Dev.to Machine Learning3h ago|Products & Services Tutorials & How-To

Setting Up and Using ONNX Runtime for C++ in Linux

This article explains how to set up ONNX Runtime, a fast inference engine for machine learning models, to run in a native C++ application on Linux. It covers installing the necessary tools, downloading the ONNX Runtime library and an ONNX model, and configuring a CMake project to use the runtime.

💡

Why it matters

Integrating ONNX Runtime into C++ applications allows developers to leverage the power of machine learning models without the overhead of a full Python environment.

Key Points

1ONNX Runtime allows running machine learning models in a C++ application without a full Python environment
2The article provides step-by-step instructions for setting up the ONNX Runtime library and an ONNX model
3A CMake configuration is provided to link the C++ application against the ONNX Runtime library

Details

ONNX Runtime is a practical tool for running machine learning models in a native C++ application on Linux. It provides a fast inference engine for models stored in the ONNX format, allowing developers to train or export models elsewhere and then deploy them in a lightweight C++ program without needing a full Python environment. This is especially useful when the application is already written in C++, such as a backend service, robotics stack, desktop application, or embedded system, where Linux and CMake are likely already part of the workflow. The article walks through the setup process, including installing the necessary tools, downloading the ONNX Runtime library and an ONNX image classification model, and configuring a CMake project to link the C++ application against the ONNX Runtime library. The provided CMake configuration sets up the project, enables C++17, points to the ONNX Runtime library and headers, links the executable against the library, and sets an rpath so the executable can find the ONNX Runtime shared library during local development.

Setting Up and Using ONNX Runtime for C++ in Linux

Why it matters

Key Points

Details

Dive deeper

Related Articles

DeepArchitect: Automatically Designing and Training Deep Ar…

Why I Chose a Fine-Tuned 7B Model Over GPT-4 for High-Volum…

Understanding Tokens in Large Language Models

The Fairness Metrics Your ML Model Needs - And Why Accuracy…

Compress your LLM's KV cache 33x without training

Anthropic's Unreleased Frontier AI Model 'Mythos' Revealed …

Your AI Is a Black Box Because You Didn't Document It

Lessons Learned from 12 Failed Compression Approaches for A…

The Math Behind E8 Lattice Quantization

Calculating GPU Memory Savings with NexusQuant Compression

AI Curator

Ask me anything about AI

Related Articles

DeepArchitect: Automatically Designing and Training Deep Ar…

Why I Chose a Fine-Tuned 7B Model Over GPT-4 for High-Volum…

Understanding Tokens in Large Language Models

The Fairness Metrics Your ML Model Needs - And Why Accuracy…

Compress your LLM's KV cache 33x without training

Anthropic's Unreleased Frontier AI Model 'Mythos' Revealed …

Your AI Is a Black Box Because You Didn't Document It

Lessons Learned from 12 Failed Compression Approaches for A…

The Math Behind E8 Lattice Quantization

Calculating GPU Memory Savings with NexusQuant Compression