Open Neural Network Exchange

ONNX Runtime
開発元	Microsoft
初版	2018年11月30日 (6年前)
最新版	v1.20.1 / 2024年11月22日 (5か月前)
リポジトリ	onnxruntime - GitHub
プログラミング; 言語	Python, C++, C#, C言語, Java, JavaScript, Objective-C, WinRT
対応OS	Windows, Linux, macOS, Android, iOS, ウェブブラウザ
プラットフォーム	多数（本記事参照）
ライセンス	MIT License
公式サイト	onnxruntime.ai
	テンプレートを表示

Open Neural Network Exchange (ONNX)
開発元	Facebook, Microsoft
初版	2017年9月7日 (7年前)
最新版	v1.17.0 / 2024年10月2日 (7か月前)
リポジトリ	onnx - GitHub
種別	人工知能機械学習
ライセンス	Apache License 2.0
公式サイト	onnx.ai
	テンプレートを表示

Open Neural Network Exchange（略称：ONNX）とは、オープンソースで開発されている機械学習や人工知能のモデルを表現する為の代表的なフォーマットである^[3]。実行エンジンとしてONNX Runtimeも開発されている。

概要

機械学習、特にニューラルネットワークモデルは様々なフレームワーク上で学習され、また様々なハードウェア上で実行（推論）される。各環境に特化したモデルは他のフレームワーク・ハードウェアで利用できず相互運用性を欠いてしまう。また実装者は環境ごとにサポートをおこなう必要があり大きな労力を必要とする。

ONNXはモデルを記述する統一インターフェース（フォーマット）を提供し、これらの問題を解決する。各フレームワークは学習したモデルをONNX形式で出力する。各ハードウェアはONNX実行環境を提供することで、どのフレームワークで学習されたかを問わずモデル推論を実行する。このように相互運用可能なモデルフォーマットとしてONNXは開発されている。

2017年に開発が開始された。

開発背景

以下の特性を補完する意図にて開発が進められた。

フレームワークの相互運用性

開発工程や機械学習の高速処理、ネットワークの基本設計における柔軟性やモバイルデバイスでの推論などの特定の段階において、開発者が複数のフレームワークでのデータのやり取りを簡単に行えるようにする^[4]。

最適化の共有

ハードウェアベンダーなどは、ONNXを対象に調整を行うことで、複数のフレームワークにおけるニューラルネットワークのパフォーマンスを一度に改善することができる^[4]。

沿革

2017年9月に、FacebookとMicrosoftは、PyTorchやCaffe2などの機械学習フレームワーク間において相互運用を可能にする為の取り組みとして、このプロジェクトを始動した。その後、IBM、Huawei、Intel、AMD、ARM、Qualcommがこの取り組みに対して積極的な支援を表明した^[3]。

2017年10月に、MicrosoftはCognitive ToolkitおよびProject Brainwaveプラットフォームにおいて、ONNXのサポートを発表した^[3]。

2019年11月、ONNXはLinux Foundation AIの卒業生プロジェクトとして承認された。

構成

ONNXは、推論（評価）に焦点を当て、拡張可能な計算グラフモデル、組み込み演算子、および標準データ型の定義を提供する^[4]。

それぞれのデータフローグラフは、有向非巡回グラフを形成するノードのリストになっている。ノードには入力と出力があり、各ノードが処理を呼び出すようになっている。メタデータはグラフを文書化する。組み込み演算子は、ONNXをサポートする各フレームワークで利用可能である^[4]。

グラフは Protocol Buffers を使用して拡張子 .onnx のバイナリファイルとして保存可能である^[5]。このファイルは様々な機械学習のライブラリから読み書き可能である。

ONNX仕様は2つのサブ仕様、IRとOperatorからなる。この2つの仕様はそれぞれバージョニングされており、ONNX仕様のバージョンはこの2つのサブ仕様の特定版を指定したものとなっている。2021-12-22現在の最新バージョンは version 1.10.2 であり、これはIR v8とOperator v15-v2-v1 から成る^[6]。

ONNX IR

Open Neural Network Exchange Intermediate Representation (ONNX IR) はONNXの基本データ型と計算グラフを定義するサブ仕様である^[7]。ONNX IRは計算グラフを構成する Model, Graph, Node 等の要素、入出力 Tensor, Sequence, Map およびデータ FLOAT, INT8, BFLOAT16 等の基本データ型を定義する。2021-12-22現在の最新バージョンは version 8 である^[6]。

ONNX IRが定義する要素として以下が挙げられる。

Graph: 計算グラフを表現する要素。Graph入出力を指定するinput/initializer^[8]^[9]/output、計算ノード群を指定するnode、メタデータを収納するname/doc_string/value_info、の属性をもつ。
Node: 計算ノードを表現する要素。Node入出力を指定するinput/output、演算子とそのパラメータを指定するdomain/op_type/attribute、メタデータを収納するname/doc_string/value_info、の属性をもつ。

すなわちGraphに収納された各Nodeが入出力をもった演算 (例: Conv) になっており、Node/Graph入出力名に基づいてNode群がグラフ構造を取っている。

拡張演算子

ONNX IRはONNX Operatorで定義される標準演算子に追加して、独自の拡張演算子を受け入れられるように設計されている^[10]。これによりONNXの "Extensible/拡張可能" 特性を実現している^[11]。拡張演算子セットを Model の opset_import 属性に指定することで実行エンジン側へ拡張演算子の利用を通知する仕組みである^[12]。ONNXを受け取った実行エンジンは opset_import を確認し、指定された演算子セット全てをサポートしていれば受け入れ、そうでなければ Model 全体を拒絶する^[13]。

ONNX Operator

ONNXのビルトイン演算子はサブ仕様 Operator specifications により定義される^[14]。3種類の演算子セット（Opset）ai.onnx, ai.onnx.ml, ai.onnx.training が定義されており、ai.onnx がデフォルトである。2022年12月12日現在、ai.onnx の最新バージョンは version 18 である^[6]。

例えばOpset ai.onnx v15ではRNN系演算子として RNN 、LSTM 、GRU が定義されている。

量子化

ONNXは入出力の量子化やそれに対する操作を演算子として持つ。QuantizeLinearはスケール・シフトパラメータに基づく線形量子化をおこなう^[15]。DynamicQuantizeLinearは入力ベクトルのmin/maxに基づく動的uint8量子化をおこなう^[16]。int8入力に対する演算にはMatMulInteger、QLinearMatMul、ConvInteger、QLinearConvなどがある。

ONNX Runtime

ONNX Runtime (略称: ORT^[17]) は様々な環境におけるONNXモデルの推論・学習高速化を目的としたオープンソースプロジェクトである^[18]。フレームワーク・OS・ハードウェアを問わず単一のRuntime APIを介してONNXモデルを利用できる^[19]。またデプロイ環境に合わせた最適化を自動でおこなう^[20]。ONNX Runtimeは設計方針としてアクセラレータ・ランタイム抽象化とパフォーマンス最適化の両立を掲げており、ONNXモデルの自動分割と最適アクセラレータによるサブモデル実行によりこれを実現している^[21]。

ONNX Runtimeがサポートする最適化には以下が挙げられる。

モデル量子化: 8-bit Model Quantization^[22]
グラフ最適化^[23]: Basic (不要ノード除去・一部のop fusions^[24]), Extended (op fusions^[25]), Layout (NCHWc Optimizer^[26]) の三段階

対応デバイス

ONNX Runtime は共有ライブラリの Execution Providers によって多数のバックエンドをサポートしている^[27]。これにはIntel の OpenVINO バックエンド (onnxruntime-openvino) 及び oneDNN バックエンド、NVIDIAの CUDA バックエンド (onnxruntime-gpu) 及び TensorRT バックエンド (onnxruntime-gpu)、AMDの ROCm バックエンド及び MIGraphX バックエンド、Windows の DirectML バックエンド (onnxruntime-directmlなど)、macOS / iOS の CoreML バックエンド、Android の NNAPI バックエンド、Microsoft Azure向けの Azure バックエンドなどが存在する^[28]^[27]。

以下のデバイスがONNX Runtimeに対応している。^[29]

CPU
- x86, x86-64, ARM32, ARM64, POWER, RISC-V^[30]
- Intel OpenVINO, Intel oneAPI Deep Neural Network Library - OpenVINOはIntelのCPU・GPU・NPUおよびARMのCPUに対応^[31]
- Google XNNPACK - ARM, x86, WebAssembly, RISC-VのCPUに対応
- Apache TVM（プレビュー）
GPU
- NVIDIA CUDA, TensorRT
- AMD ROCm, MIGraphX
- Intel OpenVINO
- Microsoft DirectML
- Apache TVM（プレビュー）
エッジやモバイル（NPU）
- Intel OpenVINO - Intel Core Ultraなどに内蔵するNPUに対応
- Qualcomm QNN - QualcommのCPU・GPU・NPUに対応
- Microsoft DirectML - IntelやQualcommなどのNPUに対応^[32]^[33]
- Google XNNPACK
- Android Neural Networks API - ただしAndroid15よりNNAPI自体が非推奨^[34]
- AMD Vitis AI（プレビュー） - AMD Ryzenなどに内蔵するNPUに対応^[35]
- Arm Compute Library, Arm NN（プレビュー） - ARMのCPU・GPU・NPUに対応
- Apple CoreML（プレビュー）
- Apache TVM（プレビュー）
- Hailo（プレビュー）^[36]
その他
- Rockchip NPU （プレビュー）
- AMD Vitis AI（プレビュー） - AMD Versal, Zynq, Alveo等に対応^[37]
- Huawei Compute Architecture for Neural Networks (CANN) （プレビュー）
- Microsoft Azure（プレビュー）

主要なデバイスとしては Google の TPU, Edge TPU、AWS の Trainium, Inferentia が対応していない。Edge TPU は LiteRT のみだが、それ以外のデバイスは、JAX、TensorFlow、PyTorch に対応しているので、これらに変換することで動作し、AWS は Trainium, Inferentia は PyTorch 経由の動作を薦めている^[38]。

ONNX Runtime Web

ONNX Runtime Webはウェブブラウザを含めたJavaScript上で動作するONNX Rutime。Execution Providerとして、Web Neural Network (WebNN)、WebGPU、WebGL、WebAssemblyが利用可能である。WebNNはONNXのようにニューラルネットワークのモデルを表現できるので、GPUやNPUなどのアクセラレーションが有効であれば、これが最速である。WindowsではWebNNからDirectML経由でGPUやNPUで実行される。^[39]^[40]^[41]

ONNXモデル

ONNXへの変換

ONNXのモデルはPythonスクリプトから生成したり（#例を参照）、他のフレームワークから変換したりすることで作ることができる。他のフレームワークからの変換には以下のような方法が存在する：

ONNXMLTools^[42] - 様々なフレームワークからの変換を行う。
PyTorchの標準ONNXエクスポータ (torch^[43])
tf2onnx - TensorFlow、LiteRT（旧称TensorFlow Lite）、Keras、TensorFlow.jsからの変換を行う^[44]
sklearn-onnx (skl2onnx^[43]) - scikit-learnからの変換を行う

ONNXからの変換

onnx2tf - TensorFlow、LiteRT（旧称TensorFlow Lite）、Kerasへの変換を行う^[45]
onnx2pytorch^[46], onnx2torch^[47] - PyTorchへの変換
JAX ONNX Runtime - JAXへの変換^[48]
ONNX からNVIDIA GPU向けのTensorRTバイナリ (.trt) を生成するものとして NVIDIA の Polygraphy^[49] や trtexec も存在する^[49]。
OONX を LLVM の MLIR によってコンパイルするための onnx-mlir も存在する^[50]。

モデル集

ONNXのモデル集としては以下が存在する。

ONNX Model Zoo^[51]

例

線形回帰モデルの学習結果として $y=2x+3$ が得られたとして、それを ONNX ファイルに保存する Python での実装例。

import numpy as np
import onnx
from onnx import TensorProto, numpy_helper
from onnx.helper import make_model, make_node, make_graph, make_tensor_value_info

A = numpy_helper.from_array(np.array(2.0), "A")
B = numpy_helper.from_array(np.array(3.0), "B")
X = make_tensor_value_info("X", TensorProto.DOUBLE, [])
Y = make_tensor_value_info("Y", TensorProto.DOUBLE, [])

graph = make_graph([
	make_node("Mul", ["A", "X"], ["AX"]),
	make_node("Add", ["AX", "B"], ["Y"]),
], "Linear Regression", [X], [Y], [A, B])
onnx.save(make_model(graph), "2x_3.onnx")

それを ONNX Runtime を使い実行する Python での実装例。

import numpy as np
import onnxruntime as ort
ort_sess = ort.InferenceSession("2x_3.onnx")
y = ort_sess.run(None, {"X": np.array(4.0)})[0]

脚注

[脚注の使い方]

出典

^ “Releases · onnx/onnx”. 2025年1月20日閲覧。
^ “Releases · microsoft/onnxruntime”. 2025年1月20日閲覧。
^ ^a ^b ^c “Microsoft and Facebook's open AI ecosystem gains more support” (英語). Engadget 2017年10月11日閲覧。
^ ^a ^b ^c ^d “Microsoft and Facebook create open ecosystem for AI model interoperability - Microsoft Cognitive Toolkit” (英語). Microsoft Cognitive Toolkit. (2017年9月7日) 2017年10月11日閲覧。
^ onnx/IR.md at main onnx/onnx - GitHub
^ ^a ^b ^c ONNX Versioning. onnx/onnx.
^ " 1. A definition of an extensible computation graph model. 2. Definitions of standard data types. #1 and #2 together make up the ONNX Intermediate Representation, or 'IR', specification" Open Neural Network Exchange Intermediate Representation (ONNX IR) Specification. onnx/onnx
^ 同名のGraph入力デフォルト値、あるいは定数Graph入力扱い "When an initializer has the same name as a graph input, it specifies a default value for that input. When an initializer has a name different from all graph inputs, it specifies a constant value." Open Neural Network Exchange Intermediate Representation (ONNX IR) Specification. onnx/onnx. 2022-06-09閲覧.
^ "When an initializer has the same name as a graph input, it specifies a default value for that input." Open Neural Network Exchange Intermediate Representation (ONNX IR) Specification. onnx/onnx. 2022-06-09閲覧.
^ "An implementation MAY extend ONNX by adding operators expressing semantics beyond the standard set of operators" Open Neural Network Exchange Intermediate Representation (ONNX IR) Specification. onnx/onnx
^ "Extensible computation graph model ... expressing semantics beyond the standard set of operators" Open Neural Network Exchange Intermediate Representation (ONNX IR) Specification. onnx/onnx
^ "The mechanism for this is adding operator sets to the opset_import property in a model that depends on the extension operators." Open Neural Network Exchange Intermediate Representation (ONNX IR) Specification. onnx/onnx
^ "An implementation must support all operators in the set or reject the model." Open Neural Network Exchange Intermediate Representation (ONNX IR) Specification. onnx/onnx
^ "Operator specifications that may be referenced by a given ONNX graph." ONNX Versioning. onnx/onnx.
^ "QuantizeLinear The linear quantization operator. It consumes a high precision tensor, a scale, and a zero point ... The quantization formula is y = saturate ((x / y_scale) + y_zero_point). ... For (x / y_scale), it's rounding to nearest ties to even." Operator Schemas. ONNX. 2022-03-13閲覧.
^ "DynamicQuantizeLinear for Scale, Zero Point and FP32->8Bit convertion of FP32 Input data" Operator Schemas. ONNX. 2022-03-13閲覧.
^ "ONNX Runtime (ORT)" Welcome to ONNX Runtime (ORT). ONNX Runtime.
^ "ONNX Runtime is an open source project that is designed to accelerate machine learning across a wide range of frameworks, operating systems, and hardware platforms." About. ONNX Runtime.
^ "It enables acceleration of machine learning inferencing across all of your deployment targets using a single set of API." About. ONNX Runtime.
^ "ONNX Runtime automatically parses through your model to identify optimization opportunities and provides access to the best hardware acceleration available." About. ONNX Runtime.
^ "Design principles ONNX Runtime abstracts custom accelerators and runtimes to maximize their benefits across an ONNX model. ... ONNX Runtime partitions the ONNX model graph into subgraphs that align with available custom accelerators and runtimes." About. ONNX Runtime.
^ Quantize ONNX Models. ONNX Runtime.
^ Graph Optimizations in ONNX Runtime. ONNX Runtime.
^ "Redundant node eliminations ... Semantics-preserving node fusions" Graph Optimizations in ONNX Runtime. ONNX Runtime.
^ "These optimizations include complex node fusions." Graph Optimizations in ONNX Runtime. ONNX Runtime.
^ "These optimizations change the data layout ... Optimizes the graph by using NCHWc layout instead of NCHW layout." Graph Optimizations in ONNX Runtime. ONNX Runtime.
^ ^a ^b Build ONNX Runtime with Execution Providers Microsoft
^ Optimize and Accelerate Machine Learning Inferencing and Training Microsoft
^ “Execution Providers”. onnxruntime. 2025年1月18日閲覧。
^ “Build for inferencing”. onnxruntime. 2025年1月18日閲覧。
^ “System Requirements — OpenVINO™ documentationCopy to clipboardCopy to clipboard — Version(2024)”. docs.openvino.ai. 2025年1月20日閲覧。
^ Pralle, Chad. “Introducing Neural Processor Unit (NPU) support in DirectML (developer preview) - DirectX Developer Blog”. DirectX Developer Blog. 2025年1月24日閲覧。
^ Blog, Windows Developer. “DirectML expands NPU support to Copilot+ PCs and WebNN”. Windows Developer Blog. 2025年1月24日閲覧。
^ “NNAPI 移行ガイド”. 2025年1月18日閲覧。
^ “AMD - Vitis AI”. onnxruntime. 2025年1月18日閲覧。
^ Hailo. “hailo-ai/onnxruntime: ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator”. 2025年1月18日閲覧。
^ “Vitis AI — Vitis™ AI 3.5 documentation”. xilinx.github.io. 2025年1月20日閲覧。
^ “ONNX FAQ — AWS Neuron Documentation”. awsdocs-neuron.readthedocs-hosted.com. 2025年1月19日閲覧。
^ “Web”. onnxruntime. 2025年1月24日閲覧。
^ “WebNN Overview” (2024年8月20日). 2025年1月24日閲覧。
^ “Web Neural Network API”. w3.org. 2025年1月24日閲覧。
^ ONNX モデル Microsoft
^ ^a ^b Install ONNX to export the model Microsoft
^ “onnx/tensorflow-onnx: Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX”. 2025年1月18日閲覧。
^ “PINTO0309/onnx2tf: Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massive Transpose extrapolation problem in onnx-tensorflow (onnx-tf). I don't need a Star, but give me a pull request.”. 2025年1月18日閲覧。
^ “Talmaj/onnx2pytorch: Transform ONNX model to PyTorch representation”. 2025年1月18日閲覧。
^ “ENOT-AutoDL/onnx2torch: Convert ONNX models to PyTorch.”. 2025年1月19日閲覧。
^ “google/jaxonnxruntime: A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.”. 2025年1月19日閲覧。
^ ^a ^b NVIDIA Deep Learning TensorRT Documentation NVIDIA
^ Users of MLIR LLVM Project
^ “onnx/models: A collection of pre-trained, state-of-the-art models in the ONNX format”. 2025年1月18日閲覧。

外部リンク

[1] “Releases · onnx/onnx”. 2025年1月20日閲覧。

[2] “Releases · microsoft/onnxruntime”. 2025年1月20日閲覧。

[:02-3] “Microsoft and Facebook's open AI ecosystem gains more support” (英語). Engadget 2017年10月11日閲覧。

[:0-4] “Microsoft and Facebook create open ecosystem for AI model interoperability - Microsoft Cognitive Toolkit” (英語). Microsoft Cognitive Toolkit. (2017年9月7日) 2017年10月11日閲覧。

[5] x/IR.md at main onnx/onnx - GitHub

[:1-6] ONNX Versioning. onnx/onnx.

[7] " 1. A definition of an extensible computation graph model. 2. Definitions of standard data types. #1 and #2 together make up the ONNX Intermediate Representation, or 'IR', specification" Open Neural Network Exchange Intermediate Representation (ONNX IR) Specification. onnx/onnx

[8] 同名のGraph入力デフォルト値、あるいは定数Graph入力扱い "When an initializer has the same name as a graph input, it specifies a default value for that input. When an initializer has a name different from all graph inputs, it specifies a constant value." Open Neural Network Exchange Intermediate Representation (ONNX IR) Specification. onnx/onnx. 2022-06-09閲覧.

[9] "When an initializer has the same name as a graph input, it specifies a default value for that input." Open Neural Network Exchange Intermediate Representation (ONNX IR) Specification. onnx/onnx. 2022-06-09閲覧.

[10] "An implementation MAY extend ONNX by adding operators expressing semantics beyond the standard set of operators" Open Neural Network Exchange Intermediate Representation (ONNX IR) Specification. onnx/onnx

[11] "Extensible computation graph model ... expressing semantics beyond the standard set of operators" Open Neural Network Exchange Intermediate Representation (ONNX IR) Specification. onnx/onnx

[12] "The mechanism for this is adding operator sets to the opset_import property in a model that depends on the extension operators." Open Neural Network Exchange Intermediate Representation (ONNX IR) Specification. onnx/onnx

[13] "An implementation must support all operators in the set or reject the model." Open Neural Network Exchange Intermediate Representation (ONNX IR) Specification. onnx/onnx

[14] "Operator specifications that may be referenced by a given ONNX graph." ONNX Versioning. onnx/onnx.

[15] "QuantizeLinear The linear quantization operator. It consumes a high precision tensor, a scale, and a zero point ... The quantization formula is y = saturate ((x / y_scale) + y_zero_point). ... For (x / y_scale), it's rounding to nearest ties to even." Operator Schemas. ONNX. 2022-03-13閲覧.

[16] "DynamicQuantizeLinear for Scale, Zero Point and FP32->8Bit convertion of FP32 Input data" Operator Schemas. ONNX. 2022-03-13閲覧.

[17] "ONNX Runtime (ORT)" Welcome to ONNX Runtime (ORT). ONNX Runtime.

[18] "ONNX Runtime is an open source project that is designed to accelerate machine learning across a wide range of frameworks, operating systems, and hardware platforms." About. ONNX Runtime.

[19] "It enables acceleration of machine learning inferencing across all of your deployment targets using a single set of API." About. ONNX Runtime.

[20] "ONNX Runtime automatically parses through your model to identify optimization opportunities and provides access to the best hardware acceleration available." About. ONNX Runtime.

[21] "Design principles ONNX Runtime abstracts custom accelerators and runtimes to maximize their benefits across an ONNX model. ... ONNX Runtime partitions the ONNX model graph into subgraphs that align with available custom accelerators and runtimes." About. ONNX Runtime.

[22] Quantize ONNX Models. ONNX Runtime.

[23] Graph Optimizations in ONNX Runtime. ONNX Runtime.

[24] "Redundant node eliminations ... Semantics-preserving node fusions" Graph Optimizations in ONNX Runtime. ONNX Runtime.

[25] "These optimizations include complex node fusions." Graph Optimizations in ONNX Runtime. ONNX Runtime.

[26] "These optimizations change the data layout ... Optimizes the graph by using NCHWc layout instead of NCHW layout." Graph Optimizations in ONNX Runtime. ONNX Runtime.

[onnxr-eps-27] Build ONNX Runtime with Execution Providers Microsoft

[28] Optimize and Accelerate Machine Learning Inferencing and Training Microsoft

[29] “Execution Providers”. onnxruntime. 2025年1月18日閲覧。

[30] “Build for inferencing”. onnxruntime. 2025年1月18日閲覧。

[31] “System Requirements — OpenVINO™ documentationCopy to clipboardCopy to clipboard — Version(2024)”. docs.openvino.ai. 2025年1月20日閲覧。

[32] Pralle, Chad. “Introducing Neural Processor Unit (NPU) support in DirectML (developer preview) - DirectX Developer Blog”. DirectX Developer Blog. 2025年1月24日閲覧。

[33] Blog, Windows Developer. “DirectML expands NPU support to Copilot+ PCs and WebNN”. Windows Developer Blog. 2025年1月24日閲覧。

[34] “NNAPI 移行ガイド”. 2025年1月18日閲覧。

[35] “AMD - Vitis AI”. onnxruntime. 2025年1月18日閲覧。

[36] Hailo. “hailo-ai/onnxruntime: ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator”. 2025年1月18日閲覧。

[37] “Vitis AI — Vitis™ AI 3.5 documentation”. xilinx.github.io. 2025年1月20日閲覧。

[38] “ONNX FAQ — AWS Neuron Documentation”. awsdocs-neuron.readthedocs-hosted.com. 2025年1月19日閲覧。

[39] “Web”. onnxruntime. 2025年1月24日閲覧。

[40] “WebNN Overview” (2024年8月20日). 2025年1月24日閲覧。

[41] “Web Neural Network API”. w3.org. 2025年1月24日閲覧。

[windows-ml-onnx-42] ONNX モデル Microsoft

[onnx-export-43] Install ONNX to export the model Microsoft

[44] “onnx/tensorflow-onnx: Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX”. 2025年1月18日閲覧。

[onnx2tf-45] “PINTO0309/onnx2tf: Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massive Transpose extrapolation problem in onnx-tensorflow (onnx-tf). I don't need a Star, but give me a pull request.”. 2025年1月18日閲覧。

[46] “Talmaj/onnx2pytorch: Transform ONNX model to PyTorch representation”. 2025年1月18日閲覧。

[47] “ENOT-AutoDL/onnx2torch: Convert ONNX models to PyTorch.”. 2025年1月19日閲覧。

[jaxonnxruntime-48] “google/jaxonnxruntime: A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.”. 2025年1月19日閲覧。

[nv-trt-guide-49] NVIDIA Deep Learning TensorRT Documentation NVIDIA

[50] Users of MLIR LLVM Project

[51] “onnx/models: A collection of pre-trained, state-of-the-art models in the ONNX format”. 2025年1月18日閲覧。

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

[34]

[35]

[36]

[37]

[38]

[39]

[40]

[41]

[42]

[43]

[44]

[45]

[46]

[47]

[48]

[49]

[50]

[51]