ATC Model Converter Complete guide for converting ONNX models to Ascend AI processor compatible format using ATC (Ascend Tensor Compiler) tool. Supported CANN Versions: 8.3.RC1, 8.5.0 ⚠️ Critical Compatibility Requirements Before starting, ensure your environment meets these requirements: Component Requirement Why Python 3.7, 3.8, 3.9, or 3.10 Python 3.11+ incompatible with CANN 8.1.RC1 NumPy < 2.0 (e.g., 1.26.4) CANN uses deprecated NumPy API ONNX Opset 11 or 13 (for CANN 8.1.RC1) Higher opset versions not supported Quick Environment Setup:

Create Python 3.10 environment (recommended)

conda create -n atc_py310 python = 3.10 -y conda activate atc_py310

Install compatible dependencies

pip install torch torchvision ultralytics onnx onnxruntime pip install "numpy<2.0" --force-reinstall pip install decorator attrs absl-py psutil protobuf sympy ⚠️ IMPORTANT: SoC Version Must Match Exactly SoC version in ATC conversion must exactly match your target device!

Get exact SoC version from your device

npu-smi info | grep Name

Output: Name: 910B3 → Use: --soc_version=Ascend910B3

Output: Name: 310P3 → Use: --soc_version=Ascend310P3

Common Error: [ACL ERROR] EE1001: supported socVersion=Ascend910B3, but the model socVersion=Ascend910B Fix: Use exact SoC version from npu-smi info , not generic version! Quick Start

1. Check your CANN version and environment

./scripts/check_env_enhanced.sh

2. Source the appropriate environment

source /usr/local/Ascend/ascend-toolkit/set_env.sh

For 8.1.RC1/8.3.RC1

OR

source /usr/local/Ascend/cann/set_env.sh

For 8.5.0+

3. Basic ONNX to OM conversion

atc --model = model.onnx --framework = 5 --output = output_model \ --soc_version = Ascend910B3

With input shape specification

atc --model = model.onnx --framework = 5 --output = output_model \ --soc_version = Ascend910B3 \ --input_shape = "input:1,3,640,640" YOLO Model Conversion & Inference YOLO Task Types & Output Formats Task Model Example ONNX Output Post-processing Detection yolo26n.pt (1, 84, 8400) decode + NMS Pose yolo26n-pose.pt (1, 300, 57) filter only Segmentation yolo26n-seg.pt (1, 116, 8400) decode + NMS + mask OBB yolo26n-obb.pt (1, 15, 8400) decode + NMS Note: YOLO ONNX outputs are raw feature maps, not processed detections. The yolo_om_infer.py script handles decode + NMS automatically. Step 1: Export YOLO to ONNX from ultralytics import YOLO model = YOLO ( 'yolo26n.pt' )

or yolo26n-pose.pt, yolo26n-seg.pt, etc.

Export with opset 11 for CANN 8.1.RC1 compatibility

model . export ( format = 'onnx' , imgsz = 640 , opset = 11 , simplify = True ) Step 2: Convert to OM

Get your SoC version first

npu-smi info | grep Name

Convert

atc --model = yolo26n.onnx --framework = 5 --output = yolo26n \ --soc_version = Ascend910B3 \ --input_shape = "images:1,3,640,640" Step 3: Run Inference

Detection (default)

python3 scripts/yolo_om_infer.py --model yolo26n.om \ --source image.jpg --task detect --output result.jpg

Pose estimation

python3 scripts/yolo_om_infer.py --model yolo26n-pose.om \ --source image.jpg --task pose --output result_pose.jpg

Segmentation

python3 scripts/yolo_om_infer.py --model yolo26n-seg.om \ --source image.jpg --task segment --output result_seg.jpg

Oriented Bounding Box

python3 scripts/yolo_om_infer.py --model yolo26n-obb.om \ --source image.jpg --task obb --output result_obb.jpg YOLO Python API from yolo_om_infer import YoloOMInferencer , draw_results

Initialize for detection

inferencer

YoloOMInferencer ( model_path = "yolo26n.om" , task = "detect" ,

or "pose", "segment", "obb"

device_id

0 , conf_thres = 0.25 , iou_thres = 0.45 )

Run inference

result

inferencer ( "image.jpg" )

Access results

print ( f"Detections: { result [ 'num_detections' ] } " ) print ( f"Inference time: { result [ 'timing' ] [ 'infer_ms' ] : .1f } ms" ) for det in result [ 'detections' ] : print ( f" { det [ 'cls_name' ] } : { det [ 'conf' ] : .2f } at { det [ 'box' ] } " )

Cleanup

inferencer . free_resource ( ) For detailed YOLO guide, see YOLO_GUIDE.md . OM Model Inference (General) After converting your model to OM format, use ais_bench for Python inference. Install ais_bench

Download pre-built wheel packages (recommended)

See: https://gitee.com/ascend/tools/blob/master/ais-bench_workload/tool/ais_bench/README.md

Example for Python 3.10, aarch64:

wget https://aisbench.obs.myhuaweicloud.com/packet/ais_bench_infer/0.0.2/ait/aclruntime-0.0.2-cp310-cp310-linux_aarch64.whl wget https://aisbench.obs.myhuaweicloud.com/packet/ais_bench_infer/0.0.2/ait/ais_bench-0.0.2-py3-none-any.whl pip3 install ./aclruntime-.whl ./ais_bench-.whl Basic Inference

Print model info

python3 scripts/infer_om.py --model model.om --info

Run inference with random input

python3 scripts/infer_om.py --model model.om --input-shape "1,3,640,640"

Run inference with actual input

python3 scripts/infer_om.py --model model.om --input test.npy --output result.npy Python API from ais_bench . infer . interface import InferSession import numpy as np session = InferSession ( device_id = 0 , model_path = "model.om" ) print ( "Inputs:" , [ ( i . name , i . shape ) for i in session . get_inputs ( ) ] ) print ( "Outputs:" , [ ( o . name , o . shape ) for o in session . get_outputs ( ) ] ) input_data = np . random . randn ( 1 , 3 , 640 , 640 ) . astype ( np . float32 ) outputs = session . infer ( [ input_data ] , mode = 'static' ) print ( f"Inference time: { session . summary ( ) . exec_time_list [ - 1 ] : .3f } ms" ) session . free_resource ( ) See INFERENCE.md for detailed ais_bench usage. Precision Comparison Verify conversion accuracy by comparing ONNX (CPU) vs OM (NPU) outputs.

Basic comparison

python3 scripts/compare_precision.py --onnx model.onnx --om model.om --input test.npy

With custom tolerances

python3 scripts/compare_precision.py --onnx model.onnx --om model.om --input test.npy \ --atol 1e-3 --rtol 1e-2 Metric Description Good Value cosine_similarity 1.0 = identical

0.99 max_abs_diff Maximum absolute difference <1e-3 (FP32) is_close Pass/fail based on atol/rtol True CANN Version Guide CANN Version Environment Path Notes 8.3.RC1 /usr/local/Ascend/ascend-toolkit/set_env.sh Standard installation 8.5.0+ /usr/local/Ascend/cann/set_env.sh Must install matching ops package

Auto-detect CANN version

./scripts/setup_env.sh Core Parameters Parameter Required Description Example --model Yes Input ONNX model path --model=resnet50.onnx --framework Yes Framework type (5=ONNX) --framework=5 --output Yes Output OM model path --output=resnet50 --soc_version Yes Must match device exactly --soc_version=Ascend910B3 --input_shape Optional Input tensor shapes --input_shape="input:1,3,224,224" --precision_mode Optional Precision mode --precision_mode=force_fp16 For complete parameters, see PARAMETERS.md . SoC Version Reference Device SoC Version How to Check Atlas 910B3 Ascend910B3 npu-smi info | grep Name Atlas 310P Ascend310P1/P3 npu-smi info | grep Name Atlas 200I DK A2 Ascend310B4 npu-smi info | grep Name Always verify with npu-smi info - do not assume version! Troubleshooting Error: Opname not found in model

Verify input names

python3 scripts/get_onnx_info.py model.onnx

Use correct name in conversion

atc --model = model.onnx --input_shape = "correct_name:1,3,224,224" .. . Error: Invalid soc_version

Check actual chip version - must be EXACT match

npu-smi info | grep Name

Use: Ascend + Name value (e.g., Ascend910B3, not Ascend910B)

Conversion Too Slow export TE_PARALLEL_COMPILER = 16 atc --model = model.onnx .. . YOLO Detection Results Look Wrong Ensure you're using correct --task parameter Detection models need decode + NMS (script handles this) Pose models output top-300 detections (no NMS needed) See FAQ.md for more troubleshooting. Resources scripts/ Conversion & Environment: check_env_enhanced.sh - ⭐ Comprehensive compatibility check get_onnx_info.py - Inspect ONNX model inputs/outputs setup_env.sh - Auto-setup CANN environment with SoC warning convert_onnx.sh - Batch conversion helper Inference & Testing: yolo_om_infer.py - ⭐ End-to-end YOLO inference (detect/pose/segment/obb) infer_om.py - ⭐ Python inference for OM models using ais_bench compare_precision.py - ⭐ Compare ONNX vs OM output precision references/ YOLO_GUIDE.md - ⭐ YOLO detailed guide (formats, post-processing) PARAMETERS.md - Complete ATC parameter reference AIPP_CONFIG.md - AIPP configuration guide INFERENCE.md - ais_bench inference guide FAQ.md - Frequently asked questions CANN_VERSIONS.md - Version-specific guidance

安装

Create Python 3.10 environment (recommended)

Install compatible dependencies

Get exact SoC version from your device

Output: Name: 910B3 → Use: --soc_version=Ascend910B3

Output: Name: 310P3 → Use: --soc_version=Ascend310P3

1. Check your CANN version and environment

2. Source the appropriate environment

For 8.1.RC1/8.3.RC1

OR

For 8.5.0+

3. Basic ONNX to OM conversion

With input shape specification

or yolo26n-pose.pt, yolo26n-seg.pt, etc.

Export with opset 11 for CANN 8.1.RC1 compatibility

Get your SoC version first

Convert

Detection (default)

Pose estimation

Segmentation

Oriented Bounding Box

Initialize for detection

inferencer

or "pose", "segment", "obb"

device_id

Run inference

result

Access results

Cleanup

Download pre-built wheel packages (recommended)

See: https://gitee.com/ascend/tools/blob/master/ais-bench_workload/tool/ais_bench/README.md

Example for Python 3.10, aarch64:

Print model info

Run inference with random input

Run inference with actual input

Basic comparison

With custom tolerances

Auto-detect CANN version

Verify input names

Use correct name in conversion

Check actual chip version - must be EXACT match

Use: Ascend + Name value (e.g., Ascend910B3, not Ascend910B)