Nordic Edge AI 

Ultra-low-power edge AI across the industry's broadest range of applications 

Your data. Your model. Your edge.

From saving bandwidth and energy to more responsive real-time performance, implementing AI in your embedded applications offers massive benefits beyond the buzzwords. Nordic Semiconductor offers two unique technologies, Neuton models and Axon NPU, exclusively to our customers, to cover the industry's broadest range of devices, applications, and customer needs.

Axon NPU

  Axon_Logo_horizontal_Gradient_RGB copy.png   

 

Edge AI without compromise

For years, adding AI to wireless IoT devices meant trading battery life for performance. Running TensorFlow Lite on a CPU was often too slow and memory-intensive, while discrete NPUs added cost and complexity. Although Neuton models already enable efficient edge AI on the nRF54L Series CPU, demanding workloads like audio, imaging, and high-rate sensor data need dedicated acceleration.

Integrated AI acceleration

Axon is Nordic’s proprietary NPU, integrated into the high-memory nRF54LM20B SoC. It accelerates TensorFlow Lite models with up to 15x faster inference than the CPU, and delivers up to 7x higher performance and up to 8x better efficiency than the closest competing wireless NPU, bringing powerful edge AI to ultra-low-power devices.

Simpler designs and broader possibilities

By integrating the NPU on-chip, Axon removes the need for discrete accelerators, reducing power, BoM cost, and development complexity. Axon enables industry-leading energy efficiency across use cases ranging from anomaly detection and biometrics to sound, keyword, and image recognition – leveraging fully accelerated edge AI.

Benchmarks

Axon NPU running @ 128 MHz in nRF54LM20B SoC

Benchmarks from ML Commons, MLPerf Inference: Tiny

Benchmak source: https://github.com/mlcommons/tiny/tree/master/benchmark

Axon NPU average current and energy consumption

Benchmark  Inference time Avg. Current @ 3 V Energy
KWS (DS-CNN)

4.5 ms 3.0 mA 40.5 µJ
IC (ResNet-8)   17.9 ms 2.7 mA 145 µJ
AD (Deep AutoEncoder) 1.3 ms 3.5 mA 13.7 µJ
VWW (MNV1 .25x) 14.4 ms 2.8 mA 121 µJ

 

 

 

Axon NPU inference times compared to Arm Cortex-M33

Benchmark  Arm Cortex-M33 @ 128 MHz Axon NPU @ 128 MHz Performance improvement
KWS (DS-CNN)

70 ms 4.5 ms 15x
IC (ResNet-8)   303 ms 17.9 ms 17x
AD (Deep AutoEncoder) 7 ms 1.3 ms 5x
VWW (MNV1 .25x) 223 ms 14.4 ms 15x

Axon NPU

Products containing our AI accelerator

nRF54LM20B SoC

Ultra-low-power wireless SoC with integrated Axon NPU, for hardware-accelerated edge AI applications, supporting Bluetooth LE, Channel Sounding, Bluetooth Mesh, Zigbee, Thread, Matter, Aliro, and 2.4 GHz proprietary protocols.

128 MHz Arm Cortex-M33
2 MB NVM, 512 KB RAM
128 MHz integrated Axon NPU
High-speed USB