Introduction
Many applications like smart manufacturing, surveillance and augmented reality require Artificial Intelligence to be implemented on the EDGE. Some of the challenges in implementing AI on the EDGE are –
- Achieving lowest possible latency
- Network pruning to improve performance without compromising on accuracy
- Flexible hardware that can keep pace with innovation
The Xilinx Edge AI platform is well suited for such applications as it is built on adaptable compute platform that retains the flexibility to change as per latest innovation (allowing only domain specific architectures to be updated) while delivering on low latency and end to end acceleration. The Xilinx Edge AI Platform provides comprehensive tools and models which utilize unique deep compression and hardware-accelerated Deep Learning technology. The Xilinx Edge platform delivers exceptional performance using pruning technology which is discussed in detail in Part 2 of this blog post.
Following are steps involved in implementing an EDGE AI solution for a specific application:
In this case study we will discuss the implementation of a classification model that we have developed for a retail use case using Xilinx Edge AI platform on a Zynq UltraScale+ Evaluation board (ZCU106).
Xilinx Edge AI Architecture
The Xilinx Edge AI platform provides comprehensive tools to implement Deep Learning on Zynq hardware.
YantraVision has extensive capability in developing solutions using Deep Learning for applications in the Retail and Manufacturing industry. For the purpose of this case study we have implemented the classification model that we use in our Retail solution on a Zynq UltraScale+ Evaluation board using Xilinx Edge AI platform.
Evaluation Results for Retail Application
CNN | Vgg16Net |
Elapsed CPU time for the training (sec) | 3600 seconds |
Parameters | 15,250,250 |
Epochs | 60 |
Dataset | fmnist |
TF Output Node | dense_2/softmax |
Training Accuracy | 0.91360 |
Frozen graph top-1 accuracy | 0.8854 |
Quantized graph top-1 accuracy | 0.8854 |
Run time accuracy in board | 0.95 |
Throughput(fps) in board | 54.8419 |
Conclusion
The Xilinx Edge AI platform is built on adaptable compute platform that retains the flexibility to change as per latest innovation (allowing only domain specific architectures to be updated) while delivering on low latency and end to end acceleration. The Xilinx Deep Learning Processor Unit (DPU) is part of the EDGE AI stack and is a programmable engine dedicated for convolutional neural network (CNN). The DPU IP can be integrated as a block in the programmable logic (PL) of the selected Zynq-7000 SoC and Zynq UltraScale™+ MPSoC devices with direct connections to the processing system (PS).
Yantravision has extensive capability in building and training our own neural network models. We also have the capability to implement on Deep Learning using Xilinx Edge AI.
For details regarding building a Deep Learning solution on Xilinx from scratch write to us at sales@yantravision.com