Analyzing Deep Learning Model Inferences for Image Classification using OpenVINO
It may be desirable to execute deep learning model inferences on an integrated GPU at the edge. While such GPUs are much less powerful than discrete GPUs, it is able to deliver higher floating-point operations per second than a CPU located on the same die. For edge devices, the benefit of moving to lower precision with minimal loss of accuracy to obtain higher performance is also attractive. Hence, we chose 14 deep learning models for image classification to evaluate their inference performance with the OpenVINO toolkit. Then, we analyzed the implementation of the fastest inference model of all the models. The experimental results are promising. Compared to the performance of full-precision (FP32) models, the speedup of the 8-bit (INT8) quantization ranges from 1.02 to 1.56 on an Intel (R) Xeon (R) 4-core CPU, and the speedup of the FP16 models ranges from 1.1 to 2 on an Intel (R) Iris (TM) Pro GPU. For the FP32 models, the GPU is on average 1.5X faster than the CPU.
- Research Organization:
- Argonne National Laboratory (ANL)
- Sponsoring Organization:
- USDOE Office of Science
- DOE Contract Number:
- AC02-06CH11357
- OSTI ID:
- 1804060
- Country of Publication:
- United States
- Language:
- English
Similar Records
Analyzing inference workloads for spatiotemporal modeling
A GPU accelerated mixed-precision Smoothed Particle Hydrodynamics framework with cell-based relative coordinates
Accelerating gravitational microlensing simulations using the Xeon Phi coprocessor
Journal Article
·
Mon Sep 16 20:00:00 EDT 2024
· Future Generations Computer Systems
·
OSTI ID:2513464
A GPU accelerated mixed-precision Smoothed Particle Hydrodynamics framework with cell-based relative coordinates
Journal Article
·
Sun Jan 28 19:00:00 EST 2024
· Engineering Analysis with Boundary Elements
·
OSTI ID:2283916
Accelerating gravitational microlensing simulations using the Xeon Phi coprocessor
Journal Article
·
Thu Apr 06 20:00:00 EDT 2017
· Astronomy and Computing
·
OSTI ID:1543509