Benchmarking NLP and Computer Vision Models on Domain-Specific Architectures: Standard vs. TensorRT-Optimized Performance

Laila Nassef

doi:10.52783/jes.7272

PDF

Published: Nov 16, 2024

DOI: https://doi.org/10.52783/jes.7272

Keywords:

Deep Learning Benchmarking, Inference Time, Throughput, Resource Utilization, TensorRT Optimizations, ONNX Runtime, ResNet-50, BERT, Domain-Specific Architectures (DSAs)

Laila Nassef, Rana A. Tarabishi, Sara A. Abo Alnasor

Abstract

In this work we systemically study performance of deep learning models for Natural Language Processing (NLP) and Computer Vision (CV) tasks using two popular representative architectures, ResNet-50 and BERT across two configurations: a standard ONNX Runtime configuration and an optimized TensorRT configuration. The main goal is to measure and compare inference time, throughput, CPU and GPU utilization as well as memory usage of all models on Domain-Specific Architectures (DSAs), in this case an NVIDIA GeForce RTX 3060 GPU. We experimentally demonstrate these trade-offs of latency-focused and throughput-focused optimizations and implications for at-scale deployment in realistic resource-constrained environments. The main findings show that the TensorRT-Optimized configuration yields a much higher throughput (up to 432 inferences per second with ResNet-50), and that the Standard configuration shows lower inference time, which is more appropriate for latency-sensitive applications. It should be noted that due to its dense transformer structure and large number of parameters, BERT has a much higher resource demand than ResNet-50, highlighting how model choices need to match performance with the constraints imposed by their deployment. Analysis of CPU and GPU utilization further illustrates the efficiency gains and potential bottlenecks associated with each configuration. Along with the benchmarking results, we also describe optimizations for serving the model: dynamic batching, mixed-precision training and memory management techniques to improve throughput as well as inference time. This study gives practitioners rich information to choose between model configurations and optimization strategies for an effective deployment of NLP and CV models on DSAs.

Issue

Vol. 20 No. 11s (2024)

Section

Articles

This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License.

Announcement

Call for Papers for the New Issue.
Last Date of Submission: April 30^th, 2026

Indexing

Call for Papers & Reviewers

The Journal of Electrical Systems (JES) is inviting researchers, scholars, and experts in the field of electrical systems to submit their original and unpublished research papers for consideration in our upcoming issues. We welcome high-quality contributions that address innovative ideas, advancements, and challenges in electrical systems and related areas.

Submission Deadline: March 31^st, 2025

Topics of Interest Include, but are not Limited to:

• Power Systems and Smart Grids
• Renewable Energy
• Control Systems
• Electronics and Communication
• Signal Processing
• Artificial Intelligence in Electrical Engineering
• Internet of Things (IoT) in Electrical Systems
• Electric Vehicles and Transportation
• Robotics and Automation

Authors are requested to submit their manuscripts electronically through our online submission system by the specified deadline.

Submission Guidelines:

Manuscripts should be prepared according to the JES guidelines available on our website.
All submissions will undergo a rigorous peer-review process.
Manuscripts must be original, not previously published or under consideration elsewhere.

Call for Reviewers:
JES is also seeking qualified and experienced individuals to join our esteemed panel of reviewers. If you are interested in contributing your expertise to ensure the quality of the papers published in JES, kindly submit your resume to editor@esrgroups.org. Reviewers play a crucial role in maintaining the high standards of our journal.

We look forward to receiving your valuable contributions and appreciate your interest in the Journal of Electrical Systems.

Important Links

Home

Aims and Scope

Instructions for Authors

Editorial Board

Downloads

Download Paper Template

Article Sidebar

Main Article Content

Abstract

Article Details