Design and Implementation of Scalable Test Platforms for LLM Deployments

Reena Chandra

doi:10.52783/jes.9232

PDF

Published: Nov 18, 2025

DOI: https://doi.org/10.52783/jes.9232

Keywords:

Machine Learning Deployment; AWS Lambda; Amazon EC2; SageMaker; Google Colab; Cloud Benchmarking; XGBoost; Serverless Architecture; LLM Simulation; Python Automation; MLOps; CloudWatch Monitoring; Concurrency Testing; Model Inference Scalability

Reena Chandra

Abstract

As the adoption of Large Language Models (LLMs) and machine learning (ML) accelerates across domains, there is a growing need for scalable, cost-efficient, and reproducible deployment frameworks. This study introduces a cloud-native benchmarking architecture that integrates Google Colab for rapid model development with Amazon Web Services (AWS) for deployment simulation. Four ML models, Random Forest, XGBoost, LightGBM, and Multi-Layer Perceptron (MLP)—are trained and evaluated using a tabular classification dataset, then aligned with suitable AWS services (Lambda, EC2, SageMaker) based on their computational and concurrency profiles. The models are assessed on classification metrics, latency, cold start behaviour, and cost per inference. Findings reveal that XGBoost is optimal for stateless, serverless deployment via AWS Lambda, while MLP is better suited for EC2 due to memory demands. LightGBM benefits from SageMaker’s managed scalability. The framework demonstrates the viability of surrogate model benchmarking for LLM scenarios using lightweight ML models, and offers a reproducible, low-cost pipeline to support MLOps practices in cloud environments.

Issue

Vol. 21 No. 1s (2025)

Section

Articles

This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License.

Announcement

Call for Papers for the New Issue
Last Date of Submission: December 31^st, 2025

Indexing

Call for Papers & Reviewers

The Journal of Electrical Systems (JES) is inviting researchers, scholars, and experts in the field of electrical systems to submit their original and unpublished research papers for consideration in our upcoming issues. We welcome high-quality contributions that address innovative ideas, advancements, and challenges in electrical systems and related areas.

Submission Deadline: March 31^st, 2025

Topics of Interest Include, but are not Limited to:

• Power Systems and Smart Grids
• Renewable Energy
• Control Systems
• Electronics and Communication
• Signal Processing
• Artificial Intelligence in Electrical Engineering
• Internet of Things (IoT) in Electrical Systems
• Electric Vehicles and Transportation
• Robotics and Automation

Authors are requested to submit their manuscripts electronically through our online submission system by the specified deadline.

Submission Guidelines:

Manuscripts should be prepared according to the JES guidelines available on our website.
All submissions will undergo a rigorous peer-review process.
Manuscripts must be original, not previously published or under consideration elsewhere.

Call for Reviewers:
JES is also seeking qualified and experienced individuals to join our esteemed panel of reviewers. If you are interested in contributing your expertise to ensure the quality of the papers published in JES, kindly submit your resume to editor@esrgroups.org. Reviewers play a crucial role in maintaining the high standards of our journal.

We look forward to receiving your valuable contributions and appreciate your interest in the Journal of Electrical Systems.

Important Links

Home

Aims and Scope

Instructions for Authors

Editorial Board

Downloads

Download Paper Template

Article Sidebar

Main Article Content

Abstract

Article Details