Home
World Journal of Advanced Engineering Technology and Sciences
International, Peer reviewed, Referred, Open access | ISSN Approved Journal

Main navigation

  • Home
    • Journal Information
    • Abstracting and Indexing
    • Editorial Board Members
    • Reviewer Panel
    • Journal Policies
    • WJAETS CrossMark Policy
    • Publication Ethics
    • Instructions for Authors
    • Article processing fee
    • Track Manuscript Status
    • Get Publication Certificate
    • Issue in Progress
    • Current Issue
    • Past Issues
    • Become a Reviewer panel member
    • Join as Editorial Board Member
  • Contact us
  • Downloads

ISSN: 2582-8266 (Online)  || UGC Compliant Journal || Google Indexed || Impact Factor: 9.48 || Crossref DOI

Fast Publication within 2 days || Low Article Processing charges || Peer reviewed and Referred Journal

Research and review articles are invited for publication in Volume 18, Issue 2 (February 2026).... Submit articles

Serverless computing for ML workloads: The convergence of on-demand resources and model deployment

Breadcrumb

  • Home
  • Serverless computing for ML workloads: The convergence of on-demand resources and model deployment

Ramya Boorugula *

Srinivasa Institute of Technology and Management Studies, India.

Review Article

World Journal of Advanced Engineering Technology and Sciences, 2025, 15(02), 918-924

Article DOI: 10.30574/wjaets.2025.15.2.0637

DOI url: https://doi.org/10.30574/wjaets.2025.15.2.0637

Received on 28 March 2025; revised on 03 May 2025; accepted on 06 May 2025

Serverless computing represents a transformative approach for machine learning deployments, offering event-driven execution, automatic scaling, and pay-per-use billing models that address longstanding operational challenges. This article explores the convergence of serverless architectures with machine learning workloads, examining how this integration reshapes deployment practices and operational economics. The global serverless architecture market continues rapid expansion, with ML deployments representing an increasingly significant segment. The evolution from traditional server-based deployments through containerization to serverless paradigms reveals quantifiable benefits in resource utilization, operational overhead reduction, and cost efficiency for intermittent workloads. Current serverless ML solutions demonstrate substantial improvements in cold start latencies, memory limitations, and specialized hardware access compared to earlier implementations. Performance analysis reveals nuanced tradeoffs between dedicated and serverless infrastructures across dimensions of latency, throughput, cost efficiency, resource utilization, and operational overhead. Implementation strategies including hybrid architectures, model optimization techniques, effective resource provisioning, and targeted cost management approaches collectively enable organizations to maximize benefits while mitigating limitations. This comprehensive article provides ML practitioners and architects with actionable insights to navigate the evolving serverless ML landscape and make informed decisions about where serverless approaches offer maximum value in deployment strategies. 

Serverless Computing; Machine Learning Deployment; Automatic Scaling; Cost Optimization; Hybrid Architectures

https://wjaets.com/sites/default/files/fulltext_pdf/WJAETS-2025-0637.pdf

Preview Article PDF

Ramya Boorugula. Serverless computing for ML workloads: The convergence of on-demand resources and model deployment. World Journal of Advanced Engineering Technology and Sciences, 2025, 15(02), 918-924. Article DOI: https://doi.org/10.30574/wjaets.2025.15.2.0637.

Get Certificates

Get Publication Certificate

Download LoA

Check Corssref DOI details

Issue details

Issue Cover Page

Editorial Board

Table of content


Copyright © Author(s). All rights reserved. This article is published under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits use, sharing, adaptation, distribution, and reproduction in any medium or format, as long as appropriate credit is given to the original author(s) and source, a link to the license is provided, and any changes made are indicated.


Copyright © 2026 World Journal of Advanced Engineering Technology and Sciences

Developed & Designed by VS Infosolution