Serverless architectures for agentic AI deployment

Gaurav Samdani *, Kabita Paul and Flavia Saldanha

Independent Publisher, USA.
 
Research Article
World Journal of Advanced Engineering Technology and Sciences, 2022, 07(02), 320-333.
Article DOI: 10.30574/wjaets.2022.7.2.0144
Publication history: 
Received on 01 November 2022; revised on 22 December 2022; accepted on 25 December 2022
 
Abstract: 
This paper presents directions on improving scalabilities, costs, and flexibility in serverless architectures incorporating agentic AI deployment. Using event-driven and a pay-as-you-go model, Serverless computing is shown to be an optimal way to deploy agentic AI systems due to their need for flexibility. The research objectives include the assessment of the possibilities for serverless platforms, the assessment of the effectiveness of its case applications, and the development of a solid methodology for its application in real life. The methodology uses case studies, comparative analysis, and evaluation metrics to determine the benefits of serverless computing to AI workloads. The main findings emphasize latency optimization, cost-effectiveness, and flexibility of operations. These insights are mostly general for businesses, developers, and cloud providers interested in AI effectiveness and deployment. Consequently, this research finds that linking serverless architectures to agentic AI endorses innovation possibilities in deploying AI.
 
Keywords: 
Agentic AI; Serverless Computing; Dynamic Scalability; Cost Efficiency; Event-Driven Models; Infrastructure Automation
 
Full text article in PDF: