Staff Software Engineer, VISA USA, INC.
World Journal of Advanced Engineering Technology and Sciences, 2025, 14(03), 538-546
Article DOI: 10.30574/wjaets.2025.14.3.0179
Received on 17 February 2025; revised on 28 March 2025; accepted on 30 March 2025
Dynamic load balancing is a key challenge in AI-enabled cloud infrastructures with volatile resource demand. This results in resource utilization drifting away from balance and creating performance loss, so the infrastructure starts to operate inefficiently. In this paper, we introduce a principled approach based on reinforcement learning and algorithmic optimization to dynamically allocate the load across the infrastructure. Our approach is based on reinforcement learning, providing instructions on what the ideal actions for load balancing in an ever-changing environment are. It takes advantage of a deep neural network to capture the complex interactions from historical states and associated load-balancing actions. The best actions are selected by maximizing the sum of rewards, taking into account short-term and long-term objectives. To increase the efficiency of the load balancing even further, we then apply algorithmic optimization approaches like genetic algorithms and ant colony optimization. Smart load-balancing strategies: These are done using an introduction of deep Q-learning algorithms, which helps in the optimization of the decision-making process of such reinforcement learning agent targeting for highly intelligent and efficient load-balancing act aggregate. Experimental results based on simulations and real-world experiments show that our framework can help network programs highly efficiently balance workloads and significantly improve the performance of the infrastructure. It can adjust to changing resource demands and conditions as well, so it should prove effective against such a dynamic environment. Overall, we present a new paradigm for implementing dynamic load balancing for AI cloud infrastructures. By combining the best of reinforcement learning and algorithmic optimization, it can improve resource utilization, delivering high-performance servers.
Load Balancing; Data Processing; Dynamic Environment; Reinforcement Learning; Flexibility; Efficient Resource
Preview Article PDF
Navya Krishna Alapati. Real-time data analytics and processing for adaptive load balancing in cloud infrastructures. World Journal of Advanced Engineering Technology and Sciences, 2025, 14(03), 538-546. Article DOI: https://doi.org/10.30574/wjaets.2025.14.3.0179.