Akamai Technologies Inc. USA.
World Journal of Advanced Engineering Technology and Sciences, 2025, 15(03), 1197–1206
Article DOI: 10.30574/wjaets.2025.15.3.1040
Received on 30 April 2025; revised on 10 June 2025; accepted on 12 June 2025
This article presents a comprehensive framework for addressing reliability challenges in modern cloud computing environments. The article explores the evolution from traditional redundancy-based approaches to sophisticated predictive analytics and integrated security postures essential for maintaining high availability in distributed systems. The article examines how real-time monitoring methodologies, combined with machine learning techniques like Modified Sequential Minimal Optimization and Weibull Distribution Analysis, can anticipate and prevent service disruptions before they impact users. The article analyzes architectural considerations that minimize complexity and decouple components to contain failure propagation while evaluating how service-level agreements must evolve to reflect multidimensional reliability requirements. Through an enterprise-scale case study, the article demonstrates the practical implementation of these principles and their transformative impact on both technical metrics and business outcomes. The article highlights emerging trends in cloud reliability engineering, including observability platforms, AIOps capabilities, and reliability-as-code approaches, while identifying research gaps and future opportunities. This article contributes to the growing field of cloud resilience by integrating technical, organizational, and economic perspectives into a holistic reliability strategy suitable for increasingly complex cloud deployments.
Cloud Reliability Engineering; Predictive Failure Analytics; Observability; Service Level Objectives; Chaos Engineering
Preview Article PDF
Anil Kumar Gottepu. Capturing and mitigating reliability issues in cloud computing: A comprehensive approach. World Journal of Advanced Engineering Technology and Sciences, 2025, 15(03), 1197-1206. Article DOI: 10.30574/wjaets.2025.15.3.1040.