Home
World Journal of Advanced Engineering Technology and Sciences
International, Peer reviewed, Referred, Open access | ISSN Approved Journal

Main navigation

  • Home
    • Journal Information
    • Abstracting and Indexing
    • Editorial Board Members
    • Reviewer Panel
    • Journal Policies
    • WJAETS CrossMark Policy
    • Publication Ethics
    • Instructions for Authors
    • Article processing fee
    • Track Manuscript Status
    • Get Publication Certificate
    • Issue in Progress
    • Current Issue
    • Past Issues
    • Become a Reviewer panel member
    • Join as Editorial Board Member
  • Contact us
  • Downloads

ISSN: 2582-8266 (Online)  || UGC Compliant Journal || Google Indexed || Impact Factor: 9.48 || Crossref DOI

Fast Publication within 2 days || Low Article Processing charges || Peer reviewed and Referred Journal

Research and review articles are invited for publication in Volume 18, Issue 3 (March 2026).... Submit articles

Enhancing data processing with Apache spark: A technical deep dive

Breadcrumb

  • Home
  • Enhancing data processing with Apache spark: A technical deep dive

Avinash Dulam *

Osmania University, Hyderabad, India 

Review Article

World Journal of Advanced Engineering Technology and Sciences, 2025, 15(03), 1279–1284

Article DOI: 10.30574/wjaets.2025.15.3.0910

DOI url: https://doi.org/10.30574/wjaets.2025.15.3.0910

Received on 02 May 2025; revised on 10 June 2025; accepted on 12 June 2025

Apache Spark has revolutionized big data processing by introducing a unified computing framework that addresses the challenges of distributed data processing, real-time analytics, and machine learning at scale. The framework's architecture, built on Resilient Distributed Datasets (RDDs), enables fault-tolerant parallel operations while providing sophisticated optimization techniques for enhanced performance. Through advanced features like Structured Streaming, DataFrame abstractions, and MLlib integration, Spark offers comprehensive solutions for modern data processing needs, from batch processing to real-time analytics, effectively supporting organizations in managing exponentially growing data volumes while maintaining processing efficiency and scalability. The platform's innovative approach to data abstraction, combined with its robust optimization capabilities and integration with modern computing paradigms, establishes it as a cornerstone technology for enterprises seeking to harness the power of big data while minimizing operational complexity and maximizing resource utilization across diverse processing environments.

Distributed Computing; Data Processing Optimization; Stream Processing; Machine Learning Integration; Resource Management

https://wjaets.com/sites/default/files/fulltext_pdf/WJAETS-2025-0910.pdf

Preview Article PDF

Avinash Dulam. Enhancing data processing with Apache spark: A technical deep dive. World Journal of Advanced Engineering Technology and Sciences, 2025, 15(03), 1279-1284. Article DOI: 10.30574/wjaets.2025.15.3.0910.

Get Certificates

Get Publication Certificate

Download LoA

Check Corssref DOI details

Issue details

Issue Cover Page

Editorial Board

Table of content


Copyright © Author(s). All rights reserved. This article is published under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits use, sharing, adaptation, distribution, and reproduction in any medium or format, as long as appropriate credit is given to the original author(s) and source, a link to the license is provided, and any changes made are indicated.


Copyright © 2026 World Journal of Advanced Engineering Technology and Sciences

Developed & Designed by VS Infosolution