Home
World Journal of Advanced Engineering Technology and Sciences
International, Peer reviewed, Referred, Open access | ISSN Approved Journal

Main navigation

  • Home
    • Journal Information
    • Abstracting and Indexing
    • Editorial Board Members
    • Reviewer Panel
    • Journal Policies
    • WJAETS CrossMark Policy
    • Publication Ethics
    • Instructions for Authors
    • Article processing fee
    • Track Manuscript Status
    • Get Publication Certificate
    • Issue in Progress
    • Current Issue
    • Past Issues
    • Become a Reviewer panel member
    • Join as Editorial Board Member
  • Contact us
  • Downloads

ISSN: 2582-8266 (Online)  || UGC Compliant Journal || Google Indexed || Impact Factor: 9.48 || Crossref DOI

Fast Publication within 2 days || Low Article Processing charges || Peer reviewed and Referred Journal

Research and review articles are invited for publication in Volume 18, Issue 3 (March 2026).... Submit articles

Cloud ETL optimization with AWS glue and spark

Breadcrumb

  • Home
  • Cloud ETL optimization with AWS glue and spark

Sarvesh Kumar Gupta *

Western Governors University, Utah, USA.

Review Article

 

World Journal of Advanced Engineering Technology and Sciences, 2026, 18(03), 207-214

Article DOI: 10.30574/wjaets.2026.18.3.0076

DOI url: https://doi.org/10.30574/wjaets.2026.18.3.0076

Received on 26 December 2025; revised on 18 February 2026; accepted on 21 February 2026

Cloud-native ETL has become a cornerstone of modern data architectures, enabling real-time analytics, scalable machine learning pipelines, and cost-efficient data processing. AWS Glue and Apache Spark represent a powerful duo for building robust and serverless ETL frameworks. This review has examined their capabilities in depth—covering architecture, tuning methods, and best practices. It also highlights experimental benchmarks, key optimization strategies, and emerging trends that define the future of ETL. The findings suggest that with the right design patterns and tuning, organizations can significantly boost performance while reducing both cost and operational complexity.

Cloud ETL; AWS Glue; Apache Spark; DataFrames; DynamicFrames; Partition Pruning; Predicate Pushdown; Parquet; Delta Lake; Data Lakehouse; Serverless Data Pipelines

https://wjaets.com/sites/default/files/fulltext_pdf/WJAETS-2026-0076.pdf

Get Your e Certificate of Publication using below link

Download Certificate

Preview Article PDF

Sarvesh Kumar Gupta. Cloud ETL optimization with AWS glue and spark. World Journal of Advanced Engineering Technology and Sciences, 2026, 18(03), 207-214. Article DOI: https://doi.org/10.30574/wjaets.2026.18.3.0076

Get Certificates

Get Publication Certificate

Download LoA

Check Corssref DOI details

Issue details

Issue Cover Page

Editorial Board

Table of content


Copyright © Author(s). All rights reserved. This article is published under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits use, sharing, adaptation, distribution, and reproduction in any medium or format, as long as appropriate credit is given to the original author(s) and source, a link to the license is provided, and any changes made are indicated.


Copyright © 2026 World Journal of Advanced Engineering Technology and Sciences

Developed & Designed by VS Infosolution