TechSnap #2: Why Do We Need Spark?

Playback speed

Share post at current time

Share from 0:00

0:00

Transcript

TechSnap #2: Why Do We Need Spark?

Huong Vuong

Jun 01, 2025

Transcript

Download PDF here

Why Do We Need Spark

13.9MB ∙ PDF file

Download

🚀 TechSnap Ep. 2 is live!
In this episode, we take a very high-level look at Apache Spark — what it is, why it matters, and how it’s powering large-scale data processing across industries.

💡 With Spark 4.0 rolling out recently, it’s a great time to revisit the basics. The ecosystem is evolving fast — and understanding the why behind Spark can help you see where data engineering is heading next.

✨ This episode is not about tuning configs or diving deep into cluster architecture. It’s for those who’ve never used Spark — tech leaders, analysts, or anyone curious about modern data platforms.

We walk through:

The core business challenges of big data: volume, velocity, and flexibility
Why Spark more than just running SQL — it's a flexible engine that supports Python, Scala, and more
4 real-world use cases from finance, e-commerce, healthcare, and telecom

🎙 On a personal note: I've worked with Spark for years. Most of my past sharing has been deep-dive problem-solving for specific Spark issues.
But recently, a Vietnamese colleague asked me, “Is Spark the same as Presto?” — and it hit me: many smart folks still haven’t had the chance to understand Spark at the foundational level.

So this episode is my way of bridging that gap — simple and beginner-friendly.

📺 Give it a watch, and I’d love to hear your feedback.
This is just the beginning of our TechSnap journey into data engineering, analytics, and AI — made simple and accessible.

Because learning tech shouldn’t take forever. Sometimes, all you need is a spark. ⚡

Code Cook Cash

TechSnap #2: Why Do We Need Spark?

Discussion about this video