This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.
…
continue reading
Welcome to The Data Flowcast: Mastering Airflow for Data Engineering & AI — the podcast where we keep you up to date with insights and ideas propelling the Airflow community forward. Join us each week, as we explore the current state, future and potential of Airflow with leading thinkers in the community, and discover how best to leverage this workflow management system to meet the ever-evolving needs of data engineering and AI ecosystems. Podcast Webpage: https://www.astronomer.io/podcast/
…
continue reading
The Data Engineering Show is a podcast for data engineering and BI practitioners to go beyond theory. Learn from the biggest influencers in tech about their practical day-to-day data challenges and solutions in a casual and fun setting. SEASON 1 DATA BROS Eldad and Boaz Farkash shared the same stuffed toys growing up as well as a big passion for data. After founding Sisense and building it to become a high-growth analytics unicorn, they moved on to their next venture, Firebolt, a leading hig ...
…
continue reading
Discussions around Data Engineering
…
continue reading
Databases and data engineering episodes of Software Engineering Daily
…
continue reading
Unlocking the Power of Data: A Guide for Leaders and Executives" As a leader or executive, you know the importance of data in driving business decisions and staying ahead of the competition. But, with the increasing amount of data generated daily, it can be overwhelming to know where to start and how to utilize this valuable asset effectively. This blog, with multiple topics, addresses the technical terminology in data engineering and analytics on the cloud.
…
continue reading

1
Airflow’s Role in the Rise of DataOps with Andy Byron
26:15
26:15
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
26:15The orchestration layer is evolving into a critical component of the modern data stack. Understanding its role in DataOps is key to optimizing workflows, improving reliability and reducing complexity. In this episode, Andy Byron, CEO at Astronomer, discusses the rapid growth of Apache Airflow, the increasing importance of orchestration and how Astr…
…
continue reading

1
Bringing AI Into The Inner Loop of Data Engineering With Ascend
52:47
52:47
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
52:47Summary In this episode of the Data Engineering Podcast Sean Knapp, CEO of Ascend.io, explores the intersection of AI and data engineering. He discusses the evolution of data engineering and the role of AI in automating processes, alleviating burdens on data engineers, and enabling them to focus on complex tasks and innovation. The conversation cov…
…
continue reading
In this episode of The Data Engineering Show, host Benjamin and co-host Eldad sit with CEO DuckDB Labs and co-creator DuckDB, Hannes Mühleisen. Together, they: Talk about the journey of DuckDB, an open-source analytical database system designed as a universal wrangling tool. Explain how DuckDB differs from SQLite, highlighting the analytical and tr…
…
continue reading

1
The Software Risk That Affects Everyone and How To Address It with Michael Winser and Jarek Potiuk
28:27
28:27
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
28:27The security of open-source software is a growing concern, especially as dependencies and regulations become more complex, making it essential to understand how to manage software supply chains effectively. In this episode, we sit down with Michael Winser, Co-Founder at Alpha-Omega and Security Strategy Ambassador at Eclipse Foundation, and Jarek P…
…
continue reading

1
Astronomer's Role in the Airflow Ecosystem: A Deep Dive with Pete DeJoy
51:41
51:41
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
51:41Summary In this episode of the Data Engineering Podcast Pete DeJoy, co-founder and product lead at Astronomer, talks about building and managing Airflow pipelines on Astronomer and the upcoming improvements in Airflow 3. Pete shares his journey into data engineering, discusses Astronomer's contributions to the Airflow project, and highlights the cr…
…
continue reading

1
Building Scalable ML Infrastructure at Outerbounds with Savin Goyal
36:46
36:46
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
36:46Machine learning is changing fast, and companies need better tools to handle AI workloads. The right infrastructure helps data scientists focus on solving problems instead of managing complex systems. In this episode, we talk with Savin Goyal, Co-Founder and CTO at Outerbounds, about building ML infrastructure, how orchestration makes workflows eas…
…
continue reading

1
Accelerated Computing in Modern Data Centers With Datapelago
55:36
55:36
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
55:36Summary In this episode of the Data Engineering Podcast Rajan Goyal, CEO and co-founder of Datapelago, talks about improving efficiencies in data processing by reimagining system architecture. Rajan explains the shift from hyperconverged to disaggregated and composable infrastructure, highlighting the importance of accelerated computing in modern d…
…
continue reading

1
Customizing Airflow for Complex Data Environments at Stripe with Nick Bilozerov and Sharadh Krishnamurthy
27:40
27:40
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
27:40Keeping data pipelines reliable at scale requires more than just the right tools — it demands constant innovation. In this episode, Nick Bilozerov, Senior Data Engineer at Stripe, and Sharadh Krishnamurthy, Engineering Manager at Stripe, discuss how Stripe customizes Airflow for its needs, the evolution of its data orchestration framework and the t…
…
continue reading

1
Harnessing Airflow for Data-Driven Policy Research at CSET with Jennifer Melot
17:54
17:54
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
17:54Turning complex datasets into meaningful analysis requires robust data infrastructure and seamless orchestration. In this episode, we’re joined by Jennifer Melot, Technical Lead at the Center for Security and Emerging Technology (CSET) at Georgetown University, to explore how Airflow powers data-driven insights in technology policy research. Jennif…
…
continue reading

1
The Future of Data Engineering: AI, LLMs, and Automation
59:39
59:39
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
59:39Summary In this episode of the Data Engineering Podcast Gleb Mezhanskiy, CEO and co-founder of DataFold, talks about the intersection of AI and data engineering. He discusses the challenges and opportunities of integrating AI into data engineering, particularly using large language models (LLMs) to enhance productivity and reduce manual toil. The c…
…
continue reading

1
Leveraging Airflow To Build Scalable and Reliable Data Platforms at 99acres.com with Samyak Jain
25:08
25:08
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
25:08Data orchestration is evolving rapidly, with dynamic workflows becoming the cornerstone of modern data engineering. In this episode, we are joined by Samyak Jain, Senior Software Engineer - Big Data at 99acres.com. Samyak shares insights from his journey with Apache Airflow, exploring how his team built a self-service platform that enables non-tech…
…
continue reading

1
Evolving Responsibilities in AI Data Management
38:57
38:57
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
38:57Summary In this episode of the Data Engineering Podcast Bartosz Mikulski talks about preparing data for AI applications. Bartosz shares his journey from data engineering to MLOps and emphasizes the importance of data testing over software development in AI contexts. He discusses the types of data assets required for AI applications, including exten…
…
continue reading

1
Hybrid Testing Solutions for Autonomous Driving at Bosch with Jens Scheffler and Christian Schilling
33:45
33:45
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
33:45Testing autonomous vehicles demands precision, scalability and powerful orchestration tools — enter Apache Airflow, a key component of Bosch’s cutting-edge testing framework. In this episode, we sit down with Jens Scheffler, Test Execution Cluster Technical Architect, and Christian Schilling, Product Owner Open Loop Testing Automated Driving, both …
…
continue reading

1
AI and Data Movement: Trends and Best Practices with Estuary’s Daniel Pálma
30:33
30:33
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
30:33In this episode of The Data Engineering Show, the bros sit with Daniel Pálma, Head of Marketing at Estuary. Join them as they; Talk about Daniel’s career transition from data engineering to marketing and how his background in data engineering has been a tremendous help to his marketing competence. Discuss the role of AI in the evolution of data mov…
…
continue reading

1
Overcoming Airflow Scaling Challenges at Monzo Bank with Jonathan Rainer
43:39
43:39
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
43:39Scaling a data orchestration platform to manage thousands of tasks daily demands innovative solutions and strategic problem-solving. In this episode, we explore the complexities of scaling Airflow and the challenges of orchestrating thousands of tasks in dynamic data environments. Jonathan Rainer, Former Platform Engineer at Monzo Bank, joins us to…
…
continue reading

1
Orchestrating Analytics and AI Workflows at Telia with Arjun Anandkumar
26:00
26:00
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
26:00The future of data engineering lies in seamless orchestration and automation. In this episode, Arjun Anandkumar, Data Engineer at Telia, shares how his team uses Airflow to drive analytics and AI workflows. He highlights the challenges of scaling data platforms and how adopting best practices can simplify complex processes for teams across the orga…
…
continue reading

1
The Role of Airflow in Finance Transformation at Etraveli Group with Mihir Samant
21:19
21:19
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
21:19Transforming bottlenecked finance processes into streamlined, automated systems requires the right tools and a forward-thinking approach. In this episode, Mihir Samant, Senior Data Analyst at Etraveli Group, joins us to share how his team leverages Airflow to revolutionize finance automation. With extensive experience in data workflows and a passio…
…
continue reading

1
Inside Ford’s Data Transformation: Advanced Orchestration Strategies with Vasantha Kosuri-Marshall
38:54
38:54
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
38:54Data engineering is entering a new era, where orchestration and automation are redefining how large-scale projects operate. This episode features Vasantha Kosuri-Marshall, Data and ML Ops Engineer at Ford Motor Company. Vasantha shares her expertise in managing complex data pipelines. She takes us through Ford's transition to cloud platforms, the a…
…
continue reading

1
CSVs Will Never Die And OneSchema Is Counting On It
54:40
54:40
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
54:40Summary In this episode of the Data Engineering Podcast Andrew Luo, CEO of OneSchema, talks about handling CSV data in business operations. Andrew shares his background in data engineering and CRM migration, which led to the creation of OneSchema, a platform designed to automate CSV imports and improve data validation processes. He discusses the ch…
…
continue reading

1
Powering Finance With Advanced Data Solutions at Ramp with Ryan Delgado
24:35
24:35
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
24:35Data is the backbone of every modern business, but unlocking its full potential requires the right tools and strategies. In this episode, Ryan Delgado, Director of Engineering at Ramp, joins us to explore how innovative data platforms can transform business operations and fuel growth. He shares insights on integrating Apache Airflow, optimizing dat…
…
continue reading

1
AI and Data Change Management with Chad Sanderson, CEO Gable AI
36:43
36:43
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
36:43In this episode of The Data Engineering Show, host Benjamin and co-host Eldad sit with Chad Sanderson, CEO and co-founder of Gable AI to explore the interesting world of data change management. Join them as they: Delve into challenges of data quality, how it degrades over time and the one-sided data quality checks on the “last mile” of the data sup…
…
continue reading

1
Breaking Down Data Silos: AI and ML in Master Data Management
57:30
57:30
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
57:30Summary In this episode of the Data Engineering Podcast Dan Bruckner, co-founder and CTO of Tamr, talks about the application of machine learning (ML) and artificial intelligence (AI) in master data management (MDM). Dan shares his journey from working at CERN to becoming a data expert and discusses the challenges of reconciling large-scale organiz…
…
continue reading

1
Building a Data Vision Board: A Guide to Strategic Planning
49:59
49:59
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
49:59Summary In this episode of the Data Engineering Podcast Lior Barak shares his insights on developing a three-year strategic vision for data management. He discusses the importance of having a strategic plan for data, highlighting the need for data teams to focus on impact rather than just enablement. He introduces the concept of a "data vision boar…
…
continue reading

1
Exploring the Power of Airflow 3 at Astronomer with Amogh Desai
30:24
30:24
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
30:24What does it take to go from fixing a broken link to becoming a committer for one of the world’s leading open-source projects? Amogh Desai, Senior Software Engineer at Astronomer, takes us through his journey with Apache Airflow. From small contributions to building meaningful connections in the open-source community, Amogh’s story provides actiona…
…
continue reading

1
How Orchestration Impacts Data Platform Architecture
59:39
59:39
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
59:39Summary The core task of data engineering is managing the flows of data through an organization. In order to ensure those flows are executing on schedule and without error is the role of the data orchestrator. Which orchestration engine you choose impacts the ways that you architect the rest of your data platform. In this episode Hugo Lu shares his…
…
continue reading

1
Using Airflow To Power Machine Learning Pipelines at Optimove with Vasyl Vasyuta
24:11
24:11
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
24:11Data orchestration and machine learning are shaping how organizations handle massive datasets and drive customer-focused strategies. Tools like Apache Airflow are central to this transformation. In this episode, Vasyl Vasyuta, R&D Team Leader at Optimove, joins us to discuss how his team leverages Airflow to optimize data processing, orchestrate ma…
…
continue reading

1
An Exploration Of The Impediments To Reusable Data Pipelines
51:32
51:32
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
51:32Summary In this episode of the Data Engineering Podcast the inimitable Max Beauchemin talks about reusability in data pipelines. The conversation explores the "write everything twice" problem, where similar pipelines are built without code reuse, and discusses the challenges of managing different SQL dialects and relational databases. Max also touc…
…
continue reading

1
Maximizing Business Impact Through Data at GlossGenius with Katie Bauer
25:49
25:49
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
25:49Bridging the gap between data teams and business priorities is essential for maximizing impact and building value-driven workflows. Katie Bauer, Senior Director of Data at GlossGenius, joins us to share her principles for creating effective, aligned data teams. In this episode, Katie draws from her experience at GlossGenius, Reddit and Twitter to h…
…
continue reading

1
Optimizing Large-Scale Deployments at LinkedIn with Rahul Gade
27:47
27:47
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
27:47Scaling deployments for a billion users demands innovation, precision and resilience. In this episode, we dive into how LinkedIn optimizes its continuous deployment process using Apache Airflow. Rahul Gade, Staff Software Engineer at LinkedIn, shares his insights on building scalable systems and democratizing deployments for over 10,000 engineers. …
…
continue reading
Summary In this episode of the Data Engineering Podcast Sam Kleinman talks about the pivotal role of databases in software engineering. Sam shares his journey into the world of data and discusses the complexities of database selection, highlighting the trade-offs between different database architectures and how these choices affect system design, q…
…
continue reading

1
Bridging Code and UI in Data Orchestration with Kestra
44:30
44:30
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
44:30Summary In this episode of the Data Engineering Podcast, Anna Geller talks about the integration of code and UI-driven interfaces for data orchestration. Anna defines data orchestration as automating the coordination of workflow nodes that interact with data across various business functions, discussing how it goes beyond ETL and analytics to enabl…
…
continue reading

1
Tech Stacks and Tradeoffs: Xudo's Founder on Picking the Right Tools for BI Success
24:56
24:56
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
24:56Wouter Trappers is the founder of Xudo and shares his slightly unconventional path from philosopher to data consultant with the Bros in this latest episode of The Data Engineering Show. Wouter’s grounding in philosophy has proved to be a shaping influence on his approach to business intelligence. Much more than just a software solution, for Wouter,…
…
continue reading

1
Streaming Data Into The Lakehouse With Iceberg And Trino At Going
39:49
39:49
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
39:49In this episode, I had the pleasure of speaking with Ken Pickering, VP of Engineering at Going, about the intricacies of streaming data into a Trino and Iceberg lakehouse. Ken shared his journey from product engineering to becoming deeply involved in data-centric roles, highlighting his experiences in ecommerce and InsurTech. At Going, Ken leads th…
…
continue reading

1
How Uber Manages 1 Million Daily Tasks Using Airflow, with Shobhit Shah and Sumit Maheshwari
28:44
28:44
「あとで再生する」
「あとで再生する」
リスト
気に入り
気に入った
28:44When data orchestration reaches Uber’s scale, innovation becomes a necessity, not a luxury. In this episode, we discuss the innovations behind Uber’s unique Airflow setup. With our guests Shobhit Shah and Sumit Maheshwari, both Staff Software Engineers at Uber, we explore how their team manages one of the largest data workflow systems in the world.…
…
continue reading