Beta

Explorez tous les épisodes de The Data Stack Show

Plongez dans la liste complète des épisodes de The Data Stack Show. Chaque épisode est catalogué accompagné de descriptions détaillées, ce qui facilite la recherche et l'exploration de sujets spécifiques. Suivez tous les épisodes de votre podcast préféré et ne manquez aucun contenu pertinent.

Rows per page:

1–50 of 426

DateTitreDurée
14 Aug 2023The PRQL: How Can Reverse ETL Revolutionize Marketing Data Management? Featuring Chris Sell of GrowthLoop00:03:34
In this bonus episode, Eric and Kostas preview their upcoming conversation with Chris Sell of GrowthLoop.
25 May 202288: What Is Data Observability? With Tristan Spaulding of Acceldata01:01:46

Highlights from this week’s conversation include:

  • Tristan’s background and career journey (2:43)
  • Updating old technology (11:40)
  • Defining “data observability” (18:44)
  • The primary user of a data observability tool (29:56)
  • Handling an incident (33:01)
  • Why multipliers for data observability (37:06)
  • Early symptoms of a data drift (43:12)
  • Tuning in the context of data engineering (50:11)
  • What keeps Tristan working with data (55:12)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

31 Jul 2023The PRQL: Turning Data Into an API with Matteo Pelati and Vivek Gudapuri of Dozer00:06:59
In this bonus episode, Eric and Kostas preview their upcoming conversation with Matteo Pelati and Vivek Gudapuri of Dozer.
05 Dec 2022The PRQL: Data Analytics: Same As It Ever Was00:06:44
In this bonus episode, Eric and Kostas preview their upcoming conversation with Aron Clymer of Data Clymer.
11 Dec 2024219: The First 90 Days of Data Leadership: What the LinkedIn Posts Don't Tell You with Matt Kelliher-Gibson, The Cynical Data Guy00:33:21

Highlights from this week’s conversation include:

  • Lightning Round Setup (1:15)
  • Scenarios for New Data Leaders (2:33)
  • Optimism vs. Reality (3:14)
  • Cynical Perspective on Data Roles (5:32)
  • Monitoring Systems Discussion (9:31)
  • Executive Alignment Challenges (12:54)
  • Understanding Team Dynamics (17:32)
  • Head of Data vs. Head of Product (20:13)
  • Product Development Steps (22:14)
  • Consequences of Product Decisions (24:14)
  • Challenges in Data Team Dynamics (26:03)
  • Attribution Reporting Complexity (28:24)
  • Long-Term Vision for Data Teams (29:22)
  • AI Summaries Discussion (30:19)
  • Closing Thoughts on AI Nuance (32:02)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

16 Oct 2023The PRQL: The Intersection of Physics, Data Science, and Product Development with Santona Tuli of Upsolver00:05:47
In this bonus episode, Eric and Kostas preview their upcoming conversation with Santona Tuli of Upsolver.
20 May 2022The PRQL: Does Data Exist if We Do Not Observe It?00:03:41
Eric and Kostas preview their upcoming conversation with Tristan Spaulding of Acceldata.
29 Jul 2022The PRQL: Farm to Table Abstract Mathematics00:04:01
Eric and Kostas preview their upcoming conversation with Eric Damlier of Conexus AI.
31 Jan 2024175: The Parts, Pieces, and Future of Composable Data Systems, Featuring Wes McKinney, Pedro Pedreira, Chris Riccomini, and Ryan Blue01:18:30

Highlights from this week’s conversation include:

  • Introduction of the panel (0:05)
  • Defining composable data stack (5:22)
  • Components of a composable data stack (7:49)
  • Challenges and incentives for composable components (10:37)
  • Specialization and modularity in data workloads (13:05)
  • Organic evolution of composable systems (17:50)
  • Efficiency and common layers in data management systems (22:09)
  • The IR and Data Computation (23:00)
  • Components of the Storage Layer (26:16)
  • Decoupling Language and Execution (29:42)
  • Apache Calcite and Modular Frontend (36:46)
  • Data Types and Coercion (39:27)
  • Describing Data Sets and Schema (42:00)
  • Open Standards and Frontiers (46:22)
  • Challenges of standardizing APIs (48:15)
  • Trade-offs in building composable systems (54:04)
  • Evolution of data system composability (56:32)
  • Exciting new projects in data systems (1:01:57)
  • Final thoughts and takeaways (1:17:25)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

10 Jul 2024197: Deep Dive: How to Build AI Features and Why it is So Dang Hard with Barry McCardle of Hex01:03:35

Highlights from this week’s conversation include:

  • Overview of Hex and its Purpose (0:51)
  • Discussion on AI and Data Collaboration (1:42)
  • Product Updates in Hex (2:14)
  • Challenges of Building AI Features (13:29)
  • Magic Features and AI Context (15:22)
  • Chatbots and UI (17:31)
  • Benchmarking AI Models (19:06)
  • AI as a Judge Pattern (23:32)
  • Challenges in AI Development (25:31)
  • AI in Production and Product Integration (28:43)
  • Difficulties in AI Feature Prediction (33:38)
  • Deterministic template selection and AI model uncertainty (36:21)
  • Infrastructure for AI experimentation and evaluation (40:11)
  • Consolidation and competition in the data stack industry (42:27)
  • Data gravity, integration, and market dynamics (47:12)
  • Enterprise adoption and the bundling and unbundling of platforms (51:03)
  • The open source databases and the middle ground (53:18)
  • Building successful open source businesses (57:00)
  • The fun approach to product launch video (1:01:14)
  • Final thoughts and takeaways (01:03:15)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

07 Sep 2022103: Everyone Is Invited to the Data Lakehouse with Kyle Weller of Onehouse.ai00:55:46

Highlights from this week’s conversation include:

  • Kyle’s background and career journey (2:38)
  • Unique challenges in building data engineering products (9:33)
  • The problem set Databricks resolves (13:46)
  • About Onehouse (17:15)
  • From Microsoft to Onehouse (20:59)
  • Why there’s so much distance between data powers (24:45)
  • Why the data lake is not enough (30:15)
  • Who should have a lake house (39:03)
  • Why we have all three data platforms (43:53)
  • How to step into the data lake house world (49:48)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

07 Jan 2022The PRQL: What Old Tech Concepts Were Borrowed to Build the Data Lake House?00:04:37
Eric and Kostas preview the upcoming show as they talk about data lakes and data warehouses and why these are important.
27 Apr 2023Data Council Week (Ep 6) - All About Debezium and Change Data Capture With Gunnar Morling of Decodable00:39:57

Highlights from this week’s conversation include:

  • Gunner’s background in data (0:32)
  • Setting the vision in early days of Red Hat and spearheading Debezium (6:20)
  • Replication of data in Debezium (9:47)
  • The patterns and processes of Debezium (16:21)
  • Debezium working with Kafka (19:03)
  • Building a diverse system while incorporating common interfaces (24:09)
  • The importance of documentation in open-sourced projects (27:59)
  • Debezium’s vision moving forward (31:32)
  • Why aren’t there more CDC open-sourced solutions? (34:35)
  • Connecting with Gunnar (37:27)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

24 Jan 2024174: Does Your Data Stack Need a Semantic Layer? Featuring Artyom Keydunov of Cube Dev00:58:14

Highlights from this week’s conversation include:

  • Artyom’s background in the data space (0:32)
  • The growth and changes at Cube (5:58)
  • Pain points of managing metrics definitions across different tools (9:39)
  • Trade-offs between coupled and decoupled semantic layers (12:12)
  • Making a case for implementing a semantic layer (14:17)
  • The evolution of semantic layers (23:28)
  • Challenges in designing a decoupled semantic layer (24:16)
  • Different approaches to solving the interface problem (26:58)
  • Implementing a SQL engine in Cube (35:58)
  • Overhead and debugging in semantic layers (39:08)
  • The semantic layer and its importance (46:26)
  • The need for semantics in data products (47:34)
  • What’s the future of semantic layers and user experience? (51:49)
  • Final thoughts and takeaways (57:34)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

29 Mar 2023132: Data Quality and Data Contracts with Chad Sanderson of Data Quality Camp01:06:34

Highlights from this week’s conversation include:

  • Chad’s background in data (2:10)
  • Breaking down data quality (4:02)
  • Semantic and logical layers of data (10:04)
  • What are data contracts and how do they work? (17:41)
  • Implicit contracts at companies (24:01)
  • Where do data contracts fit in data infrastructure? (28:14)
  • The value of data contracts to the producer and consumer (31:18)
  • Tools needed in effective data contracts (46:13)
  • The importance of community in data quality (50:53)
  • Getting connected to Data Quality Camp (1:00:55)
  • Final thoughts and takeaways (1:01:53)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

04 May 202285: You Can Stop Doing Data Fire Drills with Barr Moses of Monte Carlo00:51:39

Highlights from this week’s conversation include:

  • Barr’s background and career journey (2:12)
  • Trust: a technical or human problem? (9:47)
  • Behind the name “Monte Carlo” (15:41)
  • Defining data accuracy and reliability (17:36)
  • How much can be done with standardization (22:27)
  • How to avoid frustration when generating data about data (25:49)
  • Defining “resolution” (28:59)
  • Understanding the concept of SLAs (33:25)
  • Building a company for a category that doesn’t exist yet (37:40)
  • What it looks like to use Monte Carlo (44:07)
  • The best part about working with data teams (47:28)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

10 Jun 2024The PRQL: The Cynical Data Guy’s Origin Story00:03:08

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

23 Dec 2024The PRQL: Building Software with Intuition: Lessons from Multiple Successful Startups with Sokratis Vidros of Novu00:02:46

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

17 Jul 2024198: Building AI Search and Customer-Enabled Fine-Tuning with Jesse Clark of Marqo.ai00:52:11

Highlights from this week’s conversation include:

  • Jesse’s background and work in data (0:35)
  • E-commerce Application for Search (1:23)
  • Ph.D. in Physics Experience Then Working in Data (2:27)
  • Early Machine Learning Journey (4:35)
  • Machine Learning at Stitch Fix (7:28)
  • Machine Learning at Amazon (10:39)
  • Myths and Realities of AI (13:49)
  • Bolt-On AI vs. Native AI (17:26)
  • Overview of Marqo (19:46)
  • Product launch and fine-tuning models (23:02)
  • Importance of data quality (25:38)
  • The power of machine learning in search (32:02)
  • Future of domain-specific knowledge and product data (34:08)
  • Unstructured data and AI (37:19)
  • Technical aspects of Marqo's system (39:42)
  • Challenges of vector search (43:27)
  • Evolution of search technology (48:15)
  • Future of search interfaces (50:43)
  • Final thoughts and takeaways (51:53)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

08 Jul 2024The PRQL: Why is Building Great AI Features so Hard? Featuring Barry McCardel of Hex00:02:22

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

06 Mar 2023The PRQL: Time-Series Data 10100:03:37
In this bonus episode, Eric and Kostas preview their upcoming conversation with David Kohn of Timescale.
20 Dec 2023169: Data Models: From Warehouse to Business Impact with Tasso Argyros of ActionIQ01:05:54

Highlights from this week’s conversation include:

  • The Evolution of Databases and Data Systems (2:33)
  • Abstracting Data for Business Users (4:31)
  • Building a Database for Google-like Search (7:58)
  • The Big Data Explosion (11:10)
  • Selling Myspace as First Customer (13:14)
  • Starting ActionIQ (16:57)
  • The customer-centric organization (22:46)
  • Transitioning to customer data focus (23:53)
  • Understanding business users' needs (28:30)
  • Supporting Arbitrary Queries and Data Models (34:42)
  • Unique Technical Perspective of Clickstream Data (37:01)
  • The value per terabyte of data (46:45)
  • Building a product for multiple personas (50:45)
  • Composability and Benefits (58:05)
  • Evolution of Storage and Compute (1:00:09)
  • Composability and Treasure Data (1:02:10)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

07 Apr 202132: Cooking with Data Ops with Chris Bergh from DataKitchen00:58:31

On this week's episode of The Data Stack Show, Eric and Kostas talk with Chris Bergh, the CEO and head chef at Data Kitchen. DataKitchen’s mission is to provide the software, service, and knowledge that makes it possible for every data and analytics team to realize their full potential with DataOps.

Highlights from this week's episode include: 

  • Chris' background and how the lessons learned in the Peace Corps and at NASA apply to him today (2:03)
  • Why AI left Chris feeling like a jilted lover (7:49)
  • Most projects that people do in data analytics fail (10:12)
  • Three things that DataOps focuses on (16:37)
  • Comparing and contrasting DevOps and DataOps (22:30)
  • The types of data that DataKitchen handles and building a product or a service around DataOps (29:29)
  • Fixing problems at the source instead of just offering a tool to slightly improve things downstream (37:17)
  • Where we are at in the process of how companies are going to run on data (41:43)

The Data Stack Show is a weekly podcast powered by RudderStack. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

13 May 2022The PRQL: Can You Trust AI Enabled Analytics?00:03:17
Eric and Kostas preview their upcoming conversation with Cindi Howson of ThoughtSpot and Host of The Data Chief Podcast.
15 Oct 2021Data Debrief: Can Tools Help Solve Data Quality Organizational Challenges?00:06:55
On this Data Debrief, Eric and Kostas are joined by Brian from Rudderstack to talk about Data Quality.
22 Sep 202154: The Center of the Modern Data Stack with Neil Rahilly of Mixpanel01:08:53

Highlights from this week’s conversation include:

  • Neil’s programming hobby turned into a career and how he cold-contacted Mixpanel for a job (2:28)
  • Lessons learned from nine years at Mixpanel (5:05)
  • Defining product analytics (8:06)
  • How Mixpanel has evolved into the product it is today (10:56)
  • The importance of Mixpanel’s real-time analysis (19:52)
  • Looking at Arb, Mixpanel’s own arbitrary segmentation database (23:44)
  • The business impact that the rise of the cloud data warehouse had on Mixpanel (34:56)
  • Sub-second latencies and real-time use cases (49:05)
  • Career advice from Neil (1:02:02)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

12 Aug 2022The PRQL: What’s the Hardest Part About Data Quality?00:04:06
Eric and Kostas preview their upcoming conversation with James Campbell at Superconductive.
23 Aug 2023152: Three Steps To Enhance Product Analytics with Ken Fine of Heap01:07:19

Highlights from this week’s conversation include:

  • Ken’s background and journey to Heap (2:32)
  • Heap’s problem-solving approach (8:19)
  • Auto-capture and its significance in the marketplace (13:03)
  • Providing qualitative context: sessions and surveys (16:23)
  • Collection and storage of data (25:42) 
  • Challenges of real-time data collection (26:40)
  • The true gap in the market today (37:39)
  • Consolidation and aggregation of data solutions (41:58)
  • Simplifying the data stack (47:32)
  • A different approach in engineering and software development (51:12)
  • Skills and Stages in Company Growth (55:58)
  • Final thoughts and takeaways (1:02:52)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

11 Mar 2024The PRQL: Making the Data Stack Serverless in the Cloud with Mike Driscoll of Rill Data00:05:36

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

19 Jul 2023147: Where Data and Infrastructure Converge Featuring Lars Kamp of Resoto00:58:18

Highlights from this week’s conversation include:

  • Lars work on Resoto in helping to cut cloud costs for organizations (2:02)
  • The trend of large resources to micro resources (5:59)
  • What are some of the typical resource drains in data infrastructure (8:56)
  • Managing cost on the backend with scale and experimentation (12:51)
  • Solutions for resource management problems (17:38)
  • How Resoto is solving pain points in resource management (26:17)
  • Navigating the complexities of data infrastructure (29:01)
  • Resoto’s solution for interpreting difficult cloud data products (36:35)
  • Exploring relationships of data points and finding solutions (43:40)
  • Querying in graph database (47:46)
  • How to go from graph to SQL (49:13)
  • How can data teams plan for costs in the coming years (50:53)
  • Final thoughts and takeaways (53:49)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

05 Oct 2022107: Building Modern Data Teams with dbt Labs, REI, and Robinhood01:02:42

Highlights from this week’s conversation include:

  • Introducing our guests (3:05)
  • Defining “data team” (4:40)
  • How data teams emerge and evolve (14:11)
  • The need that forces the creation of a data team (21:12)
  • The backbone of the data team (26:23)
  • Building a career within a data team (36:39)
  • Advice for new data team managers (47:35)
  • Question and answer time (52:38)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

.

14 Oct 2022The PRQL: What Comes to Mind When You Think of ‘Headless’?00:04:40
In this bonus episode, Eric and Kostas preview their upcoming conversation with Artyom Keydunov & Pavel Tiunov of CubeJS.
02 Jan 2024The PRQL: Does Machine Learning Need Its Own Orchestrator? Featuring Sandy Ryza of Dagster00:03:48
In this bonus episode, Eric and Kostas preview their upcoming conversation with Sandy Ryza of Dagster.
20 Apr 202284: Why Are Analytics Still So Hard? With Kaycee Lai of Promethium00:56:00

Highlights from this week’s conversation include:

  • Kaycee’s background and career journey (2:34)
  • Why analytics are hard (7:28)
  • Defining “data management” (11:47)
  • Defining “data virtualization” (15:57)
  • The relationship between data virtualization and ETL (18:34)
  • Where a company should invest first (21:40)
  • Building without a Frankenstein stack (25:19)
  • How Promethium solves data stack issues (27:53)
  • Giving context to data (35:14)
  • Cataloging: background, at Promethium, future (39:29)
  • Who uses data catalogs (48:00)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

31 Aug 2022102: Building Pinot for Real-Time, Interactive User Analytics with Kishore Gopalakrishna of StarTree00:48:35

Highlights from this week’s conversation include:

  • Kishore’s background and career journey (2:30)
  • Internal analytics versus user-facing analytics (3:49)
  • New ways of thinking about analytics (8:06)
  • What makes Pinot different (13:45)
  • How Pinot transforms systems (21:53)
  • Understanding the data landscape (32:40)
  • The Pinot user experience (36:27)
  • Something exciting about StarTree (40:05)
  • When you should adopt this technology (43:15)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

31 Jul 202000: Welcome to the Data Stack Show00:01:13
Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data
31 May 2023140: Stream Processing for Machine Learning with Davor Bonaci of DataStax01:01:30

Highlights from this week’s conversation include:

  • Davor’s journey from Google and what he was building there (3:32)
  • How work in stream processing changed Davor’s journey (5:10)
  • Analytical predictive models and infrastructure (9:39)
  • How Kaskada serves as a recommendation engine with data (14:05)
  • Kaskada’s user experience as an event processing platform (20:06)
  • Enhancing typical feature store architecture to achieve better results (23:34)
  • What is needed to improve stream and batch processes (27:39)
  • Using another syntax instead of SQL (36:44)
  • DataStax acquiring Kaskada and what will come from that merger (40:24)
  • Operationalizing and democratizing ML (47:54)
  • Final thoughts and takeaways (56:04) 

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

07 Nov 2022The PRQL: Who Needs a Stream Processing Engine?00:05:10
In this bonus episode, Eric and Kostas preview their upcoming conversation with Zander Matheson of bytewax.
23 Oct 2024212: Scaling Startups with Purpose: Jonathan Bragdon’s Blueprint for Capital-Efficient Growth01:00:10

Highlights from this week’s conversation include:

  • Jonathan's Background in Data and VC (1:05)
  • Working with CPG Data (3:45)
  • Details of Purchase Data (6:20)
  • Funding Fast-Growth Companies (12:21)
  • Venture Studio Model (16:34)
  • Learnings from Consulting and Incubation (19:35)
  • Founder Obsession (21:54)
  • Capital Stack Introduction (24:18)
  • Capital Efficiency in Startups (28:05)
  • Value Creation in Venture Capital (33:37)
  • Revenue-Based Financing (38:43)
  • Exit Aperture and Dilution (39:39)
  • Importance of Fit in Investment (41:51)
  • Setting Expectations Early (44:06)
  • Aligning Financial and Problem-Solving Goals (46:21)
  • Technical and Process Focus in Startups (49:21)
  • Identifying Tech and Process Debt (52:39)
  • Advice for Aspiring Founders (54:53)
  • The Craft vs. Problem Focus (57:09)
  • Final Thoughts and Takeaways (58:48)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

09 Dec 2024The PRQL: The Unspoken Truths of Data Leadership with Matt Kelliher-Gibson, The Cynical Data00:02:53

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

01 Jul 2024The PRQL: Google Cloud Deep Dive and Observability AI with David Wynn of Edge Delta00:02:58

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

30 Sep 202008: When data alone is not enough - Reinventing book shopping at Bookshop.org with Mason Stewart00:54:06

In this week’s episode of The Data Stack Show, Kostas Pardalis and Eric Dodds chat with Mason Stewart, the lead engineer at Bookshop.org. Bookshop is an online bookstore with a mission to financially support local, independent bookstores. Their hope is to help strengthen the fragile ecosystem and margins around bookselling and keep local bookstores an integral part of our culture and communities.

Among other topics, today’s conversation talked about making what some might call boring decisions with the data stack that are better described as mature decisions and the intertwining of human interaction with data for problem solving and recommendations.

  • Background on Mason and Bookshop.org (3:28)
  • Technical challenges of keeping up with a rapidly expanding business (10:00)
  • Interacting with data from fulfillment partners (14:36)
  • Data schema for books and dealing with Elasticsearch (24:46)
  • Human intervention in recognizing problems and exceptions (31:38)
  • In-depth look at Bookshop’s data stack (37:06)
  • Using curated lists from bookstores instead of algorithmic recommendations (43:50)

The Data Stack Show is a weekly podcast powered by RudderStack. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

18 Dec 2024220: From Box Office to Big Data: Bridging Marketing and Technology Through Data-Driven Leadership with Brian Schwartz of SIZE01:01:40

Highlights from this week’s conversation include:

  • Brian’s Background and Journey in Data and Marketing (0:56)
  • AI and Data Strategy (2:12)
  • Experience at DreamWorks Animation (3:15)
  • Marketing Timeline for Movies (5:18)
  • Data-Driven Decisions at Expedia (9:04)
  • Advising High-Growth Companies (14:59)
  • LinkedIn Connections and Networking (17:57)
  • Tension Between Marketing and Data Teams (19:59)
  • Technology Spending in Marketing (22:07)
  • Advice for Tech Leaders Facing Brand Marketers (25:50)
  • Frequency of Replatforming (30:11)
  • Understanding Data Accessibility (33:58)
  • Data as a Product (00:37:58)
  • Overhyped AI Applications (39:00)
  • Underutilized AI Opportunities (00:41:51)
  • AI's ROI Challenges (47:01)
  • Effective AI Support Systems (52:33)
  • Potential Ventures to Pursue Outside of Data (56:04)
  • Final Thoughts and Takeaways (1:00:11)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

13 Nov 2023The PRQL: Navigating the World of Data Overload with Travis Henry and Hillary Carpio of Snowflake00:04:39
In this bonus episode, Eric and Kostas preview their upcoming conversation with Travis Henry and Hillary Carpio of Snowflake.
03 Jun 2024The PRQL: From Programming Tic Tac Toe to Building an Operating System for Natural Language Programs With Binny Gill of Kognitos00:02:59

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

18 Oct 2023160: Closing the Gap Between Dev Teams and Data Teams with Santona Tuli of Upsolver01:05:42

Highlights from this week’s conversation include:

  • Santona’s journey from nuclear physics to data science (4:59)
  • The appeal of startups and wearing multiple hats (8:12)
  • The challenge of pseudoscience in the news (10:24)
  • Approaching data with creativity and rigor (13:22)
  • Challenges and differences in data workflows (14:39)
  • Schema Evolution and Quality Problems (27:01)
  • Real-time Data Monitoring and Anomaly Detection (30:34)
  • The importance of data as a business differentiator (35:48)
  • The SQL job creation process (46:25)
  • Different options for creating solver jobs (47:20)
  • Adding column-level expectations (50:17)
  • Discussing the differences of working with data as a scientist and in a startup (1:00:18)
  • Final thoughts and takeaways (1:04:01)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

14 Aug 2024202: Predicting the Impact of Competitive Entrants With Synthetic Controls with Evan Wimpey of Elder Research00:50:01

Highlights from this week’s conversation include:

  • Evan's Background and Journey in Data (0:40)
  • Discussion on Synthetic Controls (1:04)
  • Evan's Educational Journey and Marine Corps Experience (2:54)
  • Joining Elder Research (4:38)
  • Synthetic Controls Explained (6:54)
  • Measuring Impact with Synthetic Controls (9:05)
  • Building the Control Group (12:54)
  • Qualitative Context in Data Analysis (14:50)
  • Final Steps with Synthetic Controls (16:29)
  • Client Analytics Maturity (18:56)
  • Outsourcing Decisions in Analytics (21:09)
  • Cohesion Between Analytics Teams (24:18)
  • Validation of Predictive Models (26:37)
  • Confidence in Marketing Predictions (29:01)
  • Setting Expectations for Data Science (36:09)
  • Evan's Background in Data Comedy (39:44)
  • The Journey to Award-Winning Jokes (41:29)
  • Creating New Jokes (46:22)
  • Get the Joke Book and Final Thoughts in the Episode (48:46)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

27 Jan 2025The PRQL: From Data Chaos to Marketing Truth: An Engineer's Guide to Attribution with Lew Dawson of Momentum Consulting00:03:38

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

13 May 2024The PRQL: How to Get Business Teams Closer to Customer Data (The Right Way)00:03:00

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

19 Feb 2024The PRQL: Building a Future-Proof Data Stack from Day Zero? Featuring Peter Chapman00:05:47
In this bonus episode, Eric and Kostas preview their upcoming conversation with Peter Chapman, a GTM consultant.
09 Mar 202278: The Etymology of Reverse ETL & Why It’s a Key Piece Of The Modern Data Stack with Boris Jabes of Census01:05:51

Highlights from this week’s conversation include:

  • Boris’ background career journey (2:32)
  • The origins of “reverse ETL” (6:39)
  • Reverse Fivetran (16:35)
  • Product as an experience (22:41)
  • Fivetran users vs Census users (24:14)
  • How to add value to a data dump (26:56)
  • Ways companies are creating IP (33:48)
  • The cascade effect of the modern data stack (37:56)
  • Defining “data federation” (43:51)
  • Lessons from building a product (49:10)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

08 Mar 2023129: Databases, Data Warehouses, and Timeseries Data with David Kogn of Timescale01:09:22

Highlights from this week’s conversation include:

  • David’s background and journey to Timescale (2:12)
  • What are time series databases? (14:13)
  • How Timescale would have impacted David’s trajectory early in his career (17:51)
  • Innovation in postgreSQL (21:02)
  • Why does Timescale build their timeseries databases differently? (27:08)
  • The challenges of building a new database on top of an old software (32:22)
  • Writing outside of SQL and Timescale’s secret sauce (37:47)
  • The importance of the developer experience in Timescale (54:08)
  • How does someone know when they need to implement time series functionality (56:51)
  • Final thoughts and takeaways (1:04:57)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

11 Sep 2023The PRQL: Making ERP Systems More User-Friendly with Emilie Schario of Turbine00:06:42
In this bonus episode, Eric and Kostas preview their upcoming conversation with Emilie Schario of Turbine.
13 Nov 2024215: Data Sharing and the Truth About Data Clean Rooms with Patrik Devlin of Wilde AI00:52:56

Highlights from this week’s conversation include:

  • Patrik's Background and Journey to Wilde (1:12)
  • The Evolution of QR Codes (4:09)
  • Marketing Analytics and Clean Rooms (9:52)
  • Challenges in Data Sharing (13:20)
  • Technical Challenges with Clean Rooms (15:37)
  • Exploring Current Data Infrastructure (19:11)
  • Data Orchestration Tools (22:50)
  • Performance Tuning and Data Syncing (24:00)
  • Choosing Data Tools (26:08)
  • Mother Duck and Data Warehousing (00:30:31)
  • Flexible Data Architecture (32:40)
  • DuckDB Implementation (35:36)
  • Data Marketplace Concept (38:34)
  • Asset Availability in Data Queries (42:21)
  • Transition from Software Engineering to Data Stack (46:36)
  • Data Contracts and Type Safety (49:10)
  • Database Schema Perspectives (50:27)
  • Final Thoughts and Takeaways (51:35)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

22 Jul 2022The PRQL: If You Were Building a Data Team What Would Your First Hire Be?00:04:19
Eric and Kostas preview their upcoming conversation with Emilie Schario from Amplify Partners.
14 Apr 202133: ML is a Data Quality Problem with Peter Gao from Aquarium Learning00:56:35

On this week's episode of The Data Stack Show, Eric and Kostas talk with Peter Gao, co-founder, and CEO at Aquarium Learning. A former engineer at Cruise Automation, Peter and Aquarium Learning help ML teams improve their model performances by improving their data.

Highlights from this week's episode include:

  • How getting hit by a drunk driver made researching self-driving cars personal for Peter (2:12)
  • Filtering out the hype in self-driving car news to get a clear picture of its state today (6:52)
  • The data required for a self-driving vehicle (13:56)
  • Operation Vacation and how Aquarium can help provide the tools to make models better (16:53)
  • Utilizing neural networks to index data (20:41)
  • How Aquarium fits in the ML stack (30:25)
  • Interesting use cases of Aquarium (33:59)
  • Distinguishing subclasses of machine learning (40:05)
  • Human involvement in machine learning (46:13)

The Data Stack Show is a weekly podcast powered by RudderStack. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

27 Feb 2023The PRQL: Boundaries Between Synthetic Data and Prediction Models00:03:46
In this bonus episode, Eric and Kostas preview their upcoming conversation with Alex Watson of Gretel.ai.
01 May 2024187: Startup Lessons and Torch Passing with Kostas Pardalis00:46:06

Highlights from this week’s conversation include:

  • Kostas Passes the Baton as Co-Host of the Podcast (0:24)
  • Reflecting on the Podcast (2:56)
  • New Co-Host John Wessel and His Background in Data (4:34)
  • Kostas Journey in Data (10:55)
  • Rudderstack's Explosive Growth (21:28)
  • The Podcast's Inception and Marketing Activities (24:19)
  • Evolution of the podcast (27:22)
  • Memorable guests and experiences (28:29)
  • Connecting with industry leaders and key innovators in the space (33:05)
  • Kostas' new venture (36:26)
  • Advice for the new co-host (42:17)
  • Final Thoughts and Takeaways (44:47)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

08 Feb 2023125: Authorization Is A Data Problem with Jeff Chao of Abbey Labs00:55:29

Highlights from this week’s conversation include:

  • Jeff’s background at Netflix and Stripe leading him to Abbey Labs (2:22)
  • What Abbey is solving in the space (5:16)
  • Tackling permissions in an organization (7:30)
  • Opportunities to improve the availability of data (10:14)
  • The challenge of tackling a new problem area at a new company (14:59)
  • What is the most common challenges in the identity and security space (18:43)
  • Importance of identity and the ability to track it in data (22:46)
  • Connecting all the different platforms without frustrating the user (30:32)
  • What are the parts of access data that needing to be tracked (36:10)
  • Dealing with the varieties of data in security and managing permissions (40:26)
  • Final thoughts and takeaways (51:52)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

02 Oct 2023The PRQL: The Power of Data Orchestration: A Game-Changer for Data Infrastructure, Featuring Nick Schrock of Dagster Labs00:03:29
In this bonus episode, Eric and Kostas preview their upcoming conversation with Nick Schrock of Dagster Labs.
14 Oct 202010: The Evolution of the BI Market with Huy Nguyen of Holistics00:56:28

In this week’s episode of The Data Stack Show, Kostas Pardalis and Eric Dodds are joined by CTO and Co-Founder of Holistics, Huy Nguyen. Holistics takes an approach to business intelligence and data analytics that they call DataOps. They focus on data team productivity and company-wide access to insights. 

Important points in the conversation included:

  • Introduction to Huy and Holistics (3:12)
  • Approaching BI with more than just visualization (8:59)
  • How friction between different roles within an organization is addressed by Holistics (15:20)
  • Holistics as a complementary tool (23:25)
  • Describing their own data stack (34:47)
  • History of BI and trends for the future (39:33)

The Data Stack Show is a weekly podcast powered by RudderStack. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

26 Feb 2025230: The Cynical Data Guy: Data Tech Debt, Data Mesh, and Dashboard Directives00:25:20

Highlights from this week’s conversation include:

  • The Return of the Cynical Data Guy (0:14)
  • Risks of SQL Complexity (2:16)
  • Technical Debt in Data (4:34)
  • Data Mesh Critique (6:38)
  • Governance vs. Decentralization (9:55)
  • Never Let a Stakeholder Tell You They Need a Dashboard (12:05)
  • Dashboard vs. Table (13:34)
  • Organizational Dynamics in Data Requests (16:35)
  • AI and Prompt Writing (19:43)
  • Search Techniques and User Behavior (21:20)
  • Discussion on Code Optimization Tools (23:19)
  • Final Thoughts and Takeaways (24:47)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

22 Mar 2023131: How Data Teams Interact With Marketing Tools with Jason Davis of Simon Data00:47:34

Highlights from this week’s conversation include:

  • Defining CDPs (2:28)
  • The data team's role in marketing (7:41)
  • Leveraging commonalities across businesses (12:49)
  • Building a CDP with customer data (18:05)
  • Challenges in identity modeling (23:00)
  • CDP lifecycle and one-to-one data (30:06)
  • Segmentation and optimization (33:23)
  • Real-time data in the cloud (40:37)
  • The future of AI and machine learning (43:02)
  • Final thoughts and takeaways (46:42)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

26 Apr 2023Data Council Week (Ep 4) - Using Data Anonymization for Identity Protection With Will Thompson of Privacy Dynamics00:46:04

Highlights from this week’s conversation include:

  • Will’s background in data (0:28)
  • Privacy dynamics and data anonymization (4:18)
  • Addressing data privacy problems in the space (10:33)
  • Developer experience with Privacy Dynamics (13:49)
  • How does Privacy Dynamics work? (21:09)
  • Update of real-time anonymized data (26:29)
  • The problem of dates and other complexities in data (31:24)
  • Being a data engineer in a startup (34:44)
  • Moving at the speed of a startup (41:01)
  • Connecting with Will and Privacy Dynamics (43:28)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

02 Jun 202138: Graph Databases & Data Governance with David Allen of Neo4j00:50:48

Highlights from this week's episode include: 

  • David’s background in comparative databases (1:50)
  • David’s experience and lessons he learned from writing his book (3:23)
  • How writing a technical book compares to writing technical documentation (4:41)
  • The process of writing a book (6:30)
  • The best and worst part of David’s book writing experience (8:02)
  • An introduction to what Neo4j is (9:08)
  • What you need to graph (11:13)
  • Typical problems a graph database is a good solution for (13:00)
  • The difference between performance and relational databases (18:41)
  • How Neo4j addresses performance and ergonomics (23:30)
  • Neo4j and scalability (26:20)
  • How Neo4j fits in the modern data stack (31:48)
  • Neo4j use cases (35:45)
  • Practical implementation of Neo4j (40:51)
  • Neo4j’s relationship with open source (45:50)

The Data Stack Show is a weekly podcast powered by RudderStack. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

08 Nov 2021The PRQL: Will we ever get rid of the CSV?00:09:30
08 Jun 202290: The Modern Data Stack Has a Join Problem with Ahmed Elsamadisi of Narrator AI00:56:34

Highlights from this week’s conversation include:

  • Ahmed’s background and career journey (2:27)
  • Why the modern data stack “sucks” (4:53)
  • The limitations of progress (9:13)
  • Showing data with only 11 columns (11:55)
  • Managing one table that rules them all (19:02)
  • Viewing the world as timestamped activities (32:40)
  • When this model becomes harder to use (35:15)
  • The two parts you need in a company (44:41)
  • Those who use Narrator (48:32)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

05 Feb 2024The PRQL: The Evolution of Application Orchestration Featuring Viren Baraiya of orkes.io00:04:01
In this bonus episode, Eric and Kostas preview their upcoming conversation with Viren Baraiya of orkes.io.
24 Jul 2024199: How To Use Data Analytics and AI To Increase Profitability With Smarter Procurement, Featuring Cameron Jagoe of ProcureVue00:49:29

Highlights from this week’s conversation include:

  • Cameron's Background and Journey in Data (1:49)
  • Running a Bakery (3:03)
  • Applying Analytics to Bakery Operations (7:07)
  • Reevaluating Business Operations (18:08)
  • Optimizing for Profitability (19:09)
  • Working at Newell Rubbermaid (20:11)
  • Value Engineering Projects (22:11)
  • Starting a Center of Excellence (24:53)
  • Productizing the Approach (29:48)
  • Tech Stack for Data Analysis (31:40)
  • Data Cleaning and Classification (35:16)
  • Market Build and Pricing Accuracy (37:13)
  • The AI Tool as a Pointy Stick (38:20)
  • Sourcing and Sales as Two Sides of the Coin (41:04)
  • Challenges with Parsing Data (44:06)
  • Personal Journey and Company Success (46:44)
  • Final thoughts and takeaways (47:45)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

11 Nov 202014: Breaking Down Electronic Money Transfers and Modernizing Real Estate Transactions with Dan Jeffords of Earnnest00:48:07

This week on The Data Stack Show, Kostas and Eric chat with Daniel Jeffords, CTO and co-founder of Earnnest, a financial tool for the real estate industry. Earnnest’s digital platform allows buyers to securely and electronically deposit funds directly to an escrow holder and keeps agents, buyers, and escrow holders in the loop with automated emails and tracking information.

Highlights from this week’s episode include:

  • Earnnest’s approach to the way payments are handled in an antiquated real estate industry (2:12)
  • Clearing up the differences in the way money changes hands, ACH, wire, and checks (12:39)
  • How Earnnest works and who are the involved parties (21:06)
  • Disrupting a highly regulated industry (24:24)
  • Emphasizing security and transparency (30:09)
  • Erlang, Elixir, Dwolla and more. How Earnnest uses data (33:40)
  • Trying very hard to store very little data (42:58)

The Data Stack Show is a weekly podcast powered by RudderStack. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

28 Aug 2023The PRQL: Exploring the Evolution of Notebooks with Jakub Jurových of Deepnote00:04:23
In this bonus episode, Eric and Kostas preview their upcoming conversation with Jakub Jurových of Deepnote.
25 Apr 2022Data Council Week (Ep 1): Discussing Firebolt’s Engine With Benjamin HoppDiscussing Firebolt’s Engine With Benjamin Hopp00:28:23

Highlights from this week’s conversation include:

  • Ben’s career journey (2:55)
  • What makes Firebolt different (3:58)
  • Firebolt’s data product family (7:37)
  • Table engines and Firebolt (10:57)
  • Ben’s favorite part of ClickHouse (12:52)
  • The experience of building an optimizer (15:19)
  • Where Firebolt fits into architecture (17:27)
  • Working in the data space: to love and dislike (19:51)
  • Coming soon in the near future (24:35)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com

.

28 Apr 202134: The Intersection of Data Engineering and Marketing with John Marbachm of Grafana Labs00:49:07

On this week's episode of The Data Stack Show, Eric and Kostas talk with John Marbach, senior growth manager at Grafana Labs. In this conversation, John discusses marketing ops and the blending of roles of data engineering and marketing.

Highlights from this week's episode include:

  • Grafana Labs John Marbach Senior Growth Manager
  • Introduction to John Marbach and working in the blurred lines between marketing and data engineering (2:14)
  • How managing pipeline building and consuming data influences the use of downstream tools (6:28)
  • Experiments in marketing (11:28)
  • Exploring the role of marketing ops (15:35)
  • How accruing technical debt can grind things to a halt (20:35)
  • Matching the stack with the company's scale (24:48)
  • CDPs and marketing to developers (28:40)
  • Biggest challenges and barriers between data engineering and marketing (35:19)
  • Takes on reverse ETL (39:07)
  • Thoughts on cryptocurrency and the blockchain (44:08)

The Data Stack Show is a weekly podcast powered by RudderStack. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

20 Mar 2024182: Building a Dynamic Data Infrastructure at Enterprise Scale Featuring Kevin Liu of Stripe01:00:54

Highlights from this week’s conversation include:

  • Kevin’s background and work at Stripe (0:31)
  • Evolution of Data Infrastructure at Stripe (2:18)
  • Kevin's Interest in Data (5:29)
  • Software Engineer or Data Engineer? (8:27)
  • Speech Recognition Work at Amazon (11:06)
  • Efficiency and Cost Management (15:50)
  • Metadata and Query Analysis (18:38)
  • Surprising Discoveries in Metadata Analysis (21:43)
  • Optimizing Cost and Value (23:55)
  • Product Sizing Stripe Data (26:39)
  • Popular Tool for Data Interaction (30:08)
  • Enabling Data Infrastructure Integration (35:22)
  • Value of Data Pipelining for Stripe (39:32)
  • Next Generation Product and Technology (43:54)
  • Maximizing value in a decentralized environment (51:34)
  • Future of open source projects in data infrastructure (57:59)
  • Final thoughts and takeaways (59:02)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

23 Jun 202141: Doing MLOps on Top of Apache Pulsar and Trino with Joshua Odmark of Pandio00:50:20

Highlights from this week’s episode:

  • Joshua started his first company at age 15 and then sold two more startups after that (2:15)
  • Embracing the open source movement and not reinventing the wheel if you don't have to (12:15)
  • Pulsar seemed built to address Kafka's weaknesses (17:23)
  • Using Redis as a coordinator for federated learning and taking advantage of its portability (23:05)
  • The pillars of Pandio and some practical use cases (31:24)
  • Feature stores and model versioning (38:23)
  • Seeing Pulsar as the future because of the ability to run tens of millions of topics (41:04)

The Data Stack Show is a weekly podcast powered by RudderStack. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

11 Sep 2024206: Reviving Old-School Customer Experiences Through Modern Data Strategies, Featuring Edward Chenard, Seasoned Data Leader and Analytics Officer00:48:26

Highlights from this week’s conversation include:

  • Edward's Background and Journey in Data (0:44)
  • P&L Ownership Discussion (1:15)
  • Challenges in Profit Ownership (3:38)
  • Data Team Dynamics (5:52)
  • Role Clarity Between CFO and CDO (7:31)
  • Nuances of Data Leadership (11:24)
  • Focus on Relevance in Data Work (14:05)
  • Best Buy's Personalization Project (18:39)
  • Building a Data Stack (21:00)
  • Crowd-Driven Algorithms (25:26)
  • Event-Based Personalization (28:12)
  • Corporate Politics and Implementation (31:00)
  • In-Store Experience Innovations (33:16)
  • Impact of Data Science at Best Buy (37:14)
  • The Importance of Data Teams in AI Implementation (39:19)
  • Using AI Conversationally (42:09)
  • Book Recommendations for Data Leaders (44:24)
  • Final Thoughts and Takeaways (47:05)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

24 Dec 2024221: The Art and Science of Technical Leadership in Early-Stage Startups: Building World-Class Engineering Teams from Scratch with Sokratis Vidros of Novu00:46:13

Highlights from this week’s conversation include:

  • Sokratis’ Background and Journey in Data (1:19)
  • Engineers Wearing Multiple Hats (2:17)
  • The Era of Early Startups (3:32)
  • Lessons from Building Software (7:15)
  • Importance of Team Dynamics (9:12)
  • Balancing Creativity and Stability (15:00)
  • Version Control in Data Analysis (18:57)
  • Opinionated Modern Data Software (21:14)
  • Creating Dashboards for Company Stats (22:41)
  • Hiring for Intuition in Tech (27:38)
  • Interview Process Insights (30:15)
  • Protecting Intuitive Thinkers in Companies (35:08)
  • The Challenge of Trust (39:21)
  • Loss of Control in Delegation (40:14)
  • Founder Work-Life Balance (42:15)
  • Advice for Early-Stage Engineers (44:03)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

25 Sep 2024208: The Intersection of AI Safety and Innovation: Insights from Soheil Koushan on LLMs, Vision, and Responsible AI Development00:44:04

Highlights from this week’s conversation include:

  • Soheil’s Background and Journey in AI (0:40)
  • Anthropic's Philosophy on Safety (1:21)
  • Key Moments in AI Discovery (2:52)
  • Computer Vision Applications (4:42)
  • Magic vs. Reality in AI (7:35)
  • Product Development at Anthropic (12:57)
  • Tension Between Research and Product (14:36)
  • Safety as a Capability (17:33)
  • Community Notes and Democracy in AI (20:41)
  • Expert Panels for Safety (21:38)
  • Post-Training Data Quality (23:32)
  • User Data and Privacy (25:32)
  • Test Time Compute Paradigm (30:54)
  • The Future of AI Interfaces (36:04)
  • Advancements in Computer Vision (38:46)
  • The Role of AGI in AI Development (41:52)
  • Final Thoughts and Takeaways (43:07)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

06 Oct 202156: Stream Processing and Observability with Jeff Chao of Stripe01:03:55

Highlights from this week’s conversation include:

  • Jeff’s history with stream processing (2:52)
  • Working with Mantis to address the impact of Netflix downtime (4:20)
  • Defining observability as operational insight (6:58)
  • Time series data and the value of data today (18:52)
  • Data integration’s shift from batch to streaming (29:34)
  • The current state of change data capture (32:20)
  • How an engineer thinks of the end-user (56:21)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

07 Dec 2022116: Data Democratization & Self Service with Aron Clymer of Data Clymer00:54:23

Highlights from this week’s conversation include:

  • Aron’s background in the world of data (2:18)
  • Recent Clients and major projects (3:30)
  • Helping to spearhead data-driven growth at Salesforce (6:50)
  • Stories about Marc Benioff, co-founder of Salesforce (16:12)
  • Biggest learnings as a consultant in the data strategy space (17:58)
  • The need for data democratization (23:33)
  • Advice for Aron’s younger self in consulting (28:45)
  • Current trends in data democratization and sales service (35:01)
  • Aron’s favorite tools and platforms to use (42:19)
  • Favorite part of the consulting process (47:45)
  • Final thoughts and takeaways (50:29)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

22 Jun 202292: Building a Decentralized Storage System for Media File Collaboration with Tejas Chopra00:55:36

Highlights from this week’s conversation include:

  • Tejas’ background and career journey (2:49, 43:04)
  • Digital collaboration with Netflix Drive (7:57)
  • A formal version control component (23:44)
  • Centralized store vs. local affairs (31:05)
  • The different skill sets a data engineer needs (37:38)
  • How to get into data engineering (40:57)
  • New technologies coming into day-to-day work (44:39)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

02 Oct 2024209: Storytime with Cynical Data Guy: Data Projects, $50K Web Scraping Fails, and the Role of CDOs00:30:57

Highlights from this week’s conversation include:

  • Previewing the Next Cynical Data Guy Episode (0:13)
  • Story Time: Coolest Data Project You’ve Worked On (1:13)
  • Failed Web Scraping Project (3:40)
  • Building a Neural Net for Matching (5:22)
  • Rebuilding the Project Strategy (7:04)
  • Project Completion and Politics (9:35)
  • Agreeable Data Guy's Pricing Story (11:00)
  • Balancing Advanced and Simple Solutions (14:15)
  • Insights from Pricing Team Meetings (16:19)
  • Building for Scale vs. Immediate Needs (18:29)
  • Open Source Data Formats (19:46)
  • Disaster Recovery Experiences (22:34)
  • Reflections on Chief Data Officers (25:01)
  • Cynicism in Data Projects (28:19)
  • Final Thoughts and Takeaways (30:20)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

21 Jul 202145: Open Source and Attribution with Ophir Prusak of Codesmith00:55:59

Highlights from today's conversation include:

  • Ophir's decision to switch from software engineering to marketing and riding the startup train (2:39)
  • Open sourcing in the world of software (5:55)
  • How open source has changed Ophir's life as a marketeer working at startups (10:28)
  • Chartio's sunsetting drove Ophir to search for a data tooling replacement (27:27)
  • Discussing trends in adoption of tools for small scale and large scale companies (35:01)
  • Data challenges related to attribution--how wrong do you want to be?  (44:07)

The Data Stack Show is a weekly podcast powered by RudderStack. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

04 Oct 2023158: The Orchestration Layer as the Data Platform Control Plane With Nick Schrock of Dagster Labs01:02:18

Highlights from this week’s conversation include:

  • Nick’s background and journey in data (2:28)
  • Founding Dagster Labs (7:50)
  • The evolution of data engineering (12:32)
  • Fragmentation in data infrastructure (15:04)
  • The role of orchestration in data platforms (19:53)
  • The importance of operational tools for data pipelines (25:01)
  • Lessons learned from working with GraphQL (26:19)
  • The role of the orchestrator in data engineering (34:51)
  • The boundaries between data infrastructure and product engineering (37:33)
  • Different orchestrators in the data infrastructure landscape(42:03)
  • The role of MLOps in data engineering (46:04)
  • Data Quality and Orchestration (51:04)
  • Future of Data Teams and Orchestration (54:27)
  • Final thoughts and takeaways from (58:01)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

26 Jun 2024195: Supply Chain Data Stacks and Snowflake Optimization Pro Tips with Jeff Skoldberg of Green Mountain Data Solutions00:48:51

Highlights from this week’s conversation include:

  • Jeff's Background and Transition to Independent Consulting (0:03)
  • Working at Keurig and Business Model Changes (2:16)
  • Tech Stack Evolution and SAP HANA Implementation (7:33)
  • Adoption of Tableau and Data Pipelines (11:21)
  • Supply Chain Analytics and Timeless Data Modeling (15:49)
  • Impact of Cloud Computing on Cost Optimization (18:35)
  • Challenges of Managing Variable Costs (20:59)
  • Democratization of Data and Cost Impact (23:52)
  • Quality of Fivetran Connectors (27:29)
  • Data Ingestion and Cost Awareness (29:44)
  • Virtual Warehouse Cost Management (31:22)
  • Auto-Scaling and Performance Optimization (33:09)
  • Cost-Saving Frameworks for Business Problems (38:19)
  • Dashboard Frameworks (40:53)
  • Increasing Dashboards (43:29)
  • Final thoughts and takeaways (46:28)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

28 Nov 2022The PRQL: Managing Complexities of Financial Data00:06:06
In this bonus episode, Eric and Kostas preview their upcoming conversation with Ashwin Kamath of Spectre.
10 Feb 202124: Demystifying AI with Duc Haba00:51:00

On this week’s episode of The Data Stack Show, Eric is joined by Duc Haba, an AI researcher and enterprise mobility solution architect consultant who most recently did AI consulting work with Cognizant. Their discussion revolves around demystifying artificial intelligence and why so many people either fear AI or place too much trust in it. Duc talks about some of the AI projects he has worked on, some successes and some failures, and points to how the data biases that humans bring into the models can radically alter the outcome of those endeavors.

Highlights from this week’s episode include:

  • Duc's background with AI and getting to work with LeVar Burton (1:44)
  • Demystifying AI and coming up with a definition for it (3:34)
  • Misplaced fears of AI (7:53)
  • Misplaced trust in AI (10:36)
  • Public versus hidden AI (13:58)
  • Acquiring the data needed for to train AI models (23:11)
  • Examples of interesting AI projects Duc has worked on (27:58)
  • Where to go to learn more about AI (35:06)
  • Thinking of AI as something that can help your business do something better with what it's already been doing (39:53)
  • Anticipating the near-future of AI (44:16)

The Data Stack Show is a weekly podcast powered by RudderStack. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

26 Apr 2023Data Council Week (Ep 5) - The Difference Between Data Platforms and ML Platforms with Michael Del Balso of Tecton00:43:00

Highlights from this week’s conversation include:

  • Michael’s journey to co-founding Tecton (0:22)
  • The evolution of MLops and platform teams (3:50)
  • Understanding boundaries between the data platform and the MLops (8:42)
  • Differences in machine learning vs data pipelines (16:58)
  • The systems needed to handle all these types of data (22:22)
  • Developer experience in Tecton (25:15)
  • Automating challenges in ML development (32:30)
  • The most difficult part of the life cycle of prediction (37:24)
  • Exciting new developments at Tecton (39:27)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

03 Oct 2022Shop Talk With Eric and Kostas: Transitioning From Consumer to Builder00:27:47
In this bonus episode, Eric and Kostas talk shop regarding transitioning from consumer to builder.
24 Apr 2024186: Data Fusion and The Future Of Specialized Databases with Andrew Lamb of InfluxData00:58:26

Highlights from this week’s conversation include:

  • The Evolution of Data Systems (0:47)
  • The Role of Open Source Software (2:39)
  • Challenges of Time Series Data (6:38)
  • Architecting InfluxDB (9:34)
  • High Cardinality Concepts (11:36)
  • Trade-Offs in Time Series Databases (15:35)
  • High Cardinality Data (18:24)
  • Evolution to InfluxDB 3.0 (21:06)
  • Modern Data Stack (23:04)
  • Evolution of Database Systems (29:48)
  • InfluxDB Re-Architecture (33:14)
  • Building an Analytic System with Data Fusion (37:33)
  • Challenges of Mapping Time Series Data into Relational Model (44:55)
  • Adoption and Future of Data Fusion (46:51)
  • Externalized Joins and Technical Challenges (51:11)
  • Exciting Opportunities in Data Tooling (55:20)
  • Emergence of New Architectures (56:35)
  • Final thoughts and takeaways (57:47)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

01 Jun 202289: Solving Microservice Orchestration Issues at Netflix with Viren Baraiya of Orkes00:51:40

Highlights from this week’s conversation include:

  • Viren’s background and career journey (2:23)
  • Engineering challenges in Netflix transitions (6:05)
  • How Conductor changed the process (9:30)
  • Building a lot more microservices (16:04)
  • Open sourcing Conductor (17:38)
  • Defining “orchestration” (22:05)
  • Using an orchestrator written in Java (31:04)
  • Building a cloud service around microservices (34:59)
  • Differentiating product experiences (37:17)
  • Orchestration platforms in new environments (42:15)
  • Advice for those early on in their career (46:10)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

08 Jul 2022The PRQL: Data Marts Aren’t Just for the Enterprise00:03:55
Eric and Kostas preview their upcoming conversation with Nick Hansel from Transform.
06 Nov 2023The PRQL: The Shortcomings of Apache Kafka with David Yaffe and Johnny Graettinger of Estuary00:03:51
In this bonus episode, Eric and Kostas preview their upcoming conversation with David Yaffe and Johnny Graettinger of Estuary.
30 Oct 2023The PRQL: How LLMs are Transforming Enterprise Workflows with Mark Huang of Gradient00:03:36
In this bonus episode, Eric and Kostas preview their upcoming conversation with Mark Huang of Gradient.
21 Feb 2024178: How to Build a Data Stack to Win PLG, Featuring Peter Chapman00:57:17

Highlights from this week’s conversation include:

  • Peter's background and journey in data (0:26)
  • Introduction to PLG (4:18)
  • Starting in data at Heroku (6:05)
  • Building the data stack at Heroku (8:13)
  • Data stack requirements for early-stage companies (12:00)
  • Differentiating PLG companies from open source companies (19:26)
  • Venture capital and open source as a lever for growth (22:56)
  • Initial data modeling and analysis (25:38)
  • Operationalizing Data (29:16)
  • Sales and Marketing Operationalization (31:52)
  • Identifying Signals (34:16)
  • Challenges in Developing Signals (37:07)
  • Account Management for Developer Tools (42:30)
  • Challenges in Achieving Margins (45:02)
  • Leveraging Infrastructure for Margins (47:35)
  • Inference vs Training (54:55)
  • Final thoughts and takeaways (57:02)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

09 Sep 2022The PRQL: What Does 10 Years in the Data Space Give You?00:04:05
In this bonus episode, Eric and Kostas preview their upcoming conversation with Benn Stancil of Mode.
27 Apr 2022Data Council Week (Ep 3): Product Analytics the Right Way With James Greenhill of PostHog00:29:27

Highlights from this week’s conversation include:

  • How James got started in data (2:42)
  • What makes PostHog different (10:43)
  • Why we need product analytics (13:40)
  • Capturing and collecting data (15:17)
  • Dealing with drift on a platform like PostHog (19:45)
  • Starting from the metrics versus events (22:50)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

30 Nov 2022115: What Is Production Grade Data? Featuring Ashwin Kamath of Spectre00:54:55

Highlights from this week’s conversation include:

  • Ashwin’s background in the data space (2:43)
  • The unique nature of working with data in finance (7:32)
  • Technological challenges of working in the finance data space (13:55)
  • The third-party data factor and judging if it is reliable enough (17:07)
  • What made Ashwin decide to go out and build his own company? (31:47)
  • Defining data decay and data storing and why both are important (37:52)
  • Advice on the importance of data quality (42:10)
  • Final takeaways and wrap-up (50:49)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

02 Aug 2023149: Turning Tables Into APIs for Real-time Data Apps, Featuring Matteo Pelati and Vivek Gudapuri of Dozer01:03:46

Highlights from this week’s conversation include:

  • Building Dozer: Simplifying Data Sources into APIs (1:13)
  • Bridging Data Engineering with Application Engineering (4:19)
  • Turning Data Sources into APIs (7:46)
  • The cost of caching (12:59)
  • Challenges with legacy systems (14:30)
  • Real-time data integration (19:31)
  • YAML and SQL experience (25:37)
  • Behind the scenes of Dozer (29:18)
  • Heavy Workloads and Low Latency (42:00)
  • Use Cases of Dozer (45:51)
  • Reliability and storing data from different connectors (51:35)
  • Importance of observability in serving data to customers (53:24)
  • Final thoughts and takeaways (56:34)

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

.

30 Jun 202142: Scaling Data Science with Ryan Boyer of Shipt00:52:53

Highlights from this week’s episode include:

  • Ryan's full circle path from stocking shelves at Target to using data science for a company owned by Target (2:00)
  • Building great tools and wielding them effectively (5:04)
  • Changes at Shipt since being acquired (9:29)
  • How people’s bias impacts models built by data scientists (12:30)
  • The different data sources Shipt incorporates (22:02)
  • How Ryan's work as a data scientist has changed as Shipt has grown (25:29)
  • How data science helps marketing (31:38)
  • Improving search experience (34:23)
  • Shipt's evolving data stack (38:27)
  • New trends in data science (47:06)

The Data Stack Show is a weekly podcast powered by RudderStack. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Améliorez votre compréhension de The Data Stack Show avec My Podcast Data

Chez My Podcast Data, nous nous efforçons de fournir des analyses approfondies et basées sur des données tangibles. Que vous soyez auditeur passionné, créateur de podcast ou un annonceur, les statistiques et analyses détaillées que nous proposons peuvent vous aider à mieux comprendre les performances et les tendances de The Data Stack Show. De la fréquence des épisodes aux liens partagés en passant par la santé des flux RSS, notre objectif est de vous fournir les connaissances dont vous avez besoin pour vous tenir à jour. Explorez plus d'émissions et découvrez les données qui font avancer l'industrie du podcast.
© My Podcast Data