The Preset Podcast

By Preset

Welcome to the Preset Podcast, the home of "Analytics Everywhere" and "Designated Driver". Analytics Everywhere discusses wide-ranging topics in business intelligence and data engineering, and Designated Driver is a great way to get to know the database platforms of the world over a beer. These podcasts are dedicated to explore next-generation data tools and the impact they have on data teams.

Listen on Spotify

Available on

Report content on Spotify

Designated Driver #8 • Altinity with Robert Hodges

The Preset PodcastNov 11, 2024

00:00

41:17

Designated Driver #8 • Altinity with Robert Hodges

Anyone up for a Boulevardier? Of course you are! Join special guest Robert Hodges, CEO of Altinity as we learn about databases and oh-so-much more. Grab a snack/drink/whatever, and join us.

Try Altinity for free at https://altinity.com/
Learn more about Apache Superset at https://superset.apache.org/

Get Preset for free, forever, at https://preset.io

Nov 11, 202441:17

Analytics Everywhere #10 • Alexander Gallego of Redpanda

Join us for an in-depth conversation with Redpanda founder & CEO Alexander Gallego, as we dive into all sorts of topics from the project's origin story, to trends in streaming data, AI, and much more. Check out Redpanda at ⁠https://redpanda.com Try Preset for free, forever, at ⁠https://preset.io/ Learn more about how Preset can help your business: https://preset.io/contact-sales/

Sep 30, 202456:56

Analytics Everywhere #9 • Devin Stein of Dosu

Today we welcome Devin Stein, Founder & CEO of Dosu. Dosu offers GitHub repo automation tools built around custom LLM agents, trained on your repo's issues/PRs and related content. Learn about how it's been helping the Apache Superset project and can help your repo too! Try out Dosu⁠

Try Preset for free

Learn more about how Preset can help your business

Aug 20, 202443:43

Designated Driver #7 • Apache Pinot / StarTree

It's happy hour again! Join Viktor Gamoc from Apache Pinot and StarTree to learn about him, the project, and their use cases with Superset. Grab a snack/drink/whatever, and join us. Try StarTree for free at https://startree.ai/ Learn more about Apache Pinot at https://pinot.apache.org/ Learn more about Apache Superset at https://superset.apache.org/ Get Preset for free, forever, at https://preset.io

Aug 08, 202446:42

Designated Driver Episode #6 • Confluent/Apache Kafka

Join us with special guest Tim Berglund from Confluent and Apache Kafka. We'll spend our happy (half) hour (depending on time zone, of course!) learning all about the project's background, its use cases, and how that all flows into Superset. Grab a beverage, and let's dive in!

Try Confluent for free at https://www.confluent.io/

Try Apache Superset/Preset for free at https://preset.io

Learn more about Apache Kafka at https://kafka.apache.org/

Learn more about Apache Superset at https://superset.apache.org/

Jul 12, 202425:43

Analytics Everywhere #8: Avi Press of Scarf

Today we welcome Avi Press, Founder & CEO at Scarf. Scarf offers telemetry and analytics to capture anonymized open-source usage data and find actionable insights to grow your open-source project or fuel your sales and marketing strategies.

Check out Scarf at https://scarf.sh/

Try Preset for free at https://preset.io/

Jul 09, 202447:44

Designated Driver Episode #5 • Shillelagh

Time to take a walk on the Irish side, as Beto tells us all about Shillelah⁠! We explore the history, use cases, and inner workings of this Python library that allows you to query many resources (APIs, files, and more) using SQL. It's not just compatible with Apache Superset, it's built for Superset!

Want to get started with Shillelagh and Superset today? Try Preset! You can start a free-forever account or contact our sales team to access even more Superset features.

Jun 28, 202433:59

Designated Driver Episode #4 • InfluxDB

Please join us in welcoming Anais Dotis of InfluxDB/InfluxData on our 4th installment of Designated Driver - and our first episode with an AI-generated theme song! The future is now! Grab a beer and join us!

Jun 04, 202424:41

Designated Driver Episode #3 • Apache Drill

Meet Apache Drill PMC Chair Charles Givre, and learn the back story of this DB — we even get a demo this time!

May 28, 202431:57

Designated Driver Episode #2 • MotherDuck / DuckDB

In this episode, we hang out with MotherDuck founder and CEO Jordan Tigani over a beer (and/or a kombucha). Learn all about what makes DuckDB/MotherDuck an emerging winner in the space, and a great pairing with Apache Superset.

Apr 18, 202425:35

Designated Driver Episode #1 • CelerData / StarRocks

In our first episode of Designated Driver (having a beer... database drivers.. get it?), Beto and Evan meet up with Albert Wong from CelerData and StarRocks to have a beer and talk shop. Let's get rollin'!

Jan 23, 202439:46

Analytics Everywhere #7: Tarush Aggarwal of 5X

Today Max and Evan are joined by Tarush Aggarwal, CEO of 5X, as we discuss the intersection of data teams with the modern data stack, and how these tools went from a monolith model to a sea of disconnected tools and patterns. Where is this headed, and how can data teams be most effective in this new ecosystem?

Dec 08, 202301:10:22

Analytics Everywhere #6: Semantic Layer for Machine Learning with Byron Allen

Welcome to this episode of the Analytics Everywhere podcast! In episode #6, Max chats with Byron Allen, the ML Practice Lead at Contino. In this episode, we chat about a variety of topics around the challenges of operationalizing data:

- organizational difficulties around data (data mesh vs centralized data warehouse / governance)

- what a semantic layer really is

- thin vs thick semantic layers

- entity / dataset centric modelling

- the idea of a unified semantic layer for ML and BI

- batch vs real-time ML use cases

- experimentation frameworks

- and much much more!

We hope you enjoy this episode!

Links:

Byron Allen: https://www.linkedin.com/in/byronaallen/

Byron Allen's conversation about entity centric modeling: https://youtu.be/9YcLBSqZNzE?t=2977

Drew Banin's talk at apply(): https://www.tecton.ai/apply/session-video-archive/the-dbt-semantic-layer/

Preset: https://preset.io/product

Subscribe to the podcast here: https://anchor.fm/analytics-everywhere

Sep 23, 202201:07:06

Analytics Everywhere #5: Data Integration with Michel Tricot

Welcome to this live episode of the Analytics Everywhere podcast! In episode #4, Max chats with Michel Tricot, the co-founder and CEO of Airbyte. In this episode, we talk about the challenges of data integration, the importance of establishing protocols for data connectors, how traditional REST API's fall short when it comes to data exhaust, the original idea for Airbyte that got them into YCombinator, and much much more.

We hope you enjoy this episode!

Links:

Michel Tricot: https://www.linkedin.com/in/micheltricot/

Airbyte: https://airbyte.com/

Preset: https://preset.io/product

Subscribe to the podcast here: https://anchor.fm/analytics-everywhere

Sep 08, 202201:27:35

Analytics Everywhere #4: dbt with Drew Banin

Welcome to this live episode of the Analytics Everywhere podcast! In episode #4, Max chats with dbt co-founder Drew Banin about the the history of dbt, Airflow vs the dbt view of data engineering, the dbt semantic layer, and more. We hope you enjoy this episode!

Links:

Drew Banin: https://www.linkedin.com/in/drewbanin

Preset: https://preset.io/product/

dbt: http://dbt.com/

Subscribe to the Analytics Everywhere Podcast here: https://anchor.fm/analytics-everywhere

Jun 23, 202201:12:49

Analytics Everywhere #3: Building Data Products with Zach Wilson

Welcome to this live episode of the Analytics Everywhere podcast! In episode #3, Max chats with Zach Wilson from Airbnb about building data products. They talk about some of the data engineering patterns they witnessed at both small and large tech companies, data applications, and more. We end this episode with some great Q&A from the audience.

Watch the video recording here: https://preset.io/events/the-analytics-everywhere-podcast-live-episode-with-zach-wilson/

Zach Wilson: https://www.linkedin.com/in/eczachly/

Max Beauchemin: https://www.linkedin.com/in/maximebeauchemin/

Srini Kadamati: https://www.linkedin.com/in/srinivasakadamati/

Apr 20, 202201:28:12

Analytics Everywhere #2: Headless BI with Pavel Tiunov

In this episode, we sit down with Pavel Tiunov, the CTO of Cube. Cube is a headless BI platform that seeks to be the serving layer for interactive analytics applications. Cube’s story is pretty interesting, because they started out as a different company with a different name before pivoting completely. This conversation is all about headless BI, a recent trend within the modern data stack that revolves around pushing charts, dashboards, and analytics experiences out of the traditional container of internal BI.

Apr 08, 202255:33

Analytics Everywhere #1: The Data Mesh with Chris Riccomini

Welcome to the Analytics Everywhere podcast, presented by Preset (the experts of Apache Superset). This podcast is dedicated to understanding the perspectives of the builders of next generation data tools and the impact those tools seek to have on the end user analytics experience.

In episode 1, Max and Srini from Preset speak with Chris Riccomini on the data mesh.

Who:

Chris Riccomini created Apache Samza, a streaming framework on top of Apache Kafka, while at LinkedIn and then went on to help manage data engineering at WePay. Max Beauchemin is the original creator of Apache Airflow and Apache Superset and is Preset's founder and CEO. Srini Kadamati is a Senior Developer Advocate at Preset.

Episode Overview:

If you're new to the data mesh, don't worry! We start by getting a great definition of the data mesh by Chris. The episode then goes on to discuss the challenges of large scale data infrastructure that both Chris and Max have encountered at organizations like Airbnb, Lyft, Facebook, and LinkedIn. We also discuss Snowflake's data clean room, building data products more generally, data tools that Max and Chris wish existed, and even Matt Damon!

We hope you enjoy this first episode! If you have thoughts or feedback, please reach out to me at [email protected]

Mar 02, 202201:02:55