
The Preset Podcast
By Preset


Designated Driver #8 • Altinity with Robert Hodges
Anyone up for a Boulevardier? Of course you are! Join special guest Robert Hodges, CEO of Altinity as we learn about databases and oh-so-much more. Grab a snack/drink/whatever, and join us.
- Try Altinity for free at https://altinity.com/
- Learn more about Apache Superset at https://superset.apache.org/
- Get Preset for free, forever, at https://preset.io

Analytics Everywhere #10 • Alexander Gallego of Redpanda
Join us for an in-depth conversation with Redpanda founder & CEO Alexander Gallego, as we dive into all sorts of topics from the project's origin story, to trends in streaming data, AI, and much more. Check out Redpanda at https://redpanda.com Try Preset for free, forever, at https://preset.io/ Learn more about how Preset can help your business: https://preset.io/contact-sales/

Analytics Everywhere #9 • Devin Stein of Dosu
Today we welcome Devin Stein, Founder & CEO of Dosu. Dosu offers GitHub repo automation tools built around custom LLM agents, trained on your repo's issues/PRs and related content. Learn about how it's been helping the Apache Superset project and can help your repo too! Try out Dosu

Designated Driver #7 • Apache Pinot / StarTree
It's happy hour again! Join Viktor Gamoc from Apache Pinot and StarTree to learn about him, the project, and their use cases with Superset. Grab a snack/drink/whatever, and join us. Try StarTree for free at https://startree.ai/ Learn more about Apache Pinot at https://pinot.apache.org/ Learn more about Apache Superset at https://superset.apache.org/ Get Preset for free, forever, at https://preset.io

Designated Driver Episode #6 • Confluent/Apache Kafka
Join us with special guest Tim Berglund from Confluent and Apache Kafka. We'll spend our happy (half) hour (depending on time zone, of course!) learning all about the project's background, its use cases, and how that all flows into Superset. Grab a beverage, and let's dive in!
Try Confluent for free at https://www.confluent.io/
Try Apache Superset/Preset for free at https://preset.io
Learn more about Apache Kafka at https://kafka.apache.org/
Learn more about Apache Superset at https://superset.apache.org/

Analytics Everywhere #8: Avi Press of Scarf
Today we welcome Avi Press, Founder & CEO at Scarf. Scarf offers telemetry and analytics to capture anonymized open-source usage data and find actionable insights to grow your open-source project or fuel your sales and marketing strategies.
Check out Scarf at https://scarf.sh/
Try Preset for free at https://preset.io/

Designated Driver Episode #5 • Shillelagh
Time to take a walk on the Irish side, as Beto tells us all about Shillelah! We explore the history, use cases, and inner workings of this Python library that allows you to query many resources (APIs, files, and more) using SQL. It's not just compatible with Apache Superset, it's built for Superset!
Want to get started with Shillelagh and Superset today? Try Preset! You can start a free-forever account or contact our sales team to access even more Superset features.

Designated Driver Episode #4 • InfluxDB
Please join us in welcoming Anais Dotis of InfluxDB/InfluxData on our 4th installment of Designated Driver - and our first episode with an AI-generated theme song! The future is now! Grab a beer and join us!

Designated Driver Episode #3 • Apache Drill
Meet Apache Drill PMC Chair Charles Givre, and learn the back story of this DB — we even get a demo this time!

Designated Driver Episode #2 • MotherDuck / DuckDB
In this episode, we hang out with MotherDuck founder and CEO Jordan Tigani over a beer (and/or a kombucha). Learn all about what makes DuckDB/MotherDuck an emerging winner in the space, and a great pairing with Apache Superset.

Designated Driver Episode #1 • CelerData / StarRocks
In our first episode of Designated Driver (having a beer... database drivers.. get it?), Beto and Evan meet up with Albert Wong from CelerData and StarRocks to have a beer and talk shop. Let's get rollin'!

Analytics Everywhere #7: Tarush Aggarwal of 5X
Today Max and Evan are joined by Tarush Aggarwal, CEO of 5X, as we discuss the intersection of data teams with the modern data stack, and how these tools went from a monolith model to a sea of disconnected tools and patterns. Where is this headed, and how can data teams be most effective in this new ecosystem?

Analytics Everywhere #6: Semantic Layer for Machine Learning with Byron Allen
Welcome to this episode of the Analytics Everywhere podcast! In episode #6, Max chats with Byron Allen, the ML Practice Lead at Contino. In this episode, we chat about a variety of topics around the challenges of operationalizing data:
- organizational difficulties around data (data mesh vs centralized data warehouse / governance)
- what a semantic layer really is
- thin vs thick semantic layers
- entity / dataset centric modelling
- the idea of a unified semantic layer for ML and BI
- batch vs real-time ML use cases
- experimentation frameworks
- and much much more!
We hope you enjoy this episode!
Links:
Byron Allen: https://www.linkedin.com/in/byronaallen/
Byron Allen's conversation about entity centric modeling: https://youtu.be/9YcLBSqZNzE?t=2977
Drew Banin's talk at apply(): https://www.tecton.ai/apply/session-video-archive/the-dbt-semantic-layer/
Preset: https://preset.io/product
Subscribe to the podcast here: https://anchor.fm/analytics-everywhere

Analytics Everywhere #5: Data Integration with Michel Tricot
Welcome to this live episode of the Analytics Everywhere podcast! In episode #4, Max chats with Michel Tricot, the co-founder and CEO of Airbyte. In this episode, we talk about the challenges of data integration, the importance of establishing protocols for data connectors, how traditional REST API's fall short when it comes to data exhaust, the original idea for Airbyte that got them into YCombinator, and much much more.
We hope you enjoy this episode!
--
Links:
Michel Tricot: https://www.linkedin.com/in/micheltricot/
Airbyte: https://airbyte.com/
Preset: https://preset.io/product
Subscribe to the podcast here: https://anchor.fm/analytics-everywhere

Analytics Everywhere #4: dbt with Drew Banin
Welcome to this live episode of the Analytics Everywhere podcast! In episode #4, Max chats with dbt co-founder Drew Banin about the the history of dbt, Airflow vs the dbt view of data engineering, the dbt semantic layer, and more. We hope you enjoy this episode!
Links:
Drew Banin: https://www.linkedin.com/in/drewbanin
Preset: https://preset.io/product/
dbt: http://dbt.com/
Subscribe to the Analytics Everywhere Podcast here: https://anchor.fm/analytics-everywhere

Analytics Everywhere #3: Building Data Products with Zach Wilson
Welcome to this live episode of the Analytics Everywhere podcast! In episode #3, Max chats with Zach Wilson from Airbnb about building data products. They talk about some of the data engineering patterns they witnessed at both small and large tech companies, data applications, and more. We end this episode with some great Q&A from the audience.
--
Watch the video recording here: https://preset.io/events/the-analytics-everywhere-podcast-live-episode-with-zach-wilson/
Zach Wilson: https://www.linkedin.com/in/eczachly/
Max Beauchemin: https://www.linkedin.com/in/maximebeauchemin/
Srini Kadamati: https://www.linkedin.com/in/srinivasakadamati/

Analytics Everywhere #2: Headless BI with Pavel Tiunov
In this episode, we sit down with Pavel Tiunov, the CTO of Cube. Cube is a headless BI platform that seeks to be the serving layer for interactive analytics applications. Cube’s story is pretty interesting, because they started out as a different company with a different name before pivoting completely. This conversation is all about headless BI, a recent trend within the modern data stack that revolves around pushing charts, dashboards, and analytics experiences out of the traditional container of internal BI.

Analytics Everywhere #1: The Data Mesh with Chris Riccomini
Welcome to the Analytics Everywhere podcast, presented by Preset (the experts of Apache Superset). This podcast is dedicated to understanding the perspectives of the builders of next generation data tools and the impact those tools seek to have on the end user analytics experience.
In episode 1, Max and Srini from Preset speak with Chris Riccomini on the data mesh.
Who:
Chris Riccomini created Apache Samza, a streaming framework on top of Apache Kafka, while at LinkedIn and then went on to help manage data engineering at WePay. Max Beauchemin is the original creator of Apache Airflow and Apache Superset and is Preset's founder and CEO. Srini Kadamati is a Senior Developer Advocate at Preset.
Episode Overview:
If you're new to the data mesh, don't worry! We start by getting a great definition of the data mesh by Chris. The episode then goes on to discuss the challenges of large scale data infrastructure that both Chris and Max have encountered at organizations like Airbnb, Lyft, Facebook, and LinkedIn. We also discuss Snowflake's data clean room, building data products more generally, data tools that Max and Chris wish existed, and even Matt Damon!
We hope you enjoy this first episode! If you have thoughts or feedback, please reach out to me at [email protected]