The Data Wranglers

Follow The Data Wranglers
Share on
Copy link to clipboard

Join co-hosts Joe Hellerstein, Jeff Heer and their expert guest wranglers on Thursdays for witty dissections on data engineering and all things modern data management. The Data Wranglers brings you insights, deep dives and short takes on anything from scrubbing to data analysis. If you’re a pro in the data biz whether you’re a visualizer, engineer or a newbie learning about it as a student, The Data Wranglers will bring you information and insights that’ll make you do better with your data and sound smarter at parties. Bring your fire for data and burning questions to this unscripted, unplugged, fun and spirited conversation. A Trifacta Podcast. #TheDataWranglers Visit www.Trifacta.com/podcast for more info.

Trifacta


    • Mar 3, 2022 LATEST EPISODE
    • weekly NEW EPISODES
    • 36m AVG DURATION
    • 16 EPISODES


    Latest episodes from The Data Wranglers

    Season Finale – Data Minds Tell How They Earned It

    Play Episode Listen Later Mar 3, 2022 55:34


    In the season finale, Data Wranglers Joe Hellerstein and Jeffrey Heer tell all about what it takes to wrangle a venture into success. Trifacta CEO Adam Wilson talks with them about how they did it, how to turn a vision into reality and how to win data by design. For more data insights, listen to #TheDataWranglers.

    Machine Learning with Carlos Guestrin

    Play Episode Listen Later Feb 24, 2022 40:33


    What keeps an Apple developer up at night? Meet Carlos Guestrin -- decorated professor of Computer Science at Stanford and former Senior Director of AI and Machine Learning at Apple. Carlos talks with Data Wranglers Joe Hellerstein and Jeffrey Heer about the birth of the AI-driven Face ID and other machine-learning features he and his team created at Apple. He also talks about distribution shifts and what kept him up at night: the “unknown unknowns” about data, privacy and user experiences. #TheDataWranglers

    Cloud Databases with Sailesh Krishnamurthy

    Play Episode Listen Later Feb 10, 2022 34:35


    What are the fastest-growing cloud databases, and why? Sailesh Krishnamurthy talks with Data Wranglers Joe Hellerstein and Jeffrey Heer about the massive cloud databases he has managed, including his extensive work at Google where he engineers three huge relational cloud databases that support Google's services such as YouTube, photos, transactions and ads. And, hear insights from his career at Amazon Web Services, IBM, Truviso (which he co-founded), and Cisco. Plus, Sailesh probes Big Data and why SQL databases are back — and are here to stay. #TheDataWranglers

    Better Data Science in Marketing with Zack Pike

    Play Episode Listen Later Feb 3, 2022 41:36


    Would you save $50 billion -- if you had it? Data can help to reduce marketing waste. Zack Pike, founder of Magnetic Data Science, and veteran of the marketing analytics industry, talks to Data Wranglers Joe Hellerstein and Jeffrey Heer about using data science to help marketers reduce waste and uncertainty in their campaigns. #TheDataWranglers

    The Modern Data Stack with Sarah Catanzaro

    Play Episode Listen Later Jan 27, 2022 41:02


    How does data relate to pirates? Sarah Catanzaro, a partner at Amplify Partners, talks to Data Wranglers Joe Hellerstein and Jeffrey Heer about what's new and exciting in the modern data stack, machine learning and observable data. Her work has focused on startups that bring technological advances in machine intelligence and enterprise infrastructure to solve real-world problems. Having previously worked with the U.S. Secret Service, the Center for Advanced Defense Studies, and Palantir, Sarah learned how data can be used to understand and disrupt Somali pirate networks and other insurgent groups: like startups. As data practitioner-turned-venture capitalist, Sarah also offers insights about her career as a woman investor. #TheDataWranglers

    Data Science with DJ Patil

    Play Episode Listen Later Jan 20, 2022 53:53


    How do you unleash the power of data to benefit all Americans? DJ Patil became the first U.S. Chief Data Scientist under President Obama, and discusses with The Data Wranglers Joe Hellerstein and Jeffrey Heer how he helped to define what data science means and does for government, public policy and all Americans. Perhaps one of the most influential data scientists in the world, DJ discusses how data scientists played an integral role in the startup of LinkedIn and in the early discovery of the Covid pandemic. DJ also discusses the role of data science in public health and the criminal justice systems. Trifacta CEO Adam Wilson drops in to chat with DJ and announce an industry-changing acquisition by Alteryx. #TheDataWranglers

    Tidy Data with Hadley Wickham

    Play Episode Listen Later Jan 13, 2022 39:47


    What is the Tidyverse and why is it important? Hadley Wickham is a leading data scientist and advocate for improving data science with tidy data and data hygiene. He's the Chief Scientist at RStudio and an Adjunct Professor of Statistics at the University of Oakland, Stanford University and Rice University. Join the Tidyverse discussion with The Data Wranglers Joe Hellerstein and Jeffrey Heer. #TheDataWranglers

    Data Trends and Fragmentation with Bill Hostmann

    Play Episode Listen Later Jan 6, 2022 33:13


    How will the pendulum swing in data engineering? Join Bill Hostmann, senior research fellow at Dresner Advisory Services, and Data Wranglers Joe Hellerstein and Jeffrey Heer as they talk about fragmentation in the data industry and how to improve the performance of distributed database design. #TheDataWranglers

    2021 The Year in Data

    Play Episode Listen Later Dec 23, 2021 23:45


    What's above the cloud? Learn what's hot and what's not, as The Data Wranglers Joe Hellerstein and Jeffrey Heer look back at the year in data. Facebook went down, SQL is back, and Hadoop is dead. What's up with Spark, and are streaming databases finally real? The cloud continues to rise, there are issues in data ethics, and data is helping to combat Covid-19. And then there's the kerfuffle between Snowflake and Databricks. #TheDataWranglers

    Data with a Purpose with Moritz Stefaner

    Play Episode Listen Later Dec 16, 2021 37:16


    Meet Moritz Stefaner, a data designer who uses data for storytelling and who helped design the official German Covid-19 vaccine data dashboard. Moritz tells The Data Wranglers -- Jeffrey Heer and Adam Wilson -- how he creates a character from a dataset to give it emotional meaning and talks about the Covid vaccine clock he created. And, he dives into his data visualizations for train traffic on a German railroad network, the promises and pitfalls of using machine learning for data design, and what it took to visualize 175 years of text from Scientific American. Moritz hosts the popular podcast, Data Stories. #TheDataWranglers

    D3 and Data Visualization Insights with Mike Bostock

    Play Episode Listen Later Dec 9, 2021 34:34


    What's the secret for D3's long-time success? Mike Bostock, the creator of D3 shares the reasons for his data visualization tool's longevity, and why it won the 10-year Test-of-Time award from the IEEE. Mike goes deep on D3 and Observable, which he also founded, and talks about all things visualization with The Data Wranglers Joe Hellerstein and Jeffrey Heer, including when it's OK to use a bar-chart for getting quick data insights and the applications of time zone wrangling. #TheDataWranglers

    Best Use Cases of Apache Kafka with Jun Rao

    Play Episode Listen Later Dec 2, 2021 37:12


    Can Kafka be my database? Jun Rao, a co-founder at Confluent, the rocket-ship startup behind Apache Kafka®, answers this question and more in a round of database bingo with The Data Wranglers Joe Hellerstein and Jeffrey Heer. Rao discusses best use cases for Kafka, both traditional and newer applications, along with how to use SQL and variations for data transformation. Rao, who goes deep in both open-source and the roots of the database industry, is the co-author of more than 20 reference research papers and the co-inventor of more than a dozen U.S. software patents. #TheDataWranglers

    The Inside Story of Apache Airflow with Steven Hillion

    Play Episode Listen Later Nov 18, 2021 39:03


    What data orchestration platform is downloaded more than 10,000 times a day? Data scientist Steven Hillion joins The Data Wranglers Joe Hellerstein and Jeffrey Heer to give the inside story on Apache Airflow, used by data scientists and data engineers around the world. Apache Airflow is managed commercially by Astronomer.io, where Hillion is Head of Data and in his spare time, is writing a book of poems from mathematic formulas. #TheDataWranglers

    Redefining Metadata and Data Science with Shirshanka Das

    Play Episode Listen Later Nov 11, 2021 33:56


    What does metadata really mean? Data scientist Shirshanka Das joins The Data Wranglers, Joe Hellerstein, Jeffrey Heer and Adam Wilson, to re-define metadata. Das discusses his innovative work in data, including a decade at LinkedIn where he was part of a now-legendary data cabal that coined the term "data science" and built the open-source engineering tools Kafka, Pinot and DataHub. Recently, Shirshanka co-founded a new company, Acryl Data, to support the DataHub open-source project. #TheDataWranglers

    Data Science, Engineering and Wrangling: What's the Diff?

    Play Episode Listen Later Nov 11, 2021 28:42


    What's a trumpet have to do with data management?  Meet co-hosts Joe Hellerstein and Jeff Heer in this first episode of The Data Wranglers, where they discuss and riff on data engineering, analytics, data science and all things modern data management.

    Introducing: The Data Wranglers

    Play Episode Listen Later Nov 11, 2021 3:21


    Meet The Data Wranglers, with co-hosts Joe Hellerstein, Jeff Heer and their data wrangling expert guests. On Thursdays, The Data Wranglers will discuss and riff on data engineering, analytics, data science and all things modern data management. Don't be surprised if Adam Wilson joins from time to time with insights on all things data. A Trifacta Podcast. #TheDataWranglers Visit www.Trifacta.com/podcast for more info.

    Claim The Data Wranglers

    In order to claim this podcast we'll send an email to with a verification link. Simply click the link and you will be able to edit tags, request a refresh, and other features to take control of your podcast page!

    Claim Cancel