DataTalks.Club

Follow DataTalks.Club

Share on

DataTalks.Club - the place to talk about data!

DataTalks.Club

Jul 17, 2026 LATEST EPISODE
every other week NEW EPISODES
56m AVG DURATION
222 EPISODES

Search for episodes from DataTalks.Club with a specific topic:

Latest episodes from DataTalks.Club

Thriving in the AI Era with Human Skills - Maryam Ramezani-Bartsch

Play Episode Listen Later Jul 17, 2026 60:02

In this talk, Maryam Ramezani-Bartsch, Data and AI Leader with over 20 years of experience at companies like adidas and Zalando, shares her extensive career journey from building foundational ML systems at adidas to coaching data experts through the modern AI landscape. We explore the critical intersection of technical strategy and the essential human skills needed to thrive in the AI era.LINKS:- https://maryamramezani.com/designyourdatacareerYou will learn about:- The surprising similarities and critical differences between the current Generative AI boom and the previous Big Data era.- Why the traditional boundaries between data roles are disappearing and the specific T shaped profile companies are actually hiring for today.- The hidden danger of perfectionism in corporate tech, and what a healthy margin of failure actually looks like in practice.- How to stop leaving your career trajectory to chance by applying product Design Thinking to your own life.- A practical framework for navigating industry uncertainty and tech career anxiety without burning out.- Battle tested strategies for regaining your footing and standing out in a highly competitive job market after a layoff.- The specific non technical human skills that will become your ultimate career moat against automation.TIMECODES:00:00 Human Skills in the AI Era07:07 Building ML Systems at Adidas12:49 Generative AI vs Big Data Era18:39 T-Shaped Data Engineering Roles24:04 Overcoming Perfectionism in Tech30:34 Design Thinking for Data Careers38:59 Managing Tech Career Anxiety45:07 Aligning Passion with Tech Skills50:18 Job Search Strategies After Layoffs56:03 Essential Soft Skills for JuniorsThis talk is essential for data professionals, software engineers, and tech leaders looking to future proof their careers in an increasingly automated world. Whether you are a junior developer navigating a tough job market, an engineer bouncing back from layoffs, or a senior professional looking to strategically design your next pivot, this session provides the tools to build a highly resilient career.Connect with Maryam- Linkedin - https://www.linkedin.com/in/maryam-ramezani-bartsch/- Website - https://maryamramezani.com/- Substack - https://maryamramezani.substack.com/Connect with DataTalks.Club:- Join the community - https://datatalks.club/slack.html- Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ- Check other upcoming events - https://lu.ma/dtc-events- GitHub: https://github.com/DataTalksClub- LinkedIn - https://www.linkedin.com/company/datatalks-club/ - Twitter - https://twitter.com/DataTalksClub - Website - https://datatalks.club/

ai google battle data skills thriving substack big data ml github design thinking zalando overcoming perfectionism bartsch essential soft skills

Building a Career in AI From Real Estate to AI Engineering - Gustaf Gyllensporre

Play Episode Listen Later Jul 10, 2026 62:24

In this talk, Gustaf Gyllensporre, Senior AI Engineer and PropTech Founder, shares his unconventional career journey from selling Miami real estate to shipping production AI systems. We explore the tactical steps for breaking into the tech space as a self taught developer and how to successfully bypass traditional industry gatekeepers.Links:- @PropTechFounder - https://youtu.be/leXRiJ5TuQo?si=ymK03qKVEC7hAt9N- https://x.com/gostak_ddYou will learn about:- The strategic approach to crafting an AI engineering resume that actually gets noticed by hiring managers.- Why building another generic RAG chatbot might be hurting your portfolio and the specific high impact projects you should build instead.- The massive difference between interviewing for AI roles at dynamic startups versus traditional big tech companies.- How to leverage open source contributions to prove your technical mastery without a computer science degree.- The surprisingly simple networking tactics and developer ambassador programs that can unlock exclusive job opportunities.- Actionable ways to improve the critical social and communication skills that most developers completely ignore.TIMECODES:00:00 AI Engineering Field Guide05:01 Self Taught AI Engineer Pivot09:39 CPython Open Source Contributions13:49 Tech YouTube Channel Growth18:26 AI Engineer Resume Optimization22:44 AI Engineering Portfolio Projects29:23 Startup vs Big Tech Interviews33:47 Open Source AI Project Ideas38:55 Building Deep Research AI Agents42:52 Landing Your First AI Job46:48 Tech Networking Strategies51:16 Technical Project Demo Videos54:53 Self Taught Developer Mistakes58:53 Soft Skills for Software EngineersConnect with Gustaf- Linkedin - https://www.linkedin.com/in/gustaf-g/Connect with DataTalks.Club:- Join the community - https://datatalks.club/slack.html- Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ- Check other upcoming events - https://lu.ma/dtc-events- GitHub: https://github.com/DataTalksClub- LinkedIn - https://www.linkedin.com/company/datatalks-club/ - Twitter - https://twitter.com/DataTalksClub - Website - https://datatalks.club/

ai google career miami startups real estate engineering actionable github soft skills rag gustaf

How to Build AI that actually Ships in Production - Aleksandr Kim

Play Episode Listen Later Jul 3, 2026 57:57

In this talk, Aleksandr Kim, Senior Data Scientist at Intuit, shares his expertise in building AI-powered features in production from fine-tuning BERT models in cyber security to engineering scalable data verification platforms. We explore the reality of moving beyond messy research code to build observable, cost-effective AI agents and automated pipelines.You'll learn about:- Translating traditional machine learning metrics into actionable business outcomes- Validating large language model behavior through robust evaluation and alignment techniques- Pivoting from a generic chatbot project to high-value Slack automation workflows- Structuring outputs and guided reasoning layers to eliminate trivial AI summaries- Defining the overlapping skills between AI engineers, data scientists, and full-stack software engineers- Implementing multi-LLM routing logic and token caching to minimize enterprise API expenses- Identifying critical data infrastructure bottlenecks to determine when to pivot or drop an AI pilotTIMECODES:00:00 AI Engineering Production and Scalability06:12 Intuit Ecosystem and QuickBooks Products12:17 Aligning ML Metrics with Business Outcomes18:52 AI Engineers Conducting Customer Interviews25:13 Structured Output and Guided Reasoning31:13 Defining AI Engineering vs Software Engineering37:20 Cost Optimization and Multi LLM Routing43:26 UI Trends and Token Management in Industry49:33 Future Career Trends in AI Engineering55:46 Data Infrastructure Bottlenecks and ML FailuresThis session is designed for mid-to-senior level Data Scientists, Machine Learning Engineers, and Software Engineers who want to develop a highly practical, production-first approach to generative AI. It is especially useful for technology leads focused on reducing token overhead and building self-correcting agentic systems.Connect with Aleksandr- Website - https://alexkimds.github.io/- Linkedin - https://www.linkedin.com/in/aleksandrkim/

ai defining production identifying slack implementing pivoting api ships translating llm structuring software engineers intuit data scientists validating aleksandr senior data scientist cost optimization

AI Adoption in Enterprise Beyond Writing Code - Ivan Bilan

Play Episode Listen Later Jun 26, 2026 62:25

In this talk, Ivan, Senior Engineering Manager at Personio, shares his deep expertise in the data and software space from his early days building traditional NLP systems and massive ETL pipelines to his current leadership role in Identity and Access Management (IAM). We explore the rapid evolution of Generative AI, the reality of managing AI agents in production, and the emerging field of context engineering to optimize developer workflows.You'll learn about:- The buy vs. build dilemma for AI infrastructure and local LLMs.- How AI agents are shifting workloads and evolving code reviews.- Why AI is currently better at fixing tech debt than building from scratch.- Measuring the ROI of AI integration using DORA metrics and cycle times.- Strategies to manage vendor lock-in and minimize AI provider dependency.- Using "context engineering" and specification-driven development to maximize LLM quality.- Why hiring junior engineers is still essential and how AI accelerates their onboarding.TIMECODES:00:00 Career Journey in Data Science and NLP07:37 Industry Adoption of Generative AI and Agents11:45 Buy vs Build Dilemma for AI Infrastructure15:46 AI Capability Limits in Fixing Tech Debt19:32 Developer Workloads and AI Code Contributions24:49 Experimentation with Open Source AI Agent Architectures30:06 Measuring ROI and Business Value of AI Integration35:10 Tracking AI Impact Using DORA Metrics39:51 Impact of AI Code Generation on CI/CD System Reliability43:00 Best Practices for Team AI Tool Adoption48:20 Managing Vendor Lock-In Risks with AI Providers51:27 Importance of Hiring Junior Software Engineers56:28 Accelerated Junior Developer Onboarding with AI Assistants01:00:12 Specification-Driven Development and Context EngineeringThis talk is perfect for software engineers, engineering managers, and technical leaders looking to practically integrate AI tools into their teams without sacrificing code quality or system reliability. It is especially valuable for tech professionals navigating the complexities of AI adoption, CI/CD pipeline management, and organizational scaling in the GenAI era.Connect with DataTalks.Club:- Join the community - https://datatalks.club/slack.html- Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ- Check other upcoming events - https://lu.ma/dtc-events- GitHub: https://github.com/DataTalksClub- LinkedIn - https://www.linkedin.com/company/datatalks-club/ - Twitter - https://twitter.com/DataTalksClub - Website - https://datatalks.club/ Connect with Ivan:- LinkedIn - https://www.linkedin.com/in/ivan-bilan/ - Twitter - https://x.com/demiourgosua - Github - https://github.com/ivan-bilan - Website - https://github.com/ivan-bilan

Applied AI 2026 Berlin Conference Interview

Play Episode Listen Later Jun 19, 2026 54:43

The conference highlighted a critical shift in the technology and engineering ecosystem, moving away from passive implementations toward autonomous AI systems, collaborative communities, and robust engineering guardrails. Discussions centered on the practical architecture required to scale AI safely, the evolution of modern developer tools, and the importance of cross-border technical collaboration. Ultimately, the insights underscored that the future of technology relies on blending rigorous infrastructure with human-centric ecosystem growth.Florian Hönicke an expert in engineering infrastructure, explored the operational shifting of cloud services and the challenges of secure temporary access provisioning. He detailed strategies for managing transient credentials for large groups and autonomous agents using automated serverless functions without exposing long-lived access keys. His central thesis argues that true engineering rigor requires deterministic, self-expiring security layers at the container level.Stella Buhalis, a technical community and developer relations leader, addressed the human dynamics fueling open-source ecosystems and community-driven adoption. She emphasized that long-term project viability stems from structured developer onboarding and lower cognitive barriers rather than pure marketing outreach. Her key insight is that building trusted technical communities acts as the ultimate feedback loop for improving developer experience and software reliability.Błażej Nowakowski, a backend systems architect, focused on database migration paradigms and the optimization of high-dimensional vector search at the network edge. He analyzed real-world infrastructure friction points, specifically isolating SQLite database lock conflicts and remote data sync latencies on serverless architectures. He noted that decoupling persistent remote backends from the core runtime is crucial for maintaining low-latency, multi-cloud application performance.Alena Astrakhantseva, a talent strategy and engineering education specialist, outlined the rapid evolution of technical training as the industry shifts from traditional development to autonomous AI flows. She analyzed how continuous testing, real-time monitoring, and structured evaluation frameworks must become core competencies for new developers. Her notable perspective highlights that the next wave of technical talent must be hired for systemic engineering rigor over simple syntax mastery.Zhen Ming Ng (Babypro), an open-source library maintainer and developer, demonstrated automation workflows for package deployment and baseline library compliance. He focused on minimizing framework overhead by substituting heavy, resource-intensive dependencies with lightweight tokenizers and compact client drivers. His core perspective is that library design must prioritize minimalism to remain functional across edge-native runtime environments.Connect with speakers: Florian HönickeCloud Infrastructure & DevOps Engineer Specialisthttps://www.linkedin.com/in/florian-h%C3%B6nicke-b902b6aaStella BuhalisDeveloper Relations & Technical Community Leadhttps://www.linkedin.com/in/stella-buhalisBłażej NowakowskiBackend Systems Architect & Database Engineerhttps://www.linkedin.com/in/b%C5%82a%C5%BCej-nowakowski-096716168/Alena AstrakhantsevaTechnical Talent Strategist & Engineering Educatorhttps://www.linkedin.com/in/alenaastra/Zhen Ming Ng (Babypro)Open Source Software Maintainer & Core Developerhttps://www.linkedin.com/in/ming91/

ai c5 sqlite applied ai nowakowski berlin conference florian h

From GenAI Pilots to Production - Nikita Kozodoi

Play Episode Listen Later Jun 5, 2026 63:32

In this talk, Nikita, Senior Applied Data Scientist at the AWS Generative AI Innovation Center, shares his expertise in bringing enterprise artificial intelligence out of the sandbox—from his early days optimizing traditional machine learning models like gradient boosting to deploying advanced production-grade GenAI pipelines. We explore what it really takes to move generative AI systems from pilot prototypes to production environments.Links:- AWS Generative AI Innovation Center: https://aws.amazon.com/ai/generative-ai/innovation-center/You'll learn about:- Deploying multi-layered defenses independent of backend LLMs.- Evaluating parameter-efficient methods like LoRA and QLoRA for small models.- Balancing long-term domain expertise with real-time documentation retrieval.- Utilizing multi-agent orchestration for search and anomaly explanation.- Setting up robust LLM-as-a-judge frameworks verified by human metrics.- Leveraging Amazon Bedrock components for memory and runtime scalability.TIMECODES:05:52 Shifting from traditional ML to generative AI07:49 Hybrid pipelines blending classical ML and LLMs11:25 Production guardrails and multi-layered system defense16:15 Prompt bypasses, input attacks, and AI red teaming20:49 Newsletter localization and translation with Zalando27:24 Evaluation frameworks and human-in-the-loop metrics33:07 Aligning LLM-as-a-judge with few-shot prompts34:49 Fine-tuning small language models versus prompting41:18 Complementary mechanics of RAG and fine-tuning43:00 Agentic web search tools for anomaly explanation47:01 Automated text generation from real-time sports sensors49:58 AWS project scoping and proof of concept timelines54:58 Interview requirements and career skills for AWS roles57:59 Enterprise architecture patterns and system observability01:00:42 Reusable infrastructure blocks on Amazon BedrockThis session is designed for machine learning engineers, data scientists, and technical product managers looking to architect reliable, production-ready GenAI workflows. It is highly valuable for teams aiming to bridge the gap between experimental AI prototypes and secure enterprise software.Connect with DataTalks.Club:- Join the community - https://datatalks.club/slack.html- Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ- Check other upcoming events - https://lu.ma/dtc-events- GitHub: https://github.com/DataTalksClub- LinkedIn - https://www.linkedin.com/company/datatalks-club/ - Twitter - https://twitter.com/DataTalksClub - Website - https://datatalks.club/ Connect with Nikita- Linkedin - https://www.linkedin.com/in/kozodoi/- Github - https://github.com/kozodoi- Website and blog - https://www.kozodoi.me/

From Notebook to Production: Building End-to-End AI Systems - Mariano Semelman

Play Episode Listen Later May 29, 2026 67:54

In this talk, Mariano, Lead Data Scientist and ML Engineer at OLX, shares his journey building high-impact AI media solutions. We explore the transition from traditional e-commerce models to Generative AI and Agentic tools, focusing on how to take AI products from a notebook to full-scale production.You'll learn about:How to master the full product cycle from requirement gathering to deployment.Using video-to-ad technology to automate car listings and seller experiences.Essential modern tools like FastAPI, Arize, and why UV is a game-changer.When to use LLMs versus specialized vision models like CLIP and YOLO.Why production pipelines are moving from Jupyter notebooks to CLI tools.How agentic coding and AI assistants are 10x-ing development speed.TIMECODES:0:00 Community Introduction and Slack Engagement4:16 Career Journey: From Argentina to Barcelona7:16 Product-Driven AI vs. Traditional Reporting9:41 AI Media Solutions for E-Commerce Sellers10:55 Video-to-Ad: The Future of Marketplaces13:45 Automated Content Creation for Sellers17:10 Defining End-to-End Ownership in Data Science21:12 The Longevity of the CRISP-DM Framework25:33 Impact of Agentic Coding and GitHub Copilot31:42 Why LLMs Aren't Always the Best Solution37:39 Translating Business Needs to ML Requirements41:18 Managing Explicit and Implicit Feedback Loops48:26 Architecture Deep Dive: Image Description Logic55:28 The Declining Role of Notebooks in Production1:02:53 The Modern Tech Stack: Fast API, UV, and ArizeConnect with Mariano: Linkedin - https://www.linkedin.com/in/msemelman/Connect with DataTalks.Club:- Join the community - https://datatalks.club/slack.html- Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ- Check other upcoming events - https://lu.ma/dtc-events- GitHub: https://github.com/DataTalksClub- LinkedIn - https://www.linkedin.com/company/datatalks-club/ - Twitter - https://twitter.com/DataTalksClub - Website - https://datatalks.club/

ai google video impact essential production longevity github notebook uv clip yolo mariano agentic notebooks cli ai systems olx jupyter lead data scientist ml engineer

Data Makers Fest 2026 Conference Interviews

Play Episode Listen Later May 22, 2026 66:22

At Data Makers Fest, a recurring theme was the tension between GenAI hype and production reality. Speakers stressed that classical ML, MLOps, evaluation, data quality, and governance remain essential—especially in regulated sectors like fintech and healthcare. Another strong theme was inclusivity: building AI that serves smaller languages, diverse communities, and practitioners beyond the English-centric ecosystem.Ryan Chaves. Head of ML at a Dutch fintech, Ryan focused on the gap between AI demos and production systems. He argued that classical ML remains critical for fraud detection and risk scoring, while GenAI works best as an accelerator on top of existing systems. He also emphasized storytelling, stakeholder communication, and mentorship as core engineering skills.Alp Öktem. Computational linguist and researcher Alp explored the imbalance between AI progress in English and low-resource languages. Through Mozilla Data Collective, he highlighted how open datasets, speech corpora, and synthetic data can expand AI access to underrepresented communities. His broader warning: fluent AI can still fail culturally, linguistically, and ethically.Agnieszka Kamińska. Working in pharmaceutical ML engineering, Agnieszka discussed extracting scientific knowledge from research documents into knowledge graphs. Her focus was reliability: LLMs help with entity extraction and relationship discovery, but trustworthy systems still require ontologies, validation layers, and production-minded engineering. She advocated a pragmatic middle ground between AI hype and skepticism.Nemanja Radojković. An MLOps engineer in finance, Nemanja reflected on how GenAI is changing software engineering itself. He argued that coding assistants improve productivity but risk weakening engineers' understanding if overused. His central point: governance, reproducibility, and platform engineering will become even more important as organizations deploy AI agents at scale.Filipa Castro. Leading AI initiatives at Euronext, Filipa described how GenAI is integrated into regulated financial workflows. Her team uses LLMs to automate document-heavy operational processes while preserving human validation. Her broader message: successful enterprise AI depends less on flashy models and more on infrastructure foundations like CI/CD, monitoring, governance, and operational rigor.Beatriz Silva. As a student volunteer pursuing a master's in data science, Beatriz represented the conference's educational and community dimension. For her, the event was about access—networking with companies, exploring thesis opportunities, and connecting academic learning with industry practice. Her perspective highlighted how conferences like Data Makers Fest help shape the next generation of AI practitioners.Connect with speakers: Ryan Chaves. Head of Machine Learning at a Dutch fintech focused on fraud detection, risk systems, and production ML. LinkedInAlp Öktem. Computational linguist and researcher focused on low-resource languages, inclusive AI, and open language datasets. LinkedInAgnieszka Kamińska. Machine Learning Engineer working on scientific knowledge extraction, knowledge graphs, and AI systems in pharma. LinkedInNemanja Radojković. Senior MLOps Engineer specializing in regulated financial systems, AI governance, and platform engineering. LinkedInFilipa Castro. AI Lead at Euronext focused on enterprise GenAI systems, operational AI strategy, and financial services automation. LinkedInBeatriz Silva. Data science master's student and conference volunteer exploring opportunities in ML and computer vision. LinkedIn

head ai english data speaker conference dutch fest machine learning makers ml genai alp agnieszka computational ci cd filipa euronext nemanja machine learning engineer

Competitions: Beyond the Kaggle Leaderboard - Tatiana Habruseva

Play Episode Listen Later May 1, 2026 65:00

In this talk, Tatiana, Staff Software Engineer at LinkedIn, shares her journey from academic physics to becoming a Kaggle Master and winning the Sound Demixing Challenge. We explore how to use machine learning competitions as a strategic tool to build a high-impact career and bridge the gap between theory and production.You'll learn about:Turning competition code into professional GitHub repos.Converting results into papers for NIPS and CVPR.How LLMs are changing the benchmark for AI competitions.Why hands-on implementation beats passive learning.Using Topcoder and AI Crowd for research-driven goals.Practical steps for your very first model submission.Links:Rise: 3 Practical Steps for Advancing Your Career, Standing Out as a Leader, and Liking Your Life. By Patty Azzarello https://www.porchlightbooks.com/pages/author/Patty_Azzarello-16156396 - awesome book about why doing good is not enough, and what else you need to do to promote your career (same applies to competitions)AICrowd - https://www.aicrowd.com/challenges Grand challenges - https://grand-challenge.org/challenges/Kaggle competitions - https://www.kaggle.com/competitionsTopCoder challenge SpaceNet 9 - https://www.topcoder.com/challenges/9620f66a-767e-40ac-81d5-5cc61274b186(no current active competitions, but they appear)Medium blog post with instruction - https://medium.com/data-science/writing-papers-tech-reports-after-kaggle-competitions-ee504fc0c4c1Kaggle Solution Write-Up Documentation - https://www.kaggle.com/solution-write-up-documentationEvaluating Machine Learning Agents on Machine Learning Engineering - https://arxiv.org/abs/2410.07095Machine Learning Engineering Agent via Search and Targeted Refinement - https://arxiv.org/html/2506.15692v2AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-bench - chrome-extension://efaidnbmnnnibpcajpcglclefindmkaj/https://arxiv.org/pdf/2507.02554TIMECODES:00:00 Tatiana's journey from academia to staff software engineer06:01 Machine learning applications in physics and signal processing09:13 Skill development and domain diversification on Kaggle13:35 Agentic AI benchmarks and automated competition entries17:43 Deep technical mastery versus leaderboard gamification23:04 Hands-on implementation and the illusion of learning26:01 Specialized platforms and fair competition environments31:35 Academic publications and research from silver medals35:24 GitHub repositories and engineering portfolio building39:02 Technical marketing via blog posts and LinkedIn43:25 Innovative approaches for academic conference submissions47:21 Research challenges at NIPS and CVPR workshops52:51 Medical imaging platforms and specialized recommendations57:46 First submission strategies for beginners01:00:56 Asynchronous collaboration and competition team dynamicsPerfect for data scientists and engineers looking to transition from academia or build a formal portfolio using Kaggle as a career-advancement tool.Connect with Tatiana:Linkedin - https://www.linkedin.com/in/tatigabru/

PyConDE 2026 Conference Interviews

Play Episode Listen Later Apr 24, 2026 82:31

At PyConDE 2026, community leaders, educators, and Python tooling builders explored how Python is evolving in the age of AI — and why human connection, mentorship, and strong fundamentals matter more than ever.Jessica Greene (Ecosia / PyLadies Berlin) spoke about her work as a machine learning engineer and community organizer. She highlighted PyLadies Berlin's role in creating inclusive spaces for learning, networking, and career growth, and emphasized that AI should be seen as an amplification tool—not a replacement for solid engineering or people skills.Cheuk Ting Ho (JetBrains) discussed her role on the PyCharm team, where conferences are key for gathering feedback and staying connected to the community. She shared insights from her talk on free-threaded Python and her approach to technical storytelling across talks, blogs, videos, and informal interviews.Sebastian Raschka reflected on his work as an AI educator focused on “from scratch” explanations of machine learning and LLMs. Driven by curiosity, he prefers creating new talks over repeating old ones and aims to help people understand what happens under the hood—especially with reasoning models.Kyle Into (Meta) introduced Pyrefly, a Rust-based Python type checker designed for large codebases. He explained how type checking improves both human and AI-assisted development by making interfaces explicit, reducing risk, and strengthening project structure.Valerio Maggio shared his journey from data science into developer advocacy and community organizing. He emphasized that conferences rely on volunteers, that lightning talks boost accessibility and energy, and that sustainable processes are essential to avoid burnout.Tereza Iofciu discussed her “Data Diplomat” coaching framework, helping data professionals navigate leadership and uncertainty. She noted that AI and lean teams are raising expectations, making it crucial to think strategically, build fundamentals, and invest in real networks.Irina Saribekova described her transition from organizing Python events in Saint Petersburg to supporting PyData Berlin and PyConDE. She highlighted that conferences are built on trust, relationships, and clear systems—and that developer relations extends this work through talks, writing, and community engagement.Jessica GreeneMachine Learning Engineer at Ecosia, PyLadies Berlin co-organizer, and chair of the PyLadies Germany fund.Connect: ⁠https://www.linkedin.com/in/jessica0greene/⁠Cheuk Ting HoDeveloper Advocate at JetBrains working with the PyCharm team and active in the global Python community.Connect: ⁠https://www.linkedin.com/in/cheukting-ho/⁠Sebastian RaschkaAI educator, author, and machine learning researcher focused on LLMs, reasoning models, and educational “from scratch” implementations.Connect: ⁠https://www.linkedin.com/in/sebastianraschka/⁠Kyle IntoEngineer at Meta working on Pyrefly, a fast Python type checker built for large-scale codebases and AI-assisted development.Connect: ⁠https://www.linkedin.com/in/kyleinto/⁠Valerio MaggioData scientist, developer advocate, community organizer, and long-time contributor to PyCon Italia andPyConDE.Connect: ⁠https://www.linkedin.com/in/valeriomaggio/⁠Tereza IofciuData coach, trainer, community contributor, and creator of the Data Diplomat framework for data professionals and leaders.Connect: ⁠https://www.linkedin.com/in/tereza-iofciu/⁠Irina SaribekovaDeveloper relations specialist and Python community organizer involved in PyData Berlin, PyConDE, and conference community building.Connect: ⁠https://www.linkedin.com/in/irinasaribekova/⁠

ai conference driven rust python saint petersburg ecosia jetbrains pycharm

Starting a Data Conference: The Data Makers Fest Story - Leonid Kholkine

Play Episode Listen Later Apr 17, 2026 63:57

In this talk, Leonid Kholkine, Head of Research & Development at Their Data and Co-founder of Data Makers Fest, shares his unique journey from leading international student organizations to building one of Europe's premier data conferences. We explore the behind-the-scenes reality of community building, the evolution of the Portuguese data scene, and the technical challenges of managing AI observability at an enterprise scale.You'll learn about:- Understanding the hybrid role between product engineering and high-touch consultancy.ow organizing meetups and leagues creates a professional reputation and high-trust networks.- The hidden complexities of moving from local meetups to large-scale international conferences (venues, AV, and timing).- How Leonid used custom code and embeddings to automate speaker scheduling and timetable optimization.- Why community is the essential antidote for data practitioners working as the "only one" in their company.- A look into R&D at Their Data and the future of monitoring and self-improving generative AI workflows.Links: - www.datamakersfest.com- Data Lead Club - http://dataleadclub.ripply.net/- DareData - https://www.daredata.ai/- GenOS by DareData - https://www.daredata.ai/gen-osTIMECODES:00:00 Community Building in Data and AI03:02 Computer Engineering and International Leadership Roots06:13 Machine Learning Research in Sports Physiology10:18 Data Lead Club and Executive Networking Retreats14:03 AI Observability and R&D at Their Data18:50 Professional Growth through Community Organizing22:11 The Origins of Data Science Portugal27:57 Logistical Challenges of In-Person Conferences31:24 Strategic Event Scheduling and Venue Selection36:52 Automated Timetable Optimization with Custom Code41:22 Curating Quality Speaker Proposals in the AI Era45:08 Sponsorship Value and Student Ticket Accessibility50:23 Partnership Outreach and Network Development54:44 The Forward Deployed Engineer Role and Methodology58:35 Professional Development for Junior Data ScientistsThis video is a must-watch for data practitioners, aspiring community leaders, and event organizers. It provides deep value for anyone looking to understand the intersection of technical R&D and the "human stack" of networking and professional development.Connect with Leonid- Linkedin - https://www.linkedin.com/in/kholkine/Connect with DataTalks.Club:- Join the community - https://datatalks.club/slack.html- Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ- Check other upcoming events - https://lu.ma/dtc-events- GitHub: https://github.com/DataTalksClub- LinkedIn - https://www.linkedin.com/company/datatalks-club/ - Twitter - https://twitter.com/DataTalksClub - Website - https://datatalks.club/

Understanding the AI Engineer Role - Nasser Qadri

Play Episode Listen Later Apr 10, 2026 62:18

In this talk, Nasser Qadri, AI Engineering Manager at Google, shares his unique career journey—from a PhD in Politics and International Relations to leading high-stakes AI initiatives. We explore the evolution of the AI Engineer role, the critical intersection of social science and machine learning, and how to build robust agentic workflows with engineering rigor.You'll learn about:- Moving beyond simple API calls to implementing full-stack engineering principles and "Agent Ops."- How a background in qualitative research and statistics provides a unique "moral compass" for building ethical AI.- A strategic roadmap for transitioning from non-traditional backgrounds into elite AI engineering roles.- Using design thinking and personal "pain points" to drive meaningful technical innovation.- Why traditional ML and model distillation will remain vital as we move from generalist LLMs to specialized, high-speed agents.- How to navigate the complex landscape of AI frameworks and build depth in your technical stack.TIMECODES:00:00 Transitioning from Social Science to Software Engineering07:45 Applying Statistical Rigor to Generative AI Evaluation12:10 Balancing Research Mindsets with Engineering Speed16:30 Managing Non-Deterministic Systems and Model Creativity20:15 Comparing AI Roles in Big Tech vs Startups24:40 Learning by Building: Solving Personal Pain Points31:50 Mental Frameworks for Problem Finders and Solvers36:15 Human-Centered Design in the Age of LLMs42:05 Beyond API Calls: Software Engineering Rigor for Agents45:50 Orchestration and the Rise of Agent Ops51:30 Depth vs Breadth in AI Framework Selection56:10 The Future of Latency and Traditional ML Integration1:01:20 When to Prioritize Model Distillation and Fine-Tuning1:02:10 Closing Thoughts and Future OutlookThis conversation is designed for software engineers, data scientists, and career-switchers looking to transition into the Generative AI space. It is particularly valuable for technical leaders in large organizations and startups who need to balance rapid AI prototyping with long-term system reliability.Connect with Nasser- Linkedin - https://www.linkedin.com/in/nasserq/Connect with DataTalks.Club:- Join the community - https://datatalks.club/slack.html- Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ- Check other upcoming events - https://lu.ma/dtc-events- GitHub: https://github.com/DataTalksClub- LinkedIn - https://www.linkedin.com/company/datatalks-club/ - Twitter - https://twitter.com/DataTalksClub - Website - https://datatalks.club/

Data Engineer Career in 2026: Roles, Specializations, and What Companies Look for - Slawomir Tulski

Play Episode Listen Later Mar 27, 2026 68:43

In this talk, Slawomir Tulski, Data Leadership Consultant and former Meta Data Engineering Manager, shares his ten-year journey through the evolution of data systems—from researching glaciers in Poland to scaling the ads ranking infrastructure at one of the world's largest tech giants. We explore the shifting definition of the Data Engineer, the "Actionable Data" philosophy, and how to navigate the 2026 hiring market amidst the rise of AI.You'll learn about:- How to distinguish between Platform DE, Product DE, and Analytics Engineering.- Why most teams over-engineer their stacks and how to build "Value-First" instead of "Tool-First."- Why being "cloud-cost-conscious" is the most underrated competitive advantage in modern data teams.- How to identify "Legacy Traps" and choose a company culture that fosters growth.- Why strategic builders will thrive while "DBT Monkeys" and manual triaging roles are at risk of automation.- How to frame side projects and end-to-end "Toy Platforms" to stand out to recruiters without a Big Tech pedigree.TIMECODES:00:00 From Measuring Glaciers to London's Tech Scene06:47 Hadoop vs. AI: Lessons from the Original Big Data Hype11:54 The Data Identity Crisis: Platform vs. Product Engineering17:29 Tech-Native vs. Tech-by-Necessity Company Cultures25:33 The Competitive Advantage of Cost-Aware Engineering30:56 Avoiding Over-Engineered Platforms and Modern Data Stacks38:01 The Real-Time Myth: When to Use Kafka and Spark42:08 Breaking into Data Engineering: 2026 Market Reality51:04 AI Automation: Why Strategic Builders Outlast "DBT Monkeys"57:35 Portfolio Strategy: Framing Side Projects for Maximum Impact1:04:42 The Ultimate Portfolio Project: Building End-to-End Platforms1:07:49 Networking Advice and Local Gdansk CultureThis talk is designed for ambitious data professionals including engineers, analysts, and career-switchers who want a pragmatic, "fluff-free" roadmap for surviving and thriving in the 2026 data landscape. It is particularly valuable for hiring managers and senior leaders looking to audit their recruitment processes, as well as those in traditional corporate environments seeking to implement the agile, high-impact engineering cultures found in Big Tech giants like Meta.Connect with Slawomir:- Linkedin - https://www.linkedin.com/in/slawomir-tulski-091611116/- Form for DE role Ebook - https://docs.google.com/forms/d/e/1FAIpQLSdSCLaBdTtuRlgV_nukKckumR60VOovECtlRIRI5DMUIk36EQ/viewform?usp=dialogConnect with DataTalks.Club:- Join the community - https://datatalks.club/slack.html- Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ- Check other upcoming events - https://lu.ma/dtc-events- GitHub: https://github.com/DataTalksClub- LinkedIn - https://www.linkedin.com/company/datatalks-club/ - Twitter - https://twitter.com/DataTalksClub - Website - https://datatalks.club/

ai google career tech companies poland ebooks roles big tech github competitive advantage specialization data engineering data engineers hadoop networking advice actionable data

Inside the AI Engineer Role: Tools, Skills, and Career Path - Ruslan Shchuchkin

Play Episode Listen Later Mar 20, 2026 67:47

In this talk, Ruslan Shchuchkin, GenAI Engineer at Finance Guru, shares his unique career evolution from business administration and account management to building production-grade generative AI systems. We explore the transition from traditional Data Science to the modern AI Engineer role, defined by the "universal soldier" mindset and the ability to ship end-to-end products.You'll learn about:- Why modern AI engineers must bridge the gap between frontend, backend, and LLM logic.- How building in public and creating personal projects like Branch GPT can fast-track your hiring process.- Why understanding human behavior and user needs is the ultimate safeguard against AI replacement.- How to use tools like Cursor and Claude to accelerate development without losing your technical edge.- How traditional roles are evolving and why evaluation is the new superpower for data professionals.- Practical tips for starting local AI meetups and side hustles (like the Catch a Flat extension) without perfectionism.- Why the industry is shifting toward specific project track records and energy over formal degrees.Links: - https://www.swyx.io/create-luckTIMECODES:00:00 From Account Management to Data Science07:51 Building Branch GPT and Side Project Philosophy10:41 Transitioning to AI Engineering Full-Time15:26 Maximizing Your "Luck Surface Area"19:48 The AI Engineer as a Universal Soldier23:19 Humans vs. AI in Product Discovery28:31 Staying Sharp with X, Grok, and Meetups33:21 How to Launch a Lean Local AI Community38:49 Catch a Flat: Vibe Coding and Side Hustles43:04 Learning the Business Side through Small Projects48:48 Sourcing Project Inspiration from Daily Life52:28 The Future and Longevity of Data Science57:39 Skills over Degrees: The Realities of Hiring01:03:12 Using AI to Learn Instead of Just CodingThis talk is for Data Scientists and Software Engineers looking to transition into AI Engineering or GenAI roles. It is equally valuable for developers interested in building side projects, maximizing their career visibility, and staying updated in a rapidly shifting tech landscape.Connect with Ruslan- Linkedin - https://www.linkedin.com/in/ruslanshchuchkin/Connect with DataTalks.Club:- Join the community - https://datatalks.club/slack.html- Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ- Check other upcoming events - https://lu.ma/dtc-events- GitHub: https://github.com/DataTalksClub- LinkedIn - https://www.linkedin.com/company/datatalks-club/ - Twitter - https://twitter.com/DataTalksClub - Website - https://datatalks.club/

How to Become an AI Engineer After a Career Break - Revathy Ramalingam

Play Episode Listen Later Mar 13, 2026 5:25

In this episode Revathy Ramalingam, Senior Software Engineer and AI Engineer at a healthcare startup, shares her inspiring personal journey from over nine years in telecom software architecture to successfully transitioning back into the industry after a seven-year career break. We explore the evolution of the AI engineer role, the practical application of RAG pipelines, and the strategic use of AI tools to rebuild a technical career.You'll learn about:- AI Career Mapping: Using LLMs to design an upskilling roadmap.- Vibe Coding: Leveraging AI tools for rapid prototyping.- RAG Implementation: Building retrieval systems with LangChain.- Interview Strategy: Proving technical skills after a career gap.- Learning in Public: Building a network through community projects.TIMECODES:00:00 Why Move to AI? Using ChatGPT to Plan a Career Pivot11:00 Learning in Public: The Power of Community Support15:35 Telecom Capstone: Predicting Network Slices with ML22:15 "Vibe Coding" & Building Prototypes with AI Dev Tools28:00 The Interview Process: Navigating a 7-Year Career Break33:45 Practical Interview Tasks: Building a PDF Q&A Assistant39:40 Career Advice: Clear Plans, AI Mentors, and Hard Work44:30 Closing Thoughts: Scaling the Learning LadderThis talk is for developers and career-changers looking for a blueprint to enter the AI engineering space. It is ideal for those interested in RAG, healthcare tech, and practical career resets.Connect with Revathy- Github - https://github.com/RevathyRamalingam- Linkedin - https://www.linkedin.com/in/revathy-ramalingam/ Connect with DataTalks.Club:- Join the community - https://datatalks.club/slack.html- Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ- Check other upcoming events - https://lu.ma/dtc-events- GitHub: https://github.com/DataTalksClub- LinkedIn - https://www.linkedin.com/company/datatalks-club/ - Twitter - https://twitter.com/DataTalksClub - Website - https://datatalks.club/

learning ai google plan engineers github rag using chatgpt senior software engineer career break langchain ai engineer ramalingam

The Future of AI Agents - Aditya Gautam

Play Episode Listen Later Mar 6, 2026 68:39

In this talk, Aditya, an experienced AI Researcher and Engineer, shares his technical evolution—from his roots in embedded systems to building complex, large-scale AI agent architectures. We explore the practical challenges of enterprise AI adoption, the shifting economics of LLMs, and the infrastructure required to deploy reliable multi-agent systems.You'll learn about:- The ROI of Fine-Tuning: How to decide between specialized small models and general-purpose APIs based on cost and latency.- Agent MLOps Stack: The essential roles of guardrails, data lineage, and auditability in AI workflows.- Reliability in High-Stakes Verticals: Navigating the unique AI deployment challenges in the legal and healthcare sectors.- Evaluation Frameworks: How to design robust evals for multi-tenancy systems at scale.- Human-in-the-Loop: Strategies for aligning "LLM as a judge" with human-labeled ground truth to eliminate bias.- The Future of AGI: What to expect from the next wave of multimodal agents and autonomous systems.TIMECODES: 00:00 Aditya's from embedded systems to AI08:52 Enterprise AI research and adoption gaps 13:13 AI reliability in legal and healthcare 19:16 Specialized models and agent governance 24:58 LLM economics: Fine-tuning vs. API ROI 30:26 Agent MLOps: Guardrails and data lineage 36:55 Iterating on agents with user feedback 43:30 AI evals for multi-tenancy and scale 50:18 Aligning LLM judges with human labels 56:40 Agent infrastructure and deployment risks 1:02:35 Future of AGI and multimodal agentsThis talk is designed for Machine Learning Engineers, Data Scientists, and Technical Product Managers who are moving beyond AI prototypes and into production-grade agentic workflows. It is especially relevant for those working in regulated industries or managing high-volume API budgets.Connect with Aditya:- Linkedin - https://www.linkedin.com/in/aditya-gautam-68233a30/Connect with DataTalks.Club:- Join the community - https://datatalks.club/slack.html- Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ- Check other upcoming events - https://lu.ma/dtc-events- GitHub: https://github.com/DataTalksClub- LinkedIn - https://www.linkedin.com/company/datatalks-club/ - Twitter - https://twitter.com/DataTalksClub - Website - https://datatalks.club/

ai google future human roi agent engineers api github apis llm reliability agi data scientists specialized future of ai aditya gautam enterprise ai iterating

Analytics Engineering with dbt Workshop - Juan Manuel Perafan

Play Episode Listen Later Feb 27, 2026 83:57

In this talk, Juan, Analytics Engineer and author of Fundamentals of Analytics Engineering share his professional journey from studying psychological research in Colombia to becoming one of the first analytics engineers in the Netherlands. We explore the evolution of the role, the shift toward engineering rigor in data modeling, and how the landscape of tools like dbt and Databricks is changing the way teams work.You'll learn about:- The fundamental differences between traditional BI engineering and modern analytics engineering.- How to bridge the gap between business stakeholders and technical data infrastructure.- The technical "glue" that connects Python and SQL for robust data pipelines.- The importance of automated testing (generic vs. singular tests) to prevent "silent" data failures.- Strategies for modeling messy, fragmented source data into a unified "business reality."- The current state of the "Lakehouse" paradigm and how it impacts storage and compute costs.- Expert advice on navigating the dbt ecosystem and its emerging competitors.Links:- DE Course: https://github.com/DataTalksClub/data-engineering-zoomcamp- Luma: https://luma.com/0uf7mmupTIMECODES:0:00 Juan's psychological research and transition to data4:36 Riding the wave: The early days of analytics engineering7:56 Breaking down the gap between analysts and engineers11:03 The art of turning business reality into clean data16:25 Why data engineering is about safety, not just speed20:53 Reimagining data modeling in the modern era26:53 To split or not to split: Finding the right team roles30:35 Python, SQL, and the technical toolkit for success38:41 How to stop manually testing your data dashboards46:34 Bringing software engineering rigor to data workflows49:50 Must-read books and resources for mastering the craft55:42 The future of dbt and the shifting tool landscape1:00:29 Deciphering the lakehouse: Warehousing in the cloud1:11:16 Pro-tips for starting your data engineering journey1:14:40 The big debate: Databricks vs. Snowflake1:18:28 Why every data professional needs a local communityThis talk is designed for data analysts looking to level up their engineering skills, data engineers interested in the business-logic layer, and data leaders trying to structure their teams more effectively. It is particularly valuable for those preparing for the Data Engineering Zoomcamp or anyone looking to transition into an Analytics Engineering role.Connect with Juan- Linkedin - https://www.linkedin.com/in/jmperafan/ - Website - https://juanalytics.com/Connect with DataTalks.Club:- Join the community - https://datatalks.club/slack.html- Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ- Check other upcoming events - https://lu.ma/dtc-events- GitHub: https://github.com/DataTalksClub- LinkedIn - https://www.linkedin.com/company/datatalks-club/ - Twitter - https://twitter.com/DataTalksClub - Website - https://datatalks.club/

google strategy colombia netherlands engineering workshop riding expert analytics fundamentals reimagining bi python github deciphering sql juan manuel luma databricks warehousing

AI Engineering: Skill Stack, Agents, LLMOps, and How to Ship AI Products - Paul Iusztin

Play Episode Listen Later Feb 6, 2026 67:15

In this episode of DataTalks.Club, Paul Iusztin, founding AI engineer and author of the LLM Engineer's Handbook, breaks down the transition from traditional software development to production-grade AI engineering. We explore the essential skill stack for 2026, the shift from "PoC purgatory" to shipping real products, and why the future of the field belongs to the full-stack generalist.You'll learn about:- Why the role is evolving into the "new software engineer" and how to own the full product lifecycle.- Identifying when to use traditional ML (like XGBoost) over LLMs to avoid over-engineering.- The architectural shift from fine-tuning to mastering data pipelines and semantic search.- Reliable Agentic Workflows- How to use coding assistants like Claude and Cursor to act as an architect rather than just a coder.- Why human-in-the-loop evaluation is the most critical bottleneck in shipping reliable AI.- How to build a "Second Brain" portfolio project that proves your end-to-end engineering value.Links:- Course link: https: https://academy.towardsai.net/courses/agent-engineering?ref=b3ab31- Decoding AI Magazine: https://www.decodingai.com/TIMECODES:00:00 From code to cars: Paul's journey to AI07:08 Deep learning and the autonomous driving challenge12:09 The transition to global product engineering15:13 Survival guide: Data science vs. AI engineering22:29 The full-stack AI engineer skill stack29:12 Mastering RAG and knowledge management32:27 The generalist edge: Learning with AI42:21 Technical pillars for shipping AI products54:05 Portfolio secrets and the "second brain"58:01 The future of the LLM engineer's handbookThis talk is designed for software engineers, data scientists, and ML engineers looking to move beyond proof-of-concepts and master the engineering rigors of shipping AI products in a production environment. It is particularly valuable for those aiming for founding or lead AI roles in startups.Connect with Paul- Linkedin - https://www.linkedin.com/in/pauliusztin/- Website - https://www.pauliusztin.ai/Connect with DataTalks.Club:- Join the community - https://datatalks.club/slack.html- Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ- Check other upcoming events - https://lu.ma/dtc-events- GitHub: https://github.com/DataTalksClub- LinkedIn - https://www.linkedin.com/company/datatalks-club/ - Twitter - https://twitter.com/DataTalksClub - Website - https://datatalks.club/

learning ai google deep club data survival engineering identifying products skill ship technical handbook stack poc ml github llm cursor second brain xgboost

Applying ML: An Ongoing Personal Journey

Play Episode Listen Later Jan 9, 2026 64:30

In this talk, Rileen, a Senior Computational Biologist and Cancer Data Scientist, shares his professional journey from physics and computer science to cutting-edge cancer genomics and applied machine learning. From his early work in alternative splicing models to deep learning in medical imaging, Rileen explains how biology, data science, and AI intersect to transform cancer research.TIMECODES:00:00 Rileen's Career Journey and Education06:14 Understanding Alternative Splicing in Computational Biology10:56 Modeling Alternative Splicing with Machine Learning14:52 Model Error Analysis and Transition to Cancer Research18:37 What Is Cancer? Mutational Theory Explained21:45 Cancer Treatments and Causes24:57 Cancer Genomics and Tumor Models28:59 Comparing Cell Lines and Tumor Samples (Multi-omics Analysis)32:32 Machine Learning Applications in Cancer Research35:38 Deep Learning for Medical Imaging and Pathology39:17 Data Privacy and Applied ML Course Projects42:50 Learning Outcomes and Future Plans46:36 Industry Experience in Pharmaceutical Research50:14 Day in the Life of a Computational Biologist55:02 Advice for Current ML Students58:40 Project Management and Challenges in Genomics1:02:23 Public Data Sets and Cancer Research in GermanyConnect with Rileen:- Twitter - https://x.com/RileenSinha- Linkedin - https://www.linkedin.com/in/rileen-sinha-a644692/- Github - https://github.com/OptimistixConnect with DataTalks.Club:- Join the community - https://datatalks.club/slack.html- Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ- Check other upcoming events - https://lu.ma/dtc-events- GitHub: https://github.com/DataTalksClub- LinkedIn - https://www.linkedin.com/company/datatalks-club/ - Twitter - https://twitter.com/DataTalksClub - Website - https://datatalks.club/

ai google advice challenges transition analysis ongoing project management github personal journey data privacy deep learning cancer research career journey cancer treatments medical imaging learning outcomes cancer genomics

Building Pet Health Tech: ML, Sensors, and Dog Behavior Data

Play Episode Listen Later Dec 12, 2025 61:14

In this session Sofya shares her journey building a pet-tech startup that blends machine learning sensor data and canine behavior analytics. She walks through her path from early programming explorations to launching a health monitoring device designed around anomaly detection and long-term behavioral baselines.TIMECODES: 00:00 Sofya's pet tech startup with machine learning sensor data and behavior pattern analytics10:00 Journey from programming hobby to full time software development career17:20 Career growth after skipping university and building practical experience24:07 Puppy adoption story and family influence on pet focused innovation32:16 Dog health monitoring framed as anomaly detection in real world machine learning37:05 Collecting canine data with emphasis on sleep patterns and cycle tracking43:35 Establishing a dogs normal baseline through long term data observation49:34 Startup funding through personal savings and early stage bootstrapping55:28 Finding cofounders and collaborators through meetups and coworking communities59:48 Closing insights on Sofya's educational path and early device prototypesConnect with Sofya- Website - https://www.fit-tails.com/ - Linkedin - https://www.linkedin.com/in/sofya-yulpatova/Connect with DataTalks.Club:- Join the community - https://datatalks.club/slack.html- Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ- Check other upcoming events - https://lu.ma/dtc-events- GitHub: https://github.com/DataTalksClub- LinkedIn - https://www.linkedin.com/company/datatalks-club/ - Twitter - https://twitter.com/DataTalksClub - Website - https://datatalks.club/

google dogs career data startups establishing puppies collecting github healthtech sensors pet health dog behavior

From Full-Time Mom to Head of Data and Cloud - Xia He-Bleinagel

Play Episode Listen Later Nov 28, 2025 62:14

In this talk, Xia He-Bleinagel, Head of Data & Cloud at NOW GmbH, shares her remarkable journey from studying automotive engineering across Europe to leading modern data, cloud, and engineering teams in Germany.We dive into her transition from hands-on engineering to leadership, how she balanced family with career growth, and what it really takes to succeed in today's cloud, data, and AI job market.TIMECODES:00:00 Studying Automotive Engineering Across Europe08:15 How Andrew Ng Sparked a Machine Learning Journey11:45 Import–Export Work as an Unexpected Career Boost17:05 Balancing Family Life with Data Engineering Studies20:50 From Data Engineer to Head of Data & Cloud27:46 Building Data Teams & Tackling Tech Debt30:56 Learning Leadership Through Coaching & Observation34:17 Management vs. IC: Finding Your Best Fit38:52 Boosting Developer Productivity with AI Tools42:47 Succeeding in Germany's Competitive Data Job Market46:03 Fast-Track Your Cloud & Data Career50:03 Mentorship & Supporting Working Moms in Tech53:03 Cultural & Economic Factors Shaping Women's Careers57:13 Top Networking Groups for Women in Data1:00:13 Turning Domain Expertise into a Data Career AdvantageConnect with Xia- Linkedin - https://www.linkedin.com/in/xia-he-bleinagel-51773585/- Github - https://github.com/Data-Think-2021- Website - https://datathinker.de/Connect with DataTalks.Club:- Join the community - https://datatalks.club/slack.html- Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ- Check other upcoming events - https://lu.ma/dtc-events- GitHub: https://github.com/DataTalksClub- LinkedIn - https://www.linkedin.com/company/datatalks-club/ - Twitter - https://twitter.com/DataTalksClub - Website - https://datatalks.club/

women head ai europe google germany management data cultural cloud mentorship github succeeding full time mom

From Black-Box Systems to Augmented Decision-Making - Anusha Akkina

Play Episode Listen Later Nov 28, 2025 62:48

In this talk, Anusha Akkina, co-founder of Auralytix, shares her journey from working as a Chartered Accountant and Auditor at Deloitte to building an AI-powered finance intelligence platform designed to augment, not replace, human decision-making. Together with host Alexey from DataTalks.Club, she explores how AI is transforming finance operations beyond spreadsheets—from tackling ERP limitations to creating real-time insights that drive strategic business outcomes.TIMECODES:00:00 Building trust in AI finance and introducing Auralytix02:22 From accounting roots to auditing at Deloitte and Paraxel08:20 Moving to Germany and pivoting into corporate finance11:50 The data struggle in strategic finance and the need for change13:23 How Auralytix was born: bridging AI and financial compliance17:15 Why ERP systems fail finance teams and how spreadsheets fill the gap24:31 The real cost of ERP rigidity and lessons from failed transformations29:10 The hidden risks of spreadsheet dependency and knowledge loss37:30 Experimenting with ChatGPT and coding the first AI finance prototype43:34 Identifying finance's biggest pain points through user research47:24 Empowering finance teams with AI-driven, real-time decision insights50:59 Developing an entrepreneurial mindset through strategy and learning54:31 Essential resources and finding the right AI co-founderConnect with Anusha- Linkedin - https://www.linkedin.com/in/anusha-akkina-acma-cgma-56154547/- Website - https://aurelytix.com/Connect with DataTalks.Club:- Join the community - https://datatalks.club/slack.html- Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ- Check other upcoming events - https://lu.ma/dtc-events- GitHub: https://github.com/DataTalksClub- LinkedIn - https://www.linkedin.com/company/datatalks-club/ - Twitter - https://twitter.com/DataTalksClub - Website - https://datatalks.club/

Qdrant 2025 Conference Interviews

Play Episode Listen Later Nov 28, 2025 51:59

At Qdrant Conference, builders, researchers, and industry practitioners shared how vector search, retrieval infrastructure, and LLM-driven workflows are evolving across developer tooling, AI platforms, analytics teams, and modern search research.Andrey Vasnetsov (Qdrant) explained how Qdrant was born from the need to combine database-style querying with vector similarity search—something he first built during the COVID lockdowns. He highlighted how vector search has shifted from an ML specialty to a standard developer tool and why hosting an in-person conference matters for gathering honest, real-time feedback from the growing community.Slava Dubrov (HubSpot) described how his team uses Qdrant to power AI Signals, a platform for embeddings, similarity search, and contextual recommendations that support HubSpot's AI agents. He shared practical use cases like look-alike company search, reflected on evaluating agentic frameworks, and offered career advice for engineers moving toward technical leadership.Marina Ariamnova (SumUp) presented her internally built LLM analytics assistant that turns natural-language questions into SQL, executes queries, and returns clean summaries—cutting request times from days to minutes. She discussed balancing analytics and engineering work, learning through real projects, and how LLM tools help analysts scale routine workflows without replacing human expertise.Evgeniya (Jenny) Sukhodolskaya (Qdrant) discussed the multi-disciplinary nature of DevRel and her focus on retrieval research. She shared her work on sparse neural retrieval, relevance feedback, and hybrid search models that blend lexical precision with semantic understanding—contributing methods like Mini-COIL and shaping Qdrant's search quality roadmap through end-to-end experimentation and community education.SpeakersAndrey VasnetsovCo-founder & CTO of Qdrant, leading the engineering and platform vision behind a developer-focused vector database and vector-native infrastructure.Connect: https://www.linkedin.com/in/andrey-vasnetsov-75268897/Slava DubrovTechnical Lead at HubSpot working on AI Signals—embedding models, similarity search, and context systems for AI agents.Connect: https://www.linkedin.com/in/slavadubrov/Marina AriamnovaData Lead at SumUp, managing analytics and financial data workflows while prototyping LLM tools that automate routine analysis.Connect: https://www.linkedin.com/in/marina-ariamnova/Evgeniya (Jenny) SukhodolskayaDeveloper Relations Engineer at Qdrant specializing in retrieval research, sparse neural methods, and educational ML content.Connect: https://www.linkedin.com/in/evgeniya-sukhodolskaya/

covid-19 ai conference cto ml hubspot llm sql devrel sumup

How to Build and Evaluate AI systems in the Age of LLMs - Hugo Bowne-Anderson

Play Episode Listen Later Oct 24, 2025 61:40

In this talk, Hugo Bowne-Anderson, an independent data and AI consultant, educator, and host of the podcasts Vanishing Gradients and High Signal, shares his journey from academic research and curriculum design at DataCamp to advising teams at Netflix, Meta, and the US Air Force. Together, we explore how to build reliable, production-ready AI systems—from prompt evaluation and dataset design to embedding agents into everyday workflows.You'll learn about: How to structure teams and incentives for successful AI adoptionPractical prompting techniques for accurate timestamp and data generationBuilding and maintaining evaluation sets to avoid “prompt overfitting”- Cost-effective methods for LLM evaluation and monitoringTools and frameworks for debugging and observing AI behavior (Logfire, Braintrust, Phoenix Arise)The evolution of AI agents—from simple RAG systems to proactive, embedded assistantsHow to escape “proof of concept purgatory” and prioritize AI projects that drive business valueStep-by-step guidance for building reliable, evaluable AI agentsThis session is ideal for AI engineers, data scientists, ML product managers, and startup founders looking to move beyond experimentation into robust, scalable AI systems. Whether you're optimizing RAG pipelines, evaluating prompts, or embedding AI into products, this talk offers actionable frameworks to guide you from concept to production.LINKSEscaping POC Purgatory: Evaluation-Driven Development for AI Systems - https://www.oreilly.com/radar/escaping-poc-purgatory-evaluation-driven-development-for-ai-systems/Stop Building AI Agents - https://www.decodingai.com/p/stop-building-ai-agentsHow to Evaluate LLM Apps Before You Launch - https://www.youtube.com/watch?si=90fXJJQThSwGCaYv&v=TTr7zPLoTJI&feature=youtu.beMy Vanishing Gradients Substack - https://hugobowne.substack.com/Building LLM Applications for Data Scientists and Software Engineers https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=datatalksclubTIMECODES:00:00 Introduction and Expertise04:04 Transition to Freelance Consulting and Advising08:49 Restructuring Teams and Incentivizing AI Adoption12:22 Improving Prompting for Timestamp Generation17:38 Evaluation Sets and Failure Analysis for Reliable Software23:00 Evaluating Prompts: The Cost and Size of Gold Test Sets27:38 Software Tools for Evaluation and Monitoring33:14 Evolution of AI Tools: Proactivity and Embedded Agents40:12 The Future of AI is Not Just Chat44:38 Avoiding Proof of Concept Purgatory: Prioritizing RAG for Business Value50:19 RAG vs. Agents: Complexity and Power Trade-Offs56:21 Recommended Steps for Building Agents59:57 Defining Memory in Multi-Turn ConversationsConnect with HugoTwitter - https://x.com/hugobowneLinkedin - https://www.linkedin.com/in/hugo-bowne-anderson-045939a5/Github - https://github.com/hugobowneWebsite - https://hugobowne.github.io/Connect with DataTalks.Club:Join the community - https://datatalks.club/slack.htmlSubscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQCheck other upcoming events - https://lu.ma/dtc-eventsGitHub: https://github.com/DataTalksClub- LinkedIn - https://www.linkedin.com/company/datatalks-club/ Twitter - https://twitter.com/DataTalksClub - Website - https://datatalks.club/

From Biotechnology to Bioinformatics Software - Sebastian Ayala Ruano

Play Episode Listen Later Oct 24, 2025 55:36

In this talk, Sebastian, a bioinformatics researcher and software engineer, shares his inspiring journey from wet lab biotechnology to computational bioinformatics. Hosted by Data Talks Club, this session explores how data science, AI, and open-source tools are transforming modern biological research — from DNA sequencing to metagenomics and protein structure prediction.You'll learn about: - The difference between wet lab and dry lab workflows in biotechnology - How bioinformatics enables faster insights through data-driven modeling - The MCW2 Graph Project and its role in studying wastewater microbiomes - Using co-abundance networks and the CC Lasso algorithm to map microbial interactions - How AlphaFold revolutionized protein structure prediction - Building scientific knowledge graphs to integrate biological metadata - Open-source tools like VueGen and VueCore for automating reports and visualizations - The growing impact of AI and large language models (LLMs) in research and documentation - Key differences between R (BioConductor) and Python ecosystems for bioinformaticsThis talk is ideal for data scientists, bioinformaticians, biotech researchers, and AI enthusiasts who want to understand how data science, AI, and biology intersect. Whether you work in genomics, computational biology, or scientific software, you'll gain insights into real-world tools and workflows shaping the future of bioinformatics.Links:- MicW2Graph: https://zenodo.org/records/12507444- VueGen: https://github.com/Multiomics-Analytics-Group/vuegen- Awesome-Bioinformatics: https://github.com/danielecook/Awesome-BioinformaticsTIMECODES00:00 Sebastian's Journey into Bioinformatics06:02 From Wet Lab to Computational Biology08:23 Wet Lab vs Dry Lab Explained12:35 Bioinformatics as Data Science for Biology15:30 How DNA Sequencing Works19:29 MCW2 Graph and Wastewater Microbiomes23:10 Building Microbial Networks with CC Lasso26:54 Protein–Ligand Simulation Basics29:58 Predicting Protein Folding in 3D33:30 AlphaFold Revolution in Protein Prediction36:45 Inside the MCW2 Knowledge Graph39:54 VueGen: Automating Scientific Reports43:56 VueCore: Visualizing OMIX Data47:50 Using AI and LLMs in Bioinformatics50:25 R vs Python in Bioinformatics Tools53:17 Closing Thoughts from EcuadorConnect with SebastianTwitter - https://twitter.com/sayalaruanoLinkedin - https://linkedin.com/in/sayalaruano Github - https://github.com/sayalaruanoWebsite - https://sayalaruano.github.io/Connect with DataTalks.Club:Join the community - https://datatalks.club/slack.htmlSubscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQCheck other upcoming events - https://lu.ma/dtc-eventsGitHub: https://github.com/DataTalksClubLinkedIn - https://www.linkedin.com/company/datatalks-club/Twitter - https://twitter.com/DataTalksClub - Website - https://datatalks.club/

ai google building dna open software using ai python data science github closing thoughts biotechnology ayala bioinformatics ruano

Lessons from Applied AI: Tesla, Waymo, and Beyond - Aishwarya Jadhav

Play Episode Listen Later Oct 10, 2025 59:17

In this episode, we talked with Aishwarya Jadhav, a machine learning engineer whose career has spanned Morgan Stanley, Tesla, and now Waymo. Aishwarya shares her journey from big data in finance to applied AI in self-driving, gesture understanding, and computer vision. She discusses building an AI guide dog for the visually impaired, contributing to malaria mapping in Africa, and the challenges of deploying safe autonomous systems. We also explore the intersection of computer vision, NLP, and LLMs, and what it takes to break into the self-driving AI industry.TIMECODES00:51 Aishwarya's career journey from finance to self-driving AI05:45 Building AI guide dog for the visually impaired12:03 Exploring LiDAR, radar, and Tesla's camera-based approach16:24 Trust, regulation, and challenges in self-driving adoption19:39 Waymo, ride-hailing, and gesture recognition for traffic control24:18 Malaria mapping in Africa and AI for social good29:40 Deployment, safety, and testing in self-driving systems37:00 Transition from NLP to computer vision and deep learning43:37 Reinforcement learning, robotics, and self-driving constraints51:28 Testing processes, evaluations, and staged rollouts for autonomous driving52:53 Can multimodal LLMs be applied to self-driving?55:33 How to get started in self-driving AI careersConnect with Aishwarya- Linkedin - https://www.linkedin.com/in/aishwaryajadhav8/Connect with DataTalks.Club:- Join the community - https://datatalks.club/slack.html- Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ- Check other upcoming events - https://lu.ma/dtc-events- GitHub: https://github.com/DataTalksClub- LinkedIn - https://www.linkedin.com/company/datatalks-club/ - Twitter - https://twitter.com/DataTalksClub - Website - https://datatalks.club/

trust ai google lessons africa transition testing tesla nlp github morgan stanley deployment malaria waymo reinforcement aishwarya applied ai jadhav

Building reliable AI products in the era of Gen AI and Agents - Ranjitha Kulkarni

Play Episode Listen Later Oct 10, 2025 59:44

In this episode, we talked with Ranjitha Kulkarni, a machine learning engineer with a rich career spanning Microsoft, Dropbox, and now NeuBird AI. Ranjitha shares her journey into ML and NLP, her work building recommendation systems, early AI agents, and cutting-edge LLM-powered products. She offers insights into designing reliable AI systems in the new era of generative AI and agents, and how context engineering and dynamic planning shape the future of AI products.TIMECODES00:00 Career journey and early curiosity04:25 Speech recognition at Microsoft05:52 Recommendation systems and early agents at Dropbox07:44 Joining NewBird AI12:01 Defining agents and LLM orchestration16:11 Agent planning strategies18:23 Agent implementation approaches22:50 Context engineering essentials30:27 RAG evolution in agent systems37:39 RAG vs agent use cases40:30 Dynamic planning in AI assistants43:00 AI productivity tools at Dropbox46:00 Evaluating AI agents53:20 Reliable tool usage challenges58:17 Future of agents in engineering Connect with Ranjitha- Linkedin - https://www.linkedin.com/in/ranjitha-gurunath-kulkarniConnect with DataTalks.Club:- Join the community - https://datatalks.club/slack.html- Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ- Check other upcoming events - https://lu.ma/dtc-events- GitHub: https://github.com/DataTalksClub- LinkedIn - https://www.linkedin.com/company/datatalks-club/ - Twitter - https://twitter.com/DataTalksClub - Website - https://datatalks.club/

ai google future career microsoft defining speech products agent context recommendations dynamic nlp reliable ml dropbox github llm genai rag kulkarni

From Theme Parks to Tesla: Building Data Products That Work

Play Episode Listen Later Oct 10, 2025 60:45

In this episode, we talked with Abouzar Abbaspour, a data engineer whose career spans software engineering in Iran, building crowd and recommendation systems at a Dutch theme park, deploying large-scale ML models at Bol.com, and now working at Tesla. Abouzar shares how he bridged diverse industries, tackled real-world data challenges, and adapted to new roles while keeping a hands-on approach to machine learning and engineering.TIMECODES00:00 Career journey and early motivations06:17 Moving to Europe for data science12:18 Working with theme parks and crowd modeling18:29 Lessons from ride and visitor data23:06 Building recommendation systems at Efteling27:26 Joining Bol.com and the Dutch e-commerce industry32:49 Product and brand recommendation logic36:09 Experimenting with "Tinder for brands"40:26 Engagement metrics and product validation43:02 From ML engineering to data engineering roles52:04 Hands-on skills at Tesla and industry expectations57:43 Career growth, learning, and adviceConnect with AbouzarLinkedin - / abouzar-abbaspour Website - https://www.abouzar-abbaspour.com/Connect with DataTalks.Club:Join the community - https://datatalks.club/slack.htmlSubscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/...Check other upcoming events - https://lu.ma/dtc-eventsGitHub: https://github.com/DataTalksClubLinkedIn - / datatalks-club Twitter - / datatalksclub Website - https://datatalks.club/

europe google lessons moving building career data hands iran tesla engagement product tinder products dutch ml theme parks bol experimenting

From Semiconductors to Machine Learning: A Career in Data and Teaching

Play Episode Listen Later Oct 10, 2025 73:25

In this episode, we chat with Dashel Ruiz, whose journey spans semiconductors, machine learning, and teaching. Dashel shares how he transitioned from hardware to data science, navigated complex projects in diverse industries, and now combines technical expertise with a passion for teaching. Tune in to hear insights on building a career in data, mastering new technologies, and making an impact both in the lab and the classroom.TIMECODES00:00 Dashel's unique career path from music to semiconductors06:16 The transition into data and software engineering at Microchip11:44 Discovering machine learning to solve real problems in semiconductor manufacturing20:40 How Dashel found and his experience with the Machine Learning Zoomcamp29:33 The practical advantages of DataTalks.Club courses over other platforms39:52 Overcoming challenges and the value of the learning community48:10 Hands-on project experience: From image classification to Kaggle competitions54:12 Staying motivated throughout the long-term course59:55 The importance of deployment and full-stack ML skills1:07:36 Closing thoughts on teaching and future coursesConnect with Dashel Linkedin - https://www.linkedin.com/in/dashel-ruiz-perez-2b036172/Connect with DataTalks.Club:Join the community - https://datatalks.club/slack.htmlSubscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQCheck other upcoming events - https://lu.ma/dtc-eventsGitHub: https://github.com/DataTalksClubLinkedIn - https://www.linkedin.com/company/datatalks-club/ Twitter - https://twitter.com/DataTalksClub Website - https://datatalks.club/

google career club teaching data overcoming staying hands discovering machine learning ml semiconductors kaggle dashel

Lessons from Two Decades of AI - Micheal Lanham

Play Episode Listen Later Sep 26, 2025 59:58

In this episode, we talk with Michael Lanham, an AI and software innovator with over two decades of experience spanning game development, fintech, oil and gas, and agricultural tech. Michael shares his journey from building neural network-based games and evolutionary algorithms to writing influential books on AI agents and deep learning. He offers insights into the evolving AI landscape, practical uses of AI agents, and the future of generative AI in gaming and beyond.TIMECODES00:00 Micheal Lanham's career journey and AI agent books05:45 Publishing journey: AR, Pokémon Go, sound design, and reinforcement learning10:00 Evolution of AI: evolutionary algorithms, deep learning, and agents13:33 Evolutionary algorithms in prompt engineering and LLMs18:13 AI agent books second edition and practical applications20:57 AI agent workflows: minimalism, task breakdown, and collaboration26:25 Collaboration and orchestration among AI agents31:24 Tools and reasoning servers for agent communication35:17 AI agents in game development and generative AI impact38:57 Future of generative AI in gaming and immersive content41:42 Coding agents, new LLMs, and local deployment45:40 AI model trends and data scientist career advice53:36 Cognitive testing, evaluation, and monitoring in AI58:50 Publishing details and closing remarksConnect with MichealLinkedin - https://www.linkedin.com/in/micheal-lanham-189693123/Connect with DataTalks.Club:Join the community - https://datatalks.club/slack.htmlSubscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/...Check other upcoming events - https://lu.ma/dtc-eventsGitHub: https://github.com/DataTalksClubLinkedIn - / datatalks-club Twitter - / datatalksclub Website - https://datatalks.club/

ai google lessons future evolution tools collaboration publishing pok cognitive coding micheal evolutionary two decades lanham

Berlin PyData 2025 Conference Interviews

Play Episode Listen Later Sep 26, 2025 49:21

At PyData Berlin, community members and industry voices highlighted how AI and data tooling are evolving across knowledge graphs, MLOps, small-model fine-tuning, explainability, and developer advocacy.- Igor Kvachenok (Leuphana University / ProKube) combined knowledge graphs with LLMs for structured data extraction in the polymer industry, and noted how MLOps is shifting toward LLM-focused workflows.- Selim Nowicki (Distill Labs) introduced a platform that uses knowledge distillation to fine-tune smaller models efficiently, making model specialization faster and more accessible.- Gülsah Durmaz (Architect & Developer) shared her transition from architecture to coding, creating Python tools for design automation and volunteering with PyData through PyLadies.- Yashasvi Misra (Pure Storage) spoke on explainable AI, stressing accountability and compliance, and shared her perspective as both a data engineer and active Python community organizer.- Mehdi Ouazza (MotherDuck) reflected on developer advocacy through video, workshops, and branding, showing how creative communication boosts adoption of open-source tools like DuckDB.Igor KvachenokMaster's student in Data Science at Leuphana University of Lüneburg, writing a thesis on LLM-enhanced data extraction for the polymer industry. Builds RDF knowledge graphs from semi-structured documents and works at ProKube on MLOps platforms powered by Kubeflow and Kubernetes.Connect: https://www.linkedin.com/in/igor-kvachenok/Selim NowickiFounder of Distill Labs, a startup making small-model fine-tuning simple and fast with knowledge distillation. Previously led data teams at Berlin startups like Delivery Hero, Trade Republic, and Tier Mobility. Sees parallels between today's ML tooling and dbt's impact on analytics.Connect: https://www.linkedin.com/in/selim-nowicki/Gülsah DurmazArchitect turned developer, creating Python-based tools for architectural design automation with Rhino and Grasshopper. Active in PyLadies and a volunteer at PyData Berlin, she values the community for networking and learning, and aims to bring ML into architecture workflows.Connect: https://www.linkedin.com/in/gulsah-durmaz/Yashasvi (Yashi) MisraData Engineer at Pure Storage, community organizer with PyLadies India, PyCon India, and Women Techmakers. Advocates for inclusive spaces in tech and speaks on explainable AI, bridging her day-to-day in data engineering with her passion for ethical ML.Connect: https://www.linkedin.com/in/misrayashasvi/Mehdi OuazzaDeveloper Advocate at MotherDuck, formerly a data engineer, now focused on building community and education around DuckDB. Runs popular YouTube channels ("mehdio DataTV" and "MotherDuck") and delivered a hands-on workshop at PyData Berlin. Blends technical clarity with creative storytelling.Connect: https://www.linkedin.com/in/mehd-io/

From Astronomy to Applied ML - Daniel Egbo

Play Episode Listen Later Sep 26, 2025 63:54

In this episode, we talk with Daniel, an astrophysicist turned machine learning engineer and AI ambassador. Daniel shares his journey bridging astronomy and data science, how he leveraged live courses and public knowledge sharing to grow his skills, and his experiences working on cutting-edge radio astronomy projects and AI deployments. He also discusses practical advice for beginners in data and astronomy, and insights on career growth through community and continuous learning.TIMECODES00:00 Lunar eclipse story and Daniel's astronomy career04:12 Electromagnetic spectrum and MEERKAT data explained10:39 Data analysis and positional cross-correlation challenges15:25 Physics behind radio star detection and observation limits16:35 Radio astronomy's advantage and machine learning potential20:37 Radio astronomy progress and Daniel's ML journey26:00 Python tools and experience with ZoomCamps31:26 Intel internship and exploring LLMs41:04 Sharing progress and course projects with orchestration tools44:49 Setting up Airflow 3.0 and building data pipelines47:39 AI startups, training resources, and NVIDIA courses50:20 Student access to education, NVIDIA experience, and beginner astronomy programs57:59 Skills, projects, and career advice for beginners59:19 Starting with data science or engineering1:00:07 Course sponsorship, data tools, and learning resourcesConnect with DanielLinkedin - / egbodaniel Connect with DataTalks.Club:Join the community - https://datatalks.club/slack.htmlSubscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/...Check other upcoming events - https://lu.ma/dtc-eventsGitHub: https://github.com/DataTalksClubLinkedIn - / datatalks-club Twitter - / datatalksclub Website - https://datatalks.club/

ai google starting sharing data radio student skills intel physics nvidia python applied ml astronomy lunar meerkats electromagnetic airflow

Berlin Buzzwords 2025 Conference Interviews

Play Episode Listen Later Sep 12, 2025 67:42

At Berlin Buzzwords, industry voices highlighted how search is evolving with AI and LLMs.- Kacper Łukawski (Qdrant) stressed hybrid search (semantic + keyword) as core for RAG systems and promoted efficient embedding models for smaller-scale use.- Manish Gill (ClickHouse) discussed auto-scaling OLAP databases on Kubernetes, combining infrastructure and database knowledge.- André Charton (Kleinanzeigen) reflected on scaling search for millions of classifieds, moving from Solr/Elasticsearch toward vector search, while returning to a hands-on technical role.- Filip Makraduli (Superlinked) introduced a vector-first framework that fuses multiple encoders into one representation for nuanced e-commerce and recommendation search.- Brian Goldin (Voyager Search) emphasized spatial context in retrieval, combining geospatial data with AI enrichment to add the “where” to search.- Atita Arora (Voyager Search) highlighted geospatial AI models, the renewed importance of retrieval in RAG, and the cautious but promising rise of AI agents.Together, their perspectives show a common thread: search is regaining center stage in AI—scaling, hybridization, multimodality, and domain-specific enrichment are shaping the next generation of retrieval systems.Kacper Łukawski Senior Developer Advocate at Qdrant, he educates users on vector and hybrid search. He highlighted Qdrant's support for dense and sparse vectors, the role of search with LLMs, and his interest in cost-effective models like static embeddings for smaller companies and edge apps. Connect: https://www.linkedin.com/in/kacperlukawski/Manish Gill Engineering Manager at ClickHouse, he spoke about running ClickHouse on Kubernetes, tackling auto-scaling and stateful sets. His team focuses on making ClickHouse scale automatically in the cloud. He credited its speed to careful engineering and reflected on the shift from IC to manager. Connect: https://www.linkedin.com/in/manishgill/André Charton Head of Search at Kleinanzeigen, he discussed shaping the company's search tech—moving from Solr to Elasticsearch and now vector search with Vespa. Kleinanzeigen handles 60M items, 1M new listings daily, and 50k requests/sec. André explained his career shift back to hands-on engineering. Connect: https://www.linkedin.com/in/andrecharton/Filip Makraduli Founding ML DevRel engineer at Superlinked, an open-source framework for AI search and recommendations. Its vector-first approach fuses multiple encoders (text, images, structured fields) into composite vectors for single-shot retrieval. His Berlin Buzzwords demo showed e-commerce search with natural-language queries and filters. Connect: https://www.linkedin.com/in/filipmakraduli/Brian Goldin Founder and CEO of Voyager Search, which began with geospatial search and expanded into documents and metadata enrichment. Voyager indexes spatial data and enriches pipelines with NLP, OCR, and AI models to detect entities like oil spills or windmills. He stressed adding spatial context (“the where”) as critical for search and highlighted Voyager's 12 years of enterprise experience. Connect: https://www.linkedin.com/in/brian-goldin-04170a1/Atita Arora Director of AI at Voyager Search, with nearly 20 years in retrieval systems, now focused on geospatial AI for Earth observation data. At Berlin Buzzwords she hosted sessions, attended talks on Lucene, GPUs, and Solr, and emphasized retrieval quality in RAG systems. She is cautiously optimistic about AI agents and values the event as both learning hub and professional reunion. Connect: https://www.linkedin.com/in/atitaarora/

From Medicine to Machine Learning: How Public Learning Turned into a Career - Pastor Soto

Play Episode Listen Later Aug 22, 2025 59:31

In this episode, We talked with Pastor, a medical doctor who built a career in machine learning while studying medicine. Pastor shares how he balanced both fields, leveraged live courses and public sharing to grow his skills, and found opportunities through freelancing and mentoring.TIMECODES00:00 Pastor's background and early programming journey06:05 Learning new tools and skills on the job while studying medicine11:44 Balancing medical studies with data science work and motivation13:48 Applying medical knowledge to data science and vice versa18:44 Starting freelance work on Upwork and overcoming language challenges24:03 Joining the machine learning engineering course and benefits of live cohorts27:41 Engaging with the course community and sharing progress publicly35:16 Using LinkedIn and social media for career growth and interview opportunities41:03 Building reputation, structuring learning, and leveraging course projects50:53 Volunteering and mentoring with DeepLearning.AI and Stanford Coding Place57:00 Managing time and staying productive while studying medicine and machine learningConnect with PastorTwitter - https://x.com/PastorSotoB1Linkedin - / pastorsoto Github - https://github.com/sotoblancoWebsite - https://substack.com/@pastorsotoConnect with DataTalks.Club:Join the community - https://datatalks.club/slack.htmlSubscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/...Check other upcoming events - https://lu.ma/dtc-eventsGitHub: https://github.com/DataTalksClubLinkedIn - / datatalks-club Twitter - / datatalksclub Website - https://datatalks.club/

learning ai google starting building career managing medicine public balancing pastor engaging machine learning volunteering github upwork deep learning pastor soto

How to Rebuild Data Trust? Mindful Data Strategy and Maintenance vs Innovation - Lior Barak

Play Episode Listen Later Aug 15, 2025 61:30

Struggling with data trust issues, dashboard drama, or constant pipeline firefighting? In this deep‑dive interview, Lior Barak shows you how to shift from a reactive “fix‑it” culture to a mindful, impact‑driven practice rooted in Zen/Wabi‑Sabi principles.You'll learn:Why 97 % of CEOs say they use data, but only 24 % call themselves data‑drivenThe traffic‑light dashboard pattern (green / yellow / red) that instantly tells execs whether numbers are safe to useA practical rule for balancing maintenance, rollout, and innovation—and avoiding team burnoutHow to quantify ROI on data products, kill failing legacy systems, and handle ad‑hoc exec requests without derailing roadmapsTurning “imperfect” data into business value with mindful communication, root‑cause logs, and automated incident review loops

community trust ai google career innovation product ceos struggling roi accepting traffic mindful rebuild zen maintenance barak regaining lior data strategy wabi usea

From Simulations to Freelance Data Engineering: Orell's Journey Out of Academia and Into Consulting - Orell Garten

Play Episode Listen Later Aug 1, 2025 58:22

In this episode, we talk with Orell about his journey from electrical engineering to freelancing in data engineering. Exploring lessons from startup life, working with messy industrial data, the realities of freelancing, and how to stay up to date with new tools. Topics covered: Why Orel left a PhD and a simulation‑focused start‑up after Covid hitWhat he learned trying (and failing) to commercialise medical‑imaging simulationsThe first freelance project and the long, quiet months that followedHow he now finds clients, keeps projects small and delivers value quicklyTypical work he does for industrial companies: parsing messy machine logs, building simple pipelines, adding structure laterFavorite everyday tools (Python, DuckDB, a bit of C++) and the habit of blocking time for learningAdvice for anyone thinking about freelancing: cash runway, networking, and focusing on problems rather than “perfect” tech choicesA practical conversation for listeners who are curious about moving from research or permanent roles into freelance data engineering.

google phd tools staying startups exploring consulting academia freelance python github garten freelancing simulations data engineering duckdb orell

Can You Quit Your Job and Still Succeed as a Data Freelancer?

Play Episode Listen Later Jul 25, 2025 58:14

Thinking about swapping your 9‑to‑5 for client work, but worried that a long German–style notice period will kill your chances?  In this live interview, seven‑year data‑freelance veteran Dimitri walks through his experience of taking his freelance career to the next level.About the Speaker: Dimitri Visnadi is an independent data consultant with a focus on data strategy. He has been consulting companies leading the marketing data space such as Unilever, Ferrero, Heineken, and Red Bull.He has lived and worked in 6 countries across Europe in both corporate and startup organizations. He was part of data departments at Hewlett-Packard (HP) and a Google partnered consulting firm where he was working on data products and strategy.Having received a Masters in Business Analytics with Computer Science from University College London and a Bachelor in Business Administration from John Cabot University, Dimitri still has close ties to academia and holds a mentor position in entrepreneurship at both institutions.

From Hackathons to Developer Advocacy - Will Russel

Play Episode Listen Later May 26, 2025 57:10

In this podcast episode, we talked with Will Russell about From Hackathons to Developer Advocacy.About the Speaker: Will Russell is a Developer Advocate at Kestra, known for his videos on workflow orchestration. Previously, Will built open source education programs to help up and coming developers make their first contributions in open source. With a passion for developer education, Will creates technical video content and documentation that makes technologies more approachable for developers.In this episode, we sit down with Will—developer advocate, content creator, and passionate community builder. We'll hear about his unique path through tech, the lessons he's learned, and his approach to making complex topics accessible and engaging. Whether you're curious about open source, hackathons, or what it's like to bridge the gap between developers and the broader tech community, this conversation is full of insights and inspiration.

google mentorship developers russel hackathons developer advocate developer advocacy kestra will russell

Build a Strong Career in Data - Lavanya Gupta

Play Episode Listen Later May 9, 2025 51:59

In this podcast episode, we talked with Lavanya Gupta about Building a Strong Career in Data.About the Speaker: Lavanya is a Carnegie Mellon University (CMU) alumni of the Language Technologies Institute (LTI). She works as a Sr. AI/ML Applied Associate at JPMorgan Chase in their specialized Machine Learning Center of Excellence (MLCOE) vertical. Her latest research on long-context evaluation of LLMs was published in EMNLP 2024. In addition to having a strong industrial research background of 5+ years, she is also an enthusiastic technical speaker. She has delivered talks at events such as Women in Data Science (WiDS) 2021, PyData, Illuminate AI 2021, TensorFlow User Group (TFUG), and MindHack! Summit. She also serves as a reviewer at top-tier NLP conferences (NeurIPS 2024, ICLR 2025, NAACL 2025). Additionally, through her collaborations with various prestigious organizations, like Anita BOrg and Women in Coding and Data Science (WiCDS), she is committed to mentoring aspiring machine learning enthusiasts.In this episode, we talk about Lavanya Gupta's journey from software engineer to AI researcher. She shares how hackathons sparked her passion for machine learning, her transition into NLP, and her current work benchmarking large language models in finance. Tune in for practical insights on building a strong data career and navigating the evolving AI landscape.

women community ai google building career data summit sr mentorship nlp limitations coding gupta jp morgan chase benchmarking kaggle lavanya neurips iclr pydata carnegie mellon university cmu

From Supply Chain Management to Digital Warehousing and FinOps - Eddy Zulkifly

Play Episode Listen Later Apr 4, 2025 52:08

In this podcast episode, we talked with Eddy Zulkifly about From Supply Chain Management to Digital Warehousing and FinOpsAbout the Speaker: Eddy Zulkifly is a Staff Data Engineer at Kinaxis, building robust data platforms across Google Cloud, Azure, and AWS. With a decade of experience in data, he actively shares his expertise as a Mentor on ADPList and Teaching Assistant at Uplimit. Previously, he was a Senior Data Engineer at Home Depot, specializing in e-commerce and supply chain analytics. Currently pursuing a Master's in Analytics at the Georgia Institute of Technology, Eddy is also passionate about open-source data projects and enjoys watching/exploring the analytics behind the Fantasy Premier League.In this episode, we dive into the world of data engineering and FinOps with Eddy Zulkifly, Staff Data Engineer at Kinaxis. Eddy shares his unconventional career journey—from optimizing physical warehouses with Excel to building digital data platforms in the cloud.

Data Intensive AI - Bartosz Mikulski

Play Episode Listen Later Mar 21, 2025 54:54

In this podcast episode, we talked with Bartosz Mikulski about Data Intensive AI.About the Speaker:Bartosz is an AI and data engineer. He specializes in moving AI projects from the good-enough-for-a-demo phase to production by building a testing infrastructure and fixing the issues detected by tests. On top of that, he teaches programmers and non-programmers how to use AI. He contributed one chapter to the book 97 Things Every Data Engineer Should Know, and he was a speaker at several conferences, including Data Natives, Berlin Buzzwords, and Global AI Developer Days. In this episode, we discuss Bartosz's career journey, the importance of testing in data pipelines, and how AI tools like ChatGPT and Cursor are transforming development workflows. From prompt engineering to building Chrome extensions with AI, we dive into practical use cases, tools, and insights for anyone working in data-intensive AI projects. Whether you're a data engineer, AI enthusiast, or just curious about the future of AI in tech, this episode offers valuable takeaways and real-world experiences.0:00 Introduction to Bartosz and his background4:00 Bartosz's career journey from Java development to AI engineering9:05 The importance of testing in data engineering11:19 How to create tests for data pipelines13:14 Tools and approaches for testing data pipelines17:10 Choosing Spark for data engineering projects19:05 The connection between data engineering and AI tools21:39 Use cases of AI in data engineering and MLOps25:13 Prompt engineering techniques and best practices31:45 Prompt compression and caching in AI models33:35 Thoughts on DeepSeek and open-source AI models35:54 Using AI for lead classification and LinkedIn automation41:04 Building Chrome extensions with AI integration43:51 Comparing Cursor and GitHub Copilot for coding47:11 Using ChatGPT and Perplexity for AI-assisted tasks52:09 Hosting static websites and using AI for development54:27 How blogging helps attract clients and share knowledge58:15 Using AI to assist with writing and content creation

ai google data tools chatgpt hosting using ai chrome github java prompt intensive perplexity cursor using chatgpt github copilot bartosz

MLOps in Corporations and Startups - Nemanja Radojkovic

Play Episode Listen Later Mar 14, 2025 58:03

In this podcast episode, we talked with Nemanja Radojkovic about MLOps in Corporations and Startups.About the Speaker: Nemanja Radojkovic is Senior Machine Learning Engineer at Euroclear.In this event,we're diving into the world of MLOps, comparing life in startups versus big corporations. Joining us again is Nemanja, a seasoned machine learning engineer with experience spanning Fortune 500 companies and agile startups. We explore the challenges of scaling MLOps on a shoestring budget, the trade-offs between corporate stability and startup agility, and practical advice for engineers deciding between these two career paths. Whether you're navigating legacy frameworks or experimenting with cutting-edge tools.1:00 MLOps in corporations versus startups6:03 The agility and pace of startups7:54 MLOps on a shoestring budget12:54 Cloud solutions for startups15:06 Challenges of cloud complexity versus on-premise19:19 Selecting tools and avoiding vendor lock-in22:22 Choosing between a startup and a corporation27:30 Flexibility and risks in startups29:37 Bureaucracy and processes in corporations33:17 The role of frameworks in corporations34:32 Advantages of large teams in corporations40:01 Challenges of technical debt in startups43:12 Career advice for junior data scientists44:10 Tools and frameworks for MLOps projects49:00 Balancing new and old technologies in skill development55:43 Data engineering challenges and reliability in LLMs57:09 On-premise vs. cloud solutions in data-sensitive industries59:29 Alternatives like Dask for distributed systems

google challenges career data tools fortune startups balancing cloud corporations flexibility alternatives advantages selecting github bureaucracy das k nemanja

Trends in Data Engineering – Adrian Brudaru

Play Episode Listen Later Mar 7, 2025 56:59

In this podcast episode, we talked with Adrian Brudaru about the past, present and future of data engineering.About the speaker:Adrian Brudaru studied economics in Romania but soon got bored with how creative the industry was, and chose to go instead for the more factual side. He ended up in Berlin at the age of 25 and started a role as a business analyst. At the age of 30, he had enough of startups and decided to join a corporation, but quickly found out that it did not provide the challenge he wanted.As going back to startups was not a desirable option either, he decided to postpone his decision by taking freelance work and has never looked back since. Five years later, he co-founded a company in the data space to try new things. This company is also looking to release open source tools to help democratize data engineering.0:00 Introduction to DataTalks.Club1:05 Discussing trends in data engineering with Adrian2:03 Adrian's background and journey into data engineering5:04 Growth and updates on Adrian's company, DLT Hub9:05 Challenges and specialization in data engineering today13:00 Opportunities for data engineers entering the field15:00 The "Modern Data Stack" and its evolution17:25 Emerging trends: AI integration and Iceberg technology27:40 DuckDB and the emergence of portable, cost-effective data stacks32:14 The rise and impact of dbt in data engineering34:08 Alternatives to dbt: SQLMesh and others35:25 Workflow orchestration tools: Airflow, Dagster, Prefect, and GitHub Actions37:20 Audience questions: Career focus in data roles and AI engineering overlaps39:00 The role of semantics in data and AI workflows41:11 Focusing on learning concepts over tools when entering the field 45:15 Transitioning from backend to data engineering: challenges and opportunities 47:48 Current state of the data engineering job market in Europe and beyond 49:05 Introduction to Apache Iceberg, Delta, and Hudi file formats 50:40 Suitability of these formats for batch and streaming workloads 52:29 Tools for streaming: Kafka, SQS, and related trends 58:07 Building AI agents and enabling intelligent data applications 59:09Closing discussion on the place of tools like DBT in the ecosystem

Competitive Machine Leaning And Teaching – Alexander Guschin

Play Episode Listen Later Feb 14, 2025 53:27

In this podcast episode, we talked with Alexander Guschin about launching a career off Kaggle.About the Speaker: Alexander Guschin is a Machine Learning Engineer with 10+ years of experience, a Kaggle Grandmaster ranked 5th globally, and a teacher to 100K+ students. He leads DS and SE teams and contributes to open-source ML tools.00:00 Starting with Machine Learning: Challenges and Early Steps 13:05 Community and Learning Through Kaggle Sessions 17:10 Broadening Skills Through Kaggle Participation 18:54 Early Competitions and Lessons Learned 21:10 Transitioning to Simpler Solutions Over Time 23:51 Benefits of Kaggle for Starting a Career in Machine Learning 29:08 Teamwork vs. Solo Participation in Competitions 31:14 Schoolchildren in AI Competitions42:33 Transition to Industry and MLOps50:13 Encouraging teamwork in student projects50:48 Designing competitive machine learning tasks52:22 Leaderboard types for tracking performance53:44 Managing small-scale university classes54:17 Experience with Coursera and online teaching59:40 Convincing managers about Kaggle's value61:38 Secrets of Kaggle competition success63:11 Generative AI's impact on competitive ML65:13 Evolution of automated ML solutions66:22 Reflecting on competitive data science experience

Redefining AI Infrastructure: Open-Source, Chips, and the Future Beyond Kubernetes

Play Episode Listen Later Jan 31, 2025 56:55

In this podcast episode, we talked with Andrey Cheptsov about The future of AI infrastructure. About the Speaker: Andrey Cheptsov is the founder and CEO of dstack, an open-source alternative to Kubernetes and Slurm, built to simplify the orchestration of AI infrastructure. Before dstack, Andrey worked at JetBrains for over a decade helping different teams make the best developer tools. During the event, the guest, Andrey Cheptsov, founder and CEO of dstack, discussed the complexities of AI infrastructure. We explore topics like the challenges of using Kubernetes for AI workloads, the need to rethink container orchestration, and the future of hybrid and cloud-only infrastructures. Andrey also shares insights into the role of on-premise and bare-metal solutions, edge computing, and federated learning. 0:00 Andrey's Career Journey: From JetBrains to DStack 5:00 The Motivation Behind DStack 7:00 Challenges in Machine Learning Infrastructure 10:00 Transitioning from Cloud to On-Prem Solutions 14:30 Reflections on OpenAI's Evolution 17:30 Open Source vs Proprietary Models: A Balanced Perspective 21:01 Monolithic vs. Decentralized AI businesses 22:05 The role of privacy and control in AI for industries like banking and healthcare 30:00 Challenges in training large AI models: GPUs and distributed systems 37:03 DeepSpeed's efficient training approach vs. brute force methods 39:00 Challenges for small and medium businesses: hosting and fine-tuning models 47:01 Managing Kubernetes challenges for AI teams 52:00 Hybrid vs. cloud-only infrastructure 56:03 On-premise vs. bare-metal solutions 58:05 Exploring edge computing and its challenges

Linguistics and Fairness - Tamara Atanasoska

Play Episode Listen Later Jan 17, 2025 54:13

In this podcast episode, we talked with Tamara Atanasoska about building fair AI systems. About the Speaker: Tamara works on ML explainability, interpretability and fairness as Open Source Software Engineer at probable. She is a maintainer of fairlearn, contributor to scikit-learn and skops. Tamara has both computer science/ software engineering and a computational linguistics(NLP) background. During the event, the guest discussed their career journey from software engineering to open-source contributions, focusing on explainability in AI through Scikit-learn and Fairlearn. They explored fairness in AI, including challenges in credit loans, hiring, and decision-making, and emphasized the importance of tools, human judgment, and collaboration. The guest also shared their involvement with PyLadies and encouraged contributions to Fairlearn. 0:00 Introduction to the event and the community 1:51 Topic introduction: Linguistic fairness and socio-technical perspectives in AI 2:37 Guest introduction: Tamara's background and career 3:18 Tamara's career journey: Software engineering, music tech, and computational linguistics 9:53 Tamara's background in language and computer science 14:52 Exploring fairness in AI and its impact on society 21:20 Fairness in AI models 26:21 Automating fairness analysis in models 32:32 Balancing technical and domain expertise in decision-making 37:13 The role of humans in the loop for fairness 40:02 Joining Probable and working on open-source projects 46:20 Scopes library and its integration with Hugging Face 50:48 PyLadies and community involvement 55:41 The ethos of Scikit-learn and Fairlearn

ai club balancing software exploring nlp fairness ml github linguistics automating scopes pyladies scikit

Career choices, transitions and promotions in and out of tech - Agita Jaunzeme

Play Episode Listen Later Jan 10, 2025 55:21

In this podcast episode, we talked with Agita Jaunzeme about Career choices, transitions and promotions in and out of tech. About the Speaker: Agita has designed a career spanning DevOps/DataOps engineering, management, community building, education, and facilitation. She has worked on projects across corporate, startup, open source, and non-governmental sectors. Following her passion, she founded an NGO focusing on the inclusion of expats and locals in Porto. Embodying the values of innovation, automation, and continuous learning, Agita provides practical insights on promotions, career pivots, and aligning work with passion and purpose. During this event, discussed their career journey, starting with their transition from art school to programming and later into DevOps, eventually taking on leadership roles. They explored the challenges of burnout and the importance of volunteering, founding an NGO to support inclusion, gender equality, and sustainability. The conversation also covered key topics like mentorship, the differences between data engineering and data science, and the dynamics of managing volunteers versus employees. Additionally, the guest shared insights on community management, developer relations, and the importance of product vision and team collaboration. 0:00 Introduction and Welcome 1:28 Guest Introduction: Agita's Background and Career Highlights 3:05 Transition to Tech: From Art School to Programming 5:40 Exploring DevOps and Growing into Leadership Roles 7:24 Burnout, Volunteering, and Founding an NGO 11:00 Volunteering and Mentorship Initiatives 14:00 Discovering Programming Skills and Early Career Challenges 15:50 Automating Work Processes and Earning a Promotion 19:00 Transitioning from DevOps to Volunteering and Project Management 24:00 Managing Volunteers vs. Employees and Building Organizational Skills 31:07 Personality traits in engineering vs. data roles 33:14 Differences in focus between data engineers and data scientists 36:24 Transitioning from volunteering to corporate work 37:38 The role and responsibilities of a community manager 39:06 Community management vs. developer relations activities 41:01 Product vision and team collaboration 43:35 Starting an NGO and legal processes 46:13 NGO goals: inclusion, gender equality, and sustainability 49:02 Community meetups and activities 51:57 Living off-grid in a forest and sustainability 55:02 Unemployment party and brainstorming session 59:03 Unemployment party: the process and structure

Career advice, learning, and featuring women in ML and AI - Isabella Bicalho

Play Episode Listen Later Dec 13, 2024 54:40

In this podcast episode, we talked with Isabella Bicalho about Career advice, learning, and featuring women in ML and AI. About the Speaker: Isabella is a Machine Learning Engineer and Data Scientist with three years of hands-on AI development experience. She draws upon her early computational research expertise to develop ML solutions. While contributing to open-source projects, she runs a newsletter dedicated to showcasing women's accomplishments in data science. During this event, the guest discussed her transition into machine learning, her freelance work in AI, and the growing AI scene in France. She shared insights on freelancing versus full-time work, the value of open-source contributions, and developing both technical and soft skills. The conversation also covered career advice, mentorship, and her Substack series on women in data science, emphasizing leadership, motivation, and career opportunities in tech. 0:00 Introduction 1:23 Background of Isabella Bicalho 2:02 Transition to machine learning 4:03 Study and work experience 5:00 Living in France and language learning 6:03 Internship experience 8:45 Focus areas of Inria 9:37 AI development in France 10:37 Current freelance work 11:03 Freelancing in machine learning 13:31 Moving from research to freelancing 14:03 Freelance vs. full-time data science 17:00 Finding first freelance client 18:00 Involvement in open-source projects 20:17 Passion for open-source and teamwork 23:52 Starting new projects 25:03 Community project experience 26:02 Teaching and learning 29:04 Contributing to open-source projects 32:05 Open-source tools vs. projects 33:32 Importance of community-driven projects 34:03 Learning resources 36:07 Green space segmentation project 39:02 Developing technical and soft skills 40:31 Gaining insights from industry experts 41:15 Understanding data science roles 41:31 Project challenges and team dynamics 42:05 Turnover in open-source projects 43:05 Managing expectations in open-source work 44:50 Mentorship in projects 46:17 Role of AI tools in learning 47:59 Overcoming learning challenges 48:52 Discussion on substack 49:01 Interview series on women in data 50:15 Insights from women in data science 51:20 Impactful stories from substack 53:01 Leadership challenges in projects 54:19 Career advice and opportunities 56:07 Motivating others to step out of comfort zone 57:06 Contacting for substack story sharing 58:00 Closing remarks and connections

AI in Industry: Trust, Return on Investment and Future - Maria Sukhareva

Play Episode Listen Later Dec 6, 2024 52:58

Reflection on an Almost Two-Year Journey of Generative AI in Industry – Maria Sukhareva About the speaker: Maria Sukhareva is a principal key expert in Artificial Intelligence in Siemens with over 15 years of experience at the forefront of generative AI technologies. Known for her keen eye for technological innovation, Maria excels at transforming cutting-edge AI research into practical, value-driven tools that address real-world needs. Her approach is both hands-on and results-focused, with a commitment to creating scalable, long-term solutions that improve communication, streamline complex processes, and empower smarter decision-making. Maria's work reflects a balanced vision, where the power of innovation is met with ethical responsibility, ensuring that her AI projects deliver impactful and production-ready outcomes. We talked about: 00:00 DataTalks.Club intro 02:13 Career journey: From linguistics to AI 08:02 The Evolution of AI Expertise and its Future 13:10 AI vulnerabilities: Bypassing bot restrictions 17:00 Non-LLM classifiers as a more robust solution 22:56 Risks of chatbot deployment: Reputational and financial 27:13 The role of AI as a tool, not a replacement for human workers 31:41 The role of human translators in the age of AI 34:49 Evolution of English and its Germanic roots 38:44 Beowulf and Old English 39:43 Impact of the Norman occupation on English grammar 42:34 Identifying mushrooms with AI apps and safety precautions 45:08 Decoding ancient languages like Sumerian 49:43 The evolution of machine translation and multilingual models 53:01 Challenges with low-resource languages and inconsistent orthography 57:28 Transition from academia to industry in AI Join our Slack: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html

Large Hadron Collider and Mentorship – Anastasia Karavdina

Play Episode Listen Later Nov 22, 2024 54:13

We talked about: 00:00 DataTalks.Club intro 00:00 Large Hadron Collider and Mentorship 02:35 Career overview and transition from physics to data science 07:02 Working at the Large Hadron Collider 09:19 How particles collide and the role of detectors 11:03 Data analysis challenges in particle physics and data science similarities 13:32 Team structure at the Large Hadron Collider 20:05 Explaining the connection between particle physics and data science 23:21 Software engineering practices in particle physics 26:11 Challenges during interviews for data science roles 29:30 Mentoring and offering advice to job seekers 40:03 The STAR method and its value in interviews 50:32 Paid vs unpaid mentorship and finding the right fit About the speaker: Anastasia is a particle physicist turned data scientist, with experience in large-scale experiments like those at the Large Hadron Collider. She also worked at Blue Yonder, scaling AI-driven solutions for global supply chain giants, and at Kaufland e-commerce, focusing on NLP and search. Anastasia is a mentor for Ml/AI, dedicated to helping her mentees achieve their goals. She is passionate about growing the next generation of data science elite in Germany: from Data Analysts up to ML Engineers. Join our Slack: https://datatalks .club/slack.html

ai challenges germany career club data team software paid mentorship mentoring explaining nlp slack large hadron collider kaufland ml ai blue yonder

MLOps as a Team - Raphaël Hoogvliets

Play Episode Listen Later Nov 8, 2024 55:36

We talked about: 00:00 DataTalks.Club intro 02:34 Career journey and transition into MLOps 08:41 Dutch agriculture and its challenges 10:36 The concept of "technical debt" in MLOps 13:37 Trade-offs in MLOps: moving fast vs. doing things right 14:05 Building teams and the role of coordination in MLOps 16:58 Key roles in an MLOps team: evangelists and tech translators 23:01 Role of the MLOps team in an organization 25:19 How MLOps teams assist product teams 27 :56 Standardizing practices in MLOps 32:46 Getting feedback and creating buy-in from data scientists 36:55 The importance of addressing pain points in MLOps 39:06 Best practices and tools for standardizing MLOps processes 42:31 Value of data versioning and reproducibility 44:22 When to start thinking about data versioning 45:10 Importance of data science experience for MLOps 46:06 Skill mix needed in MLOps teams 47:33 Building a diverse MLOps team 48:18 Best practices for implementing MLOps in new teams 49:52 Starting with CI/CD in MLOps 51:21 Key components for a complete MLOps setup 53:08 Role of package registries in MLOps 54:12 Using Docker vs. packages in MLOps 57:56 Examples of MLOps success and failure stories 1:00:54 What MLOps is in simple terms 1:01:58 The complexity of achieving easy deployment, monitoring, and maintenance Join our Slack: https://datatalks .club/slack.html

starting building career club trade dutch skill slack rapha ci cd standardizing

Claim DataTalks.Club

In order to claim this podcast we'll send an email to with a verification link. Simply click the link and you will be able to edit tags, request a refresh, and other features to take control of your podcast page!

Claim Cancel

DataTalks.Club

Search for episodes from DataTalks.Club with a specific topic:

Latest episodes from DataTalks.Club

Thriving in the AI Era with Human Skills - Maryam Ramezani-Bartsch

Building a Career in AI From Real Estate to AI Engineering - Gustaf Gyllensporre

How to Build AI that actually Ships in Production - Aleksandr Kim

AI Adoption in Enterprise Beyond Writing Code - Ivan Bilan

Applied AI 2026 Berlin Conference Interview

From GenAI Pilots to Production - Nikita Kozodoi

From Notebook to Production: Building End-to-End AI Systems - Mariano Semelman

Data Makers Fest 2026 Conference Interviews

Competitions: Beyond the Kaggle Leaderboard - Tatiana Habruseva

PyConDE 2026 Conference Interviews

Starting a Data Conference: The Data Makers Fest Story - Leonid Kholkine

Understanding the AI Engineer Role - Nasser Qadri

Data Engineer Career in 2026: Roles, Specializations, and What Companies Look for - Slawomir Tulski

Inside the AI Engineer Role: Tools, Skills, and Career Path - Ruslan Shchuchkin

How to Become an AI Engineer After a Career Break - Revathy Ramalingam

The Future of AI Agents - Aditya Gautam

Analytics Engineering with dbt Workshop - Juan Manuel Perafan

AI Engineering: Skill Stack, Agents, LLMOps, and How to Ship AI Products - Paul Iusztin

Applying ML: An Ongoing Personal Journey

Building Pet Health Tech: ML, Sensors, and Dog Behavior Data

From Full-Time Mom to Head of Data and Cloud - Xia He-Bleinagel

From Black-Box Systems to Augmented Decision-Making - Anusha Akkina

Qdrant 2025 Conference Interviews

How to Build and Evaluate AI systems in the Age of LLMs - Hugo Bowne-Anderson

From Biotechnology to Bioinformatics Software - Sebastian Ayala Ruano

Lessons from Applied AI: Tesla, Waymo, and Beyond - Aishwarya Jadhav

Building reliable AI products in the era of Gen AI and Agents - Ranjitha Kulkarni

From Theme Parks to Tesla: Building Data Products That Work

From Semiconductors to Machine Learning: A Career in Data and Teaching

Lessons from Two Decades of AI - Micheal Lanham

Berlin PyData 2025 Conference Interviews

From Astronomy to Applied ML - Daniel Egbo

Berlin Buzzwords 2025 Conference Interviews

From Medicine to Machine Learning: How Public Learning Turned into a Career - Pastor Soto

How to Rebuild Data Trust? Mindful Data Strategy and Maintenance vs Innovation - Lior Barak

From Simulations to Freelance Data Engineering: Orell's Journey Out of Academia and Into Consulting - Orell Garten

Can You Quit Your Job and Still Succeed as a Data Freelancer?

From Hackathons to Developer Advocacy - Will Russel

Build a Strong Career in Data - Lavanya Gupta

From Supply Chain Management to Digital Warehousing and FinOps - Eddy Zulkifly

Data Intensive AI - Bartosz Mikulski

MLOps in Corporations and Startups - Nemanja Radojkovic

Trends in Data Engineering – Adrian Brudaru

Competitive Machine Leaning And Teaching – Alexander Guschin

Redefining AI Infrastructure: Open-Source, Chips, and the Future Beyond Kubernetes

Linguistics and Fairness - Tamara Atanasoska

Career choices, transitions and promotions in and out of tech - Agita Jaunzeme

Career advice, learning, and featuring women in ML and AI - Isabella Bicalho

AI in Industry: Trust, Return on Investment and Future - Maria Sukhareva

Large Hadron Collider and Mentorship – Anastasia Karavdina

MLOps as a Team - Raphaël Hoogvliets

Claim DataTalks.Club

On the way!

How to Rebuild Data Trust? Mindful Data Strategy and Maintenance vs Innovation - Lior Barak