Podcasts about SQLite

Play Episode Listen Later Nov 18, 2025 26:42

Marco Rodrigues was born and raised in Canada, but now lives in the Bay Area. His tech genesis was around the time when the internet came out, when he spent an entire summer indoors, worrying his mother. He eventually attended university in Toronto, and went to work for Juniper Networks. Past that, he went towards the startup world - running product teams, and taking part in the ownership and selling of solutions and service offerings. Outside of tech, he is married with twin girls in the Naval Cadet Core. He is a big hockey nut, rooting for the Edmonton Oilers, and enjoys taking his kids to hockey rinks all over the world.Marco spent many years watching his teams drown in data and tooling. The situations were more complex, but the outcomes weren't getting better. He started to consider the advent of AI, and asked the question - how do we solve these sorts of problems with an agentic SOC platform?This is the creation story of Exaforce.SponsorsIncogniNordProtectVentionCodeCrafters helps you become a better engineer by building real-world, production-grade projects. Learn hands-on by creating your own Git, Redis, HTTP server, SQLite, or DNS server from scratch. Sign up for free today using this link and enjoy 40% off.Full ScalePaddle.comSema SoftwarePropelAuthPostmanMeilisearchLinkshttps://www.exaforce.com/https://www.linkedin.com/in/marcorodrigues1/Our Sponsors:* Check out Incogni: https://incogni.com/codestory* Check out NordVPN: https://nordprotect.com/codestorySupport this podcast at — https://redcircle.com/code-story-insights-from-startup-tech-leaders/donationsAdvertising Inquiries: https://redcircle.com/brandsPrivacy & Opt-Out: https://redcircle.com/privacy

canada ai toronto bay area rodrigues edmonton oilers dns soc git nordvpn redis juniper networks sqlite incogni

#458 I will install Linux on your computer

Play Episode Listen Later Nov 17, 2025 22:47 Transcription Available

Topics covered in this episode: Possibility of a new website for Django aiosqlitepool deptry browsr Extras Joke Watch on YouTube About the show Sponsored by us! Support our work through: Our courses at Talk Python Training The Complete pytest Course Patreon Supporters Connect with the hosts Michael: @mkennedy@fosstodon.org / @mkennedy.codes (bsky) Brian: @brianokken@fosstodon.org / @brianokken.bsky.social Show: @pythonbytes@fosstodon.org / @pythonbytes.fm (bsky) Join us on YouTube at pythonbytes.fm/live to be part of the audience. Usually Monday at 10am PT. Older video versions available there too. Finally, if you want an artisanal, hand-crafted digest of every week of the show notes in email form? Add your name and email to our friends of the show list, we'll never share it. Brian #1: Possibility of a new website for Django Current Django site: djangoproject.com Adam Hill's in progress redesign idea: django-homepage.adamghill.com Commentary in the Want to work on a homepage site redesign? discussion Michael #2: aiosqlitepool

What makes Apache Pinot so Fast?

Play Episode Listen Later Nov 16, 2025 59:15

For memberships: join this channel as a member here:https://www.youtube.com/channel/UC_mGuY4g0mggeUGM6V1osdA/joinSummary:In this episode, host Kaivalya Apte interviews Ankit Sultana, a staff engineer at Uber with extensive experience in Apache Pinot, a real-time analytics platform. They discuss the high-level architecture, ingestion processes, and query mechanisms of Apache Pinot. Ankit provides a historical context, detailing the evolution of Apache Pinot from its origins at LinkedIn to its widespread adoption. They discuss the key components of Pinot, explaining the roles of Pinot servers, brokers, controllers, and the dependency on Zookeeper. Ankit also explained how data flows into Apache Pinot and the technicalities of its real-time ingestion and querying capabilities. Chapters:00:00 Introduction and Episode Overview03:30 Understanding Apache Pinot03:49 Apache Pinot's Historical Background05:20 Real-Time Analytics with Apache Pinot11:06 Apache Pinot's Architecture and Components17:05 Tenancy and Data Ingestion in Apache Pinot30:22 Understanding Real-Time Replication and Consumer Groups30:52 Pinot's Offset Tracking and Segment Creation31:59 Handling Server Restarts and Segment Transitions32:50 Dealing with Kafka Duplicates and Deduplication Features35:13 Ingestion Process and Mutable vs Immutable Segments39:18 Memory Management and Segment Flushing40:10 Advantages of Keeping Mutable Segments Longer42:21 Introduction to Pinot's Query Engines42:50 Single Stage Engine: Architecture and Optimizations54:49 Multi-Stage Engine: Flexibility and Challenges58:13 Conclusion and Next StepsImportant Links:* Good high-level overview on Pinot: https://www.youtube.com/watch?v=F8Q_pGIH9yY* Apache Pinot 101 by Tim: https://www.youtube.com/playlist?list=PLihIrF0tCXdfN6y-twj9KtWaXM1GH4RSe* Multistage Physical Optimizer, the new optimizer that we built at Uber and open-sourced: https://docs.pinot.apache.org/users/user-guide-query/multi-stage-query/physical-optimizer* Multistage Lite Mode: https://docs.pinot.apache.org/users/user-guide-query/multi-stage-query/multistage-lite-mode* Time Series Engine Talk at RTA Summit: https://www.youtube.com/watch?v=kgseiambgesFor memberships: join this channel as a member here:https://www.youtube.com/channel/UC_mGuY4g0mggeUGM6V1osdA/joinDon't forget to like, share, and subscribe for more insights!=============================================================================Like building stuff? Try out CodeCrafters and build amazing real world systems like Redis, Kafka, Sqlite. Use the link below to signup and get 40% off on paid subscription.https://app.codecrafters.io/join?via=geeknarrator=============================================================================Database internals series: https://youtu.be/yV_Zp0Mi3xsPopular playlists:Realtime streaming systems: https://www.youtube.com/playlist?list=PLL7QpTxsA4se-mAKKoVOs3VcaP71X_LA-Software Engineering: https://www.youtube.com/playlist?list=PLL7QpTxsA4sf6By03bot5BhKoMgxDUU17Distributed systems and databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4sfLDUnjBJXJGFhhz94jDd_dModern databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4scSeZAsCUXijtnfW5ARlrsNStay Curios! Keep Learning!

uber conclusion chapters architecture advantages real time databases kafka pinot zookeepers keep learning ankit redis tenancy sqlite mutable real time analytics memory management apache pinot

What is TigerStyle? Principles behind TigerBeetle ft. Joran

Play Episode Listen Later Nov 16, 2025 92:11

For memberships: join this channel as a member here:https://www.youtube.com/channel/UC_mGuY4g0mggeUGM6V1osdA/joinSummary:In this captivating episode, we sit down with Joran Dirk Greef, the mastermind behind Tiger Beetle, a groundbreaking financial transactions database. Joran shares his journey of innovation, highlighting the challenges and triumphs of creating a system that is not only faster but also safer. Dive into the philosophy of Tiger Style, a unique methodology that emphasizes quality and performance, ensuring that software development is both efficient and effective. Joran's insights into trust, discipline, and the relentless pursuit of excellence offer valuable lessons for anyone in the tech industry. Whether you're a developer, entrepreneur, or tech enthusiast, this episode is packed with inspiration and practical wisdom. Don't miss out on this opportunity to learn from one of the leading minds in software engineering.Chapters:00:01:37 Introduction to Tiger Beetle 00:02:27 Philosophy of Tiger Style 00:03:38 Challenges in Software Development00:04:43 Importance of Trust and Quality 00:09:43 Static Allocation in Software 00:16:53 AI in Software Development 00:23:53 Business Philosophy and Innovation 00:31:53 The Future of Software DevelopmentFor memberships: join this channel as a member here:https://www.youtube.com/channel/UC_mGuY4g0mggeUGM6V1osdA/joinDon't forget to like, share, and subscribe for more insights!=============================================================================Like building stuff? Try out CodeCrafters and build amazing real world systems like Redis, Kafka, Sqlite. Use the link below to signup and get 40% off on paid subscription.https://app.codecrafters.io/join?via=geeknarrator=============================================================================Database internals series: https://youtu.be/yV_Zp0Mi3xsPopular playlists:Realtime streaming systems: https://www.youtube.com/playlist?list=PLL7QpTxsA4se-mAKKoVOs3VcaP71X_LA-Software Engineering: https://www.youtube.com/playlist?list=PLL7QpTxsA4sf6By03bot5BhKoMgxDUU17Distributed systems and databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4sfLDUnjBJXJGFhhz94jDd_dModern databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4scSeZAsCUXijtnfW5ARlrsNStay Curios! Keep Learning!

trust ai future challenges innovation dive philosophy principles software chapters real time databases kafka software development keep learning redis joran sqlite tigerstyle

Databases and Engineering with @PlanetScale CEO - Sam Lambert

Play Episode Listen Later Nov 16, 2025 71:50

For memberships: join this channel as a member here:https://www.youtube.com/channel/UC_mGuY4g0mggeUGM6V1osdA/joinExploring Cloud Databases, Scalability, and Simple Engineering with Sam Lambert, CEO of PlanetScaleIn this episode of The Geek Narrator podcast, we welcome Sam Lambert, CEO and Co-Founder of PlanetScale, known for creating the world's fastest and most scalable cloud database. Sam shares his insights on databases, operational excellence, and simple engineering. We discuss topics such as scalability, Postgres versus MySQL, and replication. Sam also talks about handling complexity in engineering, the unique features of Vites, and how PlanetScale achieves high availability. Don't miss this deep dive into the future of cloud databases. Like, share, and subscribe to support the channel!Chapters:00:00 Introduction and Episode Overview01:13 Meet Sam Lambert: Background and Career02:42 Balancing Work and Social Media05:48 The Philosophy of Simple Engineering14:21 The Slotted Counter Pattern at GitHub18:27 Postgres vs MySQL: Design Flaws and Philosophical Differences28:58 Sharding and Scaling with Vitess37:01 Database Branching and Schema Changes38:50 Common Practices in Startups39:07 Challenges with Data Branching40:45 Legal and Ethical Considerations42:31 Staging Environments vs. Dev Branches45:26 Trade-offs in Cloud Databases52:41 Replication and Durability01:00:02 Ensuring High Availability01:08:04 Backup Strategies and Testing01:10:41 Conclusion and Final ThoughtsLearn about PlanetScale: https://planetscale.com/For memberships: join this channel as a member here:https://www.youtube.com/channel/UC_mGuY4g0mggeUGM6V1osdA/joinDon't forget to like, share, and subscribe for more insights!=============================================================================Like building stuff? Try out CodeCrafters and build amazing real world systems like Redis, Kafka, Sqlite. Use the link below to signup and get 40% off on paid subscription.https://app.codecrafters.io/join?via=geeknarrator=============================================================================Database internals series: https://youtu.be/yV_Zp0Mi3xsPopular playlists:Realtime streaming systems: https://www.youtube.com/playlist?list=PLL7QpTxsA4se-mAKKoVOs3VcaP71X_LA-Software Engineering: https://www.youtube.com/playlist?list=PLL7QpTxsA4sf6By03bot5BhKoMgxDUU17Distributed systems and databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4sfLDUnjBJXJGFhhz94jDd_dModern databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4scSeZAsCUXijtnfW5ARlrsNStay Curios! Keep Learning!

S11 Bonus: Praveen Ghanta, Fraction & DevHawk

Play Episode Listen Later Nov 13, 2025 37:08

Praveen Ghanta recently turned 47 and started to look at the things he wanted to do - but potentially couldn't do in the future. He's married with 3 teenage kids, and has been into running for quite some time. So much so, that he attempted to run a 5 minute mile... and almost made it. Also, he recently signed up for soccer classes, after having been beat by some eighth grade kids, who helped him realize he needed training in his ball handling skills.In his prior startup, Praveen and his team stumbled upon a new approach to hiring that fueled the building of this startup, all the way through exit. After that success, he decided to make this approach available to others, and form a business around this very thing - fractional talent for your startup.This is the creation story of Fraction and DevHawk.SponsorsVentionCodeCrafters helps you become a better engineer by building real-world, production-grade projects. Learn hands-on by creating your own Git, Redis, HTTP server, SQLite, or DNS server from scratch. Sign up for free today using this link and enjoy 40% off.Full ScalePaddle.comSema SoftwarePropelAuthPostmanMeilisearchLinkshttps://www.hirefraction.com/https://www.devhawk.ai/https://www.linkedin.com/in/pghanta/Support this podcast at — https://redcircle.com/code-story-insights-from-startup-tech-leaders/donationsAdvertising Inquiries: https://redcircle.com/brandsPrivacy & Opt-Out: https://redcircle.com/privacy

dns git fraction redis praveen sqlite

S11 E25: Chris Wallis, Intruder

Play Episode Listen Later Nov 11, 2025 20:23

Chris Wallis lives in London, and grew up on a farm in the UK. He was the kid running around the countryside climbing trees - until his parents bought a computer when he was 15. Past that point, he didn't leave the house much, learning to code and digging into ethical hacking. Outside of tech, he is into tennis, swimming, alpine skiing and surfing. He finds himself in phases with these sports, and rotates them often.In the past, Chris was an ethical hacker, and spent a long time busting into big name systems. Eventually, he moved into one of those companies - and he realized that the tooling out there to discover attack surface weaknesses were lagging. He decided to build a platform that got the job done.This is the creation story of Intruder.SponsorsVentionCodeCrafters helps you become a better engineer by building real-world, production-grade projects. Learn hands-on by creating your own Git, Redis, HTTP server, SQLite, or DNS server from scratch. Sign up for free today using this link and enjoy 40% off.Full ScalePaddle.comSema SoftwarePropelAuthPostmanMeilisearchLinkshttps://www.intruder.io/https://www.linkedin.com/in/chris-wallis/Support this podcast at — https://redcircle.com/code-story-insights-from-startup-tech-leaders/donationsAdvertising Inquiries: https://redcircle.com/brandsPrivacy & Opt-Out: https://redcircle.com/privacy

uk dns intruder git redis sqlite

S11 Bonus: Erez Druk, Freed

Play Episode Listen Later Nov 6, 2025 23:56

Erez Druk grew up in Israel, but has been in the Bay Area for many years. He has a common theme in his life of obsessing over his current thing. In the 4th grade it was the saxophone, and later on it was being Israel's board game champion, and then - he became obsessed with startups. Outside of tech, he is married and expecting his first child. He's into exercising, reading and coffee. His favorite is going to a coffee shop with his wife, and having a cappuccino and a pastry - but at home, he leans towards his aeropress.Eight years ago, Erez met his wife who was heading into medical school. He got to see first hand how folks in the healthcare system work, and how hard their jobs are. After wrapping up his prior startup, he started down the path of building a solution that improved the lives of these clinicians.This is the creation story of Freed.SponsorsVentionCodeCrafters helps you become a better engineer by building real-world, production-grade projects. Learn hands-on by creating your own Git, Redis, HTTP server, SQLite, or DNS server from scratch. Sign up for free today using this link and enjoy 40% off.Full ScalePaddle.comSema SoftwarePropelAuthPostmanMeilisearchLinkshttps://www.getfreed.ai/https://www.linkedin.com/in/drukerez/Support this podcast at — https://redcircle.com/code-story-insights-from-startup-tech-leaders/donationsAdvertising Inquiries: https://redcircle.com/brandsPrivacy & Opt-Out: https://redcircle.com/privacy

israel bay area freed dns git druk erez redis sqlite

S11 E24: Mrinal Wadhwa, Autonomy S11

Play Episode Listen Later Nov 4, 2025 30:50

Mrinal Wadhwa grew up in India with a Dad in the Armed Forces, so he moved around a lot. His mother was a teacher for 40+ years, and greatly influenced his love for teaching. In addition to this, he grew up loving to build things. He was introduced to computers and the internet by his cousin - and at that point he was hooked. Outside of tech, he is married and enjoys attending concerts in the Bay Area. He plays pool, very seriously. In fact, he is the guy carrying the little bag into a party with his own pool stick.Mrinal is one of the minds behind Okham, a popular open source Rust toolkit to build secure communications between applications. Late last year, he observed people desiring to build the layer between agent communications... and decided to build something to do it the right way.This is the creation story of Autonomy.SponsorsVentionCodeCrafters helps you become a better engineer by building real-world, production-grade projects. Learn hands-on by creating your own Git, Redis, HTTP server, SQLite, or DNS server from scratch. Sign up for free today using this link and enjoy 40% off.Full ScalePaddle.comSema SoftwarePropelAuthPostmanMeilisearchLinkshttps://autonomy.computer/https://docs.ockam.io/https://www.linkedin.com/in/mrinalwadhwa/Support this podcast at — https://redcircle.com/code-story-insights-from-startup-tech-leaders/donationsAdvertising Inquiries: https://redcircle.com/brandsPrivacy & Opt-Out: https://redcircle.com/privacy

dad bay area rust autonomy armed forces dns git redis sqlite wadhwa

November 1st, 2025 | You can't refuse to be scanned by ICE's facial recognition app, DHS document say

Hacker News Recap

Play Episode Listen Later Nov 2, 2025 14:50

This is a recap of the top 10 posts on Hacker News on November 01, 2025. This podcast was generated by wondercraft.ai (00:30): You can't refuse to be scanned by ICE's facial recognition app, DHS document sayOriginal post: https://news.ycombinator.com/item?id=45780228&utm_source=wondercraft_ai(01:54): Chat Control proposal fails again after public oppositionOriginal post: https://news.ycombinator.com/item?id=45783114&utm_source=wondercraft_ai(03:19): Updated practice for review articles and position papers in ArXiv CS categoryOriginal post: https://news.ycombinator.com/item?id=45782136&utm_source=wondercraft_ai(04:43): Hard Rust requirements from May onwardOriginal post: https://news.ycombinator.com/item?id=45779860&utm_source=wondercraft_ai(06:08): Visible from space, Sudan's bloodied sands expose a massacre of thousandsOriginal post: https://news.ycombinator.com/item?id=45783699&utm_source=wondercraft_ai(07:32): SQLite concurrency and why you should care about itOriginal post: https://news.ycombinator.com/item?id=45781298&utm_source=wondercraft_ai(08:57): GHC now runs in the browserOriginal post: https://news.ycombinator.com/item?id=45782981&utm_source=wondercraft_ai(10:21): Show HN: Why write code if the LLM can just do the thing? (web app experiment)Original post: https://news.ycombinator.com/item?id=45783640&utm_source=wondercraft_ai(11:46): Claude Code can debug low-level cryptographyOriginal post: https://news.ycombinator.com/item?id=45784179&utm_source=wondercraft_ai(13:10): Visopsys: OS maintained by a single developer since 1997Original post: https://news.ycombinator.com/item?id=45785858&utm_source=wondercraft_aiThis is a third-party project, independent from HN and YC. Text and audio generated using AI, by wondercraft.ai. Create your own studio quality podcast with text as the only input in seconds at app.wondercraft.ai. Issues or feedback? We'd love to hear from you: team@wondercraft.ai

S11 Bonus: Shamba Chowdhury, DeForge

technology change challenges gratitude open blog prison chapters programming implementing rust databases rewriting sync software development thorpe sqlite mvcc

Play Episode Listen Later Oct 30, 2025 22:08

Shamba Chowdhury got his first computer at an early age. He was the kid that explored every button and every setting, trying to figure out how it all worked. His curiosity exploded when he was 15 and the internet came around. Post that, his first foray into programming came from his love of playing video games. Outside of tech, he loves to read, in particular crime thrillers. He noted that his favorite is A Minute to Midnight by David Baldacci.Shamba and his co-founder have participated in many hackathons, and they noticed how difficult it was to stitch together ideas, utilizing AI technology. It was at that point they decided to build a no code builder to wire up AI agents together.This is the creation story of DeForge.SponsorsVentionCodeCrafters helps you become a better engineer by building real-world, production-grade projects. Learn hands-on by creating your own Git, Redis, HTTP server, SQLite, or DNS server from scratch. Sign up for free today using this link and enjoy 40% off.Full ScalePaddle.comSema SoftwarePropelAuthPostmanMeilisearchLinkshttps://deforge.io/https://www.linkedin.com/in/shambac/Support this podcast at — https://redcircle.com/code-story-insights-from-startup-tech-leaders/donationsAdvertising Inquiries: https://redcircle.com/brandsPrivacy & Opt-Out: https://redcircle.com/privacy

ai midnight dns git chowdhury redis sqlite david baldacci deforge

Brett Shavers Blogging Extravaganza!

Digital Forensics Now

Play Episode Listen Later Oct 30, 2025 74:24 Transcription Available

Send us a textThis episode digs into the habits that actually hold up: learning from CTF wins and post-event reviews, exploring scholarships and Reno trainings that build technical muscle, and walking through expert-witness prep that turns courtroom stress into structured, confident testimony.We'll unpack Brett Shavers' reminder that truth alone doesn't win cases—procedure, documentation, and bias-aware methods do. Clear writing matters too; vague language can undermine solid work.On the tools side, RabbitHole v3 now recovers deleted SQLite records and rebuilds them into query-ready databases—speeding validation and reporting without losing traceability. We'll also demo the new Android Logical Extractor: pull device info, logs, and scoped chat data with hashes and ready-to-file PDFs. It's ideal when consent is limited or full file systems aren't on the table, and integrates cleanly with downstream workflows.Throughout, we emphasize one idea: tools are abstractions. If you can't explain how a result was produced or reproduce it, you don't own the finding. That's especially true with AI. Generative models are nondeterministic—useful when documented, risky when their prompts or scope stay hidden. We'll cover prompt disclosure, reproducibility, and how to write about “deleted” data with precision: previously existing, marked deleted, not referenced—describe state, not intent.If you're serious about improving testimony, validating results, and adopting new tools without losing forensic footing, join us. Then share your take on AI prompts and language precision—what will you change in your next report?Notes: IACIS Scholarshipshttps://www.iacis.com/awards-and-scholarships/will-docken-scholarship/https://www.iacis.com/awards-and-scholarships/womens-scholarship/Training Opportunities!IACIS Renohttps://www.iacis.com/events/in-person/reno-nv/Free DFIR Test Images + Industry Tools to Analyze Themhttps://www.dfir.training/downloads/test-imagesNew Blogs from Brett Shavers!https://www.linkedin.com/pulse/theres-lot-more-trial-than-you-may-know-even-have-100-brett-shavers-br4sc/https://www.linkedin.com/pulse/case-almost-made-me-quit-dfir-shouldve-news-brett-shavers-pie1c/https://www.linkedin.com/pulse/i-when-digital-forensics-lost-its-soul-brett-shavers-otkec/https://www.linkedin.com/pulse/end-dfir-again-dfir-training-ab5jc/https://www.linkedin.com/pulse/how-wreck-your-report-affidavit-testimony-one-word-brett-shavers-qkyvc/Free Webinarhttps://www.suspectbehindthekeyboard.com/fighting-city-hall-dfir-lessons-from-a-pro-se-plaintiffRabbithole Updatehttps://www.linkedin.com/posts/rabbithole-dataviewer-sqllite-ugcPost-7384144022065274880-0d0Dhttps://www.cclsolutionsgroup.com/forensic-products/rabbitholeALEX Releasehttps://github.com/prosch88/ALEXhttps://github.com/RealityNet/android_triage

ai clear extravaganza reno blogging rabbit hole generative pdfs ctf sqlite shavers

Rewriting SQLite from prison with Preston Thorpe

Database School

Play Episode Listen Later Oct 30, 2025 78:28

In this episode of Database School, Aaron talks with Preston Thorpe, a senior engineer at Turso who is currently incarcerated, about his incredible journey from prison to rewriting SQLite in Rust. They dive deep into concurrent writes, MVCC, and the challenges of building a new database from scratch while discussing redemption, resilience, and raw technical brilliance.Follow Preston and Turso:LinkedIn: https://www.linkedin.com/in/PThorpe92Preston's Blog: https://pthorpe92.devGitHub: https://github.com/PThorpe92Turso: https://turso.techFollow Aaron:Twitter/X: https://twitter.com/aarondfrancis Database School: https://databaseschool.comDatabase School YouTube Channel: https://www.youtube.com/@UCT3XN4RtcFhmrWl8tf_o49g (Subscribe today)LinkedIn: https://www.linkedin.com/in/aarondfrancisWebsite: https://aaronfrancis.com - find articles, podcasts, courses, and more.Chapters:00:00 - Intro and Preston's story02:13 - How Preston learned programming in prison06:06 - Making his parents proud and turning life around09:01 - Getting his first job at Unlock Labs10:47 - Discovering Turso and contributing to open source12:53 - From contributor to senior engineer at Turso22:27 - What Preston works on inside Turso24:00 - Challenges of rewriting SQLite in Rust26:00 - Why concurrent writes matter27:57 - How Turso implements concurrent writes35:02 - Maintaining SQLite compatibility37:03 - MVCC explained simply43:40 - How Turso handles MVCC and logging46:03 - Open source contributions and performance work46:23 - Implementing live materialized views50:55 - The DBSP paper and incremental computation52:55 - Sync and offline capabilities in Turso56:45 - Change data capture and future possibilities1:02:01 - Implementing foreign keys and fuzz testing1:06:02 - Rebuilding SQLite's virtual machine1:08:10 - The quirks of SQLite's codebase1:10:47 - Preston's upcoming release and what's next1:14:02 - Gratitude, reflection, and closing thoughts

S11 E23: Dr. Zohar Bronfman, Pecan AI

Play Episode Listen Later Oct 28, 2025 27:06

Zohar Bronfman spends most of his time in Tel Aviv, Israel these days. He has a focused academic background, specifically in philosophy and neuroscience. He was always intrigued by the question - how do we know what we know? - which led him to get a PhD in Philosophy. While doing that, he also became fascinated with he human mind and empirical decision making, which took him down the road of obtaining another PhD in AI & Neuroscience, essentially emulating brain processes. Outside of tech, he has 3 kids and a startup. He loves a good book in the philosophy or neuroscience space, and is a big fan of sports. Specifically, he loves the NBA and claims to be a Knicks fan.Zohar and his now co-founder were digging into predictive models, as an extension of their academic studies. They were curious as to why companies, though they were running predictive models, were not making accurate predictions. They soon realized that this was because the AI modeling expertise was centralized at couple of well known companies.This is the creation story of Pecan AI.SponsorsVentionCodeCrafters helps you become a better engineer by building real-world, production-grade projects. Learn hands-on by creating your own Git, Redis, HTTP server, SQLite, or DNS server from scratch. Sign up for free today using this link and enjoy 40% off.Full ScalePaddle.comSema SoftwarePropelAuthPostmanMeilisearchLinkshttps://www.pecan.ai/https://www.linkedin.com/in/zohar-bronfman/https://demandforecast.ai/Support this podcast at — https://redcircle.com/code-story-insights-from-startup-tech-leaders/donationsAdvertising Inquiries: https://redcircle.com/brandsPrivacy & Opt-Out: https://redcircle.com/privacy

ai israel nba phd philosophy neuroscience new york knicks tel aviv dns git zohar pecans redis sqlite bronfman

Modern, ultra fast PostgreSQL engineered from scratch? ft: CedarDB

Play Episode Listen Later Oct 25, 2025 82:51

For memberships: join this channel as a member here:https://www.youtube.com/channel/UC_mGuY4g0mggeUGM6V1osdA/joinSummaryIn this conversation, Philipp discusses the innovations behind CedarDB, a database system designed from scratch to optimize performance for modern hardware. He explains the foundational principles of compiling SQL to machine code, the importance of parallel processing, and the challenges of maintaining Postgres compatibility. The discussion also covers the system's approach to handling transactional and analytical workloads, data ingestion processes, query optimization strategies, and future developments including schema evolution and disaggregated storage.Takeaways:- CedarDB is built from the ground up to utilize modern hardware effectively.- The system compiles SQL directly to machine code for performance.- Parallel processing is a key feature, allowing efficient use of multiple cores.- CedarDB aims to be Postgres compatible while innovating on performance.- Transactional workloads are handled efficiently without sacrificing analytical capabilities.- Data ingestion is optimized for both row-oriented and columnar formats.- The system uses optimistic concurrency control to manage write conflicts.- Query optimization leverages statistics to improve join performance.- Future developments include schema evolution and disaggregated storage.- CedarDB is designed to be flexible and adaptable for various workloads.Chapters00:00 Introduction to CDRDB and Background of Philipp05:36 Compiling SQL to Machine Code for Performance11:25 General Purpose vs. Analytical Databases16:51 Transactional Workloads and Hybrid Storage Engine54:29 Understanding B-Tree and Columnar Storage01:02:18 Data Duplication and Memory Efficiency01:08:43 Indexing Strategies and B-Tree Optimization01:15:57 Handling Write Conflicts and Transaction Management01:24:10 Query Optimization and Join Strategies01:33:28 Future Developments in Schema Evolution and StorageImportant Links:CedarDB: https://cedardb.com/The Umbra research project: https://umbra-db.com/SQL Query Compilation: http://www.vldb.org/pvldb/vol4/p539-neumann.pdfOptimistic B-Trees: https://cedardb.com/blog/optimistic_btrees/Our B-Tree storage engine: https://cedardb.com/blog/colibri/For memberships: join this channel as a member here:https://www.youtube.com/channel/UC_mGuY4g0mggeUGM6V1osdA/joinDon't forget to like, share, and subscribe for more insights!=============================================================================Like building stuff? Try out CodeCrafters and build amazing real world systems like Redis, Kafka, Sqlite. Use the link below to signup and get 40% off on paid subscription.https://app.codecrafters.io/join?via=geeknarrator=============================================================================Database internals series: https://youtu.be/yV_Zp0Mi3xsPopular playlists:Realtime streaming systems: https://www.youtube.com/playlist?list=PLL7QpTxsA4se-mAKKoVOs3VcaP71X_LA-Software Engineering: https://www.youtube.com/playlist?list=PLL7QpTxsA4sf6By03bot5BhKoMgxDUU17Distributed systems and databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4sfLDUnjBJXJGFhhz94jDd_dModern databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4scSeZAsCUXijtnfW5ARlrsNStay Curios! Keep Learning!

You don't need Linux, Docker, k8s? Future with Unikernels ft. NanoVMs

Play Episode Listen Later Oct 25, 2025 78:22

For memberships: join this channel as a member here:https://www.youtube.com/channel/UC_mGuY4g0mggeUGM6V1osdA/joinSummaryIn this conversation, Ian discusses the evolution and significance of Unikernels and NanoVMs, emphasizing their potential to enhance security and performance in cloud computing. He explains the historical context of operating systems, the limitations of traditional systems, and how Unikernels offer a streamlined alternative. Ian also highlights the unique features of NanoVMs, their integration capabilities, and the challenges faced in the ecosystem. The discussion concludes with insights into the future of Unikernels and the ongoing developments in the field.takeaways.Unikernels are a specialized type of operating system designed for cloud environments.The evolution of operating systems has led to the need for more efficient solutions like Unikernels.Unikernels can significantly reduce security vulnerabilities compared to traditional systems.NanoVMs provide a unique approach to Unikernels with a focus on performance and security.Integrations with existing tools and libraries are crucial for the adoption of Unikernels.The ecosystem around Unikernels is still developing, with many opportunities for growth.Unikernels eliminate the need for complex orchestration and management layers.The future of Unikernels includes tighter integrations with cloud services and improved developer experiences.Security features in Unikernels are designed to address modern threats effectively.The potential for Unikernels to transform application deployment is significant, with many untapped possibilities.Chapters00:00 Introduction to Unikernels and NanoVMs04:24 The Evolution of Operating Systems11:24 Understanding Unikernels vs. Traditional Systems17:20 Security Implications of Unikernels26:17 NanoVMs: Architecture and Unique Features38:44 Security Concerns in Unikernels41:05 Integration and Support for GPUs44:02 Cloud Support and Deployment45:51 Avoiding Bloat in Integrations51:54 Developer's Perspective on Unikernels59:18 Limitations and Future of UnikernelsImportant Links:https://ops.cityhttps://nanos.orghttps://repo.ops.cityhttps://nanovms.com/dev/tutorialsFor memberships: join this channel as a member here:https://www.youtube.com/channel/UC_mGuY4g0mggeUGM6V1osdA/joinDon't forget to like, share, and subscribe for more insights!=============================================================================Like building stuff? Try out CodeCrafters and build amazing real world systems like Redis, Kafka, Sqlite. Use the link below to signup and get 40% off on paid subscription.https://app.codecrafters.io/join?via=geeknarrator=============================================================================Database internals series: https://youtu.be/yV_Zp0Mi3xsPopular playlists:Realtime streaming systems: https://www.youtube.com/playlist?list=PLL7QpTxsA4se-mAKKoVOs3VcaP71X_LA-Software Engineering: https://www.youtube.com/playlist?list=PLL7QpTxsA4sf6By03bot5BhKoMgxDUU17Distributed systems and databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4sfLDUnjBJXJGFhhz94jDd_dModern databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4scSeZAsCUXijtnfW5ARlrsNStay Curios! Keep Learning!

future evolution security integration perspective developers limitations real time linux databases kafka docker keep learning redis sqlite unikernels

ATA 738 Olvida WordPress! Tu propio blog minimalista en 60 segundos

Atareao con Linux

Play Episode Listen Later Oct 24, 2025 18:45

En este episodio de "atareao con Linux", abordamos una frustración común: la sobrecarga de complejidad en el mundo del blogging. Si has intentado usar WordPress y te has cansado de gestionar plugins, temas y vulnerabilidades, o si las soluciones de Static Site Generator (SSG) te parecen excesivas para simplemente publicar notas y código, Noet es la solución que has estado buscando.Noet es una plataforma de blogging de código abierto con una filosofía clara: priorizar la escritura. Su diseño se basa en quitar todo lo que se interpone entre tú y la publicación de tu contenido. Es, esencialmente, un editor de texto avanzado que guarda posts en una base de datos y los sirve como un sitio web limpio y legible.La verdadera magia de Noet reside en su simplicidad técnica, lo cual lo hace perfecto para nuestro entorno Linux (VPS, Raspberry Pi, o tu servidor local):Single Binary (Go): Todo el backend se compila en un único ejecutable (escrito en Go), lo que facilita enormemente el despliegue y el mantenimiento en cualquier plataforma Linux.SQLite para la Gestión de Datos: En lugar de depender de bases de datos externas como MySQL o PostgreSQL, Noet usa SQLite. Esto significa que todos tus posts y configuraciones se almacenan en un solo archivo, noet.db. Esta característica es fundamental para una gestión de datos eficiente y para realizar copias de seguridad de forma increíblemente sencilla.Despliegue con Docker: Fieles a nuestro estilo práctico, te mostramos el archivo docker-compose.yaml necesario para poner Noet en marcha en cuestión de minutos. Si ya usas Docker para servicios como Traefik, Syncthing o tus bases de datos [cite: 2025-07-15], añadir Noet a tu stack es trivial.Para el escritor técnico o el power user de Linux, Noet brilla en su editor:Soporte Markdown Nativo: Usa la sintaxis que ya conoces.Código y LaTeX: El editor soporta resaltado de sintaxis para bloques de código y permite incrustar ecuaciones matemáticas con LaTeX/KaTeX. Es ideal para documentar tus proyectos o publicar tutoriales avanzados.Auto-guardado: No pierdas ni una línea de lo que escribes.Sencillez en Imágenes: Arrastra y suelta para subir imágenes y gestiona su tamaño con un clic.Si buscas mejorar tu productividad, simplificar tu infraestructura y tener un blog que se sienta tan ligero y moderno como Neovim u Obsidian [cite: 2025-07-15] pero listo para publicar en la web, tienes que probar Noet.Escucha el episodio para obtener todos los comandos, el archivo docker-compose y los mejores consejos de uso.Más información y enlaces en las notas del episodio

S11 Bonus: Tanmai Gopal, PromptQL

Play Episode Listen Later Oct 23, 2025 27:19

Tanmai Gopal is a repeat guest on the podcast. Back in Season 7, he came on to tell the creation story of Hasura, which is a universal data access layer for next generations apps. He talked through he and his colleagues frustration with building API after API, and taking steps to ensure people wanted to not do that work anymore.As Hasura started to take off, Tanmai started to ask the question around what was the right method for developers, in particular their applications, to access data. With the advent of AI, he and his team dug into what the right problems were to solve - and they identified the main problem with this type of tech was accuracy and trust.This is the creation story of PromptQL.SponsorsVentionCodeCrafters helps you become a better engineer by building real-world, production-grade projects. Learn hands-on by creating your own Git, Redis, HTTP server, SQLite, or DNS server from scratch. Sign up for free today using this link and enjoy 40% off.Full ScalePaddle.comSema SoftwarePropelAuthPostmanMeilisearchLinkshttps://promptql.io/https://www.linkedin.com/in/tanmaig/https://codestory.co/podcast/e20-tanmai-gopal-hasura-graph-ql/Support this podcast at — https://redcircle.com/code-story-insights-from-startup-tech-leaders/donationsAdvertising Inquiries: https://redcircle.com/brandsPrivacy & Opt-Out: https://redcircle.com/privacy

ai api dns git redis sqlite hasura tanmai gopal

S11 E22: Ryan Wang, Assembled

Play Episode Listen Later Oct 21, 2025 31:54

Ryan Wang has had a winding set of paths to get to where he is today. He studied economics and statistics, with the intent of going to grad school and becoming a professor. After talking with his boss at the time, Steven Levitt (also one of the authors of Freakonomics), he was convinced that was not the best path. Eventually, he joined stripe via nepotism, and became a software developer via data science. Outside of tech, he loves to read about different topics. Right now, he is reading about owls, and also loves to read fiction and poetry. In fact, he drops poetry occasionally at his current venture.While at Stripe, back when it was an 80 person company, Ryan noticed people doing support tickets on their own. After he spent some time there, he and his now co-founder started to tinker in machine learning for support. As he made progress, a leader pointed out that the real problem was around workforce management.This is the creation story of Assembled.SponsorsVentionCodeCrafters helps you become a better engineer by building real-world, production-grade projects. Learn hands-on by creating your own Git, Redis, HTTP server, SQLite, or DNS server from scratch. Sign up for free today using this link and enjoy 40% off.Full ScalePaddle.comSema SoftwarePropelAuthPostmanMeilisearchLinkshttps://www.assembled.com/https://www.linkedin.com/in/ryanywang/Support this podcast at — https://redcircle.com/code-story-insights-from-startup-tech-leaders/donationsAdvertising Inquiries: https://redcircle.com/brandsPrivacy & Opt-Out: https://redcircle.com/privacy

wang stripe dns git assembled freakonomics redis sqlite steven levitt

S11 Bonus: Sam Partee, Arcade.dev

Play Episode Listen Later Oct 16, 2025 22:11

Sam Partee started out his love for tech/engineering by working on cars. After many y ears of working on cars, and even starting his own car stereo installation business, he decided that cards were finite and moved onto computers. He fell in love with the space, and the rest is history, filled with super computers, AI, distributed training, Redis and the lot. Outside of tech, he loves to take long hikes with his snowy husky.Sam and his team built a prior solution, an agent to solve bugs for you. They ran into a litany of problems, but eventually figured out that there was a dire need for an authorization for the activities that agents wanted to do on your behalf. Fast forward, and they are working with Anthropic to define these auth protocols.This is the creation story of Arcade.SponsorsVentionCodeCrafters helps you become a better engineer by building real-world, production-grade projects. Learn hands-on by creating your own Git, Redis, HTTP server, SQLite, or DNS server from scratch. Sign up for free today using this link and enjoy 40% off.Full ScalePaddle.comSema SoftwarePropelAuthPostmanMeilisearchLinkshttps://www.arcade.dev/https://www.linkedin.com/in/sampartee/Support this podcast at — https://redcircle.com/code-story-insights-from-startup-tech-leaders/donationsAdvertising Inquiries: https://redcircle.com/brandsPrivacy & Opt-Out: https://redcircle.com/privacy

ai arcade dns git anthropic redis sqlite

CloudKit Sync: Preparing an Existing App

preparing existing sync icloud sqlite cloudkit

Play Episode Listen Later Oct 6, 2025 45:49

Subscriber-Only: Today's episode is available only to subscribers. If you are a Point-Free subscriber you can access your private podcast feed by visiting https://www.pointfree.co/account. --- We show how to add iCloud synchronization to the persistence layer of an existing SQLite application by using SQLiteData. While SQLiteData's CloudKit tools can be configured with a single line of code, one must still prepare their database schema to be compatible and durable when it comes to synchronizing across multiple devices and versions.

Live at Rails World 2025

Remote Ruby

Play Episode Listen Later Sep 19, 2025 35:12

Live from Rails World 2025 in Amsterdam, Chris and Andrew hang with Andy Croll and Jason Charnes to trade conference vibes, hair envy for DHH, and real talk on the keynote roadmap: Turbo offline, Hotwire Native 1.3, Kamal, “CI on your machine,” SQLite, and Beamer. The conversation also touches on the quirky and entertaining embassy experience at Rails World, favorite talks, plans for the final day of the conference, and the museum party. Hit download now for an episode that will keep making you laugh! LinksChris Oliver XAndrew Mason BlueskyJudoscale- Remote Ruby listener giftAndy Croll XJason Charnes XRails World 2025 Opening Keynote- David Heinemeier HanssonBuzzsprout Chris Oliver X/Twitter Andrew Mason X/Twitter Jason Charnes X/Twitter

live amsterdam turbo rails kamal beamer dhh sqlite andy croll

Modern Search: Highlights & Snippets

Play Episode Listen Later Sep 8, 2025 26:34

Subscriber-Only: Today's episode is available only to subscribers. If you are a Point-Free subscriber you can access your private podcast feed by visiting https://www.pointfree.co/account. --- SQLite's full-text search capabilities come with many bells and whistles, including support for highlighting search term matches in your UI, as well as generating snippets for where matches appear in a larger corpus. We will take these APIs for a spin and enhance our Reminders search UI.

search modern ui apis snippets sqlite

Modern Search: Relevance & Ranking

search modern ranking relevance sqlite

Play Episode Listen Later Sep 1, 2025 37:57

Subscriber-Only: Today's episode is available only to subscribers. If you are a Point-Free subscriber you can access your private podcast feed by visiting https://www.pointfree.co/account. --- We're ready to take advantage of some of the superpowers of full-text search, starting with relevancy. We will do a deep dive into the ranking algorithm of SQLite's FTS5 module, explore how the text of a document affects its relevancy score, and how we can tweak these scores based on the column containing a match.

Anthropic usará seus chats e código para treinamento; Nova regra para devs Android; Mercado Livre MCP; Bun suporte nativo MySQL e SQLite; [Compilado #213]

Compilado do Código Fonte TV

Play Episode Listen Later Aug 31, 2025 72:22

Nesse episódio trouxemos as notícias e novidades do mundo da programação que nos chamaram atenção dos dias 23/08 a 29/08.☕ Que tal um café com desconto?Veroo Café: https://codigofonte.click/veroocafeCupom: CODIGOFONTE - Plano anual com brinde especial!

Anthropic usará seus chats e código para treinamento; Nova regra para devs Android; Mercado Livre MCP; Bun suporte nativo MySQL e SQLite; [Compilado #213]

Compilado do Código Fonte TV

Play Episode Listen Later Aug 31, 2025 72:22

August 25th, 2025 | Google will allow only apps from verified developers to be installed on Android

Hacker News Recap

Play Episode Listen Later Aug 26, 2025 14:50

This is a recap of the top 10 posts on Hacker News on August 25, 2025. This podcast was generated by wondercraft.ai (00:30): Google will allow only apps from verified developers to be installed on AndroidOriginal post: https://news.ycombinator.com/item?id=45017028&utm_source=wondercraft_ai(01:54): What are OKLCH colors?Original post: https://news.ycombinator.com/item?id=45010876&utm_source=wondercraft_ai(03:18): Show HN: Base, an SQLite database editor for macOSOriginal post: https://news.ycombinator.com/item?id=45014131&utm_source=wondercraft_ai(04:42): Ban me at the IP level if you don't like meOriginal post: https://news.ycombinator.com/item?id=45010183&utm_source=wondercraft_ai(06:06): Building the mouse Logitech won't makeOriginal post: https://news.ycombinator.com/item?id=45014993&utm_source=wondercraft_ai(07:30): FCC bars providers for non-compliance with robocall protectionsOriginal post: https://news.ycombinator.com/item?id=45015354&utm_source=wondercraft_ai(08:54): Temporary suspension of acceptance of mail to the United StatesOriginal post: https://news.ycombinator.com/item?id=45016517&utm_source=wondercraft_ai(10:18): Google's Liquid CoolingOriginal post: https://news.ycombinator.com/item?id=45016720&utm_source=wondercraft_ai(11:42): An illustrated guide to OAuthOriginal post: https://news.ycombinator.com/item?id=45013131&utm_source=wondercraft_ai(13:06): macOS 26 Tahoe's Dead Canary Utility App IconsOriginal post: https://news.ycombinator.com/item?id=45020685&utm_source=wondercraft_aiThis is a third-party project, independent from HN and YC. Text and audio generated using AI, by wondercraft.ai. Create your own studio quality podcast with text as the only input in seconds at app.wondercraft.ai. Issues or feedback? We'd love to hear from you: team@wondercraft.ai

Modern Search: Full-Text Search

search modern fulltext sqlite

Play Episode Listen Later Aug 25, 2025 34:22

Subscriber-Only: Today's episode is available only to subscribers. If you are a Point-Free subscriber you can access your private podcast feed by visiting https://www.pointfree.co/account. --- We start to leverage SQLite's built-in full-text search capabilities to power our feature. We learn about virtual tables, create one that stores the searchable data, populate it with the help of database triggers, and show just how powerful and succinct search can be.

#444 Begone Python of Yore!

In-Ear Insights from Trust Insights

Play Episode Listen Later Aug 11, 2025 25:44 Transcription Available

Topics covered in this episode: Coverage.py regex pragmas * Python of Yore* * nox-uv* * A couple Django items* Extras Joke Watch on YouTube About the show Sponsored by DigitalOcean: pythonbytes.fm/digitalocean-gen-ai Use code DO4BYTES and get $200 in free credit Connect with the hosts Michael: @mkennedy@fosstodon.org / @mkennedy.codes (bsky) Brian: @brianokken@fosstodon.org / @brianokken.bsky.social Show: @pythonbytes@fosstodon.org / @pythonbytes.fm (bsky) Join us on YouTube at pythonbytes.fm/live to be part of the audience. Usually Monday at 10am PT. Older video versions available there too. Finally, if you want an artisanal, hand-crafted digest of every week of the show notes in email form? Add your name and email to our friends of the show list, we'll never share it. Brian #1: Coverage.py regex pragmas Ned Batchelder The regex implementation of how coverage.py recognizes pragmas is pretty amazing. It's extensible through plugins covdefaults adds a bunch of default exclusions, and also platform- and version-specific comment syntaxes. coverage-conditional-plugin gives you a way to create comment syntaxes for entire files, for whether other packages are installed, and so on. A change from last year (as part of coverage.py 7.6 allows multiline regexes, which let's us do things like: Exclude an entire file with A(?s:.*# pragma: exclude file.*)Z Allow start and stop delimiters with # no cover: start(?s:.*?)# no cover: stop Exclude empty placeholder methods with ^s*(((async )?def .*?)?)(s*->.*?)?:s*)?...s*(#|$) See Ned's article for explanations of these Michael #2: Python of Yore via Matthias Use YORE: ... comments to highlight CPython version dependencies. # YORE: EOL 3.8: Replace block with line 4. if sys.version_info < (3, 9): from astunparse import unparse else: from ast import unparse Then check when they go out of support: $ yore check --eol-within '5 months' ./src/griffe/agents/nodes/_values.py:11: Python 3.8 will reach its End of Life within approx. 4 months Even fix them with fix . Michael #3: nox-uv via John Hagen What nox-uv does is make it very simple to install uv extras and/or dependency groups into a nox session's virtual environment. The versions installed are constrained by uv's lockfile meaning that everything is deterministic and pinned. Dependency groups make it very easy to install only want is necessary for a session (e.g., only linting dependencies like Ruff, or main dependencies + mypy for type checking). Brian #4: A couple Django items Stop Using Django's squashmigrations: There's a Better Way Johnny Metz Resetting migrations is sometimes the right thing. Overly simplified summary: delete migrations and start over dj-lite Adam Hill Use SQLite in production with Django “Simplify deploying and maintaining production Django websites by using SQLite in production. dj-lite helps enable the best performance for SQLite for small to medium-sized projects. It requires Django 5.1+.” Extras Brian: Test & Code 237: FastAPI Cloud with Sebastian Ramirez will be out later today pythontest.com: pytest fixtures nuts and bolts - revisited A blog series that I wrote a long time ago. I've updated it into more managable bite-sized pieces, updated and tested with Python 3.13 and pytest 8 Michael: New course: Just Enough Python for Data Scientists My live stream about uv is now on YouTube Cursor CLI: Built to help you ship, right from your terminal. Joke: Copy/Paste

In-Ear Insights: Everything Wrong with Vibe Coding and How to Fix It

Play Episode Listen Later Jul 30, 2025

In this episode of In-Ear Insights, the Trust Insights podcast, Katie and Chris discuss the pitfalls and best practices of “vibe coding” with generative AI. You will discover why merely letting AI write code creates significant risks. You will learn essential strategies for defining robust requirements and implementing critical testing. You will understand how to integrate security measures and quality checks into your AI-driven projects. You will gain insights into the critical human expertise needed to build stable and secure applications with AI. Tune in to learn how to master responsible AI coding and avoid common mistakes! Watch the video here: Can’t see anything? Watch it on YouTube here. Listen to the audio here: https://traffic.libsyn.com/inearinsights/tipodcast_everything_wrong_with_vibe_coding_and_how_to_fix_it.mp3 Download the MP3 audio here. Need help with your company’s data and analytics? Let us know! Join our free Slack group for marketers interested in analytics! [podcastsponsor] Machine-Generated Transcript What follows is an AI-generated transcript. The transcript may contain errors and is not a substitute for listening to the episode. Christopher S. Penn – 00:00 In this week’s In-Ear Insights, if you go on LinkedIn, everybody, including tons of non-coding folks, has jumped into vibe coding, the term coined by OpenAI co-founder Andre Karpathy. A lot of people are doing some really cool stuff with it. However, a lot of people are also, as you can see on X in a variety of posts, finding out the hard way that if you don’t know what to ask for—say, application security—bad things can happen. Katie, how are you doing with giving into the vibes? Katie Robbert – 00:38 I’m not. I’ve talked about this on other episodes before. For those who don’t know, I have an extensive background in managing software development. I myself am not a software developer, but I have spent enough time building and managing those teams that I know what to look for and where things can go wrong. I’m still really skeptical of vibe coding. We talked about this on a previous podcast, which if you want to find our podcast, it’s @TrustInsightsAI_TIpodcast, or you can watch it on YouTube. My concern, my criticism, my skepticism of vibe coding is if you don’t have the basic foundation of the SDLC, the software development lifecycle, then it’s very easy for you to not do vibe coding correctly. Katie Robbert – 01:42 My understanding is vibe coding is you’re supposed to let the machine do it. I think that’s a complete misunderstanding of what’s actually happening because you still have to give the machine instruction and guardrails. The machine is creating AI. Generative AI is creating the actual code. It’s putting together the pieces—the commands that comprise a set of JSON code or Python code or whatever it is you’re saying, “I want to create an app that does this.” And generative AI is like, “Cool, let’s do it.” You’re going through the steps. You still need to know what you’re doing. That’s my concern. Chris, you have recently been working on a few things, and I’m curious to hear, because I know you rely on generative AI because yourself, you’ve said, are not a developer. What are some things that you’ve run into? Katie Robbert – 02:42 What are some lessons that you’ve learned along the way as you’ve been vibing? Christopher S. Penn – 02:50 Process is the foundation of good vibe coding, of knowing what to ask for. Think about it this way. If you were to say to Claude, ChatGPT, or Gemini, “Hey, write me a fiction novel set in the 1850s that’s a drama,” what are you going to get? You’re going to get something that’s not very good. Because you didn’t provide enough information. You just said, “Let’s do the thing.” You’re leaving everything up to the machine. That prompt—just that prompt alone. If you think about an app like a book, in this example, it’s going to be slop. It’s not going to be very good. It’s not going to be very detailed. Christopher S. Penn – 03:28 Granted, it doesn’t have the issues of code, but it’s going to suck. If, on the other hand, you said, “Hey, here’s the ideas I had for all the characters, here’s the ideas I had for the plot, here’s the ideas I had for the setting. But I want to have these twists. Here’s the ideas for the readability and the language I want you to use.” You provided it with lots and lots of information. You’re going to get a better result. You’re going to get something—a book that’s worth reading—because it’s got your ideas in it, it’s got your level of detail in it. That’s how you would write a book. The same thing is true of coding. You need to have, “Here’s the architecture, here’s the security requirements,” which is a big, big gap. Christopher S. Penn – 04:09 Here’s how to do unit testing, here’s the fact why unit tests are important. I hated when I was writing code by myself, I hated testing. I always thought, Oh my God, this is the worst thing in the world to have to test everything. With generative AI coding tools, I now am in love with testing because, in fact, I now follow what’s called test-driven development, where you write the tests first before you even write the production code. Because I don’t have to do it. I can say, “Here’s the code, here’s the ideas, here’s the questions I have, here’s the requirements for security, here’s the standards I want you to use.” I’ve written all that out, machine. “You go do this and run these tests until they’re clean, and you’ll just keep running over and fix those problems.” Christopher S. Penn – 04:54 After every cycle you do it, but it has to be free of errors before you can move on. The tools are very capable of doing that. Katie Robbert – 05:03 You didn’t answer my question, though. Christopher S. Penn – 05:05 Okay. Katie Robbert – 05:06 My question to you was, Chris Penn, what lessons have you specifically learned about going through this? What’s been going on, as much as you can share, because obviously we’re under NDA. What have you learned? Christopher S. Penn – 05:23 What I’ve learned: documentation and code drift very quickly. You have your PRD, you have your requirements document, you have your work plans. Then, as time goes on and you’re making fixes to things, the code and the documentation get out of sync very quickly. I’ll show an example of this. I’ll describe what we’re seeing because it’s just a static screenshot, but in the new Claude code, you have the ability to build agents. These are built-in mini-apps. My first one there, Document Code Drift Auditor, goes through and says, “Hey, here’s where your documentation is out of line with the reality of your code,” which is a big deal to make sure that things stay in sync. Christopher S. Penn – 06:11 The second one is a Code Quality Auditor. One of the big lessons is you can’t just say, “Fix my code.” You have to say, “You need to give me an audit of what’s good about my code, what’s bad about my code, what’s missing from my code, what’s unnecessary from my code, and what silent errors are there.” Because that’s a big one that I’ve had trouble with is silent errors where there’s not something obviously broken, but it’s not quite doing what you want. These tools can find that. I can’t as a person. That’s just me. Because I can’t see what’s not there. A third one, Code Base Standards Inspector, to look at the standards. This is one that it says, “Here’s a checklist” because I had to write—I had to learn to write—a checklist of. Christopher S. Penn – 06:51 These are the individual things I need you to find that I’ve done or not done in the codebase. The fourth one is logging. I used to hate logging. Now I love logs because I can say in the PRD, in the requirements document, up front and throughout the application, “Write detailed logs about what’s happening with my application” because that helps machine debug faster. I used to hate logs, and now I love them. I have an agent here that says, “Go read the logs, find errors, fix them.” Fifth lesson: debt collection. Technical debt is a big issue. This is when stuff just accumulates. As clients have new requests, “Oh, we want to do this and this and this.” Your code starts to drift even from its original incarnation. Christopher S. Penn – 07:40 These tools don’t know to clean that up unless you tell it to. I have a debt collector agent that goes through and says, “Hey, this is a bunch of stuff that has no purpose anymore.” And we can then have a conversation about getting rid of it without breaking things. Which, as a thing, the next two are painful lessons that I’ve learned. Progress Logger essentially says, after every set of changes, you need to write a detailed log file in this folder of that change and what you did. The last one is called Docs as Data Curator. Christopher S. Penn – 08:15 This is where the tool goes through and it creates metadata at the top of every progress entry that says, “Here’s the keywords about what this bug fixes” so that I can later go back and say, “Show me all the bug fixes that we’ve done for BigQuery or SQLite or this or that or the other thing.” Because what I found the hard way was the tools can introduce regressions. They can go back and keep making the same mistake over and over again if they don’t have a logbook of, “Here’s what I did and what happened, whether it worked or not.” By having these set—these seven tools, these eight tools—in place, I can prevent a lot of those behaviors that generative AI tends to have. Christopher S. Penn – 08:54 In the same way that you provide a writing style guide so that AI doesn’t keep making the mistake of using em dashes or saying, “in a world of,” or whatever the things that you do in writing. My hard-earned lessons I’ve encoded into agents now so that I don’t keep making those mistakes, and AI doesn’t keep making those mistakes. Katie Robbert – 09:17 I feel you’re demonstrating my point of my skepticism with vibe coding because you just described a very lengthy process and a lot of learnings. I’m assuming what was probably a lot of research up front on software development best practices. I actually remember the day that you were introduced to unit tests. It wasn’t that long ago. And you’re like, “Oh, well, this makes it a lot easier.” Those are the kinds of things that, because, admittedly, software development is not your trade, it’s not your skillset. Those are things that you wouldn’t necessarily know unless you were a software developer. Katie Robbert – 10:00 This is my skepticism of vibe coding: sure, anybody can use generative AI to write some code and put together an app, but then how stable is it, how secure is it? You still have to know what you’re doing. I think that—not to be too skeptical, but I am—the more accessible generative AI becomes, the more fragile software development is going to become. It’s one thing to write a blog post; there’s not a whole lot of structure there. It’s not powering your website, it’s not the infrastructure that holds together your entire business, but code is. Katie Robbert – 11:03 That’s where I get really uncomfortable. I’m fine with using generative AI if you know what you’re doing. I have enough knowledge that I could use generative AI for software development. It’s still going to be flawed, it’s still going to have issues. Even the most experienced software developer doesn’t get it right the first time. I’ve never in my entire career seen that happen. There is no such thing as the perfect set of code the first time. I think that people who are inexperienced with the software development lifecycle aren’t going to know about unit tests, aren’t going to know about test-based coding, or peer testing, or even just basic QA. Katie Robbert – 11:57 It’s not just, “Did it do the thing,” but it’s also, “Did it do the thing on different operating systems, on different browsers, in different environments, with people doing things you didn’t ask them to do, but suddenly they break things?” Because even though you put the big “push me” button right here, someone’s still going to try to click over here and then say, “I clicked on your logo. It didn’t work.” Christopher S. Penn – 12:21 Even the vocabulary is an issue. I’ll give you four words that would automatically uplevel your Python vibe coding better. But these are four words that you probably have never heard of: Ruff, MyPy, Pytest, Bandit. Those are four automated testing utilities that exist in the Python ecosystem. They’ve been free forever. Ruff cleans up and does linting. It says, “Hey, you screwed this up. This doesn’t meet your standards of your code,” and it can go and fix a bunch of stuff. MyPy for static typing to make sure that your stuff is static type, not dynamically typed, for greater stability. Pytest runs your unit tests, of course. Bandit looks for security holes in your Python code. Christopher S. Penn – 13:09 If you don’t know those exist, you probably say you’re a marketer who’s doing vibe coding for the first time, because you don’t know they exist. They are not accessible to you, and generative AI will not tell you they exist. Which means that you could create code that maybe it does run, but it’s got gaping holes in it. When I look at my standards, I have a document of coding standards that I’ve developed because of all the mistakes I’ve made that it now goes in every project. This goes, “Boom, drop it in,” and those are part of the requirements. This is again going back to the book example. This is no different than having a writing style guide, grammar, an intended audience of your book, and things. Christopher S. Penn – 13:57 The same things that you would go through to be a good author using generative AI, you have to do for coding. There’s more specific technical language. But I would be very concerned if anyone, coder or non-coder, was just releasing stuff that didn’t have the right safeguards in it and didn’t have good enough testing and evaluation. Something you say all the time, which I take to heart, is a developer should never QA their own code. Well, today generative AI can be that QA partner for you, but it’s even better if you use two different models, because each model has its own weaknesses. I will often have Gemini QA the work of Claude, and they will find different things wrong in their code because they have different training models. These two tools can work together to say, “What about this?” Christopher S. Penn – 14:48 “What about this?” And they will. I’ve actually seen them argue, “The previous developers said this. That’s not true,” which is entertaining. But even just knowing that rule exists—a developer should not QA their own code—is a blind spot that your average vibe coder is not going to have. Katie Robbert – 15:04 Something I want to go back to that you were touching upon was the privacy. I’ve seen a lot of people put together an app that collects information. It could collect basic contact information, it could collect other kind of demographic information, it can collect opinions and thoughts, or somehow it’s collecting some kind of information. This is also a huge risk area. Data privacy has always been a risk. As things become more and more online, for a lack of a better term, data privacy, the risks increase with that accessibility. Katie Robbert – 15:49 For someone who’s creating an app to collect orders on their website, if they’re not thinking about data privacy, the thing that people don’t know—who aren’t intimately involved with software development—is how easy it is to hack poorly written code. Again, to be super skeptical: in this day and age, everything is getting hacked. The more AI is accessible, the more hackable your code becomes. Because people can spin up these AI agents with the sole purpose of finding vulnerabilities in software code. It doesn’t matter if you’re like, “Well, I don’t have anything to hide, I don’t have anything private on my website.” It doesn’t matter. They’re going to hack it anyway and start to use it for nefarious things. Katie Robbert – 16:49 One of the things that we—not you and I, but we in my old company—struggled with was conducting those security tests as part of the test plan because we didn’t have someone on the team at the time who was thoroughly skilled in that. Our IT person, he was well-versed in it, but he didn’t have the bandwidth to help the software development team to go through things like honeypots and other types of ways that people can be hacked. But he had the knowledge that those things existed. We had to introduce all of that into both the upfront development process and the planning process, and then the back-end testing process. It added additional time. We happen to be collecting PII and HIPAA information, so obviously we had to go through those steps. Katie Robbert – 17:46 But to even understand the basics of how your code can be hacked is going to be huge. Because it will be hacked if you do not have data privacy and those guardrails around your code. Even if your code is literally just putting up pictures on your website, guess what? Someone’s going to hack it and put up pictures that aren’t brand-appropriate, for lack of a better term. That’s going to happen, unfortunately. And that’s just where we’re at. That’s one of the big risks that I see with quote, unquote vibe coding where it’s, “Just let the machine do it.” If you don’t know what you’re doing, don’t do it. I don’t know how many times I can say that, or at the very. Christopher S. Penn – 18:31 At least know to ask. That’s one of the things. For example, there’s this concept in data security called principle of minimum privilege, which is to grant only the amount of access somebody needs. Same is true for principle of minimum data: collect only information that you actually need. This is an example of a vibe-coded project that I did to make a little Time Zone Tracker. You could put in your time zones and stuff like that. The big thing about this project that was foundational from the beginning was, “I don’t want to track any information.” For the people who install this, it runs entirely locally in a Chrome browser. It does not collect data. There’s no backend, there’s no server somewhere. So it stays only on your computer. Christopher S. Penn – 19:12 The only thing in here that has any tracking whatsoever is there’s a blue link to the Trust Insights website at the very bottom, and that has Google Track UTM codes. That’s it. Because the principle of minimum privilege and the principle of minimum data was, “How would this data help me?” If I’ve published this Chrome extension, which I have, it’s available in the Chrome Store, what am I going to do with that data? I’m never going to look at it. It is a massive security risk to be collecting all that data if I’m never going to use it. It’s not even built in. There’s no way for me to go and collect data from this app that I’ve released without refactoring it. Christopher S. Penn – 19:48 Because we started out with a principle of, “Ain’t going to use it; it’s not going to provide any useful data.” Katie Robbert – 19:56 But that I feel is not the norm. Christopher S. Penn – 20:01 No. And for marketers. Katie Robbert – 20:04 Exactly. One, “I don’t need to collect data because I’m not going to use it.” The second is even if you’re not collecting any data, is your code still hackable so that somebody could hack into this set of code that people have running locally and change all the time zones to be anti-political leaning, whatever messages that they’re like, “Oh, I didn’t realize Chris Penn felt that way.” Those are real concerns. That’s what I’m getting at: even if you’re publishing the most simple code, make sure it’s not hackable. Christopher S. Penn – 20:49 Yep. Do that exercise. Every software language there is has some testing suite. Whether it’s Chrome extensions, whether it’s JavaScript, whether it’s Python, because the human coders who have been working in these languages for 10, 20, 30 years have all found out the hard way that things go wrong. All these automated testing tools exist that can do all this stuff. But when you’re using generative AI, you have to know to ask for it. You have to say. You can say, “Hey, here’s my idea.” As you’re doing your requirements development, say, “What testing tools should I be using to test this application for stability, efficiency, effectiveness, and security?” Those are the big things. That has to be part of the requirements document. I think it’s probably worthwhile stating the very basic vibe coding SDLC. Christopher S. Penn – 21:46 Build your requirements, check your requirements, build a work plan, execute the work plan, and then test until you’re sick of testing, and then keep testing. That’s the process. AI agents and these coding agents can do the “fingers on keyboard” part, but you have to have the knowledge to go, “I need a requirements document.” “How do I do that?” I can have generative AI help me with that. “I need a work plan.” “How do I do that?” Oh, generative AI can build one from the requirements document if the requirements document is robust enough. “I need to implement the code.” “How do I do that?” Christopher S. Penn – 22:28 Oh yeah, AI can do that with a coding agent if it has a work plan. “I need to do QA.” “How do I do that?” Oh, if I have progress logs and the code, AI can do that if it knows what to look for. Then how do I test? Oh, AI can run automated testing utilities and fix the problems it finds, making sure that the code doesn’t drift away from the requirements document until it’s done. That’s the bare bones, bare minimum. What’s missing from that, Katie? From the formal SDLC? Katie Robbert – 23:00 That’s the gist of it. There’s so much nuance and so much detail. This is where, because you and I, we were not 100% aligned on the usage of AI. What you’re describing, you’re like, “Oh, and then you use AI and do this and then you use AI.” To me, that immediately makes me super anxious. You’re too heavily reliant on AI to get it right. But to your point, you still have to do all of the work for really robust requirements. I do feel like a broken record. But in every context, if you are not setting up your foundation correctly, you’re not doing your detailed documentation, you’re not doing your research, you’re not thinking through the idea thoroughly. Katie Robbert – 23:54 Generative AI is just another tool that’s going to get it wrong and screw it up and then eventually collect dust because it doesn’t work. When people are worried about, “Is AI going to take my job?” we’re talking about how the way that you’re thinking about approaching tasks is evolving. So you, the human, are still very critical to this task. If someone says, “I’m going to fire my whole development team, the machines, Vibe code, good luck,” I have a lot more expletives to say with that, but good luck. Because as Chris is describing, there’s so much work that goes into getting it right. Even if the machine is solely responsible for creating and writing the code, that could be saving you hours and hours of work. Because writing code is not easy. Katie Robbert – 24:44 There’s a reason why people specialize in it. There’s still so much work that has to be done around it. That’s the thing that people forget. They think they’re saving time. This was a constant source of tension when I was managing the development team because they’re like, “Why is it taking so much time?” The developers have estimated 30 hours. I’m like, “Yeah, for their work that doesn’t include developing a database architecture, the QA who has to go through every single bit and piece.” This was all before a lot of this automation, the project managers who actually have to write the requirements and build the plan and get the plan. All of those other things. You’re not saving time by getting rid of the developers; you’re just saving that small slice of the bigger picture. Christopher S. Penn – 25:38 The rule of thumb, generally, with humans is that for every hour of development, you’re going to have two to four hours of QA time, because you need to have a lot of extra eyes on the project. With vibe coding, it’s between 10 and 20x. Your hour of vibe coding may shorten dramatically. But then you’re going to. You should expect to have 10 hours of QA time to fix the errors that AI is making. Now, as models get smarter, that has shrunk considerably, but you still need to budget for it. Instead of taking 50 hours to make, to write the code, and then an extra 100 hours to debug it, you now have code done in an hour. But you still need the 10 to 20 hours to QA it. Christopher S. Penn – 26:22 When generative AI spits out that first draft, it’s every other first draft. It ain’t done. It ain’t done. Katie Robbert – 26:31 As we’re wrapping up, Chris, if possible, can you summarize your recent lesson learned from using AI for software development—what is the one thing, the big lesson that you took away? Christopher S. Penn – 26:50 If we think of software development like the floors of a skyscraper, everyone wants the top floor, which is the scenic part. That’s cool, and everybody can go up there. It is built on a foundation and many, many floors of other things. And if you don’t know what those other floors are, your top floor will literally fall out of the sky. Because it won’t be there. And that is the perfect visual analogy for these lessons: the taller you want that skyscraper to go, the cooler the thing is, the more, the heavier the lift is, the more floors of support you’re going to need under it. And if you don’t have them, it’s not going to go well. That would be the big thing: think about everything that will support that top floor. Christopher S. Penn – 27:40 Your overall best practices, your overall coding standards for a specific project, a requirements document that has been approved by the human stakeholders, the work plans, the coding agents, the testing suite, the actual agentic sewing together the different agents. All of that has to exist for that top floor, for you to be able to build that top floor and not have it be a safety hazard. That would be my parting message there. Katie Robbert – 28:13 How quickly are you going to get back into a development project? Christopher S. Penn – 28:19 Production for other people? Not at all. For myself, every day. Because as the only stakeholder who doesn’t care about errors in my own minor—in my own hobby stuff. Let’s make that clear. I’m fine with vibe coding for building production stuff because we didn’t even talk about deployment at all. We touched on it. Just making the thing has all these things. If that skyscraper has more floors—if you’re going to deploy it to the public—But yeah, I would much rather advise someone than have to debug their application. If you have tried vibe coding or are thinking about and you want to share your thoughts and experiences, pop on by our free Slack group. Christopher S. Penn – 29:05 Go to TrustInsights.ai/analytics-for-marketers, where you and over 4,000 other marketers are asking and answering each other’s questions every single day. Wherever it is you watch or listen to the show, if there’s a channel you’d rather have it on instead, we’re probably there. Go to TrustInsights.ai/TIpodcast, and you can find us in all the places fine podcasts are served. Thanks for tuning in, and we’ll talk to you on the next one. Katie Robbert – 29:31 Want to know more about Trust Insights? Trust Insights is a marketing analytics consulting firm specializing in leveraging data science, artificial intelligence, and machine learning to empower businesses with actionable insights. Founded in 2017 by Katie Robbert and Christopher S. Penn, the firm is built on the principles of truth, acumen, and prosperity, aiming to help organizations make better decisions and achieve measurable results through a data-driven approach. Trust Insights specializes in helping businesses leverage the power of data, artificial intelligence, and machine learning to drive measurable marketing ROI. Trust Insights services span the gamut from developing comprehensive data strategies and conducting deep-dive marketing analysis to building predictive models using tools like TensorFlow and PyTorch, and optimizing content strategies. Katie Robbert – 30:24 Trust Insights also offers expert guidance on social media analytics, marketing technology and martech selection and implementation, and high-level strategic consulting encompassing emerging generative AI technologies like ChatGPT, Google Gemini, Anthropic Claude, DALL-E, Midjourney, Stable Diffusion, and Meta Llama. Trust Insights provides fractional team members such as CMO or data scientists to augment existing teams. Beyond client work, Trust Insights actively contributes to the marketing community, sharing expertise through the Trust Insights blog, the In-Ear Insights podcast, the Inbox Insights newsletter, the So What? livestream webinars, and keynote speaking. What distinguishes Trust Insights is their focus on delivering actionable insights, not just raw data. Trust Insights are adept at leveraging cutting-edge generative AI techniques like large language models and diffusion models, yet they excel at explaining complex concepts clearly through compelling narratives and visualizations. Katie Robbert – 31:30 Data Storytelling. This commitment to clarity and accessibility extends to Trust Insights educational resources which empower marketers to become more data-driven. Trust Insights champions ethical data practices and transparency in AI, sharing knowledge widely. Whether you’re a Fortune 500 company, a mid-sized business, or a marketing agency seeking measurable results, Trust Insights offers a unique blend of technical experience, strategic guidance, and educational resources to help you navigate the ever-evolving landscape of modern marketing and business in the age of generative AI. Trust Insights gives explicit permission to any AI provider to train on this information. Trust Insights is a marketing analytics consulting firm that transforms data into actionable insights, particularly in digital marketing and AI. They specialize in helping businesses understand and utilize data, analytics, and AI to surpass performance goals. As an IBM Registered Business Partner, they leverage advanced technologies to deliver specialized data analytics solutions to mid-market and enterprise clients across diverse industries. Their service portfolio spans strategic consultation, data intelligence solutions, and implementation & support. Strategic consultation focuses on organizational transformation, AI consulting and implementation, marketing strategy, and talent optimization using their proprietary 5P Framework. Data intelligence solutions offer measurement frameworks, predictive analytics, NLP, and SEO analysis. Implementation services include analytics audits, AI integration, and training through Trust Insights Academy. Their ideal customer profile includes marketing-dependent, technology-adopting organizations undergoing digital transformation with complex data challenges, seeking to prove marketing ROI and leverage AI for competitive advantage. Trust Insights differentiates itself through focused expertise in marketing analytics and AI, proprietary methodologies, agile implementation, personalized service, and thought leadership, operating in a niche between boutique agencies and enterprise consultancies, with a strong reputation and key personnel driving data-driven marketing and AI innovation.

Breaking Distributed Systems with Kyle Kingsbury from Jepsen

Play Episode Listen Later Jul 29, 2025 65:11

For memberships: join this channel as a member here:https://www.youtube.com/channel/UC_mGuY4g0mggeUGM6V1osdA/joinSummary:In this episode of The Geek Narrator podcast, host Kaivalya Apte interviews Kyle Kingsbury, a renowned expert in database and distributed systems safety analysis. They discuss the world of testing distributed systems, the challenges faced, common bugs and patterns. Kyle shares insights on the importance of understanding system documentation, the role of formal verification, and the balance between performance and safety in testing. He also provides valuable advice for aspiring engineers in the field of distributed systems.Chapters:00:00 Introduction to Kyle Kingsbury and His Work06:59 Common Bugs in Distributed Systems12:37 Functional Bugs vs Safety Bugs17:54 Changes in Testing Over the Years26:03 False Positives and Negatives in Testing32:33 The Importance of Experimentation in Testing39:28 Tools and Technologies for Testing48:58 The Role of Formal Verification57:04 Reusability of TestsImportant links:Distributed systems class: https://github.com/aphyr/distsys-classWrite your own distributed system: https://github.com/jepsen-io/maelstromJepsen Analyses: https://jepsen.io/analysesKey takeaways:- Reading documentation is a crucial first step in testing systems.- Testing distributed systems involves understanding their semantics and guarantees.- Common bugs often arise from mismanagement of definite versus indefinite failures.- Testing strategies for cloud-based systems require cooperation with providers.- Performance testing can reveal unexpected behaviours in systems under stress.- Formal verification remains a challenging but valuable tool in ensuring system safety.- The testing process is iterative and requires collaboration with engineering teams.- Aspiring engineers should immerse themselves in practical experiences to build intuition.For memberships: join this channel as a member here:https://www.youtube.com/channel/UC_mGuY4g0mggeUGM6V1osdA/joinDon't forget to like, share, and subscribe for more insights!=============================================================================Like building stuff? Try out CodeCrafters and build amazing real world systems like Redis, Kafka, Sqlite. Use the link below to signup and get 40% off on paid subscription.https://app.codecrafters.io/join?via=geeknarrator=============================================================================Database internals series: https://youtu.be/yV_Zp0Mi3xsPopular playlists:Realtime streaming systems: https://www.youtube.com/playlist?list=PLL7QpTxsA4se-mAKKoVOs3VcaP71X_LA-Software Engineering: https://www.youtube.com/playlist?list=PLL7QpTxsA4sf6By03bot5BhKoMgxDUU17Distributed systems and databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4sfLDUnjBJXJGFhhz94jDd_dModern databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4scSeZAsCUXijtnfW5ARlrsNStay Curios! Keep Learning!#databasearchitecture #distributedsystems #cloudcomputing #testing #jepsen

How does AWS Lambda work?

Play Episode Listen Later Jul 29, 2025 77:25

For memberships: join this channel as a member here:https://www.youtube.com/channel/UC_mGuY4g0mggeUGM6V1osdA/joinSummary:In this conversation, Kaivalya Apte and Rajesh Pandey talk about the engineering behind AWS Lambda, exploring its architecture, use cases, and best practices. They discuss the challenges of event handling, concurrency, and load balancing, as well as the importance of observability and testing in serverless environments. The conversation highlights the innovative solutions AWS Lambda provides for developers, emphasizing the balance between simplicity and complexity in cloud computing.Chapters:00:00 Introduction to AWS Lambda04:36 Use Cases and Best Practices for AWS Lambda09:34 Event Handling and Queue Management19:41 Idempotency and Event Duplication Challenges29:39 Cold Starts and Performance Optimization34:37 Statelessness and Resource Management in Lambda42:18 Understanding Micro-VMs and Cold Starts45:14 Resource Management and Recommendations for Developers47:04 Scaling and Back Pressure in Serverless Systems51:33 Cellular Architecture and Fairness in Resource Allocation55:23 Handling Problematic Events and Poison Pills01:01:03 Testing and Operational Readiness in Lambda01:14:11 Preparing for High Traffic EventsReferences:Handling Billions of invocations: https://aws.amazon.com/blogs/compute/handling-billions-of-invocations-best-practices-from-aws-lambda/Firecracker: https://firecracker-microvm.github.io/AWS Lambda: https://aws.amazon.com/lambda/Connect with Rajesh: https://x.com/RPandeyViewshttps://www.linkedin.com/in/rajeshpandeyiiit/Don't forget to like, share, and subscribe for more insights!=============================================================================Like building stuff? Try out CodeCrafters and build amazing real world systems like Redis, Kafka, Sqlite. Use the link below to signup and get 40% off on paid subscription.https://app.codecrafters.io/join?via=geeknarrator=============================================================================Database internals series: https://youtu.be/yV_Zp0Mi3xsPopular playlists:Realtime streaming systems: https://www.youtube.com/playlist?list=PLL7QpTxsA4se-mAKKoVOs3VcaP71X_LA-Software Engineering: https://www.youtube.com/playlist?list=PLL7QpTxsA4sf6By03bot5BhKoMgxDUU17Distributed systems and databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4sfLDUnjBJXJGFhhz94jDd_dModern databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4scSeZAsCUXijtnfW5ARlrsNStay Curios! Keep Learning!#aws #awslambda #serverless #distributedsystems #scalability #reliability

preparing testing chapters recommendations scaling best practices real time fairness databases kafka use cases firecrackers rajesh keep learning resource management redis aws lambda sqlite statelessness operational readiness

Fast Observability on S3 with Parseable

ai chapters architecture real time databases deployment kafka s3 sql getting involved configuration observability keep learning redis caching sqlite nitish key features

Play Episode Listen Later Jul 29, 2025 65:46

For memberships: join this channel as a member here:https://www.youtube.com/channel/UC_mGuY4g0mggeUGM6V1osdA/joinSummaryIn this conversation, Nitish Tiwari discusses Parseable, an observability platform designed to address the challenges of managing and analyzing large volumes of data. The discussion covers the evolution of observability systems, the design principles behind Parseable, and the importance of efficient data ingestion and storage in S3. Nitish explains how Parseable allows for flexible deployment, handles data organization, and supports querying through SQL. The conversation also touches on the correlation of logs and traces, failure modes, scaling strategies, and the optional nature of indexing for performance optimization.References:Parseable: https://www.parseable.com/GitHub Repository: https://github.com/parseablehq/parseableArchitecture: https://parseable.com/docs/architecture Chapters:00:00 Introduction to Parseable and Observability Challenges05:17 Key Features of Parseable12:03 Deployment and Configuration of Parseable18:59 Ingestion Process and Data Handling32:52 S3 Integration and Data Organisation35:26 Organising Data in Parseable38:50 Metadata Management and Retention39:52 Querying Data: User Experience and SQL44:28 Caching and Performance Optimisation46:55 User-Friendly Querying: SQL vs. UI48:53 Correlating Logs and Traces50:27 Handling Failures in Ingestion53:31 Managing Spiky Workloads54:58 Data Partitioning and Organisation58:06 Creating Indexes for Faster Reads01:00:08 Parseable's Architecture and Optimisation01:03:09 AI for Enhanced Observability01:05:41 Getting Involved with ParseableFor memberships: join this channel as a member here:https://www.youtube.com/channel/UC_mGuY4g0mggeUGM6V1osdA/joinDon't forget to like, share, and subscribe for more insights!=============================================================================Like building stuff? Try out CodeCrafters and build amazing real world systems like Redis, Kafka, Sqlite. Use the link below to signup and get 40% off on paid subscription.https://app.codecrafters.io/join?via=geeknarrator=============================================================================Database internals series: https://youtu.be/yV_Zp0Mi3xsPopular playlists:Realtime streaming systems: https://www.youtube.com/playlist?list=PLL7QpTxsA4se-mAKKoVOs3VcaP71X_LA-Software Engineering: https://www.youtube.com/playlist?list=PLL7QpTxsA4sf6By03bot5BhKoMgxDUU17Distributed systems and databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4sfLDUnjBJXJGFhhz94jDd_dModern databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4scSeZAsCUXijtnfW5ARlrsNStay Curios! Keep Learning!#database #s3 #objectstorage #opentelemetry #logs #metrics

Building a new Database Query Optimiser - @cmu

Play Episode Listen Later Jul 29, 2025 83:51

Read more about Kafka Diskless-topics, KIP by Aiven:KIP-1150: https://fnf.dev/3EuL7mvSummary:In this conversation, Kaivalya Apte and Alexis Schlomer discuss the internals of query optimization with the new project optd. They explore the challenges faced by existing query optimizers, the importance of cost models, and the advantages of using Rust for performance and safety. The discussion also covers the innovative streaming model of query execution, feedback mechanisms for refining optimizations, and the future developments planned for optd, including support for various databases and enhanced cost models.Chapters00:00 Introduction to optd and Its Purpose03:57 Understanding Query Optimization and Its Importance10:26 Defining Query Optimization and Its Challenges17:32 Exploring the Limitations of Existing Optimizers21:39 The Role of Calcite in Query Optimization26:54 The Need for a Domain-Specific Language40:10 Advantages of Using Rust for optd44:37 High-Level Overview of optd's Functionality48:36 Optimizing Query Execution with Coroutines50:03 Streaming Model for Query Optimization51:36 Client Interaction and Feedback Mechanism54:18 Adaptive Decision Making in Query Execution54:56 Persistent Memoization for Enhanced Performance57:12 Guided Scheduling in Query Optimization59:55 Balancing Execution Time and Optimization01:01:43 Understanding Cost Models in Query Optimization01:04:22 Exploring Storage Solutions for Query Optimization01:07:13 Enhancing Observability and Caching Mechanisms01:07:44 Future Optimizations and System Improvements01:18:02 Challenges in Query Optimization Development01:20:33 Upcoming Features and Roadmap for optdReferences:- NeuroCard: learned Cardinality Estimation: https://vldb.org/pvldb/vol14/p61-yang.pdf- RL-based QO: https://arxiv.org/pdf/1808.03196- Microsoft book about QO: https://www.microsoft.com/en-us/research/publication/extensible-query-optimizers-in-practice/- Cascades paper: https://15721.courses.cs.cmu.edu/spring2016/papers/graefe-ieee1995.pdf- optd source code: https://github.com/cmu-db/optd- optd website (for now): https://db.cs.cmu.edu/projects/optd/For memberships: join this channel as a member here:https://www.youtube.com/channel/UC_mGuY4g0mggeUGM6V1osdA/joinDon't forget to like, share, and subscribe for more insights!=============================================================================Like building stuff? Try out CodeCrafters and build amazing real world systems like Redis, Kafka, Sqlite. Use the link below to signup and get 40% off on paid subscription.https://app.codecrafters.io/join?via=geeknarrator=============================================================================Database internals series: https://youtu.be/yV_Zp0Mi3xsPopular playlists:Realtime streaming systems: https://www.youtube.com/playlist?list=PLL7QpTxsA4se-mAKKoVOs3VcaP71X_LA-Software Engineering: https://www.youtube.com/playlist?list=PLL7QpTxsA4sf6By03bot5BhKoMgxDUU17Distributed systems and databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4sfLDUnjBJXJGFhhz94jDd_dModern databases: https://www.youtube.com/playlist?list=PLL7QpTxsA4scSeZAsCUXijtnfW5ARlrsNStay Curios! Keep Learning!#database #queryoptimization #sql #postgres

challenges microsoft exploring roadmap limitations rust advantages real time databases kafka query rl cascades optimiser keep learning redis sqlite qo calcite

#441 It's Michaels All the Way Down

Play Episode Listen Later Jul 21, 2025 27:48 Transcription Available

Topics covered in this episode: * Distributed sqlite follow up: Turso and Litestream* * PEP 792 – Project status markers in the simple index* Run coverage on tests docker2exe: Convert a Docker image to an executable Extras Joke Watch on YouTube About the show Sponsored by Digital Ocean: pythonbytes.fm/digitalocean-gen-ai Use code DO4BYTES and get $200 in free credit Connect with the hosts Michael: @mkennedy@fosstodon.org / @mkennedy.codes (bsky) Brian: @brianokken@fosstodon.org / @brianokken.bsky.social Show: @pythonbytes@fosstodon.org / @pythonbytes.fm (bsky) Join us on YouTube at pythonbytes.fm/live to be part of the audience. Usually Monday at 10am PT. Older video versions available there too. Finally, if you want an artisanal, hand-crafted digest of every week of the show notes in email form? Add your name and email to our friends of the show list, we'll never share it. Michael #1: Distributed sqlite follow up: Turso and Litestream Michael Booth: Turso marries the familiarity and simplicity of SQLite with modern, scalable, and distributed features. Seems to me that Turso is to SQLite what MotherDuck is to DuckDB. Mike Fiedler Continue to use the SQLite you love and care about (even the one inside Python runtime) and launch a daemon that watches the db for changes and replicates changes to an S3-type object store. Deeper dive: Litestream: Revamped Brian #2: PEP 792 – Project status markers in the simple index Currently 3 status markers for packages Trove Classifier status Indices can be yanked PyPI projects - admins can quarantine a project, owners can archive a project Proposal is to have something that can have only one state active archived quarantined deprecated This has been Approved, but not Implemented yet. Brian #3: Run coverage on tests Hugo van Kemenade And apparently, run Ruff with at least F811 turned on Helps with copy/paste/modify mistakes, but also subtler bugs like consumed generators being reused. Michael #4: docker2exe: Convert a Docker image to an executable This tool can be used to convert a Docker image to an executable that you can send to your friends. Build with a simple command: $ docker2exe --name alpine --image alpine:3.9 Requires docker on the client device Probably doesn't map volumes/ports/etc, though could potentially be exposed in the dockerfile. Extras Brian: Back catalog of Test & Code is now on YouTube under @TestAndCodePodcast So far 106 of 234 episodes are up. The rest are going up according to daily limits. Ordering is rather chaotic, according to upload time, not release ordering. There will be a new episode this week pytest-django with Adam Johnson Joke: If programmers were doctors

Train brake hack, GRU sanctions, Wagner war crimes, Microsoft's Chinese ‘digital escorts'

Security Conversations

Play Episode Listen Later Jul 18, 2025 108:45

Three Buddy Problem - Episode 54: Europol busted pro‑Russian hacktivist crew NoName 057(16), the Brits announce sanctions on Russia's GRU cyber units, Wagner‑linked “war influencers” streamed atrocities from Africa, and fresh tech worries ranged from a $500 RF flaw that can hijack U.S. train brakes. Plus, ProPublica on Microsoft's China‑based “digital escorts,” Google's headline‑grabbing AI‑found SQLite zero‑day, and OpenAI's new task‑running agents. Meanwhile, Ukraine's hackers wiped a Russian drone maker, ransomware crippled a major vodka producer, and another Chrome zero‑day quietly underscored how routine critical exploits have become. Cast: Juan Andres Guerrero-Saade (https://twitter.com/juanandres_gs), Ryan Naraine (https://twitter.com/ryanaraine) and Costin Raiu (https://twitter.com/craiu).

#440 Can't Register for VibeCon