Podcasts about ai engineer

97PODCASTS
168EPISODES
48mAVG DURATION
1WEEKLY EPISODE
Nov 20, 2025LATEST

POPULARITY

20172018201920202021202220232024

Best podcasts about ai engineer

Latent Space: The AI Engineer Podcast â€” CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

22 episodes with ai engineer

Super Prompt: Generative AI w/ Tony Wan

5 episodes with ai engineer

The AI Breakdown: Daily Artificial Intelligence News and Discussions

2 episodes with ai engineer

The Swyx Mixtape

5 episodes with ai engineer

Artificial Intelligence and You

2 episodes with ai engineer

Modern Web

2 episodes with ai engineer

High Agency: The Podcast for AI Builders

2 episodes with ai engineer

The top AI news from the past week, every ThursdAI

5 episodes with ai engineer

Latest podcast episodes about ai engineer

"AI Won't Fix Broken Processes": GTM Strategy, RevOps, and the Rise of the AI Engineer with Kristina McMillan

The RevOps Review

Play Episode Listen Later Nov 14, 2025 23:47

In this episode, Kristina McMillan, Executive in Residence at Scale Venture Partners, shares what she's seeing across Scale's portfolio when it comes to AI adoption in revenue teams. From the rise of the go-to-market engineer to the three levels of AI maturity, Kristina breaks down what's working, what's hype, and why RevOps needs to lead with strategy, not just tools. We also get into AI's real impact on metrics like ARR per employee, the role of internal AI hackathons, and how top teams are choosing between building and buying. If you're feeling overwhelmed by the pace of change, this episode will give you clarity and a tactical playbook.

ai executives scale engineers residence processes arr mcmillan revops gtm strategy ai engineer scale venture partners

#287 - From Smoothie King to AI Engineer

Develop Yourself

Play Episode Listen Later Nov 13, 2025 19:19 Transcription Available

Ryan is a current student at Parsity who build an app for his employer, Smoothie King, to suggest drinks in a chat interface using a powerful and lesser-known AI technology: RAG.RAG stands for retrieval augmented generation. Basically, providing information (like smoothie recipes) to an AI model so it can return a highly specific response.Ryan breaks down how he finds the time to build side projects like this and how he built this app.Want to build your own AI-powered app? Check out this project: parsity.io/ai-with-ragConnect with Ryan here: https://www.linkedin.com/in/rhardin378/Send us a textShameless Plugs

ai drop engineers rag smoothie king ai engineer

Techtopia 386: Hvad er vibe coding?

TechTopia

Play Episode Listen Later Nov 10, 2025 49:17

AI-assisteret softwareudvikling er rykket fra eksperiment til virkelighed. Men hvad virker – og hvad er bare hype?Kasper Junge og Christian Bech Nørhave tager dig med ind i maskinrummet, hvor AI allerede er en del af udviklingsteamets hverdag. De deler erfaringer med AI i praksis.Det handler ikke om hype, men om hvad der virker i praksis.Hvad AI faktisk kan (og ikke kan) i softwareudviklingFælles sprog og processer: gør AI til en kollega, ikke en gadgetFart kræver retning: klare mål, kodekvalitet og ansvarBrug AI som kraftforstærker – uden at miste kontrollenMedvirkende:Christian Bech Nørhave+20 års erfaring med Digitaliseringsrådgivning+200 foredrag omkring AIBygger nordisk MSP i samarbejde med DevoteamKasper JungeAI Engineer hos DineroVært på Verbos PodcastNordic AI Influencer DAIR Award WinnerLink:vibe-coding.dk

ai men vibe farts dinero coding hvad msp bygger medvirkende ai engineer devoteam digitaliseringsr

How GenAI Is Changing Every Career

Open Tech Talks : Technology worth Talking| Blogging |Lifestyle

Play Episode Listen Later Nov 8, 2025 18:05

Building Career Resilience in the Age of Generative AI Every week, we explore how AI and technology are changing the way we work and learn. This episode dives into the question I get asked the most, How is Generative AI changing every career? Let's unpack why it matters, how it's shifting roles and skills, and what you can do to lead this change instead of chasing it In this solo episode of Open Tech Talks, host Kashif Manzoor, AI Engineer and Strategiest, and author of AI Tech Circle, dives deep into one of the biggest career questions of our time: How is Generative AI reshaping every profession? Whether you're a developer, analyst, marketer, finance expert, or operations lead, the rise of Gen AI is transforming how work gets done. Kashif combines real-world enterprise experience, current research from McKinsey and Goldman Sachs, and his personal journey building the Gen AI Maturity Framework and Portal to uncover how you can stay relevant, resilient, and ready for AI-driven change. He shares first-hand stories from his own AI adoption journey, how enterprise teams are shifting from cloud architecture to AI architecture, from isolated use-cases to full-scale agentic AI strategies and the lessons learned while guiding organizations through transformation. This episode is both a roadmap and a reflection: how to experiment weekly, build your portfolio, upskill smartly, reposition your role, and teach and share as you grow. Episode # 173 What You'll Learn Why Generative AI matters now and how it differs from traditional AI How tasks, roles, and careers are evolving across industries Real-world examples from finance, marketing, and software engineering The five practical steps to future-proof your career with Gen AI Insights from McKinsey, ResearchGate, and Goldman Sachs on AI productivity impact How to move from "knowing AI tools" to using AI strategically in daily work A behind-the-scenes look at the creation of the Gen AI Maturity Framework Why the future of work is not about jobs lost but roles transformed External References McKinsey Global Institute – Generative AI and the Future of Work Deloitte – Generative AI and the Future of Work Goldman Sachs – How Will AI Affect the Global Workforce Robert Half – How GenAI Is Changing Creative Careers Mäkelä & Stephany (2024) – Complement or Substitute?

ai real career portal future of work goldman sachs mckinsey substitute genai complement researchgate stephany kashif ai engineer

MFP E. 44: The Next Frontier: AI, Automation, and the Future of Work in Multifamily with Ben Infantino, AI Engineer at Apartment SEO

MultiFamily Podcast

Play Episode Listen Later Oct 29, 2025 33:21

Welcome back to The Multifamily Podcast with Ronn and Martin, powered by ApartmentSEO.com. Today, we're diving into a topic that's moving faster than almost anything in history—artificial intelligence. Our guest is Ben, an AI Engineer with Apartment SEO, who's been right in the thick of these changes. From the big bang ChatGPT moment to the new era of GPT-5, agent mode, and the future of work, we'll unpack what all of this means not just for tech, but for industries like multifamily real estate. Welcome to The Multifamily Podcast, Ben!

chatgpt engineers automation future of work apartments gpt multifamily next frontier infantino ronn ai engineer

SRE から AI Engineer へ転身 (Asai)

London Tech Talk

Play Episode Listen Later Oct 25, 2025 54:19

Asai さんをゲストにお呼びしました。Asaiさんの近況についてキャッチアップしました。前半では第二子出産、ジュネーブとロンドンの保育園事情についてお話しました。その後、AsaiさんのSREからAI Engineerへの転身について決断した背景等をお伺いしました。SREからAI Engineerへ - 初週の感想経営戦略を問いなおすご意見・ご感想など、お便りはこちらの⁠⁠⁠⁠⁠⁠⁠ ⁠⁠⁠Google Form⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ で募集しています。

engineers ai engineer

From AC/DC to AI... Engineer to CEO (ft. Eduardo Conrado)

DGTL Voices with Ed Marx

Play Episode Listen Later Oct 14, 2025 23:57

On this episode of DGTL Voices, Ed interviews Eduardo Conrado, the incoming CEO of Ascension, discussing his journey from engineering to healthcare leadership. They explore the role of data-driven insights, and strategies for career growth. Eduardo shares his experiences and insights on how CIOs and technology leaders can effectively connect with operations to drive transformation in the healthcare sector.

ceo engineers ascension ac dc cios conrado ai engineer

What Is the Pathway to Become an AI Engineer? 5 Skills Developers Need Most

UBC News World

Play Episode Listen Later Oct 10, 2025 4:07

Is there a defined pathway to becoming an AI engineer? While school curriculums are still inchoate, must-have skills have been, more or less, identified; the major ones, we tackle in this segment.Find out more at https://interviewcamp.ai/ interviewcamp.ai City: New York Address: 430 Park Ave Website: https://interviewcamp.ai

ai skills engineers developers pathway ai engineer

#130 GenAI in action: Cloud DevOps and Tagassistant.ai (with Mark Edmondson)

The Measure Pod

Play Episode Listen Later Oct 3, 2025 87:53

Full show notes, transcript and AI chatbot - http://bit.ly/4gXOEJaWatch on YouTube - https://www.youtube.com/watch?v=dcb_WFxQGZg-----Episode Summary:In this episode of The Measure Pod, Dara and Matthew sit down with Mark Edmondson, AI Engineer, founder of Sunholo and Aitana, and board member at 8-bit-sheep. They explore the rapidly evolving landscape of AI in business and analytics, from career progression in the age of AI to the practical realities of implementing AI-driven company strategies. Mark shares insights on DevOps, data pipelines, and the distinction between data engineering and data science, while discussing why so many AI projects fail and what it takes to succeed. The conversation tackles the human side of technological change, including productivity expectations, burnout, and how we should be educating future generations for an AI-driven world.-----About The Measure Pod:The Measure Pod is your go-to fortnightly podcast hosted by seasoned analytics pros. Join Dara Fitzgerald (Co-Founder at Measurelab) & Matthew Hooson (Head of Engineering at Measurelab) as they dive into the world of data, analytics and measurement, with a side of fun.-----If you liked this episode, don't forget to subscribe to The Measure Pod on your favourite podcast platform and leave us a review. Let's make sense of the analytics industry together!

ai action engineering devops genai ai engineer cloud devops mark edmondson

IA de l'actu (sept 2025)

IA pas que la Data

Play Episode Listen Later Oct 2, 2025 55:17

L'IA entre-t-elle dans son automne ?

google focus dans conclusion pierre intel bonne openai all in retour guerre chine llm agi anthropic teasing ai engineer

How the EU's Cyber Act Burdens Lone Open Source Developers

The New Stack Podcast

Play Episode Listen Later Sep 11, 2025 19:30

The European Union's upcoming Cyber Resilience Act (CRA) goes into effect in October 2026, with the remainder of the requirements going into effect in December 2027, and introduces significant cybersecurity compliance requirements for software vendors, including those who rely heavily on open source components. At the Open Source Summit Europe, Christopher "CRob" Robinson of the Open Source Security Foundation highlighted concerns about how these regulations could impact open source maintainers. Many open source projects begin as personal solutions to shared problems and grow in popularity, often ending up embedded in critical systems across industries like automotive and energy. Despite this widespread use—Robinson noted up to 97% of commercial software contains open source—these projects are frequently maintained by individuals or small teams with limited resources.Developers often have no visibility into how their code is used, yet they're increasingly burdened by legal and compliance demands from downstream users, such as requests for Software Bills of Materials (SBOMs) and conformity assessments. The CRA raises the stakes, with potential penalties in the billions for noncompliance, putting immense pressure on the open source ecosystem. Learn more from The New Stack about Open Source Security:Open Source Propels the Fall of Security by ObscurityThere Is Just One Way To Do Open Source Security: TogetherJoin our community of newsletter subscribers to stay on top of the news and at the top of your game.

E82 - Joseph Thacker, Leveraging AI's Impact in a Changing World

Keys to the Commonwealth

Play Episode Listen Later Sep 8, 2025 64:09

Send us a textAs a security researcher who specializes in application security and AI, Joseph Thacker shares his knowledge on the growing influence of AI in various aspects of our culture. He's the principal AI Engineer at AppOmni and has helped multiple Fortune 500 companies find vulnerablities that could have cost them millions. He is incredibly knowledgable and offers great insight into this growing industry._______________________________Find Joseph Thacker onLinkedIn:https://www.linkedin.com/in/josephthacker?original_referer=https%3A%2F%2Fwww.google.com%2FHis New website and course for parents:https://aisafetyforparents.com/X:@rez0__Instagram:@thackandforthWebsite:https://josephthacker.com/_______________________________Show hosted by Landry Fieldshttps://www.x.com/landryfieldz'https://www.linkedin.com/in/landryfields/https://www.instagram.com/landryfields_https://www.youtube.com/@landryfields_www.novainsurancegroup.com859-687-2004

ai fortune changing world leveraging ai thacker ai engineer appomni

Stop Hiring Junior Engineers Because of AI?

Beyond Coding

Play Episode Listen Later Sep 3, 2025 49:41

As AI accelerates development, many companies are halting junior hiring, believing AI tools can replace them. Shahin Shahkarami, Director of Data & AI at Ikea Retail, argues this is a massive mistake and that now is actually the best time to invest in new talent.In this episode/video, we cover:Why companies should hire junior talent despite the rise of AI.How the role of a data scientist is evolving with generative AI.The most valuable business use cases for AI beyond chatbots.This conversation is for tech leaders, hiring managers, and aspiring developers looking to understand how to build and grow their careers in the age of AI.Connect with Shahin:https://www.linkedin.com/in/shahin-shahkaramiFull episode on YouTube ▶️https://youtu.be/Jui-8Lx6kvkBeyond Coding Podcast with ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠

director ai power data danger hiring engineers personal growth releasing shahin ai engineer

Navigating the AI bubble, the 10x AI engineer, and the Cloudflare vs. Perplexity data grab

PodRocket - A web development podcast from LogRocket

Play Episode Listen Later Aug 28, 2025 44:26

Is the AI industry an unsustainable bubble built on burning billions in cash? We break down the AI hype cycle, the tough job market for developers, and whether a crash is on the horizon. In this panel discussion with Josh Goldberg, Paige Niedringhaus, Paul Mikulskis, and Noel Minchow, we tackle the biggest questions in tech today. * We debate if AI is just another Web3-style hype cycle * Why the "10x AI engineer" is a myth that ignores the reality of software development * The ethical controversy around AI crawlers and data scraping, highlighted by Cloudflare's recent actions Plus, we cover the latest industry news, including Vercel's powerful new AI SDK V5 and what GitHub's leadership shakeup means for the future of developers. Resources Anthropic Is Bleeding Out: https://www.wheresyoured.at/anthropic-is-bleeding-out The Hater's Guide To The AI Bubble: https://www.wheresyoured.at/the-haters-gui No, AI is not Making Engineers 10x as Productive: https://colton.dev/blog/curing-your-ai-10x-engineer-imposter-syndrome Cloudflare Is Blocking AI Crawlers by Default: https://www.wired.com/story/cloudflare-blocks-ai-crawlers-default Perplexity is using stealth, undeclared crawlers to evade website no-crawl directives: https://blog.cloudflare.com/perplexity-is-using-stealth-undeclared-crawlers-to-evade-website-no-crawl-directives GitHub just got less independent at Microsoft after CEO resignation: https://www.theverge.com/news/757461/microsoft-github-thomas-dohmke-resignation-coreai-team-transition Chapters 0:00 Is the AI Industry Burning Cash Unsustainably? 01:06 Anthropic and the "AI Bubble Euphoria" 04:42 How the AI Hype Cycle is Different from Web3 & VR 08:24 The Problem with "Slapping AI" on Every App 11:54 The "10x AI Engineer" is a Myth and Why 17:55 Real-World AI Success Stories 21:26 Cloudflare vs. AI Crawlers: The Ethics of Data Scraping 30:05 Vercel's New AI SDK V5: What's Changed? 33:45 GitHub's CEO Steps Down: What It Means for Developers 38:54 Hot Takes: The Future of AI Startups, the Job Market, and More We want to hear from you! How did you find us? Did you see us on Twitter? In a newsletter? Or maybe we were recommended by a friend? Fill out our listener survey (https://t.co/oKVAEXipxu)! Let us know by sending an email to our producer, Em, at emily.kochanek@logrocket.com (mailto:emily.kochanek@logrocket.com), or tweet at us at PodRocketPod (https://twitter.com/PodRocketpod). Follow us. Get free stickers. Follow us on Apple Podcasts, fill out this form (https://podrocket.logrocket.com/get-podrocket-stickers), and we'll send you free PodRocket stickers! What does LogRocket do? LogRocket provides AI-first session replay and analytics that surfaces the UX and technical issues impacting user experiences. Start understanding where your users are struggling by trying it for free at LogRocket.com. Try LogRocket for free today. (https://logrocket.com/signup/?pdr)

Is Your Data Strategy Ready for the Agentic AI Era?

The New Stack Podcast

Play Episode Listen Later Aug 28, 2025 27:58

Enterprise AI is still in its infancy, with less than 1% of enterprise data currently used to fuel AI, according to Raj Verma, CEO of SingleStore. While consumer AI is slightly more advanced, most organizations are only beginning to understand the scale of infrastructure needed for true AI adoption. Verma predicts AI will evolve in three phases: first, the easy tasks will be automated; next, complex tasks will become easier; and finally, the seemingly impossible will become achievable—likely within three years. However, to reach that point, enterprises must align their data strategies with their AI ambitions. Many have rushed into AI fearing obsolescence, but without preparing their data infrastructure, they're at risk of failure. Current legacy systems are not designed for the massive concurrency demands of agentic AI, potentially leading to underperformance. Verma emphasizes the need to move beyond siloed or "swim lane" databases toward unified, high-performance data platforms tailored for the scale and complexity of the AI era.Learn more from The New Stack about the latest evolution in AI infrastructure: How To Use AI To Design Intelligent, Adaptable InfrastructureHow to Support Developers in Building AI Workloads Join our community of newsletter subscribers to stay on top of the news and at the top of your game.

ceo ai tech current software engineers software developers verma agentic tech podcast data strategy enterprise ai data infrastructure ai engineer new stack singlestore developer podcast new stack makers

#267 - Step-by-Step: Build a Real AI Project with Next.js & RAG

Develop Yourself

Play Episode Listen Later Aug 25, 2025 24:54 Transcription Available

What does it actually mean to be an “AI Engineer”? Honestly—not much. The title is overloaded and vague. But what is meaningful right now is knowing how to build real projects with AI that go beyond toy chatbots and portfolio fluff.In this episode, I walk you through the exact project I've been building at two different AI startups: a Retrieval Augmented Generation (RAG) app. You'll learn how to:Scrape and store content in a vector databaseUse embeddings to turn your text into something a model can understandStream responses back to your frontend with Next.js + TypeScriptReduce hallucinations and add structured, reliable outputsUnderstand why this is the skillset employers are actually hiring for right now

ai project drop honestly step by step googlers scrape zubin ai engineer

45% разработчиков тратят больше времени на отладку ИИ-кода, чем на написание с нуля. Евгений Волчков

Play Episode Listen Later Aug 11, 2025 100:14

Новое исследование Stack Overflow 2025 Developer Survey, в котором приняли участие более 49,000 разработчиков из 177 стран, выявило парадоксальную проблему: 45% программистов сообщают, что отладка ИИ-сгенерированного кода занимает больше времени, чем ожидалось 84% of developers use AI, yet most don't trust it!. https://survey.stackoverflow.co/2025/ai#2-accuracy-of-ai-toolsОсновная причина — "ИИ-решения, которые почти правильные, но не совсем", с которыми сталкиваются 66% разработчиков. Такой код выглядит работоспособным, но требует тщательной проверки и исправления скрытых ошибок, что превращает обещанную экономию времени в дополнительную нагрузку.Эти данные контрастируют с недавними заявлениями CEO OpenAI Сэма Альтмана, который в своем последнем блог-посте "The Gentle Singularity" утверждает, что "2025 год ознаменовался появлением агентов, которые могут выполнять настоящую когнитивную работу; написание компьютерного кода уже никогда не будет прежним". Однако в том же посте Альтман признает постепенность изменений, отмечая что "мир не изменится сразу" и люди найдут "новые способы быть полезными друг другу", хотя эти способы "могут не очень походить на сегодняшние рабочие места". https://blog.samaltman.com/the-gentle-singularityПока что реальность показывает обратное — ИИ создает дополнительную работу вместо её сокращения, заставляя разработчиков тратить время на верификацию и исправление "почти правильного" кода.Евгений Волчков, Engineering Manager в iManage (ex-Bank of America и Verizon).LinkedIn: https://www.linkedin.com/in/valchkou/ Эпизоды по теме:- AI Engineer - это будущее или модный хайп? Какие программисты будут в спросе, а какие за бортом? Евгений Волчков https://youtube.com/live/5T6be4jjzrY- Калифорнийский парадокс: почему местные AI-таланты с дипломом Berkeley не нужны? Савва Вяткин https://youtu.be/PAJ_R2hBie8- Новая эра: AI, работа и профессии будущего. Как ИИ меняет правила игры на рынке труда. Ник Береза https://youtube.com/live/eO9PghMknOY- Тренды IT 2025: венчур, стартапы, искусственный интеллект. Алексей Моисеенков. https://youtube.com/live/1d7hRZrJTkM- Что нас ждет в 2025? Кризис, массовые увольнения, крах стартапов. Где искать работу? Денис Калышкин https://youtube.com/live/ZbYm10zrfEA***Записаться на карьерную консультацию (резюме, LinkedIn, карьерная стратегия, поиск работы в США) https://annanaumova.comКоучинг (синдром самозванца, прокрастинация, неуверенность в себе, страхи, лень) https://annanaumova.notion.site/3f6ea5ce89694c93afb1156df3c903abВидео курс по составлению резюме для международных компаний "Идеальное американское резюме": https://go.mbastrategy.com/resumecoursemainГайд "Идеальное американское резюме" https://go.mbastrategy.com/usresumeПодписывайтесь на мой Телеграм канал: https://t.me/prodcastUSAПодписывайтесь на мой Инстаграм https://www.instagram.com/prodcast.us Гайд "Как оформить профиль в LinkedIn, чтобы рекрутеры не смогли пройти мимо" https://go.mbastrategy.com/linkedinguide⏰ Timecodes ⏰00:00 Начало15:12 Парадокс ИИ. Заменит ли он разработчиков?30:50 Изменятся ли финансовые рынки?41:08 Рынок IT в кризисе?54:59 Как найти удаленку в ИИ в США?1:01:26 Заменит ли ИИ дата-саентистов?1:11:07 Куда вывозят из США? 1:12:27 Как изменятся зарплаты специалистов?1:17:14 Про лейоффы и кризисы1:21:28 Что ждёт QA специалистов?1:30:18 AI Action Plan 1:34:25 Что посоветуешь тем кто боится Ai?

ai bank openai qa stack overflow ai engineer

Nikolai Yakovenko: the $200 million AI engineer

Razib Khan's Unsupervised Learning

Play Episode Listen Later Aug 2, 2025 80:48

On this episode of Unsupervised Learning, in the wake of Elon Musk's xAI Grok chatbot turning anti-Semitic following a recent update, Razib catches up with Nikolai Yakovenko about the state of AI in the summer of 2025. Nearly three years after their first conversations on the topic, the catch up, covering ChatGPT's release and the anticipation of massive macroeconomic transformations driven by automation of knowledge-work. Yakovenko is a former professional poker player and research scientist at Google, Twitter (now X) and Nvidia (now the first $4 trillion company). With more than a decade on the leading edge computer science, Yakovenko has been at the forefront of the large-language-model revolution that was a necessary precursor to the rise of companies like OpenAI, Anthropic and Perplexity, as well as hundreds of smaller startups. Currently, he is the CEO of DeepNewz, an AI-driven news startup that leverages the latest models to retrieve the ground-truth on news-stories. Disclosure: Razib actively uses and recommends the service and is an advisor to the company. Razib and Yakovenko first tackle why Mark Zuckerberg's Meta is offering individual pay packages north of $200 million, poaching some of OpenAI's top individual contributors. Yakovenko observes that it seems Meta is giving up on its open-source Llama project, their competitor to the models that underpin OpenAI and ChatGPT (he also comments that it seems that engineers at xAI are disappointed in the latest version of Grok). Overall, though the pay-packages of AI engineers and researchers are high; there is now a big shakeout as massive companies with the money and engineering researchers pull away from their competitors. Additionally, in terms of cutting-edge models, the US and China are the only two international players (Yakovenko notes parenthetically that Chinese engineers are also the primary labor base of American AI firms). They also discuss how it is notable that almost three years after the beginning of the current booming repeated hype-cycles of artificial intelligence began to crest, we are still no closer to “artificial general intelligence” and the “intelligence super-explosion” that Ray Kurzweil has been predicting for generations. AI is partially behind the rise of companies like Waymo that are on the verge of transforming the economy, but overall, even though AI is still casting around for its killer app, big-tech has fully bought in and believes that the next decade will determine who wins the future.

AI Engineer Devon vs. OpenAI's $2,000 Genius Subscription: The Future of Human Replacement

Our Future STRONG

Play Episode Listen Later Jul 27, 2025 12:08

OpenAI's $2,000 AI Agent and the Future of Work: OpenAI's projected $2,000 monthly subscription for an AI agent with "PhD-level intelligence" is positioned as a cost-effective alternative to hiring humans and it is sparking concern. We analyze this, comparing it to similar AI services like Devon ($500/month) which aim to replace software engineers. The discussion expands on the broader implications for businesses, exploring how AI could drastically reduce human employment while simultaneously creating new opportunities for entrepreneurs and startups. The discussion also touches on the potential for robots to replace physical labor, furthering the discussion on automation and the future of work. Ultimately, we examine the rapidly evolving landscape of AI and its profound economic impact.Questions this podcast attempts to address: How will AI's increasing capabilities reshape the future of work and employment?What pricing tiers for AI products are mentioned?How does the cost of Devon compare to human software engineer salaries?

ai phd genius engineers future of work openai replacement subscription ai engineer

Куда катится американский IT-рынок? Про удаленку, зарплаты в IT, локальных кандидатов и full stack.

Play Episode Listen Later Jun 16, 2025 94:31

Как очистить резюме от цифрового мусора и привлечь HR-ботов?Действительно ли удаленка умерла и теперь правит гибрид?Превратился ли LinkedIn в Tinder для программистов?Почему $200k стали новыми $100k в IT-зарплатах?Выбирают ли стартапы теперь vibe важнее технических скиллов?Нужно ли фронтендеру знать DevOps или это просто способ сэкономить на зарплате?Маша (Мария) Подоляк (Marsha Podolyak)Автор Телеграм канала "

devops fullstack ai engineer

You + Happy Replay with Comedian and Engineer Jashan Kaleka

You + Happy

Play Episode Listen Later Jun 10, 2025 109:24

Find out more about Jashan on Instagram @Jashan_KalekaYou + Happy podcast on Instagram @YouPlusHappy Host @Selena_MarshaeFuture of AI: Job Impact, Career Success, and More with AI Engineer & Comedian Jashan Kaleka

comedians engineers career success ai engineer

AI Engineer - это будущее или модный хайп? Какие программисты будут в спросе, а какие за бортом?

Play Episode Listen Later Jun 9, 2025 103:26

Заменит ли AI всех разработчиков или создаст миллионы новых рабочих мест? Какие навыки программиста станут бесполезными уже через два года? Почему получить диплом в 30 лет стало нормой в IT? Почему junior с тремя языками программирования - это красный флаг? Что важнее в 2025 году - диплом или реальный опыт в IT? Какие AI скилы стоит изучать прямо сейчас, чтобы не остаться за бортом? Что делать продактам, проджектам, маркетологам: QA? Повторится ли история доткомов с AI стартапами или это разные времена?Евгений Волчков, Engineering Manager в iManage (ex-Bank of America и Verizon).LinkedIn: https://www.linkedin.com/in/valchkou/ Видео по теме:- Найм сломан. Тысячи кандидатов, а подходящих нет? Почему так сложно найти программиста в 2025? Юлия Тарасова https://youtube.com/live/6uVCZsF4aQE- Новая эра: AI, работа и профессии будущего. Как ИИ меняет правила игры на рынке труда. Ник Береза. https://youtube.com/live/eO9PghMknOY- Тренды IT 2025: венчур, стартапы, искусственный интеллект. Алексей Моисеенков. https://youtube.com/live/1d7hRZrJTkM- Аутсорсинг в IT. Дешевый код - это новая реальность? Кто кого вытеснит с рынка разработки? Валерий Широков и Евгений Волчков https://youtube.com/live/LVrEzC3zai4 ***Записаться на карьерную консультацию (резюме, LinkedIn, карьерная стратегия, поиск работы в США) https://annanaumova.comКоучинг (синдром самозванца, прокрастинация, неуверенность в себе, страхи, лень) https://annanaumova.notion.site/3f6ea5ce89694c93afb1156df3c903abВидео курс по составлению резюме для международных компаний "Идеальное американское резюме": https://go.mbastrategy.com/resumecoursemainГайд "Идеальное американское резюме" https://go.mbastrategy.com/usresumeПодписывайтесь на мой Телеграм канал: https://t.me/prodcastUSAПодписывайтесь на мой Инстаграм https://www.instagram.com/prodcast.us Гайд "Как оформить профиль в LinkedIn, чтобы рекрутеры не смогли пройти мимо" https://go.mbastrategy.com/linkedinguide⏰ Timecodes ⏰00:00 Начало9:10 Что изменилось на рынке найма в США?24:20 Вопросы из чата31:12 Что нужно учить в AI сейчас?1:12:56 Кого заменит AI?

ai bank engineers qa ai engineer

The Biggest Trends from the AI Engineer World's Fair

The AI Breakdown: Daily Artificial Intelligence News and Discussions

Play Episode Listen Later Jun 7, 2025 23:43

The AI Engineer World's Fair highlighted key AI and agent world shifts. Top themes: evals, tiny teams, agent swarms, and the rise of coding agents. NLW breaks down the key trends and the alpha that exists in the program. Get Ad Free AI Daily Brief: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://patreon.com/AIDailyBrief⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Brought to you by:KPMG – Go to ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://kpmg.com/ai⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ to learn more about how KPMG can help you drive value with our AI solutions.Blitzy.com - Go to ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://blitzy.com/⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ to build enterprise software in days, not months AGNTCY - The AGNTCY is an open-source collective dedicated to building the Internet of Agents, enabling AI agents to communicate and collaborate seamlessly across frameworks. Join a community of engineers focused on high-quality multi-agent software and support the initiative at ⁠⁠⁠⁠agntcy.org ⁠⁠⁠⁠ - ⁠⁠⁠⁠https://agntcy.org/?utm_campaign=fy25q4_agntcy_amer_paid-media_agntcy-aidailybrief_podcast&utm_channel=podcast&utm_source=podcast⁠⁠⁠⁠ Vanta - Simplify compliance - ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://vanta.com/nlw⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Plumb - The automation platform for AI experts and consultants ⁠⁠⁠⁠https://useplumb.com/⁠⁠⁠⁠The Agent Readiness Audit from Superintelligent - Go to ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://besuper.ai/ ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠to request your company's agent readiness score.The AI Daily Brief helps you understand the most important news and discussions in AI. Subscribe to the podcast version of The AI Daily Brief wherever you listen: https://pod.link/1680633614Subscribe to the newsletter: https://aidailybrief.beehiiv.com/Join our Discord: https://bit.ly/aibreakdownInterested in sponsoring the show? nlw@breakdown.network

ai internet discord engineers kpmg biggest trends ai engineer nlw

Say hi at AI Engineer World's Fair

ChatGPT & Prompt Engineering Podcast

Play Episode Listen Later Jun 2, 2025 1:37 Transcription Available

It's been a while since I've released an episode. I'm heading to the AI Engineer World's Fair tomorrowv If you're going to be there, I am going to be wearing a Superman shirt, so come say hi! I would love to talk to listeners and hear how your prompting journey has been going.I'm also planning on restarting the podcast, with one of a couple different directions: agents, vibe coding, or using reasoning models. Which one would you find most useful? Poll here: https://forms.gle/fLqiKeouDPazuU3s5Stay in touch on:Youtube: youtube.com/@PromptEngineeringPodcastTelegram: https://t.me/PromptEngineeringMastermindLinkedIn: https://www.linkedin.com/groups/14231334/Support the show

superman engineers poll say hi ai engineer

[AIEWF Preview] CloudChef: Your Robot Chef - Michellin-Star food at $12/hr (w/ Kitchen tour!)

Latent Space: The AI Engineer Podcast â€” CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

Play Episode Listen Later May 31, 2025

One of the new tracks at next week's AI Engineer conference in SF is a new focus on LLMs + Robotics, ft. household names like Waymo and Physical Intelligence. However there are many other companies applying LLMs and VLMs in the real world! CloudChef, the first industrial-scale kitchen robotics company with one-shot demonstration learning and an incredibly simple business model, will be serving tasty treats all day with Zippy (https://www.cloudchef.co/zippy ) their AI Chef platform. This is a lightning pod with CEO Nikhil Abraham to preview what Zippy is capable of! https://www.cloudchef.co/platform See a real chef comparison: https://www.youtube.com/watch?v=INDhZ7LwSeo&t=64s See it in the AI Engineer Expo at SF next week: https://ai.engineer Chapters 00:00 Welcome and Introductions 00:58 What is Cloud Chef? 01:36 How the Robots Work: Culinary Intelligence 05:57 Commercial Applications and Early Success 07:02 The Software-First Approach 10:09 Business Model and Pricing 13:10 Demonstration Learning: Training the Robots 16:03 Call to Action and Engineering Opportunities 18:45 Final Thoughts and Technical Details

action tour chefs robots kitchen final thoughts sf business models waymo zippy ai engineer michellin

AI Unraveled: Latest AI News & Trends, Master GPT, Gemini, Generative AI, LLMs, Prompting, GPT Store

Play Episode Listen Later May 21, 2025 19:02

ai master study cloud engineers mastery includes gateway shopify ml computer vision natural language processing study guide ai engineer azure ai

AI Unraveled: Latest AI News & Trends, Master GPT, Gemini, Generative AI, LLMs, Prompting, GPT Store

Play Episode Listen Later May 20, 2025 29:40

This podcast discuss Deep Research, defining it as a comprehensive engagement with information beyond superficial inquiry. It contrasts this with surface learning and simple information gathering, emphasizing the need for rigor and critical analysis. The emergence of AI-powered deep research tools, such as Grok, ChatGPT, and Gemini, is explored as a new dimension, capable of automating and enhancing research processes with unprecedented speed and scale, although they introduce challenges related to accuracy, bias, and ethical considerations. Ultimately, the text argues that while AI can significantly augment human capabilities, the core principles of deep understanding and ethical conduct remain fundamentally reliant on human intellect and oversight, essential for advancing knowledge and tackling complex global issues across various domains.

ai deep research chatgpt comparison hire applications gemini gb automate methodology grok gpts ai engineer

AI Weekly News Rundown from May 11 to May 18 2025:

AI Unraveled: Latest AI News & Trends, Master GPT, Gemini, Generative AI, LLMs, Prompting, GPT Store

Play Episode Listen Later May 18, 2025 19:26

This episode from the week of 11-18 May 2025 cover a range of AI developments, highlighting major model releases and updates from companies like OpenAI, Google, and Anthropic, as well as the strategic deployment of AI in various sectors, including healthcare, law, education, and content creation. They also touch upon significant ethical and regulatory considerations, such as data privacy concerns raised by international partnerships, debates over copyright protection for artists, the persistent issue of AI "hallucinations," and discussions around government approaches to AI regulation. The reports also reflect on the evolving capabilities of AI agents in tasks from software engineering to web research and customer service, alongside breakthroughs in AI-assisted scientific discovery.

AI Daily News May 16 2025:

AI Unraveled: Latest AI News & Trends, Master GPT, Gemini, Generative AI, LLMs, Prompting, GPT Store

Play Episode Listen Later May 17, 2025 12:47

Theis episode and sources collectively offer a snapshot of the AI landscape on a May 16th 2025 highlighting diverse developments. One major theme is advancements in AI model capabilities and applications, with Windsurf launching specialised models for software development, OpenAI introducing a coding agent in ChatGPT, and a new AI model, YingLong, focusing on rapid, high-resolution local weather forecasts. Simultaneously, the texts reveal challenges in AI reliability and deployment, including Anthropic's Claude hallucinating a legal citation, Meta delaying a major model release due to insufficient improvements, and research indicating current LLMs struggle with coherence in multi-turn conversations. Finally, the articles touch on practical AI integrations and evaluations, such as Zapier automating legal document analysis, a pilot for an "AI doctor" clinic in Saudi Arabia, and OpenAI releasing a benchmark specifically for healthcare AI, demonstrating the ongoing effort to apply and rigorously assess AI in various fields.

AI Daily News May 15 2025:

AI Unraveled: Latest AI News & Trends, Master GPT, Gemini, Generative AI, LLMs, Prompting, GPT Store

Play Episode Listen Later May 16, 2025 17:07

These excerpts highlight a busy day in the field of artificial intelligence, showcasing major developments from leading companies and broader industry trends. Key updates include Anthropic reportedly preparing a new advanced "Claude Neptune" model and enhancing existing ones with increased autonomy, while OpenAI integrated its new GPT-4.1 models into ChatGPT, improving capabilities for users, especially in coding. Google DeepMind's AlphaEvolve AI demonstrated the ability to autonomously discover novel mathematical advancements, solving long-standing problems and optimising algorithms. Other notable points cover new AI-powered tools for document creation, OpenAI's launch of a Safety Evaluations Hub for transparency, and a US legislative proposal for a 10-year ban on state-level AI regulation. Additionally, Google Cloud introduced a generative AI certification program for business leaders, Databricks made a significant acquisition to bolster its AI agent platform, and a report questioned the immediate replacement of human radiologists by AI, underscoring AI's role as an assistive tool rather than a full substitute in complex medical fields.

AI Daily News May 14 2025:

AI Unraveled: Latest AI News & Trends, Master GPT, Gemini, Generative AI, LLMs, Prompting, GPT Store

Play Episode Listen Later May 15, 2025 14:44

This AI Unraveled podcast episode and sources chronicle significant advancements and strategic moves in the field of artificial intelligence on a specific day, May 14th, 2025. They highlight major hardware deals, such as Nvidia supplying advanced chips to Saudi Arabia, alongside shifts in how major tech companies like Google are integrating AI, evidenced by testing "AI Mode" in search and expanding Gemini across numerous devices. Furthermore, the sources reveal emerging trends in AI adoption, including non-programmers using "vibe coding," and new AI capabilities in creative tools, like TikTok's AI Alive feature for animating photos, and in content production, as seen with Audible's AI narration tools. Finally, they touch upon AI's increasing role in scientific research and changes in international AI chip export policies.

AI Unraveled: Latest AI News & Trends, Master GPT, Gemini, Generative AI, LLMs, Prompting, GPT Store

Play Episode Listen Later May 15, 2025 21:59

This podcast examines how Artificial Intelligence (AI) can significantly improve our understanding and conservation of biodiversity. It identifies seven major knowledge gaps, known as "shortfalls," which impede effective conservation efforts. The source highlights a review that suggests AI can help bridge five of these shortfalls, although its current application is limited primarily to mapping species distribution and detecting traits. Overcoming the barriers to widespread AI adoption in this field requires addressing issues with data availability and standardization, technological complexities, resource limitations, and fostering better interdisciplinary collaboration. The text also stresses the critical importance of ensuring equity and addressing biases, particularly concerning data from less studied regions and respecting Indigenous knowledge, advocating for responsible AI development through transparency and accountability.

ai overcoming artificial intelligence hire indigenous filling gaps revolutionizing gb automate biodiversity gpts ai engineer

AI Unraveled: Latest AI News & Trends, Master GPT, Gemini, Generative AI, LLMs, Prompting, GPT Store

Play Episode Listen Later May 15, 2025 22:32

This podcast and sources discuss the Google Cloud Generative AI Leader certification, a first-of-its-kind credential designed for professionals who aim to strategically implement Generative AI within their organizations. The material outlines the exam's structure and logistics, including its four key domains covering fundamentals, Google's offerings, output improvement techniques, and business strategies. It also details the official learning path, exam preparation strategies, and the importance of responsible and secure AI adoption for successful Generative AI leadership.Get the eBook at: https://play.google.com/store/books/details?id=bgZeEQAAQBAJDjamgatech: https://djamgatech.com/product/ace-the-google-cloud-generative-ai-leader-certification-ebook-audiobook/Shopify: https://djamgatech.myshopify.com/products/%F0%9F%93%9Aace-the-google-cloud-generative-ai-leader-certification-comprehensive-guide-to-strategic-ai-leadership?utm_source=copyToPasteBoard&utm_medium=product-links&utm_content=webGoogle Play: https://play.google.com/store/books/details?id=bgZeEQAAQBAJApple iBook: https://books.apple.com/us/book/id6745973508

ai google leadership hire strategic ebooks shopify certification gb automate generative google cloud comprehensive guide gpts ai engineer

AI, Marketing, and Human Decision Making // Fausto Albers // #313

MLOps.community

Play Episode Listen Later May 14, 2025 49:40

AI, Marketing, and Human Decision Making // MLOps Podcast #313 with Fausto Albers, AI Engineer & Community Lead at AI Builders Club.Join the Community: https://go.mlops.community/YTJoinIn Get the newsletter: https://go.mlops.community/YTNewsletter // AbstractDemetrios and Fausto Albers explore how generative AI transforms creative work, decision-making, and human connection, highlighting both the promise of automation and the risks of losing critical thinking and social nuance.// BioFausto Albers is a relentless explorer of the unconventional—a techno-optimist with a foundation in sociology and behavioral economics, always connecting seemingly absurd ideas that, upon closer inspection, turn out to be the missing pieces of a bigger puzzle. He thrives in paradox: he overcomplicates the simple, oversimplifies the complex, and yet somehow lands on solutions that feel inevitable in hindsight. He believes that true innovation exists in the tension between chaos and structure—too much of either, and you're stuck.His career has been anything but linear. He's owned and operated successful restaurants, served high-stakes cocktails while juggling bottles on London's bar tops, and later traded spirits for code—designing digital waiters, recommender systems, and AI-driven accounting tools. Now, he leads the AI Builders Club Amsterdam, a fast-growing community where AI engineers, researchers, and founders push the boundaries of intelligent systems.Ask him about RAG, and he'll insist on specificity—because, as he puts it, discussing retrieval-augmented generation without clear definitions is as useful as declaring that “AI will have an impact on the world.” An engaging communicator, a sharp systems thinker, and a builder of both technology and communities, Fausto is here to challenge perspectives, deconstruct assumptions, and remix the future of AI.// Related LinksWebsite: aibuilders.clubMoravec's paradox: https://en.wikipedia.org/wiki/Moravec%27s_paradox?utm_source=chatgpt.comBehavior Modeling, Secondary AI Effects, Bias Reduction & Synthetic Data // Devansh Devansh // #311: https://youtu.be/jJXee5rMtHI~~~~~~~~ ✌️Connect With Us ✌️ ~~~~~~~Catch all episodes, blogs, newsletters, and more: https://go.mlops.community/TYExploreJoin our Slack community [https://go.mlops.community/slack]Follow us on X/Twitter [@mlopscommunity](https://x.com/mlopscommunity) or [LinkedIn](https://go.mlops.community/linkedin)] Sign up for the next meetup: [https://go.mlops.community/register]MLOps Swag/Merch: [https://shop.mlops.community/]Connect with Demetrios on LinkedIn: /dpbrinkmConnect with Fausto on LinkedIn: /stepintoliquidTimestamps:[00:00] Fausto's preferred coffee[00:26] Takeaways[01:18] Automated Ad Creative Generation[07:14] AI in Marketing Workflows[13:23] MCP and System Bottlenecks[21:45] Forward Compatibility vs Optimization[29:57] Unlocking Workflow Speed[33:48] AI Dependency vs Critical Thinking[37:44] AI Realism and Paradoxes[42:30] Outsourcing Decision-Making Risks[46:22] Human Value in Automation[49:02] Wrap up

AI Daily News May 13 2025:

AI Unraveled: Latest AI News & Trends, Master GPT, Gemini, Generative AI, LLMs, Prompting, GPT Store

Play Episode Listen Later May 14, 2025 13:36

This episode offers a snapshot of the rapidly evolving artificial intelligence landscape on a May 13th 2025, highlighting advancements and strategic shifts across diverse sectors. We see insights into the projected capabilities of AI in fields like software engineering and healthcare, with discussions on AI reaching junior engineer levels and the development of benchmarks to assess its reliability in medical scenarios. Furthermore, the text covers hardware and infrastructure developments, noting Saudi Arabia's moves to secure AI chip supplies from multiple vendors. Innovative applications such as AI-powered battery management and the use of AI to analyse facial photos for biological age and health prediction are also presented. Finally, the sources touch on changes in user interaction, including Apple exploring brain-computer interfaces and Google experimenting with incorporating AI directly into its search interface.

AI Daily News May 12 2025:

AI Unraveled: Latest AI News & Trends, Master GPT, Gemini, Generative AI, LLMs, Prompting, GPT Store

Play Episode Listen Later May 13, 2025 12:28

There are reports of significant negotiations between OpenAI and Microsoft regarding their partnership terms, possibly influenced by OpenAI's future IPO plans. The texts also highlight the Vatican's view on AI as a critical challenge to humanity, with Pope Leo XIV emphasizing ethical guidance. From a technological perspective, breakthroughs are noted in AI training methods, such as the "Absolute Zero" system enabling models to teach themselves, and in new silicon-free transistor technology developed in China. Finally, the articles touch on the practical applications of AI, covering the use of AI tools for personalised avatar creation and Klarna's decision to reintroduce human staff after an AI-only customer service approach negatively impacted quality.

AI, Marketing, and Human Decision Making // Fausto Albers // Podcast #313

MLOps.community

Play Episode Listen Later May 9, 2025 50:29

AI, Marketing, and Human Decision Making // MLOps Podcast #313 with Fausto Albers, AI Engineer & Community Lead at AI Builders Club.Join the Community: https://go.mlops.community/YTJoinIn Get the newsletter: https://go.mlops.community/YTNewsletter // AbstractDemetrios and Fausto Albers explore how generative AI transforms creative work, decision-making, and human connection, highlighting both the promise of automation and the risks of losing critical thinking and social nuance.// BioFausto Albers is a relentless explorer of the unconventional—a techno-optimist with a foundation in sociology and behavioral economics, always connecting seemingly absurd ideas that, upon closer inspection, turn out to be the missing pieces of a bigger puzzle. He thrives in paradox: he overcomplicates the simple, oversimplifies the complex, and yet somehow lands on solutions that feel inevitable in hindsight. He believes that true innovation exists in the tension between chaos and structure—too much of either, and you're stuck.His career has been anything but linear. He's owned and operated successful restaurants, served high-stakes cocktails while juggling bottles on London's bar tops, and later traded spirits for code—designing digital waiters, recommender systems, and AI-driven accounting tools. Now, he leads the AI Builders Club Amsterdam, a fast-growing community where AI engineers, researchers, and founders push the boundaries of intelligent systems.Ask him about RAG, and he'll insist on specificity—because, as he puts it, discussing retrieval-augmented generation without clear definitions is as useful as declaring that “AI will have an impact on the world.” An engaging communicator, a sharp systems thinker, and a builder of both technology and communities, Fausto is here to challenge perspectives, deconstruct assumptions, and remix the future of AI.// Related LinksWebsite: aibuilders.clubMoravec's paradox: https://en.wikipedia.org/wiki/Moravec%27s_paradox?utm_source=chatgpt.comBehavior Modeling, Secondary AI Effects, Bias Reduction & Synthetic Data // Devansh Devansh // #311: https://youtu.be/jJXee5rMtHI~~~~~~~~ ✌️Connect With Us ✌️ ~~~~~~~Catch all episodes, blogs, newsletters, and more: https://go.mlops.community/TYExploreJoin our slack community [https://go.mlops.community/slack]Follow us on X/Twitter [@mlopscommunity](https://x.com/mlopscommunity) or [LinkedIn](https://go.mlops.community/linkedin)] Sign up for the next meetup: [https://go.mlops.community/register]MLOps Swag/Merch: [https://shop.mlops.community/]Connect with Demetrios on LinkedIn: /dpbrinkmConnect with Fausto on LinkedIn: /stepintoliquid

community ai marketing decision making rag fausto albers ai marketing moravec community lead demetrios ai engineer

Inside Devin: The world's first autonomous AI engineer that's set to write 50% of its company's code by end of year | Scott Wu (CEO and co-founder of Cognition)

Lenny's Podcast: Product | Growth | Career

Play Episode Listen Later May 4, 2025 92:31

Scott Wu is the co-founder and CEO of Cognition, the company behind Devin—the world's first autonomous AI software engineer. Unlike other AI coding tools, Devin works like an autonomous engineer that you can interact with through Slack, Linear, and GitHub, just like with a remote engineer. With Scott's background in competitive programming and a previous AI-powered startup, Lunchclub, teaching AI to code has become his ultimate passion.What you'll learn:1. How a team of “Devins” are already producing 25% of Cognition's pull requests, and they are on track to hit 50% by year's end2. How each engineer on Cognition's 15-person engineering team works with about five Devins each3. How Devin has evolved from a “high school CS student” to a “junior engineer” over the past year4. Why engineering will shift from “bricklayers” to “architects”5. Why AI tools will lead to more engineering jobs rather than fewer6. How Devin creates its own wiki to understand and document complex codebases7. The eight pivots Cognition went through before landing on their current approach8. The cultural shifts required to successfully adopt AI engineers—Brought to you by:Enterpret—Transform customer feedback into product growthParagon—Ship every SaaS integration your customers wantAttio—The powerful, flexible CRM for fast-growing startups—Where to find Scott Wu:• X: https://x.com/scottwu46• LinkedIn: https://www.linkedin.com/in/scott-wu-8b94ab96/—Where to find Lenny:• Newsletter: https://www.lennysnewsletter.com• X: https://twitter.com/lennysan• LinkedIn: https://www.linkedin.com/in/lennyrachitsky/—In this episode, we cover:(00:00) Introduction to Scott Wu and Devin(09:13) Scaling and future prospects(10:23) Devin's origin story(17:26) The idea of Devin as a person(22:19) How a team of “Devins” are already producing 25% of Cognition's pull requests(25:17) Important skills in the AI era(30:21) How Cognition's engineering team works with Devin's(34:37) Live demo(42:20) Devin's codebase integration(44:50) Automation with Linear(46:53) What Devin does best(52:56) The future of AI in software engineering(57:13) Moats and stickiness in AI(01:01:57) The tech that enables Devin(01:04:14) AI will be the biggest technology shift of our lives(01:07:25) Adopting Devin in your company(01:15:13) Startup wisdom and hiring practices(01:22:32) Lightning round and final thoughts—Referenced:• Devin: https://devin.ai/• GitHub: https://github.com/• Linear: https://linear.app/• Waymo: https://waymo.com/• GitHub Copilot: https://github.com/features/copilot• Cursor: https://www.cursor.com/• Anysphere: https://anysphere.inc/• Bolt: https://bolt.new/• StackBlitz: https://stackblitz.com/• Cognition: https://cognition.ai/• v0: https://v0.dev/• Vercel: https://vercel.com/• Everyone's an engineer now: Inside v0's mission to create a hundred million builders | Guillermo Rauch (founder and CEO of Vercel, creators of v0 and Next.js): https://www.lennysnewsletter.com/p/everyones-an-engineer-now-guillermo-rauch• Inside Bolt: From near-death to ~$40m ARR in 5 months—one of the fastest-growing products in history | Eric Simons (founder and CEO of StackBlitz): https://www.lennysnewsletter.com/p/inside-bolt-eric-simons• Assembly: https://en.wikipedia.org/wiki/Assembly_language• Pascal: https://en.wikipedia.org/wiki/Pascal_(programming_language)• Python: https://www.python.org/• Jevons paradox: https://en.wikipedia.org/wiki/Jevons_paradox• Datadog: https://www.datadoghq.com/• Bending the universe in your favor | Claire Vo (LaunchDarkly, Color, Optimizely, ChatPRD): https://www.lennysnewsletter.com/p/bending-the-universe-in-your-favor• OpenAI's CPO on how AI changes must-have skills, moats, coding, startup playbooks, more | Kevin Weil (CPO at OpenAI, ex-Instagram, Twitter): https://www.lennysnewsletter.com/p/kevin-weil-open-ai• Behind the product: Replit | Amjad Masad (co-founder and CEO): https://www.lennysnewsletter.com/p/behind-the-product-replit-amjad-masad• Windsurf: https://windsurf.com/• COBOL: https://en.wikipedia.org/wiki/COBOL• Fortran: https://en.wikipedia.org/wiki/Fortran• Magic the Gathering: https://magic.wizards.com/en• Aura frames: https://auraframes.com/• AirPods: https://www.apple.com/airpods/• Steven Hao on LinkedIn: https://www.linkedin.com/in/steven-hao-160b9638/• Walden Yan on LinkedIn: https://www.linkedin.com/in/waldenyan/—Recommended books:• How to Win Friends & Influence People: https://www.amazon.com/How-Win-Friends-Influence-People/dp/0671027034• The Power Law: Venture Capital and the Making of the New Future: https://www.amazon.com/Power-Law-Venture-Capital-Making/dp/052555999X• The Great Gatsby: https://www.amazon.com/Great-Gatsby-F-Scott-Fitzgerald/dp/0743273567—Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email podcast@lennyrachitsky.com.—Lenny may be an investor in the companies discussed. Get full access to Lenny's Newsletter at www.lennysnewsletter.com/subscribe

Beyond the Matrix: AI and the Future of Human Creativity

MLOps.community

Play Episode Listen Later Mar 30, 2025 55:08

Beyond the Matrix: AI and the Future of Human Creativity // MLOps Podcast #300 with Fausto Albers, AI Engineer & Community Lead at AI Builders Club.Join the Community: https://go.mlops.community/YTJoinIn Get the newsletter: https://go.mlops.community/YTNewsletter // AbstractFausto Albers discusses the intersection of AI and human creativity. He explores AI's role in job interviews, personalized AI assistants, and the evolving nature of human-computer interaction. Key topics include AI-driven self-analysis, context-aware AI systems, and the impact of AI on optimizing human decision-making. The conversation highlights how AI can enhance creativity, collaboration, and efficiency by reducing cognitive load and making intelligent suggestions in real time.// BioFausto Albers is a relentless explorer of the unconventional—a techno-optimist with a foundation in sociology and behavioral economics, always connecting seemingly absurd ideas that, upon closer inspection, turn out to be the missing pieces of a bigger puzzle. He thrives in paradox: he overcomplicates the simple, oversimplifies the complex, and yet somehow lands on solutions that feel inevitable in hindsight. He believes that true innovation exists in the tension between chaos and structure—too much of either, and you're stuck.His career has been anything but linear. He's owned and operated successful restaurants, served high-stakes cocktails while juggling bottles on London's bar tops, and later traded spirits for code—designing digital waiters, recommender systems, and AI-driven accounting tools. Now, he leads the AI Builders Club Amsterdam, a fast-growing community where AI engineers, researchers, and founders push the boundaries of intelligent systems.Ask him about RAG, and he'll insist on specificity—because, as he puts it, discussing retrieval-augmented generation without clear definitions is as useful as declaring that “AI will have an impact on the world.” An engaging communicator, a sharp systems thinker, and a builder of both technology and communities, Fausto is here to challenge perspectives, deconstruct assumptions, and remix the future of AI.// Related LinksWebsite: aibuilders.club~~~~~~~~ ✌️Connect With Us ✌️ ~~~~~~~Catch all episodes, blogs, newsletters, and more: https://go.mlops.community/TYExploreJoin our slack community [https://go.mlops.community/slack]Follow us on X/Twitter [@mlopscommunity](https://x.com/mlopscommunity) or [LinkedIn](https://go.mlops.community/linkedin)] Sign up for the next meetup: [https://go.mlops.community/register]MLOps Swag/Merch: [https://shop.mlops.community/]Connect with Demetrios on LinkedIn: /dpbrinkmConnect with Fausto on LinkedIn: /stepintoliquid

community ai future creativity matrix rag fausto community lead demetrios ai engineer

Thrive Sync: Women, Wellness, and AI

Uncomplicated Marketing

Play Episode Listen Later Jan 29, 2025 46:01

Zara Hajihashemi, AI Engineer and Founder of Cybele Health, joins the podcast to share her journey from Apple tech lead to femtech entrepreneur, driven by a mission to revolutionize women's health with AI-driven insights. With a PhD in machine learning, Zara spent six years at Apple leading cross-functional AI projects before founding Cybele Health to address the inefficiencies in healthcare for professional women and working mothers.In this episode, you'll discover:The Evolution from AI Engineer to Founder: Learn how Zara's experience at Apple, coupled with her PhD research, shaped her vision for Cybele Health and the need for AI-powered, personalized healthcare solutions.Bridging the Healthcare Gap with AI: Zara discusses how Cybele Health is leveraging AI to provide 360-degree visibility into women's health, improving communication between patients and providers to create personalized wellness strategies.The Importance of Personalized Health: Discover how diet, mental health, and physical activity should be aligned with a woman's biological cycle to optimize well-being and productivity.The Role of Functional Medicine and Preventative Care: Zara explains why being proactive rather than reactive in healthcare is crucial, and how AI can assist in creating sustainable, individualized health plans.The Future of AI in Femtech: Explore how AI is revolutionizing the health industry by acting as a 24/7 health assistant, providing predictive insights, and closing gaps in traditional medical care.Building a Health-Tech Startup: Zara shares her journey of founding Cybele Health, securing early users, and the marketing strategies she is employing to drive adoption among both providers and consumers.Zara's Top Health and Wellness Tips:Read labels and avoid processed foods with unrecognizable ingredients.Sync your diet, workouts, and daily habits with your biological cycle for optimal results.Prioritize functional medicine approaches for proactive rather than reactive health management.Connect with Zara and Learn More:Website join the waitlist: Cybele Health LinkedIn: Zara Hajihashemi

women founders ai apple future building phd evolution wellness thrive bridging prioritize functional medicine sync ai engineer learn more website

NVIDIA CEO Predicts The Next Wave of AI - The Physical Internet. Hashtag Trending for Wednesday, January 8, 2025

Hashtag Trending

Play Episode Listen Later Jan 8, 2025 8:06 Transcription Available

AI Job Boom, Microsoft's OneDrive Update, and NVIDIA's Future Vision | Hashtag Trending In today's episode, AI jobs take the spotlight on LinkedIn's fastest growing careers list with roles like AI Engineer and AI Consultant on the rise. Microsoft is closing a loophole in OneDrive that could affect unlicensed accounts starting in 2025. NVIDIA's CEO presents groundbreaking advancements in 'physical AI' at CES 2025, introducing Project Digits—a personal AI supercomputer. Meanwhile, Meta removes third-party fact-checking, shifting to community-driven notes. Join Jim Love for these stories and more on Hashtag Trending. 00:00 Introduction and Host Welcome 00:26 AI Jobs on the Rise 01:43 Microsoft Closes OneDrive Loophole 03:05 NVIDIA's Vision for the Future 05:37 Meta's Shift in Fact-Checking Policy 07:44 Conclusion and Sign-Off

ceo ai internet vision future microsoft shift conclusion ces nvidia hashtags trending predicts next wave onedrive sign off ai engineer host welcome

AI in Business: Advice and Best Practices from Melanie Johnson

Elite Expert Insider

Play Episode Listen Later Oct 28, 2024 22:09

Welcome to another episode of The Elite Expert Insider! Today, we're turning the tables as Melanie Johnson, our usual host, steps into the spotlight as our guest, interviewed by her co-owner Jenn Foster. Learn the power of AI in business, especially focusing on practical applications and debunking the fears surrounding artificial intelligence.

ai chatgpt artificial intelligence seo best practices marketing strategies email marketing business strategy business advice bestsellers book publishing social media management productivity tools ai in business ai engineer melanie johnson contact management jenn foster elite expert insider

Building the AI Engineer Nation — with Josephine Teo, Minister of Digital Development and Information, Singapore

Latent Space: The AI Engineer Podcast â€” CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

Play Episode Listen Later Oct 19, 2024 56:39

Singapore's GovTech is hosting an AI CTF challenge with ~$15,000 in prizes, starting October 26th, open to both local and virtual hackers. It will be hosted on Dreadnode's Crucible platform; signup here!It is common to say if you want to work in AI, you should come to San Francisco. Not everyone can. Not everyone should. If you can only do meaningful AI work in one city, then AI has failed to generalize meaningfully.As non-Americans working in the US, we know what it's like to see AI progress so rapidly here, and yet be at a loss for what our home countries can do. Through Latent Space we've tried to tell the story of AI outside of the Bay Area bubble; we talked to Notion in New York and Humanloop and Wondercraft in London and HuggingFace in Paris and ICLR in Vienna, and the Reka, RWKV, and Winds of AI Winter episodes were taped in Singapore (the World's Fair also had Latin America representation and we intend to at least add China, Japan, and India next year).The Role of Government with AIAs an intentionally technical resource, we've mostly steered clear of regulation and safety debates on the podcast; whether it is safety bills or technoalarmism, often at the cost of our engagement numbers or ability to book big name guests with a political agenda. When SOTA shifts 3x faster than it takes to pass a law, when nobody agrees on definitions of important things, when you can elicit never-before-seen behavior by slightly different prompting or sampling, it is hard enough to simply keep up to speed, so we are happy limiting our role to that. The story of AI progress has more often been achieved in the private sector, usually in spite of, rather than with thanks to, government intervention.But industrial policy is inextricably linked to the business of AI, which we do very much care about, has an explicitly accelerationist intent if not impact, and has a track record of success in correcting for legitimate market failures in private sector investment, particularly outside of the US. It is with this lens we approach today's episode and special guest, our first with a sitting Cabinet member.Singapore's National AI StrategyIt is well understood that much of Singapore's economic success is attributable to industrial policy, from direct efforts like the Jurong Town Corporation industrialization to indirect ones like going all in on English as national first language. Singapore's National AI Strategy grew out of its 2014 Smart Nation initiative, first launched in 2019 and then refreshed in 2023 by Minister Josephine Teo, our guest today.While Singapore is not often thought of as an AI leader, the National University ranks in the top 10 in publications (above Oxford/Harvard!), and many overseas Singaporeans work at the leading AI companies and institutions in the US (and some of us even run leading AI Substacks?). OpenAI has often publicly named the Singapore government as their model example of government collaborator and is opening an office in Singapore in time for DevDay 2024.AI Engineer NationsSwyx first pitched the AI Engineer Nation concept at a private Sovereign AI summit featuring Dr. He Ruimin, Chief AI Officer of Singapore, which eventually led to an invitation to discuss the concept with Minister Teo, the country's de-facto minister for tech (she calls it Digital Development, for good reasons she explains in the pod).This chat happened (with thanks to Jing Long, Joyce, and other folks from MDDI)!The central pitch for any country, not just Singapore, to emphasize and concentrate bets on AI Engineers, compared with other valuable efforts like training more researchers, releasing more government-approved data, or offering more AI funding, is a calculated one, based on the fact that: * GPU clusters and researchers have massive returns to scale and colocation, mostly concentrated in the US, that are irresponsibly expensive to replicate* Even if research stopped today and there was no progress for the next 30 years, there are far more capabilities to unlock and productize from existing foundation models and we

Production AI Engineering starts with Evals — with Ankur Goyal of Braintrust

Latent Space: The AI Engineer Podcast â€” CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

Play Episode Listen Later Oct 11, 2024 116:40

We are in

From API to AGI: Structured Outputs, OpenAI API platform and O1 Q&A — with Michelle Pokrass & OpenAI Devrel + Strawberry team

Latent Space: The AI Engineer Podcast â€” CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

Play Episode Listen Later Sep 13, 2024 122:59

Congrats to Damien on successfully running AI Engineer London! See our community page and the Latent Space Discord for all upcoming events.This podcast came together in a far more convoluted way than usual, but happens to result in a tight 2 hours covering the ENTIRE OpenAI product suite across ChatGPT-latest, GPT-4o and the new o1 models, and how they are delivered to AI Engineers in the API via the new Structured Output mode, Assistants API, client SDKs, upcoming Voice Mode API, Finetuning/Vision/Whisper/Batch/Admin/Audit APIs, and everything else you need to know to be up to speed in September 2024.This podcast has two parts: the first hour is a regular, well edited, podcast on 4o, Structured Outputs, and the rest of the OpenAI API platform. The second was a rushed, noisy, hastily cobbled together recap of the top takeaways from the o1 model release from yesterday and today.Building AGI with Structured Outputs — Michelle Pokrass of OpenAI API teamMichelle Pokrass built massively scalable platforms at Google, Stripe, Coinbase and Clubhouse, and now leads the API Platform at Open AI. She joins us today to talk about why structured output is such an important modality for AI Engineers that Open AI has now trained and engineered a Structured Output mode with 100% reliable JSON schema adherence. To understand why this is important, a bit of history is important:* June 2023 when OpenAI first added a "function calling" capability to GPT-4-0613 and GPT 3.5 Turbo 0613 (our podcast/writeup here)* November 2023's OpenAI Dev Day (our podcast/writeup here) where the team shipped JSON Mode, a simpler schema-less JSON output mode that nevertheless became more popular because function calling often failed to match the JSON schema given by developers. * Meanwhile, in open source, many solutions arose, including * Instructor (our pod with Jason here) * LangChain (our pod with Harrison here, and he is returning next as a guest co-host)* Outlines (Remi Louf's talk at AI Engineer here)* Llama.cpp's constrained grammar sampling using GGML-BNF* April 2024: OpenAI started implementing constrained sampling with a new `tool_choice: required` parameter in the API* August 2024: the new Structured Output mode, co-led by Michelle* Sept 2024: Gemini shipped Structured Outputs as wellWe sat down with Michelle to talk through every part of the process, as well as quizzing her for updates on everything else the API team has shipped in the past year, from the Assistants API, to Prompt Caching, GPT4 Vision, Whisper, the upcoming Advanced Voice Mode API, OpenAI Enterprise features, and why every Waterloo grad seems to be a cracked engineer.Part 1 Timestamps and TranscriptTranscript here.* [00:00:42] Episode Intro from Suno* [00:03:34] Michelle's Path to OpenAI* [00:12:20] Scaling ChatGPT* [00:13:20] Releasing Structured Output* [00:16:17] Structured Outputs vs Function Calling* [00:19:42] JSON Schema and Constrained Grammar* [00:20:45] OpenAI API team* [00:21:32] Structured Output Refusal Field* [00:24:23] ChatML issues* [00:26:20] Function Calling Evals* [00:28:34] Parallel Function Calling* [00:29:30] Increased Latency* [00:30:28] Prompt/Schema Caching* [00:30:50] Building Agents with Structured Outputs: from API to AGI* [00:31:52] Assistants API* [00:34:00] Use cases for Structured Output* [00:37:45] Prompting Structured Output* [00:39:44] Benchmarking Prompting for Structured Outputs* [00:41:50] Structured Outputs Roadmap* [00:43:37] Model Selection vs GPT4 Finetuning* [00:46:56] Is Prompt Engineering Dead?* [00:47:29] 2 models: ChatGPT Latest vs GPT 4o August* [00:50:24] Why API => AGI* [00:52:40] Dev Day* [00:54:20] Assistants API Roadmap* [00:56:14] Model Reproducibility/Determinism issues* [00:57:53] Tiering and Rate Limiting* [00:59:26] OpenAI vs Ops Startups* [01:01:06] Batch API* [01:02:54] Vision* [01:04:42] Whisper* [01:07:21] Voice Mode API* [01:08:10] Enterprise: Admin/Audit Log APIs* [01:09:02] Waterloo grads* [01:10:49] Books* [01:11:57] Cognitive Biases* [01:13:25] Are LLMs Econs?* [01:13:49] Hiring at OpenAIEmergency O1 Meetup — OpenAI DevRel + Strawberry teamthe following is our writeup from AINews, which so far stands the test of time.o1, aka Strawberry, aka Q*, is finally out! There are two models we can use today: o1-preview (the bigger one priced at $15 in / $60 out) and o1-mini (the STEM-reasoning focused distillation priced at $3 in/$12 out) - and the main o1 model is still in training. This caused a little bit of confusion.There are a raft of relevant links, so don't miss:* the o1 Hub* the o1-preview blogpost* the o1-mini blogpost* the technical research blogpost* the o1 system card* the platform docs* the o1 team video and contributors list (twitter)Inline with the many, many leaks leading up to today, the core story is longer “test-time inference” aka longer step by step responses - in the ChatGPT app this shows up as a new “thinking” step that you can click to expand for reasoning traces, even though, controversially, they are hidden from you (interesting conflict of interest…):Under the hood, o1 is trained for adding new reasoning tokens - which you pay for, and OpenAI has accordingly extended the output token limit to >30k tokens (incidentally this is also why a number of API parameters from the other models like temperature and role and tool calling and streaming, but especially max_tokens is no longer supported).The evals are exceptional. OpenAI o1:* ranks in the 89th percentile on competitive programming questions (Codeforces),* places among the top 500 students in the US in a qualifier for the USA Math Olympiad (AIME),* and exceeds human PhD-level accuracy on a benchmark of physics, biology, and chemistry problems (GPQA).You are used to new models showing flattering charts, but there is one of note that you don't see in many model announcements, that is probably the most important chart of all. Dr Jim Fan gets it right: we now have scaling laws for test time compute, and it looks like they scale loglinearly.We unfortunately may never know the drivers of the reasoning improvements, but Jason Wei shared some hints:Usually the big model gets all the accolades, but notably many are calling out the performance of o1-mini for its size (smaller than gpt 4o), so do not miss that.Part 2 Timestamps* [01:15:01] O1 transition* [01:16:07] O1 Meetup Recording* [01:38:38] OpenAI Friday AMA recap* [01:44:47] Q&A Part 2* [01:50:28] O1 DemosDemo Videos to be posted shortly Get full access to Latent Space at www.latent.space/subscribe

AI Magic: Shipping 1000s of successful products with no managers and a team of 12 — Jeremy Howard of Answer.ai

Latent Space: The AI Engineer Podcast â€” CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

Play Episode Listen Later Aug 16, 2024 58:56

Disclaimer: We recorded this episode ~1.5 months ago, timing for the FastHTML release. It then got bottlenecked by Llama3.1, Winds of AI Winter, and SAM2 episodes, so we're a little late. Since then FastHTML was released, swyx is building an app in it for AINews, and Anthropic has also released their prompt caching API. Remember when Dylan Patel of SemiAnalysis coined the GPU Rich vs GPU Poor war? (if not, see our pod with him). The idea was that if you're GPU poor you shouldn't waste your time trying to solve GPU rich problems (i.e. pre-training large models) and are better off working on fine-tuning, optimized inference, etc. Jeremy Howard (see our “End of Finetuning” episode to catchup on his background) and Eric Ries founded Answer.AI to do exactly that: “Practical AI R&D”, which is very in-line with the GPU poor needs. For example, one of their first releases was a system based on FSDP + QLoRA that let anyone train a 70B model on two NVIDIA 4090s. Since then, they have come out with a long list of super useful projects (in no particular order, and non-exhaustive):* FSDP QDoRA: this is just as memory efficient and scalable as FSDP/QLoRA, and critically is also as accurate for continued pre-training as full weight training.* Cold Compress: a KV cache compression toolkit that lets you scale sequence length without impacting speed.* colbert-small: state of the art retriever at only 33M params* JaColBERTv2.5: a new state-of-the-art retrievers on all Japanese benchmarks.* gpu.cpp: portable GPU compute for C++ with WebGPU.* Claudette: a better Anthropic API SDK. They also recently released FastHTML, a new way to create modern interactive web apps. Jeremy recently released a 1 hour “Getting started” tutorial on YouTube; while this isn't AI related per se, but it's close to home for any AI Engineer who are looking to iterate quickly on new products: In this episode we broke down 1) how they recruit 2) how they organize what to research 3) and how the community comes together. At the end, Jeremy gave us a sneak peek at something new that he's working on that he calls dialogue engineering: So I've created a new approach. It's not called prompt engineering. I'm creating a system for doing dialogue engineering. It's currently called AI magic. I'm doing most of my work in this system and it's making me much more productive than I was before I used it.He explains it a bit more ~44:53 in the pod, but we'll just have to wait for the public release to figure out exactly what he means.Timestamps* [00:00:00] Intro by Suno AI* [00:03:02] Continuous Pre-Training is Here* [00:06:07] Schedule-Free Optimizers and Learning Rate Schedules* [00:07:08] Governance and Structural Issues within OpenAI and Other AI Labs* [00:13:01] How Answer.ai works* [00:23:40] How to Recruit Productive Researchers* [00:27:45] Building a new BERT* [00:31:57] FSDP, QLoRA, and QDoRA: Innovations in Fine-Tuning Large Models* [00:36:36] Research and Development on Model Inference Optimization* [00:39:49] FastHTML for Web Application Development* [00:46:53] AI Magic & Dialogue Engineering* [00:52:19] AI wishlist & predictionsShow Notes* Jeremy Howard* Previously on Latent Space: The End of Finetuning, NeurIPS Startups* Answer.ai* Fast.ai* FastHTML* answerai-colbert-small-v1* gpu.cpp* Eric Ries* Aaron DeFazio* Yi Tai* Less Wright* Benjamin Warner* Benjamin Clavié* Jono Whitaker* Austin Huang* Eric Gilliam* Tim Dettmers* Colin Raffel* Sebastian Raschka* Carson Gross* Simon Willison* Sepp Hochreiter* Llama3.1 episode* Snowflake Arctic* Ranger Optimizer* Gemma.cpp* HTMX* UL2* BERT* DeBERTa* Efficient finetuning of Llama 3 with FSDP QDoRA* xLSTMTranscriptAlessio [00:00:00]: Hey everyone, welcome to the Latent Space podcast. This is Alessio, partner and CTO-in-Residence at Decibel Partners, and I'm joined by my co-host Swyx, founder of Smol AI.Swyx [00:00:14]: And today we're back with Jeremy Howard, I think your third appearance on Latent Space. Welcome.Jeremy [00:00:19]: Wait, third? Second?Swyx [00:00:21]: Well, I grabbed you at NeurIPS.Jeremy [00:00:23]: I see.Swyx [00:00:24]: Very fun, standing outside street episode.Jeremy [00:00:27]: I never heard that, by the way. You've got to send me a link. I've got to hear what it sounded like.Swyx [00:00:30]: Yeah. Yeah, it's a NeurIPS podcast.Alessio [00:00:32]: I think the two episodes are six hours, so there's plenty to listen, we'll make sure to send it over.Swyx [00:00:37]: Yeah, we're trying this thing where at the major ML conferences, we, you know, do a little audio tour of, give people a sense of what it's like. But the last time you were on, you declared the end of fine tuning. I hope that I sort of editorialized the title a little bit, and I know you were slightly uncomfortable with it, but you just own it anyway. I think you're very good at the hot takes. And we were just discussing in our pre-show that it's really happening, that the continued pre-training is really happening.Jeremy [00:01:02]: Yeah, absolutely. I think people are starting to understand that treating the three ULM FIT steps of like pre-training, you know, and then the kind of like what people now call instruction tuning, and then, I don't know if we've got a general term for this, DPO, RLHFE step, you know, or the task training, they're not actually as separate as we originally suggested they were in our paper, and when you treat it more as a continuum, and that you make sure that you have, you know, more of kind of the original data set incorporated into the later stages, and that, you know, we've also seen with LLAMA3, this idea that those later stages can be done for a lot longer. These are all of the things I was kind of trying to describe there. It wasn't the end of fine tuning, but more that we should treat it as a continuum, and we should have much higher expectations of how much you can do with an already trained model. You can really add a lot of behavior to it, you can change its behavior, you can do a lot. So a lot of our research has been around trying to figure out how to modify the model by a larger amount rather than starting from random weights, because I get very offended at the idea of starting from random weights.Swyx [00:02:14]: Yeah, I saw that in ICLR in Vienna, there was an outstanding paper about starting transformers from data-driven piers. I don't know if you saw that one, they called it sort of never trained from scratch, and I think it was kind of rebelling against like the sort of random initialization.Jeremy [00:02:28]: Yeah, I've, you know, that's been our kind of continuous message since we started Fast AI, is if you're training for random weights, you better have a really good reason, you know, because it seems so unlikely to me that nobody has ever trained on data that has any similarity whatsoever to the general class of data you're working with, and that's the only situation in which I think starting from random weights makes sense.Swyx [00:02:51]: The other trends since our last pod that I would point people to is I'm seeing a rise in multi-phase pre-training. So Snowflake released a large model called Snowflake Arctic, where they detailed three phases of training where they had like a different mixture of like, there was like 75% web in the first instance, and then they reduced the percentage of the web text by 10% each time and increased the amount of code in each phase. And I feel like multi-phase is being called out in papers more. I feel like it's always been a thing, like changing data mix is not something new, but calling it a distinct phase is new, and I wonder if there's something that you're seeingJeremy [00:03:32]: on your end. Well, so they're getting there, right? So the point at which they're doing proper continued pre-training is the point at which that becomes a continuum rather than a phase. So the only difference with what I was describing last time is to say like, oh, there's a function or whatever, which is happening every batch. It's not a huge difference. You know, I always used to get offended when people had learning rates that like jumped. And so one of the things I started doing early on in Fast.ai was to say to people like, no, you should actually have your learning rate schedule should be a function, not a list of numbers. So now I'm trying to give the same idea about training mix.Swyx [00:04:07]: There's been pretty public work from Meta on schedule-free optimizers. I don't know if you've been following Aaron DeFazio and what he's doing, just because you mentioned learning rate schedules, you know, what if you didn't have a schedule?Jeremy [00:04:18]: I don't care very much, honestly. I don't think that schedule-free optimizer is that exciting. It's fine. We've had non-scheduled optimizers for ages, like Less Wright, who's now at Meta, who was part of the Fast.ai community there, created something called the Ranger optimizer. I actually like having more hyperparameters. You know, as soon as you say schedule-free, then like, well, now I don't get to choose. And there isn't really a mathematically correct way of, like, I actually try to schedule more parameters rather than less. So like, I like scheduling my epsilon in my atom, for example. I schedule all the things. But then the other thing we always did with the Fast.ai library was make it so you don't have to set any schedules. So Fast.ai always supported, like, you didn't even have to pass a learning rate. Like, it would always just try to have good defaults and do the right thing. But to me, I like to have more parameters I can play with if I want to, but you don't have to.Alessio [00:05:08]: And then the more less technical side, I guess, of your issue, I guess, with the market was some of the large research labs taking all this innovation kind of behind closed doors and whether or not that's good, which it isn't. And now we could maybe make it more available to people. And then a month after we released the episode, there was the whole Sam Altman drama and like all the OpenAI governance issues. And maybe people started to think more, okay, what happens if some of these kind of labs, you know, start to break from within, so to speak? And the alignment of the humans is probably going to fall before the alignment of the models. So I'm curious, like, if you have any new thoughts and maybe we can also tie in some of the way that we've been building Answer as like a public benefit corp and some of those aspects.Jeremy [00:05:51]: Sure. So, yeah, I mean, it was kind of uncomfortable because two days before Altman got fired, I did a small public video interview in which I said, I'm quite sure that OpenAI's current governance structure can't continue and that it was definitely going to fall apart. And then it fell apart two days later and a bunch of people were like, what did you know, Jeremy?Alessio [00:06:13]: What did Jeremy see?Jeremy [00:06:15]: I didn't see anything. It's just obviously true. Yeah. So my friend Eric Ries and I spoke a lot before that about, you know, Eric's, I think probably most people would agree, the top expert in the world on startup and AI governance. And you know, we could both clearly see that this didn't make sense to have like a so-called non-profit where then there are people working at a company, a commercial company that's owned by or controlled nominally by the non-profit, where the people in the company are being given the equivalent of stock options, like everybody there was working there with expecting to make money largely from their equity. So the idea that then a board could exercise control by saying like, oh, we're worried about safety issues and so we're going to do something that decreases the profit of the company, when every stakeholder in the company, their remuneration pretty much is tied to their profit, it obviously couldn't work. So I mean, that was a huge oversight there by someone. I guess part of the problem is that the kind of people who work at non-profits and in this case the board, you know, who are kind of academics and, you know, people who are kind of true believers. I think it's hard for them to realize that 99.999% of the world is driven very heavily by money, especially huge amounts of money. So yeah, Eric and I had been talking for a long time before that about what could be done differently, because also companies are sociopathic by design and so the alignment problem as it relates to companies has not been solved. Like, companies become huge, they devour their founders, they devour their communities and they do things where even the CEOs, you know, often of big companies tell me like, I wish our company didn't do that thing. You know, I know that if I didn't do it, then I would just get fired and the board would put in somebody else and the board knows if they don't do it, then their shareholders can sue them because they're not maximizing profitability or whatever. So what Eric's spent a lot of time doing is trying to think about how do we make companies less sociopathic, you know, how to, or more, you know, maybe a better way to think of it is like, how do we make it so that the founders of companies can ensure that their companies continue to actually do the things they want them to do? You know, when we started a company, hey, we very explicitly decided we got to start a company, not a academic lab, not a nonprofit, you know, we created a Delaware Seacorp, you know, the most company kind of company. But when we did so, we told everybody, you know, including our first investors, which was you Alessio. They sound great. We are going to run this company on the basis of maximizing long-term value. And in fact, so when we did our second round, which was an angel round, we had everybody invest through a long-term SPV, which we set up where everybody had to agree to vote in line with long-term value principles. So like never enough just to say to people, okay, we're trying to create long-term value here for society as well as for ourselves and everybody's like, oh, yeah, yeah, I totally agree with that. But when it comes to like, okay, well, here's a specific decision we have to make, which will not maximize short-term value, people suddenly change their mind. So you know, it has to be written into the legal documents of everybody so that no question that that's the way the company has to be managed. So then you mentioned the PBC aspect, Public Benefit Corporation, which I never quite understood previously. And turns out it's incredibly simple, like it took, you know, like one paragraph added to our corporate documents to become a PBC. It was cheap, it was easy, but it's got this huge benefit, which is if you're not a public benefit corporation, then somebody can come along and offer to buy you with a stated description of like turning your company into the thing you most hate, right? And if they offer you more than the market value of your company and you don't accept it, then you are not necessarily meeting the kind of your fiduciary responsibilities. So the way like Eric always described it to me is like, if Philip Morris came along and said that you've got great technology for marketing cigarettes to children, so we're going to pivot your company to do that entirely, and we're going to pay you 50% more than the market value, you're going to have to say yes. If you have a PBC, then you are more than welcome to say no, if that offer is not in line with your stated public benefit. So our stated public benefit is to maximize the benefit to society through using AI. So given that more children smoking doesn't do that, then we can say like, no, we're not selling to you.Alessio [00:11:01]: I was looking back at some of our emails. You sent me an email on November 13th about talking and then on the 14th, I sent you an email working together to free AI was the subject line. And then that was kind of the start of the C round. And then two days later, someone got fired. So you know, you were having these thoughts even before we had like a public example of like why some of the current structures didn't work. So yeah, you were very ahead of the curve, so to speak. You know, people can read your awesome introduction blog and answer and the idea of having a R&D lab versus our lab and then a D lab somewhere else. I think to me, the most interesting thing has been hiring and some of the awesome people that you've been bringing on that maybe don't fit the central casting of Silicon Valley, so to speak. Like sometimes I got it like playing baseball cards, you know, people are like, oh, what teams was this person on, where did they work versus focusing on ability. So I would love for you to give a shout out to some of the awesome folks that you have on the team.Jeremy [00:11:58]: So, you know, there's like a graphic going around describing like the people at XAI, you know, Elon Musk thing. And like they are all connected to like multiple of Stanford, Meta, DeepMind, OpenAI, Berkeley, Oxford. Look, these are all great institutions and they have good people. And I'm definitely not at all against that, but damn, there's so many other people. And one of the things I found really interesting is almost any time I see something which I think like this is really high quality work and it's something I don't think would have been built if that person hadn't built the thing right now, I nearly always reach out to them and ask to chat. And I tend to dig in to find out like, okay, you know, why did you do that thing? Everybody else has done this other thing, your thing's much better, but it's not what other people are working on. And like 80% of the time, I find out the person has a really unusual background. So like often they'll have like, either they like came from poverty and didn't get an opportunity to go to a good school or had dyslexia and, you know, got kicked out of school in year 11, or they had a health issue that meant they couldn't go to university or something happened in their past and they ended up out of the mainstream. And then they kind of succeeded anyway. Those are the people that throughout my career, I've tended to kind of accidentally hire more of, but it's not exactly accidentally. It's like when I see somebody who's done, two people who have done extremely well, one of them did extremely well in exactly the normal way from the background entirely pointing in that direction and they achieved all the hurdles to get there. And like, okay, that's quite impressive, you know, but another person who did just as well, despite lots of constraints and doing things in really unusual ways and came up with different approaches. That's normally the person I'm likely to find useful to work with because they're often like risk-takers, they're often creative, they're often extremely tenacious, they're often very open-minded. So that's the kind of folks I tend to find myself hiring. So now at Answer.ai, it's a group of people that are strong enough that nearly every one of them has independently come to me in the past few weeks and told me that they have imposter syndrome and they're not convinced that they're good enough to be here. And I kind of heard it at the point where I was like, okay, I don't think it's possible that all of you are so far behind your peers that you shouldn't get to be here. But I think part of the problem is as an R&D lab, the great developers look at the great researchers and they're like, wow, these big-brained, crazy research people with all their math and s**t, they're too cool for me, oh my God. And then the researchers look at the developers and they're like, oh, they're killing it, making all this stuff with all these people using it and talking on Twitter about how great it is. I think they're both a bit intimidated by each other, you know. And so I have to kind of remind them like, okay, there are lots of things in this world where you suck compared to lots of other people in this company, but also vice versa, you know, for all things. And the reason you came here is because you wanted to learn about those other things from those other people and have an opportunity to like bring them all together into a single unit. You know, it's not reasonable to expect you're going to be better at everything than everybody else. I guess the other part of it is for nearly all of the people in the company, to be honest, they have nearly always been better than everybody else at nearly everything they're doing nearly everywhere they've been. So it's kind of weird to be in this situation now where it's like, gee, I can clearly see that I suck at this thing that I'm meant to be able to do compared to these other people where I'm like the worst in the company at this thing for some things. So I think that's a healthy place to be, you know, as long as you keep reminding each other about that's actually why we're here. And like, it's all a bit of an experiment, like we don't have any managers. We don't have any hierarchy from that point of view. So for example, I'm not a manager, which means I don't get to tell people what to do or how to do it or when to do it. Yeah, it's been a bit of an experiment to see how that would work out. And it's been great. So for instance, Ben Clavier, who you might have come across, he's the author of Ragatouille, he's the author of Rerankers, super strong information retrieval guy. And a few weeks ago, you know, this additional channel appeared on Discord, on our private Discord called Bert24. And these people started appearing, as in our collab sections, we have a collab section for like collaborating with outsiders. And these people started appearing, there are all these names that I recognize, like Bert24, and they're all talking about like the next generation of Bert. And I start following along, it's like, okay, Ben decided that I think, quite rightly, we need a new Bert. Because everybody, like so many people are still using Bert, and it's still the best at so many things, but it actually doesn't take advantage of lots of best practices. And so he just went out and found basically everybody who's created better Berts in the last four or five years, brought them all together, suddenly there's this huge collaboration going on. So yeah, I didn't tell him to do that. He didn't ask my permission to do that. And then, like, Benjamin Warner dived in, and he's like, oh, I created a whole transformers from scratch implementation designed to be maximally hackable. He originally did it largely as a teaching exercise to show other people, but he was like, I could, you know, use that to create a really hackable BERT implementation. In fact, he didn't say that. He said, I just did do that, you know, and I created a repo, and then everybody's like starts using it. They're like, oh my god, this is amazing. I can now implement all these other BERT things. And it's not just answer AI guys there, you know, there's lots of folks, you know, who have like contributed new data set mixes and blah, blah, blah. So, I mean, I can help in the same way that other people can help. So like, then Ben Clavier reached out to me at one point and said, can you help me, like, what have you learned over time about how to manage intimidatingly capable and large groups of people who you're nominally meant to be leading? And so, you know, I like to try to help, but I don't direct. Another great example was Kerem, who, after our FSTP QLORA work, decided quite correctly that it didn't really make sense to use LoRa in today's world. You want to use the normalized version, which is called Dora. Like two or three weeks after we did FSTP QLORA, he just popped up and said, okay, I've just converted the whole thing to Dora, and I've also created these VLLM extensions, and I've got all these benchmarks, and, you know, now I've got training of quantized models with adapters that are as fast as LoRa, and as actually better than, weirdly, fine tuning. Just like, okay, that's great, you know. And yeah, so the things we've done to try to help make these things happen as well is we don't have any required meetings, you know, but we do have a meeting for each pair of major time zones that everybody's invited to, and, you know, people see their colleagues doing stuff that looks really cool and say, like, oh, how can I help, you know, or how can I learn or whatever. So another example is Austin, who, you know, amazing background. He ran AI at Fidelity, he ran AI at Pfizer, he ran browsing and retrieval for Google's DeepMind stuff, created Jemma.cpp, and he's been working on a new system to make it easier to do web GPU programming, because, again, he quite correctly identified, yeah, so I said to him, like, okay, I want to learn about that. Not an area that I have much expertise in, so, you know, he's going to show me what he's working on and teach me a bit about it, and hopefully I can help contribute. I think one of the key things that's happened in all of these is everybody understands what Eric Gilliam, who wrote the second blog post in our series, the R&D historian, describes as a large yard with narrow fences. Everybody has total flexibility to do what they want. We all understand kind of roughly why we're here, you know, we agree with the premises around, like, everything's too expensive, everything's too complicated, people are building too many vanity foundation models rather than taking better advantage of fine-tuning, like, there's this kind of general, like, sense of we're all on the same wavelength about, you know, all the ways in which current research is fucked up, and, you know, all the ways in which we're worried about centralization. We all care a lot about not just research for the point of citations, but research that actually wouldn't have happened otherwise, and actually is going to lead to real-world outcomes. And so, yeah, with this kind of, like, shared vision, people understand, like, you know, so when I say, like, oh, well, you know, tell me, Ben, about BERT 24, what's that about? And he's like, you know, like, oh, well, you know, you can see from an accessibility point of view, or you can see from a kind of a actual practical impact point of view, there's far too much focus on decoder-only models, and, you know, like, BERT's used in all of these different places and industry, and so I can see, like, in terms of our basic principles, what we're trying to achieve, this seems like something important. And so I think that's, like, a really helpful that we have that kind of shared perspective, you know?Alessio [00:21:14]: Yeah. And before we maybe talk about some of the specific research, when you're, like, reaching out to people, interviewing them, what are some of the traits, like, how do these things come out, you know, usually? Is it working on side projects that you, you know, you're already familiar with? Is there anything, like, in the interview process that, like, helps you screen for people that are less pragmatic and more research-driven versus some of these folks that are just gonna do it, you know? They're not waiting for, like, the perfect process.Jeremy [00:21:40]: Everybody who comes through the recruiting is interviewed by everybody in the company. You know, our goal is 12 people, so it's not an unreasonable amount. So the other thing to say is everybody so far who's come into the recruiting pipeline, everybody bar one, has been hired. So which is to say our original curation has been good. And that's actually pretty easy, because nearly everybody who's come in through the recruiting pipeline are people I know pretty well. So Jono Whitaker and I, you know, he worked on the stable diffusion course we did. He's outrageously creative and talented, and he's super, like, enthusiastic tinkerer, just likes making things. Benjamin was one of the strongest parts of the fast.ai community, which is now the alumni. It's, like, hundreds of thousands of people. And you know, again, like, they're not people who a normal interview process would pick up, right? So Benjamin doesn't have any qualifications in math or computer science. Jono was living in Zimbabwe, you know, he was working on, like, helping some African startups, you know, but not FAANG kind of credentials. But yeah, I mean, when you actually see people doing real work and they stand out above, you know, we've got lots of Stanford graduates and open AI people and whatever in our alumni community as well. You know, when you stand out above all of those people anyway, obviously you've got something going for you. You know, Austin, him and I worked together on the masks study we did in the proceeding at the National Academy of Science. You know, we had worked together, and again, that was a group of, like, basically the 18 or 19 top experts in the world on public health and epidemiology and research design and so forth. And Austin, you know, one of the strongest people in that collaboration. So yeah, you know, like, I've been lucky enough to have had opportunities to work with some people who are great and, you know, I'm a very open-minded person, so I kind of am always happy to try working with pretty much anybody and some people stand out. You know, there have been some exceptions, people I haven't previously known, like Ben Clavier, actually, I didn't know before. But you know, with him, you just read his code, and I'm like, oh, that's really well-written code. And like, it's not written exactly the same way as everybody else's code, and it's not written to do exactly the same thing as everybody else's code. So yeah, and then when I chatted to him, it's just like, I don't know, I felt like we'd known each other for years, like we just were on the same wavelength, but I could pretty much tell that was going to happen just by reading his code. I think you express a lot in the code you choose to write and how you choose to write it, I guess. You know, or another example, a guy named Vic, who was previously the CEO of DataQuest, and like, in that case, you know, he's created a really successful startup. He won the first, basically, Kaggle NLP competition, which was automatic essay grading. He's got the current state-of-the-art OCR system, Surya. Again, he's just a guy who obviously just builds stuff, you know, he doesn't ask for permission, he doesn't need any, like, external resources. Actually, Karim's another great example of this, I mean, I already knew Karim very well because he was my best ever master's student, but it wasn't a surprise to me then when he then went off to create the world's state-of-the-art language model in Turkish on his own, in his spare time, with no budget, from scratch. This is not fine-tuning or whatever, he, like, went back to Common Crawl and did everything. Yeah, it's kind of, I don't know what I'd describe that process as, but it's not at all based on credentials.Swyx [00:25:17]: Assemble based on talent, yeah. We wanted to dive in a little bit more on, you know, turning from the people side of things into the technical bets that you're making. Just a little bit more on Bert. I was actually, we just did an interview with Yi Tay from Reka, I don't know if you're familiar with his work, but also another encoder-decoder bet, and one of his arguments was actually people kind of over-index on the decoder-only GPT-3 type paradigm. I wonder if you have thoughts there that is maybe non-consensus as well. Yeah, no, absolutely.Jeremy [00:25:45]: So I think it's a great example. So one of the people we're collaborating with a little bit with BERT24 is Colin Raffle, who is the guy behind, yeah, most of that stuff, you know, between that and UL2, there's a lot of really interesting work. And so one of the things I've been encouraging the BERT group to do, Colin has as well, is to consider using a T5 pre-trained encoder backbone as a thing you fine-tune, which I think would be really cool. You know, Colin was also saying actually just use encoder-decoder as your Bert, you know, why don't you like use that as a baseline, which I also think is a good idea. Yeah, look.Swyx [00:26:25]: What technical arguments are people under-weighting?Jeremy [00:26:27]: I mean, Colin would be able to describe this much better than I can, but I'll give my slightly non-expert attempt. Look, I mean, think about like diffusion models, right? Like in stable diffusion, like we use things like UNet. You have this kind of downward path and then in the upward path you have the cross connections, which it's not a tension, but it's like a similar idea, right? You're inputting the original encoding path into your decoding path. It's critical to make it work, right? Because otherwise in the decoding part, the model has to do so much kind of from scratch. So like if you're doing translation, like that's a classic kind of encoder-decoder example. If it's decoder only, you never get the opportunity to find the right, you know, feature engineering, the right feature encoding for the original sentence. And it kind of means then on every token that you generate, you have to recreate the whole thing, you know? So if you have an encoder, it's basically saying like, okay, this is your opportunity model to create a really useful feature representation for your input information. So I think there's really strong arguments for encoder-decoder models anywhere that there is this kind of like context or source thing. And then why encoder only? Well, because so much of the time what we actually care about is a classification, you know? It's like an output. It's like generating an arbitrary length sequence of tokens. So anytime you're not generating an arbitrary length sequence of tokens, decoder models don't seem to make much sense. Now the interesting thing is, you see on like Kaggle competitions, that decoder models still are at least competitive with things like Deberta v3. They have to be way bigger to be competitive with things like Deberta v3. And the only reason they are competitive is because people have put a lot more time and money and effort into training the decoder only ones, you know? There isn't a recent Deberta. There isn't a recent Bert. Yeah, it's a whole part of the world that people have slept on a little bit. And this is just what happens. This is how trends happen rather than like, to me, everybody should be like, oh, let's look at the thing that has shown signs of being useful in the past, but nobody really followed up with properly. That's the more interesting path, you know, where people tend to be like, oh, I need to get citations. So what's everybody else doing? Can I make it 0.1% better, you know, or 0.1% faster? That's what everybody tends to do. Yeah. So I think it's like, Itay's work commercially now is interesting because here's like a whole, here's a whole model that's been trained in a different way. So there's probably a whole lot of tasks it's probably better at than GPT and Gemini and Claude. So that should be a good commercial opportunity for them if they can figure out what those tasks are.Swyx [00:29:07]: Well, if rumors are to be believed, and he didn't comment on this, but, you know, Snowflake may figure out the commercialization for them. So we'll see.Jeremy [00:29:14]: Good.Alessio [00:29:16]: Let's talk about FSDP, Qlora, Qdora, and all of that awesome stuff. One of the things we talked about last time, some of these models are meant to run on systems that nobody can really own, no single person. And then you were like, well, what if you could fine tune a 70B model on like a 4090? And I was like, no, that sounds great, Jeremy, but like, can we actually do it? And then obviously you all figured it out. Can you maybe tell us some of the worst stories behind that, like the idea behind FSDP, which is kind of taking sharded data, parallel computation, and then Qlora, which is do not touch all the weights, just go quantize some of the model, and then within the quantized model only do certain layers instead of doing everything.Jeremy [00:29:57]: Well, do the adapters. Yeah.Alessio [00:29:59]: Yeah. Yeah. Do the adapters. Yeah. I will leave the floor to you. I think before you published it, nobody thought this was like a short term thing that we're just going to have. And now it's like, oh, obviously you can do it, but it's not that easy.Jeremy [00:30:12]: Yeah. I mean, to be honest, it was extremely unpleasant work to do. It's like not at all enjoyable. I kind of did version 0.1 of it myself before we had launched the company, or at least the kind of like the pieces. They're all pieces that are difficult to work with, right? So for the quantization, you know, I chatted to Tim Detmers quite a bit and, you know, he very much encouraged me by saying like, yeah, it's possible. He actually thought it'd be easy. It probably would be easy for him, but I'm not Tim Detmers. And, you know, so he wrote bits and bytes, which is his quantization library. You know, he wrote that for a paper. He didn't write that to be production like code. It's now like everybody's using it, at least the CUDA bits. So like, it's not particularly well structured. There's lots of code paths that never get used. There's multiple versions of the same thing. You have to try to figure it out. So trying to get my head around that was hard. And you know, because the interesting bits are all written in CUDA, it's hard to like to step through it and see what's happening. And then, you know, FSTP is this very complicated library and PyTorch, which not particularly well documented. So the only really, really way to understand it properly is again, just read the code and step through the code. And then like bits and bytes doesn't really work in practice unless it's used with PEF, the HuggingFace library and PEF doesn't really work in practice unless you use it with other things. And there's a lot of coupling in the HuggingFace ecosystem where like none of it works separately. You have to use it all together, which I don't love. So yeah, trying to just get a minimal example that I can play with was really hard. And so I ended up having to rewrite a lot of it myself to kind of create this like minimal script. One thing that helped a lot was Medec had this LlamaRecipes repo that came out just a little bit before I started working on that. And like they had a kind of role model example of like, here's how to train FSTP, LoRa, didn't work with QLoRa on Llama. A lot of the stuff I discovered, the interesting stuff would be put together by Les Wright, who's, he was actually the guy in the Fast.ai community I mentioned who created the Ranger Optimizer. So he's doing a lot of great stuff at Meta now. So yeah, I kind of, that helped get some minimum stuff going and then it was great once Benjamin and Jono joined full time. And so we basically hacked at that together and then Kerim joined like a month later or something. And it was like, gee, it was just a lot of like fiddly detailed engineering on like barely documented bits of obscure internals. So my focus was to see if it kind of could work and I kind of got a bit of a proof of concept working and then the rest of the guys actually did all the work to make it work properly. And, you know, every time we thought we had something, you know, we needed to have good benchmarks, right? So we'd like, it's very easy to convince yourself you've done the work when you haven't, you know, so then we'd actually try lots of things and be like, oh, and these like really important cases, the memory use is higher, you know, or it's actually slower. And we'd go in and we just find like all these things that were nothing to do with our library that just didn't work properly. And nobody had noticed they hadn't worked properly because nobody had really benchmarked it properly. So we ended up, you know, trying to fix a whole lot of different things. And even as we did so, new regressions were appearing in like transformers and stuff that Benjamin then had to go away and figure out like, oh, how come flash attention doesn't work in this version of transformers anymore with this set of models and like, oh, it turns out they accidentally changed this thing, so it doesn't work. You know, there's just, there's not a lot of really good performance type evals going on in the open source ecosystem. So there's an extraordinary amount of like things where people say like, oh, we built this thing and it has this result. And when you actually check it, so yeah, there's a shitload of war stories from getting that thing to work. And it did require a particularly like tenacious group of people and a group of people who don't mind doing a whole lot of kind of like really janitorial work, to be honest, to get the details right, to check them. Yeah.Alessio [00:34:09]: We had a trade out on the podcast and we talked about how a lot of it is like systems work to make some of these things work. It's not just like beautiful, pure math that you do on a blackboard. It's like, how do you get into the nitty gritty?Jeremy [00:34:22]: I mean, flash attention is a great example of that. Like it's, it basically is just like, oh, let's just take the attention and just do the tiled version of it, which sounds simple enough, you know, but then implementing that is challenging at lots of levels.Alessio [00:34:36]: Yeah. What about inference? You know, obviously you've done all this amazing work on fine tuning. Do you have any research you've been doing on the inference side, how to make local inference really fast on these models too?Jeremy [00:34:47]: We're doing quite a bit on that at the moment. We haven't released too much there yet. But one of the things I've been trying to do is also just to help other people. And one of the nice things that's happened is that a couple of folks at Meta, including Mark Seraphim, have done a nice job of creating this CUDA mode community of people working on like CUDA kernels or learning about that. And I tried to help get that going well as well and did some lessons to help people get into it. So there's a lot going on in both inference and fine tuning performance. And a lot of it's actually happening kind of related to that. So PyTorch team have created this Torch AO project on quantization. And so there's a big overlap now between kind of the FastAI and AnswerAI and CUDA mode communities of people working on stuff for both inference and fine tuning. But we're getting close now. You know, our goal is that nobody should be merging models, nobody should be downloading merged models, everybody should be using basically quantized plus adapters for almost everything and just downloading the adapters. And that should be much faster. So that's kind of the place we're trying to get to. It's difficult, you know, because like Karim's been doing a lot of work with VLM, for example. These inference engines are pretty complex bits of code. They have a whole lot of custom kernel stuff going on as well, as do the quantization libraries. So we've been working on, we're also quite a bit of collaborating with the folks who do HQQ, which is a really great quantization library and works super well. So yeah, there's a lot of other people outside AnswerAI that we're working with a lot who are really helping on all this performance optimization stuff, open source.Swyx [00:36:27]: Just to follow up on merging models, I picked up there that you said nobody should be merging models. That's interesting because obviously a lot of people are experimenting with this and finding interesting results. I would say in defense of merging models, you can do it without data. That's probably the only thing that's going for it.Jeremy [00:36:45]: To explain, it's not that you shouldn't merge models. You shouldn't be distributing a merged model. You should distribute a merged adapter 99% of the time. And actually often one of the best things happening in the model merging world is actually that often merging adapters works better anyway. The point is, Sean, that once you've got your new model, if you distribute it as an adapter that sits on top of a quantized model that somebody's already downloaded, then it's a much smaller download for them. And also the inference should be much faster because you're not having to transfer FB16 weights from HPM memory at all or ever load them off disk. You know, all the main weights are quantized and the only floating point weights are in the adapters. So that should make both inference and fine tuning faster. Okay, perfect.Swyx [00:37:33]: We're moving on a little bit to the rest of the fast universe. I would have thought that, you know, once you started Answer.ai, that the sort of fast universe would be kind of on hold. And then today you just dropped Fastlight and it looks like, you know, there's more activity going on in sort of Fastland.Jeremy [00:37:49]: Yeah. So Fastland and Answerland are not really distinct things. Answerland is kind of like the Fastland grown up and funded. They both have the same mission, which is to maximize the societal benefit of AI broadly. We want to create thousands of commercially successful products at Answer.ai. And we want to do that with like 12 people. So that means we need a pretty efficient stack, you know, like quite a few orders of magnitude more efficient, not just for creation, but for deployment and maintenance than anything that currently exists. People often forget about the D part of our R&D firm. So we've got to be extremely good at creating, deploying and maintaining applications, not just models. Much to my horror, the story around creating web applications is much worse now than it was 10 or 15 years ago in terms of, if I say to a data scientist, here's how to create and deploy a web application, you know, either you have to learn JavaScript or TypeScript and about all the complex libraries like React and stuff, and all the complex like details around security and web protocol stuff around how you then talk to a backend and then all the details about creating the backend. You know, if that's your job and, you know, you have specialists who work in just one of those areas, it is possible for that to all work. But compared to like, oh, write a PHP script and put it in the home directory that you get when you sign up to this shell provider, which is what it was like in the nineties, you know, here are those 25 lines of code and you're done and now you can pass that URL around to all your friends, or put this, you know, .pl file inside the CGI bin directory that you got when you signed up to this web host. So yeah, the thing I've been mainly working on the last few weeks is fixing all that. And I think I fixed it. I don't know if this is an announcement, but I tell you guys, so yeah, there's this thing called fastHTML, which basically lets you create a complete web application in a single Python file. Unlike excellent projects like Streamlit and Gradio, you're not working on top of a highly abstracted thing. That's got nothing to do with web foundations. You're working with web foundations directly, but you're able to do it by using pure Python. There's no template, there's no ginger, there's no separate like CSS and JavaScript files. It looks and behaves like a modern SPA web application. And you can create components for like daisy UI, or bootstrap, or shoelace, or whatever fancy JavaScript and or CSS tailwind etc library you like, but you can write it all in Python. You can pip install somebody else's set of components and use them entirely from Python. You can develop and prototype it all in a Jupyter notebook if you want to. It all displays correctly, so you can like interactively do that. And then you mentioned Fastlight, so specifically now if you're using SQLite in particular, it's like ridiculously easy to have that persistence, and all of your handlers will be passed database ready objects automatically, that you can just call dot delete dot update dot insert on. Yeah, you get session, you get security, you get all that. So again, like with most everything I do, it's very little code. It's mainly tying together really cool stuff that other people have written. You don't have to use it, but a lot of the best stuff comes from its incorporation of HTMX, which to me is basically the thing that changes your browser to make it work the way it always should have. So it just does four small things, but those four small things are the things that are basically unnecessary constraints that HTML should never have had, so it removes the constraints. It sits on top of Starlet, which is a very nice kind of lower level platform for building these kind of web applications. The actual interface matches as closely as possible to FastAPI, which is a really nice system for creating the kind of classic JavaScript type applications. And Sebastian, who wrote FastAPI, has been kind enough to help me think through some of these design decisions, and so forth. I mean, everybody involved has been super helpful. Actually, I chatted to Carson, who created HTMX, you know, so about it. Some of the folks involved in Django, like everybody in the community I've spoken to definitely realizes there's a big gap to be filled around, like, highly scalable, web foundation-based, pure Python framework with a minimum of fuss. So yeah, I'm getting a lot of support and trying to make sure that FastHTML works well for people.Swyx [00:42:38]: I would say, when I heard about this, I texted Alexio. I think this is going to be pretty huge. People consider Streamlit and Gradio to be the state of the art, but I think there's so much to improve, and having what you call web foundations and web fundamentals at the core of it, I think, would be really helpful.Jeremy [00:42:54]: I mean, it's based on 25 years of thinking and work for me. So like, FastML was built on a system much like this one, but that was of hell. And so I spent, you know, 10 years working on that. We had millions of people using that every day, really pushing it hard. And I really always enjoyed working in that. Yeah. So, you know, and obviously lots of other people have done like great stuff, and particularly HTMX. So I've been thinking about like, yeah, how do I pull together the best of the web framework I created for FastML with HTMX? There's also things like PicoCSS, which is the CSS system, which by default, FastHTML comes with. Although, as I say, you can pip install anything you want to, but it makes it like super easy to, you know, so we try to make it so that just out of the box, you don't have any choices to make. Yeah. You can make choices, but for most people, you just, you know, it's like the PHP in your home directory thing. You just start typing and just by default, you'll get something which looks and feels, you know, pretty okay. And if you want to then write a version of Gradio or Streamlit on top of that, you totally can. And then the nice thing is if you then write it in kind of the Gradio equivalent, which will be, you know, I imagine we'll create some kind of pip installable thing for that. Once you've outgrown, or if you outgrow that, it's not like, okay, throw that all away and start again. And this like whole separate language that it's like this kind of smooth, gentle path that you can take step-by-step because it's all just standard web foundations all the way, you know.Swyx [00:44:29]: Just to wrap up the sort of open source work that you're doing, you're aiming to create thousands of projects with a very, very small team. I haven't heard you mention once AI agents or AI developer tooling or AI code maintenance. I know you're very productive, but you know, what is the role of AI in your own work?Jeremy [00:44:47]: So I'm making something. I'm not sure how much I want to say just yet.Swyx [00:44:52]: Give us a nibble.Jeremy [00:44:53]: All right. I'll give you the key thing. So I've created a new approach. It's not called prompt engineering. It's called dialogue engineering. But I'm creating a system for doing dialogue engineering. It's currently called AI magic. I'm doing most of my work in this system and it's making me much more productive than I was before I used it. So I always just build stuff for myself and hope that it'll be useful for somebody else. Think about chat GPT with code interpreter, right? The basic UX is the same as a 1970s teletype, right? So if you wrote APL on a teletype in the 1970s, you typed onto a thing, your words appeared at the bottom of a sheet of paper and you'd like hit enter and it would scroll up. And then the answer from APL would be printed out, scroll up, and then you would type the next thing. And like, which is also the way, for example, a shell works like bash or ZSH or whatever. It's not terrible, you know, like we all get a lot done in these like very, very basic teletype style REPL environments, but I've never felt like it's optimal and everybody else has just copied chat GPT. So it's also the way BART and Gemini work. It's also the way the Claude web app works. And then you add code interpreter. And the most you can do is to like plead with chat GPT to write the kind of code I want. It's pretty good for very, very, very beginner users who like can't code at all, like by default now the code's even hidden away, so you never even have to see it ever happened. But for somebody who's like wanting to learn to code or who already knows a bit of code or whatever, it's, it seems really not ideal. So okay, that's one end of the spectrum. The other end of the spectrum, which is where Sean's work comes in, is, oh, you want to do more than chat GPT? No worries. Here is Visual Studio Code. I run it. There's an empty screen with a flashing cursor. Okay, start coding, you know, and it's like, okay, you can use systems like Sean's or like cursor or whatever to be like, okay, Apple K in cursors, like a creative form that blah, blah, blah. But in the end, it's like a convenience over the top of this incredibly complicated system that full-time sophisticated software engineers have designed over the past few decades in a totally different environment as a way to build software, you know. And so we're trying to like shoehorn in AI into that. And it's not easy to do. And I think there are like much better ways of thinking about the craft of software development in a language model world to be much more interactive, you know. So the thing that I'm building is neither of those things. It's something between the two. And it's built around this idea of crafting a dialogue, you know, where the outcome of the dialogue is the artifacts that you want, whether it be a piece of analysis or whether it be a Python library or whether it be a technical blog post or whatever. So as part of building that, I've created something called Claudette, which is a library for Claude. I've created something called Cosette, which is a library for OpenAI. They're libraries which are designed to make those APIs much more usable, much easier to use, much more concise. And then I've written AI magic on top of those. And that's been an interesting exercise because I did Claudette first, and I was looking at what Simon Willison did with his fantastic LLM library. And his library is designed around like, let's make something that supports all the LLM inference engines and commercial providers. I thought, okay, what if I did something different, which is like make something that's as Claude friendly as possible and forget everything else. So that's what Claudette was. So for example, one of the really nice things in Claude is prefill. So by telling the assistant that this is what your response started with, there's a lot of powerful things you can take advantage of. So yeah, I created Claudette to be as Claude friendly as possible. And then after I did that, and then particularly with GPT 4.0 coming out, I kind of thought, okay, now let's create something that's as OpenAI friendly as possible. And then I tried to look to see, well, where are the similarities and where are the differences? And now can I make them compatible in places where it makes sense for them to be compatible without losing out on the things that make each one special for what they are. So yeah, those are some of the things I've been working on in that space. And I'm thinking we might launch AI magic via a course called how to solve it with code. The name is based on the classic Polya book, if you know how to solve it, which is, you know, one of the classic math books of all time, where we're basically going to try to show people how to solve challenging problems that they didn't think they could solve without doing a full computer science course, by taking advantage of a bit of AI and a bit of like practical skills, as particularly for this like whole generation of people who are learning to code with and because of ChatGPT. Like I love it, I know a lot of people who didn't really know how to code, but they've created things because they use ChatGPT, but they don't really know how to maintain them or fix them or add things to them that ChatGPT can't do, because they don't really know how to code. And so this course will be designed to show you how you can like either become a developer who can like supercharge their capabilities by using language models, or become a language model first developer who can supercharge their capabilities by understanding a bit about process and fundamentals.Alessio [00:50:19]: Nice. That's a great spoiler. You know, I guess the fourth time you're going to be on learning space, we're going to talk about AI magic. Jeremy, before we wrap, this was just a great run through everything. What are the things that when you next come on the podcast in nine, 12 months, we're going to be like, man, Jeremy was like really ahead of it. Like, is there anything that you see in the space that maybe people are not talking enough? You know, what's the next company that's going to fall, like have drama internally, anything in your mind?Jeremy [00:50:47]: You know, hopefully we'll be talking a lot about fast HTML and hopefully the international community that at that point has come up around that. And also about AI magic and about dialogue engineering. Hopefully dialogue engineering catches on because I think it's the right way to think about a lot of this stuff. What else? Just trying to think about all on the research side. Yeah. I think, you know, I mean, we've talked about a lot of it. Like I think encoder decoder architectures, encoder only architectures, hopefully we'll be talking about like the whole re-interest in BERT that BERT 24 stimulated.Swyx [00:51:17]: There's a safe space model that came out today that might be interesting for this general discussion. One thing that stood out to me with Cartesia's blog posts was that they were talking about real time ingestion, billions and trillions of tokens, and keeping that context, obviously in the state space that they have.Jeremy [00:51:34]: Yeah.Swyx [00:51:35]: I'm wondering what your thoughts are because you've been entirely transformers the whole time.Jeremy [00:51:38]: Yeah. No. So obviously my background is RNNs and LSTMs. Of course. And I'm still a believer in the idea that state is something you can update, you know? So obviously Sepp Hochreiter came up, came out with xLSTM recently. Oh my God. Okay. Another whole thing we haven't talked about, just somewhat related. I've been going crazy for like a long time about like, why can I not pay anybody to save my KV cash? I just ingested the Great Gatsby or the documentation for Starlet or whatever, you know, I'm sending it as my prompt context. Why are you redoing it every time? So Gemini is about to finally come out with KV caching, and this is something that Austin actually in Gemma.cpp had had on his roadmap for years, well not years, months, long time. The idea that the KV cache is like a thing that, it's a third thing, right? So there's RAG, you know, there's in-context learning, you know, and prompt engineering, and there's KV cache creation. I think it creates like a whole new class almost of applications or as techniques where, you know, for me, for example, I very often work with really new libraries or I've created my own library that I'm now writing with rather than on. So I want all the docs in my new library to be there all the time. So I want to upload them once, and then we have a whole discussion about building this application using FastHTML. Well nobody's got FastHTML in their language model yet, I don't want to send all the FastHTML docs across every time. So one of the things I'm looking at doing in AI Magic actually is taking advantage of some of these ideas so that you can have the documentation of the libraries you're working on be kind of always available. Something over the next 12 months people will be spending time thinking about is how to like, where to use RAG, where to use fine-tuning, where to use KV cache storage, you know. And how to use state, because in state models and XLSTM, again, state is something you update. So how do we combine the best of all of these worlds?Alessio [00:53:46]: And Jeremy, I know before you talked about how some of the autoregressive models are not maybe a great fit for agents. Any other thoughts on like JEPA, diffusion for text, any interesting thing that you've seen pop up?Jeremy [00:53:58]: In the same way that we probably ought to have state that you can update, i.e. XLSTM and state models, in the same way that a lot of things probably should have an encoder, JEPA and diffusion both seem like the right conceptual mapping for a lot of things we probably want to do. So the idea of like, there should be a piece of the generative pipeline, which is like thinking about the answer and coming up with a sketch of what the answer looks like before you start outputting tokens. That's where it kind of feels like diffusion ought to fit, you know. And diffusion is, because it's not autoregressive, it's like, let's try to like gradually de-blur the picture of how to solve this. So this is also where dialogue engineering fits in, by the way. So with dialogue engineering, one of the reasons it's working so well for me is I use it to kind of like craft the thought process before I generate the code, you know. So yeah, there's a lot of different pieces here and I don't know how they'll all kind of exactly fit together. I don't know if JEPA is going to actually end up working in the text world. I don't know if diffusion will end up working in the text world, but they seem to be like trying to solve a class of problem which is currently unsolved.Alessio [00:55:13]: Awesome, Jeremy. This was great, as usual. Thanks again for coming back on the pod and thank you all for listening. Yeah, that was fantastic. Get full access to Latent Space at www.latent.space/subscribe

god ceo ai google science magic building research japanese elon musk development african chatgpt silicon valley ceos discord oxford stanford spa products berkeley cto pfizer react bart managers governance openai gemini turkish residence cgi zimbabwe nvidia ux api shipping ranger vic gpt python ui winds ml llama snowflakes national academy apis karim javascript html assemble r d llm sam altman gpu altman great gatsby css php django jono kv rag anthropic ocr deepmind alessio fine tuning surya faang xai typescript eric ries pbc philip morris apl starlet cuda visual studio code dpo kerem t5 reka kerim kaggle sqlite pytorch spv itay jupyter 33m public benefit corporation pef jeremy howard 70b repl neurips berts ai engineer huggingface vl m htmx ai winter zsh hpm simon willison alexio rnns streamlit iclr webgpu latent space unet gradio lstms polya web application development

Dino Scheidt: Gen AI in Enterprise, Data P&L, and Event Storming

The Long Game w/ Elijah Murray

Play Episode Listen Later Aug 8, 2024 43:08

Dino Scheidt is an AI Engineer, former CTO and Founder who works with Fortune 50s, Start-Ups, and Governments on Data Intelligence and AI Architectures. In this conversation, we explore the evolving landscape of AI with a particular focus on generative AI and its applications. Dino criticizes the concept of 'AI strategies,' arguing that AI should be seen as a tool rather than a strategy. Despite his initial skepticism towards generative AI, Dino acknowledges its potential, especially in transforming traditional value chains through AI-enabled communications. We also delve into the challenges and opportunities posed by non-deterministic systems, the concept of Data P&L, and the complexity of integrating generative AI into existing business operations. Dino wraps up the discussion by emphasizing the importance of lateral thinking and digital representation of business processes to leverage future AI innovations effectively. EPISODE LINKS: Dino Scheidt LinkedIn: https://linkedin.com/in/dinoscheidt Dino Scheidt Website: https://din.ooo The Gartner Hype Cycle: https://en.wikipedia.org/wiki/Gartner_hype_cycle TIMESTAMPS: 00:00:12 Introduction and background 00:00:45 AI and Generative Models 00:04:36 Deterministic vs Non-Deterministic Systems 00:08:19 Unpredictability, Value Chain, and AI Integration 00:13:26 Hype Cycle: Adjective to Noun 00:19:13 Strategic Integration of Generative AI 00:28:36 Value of Generative AI: Consumer vs. Enterprise Perspectives 00:32:12 Event Storming and Tactical AI 00:39:51 Future of AI and Final Thoughts 00:42:36 Closing CONNECT: Website: https://hoo.be/elijahmurray YouTube: https://www.youtube.com/@elijahmurray Twitter: https://twitter.com/elijahmurray Instagram: https://www.instagram.com/elijahmurray LinkedIn: https://www.linkedin.com/in/elijahmurray/ Apple Podcasts: https://podcasts.apple.com/us/podcast/the-long-game-w-elijah-murray/ Spotify: https://podcasters.spotify.com/pod/show/elijahmurray RSS: https://anchor.fm/s/3e31c0c/podcast/rss

7 Observations From the AI Engineer World's Fair

The AI Breakdown: Daily Artificial Intelligence News and Discussions

Play Episode Listen Later Jun 28, 2024 13:39

Dive into the latest insights from the AI Engineer World's Fair in San Francisco. This event, touted as the biggest technical AI conference in the city, brought together over 100 speakers and countless developers. Discover seven key observations that highlight the current state and future of AI development, from the focus on practical, production-specific solutions to the emergence of AI engineers as a distinct category. Learn about the innovative conversations happening around AI agents and the unique dynamics of this rapidly evolving field. Learn how to use AI with the world's biggest library of fun and useful tutorials: https://besuper.ai/ Use code 'youtube' for 50% off your first month. The AI Daily Brief helps you understand the most important news and discussions in AI. Subscribe to the podcast version of The AI Daily Brief wherever you listen: https://pod.link/1680633614 Subscribe to the newsletter: https://aidailybrief.beehiiv.com/ Join our Discord: https://bit.ly/aibreakdown

ai discover san francisco dive discord engineers observations ai engineer

S27:E7 - Tech and Art (Chris Immel)

CodeNewbie

Play Episode Listen Later May 15, 2024 36:03

Meet Chris Immel, AI Engineer and Digital Artist at Luminifera Projects. Chris shares how he works to create a symbiosis between software development and art and why he remains optimistic when it comes to the AI revolution. Show Links Partner with Dev & CodeNewbie! (sponsor) Chris' Instagram Chris' Website Chris' GitHub Chris' LinkedIn

ai tech dev digital artist ai engineer codenewbie

Podcasts about ai engineer

Best podcasts about ai engineer

Latent Space: The AI Engineer Podcast â€” CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

Super Prompt: Generative AI w/ Tony Wan

The AI Breakdown: Daily Artificial Intelligence News and Discussions

The Swyx Mixtape

Artificial Intelligence and You

Modern Web

High Agency: The Podcast for AI Builders

The top AI news from the past week, every ThursdAI

Latest news about ai engineer

Latest podcast episodes about ai engineer

"AI Won't Fix Broken Processes": GTM Strategy, RevOps, and the Rise of the AI Engineer with Kristina McMillan

#287 - From Smoothie King to AI Engineer

Techtopia 386: Hvad er vibe coding?

How GenAI Is Changing Every Career

MFP E. 44: The Next Frontier: AI, Automation, and the Future of Work in Multifamily with Ben Infantino, AI Engineer at Apartment SEO

SRE から AI Engineer へ転身 (Asai)

From AC/DC to AI... Engineer to CEO (ft. Eduardo Conrado)

What Is the Pathway to Become an AI Engineer? 5 Skills Developers Need Most

#130 GenAI in action: Cloud DevOps and Tagassistant.ai (with Mark Edmondson)

IA de l'actu (sept 2025)

How the EU's Cyber Act Burdens Lone Open Source Developers

E82 - Joseph Thacker, Leveraging AI's Impact in a Changing World

Stop Hiring Junior Engineers Because of AI?

Navigating the AI bubble, the 10x AI engineer, and the Cloudflare vs. Perplexity data grab

Is Your Data Strategy Ready for the Agentic AI Era?

#267 - Step-by-Step: Build a Real AI Project with Next.js & RAG

45% разработчиков тратят больше времени на отладку ИИ-кода, чем на написание с нуля. Евгений Волчков

Nikolai Yakovenko: the $200 million AI engineer

AI Engineer Devon vs. OpenAI's $2,000 Genius Subscription: The Future of Human Replacement

Куда катится американский IT-рынок? Про удаленку, зарплаты в IT, локальных кандидатов и full stack.

You + Happy Replay with Comedian and Engineer Jashan Kaleka

AI Engineer - это будущее или модный хайп? Какие программисты будут в спросе, а какие за бортом?

The Biggest Trends from the AI Engineer World's Fair

Say hi at AI Engineer World's Fair

[AIEWF Preview] CloudChef: Your Robot Chef - Michellin-Star food at $12/hr (w/ Kitchen tour!)

AI Weekly News Rundown from May 11 to May 18 2025:

AI Daily News May 16 2025:

AI Daily News May 15 2025:

AI Daily News May 14 2025:

AI, Marketing, and Human Decision Making // Fausto Albers // #313

AI Daily News May 13 2025:

AI Daily News May 12 2025:

AI, Marketing, and Human Decision Making // Fausto Albers // Podcast #313

Inside Devin: The world's first autonomous AI engineer that's set to write 50% of its company's code by end of year | Scott Wu (CEO and co-founder of Cognition)

Beyond the Matrix: AI and the Future of Human Creativity

Thrive Sync: Women, Wellness, and AI

NVIDIA CEO Predicts The Next Wave of AI - The Physical Internet. Hashtag Trending for Wednesday, January 8, 2025

AI in Business: Advice and Best Practices from Melanie Johnson

Building the AI Engineer Nation — with Josephine Teo, Minister of Digital Development and Information, Singapore

Production AI Engineering starts with Evals — with Ankur Goyal of Braintrust

From API to AGI: Structured Outputs, OpenAI API platform and O1 Q&A — with Michelle Pokrass & OpenAI Devrel + Strawberry team

AI Magic: Shipping 1000s of successful products with no managers and a team of 12 — Jeremy Howard of Answer.ai

Dino Scheidt: Gen AI in Enterprise, Data P&L, and Event Storming

7 Observations From the AI Engineer World's Fair

S27:E7 - Tech and Art (Chris Immel)