Podcasts about compute

Activity that uses computers

  • 659PODCASTS
  • 1,918EPISODES
  • 43mAVG DURATION
  • 5WEEKLY NEW EPISODES
  • May 19, 2025LATEST

POPULARITY

20172018201920202021202220232024

Categories



Best podcasts about compute

Show all podcasts related to compute

Latest podcast episodes about compute

AWS Podcast
#721: AWS News: Amazon Nova Premier takes on complex tasks and model distillation, and more

AWS Podcast

Play Episode Listen Later May 19, 2025 28:42


Nova Premier is our most advanced AI model yet, featuring a million-token context window and enhanced capabilities at nearly half the cost of competitors. Dive into this update and more with hosts Simon and Jillian. 00:00 - Intro 00:31 - Amazon Nova Premier 02:56 - Analytics 04:46 - Artificial Intelligence 11:02 - Business Applications 11:38 - Cloud Financial Management 11:57 - Compute 12:10 - Contact Center 14:50 - Containers 15:13 - Database 17:52 -Developer Tools 18:08 - Management and Governance 20:25 - Networking 22:48 - Marketplace 24:04 - Security Identity End Compliance 26:09 - Storage 27:56 - Outro Show Notes: https://dqkop6u6q45rj.cloudfront.net/shownotes-20250516-191312.html

POLITICO Dispatch
‘Compute, not crude': How American AI is defining the new Middle East

POLITICO Dispatch

Play Episode Listen Later May 15, 2025 24:47


A large contingent of Silicon Valley CEOs followed President Donald Trump to Saudi Arabia this week, where a number of them announced billions of dollars in AI-related investments and business partnerships. Mohammed Soliman, a senior fellow at the Middle East Institute, says this is the new Middle East — where the relationship with the U.S. is driven by tech and innovation, not just oil and security. On POLITICO Tech, Soliman tells host Steven Overly how this new arrangement benefits tech companies and Gulf nations — and why it's necessary if the U.S. hopes to stay ahead of China.  Steven Overly is the host of POLITICO Tech and covers the intersection of trade and technology. Nirmal Mulaikal is the co-host and producer of POLITICO Energy and producer of POLITICO Tech. Learn more about your ad choices. Visit megaphone.fm/adchoices

Enterprise Podcast Network – EPN
Powering the AI Revolution: How TensorWave's AMD Supercloud Is Solving Compute Bottlenecks

Enterprise Podcast Network – EPN

Play Episode Listen Later May 15, 2025 8:54


Piotr Tomasik, Co-Founder & President of TensorWave, who’s powering the next wave of AI compute with AMD-optimized super-cloud infrastructure and building Las Vegas into a … Read more The post Powering the AI Revolution: How TensorWave’s AMD Supercloud Is Solving Compute Bottlenecks appeared first on Top Entrepreneurs Podcast | Enterprise Podcast Network.

Azeem Azhar's Exponential View
China's catching up to US AI… Here's why it won't matter

Azeem Azhar's Exponential View

Play Episode Listen Later May 14, 2025 49:17


Lennart Heim, a researcher and information scientist at RAND Corporation, joins Azeem Azhar to unpack a provocative claim: China is catching up with US AI capabilities, but it doesn't matter. Timestamps: (00:00) Episode trailer (01:19) Lennart's core thesis (03:26)   Why compute matters so much (07:31)  The investment split between model R&D and model execution (11:18)  How test-time compute impacts costs (16:14) The geopolitics of compute (21:32) Why does the U.S have more compute capacity than China? (25:01)  The trade-off between economic needs and national-security needs (31:54)  How technology change might shift the battlegrounds (35:33)  Dealing with compute and power concentration (48:19)  Concluding quick-fire question  Lennart's links: Twitter/X: https://twitter.com/ohlennartPersonal blog: https://heim.xyz/Azeem's links:Substack: https://www.exponentialview.co/Website: https://www.azeemazhar.com/LinkedIn: https://www.linkedin.com/in/azharTwitter/X: https://x.com/azeemThis was originally recorded for "Friday with Azeem Azhar", a new show that takes place every Friday at 9am PT and 12pm ET. You can tune in through Exponential View on Substack. Produced by supermix.io and EPIIPLUS1 Ltd

Compute This
Compute This 5-11-2025

Compute This

Play Episode Listen Later May 11, 2025 53:51 Transcription Available


Born In Silicon Valley
Smarter Compute Slashes AI Costs

Born In Silicon Valley

Play Episode Listen Later May 8, 2025 51:52


Hey there, listeners! Welcome back to the show! Today, we've got an incredible episode lined up for you. Host Jake Aaron Villarreal sits down with Gennady Pihimenko, the co-founder and CEO of CentML, for a deep dive into the wild world of AI and machine learning. Gennady takes us on his fascinating journey—from a math and programming whiz in Russia to a trailblazer in the AI industry. He breaks down the evolution of AI, spills the tea on why optimizing machine learning workloads is a game-changer, and gets real about the balancing act of being a professor and a startup CEO.Gennady doesn't hold back, sharing sharp insights on the hidden costs of AI—think training and inference—and who CentML's solutions are built for. He's all about solving real-world problems and unlocking transformative impact through smarter, more efficient compute solutions. Plus, he dishes on the challenges and opportunities in the AI industry, from the skyrocketing demand for computational power to the art of building a killer team and nailing hiring strategies in a startup. Gennady also gives us a sneak peek into CentML's future, teasing their growth plans and how customer feedback is shaping their roadmap.Buckle up for a conversation packed with big ideas, practical wisdom, and a front-row seat to the future of AI. Let's dive in! Host: Jake Aaron Villarreal, leads the top AI Recruitment Firm in Silicon Valley www.matchrelevant.com, uncovering stories of funded startups and goes behinds to scenes to tell their founders journey.  If you are growing AI Startup or have a great storytelling, email us at: jake.villarreal@matchrelevant.com  

Data Protection Gumbo
298: The Battle for AI Supremacy Isn't About Models—It's About Infrastructure - Thunder Compute

Data Protection Gumbo

Play Episode Listen Later May 6, 2025 33:32


Carl Peterson, CEO of Thunder Compute uncovers how Thunder Computer is redefining GPU utilization by enabling network-attached virtual GPUs—dramatically slashing costs and democratizing access. Carl shares the startup's Y Combinator origin story, the impact of DeepSeek, and how virtualization is transforming AI development for individuals and enterprises alike. We also unpack GPU security, job disruption from AI, and the accelerating arms race in model development. A must-listen for anyone navigating AI, compute efficiency, and data protection.

AWS Podcast
#719: AWS News: Amazon Q Developer brings powerful new AI capabilities to GitLab Duo

AWS Podcast

Play Episode Listen Later May 5, 2025 26:12


Description: Learn how you can use the all new Amazon Q Developer integration with GitLab Duo to automate code generation and review, plus even more updates from AWS. 00:00:00 - Intro, 00:00:28 - SWE Holly Bench, 00:04:31 - Analytics, 00:06:49 - Application Integration, 00:07:14 - Artificial Intelligence, 00:08:53 - Amazon Bedrock Data Automation, 00:14:11 - AWS Health Omex, 00:14:21 - Compute, 00:16:37 - Contact Centers, 00:17:25 - Containers, 00:17:46 - Databases, 00:18:18 - Front end Web and Mobile, 00:18:59 - Management and Governance, 00:20:07 - Migration and Transfer, 00:20:17 - Networking and Content Delivery, 00:20:44 - Security Identity End Compliance, 00:23:24 - Serverless, 00:24:01 - Storage, 00:24:41 - Wrap up Shownotes: https://d29iemol7wxagg.cloudfront.net/719ExtendedShownotes.html

Compute This
Compute This 5-4-2025

Compute This

Play Episode Listen Later May 5, 2025 53:51 Transcription Available


LessWrong Curated Podcast
“Slowdown After 2028: Compute, RLVR Uncertainty, MoE Data Wall” by Vladimir_Nesov

LessWrong Curated Podcast

Play Episode Listen Later May 3, 2025 11:33


It'll take until ~2050 to repeat the level of scaling that pretraining compute is experiencing this decade, as increasing funding can't sustain the current pace beyond ~2029 if AI doesn't deliver a transformative commercial success by then. Natural text data will also run out around that time, and there are signs that current methods of reasoning training might be mostly eliciting capabilities from the base model. If scaling of reasoning training doesn't bear out actual creation of new capabilities that are sufficiently general, and pretraining at ~2030 levels of compute together with the low hanging fruit of scaffolding doesn't bring AI to crucial capability thresholds, then it might take a while. Possibly decades, since training compute will be growing 3x-4x slower after 2027-2029 than it does now, and the ~6 years of scaling since the ChatGPT moment stretch to 20-25 subsequent years, not even having access to any [...] ---Outline:(01:14) Training Compute Slowdown(04:43) Bounded Potential of Thinking Training(07:43) Data Inefficiency of MoEThe original text contained 4 footnotes which were omitted from this narration. --- First published: May 1st, 2025 Source: https://www.lesswrong.com/posts/XiMRyQcEyKCryST8T/slowdown-after-2028-compute-rlvr-uncertainty-moe-data-wall --- Narrated by TYPE III AUDIO.

Deep Papers
Sleep-time Compute: Beyond Inference Scaling at Test-time

Deep Papers

Play Episode Listen Later May 2, 2025 30:24


What if your LLM could think ahead—preparing answers before questions are even asked?In this week's paper read, we dive into a groundbreaking new paper from researchers at Letta, introducing sleep-time compute: a novel technique that lets models do their heavy lifting offline, well before the user query arrives. By predicting likely questions and precomputing key reasoning steps, sleep-time compute dramatically reduces test-time latency and cost—without sacrificing performance.​We explore new benchmarks—Stateful GSM-Symbolic, Stateful AIME, and the multi-query extension of GSM—that show up to 5x lower compute at inference, 2.5x lower cost per query, and up to 18% higher accuracy when scaled.​You'll also see how this method applies to realistic agent use cases and what makes it most effective.If you care about LLM efficiency, scalability, or cutting-edge research.Explore more AI research, or sign up to hear the next session live: arize.com/ai-research-papersLearn more about AI observability and evaluation, join the Arize AI Slack community or get the latest on LinkedIn and X.

FloppyDays Vintage Computing Podcast
Floppy Days 150 - Interview with David Greelish, Apple Lisa Documentary

FloppyDays Vintage Computing Podcast

Play Episode Listen Later Apr 28, 2025 68:50


Interview with David Greelish, Apple Lisa Documentary Patreon: https://www.patreon.com/FloppyDays Sponsors: 8-Bit Classics  Arcade Shopper  FutureVision Research   Hello, and welcome to episode 150 of the Floppy Days Podcast for April, 2025.  My name is Randy Kindig and I'm the host for this journey through the annals of home computer history. This month, I'm going to step aside from the ongoing series of episodes about the HP 97/67 programmable calculators to bring you a timely interview with a good friend about an interesting topic.  That friend is David Greelish, a computer historian, and the topic is his recent publication of a film documentary about the Apple Lisa, called "Before Macintosh: The Apple Lisa".  David tells us all about the film, why he produced it, why the Apple Lisa was an important part of home computer history, who he interviewed for the film (he had some amazing guests) and much more.  It's a great film and should interest a lot of the listeners, so please consider going out and purchasing the film in order to support David's efforts. For upcoming shows, we do have one more episode in the series on the HP97 with HP calculator historian Wlodek Mier-Jedrzejowicz.  I will air that episode very soon. New Acquisitions/What I've Been Up To Indy Classic Expo - https://www.indyclassic.org  Vintage Computer Center - https://www.vintagecomputercenter.com  OmniView 80 card for Atari 800 - https://archive.org/details/Atari_OMNIVIEW_manual  Commodore 16 - https://en.wikipedia.org/wiki/Commodore_16  6502 Plus 4 upgrade for C16 from Lotharek - (https://lotharek.pl/productdetail.php?id=257  News Reboot of Compute's Gazette Magazine - https://www.computesgazette.com/iconic-computes-gazette-magazine-returns-after-35-years-expanding-focus-to-entire-retro-computing-community/  Upcoming Shows The 32nd Annual “Last” Chicago CoCoFEST! - May 2-3, 2025 - Holiday Inn & Suites Chicago-Carol Stream (Wheaton), Carol Stream, Illinois - https://www.glensideccc.com/cocofest/  VCF Europe - May 3-4 - Munich, Germany - https://vcfe.org/E/  Retrofest 2025 - May 31-June1 - Steam Museum of the Great Western Railway, Swindon, UK - https://retrofest.uk/  Vancouver Retro Gaming Expo - June 14 - New Westminster, BC, Canada - https://www.vancouvergamingexpo.com/index.html  VCF Southwest - June 20-22, 2025 - Davidson-Gundy Alumni Center at UT Dallas - https://www.vcfsw.org/  Southern Fried Gaming Expo and VCF Southeast - June 20-22, 2025 - Atlanta, GA - https://gameatl.com/  Pacific Commodore Expo NW v4 - June 21-22 - Old Rainier Brewery Intraspace, Seattle, WA - https://www.portcommodore.com/dokuwiki/doku.php?id=pacommex:start  KansasFest - July 18-20 - Virtual only - https://www.kansasfest.org/  VCF West - August 1-2 - Computer History Museum in Mountain View, CA - https://vcfed.org/2025/03/05/vcf-west-2025-save-the-date/  VCF Midwest - September 13-14, 2025 - Renaissance Schaumburg Convention Center in Schaumburg, IL - http://vcfmw.org/  Tandy Assembly - September 26-28 - Courtyard by Marriott Springfield - Springfield, OH - http://www.tandyassembly.com/  Portland Retro Gaming Expo - October 17-19 - Oregon Convention Center, Portland, OR - https://retrogamingexpo.com/  Chicago TI International World Faire - October 25 - Evanston Public Library, Evanston, IL - https://www.chicagotiug.org/home  Schedule Published on Floppy Days Website - https://docs.google.com/document/d/e/2PACX-1vSeLsg4hf5KZKtpxwUQgacCIsqeIdQeZniq3yE881wOCCYskpLVs5OO1PZLqRRF2t5fUUiaKByqQrgA/pub  Documentary and Classic Computing Links Classic Computing Website - https://www.classiccomputing.com/Classic_Computing/Blog/Blog.html  https://www.youtube.com/watch?v=psAeTDYezdo - "Before Macintosh: The Apple Lisa" Full Documentary Film  Exidy Sorcerer at VCFSE 2 - https://floppydays.libsyn.com/floppy-days-episode-17-the-exidy-sorcerer-live-from-vcfse-20  Stan Veit podcast - https://www.classiccomputing.com/CCPodcasts/Stan_Veit/Stan_Veit.html  Classic Computing - the book! - https://www.classiccomputing.com/Classic_Computing/My_Book.html  Documentary link at IMDB - https://www.imdb.com/title/tt31122934/   

CryptoNews Podcast
#434: Daniel ‌Marin, Founder of Nexus, on Enabling the Verifiable Internet, Aggregating Unused Compute Power, ZK Tech, and Verifiable AI

CryptoNews Podcast

Play Episode Listen Later Apr 28, 2025 34:59


Daniel ‌Marin is the Founder and Chief Executive Officer of Nexus. Daniel founded Nexus in 2022 while he was at Stanford with the mission to enable the Verifiable Internet, which will redefine digital trust and create a more transparent, secure, and efficient world. To achieve this mission, Nexus is building a globally distributed Layer-1 blockchain powered by a zkVM engine.Daniel earned a Bachelor of Science in Computer Science from Stanford University. He was named to Forbes' '30 Under 30' list in 2025, and earned Bronze medals at the International Physics Olympiad in 2018 and 2019.In this conversation, we discuss:- Are we back?- Enabling the Verifiable Internet- Parallels between AI and ZK- Aggregating unused compute power- Verifiable AI- Solving critical issues around privacy, trust, and security- 2.1 million users and 3.6 million nodes already connected to the network- With Nexus, more nodes = faster blockchain- Verifiable computation will impact many markets, blockchain is just one example- The power of zkEVM- The future of AI & BlockchainNexusWebsite: nexus.xyzX: @NexusLabsDiscord: discord.gg/nexus-xyzDaniel MarinX: @danielmarinqLinkedIn: Daniel Marin---------------------------------------------------------------------------------  This episode is brought to you by PrimeXBT.  PrimeXBT offers a robust trading system for both beginners and professional traders that demand highly reliable market data and performance. Traders of all experience levels can easily design and customize layouts and widgets to best fit their trading style. PrimeXBT is always offering innovative products and professional trading conditions to all customers.   PrimeXBT is running an exclusive promotion for listeners of the podcast. After making your first deposit, 50% of that first deposit will be credited to your account as a bonus that can be used as additional collateral to open positions.  Code: CRYPTONEWS50  This promotion is available for a month after activation. Click the link below:  PrimeXBT x CRYPTONEWS50

Compute This
Compute This 4-27-2025

Compute This

Play Episode Listen Later Apr 27, 2025 53:51 Transcription Available


The Asianometry Podcast
Quantum Compute with Single Photons

The Asianometry Podcast

Play Episode Listen Later Apr 24, 2025


Last year during my trip to Silicon Valley, I was invited to visit a company called PsiQuantum. When you think about quantum computing, your mind might conjure up those chandeliers. Qubits plunged to super cold temperatures. PsiQuantum is working on something a little different. Quantum computing using photons. In this video, a form of quantum compute with intriguing possibilities. Does it “work” like silicon does today? Is quantum compute really here? I can't really answer those questions in this video. But we can explore the ideas and the ideas are certainly mind-bending.

Open at Intel
Data Privacy and Efficiency with Bacalhau Compute Over Data

Open at Intel

Play Episode Listen Later Apr 24, 2025 23:10


In this episode, David Aronchick, CEO and Co-founder of Expanso discusses his experiences and insights from working with Kubernetes since its early days at Google. David shares his journey from working on Kubernetes to co-founding Kubeflow and his latest project, Bacalhau, which focuses on combining compute and data management in distributed systems. Highlighting the challenges of data processing and privacy, particularly in edge computing and regulated environments, David emphasizes cost-saving benefits and the importance of local data processing. Throughout, privacy and regulatory concerns are underscored along with solutions for efficient and secure data handling. 00:00 Introduction and Welcome 00:23 Early Days of Kubernetes 01:05 Kubernetes Community and Evolution 02:23 AI, ML, and KubeFlow 03:40 Current Work and Data Challenges 08:20 Privacy and Security Concerns 14:21 Real-World Applications and Benefits 20:42 Conclusion Guest: David Aronchick, Founder and CEO at Expanso, formerly led open source machine learning strategy at Azure, managed Kubernetes product development at Google, and co-founded Kubeflow. Previous roles at Microsoft, Amazon, and Chef.  

The Asianometry Podcast
Quantum Compute with Single Photons

The Asianometry Podcast

Play Episode Listen Later Apr 24, 2025


Last year during my trip to Silicon Valley, I was invited to visit a company called PsiQuantum. When you think about quantum computing, your mind might conjure up those chandeliers. Qubits plunged to super cold temperatures. PsiQuantum is working on something a little different. Quantum computing using photons. In this video, a form of quantum compute with intriguing possibilities. Does it “work” like silicon does today? Is quantum compute really here? I can't really answer those questions in this video. But we can explore the ideas and the ideas are certainly mind-bending.

AWS Podcast
#717: Conversational AI with Amazon Nova Sonic, Amazon Bedrock Guardrails announces new capabilities

AWS Podcast

Play Episode Listen Later Apr 21, 2025 38:03


Learn about the latest new FM in the Nova family that simplifies conversational AI with low latency, and build safely with new capabilities for Amazon Bedrock Guardrails. 00:00 - Intro, 00:27 - Amazon Nova Sonic, 03:13 - Amazon Bedrock Guardrails, 05:23 - Analytics, 08:18 - Application Integration, 08:37 - Artificial Intelligence, 12:06 - Business Applications, 13:01 - Cloud Financial Management, 13:44 - Compute, 15:04 - Contact Center, 16:29 - Containers, 16:49 - Databases, 19:57 - Developer Tools, 20:59 - Frontend Web and Mobile, 21:20 - Management and Governance, 23:39 - Media Services, 25:37 - Migration and Transfer, 26:46 - Networking and Content Delivery, 28:45 - Artificial Intelligence, 29:58 - Security, Identity, and Compliance, 32:51 - Serverless, 33:57 - Storage, 37:29 - Wrap up Show Notes: https://dqkop6u6q45rj.cloudfront.net/run-sheet-20250418-173723.html

Compute This
Compute This 4-20-2025

Compute This

Play Episode Listen Later Apr 21, 2025 53:51 Transcription Available


Eye On A.I.
#249 Brice Challamel: How Moderna is Using AI to Disrupt Modern Healthcare

Eye On A.I.

Play Episode Listen Later Apr 20, 2025 50:04


This episode is sponsored by Oracle. OCI is the next-generation cloud designed for every workload – where you can run any application, including any AI projects, faster and more securely for less.  On average, OCI costs 50% less for compute, 70% less for storage, and 80% less for networking. Join Modal, Skydance Animation, and today's innovative AI tech companies who upgraded to OCI…and saved.    Offer only for new US customers with a minimum financial commitment. See if you qualify for half off at http://oracle.com/eyeonai     In this episode of Eye on AI, Craig Smith sits down with Brice Challamel, Head of AI Products and Innovation at Moderna, to explore how one of the world's leading biotech companies is embedding artificial intelligence across every layer of its business—from drug discovery to regulatory approval.   Brice breaks down how Moderna treats AI not just as a tool, but as a utility—much like electricity or the internet—designed to empower every employee and drive innovation at scale. With over 1,800 GPTs in production and thousands of AI solutions running on internal platforms like Compute and MChat, Moderna is redefining what it means to be an AI-native company.   Key topics covered in this episode: How Moderna operationalizes AI at scale GenAI as the new interface for machine learning AI's role in speeding up drug approvals and clinical trials The future of personalized cancer treatment (INT) Moderna's platform mindset: AI + mRNA = next-gen medicine Collaborating with the FDA using AI-powered systems   Don't forget to like, comment, and subscribe for more interviews at the intersection of AI and innovation.     Stay Updated: Craig Smith on X:https://x.com/craigss Eye on A.I. on X: https://x.com/EyeOn_AI     (00:00) Preview  (02:49) Brice Challamel's Background and Role at Moderna (05:51) Why AI Is Treated as a Utility at Moderna (09:01) Moderna's AI Infrastructure (11:53) GenAI vs Traditional ML (14:59) Combining mRNA and AI as Dual Platforms (18:15) AI's Impact on Regulatory & Clinical Acceleration (23:46) The Five Core Applications of AI at Moderna (26:33) How Teams Identify AI Use Cases Across the Business (29:01) Collaborating with the FDA Using AI Tools (33:55) How Moderna Is Personalizing Cancer Treatments (36:59) The Role of GenAI in Medical Care (40:10) Producing Personalized mRNA Medicines (42:33) Why Moderna Doesn't Sell AI Tools (45:30) The Future: AI and Democratized Biotech

Complex Systems with Patrick McKenzie (patio11)
The AI energy bottleneck, with Tim Fist

Complex Systems with Patrick McKenzie (patio11)

Play Episode Listen Later Apr 17, 2025 66:05


In this episode, Patrick McKenzie (@patio11) is joined by Tim Fist, Director of Emerging Technologies at the Institute for Progress, to discuss how energy constraints could bottleneck AI development. They explore how AI training clusters will soon require gigawatts of power—equivalent to multiple nuclear plants—with projections showing a single cluster needing 5 gigawatts by 2030. Tim explains why behind-the-meter generation and geothermal energy offer promising solutions while regulatory hurdles like NEPA and transmission permitting create "litigation doom loops" that threaten America's competitiveness. The conversation covers the global race for compute infrastructure, with China and the UAE making aggressive investments while the US struggles with permitting delays, highlighting how energy policy will determine which nations lead the AI revolution. –Full transcript available here: www.complexsystemspodcast.com/the-ai-energy-bottleneck-with-tim-fist/–Sponsor:  VantaVanta automates security compliance and builds trust, helping companies streamline ISO, SOC 2, and AI framework certifications. Learn more at https://vanta.com/complex–Recommended in this episode:Compute in America https://ifp.org/compute-in-america/Tim Fist on Twitter https://x.com/fiiiiiist The Enchippening by Sarah Constantin https://sarahconstantin.substack.com/p/the-enchippening Solar economics with Casey Handmer https://open.spotify.com/episode/0GHegWgLSubYxvATmbWhQu?si=VKJYaSwaRJq_YcK8kJIdvQ AI & Power economics with Azeem Azhar https://open.spotify.com/episode/3KkvPiYpGvXCRukWxHP7Ch?si=RPEjrs67S9CFA0lLak6OVAFracking with Austin Vernon  https://open.spotify.com/episode/0YDV1XyjUCM2RtuTcBGYH9?si=hSniC3N0QkqhF74ra-XAcA Economics of the grid with Travis Dauwalter https://open.spotify.com/episode/5JY8e84sEXmHFlc8IR2kRb?si=BsqMZGu6Qr-2F7-RSyyEhw–Timestamps:(00:00) Intro(00:40) Energy bottlenecks in AI development(02:56) Technical and policy solutions for energy needs(05:18) Challenges in transmission infrastructure(12:14) Behind the meter generation explained(17:50) Solar and storage: The future of energy(18:47) Sponsor: Vanta(20:05) Solar and storage: The future of energy (part 2)(29:07) Power purchase agreements and financing(33:17) Financing geothermal wells(33:53) The promise of geothermal energy(35:25) Challenges in geothermal adoption(36:59) Industrial applications of geothermal heat(45:01) Geothermal energy and national security(49:27) Global investments in AI and energy infrastructure(56:29) Policy and technical expertise in AI(01:00:54) The role of government in technological advancements(01:05:07) Wrap

AWS Podcast
#716: Concrete, Cooling, and Compute: Reinventing Data Centers for the AI Age

AWS Podcast

Play Episode Listen Later Apr 14, 2025 29:54


Dive deep into the fascinating world of modern data centers with Sr. Principal Engineer at AWS, Stephen Callahan. Discover how AI is revolutionizing data center design, why nothing is uninteresting at scale, and the innovative ways AWS is tackling sustainability while powering the future of cloud computing. Learn more: AWS Global Infrastructure: https://aws.amazon.com/about-aws/global-infrastructure/ More about Data Center Innovations: https://press.aboutamazon.com/2024/12/aws-announces-new-data-center-components-to-support-ai-innovation-and-further-improve-energy-efficiency

Compute This
Compute This 4-13-2025

Compute This

Play Episode Listen Later Apr 13, 2025 53:51 Transcription Available


Latent Space: The AI Engineer Podcast — CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

Evan Conrad, co-founder of SF Compute, joined us to talk about how they started as an AI lab that avoided bankruptcy by selling GPU clusters, why CoreWeave financials look like a real estate business, and how GPUs are turning into a commodities market. Chapters: 00:00:05 - Introductions 00:00:12 - Introduction of guest Evan Conrad from SF Compute 00:00:12 - CoreWeave Business Model Discussion 00:05:37 - CoreWeave as a Real Estate Business 00:08:59 - Interest Rate Risk and GPU Market Strategy Framework 00:16:33 - Why Together and DigitalOcean will lose money on their clusters 00:20:37 - SF Compute's AI Lab Origins 00:25:49 - Utilization Rates and Benefits of SF Compute Market Model 00:30:00 - H100 GPU Glut, Supply Chain Issues, and Future Demand Forecast 00:34:00 - P2P GPU networks 00:36:50 - Customer stories 00:38:23 - VC-Provided GPU Clusters and Credit Risk Arbitrage 00:41:58 - Market Pricing Dynamics and Preemptible GPU Pricing Model 00:48:00 - Future Plans for Financialization? 00:52:59 - Cluster auditing and quality control 00:58:00 - Futures Contracts for GPUs 01:01:20 - Branding and Aesthetic Choices Behind SF Compute 01:06:30 - Lessons from Previous Startups 01:09:07 - Hiring at SF Compute Chapters 00:00:00 Introduction and Background 00:00:58 Analysis of GPU Business Models 00:01:53 Challenges with GPU Pricing 00:02:48 Revenue and Scaling with GPUs 00:03:46 Customer Sensitivity to GPU Pricing 00:04:44 Core Weave's Business Strategy 00:05:41 Core Weave's Market Perception 00:06:40 Hyperscalers and GPU Market Dynamics 00:07:37 Financial Strategies for GPU Sales 00:08:35 Interest Rates and GPU Market Risks 00:09:30 Optimal GPU Contract Strategies 00:10:27 Risks in GPU Market Contracts 00:11:25 Price Sensitivity and Market Competition 00:12:21 Market Dynamics and GPU Contracts 00:13:18 Hyperscalers and GPU Market Strategies 00:14:15 Nvidia and Market Competition 00:15:12 Microsoft's Role in GPU Market 00:16:10 Challenges in GPU Market Dynamics 00:17:07 Economic Realities of the GPU Market 00:18:03 Real Estate Model for GPU Clouds 00:18:59 Price Sensitivity and Chip Design 00:19:55 SF Compute's Beginnings and Challenges 00:20:54 Navigating the GPU Market 00:21:54 Pivoting to a GPU Cloud Provider 00:22:53 Building a GPU Market 00:23:52 SF Compute as a GPU Marketplace 00:24:49 Market Liquidity and GPU Pricing 00:25:47 Utilization Rates in GPU Markets 00:26:44 Brokerage and Market Flexibility 00:27:42 H100 Glut and Market Cycles 00:28:40 Supply Chain Challenges and GPU Glut 00:29:35 Future Predictions for the GPU Market 00:30:33 Speculations on Test Time Inference 00:31:29 Market Demand and Test Time Inference 00:32:26 Open Source vs. Closed AI Demand 00:33:24 Future of Inference Demand 00:34:24 Peer-to-Peer GPU Markets 00:35:17 Decentralized GPU Market Skepticism 00:36:15 Redesigning Architectures for New Markets 00:37:14 Supporting Grad Students and Startups 00:38:11 Successful Startups Using SF Compute 00:39:11 VCs and GPU Infrastructure 00:40:09 VCs as GPU Credit Transformators 00:41:06 Market Timing and GPU Infrastructure 00:42:02 Understanding GPU Pricing Dynamics 00:43:01 Market Pricing and Preemptible Compute 00:43:55 Price Volatility and Market Optimization 00:44:52 Customizing Compute Contracts 00:45:50 Creating Flexible Compute Guarantees 00:46:45 Financialization of GPU Markets 00:47:44 Building a Spot Market for GPUs 00:48:40 Auditing and Standardizing Clusters 00:49:40 Ensuring Cluster Reliability 00:50:36 Active Monitoring and Refunds 00:51:33 Automating Customer Refunds 00:52:33 Challenges in Cluster Maintenance 00:53:29 Remote Cluster Management 00:54:29 Standardizing Compute Contracts 00:55:28 Unified Infrastructure for Clusters 00:56:24 Creating a Commodity Market for GPUs 00:57:22 Futures Market and Risk Management 00:58:18 Reducing Risk with GPU Futures 00:59:14 Stabilizing the GPU Market 01:00:10 SF Compute's Anti-Hype Approach 01:01:07 Calm Branding and Expectations 01:02:07 Promoting San Francisco's Beauty 01:03:03 Design Philosophy at SF Compute 01:04:02 Artistic Influence on Branding 01:05:00 Past Projects and Burnout 01:05:59 Challenges in Building an Email Client 01:06:57 Persistence and Iteration in Startups 01:07:57 Email Market Challenges 01:08:53 SF Compute Job Opportunities 01:09:53 Hiring for Systems Engineering 01:10:50 Financial Systems Engineering Role 01:11:50 Conclusion and Farewell

This Week in Pre-IPO Stocks
E196: Thinking Machines targets $10B valuation with $2B seed round; ByteDance revenue hits $155B in 2024; Anysphere(Cursor) revenue quadruples, eyes $10B valuation; Nuro raises $106M at $6B valuation; Base Power raises $200M to scale affordable home batte

This Week in Pre-IPO Stocks

Play Episode Listen Later Apr 11, 2025 13:52


Send us a textSubscribe to AG Dillon Pre-IPO Stock Research at agdillon.com/subscribe;- Wednesday = secondary market valuations, revenue multiples, performance, index fact sheets- Saturdays = pre-IPO news and insights00:00 - Intro00:08 - Thinking Machines Targets $10B Valuation with $2B Seed Round  01:12 - ByteDance Revenue Hits $155B; Valuation Diverges  02:15 - Anysphere Revenue Quadruples; Eyes $10B Valuation  03:01 - Nuro Raises $106M at $6B Valuation  03:51 - Base Power Raises $200M to Scale Affordable Home Batteries  05:06 - Anthropic Launches Claude Max, Valued at $61.5B  06:15 - Ripple Acquires Hidden Road for $1.25B  07:13 - Canva Adds GenAI Tools; Valued at $37.9B  08:19 - Electricity Demand for AI Surges Globally  10:31 - OpenAI Rolls Out ChatGPT Memory Feature  11:30 - Google Joins Anthropic's Model Context Protocol  12:43 - Safe Superintelligence Taps Google Cloud for Compute  

Web3 with Sam Kamani
244: From Harvard to Hardware: Hoansoo from Exabits on Web3 Compute and Scaling with GPUs

Web3 with Sam Kamani

Play Episode Listen Later Apr 9, 2025 26:12


In this episode of Web3 with Sam Kamani, Sam is joined by co-host Amanda Whitcroft to interview Hoansoo Lee, co-founder of Exabits.ai. With a PhD from Harvard and deep expertise in edge computing, Hoansoo shares how Exabits is decentralizing the GPU cloud for AI by combining high-performance chips like the H100 and Blackwell with tokenized infrastructure on Web3 rails.They explore why AI compute is the "new energy," how Exabits differentiates from competitors like CoreWeave, and the opportunities for DeFi and structured finance in this emerging landscape. Hoansoo also discusses the limitations of decentralized compute, the challenges around AI experimentation, and how data, compute, and causality intersect in building next-gen AI.Whether you're a founder building in AI, a researcher, or a curious investor, this episode is packed with deep insights into the future of decentralized compute and what's next in the AI x Web3 convergence.Key Timestamps[00:00:00] Introduction: Sam introduces co-host Amanda and guest Hoansoo Lee from Exabits.ai.[00:01:00] What is Exabits?: Hoansoo explains Exabits in one sentence—high-quality GPU compute for AI.[00:02:00] Who Uses It: Discussing their customer base across Web2 and Web3.[00:03:00] Hardware Stack: Exabits runs 60,000+ GPUs including H100s and Blackwells.[00:04:00] Competitive Landscape: Why Exabits is different from other Web3 dePIN projects.[00:05:00] Founding Story: How a background in edge computing led to building Exabits.[00:06:00] Go-to-Market: Customer acquisition through partnerships, referrals, and conferences.[00:07:00] Growth Opportunity: Why structured finance and GPU financialization is the next big thing.[00:08:00] AI Efficiency vs. Demand: DeepSeek, scaling laws, and the compute boom.[00:10:00] Energy + Compute: AI's demand for energy and its parallels to historical tech trends.[00:11:00] Decentralized Compute: Limitations of latency-sensitive decentralized AI infrastructure.[00:13:00] AI = Bitcoin Mining 2.0: The evolution from minting Bitcoin to minting intelligence.[00:14:00] Pillars of AI: From compute/data/models to experimentation and causal inference.[00:17:00] AI Limits: Why synthetic data can't replace real-world experimentation.[00:18:00] Scarcity & Innovation: How chip scarcity could spark further innovation.[00:20:00] In-House Servers: Why building H200 racks in-house is a differentiator.[00:21:00] How It Works: A user's experience on Exabits from login to compute access.[00:23:00] Founder Advice: Hoansoo's take on building something with real customers and solid fundamentals[00:24:00] Roadmap: Data center expansion, orchestration features, and governance via staking.[00:25:00] TGE Ahead: Exabits' upcoming token generation event and next steps.Connecthttps://www.exabits.ai/https://www.linkedin.com/company/exabitsai/https://x.com/exa_bitshttps://www.linkedin.com/in/hoansoo-lee-21586b9/https://www.linkedin.com/in/amanda-whitcroft-324879164/DisclaimerNothing mentioned in this podcast is investment advice and please do your own research. Finally, it would mean a lot if you can leave a review of this podcast on Apple Podcasts or Spotify and share this podcast with a friend.Be a guest on the podcast or contact us - https://www.web3pod.xyz/

AWS Podcast
#715: AWS News: Be your own data analyst with Amazon Q in Quicksight, and more

AWS Podcast

Play Episode Listen Later Apr 7, 2025 24:07


Hosts Simon and Jillian discuss how you can uncover hidden trends and make data-driven decisions - all through natural conversation, with Amazon Q in Quicksight, plus, more of the latest updates from AWS. 00:00 - Intro, 00:22 - Top Stories, 02:50 - Analytics, 03:35 - Application Integrations, 04:48 - Amazon Sagemaker, 05:29 - Amazon Bedrock Knowledge Bases, 05:48- Amazon Polly, 06:46 - Amazon Bedrock, 07:31 - Amazon Bedrock Model Evolution LLM, 08:29 - Business Application, 08:58 - Compute, 09:51 - Contact Centers, 10:54 - Containers, 11:12 - Database, 14:21 - Developer Tools, 15:20 - Front End Web and Mobile, 15:45 - Games, 16:04 - Management and Governance, 16:35 - Media Services, 16:47 - Network and Content Delivery, 19:39 - Security Identity and Compliance, 20:24 - Serverless, 21:48 - Storage, 22:43 - Wrap up Show Notes: https://dqkop6u6q45rj.cloudfront.net/shownotes-20250404-184823.html

The GeekNarrator
Can Math simplify incremental compute?

The GeekNarrator

Play Episode Listen Later Apr 6, 2025 77:13


In this episode of The Geek Narrator podcast, Lalit Suresh, CEO of Feldera, joins us to share insights on incremental view maintenance and its significance in modern data processing.We have discussed the challenges posed by distributed systems, the mathematical foundation of DBSP, and how Feldera's architecture addresses these challenges. Performance optimization, handling late events, and the future of stream processing, the importance of SQL in creating efficient data workflows - its all in here.Chapters00:00 Introduction to Incremental View Maintenance06:30 Challenges in Distributed Systems11:46 Batch Processing vs Stream Processing16:27 Understanding DBSP: The Mathematical Foundation27:46 Architecture of Feldera and Data Flow39:23 Partitioning and Storage Layer in Feldera42:51 Understanding Co-Design Storage Layers45:52 Foreground and Background Workers in DBSP49:16 Tuning Background Workers for Performance49:41 Synchronous Compute Model and View Propagation51:35 Zsets and Batch Processing in Stream Workloads54:00 Data Model Optimization in Feldera57:22 Handling Late Events and Lateness in Feldera01:01:18 Watermarks and Lateness Annotations01:04:20 Error Handling and Idempotency in Feldera01:11:05 Feldera's Differentiators and Future Roadmap

Hashtag Trending
Preparing for the Intelligence Explosion, Project Synapse on Hashtag Trending Weekend

Hashtag Trending

Play Episode Listen Later Apr 5, 2025 68:26 Transcription Available


In this episode of Project Synapse, hosts discuss the underestimated changes brought about by advanced AI systems, the need for critical thinking, and preparedness for scenarios triggered by rapid technological advancements. The conversation covers the impactful paper 'Preparing for the Intelligence Explosion' by Will McCaskill and Finn Moon House, which emphasizes the acceleration of AI and the potential consequences on society. Amidst AI's advancements in diverse fields like manufacturing and cybersecurity, the hosts shed light on the importance of foresight and human adaptability to maintain balance and progress in an AI-driven future. 00:00 A Quiet Week and Unexpected Snow 00:17 Surviving the Ice Storm 01:48 Generator Troubles and Perplexity AI 03:23 Discussing the Intelligence Explosion Paper 04:44 Implications of Rapid AI Advancements 08:42 Historical Comparisons and Accelerated Change 12:47 Challenges in Organizational Change 17:14 Security Concerns in the Age of AI 22:34 Exponential Growth in AI Efficiency 34:18 AI Designing AI: The Future of Scalability 34:48 The Implications of Autonomous Warfare 35:44 Efficiency in AI Training and Compute 36:33 The Countdown to Superintelligence 37:35 Tariffs and Trade Imbalances: A Misunderstanding 44:01 Critical Thinking in the Age of AI 59:00 The Importance of Scenario Planning 01:00:47 The Future of Employment and Automation 01:03:14 The Human Element in a Technological World 01:04:43 Embracing AI in the Workplace 01:07:58 Concluding Thoughts: Imagining a Harmonious Future

Digital Currents
The AI Funding Frenzy: OpenAI, AMD & the Battle for Compute Power

Digital Currents

Play Episode Listen Later Apr 4, 2025 48:44


In this episode of the ABCDs Roundup, we break down OpenAI's massive $40 billion funding round, led by SoftBank, and its impact on AI infrastructure. We explore AMD's $4.9 billion acquisition of ZT Systems as it challenges Nvidia in the AI data center wars and take a broader look at tariffs affecting the AI and blockchain industries. We also cover the latest developments in TikTok's U.S. ownership battle and Fidelity's Bitcoin market update, which predicts a potential acceleration phase in this week's chart. Remember to Stay Current! To learn more, visit us on the web at https://www.morgancreekcap.com/morgan-creek-digital/. To speak to a team member or sign up for additional content, please email mcdigital@morgancreekcap.com   Legal Disclaimer This podcast is for informational purposes only and should not be construed as investment advice or a solicitation for the sale of any security, advisory, or other service. Investments related to the themes and ideas discussed may be owned by funds managed by the host and podcast guests. Any conflicts mentioned by the host are subject to change. Listeners should consult their personal financial advisors before making any investment decisions.

ESG Currents
How AWS Fosters Weeding Robots, Other Climate Tech

ESG Currents

Play Episode Listen Later Apr 2, 2025 33:50 Transcription Available


AI’s role in climate is often framed around increased energy use and emissions, but what if it could help solve the crisis? In this episode of Bloomberg Intelligence’s ESG Currents, we explore how Amazon Web Services is helping foster AI-driven climate-tech companies, including through the Compute for Climate Fellowship. AWS’ Head of Climate Tech Business Development, Startups and Venture Capital Lisbeth Kaufman joins BI director of ESG research Eric Kane to discuss how she drew inspiration from The Toxic Avenger, and highlights real-world applications of computing and AI in fusion energy, crop yields, pest mitigation and textile production. This episode was recorded on March 17. Learn more about the climate-tech startups discussed or apply for the fellowship here.See omnystudio.com/listener for privacy information.

The Twenty Minute VC: Venture Capital | Startup Funding | The Pitch
20VC: Microsoft CTO on Where Value Accrues in an AI World | Why Scaling Laws are BS | An Evaluation of Deepseek and How We Underestimate the Chinese | The Future of Software Development and The Future of Agents with Kevin Scott

The Twenty Minute VC: Venture Capital | Startup Funding | The Pitch

Play Episode Listen Later Mar 31, 2025 45:08


Kevin Scott is the CTO of Microsoft, where he leads the company's AI and technology strategy at global scale and played a pivotal role in Microsoft's partnership with OpenAI. Prior to Microsoft, Kevin spent six years at Linkedin as SVP of Engineering. Kevin has also enjoyed advisory positions with Pinterest, Box, Code.org and more.  In Today's Episode We Discuss: 04:10 Where is Enduring Value in a World of AI 10:53 Why Scaling Laws are BS 12:26 What is the Bottleneck Today: Data, Compute or Algorithms 15:38: In 10 Years Time: What % of Data Usage will be Synthetic 20:04 How Will AI Agents Evolve Over the Next Five Years 23:34: Deepseek Evalution: Do We Underestimate China 28:34 The Future of Software Development 31:53 The Thing That Most Excites Me in AI is Tech Debt 35:01 Leadership Lessons from Satya Nadella 41:13 Quickfire Round  

Latent Space: The AI Engineer Podcast — CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

Unsupervised Learning is a podcast that interviews the sharpest minds in AI about what's real today, what will be real in the future and what it means for businesses and the world - helping builders, researchers and founders deconstruct and understand the biggest breakthroughs. Top guests: Noam Shazeer, Bob McGrew, Noam Brown, Dylan Patel, Percy Liang, David Luan https://www.latent.space/p/unsupervised-learning Timestamps 00:00 Introduction and Excitement for Collaboration 00:27 Reflecting on Surprises in AI Over the Past Year 01:44 Open Source Models and Their Adoption 06:01 The Rise of GPT Wrappers 06:55 AI Builders and Low-Code Platforms 09:35 Overhyped and Underhyped AI Trends 22:17 Product Market Fit in AI 28:23 Google's Current Momentum 28:33 Customer Support and AI 29:54 AI's Impact on Cost and Growth 31:05 Voice AI and Scheduling 32:59 Emerging AI Applications 34:12 Education and AI 36:34 Defensibility in AI Applications 40:10 Infrastructure and AI 47:08 Challenges and Future of AI 52:15 Quick Fire Round and Closing Remarks Chapters 00:00:00 Introduction and Collab Excitement 00:00:58 Open Source and Model Adoption 00:01:58 Enterprise Use of Open Source Models 00:02:57 The Competitive Edge of Closed Source Models 00:03:56 DeepSea and Open Source Model Releases 00:04:54 Market Narrative and DeepSea Impact 00:05:53 AI Engineering and GPT Wrappers 00:06:53 AI Builders and Low-Code Platforms 00:07:50 Innovating Beyond Existing Paradigms 00:08:50 Apple and AI Product Development 00:09:48 Overhyped and Underhyped AI Trends 00:10:46 Frameworks and Protocols in AI Development 00:11:45 Emerging Opportunities in AI 00:12:44 Stateful AI and Memory Innovation 00:13:44 Challenges with Memory in AI Agents 00:14:44 The Future of Model Training Companies 00:15:44 Specialized Use Cases for AI Models 00:16:44 Vertical Models vs General Purpose Models 00:17:42 General Purpose vs Domain-Specific Models 00:18:42 Reflections on Model Companies 00:19:39 Model Companies Entering Product Space 00:20:38 Competition in AI Model and Product Sectors 00:21:35 Coding Agents and Market Dynamics 00:22:35 Defensibility in AI Applications 00:23:35 Investing in Underappreciated AI Ventures 00:24:32 Analyzing Market Fit in AI 00:25:31 AI Applications with Product Market Fit 00:26:31 OpenAI's Impact on the Market 00:27:31 Google and OpenAI Competition 00:28:31 Exploring Google's Advancements 00:29:29 Customer Support and AI Applications 00:30:27 The Future of AI in Customer Support 00:31:26 Cost-Cutting vs Growth in AI 00:32:23 Voice AI and Real-World Applications 00:33:23 Scaling AI Applications for Demand 00:34:22 Summarization and Conversational AI 00:35:20 Future AI Use Cases and Market Fit 00:36:20 AI Education and Model Capabilities 00:37:17 Reforming Education with AI 00:38:15 Defensibility in AI Apps 00:39:13 Network Effects and AI 00:40:12 AI Brand and Market Positioning 00:41:11 AI Application Defensibility 00:42:09 LLM OS and AI Infrastructure 00:43:06 Security and AI Application 00:44:06 OpenAI's Role in AI Infrastructure 00:45:02 The Balance of AI Applications and Infrastructure 00:46:02 Capital Efficiency in AI Infrastructure 00:47:01 Challenges in AI DevOps and Infrastructure 00:47:59 AI SRE and Monitoring 00:48:59 Scaling AI and Hardware Challenges 00:49:58 Reliability and Compute in AI 00:50:57 Nvidia's Dominance and AI Hardware 00:51:57 Emerging Competition in AI Silicon 00:52:54 Agent Authentication Challenges 00:53:53 Dream Podcast Guests 00:54:51 Favorite News Sources and Startups 00:55:50 The Value of In-Person Conversations 00:56:50 Private vs Public AI Discourse 00:57:48 Latent Space and Podcasting 00:58:46 Conclusion and Final Thoughts

EUVC
EUVC | E433 | Inflection's Jonatan Luther-Bergquist on the development of crypto to sovereign compute

EUVC

Play Episode Listen Later Mar 28, 2025 44:09


In today's episode, Andreas Munk Holm talks with Jonatan Luther-Bergquist from Inflection.xyz, who shares his journey in framing crypto fundamentally as a compute technology and formulating their thesis on sovereign compute back in 2023. Jonatan explains that while the early work in crypto helped lay the groundwork for disruptive technology, he saw a bigger picture emerging - a need for Europe and other regions to improve their digital independence.In this conversation, Jonatan dives into the three pillars of his sovereign compute thesis: scaling compute capabilities, building resilient systems, and ensuring access to data flows. He illustrates his points with real-world examples, from photonics and semiconductor technologies to communications that can make all the difference in modern defense scenarios. By drawing parallels between evolving technology and the pressing geopolitical challenges, like those seen in Ukraine, Jonatan makes a compelling case for why investing in robust, forward-thinking compute solutions is essential for a secure and prosperous future.Chapters: 00:23 The Importance of Compute for Sovereignty00:38 Key Areas for Improvement in Compute01:27 Integrating Compute into Daily Life02:54 The Story of Inflection04:03 Inflection's Early Focus on Crypto 05:18 Transition to Sovereign Compute11:22 Challenges in the Crypto Space17:22 Understanding Sovereign Compute21:24 Equipping Compute Resources with Quality Data22:04 Understanding the Concept of Flow in Compute22:15 Evaluating Vertical Focus in VC Firms24:14 The Importance of Quality in Investment Decisions28:52 Exploring Technologies in Sovereign Compute29:40 Innovative Compute Solutions: From Semiconductors to Brain Tissue31:47 The Role of Communication in Modern Warfare33:48 The Geopolitical Importance of Sovereignty34:27 Personal and Professional Journey in Defense Tech41:26 Transitioning from Crypto to Sovereign Compute

AWS Podcast
#713: AWS News: Meet the Next Generation of Amazon SageMaker, Multi-Agent Collaboration on Bedrock

AWS Podcast

Play Episode Listen Later Mar 24, 2025 24:27


New game-changing AI developments are here, from SageMaker Unified Studio to Bedrock's new multi-agent capabilities. Join your hosts Simon and Jillian for the latest updates from AWS. 00:00:00 - Intro 00:00:49 - Top Stories 00:02:31 - Amazon Bedrock 00:05:35 - Analytics 00:06:08 - Application Integration 00:06:41 - AWS Step Function Workflow Studio 00:06:59 - Amazon Bedrock 00:07:26 - GraphRAG 00:09:08 - Amazon Nova Pro Foundation Model 00:09:32 - Amazon S3 Table and Sagemaker Lakehouse 00:12:00 - Compute 00:13:30 - Customer Engagement 00:14:39 - Data Bases 00:15:09 - Developer Tools 00:17:09 - End User Computing 00:17:25 - Front end Web and Mobile 00:18:08 - Games Internet of things 00:20:12 - Management and Governance 00:20:31 - Networking and Content Delivery 00:20:41 - AWS Application Load Balancer 00:21:06 - Security Identity End Compliance 00:22:32 - Storage 00:23:47 - Wrap up

Fund/Build/Scale
Is AI Compute Broken? Tim Davis on Modular's $130M Bet

Fund/Build/Scale

Play Episode Listen Later Mar 18, 2025 46:27


Would you leave a stable, high-paying job at Google to build something that competes with NVIDIA, Intel, and AMD? That's exactly what Tim Davis, co-founder and president of Modular, did. Since then, his company has raised $130M to reimagine AI compute infrastructure — but are AI startups really desperate for a new compute layer? And what's it like to build a startup when your biggest competitors are trillion-dollar giants? In this episode of Fund/Build/Scale, Tim shares his vision for the future of AI compute, why talent is the real key to success, and some of the tough lessons he's learned from three startups. RUNTIME 46:27 EPISODE BREAKDOWN (1:26) “We are building a new accelerated execution platform for compute.” (6:41) “ It will exist all over the place and it already does, but AI will be everywhere that compute is.” (11:18) “ You only you only have so much time in a week. What is the thing that you're best at?” (15:13) “ We have decided to start from the hardest part of the software stack.” (22:44) “For the most talented people in the world, the risk is actually not as great as what you think.” (30:24) “ Growing up in Australia, my view of the of the United States was very much driven from the media and from Hollywood.” (33:26) “ I sat in a room for six weeks and just met everyone that I could. And that really was the beginning of a journey to the United States.” (37:48) “ I still think there's a special place in the Bay Area, and in the United States, there is a different risk appetite.” (40:41) The one question Tim would have to ask the CEO before he'd take a job at someone else's early-stage startup. LINKS Tim Davis, co-founder, president timdavis.com Chris Lattner, co-founder, CEO Modular AI startup Modular raises $100 mln in General Catalyst-led funding, 8/24/2023, Reuters  SUBSCRIBE

This Week in Machine Learning & Artificial Intelligence (AI) Podcast
Scaling Up Test-Time Compute with Latent Reasoning with Jonas Geiping - #723

This Week in Machine Learning & Artificial Intelligence (AI) Podcast

Play Episode Listen Later Mar 17, 2025 58:38


Today, we're joined by Jonas Geiping, research group leader at Ellis Institute and the Max Planck Institute for Intelligent Systems to discuss his recent paper, “Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach.” This paper proposes a novel language model architecture which uses recurrent depth to enable “thinking in latent space.” We dig into “internal reasoning” versus “verbalized reasoning”—analogous to non-verbalized and verbalized thinking in humans, and discuss how the model searches in latent space to predict the next token and dynamically allocates more compute based on token difficulty. We also explore how the recurrent depth architecture simplifies LLMs, the parallels to diffusion models, the model's performance on reasoning tasks, the challenges of comparing models with varying compute budgets, and architectural advantages such as zero-shot adaptive exits and natural speculative decoding. The complete show notes for this episode can be found at https://twimlai.com/go/723.

ChinaTalk
Building Compute in America

ChinaTalk

Play Episode Listen Later Mar 17, 2025 75:41


Despite leading the world in AI innovation, there's no guarantee that America will rise to meet the challenge of AI infrastructure. Specifically, the key technological barrier for data center construction within the next 5 years is new power capacity. To discuss policy solutions, ChinaTalk interviewed Ben Della Rocca, who helped write the AI infrastructure executive order and formerly served as director for technology and national security on Biden's NSC, as well as Arnab Datta, director at IFP and managing director at Employ America, and Tim Fist, a director at IFP. Arnab and Tim just published a fantastic three-part series exploring the policy changes needed to ensure that AGI is invented in the USA and deployed through American data centers. In today's interview, we discuss… The need for new power generation driven by ballooning demand for compute, The impact of the January 2025 executive order on AI infrastructure, Which energy technologies can (and can't) power gigawatt-scale AI training facilities (and why Jordan is all-in on GEOTHERMAL), Challenges for financing moonshot green power ideas and the role of government action, The failure of the market to prioritize AI lab security, and what can be done to fend off threats from adversaries and non-state actors. Outtro music: Ghost Crew - 蝴蝶武士 (Butterfly Warriors) (Youtube link) Learn more about your ad choices. Visit megaphone.fm/adchoices

Unsupervised Learning
Ep 58: Google Researchers Noam Shazeer and Jack Rae on Scaling Test-time Compute, Reactions to Ilya & AGI

Unsupervised Learning

Play Episode Listen Later Mar 17, 2025 69:28


ChinaEconTalk
Building Compute in America

ChinaEconTalk

Play Episode Listen Later Mar 17, 2025 75:41


Despite leading the world in AI innovation, there's no guarantee that America will rise to meet the challenge of AI infrastructure. Specifically, the key technological barrier for data center construction within the next 5 years is new power capacity. To discuss policy solutions, ChinaTalk interviewed Ben Della Rocca, who helped write the AI infrastructure executive order and formerly served as director for technology and national security on Biden's NSC, as well as Arnab Datta, director at IFP and managing director at Employ America, and Tim Fist, a director at IFP. Arnab and Tim just published a fantastic three-part series exploring the policy changes needed to ensure that AGI is invented in the USA and deployed through American data centers. In today's interview, we discuss… The need for new power generation driven by ballooning demand for compute, The impact of the January 2025 executive order on AI infrastructure, Which energy technologies can (and can't) power gigawatt-scale AI training facilities (and why Jordan is all-in on GEOTHERMAL), Challenges for financing moonshot green power ideas and the role of government action, The failure of the market to prioritize AI lab security, and what can be done to fend off threats from adversaries and non-state actors. Outtro music: Ghost Crew - 蝴蝶武士 (Butterfly Warriors) (Youtube link) Learn more about your ad choices. Visit megaphone.fm/adchoices

Azizi Podcast
#117 - AI, Decentralization & The Future of Compute | Mori Zihayat, Heisenberg Network

Azizi Podcast

Play Episode Listen Later Mar 16, 2025 46:08


In this episode of the Azizi Podcast, host Samir Azizi sits down with Mori Zihayat, a core contributor to Heisenberg Network (Heisenberg.so). They dive deep into the world of AI, decentralization, and the future of compute, covering topics like: - Mori's journey into AI, blockchain, and decentralized computing - The problems facing AI projects today and why many struggle - How AI depends on structured data and what most people get wrong - Heisenberg Network's mission to revolutionize AI compute and data processing - How anyone can contribute their unused CPU power and earn crypto If you're interested in the future of AI, decentralized infrastructure, and how you can profit from the AI revolution, this episode is for you. Learn more about Heisenberg Network: Website: https://www.heisenberg.so/ Join the Heisenberg Node Waitlist: https://www.heisenberg.so/heisenberg-node Follow Mori Zihayat: X: https://x.com/MoriZihayat LinkedIn: https://www.linkedin.com/in/morteza-zihayat/ Follow Heisenberg Network: X: https://x.com/HeisenbergNet LinkedIn: https://www.linkedin.com/company/heisenbergnet Subscribe for more AI, blockchain, and tech deep dives.  

ANTIC The Atari 8-bit Podcast
ANTIC Episode 115 - Exhausting Games for your Atari

ANTIC The Atari 8-bit Podcast

Play Episode Listen Later Mar 10, 2025 88:05


ANTIC Episode 115 In this episode of ANTIC The Atari 8-Bit Computer Podcast… we talk lots of contest news, Mr. Paint, a DIY Atari-themed monitor, and lots of other Atari 8-bit news.  Plus, we find a book on “exhausting” Atari games! READY! Recurring Links  Floppy Days Podcast  AtariArchives.org  AtariMagazines.com  Kay's Book “Terrible Nerd”  New Atari books scans at archive.org  ANTIC feedback at AtariAge  Atari interview discussion thread on AtariAge  Interview index: here  ANTIC Facebook Page  AHCS  Eaten By a Grue  Next Without For  Links for Items Mentioned in Show: What we've been up to Scanned stuff from Timothy Onders https://archive.org/details/stx_Atari_400_800_Personal_Computer_System_Operating_System_Listing_1981-02_CO16579 https://archive.org/details/stx_Atari_400_800_Personal_Computer_System_Hardware_Manual_CO16555_1980-10 https://archive.org/details/APX_Isopleth_Map-Making_Package_manual_APX-20103_1982-06 Pilot book - “Atari 400/800 Student Pilot Reference Guide” by Atari - https://archive.org/details/atari_pilot-student-guide  Scanned JACG (Jersey Atari Computer Group) newsletters: October, 1985 - https://archive.org/details/jacg-newsletter-1985-october-vol-5-no-2  November, 1985 - https://archive.org/details/jacg-newsletter-1985-november-vol-5-no-3  Atari newsletters at Internet Archive - https://docs.google.com/spreadsheets/d/1RkznDDlOL2O_K-RrbkajIuo6DvYof6Ajrn7j9NTcoDM/edit?usp=sharing  Recent Interviews ANTIC Interview 453 - Giann Velasquez, Atariteca - https://ataripodcast.libsyn.com/antic-interview-453-giann-velasquez-atariteca  ANTIC Interview 452 - Dean Garraghty, DGS Software  ANTIC Interview 454 - Steve Kranish, Parker Brothers Frogger  News Mr. Paint by Wade Ripkowski: https://github.com/Ripjetski6502/MrPaint  https://forums.atariage.com/topic/379270-mr-paint/  Atari ‘faux neon' LED logo sign, $40 on pre-order - https://atari.com/products/atari-neon-led-sign-white-12-x-13  “errant” on git - using Atari as a keyboard for a PC. Code and instructions posted: https://git.sdf.org/errant/keytari  https://voidptr.org/  Arcade Centipede emulated on Atari 800XL - https://forums.atariage.com/topic/379015-centipede-emulator-for-the-atari-800xl/  FujiCup 2024 Results Announced:  https://fujicup.pl/  results page for 2024 - https://fujicup.pl/wyniki2024  Video - https://www.youtube.com/watch?v=xW-z9tD1OW4  Download all 2024 games in ZIP archive  Atari Homebrew Awards 2024: https://www.youtube.com/watch?v=0b3g4Czr0BE  Best Atari 8-Bit/5200 Homebrew (Original) - https://forums.atariage.com/topic/379180-7th-annual-atari-homebrew-awards-atari-8-bit5200-homebrew-original/  Best Atari 8-Bit/5200 Homebrew (Port) - ​​https://forums.atariage.com/topic/379181-7th-annual-atari-homebrew-awards-atari-8-bit5200-homebrew-port/  Best Atari 8-Bit/5200 WIP (Original) - https://forums.atariage.com/topic/379182-7th-annual-atari-homebrew-awards-atari-8-bit5200-wip-original/  Best Atari 8-Bit/5200 WIP (Port) - https://forums.atariage.com/topic/379183-7th-annual-atari-homebrew-awards-atari-8-bit5200-wip-port/  800XL gets a mention in Hackaday article - https://hackaday.com/2025/02/21/genetic-algorithm-runs-on-atari-800-xl/  XCL10 Monitor - Marcin "Fokaszalot" - Baran - https://atarionline.pl/v01/index.php?ct=nowinki&ucat=1&subaction=showfull&id=1740334426  BASIC 10-Liner Contest - https://gkanold.wixsite.com/homeputerium/copy-of-games-list-2024  Via bill kendrick - https://www.timeextension.com/features/interview-it-was-a-suicide-mission-larry-siegel-reflects-on-ataris-failed-war-on-nintendo  Compute! Magazine ATR by Issue #4 to #95 - Rory McMahon - https://discord.com/channels/1071168010427060324/1071168010427060327/1340108131690348607  https://www.eurogamer.net/40-years-on-rescue-on-fractalus-remains-a-rare-reminder-of-the-magic-of-lucasfilm-games  Computer Dealer Demos: Selling Home Computers with Bouncing Balls and Animated Logos by Patryk Wasiak, Institute for Cultural Studies, University of Wrocław, Poland - https://www.academia.edu/10744534/Computer_Dealer_Demos_Selling_Home_Computers_with_Bouncing_Balls_and_Animated_Logos?email_work_card=title  Why the N tools?” By Thomas Cherryhomes:  https://fujinet.online/2025/02/21/atari-why-the-n-tools/  Video - https://youtu.be/BUR_KRTRWk0  1090XL Expansion case: https://forums.atariage.com/topic/318373-1090xl-remake/page/41/#findComment-5620900  Link to STLs: https://makerworld.com/en/models/1084156  Upcoming Shows Midwest Gaming Classic - April 4-6 - Baird Center, Milwaukee, WI - https://www.midwestgamingclassic.com/  VCF East - April 4-6, 2025 - Wall, NJ - http://www.vcfed.org  Indy Classic Computer and Video Game Expo - April 12-13 - Crowne Plaza Airport Hotel, Indianapolis, IN - https://indyclassic.org/  VCF Europe - May 3-4 - Munich, Germany - https://vcfe.org/E/  Retrofest 2025 - May 31-June1 - Steam Museum of the Great Western Railway, Swindon, UK - https://retrofest.uk/  Vancouver Retro Gaming Expo - June 14 - New Westminster, BC, Canada - https://www.vancouvergamingexpo.com/index.html  VCF Southwest - June 20-22, 2025 - Davidson-Gundy Alumni Center at UT Dallas - https://www.vcfsw.org/  Southern Fried Gaming Expo and VCF Southeast - June 20-22, 2025 - Atlanta, GA - https://gameatl.com/  Silly Venture SE (Summer Edition) - July 31-Aug. 3 - Gdansk, Poland - https://www.demoparty.net/silly-venture/silly-venture-2025-se   Fujiama - August 11-17 - Lengenfeld, Germany - http://atarixle.ddns.net/fuji/2025/  VCF Midwest - September 13-14, 2025 - Renaissance Schaumburg Convention Center in Schaumburg, IL - http://vcfmw.org/  Portland Retro Gaming Expo - October 17-19 - Oregon Convention Center, Portland, OR - https://retrogamingexpo.com/  Event page on Floppy Days Website - https://docs.google.com/document/d/e/2PACX-1vSeLsg4hf5KZKtpxwUQgacCIsqeIdQeZniq3yE881wOCCYskpLVs5OO1PZLqRRF2t5fUUiaKByqQrgA/pub  YouTube Videos The Atari 800 Quick Repair Guide ! - Paul Westphal - https://www.youtube.com/watch?v=5R7CpvJLERk  Atari Pioneers Spill: 80s Gaming's Untold Stories! - Convention Coverage - https://www.youtube.com/watch?v=YexxqfHUeik  Cutting Edge, Atari XL/XE 64 bytes intro - Freddy Offenga - https://www.youtube.com/watch?v=bcoGgFd-3Nc (From LoveByte 2025 - https://lovebyte.party/ ) "Abundance" 128 Byte Intro Atari XL/XE - gorgh Atari - https://www.youtube.com/watch?v=T6HmWxcGVrg  New at Archive.org  https://archive.org/details/addison-wesley-adventures-voor-uw-atari-xlxe  https://archive.org/details/addison-wesley-afmattende-spelen-voor-uw-atari-600-xl-800-xl  https://archive.org/details/great-lakes-atari-digest-june-1989-vol-1-no-4  https://archive.org/details/great-lakes-atari-digest-october-1989-vol-1-no-8  https://archive.org/details/catch-on-to-computers-with-atari-logo-post-cereal  https://archive.org/details/computer-shopper-april-1987-vol-7-num-4-atari-articles  https://archive.org/details/salespersons-guide-to-the-atari-400-home-computer-system/page/n1/mode/2up  https://archive.org/details/excalibur-magazine/  https://archive.org/details/capitol-hill-atari-owners-society-software-library-disk-catalog-march-1987  https://archive.org/details/atari-price-list-june-1982-and-letters/mode/2up  https://archive.org/details/grand-rapids-atari-systems-supporters-software-library-disk-catalog-1987  Commercial Atari XE Computer System Commercial (1988) - https://www.youtube.com/watch?v=LjWEE5r8Rak  Feedback Chris Lorenzo - Vintage Gaming Memories (YouTube) - Atari Addict Collectors Issue Magazine 

AWS Podcast
#711: AWS News: Claude 3.7 Meets Amazon Bedrock, Bedrock Data Automation

AWS Podcast

Play Episode Listen Later Mar 10, 2025 23:25


Anthropic's most advanced AI model yet is now on Amazon Bedrock, plus, multimodal content analysis with Bedrock Data Automation. Keep up with these updates and more on this week's AWS News. Chapters: 00:00:00 - Intro 00:01:02 - Anthropic Claude 3.7 00:03:14 - Amazon Bedrock Data Automation 00:05:54 - Analytics 00:06:50 - Artificial Intelligence 00:09:23 - Compute 00:12:37 - Customer Engagement 00:13:50 - Databases 00:16:11 - Developer Tools 00:17:05 - End User Computing 00:17:23 - Front End Web and Mobile 00:18:31 - Management End Governance 00:19:34 - Migration and Modernization 00:20:43 - Security Identity End Compliance 00:21:56 - Storage 00:22:12 - Outro

The Fintech Blueprint
Building a Trustless Supercomputer for Web3 and AI, with Arweave's Founder Sam Williams

The Fintech Blueprint

Play Episode Listen Later Mar 10, 2025 46:04


Lex interviews Sam Williams - founder of Arweave. This episode delves into the innovative aspects of Arweave, a protocol designed for permanent data storage and computation within the Web3 ecosystem. The discussion covers a range of topics, from the economic models underpinning Arweave to its potential applications in decentralized finance (DeFi) and beyond. Notable discussion points: The Founding of Arweave and its Mission – Sam Williams' interest in distributed computing and concerns about authoritarianism led him to create Arweave in 2017. Inspired by the Snowden leaks, he saw the need for a blockchain-based permanent storage solution to protect journalism, historical records, and digital assets from censorship. Decentralized vs. Distributed Storage – Williams explained how Arweave differs from alternatives like IPFS and Filecoin. Unlike traditional storage, which requires ongoing payments, Arweave uses a one-time payment model. This storage endowment leverages declining storage costs to ensure long-term data persistence without relying on centralized infrastructure. Arweave's Expansion into Decentralized Compute – Arweave has evolved beyond storage to develop decentralized computing through "Arweave IO." This enables parallelized smart contract execution, making it possible to run AI models, financial automation, and decentralized apps on-chain—aligning with Web3's shift toward autonomous, intelligent systems.MENTIONED IN THE CONVERSATION Topics: Arweave, permanent data storage, Web3, decentralized systems, distributed systems, blockchain, economic models, IPFS, Filecoin, decentralized computing, decentralized finance, compute ABOUT THE FINTECH BLUEPRINT 

Game and Compute
Website Audit Series: high-end fashion and apparel niche (Shopify, social media conversions, blogs, logos)

Game and Compute

Play Episode Listen Later Mar 9, 2025 55:49


Free Website Audit Series: high-end fashion and apparel niche (Shopify, social media conversions, blogs, logos). A very good job on this website! Nice Shopify store, I am providing free website audit content here for the Discord community I am a part of, and for all you Game and Compute listeners out there. Enjoy!

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups
National Security Strategy and AI Evals on the Eve of Superintelligence with Dan Hendrycks

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Mar 5, 2025 36:24


This week on No Priors, Sarah is joined by Dan Hendrycks, director of the Center of AI Safety. Dan serves as an advisor to xAI and Scale AI. He is a longtime AI researcher, publisher of interesting AI evals such as "Humanity's Last Exam," and co-author of a new paper on National Security "Superintelligence Strategy" along with Scale founder-CEO Alex Wang and former Google CEO Eric Schmidt. They explore AI safety, geopolitical implications, the potential weaponization of AI, along with policy recommendations. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @DanHendrycks Show Notes: 0:00 Introduction 0:36 Dan's path to focusing on AI Safety 1:25 Safety efforts in large labs 3:12 Distinguishing alignment and safety 4:48 AI's impact on national security 9:59 How might AI be weaponized? 14:43 Immigration policies for AI talent 17:50 Mutually assured AI malfunction 22:54 Policy suggestions for current administration 25:34 Compute security 30:37 Current state of evals

The Lawfare Podcast
Lawfare Daily: Tim Fist and Arnab Datta on the Race to Build AI Infrastructure in America

The Lawfare Podcast

Play Episode Listen Later Mar 4, 2025 41:56


Tim Fist, Director of Emerging Technology Policy at the Institute for Future Progress, and Arnab Datta, Director of Infrastructure Policy at IFP and Managing Director of Policy Implementation at Employ America, join Kevin Frazier, a Contributing Editor at Lawfare and adjunct professor at Delaware Law, to dive into the weeds of their thorough report on building America's AI infrastructure. The duo extensively studied the gulf between the stated goals of America's AI leaders and the practical hurdles to realizing those ambitious aims.Check out the entire report series here: Compute in AmericaTo receive ad-free podcasts, become a Lawfare Material Supporter at www.patreon.com/lawfare. You can also support Lawfare by making a one-time donation at https://givebutter.com/lawfare-institute.Support this show http://supporter.acast.com/lawfare. Hosted on Acast. See acast.com/privacy for more information.

The AI Breakdown: Daily Artificial Intelligence News and Discussions
Why AI Compute Consumption Isn't Slowing Down

The AI Breakdown: Daily Artificial Intelligence News and Discussions

Play Episode Listen Later Feb 28, 2025 20:51


Despite reports of Microsoft canceling data center leases, AI compute demand is still accelerating. This episode breaks down Wall Street's ongoing AI skepticism and the reasons why companies like Meta, OpenAI, and Apple are making massive infrastructure bets. Plus, key takeaways from Nvidia's latest earnings and what reasoning models mean for the future of computing. Brought to you by:KPMG – Go to ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠www.kpmg.us/ai⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ to learn more about how KPMG can help you drive value with our AI solutions.Vanta - Simplify compliance - ⁠⁠⁠⁠⁠⁠⁠https://vanta.com/nlwThe Agent Readiness Audit from Superintelligent - Go to https://besuper.ai/ to request your company's agent readiness score.The AI Daily Brief helps you understand the most important news and discussions in AI. Subscribe to the podcast version of The AI Daily Brief wherever you listen: https://pod.link/1680633614Subscribe to the newsletter: https://aidailybrief.beehiiv.com/Join our Discord: https://bit.ly/aibreakdown

The Twenty Minute VC: Venture Capital | Startup Funding | The Pitch
20VC: Why Google Will Win the AI Arms Race & OpenAI Will Not | NVIDIA vs AMD: Who Wins and Why | The Future of Inference vs Training | The Economics of Compute & Why To Win You Must Have Product, Data & Compute with Steeve Morin @ ZML

The Twenty Minute VC: Venture Capital | Startup Funding | The Pitch

Play Episode Listen Later Feb 24, 2025 72:32


Steeve Morin is the Founder & CEO @ ZML, a next-generation inference engine enabling peak performance on a wide range of chips. Prior to founding ZML, Steeve was the VP Engineering at Zenly for 7 years leading eng to millions of users and an acquisition by Snap.  In Today's Episode We Discuss: 04:17 How Will Inference Change and Evolve Over the Next 5 Years 09:17 Challenges and Innovations in AI Hardware 15:38 The Economics of AI Compute 18:01 Training vs. Inference: Infrastructure Needs 25:08 The Future of AI Chips and Market Dynamics 34:43 Nvidia's Market Position and Competitors 38:18 Challenges of Incremental Gains in the Market 39:12 The Zero Buy-In Strategy 39:34 Switching Between Compute Providers 40:40 The Importance of a Top-Down Strategy for Microsoft and Google 41:42 Microsoft's Strategy with AMD 45:50 Data Center Investments and Training 46:40 How to Succeed in AI: The Triangle of Products, Data, and Compute 48:25 Scaling Laws and Model Efficiency 49:52 Future of AI Models and Architectures 57:08 Retrieval Augmented Generation (RAG) 01:00:52 Why OpenAI's Position is Not as Strong as People Think 01:06:47 Challenges in AI Hardware Supply