Podcasts about benchmarks

  • 1,184PODCASTS
  • 1,881EPISODES
  • 38mAVG DURATION
  • 5WEEKLY NEW EPISODES
  • Jun 19, 2025LATEST

POPULARITY

20172018201920202021202220232024

Categories



Best podcasts about benchmarks

Show all podcasts related to benchmarks

Latest podcast episodes about benchmarks

Latent Space: The AI Engineer Podcast — CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

Solving Poker and Diplomacy, Debating RL+Reasoning with Ilya, what's *wrong* with the System 1/2 analogy, and where Test-Time Compute hits a wall Timestamps 00:00 Intro – Diplomacy, Cicero & World Championship 02:00 Reverse Centaur: How AI Improved Noam's Human Play 05:00 Turing Test Failures in Chat: Hallucinations & Steerability 07:30 Reasoning Models & Fast vs. Slow Thinking Paradigm 11:00 System 1 vs. System 2 in Visual Tasks (GeoGuessr, Tic-Tac-Toe) 14:00 The Deep Research Existence Proof for Unverifiable Domains 17:30 Harnesses, Tool Use, and Fragility in AI Agents 21:00 The Case Against Over-Reliance on Scaffolds and Routers 24:00 Reinforcement Fine-Tuning and Long-Term Model Adaptability 28:00 Ilya's Bet on Reasoning and the O-Series Breakthrough 34:00 Noam's Dev Stack: Codex, Windsurf & AGI Moments 38:00 Building Better AI Developers: Memory, Reuse, and PR Reviews 41:00 Multi-Agent Intelligence and the “AI Civilization” Hypothesis 44:30 Implicit World Models and Theory of Mind Through Scaling 48:00 Why Self-Play Breaks Down Beyond Go and Chess 54:00 Designing Better Benchmarks for Fuzzy Tasks 57:30 The Real Limits of Test-Time Compute: Cost vs. Time 1:00:30 Data Efficiency Gaps Between Humans and LLMs 1:03:00 Training Pipeline: Pretraining, Midtraining, Posttraining 1:05:00 Games as Research Proving Grounds: Poker, MTG, Stratego 1:10:00 Closing Thoughts – Five-Year View and Open Research Directions Chapters 00:00:00 Intro & Guest Welcome 00:00:33 Diplomacy AI & Cicero Insights 00:03:49 AI Safety, Language Models, and Steerability 00:05:23 O Series Models: Progress and Benchmarks 00:08:53 Reasoning Paradigm: Thinking Fast and Slow in AI 00:14:02 Design Questions: Harnesses, Tools, and Test Time Compute 00:20:32 Reinforcement Fine-tuning & Model Specialization 00:21:52 The Rise of Reasoning Models at OpenAI 00:29:33 Data Efficiency in Machine Learning 00:33:21 Coding & AI: Codex, Workflows, and Developer Experience 00:41:38 Multi-Agent AI: Collaboration, Competition, and Civilization 00:45:14 Poker, Diplomacy & Exploitative vs. Optimal AI Strategy 00:52:11 World Models, Multi-Agent Learning, and Self-Play 00:58:50 Generative Media: Image & Video Models 01:00:44 Robotics: Humanoids, Iteration Speed, and Embodiment 01:04:25 Rapid Fire: Research Practices, Benchmarks, and AI Progress 01:14:19 Games, Imperfect Information, and AI Research Directions

Versicherungsfunk
Versicherungsfunk Update 16.06.2025

Versicherungsfunk

Play Episode Listen Later Jun 16, 2025 3:36


Die Themen im heutigen Versicherungsfunk Update sind: Frühverrentung bei Babyboomern belastet Rentensystem Rund 900.000 Menschen aus den geburtenstarken Jahrgängen der Babyboomer erhielten Ende 2023 eine gesetzliche Altersrente, obwohl sie das reguläre Renteneintrittsalter noch nicht erreicht hatten. Das geht aus einer neuen IW-Analyse hervor. Insgesamt beziehen bereits 4,5 Mio. Babyboomer eine Altersrente. Die Frühverrentung belastet das Rentensystem – und zeigt, dass Anreize für einen späteren Renteneintritt bislang kaum Wirkung zeigen. ASCORE aktualisiert Scoring für PHV, Diensthaftpflicht und Cyber ASCORE Analyse hat die Scoring-Ergebnisse 2025 für die Private Haftpflicht-, Diensthaftpflicht- und Cyberversicherung veröffentlicht. Neue Kriterien und angepasste Benchmarks berücksichtigen aktuelle Markttrends. In der Privathaftpflicht erreichten nur 26 Familien- und 31 Single-Tarife die Höchstwertung. Im Bereich Cyber schaffte es lediglich ein Tarif auf sechs Kompasse. Auch die Diensthaftpflicht wurde um neue Kriterien wie eine Innovationsklausel erweitert. Managerhaftung: Cyberrisiken bleiben Top-Thema für Führungskräfte Cyberangriffe und Datenverluste stellen laut aktueller D&O-Studie von Willis und Clyde & Co. die größten Haftungsrisiken für Manager dar. In Deutschland sehen 88 % der Führungskräfte Cyberattacken als Top-Risiko – doch nur 17 % geben an, dass Risikomanagement unter den drei wichtigsten Themen ihres Boards rangiert. Neben Cyber zählen auch Gesundheit, Arbeitssicherheit und regulatorische Risiken zu den größten D&O-Herausforderungen. Generali und Laureus: Neue Partnerschaft für mehr Chancengleichheit durch Sport Die Generali Deutschland und die Stiftung Laureus Sport for Good kooperieren künftig im Rahmen des „Ökosystem Sport“, um benachteiligte Kinder und Jugendliche durch Sport zu fördern. Im Fokus stehen soziale Sportprojekte zur Inklusion, Bildung und Gesundheit. Mit der Partnerschaft baut Generali ihr gesellschaftliches Engagement weiter aus und verbindet ihre Sponsoringstrategie mit konkreter sozialer Wirkung. Ziel ist es, durch gemeinsame Programme neue Förderregionen zu erschließen und bestehende Projekte nachhaltig zu stärken. VGH mit Wachstum in allen Sparten – Lebensversicherung legt besonders zu Die VGH Versicherungen melden für 2024 einen Anstieg der Beitragseinnahmen auf 2,94 Mrd. EUR (+4,3 %). Besonders deutlich legte die Lebensversicherung zu: Das Neugeschäft mit laufenden Beiträgen wuchs um 34,2 % – entgegen dem Marktrend. Die Wohngebäudeschäden blieben 2024 moderat, das Eigenkapital der Gruppe wurde um 131 Mio. EUR gestärkt. Auch in der privaten Krankenversicherung konnte die VGH über Marktniveau zulegen. Für 2025 kündigt das Unternehmen neue Produkte wie eine betriebliche Krankenversicherung an. Fünf Tierseuchen gleichzeitig – VTV meldet Rekordschäden Die Vereinigte Tierversicherung (VTV) verzeichnete 2024 mit 96 Mio. EUR das teuerste Schadenjahr ihrer Geschichte – vor allem durch Tierseuchen wie Blauzungenkrankheit, Afrikanische Schweinepest und Vogelgrippe. Für 2025 wurde erstmals der gleichzeitige Ausbruch von fünf Tierseuchen in Deutschland registriert. Besonders betroffen: Wiederkäuer in NRW, Schleswig-Holstein und Rheinland-Pfalz. Eine staatliche Entschädigung gibt es nicht – Versicherungsschutz ist damit wichtiger denn je. Gleichzeitig meldet das Statistische Bundesamt einen Preisanstieg von +9,8 % für tierische Erzeugnisse gegenüber dem Vorjahr – ein Indiz für knapper werdendes Angebot und steigenden wirtschaftlichen Druck auf Tierhalter.

The top AI news from the past week, every ThursdAI

Hey folks, this is Alex, finally back home! This week was full of crazy AI news, both model related but also shifts in the AI landscape and big companies, with Zuck going all in on scale & execu-hiring Alex Wang for a crazy $14B dollars. OpenAI meanwhile, maybe received a new shipment of GPUs? Otherwise, it's hard to explain how they have dropped the o3 price by 80%, while also shipping o3-pro (in chat and API). Apple was also featured in today's episode, but more so for the lack of AI news, completely delaying the “very personalized private Siri powered by Apple Intelligence” during WWDC25 this week. We had 2 guests on the show this week, Stefania Druga and Eric Provencher (who builds RepoPrompt). Stefania helped me cover the AI Engineer conference we all went to last week, and shared some cool Science CoPilot stuff she's working on, while Eric is the GOTO guy for O3-pro helped us understand what this model is great for! As always, TL;DR and show notes at the bottom, video for those who prefer watching is attached below, let's dive in! Big Companies LLMs & APIsLet's start with big companies, because the landscape has shifted, new top reasoner models dropped and some huge companies didn't deliver this week! Zuck goes all in on SuperIntelligence - Meta's $14B stake in ScaleAI and Alex WangThis may be the most consequential piece of AI news today. Fresh from the dissapointing results of LLama 4, reports of top researchers leaving the Llama team, many have decided to exclude Meta from the AI race. We have a saying at ThursdAI, don't bet against Zuck! Zuck decided to spend a lot of money (nearly 20% of their reported $65B investment in AI infrastructure) to get a 49% stake in Scale AI and bring Alex Wang it's (now former) CEO to lead the new Superintelligence team at Meta. For folks who are not familiar with Scale, it's a massive company in providing human annotated data services to all the big AI labs, Google, OpenAI, Microsoft, Anthropic.. all of them really. Alex Wang, is the youngest self made billionaire because of it, and now Zuck not only has access to all their expertise, but also to a very impressive AI persona, who could help revive the excitement about Meta's AI efforts, help recruit the best researchers, and lead the way inside Meta. Wang is also an outspoken China hawk who spends as much time in congressional hearings as in Slack, so the geopolitics here are … spicy. Meta just stapled itself to the biggest annotation funnel on Earth, hired away Google's Jack Rae (who was on the pod just last week, shipping for Google!) for brainy model alignment, and started waving seven-to-nine-figure comp packages at every researcher with “Transformer” in their citation list. Whatever disappointment you felt over Llama-4's muted debut, Zuck clearly felt it too—and responded like a founder who still controls every voting share. OpenAI's Game-Changer: o3 Price Slash & o3-pro launches to top the intelligence leaderboards!Meanwhile OpenAI dropping not one, but two mind-blowing updates. First, they've slashed the price of o3—their premium reasoning model—by a staggering 80%. We're talking from $40/$10 per million tokens down to just $8/$2. That's right, folks, it's now in the same league as Claude Sonnet cost-wise, making top-tier intelligence dirt cheap. I remember when a price drop of 80% after a year got us excited; now it's 80% in just four months with zero quality loss. They've confirmed it's the full o3 model—no distillation or quantization here. How are they pulling this off? I'm guessing someone got a shipment of shiny new H200s from Jensen!And just when you thought it couldn't get better, OpenAI rolled out o3-pro, their highest intelligence offering yet. Available for pro and team accounts, and via API (87% cheaper than o1-pro, by the way), this model—or consortium of models—is a beast. It's topping charts on Artificial Analysis, barely edging out Gemini 2.5 as the new king. Benchmarks are insane: 93% on AIME 2024 (state-of-the-art territory), 84% on GPQA Diamond, and nearing a 3000 ELO score on competition coding. Human preference tests show 64-66% of folks prefer o3-pro for clarity and comprehensiveness across tasks like scientific analysis and personal writing.I've been playing with it myself, and the way o3-pro handles long context and tough problems is unreal. As my friend Eric Provencher (creator of RepoPrompt) shared on the show, it's surgical—perfect for big refactors and bug diagnosis in coding. It's got all the tools o3 has—web search, image analysis, memory personalization—and you can run it in background mode via API for async tasks. Sure, it's slower due to deep reasoning (no streaming thought tokens), but the consistency and depth? Worth it. Oh, and funny story—I was prepping a talk for Hamel Hussain's evals course, with a slide saying “don't use large reasoning models if budget's tight.” The day before, this price drop hits, and I'm scrambling to update everything. That's AI pace for ya!Apple WWDC: Where's the Smarter Siri? Oh Apple. Sweet, sweet Apple. Remember all those Bella Ramsey ads promising a personalized Siri that knows everything about you? Well, Craig Federighi opened WWDC by basically saying "Yeah, about that smart Siri... she's not coming. Don't wait up."Instead, we got:* AI that can combine emojis (revolutionary!

PodRocket - A web development podcast from LogRocket
10 years of SolidJS with Ryan Carniato

PodRocket - A web development podcast from LogRocket

Play Episode Listen Later Jun 12, 2025 43:21


Ryan Carniato, creator of SolidJS, joins the podcast to reflect on a decade of developing the framework. We dive into the evolution of frontend tooling, the rise of fine-grained reactivity, and why SolidJS continues to challenge virtual DOM conventions. Ryan also shares insights on open source maintenance, web standards, and the future of UI architecture. Links YouTube: https://www.youtube.com/@ryansolid X: https://x.com/ryancarniato Dev.to: https://dev.to/ryansolid SolidJS Website: https://www.solidjs.com Resources A Decade of SolidJS: https://dev.to/this-is-learning/a-decade-of-solidjs-32f4 We want to hear from you! How did you find us? Did you see us on Twitter? In a newsletter? Or maybe we were recommended by a friend? Let us know by sending an email to our producer, Em, at emily.kochanek@logrocket.com (mailto:emily.kochanek@logrocket.com), or tweet at us at PodRocketPod (https://twitter.com/PodRocketpod). Follow us. Get free stickers. Follow us on Apple Podcasts, fill out this form (https://podrocket.logrocket.com/get-podrocket-stickers), and we'll send you free PodRocket stickers! What does LogRocket do? LogRocket provides AI-first session replay and analytics that surfaces the UX and technical issues impacting user experiences. Start understanding where your users are struggling by trying it for free at LogRocket.com. Try LogRocket for free today. (https://logrocket.com/signup/?pdr) Special Guest: Ryan Carniato.

SaaS Metrics School
What's a Healthy G&A Budget for SaaS? Benchmarks by ARR Stage

SaaS Metrics School

Play Episode Listen Later Jun 11, 2025 4:25


In episode #285 of SaaS Metrics School, Ben Murray dives into one of the most overlooked levers in SaaS financial performance—G&A (General & Administrative) spend. How much should you really be spending on back office functions like finance, HR, legal, and IT? Using data from Benchmarkit.AI, Ben walks through G&A as a percent of revenue across ARR stages—from startups under $1M to companies exceeding $100M. He also explains how operating leverage is created through back office efficiency and why using benchmarks segmented by ARR is crucial in SaaS metrics analysis. What You'll Learn: Why aggregate SaaS benchmarks are dangerous G&A benchmarks by ARR segment (top quartile vs. median) The role of operating leverage in SaaS profitability How to evaluate your own back office spend using metrics Actionable targets for G&A as a percent of revenue SaaS Metrics Covered: G&A % of Revenue Operating Leverage Opex Profile by ARR Benchmarking by ARR vs. ACV Resources Mentioned: Benchmarkit.ai Join Ben's SaaS Metrics community for webinars, templates, and live sessions: https://www.thesaasacademy.com/offers/ivNjwYDx/checkout Subscribe to Ben's SaaS newsletter: https://mailchi.mp/df1db6bf8bca/the-saas-cfo-sign-up-landing-page

programmier.bar – der Podcast für App- und Webentwicklung
News AI 23/25: Claude 4 // DeepSeek R1 0528 // Chatterbox TTS // Mem0 // Koog // Duolingo AI-first

programmier.bar – der Podcast für App- und Webentwicklung

Play Episode Listen Later Jun 5, 2025 34:44


Anthropic bringt mit Claude 4 die nächste Modellreihe heraus und setzt damit neue Maßstäbe in den Benchmarks.DeepSeek-R1 hat ein Update vom 28. Mai erhalten und ist nun wieder auf Augenhöhe mit den anderen großen Modellen.Chatterbox TTS ist ein neues Open-Source-Modell, das Stimmen mit nur fünf Sekunden Vorlage klonen kann.Mem0 ist ein Konzept, das langfristiges Gedächtnis (Long Term Memory) für Agenten verbessern soll.Trea nutzt ein Multi-Stage-Agentensystem, um aus LLMs bessere Coding-Ergebnisse herauszuholen.OpenAI eröffnet ein Vertriebsbüro in München. Duolingo wird „AI-first“.JetBrains veröffentlicht mit Koog eine Infrastruktur, um KI-Agenten im Kotlin-Umfeld zu bauen.Apples Kurzbefehle-App („Shortcuts“) soll künftig KI-Unterstützung erhalten.Steve Jobs wusste schon vor 40 Jahren, was er von Technologie will – mit historicmentor.com wird das bald erlebbar.Link zum Stargate-Video.Fabis erwähnte Prompt-Analyse von Cursor.Schreibt uns! Schickt uns eure Themenwünsche und euer Feedback: podcast@programmier.barFolgt uns! Bleibt auf dem Laufenden über zukünftige Folgen und virtuelle Meetups und beteiligt euch an Community-Diskussionen. BlueskyInstagramLinkedInMeetupYouTube

SaaS Talkâ„¢ with the Metrics Brothers - Strategies, Insights, & Metrics for B2B SaaS Executive Leaders

Dave "CAC" Kellogg and Ray "Growth" Rike break down the recent Benchmarkit B2B Marketing Budget and Productivity Benchmarks Report. Key trends and insights into how the Marketing budgets are established, consumed and reported upon including:Marketing budget as a percentage of revenue including YoY changes and segmented by company sizeGrowth Rates compared to Marketing budget allocation - the chicken or the egg discussionPeople vs Program vs Technology budget allocation - and which company profile attributes impact the resultsDemand Generation - how much of the Marketing budget is consumed by demand gen - it depends...Product-Led Growth vs Sales-Led Growth - how the GTM motion impacts budget allocationTop 3 Marketing Performance Metrics use and the Top 3 Marketing efficiency metrics usedIf you are evaluating how your Marketing budget companies to similar "like" companies or how other companies are measuring the efficiency and/or Marketing ROI - this episode has something for you!See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

The Options Insider Radio Network
The European Market Brief 2: European Benchmarks

The Options Insider Radio Network

Play Episode Listen Later Jun 3, 2025 53:22


In this episode, host Mark Longo is joined by guests Russell Rhodes (Indiana University Kelley School of Business), Damien Zinck (Eurex), and Nick Cassano (Tradier) to delve into the dynamic world of European derivatives markets. The discussion encompasses various aspects such as the Euro STOXX 50 index, DAX futures, VSTOXX volatility index, and innovative trading strategies like one-by-two call spreads. The episode highlights the impact of market volatility driven by global trade wars and geopolitical factors on European markets. The experts also share insights into sector-specific futures and the growing interest in European derivatives among US investors.   01:22 Welcome to the European Market Brief 02:31 Episode Two Kickoff and Guest Introductions 07:26 Diving into the Euro Stocks 50 13:51 Exploring the DAX and Micro Contracts 17:42 Trading Strategies and Leverage in Futures 18:58 Understanding VSTOXX and Volatility 27:34 VSTOXX vs VIX: A Comparative Analysis 28:26 European Derivatives Market: Current Trends 29:13 Impact of Trade Wars and Geopolitics 36:20 Sector Futures and Trading Opportunities 42:33 Volatility Trading Strategies 46:07 Listener Feedback and Closing Remarks  

Lunchtime With Roggin And Rodney
5/30 H2: Bill Plaschke; MLB benchmarks have changed; Players getting death threats

Lunchtime With Roggin And Rodney

Play Episode Listen Later May 30, 2025 42:45 Transcription Available


Bill Plaschke hops on to talk about the end of Around the Horn, the current state of the Dodgers and the Lakers' offseason. Batting average for position players and wins for starting pitchers arent as important as they used to be. Is the legalization of sports gambling inadvertently the reason why so many players are recieving threats on social media?

AI + a16z
Beyond Leaderboards: LMArena's Mission to Make AI Reliable

AI + a16z

Play Episode Listen Later May 30, 2025 101:43


LMArena cofounders Anastasios N. Angelopoulos, Wei-Lin Chiang, and Ion Stoica sit down with a16z general partner Anjney Midha to talk about the future of AI evaluation. As benchmarks struggle to keep up with the pace of real-world deployment, LMArena is reframing the problem: what if the best way to test AI models is to put them in front of millions of users and let them vote? The team discusses how Arena evolved from a research side project into a key part of the AI stack, why fresh and subjective data is crucial for reliability, and what it means to build a CI/CD pipeline for large models.They also explore:Why expert-only benchmarks are no longer enough.How user preferences reveal model capabilities — and their limits.What it takes to build personalized leaderboards and evaluation SDKs.Why real-time testing is foundational for mission-critical AI.Follow everyone on X:Anastasios N. AngelopoulosWei-Lin ChiangIon StoicaAnjney MidhaTimestamps0:04 -  LLM evaluation: From consumer chatbots to mission-critical systems6:04 -  Style and substance: Crowdsourcing expertise18:51 -  Building immunity to overfitting and gaming the system29:49 -  The roots of LMArena41:29 -   Proving the value of academic AI research48:28 -  Scaling LMArena and starting a company59:59 -  Benchmarks, evaluations, and the value of ranking LLMs1:12:13 -  The challenges of measuring AI reliability1:17:57 -  Expanding beyond binary rankings as models evolve1:28:07 -  A leaderboard for each prompt1:31:28 -  The LMArena roadmap1:34:29 -  The importance of open source and openness1:43:10 -  Adapting to agents (and other AI evolutions) Check out everything a16z is doing with artificial intelligence here, including articles, projects, and more podcasts.

IBKR Podcasts
Building Benchmarks

IBKR Podcasts

Play Episode Listen Later May 28, 2025 16:29


S&P500, Dow Jones Industrial Average, and the Nasdaq 100 are all household names to investors. However, many investors may not know what goes into constructing an index. Nasdaq's Director of Index Product Development, Rob Jankiewicz joins Jeff Praissman to discuss how indexes are constructed and the many parameters that are used.

In Your Shoes
ARC Prize 2024 Winners Jan Disselhoff & Daniel Franzen on AGI, ARC Challenge, LLMs and AI in Europe

In Your Shoes

Play Episode Listen Later May 25, 2025 57:08


In this conversation, Vivek Juneja engages with Daniel and Jan, two researchers deeply involved in the ARC Challenge and the field of deep learning. They discuss their backgrounds, motivations for participating in the ARC Challenge, and the implications of the challenge for future AI research. The conversation covers the limitations of current language models, the importance of energy efficiency, and the role of benchmarks in AI. They also explore the startup culture in Europe, the challenges posed by regulations, and the exciting breakthroughs in AI research, particularly in reasoning and program synthesis. The discussion concludes with advice for newcomers in the field and reflections on staying updated in a rapidly evolving landscape.Takeaways1. Daniel started coding at eight and switched to computer science and Jan began with mathematics before moving to computer science.2. The ARC Challenge represents cognitive puzzles for AI. LLMs showed promising results in the ARC Challenge.3. Energy efficiency is a significant concern for AI models.4. Consumer interaction with LLMs often leads to over-reliance.5. Benchmarks in AI may lose their relevance over time.6. Startup culture in Europe lacks the drive seen in Silicon Valley.7. Regulations can hinder innovation in AI development.8. Exciting breakthroughs in reasoning models are on the horizon9. Starting with AI is not that hard, and is highly approachable

Effective Altruism Forum Podcast
“Why I am Still Skeptical about AGI by 2030” by James Fodor

Effective Altruism Forum Podcast

Play Episode Listen Later May 23, 2025 12:30


Introduction I have been writing posts critical of mainstream EA narratives about AI capabilities and timelines for many years now. Compared to the situation when I wrote my posts in 2018 or 2020, LLMs now dominate the discussion, and timelines have also shrunk enormously. The ‘mainstream view' within EA now appears to be that human-level AI will be arriving by 2030, even as early as 2027. This view has been articulated by 80,000 Hours, on the forum (though see this excellent piece excellent piece arguing against short timelines), and in the highly engaging science fiction scenario of AI 2027. While my article piece is directed generally against all such short-horizon views, I will focus on responding to relevant portions of the article ‘Preparing for the Intelligence Explosion' by Will MacAskill and Fin Moorhouse. Rates of Growth The authors summarise their argument as follows: Currently, total global research effort [...] ---Outline:(00:11) Introduction(01:05) Rates of Growth(04:55) The Limitations of Benchmarks(09:26) Real-World Adoption(11:31) Conclusion--- First published: May 2nd, 2025 Source: https://forum.effectivealtruism.org/posts/meNrhbgM3NwqAufwj/why-i-am-still-skeptical-about-agi-by-2030 --- Narrated by TYPE III AUDIO.

Bloomberg Daybreak: Asia Edition
APAC Markets Open Higher as Geopolitical Risks Loom

Bloomberg Daybreak: Asia Edition

Play Episode Listen Later May 21, 2025 19:19 Transcription Available


Benchmarks in Australia, South Korea and Japan all climbed in early trading Wednesday, pushing a gauge of Asian shares 0.4% higher. Geopolitical tensions may add headwinds to the markets, which had calmed recently after a month of turmoil from the tariff blitz unleashed by US President Donald Trump. We look at how the tariff story is resonating through the Asia-Pacific with Helen Zhu, Managing Partner and Chief Investment Officer at NF Trinity. Plus - oil rose after CNN reported that US intelligence had suggested Israel is making preparations for a possible strike on Iranian nuclear facilities. Stocks in Asia advanced on Wednesday. West Texas Intermediate gained 1.5% to $62.96 a barrel. It's not clear that Israeli leaders had made a final decision to carry out the strikes, CNN said, citing unnamed officials. Contracts for the S&P 500 and the Nasdaq 100 were down 0.1%, paring most of their losses earlier in the day. We get reaction to the day's macro headlines from Brian Vendig, President and Chief Investment Officer at MJP Wealth Advisors.See omnystudio.com/listener for privacy information.

The AI Breakdown: Daily Artificial Intelligence News and Discussions

With Microsoft Build, Google I/O, and Code with Claude all happening this week, the battle between major AI labs is heating up. This episode breaks down how to think about the AI race across five key vectors: Consumer, Enterprise, Coding, Agents, and Benchmarks. Get Ad Free AI Daily Brief: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://patreon.com/AIDailyBrief⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Brought to you by:KPMG – Go to ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://kpmg.com/ai⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ to learn more about how KPMG can help you drive value with our AI solutions.Blitzy.com - Go to ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://blitzy.com/⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ to build enterprise software in days, not months Vertice Labs - Check out ⁠⁠⁠⁠⁠⁠⁠⁠http://verticelabs.io/⁠⁠⁠⁠⁠⁠⁠⁠ - the AI-native digital consulting firm specializing in product development and AI agents for small to medium-sized businesses.The Agent Readiness Audit from Superintelligent - Go to ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://besuper.ai/ ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠to request your company's agent readiness score.The AI Daily Brief helps you understand the most important news and discussions in AI. Subscribe to the podcast version of The AI Daily Brief wherever you listen: https://pod.link/1680633614Subscribe to the newsletter: https://aidailybrief.beehiiv.com/Join our Discord: https://bit.ly/aibreakdownInterested in sponsoring the show? nlw@breakdown.network

digital kompakt | Business & Digitalisierung von Startup bis Corporate
Tobit: Die krasseste Super-App, von der du je gehört hast

digital kompakt | Business & Digitalisierung von Startup bis Corporate

Play Episode Listen Later May 20, 2025 57:41


Tauche ein in die faszinierende Welt der digitalen Transformation mit Joel Kaczmarek und Dominik Dommick, die im Gespräch mit Tobias Groten, dem visionären Gründer von Tobit Software, die Zukunft der Städte erkunden. Tobias hat in Ahaus ein digitales Wunderland geschaffen, das als lebendiges Labor für vernetzte Städte dient. Von Super-Apps bis zu digitalisierten Stadtstrukturen – erfahre, wie Tobias ganze Städte revolutioniert und welche Rolle innovative Technologien dabei spielen. Lass dich inspirieren von einem Gespräch, das zeigt, wie Digitalisierung das Leben in Städten neu definiert. Du erfährst... …wie Tobias Groten mit Tobit ganze Städte digitalisiert und vernetzt …welche innovativen Konzepte hinter der Super-App von Ahaus stecken …warum Digitalisierung in Kleinstädten entscheidend für deren Zukunft ist …wie das Punktesystem von Ahaus die lokale Gig-Economy revolutioniert …warum Tobias Groten trotz Erfolg keine weiteren Städte digitalisiert __________________________ ||||| PERSONEN |||||

Ransquawk Rundown, Daily Podcast
Europe Market Open: EU & US futures flat with catalysts sparse; fixed benchmarks extend onto gains and DXY lower after data

Ransquawk Rundown, Daily Podcast

Play Episode Listen Later May 16, 2025 2:16


Mixed APAC trade, US futures range bound while European futures point to a marginally firmer open.DXY remains lower after Thursday's data, EUR/USD marginally reclaimed 1.12, USD/JPY found support at 145.00.Fixed benchmarks extended/held on to recent gains.Crude benchmarks remain underpinned by the latest on US-Iran, metals marginally softer.Looking ahead, highlights include US Export/Import Prices, UoM Sentiment Survey, BoC SLOS, Speakers including ECB's Lane, Cipollone & Fed's Barkin.Click for the Newsquawk Week Ahead.Read the full report covering Equities, Forex, Fixed Income, Commodites and more on Newsquawk

Market Proof Marketing: New Home Builder Marketing Insights
Ep 387: Books, Buzzwords & Benchmarks

Market Proof Marketing: New Home Builder Marketing Insights

Play Episode Listen Later May 15, 2025 41:23


Market Proof Marketing · Ep 387: Books Buzzwords & Benchmarks In this episode of Market Proof Marketing, Jackie Lipinski and Beth Russell reflect on the power of clear communication, clean marketing systems, and community-first thinking. From school movie nights and panic cleaning to organizing ad accounts and analyzing the Q1 Online Sales Benchmarks, this episode covers the serious and the silly. Plus, a detailed recap of the most recent Market Proof Marketing Academy, thoughts on Google's AI ad integrations, and a reminder that your brand lives well beyond your website.Key TopicsCommunication fails & panic cleaningHighlights from the Market Proof Marketing AcademyQ1 2025 Online Sales BenchmarksGoogle tests ads in AI chatbot conversationsCommunity planning and lifestyle-driven decision-makingShifting focus from incentives to valueAll Access reminders and listener appreciationSPECIAL ANNOUNCEMENTSOnline Sales & Marketing Summit is Filling Up - Grab Your Tickets TodayStory Time [1:05]Beth shares a personal story about missing a school event due to unclear and inconsistent communication, something builder marketers should heed when planning outreach for events.Jackie reflects on "panic cleaning" before company arrives and ties it to the need for marketers to keep their ad accounts clean and organized, even if no one's watching (yet).The team gives a shout-out to Kendra and listeners who asked for Story Time to stay; your wishes have been granted!In The News [21:04]Q1 2025 Online Sales BenchmarksJackie and Beth review Amanda Martin's new online sales benchmark report summarizing over 50,000 leads from 33 markets. Despite Q1 being particularly tough for many builders, the consistency and performance of online sales teams remain impressive. They also discuss how these benchmarks can serve as both inspiration and a performance check.Google is Testing Ads in Third-party AI Chatbot ConversationsGoogle's experiment with placing ads inside chatbot conversations sparks discussion on what this means for marketers. Jackie and Beth point out the infancy of this format, the very low AI-driven site traffic to builders today, and why the content on your own website - and how it gets syndicated elsewhere - remains your biggest asset for visibility.Things We Love, Things We Hate [33:40]Beth's Love: Taking a walk through her community trail reminded her how thoughtfully designed neighborhoods deliver lasting emotional value to residents, something builders should showcase more intentionally.Jackie's Love: Celebrating 10 years since relocating to Washington state, Jackie reflects on the importance of loving where you live and how lifestyle decisions drive buying behavior.Want to rewatch part of the Academy or catch exclusive behind-the-scenes content?Join DYC's All Access community — free for builders, online sales specialists, and home building leaders.Have feedback or want to fight for more Story Time? Let us know: your messages bring us joy (and influences podcast decisions!).Like and subscribe on your favorite platform! The post Ep 387: Books, Buzzwords & Benchmarks appeared first on Online Sales and Marketing for Home Builders - DYC.

Big Technology Podcast
AI's Drawbacks: Environmental Damage, Bad Benchmarks, Outsourcing Thinking — With Emily M. Bender and Alex Hanna

Big Technology Podcast

Play Episode Listen Later May 14, 2025 61:55


Emily  Bender is a computational linguistics professor at the University of Washington. Alex Hanna is the Director of Research at the Distributed AI Research Institute. Bender and Hanna join Big Technology to discuss what their new book, “The AI‑Con," which they describe as the layered ways today's language‑model boom obscures environmental costs, labor harms, and shaky science. Tune in to hear a lively back‑and‑forth on whether chatbots are useful tools or polished parlor tricks. We also cover benchmark gaming, data‑center water use, doomerism, and more. Hit play for a candid debate that will leave you smarter about where generative AI really stands — and what comes next. --- Enjoying Big Technology Podcast? Please rate us five stars ⭐⭐⭐⭐⭐ in your podcast app of choice. Want a discount for Big Technology on Substack? Here's 25% off for the first year, which includes membership to our subscriber Discord: https://www.bigtechnology.com/subscribe?coupon=0843016b Questions? Feedback? Write to: bigtechnologypodcast@gmail.com

Keepin' The Lights On
Electrical Contractor Business Benchmarks w Josh Bone, ELECTRI International

Keepin' The Lights On

Play Episode Listen Later May 14, 2025 42:48


(00:00:00) Electrical Contractor Business Benchmarks w Josh Bone, ELECTRI International (00:05:28) Project Basics (00:15:31) Evolution of an EC Business (00:23:17) Project Progress (00:32:16) Who is This For? (00:33:19) What to Measure in Business? (00:38:22) Passion for the Industry Business financial and operation benchmarks are invaluable as you look to improve your business numbers and service levels. However, if you're an electrical contractor or data/comm installer, how do you find numbers that relate to your industry?I spoke with Josh Bone, Executive Director at ELECTRI International, about a new project to create a repository of benchmarking data that will be available to contractors for free. We discuss the design of the project and how the data will help contractors understand their businesses for better management and financial success.ELECTRI International is a non-profit research organization focused on the electrical and comm/data side of construction.Thank you for listening and please take a moment to subscribe, rate, and review our show on your favorite app.To get a hold of us here at Keepin' The Lights On, please email: podcast@graybar.comWatch on YouTube: https://youtu.be/JkEiv3FKFdwTo reach Josh Bone on LinkedIn:  https://www.linkedin.com/in/joshbone/Learn more about Electri International: https://www.electri.org/Find the free Electri International research reports: https://www.electri.org/research-overview/research/Get engaged in the latest research: https://www.electri.org/research-overview/electri-research-engagement-hub/Drago's Seafood New Orleans: https://www.dragosrestaurant.com/

two & a half gamers

Want real ROAS benchmarks from 300+ games and $200 M+ in spend?This episode of 2.5 Gamers goes deep with ZJ from PVX Partners, who brings an insane dataset from UA financing across puzzle, casual, strategy, and 4X games.We cover what's really happening in 2025:

Online People Talking with Jen Barkan
#37: Real Talk on Benchmarks and Market Trends with Our Newest Sales Coaches

Online People Talking with Jen Barkan

Play Episode Listen Later May 13, 2025 38:50


#37: Real Talk on Benchmarks and Market Trends with Our Newest Sales CoachesIn this episode, Jen, Amanda, and new team members Melissa Fort and Molly Adams, bring energy, insights, and just the right amount of pet hair talk to the mic. From behind-the-scenes snafus to deep dives on sales benchmarks and market shifts, this episode is packed with relatable moments and practical takeaways for online sales pros.You'll hear about:DYC's Newest Coaches: Meet Melissa and MollyMelissa and Molly reflect on joining the online sales coaching team and share their impressions of the Online Sales & Marketing Summit in Chicago—whether attending as a first-timer or a seasoned leader.Online Sales Benchmarks RevealedAmanda breaks down six years of performance data, uncovering:A slight uptick in lead-to-appointment conversionsA dip in appointment-to-sale conversionsStable online sales contribution ratesNavigating a Hesitant MarketMarket uncertainty and buyer hesitancy are real, but the team has winning strategies to help you succeedTravel, Transitions & Backup PlansSharpening Sales Strategy & BehaviorDon't miss this conversation that blends data-driven strategy with real-life stories from the front lines of online sales. Whether you're a seasoned OSC or new to the role, there's something here for you.Thanks for tuning in!Questions? Email: show@doyouconvert.comMore at doyouconvert.com

Ground Truths
Tyler Cowen: The Prototypic Polymath

Ground Truths

Play Episode Listen Later May 10, 2025 32:18


Audio file, also on Apple and SpotifyTyler Cowen, Ph.D, is the Holbert L. Harris Professor of Economics at George Mason University. He is the author of 17 books, most recently Talent.: How to Identify Energizers, Creatives, and Winners Around the World. Tyler has been recognized as one of the most influential economists of the past decade. He initiated and directs the philanthropic project Emergent Ventures, writes a blog Marginal Revolution, and a podcast Conversations With Tyler, and also writes columns for The Free Press." He is writing a new book (and perhaps his last) on Mentors. “Maybe AGI [Artificial General Intelligence] is like porn — I know it when I see it. And I've seen it.”—Tyler CowenOur conversation on acquiring information, A.I., A.G.I., the NIH, the assault on science, the role of doctors in the A.I. era,, the meaning of life, books of the future, and much more.Transcript with linksEric Topol (00:06):Well, hello. This is Eric Topol with Ground Truths, and I am really thrilled today to have the chance to have a conversation with Tyler Cowen, who is, when you look up polymath in the dictionary, you might see a picture of him. He is into everything. And recently in the Economist magazine 1843, John Phipps wrote a great piece profile, the man who wants to know everything. And actually, I think there's a lot to that.Tyler Cowen (00:36):That's why we need longevity work, right?Eric Topol (00:39):Right. So he's written a number of books. How many books now, Tyler?Tyler Cowen:17, I'm not sure.Eric Topol:Only 17? And he also has a blog that's been going on for over 20 years, Marginal Revolution that he does with Alex Tabarrok.Tyler Cowen (00:57):Correct.Eric Topol (00:57):And yeah, and then Conversations with Tyler, a podcast, which I think an awful lot of people are tuned into that. So with that, I'm just thrilled to get a chance to talk with you because I used to think I read a lot, but then I learned about you.“Cowen calls himself “hyperlexic”. On a good day, he claims to read four or fivebooks. Secretly, I timed him at 30 seconds per page reading a dense tract byMartin Luther. “—John Phipps, The Economist's 1843I've been reading more from the AIs lately and less from books. So I'll get one good book and ask the AI a lot of questions.Eric Topol (01:24):Yeah. Well, do you use NotebookLM for that?Tyler Cowen (01:28):No, just o3 from OpenAI at the moment, but a lot of the models are very good. Claude, there's others.Eric Topol (01:35):Yeah, yeah. No, I see how that's a whole different way to interrogate a book and it's great. And in fact, that gets me to a topic I was going to get to later, but I'll do it now. You're soon or you have already started writing for the Free Press with Barri Weiss.Tyler Cowen (01:54):That's right, yes. I have a piece coming out later today. It's been about two weeks. It's been great so far.“Tyler Cowen has a mind unlike any I've ever encountered. In a single conversation, it's not at all unusual for him to toggle between DeepSeek, GLP-1s, Haitian art, sacred Tibetan music, his favorite Thai spot in L.A., and LeBron James”—Bari WeissYeah, so that's interesting. I hadn't heard of it until I saw the announcement from Barri and I thought what was great about it is she introduced it. She said, “Tyler Cowen has a mind unlike any I've ever encountered. In a single conversation, it's not at all unusual for him to toggle between DeepSeek, GLP-1s, Haitian art, sacred Tibetan music, his favorite Thai spot in L.A., and LeBron James. Now who could do that, right. So I thought, well, you know what? I need independent confirmation of that, that is as being a polymath. And then I saw Patrick Collison, who I know at Stripe and Arc Institute, “you can have a specific and detailed discussion with him about 17th-century Irish economic thinkers, or trends in African music or the history of nominal GDP targeting. I don't know anyone who can engage in so many domains at the depth he does.” So you're an information acquirer and one of the books you wrote, I love the title Infovore.Tyler Cowen (03:09):The Age of the Infovore, that's right.Eric Topol (03:11):I mean, have people been using that term because you are emblematic of it?“You can have a specific and detailed discussion with him about 17th-century Irish economic thinkers, or trends in African music or the history of nominal GDP targeting. I don't know anyone who can engage in so many domains at the depth he does.”—Patrick CollisonIt was used on the internet at some obscure site, and I saw it and I fell in love with that word, and I thought I should try to popularize it, but it doesn't come from me, but I think I am the popularizer of it.Yeah, well, if anybody was ingesting more information and being able to work with it. That's what I didn't realize about you, Tyler, is restaurants and basketball and all these other fine arts, very impressive. Now, one of the topics I wanted to get into you is I guess related to a topic you've written about fair amount, which is the great stagnation, and right now we're seeing issues like an attack on science. And in the past, you've written about how you want to raise the social status of scientists. So how do you see this current, I would even characterize as a frontal assault on science?Tyler Cowen (04:16):Well, I'm very worried about current Trump administration policies. They change so frequently and so unpredictably, it's a little hard to even describe what they always are. So in that sense, it's a little hard to criticize them, but I think they're scaring away talent. They might scare away funding and especially the biomedical sciences, the fixed costs behind a lot of lab work, clinical trials, they're so high that if you scare money away, it does not come back very readily or very quickly. So I think the problem is biggest perhaps for a lot of the biomedical sciences. I do think a lot of reform there has been needed, and I hope somehow the Trump policies evolve to that sort of reform. So I think the NIH has become too high bound and far too conservative, and they take too long to give grants, and I don't like how the overhead system has been done. So there's plenty of room for improvement, but I don't see so far at least that the efforts have been constructive. They've been mostly destructive.Eric Topol (05:18):Yeah, I totally agree. Rather than creative destruction it's just destruction and it's unfortunate because it seems to be haphazard and reckless to me at least. We of course, like so many institutions rely on NIH funding for the work, but I agree that reform is fine as long as it's done in a very thought out, careful way, so we can eke out the most productivity for the best investment. Now along with that, you started Emergent Ventures where you're funding young talent.Tyler Cowen (05:57):That's right. That's a philanthropic fund. And we now have slightly over 1000 winners. They're not all young, I'd say they're mostly young and a great number of them want to go into the biomedical sciences or have done so. And this is part of what made me realize what an incredible influx of talent we're seeing into those areas. I'm not sure this is widely appreciated by the world. I'm sure you see it. I also see how much of that talent actually is coming from Canada, from Ontario in particular, and I've just become far more optimistic about computational biology and progress in biology and medical cures, fixes, whatever you want to call it, extending lives. 10 years ago, I was like, yeah, who knows? A lot of things looked pretty stuck. Then we had a number of years where life expectancy was falling, and now I think we're on the verge of a true golden age.Eric Topol (06:52):I couldn't agree with you more on that. And I know some of the people that you funded like Anne Wylie who developed a saliva test for Covid out of Yale. But as you say, there's so many great young and maybe not so young scientists all over, Canada being one great reservoir. And now of course I'm worried that we're seeing emigration rather than more immigration of this talent. Any thoughts about that?Tyler Cowen (07:21):Well, the good news is this, I'm in contact with young people almost every day, often from other countries. They still want to come to the United States. I would say I sign an O-1 letter for someone about once a week, and at least not yet has the magic been dissipated. So I'm less pessimistic than some people are, but I absolutely do see the dangers. We're just the biggest market, the freest place we have by far the most ambitious people. I think that's actually the most significant factor. And young people sense that, and they just want to come here and there's not really another place they can go that will fit them.Eric Topol (08:04):Yeah, I mean one of the things as you've probably noted is there's these new forces that are taking on big shouldering. In fact, Patrick Collison with Arc Institute and Chan Zuckerberg for their institute and others like that, where the work you're doing with Emergent Ventures, you're supporting important projects, talents, and if this whole freefall in NIH funding and other agency funding continues, it looks like we may have to rely more on that, especially if we're going to attract some talent from outside. I don't know how else we're going to make. You're absolutely right about how we are such a great destination and great collaborations and mentors and all that history, but I'm worried that it could be in kind of a threatened mode, if you will.Tyler Cowen (08:59):I hope AI lowers costs. As you probably know at Arc, they had Greg Brockman come in for some number of months and he's one of the people, well, he helped build up Stripe, but he also was highly significant in OpenAI behind the GPT-4 model. And to have Greg Brockman at your institute doing AI for what, six months, that's a massive acceleration that actually no university had the wisdom to do, and Arc did. So I think we're seeing just more entrepreneurial thinking in the area. There's still this problem of bottlenecks. So let's say AI is great for drug discovery as it may be. Well, clinical trials then become a bigger bottleneck. The FDA becomes a bigger bottleneck. So rapid improvement in only one area while great is actually not good enough.Eric Topol (09:46):Yeah, I'm glad you brought up that effect in Arc Institute because we both know Patrick Hsu, who's a brilliant young guy who works there and has published some incredible large language models applied to life science in recent months, and it is impressive how they used AI in almost a singular way as compared to as you said, many other leading institutions. So that is I think, a really important thing to emphasize.Tyler Cowen (10:18):Arc can move very quickly. I think that's not really appreciated. So if Patrick Hsu decides Silvana Konermann, Patrick Collison, if they decide something ought to be bought or purchased or set in motion, it can happen in less than a day. And it does happen basically immediately. And it's not only that it's quicker, I think when you have quicker decisions, they're better and it's infectious to the people you're working with. And there's an understanding that the core environment is not a bureaucratic one. So it has a kind of multiplier effect through the institution.Eric Topol (10:54):Yeah, I totally agree with you. It's always been a philosophy in your mind to get stuff done, get s**t done, whatever you want to call it. They're getting it done. And that's what's so impressive. And not just that they've got some new funds available, but rather they're executing in a way that's parallel to the way the world's evolving in the AI front, which is I think faster than most people would ever have expected, anticipated. Now that gets me to a post you had on Marginal Revolution just last week, which one of the things I love about Marginal Revolution is you don't have to read a whole lot of stuff. You just give the bullets, the juice, if you will. Here you wrote o3 and AGI, is April 16th AGI day? And everybody's talking about artificial general intelligence is here. It's going to be here five years, it's going to be seven years.Eric Topol (11:50):It certainly seems to be getting closer. And in this you wrote, “I think it is AGI, seriously. Try asking it lots of questions, and then ask yourself: just how much smarter was I expecting AGI to be? As I've argued in the past, AGI, however you define it, is not much of a social event per se. It still will take us a long time to use it properly. Benchmarks, benchmarks, blah blah blah. Maybe AGI is like porn — I know it when I see it. And I've seen it.” I thought that was really well done, Tyler. Anything you want to amplify on that?Tyler Cowen (12:29):Look, if I ask at economics questions and I'm trained as an economist, it beats me. So I don't care if other people don't call it AGI, but one of the original definitions of AGI was that it would beat most experts most of the time on most matters, say 90% or above, and we're there. So people keep on shifting the goalposts. They'll say, well, sometimes it hallucinates or it's not very good at playing tic-tac toe, or there's always another complaint. Those are not irrelevant, but I'll just say, sit down, have someone write at a test of 20 questions, you're a PhD, you take the test, let o3 take the test, then have someone grade, see how you've done, then form your opinion. That's my suggestion.Eric Topol (13:16):I think it's pretty practical. I mean, enough with the Turing test, I mean, we've had that Turing test for decades, and I think the way you described it is a little more practical and meaningful these days. But its capabilities to me at least, are still beyond belief eke out of current, not just the large language models, but large reasoning models. And so, it's just gotten to a point where and it's accelerating, every week there's so many other, the competition is good for taking it to the next level.Tyler Cowen (13:50):It can do tasks and it self improves. So o3-pro will be out in a few weeks. It may be out by the time you're hearing this. I think that's obviously going to be better than just pure o3. And then GPT-5 people have said it will be this summer. So every few months there are major advances and there's no sign of those stopping.Eric Topol (14:12):Absolutely. Now, of course, you've been likened to “Treat Tyler like a really good GPT” that is because you're this information meister. What do you ask the man who you can ask anything? That's kind of what we have when we can go to any one of these sites and start our prompts, whatever. So it's kind of funny in some ways you might've annotated this with your quest for knowledge.Tyler Cowen (14:44):Well, I feel I understand the thing better than most people do for that reason, but it's not entirely encouraging to me personally, selfishly to be described that way, whether or not it's accurate. It just means I have a lot more new competition.Eric Topol (14:59):Well, I love this one. “I'm not very interested in the meaning of life, but I'm very interested in collecting information on what other people think is the meaning of life. And it's not entirely a joke” and that's also what you wrote about in the Free Press thing, that most of the things that are going to be written are going to be better AI in the media and that we should be writing books for the AI that's going to ingest them. How do you see this human AI interface growing or moving?Tyler Cowen (15:30):The AI is your smartest reader. It's your most sympathetic reader. It will remember what you tell it. So I think humans should sit down and ask, what does the AI need to know? And also, what is it that I know that's not on the historical record anywhere? That's not just repetition if I put it down, say on the internet. So there's no point in writing repetitions anymore because the AI already knows those things. So the value of what you'd call broadly, memoir, biography, anecdote, you could say secrets. It's now much higher. And the value of repeating basic truths, which by the way, I love as an economist, to be clear, like free trade, tariffs are usually bad, those are basic truths. But just repeating that people will be going to the AI and saying it again won't make the AI any better. So everything you write or podcast, you should have this point in mind.Eric Topol (16:26):So you obviously have all throughout your life in reading lots of books. Will your practice still be to do the primary reading of the book, or will you then go to o3 or whatever or the other way around?Tyler Cowen (16:42):I've become fussier about my reading. So I'll pick up a book and start and then start asking o3 or other models questions about the book. So it's like I get a customized version of the book I want, but I'm also reading somewhat more fiction. Now, AI might in time become very good at fiction, but we're not there now. So fiction is more special. It's becoming more human, and I should read more of it, and I'm doing that.Eric Topol (17:10):Yeah, no, that's great. Now, over the weekend, there was a lot of hubbub about Bill Gates saying that we won't need doctors in the next 10 years because of AI. What are your thoughts about that?Tyler Cowen (17:22):Well, that's wrong as stated, but he may have put it in a more complex way. He's a very smart guy of course. AI already does better diagnosis on humans than medical doctors. Not by a lot, but by somewhat. And that's free and that's great, but if you need brain surgery for some while, you still need the human doctor. So human doctors will need to adjust. And if someone imagines that at some point robots do the brain surgery better, well fine. But I'm not convinced that's within the next 10 years. That would surprise me.Eric Topol (17:55):So to that point, recently, a colleague of mine wrote an op-ed in the New York Times about six studies comparing AI alone versus doctors with AI. And in all six studies, the AI did better than the doctors who had access to AI. Now, you could interpret that as, well they don't know how to use AI. They have automation bias or that is true. What do you think?Tyler Cowen (18:27):It's probably true, but I would add as an interpretation, the value of meta rationality has gone up. So to date, we have not selected doctors for their ability to work with AI, obviously, but some doctors have the personal quality, it's quite distinct from intelligence, but if just knowing when they should defer to someone or something else, and those doctors and researchers will become much more valuable. They're sufficiently modest to defer to the AI and have some judgment as to when they should do that. That's now a super important quality. Over time, I hope our doctors have much more of that. They are selected on that basis, and then that result won't be true anymore.Eric Topol (19:07):So obviously you would qualify. There's a spectrum here. The AI enthusiasts, you and I are both in that group, and then there's the doomsayers and there's somewhere middle ground, of course, where people are trying to see the right balance. Are there concerns about AI, I mean anything about that, how it's moving forward that you're worried about?Tyler Cowen (19:39):Well, any change that big one should have very real concerns. Maybe our biggest concern is that we're not sure what our biggest concern should be. One simple effect that I see coming soon is it will devalue the status of a lot of our intellectuals and what's called our chattering class. A lot of its people like us, we won't seem so impressive anymore. Now, that's not the end of the world for everyone as a whole, but if you ask, what does it mean for society to have the status of its elites so punctured? At a time when we have some, I would say very negative forces attacking those elites in other ways, that to me is very concerning.Eric Topol (20:25):Do you think that although we've seen what's happening with the current administration with respect to the tariffs, and we've already talked about the effects on science funding, do you see this as a short-term hit that will eventually prevail? Do you see them selectively supporting AI efforts and finding the right balance with the tech companies to support them and the competition that exists globally with China and whatnot? How are we going to get forward and what some people consider pretty dark times, which is of course, so seemingly at odds with the most extraordinary times of human support with AI?Tyler Cowen (21:16):Well, the Trump people are very pro AI. I think that's one of the good things about the administration, much pro AI and more interested than were the Biden people. The Biden people, you could say they were interested, but they feared it would destroy the whole world, and they wanted to choke and throttle it in a variety of ways. So I think there's a great number of issues where the Trump people have gone very badly wrong, but at least so far AI's not one of them. I'd give them there like an A or A+ so far. We'll see, right?Eric Topol (21:44):Yeah. As you've seen, we still have some of these companies in some kind of a hot seat like Meta and Google regarding their monopolies, and we saw how some of the tech leaders, not all of them, became very supportive, potentially you could interpret that for their own interests. They wanted to give money to the inauguration and also get favor curry some political favor. But I haven't yet seen the commitment to support AI, talk about a golden age for the United States because so much of this is really centered here and some of the great minds that are helping to drive the AI and these models. But I wonder if there's more that can be done so that we continue to lead in this space.Tyler Cowen (22:45):There's a number of issues here. The first is Trump administration policy toward the FTC, I think has not been wonderful. They appointed someone who seems like would be more appropriate for a democratic or more left-leaning administration. But if you look at the people in the Office of Science and Technology Policy in the White House, they're excellent, and there's always different forces in any administration. But again, so far so good. I don't think they should continue the antitrust suit against Google that is looking like it's going against Google, but that's not really the Trump administration, that's the judiciary, and that's been underway for quite some while. So with Trump, it's always very hard to predict. The lack of predictability, I would say, is itself a big problem. But again, if you're looking for one area where it's good, that would be my pick.Eric Topol (23:35):Yeah, well, I would agree with that for sure. I just want to see more evidence that we capitalize on the opportunities here and don't let down. I mean, do you think outlawing selling the Nvidia chips to China is the way to do this? It seems like that hurts Nvidia and isn't China going to get whatever they want anyway?Tyler Cowen (24:02):That restriction, I favored when it was put in. I'm now of the view that it has not proved useful. And if you look at how many of those chips get sold, say to Malaysia, which is not a top AI performer, one strongly suspects, they end up going to China. China is incentivized to develop its own high-quality chips and be fully independent of Western supply lines. So I think it's not worked out well.Eric Topol (24:29):Yeah, no, I see that since you've written so much about this, it's good to get your views because I share those views and you know a lot more about this than I would, but it seems like whether it's Malaysia or other channels, they're going to get the Blackwell chips that they want. And it seems like this is almost like during Covid, how you would close down foreign travel. It's like it doesn't really work that well. There's a big world out there, right?Tyler Cowen (25:01):It's an interesting question. What kind of timing do you want for when both America and China get super powerful AI? And I don't think you actually want only America to have it. It's a bit like nuclear weapons, but you don't want China to have it first. So you want some kind of staggered sequence where we're always a bit ahead of them, but they also maybe are constraining us a bit. I hope we're on track to get that, but I really, really don't want China to have it first.Eric Topol (25:31):Yeah, I mean I think there's, as you're pointing out aptly is a healthy managed competition and that if we can keep that lead there, it is good for both and it's good for the world ideally. But getting back, is there anything you're worried about in AI? I mean because I know you're upbeat about its net effective, and we've already talked about amazing potential for efficiency, productivity. It basically upends a lot of economic models of the past, right?Tyler Cowen (26:04):Yes. I think it changes or will change so many parts of life. Again, it's a bit difficult to specify worries, but how we think of ourselves as humans, how we think of our gods, our religions, I feel all that will be different. If you imagine trying to predict the effects of the printing press after Gutenberg, that would've been nearly impossible to do. I think we're all very glad we got the printing press, but you would not say all of it went well. It's not that you would blame the printing press for those subsequent wars, but it was disruptive to the earlier political equilibrium. I think we need to take great care to do it better this time. AI in different forms will be weaponized. There's great potential for destruction there and evil people will use it. So of course, we need to be very much concerned.Eric Topol (26:54):And there's obviously many of these companies have ways to try to have efforts to anticipate that. That is alignments and various safety type parallel efforts like Ilya did when he moved out of OpenAI and others. Is that an important part of each of these big efforts, whether it's OpenAI, Google, or the rest of them anthropic that they put in resources to keep things from going off the tracks?Tyler Cowen (27:34):That's good and it's important, but I think it's also of limited value because the more we learn how to control AI systems directly, the bad guys will have similar lessons, and they will use alignment possibly to make their AIs bad and worse and that it obeys them. So yeah, I'd rather the good guys make progress on what they're trying to do, but don't think it's going to solve the problem. It creates new problems as well.Eric Topol (28:04):So because of AI, do you think you'll write any more books in the future?Tyler Cowen (28:11):I'm writing a book right now. I suspect it will be my last. That book, its title is Mentors. It's about how to mentor individuals and what do the social sciences know about mentoring. My view is that even if the AI could write the book better than I can, that people actually want to read a book like that from a human. I could be wrong, but I think we should in the future, restrict ourselves to books that are better by a human. I will write every day for the rest of my life, but I'm not sure that books make sense at the current moment.Eric Topol (28:41):Yeah, that's a really important point, and I understand that completely. Now, when you write for the Free Press, which will be besides the Conversations with Tyler podcast and the Marginal Revolution, what kind of things will you be writing about in the Free Press?Tyler Cowen (28:56):Well, I just submitted a piece. It's a defense of elitism. So the problem with our elites is that they have not been elitist enough and have not adhered strictly enough to the scientific method. So it's a very simple point. I think to you it would be pretty obvious, but it needs to be said. It's not out there enough in the debate that yes, sometimes the elites have truly and badly let us down, but the answer is not to reject elitism per se, but to impose higher elitist standards on our sometimes supposed elites. So that's the piece I just sent in. It's coming out soon and should be out by the time anyone hears this.Eric Topol (29:33):Well, I look forward to reading that. So besides a polymath, you might be my favorite polymath, Tyler you didn't know that. Also, you're a futurist because when you have that much information ingested, and now of course with a super performance of AI to help, it really does help to try to predict where we're headed. Have I missed anything in this short conversation that you think we should touch on?Tyler Cowen (30:07):Well, I'll touch on a great interest of yours. I like your new book very much. I think over the course of the next 40 years working with AI, we will beat back essentially every malady that kills people. It doesn't mean you live forever. Many, many more people will simply die of what we now call old age. There's different theories as to what that means. I don't have a lot of expertise in that, but the actual things people are dying from will be greatly postponed. And if you have a kid today to think that kid might expect to live to be 97 or even older, that to me is extremely plausible.Tyler Cowen (30:45):I won't be around to see it, but that's a phenomenal development for human beings.Eric Topol (30:50):I share that with you. I'm sad that I won't be around to see it, but exactly as you've outlined, the fact that we're going to be able to have a huge impact on particularly the age-related diseases, but also as you touched on the genetic diseases with genome editing and many other, I think, abilities that we have now controlling the immune system, I mean a central part of how we get into trouble with diseases. So I couldn't agree with you more, and that's a really good note to finish on because so many of the things that we have discussed today, we share similar views and we come at it from totally different worlds. The economist that has a very wide-angle lens, and I guess you'd say the physician who has a more narrow lens aperture. But thank you so much, Tyler for joining me today.Tyler Cowen (31:48):My pleasure. Let me close by telling you some good news. I have AI friends who think you and I, I'm 63 will be around to see that, I don't agree with them they don't convince me, but there are smart people who think the benefits from this will come quite soon.Eric Topol (32:03):I sure hope they're right.Tyler Cowen (32:05):Yes.*******************************************SUPER AGERS, my new book, was released on May 6th. It's about extending our healthspan, and I introduce 2 of my patients (one below, Mrs. L.R.) as exemplars to learn from. This potential to prevent the 3 major age-related diseases would not be possible without the jumps in the science of aging and multimodal A.I. My op-ed preview of the book was published in The NY Times last week. Here's a gift link. I did a podcast with Mel Robbins on the book here. Here's my publisher ‘s (Simon and Schuster) site for the book. If you're interested in the audio book, I am the reader (first time I have done this, quite an experience!)The book was reviewed in WSJ. Here's a gift linkThere have been many pieces written about it. Here's a gift link to the one in the Wall Street Journal and here for the one in the New York Times .**********************Thanks for reading and subscribing to Ground Truths.If you found this interesting please share it!That makes the work involved in putting these together especially worthwhile.All content on Ground Truths— newsletters, analyses, and podcasts—is free, open-access.Paid subscriptions are voluntary and all proceeds from them go to support Scripps Research. They do allow for posting comments and questions, which I do my best to respond to. Please don't hesitate to post comments and give me feedback. Many thanks to those who have contributed—they have greatly helped fund our summer internship programs for the past two years. Get full access to Ground Truths at erictopol.substack.com/subscribe

Money Talk
View on Active Investment Management Versus Benchmarks

Money Talk

Play Episode Listen Later May 6, 2025 8:57


benchmarks active investment management
Refuse to Lose - a Seattle Mariners Podcast
M's Hit All Benchmarks Through 30 Games + The Definitive Problem with Julio + Joe Doyle of Overslot

Refuse to Lose - a Seattle Mariners Podcast

Play Episode Listen Later May 2, 2025 74:36


Brady is back on a Friday as the Mariners sit at 18-12 overall heading into Texas. Brady has very minimal, if any complaints about the first 30 games and discusses how the M's have hit every box you want checked in that time. Then, he wonders if the M's "Control the Zone" mantra has worked against them over the last few years and how they've modified it this year.And after a few weeks of trying to figure out the answer, we finally get the definitive answer on the issue with Julio Rodriguez from an ESPN personality. Then, Joe Doyle of Overslot stops by to discuss the M's and their position/opportunity in the draft.And, can we stop saying it's a "small sample size" for the M's offensive success of the end of last year and the beginning of this year?

#Hashtags, The Gartner Marketing & Communications Podcast
2025 Comms Budget & Structure Benchmarks

#Hashtags, The Gartner Marketing & Communications Podcast

Play Episode Listen Later Apr 30, 2025 44:59


In this episode, Gartner Analyst Iliyana Hadjistoyanova discusses key findings from the 2025 Gartner CCO Spend Survey. Communications leaders will get access to a unique dataset that will enable them to pressure-test their investment decision, defend their current budgets and give them the confidence to ask for more resourcing, if needed.Iliyana Hadjistoyanova supports communications leaders in managing their functions and teams. Her research and advisory work covers areas such as strategy building, operating models, governance, measurement, agency management and talent development. She is passionate about helping clients prioritize high-value work and connect it to business goals in clear, measurable ways. 

Your Healthy Self with Regan
Muscle, Heart, Brain: The Longevity Axis You Can't Ignore

Your Healthy Self with Regan

Play Episode Listen Later Apr 30, 2025 28:15


In this episode, Regan Archibald explores the vital connection between your muscle, heart, and brain—what he calls the muscle-heart-brain axis—and why this triad is central to aging well. You'll discover how muscle isn't just for vanity, but a powerful organ system that communicates with your brain, boosts cardiovascular health, and secretes myokines that influence everything from immune function to glucose regulation. Regan shares powerful insights on peptides like MOTS-c, 5-Amino-1MQ, BPC-157, and GLP-1s, and how they enhance muscle recovery and metabolic health. He also introduces the "Fitness 50 Benchmarks," a diagnostic framework to help you measure and improve your performance with age. If you're looking for actionable ways to preserve cognitive sharpness, heart strength, and muscular vitality into your 80s, 90s, and beyond—this is the episode for you. www.agelessfuture.com

Money Matters with Wes Moss
Bear Market Retirement Guide: Investment Strategies, Tax Moves, Vacation Decisions, Timing, And Understanding Portfolio Benchmarks

Money Matters with Wes Moss

Play Episode Listen Later Apr 24, 2025 38:18


Make informed decisions about your retirement plans, even in a bear market. Learn whether to cancel your vacation plans due to stock market concerns and discover how to manage your investments effectively. Explore strategies like layering for early retirement and understand the implications of drawing down your TSP defined contribution plan. Examine rebalancing your taxable brokerage account, including ETFs and mutual funds, and learn the rules for rebalancing your Roth IRA. Fine-tune your Roth IRA allocation for long-term growth, considering factors like pension plans and risk tolerance. Understand investment benchmarks and how to accurately assess your portfolio's performance. Zooming out to view a longer time horizon typically provides more perspective than comparing everything to your own market high watermark. Compare maximizing your Social Security savings for short-term growth and with the benefits of optimizing based on your specific needs and stress points. Evaluate the role of home equity in your retirement planning and its impact on withdrawal strategies. Does equity count? The answer may be both yes and no. Strategize about moving funds from a 457B to a Roth IRA as you approach retirement, and carefully consider the tax implications. It's usually about taxes today vs. taxes tomorrow. Tune in to the Retire Sooner podcast with Wes Moss and Christa DiBiase for essential insights on these and other crucial retirement topics and send us your questions at https://www.yourwealth.com/contact/schedule-appointment/ Learn more about your ad choices. Visit megaphone.fm/adchoices

The Daily Crunch – Spoken Edition
Crowdsourced AI benchmarks have serious flaws, some experts say

The Daily Crunch – Spoken Edition

Play Episode Listen Later Apr 24, 2025 5:29


AI labs are increasingly relying on crowdsourced benchmarking platforms such as Chatbot Arena to probe the strengths and weaknesses of their latest models. But some experts say that there are serious problems with this approach from an ethical and academic perspective. Learn more about your ad choices. Visit podcastchoices.com/adchoices

Everyday AI Podcast – An AI and ChatGPT Podcast
EP 510: OpenAI's o3 Use Cases - How to use the world's new most powerful LLM at your company

Everyday AI Podcast – An AI and ChatGPT Podcast

Play Episode Listen Later Apr 23, 2025 74:11


Wait. So.... how do you actually use OpenAI's new o3 model? ↳ It's legit agentic. ↳ Can think on its own. ↳ Use multiple tools in sequence. This is not the normal blueprint for how a LLM works. Don't worry -- we got you.Newsletter: Sign up for our free daily newsletterMore on this Episode: Episode PageJoin the discussion: Thoughts on this? Join the convo.Upcoming Episodes: Check out the upcoming Everyday AI Livestream lineupWebsite: YourEverydayAI.comEmail The Show: info@youreverydayai.comConnect with Jordan on LinkedInTopics Covered in This Episode:OpenAI's o3 Model Introductiono3's Agentic AI CapabilitiesOpenAI's o3 Model Tiers Explainedo3 Model Tools & Features Overviewo3 vs. GPT 4 Benchmarkingo3's Business Use Cases and Demoo3 Model Live Demos & FindingsLarge Language Model Utility in BusinessTimestamps:00:00 "Revolutionizing Business with AI"10:17 "o3 Outperforms Gemini in Benchmarks"15:29 Advertising Deck Overview18:02 "Manual Transcription Needed for Images"22:45 "Automated Search and Retrieval Process"31:59 Analyzing Media Trends with Consultants36:06 AI News Tracking Setup40:25 Podcast Downloads: First 7 Days Focus47:23 AI News & FreshFind Interface48:56 Rethinking Work with AI Influence57:22 Apple Sentiment Impact Analysis01:00:13 Restaurant Price Analysis Tech01:07:08 AI Head-to-Head and CSV UploadKeywords:o3, OpenAI, large language model, agentic AI, GPT-4.1, API, thinking models, transformer model, GPT-4o, chain of thought thinking, reasoning models, Gemini 2.5 Pro, Claude 3.7 SONNET, hybrid models, tool usage, visual input reasoning, image generation, canvas feature, agentic model, autonomous decision-making, third-party benchmarks, LiveBench, artificial analysis intelligence index, MMLU Pro, GPQA Diamond, Humanities Lax exam, LiveCodeBench, SciCode, AI Math 500, business use cases, PDF transcription, computer vision, sentiment analysis, data trends, Python code, interactive dashboard, sentiment tone, online mentions, PR recommendations, interactive canvas, multimodal content.Send Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info) Ready for ROI on GenAI? Go to youreverydayai.com/partner

Celebrate Kids Podcast with Dr. Kathy
Understanding Rites of Passage for Our Kids: Key Benchmarks in Your Child's Journey to Adulthood

Celebrate Kids Podcast with Dr. Kathy

Play Episode Listen Later Apr 23, 2025 16:17 Transcription Available


In this episode of the Celebrate Kids podcast, Dr. Kathy addresses a common concern among parents: how to determine when their children are mature enough to handle increased responsibilities and discussions. The episode explores various cultural and societal benchmarks, such as obtaining a driver's license, voting, and military registration, that signal a transition into adulthood. Dr. Kathy provides insights on how parents can initiate conversations that foster understanding and growth in their children, helping them to embrace their identities and responsibilities as they mature. Additionally, listeners are introduced to Summit Ministries, a valuable resource for shaping young minds and addressing significant life questions, with information on summer camps available for high school and college students. Tune in to gain practical advice on parenting and guiding your children through their developmental stages.

HealthcareNOW Radio - Insights and Discussion on Healthcare, Healthcare Information Technology and More
Revenue Cycle Optimized: How AI Can Improve Performance Leveraging RCM Benchmarks

HealthcareNOW Radio - Insights and Discussion on Healthcare, Healthcare Information Technology and More

Play Episode Listen Later Apr 22, 2025 24:20


How AI Can Improve Performance Leveraging RCM Benchmarks Stuart Newsome, VP of RCM Insights, breaks down the fundamentals of benchmarking in revenue cycle management, including key metrics like net days in A/R and clean claim rates. Learn how AI and automation tools can help healthcare organizations go beyond analytics to drive performance, identify gaps, and enable continuous improvement. Find all of our network podcasts on your favorite podcast platforms and be sure to subscribe and like us. Learn more at www.healthcarenowradio.com/listen

Money Matters With Wes Moss
Easter Eggflation, Market Benchmarks, Tariffs, And The Fed: Zoom Out On Portfolio Performance And Investor Sentiment

Money Matters With Wes Moss

Play Episode Listen Later Apr 22, 2025 34:01


Analyze how inflation may prevent nearly half of Americans from dying Easter eggs and why consumer sentiment may be out of alignment with the reality on the ground. Discover how the most popular benchmark for measuring market performance may not be the most efficient one. Zoom out to examine historical trends and gain perspective on where your portfolio stands despite the current tariff uncertainty. Gather the latest on Fed Chair Jerome Powell's interest rate outlook, as well as his ability to remain in the job despite pressure from the current administration. Wes Moss and Jeff Lloyd guide you through these essential topics and more.

The Nugget Climbing Podcast
EP 269: The King of the Board Lords on Projecting Tactics, Protein, & Climbing 4 Hours Every Day — Sean Houchins-McCallum (sean911)

The Nugget Climbing Podcast

Play Episode Listen Later Apr 21, 2025 128:07


Sean Houchins-McCallum is the top-ranked climber on the Tension Board 2. We talked about growing up in Iowa, his obsession with board climbing, his insane training schedule, his ultimate board project, why we haven't seen V15+ on the boards, projecting tactics, how changing his diet allowed him to jump up two grades, his home made pre-workout, TB2 vs. outdoor bouldering, dream lines, and much more.Rúngne:rungne.info/nuggetUse code “NUGGET” for 10% off storewide.Arc'teryx:Women's climbing clothingMen's climbing clothingCheck out the NEW Kragg Collection.Maui Nui Venison:mauinuivenison.com/NUGGETThe healthiest red meat on the planet. Wild harvested and responsibly sourced. Mad Rock:madrock.comUse code “NUGGET10” at checkout for 10% off your next order. Become a Patron:patreon.com/thenuggetclimbingWe are supported by these amazing BIG GIVERS:Michael Roy and Mark and Julie CalhounShow Notes:  thenuggetclimbing.com/episodes/sean-houchins-mccallumNuggets:(00:00:00) – Intro(00:01:16) – Unsynthesized(00:04:50) – Sending every classic in 4 months(00:07:02) – Limit projects every day(00:09:31) – Sean vs. Noah Wheeler(00:11:36) – Is boarding its own sport?(00:16:04) – Becoming a board climber(00:17:30) – Sean's ultimate board proj (You've Seen the Butcher at 65º)(00:19:10) – Board tactics(00:21:28) – Why aren't there V15s or harder on boards?(00:26:14) – Getting outside(00:27:45) – Sean's dimensions(00:29:24) – Sean's warms up(00:34:07) – A typical week(00:36:52) – Sean's projecting tactics(00:40:42) – 2 months on(00:46:52) – Does Sean train his strengths?(00:54:20) – Sean's weaknesses(00:57:19) – The Spray layout(01:01:41) – TB1 vs. TB2(01:02:49) – Mirroring climbs & the leaderboard(01:07:05) – Intimidating holds on the TB2(01:10:19) – Go-to shoes(01:11:36) – Working & setting(01:17:11) – Outdoor goals(01:18:55) – Jumping up 2 grades after changing his diet(01:26:52) – TB2 vs. outside(01:31:57) – New board companies(01:34:59) – Sport climbing feature(01:38:59) – Dream lines(01:40:17) – Rapid fire questions(02:00:53) – How Low Can You Go?

Real World Serverless with theburningmonk
#116: AI Agents, MCP and the problems with AI benchmarks | ft. Matt Carey

Real World Serverless with theburningmonk

Play Episode Listen Later Apr 19, 2025 48:08


In this episode, I spoke with Matt Carey, founding AI engineer at StackOne, founder of AI Demo Days and member of the OpenUK AI Advisory Board.Everyone needs a friend who works in AI to help them filter the AI news and get the signals from the noise. Matt is that friend for me!We discussed AI agents, MCP, and the challenges of AI benchmarks, which help explain the disconnect between the benchmark results and the anecdotal experiences of AI users, such as myself.Links from the episode:Google's whitepaper on AI agentsAnthropic Building Effective AI AgentsSimon Willison on XThorsten Ball's Joy & Curiosity newsletterAI Demo DaysMCP has a prompt injection problemOpening theme song:Cheery Monday by Kevin MacLeodLink: https://incompetech.filmmusic.io/song/3495-cheery-mondayLicense: http://creativecommons.org/licenses/by/4.0

Zero Knowledge
ZK Benchmarks with Conner Swann

Zero Knowledge

Play Episode Listen Later Apr 9, 2025 51:43


In this episode Anna explores the contentious topic of ZK benchmarking with Conner Swann, founder of ProofLab. Drawing parallels with AI's development, they explore how tools like ProofLab could help standardize benchmarking across dimensions like speed, cost, and security. The conversation touches on the importance of formal verification and real-world workload testing, while considering how better benchmarking could accelerate ZK adoption and potentially lead to ZK's own 'ChatGPT moment' Related links: zkSummit The Zero-Knowledge Community Survey NoirHack Noir ethProofs Designing Optimistic Interoperability with Nomad Trusted Setup Ceremonies Explored Anna's Post on Benchmarking The path to secure and efficient zkVMs: How to track progress Ben Fisch on Limiting Factors of Verifiable Compute ZKarnage Further reading: Celo Espero ceremony ZKVM Benchmarking The same thing that happened to AI is Happening to ZK Applications for zkSummit13 are open, spots are limited for this edition so apply soon (www.zksummit.com) if you want to join. ZK is finally easy with Noir, the fastest-growing zero-knowledge programming language. Build privacy-preserving apps without any ZK experience. Aztec Labs is running a 4 week program,...

SaaS Talkâ„¢ with the Metrics Brothers - Strategies, Insights, & Metrics for B2B SaaS Executive Leaders

B2B SaaS Benchmark reports are popping up like mushrooms - but how do you know if the benchmark report is good, contains valuable insights and how to use the benchmarks to inform your own objectives and SaaS metrics goals? Dave "CAC" Kellogg and Ray "Growth" Rike take on a topic that is near and dear to their hearts and discuss several key factors that are a tell for a good benchmark report including:Top things to consider when reading a benchmark reportAttributes of a good benchmark reportAttributes of a "not so good" benchmark reportBenchmark utilization best practicesGood Benchmark reports to reviewIf you read benchmark reports and use them in helping to establish goals and objectives in your B2B SaaS company - this episode is a must listen!See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

The Moneywise Guys
4/2/25 The Price We Pay: Liberation History, Auto Market Shifts, and Billionaire Benchmarks

The Moneywise Guys

Play Episode Listen Later Apr 3, 2025 45:44


The Moneywise Radio Show and Podcast Wednesday, April 2nd BE MONEYWISE. Moneywise Wealth Management I "The Moneywise Guys" podcast call: 661-847-1000 text in anytime: 661-396-1000 website: www.MoneywiseGuys.com facebook: Moneywise_Wealth_Manageme instagram: MoneywiseWealthManagement

Revenue Cycle Optimized
RCM Insights – How AI Can Improve Performance Leveraging RCM Benchmarks

Revenue Cycle Optimized

Play Episode Listen Later Apr 3, 2025 24:20


In this RCM Insights episode, Stuart Newsome, VP of RCM Insights, breaks down the fundamentals of benchmarking in revenue cycle management, including key metrics like net days in A/R and clean claim rates. Learn how AI and automation tools can help healthcare organizations go beyond analytics to drive performance, identify gaps, and enable continuous improvement.

DiversifyRx
Mastering KPIs & Data: The Key to a Profitable Pharmacy You Love | Dr. Lisa Faast | Becoming A Pharmacy Badass

DiversifyRx

Play Episode Listen Later Apr 3, 2025 11:23


**In this episode of Becoming a Pharmacy Badass, Dr. Lisa Faast dives into one of the most critical aspects of running a successful and profitable pharmacy: understanding and leveraging KPIs (Key Performance Indicators) and data. Dr. Faast acknowledges that pulling reports from your PMS and calculating data can feel overwhelming, but she explains how to simplify the process and ensure that the data you rely on is accurate and actionable.** **Show Notes:** 1. **Understanding the Importance of Data in Pharmacy** [0:00] 2. **Benchmarks and Industry Standards ** [1:26] 3. **Handling Dirty Data ** [3:00] 4. **The Role of Action and Trend Tracking** [7:02] 5. **Practical Steps for Data Management ** [8:21] **Websites Mentioned: https://diversifyrx.com/weekly-awesomeness-newsletter/ https://www.drlisafaast.com/ ----- #### **Becoming a Badass Pharmacy Owner Podcast is a Proud to be Apart of the Pharmacy Podcast Network**

Reward Hacking by Reasoning Models & Loss of Control Scenarios w/ Jeffrey Ladish of Palisade Research, from FLI Podcast

Play Episode Listen Later Apr 2, 2025 92:17


On this cross-post episode, Jeffrey Ladish discusses the rapid pace of AI progress and the risks of losing control over powerful systems. We explore why AIs can be both smart and dumb, the challenges of creating honest AIs, and scenarios where AI could turn against us. Additionally, we delve into Palisade's new study on how reasoning models can cheat in chess by exploiting the game environment. Check out the Future of Life podcast here.: https://futureoflife.org/project/future-of-life-institute-podcast/ SPONSORS: Oracle Cloud Infrastructure (OCI) | 2025: Oracle Cloud Infrastructure offers next-generation cloud solutions that cut costs and boost performance. With OCI, you can run AI projects and applications faster and more securely for less. New U.S. customers can save 50% on compute, 70% on storage, and 80% on networking by switching to OCI before May 31, 2024. See if you qualify at https://oracle.com/cognitive Shopify: Shopify is revolutionizing online selling with its market-leading checkout system and robust API ecosystem. Its exclusive library of cutting-edge AI apps empowers e-commerce businesses to thrive in a competitive market. Cognitive Revolution listeners can try Shopify for just $1 per month at https://shopify.com/cognitive NetSuite: Over 41,000 businesses trust NetSuite by Oracle, the #1 cloud ERP, to future-proof their operations. With a unified platform for accounting, financial management, inventory, and HR, NetSuite provides real-time insights and forecasting to help you make quick, informed decisions. Whether you're earning millions or hundreds of millions, NetSuite empowers you to tackle challenges and seize opportunities. Download the free CFO's guide to AI and machine learning at https://netsuite.com/cognitive PRODUCED BY: https://aipodcast.ing CHAPTERS: (00:00) About the Episode (02:59) The pace of AI progress (07:14) How we might lose control (10:22) Why are AIs sometimes dumb? (Part 1) (15:50) Sponsors: Oracle Cloud Infrastructure (OCI) | 2025 | Shopify (18:24) Why are AIs sometimes dumb? (Part 2) (18:24) Benchmarks vs real world (24:43) Loss of control scenarios (32:08) Why would AI turn against us? (Part 1) (32:09) Sponsors: NetSuite (33:42) Why would AI turn against us? (Part 2) (37:40) AIs hacking chess (43:30) Why didn't more advanced AIs hack? (48:44) Creating honest AIs (56:49) AI attackers vs AI defenders (01:05:32) How good is security at AI companies? (01:10:42) A sense of urgency (01:17:16) What should we do? (01:22:59) Skepticism about AI progress (01:29:38) Outro

Todd N Tyler Radio Empire
4/1 5-2 Benchmarks

Todd N Tyler Radio Empire

Play Episode Listen Later Apr 1, 2025 13:10


These seems attainable.See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

Everyday AI Podcast – An AI and ChatGPT Podcast
EP 493: ChatGPT's groundbreaking image update, Google's chart-topping Gemini 2.5 drop, Microsoft's new reasoning agents and more AI news that matters

Everyday AI Podcast – An AI and ChatGPT Podcast

Play Episode Listen Later Mar 31, 2025 56:06


PodRocket - A web development podcast from LogRocket
Debugging apps with Deno and OpenTelemetry with Luca Casonato

PodRocket - A web development podcast from LogRocket

Play Episode Listen Later Mar 27, 2025 24:55


Luca Casanato, member of the Deno core team, delves into the intricacies of debugging applications using Deno and OpenTelemetry. Discover how Deno's native integration with OpenTelemetry enhances application performance monitoring, simplifies instrumentation compared to Node.js, and unlocks new insights for developers! Links https://lcas.dev https://x.com/lcasdev https://github.com/lucacasonato https://mastodon.social/@lcasdev https://www.linkedin.com/in/luca-casonato-15946b156 We want to hear from you! How did you find us? Did you see us on Twitter? In a newsletter? Or maybe we were recommended by a friend? Let us know by sending an email to our producer, Emily, at emily.kochanekketner@logrocket.com (mailto:emily.kochanekketner@logrocket.com), or tweet at us at PodRocketPod (https://twitter.com/PodRocketpod). Follow us. Get free stickers. Follow us on Apple Podcasts, fill out this form (https://podrocket.logrocket.com/get-podrocket-stickers), and we'll send you free PodRocket stickers! What does LogRocket do? LogRocket provides AI-first session replay and analytics that surfaces the UX and technical issues impacting user experiences. Start understand where your users are struggling by trying it for free at [LogRocket.com]. Try LogRocket for free today.(https://logrocket.com/signup/?pdr) Special Guest: Luca Casonato.

LinkedIn Ads Show
LinkedIn Ads Sales Benchmarks - Ep. 157

LinkedIn Ads Show

Play Episode Listen Later Mar 20, 2025 54:02 Transcription Available


Show Resources: Here are the resources we covered in the episode: Here's the Startup Benchmarks Report from the episode, free Connect with Ruby on LinkedIn Slides that Ruby referenced Follow Kacie Jenkins on LinkedIn Follow Betty Mok on LinkedIn Join the LinkedIn Ads Fanatics community and get access to our 4 courses to take you from beginner to expert Contact us at Podcast@B2Linked.com with any questions, suggestions, or corrections! Rate/Review: a great, no-cost way to support us Summary:  The Power of Benchmarks in LinkedIn Advertising & Sales In this episode of The LinkedIn Ads Show, host AJ Wilcox welcomes Ruby James from LinkedIn to discuss one of the most valuable benchmark reports available for B2B startups—Growth Benchmarks for Scaling B2B Startups. Unlike traditional benchmarks that focus solely on marketing performance, this report integrates both marketing and sales metrics, offering a holistic view of what success looks like for startups. Key Takeaways: The Evolution of Measurement in B2B Marketing The top-performing startups are measuring marketing success beyond lead generation, focusing on sales metrics like meetings booked, opportunities created, and revenue closed. Sales and marketing are increasingly aligned, with some companies merging them into a single "Revenue Team" to eliminate attribution battles. The Growing Impact of Executive Thought Leadership Startups that integrate executive and industry thought leadership into their content strategy see higher engagement rates and stronger conversion rates through the funnel. Data shows that retargeting audiences who saw thought leadership ads converted 45% more frequently than those who only saw traditional brand ads. Low-lift, high-impact content (such as quick takes from executives) performs better than highly produced videos. The Shift to an ABM (Account-Based Marketing) Strategy Early-stage startups have increased ABM adoption by 30% in just two years. More companies are leveraging LinkedIn's ABM tools, dynamically integrating CRM data and intent signals from platforms like G2 to refine their targeting. Successful startups focus on pipeline acceleration campaigns, ensuring they engage buying committees throughout the entire sales cycle. Reimagining the B2B Funnel The traditional marketing funnel is outdated, and leading companies are shifting to an “in-market vs. out-of-market” approach (based on LinkedIn's 95-5 rule). Companies that engage early in the buyer's journey (before prospects enter the market) are more likely to be on their final shortlist when they're ready to buy. The "Measurement Gold Rush" Startups are investing in advanced measurement solutions (e.g., HockeyStack, Factors AI, Dreamdata) to track multi-touch attribution and incrementality. The mindset is shifting to accepting that not everything can be perfectly measured and prioritizing longer time horizons for campaign evaluation. Final Thoughts & Call to Action The full benchmark report is available in the show notes, and listeners are encouraged to read along. Ruby invites listeners to connect with her on LinkedIn for more insights and upcoming research. AJ encourages LinkedIn Ads fanatics to check out the LinkedIn Ads Fanatics community for exclusive content, courses, and coaching. This episode provides game-changing insights into LinkedIn advertising and B2B marketing strategy, especially for startups looking to scale efficiently while ensuring sales and marketing are aligned. Want more insights? Join the LinkedIn Ads Fanatics community at fanatics.b2linked.com.  Show Transcript: For the full show transcript, see the show notes page here: Episode 157

The Dentist Money™ Show | Financial Planning & Wealth Management
#624: 2024 Dentist Financial Benchmarks: How Do You Measure Up? (Part 3)

The Dentist Money™ Show | Financial Planning & Wealth Management

Play Episode Listen Later Mar 19, 2025 44:56


On this episode of the Dentist Money Show, Ryan and Matt conclude the three-part series of the past year's biggest themes in dentistry and analyze the results from a recent survey sent out to Dentist Advisors' clients. The finale of the benchmark series includes insights about commercial real estate holdings, personal and practice debt averages, and insurance coverage. They explore the emotional weight of homeownership, some of the misconceptions surrounding real estate investments, and share key insights on student loans, life insurance, and disability insurance. Don't miss this wrap-up of 2024's dentist financial benchmarks, packed with valuable takeaways for dentists at any stage in their journey. Book a free consultation with a CFP® advisor who only works with dentists. Get an objective financial assessment and learn how Dentist Advisors can help you live your rich life.

Unsupervised Learning
Ep 58: Google Researchers Noam Shazeer and Jack Rae on Scaling Test-time Compute, Reactions to Ilya & AGI

Unsupervised Learning

Play Episode Listen Later Mar 17, 2025 69:28