POPULARITY
In this episode of Journey Of Hope, a podcast produced by Heart For Lebanon, host Elio Constantine speaks with Field Director of the Heart For Lebanon Ministry Center in Southern Lebanon, and pastor of the Hope Evangelical Church, Milad Nassar. Elio and Milad are in South Lebanon with an update from the field on the latest developments in the aftermath of the most recent conflict in Lebanon, as well as a new way that Heart For Lebanon is sharing the hope of Jesus with the community, while meeting some very practical restoration needs. Heart For Lebanon website: Home - Heart for LebanonEmail Elio and the Journey Of Hope Podcast: podcast@heartforlebanon.orgJoin our prayer ministry: Prayer Ministry - Heart for Lebanon
En la edición de Los Tenores de este miércoles 16 de abril, nuestros panelistas tuvieron un intenso debate ante el cambio de horario para el partido de la U de Chile ante Deportes La Serena, por el inicio de la octava fecha del Campeonato Nacional 2025. En esa línea, Pancho Mouat, Danilo Díaz, Rodrigo Hernández, Víctor Cruces y Carlos Costas escucharon la molestia del DT azul, Gustavo Álvarez, ante esta modificación. También escucharon las reflexiones de Pablo Milad tras su visita a la Cámara de Diputados y abordando también la renuncia de Marko Biskupovic como gerente de selecciones de la ANFP. Además, tuvieron una interesante conversación, en los estudios de ADN, con Juan Pablo Pavez, director de Azul Azul, que abordó en críticos términos la situación judicial de Michael Clark, presidente del club. Revive la edición de Los Tenores de este miércoles 16 de abril, y no te pierdas ningún detalle del “clásico de las 2″.See omnystudio.com/listener for privacy information.
Today on the show, Lisa was joined by Milad Parsa of Lost In Separation! We dive into the first memories of music in Milad's life and it making an impact, the faithful Dallas music scene, having full circle moments, how music is helps aid Milad's emotions, the new music on the way and so much more.-----------------------------------------Find Lost In Separation:Instagram: https://www.instagram.com/lostinseparationall other links via linktree: https://linktr.ee/lostinsepSpotify Link: https://open.spotify.com/artist/4cndLBc8ieDlCoknDkeDVn?si=DzB0cbzYRj-c7bfeRK7mgwApple Music Link: https://music.apple.com/us/artist/lost-in-separation/1232417865Lost In Conversation ( we talk about ) : https://youtu.be/Al02b5H0Nks?si=8veEWuGOzs706kKNTiktok: https://www.tiktok.com/@lostinseparation?is_from_webapp=1&sender_device=pc--------------------------------------------------------Find Stereo Therapy :Insta: https://www.instagram.com/stereo.therapyWebsite: https://www.stereotherapypod.com---------------------------------------------------Stereo Therapy Theme Song by Walwin:Insta: https://www.instagram.com/walwin.mp3
Avsnitt 270 av Podcast Juventus Club Svezia med gästen Milad Kazeroni, som är skribent på Milan-redaktionen på SvenskaFans och som gör återkomst i podden efter att ha varit med i avsnitt 216. Vi diskuterade Milans svåra period framförallt den här säsongen och började dissekera klubben från ägarskapet med Red Bird, till en Zlatan Ibrahimovic med oklar position, till tränarskifte under säsong och ända ned i truppen samt hur man ska få det att vända. Det blev även liknelser med Juventus som tampas om en topp 4-placering även denna säsongen då man är inne på sin femte raka utan en Scudetto och en relativ kräftgång med tränarskiften även i Turin. Stöd gärna podden du med, bli patron: https://www.patreon.com/podcastjuventusclubsvezia Intro/Outro Podcast Juventus Club Svezia, skapad av: Roger Myrehag - Oboogie Music
In this special father-daughter double feature, host Jeff Gross sits down with Pastor Milad Soleimani—founder of Leo Miami Sport and a spiritual leader at King Jesus Ministry—and his daughter Ludmilla, a viral content creator with over 11 million followers on TikTok.
En la edición de Los Tenores de este viernes 28 de marzo, nuestros panelistas estuvieron al pendiente de los movimientos de Pablo Milad y Ricardo Gareca previo a la reunión que puede zanjar el futuro del DT de La Roja. Danilo Díaz, Cristian Arcos, Leo Burgueño, Víctor Cruces y Carlos Costas también desmenuzaron la derrota de la U de Chile ante Everton, marcada por la temprana expulsión de Javier Altamirano en el cuadro azul. Además, escucharon a Aníbal Mosa tras una acción publicitaria en Colo Colo, en el día después del empate contra Palestino. Revive la edición de Los Tenores de este viernes 28 de marzo y no te pierdas ningún detalle del “clásico de las 2″, donde también anticiparon el duelo de esta tarde entre la Unión Española y la UC.See omnystudio.com/listener for privacy information.
Milad Dagher is the Co-Founder and CEO of Nomic Bio.
The study aims to investigate how recommender systems shape providers' dynamics and content offerings on platforms, and to provide insights into algorithm designs for achieving better outcomes in platform design. The study reveals that recommender systems have the potential to introduce biases in providers' understanding of user preferences, thereby impacting the variety of offerings on platforms. Moreover, it identifies algorithm design as a critical factor, with item-based collaborative filters showcasing superior performance in contexts where customers exhibit selectivity. Conversely, user-based models prove more effective in scenarios where recommendations significantly sway user decisions, ultimately boosting sales. Authors: Mohammadi Darani, Milad, and Sina Aghaie
En el sur de Líbano, la guerra entre Israel y Hezbolá ha dejado a las comunidades cristianas en ruinas. Milad eid y su hijo elieh recorren los restos de su hotel en Alma Shaab. La destrucción es total. Él fue uno de los últimos en abandonar esta aldea cristiana fronteriza antes de la invasión de las tropas israelíes a finales de septiembre. Cuando Israel se retiró a finales de enero, regresó y encontró su proyecto de vida, construido durante 20 años, reducido a escombros.
En la edición de Los Tenores de este jueves 6 de marzo, nuestros panelistas abordaron la victoria de la Unión Española vía penales ante Everton para clasificar a la fase de grupos de la Copa Sudamericana, y conversaron con el héroe hispano, Franco Torgnascioli, que tapó tres penales en la definición desde los 12 pasos. Cristian Arcos, Víctor Cruces, Leonardo Burgueño, Rodrigo Hernández y Carlos Costas, comentaron las palabras de Pablo Milad sobre Fernando Zampedri, las polémicas de la Liga 2D, y el nivel de La Roja Sub 20. Además, analizaron el estado de la cancha del Estadio Santa Laura, que podría modificar la realización del partido entre la Universidad de Chile y Unión Española, las tarjetas amarillas administrativas que recibieron los azules, y los trabajos de Arturo Vidal para recuperarse de su lesión. Revive la edición de Los Tenores de este jueves 6 de marzo y no te pierdas ningún detalle del “clásico de las 2″.See omnystudio.com/listener for privacy information.
میلاد منشئیپور، همبنیانگذار و مدیرعامل سابق تپسی، کسی که اولین استارتاپ ایرانی رو به بورس رسوند و یکی از چهرههای تأثیرگذار توی اکوسیستم استارتاپی ایرانه. تو دورهای که مدیرعامل تپسی بود، این شرکت به یکی از بازیگران اصلی بازار تبدیل شد و مسیر جدیدی رو برای استارتاپهای ایرانی باز کرد.توی این قسمت، از مسیرش بعد از خروج از تپسی میگیم. میلاد روراست و بدون تعارف از دلیل تصمیمش برای کنار رفتن حرف میزنه، از چالشها و موفقیتهای ساختن یه استارتاپ از صفر تا بزرگ شدنش. در مورد سبک مدیریتیاش که باعث رشد تپسی شد گپ میزنیم و میپرسیم این روزا مشغول چی کاراست و برنامههای آیندهش چیه.00:00 - مقدمه و معرفی میلاد منشئیپور02:30 - چرا از تپسی کنار رفت؟08:15 - چالشهای راهاندازی تپسی14:45 - سبک مدیریتی و رشد تپسی22:10 - تجربه ورود به بورس30:20 - بعد از تپسی چه کار میکند؟37:50 - آینده استارتاپهای ایرانی45:00 - سرمایهگذاری و نگاه میلاد به اکوسیستم52:30 - تجربه شخصی از رشد و شکست59:10 - مهمترین تصمیمات دوران مدیریت1:05:45 - تأثیرات فرهنگی و اجتماعی تپسی1:12:20 - توصیه به کارآفرینان1:18:50 - برنامههای آینده میلاد1:25:00 - جمعبندی و حرفهای پایانیMilad Monshipour, the visionary co-founder and former CEO of Tapsi, one of Iran's leading ride-hailing platforms. Under his leadership, Tapsi achieved a historic milestone by becoming the first Iranian startup to go public, marking a significant moment in the nation's tech industry.In this episode, we discuss Milad's journey after his departure from Tapsi. He candidly shares the reasons behind his decision to step down, offering insights into the challenges and triumphs of steering a startup to success. We also explore his innovative management strategies that propelled Tapsi to new heights and discuss his current ventures and future aspirations.Join us for an inspiring conversation with one of Iran's most influential tech entrepreneurs as we uncover the story behind Tapsi's rise and what's next for Milad Monshipour.اسپانسر این قسمتراهکارهای سازمانی شاتلراهگشای کسب و کارها در مدیریت خدمات ارتباطیhttps://www.instagram.com/shatelbusiness/اطلاعات بیشتر درباره پادکست طبقه ۱۶ و لینک پادکستهای صوتیhttps://linktr.ee/tabaghe16 Hosted on Acast. See acast.com/privacy for more information.
In this powerful episode of Journey of Hope, host Elio Constantine reconnects with Milad, Heart for Lebanon's field manager and a lead pastor, to discuss the recent challenges and triumphs faced by communities in southern Lebanon. As the region emerges from a period of intense conflict, they reflect on the impact of the September aggression between Israel and Hezbollah, which displaced hundreds of thousands of people and forced Heart for Lebanon to temporarily halt operations. However, with the recent ceasefire and the formation of a new government, hope is beginning to flourish once again.Milad shares heartwarming stories of answered prayers, highlighting the resilience of individuals like Malak, who feared for her daughter's safety during the turmoil but found solace in God's presence. The episode emphasizes how the community has rallied together, with the Hope Center reopening its doors to provide education and spiritual support, nurturing the next generation in a safe environment.Listeners will be inspired by testimonies of transformation, including the story of Basam and his son, Raad, who overcame anxiety through prayer and faith. Milad also recounts the experience of Fadia, who found refuge after losing her home, demonstrating how God's love shines through even in the darkest times. As the episode unfolds, Milad reveals a growing interest in the Gospel among individuals from diverse backgrounds, showcasing the power of faith to unite and heal. With an increasing number of seekers attending Bible studies, the message of hope is spreading across the region.Join Elio and Milad as they celebrate the resilience of the human spirit and the unwavering faith that continues to guide the people of Lebanon. This episode is a testament to the transformative power of prayer and community support in overcoming adversity. Don't miss this uplifting conversation that reminds us all of the hope that can emerge from despair. Subscribe, share, and engage with Heart for Lebanon's prayer ministry at heartforlebanon.org/prayer. Your support is crucial in bringing hope and change to those in need.
Milad Hajizade (clinical psychologist) comes back on the show to talk about grief. In this episode he opens up about the loss of both of his parents within one week of each other. -------------------------To learn more about psychedelic therapy go to my brother Mehran's page at: https://www.mindbodyintegration.ca/ or to https://www.legacyjourneys.ca/ for his next retreat.***Masty o Rasty is not responsible for, or condone, the views and opinions expressed by our guests ******مستی و راستی هیچگونه مسولیتی در برابر نظرها و عقاید مهمانهای برنامه ندارد.***--------Support the showhttps://paypal.me/raamemamiVenmo + Revolut: @KingRaam Hosted on Acast. See acast.com/privacy for more information.
De #1 Podcast voor ondernemers | 7DTV | Ronnie Overgoor in gesprek met inspirerende ondernemers
Van vluchteling naar succesvol ondernemer: Milad Zandi deelt zijn inspirerende verhaal vol doorzettingsvermogen en ambitie.
Lieske, Tanya www.deutschlandfunk.de, Tag für Tag
This episode is sponsored by https://WE-PN.com Become your own VPN provider.To get 50% off enter promo code: kingraam50-------------------------This episode is sponsored by BetterHelp. Give online therapy a try at https://betterhelp.com/MASTYORASTY and get on your way to being your best self.-------------------------In this episode Raam talks to Milad who works in waste management. But the origin of their connection goes back to a tweet that Milad made one day about Raam that caused a lot of controversy and went viral. Transparent communication is the key to healthier relationships and friendships. -------------------------To learn more about psychedelic therapy go to my brother Mehran's page at: https://www.mindbodyintegration.ca/ or to https://www.legacyjourneys.ca/ for his next retreat.***Masty o Rasty is not responsible for, or condone, the views and opinions expressed by our guests ******مستی و راستی هیچگونه مسولیتی در برابر نظرها و عقاید مهمانهای برنامه ندارد.***--------Support the showhttps://paypal.me/raamemamiVenmo + Revolut: @KingRaam Hosted on Acast. See acast.com/privacy for more information.
durée : 00:05:39 - L'Invité de la rédaction de France Bleu Maine - Installé depuis près de trente ans au Mans, le cardiologue syrien Milad Aleid a vécu "de façon intense" la chute de Bachar al-Assad en Syrie dimanche 8 décembre.
Agile in Hardware: The Future of Agile Hardware Development, A Case Study From High Power Semiconductor Industry With Milad Maleki and Markus Thut In this Agile in Hardware episode, Milad Maleki and Markus Thut of Hitachi Energy Ltd. describe the pioneering methods and challenges of Agile hardware development in high-power semiconductor manufacturing. From introducing cutting-edge RoadPak technology for Formula E racing to optimizing consumer EV solutions, they share a fresh perspective on agile practices beyond the traditional software domain. Join us as we uncover the intricacies of an iterative approach to hardware design, production integration, and actionable insights for advancing Agile principles in complex hardware manufacturing. The RoadPak Project: Pioneering eMobility Innovation Milad and Markus kick off the discussion with the story behind the RoadPak project, a powerful technology supporting electric mobility and racing industries, including Formula E and Formula 1. Developed initially as a small-scale prototype, RoadPak has since evolved into a versatile solution with wide-reaching applications in consumer electric vehicles and fast-charging stations. “From racing to consumer EVs, RoadPak's journey showcases the transformative potential of scaling innovation from concept to consumer solutions.” Redefining Agile in Hardware: An Iterative Revolution Unlike traditional hardware development's linear approach, the team adopted an Agile model to adapt and evolve both the product and its manufacturing processes at the same time. By designing the production line in tandem with the product, they created a collaborative environment where feedback directly informed product design and production line adjustments. “In Agile hardware, the manufacturing process becomes part of the product itself—a continual feedback loop between design, production, and customer needs.” Iterative Sample Development: The A, B, C, and D Samples Milad and Markus discuss the use of progressive sample iterations (A, B, C, and D) to refine RoadPak's development. But, within each of the sample phases, they iterated quickly, for example having samples from A1, A2, A2b, etc. This approach provided invaluable insights, allowing for cost-effective tools and small-scale prototypes that rapidly incorporated feedback from customers and the production line. “Every iteration helped us with fast and impactful learning cycles which refined both product design and manufacturing. Proving that fast feedback is crucial—even in hardware.” Customer Feedback and Early Prototyping: Shortening the Feedback Loop To ensure RoadPak met real-world requirements, the team engaged customers early and often. They relied on simulation, rapid prototyping, and laser-cut parts to accelerate the feedback process. A specialized “evaluation kit” enabled customers to test the component in their own environments, exemplifying how quick delivery - even in hardware projects - can significantly speed up product development. “Early customer feedback is critical; our evaluation kit bridged the gap, turning theoretical design into practical functionality for real-world testing.” Integrated Development: Product and Manufacturing as Partners This episode emphasizes the unique challenges of developing the product and manufacturing process concurrently. By focusing on early quality control and optimizing the process on-site, they achieved higher yield and product reliability, setting the foundation for scalable, high-quality production. “For any new product, designing the manufacturing process alongside the product itself isn't optional—it's essential for quick feedback, and long-term success and quality.” Key Success Factors in Agile Hardware Development Markus and Milad highlight the importance of cross-functional teams, communication, and focus on dedicated resources. By streamlining their team's goals and processes, they maintained agility and clarity in the development cycle. This episode wraps up with tips and resources for those looking to apply Agile principles to hardware, emphasizing the value of flexible, collaborative workflows. “Focus and communication drive success in Agile hardware; with the right team alignment, you're equipped to adapt quickly and effectively.” Recommended Resources Milad and Markus suggest practical resources to deepen listeners' understanding of Agile in hardware. The book Scrum Essentials: Agile Software Development and Agile Project Management for Project Managers, Scrum Masters, Product Owners, and Stakeholders by Troy Dimes serves as an adaptable foundation. “Books and frameworks are starting points, but adapting Agile to hardware means integrating experimentation as a core part of the process.” About Markus Thut and Milad Maleki Markus Thut is a lead engineer at Hitachi Energy Ltd.'s semiconductor production in Lenzburg, Switzerland, specializing in high-power semiconductors and eMobility innovations. Markus is recognized for his forward-thinking approach to automation and industrial innovation, rooted in Swiss precision and a dedication to bringing visionary ideas to life. You can link with Markus Thut on LinkedIn and connect with Markus Thut on Twitter. Milad Maleki is the Head of R&D for high-power semiconductors at Hitachi Energy. With a PhD from École Polytechnique Fédérale de Lausanne (EPFL), Milad has led groundbreaking research and development initiatives in the semiconductor field, championing collaboration and innovation to power a sustainable energy future. You can link with Milad Maleki on LinkedIn and connect with Milad Maleki on Twitter.
This episode is sponsored by https://WE-PN.com Become your own VPN provider.To get 50% off enter promo code: kingraam50-------------------------This episode is sponsored by BetterHelp. Give online therapy a try at https://betterhelp.com/MASTYORASTY and get on your way to being your best self.-------------------------Milad Hajizade (clinical psychologist) comes back on the show to talk about cyber bullying and its effects on the human condition.https://www.instagram.com/miladhajizade/-------------------------To learn more about psychedelic therapy go to my brother Mehran's page at: https://www.mindbodyintegration.ca/ or to https://www.legacyjourneys.ca/ for his next retreat.***Masty o Rasty is not responsible for, or condone, the views and opinions expressed by our guests ******مستی و راستی هیچگونه مسولیتی در برابر نظرها و عقاید مهمانهای برنامه ندارد.***-------------------------King Raam Tour:Nov 21 2024 - Boston,Nov 23 2024 - New YorkNov 24 2024 - Washington, DCNov 26 2024 - AtlantaNov 29 2024 - Houston,Nov 30 2024 - Dallas, TXDec 3 2024 - San DiegoDec 4 2024 - Los AngelesDec 5 2024 - San FranciscoDec 8 2024 - PortlandDec 9 2024 - Seattlekingraam.com/tourSupport the show Hosted on Acast. See acast.com/privacy for more information.
In this informational episode of Journey of Hope, host Elio Constantine takes us through the heart-wrenching realities faced by Lebanon amidst the ongoing conflict between Israel and Hezbollah. With the backdrop of Lebanon's economic crisis, Elio provides a historical context to the current conflict, tracing its roots back to the establishment of Israel in 1948 and the subsequent formation of resistance movements like Hezbollah. Joining Elio is Camille Melki, co-founder of Heart for Lebanon, who shares the harrowing day-to-day experiences of Lebanese families caught in the crossfire. Camille underscores the critical role of the Church and Heart for Lebanon in providing holistic care to those in need, addressing not only physical but also emotional and spiritual impoverishment. Through their efforts, they offer a beacon of hope and resilience in the face of despair.Listeners will hear poignant stories from the field, including that of Delana, a Kurdish woman displaced twice, and Nisreen, who finds solace in Bible study amidst her family's displacement. These narratives, shared by field managers Bashir and Milad, highlight the transformative impact of Heart for Lebanon's Ministry on individuals and communities.Camille emphasizes the importance of prayer, awareness, and being messengers of peace. He calls on listeners to engage in prayer for peace, repentance, and the ongoing work of Heart for Lebanon. As the episode concludes, Elio urges listeners to share the podcast, engage with the Heart for Lebanon community, and join in prayer for the safety and wisdom of the team working tirelessly on the ground.Don't miss this powerful episode that sheds light on the struggles and triumphs of those living in Lebanon. For more information or to join the prayer Ministry, visit heartforlebanon.org/prayer. Your support and prayers can make a difference.
لقاء الحلقة ال135 مع ميلاد دانيال وهو اخ لمينا دانيال الذى قتل برصاصة فى الصدر فى احداث ماسبيرو. لماذا نزل مينا ميدان التحرير؟ ماذا طلب من اصدقائه بخصوص جنازته؟ ما هو موقف البابا شنودة ثم البابا تاودروس من احداث ماسبيرو؟ لماذا قال الأنبا يؤنس انه سوف يغلق المقابر ولن يسمح بدفن مينا هناك؟ لماذا تركت العائلة مصر وماذا حدث لهم فى مطار انشون بكوريا الجنوبية؟
This episode is sponsored by https://WE-PN.com Become your own VPN provider.To get 50% off enter promo code: kingraam50-------------------------This episode is sponsored by BetterHelp. Give online therapy a try at https://betterhelp.com/MASTYORASTY and get on your way to being your best self.-------------------------Milad Hajizade (clinical psychologist) comes back on the show to talk about sex and relationships in the modern age. -------------------------To learn more about psychedelic therapy go to my brother Mehran's page at: https://www.mindbodyintegration.ca/ or to https://www.legacyjourneys.ca/ for his next retreat.***Masty o Rasty is not responsible for, or condone, the views and opinions expressed by our guests ******مستی و راستی هیچگونه مسولیتی در برابر نظرها و عقاید مهمانهای برنامه ندارد.***-------------------------King Raam Tour:Nov 21 2024 - Boston,Nov 23 2024 - New YorkNov 24 2024 - Washington, DCNov 26 2024 - AtlantaNov 29 2024 - Houston,Nov 30 2024 - Dallas, TXDec 3 2024 - San DiegoDec 4 2024 - Los AngelesDec 5 2024 - San FranciscoDec 8 2024 - PortlandDec 9 2024 - Seattlekingraam.com/tourSupport the show Hosted on Acast. See acast.com/privacy for more information.
This week, we revisit our interview with John Milad. John has 25 years of experience as a start-up executive, venture investor, and banker specializing in healthcare innovation across the UK, Europe, and the US. He had 10 years of life sciences venture experience at a family office, Atlas Venture, NBGI Venture and Downing. Notable exits include Symetis, ACT/EPIX, BoneSupport and Medcast/WebMD. John is a Co-founder of Quanta Dialysis Technologies, which I led as CEO for 8 years. Notable outcomes include creating an award-winning, disruptive portable dialysis system; raising nearly $400m in funding; commercially launching in UK and US; and establishing trans-Atlantic commercial operations. For another 2 years, he served as an executive at Nitec Pharma, which was ultimately sold to Horizon Therapeutics. John has held multiple board positions across the sector. He has been an advisor to NIIR/I4i programme, mentor at the Enterprise Hub of the Royal Academy of Engineering. He has been recognized in the Sunday Times Maserati List of the UK's Top 100 “Game Changing Innovators and Entrepreneurs”; Winner of the 2022 MacRobert Award from the Royal Academy of Engineering for outstanding engineering innovation.
(We are terribly sorry about the audio quality in this episode, there were some audio glitches that we didn't notice unfortunately.)This episode is sponsored by https://WE-PN.com Become your own VPN provider.To get 50% off enter promo code: kingraam50-------------------------This episode is sponsored by BetterHelp. Give online therapy a try at https://betterhelp.com/MASTYORASTY and get on your way to being your best self.-------------------------Milad Hajizadeh is a clinical psychologist currently on a trip in Nepal. He joins Raam to talk about rise of psychology and mental health awareness in the age of social media. -------------------------To learn more about psychedelic therapy go to my brother Mehran's page at: https://www.mindbodyintegration.ca/ or to https://www.legacyjourneys.ca/ for his next retreat.***Masty o Rasty is not responsible for, or condone, the views and opinions expressed by our guests ******مستی و راستی هیچگونه مسولیتی در برابر نظرها و عقاید مهمانهای برنامه ندارد.***-------------------------King Raam Tour:Nov 21 2024 - Boston,Nov 23 2024 - New YorkNov 24 2024 - Washington, DCNov 26 2024 - AtlantaNov 29 2024 - Houston,Nov 30 2024 - Dallas, TXDec 3 2024 - San DiegoDec 4 2024 - Los AngelesDec 5 2024 - San FranciscoDec 8 2024 - PortlandDec 9 2024 - Seattlekingraam.com/tourSupport the show Hosted on Acast. See acast.com/privacy for more information.
This week Ellen Halliday and Alona Ferber discuss a vital question: what is the reality of life under occupation for Palestinians? And how much longer can the world ignore it? They are joined by Nathan Thrall, the Pulitzer Prize-winning author of A Day in the Life of Abed Salama: Anatomy of a Jerusalem Tragedy, which follows the life of a Palestinian man whose son Milad died in a 2012 road accident in the West Bank.Thrall, who is based in Jerusalem, discusses how Salama's story provides a window into the wider tragedy of Israel Palestine. Hosted on Acast. See acast.com/privacy for more information.
மீலாது வழிகேடுமவ்லவி ரஹ்மத்துல்லாஹ் ஃபிர்தவ்ஸி | Rahmatullah Firdousi06-09-2024, Jumma
Today's guest, Nicholas Carlini, a research scientist at DeepMind, argues that we should be focusing more on what AI can do for us individually, rather than trying to have an answer for everyone."How I Use AI" - A Pragmatic ApproachCarlini's blog post "How I Use AI" went viral for good reason. Instead of giving a personal opinion about AI's potential, he simply laid out how he, as a security researcher, uses AI tools in his daily work. He divided it in 12 sections:* To make applications* As a tutor* To get started* To simplify code* For boring tasks* To automate tasks* As an API reference* As a search engine* To solve one-offs* To teach me* Solving solved problems* To fix errorsEach of the sections has specific examples, so we recommend going through it. It also includes all prompts used for it; in the "make applications" case, it's 30,000 words total!My personal takeaway is that the majority of the work AI can do successfully is what humans dislike doing. Writing boilerplate code, looking up docs, taking repetitive actions, etc. These are usually boring tasks with little creativity, but with a lot of structure. This is the strongest arguments as to why LLMs, especially for code, are more beneficial to senior employees: if you can get the boring stuff out of the way, there's a lot more value you can generate. This is less and less true as you go entry level jobs which are mostly boring and repetitive tasks. Nicholas argues both sides ~21:34 in the pod.A New Approach to LLM BenchmarksWe recently did a Benchmarks 201 episode, a follow up to our original Benchmarks 101, and some of the issues have stayed the same. Notably, there's a big discrepancy between what benchmarks like MMLU test, and what the models are used for. Carlini created his own domain-specific language for writing personalized LLM benchmarks. The idea is simple but powerful:* Take tasks you've actually needed AI for in the past.* Turn them into benchmark tests.* Use these to evaluate new models based on your specific needs.It can represent very complex tasks, from a single code generation to drawing a US flag using C:"Write hello world in python" >> LLMRun() >> PythonRun() >> SubstringEvaluator("hello world")"Write a C program that draws an american flag to stdout." >> LLMRun() >> CRun() >> VisionLLMRun("What flag is shown in this image?") >> (SubstringEvaluator("United States") | SubstringEvaluator("USA")))This approach solves a few problems:* It measures what's actually useful to you, not abstract capabilities.* It's harder for model creators to "game" your specific benchmark, a problem that has plagued standardized tests.* It gives you a concrete way to decide if a new model is worth switching to, similar to how developers might run benchmarks before adopting a new library or framework.Carlini argues that if even a small percentage of AI users created personal benchmarks, we'd have a much better picture of model capabilities in practice.AI SecurityWhile much of the AI security discussion focuses on either jailbreaks or existential risks, Carlini's research targets the space in between. Some highlights from his recent work:* LAION 400M data poisoning: By buying expired domains referenced in the dataset, Carlini's team could inject arbitrary images into models trained on LAION 400M. You can read the paper "Poisoning Web-Scale Training Datasets is Practical", for all the details. This is a great example of expanding the scope beyond the model itself, and looking at the whole system and how ti can become vulnerable.* Stealing model weights: They demonstrated how to extract parts of production language models (like OpenAI's) through careful API queries. This research, "Extracting Training Data from Large Language Models", shows that even black-box access can leak sensitive information.* Extracting training data: In some cases, they found ways to make models regurgitate verbatim snippets from their training data. Him and Milad Nasr wrote a paper on this as well: Scalable Extraction of Training Data from (Production) Language Models. They also think this might be applicable to extracting RAG results from a generation.These aren't just theoretical attacks. They've led to real changes in how companies like OpenAI design their APIs and handle data. If you really miss logit_bias and logit results by token, you can blame Nicholas :)We had a ton of fun also chatting about things like Conway's Game of Life, how much data can fit in a piece of paper, and porting Doom to Javascript. Enjoy!Show Notes* How I Use AI* My Benchmark for LLMs* Doom Javascript port* Conway's Game of Life* Tic-Tac-Toe in one printf statement* International Obfuscated C Code Contest* Cursor* LAION 400M poisoning paper* Man vs Machine at Black Hat* Model Stealing from OpenAI* Milad Nasr* H.D. Moore* Vijay Bolina* Cosine.sh* uuencodeTimestamps* [00:00:00] Introductions* [00:01:14] Why Nicholas writes* [00:02:09] The Game of Life* [00:05:07] "How I Use AI" blog post origin story* [00:08:24] Do we need software engineering agents?* [00:11:03] Using AI to kickstart a project* [00:14:08] Ephemeral software* [00:17:37] Using AI to accelerate research* [00:21:34] Experts vs non-expert users as beneficiaries of AI* [00:24:02] Research on generating less secure code with LLMs.* [00:27:22] Learning and explaining code with AI* [00:30:12] AGI speculations?* [00:32:50] Distributing content without social media* [00:35:39] How much data do you think you can put on a single piece of paper?* [00:37:37] Building personal AI benchmarks* [00:43:04] Evolution of prompt engineering and its relevance* [00:46:06] Model vs task benchmarking* [00:52:14] Poisoning LAION 400M through expired domains* [00:55:38] Stealing OpenAI models from their API* [01:01:29] Data stealing and recovering training data from models* [01:03:30] Finding motivation in your workTranscriptAlessio [00:00:00]: Hey everyone, welcome to the Latent Space podcast. This is Alessio, partner and CTO-in-Residence at Decibel Partners, and I'm joined by my co-host Swyx, founder of Smol AI.Swyx [00:00:12]: Hey, and today we're in the in-person studio, which Alessio has gorgeously set up for us, with Nicholas Carlini. Welcome. Thank you. You're a research scientist at DeepMind. You work at the intersection of machine learning and computer security. You got your PhD from Berkeley in 2018, and also your BA from Berkeley as well. And mostly we're here to talk about your blogs, because you are so generous in just writing up what you know. Well, actually, why do you write?Nicholas [00:00:41]: Because I like, I feel like it's fun to share what you've done. I don't like writing, sufficiently didn't like writing, I almost didn't do a PhD, because I knew how much writing was involved in writing papers. I was terrible at writing when I was younger. I do like the remedial writing classes when I was in university, because I was really bad at it. So I don't actually enjoy, I still don't enjoy the act of writing. But I feel like it is useful to share what you're doing, and I like being able to talk about the things that I'm doing that I think are fun. And so I write because I think I want to have something to say, not because I enjoy the act of writing.Swyx [00:01:14]: But yeah. It's a tool for thought, as they often say. Is there any sort of backgrounds or thing that people should know about you as a person? Yeah.Nicholas [00:01:23]: So I tend to focus on, like you said, I do security work, I try to like attacking things and I want to do like high quality security research. And that's mostly what I spend my actual time trying to be productive members of society doing that. But then I get distracted by things, and I just like, you know, working on random fun projects. Like a Doom clone in JavaScript.Swyx [00:01:44]: Yes.Nicholas [00:01:45]: Like that. Or, you know, I've done a number of things that have absolutely no utility. But are fun things to have done. And so it's interesting to say, like, you should work on fun things that just are interesting, even if they're not useful in any real way. And so that's what I tend to put up there is after I have completed something I think is fun, or if I think it's sufficiently interesting, write something down there.Alessio [00:02:09]: Before we go into like AI, LLMs and whatnot, why are you obsessed with the game of life? So you built multiplexing circuits in the game of life, which is mind boggling. So where did that come from? And then how do you go from just clicking boxes on the UI web version to like building multiplexing circuits?Nicholas [00:02:29]: I like Turing completeness. The definition of Turing completeness is a computer that can run anything, essentially. And the game of life, Conway's game of life is a very simple cellular 2D automata where you have cells that are either on or off. And a cell becomes on if in the previous generation some configuration holds true and off otherwise. It turns out there's a proof that the game of life is Turing complete, that you can run any program in principle using Conway's game of life. I don't know. And so you can, therefore someone should. And so I wanted to do it. Some other people have done some similar things, but I got obsessed into like, if you're going to try and make it work, like we already know it's possible in theory. I want to try and like actually make something I can run on my computer, like a real computer I can run. And so yeah, I've been going on this rabbit hole of trying to make a CPU that I can run semi real time on the game of life. And I have been making some reasonable progress there. And yeah, but you know, Turing completeness is just like a very fun trap you can go down. A while ago, as part of a research paper, I was able to show that in C, if you call into printf, it's Turing complete. Like printf, you know, like, which like, you know, you can print numbers or whatever, right?Swyx [00:03:39]: Yeah, but there should be no like control flow stuff.Nicholas [00:03:42]: Because printf has a percent n specifier that lets you write an arbitrary amount of data to an arbitrary location. And the printf format specifier has an index into where it is in the loop that is in memory. So you can overwrite the location of where printf is currently indexing using percent n. So you can get loops, you can get conditionals, and you can get arbitrary data rates again. So we sort of have another Turing complete language using printf, which again, like this has essentially zero practical utility, but like, it's just, I feel like a lot of people get into programming because they enjoy the art of doing these things. And then they go work on developing some software application and lose all joy with the boys. And I want to still have joy in doing these things. And so on occasion, I try to stop doing productive, meaningful things and just like, what's a fun thing that we can do and try and make that happen.Alessio [00:04:39]: Awesome. So you've been kind of like a pioneer in the AI security space. You've done a lot of talks starting back in 2018. We'll kind of leave that to the end because I know the security part is, there's maybe a smaller audience, but it's a very intense audience. So I think that'll be fun. But everybody in our Discord started posting your how I use AI blog post and we were like, we should get Carlini on the podcast. And then you were so nice to just, yeah, and then I sent you an email and you're like, okay, I'll come.Swyx [00:05:07]: And I was like, oh, I thought that would be harder.Alessio [00:05:10]: I think there's, as you said in the blog posts, a lot of misunderstanding about what LLMs can actually be used for. What are they useful at? What are they not good at? And whether or not it's even worth arguing what they're not good at, because they're obviously not. So if you cannot count the R's in a word, they're like, it's just not what it does. So how painful was it to write such a long post, given that you just said that you don't like to write? Yeah. And then we can kind of run through the things, but maybe just talk about the motivation, why you thought it was important to do it.Nicholas [00:05:39]: Yeah. So I wanted to do this because I feel like most people who write about language models being good or bad, some underlying message of like, you know, they have their camp and their camp is like, AI is bad or AI is good or whatever. And they like, they spin whatever they're going to say according to their ideology. And they don't actually just look at what is true in the world. So I've read a lot of things where people say how amazing they are and how all programmers are going to be obsolete by 2024. And I've read a lot of things where people who say like, they can't do anything useful at all. And, you know, like, they're just like, it's only the people who've come off of, you know, blockchain crypto stuff and are here to like make another quick buck and move on. And I don't really agree with either of these. And I'm not someone who cares really one way or the other how these things go. And so I wanted to write something that just says like, look, like, let's sort of ground reality and what we can actually do with these things. Because my actual research is in like security and showing that these models have lots of problems. Like this is like my day to day job is saying like, we probably shouldn't be using these in lots of cases. I thought I could have a little bit of credibility of in saying, it is true. They have lots of problems. We maybe shouldn't be deploying them lots of situations. And still, they are also useful. And that is the like, the bit that I wanted to get across is to say, I'm not here to try and sell you on anything. I just think that they're useful for the kinds of work that I do. And hopefully, some people would listen. And it turned out that a lot more people liked it than I thought. But yeah, that was the motivation behind why I wanted to write this.Alessio [00:07:15]: So you had about a dozen sections of like how you actually use AI. Maybe we can just kind of run through them all. And then maybe the ones where you have extra commentary to add, we can... Sure.Nicholas [00:07:27]: Yeah, yeah. I didn't put as much thought into this as maybe was deserved. I probably spent, I don't know, definitely less than 10 hours putting this together.Swyx [00:07:38]: Wow.Alessio [00:07:39]: It took me close to that to do a podcast episode. So that's pretty impressive.Nicholas [00:07:43]: Yeah. I wrote it in one pass. I've gotten a number of emails of like, you got this editing thing wrong, you got this sort of other thing wrong. It's like, I haven't just haven't looked at it. I tend to try it. I feel like I still don't like writing. And so because of this, the way I tend to treat this is like, I will put it together into the best format that I can at a time, and then put it on the internet, and then never change it. And this is an aspect of like the research side of me is like, once a paper is published, like it is done as an artifact that exists in the world. I could forever edit the very first thing I ever put to make it the most perfect version of what it is, and I would do nothing else. And so I feel like I find it useful to be like, this is the artifact, I will spend some certain amount of hours on it, which is what I think it is worth. And then I will just...Swyx [00:08:22]: Yeah.Nicholas [00:08:23]: Timeboxing.Alessio [00:08:24]: Yeah. Stop. Yeah. Okay. We just recorded an episode with the founder of Cosine, which is like an AI software engineer colleague. You said it took you 30,000 words to get GPT-4 to build you the, can GPT-4 solve this kind of like app. Where are we in the spectrum where chat GPT is all you need to actually build something versus I need a full on agent that does everything for me?Nicholas [00:08:46]: Yeah. Okay. So this was an... So I built a web app last year sometime that was just like a fun demo where you can guess if you can predict whether or not GPT-4 at the time could solve a given task. This is, as far as web apps go, very straightforward. You need basic HTML, CSS, you have a little slider that moves, you have a button, sort of animate the text coming to the screen. The reason people are going here is not because they want to see my wonderful HTML, right? I used to know how to do modern HTML in 2007, 2008. I was very good at fighting with IE6 and these kinds of things. I knew how to do that. I have no longer had to build any web app stuff in the meantime, which means that I know how everything works, but I don't know any of the new... Flexbox is new to me. Flexbox is like 10 years old at this point, but it's just amazing being able to go to the model and just say, write me this thing and it will give me all of the boilerplate that I need to get going. Of course it's imperfect. It's not going to get you the right answer, and it doesn't do anything that's complicated right now, but it gets you to the point where the only remaining work that needs to be done is the interesting hard part for me, the actual novel part. Even the current models, I think, are entirely good enough at doing this kind of thing, that they're very useful. It may be the case that if you had something, like you were saying, a smarter agent that could debug problems by itself, that might be even more useful. Currently though, make a model into an agent by just copying and pasting error messages for the most part. That's what I do, is you run it and it gives you some code that doesn't work, and either I'll fix the code, or it will give me buggy code and I won't know how to fix it, and I'll just copy and paste the error message and say, it tells me this. What do I do? And it will just tell me how to fix it. You can't trust these things blindly, but I feel like most people on the internet already understand that things on the internet, you can't trust blindly. And so this is not like a big mental shift you have to go through to understand that it is possible to read something and find it useful, even if it is not completely perfect in its output.Swyx [00:10:54]: It's very human-like in that sense. It's the same ring of trust, I kind of think about it that way, if you had trust levels.Alessio [00:11:03]: And there's maybe a couple that tie together. So there was like, to make applications, and then there's to get started, which is a similar you know, kickstart, maybe like a project that you know the LLM cannot solve. It's kind of how you think about it.Nicholas [00:11:15]: Yeah. So for getting started on things is one of the cases where I think it's really great for some of these things, where I sort of use it as a personalized, help me use this technology I've never used before. So for example, I had never used Docker before January. I know what Docker is. Lucky you. Yeah, like I'm a computer security person, like I sort of, I have read lots of papers on, you know, all the technology behind how these things work. You know, I know all the exploits on them, I've done some of these things, but I had never actually used Docker. But I wanted it to be able to, I could run the outputs of language model stuff in some controlled contained environment, which I know is the right application. So I just ask it like, I want to use Docker to do this thing, like, tell me how to run a Python program in a Docker container. And it like gives me a thing. I'm like, step back. You said Docker compose, I do not know what this word Docker compose is. Is this Docker? Help me. And like, you'll sort of tell me all of these things. And I'm sure there's this knowledge that's out there on the internet, like this is not some groundbreaking thing that I'm doing, but I just wanted it as a small piece of one thing I was working on. And I didn't want to learn Docker from first principles. Like I, at some point, if I need it, I can do that. Like I have the background that I can make that happen. But what I wanted to do was, was thing one. And it's very easy to get bogged down in the details of this other thing that helps you accomplish your end goal. And I just want to like, tell me enough about Docker so I can do this particular thing. And I can check that it's doing the safe thing. I sort of know enough about that from, you know, my other background. And so I can just have the model help teach me exactly the one thing I want to know and nothing more. I don't need to worry about other things that the writer of this thinks is important that actually isn't. Like I can just like stop the conversation and say, no, boring to me. Explain this detail. I don't understand. I think that's what that was very useful for me. It would have taken me, you know, several hours to figure out some things that take 10 minutes if you could just ask exactly the question you want the answer to.Alessio [00:13:05]: Have you had any issues with like newer tools? Have you felt any meaningful kind of like a cutoff day where like there's not enough data on the internet or? I'm sure that the answer to this is yes.Nicholas [00:13:16]: But I tend to just not use most of these things. Like I feel like this is like the significant way in which I use machine learning models is probably very different than most people is that I'm a researcher and I get to pick what tools that I use and most of the things that I work on are fairly small projects. And so I can, I can entirely see how someone who is in a big giant company where they have their own proprietary legacy code base of a hundred million lines of code or whatever and like you just might not be able to use things the same way that I do. I still think there are lots of use cases there that are entirely reasonable that are not the same ones that I've put down. But I wanted to talk about what I have personal experience in being able to say is useful. And I would like it very much if someone who is in one of these environments would be able to describe the ways in which they find current models useful to them. And not, you know, philosophize on what someone else might be able to find useful, but actually say like, here are real things that I have done that I found useful for me.Swyx [00:14:08]: Yeah, this is what I often do to encourage people to write more, to share their experiences because they often fear being attacked on the internet. But you are the ultimate authority on how you use things and there's this objectively true. So they cannot be debated. One thing that people are very excited about is the concept of ephemeral software or like personal software. This use case in particular basically lowers the activation energy for creating software, which I like as a vision. I don't think I have taken as much advantage of it as I could. I feel guilty about that. But also, we're trending towards there.Nicholas [00:14:47]: Yeah. No, I mean, I do think that this is a direction that is exciting to me. One of the things I wrote that was like, a lot of the ways that I use these models are for one-off things that I just need to happen that I'm going to throw away in five minutes. And you can.Swyx [00:15:01]: Yeah, exactly.Nicholas [00:15:02]: Right. It's like the kind of thing where it would not have been worth it for me to have spent 45 minutes writing this, because I don't need the answer that badly. But if it will only take me five minutes, then I'll just figure it out, run the program and then get it right. And if it turns out that you ask the thing, it doesn't give you the right answer. Well, I didn't actually need the answer that badly in the first place. Like either I can decide to dedicate the 45 minutes or I cannot, but like the cost of doing it is fairly low. You see what the model can do. And if it can't, then, okay, when you're using these models, if you're getting the answer you want always, it means you're not asking them hard enough questions.Swyx [00:15:35]: Say more.Nicholas [00:15:37]: Lots of people only use them for very small particular use cases and like it always does the thing that they want. Yeah.Swyx [00:15:43]: Like they use it like a search engine.Nicholas [00:15:44]: Yeah. Or like one particular case. And if you're finding that when you're using these, it's always giving you the answer that you want, then probably it has more capabilities than you're actually using. And so I oftentimes try when I have something that I'm curious about to just feed into the model and be like, well, maybe it's just solved my problem for me. You know, most of the time it doesn't, but like on occasion, it's like, it's done things that would have taken me, you know, a couple hours that it's been great and just like solved everything immediately. And if it doesn't, then it's usually easier to verify whether or not the answer is correct than to have written in the first place. And so you check, you're like, well, that's just, you're entirely misguided. Nothing here is right. It's just like, I'm not going to do this. I'm going to go write it myself or whatever.Alessio [00:16:21]: Even for non-tech, I had to fix my irrigation system. I had an old irrigation system. I didn't know how I worked to program it. I took a photo, I sent it to Claude and it's like, oh yeah, that's like the RT 900. This is exactly, I was like, oh wow, you know, you know, a lot of stuff.Swyx [00:16:34]: Was it right?Alessio [00:16:35]: Yeah, it was right.Swyx [00:16:36]: It worked. Did you compare with OpenAI?Alessio [00:16:38]: No, I canceled my OpenAI subscription, so I'm a Claude boy. Do you have a way to think about this like one-offs software thing? One way I talk to people about it is like LLMs are kind of converging to like semantic serverless functions, you know, like you can say something and like it can run the function in a way and then that's it. It just kind of dies there. Do you have a mental model to just think about how long it should live for and like anything like that?Nicholas [00:17:02]: I don't think I have anything interesting to say here, no. I will take whatever tools are available in front of me and try and see if I can use them in meaningful ways. And if they're helpful, then great. If they're not, then fine. And like, you know, there are lots of people that I'm very excited about seeing all these people who are trying to make better applications that use these or all these kinds of things. And I think that's amazing. I would like to see more of it, but I do not spend my time thinking about how to make this any better.Alessio [00:17:27]: What's the most underrated thing in the list? I know there's like simplified code, solving boring tasks, or maybe is there something that you forgot to add that you want to throw in there?Nicholas [00:17:37]: I mean, so in the list, I only put things that people could look at and go, I understand how this solved my problem. I didn't want to put things where the model was very useful to me, but it would not be clear to someone else that it was actually useful. So for example, one of the things that I use it a lot for is debugging errors. But the errors that I have are very much not the errors that anyone else in the world will have. And in order to understand whether or not the solution was right, you just have to trust me on it. Because, you know, like I got my machine in a state that like CUDA was not talking to whatever some other thing, the versions were mismatched, something, something, something, and everything was broken. And like, I could figure it out with interaction with the model, and it gave it like told me the steps I needed to take. But at the end of the day, when you look at the conversation, you just have to trust me that it worked. And I didn't want to write things online that were this, like, you have to trust me that what I'm saying. I want everything that I said to like have evidence that like, here's the conversation, you can go and check whether or not this actually solved the task as I said that the model does. Because a lot of people I feel like say, I used a model to solve this very complicated task. And what they mean is the model did 10%, and I did the other 90% or something, I wanted everything to be verifiable. And so one of the biggest use cases for me, I didn't describe even at all, because it's not the kind of thing that other people could have verified by themselves. So that maybe is like, one of the things that I wish I maybe had said a little bit more about, and just stated that the way that this is done, because I feel like that this didn't come across quite as well. But yeah, of the things that I talked about, the thing that I think is most underrated is the ability of it to solve the uninteresting parts of problems for me right now, where people always say, this is one of the biggest arguments that I don't understand why people say is, the model can only do things that people have done before. Therefore, the model is not going to be helpful in doing new research or like discovering new things. And as someone whose day job is to do new things, like what is research? Research is doing something literally no one else in the world has ever done before. So this is what I do every single day, 90% of this is not doing something new, 90% of this is doing things a million people have done before, and then a little bit of something that was new. There's a reason why we say we stand on the shoulders of giants. It's true. Almost everything that I do is something that's been done many, many times before. And that is the piece that can be automated. Even if the thing that I'm doing as a whole is new, it is almost certainly the case that the small pieces that build up to it are not. And a number of people who use these models, I feel like expect that they can either solve the entire task or none of the task. But now I find myself very often, even when doing something very new and very hard, having models write the easy parts for me. And the reason I think this is so valuable, everyone who programs understands this, like you're currently trying to solve some problem and then you get distracted. And whatever the case may be, someone comes and talks to you, you have to go look up something online, whatever it is. You lose a lot of time to that. And one of the ways we currently don't think about being distracted is you're solving some hard problem and you realize you need a helper function that does X, where X is like, it's a known algorithm. Any person in the world, you say like, give me the algorithm that, have a dense graph or a sparse graph, I need to make it dense. You can do this by doing some matrix multiplies. It's like, this is a solved problem. I knew how to do this 15 years ago, but it distracts me from the problem I'm thinking about in my mind. I needed this done. And so instead of using my mental capacity and solving that problem and then coming back to the problem I was originally trying to solve, you could just ask model, please solve this problem for me. It gives you the answer. You run it. You can check that it works very, very quickly. And now you go back to solving the problem without having lost all the mental state. And I feel like this is one of the things that's been very useful for me.Swyx [00:21:34]: And in terms of this concept of expert users versus non-expert users, floors versus ceilings, you had some strong opinion here that like, basically it actually is more beneficial for non-experts.Nicholas [00:21:46]: Yeah, I don't know. I think it could go either way. Let me give you the argument for both of these. Yes. So I can only speak on the expert user behalf because I've been doing computers for a long time. And so yeah, the cases where it's useful for me are exactly these cases where I can check the output. I know, and anything the model could do, I could have done. I could have done better. I can check every single thing that the model is doing and make sure it's correct in every way. And so I can only speak and say, definitely it's been useful for me. But I also see a world in which this could be very useful for the kinds of people who do not have this knowledge, with caveats, because I'm not one of these people. I don't have this direct experience. But one of these big ways that I can see this is for things that you can check fairly easily, someone who could never have asked or have written a program themselves to do a certain task could just ask for the program that does the thing. And you know, some of the times it won't get it right. But some of the times it will, and they'll be able to have the thing in front of them that they just couldn't have done before. And we see a lot of people trying to do applications for this, like integrating language models into spreadsheets. Spreadsheets run the world. And there are some people who know how to do all the complicated spreadsheet equations and various things, and other people who don't, who just use the spreadsheet program but just manually do all of the things one by one by one by one. And this is a case where you could have a model that could try and give you a solution. And as long as the person is rigorous in testing that the solution does actually the correct thing, and this is the part that I'm worried about most, you know, I think depending on these systems in ways that we shouldn't, like this is what my research says, my research says is entirely on this, like, you probably shouldn't trust these models to do the things in adversarial situations, like, I understand this very deeply. And so I think that it's possible for people who don't have this knowledge to make use of these tools in ways, but I'm worried that it might end up in a world where people just blindly trust them, deploy them in situations that they probably shouldn't, and then someone like me gets to come along and just break everything because everything is terrible. And so I am very, very worried about that being the case, but I think if done carefully it is possible that these could be very useful.Swyx [00:23:54]: Yeah, there is some research out there that shows that when people use LLMs to generate code, they do generate less secure code.Nicholas [00:24:02]: Yeah, Dan Bonet has a nice paper on this. There are a bunch of papers that touch on exactly this.Swyx [00:24:07]: My slight issue is, you know, is there an agenda here?Nicholas [00:24:10]: I mean, okay, yeah, Dan Bonet, at least the one they have, like, I fully trust everything that sort of.Swyx [00:24:15]: Sorry, I don't know who Dan is.Swyx [00:24:17]: He's a professor at Stanford. Yeah, he and some students have some things on this. Yeah, there's a number. I agree that a lot of the stuff feels like people have an agenda behind it. There are some that don't, and I trust them to have done the right thing. I also think, even on this though, we have to be careful because the argument, whenever someone says x is true about language models, you should always append the suffix for current models because I'll be the first to admit I was one of the people who was very much on the opinion that these language models are fun toys and are going to have absolutely no practical utility. If you had asked me this, let's say, in 2020, I still would have said the same thing. After I had seen GPT-2, I had written a couple of papers studying GPT-2 very carefully. I still would have told you these things are toys. And when I first read the RLHF paper and the instruction tuning paper, I was like, nope, this is this thing that these weird AI people are doing. They're trying to make some analogies to people that makes no sense. It's just like, I don't even care to read it. I saw what it was about and just didn't even look at it. I was obviously wrong. These things can be useful. And I feel like a lot of people had the same mentality that I did and decided not to change their mind. And I feel like this is the thing that I want people to be careful about. I want them to at least know what is true about the world so that they can then see that maybe they should reconsider some of the opinions that they had from four or five years ago that may just not be true about today's models.Swyx [00:25:47]: Specifically because you brought up spreadsheets, I want to share my personal experience because I think Google has done a really good job that people don't know about, which is if you use Google Sheets, Gemini is integrated inside of Google Sheets and it helps you write formulas. Great.Nicholas [00:26:00]: That's news to me.Swyx [00:26:01]: Right? They don't maybe do a good job. Unless you watch Google I.O., there was no other opportunity to learn that Gemini is now in your Google Sheets. And so I just don't write formulas manually anymore. It just prompts Gemini to do it for me. And it does it.Nicholas [00:26:15]: One of the problems that these machine learning models have is a discoverability problem. I think this will be figured out. I mean, it's the same problem that you have with any assistant. You're given a blank box and you're like, what do I do with it? I think this is great. More of these things, it would be good for them to exist. I want them to exist in ways that we can actually make sure that they're done correctly. I don't want to just have them be pushed into more and more things just blindly. I feel like lots of people, there are far too many X plus AI, where X is like arbitrary thing in the world that has nothing to do with it and could not be benefited at all. And they're just doing it because they want to use the word. And I don't want that to happen.Swyx [00:26:58]: You don't want an AI fridge?Nicholas [00:27:00]: No. Yes. I do not want my fridge on the internet.Swyx [00:27:03]: I do not want... Okay.Nicholas [00:27:05]: Anyway, let's not go down that rabbit hole. I understand why some of that happens, because people want to sell things or whatever. But I feel like a lot of people see that and then they write off everything as a result of it. And I just want to say, there are allowed to be people who are trying to do things that don't make any sense. Just ignore them. Do the things that make sense.Alessio [00:27:22]: Another chunk of use cases was learning. So both explaining code, being an API reference, all of these different things. Any suggestions on how to go at it? I feel like one thing is generate code and then explain to me. One way is just tell me about this technology. Another thing is like, hey, I read this online, kind of help me understand it. Any best practices on getting the most out of it?Swyx [00:27:47]: Yeah.Nicholas [00:27:47]: I don't know if I have best practices. I have how I use them.Swyx [00:27:51]: Yeah.Nicholas [00:27:51]: I find it very useful for cases where I understand the underlying ideas, but I have never usedSwyx [00:27:59]: them in this way before.Nicholas [00:28:00]: I know what I'm looking for, but I just don't know how to get there. And so yeah, as an API reference is a great example. The tool everyone always picks on is like FFmpeg. No one in the world knows the command line arguments to do what they want. They're like, make the thing faster. I want lower bitrate, like dash V. Once you tell me what the answer is, I can check. This is one of these things where it's great for these kinds of things. Or in other cases, things where I don't really care that the answer is 100% correct. So for example, I do a lot of security work. Most of security work is reading some code you've never seen before and finding out which pieces of the code are actually important. Because, you know, most of the program isn't actually do anything to do with security. It has, you know, the display piece or the other piece or whatever. And like, you just, you would only ignore all of that. So one very fun use of models is to like, just have it describe all the functions and just skim it and be like, wait, which ones look like approximately the right things to look at? Because otherwise, what are you going to do? You're going to have to read them all manually. And when you're reading them manually, you're going to skim the function anyway, and not just figure out what's going on perfectly. Like you already know that when you're going to read these things, what you're going to try and do is figure out roughly what's going on. Then you'll delve into the details. This is a great way of just doing that, but faster, because it will abstract most of whatSwyx [00:29:21]: is right.Nicholas [00:29:21]: It's going to be wrong some of the time. I don't care.Swyx [00:29:23]: I would have been wrong too.Nicholas [00:29:24]: And as long as you treat it with this way, I think it's great. And so like one of the particular use cases I have in the thing is decompiling binaries, where oftentimes people will release a binary. They won't give you the source code. And you want to figure out how to attack it. And so one thing you could do is you could try and run some kind of decompiler. It turns out for the thing that I wanted, none existed. And so I spent too many hours doing it by hand. Before I first thought, why am I doing this? I should just check if the model could do it for me. And it turns out that it can. And it can turn the compiled source code, which is impossible for any human to understand, into the Python code that is entirely reasonable to understand. And it doesn't run. It has a bunch of problems. But it's so much nicer that it's immediately a win for me. I can just figure out approximately where I should be looking, and then spend all of my time doing that by hand. And again, you get a big win there.Swyx [00:30:12]: So I fully agree with all those use cases, especially for you as a security researcher and having to dive into multiple things. I imagine that's super helpful. I do think we want to move to your other blog post. But you ended your post with a little bit of a teaser about your next post and your speculations. What are you thinking about?Nicholas [00:30:34]: So I want to write something. And I will do that at some point when I have time, maybe after I'm done writing my current papers for ICLR or something, where I want to talk about some thoughts I have for where language models are going in the near-term future. The reason why I want to talk about this is because, again, I feel like the discussion tends to be people who are either very much AGI by 2027, orSwyx [00:30:55]: always five years away, or are going to make statements of the form,Nicholas [00:31:00]: you know, LLMs are the wrong path, and we should be abandoning this, and we should be doing something else instead. And again, I feel like people tend to look at this and see these two polarizing options and go, well, those obviously are both very far extremes. Like, how do I actually, like, what's a more nuanced take here? And so I have some opinions about this that I want to put down, just saying, you know, I have wide margins of error. I think you should too. If you would say there's a 0% chance that something, you know, the models will get very, very good in the next five years, you're probably wrong. If you're going to say there's a 100% chance that in the next five years, then you're probably wrong. And like, to be fair, most of the people, if you read behind the headlines, actually say something like this. But it's very hard to get clicks on the internet of like, some things may be good in the future. Like, everyone wants like, you know, a very, like, nothing is going to be good. This is entirely wrong. It's going to be amazing. You know, like, they want to see this. I want people who have negative reactions to these kinds of extreme views to be able to at least say, like, to tell them, there is something real here. It may not solve all of our problems, but it's probably going to get better. I don't know by how much. And that's basically what I want to say. And then at some point, I'll talk about the safety and security things as a result of this. Because the way in which security intersects with these things depends a lot in exactly how people use these tools. You know, if it turns out to be the case that these models get to be truly amazing and can solve, you know, tasks completely autonomously, that's a very different security world to be living in than if there's always a human in the loop. And the types of security questions I would want to ask would be very different. And so I think, you know, in some very large part, understanding what the future will look like a couple of years ahead of time is helpful for figuring out which problems, as a security person, I want to solve now. You mentioned getting clicks on the internet,Alessio [00:32:50]: but you don't even have, like, an ex-account or anything. How do you get people to read your stuff? What's your distribution strategy? Because this post was popping up everywhere. And then people on Twitter were like, Nicholas Garlini wrote this. Like, what's his handle? It's like, he doesn't have it. It's like, how did you find it? What's the story?Nicholas [00:33:07]: So I have an RSS feed and an email list. And that's it. I don't like most social media things. On principle, I feel like they have some harms. As a person, I have a problem when people say things that are wrong on the internet. And I would get nothing done if I would have a Twitter. I would spend all of my time correcting people and getting into fights. And so I feel like it is just useful for me for this not to be an option. I tend to just post things online. Yeah, it's a very good question. I don't know how people find it. I feel like for some things that I write, other people think it resonates with them. And then they put it on Twitter. And...Swyx [00:33:43]: Hacker News as well.Nicholas [00:33:44]: Sure, yeah. I am... Because my day job is doing research, I get no value for having this be picked up. There's no whatever. I don't need to be someone who has to have this other thing to give talks. And so I feel like I can just say what I want to say. And if people find it useful, then they'll share it widely. You know, this one went pretty wide. I wrote a thing, whatever, sometime late last year, about how to recover data off of an Apple profile drive from 1980. This probably got, I think, like 1000x less views than this. But I don't care. Like, that's not why I'm doing this. Like, this is the benefit of having a thing that I actually care about, which is my research. I would care much more if that didn't get seen. This is like a thing that I write because I have some thoughts that I just want to put down.Swyx [00:34:32]: Yeah. I think it's the long form thoughtfulness and authenticity that is sadly lacking sometimes in modern discourse that makes it attractive. And I think now you have a little bit of a brand of you are an independent thinker, writer, person, that people are tuned in to pay attention to whatever is next coming.Nicholas [00:34:52]: Yeah, I mean, this kind of worries me a little bit. I don't like whenever I have a popular thing that like, and then I write another thing, which is like entirely unrelated. Like, I don't, I don't... You should actually just throw people off right now.Swyx [00:35:01]: Exactly.Nicholas [00:35:02]: I'm trying to figure out, like, I need to put something else online. So, like, the last two or three things I've done in a row have been, like, actually, like, things that people should care about.Swyx [00:35:10]: Yes. So, I have a couple of things.Nicholas [00:35:11]: I'm trying to figure out which one do I put online to just, like, cull the list of people who have subscribed to my email.Swyx [00:35:16]: And so, like, tell them, like,Nicholas [00:35:16]: no, like, what you're here for is not informed, well-thought-through takes. Like, what you're here for is whatever I want to talk about. And if you're not up for that, then, like, you know, go away. Like, this is not what I want out of my personal website.Swyx [00:35:27]: So, like, here's, like, top 10 enemies or something.Alessio [00:35:30]: What's the next project you're going to work on that is completely unrelated to research LLMs? Or what games do you want to port into the browser next?Swyx [00:35:39]: Okay. Yeah.Nicholas [00:35:39]: So, maybe.Swyx [00:35:41]: Okay.Nicholas [00:35:41]: Here's a fun question. How much data do you think you can put on a single piece of paper?Swyx [00:35:47]: I mean, you can think about bits and atoms. Yeah.Nicholas [00:35:49]: No, like, normal printer. Like, I gave you an office printer. How much data can you put on a piece of paper?Alessio [00:35:54]: Can you re-decode it? So, like, you know, base 64A or whatever. Yeah, whatever you want.Nicholas [00:35:59]: Like, you get normal off-the-shelf printer, off-the-shelf scanner. How much data?Swyx [00:36:03]: I'll just throw out there. Like, 10 megabytes. That's enormous. I know.Nicholas [00:36:07]: Yeah, that's a lot.Swyx [00:36:10]: Really small fonts. That's my question.Nicholas [00:36:12]: So, I have a thing. It does about a megabyte.Swyx [00:36:14]: Yeah, okay.Nicholas [00:36:14]: There you go. I was off by an order of magnitude.Swyx [00:36:16]: Yeah, okay.Nicholas [00:36:16]: So, in particular, it's about 1.44 megabytes. A floppy disk.Swyx [00:36:21]: Yeah, exactly.Nicholas [00:36:21]: So, this is supposed to be the title at some point. It's a floppy disk.Swyx [00:36:24]: A paper is a floppy disk. Yeah.Nicholas [00:36:25]: So, this is a little hard because, you know. So, you can do the math and you get 8.5 by 11. You can print at 300 by 300 DPI. And this gives you 2 megabytes. And so, every single pixel, you need to be able to recover up to like 90 plus percent. Like, 95 percent. Like, 99 point something percent accuracy. In order to be able to actually decode this off the paper. This is one of the things that I'm considering. I need to get a couple more things working for this. Where, you know, again, I'm running into some random problems. But this is probably, this will be one thing that I'm going to talk about. There's this contest called the International Obfuscated C-Code Contest, which is amazing. People try and write the most obfuscated C code that they can. Which is great. And I have a submission for that whenever they open up the next one for it. And I'll write about that submission. I have a very fun gate level emulation of an old CPU that runs like fully precisely. And it's a fun kind of thing. Yeah.Swyx [00:37:20]: Interesting. Your comment about the piece of paper reminds me of when I was in college. And you would have like one cheat sheet that you could write. So, you have a formula, a theoretical limit for bits per inch. And, you know, that's how much I would squeeze in really, really small. Yeah, definitely.Nicholas [00:37:36]: Okay.Swyx [00:37:37]: We are also going to talk about your benchmarking. Because you released your own benchmark that got some attention, thanks to some friends on the internet. What's the story behind your own benchmark? Do you not trust the open source benchmarks? What's going on there?Nicholas [00:37:51]: Okay. Benchmarks tell you how well the model solves the task the benchmark is designed to solve. For a long time, models were not useful. And so, the benchmark that you tracked was just something someone came up with, because you need to track something. All of deep learning exists because people tried to make models classify digits and classify images into a thousand classes. There is no one in the world who cares specifically about the problem of distinguishing between 300 breeds of dog for an image that's 224 or 224 pixels. And yet, like, this is what drove a lot of progress. And people did this not because they cared about this problem, because they wanted to just measure progress in some way. And a lot of benchmarks are of this flavor. You want to construct a task that is hard, and we will measure progress on this benchmark, not because we care about the problem per se, but because we know that progress on this is in some way correlated with making better models. And this is fine when you don't want to actually use the models that you have. But when you want to actually make use of them, it's important to find benchmarks that track with whether or not they're useful to you. And the thing that I was finding is that there would be model after model after model that was being released that would find some benchmark that they could claim state-of-the-art on and then say, therefore, ours is the best. And that wouldn't be helpful to me to know whether or not I should then switch to it. So the argument that I tried to lay out in this post is that more people should make benchmarks that are tailored to them. And so what I did is I wrote a domain-specific language that anyone can write for and say, you can take tasks that you have wanted models to solve for you, and you can put them into your benchmark that's the thing that you care about. And then when a new model comes out, you benchmark the model on the things that you care about. And you know that you care about them because you've actually asked for those answers before. And if the model scores well, then you know that for the kinds of things that you have asked models for in the past, it can solve these things well for you. This has been useful for me because when another model comes out, I can run it. I can see, does this solve the kinds of things that I care about? And sometimes the answer is yes, and sometimes the answer is no. And then I can decide whether or not I want to use that model or not. I don't want to say that existing benchmarks are not useful. They're very good at measuring the thing that they're designed to measure. But in many cases, what that's designed to measure is not actually the thing that I want to use it for. And I expect that the way that I want to use it is different the way that you want to use it. And I would just like more people to have these things out there in the world. And the final reason for this is, it is very easy. If you want to make a model good at some benchmark, to make it good at that benchmark, you can find the distribution of data that you need and train the model to be good on the distribution of data. And then you have your model that can solve this benchmark well. And by having a benchmark that is not very popular, you can be relatively certain that no one has tried to optimize their model for your benchmark.Swyx [00:40:40]: And I would like this to be-Nicholas [00:40:40]: So publishing your benchmark is a little bit-Swyx [00:40:43]: Okay, sure.Nicholas [00:40:43]: Contextualized. So my hope in doing this was not that people would use mine as theirs. My hope in doing this was that- You should make yours. Yes, you should make your benchmark. And if, for example, there were even a very small fraction of people, 0.1% of people who made a benchmark that was useful for them, this would still be hundreds of new benchmarks that- not want to make one myself, but I might want to- I might know the kinds of work that I do is a little bit like this person, a little bit like that person. I'll go check how it is on their benchmarks. And I'll see, roughly, I'll get a good sense of what's going on. Because the alternative is people just do this vibes-based evaluation thing, where you interact with the model five times, and you see if it worked on the kinds of things that you just like your toy questions. But five questions is a very low bit output from whether or not it works for this thing. And if you could just automate running it 100 questions for you, it's a much better evaluation. So that's why I did this.Swyx [00:41:37]: Yeah, I like the idea of going through your chat history and actually pulling out real-life examples. I regret to say that I don't think my chat history is used as much these days, because I'm using Cursor, the native AI IDE. So your examples are all coding related. And the immediate question is, now that you've written the How I Use AI post, which is a little bit broader, are you able to translate all these things to evals? Are some things unevaluable?Nicholas [00:42:03]: Right. A number of things that I do are harder to evaluate. So this is the problem with a benchmark, is you need some way to check whether or not the output was correct. And so all of the kinds of things that I can put into the benchmark are the kinds of things that you can check. You can check more things than you might have thought would be possible if you do a little bit of work on the back end. So for example, all of the code that I have the model write, it runs the code and sees whether the answer is the correct answer. Or in some cases, it runs the code, feeds the output to another language model, and the language model judges was the output correct. And again, is using a language model to judge here perfect? No. But like, what's the alternative? The alternative is to not do it. And what I care about is just, is this thing broadly useful for the kinds of questions that I have? And so as long as the accuracy is better than roughly random, like, I'm okay with this. I've inspected the outputs of these, and like, they're almost always correct. If you ask the model to judge these things in the right way, they're very good at being able to tell this. And so, yeah, I probably think this is a useful thing for people to do.Alessio [00:43:04]: You complain about prompting and being lazy and how you do not want to tip your model and you do not want to murder a kitten just to get the right answer. How do you see the evolution of like prompt engineering? Even like 18 months ago, maybe, you know, it was kind of like really hot and people wanted to like build companies around it. Today, it's like the models are getting good. Do you think it's going to be less and less relevant going forward? Or what's the minimum valuable prompt? Yeah, I don't know.Nicholas [00:43:29]: I feel like a big part of making an agent is just like a fancy prompt that like, you know, calls back to the model again. I have no opinion. It seems like maybe it turns out that this is really important. Maybe it turns out that this isn't. I guess the only comment I was making here is just to say, oftentimes when I use a model and I find it's not useful, I talk to people who help make it. The answer they usually give me is like, you're using it wrong. Which like reminds me very much of like that you're holding it wrong from like the iPhone kind of thing, right? Like, you know, like I don't care that I'm holding it wrong. I'm holding it that way. If the thing is not working with me, then like it's not useful for me. Like it may be the case that there exists a way to ask the model such that it gives me the answer that's correct, but that's not the way I'm doing it. If I have to spend so much time thinking about how I want to frame the question, that it would have been faster for me just to get the answer. It didn't save me any time. And so oftentimes, you know, what I do is like, I just dump in whatever current thought that I have in whatever ill-formed way it is. And I expect the answer to be correct. And if the answer is not correct, like in some sense, maybe the model was right to give me the wrong answer. Like I may have asked the wrong question, but I want the right answer still. And so like, I just want to sort of get this as a thing. And maybe the way to fix this is you have some default prompt that always goes into all the models or something, or you do something like clever like this. It would be great if someone had a way to package this up and make a thing I think that's entirely reasonable. Maybe it turns out that as models get better, you don't need to prompt them as much in this way. I just want to use the things that are in front of me.Alessio [00:44:55]: Do you think that's like a limitation of just how models work? Like, you know, at the end of the day, you're using the prompt to kind of like steer it in the latent space. Like, do you think there's a way to actually not make the prompt really relevant and have the model figure it out? Or like, what's the... I mean, you could fine tune itNicholas [00:45:10]: into the model, for example, that like it's supposed to... I mean, it seems like some models have done this, for example, like some recent model, many recent models. If you ask them a question, computing an integral of this thing, they'll say, let's think through this step by step. And then they'll go through the step by step answer. I didn't tell it. Two years ago, I would have had to have prompted it. Think step by step on solving the following thing. Now you ask them the question and the model says, here's how I'm going to do it. I'm going to take the following approach and then like sort of self-prompt itself.Swyx [00:45:34]: Is this the right way?Nicholas [00:45:35]: Seems reasonable. Maybe you don't have to do it. I don't know. This is for the people whose job is to make these things better. And yeah, I just want to use these things. Yeah.Swyx [00:45:43]: For listeners, that would be Orca and Agent Instruct. It's the soda on this stuff. Great. Yeah.Alessio [00:45:49]: That's a few shot. It's included in the lazy prompting. Like, do you do a few shot prompting? Like, do you collect some examples when you want to put them in? Or...Nicholas [00:45:57]: I don't because usually when I want the answer, I just want to get the answer. Brutal.Swyx [00:46:03]: This is hard mode. Yeah, exactly.Nicholas [00:46:04]: But this is fine.Swyx [00:46:06]: I want to be clear.Nicholas [00:46:06]: There's a difference between testing the ultimate capability level of the model and testing the thing that I'm doing with it. What I'm doing is I'm not exercising its full capability level because there are almost certainly better ways to ask the questions and sort of really see how good the model is. And if you're evaluating a model for being state of the art, this is ultimately what I care about. And so I'm entirely fine with people doing fancy prompting to show me what the true capability level could be because it's really useful to know what the ultimate level of the model could be. But I think it's also important just to have available to you how good the model is if you don't do fancy things.Swyx [00:46:39]: Yeah, I would say that here's a divergence between how models are marketed these days versus how people use it, which is when they test MMLU, they'll do like five shots, 25 shots, 50 shots. And no one's providing 50 examples. I completely agree.Nicholas [00:46:54]: You know, for these numbers, the problem is everyone wants to get state of the art on the benchmark. And so you find the way that you can ask the model the questions so that you get state of the art on the benchmark. And it's good. It's legitimately good to know. It's good to know the model can do this thing if only you try hard enough. Because it means that if I have some task that I want to be solved, I know what the capability level is. And I could get there if I was willing to work hard enough. And the question then is, should I work harder and figure out how to ask the model the question? Or do I just do the thing myself? And for me, I have programmed for many, many, many years. It's often just faster for me just to do the thing than to figure out the incantation to ask the model. But I can imagine someone who has never programmed before might be fine writing five paragraphs in English describing exactly the thing that they want and have the model build it for them if the alternative is not. But again, this goes to all these questions of how are they going to validate? Should they be trusting the output? These kinds of things.Swyx [00:47:49]: One problem with your eval paradigm and most eval paradigms, I'm not picking on you, is that we're actually training these things for chat, for interactive back and forth. And you actually obviously reveal much more information in the same way that asking 20 questions reveals more information in sort of a tree search branching sort of way. Then this is also by the way the problem with LMSYS arena, right? Where the vast majority of prompts are single question, single answer, eval, done. But actually the way that we use chat things, in the way, even in the stuff that you posted in your how I use AI stuff, you have maybe 20 turns of back and forth. How do you eval that?Nicholas [00:48:25]: Yeah. Okay. Very good question. This is the thing that I think many people should be doing more of. I would like more multi-turn evals. I might be writing a paper on this at some point if I get around to it. A couple of the evals in the benchmark thing I have are already multi-turn. I mentioned 20 questions. I have a 20 question eval there just for fun. But I have a couple others that are like, I just tell the model, here's my get thing, figure out how to cherry pick off this other branch and move it over there. And so what I do is I just, I basically build a tiny little agency thing. I just ask the model how I do it. I run the thing on Linux. This is what I want a Docker for. I spin up a Docker container. I run whatever the model told me the output to do is. I feed the output back into the model. I repeat this many rounds. And then I check at the very end, does the git commit history show that it is correctly cherry picked in
As a fourth installment of this podcast into key industry leaders in neuromodulation, this is our conversation with Milad Girgis, who is the VP and General Manager of the Brain franchise at Boston Scientific. Milad has dedicated over 25 years to the Medical Device Industry, with two decades at Boston Scientific. Before diving into his impressive tenure at Boston Scientific, we explore the earlier phases of his career and identify key turning points that shaped his professional journey. Milad provides insights into what his typical day at Boston Scientific looks like. We also discuss the competitive DBS landscape and how they create unique selling points for their program. Milad shares exciting developments and future plans that are on the roadmap, giving us a glimpse into the innovations driving the industry forward. Finally, we illuminate Boston Scientific's approach to DBS with their unique strategy and focus on image guidance – among other concepts. Our conversation also addresses the academic community's curiosity about transitioning to industry roles, with our guest presenting a compelling case for why a career in the medical device industry can be rewarding. Join us as we explore and map some new potential horizons for neuromodulation.
Join host Japhet De Oliveira as he sits down with Milad Pezeshki, Director of Outpatient Operations at Adventist Health Simi Valley, for a moving conversation about what language he dreams in, how his mother was an advocate for his education, the beauty found in complicated meals, and acknowledging risks but not fearing failure.
Send us a Text Message.Get your fermentation on whith Sarah and The Shrub covering Kombucha! Also a little story (some might say rant...) from a recent weird event, and a spicy face-off between two shrubs.***- “Kombucha” (Wikipedia). https://en.wikipedia.org/wiki/Kombucha - Zeratsky, Katherine. Mayo Clinic Consumer Health. https://www.mayoclinic.org/healthy-lifestyle/consumer-health/expert-answers/kombucha-tea/faq-20058126- MacKeen, Dawn. “Are There Benefits to Drinking Kombucha?” The New York Times, 16 Oct. 2019. https://www.nytimes.com/2019/10/16/style/self-care/kombucha-benefits.html- Maicas, Sergi. “The Role of Yeasts in Fermentation.” Microorganisms, August 2020. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7466055/#:~:text=Upon%20a%20strictly%20biochemical%20point,by%20converting%20sugar%20into%20alcohol.- Davani-Davari, Dorna; Negahdarjpour, Manica; Karimzadeh, Iman; Seifan, Mostafa; Mohkam, Milad; Masoumi, Seyed Jalil; Berenjian, Aydin; Ghasemi, Younes. “Prebiotics: Definition, Types, Sources, Mechanisms, and Clinical Applications.” Foods, March 2019. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6463098/- Grushkin, Daniel. “Meet the Woman Who Wants to Grow Clothing in a Lab.” Popular Science, 17 Feb. 2015. https://www.popsci.com/meet-woman-who-wants-growing-clothing-lab/- https://www.youtube.com/watch?v=D3Axb37lMWI- https://www.youtube.com/@YouBrewKombucha***ig: @shrubbish_podemail: shrubbishpodcast@gmail.comWhile I want to bring levity to the table, this podcast does contain descriptions of substance abuse. If you or someone you know needs help, the SAMHSA (Substance Abuse and Mental Health Services Administration) hotline is available 24/7 at 1-800-662-HELP (4357).
Abraham bleibt für viele Menschen eine religiöse Krise, eine die uns bis heute berührt, anzieht und anwidert, aber nicht loslässt, meint Ahmad Milad Karimi, Religionsphilosoph und Islamwissenschaftler. Gestaltung: Alexandra Mantler – Eine Eigenproduktion des ORF, gesendet in Ö1 am 15. 06. 2024
Leiblichkeit stiftet Beziehung, Nähe, Berührung, Gemeinschaft und Intimität, meint Ahmad Milad Karimi, Religionsphilosoph und Islamwissenschaftler. Gestaltung: Alexandra Mantler – Eine Eigenproduktion des ORF, gesendet in Ö1 am 14. 06. 2024
Am Anfang der Erzählung des Menschen steht die Fehlbarkeit, meint Ahmad Milad Karimi, Religionsphilosoph und Islamwissenschaftler. Gestaltung: Alexandra Mantler – Eine Eigenproduktion des ORF, gesendet in Ö1 am 13. 06. 2024
Noch nie zuvor war der Mensch in seiner Menschlichkeit so fragwürdig geworden wie in der Gegenwart, meint Ahmad Milad Karimi, Religionsphilosoph und Islamwissenschaftler. Gestaltung: Alexandra Mantler – Eine Eigenproduktion des ORF, gesendet in Ö1 am 12. 06. 2024
Noch nie zuvor war der Mensch in seiner Menschlichkeit so fragwürdig geworden wie in der Gegenwart, meint Ahmad Milad Karimi, Religionsphilosoph und Islamwissenschaftler. Gestaltung: Alexandra Mantler – Eine Eigenproduktion des ORF, gesendet in Ö1 am 11. 06. 2024
Wenn die Religion politisiert wird, dann wird sie verfälscht, meint Ahmad Milad Karimi, Religionsphilosoph und Islamwissenschaftler. Gestaltung: Alexandra Mantler – Eine Eigenproduktion des ORF, gesendet in Ö1 am 10. 06. 2024
"Health is number one. Without energy, you can't start tackling everything." - Milad Rahnama Many people are seeking ways to improve their overall well-being. The key to a fulfilling life is finding a balance between faith, fitness, and finance. Individuals can transform their lives and achieve their full potential by focusing on these three pillars. In this episode, Milad Rahnama, a Rare Movement founder, shares his transformation and the philosophy behind his holistic approach to fitness. Milad shares his journey from rock bottom to rebuilding his life, addressing his health, finances, and spirituality along the way. He emphasizes taking ownership, removing temptations, and finding flow-state activities to replace unhealthy coping mechanisms. Milad also offers insights into managing emotional spending and the power of hiring coaches who have overcome similar challenges. Don't miss this inspiring episode on transforming your life from the inside out! Topic Cover: 1:38 - A Journey from Vice to Victory 3:38 - Emotions and the Illusion of Rational Financial Decisions 4:53 - Transformation Through Sobriety, Fitness, Finance, and Faith 6:21 - Owning Mistakes and Pursuing Transformation 8:48 - Finding Strength Through Adversity and Self-Reflection 12:26 - Healing Through Sharing and Self-Awareness 14:05 - Prioritizing Health for Energy and Life Balance 18:52 - Overcoming Gym Anxiety Through Outdoor Exercise and Meditation 19:30 - Embracing Incremental Progress and Honest Health Journeys 21:26 - Overcoming Addiction Through Eliminating Temptations 24:03 - Maintaining Momentum: The Challenge of Consistency 25:41 - Identity Transformation and Consistent Rituals Across Environments 29:20 - Transforming Health, Relationships, and Finances 32:32 - Balancing Emotional Spending With Financial Goals 33:08 - Transforming Lives Through Mindset and System Changes 33:53 - Strategizing Personal Finance for Life Enjoyment and Security 37:01 - Emotional Connections With Food and Consumer Choices 38:04 - Choosing Mentors Based on Their Lifestyles, Not Just Words 39:11 - Insights on Coaching and Overcoming Personal Struggles Key Takeaways: How to transform morning routines into personal empowerment Replace negative coping mechanisms with flow-state activities What are the hidden Impact of emotional spending Strategies to control emotional spending Connect with Milad Rahnama: Website Instagram Facebook Connect with Sherry Shaban: Website Facebook Instagram TikTok LinkedIn YouTube If you're struggling with self-sabotaging behavior and other non-serving habits that have been keeping you from hitting your health goals, I'd like to invite you to join me in Transformation in Paradise: Metamorphosis Greece this October 12–19, 2024, in Lefkada. RESERVE YOUR SPOT NOW! Visit www.tranformationinparadiseretreat.com for more details. If this sounds like something you'd like to learn more about, email me at sherry@sherryshaban.com and let's get in touch to go over all the details and answer your questions to determine if this retreat is the right fit for you. Keep it up, Athletes! Sherry
Hello, wonderful listeners! Today, we have the incredible Milad Rahnama with us. He's going to share his inspiring journey through the challenges of the financial world and personal struggles. Let's dive in!In the world of finance, success often comes hand in hand with stress. For 17 years, Milad navigated the tumultuous waters of the financial industry, striving to provide for his family while battling his own personal demons. Behind the facade of stability, he was trapped in a toxic marriage with a narcissist. The weight of the relationship took its toll, leading him down a dark path of drugs and alcohol. As his addiction spiraled out of control, he faced the terrifying possibility of losing custody of his children – the very reason he fought so hard to keep pushing forward.But amidst the chaos, a flicker of hope emerged. With unwavering determination, Milad clawed his way out of the depths of addiction, shedding his vices one by one. In the courtroom, he fought tooth and nail for his children, refusing to let them become collateral damage in the wreckage of his past. Emerging from the ashes of his former life, he found a new purpose.Drawing from his own experiences, Milad dedicated himself to helping other men break free from the chains of addiction, reclaim their physical health, and master their finances. As a divorced single father with shared custody, he knows firsthand the struggles of balancing parenthood with personal growth. For more insights and personalized coaching, connect with Milad on Instagram: https://instagram.com/alpha_fitnessdad?igshid=MTk0NTkyODZkYg%3D%3D&utm_source=qr Hosted on Acast. See acast.com/privacy for more information.
Die kommende EU-Wahl gilt als Richtungswahl. Was schätzen wir an der Europäischen Union? Wo nervt sie? Was fehlt? Was für ein Europa wollen wir? Darüber diskutieren wir in der Sendung mit unseren Gästen und Hörern. Tekin, Funda;Tabesch, Milad www.deutschlandfunkkultur.de, Im Gespräch
Here we delve into the essence of spiritual formation and the role of Bible studies in fostering a journey towards Christ-like transformation amidst the complexities of Lebanon's cultural mosaic. Host Elio Constantine is joined by Milad Nassar, Heart for Lebanon's South Ministry Center Field Manager and Hope Evangelical Church Ghazieh's Lead Pastor, to discuss the organization's unconditional approach to ministry. They explore how this approach transcends sectarian divides, offering love and dignity to all, irrespective of background or belief.You'll hear the inspiring testimonies of Idris, a Kurdish refugee who discovers the Bible as a source of love and peace, and Eliane, whose journey with Heart for Lebanon leads her to a life of discipleship, even beyond the borders of Lebanon.The episode culminates with the remarkable story of Milad and Claude—once on opposing sides of conflict, now united in Christ, demonstrating that true reconciliation and unity are possible through the transformative power of God's love.Join us as we explore the path from unconditional ministry to discipleship, and how Heart for Lebanon is breaking down barriers to build a family in faith. For those moved by the stories of hope and change, visit heartforlebanon.org to learn how you can contribute through prayer and engagement.Send your questions or comments to podcast@heartforlebanon.org.
کمالگراییوقتى در مسير رشد و توسعه قرار ميگيرى، احتمال داره استانداردهاى بالايى رو براى خودت و كسبوكارت در نظر بگيرى. اين هم ميتونه خوب باشه، هم در عين حال ميتونه مشكلساز بشه.در این اپیزود قرار هست در مورد کمالگرایی گفتگو کنیم،به همراه میلاد منشیپور، همبنیانگذار و مدیر عامل تپسی، و به احترام مسیر الهامبخش و تاثیرگذاری که از آغاز فعالیت تا به امروز در این پروژه طی شده.مسیری که از پشت یک میز کار اشتراکی شروع شده، و امروز بیش از هزار پرسنل داره، صدها هزارنفر ازش به عنوان شغل اول یا دوم بهره میبرن، و میلیونها کاربر با تپسی، کیفیت مسافرتهای شهری خودشون رو ارتقاء دادن، تا تبديل شدن به IPO براى اولين بار در اكوسيستم استارتاپى ايران.قسمت چهلونهم پادکست امیفر منتشر شد.نسخه ویدیویی همه قسمتها رو میتونین در کانال یوتوب Amifar ببینید و نسخه شنیداری رو در همه پلتفرمهای پادگیر با عنوان Amifar بشنوین_پیج اینستاگرام میلاد منشیپور:https://www.instagram.com/miladmonshipourپیج اینستاگرام پادکست امیـفر:https://instagram.com/amifarpodcastکانال یوتیوب پادکست امیـفر:https://youtube.com/@amirhosseinfarzaneh
Welcome to the Voices Of Value Podcast with Collis Stutzer. GUEST Milad Ghasempour instagram.com/persianhalfblood PODCAST Voices of Value Podcast linktr.ee/voicesofvaluepodcast HOST instagram.com/coachstutzer PRODUCER NSPN linktr.ee/nosolutionsproductions --- Send in a voice message: https://podcasters.spotify.com/pod/show/voicesofvalue/message
Writer Nathan Thrall joins Kate Wolf to talk about his recent book, A Day in the Life of Abed Salama: Anatomy of a Jerusalem Tragedy, which was published last October and named one of the best books of the year by The New Yorker, The Economist, The New Republic and the Financial Times. It is an account of a horrific accident that took place in the outskirts of Jerusalem on a rainy day in 2012, when a school bus full of kindergarten students on their way to a class trip collided with a semi-trailer and caught on fire. Thrall follows the lives of a number of people who were directly impacted by the tragedy, delving into their pasts and the ways in which the decades-old conflict between Israel and Palestine has indelibly shaped their trajectories. Chief among them is Abed Salama, a Palestinian, and father of five-year-old Milad, who was a passenger on the bus. In looking closely at the material conditions of Salama's life, and the way they play out within the worst circumstances imaginable, Thrall evinces the toll of occupation in the most human of terms. Also, Kohei Saito, author of Slow Down: The Degrowth Manifesto, returns to recommend Naomi Klein's No Logo.
GUEST Milad Ghasempour instagram.com/mjghasem PODCAST Voices of Value Podcast linktr.ee/voicesofvaluepodcast HOST instagram.com/coachstutzer PRODUCER NSPN linktr.ee/nosolutionsproductions --- Send in a voice message: https://podcasters.spotify.com/pod/show/voicesofvalue/message