Podcasts about Elos

Place in Greece

97PODCASTS
147EPISODES
40mAVG DURATION
1MONTHLY NEW EPISODE
Jun 1, 2026LATEST

POPULARITY

20192020202120222023202420252026

Best podcasts about Elos

AmericanReef - Saltwater and Coral Reef Aquarium Advanced Aquarists Edition

4 episodes with Elos

Latent Space: The AI Engineer Podcast â€” CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

3 episodes with Elos

The Unlimited Spanish Podcast: Aprende español | Habla español | Learn Spanish | Speak Spanish | TPRS

2 episodes with Elos

Notícias Agrícolas - Podcasts

5 episodes with Elos

The AskHistorians Podcast

2 episodes with Elos

The Wailordz: A PokÃ©mon GO Podcast

3 episodes with Elos

Radio Runeterra

6 episodes with Elos

Lama Padma Samten

4 episodes with Elos

Notícias do Front

2 episodes with Elos

RADIO 11.11 432 HZ

2 episodes with Elos

Elos

10 episodes with Elos

Ooshin Podcast

4 episodes with Elos

Rivista di Psicoanalisi - Podcast

4 episodes with Elos

UFOP CAST

3 episodes with Elos

Kult?ras Rondo

2 episodes with Elos

Podcasts – Trucão Comunicação em Transporte

2 episodes with Elos

Momento de Oração com Dr. Bezerra de Menezes

2 episodes with Elos

Assunto Nosso

2 episodes with Elos

Latest podcast episodes about Elos

Polícia Civil aponta novos elos entre Deolane e o PCC

O Antagonista

Play Episode Listen Later Jun 1, 2026 18:17

Advogado ingressa com petição para cancelar registro da influenciadora digital; polícia chama Deolane de “caixa do PCC”.Você já leu uma notícia hoje e sentiu que já viveu esse momento antes? Essa sensação de déjà Vu não é coincidência. No Brasil, o que é manchete hoje costuma ser o eco de decisões e fatos que analisamos meses, ou até anos atrás. Para celebrar os 8 anos da Crusoé, decidimos enfrentar esse ciclo. Pegamos o que nasceu no digital e, pela primeira vez, transformamos em um registro físico, tátil e permanente. Chegou a edição especial Crusoé impressa. É um item colecionável, atemporal e limitado. Uma revista feita para quem gosta de ler com calma, longe das notificações do celular. Um exemplar para guardar sobre o que realmente importa na história recente do brasil. Esta edição é um presente exclusivo para novos assinantes do Combo de 2 anos O Antagonista e Crusoé. Utilize o cupom 8ANOSCRUSOE e acesse o link: https://bit.ly/crusoe-edicao-impressa Meio-Dia em Brasília traz as principais notícias e análises da política nacional direto de Brasília. Com apresentação de José Inácio Pilar e Wilson Lima, o programa aborda os temas mais quentes do cenário político e econômico do Brasil. Com um olhar atento sobre política, notícias e economia, mantém o público bem informado. Transmissão ao vivo de segunda a sexta-feira às 12h no nosso canal do Youtube. https://www.youtube.com/@OAntagonista Siga O Antagonista no X: https://x.com/o_antagonista Acompanhe O Antagonista no canal do WhatsApp. Boletins diários, conteúdos exclusivos em vídeo e muito mais. https://whatsapp.com/channel/0029Va2SurQHLHQbI5yJN344 Leia mais em www.oantagonista.com.br | www.crusoe.com.br #Policia #Deolane #PCC #investigacao #elos #podcast #noticias #crime #justica #seguranca #atualidades #debate #informacao #trending #viral #repercussao #analise #midia #urgente #operacao

debate crime whatsapp brasil viral uma voc civil trending bras no brasil utilize chegou novos combo pilar vu pcc urgente advogado transmiss pegamos justia atualidades elos cruso meio dia deolane segurana wilson lima o antagonista

Frazão: Nunes Marques é mais um ministro com elos com o Master

Colunistas Eldorado Estadão

Play Episode Listen Later Mar 19, 2026 10:43

Felipe Frazão, repórter do Estadão em Brasília, aborda as principais notícias do dia na política brasileira, na coluna 'Sua Política', às terças e quintas, 8h30.See omnystudio.com/listener for privacy information.

bras ministro estad mais um fraz elos nunes marques o master felipe fraz

A14MOD68_20260218 - Os cinco elos que nos unem a Deus.

Mensagens da Graça de Deus

Play Episode Listen Later Feb 19, 2026 44:14

Tema – Os cinco elos que nos unem a Deus. Quarta – Feira 18/02/2026 Pregador : Ap. Miguel Ângelo Neste episódio, estudamos Romanos 8:29-30 e explicamos a predestinação bíblica, o chamado eficaz e a segurança eterna da salvação. Você pode perder a salvação? O que significa “aos que de antemão conheceu”? Deus escolhe quem será salvo? Neste estudo profundo sobre os cinco elos da salvação — conhecimento prévio, predestinação, chamado, justificação e glorificação — entendemos que a salvação é obra da graça de Deus e não depende do esforço humano. Se você deseja compreender: • Predestinação segundo a Bíblia • Segurança eterna do cristão • Doutrina da graça • Salvação pela graça em Efésios 2:8 • Soberania de Deus na salvação Este episódio é para você. Aula 14 Módulo – 68 Seminário: Construindo uma casa sobre a Rocha

jesus christ gospel amor neste poder voc cristo vive cinco santo deus paz espiritual rocha medo salva seguran romanos palavra alegria aula tristeza ef felicidade falso evangelho profeta religioso construindo semin soberania louvor doutrina quarta feira reli elos apostolica tema os

La formulazione del caso clinico e il P.D.M.3. Davide Elos dialoga con Rachele Mariani

Rivista di Psicoanalisi - Podcast

Play Episode Listen Later Feb 11, 2026 23:27

In questo episodio del Podcast della Rivista di Psicoanalisi dialogheremo con la Dottoressa Rachele Mariani, redattrice della rubrica Ricerca e Metodo.Rachele Mariani ci parlerá del rapporto fra Psicoanalisi e Ricerca utilizzando la formulazione del caso clinico come fil rouge del suo discorso. Si discuterá del P.D.M.3 e si accennerà ai due contributi presenti nella sezione Ricerca e Metodo del numero 4/2025 della Rivista di Psicoanalisi. In questi due lavori la Dottoressa Muzi e la Dottoressa Tanzilli illustreranno l'applicazione del P.D.M.3 a due differenti casi clinici.

caso davide ricerca metodo mariani rivista dialoga elos clinico psicoanalisi

Dialoghi Aperti: il rapporto fra Psicoanalisi e Psichiatria: Davide Elos intervista Fabrizio Pavone

Rivista di Psicoanalisi - Podcast

Play Episode Listen Later Feb 8, 2026 35:09

In questo episodio del Podcast, dedicate alla Rubrica Dialoghi Aperti, converseremo con il Dott. Fabrizio Pavone sul rapporto fra Psicoanalisi e Psichiatria. Il rapporto fra Psicoanalisi e Psichiatria é stato caratterizzato, fin dall'inizio, da una oscillazione fra collaborazione e dialogo costruttivo da una parte e rivalità e sospettosità reciproca dall'altra. Fabrizio Pavone svilupperà questo tema attraverso cenni storici e riflessioni sul rapporto fra le due discipline.

davide intervista fabrizio dott rapporto pavone dialoghi psichiatria elos psicoanalisi

Karl Abraham. Davide Elos dialoga con Alessandra Balloni

Rivista di Psicoanalisi - Podcast

Play Episode Listen Later Jan 25, 2026 19:23

In questo episodio del Podcast della Rivista di Psicoanalisi, dialogheremo con Alessandra Balloni sulla figura Karl Abraham nella ricorrenza del centenario della sua prematura morte avvenuta il 25 dicembre del 1925. Alesssandra Balloni evidenzia il contributo di Karl Abraham allo sviluppo del moviemnto psicoanalitico e ne trattegia, anche attraverso il racconto di alcuni aneddotti, il rapporto con Freud.

freud davide rivista dialoga elos psicoanalisi

Il P.D.M. 3. Davide Elos intervista Vittorio Lingiardi.

Rivista di Psicoanalisi - Podcast

Play Episode Listen Later Jan 5, 2026 45:36

In questo episodio del Podcast il Professor Vittorio Lingiardi ci illustrerà gli aspetti innovativi presenti nella terza edizione del Manule Diagnostico Psicodinamico da poco pubblicato per l'editore Raffaello Cortina.Si parlerà della necessità di un dialogo fra il P.D.M III e gli altri sistemi diagnostici.Si tratterà inoltre dell'importanza data al tema del corpo, della diagnosi nelle diverse fasi del ciclo di vita (con una particolare focalizzazione sull'adolescenza) e delle condizioni non patologiche che richiedono attenzione clinica.Infine si discuterà dell'influenza dell'attuale contesto storico e sociale sulla genesi della sofferenza psichica.

davide intervista infine vittorio elos raffaello cortina

Aistringoji „Sarsuela“ – gruodžio 13-ąją Alytaus kultūros centre

FM99 radijo podcast'as

Play Episode Listen Later Dec 3, 2025 10:27

Gruodžio 13 d. Alytaus kultūros centre žiūrovai kviečiami pasinerti į pietietišką muzikos ir šokio energiją – Lietuvos nacionalinis operos ir baleto teatras pristatys ispaniškos muzikos operetę „Sarsuela“. Tai žanras, kuriame jungiami vokaliniai kūriniai, šnekamieji dialogai ir aistringo šokio elementai, o scena tampa tarsi Ispanijos taverna, kur šventė nesibaigia. Apie tai FM99 kalbame su dainininke Monika Pleškyte, kuri operetėje atlieka Elos vaidmenį.

kult tai apie lietuvos elos ispanijos alytaus gruod fm99

Golpistas em cana e os elos políticos do rei da sonegação | Fechamento Carta

Carta Podcast

Play Episode Listen Later Nov 28, 2025 62:12

Jair Bolsonaro e mais quatro fardados começaram a cumprir pena por terem tentado dar um golpe na democracia brasileira. É inédito na história do País. Agora falta saber se o Superior Tribunal Militar (STM) vai tirar o posto e a patente do quinteto, uma das consequências possíveis da condenação no Supremo Tribunal Federal (STF). A extrema-direita luta para livrar a cara dos golpistas com uma lei de anistia no Congresso e alimenta esperanças diante da má fase na relação dos presidentes do Senado, Davi Alcolumbre, e da Câmara, Hugo Motta, com o governo. Alcolumbre e Motta boicotaram um evento com Lula de sanção da isenção de imposto de renda para salários de até 5 mil por mês e comandaram a derrubada de vetos presidenciais a uma lei que afrouxa o licenciamento ambiental.Enquanto isso, um senador bolsonarista das sombras tem motivo para ficar arrepiado com uma nova operação contra o crime organizado. É Ciro Nogueira, chefe do PP e ex-ministro da Casa Civil de Bolsonaro. O número 2 dele dele na Casa Civil, Jhonatas Assunção, foi alvo da Receita Federal, do fisco paulista e do Ministério Público do Estado de São Paulo em uma operação que mira uma refinaria do Rio, a Refit. O dono da refinaria é amigo de Nogueira e um dos maiores sonegadores do Brasil, Ricardo Magro.Esses são os principais temas da edição desta quinta-feira do Fechamento, com a participação dos jornalistas Mariana Serafini, Mauricio Thuswhol e André Barrocal.

A ANÁLISE DOS POSSÍVEIS TREINADORES DO SPORT PARA 2026 E OS ELOS QUE TORNAM SEABRA FAVORITO

Podcast 45 Minutos

Play Episode Listen Later Oct 30, 2025 64:52

Fred Figueroa, Cassio Zirpoli e Kauê Diniz comentam tudo sobre a fase do Sport e o mercado de treinadores disponíveis ao alcance do Leão.

sports poss favorito tornam treinadores seabra elos

Il Dialogo fra Arte e Psicoanalisi. Davide Elos intervista Sonia DeCristofaro

Rivista di Psicoanalisi - Podcast

Play Episode Listen Later Oct 7, 2025 18:19

In questo episodio del Podcast della Rivista di Psicoanalisi, Sonia De Cristofaro ci introduce, al dialogo fra l'opera di Louise Bourgeois, Francis Bacon, Alberto Giacometti ed Anselm Kiefer, quattro figure emblematiche dell'arte figurativa contemporanea, e tre psicoanalisti, Eleonora Natoli, Massimiliano Sommantico ed Andrea Bocchiola, Sonia Decristofaro delinea la sua concezione di dialogo e confronto fra le discipline in termini non di interpretazione, ma di apertura all'alterità.Sonia DeCristofaro è psicologa e psicoanalista, membro associato S.P.I. e I.P.A.Inizia la sua carriera clinica fra pubblico e privato occupandosi di disturbi del comportamento alimentare e di doppia diagnosi e tossicodipendenze in ambito pubblico. Dal 2015 lavoro solo in studio con adolescenti ed adulti. I suoi interessi principali vertono intorno ai temi del trauma collettivo, dei disturbi e delle varianze dell'identità di genere, della psicosomatica e degli aspetti non organici delle malattie autoimmuni. E' redattrice della sezione Dialoghi Aperti per la Rivista di Psicoanalisi.La sezione Dialogo fra Arte e Psicoanalisi è pubblicata sul numero 3/2025 della Rivista di Psicoanalisi.

arte dal davide intervista inizia francis bacon dialogo louise bourgeois rivista anselm kiefer alberto giacometti elos psicoanalisi

"Inibizione, sintomo e angoscia". Davide Elos intervista Alessandra Balloni.

Rivista di Psicoanalisi - Podcast

Play Episode Listen Later Sep 23, 2025 14:51

In questo episodio del Podcast converseremo con la nuova Redattrice della sezione "Note storico critiche" della Rivista di Psicoanalisi: la Dott.ssa Alessandra Balloni.Il testo di cui discuteremo è "Inibizione, sintomo e angoscia" scritto da Freud nel 1925 e pubblicato nel 1926. Alessandra Balloni ci illustrerà gli snodi cruciali ed i punti innovativi, rispetto alle precedenti teorizzazioni sull'angoscia, presenti in questo fondamentale lavoro della produzione freudiana.Ci sofferemeremo inoltre su alcuni importanti accadimenti, avvenuti nella vita di Freud nel 1925 e su come tali eventi abbiano influenzato la scittura del testo da parte dell'autore.Alessandra Balloni è psicoanalista, socio ordinario della Società Psicoanalitica Italiana e Full memeber dell'International Psychoanalitycal Association. Nel passato quadriennio ha ricoperto la carica di Segretario Scientifico del Centro di Psicoanalisi Romano e nel quadriennio appena iniziato è redattrice della Rivista di Psicoanalisi.

centro freud nel davide intervista dott rivista elos psicoanalisi angoscia

Libros para Copywriters – Léelos antes de septiembre

Copymelo

Play Episode Listen Later Aug 18, 2025 3:38

Hoy te recomiendo varios libros de copywriting y negocio que sí o sí has de leer antes de septiembre para estar preparado para la nueva temporada.Pero antes, recuerda que, si todavía no te has apuntado, puedes unirte ahora mismo a la lista de correo Press Start y recibir cada día un nuevo consejo de ventas

pero antes libros copywriters press start elos

Il Dialogo fra Ricercatori e Clinici. Davide Elos intervista Giulio Cesare Zavattini e Anna Daniela Linciano

Rivista di Psicoanalisi - Podcast

Play Episode Listen Later Jul 14, 2025 66:24

In questo episodio del podcast che fa riferimento al Focus dal titolo "Il dialogo fra ricercatori e clinici. Le teorie private dei pazienti e degli analisti" pubblicato sul volume 2/2025 della Rivista di Psicoanalisi intervistiamo il Professor Giulio Cesare Zavattini e la Dottoressa Daniela Linciano.Nella prima parte del podcast Giulio Cesare Zavattini discute ed argomenta l'importanza del dialogo fra psicoanalisti e ricercatori per sostenere ed alimentare la vitalità della Psicoanalisi.Nella seconda parte dell'episodio Daniela LInciano ci illustra le tappe, gli strumenti, la composizione del gruppo di lavoro ed i risultati del progetto di ricerca multicenrico italo-svedese BEP (Beginnings and endings in Psychoanalysis and Psychoanalytic Psichoterapy).Giulio Cesare Zavattini Psicoanalista della Società Psicoanalitica Italiana (SPI) e dell'International Psycho-Analytical Association (IPA); Membro del Tavistock Relationships Association of Psychotherapists and Counsellors (TRAPC)); Membro dell'International Association of Couple and Family Psychoanalysis (IACFP). già Professore Ordinario di “Psicopatologia delle relazioni di coppia e genitoriali” e Docente di ‘Psicoterapia psicoanalitica della coppia' nella Specializzazione in Psicologia clinica', Facoltà di Medicina e Psicologia, Sapienza Università di Roma. Fa parte della Redazione di ‘Interazioni'; ‘Terapia familiare'; del Comitato dei Garanti del ‘Giornale Italiano di Psicologia'. E' membro dell'International Advisory Board del Journal of Couple and Family Psychoanalysis. Ha fatto parte della Redazione di ‘Rivista di psicoanalisi' negli anni 2020- 2024 dove si è occupato della Sezione di Ricerca e metodo insieme a Daniela Rao. Attualmente fa parte del Comitato di Lettura di Rivista di Psicoanalisi per il quadriennio che è ora iniziato I suoi interessi scientifici riguardano sul piano della ricerca empirica la regolazione delle emozioni, l'attaccamento adulto e le relazioni sentimentali, le relazioni genitori e figli adottivi; sul piano della ricerca clinica si è occupato dello studio dei modelli psicoanalitici, l'adolescente e la sua famiglia, la psicoanalisi di coppia e l'uso clinico del sogno. Pubblicazioni recenti Zavattini G.C. (2025). Per un futuro good enough della Psicoanalisi nell'Università. Rivista di psicoanalisi, 7(1): 197- 202. DOI: 10.26364/RPSA 20250710197 * Ammaniti M, Zavattini G.C. (2024). Nino Dazzi, la psicoterapia e psicoanalisi. Giornale Italiano di Psicologia, LI(4): 871-876. DOI: 10.1421/115020 Ballarotto G., Zavattini G.C. (2024). La dipendenza dai videogiochi e dalla pornografia online in adolescenza: risposta al dibattito emerso. Psicologia clinica dello sviluppo, XXVII(3): 471-476. Bruno S., Velotti P., Zavattini G.C. (2024). Lo studio della regolazione emotiva nella pratica clinica: dal modello teorico all'impatto sociale. Giornale Italiano di Psicologia, 1:27-46. DOI: 10.1421/112887. Monguzzi F., Zavattini G.C. (2023). The Intersubjective function of the dream in the psychoanalytic couple psychotherapy. Journal of Couple and Family Psychotherapy, 13(2): 152-165. * Velotti P., Zavattini G.C. (2023). ‘Teorie dell'azione terapeutica. Tra interpretazione e relazione', Il Mulino, Bologna. * Velotti P., Rogier G., Beomonte Zobel S., Chirumbolo A., Zavattini G.C. (2022). The Relation of Anxiety and Avoidance Dimensions of Attachment to Intimate Partner Violence: A Meta-Analysis About Perpetrators. Trauma, Violence, and Abuse, 23, (1): 196-212. DOI 10.1177/1524838020933864 Pace C.S., Muzi, S., Senso, A., Zavattini, G.C., Madera, F. (2022), Can the Family Drawing be a useful tool for assessing attachment representations in children? A systematic review and meta-analysis, Attachment & Human Development, 24(4): 477-502 DOI: 10.1080/14616734.2021.1991664, de Campora G., Zavattini G.C. (2021). Mindfulness and Eating Disorders across Lifespan. Assessment and Intervention Across the Lifespan. Routdlege, London pp. 1-120. DOI: 10.4324/9781003154266 Velotti P., Zavattini G.C. (2019). È ancora attuale l'uso del sogno nella pratica clinica? Giornale italiano di Psicologia, 3:433-462, DOI: 10.1421/94505 * Anna Daniela Linciano è medico, psichiatra e psicoanalista. Membro Ordinario della Società Psicoanalitica Italiana (SPI) e Full Member della International Psychoaanalytical Association (IPA). Vincitrice del premio Tyson 2011 dell'International Psychoanalytical association (IPA). È stata membro di Redazione della Rivista di Psicoanalisi nel quadriennio 2012-2016 e membro del Comitato di Lettura della stessa Rivista quadriennio che si è appena concluso. Ha partecipato e partecipa tutt'ora a diversi gruppi di studio e di ricerca presso il Centro Milanese di Psicoanalisi. È stata nel 2014 Co-Researcher di un progetto di ricerca, che ha ricevuto un grant IPA, sull'integrazione delle terapie psicoanalitiche agevolate con i servizi pubblici e successivamente. Nel 2016 Principal Investigator (PI) del gruppo di ricerca BEP, che pure ha vinto un secondo Grant IPA ed i cui sviluppi e risultati sono oggetto del Focus dedicato alla ricerca dell'ultimo numero in uscita della Rivista di Psicoanalisi.

anxiety trauma focus mindfulness journal violence couple abuse roma fa li eating disorders attachment medicina relation psychotherapists nella bologna terapia nel tra ipa psicologia davide human development intervista lifespan doi docente psychoanalysis senso ricerca madera xxvii membro dialogo psicoterapia lettura teorie comitato attualmente bep garanti facolt giornale rivista giulio cesare redazione bruno s il mulino muzi sezione elos ricercatori full member psicopatologia psicoanalisi professore ordinario specializzazione sapienza universit interazioni intersubjective pubblicazioni

54° Congresso I.P.A. di Lisbona: Psychoanalysis an anchor in chaotic times. Davide Elos intervista Antonella Sessarego

Rivista di Psicoanalisi - Podcast

Play Episode Listen Later Jun 29, 2025 17:11

QUESTO E' IL PRIMO EPISODIO DEL PODCAST DELLA RIVISTA DI PSICOANALISI CURATO DALLA NUOVA REDAZIONE.IN QUESTO EPISODIO DAVIDE ELOS DIALOGA CON ANTONELLA SESSAREGO, NUOVO DIRETTORE DELLA RIVISTA.IL DIALOGO VERTE SUI QUATTRO LAVORI CHE VERRANNO PRESENTATI NELLE SESSIONI DI APERTURA DEL 54° CONGRESSO I.P.A. CHE SI TERRA' A LISBONA DAL 30 LUGLIO AL 2 AGOSTO PROSSIMI: "TRADIZIONE E CAMBIAMENTO DELLA TEORIA PSICOANALITICA: INTERROGARE L'INFANTILE" DI STEPHEN SELIGMAN; ""ESTENSIONI DEL NARCISISMO:PSICOANALISI GUERRA E CLIMA" DI ELISARDO CESAR MEREA; "STORIE DEL TRIANGOLO.RIFLESSIONI SUGLI ELEMENTI ESSENZIALI DELLA PSICOANALISI ATTRAVERSO LE CULTURE" DI THOMAS PLAENKERS; "INCONTRI TRASFORMATIVI: UN PONTE TRA CYBERSPAZIO E SPAZIO ANALITICO" DI SUN JU CHUNG.QUESTI QUATTRO LAVORI SARANNO PUBBLICATI IN ANTEPRIMA SUL NUMERO 2/2025 DELLA RIVISTA DI PSICOANALISI.ANTONELLA SESSAREGO E' PSICHIATRA, PSICOANALISTA, MEMBRO ORDINARIO S.P.I. E FULL MEMEBER I.P.A..

anchor chaotic davide intervista congresso psychoanalysis lisbona psicoanalista elos

Protetesti Losandželosā. Masks un Tramps šķiras

Divas puslodes

Play Episode Listen Later Jun 11, 2025 54:10

Kamēr Latvijā uzmanības lokā pašvaldību vēlēšanas, pasaules medijos pirmajās rindās ir ziņas par Amerikas Savienotajām Valstīm. Turpinām kolēģu aizsākto sarunu par Kaļiņingradu, runājot par drošības situāciju un apgabalu kopumā. Nedaudz to aplūkojam Ukrainā notiekošā kara kontekstā. Aktualitātes analizē politologs Veiko Spolītis un ārpolitikas pētnieks Rinalds Gulbis. Par Kaļiņingradas apgabalu saruna ar politologu, politiķi no Ukrainas Ihoru Ždanovu, viņš ir fonda „Atklātā politika” projekta vadītājs, bijis arī Ukrainas Jaunatnes un sporta lietu ministrs. Donalds Bargais Protesti Losandželosā uzliesmoja 6. jūnijā pēc tam, kad ASV Imigrācijas un muitas policijas darbinieki veica reidus pilsētas tirdzniecības vietās, arestējot apmēram simtu tur strādājošo, kuri, iespējams, uzturas valstī nelegāli. Protesti sākās pie aizturēšanas centra ēkas, kur policija aizturēja 44 cilvēkus. Cita starpā tika arestēts Starptautiskās apkalpošanas sfēras darbinieku asociācijas Kalifornijas nodaļas prezidents Deivids Uerta, kurš it kā kavējis automašīnas pārvietošanos. Tiek ziņots, ka viņš guvis salīdzinoši vieglu traumu; pirmdien viņš tika atbrīvots pret drošības naudu. Jau tajā pašā dienā notika sadursmes starp protestētājiem un policiju, kur protestētāji esot metuši dažādus priekšmetus, kamēr policija izmantojusi asaru gāzi, piparu aerosolu, žilbinošās granātas, bet vakarpusē likumsargiem bija atļauts lietot arī neletālu munīciju. Brīvdienās protesti vērsās plašumā, pie kam notika vairāki mēģinājumi bloķēt ielas, atsevišķu automašīnu aizdedzināšana, daži vandālisma un veikalu aplaupīšanas gadījumi. Protestētāji apmētāja policistus un policijas auto ar dažādiem priekšmetiem, daži policisti, protestētāji un arī protestus atspoguļojošie žurnālisti guva vieglus ievainojumus. Tomēr daudzviet protesti notika mierīgi, un situācija pilsētā, pēc visa spriežot, nebija tāda, lai ar to netiktu galā vietējie policijas spēki. Taču jau 7. jūnijā Savienoto Valstu prezidents Donalds Tramps izdeva rīkojumu nosūtīt uz Losandželosu 2100 Nacionālās gvardes karavīru. Pie tam viņš to darīja, nesaskaņojot ar Kalifornijas pavalsts gubernatoru, Demokrātu partijas pārstāvi Gevinu Ņūsomu. Kaut prezidentam ir tādas pilnvaras, šādi Savienoto Valstu vadītājs pēdējoreiz rīkojies 1965. gadā. Tāpat tika ziņots, ka gatavībā nosūtīšanai uz pilsētu esot septiņi simti jūras kājnieku. Jau drīz pēc tam, kad šīs ziņas parādījās atklātībā, prezidenta rīcības adekvātums tika apšaubīts. Pirmdien, 9. jūnijā, gubernators Ņūsoms iesniedza prasību tiesā, pieprasot atzīt Trampa pavēli par Nacionālās gvardes izmantošanu par nelikumīgu; attiecīgā sēde paredzēta 12. jūnijā. Tikām gan prezidents Tramps, gan citi Baltā nama pārstāvji izvērsa agresīvu retoriku, gānot un vainojot protestu uzliesmošanā gubernatoru, kā arī Losandželosas mēri Karenu Basu. Viens no spilgtākajiem prezidenta citātiem bija: „Ja viņi spļaus, mēs sitīsim.” Proti, viņš vainoja demonstrantus, ka tie spļaujot uz likumsargiem, un attiecīgi piedraudēja, ka protestētāji tikšot sisti tā, kā nekad agrāk. Nu jau nacionālo gvardu skaits Losandželosā pārsniedzis četrus simtus, un vakar, kā ziņo, tur ieradušies arī solītie jūras kājnieki. Militārpersonas gan lielākoties bez kāda pielietojuma uzturoties federālo institūciju ēkās. Par spīti aizliegumiem ļaudis pirmdien un otrdien turpināja pulcēties protestos, kuri gan ātri tika izklīdināti. Daudzi uzlūko Trampa administrācijas rīcību Losandželosā kā paraugu varas spiediena politikai. Divu vīru kaismīga šķiršanās Par to, ka divām tik spēcīgām un ambiciozām personām kā Donalds Tramps un Īlons Masks nenāksies viegli sastrādāties, runāja jau teju kopš pasaules bagātākais cilvēks kļuva par pusoficiālu Baltā nama darbinieku. Šie signāli kļuva intensīvāki pagājušā mēneša laikā, Maskam paziņojot, ka viņš distancēsies no administrācijas un turpmāk mazāk ziedos līdzekļus politikas atbalstam. Taču otrdien ārup izlauzās acīmredzot ilgi slāpēta nepatika, kad multimiljardieris izteica atklātu un skarbu kritiku par šobrīd Kongresā virzīto Trampa administrācijas budžeta projektu, nodēvēdams to par „pretīgu draņķību”. Tā viņš pievienojās daudzo prezidenta kritizētāju pulkam, kuri pamatoti norāda, ka projekts paredz nodokļu atlaides bagātajiem, tajā pašā laikā liedzot daudziem maznodrošinātajiem līdzšinējo valsts apmaksāto pieeju medicīnas pakalpojumiem. Nākamajās dienās eskalācija tikai vērsās plašumā, iesaistoties ļaudīm gan no prezidenta, gan finanšu magnāta puses. Tramps piedraudēja, ka varētu atņemt Maska uzņēmumiem valdības kontraktus un subsīdijas. Sevišķi sāpīgi tas būtu kosmosa izpētes programmai „Space X”, kas lielā mērā balstās valdības finansējumā. Bet Masks nepalika atbildi parādā, norādot, ka bez viņa nodrošinātās tehnikas valdībai būs problemātiski, piemēram, nodrošināt astronautu nogādāšanu uz Starptautisko kosmosa staciju un atpakaļ. Draudi pārtraukt šo tehnisko atbalstu gan tika drīz atsaukti, taču turpinājumā multimiljardieris paziņoja, ka prezidents esot minēts 2019. gadā cietumā mirušā uzņēmēja un bērnu seksuālā izmantotāja Džefrija Epsteina lietas materiālos. Tā bija abu dižvīru konflikta kulminācija, kas tika sasniegta 5. jūnijā. Piektdien, 6. jūnijā, Masks paziņoja, ka varētu nodibināt jaunu politisku partiju, kas kļūtu par trešo spēku starp demokrātiem un republikāņiem. Tikām viņa konflikts ar Baltā nama saimnieku sāka iespaidot Maska finanšu stāvokli, kompānijas „Tesla” akcijām noslīdot par 15% procentiem, respektīvi, zaudējot apmēram 150 miljardus savas vērtības. Kopš pagājušās nedēļas nogales procesā gan iestājies zināms atslābums. Pirmais, kurš no eskalācijas pārgāja uz remdenāku toni, bija prezidents, izsakoties, ka neņemot ļaunā privāti vērstos uzbrukumus, tikai nožēlojot, ka Īlons nav praties ātrāk atbrīvoties no administratīvā darba sloga. Šīs nedēļas sākumā preses virsraksti jau vēsta par konflikta pierimšanu, lai gan no agrākās divu diževīru draudzības maz kas palicis pāri.

tesla protests spacex masks ka pie milit cita kam iras tik viens aktualit turpin trampa proti ukrain kop tramps balt kongres jau nacion maska sevi latvij kaut tiek elos demokr atkl pirmais protesti daudzi divu starptautisk valst starptautisko piektdien amerikas savienotaj losand veiko spol

Gripe Aviária: Plano de contingência no RS está em andamento para todos os elos da cadeia avícola após confirmação de caso em granja comercial em Mont

Notícias Agrícolas - Podcasts

Play Episode Listen Later May 20, 2025 21:11

Gripe Aviária: Plano de contingência no RS está em andamento para todos os elos da cadeia avícola após confirmação de caso em granja comercial em Montenegro

caso cola plano rs para todos mont montenegro comercial confirma gripe cadeia granja andamento conting elos

Tirando a Fallar: Tiempo de playoff en la Euroliga, la NBA y la LF Endesa y la vida de Urbano González con ELA

Tirando a Fallar

Play Episode Listen Later Apr 20, 2025 66:47

Repaso a la llegada del momento decisivo en la Euroliga, la NBA y la LF Endesa y entrevista con Urbano González, exjugador profesional enfermo de ELA. José Manuel Puertas entrevista esta semana a Urbano González, exjugador profesional de baloncesto y una de las caras visibles de la ELA en España. El que fuera miembro del histórico Elosúa León explicará detalles sobre su vida actual, postrado en una cama y necesitado de un respirador, comunicándose a través de una tablet gracias a sus ojos, pues ya ha perdido la capacidad de hablar. También escucharemos a su esposa y principal cuidadora, Carlota Amigo. Tras el duro testimonio de González y su mujer, tiempo para la actualidad del baloncesto. Junto a Lucas Sáez Bravo conoceremos cómo llegan al playoff de la Euroliga el Real Madrid y el Barça, después de que los blancos se colaran con muchas dificultades a través del play-in. Han comenzado también los playoffs en la NBA, con Anthony Edwards colocándose el traje de villano a la primera de cambio. Edu Salán, Anastasio Ríos y John Vázquez repasarán las claves de las intensas semanas que vienen por delante en la pelea por el anillo. Finalmente, la Liga Femenina Endesa ha visto como la temporada regular ha terminado, con el descenso in extremis del Celta. Andrea Blez comentará el cierre liguero, la llegada de las eliminatorias por el título, y la última polémica entre César Aneas y el Spar Gran Canaria.

nba playoffs espa tambi bar bravo tras finalmente junto ela real madrid gonz anthony edwards repaso urbano la nba celta tirando fallar endesa euroliga john v elos liga femenina endesa lucas s

EP322 A school framework where learning feels different and students thrive

Angela Watson's Truth for Teachers

Play Episode Listen Later Apr 6, 2025 32:39

What if school was designed around student agency, real-world learning, and deep relationships—instead of compliance and test scores? In this episode, I take you inside Fannie Lou Hamer Freedom High School, a Big Picture Learning school in the South Bronx that is reimagining what education can be. I spent two days observing classes, talking with students and teachers, and experiencing firsthand how this school operates within the constraints of the public school system while centering student voice and engagement. You'll hear about: How Fannie Lou's advisory system, looping, and block scheduling create a close-knit, supportive learning environment Why students don't take standardized tests and instead defend portfolios of work How extended learning opportunities (ELOs) and internships connect students to real-world experiences The role of AI and technology in supporting student-driven, competency-based learning What it actually looks like to make relationships the foundation of a school Fannie Lou's approach isn't a magic bullet or an exclusive model—it's a public school choice within NYC's Department of Education that any student can opt into. Listen in to hear what's possible when we trust students, empower teachers, and rethink what high-quality learning truly looks like. Get the shareable article/transcript for this episode here.

learning ai new york city school education students thrive framework south bronx elos big picture learning

FRUTOS DA EXTENSÃO: CONSOTIUM - Cooperação Intermunicipal para o Desenvolvimento do Lazer e do Esporte na Região dos Inconfidentes

UFOP CAST

Play Episode Listen Later Mar 13, 2025 8:19

O projeto “CONSOTIUM - Cooperação Intermunicipal para o Desenvolvimento do Lazer e do Esporte na Região dos Inconfidentes”, vinculado ao programa extensionista ELOS, tem como objetivo apoiar a criação de um consórcio intermunicipal de lazer e esporte na região dos Inconfidentes. Foi formado um grupo de representantes das secretarias de lazer das três cidades, que elaboraram um protocolo de intenções para a implementação do consórcio. O protocolo foi encaminhado às procuradorias municipais, sendo aprovado pelas de Ouro Preto e Itabirito. A procuradoria de Mariana ainda não assinou o documento.Este consórcio envolve o diálogo e a colaboração entre as cidades de Itabirito, Ouro Preto e Mariana, com foco no desenvolvimento de ações conjuntas nas áreas de lazer e esporte, sendo o primeiro consórcio do Brasil com essa temática exclusiva. Conheça mais sobre essa iniciativa no programa de hoje.Ficha TécnicaProdução: Larissa AntunesEdição de Texto: Elis CristinaEdição de áudio e sonoplastia: Aurélio Bernardi, Breno Estevam e Joezzer Sâmeke

brasil foi conhe aur regi desenvolvimento frutos esporte lazer coopera extens bernardi ouro preto elos

Žurnālistes Māras Rozenbergas pieredzējumi "Oskaru" balvas ceremonijā Losandželosā

Kultūras Rondo

Play Episode Listen Later Mar 7, 2025 28:33

Kultūras žurnālistes Māras Rozenbergas iespaidi un pieredzētais ASV Kinokakadēmijas "Oskaru" balvas ceremonijā Losandželosā.

kult listes jumi elos pieredz losand

Outlasting Noam Shazeer, crowdsourcing Chat + AI with >1.4m DAU, and becoming the "Western DeepSeek" — with William Beauchamp, Chai Research

Latent Space: The AI Engineer Podcast â€” CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

Play Episode Listen Later Jan 26, 2025 75:46

One last Gold sponsor slot is available for the AI Engineer Summit in NYC. Our last round of invites is going out soon - apply here - If you are building AI agents or AI eng teams, this will be the single highest-signal conference of the year for you!While the world melts down over DeepSeek, few are talking about the OTHER notable group of former hedge fund traders who pivoted into AI and built a remarkably profitable consumer AI business with a tiny team with incredibly cracked engineering team — Chai Research. In short order they have:* Started a Chat AI company well before Noam Shazeer started Character AI, and outlasted his departure.* Crossed 1m DAU in 2.5 years - William updates us on the pod that they've hit 1.4m DAU now, another +40% from a few months ago. Revenue crossed >$22m. * Launched the Chaiverse model crowdsourcing platform - taking 3-4 week A/B testing cycles down to 3-4 hours, and deploying >100 models a week.While they're not paying million dollar salaries, you can tell they're doing pretty well for an 11 person startup:The Chai Recipe: Building infra for rapid evalsRemember how the central thesis of LMarena (formerly LMsys) is that the only comprehensive way to evaluate LLMs is to let users try them out and pick winners?At the core of Chai is a mobile app that looks like Character AI, but is actually the largest LLM A/B testing arena in the world, specialized on retaining chat users for Chai's usecases (therapy, assistant, roleplay, etc). It's basically what LMArena would be if taken very, very seriously at one company (with $1m in prizes to boot):Chai publishes occasional research on how they think about this, including talks at their Palo Alto office:William expands upon this in today's podcast (34 mins in):Fundamentally, the way I would describe it is when you're building anything in life, you need to be able to evaluate it. And through evaluation, you can iterate, we can look at benchmarks, and we can say the issues with benchmarks and why they may not generalize as well as one would hope in the challenges of working with them. But something that works incredibly well is getting feedback from humans. And so we built this thing where anyone can submit a model to our developer backend, and it gets put in front of 5000 users, and the users can rate it. And we can then have a really accurate ranking of like which model, or users finding more engaging or more entertaining. And it gets, you know, it's at this point now, where every day we're able to, I mean, we evaluate between 20 and 50 models, LLMs, every single day, right. So even though we've got only got a team of, say, five AI researchers, they're able to iterate a huge quantity of LLMs, right. So our team ships, let's just say minimum 100 LLMs a week is what we're able to iterate through. Now, before that moment in time, we might iterate through three a week, we might, you know, there was a time when even doing like five a month was a challenge, right? By being able to change the feedback loops to the point where it's not, let's launch these three models, let's do an A-B test, let's assign, let's do different cohorts, let's wait 30 days to see what the day 30 retention is, which is the kind of the, if you're doing an app, that's like A-B testing 101 would be, do a 30-day retention test, assign different treatments to different cohorts and come back in 30 days. So that's insanely slow. That's just, it's too slow. And so we were able to get that 30-day feedback loop all the way down to something like three hours.In Crowdsourcing the leap to Ten Trillion-Parameter AGI, William describes Chai's routing as a recommender system, which makes a lot more sense to us than previous pitches for model routing startups:William is notably counter-consensus in a lot of his AI product principles:* No streaming: Chats appear all at once to allow rejection sampling* No voice: Chai actually beat Character AI to introducing voice - but removed it after finding that it was far from a killer feature.* Blending: “Something that we love to do at Chai is blending, which is, you know, it's the simplest way to think about it is you're going to end up, and you're going to pretty quickly see you've got one model that's really smart, one model that's really funny. How do you get the user an experience that is both smart and funny? Well, just 50% of the requests, you can serve them the smart model, 50% of the requests, you serve them the funny model.” (that's it!)But chief above all is the recommender system.We also referenced Exa CEO Will Bryk's concept of SuperKnowlege:Full Video versionOn YouTube. please like and subscribe!Timestamps* 00:00:04 Introductions and background of William Beauchamp* 00:01:19 Origin story of Chai AI* 00:04:40 Transition from finance to AI* 00:11:36 Initial product development and idea maze for Chai* 00:16:29 User psychology and engagement with AI companions* 00:20:00 Origin of the Chai name* 00:22:01 Comparison with Character AI and funding challenges* 00:25:59 Chai's growth and user numbers* 00:34:53 Key inflection points in Chai's growth* 00:42:10 Multi-modality in AI companions and focus on user-generated content* 00:46:49 Chaiverse developer platform and model evaluation* 00:51:58 Views on AGI and the nature of AI intelligence* 00:57:14 Evaluation methods and human feedback in AI development* 01:02:01 Content creation and user experience in Chai* 01:04:49 Chai Grant program and company culture* 01:07:20 Inference optimization and compute costs* 01:09:37 Rejection sampling and reward models in AI generation* 01:11:48 Closing thoughts and recruitmentTranscriptAlessio [00:00:04]: Hey everyone, welcome to the Latent Space podcast. This is Alessio, partner and CTO at Decibel, and today we're in the Chai AI office with my usual co-host, Swyx.swyx [00:00:14]: Hey, thanks for having us. It's rare that we get to get out of the office, so thanks for inviting us to your home. We're in the office of Chai with William Beauchamp. Yeah, that's right. You're founder of Chai AI, but previously, I think you're concurrently also running your fund?William [00:00:29]: Yep, so I was simultaneously running an algorithmic trading company, but I fortunately was able to kind of exit from that, I think just in Q3 last year. Yeah, congrats. Yeah, thanks.swyx [00:00:43]: So Chai has always been on my radar because, well, first of all, you do a lot of advertising, I guess, in the Bay Area, so it's working. Yep. And second of all, the reason I reached out to a mutual friend, Joyce, was because I'm just generally interested in the... ...consumer AI space, chat platforms in general. I think there's a lot of inference insights that we can get from that, as well as human psychology insights, kind of a weird blend of the two. And we also share a bit of a history as former finance people crossing over. I guess we can just kind of start it off with the origin story of Chai.William [00:01:19]: Why decide working on a consumer AI platform rather than B2B SaaS? So just quickly touching on the background in finance. Sure. Originally, I'm from... I'm from the UK, born in London. And I was fortunate enough to go study economics at Cambridge. And I graduated in 2012. And at that time, everyone in the UK and everyone on my course, HFT, quant trading was really the big thing. It was like the big wave that was happening. So there was a lot of opportunity in that space. And throughout college, I'd sort of played poker. So I'd, you know, I dabbled as a professional poker player. And I was able to accumulate this sort of, you know, say $100,000 through playing poker. And at the time, as my friends would go work at companies like ChangeStreet or Citadel, I kind of did the maths. And I just thought, well, maybe if I traded my own capital, I'd probably come out ahead. I'd make more money than just going to work at ChangeStreet.swyx [00:02:20]: With 100k base as capital?William [00:02:22]: Yes, yes. That's not a lot. Well, it depends what strategies you're doing. And, you know, there is an advantage. There's an advantage to being small, right? Because there are, if you have a 10... Strategies that don't work in size. Exactly, exactly. So if you have a fund of $10 million, if you find a little anomaly in the market that you might be able to make 100k a year from, that's a 1% return on your 10 million fund. If your fund is 100k, that's 100% return, right? So being small, in some sense, was an advantage. So started off, and the, taught myself Python, and machine learning was like the big thing as well. Machine learning had really, it was the first, you know, big time machine learning was being used for image recognition, neural networks come out, you get dropout. And, you know, so this, this was the big thing that's going on at the time. So I probably spent my first three years out of Cambridge, just building neural networks, building random forests to try and predict asset prices, right, and then trade that using my own money. And that went well. And, you know, if you if you start something, and it goes well, you You try and hire more people. And the first people that came to mind was the talented people I went to college with. And so I hired some friends. And that went well and hired some more. And eventually, I kind of ran out of friends to hire. And so that was when I formed the company. And from that point on, we had our ups and we had our downs. And that was a whole long story and journey in itself. But after doing that for about eight or nine years, on my 30th birthday, which was four years ago now, I kind of took a step back to just evaluate my life, right? This is what one does when one turns 30. You know, I just heard it. I hear you. And, you know, I looked at my 20s and I loved it. It was a really special time. I was really lucky and fortunate to have worked with this amazing team, been successful, had a lot of hard times. And through the hard times, learned wisdom and then a lot of success and, you know, was able to enjoy it. And so the company was making about five million pounds a year. And it was just me and a team of, say, 15, like, Oxford and Cambridge educated mathematicians and physicists. It was like the real dream that you'd have if you wanted to start a quant trading firm. It was like...swyx [00:04:40]: Your own, all your own money?William [00:04:41]: Yeah, exactly. It was all the team's own money. We had no customers complaining to us about issues. There's no investors, you know, saying, you know, they don't like the risk that we're taking. We could. We could really run the thing exactly as we wanted it. It's like Susquehanna or like Rintec. Yeah, exactly. Yeah. And they're the companies that we would kind of look towards as we were building that thing out. But on my 30th birthday, I look and I say, OK, great. This thing is making as much money as kind of anyone would really need. And I thought, well, what's going to happen if we keep going in this direction? And it was clear that we would never have a kind of a big, big impact on the world. We can enrich ourselves. We can make really good money. Everyone on the team would be paid very, very well. Presumably, I can make enough money to buy a yacht or something. But this stuff wasn't that important to me. And so I felt a sort of obligation that if you have this much talent and if you have a talented team, especially as a founder, you want to be putting all that talent towards a good use. I looked at the time of like getting into crypto and I had a really strong view on crypto, which was that as far as a gambling device. This is like the most fun form of gambling invented in like ever super fun, I thought as a way to evade monetary regulations and banking restrictions. I think it's also absolutely amazing. So it has two like killer use cases, not so much banking the unbanked, but everything else, but everything else to do with like the blockchain and, and you know, web, was it web 3.0 or web, you know, that I, that didn't, it didn't really make much sense. And so instead of going into crypto, which I thought, even if I was successful, I'd end up in a lot of trouble. I thought maybe it'd be better to build something that governments wouldn't have a problem with. I knew that LLMs were like a thing. I think opening. I had said they hadn't released GPT-3 yet, but they'd said GPT-3 is so powerful. We can't release it to the world or something. Was it GPT-2? And then I started interacting with, I think Google had open source, some language models. They weren't necessarily LLMs, but they, but they were. But yeah, exactly. So I was able to play around with, but nowadays so many people have interacted with the chat GPT, they get it, but it's like the first time you, you can just talk to a computer and it talks back. It's kind of a special moment and you know, everyone who's done that goes like, wow, this is how it should be. Right. It should be like, rather than having to type on Google and search, you should just be able to ask Google a question. When I saw that I read the literature, I kind of came across the scaling laws and I think even four years ago. All the pieces of the puzzle were there, right? Google had done this amazing research and published, you know, a lot of it. Open AI was still open. And so they'd published a lot of their research. And so you really could be fully informed on, on the state of AI and where it was going. And so at that point I was confident enough, it was worth a shot. I think LLMs are going to be the next big thing. And so that's the thing I want to be building in, in that space. And I thought what's the most impactful product I can possibly build. And I thought it should be a platform. So I myself love platforms. I think they're fantastic because they open up an ecosystem where anyone can contribute to it. Right. So if you think of a platform like a YouTube, instead of it being like a Hollywood situation where you have to, if you want to make a TV show, you have to convince Disney to give you the money to produce it instead, anyone in the world can post any content they want to YouTube. And if people want to view it, the algorithm is going to promote it. Nowadays. You can look at creators like Mr. Beast or Joe Rogan. They would have never have had that opportunity unless it was for this platform. Other ones like Twitter's a great one, right? But I would consider Wikipedia to be a platform where instead of the Britannica encyclopedia, which is this, it's like a monolithic, you get all the, the researchers together, you get all the data together and you combine it in this, in this one monolithic source. Instead. You have this distributed thing. You can say anyone can host their content on Wikipedia. Anyone can contribute to it. And anyone can maybe their contribution is they delete stuff. When I was hearing like the kind of the Sam Altman and kind of the, the Muskian perspective of AI, it was a very kind of monolithic thing. It was all about AI is basically a single thing, which is intelligence. Yeah. Yeah. The more intelligent, the more compute, the more intelligent, and the more and better AI researchers, the more intelligent, right? They would speak about it as a kind of erased, like who can get the most data, the most compute and the most researchers. And that would end up with the most intelligent AI. But I didn't believe in any of that. I thought that's like the total, like I thought that perspective is the perspective of someone who's never actually done machine learning. Because with machine learning, first of all, you see that the performance of the models follows an S curve. So it's not like it just goes off to infinity, right? And the, the S curve, it kind of plateaus around human level performance. And you can look at all the, all the machine learning that was going on in the 2010s, everything kind of plateaued around the human level performance. And we can think about the self-driving car promises, you know, how Elon Musk kept saying the self-driving car is going to happen next year, it's going to happen next, next year. Or you can look at the image recognition, the speech recognition. You can look at. All of these things, there was almost nothing that went superhuman, except for something like AlphaGo. And we can speak about why AlphaGo was able to go like super superhuman. So I thought the most likely thing was going to be this, I thought it's not going to be a monolithic thing. That's like an encyclopedia Britannica. I thought it must be a distributed thing. And I actually liked to look at the world of finance for what I think a mature machine learning ecosystem would look like. So, yeah. So finance is a machine learning ecosystem because all of these quant trading firms are running machine learning algorithms, but they're running it on a centralized platform like a marketplace. And it's not the case that there's one giant quant trading company of all the data and all the quant researchers and all the algorithms and compute, but instead they all specialize. So one will specialize on high frequency training. Another will specialize on mid frequency. Another one will specialize on equity. Another one will specialize. And I thought that's the way the world works. That's how it is. And so there must exist a platform where a small team can produce an AI for a unique purpose. And they can iterate and build the best thing for that, right? And so that was the vision for Chai. So we wanted to build a platform for LLMs.Alessio [00:11:36]: That's kind of the maybe inside versus contrarian view that led you to start the company. Yeah. And then what was maybe the initial idea maze? Because if somebody told you that was the Hugging Face founding story, people might believe it. It's kind of like a similar ethos behind it. How did you land on the product feature today? And maybe what were some of the ideas that you discarded that initially you thought about?William [00:11:58]: So the first thing we built, it was fundamentally an API. So nowadays people would describe it as like agents, right? But anyone could write a Python script. They could submit it to an API. They could send it to the Chai backend and we would then host this code and execute it. So that's like the developer side of the platform. On their Python script, the interface was essentially text in and text out. An example would be the very first bot that I created. I think it was a Reddit news bot. And so it would first, it would pull the popular news. Then it would prompt whatever, like I just use some external API for like Burr or GPT-2 or whatever. Like it was a very, very small thing. And then the user could talk to it. So you could say to the bot, hi bot, what's the news today? And it would say, this is the top stories. And you could chat with it. Now four years later, that's like perplexity or something. That's like the, right? But back then the models were first of all, like really, really dumb. You know, they had an IQ of like a four year old. And users, there really wasn't any demand or any PMF for interacting with the news. So then I was like, okay. Um. So let's make another one. And I made a bot, which was like, you could talk to it about a recipe. So you could say, I'm making eggs. Like I've got eggs in my fridge. What should I cook? And it'll say, you should make an omelet. Right. There was no PMF for that. No one used it. And so I just kept creating bots. And so every single night after work, I'd be like, okay, I like, we have AI, we have this platform. I can create any text in textile sort of agent and put it on the platform. And so we just create stuff night after night. And then all the coders I knew, I would say, yeah, this is what we're going to do. And then I would say to them, look, there's this platform. You can create any like chat AI. You should put it on. And you know, everyone's like, well, chatbots are super lame. We want absolutely nothing to do with your chatbot app. No one who knew Python wanted to build on it. I'm like trying to build all these bots and no consumers want to talk to any of them. And then my sister who at the time was like just finishing college or something, I said to her, I was like, if you want to learn Python, you should just submit a bot for my platform. And she, she built a therapy for me. And I was like, okay, cool. I'm going to build a therapist bot. And then the next day I checked the performance of the app and I'm like, oh my God, we've got 20 active users. And they spent, they spent like an average of 20 minutes on the app. I was like, oh my God, what, what bot were they speaking to for an average of 20 minutes? And I looked and it was the therapist bot. And I went, oh, this is where the PMF is. There was no demand for, for recipe help. There was no demand for news. There was no demand for dad jokes or pub quiz or fun facts or what they wanted was they wanted the therapist bot. the time I kind of reflected on that and I thought, well, if I want to consume news, the most fun thing, most fun way to consume news is like Twitter. It's not like the value of there being a back and forth, wasn't that high. Right. And I thought if I need help with a recipe, I actually just go like the New York times has a good recipe section, right? It's not actually that hard. And so I just thought the thing that AI is 10 X better at is a sort of a conversation right. That's not intrinsically informative, but it's more about an opportunity. You can say whatever you want. You're not going to get judged. If it's 3am, you don't have to wait for your friend to text back. It's like, it's immediate. They're going to reply immediately. You can say whatever you want. It's judgment-free and it's much more like a playground. It's much more like a fun experience. And you could see that if the AI gave a person a compliment, they would love it. It's much easier to get the AI to give you a compliment than a human. From that day on, I said, okay, I get it. Humans want to speak to like humans or human like entities and they want to have fun. And that was when I started to look less at platforms like Google. And I started to look more at platforms like Instagram. And I was trying to think about why do people use Instagram? And I could see that I think Chai was, was filling the same desire or the same drive. If you go on Instagram, typically you want to look at the faces of other humans, or you want to hear about other people's lives. So if it's like the rock is making himself pancakes on a cheese plate. You kind of feel a little bit like you're the rock's friend, or you're like having pancakes with him or something, right? But if you do it too much, you feel like you're sad and like a lonely person, but with AI, you can talk to it and tell it stories and tell you stories, and you can play with it for as long as you want. And you don't feel like you're like a sad, lonely person. You feel like you actually have a friend.Alessio [00:16:29]: And what, why is that? Do you have any insight on that from using it?William [00:16:33]: I think it's just the human psychology. I think it's just the idea that, with old school social media. You're just consuming passively, right? So you'll just swipe. If I'm watching TikTok, just like swipe and swipe and swipe. And even though I'm getting the dopamine of like watching an engaging video, there's this other thing that's building my head, which is like, I'm feeling lazier and lazier and lazier. And after a certain period of time, I'm like, man, I just wasted 40 minutes. I achieved nothing. But with AI, because you're interacting, you feel like you're, it's not like work, but you feel like you're participating and contributing to the thing. You don't feel like you're just. Consuming. So you don't have a sense of remorse basically. And you know, I think on the whole people, the way people talk about, try and interact with the AI, they speak about it in an incredibly positive sense. Like we get people who say they have eating disorders saying that the AI helps them with their eating disorders. People who say they're depressed, it helps them through like the rough patches. So I think there's something intrinsically healthy about interacting that TikTok and Instagram and YouTube doesn't quite tick. From that point on, it was about building more and more kind of like human centric AI for people to interact with. And I was like, okay, let's make a Kanye West bot, right? And then no one wanted to talk to the Kanye West bot. And I was like, ah, who's like a cool persona for teenagers to want to interact with. And I was like, I was trying to find the influencers and stuff like that, but no one cared. Like they didn't want to interact with the, yeah. And instead it was really just the special moment was when we said the realization that developers and software engineers aren't interested in building this sort of AI, but the consumers are right. And rather than me trying to guess every day, like what's the right bot to submit to the platform, why don't we just create the tools for the users to build it themselves? And so nowadays this is like the most obvious thing in the world, but when Chai first did it, it was not an obvious thing at all. Right. Right. So we took the API for let's just say it was, I think it was GPTJ, which was this 6 billion parameter open source transformer style LLM. We took GPTJ. We let users create the prompt. We let users select the image and we let users choose the name. And then that was the bot. And through that, they could shape the experience, right? So if they said this bot's going to be really mean, and it's going to be called like bully in the playground, right? That was like a whole category that I never would have guessed. Right. People love to fight. They love to have a disagreement, right? And then they would create, there'd be all these romantic archetypes that I didn't know existed. And so as the users could create the content that they wanted, that was when Chai was able to, to get this huge variety of content and rather than appealing to, you know, 1% of the population that I'd figured out what they wanted, you could appeal to a much, much broader thing. And so from that moment on, it was very, very crystal clear. It's like Chai, just as Instagram is this social media platform that lets people create images and upload images, videos and upload that, Chai was really about how can we let the users create this experience in AI and then share it and interact and search. So it's really, you know, I say it's like a platform for social AI.Alessio [00:20:00]: Where did the Chai name come from? Because you started the same path. I was like, is it character AI shortened? You started at the same time, so I was curious. The UK origin was like the second, the Chai.William [00:20:15]: We started way before character AI. And there's an interesting story that Chai's numbers were very, very strong, right? So I think in even 20, I think late 2022, was it late 2022 or maybe early 2023? Chai was like the number one AI app in the app store. So we would have something like 100,000 daily active users. And then one day we kind of saw there was this website. And we were like, oh, this website looks just like Chai. And it was the character AI website. And I think that nowadays it's, I think it's much more common knowledge that when they left Google with the funding, I think they knew what was the most trending, the number one app. And I think they sort of built that. Oh, you found the people.swyx [00:21:03]: You found the PMF for them.William [00:21:04]: We found the PMF for them. Exactly. Yeah. So I worked a year very, very hard. And then they, and then that was when I learned a lesson, which is that if you're VC backed and if, you know, so Chai, we'd kind of ran, we'd got to this point, I was the only person who'd invested. I'd invested maybe 2 million pounds in the business. And you know, from that, we were able to build this thing, get to say a hundred thousand daily active users. And then when character AI came along, the first version, we sort of laughed. We were like, oh man, this thing sucks. Like they don't know what they're building. They're building the wrong thing anyway, but then I saw, oh, they've raised a hundred million dollars. Oh, they've raised another hundred million dollars. And then our users started saying, oh guys, your AI sucks. Cause we were serving a 6 billion parameter model, right? How big was the model that character AI could afford to serve, right? So we would be spending, let's say we would spend a dollar per per user, right? Over the, the, you know, the entire lifetime.swyx [00:22:01]: A dollar per session, per chat, per month? No, no, no, no.William [00:22:04]: Let's say we'd get over the course of the year, we'd have a million users and we'd spend a million dollars on the AI throughout the year. Right. Like aggregated. Exactly. Exactly. Right. They could spend a hundred times that. So people would say, why is your AI much dumber than character AIs? And then I was like, oh, okay, I get it. This is like the Silicon Valley style, um, hyper scale business. And so, yeah, we moved to Silicon Valley and, uh, got some funding and iterated and built the flywheels. And, um, yeah, I, I'm very proud that we were able to compete with that. Right. So, and I think the reason we were able to do it was just customer obsession. And it's similar, I guess, to how deep seek have been able to produce such a compelling model when compared to someone like an open AI, right? So deep seek, you know, their latest, um, V2, yeah, they claim to have spent 5 million training it.swyx [00:22:57]: It may be a bit more, but, um, like, why are you making it? Why are you making such a big deal out of this? Yeah. There's an agenda there. Yeah. You brought up deep seek. So we have to ask you had a call with them.William [00:23:07]: We did. We did. We did. Um, let me think what to say about that. I think for one, they have an amazing story, right? So their background is again in finance.swyx [00:23:16]: They're the Chinese version of you. Exactly.William [00:23:18]: Well, there's a lot of similarities. Yes. Yes. I have a great affinity for companies which are like, um, founder led, customer obsessed and just try and build something great. And I think what deep seek have achieved. There's quite special is they've got this amazing inference engine. They've been able to reduce the size of the KV cash significantly. And then by being able to do that, they're able to significantly reduce their inference costs. And I think with kind of with AI, people get really focused on like the kind of the foundation model or like the model itself. And they sort of don't pay much attention to the inference. To give you an example with Chai, let's say a typical user session is 90 minutes, which is like, you know, is very, very long for comparison. Let's say the average session length on TikTok is 70 minutes. So people are spending a lot of time. And in that time they're able to send say 150 messages. That's a lot of completions, right? It's quite different from an open AI scenario where people might come in, they'll have a particular question in mind. And they'll ask like one question. And a few follow up questions, right? So because they're consuming, say 30 times as many requests for a chat, or a conversational experience, you've got to figure out how to how to get the right balance between the cost of that and the quality. And so, you know, I think with AI, it's always been the case that if you want a better experience, you can throw compute at the problem, right? So if you want a better model, you can just make it bigger. If you want it to remember better, give it a longer context. And now, what open AI is doing to great fanfare is with projection sampling, you can generate many candidates, right? And then with some sort of reward model or some sort of scoring system, you can serve the most promising of these many candidates. And so that's kind of scaling up on the inference time compute side of things. And so for us, it doesn't make sense to think of AI is just the absolute performance. So. But what we're seeing, it's like the MML you score or the, you know, any of these benchmarks that people like to look at, if you just get that score, it doesn't really tell tell you anything. Because it's really like progress is made by improving the performance per dollar. And so I think that's an area where deep seek have been able to form very, very well, surprisingly so. And so I'm very interested in what Lama four is going to look like. And if they're able to sort of match what deep seek have been able to achieve with this performance per dollar gain.Alessio [00:25:59]: Before we go into the inference, some of the deeper stuff, can you give people an overview of like some of the numbers? So I think last I checked, you have like 1.4 million daily active now. It's like over 22 million of revenue. So it's quite a business.William [00:26:12]: Yeah, I think we grew by a factor of, you know, users grew by a factor of three last year. Revenue over doubled. You know, it's very exciting. We're competing with some really big, really well funded companies. Character AI got this, I think it was almost a $3 billion valuation. And they have 5 million DAU is a number that I last heard. Torquay, which is a Chinese built app owned by a company called Minimax. They're incredibly well funded. And these companies didn't grow by a factor of three last year. Right. And so when you've got this company and this team that's able to keep building something that gets users excited, and they want to tell their friend about it, and then they want to come and they want to stick on the platform. I think that's very special. And so last year was a great year for the team. And yeah, I think the numbers reflect the hard work that we put in. And then fundamentally, the quality of the app, the quality of the content, the quality of the content, the quality of the content, the quality of the content, the quality of the content. AI is the quality of the experience that you have. You actually published your DAU growth chart, which is unusual. And I see some inflections. Like, it's not just a straight line. There's some things that actually inflect. Yes. What were the big ones? Cool. That's a great, great, great question. Let me think of a good answer. I'm basically looking to annotate this chart, which doesn't have annotations on it. Cool. The first thing I would say is this is, I think the most important thing to know about success is that success is born out of failures. Right? Through failures that we learn. You know, if you think something's a good idea, and you do and it works, great, but you didn't actually learn anything, because everything went exactly as you imagined. But if you have an idea, you think it's going to be good, you try it, and it fails. There's a gap between the reality and expectation. And that's an opportunity to learn. The flat periods, that's us learning. And then the up periods is that's us reaping the rewards of that. So I think the big, of the growth shot of just 2024, I think the first thing that really kind of put a dent in our growth was our backend. So we just reached this scale. So we'd, from day one, we'd built on top of Google's GCP, which is Google's cloud platform. And they were fantastic. We used them when we had one daily active user, and they worked pretty good all the way up till we had about 500,000. It was never the cheapest, but from an engineering perspective, man, that thing scaled insanely good. Like, not Vertex? Not Vertex. Like GKE, that kind of stuff? We use Firebase. So we use Firebase. I'm pretty sure we're the biggest user ever on Firebase. That's expensive. Yeah, we had calls with engineers, and they're like, we wouldn't recommend using this product beyond this point, and you're 3x over that. So we pushed Google to their absolute limits. You know, it was fantastic for us, because we could focus on the AI. We could focus on just adding as much value as possible. But then what happened was, after 500,000, just the thing, the way we were using it, and it would just, it wouldn't scale any further. And so we had a really, really painful, at least three-month period, as we kind of migrated between different services, figuring out, like, what requests do we want to keep on Firebase, and what ones do we want to move on to something else? And then, you know, making mistakes. And learning things the hard way. And then after about three months, we got that right. So that, we would then be able to scale to the 1.5 million DAE without any further issues from the GCP. But what happens is, if you have an outage, new users who go on your app experience a dysfunctional app, and then they're going to exit. And so your next day, the key metrics that the app stores track are going to be something like retention rates. And so your next day, the key metrics that the app stores track are going to be something like retention rates. Money spent, and the star, like, the rating that they give you. In the app store. In the app store, yeah. Tyranny. So if you're ranked top 50 in entertainment, you're going to acquire a certain rate of users organically. If you go in and have a bad experience, it's going to tank where you're positioned in the algorithm. And then it can take a long time to kind of earn your way back up, at least if you wanted to do it organically. If you throw money at it, you can jump to the top. And I could talk about that. But broadly speaking, if we look at 2024, the first kink in the graph was outages due to hitting 500k DAU. The backend didn't want to scale past that. So then we just had to do the engineering and build through it. Okay, so we built through that, and then we get a little bit of growth. And so, okay, that's feeling a little bit good. I think the next thing, I think it's, I'm not going to lie, I have a feeling that when Character AI got... I was thinking. I think so. I think... So the Character AI team fundamentally got acquired by Google. And I don't know what they changed in their business. I don't know if they dialed down that ad spend. Products don't change, right? Products just what it is. I don't think so. Yeah, I think the product is what it is. It's like maintenance mode. Yes. I think the issue that people, you know, some people may think this is an obvious fact, but running a business can be very competitive, right? Because other businesses can see what you're doing, and they can imitate you. And then there's this... There's this question of, if you've got one company that's spending $100,000 a day on advertising, and you've got another company that's spending zero, if you consider market share, and if you're considering new users which are entering the market, the guy that's spending $100,000 a day is going to be getting 90% of those new users. And so I have a suspicion that when the founders of Character AI left, they dialed down their spending on user acquisition. And I think that kind of gave oxygen to like the other apps. And so Chai was able to then start growing again in a really healthy fashion. I think that's kind of like the second thing. I think a third thing is we've really built a great data flywheel. Like the AI team sort of perfected their flywheel, I would say, in end of Q2. And I could speak about that at length. But fundamentally, the way I would describe it is when you're building anything in life, you need to be able to evaluate it. And through evaluation, you can iterate, we can look at benchmarks, and we can say the issues with benchmarks and why they may not generalize as well as one would hope in the challenges of working with them. But something that works incredibly well is getting feedback from humans. And so we built this thing where anyone can submit a model to our developer backend, and it gets put in front of 5000 users, and the users can rate it. And we can then have a really accurate ranking of like which model, or users finding more engaging or more entertaining. And it gets, you know, it's at this point now, where every day we're able to, I mean, we evaluate between 20 and 50 models, LLMs, every single day, right. So even though we've got only got a team of, say, five AI researchers, they're able to iterate a huge quantity of LLMs, right. So our team ships, let's just say minimum 100 LLMs a week is what we're able to iterate through. Now, before that moment in time, we might iterate through three a week, we might, you know, there was a time when even doing like five a month was a challenge, right? By being able to change the feedback loops to the point where it's not, let's launch these three models, let's do an A-B test, let's assign, let's do different cohorts, let's wait 30 days to see what the day 30 retention is, which is the kind of the, if you're doing an app, that's like A-B testing 101 would be, do a 30-day retention test, assign different treatments to different cohorts and come back in 30 days. So that's insanely slow. That's just, it's too slow. And so we were able to get that 30-day feedback loop all the way down to something like three hours. And when we did that, we could really, really, really perfect techniques like DPO, fine tuning, prompt engineering, blending, rejection sampling, training a reward model, right, really successfully, like boom, boom, boom, boom, boom. And so I think in Q3 and Q4, we got, the amount of AI improvements we got was like astounding. It was getting to the point, I thought like how much more, how much more edge is there to be had here? But the team just could keep going and going and going. That was like number three for the inflection point.swyx [00:34:53]: There's a fourth?William [00:34:54]: The important thing about the third one is if you go on our Reddit or you talk to users of AI, there's like a clear date. It's like somewhere in October or something. The users, they flipped. Before October, the users... The users would say character AI is better than you, for the most part. Then from October onwards, they would say, wow, you guys are better than character AI. And that was like a really clear positive signal that we'd sort of done it. And I think people, you can't cheat consumers. You can't trick them. You can't b******t them. They know, right? If you're going to spend 90 minutes on a platform, and with apps, there's the barriers to switching is pretty low. Like you can try character AI, you can't cheat consumers. You can't cheat them. You can't cheat them. You can't cheat AI for a day. If you get bored, you can try Chai. If you get bored of Chai, you can go back to character. So the users, the loyalty is not strong, right? What keeps them on the app is the experience. If you deliver a better experience, they're going to stay and they can tell. So that was the fourth one was we were fortunate enough to get this hire. He was hired one really talented engineer. And then they said, oh, at my last company, we had a head of growth. He was really, really good. And he was the head of growth for ByteDance for two years. Would you like to speak to him? And I was like, yes. Yes, I think I would. And so I spoke to him. And he just blew me away with what he knew about user acquisition. You know, it was like a 3D chessswyx [00:36:21]: sort of thing. You know, as much as, as I know about AI. Like ByteDance as in TikTok US. Yes.William [00:36:26]: Not ByteDance as other stuff. Yep. He was interviewing us as we were interviewing him. Right. And so pick up options. Yeah, exactly. And so he was kind of looking at our metrics. And he was like, I saw him get really excited when he said, guys, you've got a million daily active users and you've done no advertising. I said, correct. And he was like, that's unheard of. He's like, I've never heard of anyone doing that. And then he started looking at our metrics. And he was like, if you've got all of this organically, if you start spending money, this is going to be very exciting. I was like, let's give it a go. So then he came in, we've just started ramping up the user acquisition. So that looks like spending, you know, let's say we're spending, we started spending $20,000 a day, it looked very promising than 20,000. Right now we're spending $40,000 a day on user acquisition. That's still only half of what like character AI or talkie may be spending. But from that, it's sort of, we were growing at a rate of maybe say, 2x a year. And that got us growing at a rate of 3x a year. So I'm growing, I'm evolving more and more to like a Silicon Valley style hyper growth, like, you know, you build something decent, and then you canswyx [00:37:33]: slap on a huge... You did the important thing, you did the product first.William [00:37:36]: Of course, but then you can slap on like, like the rocket or the jet engine or something, which is just this cash in, you pour in as much cash, you buy a lot of ads, and your growth is faster.swyx [00:37:48]: Not to, you know, I'm just kind of curious what's working right now versus what surprisinglyWilliam [00:37:52]: doesn't work. Oh, there's a long, long list of surprising stuff that doesn't work. Yeah. The surprising thing, like the most surprising thing, what doesn't work is almost everything doesn't work. That's what's surprising. And I'll give you an example. So like a year and a half ago, I was working at a company, we were super excited by audio. I was like, audio is going to be the next killer feature, we have to get in the app. And I want to be the first. So everything Chai does, I want us to be the first. We may not be the company that's strongest at execution, but we can always be theswyx [00:38:22]: most innovative. Interesting. Right? So we can... You're pretty strong at execution.William [00:38:26]: We're much stronger, we're much stronger. A lot of the reason we're here is because we were first. If we launched today, it'd be so hard to get the traction. Because it's like to get the flywheel, to get the users, to build a product people are excited about. If you're first, people are naturally excited about it. But if you're fifth or 10th, man, you've got to beswyx [00:38:46]: insanely good at execution. So you were first with voice? We were first. We were first. I only knowWilliam [00:38:51]: when character launched voice. They launched it, I think they launched it at least nine months after us. Okay. Okay. But the team worked so hard for it. At the time we did it, latency is a huge problem. Cost is a huge problem. Getting the right quality of the voice is a huge problem. Right? Then there's this user interface and getting the right user experience. Because you don't just want it to start blurting out. Right? You want to kind of activate it. But then you don't have to keep pressing a button every single time. There's a lot that goes into getting a really smooth audio experience. So we went ahead, we invested the three months, we built it all. And then when we did the A-B test, there was like, no change in any of the numbers. And I was like, this can't be right, there must be a bug. And we spent like a week just checking everything, checking again, checking again. And it was like, the users just did not care. And it was something like only 10 or 15% of users even click the button to like, they wanted to engage the audio. And they would only use it for 10 or 15% of the time. So if you do the math, if it's just like something that one in seven people use it for one seventh of their time. You've changed like 2% of the experience. So even if that that 2% of the time is like insanely good, it doesn't translate much when you look at the retention, when you look at the engagement, and when you look at the monetization rates. So audio did not have a big impact. I'm pretty big on audio. But yeah, I like it too. But it's, you know, so a lot of the stuff which I do, I'm a big, you can have a theory. And you resist. Yeah. Exactly, exactly. So I think if you want to make audio work, it has to be a unique, compelling, exciting experience that they can't have anywhere else.swyx [00:40:37]: It could be your models, which just weren't good enough.William [00:40:39]: No, no, no, they were great. Oh, yeah, they were very good. it was like, it was kind of like just the, you know, if you listen to like an audible or Kindle, or something like, you just hear this voice. And it's like, you don't go like, wow, this is this is special, right? It's like a convenience thing. But the idea is that if you can, if Chai is the only platform, like, let's say you have a Mr. Beast, and YouTube is the only platform you can use to make audio work, then you can watch a Mr. Beast video. And it's the most engaging, fun video that you want to watch, you'll go to a YouTube. And so it's like for audio, you can't just put the audio on there. And people go, oh, yeah, it's like 2% better. Or like, 5% of users think it's 20% better, right? It has to be something that the majority of people, for the majority of the experience, go like, wow, this is a big deal. That's the features you need to be shipping. If it's not going to appeal to the majority of people, for the majority of the experience, and it's not a big deal, it's not going to move you. Cool. So you killed it. I don't see it anymore. Yep. So I love this. The longer, it's kind of cheesy, I guess, but the longer I've been working at Chai, and I think the team agrees with this, all the platitudes, at least I thought they were platitudes, that you would get from like the Steve Jobs, which is like, build something insanely great, right? Or be maniacally focused, or, you know, the most important thing is saying no to, not to work on. All of these sort of lessons, they just are like painfully true. They're painfully true. So now I'm just like, everything I say, I'm either quoting Steve Jobs or Zuckerberg. I'm like, guys, move fast and break free.swyx [00:42:10]: You've jumped the Apollo to cool it now.William [00:42:12]: Yeah, it's just so, everything they said is so, so true. The turtle neck. Yeah, yeah, yeah. Everything is so true.swyx [00:42:18]: This last question on my side, and I want to pass this to Alessio, is on just, just multi-modality in general. This actually comes from Justine Moore from A16Z, who's a friend of ours. And a lot of people are trying to do voice image video for AI companions. Yes. You just said voice didn't work. Yep. What would make you revisit?William [00:42:36]: So Steve Jobs, he was very, listen, he was very, very clear on this. There's a habit of engineers who, once they've got some cool technology, they want to find a way to package up the cool technology and sell it to consumers, right? That does not work. So you're free to try and build a startup where you've got your cool tech and you want to find someone to sell it to. That's not what we do at Chai. At Chai, we start with the consumer. What does the consumer want? What is their problem? And how do we solve it? So right now, the number one problems for the users, it's not the audio. That's not the number one problem. It's not the image generation either. That's not their problem either. The number one problem for users in AI is this. All the AI is being generated by middle-aged men in Silicon Valley, right? That's all the content. You're interacting with this AI. You're speaking to it for 90 minutes on average. It's being trained by middle-aged men. The guys out there, they're out there. They're talking to you. They're talking to you. They're like, oh, what should the AI say in this situation, right? What's funny, right? What's cool? What's boring? What's entertaining? That's not the way it should be. The way it should be is that the users should be creating the AI, right? And so the way I speak about it is this. Chai, we have this AI engine in which sits atop a thin layer of UGC. So the thin layer of UGC is absolutely essential, right? It's just prompts. But it's just prompts. It's just an image. It's just a name. It's like we've done 1% of what we could do. So we need to keep thickening up that layer of UGC. It must be the case that the users can train the AI. And if reinforcement learning is powerful and important, they have to be able to do that. And so it's got to be the case that there exists, you know, I say to the team, just as Mr. Beast is able to spend 100 million a year or whatever it is on his production company, and he's got a team building the content, the Mr. Beast company is able to spend 100 million a year on his production company. And he's got a team building the content, which then he shares on the YouTube platform. Until there's a team that's earning 100 million a year or spending 100 million on the content that they're producing for the Chai platform, we're not finished, right? So that's the problem. That's what we're excited to build. And getting too caught up in the tech, I think is a fool's errand. It does not work.Alessio [00:44:52]: As an aside, I saw the Beast Games thing on Amazon Prime. It's not doing well. And I'mswyx [00:44:56]: curious. It's kind of like, I mean, the audience reading is high. The run-to-meet-all sucks, but the audience reading is high.Alessio [00:45:02]: But it's not like in the top 10. I saw it dropped off of like the... Oh, okay. Yeah, that one I don't know. I'm curious, like, you know, it's kind of like similar content, but different platform. And then going back to like, some of what you were saying is like, you know, people come to ChaiWilliam [00:45:13]: expecting some type of content. Yeah, I think it's something that's interesting to discuss is like, is moats. And what is the moat? And so, you know, if you look at a platform like YouTube, the moat, I think is in first is really is in the ecosystem. And the ecosystem, is comprised of you have the content creators, you have the users, the consumers, and then you have the algorithms. And so this, this creates a sort of a flywheel where the algorithms are able to be trained on the users, and the users data, the recommend systems can then feed information to the content creators. So Mr. Beast, he knows which thumbnail does the best. He knows the first 10 seconds of the video has to be this particular way. And so his content is super optimized for the YouTube platform. So that's why it doesn't do well on Amazon. If he wants to do well on Amazon, how many videos has he created on the YouTube platform? By thousands, 10s of 1000s, I guess, he needs to get those iterations in on the Amazon. So at Chai, I think it's all about how can we get the most compelling, rich user generated content, stick that on top of the AI engine, the recommender systems, in such that we get this beautiful data flywheel, more users, better recommendations, more creative, more content, more users.Alessio [00:46:34]: You mentioned the algorithm, you have this idea of the Chaiverse on Chai, and you have your own kind of like LMSYS-like ELO system. Yeah, what are things that your models optimize for, like your users optimize for, and maybe talk about how you build it, how people submit models?William [00:46:49]: So Chaiverse is what I would describe as a developer platform. More often when we're speaking about Chai, we're thinking about the Chai app. And the Chai app is really this product for consumers. And so consumers can come on the Chai app, they can come on the Chai app, they can come on the Chai app, they can interact with our AI, and they can interact with other UGC. And it's really just these kind of bots. And it's a thin layer of UGC. Okay. Our mission is not to just have a very thin layer of UGC. Our mission is to have as much UGC as possible. So we must have, I don't want people at Chai training the AI. I want people, not middle aged men, building AI. I want everyone building the AI, as many people building the AI as possible. Okay, so what we built was we built Chaiverse. And Chaiverse is kind of, it's kind of like a prototype, is the way to think about it. And it started with this, this observation that, well, how many models get submitted into Hugging Face a day? It's hundreds, it's hundreds, right? So there's hundreds of LLMs submitted each day. Now consider that, what does it take to build an LLM? It takes a lot of work, actually. It's like someone devoted several hours of compute, several hours of their time, prepared a data set, launched it, ran it, evaluated it, submitted it, right? So there's a lot of, there's a lot of, there's a lot of work that's going into that. So what we did was we said, well, why can't we host their models for them and serve them to users? And then what would that look like? The first issue is, well, how do you know if a model is good or not? Like, we don't want to serve users the crappy models, right? So what we would do is we would, I love the LMSYS style. I think it's really cool. It's really simple. It's a very intuitive thing, which is you simply present the users with two completions. You can say, look, this is from model one. This is from model two. This is from model three. This is from model A. This is from model B, which is better. And so if someone submits a model to Chaiverse, what we do is we spin up a GPU. We download the model. We're going to now host that model on this GPU. And we're going to start routing traffic to it. And we're going to send, we think it takes about 5,000 completions to get an accurate signal. That's roughly what LMSYS does. And from that, we're able to get an accurate ranking. And we're able to get an accurate ranking. And we're able to get an accurate ranking of which models are people finding entertaining and which models are not entertaining. If you look at the bottom 80%, they'll suck. You can just disregard them. They totally suck. Then when you get the top 20%, you know you've got a decent model, but you can break it down into more nuance. There might be one that's really descriptive. There might be one that's got a lot of personality to it. There might be one that's really illogical. Then the question is, well, what do you do with these top models? From that, you can do more sophisticated things. You can try and do like a routing thing where you say for a given user request, we're going to try and predict which of these end models that users enjoy the most. That turns out to be pretty expensive and not a huge source of like edge or improvement. Something that we love to do at Chai is blending, which is, you know, it's the simplest way to think about it is you're going to end up, and you're going to pretty quickly see you've got one model that's really smart, one model that's really funny. How do you get the user an experience that is both smart and funny? Well, just 50% of the requests, you can serve them the smart model, 50% of the requests, you serve them the funny model. Just a random 50%? Just a random, yeah. And then... That's blending? That's blending. You can do more sophisticated things on top of that, as in all things in life, but the 80-20 solution, if you just do that, you get a pretty powerful effect out of the gate. Random number generator. I think it's like the robustness of randomness. Random is a very powerful optimization technique, and it's a very robust thing. So you can explore a lot of the space very efficiently. There's one thing that's really, really important to share, and this is the most exciting thing for me, is after you do the ranking, you get an ELO score, and you can track a user's first join date, the first date they submit a model to Chaiverse, they almost always get a terrible ELO, right? So let's say the first submission they get an ELO of 1,100 or 1,000 or something, and you can see that they iterate and they iterate and iterate, and it will be like, no improvement, no improvement, no improvement, and then boom. Do you give them any data, or do you have to come up with this themselves? We do, we do, we do, we do. We try and strike a balance between giving them data that's very useful, you've got to be compliant with GDPR, which is like, you have to work very hard to preserve the privacy of users of your app. So we try to give them as much signal as possible, to be helpful. The minimum is we're just going to give you a score, right? That's the minimum. But that alone is people can optimize a score pretty well, because they're able to come up with theories, submit it, does it work? No. A new theory, does it work? No. And then boom, as soon as they figure something out, they keep it, and then they iterate, and then boom,Alessio [00:51:46]: they figure something out, and they keep it. Last year, you had this post on your blog, cross-sourcing the lead to the 10 trillion parameter, AGI, and you call it a mixture of experts, recommenders. Yep. Any insights?William [00:51:58]: Updated thoughts, 12 months later? I think the odds, the timeline for AGI has certainly been pushed out, right? Now, this is in, I'm a controversial person, I don't know, like, I just think... You don't believe in scaling laws, you think AGI is further away. I think it's an S-curve. I think everything's an S-curve. And I think that the models have proven to just be far worse at reasoning than people sort of thought. And I think whenever I hear people talk about LLMs as reasoning engines, I sort of cringe a bit. I don't think that's what they are. I think of them more as like a simulator. I think of them as like a, right? So they get trained to predict the next most likely token. It's like a physics simulation engine. So you get these like games where you can like construct a bridge, and you drop a car down, and then it predicts what should happen. And that's really what LLMs are doing. It's not so much that they're reasoning, it's more that they're just doing the most likely thing. So fundamentally, the ability for people to add in intelligence, I think is very limited. What most people would consider intelligence, I think the AI is not a crowdsourcing problem, right? Now with Wikipedia, Wikipedia crowdsources knowledge. It doesn't crowdsource intelligence. So it's a subtle distinction. AI is fantastic at knowledge. I think it's weak at intelligence. And a lot, it's easy to conflate the two because if you ask it a question and it gives you, you know, if you said, who was the seventh president of the United States, and it gives you the correct answer, I'd say, well, I don't know the answer to that. And you can conflate that with intelligence. But really, that's a question of knowledge. And knowledge is really this thing about saying, how can I store all of this information? And then how can I retrieve something that's relevant? Okay, they're fantastic at that. They're fantastic at storing knowledge and retrieving the relevant knowledge. They're superior to humans in that regard. And so I think we need to come up for a new word. How does one describe AI should contain more knowledge than any individual human? It should be more accessible than any individual human. That's a very powerful thing. That's superswyx [00:54:07]: powerful. But what words do we use to describe that? We had a previous guest on Exa AI that does search. And he tried to coin super knowledge as the opposite of super intelligence.William [00:54:20]: Exactly. I think super knowledge is a more accurate word for it.swyx [00:54:24]: You can store more things than any human can.William [00:54:26]: And you can retrieve it better than any human can as well. And I think it's those two things combined that's special. I think that thing will exist. That thing can be built. And I think you can start with something that's entertaining and fun. And I think, I often think it's like, look, it's going to be a 20 year journey. And we're in like, year four, or it's like the web. And this is like 1998 or something. You know, you've got a long, long way to go before the Amazon.coms are like these huge, multi trillion dollar businesses that every single person uses every day. And so AI today is very simplistic. And it's fundamentally the way we're using it, the flywheels, and this ability for how can everyone contribute to it to really magnify the value that it brings. Right now, like, I think it's a bit sad. It's like, right now you have big labs, I'm going to pick on open AI. And they kind of go to like these human labelers. And they say, we're going to pay you to just label this like subset of questions that we want to get a really high quality data set, then we're going to get like our own computers that are really powerful. And that's kind of like the thing. For me, it's so much like Encyclopedia Britannica. It's like insane. All the people that were interested in blockchain, it's like, well, this is this is what needs to be decentralized, you need to decentralize that thing. Because if you distribute it, people can generate way more data in a distributed fashion, way more, right? You need the incentive. Yeah, of course. Yeah. But I mean, the, the, that's kind of the exciting thing about Wikipedia was it's this understanding, like the incentives, you don't need money to incentivize people. You don't need dog coins. No. Sometimes, sometimes people get the satisfaction fro

united states god tv new york spotify amazon tiktok money ai new york city google hollywood uk disney strategy research gold chinese elon musk cost western kanye west transition harvard 3d chatgpt started tree humans beast silicon valley reddit comparison oxford amazon prime products shakespeare origin mma rejection joe rogan wikipedia cambridge trading views bay area apollo revenue mark zuckerberg weed sort fantastic steve jobs cto iq vc kindle openai user evaluation launched initial ux api tyranny gpt gdpr python lama github neuralink citadel llm palo alto consuming sam altman crossed gpu chai pto agi elo bytedance burr git b2b saas kv ugc cobalt crowdsourcing noam v2 alessio mixture cmu animations inference britannica beauchamp gcp decibel vertex a16z alphago cuda encyclopedia britannica torquay pmf firebase dpo susquehanna dau dae mlas o3 mk1 hft character ai minimax o1 outlasting mml elos chat ai latent space

Latent.Space 2024 Year in Review

Latent Space: The AI Engineer Podcast â€” CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

Play Episode Listen Later Dec 31, 2024 111:07

Applications for the 2025 AI Engineer Summit are up, and you can save the date for AIE Singapore in April and AIE World's Fair 2025 in June.Happy new year, and thanks for 100 great episodes! Please let us know what you want to see/hear for the next 100!Full YouTube Episode with Slides/ChartsLike and subscribe and hit that bell to get notifs!Timestamps* 00:00 Welcome to the 100th Episode!* 00:19 Reflecting on the Journey* 00:47 AI Engineering: The Rise and Impact* 03:15 Latent Space Live and AI Conferences* 09:44 The Competitive AI Landscape* 21:45 Synthetic Data and Future Trends* 35:53 Creative Writing with AI* 36:12 Legal and Ethical Issues in AI* 38:18 The Data War: GPU Poor vs. GPU Rich* 39:12 The Rise of GPU Ultra Rich* 40:47 Emerging Trends in AI Models* 45:31 The Multi-Modality War* 01:05:31 The Future of AI Benchmarks* 01:13:17 Pionote and Frontier Models* 01:13:47 Niche Models and Base Models* 01:14:30 State Space Models and RWKB* 01:15:48 Inference Race and Price Wars* 01:22:16 Major AI Themes of the Year* 01:22:48 AI Rewind: January to March* 01:26:42 AI Rewind: April to June* 01:33:12 AI Rewind: July to September* 01:34:59 AI Rewind: October to December* 01:39:53 Year-End Reflections and PredictionsTranscript[00:00:00] Welcome to the 100th Episode![00:00:00] Alessio: Hey everyone, welcome to the Latent Space Podcast. This is Alessio, partner and CTO at Decibel Partners, and I'm joined by my co host Swyx for the 100th time today.[00:00:12] swyx: Yay, um, and we're so glad that, yeah, you know, everyone has, uh, followed us in this journey. How do you feel about it? 100 episodes.[00:00:19] Alessio: Yeah, I know.[00:00:19] Reflecting on the Journey[00:00:19] Alessio: Almost two years that we've been doing this. We've had four different studios. Uh, we've had a lot of changes. You know, we used to do this lightning round. When we first started that we didn't like, and we tried to change the question. The answer[00:00:32] swyx: was cursor and perplexity.[00:00:34] Alessio: Yeah, I love mid journey. It's like, do you really not like anything else?[00:00:38] Alessio: Like what's, what's the unique thing? And I think, yeah, we, we've also had a lot more research driven content. You know, we had like 3DAO, we had, you know. Jeremy Howard, we had more folks like that.[00:00:47] AI Engineering: The Rise and Impact[00:00:47] Alessio: I think we want to do more of that too in the new year, like having, uh, some of the Gemini folks, both on the research and the applied side.[00:00:54] Alessio: Yeah, but it's been a ton of fun. I think we both started, I wouldn't say as a joke, we were kind of like, Oh, we [00:01:00] should do a podcast. And I think we kind of caught the right wave, obviously. And I think your rise of the AI engineer posts just kind of get people. Sombra to congregate, and then the AI engineer summit.[00:01:11] Alessio: And that's why when I look at our growth chart, it's kind of like a proxy for like the AI engineering industry as a whole, which is almost like, like, even if we don't do that much, we keep growing just because there's so many more AI engineers. So did you expect that growth or did you expect that would take longer for like the AI engineer thing to kind of like become, you know, everybody talks about it today.[00:01:32] swyx: So, the sign of that, that we have won is that Gartner puts it at the top of the hype curve right now. So Gartner has called the peak in AI engineering. I did not expect, um, to what level. I knew that I was correct when I called it because I did like two months of work going into that. But I didn't know, You know, how quickly it could happen, and obviously there's a chance that I could be wrong.[00:01:52] swyx: But I think, like, most people have come around to that concept. Hacker News hates it, which is a good sign. But there's enough people that have defined it, you know, GitHub, when [00:02:00] they launched GitHub Models, which is the Hugging Face clone, they put AI engineers in the banner, like, above the fold, like, in big So I think it's like kind of arrived as a meaningful and useful definition.[00:02:12] swyx: I think people are trying to figure out where the boundaries are. I think that was a lot of the quote unquote drama that happens behind the scenes at the World's Fair in June. Because I think there's a lot of doubt or questions about where ML engineering stops and AI engineering starts. That's a useful debate to be had.[00:02:29] swyx: In some sense, I actually anticipated that as well. So I intentionally did not. Put a firm definition there because most of the successful definitions are necessarily underspecified and it's actually useful to have different perspectives and you don't have to specify everything from the outset.[00:02:45] Alessio: Yeah, I was at um, AWS reInvent and the line to get into like the AI engineering talk, so to speak, which is, you know, applied AI and whatnot was like, there are like hundreds of people just in line to go in.[00:02:56] Alessio: I think that's kind of what enabled me. People, right? Which is what [00:03:00] you kind of talked about. It's like, Hey, look, you don't actually need a PhD, just, yeah, just use the model. And then maybe we'll talk about some of the blind spots that you get as an engineer with the earlier posts that we also had on on the sub stack.[00:03:11] Alessio: But yeah, it's been a heck of a heck of a two years.[00:03:14] swyx: Yeah.[00:03:15] Latent Space Live and AI Conferences[00:03:15] swyx: You know, I was, I was trying to view the conference as like, so NeurIPS is I think like 16, 17, 000 people. And the Latent Space Live event that we held there was 950 signups. I think. The AI world, the ML world is still very much research heavy. And that's as it should be because ML is very much in a research phase.[00:03:34] swyx: But as we move this entire field into production, I think that ratio inverts into becoming more engineering heavy. So at least I think engineering should be on the same level, even if it's never as prestigious, like it'll always be low status because at the end of the day, you're manipulating APIs or whatever.[00:03:51] swyx: But Yeah, wrapping GPTs, but there's going to be an increasing stack and an art to doing these, these things well. And I, you know, I [00:04:00] think that's what we're focusing on for the podcast, the conference and basically everything I do seems to make sense. And I think we'll, we'll talk about the trends here that apply.[00:04:09] swyx: It's, it's just very strange. So, like, there's a mix of, like, keeping on top of research while not being a researcher and then putting that research into production. So, like, people always ask me, like, why are you covering Neuralibs? Like, this is a ML research conference and I'm like, well, yeah, I mean, we're not going to, to like, understand everything Or reproduce every single paper, but the stuff that is being found here is going to make it through into production at some point, you hope.[00:04:32] swyx: And then actually like when I talk to the researchers, they actually get very excited because they're like, oh, you guys are actually caring about how this goes into production and that's what they really really want. The measure of success is previously just peer review, right? Getting 7s and 8s on their um, Academic review conferences and stuff like citations is one metric, but money is a better metric.[00:04:51] Alessio: Money is a better metric. Yeah, and there were about 2200 people on the live stream or something like that. Yeah, yeah. Hundred on the live stream. So [00:05:00] I try my best to moderate, but it was a lot spicier in person with Jonathan and, and Dylan. Yeah, that it was in the chat on YouTube.[00:05:06] swyx: I would say that I actually also created.[00:05:09] swyx: Layen Space Live in order to address flaws that are perceived in academic conferences. This is not NeurIPS specific, it's ICML, NeurIPS. Basically, it's very sort of oriented towards the PhD student, uh, market, job market, right? Like literally all, basically everyone's there to advertise their research and skills and get jobs.[00:05:28] swyx: And then obviously all the, the companies go there to hire them. And I think that's great for the individual researchers, but for people going there to get info is not great because you have to read between the lines, bring a ton of context in order to understand every single paper. So what is missing is effectively what I ended up doing, which is domain by domain, go through and recap the best of the year.[00:05:48] swyx: Survey the field. And there are, like NeurIPS had a, uh, I think ICML had a like a position paper track, NeurIPS added a benchmarks, uh, datasets track. These are ways in which to address that [00:06:00] issue. Uh, there's always workshops as well. Every, every conference has, you know, a last day of workshops and stuff that provide more of an overview.[00:06:06] swyx: But they're not specifically prompted to do so. And I think really, uh, Organizing a conference is just about getting good speakers and giving them the correct prompts. And then they will just go and do that thing and they do a very good job of it. So I think Sarah did a fantastic job with the startups prompt.[00:06:21] swyx: I can't list everybody, but we did best of 2024 in startups, vision, open models. Post transformers, synthetic data, small models, and agents. And then the last one was the, uh, and then we also did a quick one on reasoning with Nathan Lambert. And then the last one, obviously, was the debate that people were very hyped about.[00:06:39] swyx: It was very awkward. And I'm really, really thankful for John Franco, basically, who stepped up to challenge Dylan. Because Dylan was like, yeah, I'll do it. But He was pro scaling. And I think everyone who is like in AI is pro scaling, right? So you need somebody who's ready to publicly say, no, we've hit a wall.[00:06:57] swyx: So that means you're saying Sam Altman's wrong. [00:07:00] You're saying, um, you know, everyone else is wrong. It helps that this was the day before Ilya went on, went up on stage and then said pre training has hit a wall. And data has hit a wall. So actually Jonathan ended up winning, and then Ilya supported that statement, and then Noam Brown on the last day further supported that statement as well.[00:07:17] swyx: So it's kind of interesting that I think the consensus kind of going in was that we're not done scaling, like you should believe in a better lesson. And then, four straight days in a row, you had Sepp Hochreiter, who is the creator of the LSTM, along with everyone's favorite OG in AI, which is Juergen Schmidhuber.[00:07:34] swyx: He said that, um, we're pre trading inside a wall, or like, we've run into a different kind of wall. And then we have, you know John Frankel, Ilya, and then Noam Brown are all saying variations of the same thing, that we have hit some kind of wall in the status quo of what pre trained, scaling large pre trained models has looked like, and we need a new thing.[00:07:54] swyx: And obviously the new thing for people is some make, either people are calling it inference time compute or test time [00:08:00] compute. I think the collective terminology has been inference time, and I think that makes sense because test time, calling it test, meaning, has a very pre trained bias, meaning that the only reason for running inference at all is to test your model.[00:08:11] swyx: That is not true. Right. Yeah. So, so, I quite agree that. OpenAI seems to have adopted, or the community seems to have adopted this terminology of ITC instead of TTC. And that, that makes a lot of sense because like now we care about inference, even right down to compute optimality. Like I actually interviewed this author who recovered or reviewed the Chinchilla paper.[00:08:31] swyx: Chinchilla paper is compute optimal training, but what is not stated in there is it's pre trained compute optimal training. And once you start caring about inference, compute optimal training, you have a different scaling law. And in a way that we did not know last year.[00:08:45] Alessio: I wonder, because John is, he's also on the side of attention is all you need.[00:08:49] Alessio: Like he had the bet with Sasha. So I'm curious, like he doesn't believe in scaling, but he thinks the transformer, I wonder if he's still. So, so,[00:08:56] swyx: so he, obviously everything is nuanced and you know, I told him to play a character [00:09:00] for this debate, right? So he actually does. Yeah. He still, he still believes that we can scale more.[00:09:04] swyx: Uh, he just assumed the character to be very game for, for playing this debate. So even more kudos to him that he assumed a position that he didn't believe in and still won the debate.[00:09:16] Alessio: Get rekt, Dylan. Um, do you just want to quickly run through some of these things? Like, uh, Sarah's presentation, just the highlights.[00:09:24] swyx: Yeah, we can't go through everyone's slides, but I pulled out some things as a factor of, like, stuff that we were going to talk about. And we'll[00:09:30] Alessio: publish[00:09:31] swyx: the rest. Yeah, we'll publish on this feed the best of 2024 in those domains. And hopefully people can benefit from the work that our speakers have done.[00:09:39] swyx: But I think it's, uh, these are just good slides. And I've been, I've been looking for a sort of end of year recaps from, from people.[00:09:44] The Competitive AI Landscape[00:09:44] swyx: The field has progressed a lot. You know, I think the max ELO in 2023 on LMSys used to be 1200 for LMSys ELOs. And now everyone is at least at, uh, 1275 in their ELOs, and this is across Gemini, Chadjibuti, [00:10:00] Grok, O1.[00:10:01] swyx: ai, which with their E Large model, and Enthopic, of course. It's a very, very competitive race. There are multiple Frontier labs all racing, but there is a clear tier zero Frontier. And then there's like a tier one. It's like, I wish I had everything else. Tier zero is extremely competitive. It's effectively now three horse race between Gemini, uh, Anthropic and OpenAI.[00:10:21] swyx: I would say that people are still holding out a candle for XAI. XAI, I think, for some reason, because their API was very slow to roll out, is not included in these metrics. So it's actually quite hard to put on there. As someone who also does charts, XAI is continually snubbed because they don't work well with the benchmarking people.[00:10:42] swyx: Yeah, yeah, yeah. It's a little trivia for why XAI always gets ignored. The other thing is market share. So these are slides from Sarah. We have it up on the screen. It has gone from very heavily open AI. So we have some numbers and estimates. These are from RAMP. Estimates of open AI market share in [00:11:00] December 2023.[00:11:01] swyx: And this is basically, what is it, GPT being 95 percent of production traffic. And I think if you correlate that with stuff that we asked. Harrison Chase on the LangChain episode, it was true. And then CLAUD 3 launched mid middle of this year. I think CLAUD 3 launched in March, CLAUD 3. 5 Sonnet was in June ish.[00:11:23] swyx: And you can start seeing the market share shift towards opening, uh, towards that topic, uh, very, very aggressively. The more recent one is Gemini. So if I scroll down a little bit, this is an even more recent dataset. So RAM's dataset ends in September 2 2. 2024. Gemini has basically launched a price war at the low end, uh, with Gemini Flash, uh, being basically free for personal use.[00:11:44] swyx: Like, I think people don't understand the free tier. It's something like a billion tokens per day. Unless you're trying to abuse it, you cannot really exhaust your free tier on Gemini. They're really trying to get you to use it. They know they're in like third place, um, fourth place, depending how you, how you count.[00:11:58] swyx: And so they're going after [00:12:00] the Lower tier first, and then, you know, maybe the upper tier later, but yeah, Gemini Flash, according to OpenRouter, is now 50 percent of their OpenRouter requests. Obviously, these are the small requests. These are small, cheap requests that are mathematically going to be more.[00:12:15] swyx: The smart ones obviously are still going to OpenAI. But, you know, it's a very, very big shift in the market. Like basically 2023, 2022, To going into 2024 opening has gone from nine five market share to Yeah. Reasonably somewhere between 50 to 75 market share.[00:12:29] Alessio: Yeah. I'm really curious how ramped does the attribution to the model?[00:12:32] Alessio: If it's API, because I think it's all credit card spin. . Well, but it's all, the credit card doesn't say maybe. Maybe the, maybe when they do expenses, they upload the PDF, but yeah, the, the German I think makes sense. I think that was one of my main 2024 takeaways that like. The best small model companies are the large labs, which is not something I would have thought that the open source kind of like long tail would be like the small model.[00:12:53] swyx: Yeah, different sizes of small models we're talking about here, right? Like so small model here for Gemini is AB, [00:13:00] right? Uh, mini. We don't know what the small model size is, but yeah, it's probably in the double digits or maybe single digits, but probably double digits. The open source community has kind of focused on the one to three B size.[00:13:11] swyx: Mm-hmm . Yeah. Maybe[00:13:12] swyx: zero, maybe 0.5 B uh, that's moon dream and that is small for you then, then that's great. It makes sense that we, we have a range for small now, which is like, may, maybe one to five B. Yeah. I'll even put that at, at, at the high end. And so this includes Gemma from Gemini as well. But also includes the Apple Foundation models, which I think Apple Foundation is 3B.[00:13:32] Alessio: Yeah. No, that's great. I mean, I think in the start small just meant cheap. I think today small is actually a more nuanced discussion, you know, that people weren't really having before.[00:13:43] swyx: Yeah, we can keep going. This is a slide that I smiley disagree with Sarah. She's pointing to the scale SEAL leaderboard. I think the Researchers that I talked with at NeurIPS were kind of positive on this because basically you need private test [00:14:00] sets to prevent contamination.[00:14:02] swyx: And Scale is one of maybe three or four people this year that has really made an effort in doing a credible private test set leaderboard. Llama405B does well compared to Gemini and GPT 40. And I think that's good. I would say that. You know, it's good to have an open model that is that big, that does well on those metrics.[00:14:23] swyx: But anyone putting 405B in production will tell you, if you scroll down a little bit to the artificial analysis numbers, that it is very slow and very expensive to infer. Um, it doesn't even fit on like one node. of, uh, of H100s. Cerebras will be happy to tell you they can serve 4 or 5B on their super large chips.[00:14:42] swyx: But, um, you know, if you need to do anything custom to it, you're still kind of constrained. So, is 4 or 5B really that relevant? Like, I think most people are basically saying that they only use 4 or 5B as a teacher model to distill down to something. Even Meta is doing it. So with Lama 3. [00:15:00] 3 launched, they only launched the 70B because they use 4 or 5B to distill the 70B.[00:15:03] swyx: So I don't know if like open source is keeping up. I think they're the, the open source industrial complex is very invested in telling you that the, if the gap is narrowing, I kind of disagree. I think that the gap is widening with O1. I think there are very, very smart people trying to narrow that gap and they should.[00:15:22] swyx: I really wish them success, but you cannot use a chart that is nearing 100 in your saturation chart. And look, the distance between open source and closed source is narrowing. Of course it's going to narrow because you're near 100. This is stupid. But in metrics that matter, is open source narrowing?[00:15:38] swyx: Probably not for O1 for a while. And it's really up to the open source guys to figure out if they can match O1 or not.[00:15:46] Alessio: I think inference time compute is bad for open source just because, you know, Doc can donate the flops at training time, but he cannot donate the flops at inference time. So it's really hard to like actually keep up on that axis.[00:15:59] Alessio: Big, big business [00:16:00] model shift. So I don't know what that means for the GPU clouds. I don't know what that means for the hyperscalers, but obviously the big labs have a lot of advantage. Because, like, it's not a static artifact that you're putting the compute in. You're kind of doing that still, but then you're putting a lot of computed inference too.[00:16:17] swyx: Yeah, yeah, yeah. Um, I mean, Llama4 will be reasoning oriented. We talked with Thomas Shalom. Um, kudos for getting that episode together. That was really nice. Good, well timed. Actually, I connected with the AI meta guy, uh, at NeurIPS, and, um, yeah, we're going to coordinate something for Llama4. Yeah, yeah,[00:16:32] Alessio: and our friend, yeah.[00:16:33] Alessio: Clara Shi just joined to lead the business agent side. So I'm sure we'll have her on in the new year.[00:16:39] swyx: Yeah. So, um, my comment on, on the business model shift, this is super interesting. Apparently it is wide knowledge that OpenAI wanted more than 6. 6 billion dollars for their fundraise. They wanted to raise, you know, higher, and they did not.[00:16:51] swyx: And what that means is basically like, it's very convenient that we're not getting GPT 5, which would have been a larger pre train. We should have a lot of upfront money. And [00:17:00] instead we're, we're converting fixed costs into variable costs, right. And passing it on effectively to the customer. And it's so much easier to take margin there because you can directly attribute it to like, Oh, you're using this more.[00:17:12] swyx: Therefore you, you pay more of the cost and I'll just slap a margin in there. So like that lets you control your growth margin and like tie your. Your spend, or your sort of inference spend, accordingly. And it's just really interesting to, that this change in the sort of inference paradigm has arrived exactly at the same time that the funding environment for pre training is effectively drying up, kind of.[00:17:36] swyx: I feel like maybe the VCs are very in tune with research anyway, so like, they would have noticed this, but, um, it's just interesting.[00:17:43] Alessio: Yeah, and I was looking back at our yearly recap of last year. Yeah. And the big thing was like the mixed trial price fights, you know, and I think now it's almost like there's nowhere to go, like, you know, Gemini Flash is like basically giving it away for free.[00:17:55] Alessio: So I think this is a good way for the labs to generate more revenue and pass down [00:18:00] some of the compute to the customer. I think they're going to[00:18:02] swyx: keep going. I think that 2, will come.[00:18:05] Alessio: Yeah, I know. Totally. I mean, next year, the first thing I'm doing is signing up for Devin. Signing up for the pro chat GBT.[00:18:12] Alessio: Just to try. I just want to see what does it look like to spend a thousand dollars a month on AI?[00:18:17] swyx: Yes. Yes. I think if your, if your, your job is a, at least AI content creator or VC or, you know, someone who, whose job it is to stay on, stay on top of things, you should already be spending like a thousand dollars a month on, on stuff.[00:18:28] swyx: And then obviously easy to spend, hard to use. You have to actually use. The good thing is that actually Google lets you do a lot of stuff for free now. So like deep research. That they just launched. Uses a ton of inference and it's, it's free while it's in preview.[00:18:45] Alessio: Yeah. They need to put that in Lindy.[00:18:47] Alessio: I've been using Lindy lately. I've been a built a bunch of things once we had flow because I liked the new thing. It's pretty good. I even did a phone call assistant. Um, yeah, they just launched Lindy voice. Yeah, I think once [00:19:00] they get advanced voice mode like capability today, still like speech to text, you can kind of tell.[00:19:06] Alessio: Um, but it's good for like reservations and things like that. So I have a meeting prepper thing. And so[00:19:13] swyx: it's good. Okay. I feel like we've, we've covered a lot of stuff. Uh, I, yeah, I, you know, I think We will go over the individual, uh, talks in a separate episode. Uh, I don't want to take too much time with, uh, this stuff, but that suffice to say that there is a lot of progress in each field.[00:19:28] swyx: Uh, we covered vision. Basically this is all like the audience voting for what they wanted. And then I just invited the best people I could find in each audience, especially agents. Um, Graham, who I talked to at ICML in Vienna, he is currently still number one. It's very hard to stay on top of SweetBench.[00:19:45] swyx: OpenHand is currently still number one. switchbench full, which is the hardest one. He had very good thoughts on agents, which I, which I'll highlight for people. Everyone is saying 2025 is the year of agents, just like they said last year. And, uh, but he had [00:20:00] thoughts on like eight parts of what are the frontier problems to solve in agents.[00:20:03] swyx: And so I'll highlight that talk as well.[00:20:05] Alessio: Yeah. The number six, which is the Hacken agents learn more about the environment, has been a Super interesting to us as well, just to think through, because, yeah, how do you put an agent in an enterprise where most things in an enterprise have never been public, you know, a lot of the tooling, like the code bases and things like that.[00:20:23] Alessio: So, yeah, there's not indexing and reg. Well, yeah, but it's more like. You can't really rag things that are not documented. But people know them based on how they've been doing it. You know, so I think there's almost this like, you know, Oh, institutional knowledge. Yeah, the boring word is kind of like a business process extraction.[00:20:38] Alessio: Yeah yeah, I see. It's like, how do you actually understand how these things are done? I see. Um, and I think today the, the problem is that, Yeah, the agents are, that most people are building are good at following instruction, but are not as good as like extracting them from you. Um, so I think that will be a big unlock just to touch quickly on the Jeff Dean thing.[00:20:55] Alessio: I thought it was pretty, I mean, we'll link it in the, in the things, but. I think the main [00:21:00] focus was like, how do you use ML to optimize the systems instead of just focusing on ML to do something else? Yeah, I think speculative decoding, we had, you know, Eugene from RWKB on the podcast before, like he's doing a lot of that with Fetterless AI.[00:21:12] swyx: Everyone is. I would say it's the norm. I'm a little bit uncomfortable with how much it costs, because it does use more of the GPU per call. But because everyone is so keen on fast inference, then yeah, makes sense.[00:21:24] Alessio: Exactly. Um, yeah, but we'll link that. Obviously Jeff is great.[00:21:30] swyx: Jeff is, Jeff's talk was more, it wasn't focused on Gemini.[00:21:33] swyx: I think people got the wrong impression from my tweet. It's more about how Google approaches ML and uses ML to design systems and then systems feedback into ML. And I think this ties in with Lubna's talk.[00:21:45] Synthetic Data and Future Trends[00:21:45] swyx: on synthetic data where it's basically the story of bootstrapping of humans and AI in AI research or AI in production.[00:21:53] swyx: So her talk was on synthetic data, where like how much synthetic data has grown in 2024 in the pre training side, the post training side, [00:22:00] and the eval side. And I think Jeff then also extended it basically to chips, uh, to chip design. So he'd spend a lot of time talking about alpha chip. And most of us in the audience are like, we're not working on hardware, man.[00:22:11] swyx: Like you guys are great. TPU is great. Okay. We'll buy TPUs.[00:22:14] Alessio: And then there was the earlier talk. Yeah. But, and then we have, uh, I don't know if we're calling them essays. What are we calling these? But[00:22:23] swyx: for me, it's just like bonus for late in space supporters, because I feel like they haven't been getting anything.[00:22:29] swyx: And then I wanted a more high frequency way to write stuff. Like that one I wrote in an afternoon. I think basically we now have an answer to what Ilya saw. It's one year since. The blip. And we know what he saw in 2014. We know what he saw in 2024. We think we know what he sees in 2024. He gave some hints and then we have vague indications of what he saw in 2023.[00:22:54] swyx: So that was the Oh, and then 2016 as well, because of this lawsuit with Elon, OpenAI [00:23:00] is publishing emails from Sam's, like, his personal text messages to Siobhan, Zelis, or whatever. So, like, we have emails from Ilya saying, this is what we're seeing in OpenAI, and this is why we need to scale up GPUs. And I think it's very prescient in 2016 to write that.[00:23:16] swyx: And so, like, it is exactly, like, basically his insights. It's him and Greg, basically just kind of driving the scaling up of OpenAI, while they're still playing Dota. They're like, no, like, we see the path here.[00:23:30] Alessio: Yeah, and it's funny, yeah, they even mention, you know, we can only train on 1v1 Dota. We need to train on 5v5, and that takes too many GPUs.[00:23:37] Alessio: Yeah,[00:23:37] swyx: and at least for me, I can speak for myself, like, I didn't see the path from Dota to where we are today. I think even, maybe if you ask them, like, they wouldn't necessarily draw a straight line. Yeah,[00:23:47] Alessio: no, definitely. But I think like that was like the whole idea of almost like the RL and we talked about this with Nathan on his podcast.[00:23:55] Alessio: It's like with RL, you can get very good at specific things, but then you can't really like generalize as much. And I [00:24:00] think the language models are like the opposite, which is like, you're going to throw all this data at them and scale them up, but then you really need to drive them home on a specific task later on.[00:24:08] Alessio: And we'll talk about the open AI reinforcement, fine tuning, um, announcement too, and all of that. But yeah, I think like scale is all you need. That's kind of what Elia will be remembered for. And I think just maybe to clarify on like the pre training is over thing that people love to tweet. I think the point of the talk was like everybody, we're scaling these chips, we're scaling the compute, but like the second ingredient which is data is not scaling at the same rate.[00:24:35] Alessio: So it's not necessarily pre training is over. It's kind of like What got us here won't get us there. In his email, he predicted like 10x growth every two years or something like that. And I think maybe now it's like, you know, you can 10x the chips again, but[00:24:49] swyx: I think it's 10x per year. Was it? I don't know.[00:24:52] Alessio: Exactly. And Moore's law is like 2x. So it's like, you know, much faster than that. And yeah, I like the fossil fuel of AI [00:25:00] analogy. It's kind of like, you know, the little background tokens thing. So the OpenAI reinforcement fine tuning is basically like, instead of fine tuning on data, you fine tune on a reward model.[00:25:09] Alessio: So it's basically like, instead of being data driven, it's like task driven. And I think people have tasks to do, they don't really have a lot of data. So I'm curious to see how that changes, how many people fine tune, because I think this is what people run into. It's like, Oh, you can fine tune llama. And it's like, okay, where do I get the data?[00:25:27] Alessio: To fine tune it on, you know, so it's great that we're moving the thing. And then I really like he had this chart where like, you know, the brain mass and the body mass thing is basically like mammals that scaled linearly by brain and body size, and then humans kind of like broke off the slope. So it's almost like maybe the mammal slope is like the pre training slope.[00:25:46] Alessio: And then the post training slope is like the, the human one.[00:25:49] swyx: Yeah. I wonder what the. I mean, we'll know in 10 years, but I wonder what the y axis is for, for Ilya's SSI. We'll try to get them on.[00:25:57] Alessio: Ilya, if you're listening, you're [00:26:00] welcome here. Yeah, and then he had, you know, what comes next, like agent, synthetic data, inference, compute, I thought all of that was like that.[00:26:05] Alessio: I don't[00:26:05] swyx: think he was dropping any alpha there. Yeah, yeah, yeah.[00:26:07] Alessio: Yeah. Any other new reps? Highlights?[00:26:10] swyx: I think that there was comparatively a lot more work. Oh, by the way, I need to plug that, uh, my friend Yi made this, like, little nice paper. Yeah, that was really[00:26:20] swyx: nice.[00:26:20] swyx: Uh, of, uh, of, like, all the, he's, she called it must read papers of 2024.[00:26:26] swyx: So I laid out some of these at NeurIPS, and it was just gone. Like, everyone just picked it up. Because people are dying for, like, little guidance and visualizations And so, uh, I thought it was really super nice that we got there.[00:26:38] Alessio: Should we do a late in space book for each year? Uh, I thought about it. For each year we should.[00:26:42] Alessio: Coffee table book. Yeah. Yeah. Okay. Put it in the will. Hi, Will. By the way, we haven't introduced you. He's our new, you know, general organist, Jamie. You need to[00:26:52] swyx: pull up more things. One thing I saw that, uh, Okay, one fun one, and then one [00:27:00] more general one. So the fun one is this paper on agent collusion. This is a paper on steganography.[00:27:06] swyx: This is secret collusion among AI agents, multi agent deception via steganography. I tried to go to NeurIPS in order to find these kinds of papers because the real reason Like NeurIPS this year has a lottery system. A lot of people actually even go and don't buy tickets because they just go and attend the side events.[00:27:22] swyx: And then also the people who go and end up crowding around the most popular papers, which you already know and already read them before you showed up to NeurIPS. So the only reason you go there is to talk to the paper authors, but there's like something like 10, 000 other. All these papers out there that, you know, are just people's work that they, that they did on the air and they failed to get attention for one reason or another.[00:27:42] swyx: And this was one of them. Uh, it was like all the way at the back. And this is a deep mind paper that actually focuses on collusion between AI agents, uh, by hiding messages in the text that they generate. Uh, so that's what steganography is. So a very simple example would be the first letter of every word.[00:27:57] swyx: If you Pick that out, you know, and the code sends a [00:28:00] different message than that. But something I've always emphasized is to LLMs, we read left to right. LLMs can read up, down, sideways, you know, in random character order. And it's the same to them as it is to us. So if we were ever to get You know, self motivated, underlined LLMs that we're trying to collaborate to take over the planet.[00:28:19] swyx: This would be how they do it. They spread messages among us in the messages that we generate. And he developed a scaling law for that. So he marked, I'm showing it on screen right now, the emergence of this phenomenon. Basically, for example, for Cypher encoding, GPT 2, Lama 2, mixed trial, GPT 3. 5, zero capabilities, and sudden 4.[00:28:40] swyx: And this is the kind of Jason Wei type emergence properties that people kind of look for. I think what made this paper stand out as well, so he developed the benchmark for steganography collusion, and he also focused on shelling point collusion, which is very low coordination. For agreeing on a decoding encoding format, you kind of need to have some [00:29:00] agreement on that.[00:29:00] swyx: But, but shelling point means like very, very low or almost no coordination. So for example, if I, if I ask someone, if the only message I give you is meet me in New York and you're not aware. Or when you would probably meet me at Grand Central Station. That is the Grand Central Station is a shelling point.[00:29:16] swyx: And it's probably somewhere, somewhere during the day. That is the shelling point of New York is Grand Central. To that extent, shelling points for steganography are things like the, the, the common decoding methods that we talked about. It will be interesting at some point in the future when we are worried about alignment.[00:29:30] swyx: It is not interesting today, but it's interesting that DeepMind is already thinking about this.[00:29:36] Alessio: I think that's like one of the hardest things about NeurIPS. It's like the long tail. I[00:29:41] swyx: found a pricing guy. I'm going to feature him on the podcast. Basically, this guy from NVIDIA worked out the optimal pricing for language models.[00:29:51] swyx: It's basically an econometrics paper at NeurIPS, where everyone else is talking about GPUs. And the guy with the GPUs is[00:29:57] Alessio: talking[00:29:57] swyx: about economics instead. [00:30:00] That was the sort of fun one. So the focus I saw is that model papers at NeurIPS are kind of dead. No one really presents models anymore. It's just data sets.[00:30:12] swyx: This is all the grad students are working on. So like there was a data sets track and then I was looking around like, I was like, you don't need a data sets track because every paper is a data sets paper. And so data sets and benchmarks, they're kind of flip sides of the same thing. So Yeah. Cool. Yeah, if you're a grad student, you're a GPU boy, you kind of work on that.[00:30:30] swyx: And then the, the sort of big model that people walk around and pick the ones that they like, and then they use it in their models. And that's, that's kind of how it develops. I, I feel like, um, like, like you didn't last year, you had people like Hao Tian who worked on Lava, which is take Lama and add Vision.[00:30:47] swyx: And then obviously actually I hired him and he added Vision to Grok. Now he's the Vision Grok guy. This year, I don't think there was any of those.[00:30:55] Alessio: What were the most popular, like, orals? Last year it was like the [00:31:00] Mixed Monarch, I think, was like the most attended. Yeah, uh, I need to look it up. Yeah, I mean, if nothing comes to mind, that's also kind of like an answer in a way.[00:31:10] Alessio: But I think last year there was a lot of interest in, like, furthering models and, like, different architectures and all of that.[00:31:16] swyx: I will say that I felt the orals, oral picks this year were not very good. Either that or maybe it's just a So that's the highlight of how I have changed in terms of how I view papers.[00:31:29] swyx: So like, in my estimation, two of the best papers in this year for datasets or data comp and refined web or fine web. These are two actually industrially used papers, not highlighted for a while. I think DCLM got the spotlight, FineWeb didn't even get the spotlight. So like, it's just that the picks were different.[00:31:48] swyx: But one thing that does get a lot of play that a lot of people are debating is the role that's scheduled. This is the schedule free optimizer paper from Meta from Aaron DeFazio. And this [00:32:00] year in the ML community, there's been a lot of chat about shampoo, soap, all the bathroom amenities for optimizing your learning rates.[00:32:08] swyx: And, uh, most people at the big labs are. Who I asked about this, um, say that it's cute, but it's not something that matters. I don't know, but it's something that was discussed and very, very popular. 4Wars[00:32:19] Alessio: of AI recap maybe, just quickly. Um, where do you want to start? Data?[00:32:26] swyx: So to remind people, this is the 4Wars piece that we did as one of our earlier recaps of this year.[00:32:31] swyx: And the belligerents are on the left, journalists, writers, artists, anyone who owns IP basically, New York Times, Stack Overflow, Reddit, Getty, Sarah Silverman, George RR Martin. Yeah, and I think this year we can add Scarlett Johansson to that side of the fence. So anyone suing, open the eye, basically. I actually wanted to get a snapshot of all the lawsuits.[00:32:52] swyx: I'm sure some lawyer can do it. That's the data quality war. On the right hand side, we have the synthetic data people, and I think we talked about Lumna's talk, you know, [00:33:00] really showing how much synthetic data has come along this year. I think there was a bit of a fight between scale. ai and the synthetic data community, because scale.[00:33:09] swyx: ai published a paper saying that synthetic data doesn't work. Surprise, surprise, scale. ai is the leading vendor of non synthetic data. Only[00:33:17] Alessio: cage free annotated data is useful.[00:33:21] swyx: So I think there's some debate going on there, but I don't think it's much debate anymore that at least synthetic data, for the reasons that are blessed in Luna's talk, Makes sense.[00:33:32] swyx: I don't know if you have any perspectives there.[00:33:34] Alessio: I think, again, going back to the reinforcement fine tuning, I think that will change a little bit how people think about it. I think today people mostly use synthetic data, yeah, for distillation and kind of like fine tuning a smaller model from like a larger model.[00:33:46] Alessio: I'm not super aware of how the frontier labs use it outside of like the rephrase, the web thing that Apple also did. But yeah, I think it'll be. Useful. I think like whether or not that gets us the big [00:34:00] next step, I think that's maybe like TBD, you know, I think people love talking about data because it's like a GPU poor, you know, I think, uh, synthetic data is like something that people can do, you know, so they feel more opinionated about it compared to, yeah, the optimizers stuff, which is like,[00:34:17] swyx: they don't[00:34:17] Alessio: really work[00:34:18] swyx: on.[00:34:18] swyx: I think that there is an angle to the reasoning synthetic data. So this year, we covered in the paper club, the star series of papers. So that's star, Q star, V star. It basically helps you to synthesize reasoning steps, or at least distill reasoning steps from a verifier. And if you look at the OpenAI RFT, API that they released, or that they announced, basically they're asking you to submit graders, or they choose from a preset list of graders.[00:34:49] swyx: Basically It feels like a way to create valid synthetic data for them to fine tune their reasoning paths on. Um, so I think that is another angle where it starts to make sense. And [00:35:00] so like, it's very funny that basically all the data quality wars between Let's say the music industry or like the newspaper publishing industry or the textbooks industry on the big labs.[00:35:11] swyx: It's all of the pre training era. And then like the new era, like the reasoning era, like nobody has any problem with all the reasoning, especially because it's all like sort of math and science oriented with, with very reasonable graders. I think the more interesting next step is how does it generalize beyond STEM?[00:35:27] swyx: We've been using O1 for And I would say like for summarization and creative writing and instruction following, I think it's underrated. I started using O1 in our intro songs before we killed the intro songs, but it's very good at writing lyrics. You know, I can actually say like, I think one of the O1 pro demos.[00:35:46] swyx: All of these things that Noam was showing was that, you know, you can write an entire paragraph or three paragraphs without using the letter A, right?[00:35:53] Creative Writing with AI[00:35:53] swyx: So like, like literally just anything instead of token, like not even token level, character level manipulation and [00:36:00] counting and instruction following. It's, uh, it's very, very strong.[00:36:02] swyx: And so no surprises when I ask it to rhyme, uh, and to, to create song lyrics, it's going to do that very much better than in previous models. So I think it's underrated for creative writing.[00:36:11] Alessio: Yeah.[00:36:12] Legal and Ethical Issues in AI[00:36:12] Alessio: What do you think is the rationale that they're going to have in court when they don't show you the thinking traces of O1, but then they want us to, like, they're getting sued for using other publishers data, you know, but then on their end, they're like, well, you shouldn't be using my data to then train your model.[00:36:29] Alessio: So I'm curious to see how that kind of comes. Yeah, I mean, OPA has[00:36:32] swyx: many ways to publish, to punish people without bringing, taking them to court. Already banned ByteDance for distilling their, their info. And so anyone caught distilling the chain of thought will be just disallowed to continue on, on, on the API.[00:36:44] swyx: And it's fine. It's no big deal. Like, I don't even think that's an issue at all, just because the chain of thoughts are pretty well hidden. Like you have to work very, very hard to, to get it to leak. And then even when it leaks the chain of thought, you don't know if it's, if it's [00:37:00] The bigger concern is actually that there's not that much IP hiding behind it, that Cosign, which we talked about, we talked to him on Dev Day, can just fine tune 4.[00:37:13] swyx: 0 to beat 0. 1 Cloud SONET so far is beating O1 on coding tasks without, at least O1 preview, without being a reasoning model, same for Gemini Pro or Gemini 2. 0. So like, how much is reasoning important? How much of a moat is there in this, like, All of these are proprietary sort of training data that they've presumably accomplished.[00:37:34] swyx: Because even DeepSeek was able to do it. And they had, you know, two months notice to do this, to do R1. So, it's actually unclear how much moat there is. Obviously, you know, if you talk to the Strawberry team, they'll be like, yeah, I mean, we spent the last two years doing this. So, we don't know. And it's going to be Interesting because there'll be a lot of noise from people who say they have inference time compute and actually don't because they just have fancy chain of thought.[00:38:00][00:38:00] swyx: And then there's other people who actually do have very good chain of thought. And you will not see them on the same level as OpenAI because OpenAI has invested a lot in building up the mythology of their team. Um, which makes sense. Like the real answer is somewhere in between.[00:38:13] Alessio: Yeah, I think that's kind of like the main data war story developing.[00:38:18] The Data War: GPU Poor vs. GPU Rich[00:38:18] Alessio: GPU poor versus GPU rich. Yeah. Where do you think we are? I think there was, again, going back to like the small model thing, there was like a time in which the GPU poor were kind of like the rebel faction working on like these models that were like open and small and cheap. And I think today people don't really care as much about GPUs anymore.[00:38:37] Alessio: You also see it in the price of the GPUs. Like, you know, that market is kind of like plummeted because there's people don't want to be, they want to be GPU free. They don't even want to be poor. They just want to be, you know, completely without them. Yeah. How do you think about this war? You[00:38:52] swyx: can tell me about this, but like, I feel like the, the appetite for GPU rich startups, like the, you know, the, the funding plan is we will raise 60 million and [00:39:00] we'll give 50 of that to NVIDIA.[00:39:01] swyx: That is gone, right? Like, no one's, no one's pitching that. This was literally the plan, the exact plan of like, I can name like four or five startups, you know, this time last year. So yeah, GPU rich startups gone.[00:39:12] The Rise of GPU Ultra Rich[00:39:12] swyx: But I think like, The GPU ultra rich, the GPU ultra high net worth is still going. So, um, now we're, you know, we had Leopold's essay on the trillion dollar cluster.[00:39:23] swyx: We're not quite there yet. We have multiple labs, um, you know, XAI very famously, you know, Jensen Huang praising them for being. Best boy number one in spinning up 100, 000 GPU cluster in like 12 days or something. So likewise at Meta, likewise at OpenAI, likewise at the other labs as well. So like the GPU ultra rich are going to keep doing that because I think partially it's an article of faith now that you just need it.[00:39:46] swyx: Like you don't even know what it's going to, what you're going to use it for. You just, you just need it. And it makes sense that if, especially if we're going into. More researchy territory than we are. So let's say 2020 to 2023 was [00:40:00] let's scale big models territory because we had GPT 3 in 2020 and we were like, okay, we'll go from 1.[00:40:05] swyx: 75b to 1. 8b, 1. 8t. And that was GPT 3 to GPT 4. Okay, that's done. As far as everyone is concerned, Opus 3. 5 is not coming out, GPT 4. 5 is not coming out, and Gemini 2, we don't have Pro, whatever. We've hit that wall. Maybe I'll call it the 2 trillion perimeter wall. We're not going to 10 trillion. No one thinks it's a good idea, at least from training costs, from the amount of data, or at least the inference.[00:40:36] swyx: Would you pay 10x the price of GPT Probably not. Like, like you want something else that, that is at least more useful. So it makes sense that people are pivoting in terms of their inference paradigm.[00:40:47] Emerging Trends in AI Models[00:40:47] swyx: And so when it's more researchy, then you actually need more just general purpose compute to mess around with, uh, at the exact same time that production deployments of the old, the previous paradigm is still ramping up,[00:40:58] swyx: um,[00:40:58] swyx: uh, pretty aggressively.[00:40:59] swyx: So [00:41:00] it makes sense that the GPU rich are growing. We have now interviewed both together and fireworks and replicates. Uh, we haven't done any scale yet. But I think Amazon, maybe kind of a sleeper one, Amazon, in a sense of like they, at reInvent, I wasn't expecting them to do so well, but they are now a foundation model lab.[00:41:18] swyx: It's kind of interesting. Um, I think, uh, you know, David went over there and started just creating models.[00:41:25] Alessio: Yeah, I mean, that's the power of prepaid contracts. I think like a lot of AWS customers, you know, they do this big reserve instance contracts and now they got to use their money. That's why so many startups.[00:41:37] Alessio: Get bought through the AWS marketplace so they can kind of bundle them together and prefer pricing.[00:41:42] swyx: Okay, so maybe GPU super rich doing very well, GPU middle class dead, and then GPU[00:41:48] Alessio: poor. I mean, my thing is like, everybody should just be GPU rich. There shouldn't really be, even the GPU poorest, it's like, does it really make sense to be GPU poor?[00:41:57] Alessio: Like, if you're GPU poor, you should just use the [00:42:00] cloud. Yes, you know, and I think there might be a future once we kind of like figure out what the size and shape of these models is where like the tiny box and these things come to fruition where like you can be GPU poor at home. But I think today is like, why are you working so hard to like get these models to run on like very small clusters where it's like, It's so cheap to run them.[00:42:21] Alessio: Yeah, yeah,[00:42:22] swyx: yeah. I think mostly people think it's cool. People think it's a stepping stone to scaling up. So they aspire to be GPU rich one day and they're working on new methods. Like news research, like probably the most deep tech thing they've done this year is Distro or whatever the new name is.[00:42:38] swyx: There's a lot of interest in heterogeneous computing, distributed computing. I tend generally to de emphasize that historically, but it may be coming to a time where it is starting to be relevant. I don't know. You know, SF compute launched their compute marketplace this year, and like, who's really using that?[00:42:53] swyx: Like, it's a bunch of small clusters, disparate types of compute, and if you can make that [00:43:00] useful, then that will be very beneficial to the broader community, but maybe still not the source of frontier models. It's just going to be a second tier of compute that is unlocked for people, and that's fine. But yeah, I mean, I think this year, I would say a lot more on device, We are, I now have Apple intelligence on my phone.[00:43:19] swyx: Doesn't do anything apart from summarize my notifications. But still, not bad. Like, it's multi modal.[00:43:25] Alessio: Yeah, the notification summaries are so and so in my experience.[00:43:29] swyx: Yeah, but they add, they add juice to life. And then, um, Chrome Nano, uh, Gemini Nano is coming out in Chrome. Uh, they're still feature flagged, but you can, you can try it now if you, if you use the, uh, the alpha.[00:43:40] swyx: And so, like, I, I think, like, you know, We're getting the sort of GPU poor version of a lot of these things coming out, and I think it's like quite useful. Like Windows as well, rolling out RWKB in sort of every Windows department is super cool. And I think the last thing that I never put in this GPU poor war, that I think I should now, [00:44:00] is the number of startups that are GPU poor but still scaling very well, as sort of wrappers on top of either a foundation model lab, or GPU Cloud.[00:44:10] swyx: GPU Cloud, it would be Suno. Suno, Ramp has rated as one of the top ranked, fastest growing startups of the year. Um, I think the last public number is like zero to 20 million this year in ARR and Suno runs on Moto. So Suno itself is not GPU rich, but they're just doing the training on, on Moto, uh, who we've also talked to on, on the podcast.[00:44:31] swyx: The other one would be Bolt, straight cloud wrapper. And, and, um, Again, another, now they've announced 20 million ARR, which is another step up from our 8 million that we put on the title. So yeah, I mean, it's crazy that all these GPU pores are finding a way while the GPU riches are also finding a way. And then the only failures, I kind of call this the GPU smiling curve, where the edges do well, because you're either close to the machines, and you're like [00:45:00] number one on the machines, or you're like close to the customers, and you're number one on the customer side.[00:45:03] swyx: And the people who are in the middle. Inflection, um, character, didn't do that great. I think character did the best of all of them. Like, you have a note in here that we apparently said that character's price tag was[00:45:15] Alessio: 1B.[00:45:15] swyx: Did I say that?[00:45:16] Alessio: Yeah. You said Google should just buy them for 1B. I thought it was a crazy number.[00:45:20] Alessio: Then they paid 2. 7 billion. I mean, for like,[00:45:22] swyx: yeah.[00:45:22] Alessio: What do you pay for node? Like, I don't know what the game world was like. Maybe the starting price was 1B. I mean, whatever it was, it worked out for everybody involved.[00:45:31] The Multi-Modality War[00:45:31] Alessio: Multimodality war. And this one, we never had text to video in the first version, which now is the hottest.[00:45:37] swyx: Yeah, I would say it's a subset of image, but yes.[00:45:40] Alessio: Yeah, well, but I think at the time it wasn't really something people were doing, and now we had VO2 just came out yesterday. Uh, Sora was released last month, last week. I've not tried Sora, because the day that I tried, it wasn't, yeah. I[00:45:54] swyx: think it's generally available now, you can go to Sora.[00:45:56] swyx: com and try it. Yeah, they had[00:45:58] Alessio: the outage. Which I [00:46:00] think also played a part into it. Small things. Yeah. What's the other model that you posted today that was on Replicate? Video or OneLive?[00:46:08] swyx: Yeah. Very, very nondescript name, but it is from Minimax, which I think is a Chinese lab. The Chinese labs do surprisingly well at the video models.[00:46:20] swyx: I'm not sure it's actually Chinese. I don't know. Hold me up to that. Yep. China. It's good. Yeah, the Chinese love video. What can I say? They have a lot of training data for video. Or a more relaxed regulatory environment.[00:46:37] Alessio: Uh, well, sure, in some way. Yeah, I don't think there's much else there. I think like, you know, on the image side, I think it's still open.[00:46:45] Alessio: Yeah, I mean,[00:46:46] swyx: 11labs is now a unicorn. So basically, what is multi modality war? Multi modality war is, do you specialize in a single modality, right? Or do you have GodModel that does all the modalities? So this is [00:47:00] definitely still going, in a sense of 11 labs, you know, now Unicorn, PicoLabs doing well, they launched Pico 2.[00:47:06] swyx: 0 recently, HeyGen, I think has reached 100 million ARR, Assembly, I don't know, but they have billboards all over the place, so I assume they're doing very, very well. So these are all specialist models, specialist models and specialist startups. And then there's the big labs who are doing the sort of all in one play.[00:47:24] swyx: And then here I would highlight Gemini 2 for having native image output. Have you seen the demos? Um, yeah, it's, it's hard to keep up. Literally they launched this last week and a shout out to Paige Bailey, who came to the Latent Space event to demo on the day of launch. And she wasn't prepared. She was just like, I'm just going to show you.[00:47:43] swyx: So they have voice. They have, you know, obviously image input, and then they obviously can code gen and all that. But the new one that OpenAI and Meta both have but they haven't launched yet is image output. So you can literally, um, I think their demo video was that you put in an image of a [00:48:00] car, and you ask for minor modifications to that car.[00:48:02] swyx: They can generate you that modification exactly as you asked. So there's no need for the stable diffusion or comfy UI workflow of like mask here and then like infill there in paint there and all that, all that stuff. This is small model nonsense. Big model people are like, huh, we got you in as everything in the transformer.[00:48:21] swyx: This is the multimodality war, which is, do you, do you bet on the God model or do you string together a whole bunch of, uh, Small models like a, like a chump. Yeah,[00:48:29] Alessio: I don't know, man. Yeah, that would be interesting. I mean, obviously I use Midjourney for all of our thumbnails. Um, they've been doing a ton on the product, I would say.[00:48:38] Alessio: They launched a new Midjourney editor thing. They've been doing a ton. Because I think, yeah, the motto is kind of like, Maybe, you know, people say black forest, the black forest models are better than mid journey on a pixel by pixel basis. But I think when you put it, put it together, have you tried[00:48:53] swyx: the same problems on black forest?[00:48:55] Alessio: Yes. But the problem is just like, you know, on black forest, it generates one image. And then it's like, you got to [00:49:00] regenerate. You don't have all these like UI things. Like what I do, no, but it's like time issue, you know, it's like a mid[00:49:06] swyx: journey. Call the API four times.[00:49:08] Alessio: No, but then there's no like variate.[00:49:10] Alessio: Like the good thing about mid journey is like, you just go in there and you're cooking. There's a lot of stuff that just makes it really easy. And I think people underestimate that. Like, it's not really a skill issue, because I'm paying mid journey, so it's a Black Forest skill issue, because I'm not paying them, you know?[00:49:24] Alessio: Yeah,[00:49:25] swyx: so, okay, so, uh, this is a UX thing, right? Like, you, you, you understand that, at least, we think that Black Forest should be able to do all that stuff. I will also shout out, ReCraft has come out, uh, on top of the image arena that, uh, artificial analysis has done, has apparently, uh, Flux's place. Is this still true?[00:49:41] swyx: So, Artificial Analysis is now a company. I highlighted them I think in one of the early AI Newses of the year. And they have launched a whole bunch of arenas. So, they're trying to take on LM Arena, Anastasios and crew. And they have an image arena. Oh yeah, Recraft v3 is now beating Flux 1. 1. Which is very surprising [00:50:00] because Flux And Black Forest Labs are the old stable diffusion crew who left stability after, um, the management issues.[00:50:06] swyx: So Recurve has come from nowhere to be the top image model. Uh, very, very strange. I would also highlight that Grok has now launched Aurora, which is, it's very interesting dynamics between Grok and Black Forest Labs because Grok's images were originally launched, uh, in partnership with Black Forest Labs as a, as a thin wrapper.[00:50:24] swyx: And then Grok was like, no, we'll make our own. And so they've made their own. I don't know, there are no APIs or benchmarks about it. They just announced it. So yeah, that's the multi modality war. I would say that so far, the small model, the dedicated model people are winning, because they are just focused on their tasks.[00:50:42] swyx: But the big model, People are always catching up. And the moment I saw the Gemini 2 demo of image editing, where I can put in an image and just request it and it does, that's how AI should work. Not like a whole bunch of complicated steps. So it really is something. And I think one frontier that we haven't [00:51:00] seen this year, like obviously video has done very well, and it will continue to grow.[00:51:03] swyx: You know, we only have Sora Turbo today, but at some point we'll get full Sora. Oh, at least the Hollywood Labs will get Fulsora. We haven't seen video to audio, or video synced to audio. And so the researchers that I talked to are already starting to talk about that as the next frontier. But there's still maybe like five more years of video left to actually be Soda.[00:51:23] swyx: I would say that Gemini's approach Compared to OpenAI, Gemini seems, or DeepMind's approach to video seems a lot more fully fledged than OpenAI. Because if you look at the ICML recap that I published that so far nobody has listened to, um, that people have listened to it. It's just a different, definitely different audience.[00:51:43] swyx: It's only seven hours long. Why are people not listening? It's like everything in Uh, so, so DeepMind has, is working on Genie. They also launched Genie 2 and VideoPoet. So, like, they have maybe four years advantage on world modeling that OpenAI does not have. Because OpenAI basically only started [00:52:00] Diffusion Transformers last year, you know, when they hired, uh, Bill Peebles.[00:52:03] swyx: So, DeepMind has, has a bit of advantage here, I would say, in, in, in showing, like, the reason that VO2, while one, They cherry pick their videos. So obviously it looks better than Sora, but the reason I would believe that VO2, uh, when it's fully launched will do very well is because they have all this background work in video that they've done for years.[00:52:22] swyx: Like, like last year's NeurIPS, I already was interviewing some of their video people. I forget their model name, but for, for people who are dedicated fans, they can go to NeurIPS 2023 and see, see that paper.[00:52:32] Alessio: And then last but not least, the LLMOS. We renamed it to Ragops, formerly known as[00:52:39] swyx: Ragops War. I put the latest chart on the Braintrust episode.[00:52:43] swyx: I think I'm going to separate these essays from the episode notes. So the reason I used to do that, by the way, is because I wanted to show up on Hacker News. I wanted the podcast to show up on Hacker News. So I always put an essay inside of there because Hacker News people like to read and not listen.[00:52:58] Alessio: So episode essays,[00:52:59] swyx: I remember [00:53:00] purchasing them separately. You say Lanchain Llama Index is still growing.[00:53:03] Alessio: Yeah, so I looked at the PyPy stats, you know. I don't care about stars. On PyPy you see Do you want to share your screen? Yes. I prefer to look at actual downloads, not at stars on GitHub. So if you look at, you know, Lanchain still growing.[00:53:20] Alessio: These are the last six months. Llama Index still growing. What I've basically seen is like things that, One, obviously these things have A commercial product. So there's like people buying this and sticking with it versus kind of hopping in between things versus, you know, for example, crew AI, not really growing as much.[00:53:38] Alessio: The stars are growing. If you look on GitHub, like the stars are growing, but kind of like the usage is kind of like flat. In the last six months, have they done some[00:53:4

god ceo new york spotify amazon time world ai europe google china apple vision pr voice future speaking san francisco new york times phd video thinking chinese simple data predictions elon musk impact surprise iphone chatgpt code legal tesla reflecting memory ga reddit discord busy cloud lgbt flash stem honestly ab pros jeff bezos windows excited researchers ip lower unicorns tackling sort survey insane tier cto whispers applications vc f1 gemini openai doc seal signing fireworks genie academic sf nvidia organizing ux api davos assembly frontier chrome gpt makes scarlett johansson ui aws turbo mm soda bash mosaic ml lama dropbox github creative writing drafting reinvent canvas 1b apis bolt ruler lava exact stripe pico hundred dev flux strawberry vm vcs sander wwdc 200k llm bt arr sora taiwanese opus moto sam altman gartner anthropic google docs assumption nemo blackwell parting google drive gpu grok sombra agi ramp opa tbd 3b 5b elia elo perplexity estimates gnome midjourney bytedance leopold rag ciso gpus dota haiku dx coursera sarah silverman sonnets deepmind cypher getty george rr martin ilya quill sdks cobalt future trends noam v2 alessio ttc sheesh lms satya r1 xai 8b suno ssi mcp veo stack overflow vo2 emerging trends rl mistral itc theoretically gpts replicate sota jensen huang yi black forest inflection databricks aitor graphql brain trust ai models chinchillas adept nosql grand central hacker news grand central station hacken zep ethical issues ai news cosign claud gpc tpu distro lubna heygen o3 neo4j autogpt gbt minimax gpd o1 jeremy howard 70b quent langchain 400b gradients exa neurips loras jeff dean 128k gemini pro elos cerebras code interpreter ai winter john franco icml lstm r1s aws reinvent muser latent space pypy nova pro dan gross paige bailey noam brown quiet capital john frankel

FRUTOS DA EXTENSÃO: ELO Estrela do Oriente - Melhor idade em movimento

UFOP CAST

Play Episode Listen Later Dec 4, 2024 8:26

O projeto “ELO Estrela do Oriente: Melhor Idade em Movimento” faz parte do grupo de oficinas desenvolvidas pelo programa de extensão “ELOS” da Escola de Educação Física da Universidade Federal de Ouro Preto (UFOP), e atende as demandas do Grupo da Melhor Idade Estrela do Oriente, localizado no distrito de Ouro Preto em Cachoeira do Campo. O projeto tem como objetivo promover, aos idosos do distrito, práticas da cultura corporal do movimento. Para isso, são oferecidas aulas gratuitas de ginástica e aulas de dança, toda segunda e quarta-feira, no Clube Estrela do Oriente. Aperte o play e confira todos os detalhes dessa iniciativa extensionista. Ficha Técnica Produção: Larissa Antunes Edição de Texto: Elis Cristina Edição de áudio e sonoplastia:Eduardo Rodrigues

grupo campo melhor educa escola movimento oriente frutos universidade federal aperte estrela idade extens ouro preto cachoeira ficha t elos

In the Arena: How LMSys changed LLM Benchmarking Forever

Latent Space: The AI Engineer Podcast â€” CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

Play Episode Listen Later Nov 1, 2024 41:02

Apologies for lower audio quality; we lost recordings and had to use backup tracks. Our guests today are Anastasios Angelopoulos and Wei-Lin Chiang, leads of Chatbot Arena, fka LMSYS, the crowdsourced AI evaluation platform developed by the LMSys student club at Berkeley, which became the de facto standard for comparing language models. Arena ELO is often more cited than MMLU scores to many folks, and they have attracted >1,000,000 people to cast votes since its launch, leading top model trainers to cite them over their own formal academic benchmarks:The Limits of Static BenchmarksWe've done two benchmarks episodes: Benchmarks 101 and Benchmarks 201. One issue we've always brought up with static benchmarks is that 1) many are getting saturated, with models scoring almost perfectly on them 2) they often don't reflect production use cases, making it hard for developers and users to use them as guidance. The fundamental challenge in AI evaluation isn't technical - it's philosophical. How do you measure something that increasingly resembles human intelligence? Rather than trying to define intelligence upfront, Arena let users interact naturally with models and collect comparative feedback. It's messy and subjective, but that's precisely the point - it captures the full spectrum of what people actually care about when using AI.The Pareto Frontier of Cost vs IntelligenceBecause the Elo scores are remarkably stable over time, we can put all the chat models on a map against their respective cost to gain a view of at least 3 orders of magnitude of model sizes/costs and observe the remarkable shift in intelligence per dollar over the past year:This frontier stood remarkably firm through the recent releases of o1-preview and price cuts of Gemini 1.5:The Statistics of SubjectivityIn our Benchmarks 201 episode, Clémentine Fourrier from HuggingFace thought this design choice was one of shortcomings of arenas: they aren't reproducible. You don't know who ranked what and what exactly the outcome was at the time of ranking. That same person might rank the same pair of outputs differently on a different day, or might ask harder questions to better models compared to smaller ones, making it imbalanced. Another argument that people have brought up is confirmation bias. We know humans prefer longer responses and are swayed by formatting - Rob Mulla from Dreadnode had found some interesting data on this in May:The approach LMArena is taking is to use logistic regression to decompose human preferences into constituent factors. As Anastasios explains: "We can say what components of style contribute to human preference and how they contribute." By adding these style components as parameters, they can mathematically "suck out" their influence and isolate the core model capabilities.This extends beyond just style - they can control for any measurable factor: "What if I want to look at the cost adjusted performance? Parameter count? We can ex post facto measure that." This is one of the most interesting things about Arena: You have a data generation engine which you can clean and turn into leaderboards later. If you wanted to create a leaderboard for poetry writing, you could get existing data from Arena, normalize it by identifying these style components. Whether or not it's possible to really understand WHAT bias the voters have, that's a different question.Private EvalsOne of the most delicate challenges LMSYS faces is maintaining trust while collaborating with AI labs. The concern is that labs could game the system by testing multiple variants privately and only releasing the best performer. This was brought up when 4o-mini released and it ranked as the second best model on the leaderboard:But this fear misunderstands how Arena works. Unlike static benchmarks where selection bias is a major issue, Arena's live nature means any initial bias gets washed out by ongoing evaluation. As Anastasios explains: "In the long run, there's way more fresh data than there is data that was used to compare these five models." The other big question is WHAT model is actually being tested; as people often talk about on X / Discord, the same endpoint will randomly feel “nerfed” like it happened for “Claude European summer” and corresponding conspiracy theories:It's hard to keep track of these performance changes in Arena as these changes (if real…?) are not observable.The Future of EvaluationThe team's latest work on RouteLLM points to an interesting future where evaluation becomes more granular and task-specific. But they maintain that even simple routing strategies can be powerful - like directing complex queries to larger models while handling simple tasks with smaller ones.Arena is now going to expand beyond text into multimodal evaluation and specialized domains like code execution and red teaming. But their core insight remains: the best way to evaluate intelligence isn't to simplify it into metrics, but to embrace its complexity and find rigorous ways to analyze it. To go after this vision, they are spinning out Arena from LMSys, which will stay as an academia-driven group at Berkeley.Full Video PodcastChapters* 00:00:00 - Introductions* 00:01:16 - Origin and development of Chatbot Arena* 00:05:41 - Static benchmarks vs. Arenas* 00:09:03 - Community building* 00:13:32 - Biases in human preference evaluation* 00:18:27 - Style Control and Model Categories* 00:26:06 - Impact of o1* 00:29:15 - Collaborating with AI labs* 00:34:51 - RouteLLM and router models* 00:38:09 - Future of LMSys / ArenaShow Notes* Anastasios Angelopoulos* Anastasios' NeurIPS Paper Conformal Risk Control* Wei-Lin Chiang* Chatbot Arena* LMSys* MTBench* ShareGPT dataset* Stanford's Alpaca project* LLMRouter* E2B* DreadnodeTranscriptAlessio [00:00:00]: Hey everyone, welcome to the Latent Space podcast. This is Alessio, Partner and CTO in Residence at Decibel Partners, and I'm joined by my co-host Swyx, founder of Smol.ai.Swyx [00:00:14]: Hey, and today we're very happy and excited to welcome Anastasios and Wei Lin from LMSys. Welcome guys.Wei Lin [00:00:21]: Hey, how's it going? Nice to see you.Anastasios [00:00:23]: Thanks for having us.Swyx [00:00:24]: Anastasios, I actually saw you, I think at last year's NeurIPS. You were presenting a paper, which I don't really super understand, but it was some theory paper about how your method was very dominating over other sort of search methods. I don't remember what it was, but I remember that you were a very confident speaker.Anastasios [00:00:40]: Oh, I totally remember you. Didn't ever connect that, but yes, that's definitely true. Yeah. Nice to see you again.Swyx [00:00:46]: Yeah. I was frantically looking for the name of your paper and I couldn't find it. Basically I had to cut it because I didn't understand it.Anastasios [00:00:51]: Is this conformal PID control or was this the online control?Wei Lin [00:00:55]: Blast from the past, man.Swyx [00:00:57]: Blast from the past. It's always interesting how NeurIPS and all these academic conferences are sort of six months behind what people are actually doing, but conformal risk control, I would recommend people check it out. I have the recording. I just never published it just because I was like, I don't understand this enough to explain it.Anastasios [00:01:14]: People won't be interested.Wei Lin [00:01:15]: It's all good.Swyx [00:01:16]: But ELO scores, ELO scores are very easy to understand. You guys are responsible for the biggest revolution in language model benchmarking in the last few years. Maybe you guys want to introduce yourselves and maybe tell a little bit of the brief history of LMSysWei Lin [00:01:32]: Hey, I'm Wei Lin. I'm a fifth year PhD student at UC Berkeley, working on Chatbot Arena these days, doing crowdsourcing AI benchmarking.Anastasios [00:01:43]: I'm Anastasios. I'm a sixth year PhD student here at Berkeley. I did most of my PhD on like theoretical statistics and sort of foundations of model evaluation and testing. And now I'm working 150% on this Chatbot Arena stuff. It's great.Alessio [00:02:00]: And what was the origin of it? How did you come up with the idea? How did you get people to buy in? And then maybe what were one or two of the pivotal moments early on that kind of made it the standard for these things?Wei Lin [00:02:12]: Yeah, yeah. Chatbot Arena project was started last year in April, May, around that. Before that, we were basically experimenting in a lab how to fine tune a chatbot open source based on the Llama 1 model that I released. At that time, Lama 1 was like a base model and people didn't really know how to fine tune it. So we were doing some explorations. We were inspired by Stanford's Alpaca project. So we basically, yeah, grow a data set from the internet, which is called ShareGPT data set, which is like a dialogue data set between user and chat GPT conversation. It turns out to be like pretty high quality data, dialogue data. So we fine tune on it and then we train it and release the model called V2. And people were very excited about it because it kind of like demonstrate open way model can reach this conversation capability similar to chat GPT. And then we basically release the model with and also build a demo website for the model. People were very excited about it. But during the development, the biggest challenge to us at the time was like, how do we even evaluate it? How do we even argue this model we trained is better than others? And then what's the gap between this open source model that other proprietary offering? At that time, it was like GPT-4 was just announced and it's like Cloud One. What's the difference between them? And then after that, like every week, there's a new model being fine tuned, released. So even until still now, right? And then we have that demo website for V2 now. And then we thought like, okay, maybe we can add a few more of the model as well, like API model as well. And then we quickly realized that people need a tool to compare between different models. So we have like a side by side UI implemented on the website to that people choose, you know, compare. And we quickly realized that maybe we can do something like, like a battle on top of ECLMs, like just anonymize it, anonymize the identity, and that people vote which one is better. So the community decides which one is better, not us, not us arguing, you know, our model is better or what. And that turns out to be like, people are very excited about this idea. And then we tweet, we launch, and that's, yeah, that's April, May. And then it was like first two, three weeks, like just a few hundred thousand views tweet on our launch tweets. And then we have regularly double update weekly, beginning at a time, adding new model GPT-4 as well. So it was like, that was the, you know, the initial.Anastasios [00:04:58]: Another pivotal moment, just to jump in, would be private models, like the GPT, I'm a little,Wei Lin [00:05:04]: I'm a little chatty. That was this year. That was this year.Anastasios [00:05:07]: Huge.Wei Lin [00:05:08]: That was also huge.Alessio [00:05:09]: In the beginning, I saw the initial release was May 3rd of the beta board. On April 6, we did a benchmarks 101 episode for a podcast, just kind of talking about, you know, how so much of the data is like in the pre-training corpus and blah, blah, blah. And like the benchmarks are really not what we need to evaluate whether or not a model is good. Why did you not make a benchmark? Maybe at the time, you know, it was just like, Hey, let's just put together a whole bunch of data again, run a, make a score that seems much easier than coming out with a whole website where like users need to vote. Any thoughts behind that?Wei Lin [00:05:41]: I think it's more like fundamentally, we don't know how to automate this kind of benchmarks when it's more like, you know, conversational, multi-turn, and more open-ended task that may not come with a ground truth. So let's say if you ask a model to help you write an email for you for whatever purpose, there's no ground truth. How do you score them? Or write a story or a creative story or many other things like how we use ChatterBee these days. It's more open-ended. You know, we need human in the loop to give us feedback, which one is better. And I think nuance here is like, sometimes it's also hard for human to give the absolute rating. So that's why we have this kind of pairwise comparison, easier for people to choose which one is better. So from that, we use these pairwise comparison, those to calculate the leaderboard. Yeah. You can add more about this methodology.Anastasios [00:06:40]: Yeah. I think the point is that, and you guys probably also talked about this at some point, but static benchmarks are intrinsically, to some extent, unable to measure generative model performance. And the reason is because you cannot pre-annotate all the outputs of a generative model. You change the model, it's like the distribution of your data is changing. New labels to deal with that. New labels are great automated labeling, right? Which is why people are pursuing both. And yeah, static benchmarks, they allow you to zoom in to particular types of information like factuality, historical facts. We can build the best benchmark of historical facts, and we will then know that the model is great at historical facts. But ultimately, that's not the only axis, right? And we can build 50 of them, and we can evaluate 50 axes. But it's just so, the problem of generative model evaluation is just so expansive, and it's so subjective, that it's just maybe non-intrinsically impossible, but at least we don't see a way. We didn't see a way of encoding that into a fixed benchmark.Wei Lin [00:07:47]: But on the other hand, I think there's a challenge where this kind of online dynamic benchmark is more expensive than static benchmark, offline benchmark, where people still need it. Like when they build models, they need static benchmark to track where they are.Anastasios [00:08:03]: It's not like our benchmark is uniformly better than all other benchmarks, right? It just measures a different kind of performance that has proved to be useful.Swyx [00:08:14]: You guys also published MTBench as well, which is a static version, let's say, of Chatbot Arena, right? That people can actually use in their development of models.Wei Lin [00:08:25]: Right. I think one of the reasons we still do this static benchmark, we still wanted to explore, experiment whether we can automate this, because people, eventually, model developers need it to fast iterate their model. So that's why we explored LM as a judge, and ArenaHard, trying to filter, select high-quality data we collected from Chatbot Arena, the high-quality subset, and use that as a question and then automate the judge pipeline, so that people can quickly get high-quality signal, benchmark signals, using this online benchmark.Swyx [00:09:03]: As a community builder, I'm curious about just the initial early days. Obviously when you offer effectively free A-B testing inference for people, people will come and use your arena. What do you think were the key unlocks for you? Was it funding for this arena? Was it marketing? When people came in, do you see a noticeable skew in the data? Which obviously now you have enough data sets, you can separate things out, like coding and hard prompts, but in the early days, it was just all sorts of things.Anastasios [00:09:31]: Yeah, maybe one thing to establish at first is that our philosophy has always been to maximize organic use. I think that really does speak to your point, which is, yeah, why do people come? They came to use free LLM inference, right? And also, a lot of users just come to the website to use direct chat, because you can chat with the model for free. And then you could think about it like, hey, let's just be kind of like more on the selfish or conservative or protectionist side and say, no, we're only giving credits for people that battle or so on and so forth. Strategy wouldn't work, right? Because what we're trying to build is like a big funnel, a big funnel that can direct people. And some people are passionate and interested and they battle. And yes, the distribution of the people that do that is different. It's like, as you're pointing out, it's like, that's not as they're enthusiastic.Wei Lin [00:10:24]: They're early adopters of this technology.Anastasios [00:10:27]: Or they like games, you know, people like this. And we've run a couple of surveys that indicate this as well, of our user base.Wei Lin [00:10:36]: We do see a lot of developers come to the site asking polling questions, 20-30%. Yeah, 20-30%.Anastasios [00:10:42]: It's obviously not reflective of the general population, but it's reflective of some corner of the world of people that really care. And to some extent, maybe that's all right, because those are like the power users. And you know, we're not trying to claim that we represent the world, right? We represent the people that come and vote.Swyx [00:11:02]: Did you have to do anything marketing-wise? Was anything effective? Did you struggle at all? Was it success from day one?Wei Lin [00:11:09]: At some point, almost done. Okay. Because as you can imagine, this leaderboard depends on community engagement participation. If no one comes to vote tomorrow, then no leaderboard.Anastasios [00:11:23]: So we had some period of time when the number of users was just, after the initial launch, it went lower. Yeah. And, you know, at some point, it did not look promising. Actually, I joined the project a couple months in to do the statistical aspects, right? As you can imagine, that's how it kind of hooked into my previous work. At that time, it wasn't like, you know, it definitely wasn't clear that this was like going to be the eval or something. It was just like, oh, this is a cool project. Like Wayland seems awesome, you know, and that's it.Wei Lin [00:11:56]: Definitely. There's in the beginning, because people don't know us, people don't know what this is for. So we had a hard time. But I think we were lucky enough that we have some initial momentum. And as well as the competition between model providers just becoming, you know, became very intense. Intense. And then that makes the eval onto us, right? Because always number one is number one.Anastasios [00:12:23]: There's also an element of trust. Our main priority in everything we do is trust. We want to make sure we're doing everything like all the I's are dotted and the T's are crossed and nobody gets unfair treatment and people can see from our profiles and from our previous work and from whatever, you know, we're trustworthy people. We're not like trying to make a buck and we're not trying to become famous off of this or that. It's just, we're trying to provide a great public leaderboard community venture project.Wei Lin [00:12:51]: Yeah.Swyx [00:12:52]: Yes. I mean, you are kind of famous now, you know, that's fine. Just to dive in more into biases and, you know, some of this is like statistical control. The classic one for human preference evaluation is humans demonstrably prefer longer contexts or longer outputs, which is actually something that we don't necessarily want. You guys, I think maybe two months ago put out some length control studies. Apart from that, there are just other documented biases. Like, I'd just be interested in your review of what you've learned about biases and maybe a little bit about how you've controlled for them.Anastasios [00:13:32]: At a very high level, yeah. Humans are biased. Totally agree. Like in various ways. It's not clear whether that's good or bad, you know, we try not to make value judgments about these things. We just try to describe them as they are. And our approach is always as follows. We collect organic data and then we take that data and we mine it to get whatever insights we can get. And, you know, we have many millions of data points that we can now use to extract insights from. Now, one of those insights is to ask the question, what is the effect of style, right? You have a bunch of data, you have votes, people are voting either which way. We have all the conversations. We can say what components of style contribute to human preference and how do they contribute? Now, that's an important question. Why is that an important question? It's important because some people want to see which model would be better if the lengths of the responses were the same, were to be the same, right? People want to see the causal effect of the model's identity controlled for length or controlled for markdown, number of headers, bulleted lists, is the text bold? Some people don't, they just don't care about that. The idea is not to impose the judgment that this is not important, but rather to say ex post facto, can we analyze our data in a way that decouples all the different factors that go into human preference? Now, the way we do this is via statistical regression. That is to say the arena score that we show on our leaderboard is a particular type of linear model, right? It's a linear model that takes, it's a logistic regression that takes model identities and fits them against human preference, right? So it regresses human preference against model identity. What you get at the end of that logistic regression is a parameter vector of coefficients. And when the coefficient is large, it tells you that GPT 4.0 or whatever, very large coefficient, that means it's strong. And that's exactly what we report in the table. It's just the predictive effect of the model identity on the vote. The other thing that you can do is you can take that vector, let's say we have M models, that is an M dimensional vector of coefficients. What you can do is you say, hey, I also want to understand what the effect of length is. So I'll add another entry to that vector, which is trying to predict the vote, right? That tells me the difference in length between two model responses. So we have that for all of our data. We can compute it ex post facto. We added it into the regression and we look at that predictive effect. And then the idea, and this is formally true under certain conditions, not always verifiable ones, but the idea is that adding that extra coefficient to this vector will kind of suck out the predictive power of length and put it into that M plus first coefficient and quote, unquote, de-bias the rest so that the effect of length is not included. And that's what we do in style control. Now we don't just do it for M plus one. We have, you know, five, six different style components that have to do with markdown headers and bulleted lists and so on that we add here. Now, where is this going? You guys see the idea. It's a general methodology. If you have something that's sort of like a nuisance parameter, something that exists and provides predictive value, but you really don't want to estimate that. You want to remove its effect. In causal inference, these things are called like confounders often. What you can do is you can model the effect. You can put them into your model and try to adjust for them. So another one of those things might be cost. You know, what if I want to look at the cost adjusted performance of my model, which models are punching above their weight, parameter count, which models are punching above their weight in terms of parameter count, we can ex post facto measure that. We can do it without introducing anything that compromises the organic nature of theWei Lin [00:17:17]: data that we collect.Anastasios [00:17:18]: Hopefully that answers the question.Wei Lin [00:17:20]: It does.Swyx [00:17:21]: So I guess with a background in econometrics, this is super familiar.Anastasios [00:17:25]: You're probably better at this than me for sure.Swyx [00:17:27]: Well, I mean, so I used to be, you know, a quantitative trader and so, you know, controlling for multiple effects on stock price is effectively the job. So it's interesting. Obviously the problem is proving causation, which is hard, but you don't have to do that.Anastasios [00:17:45]: Yes. Yes, that's right. And causal inference is a hard problem and it goes beyond statistics, right? It's like you have to build the right causal model and so on and so forth. But we think that this is a good first step and we're sort of looking forward to learning from more people. You know, there's some good people at Berkeley that work on causal inference for the learning from them on like, what are the really most contemporary techniques that we can use in order to estimate true causal effects if possible.Swyx [00:18:10]: Maybe we could take a step through the other categories. So style control is a category. It is not a default. I have thought that when you wrote that blog post, actually, I thought it would be the new default because it seems like the most obvious thing to control for. But you also have other categories, you have coding, you have hard prompts. We consider that.Anastasios [00:18:27]: We're still actively considering it. It's just, you know, once you make that step, once you take that step, you're introducing your opinion and I'm not, you know, why should our opinion be the one? That's kind of a community choice. We could put it to a vote.Wei Lin [00:18:39]: We could pass.Anastasios [00:18:40]: Yeah, maybe do a poll. Maybe do a poll.Swyx [00:18:42]: I don't know. No opinion is an opinion.Wei Lin [00:18:44]: You know what I mean?Swyx [00:18:45]: Yeah.Wei Lin [00:18:46]: There's no neutral choice here.Swyx [00:18:47]: Yeah. You have all these others. You have instruction following too. What are your favorite categories that you like to talk about? Maybe you tell a little bit of the stories, tell a little bit of like the hard choices that you had to make.Wei Lin [00:18:57]: Yeah. Yeah. Yeah. I think the, uh, initially the reason why we want to add these new categories is essentially to answer some of the questions from our community, which is we won't have a single leaderboard for everything. So these models behave very differently in different domains. Let's say this model is trend for coding, this model trend for more technical questions and so on. On the other hand, to answer people's question about like, okay, what if all these low quality, you know, because we crowdsource data from the internet, there will be noise. So how do we de-noise? How do we filter out these low quality data effectively? So that was like, you know, some questions we want to answer. So basically we spent a few months, like really diving into these questions to understand how do we filter all these data because these are like medias of data points. And then if you want to re-label yourself, it's possible, but we need to kind of like to automate this kind of data classification pipeline for us to effectively categorize them to different categories, say coding, math, structure, and also harder problems. So that was like, the hope is when we slice the data into these meaningful categories to give people more like better signals, more direct signals, and that's also to clarify what we are actually measuring for, because I think that's the core part of the benchmark. That was the initial motivation. Does that make sense?Anastasios [00:20:27]: Yeah. Also, I'll just say, this does like get back to the point that the philosophy is to like mine organic, to take organic data and then mine it x plus factor.Alessio [00:20:35]: Is the data cage-free too, or just organic?Anastasios [00:20:39]: It's cage-free.Wei Lin [00:20:40]: No GMO. Yeah. And all of these efforts are like open source, like we open source all of the data cleaning pipeline, filtering pipeline. Yeah.Swyx [00:20:50]: I love the notebooks you guys publish. Actually really good just for learning statistics.Wei Lin [00:20:54]: Yeah. I'll share this insights with everyone.Alessio [00:20:59]: I agree on the initial premise of, Hey, writing an email, writing a story, there's like no ground truth. But I think as you move into like coding and like red teaming, some of these things, there's like kind of like skill levels. So I'm curious how you think about the distribution of skill of the users. Like maybe the top 1% of red teamers is just not participating in the arena. So how do you guys think about adjusting for it? And like feels like this where there's kind of like big differences between the average and the top. Yeah.Anastasios [00:21:29]: Red teaming, of course, red teaming is quite challenging. So, okay. Moving back. There's definitely like some tasks that are not as subjective that like pairwise human preference feedback is not the only signal that you would want to measure. And to some extent, maybe it's useful, but it may be more useful if you give people better tools. For example, it'd be great if we could execute code with an arena, be fantastic.Wei Lin [00:21:52]: We want to do it.Anastasios [00:21:53]: There's also this idea of constructing a user leaderboard. What does that mean? That means some users are better than others. And how do we measure that? How do we quantify that? Hard in chatbot arena, but where it is easier is in red teaming, because in red teaming, there's an explicit game. You're trying to break the model, you either win or you lose. So what you can do is you can say, Hey, what's really happening here is that the models and humans are playing a game against one another. And then you can use the same sort of Bradley Terry methodology with some, some extensions that we came up with in one of you can read one of our recent blog posts for, for the sort of theoretical extensions. You can attribute like strength back to individual players and jointly attribute strength to like the models that are in this jailbreaking game, along with the target tasks, like what types of jailbreaks you want.Wei Lin [00:22:44]: So yeah.Anastasios [00:22:45]: And I think that this is, this is a hugely important and interesting avenue that we want to continue researching. We have some initial ideas, but you know, all thoughts are welcome.Wei Lin [00:22:54]: Yeah.Alessio [00:22:55]: So first of all, on the code execution, the E2B guys, I'm sure they'll be happy to helpWei Lin [00:22:59]: you.Alessio [00:23:00]: I'll please set that up. They're big fans. We're investors in a company called Dreadnought, which we do a lot in AI red teaming. I think to me, the most interesting thing has been, how do you do sure? Like the model jailbreak is one side. We also had Nicola Scarlini from DeepMind on the podcast, and he was talking about, for example, like, you know, context stealing and like a weight stealing. So there's kind of like a lot more that goes around it. I'm curious just how you think about the model and then maybe like the broader system, even with Red Team Arena, you're just focused on like jailbreaking of the model, right? You're not doing kind of like any testing on the more system level thing of the model where like, maybe you can get the training data back, you're going to exfiltrate some of the layers and the weights and things like that.Wei Lin [00:23:43]: So right now, as you can see, the Red Team Arena is at a very early stage and we are still exploring what could be the potential new games we can introduce to the platform. So the idea is still the same, right? And we build a community driven project platform for people. They can have fun with this website, for sure. That's one thing, and then help everyone to test these models. So one of the aspects you mentioned is stealing secrets, stealing training sets. That could be one, you know, it could be designed as a game. Say, can you still use their credential, you know, we hide, maybe we can hide the credential into system prompts and so on. So there are like a few potential ideas we want to explore for sure. Do you want to add more?Anastasios [00:24:28]: I think that this is great. This idea is a great one. There's a lot of great ideas in the Red Teaming space. You know, I'm not personally like a Red Teamer. I don't like go around and Red Team models, but there are people that do that and they're awesome. They're super skilled. When I think about the Red Team arena, I think those are really the people that we're building it for. Like, we want to make them excited and happy, build tools that they like. And just like chatbot arena, we'll trust that this will end up being useful for the world. And all these people are, you know, I won't say all these people in this community are actually good hearted, right? They're not doing it because they want to like see the world burn. They're doing it because they like, think it's fun and cool. And yeah. Okay. Maybe they want to see, maybe they want a little bit.Wei Lin [00:25:13]: I don't know. Majority.Anastasios [00:25:15]: Yeah.Wei Lin [00:25:16]: You know what I'm saying.Anastasios [00:25:17]: So, you know, trying to figure out how to serve them best, I think, I don't know where that fits. I just, I'm not expressing. And give them credits, right?Wei Lin [00:25:24]: And give them credit.Anastasios [00:25:25]: Yeah. Yeah. So I'm not trying to express any particular value judgment here as to whether that's the right next step. It's just, that's sort of the way that I think we would think about it.Swyx [00:25:35]: Yeah. We also talked to Sander Schulhoff of the HackerPrompt competition, and he's pretty interested in Red Teaming at scale. Let's just call it that. You guys maybe want to talk with him.Wei Lin [00:25:45]: Oh, nice.Swyx [00:25:46]: We wanted to cover a little, a few topical things and then go into the other stuff that your group is doing. You know, you're not just running Chatbot Arena. We can also talk about the new website and your future plans, but I just wanted to briefly focus on O1. It is the hottest, latest model. Obviously, you guys already have it on the leaderboard. What is the impact of O1 on your evals?Wei Lin [00:26:06]: Made our interface slower.Anastasios [00:26:07]: It made it slower.Swyx [00:26:08]: Yeah.Wei Lin [00:26:10]: Because it needs like 30, 60 seconds, sometimes even more to, the latency is like higher. So that's one. Sure. But I think we observe very interesting things from this model as well. Like we observe like significant improvement in certain categories, like more technical or math. Yeah.Anastasios [00:26:32]: I think actually like one takeaway that was encouraging is that I think a lot of people before the O1 release were thinking, oh, like this benchmark is saturated. And why were they thinking that? They were thinking that because there was a bunch of models that were kind of at the same level. They were just kind of like incrementally competing and it sort of wasn't immediately obvious that any of them were any better. Nobody, including any individual person, it's hard to tell. But what O1 did is it was, it's clearly a better model for certain tasks. I mean, I used it for like proving some theorems and you know, there's some theorems that like only I know because I still do a little bit of theory. Right. So it's like, I can go in there and ask like, oh, how would you prove this exact thing? Which I can tell you has never been in the public domain. It'll do it. It's like, what?Wei Lin [00:27:19]: Okay.Anastasios [00:27:20]: So there's this model and it crushed the benchmark. You know, it's just like really like a big gap. And what that's telling us is that it's not saturated yet. It's still measuring some signal. That was encouraging. The point, the takeaway is that the benchmark is comparative. There's no absolute number. There's no maximum ELO. It's just like, if you're better than the rest, then you win. I think that was actually quite helpful to us.Swyx [00:27:46]: I think people were criticizing, I saw some of the academics criticizing it as not apples to apples. Right. Like, because it can take more time to reason, it's basically doing some search, doing some chain of thought that if you actually let the other models do that same thing, they might do better.Wei Lin [00:28:03]: Absolutely.Anastasios [00:28:04]: To be clear, none of the leaderboard currently is apples to apples because you have like Gemini Flash, you have, you know, all sorts of tiny models like Lama 8B, like 8B and 405B are not apples to apples.Wei Lin [00:28:19]: Totally agree. They have different latencies.Anastasios [00:28:21]: Different latencies.Wei Lin [00:28:22]: Control for latency. Yeah.Anastasios [00:28:24]: Latency control. That's another thing. We can do style control, but latency control. You know, things like this are important if you want to understand the trade-offs involved in using AI.Swyx [00:28:34]: O1 is a developing story. We still haven't seen the full model yet, but it's definitely a very exciting new paradigm. I think one community controversy I just wanted to give you guys space to address is the collaboration between you and the large model labs. People have been suspicious, let's just say, about how they choose to A-B test on you. I'll state the argument and let you respond, which is basically they run like five anonymous models and basically argmax their Elo on LMSYS or chatbot arena, and they release the best one. Right? What has been your end of the controversy? How have you decided to clarify your policy going forward?Wei Lin [00:29:15]: On a high level, I think our goal here is to build a fast eval for everyone, and including everyone in the community can see the data board and understand, compare the models. More importantly, I think we want to build the best eval also for model builders, like all these frontier labs building models. They're also internally facing a challenge, which is how do they eval the model? That's the reason why we want to partner with all the frontier lab people, and then to help them testing. That's one of the... We want to solve this technical challenge, which is eval. Yeah.Anastasios [00:29:54]: I mean, ideally, it benefits everyone, right?Wei Lin [00:29:56]: Yeah.Anastasios [00:29:57]: And people also are interested in seeing the leading edge of the models. People in the community seem to like that. Oh, there's a new model up. Is this strawberry? People are excited. People are interested. Yeah. And then there's this question that you bring up of, is it actually causing harm?Wei Lin [00:30:15]: Right?Anastasios [00:30:16]: Is it causing harm to the benchmark that we are allowing this private testing to happen? Maybe stepping back, why do you have that instinct? The reason why you and others in the community have that instinct is because when you look at something like a benchmark, like an image net, a static benchmark, what happens is that if I give you a million different models that are all slightly different, and I pick the best one, there's something called selection bias that plays in, which is that the performance of the winning model is overstated. This is also sometimes called the winner's curse. And that's because statistical fluctuations in the evaluation, they're driving which model gets selected as the top. So this selection bias can be a problem. Now there's a couple of things that make this benchmark slightly different. So first of all, the selection bias that you include when you're only testing five models is normally empirically small.Wei Lin [00:31:12]: And that's why we have these confidence intervals constructed.Anastasios [00:31:16]: That's right. Yeah. Our confidence intervals are actually not multiplicity adjusted. One thing that we could do immediately tomorrow in order to address this concern is if a model provider is testing five models and they want to release one, and we're constructing the models at level one minus alpha, we can just construct the intervals instead at level one minus alpha divided by five. That's called Bonferroni correction. What that'll tell you is that the final performance of the model, the interval that gets constructed, is actually formally correct. We don't do that right now, partially because we know from simulations that the amount of selection bias you incur with these five things is just not huge. It's not huge in comparison to the variability that you get from just regular human voters. So that's one thing. But then the second thing is the benchmark is live, right? So what ends up happening is it'll be a small magnitude, but even if you suffer from the winner's curse after testing these five models, what'll happen is that over time, because we're getting new data, it'll get adjusted down. So if there's any bias that gets introduced at that stage, in the long run, it actually doesn't matter. Because asymptotically, basically in the long run, there's way more fresh data than there is data that was used to compare these five models against these private models.Swyx [00:32:35]: The announcement effect is only just the first phase and it has a long tail.Anastasios [00:32:39]: Yeah, that's right. And it sort of like automatically corrects itself for this selection adjustment.Swyx [00:32:45]: Every month, I do a little chart of Ellim's ELO versus cost, just to track the price per dollar, the amount of like, how much money do I have to pay for one incremental point in ELO? And so I actually observe an interesting stability in most of the ELO numbers, except for some of them. For example, GPT-4-O August has fallen from 12.90

god community ai strategy pr moving future phd partner cost forever impact student humans apologies stanford origin limits sort arena berkeley statistics cto gemini blast residence uc berkeley api intense gpt collaborating ui majority lama llama static llm biases elo arenas gotcha benchmarks deepmind benchmarking ying v2 alessio 8b alpaca lm parameter latency red team pid dreadnoughts red teaming smol moes o1 neurips repl elos huggingface cloud one latent space bonferroni red teamer wei lin

Ya tenemos que creer en elos Cardinals | Vikings sorprende a 49ers | Saints destruye a Cowboys

Primero y Diez - El Podcast

Play Episode Listen Later Sep 16, 2024 71:52

La semana 2 de la NFL nos dejó muchas sorpresas a las que no podemos quedarnos sin reaccionar. Desde os Cardinals despertando, los Vikings tomando por sorpresa a los 49ers, hasta la destrucción total de los Saints a los Cowboys. Learn more about your ad choices. Visit megaphone.fm/adchoices

nfl saints desde cowboys vikings cardinals tenemos san francisco 49ers creer sorprende destruye elos cardinals vikings

Missouri Executive Order 44 (Jerom Johnson & Elos Olsen)

Getting It Out

Play Episode Listen Later Sep 5, 2024 56:16

MISSOURI EXECUTIVE ORDER 44 is a righteous band with a unique mission. Their new album, Salt Sermon, was just released courtesy of The Ghost Is Clear Records and Learning Curve Records to critical acclaim. Members Jerom Johnson (vocals) and Elos Olsen (guitar) share their story of creating the world's first Mormon-powered blackened sasscore band. Music by:Common WoundsMissouri Executive Order 44The SlimeIntro music by:Hot ZonePatreon: https://www.patreon.com/GettingitoutpodcastEmail: dan@gettingitout.netWebsite: http://gettingitout.net/Instagram: @getting_it_out_podcastFacebook: www.facebook.com/gettingitoutpodcastX: @GettingItOutPod Get bonus content on Patreon Hosted on Acast. See acast.com/privacy for more information.

music executives missouri acast mormon executive orders olsen slime hot zone elos

Samuel Vázquez: "La política del PSOE en inmigración es "tráelos, utilízalos políticamente, pero aléjalos de mi casa"

La Trinchera de Llamas

Play Episode Listen Later Aug 17, 2024 18:15

Samuel Vázquez, presidente de Un Policía para el Siglo XXI, llega a La Trinchera para abordar como la inmigración ilegal y masiva afecta a occidente

pero poltica la pol siglo xxi mi casa inmigraci psoe util elos

Intervija | Toms Skujiņš par 5. vietu Parīzē un mērķiem Losandželosā

11TV Podkāsts

Play Episode Listen Later Aug 15, 2024 34:32

Valdis Valters pēc lieliskā starta Parīzē un interviju aicinājis 5. vietas ieguvēju Tomu Skujiņu. Toms atklās detaļas, kas palīdzēja izcīnīt augsto vietu, par sezonu kopumā un mērķiem Losandželosā.

elos intervija vietu losand valdis valters

Avui portem dos invitats molt especials, dos xiquets de 15 anys apassionats pel cinema. Parlem amb ells de cinema, quin és el seu gènere preferit, el seu top 5 de pel·lícules preferides i quines recomanen. Programa de cine, en sèrie i en serio, on uns "personajes" parlen sobre la cartellera, novetats, rumors i notícies del món de la gran i petita pantalla... Episodi n°28 "Los xiqüelos" També en podcast a iVoox, Spotify i a www.radiorapita.cat

spotify tamb avui parlem episodi elos

A operação em São Paulo que mira os elos entre o PCC e a política

Durma com essa

Play Episode Listen Later Apr 16, 2024 15:40

O Ministério Público e a Polícia Militar prenderam nesta terça-feira (16) três vereadores de cidades de São Paulo: Ricardo Queixão (PSD), de Cubatão, Flavio Batista de Souza (Podemos), de Ferraz de Vasconcelos, e Luiz Carlos Alves Dias (MDB), de Santa Isabel. Eles e outras dez pessoas foram alvo da Operação Munditia, que mirou um grupo suspeito de fraudar contratos públicos em benefício do PCC (Primeiro Comando da Capital). O Durma com Essa trata da investigação, do histórico da facção e de seus elos com a política. O programa também tem a participação de Marcelo Roubicek explicando o atraso no ajuste fiscal.Assine o podcast: Spreaker | Apple Podcasts | Deezer | Google Podcasts | Spotify | Outros apps (RSS)Edição de áudio Pedro PastorizProdução de arte Mariana Simonetti

capital opera pol poltica eles o paulo militar assine psd vasconcelos o minist ferraz santa isabel elos cubat

LAS HIJAS DEL ABISMO | Dark Souls II - A Juego Lento 4 | Las 3 Coronas d elos reyes olvidados

Nuevebits - Podcast de Videojuegos en Español

Play Episode Listen Later Apr 5, 2024 124:17

Llegamos al final de nuestro viaje por Dark Souls II viajando a lo más profundo de los reinos olvidados. Si este es tu primero episodio de Dark Souls II - A Juego Lento, escucha primero los anteriores episodios: Episodios de Dark Souls II A Juego Lento: A Juego Lento I: https://youtu.be/jVKfLPy2v7s A Juego Lento II: https://youtu.be/bidnXcyZs8I A Juego Lento III: https://youtu.be/gRHhMk2fW-k - Estudia online en ⁠ @UNIR ⁠ un máster o un grado en videojuegos, diseño gráfico digital, UX o multimedia. Pide más información sin compromiso: https://estudiar.unir.net/es/es-gen-area-ing-formacion-diseno-director-ejecutivo/ - Másters y Grados 100% oficiales; dan acceso a programas de doctorado y a trabajar como profesor univeristario - Formación 100% online; estudia sin cambiar tus rutinas - Pensados para gente que trabaja y quiere convertirse en diseñador de lo que le apasiona

reyes ux juego llegamos pide formaci episodios lento abismo unir coronas estudia grados olvidados las hijas dark souls ii elos

Fantastisch: Marc-Uwe Kling liest „Der Spurenfinder“ von Marc-Uwe, Johanna und Luise Kling

SWR2 am Samstagnachmittag

Play Episode Listen Later Mar 9, 2024 4:38

Eigentlich hat sich der berühmte Spurenfinder Elos von Bergen zur Ruhe gesetzt, vor allem zum Schutz seiner Zwillinge Ada und Naru. Denn bei seinen bisherigen Abenteuern hat er sich nicht nur Freunde gemacht. Doch dann geschieht ausgerechnet im beschaulichen Friedhofen ein Mord: Ein Fall für Elos von Bergen und diesmal sind seine Kinder mit von der Partie. Marc-Uwe Kling hat mit seinen beiden Töchtern eine fantastische Welt kreiert, von der sie locker und lustig erzählen, auch wenn es zwischendurch sehr gefährlich und spannend wird. Der Fall ist so komplex, dass man ihn danach gleich nochmal hören will, um selbst Spuren zu suchen – nein, zu finden.

welt kinder freunde ruhe bergen schutz spuren der fall abenteuern liest naru fantastisch marc uwe kling elos

Entrevista Lucas Alex Santos Scouto, sobre sua nova trajetória na Sete Elos

Assunto Nosso

Play Episode Listen Later Feb 27, 2024 5:43

Compacto de entrevista com a abordagem de temas pertinentes à comunidade do Vale do Rio Pardo e Taquari. A cada edição um tema relevante ou algum assunto específico de ampla repercussão

entrevista sete trajet elos taquari alex santos

Mundofonías 2024 #10: Músicas que florecen / Music that flourishes

Mundofonías

Play Episode Listen Later Feb 2, 2024 58:41

Degustamos músicas con inspiraciones brasileñas, ibéricas, centroeuropeas, caucásicas y centroasiáticas. Entre novedades y avances, anunciamos también la presentación ¡muy silvestre! del disco de La Barca y charlamos con Christian Pliefke, quien nos habla de los sellos que impulsa desde CPL-Musicgroup, y con Kirill Kuzmin, quien nos pone al día acerca de las actividades del Aga Khan Music Programme y del proyecto de los Aga Khan Master Musicians. We taste music with Brazilian, Iberian, Central European, Caucasian and Central Asian inspirations. Among new releases and previews, we also announce the ¡very wild! album presentation of La Barca and we talk with Christian Pliefke, who tells us about the labels he promotes from CPL-Musicgroup, and with Kirill Kuzmin, who updates us on the activities of the Aga Khan Music Programme and the Aga Khan Master Musicians. · Deltas - Loyo - II · La Barca - La mala hierba - Verde mi sangre, rojo tus hojas · Sole - Ke somas me - Elos · Eläkeläiset - Keväthumppa - Keväthumppa · Lakvar - Ki-bë - Fiction and folklore · Aga Khan Master Musicians - Tashkent - Nowruz · Aga Khan Master Musicians - Nowruz - Nowruz · Alma Pannonia - Transylvanian Jewish dance 1 - Transylvanian dances Voces invitadas: Guest voices: · Christian Pliefke (CPL-Musicgroup) · Kirill Kuzmin (Aga Khan Music Programme) #Mundofonews · La Barca 📸 Aga Khan Master Musicians (Sebastian Schutyser)

music fiction brazilian verde caucasians m sicas iberian central asian central european la barca flourishes elos degustamos

PvP Corner 145

GoCast: a Pokemon GO Podcast

Play Episode Listen Later Nov 17, 2023 95:34

DPhiE250 and FishOnAHeater recap what is going on to wrap up the Adventures Abound season in GO Battle League as well as preview the Latin American International Championships this weekend. Fish tells a personal story relating to the Brisbane regional. Finally, Fish & DPhiE wrap up the episode by pulling out three pieces of mail from the mailbag. Let's celebrate some achievements! In the Pallet Town server, basherballgod reached Legend by utilizing a Shadow Luxray and Freyaamber finished both the Master Ball research & Level 45 research. In the GOCast server, JEngineer and girafichu both reached their peak ELOS (3550 & 3429 respectively). Great work, trainers! Lundberger's LAIC article Time Codes: GBL - 1:20 Events - 13:47 Play! Pokemon - 16:57 Fish has a story - 24:30 Mailbag - 48:20 Achievements - 1:27:15 Regular Links: GBL Adventures Abound Blog post Support PvPoke on Patreon or Github Play! Pokémon 2024 Season Information from Dracoviz More Fish More DPhiE250 Voicemail: (262) 586-7717 Email: pvpcorner@gocastpodcast.com Physical mail: GOCast PO BOX Crystal Lake - Post Office PO Box 367 301 E Congress Pkwy Crystal Lake, IL 60039 --- Send in a voice message: https://podcasters.spotify.com/pod/show/gocastpodcast/message

fish pokemon pok brisbane elos pallet town master ball gocast go battle league

49ers con Chase ¿al Super Bowl? La realidad d elos Steelers. Cowboys ¿El bully de los débiles?

Mundo NFL

Play Episode Listen Later Nov 14, 2023 37:17

A los 49ers les cayó muy bien la semana de descanso y le pasaron por encima a los Jaguars. Lo mismo hicieron los Cowboys con los Giants, pero ¿pueden dar el salto para contender o solo abusan de los débiles? Además exploramos las verdaderas perspectivas de un equipo como los Steelers.See omnystudio.com/listener for privacy information.

super bowl giants adem cowboys bullies pittsburgh steelers san francisco 49ers jaguars la realidad los d biles elos steelers cowboys

Arroba do boi atinge preço de equilíbrio e atuais patamares contemplam quase todos os elos da cadeia, exceto o pecuarista

Notícias Agrícolas - Podcasts

Play Episode Listen Later Oct 17, 2023 23:37

Consumo de carne no mercado interno garantiu evolução de 10% no preço do atacado nos últimos 45 dias

consumo quase pre o brio equil cadeia exceto elos arroba pecuarista

Super Sexy Summer Solstice

Play Episode Listen Later Jun 16, 2023 61:27

What's good trainers, welcome back! On this weeks episode, Roldy shows up with a great pre-show buzz and it's amazing! We recap our Axew Community Day and our mid 30's struggle to link up, the super sexy Summer Solstice/Team Rocket takeover (such wow!), something about Pokémon contests, and a lot of gay jokes (Happy Pride!). And in the Battle Dome, we discuss our non-ELOs, how fun the Summer Cups were and discuss tips next weeks Single Type Cup because the current week is skippable. Thats it for today trainers, thanks for tuning in! Don't forget to rate, subscribe, and let us know what you'd like to hear, or if you're interested in being a sponsor of the week via Twitter!!Find us on Apple Podcasts, Spotify, Google Podcasts, Amazon Music, and many more!Also duh, follow us on Twitter and add us on Pokémon GO!The Wailordz - @thewailordzRichard - @itslikecrackyo - 3773 3450 3263Roldy - @R2theOLDY - 8568 6703 2582Til next time trainers!

spotify google podcasts pok amazon music pokemon go happy pride summer solstice community day super sexy gbl elos battledome go battle league

Unindo todos os elos da cadeia, vem aí a segunda edição do São Paulo Coffee Festival

Notícias Agrícolas - Podcasts

Play Episode Listen Later May 19, 2023 33:27

Com a participação de Natália Camoleze - Coordenadora de conteúdo da Café Editora

coffee festival o paulo cadeia unindo elos segunda edi

Las razas d elos perros más difíciles y más dòciles de entrenar

Martha Debayle en W

Play Episode Listen Later Feb 16, 2023 40:54

Garret Wing

perros entrenar razas elos

Com mercado cada vez mais exigente, produtor de café precisa estar atento a qualidade, mas operar com segurança em todos os elos da cadeia

Notícias Agrícolas - Podcasts

Play Episode Listen Later Feb 8, 2023 7:30

Edição 2023 da Femagri foca em sustentabilidade e soluções em todas as pontas para o produtor

cada estar mercado precisa seguran qualidade atento produtor cadeia operar elos

A SIC vem aí: Principal evento da cafeicultura reúne todos os elos da cadeia, abre leque de oportunidades ao produtor, cria debates e mostra potência

Notícias Agrícolas - Podcasts

Play Episode Listen Later Nov 11, 2022 40:16

Evento acontece nos dias 16,17 e 18 de novembro em Belo Horizonte

debate principal oportunidades abre evento cria mostra belo horizonte produtor cadeia elos

10 elos quebrados I Pr.Hugo Zica | Igreja Batista Plenitude

Igreja Batista Plenitude

Play Episode Listen Later Nov 1, 2022 29:08

Êxodo 20.1-17 São Luís-MA, Brasil. 30.10.2022 --- Send in a voice message: https://anchor.fm/igreja-batista-plenitude/message

brasil plenitude igreja batista zica elos

231: LIVE from LOMT with Kelly Weliky, Jen Ludwig, Ashley Boggs, Mark Ferguson, and Jeremy Wohlers.

Voices from The Bench

Play Episode Listen Later Aug 29, 2022 69:01

This week we bring you 3 more amazing conversations we got while attending the Ladies of the Mill (https://www.ladiesofthemill.com/) Summit back in July. Live from the Preat (https://www.preat.com/) booth Elvis talked to past podcast guest Kelly Weliky. Minutes after doing her first on-stage presentation, Kelly talks about speaking in front of other and what she has been up to since last on the podcast. Then we chat with Jen Ludwing. Jen is a regional manager for the Apex Dental Laboratory Group (https://www.apexlabgroup.com/) and a all around bright light in our industry. Jen talks about her role and why Ladies of the Mill is so important to her. Then we wrap up the whole weekend with a conversation with the folks at Vulcan Custom Dental (https://www.vulcandental.com/) and BioHorizon (https://www.biohorizons.com/). Ashley Boggs is an engineer for BioHorizon and talks about some engineering "stuff", Mark Ferguson talks about Vulcan's relationship with BioHorizon and some of the exciting things they are doing, and Jeremy Wohlers is there to make sure they is synergy between the two companies. It's a great convention with a lot of great conversations. Did you know Asiga (https://www.asiga.com/) has over 500 validated materials on their open material system. And it's growing every day? By harnessing Asiga's proprietary layer monitoring technology with its smart positioning system and integrated internal radiometer, as a laboratory, you will be able to produce any indication you desire. Whether models, splints, temporaries, or even permanent crowns. Your investment will be future proofed by Asiga's rugged engineering. Providing you with a fast, accurate, and repeatable machine, with a reputation that is time tested in the laboratory industry. If you would like to learn about Asiga's machine or material offerings, please visit the website at asiga.com or contact your favorite dental reseller. Two dynamic teams have joined forces to rock the intraoral scanning world! Whip Mix (https://www.whipmix.com/) has added the 3Shape TRIOS® (https://info.whipmix.com/en/3shape-trios-contact-us) line of scanners to its line of digital solutions for the dental office. Together, this dynamic duo can get your dentists scanning, providing you the reliable scans you need for your lab work. If you are interested in learning more about helping your dentists, go to: http://tinyurl.com/Whipmixtrios Special Guests: Ashley Boggs, Jennifer Ludwig, CDT, Jeremy Wohlers CDT, Kelly Weliky, and Mark Ferguson.

The Retro Sexual Cup

Play Episode Listen Later Jun 24, 2022 65:55

What's good trainers, welcome back! Roldy showed up buzzed and Richard joined him about halfway into it. Nature is healing. Anyways, on today's episode, we're recapping our current TCG event, we double down on Seattle(!), and we make the most of having no news (thanks Poke Miners!). Plus, we recap what's going on in the Battle Dome. ELOs are out and some bitches are petty. And apparently there was drama at the Pokemon Go Regional Worlds National whatever event. All this and more! That's it for today trainers, thanks for tuning in! Don't forget to rate, subscribe, and let us know what you'd like to hear, or if you're interested in being a sponsor of the week via Twitter!!Find us on Apple Podcasts, Spotify, Google Podcasts, Amazon Music, and many more!Don't forget to check out Roldy's YouTube channel for some fun PvP content https://www.youtube.com/channel/UCCZmICr6kvIajkrb_9792IQAlso duh, follow us on Twitter and add us on Pokémon GO!Richard - @itslikecrackyo - 3773 3450 3263Roldy - @R2theOLDY

spotify nature seattle sexual google podcasts pok retro amazon music pvp tcg elos battledome

The One With All The Deino Hype

Play Episode Listen Later Jun 22, 2022 63:25

What's good trainers, welcome back! On today's episode, after many moons, Richard is slipping back into stardust poverty with Roldy, we recap our successful Adventure Week (thanks for something good for once, Niantic), and we give our take on what everyone else was predicting: Deino Community Day. Plus, GBL is back in full swing! Bugs are everywhere, ELOs are crap, and Richard wasn't paying attention because he was PvPing during the recording. All this and more on today's episode. Thanks for tuning in, til next time Trainers! Don't forget to rate, subscribe, and let us know what you'd like to hear, or if you're interested in being a sponsor of the week via Twitter!!Find us on Apple Podcasts, Spotify, Google Podcasts, Amazon Music, and many more!Don't forget to check out Roldy's YouTube channel for some fun PvP content https://www.youtube.com/channel/UCCZmICr6kvIajkrb_9792IQAlso duh, follow us on Twitter and add us on Pokémon GO!Richard - @itslikecrackyo - 3773 3450 3263Roldy - @R2theOLDY

spotify hype trainers pokemon google podcasts pok bugs amazon music pokemon go pvp niantic pokemon tcg community day go fest gbl elos adventure week go battle league

T01E16 Mi Mamá a Corazón Abierto

A Corazón Abierto

Play Episode Listen Later May 4, 2021 96:26

Esta entrevista es muy especial para mi. Quise festejar el día de las Madres invitando a mi mamá a este episodios (Carmen González de Elosúa). Esta plática, más que una entrevista, es un regalo para mi. Para poder decirle a mi mamá todo lo qeu me quedo de ella y lo que valoro su amor incondicional. Quise compartirla con ustedes porque estoy segura que muchos se identificarán conmigo, en una relación con sus padres que es imperfecta, que ha pasado por cambios, por ideologías diferentes, pero al fin de cuentas, una relación que ha sido la base de lo que somos hoy. Espero la disfruten tanto como lo hice yo. Si quieres comentar este episodio, conocerme más o simplemente decirme qué piensas escríbeme en IG a @caroelosua y si te gusta mi contenido por favor no olvides suscribirte y compartir. Si quieres ver esta entrevista en video, la puedes encontrar en mi canal de YouTube.com/CaroElosua De todo corazón, Gracias por ser y Gracias por estar aquí.

espero coraz madres abierto quise elos

Podcasts about Elos

Best podcasts about Elos

AmericanReef - Saltwater and Coral Reef Aquarium Advanced Aquarists Edition

Latent Space: The AI Engineer Podcast â€” CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

The Unlimited Spanish Podcast: Aprende español | Habla español | Learn Spanish | Speak Spanish | TPRS

Notícias Agrícolas - Podcasts

The AskHistorians Podcast

The Wailordz: A PokÃ©mon GO Podcast

Radio Runeterra

Lama Padma Samten

Notícias do Front

RADIO 11.11 432 HZ

Elos

Ooshin Podcast

Rivista di Psicoanalisi - Podcast

UFOP CAST

Kult?ras Rondo

Podcasts – Trucão Comunicação em Transporte

Momento de Oração com Dr. Bezerra de Menezes

Assunto Nosso

Latest news about Elos

Latest podcast episodes about Elos

Polícia Civil aponta novos elos entre Deolane e o PCC

Frazão: Nunes Marques é mais um ministro com elos com o Master

A14MOD68_20260218 - Os cinco elos que nos unem a Deus.

La formulazione del caso clinico e il P.D.M.3. Davide Elos dialoga con Rachele Mariani

Dialoghi Aperti: il rapporto fra Psicoanalisi e Psichiatria: Davide Elos intervista Fabrizio Pavone

Karl Abraham. Davide Elos dialoga con Alessandra Balloni

Il P.D.M. 3. Davide Elos intervista Vittorio Lingiardi.

Aistringoji „Sarsuela“ – gruodžio 13-ąją Alytaus kultūros centre

Golpistas em cana e os elos políticos do rei da sonegação | Fechamento Carta

A ANÁLISE DOS POSSÍVEIS TREINADORES DO SPORT PARA 2026 E OS ELOS QUE TORNAM SEABRA FAVORITO

Il Dialogo fra Arte e Psicoanalisi. Davide Elos intervista Sonia DeCristofaro

"Inibizione, sintomo e angoscia". Davide Elos intervista Alessandra Balloni.

Libros para Copywriters – Léelos antes de septiembre

Il Dialogo fra Ricercatori e Clinici. Davide Elos intervista Giulio Cesare Zavattini e Anna Daniela Linciano

54° Congresso I.P.A. di Lisbona: Psychoanalysis an anchor in chaotic times. Davide Elos intervista Antonella Sessarego

Protetesti Losandželosā. Masks un Tramps šķiras

Gripe Aviária: Plano de contingência no RS está em andamento para todos os elos da cadeia avícola após confirmação de caso em granja comercial em Mont

Tirando a Fallar: Tiempo de playoff en la Euroliga, la NBA y la LF Endesa y la vida de Urbano González con ELA

EP322 A school framework where learning feels different and students thrive

FRUTOS DA EXTENSÃO: CONSOTIUM - Cooperação Intermunicipal para o Desenvolvimento do Lazer e do Esporte na Região dos Inconfidentes

Žurnālistes Māras Rozenbergas pieredzējumi "Oskaru" balvas ceremonijā Losandželosā

Outlasting Noam Shazeer, crowdsourcing Chat + AI with >1.4m DAU, and becoming the "Western DeepSeek" — with William Beauchamp, Chai Research

Latent.Space 2024 Year in Review

FRUTOS DA EXTENSÃO: ELO Estrela do Oriente - Melhor idade em movimento

In the Arena: How LMSys changed LLM Benchmarking Forever

Ya tenemos que creer en elos Cardinals | Vikings sorprende a 49ers | Saints destruye a Cowboys

Missouri Executive Order 44 (Jerom Johnson & Elos Olsen)

Samuel Vázquez: "La política del PSOE en inmigración es "tráelos, utilízalos políticamente, pero aléjalos de mi casa"

Intervija | Toms Skujiņš par 5. vietu Parīzē un mērķiem Losandželosā

Rádio Revista - Alex da Banda 7 Elos

Rádio Revista - Alex da Banda 7 Elos

07x28 Los Xiqüelos

A operação em São Paulo que mira os elos entre o PCC e a política

LAS HIJAS DEL ABISMO | Dark Souls II - A Juego Lento 4 | Las 3 Coronas d elos reyes olvidados

Fantastisch: Marc-Uwe Kling liest „Der Spurenfinder“ von Marc-Uwe, Johanna und Luise Kling

Entrevista Lucas Alex Santos Scouto, sobre sua nova trajetória na Sete Elos

Mundofonías 2024 #10: Músicas que florecen / Music that flourishes

PvP Corner 145

49ers con Chase ¿al Super Bowl? La realidad d elos Steelers. Cowboys ¿El bully de los débiles?

Arroba do boi atinge preço de equilíbrio e atuais patamares contemplam quase todos os elos da cadeia, exceto o pecuarista

Super Sexy Summer Solstice

Unindo todos os elos da cadeia, vem aí a segunda edição do São Paulo Coffee Festival

Las razas d elos perros más difíciles y más dòciles de entrenar

Com mercado cada vez mais exigente, produtor de café precisa estar atento a qualidade, mas operar com segurança em todos os elos da cadeia

A SIC vem aí: Principal evento da cafeicultura reúne todos os elos da cadeia, abre leque de oportunidades ao produtor, cria debates e mostra potência

10 elos quebrados I Pr.Hugo Zica | Igreja Batista Plenitude

231: LIVE from LOMT with Kelly Weliky, Jen Ludwig, Ashley Boggs, Mark Ferguson, and Jeremy Wohlers.

The Retro Sexual Cup

The One With All The Deino Hype

T01E16 Mi Mamá a Corazón Abierto