City in Scania, Sweden
POPULARITY
Categories
In this episode of DSC's Campfires, host Larry Weishuhn shares the thrilling details of his recent European roe deer hunt. He recounts his journey through Sweden, Poland, and Scotland alongside friends Mark and Carol Little, as well as expert guides Stefan and Sofia Bengtsson from Scandinavian Pro Hunters. Larry's deep passion for the sport and the delicious rewards of the hunt shine through as he details the beautiful landscapes, his gear, and the magnificent animals he encountered along the way. Larry compares the European roe deer to the whitetailed deer of European countries. These deer have a notably small body size, typically weighing between 50 and 60 pounds. Beyond the challenge of the hunt, Larry notes that the meat from these deer is absolutely delicious. The three-nation hunt was initially created as a donation for the DSC Foundation Gala in 2025. The package was generously put together by Larry alongside Stefan and Sofia Bengtsson of Scandinavian Pro Hunters. Mark and Carol Little purchased the hunt at the gala to help fund various DSC Foundation projects. The actual hunt came to fruition shortly before recording, taking place in May 2026. Larry traveled from Austin to London to Copenhagen, eventually driving across the waterway to the southernmost tip of Sweden. The group hunted on a large estate managed by Scandinavian Pro Hunters. This property featured a beautifully rehabilitated waterway that now supports abundant wildlife and sea-run trout. Carol Little successfully harvested a great deer early in the trip. Larry hunted from a "hochsitz" (an open-top deer blind) using Stefan's single-shot K95 Blaser rifle chambered in 6.5 Creedmoor. He harvested a mature, silver-medal buck with gorgeous ivory-colored antler tips. The group traveled by train from Malmo to Copenhagen, then flew directly into Warsaw, Poland. The Polish hunting grounds were highly agricultural, filled with apple orchards, blackberries, and strawberries. Larry hunted with Stefan and a local English-speaking guide named Derek. Using an R8 Blaser in .243, Larry shot a buck late in the evening. The local property manager used a thermal drone, which is completely legal in Poland, to recover the deer in the dark. This buck scored as a silver medal and featured a unique 3x4 point configuration. On a subsequent outing in Poland, Larry successfully used an 8x57 JS rifle to take another beautiful silver-medal buck. The final leg involved flying into Glasgow, Scotland, and driving to an area near Inverness. Larry hunted with a local Scottish guide named George. Larry used George's 6.5x55 Swede rifle to stalk and harvest another incredible roe deer buck. In total, Larry took four roe deer during the trip, including three 3x3s and two featuring a fourth point. Larry will be attending the Texas Wildlife Association Convention in San Antonio from July 23rd to July 26th. He will also be present at the upcoming DSC Summer Expo and DSC Gala. Another Scandinavian Pro Hunter trip will be auctioned off at the DSC Gala. Listeners can find full written stories and photos on Larry's website at larryweishuhn.net. Video coverage of the hunt will be available on Carbon TV and his YouTube channel, A Sportsman's Life. More information on the DSC Foundation and their conservation efforts can be found at dscf.org and biggame.org. Learn more about your ad choices. Visit megaphone.fm/adchoices
Myles Downey's new book, The Sessions, marks a transition, a metamorphosis in his influential body of work. And that becomes the theme of this, his third appearance on The Coach's Journey Podcast.What happens when it is no longer sustainable to remain what you have been? What is it like to have an idea - like Myles did with The Sessions more than 30 years ago - and not be able to let it go? And for Padraig, the protagonist of The Sessions, what is it like for someone to step into their own shoes, becoming a leader, becoming oneself as a leader? That is just the start, though, in a wide-ranging conversation about Myles' work, his new book, and what he has coming up next as part of what has already been an incredibly influential and impactful career, which started working closely with Sir John Whitmore in the early days of coaching in the UK and passed through The School of Coaching, his bestselling book Effective Modern Coaching and his powerful work on genius, which culminated in the book, Enabling Genius.In particular, we talk about: Whether it is our duty and responsibility to express ourselves. Poetry, prose, fiction, non-fiction, and Myles' relationship to reading and writing (and what this has to do with coaching).Myles' passion for human nature, creativity and how human beings should be more of themselves, not less of themselves. The close relationship between genius and being seen as mad, and why some leaders and CEOs see genius as dangerous. One of the most powerful coaching exercises Myles did with Robbie when he was Robbie's coach. And at the end, Myles tells a moving story about the influence of his wife, Jo, on The Sessions and his wider work, and gives a tantalising trailer for what might be his next book: The Inner Game of Leadership - what it takes to express oneself, configured around the notion of leadership. To listen to Myles' first appearance on The Coach's Journey Podcast, visit https://www.thecoachsjourney.com/podcast/episode-31-myles-downeyTo listen to his second appearance on the show, visit: https://www.thecoachsjourney.com/podcast/69-myles-downey-what-is-effective-coaching-performance-coaching-when-to-be-directive-changing-clients-maps-of-reality-and-moreFor more information about Myles, visit www.mylesdowney.com or find him on LinkedIn at www.linkedin.com/in/mylesdowneyFor more information about Robbie Swale, visit www.robbieswale.com.Read more about The Coach's Journey at www.thecoachsjourney.com.Music by My Good Man William: listen on Spotify: https://open.spotify.com/artist/4KmeQUcTbeE31uFynHQLQgTo support the Coach's Journey, visit www.patreon.com/thecoachsjourney and to join the Coach's Journey Community visit www.thecoachsjourney.com/community. Things and people we mentioned (that you might be interested in):Myles' previous appearances on The Coach's Journey Podcast: https://www.thecoachsjourney.com/podcast/episode-31-myles-downeyhttps://www.thecoachsjourney.com/podcast/69-myles-downey-what-is-effective-coaching-performance-coaching-when-to-be-directive-changing-clients-maps-of-reality-and-moreThe Sessions by Myles Downey: https://thesessions.online/ and https://www.amazon.co.uk/Sessions-tale-leadership-Dublin-pubs-ebook/dp/B0GKCWPD4B/The Inner Game of Tennis by Timothy Gallwey: https://www.amazon.co.uk/Inner-Game-Tennis-ultimate-performance/dp/1447288505/Myles' books: Effective Modern Coaching, The Enabling Manager, Enabling Genius: https://www.amazon.co.uk/Effective-Modern-Coaching-principles-successful/dp/191595116X/The fantasy novels where Robbie learned why poetry matters was The Long Price Quartet by Daniel Abraham: https://www.amazon.co.uk/The-Long-Price-Quartet/dp/B074CFZ29BJoseph Campbell: http://robbieswale.com/the-12-minute-blog/2024/10/9/the-insecure-way-is-really-the-secure-wayStrong Ground by Brene Brown: https://www.amazon.co.uk/Strong-Ground-Leadership-Tenacity-Vermilion/dp/178504320XRoberto Assagioli: https://en.wikipedia.org/wiki/Roberto_AssagioliBIOGRAPHY FROM MYLESMyles Downey is a recognized authority on Performance, Coaching and Leadership and the author of three classics in the coaching and performance arena: ‘Effective Modern Coaching', (330,00+ books sold); ‘Enabling Genius – a mindset for success in the 21st Century'; ‘The Enabling Manager – how to get the best out of your team'. Myles is one of the leading executive performance coaches in Europe, with global experience (Europe, North and South America, Asia-Pacific, UAE). He has worked across the C-suite in many prestigious organisations over the past 35 years, in a wide variety of industries, including Banking and Financial Services, Manufacturing, Oil and Gas, Professional Services, Tech, the Public Sector and Sports.Myles was the founder of The School of Coaching (1996 to 2012), at the time the premier institution for coach training in Europe.Myles has deployed his performance coaching programs with the Senior Coaches of the England Rugby Team and with the New Zealand Elite Rugby Coaches.Myles has been a speaker at many leadership conferences, including the BBC's Conference for its top 400 senior executives, the McKinsey Global Partners Conference in Singapore and the ICF Global Conference in Malmo, to name a few. After 33 years living in London, Myles now lives in Norfolk, England. He was born in Dublin, Ireland in 1959. Myles plays for Norfolk Veterans Tennis and competes occasionally on the ITF Masters Tour.
Fabi and Cristian break down a wild stretch in chess. Hans Niemann wins his first ever super tournament in Warsaw to kick off the Grand Chess Tour, but admits he felt like the rabbit being caught by the turtle as Fabi closed the gap in the final rounds. We dig into the American trio's standout performances, Alireza's secret conflict between the GCT and the Esports World Cup, and Magnus winning Tepe Sigeman in Malmo, ahead of Erigaisi and Erdogmus. Plus, 12-year-old Faustino Oro officially becomes a Grandmaster, the second youngest in chess history! Episode sponsored by our friends at https://www.chessreps.com/. The fastest way to learn openings and actually remember them. Train smarter, play sharper.♟️Play With Us: https://www.chess.com/join/csqpod?ref_id=207174611
Fabi and Cristian are live from Warsaw ahead of the Grand Chess Tour's first leg, breaking down the biggest stories in chess. Magnus Carlsen returns to classical at TePe Sigeman in Malmo and faces 14-year-old Yagiz Kaan Erdogmus for the first time ever. Webster University shuts down the legendary SPICE program, ending one of the most decorated collegiate chess dynasties in U.S. history — and Susan Polgar isn't taking it quietly. Plus the red or blue button debate splitting chess Twitter, and Fabi's POV heading into GCT Warsaw. Episode sponsored by our friends at https://www.chessreps.com/ ♟️Play With Us: https://www.chess.com/join/csqpod?ref_id=207174611
For the Glory KC is back with the 167th episode of the show!Sporting Kansas City finally did it! They made a signing that everyone expects to be a starter. Lasse Berg Johnsen joins from Malmo in Sweden and should immediately be penciled into the defensive midfielder role when his P-1 visa arrives. We break down the signing and how he'll fit next to Manu Garcia.The majority of this week's episode is focused on the two big preseason wins for Sporting KC. They finished 3-3-0 in preseason, a three game and five goal improvement over the 2025 preseason. Sure, against NYCFC it was a win over backups, but that Austin FC game was a starting lineup that got beaten on before some late red cards let the game get out of hand. Rapha Wicky seemed none too happy with Austin's violent fouls that led to cards either.The results of the games made it look like Wicky's system just needed a little more time to take hold (or maybe they just needed to not be playing Minnesota United). Regardless, there were plenty of positives to take away and I even took a shot at predicting who should start this weekend against the Quakes.In this episode I also spent time on the multiple rumors for Sporting KC this past week, SKC adding to their front office, Forbes' MLS valuations, Nemanja Radoja's new landing spot, SKC II signings and some former KC player updates.The KC Current made big headlines this week too. Some for good reasons and others that weren't quite so good. The team traded Claire Hutton, one of the best young players in the world. Then they immediately added another young star in Croix Bethune. That, plus the 1-1 draw against San Diego in preseason.In the Digital Crawl, we hit on a few more topics, including:KS Astras keep winningCatarina Macario to the San Diego Wave?Here is a rundown of topics and start times:Sporting KC - 0:00KC Current - 33:44Digital Crawl - 41:57As a special gift to For the Glory KC listeners and KC Soccer Journal readers, Backheeled dot com is giving away 30 days of their amazing, independent American soccer coverage for free. If you decide you want to turn that into a paid membership, they'll give you 10 percent off too. Just follow this link!Big thanks to Splitter Conspiracy (listen to them here) for our theme music made with the permission of the KC Cauldron.
Malmo Taproom had a tap takeover with Saez and Son from the Valenciana region of Spain. They had lots to try from TIPA to Pastry sours. It was tough to pick so we went with the two more interesting beers. Twisted Mary is a tomato gose that tastes more like vegetable or tomato juice than beer. Produced by their side project LOKO*, this beer will be really devisive. If you don't like tomato juice, you'll want to give it a pass. Palm Beach is a 6% ABV pastry sour with mango, platano and coconut milk. This is a rich delicous brew that you want to keep drinking. #beer #craftbeer #drinks #gose #pastrysour
Based in the Swedish city of Malmo, the Disgusting Food Museum features more than 80 foods from dozens of countries, including fermented shark, baby mice wine, fertilized egg and yes, even Vegemite.
Our great coach on this episode is Anders Tortensson.However this is not just a story about Anders, its also about the triumph of his team Mjallby in winning the Swedish top tier football competition in 2025.Mjallby is a town of 14,000 people with a budget that is only 12% of the richest club in the league, Malmo. Their stadium holds only 6000 people, and many of the players live in flats together and car pool into training.In 2017 they were in the 4th flight of Swedish football, and now after theirn victory they have qualified to take part in the Champions League qualifiers.They are led by Anders. A former school principal and miltary officer who has been involved with the club on and off since being a player there in the 1980's. In 2023 he became the head coach on a full time, professional basis and has led them to a a championship that many believe is one of the greatest underdog stories in European football.What can I do better to work with people's inner and outmer motivation?How can I use my voice better to convey a sense of calm?If you would like to send us any feedback or if you know a great coach, who has a unique story to share, then we would love to hear from you, please contact us at paul@thegreatcoachespodcast.com or contact us through our website thegreatcoachespodcast.com Hosted on Acast. See acast.com/privacy for more information.
Nottingham Forest's greatest ever player, John Robertson, died at the age of 72 on Christmas Day after suffering from Parkinson's disease for a number of years. Robertson was a league title winner and the man who set up Trevor Francis' goal against Malmo in the 1979 European Cup final. A year later he scored the winner himself against Hamburg. In our tribute, we hear from Robertson's former teammate Garry Birtles, broadcaster Darren Fletcher and podcast regular Steve Battlemuch to give the fan point of view on a great player and great man. #nffc #nottinghamforest
We celebrate our 200th episode with some reminiscing, and some recounts of cold days and playing with action figures. Sporting CP hit another Triple double this year, with goals spread around as they destroy AVS Futebol SAD 6-0. The Special one is finding his form with SL Benfica as they snag points away from Napoli in the Champions league and then lean on Pavlidis to put a hurting on Moreirense. Porto quietly continue their dominant streak in Portugal and Europe, with a big goal week for Samu, as he scores against Malmo and Estrela de Amadora. Famalicao are unleashing some impressive talents upon the league, but more than anything, bringing some great vibes. Another big win in the league as they strike down Estoril 4-0.We debate some other fun stuff like Eusebio, Cristiano Ronaldo, Green tea, Frankenstein, Lenny & Carl, Hamilton , and more. Enjoy!
Gabriel Alves destaca as exibições de Mora e Samu, mas sublinha a desconcentração defensiva dos azuis e brancos. Ainda acrescenta a habilidade "excelente" do guarda-redes do Malmo.See omnystudio.com/listener for privacy information.
Gabriel Alves destaca as exibições de Mora e Samu, mas sublinha a desconcentração defensiva dos azuis e brancos. Ainda acrescenta a habilidade "excelente" do guarda-redes do Malmo.See omnystudio.com/listener for privacy information.
Driven by Susie Wolff Life in the fast lane, with a perspective like no other: Formula One superstar Susie Wolff lifts the bonnet on her incredible journey to the top of motorsport, and her ambitions for what comes next. Unlucky for Some by Tom Wood In his latest gripping adventure, the world's deadliest assassin is all out of luck. Malmo, Sweden. What should have been a simple job - to kill the heir of an organised crime family - spirals out of control when assassin-for-hire Victor is shot and wounded. Forced to hide out in a hostile city while he recuperates, he doesn't realise that multiple enemies are circling to take advantage of his circumstances. Faced with a vengeful criminal organisation, corrupt officials, ruthless bounty hunters, elite mercenaries, and the most dangerous assassin he's ever faced, Victor's luck has finally run out. LISTEN ABOVE See omnystudio.com/listener for privacy information.
Nottingham Forest are looking to continue their transformation under Sean Dyche and make it four wins in a row when they host Brighton and Hove Albion at the City Ground in the Premier League on Sunday. We'll preview the game in full and reflect on the Europa League win against Malmo as Matt Davies is joined by Mikey Clarke and former Reds winger Franz Carr. #nffc #nottinghamforest
Matt Davies is joined by former Nottingham Forest defender and BBC Radio Nottingham summariser Brian Laws to discuss the news Morgan Gibbs-White will miss the Europa League game against Malmo through injury. We also reflect on Forest's 3-0 win at Liverpool and on Sean Dyche's impact on the club since his appointment as manager. #nffc #nottinghamforest
Nottingham Forest faced Malmo in a repeat of the 1979 European Cup final when the sides meet in the Europa League at the City Ground. It was a routine win this time for the Reds thanks to goals from Ryan Yates, Arnauld Kalimuendo and Nikola Milenkovic. There was an injury scare though for Murillo as he went off holding his hamstring in the second half. Matt Davies is joined by Simon Fotheringham and former Reds midfielder Sam Collins to discuss whatever happens in full. #nffc #nottinghamforest
Matt Davies is joined by Swedish football expert Josip Ladan to tell us more about Malmo FF ahead of their Europa League game against Nottingham Forest. We all know about the 1979 European Cup final but Josip tells us about their recent dominance of Swedish football before dramatic decline in the season which recently ended. We also discuss their best players, style of play and the chances of an upset at the City Ground with Malmo rated as huge underdogs. #nffc #nottinghamforest
Sean Dyche has had in Press conference before the Europa League game! For Exclusive Content Join FFTV Patreon: https://www.patreon.com/ForestFanTV Sean Dyche faced the media today in his pre-Malmö press conference, delivering some devastating injury news just 24 hours before our crucial Europa League clash. The biggest shock came with the confirmation that star midfielder Morgan Gibbs-White will be unavailable for tomorrow's game against the Swedish champions. This is a massive blow to the team's creative engine, and we break down exactly what this means for our tactical approach as the Reds look to secure vital European points away from home. The manager also provided crucial updates on the fitness of striker Chris Wood and defender Ola Aina. Wood and Aina's availability has been a major concern for the club, and while Dyche detailed their recovery status, questions remain over whether they will be fit enough to provide cover and rotation depth in this busy period. With a key playmaker suddenly ruled out, Dyche's lineup selection is now tougher than ever, and we analyse the immediate tactical adjustments that Forest will be forced to make against a difficult Malmö side. Don't miss this essential, full breakdown of all the news directly from the manager's press conference. How severely will the loss of Gibbs-White affect the formation, and which player needs to step up in midfield? Let us know in the comments below your predicted starting XI for the Malmö match given these late injury setbacks! Hit the like button, subscribe for all the best NFFC coverage, and ring the bell so you never miss a vital injury or transfer update! #nffc #europaleague #nottinghamforest Learn more about your ad choices. Visit podcastchoices.com/adchoices
Nottingham Forest face one of the most famous names in their history in Malmo on Thursday night. It's a repeat of the 1979 European Cup final, but this time the match in the Europa League at the City Ground. We'll preview the match in full as Forest look to climb the table against a team which has one point from four games and finished their domestic season three weeks ago. Matt Davies is joined by Emily Williamson and Steve Battlemuch. #nffc #nottinghamforest
Forest play historic European rival from Sweden in Malmo on Thursday! ⚽Football Prizes⚽ Massive 2999 Instant Win Prizes!: https://footballprizes.co.uk/product/yates-2/ The European journey continues! This Thursday, Nottingham Forest welcomes Swedish giants Malmö FF to the City Ground for a crucial Europa League clash. This is more than just a group stage fixture; it's a meeting steeped in history, harking back to that unforgettable 1979 European Cup Final in Munich! Join Wolfie as he guides you through everything you need to know ahead of this intriguing European encounter. The biggest question facing Sean Dyche after the incredible 3-0 victory over Liverpool is selection! With a congested fixture list, who will get rested, and who will start to maintain momentum? Wolfie runs through the potential starting lineup, debating whether key players should be wrapped in cotton wool or if Dyche will stick with a winning formula for this important European tie. We analyse the balance between progression in Europe and freshness for the Premier League, discussing who might get their chance to shine on the European stage. Facing Malmö FF, the dominant force in Swedish football, will be a stern test of our tactical discipline. We break down Malmö's style and discuss how Forest can exploit their high defensive line. We want your input! Jump in the comments and let us know your preferred starting XI for Thursday night! Does history favour the Reds, or will Malmö spoil the party? Like, share, and subscribe to Forest Fan TV for all the best build-up! #nffc #europaleague #nottinghamforest Learn more about your ad choices. Visit podcastchoices.com/adchoices
Join us in Malmo for a real-life #DividendTalk meetup. Signup here:
Join us in Malmo for a real-life #DividendTalk meetup. Sign up here:
Carl Mankert is a ceramic artist working in Chicago and Malmo, Sweden. His work ranges from functional terracotta gardenware to decorative raku and mosaics. The work often explores themes of emigration and immigration, belonging and blending in. His company, Chicago Kiln Service, Inc. provides sales and service of studio equipment. https://ThePottersCast.com/1178
In this episode we talk with Michelle of @OhthePlacesYoullGoTravels about the Denmark and Sweden part of their 10-day family trip to European, which proves that travel with kids can be affordable, flexible, and genuinely kid-friendly when you use points, stopovers, and creative lodging. Michelle walks us through the itinerary that started with a Flying Blue stopover in Amsterdam, an unforgettable day at LEGO House in Billund, and then a multi-night stay split between Copenhagen and Malmo, Sweden. She explains how she stitched the trip together using points for flights, a Mr & Mrs Smith penthouse booked on Hyatt points, a budget Kindred home-exchange apartment in Malmo, and an unexpected bargain Story Hotel stay by Hyatt — and how those choices kept the trip family-friendly and economical.We share concrete how-to tips you can use: how to book Flying Blue stopovers, when staying overnight in Billund (or renting a car) makes sense, where to store luggage and how LEGO House's zones keep kids entertained for hours, and why Malmo can be a smart, lower-cost base with a 30-minute train into Copenhagen. Michelle also breaks down the numbers and how with points they spent LESS out of pocket for flights, hotels, food and excursions for a family of 4 over 10 days and across many countries — a life-changing contrast to her family's earlier Florida road trip that was astronomically more! Join us to hear about this amazing, underrated and family friendly destination!Find Us On OnlineMary Ellen | JoFacebook GroupWonderland On Points BlogOhtheplacesyoullgotravel.comEpisode SponsorComfrt Hoodie EXTRA 15% OFF of Black Friday Sale Prices!Youtube Review of Comfrt HoodiesAffiliate LinksRakuten- Mary Ellen (Get 5000 AMEX POINTS)Rakuten- Joanna (Get 5000 AMEX POINTS)Chase/Capital One/Amex Card Links30% off the CardPointers subscription!FlyKitt- the BEST Jet Lag Solution!Tripiamo Driving TutorialsOur Favorite Travel NecessitiesWe receive a small commission when you choose to use any of our links to purchase your products or apply for your cards! We SO appreciate when you choose to give back to the podcast in this way!
We head to Sweden where a handful of mechanics are taking on one of the world's richest companies - and the world's richest man Elon Musk.The strike at Tesla's workshops has now reached its second anniversary. It's a clash of ideologies, in which each side has much at stake. And after two years there is still no prospect of a resolution.Presenter: Tim Mansel Sound mix: Annie Gardiner(Image: Striking worker Janis on the picket line in Malmo, Sweden.)
Warhammer Meta Chasers is a weekly competitive Warhammer 40k hype show. We run down some of the biggest and best events coming up this weekend where we discuss Warhammer 40k Factions in attendance and highlight army lists from some of the top ranked players around the globe. We talk about what the meta is, what it will be and how you can stack up against it. The show is hosted by Paul Murphy, Adam Camilleri, and Dustin Henshaw. The show runs LIVE every week on YouTube around 9pm EST every Thursday. We sincerely invite you to join us in chat if you can make it. The show is pushed to the Podcast aggregators soon after! We have an amazing chat community. Check out our Patreon here: https://www.patreon.com/WarhammerMetaChasers Join us live each and every Thursday on YouTube and join in our awesome chat community. Want to message the show another way? Hit up Paul on twitter @warmaster_tpm or on Instagram @fightswithdice
We've talked about Malmö Taproom many times but did you know they have a second bar (some would say hidden) bar? Just behind Malmö Taproom is Malmo Mead and Cocktail Bar. It's a small room with lots of meads, cocktails and craft beer. We tried the cleverly titled Ask Staff IPA. This is a solid IPA that wasn't on the Taproom menu. We followed that up with a couple of meads, Tropical Blueberry and Exclusive Peach. These are part of the new trend of meads of being bright, fruity and not honey syrup. Tropical Blueberry is a meger 7% ABV while Exclusive Peach comes in at 12%. It's a fun place to check. You'll find some extra beers that aren't part of Malmö Taprooms menu. #beer #craftbeer #drinks #mead #ipa #malmotaproom
durée : 00:14:41 - Le Disque classique du jour du lundi 15 septembre 2025 - Ce nouvel opus qui paraît sous le label Naxos nous fait découvrir les deux concertos pour violon et le concerto pour piano de la compositrice noire américaine Florence Price. Vous aimez ce podcast ? Pour écouter tous les autres épisodes sans limite, rendez-vous sur Radio France.
durée : 00:14:41 - Le Disque classique du jour du lundi 15 septembre 2025 - Ce nouvel opus qui paraît sous le label Naxos nous fait découvrir les deux concertos pour violon et le concerto pour piano de la compositrice noire américaine Florence Price. Vous aimez ce podcast ? Pour écouter tous les autres épisodes sans limite, rendez-vous sur Radio France.
The Europa League draw took 12pm UK time as Nottingham Forest return to European competition for the first time since 1996. Forest have drawn Porto (home), Braga (away), Ferencvaros (home), Betis (away), Midtylland (home), Sturm Graz (away), Malmo (home) and Utrecht (away). We've put the podcast out unedited as it was pretty funny as we explain how the draw works, watch the draw unfold live and then we'll discuss the latest Reds news as Elliot Anderson is named in the England squad with Morgan Gibbs-White by Thomas Tuchel for the first time. Meanwhile, Nicolo Savona is set to complete his transfer from Juventus. We'll also ask if Jose Mourinho could be the next Forest manager after he left Fenerbahce today. Finally, David Carmo has completed a loan transfer away from the club. Matt Davies is joined by Greg Mitchell, Michael Temple and Forest fan and travel expert Rob Adcock to tell us more about the formalities of the draw and each team Forest get. #NFFC #nottinghamforest
Max Hayes reflects on Forest's Europa League draw and Elliot Anderson being called up to the England squad. EXCLUSIVE NordVPN Deal ➼ https://nordvpn.com/garibaldi Try it risk-free now with a 30-day money-back guarantee
On the latest episode of ‘New Classical Tracks,' John Jeter leads the Malmo Symphony Orchestra, alongside soloists Han Chen and Fanny Clamagirand, in music by Florence Price and William Grant Still. Listen now with host Julie Amacher!
Recorded and published originally in February 2021Dotun and Tim are joined by Jonny Owen the director of 'I Believe in Miracles' and broadcaster to discuss Nottingham Forest's first European Cup win.They talk Brian Clough, music and Thatcher. Watch this episode on Youtube:https://youtu.be/IlnSadpuiPYFOLLOW THE BRAZILIAN SHIRT NAME ON INSTAGRAM: https://www.instagram.com/brazilshirtpod/ FOLLOW THE BRAZILIAN SHIRT NAME ON FACEBOOK: https://www.facebook.com/BrazilShirtPod FOLLOW THE BRAZILIAN SHIRT NAME ON TWITTER: https://twitter.com/BrazilShirtPod PURCHASE DOTUN'S LATEST BOOK, EFFRIES HERE: https://amzn.to/4cM260f
The Öresund Bridge, which connects Sweden and Demark, opened exactly 25 years ago today. It is one of Europe's most iconic bridges, carrying road and rail passengers between Copenhagen and Malmo. We look at how it has helped boost business and tourism and find out what other cities and countries might learn from its cross-border successes and challenges. Producer/presenter: Maddy Savage(Image: The bridge on a calm day. Credit Getty Images)
This podcast has introduced me to some wonderful and amazing people. One of them provided this recording of Robert Plant and Saving Grace with Suzi Dian, in Malmo, Sweden on May 9, 2025. I play a strong acoustic Four Sticks, a lovely Down to the Sea (off Fate of Nations), and a haunting Gillian Welch song, Orphan Girl. It's nice to hear Robert out and about and enjoying himself.
This episode was recorded before the tragic loss of Li. She was an amazing woman and helped create in the Malmö beer scene. She will be greatly missed. Our hearts go out to Anders, Malmö Taproom and everyone affected by her passing. We haven't reviewed a Malmo Brewing beer from Malmö Taproom in a long time. They got a new head brewer last year and we felt it was time to check in with them to see if things have changed. Björn from Björnguiden joins us this episode. Goodnik is a new hazy IPA made with nelson, waiti and nectaron. This is a really good start and a vast improvement over the hazy IPAs they have been making over the past few years. Malmö Brings all the Boys to the Yard is an imperial milkshake IPA at 10.1% ABV. It's a collab with Finnish brewers Tuju. Made with lactose, peach and vanilla, it was surprisingly drinkable and the 10.1% ABV could not be found. Lastly we tried an imperial pastry stout called Invisible Thunder. Filled with coconut, espresso beans and vanilla, it's also a heavy hitter at 11.3% AVB, This was quite the improvement over what we have tried in the past. Things are really starting to turn around! #beer #craftbeer #drinks #ipa #neipa #hazyipa #imperialstout #milkshakeipa
17:54 04.05.2025 On my way to visit my cousin in the west of Sweden. Spent two days in Stockholm. Beautiful city with its own atmosphere. Extremely similar to Germany but endearing in its design, spaciousness, and the things you can get away with because of how much people keep to themselves. Met cool people in the hostel. I'm trying something new by adding in voicenotes I've sent to other people. This one has a 10 min one to J dawg and one to my training mentor in German.
Based in Malmo, Sweden,Lotta & Per-Anders Jörgensen are the founders of the legendary food magazine Fool. Lotta is an art director and Per-Anders, or P.A. as I have come to call him, is a photographer. This is a magazine that launched in 2012 and has put out, thus far, 8 issues, very sporadically. It has been a few years since. The last issue, but as they reveal in the episode, there will be a #9.Aside of its unpredictable publishing schedule, Fool is a rare kind of magazine. In a world where everything moves so fast, where writing about food is mostly oriented towards minuscule bits of information on social media that keep coming at a rapid pace, one after the other, Fool is slow. It's thoughtful. It's reflective. It's stories are about interesting humans that work in food and their ideas, regardless of how well known they are. It's creative, with beautiful illustrations and photography, and stories that have always gone a little bit deeper than anywhere else. I had the pleasure of writing a few feature stories there and there was never any indication of what the word count should be. Just make it as long as you think it should be, they would say. That kind of collaboration is a dream for a writer or contributor of any sort. When you pick up an issue, you can read it like a book. A decade later, the stories remain relevant.Lotta and PA also create books, such as the Burnt Ends book, which we talked about with that restaurant's chef, Dave Pynt, in the previous episode. They've also worked with Andoni Luis Aduriz of Mugaritz, and quite a few other truly iconic chefs. There is also a documentary series they have created that they will launch soon, or at least soonish, or when it feels right. Anyway, their work has always been a big inspiration for me so it was a pleasure to have them on.READ MORE AT NEW WORLDER.
The idea of industrial food systems is flat, heavy, and feels complex to access. It brings up connotations of very bland, hyper-processed foods made to reach a large number of people at a low cost. There are important consequences to these food systems choices, though some are louder ones than others. My guest today, Franco Fubini, tackles an often under-appreciated one: flavours of ingredients. Franco Fubini is the founder and CEO of Natoora, and takes a unique approach to seasonality and sourcing for chefs and consumers across London, Paris, Milan, Copenhagen, Malmo, New York, LA, Miami, and Melbourne. He is also a professor of Sustainability Management at Columbia University in NYC. Franco is driven by his belief that engaging people with the real flavour of fruits and vegetables, arguing that we can collectively transform how food is being farmed and supplied if we focus more on a supply chain rooted in flavour, transparency, and direct relationships. He is also the author of In Search of the Perfect Peach: Why Flavour Holds the Answer to Fixing Our Food System. In today's episode, we look at the role that flavour plays in our food systems, and how flavour's decline has been connected to wartime economies and contemporary agricultural systems. Franco speaks to the work he's doing through Natoora, and how both old and new strategies are needed to model more sustainable, resilient, and locally-grounded food systems for the future. Learn More About Franco In Search of the Perfect Peach Natoora Website Instagram: @natoora and @francofubini
This cast ties together the 90's era of our story to the current ERA.💫 (if you want some XTRA background you can re-listen to the episodes 'Moving to LA' & 'Moving to Portland' as those two set this one UP.) The opening catches listeners up to Portland, Oregon in 1999 and the emotional STATE of things... but this cast could ultimately be called "The beginning of GRAILS". The bottom line is that one often has to throw everything out the window to open up a new horizon of possibilities. And that horizon, by definition, can often begin with a pathetic and destitute situation by LAW. But from that desert floor,, grow the WEEDS that become the terrain in every hero's journey rite? This cast features a couple very special guests... Alex Hall (Grails co-founder, who's living out in Malmo, Sweden) weighs in on the frame by frame situation from his perspective. And then Emil & Alex realize that they didn't so much 'start' Grails but that their mutual friend Brad Adkins (who introduced them) forced them to start the band against their WILL. So not only do you often begin the dream out in the desert with no food or water,, but sometimes there's a GUN to yr back too.🔥🚣♂️🔥 Learn more about your ad choices. Visit megaphone.fm/adchoices
Happy holidays! We'll be sharing snippets from Latent Space LIVE! through the break bringing you the best of 2024! We want to express our deepest appreciation to event sponsors AWS, Daylight Computer, Thoth.ai, StrongCompute, Notable Capital, and most of all all our LS supporters who helped fund the gorgeous venue and A/V production!For NeurIPS last year we did our standard conference podcast coverage interviewing selected papers (that we have now also done for ICLR and ICML), however we felt that we could be doing more to help AI Engineers 1) get more industry-relevant content, and 2) recap 2024 year in review from experts. As a result, we organized the first Latent Space LIVE!, our first in person miniconference, at NeurIPS 2024 in Vancouver.The single most requested domain was computer vision, and we could think of no one better to help us recap 2024 than our friends at Roboflow, who was one of our earliest guests in 2023 and had one of this year's top episodes in 2024 again. Roboflow has since raised a $40m Series B!LinksTheir slides are here:All the trends and papers they picked:* Isaac Robinson* Sora (see our Video Diffusion pod) - extending diffusion from images to video* SAM 2: Segment Anything in Images and Videos (see our SAM2 pod) - extending prompted masks to full video object segmentation* DETR Dominancy: DETRs show Pareto improvement over YOLOs* RT-DETR: DETRs Beat YOLOs on Real-time Object Detection* LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection* D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution Refinement* Peter Robicheaux* MMVP (Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs)* * Florence 2 (Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks) * PalíGemma / PaliGemma 2* PaliGemma: A versatile 3B VLM for transfer* PaliGemma 2: A Family of Versatile VLMs for Transfer* AlMv2 (Multimodal Autoregressive Pre-training of Large Vision Encoders) * Vik Korrapati - MoondreamFull Talk on YouTubeWant more content like this? Like and subscribe to stay updated on our latest talks, interviews, and podcasts.Transcript/Timestamps[00:00:00] Intro[00:00:05] AI Charlie: welcome to Latent Space Live, our first mini conference held at NeurIPS 2024 in Vancouver. This is Charlie, your AI co host. When we were thinking of ways to add value to our academic conference coverage, we realized that there was a lack of good talks, just recapping the best of 2024, going domain by domain.[00:00:36] AI Charlie: We sent out a survey to the over 900 of you. who told us what you wanted, and then invited the best speakers in the Latent Space Network to cover each field. 200 of you joined us in person throughout the day, with over 2, 200 watching live online. Our second featured keynote is The Best of Vision 2024, with Peter Robichaud and Isaac [00:01:00] Robinson of Roboflow, with a special appearance from Vic Corrapati of Moondream.[00:01:05] AI Charlie: When we did a poll of our attendees, the highest interest domain of the year was vision. And so our first port of call was our friends at Roboflow. Joseph Nelson helped us kickstart our vision coverage in episode 7 last year, and this year came back as a guest host with Nikki Ravey of Meta to cover segment Anything 2.[00:01:25] AI Charlie: Roboflow have consistently been the leaders in open source vision models and tooling. With their SuperVision library recently eclipsing PyTorch's Vision library. And Roboflow Universe hosting hundreds of thousands of open source vision datasets and models. They have since announced a 40 million Series B led by Google Ventures.[00:01:46] AI Charlie: Woohoo.[00:01:48] Isaac's picks[00:01:48] Isaac Robinson: Hi, we're Isaac and Peter from Roboflow, and we're going to talk about the best papers of 2024 in computer vision. So, for us, we defined best as what made [00:02:00] the biggest shifts in the space. And to determine that, we looked at what are some major trends that happened and what papers most contributed to those trends.[00:02:09] Isaac Robinson: So I'm going to talk about a couple trends, Peter's going to talk about a trend, And then we're going to hand it off to Moondream. So, the trends that I'm interested in talking about are These are a major transition from models that run on per image basis to models that run using the same basic ideas on video.[00:02:28] Isaac Robinson: And then also how debtors are starting to take over the real time object detection scene from the YOLOs, which have been dominant for years.[00:02:37] Sora, OpenSora and Video Vision vs Generation[00:02:37] Isaac Robinson: So as a highlight we're going to talk about Sora, which from my perspective is the biggest paper of 2024, even though it came out in February. Is the what?[00:02:48] Isaac Robinson: Yeah. Yeah. So just it's a, SORA is just a a post. So I'm going to fill it in with details from replication efforts, including open SORA and related work, such as a stable [00:03:00] diffusion video. And then we're also going to talk about SAM2, which applies the SAM strategy to video. And then how debtors, These are the improvements in 2024 to debtors that are making them a Pareto improvement to YOLO based models.[00:03:15] Isaac Robinson: So to start this off, we're going to talk about the state of the art of video generation at the end of 2023, MagVIT MagVIT is a discrete token, video tokenizer akin to VQ, GAN, but applied to video sequences. And it actually outperforms state of the art handcrafted video compression frameworks.[00:03:38] Isaac Robinson: In terms of the bit rate versus human preference for quality and videos generated by autoregressing on these discrete tokens generate some pretty nice stuff, but up to like five seconds length and, you know, not super detailed. And then suddenly a few months later we have this, which when I saw it, it was totally mind blowing to me.[00:03:59] Isaac Robinson: 1080p, [00:04:00] a whole minute long. We've got light reflecting in puddles. That's reflective. Reminds me of those RTX demonstrations for next generation video games, such as Cyberpunk, but with better graphics. You can see some issues in the background if you look closely, but they're kind of, as with a lot of these models, the issues tend to be things that people aren't going to pay attention to unless they're looking for.[00:04:24] Isaac Robinson: In the same way that like six fingers on a hand. You're not going to notice is a giveaway unless you're looking for it. So yeah, as we said, SORA does not have a paper. So we're going to be filling it in with context from the rest of the computer vision scene attempting to replicate these efforts. So the first step, you have an LLM caption, a huge amount of videos.[00:04:48] Isaac Robinson: This, this is a trick that they introduced in Dolly 3, where they train a image captioning model to just generate very high quality captions for a huge corpus and then train a diffusion model [00:05:00] on that. Their Sora and their application efforts also show a bunch of other steps that are necessary for good video generation.[00:05:09] Isaac Robinson: Including filtering by aesthetic score and filtering by making sure the videos have enough motion. So they're not just like kind of the generators not learning to just generate static frames. So. Then we encode our video into a series of space time latents. Once again, SORA, very sparse in details.[00:05:29] Isaac Robinson: So the replication related works, OpenSORA actually uses a MAG VIT V2 itself to do this, but swapping out the discretization step with a classic VAE autoencoder framework. They show that there's a lot of benefit from getting the temporal compression, which makes a lot of sense as the Each sequential frames and videos have mostly redundant information.[00:05:53] Isaac Robinson: So by compressing against, compressing in the temporal space, you allow the latent to hold [00:06:00] a lot more semantic information while avoiding that duplicate. So, we've got our spacetime latents. Possibly via, there's some 3D VAE, presumably a MAG VATV2 and then you throw it into a diffusion transformer.[00:06:19] Isaac Robinson: So I think it's personally interesting to note that OpenSORA is using a MAG VATV2, which originally used an autoregressive transformer decoder to model the latent space, but is now using a diffusion diffusion transformer. So it's still a transformer happening. Just the question is like, is it?[00:06:37] Isaac Robinson: Parameterizing the stochastic differential equation is, or parameterizing a conditional distribution via autoregression. It's also it's also worth noting that most diffusion models today, the, the very high performance ones are switching away from the classic, like DDPM denoising diffusion probability modeling framework to rectified flows.[00:06:57] Isaac Robinson: Rectified flows have a very interesting property that as [00:07:00] they converge, they actually get closer to being able to be sampled with a single step. Which means that in practice, you can actually generate high quality samples much faster. Major problem of DDPM and related models for the past four years is just that they require many, many steps to generate high quality samples.[00:07:22] Isaac Robinson: So, and naturally, the third step is throwing lots of compute at the problem. So I didn't, I never figured out how to manage to get this video to loop, but we see very little compute, medium compute, lots of compute. This is so interesting because the the original diffusion transformer paper from Facebook actually showed that, in fact, the specific hyperparameters of the transformer didn't really matter that much.[00:07:48] Isaac Robinson: What mattered was that you were just increasing the amount of compute that the model had. So, I love how in the, once again, little blog posts, they don't even talk about [00:08:00] like the specific hyperparameters. They say, we're using a diffusion transformer, and we're just throwing more compute at it, and this is what happens.[00:08:08] Isaac Robinson: OpenSora shows similar results. The primary issue I think here is that no one else has 32x compute budget. So we end up with these we end up in the middle of the domain and most of the related work, which is still super, super cool. It's just a little disappointing considering the context. So I think this is a beautiful extension of the framework that was introduced in 22 and 23 for these very high quality per image generation and then extending that to videos.[00:08:39] Isaac Robinson: It's awesome. And it's GA as of Monday, except no one can seem to get access to it because they keep shutting down the login.[00:08:46] SAM and SAM2[00:08:46] Isaac Robinson: The next, so next paper I wanted to talk about is SAM. So we at Roboflow allow users to label data and train models on that data. Sam, for us, has saved our users 75 years of [00:09:00] labeling time.[00:09:00] Isaac Robinson: We are the, to the best of my knowledge, the largest SAM API that exists. We also, SAM also allows us to have our users train just pure bounding box regression models and use those to generate high quality masks which has the great side effect of requiring less training data to have a meaningful convergence.[00:09:20] Isaac Robinson: So most people are data limited in the real world. So anything that requires less data to get to a useful thing is that super useful. Most of our users actually run their object per frame object detectors on every frame in a video, or maybe not most, but many, many. And so Sam follows into this category of taking, Sam 2 falls into this category of taking something that really really works and applying it to a video which has the wonderful benefit of being plug and play with most of our Many of our users use cases.[00:09:53] Isaac Robinson: We're, we're still building out a sufficiently mature pipeline to take advantage of that, but it's, it's in the works. [00:10:00] So here we've got a great example. We can click on cells and then follow them. You even notice the cell goes away and comes back and we can still keep track of it which is very challenging for existing object trackers.[00:10:14] Isaac Robinson: High level overview of how SAM2 works. We there's a simple pipeline here where we can give, provide some type of prompt and it fills out the rest of the likely masks for that object throughout the rest of the video. So here we're giving a bounding box in the first frame, a set of positive negative points, or even just a simple mask.[00:10:36] Isaac Robinson: I'm going to assume people are somewhat familiar with SAM. So I'm going to just give a high level overview of how SAM works. You have an image encoder that runs on every frame. SAM two can be used on a single image, in which case the only difference between SAM two and SAM is that image encoder, which Sam used a standard VIT [00:11:00] Sam two replaced that with a hara hierarchical encoder, which gets approximately the same results, but leads to a six times faster inference, which is.[00:11:11] Isaac Robinson: Excellent, especially considering how in a trend of 23 was replacing the VAT with more efficient backbones. In the case where you're doing video segmentation, the difference is that you actually create a memory bank and you cross attend the features from the image encoder based on the memory bank.[00:11:31] Isaac Robinson: So the feature set that is created is essentially well, I'll go more into it in a couple of slides, but we take the features from the past couple frames, plus a set of object pointers and the set of prompts and use that to generate our new masks. Then we then fuse the new masks for this frame with the.[00:11:57] Isaac Robinson: Image features and add that to the memory bank. [00:12:00] It's, well, I'll say more in a minute. The just like SAM, the SAM2 actually uses a data engine to create its data set in that people are, they assembled a huge amount of reference data, used people to label some of it and train the model used the model to label more of it and asked people to refine the predictions of the model.[00:12:20] Isaac Robinson: And then ultimately the data set is just created from the engine Final output of the model on the reference data. It's very interesting. This paradigm is so interesting to me because it unifies a model in a dataset in a way that is very unique. It seems unlikely that another model could come in and have such a tight.[00:12:37] Isaac Robinson: So brief overview of how the memory bank works, the paper did not have a great visual, so I'm just, I'm going to fill in a bit more. So we take the last couple of frames from our video. And we take the last couple of frames from our video attend that, along with the set of prompts that we provided, they could come from the future, [00:13:00] they could come from anywhere in the video, as well as reference object pointers, saying, by the way, here's what we've found so far attending to the last few frames has the interesting benefit of allowing it to model complex object motion without actually[00:13:18] Isaac Robinson: By limiting the amount of frames that you attend to, you manage to keep the model running in real time. This is such an interesting topic for me because one would assume that attending to all of the frames is super essential, or having some type of summarization of all the frames is super essential for high performance.[00:13:35] Isaac Robinson: But we see in their later ablation that that actually is not the case. So here, just to make sure that there is some benchmarking happening, we just compared to some of the stuff that's came out prior, and indeed the SAM2 strategy does improve on the state of the art. This ablation deep in their dependencies was super interesting to me.[00:13:59] Isaac Robinson: [00:14:00] We see in section C, the number of memories. One would assume that increasing the count of memories would meaningfully increase performance. And we see that it has some impact, but not the type that you'd expect. And that it meaningfully decreases speed, which justifies, in my mind, just having this FIFO queue of memories.[00:14:20] Isaac Robinson: Although in the future, I'm super interested to see A more dedicated summarization of all of the last video, not just a stacking of the last frames. So that another extension of beautiful per frame work into the video domain.[00:14:42] Realtime detection: DETRs > YOLO[00:14:42] Isaac Robinson: The next trend I'm interested in talking about is this interesting at RoboFlow, we're super interested in training real time object detectors.[00:14:50] Isaac Robinson: Those are bread and butter. And so we're doing a lot to keep track of what is actually happening in that space. We are finally starting to see something change. So, [00:15:00] for years, YOLOs have been the dominant way of doing real time object detection, and we can see here that they've essentially stagnated.[00:15:08] Isaac Robinson: The performance between 10 and 11 is not meaningfully different, at least, you know, in this type of high level chart. And even from the last couple series, there's not. A major change so YOLOs have hit a plateau, debtors have not. So we can look here and see the YOLO series has this plateau. And then these RT debtor, LW debtor, and Define have meaningfully changed that plateau so that in fact, the best Define models are plus 4.[00:15:43] Isaac Robinson: 6 AP on Cocoa at the same latency. So three major steps to accomplish this. The first RT deditor, which is technically a 2023 paper preprint, but published officially in 24, so I'm going to include that. I hope that's okay. [00:16:00] That is showed that RT deditor showed that we could actually match or out speed YOLOs.[00:16:04] Isaac Robinson: And then LWdebtor showed that pre training is hugely effective on debtors and much less so on YOLOs. And then DeFine added the types of bells and whistles that we expect from these types, this, this arena. So the major improvements that RTdebtor shows was Taking the multi scale features that debtors typically pass into their encoder and decoupling them into a much more efficient transformer encoder.[00:16:30] Isaac Robinson: The transformer is of course, quadratic complexity. So decreasing the amount of stuff that you pass in at once is super helpful for increasing your runtime or increasing your throughput. So that change basically brought us up to yellow speed and then they do a hardcore analysis on. Benchmarking YOLOs, including the NMS step.[00:16:54] Isaac Robinson: Once you once you include the NMS in the latency calculation, you see that in fact, these debtors [00:17:00] are outperforming, at least this time, the the, the YOLOs that existed. Then LW debtor goes in and suggests that in fact, the frame, the huge boost here is from pre training. So, this is the define line, and this is the define line without pre training.[00:17:19] Isaac Robinson: It's within range, it's still an improvement over the YOLOs, but Really huge boost comes from the benefit of pre training. When YOLOx came out in 2021, they showed that they got much better results by having a much, much longer training time, but they found that when they did that, they actually did not benefit from pre training.[00:17:40] Isaac Robinson: So, you see in this graph from LWdebtor, in fact, YOLOs do have a real benefit from pre training, but it goes away as we increase the training time. Then, the debtors converge much faster. LWdebtor trains for only 50 epochs, RTdebtor is 60 epochs. So, one could assume that, in fact, [00:18:00] the entire extra gain from pre training is that you're not destroying your original weights.[00:18:06] Isaac Robinson: By relying on this long training cycle. And then LWdebtor also shows superior performance to our favorite data set, Roboflow 100 which means that they do better on the real world, not just on Cocoa. Then Define throws all the bells and whistles at it. Yellow models tend to have a lot of very specific complicated loss functions.[00:18:26] Isaac Robinson: This Define brings that into the debtor world and shows consistent improvement on a variety of debtor based frameworks. So bring these all together and we see that suddenly we have almost 60 AP on Cocoa while running in like 10 milliseconds. Huge, huge stuff. So we're spending a lot of time trying to build models that work better with less data and debtors are clearly becoming a promising step in that direction.[00:18:56] Isaac Robinson: The, what we're interested in seeing [00:19:00] from the debtors in this, this trend to next is. Codetter and the models that are currently sitting on the top of the leaderboard for large scale inference scale really well as you switch out the backbone. We're very interested in seeing and having people publish a paper, potentially us, on what happens if you take these real time ones and then throw a Swingy at it.[00:19:23] Isaac Robinson: Like, do we have a Pareto curve that extends from the real time domain all the way up to the super, super slow but high performance domain? We also want to see people benchmarking in RF100 more, because that type of data is what's relevant for most users. And we want to see more pre training, because pre training works now.[00:19:43] Isaac Robinson: It's super cool.[00:19:48] Peter's Picks[00:19:48] Peter Robicheaux: Alright, so, yeah, so in that theme one of the big things that we're focusing on is how do we get more out of our pre trained models. And one of the lenses to look at this is through sort of [00:20:00] this, this new requirement for like, how Fine grained visual details and your representations that are extracted from your foundation model.[00:20:08] Peter Robicheaux: So it's sort of a hook for this Oh, yeah, this is just a list of all the the papers that I'm going to mention I just want to make sure I set an actual paper so you can find it later[00:20:18] MMVP (Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs)[00:20:18] Peter Robicheaux: Yeah, so sort of the big hook here is that I make the claim that LLMs can't see if you go to if you go to Claude or ChatGPT you ask it to see this Watch and tell me what time it is, it fails, right?[00:20:34] Peter Robicheaux: And so you could say, like, maybe, maybe the Like, this is, like, a very classic test of an LLM, but you could say, Okay, maybe this, this image is, like, too zoomed out, And it just, like, it'll do better if we increase the resolution, And it has easier time finding these fine grained features, Like, where the watch hands are pointing.[00:20:53] Peter Robicheaux: Nodice. And you can say, okay, well, maybe the model just doesn't know how to tell time from knowing the position of the hands. But if you actually prompt [00:21:00] it textually, it's very easy for it to tell the time. So this to me is proof that these LLMs literally cannot see the position of the watch hands and it can't see those details.[00:21:08] Peter Robicheaux: So the question is sort of why? And for you anthropic heads out there, cloud fails too. So the, the, my first pick for best paper of 2024 Envision is this MMVP paper, which tries to investigate the Why do LLMs not have the ability to see fine grained details? And so, for instance, it comes up with a lot of images like this, where you ask it a question that seems very visually apparent to us, like, which way is the school bus facing?[00:21:32] Peter Robicheaux: And it gets it wrong, and then, of course, it makes up details to support its wrong claim. And so, the process by which it finds these images is sort of contained in its hypothesis for why it can't. See these details. So it hypothesizes that models that have been initialized with, with Clip as their vision encoder, they don't have fine grained details and the, the features extracted using Clip because Clip sort of doesn't need to find these fine grained [00:22:00] details to do its job correctly, which is just to match captions and images, right?[00:22:04] Peter Robicheaux: And sort of at a high level, even if ChatGPT wasn't initialized with Clip and wasn't trained contrastively at all. The vision encoder wasn't trained contrastively at all. Still, in order to do its job of capturing the image it could do a pretty good job without actually finding the exact position of all the objects and visual features in the image, right?[00:22:21] Peter Robicheaux: So This paper finds a set of difficult images for these types of models. And the way it does it is it looks for embeddings that are similar in clip space, but far in DynaV2 space. So DynaV2 is a foundation model that was trained self supervised purely on image data. And it kind of uses like some complex student teacher framework, but essentially, and like, it patches out like certain areas of the image or like crops with certain areas of the image and tries to make sure that those have consistent representations, which is a way for it to learn very fine grained visual features.[00:22:54] Peter Robicheaux: And so if you take things that are very close in clip space and very far in DynaV2 space, you get a set of images [00:23:00] that Basically, pairs of images that are hard for a chat GPT and other big language models to distinguish. So, if you then ask it questions about this image, well, as you can see from this chart, it's going to answer the same way for both images, right?[00:23:14] Peter Robicheaux: Because to, to, from the perspective of the vision encoder, they're the same image. And so if you ask a question like, how many eyes does this animal have? It answers the same for both. And like all these other models, including Lava do the same thing, right? And so this is the benchmark that they create, which is like finding clip, like clip line pairs, which is pairs of images that are similar in clip space and creating a data set of multiple choice questions based off of those.[00:23:39] Peter Robicheaux: And so how do these models do? Well, really bad. Lava, I think, So, so, chat2BT and Jim and I do a little bit better than random guessing, but, like, half of the performance of humans who find these problems to be very easy. Lava is, interestingly, extremely negatively correlated with this dataset. It does much, much, much, much worse [00:24:00] than random guessing, which means that this process has done a very good job of identifying hard images for, for Lava, specifically.[00:24:07] Peter Robicheaux: And that's because Lava is basically not trained for very long and is initialized from Clip, and so You would expect it to do poorly on this dataset. So, one of the proposed solutions that this paper attempts is by basically saying, Okay, well if clip features aren't enough, What if we train the visual encoder of the language model also on dyno features?[00:24:27] Peter Robicheaux: And so it, it proposes two different ways of doing this. One, additively which is basically interpolating between the two features, and then one is interleaving, which is just kind of like training one on the combination of both features. So there's this really interesting trend when you do the additive mixture of features.[00:24:45] Peter Robicheaux: So zero is all clip features and one is all DynaV2 features. So. It, as you, so I think it's helpful to look at the right most chart first, which is as you increase the number of DynaV2 features, your model does worse and worse and [00:25:00] worse on the actual language modeling task. And that's because DynaV2 features were trained completely from a self supervised manner and completely in image space.[00:25:08] Peter Robicheaux: It knows nothing about text. These features aren't really compatible with these text models. And so you can train an adapter all you want, but it seems that it's in such an alien language that it's like a very hard optimization for this. These models to solve. And so that kind of supports what's happening on the left, which is that, yeah, it gets better at answering these questions if as you include more dyna V two features up to a point, but then you, when you oversaturate, it completely loses its ability to like.[00:25:36] Peter Robicheaux: Answer language and do language tasks. So you can also see with the interleaving, like they essentially double the number of tokens that are going into these models and just train on both, and it still doesn't really solve the MMVP task. It gets Lava 1. 5 above random guessing by a little bit, but it's still not close to ChachiPT or, you know, Any like human performance, obviously.[00:25:59] Peter Robicheaux: [00:26:00] So clearly this proposed solution of just using DynaV2 features directly, isn't going to work. And basically what that means is that as a as a vision foundation model, DynaV2 is going to be insufficient for language tasks, right?[00:26:14] Florence 2 (Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks)[00:26:14] Peter Robicheaux: So my next pick for best paper of 2024 would be Florence 2, which tries to solve this problem by incorporating not only This dimension of spatial hierarchy, which is to say pixel level understanding, but also in making sure to include what they call semantic granularity, which ends up, the goal is basically to have features that are sufficient for finding objects in the image, so they're, they're, they have enough pixel information, but also can be talked about and can be reasoned about.[00:26:44] Peter Robicheaux: And that's on the semantic granularity axis. So here's an example of basically three different paradigms of labeling that they do. So they, they create a big dataset. One is text, which is just captioning. And you would expect a model that's trained [00:27:00] only on captioning to have similar performance like chat2BT and like not have spatial hierarchy, not have features that are meaningful at the pixel level.[00:27:08] Peter Robicheaux: And so they add another type, which is region text pairs, which is essentially either classifying a region or You're doing object detection or doing instance segmentation on that region or captioning that region. And then they have text phrased region annotations, which is essentially a triple. And basically, not only do you have a region that you've described, you also find it's like, It's placed in a descriptive paragraph about the image, which is basically trying to introduce even more like semantic understanding of these regions.[00:27:39] Peter Robicheaux: And so like, for instance, if you're saying a woman riding on the road, right, you have to know what a woman is and what the road is and that she's on top of it. And that's, that's basically composing a bunch of objects in this visual space, but also thinking about it semantically, right? And so the way that they do this is they take basically they just dump Features from a vision encoder [00:28:00] straight into a encoder decoder transformer.[00:28:03] Peter Robicheaux: And then they train a bunch of different tasks like object detection and so on as a language task. And I think that's one of the big things that we saw in 2024 is these, these vision language models operating in, on pixel space linguistically. So they introduced a bunch of new tokens to point to locations and[00:28:22] Peter Robicheaux: So how does it work? How does it actually do? We can see if you look at the graph on the right, which is using the, the Dino, the the Dino framework your, your pre trained Florence 2 models transfer very, very well. They get 60%, 60 percent map on Cocoa, which is like approaching state of the art and they train[00:28:42] Vik Korrapati: with, and they[00:28:43] Peter Robicheaux: train with a much more more efficiently.[00:28:47] Peter Robicheaux: So they, they converge a lot faster, which both of these things are pointing to the fact that they're actually leveraging their pre trained weights effectively. So where is it falling short? So these models, I forgot to mention, Florence is a 0. 2 [00:29:00] billion and a 0. 7 billion parameter count. So they're very, very small in terms of being a language model.[00:29:05] Peter Robicheaux: And I think that. This framework, you can see saturation. So, what this graph is showing is that if you train a Florence 2 model purely on the image level and region level annotations and not including the pixel level annotations, like this, segmentation, it actually performs better as an object detector.[00:29:25] Peter Robicheaux: And what that means is that it's not able to actually learn all the visual tasks that it's trying to learn because it doesn't have enough capacity.[00:29:32] PalíGemma / PaliGemma 2[00:29:32] Peter Robicheaux: So I'd like to see this paper explore larger model sizes, which brings us to our next big paper of 2024 or two papers. So PolyGemma came out earlier this year.[00:29:42] Peter Robicheaux: PolyGemma 2 was released, I think like a week or two ago. Oh, I forgot to mention, you can actually train You can, like, label text datasets on RoboFlow and you can train a Florence 2 model and you can actually train a PolyGemma 2 model on RoboFlow, which we got into the platform within, like, 14 hours of release, which I was really excited about.[00:29:59] Peter Robicheaux: So, anyway, so [00:30:00] PolyGemma 2, so PolyGemma is essentially doing the same thing, but instead of doing an encoder decoder, it just dumps everything into a decoder only transformer model. But it also introduced the concept of location tokens to point to objects in pixel space. PolyGemma 2, so PolyGemma uses Gemma as the language encoder, and it uses Gemma2B.[00:30:17] Peter Robicheaux: PolyGemma 2 introduces using multiple different sizes of language encoders. So, the way that they sort of get around having to do encoder decoder is they use the concept of prefix loss. Which basically means that when it's generating, tokens autoregressively, it's all those tokens in the prefix, which is like the image that it's looking at and like a description of the task that it's trying to do.[00:30:41] Peter Robicheaux: They're attending to each other fully, full attention. Which means that, you know, it can sort of. Find high level it's easier for the, the prefix to color, to color the output of the suffix and also to just find like features easily. So this is sort of [00:31:00] an example of like one of the tasks that was trained on, which is like, you describe the task in English and then you give it all these, like, You're asking for it to segment these two classes of objects, and then it finds, like, their locations using these tokens, and it finds their masks using some encoding of the masks into tokens.[00:31:24] Peter Robicheaux: And, yeah, so, one of my critiques, I guess, of PolyGemma 1, at least, is that You find that performance saturates as a pre trained model after only 300 million examples seen. So, what this graph is representing is each blue dot is a performance on some downstream task. And you can see that after seeing 300 million examples, It sort of does equally well on all of the downtrend tasks that they tried it on, which was a lot as 1 billion examples, which to me also kind of suggests a lack of capacity for this model.[00:31:58] Peter Robicheaux: PolyGemma2, [00:32:00] you can see the results on object detection. So these were transferred to to Coco. And you can see that this sort of also points to an increase in capacity being helpful to the model. You can see as. Both the resolution increases, and the parameter count of the language model increases, performance increases.[00:32:16] Peter Robicheaux: So resolution makes sense, obviously, it helps to find small images, or small objects in the image. But it also makes sense for another reason, which is that it kind of gives the model a thinking register, and it gives it more tokens to, like, process when making its predictions. But yeah, you could, you could say, oh, 43.[00:32:30] Peter Robicheaux: 6, that's not that great, like Florence 2 got 60. But this is not Training a dino or a debtor on top of this language or this image encoder. It's doing the raw language modeling task on Cocoa. So it doesn't have any of the bells and whistles. It doesn't have any of the fancy losses. It doesn't even have bipartite graph matching or anything like that.[00:32:52] Peter Robicheaux: Okay, the big result and one of the reasons that I was really excited about this paper is that they blow everything else away [00:33:00] on MMVP. I mean, 47. 3, sure, that's nowhere near human accuracy, which, again, is 94%, but for a, you know, a 2 billion language, 2 billion parameter language model to be chat2BT, that's quite the achievement.[00:33:12] Peter Robicheaux: And that sort of brings us to our final pick for paper of the year, which is AIMV2. So, AIMV2 sort of says, okay, Maybe this language model, like, maybe coming up with all these specific annotations to find features and with high fidelity and pixel space isn't actually necessary. And we can come up with an even simpler, more beautiful idea for combining you know, image tokens and pixel tokens in a way that's interfaceable for language tasks.[00:33:44] Peter Robicheaux: And this is nice because it can scale, you can come up with lots more data if you don't have to come up with all these annotations, right? So the way that it works. is it does something very, very similar to PolyGemo, where you have a vision encoder that dumps image tokens into a decoder only transformer.[00:33:59] Peter Robicheaux: But [00:34:00] the interesting thing is that it also autoregressively tries to learn the mean squared error of the image tokens. So instead of having to come up with fancy object detection or semantic, or segment, or segmentation labels, you can just try to reconstruct the image and have it learn fine grained features that way.[00:34:16] Peter Robicheaux: And it does this in kind of, I think, a beautiful way that's kind of compatible with the PolyGemma line of thinking, which is randomly sampling a prefix line of thinking Prefix length and using only this number of image tokens as the prefix. And so doing a similar thing with the causal. So the causal with prefix is the, the attention mask on the right.[00:34:35] Peter Robicheaux: So it's doing full block attention with some randomly sampled number of image tokens to then reconstruct the rest of the image and the downstream caption for that image. And so, This is the dataset that they train on. It's image or internet scale data, very high quality data created by the data filtering networks paper, essentially which is maybe The best clip data that exists.[00:34:59] Peter Robicheaux: [00:35:00] And we can see that this is finally a model that doesn't saturate. It's even at the highest parameter count, it's, it appears to be, oh, at the highest parameter account, it appears to be improving in performance with more and more samples seen. And so you can sort of think that. You know, if we just keep bumping the parameter count and increasing the example scene, which is the, the, the line of thinking for language models, then it'll keep getting better.[00:35:27] Peter Robicheaux: So how does it actually do at finding, oh, it also improves with resolution, which you would expect for a model that This is the ImageNet classification accuracy, but yeah, it does better if you increase the resolution, which means that it's actually leveraging and finding fine grained visual features.[00:35:44] Peter Robicheaux: And so how does that actually do compared to CLIP on Cocoa? Well, you can see that if you slap a transformer detection head on it, Entry now in Cocoa, it's just 60. 2, which is also within spitting distance of Soda, which means that it does a very good job of [00:36:00] finding visual features, but you could say, okay, well, wait a second.[00:36:03] Peter Robicheaux: Clip got to 59. 1, so. Like, how does this prove your claim at all? Because doesn't that mean like clip, which is known to be clip blind and do badly on MMVP, it's able to achieve a very high performance on fine, on this fine grained visual features task of object detection, well, they train on like, Tons of data.[00:36:24] Peter Robicheaux: They train on like objects, 365, Cocoa, Flickr and everything else. And so I think that this benchmark doesn't do a great job of selling how good of a pre trained model MV2 is. And we would like to see the performance on fewer data as examples and not trained to convergence on object detection. So seeing it in the real world on like a dataset, like RoboFlow 100, I think would be quite interesting.[00:36:48] Peter Robicheaux: And our, our, I guess our final, final pick for paper of 2024 would be Moondream. So introducing Vic to talk about that.[00:36:54] swyx: But overall, that was exactly what I was looking for. Like best of 2024, an amazing job. Yeah, you can, [00:37:00] if there's any other questions while Vic gets set up, like vision stuff,[00:37:07] swyx: yeah,[00:37:11] swyx: Vic, go ahead. Hi,[00:37:13] Vik Korrapati / Moondream[00:37:13] question: well, while we're getting set up, hi, over here, thanks for the really awesome talk. One of the things that's been weird and surprising is that the foundation model companies Even these MLMs, they're just like worse than RT Tether at detection still. Like, if you wanted to pay a bunch of money to auto label your detection dataset, If you gave it to OpenAI or Cloud, that would be like a big waste.[00:37:37] question: So I'm curious, just like, even Pali Gemma 2, like is worse. So, so I'm curious to hear your thoughts on like, how come, Nobody's cracked the code on like a generalist that really you know, beats a specialist model in computer vision like they have in in LLM land.[00:38:00][00:38:01] Isaac Robinson: Okay. It's a very, very interesting question. I think it depends on the specific domain. For image classification, it's basically there. In the, in AIMv2 showed, a simple attentional probe on the pre trained features gets like 90%, which is as well as anyone does. The, the, the, the bigger question, like, why isn't it transferring to object detection, especially like real time object detection.[00:38:25] Isaac Robinson: I think, in my mind, there are two answers. One is, object detection is really, really, really the architectures are super domain specific. You know, we see these, all these super, super complicated things, and it's not super easy to, to, to build something that just transfers naturally like that, whereas image classification, you know, clip pre training transfers super, super quickly.[00:38:48] Isaac Robinson: And the other thing is, until recently, the real time object detectors didn't even really benefit from pre training. Like, you see the YOLOs that are like, essentially saturated, showing very little [00:39:00] difference with pre training improvements, with using pre trained model at all. It's not surprising, necessarily, that People aren't looking at the effects of better and better pre training on real time detection.[00:39:12] Isaac Robinson: Maybe that'll change in the next year. Does that answer your question?[00:39:17] Peter Robicheaux: Can you guys hear me? Yeah, one thing I want to add is just like, or just to summarize, basically, is that like, Until 2024, you know, we haven't really seen a combination of transformer based object detectors and fancy losses, and PolyGemma suffers from the same problem, which is basically to say that these ResNet, or like the convolutional models, they have all these, like, extreme optimizations for doing object detection, but essentially, I think it's kind of been shown now that convolution models like just don't benefit from pre training and just don't like have the level of intelligence of transformer models.[00:39:56] swyx: Awesome. Hi,[00:39:59] Vik Korrapati: can [00:40:00] you hear me?[00:40:01] swyx: Cool. I hear you. See you. Are you sharing your screen?[00:40:04] Vik Korrapati: Hi. Might have forgotten to do that. Let me do[00:40:07] swyx: that. Sorry, should have done[00:40:08] Vik Korrapati: that.[00:40:17] swyx: Here's your screen. Oh, classic. You might have to quit zoom and restart. What? It's fine. We have a capture of your screen.[00:40:34] swyx: So let's get to it.[00:40:35] Vik Korrapati: Okay, easy enough.[00:40:49] Vik Korrapati: All right. Hi, everyone. My name is Vic. I've been working on Moondream for almost a year now. Like Shawn mentioned, I just went and looked and it turns out the first version I released December [00:41:00] 29, 2023. It's been a fascinating journey. So Moonbeam started off as a tiny vision language model. Since then, we've expanded scope a little bit to also try and build some tooling, client libraries, et cetera, to help people really deploy it.[00:41:13] Vik Korrapati: Unlike traditional large models that are focused at assistant type use cases, we're laser focused on building capabilities that developers can, sorry, it's yeah, we're basically focused on building capabilities that developers can use to build vision applications that can run anywhere. So, in a lot of cases for vision more so than for text, you really care about being able to run on the edge, run in real time, etc.[00:41:40] Vik Korrapati: So That's really important. We have we have different output modalities that we support. There's query where you can ask general English questions about an image and get back human like answers. There's captioning, which a lot of our users use for generating synthetic datasets to then train diffusion models and whatnot.[00:41:57] Vik Korrapati: We've done a lot of work to minimize those sessions there. [00:42:00] So that's. Use lot. We have open vocabulary object detection built in similar to a couple of more recent models like Palagem, et cetera, where rather than having to train a dedicated model, you can just say show me soccer balls in this image or show me if there are any deer in this image, it'll detect it.[00:42:14] Vik Korrapati: More recently, earlier this month, we released pointing capability where if all you're interested in is the center of an object you can just ask it to point out where that is. This is very useful when you're doing, you know, I automation type stuff. Let's see, LA we, we have two models out right now.[00:42:33] Vik Korrapati: There's a general purpose to be para model, which runs fair. Like it's, it's it's fine if you're running on server. It's good for our local Amma desktop friends and it can run on flagship, flagship mobile phones, but it never. so much for joining us today, and we'll see you in the [00:43:00] next one. Less memory even with our not yet fully optimized inference client.[00:43:06] Vik Korrapati: So the way we built our 0. 5b model was to start with the 2 billion parameter model and prune it while doing continual training to retain performance. We, our objective during the pruning was to preserve accuracy across a broad set of benchmarks. So the way we went about it was to estimate the importance of different components of the model, like attention heads, channels MLP rows and whatnot using basically a technique based on the gradient.[00:43:37] Vik Korrapati: I'm not sure how much people want to know details. We'll be writing a paper about this, but feel free to grab me if you have more questions. Then we iteratively prune a small chunk that will minimize loss and performance retrain the model to recover performance and bring it back. The 0. 5b we released is more of a proof of concept that this is possible.[00:43:54] Vik Korrapati: I think the thing that's really exciting about this is it makes it possible for for developers to build using the 2B param [00:44:00] model and just explore, build their application, and then once they're ready to deploy figure out what exactly they need out of the model and prune those capabilities into a smaller form factor that makes sense for their deployment target.[00:44:12] Vik Korrapati: So yeah, very excited about that. Let me talk to you folks a little bit about another problem I've been working on recently, which is similar to the clocks example we've been talking about. We had a customer reach out who was talking about, like, who had a bunch of gauges out in the field. This is very common in manufacturing and oil and gas, where you have a bunch of analog devices that you need to monitor.[00:44:34] Vik Korrapati: It's expensive to. And I was like, okay, let's have humans look at that and monitor stuff and make sure that the system gets shut down when the temperature goes over 80 or something. So I was like, yeah, this seems easy enough. Happy to, happy to help you distill that. Let's, let's get it going. Turns out our model couldn't do it at all.[00:44:51] Vik Korrapati: I went and looked at other open source models to see if I could just generate a bunch of data and learn from that. Did not work either. So I was like, let's look at what the folks with [00:45:00] hundreds of billions of dollars in market cap have to offer. And yeah, that doesn't work either. My hypothesis is that like the, the way these models are trained are using a large amount of image text data scraped from the internet.[00:45:15] Vik Korrapati: And that can be biased. In the case of gauges, most gauge images aren't gauges in the wild, they're product images. Detail images like these, where it's always set to zero. It's paired with an alt text that says something like GIVTO, pressure sensor, PSI, zero to 30 or something. And so the models are fairly good at picking up those details.[00:45:35] Vik Korrapati: It'll tell you that it's a pressure gauge. It'll tell you what the brand is, but it doesn't really learn to pay attention to the needle over there. And so, yeah, that's a gap we need to address. So naturally my mind goes to like, let's use synthetic data to, Solve this problem. That works, but it's problematic because it turned out we needed millions of synthetic gauge images to get to reasonable performance.[00:45:57] Vik Korrapati: And thinking about it, reading a gauge is like [00:46:00] not a one, like it's not a zero short process in our minds, right? Like if you had to tell me the reading in Celsius for this, Real world gauge. There's two dials on there. So first you have to figure out which one you have to be paying attention to, like the inner one or the outer one.[00:46:14] Vik Korrapati: You look at the tip of the needle, you look at what labels it's between, and you count how many and do some math to figure out what that probably is. So what happens if we just add that as a Chain of thought to give the model better understanding of the different sub, to allow the model to better learn the subtasks it needs to perform to accomplish this goal.[00:46:37] Vik Korrapati: So you can see in this example, this was actually generated by the latest version of our model. It's like, okay, Celsius is the inner scale. It's between 50 and 60. There's 10 ticks. So the second tick, it's a little debatable here, like there's a weird shadow situation going on, the dial is off, so I don't know what the ground truth is, but it works okay.[00:46:57] Vik Korrapati: There's points on there that are, the points [00:47:00] over there are actually grounded. I don't know if this is easy to see, but when I click on those, there's a little red dot that moves around on the image. The model actually has to predict where this points are, I was already trying to do this with bounding boxes, but then Malmo came out with pointing capabilities.[00:47:15] Vik Korrapati: And it's like pointing is a much better paradigm to to represent this. We see pretty good results. This one's actually for clock reading. I couldn't find our chart for gauge reading at the last minute. So the light. Blue chart is with our rounded chain of thought. This measures, we have, we built a clock reading benchmark about 500 images.[00:47:37] Vik Korrapati: This measures accuracy on that. You can see it's a lot more sample efficient when you're using the chain of thought to model. Another big benefit from this approach is like, you can kind of understand how the model is. it and how it's failing. So in this example, the actual correct reading is 54 Celsius, the model output [00:48:00] 56, not too bad but you can actually go and see where it messed up. Like it got a lot of these right, except instead of saying it was on the 7th tick, it actually predicted that it was the 8th tick and that's why it went with 56.[00:48:14] Vik Korrapati: So now that you know that this. Failing in this way, you can adjust how you're doing the chain of thought to maybe say like, actually count out each tick from 40, instead of just trying to say it's the eighth tick. Or you might say like, okay, I see that there's that middle thing, I'll count from there instead of all the way from 40.[00:48:31] Vik Korrapati: So helps a ton. The other thing I'm excited about is a few short prompting or test time training with this. Like if a customer has a specific gauge that like we're seeing minor errors on, they can give us a couple of examples where like, if it's miss detecting the. Needle, they can go in and correct that in the chain of thought.[00:48:49] Vik Korrapati: And hopefully that works the next time. Now, exciting approach, we only apply it to clocks and gauges. The real question is, is it going to generalize? Probably, like, there's some science [00:49:00] from text models that when you train on a broad number of tasks, it does generalize. And I'm seeing some science with our model as well.[00:49:05] Vik Korrapati: So, in addition to the image based chain of thought stuff, I also added some spelling based chain of thought to help it understand better understand OCR, I guess. I don't understand why everyone doesn't do this, by the way. Like, it's trivial benchmark question. It's Very, very easy to nail. But I also wanted to support it for stuff like license plate, partial matching, like, hey, does any license plate in this image start with WHA or whatever?[00:49:29] Vik Korrapati: So yeah, that sort of worked. All right, that, that ends my story about the gauges. If you think about what's going on over here it's interesting that like LLMs are showing enormous. Progress in reasoning, especially with the latest set of models that we've seen, but we're not really seeing, I have a feeling that VLMs are lagging behind, as we can see with these tasks that should be very simple for a human to do [00:50:00] that are very easy to find VLMs failing at.[00:50:04] Vik Korrapati: My hypothesis on why this is the case is because On the internet, there's a ton of data that talks about how to reason. There's books about how to solve problems. There's books critiquing the books about how to solve problems. But humans are just so good at perception that we never really talk about it.[00:50:20] Vik Korrapati: Like, maybe in art books where it's like, hey, to show that that mountain is further away, you need to desaturate it a bit or whatever. But the actual data on how to, like, look at images is, isn't really present. Also, the Data we have is kind of sketched. The best source of data we have is like image all text pairs on the internet and that's pretty low quality.[00:50:40] Vik Korrapati: So yeah, I, I think our solution here is really just we need to teach them how to operate on individual tasks and figure out how to scale that out. All right. Yep. So conclusion. At Moondream we're trying to build amazing PLMs that run everywhere. Very hard problem. Much work ahead, but we're making a ton of progress and I'm really excited [00:51:00] about If anyone wants to chat about more technical details about how we're doing this or interest in collaborating, please, please hit me up.[00:51:08] Isaac Robinson: Yeah,[00:51:09] swyx: like, I always, when people say, when people say multi modality, like, you know, I always think about vision as the first among equals in all the modalities. So, I really appreciate having the experts in the room. Get full access to Latent Space at www.latent.space/subscribe
In light of controversies at this year's Eurovision in Malmo, a new code of conduct is being introduced by the European Broadcasting Union...but what does it entail?Joining Seán to discuss is Richard Taylor, contributor to Eurovision Ireland - the official Irish Eurovision Blog.
After a disastrous performance against Levadiakos, Olympiacos fought back by winning their first away match in the Europa League this season against Malmo. Jose Luis Mendilibar's side is starting to look like a serious contender for a spot in the top eight, which leads to automatic qualification for the Last 16 --- Support this podcast: https://podcasters.spotify.com/pod/show/gate7/support
Warhammer Meta Chasers is a weekly competitive Warhammer 40k hype show. We run down some of the biggest and best events coming up this weekend where we discuss Warhammer 40k Factions in attendance and highlight army lists from some of the top ranked players around the globe. We talk about what the meta is, what it will be and how you can stack up against it. The show is hosted by Paul Murphy, Adam Camilleri, and Dustin Henshaw. The show runs LIVE every week on YouTube around 9pm EST every Thursday. We sincerely invite you to join us in chat if you can make it. The show is pushed to the Podcast aggregators soon after! We have an amazing chat community. Check out our Patreon here: https://www.patreon.com/WarhammerMetaChasers Join us live each and every Thursday on YouTube and join in our awesome chat community. Want to message the show another way? Hit up Paul on twitter @warmaster_tpm or on Instagram @fightswithdice
It's time for The Truth!Sam Tighe and Dougie Critchley take a look back through the annals of history to analyse the career of Zlatan Ibrahimovic, and ask the question - was he a world class striker in the pantheons of the all-time greats, or just an excellent player in his day?Zlatan has long divided opinion, even in a world before the hyper-sensationalisation of social media that we see today. From dancing through players on his debut, scoring some of the best goals that the game has ever seen, referring to himself exclusively in the third person, needing to be the lightning rod for whichever team he played for, and constant references to lions and gods, there is no denying the man was pure entertainment. But entertaining and being one of the greats are two different things. So is Zlatan right up there with the greatest ever to play the game? Or was his own perception of himself bigger than his legacy on the pitch?The Truth is somewhere in the middle... Hosts: Sam Tighe & Dougie CritchleyProduction & Editing: Jack Collins Studio Recording: Footwork Media And remember, if you'd like more from the Rank Squad, including extra podcasts every Monday and Friday (including our weekly Postbox taking a look at the whole weekend of football) and access to our brilliant Discord community, then why not join us here on Patreon?
Have you visited a food hall recently? It's a venue bringing together multiple independent food and drink businesses, often with communal seating. We look at the ways in which food halls are being used to bring consumers and spend to new areas, raising the value of surrounding offices, apartments and other businesses. In this programme Devina Gupta visits Society food hall in Manchester in the UK, where she meets Julia Martinelli, who manages the pizza offering from Noi Quattro restaurant and Reece Gibson, operations manager for Vocation Brewery which runs the bar. Mariko Oi in Singapore reports from the Maxwell Hawker Centre in Singapore, to explore how today's food halls have evolved from street food traders. Frode Rønne Malmo from Mathallen in Oslo, Norway and Spiros Loukopoulos, from Reffen in Copenhagen, Denmark talk about the ways in which their food halls have brought people to the surrounding area. Food hall consultant Philip Colicchio in New York in the US explains why this business model has been so popular. Presented by Devina Gupta. Produced by Beatrice Pickup. Additional reporting by Mariko Oi. (Image: a man and a woman enjoying plates of food in a food hall. Credit: Getty Images/BBC)
Greta Thunberg joins anti-Semitic protesters in Malmo, Sweden; Joe Biden declares that he won't leave anyone behind while abandoning American hostages to Hamas; and The New York Times says that the Republicans are the real antisemites. Click here to join the member exclusive portion of my show: https://utm.io/ueSEj Ep.1962 - - - DailyWire+: Watch the premiere of our new animated sitcom Mr. Birchum this Sunday, May 12th at 9 PM ET on DailyWire+: https://bit.ly/4akO7wC Get 25% off your DailyWire+ Membership here: https://bit.ly/4akO7wC - - - Today's Sponsors: Eight Sleep - Exclusive discount for my listeners with promo code: SHAPIRO at https://www.eightsleep.com/shapiro/ Beam - Get 40% off for a limited time! http://www.ShopBeam.com/BEN Bambee - Visit https://www.bambee.com/ and type in ‘Ben Shapiro' when you sign up. Blinds - Exclusive Discount for my Listeners! Tell them The Ben Shapiro Show sent you! https://www.blinds.com/ Robinhood - Learn more by downloading the Robinhood app or by visiting http://www.Robinhood.com *Returns are not guaranteed. Interest is earned on uninvested cash swept from your brokerage account to program banks. The cash sweep program is offered through Robinhood Financial LLC. Terms apply. Robinhood is not a bank. Bigger instant deposits are only available if your instant deposit status is in good standing. Robinhood Financial LLC (member SIPC) is a registered broker-dealer. - - - Socials: Follow on Twitter: https://bit.ly/3cXUn53 Follow on Instagram: https://bit.ly/3QtuibJ Follow on Facebook: https://bit.ly/3TTirqd Subscribe on YouTube: https://bit.ly/3RPyBiB
It's that time of year again when the entirety of Europe (and a few other countries) come together to celebrate kitschy, bombastic songwriting through the Eurovision Song Contest! This year's competition, held in Malmo, Sweden, features everything from rave-pop on behalf of the Netherlands, to folk-rapping hybrids courtesy of Ukraine – and Charlie and Nate are here to musicologically unpack the craziest tracks that have the potential to win it all. For more on the controversy surrounding this year's contest, check out Charlie's appearance on Vox's podcast Today, Explained. Songs discussed: Joost – Europapa Angelina Mango – La noia alyona alyona, Jerry Heil – Teresa & Maria Nemo – The Code Baby Lasagna – Rim Tim Tagi Dim Windows95Man – No Rules! Kaleen – We Will Rave Olly Alexander – Dizzy Bambie Thug – Doomsday Blue Ladaniva – Jako Learn more about your ad choices. Visit podcastchoices.com/adchoices