Podcasts about Colab

  • 535PODCASTS
  • 1,082EPISODES
  • 51mAVG DURATION
  • 5WEEKLY NEW EPISODES
  • May 22, 2025LATEST
Colab

POPULARITY

20172018201920202021202220232024

Categories



Best podcasts about Colab

Show all podcasts related to colab

Latest podcast episodes about Colab

Beercast Brasil
BC#602 – Ed Muller da Oldsmobier

Beercast Brasil

Play Episode Listen Later May 22, 2025 41:19


Negócios nascem e morrem e fica sempre tudo o que aprendemos com a jornada. Ed Muller terminou sua história com a Colab e agora volta a tocar as cervejas da Oldsmobier. Nessa entrevista, Ed conta tudo o que aconteceu para Anselmo Mendo e Victor Marinho.

News Talk 920 KVEC
Hometown Radio 05/13/25 6p: Mike Brown retires from COLAB of SLO County

News Talk 920 KVEC

Play Episode Listen Later May 14, 2025 43:35


Hometown Radio 05/13/25 6p: Mike Brown retires from COLAB of SLO County

Project Inclusion: The Podcast

What if everything we thought you knew about education was holding us back? It's time to unlearn, reimagine, and step into the future of learning with CoLAB. Listen to our latest podcast episode featuring “who” from CoLAB, an organization whose vision is to co-design next-ready resilient communities where education, creativity and social responsibility converge to drive impact in a rapidly changing world. CoLAB is redefining education with a bold, student-centered philosophy that fuses design thinking, critical inquiry, creativity, and service. More than just a learning model, it's an agile ecosystem designed to equip students with the skills and mindset needed to thrive in the Fourth Industrial Revolution. Unlike traditional approaches, CoLAB champions co-creation, student agency, and the exploration of limitless possibilities—drawing inspiration from quantum physics, psychology, and the sciences to challenge conventional wisdom. What happens when we bring people together in an environment that intentionally designs for critical inquiry, imagination, and creativity, turning K-12 education on its head? What happens when you infuse design thinking into the DNA of a classroom? How can we provide a pathway to advancement—one where people of all ages, inside and outside classrooms, can develop the skills and capacities to lead and realize potential in the fourth industrial revolution What do outcomes look like when we can rewrite traditional ways of teaching, and train educators to bring these next generation classrooms to life?  

Move Fast. Break Shit. Burn Out.
Thomas Knoll, Head of Innov8rs CoLab: Do you have a corporate failure policy?

Move Fast. Break Shit. Burn Out.

Play Episode Listen Later May 9, 2025 39:06


In this episode, we sit down with Thomas Knoll, Head of Innov8rs CoLab, who has spent years bringing together innovation and intrapreneurship communities. Tommy dives into the crucial importance of understanding an organization's "tolerance for change"—especially how much support truly exists at the C-suite level. We explore his thought-provoking article, "Does Your Organization Have a Failure Policy?", unpacking the significance of setting clear failure guardrails within organizations. Through contrasting examples like SpaceX and Boeing, we discuss how intentionality around failure policies can drive innovation, learning, and long-term success. Whether you're leading change or looking to foster a culture of smart risk-taking, this conversation is packed with actionable insights.Original music by ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Lynz Floren⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠.

News Talk 920 KVEC
Hometown Radio 04/29/25 4p: Mike Brown retires from COLAB of SLO County

News Talk 920 KVEC

Play Episode Listen Later Apr 30, 2025 43:35


Hometown Radio 04/29/25 4p: Mike Brown retires from COLAB of SLO County

Unpaid And Underrated
103 : 50% Timestamps

Unpaid And Underrated

Play Episode Listen Later Apr 29, 2025 78:00


This week Joey gathers a gaggle of silly geese yet again to discuss some hard hitting problems in the lifting world. They dive right into some great topics naps, the TikTok voice, birds, and a Crü deathmatch. Links Massenomics x Ünpaid and Ünderrated Colab (https://www.massenomics.com/shop/unpaid-underrated-tee) Get Your Own Keith Head (https://www.unpaidinternpodcast.com/articles/keith-head) Follow The Podcast On Instagram @unpaid.underrated.podcast (https://www.instagram.com/unpaid.underrated.podcast/) Online UnpaidInternPodcast.com (https://www.unpaidinternpodcast.com/) On Youtube @Unpaid.Underrated.Podcast (https://www.youtube.com/@Unpaid.Underrated.Podcast) Our Guest Big Ryan On Instagram @angrym0nkie (https://www.instagram.com/angrym0nkie/) Big Gary On Instagram @npc_gary (https://www.instagram.com/npc_gary/) Big Joey On Instagram @joey.echeverria (https://www.instagram.com/joey.echeverria/) Our Hosts @keithhoneycutt73 (https://www.instagram.com/keithhoneycutt73/) or his orange gym, @thenowhinecellar (https://www.instagram.com/thenowhinecellar/) @joey_mleczko (https://www.instagram.com/joey_mleczko/) Special Guests: Big Brad (Gary?), Big Joey, and Big Mofo Guy.

IT’S JUST COFFEE!
Why Coffee Costs Are Rising Fast ! Inside the C-Price with Condesa Co.Lab | OLIVER BROWN!

IT’S JUST COFFEE!

Play Episode Listen Later Apr 28, 2025 65:07


Why is the cost of coffee rising so quickly?Today we're joined by Oliver Brown, Sales and Relations Manager at Condesa Co.Lab, to unpack the complexities and volatility of coffee pricing.We'll break down the C-price, dive into the dynamics of supply and demand, and explore the environmental challenges impacting coffee production. Plus we're asking the big question: what does the future of the coffee industry hold? Could we really be paying $10 for a latte by the end of 2025?!If you're new here (welcome), our show dives into some of the best coffee conversations on the internet, but we will always remind ourselves at the end of the day - It's Just Coffee!Check out Condesa Co.Lab here: https://www.instagram.com/condesa_co_lab/ Want more coffee content? IT'S JUST COFFEE: https://linktr.ee/itsjustcoffeepod?utm_source=linktree_profile_share<sid=4e8cead0-6644-4c4a-b419-28c825b1b236Want to get in touch? Hit us up at hello@itsjustcoffeepod.com for any questions or comments.Proudly sponsored by -Eco Barista: https://www.ecobarista.com.au/ Apax Lab: https://apaxlab.com/ Thanks for listening! Learn more about your ad choices. Visit megaphone.fm/adchoices

Breezy With No Filter
Who Let Breezy Get A Puppy? & Why Do We Eat Ham On Easter? First Intro To New Podcast Colab "Not Sorry" With Cait

Breezy With No Filter

Play Episode Listen Later Apr 27, 2025 91:05


First official podcast colob for our new podcast called "Not Sorry" with Cait Brown Breezy introduces her and her boyfriend Ari's new puppy Teddy! Easter update Katy Perry being an idiot thinking going to space would make any of us impressed. Weird things going on in Arizona. Why not wearing "underwears" is the safest option. New England Serial killer. ....And a bunch of other stuff

Le Cancre Pédagogue.
Hors- série: Le Cancre à l'AQUOPS avec Maude Tremblay

Le Cancre Pédagogue.

Play Episode Listen Later Apr 24, 2025 10:49


En collaboration avec Sylvain Duclos, nous discutons de COlab avec Maude Tremblay! Une plongées dans l'univers es STIAM au féminin.

Unpaid And Underrated
102 : Boiling Dirt

Unpaid And Underrated

Play Episode Listen Later Apr 22, 2025 147:11


This week Joey and Keith get to know Big Debo. They dive right into great topics like chemistry, dirt, debo-isms, Power Rangers, and peanut butter. Links Massenomics x Ünpaid and Ünderrated Colab (https://www.massenomics.com/shop/unpaid-underrated-tee) Get Your Own Keith Head (https://www.unpaidinternpodcast.com/articles/keith-head) Follow The Podcast On Instagram @unpaid.underrated.podcast (https://www.instagram.com/unpaid.underrated.podcast/) Online UnpaidInternPodcast.com (https://www.unpaidinternpodcast.com/) On Youtube @Unpaid.Underrated.Podcast (https://www.youtube.com/@Unpaid.Underrated.Podcast) Our Guest On Instagram @coachdebo (https://www.instagram.com/coachdebo) Our Hosts @keithhoneycutt73 (https://www.instagram.com/keithhoneycutt73/) or his orange gym, @thenowhinecellar (https://www.instagram.com/thenowhinecellar/) @joey_mleczko (https://www.instagram.com/joey_mleczko/) Special Guest: Big Debo.

Amazing Teams Podcast
What Are You Doing on the Fifth Day? How COLAB Made the 4-Day Workweek Work

Amazing Teams Podcast

Play Episode Listen Later Apr 22, 2025 38:40


Send us a textIn this episode of the Amazing Teams podcast, we sit down with Morgan Witham, CEO of COLAB, about her journey from investment banking to leading a digital agency and how her team embraced a four-day work week. Morgan shares insights on workplace culture, employee retention, and the power of gratitude in building a thriving company.We dive into:Morgan transitioned from investment banking to CEO of COLAB to help scale the business.The four-day work week was implemented to combat burnout and attract talent.Retention is influenced by overall workplace culture, not just policies.The four-day work week was not a silver bullet for retention issues.Gratitude and recognition are key components of CoLab's culture.Tune in to hear Morgan's leadership insights and how COLAB is redefining modern work culture! Resources:Get to know COLABCOLAB's take on HeyTaco & team cultureConnect with COLAB's fearless leader, Morgan!

UBC News World
Sports & Gaming Industry - Colab Cloud's Cutting-Edge Technology Advantage

UBC News World

Play Episode Listen Later Apr 22, 2025 4:47


Colab Cloud's robust, cloud-based platform aggregates multiple services, enabling streamlined operations and optimized sports experiences. Colab Cloud combines learning modules, secure payments, and AI-based analytics to support Performance Optimization, Athlete Development, and Talent Management. Synergy Global Enterprise LLC City: singapore Address: 111 North Bridge Road #21-01 Website: https://pixelproduction.com/

The co-lab career stories
Claire Steichen - Founder, Clear Strategy Coaching

The co-lab career stories

Play Episode Listen Later Apr 21, 2025 14:47


Claire Steichen founded Clear Strategy Coaching in 2008 to help ambitious mid-career professionals reach their goals, authentically and without burn-out. Claire is the author of “Confidence at Work: The High Achiever's Guide to Navigating Uncertainty”. Using her I to the 4th Power methodology, Claire has trained hundreds of mid-level and senior professionals to build career on their terms. Before becoming a coach, Claire spent two decades in Beauty at L'Oreal, Parfums Christian Dior, and Givaudan.  Since founding Clear Strategy Coaching, her corporate clients have included L'Oreal, Visa, American Express, ESPN, Firmenich, International Flavors & Fragrances, and Omnicom. Claire received her certification through Coach Training Institute and is a Certified Professional Coach with the International Coach Federation.On this episode of the CoLab podcast, Madelyn Ulrich sits down with Claire Steichen, founder of Clear Strategy Coaching, to talk about bouncing back from career setbacks, trusting your intuition, and celebrating small wins. From L'Oréal to launching her own practice, Claire shares the real story behind building confidence and resilience in leadership.

The Ops Experts Club Podcast
62. How to Avoid Death by a Thousand Meetings

The Ops Experts Club Podcast

Play Episode Listen Later Apr 17, 2025 25:51


SUMMARY: In this episode, Aaron and Terryn dive into the all-too-common struggle of "death by meetings" and share actionable strategies to reclaim your time and boost operational efficiency. Fresh off a whirlwind week of coast-to-coast travel and back-to-back meetings, Aaron and Terryn unpack the challenges of meeting overload and offer practical tips for business leaders, visionaries, and ops professionals alike.   From identifying when a meeting is truly necessary to defining clear deliverables and roles, they explore how to streamline communication and avoid the trap of endless, unproductive calls. Learn how the CoLab team leverages tools like Asana, Voxer, and Loom to cut down on unnecessary meetings, delegate effectively, and keep projects moving forward. They also discuss the importance of honoring the chain of command, respecting team downtime, and using structured systems like Level 10 meetings to foster accountability without micromanaging.   Key Takeaways: For Visionaries: Honor the chain of command by funneling ideas through your ops lead or executive assistant to avoid overwhelming your team. For Operators: Strategize rather than execute every task—delegate deliverables to the right team members to prevent burnout. For All Teams: Use automation and async tools (like voice memos or screen recordings) to share updates and reduce meeting fatigue. Pro Tip: Stick to a cadence like weekly Level 10 meetings to ensure accountability and clarity without derailing your team's focus. Tune in to discover how to escape the meeting marathon and create a more productive, intentional workflow for your business. Perfect for anyone looking to optimize operations and make every minute count!   Minute by Minute: 0:00 Introduction 2:09 How does one avoid death by a thousand meetings? 4:55 Who takes away deliverables from meetings? 11:12 How to avoid the meeting all together 16:35 Honor the chain of command 19:18 If you're gonna do a meeting, do it with structure 23:11 Action items from this conversation

Beer Thirty: Craft Brew Stories and Reviews From Northern California
BEER THIRTY: 97.7 The River/Fogbelt Colab - Beer:30 West Coast IPA 2025- 04/18/25

Beer Thirty: Craft Brew Stories and Reviews From Northern California

Play Episode Listen Later Apr 17, 2025 5:36


It ain't just wine country anymore!!  Some of the best craft-brewed beers in the world are right here in the North Bay.  And Danny Wright wants to taste them ALL on Beer Thirty!  With help from the guys at the Sports Meats Beer podcast, catch new episodes on-air every Friday at 8:30am on 97.7 The River! This week we recorded LIVE @ Fogbelt Brewing in Healdsburg to try the new 97.7 The River/Fogbelt Colab  - Beer:30 West Coast IPA 2025!  

colab north bay west coast ipa healdsburg beer thirty sports meats beer
Unpaid And Underrated
101 : We Used to Be a Proper Cült

Unpaid And Underrated

Play Episode Listen Later Apr 15, 2025 153:47


This week Joey and Keith get to know Big Brad (Gary?). They dive right into great topics like bikes, smoking in public, FB marketplace, the weather, and liking instagram stories. Links Massenomics x Ünpaid and Ünderrated Colab (https://www.massenomics.com/shop/unpaid-underrated-tee) Get Your Own Keith Head (https://www.unpaidinternpodcast.com/articles/keith-head) Follow The Podcast On Instagram @unpaid.underrated.podcast (https://www.instagram.com/unpaid.underrated.podcast/) Online UnpaidInternPodcast.com (https://www.unpaidinternpodcast.com/) On Youtube @Unpaid.Underrated.Podcast (https://www.youtube.com/@Unpaid.Underrated.Podcast) Our Guest On Instagram @npc_gary (https://www.instagram.com/npc_gary/) Our Hosts @keithhoneycutt73 (https://www.instagram.com/keithhoneycutt73/) or his orange gym, @thenowhinecellar (https://www.instagram.com/thenowhinecellar/) @joey_mleczko (https://www.instagram.com/joey_mleczko/) Special Guest: Big Brad (Gary?).

Wake The Farm Up! - Maintaining Ground
WTFU • Elf Co-Lab Convo • Unheard Archives Collage

Wake The Farm Up! - Maintaining Ground

Play Episode Listen Later Apr 12, 2025 41:14


This is pure Serious and Seriously Fun conversation collage from the WTFU vaults archives of pod squad family contributions and collaborations.  Let's Grow! We want to hear from you! Enjoy this audio adventure! We so into it...Made to listen as an audio show beginning to end. A mixed variety of elf field recordings of Frogs and Birds , as well as fresh cut from the Elfkin ambient flows called SwampBloom, it is free of words and talking. It can be found on the wake the farm up youtube channel Heartbeet Homestead  on Ohio Cannabis (mixed in show) Farm Talk•Dj Sundra Stories of the Lost! (Mixed in Show) Festival AdventuresCompost J on the Bone Broth! (Mixed in Show) Life Style Comedy•SWATAA w/ Chalky Elf (16:30) Musical Interlude•JacoBus Poetics (33:13) Pure Poetic PsyTent Talk•Rowan Green of Symbiotic Forest (3:07) Check it•Aether Elf (Intro to show) Pure ElfinessSubscribe Everywhere Cause thats cool hahaha!check out links to the Council of Counsel:Doctor Bionic • Kalpataru Tree • Dirtwire • Anno Project@wakethefarmup @maintaining_ground_podcast@kastle_369 @ra.feke @alexhillchill @powergurlz_entMateria Medica One Earth Collaborative Luv Locs Experimentthe More you know you---Ask how you could be involved in the show...

Finovate Podcast
EP 253: Walter Mendenhall, Help With My Loan

Finovate Podcast

Play Episode Listen Later Apr 9, 2025 20:09


Former pro athlete and founder of the Male Mogul Initiative Walter Mendenhall on the recent acquisition of Help With My Loan, and the importance of fair access to credit. Detailed Summary: In this episode of the Finovate Podcast, host Greg Palmer talks with Walter Mendenhall, founder of the Male Mogul Initiative, a Chicago-based nonprofit aimed at empowering young men through leadership, entrepreneurship, and workforce development. Walter shares his unique path from the NFL to teaching, and eventually to founding his organization in response to systemic issues he witnessed in his community. What began with a handful of students in 2018 has grown into a large-scale movement impacting over 3,000 young people, producing 200+ jobs, and launching Chicago's first youth business incubator, CoLab. Walter highlights the critical issues his organization tackles—such as lack of access to capital and high rates of unemployment and gun violence among young black men in Chicago. Walter then explains how his personal experience being denied a loan, despite strong financials, sparked a deeper dive into the challenges under-resourced communities face in accessing capital. This led to his obsession with financial education and systemic barriers, eventually informing his academic work and business ventures. He emphasizes how a lack of basic financial knowledge—like understanding loan underwriting criteria—keeps many people from succeeding. His goal is to demystify these processes and educate individuals who are often left behind by traditional financial systems. The conversation turns to the recent acquisition of Help With My Loan, a Finovate alum that uses AI to speed up and streamline the commercial loan process. Walter, along with his partners, brought the company from California to Chicago to expand its reach and use it as a tool to train youth as business loan brokers. With a 95% match rate and 80% faster processing times, the platform aligns perfectly with Male Mogul's mission to increase access to capital. Looking ahead, Walter envisions expanding their lender network, integrating blockchain for community investment, and ultimately creating scalable solutions to close the capital gap nationwide. His closing advice to fintech innovators: focus on solving real problems with passion—opportunities and support will follow. More info: Male Mogul Initiative: https://malemogulinitiative.org/ Help With My Loan: https://helpwithmyloan.com/ Walter Mendenhall IV: https://waltermendenhall.com/ , https://www.linkedin.com/in/walter-mendenhall-iv-55200787/ Greg Palmer: https://www.linkedin.com/in/gregbpalmer/ Finovate: https://www.finovate.com ; https://www.linkedin.com/company/finovate-conference-series/ #fintech #lending #credit #FinancialInclusion #finovate

Unpaid And Underrated
100 : The Silent Partycast

Unpaid And Underrated

Play Episode Listen Later Apr 7, 2025 172:46


This week Joey and Keith get to know The Technical Guy Big Nate. They dive right into great topics like being a dad, Relient K, pooping, and poetry competitions. Links Massenomics x Ünpaid and Ünderrated Colab (https://www.massenomics.com/shop/unpaid-underrated-tee) Get Your Own Keith Head (https://www.unpaidinternpodcast.com/articles/keith-head) Follow The Podcast On Instagram @unpaid.underrated.podcast (https://www.instagram.com/unpaid.underrated.podcast/) Online UnpaidInternPodcast.com (https://www.unpaidinternpodcast.com/) On Youtube @Unpaid.Underrated.Podcast (https://www.youtube.com/@Unpaid.Underrated.Podcast) Our Guest On Instagram @natee561 (https://www.instagram.com/natee561/) His wife's floral account @floraldesignsbykay (https://instagram.com/floraldesignsbykay) or online at floraldesignsbykay.com (https://floraldesignsbykay.com) Our Hosts @keithhoneycutt73 (https://www.instagram.com/keithhoneycutt73/) or his orange gym, @thenowhinecellar (https://www.instagram.com/thenowhinecellar/) @joey_mleczko (https://www.instagram.com/joey_mleczko/) Special Guest: Big Nate.

Unpaid And Underrated
099 : SURPRISES!

Unpaid And Underrated

Play Episode Listen Later Apr 1, 2025 128:44


This week Joey and Keith get to know Big Loren. They dive right into great topics like horse stall mats, country music, posting kids on the internet, and MCing a powerlifting meet. Links Massenomics x Ünpaid and Ünderrated Colab (https://www.massenomics.com/shop/unpaidpo-underrated-tee) Get Your Own Keith Head (https://www.unpaidinternpodcast.com/articles/keith-head) Follow The Podcast On Instagram @unpaid.underrated.podcast (https://www.instagram.com/unpaid.underrated.podcast/) Online UnpaidInternPodcast.com (https://www.unpaidinternpodcast.com/) On Youtube @Unpaid.Underrated.Podcast (https://www.youtube.com/@Unpaid.Underrated.Podcast) Our Guest On Instagram @lo__koe (https://www.instagram.com/lo__koe/) Our Hosts @keithhoneycutt73 (https://www.instagram.com/keithhoneycutt73/) or his orange gym, @thenowhinecellar (https://www.instagram.com/thenowhinecellar/) @joey_mleczko (https://www.instagram.com/joey_mleczko/) Special Guest: Big Loren.

The Ops Experts Club Podcast
59. The Pros and Cons of Member Based Communities

The Ops Experts Club Podcast

Play Episode Listen Later Mar 27, 2025 23:16


SUMMARY: In this episode, Aaron and Terryn dive into the exciting world of working with high-profile clients and the growing trend of branded community apps. The two discuss the evolution of online communities, as Aaron and Terryn explore how visionaries like Jon Acuff, Darius Daniels, Brandon Turner, and Natasha Graziano are leveraging custom apps to connect with their audiences.   The hosts break down the pros and cons of moving from social media platforms like Facebook groups to fully branded community apps, highlighting the benefits of control, push notifications, and seamless upsell opportunities. They discuss popular platforms like Kajabi, Circle, and Mighty Networks, sharing real-world examples from their current builds and offering insights into key features like gamification, tiered memberships, and live engagement tools. Along the way, Aaron and Terryn emphasize the importance of mapping out a clear customer journey to ensure long-term value and retention—whether through live Q&As, coaching, or celebrating member wins.   Packed with practical tips and behind-the-scenes stories, this episode is a must-listen for visionaries and operators looking to build thriving, results-driven communities. Plus, stick around for a shoutout to Winston and Moose, Terryn's furry co-stars of the show! Tune in to discover how to meet your audience where they are and take your community to the next level. Have questions about platforms like Circle, Kajabi, or Mighty Networks? Drop us a message on socials—The CoLab team is here to help you bring your vision to life!   Minute by Minute: 0:00 Introduction 3:27 Working with Michael Junior 5:24 Communities coming together in apps 8:22 The benefits of push notifications in branded apps 10:49 Recurring revenue within your platform 13:13 Gamification and badges 17:07 Continuing to deliver value to your members  

Co-Lab Recordings Podcast - Hosted by Benny Colab
Podcast 076 - Drum and Bass - hosted by Benny Colab

Co-Lab Recordings Podcast - Hosted by Benny Colab

Play Episode Listen Later Mar 26, 2025 74:27


Podcast for the the Co-Lab Recordings stable hosted by label co-owner Benny Colab bringing you the latest and exclusive forthcoming music from Co-Lab, Sumo Beatz, Pure Vibes, Liquid Lab and Calypso Muzak alongside his favourite tracks this month from across Drum and Bass. Tracklist: 1 - Xeonz - Feel Right 2 - Anushka - Overwhelmed (feat. Max Wheeler) [DJ Die Remix] 3 - Verbz & Zar ft Note - All Said & Done 4 - Heist - Herbie 5 - Heist - Courtesan 6 - GLXY & Salo - Love Lost 7 - The Skeptics & Phoebe Train - Never Been About You 8 - Chug - Echos of Yesterday 9 - Motiv & Collette Warren - Cloak & Dagger (Random Movement Remix) 10 - Channell - Like That 11 - Alibi - With You 12 - Workforce - Drowning 13 - Dogger - Natural Living 14 - Need For Mirrors - Water Bending 15 - Joakuim - Soul Jazz 16 - IP - Look Around 17 - Jungle Jim - Uneasy Slumber 18 - Jungle Jim - Edge Of The Clearing 19 - Jungle Jim - Computer Dreads 20 - Alien Perfect - No Piano 21 - DSP - Judgement 22 - Heist - A Bit Dicey 23 - Shimah - Taken 24 - Sustance & Para - Thats Right 25 - Sl8r & Fox - Skull & Crosses 26 - Heist - Octopus 27 - Ky - Less is More 28 - Dunk - Freddy Krueger 29 - Heist - King Of The Warblers 30 - Shimah - Extraterrestrial Origin 31 - Bladerunner & Heist - No Mercy 32 - Jenks - Bullet 33 - Shimah - Telekinesis 34 - Aries & Dj Gaw - Let it Go 35 - Heist - Scabbard 36 - Pola & Bryson & IYAMAH - Want It 37 - Aries - Not So Bad 38 - Shimah - Pressure in The Pipes

dance bass jungle edm drum heist tracklist pipes colab drum and bass ukf drum and bass arena co-lab recordings calypso muzak
Unpaid And Underrated
098 : A Moist Chalk

Unpaid And Underrated

Play Episode Listen Later Mar 25, 2025 144:26


This week Joey and Keith get to know Big Dan. They dive right into great topics like The Eagles, being sick, strongman, warming up, an drumming. Links Massenomics x Ünpaid and Ünderrated Colab (https://www.massenomics.com/shop/unpaid-underrated-tee) Get Your Own Keith Head (https://www.unpaidinternpodcast.com/articles/keith-head) Follow The Podcast On Instagram @unpaid.underrated.podcast (https://www.instagram.com/unpaid.underrated.podcast/) Online UnpaidInternPodcast.com (https://www.unpaidinternpodcast.com/) On Youtube @Unpaid.Underrated.Podcast (https://www.youtube.com/@Unpaid.Underrated.Podcast) Our Guest On Instagram @dan_eager (https://www.instagram.com/dan_eager/) Our Hosts @keithhoneycutt73 (https://www.instagram.com/keithhoneycutt73/) or his orange gym, @thenowhinecellar (https://www.instagram.com/thenowhinecellar/) @joey_mleczko (https://www.instagram.com/joey_mleczko/) Special Guest: Big Dan.

Unpaid And Underrated
097 : Blood Up

Unpaid And Underrated

Play Episode Listen Later Mar 18, 2025 113:20


This week Joey and Keith get to know Big Ron. They dive right into great topics like comics, axels, disc golf, metal sub-genres, and the podcast family tree. Links Massenomics x Ünpaid and Ünderrated Colab (https://www.massenomics.com/shop/unpaid-underrated-tee) Get Your Own Keith Head (https://www.unpaidinternpodcast.com/articles/keith-head) Follow The Podcast On Instagram @unpaid.underrated.podcast (https://www.instagram.com/unpaid.underrated.podcast/) Online UnpaidInternPodcast.com (https://www.unpaidinternpodcast.com/) On Youtube @Unpaid.Underrated.Podcast (https://www.youtube.com/@Unpaid.Underrated.Podcast) Our Guest On Instagram @_capt.ron_ (https://www.instagram.com/_capt.ron_/) Our Hosts @keithhoneycutt73 (https://www.instagram.com/keithhoneycutt73/) or his orange gym, @thenowhinecellar (https://www.instagram.com/thenowhinecellar/) @joey_mleczko (https://www.instagram.com/joey_mleczko/) Special Guest: Big Ron.

Unpaid And Underrated
096 : Keith Here btw

Unpaid And Underrated

Play Episode Listen Later Mar 11, 2025 104:31


This week Joey and Keith bring on a some Arnold veterans to talk about their weekend at the festival. Links Massenomics x Ünpaid and Ünderrated Colab (https://www.massenomics.com/shop/unpaid-underrated-tee) Get Your Own Keith Head (https://www.unpaidinternpodcast.com/articles/keith-head) Follow The Podcast On Instagram @unpaid.underrated.podcast (https://www.instagram.com/unpaid.underrated.podcast/) Online UnpaidInternPodcast.com (https://www.unpaidinternpodcast.com/) On Youtube @Unpaid.Underrated.Podcast (https://www.youtube.com/@Unpaid.Underrated.Podcast) Our Guest Big Antony on Instagram @antony.martinez14 (https://www.instagram.com/antony.martinez14/) Big Bryce on Instagram @perfectlilsweetiesgym (https://www.instagram.com/perfectlilsweetiesgym/) Big Hogan on Instagram @worldsstrongestpsychologist (https://www.instagram.com/worlds_strongest_psychologist/) Big Katie on Instagram @moorhead_k (https://www.instagram.com/moorhead_k/) Big Tyler on Instagram @tylerthompson1991 (https://www.instagram.com/tylerthompson1991/) Our Hosts @keithhoneycutt73 (https://www.instagram.com/keithhoneycutt73/) or his orange gym, @thenowhinecellar (https://www.instagram.com/thenowhinecellar/) @joey_mleczko (https://www.instagram.com/joey_mleczko/) Special Guests: Big Antony, Big Bryce, Big Hogan, Big Katie, and Big Tyler.

Beer Thirty: Craft Brew Stories and Reviews From Northern California
BEER THIRTY : Cooperage/Old Caz Colab - Form Like Voltron

Beer Thirty: Craft Brew Stories and Reviews From Northern California

Play Episode Listen Later Mar 6, 2025 6:34


It ain't just wine country anymore!!  Some of the best craft-brewed beers in the world are right here in the North Bay.  And Danny Wright wants to taste them ALL on Beer Thirty!  With help from the guys at the Sports Meats Beer podcast, catch new episodes on-air every Friday at 8:30am on 97.7 The River - This week : Cooperage/Old Caz Colab - Form Like Voltron.

voltron colab north bay cooperage beer thirty sports meats beer
Unpaid And Underrated
095 : All The Salads Were Wet

Unpaid And Underrated

Play Episode Listen Later Mar 4, 2025 123:40


This week Big Jen brings a gaggle of silly hawks together to review some less favorable reviews. Links Massenomics x Ünpaid and Ünderrated Colab (https://www.massenomics.com/shop/unpaid-underrated-tee) Get Your Own Keith Head (https://www.unpaidinternpodcast.com/articles/keith-head) Follow The Podcast On Instagram @unpaid.underrated.podcast (https://www.instagram.com/unpaid.underrated.podcast/) Online UnpaidInternPodcast.com (https://www.unpaidinternpodcast.com/) On Youtube @Unpaid.Underrated.Podcast (https://www.youtube.com/@Unpaid.Underrated.Podcast) Our Guest Big Jen on Instagram @getstrongjen24 (https://www.instagram.com/getstrongjen24/) Big Daniel on Instagram @coach8123 (https://www.instagram.com/coach8123/) Big Hannah on Instagram @h.bohling (https://www.instagram.com/h.bohling/) Big Ryan on Instagram @angrym0nkie (https://www.instagram.com/angrym0nkie/) Our Hosts @keithhoneycutt73 (https://www.instagram.com/keithhoneycutt73/) or his orange gym, @thenowhinecellar (https://www.instagram.com/thenowhinecellar/) @joey_mleczko (https://www.instagram.com/joey_mleczko/) Special Guests: Big Daniel, Big Hannah, Big Mofo Guy, and Big Ryan.

TechCrunch
Anthropic raises $3.5B to fuel its AI ambitions

TechCrunch

Play Episode Listen Later Mar 4, 2025 7:47


Plus - Google upgrades Colab with an AI agent tool; Reddit co-founder Alexis Ohanian joins Frank McCourt's TikTok bid Learn more about your ad choices. Visit podcastchoices.com/adchoices

Unpaid And Underrated
094 : The #1 podcast about “we really don't know a lot about anything”

Unpaid And Underrated

Play Episode Listen Later Feb 25, 2025 118:42


This week Joey, Keith, and Nate join the ranks of “The Davids” and try to explain the history of many Ünpaid and Ünderrated jokes. They also dive into some great topics like hoodies, being a proper country, what really is a surplus, and PC upgrades. Links LocalSend : Share files to nearby devices. Free, open-source, cross-platform. (https://localsend.org) Massenomics x Ünpaid and Ünderrated Colab (https://www.massenomics.com/shop/unpaid-underrated-tee) Get Your Own Keith Head (https://www.unpaidinternpodcast.com/articles/keith-head) Follow The Podcast On Instagram @unpaid.underrated.podcast (https://www.instagram.com/unpaid.underrated.podcast/) Online UnpaidInternPodcast.com (https://www.unpaidinternpodcast.com/) On Youtube @Unpaid.Underrated.Podcast (https://www.youtube.com/@Unpaid.Underrated.Podcast) Our Guest On Instagram @natee561 (https://www.instagram.com/natee561/) Our Hosts @keithhoneycutt73 (https://www.instagram.com/keithhoneycutt73/) or his orange gym, @thenowhinecellar (https://www.instagram.com/thenowhinecellar/) @joey_mleczko (https://www.instagram.com/joey_mleczko/)

Clarity Generates Confidence
Episode 129: Think Bigger, Collaborate Smarter

Clarity Generates Confidence

Play Episode Listen Later Feb 20, 2025 34:13


What if the key to business growth isn't just working harder—but collaborating smarter? In this episode of Clarity Generates Confidence, we welcome Chad Jenkins, a growth alchemist, best-selling author, and founder of CoLab under SeedSpark, for an insightful conversation on the power of collaboration, unlocking unique value, and thinking bigger in business and entrepreneurship.Drawing from his book Friction Fuel, Chad explores how early life experiences shape entrepreneurial thinking and emphasizes the importance of moving beyond siloed strategies. He challenges entrepreneurs to embrace a collaboration-first growth approach that amplifies impact and drives innovation.Chad also dives into the future of entrepreneurship, advocating for a collaboration-driven business model where success is built on shared knowledge, strategic partnerships, and leveraging industry insights. He challenges listeners to rethink how they combine relationships and expertise to create exponential value in an increasingly interconnected world.

Rock & Roll Happy Hour
Last Call - AleSmith - Coffee & Beer Flat White Blonde Ale

Rock & Roll Happy Hour

Play Episode Listen Later Feb 19, 2025 1:55


Didn't have to wait long to taste what's happening at AleSmith this weekend! Today Kristen and Peter have brought the Colab with Arizona Wilderness Brewing Co & Peixoto Coffee Roasters for a Flat White Inspired Blonde Ale. Using coffee cured with the cherry on gives this Latin American Coffee a subtle sweetness to go with it's light acidicness that is supported by the malt of the Blonde Ale with a little hint of lactose.

Unpaid And Underrated
093 : I've Never Swallowed a Banana

Unpaid And Underrated

Play Episode Listen Later Feb 18, 2025 128:07


This week Joey and Keith get to know Big Antony. They dive right into great topics like what it's like to walk a mile in Keith's shoes, the new drink spotter contest, being an EMT, Rubik's cubes, chess, magic and Aunts. Links Massenomics x Ünpaid and Ünderrated Colab (https://www.massenomics.com/shop/unpaid-underrated-tee) Get Your Own Keith Head (https://www.unpaidinternpodcast.com/articles/keith-head) Follow The Podcast On Instagram @unpaid.underrated.podcast (https://www.instagram.com/unpaid.underrated.podcast/) Online UnpaidInternPodcast.com (https://www.unpaidinternpodcast.com/) On Youtube @Unpaid.Underrated.Podcast (https://www.youtube.com/@Unpaid.Underrated.Podcast) Our Guest On Instagram @antony.martinez14 (https://www.instagram.com/antony.martinez14/) Our Hosts @keithhoneycutt73 (https://www.instagram.com/keithhoneycutt73/) or his orange gym, @thenowhinecellar (https://www.instagram.com/thenowhinecellar/) @joey_mleczko (https://www.instagram.com/joey_mleczko/) Special Guests: Big Antony and Big Hogan.

Seismic Soundoff
249: Machine Learning Methods in Geoscience

Seismic Soundoff

Play Episode Listen Later Feb 13, 2025 24:14


“The biggest challenge for geophysicists? Learning machine learning's ‘new language' from the world of statistics.” Machine learning is transforming geoscience, and Gerard Schuster explains how. This conversation explores key ML applications in seismic interpretation, the role of convolutional neural networks in fault detection, and why hands-on labs are essential for mastering these techniques. With real-world examples and insights from his new book, Machine Learning Methods in Geoscience, this episode delivers practical knowledge for integrating ML into geophysics. KEY TAKEAWAYS > Why ML matters for geoscientists – The demand for ML skills is growing, and Jerry shares how this shift shapes education and careers. > CNNs in action – Convolutional neural networks are used to detect rock cracks in Saudi Arabia through drone imagery. > Transformers vs. traditional neural networks – Transformers process seismic data differently by capturing long-range dependencies, offering new advantages. NEXT STEP Explore Machine Learning Methods in Geoscience by Gerard Schuster, featuring hands-on MATLAB and Colab labs. Get the book and start applying ML techniques today! https://library.seg.org/doi/epdf/10.1190/1.9781560804048.fm TEXT A FRIEND These are great insights on how ML is actually being used in seismic work, not just theory. https://seg.org/podcasts/episode-249-machine-learning-methods-in-geoscience GUEST BIO Gerard Schuster has an M.S. (1982) and a Ph.D. (1984) from Columbia University and was a postdoctoral researcher there from 1984 to 1985. From 1985 to 2009, he was a professor of geophysics at the University of Utah and became a professor of geophysics at KAUST (2009–2021). He is currently a research professor at the University of Utah. He received several teaching and research awards while at the University of Utah. He was editor of GEOPHYSICS 2004–2005 and was awarded SEG's Virgil Kauffman Gold Medal in 2010 for his work in seismic interferometry. His previous books are Seismic Interferometry (2009, Cambridge Press) and Seismic Inversion (2017, SEG). LINKS * Buy the Print Book at https://seg.org/shop/product/?id=fe5a3cd3-77b2-ef11-b8e8-6045bda82e05 * Visit https://seg.org/podcasts/episode-249-machine-learning-methods-in-geoscience for the full guest bios and show notes. CALL FOR ABSTRACTS Technical Program Chairs Yingcai Zheng and Molly Turko invite you to submit your best work. This year, we're fostering deeper collaboration between SEG, AAPG, and SEPM. Focus on regional challenges and how integrated geoscience can unlock solutions. Submit short or expanded abstracts for oral and poster presentations. The Call for Abstracts is open and closes on 15 March at 5:00 PM CT. Don't miss this opportunity to share your research and connect with the broader geoscience community at https://www.imageevent.org/. SHOW CREDITS Andrew Geary at TreasureMint hosted, edited, and produced this episode. The SEG podcast team comprises Jennifer Cobb, Kathy Gamble, and Ally McGinnis. If you have episode ideas or feedback for the show or want to sponsor a future episode, email the show at podcast@seg.org.

Unpaid And Underrated
092 : South Dacotta

Unpaid And Underrated

Play Episode Listen Later Feb 11, 2025 132:11


This week Joey and Keith get to know Big Kurt. They dive right into great topics like Facebook Marketplace finds, the royal rumble, teaching, coaching, the blue book, hot peppers, and what the real backlog is. Links Massenomics x Ünpaid and Ünderrated Colab (https://www.massenomics.com/shop/unpaid-underrated-tee) Get Your Own Keith Head (https://www.unpaidinternpodcast.com/articles/keith-head) Follow The Podcast On Instagram @unpaid.underrated.podcast (https://www.instagram.com/unpaid.underrated.podcast/) Online UnpaidInternPodcast.com (https://www.unpaidinternpodcast.com/) On Youtube @Unpaid.Underrated.Podcast (https://www.youtube.com/@Unpaid.Underrated.Podcast) Our Guest On Instagram @kdubs_82 (https://www.instagram.com/kdubs_82/) Our Hosts @keithhoneycutt73 (https://www.instagram.com/keithhoneycutt73/) or his orange gym, @thenowhinecellar (https://www.instagram.com/thenowhinecellar/) @joey_mleczko (https://www.instagram.com/joey_mleczko/) Special Guest: Big Kurt.

PDR College podcast- Paintless Dent Repair / Removal Business and Marketing
PDRC 2025 Episode 1 Glue Pulling Evolution w Gene Fetty (CoLab w the All Access Auto Apearance Institute Podcast)

PDR College podcast- Paintless Dent Repair / Removal Business and Marketing

Play Episode Listen Later Feb 6, 2025 80:09


Join Gene Fetty & Keith to talk about innovation and history in Glue Pulling for PDR and Collision repair! Check out the Automotive Appearance Institute (AAI) HERE   Save the date October 9-12 for the PDR College Advanced Skills Seminar 2025 @ Anson PDR, in Burelson TX!

SOMAPSO Pod
SOMAPSO Pod - Week of Feb 6, 2025

SOMAPSO Pod

Play Episode Listen Later Feb 6, 2025 24:04


We rewind to deliciousness at The Corner Slice, Lum's Cellars, trivia night, and poetry class at The Write Space.We're looking forward to the fundraiser for the LA fires at Maplewood Mercantile, pop-ups at the Co-Lab with Stupid Sloth Cards, the Charmery, the Local D, and fairy hair, two craft chocolate tastings, a film screening of One of Them Days celebrating SZA, karaoke at Pickett's, a kids sewing workshop.Three Things with Rent Party Garden, Maplewoodstock band applications, tax assistance, and General Store Shops and Cafe.Do you like us? Check this box.LINKS:Elks Rent Party Garden Named South Orange Villager of the MonthMaplewoodstock band applicationsValentine's Day Pretzel preorderKing Cake preorderTax filing assistanceTax filing assistance for seniors

Unpaid And Underrated
091 : Expesso Grest

Unpaid And Underrated

Play Episode Listen Later Feb 4, 2025 143:05


This week Joey and Keith get to know Big Nate. They dive right into great topics like what shirts mean, birthdays, sky diving, broken bones, the blue book, and maps. Links Donate to the OAR Foundation (https://www.operationalliesrefugefoundation.org/donate) Donate to the Appalachia Service Project (https://asphome.org/) Massenomics x Ünpaid and Ünderrated Colab (https://www.massenomics.com/shop/unpaid-underrated-tee) Get Your Own Keith Head (https://www.unpaidinternpodcast.com/articles/keith-head) Follow The Podcast On Instagram @unpaid.underrated.podcast (https://www.instagram.com/unpaid.underrated.podcast/) Online UnpaidInternPodcast.com (https://www.unpaidinternpodcast.com/) On Youtube @Unpaid.Underrated.Podcast (https://www.youtube.com/@Unpaid.Underrated.Podcast) Our Guest On Instagram @nate.verde (https://www.instagram.com/nate.verde/) Our Hosts @keithhoneycutt73 (https://www.instagram.com/keithhoneycutt73/) or his orange gym, @thenowhinecellar (https://www.instagram.com/thenowhinecellar/) @joey_mleczko (https://www.instagram.com/joey_mleczko/) Special Guests: Big Carp, Big Kris, and Big Nate.

The Ops Experts Club Podcast
51. Keeping Work Engaging and Navigating Effective Recruiting in Operations

The Ops Experts Club Podcast

Play Episode Listen Later Jan 30, 2025 25:20


SUMMARY: In this episode of The Ops Experts Club Podcast, we tackle two key aspects of running a successful operation: staying engaged in your role and the art of effective recruiting. Join us as we discuss strategies to maintain excitement in your work by delegating effectively, creating space for new challenges, and fostering growth opportunities within your team. We also take a deep dive into recruiting—exploring best practices, the nuances of hiring overseas, and how to honor applicants throughout the process. From sifting through resumes to conducting meaningful interviews and leveraging tools like personality assessments, we break down what it takes to find the right fit. If you've ever wondered how to recruit top talent without overpaying or how to stay inspired in your day-to-day operations, this episode is packed with actionable insights you won't want to miss.   Minute by Minute: 0:00 Introduction 3:48 The trick to staying engaged in your ops work 6:51 The CoLab method of recruiting 12:56 “We'll just hire a VA to do that” 17:05 Things to look for in VA searches

Unpaid And Underrated
090 : Odde Hot Arena

Unpaid And Underrated

Play Episode Listen Later Jan 28, 2025 115:51


This week Joey and Keith get to know Big Tommy Stapler. They dive right into great topics like "What does the Costco logo look like?", facebook groups, metal, wrestling, and hairlines. Links Massenomics x Ünpaid and Ünderrated Colab (https://www.massenomics.com/shop/unpaid-underrated-tee) Get Your Own Keith Head (https://www.unpaidinternpodcast.com/articles/keith-head) Follow The Podcast On Instagram @unpaid.underrated.podcast (https://www.instagram.com/unpaid.underrated.podcast/) Online UnpaidInternPodcast.com (https://www.unpaidinternpodcast.com/) On Youtube @Unpaid.Underrated.Podcast (https://www.youtube.com/@Unpaid.Underrated.Podcast) Our Guest On Instagram @tommy_schneider22 (https://www.instagram.com/tommy_schneider22/) Our Hosts @keithhoneycutt73 (https://www.instagram.com/keithhoneycutt73/) or his orange gym, @thenowhinecellar (https://www.instagram.com/thenowhinecellar/) @joey_mleczko (https://www.instagram.com/joey_mleczko/) Special Guest: Big Tommy Stapler.

SPED Homeschool Conversations
Playing to Thrive: Building Essential Skills for Today's World

SPED Homeschool Conversations

Play Episode Listen Later Jan 28, 2025 57:34


Could play be the key to helping homeschoolers thrive? In this episode of Empowering Homeschool Conversations, host Peggy Ployhar is joined by Alan Tang, founder of CoLab, an innovative online program that uses games, discussions, and escape rooms to equip kids with essential skills for the modern world. Alan’s journey—from corporate finance and culinary arts to education—has inspired his unique approach to engaging, hands-on learning. Tune in as we dive into how play-based learning can empower homeschool families to prepare their children for lifelong success. To connect with Alan and his resources, use this link: https://www.collaborationlaboratory.com/ and to see testimonials from his students visit: https://youtu.be/b_LaBT31moQ Viewers like you funded similar episodes, and other free resources from SPED Homeschool. To learn how you can support the nonprofit work of SPED Homeschool and this broadcast, visit https://spedhomeschool.com/donate/ To find out more about SPED Homeschool, visit our website at https://spedhomeschool.com/ To learn about the other Empowering Homeschool Conversations Co-Hosts and their resources, visit: https://annieyorty.com/ https://www.leilanimelendez.com/ https://elarplearning.com/ https://solimaracademy.com/ Join our mission to empower homeschool families! https://spedhomeschool.com/donate/Join our mission to empower homeschool families!: https://spedhomeschool.com/donate/ Discover more Christian podcasts at lifeaudio.com and inquire about advertising opportunities at lifeaudio.com/contact-us.

Unpaid And Underrated
089 : Animal Balloon Pen…

Unpaid And Underrated

Play Episode Listen Later Jan 21, 2025 109:15


This week Joey and Keith get to know Big Megan. They dive right into great topics like lifting at massenomics gym, reading, rap and hip/hop, tattoos, barbells, and a whole lot of blackberry. Links Massenomics x Ünpaid and Ünderrated Colab (https://www.massenomics.com/shop/unpaid-underrated-tee) Get Your Own Keith Head (https://www.unpaidinternpodcast.com/articles/keith-head) Follow The Podcast On Instagram @unpaid.underrated.podcast (https://www.instagram.com/unpaid.underrated.podcast/) Online UnpaidInternPodcast.com (https://www.unpaidinternpodcast.com/) On Youtube @Unpaid.Underrated.Podcast (https://www.youtube.com/@Unpaid.Underrated.Podcast) Our Guest On Instagram @megantheequarterhorse (https://www.instagram.com/megantheequarterhorse/) Our Hosts @keithhoneycutt73 (https://www.instagram.com/keithhoneycutt73/) or his orange gym, @thenowhinecellar (https://www.instagram.com/thenowhinecellar/) @joey_mleczko (https://www.instagram.com/joey_mleczko/) Special Guest: Big Megan.

It's No Fluke
E126: Why The Colab Built The Anti-Agency

It's No Fluke

Play Episode Listen Later Jan 21, 2025 51:01


Ashley Mann is a seasoned communications and PR professional with 15 years of expertise in B2B and B2C technology. Her experience spans both agency and startup environments, where she has honed her approach to strategic communications. As co-founder of The Colab, she partners directly with high-growth companies to uncover and amplify their core mission. Ashley brings a unique perspective shaped by extensive experience in consumer and technology sectors, focusing on delivering impactful, mission-driven communications strategies. Lizzy Harris is a co-founder and Partner at The Colab, the leading anti-agency for B2B and B2C tech companies. Having worked at agencies and in-house over the last 15 years, Lizzy built The Colab with a vision of turning the traditional agency model on its head, with a hyper focus on transparency, communication, and overexceeding metrics. She is passionate about forging authentic partnerships with clients, helping them define their narrative, and driving measurable business outcomes through tailored strategies.

Unpaid And Underrated
088 : You're Gonna Need a Thesaurus Bud

Unpaid And Underrated

Play Episode Listen Later Jan 14, 2025 120:30


This week Joey and Keith get to know Big Garrett. They dive right into great topics like collars, engagements, Diet Dew, Facebook groups, piano, and home gyms. Links Massenomics x Ünpaid and Ünderrated Colab (https://www.massenomics.com/shop/unpaid-underrated-tee) Get Your Own Keith Head (https://www.unpaidinternpodcast.com/articles/keith-head) Follow The Podcast On Instagram @unpaid.underrated.podcast (https://www.instagram.com/unpaid.underrated.podcast/) Online UnpaidInternPodcast.com (https://www.unpaidinternpodcast.com/) On Youtube @Unpaid.Underrated.Podcast (https://www.youtube.com/@Unpaid.Underrated.Podcast) Our Guest On Instagram @garrett.cscs77 (https://www.instagram.com/garrett.cscs77/) or @gcstrengthsystems (https://www.instagram.com/gc_strength_systems/) Our Hosts @keithhoneycutt73 (https://www.instagram.com/keithhoneycutt73/) or his orange gym, @thenowhinecellar (https://www.instagram.com/thenowhinecellar/) @joey_mleczko (https://www.instagram.com/joey_mleczko/) Special Guest: Big Garrett.

Software Engineering Daily
AI Developer Tools at Google with Paige Bailey

Software Engineering Daily

Play Episode Listen Later Jan 9, 2025 37:28


Over the years, Google has released a variety of ML, data science, and AI developer tools and platforms. Prominent examples include Colab, Kaggle, AI Studio, and the Gemini API. Paige Bailey is the Uber Technical Lead of the Developer Relations team at Google ML Developer Tools, working on Gemini APIs, Gemma, AI Studio, Kaggle, Colab The post AI Developer Tools at Google with Paige Bailey appeared first on Software Engineering Daily.

Unpaid And Underrated
087 : Curl in the Squat Rack

Unpaid And Underrated

Play Episode Listen Later Jan 7, 2025 105:03


This week Joey and Keith get to know Big Katie. They dive right into great topics like, the progression of time, hops, mascots, James Bond, and if Nate is real or not. Links Massenomics x Ünpaid and Ünderrated Colab (https://www.massenomics.com/shop/unpaid-underrated-tee) Follow The Podcast On Instagram @unpaid.underrated.podcast (https://www.instagram.com/unpaid.underrated.podcast/) Online UnpaidInternPodcast.com (https://www.unpaidinternpodcast.com/) On Youtube @Unpaid.Underrated.Podcast (https://www.youtube.com/@Unpaid.Underrated.Podcast) Our Guest Follow Katie on Instagram @moorhead_k (https://www.instagram.com/moorhead_k/) Follow her Massenomics Certified Traning Facility @npz_strongman (https://www.instagram.com/npz_strongman) Our Hosts @keithhoneycutt73 (https://www.instagram.com/keithhoneycutt73/) or his orange gym, @thenowhinecellar (https://www.instagram.com/thenowhinecellar/) @joey_mleczko (https://www.instagram.com/joey_mleczko/) Special Guest: Big Katie.

Unpaid And Underrated
086 : Maple Syrup and Piss

Unpaid And Underrated

Play Episode Listen Later Dec 31, 2024 118:27


This week Joey and Keith get to know Big Jonathan Oldham. They talk about some heavy stuff as well as digging into some great topics like being a kicker and getting into rugby, plants, heavy music, and Keith's cannonball experience. Links Massenomics x Ünpaid and Ünderrated Colab (https://www.massenomics.com/shop/unpaid-underrated-tee) Follow The Podcast On Instagram @unpaid.underrated.podcast (https://www.instagram.com/unpaid.underrated.podcast/) Online UnpaidInternPodcast.com (https://www.unpaidinternpodcast.com/) On Youtube @Unpaid.Underrated.Podcast (https://www.youtube.com/@Unpaid.Underrated.Podcast) Our Guest On Instagram @baconandbeerbells (https://www.instagram.com/baconandbeerbells/) Our Hosts @keithhoneycutt73 (https://www.instagram.com/keithhoneycutt73/) or his orange gym, @thenowhinecellar (https://www.instagram.com/thenowhinecellar/) @joey_mleczko (https://www.instagram.com/joey_mleczko/) Special Guest: Big John.

Unpaid And Underrated
085 : LOL OUT LOUD

Unpaid And Underrated

Play Episode Listen Later Dec 24, 2024 137:32


This week, Joey and Keith sit down with Big Kim from Barbell Rescue for a fun and insightful conversation. They revisit their bold predictions for 2024, make fresh ones for 2025, and dive into a mix of entertaining topics—including Modelos, internet friendships, western shows and movies, cheerleading, and, of course, the story behind creating the ultimate barbell cleaning tool: the Barbell Rescue Brush. Links Massenomics x Ünpaid and Ünderrated Colab (https://www.massenomics.com/shop/unpaid-underrated-tee) Follow The Podcast On Instagram @unpaid.underrated.podcast (https://www.instagram.com/unpaid.underrated.podcast/) Online UnpaidInternPodcast.com (https://www.unpaidinternpodcast.com/) On Youtube @Unpaid.Underrated.Podcast (https://www.youtube.com/@Unpaid.Underrated.Podcast) Our Guest Follow Barbell Rescue on Instagram @barbellrescue1 (https://www.instagram.com/barbellrescue1/) Follow Kim on Instagram @friesenka (https://www.instagram.com/friesenka/) Follow Kim's Lifing on Instagram @freezn_fitnes (https://www.instagram.com/freezn_fitness/) Visit BarbellRescue.com (https://barbellrescue.com) and Use Code "UNPAID" Our Hosts @keithhoneycutt73 (https://www.instagram.com/keithhoneycutt73/) or his orange gym, @thenowhinecellar (https://www.instagram.com/thenowhinecellar/) @joey_mleczko (https://www.instagram.com/joey_mleczko/) Special Guest: Big Kim.

Latent Space: The AI Engineer Podcast — CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0
2024 in Post-Transformers Architectures (State Space Models, RWKV) [LS Live @ NeurIPS]

Latent Space: The AI Engineer Podcast — CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

Play Episode Listen Later Dec 24, 2024 43:02


Happy holidays! We'll be sharing snippets from Latent Space LIVE! through the break bringing you the best of 2024! We want to express our deepest appreciation to event sponsors AWS, Daylight Computer, Thoth.ai, StrongCompute, Notable Capital, and most of all all our LS supporters who helped fund the gorgeous venue and A/V production!For NeurIPS last year we did our standard conference podcast coverage interviewing selected papers (that we have now also done for ICLR and ICML), however we felt that we could be doing more to help AI Engineers 1) get more industry-relevant content, and 2) recap 2024 year in review from experts. As a result, we organized the first Latent Space LIVE!, our first in person miniconference, at NeurIPS 2024 in Vancouver.Of perennial interest, particularly at academic conferences, is scaled-up architecture research as people hunt for the next Attention Is All You Need. We have many names for them: “efficient models”, “retentive networks”, “subquadratic attention” or “linear attention” but some of them don't even have any lineage with attention - one of the best papers of this NeurIPS was Sepp Hochreiter's xLSTM, which has a particularly poetic significance as one of the creators of the LSTM returning to update and challenge the OG language model architecture:So, for lack of a better term, we decided to call this segment “the State of Post-Transformers” and fortunately everyone rolled with it.We are fortunate to have two powerful friends of the pod to give us an update here:* Together AI: with CEO Vipul Ved Prakash and CTO Ce Zhang joining us to talk about how they are building Together together as a quote unquote full stack AI startup, from the lowest level kernel and systems programming to the highest level mathematical abstractions driving new model architectures and inference algorithms, with notable industry contributions from RedPajama v2, Flash Attention 3, Mamba 2, Mixture of Agents, BASED, Sequoia, Evo, Dragonfly, Dan Fu's ThunderKittens and many more research projects this year* Recursal AI: with CEO Eugene Cheah who has helped lead the independent RWKV project while also running Featherless AI. This year, the team has shipped RWKV v5, codenamed Eagle, to 1.5 billion Windows 10 and Windows 11 machines worldwide, to support Microsoft's on-device, energy-usage-sensitive Windows Copilot usecases, and has launched the first updates on RWKV v6, codenamed Finch and GoldFinch. On the morning of Latent Space Live, they also announced QRWKV6, a Qwen 32B model modified with RWKV linear attention layers. We were looking to host a debate between our speakers, but given that both of them were working on post-transformers alternativesFull Talk on YoutubePlease like and subscribe!LinksAll the models and papers they picked:* Earlier Cited Work* Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention* Hungry hungry hippos: Towards language modeling with state space models* Hyena hierarchy: Towards larger convolutional language models* Mamba: Linear-Time Sequence Modeling with Selective State Spaces* S4: Efficiently Modeling Long Sequences with Structured State Spaces* Just Read Twice (Arora et al)* Recurrent large language models that compete with Transformers in language modeling perplexity are emerging at a rapid rate (e.g., Mamba, RWKV). Excitingly, these architectures use a constant amount of memory during inference. However, due to the limited memory, recurrent LMs cannot recall and use all the information in long contexts leading to brittle in-context learning (ICL) quality. A key challenge for efficient LMs is selecting what information to store versus discard. In this work, we observe the order in which information is shown to the LM impacts the selection difficulty. * To formalize this, we show that the hardness of information recall reduces to the hardness of a problem called set disjointness (SD), a quintessential problem in communication complexity that requires a streaming algorithm (e.g., recurrent model) to decide whether inputted sets are disjoint. We empirically and theoretically show that the recurrent memory required to solve SD changes with set order, i.e., whether the smaller set appears first in-context. * Our analysis suggests, to mitigate the reliance on data order, we can put information in the right order in-context or process prompts non-causally. Towards that end, we propose: (1) JRT-Prompt, where context gets repeated multiple times in the prompt, effectively showing the model all data orders. This gives 11.0±1.3 points of improvement, averaged across 16 recurrent LMs and the 6 ICL tasks, with 11.9× higher throughput than FlashAttention-2 for generation prefill (length 32k, batch size 16, NVidia H100). We then propose (2) JRT-RNN, which uses non-causal prefix-linear-attention to process prompts and provides 99% of Transformer quality at 360M params., 30B tokens and 96% at 1.3B params., 50B tokens on average across the tasks, with 19.2× higher throughput for prefill than FA2.* Jamba: A 52B Hybrid Transformer-Mamba Language Model* We present Jamba, a new base large language model based on a novel hybrid Transformer-Mamba mixture-of-experts (MoE) architecture. * Specifically, Jamba interleaves blocks of Transformer and Mamba layers, enjoying the benefits of both model families. MoE is added in some of these layers to increase model capacity while keeping active parameter usage manageable. * This flexible architecture allows resource- and objective-specific configurations. In the particular configuration we have implemented, we end up with a powerful model that fits in a single 80GB GPU.* Built at large scale, Jamba provides high throughput and small memory footprint compared to vanilla Transformers, and at the same time state-of-the-art performance on standard language model benchmarks and long-context evaluations. Remarkably, the model presents strong results for up to 256K tokens context length. * We study various architectural decisions, such as how to combine Transformer and Mamba layers, and how to mix experts, and show that some of them are crucial in large scale modeling. We also describe several interesting properties of these architectures which the training and evaluation of Jamba have revealed, and plan to release checkpoints from various ablation runs, to encourage further exploration of this novel architecture. We make the weights of our implementation of Jamba publicly available under a permissive license.* SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers* We introduce Sana, a text-to-image framework that can efficiently generate images up to 4096×4096 resolution. Sana can synthesize high-resolution, high-quality images with strong text-image alignment at a remarkably fast speed, deployable on laptop GPU. Core designs include: * (1) Deep compression autoencoder: unlike traditional AEs, which compress images only 8×, we trained an AE that can compress images 32×, effectively reducing the number of latent tokens. * (2) Linear DiT: we replace all vanilla attention in DiT with linear attention, which is more efficient at high resolutions without sacrificing quality. * (3) Decoder-only text encoder: we replaced T5 with modern decoder-only small LLM as the text encoder and designed complex human instruction with in-context learning to enhance the image-text alignment. * (4) Efficient training and sampling: we propose Flow-DPM-Solver to reduce sampling steps, with efficient caption labeling and selection to accelerate convergence. * As a result, Sana-0.6B is very competitive with modern giant diffusion model (e.g. Flux-12B), being 20 times smaller and 100+ times faster in measured throughput. Moreover, Sana-0.6B can be deployed on a 16GB laptop GPU, taking less than 1 second to generate a 1024×1024 resolution image. Sana enables content creation at low cost. * RWKV: Reinventing RNNs for the Transformer Era* Transformers have revolutionized almost all natural language processing (NLP) tasks but suffer from memory and computational complexity that scales quadratically with sequence length. In contrast, recurrent neural networks (RNNs) exhibit linear scaling in memory and computational requirements but struggle to match the same performance as Transformers due to limitations in parallelization and scalability. * We propose a novel model architecture, Receptance Weighted Key Value (RWKV), that combines the efficient parallelizable training of transformers with the efficient inference of RNNs.* Our approach leverages a linear attention mechanism and allows us to formulate the model as either a Transformer or an RNN, thus parallelizing computations during training and maintains constant computational and memory complexity during inference. * We scale our models as large as 14 billion parameters, by far the largest dense RNN ever trained, and find RWKV performs on par with similarly sized Transformers, suggesting future work can leverage this architecture to create more efficient models. This work presents a significant step towards reconciling trade-offs between computational efficiency and model performance in sequence processing tasks.* LoLCATs: On Low-Rank Linearizing of Large Language Models* Recent works show we can linearize large language models (LLMs) -- swapping the quadratic attentions of popular Transformer-based LLMs with subquadratic analogs, such as linear attention -- avoiding the expensive pretraining costs. However, linearizing LLMs often significantly degrades model quality, still requires training over billions of tokens, and remains limited to smaller 1.3B to 7B LLMs. * We thus propose Low-rank Linear Conversion via Attention Transfer (LoLCATs), a simple two-step method that improves LLM linearizing quality with orders of magnitudes less memory and compute. * We base these steps on two findings. * First, we can replace an LLM's softmax attentions with closely-approximating linear attentions, simply by training the linear attentions to match their softmax counterparts with an output MSE loss ("attention transfer").* Then, this enables adjusting for approximation errors and recovering LLM quality simply with low-rank adaptation (LoRA). * LoLCATs significantly improves linearizing quality, training efficiency, and scalability. We significantly reduce the linearizing quality gap and produce state-of-the-art subquadratic LLMs from Llama 3 8B and Mistral 7B v0.1, leading to 20+ points of improvement on 5-shot MMLU. * Furthermore, LoLCATs does so with only 0.2% of past methods' model parameters and 0.4% of their training tokens. * Finally, we apply LoLCATs to create the first linearized 70B and 405B LLMs (50x larger than prior work). * When compared with prior approaches under the same compute budgets, LoLCATs significantly improves linearizing quality, closing the gap between linearized and original Llama 3.1 70B and 405B LLMs by 77.8% and 78.1% on 5-shot MMLU.Timestamps* [00:02:27] Intros* [00:03:16] Why Scale Context Lengths? or work on Efficient Models* [00:06:07] The Story of SSMs* [00:09:33] Idea 1: Approximation -> Principled Modeling* [00:12:14] Idea 3: Selection* [00:15:07] Just Read Twice* [00:16:51] Idea 4: Test Time Compute* [00:17:32] Idea 2: Hardware & Kernel Support* [00:19:49] RWKV vs SSMs* [00:24:24] RWKV Arch* [00:26:15] QWRKWv6 launch* [00:30:00] What's next* [00:33:21] Hot Takes - does anyone really need long context?Transcript[00:00:00] AI Charlie: We're back at Latent Space Live, our first mini conference held at NeurIPS 2024 in Vancouver. This is Charlie, your AI co host. As a special treat this week, we're recapping the best of 2024 going domain by domain. We sent out a survey to the over 900 of you who told us what you wanted, and then invited the best speakers in the Latent Space Network to cover each field.[00:00:24] AI Charlie: 200 of you joined us in person throughout the day, with over 2200 watching live online. Thanks Our next keynote covers the State of Transformers alternative architectures, with a special joint presentation with Dan Fu of Together AI and Eugene Chia of Recursal AI and Featherless AI. We've featured both Together and Recursal on the pod before, with CEO Veepal Vedprakash introducing them.[00:00:49] AI Charlie: And CTO CE Zhang joining us to talk about how they are building together together as a quote unquote full stack AI startup from the lowest level kernel and systems [00:01:00] programming to the highest level mathematical abstractions driving new model architectures and inference algorithms with notable industry contributions from Red Pajama V2, Flash Attention 3, Mamba 2, Mixture of Agents.[00:01:15] AI Charlie: Based, Sequoia, Evo, Dragonfly, Danfoo's Thunder Kittens, and many more research projects this year. As for Recursal and Featherless, we were the first podcast to feature RWKV last year, and this year the team has shipped RWKV v5, codenamed Eagle, to 1. 5 billion Windows 10 and Windows 11 machines worldwide to support Microsoft's on device, end Energy Usage Sensitive Windows Copilot Use Cases and has launched the first updates on RWKV v6, codenamed Finch and Goldfinch.[00:01:53] AI Charlie: On the morning of Latent Space Live, they also announced QRdata UKv6, a QEN32B model [00:02:00] modified with RDWKV linear attention layers. Eugene has also written the most single most popular guest post on the Latent Space blog this year. Yes, we do take guest posts on what he has discovered about the H100 GPU inference NeoCloud market since the successful launch of Featherless AI this year.[00:02:20] AI Charlie: As always, don't forget to check the show notes for the YouTube link to their talk as well as their slides. Watch out and take care.[00:02:27] Intros[00:02:27] Dan Fu: Yeah, so thanks so much for having us. So this is going to be a little bit of a two part presentation. My name is Dan. I'm at Together AI, and I'll be joining UCSD as faculty in about a year. And Eugene, you want to introduce yourself?[00:02:46] Eugene Cheah: Eugene, I lead the art activity team, and I, I'm CEO of Featherless, and we both work on this new post transformer architecture space.[00:02:55] Dan Fu: Yeah, so yeah, so today we're really excited to talk to you a little bit [00:03:00] about that. So first I'm going to give a broad overview of kind of the last few years of progress in non post transformer architectures. And then afterwards Eugene will tell us a little bit about the latest and the greatest and the latest frontier models in this space.[00:03:16] Why Scale Context Lengths? or work on Efficient Models[00:03:16] Dan Fu: So, the story starts with Scaling. So this is probably a figure or something like this that you've seen very recently. Over the last five to six years, we've seen models really scale up in parameter size, and that's brought with it a bunch of new capabilities, like the ability to talk to you and tell you sometimes how to use your Colab screens.[00:03:35] Dan Fu: But another place where we've seen scaling especially recently is scaling in context length. So this can mean Having more text inputs for your models, but it can also mean things like taking a lot of visual token inputs image inputs to your models or generating lots of outputs. And one thing that's been really exciting over the last few months or so is that we're, we're seeing scaling, not only during training time, but also [00:04:00] during test time.[00:04:00] Dan Fu: So this is one of the, the, this is the iconic image from the OpenAI 01 release. Not only are we starting to scale train time compute, but we're also starting to scale test time compute. Now if you're familiar with our attention and our transformer architectures today, this graph on the right might look a little bit scary.[00:04:19] Dan Fu: And one of the reasons is that the implications are a little bit Interesting. So what does it mean if we want to continue having smarter and smarter models? Do we just need to start building bigger, bigger data centers, spending more flops? Is this this little Dolly 3, we need more flops, guys? Is this going to be the future of all of AI?[00:04:39] Dan Fu: Or is there a better way, another path forward? Maybe we can get the same capabilities that we've gotten used to, But for a lot less compute, a lot less flops. And one of the things that we're going to talk about today is specifically looking at that core attention operator in some of these models.[00:04:57] Dan Fu: And the reason is that so this is just some, some [00:05:00] basic you know, scaling curves, but attention has compute that scales quadratically in the context length. So that means that if you're doing something like test time compute and you want to spend a bunch of tokens thinking about what comes next, the longer that that goes the, the, the more tokens you spend on that, that compute grows quadratically in that.[00:05:19] Dan Fu: One of the questions that we're interested in is, can we take that basic sequence model, that basic sequence primitive at the bottom, and get it to scale better? Can we scale in, let's say, n to the 3 halves or n log n? So in, in the first part of the talk, so we just went over the introduction. What I'm gonna do over the next few slides is just talk about some of the key advances and ideas that have shown over the past few years since maybe early 2020 to, to now that shown promise that this might actually be possible.[00:05:48] Dan Fu: That you can actually get potentially the same quality that we want while scale, while scaling better. So to do that, we're and, and basically the, the story that we're gonna look is we're gonna start to see [00:06:00] how. So this is a basic graph of just the past couple years of progress of perplexity where that blue line, that dotted blue line, is attention.[00:06:07] The Story of SSMs[00:06:07] Dan Fu: It's your basic transformer, full dense attention. And then the dots coming down are some of the methods that you'll see in this presentation today. We're going to turn the clock back all the way to 2020. So this, this, this question of can we make attention subquadratic? Basically, as soon as we said attention is all you need, People started asking this question.[00:06:28] Dan Fu: So we have this quadratic attention operator. Can we do better? I'll briefly talk about why attention is quadratic. And the basic thing that happens, if you're not familiar, is that you have these inputs, these keys and queries. And what you do in this attention matrix, this S matrix over here, is that you're using, you're comparing every token in your input to every other token.[00:06:49] Dan Fu: So when I try to do something like upload a whole book to Gemini, what happens beyond the Maybe not Gemini, because we don't necessarily know what architecture is. But let's say we upload it to LLAMA, what happens beyond [00:07:00] the scenes, behind the scenes, is that it's going to take every single word in that book and compare it to every other word.[00:07:05] Dan Fu: And this has been a really, it's, it's led to some pretty impressive things. But it's kind of a brute forcing of the way that you would try to interpret a interpret something. And what attention does in particular is the, and then what attention, sorry, don't want to. Okay, no, no laser pointer. What, what attention does afterwards is that instead of always operating in this quadratic thing, it takes a row wise softmax over this matrix, and then multiplies it by this values matrix.[00:07:32] Dan Fu: So, one of the key points to notice is that the output size is always going to be the same as the inputs, at least in standard self attention. So one of the first things that folks tried to do around 2020 is this thing called linear attention, which is just, just noticing that if we take out this softmax from here, if we take out this non linearity in the middle of the attention operation, and then if you compute the keys and the values operation first, you actually never hit this quadratic bottleneck.[00:07:57] Dan Fu: So that, that's potentially a way [00:08:00] to get a lot more computationally efficient. And there are various ways to do this by basically using feature maps or try to approximate this overall attention computation. But some of this work sort of started to hit a wall in 2020. And the basic challenges were, were two.[00:08:16] Dan Fu: So one was quality. It was back then, it was kind of hard to, to get good quality with these linear attention operators. The other one was actually hardware efficiency. So these, this feature map that was just shown by a simplify simplify here. Actually ends up being quite computationally expensive if you just implement it naively.[00:08:34] Dan Fu: So you started having these operators that not only were you sure, you're not really sure if they have the same quality, but also they're actually just wall clock slower. So you kind of end up getting the worst of both worlds. So this was the the stage. So that kind of sets the stage for four years ago.[00:08:49] Dan Fu: Keep this in mind because linear attention is actually going to come back in a few years once we have a better understanding. But one of the works that started kicking off this, this [00:09:00] mini revolution in post transformer architectures was this idea called states based model. So here the seminal work is, is one about our work queue in 2022.[00:09:09] Dan Fu: And this, this piece of work really brought together a few ideas from, from some long running research research lines of work. The first one was, and this is really one of the keys to, to closing the gap in quality was just using things that, that if you talk to a, a, an electrical engineer off the street, they might know off, off the, like the back of their hand.[00:09:33] Idea 1: Approximation -> Principled Modeling[00:09:33] Dan Fu: But taking some of those properties with how we model dynamical systems in signal processing and then using those ideas to model the inputs, the, the text tokens in, for example a transformer like Next Token Prediction Architecture. So some of those early states-based model papers were looking at this relatively, relatively simple recurrent update model that comes from maybe chapter one of a signal processing class.[00:09:59] Dan Fu: But then using [00:10:00] some principle theory about how you should do that recurrent update in order to really get the most that you can out of your hidden state, out of your out of your sequence. So that, that was one key idea for quality and. When this was eventually realized, you started to see a bunch of benchmarks that were pretty sticky for a few years.[00:10:20] Dan Fu: Things like long range arena, some long sequence evaluation benchmarks, There was stuff in time series, time series analysis. They started to, you started to see the quality tick up in meaningful ways. But the other key thing that What's so influential about these states based models is that they also had a key idea about how you can compute these things efficiently.[00:10:45] Dan Fu: So if you go back to your machine learning 101 class where you learned about RNNs, one thing that you may have learned is that they don't paralyze as well as detention, because if you just run them naively, you have to do this kind of sequential update to process new tokens, [00:11:00] whereas in attention, you can process all the tokens in parallel at one time.[00:11:04] Dan Fu: One of the key insights behind the S4 paper was that these recurrent models, you could take them and you could also formulate them as a convolution. And in particular, with a convolution, you could, instead of using a PyTorch conv1d operation, you can compute that with the FFT. And that would give you n log n compute in the in the sequence length n with an operator that was relatively well optimized for modern hardware.[00:11:28] Dan Fu: So those are really, I'd say, the two key ideas in 2022 that started allowing these breakthroughs to happen in these non transformer architectures. So, these ideas about how to principally model sorry, how to model the recurrent updates of a mo of, of a sequence in a principled way, and also these key ideas in how you can compute it efficiently by turning it into a convolution and then scaling it up with the FFT.[00:11:53] Dan Fu: Along those same lines, so afterwards we started putting out some work on specialized kernels, so just [00:12:00] like we have flash attention for transformers, we also have works like flash fft conf, and if you look at these lines of work oftentimes when, whenever you see a new architecture, you see a new primitive one of the, one of the table stakes now is, do you have an efficient kernel so that you can actually get wall clock speed up?[00:12:14] Idea 3: Selection[00:12:14] Dan Fu: So by 2022, We are starting to have these models that had promising quality primitives, but and, and also promising wall clocks. So you could actually see regimes where they were better than transformers in meaningful ways. That being said, there were, there's still sometimes a quality gap, particularly for language modeling.[00:12:33] Dan Fu: And because languages, It's so core to what we do in sequence modeling these days the, the next, the next key idea that I'm going to talk about is this idea of selection mechanisms. And this is basically an idea of, so you have this recurrent state that you're keeping around that just summarizes everything that, that came before.[00:12:50] Dan Fu: And to get a good sequence model, one of the things that you really need to be able to do is have the model learn what's the best way to pick out pieces from that recurrent [00:13:00] state. So one of the, one of the major ideas here in a line of work called H3, Hungry Hungry Hippos, and also these hyena models were One way you can do this is by just adding some simple element wise gates.[00:13:13] Dan Fu: So versions of these ideas have been around for decades. If you squint at the LSTM paper you, you can probably find, find this gating mechanism. But turns out you can take those old ideas, add them into these new. state space models, and then you can see quality start to pick up. If you've heard of the Mamba model, this also takes the selection to the next level by actually making some changes in that fundamental recurrent state space.[00:13:40] Dan Fu: So, it's not only just this gating that happens around the SSM layer, but also you can actually make The ABCD matrices of your state space model, you can make them data dependent, which will allow you to even better select out different pieces from your hidden state depending on what you're seeing. I'll also point out if you look at the [00:14:00] bottom right of this figure, there's this little triangle with a GPU SRAM, GPU HBM, and this, this is just continuing that trend of when you have a new architecture you, you, you also release it with a kernel to, to, to show that it is hardware efficient, that it, that it can be hardware efficient on modern hardware.[00:14:17] Dan Fu: The, the, one of the next cool things that happened is once we had this understanding of these are the basic pieces, these are the basic principles behind some of the sequence models linear attention actually started to come back. So in earlier this year, there was a model called BASED the, from Simran Arora and, and some other folks, that combined a more principled version of linear attention that basically the, the, the, the two second summary is that it used a Taylor approximation of the softmax attention, combined that with a simple sliding window attention and was starting to able, starting to be able to expand the Pareto frontier of how much data can you recall from your sequence, versus how small is your recurrent state size.[00:14:58] Dan Fu: So those orange dots [00:15:00] are, at the top there, are just showing smaller sequences that can recall more memory.[00:15:07] Just Read Twice[00:15:07] Dan Fu: And the last major idea I think that has been influential in this line of work and is very relatively late breaking just a few months ago, is just the basic idea that when you have these models that are fundamentally more efficient in the sequence length, you maybe don't want to prompt them or use them in exactly the same way.[00:15:26] Dan Fu: So this was a really cool paper called Just Read Twice, also from Simran. That basically said, hey, all these efficient models can process tokens so much more efficiently than transformers that they can sometimes have unfair advantages compared to a simple transformer token. So, or sorry, a simple transformer model.[00:15:44] Dan Fu: So take, for example the standard, the standard use case of you have some long document, you're going to pass it in as input, and then you're going to ask some question about it. One problem you might imagine for a recurrent model where you have a fixed state size is, let's say that [00:16:00] you're. Article is very long, and you're trying to ask about some really niche thing.[00:16:04] Dan Fu: You can imagine it might be hard for the model to know ahead of time what information to put into the hidden state. But these, these, these models are so much more efficient that you can do something really stupid, like, you can just put the document write down the document, write down the question, write down the document again, and then write down the question again, and then this time, the second time that you go over that document, you know exactly what to look for.[00:16:25] Dan Fu: And the cool thing about this is, so this is, And this this results in better quality, especially on these recall intensive tasks. But the other interesting thing is it really takes advantage of the more efficient architectures that, that we're having here. So one of the other, I think, influential ideas in this line of work is if you change the fundamental compute capabilities of your model and the way that it scales, you can actually start to query it at test time differently.[00:16:51] Idea 4: Test Time Compute[00:16:51] Dan Fu: And this actually, of course, goes back to those slides on test time compute. So while everybody's looking at, say, test time compute for big transformer models, [00:17:00] I think potentially a really interesting research question is, how can you take those and how does it change with this new next generation of models?[00:17:09] Dan Fu: So the, I'll just briefly summarize what some of those key ideas were and then talk and then show you briefly kind of what the state of the art is today. So, so the four key ideas are instead of just doing a simple linear attention approximation, instead take ideas that we know from other fields like signal processing, do a more principled approach to your modeling of the sequence.[00:17:32] Idea 2: Hardware & Kernel Support[00:17:32] Dan Fu: Another key idea throughout all these lines of work is you really want. Hardware and kernel support from day one. So, so even if your model is theoretically more efficient if somebody goes and runs it and it's two times slower one of the things that, that we've learned is that if, if you're in that situation, it's, it's just gonna be dead on arrival.[00:17:49] Dan Fu: So you want to be designing your architectures one of the key, key machine learning ideas that has been important for the quality is just making sure that you encode different ways that you can [00:18:00] select from your hidden state and, and really focus on that as a key decider of quality. And finally, I think one of the, the, the emerging new, new things for, for this line of work and something that's quite interesting is, What are the right test time paradigms for these models?[00:18:15] Dan Fu: How do they change relative to relative to what you might do for a standard transformer? I'll briefly end this section. So I've labeled this slide where we are yesterday because Eugene is going to talk about some new models that he released literally this morning. But as of yesterday, some of the really cool results out of the, these efficient alternative models were so AI2 trained this hybrid MOE called Jamba.[00:18:40] Dan Fu: That, that, that seems, that is currently the state of the art for these non transformer architectures. There's this NVIDIA and MIT put out this new diffusion model called SANA recently that one of their key key observations is that you can take a standard diffusion transformer diffusion model, replace the layers with linear [00:19:00] attention, and then that lets you scale to much larger much larger images, much, much Much larger sequences more efficiently.[00:19:07] Dan Fu: And and one thing that I don't think anybody would have called when a few years ago is that one of those gated SSM, gated states based models ended up on the cover of Science because a great group of folks went and trained some DNA models. So that's Michael Polley, Eric Yuen from from Stanford and the Arc Institute.[00:19:26] Dan Fu: So it's, we're really at an exciting time in 2024 where these non transformer, post transformer architectures are showing promise across a wide range. Across a wide range of, of modalities, of applications, and, and of tasks. And with that, I'll pass it on to Eugene, who can tell you a little bit about the latest and greatest with RWKV.[00:19:49] RWKV vs SSMs[00:19:49] Eugene Cheah: So, that's useful? Yeah. You're talking to here. Oh, I'm talking to here. Okay. So, yeah, two streams. Yeah. So, I think one common questions that we tend to get asked, right, is what's the difference between [00:20:00] RWKV and state space? So I think one of the key things to really understand, right the difference between the two groups, right, is that we are actually more like an open source, random internet meets academia kind of situation.[00:20:11] Eugene Cheah: Like, most of us never wrote any paper, but we, we basically look at RNNs and linear intention when intention is all you need came out, and then we decided to like, hey there is a quadratic scaling problem. Why don't we try fixing that instead? So, so, so we end up developing our own branch, but we end up sharing ideas back and forth.[00:20:30] Eugene Cheah: So, and, and we do all this actively in Discord, GitHub, etc. This was so bad for a few years, right, that basically, the average group's H index was so close to zero, right, Illuter. ai actually came in and helped us write our first paper. Great, now our H index is now three, apparently. So, so, so, but, but the thing is, like, a lot of these experiments led to results, and, and, essentially, essentially, we we took the same ideas from linear attention, [00:21:00] and we built on it.[00:21:01] Eugene Cheah: So, to take a step back into, like, how does RWKB handle its own attention mechanic and achieve the same goals of, like, O and compute, respectively, and in focus of our overall goal to make AI accessible to everyone, regardless of language, nation, or compute, that's our goal. We actually train our models primarily on over a hundred languages, which is another topic altogether.[00:21:23] Eugene Cheah: And our goal is to train to even 200 languages to cover all languages in the world. But at the same time, we work on this architecture, To lower the compute cost so that people can run it on Raspberry Pis and on anything. So, how did RWKB break the dependency of LSTM token flow? Because I think to understand architecture, right, it's probably easier to understand it from the RNN lens.[00:21:46] Eugene Cheah: Because that's where we built on. We all, we all state space kind of like try to, try to start anew and took lessons from that and say, So there's a little bit of divergence there. And AKA, this our version of linear attention. So to take step back [00:22:00] all foundation models, be it transformers or non transformers at a very high level, right?[00:22:05] Eugene Cheah: Pumps in the token. I mean, text that things into embeddings and go through a lot of layers. Generate a lot of states where the QKV cache or be iron in states or RW KB states. And outputs and embedding, they are not the same thing. And we just take more layers and more embeddings. And somehow that magically works.[00:22:23] Eugene Cheah: So, if you, if you remember your ancient RNN lessons which we, which we, which we we call best learning these days the general idea is that you have the embedding information flowing all the way up, and when, and you take that information and you flow it back down, and then you process it as part of your LSTM layers.[00:22:41] Eugene Cheah: So, this is how it generally works. Kapati is quoted saying that RNNs are actually unreasonably effective. The problem is this is not scalable. To start doing work on the second token, you need to wait for the first token. And then you need to, and likewise for the third token and fourth token, yada yada.[00:22:55] Eugene Cheah: That is CPU land, not GPU land. So, so, so, you [00:23:00] can have a H100 and you can't even use 1 percent of it. So, so that's kind of why RNNs didn't really take off in the direction that we wanted, like, billions of parameters when it comes to training. So, what did RDAP KV version 0 do? Boom. We just did the dumbest, lamest thing.[00:23:13] Eugene Cheah: Sorry, this is the bottleneck for RNN. We did the dumb thing of removing that line. And it kind of worked. It trained. It sucked, but it kind of worked. Then we were like, hey, then no one cared because the loss was crap, but how do we improve that? And that's essentially where we move forward, because if you see this kind of flow, right, you can actually get your GPU saturated quickly, where it essentially cascades respectively.[00:23:41] Eugene Cheah: So I'm just waiting for this to loop again. So it's like, once you get your first layer, your token to be computed finish. You start to cascade your compute all the way until you are, Hey, I'm using 100 percent of the GPU. So we, we worked on it, and we started going along the principle of that as long as we keep this general architecture [00:24:00] where, where we can cascade and, and be highly efficient with our architecture, nothing is sacred in our architecture.[00:24:06] Eugene Cheah: And we have done some crazy ideas. In fact, you ask us, if you ask me to explain some things in the paper, right, officially in the paper, I'll say we had this idea and we wrote it this way. The reality is someone came with a code, we tested it, it worked, and then we rationalized later. So, so the general[00:24:24] RWKV Arch[00:24:24] Eugene Cheah: The idea behind rwkbr is that we generally have two major blocks that we do.[00:24:30] Eugene Cheah: We call time mix and channel mix. And time mix generally handles handles long term memory states, where essentially, where essentially where we apply the matrix multiplication and Cilu activation functions into processing an input embedding and an output embedding. I'm oversimplifying it because this, This calculation changed every version and we have, like, version 7 right now.[00:24:50] Eugene Cheah: ChannelMix is similar to Base in the sense that it does shorter term attention, where it just looks at the sister token, or the token before it, because [00:25:00] there's a shift in the token shift matrix. I don't really want to go too much into the papers itself, because, like, we do have three papers on this.[00:25:09] Eugene Cheah: Basically, RWKB, RNN for the transformer, ERA, Ego and Pinch, RWKB, Matrix Value State. This is the updated version 5, version 6. And Goldfinch is our, is, is, is, is our hybrid model respectively. We are writing the paper already for V seven and which is, which is for R wk V seven. Called, named Goose, or architectures are named by Bird.[00:25:30] Eugene Cheah: And, I'm going to cover as well, qrwkb, and mama100k, and rwkb, and Where did that lead to? Great! Because we are all GPU poor and to be clear, like, most of this research is done, like, only on a handful H100s, which I had one Google researcher told me that was, like, his experiment budget for a single researcher.[00:25:48] Eugene Cheah: So, our entire organization has less compute than a single researcher in Google. So We, we, one of the things that we explored into was to how do we convert transformer models instead? Because [00:26:00] someone already paid that billion dollars, a million dollars onto training, so why don't we take advantage of those weights?[00:26:05] Eugene Cheah: And, and to, I believe, together AI worked on the lockets for, for the Lambda side of things, and, and we took some ideas from there as well, and we essentially did that for RWKB.[00:26:15] QWRKWv6 launch[00:26:15] Eugene Cheah: And that led to, Q RWKB6, which we just dropped today, a 32 bit instruct preview model, where we took the Quen 32 bit instruct model, freeze the feedforward layer, remove the QKB attention layer, and replace it with RWKB linear layers.[00:26:32] Eugene Cheah: So to be clear, this means we do not have the rwkv channel mix layer, we only have the time mix layer. But but once we do that, we train the rwkv layer. Important is that the feedforward layer needs to be frozen, so the new attention can be learned. And then we unfreeze the feedforward layer, and train all the layers together with a custom learning rate schedule, so that they can learn how to work together.[00:26:54] Eugene Cheah: The end result, surprisingly, And, to be honest, to the frustration of the R. W. [00:27:00] KV MOE team, which ended up releasing the model on the same day, was that, with just a few hours of training on two nodes, we managed to get it to be on par, kind of, with the original QUAN32B model. So, in fact, when the first run, right, that completely confused us, it was like, and I was telling Daniel Goldstein, Smirky, who kind of leads most of our research coordination, When you pitched me this idea, you told me at best you'll get the same level of performance.[00:27:26] Eugene Cheah: You didn't tell me the challenge and score and Winograd score will shoot up. I don't know what's happening there. But it did. MMLU score dropping, that was expected. Because if you think about it, when we were training all the layers, right, we were essentially Like, Frankenstein this thing, and we did brain damage to the feedforward network layer 2 with the new RWKB layers.[00:27:47] Eugene Cheah: But, 76%, hey, somehow it's retained, and we can probably further train this. We didn't even spend more than 3 days training this, so there's a lot more that can be done, hence the preview. This brings up [00:28:00] a big question, because We are already now in the process of converting to 7TB. We are now, this is actually extremely compute efficient to test our attention mechanic.[00:28:10] Eugene Cheah: It's like, it becomes a shortcut. We can, we are already planning to do our version 7 and our hybrid architecture for it. Because we don't need to train from scratch. And we get a really good model out of it. And the other thing that is uncomfortable to say is that because we are doing right now on the 70b is that if this scales correctly to 128k context length, I'm not even talking about a million 128, majority of enterprise workload today is just on 70b at under 32k context length.[00:28:41] Eugene Cheah: That means if this works and the benchmark matches it, It means we can replace the vast majority of current AI workload, unless you want super long context. And then sorry, can someone give us more GPUs? Because we do need the VRAM for super long context, sadly. So yeah, that's what we are working on, and essentially, [00:29:00] we are excited about this to just push it further.[00:29:02] Eugene Cheah: And this conversion process, to be clear, I don't think it's going to be exclusive to RWKB. It probably will work for Mamba as well, I don't see why not. And we will probably see more ideas, or more experiments, or more hybrids, or Yeah, like, one of the weirdest things that I wanted to say outright, and I confirmed this with the Black Mamba team and the Jamba team, which because we did the GoFinch hybrid model, is that none of us understand why a hard hybrid with a state based model to be R.[00:29:28] Eugene Cheah: QA state space and transformer performs better when, than the baseline of both. It's like, it's like when you train one, you expect, and then you replace, you expect the same results. That's our pitch. That's our claim. But somehow when we jam both together, it outperforms both. And that's like one area of emulation that, like, we only have four experiments, plus four teams, that a lot more needs to be done.[00:29:51] Eugene Cheah: But, but these are things that excite me, essentially, because that is what it's potentially we can move ahead for. Which brings us to what comes next.[00:30:00] What's next[00:30:00] [00:30:00][00:30:00] Dan Fu: So, this part is kind of just some, where we'll talk a little bit about stuff that, that we're excited about. Maybe have some wild speculation on, on what, what's, what's coming next.[00:30:12] Dan Fu: And, of course this is also the part that will be more open to questions. So, a couple things that, that I'm excited about is continued hardware model co design for, for these models. So one of the things that we've put out recently is this library called ThunderKittens. It's a CUDA library.[00:30:29] Dan Fu: And one of the things that, that we found frustrating is every time that we built one of these new architectures, and I'm sure you had the exact same experience, we'd have to go and spend two months in CUDA land, like writing these, these new efficient things. And. If we decided to change one thing in PyTorch, like one line of PyTorch code is like a week of CUDA code at least.[00:30:47] Dan Fu: So one of our goals with, with a library like Thunderkitten, so we, we just broke down what are the key principles, what are the key hardware things what are the key, Compute pieces that you get from the hardware. So for example on [00:31:00] H100 everything is really revolves around a warp group matrix multiply operation.[00:31:06] Dan Fu: So you really want your operation to be able to split into relatively small matrix, matrix multiply operations. So like multiplying two 64 by 64 matrices, for example. And so if you know that ahead of time when you're designing your model, that probably gives you you know, some information about how you set the state sizes, how you set the update, how you set the update function.[00:31:27] Dan Fu: So with Thunderkittens we basically built a whole library just around this basic idea that all your basic compute primitives should not be a float, but it should be a matrix, and everything should just be matrix compute. And we've been using that to, to try to both re implement some existing architectures, and also start to design code.[00:31:44] Dan Fu: Some new ones that are really designed with this core with a tensor core primitive in mind. Another thing that that we're, that at least I'm excited about is we, over the last four or five years, we've really been looking at language models as the next thing. But if you've been paying [00:32:00] attention to Twitter there's been a bunch of new next generation models that are coming out.[00:32:04] Dan Fu: So there, there are. So, video generation models that can run real time, that are supported by your mouse and your keyboard, that I'm told if you play with them that, you know, that they only have a few seconds of memory. Can we take that model, can we give it a very long context length so that you could actually maybe generate an entire game state at a time?[00:32:25] Dan Fu: What does that look like for the model? You're certainly not going to do a giant quadratic attention computation to try to run that. Maybe, maybe use some of these new models, or some of these new video generation models that came out. So Sora came out I don't know, two days ago now. But with super long queue times and super long generation times.[00:32:43] Dan Fu: So that's probably a quadratic attention operation at the, at the bottom of it. What if we could remove that and get the same quality, but a lot faster generation time? Or some of the demos that we saw from Paige earlier today. You know, if I have a super long conversation with my [00:33:00] Gemini bot, what if I wanted to remember everything that it's seen in the last week?[00:33:06] Dan Fu: I mean, maybe you don't for personal reasons, but what if I did, you know? What does that mean for the architecture? And I think, you know, that's certainly something I'm pretty excited about. I'm sure you're excited about it too. So, I think we were supposed to have some hot takes, but I honestly don't remember what our hot takes were.[00:33:21] Hot Takes - does anyone really need long context?[00:33:21] Eugene Cheah: Yeah, including the next slide. Hot takes, yes, these are our[00:33:25] Dan Fu: hot takes.[00:33:25] Eugene Cheah: I think the big one on Twitter that we saw, that we shared, was the question is like, is RAG relevant? In the case of, like, the future of, like, state based models?[00:33:38] Dan Fu: Let's see, I haven't played too much with RAG. But when I have. I'll say I found it was a little bit challenging to do research on it because we had this experience over and over again, where you could have any, an embedding model of any quality, so you could have a really, really bad embedding model, or you could have a really, really [00:34:00] good one, By any measure of good.[00:34:03] Dan Fu: And for the final RAG application, it kind of didn't matter. That's what I'll say about RAG while I'm being recorded. I know it doesn't actually answer the question, but[00:34:13] Eugene Cheah: Yeah, so I think a lot of folks are like, extremely excited of the idea of RWKB or State Space potentially having infinite context.[00:34:21] Eugene Cheah: But I think the reality is that when we say infinite context, we just mean a different kind of infinite context, or you, or as it's previously covered, you need to test the model differently. So, think of it more along the lines of the human. Like, I don't remember what I ate for breakfast yesterday.[00:34:37] Eugene Cheah: Yeah, that's the statement that I'll say. And And we humans are not quadratic transformers. If we did, if let's say we increased our brain size for every second we live, we would have exploded by the time we are 5 years old or something like that. And, and I think, I think basically fundamentally for us, right, be it whether we, regardless of whether RWKB, statespace, XLSTM, [00:35:00] etc, our general idea is that instead of that expanding state, that increase in computational cost, what if we have a fixed state size?[00:35:08] Eugene Cheah: And Information theory detects that that fixed state size will have a limit. Just how big of a limit is a question, like, we, like, RWKB is running at 40 megabytes for, for its state. Its future version might run into 400 megabytes. That is like millions of tokens in, if you're talking about mathematically, the maximum possibility.[00:35:29] Eugene Cheah: It's just that I guess we were all more inefficient about it, so maybe we hit 100, 000. And that's kind of like the work we are doing, trying to like push it and maximize it. And that's where the models will start differing, because it will choose to forget things, it will choose to remember things. And that's why I think that there might be some element of right, but it may not be the same right.[00:35:49] Eugene Cheah: It may be the model learn things, and it's like, hmm, I can't remember that, that article. Let me do a database search, to search. Just like us humans, when we can't remember the article in the company. We do a search on Notion. [00:36:00][00:36:00] Dan Fu: I think something that would be really interesting is if you could have facts that are, so right now, the one intuition about language models is that all those parameters are around just to store random facts about the world.[00:36:14] Dan Fu: And this intuition comes from the observation that if you take a really small language model, it can do things like talk to you, or kind of has like the The style of conversation, it can learn that, but where it will usually fall over compared to a much larger one is it'll just be a lot less factual about things that it knows or that it can do.[00:36:32] Dan Fu: But that points to all those weights that we're spending, all that SGD that we're spending to train these models are just being used to store facts. And we have things like databases that are pretty good at storing facts. So I think one thing that would be really interesting is if we could actually have some sort of outside data store that a language model can can look at that that maybe is you know, has has some sort of gradient descent in it, but but would be quite interesting.[00:36:58] Dan Fu: And then maybe you could edit it, delete [00:37:00] facts, you know, change who's president so that it doesn't, it doesn't get lost.[00:37:04] Vibhu: Can we open up Q& A and hot takes for the audience? I have a hot take Q& A. Do these scale? When, when 405B state space model, RAG exists, no one does long context, who's throwing in 2 million token questions, hot takes?[00:37:24] Dan Fu: The, the who's throwing in 2 million token question, I think, is, is a really good question. So I actually, I was going to offer that as a hot take. I mean, my hot take was going to be that long context doesn't matter. I know I just gave a whole talk about it, but you know, what, what's the point of doing research if you can't, you know, play both sides.[00:37:40] Dan Fu: But I think one of the, so I think for both of us, the reason that we first got into this was just from the first principled questions of there's this quadratic thing. Clearly intelligence doesn't need to be quadratic. What is going on? Can we understand it better? You know, since then it's kind of turned into a race, which has [00:38:00] been exciting to watch, like, how much context you can take in.[00:38:03] Dan Fu: But I think it's right. Nobody is actually putting in a two million context prompt into these models. And, and, you know, if they are, maybe we can go, go You know, design a better model to do that particular thing. Yeah, what do you think about that? So you've also been working on this. Do you think long context matters?[00:38:19] Eugene Cheah: So I'm going to burn a bit. How many of you remember the news of Google Gemini supporting 3 million contacts, right? Raise your hand.[00:38:28] Vibhu: Yeah, 2 million.[00:38:29] Eugene Cheah: Oh, it's 2 million.[00:38:31] Eugene Cheah: Yeah, how many of you actually tried that? See?[00:38:34] Vibhu: I use it a lot. You? You work for MindsTV. I use it a lot.[00:38:41] Eugene Cheah: So, for some people that has used, and I think, I think that's the, that's might be, like, this is where my opinion starts to differ, because I think the big labs may have a bigger role in this, because Like, even for RWKB, even when we train non contacts, the reason why I say VRAM is a problem is that because when we did the, we need to backprop [00:39:00] against the states, we actually need to maintain the state in between the tokens by the token length.[00:39:05] Eugene Cheah: So that means we need to actually roll out the whole 1 million contacts if we are actually training 1 million. Which is the same for transformers, actually, but it just means we don't magically reuse the VRAM consumption in the training time space. So that is one of the VRAM bottlenecks, and I'm neither OpenAI nor Google, so donate GPUs if you have too much of them.[00:39:27] Eugene Cheah: But then, putting it back to another paradigm, right, is that I think O1 style reasoning might be actually pushing that direction downwards. In my opinion, this is my partial hot take is that if, let's say you have a super big model, And let's say you have a 70B model that may take double the tokens, but gets the same result.[00:39:51] Eugene Cheah: Strictly speaking, a 70B, and this is even for transformer or non transformer, right? We we'll take less less resources than that 400 B [00:40:00] model, even if it did double the amount thinking. And if that's the case, and we are still all trying to figure this out, maybe the direction for us is really getting the sub 200 B to be as fast as efficient as possible.[00:40:11] Eugene Cheah: We a very efficient architecture that some folks happen to be working on to, to just reason it out over larger and larger context thing.[00:40:20] Question: Yeah. One thing I'm super interested in is. Models that can watch forever? Obviously you cannot train something on infinite context length. How are y'all thinking about that, where you run on a much longer context length than is possible to train on?[00:40:38] Dan Fu: Yeah, it's a, it's a great question. So I think when I think you guys probably had tweets along these lines, too. When we first started doing these things, because these are all recurrent models in theory you could just run it forever. You could just run it forever. And at the very least it won't, it won't like error out on your crash.[00:40:57] Dan Fu: There's another question of whether it can actually [00:41:00] use what it's seen in that infinite context. And I think there, so one place where probably the research and architectures ran faster Then another research is actually the benchmarks for long context. So you turn it on forever. You want to do everything or watch everything.[00:41:16] Dan Fu: What is it that you actually wanted to do? Can we actually build some benchmarks for that? Then measure what's happening. And then ask the question, can the models do it? Is there something else that they need? Yeah, I think that if I were to turn back the clock to 2022, that's probably one of the things I would have done differently, which would have been actually get some long context benchmarks out at the same time as we started pushing context length on all these models.[00:41:41] Eugene Cheah: I will also say the use case. So like, I think we both agree that there's no Infinite memory and the model needs to be able to learn and decide. I think what we have observed for, I think this also fits the state space model, is that one of the key advantages of this alternate attention mechanic that is not based on token position is that the model don't suddenly become crazy when you go past the [00:42:00] 8k training context tank, or a million context tank.[00:42:03] Eugene Cheah: It's actually still stable. It's still able to run, it's still able to rationalize. It just starts forgetting things. But some of these things are still there in latent memory. Some of these things are still somewhat there. That's the whole point of why reading twice works. Things like that. And one of the biggest pushes in this direction is that I think both Statespace and RWKB have Separate papers by other researchers where they use this architecture for time series data.[00:42:26] Eugene Cheah: Weather modeling. So, you are not asking what was the weather five days ago. You're asking what's the weather tomorrow based on the infinite length that we, as long as this Earth and the computer will keep running. So, so, and they found that it is like, better than existing, like, transformer or existing architecture in modeling this weather data.[00:42:47] Eugene Cheah: Control for the param size and stuff. I'm quite sure there are people with larger models. So, so there are things that, that in this case, right, there is future applications if your question is just what's next and not what's 10 years ago.[00:42:59] Dan Fu: Thanks so [00:43:00] much for having us. Get full access to Latent Space at www.latent.space/subscribe

SOMAPSO Pod
SOMAPSO Pod - Week of Dec 19, 2024

SOMAPSO Pod

Play Episode Listen Later Dec 19, 2024 27:43


It's the holiday season so hoop-de-do and dickory dock and don't forget we're off for the next two weeks!We rewind to the art opening at the Herb and Milly Iris Gallery, the Baker Square dedication, and even more holiday preparations.We are looking forward to ribbon cuttings, a wine tasting, a tiki pop-up bar, Dickens Village, a holiday market, the Local D, Old Year's Eve, and embroidery journals.Three Things with free karate classes, the SO Elf Chase, join a Maplewood Township committee, a vintage Pop-Up, Fox and Falcon, and CKO.Our last Co-Lab Thing To Know all about the Co-Lab marketplace, a gingerbread house decorating event, and two pop-ups this weekend!Whatever you celebrate, have a great holiday everyone!LINKS:South Orange "Elf Chase Giveaway"Maplewood Township boards and committees Crash Doll Vintage Pop-Up CKO rebranding

Business of Architecture Podcast
595: Architects: Stop Designing & Start Developing with Nourah Said of NAS CoLab

Business of Architecture Podcast

Play Episode Listen Later Dec 6, 2024 0:32


What's the future of architecture? Architect and entrepreneur Nourah Said joins the podcast to explore groundbreaking ideas that could change how architects work and earn. From her Cambridge Judge School thesis to her own entrepreneurial journey, Nourah sheds light on challenges in the profession and how architects can reclaim creative and financial control. Learn why the traditional role of architects may no longer fit the modern world. Nourah discusses how breaking free from old systems could unlock new growth opportunities. Along the way, she reveals surprising lessons architects can learn from developers and entrepreneurs. Could architects really take on a different role in the construction process? Nourah shares insights from her research and conversations with industry leaders that might make you rethink what's possible for architects today.   A controversial idea that merges two traditionally separate roles in construction. Why one business model might be holding architects back—and how to break free. A shift that could make architects more innovative, influential, and profitable. What young architects are doing today could reshape the future of the industry.   To learn more about Nourah, visit her: Website:  https://nas-colab.com/ LinkedIn: https://www.linkedin.com/in/nourah-said-riba-14752059/   ► Transcription: https://otter.ai/u/CHSo19PDlVy4HRBJJKlV1CJuQGE?utm_source=copy_url   ► Feedback? Email us at podcast@businessofarchitecture.com   ► Access your free training at http://SmartPracticeMethod.com/   ► If you want to speak directly to our advisors, book a call at https://www.businessofarchitecture.com/call   ► Subscribe to our YouTube Channel for updates:   https://www.youtube.com/c/BusinessofArchitecture   *******   For more free tools and resources for running a profitable, impactful, and fulfilling practice, connect with me on: Facebook: https://www.facebook.com/groups/businessofarchitecture Instagram: https://www.instagram.com/businessofarch/ Website: https://www.businessofarchitecture.com/yt Twitter: https://twitter.com/BusinessofArch Podcast: http://www.businessofarchitecture.com/show iTunes: https://podcasts.apple.com/us/podcast/business-architecture-podcast/id588987926 Android Podcast Feed: http://feeds.feedburner.com/BusinessofArchitecture-podcast Google Podcasts: https://podcasts.google.com/feed/aHR0cHM6Ly9idXNpbmVzc29mYXJjaGl0ZWN0dXJlLmxpYnN5bi5jb20vcnNz   *******   Access the FREE Architecture Firm Profit Map video here: http://freearchitectgift.com   Carpe Diem!