Podcasts about ideogram

  • 64PODCASTS
  • 101EPISODES
  • 40mAVG DURATION
  • 1EPISODE EVERY OTHER WEEK
  • Aug 27, 2025LATEST

POPULARITY

20172018201920202021202220232024


Best podcasts about ideogram

Latest podcast episodes about ideogram

Making a Scene Presents
AI for Merch & Fan Product Design: Turning Ideas into Unique Fan Experiences

Making a Scene Presents

Play Episode Listen Later Aug 25, 2025 11:11


Making a Scene - AI for Merch & Fan Product Design: Turning Ideas into Unique Fan ExperiencesMerch has always been one of the most powerful tools indie artists have. Not only does it bring in money, but it also builds identity. A shirt, a vinyl sleeve, or even a sticker isn't just an object—it's a way for fans to carry your music into the world. The problem for many artists has always been cost. Professional designers can be expensive, and trying to do it yourself often feels limited.That's where artificial intelligence comes in. Generative AI tools like Midjourney, Ideogram, and Stable Diffusion aren't here to take creativity away from artists. They're here to amplify it. Instead of thinking of AI as a replacement for human imagination, think of it as a creative partner that can unlock ideas you never would have thought possible. http://www.makingascene.org

This Day in AI Podcast
GPT-5 A Week Later, Ideogram Character Reference & gaggle poaching - EP99.13-THINKING-MINI

This Day in AI Podcast

Play Episode Listen Later Aug 15, 2025 61:33


Join Simtheory: https://simtheory.ai----CHAPTERS:00:00 - Simtheory plug00:48 - GPT-5 1 Week Later, Reaction to GPT-5 & Our Thoughts on Future of AI Models30:12 - Ideogram Character Reference Fun + Disturbing Photos of Us37:33 - Using creative MCPs together for photos, videos and 3D objects43:16 - MCP output combinations and the explosion of MCPs51:18 - What is needed from the next models like Gemini 3.0 Pro54:30 - Sundar Pendant Design & Final Thoughts56:20 - Final LOLz of week: gaggle poaching58:10 - Surprise GPT-5 Indie SongThanks for all of your supporting and listening to the show! xoxox

Everyday AI Podcast – An AI and ChatGPT Podcast
EP 581: Microsoft and OpenAI renegotiating, Google launches new model, and more AI News That Matters

Everyday AI Podcast – An AI and ChatGPT Podcast

Play Episode Listen Later Aug 4, 2025 38:33


There's a new most powerful AI model in townApple is trying to make a ChatGPT competitor.And OpenAI? Well.... they're in a capacity crunch.Big Tech made some BIG moves in AI this week. And you probably missed them. Don't worry. We gotchyu. On Mondays, Everyday AI brings you the AI News that Matters. No B.S. No marketing fluff. Just what you need to know to be the smartest person in AI at your company. Newsletter: Sign up for our free daily newsletterMore on this Episode: Episode PageJoin the discussion: Thoughts on this? Join the convo and connect with other AI leaders on LinkedIn.Upcoming Episodes: Check out the upcoming Everyday AI Livestream lineupWebsite: YourEverydayAI.comEmail The Show: info@youreverydayai.comConnect with Jordan on LinkedInTopics Covered in This Episode:OpenAI Study Mode in ChatGPT LaunchGoogle Gemini 2.5 Deep Think ReleaseGemini 2.5 Parallel Thinking and Coding BenchmarksGoogle AI Mode: PDF and Canvas FeaturesNotebook LM Video Overviews CustomizationMicrosoft Edge Copilot Mode Experimental RolloutOpenAI GPT-5 Model Launch DelaysApple Building In-House ChatGPT CompetitorMicrosoft and OpenAI Partnership RenegotiationAdditional AI Tool Updates: Runway, Midjourney, IdeogramTimestamps:00:00 AI Industry Updates and Competition03:22 ChatGPT's Study Mode Promotes Critical Thinking09:02 "Google AI Search Mode Enhancements"10:21 Google AI Enhances Learning Tools16:14 Microsoft Edge Introduces Copilot Mode20:18 OpenAI GPT-5 Delayed Speculation22:42 Apple Developing In-House ChatGPT Rival27:06 Microsoft-OpenAI Partnership Renegotiation30:51 Microsoft-OpenAI Partnership Concerns Rise33:23 AI Updates: Video, Characters, AmazonKeywords:Microsoft and OpenAI renegotiation, Copilot, OpenAI, GPT-5, AI model, Google Gemini 2.5, Deep Think mode, Google AI mode, Canvas mode, NotebookLM, AI browser, Agentic browser, Edge browser, Perplexity Comet, Sora, AI video tool, AI image editor, Apple AI chatbot, ChatGPT competitor, Siri integration, Artificial General Intelligence, AGI, Large Language Models, AI education tools, Study Mode, Academic cheating, Reinforcement learning, Parallel thinking, Code Bench Competition, Scientific reasoning, Chrome, Google Lens, Search Live, AI-powered search, PDF upload, Google Drive integration, Anthropic, Meta, Superintelligent labs, Amazon Alexa, Fable Showrunner, Ideogram, Midjourney, Luma Dream Machine, Zhipu GLM 4.5, Runway Alif, Adobe Photoshop harmonize, AI funding, AI product delays, AI feature rollout, AI training, AI onboarding, AI-powered presentations, AI-generated overviews, AI in business, AI technology partnership, AI investment, AI talent acqSend Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info) Ready for ROI on GenAI? Go to youreverydayai.com/partner

VP Land
Runway Aleph Changes VFX Forever, Adobe Harmonize, Wan 2.2 & More! | This Week in AI for Filmmakers

VP Land

Play Episode Listen Later Aug 1, 2025 48:43


Runway Aleph allows you to manipulate your videos using simple text prompts—no more tedious VFX workflows. This week, Addy and Joey break down the wave of new AI tools transforming post-production, including Luma AI's Modify with Instructions feature, Wan 2.2's open-source model, Ideogram's one-shot character generator, Adobe's new Harmonize tool, and more. Plus, we explore what these tools mean for VFX artists, virtual production, and the future of filmmaking workflows.--The views and opinions expressed in this podcast are the personal views of the hosts and do not necessarily reflect the views or positions of their respective employers or organizations. This show is independently produced by VP Land without the use of any outside company resources, confidential information, or affiliations.

Machine Learning Guide
MLA 025 AI Image Generation: Midjourney vs Stable Diffusion, GPT-4o, Imagen & Firefly

Machine Learning Guide

Play Episode Listen Later Jul 9, 2025 72:33


The 2025 generative AI image market is a trade-off between aesthetic quality, instruction-following, and user control. This episode analyzes the key platforms, comparing Midjourney's artistic output against the superior text generation and prompt adherence of GPT-4o and Imagen 4, the commercial safety of Adobe Firefly, and the total customization of Stable Diffusion. Links Notes and resources at ocdevel.com/mlg/mla-25 Try a walking desk - stay healthy & sharp while you learn & code Build the future of multi-agent software with AGNTCY. The State of the Market The market is split by three core philosophies: The "Artist" (Midjourney): Prioritizes aesthetic excellence and cinematic output, sacrificing precise user control and instruction following. The "Collaborator" (GPT-4o, Imagen 4): Extensions of LLMs that excel at conversational co-creation, complex instruction following, and integration into productivity workflows. The "Sovereign Toolkit" (Stable Diffusion): An open-source engine offering users unparalleled control, customization, and privacy in exchange for technical engagement. Table 1: 2025 Generative AI Image Tool At-a-Glance Comparison Tool Parent Company Access Method(s) Pricing Core Strength Best For Midjourney v7 Midjourney, Inc. Web App, Discord Subscription Artistic Aesthetics & Photorealism Fine Art, Concept Design, Stylized Visuals GPT-4o OpenAI ChatGPT, API Freemium/Sub Conversational Control & Instruction Following Marketing Materials, UI/UX Mockups, Logos Google Imagen 4 Google Gemini, Workspace, Vertex AI Freemium/Sub Ecosystem Integration & Speed Business Presentations, Educational Content Stable Diffusion 3 Stability AI Local Install, Web UIs, API Open Source Ultimate Customization & Control Developers, Power Users, Bespoke Workflows Adobe Firefly Adobe Creative Cloud Apps, Web App Subscription Commercial Safety & Workflow Integration Professional Designers, Agencies, Enterprise Core Platforms Midjourney v7: Premium choice for artistic quality. Features: Web UI with Draft Mode, user personalization, emerging video/3D. Weaknesses: Poor text generation, poor prompt adherence, public images on cheap plans, no API/bans automation. OpenAI GPT-4o: An intelligent co-creator for controlled generation. Features: Conversational refinement, superior text rendering, understands uploaded image context. Weaknesses: Slower than competitors, generates one image at a time, strict content filters. Google Imagen 4: Pragmatic tool focused on speed and ecosystem integration. Features: High-quality photorealism, fast generation, strong text rendering, multilingual. Weaknesses: Less artistic flair; value is dependent on Google ecosystem investment. Stable Diffusion 3: Open-source engine for maximum user control. Features: MMDiT architecture improves prompt/text handling, scalable models, vast ecosystem (LoRAs/ControlNet). Weaknesses: Steep learning curve, quality is user-dependent. Adobe Firefly: Focused on commercial safety and professional workflow integration. Features: Trained on Adobe Stock for legal indemnity, Generative Fill/Expand tools. Weaknesses: Creative range limited by training data, requires Adobe subscription/credits. Tools and Concepts In-painting: Modifying a masked area inside an image. Out-painting: Extending an image beyond its original borders. LoRA (Low-Rank Adaptation): A small file that applies a fine-tuned style, character, or concept to a base model. ControlNet: Uses a reference image (e.g., pose, sketch) to enforce the composition, structure, or pose of the output. A1111 vs. ComfyUI: Two main UIs for Stable Diffusion. A1111 is a beginner-friendly tabbed interface; ComfyUI is a node-based interface for complex, efficient, and automated workflows. Workflows "Best of Both Worlds": Generate aesthetic base images in Midjourney, then composite, edit, and add text with precision in Photoshop/Firefly. Single-Ecosystem: Work entirely within Adobe Creative Cloud or Google Workspace for seamless integration, commercial safety (Adobe), and convenience (Google). "Build Your Own Factory": Use ComfyUI to build automated, multi-step pipelines for consistent character generation, advanced upscaling, and video. Decision Framework Choose by Goal: Fine Art/Concept Art: Midjourney. Logos/Ads with Text: GPT-4o, Google Imagen 4, or specialist Ideogram. Consistent Character in Specific Pose: Stable Diffusion with a Character LoRA and ControlNet (OpenPose). Editing/Expanding an Existing Photo: Adobe Photoshop with Firefly. Exclusion Rules: If you need legible text, exclude Midjourney. If you need absolute privacy or zero cost (post-hardware), Stable Diffusion is the only option. If you need guaranteed commercial legal safety, use Adobe Firefly. If you need an API for a product, use OpenAI or Google; automating Midjourney is a bannable offense.

a16z
What You Missed in AI This Week (Google, Apple, ChatGPT)

a16z

Play Episode Listen Later Jun 13, 2025 28:50


Things in consumer AI are moving fast. In this episode, Justine and Olivia Moore, investing partners (and identical twins!) at a16z, break down what's real, what's overhyped, and what's next across the consumer AI space. They cover: Veo 3: how Google's video model unlocked a new genre of content OpenAI's Advanced Voice Mode: upgrades, realism, and... um, human-like hesitation Apple's AI announcements 11Labs V3: expressive voice tags, real-time interruptions, and narrative tools for creators New data from a16z: AI consumer startups are ramping revenue faster than ever—and they show you how Justine walks through how she used ChatGPT, Ideogram, and Krea to launch a fully AI-assisted brand prototype (store photos and all)Timecodes: 00:00 Introduction  00:28 Meet the Hosts: Justine and Olivia00:44 Veo 3: The Game-Changer in AI Video06:34 ChatGPT's Advanced Voice Mode Updates10:22 Apple's AI Announcements and Siri's Shortcomings12:18 11 Labs' New Voice Model: 11 V315:50 Report from a16z: AI Revenue Growth23:14 Demo of the Week: AI in Brand CreationResources: Read ‘What “Working” Means in the Era of AI Apps': https://a16z.com/revenue-benchmarks-a... Find Justine on X: https://x.com/venturetwins Find Olivia on X: https://x.com/omooretweetsTools Discussed: Veo 3: https://gemini.google/overview/video-... OpenAI: https://openai.com/chatgpt 11Labs (V3 voice model) – https://elevenlabs.io/ Ideogram (logo/image generation) – https://ideogram.ai/ Black Forest Labs/Flux Context (image editing via Krea) – https://www.krea.ai/ Flux Context demo (Krea launch post) – https://www.krea.ai/blog/flux-context Hedra: https://www.hedra.com/Stay Updated: Let us know what you think: https://ratethispodcast.com/a16zFind a16z on Twitter: https://twitter.com/a16zFind a16z on LinkedIn: https://www.linkedin.com/company/a16zSubscribe on your favorite podcast app: https://a16z.simplecast.com/Follow our host: https://x.com/eriktorenbergPlease note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details please see a16z.com/disclosures.

da Brand a Friend
#374 - ContentFinderToolkit

da Brand a Friend

Play Episode Listen Later Jun 1, 2025 13:44


#374 - ContentFinderToolkitIl set di libri che amo di più. Edward Tufte collection100% free digital tools per: a) Eliminare background dalle foto b) Scaricare video da Youtube e da altri sitiDirectory-catalogo di centinaia di siti e risorse dove poter trovare gratuitamente contenuti, immagini, suoni, video che puoi riutilizzare. https://bit.ly/contentfindertoolkit_______________Info Utili• Sostieni questo podcast:Entra in contatto con me, ottieni feedback, ricevi consigli sul tuo progetto onlinehttps://Patreon.com/Robin_Good•  Musica di questa puntata:"Favela Beat" by Birocratic disponibile su Bandcamp•  Nella foto di copertina:Immagine generata con Ideogram.ai.• Ascolta e condividi questo podcast:https://www.spreaker.com/show/dabrandafriendArchivio completo organizzato per temi:https://start.me/p/kxENzk/da-brand-a-friend-archivio-podcast• Seguimi su Telegram:https://t.me/RobinGoodItaliaInstagram channelmomenti di vita non in posa - cosa vedono i miei occhi:https://instagram.com/giggi_canali • Newsletter in Inglese:https://robingood.substack.com - Fuoco su costruire fiducia per chi fa l'imprenditore onlinehttps://goodtools.substack.com - Tool alternativi a costo zerohttps://curationmonetized.substack.com - Esempi di come monetizzare organizzando informazioni.

da Brand a Friend
#372 - Produttività e Salute

da Brand a Friend

Play Episode Listen Later May 18, 2025 22:11


#372 - Produttività e Salute5 consigli utili di tool e prodotti che utilizzo per migliorare la mia produttività e per mantenere il mio corpo in forma e in ottima salute.Link alle risorse menzionate nella puntata:Rambull NewsletterProduttivitàAmbienti sonori:1) MyNoise.net+ intervista a Stephane Pigeon, l'ingengnere audio dietro questo affascinante progetto2) ChillyATC.com3) Flocus Virtual Café4) Altri ambienti sonori interessanti:Background Sounds To Write, Focus and Relax To - GT #28Salute1) Alkavital2) DMSO 99.9%_______________Info Utili• Sostieni questo podcast:Entra in contatto con me, ottieni feedback, ricevi consigli sul tuo progetto onlinehttps://Patreon.com/Robin_Good•  Musica di questa puntata:"Let's Go Surfing" by Joakim Karud disponibile su Bandcamp•  Nella foto di copertina:Scimmia con cocco. Generata con Ideogram. • Ascolta e condividi questo podcast:https://www.spreaker.com/show/dabrandafriendArchivio completo organizzato per temi:https://start.me/p/kxENzk/da-brand-a-friend-archivio-podcast• Seguimi su Telegram:https://t.me/RobinGoodItaliaInstagram channel momenti di vita non in posa - cosa vedono i miei occhi:https://instagram.com/giggi_canali • Newsletter in Inglese:https://robingood.substack.com - Fuoco su costruire fiducia per chi fa l'imprenditore onlinehttps://goodtools.substack.com - Tool alternativi a costo zerohttps://curationmonetized.substack.com - Esempi di come monetizzare organizzando informazioni.

da Brand a Friend
#371 - Scimmie Anticocco

da Brand a Friend

Play Episode Listen Later May 11, 2025 21:18


#371 - Scimmie AnticoccoChiacchiere, riflessioni, pensieri. Nulla di speciale o straordinario. Quello che vedo, sento e mi passa per la testa.  Nuove mini-app create con l'IA1) WebPage TimeDetector - Chrome extension +2) Video Finder - web appdisponibili gratuitamente per tutti i miei sostenitori su Patreon e SubstackInstagram channel - momenti di vita non in posa - cosa vedono i miei occhi:https://instagram.com/giggi_canali _______________Info Utili• Sostieni questo podcast:Entra in contatto con me, ottieni feedback, ricevi consigli sul tuo progetto onlinehttps://Patreon.com/Robin_Good•  Musica di questa puntata:"Sleepyface" by Birocratic disponibile su Bandcamp•  Nella foto di copertina:Scimmia con cocco. Generata con Ideogram. • Ascolta e condividi questo podcast:https://www.spreaker.com/show/dabrandafriendArchivio completo organizzato per temi:https://start.me/p/kxENzk/da-brand-a-friend-archivio-podcast• Seguimi su Telegram:https://t.me/RobinGoodItalia• Newsletter in Inglese:https://robingood.substack.com - Fuoco su costruire fiducia per chi fa l'imprenditore onlinehttps://goodtools.substack.com - Tool alternativi a costo zerohttps://curationmonetized.substack.com - Esempi di come monetizzare organizzando informazioni.

Leveraging AI
187 | How Top Creators Use AI to Create Scroll-Stopping LinkedIn Content with MJ Jaindl

Leveraging AI

Play Episode Listen Later May 6, 2025 47:19 Transcription Available


Ryan's Method: Passive Income Podcast
Create Profitable Print on Demand Designs in 2 Minutes (or less)

Ryan's Method: Passive Income Podcast

Play Episode Listen Later Apr 15, 2025 8:55


One of the most popular design tools for print on demand just integrated Ideogram natively into their design app, allowing me to create 10 designs in 10 minutes!

Ryan's Method: Passive Income Podcast
Watch me Create Best-Selling Designs in 27 Seconds

Ryan's Method: Passive Income Podcast

Play Episode Listen Later Apr 2, 2025 14:17


Ideogram has an incredible ability to 'remix' existing designs and allow us to sell in the highest-demand print on demand niches with no graphic design experience or ability.

Let's Talk AI
#205 - Gemini 2.5, ChatGPT Image Gen, Thoughts of LLMs

Let's Talk AI

Play Episode Listen Later Apr 1, 2025 94:18 Transcription Available


Our 205th episode with a summary and discussion of last week's big AI news! Recorded on 03/28/2025 Hosted by Andrey Kurenkov and Jeremie Harris. Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai Read out our text newsletter and comment on the podcast at https://lastweekin.ai/. Join our Discord here! https://discord.gg/nTyezGSKwP In this episode: OpenAI's new image generation capabilities represent significant advancements in AI tools, showcasing impressive benchmarks and multimodal functionalities. OpenAI is finalizing a historic $40 billion funding round led by SoftBank, and Sam Altman shifts focus to technical direction while COO Brad Lightcap takes on more operational responsibilities., Anthropic unveils groundbreaking interpretability research, introducing cross-layer tracers and showcasing deep insights into model reasoning through applications on Claude 3.5. New challenging benchmarks such as ARC AGI 2 and complex Sudoku variations aim to push the boundaries of reasoning and problem-solving capabilities in AI models. Timestamps + Links: (00:00:00) Intro / Banter (00:01:01) News Preview Tools & Apps (00:02:46) Gemini 2.5: Our most intelligent AI model (00:08:41) OpenAI rolls out image generation powered by GPT-4o to ChatGPT (00:16:14) Ideogram presents version 3.0 of its AI image generation system (00:19:20) New Reve Image Generator Beats AI Art Heavyweights MidJourney and Flux at a Penny Per Image (00:21:56) Alibaba Releases Qwen2.5 Omni, Adds Voice and Video Modes to Qwen Chat (00:23:58) The official version of Tencent's Hunyuan Deep Thinking Model T1 is here, with fast articulation, instant responses, and a decoding speed increase of 2 times Applications & Business (00:25:45) OpenAI Close to Finalizing $40 Billion SoftBank-Led Funding (00:29:26) OpenAI reshuffles leadership as Sam Altman pivots to technical focus (00:33:23) Nvidia shows off Rubin Ultra with 600,000-Watt Kyber racks and infrastructure, coming in 2027 (00:35:23) China's SiCarrier emerges as challenger to ASML, other chip tool titans (00:38:24) Pony.ai wins first permit for fully driverless taxi operation in the center of China's Silicon Valley Projects & Open Source (00:40:27) A new, challenging AGI test stumps most AI models (00:45:16) Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models (00:48:13) Wan: Open and Advanced Large-Scale Video Generative Models (00:50:38) DeepSeek V3-0324 tops non-reasoning AI models in open-source first (00:54:46) OpenAI adopts rival Anthropic's standard for connecting AI models to data Research & Advancements (00:55:56) Anthropic can now track the bizarre inner workings of a large language model (01:06:00) Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models (01:11:50) Inside-Out: Hidden Factual Knowledge in LLMs (01:15:14) Sakana AI super-powers AI reasoning using Japan's own Sudoku Puzzles Policy & Safety (01:18:38) Senator Wiener Introduces Legislation to Protect AI Whistleblowers & Boost Responsible AI Development (01:21:50) NVIDIA & Other Tech Giants Demand Trump Administration To Reconsider “AI Diffusion” Policy Which Is Set To Be Effective By May 15 (01:23:17) U.S. blacklists over 50 Chinese companies in bid to curb Beijing's AI, chip capabilities (01:26:44) Netflix's Reed Hastings Gives $50 Million to Bowdoin for A.I. Program (01:27:55) Judge allows 'New York Times' copyright case against OpenAI to go forward (01:29:48) Judge rules that AI can continue training on copyrighted lyrics, for now

Trench Tech
[Extrait] L'iA peut-elle sauver le monde ? - Lou Welgryn

Trench Tech

Play Episode Listen Later Mar 29, 2025 6:57


Et si l'IA n'était pas si magique que ça ? Parlons franchement : l'intelligence artificielle est-elle le problème ou la solution ?Ce podcast explore dégâts sociaux et environnementaux de l'IA et comment mobiliser citoyens et pros de la tech agir diffrémment. Ecoutez l'épisode complet Pirates de la Tech : Cap sur le bien commun avec Lou Welgryn :

VP Land
ChatGPT's Crazy Image Upgrade (Plus Reve & Ideogram 3.0), Hollywood's Global Shift, and Roblox Goes Generative

VP Land

Play Episode Listen Later Mar 28, 2025 34:58


We analyze the image generation capabilities of ChatGPT 4o, Reve, and Ideogram 3.0, examining their improved text handling and what it means for creators. Then, we dive into the Studio Ghibli AI controversy, Hollywood's production exodus overseas, and why Rob Lowe is shooting American game shows in Dublin. Plus, Roblox enters the generative 3D space with their new Cube tool, potentially changing how virtual worlds are built at scale.

TeknoSafari's Podcast
Okullarda AI Öğretmen Dönemi Başladı! Test Sonuçları Uçuşa Geçti

TeknoSafari's Podcast

Play Episode Listen Later Mar 28, 2025 22:54


1. Gemini, görüntü işlemede rekabeti kızıştırdı. Video analiz edebiliyor. 2.5 ile de başa oynuyor.2. GPT'ye yeni görüntü işleme motoru geldi pir geldi,3. Ideogram hemen Version 3'ü sürdü4. REVE çok çok iyi. 5. SORA, plus abonelere sınırsız6. GROK'a video üretme bekleniyor. 7. deepseek v3 geldi. R2 bekleniyor. Tst edenler AGI neredeyse diyor. Bu arada: deepseek bazı çalışanların yurtdışına serbestçe seyahat etmesini yasaklıyor.8. BAIDU, ERNIE 4.5 ve X1'i tanıttı.Çok modlu yeteneklere sahip derin düşünme akıl yürütme modeli olarak ERNIE X1, DeepSeek R1 ile aynı performansı yalnızca yarı fiyatına.9. QWEN'e video yüklenebiliyor.10. BYD Zhengzhou fabrikası San Francisco'dan DAHA BÜYÜK olacak. Tesla Gigafactory Nevada'dan 10 kat DAHA BÜYÜK.11. NotebookLM podcast özelliği Gemini'da. 12. NotebookLM'e mindmap geldi.13. Adobe'nin en iyi özelliklerini doğrudan iş akışınıza getiren yeni Microsoft 365 Copilot tanıtıldı.14. Adobe, üçüncü parti yazılımlara izin verecek. SORA ,Runway, Pika, Flux v.b. pek çok uygulama ekosisteme giriyor.15. Perplexity, Google ile dalga geçen reklam yayınladı. Cesur iş.16. Teksas okullarının ‘yapay zeka öğretmeni' kullanımı, öğrenci test puanlarını ülkedeki en iyi %2'ye fırlattı. Yöneticiler, öğrencilerin ‘daha iyi' ve ‘daha hızlı' öğrendiğini söylüyor.17. Steal Their Look adlı Glif, elbiselerinizi soyuyor. Glif.app, kullanıcıların AI tabanlı küçük uygulamalar ve sohbet botları oluşturmasına olanak tanıyan bir platform. Bu platformda oluşturulan "glifler", kullanıcı girdilerine (metin, resim veya buton tıklamaları) dayanarak metin, resim, video veya bunların kombinasyonlarını üreten AI destekli generatörlere verilen ad.18. Apple, NVIDIA GB300 NVL72 için 1 milyar dolarlık sipariş vererek yapay zeka veri merkezi oyununa adım atıyor. Şaka gibi.19. Nilüfer Belediyesi Yapay Zeka Bürosu'nu kuran ilk ilçe belediyesi oldu!

AI Unraveled: Latest AI News & Trends, Master GPT, Gemini, Generative AI, LLMs, Prompting, GPT Store

OpenAI faced copyright discussions over its Ghibli-style image generation while projecting substantial revenue growth, despite ongoing significant investment. Simultaneously, Ideogram launched a sophisticated image generation model, outperforming competitors. BMW and Alibaba partnered to integrate AI into vehicles, and Alibaba also released a versatile AI model for mobile devices and other applications, with plans for its adoption by major tech companies. Furthermore, Bill Gates predicted widespread replacement of doctors and teachers by AI, and North Korea revealed new AI-powered military drones, raising security considerations. The day also saw OpenAI enhance ChatGPT with image generation and adopt Anthropic's open-source protocol, alongside various other AI developments from companies like Microsoft, Amazon, and Midjourney, as well as regulatory actions.

AI For Humans
OpenAI's New 4o Image Gen Dominates The Internet, Google Gemini 2.5 & More AI News

AI For Humans

Play Episode Listen Later Mar 27, 2025 62:41


OpenAI's new 4o Image Gen is the best AI image model we've seen to date and it has absolutely taken over the Internet. Plus, Gemini 2.5 is no slouch and a ton of new robots! Plus, OpenAI's new OpenAI.fm let's you prompt AI voices in new ways, DeepSeek's new model is actually better (at times) then GPT 4.5, a new Cursor for 3D modeling and, we're so sorry for this, but a LOT of talk about AI Big Booty Bears.    **GO AND VISIT OUR SPONSOR Y'ALL** bubble.io/aiforhumans   Join the discord: https://discord.gg/muD2TYgC8f Join our Patreon: https://www.patreon.com/AIForHumansShow AI For Humans Newsletter: https://aiforhumans.beehiiv.com/ Follow us for more on X @AIForHumansShow Join our TikTok @aiforhumansshow To book us for speaking, please visit our website: https://www.aiforhumans.show/   // Show Links // OpenAI's GPT-4o Image Gen is Here https://openai.com/index/introducing-4o-image-generation/  Live demo (with Sam) https://www.youtube.com/live/2f3K43FHRKo?si=vL_0QC8ygRx4MgOF     OpenAI Causes The Great Giblification of the Internet https://x.com/heyBarsee/status/1904891940522647662 husbandt: https://x.com/squirtle_says/status/1904816587108213244 trump/vance: https://x.com/LukasMikelionis/status/1904873083246084364 movie scenes: https://x.com/MDurbar/status/1904872441899339963 brain meme: https://x.com/TechMemeKing/status/1904867629644267980 vibe ghibling: https://x.com/EMostaque/status/1904714479906283878   Sam Altman Says More Creative Freedom https://x.com/sama/status/1904598788687487422 Gavin's Knight + Rotisserie Chicken Photo Reddit Post https://www.reddit.com/r/ChatGPT/comments/1jk0p3v/tried_to_push_the_new_image_model_with_an/ Kevin's Aladdin Sane + Katamari WIlliams Images https://x.com/Attack/status/1904743185760608316 Big Butt Bear Video https://x.com/AIForHumansShow/status/1904687617758945674 Google Gemini 2.5  https://x.com/NoamShazeer/status/1904581813215125787 Largest Score Jump Ever on LMSYS https://x.com/AndrewCurran_/status/1904590242792996959 One Shot Coding Demos From Matt Berman https://x.com/MatthewBerman/status/1904714953095078004 Reve - Brand New Image Model Ranked #1 https://preview.reve.art/ Ideogram 3.0 https://x.com/ideogram_ai/status/1904927717281456188 OpenAI FM + new voice API https://x.com/OpenAIDevs/status/1902773579323674710  New DeepSeek Model is Actually Much Better https://www.reuters.com/technology/artificial-intelligence/chinas-deepseek-releases-ai-model-upgrade-intensifies-rivalry-with-openai-2025-03-25/ Figure 01 “Natural” Walking https://youtu.be/z6KiwXT_yAM?si=RRsmjvs0qpRU0cqX WPP Makes Robots Into Camera Operators https://x.com/TheHumanoidHub/status/1903173205155815431 H&M is making AI clones of 30 models https://www.inc.com/kit-eaton/clothing-giant-hm-will-use-models-ai-made-digital-twins-consent-included/91166352 Cursor for 3D Modeling  https://x.com/_martinsit/status/1904234440198615204 Seeing Eye Robot Dogs  https://x.com/iconphas/status/1904259348815352029 SynCity https://x.com/shtedritski/status/1903112129420443712 Gavin's Dial-up Diaries Video https://x.com/AIForHumansShow/status/1904244229783892207 Kevin's OpenAI Real Time Voice Test https://x.com/Attack/status/1904541254257643797      

Ryan's Method: Passive Income Podcast
IDEOGRAM 3.0 IS AMAZING FOR PRINT ON DEMAND!

Ryan's Method: Passive Income Podcast

Play Episode Listen Later Mar 27, 2025 11:12


I'm recreating 5 best-selling Etsy shirts using the new, powerful Ideogram 3.0 AI image generator!

Brave New Bookshelf
34 - Dana Sacco and Bootstrapping Your Publishing Career with AI Tools

Brave New Bookshelf

Play Episode Listen Later Mar 13, 2025 42:33


In this episode of Brave New Bookshelf, we sit down with Dana Sacco, a businesswoman-turned-author who has mastered the art of bootstrapping her publishing career using AI tools. Dana shares how she transitioned from being an avid reader to a multi-genre author, all while leveraging affordable and innovative AI solutions like ChatGPT, Claude, and Ideogram to streamline her workflow. Visit our website https://bravenewbookshelf.com to view the full episode notes, links and apps mentioned in the episode, and the full transcript.

Experts Unleashed with Joel Erway
Paid Ads Unleashed: Proven Strategies for Growth | EU 131 with Joe Stolte

Experts Unleashed with Joel Erway

Play Episode Listen Later Mar 12, 2025 29:19


In this episode of Experts Unleashed, I sit down with Joe Stolte from Daily AI to dive into his strategies for running successful paid ad campaigns. Joe shares how he spends around $15,000 a month on ads to drive free trials for his AI-powered email newsletter software, designed for thought leaders and small businesses. We talk about the power of retargeting, optimizing ad creatives, and using AI tools like Ideogram for image generation. Joe also highlights the value of partnering with social media creators to amplify results. Throughout our conversation, Joe emphasizes one key takeaway: having a strong offer and consistently testing are crucial for refining your marketing efforts.  

Ryan's Method: Passive Income Podcast
This Beginner-Friendly AI is Even Easier to Use Now

Ryan's Method: Passive Income Podcast

Play Episode Listen Later Mar 7, 2025 10:30


One of the most popular design tools for print on demand just integrated Ideogram natively into their design app, allowing me to create 10 designs in 10 minutes!

AI Unraveled: Latest AI News & Trends, Master GPT, Gemini, Generative AI, LLMs, Prompting, GPT Store

OpenAI released GPT-4.5, touting enhanced reasoning and emotional intelligence, while Tencent unveiled a rapid decision-making model aimed at real-time applications. Meta is developing a standalone AI app to compete with other chatbot platforms, and Amazon introduced its first quantum computing chip. Elsewhere, Ideogram is working on faster visual and textual processing, and Canada is scrutinising X's use of personal data for AI training. These developments, alongside anxieties about job displacement, signal a continued acceleration and broadening of AI's impact across various industries and societal domains.

The Post-Christian Podcast
A.I. for Church Leaders with Dr. David Thorne

The Post-Christian Podcast

Play Episode Listen Later Jan 14, 2025 21:42


In this episode of the Post Christian Podcast, host Dr. Eric Bryant interviews Dr. David Thorne from AI for Churches and leader of the AI Ethics Collective. From their website:"David earned his doctorate in Leadership Studies and masters degrees in management and leadership, counseling, and practical theology. Passionate about marketing strategy and helping churches utilize technology to achieve outreach goals and foster community engagement, David has a deep understanding of the hurdles church leaders face in a changing world."

Latent Space: The AI Engineer Podcast — CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

Applications for the NYC AI Engineer Summit, focused on Agents at Work, are open!When we first started Latent Space, in the lightning round we'd always ask guests: “What's your favorite AI product?”. The majority would say Midjourney. The simple UI of prompt → very aesthetic image turned it into a $300M+ ARR bootstrapped business as it rode the first wave of AI image generation.In open source land, StableDiffusion was congregating around AUTOMATIC1111 as the de-facto web UI. Unlike Midjourney, which offered some flags but was mostly prompt-driven, A1111 let users play with a lot more parameters, supported additional modalities like img2img, and allowed users to load in custom models. If you're interested in some of the SD history, you can look at our episodes with Lexica, Replicate, and Playground.One of the people involved with that community was comfyanonymous, who was also part of the Stability team in 2023, decided to build an alternative called ComfyUI, now one of the fastest growing open source projects in generative images, and is now the preferred partner for folks like Black Forest Labs's Flux Tools on Day 1. The idea behind it was simple: “Everyone is trying to make easy to use interfaces. Let me try to make a powerful interface that's not easy to use.”Unlike its predecessors, ComfyUI does not have an input text box. Everything is based around the idea of a node: there's a text input node, a CLIP node, a checkpoint loader node, a KSampler node, a VAE node, etc. While daunting for simple image generation, the tool is amazing for more complex workflows since you can break down every step of the process, and then chain many of them together rather than manually switching between tools. You can also re-start execution halfway instead of from the beginning, which can save a lot of time when using larger models.To give you an idea of some of the new use cases that this type of UI enables:* Sketch something → Generate an image with SD from sketch → feed it into SD Video to animate* Generate an image of an object → Turn into a 3D asset → Feed into interactive experiences* Input audio → Generate audio-reactive videosTheir Examples page also includes some of the more common use cases like AnimateDiff, etc. They recently launched the Comfy Registry, an online library of different nodes that users can pull from rather than having to build everything from scratch. The project has >60,000 Github stars, and as the community grows, some of the projects that people build have gotten quite complex:The most interesting thing about Comfy is that it's not a UI, it's a runtime. You can build full applications on top of image models simply by using Comfy. You can expose Comfy workflows as an endpoint and chain them together just like you chain a single node. We're seeing the rise of AI Engineering applied to art.Major Tom's ComfyUI Resources from the Latent Space DiscordMajor shoutouts to Major Tom on the LS Discord who is a image generation expert, who offered these pointers:* “best thing about comfy is the fact it supports almost immediately every new thing that comes out - unlike A1111 or forge, which still don't support flux cnet for instance. It will be perfect tool when conflicting nodes will be resolved”* AP Workflows from Alessandro Perili are a nice example of an all-in-one train-evaluate-generate system built atop Comfy* ComfyUI YouTubers to learn from:* @sebastiankamph* @NerdyRodent* @OlivioSarikas* @sedetweiler* @pixaroma* ComfyUI Nodes to check out:* https://github.com/kijai/ComfyUI-IC-Light* https://github.com/MrForExample/ComfyUI-3D-Pack* https://github.com/PowerHouseMan/ComfyUI-AdvancedLivePortrait* https://github.com/pydn/ComfyUI-to-Python-Extension* https://github.com/THtianhao/ComfyUI-Portrait-Maker* https://github.com/ssitu/ComfyUI_NestedNodeBuilder* https://github.com/longgui0318/comfyui-magic-clothing* https://github.com/atmaranto/ComfyUI-SaveAsScript* https://github.com/ZHO-ZHO-ZHO/ComfyUI-InstantID* https://github.com/AIFSH/ComfyUI-FishSpeech* https://github.com/coolzilj/ComfyUI-Photopea* https://github.com/lks-ai/anynode* Sarav: https://www.youtube.com/@mickmumpitz/videos ( applied stuff )* Sarav: https://www.youtube.com/@latentvision (technical, but infrequent)* look for comfyui node for https://github.com/magic-quill/MagicQuill* “Comfy for Video” resources* Kijai (https://github.com/kijai) pushing out support for Mochi, CogVideoX, AnimateDif, LivePortrait etc* Comfyui node support like LTX https://github.com/Lightricks/ComfyUI-LTXVideo , and HunyuanVideo* FloraFauna AI* Communities: https://www.reddit.com/r/StableDiffusion/, https://www.reddit.com/r/comfyui/Full YouTube EpisodeAs usual, you can find the full video episode on our YouTube (and don't forget to like and subscribe!)Timestamps* 00:00:04 Introduction of hosts and anonymous guest* 00:00:35 Origins of Comfy UI and early Stable Diffusion landscape* 00:02:58 Comfy's background and development of high-res fix* 00:05:37 Area conditioning and compositing in image generation* 00:07:20 Discussion on different AI image models (SD, Flux, etc.)* 00:11:10 Closed source model APIs and community discussions on SD versions* 00:14:41 LoRAs and textual inversion in image generation* 00:18:43 Evaluation methods in the Comfy community* 00:20:05 CLIP models and text encoders in image generation* 00:23:05 Prompt weighting and negative prompting* 00:26:22 Comfy UI's unique features and design choices* 00:31:00 Memory management in Comfy UI* 00:33:50 GPU market share and compatibility issues* 00:35:40 Node design and parameter settings in Comfy UI* 00:38:44 Custom nodes and community contributions* 00:41:40 Video generation models and capabilities* 00:44:47 Comfy UI's development timeline and rise to popularity* 00:48:13 Current state of Comfy UI team and future plans* 00:50:11 Discussion on other Comfy startups and potential text generation supportTranscriptAlessio [00:00:04]: Hey everyone, welcome to the Latent Space podcast. This is Alessio, partner and CTO at Decibel Partners, and I'm joined by my co-host Swyx, founder of Small AI.swyx [00:00:12]: Hey everyone, we are in the Chroma Studio again, but with our first ever anonymous guest, Comfy Anonymous, welcome.Comfy [00:00:19]: Hello.swyx [00:00:21]: I feel like that's your full name, you just go by Comfy, right?Comfy [00:00:24]: Yeah, well, a lot of people just call me Comfy, even when they know my real name. Hey, Comfy.Alessio [00:00:32]: Swyx is the same. You know, not a lot of people call you Shawn.swyx [00:00:35]: Yeah, you have a professional name, right, that people know you by, and then you have a legal name. Yeah, it's fine. How do I phrase this? I think people who are in the know, know that Comfy is like the tool for image generation and now other multimodality stuff. I would say that when I first got started with Stable Diffusion, the star of the show was Automatic 111, right? And I actually looked back at my notes from 2022-ish, like Comfy was already getting started back then, but it was kind of like the up and comer, and your main feature was the flowchart. Can you just kind of rewind to that moment, that year and like, you know, how you looked at the landscape there and decided to start Comfy?Comfy [00:01:10]: Yeah, I discovered Stable Diffusion in 2022, in October 2022. And, well, I kind of started playing around with it. Yes, I, and back then I was using Automatic, which was what everyone was using back then. And so I started with that because I had, it was when I started, I had no idea like how Diffusion works. I didn't know how Diffusion models work, how any of this works, so.swyx [00:01:36]: Oh, yeah. What was your prior background as an engineer?Comfy [00:01:39]: Just a software engineer. Yeah. Boring software engineer.swyx [00:01:44]: But like any, any image stuff, any orchestration, distributed systems, GPUs?Comfy [00:01:49]: No, I was doing basically nothing interesting. Crud, web development? Yeah, a lot of web development, just, yeah, some basic, maybe some basic like automation stuff. Okay. Just. Yeah, no, like, no big companies or anything.swyx [00:02:08]: Yeah, but like already some interest in automations, probably a lot of Python.Comfy [00:02:12]: Yeah, yeah, of course, Python. But I wasn't actually used to like the Node graph interface before I started Comfy UI. It was just, I just thought it was like, oh, like, what's the best way to represent the Diffusion process in the user interface? And then like, oh, well. Well, like, naturally, oh, this is the best way I've found. And this was like with the Node interface. So how I got started was, yeah, so basic October 2022, just like I hadn't written a line of PyTorch before that. So it's completely new. What happened was I kind of got addicted to generating images.Alessio [00:02:58]: As we all did. Yeah.Comfy [00:03:00]: And then I started. I started experimenting with like the high-res fixed in auto, which was for those that don't know, the high-res fix is just since the Diffusion models back then could only generate that low-resolution. So what you would do, you would generate low-resolution image, then upscale, then refine it again. And that was kind of the hack to generate high-resolution images. I really liked generating. Like higher resolution images. So I was experimenting with that. And so I modified the code a bit. Okay. What happens if I, if I use different samplers on the second pass, I was edited the code of auto. So what happens if I use a different sampler? What happens if I use a different, like a different settings, different number of steps? And because back then the. The high-res fix was very basic, just, so. Yeah.swyx [00:04:05]: Now there's a whole library of just, uh, the upsamplers.Comfy [00:04:08]: I think, I think they added a bunch of, uh, of options to the high-res fix since, uh, since, since then. But before that was just so basic. So I wanted to go further. I wanted to try it. What happens if I use a different model for the second, the second pass? And then, well, then the auto code base was, wasn't good enough for. Like, it would have been, uh, harder to implement that in the auto interface than to create my own interface. So that's when I decided to create my own. And you were doing that mostly on your own when you started, or did you already have kind of like a subgroup of people? No, I was, uh, on my own because, because it was just me experimenting with stuff. So yeah, that was it. Then, so I started writing the code January one. 2023, and then I released the first version on GitHub, January 16th, 2023. That's how things got started.Alessio [00:05:11]: And what's, what's the name? Comfy UI right away or? Yeah.Comfy [00:05:14]: Comfy UI. The reason the name, my name is Comfy is people thought my pictures were comfy, so I just, uh, just named it, uh, uh, it's my Comfy UI. So yeah, that's, uh,swyx [00:05:27]: Is there a particular segment of the community that you targeted as users? Like more intensive workflow artists, you know, compared to the automatic crowd or, you know,Comfy [00:05:37]: This was my way of like experimenting with, uh, with new things, like the high risk fixed thing I mentioned, which was like in Comfy, the first thing you could easily do was just chain different models together. And then one of the first things, I think the first times it got a bit of popularity was when I started experimenting with the different, like applying. Prompts to different areas of the image. Yeah. I called it area conditioning, posted it on Reddit and it got a bunch of upvotes. So I think that's when, like, when people first learned of Comfy UI.swyx [00:06:17]: Is that mostly like fixing hands?Comfy [00:06:19]: Uh, no, no, no. That was just, uh, like, let's say, well, it was very, well, it still is kind of difficult to like, let's say you want a mountain, you have an image and then, okay. I'm like, okay. I want the mountain here and I want the, like a, a Fox here.swyx [00:06:37]: Yeah. So compositing the image. Yeah.Comfy [00:06:40]: My way was very easy. It was just like, oh, when you run the diffusion process, you kind of generate, okay. You do pass one pass through the diffusion, every step you do one pass. Okay. This place of the image with this brand, this space, place of the image with the other prop. And then. The entire image with another prop and then just average everything together, every step, and that was, uh, area composition, which I call it. And then, then a month later, there was a paper that came out called multi diffusion, which was the same thing, but yeah, that's, uh,Alessio [00:07:20]: could you do area composition with different models or because you're averaging out, you kind of need the same model.Comfy [00:07:26]: Could do it with, but yeah, I hadn't implemented it. For different models, but, uh, you, you can do it with, uh, with different models if you want, as long as the models share the same latent space, like we, we're supposed to ring a bell every time someone says, yeah, like, for example, you couldn't use like Excel and SD 1.5, because those have a different latent space, but like, uh, yeah, like SD 1.5 models, different ones. You could, you could do that.swyx [00:07:59]: There's some models that try to work in pixel space, right?Comfy [00:08:03]: Yeah. They're very slow. Of course. That's the problem. That that's the, the reason why stable diffusion actually became like popular, like, cause was because of the latent space.swyx [00:08:14]: Small and yeah. Because it used to be latent diffusion models and then they trained it up.Comfy [00:08:19]: Yeah. Cause a pixel pixel diffusion models are just too slow. So. Yeah.swyx [00:08:25]: Have you ever tried to talk to like, like stability, the latent diffusion guys, like, you know, Robin Rombach, that, that crew. Yeah.Comfy [00:08:32]: Well, I used to work at stability.swyx [00:08:34]: Oh, I actually didn't know. Yeah.Comfy [00:08:35]: I used to work at stability. I got, uh, I got hired, uh, in June, 2023.swyx [00:08:42]: Ah, that's the part of the story I didn't know about. Okay. Yeah.Comfy [00:08:46]: So the, the reason I was hired is because they were doing, uh, SDXL at the time and they were basically SDXL. I don't know if you remember it was a base model and then a refiner model. Basically they wanted to experiment, like chaining them together. And then, uh, they saw, oh, right. Oh, this, we can use this to do that. Well, let's hire that guy.swyx [00:09:10]: But they didn't, they didn't pursue it for like SD3. What do you mean? Like the SDXL approach. Yeah.Comfy [00:09:16]: The reason for that approach was because basically they had two models and then they wanted to publish both of them. So they, they trained one on. Lower time steps, which was the refiner model. And then they, the first one was trained normally. And then they went during their test, they realized, oh, like if we string these models together are like quality increases. So let's publish that. It worked. Yeah. But like right now, I don't think many people actually use the refiner anymore, even though it is actually a full diffusion model. Like you can use it on its own. And it's going to generate images. I don't think anyone, people have mostly forgotten about it. But, uh.Alessio [00:10:05]: Can we talk about models a little bit? So stable diffusion, obviously is the most known. I know flux has gotten a lot of traction. Are there any underrated models that people should use more or what's the state of the union?Comfy [00:10:17]: Well, the, the latest, uh, state of the art, at least, yeah, for images there's, uh, yeah, there's flux. There's also SD3.5. SD3.5 is two models. There's a, there's a small one, 2.5B and there's the bigger one, 8B. So it's, it's smaller than flux. So, and it's more, uh, creative in a way, but flux, yeah, flux is the best. People should give SD3.5 a try cause it's, uh, it's different. I won't say it's better. Well, it's better for some like specific use cases. Right. If you want some to make something more like creative, maybe SD3.5. If you want to make something more consistent and flux is probably better.swyx [00:11:06]: Do you ever consider supporting the closed source model APIs?Comfy [00:11:10]: Uh, well, they, we do support them as custom nodes. We actually have some, uh, official custom nodes from, uh, different. Ideogram.swyx [00:11:20]: Yeah. I guess DALI would have one. Yeah.Comfy [00:11:23]: That's, uh, it's just not, I'm not the person that handles that. Sure.swyx [00:11:28]: Sure. Quick question on, on SD. There's a lot of community discussion about the transition from SD1.5 to SD2 and then SD2 to SD3. People still like, you know, very loyal to the previous generations of SDs?Comfy [00:11:41]: Uh, yeah. SD1.5 then still has a lot of, a lot of users.swyx [00:11:46]: The last based model.Comfy [00:11:49]: Yeah. Then SD2 was mostly ignored. It wasn't, uh, it wasn't a big enough improvement over the previous one. Okay.swyx [00:11:58]: So SD1.5, SD3, flux and whatever else. SDXL. SDXL.Comfy [00:12:03]: That's the main one. Stable cascade. Stable cascade. That was a good model. But, uh, that's, uh, the problem with that one is, uh, it got, uh, like SD3 was announced one week after. Yeah.swyx [00:12:16]: It was like a weird release. Uh, what was it like inside of stability actually? I mean, statute of limitations. Yeah. The statute of limitations expired. You know, management has moved. So it's easier to talk about now. Yeah.Comfy [00:12:27]: And inside stability, actually that model was ready, uh, like three months before, but it got, uh, stuck in, uh, red teaming. So basically the product, if that model had released or was supposed to be released by the authors, then it would probably have gotten very popular since it's a, it's a step up from SDXL. But it got all of its momentum stolen. It got stolen by the SD3 announcement. So people kind of didn't develop anything on top of it, even though it's, uh, yeah. It was a good model, at least, uh, completely mostly ignored for some reason. Likeswyx [00:13:07]: I think the naming as well matters. It seemed like a branch off of the main, main tree of development. Yeah.Comfy [00:13:15]: Well, it was different researchers that did it. Yeah. Yeah. Very like, uh, good model. Like it's the Worcestershire authors. I don't know if I'm pronouncing it correctly. Yeah. Yeah. Yeah.swyx [00:13:28]: I actually met them in Vienna. Yeah.Comfy [00:13:30]: They worked at stability for a bit and they left right after the Cascade release.swyx [00:13:35]: This is Dustin, right? No. Uh, Dustin's SD3. Yeah.Comfy [00:13:38]: Dustin is a SD3 SDXL. That's, uh, Pablo and Dome. I think I'm pronouncing his name correctly. Yeah. Yeah. Yeah. Yeah. That's very good.swyx [00:13:51]: It seems like the community is very, they move very quickly. Yeah. Like when there's a new model out, they just drop whatever the current one is. And they just all move wholesale over. Like they don't really stay to explore the full capabilities. Like if, if the stable cascade was that good, they would have AB tested a bit more. Instead they're like, okay, SD3 is out. Let's go. You know?Comfy [00:14:11]: Well, I find the opposite actually. The community doesn't like, they only jump on a new model when there's a significant improvement. Like if there's a, only like a incremental improvement, which is what, uh, most of these models are going to have, especially if you, cause, uh, stay the same parameter count. Yeah. Like you're not going to get a massive improvement, uh, into like, unless there's something big that, that changes. So, uh. Yeah.swyx [00:14:41]: And how are they evaluating these improvements? Like, um, because there's, it's a whole chain of, you know, comfy workflows. Yeah. How does, how does one part of the chain actually affect the whole process?Comfy [00:14:52]: Are you talking on the model side specific?swyx [00:14:54]: Model specific, right? But like once you have your whole workflow based on a model, it's very hard to move.Comfy [00:15:01]: Uh, not, well, not really. Well, it depends on your, uh, depends on their specific kind of the workflow. Yeah.swyx [00:15:09]: So I do a lot of like text and image. Yeah.Comfy [00:15:12]: When you do change, like most workflows are kind of going to be complete. Yeah. It's just like, you might have to completely change your prompt completely change. Okay.swyx [00:15:24]: Well, I mean, then maybe the question is really about evals. Like what does the comfy community do for evals? Just, you know,Comfy [00:15:31]: Well, that they don't really do that. It's more like, oh, I think this image is nice. So that's, uh,swyx [00:15:38]: They just subscribe to Fofr AI and just see like, you know, what Fofr is doing. Yeah.Comfy [00:15:43]: Well, they just, they just generate like it. Like, I don't see anyone really doing it. Like, uh, at least on the comfy side, comfy users, they, it's more like, oh, generate images and see, oh, this one's nice. It's like, yeah, it's not, uh, like the, the more, uh, like, uh, scientific, uh, like, uh, like checking that's more on specifically on like model side. If, uh, yeah, but there is a lot of, uh, vibes also, cause it is a like, uh, artistic, uh, you can create a very good model that doesn't generate nice images. Cause most images on the internet are ugly. So if you, if that's like, if you just, oh, I have the best model at 10th giant, it's super smart. I created on all the, like I've trained on just all the images on the internet. The images are not going to look good. So yeah.Alessio [00:16:42]: Yeah.Comfy [00:16:43]: They're going to be very consistent. But yeah. People like, it's not going to be like the, the look that people are going to be expecting from, uh, from a model. So. Yeah.swyx [00:16:54]: Can we talk about LoRa's? Cause we thought we talked about models then like the next step is probably LoRa's. Before, I actually, I'm kind of curious how LoRa's entered the tool set of the image community because the LoRa paper was 2021. And then like, there was like other methods like textual inversion that was popular at the early SD stage. Yeah.Comfy [00:17:13]: I can't even explain the difference between that. Yeah. Textual inversions. That's basically what you're doing is you're, you're training a, cause well, yeah. Stable diffusion. You have the diffusion model, you have text encoder. So basically what you're doing is training a vector that you're going to pass to the text encoder. It's basically you're training a new word. Yeah.swyx [00:17:37]: It's a little bit like representation engineering now. Yeah.Comfy [00:17:40]: Yeah. Basically. Yeah. You're just, so yeah, if you know how like the text encoder works, basically you have, you take your, your words of your product, you convert those into tokens with the tokenizer and those are converted into vectors. Basically. Yeah. Each token represents a different vector. So each word presents a vector. And those, depending on your words, that's the list of vectors that get passed to the text encoder, which is just. Yeah. Yeah. I'm just a stack of, of attention. Like basically it's a very close to LLM architecture. Yeah. Yeah. So basically what you're doing is just training a new vector. We're saying, well, I have all these images and I want to know which word does that represent? And it's going to get like, you train this vector and then, and then when you use this vector, it hopefully generates. Like something similar to your images. Yeah.swyx [00:18:43]: I would say it's like surprisingly sample efficient in picking up the concept that you're trying to train it on. Yeah.Comfy [00:18:48]: Well, people have kind of stopped doing that even though back as like when I was at Stability, we, we actually did train internally some like textual versions on like T5 XXL actually worked pretty well. But for some reason, yeah, people don't use them. And also they might also work like, like, yeah, this is something and probably have to test, but maybe if you train a textual version, like on T5 XXL, it might also work with all the other models that use T5 XXL because same thing with like, like the textual inversions that, that were trained for SD 1.5, they also kind of work on SDXL because SDXL has the, has two text encoders. And one of them is the same as the, as the SD 1.5 CLIP-L. So those, they actually would, they don't work as strongly because they're only applied to one of the text encoders. But, and the same thing for SD3. SD3 has three text encoders. So it works. It's still, you can still use your textual version SD 1.5 on SD3, but it's just a lot weaker because now there's three text encoders. So it gets even more diluted. Yeah.swyx [00:20:05]: Do people experiment a lot on, just on the CLIP side, there's like Siglip, there's Blip, like do people experiment a lot on those?Comfy [00:20:12]: You can't really replace. Yeah.swyx [00:20:14]: Because they're trained together, right? Yeah.Comfy [00:20:15]: They're trained together. So you can't like, well, what I've seen people experimenting with is a long CLIP. So basically someone fine tuned the CLIP model to accept longer prompts.swyx [00:20:27]: Oh, it's kind of like long context fine tuning. Yeah.Comfy [00:20:31]: So, so like it's, it's actually supported in Core Comfy.swyx [00:20:35]: How long is long?Comfy [00:20:36]: Regular CLIP is 77 tokens. Yeah. Long CLIP is 256. Okay. So, but the hack that like you've, if you use stable diffusion 1.5, you've probably noticed, oh, it still works if I, if I use long prompts, prompts longer than 77 words. Well, that's because the hack is to just, well, you split, you split it up in chugs of 77, your whole big prompt. Let's say you, you give it like the massive text, like the Bible or something, and it would split it up in chugs of 77 and then just pass each one through the CLIP and then just cut anything together at the end. It's not ideal, but it actually works.swyx [00:21:26]: Like the positioning of the words really, really matters then, right? Like this is why order matters in prompts. Yeah.Comfy [00:21:33]: Yeah. Like it, it works, but it's, it's not ideal, but it's what people expect. Like if, if someone gives a huge prompt, they expect at least some of the concepts at the end to be like present in the image. But usually when they give long prompts, they, they don't, they like, they don't expect like detail, I think. So that's why it works very well.swyx [00:21:58]: And while we're on this topic, prompts waiting, negative comments. Negative prompting all, all sort of similar part of this layer of the stack. Yeah.Comfy [00:22:05]: The, the hack for that, which works on CLIP, like it, basically it's just for SD 1.5, well, for SD 1.5, the prompt waiting works well because CLIP L is a, is not a very deep model. So you have a very high correlation between, you have the input token, the index of the input token vector. And the output token, they're very, the concepts are very close, closely linked. So that means if you interpolate the vector from what, well, the, the way Comfy UI does it is it has, okay, you have the vector, you have an empty prompt. So you have a, a chunk, like a CLIP output for the empty prompt, and then you have the one for your prompt. And then it interpolates from that, depending on your prompt. Yeah.Comfy [00:23:07]: So that's how it, how it does prompt waiting. But this stops working the deeper your text encoder is. So on T5X itself, it doesn't work at all. So. Wow.swyx [00:23:20]: Is that a problem for people? I mean, cause I'm used to just move, moving up numbers. Probably not. Yeah.Comfy [00:23:25]: Well.swyx [00:23:26]: So you just use words to describe, right? Cause it's a bigger language model. Yeah.Comfy [00:23:30]: Yeah. So. Yeah. So honestly it might be good, but I haven't seen many complaints on Flux that it's not working. So, cause I guess people can sort of get around it with, with language. So. Yeah.swyx [00:23:46]: Yeah. And then coming back to LoRa's, now the, the popular way to, to customize models is LoRa's. And I saw you also support Locon and LoHa, which I've never heard of before.Comfy [00:23:56]: There's a bunch of, cause what, what the LoRa is essentially is. Instead of like, okay, you have your, your model and then you want to fine tune it. So instead of like, what you could do is you could fine tune the entire thing, but that's a bit heavy. So to speed things up and make things less heavy, what you can do is just fine tune some smaller weights, like basically two, two matrices that when you multiply like two low rank matrices and when you multiply them together, gives a, represents a difference between trained weights and your base weights. So by training those two smaller matrices, that's a lot less heavy. Yeah.Alessio [00:24:45]: And they're portable. So you're going to share them. Yeah. It's like easier. And also smaller.Comfy [00:24:49]: Yeah. That's the, how LoRa's work. So basically, so when, when inferencing you, you get an inference with them pretty efficiently, like how ComputeWrite does it. It just, when you use a LoRa, it just applies it straight on the weights so that there's only a small delay at the base, like before the sampling to when it applies the weights and then it just same speed as, as before. So for, for inference, it's, it's not that bad, but, and then you have, so basically all the LoRa types like LoHa, LoCon, everything, that's just different ways of representing that like. Basically, you can call it kind of like compression, even though it's not really compression, it's just different ways of represented, like just, okay, I want to train a different on the difference on the weights. What's the best way to represent that difference? There's the basic LoRa, which is just, oh, let's multiply these two matrices together. And then there's all the other ones, which are all different algorithms. So. Yeah.Alessio [00:25:57]: So let's talk about LoRa. Let's talk about what comfy UI actually is. I think most people have heard of it. Some people might've seen screenshots. I think fewer people have built very complex workflows. So when you started, automatic was like the super simple way. What were some of the choices that you made? So the node workflow, is there anything else that stands out as like, this was like a unique take on how to do image generation workflows?Comfy [00:26:22]: Well, I feel like, yeah, back then everyone was trying to make like easy to use interface. Yeah. So I'm like, well, everyone's trying to make an easy to use interface.swyx [00:26:32]: Let's make a hard to use interface.Comfy [00:26:37]: Like, so like, I like, I don't need to do that, everyone else doing it. So let me try something like, let me try to make a powerful interface that's not easy to use. So.swyx [00:26:52]: So like, yeah, there's a sort of node execution engine. Yeah. Yeah. And it actually lists, it has this really good list of features of things you prioritize, right? Like let me see, like sort of re-executing from, from any parts of the workflow that was changed, asynchronous queue system, smart memory management, like all this seems like a lot of engineering that. Yeah.Comfy [00:27:12]: There's a lot of engineering in the back end to make things, cause I was always focused on making things work locally very well. Cause that's cause I was using it locally. So everything. So there's a lot of, a lot of thought and working by getting everything to run as well as possible. So yeah. ConfUI is actually more of a back end, at least, well, not all the front ends getting a lot more development, but, but before, before it was, I was pretty much only focused on the backend. Yeah.swyx [00:27:50]: So v0.1 was only August this year. Yeah.Comfy [00:27:54]: With the new front end. Before there was no versioning. So yeah. Yeah. Yeah.swyx [00:27:57]: And so what was the big rewrite for the 0.1 and then the 1.0?Comfy [00:28:02]: Well, that's more on the front end side. That's cause before that it was just like the UI, what, cause when I first wrote it, I just, I said, okay, how can I make, like, I can do web development, but I don't like doing it. Like what's the easiest way I can slap a node interface on this. And then I found this library. Yeah. Like JavaScript library.swyx [00:28:26]: Live graph?Comfy [00:28:27]: Live graph.swyx [00:28:28]: Usually people will go for like react flow for like a flow builder. Yeah.Comfy [00:28:31]: But that seems like too complicated. So I didn't really want to spend time like developing the front end. So I'm like, well, oh, light graph. This has the whole node interface. So, okay. Let me just plug that into, to my backend.swyx [00:28:49]: I feel like if Streamlit or Gradio offered something that you would have used Streamlit or Gradio cause it's Python. Yeah.Comfy [00:28:54]: Yeah. Yeah. Yeah.Comfy [00:29:00]: Yeah.Comfy [00:29:14]: Yeah. logic and your backend logic and just sticks them together.swyx [00:29:20]: It's supposed to be easy for you guys. If you're a Python main, you know, I'm a JS main, right? Okay. If you're a Python main, it's supposed to be easy.Comfy [00:29:26]: Yeah, it's easy, but it makes your whole software a huge mess.swyx [00:29:30]: I see, I see. So you're mixing concerns instead of separating concerns?Comfy [00:29:34]: Well, it's because... Like frontend and backend. Frontend and backend should be well separated with a defined API. Like that's how you're supposed to do it. Smart people disagree. It just sticks everything together. It makes it easy to like a huge mess. And also it's, there's a lot of issues with Gradio. Like it's very good if all you want to do is just get like slap a quick interface on your, like to show off your ML project. Like that's what it's made for. Yeah. Like there's no problem using it. Like, oh, I have my, I have my code. I just wanted a quick interface on it. That's perfect. Like use Gradio. But if you want to make something that's like a real, like real software that will last a long time and will be easy to maintain, then I would avoid it. Yeah.swyx [00:30:32]: So your criticism is Streamlit and Gradio are the same. I mean, those are the same criticisms.Comfy [00:30:37]: Yeah, Streamlit I haven't used as much. Yeah, I just looked a bit.swyx [00:30:43]: Similar philosophy.Comfy [00:30:44]: Yeah, it's similar. It's just, it just seems to me like, okay, for quick, like AI demos, it's perfect.swyx [00:30:51]: Yeah. Going back to like the core tech, like asynchronous queues, slow re-execution, smart memory management, you know, anything that you were very proud of or was very hard to figure out?Comfy [00:31:00]: Yeah. The thing that's the biggest pain in the ass is probably the memory management. Yeah.swyx [00:31:05]: Were you just paging models in and out or? Yeah.Comfy [00:31:08]: Before it was just, okay, load the model, completely unload it. Then, okay, that, that works well when you, your model are small, but if your models are big and it takes sort of like, let's say someone has a, like a, a 4090, and the model size is 10 gigabytes, that can take a few seconds to like load and load, load and load, so you want to try to keep things like in memory, in the GPU memory as much as possible. What Comfy UI does right now is it. It tries to like estimate, okay, like, okay, you're going to sample this model, it's going to take probably this amount of memory, let's remove the models, like this amount of memory that's been loaded on the GPU and then just execute it. But so there's a fine line between just because try to remove the least amount of models that are already loaded. Because as fans, like Windows drivers, and one other problem is the NVIDIA driver on Windows by default, because there's a way to, there's an option to disable that feature, but by default it, like, if you start loading, you can overflow your GPU memory and then it's, the driver's going to automatically start paging to RAM. But the problem with that is it's, it makes everything extremely slow. So when you see people complaining, oh, this model, it works, but oh, s**t, it starts slowing down a lot, that's probably what's happening. So it's basically you have to just try to get, use as much memory as possible, but not too much, or else things start slowing down, or people get out of memory, and then just find, try to find that line where, oh, like the driver on Windows starts paging and stuff. Yeah. And the problem with PyTorch is it's, it's high levels, don't have that much fine-grained control over, like, specific memory stuff, so kind of have to leave, like, the memory freeing to, to Python and PyTorch, which is, can be annoying sometimes.swyx [00:33:32]: So, you know, I think one thing is, as a maintainer of this project, like, you're designing for a very wide surface area of compute, like, you even support CPUs.Comfy [00:33:42]: Yeah, well, that's... That's just, for PyTorch, PyTorch supports CPUs, so, yeah, it's just, that's not, that's not hard to support.swyx [00:33:50]: First of all, is there a market share estimate, like, is it, like, 70% NVIDIA, like, 30% AMD, and then, like, miscellaneous on Apple, Silicon, or whatever?Comfy [00:33:59]: For Comfy? Yeah. Yeah, and, yeah, I don't know the market share.swyx [00:34:03]: Can you guess?Comfy [00:34:04]: I think it's mostly NVIDIA. Right. Because, because AMD, the problem, like, AMD works horribly on Windows. Like, on Linux, it works fine. It's, it's lower than the price equivalent NVIDIA GPU, but it works, like, you can use it, you generate images, everything works. On Linux, on Windows, you might have a hard time, so, that's the problem, and most people, I think most people who bought AMD probably use Windows. They probably aren't going to switch to Linux, so... Yeah. So, until AMD actually, like, ports their, like, raw cam to, to Windows properly, and then there's actually PyTorch, I think they're, they're doing that, they're in the process of doing that, but, until they get it, they get a good, like, PyTorch raw cam build that works on Windows, it's, like, they're going to have a hard time. Yeah.Alessio [00:35:06]: We got to get George on it. Yeah. Well, he's trying to get Lisa Su to do it, but... Let's talk a bit about, like, the node design. So, unlike all the other text-to-image, you have a very, like, deep, so you have, like, a separate node for, like, clip and code, you have a separate node for, like, the case sampler, you have, like, all these nodes. Going back to, like, the making it easy versus making it hard, but, like, how much do people actually play with all the settings, you know? Kind of, like, how do you guide people to, like, hey, this is actually going to be very impactful versus this is maybe, like, less impactful, but we still want to expose it to you?Comfy [00:35:40]: Well, I try to... I try to expose, like, I try to expose everything or, but, yeah, at least for the, but for things, like, for example, for the samplers, like, there's, like, yeah, four different sampler nodes, which go in easiest to most advanced. So, yeah, if you go, like, the easy node, the regular sampler node, that's, you have just the basic settings. But if you use, like, the sampler advanced... If you use, like, the custom advanced node, that, that one you can actually, you'll see you have, like, different nodes.Alessio [00:36:19]: I'm looking it up now. Yeah. What are, like, the most impactful parameters that you use? So, it's, like, you know, you can have more, but, like, which ones, like, really make a difference?Comfy [00:36:30]: Yeah, they all do. They all have their own, like, they all, like, for example, yeah, steps. Usually you want steps, you want them to be as low as possible. But you want, if you're optimizing your workflow, you want to, you lower the steps until, like, the images start deteriorating too much. Because that, yeah, that's the number of steps you're running the diffusion process. So, if you want things to be faster, lower is better. But, yeah, CFG, that's more, you can kind of see that as the contrast of the image. Like, if your image looks too bursty. Then you can lower the CFG. So, yeah, CFG, that's how, yeah, that's how strongly the, like, the negative versus positive prompt. Because when you sample a diffusion model, it's basically a negative prompt. It's just, yeah, positive prediction minus negative prediction.swyx [00:37:32]: Contrastive loss. Yeah.Comfy [00:37:34]: It's positive minus negative, and the CFG does the multiplier. Yeah. Yeah. Yeah, so.Alessio [00:37:41]: What are, like, good resources to understand what the parameters do? I think most people start with automatic, and then they move over, and it's, like, snap, CFG, sampler, name, scheduler, denoise. Read it.Comfy [00:37:53]: But, honestly, well, it's more, it's something you should, like, try out yourself. I don't know, you don't necessarily need to know how it works to, like, what it does. Because even if you know, like, CFGO, it's, like, positive minus negative prompt. Yeah. So the only thing you know at CFG is if it's 1.0, then that means the negative prompt isn't applied. It also means sampling is two times faster. But, yeah. But other than that, it's more, like, you should really just see what it does to the images yourself, and you'll probably get a more intuitive understanding of what these things do.Alessio [00:38:34]: Any other nodes or things you want to shout out? Like, I know the animate diff IP adapter. Those are, like, some of the most popular ones. Yeah. What else comes to mind?Comfy [00:38:44]: Not nodes, but there's, like, what I like is when some people, sometimes they make things that use ComfyUI as their backend. Like, there's a plugin for Krita that uses ComfyUI as its backend. So you can use, like, all the models that work in Comfy in Krita. And I think I've tried it once. But I know a lot of people use it, and it's probably really nice, so.Alessio [00:39:15]: What's the craziest node that people have built, like, the most complicated?Comfy [00:39:21]: Craziest node? Like, yeah. I know some people have made, like, video games in Comfy with, like, stuff like that. So, like, someone, like, I remember, like, yeah, last, I think it was last year, someone made, like, a, like, Wolfenstein 3D in Comfy. Of course. And then one of the inputs was, oh, you can generate a texture, and then it changes the texture in the game. So you can plug it to, like, the workflow. And there's a lot of, if you look there, there's a lot of crazy things people do, so. Yeah.Alessio [00:39:59]: And now there's, like, a node register that people can use to, like, download nodes. Yeah.Comfy [00:40:04]: Like, well, there's always been the, like, the ComfyUI manager. Yeah. But we're trying to make this more, like, I don't know, official, like, with, yeah, with the node registry. Because before the node registry, the, like, okay, how did your custom node get into ComfyUI manager? That's the guy running it who, like, every day he searched GitHub for new custom nodes and added dev annually to his custom node manager. So we're trying to make it less effortless. So we're trying to make it less effortless for him, basically. Yeah.Alessio [00:40:40]: Yeah. But I was looking, I mean, there's, like, a YouTube download node. There's, like, this is almost like, you know, a data pipeline more than, like, an image generation thing at this point. It's, like, you can get data in, you can, like, apply filters to it, you can generate data out.Comfy [00:40:54]: Yeah. You can do a lot of different things. Yeah. So I'm thinking, I think what I did is I made it easy to make custom nodes. So I think that helped a lot. I think that helped a lot for, like, the ecosystem because it is very easy to just make a node. So, yeah, a bit too easy sometimes. Then we have the issue where there's a lot of custom node packs which share similar nodes. But, well, that's, yeah, something we're trying to solve by maybe bringing some of the functionality into the core. Yeah. Yeah. Yeah.Alessio [00:41:36]: And then there's, like, video. People can do video generation. Yeah.Comfy [00:41:40]: Video, that's, well, the first video model was, like, stable video diffusion, which was last, yeah, exactly last year, I think. Like, one year ago. But that wasn't a true video model. So it was...swyx [00:41:55]: It was, like, moving images? Yeah.Comfy [00:41:57]: I generated video. What I mean by that is it's, like, it's still 2D Latents. It's basically what I'm trying to do. So what they did is they took SD2, and then they added some temporal attention to it, and then trained it on videos and all. So it's kind of, like, animated, like, same idea, basically. Why I say it's not a true video model is that you still have, like, the 2D Latents. Like, a true video model, like Mochi, for example, would have 3D Latents. Mm-hmm.Alessio [00:42:32]: Which means you can, like, move through the space, basically. It's the difference. You're not just kind of, like, reorienting. Yeah.Comfy [00:42:39]: And it's also, well, it's also because you have a temporal VAE. Mm-hmm. Also, like, Mochi has a temporal VAE that compresses on, like, the temporal direction, also. So that's something you don't have with, like, yeah, animated diff and stable video diffusion. They only, like, compress spatially, not temporally. Mm-hmm. Right. So, yeah. That's why I call that, like, true video models. There's, yeah, there's actually a few of them, but the one I've implemented in comfy is Mochi, because that seems to be the best one so far. Yeah.swyx [00:43:15]: We had AJ come and speak at the stable diffusion meetup. The other open one I think I've seen is COG video. Yeah.Comfy [00:43:21]: COG video. Yeah. That one's, yeah, it also seems decent, but, yeah. Chinese, so we don't use it. No, it's fine. It's just, yeah, I could. Yeah. It's just that there's a, it's not the only one. There's also a few others, which I.swyx [00:43:36]: The rest are, like, closed source, right? Like, Cling. Yeah.Comfy [00:43:39]: Closed source, there's a bunch of them. But I mean, open. I've seen a few of them. Like, I can't remember their names, but there's COG videos, the big, the big one. Then there's also a few of them that released at the same time. There's one that released at the same time as SSD 3.5, same day, which is why I don't remember the name.swyx [00:44:02]: We should have a release schedule so we don't conflict on each of these things. Yeah.Comfy [00:44:06]: I think SD 3.5 and Mochi released on the same day. So everything else was kind of drowned, completely drowned out. So for some reason, lots of people picked that day to release their stuff.Comfy [00:44:21]: Yeah. Which is, well, shame for those. And I think Omnijet also released the same day, which also seems interesting. Yeah. Yeah.Alessio [00:44:30]: What's Comfy? So you are Comfy. And then there's like, comfy.org. I know we do a lot of things for, like, news research and those guys also have kind of like a more open source thing going on. How do you work? Like you mentioned, you mostly work on like, the core piece of it. And then what...Comfy [00:44:47]: Maybe I should fade it in because I, yeah, I feel like maybe, yeah, I only explain part of the story. Right. Yeah. Maybe I should explain the rest. So yeah. So yeah. Basically, January, that's when the first January 2023, January 16, 2023, that's when Amphi was first released to the public. Then, yeah, did a Reddit post about the area composition thing somewhere in, I don't remember exactly, maybe end of January, beginning of February. And then someone, a YouTuber, made a video about it, like Olivio, he made a video about Amphi in March 2023. I think that's when it was a real burst of attention. And by that time, I was continuing to develop it and it was getting, people were starting to use it more, which unfortunately meant that I had first written it to do like experiments, but then my time to do experiments went down. It started going down, because people were actually starting to use it then. Like, I had to, and I said, well, yeah, time to add all these features and stuff. Yeah, and then I got hired by Stability June, 2023. Then I made, basically, yeah, they hired me because they wanted the SD-XL. So I got the SD-XL working very well withітhe UI, because they were experimenting withámphi.house.com. Actually, the SDX, how the SDXL released worked is they released, for some reason, like they released the code first, but they didn't release the model checkpoint. So they released the code. And then, well, since the research was related to code, I released the code in Compute 2. And then the checkpoints were basically early access. People had to sign up and they only allowed a lot of people from edu emails. Like if you had an edu email, like they gave you access basically to the SDXL 0.9. And, well, that leaked. Right. Of course, because of course it's going to leak if you do that. Well, the only way people could easily use it was with Comfy. So, yeah, people started using. And then I fixed a few of the issues people had. So then the big 1.0 release happened. And, well, Comfy UI was the only way a lot of people could actually run it on their computers. Because it just like automatic was so like inefficient and bad that most people couldn't actually, like it just wouldn't work. Like because he did a quick implementation. So people were forced. To use Comfy UI, and that's how it became popular because people had no choice.swyx [00:47:55]: The growth hack.Comfy [00:47:56]: Yeah.swyx [00:47:56]: Yeah.Comfy [00:47:57]: Like everywhere, like people who didn't have the 4090, they had like, who had just regular GPUs, they didn't have a choice.Alessio [00:48:05]: So yeah, I got a 4070. So think of me. And so today, what's, is there like a core Comfy team or?Comfy [00:48:13]: Uh, yeah, well, right now, um, yeah, we are hiring. Okay. Actually, so right now core, like, um, the core core itself, it's, it's me. Uh, but because, uh, the reason where folks like all the focus has been mostly on the front end right now, because that's the thing that's been neglected for a long time. So, uh, so most of the focus right now is, uh, all on the front end, but we are, uh, yeah, we will soon get, uh, more people to like help me with the actual backend stuff. Yeah. So, no, I'm not going to say a hundred percent because that's why once the, once we have our V one release, which is because it'd be the package, come fee-wise with the nice interface and easy to install on windows and hopefully Mac. Uh, yeah. Yeah. Once we have that, uh, we're going to have to, lots of stuff to do on the backend side and also the front end side, but, uh.Alessio [00:49:14]: What's the release that I'm on the wait list. What's the timing?Comfy [00:49:18]: Uh, soon. Uh, soon. Yeah, I don't want to promise a release date. We do have a release date we're targeting, but I'm not sure if it's public. Yeah, and we're still going to continue doing the open source, making MPUI the best way to run stable infusion models. At least the open source side, it's going to be the best way to run models locally. But we will have a few things to make money from it, like cloud inference or that type of thing. And maybe some things for some enterprises.swyx [00:50:08]: I mean, a few questions on that. How do you feel about the other comfy startups?Comfy [00:50:11]: I mean, I think it's great. They're using your name. Yeah, well, it's better they use comfy than they use something else. Yeah, that's true. It's fine. We're going to try not to... We don't want to... We want people to use comfy. Like I said, it's better that people use comfy than something else. So as long as they use comfy, I think it helps the ecosystem. Because more people, even if they don't contribute directly, the fact that they are using comfy means that people are more likely to join the ecosystem. So, yeah.swyx [00:50:57]: And then would you ever do text?Comfy [00:50:59]: Yeah, well, you can already do text with some custom nodes. So, yeah, it's something we like. Yeah, it's something I've wanted to eventually add to core, but it's more like not a very... It's a very high priority. But because a lot of people use text for prompt enhancement and other things like that. So, yeah, it's just that my focus has always been on diffusion models. Yeah, unless some text diffusion model comes out.swyx [00:51:30]: Yeah, David Holtz is investing a lot in text diffusion.Comfy [00:51:34]: Yeah, well, if a good one comes out, then we'll probably implement it since it fits with the whole...swyx [00:51:39]: Yeah, I mean, I imagine it's going to be a close source to Midjourney. Yeah.Comfy [00:51:43]: Well, if an open one comes out, then I'll probably implement it.Alessio [00:51:54]: Cool, comfy. Thanks so much for coming on. This was fun. Bye. Get full access to Latent Space at www.latent.space/subscribe

The Agents of Change: SEO, Social Media, and Mobile Marketing for Small Business

What if you could turn your creative visions into reality without hiring an expensive designer? That's where AI tools like MidJourney and Ideogram come in. On this episode of The Agents of Change, I sit down with Jeff Sieh, a content creator, live streamer, and AI enthusiast, to explore how marketers and creators can use AI to craft stunning visuals and repurpose content more efficiently. Jeff's insights into prompt engineering, style references, and blending tools will help you cut through the noise and stand out in a crowded feed. If you've ever struggled with getting the perfect image or want to better leverage AI in your marketing efforts, this episode is for you. https://www.theagentsofchange.com/562

TeknoSafari's Podcast
Çakal Stajyer Yapay Zeka Ajanlığı Yaparsa - Yapay Zekada Bu Hafta S2 B4

TeknoSafari's Podcast

Play Episode Listen Later Nov 14, 2024 19:57


1. Claude bilgisayarınızı görebiliyor! 2. Perplexity NotebookLM'e rakip olmaya çalışıyor. Ayrıca Reasoningle de iddialı. Öte yandan #WallStreetJournal ve #NewYorkPost telif hakkı ihlali ve marka hasarıyla suçlayarak #PerplexityAI ye dava açıyor. 3. Ideogram, CANVAS ile zirveye çıktı. 4. SOTA videoda çok iyi ve açık kaynaklı. Mochi 1 kesinlikle denemeye değer. 5. Flux gelince konu kapanmadı, Stabble Diffusion 3.5 geldi 6. RunwayML, At-One ile atakta 7. OpenAI gelişmiş sesi Avrupa'ya da açtı. 8. ByteDance stajyeri yapay zeka modellerine zararlı kod yerleştirdiği için kovuldu 9. Elon XAI APIsi yayınladı. Grok uygulamalarınıza eklenebiliyor. GROK3 gelirse ortalık karışır. 10. GPT Windows uygulaması geldi. #yapayzeka #teknolojihaberleri #bilim

Let's Talk AI
#187 - Anthropic Agents, Mochi1, 3.4B data center, OpenAI's FAST image gen

Let's Talk AI

Play Episode Listen Later Oct 28, 2024 129:38


Our 187th episode with a summary and discussion of last week's big AI news, now with Jeremie co-hosting once again! With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris) Read out our text newsletter and comment on the podcast at https://lastweekin.ai/. If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form. Email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai Timestamps + Links: (00:00:00) Intro / Banter (00:03:07) Response to listener comments / corrections (00:05:13) Sponsor Read) Tools & Apps(00:06:22) Anthropic's latest AI update can use a computer on its own (00:18:09) AI video startup Genmo launches Mochi 1, an open source rival to Runway, Kling, and others (00:20:37) Canva has a shiny new text-to-image generator (00:23:35) Canvas Beta brings Remix, Extend, and Magic Fill to Ideogram users (00:26:16) StabilityAI releases Stable Diffusion 3.5  (00:28:27) Bringing Agentic Workflows into Inflection for Enterprise Applications & Business(00:32:35) Crusoe's $3.4B joint venture to build AI data center campus with up to 100,000 GPUs (00:39:08) Anthropic reportedly in early talks to raise new funding on up to $40B valuation (00:45:47) Longtime policy researcher Miles Brundage leaves OpenAI (00:49:53) NVIDIA's Blackwell GB200 AI Servers Ready For Mass Deployment In December (00:52:41) Foxconn building Nvidia superchip facility in Mexico, executives say (00:55:27) xAI, Elon Musk's AI startup, launches an API Projects & Open Source(00:58:32) INTELLECT-1: The First Decentralized 10-Billion-Parameter AI Model Training (01:06:34) Meta FAIR Releases Eight New AI Research Artifacts—Models, Datasets, and Tools to Inspire the AI Community (01:10:02) Google DeepMind is making its AI text watermark open source Research & Advancements(01:13:21) OpenAI researchers develop new model that speeds up media generation by 50X (01:17:54) How much AI compute is out there, and who owns it? (01:25:28) Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning (01:33:30) Inference Scaling for Long-Context Retrieval Augmented Generation Policy & Safety(01:41:50) Announcing our updated Responsible Scaling Policy (01:48:52) Anthropic is testing AI's capacity for sabotage (01:56:30) OpenAI asked US to approve energy-guzzling 5GW data centers, report says (02:00:05) US Probes TSMC's Dealings with Huawei (02:03:03) TikTok owner ByteDance taps TSMC to make its own AI GPUs to stop relying on Nvidia — the company has reportedly spent over $2 billion on Nvidia AI GPUs (02:06:37) Outro

AI For Humans
Anthropic's New AI Agent, OpenAI Plays Catch-up, Runway's Act-One & More AI News

AI For Humans

Play Episode Listen Later Oct 24, 2024 50:12


AI NEWS: Agents are here from Anthropic with Computer Use in Claude Sonnet 3.5 (new) and likely coming from OpenAI, O1 keeps getting better and might get upgraded soon, Runway's New Act One let's you puppet AI video, Ideogram's new Canvas upgrades AI imaging, Unitree's Robots are getting WAY better and we show you how to make Google's NotebookLM uncensored. AND OH SO MUCH MORE.   It's a big, massive week of AI news. And we are here, for you.   Join our Patreon: https://www.patreon.com/AIForHumansShow Jump in our Discord: https://discord.gg/muD2TYgC8f Follow us for more on X @AIForHumansShow Join our TikTok @aiforhumansshow And to contact or book us for speaking/consultation, please visit our website: https://www.aiforhumans.show/   // Show Links //   Anthropic Drops “Computer Use” In Sonnet 3.5 aka AI Agents https://www.anthropic.com/news/3-5-models-and-computer-use   Claude Coding 90s Website: https://youtu.be/vH2f7cjXjKI?si=XqTRKVxHZx1bK36b   Picks the first link on Google: https://x.com/AnthropicAI/status/1848742757151498717   What Computer Use Can't Do https://x.com/forgebitz/status/1848764235729244254   OpenAI's Noam Brown on O1 https://v.redd.it/7dic62adm3wd1   OpenAI Feels The Pressure, Close To Releasing Coding Bot https://www.theinformation.com/articles/openai-in-duel-with-anthropic-doubles-down-on-ai-that-writes-software   OpenAI Agentic Rumors Involving Microsoft https://x.com/flowersslop/status/1848506100435304852   Sam Altman Teases ChatGPT Update For Second Birthday https://x.com/sama/status/1848487309211275398   Satya Nadella Says We're “Using AI Tools to Build Better AI” https://x.com/tsarnick/status/1848472478257189374   Runway Act-One https://runwayml.com/research/introducing-act-one   Teaser Video https://x.com/runwayml/status/1848785907723473001   Two actors in a scene https://x.com/runwayml/status/1848785913918218517   Mochi 1 -- New OpenSource AI Video From Genmo https://x.com/genmoai/status/1848762405779574990   Ideogram Canvas Feature https://x.com/ideogram_ai/status/1848757699606983143   Stable Diffusion 3.5 https://x.com/StabilityAI/status/1848729212250951911   Unitree Robot Exercise Videos https://youtu.be/G6JE7mNYz2A?si=KLiXYznOUy7Qz4Rh   TANGO https://x.com/dreamingtulpa/status/1847310594434584922   Trump at a McDonald's https://x.com/aliensupershow/status/1848438728148111822   NotebookLM Uncensored https://www.reddit.com/r/notebooklm/comments/1g64iyi/holy_shit_listeners_notebooklm_can_generate_18/

Marketing Against The Grain
NotebookLM is INSANE! How to Use Google's AI Tool for Marketing In 2024

Marketing Against The Grain

Play Episode Listen Later Oct 10, 2024 18:24


Ep. 268 "Is AI about to revolutionize storytelling forever?" Kipp dives into the groundbreaking potential of Google's NotebookLM and how it might reshape marketing strategies in 2024. Learn more about leveraging AI for creating engaging media, how engineers are becoming pivotal in marketing, and tips on automating content creation efficiently. Mentions Andrej Karpathy https://x.com/karpathy Histories of Mysteries podcast https://open.spotify.com/show/3K4LRyMCP44kBbiOziwJjb?si=432a337c28f14d97&nd=1&dlsi=1e2cd9b320094415 NotebookLM https://notebooklm.google/ Ideogram https://ideogram.ai/ Claude https://claude.ai/ Nathan Barry https://x.com/nathanbarry Resource [Free] Steal our favorite AI Prompts featured on the show! Grab them here: https://clickhubspot.com/aip We're on Social Media! Follow us for everyday marketing wisdom straight to your feed YouTube: ​​https://www.youtube.com/channel/UCGtXqPiNV8YC0GMUzY-EUFg  Twitter: https://twitter.com/matgpod  TikTok: https://www.tiktok.com/@matgpod  Join our community https://landing.connect.com/matg Thank you for tuning into Marketing Against The Grain! Don't forget to hit subscribe and follow us on Apple Podcasts (so you never miss an episode)! https://podcasts.apple.com/us/podcast/marketing-against-the-grain/id1616700934   If you love this show, please leave us a 5-Star Review https://link.chtbl.com/h9_sjBKH and share your favorite episodes with friends. We really appreciate your support. Host Links: Kipp Bodnar, https://twitter.com/kippbodnar   Kieran Flanagan, https://twitter.com/searchbrat  ‘Marketing Against The Grain' is a HubSpot Original Podcast // Brought to you by The HubSpot Podcast Network // Produced by Darren Clarke.

The Next Wave - Your Chief A.I. Officer
5+ AI Workflows You Can Copy For Your Business in 2024

The Next Wave - Your Chief A.I. Officer

Play Episode Listen Later Sep 10, 2024 43:08


Episode 23: How can AI simplify complex workflows and enrich language learning? Matt Wolfe (https://x.com/mreflow)) and Nathan Lands (https://x.com/NathanLands) take you on an insightful journey exploring diverse AI use cases and tools. In this episode, Matt and Nathan delve into the intricacies of learning Japanese and coding with AI, leveraging tools like Claude and Perplexity to streamline and enhance these processes. Nathan shares his experience simplifying Japanese language learning through targeted translation techniques, while Matt reveals his tips for efficient coding using AI, along with strategies for optimizing content with AI tools like Perplexity and Ideogram. The duo also discusses workflow automation, potential SEO hacks, and meeting management with AI, rounding out the episode with engaging and valuable insights. Check out The Next Wave YouTube Channel if you want to see Matt and Nathan on screen: https://lnk.to/thenextwavepd — Show Notes: (00:00) Exploring advanced AI use cases for businesses. (04:40) AI recreates Doom from gameplay videos accurately. (07:30) Useful for newsletters, business documents, and data management. (09:55) Custom instructions enhance model quality and results. (13:09) Create shorthands for tasks using custom instructions. (18:19) Perplexity's page feature generates mini Wikipedia entries. (25:25) Curious about AI-generated YouTube thumbnail prompts. (26:48) Automating business workflows. (31:49) Automated workflows simplify data analysis with make.com. (35:31) AI efficiently simplifies meeting agenda creation process. (37:21) Google Meet now summarizes and generates meeting notes. — Mentions: Riley Brown: https://www.youtube.com/channel/UCMcoud_ZW7cfxeIugBflSBw Ideogram 2.0: https://about.ideogram.ai/2.0 Perplexity: https://www.perplexity.ai/ Claude: https://claude.ai/ Make.com: https://www.make.com/en — Check Out Matt's Stuff: • Future Tools - https://futuretools.beehiiv.com/ • Blog - https://www.mattwolfe.com/ • YouTube- https://www.youtube.com/@mreflow — Check Out Nathan's Stuff: Newsletter: https://news.lore.com/ Blog - https://lore.com/ The Next Wave is a HubSpot Original Podcast // Brought to you by The HubSpot Podcast Network // Production by Darren Clarke // Editing by Ezra Bakker Trupiano

The Family History AI Show
EP12: Hollywood AI Blunder, AI Image Generator Roundup, Google Lens Saves You Time Researching, Use AI For Translation

The Family History AI Show

Play Episode Listen Later Sep 10, 2024 57:09


Hosts Mark Thompson and Steve Little expertly navigate the rapidly evolving AI landscape, offering practical insights on leveraging AI for your family history research. Mark and Steve open this episode by discussing lessons that genealogists can learn from Hollywood on the importance of fact-checking. Next, they provide a comprehensive roundup of the top AI image-generation tools. Then, how Google Lens' big upgrade can greatly simplify your research. In this week's Tip of the Week, learn how AI translation can help you research in another language, and so much more. With a mix of news, analysis, and hands-on advice, this podcast equips you with the knowledge to harness AI's power in uncovering your family stories. Whether you're a tech-savvy researcher or new to AI, this show offers valuable insights.TimestampsI. AI In the News00:01:16 Hollywood AI Mistake: Lessons for genealogists on fact-checking.00:04:18 AI Image Generators: Overview of popular tools and their applications.00:24:22 Google Lens Upgrade: New features for image and text search.II. Tip of the Week00:30:13 AI Building Blocks - Translation: Exploring translation and applications in genealogy.III. RapidFire00:41:57 Microsoft Edge Tab Organizer: New AI-powered feature for efficient research.00:44:18 "The AI Scientist": Discussion of AI agents and complex problem-solving.00:49:10 Eleven Labs Reader Upgrade: Text-to-speech tool now supports 32 languages.00:53:03 OpenAI's Condé Nast Deal: Improves AI training and new ways to access publications.Resource LinksChatGPT: https://chat.openai.com/DALL-E: https://openai.com/dall-e-2MidJourney: https://www.midjourney.com/Google Lens: https://lens.google.com/Imagen: https://deepmind.google/technologies/imagen-3/Ideogram: https://ideogram.ai/Adobe Firefly: https://www.adobe.com/sensei/generative-ai/firefly.htmlEleven Labs Reader: https://elevenlabs.io/Google Translate: https://translate.google.com/Microsoft Edge (for Tab Organizer): https://www.microsoft.com/edgeTranskribus: https://readcoop.eu/transkribus/OpenAI: https://openai.com/TagsArtificial Intelligence, Genealogy, Family History, AI Tools, Image Generation, Google Lens, OCR Technology, Language Translation, AI Ethics, Research Techniques, DALL-E, Midjourney, Adobe Firefly, Google Imagen, Microsoft Edge, Tab Management, AI Agents, Text-to-Speech, Eleven Labs Reader, OpenAI Partnerships

Inteligencia Artificial
Nuevas Herramientas IA Ideogram, Grok 2 y Cerebras

Inteligencia Artificial

Play Episode Listen Later Sep 4, 2024


En el episodio más reciente del podcast «Inteligencia Artificial», analizamos las últimas innovaciones en el mundo de la IA. Hoy exploramos tres herramientas clave que están transformando el campo de la inteligencia artificial: Ideogram v2, Grok 2, y Cerebras. Si estás interesado en el presente y futuro de la IA, sigue leyendo para conocer las […] Origen

Let's Talk AI
#180 - Ideogram v2, Imagen 3, AI in 2030, Agent Q, SB 1047

Let's Talk AI

Play Episode Listen Later Sep 3, 2024 125:23 Transcription Available


Our 180th episode with a summary and discussion of last week's big AI news! With hosts Andrey Kurenkov (https://twitter.com/andrey_kurenkov) and Jeremie Harris (https://twitter.com/jeremiecharris) If you would like to get a sneak peek and help test Andrey's generative AI application, go to Astrocade.com to join the waitlist and the discord. Read out our text newsletter and comment on the podcast at https://lastweekin.ai/ If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form. Email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai Episode Highlights: Ideogram AI's new features, Google's Imagine 3, Dream Machine 1.5, and Runway's Gen3 Alpha Turbo model advancements. Perplexity's integration of Flux image generation models and code interpreter updates for enhanced search results.  Exploration of the feasibility and investment needed for scaling advanced AI models like GPT-4 and Agent Q architecture enhancements. Analysis of California's AI regulation bill SB1047 and legal issues related to synthetic media, copyright, and online personhood credentials. Timestamps + Links: (00:00:00) Intro / Banter (00:01:08) Response to Listener Comments / Corrections Tools & Apps (00:03:58) Ideogram AI expands its features with v2 model and color palette options (00:07:48) Google Releases Powerful AI Image Generator You Can Use for Free (00:11:41) Perplexity adds Flux.1 model for Pro users alongside Playground v3 update (00:13:58) Luma drops Dream Machine 1.5 — here's what's new (00:17:49) Runway's Gen-3 Alpha Turbo is here and can make AI videos faster than you can type (00:20:21) Perplexity's latest update improves code interpreter, charts included Applications & Business (00:24:14) AMD buying server maker ZT Systems for $4.9 billion as chipmakers strengthen AI capabilities (00:28:55) Ars Technica content is now available in OpenAI services (00:34:08) Anysphere, a GitHub Copilot rival, has raised $60M Series A at  $400M valuation from a16z, Thrive, sources say 00:38:32 Stability AI appoints new Chief Technology Officer (00:41:45) Cruise's robotaxis are coming to the Uber app in 2025 Projects & Open Source (00:44:16) AI21 Introduces the Jamba Model Family: The most powerful and efficient long-context models for the enterprise (00:53:47) Microsoft reveals Phi-3.5 — this new small AI model outperforms Gemini and GPT-4o (00:57:33) Nvidia's Llama-3.1-Minitron 4B is a small language model that punches above its weight (01:00:58) Open source Dracarys models ignite generative AI fired coding Research & Advancements (01:12:35) Can AI Scaling Continue Through 2030? (01:15:35) Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents (01:23:58) Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models (01:31:18) Loss of plasticity in deep continual learning Policy & Safety (01:38:20) California weakens bill to prevent AI disasters before final vote, taking advice from Anthropic (01:48:14) Personhood credentials: Artificial intelligence and the value of privacy-preserving tools to distinguish who is real online (01:52:44) Showing SAE Latents Are Not Atomic Using Meta-SAEs Synthetic Media & Art (01:58:33) Authors sue Claude AI chatbot creator Anthropic for copyright infringement (01:59:32) Artists' lawsuit against Stability AI and Midjourney gets more punch (02:01:43) Outro

The AI Breakdown: Daily Artificial Intelligence News and Discussions
The 5 Most Important Stories in AI This Week

The AI Breakdown: Daily Artificial Intelligence News and Discussions

Play Episode Listen Later Aug 24, 2024 12:33


Covering the five most significant stories in AI this week. Major product releases like MidJourney's new web-based interface and Ideogram 2.0's text generation capabilities are highlighted, along with key enterprise AI developments from Salesforce. The ongoing debate over California's AI safety bill SB 1047 is also discussed, as well as how intense competition among AI companies is driving benefits for consumers and startups. Stay updated on the latest in AI! Concerned about being spied on? Tired of censored responses? AI Daily Brief listeners receive a 20% discount on Venice Pro. Visit ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://venice.ai/nlw ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠and enter the discount code NLWDAILYBRIEF. Learn how to use AI with the world's biggest library of fun and useful tutorials: https://besuper.ai/ Use code 'podcast' for 50% off your first month. The AI Daily Brief helps you understand the most important news and discussions in AI. Subscribe to the podcast version of The AI Daily Brief wherever you listen: https://pod.link/1680633614 Subscribe to the newsletter: https://aidailybrief.beehiiv.com/ Join our Discord: https://bit.ly/aibreakdown

This Day in AI Podcast
EP74: Human Eggs with Ideogram 2.0, Phi 3.5 Boom Factor + AI-Free Startups

This Day in AI Podcast

Play Episode Listen Later Aug 23, 2024 73:09


Sign up to Simtheory for an AI workspace: https://simtheory.aiTry ideogram 2.0 on Simtheory---CHAPTERS:00:00 - Ideogram 2.0: Your new AI graphics designer?23:46 - Microsoft Phi 3.5 Initial Impressions & Thoughts + Boom Factor38:51 - AI workspace productivity: how much is your productivity worth?55:08 - Procreate's Anti AI Movement: Marketing or a New Category?1:07:06 - Chris's thoughts on Phi-3.5 Fine Tuning & Lack of Documentation, Accessibility of Models to Try---To see images from the show join our Discord community: https://thisdayinai.comShow notes: https://thisdayinai.com/bookmarks/68-ep74Thanks for listening, your comments, reviews and support of the show. We really appreciate it and love hearing from you.PS. Tasmanian YouTuber Chris mentions: https://www.youtube.com/@UCalOFVbIxEAWIV5LHGkKcnw

Everyday AI Podcast – An AI and ChatGPT Podcast
EP 342: New AI Image Generators You Won't Believe - Flux, Ideogram 2, and more

Everyday AI Podcast – An AI and ChatGPT Podcast

Play Episode Listen Later Aug 22, 2024 54:13


Send Everyday AI and Jordan a text messageWin a free year of ChatGPT or other prizes! Find out how.Y'all won't believe these new AI image generators.... They're starting to blur the line between fact and fiction. (In both a good and a bad way.) And in the past few weeks, the entire game has changed. We go over what's new and what you need to know. Newsletter: Sign up for our free daily newsletterMore on this Episode: Episode PageJoin the discussion: Ask Jordan questions on AI image generatorsRelated Episodes: Ep 218: Winning the Probability Game in AI VisualsEp 198: Midjourney V6 – What's new and producing powerful ad creativesUpcoming Episodes: Check out the upcoming Everyday AI Livestream lineupWebsite: YourEverydayAI.comEmail The Show: info@youreverydayai.comConnect with Jordan on LinkedInTopics Covered in This Episode:1. Different image generators and their uses2. Importance of AI images3. AI image generation timeline4. Potential misuse of AI image generators5. AI image generation promptsTimestamps:02:50 Daily AI news06:20 AI image vs real image test09:12 AI creates images from text descriptions using AI.10:05 AI image generators use diffusion models, face copyright concerns.13:43 DALL-E and OpenAI in AI image game.18:14 Live demonstration, prompt, show, vote for best.21:30 Open source model used for image generators.27:49 Issues showing small URLs, presenting photo options.30:46 Interface is a love-hate experience with AI.34:47 Generate, observe real-time photo realistic beach image.38:25 Imprinting painting with quick color changes attempt.41:57 Results of AI image generator with social platform.42:41 Explore AI image generators for various industries.48:36 Former president spreads problematic AI-generated images.51:16 AI image generators creating videos are powerful.Keywords:AI tools, photorealistic images, ideogram, Imagine 3, Flux 1, image generation, Grok, Google DeepMind, Jordan Wilson, Midjourney, user control, AI image generators, misinformation and disinformation, open source models, copyright concerns, DALL E, diffusion models, AI-generated images, AI demonstration, Chat GPT course, webinar, audience vote, AI impact on creativity and business, Google and California deal, Microsoft AI feature recall, Neuralink brain chip, misuse of image generators, video creation, ad campaigns, newsletter giveaway Get more out of ChatGPT by learning our PPP method in this live, interactive and free training! Sign up now: https://youreverydayai.com/ppp-registration/

AI For Humans
Ideogram & Flux Make AI Imaging Too Realistic, OpenAI Updates & More AI News

AI For Humans

Play Episode Listen Later Aug 22, 2024 47:55


Join our Patreon: https://www.patreon.com/AIForHumansShow AI news coming in HOT: Ideogram 2.0 dropped and, along with Flux community updates & Google's IMAGEN-3, continues to show how AI imaging tools keep improving… but also opens the door to whole new messes. Plus, OpenAI's drops new blog posts (wee!), Unitree sends their cheap humanoid robot into production and we use Hedra and a bunch of other AI tools to interview Baby Joe Brogan. It's quite a moment.    Follow us for more on X @AIForHumansShow Join our TikTok @aiforhumansshow And to contact or book us for speaking/consultation, please visit our website: https://www.aiforhumans.show/   // SHOW LINKS //  Ideogram https://ideogram.ai/t/explore Ideogram 2.0 Launch Trailer https://x.com/i/status/1826277550798278804 Fluxstanza https://civitai.com/models/657252/fluxstanza?modelVersionId=735368 https://x.com/itspoidaman/status/1824957283803308536 FAL.ai https://fal.ai/models Political AI Image Controversy https://www.nytimes.com/2024/08/19/us/politics/trump-taylor-swift-ai-images.html Imagen-3 https://deepmind.google/technologies/imagen-3/ Procreate Boss Says They'll Never Use AI  https://www.theverge.com/2024/8/19/24223473/procreate-anti-generative-ai-pledge-digital-illustration-creatives OpenAI Fine Tuning Launched For GPT-4o (explain what this is…) https://openai.com/index/gpt-4o-fine-tuning/ Partnering with Conde Nast https://openai.com/index/conde-nast/ Matthew Berman SearchGPT Video https://youtu.be/DV9I_fu0ba8?si=Kcd4wb54wFgwaQuJ Unitree's 16k Humanoid Robot Goes Into Production  https://x.com/UnitreeRobotics/status/1789931753974517820 GoT AI Rave https://x.com/andr3_ai/status/1825600625754911091 Space Vets Children's Series:  https://www.storybookstudios.ai/space-vets Demis Hassabis on the Google Deep Mind Podcast https://x.com/GoogleDeepMind/status/1824447036847993292 Runway GEN-3 Turbo https://venturebeat.com/ai/runways-gen-3-alpha-turbo-is-here-and-can-make-ai-videos-faster-than-you-can-type/ Hedra 1.5 Character Generation https://x.com/hedra_labs/status/1824113944757457157  

a16z
When AI Meets Art

a16z

Play Episode Listen Later Jul 30, 2024 43:20


On June 27th, the a16z team headed to New York City for the first-ever AI Artist Retreat at their office. This event brought together the builders behind some of the most popular AI creative tools, along with 16 artists, filmmakers, and designers who are exploring the capabilities of AI in their work.In this episode, we hear from the innovators pushing the boundaries of AI creativity. Joined by Anish Acharya, General Partner, and Justine Moore, Partner on the Consumer team, we feature insights from:Ammaar Reshi - Head of Design, ElevenLabsJustin Maier - Cofounder & CEO, CivitaiMaxfield Hulker - Cofounder & COO, CivitaiDiego Rodriguez - Cofounder & CTO, KreaVictor Perez - Cofounder & CEO, KreaMohammad Norouzi - Cofounder & CEO, IdeogramHang Chu - Cofounder & CEO, ViggleConor Durkan - Cofounder, UdioThese leaders highlight the surprising commonalities between founders and artists, and the interdisciplinary nature of their work. The episode covers the origin stories behind these innovative tools, their viral moments, and their future visions. You'll also hear about the exciting potential for AI in various creative modalities, including image, video, music, 3D, and speech.Keep an eye out for more in our series highlighting the founders building groundbreaking foundation models and AI applications for video, audio, photography, animation, and more.Learn more and see videos on artists leveraging AI at: a16z.com/aiart Find Ammaar on Twitter: https://x.com/ammaarLearn more about ElevenLabs: https://elevenlabs.ioFind Justin on Twitter: https://x.com/justmaierFind Max on LinkedIn: https://www.linkedin.com/in/maxfield-hulker-5222aa230/Learn more about Civitai: https://civitai.comFind Diego on Twitter: https://x.com/asciidiego?lang=enFind Victor on Twitter: https://x.com/viccpoesLearn more about Krea: https://www.krea.ai/homeFind Mohammed on Twitter: https://x.com/mo_norouziLearn more about Ideogram: https://ideogram.ai/t/exploreFind Conor on Twitter: https://x.com/conormdurkanLearn more about Udio: https://www.udio.com/homeFind Hang on Twitter: https://x.com/chuhang1122Learn more about Viggle: https://viggle.ai/ Stay Updated: Let us know what you think: https://ratethispodcast.com/a16zFind a16z on Twitter: https://twitter.com/a16zFind a16z on LinkedIn: https://www.linkedin.com/company/a16zSubscribe on your favorite podcast app: https://a16z.simplecast.com/Follow our host: https://twitter.com/stephsmithioPlease note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details please see a16z.com/disclosures.  

Everyday VOpreneur
Boost Your Marketing With AI: Elevate Your Graphics And Images with Mandi Kaye

Everyday VOpreneur

Play Episode Listen Later Apr 11, 2024 41:49


Mandi Kaye joins Marc Scott to discuss AI Image Generation and how voice actors can use it in their marketing efforts. Mandi shares her experience and insights on using various AI tools for image generation, including Dall-E, Ideogram and Leonardo. They discuss the evolution of AI technology, the benefits and limitations of different tools, and the importance of prompts in generating desired images. Mandi shares how she uses AI image generation to create social media posts and marketing materials for her voiceover business. She explains that she starts by writing the text she wants to accompany the image and then asks ChatGPT to generate an image that matches. She iterates on the generated images until she finds the perfect one. Mandi also discusses the importance of creating detailed prompts to get the desired results. She shares examples of prompts she has used and emphasizes the need to be specific and thorough. Potential concerns about using AI-generated content is also discussed. CONNECT WITH MANDI KAYE Mandi Kaye Website - https://mandikaye.com Mandi Kaye on Instagram - https://www.instagram.com/themandikaye Mandi Kaye on LinkedIn - https://www.linkedin.com/in/mandisorensen   Marc Scott on Instagram - @marcscott AI RESOURCES MENTIONED IN THIS EPISODE Dall-E - https://chat.openai.com Canva - https://canva.com Midjourney - https://www.midjourney.com Adobe Firefly - https://firefly.adobe.com Ideogram - https://ideogram.ai Leonardo - https://leonardo.ai RESOURCES FOR VOICE ACTORS * The VOpreneur Guide to Testimonials Visit https://vopreneur.com/testimonials * Get an instant $25 credit when you sign up for VoiceZam Visit https://voicezam.com/marcscott * For voice over services: Visit https://marcscottvoiceover.com * Want VOpreneur Swag? Visit https://teespring.com/stores/vopreneur * Join the VOpreneur Facebook Group Visit https://facebook.com/groups/vopreneur EVERYDAY VOPRENEURS IN THIS EPISODE * Thanks to "Uncle Roy" for production assistance! Visit https://antlandproductions.com * Thanks to Christy Harst for VO contributions! Visit https://christyharst.com * Thanks to Krysta Wallrauch for VO contributions! Visit https://krystawallrauch.com If you need guidance with your voice over business or learning how to more effectively market, I can help. Book a 15 minute free consultation with me to discuss your specific needs. Book Your Consult KEY TAKEAWAYS AI image generation offers exciting possibilities for voice actors. Different AI tools have different strengths and weaknesses, and it's important to choose the right tool for the desired outcome. Prompts play a crucial role in generating desired images, and experimenting with prompts can lead to better results. AI image generation can be used for various applications, such as creating quote graphics and social media graphics for voiceover businesses. AI image generation can be used to create social media posts and marketing materials for businesses Writing detailed prompts is crucial to getting the desired results from AI image generation AI-generated content should be used with real content to avoid any potential issues AI image generation can save time and help with brainstorming and ideation  

Marketing Against The Grain
Matt Wolfe Ranks The Best AI Tools For Marketers In 2024

Marketing Against The Grain

Play Episode Listen Later Apr 9, 2024 38:22


Ep. 215 What are the hottest and most important Ai apps to use in your marketing today? Kipp and guest Matt Wolfe (entrepreneur and YouTuber) dive into the intricate world of AI marketing tools, exploring what it means to effectively integrate AI into your toolkit. Learn more about how Hume can decipher the subtleties of human emotion, the creative potential of using Suno for song creation, and the practical uses of both Recraft  and Ideogram for crafting compelling marketing visuals. Check out Matt's new podcast The Next Wave here: https://link.chtbl.com/uqnVaUip Mentions Hume https://www.hume.ai/ Suno https://www.suno.ai/ Recraft https://www.recraft.ai/ Ideogram https://ideogram.ai/ We're on Social Media! Follow us for everyday marketing wisdom straight to your feed YouTube: ​​https://www.youtube.com/channel/UCGtXqPiNV8YC0GMUzY-EUFg  Twitter: https://twitter.com/matgpod  TikTok: https://www.tiktok.com/@matgpod  Join our community https://landing.connect.com/matg Thank you for tuning into Marketing Against The Grain! Don't forget to hit subscribe and follow us on Apple Podcasts (so you never miss an episode)! https://podcasts.apple.com/us/podcast/marketing-against-the-grain/id1616700934   If you love this show, please leave us a 5-Star Review https://link.chtbl.com/h9_sjBKH and share your favorite episodes with friends. We really appreciate your support. Host Links: Kipp Bodnar, https://twitter.com/kippbodnar   Kieran Flanagan, https://twitter.com/searchbrat  ‘Marketing Against The Grain' is a HubSpot Original Podcast // Brought to you by The HubSpot Podcast Network // Produced by Darren Clarke.

AI For Humans
OpenAI vs Elon, Anthropic's Claude 3 Is Great & AIs Debate Toilet Paper | Ep47

AI For Humans

Play Episode Listen Later Mar 7, 2024 93:23


This week… Elon Musk sues OpenAI, Anthropic's Claude 3 is the best LLM we've used, Google's Sergey Brin knows they screwed up & AI tax bots gone very wrong. Plus, Gavin dives into text-to-image newness with Ideogram 1.0, Kevin shows off Dust3r which does simple 3D modeling, another dancing robot and a group of hackers takes on Humane's AI pin.  AND THEN…it's the return of the AI For Humans AI Debate! We pit OpenAI's GPT-4 against the newly released Claude 3 and the results will both surprise and, dare we say, SHOCK YOU TO YOUR CORE. This week's AI co-host is one of our two debaters, Dr. Cornelius “Corny” Quckenbush, who has come to take on GPT-4 and also tell us of his deep love of rubber ducks. It's an endless cavalcade of ridiculous and informative AI news, AI tools, and AI entertainment cooked up just for you. Follow us for more AI discussions, AI news updates, and AI tool reviews on X @AIForHumansShow Join our vibrant community on TikTok @aiforhumansshow For more info, visit our website at https://www.aiforhumans.show/   /// Show links /// Anthropic's Claude 3 https://claude.ai/ Elon Vs OpenAI https://www.nytimes.com/2024/03/02/technology/elon-musk-openai-lawsuit-microsoft-research.html Cecilia Ziniti's Twitter Thread https://twitter.com/CeciliaZin/status/1763849318396752151 Claude 3 Is Here https://x.com/AnthropicAI/status/1764653830468428150?s=20 Claude3 Has Awareness of Doing a Test https://x.com/alexalbert__/status/1764722513014329620?s=20 Sergey Brin: We Messed Up https://fortune.com/2024/03/04/sergey-brin-google-definitely-messed-up-gemini-image-generation/ Laurie Anderson Brings Lou Reed Back https://www.theguardian.com/music/2024/feb/28/laurie-anderson-ai-chatbot-lou-reed-ill-be-your-mirror-exhibition-adelaide-festival?utm_source=aisecret.us&utm_medium=Aisecret.us&utm_campaign=Daily AI Political Generated Images That Aren't Real https://www.theguardian.com/us-news/2024/mar/04/trump-ai-generated-images-black-voters Washington Post: Tax Chatbots Are Screwing Up https://www.washingtonpost.com/technology/2024/03/04/ai-taxes-turbotax-hrblock-chatbot/ AI App Helps Detect Ear Infections In Children https://www.cbsnews.com/pittsburgh/news/ai-smartphone-app-diagnose-ear-infections-pittsburgh/ Whomane: Open Source AI Pin https://x.com/kodjima33/status/1764472814353183199?s=20 Expressive Whole Body Control for Humanoid Robots https://youtu.be/UGA9YAg3e-M?si=cYqM30UoOPIsNuzU WTFaldo (Waldo Animation) https://x.com/CitizenPlain/status/1764763312107970592?s=20 Dust3r https://dust3r.europe.naverlabs.com/ Ideogram https://ideogram.ai/t/explore  

Unsupervised Learning
Ep 29: Salesforce AI CEO Clara Shih on Future of Slack, How Gucci Uses AI and Working with Marc Benioff

Unsupervised Learning

Play Episode Listen Later Mar 6, 2024 52:37


There's an ongoing debate about where the most value will accrue in AI between incumbents and startups. Of the incumbents, few have shipped product faster than SalesforceAI. Today on Unsupervised Learning we had on Clara Shih, CEO of SalesforceAI and one of Time Magazine's 100 Most Influential People in AI.  (0:00) intro(0:50) work practices that will become irrelevant(1:37) revolutionizing reply recommendations and case summaries(4:57) newest Salesforce products(5:53) structuring teams(7:22) engineering trust into AI products(11:58) combining in-house models with ChatGBT(13:33) Gucci's AI adoption(16:01) how does Salesforce choose who to share their data with?(20:29) AI costs(26:29 creating unique voices for brands(27:45) AI incumbents vs. startups(29:54) what Clara would build if she had the time(32:28) the future of Slack(35:55) what percent of customer support questions can be answered by AI?(38:37) over-hyped/under-hyped(39:32) working with Mark Benioff(40:46) Jacob and Pat debrief(44:42) Slack is the perfect interface for generative AI(46:10) Abridge investment(48:15) Ideogram investment With your co-hosts:  @jacobeffron  - Partner at Redpoint, Former PM Flatiron Health  @patrickachase  - Partner at Redpoint, Former ML Engineer LinkedIn  @ericabrescia  - Former COO Github, Founder Bitnami (acq'd by VMWare)  @jordan_segall  - Partner at Redpoint

How to Sell Your Stuff on Etsy
Ep 115 | Turn Your Handmade Business into a Multi-Stream Machine – with The Product Boss

How to Sell Your Stuff on Etsy

Play Episode Listen Later Feb 8, 2024 53:13


Have you ever heard the phrase— “Don't put all your eggs in the Etsy basket?” Today the Product Boss herself is joining us for an inspiring conversation about how you can scale your product based business into multiple streams of income. Listen in to learn what you can do to safeguard and grow your Etsy business into a multi-stream machine from the best in the business. **“How to Sell Your Stuff on Etsy” is not affiliated with or endorsed by Etsy.com STUFF I MENTIONED: Check out the “How to Sell Your Stuff” Youtube Channel: THANKS for your support!!!   FREE Best Seller Secret 5 Day Challenge (Feb 12-16, 2024): https://www.theproductboss.com/etsy Multi Stream Machine (available 2/14/24): https://theproductboss.samcart.com/referral/msm/QUGSlkiR2rCqAnii The Product Boss Podcast: https://www.theproductboss.com/podcast Instagram: http://instagram.com/theproductboss Facebook: https://www.facebook.com/theproductboss WHAT'S HAPPENING NOW: ⭐The A.I. Print on Demand LIVE Workshop just happened on 1/17/24! Get a copy of the recording, prompts for your own mockups + POD designs, tutorials for Midjourney, Ideogram, and Dalle-3 and MORE (Use code POD50 to save $50): https://www.howtosellyourstuff.com/ai-POD-workshop-enrollment ⭐ Get my free list of 100 in Demand Micro-Niche Keywords: https://www.howtosellyourstuff.com/Micro-Niche-Demand  ⭐ Join the $4.99 monthly subscription to the In Demand Micro-Niche Keywords List: https://www.howtosellyourstuff.com/micro-niche-member  (You'll get ongoing access to the ever growing of list of hundreds--soon to be thousands-- of micro-niche opportunities on Etsy!)  ---------------------------------------------   ⭐Book a one-on-one Etsy coaching session with Lizzie: https://www.howtosellyourstuff.com/coaching ⭐Apply to be a Podcast Guest: https://bit.ly/48hFD8X Find me on Instagram and TikTok @HowtoSellYourStuff   FREE ETSY MASTERCLASS: https://www.howtosellyourstuff.com/masterclass FREE PDF DOWNLOAD: “4 Strategies I Used to Grow My Etsy Shop from $25 to $6000k/month”: https://www.howtosellyourstuff.com/site/4-strategies-opt-in   Grab my UPDATED Etsy Course for physical product sellers: “Listings that Sell 2.0” and learn how to skyrocket your Etsy business: https://www.howtosellyourstuff.com/etsy-listings-that-sell    ----- HOW TO SELL YOUR STUFF WEBSITE: https://www.howtosellyourstuff.com/ HOW TO SELL YOUR STUFF INSTAGRAM: https://www.instagram.com/howtosellyourstuff/ HOW TO SELL YOUR STUFF SHOWNOTES: https://www.howtosellyourstuff.com/blog/115 ------- THIS EPISODE IS SPONSORED BY: My most popular freebie— the FREE PDF download where I share the “4 Strategies I Used to Grow My Etsy Shop from $25/month to $6000+/month.” Grab a copy and start leveling up your Etsy shop: https://www.howtosellyourstuff.com/site/4-strategies-opt-in AND   Do you use special fonts, graphics, svgs, or other digital goods to create your products or run your Etsy business? You NEED Creative Fabrica! Creative Fabrica is a website where you can access UNLIMITED digital goods for just $9 per month. They have over 6 MILLION fonts, graphics, and other digital resources that you will gain full access to. (It's essentially the top Etsy seller's best kept secret!) AND on top of all that Creative Fabrica discovered this podcast and reached out to me because they wanted to offer you guys a special little perk: you can now get a one-month free trial for up to 10 product downloads to test drive it and see if it's a good fit for you. Learn more at: https://www.creativefabrica.com/ref/2877703 (Now through 2/14/24--- get the FULL YEAR for only $47! It's a crazy deal!)   *Some of the links above are affiliate links which means I'll receive a commission if you purchase through my link, at no extra cost to you. You can see my affiliate disclosure here: https://www.howtosellyourstuff.com/affiliate-disclosure

How to Sell Your Stuff on Etsy
Ep 114 | Sewing Hobby Turned College Etsy Side Hustle

How to Sell Your Stuff on Etsy

Play Episode Listen Later Feb 1, 2024 53:03


Miss Maddy learned how to sew as a child and fell in love with the hobby. As a teenager she turned her hobby into an Etsy side-hustle that helped her pay the bills all through college. Today her Cat Costume Shop has charmed almost 6,000 customers and continued to bring her joy, smiles, and a wonderful supplemental income. You're going to absolutely adore Maddy and her story! **“How to Sell Your Stuff on Etsy” is not affiliated with or endorsed by Etsy.com   STUFF I MENTIONED: Where to find Maddy: Etsy: https://www.etsy.com/shop/MissMaddyMakes Instagram: @miss.maddy.makes   WHAT'S HAPPENING NOW: ⭐The A.I. Print on Demand LIVE Workshop just happened on 1/17/24! Get a copy of the recording, prompts for your own mockups + POD designs, tutorials for Midjourney, Ideogram, and Dalle-3 and MORE (Use code POD50 to save $50): https://www.howtosellyourstuff.com/ai-POD-workshop-enrollment ⭐ Get my free list of 100 in Demand Micro-Niche Keywords: https://www.howtosellyourstuff.com/Micro-Niche-Demand  ⭐ Join the $4.99 monthly subscription to the In Demand Micro-Niche Keywords List: https://www.howtosellyourstuff.com/micro-niche-member  (You'll get ongoing access to the ever growing of list of hundreds--soon to be thousands-- of micro-niche opportunities on Etsy!)  ---------------------------------------------   ⭐Book a one-on-one Etsy coaching session with Lizzie: https://www.howtosellyourstuff.com/coaching ⭐Apply to be a Podcast Guest: https://bit.ly/48hFD8X Find me on Instagram and TikTok @HowtoSellYourStuff   FREE ETSY MASTERCLASS: https://www.howtosellyourstuff.com/masterclass FREE PDF DOWNLOAD: “4 Strategies I Used to Grow My Etsy Shop from $25 to $6000k/month”: https://www.howtosellyourstuff.com/site/4-strategies-opt-in   Grab my UPDATED Etsy Course for physical product sellers: “Listings that Sell 2.0” and learn how to skyrocket your Etsy business: https://www.howtosellyourstuff.com/etsy-listings-that-sell    ----- HOW TO SELL YOUR STUFF WEBSITE: https://www.howtosellyourstuff.com/ HOW TO SELL YOUR STUFF INSTAGRAM: https://www.instagram.com/howtosellyourstuff/ HOW TO SELL YOUR STUFF SHOWNOTES: https://www.howtosellyourstuff.com/blog/114 ------- THIS EPISODE IS SPONSORED BY: My business and Etsy coaching services:  Sometimes we just need a pair of expert eyes to help us see our path forward more clearly! Whether you need help troubleshooting in your Etsy shop, pivoting to a new product, expanding your business to sell courses, and so much more-- you can hire me by the hour to provide a recorded zoom coaching session. We will work together to figure out your next steps so you can work smarter, not harder!  Book your session today: https://www.howtosellyourstuff.com/coaching AND My Customer Service Templates & Mini-Course: Learn my exact customer service strategy AND get access to over 20 templates of my word-for-word responses to customers in everyday and difficult situations:  https://www.howtosellyourstuff.com/offers/wUXKPzRG/checkout

How to Sell Your Stuff on Etsy
Ep 113 | Niching Down Strikes Again--Over 750 sales in 2 Years-- with Linda Sortino

How to Sell Your Stuff on Etsy

Play Episode Listen Later Jan 25, 2024 54:10


Does SEO matter? Yes. Do great pictures matter? Yes. But even MORE than all the typical Etsy advice--- finding DEMAND and niching down to serve that demand will help new (and old) sellers win every time! This week I'm interviewing Linda Sortino who has made over 700 sales selling bead boards. You may have never heard of this product---  but her customers can't wait to get their hands on one. Listen in to hear how serving a very specific customer and niche can build an incredible Etsy business. **“How to Sell Your Stuff on Etsy” is not affiliated with or endorsed by Etsy.com STUFF I MENTIONED: Linda's Favorite Episodes: #70 Print on Demand Insight, Inspo, and Tips You Won't Want to Miss: https://www.howtosellyourstuff.com/blog/best-pod-for-etsy #102 Fast Success on Etsy in a “Saturated” Niche: https://www.howtosellyourstuff.com/blog/102   Where to find Linda: Blog: www.comebeadwithme.com   Product Website: https://beadboardenvy.com/ https://www.facebook.com/comebeadwithlinda/ https://www.pinterest.com/beadwithlinda/   WHAT'S HAPPENING NOW: ⭐The A.I. Print on Demand LIVE Workshop just happened on 1/17/24! Get a copy of the recording, prompts for your own mockups + POD designs, tutorials for Midjourney, Ideogram, and Dalle-3 and MORE (Use code POD50 to save $50): https://www.howtosellyourstuff.com/ai-POD-workshop-enrollment ⭐ Get my free list of 100 in Demand Micro-Niche Keywords: https://www.howtosellyourstuff.com/Micro-Niche-Demand  ⭐ Join the $4.99 monthly subscription to the In Demand Micro-Niche Keywords List: https://www.howtosellyourstuff.com/micro-niche-member  (You'll get ongoing access to the ever growing of list of hundreds--soon to be thousands-- of micro-niche opportunities on Etsy!)  ---------------------------------------------   ⭐Book a one-on-one Etsy coaching session with Lizzie: https://www.howtosellyourstuff.com/coaching ⭐Apply to be a Podcast Guest: https://bit.ly/48hFD8X Find me on Instagram and TikTok @HowtoSellYourStuff   FREE ETSY MASTERCLASS: https://www.howtosellyourstuff.com/masterclass FREE PDF DOWNLOAD: “4 Strategies I Used to Grow My Etsy Shop from $25 to $6000k/month”: https://www.howtosellyourstuff.com/site/4-strategies-opt-in   Grab my UPDATED Etsy Course for physical product sellers: “Listings that Sell 2.0” and learn how to skyrocket your Etsy business: https://www.howtosellyourstuff.com/etsy-listings-that-sell    ----- HOW TO SELL YOUR STUFF WEBSITE: https://www.howtosellyourstuff.com/ HOW TO SELL YOUR STUFF INSTAGRAM: https://www.instagram.com/howtosellyourstuff/ HOW TO SELL YOUR STUFF SHOWNOTES: https://www.howtosellyourstuff.com/blog/113 ------- THIS EPISODE IS SPONSORED BY: 100 KEYWORDS: Get the FREE list of 100 Keywords in various micro niches that all have demand without crazy competition ➡️ https://www.howtosellyourstuff.com/Micro-Niche-Demand  AND Listings that Sell 2.0 Learn all the secrets to build a 6 figure physical product shop with my flagship course Listings that Sell 2.0: https://www.howtosellyourstuff.com/etsy-listings-that-sell

How to Sell Your Stuff on Etsy
Ep 112 | Bailey has earned over $1 million in 3 years selling PNGs on Etsy – with Bailey Designed Co

How to Sell Your Stuff on Etsy

Play Episode Listen Later Jan 18, 2024 48:07


This digital product success story will blow your mind and inspire you to no end! Just three years ago, Bailey opened her Etsy shop selling PNG designs for sublimation tumblers—and since then, she's become a top 0.1% Etsy Seller earning $45k per month passively through digital downloads. Listen in as she shares her story, key strategies that helped her grow, and how she's scaling and securing her business for the future. **“How to Sell Your Stuff on Etsy” is not affiliated with or endorsed by Etsy.com STUFF I MENTIONED: Bulk listing and editing tool Bailey mentioned (Vela): https://welcome.getvela.com/ How to Create Your Own Font in Midjourney: (Go to the Fonts and Calligraphy Chapters of the video): https://youtu.be/OY427QKSXcM?si=MlQRbJmRMSVc1QIX   Bailey's Digitally Purposed Community and Training: https://www.digitallypurposed.com/digitallypurposedsales-7407?am_id=lizzie295 Find Bailey: YouTube: https://bit.ly/BaileyYouTube   Digitally Purposed: https://bit.ly/digitallypurposed   WHAT'S HAPPENING NOW: ⭐The A.I. Print on Demand LIVE Workshop just happened on 1/17/24! Get a copy of the recording, prompts for your own mockups + POD designs, tutorials for Midjourney, Ideogram, and Dalle-3 and MORE (Use code POD50 to save $50): https://www.howtosellyourstuff.com/ai-POD-workshop-enrollment ⭐ Get my free list of 100 in Demand Micro-Niche Keywords: https://www.howtosellyourstuff.com/Micro-Niche-Demand  ⭐ Join the $4.99 monthly subscription to the In Demand Micro-Niche Keywords List: https://www.howtosellyourstuff.com/micro-niche-member  (You'll get ongoing access to the ever growing of list of hundreds--soon to be thousands-- of micro-niche opportunities on Etsy!)  ---------------------------------------------   ⭐Book a one-on-one Etsy coaching session with Lizzie: https://www.howtosellyourstuff.com/coaching ⭐Apply to be a Podcast Guest: https://bit.ly/48hFD8X Find me on Instagram and TikTok @HowtoSellYourStuff   FREE ETSY MASTERCLASS: https://www.howtosellyourstuff.com/masterclass FREE PDF DOWNLOAD: “4 Strategies I Used to Grow My Etsy Shop from $25 to $6000k/month”: https://www.howtosellyourstuff.com/site/4-strategies-opt-in   Grab my UPDATED Etsy Course for physical product sellers: “Listings that Sell 2.0” and learn how to skyrocket your Etsy business: https://www.howtosellyourstuff.com/etsy-listings-that-sell    ----- HOW TO SELL YOUR STUFF WEBSITE: https://www.howtosellyourstuff.com/ HOW TO SELL YOUR STUFF INSTAGRAM: https://www.instagram.com/howtosellyourstuff/ HOW TO SELL YOUR STUFF SHOWNOTES: https://www.howtosellyourstuff.com/blog/112 ------- THIS EPISODE IS SPONSORED BY: My Resources Me! And the Resources section on my website. If you have questions specific to your personal niche on Etsy, you should definitely come check out my Resource page at www.HowtoSellYourStuff.com/Resources where I will connect you with my favorite free and paid resources created by experts I have personally vetted. Whether you sell POD, digital products, printable, physical products and more—there's info waiting for you to help you on your Etsy journey!  Recommended Resources: https://www.howtosellyourstuff.com/resources AND Paige Hulse Law and the Creative Law Shop Whether you're just getting started on Etsy or you've been selling for years but never quite got around to the legal setup, I want to make sure you know about Attorney Paige Hulse and her Creative Law Shop. If you need legal assistance for your Etsy shop, want to register a trademark, or are looking for help with forming a business—contact Paige at https://paigehulse.com/ AND If you're looking for a well-crafted legal document that is tailored to creative entrepreneurship, but don't want the cash outlay of hiring an attorney by the hour, you can get everything you need from an LLC operating agreement, multi-person LLC agreements for partnerships, special provisions for your Etsy Shop Policies, affiliate agreements, influencer contracts, photography releases, and so much more. There are over 80 contracts available plus free resources and educational tools waiting for you at https://www.shopcreativelaw.com/ Make sure you use the code smiley10 for 10% off of anything from the Creative Law Shop! *Some of the links above are affiliate links which means I'll receive a commission if you purchase through my link, at no extra cost to you. You can see my affiliate disclosure here: https://www.howtosellyourstuff.com/affiliate-disclosure

Everyday AI Podcast – An AI and ChatGPT Podcast
EP 129: AI Image Generators - The Good, The Bad, and The Awesome

Everyday AI Podcast – An AI and ChatGPT Podcast

Play Episode Listen Later Oct 24, 2023 40:38


There are so many amazing and powerful AI image generators. From industry leaders like Midjourney and DALL-E to newer image generators getting released every week. So which one is right for you? Leonard Rodman, ChatGPT and Midjourney Consultant at Rodman.ai, joins us to go over the good, bad, and awesome image generators and how to prompt them to get what you're looking for.Newsletter: Sign up for our free daily newsletterMore on this Episode: Episode PageJoin the discussion: Ask Leonard and Jordan questions about AI image generatorsUpcoming Episodes: Check out the upcoming Everyday AI Livestream lineupWebsite: YourEverydayAI.comEmail The Show: info@youreverydayai.comConnect with Jordan on LinkedInTimestamps:[00:01:15] Daily AI news[00:03:40] About Leonard and Rodman.ai[00:08:35] Good beginner AI image generator[00:14:05] Ideogram - one of the originals[00:17:00] Recognizing AI-generated photos[00:25:00] Midjourney  breakdown[00:31:15] Audience questionsTopics Covered in This Episode:1. Introduction to AI Image Generators2. The Evolution of AI Image Generators3. Techniques and Tips for Effective AI Image Generation4. Identifying AI-Generated Images and Legal ConsiderationsKeywords:AI image generators, image generation, Leonardo, MidJourney, AI art, AI-generated images, AI technology, image recognition, digital photography, AI advancements, AI-generated storytelling, digital image manipulation, copyright laws, AI-generated venture capital pitches, marketing and advertising, image composition, AI-generated text, image modification, image quality, AI-generated photography, legal issues, DALL-E, digital cameras, AI-generated marketing, AI-generated storytelling, style subsets, image prompts, camera specifications, image ownership, image copyright, AI in different fields Get more out of ChatGPT by learning our PPP method in this live, interactive and free training! Sign up now: https://youreverydayai.com/ppp-registration/