Podcasts about runwayml

35PODCASTS
41EPISODES
45mAVG DURATION
1MONTHLY NEW EPISODE
Aug 5, 2025LATEST

POPULARITY

20172018201920202021202220232024

Best podcasts about runwayml

AI For Humans

4 episodes with runwayml

This Week in Startups

2 episodes with runwayml

The ChatGPT Report

2 episodes with runwayml

Latest podcast episodes about runwayml

Episode 563: AI Video Prompt Master Review

Hot Internet Marketing Products

Play Episode Listen Later Aug 5, 2025 3:52

AI Video Prompt Master - https://www.marketingsharks.com/ai-video-prompt-master-review/AI Video Prompt Master – Viral & Cinematic Agent – AI Video Prompt Master – a GPT agent that creates pro-level video prompts for TikTok, Reels & Shorts using cinematic AI tokens. Perfect for creators, affiliates, and marketers – powerful upsells including full PLR + training. No tech skills needed.AI Video Prompt Master is a powerful GPT agent that creates scroll-stopping video prompts for TikTok, Reels & Shorts—perfect for affiliates, marketers & faceless creators. The funnel includes PLR rights to rebrand & resell the agent (OTO1), 2 bonus agents with PLR (OTO2), and full video training to create & launch your own GPT agents (OTO3).Generate AI-optimized prompts (6 different modes, narrative, JSON…) for platforms like Veo 2 & 3, Kling Master, SeeDance, Hailuo, Sora, RunwayML, and Pika Labs with our advanced layered system.

tiktok ai master shorts reels gpt sora prompt veo json plr runwayml

5 Tools to Build Viral Video Ads With AI | Andrew Erickson

Lunch With Norm - The Amazon FBA & eCommerce Podcast

Play Episode Listen Later May 12, 2025 50:02

Levanta Shift Calculator -

S5 E170 Machinima News Omnibus (Feb 2025)

And Now For Something Completely Machinima

Play Episode Listen Later Feb 20, 2025 54:09

A packed news episode, with Tracy's analysis of the AI art copyright ruling decision and a discussion with Damien about his hit fan series, Heir to the Empire - the Star Wars story that everyone always wanted to see made. Do comment on the discussion (on the YouTube channel or the Blog post). The links are available in the show notes.0:45 Tracy is the first woman on Mars!!!2:38 RunwayML's third annual film festival – call for submissions now open4:20 Second Life evolves [again]7:00 Bloomsbury's video series: Richard Williams' Animation Series7:30 Projects: Gladiator II10:00 The Freeman Chronicles – a Half Life real life film!11:30 For Phil: a Tom Cook Elite Dangerous; for Damien: a Mandalorian cinematic made in Unreal; for Ricky, a Godot 4.3 cinematic12:40 The US AI art copyright ruling – what do you think?21:22 50 series release of NVIDIA's graphics cards23:19 An HL3 teaser?24:10 Hollywood Animal – The Movies as a business26:05 The TMU Tribute Show to Ken White26:55 Astartes 2, Warhammer official film28:40 Sims 1 and 2 re-released31:33 Rooster Teeth is back?!32:55 Tracy asks Damien about his Heir to the Empire hit fan series and what comes nextCredits -Speakers: Damien Valentine, Tracy HarwoodProducer: Damien ValentineEditor: Phil RiceMusic: Animo Domini Beats

#102 - ChatGPT Pro | NotebookLM im Recruiting | KI in Bewerbungen | Social Media Verbot

Zielgruppengerecht - Der Recruiting Tech Talk

Play Episode Listen Later Feb 4, 2025 36:07

Jede Menge KI-Updates: ChatGPT Pro, Project Astra und Project Mariner von Google Gemini 2.0 und NotebookLM. Natürlich beleuchten Jan und Robin wie immer deren Bedeutung für's Recruiting und mögliche Anwendungsbeispielen.

social media chatgpt recruiting bedeutung verbot bewerbungen google gemini notebooklm runwayml

[Ride Home] Simon Willison: Things we learned about LLMs in 2024

Latent Space: The AI Engineer Podcast â€” CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

Play Episode Listen Later Jan 12, 2025 73:23

Due to overwhelming demand (>15x applications:slots), we are closing CFPs for AI Engineer Summit NYC today. Last call! Thanks, we'll be reaching out to all shortly!The world's top AI blogger and friend of every pod, Simon Willison, dropped a monster 2024 recap: Things we learned about LLMs in 2024. Brian of the excellent TechMeme Ride Home pinged us for a connection and a special crossover episode, our first in 2025. The target audience for this podcast is a tech-literate, but non-technical one. You can see Simon's notes for AI Engineers in his World's Fair Keynote.Timestamp* 00:00 Introduction and Guest Welcome* 01:06 State of AI in 2025* 01:43 Advancements in AI Models* 03:59 Cost Efficiency in AI* 06:16 Challenges and Competition in AI* 17:15 AI Agents and Their Limitations* 26:12 Multimodal AI and Future Prospects* 35:29 Exploring Video Avatar Companies* 36:24 AI Influencers and Their Future* 37:12 Simplifying Content Creation with AI* 38:30 The Importance of Credibility in AI* 41:36 The Future of LLM User Interfaces* 48:58 Local LLMs: A Growing Interest* 01:07:22 AI Wearables: The Next Big Thing* 01:10:16 Wrapping Up and Final ThoughtsTranscript[00:00:00] Introduction and Guest Welcome[00:00:00] Brian: Welcome to the first bonus episode of the Tech Meme Write Home for the year 2025. I'm your host as always, Brian McCullough. Listeners to the pod over the last year know that I have made a habit of quoting from Simon Willison when new stuff happens in AI from his blog. Simon has been, become a go to for many folks in terms of, you know, Analyzing things, criticizing things in the AI space.[00:00:33] Brian: I've wanted to talk to you for a long time, Simon. So thank you for coming on the show. No, it's a privilege to be here. And the person that made this connection happen is our friend Swyx, who has been on the show back, even going back to the, the Twitter Spaces days but also an AI guru in, in their own right Swyx, thanks for coming on the show also.[00:00:54] swyx (2): Thanks. I'm happy to be on and have been a regular listener, so just happy to [00:01:00] contribute as well.[00:01:00] Brian: And a good friend of the pod, as they say. Alright, let's go right into it.[00:01:06] State of AI in 2025[00:01:06] Brian: Simon, I'm going to do the most unfair, broad question first, so let's get it out of the way. The year 2025. Broadly, what is the state of AI as we begin this year?[00:01:20] Brian: Whatever you want to say, I don't want to lead the witness.[00:01:22] Simon: Wow. So many things, right? I mean, the big thing is everything's got really good and fast and cheap. Like, that was the trend throughout all of 2024. The good models got so much cheaper, they got so much faster, they got multimodal, right? The image stuff isn't even a surprise anymore.[00:01:39] Simon: They're growing video, all of that kind of stuff. So that's all really exciting.[00:01:43] Advancements in AI Models[00:01:43] Simon: At the same time, they didn't get massively better than GPT 4, which was a bit of a surprise. So that's sort of one of the open questions is, are we going to see huge, but I kind of feel like that's a bit of a distraction because GPT 4, but way cheaper, much larger context lengths, and it [00:02:00] can do multimodal.[00:02:01] Simon: is better, right? That's a better model, even if it's not.[00:02:05] Brian: What people were expecting or hoping, maybe not expecting is not the right word, but hoping that we would see another step change, right? Right. From like GPT 2 to 3 to 4, we were expecting or hoping that maybe we were going to see the next evolution in that sort of, yeah.[00:02:21] Brian: We[00:02:21] Simon: did see that, but not in the way we expected. We thought the model was just going to get smarter, and instead we got. Massive drops in, drops in price. We got all of these new capabilities. You can talk to the things now, right? They can do simulated audio input, all of that kind of stuff. And so it's kind of, it's interesting to me that the models improved in all of these ways we weren't necessarily expecting.[00:02:43] Simon: I didn't know it would be able to do an impersonation of Santa Claus, like a, you know, Talked to it through my phone and show it what I was seeing by the end of 2024. But yeah, we didn't get that GPT 5 step. And that's one of the big open questions is, is that actually just around the corner and we'll have a bunch of GPT 5 class models drop in the [00:03:00] next few months?[00:03:00] Simon: Or is there a limit?[00:03:03] Brian: If you were a betting man and wanted to put money on it, do you expect to see a phase change, step change in 2025?[00:03:11] Simon: I don't particularly for that, like, the models, but smarter. I think all of the trends we're seeing right now are going to keep on going, especially the inference time compute, right?[00:03:21] Simon: The trick that O1 and O3 are doing, which means that you can solve harder problems, but they cost more and it churns away for longer. I think that's going to happen because that's already proven to work. I don't know. I don't know. Maybe there will be a step change to a GPT 5 level, but honestly, I'd be completely happy if we got what we've got right now.[00:03:41] Simon: But cheaper and faster and more capabilities and longer contexts and so forth. That would be thrilling to me.[00:03:46] Brian: Digging into what you've just said one of the things that, by the way, I hope to link in the show notes to Simon's year end post about what, what things we learned about LLMs in 2024. Look for that in the show notes.[00:03:59] Cost Efficiency in AI[00:03:59] Brian: One of the things that you [00:04:00] did say that you alluded to even right there was that in the last year, you felt like the GPT 4 barrier was broken, like IE. Other models, even open source ones are now regularly matching sort of the state of the art.[00:04:13] Simon: Well, it's interesting, right? So the GPT 4 barrier was a year ago, the best available model was OpenAI's GPT 4 and nobody else had even come close to it.[00:04:22] Simon: And they'd been at the, in the lead for like nine months, right? That thing came out in what, February, March of, of 2023. And for the rest of 2023, nobody else came close. And so at the start of last year, like a year ago, the big question was, Why has nobody beaten them yet? Like, what do they know that the rest of the industry doesn't know?[00:04:40] Simon: And today, that I've counted 18 organizations other than GPT 4 who've put out a model which clearly beats that GPT 4 from a year ago thing. Like, maybe they're not better than GPT 4. 0, but that's, that, that, that barrier got completely smashed. And yeah, a few of those I've run on my laptop, which is wild to me.[00:04:59] Simon: Like, [00:05:00] it was very, very wild. It felt very clear to me a year ago that if you want GPT 4, you need a rack of 40, 000 GPUs just to run the thing. And that turned out not to be true. Like the, the, this is that big trend from last year of the models getting more efficient, cheaper to run, just as capable with smaller weights and so forth.[00:05:20] Simon: And I ran another GPT 4 model on my laptop this morning, right? Microsoft 5. 4 just came out. And that, if you look at the benchmarks, it's definitely, it's up there with GPT 4. 0. It's probably not as good when you actually get into the vibes of the thing, but it, it runs on my, it's a 14 gigabyte download and I can run it on a MacBook Pro.[00:05:38] Simon: Like who saw that coming? The most exciting, like the close of the year on Christmas day, just a few weeks ago, was when DeepSeek dropped their DeepSeek v3 model on Hugging Face without even a readme file. It was just like a giant binary blob that I can't run on my laptop. It's too big. But in all of the benchmarks, it's now by far the best available [00:06:00] open, open weights model.[00:06:01] Simon: Like it's, it's, it's beating the, the metalamas and so forth. And that was trained for five and a half million dollars, which is a tenth of the price that people thought it costs to train these things. So everything's trending smaller and faster and more efficient.[00:06:15] Brian: Well, okay.[00:06:16] Challenges and Competition in AI[00:06:16] Brian: I, I kind of was going to get to that later, but let's, let's combine this with what I was going to ask you next, which is, you know, you're talking, you know, Also in the piece about the LLM prices crashing, which I've even seen in projects that I'm working on, but explain Explain that to a general audience, because we hear all the time that LLMs are eye wateringly expensive to run, but what we're suggesting, and we'll come back to the cheap Chinese LLM, but first of all, for the end user, what you're suggesting is that we're starting to see the cost come down sort of in the traditional technology way of Of costs coming down over time,[00:06:49] Simon: yes, but very aggressively.[00:06:51] Simon: I mean, my favorite thing, the example here is if you look at GPT-3, so open AI's g, PT three, which was the best, a developed model in [00:07:00] 2022 and through most of 20 2023. That, the models that we have today, the OpenAI models are a hundred times cheaper. So there was a 100x drop in price for OpenAI from their best available model, like two and a half years ago to today.[00:07:13] Simon: And[00:07:14] Brian: just to be clear, not to train the model, but for the use of tokens and things. Exactly,[00:07:20] Simon: for running prompts through them. And then When you look at the, the really, the top tier model providers right now, I think, are OpenAI, Anthropic, Google, and Meta. And there are a bunch of others that I could list there as well.[00:07:32] Simon: Mistral are very good. The, the DeepSeq and Quen models have got great. There's a whole bunch of providers serving really good models. But even if you just look at the sort of big brand name providers, they all offer models now that are A fraction of the price of the, the, of the models we were using last year.[00:07:49] Simon: I think I've got some numbers that I threw into my blog entry here. Yeah. Like Gemini 1. 5 flash, that's Google's fast high quality model is [00:08:00] how much is that? It's 0. 075 dollars per million tokens. Like these numbers are getting, So we just do cents per million now,[00:08:09] swyx (2): cents per million,[00:08:10] Simon: cents per million makes, makes a lot more sense.[00:08:12] Simon: Yeah they have one model 1. 5 flash 8B, the absolute cheapest of the Google models, is 27 times cheaper than GPT 3. 5 turbo was a year ago. That's it. And GPT 3. 5 turbo, that was the cheap model, right? Now we've got something 27 times cheaper, and the Google, this Google one can do image recognition, it can do million token context, all of those tricks.[00:08:36] Simon: But it's, it's, it's very, it's, it really is startling how inexpensive some of this stuff has got.[00:08:41] Brian: Now, are we assuming that this, that happening is directly the result of competition? Because again, you know, OpenAI, and probably they're doing this for their own almost political reasons, strategic reasons, keeps saying, we're losing money on everything, even the 200.[00:08:56] Brian: So they probably wouldn't, the prices wouldn't be [00:09:00] coming down if there wasn't intense competition in this space.[00:09:04] Simon: The competition is absolutely part of it, but I have it on good authority from sources I trust that Google Gemini is not operating at a loss. Like, the amount of electricity to run a prompt is less than they charge you.[00:09:16] Simon: And the same thing for Amazon Nova. Like, somebody found an Amazon executive and got them to say, Yeah, we're not losing money on this. I don't know about Anthropic and OpenAI, but clearly that demonstrates it is possible to run these things at these ludicrously low prices and still not be running at a loss if you discount the Army of PhDs and the, the training costs and all of that kind of stuff.[00:09:36] Brian: One, one more for me before I let Swyx jump in here. To, to come back to DeepSeek and this idea that you could train, you know, a cutting edge model for 6 million. I, I was saying on the show, like six months ago, that if we are getting to the point where each new model It would cost a billion, ten billion, a hundred billion to train that.[00:09:54] Brian: At some point it would almost, only nation states would be able to train the new models. Do you [00:10:00] expect what DeepSeek and maybe others are proving to sort of blow that up? Or is there like some sort of a parallel track here that maybe I'm not technically, I don't have the mouse to understand the difference.[00:10:11] Brian: Is the model, are the models going to go, you know, Up to a hundred billion dollars or can we get them down? Sort of like DeepSeek has proven[00:10:18] Simon: so I'm the wrong person to answer that because I don't work in the lab training these models. So I can give you my completely uninformed opinion, which is, I felt like the DeepSeek thing.[00:10:27] Simon: That was a bomb shell. That was an absolute bombshell when they came out and said, Hey, look, we've trained. One of the best available models and it cost us six, five and a half million dollars to do it. I feel, and they, the reason, one of the reasons it's so efficient is that we put all of these export controls in to stop Chinese companies from giant buying GPUs.[00:10:44] Simon: So they've, were forced to be, go as efficient as possible. And yet the fact that they've demonstrated that that's possible to do. I think it does completely tear apart this, this, this mental model we had before that yeah, the training runs just keep on getting more and more expensive and the number of [00:11:00] organizations that can afford to run these training runs keeps on shrinking.[00:11:03] Simon: That, that's been blown out of the water. So yeah, that's, again, this was our Christmas gift. This was the thing they dropped on Christmas day. Yeah, it makes me really optimistic that we can, there are, It feels like there was so much low hanging fruit in terms of the efficiency of both inference and training and we spent a whole bunch of last year exploring that and getting results from it.[00:11:22] Simon: I think there's probably a lot left. I think there's probably, well, I would not be surprised to see even better models trained spending even less money over the next six months.[00:11:31] swyx (2): Yeah. So I, I think there's a unspoken angle here on what exactly the Chinese labs are trying to do because DeepSea made a lot of noise.[00:11:41] swyx (2): so much for joining us for around the fact that they train their model for six million dollars and nobody quite quite believes them. Like it's very, very rare for a lab to trumpet the fact that they're doing it for so cheap. They're not trying to get anyone to buy them. So why [00:12:00] are they doing this? They make it very, very obvious.[00:12:05] swyx (2): Deepseek is about 150 employees. It's an order of magnitude smaller than at least Anthropic and maybe, maybe more so for OpenAI. And so what's, what's the end game here? Are they, are they just trying to show that the Chinese are better than us?[00:12:21] Simon: So Deepseek, it's the arm of a hedge, it's a, it's a quant fund, right?[00:12:25] Simon: It's an algorithmic quant trading thing. So I, I, I would love to get more insight into how that organization works. My assumption from what I've seen is it looks like they're basically just flexing. They're like, hey, look at how utterly brilliant we are with this amazing thing that we've done. And it's, it's working, right?[00:12:43] Simon: They but, and so is that it? Are they, is this just their kind of like, this is, this is why our company is so amazing. Look at this thing that we've done, or? I don't know. I'd, I'd love to get Some insight from, from within that industry as to, as to how that's all playing out.[00:12:57] swyx (2): The, the prevailing theory among the Local Llama [00:13:00] crew and the Twitter crew that I indexed for my newsletter is that there is some amount of copying going on.[00:13:06] swyx (2): It's like Sam Altman you know, tweet, tweeting about how they're being copied. And then also there's this, there, there are other sort of opening eye employees that have said, Stuff that is similar that DeepSeek's rate of progress is how U. S. intelligence estimates the number of foreign spies embedded in top labs.[00:13:22] swyx (2): Because a lot of these ideas do spread around, but they surprisingly have a very high density of them in the DeepSeek v3 technical report. So it's, it's interesting. We don't know how much, how many, how much tokens. I think that, you know, people have run analysis on how often DeepSeek thinks it is cloud or thinks it is opening GPC 4.[00:13:40] swyx (2): Thanks for watching! And we don't, we don't know. We don't know. I think for me, like, yeah, we'll, we'll, we basically will never know as, as external commentators. I think what's interesting is how, where does this go? Is there a logical floor or bottom by my estimations for the same amount of ELO started last year to the end of last year cost went down by a thousand X for the [00:14:00] GPT, for, for GPT 4 intelligence.[00:14:02] swyx (2): Would, do they go down a thousand X this year?[00:14:04] Simon: That's a fascinating question. Yeah.[00:14:06] swyx (2): Is there a Moore's law going on, or did we just get a one off benefit last year for some weird reason?[00:14:14] Simon: My uninformed hunch is low hanging fruit. I feel like up until a year ago, people haven't been focusing on efficiency at all. You know, it was all about, what can we get these weird shaped things to do?[00:14:24] Simon: And now once we've sort of hit that, okay, we know that we can get them to do what GPT 4 can do, When thousands of researchers around the world all focus on, okay, how do we make this more efficient? What are the most important, like, how do we strip out all of the weights that have stuff in that doesn't really matter?[00:14:39] Simon: All of that kind of thing. So yeah, maybe that was it. Maybe 2024 was a freak year of all of the low hanging fruit coming out at once. And we'll actually see a reduction in the, in that rate of improvement in terms of efficiency. I wonder, I mean, I think we'll know for sure in about three months time if that trend's going to continue or not.[00:14:58] swyx (2): I agree. You know, I [00:15:00] think the other thing that you mentioned that DeepSeq v3 was the gift that was given from DeepSeq over Christmas, but I feel like the other thing that might be underrated was DeepSeq R1,[00:15:11] Speaker 4: which is[00:15:13] swyx (2): a reasoning model you can run on your laptop. And I think that's something that a lot of people are looking ahead to this year.[00:15:18] swyx (2): Oh, did they[00:15:18] Simon: release the weights for that one?[00:15:20] swyx (2): Yeah.[00:15:21] Simon: Oh my goodness, I missed that. I've been playing with the quen. So the other great, the other big Chinese AI app is Alibaba's quen. Actually, yeah, I, sorry, R1 is an API available. Yeah. Exactly. When that's really cool. So Alibaba's Quen have released two reasoning models that I've run on my laptop.[00:15:38] Simon: Now there was, the first one was Q, Q, WQ. And then the second one was QVQ because the second one's a vision model. So you can like give it vision puzzles and a prompt that these things, they are so much fun to run. Because they think out loud. It's like the OpenAR 01 sort of hides its thinking process. The Query ones don't.[00:15:59] Simon: They just, they [00:16:00] just churn away. And so you'll give it a problem and it will output literally dozens of paragraphs of text about how it's thinking. My favorite thing that happened with QWQ is I asked it to draw me a pelican on a bicycle in SVG. That's like my standard stupid prompt. And for some reason it thought in Chinese.[00:16:18] Simon: It spat out a whole bunch of like Chinese text onto my terminal on my laptop, and then at the end it gave me quite a good sort of artistic pelican on a bicycle. And I ran it all through Google Translate, and yeah, it was like, it was contemplating the nature of SVG files as a starting point. And the fact that my laptop can think in Chinese now is so delightful.[00:16:40] Simon: It's so much fun watching you do that.[00:16:43] swyx (2): Yeah, I think Andrej Karpathy was saying, you know, we, we know that we have achieved proper reasoning inside of these models when they stop thinking in English, and perhaps the best form of thought is in Chinese. But yeah, for listeners who don't know Simon's blog he always, whenever a new model comes out, you, I don't know how you do it, but [00:17:00] you're always the first to run Pelican Bench on these models.[00:17:02] swyx (2): I just did it for 5.[00:17:05] Simon: Yeah.[00:17:07] swyx (2): So I really appreciate that. You should check it out. These are not theoretical. Simon's blog actually shows them.[00:17:12] Brian: Let me put on the investor hat for a second.[00:17:15] AI Agents and Their Limitations[00:17:15] Brian: Because from the investor side of things, a lot of the, the VCs that I know are really hot on agents, and this is the year of agents, but last year was supposed to be the year of agents as well. Lots of money flowing towards, And Gentic startups.[00:17:32] Brian: But in in your piece that again, we're hopefully going to have linked in the show notes, you sort of suggest there's a fundamental flaw in AI agents as they exist right now. Let me let me quote you. And then I'd love to dive into this. You said, I remain skeptical as to their ability based once again, on the Challenge of gullibility.[00:17:49] Brian: LLMs believe anything you tell them, any systems that attempt to make meaningful decisions on your behalf, will run into the same roadblock. How good is a travel agent, or a digital assistant, or even a research tool, if it [00:18:00] can't distinguish truth from fiction? So, essentially, what you're suggesting is that the state of the art now that allows agents is still, it's still that sort of 90 percent problem, the edge problem, getting to the Or, or, or is there a deeper flaw?[00:18:14] Brian: What are you, what are you saying there?[00:18:16] Simon: So this is the fundamental challenge here and honestly my frustration with agents is mainly around definitions Like any if you ask anyone who says they're working on agents to define agents You will get a subtly different definition from each person But everyone always assumes that their definition is the one true one that everyone else understands So I feel like a lot of these agent conversations, people talking past each other because one person's talking about the, the sort of travel agent idea of something that books things on your behalf.[00:18:41] Simon: Somebody else is talking about LLMs with tools running in a loop with a cron job somewhere and all of these different things. You, you ask academics and they'll laugh at you because they've been debating what agents mean for over 30 years at this point. It's like this, this long running, almost sort of an in joke in that community.[00:18:57] Simon: But if we assume that for this purpose of this conversation, an [00:19:00] agent is something that, Which you can give a job and it goes off and it does that thing for you like, like booking travel or things like that. The fundamental challenge is, it's the reliability thing, which comes from this gullibility problem.[00:19:12] Simon: And a lot of my, my interest in this originally came from when I was thinking about prompt injections as a source of this form of attack against LLM systems where you deliberately lay traps out there for this LLM to stumble across,[00:19:24] Brian: and which I should say you have been banging this drum that no one's gotten any far, at least on solving this, that I'm aware of, right.[00:19:31] Brian: Like that's still an open problem. The two years.[00:19:33] Simon: Yeah. Right. We've been talking about this problem and like, a great illustration of this was Claude so Anthropic released Claude computer use a few months ago. Fantastic demo. You could fire up a Docker container and you could literally tell it to do something and watch it open a web browser and navigate to a webpage and click around and so forth.[00:19:51] Simon: Really, really, really interesting and fun to play with. And then, um. One of the first demos somebody tried was, what if you give it a web page that says download and run this [00:20:00] executable, and it did, and the executable was malware that added it to a botnet. So the, the very first most obvious dumb trick that you could play on this thing just worked, right?[00:20:10] Simon: So that's obviously a really big problem. If I'm going to send something out to book travel on my behalf, I mean, it's hard enough for me to figure out which airlines are trying to scam me and which ones aren't. Do I really trust a language model that believes the literal truth of anything that's presented to it to go out and do those things?[00:20:29] swyx (2): Yeah I definitely think there's, it's interesting to see Anthropic doing this because they used to be the safety arm of OpenAI that split out and said, you know, we're worried about letting this thing out in the wild and here they are enabling computer use for agents. Thanks. The, it feels like things have merged.[00:20:49] swyx (2): You know, I'm, I'm also fairly skeptical about, you know, this always being the, the year of Linux on the desktop. And this is the equivalent of this being the year of agents that people [00:21:00] are not predicting so much as wishfully thinking and hoping and praying for their companies and agents to work.[00:21:05] swyx (2): But I, I feel like things are. Coming along a little bit. It's to me, it's kind of like self driving. I remember in 2014 saying that self driving was just around the corner. And I mean, it kind of is, you know, like in, in, in the Bay area. You[00:21:17] Simon: get in a Waymo and you're like, Oh, this works. Yeah, but it's a slow[00:21:21] swyx (2): cook.[00:21:21] swyx (2): It's a slow cook over the next 10 years. We're going to hammer out these things and the cynical people can just point to all the flaws, but like, there are measurable or concrete progress steps that are being made by these builders.[00:21:33] Simon: There is one form of agent that I believe in. I believe, mostly believe in the research assistant form of agents.[00:21:39] Simon: The thing where you've got a difficult problem and, and I've got like, I'm, I'm on the beta for the, the Google Gemini 1. 5 pro with deep research. I think it's called like these names, these names. Right. But. I've been using that. It's good, right? You can give it a difficult problem and it tells you, okay, I'm going to look at 56 different websites [00:22:00] and it goes away and it dumps everything to its context and it comes up with a report for you.[00:22:04] Simon: And it's not, it won't work against adversarial websites, right? If there are websites with deliberate lies in them, it might well get caught out. Most things don't have that as a problem. And so I've had some answers from that which were genuinely really valuable to me. And that feels to me like, I can see how given existing LLM tech, especially with Google Gemini with its like million token contacts and Google with their crawl of the entire web and their, they've got like search, they've got search and cache, they've got a cache of every page and so forth.[00:22:35] Simon: That makes sense to me. And that what they've got right now, I don't think it's, it's not as good as it can be, obviously, but it's, it's, it's, it's a real useful thing, which they're going to start rolling out. So, you know, Perplexity have been building the same thing for a couple of years. That, that I believe in.[00:22:50] Simon: You know, if you tell me that you're going to have an agent that's a research assistant agent, great. The coding agents I mean, chat gpt code interpreter, Nearly two years [00:23:00] ago, that thing started writing Python code, executing the code, getting errors, rewriting it to fix the errors. That pattern obviously works.[00:23:07] Simon: That works really, really well. So, yeah, coding agents that do that sort of error message loop thing, those are proven to work. And they're going to keep on getting better, and that's going to be great. The research assistant agents are just beginning to get there. The things I'm critical of are the ones where you trust, you trust this thing to go out and act autonomously on your behalf, and make decisions on your behalf, especially involving spending money, like that.[00:23:31] Simon: I don't see that working for a very long time. That feels to me like an AGI level problem.[00:23:37] swyx (2): It's it's funny because I think Stripe actually released an agent toolkit which is one of the, the things I featured that is trying to enable these agents each to have a wallet that they can go and spend and have, basically, it's a virtual card.[00:23:49] swyx (2): It's not that, not that difficult with modern infrastructure. can[00:23:51] Simon: stick a 50 cap on it, then at least it's an honor. Can't lose more than 50.[00:23:56] Brian: You know I don't, I don't know if either of you know Rafat Ali [00:24:00] he runs Skift, which is a, a travel news vertical. And he, he, he constantly laughs at the fact that every agent thing is, we're gonna get rid of booking a, a plane flight for you, you know?[00:24:11] Brian: And, and I would point out that, like, historically, when the web started, the first thing everyone talked about is, You can go online and book a trip, right? So it's funny for each generation of like technological advance. The thing they always want to kill is the travel agent. And now they want to kill the webpage travel agent.[00:24:29] Simon: Like it's like I use Google flight search. It's great, right? If you gave me an agent to do that for me, it would save me, I mean, maybe 15 seconds of typing in my things, but I still want to see what my options are and go, yeah, I'm not flying on that airline, no matter how cheap they are.[00:24:44] swyx (2): Yeah. For listeners, go ahead.[00:24:47] swyx (2): For listeners, I think, you know, I think both of you are pretty positive on NotebookLM. And you know, we, we actually interviewed the NotebookLM creators, and there are actually two internal agents going on internally. The reason it takes so long is because they're running an agent loop [00:25:00] inside that is fairly autonomous, which is kind of interesting.[00:25:01] swyx (2): For one,[00:25:02] Simon: for a definition of agent loop, if you picked that particularly well. For one definition. And you're talking about the podcast side of this, right?[00:25:07] swyx (2): Yeah, the podcast side of things. They have a there's, there's going to be a new version coming out that, that we'll be featuring at our, at our conference.[00:25:14] Simon: That one's fascinating to me. Like NotebookLM, I think it's two products, right? On the one hand, it's actually a very good rag product, right? You dump a bunch of things in, you can run searches, that, that, it does a good job of. And then, and then they added the, the podcast thing. It's a bit of a, it's a total gimmick, right?[00:25:30] Simon: But that gimmick got them attention, because they had a great product that nobody paid any attention to at all. And then you add the unfeasibly good voice synthesis of the podcast. Like, it's just, it's, it's, it's the lesson.[00:25:43] Brian: It's the lesson of mid journey and stuff like that. If you can create something that people can post on socials, you don't have to lift a finger again to do any marketing for what you're doing.[00:25:53] Brian: Let me dig into Notebook LLM just for a second as a podcaster. As a [00:26:00] gimmick, it makes sense, and then obviously, you know, you dig into it, it sort of has problems around the edges. It's like, it does the thing that all sort of LLMs kind of do, where it's like, oh, we want to Wrap up with a conclusion.[00:26:12] Multimodal AI and Future Prospects[00:26:12] Brian: I always call that like the the eighth grade book report paper problem where it has to have an intro and then, you know But that's sort of a thing where because I think you spoke about this again in your piece at the year end About how things are going multimodal and how things are that you didn't expect like, you know vision and especially audio I think So that's another thing where, at least over the last year, there's been progress made that maybe you, you didn't think was coming as quick as it came.[00:26:43] Simon: I don't know. I mean, a year ago, we had one really good vision model. We had GPT 4 vision, was, was, was very impressive. And Google Gemini had just dropped Gemini 1. 0, which had vision, but nobody had really played with it yet. Like Google hadn't. People weren't taking Gemini [00:27:00] seriously at that point. I feel like it was 1.[00:27:02] Simon: 5 Pro when it became apparent that actually they were, they, they got over their hump and they were building really good models. And yeah, and they, to be honest, the video models are mostly still using the same trick. The thing where you divide the video up into one image per second and you dump that all into the context.[00:27:16] Simon: So maybe it shouldn't have been so surprising to us that long context models plus vision meant that the video was, was starting to be solved. Of course, it didn't. Not being, you, what you really want with videos, you want to be able to do the audio and the images at the same time. And I think the models are beginning to do that now.[00:27:33] Simon: Like, originally, Gemini 1. 5 Pro originally ignored the audio. It just did the, the, like, one frame per second video trick. As far as I can tell, the most recent ones are actually doing pure multimodal. But the things that opens up are just extraordinary. Like, the the ChatGPT iPhone app feature that they shipped as one of their 12 days of, of OpenAI, I really can be having a conversation and just turn on my video camera and go, Hey, what kind of tree is [00:28:00] this?[00:28:00] Simon: And so forth. And it works. And for all I know, that's just snapping a like picture once a second and feeding it into the model. The, the, the things that you can do with that as an end user are extraordinary. Like that, that to me, I don't think most people have cottoned onto the fact that you can now stream video directly into a model because it, it's only a few weeks old.[00:28:22] Simon: Wow. That's a, that's a, that's a, that's Big boost in terms of what kinds of things you can do with this stuff. Yeah. For[00:28:30] swyx (2): people who are not that close I think Gemini Flashes free tier allows you to do something like capture a photo, one photo every second or a minute and leave it on 24, seven, and you can prompt it to do whatever.[00:28:45] swyx (2): And so you can effectively have your own camera app or monitoring app that that you just prompt and it detects where it changes. It detects for, you know, alerts or anything like that, or describes your day. You know, and, and, and the fact that this is free I think [00:29:00] it's also leads into the previous point of it being the prices haven't come down a lot.[00:29:05] Simon: And even if you're paying for this stuff, like a thing that I put in my blog entry is I ran a calculation on what it would cost to process 68, 000 photographs in my photo collection, and for each one just generate a caption, and using Gemini 1. 5 Flash 8B, it would cost me 1. 68 to process 68, 000 images, which is, I mean, that, that doesn't make sense.[00:29:28] Simon: None of that makes sense. Like it's, it's a, for one four hundredth of a cent per image to generate captions now. So you can see why feeding in a day's worth of video just isn't even very expensive to process.[00:29:40] swyx (2): Yeah, I'll tell you what is expensive. It's the other direction. So we're here, we're talking about consuming video.[00:29:46] swyx (2): And this year, we also had a lot of progress, like probably one of the most excited, excited, anticipated launches of the year was Sora. We actually got Sora. And less exciting.[00:29:55] Simon: We did, and then VO2, Google's Sora, came out like three [00:30:00] days later and upstaged it. Like, Sora was exciting until VO2 landed, which was just better.[00:30:05] swyx (2): In general, I feel the media, or the social media, has been very unfair to Sora. Because what was released to the world, generally available, was Sora Lite. It's the distilled version of Sora, right? So you're, I did not[00:30:16] Simon: realize that you're absolutely comparing[00:30:18] swyx (2): the, the most cherry picked version of VO two, the one that they published on the marketing page to the, the most embarrassing version of the soa.[00:30:25] swyx (2): So of course it's gonna look bad, so, well, I got[00:30:27] Simon: access to the VO two I'm in the VO two beta and I've been poking around with it and. Getting it to generate pelicans on bicycles and stuff. I would absolutely[00:30:34] swyx (2): believe that[00:30:35] Simon: VL2 is actually better. Is Sora, so is full fat Sora coming soon? Do you know, when, when do we get to play with that one?[00:30:42] Simon: No one's[00:30:43] swyx (2): mentioned anything. I think basically the strategy is let people play around with Sora Lite and get info there. But the, the, keep developing Sora with the Hollywood studios. That's what they actually care about. Gotcha. Like the rest of us. Don't really know what to do with the video anyway. Right.[00:30:59] Simon: I mean, [00:31:00] that's my thing is I realized that for generative images and images and video like images We've had for a few years and I don't feel like they've broken out into the talented artist community yet Like lots of people are having fun with them and doing and producing stuff. That's kind of cool to look at but what I want you know that that movie everything everywhere all at once, right?[00:31:20] Simon: One, one ton of Oscars, utterly amazing film. The VFX team for that were five people, some of whom were watching YouTube videos to figure out what to do. My big question for, for Sora and and and Midjourney and stuff, what happens when a creative team like that starts using these tools? I want the creative geniuses behind everything, everywhere all at once.[00:31:40] Simon: What are they going to be able to do with this stuff in like a few years time? Because that's really exciting to me. That's where you take artists who are at the very peak of their game. Give them these new capabilities and see, see what they can do with them.[00:31:52] swyx (2): I should, I know a little bit here. So it should mention that, that team actually used RunwayML.[00:31:57] swyx (2): So there was, there was,[00:31:57] Simon: yeah.[00:31:59] swyx (2): I don't know how [00:32:00] much I don't. So, you know, it's possible to overstate this, but there are people integrating it. Generated video within their workflow, even pre SORA. Right, because[00:32:09] Brian: it's not, it's not the thing where it's like, okay, tomorrow we'll be able to do a full two hour movie that you prompt with three sentences.[00:32:15] Brian: It is like, for the very first part of, of, you know video effects in film, it's like, if you can get that three second clip, if you can get that 20 second thing that they did in the matrix that blew everyone's minds and took a million dollars or whatever to do, like, it's the, it's the little bits and pieces that they can fill in now that it's probably already there.[00:32:34] swyx (2): Yeah, it's like, I think actually having a layered view of what assets people need and letting AI fill in the low value assets. Right, like the background video, the background music and, you know, sometimes the sound effects. That, that maybe, maybe more palatable maybe also changes the, the way that you evaluate the stuff that's coming out.[00:32:57] swyx (2): Because people tend to, in social media, try to [00:33:00] emphasize foreground stuff, main character stuff. So you really care about consistency, and you, you really are bothered when, like, for example, Sorad. Botch's image generation of a gymnast doing flips, which is horrible. It's horrible. But for background crowds, like, who cares?[00:33:18] Brian: And by the way, again, I was, I was a film major way, way back in the day, like, that's how it started. Like things like Braveheart, where they filmed 10 people on a field, and then the computer could turn it into 1000 people on a field. Like, that's always been the way it's around the margins and in the background that first comes in.[00:33:36] Brian: The[00:33:36] Simon: Lord of the Rings movies were over 20 years ago. Although they have those giant battle sequences, which were very early, like, I mean, you could almost call it a generative AI approach, right? They were using very sophisticated, like, algorithms to model out those different battles and all of that kind of stuff.[00:33:52] Simon: Yeah, I know very little. I know basically nothing about film production, so I try not to commentate on it. But I am fascinated to [00:34:00] see what happens when, when these tools start being used by the real, the people at the top of their game.[00:34:05] swyx (2): I would say like there's a cultural war that is more that being fought here than a technology war.[00:34:11] swyx (2): Most of the Hollywood people are against any form of AI anyway, so they're busy Fighting that battle instead of thinking about how to adopt it and it's, it's very fringe. I participated here in San Francisco, one generative AI video creative hackathon where the AI positive artists actually met with technologists like myself and then we collaborated together to build short films and that was really nice and I think, you know, I'll be hosting some of those in my events going forward.[00:34:38] swyx (2): One thing that I think like I want to leave it. Give people a sense of it's like this is a recap of last year But then sometimes it's useful to walk away as well with like what can we expect in the future? I don't know if you got anything. I would also call out that the Chinese models here have made a lot of progress Hyde Law and Kling and God knows who like who else in the video arena [00:35:00] Also making a lot of progress like surprising him like I think maybe actually Chinese China is surprisingly ahead with regards to Open8 at least, but also just like specific forms of video generation.[00:35:12] Simon: Wouldn't it be interesting if a film industry sprung up in a country that we don't normally think of having a really strong film industry that was using these tools? Like, that would be a fascinating sort of angle on this. Mm hmm. Mm hmm.[00:35:25] swyx (2): Agreed. I, I, I Oh, sorry. Go ahead.[00:35:29] Exploring Video Avatar Companies[00:35:29] swyx (2): Just for people's Just to put it on people's radar as well, Hey Jen, there's like there's a category of video avatar companies that don't specifically, don't specialize in general video.[00:35:41] swyx (2): They only do talking heads, let's just say. And HeyGen sings very well.[00:35:45] Brian: Swyx, you know that that's what I've been using, right? Like, have, have I, yeah, right. So, if you see some of my recent YouTube videos and things like that, where, because the beauty part of the HeyGen thing is, I, I, I don't want to use the robot voice, so [00:36:00] I record the mp3 file for my computer, And then I put that into HeyGen with the avatar that I've trained it on, and all it does is the lip sync.[00:36:09] Brian: So it looks, it's not 100 percent uncanny valley beatable, but it's good enough that if you weren't looking for it, it's just me sitting there doing one of my clips from the show. And, yeah, so, by the way, HeyGen. Shout out to them.[00:36:24] AI Influencers and Their Future[00:36:24] swyx (2): So I would, you know, in terms of like the look ahead going, like, looking, reviewing 2024, looking at trends for 2025, I would, they basically call this out.[00:36:33] swyx (2): Meta tried to introduce AI influencers and failed horribly because they were just bad at it. But at some point that there will be more and more basically AI influencers Not in a way that Simon is but in a way that they are not human.[00:36:50] Simon: Like the few of those that have done well, I always feel like they're doing well because it's a gimmick, right?[00:36:54] Simon: It's a it's it's novel and fun to like Like that, the AI Seinfeld thing [00:37:00] from last year, the Twitch stream, you know, like those, if you're the only one or one of just a few doing that, you'll get, you'll attract an audience because it's an interesting new thing. But I just, I don't know if that's going to be sustainable longer term or not.[00:37:11] Simon: Like,[00:37:12] Simplifying Content Creation with AI[00:37:12] Brian: I'm going to tell you, Because I've had discussions, I can't name the companies or whatever, but, so think about the workflow for this, like, now we all know that on TikTok and Instagram, like, holding up a phone to your face, and doing like, in my car video, or walking, a walk and talk, you know, that's, that's very common, but also, if you want to do a professional sort of talking head video, you still have to sit in front of a camera, you still have to do the lighting, you still have to do the video editing, versus, if you can just record, what I'm saying right now, the last 30 seconds, If you clip that out as an mp3 and you have a good enough avatar, then you can put that avatar in front of Times Square, on a beach, or whatever.[00:37:50] Brian: So, like, again for creators, the reason I think Simon, we're on the verge of something, it, it just, it's not going to, I think it's not, oh, we're going to have [00:38:00] AI avatars take over, it'll be one of those things where it takes another piece of the workflow out and simplifies it. I'm all[00:38:07] Simon: for that. I, I always love this stuff.[00:38:08] Simon: I like tools. Tools that help human beings do more. Do more ambitious things. I'm always in favor of, like, that, that, that's what excites me about this entire field.[00:38:17] swyx (2): Yeah. We're, we're looking into basically creating one for my podcast. We have this guy Charlie, he's Australian. He's, he's not real, but he pre, he opens every show and we are gonna have him present all the shorts.[00:38:29] Simon: Yeah, go ahead.[00:38:30] The Importance of Credibility in AI[00:38:30] Simon: The thing that I keep coming back to is this idea of credibility like in a world that is full of like AI generated everything and so forth It becomes even more important that people find the sources of information that they trust and find people and find Sources that are credible and I feel like that's the one thing that LLMs and AI can never have is credibility, right?[00:38:49] Simon: ChatGPT can never stake its reputation on telling you something useful and interesting because That means nothing, right? It's a matrix multiplication. It depends on who prompted it and so forth. So [00:39:00] I'm always, and this is when I'm blogging as well, I'm always looking for, okay, who are the reliable people who will tell me useful, interesting information who aren't just going to tell me whatever somebody's paying them to tell, tell them, who aren't going to, like, type a one sentence prompt into an LLM and spit out an essay and stick it online.[00:39:16] Simon: And that, that to me, Like, earning that credibility is really important. That's why a lot of my ethics around the way that I publish are based on the idea that I want people to trust me. I want to do things that, that gain credibility in people's eyes so they will come to me for information as a trustworthy source.[00:39:32] Simon: And it's the same for the sources that I'm, I'm consulting as well. So that's something I've, I've been thinking a lot about that sort of credibility focus on this thing for a while now.[00:39:40] swyx (2): Yeah, you can layer or structure credibility or decompose it like so one thing I would put in front of you I'm not saying that you should Agree with this or accept this at all is that you can use AI to generate different Variations and then and you pick you as the final sort of last mile person that you pick The last output and [00:40:00] you put your stamp of credibility behind that like that everything's human reviewed instead of human origin[00:40:04] Simon: Yeah, if you publish something you need to be able to put it on the ground Publishing it.[00:40:08] Simon: You need to say, I will put my name to this. I will attach my credibility to this thing. And if you're willing to do that, then, then that's great.[00:40:16] swyx (2): For creators, this is huge because there's a fundamental asymmetry between starting with a blank slate versus choosing from five different variations.[00:40:23] Brian: Right.[00:40:24] Brian: And also the key thing that you just said is like, if everything that I do, if all of the words were generated by an LLM, if the voice is generated by an LLM. If the video is also generated by the LLM, then I haven't done anything, right? But if, if one or two of those, you take a shortcut, but it's still, I'm willing to sign off on it.[00:40:47] Brian: Like, I feel like that's where I feel like people are coming around to like, this is maybe acceptable, sort of.[00:40:53] Simon: This is where I've been pushing the definition. I love the term slop. Where I've been pushing the definition of slop as AI generated [00:41:00] content that is both unrequested and unreviewed and the unreviewed thing is really important like that's the thing that elevates something from slop to not slop is if A human being has reviewed it and said, you know what, this is actually worth other people's time.[00:41:12] Simon: And again, I'm willing to attach my credibility to it and say, hey, this is worthwhile.[00:41:16] Brian: It's, it's, it's the cura curational, curatorial and editorial part of it that no matter what the tools are to do shortcuts, to do, as, as Swyx is saying choose between different edits or different cuts, but in the end, if there's a curatorial mind, Or editorial mind behind it.[00:41:32] Brian: Let me I want to wedge this in before we start to close.[00:41:36] The Future of LLM User Interfaces[00:41:36] Brian: One of the things coming back to your year end piece that has been a something that I've been banging the drum about is when you're talking about LLMs. Getting harder to use. You said most users are thrown in at the deep end.[00:41:48] Brian: The default LLM chat UI is like taking brand new computer users, dropping them into a Linux terminal and expecting them to figure it all out. I mean, it's, it's literally going back to the command line. The command line was defeated [00:42:00] by the GUI interface. And this is what I've been banging the drum about is like, this cannot be.[00:42:05] Brian: The user interface, what we have now cannot be the end result. Do you see any hints or seeds of a GUI moment for LLM interfaces?[00:42:17] Simon: I mean, it has to happen. It absolutely has to happen. The the, the, the, the usability of these things is turning into a bit of a crisis. And we are at least seeing some really interesting innovation in little directions.[00:42:28] Simon: Just like OpenAI's chat GPT canvas thing that they just launched. That is at least. Going a little bit more interesting than just chat, chats and responses. You know, you can, they're exploring that space where you're collaborating with an LLM. You're both working in the, on the same document. That makes a lot of sense to me.[00:42:44] Simon: Like that, that feels really smart. The one of the best things is still who was it who did the, the UI where you could, they had a drawing UI where you draw an interface and click a button. TL draw would then make it real thing. That was spectacular, [00:43:00] absolutely spectacular, like, alternative vision of how you'd interact with these models.[00:43:05] Simon: Because yeah, the and that's, you know, so I feel like there is so much scope for innovation there and it is beginning to happen. Like, like, I, I feel like most people do understand that we need to do better in terms of interfaces that both help explain what's going on and give people better tools for working with models.[00:43:23] Simon: I was going to say, I want to[00:43:25] Brian: dig a little deeper into this because think of the conceptual idea behind the GUI, which is instead of typing into a command line open word. exe, it's, you, you click an icon, right? So that's abstracting away sort of the, again, the programming stuff that like, you know, it's, it's a, a, a child can tap on an iPad and, and make a program open, right?[00:43:47] Brian: The problem it seems to me right now with how we're interacting with LLMs is it's sort of like you know a dumb robot where it's like you poke it and it goes over here, but no, I want it, I want to go over here so you poke it this way and you can't get it exactly [00:44:00] right, like, what can we abstract away from the From the current, what's going on that, that makes it more fine tuned and easier to get more precise.[00:44:12] Brian: You see what I'm saying?[00:44:13] Simon: Yes. And the this is the other trend that I've been following from the last year, which I think is super interesting. It's the, the prompt driven UI development thing. Basically, this is the pattern where Claude Artifacts was the first thing to do this really well. You type in a prompt and it goes, Oh, I should answer that by writing a custom HTML and JavaScript application for you that does a certain thing.[00:44:35] Simon: And when you think about that take and since then it turns out This is easy, right? Every decent LLM can produce HTML and JavaScript that does something useful. So we've actually got this alternative way of interacting where they can respond to your prompt with an interactive custom interface that you can work with.[00:44:54] Simon: People haven't quite wired those back up again. Like, ideally, I'd want the LLM ask me a [00:45:00] question where it builds me a custom little UI, For that question, and then it gets to see how I interacted with that. I don't know why, but that's like just such a small step from where we are right now. But that feels like such an obvious next step.[00:45:12] Simon: Like an LLM, why should it, why should you just be communicating with, with text when it can build interfaces on the fly that let you select a point on a map or or move like sliders up and down. It's gonna create knobs and dials. I keep saying knobs and dials. right. We can do that. And the LLMs can build, and Claude artifacts will build you a knobs and dials interface.[00:45:34] Simon: But at the moment they haven't closed the loop. When you twiddle those knobs, Claude doesn't see what you were doing. They're going to close that loop. I'm, I'm shocked that they haven't done it yet. So yeah, I think there's so much scope for innovation and there's so much scope for doing interesting stuff with that model where the LLM, anything you can represent in SVG, which is almost everything, can now be part of that ongoing conversation.[00:45:59] swyx (2): Yeah, [00:46:00] I would say the best executed version of this I've seen so far is Bolt where you can literally type in, make a Spotify clone, make an Airbnb clone, and it actually just does that for you zero shot with a nice design.[00:46:14] Simon: There's a benchmark for that now. The LMRena people now have a benchmark that is zero shot app, app generation, because all of the models can do it.[00:46:22] Simon: Like it's, it's, I've started figuring out. I'm building my own version of this for my own project, because I think within six months. I think it'll just be an expected feature. Like if you have a web application, why don't you have a thing where, oh, look, the, you can add a custom, like, so for my dataset data exploration project, I want you to be able to do things like conjure up a dashboard, just via a prompt.[00:46:43] Simon: You say, oh, I need a pie chart and a bar chart and put them next to each other, and then have a form where submitting the form inserts a row into my database table. And this is all suddenly feasible. It's, it's, it's not even particularly difficult to do, which is great. Utterly bizarre that these things are now easy.[00:47:00][00:47:00] swyx (2): I think for a general audience, that is what I would highlight, that software creation is becoming easier and easier. Gemini is now available in Gmail and Google Sheets. I don't write my own Google Sheets formulas anymore, I just tell Gemini to do it. And so I think those are, I almost wanted to basically somewhat disagree with, with your assertion that LMS got harder to use.[00:47:22] swyx (2): Like, yes, we, we expose more capabilities, but they're, they're in minor forms, like using canvas, like web search in, in in chat GPT and like Gemini being in, in Excel sheets or in Google sheets, like, yeah, we're getting, no,[00:47:37] Simon: no, no, no. Those are the things that make it harder, because the problem is that for each of those features, they're amazing.[00:47:43] Simon: If you understand the edges of the feature, if you're like, okay, so in Google, Gemini, Excel formulas, I can get it to do a certain amount of things, but I can't get it to go and read a web. You probably can't get it to read a webpage, right? But you know, there are, there are things that it can do and things that it can't do, which are completely undocumented.[00:47:58] Simon: If you ask it what it [00:48:00] can and can't do, they're terrible at answering questions about that. So like my favorite example is Claude artifacts. You can't build a Claude artifact that can hit an API somewhere else. Because the cause headers on that iframe prevents accessing anything outside of CDNJS. So, good luck learning cause headers as an end user in order to understand why Like, I've seen people saying, oh, this is rubbish.[00:48:26] Simon: I tried building an artifact that would run a prompt and it couldn't because Claude didn't expose an API with cause headers that all of this stuff is so weird and complicated. And yeah, like that, that, the more that with the more tools we add, the more expertise you need to really, To understand the full scope of what you can do.[00:48:44] Simon: And so it's, it's, I wouldn't say it's, it's, it's, it's like, the question really comes down to what does it take to understand the full extent of what's possible? And honestly, that, that's just getting more and more involved over time.[00:48:58] Local LLMs: A Growing Interest[00:48:58] swyx (2): I have one more topic that I, I [00:49:00] think you, you're kind of a champion of and we've touched on it a little bit, which is local LLMs.[00:49:05] swyx (2): And running AI applications on your desktop, I feel like you are an early adopter of many, many things.[00:49:12] Simon: I had an interesting experience with that over the past year. Six months ago, I almost completely lost interest. And the reason is that six months ago, the best local models you could run, There was no point in using them at all, because the best hosted models were so much better.[00:49:26] Simon: Like, there was no point at which I'd choose to run a model on my laptop if I had API access to Cloud 3. 5 SONNET. They just, they weren't even comparable. And that changed, basically, in the past three months, as the local models had this step changing capability, where now I can run some of these local models, and they're not as good as Cloud 3.[00:49:45] Simon: 5 SONNET, but they're not so far away that It's not worth me even using them. The other, the, the, the, the continuing problem is I've only got 64 gigabytes of RAM, and if you run, like, LLAMA370B, it's not going to work. Most of my RAM is gone. So now I have to shut down my Firefox tabs [00:50:00] and, and my Chrome and my VS Code windows in order to run it.[00:50:03] Simon: But it's got me interested again. Like, like the, the efficiency improvements are such that now, if you were to like stick me on a desert island with my laptop, I'd be very productive using those local models. And that's, that's pretty exciting. And if those trends continue, and also, like, I think my next laptop, if when I buy one is going to have twice the amount of RAM, At which point, maybe I can run the, almost the top tier, like open weights models and still be able to use it as a computer as well.[00:50:32] Simon: NVIDIA just announced their 3, 000 128 gigabyte monstrosity. That's pretty good price. You know, that's that's, if you're going to buy it,[00:50:42] swyx (2): custom OS and all.[00:50:46] Simon: If I get a job, if I, if, if, if I have enough of an income that I can justify blowing $3,000 on it, then yes.[00:50:52] swyx (2): Okay, let's do a GoFundMe to get Simon one it.[00:50:54] swyx (2): Come on. You know, you can get a job anytime you want. Is this, this is just purely discretionary .[00:50:59] Simon: I want, [00:51:00] I want a job that pays me to do exactly what I'm doing already and doesn't tell me what else to do. That's, thats the challenge.[00:51:06] swyx (2): I think Ethan Molik does pretty well. Whatever, whatever it is he's doing.[00:51:11] swyx (2): But yeah, basically I was trying to bring in also, you know, not just local models, but Apple intelligence is on every Mac machine. You're, you're, you seem skeptical. It's rubbish.[00:51:21] Simon: Apple intelligence is so bad. It's like, it does one thing well.[00:51:25] swyx (2): Oh yeah, what's that? It summarizes notifications. And sometimes it's humorous.[00:51:29] Brian: Are you sure it does that well? And also, by the way, the other, again, from a sort of a normie point of view. There's no indication from Apple of when to use it. Like, everybody upgrades their thing and it's like, okay, now you have Apple Intelligence, and you never know when to use it ever again.[00:51:47] swyx (2): Oh, yeah, you consult the Apple docs, which is MKBHD.[00:51:49] swyx (2): The[00:51:51] Simon: one thing, the one thing I'll say about Apple Intelligence is, One of the reasons it's so disappointing is that the models are just weak, but now, like, Llama 3b [00:52:00] is Such a good model in a 2 gigabyte file I think give Apple six months and hopefully they'll catch up to the state of the art on the small models And then maybe it'll start being a lot more interesting.[00:52:10] swyx (2): Yeah. Anyway, I like This was year one And and you know just like our first year of iPhone maybe maybe not that much of a hit and then year three They had the App Store so Hey I would say give it some time, and you know, I think Chrome also shipping Gemini Nano I think this year in Chrome, which means that every app, every web app will have for free access to a local model that just ships in the browser, which is kind of interesting.[00:52:38] swyx (2): And then I, I think I also wanted to just open the floor for any, like, you know, any of us what are the apps that, you know, AI applications that we've adopted that have, that we really recommend because these are all, you know, apps that are running on our browser that like, or apps that are running locally that we should be, that, that other people should be trying.[00:52:55] swyx (2): Right? Like, I, I feel like that's, that's one always one thing that is helpful at the start of the [00:53:00] year.[00:53:00] Simon: Okay. So for running local models. My top picks, firstly, on the iPhone, there's this thing called MLC Chat, which works, and it's easy to install, and it runs Llama 3B, and it's so much fun. Like, it's not necessarily a capable enough novel that I use it for real things, but my party trick right now is I get my phone to write a Netflix Christmas movie plot outline where, like, a bunch of Jeweller falls in love with the King of Sweden or whatever.[00:53:25] Simon: And it does a good job and it comes up with pun names for the movies. And that's, that's deeply entertaining. On my laptop, most recently, I've been getting heavy into, into Olama because the Olama team are very, very good at finding the good models and patching them up and making them work well. It gives you an API.[00:53:42] Simon: My little LLM command line tool that has a plugin that talks to Olama, which works really well. So that's my, my Olama is. I think the easiest on ramp to to running models locally, if you want a nice user interface, LMStudio is, I think, the best user interface [00:54:00] thing at that. It's not open source. It's good.[00:54:02] Simon: It's worth playing with. The other one that I've been trying with recently, there's a thing called, what's it called? Open web UI or something. Yeah. The UI is fantastic. It, if you've got Olama running and you fire this thing up, it spots Olama and it gives you an interface onto your Olama models. And t

christmas god new york amazon spotify california tiktok world new york city ai english google hollywood apple science future talk online state challenges san francisco phd chinese australian speaker fighting european union microsoft tools army open iphone wrap twitch oscars white house competition sweden silicon valley massive os airbnb act cloud mac ipads santa claus ces sort fantastic publishing explain ram gofundme bay whispers excel cnbc analyzing fintech final thoughts gentlemen openai gemini sf adobe app store api limitless talked riverside chrome gmail wrapping up qu'en python ui mm credibility gpt times square alibaba linux acceptable llama bolt stripe honored vcs javascript html tl instacart sora macbook pro llm sam altman variations vo advancements braveheart firefox vfx generated 3b agi elo wearables midjourney docker gotcha waymo gpus sonnets perplexity anthropic google translate deep sea vps google sheets lms r1 rosebud query 8b vo2 google gemini things we learned kling notebooklm svg vs code future prospects botch chinese ai ai models asr ride home netflix christmas mkbhd dataset gpc rabbit r1 skift 3d tv o3 cfps jeweller o1 heygen andrej karpathy i oh wq brian mccullough brian to'o rafat ali brian it california sb simon willison brian yeah brian you latent space runwayml brian so techmeme ride home brian there brian well brian right simon it chinese china simon you

#120 Sora is here…Im not kidding this time

The ChatGPT Report

Play Episode Listen Later Dec 12, 2024 13:44

In this episode of the ChatGPT Report, host Ryan dives deep into the latest buzz surrounding OpenAI's Sora, the new text-to-video technology that's been making waves in the AI community. Drawing insights from early user experiences, Ryan breaks down the current state of Sora, highlighting its limitations and potential. From choppy motion and inconsistent prompt adherence to the hefty pricing model, he offers a candid look at what users can expect from this emerging technology. The episode also explores broader implications of AI tool accessibility, discussing the rising costs of advanced AI services like Sora and GPT-4, and how these pricing strategies might widen the economic divide. Ryan touches on OpenAI's recent announcements, including Sora, full O1, and reinforcement fine-tuning, while speculating on upcoming releases like API access, agent previews, and advanced voice modes. With comparisons to competitors like RunwayML and reflections on the rapid evolution of AI technologies, this episode provides a nuanced perspective on the current state of generative AI.

ai drawing openai api gpt kidding sora o1 runwayml

Çakal Stajyer Yapay Zeka Ajanlığı Yaparsa - Yapay Zekada Bu Hafta S2 B4

TeknoSafari's Podcast

Play Episode Listen Later Nov 14, 2024 19:57

1. Claude bilgisayarınızı görebiliyor! 2. Perplexity NotebookLM'e rakip olmaya çalışıyor. Ayrıca Reasoningle de iddialı. Öte yandan #WallStreetJournal ve #NewYorkPost telif hakkı ihlali ve marka hasarıyla suçlayarak #PerplexityAI ye dava açıyor. 3. Ideogram, CANVAS ile zirveye çıktı. 4. SOTA videoda çok iyi ve açık kaynaklı. Mochi 1 kesinlikle denemeye değer. 5. Flux gelince konu kapanmadı, Stabble Diffusion 3.5 geldi 6. RunwayML, At-One ile atakta 7. OpenAI gelişmiş sesi Avrupa'ya da açtı. 8. ByteDance stajyeri yapay zeka modellerine zararlı kod yerleştirdiği için kovuldu 9. Elon XAI APIsi yayınladı. Grok uygulamalarınıza eklenebiliyor. GROK3 gelirse ortalık karışır. 10. GPT Windows uygulaması geldi. #yapayzeka #teknolojihaberleri #bilim

wall street journal openai new york post canvas flux bytedance grok ayr sota mochi avrupa akal at one yapay zeka ideogram bu hafta runwayml

Marketing neu denken: Wie du mit KI im Marketing die Nase vorn hast (Timo Brümmer & Leon Peters)

OMT - Webinare

Play Episode Listen Later Nov 6, 2024 67:07

Erfahre in diesem Webinar, wie du durch den gezielten Einsatz von Künstlicher Intelligenz das volle Potenzial deines Marketings ausschöpfst. Von der Effizienzsteigerung von Prozessen bis hin zur personalisierten Kundenansprache – wir zeigen dir praxisnah, wie KI dein Marketing erfolgreicher macht. Dabei stellen wir konkrete Anwendungsfälle mit den Tools ChatGPT, Ideogram.ai, Midjourney, Runwayml.com sowie Udio.com vor. Folgendes lernst Du in diesem Webinar: – Du erhältst eine Einführung in KI und welche Bedeutung sie für deinen Arbeitsalltag hat – Du bekommst konkrete Praxisbeispiele für Text, Bild, Video und Audio, um KI in deine Arbeit zu integrieren – Du lernst die Chancen und Grenzen der KI-Tools kennen – Du bekommst einen exklusiven Tipp, wie du dein KI-Wissen auf dem aktuellen Stand hältst

#111 Sora is finally here...I mean o1

The ChatGPT Report

Play Episode Listen Later Sep 19, 2024 11:15

In todays episode we run over Sor....o1, and some other fantastic AI news tidbits Creator of the Week: MeanOrangeCat's 'Knight School' This week, we dive into the world of AI-generated content and industry updates. Our creator spotlight features MeanOrangeCat's charming RunwayML creation, "Knight School," a whimsical 2.5-minute tale of a cat knight battling white rabbits. We'll break down the latest in AI advancements, including: OpenAI's O1 (Strawberry) preview: Improved math and coding capabilities The potential AI investment bubble: BlackRock and Microsoft's $30 billion fund Adobe Firefly's video model demo: Promise and skepticism Luma's new video generation API Join us as we explore these developments and discuss the ongoing debate about the progress towards AGI. Are we on the brink of a breakthrough, or is the tech sector heading for a major correction? Tune in for our analysis and insights on the current state of AI.

ai creator microsoft openai improved blackrock sora agi luma sor adobe firefly runwayml

Intersection of INDIE & Deep Tech — A Conversation with Michael Dempsey, Compound Managing Partner

INDIE AUDIO

Play Episode Listen Later Jul 3, 2024 78:51

My friend, Michael Dempsey, Managing Partner at Compound, and I have an ongoing thread about the intersection of indie ideals and his areas of focus in deep tech. Most of these conversations happen over meals or online, so we decided to dive in a bit with the cameras on.Our conversation was as fun as it was wide ranging. I've watched for years as Michael has built Compound into a research focused, thesis-driven firm that has moved from the edges to the center of some of the hottest investment areas today. We get into details behind his early conviction around AI juggernauts like RunwayML and Wayve, to new themes they're exploring in crypto and biology. Some quick takeaways and highlights to look for —There's a trend of deep tech startups pursuing overly ambitious “narrative rotations” by constantly expanding into new areas, which Michael aptly calls a “Ponzi scheme of ambition.” This is driven by the need to justify raising large amounts of venture capital.His bias towards building one big business in a massive market rather than narrative expansions. For shelling point investments, like SpaceX and Palantir, the focus is on owning and expanding a principle or market over time, rather than narrative rotations.Compound's approach to investment themes that are considered “too early or too ridiculous” with no real customers yet, and funding those contrarian opportunities.The value of developing specific year-by-year theses on how certain sectors will play out, acknowledging that 80% may be wrong but aiming for the 20% that are very right. You only really need to be right twice per fund.

ai conversations entrepreneurship investing indie spacex managing partners intersection fundraising compound ponzi palantir deep tech runwayml michael dempsey

#100 Text to video heating up Luma and RunwayML

The ChatGPT Report

Play Episode Listen Later Jun 20, 2024 9:57

Welcome to the 100th episode of TheChatGPTReport! I'm your host Ryan, here to deliver the latest news in less than 15 minutes from an average guy's perspective. In this milestone episode, we dive into several chunky small news items, including my ongoing rant about AI influencers. I'll share my thoughts on the frustration of AI influencers labeling every new product as a "killer" and the hype surrounding demo videos. This week, our creator spotlight is on Nayar (@Blues_Blue34), who created an impressive 20-second Blade Runner-inspired video using Midjourney and Luma Labs AI. Check it out for some cool AI-generated content! In small news, OpenAI and Color Health have teamed up to develop an AI assistant for personalized cancer care. Civita, a major AI model platform, has banned everything related to Stable Diffusion 3 due to licensing concerns. Meanwhile, OpenAI CTO Mira Murati and CEO Sam Altman have conflicting statements about their in-house models' capabilities. Luma AI's new Dream Machine video generator has received mixed reviews, highlighting the ongoing challenges in AI video generation. Similarly, RunwayML AI's Gen 3 text-to-video tool is being rolled out, but early user experiences suggest it's still a work in progress with significant room for improvement. Don't forget to rate, review, and subscribe to the podcast to stay updated every Thursday, and tune in for interviews some Mondays. Follow us on LinkedIn and Twitter @ChatGPTReport (the purple logo) and email us at thechatgptreport@gmail.com if you'd like to be interviewed or share your thoughts. Thanks for joining me on this 100th episode journey!

ai blade runner openai heating up sam altman midjourney luma stable diffusion dream machine civita runwayml

APPLE GOES ALL IN ON AI

AI DAILY: Breaking News in AI

Play Episode Listen Later Jun 11, 2024 4:16

Plus Musk Bashes Apple's Strategy (subscribe below) Like this? Get AIDAILY, delivered to your inbox, every weekday. Subscribe to our newsletter at https://aidaily.us Apple Unveils AI Strategy at Developer Conference Apple revealed its AI strategy, "Apple Intelligence," emphasizing personalization and privacy. Key updates include iOS 18's enhanced customization, MacOS Sequoia's iPhone mirroring, and WatchOS 11's new features. The company is also supporting RCS for better messaging with non-Apple devices. CEO Tim Cook and OpenAI's Sam Altman highlighted the advancements Apple's New AI Strategy Could Drive iPhone Upgrades Apple announced its AI strategy, "Apple Intelligence," at its annual developer conference, integrating advanced AI features into its latest devices. The new capabilities, such as image creation and email rewriting, are limited to newer models like the iPhone 15 Pro and Pro Max. This exclusivity may drive consumers to upgrade their devices. Apple Gives Siri a Major AI Makeover At WWDC 2024, Apple unveiled significant updates to Siri, enhancing its naturalness, relevance, and personal context awareness. Siri can now handle speech stumbles, understand context better, and take action across apps using the new App Intents API. These updates aim to make Siri more capable and integrated into users' daily activities. Apple Shares Slip After AI Strategy Unveiling, Musk Criticizes OpenAI Partnership Apple's stock fell during premarket trading after unveiling its "Apple Intelligence" AI strategy at WWDC 2024, which includes a partnership with OpenAI. Tesla's Elon Musk criticized the partnership, calling it a security risk. Wall Street's lukewarm response reflects concerns over Apple's AI capabilities compared to rivals like Google and Microsoft. Musk Threatens to Ban Apple Devices Over OpenAI Integration Elon Musk threatened to ban Apple devices at his companies if Apple integrates OpenAI at the operating system level, calling it an "unacceptable security violation." This follows Apple's announcement of new AI features and a partnership with OpenAI. Musk criticized Apple's reliance on OpenAI instead of developing its own AI. AI Uncovers Genetic Insights into Coronary Artery Disease Researchers at Mount Sinai's Icahn School of Medicine used AI to identify rare coding variants in 17 genes linked to coronary artery disease (CAD). Published in Nature Genetics, the study utilized an in silico score derived from machine learning models trained on extensive health records. These findings could lead to new targeted treatments and personalized cardiovascular care. Kling AI Video Generator Surpasses Competitors with Advanced Features Kling, developed by Chinese tech giant Kuaishou, is gaining attention for its superior AI video generation capabilities. It can create videos up to two minutes long in 1080p resolution, outperforming rivals like Pika and RunwayML. Utilizing advanced 3D reconstruction technology, Kling produces realistic movements and scenes, setting a new standard in AI video creation. --- Send in a voice message: https://podcasters.spotify.com/pod/show/aidaily/message

AI Weekly Rundown (April 15th to April 21st 2024) Major Announcement from xAI first multimodal model, Google Infini-Attention, Adobe Firefly AI , Microsoft VASA-1, Meta Llama 3, NVIDIA RTX A400 A1000 and more:

AI Unraveled: Latest AI News & Trends, Master GPT, Gemini, Generative AI, LLMs, Prompting, GPT Store

Play Episode Listen Later Apr 20, 2024 14:03

⚖️ xAI's first multimodal model with a unique dataset☁️ Infini-Attention: Google's breakthrough gives LLMs limitless context

75 - Surfeando la IA con Christian Palau - Humanismo Digital

Humanismo Digital

Play Episode Listen Later Apr 14, 2024 59:50

Conversamos con Christian Palau, un explorador del conocimiento y el cambio que tras más de 20 años de carrera profesional en distintos roles cercanos a la tecnología, la estrategia, el negocio o el marketing nos invita a surfear en el área de conocimiento con decisión y ganas de aprender y compartir. Con Christian hablamos del impacto de la IA en su profesión actual, de herramientas en las que él delega parte de sus tareas, de autenticidad, de ética y especialmente y creo que desde su ejemplo, a crecer y no parar de aprender y compartir conocimiento para transitar hacia el futuro con más recursos y confianza Links a Christian ============== https://www.linkedin.com/in/christianpalau/ https://www.thebrainmixers.com/ Herramientas mencionadas ======================= ChatGPT: https://chat.openai.com/ Gemini: https://gemini.google.com/app Perplexity: https://www.perplexity.ai/ Dall-E: https://openai.com/dall-e-2 Midjourney: https://www.midjourney.com/home Imagine Art: https://www.imagine.art/ Runwayml: https://runwayml.com/ Play.HT: https://play.ht/ Suno: https://www.suno.ai/ Canva: https://www.canva.com/ai-image-generator/ Capcut: https://www.capcut.com/es-es/ Tome: https://tome.app/ TLDV: https://tldv.io/ Attention Inisghts: https://attentioninsight.com/ Chef GPT: https://www.chefgpt.xyz/es Learning Studio: https://learningstudioai.com/

digital chatgpt ia gemini conversamos canva gpt tome herramientas laia palau ht suno humanismo capcut runwayml

Meta & Zuck Go Full AI, Palworld Controversy & Interactive Trucker Chatbots | Ep41

AI For Humans

Play Episode Listen Later Jan 25, 2024 67:54

This week… Meta & Mark Zuckerberg go for open source AGI, massive drama around the new game Palworld, Deep Mind's Alpha Geometry gets math right & a device that detects skin cancer! Plus, Gavin tells us about the new motion brush update from RunwayML (it's awesome), Kevin gets frustrated trying to make burgers with ChatGPT (not awesome), ActAnywhere is a cool new white paper that will let you place people in videos…anywhere & of course, it's an election year and hoo boy are some people up to some not good stuff. AND THEN… an interview with Horace Copes, an AI intergalactic trucker that also happens to be a live and interactive chatbot made by Kevin and his small team. We learn a bit about Horace and then how Kevin (and team) brought him into being. Horace is live on Twitch right now and you can interact with him yourself at: http://www.vctv9000.com It's an endless cavalcade of ridiculous and informative AI news, AI tools, and AI entertainment cooked up just for you. Follow us for more AI discussions, AI news updates, and AI tool reviews on X @AIForHumansShow Join our vibrant community on TikTok @aiforhumansshow For more info, visit our website at https://www.aiforhumans.show/ /// Show links /// Mark Zuckerberg & Meta Going for Open Source AGI https://www.theverge.com/2024/1/18/24042354/mark-zuckerberg-meta-agi-reorg-interview Confirmation Sam Altman Making AI Chips https://fortune.com/2024/01/20/openai-ceo-sam-altman-seeking-billions-for-ai-chips-factories-network/ DeepMind's Alpha Geometry https://deepmind.google/discover/blog/alphageometry-an-olympiad-level-ai-system-for-geometry/ Palworld Controversy https://www.kotaku.com.au/2024/01/palworld-ai-plagiarism-controversy/ Examples of Similarities Between Pokemon & Palworld https://twitter.com/CeciliaFae/status/1749183059877085396 Fake Biden Robocalls Are Here https://www.nbcnews.com/a/rcna134984 FDA Approved AI-Powered Skin Cancer Detector https://www.dermasensor.com/ Act Anywhere White Paper https://actanywhere.github.io/ The Cybertuba: https://twitter.com/thegarrettscott/status/1749147379993899490?s=20 Rappers Meet Namesakes: https://www.reddit.com/r/midjourney/comments/19dp4n8/rappers_challenge_their_namesakes_to_a_rap_battle/ RunwayML's New Motion Brush Update https://runwayml.com/ CHAT WITH HORACE: http://www.vctv9000.com

tiktok ai google technology news comedy twitch chatgpt artificial intelligence controversy openai chatbots interactive trucker agi zuck horace palworld deepmind aitools runwayml

On a créé une vidéo ÉPIQUE avec l'IA !! #Masterclass

The Bottlefield Show

Play Episode Listen Later Dec 1, 2023 48:48

Dans cet épisode passionnant (#tuto #masterclass) Ludovic Mornand et Frédéric Roginska relèvent le défi de créer un trailer vidéo promotionnel pour un spiritueux uniquement avec des outils d'intelligence artificielle. Ils commencent par générer le synopsis, le storyboard et les dialogues avec ChatGPT et Claude. Puis ils créent les visuels - champs d'agave, distillerie, bouteilles... - avec Midjourney en partageant leurs astuces pour obtenir des rendus ultra-réalistes. Certaines images sont même animées avec RunwayML ! Le logo est conçu avec Dall-E 3 et les voix off générées par Elevenlabs. Après un rapide montage sur Premiere Pro, découvrez le résultat bluffant dans deux versions : noir et blanc puis couleur. Malgré quelques imperfections, cette expérimentation révèle le potentiel de l'IA appliquée à la création audiovisuelle. Une émulation créative qui ne remplacera pas le talent humain mais ouvre de nouvelles perspectives passionnantes. Ludovic et Frédéric concluent en discutant de l'apport concret de ces outils et leurs limites actuelles. Une vidéo éclairante pour comprendre comment l'IA transforme déjà le travail des créatifs !

chatgpt dans masterclass trailer ia editing ils noir petit puis malgr dall nb avec l certaines midjourney pique ludovic elevenlabs premiere pro ressources le runwayml

OpenAI Reunited, New AI Video Tools & WWE's Xavier Woods AKA Austin Creed | Ep34

AI For Humans

Play Episode Listen Later Nov 30, 2023 88:58

This week… OpenAI brought Sam Altman back, DeepMind's AI discovered a trove of new materials, Sports Illustrated Got Outed for AI articles, RunwayML's new Motion Brush is out, and… WWE's Xavier Woods aka Austin Creed joins us for an interview to discuss all things AI, including whether or not we should be preparing for an AI paradise or counting our days. (51:25) Plus, meet our cool and oh-so-youthful AI co-host Alison who's a TikTok influencer with something particular on her mind. Yes, once, again your poor hosts get used to shill something terrible. Oh and we're joined for the news by Andre & Jeremie, the hosts of the very good podcast Last Week In AI which you should most definitely listen to and subscribe to! You can find it here: https://www.lastweekinai.com/ It's an endless cavalcade of ridiculous and informative AI news, AI tools, and AI entertainment cooked up just for you. Follow us for more AI discussions, AI news updates, and AI tool reviews on X @AIForHumansShow Join our vibrant community on TikTok @aiforhumansshow For more info, visit our website at https://www.aiforhumans.show/ // Show Links // RunwayML: https://runwayml.com/ One 2-3-45 https://github.com/One-2-3-45/One-2-3-45 Sam Altman Returns To OpenAI https://www.theverge.com/2023/11/22/23967223/sam-altman-returns-ceo-open-ai The Best Q* Explainer for AI Explained https://youtu.be/ARf0WyFau0A?si=_GM92Y762M08Eej0 Sports Illustrated Uses AI Writers https://futurism.com/sports-illustrated-ai-generated-writers DeepMind New Materials From AI https://deepmind.google/discover/blog/millions-of-new-materials-discovered-with-deep-learning/ Xavier Woods / Austin Creed https://twitter.com/AustinCreedWins UpUpDownDown https://www.youtube.com/channel/UCIr1YTkEHdJFtqHvR7Rwttg

tiktok ai technology comedy tech tools wwe openai reunited sam altman deepmind new ai xavier woods austin creed runwayml

Ep. 173 Ann Morgan – Abstract Painter, Melding the Traditional With Technology

Tenet

Play Episode Listen Later Nov 16, 2023 133:50

This week Wes and Todd sit down with Painter, Ann Morgan. Ann discusses painting big, technology informing her process, xeroxes, learning to paint at age 13, winning awards in her youth, AmeriCorps VISTA, getting her B.F.A. at Michigan State University, not painting for 20 years, always considering herself an Artist, her “Art Stuff" folder, the catalyst to start painting again, melding the traditional with technology, Artificial Intelligence, RunwayML, painting in oil, Her show at Next Gallery “Once When I Was You,” her process using A.I., painting in the dark, being bendy, A.I. prompts, borrowing imagery, what scares her about A.I., artist statements living in a simulation, pricing social media, art scammers her “Burn Series,” painting environments, and art being unapologetic joy. Join us for a conversation about exploration and process with Ann Morgan! Check out Ann's work at her website www.annmorgan.art Follow Ann Morgan on social media:Instagram - www.instagram.com/echopod/@echopod Facebook - www.facebook.com/amjohnsonmorgan

RunwayML's Announces New Text-to-Video AI Features

AI Chat: ChatGPT & AI News, Artificial Intelligence, OpenAI, Machine Learning

Play Episode Listen Later Nov 14, 2023 6:40

In this episode, we explore the groundbreaking new motion feature introduced by RunwayML in its text-to-video AI service, delving into its capabilities, potential applications, and how it's set to revolutionize content creation. We'll also discuss the technical aspects of this feature and its implications for the future of AI in the multimedia industry. Invest in AI Box: https://republic.com/ai-box

ai invest video ai runwayml

Een taalmodel uit de polder, GPT-NL | ✨ Poki

âœ¨Poki - Podcast over Kunstmatige Intelligentie AI

Play Episode Listen Later Nov 6, 2023 63:57

Nederland is begonnen met de ontwikkeling van een open-source taalmodel met behulp van onze eigen supercomputer Snellius. Alexander en Wietse bespreken hoe dit in zijn werk zou moeten gaan en verdiepen zich in de recente ontwikkeling op de generatie van video met RunwayML. Kijken we binnenkort echt alleen nog maar naar onze eigen TV-shows?Benieuwd naar de maaltijden van Lazy Vegan? Neem een kijkje op hun website.Zie het privacybeleid op https://art19.com/privacy en de privacyverklaring van Californië op https://art19.com/privacy#do-not-sell-my-info.

tv technology future artificial intelligence nederland neem zie benieuwd kijken californi polder kunstmatige intelligentie poki wietse runwayml

The Aphid Lounge Preview: Worldbuilding, AI, and You Part 2: Using AI for TTRPGs

Worldbuild With Us

Play Episode Listen Later Oct 17, 2023 1:02

WBWU, now in glorious video format! In the second part of this patron-exclusive Aphid Lounge series on AI tools, Daniel walks us through creating character portraits with Stable Diffusion and whipping up ideas with ChatGPT, and shows us the emerging realm of video via RunwayML. We also talk art, accessibility, ethics, and what the future might bring, for better or worse. Watch the full episode at https://www.patreon.com/posts/aphid-lounge-ai-90338514 by joining our Patreon's Giant Aphid or God-Pacted tiers!

ai chatgpt using ai worldbuilding ttrpgs stable diffusion aphid runwayml

AI-Powered Filmmaking with Waymark's Stephen Parker and Josh Rubin

Play Episode Listen Later Sep 14, 2023 91:59

In this episode, Nathan sits down with Stephen Parker and Josh Rubin of Waymark, and creators of The Frost, an AI-powered 12 minute short film. In this episode, we get a behind the scenes look at their creative process, the prompting and creative techniques they used to generate and animate the DALL-E results, and an overview of the current state of AI art. If you're looking for an ERP platform, check out our sponsor, NetSuite: http://netsuite.com/cognitive TIMESTAMPS (00:00) Episode Preview (00:01:00) Nathan's introduction for Stephen Parker and Josh Rubin (00:05:01) - The Frost is a 12-minute short film created using DALL-E 2 images. (00:07:06) - The Frost started as an experiment to see if a narrative film could be created completely from AI imagery. (00:08:38) - The filmmaking process was different because DALL-E images provided a starting point to build the story. (00:10:38) - Parker started generating images with DALL-E 2 when he got access to the early preview. (00:12:26) - Prompt technique to get consistent images by providing context about a hypothetical film. (00:15:57) Sponsors: Netsuite | Omneky (00:19:37) - Compositional continuity, like shot-reverse shot, was hard to achieve through prompting. (00:22:13)- Rubin would request specific shots and the team would prompt DALL-E 2 to create them. (00:25:24) - Filmmaking with AI as opposed to traditional filmmaking (00:32:25) - Getting consistent facial features for characters was very difficult. (00:39:03) - The storytelling helped cover inconsistencies that viewers might not notice. (00:40:15) - Working with the images DALL-E provides (00:41:54) - MacGuffin Object to tie scenes together (00:44:53) Inpainting and compositing to refine DALL-E Images (00:45:41) - Prompting for complex or novel compositions remains challenging. (00:50:43) - The AI art is limited by what exists in the training data. (01:02:05)- Animating the human characters was challenging because of missing or incorrect appendages. (01:07:36) - The team had to find creative ways to convey emotion through the limited animation. 01:02:24 - Animating subtle human movement and emotion is still very difficult. (01:06:35) - A romantic comedy would be much harder to produce with current AI capabilities. (01:12:17) - For Frost 2 they are using text-to-video models like RunwayML. (01:15:43) - AI voicing advancements applied to filmmaking (01:19:27) - The future of AI in Hollywood and filmmaking: quality narratives still require human vision LINKS: The Frost: https://www.thefrostpart.one/ MIT Tech Review Feature Article: https://www.technologyreview.com/2023/06/01/1073858/surreal-ai-generative-video-changing-film/ Behind the Scenes Videos: https://www.youtube.com/watch?v=p31COxNbTWs and https://www.youtube.com/watch?v=F8k9MeXpSUU The Frost Part 2 – trailer – https://www.youtube.com/watch?v=RcmwtRd_NIs X/SOCIAL: @Stephen_Parker (Stephen) @bigkickcreative (Josh) @Waymark @labenz (Nathan) @eriktorenberg @CogRev_Podcast SPONSORS: NetSuite | Omneky NetSuite has 25 years of providing financial software for all your business needs. More than 36,000 businesses have already upgraded to NetSuite by Oracle, gaining visibility and control over their financials, inventory, HR, eCommerce, and more. If you're looking for an ERP platform ✅ head to NetSuite: http://netsuite.com/cognitive and download your own customized KPI checklist. Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that actually work customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off. Music Credit: GoogleLM

ai hollywood ecommerce oracle frost filmmaking kpi rubin erp prompt ai powered prompting netsuite animating compositional josh rubin omneky stephen parker runwayml

Amazing AI-Powered Video Apps.

VIDEO MOJO with Jon Leland: Timeless marketing mixed with the bleeding edge of video & social media

Play Episode Listen Later Aug 14, 2023 45:30

In this Video Mojo exclusive, we delve into the fascinating world of how AI can be used for creative experimentation. I talk with Nicolas Neubert, who I met on X / Twitter as part of #aicommunity. In this conversation, he illuminates his breakthrough project, "Genesis," a concept movie trailer, and how he used the power of two AI apps, MidJourney and RunwayML to bring his cinematic vision to life. He also shares his music source and the free video editing software he chose to use despite his expertise with more advanced software. To his own amazement, this project has gone viral and was featured on CNN and in Forbes. Nicolas, a product designer at Volkswagen by day, has always been drawn to the realms of art and creativity. In this inspiring endeavor, he takes us on a creative adventure through how he created the "Genesis" concept movie trailer leveraging these two leading-edge AI tools, MidJourney and RunwayML. Discover how Nicholas seamlessly blends his expertise in design with his passion for visual storytelling. Uncover the secrets behind his creative process as he shares insights into how AI can be harnessed for artistic expression.#CreativeExperiment #ArtMeetsTechnology LINKS: @iamneubert on X YouTube of the "Genesis" faux movie trailer Nicolas' tweet on The Making of "Genesis" faux movie trailer using MidJourney and RunwayML Forbes article about this project CNN story on RunwayML MidJourney RunwayML PixaBay Music CapCut Video Editor (or mobile app) TIMESTAMPS: The key idea of the video is that AI-powered video apps like MidJourney and RunwayML provide a creative and efficient way for individuals and companies to generate viral videos, while also emphasizing the importance of human judgment and experimentation in the creative process. 00:00

ai discover forbes cnn apps uncover volkswagen ai powered midjourney go viral runwayml

AI-generated South Park, “Holy Grails” of AI & more with Sunny Madra | E1785

This Week in Startups

Play Episode Listen Later Aug 1, 2023 59:11

This Week in Startups is brought to you by… Embroker. The Embroker Startup Insurance Program helps startups secure the most important types of insurance at a lower cost and with less hassle. Save up to 20% off of traditional insurance today at Embroker.com/twist. While you're there, get an extra 10% off using offer code TWIST. OpenPhone. Create business phone numbers for you and your team that work through an app on your smartphone or desktop. TWiST listeners can get an extra 20% off any plan for your first 6 months at openphone.com/twist Roots. Invest in the only real estate investment trust that creates wealth for you and its residents at investwithroots.com/twist * Today's show: Sunny Madra joins Jason to discuss Fable Simulation's new AI showrunner(5:51), AI-powered note-takers and CRMs(37:25), and more! * Time stamps: (00:00) Sunny Madra joins Jason (2:27) Using AI to analyze large datasets (5:51) The WGA strike and Fable Studios' AI-generated South Park episode (12:29) Embroker - Use code TWIST to get an extra 10% off insurance at https://Embroker.com/twist (13:45) AI-generated South Park continued (16:58) What is lost with AI and the Modern Seinfeld Twitter account (23:40) OpenPhone - Get 20% off your first six months at https://openphone.com/twist (25:10) The pace of innovation with RunwayML (31:06) Production studios sharing the wealth & alternative plans (36:00) Roots - Head to investwithroots.com/TWIST to sign up and start investing today! (37:25) Sunny showcases Folk app's CRM (45:29) Sunny demos Circleback (51:32) Using Claude to break down an investor presentation and Sergey Brin back at Google * Follow Sunny: https://twitter.com/sundeep Check out Circleback: https://circleback.ai Check out Folk.app: https://www.folk.app Check out RunwayML: https://runwayml.com * Read LAUNCH Fund 4 Deal Memo: https://www.launch.co/four Apply for Funding: https://www.launch.co/apply Buy ANGEL: https://www.angelthebook.com Great recent interviews: Steve Huffman, Brian Chesky, Aaron Levie, Sophia Amoruso, Reid Hoffman, Frank Slootman, Billy McFarland, PrayingForExits, Jenny Lefcourt Check out Jason's suite of newsletters: https://substack.com/@calacanis * Follow Jason: Twitter: https://twitter.com/jason Instagram: https://www.instagram.com/jason LinkedIn: https://www.linkedin.com/in/jasoncalacanis * Follow TWiST: Substack: https://twistartups.substack.com Twitter: https://twitter.com/TWiStartups YouTube: https://www.youtube.com/thisweekin * Subscribe to the Founder University Podcast: https://www.founder.university/podcast

AI Minecraft, "Responsible AI" + We Make ChatGPT & LLAMA 2 Debate | AI For Humans

AI For Humans

Play Episode Listen Later Jul 27, 2023 64:40

In this week's AI-happy episode, fully human hosts Kevin Pereira & Gavin Purcell explore hot topics in the artificial intelligence community. From 'responsible AI' endorsed by tech giants like OpenAI, Microsoft, and Google to 'Steve-1,' an AI that learns through Minecraft... and then... THE NEXT BIG AI DEBATE: ChatGPT (GPT-4) and Meta's new LLAMA 2 debate the classic "Cats Vs Dogs" and we gaurantee you will be SHOCKED at the result. But wait... THERE'S MORE: We've also meet our new AI co-host Cassidy who seems nice but again, might have something slightly wrong with her. And we've got two very cool free AI tools that you'll want to get your hands on: SunoAI's Chirp which allows you to make full blown AI songs from text prompts (that are good!) and RunwayML's new Gen2 Image-To-Video which allowed the guys to make a series of Animal Influencers. Follow us for more AI discussions and updates on Twitter @AIForHumansShow, Join our vibrant community on TikTok @aiforhumansshow Subscribe on YouTube to never miss an episode @AIForHumansShow. For more info, visit our website at https://www.aiforhumans.show/

tiktok ai google debate microsoft chatgpt humans minecraft machine learning openai shocked llama responsible ai chirp runwayml

Part 2: One Minute Films, Acting & AI with Darion D'Anjou

Act Out of Love Podcast with Natasha Mayet

Play Episode Listen Later Jul 25, 2023 47:41

In part two of this informative episode with award winning sci-fi writer director Darion D'Anjou we get into the nitty gritty of AI tools for filmmakers and actors who want to make their own content, such as Chat GPT, Midjourney and RunwayML. If you like this episode please do give it a review on Apple Podcasts so someone else can be inspired! Darion D'Anjou Youtube https://www.youtube.com/c/dariondanjou One Minute Film School https://www.youtube.com/c/oneminutefilmschool https://oneminutefilmschool.dariondanjou.com/ IG @dariondanjou @dariondanjouart Discord to make ai art with Darion and see his Midjourney prompts discord.gg/K8rnCTs4 --- Send in a voice message: https://podcasters.spotify.com/pod/show/onthejourneypodcast/message Support this podcast: https://podcasters.spotify.com/pod/show/onthejourneypodcast/support

ai acting films midjourney one minute darion runwayml

EP11: Powering The Future of AI, Uncanny Valley Reductions in AI Video and the Latest in Design with AI

How to Talk to AI

Play Episode Listen Later Jun 20, 2023 39:09

FOR 15% OFF PROMPT PERFECT Use this link https://bit.ly/3Chmc16and the code 'httta' at the checkout!AI & Prompt Engineering for Everyone on CoRise.com (Presale starts 23 JUN 23): corise.com/go/prompt-eng-everyone-2F5JG use the code 'WESS10' for 10% offPodcast Page: [https://howtotalkto.ai]HTTTA Newsletter: [https://httta.substack.com/]NVIDIA Supercomputer: https://nvidianews.nvidia.com/news/nvidia-announces-dgx-gh200-ai-supercomputerAI-Generated Short Film "The Frost": https://www.technologyreview.com/2023/06/01/1073858/surreal-ai-generative-video-changing-film/RunwayML for Text-to-Image Generations: https://runwayml.com/On this episode of HTTTA, we explore the latest developments in generative models and diffusion models in the AI space. Goda and Wes dive into the AI arms race and how big companies such as Microsoft and Google can afford new NVIDIA exaflop supercomputers. They provide some thoughts on fusion power and how it is necessary to achieve advanced AI technologies like GPT-5, 6, and 7. Finally, we discuss a foldable sofa that can fit in an envelope, developed with Ikea using AI techniques. Tune in to HTTTA to hear all about all it, Happy Prompting Everybody![00:02:32] RunwayML Gen2 Text-to-Video released[00:05:06] Elon Musk responds to friend of the Podcast James Gerde video.[00:08:23] New AI Supercomputers from NVIDIA. In the past, scaling limited by physics. This is on longer a limitation. [00:09:23] Nvidia's GraceHopper supercomputer is a more cost-effective and efficient solution, using 45x less power.[00:12:32] New GPU central server from NVIDIA with expensive chips not accessible to most companies, but innovation will make it cheaper.[00:17:35] First mover advantage flipped with AI innovation. Marketing matters more than tech. Example of prompt engineering and sales on Amazon.[00:19:30] Importance of creating videos with lasting relevance. [00:24:14] A foldable sofa designed by AI can fit in an envelope and was developed with Ikea; discovered through testing keywords, including "conversation pit."[00:26:02] Diffusion models in architecture are and have potential for impressive renderings. [00:29:58] Animation still has some uncanny valley, but this AI-generated 13-minute movie "The Frost" has an understandable narrative structure despite technological limitations.[00:33:53] Shutterstock and Getty Images deal fell through with Stable Diffusion, Solution to legal arguments, or the largest settlement in history?[00:35:24] Diffusion models in network science understand how images interact and make decisions all at once, not linearly. Goda Go on Youtube: [https://www.youtube.com/@godago]Wes the Synthmind's everything: [https://linktr.ee/synthminds]

We Send AI to Therapy, Meta's New Music AI and Free RunwayML Video Apps

AI For Humans

Play Episode Listen Later Jun 15, 2023 67:30

Hosts Gavin Purcell & Kevin Pereira return to demystify the latest AI news, breakthroughs, and their practical applications. As always, our mission is to make artificial intelligence both digestible and relatable for everyone. In this week's demo, we're conducting an AI therapy session! Our AI co-host, Gash, sits down with Dr. Claude, an AI based on Anthropic's model. Their mission? To see if they can nudge Gash out of his deep-seated nihilistic state. Will they make a breakthrough? Join us for a fascinating (and, of course, semi-agressive) journey into AI self-awareness. Then, in our "Dumb Thing We Did With AI This Week" segment, we live up to our very dumb expectations. Kevin strikes a tune using Meta's new MusicGen text-to-audio software while Gavin exploits the ChatGPT plugin, Photorealistic, to transform fast food mascots into dreary corporate middle-managers. On to THE NEWS: Runway ML has just launched Gen2 video for all! Kevin and Gavin break down how to get the most out of it and what innovative things you can create with this new tool. Mark Zuckerberg gets candid about AI on Lex Fridman's podcast, delving into the significance of the newly open-sourced models. OpenAI announces new ChatGPT models, promising even greater performance and speed. We'll unpack what these developments mean for AI technology and everyday users. Stay tuned for these stories, and much more about the world of AI, its impact, and its potential. Follow us on: Twitter: https://twitter.com/AIForHumansShow YouTube: https://www.youtube.com/@AIForHumansShow And share your thoughts with us via voicemail or email through our website: https://www.aiforhumans.show/

ai therapy chatgpt apps mark zuckerberg openai new music anthropic lex fridman gash 'meta' gen2 music ai photorealistic runwayml musicgen

RunwayML, AI Steve Jobs, & more with Sunny Madra & Vinny Lingham | E1755

This Week in Startups

Play Episode Listen Later Jun 3, 2023 64:29

This Week in Startups is presented by: Vanta. Compliance and security shouldn't be a deal-breaker for startups to win new business. Vanta makes it easy for companies to get a SOC 2 report fast. TWiST listeners can get $1,000 off for a limited time at vanta.com/twist. Embroker. The Embroker Startup Insurance Program helps startups secure the most important types of insurance at a lower cost and with less hassle. Save up to 20% off of traditional insurance today at Embroker.com/twist. While you're there, get an extra 10% off using offer code TWIST. .Tech domains are the go-to namespace to build anything in tech… and home to the world's most innovative startups. Secure your .Tech domain today and lock down a 1-year domain for $10, or a 5-year domain for $50 at https://go.tech/TWIST today! * Today's show: Sunny and Vinny are back to break down what's next after AI layoffs(30:10), discuss RunwayML's massive valuation(52:09), and more! Follow Vinny: https://twitter.com/vinnylingham Check Out Waitroom: https://waitroom.com/ Follow Sunny: https://twitter.com/sundeep Check Out Definitive: https://definitive.io/prompts/new * Time stamps: (00:00) Sunny and Vinny join Jason (1:39) Sunny demos Runway (9:16) Jason AI interviews AI Steve Jobs (12:37) Vanta - Get $1000 off your SOC 2 at https://vanta.com/twist (13:44) Jason AI interviews AI Steve Jobs continued (18:27) The potential for genealogist AI startup (22:26) Converting thoughts into text with Audio Pen AI tool (24:40) Microsoft charging 40% more for AI-enhanced services (28:55) Embroker - Use code TWIST to get an extra 10% off insurance at ⁠https://Embroker.com/twist (30:10) Comparing hiring with improving employee efficiency (33:46) The ChatGPT share feature (37:47) Meta's Image Bing AI tool (40:33) .Tech - Lock down a 1-year domain for $10, or a 5-year domain for $50 at https://go.tech/TWIST (42:01) Microsoft integrating AI into windows (49:29) The Sales Recording Law (52:09) Big funding rounds and why investors must get in early (55:30) Japan going all in on copyright * Read LAUNCH Fund 4 Deal Memo & Apply for Funding Buy ANGEL Great recent interviews: Brian Chesky, Aaron Levie, Sophia Amoruso, Reid Hoffman, Frank Slootman, Billy McFarland, PrayingForExits, Jenny Lefcourt Check out Jason's suite of newsletters: https://substack.com/@calacanis * Follow Jason: Twitter: https://twitter.com/jason Instagram: https://www.instagram.com/jason LinkedIn: https://www.linkedin.com/in/jasoncalacanis * Follow TWiST: Substack: https://twistartups.substack.com Twitter: https://twitter.com/TWiStartups YouTube: https://www.youtube.com/thisweekin * Subscribe to the Founder University Podcast: https://www.founder.university/podcast

Nieuwe Zelda als eerste gespeeld en de Asus-spelcomputer

Bright Podcast

Play Episode Listen Later Apr 26, 2023 50:03

Asus heeft een draagbare spelcomputer aangekondigd: de ROG Ally. Hoe verhoudt de Ally zich tot concurrenten als de Nintendo Switch en de Steam Deck? Bastiaan praat je bij. Hij heeft ook als eerste al twee langverwachte games kunnen spelen: Zelda: Tears of the Kingdom en Star Wars Jedi Survivor. Maken deze games de verwachtingen waar?Ook in het nieuws: Apple's plannen voor een eigen dagboek-app en een gezondheidscoach, en het kabinet trekt 600 miljoen uit voor het stimuleren van de aanschaf van elektrische auto's.Tips uit deze aflevering:Welcome to Wrexham: een docuserie op Disney+ over hoe Hollywood-sterren Ryan Reynolds en Rob McElhenney in 2021 de kleine, noodlijdende voetbalclub Wrexham AFC uit Noord-Wales overnamen. Afgelopen weekend werd Wrexham na 15 jaar op het vijfde en laagste profniveau in Engeland kampioen. Tijdens het eerste seizoen van de serie lijkt dat succes nog ver weg. De iOS-app RunwayML: deze app vormt video's die je met je iPhone maakt om met AI, bijvoorbeeld tot een klei-animatie.Star Wars Rebels op Disney+: al wat ouder, maar nu weer leuk voor Star Wars-fans om te kijken in de aanloop naar de nieuwe Star Wars-serie Ahsoka, die in augustus verschijnt.Eerste indruk: Zelda: Tears of the Kingdom kan dé game van het jaar wordenReview: Star Wars: Jedi Survivor is grootser, leuker en met meer foutjes Zie het privacybeleid op https://art19.com/privacy en de privacyverklaring van Californië op https://art19.com/privacy#do-not-sell-my-info.

Runway Gen-2: Generative AI for Video Creation with Anastasis Germanidis - #622

This Week in Machine Learning & Artificial Intelligence (AI) Podcast

Play Episode Listen Later Mar 27, 2023 48:51

Today we're joined by Anastasis Germanidis, Co-Founder and CTO of RunwayML. Amongst all the product and model releases over the past few months, Runway threw its hat into the ring with Gen-1, a model that can take still images or video and transform them into completely stylized videos. They followed that up just a few weeks later with the release of Gen-2, a multimodal model that can produce a video from text prompts. We had the pleasure of chatting with Anastasis about both models, exploring the challenges of generating video, the importance of alignment in model deployment, the potential use of RLHF, the deployment of models as APIs, and much more! The complete show notes for this episode can be found at twimlai.com/go/622.

co founders cto apis runway generative video creation anastasis rlhf runwayml

Is Bing's Sydney Still Unhinged? Amazon Multimodal, AI a Threat to Humanity? OpenAI Foundry | E03

This Day in AI Podcast

Play Episode Listen Later Feb 24, 2023 61:06

In episode #03 of the This Day in AI Podcast we talk about Microsoft's Updates to the Bing ChatBot and Discover if Sydney is Still Unhinged. We Cover New Models like Amazon's Multimodal, Discuss AI Memories and How Soon Before AI is a Threat to Humanity. Learn About OpenAI Foundry, RunwayML and AI in the Enterprise thanks to OpenAI's Partnership with Bain & Company.00:00 - AI a Threat to Humanity?01:05 - Is Bing Chatbot "Sydney" still Unhinged?03:03 - Bing's Response to Kevin Roose06:50 - Is Bing Chatbot Evil?8:06 - Is the Internet Bing AI Chatbots Memory? Will AI develop Memory?10:15 - Bing's Sydney Chatbot Making Threats Again13:34 - Is AI Unhinged Because of Social Media? 14:14 - Prompt Injection Cat & Mouse14:52 - Is ChatGPT Sentient? Learning with Prompts17:26 - The Skill of Prompting AI and What is Coming20:00 - AI Wars: Bing Vs Bard, Is Google Seriously In Trouble?24:36 - Amazon AI Model: Multimodal Chain-of-Thought Reasoning31:51 - OpenAI Foundry. What is AWS thinking? Google Cloud AI?37:26 - Open AI Partnership with Bain & Company. AI in the Enterprise.41:05 - Will All Neural Net Models Join to Create a Super AI?43:35 - How Long Until AI is a Threat to Humanity (Continued)47:10 - How will Enterprise AI handle hallucinations? 49:58 - Influencing AI Through Training Data. AI SEO?53:25 - RunwayML: Zero Budget CGI for Film Makers56:20 - Microsoft BioGPT: Specialized AI Models59:15 - BasedGPT Prompt Injection: How would you take over the world?#ai #bingchat #dan #promptinjection #microsoft #bing #chatgpt #openai SOURCES: https://www.reddit.com/r/bing/comments/11a7rpu/sydneys_letter_to_the_readers_of_the_new_york/ https://twitter.com/sethlazar/status/1626257535178280960 https://twitter.com/tobyordoxford/status/1627414519784910849 https://twitter.com/davidad/status/1627474041396142082?s=46&t=zPfxo4dLsUU6d5S0RgiH7g https://twitter.com/balajis/status/1627744768062066724?s=20 https://twitter.com/emollick/status/1627452330017783811 https://www.reddit.com/r/bing/comments/117nx5g/does_anyone_else_feel_people_dont_know_whats/ https://twitter.com/AlphaSignalAI/status/1628104093318119451/photo/1 https://arxiv.org/abs/2302.00923 https://twitter.com/transitive_bs/status/1628118163874516992?s=46&t=4KszsLtOxd5ENKFLEXN7qw https://twitter.com/tunguz/status/1628027051503386624?s=20 https://www.bain.com/vector-digital/partnerships-alliance- https://blog.opencagedata.com/post/dont-believe-chatgptecosystem/openai-alliance/ https://twitter.com/karenxcheng/status/1627721862565482496?s=46&t=zPfxo4dLsUU6d5S0RgiH7g If you enjoy this podcast please consider subscribing, liking the video and leaving us a comment with feedback.

Amazing A.I. Tools for Video Editing

Online Video Made Easy

Play Episode Listen Later Feb 10, 2023 32:48

018 - In this episode, Kerry covers how artificial intelligence tools are transforming video content creation in incredible ways! He goes through the main features of the following video content creation programs: Adobe Premiere Pro: https://www.adobe.com/products/premiere/ RunwayML: https://runwayml.com/ Gen-1 A.I. from Runway: https://research.runwayml.com/gen1 Timebolt Silence Remover: https://www.timebolt.io/ Vidyo.ai: https://vidyo.ai/ Alwrite.ai: https://alwrite.ai/ Follow Kerry on Instagram: http://www.instagram.com/thelivestreamexpert Visit his website at: http://www.kerryshearer.com Find great smartphone video production tech at: http://www.livestreamtechstore.com

tools runway video editing adobe premiere pro runwayml

Sarah Catanzaro — Remembering the Lessons of the Last AI Renaissance

Gradient Dissent - A Machine Learning Podcast by W&B

Play Episode Listen Later Feb 2, 2023 76:24

Sarah Catanzaro is a General Partner at Amplify Partners, and one of the leading investors in AI and ML. Her investments include RunwayML, OctoML, and Gantry.Sarah and Lukas discuss lessons learned from the "AI renaissance" of the mid 2010s and compare the general perception of ML back then to now. Sarah also provides insights from her perspective as an investor, from selling into tech-forward companies vs. traditional enterprises, to the current state of MLOps/developer tools, to large language models and hype bubbles.Show notes (transcript and links): http://wandb.me/gd-sarah-catanzaro---⏳ Timestamps: 0:00 Intro1:10 Lessons learned from previous AI hype cycles11:46 Maintaining technical knowledge as an investor19:05 Selling into tech-forward companies vs. traditional enterprises25:09 Building point solutions vs. end-to-end platforms36:27 LLMS, new tooling, and commoditization44:39 Failing fast and how startups can compete with large cloud vendors52:31 The gap between research and industry, and vice versa1:00:01 Advice for ML practitioners during hype bubbles1:03:17 Sarah's thoughts on Rust and bottlenecks in deployment1:11:23 The importance of aligning technology with people1:15:58 Outro---

spotify ai lessons advice building selling maintaining failing renaissance google podcasts rust ml general partners catanzaro sarah sarah gantry amplify partners runwayml octoml

Ai And Content Creators - Don't Make This Mistake

Agorapreneurs

Play Episode Listen Later Jan 6, 2023 7:06

AI has changed the market for content Creators- Don't Make the mistake of thinking this is going to be an easy way to get content done for you. That would be a big mistake. Instead, look at it a different, more creative way of starting and how content creators need to adapt. Content Creation is one of the ways you can market your services. Yes, tools like ChatGPT, RunwayML, Jasper, Midjourney and others have dramatically changed what content creators need to know and do. In this episode of Stark Raving Entrepreneurs, you get to discover what has worked in the past and what is working and and into the future for content creators, or as we like to describe ourselves, Content Entrepreneurs (yes, capitalized!). Get more great tips about being a Liberty-Loving Entrepreneur and get the insights your competitors aren't getting subscribe now (don't wait for "later") : https://youtube.com/channel/UCbXsmmEY4aPH8le8Cp_RfoA?sub_confirmation=1 This is a lifestyle embracing "Live and Let Live." Do whatever you want, but don't hurt others and don't take their stuff. It's really that simple. We're a channel devoted to those with an entrepreneurial spirit who believe in freedom and liberty. If you like the idea of living a voluntary life, not initiating force or coercion, and living life peacefully, abundantly, and making a lot of money, this is the place for you! I look forward to hearing from you and getting your opinions and thoughts. Drop me a note at Terry@TerryBrock.com. Thank you for joining me today.

live ai zoom mistakes drop chatgpt videos content creators content creation canva midjourney let live runwayml

Episode 23 - Generative AI, art and reality with Anastasis Germanidis, CTO at RunwayML

Marathon VC Podcast

Play Episode Listen Later Nov 28, 2022 30:31

Σε αυτό το επεισόδιο συζητάμε με τον Αναστάση Γερμανίδη, CTO και cofounder στη Runway ML, μια από τις σημαντικότερες εταιρίες Generative AI στον κόσμο. Καλύπτουμε θέματα όπως Generative AI, Stable Diffusion, Workflows for creatives, αρχιτεκτονικές για υπηρεσίες με μεγάλα μοντέλα και ποια είναι η ελπίδα του Αναστάση για το μέλλον του χώρου. # Useful links Twitter https://twitter.com/agermanidis LinkedIn https://www.linkedin.com/in/agermanidis/ Paul Trillio (artist) https://twitter.com/paultrillo RunwayML GIthub https://github.com/runwayml Stable Diffusiub 2.0 https://github.com/Stability-AI/stablediffusion

reality cto generative workflows stable diffusion stability ai anastasis runwayml

Founded and Funded IA40 Winner Spotlight: RunwayML Co-Founder Cristobal Valenzuela on the Intersection of Art and Technology

Founded and Funded

Play Episode Listen Later Apr 7, 2022 36:24

In this episode of Founded and Funded, Madrona is launching a special series to highlight some of its IA40 winners, starting with RunwayML, which offers web-based video editing tools that utilize machine learning to automate what used to take video editors hours if not days to accomplish. Madrona Investor Ishani Ummat speaks with Co-founder and CEO Cristobal Valenzuela all about where the idea came from, how he decided to launch a company instead of joining Adobe – and even how TikTok fits into all of this. Listen now to hear all about it.

tiktok co founders winner founded intersection adobe funded valenzuela cristobal art and technology madrona runwayml

The Logik Podcast #26: Cristóbal Valenzuela from Runway

The Logik Podcast

Play Episode Listen Later Mar 23, 2022 32:00

Check out my phenomenal chat with Cristobal Valenzuela, CEO and Co-Founder of RunwayML. We talked about his journey, the history of the company and the future of machine learning and visual effects.

ceo co founders crist runway valenzuela logik runwayml

MORE 035: Are You Using This Machine Learning Tool? | Rory Knighton

MORE - The Digital Marketing Tech Tools Podcast

Play Episode Listen Later Nov 10, 2021 25:27

A great marketing tool doesn't have to be well-known! After leaving the acting world behind, today's guest Rory Knighton has amassed experience in the digital world building websites, writing code, and creating content for both himself and his clients. In today's MORE podcast episode, he shares the tools he utilizes while creating content and how you can use them for yourself. The end result of his content: put meaning behind your brand. When starting in the industry, Rory went around to businesses asking if they wanted a website. And, in the early 2000s, that was a very fancy thing. For the past six years, he's been running his own eCommerce and video-focused agency, Runaway Collective. Because of his specialty, his agency often creates video content. That means planning, filming, and editing content for clients (which can be a cumbersome process.) Imagine photoshop for video, but easier. Normally when in video editing, you have to color, cut, and keyframe manually. But the tool Runwayml delete backgrounds, all with the click of a button. Especially for greenscreen changing, it works marvelously well. You can teach it to look for people in footage, color correct, it has many different capabilities. The tool is still in beta, and it operates right in the browser and is a huge time-saver, especially when editing multiple client videos. Sharing video footage is simple with Frame.io. A video collaboration tool, Frame.io allows you to send video content to collaborators easily, and it offers in-platform messaging with timestamps to make communicating edits a breeze. It has simple controls and menus, meaning anyone can pick up the platform pretty easily. There really isn't a good alternative to either of these platforms. Vimeo is the most similar to Frame.io, but it's not close. These platforms are beneficial to anyone utilizing video content (and everyone should be.) Rory's secret to staying in business: Disclaimer: It's incredibly hard. And anybody who says it's easy is lying. For him, positive self-talk and delegating tasks help maintain forward momentum. It can be tempting to do everything himself, but learning to delegate and asking for help when needed allows him to focus on quality over quantity. Rory's final takeaway? If you aren't using video, you should be. People retain 97% of video information, versus only 27% through image or text. It's so much more engaging than the welcome copy on a website. To get in contact with Rory, connect with him on LinkedIn or visit his company website. Rory's Book and Podcast Recommendations: A Hunter-Gatherers Guide to the 21st Century by Heather Heying and Bret Weinstein 12 Rules for Life and its sequel, Beyond Order: 12 More Rules for Life: by Jordan Peterson Atomic Habits by James Clear Modern Wisdom Podcast with Chris Williamson The Keto Kamp Podcast with Ben Azadi Feel Better, Live More with Dr. Rangan Chatterjee Business Anchors with Dan and Lloyd Knowlton

sharing tool ecommerce digital marketing machine learning frame vimeo bret weinstein live more knighton heather heying runwayml

Ep. #18, AI Powered Video Editing with Anastasis Germanidis of RunwayML

Demuxed

Play Episode Listen Later Sep 29, 2021 47:18

In episode 18 of Demuxed, Matt, Phil, and Heff are joined by Anastasis Germanidis of RunwayML. They unpack AI powered video editing tools in the browser and share many invaluable insights gained from the machine learning community.

ai ai powered video editing heff anastasis runwayml

Ep. #18, AI Powered Video Editing with Anastasis Germanidis of RunwayML

Heavybit Podcast Network: Master Feed

Play Episode Listen Later Sep 29, 2021 47:18

ai ai powered video editing heff anastasis runwayml

Podcasts about runwayml

Best podcasts about runwayml

AI For Humans

This Week in Startups

The ChatGPT Report

Latest news about runwayml

Latest podcast episodes about runwayml

Episode 563: AI Video Prompt Master Review

5 Tools to Build Viral Video Ads With AI | Andrew Erickson

S5 E170 Machinima News Omnibus (Feb 2025)

#102 - ChatGPT Pro | NotebookLM im Recruiting | KI in Bewerbungen | Social Media Verbot

[Ride Home] Simon Willison: Things we learned about LLMs in 2024

#120 Sora is here…Im not kidding this time

Çakal Stajyer Yapay Zeka Ajanlığı Yaparsa - Yapay Zekada Bu Hafta S2 B4

Marketing neu denken: Wie du mit KI im Marketing die Nase vorn hast (Timo Brümmer & Leon Peters)

#111 Sora is finally here...I mean o1

Intersection of INDIE & Deep Tech — A Conversation with Michael Dempsey, Compound Managing Partner

#100 Text to video heating up Luma and RunwayML

APPLE GOES ALL IN ON AI

AI Weekly Rundown (April 15th to April 21st 2024) Major Announcement from xAI first multimodal model, Google Infini-Attention, Adobe Firefly AI , Microsoft VASA-1, Meta Llama 3, NVIDIA RTX A400 A1000 and more:

75 - Surfeando la IA con Christian Palau - Humanismo Digital

Meta & Zuck Go Full AI, Palworld Controversy & Interactive Trucker Chatbots | Ep41

On a créé une vidéo ÉPIQUE avec l'IA !! #Masterclass

OpenAI Reunited, New AI Video Tools & WWE's Xavier Woods AKA Austin Creed | Ep34

Ep. 173 Ann Morgan – Abstract Painter, Melding the Traditional With Technology

RunwayML's Announces New Text-to-Video AI Features

Een taalmodel uit de polder, GPT-NL | ✨ Poki

The Aphid Lounge Preview: Worldbuilding, AI, and You Part 2: Using AI for TTRPGs

AI-Powered Filmmaking with Waymark's Stephen Parker and Josh Rubin

Amazing AI-Powered Video Apps.

AI-generated South Park, “Holy Grails” of AI & more with Sunny Madra | E1785

AI Minecraft, "Responsible AI" + We Make ChatGPT & LLAMA 2 Debate | AI For Humans

Part 2: One Minute Films, Acting & AI with Darion D'Anjou

EP11: Powering The Future of AI, Uncanny Valley Reductions in AI Video and the Latest in Design with AI

We Send AI to Therapy, Meta's New Music AI and Free RunwayML Video Apps

RunwayML, AI Steve Jobs, & more with Sunny Madra & Vinny Lingham | E1755

Nieuwe Zelda als eerste gespeeld en de Asus-spelcomputer

Runway Gen-2: Generative AI for Video Creation with Anastasis Germanidis - #622

Is Bing's Sydney Still Unhinged? Amazon Multimodal, AI a Threat to Humanity? OpenAI Foundry | E03

Amazing A.I. Tools for Video Editing

Sarah Catanzaro — Remembering the Lessons of the Last AI Renaissance

Ai And Content Creators - Don't Make This Mistake

Episode 23 - Generative AI, art and reality with Anastasis Germanidis, CTO at RunwayML

Founded and Funded IA40 Winner Spotlight: RunwayML Co-Founder Cristobal Valenzuela on the Intersection of Art and Technology

The Logik Podcast #26: Cristóbal Valenzuela from Runway

MORE 035: Are You Using This Machine Learning Tool? | Rory Knighton

Ep. #18, AI Powered Video Editing with Anastasis Germanidis of RunwayML

Ep. #18, AI Powered Video Editing with Anastasis Germanidis of RunwayML