Podcasts about no priors

19PODCASTS
121EPISODES
41mAVG DURATION
1MONTHLY NEW EPISODE
Jun 3, 2026LATEST

POPULARITY

20192020202120222023202420252026

Best podcasts about no priors

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

96 episodes with no priors

The Valmy

3 episodes with no priors

Latest podcast episodes about no priors

⚡️Satya Nadella: No Priors x Latent Space Crossover Special at Microsoft Build

Latent Space: The AI Engineer Podcast â€” CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

Play Episode Listen Later Jun 3, 2026 38:58

We've informally heard that Satya is a listener to LS for a couple years now, but it was still absolutely surreal to meet him and do a live pod at Build, together with our friends at No Priors, the leading VC AI Podcast that we also greatly admire!We covered the MAI model technical takeaways on yesterday's AINews, so I will focus our recap of Satya's main messages around three elements:* Satya's adaptation of the Bill Gates Line for positioning Microsoft as the Frontier Intelligence Platform — customers must gain much more value from the Microsoft ecosystem than Microsoft itself, by building on multi-model harnesses like OpenClaw and Scout, drawing on the full enterprise context exposed by context layers like Work IQ (heavily dogfooded by his C-suite), and building up private evals and traces as a new form of Token IP* AI ROI: On one hand, enterprises are having difficult conversations around Tokenmaxxing and Layoffs, and on the other hand, there are serious re-evaluations of the End of SaaS since the Build vs Buy equation has changed so much. Our previous SemiAnalysis guest had… interesting comments on Microsoft's position on this as the ur-SaaS titan, and Satya had great answers* Making the Impossible Possible: Kevin Scott's inspiring framing around what the most ambitious version of applying AI and technology at large to business and social problems, like education and social impact.Enjoy!Full VideoTranscriptVoiceover: Welcome swyx, Sarah Guo, Elad Gil,, and Chairman and Chief Executive Officer of Microsoft, Satya NadellaSarah Guo: Welcome to a crossover episode of No Priors and Lane Space with Satya Nadella. Um, congratulations on an amazing build. No, thank you so much, and it's great to be with both of you. I listen to both of you or b- both the podcasts all the time. It's great to be on it.Thank you so much. [00:01:00] So you're just talking about, um, these amazing, uh, announcements from across the Microsoft estate all morning for, I think, three hours. What is the, uh, what's the most important reflection or takeaway you have?AI as an Ecosystem PlatformSarah Guo: I, I'd say there are, uh, perhaps the, the biggest one for me is let's sort of conceptualize this more as an ecosystem play as opposed to a single model or even a single platform, right?Satya Nadella: I mean, you know, whatever I... At least for me, having grown up at Microsoft, having seen, whatever, four major platform shifts, uh, I sort of fall into that, um, uh, camp where a platform is defined by fundamentally its ability to create more value about the platform versus what's captured in the platform. And so if you, you view what's happening right now, I think this morning's keynote was how can any company, whether it's an AI native company or a traditional enterprise company, participate as a first-class participant where they can point to AI they created, [00:02:00] right?It's not that they don't use other people's AI. Of course they will. But to me, what's the path? What's the recipe? How do I do it? What does a stack look like? What does the tooling look like? What is valuable? How do you do that? That's it. That's sort of our job to do. Yeah. Ecosystem strategy is, uh, very complicated, right?Sarah Guo: Because you end up building certain components, partnering for certain components, supporting them. You just announced this big suite of models. Like, tell us a little bit about the, uh, training strategy for Microsoft now. Yeah.MAI Models & Training StrategySarah Guo: So, so the thing that we wanted to do with the MAI models was to build, and as Mustafa talked about, first of all, a great lineage, right?Satya Nadella: Starting with pre-training, uh, with very good data quality, uh, doing all the ablations, making sure because in, in some sense it's becoming even harder to build a clean lineage model just because there's so much stuff out there, uh, that you truly need to ablate out to be able to have a fantastic [00:03:00] pre-trained model.In fact, that's one of the challenges of a lot of the open weight models is they look great on one benchmark or two, but they're not great on practice. So that's why, in fact, even in the RFDEs are, they, they are pretty gone really excited about these MAI models because how the heck can a small five B model hill climb?Uh, and it goes back a little bit to what I think is ultimately the key thing to do, which is try to pursue finding that cognitive core. Uh, so to me, starting with a clean lineage- Then creating that ability for companies to be able to use this, right? Not just as a generalist, but to create their own specialist by building this hill climbing scaffold around it, right?So it's not just the model, but you have a hill climb scaffold around it, then you will start building your RLE. You will start collecting the traces. Most importantly, you'll have private evals because we know all the evals out there are good, interesting, [00:04:00] but they're not really that critical- They're work, yeahSwyx: at this point because they all can be maxed. And so the point is each company will have its own private eval. And so that end-to-end platform story around our models is sort of, uh, what I think is interesting. And then the one other thing, Sarah, since you brought that up, is I do feel there's a new frontier.Satya Nadella: Like people talk about the frontier and are you operating at the frontier. Um, interestingly enough, if you add a little temporality to it, you can use, let's say, in, in, in fact, the, the Lando Lakes demo we showed was pretty cool. We used, whatever, GPT-55, right? Then you collected a bunch of traces, and then you took a 5B reasoning model and achieved higher.Sarah Guo: Uh, so that is another aspect of what it means to appear... uh, you know, operate at the frontier Yeah. I, I think, uh, I first of all have to congratulate you on basically building a frontier neo lab inside of Microsoft in two years. Um, I'm wondering, you know, you have all this AI strategy that you're rolling out.Lessons from Two Years of AI DevelopmentSwyx: I'm wondering, what do you know now that you wish you would tell yourself two years ago where- or two or [00:05:00] three years ago? Three years for the Jensen partnership, two years for, uh, MEI. Yeah, I mean, I think the, the thing when, that I reflect quite a bit, right, which is sort of obviously I got into all this when I got excited by the, the scaling laws paper and, you know, when, you know, even the OpenAI partnership came about when those folks said, “Hey, we're gonna really throw a lot of computer transformers.”Satya Nadella: Uh, and they've helped. I- the thing that I always look back and say, “Wow, these things, uh, do have capability that they're climbing up.” W- I mean, this, you know, this crude way of saying it is intelligence is log of compute kind of works. Now what I think we underestimated perhaps is the real-world complexity of deploying these so that they actually deliver the value in the real world, right?So the outcomes as measured by any benchmark is interestingly important, but the true eval is when people out there are able to do unique things that they only can value, and it's very [00:06:00] measurable, right? That I wish we had sort of even, like, had more in our consciousness, right? Which is as an industry.Sarah Guo: Because right now I think when people say, “Wow, I don't want a token max,” it's an artifact of us not having thought ourselves as an industry that we are using tokens to create value every step of the way. So I think that's kind of what I wish we had gotten there, but I'm glad we are here.Real-World Value & Use CasesSarah Guo: What are some of the use cases that you've seen that have created the most value for your customers?Because I know that people talk a lot about code, and I think it's pretty clear that that's something that's having very large scale impact. Are there other areas that you find in common that your customers are really benefiting from? Yeah. I think, yeah, to your point, obviously coding is now got... But it's interesting, by the way, Elijah, to even talk about the coding, right?Satya Nadella: Which is coding has worked so well that we now have to rebuild the IDE, right? I mean, it's kind of nuts to see what we sh- launched is like, oh my God, I have these hundred agent sessions. I... The cognitive load it transfers back to me as a human is so [00:07:00] excessive that now I need a new UI. Uh, oh, by the way, I, like the, the chat as the only artifact was also impossible, so that's why we need a canvas.So it's kind of interesting for all the things about where is software needed or where is UI needed, uh, you kind of need that even for code, right? In a fully agentic world. But that said, one of the things that we are starting to see, we started seeing with co-work, but even some of the work we, we showed with auto com- uh, um, autopilot Right on what you see with claws is a good one because if you sort of think about a lot of human capital is doing the glue work, right?If you now can augment that with tokens/agents that are long-running, durable, right, then your ability to scale even what is still judgment and glue work gets amplified like coding does. Uh, so you can... Like, I'm positive that six months from now we'll all be saying, “Oh, wow,” like, all through ni- the night there was a bunch of stuff that [00:08:00] all these autopilots that I have working on my behalf with my delegated authority, so to speak, right?I can... Sort of given even my identity, did a bunch of work, then of course I'll need my new ADE to say, “Well, what did you do?” Like, I might... “Did I do this work?” And so on. So I think that that's where compressing of workflows, uh, completing of tasks, uh, that's where I think a lot of the value gets created. I think you raised a really interesting point, which is there's the actual agent that's doing the code, and then there's a harness around it, and that's the environment, that's the context, that's everything you're setting up as a developer around actually a coding agent.The Harness Concept for Enterprise AISarah Guo: What is the harness for the enterprise? Is there an equivalent concept for broader productivity work, or how do you think about that concept sort of generalized? That's right. So, so in some sense you kind of want the harness to define the models, the, the data, uh, and the tools, and so that you have a loop across those three.Satya Nadella: And so what we are trying to, first of all, make sure is each of our products that we build, right, whether it's GitHub Copilot or the security copi- the, the [00:09:00] stuff we showed with MDASH or even the discovery for science, it doesn't matter, all of them are multi-model harnesses, um, with tools access so that you can do this progressive, uh, disclosure of tools even so that they're token efficient.Uh, and then you're feeding it with very rich context because that's sort of the other hard lesson we have learned in the last two years is, oh my God, the amount of work you need to do to prep the context layer, uh, such that your plan can execute in the most efficient way is where the magic is. So we have, in our case, we have the GitHub harness, which essentially we're using across all our products.It's available in Foundry, and we are open, like you can use your Llama harness, whatever. Or you can use the, um, uh, you know, any open harness or any harness of yours and train with your tools and multiple models and your context. And so that's the pitch. Because right now a lot of dialogue is, um, “Hey, if I train the harness plus tools and the model together, you get [00:10:00] evals.”Elad Gil: And what we are proving out is... And the best example of that is what we did with MDASH, right? Because when it launched, uh, it found bugs or vulnerabilities that were not found by Mythos Uh, and so there is existence proof, I would claim, that you can have a multimodal harness, uh, that can in fact be more, uh, performant in the real world So a premise behind the, uh, training at the independent frontier labs is really, you know, we're gonna have these models, and we'll have an API business, and we'll support enterprises and startups.Sarah Guo: ButPlatform Strategy & Developer EcosystemSarah Guo: a first-party product, be it productivity or code or search, drives the majority of revenue. That's a different value equation than you're describing, I think, with the Microsoft ecosystem. Uh, if, if that's the case, tell me if it's the case, uh, ‘cause obviously you have first-party products and you have enablement products.Satya Nadella: Um, what is the role of the develop- Like what is gonna be hard and the set of skills and the value capture the developer has in that world? Yeah. So I think that there's always [00:11:00] gonna be the case that someone who is super successful in- as a platform builder can also have first-party products. It was true with Windows.It is true, uh, with, uh, the, the SaaS side and the cloud side as well with us and others and so on. But the thing that is, is it should not be a limiter to other people achieving that same success, right? That I think is the core difference, which is the, the network effects this time around, around intelligence are such because they learn from data, and not really lots of data.It's just a few samples that you have to see to understand what's novel about something. So that's why the game becomes how to protect. So that's why I would say every company, having private evals may be the biggest IP, right? Think about it, like what's that private eval that you can then use even a frontier model to hill climb on and not leak the traces may be one of the biggest [00:12:00] drivers, uh, of IP.Like, so in other words, another te- acid test is you have an eval that's private. You're using, uh, a g- a Model A. Can you switch it to Model B and e- you know, climb up? If you can, then you're in control. If you can't, you're not in control, and that's where even the harness decision becomes super important, right?swyx So therefore, having an open harness, letting all models come in, having your evals, your context, your tools help you hill climb, I think is the skills that an AI native startup needs, a SaaS company needs, or every enterprise needs. Yeah, I think in, in a very real way you are ... Microsoft historically is an operating systems company and th- then become a cloud company.Maybe like the third act is that you're a harness or evals company. Whatever w- ... whatever the, the sort of conglomerate of concepts that you wanna put together. Um, and, and I think like enabling every company to have like frontier intelligence or what- what- Yeah ... I forget the, the [00:13:00] exact term that you used, um, is the, is the mission, right?Satya Nadella: That's it. Like that is, that is the platform promise, that you build with us, you will get your intelligence, uh, for your data. That's it. That ... To, to me, that is the ... Like if there was one tagline, uh, for this entire developer conference is- Can everybody operate at the frontier with their frontier intelligence, right?To me, that is so important because otherwise it, I, I don't know how you achieve stable equilibrium, right? Which is how do I then go and say, “Well, my company is gonna have a terminal value because I now know how to continuously compound-” Yeah ... on top of what's a platform that gets better,” right? So when, like Windows obviously came out, Adobe built, Autodesk built, uh, or even like take what Jensen said.We built DX and he built, you know, CUDA on top of it. Um, right? I mean, I always say to Jensen, “God, I got the short end of that,” right? “I wish, uh, we had recognized it.” But nevertheless, but that, that idea that you can build a platform layer [00:14:00] that someone else can then extend out, um, and build their own intelligence layer in this case, I think is everything, right?Without it, why have a developer conference? I can just come and have you all sort of just worship at the altar of one model. Yeah. But that's not a developer conference. Uh,IP, Evals & Company Valueswyx: backstage we, we had a discussion about what is IP or what is the, the value in a company. It used to be the length of, uh, human experience at a company, and now it's this other thing which is the evals, the, uh, experience in sort of applying agents to the company. Can you... I just want you to like flesh that out a bit more ‘cause- Yeah ... it was very insightful.Satya Nadella: It's a great way to frame it, right? Because yeah, at the end of the day, every company is gonna have both the human capital that is still gonna be super valuable, uh, because humans, uh, and their ability to find the gaps that exist at all times is going to be the way we all will create value, right?I mean, so I'm definitely in the camp that this is going to be about expressing new forms of human agency and ambition even as token capital goes up, right? So let's say a cor- any corporation [00:15:00] has lots of tokens and lot of human capital. The question is how do you compound the two? So if you have a... Like if you take in Teams I have a bunch of agents doing work and a bunch of humans doing work, and the traces between those, that is really important context of how that enterprise is creating value.Then that goes back to train not a generalist model, but to train the company veteran agent, uh, right? That is super valuable again, right? Which is when a company goes says, “It should in fact go onto the balance sheet,” is how I think about it, right? That's so... In fact, there may be... Like human capital was never possible to go put on a balance sheet, uh, because you didn't know how to capture the tacit knowledge.swyx: Whereas now I think you can with the agents that have learned through the h- through, through time, through all the traces. Uh, so that's what at least we think will happen. I, I think the SEC is gonna have to have accounting standards- ... for token, uh, expertise Uh, y- y- you're talking about the equilibrium [00:16:00] state, um, and a stable equilibrium where companies have this compounding value and can see terminal value for themselves.Future of SaaS & Business ModelsSarah Guo: Another challenge to, you know, the considered equilibrium of, okay, there are applications and workflows that are sort of common to a vertical or a horizontal. Um, and this was, like, the generation of SaaS companies and, you know, Microsoft has lots of SaaS properties as well. And then there are things that are very specific to every enterprise that they're differentiated against.Elad Gil: Um, I'm sure you have heard much and participate in much of the debate about the end of software because all these workflows are, are cheap to generate now. Um, do you think the equilibrium looks different between what agents get built- Yeah ... in enterprises versus in their vendors in the future? Yeah. So I think what's happening there is, see, we, we had a particular way we captured, um, I would say workflow in apps, right?Satya Nadella: Because we built a, a data model, right? We schematized some part of some business process. Mm-hmm. We then built a bunch of business logic. Yep. And then we put a bunch of UI [00:17:00] on top of it, right? So that's kind of what every SaaS company- And a little configuration. For, like, 20, 20 years that was the plan.Right, that- Yeah ... and that was it. So interestingly enough, now you kind of get to re-litigate that vertical stacking, right? So I still think, for example, that data model that you built underneath every SaaS application is super good, right? Like, why reinvent it? Like, I, I, my general ledger better be a general ledger.I don't need new schema creation. No. Uh, in fact, that entity relationship, uh, is actually pretty good, robust thing that I want to feed. And you want it to be stable. That's right. Yeah. Then same thing with business logic, right? If, if you look at, uh... We have this product called Power BI, right? It is like dashboards galore people created.The beauty underneath that dashboard is a very rich semantic model, right? Someone took the pain to create a dashboard and do all the measures, and you want that. That's business logic, right? I want that to be available to me. So I think the [00:18:00] challenge of the SaaS business model is we packaged one way. We now have to learn how to unbundle these things and rebundle in new ways and discover new business models, right?I mean, if you look at it, d- what's happening today with Microsoft 365 is a great example, right? We have this thing called Work IQ. In fact, like, what we are realizing is, oh my God, like, you know, if you look at... In fact, there's a pa- historical parallel too, right? We sold first Exchange and SharePoint and, uh, you know, before Teams, we had a thing called Lync Server and what have you, and we thought, “Oh, that's all gonna move to the cloud.”But little did we realize that, um, the number of people who will use servers in the cloud is 10X, 100X, right? Because people were not buying servers, they were just buying a subscription. Mm-hmm. The same thing is now happening with M365 because with Work IQ, we have exposed what is perhaps the most important database in a company that never got used as a database because it was only captive to our apps.Mm-hmm. Right? It, it was all email operated on it, Teams operated [00:19:00] on it, Word, Excel, PowerPoint, SharePoint. But now, like this is one of the coo- coolest things I get to do with Work IQ. I go to a GitHub repo and I say, “Hey, I attended a bunch of design meetings last week related to this repo. Can you capture all that and tell me what changes I should make?”I mean, think about that, right? It literally can go look at all those transcripts, come back with a plan to change a code base, right? Previously, you could never have thought of using M365 for something like that. So the value creation opportunity now in the agent world is in fact 10X more, but it does require us to have...Sarah Guo: For example, there's going to be usage around M365, right? Which is going to be perhaps more than even the e- end users and we have to even re-architect. Like, in fact, like what I use to serve an inbox or a mailbox cannot be used to serve an agent. Uh, and so that's sort of what we are doing.Pricing Models: Per-User, Consumption & OutcomesSarah Guo: I don't believe in, like, permanent business models for any of these domains, but in the [00:20:00] near term, do you have a prediction between, uh, you know, outcomes-based pricing, token-based pricing?Elad Gil: Enterprise bundles Yeah. The way I- I think about this is always we've had... Like, let's even take the per-user pricing. Mm-hmm. The per-user pricing is really an artifact of someone creating a budget needing certainty, right? Because it's the most important thing. Like, somebody wants a budget- Mm-hmm ... they need a per user.Satya Nadella: And, and per user is just a set of entitlements to usage, right? That's kind of what it is. And so the way is, if the first bundling will be take some usage, bundle it into per user stacks and, you know, then sell subscriptions. So subscriptions I think are gonna be there, per user is gonna be there. Then the next big thing will be consumption.So people will say, “I want consumption.” And it's also possible that people will say, “I don't even want to pay for any of the subscriptions or the consumption's outcome.” Mm. But remember, most people love outcomes until they have an outcome, because once you have an outcome, it's like giving away royalty, [00:21:00] right?Mm. I mean, like I, I've talked to customers who love, you know, outcome-based pricing, and I say, “I'm all in,” until they, “Oh my God,” like, “what are you talking about? You're sharing in my outcome? No, no, no. I want you to go back to per-user pricing, and I want you to consumption price,” right? So I think that debate will go on.Uh, but and all, all, all of these business models have a particular time and a place versus one to rule them all. And if anything, if you're a SaaS vendor or you're a platform vendor, having that flexibility... And quite frankly, we face this with GitHub, right? We just recently announced a per-user pricing on GitHub because little, you know, we- GitHub Copilot was constructed at a per-user level before we understood even, uh, the intensity of usage of agents, right?It was an interactive way for a developer to use code complete, maybe tasks. It was not like, oh, I launched 10,000, you know, agents that are going on all day, right? So that is what the adjustment is about. So now that we really want, there will [00:22:00] always be a per user, but there will have to be a consumption meter.Durability of SaaS & Build vs BuySarah Guo: How do you think about the durability of SaaS more generally? One thing I've observed is in a lot of enterprises internally, there will be teams that almost have agent euphoria. They're so excited about the explosion of things they can build that they're trying to rebuild a lot of applications or going to their SaaS vendors and saying, “We're not gonna work with you anymore,” or, “We're considering an internal project.”And it seems like in six to nine months, maybe some of those people will come back and say, “Actually, we, we can't rebuild everything.” How do you think about what's durable in this world and what isn't? Yeah, it's a... It... I think we have to go through one full budget cycle on this to really see the, um- Uh, the sort of the emergence of the equilibrium, because at the end of the day, there's marginal cost to even generating the app, right?Elad Gil: In, in fact, there can be even a, a simple way to say it, like if you should always acquire something if the marginal cost of building and maintaining, uh, something on your own is higher. Uh, right? That should be like it's a quantifiable- Yeah. Right? A quantifiable thing. And [00:23:00] the maintenance part is important, right?Even, like you got to remember like, hey, you know, all the security stuff that now AI will find, you better fix them too fast. Uh, of course, there's a coding agent to help you with, but then that burns tokens, right? So whose responsibility is it? It's kind of like a, a cycle that you've got to think through.And I think we have gone through the excitement that I can generate a lot of software. I think the next thing would be what software do I really want to generate? Mm-hmm. What software do I want to use from others? How do I compose these two into some agentic workflow that I have agency over, right?Sarah Guo: Because I think there'll be very little tolerance for anybody who's inflexible, uh, at the vendor level. Uh, but at the same time, I think that anyone who has got that flexibility shows up, delivers the value, will be back at again, right? We're selling software, uh, but with just different business models, in fact Uh, speaking about building software, um, one of my favorite moments from, I think, a previous build maybe one or two years ago was they had a b- they, they...Swyx: There was a section of you building your [00:24:00] own software. I'm curious if you're building anything now. Yeah. So I, I think the... You know, first of all, let's face it, right? Building software has made it possible for even the incompetence of a CEO of a company- ... like ours, uh, you can build, so thank God. But that said, I, I, I, I do feel that, you know, something like, um, GitHub Copilot to me, and especially the new Sessions app or the new app, has just made it so much more possible for you to have agency over artifacts that you felt you couldn't touch before, right?Satya Nadella: So to, for me as a CEO, even to go to a code base, uh, to be able to learn about it, like I remember joining Microsoft long back, you know, first and then you say, man, everybody had to go in and look at, you know, whatever, Cutler's, Malik, or what have you to learn how to do good C, uh, C++ code. Um, so now that ability to be more full stack up and down is so good, but that doesn't mean every one of us should be doing the same thing.The question is: [00:25:00] how do you then have the ability to inspect things, learn things, see things, um, I think is just so much more. And so to me, what I'm building a lot of is these long-running Foundry agents. Uh, right? So there's autopilots. So the easiest thing is, to me, I think I just built one, uh, even last week, where the idea was, hey, can I have an agent that is continuously monitoring essentially my own chief of staff autopilot, right?We're gonna have that obviously in, uh, Scout. That's what, uh, uh, we showed. But it is so easy and trivial to build. I took Work IQ. I said, “Take Work IQ, go, uh, and build a Foundry long-running agent.” Uh, store all the memory in, um, uh, using Ray Fin, right? Basically at my backend as a service. And lo and behold, it built it, and not only built it, I could say publish to Teams, and it published the damn thing to Teams.Sarah Guo: So the ability, uh, to have a, you know, some end-to-end project like this complete is just pretty [00:26:00] miraculous. How do you think, uh,Future Engineering RolesSarah Guo: that impacts the different types of engineering roles that exist in the future? Because right now I think there's, you know, a dozen different types of engineers that you can be, from QA, front end, et cetera.You know, there's a big swath. I've heard some people argue that in four or five years we'll basically end up with four engineering roles. It'll be people who are managing agents, it'll be four deployed engineers or FDEs, it'll be security engineers, and then people working on large scale infrastructure for a small number of services, and then everything else just collapses into the agentic world.Satya Nadella: Yeah, I- Do you think that's a correct view of the world? Yeah, I mean, I think, I think we'll have to experiment our way through it. But what you said is what... There are some very at scale things. At LinkedIn, they did structurally change- Mm-hmm ... uh, and it, you know, basically built up a new discipline called full stack builder, right?So they went and said, “Hey, let's bring, uh, people from design and product management, front end engineering, all put them together.” Uh, but also have an edge, right? It's not like the design person still doesn't have the design edge, or the front end [00:27:00] person doesn't have the front end edge, but you can give yourself bigger scope in roles so that you're not confined to one role.Um, and then r- equally, infrastructure has become very critical, right? So in other words, like, I mean, RLEs, I mean, one thing we've realized is even for the Excel team, for example. Mm-hmm. Building the RLE in which a reward can be learned is actually one of the hardest sort of infrastructure problems.Mm-hmm. Uh, and so you kind of need even new talent, right? Distributed systems people even in what was considered an end user app team, uh, because it's a different skill set. So yes, infrastructure, science is the other one, obviously. Um, so I think we'll see how these evolve, right? Where's the s- real... I mean, always the world will have a bunch of specialists.Okay. Um, you know, I think the generalist role is going to be the most exciting, right? Because the leverage of a generalist- Mm-hmm ... um, is where we are going to see the maximum returns, right? When, when you said, “Hey, are you coding?” I'm now a gen- Like, what... I've basically translated [00:28:00] knowledge work Right?Which I did, where I created a Word document or a spreadsheet, or even, uh... And now I can build an app, right? It's in the same sentence. Uh, right? That idea that, “Oh, wow, my generalist skills have gotten higher leverage,” I think is what we're gonna see across the board. Music to the ears of CEOs and VCs that are, like, a little dangerous and a lot of- Golden age for idea peopleSarah Guo: idea people. Yeah. Uh- With a lot of agency. I- if you take that idea of personal agency and you just zoom it out to the organizational context, um, uh, my partner Mike Renall, who, uh, actually started his career at Microsoft, just wrote an essay where one of the big takeaways is i- it's an age where you can be much more ambitious, and you need to be, given the pace of the environment and how quickly, actually, users and companies are open to adopting new technologies.Satya Nadella: Um, how do you think about... I, I feel silly asking this of somebody running a, you know, trillion-dollar-plus company already, butAmbition & Making the Impossible PossibleSatya Nadella: how do you think about how Microsoft can be more ambitious now? It's a great question. Um, I [00:29:00] think, um- I think the, the thing in these type of transitions is to have a conceptual model of how work can change to go after outcomes that you could hardly imagine previously, right?In fact, Kevin Scott has this nice line, right, which is, um, when you can make the impossible... Like, when you're making hard things easier, that's sort of one point of leverage. But true ambition is about making the impossible possible. So now the thing that is missing a little bit in all of our organizations is what is that new conceptual model of what can we build?What was impossible and what can we build? And I'll give you one example of this, right, which is I take great inspiration from sort of the people who were managing the Azure net- network. And they came to the... This was from even last year. You know, we were scaling. You saw that I, I [00:30:00] talked about sort of how we built in the last 15 months more Azure capacity than we built in the first 15 years.I mean, it's crazy. Wild. Yeah. Right? It's pretty wild. And it's the same team. So they saw that and they said, “Bob, this just ain't gonna work if we don't reconceptualize our work.” So they built... Essentially they said, “Our job is not to do Azure networking. Our job is to build the agentic system does, that, that does Azure networking,” right?These are the folks managing the 500-plus fiber operators managing the VAN, right, all over. And fiber operations ultimately is a physical operation. Things get cut, things get, uh, you know, have to be repaired. You know, we have fancy words called DevOps and so on. Basically, emails are coming in and you gotta go respond to them, take care of it.So they built this agentic system. They even have a character for it. It's called Miles, and it sort of does all this stuff, right? They started sort of screaming for more tokens and so on. And so they were saying, “Look, uh, we don't need a headcount. We need tokens in order to be able to [00:31:00] manage, uh, our operation.”That reconceptualization- Mm-hmm ... of what their work is, right? They, they basically took their work and made it meta, right? That meta work is now their new work. Mm-hmm. Right? In the ‘80s, if somebody had come to us and said, “4 billion people are gonna get up in the morning and start typing,” my model would've been, we need 4 billion typists?But we're not doing typing, we're doing knowledge work. So that, to me, I think is it, right, which is whether it's Microsoft or whether it's any organization, is to give ourselves permission to do new types of metacognition, meta work, using these new tools to change the outputs that matter, uh, and then really make the impossible possible.Sarah Guo: So completing that dot or the, the connective tissue across those, I think, is where a lot of the enterprise value will get created.Data Center Build-Out & Community ImpactSarah Guo: Should we talk about data centers? Yeah, please ask. Oh, okay. Well, uh, uh, w- we-- this leads nicely into the data center build-up. I always think, I- I just-- I'm just impressed at the sheer scale of the [00:32:00] build-out from Microsoft, but also everyone else, that this is redefining what it means to be a hyperscaler.And I just feel like that, that, that is at unprecedented scale on finances, uh, on the way you run the company, but also the communities that are, that are impacted. Um, yeah, just talk a bit more about what you're seeing on the ground, like when you visit your- Yeah, I think there are two aspects of it.Satya Nadella: Obviously, the, the build-out is, uh, extraordinary. Um, you know, nothing like this has happened, and it's great to be, uh, one of the participants in it. Uh, but you brought up the other part, right? I think at this point it's clear that unless we as an industry, uh, are very principled about ensuring that the benefits of all the stuff we're talking about are felt in real ways, uh, at the community level, right?Because this is not just a, a campaign, um, right? It has to be real, where people are saying, “Look, this is not ch- changing the prices on energy for me.” In fact, if anything, it's bringing down prices because long term there's going to be a better [00:33:00] grid, there is going to be more energy. Water consumption is, in fact, not sort of, uh...In fact, water is being replenished, right? You gotta really, you know, educate folks on truly what's happening, the cl- uh, the closed loop systems we are building. We have to invest in the training, the jobs, the tax base. In fact, the least talked about stuff is the amount of jobs that get created during construction, after construction.What's the tax base that's there in the community? And, and all this has to be real. Um, and, and if that is the case, then we will have permission. If it is not, we won't have permission. It's as simple as that, right? Which is, uh, we, we... I think we have to take it as an industry pretty seriously. Uh, I think it's good for communities to be skeptical, ask the hard questions, for us to do the hard work, earn that.Um, but at the end of the day, if there's-- if we can really be the produ-- Wait. I've always felt like in human history, if you use a lot of energy but also create a lot of value for society- The story has been fantastic. If you don't [00:34:00] do that, it's not been that great. And this time around, I'm a firm believer that ultimately if you do have a token economy that drives productivity, that drives economic growth, that drives broad spread, um, you know, participation, better health outcomes, um, then I think we'll be in a great place.Sarah Guo: Uh, and that's at least what we all have to be focused on. Yeah. It, it makes me think actually that with all these initiatives that you're doing, might be e- easier to see ROI in the communities first before in enterprise. Yeah. I, I mean, I think both sides. Yeah. In fact, it comes back together. It has to be the people in the communities are going to be employed, are going to be participants, uh, in the real economy, right?Satya Nadella: That's I think the question is. Like, if we- if the broad economy is doing well and the communities are doing well, the dots get connected. It's sort of the market forces are such that we will connect the dots. And that I think is it. Like, you ought to be able to see the evidence. You can't be about o- any one company, uh, but it has to be broad economic growth and broad [00:35:00] ec- you know, community permission.Elad Gil: Yeah. I guess I wanna talk aboutSocietal Impact & Optimism About AIElad Gil: what you're most optimistic about currently or what have you most updated your personal models on regarding societal impact of AI? So you're saying what's the, the, the- What have you updated most on in terms of societal impact of AI? Yeah. I think the, um, the p- the most, um- Critical thing is the first question we even started with, which is we need to tell the story and make it real that everybody has a real shot to participate as a first-class participant in this new economy.Satya Nadella: Right? That's kind of, I think we- in the next 12 months, 18 months, we need a way for people to say, “Oh, wow, I get it.” Right? There's going to be tremendous capability, tremendous amount of infrastructure, but I can see what is going to happen, whether it's the benefits like health outcomes or my ability to create a startup or my ability to run my [00:36:00] local sort of, uh, store more efficiently.It's just happening, and I see that, uh, benefit myself, right? That to me, you know, earning that permission in a path-dependent way, we can't wait. See, the one thing, Eli, that I've now learned is I think the world is gonna be very skeptical of tech and tech companies that say, “Trust us, we've got it. The g- future is gonna be glorious.”Sarah Guo: Uh, you kind of have to deliver tangible benefits. Um, and quite frankly, politicians winning elections, uh, because they have advocated for that. That will be at least my adjustment because without it, um, thinking that somehow... Because it's too important this time around. It's too much of the economy for it not to be the case So one very simple framework I have for, you know, what are, what is gonna be the broad benefit of AI, um, beyond the communities just working in technology, are, are sort of wealth creation- Yepit's [00:37:00] gonna happen in a ton of different companies, startups and large companies. Then you have healthcare. Uh, you, you had amazing demos today. There are companies like Open Evidence. I think that is happening. Um,Education & Future of LearningSarah Guo: education seems like another one that's an- Yep ... obvious good where we haven't seen as much impact as I'd expect.Swyx: Do you have a hypothesis on why that might be, or if it'll come? Yeah, I mean, I think this is where, again, how we think about education, how... You know, recently I met with, uh, the founders of Alpha School and learnt a lot about what they were going and going about, and it's fascinating to listen, uh, to how to even rethink- MmSatya Nadella: uh, what does education really look like. Because I think it's actually very important. Mm. Uh, and I'm not saying anything traditionally being done is less important, right? I was even looking at the, uh... It's fascinating to see. I, I, I forget the which Stanford class it was, uh, the, the Asian guidelines for CS something.Mm. Uh, because you still need people to learn. Uh, like it was an interesting AI class that they were making sure people were learning how to apply softmax appropriately versus saying, “Hey, fix my training run.” Mm-hmm. Uh, so I think learning concepts is important. It's going to [00:38:00] be, uh, critical. But the way we create the incentives, what are the credentials, how we value those credentials, what is the employment opportunity for those credentials?So I think that there's a complete change that has to happen, uh, given the way to get to information, way to educate yourself, way to continuously keep yourself updated has changed so much. So I think interestingly enough, maybe the next big startup and success story could be someone who builds a new university, um, or a new, um, pedagogy even of how to get someone to go through a curriculum and find economic opportunity, uh, that's highly valuable.Well, that has felt, uh, perhaps impossible for a long time, but it's a great note to end on and something that might be possible. It's still possible. Yeah. Thank you, Satya. Thank you so much. Thank you. Yeah. I appreciate it. Thank you all. This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit www.latent.space/subscribe

GitHub's plan for Agents — Kyle Daigle, GitHub

Latent Space: The AI Engineer Podcast â€” CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

Play Episode Listen Later Jun 2, 2026 83:27

I'm excited to work with Microsoft once again as the presenting sponsors of the AI Engineer World's Fair! We'll streaming live from MS Build today for a special crossover pod with our friends at No Priors and the one and only Satya Nadella. However we did not hold back with this interview - we asked all the burning questions about uptime and Copilot that we know you have in your minds. Lets go!For almost two decades, GitHub has been the home of software, where both open source and closed flow, through commits, pull requests, reviews, actions, etc.This ecosystem flourished as open-source maintainers and contributors would continue shipping code for the benefit of the community. However as coding agents began to ship mass quantities of code - growing 1400% in 2026, it marked a new era that was both extremely exciting and challenging for GitHub.While these agents help more people ship more projects, they also significantly increase the floor of how much code is shipped, how often it is shipped, how many people commit code, and basically orders of magnitude multiples in every dimension of GitHub infrastructure:Now GitHub inevitably experiences more pressure on their infrastructure which was originally designed around human developers moving at human speed. This has resulted in a very publicly notable uptime story:So it begs the question of whether current systems around code can absorb what AI produces. Can CI/CD keep up when every idea becomes a build? Can open source maintainers survive floods of AI-generated slop contributions? Can GitHub preserve the human social contract of software while becoming the operating layer for agents?Which brings us to the perfect person to answer these questions: GitHub COO Kyle Daigle. In this episode, he joins swyx to unpack what happens when AI doesn't just autocomplete code, but starts changing how companies operate, how open source works, how pull requests get reviewed, and how GitHub itself has to scale. We go deep on GitHub's internal AI workflows: micro-skills, WorkIQ, MCP, Slack, Teams, email, Copilot workflows, the new Copilot desktop app, CLI, cloud agents, and how Kyle uses agents to look backwards across company context before deciding what to do next. Kyle also reflects on GitHub's history building webhooks, APIs, Actions, npm, Dependabot, and Semmle, why the AI era is breaking GitHub in new ways, how Actions became a general-purpose compute layer, and what Copilot becomes after code completion.Full Video PodWe discuss:* Kyle's expanded role across GitHub* How AI got Kyle coding again after years in leadership* Why GitHub rolls out AI through existing workflows instead of forcing new tools* WorkIQ, MCP, Slack, Teams, email, and GitHub as company context* Why massive “mega-skills” are giving way to small, atomic micro-skills* How AI changes summarization, communications, marketing, and analyst work* Why former developers in leadership may have a unique advantage in the AI era* Kyle's “15 agents on Saturday” workflow* How Kyle built an AI-generated executive presentation for CRO/CFO teams* Why AI changes the chief of staff role without removing the human work* GitHub Actions, webhooks, arbitrary code execution, and secure agent compute* The npm acquisition, supply-chain security, 2FA, and token invalidation* Slop forks, vendoring, and whether AI agents change dependency management* What pull requests become when most PRs come from agents* Prompt requests, vouching, AI review, and trust in open source* What counts as a “developer” when AI lowers the barrier to building* GitHub Spark, low-code, and why GitHub refuses to hide the code* 14x commit growth, Actions load, databases, monorepos, and availability* Copilot's evolution from completion to CLI, desktop app, cloud agents, and SDK* Context, memory, rules, and making GitHub “act like Kyle wants it to act”* Ambient AI, OpenClaw, enterprise security, and the new operating system for agents* What swyx should ask Satya Nadella about Microsoft's AI futureKyle Daigle* LinkedIn: https://www.linkedin.com/in/kyledaigle* X: https://x.com/kdaigleTimestamps00:00:00 Introduction00:03:36 Why AI Got Kyle Coding Again00:07:04 Running GitHub with AI: WorkIQ, MCP, Slack, Teams, and Skills00:15:39 The Golden Age for Former Developers in Leadership00:17:31 15 Agents on Saturday and AI-Generated Executive Work00:20:20 How AI Changes the Chief of Staff Role00:21:45 GitHub's History: Actions, npm, Webhooks, and Open Source00:28:45 Slop Forks, Vendoring, and AI Dependency Management00:33:57 Pull Requests, Prompt Requests, and Trust in Agent-Generated Code00:41:21 GitHub Stars, 200M+ Developers, and the New AI Builder Wave00:45:15 GitHub Spark, Low-Code, and Why GitHub Still Shows the Code00:47:38 GitHub's Hardest Era: 14x Growth, Reliability, and Scale00:59:21 Actions as the Compute Layer for CI/CD and Automation01:02:04 The State and Future of GitHub Copilot01:08:24 Ambient AI, Background Agents, and the Future of the SDLC01:13:09 OpenClaw, Enterprise Security, and the New OS for Agents01:18:03 Build Announcements, WorkIQ, FoundryIQ, and Microsoft Context01:21:41 What Should swyx Ask Satya?TranscriptIntroduction: Kyle Daigle's Expanded Role at GitHub and MicrosoftSwyx [00:00:00]: We're here with Kyle Daigle, COO of GitHub. Welcome.Kyle [00:00:07]: Hey, thanks for having me.Swyx [00:00:08]: You're not just CEO of GitHub. People know you as that. You have a new role.Kyle [00:00:11]: So I have an expanded role now. I've been working at GitHub for thirteen years and doing all things developer. Joined as a developer myself. And now, I'm also responsible as the CMO of Developer for Microsoft. And so all the kind of learnings and passion for developers and how we work with them and how we communicate and how we bring our products to market, we're also bringing that expertise to the broader Microsoft ecosystem and helping every developer that uses a Microsoft product or would like to have a sort of similar experience that they've had with GitHub over the years. So it's a different role in some ways, but it's also just building on the experience that I've had at GitHub of just sort of tell the truth, be authentic, show people how to use it and then let the products speak for themselves. Now just doing that with, all of Microsoft.Swyx [00:01:09]: We'll be releasing this in conjunction with Build. You got lots of stuff planned, and we can sort of touch on that whenever it's appropriate. I think one of the interesting things is I rarely meet a COO who's also a CMO. I think you're a very outward facing and you're very confident publicly. That's rare. Do you actually view yourself as COO? What's What is your thing?From GitHub Developer to COO/CMO: Building the Platform and Operating GitHubKyle [00:01:33]: I think for me, it's been funny. The titles have always been, a— have always felt a little strange to me. I joined GitHub as a developer? I wrote so much of theSwyx [00:01:46]: Let's bring that up. You wrote the back ends?Kyle [00:01:48]: I was going through, I was going through, some old photos, when folks were talking about how things were being built or how there was a build GitHub. I built, webhooks and worked with teams building the API, built the platform layer. Anything that integrated with GitHub, up until really twenty eighteen, I built or ran the engineering teams. And that's kind of where my the beginning of my passion always was helping people build things, deliver them to, their customers. And so being a developer, building for developers was always super unique. In a— I think as my role expanded, it became my ability to talk to not just developers, but also enterprise customers or business leaders and have this translation layer. And then through all those years, GitHub has always operated pretty uniquely. Post-pandemic, working remotely was not as novel as it was when GitHub started in two thousand and eight. But all that expertise of running remote teams, doing it well, became this sort of bigger role, ultimately turning into the COO role of how do we operate GitHub in the way that GitHub's always operated after the Microsoft acquisition. And kind of so on from there. So like for me, I think the— I've, I still code. I love coding but the problem has always been, people. It's a much harder problem to both support our own employees, a harder problem to communicate to developers and enterprise buyers what we're building why it matters, ‘cause those are two very different messages. And so getting to work in the mix of COO, CMO, also just being a dev, I think is what's kept me at GitHub for so long.AI Workflows for Leadership: Commits, Retrospectives, and ContextSwyx [00:03:40]: Apparently, you have— your commits have gone up. What's this? What's going on?Kyle [00:03:45]: Rui's called me out pretty aggressively. So I think— as you can imagine, right, you can see my normal era of being a dev In the twenty thirteen, twenty fourteen era, and then moving into management, and then ultimately the COO role. I think what you see there is me, really getting back to coding thanks to AI. I— similar to, attaching problems between how to market and how to operate a business and how to code, I find, building agents and workflows that are connecting very disparate problems to be what's driving this. So that's, some of it's writing software. A lot of it is, connecting a ton of a different data sources to, help me out. But that is completely me really diving in on the AI side in trying out our tools, trying out everyone's tools, But building for me, building for the non-technical leader, though I'm technical and how we're, able to use these tools more than just the simple, call and response that I think a lot of the non-technical, your employers, you have to get— you have to use AI, and so everyone uses, ChatGPT or Copilot or Claude or whatever. To really get into, how is this going to help me out, it— I find that it's not the I need to write a blog post, I need to those simple examples. Helping people find the workflows of, “Okay, I need you to go through all the PRs today. I need you to go through everything that we've posted online. I need you to go through what we did the last three months. Go through all of my Obsidian notes for any mentions of this then go through my transcripts at work.” We use, Teams, so, using WorkIQ, go call that MCP server, grab all the transcripts, go through all the Slack, and then build me out the plan of, what this week's messaging actually was. That's something that was, impossible because for me, I find AI in a what most of this launch here is actually, less building forward. It's actually, a recursive loop backwards. I'm always looking at what had happened first. Go back through the week and tell me what we did, what worked, what didn't work? And then tell me in the next three or four days-What would you tweak based on this sort of like looking backwards and then looking ahead a little bit? I find that to be so much more valuable, especially for like non-technical, because that retrospection is actually LLMs are very good at that. Like finding all the patterns, pulling them out, and then applying that retrospection to just a couple of days or just like a short period of time. Is all a bunch of apps that I've built and launched a bunch of, internal tools. I use the new, GitHub Copilot app, the desktop app with workflows. Every time I crack open my laptop, it's running workflows for me. It's just a ton of different stuff and of course, it all ends up on, it all ends up on GitHub.Swyx [00:06:47]: Of course. That's where, that's where, stuff is hosted. Man, there's so much to ask you. I was going to leave the how do you run a company with AI thing at the end. I have to ask one— double click one thing. You said, you are looking back at the week. You're, you're understanding what happens. When you say we That's three thousand people. How?Rolling Out AI Internally: Skills, CLIs, and Company ContextKyle [00:07:09]: I think when we started rolling out AI internally beyond engineering, right? One of the things that I was really, passionate about is like we have to do this in a way where no one has to change how they work. I don't want to have to teach you a tool. I don't want to have to teach you something new. And so for us, we tried out a few tools. Most of them don't work because I got to get you on board? I got to teach you how to use it. What we've actually ended up doing is we've built like a set of skills internally. We have we each have our set of skills, and we've just been distributing even to the non-technical folks, the CLI. And then effectively, we're just giving it access to like read about everything that we're writing. So that's for us, that's usually GitHub, Teams, Email, and Slack. So Teams for, video chat, generally speaking.Swyx [00:08:03]: Teams and Slack?Kyle [00:08:04]: so we use Teams for video communication, but we don't use it for chat. W-we— GitHub for a long history, right? We're alwaysSwyx [00:08:13]: Also SlackKyle [00:08:14]: Talking about ChatOps and like everything is built into Slack. Like every command, every flow.Swyx [00:08:18]: So even though you have been acquired for I don't know, eight years nowKyle [00:08:22]: we stillSwyx [00:08:23]: You still use Slack?Kyle [00:08:23]: it's a purpose-built tool for us, and I think the reality is that moving off of it would be so bluntly expensive? Simply because all the tooling is, baked in with that paradigm. And they both have their pros and cons but they don't work the same way at all. We still use a bunch of different tools Because it's the purpose-built tools that We need. And thenSwyx [00:08:47]: Well, the same doesn't go for the rest of Microsoft, presumably.Kyle [00:08:50]: like the like various teams like operateSwyx [00:08:53]: They make their own decisionsKyle [00:08:54]: Various ways. I think it just matters what you're trying to what you're trying to do. But we do we do work across kind of every tool that we use, and then by giving everyone access to all of that context and the new WorkIQ MCP server, which is quite cool if you do live in the M365 like world. I can ask it all these backwards-facing questions, and it's incredibly important for our teams that are working remotely. There's a lot of stuff you miss when you're not in an office, and we are spread out all over the world. So most of that is looking back. And then we post, we post either auto-automatically into GitHub issues or discussions, these sorts of like findings or like our industry reports. Like what's happening this morning, today, yesterday. A little automation gets run. We'll use the app. We might use GitHub Actions like with, our agentic workflows just to go do that run, and then we push it into GitHub, and w-we keep having a conversation. So usually for us, it's about that sort of like looking back, looking forward on the non-technical side. And then of course for a lot of those folks, it's also building an app, pushing it to GitHub pages or pushing it somewhere to host it et cetera. But it's just like enabling everyone with that power of it's going to take me a week to figure this out. Instead, we're going “Okay I built a skill. Let's put it into a repo. We'll all share that skill together, and then we'll use the CLI or now the app-” “just to run it.”Micro Skills vs. Mega Skills: How GitHub Uses AI at WorkSwyx [00:10:26]: All right. I think, I think we're going straight into like the team management and productivity thing. I think a lot of people are getting various levels of LLM psychosis. How do you manage the bloat of skills? Like everyone Has their thing, and they're Like trying to promote it to the rest of their peers in their org, right? And obviously, whoever becomes a skill influencer internally becomes like an AI leader, right? Of sorts. I assume you have those.Kyle [00:10:50]: like I think we haveSwyx [00:10:52]: And I assume it's a mess a Yeah.Kyle [00:10:54]: there's like I— like I think the reality is there's two pieces. Like first is I think that we're ending the era of these like massive, beautiful, perfect skills that are just like not any of those things. ‘cause for a while, right every tweet every day is like go download the skills, the perfectly managed thing to do this entire workflow. And I think that like what we've found and what— I was just with my team, this week, and we were talking about the skill side, and we're really talking about these like incredibly micro skills that are just doing one thing for us very well Versus a skill that's going to do I said, that full report. That doesn't really exist on our side anymore. It's usually how do— like a single skill that's going to identify the most important marketing information given any MCP server. Like this is the most important thing. Less about stitch a bunch of tools together and have it produce this mega output because then weeks go by, months go by, things change, and you want to tweakSwyx [00:11:58]: It's brittleKyle [00:11:58]: Your mega skill and you're screwed? You can't do that. And so now we're really just talking about the Legos we're using and just letting the instruction book be something we're all putting together. Whereas I think a lot of AI skills for a while have been that mega instruction book style.Swyx [00:12:15]: I've, thought a lot about Postel's law. I don't know if that's a term that is, means things to folks. It's the idea that you should be liberal in what you accept and strict in what you output, right? And I think that's like a good framing principle for skills. This is my skills, obviously on GitHub. I feel like everyone should have like how like some repos In GitHub are special repos? I feel like we should sort of reify the slash skills and everyone like give it some kind of special presentation. Anyway, so, yeah, this is one of those like download Download anything, transcribe anything, and then you can string together the atomic skills that do one thing well Into like some kind of orchestration skill that calls other skills. I assume, does that match?Kyle [00:12:56]: I like I think so. I think that theSwyx [00:13:00]: Summarize anything.Kyle [00:13:01]: Like I think the- For me, summarizing something for I do communications and PR and analyst relations and marketing and customer activities, and so my summarize everything is very different for each one of those like Contexts. What ‘Cause if I'm summarizing something for an analyst, that's a very different thing than, probably how I'm going to summarize something for like a customer meeting or an engagement. So that's I think like the difference when we're talking about the like the tools I might use on Saturday or the skills I might use on a Saturday when it's just for Kyle. Yeah, those are kind of like they have an atomic actual tool underneath or maybe skill, and then Kyle cares about X. But I think when we're talking about work and enabling the the marketers, communicators there, it's the atomic, this is what good summarization is, and then this is what I care about as for marketing for communications For whatever. And that I think is like the interesting matrix problem when we go from like a developer set of concerns to all kinds of different professions, is that what that word means to me is different than it means to you is different than it means to the analyst or the salesperson, and that's where I think the matrix mess is that we're starting to like still starting to find. It's about these mega skills but they're all just slight permutations, but those permutations are really important. It's the difference between someone reading this and going “Did AI make this?” what Or “This makes total sense, and I would expect this when I'm giving a briefing to Gartner,” or like whatever else.Swyx [00:14:37]: I think the beauty of it maybe is that you don't have to be that careful about what goes in there. It doesn't have to exactly fit as long as it like roughly is contained in there. I used to complain about plugin hell, basically. Like when you have a framework and then you have a hundred things that you need to integrate, everyone does like the GitHub used to be bloated full of these things. And now we don't need them anymore ‘cause now you just use skills.Former Developers in Leadership: AI as a Creation MultiplierKyle [00:15:00]: And like I think the most magical thing is the just that like I can just also crack it open. Like Like yes, I could go like change the how the plugin is coded, or like I could go do that now with AI, but I think there's just something more magical about getting a response back and being “That's not right,” and then you just crack the skill open, you just type English words and it's different. That building block is just, I think very unique. Once I get everyone to kind of understand how to best how to best make those changes to get the most power out of them.Swyx [00:15:36]: Is there a— you have a your peer group that Of people like you. Is there a common framing for Something I'm feeling is, which is true, is that is this a golden age for former developers who are now in leadership? Because you can wield the tools, you would know the right words, you're maybe not too close to the details. Doesn't matter. But like you're more effective than someone who doesn't come from that background.Kyle [00:15:59]: I think that like the secret has always been your ability to identify patterns and solve problems, and I think that for folks that like myself that don't code day to day anymore, that has made me successful as a developer, made me successful as a COO and now CMO. And so now that I have access to get and write code, I'm now applying that sort of like pattern finding and problem solving, and I know enough still about how to then go and say, “Oh, I want to make an app, but I don't want to break into jail or create something that's not going to be able to work or to be deployed scale or whatever.” that ability to apply all that additional business knowledge and still code I think is what makes that so interesting to me. Slightly different than I think some of the other like technical leaders that became business leaders and now are going back to their apps and updating them. Good for them? But I think the more, much more interesting thing is, well, now I have this whole new set of expertise over ten plus years. Why not take that and use that as a developer with these AI tools? So I definitely think that makes me more powerful, but I think that's true for like every dev as well. Most of the dev friends I still have also have some other underlying skill and passion. There's really talented, very kind of linear computer science software devs, absolutely. I just find that the folks that came from a different career, went to school for something else, went off and did this random thing, and then became a software dev, or were a dev, did a random thing, came back. Learning that extra set of information, learning those extra skills, and now having the power of an AI where I can crank up fifteen agents on Saturday while my kids are doing lacrosse, That's like really powerful. And I think it gets me back to that feeling of like creation, and it's very hard to replicate that in most other senses? That first time you build an app and you click it and you show someone that's magical. And so being able to do that not just in code, but across all kinds of different assets that's, that's huge. We were doing we're doing our every year we do our revenue planning. We talk about okay, what is it going to look like for next year? And of course as you imagine, there's, slideshows everywhere talking about what are we going to talk about, what's the narrative, et cetera. And so as you said I'm “Okay, well, I could probably just like build something to build this and then that way I don't have to go build the whole spreadsheet or I have to pass it to my team.” So we went through this process, and I got all the information and used the skills I mentioned. I built like a little app just to make it so I could look at some of the information in a SQLite database, more easily. And I ultimately built this entire presentation without touching any of it and I was “Okay, I'm just going to present this to our CRO, the CFO, their teams,” without mentioning I'd built it with AI. I like built a skill to make it look very much not AI driven. Just not pretty.AI-Generated Presentations, Human Taste, and the Changing Chief of Staff RoleSwyx [00:19:03]: Like a design. Yeah.Kyle [00:19:03]: Not pretty. But just like very clearly not AI. Kind of like don't do anything interesting.Swyx [00:19:08]: That's, yeah, that is valuable.Kyle [00:19:08]: Just go Exactly. We did the whole thing through. It used my notes from Obsidian, it used all the context I mentioned before, the plans, and Never came up once that it was AI generated.Swyx [00:19:20]: It didn't matter.Kyle [00:19:20]: Never once. D It didn't matter. And so now I takeSwyx [00:19:23]: This is a toolKyle [00:19:23]: I can take that tool and go, “Look, I don't want you to go build slideshows.” They're just helping us share information with each other. If this thing can do it With a little bit of crafting from you and then we can look at it together, awesome. There's no value in all that extra work. I think that the ability to, make it look humanly bad and and build a little app to, manipulate the data I think is part of, that upside for devs that are now in leadership roles. Because, the thing that I feel like I said before, this that's all a people, that's all a people problem. I know if you've used a coworker or not to build a slide deck, unless you spent a bunch of time to not do it.Swyx [00:20:07]: I know, but like it was so, I think there's a certain charm to just being blatantly AI. ‘Cause I think that you're well, you're just honest about There may be mistakes here that I cannot vouch for. So how much value is there? But anyway I think, actually the real question I want to ask is, there's a— You were a chief of staff To Thomas. And in the pre-AI world, the that job would've been a chief of staff job of like Can you prep me these slides and all that? And now you do it yourself.Kyle [00:20:35]: I still, I still have a chief of staff. Because, the difference is it's sort of the discussion every time we have some sort of technology evolution is it's not that the jobs the roles don't all go away, they just change? And so yeah, I don't have someone spending all their time building out slides for me and presentations ‘cause I don't need that anymore. But now I need that person that is able to go and find all the different connections between humans in those discussions to help me find out, okay, I should be meeting with this group and this team, and they have an opportunity, and I'm going to be in San Francisco today, I'm going to be in Seattle tomorrow. Those sorts of human connection aspects are still incredibly valuable and has always been a big part of that chief of staff role. But now just like chiefs of staff are not opening up, letters to process, they're doing emails. What It's the same thing. And now they're, they're not building out as many of these presentations because they have the the ability to have a AI take it on for, and share that with me and great. Let's keep moving ‘cause it's allowing us to go faster and make better decisions more quickly.Swyx [00:21:45]: Awesome. Well, so we can dive into more sort of, Productivity insights as you go. I did want to do a little bit of a brief history of colleague and hub. Because, we started here. And then you also involved the NPM acquisition. I did, I do want to touch upon that. And then more recently, I just want to bring up to present day where we're having uptime issues Which transparently we've already Addressed publicly, but we'll, we'll discuss in the pod. Did I miss anything? Like what, any other major highlights? Obviously, it's, it's a lot of years to cover.A Brief History of GitHub: Webhooks, Actions, Acquisitions, and Platform EvolutionKyle [00:22:15]: No the I think one of one highlight was right before the acquisition closed in twenty eighteen, I got to launch the first version of ActionsSwyx [00:22:27]: OhKyle [00:22:27]: At GitHub Universe. So it was OSwyx [00:22:29]: They're that young?Kyle [00:22:30]: It was October of twenty eighteen, I think. Yeah. Yeah.Swyx [00:22:33]: Gee, Jesus.Kyle [00:22:34]: I got to I was the engineering leader on that project and got to launch that. And then, yeah, we did acquisitions of NPM you said, Semmle, Dependabot Pul Panda a whole bunch of things. That was a bigSwyx [00:22:47]: Pul Panda.Kyle [00:22:48]: Abi is doing well.Swyx [00:22:51]: DX. Holy crap.Kyle [00:22:52]: Did well on DX. I and like that was a that was the big shift, after the acquisition. I had to join the sort of business side.Swyx [00:23:00]: So I need to hit you on some of these things ‘cause you were there. Right? And how often do I get to talk to someone who was there? But yeah, Actions. Is that the number one source of security issues on GitHub?Kyle [00:23:11]: Oh, sh I think that the number one source of, security issues is probably like all, the literal code in everyone's like underlying repositories. I would say back further than that is, if you remember I had to show in this graph was this is, I'm, didn't say this before, this is ultimately webhooks.Swyx [00:23:30]: You yeah.Kyle [00:23:31]: Like circa whatever it was.Swyx [00:23:32]: It says Hookshot in there.Kyle [00:23:32]: I forget. Yeah. Yeah, Hookshot's in there. And so like back then, it says GitHub Services. Do you see, it says Hookshot FE for front end, and then it says GitHub Services. GitHub Services back in the old days, right? You we had a repository that was Ruby code, and you could write any Ruby code in there, and then we would execute that On your behalf As a service, and then that way if an if you were trying to integrate with something, it didn't we would run it for you.Swyx [00:23:57]: And of course no containers ‘causeKyle [00:23:58]: No, ‘cause it wasSwyx [00:23:59]: Well, no containersKyle [00:24:00]: Twenty fourteen. And so there was some isolation obviously, but it was mostly the separations on the server level. That's like an example as long as the very old version of Pages, which ran on its own containerization infrastructure, not on Actions.Swyx [00:24:15]: Which like all-time great product.Kyle [00:24:16]: Pages powers the internet at this point to some degree. Those were places where like clearly there were no like issues like to my knowledge. But it was those things where I'm looking at and going “Okay, well we can't be running arbitrary Ruby code,” like on everyone's behalf. Then containerizing all of that up intoUh into actions now where yeah the containerization, is r-really good. The pinning most folks aren't pinning it the like to a particularSwyx [00:24:48]: ImagesKyle [00:24:48]: Sha, et cetera like their workflows, and so that's a big that's a big place Of pain for folks if they're just doing similar to any dependency management, just V1 or newest or latest, I think. But, that journey from that day to “Okay, we're just going to run all this arbitrary code, and, it'll basically be okay,” to now, no, we have, really good containerization. We have a new, underlying, ag-agent, containerization, service. It's like we're using it under the hood. It's through Azure. They recently announced it. The Azure, Dev Compute, but it's, very fast, very fast compute to be able to, spin up your own cloud agents, or whatnot. We're using it under the hood for some parts of the new,Swyx [00:25:36]: Microsoft Dev Box?Kyle [00:25:37]: No. Dev Compute, yeah.Swyx [00:25:41]: Hmm. Not finding it just yet.Kyle [00:25:44]: Oh, it's, it's in there somewhere.Swyx [00:25:46]: All right. Well, we'll cut that out.Kyle [00:25:47]: Sorry. But with, Dev Compute, you can, run, really fast, spin up really, small VMs really quickly, so you're doing a tool callSwyx [00:25:58]: Same conceptKyle [00:25:58]: Just do it containerize exact-exactly. So we're using that so definitely moving that direction to protect us from every every piece of code that we're ultimately running.Swyx [00:26:07]: look, that grows into the full SDLC? Code hosting was just the start and and then it's grown beyond that. Let's talk about NPM may-maybe ‘cause I think that's also, a very major point in the industry. I do think, it was looking for a home. It was, kind of struggling as a business, right? I don't know, I don't know how you would characterize that whole acquisition and how itNPM, Package Security, and Keeping the Internet RunningKyle [00:26:33]: like when we were talking to the team, I think the big thing for the both of us was to find a way to keep NPM, which was basically powering the internet then and way more so now to some degree running. Keep it going keep continuing to scale. It was having scaling problems, if I recall, back at that time. They were doing some rewrites. ItSwyx [00:27:00]: that's cute compared to now.Kyle [00:27:01]: Well, that's the thing is like when I'm talking to folks now, there's there's so many more underlying uses of NPM than there were back when we had them join in with GitHub. But that was ultimately the goal. It was really okay, we used to have pages. We have, the world's code. Let's make sure that we can keep NPM running well for the world. And we put a bunch of time and investment into fixing some of the underlying backend, changes, some of which we talked about some of the manifest work, et cetera. And then now, really trying to bring the the security posture of NPM up to speed. But, it is a unique challenge in that every move that we make to make it more secure will break a lot of people. And security is paramount. And also, we take it very seriously. We're, the any time that we have a problem with GitHub or we make a change that makes us more secure but hurts, there's, a snow day for developers or a really bad fire that they have to go put out. And so we've, have changed the 2FA policies. We've changed the way the tokens work. When we find tokens that have been exposed or potentially, exposed, we invalidate them, andSwyx [00:28:22]: I love that feature in GitHub. Yeah, it's greatKyle [00:28:23]: That creates issues, but, the but that's the thing is we're trying to push the community, forward without necessarily, doing something that is going to break the contract that's been for 15 years or close to it or some amount of years on NPM.Slop Forks, Vendoring, and the Future of Open Source Supply ChainsSwyx [00:28:43]: I think the— So now we're talking about, open source and publishing. And I think there's something here with what people are calling slop forks, which, I think Malta from Vercel is doing. And, part of me thinks, well, the way to get past any vulnerabilities, we just, let's just get rid of the concept of NPM. And we only publish source code. And anytime you want to import it you have your coding agent look at it and then adapt whatever subset you're going to use into your vendor it. But, the AI vendor it. Is that realistic? I don't know. Is it— Will that solve all our security issues? I don't know.Kyle [00:29:24]: I don't think it'll solve I so Mitchell was just talking Mitchell Hashimoto Was just talking about this today, and I think that I-in some ways, it's all all things, old or new again? Yeah, absolutely vendoring everything. Like I do I do remember twenty thirteen, twenty fourteen.Swyx [00:29:42]: This is Yeah. Let's, we must return toKyle [00:29:43]: That's what is We were vendoring everything. We were having actual discussions around, or at least I remember we were “Should we take this full thing?” “Why is this so big? We only need this one file.” And so I do think there's something true there where having either taking only what you need or the dependencies just getting incredibly small over time, I think will help to some degree, but it's not going to solve the fundamental problem, I don't think, because the vulnerabilities in an agent looking at them, there's time and time again, there's a million different ways in which we can convince an agent that this thing is, secure or not and pull it in. Or we can do static code analysis or runtime testing to say whether the code works or not. That is, I think, the step that needs to continue to be, invested in. The question is just on, how much scope. Should it be this enormous project that I'm pulling down, or should it be this piece? Either most companies are running some amount of security checking on the on the packages that they're bringing in or vendoring. That I think won't change. That's like what advanced security does to some degree, Socket does some degree. Like everyone is doing a piece of that. How we each do that like especially when we're talking to enterprise customers, is just like very different. No there's no one wants one single way to do it. And I think that's always been GitHub's, unique position in the world. I talk a lot to maintainers, I talk a lot to folks about this. It's we're— we rarely start like a process and a practice and like push it onto the community. We usually wait for the sort of like RFC process socially or literally, everyone agreeing, and then we'll cement something in. Because otherwise we'reMaintainers, RFCs, Vouching, and the Social Layer of TrustSwyx [00:31:35]: That fits your role in the ecosystem, yeahKyle [00:31:36]: We're GitHub. Yeah, we don't want to shape the whole thing. We want it to be figured out. But like how do you balance that like sort of Role in the industry to keep everything as secure as is possible and make sure that you're you're not going to be compromised as a human, ‘cause that's usually how it all happens. And Not not create a process or lock us into a flow that you're not going to or like Mitchell's not going to or other open source projects aren't going to like. That's always been a tricky balance for us, and I think that's something that we haven't talked about enough is we're not going to be able to fix everything for everyone in a way that everyone is going to like. So tell, help us, tell us what is working. When Mitchell was talking about, the Upvote, the upSwyx [00:32:22]: I was going to bring up his thing. Yeah.Kyle [00:32:23]: I forget what it Yeah. When he's talking to us, I was chatting with him and talking to him about this and I put it on Twitter and we talked to, also over DM, was “We're going to keep working.” but I think the important thing is I do actually want to hear what isn't working for you. And as, be as specific and clear for your project as is possible. And to every piece of credit over the many years that we've known each other through the industry, he's always done that and I appreciate that ‘cause there are places that we need to fix up, and we hear from him, and we'll fix up just like we do all other kinds of maintainers. But that that process between making those types of improvements and being more secure and like creating, I forget what he calls it's not the proof process, not the claims process. Do what I'm talking about? He has that he his projects have a way for you to kind of like,Swyx [00:33:13]: VouchKyle [00:33:13]: Vouch. Thank you. Yeah. He has like the vouch system for saying, “Hey, you should accept my PRs.” That's beenSwyx [00:33:20]: I just built this into GitHub. I don't know.Kyle [00:33:22]: Well, see, but that's the thing is that you say that and like he and his community really likes this and then I'll go talk to other maintainers and other maintainers, globally, and they're “No, this doesn't work for me.” And that is the tension, but also the kind of beauty of GitHub, depending on which way you look at it is we want to help maintainers, so we create all these tools to let you have more control over how much you take in from AI and PRs. But you can also use this. What You can go use this project, and if it takes off and becomes the kind of mostly standard, then yeah, we probably wouldn't enforce it but we would add it in because that's the flow that we tend to do?Swyx [00:34:02]: I hear a lot of people don't know the history of the pull request. And like like that's how, that's something that GitHub standardized basically.Kyle [00:34:08]: Yeah. It was a very messy process Like beforehand, and now the we have the benefit of it being the process? And now we have to go and Figure out the next best process or what adaptations change, or what does a pull request look like when eighty percent of your PRs are just coming from your agents and not From other devs?Swyx [00:34:31]: Do you like the prompt request idea from Peter?Kyle [00:34:34]: like I think that for each like each idea I think has its merits. I'm not, I'm not avoiding saying anything good or bad, but I feel like I've seen a version of we have that we have entire Thomas' store. Take all the assets of what you've built and put that in. I think that's got great ideas. There's all these various permutations of the PR flow, but I think the reason why there's not a single answer is ultimately we're trying to codify trust. We're trying to say “Okay, if Sean reviews this I'm going to trust it because you're Sean or you're the senior dev or you're the whatever.” And right now, when we are working in a flow where an agent writes code and another agent reviews code and then Kyle goes and looks at it the trust is kind of diffuse. And most of the tools that we're talking about are talking more about verification flows. We have more assets to look at, so I can probably say whether this is a good PR or not. But that still doesn't solve, I think, the human problem of I'm looking at a PR and I want to know if I can trust it. And we're still, we still tend to use human signals for that? Mitchell approving it or Kyle approving it or whatever. And so I think that's, I think that's why most of these options haven't really solved it is because, it's a social problem ultimately. It's a it's a human problem to review it and agree. Or you fully trust the tool and you're imbuing that tool with full trust Which I think in some cases that absolutely exists.AI-Generated PRs, Trust, and the Waymo AnalogySwyx [00:36:08]: And so like in the same way that there will be a tipping point in society when we don't allow humans to drive anymore Because machines are measurably better than Than humans. I'm looking for that tipping point, right? Like Mythos is ridiculously expensive. Someday we'll have Mythos on a desktop. I don't know. Will, does that change the equation?Kyle [00:36:30]: I think it's more I took a Waymo here, and I was on my phone and not looking around at all. There are other, self-driving, vehicles that I would not trust while, staring at the road. And I think that trust is something that isSwyx [00:36:48]: Is this a Zoox thing? What is itKyle [00:36:50]: I think that is both. I think that is both. LikeSwyx [00:36:53]: There's Zoox in this robo taxi. That's it. It'sKyle [00:36:56]: Well, depending on what level Of self-driving. But, my point is sort of that I think part of that is I strongly believe that's, a mixture of verifiable proof. Like how many accidents, how much data, and so on, and the human aspect of how I feel when I'm in this car, what it tells me, et cetera. And so that's why I think some of the like Some of these some of our AI tools tend to, imbue me with more of that feeling of trust, even if the data says this is 100% accurate. I feel like it takes more time for us to go, “Should I trust this or not?” And that's in the soft sense of, startups with high agency, weekend projects, and open source. And then there's enterprises and regulated industries and everything else, and that is an even harder problem to go solve because even when it is fully verified, not only do you have to have trust from the humans on the team, you probably have to have trust from multinational,Swyx [00:37:55]: Oh my GodKyle [00:37:55]: Multi governments around the world and regulating agencies. And so that's where I feel like until we tip over to your point on the sort of like human EQ side of it. I feel okay this feels okay I've been proven enough. Then the ball will start to roll a lot faster, where we'll end up getting to the “Okay, we can trust this,” and feel good about it in the Most difficult of cases.Reputation, Sponsors, Stars, and Bot Activity on GitHubSwyx [00:38:18]: If human trust is the thing that matters, I feel like GitHub as the developer social network could maybe do more there. Like vouchers are one system But, we have star counts, and then we have Contributor rights, and that's it. And I feel like there should be more in that space. I don't know if there's any other design decisions there.Kyle [00:38:37]: I think that one of the places that we don't really expose right now in this sort of way is, some degree of like hard trust and support, which would like for me is like sponsors is a good example of that.Swyx [00:38:49]: Ah.Kyle [00:38:49]: It like costs you something. To prove that I believe in your project and I trust you To some degree or I want to support you at the very least.Swyx [00:38:56]: Solve payments for open source. Why not?Kyle [00:38:58]: I think that I think that like as we keep moving forward, right, there's more and more projects where I'm, adding more and more dollars into sponsors personally because I want to like support them, but I also like know of I've probably never met them in person, but, I know of enough of their work that I want to support them. I think the thing that I don't love about stars or commit counts or anything else is ultimately, even with all of the various, abuse and de-spamming and deduplication work that we do or anti-abuse work that we do, these are all, not active social signals. They're passive ones that are ultimately gamifiable. And you may trust me, but another open source maintainer may not. And on what heuristic should you be, trusting me? That I think, is kind of where some of our thinking is right now. What signal from me is most important to you? You— If you can define that potentially, honestly in an agentic workflow that's what we see some of these open source projects do, where you have GitHub actions, and then you have like an agentic workflow that's calling AI, and you're setting these rules. Like if Kyle has submitted and gotten accepted PRs across any given project and has a social handle tied to his account in GitHub, and that social account's older than a certain amount. Really complex measures that matter to you ‘cause most open source projects have that heuristic built into their heads, if not written down in the contributing guidelines. You could take that and then go apply that and then just say, “Oh, we're not going to accept this PR.” Building something that is, I think, malleable to everyone's needs, is a little bit better, rather than going “Hmm, this account's too young.” Because what happens? The attackers just go and go and create a multitude of accounts, and they wait Until it ages up. Needs to have a certain amount of stars. That's how star inflation happens. Need to have a certain amount of reposSwyx [00:40:46]: Oh my God. YeahKyle [00:40:47]: With PRs. They all just create repos and submit PRs to each other, and then they come in and do something nefarious. And so, it's hard. It's hard to find the measure. So I think we're, we're looking more at how can we provide you tools so you can kind of choose what's best for you. And of course, we'll give you some standards. But the trust vector, gets down to I don't know, some version of like human digital ID like everyone's been talking about. Like how do I prove that it's meSwyx [00:41:13]: Give me your eyeballsKyle [00:41:14]: On the internet. Give me your eyeballs. Exactly.Swyx [00:41:18]: The I got to keep moving on Topics, but obviously I can go all day on this stuff because, I've been involved in GitHub and open source My entire professional career. Stars. Very superficial. Everyone knows it. But I think time to one hundred thousand stars is the fastest I've ever seen. Like people just reached that in I don't know, months. And then like at the same time I don't trust it right? Like how many of these are real or bot or like whatever. I don't know how to ask this but like what can we do about it? LikeKyle [00:41:49]: JustSwyx [00:41:49]: Is stars broken? Is stars fine?Kyle [00:41:51]: I think that there's kind of two, there's like two pieces. Obviously we're constantly like trying to find ways in which like your users are producing spam, which would, I would include like be like only doing star gamification. When we find them, we pluck ‘em out and we,Swyx [00:42:08]: But it's like a Whac-A-MoleKyle [00:42:10]: It's a hundred percent like a Whac-A-MoleSwyx [00:42:11]: There's no wayKyle [00:42:11]: Now, powered by AI to be helpful. But I think more so what I'm seeing is, a lot of the like fastest time to X tends to be because we're now inviting so many more people into like software development on GitHub That like the zeitgeist is just swarming? And it'sSwyx [00:42:32]: It's not just developers anymoreKyle [00:42:33]: And it's not you and I. Like like however you want to say like what a developer is it's not just folks who have been coding for a very long time. It's folks that have maybe started coding or only joined in since the AI era. And nowSwyx [00:42:44]: what's the latest Octoverse number? I know eighty million was my lastRem- member that a number of developers on GitHubKyle [00:42:50]: Oh, we're over 200 million now.Swyx [00:42:53]: Okay. Well, so you see?Kyle [00:42:55]: Like over 200 million developers now.Swyx [00:42:56]: But it's not developers, right? It's, it's people with a GitHub account.What Counts as a Developer in the AI Era?Kyle [00:43:00]: So, so this is, this is the biggest debate that I would say, everyone loves to have at GitHub at this point. From my perspective, right, I think that there's, there's clearly a difference between, professional enterprise developer and then developers. But I think that I think that the idea that we should be I don't know, splitting hairs or segmenting developers in the early era of software development is, not worth our not worth the time. SoSwyx [00:43:29]: When you get into gatekeepingKyle [00:43:31]: 100%Swyx [00:43:31]: What is a developer?Kyle [00:43:31]: 100%. ‘Cause I wasn't a developer when I started writing code? I was going toSwyx [00:43:36]: Oh, no. I made— I cloned a thing, seven years before I learned to code. And then I and then I wrote about my learning to code journey, and people Just called me a fraud ‘cause I had a GitHub account. And I'm “Well, no, I just use GitHub, but I don't know-” “I didn't know what I was doing.”Kyle [00:43:49]: I I remember that. I remember those sets of posts, and like that's, that's b******t. So I fight very clearly on the line of, if you create code, if you have an idea and you create it into some way of, I'm, I'm going to run it and use the app right now, you may still use AI in that moment, but that's okay. At some point you're going to do the next thing. You're going to create a big— You're going to have to learn about this database. You're going to fix a bug, whatever. We're all on some same journey, and those people are also hearing about the great new agent skill package or a new CLI tool or a new whatever. And those projects are going up because you want to be a part of this moment, just like I wanted to be a part of the Ruby community when Ruby was popping off when I started becoming a developer, and now I can just click the star button. And so I think that yes, there's clearly some amount of like spamming and game gamification that we're working against, but I really think we're just seeing this whole new cohort of folks that are moving from technology to technology because they're not working on a 20-year-old software application. They're working on a side app that they built on the weekend for their friends or for their new idea or whatever. And that's how you see these enormous charts going up and to the right with With stars.Swyx [00:44:59]: I think something that's remarkable is the persistence or, that GitHub extends to those folks. Usually when I see platforms go into a new audience, they usually have to, have like a second platform with a different name that wraps the main platform. But somehow GitHub has been able to sort of persist and extend, and it's friendly and whatever? So it's, it's nice.Spark, Low-Code, and Always Showing the CodeKyle [00:45:19]: I that's partially why I think as we've tried to move into I don't know, more like low-code-y things. We so we started working on Spark as like a way to, build an app and run it. I think that the reality is that we anytime we try to, kind of put even a veneer on top of it without when we put a veneer on top of something, we still always show you the code. That's kind of like a tenant. We're never going to, hide the code from you ever, because whatSwyx [00:45:52]: Why would you?Kyle [00:45:52]: That's, yeah, that's the whole point? However, I think that what we learned with things like Spark is that really the value of Spark for most devs is, easy runtime. And you may have a runtime or a host that you're going to use for that or you just build something and run it but, the package of making that even more simple isn't really needed for folks that are trying to build software and not just trying to build, an app, which is, slightly different, a slightly different goal. So I want to get you in, I want to get you comfortable. I think the best thing for me as, someone that did not traditionally come into software dev way back, I want anyone to be able to breach that chasm and not be in the I don't know, I feel like we're, we're still in an era of, STEM. I've got a 12-year-old and an eight-year-old, and it's “We got to get ‘em into STEM,”? Over and over. And I like I do, I do the things that good parents do. I was “Oh, you want to do coding?” “Yes, I want to do coding.” Do coding classes. But now they're just not afraid of doing software. And that's, I think, the thing that's honestly kept me at GitHub for so long. Anyone should be able to go and build a thing, just like I can go change a light switch in my house. I'm not going to go into the breaker box ‘cause I'll probably kill myself? But, I can go change that light switch. Everyone should be able to go and say, “This fricking app doesn't do what I want. I want it to work like this.” And that I think, is what's kind of kept us all connected with GitHub through the years and some and during the easiest of times or in the hard times because of that opportunity of, we're the home for all developers, and we want everyone to be able to have that feeling that we've had of, had an idea, I created it and holy s**t here it is.Swyx [00:47:37]: Here it is. All right, I'm going to try to do more spicy questions.GitHub's Hardest Scaling Moment: Growth, Agents, and UptimeKyle [00:47:42]: Great.Swyx [00:47:42]: Is it an easy time now or a hard time?Kyle [00:47:45]: Oh at GitHub? It's a hard time. Like, it's a hard time and also, I was just with my team and I said, “This is also, the best and most exciting time that I think I can remember at GitHub.” BecauseSwyx [00:47:57]: Best of times, worst of times. It's never oneKyle [00:47:59]: ‘cause we've we were talking about Octoverse reports and, usually we do an Octoverse report once a year, and we look at the numbers, and we say, “Oh my goodness.” I was at Universe in October saying, “This was the fastest year of growth that we've ever had,” right? And now we're doing more in a month than we did in a year last year.Swyx [00:48:20]: You're talking about PRs.Kyle [00:48:21]: Commits.Swyx [00:48:21]: Commits, yeah.Kyle [00:48:22]: PRs. Kind of like you name it by roughly every measure that we're looking at, there's some amount of sort of growth that is much bigger, and that is breaking our system in new ways, not old ways. Like webhooks were always notoriously, unreliable over the years?Swyx [00:48:38]: Whose fault is that?Kyle [00:48:39]: not anymore mine, but for a period of time, I'm sure you could pull up a tweet that was “It was me. I'm sorry.” but, now, that got rewritten at a scale level that is still working and is not having problems today. Now what we're finding isn't just the isn't the-The simple stuff that folks are on the sometimes on Twitter or on the internet are “Hey, why is this like this?” Sure. There's absolutely silly problems that we shouldn't exist. But now we're talking about, unique, novel permission problems that happen only at a scale across all different objects or whatever, that now we have to go rewrite this underlying system. And so it's, there are problems that yeah, caught us off guard, which I think I said. Like the growth is astronomical, but also we're making such material progress in that I'm excited once we're once we've kind of like reimagined the underlying foundation layer, or pieces of it at least, what's going to be possible when it's not just all of us and all the new people that are being developers and all of their agents and all the tools like working together. Because that'll still happen in that in that GitHub tool, that GitHub community. But it's a it's a hard day anytime we can't give you what you're looking for. We have the same problem internally. We operate through github. Com. Of course, we have backups when things go down and whatnot for our own operations but we feel it too. If it's not working it's not working for us, and that's kind of like the promise of dogfooding for GitHub. It's always been true. We're using the same tool you're using. We're not using a super secret version. We and so we also need it to be great for us for our customers of course for open source. And now an exponential growth of agents, Doing it too.Swyx [00:50:32]: I wanted to load for audio listeners who maybe haven't seen your tweets, whatever. So one billion commits in twenty-five. Now it's two hundred and seventy-five million per week on pace for fourteen billion this year, if growth remains linear. Is that still the pace? I don't know. It's been aKyle [00:50:48]: it's, it's speedingSwyx [00:50:50]: Roughly.Kyle [00:50:50]: It's still speeding up.Swyx [00:50:51]: It's, it's April, so yeah.Kyle [00:50:51]: Exactly. This was in April.Swyx [00:50:53]: All right. So basically you have fourteen x growth, right? Year on year on year. And I think that's a scaling issue. I think, I'm going to like try to really steel man this thing. People have experienced fourteen x growth. They haven't had your downtime. And that's like— C-can we go dig into that? Why? Like what's the— what broke? What are we doing to fix it? Like just anything for the community to reassure them.Why GitHub Reliability Is Breaking in New WaysKyle [00:51:18]: so there's a Like I was saying, there's a couple different places that we've seen the growth issues. Some of the growth issues, which is why we're t— I was talking about pushing hard on more CPUs is in actions in particular. More tools, more agents, more PRs mean more builds, more builds mean more CPUs. And so we are expanding through not just our data center, but obviously we were talking about moving to Azure and moving to, adding an additional cloud compute because we simply need more CPUs. Not as much GPUs. We definitely need GPUs too, but now CPUs are becoming a factor.Swyx [00:51:53]: It's very CPU heavy.Kyle [00:51:54]: Underneath the hood when it comes to some of the underlying services, we've been breaking up over the years our database infrastructure, so that way we have, more cognitive separation between our the various services. The place that we continue to have pain is in, permissioning. And so right now m-many of our permissioning layers sit into a database that we like internally call MySQL One, and old Hubbers will know what I'm talking about. And so we've been pulling things out of MySQL One for many years, because like and we use we use Vitess and we use other technologies to shard and we do it as one bigSwyx [00:52:31]: Famous thing, PlanetScale was born from this andKyle [00:52:32]: A hundred percent. Sam Old Hubber and friend. And so finding these opportunities to like break this out and then do that globally. The other thing that I think is interesting and both a unique opportunity and tricky is we also run everything I just talked about in a black box container with GitHub Enterprise Server for people that work on-prem. So we take everything I just said, and we also do it on-prem, and we also do all of that and we do it in a data residence setup for customers that need to have their data in a single location. Each of these has the unique characteristic around how we're sort of storing that data in MySQL or in a permissioning setup. That's where some of these outages have oc-occurred, where you're seeing it more like across the board rather than just like the one pieceSwyx [00:53:17]: Filling the databaseKyle [00:53:17]: Isn't quite working. Exactly. And so part of it is that. I think there's been some other places where agents are much more or more projects appear to be moving towards monorepo versus we were going the other direction for many years in the industry. Repos were smaller, but there were more of them, and now we're seeing the opposite. Repos are bigger, and there's, not fewer of them per se ‘cause there's new growth, but, we're just seeing many more big repos. Big repos, big monorepos have always had, a unique performance problem. Because each one, is slightly different if, particularly if the underlying blobs are incredibly big Inside the repos. And so we've done a ton of work that you pro— like most people haven't probably experienced, unless you're in this case of the monorepo. But that Git, infrastructure layer improvement does help the overall, system because, many of the improvements that make monorepos work better make all repo infrastructure work better. And so, I could kind of keep going down the line where it's another thing where we're moving out of, We're changing how we do j I'll just say job queuing for lack of a better, explanation changing the underlying technologies there.Swyx [00:54:32]: I spent two years being a job queuing guy, so.Kyle [00:54:34]: And so it's kind of a little bit of a little bit of piece by piece, and it's mostly because as we were— as it was built, we built everything in a way that assumed, I guess in some ways that the size of the pipe of work was going to remain the same. There's just going to be more people coming through each of those pipes. But instead now in places whereA git push was, generally a certain size for example, is now, no longer true.Swyx [00:55:03]: Oh, yeah.Kyle [00:55:03]: OrSwyx [00:55:05]: I push a thousandKyle [00:55:06]: On the average. 100%Swyx [00:55:06]: A thousand line commits like dailyKyle [00:55:07]: Same thing with PRs. Like PRs same thing. And like we've talked about optimizing that and making changes where, and there were technology choices that did not work there? And it got slow, and it didn't It was not fast. It did not do what the users wanted. And so we've been reeling that all out and going “Okay, that's just not right. Let's stop putting good money after bad and do it the do it the right way or the right way now.” So there's It's a it's a lot of things, not quite when I've experienced scale at GitHub historically, it's almost always two options that we've used. We go vertical scaling, particularly with databases, right? And we go horizontal scaling. Oh, we just have more people using this service. Great. We're going to add more servers, and we rack them in our data center, or we use it in a cloud. And now we're sort of in a like diagonal, where like vertical doesn't really work anymore. Horizontal isn't work either because we're all We all have some CPU or GPU constraints in the world now, and now we have to go in and like crack open services that have been running for 10 or 15 years and go, “Okay, the rules of this service have legitimately changed, and now we have to rewrite them.” None of this is an excuse. This is like we're We have to do the work. We have to make it better.Swyx [00:56:22]: actually as an infra guy, I'm “This is like one of the most fascinating scaling challenges I've ever seen.”Kyle [00:56:26]: That's that's, that's the thing that's the thing that it's hard for Like when we weren't talking about it publicly, and I was like I came out, and I was “Hey, I just want to explain what's going on.” Part of it comes from a very old GitHub ethos, which is it's our it's our uptime. It's down. W What I know you're a developer, so you're, you're inclined to want to understand more what's going on. But at the same time us going “Hey, this service didn't, perform the way we expected, and now we have to go change it,” we weren't We're not trying to hide anything from you i

god jesus christ ceo amazon learning trust ai english hollywood man pr growth future talk state building san francisco writing seattle stars microsoft dm holy universe chief hall of fame fortune chatgpt uber code goal drink os productivity coo id figure lego taste mac platform stem windows twenty spark solve cfo saas developers filling cmo slack golden age reputation pages api malta dit eq acquisitions brief history contributors linux github mythos slightly cro apis someday fleet stripe llm claw reliability azure underneath abi copilot prompt gartner versus commits gpu cpu agi sha gee obsidian waymo horizontal prs git gpus dx slop addressed sdks satya nadella satya mac mini mcp compute udo ci cd rui cpus 2fa summarize contexts v1 cli repos low code mysql github copilot vms externally microsoft build rfc npm socket zoox postgres retrospectives github actions sqlite sdlc m365 cvp pull requests enterprise security vouch postel sugo webhooks jeff dean like like upvote planetscale rfcs nat friedman clis chatops leadership00 kyle daigle no priors

Giving Agents Computers — Ivan Burazin, Daytona

Latent Space: The AI Engineer Podcast â€” CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

Play Episode Listen Later May 21, 2026 70:27

Take the 2026 AI Engineering Survey and get >$2k in credits and AIE WF tickets!On the product side, everyone is getting Computer - Perplexity, Manus, Cursor, and so on. Meanwhile on the research side, agentic evals like TerminalBench and GDPVal are also assuming computer (Harbor). On both ends, the consolidating LLM OS stack has become a standard toolkit, and Daytona is one of a small set of AI Infra companies that are booming because of it.“The end of localhost” has been Ivan Burazin's obsession for more than a decade.Something that is all too familiar…Long before agents became the default way people talked about software development, Ivan was already chasing the idea that development should not depend on a fragile local machine. CodeAnywhere, one of the first browser-based IDEs, was an early attempt at that future: move the development environment into the cloud, make setup reproducible, and free developers from the endless “works on my machine” tax.The thesis was directionally right, but the market wasn't ready yet.However, agents changed that. They do not care about a laptop, desk setup, or favorite editor. They need a computer they can access through an API: something stateful enough to keep working, fast enough to spin up instantly, flexible enough to resize, isolated enough to be safe, and composable enough to run the messy real-world workflows that real software engineering actually requires.Daytona isn't just selling “sandboxes” in the narrow code-execution sense. It is the latest version of Ivan's original localhost thesis.In this episode, Daytona's CEO joins swyx to explain why AI agents need more than code execution boxes: they need composable computers, stateful sandboxes, instant startup, dynamic resources, and infrastructure that can survive workloads going from zero to 100,000 CPUs.We go deep on the new agent compute market: Daytona's hard pivot from human dev environments to AI sandboxes, the New Year's Eve MVP that customers begged for, why Daytona runs on bare metal with its own scheduler, how one customer runs almost 850,000 sandboxes a day, and why RL/eval workloads went from 0% to roughly 50% of usage in just months. Ivan also explains why agents need Windows and macOS machines, why CLI may matter more than MCP, why Kubernetes is painful for this workload, and why the future AI cloud may look more like Stripe than AWS.We discuss:* How Daytona grew out of CodeAnywhere, Shift, and the “end of localhost” thesis* Why Daytona pivoted from human dev environments to AI sandboxes* Why agents need composable computers instead of disposable code execution boxes* The New Year's Eve MVP that customers chased API keys for* Why Daytona chose bare metal, stateful snapshots, and its own scheduler* How Daytona spins up one sandbox in ~60ms and 50,000 sandboxes in ~75 seconds* Why Daytona's biggest customer runs ~850,000 sandboxes a day* How RL/eval workloads create zero-to-100,000 CPU spikes* Why RL workloads went from 0% to roughly 50% of Daytona usage* Why customers compare Daytona against EKS/GKS and say they're “never going back”* Why every AI agent may need a computer, including Windows and macOS environments* The Apple licensing constraints that make macOS sandboxes hard* Why CLI gives agents more power than MCP* How open source helps agents integrate Daytona* Why agent-generated PRs may break today's CI/CD assumptions* Why AI SaaS companies reselling tokens may face a cold shower* Why the AI cloud may look more like Stripe than AWSIvan Burazin* LinkedIn: https://www.linkedin.com/in/ivanburazin* X: https://x.com/ivanburazinDaytona* Website: https://www.daytona.io* X: https://x.com/daytonaioTimestamps* 00:00:00 Hook* 00:01:12 Introduction* 00:03:15 CodeAnywhere, Shift, and the end of localhost* 00:05:58 What Daytona is: composable computers for AI agents* 00:08:07 The pivot from dev environments to AI sandboxes* 00:10:17 The New Year's Eve MVP and customers begging for API keys* 00:12:56 Bare metal, stateful sandboxes, and Daytona's scheduler* 00:17:28 60ms startup, 50,000 sandboxes, and 850K daily runs* 00:21:53 Spiky RL/eval workloads and the new agent infra problem* 00:28:12 RL workloads, Kubernetes pain, and dynamic resizing* 00:33:31 Why every AI agent needs a computer* 00:38:48 macOS sandboxes and Apple's licensing problem* 00:44:28 Why CLI may matter more than MCP* 00:48:11 Open source, GitHub stars, and agent integration* 00:53:11 Git, CI/CD, and agent collaboration bottlenecks* 00:58:15 Founder life and building a 25-person infra company* 01:02:44 AI SaaS, token resale, and API-first business models* 01:06:10 GPU sandboxes, data centers, and compute growth* 01:09:48 Why the AI cloud may look more like Stripe than AWS* 01:11:26 Closing thoughtsTranscriptIntroduction: Daytona, CodeAnywhere, and the End of LocalhostSwyx [00:00:02]: Okay, we're in the studio with Ivan Burazin, CEO of Daytona. Welcome.Ivan [00:00:07]: Thanks for having me, man.Swyx [00:00:08]: Ivan, you and I go back.Ivan [00:00:10]: Way back.Swyx [00:00:11]: How I don't even know how, you found, did you reach out or, for Shift.Ivan [00:00:17]: I reached out to you. The reason was you - we were just - we were thinking about I was one of the co-founders of CodeAnywhere, the first browser-based IDE, and so we were thinking a long time of, localhost should die. And you had this article.Swyx [00:00:29]: End of localhost.Ivan [00:00:30]: Then I reached out to you because of that, and then we talked, and I was actually at a different job and learning about I was the head of, developer experience, and you were quite well-versed in that, and I actually reached out to you, among other people, how do we go about that? What are the key things and whatnot at this point in time? And you were nice enough to take the call, and I remember I was late on your call with you.Swyx [00:00:51]: I don't remember.Ivan [00:00:52]: I remember because I was with my then I'm thinking of a girlfriend or wife at that point in time, I'm not sure. It's the same person, so that's great, and I was late ‘cause we were, in, Italy on, vacation, and then I was late for something. I felt so bad, and you were so nice to be, good about.Swyx [00:01:10]: The reason I'm nice is because I'm also late to other people, so it's like, who's, who's without sin here, yeah, so I have to, for those who don't know, InfoBip Shift, there's this whole thing that, you did in the past, and, and that was basically one of the inspirations for me starting AI Engineer, which is like, I have to thank you for giving me that push to be like, “Oh, you can, you can build and sell conferences?”Ivan [00:01:34]: I remember you asked you asked me at the beginning to give me advisory shares, and I was so focused on what we were doing, I said no, and I should've took the advisory shares. So I'm sorry, dude. But anyway.Swyx [00:01:43]: We're not, we're not venture backed.Ivan [00:01:44]: No, it doesn't matter.Swyx [00:01:45]: It's Yeah, anyway, so I think what's impressive about you is that CodeAnywhere is the thing that you've been trying to build, and, you kind of put it on hold and then came back after InfoBip. Just give us the story, do you - the story and the origin story, going into Daytona.From CodeAnywhere and Shift to DaytonaIvan [00:02:05]: Sure. Like, really way back, me and my co-founder have been together. I say this, I've said this multiple times, it's like we were married and divorced and married. Some people actually ask me is my co-founder my partner. they thought it literally. It's not literally, but we have done multiple companies together, and to your point, we had this shift where we went from the CodeAnywhere to the conference called Shift, and then back to, Daytona. We originally started stacking servers, doing like virtualization in the early 2000s and, routers and doing basically all these things, at a foundational level, and that was a services company which we sold to focus on what my co-founder actually invented, which was the very first browser-based IDE, right, I say the first. Before us was actually Heroku. They did it for a very short time until they became Heroku. But outside of them, we were the only one, and it was called.Swyx [00:02:55]: There was Cloud9.Ivan [00:02:57]: Cloud9 came out slightly after us. There was Replit, which came out when we stopped doing it, Replit came out, and they have been successful since then, which is great. There was Nitrous.io. There was quite a few that existed at the time, but it was like too early. But the interesting part is that we, at that point in time, because there was no VS Code, there was no Kubernetes, and Docker had just started when we Or I'm not sure if it was even public at that point in time. And so we had to build everything to the whole stack ourselves and that was the key learning that we brought into and that we've been using in Daytona today. So it was super early. There's about 3 million people used CodeAnywhere. It was slightly, it was angel-backed more than venture-backed. We ended up paying everyone back because it didn't have that sort of scale. But, three years ago, we started something similar with Daytona, which is not what we are today, but it was automating dev environments for human engineers, the basically the underlying stack of CodeAnywhere. And then we did a hard pivot last January to sandboxes. And so here we are.Swyx [00:04:01]: Historic pivot, yeah, and, it's one of those things where, I had independently invested in CodeAnywhere, but also in E2B, and then both of you pivoted into the same thing, and I'm like, “F**k.”Ivan [00:04:12]: You invested, you invested in Daytona. You invested in Daytona. But you were the first If we had not got your check, we wouldn't have done it.Swyx [00:04:18]: No way.Ivan [00:04:19]: No, it was like, “We have to get him on board first,” and you were that kicker that we, that got us off the ground.Swyx [00:04:23]: No, because you were putting me on your pitch deck, man. I was like, “Man, this is like a good trip if I don't invest.”Ivan [00:04:29]: That's because it was your quote. It's like we.Swyx [00:04:30]: Yeah. It's the end of localhost.Ivan [00:04:31]: Did a bunch of research about end of localhost and who was interested in that,.Swyx [00:04:34]: No, that's like, I put, I wrote that blog post, and every single company in that field reached out to me, and then every VC who was receiving those pitches then also had to call me and, talk it, talk through it with me.Ivan [00:04:47]: It's finally happening though.Swyx [00:04:48]: It was really super interesting.Ivan [00:04:48]: It's finally happening.Swyx [00:04:49]: It's finally happening.Ivan [00:04:49]: Yeah, it's finally.Swyx [00:04:49]: It's finally happening, with maybe sort of non-human users. Yeah, so what is Daytona today? Let's get like a quick description. I'm wearing the shirt.What Daytona Is Today: Composable Computers for AI AgentsIvan [00:04:58]: You're wearing the shirt. Yes,.Swyx [00:04:59]: It says, I think your branding is very good. Like, it's very consistent. It runs AI code. Like, it cannot be simpler.Ivan [00:05:05]: Exactly, but we're gonna probably have to change that.Swyx [00:05:07]: Oh, s**t.Ivan [00:05:07]: It's also a subset of what we do. Unfortunately, we really love this, Run AI Code is super simple. People interpret it different ways. I think we've given out 5,000, 6,000 of these shirts. People wear them with pride because it doesn't really market about us.Swyx [00:05:21]: Yeah, Daytona's on the back.Ivan [00:05:22]: It markets the back. It markets to the person itself, so I think we did a really good job on that one. But it is also a subset of what we do, because people, when they think about Run AI Code, they just think about these small, let's call it isolates, code execution boxes that, you send some code, you get an output. Whereas what Daytona is today is essentially composable computers for AI agents. It is, the market calls them sandboxes which can be misleading.Swyx [00:05:44]: All these things. All these things on.Ivan [00:05:45]: Yeah, exactly, ‘cause it can be misleading ‘cause people usually think about sandboxes as a demo or a test environment versus a production-grade environment. But what Daytona does, if you think of the laptop that you have in front of you or the computer that's over there, or, my wife is an architect, so she has like a Windows with a 3D graphics card inside to do 3D rendering. Like, as humans, we have different computers or different compositions of computers. And our belief is strongly that agents today and going forward will need all these different compositions of computers to do different types of tasks. And so we offer that basically through an API.Swyx [00:06:19]: Yeah, to give people - I'm trying to sort of front-load all the aha moments or the wow moments so that people can, stay engaged and click like and subscribe. the market is exploding, right? Like, you have been reporting 74% month-on-month growth, and it also, it's just been growing for a while. Like, it's been going like this. And every single - It's not just you guys. It's every single.Ivan [00:06:41]: Everyone, yeah.Swyx [00:06:42]: Sort of, compute provider. I don't know if you agree with me saying compute provider or not.Ivan [00:06:48]: It's fine.Swyx [00:06:48]: Yeah. So like organically PLG-driven growth, but also enterprise is doing super well, I think I wanna rewind to January of last year when you did the pivot. Like, so you obviously called this market early, and you were positioned for it, and you are now one of the market leaders. But what was the insight that made you do the pivot?The Pivot: From Human Dev Environments to Agent SandboxesIvan [00:07:06]: The insight that made us do this pivot is the quarter before that, so end of 2024, when we had - Basically, we did a demo with - I don't I think we discussed this as well, Devin was not public. You actually gave me access to Devin at that time. So Devin.Swyx [00:07:25]: I did?Ivan [00:07:26]: Yeah, you gave me access.Swyx [00:07:26]: I don't think I was supposed.Ivan [00:07:27]: Yeah, exactly.Swyx [00:07:28]: Yeah, I.Ivan [00:07:28]: So it doesn't matter. You.Swyx [00:07:29]: Yeah. I gave like three friends access.Ivan [00:07:31]: Yeah, or it was a call and you showed it to me. It doesn't matter. but OpenDevin was available, which is now called OpenHands. And so we're like, “Oh, this seems to be a thing. This is not public. Let's take our for human automation of dev environments and take, OpenDevin and launch that as a SaaS.” And we did that. Not very many people signed up and used it, but a lot of people reached out that were building agents, and they were like, “Hey, my agent needs a compute sandbox runtime,” whatever you wanna call it. I forgot what it was called at that point. And then we were like, “Oh, amazing. This is a new market. Here is our infrastructure. Here's our product, and go.” And what we found really fast, soon, was that people did not like what we had built. It didn't work. And I remember talking to people at the beginning when we're doing this, the sandbox we're building for agents. People were like, “Oh, why is it different? It's the same thing. We have like EC2, we have VMs, we have all these things.” But we saw that everyone we gave it to, it was like 20, 30 people, they all said, “No.” Like, “This is not what we need. This sort of breaks.” And basically, me and my co-founder not knowing a lot about - ‘cause we're infra people. We're not AI people. So I basically took it upon myself to like watch every single podcast that exists, including all of, all of these and all that, and sort of get up to date, read all the blogs, like get, understand what's going on.Swyx [00:08:45]: Do you wanna shout out who else was useful, just in case people are also looking.Ivan [00:08:49]: Generally we -, I looked at There's a few of podcast, different segments and different types. So there's you guys, No Priors, Bill Gurley's was great while.Swyx [00:09:04]: VG2, yeah.Ivan [00:09:05]: Yeah, while it was around. So there's a few. 20VC is interesting from a different dynamic, and some are different dynamic. But there was, also Red Points.Swyx [00:09:14]: We're not really about the compute market.Ivan [00:09:15]: It was also already - Sorry?Swyx [00:09:16]: You're, you want - You're looking at the agent infra market.Ivan [00:09:19]: I was looking at the agent market and the AI market in general and sort of understanding who are the players, what the perception, and how that goes. And like obviously you complement this with like going to conferences, going to events, going to meetups, reading white papers, like doing all the things that you have to do to understand what's happening. And so when we figured, when we sort of had an idea of what we had to build, literally over the New Year's Eve, literally on New Year's Eve, I half vibe coded the first MVP, first minimal viable product of what Daytona is today. And I went to sleep at like 3:00 AM or something like that. I was doing - I just put my like baby daughter and wife to sleep and, Happy New Year's, and go back to just, doing this. And I sent it to my co-founder, my CTO, and he saw it in the morning. He's like, “This is absolute garbage.” “Do not show this to anybody at all, but the idea is good.” And so he took two weeks, and he rebuilt it.Swyx [00:10:09]: Did it like look like that? Listen, I - It was rough idea.Ivan [00:10:12]: Oh, not even, not even close. Like it was it was way worse. But it was like a very - It was a simplistic view of what it should be. Like, it worked, but it was not ideal. And so he went, we went down the whole, which is his job as CTO, to go, and he came back with this version. We then called all the people that had said like, “This is garbage,” a quarter ago. And we set up these calls, and we gave it to - We just demoed it to everyone. And all the calls went long, every single one. They were 15-minute calls, and they all went to like 25, 30 minutes or whatnot. And everyone said, “We need, we want access.” There was no login, just an API key, ‘cause it was just a beta or an alpha. And they said, “Oh, we want access.” And we're like, “Sure, yeah. Okay, thank you very much.” But after like the next day, if we'd not send it, every single one, like every call that we did, everyone came back, “Where is my API key?” Like everyone wanted it. We're like, “S**t.” Like this is it. Like I've never felt So one, the understanding to your point was like most people thought it was the same infrastructure for humans and agents. We understood a quarter ago it's not. We just didn't know what was the right primitive. And then when we came, and we can talk about what that is, and we gave it to these people, I've never seen, I've never experienced - I've done multiple companies in my life. I've never experienced this, that people literally call you if you do not give them access. Like they want access right now. And so it's like, okay, they don't want this. the thing that they want doesn't seem to exist, or they have not found it, and they really want what we want. And then when we understood that we're onto something, and then when you think about the size of the market, like the market for human engineers and enterprise is a very large market, so think GitLab or whatnot. But the market for every single agent that will exist ever in the future is just like, what is that market? How big is that? And we're like, “We are all in on this.” And so that is where we made sort of the cut between the old product and the new one.Bare Metal, Stateful Sandboxes, and the Lambda + EC2 ModelSwyx [00:12:02]: Yeah. But it wasn't composable at the time?Ivan [00:12:05]: It was very - It was basically just a Linux box that you could change, that you could define number of CPUs, disk, and RAM. Like that is what you could do, but you couldn't have multiple operating systems, you couldn't resize it on the fly, you couldn't add a GPU, you couldn't do like all the things. It was just the, just the first sort of variation of that, yeah.Swyx [00:12:22]: Was it bare metal from the start?Ivan [00:12:24]: It was bare metal from the start. And so the interesting thing that we thought about right away, so our.Swyx [00:12:29]: Which, give people the background, what is the normal path?Ivan [00:12:32]: Yeah, so, basically most providers run this on top of VMs. And also.Swyx [00:12:37]: Firecracker.Ivan [00:12:38]: Yeah, they run on Firecracker and VM. And so we also fire - We can get - We have multiple isolation layers and we can do that. But the common way to do it is that they, one, that the state of the machine, or the hard disk is not part of the sandbox itself. And the other thing is they're not meant to last forever. So most of them are preemptible, like they can There's a time that they can live. And so our thought was when we were going into this is, agents will be like humans in the sense of you don't want your laptop to be shut down until you're done with work. Like, and you want to close the lid and open the lid, it's the same state. So you - Agents would want that, like the pause and come back. They want those two things. But also agents really want speed, right? Can they get it? So when we thought about it's like we need something insanely fast, how to make it fast, how to make it long-running, and stateful. And so those two things, it's like combining a Lambda and an EC2, right? Those two things together. And so we didn't have an idea how others did it, ‘cause we didn't know too that there was a market around this. It was more like, okay, this is what we need, what they need. And we looked at Kubernetes, it wasn't wasn't good enough for that. We looked at Nomad, it didn't enable that. And so our history in rewriting our own scheduler at CodeAnywhere is basically what my CTO came up with. Like, he's like, “Oh, the learnings from there,” and he brought it. And the funny thing is, our third co-founder, when he saw it, he's like, “Dude, what is this? This is like 2008.” Like, we went back in time, and he's like, “Exactly.” And so the reason why Daytona is like super fast, and you see this on benchmarks, is we essentially, we run on bare metal. We have our own scheduler, we use the underlying, disk, CPU, and RAM of the underlying machine, which means your IOPS are insanely fast because there's no, there's no network between an EBS or something like that. But also the snapshot, the point in time, the templates, are also preloaded on the bare metal machines. So when you fire off a sandbox from a template or a snapshot, you're essentially directed to the bare metal machine where that snapshot is based on that NVMe drive, and then it literally just turns on that machine, and it's local. There's no network latency, anything on there. And so that is sort of the specificities that we, when we're thinking from first principles, what a computer would look like for an agent, that is what we came up with, and that's what we created.Benchmarks, 60ms Startup, and 50,000 SandboxesSwyx [00:15:02]: Yeah. I should maybe, I don't know if you endorse this, but there's someone that does compute SDK, you guys do very well on there, with like the TTI, right? I. is this a, is this a is this a relevant benchmark for you guys? I don't know.Ivan [00:15:16]: I don't know, and it changes every day. So today RKL is.Swyx [00:15:18]: I don't know what RKL is. Never heard of it.Ivan [00:15:20]: Yeah. RK, yeah, so it is there.Swyx [00:15:22]: You are, at least a third of the next tier of performance, and then, there's a lot of other better-known names that are very slow to start.Ivan [00:15:31]: Yeah. We've been the number one by far for a long time, and now there's different, there's different definitions also of sandboxes, different isolation patterns, different other things. So RKL runs it literally on the S3, the data, so it's very different, and they spin up a sandbox, spin up a container for that, so it's a different type of thing. So the definition of a sandbox is something that we can all, we all need to get along with. But yeah, we're insanely fast on getting these things, up and running. And so you can see even there that it's a zero point 0.10 to 0.11, so.Swyx [00:16:03]: Close enough. Yeah. what else do you need, right?Ivan [00:16:05]: Yeah. So the benchmarks itself, so, in this, in I don't think the benchmarks equate to market ownership or revenue or anything like that. and I've seen this with multiple benchmarks, not just in sandboxes, but in general benchmarks around.Swyx [00:16:20]: It's table stakes. It's just like.Ivan [00:16:21]: Exactly. But it doesn't hurt.Swyx [00:16:22]: Just roughly check.Ivan [00:16:22]: Like you definitely have to be up there and you have to be competing so that people know that, oh, this is definitely one of the top. Because this is only one dimension of what customers look for. There's other things like how many can you spin up consecutively? There's a feature set, there's support, there's like all different things that people look at, but you definitely have to be there, on the benchmarks.Swyx [00:16:40]: How many people do people spin up consecutively?Ivan [00:16:43]: So we have.Swyx [00:16:43]: Or concurrently, is the Concurrency, right?Ivan [00:16:45]: There's three metrics that we look at. And so one is like time to spin up one, and so our time to spin up one is 60 milliseconds with network latency. So request, spin up, reply, 60, the whole thing, 60 milliseconds. That is one. But if you wanna spin up 50,000 at once, we are now at about 75 seconds. So it takes about 75 seconds to spin up concurrently 50,000. Some others, there's public data around this, like take 2,000 seconds, which is 30 minutes. Like there's different variations of that. And then there is the so it is speed of one, speed of like multiple, and then how many can you consistently have up and running. And so we basically have right now no limit to how much we can add because we basically own our own metal. But the biggest customer of ours does like about 850,000 every single day is sort of where they're, where they're just shy of a million every single day that they're running, we do have a request for half a million concurrent, which is literally half a million CPUs somewhere running. So that's an interesting.Swyx [00:17:44]: They pay by like vCPU seconds.Ivan [00:17:47]: By seconds, yeah.Swyx [00:17:47]: Or whatever. Yeah. Okay, and so and then, and the other thing is, the sleeping and the resuming, ‘cause it's all the stateful resumption of all these things, how, what kind of workload are people putting through this, right? Like how is it Do we measure by gigabytes in memory, gigabytes in storage? I don't In like network attached storage. I, what are the costly ones of, out of all these features?Workload Economics: CPU, RAM, Network, and StorageIvan [00:18:15]: The most expensive thing are CPU.Swyx [00:18:18]: Okay. Yeah, of course.Ivan [00:18:18]: The second one, yeah Then it's RAM, then it's disk. We actually don't charge.Swyx [00:18:22]: Which is snapshotting, right?Ivan [00:18:23]: No, it's actually the, snapshotting's part of it, but basically the size of your hard disk, of your machine. So do you have 10 gigabytes, do you have 20, do you have 50, do you have whatever? And then the transference of that. Right now, currently we don't charge for, network at all at Polychron.Swyx [00:18:37]: Oh, you gotta, yeah, you gotta fix.Ivan [00:18:38]: Yeah. It is very much a it's a larger and larger part of our bill, so we're working around, that part there. Obviously, that is the least, expensive, so the hard disk is the least expensive, so it's basically CPU, RAM, for us network, ‘cause we don't charge the customer, and then hard disk, is how it's split up. But there's also different types of workloads, so we basically split it up into two types of workloads in Daytona. One is what we call background agents or long-running agents. and the other is, basically RLs and evals, which I put sort of together. And so they have very different patterns of usage, and if you look at the usage of a background And I'll just name names of companies, not specifically.Background Agents vs. RL/Evals: Two Usage ShapesSwyx [00:19:21]: Yeah, open, all hands.Ivan [00:19:23]: Yeah. So like a background agent's a Cognition, a Lovable, a like all these things are Harvey. These are all long-running, background agents. And so if you look at their usage patterns, their usage patterns are similar to human, which is like follow the sun. Basically, the usage patterns of that is like noon is probably the highest, and the midnight is the lowest, and then weekends are lower. weekday is higher.Swyx [00:19:42]: Yeah, that's a fun question. How global is it? Is it very US-centric or?Ivan [00:19:46]: The US is a large part, but we have currently, we have Asia, Europe, and the US regions.Swyx [00:19:52]: So it's quite global.Ivan [00:19:53]: Yeah, it's quite global. We have it all over. It's interesting that our I talked to you a bit about this. Our number one city by user.Swyx [00:20:01]: Hmm.Ivan [00:20:02]: Is Singapore.Swyx [00:20:04]: Oh, wow. Amazing.Ivan [00:20:05]: Which is an interesting one, right? Not by revenue, just by just like by individual head count.Swyx [00:20:09]: Really?Ivan [00:20:09]: Just like an interesting thing.Swyx [00:20:10]: Singapore is, Singapore is weirdly high in the adoption charts of AI for the population. It's like an, seven, eight million population. And it's like keeps showing up.Ivan [00:20:20]: No, it's quite interesting. We were quite shocked, and I was like, “Oh, this is interesting.” And also one that's up there.Swyx [00:20:24]: There's a reason I'm doing AI using Singapore. it's because I'm from there.Ivan [00:20:27]: We're there. We're gonna, we're gonna be there as well. and it's interesting that Japan is in the top or like Tokyo's in the top, which is in all the tech cycles it has never been. It has never been, so it's quite interesting that they're.Swyx [00:20:39]: I think the Japanese just love AI. Yeah. It's that, and then it's Brazil. That's it.Ivan [00:20:44]: Brazil has always been in.Swyx [00:20:45]: I think.Ivan [00:20:46]: Even when I look, if you look at like GitHub's data and ask historically with CodeAnywhere, it was always like US, Western Europe, and then you'd have like India, Brazil, China, like that would be there. But like Singapore was not in, specifically Japan was never in sort of that top, that top.Swyx [00:21:01]: Yeah. Weird pockets.Ivan [00:21:01]: Weird. Yeah, so it's very global.Swyx [00:21:02]: Okay, so actually that, but that's helps you to distribute your load through, all time?Ivan [00:21:08]: The interesting thing is like we have those kind of loads, but if you look at the researcher loads, they're quite different. So what they are is like if you give them concurrency of 10,000 or 50,000 or 100,000 CPUs at ARMb, when they fire off a run, it's just 100%. And then it just runs, and then it stops. So it's very, the usage pattern is squares basically, right? And it's also not follow the sun, because people will fire it off at midnight before they go to sleep but then wake up and so it's very unpredictable, so you don't know where that is. So the shapes of the usage are quite different than we have had before. And also what's interesting is when it's sort of a follow the sun, even if you have a high growth company, you can sort of predict your usage patterns and have enough capacity for that, because it's sort of, it grows in a, in a way you can project. When you have companies doing sort of like evals and RL, they're super spiky. So they're gonna come in, it's like, “We're gonna use nothing, then can we have 100,000?” Right? And then go back down. And then 100,000, go back down. So it's very different, right? And.Swyx [00:22:09]: Do you want to lock them into commits so.Ivan [00:22:11]: Yeah, we do.Swyx [00:22:12]: Yeah, okay.Ivan [00:22:12]: We so we have to lock them into some sort of commits to have that capacity, because we have to have, basically we have to have the capacity for peak. Right? And so right now, Daytona's mean utilization is 15%, 1-5.Swyx [00:22:25]: Oh my God.Ivan [00:22:26]: So it's very low.Swyx [00:22:27]: Because it's very spiky.Ivan [00:22:27]: It's very spiky, but we get up to 90%. so we have these things. And so what we're, what we're looking at right now as a company is similar to Cloudflare where you can like geo move things around, but that works really well for basically the background agent where it's follow the sun. But this, it's not. Like it's a very different shape. Obviously with scale you figure these things out, but that's an interesting new problem that we have, as a compute provider in the agent space. And when we were doing the conference recently, and so we talked to like Nikita from Neon and.Swyx [00:22:57]: I should bring it up.Ivan [00:22:58]: Parag from Parallel and whatnot, everyone has the same problem. Whereas the usage is super spiky, and this is something that has not happened before, that you have these types of like it was always, it the amplitudes were not this high, right? So it's quite interesting use case and problem solve.Compute Conference and Spiky Agent InfrastructureSwyx [00:23:12]: Yeah, I don't know if we're gonna bring this up again, but let's just talk about the conference, you had like 1,000 something people at the Warriors game, at the Sorry, where is it? What's.Ivan [00:23:22]: Chase Center.Swyx [00:23:23]: Chase Center.Ivan [00:23:23]: Chase Center.Swyx [00:23:24]: I went. It was, it was very impressive. Obviously, you can, how to throw a conference, what did you learn? you put, you pulled together all these impressive names.Ivan [00:23:33]: What I.Swyx [00:23:34]: What were you looking for?Ivan [00:23:35]: My thesis behind the Compute Conference was let's bring together people that are building infrastructure for AI agents. Because when I think of what we're building, it is the agent is the primary user, what are the ergonomics and usage patterns of agents, and so we can do that. And what I found, this was a theory, it wasn't proven, is that we all have these problems, as I touched onto. And I was, as I was talking on stage, it was like we all have the same underlying infra problems, which is this spiky workloads, unpredictable workloads that we've never had before, in human, compute or human infrastructure. And it's, again, it's the same when I was talking to Parag or when I was talking.Swyx [00:24:20]: Lynn. Nikita.Ivan [00:24:21]: Lynn, Nikita. Lynn especially, I was talking to her the other day as well. Like the It is a very interesting type of problem to solve because I can touch on Cloudflare because there's a lot of like talk about that recently as to how they solve that, which is they have a bunch of geos, and basically, as users work in different places, and depending on your tier, they can move you around the geos. And so that how, that's how they get the higher utilization. But you can sort of predict these, and it's If it's something in You'll rarely get a spike that is 10 orders of magnitude. Like you'll get a like let's say one of your customers has some like an exponential curve. What is that to I'm using Cloudflare as an example. 10%, 20%, whatever it is. I don't, I don't have this data, I'm just assessing. It's surely not 10x, right? It's surely not something there. And so how do you go out and solve this problem? And we're all solving this in different ways. So we have.Swyx [00:25:11]: She also has the same thing.Ivan [00:25:12]: Yeah, I know specifically that like Neon had that issue as well. Like how are we solving these spiky loads and things like that ‘cause we talked about it. And so the interesting thing for me to actually internalize was, yes, everyone that's building for agents first is going through this, and we're all solving similar problems, which is quite.Swyx [00:25:28]: Let me let me double-click on this. Okay. So for example, Neon, I happen to know that they're very sort of S3 oriented, right? so they're just like fully bet on S3. And you get to benefit from S3's distribution and infrastructure. So I would imagine that Neon doesn't have to care, whereas Lynn maybe has to care a bit more because obviously she's doing GPU inference. And, for listeners, we did an episode with her, one and a half years ago. And you have to care. But like, right?Ivan [00:25:54]: Parag cares for sure, and Nikita.Swyx [00:25:58]: And Parag is C of, Parallel.Ivan [00:25:59]: Parallel, yeah.Swyx [00:26:00]: Former CTO of Twitter.Ivan [00:26:01]: Twitter, yeah.Swyx [00:26:02]: They are the search.Ivan [00:26:03]: Yeah, they're search, yeah.Swyx [00:26:03]: I You and I know but the listeners don't know.Ivan [00:26:08]: Yeah, we can put it down in the screen, and so ‘cause we, when we were talking.Swyx [00:26:11]: I'll put it up on the, on the screen.Ivan [00:26:12]: Yeah, right.Swyx [00:26:12]: People can look it up if they need.Ivan [00:26:14]: Look it up. And, yes, but they still have CPU and RAM, allocation that you have to have up and running. And so CPU and RAM, you have to allocate that and have that ready. And so there's basically two ways to do it. One is you either over-provision and you can handle the bursts, or two, you basically have, I don't know if this is a term, just-in-time compute, which is like as your load becomes, as your usage comes in, you can fire off requests for VMs or bare metals at other cloud providers and then get them up and running.Swyx [00:26:43]: This is if you go above 100%, right?Ivan [00:26:45]: Yeah, this is.Swyx [00:26:46]: Like your overflow.Ivan [00:26:46]: If your overflow, like spillage or whatever you do.Swyx [00:26:48]: You probably lose money on it, but it doesn't matter, right?Ivan [00:26:50]: It, not Well, you might, you might not That is a more cost-effective way to do it but it's a slower way to do it. Because basically what you have to do is you have to like queue your requests, spin up these just-in-time compute, get it all ready, provision it, and then get your workload there. And so if the time isn't important that much, that's fine, and you can do that. But if your customer, and especially for, let's say, the RL training runs, the reason why a lot of people come to us is because GPUs are more expensive than CPUs, right? So you want your GPU running at, what, 100% the entire time. And so when you're running runs on CPUs, when the when the CPU cycle is like down and spinning up the next one, you want that to be instantaneous so that your GPU doesn't go down, right? And if you then have to like go out and provision machines, you're essentially telling the GPU that it has to wait, and that's incurring our cost. So there's things that you have to try to solve for there.RL Workloads, Declarative Images, and Kubernetes ReplacementSwyx [00:27:43]: Yeah, let's talk about the different workload, right? You said that, what was it? A few months ago, you had zero RL workload and now it's 50%.Ivan [00:27:52]: It will be this one, 50%, yeah.Swyx [00:27:54]: Let's talk about how different it is, right? Like I imagine, for example, a lot less dynamic code generation of like arbitrary code. Like here, it's probably all the same code. You're just doing parallel runs or something, I don't know.Ivan [00:28:05]: Yeah. So you'll have multiple Depends on the like for each run, you'll have a snapshot. And they, for the most part, they actually do use our declarative image builder, which is like, “Oh, we, the agent wants these dependencies, these env vars.”Swyx [00:28:17]: These ones, yeah.Ivan [00:28:18]: Yeah, the declarative image builder, it.Swyx [00:28:20]: Which is a very modal like thing that they.Ivan [00:28:22]: Yeah. And so we build it on the fly and then we propagate that snapshot, and you can spin up as many sandboxes as you want against that snapshot. And then if you have to do changes, the model can, or like it could be also be automated. It's like, “Oh, now for the next run, we need to install these things or remove these things or whatever to get, a task done,” and then it goes off and runs that. So yes, that is something that it seems that they prefer. The number one reason I found, or should I say, let's take a step back. What we are competing against in that environment is essentially managed Kubernetes. So EKS, GKE, whatever. That is what the vast majority run on. And anyone that has tried Daytona versus GKE, EKS is like, “I'm never going back.” That has always been. There's a few reasons. One is the ergonomics. So if you have, if you're using Kubernetes to spin that up, you have to essentially manage the interface interactions with that. Daytona, although as a compute provider, it's more akin to a Twilio and Stripe from a consumption perspective than it is an AWS. Like you have an API, an SDK, it's quite like easy and seamless to get these things up and running, that's one. The other is the speed to which we spin up, which we mentioned earlier, which is much faster, and the scale to which we can go to. We haven't got into features, but an interesting feature is that it's very hard to OOM, or out of memory, our sandboxes, because we can dynamically on the fly.Swyx [00:29:48]: Resize.Ivan [00:29:49]: Resize, which is like impossible on almost any other thing. There are some technologies that enable you to do that, but it's like a very hard thing. And so we actually saw this when, the Terminal Revenge team is, brought us actually. So thank you, Alex and the team, that brought us into this whole space.Swyx [00:30:05]: It's just very rare that, a framework would just say, “Guys, just use Daytona.”Ivan [00:30:11]: Yeah, I think it says it somewhere. Yeah.Swyx [00:30:13]: Yeah. I was like, “What is this?”Ivan [00:30:15]: There's all, there's multiple there, but they also mention a few other places. and so Daytona specifically-We have, the, just jumping on themes here We, I don't know where it says Data Center.Swyx [00:30:27]: I, there.Ivan [00:30:27]: Doesn't matter.Swyx [00:30:28]: There's a very strong recommendation, which is, very unusual. Which is, it's.Ivan [00:30:33]: We do not pay them for this, just.Swyx [00:30:34]: I know, yeah. They just like you.Ivan [00:30:35]: Yeah, they like us. yeah, and also a thing, so, Data Center has multiple isolation sets underneath. The customer doesn't have to know what they are. But basically we have Docker, which is a container, that's hardened with Sysbox. So it's Docker's, isolation that is a security equivalent to a VM, but it's still a container. And that is the default, and they, especially in these training workloads, really like that as an interface to be able to use just a basic Docker container, and we enable Docker and Docker. Which for these RL runs, if you need to do a Docker compose or Kubernetes, you can spin up a K3S inside of these things, which unlocks a huge amount of workloads that you can do that you cannot do on other providers. So just on that part is much more interesting. And so we went that, through that. We showed them that we could do that, and they enjoyed that quite a bit. They being the general venture people.Swyx [00:31:28]: Those people, yeah.Ivan [00:31:29]: And Harbor people.Swyx [00:31:29]: Harbor people, do are they, are they a company yet?Ivan [00:31:33]: As far, I do not know.Customer Pull, Slack Connect, and the Computer Use BetSwyx [00:31:35]: Okay. All right. Yeah. It's like super obvious that like, there's a lot of excitement and success around these things, okay, so yeah, tell us more, right? Like, this is an exploding workload, Harbor adopted you, which helped speed things along. But what are you learning as this new workload comes online?Ivan [00:31:53]: There's a couple things that we learned, which we chat about in the beginning. We, and this has led our story, as we mentioned, we like talked to a lot of customers along the way, and we add more features and more tool sets as we talk to customers. And it's interesting that And I think it's that the ecosystem is so small and/or the models get smarter, where when we see one user come with a request, we know it goes on a roadmap if like three to five customers come with the same request in that week. It's like very bizarre. It happens so many times, which is.Swyx [00:32:27]: Because they're all friends.Ivan [00:32:28]: Sorry?Swyx [00:32:28]: They all, they're all friends. They're all in the same group chat.Ivan [00:32:30]: Yeah, probably, yeah. ‘Cause and they're like, “Oh, can you do this?” And I'm like, “Okay, this is interesting. We'll put it on a feature request.” And then the next one's like, “Oh, can you do this?” “Okay.” It's all the same, right? It's always the same. And so what we try to do, and I personally try to do, I try to be on as many call, quote-unquote “sales calls” I can. I'm in every Slack channel. We literally have about 1,000 Slack Connect channels, something like that. It's an interesting, there's so many interesting things you find out when you have all the Slack channels. You can also see where people, transfer between companies. You see leave Slack channel, enter Slack channel. It's an interesting thing. Also, just I digress, I feel that Slack Connect is literally LinkedIn what it should be. You have a list.Swyx [00:33:08]: LinkedIn charges you to, use your own connections, but Slack doesn't, right? Slack is like, do it for free. It's more lock-in. It's great.Ivan [00:33:15]: Yeah. It's amazing. Yeah. It's one of the reasons.Swyx [00:33:17]: You're gonna pay Slack for life.Ivan [00:33:18]: Exactly. You're there for life. So that's interesting. And so one of the things, the newer things we were talking about earlier is we made a big bet and put a lot of investment on computer use. that is not seen publicly the light of day. We haven't GA'd that yet, but we have.Swyx [00:33:32]: Is there a thing I can pull up?Ivan [00:33:33]: There is computer use there. It's right up a bit.Swyx [00:33:36]: Oh, yeah. Okay.Ivan [00:33:38]: What we have, what we talked about and what we've seen publicly is there's this theme now about, the human emulator where And Elon from XAI has talked about this publicly, and if you think about the models today, they're actually quite sophisticated and they can do a lot of work, but they still don't have access to all the tools. Like, I'm a strong believer that the most efficient way for an agent to work is essentially headless or through, terminal or whatnot. But if we, if we look at knowledge work in general, there's about 100 million knowledge workers in the US, about a billion in the world, and knowledge workers, and the salaries of them aggregate to 10 trillion in the US 50 trillion worldwide.Swyx [00:34:24]: Wow.Ivan [00:34:25]: Something like that. And if we look at, the five most important sectors of that, so like healthcare and government and financial services and whatnot, that's about 56% of that. So let's say it's about half of that. So in the US it's about 25 trillion, and most of them, most of that work is actually still locked into legacy apps inside of Windows, which is not going anywhere for a very long time. Like, people just won't invest in that. How much of it? our assumption is the following: if, in the RPA market, which is similar market, well, not the same 25% of, these white collar, workers', work is automated. If an agent is more sophisticated, can go through more runs, figure stuff out, let's say it's, 40%, right? And so if you take 40% of that, you get to essentially, $10 trillion a year.Swyx [00:35:17]: That's a TAM.Ivan [00:35:18]: That is a that is a TAM. So that's the TAM of the models, right? That's not our, essentially ours. But you get to that size, and to be able to do that, you essentially have to give agents these computers with the legacy. So computer use, either Mac or Windows or Linux. Linux we also obviously have and others have. But Windows specifically is something very new, and the only option right now is an EC2 with, Windows or on Azure. Both of them take anywhere from three to five minutes to spin up. We've created an actual sandbox, so it's a second instead of milliseconds, but you have, point in time snapshots, you have, forking, you have all the things that you have from a sandbox, but essentially enables you to hopefully unlock all this value. And so that's been our big push and bet, but we've sort of, kept our ear to the ground. What is sort of the next things in the market?RPA Returns: Why Agents Still Need ComputersSwyx [00:36:06]: Yeah, knowledge work, and building, and sort of RPA, the next wave of RPA. I got very excited about RPA kind of during COVID times. The UI path was IPO-ing. And it was, a very hot Isn't it, Eastern European?Ivan [00:36:20]: It is, Romanian.Swyx [00:36:21]: Romanian?Yeah, it might be the only Romanian, big unicorn okay, yeah. This I don't I don't, I don't have like a I think there's, I think there's a stage being set for the resurgence of RPA, ‘cause everyone understands that, yeah, no one wants to deal with these shitty apps and no one's gonna rewrite them. Like, you just have to do, a remote operation and programmatic operation of them.Ivan [00:36:45]: If you wanna unlock it, my own setup was basically the following. So I was doing a board deck recently, last month, whatever, and I'm like, “Okay, let's just, let's just do automated.” So, all our data's in, ClickHouse and PostHog and QuickBooks, where everyone else's is, and I'm basically, connected that all to, my Cloud code, like go off and go Cloud code whatever. Go off and, here's the integrations, go do that. It pulled out the first report, which was great. It connected to Brex and all these things, pulled it, which was great, and then I say, “Okay, now pull out this, and this,” and I kept getting, really well McKinsey-style design reports, but the data said partial data. all the missing data, partial data. Like, it can't access all the things, and I got so frustrated, and so I got, I got, my Mac Mini virtual sandbox with OpenClaw. I gave it its own account in our company, and then I went to all these services and created a read-only account, so literally like an intern in your company. And so I would say, “Now go and do this report,” and it would get the same, or like, “I can't via the MCP or the API or whatever. I can't get all the information.” I'm like, “Go log in.” And it will log into the website, then go in, export the data. It'll export the data and do the thing end to end. So even for things that have today APIs, not all of it is exposed, and I to get value, I get immense value right now, but it has to be a computer usage, unfortunately, and so I spend a bunch of tokens just on that, but I get the job done. And so if even a startup like ours, and using all the hottest tools, still needs a computer agent what hope does, Goldman have to have a headless, right?Swyx [00:38:22]: Yeah, what a - Why isn't Microsoft doing this?Ivan [00:38:27]: I'm pretty sure, Satya had a post yesterday.Swyx [00:38:29]: Oh, okay. I see.Ivan [00:38:29]: Which was like, “Every agent needs a computer.”Swyx [00:38:31]: I see, I see.Ivan [00:38:32]: So they have launched something recently.Swyx [00:38:34]: Yeah, they have Microsoft Power Automate, I'm sure, I'm sure, they're gonna have their version.macOS Sandboxes, Apple Constraints, and the Windows OpportunityIvan [00:38:39]: Version of that, yeah.Swyx [00:38:39]: You're gonna try to do yours, and it - I always know there's always demand for Mac, but I know it's, tricky to host, macOS sandboxes.Ivan [00:38:49]: We will have macOS sandboxes fairly soon. The problem with macOS, OS sandboxes is, I'm deep in this, I don't know how much interesting is.Swyx [00:38:55]: No, it's.Ivan [00:38:56]: MacOS has this problem.Swyx [00:38:57]: It's a licensing thing, right?Ivan [00:38:58]: Licensing thing. So one, you're allowed to run only two parallel VMs per machine, so that's one. Two, you can only license to a different user every 24 hours. So if you come in and theoretically, if I wanna charge you per second and I charge you one second, I have to have it idle for the rest of the day. I can't have anyone else doing that. So the pricing will be different in the sense that I will have to - we would have to charge for 24 hours, and that's not even, that's not even the most difficult thing. But the, thing above that is, from a security perspective, they enable you to do memory snapshot, pause, resume, but only on the same physical drive, physical machine. And so what you can do in, Windows world or Linux world is that I can move in the background, your snapshot from one to the other and manage load, right? Here, if you wanna do that, you essentially have to have your.Swyx [00:39:49]: Yeah, snapshots. Yeah.Ivan [00:39:50]: Your.Swyx [00:39:51]: It's like.Ivan [00:39:51]: Physical machine.Swyx [00:39:52]: You can't break it up.Ivan [00:39:53]: You can't, you can't move things around that, and all of that is, that part is, from a security standpoint, if it is written. Like, I understand the security aspect of that, but it disables you from doing these agentic, like really scalable agentic workloads.Swyx [00:40:08]: You need to do a vibe-coded, clean room implementation on macOS that you can then - That's like Clean OS or something. I don't know.Ivan [00:40:17]: So. We have.Swyx [00:40:18]: ‘cause like Linux was originally like a clean room rewrite of Unix.Ivan [00:40:21]: Okay. Yeah.Swyx [00:40:21]: Or something like that, right? Like same thing to macOS. Someone needs to do it.Ivan [00:40:25]: Someone will do that, and someone will have some long-running agents for a few days to figure this stuff out. But yeah. So definitely we - we're really close to offering something ‘cause people do want it, but the pricing will be different, and the feature set will be sort of stringent.Swyx [00:40:38]: Yeah, nobody's gonna use this. like, the labs, the labs will because they want to automate macOS.Ivan [00:40:42]: They have to do RL. They have to do RL again. But even if you The - So the point is with the RL part, if you, if you do RL on macOS, then the next iteration of the model comes out, it will be able to use these tools significantly. Then you actually need to run those, that somewhere. So you're gonna have to have that, later on. And from, if anyone at Apple is listening, I very much feel that they are shooting themselves in the foot of the scale of the revenue of compute or licensing they could get if they would just enable a concurrency model similar to what you can get on a Windows and a, and Linux.Swyx [00:41:17]: Yeah. Yeah. And I'm sure they've heard this before. They just don't care. Yeah, it's And maybe they will change their mind with the new CEO.Ivan [00:41:24]: Yeah. We'll see.Swyx [00:41:25]: We'll see.Ivan [00:41:25]: High hopes.Swyx [00:41:26]: High hopes.Ivan [00:41:26]: High hopes.Swyx [00:41:27]: Okay. But I, it's very clear the market opportunity is huge in Windows, and you can go for a long time on just Windows, but your customers are gonna want both. and I think, it is interesting to me that, this is the sort of God application of agents, right? Like, I don't It was - How big was OpenClaw for you guys? Like, was it, was there, a significant bump.OpenClaw, Agent Labs, and the B2B2C Sandbox MarketIvan [00:41:54]: Not for us because we.Swyx [00:41:54]: Because you already.Ivan [00:41:55]: We're kind of positioned differently. Whereas although it's completely PLG and we have individual developers that use it, most of the users that use Daytona are sort of a B2B2C. Sort of it's either B2B or B2B2C. So, in the researcher world, it's B2B, so you're selling to, labs and neo labs and things like that. But on the long-running agents, it's mostly, from a scale revenue perspective, it's mostly B2B2C, where you have a app layer agent that uses you at a big scale.Swyx [00:42:26]: Like a Manus. Yeah.Ivan [00:42:28]: Like a Manus Lovable type of thing.Swyx [00:42:31]: Yeah. I think that's the question of, well how, um-Uh, yeah, B2B to C is basically to me what I've been calling an agent lab, which is kind of like you're not in a model lab, but you're making a very good wrapper that is a platform that other people can sign up so they don't have to code those things. Yeah, it sound, it sounds like a much better market than the direct OpenClaw market.Ivan [00:42:56]: I've like - We I've done multiple things. So the CodeAnywhere's part of our career path R in the calendar, was very much an end user developer product. And so that is great. It You can get a lot of developer love, and I feel that we do as a company have a bunch of developer love. But it's a different type, where it's people building these things. Again, it's more akin to a Twilio because you don't really run - As a person, you wouldn't run Twilio. I don't know how many people remember. It was like ask your developer billboard and whatnot. And people really love Twilio, but they only used it inside of like, “Oh, I'm building this app or service for thing.” And so we're very much directly to that. And you also know that I used to work for a competitor for Twilio, so it's kind of ingrained, in my DNA.Swyx [00:43:35]: People don't know InfoBip is that big.Ivan [00:43:38]: Yeah, it's.Swyx [00:43:39]: Because.Ivan [00:43:40]: It's a billion euro.Swyx [00:43:40]: They're all American. They're like, “Whatever's in Europe doesn't matter to me.” But like it's the, it's the same size or bigger? Same size?Ivan [00:43:46]: It's about half the size.Swyx [00:43:47]: Half the size?Ivan [00:43:48]: Yeah, about half the size.Swyx [00:43:48]: It's like, yeah.Ivan [00:43:48]: Still huge. Multiple billions a year. Yes.Swyx [00:43:51]: That's crazy.Ivan [00:43:51]: Exactly, and so that - These are like really interesting and large revenue-generating, very sticky businesses. Whereas when you're selling to the - When your focus is the end developer, it is a very hard sell because they're very price sensitive, very price conscious, very around that. And there's very It's very hard to scale. Your cap is the number of people that are willing to spin up - First of all, wanna spin that up, and then spin up multiple of these. Whereas if you're in the enterprise one, like we know everyone's talking about like how many tokens they're spending, I'm spending. Like a lot of companies today are like, “If this is our company, spend as much as you can.” Like basically that is where we're going. And so if you think about that paradigm, where you're selling to companies that say, “Spend as much as you can to generate, productivity,” versus, “Oh, I'm a single person. I have this much budget, and I'm doing this thing because it's fun or it's helping me out or whatever.” Like it is a different, it's a different go-to-market, I think, strategy.MCP, CLIs, and Sandboxes as the Agent RuntimeSwyx [00:44:50]: Yeah, there's a lot of discussion. I'm just kind of going through like the mental list of things that are in your favor, which is, for example, MCP versus CLI. Like obviously you want CLI. It's been very good for you. I feel like it's maybe a drop in the bucket or maybe it's huge. I'm just checking whether it's like these are big trends.Ivan [00:45:10]: Those things you - work well in our favor, to your point just because every.Swyx [00:45:13]: They're kind of drop in the bucket, right?Ivan [00:45:15]: I think it's like sort of all the things come together. And so there's so many things that impact that. To your point, like OpenClaw wasn't huge for us, but like having the agent SDK, from Anthropic, so or Cloud Claude Code was very interesting. The reason why it was interesting is that a lot of, let's call them app I don't know what to call them, app layer agent companies, essentially they are like, “Oh, I can create this new app, this new agent. All I need, I just use Claude Code, and I throw it into a sandbox, and then I have my interface to the human to that.” And so that enabled so many more companies to actually offer this, and then they would pull on sandbox. So that was, that was interesting. And to your point, like MCP, versus the CLI, the MCP is an interface against an API, whereas the CLI is like you can actually go do things. Like this is it. The difference between integrations and actually running scripts or data or analysis against a thing. So being able to use a CLI very well enables the agent to do more things, and it's because that people will invoke a sandbox, they'll run it in the CLI, and but it'll do anal-analysis on that data and then give you an actual result versus just, pulling data from an API source.Swyx [00:46:29]: Yeah, it's a layer of indirection basically, it's the same thing as agentic search versus RAG, which where you're.Ivan [00:46:34]: Exactly, yeah.Swyx [00:46:34]: Just like you just win whenever people put more agents into their workflow. And so like it doesn't really matter, but I'm just kinda teasing out like what else have people heard about that like it's sort of, “Oh yeah, this is another sandbox use case. Oh yeah, that's another one.” Am I, am I missing any big ones?Ivan [00:46:51]: The thing, the thing that people, which is the computer use stuff, which I think is probably the most interesting one, is, and to your point, we've talked to so many people over the last year. It's like, “Oh, like why do you need a sandbox? Why do you need this? Why this?” And to your point, it's like, “Oh, I need sandbox for this. I need sandbox for that. I need sandbox-” It's like, “Oh, I need it for every single thing.” And so basically what I, what I - and it sounds like a broken record, it's like you use a laptop every single day, right? And you are n of one. It's just you. But now imagine how And by the way, the laptop, the computer PC market, the PC market is about equal to the cloud market in total. So it's about 150, 180 billion a year. Something like that. It's about roughly the three cloud hyperscalers is about equal to like Apple, HP, Lenovo, whatever, It's a little bit less, but it's sort of like that. And now imagine And that's just like, so how big is the addressable market? What, how many people are there in the world now? What's the last data?Swyx [00:47:45]: Let's call it eight billion.Ivan [00:47:46]: Eight billion. And so let's say you can have two computer, like you have one personal and one business, whatever. Like so it's double that, right? and so that's 16 billion, right? How many agents are gonna be running in two years, in 10 years, in 100 years? Like And for every single task, they will need one of these. And so how big is that? That market is essentially quote unquote “infinite”. You will get to the point, and Dylan Patel was at the conference talking about, from SemiAnalysis, that talks usually about GPUs, was also talking about how CPUs will now be a bottleneck because it will be the constraint. You won't be able to grow, or we won't be able to have enough of these because there won't be enough CPUs to basically do.Swyx [00:48:23]: Yeah. Well, I actually had a really good podcast with Doug Oliphant, who, which was his president at SemiAnalysis, where they've basically been like, yeah, it's been a GPU shortage first, but then it's cascaded down to memory and now to CPUs.Ivan [00:48:35]: CPU, yeah.Swyx [00:48:35]: It-What's next? So networking. So, networking actually has been in shortage for a while if you're looking at, just GPU networking. But, yeah, it's really crazy the amount of computer use that's going on, yeah, cool. I, other questions are, just the one very big part is the open sourceness which you didn't have to do, your competitors don't do, like it's not, a lot of people are worried about keeping their projects open source because some competitor can just slot fork it. I don't know if there's any reflections on just being an open source company.Open Source, Trust, and Enterprise ProcurementIvan [00:49:15]: Yeah. There's a bunch. So we the original product that we did was open source.Swyx [00:49:19]: Yeah. CodeAnywhere.Ivan [00:49:20]: So doing that was actually very good for us. There's basically a saying of, What's the saying? Like, companies that are, that are doing really well, measure themselves against, free cashflow, that are kinda okay, it's EBITDA, then, it's, it goes all the way down.Swyx [00:49:36]: The worst is like GitHub stars.Ivan [00:49:37]: GitHub stars. GitHub stars are the worst, yeah. So you go all the way down to GitHub stars. And so our original one was GitHub stars. That's what we talked about, we're at the point we're talking about revenue, so we're we've gone up the stack on that. And so we started.Swyx [00:49:47]: No, profit.Ivan [00:49:48]: Yeah. We haven't, we're, we'll get there. We'll get there. But basically at that point we did stars and GitHub and it was useful, and the original variation that we did, it we split the core into its own repo and it was Apache 2.0, so very, permissive. And then we basically would bundl

covid-19 christmas god ceo american new year founders trust ai europe china apple man japan giving happy new year san francisco chinese performance italy data sales japanese dna microsoft losing open dad brazil network fortune startups built 3d weird shift tokyo daughter mvp os ga pc circle warriors charge cloud singapore mac computers agent windows b2b workers guys historic sort saas insane intel ram cto vc entire ipo slack openai hook salesforce nvidia hp api mckinsey croatia bare generally parallel open source ui aws nomad daytona linux github neon goldman technically tam apis licensing romanian stripe data centers vm azure deployment cad macos temporal apache cognition anthropic western europe harbor nikita railways gpu kudos cpu eastern europeans lenovo huddle quickbooks s3 ides ide ebitda cloudflare prs docker rag git kubernetes gpus benchmarks rpa gtm sdks twilio lambda lovable yc gitlab satya manus xai mac mini mcp iit rl ci cd cursor firecrackers json unix cpus cli allbirds cloud9 vms vs code plg benioff brex b2b2c chase center heroku ityou equinix bsl startup culture ebs replit eks parag nitrous nvme sqlite rls ec2 stickiness concurrency tti armb ai saas 850k bill gurley sandboxes ai engineer kua oom resize iops clickhouse clis gke rkl microsoft power automate no priors

From SaaS to AI-First: How Companies Are Reshaping Innovation

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Feb 19, 2026 40:41

In this episode of No Priors, Sarah and Elad dive into the evolving landscape of software, exploring how AI is transforming the traditional SaaS model. They discuss whether SaaS as we know it is coming to an end, what new business and sales strategies are emerging, and how AI is reshaping the way software is built, sold, and scaled. The conversation also examines whether or not these shifts are a good thing for both big and small companies, and how coders and software experts are reacting to abrupt AI transitions. They also dig into how AI is reshaping sales, automating workflows, and enabling more predictive customer strategies. Beyond individual companies, they examine how tech giants are increasingly dominating the S&P 500, and what this concentration of power means for the future of startups, innovation, and the broader entrepreneurial ecosystem. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | Chapters: 00:00 – Cold Open 00:35 – The SaaS-polcalypse discussion 4:55 – AI Change Management in Large vs. Small Companies 05:43 – “Is Software Eating the World?” 08:38 – Addressing the Unsolved Problems 14:00 – The Noise of the Last Month vs. Excitement 21:32 – What Proportion of GDP is Tech? 23:20 – Market Cap Shifts 25:02 – As a Company, When Should You Sell? 29:05 – Multi-Product Bundle Defense 30:45 – Conclusion

world ai tech innovation companies addressing large conclusion saas excitement gdp reshaping elad no priors

No Priors Live: Building Durable Software in the AI Age with MongoDB President & CEO CJ Desai

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Jan 22, 2026 36:37

Why are there only a handful of companies in the world with over $10 billion in pure-play software revenue? CJ Desai believes the reason is that products are replaceable, but platforms are forever. For No Priors' very first live from MongoDB.local SF, Sarah Guo is joined by CJ Desai, CEO and President of software developer MongoDB, to discuss the shifting landscape of enterprise software. CJ discusses whether AI will erode the value of software, and what truly constitutes a “moat” in the age of generative AI. CJ also talks about why AI adoption with Fortune 500-sized companies is still lagging, the importance of customer relationships, and why the “bear thesis” on SaaS may be overblown. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @cj_mongodb | @MongoDB Chapters: 00:00 – Cold Open 00:58 – CJ Desai Introduction 01:38 – The AI Stack and the Future of Software 04:18 – Why Platforms, Not Products, Are Sticky 09:59 – Vibe Coding and the Threat of On-Demand Apps 12:15 – Paths to Success for Software Vendor Incumbents 14:24 – How CJ Chose MongoDB 18:55 – Debunking the SaaS Bear Thesis 22:07 – Fortune 500 Perspectives on AI Value 24:24 – Can AI Native Startups Replace Systems of Record? 28:10 – The Importance of Customer Relationships 31:46 – Managing Through Massive Technology Transitions 36:37 – Conclusion

ceo president ai success future fortune record threats software conclusion perspectives saas debunking paths sf cj durable desai mongodb ai age no priors

10 contrarian leadership truths every leader needs to hear | Matt MacInnis (Rippling)

Podcast Notes Playlist: Latest Episodes

Play Episode Listen Later Jan 14, 2026

Lenny's Podcast: Product | Growth | Career ✓ Claim : Read the notes at at podcastnotes.org. Don't forget to subscribe for free to our newsletter, the top 10 ideas of the week, every Monday --------- Matt MacInnis is the chief product officer and former longtime COO at Rippling, a unified workforce management platform valued at over $16 billion.We discuss:1. Why “extraordinary results demand extraordinary efforts”2. Why you should deliberately understaff projects, and how to know when you've gone too far3. Matt's transition from COO to CPO and what surprised him about leading product4. The “high alpha, low beta” framework for evaluating people, processes, and products5. When founders should quit their startups (hint: much earlier than VCs want you to)6. How to fight entropy in your organization through relentless energy and intensity—Brought to you by:Google Gemini—Your everyday AI assistant: https://ai.dev/Datadog—Now home to Eppo, the leading experimentation and feature flagging platform: https://www.datadoghq.com/lennyGoFundMe Giving Funds—Make year-end giving easy: http://gofundme.com/lenny—Transcript: https://www.lennysnewsletter.com/p/10-contrarian-leadership-truths—My biggest takeaways (for paid newsletter subscribers): https://www.lennysnewsletter.com/i/181916584/my-biggest-takeaways-from-this-conversation—Where to find Matt MacInnis:• X: https://x.com/stanine• LinkedIn: https://www.linkedin.com/in/macinnis• Email: macinnis@rippling.com—Where to find Lenny:• Newsletter: https://www.lennysnewsletter.com• X: https://twitter.com/lennysan• LinkedIn: https://www.linkedin.com/in/lennyrachitsky/—In this episode, we cover:(00:00) Introduction to Matt MacInnis and Rippling(04:38) The importance of extraordinary efforts(08:37) The challenges and rewards of relentless effort(10:11) Your job as a leader is to preserve intensity(12:39) You learn far more from success than failure(16:34) Transitioning to chief product officer(19:54) Fixing product management at Rippling(25:27) The “high alpha, low beta” framework(28:55) The PQL framework(35:16) Hiring frameworks and team dynamics(36:52) A helpful interview tactic(40:00) Leading as a COO vs. a CPO(42:34) The reality of product-market fit(46:38) The problem with venture capital(49:29) When founders should quit their startups(41:48) The immutable market(54:13) Lessons from Notion's success(57:43) Investment strategies and narrative violations(01:00:42) The power of compounding, power law, and entropy(01:07:02) Maintaining intensity and fighting entropy(01:11:33) The importance of feedback and escalations(01:14:31) Rippling's vision and success(01:17:48) AI's impact on SaaS and business software(01:23:42) AI corner(01:26:23) Final thoughts and lightning round—Referenced:• Rippling: https://www.rippling.com• Sunil Raman on LinkedIn: https://www.linkedin.com/in/sunilraman• Dan Gill on LinkedIn: https://www.linkedin.com/in/dangill• Carvana: https://www.carvana.com• Brian Chesky's new playbook: https://www.lennysnewsletter.com/p/brian-cheskys-contrarian-approach• Parker Conrad on LinkedIn: https://www.linkedin.com/in/parkerconrad• Inkling: https://www.inkling.com• Akshay Kothari on LinkedIn: https://www.linkedin.com/in/akothari• Notion: https://www.notion.com• Conway's law: https://en.wikipedia.org/wiki/Conway%27s_law• Seeking Alpha: https://seekingalpha.com• Dennis Rodman's website: https://dennisrodman.com• Dancing pickle emoji: https://slackmojis.com/emojis/456-dancing_pickle• Pickle Rick: https://en.wikipedia.org/wiki/Pickle_Rick• SPOTAK: The Six Traits I Look for When I'm Hiring: https://finance.yahoo.com/news/spotak-six-traits-look-m-181335267.html• Geoff Lewis on LinkedIn: https://www.linkedin.com/in/geofflewis1• Zenefits: https://en.wikipedia.org/wiki/TriNet_Zenefits• New banking records prove Deel paid thief who stole trade secrets from Rippling: https://www.rippling.com/blog/new-banking-records-prove-deel-paid-thief-who-stole-trade-secrets-from-rippling• Workday: https://www.workday.com• Matic robots: https://maticrobots.com• Wall-E: https://www.imdb.com/title/tt0910970• Conviction: https://www.conviction.com• Mike Vernal on X: https://x.com/mvernal• Sarah Guo on X: https://x.com/saranormous• No Priors: https://linktr.ee/nopriors• Gemini: https://gemini.google.com• ChatGPT: https://chatgpt.com• Claude: https://claude.ai• Bryan Schreier on LinkedIn: https://www.linkedin.com/in/bryanschreier• Heated Rivalry on HBO Max: https://www.hbomax.com/shows/heated-rivalry/50cd4e99-04ee-427b-a3b4-da721ed05d9c• Fellow coffee maker: https://fellowproducts.com/products/aiden-precision-coffee-maker—Recommended books:• Pale Blue Dot: A Vision of the Human Future in Space: https://www.amazon.com/Pale-Blue-Dot-Vision-Future/dp/0345376595• Conscious Business: How to Build Value Through Values: https://www.amazon.com/Conscious-Business-Build-through-Values/dp/1622032020• Thinking in Systems: https://www.amazon.com/Thinking-Systems-Donella-H-Meadows/dp/1603580557• The Effective Executive: The Definitive Guide to Getting the Right Things Done: https://www.amazon.com/Effective-Executive-Definitive-Harperbusiness-Essentials/dp/0060833459—Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email podcast@lennyrachitsky.com.—Lenny may be an investor in the companies discussed. To hear more, visit www.lennysnewsletter.com

10 contrarian leadership truths every leader needs to hear | Matt MacInnis (Rippling)

Podcast Notes Playlist: Business

Play Episode Listen Later Jan 14, 2026 96:17

Lenny's Podcast: Product | Growth | Career ✓ Claim Key Takeaways Deliberately understaff projects: Constraints force creativity and prevent bloat from politics and bureaucracyYou've gone too far when teams can't ship basic functionalityThe sweet spot is uncomfortable but productive tensionGood teams get tired; great teams run in the red constantly and destroy good teams in that momentHigh alpha, low beta framework: Evaluate people and processes on upside potential (alpha) versus volatility (beta)Prioritize high alpha opportunities even with higher betaProcesses exist solely to lower beta but suppress alpha as a trade-offThe nuanced dance is decreasing volatility where needed (like payroll) without killing upside in innovation areasKnow when to quit your startup: If you're not certain you have product-market fit, you don't have itCompanies that hit big do so quickly; the “never quit” mentality is VC propaganda designed to extract value from founders, not protect themPivot twice or three times maximum, typically by year fourYou're running an experiment to see if the universe has binding receptors for your product. If not, move on… Leadership means fighting entropy relentlesslyTeams naturally optimize for local comfort and disorderExecutives must demand 99% energy levels daily, or the system decaysTop performers don't get 10% more rewards; they get 10-100xBeing “chill” accomplishes nothingWithholding negative feedback is selfish because you prioritize your comfort over making teammates and the company better Read the full notes @ podcastnotes.orgMatt MacInnis is the chief product officer and former longtime COO at Rippling, a unified workforce management platform valued at over $16 billion.We discuss:1. Why “extraordinary results demand extraordinary efforts”2. Why you should deliberately understaff projects, and how to know when you've gone too far3. Matt's transition from COO to CPO and what surprised him about leading product4. The “high alpha, low beta” framework for evaluating people, processes, and products5. When founders should quit their startups (hint: much earlier than VCs want you to)6. How to fight entropy in your organization through relentless energy and intensity—Brought to you by:Google Gemini—Your everyday AI assistant: https://ai.dev/Datadog—Now home to Eppo, the leading experimentation and feature flagging platform: https://www.datadoghq.com/lennyGoFundMe Giving Funds—Make year-end giving easy: http://gofundme.com/lenny—Transcript: https://www.lennysnewsletter.com/p/10-contrarian-leadership-truths—My biggest takeaways (for paid newsletter subscribers): https://www.lennysnewsletter.com/i/181916584/my-biggest-takeaways-from-this-conversation—Where to find Matt MacInnis:• X: https://x.com/stanine• LinkedIn: https://www.linkedin.com/in/macinnis• Email: macinnis@rippling.com—Where to find Lenny:• Newsletter: https://www.lennysnewsletter.com• X: https://twitter.com/lennysan• LinkedIn: https://www.linkedin.com/in/lennyrachitsky/—In this episode, we cover:(00:00) Introduction to Matt MacInnis and Rippling(04:38) The importance of extraordinary efforts(08:37) The challenges and rewards of relentless effort(10:11) Your job as a leader is to preserve intensity(12:39) You learn far more from success than failure(16:34) Transitioning to chief product officer(19:54) Fixing product management at Rippling(25:27) The “high alpha, low beta” framework(28:55) The PQL framework(35:16) Hiring frameworks and team dynamics(36:52) A helpful interview tactic(40:00) Leading as a COO vs. a CPO(42:34) The reality of product-market fit(46:38) The problem with venture capital(49:29) When founders should quit their startups(41:48) The immutable market(54:13) Lessons from Notion's success(57:43) Investment strategies and narrative violations(01:00:42) The power of compounding, power law, and entropy(01:07:02) Maintaining intensity and fighting entropy(01:11:33) The importance of feedback and escalations(01:14:31) Rippling's vision and success(01:17:48) AI's impact on SaaS and business software(01:23:42) AI corner(01:26:23) Final thoughts and lightning round—Referenced:• Rippling: https://www.rippling.com• Sunil Raman on LinkedIn: https://www.linkedin.com/in/sunilraman• Dan Gill on LinkedIn: https://www.linkedin.com/in/dangill• Carvana: https://www.carvana.com• Brian Chesky's new playbook: https://www.lennysnewsletter.com/p/brian-cheskys-contrarian-approach• Parker Conrad on LinkedIn: https://www.linkedin.com/in/parkerconrad• Inkling: https://www.inkling.com• Akshay Kothari on LinkedIn: https://www.linkedin.com/in/akothari• Notion: https://www.notion.com• Conway's law: https://en.wikipedia.org/wiki/Conway%27s_law• Seeking Alpha: https://seekingalpha.com• Dennis Rodman's website: https://dennisrodman.com• Dancing pickle emoji: https://slackmojis.com/emojis/456-dancing_pickle• Pickle Rick: https://en.wikipedia.org/wiki/Pickle_Rick• SPOTAK: The Six Traits I Look for When I'm Hiring: https://finance.yahoo.com/news/spotak-six-traits-look-m-181335267.html• Geoff Lewis on LinkedIn: https://www.linkedin.com/in/geofflewis1• Zenefits: https://en.wikipedia.org/wiki/TriNet_Zenefits• New banking records prove Deel paid thief who stole trade secrets from Rippling: https://www.rippling.com/blog/new-banking-records-prove-deel-paid-thief-who-stole-trade-secrets-from-rippling• Workday: https://www.workday.com• Matic robots: https://maticrobots.com• Wall-E: https://www.imdb.com/title/tt0910970• Conviction: https://www.conviction.com• Mike Vernal on X: https://x.com/mvernal• Sarah Guo on X: https://x.com/saranormous• No Priors: https://linktr.ee/nopriors• Gemini: https://gemini.google.com• ChatGPT: https://chatgpt.com• Claude: https://claude.ai• Bryan Schreier on LinkedIn: https://www.linkedin.com/in/bryanschreier• Heated Rivalry on HBO Max: https://www.hbomax.com/shows/heated-rivalry/50cd4e99-04ee-427b-a3b4-da721ed05d9c• Fellow coffee maker: https://fellowproducts.com/products/aiden-precision-coffee-maker—Recommended books:• Pale Blue Dot: A Vision of the Human Future in Space: https://www.amazon.com/Pale-Blue-Dot-Vision-Future/dp/0345376595• Conscious Business: How to Build Value Through Values: https://www.amazon.com/Conscious-Business-Build-through-Values/dp/1622032020• Thinking in Systems: https://www.amazon.com/Thinking-Systems-Donella-H-Meadows/dp/1603580557• The Effective Executive: The Definitive Guide to Getting the Right Things Done: https://www.amazon.com/Effective-Executive-Definitive-Harperbusiness-Essentials/dp/0060833459—Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email podcast@lennyrachitsky.com.—Lenny may be an investor in the companies discussed. To hear more, visit www.lennysnewsletter.com

10 contrarian leadership truths every leader needs to hear | Matt MacInnis (Rippling)

Podcast Notes Playlist: Startup

Play Episode Listen Later Jan 14, 2026 96:17

10 contrarian leadership truths every leader needs to hear | Matt MacInnis (Rippling)

Lenny's Podcast: Product | Growth | Career

Play Episode Listen Later Dec 28, 2025 96:17

Matt MacInnis is the chief product officer and former longtime COO at Rippling, a unified workforce management platform valued at over $16 billion.We discuss:1. Why “extraordinary results demand extraordinary efforts”2. Why you should deliberately understaff projects, and how to know when you've gone too far3. Matt's transition from COO to CPO and what surprised him about leading product4. The “high alpha, low beta” framework for evaluating people, processes, and products5. When founders should quit their startups (hint: much earlier than VCs want you to)6. How to fight entropy in your organization through relentless energy and intensity—Brought to you by:Google Gemini—Your everyday AI assistant: https://ai.dev/Datadog—Now home to Eppo, the leading experimentation and feature flagging platform: https://www.datadoghq.com/lennyGoFundMe Giving Funds—Make year-end giving easy: http://gofundme.com/lenny—Transcript: https://www.lennysnewsletter.com/p/10-contrarian-leadership-truths—My biggest takeaways (for paid newsletter subscribers): https://www.lennysnewsletter.com/i/181916584/my-biggest-takeaways-from-this-conversation—Where to find Matt MacInnis:• X: https://x.com/stanine• LinkedIn: https://www.linkedin.com/in/macinnis• Email: macinnis@rippling.com—Where to find Lenny:• Newsletter: https://www.lennysnewsletter.com• X: https://twitter.com/lennysan• LinkedIn: https://www.linkedin.com/in/lennyrachitsky/—In this episode, we cover:(00:00) Introduction to Matt MacInnis and Rippling(04:38) The importance of extraordinary efforts(08:37) The challenges and rewards of relentless effort(10:11) Your job as a leader is to preserve intensity(12:39) You learn far more from success than failure(16:34) Transitioning to chief product officer(19:54) Fixing product management at Rippling(25:27) The “high alpha, low beta” framework(28:55) The PQL framework(35:16) Hiring frameworks and team dynamics(36:52) A helpful interview tactic(40:00) Leading as a COO vs. a CPO(42:34) The reality of product-market fit(46:38) The problem with venture capital(49:29) When founders should quit their startups(41:48) The immutable market(54:13) Lessons from Notion's success(57:43) Investment strategies and narrative violations(01:00:42) The power of compounding, power law, and entropy(01:07:02) Maintaining intensity and fighting entropy(01:11:33) The importance of feedback and escalations(01:14:31) Rippling's vision and success(01:17:48) AI's impact on SaaS and business software(01:23:42) AI corner(01:26:23) Final thoughts and lightning round—Referenced:• Rippling: https://www.rippling.com• Sunil Raman on LinkedIn: https://www.linkedin.com/in/sunilraman• Dan Gill on LinkedIn: https://www.linkedin.com/in/dangill• Carvana: https://www.carvana.com• Brian Chesky's new playbook: https://www.lennysnewsletter.com/p/brian-cheskys-contrarian-approach• Parker Conrad on LinkedIn: https://www.linkedin.com/in/parkerconrad• Inkling: https://www.inkling.com• Akshay Kothari on LinkedIn: https://www.linkedin.com/in/akothari• Notion: https://www.notion.com• Conway's law: https://en.wikipedia.org/wiki/Conway%27s_law• Seeking Alpha: https://seekingalpha.com• Dennis Rodman's website: https://dennisrodman.com• Dancing pickle emoji: https://slackmojis.com/emojis/456-dancing_pickle• Pickle Rick: https://en.wikipedia.org/wiki/Pickle_Rick• SPOTAK: The Six Traits I Look for When I'm Hiring: https://finance.yahoo.com/news/spotak-six-traits-look-m-181335267.html• Geoff Lewis on LinkedIn: https://www.linkedin.com/in/geofflewis1• Zenefits: https://en.wikipedia.org/wiki/TriNet_Zenefits• New banking records prove Deel paid thief who stole trade secrets from Rippling: https://www.rippling.com/blog/new-banking-records-prove-deel-paid-thief-who-stole-trade-secrets-from-rippling• Workday: https://www.workday.com• Matic robots: https://maticrobots.com• Wall-E: https://www.imdb.com/title/tt0910970• Conviction: https://www.conviction.com• Mike Vernal on X: https://x.com/mvernal• Sarah Guo on X: https://x.com/saranormous• No Priors: https://linktr.ee/nopriors• Gemini: https://gemini.google.com• ChatGPT: https://chatgpt.com• Claude: https://claude.ai• Bryan Schreier on LinkedIn: https://www.linkedin.com/in/bryanschreier• Heated Rivalry on HBO Max: https://www.hbomax.com/shows/heated-rivalry/50cd4e99-04ee-427b-a3b4-da721ed05d9c• Fellow coffee maker: https://fellowproducts.com/products/aiden-precision-coffee-maker—Recommended books:• Pale Blue Dot: A Vision of the Human Future in Space: https://www.amazon.com/Pale-Blue-Dot-Vision-Future/dp/0345376595• Conscious Business: How to Build Value Through Values: https://www.amazon.com/Conscious-Business-Build-through-Values/dp/1622032020• Thinking in Systems: https://www.amazon.com/Thinking-Systems-Donella-H-Meadows/dp/1603580557• The Effective Executive: The Definitive Guide to Getting the Right Things Done: https://www.amazon.com/Effective-Executive-Definitive-Harperbusiness-Essentials/dp/0060833459—Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email podcast@lennyrachitsky.com.—Lenny may be an investor in the companies discussed. To hear more, visit www.lennysnewsletter.com

The Best of 2025 (So Far) with Sarah Guo and Elad Gil

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Oct 31, 2025 18:59

2025 has thus far been a year of great leaps and advances in AI technology. And Sarah and Elad have spoken with some of the most enterprising founders and scientific minds in the field of AI today. So we're revisiting a few of our favorite conversations on No Priors so far in 2025 – Winston Weinberg (Harvey), Dr. Fei-Fei Li (World Labs), Brendan Foody (Mercor), Dan Hendrycks (Center for AI Safety), Noubar Afeyan (Flagship Pioneering), Brandon McKinzie and Eric Mitchell (OpenAI o3), Isa Fulford (OpenAI), Arvind Jain (Glen), and Dr. Shiv Rao (Abridge). Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil Chapters: 00:00 – Episode Introduction 0:21 – Winston Weinberg on Leaning into New Capabilities 02:01 – Dr. Fei-Fei Li on Spatial Intelligence 04:13 – Brendan Foody on AI Disruption in the Workforce 06:10 – Dan Hendrycks on the Geopolitics of Superintelligence 08:06 – Noubar Afeyan on Entrepreneurship 10:38 – Brandon McKinzie and Eric Mitchell on Reasoning Models 12:41 – Isa Fulford on Training Deep Research 13:49 – Arvind Jain on Innovating Enterprise Search 16:21 – Dr. Shiv Rao on AI's Human Impact 18:58 – Conclusion

ai entrepreneurship conclusion leaning geopolitics elad fei fei li elad gil eric mitchell no priors

EP#39【有活力的年轻人系列】美元基金XIAOXIAOFUND 创始人晓晗：希望你内在生出勇气

Lead Her Way

Play Episode Listen Later Oct 4, 2025 108:37

听众朋友们，如果你有好奇心和耐心听“lead her way有活力的年轻人系列” 我最想和大家共勉的有两点，人天生慕强，也许你想听已经成功的人讲讲成功的公式，但是成功是没有公式的，纽约客的主编曾经说过，他写人物的时候，最感兴趣的并不是成功的故事，而是这个人是如何”becoming” who he/she is today. 这个系列里的年轻人，让我看到了becoming 的可能性。第二，为什么会采访在ai 大潮里弄潮儿的95后，00后？是因为很多我们这些在职场拼搏多年的职场人，因为经验形成的观念和看问题的角度，都会在这个新时代受到挑战，很多东西我们觉得没道理啊，但是我想这个时代最重要的能力，其实正是对一切保持好奇心，拥有开放的心态。本期嘉宾：· 晓晗：美元基金 XIAO XIAO FUND 的Solo GP。95后中文系毕业生，曾在硬科技投资领域深耕六年，30岁时华丽转身，创立专注于投资全球AI领域“小黑马”的美元基金。· Lysa（返场嘉宾）：千原传媒创始人，帮助中国科技及AI公司在海外增长品牌与用户。(Lysa 专访请听上期节目EP #38)收听指南 & 精彩亮点：1. 【04:01】30岁的“人生主线”觉醒o 晓晗如何在30岁节点，从“结婚生子”的社会剧本中挣脱，梳理出“自我成长，自由创造”的人生主线，并毅然决定创立自己的基金。2. 【09:15】为什么是Solo VC + AI“小黑马”？o 洞察到AI降低了创新门槛，年轻创业者需要更灵活、小规模的基金产品。o “小黑马”定义：区别于资源雄厚的“大白马”，他们是年轻、有强烈创造欲和动手能力、通过“野路子”摸索增长的创业者，是“五彩斑斓的黑”。3. 【15:25】一个人如何运营一支基金？o 面对“你凭什么？”的质疑，她如何通过外包中后台职能、利用AI工具，构建一个高效运转的“一人组织”，实践她所信仰的“小团队具备大能量”。4. 【44:28】在AI的混沌中，如何做判断？o 在鱼龙混杂的AI热潮中，她更看重创始人“如何讲述”（How）而非“讲述什么”（What），投资本质是“基于感觉，愿不愿意相信”。5. 【51:41】文科生在AI投资中的优势o 作为中文系背景的投资人，她认为早期投资核心是“识人”，文科生的敏感度、共情力和对人性细微差别的洞察，在技术工具化的未来愈发珍贵。6. 【58:18】给所有人的AI时代生存指南o 核心心智：拥抱混沌，保持开放，对抗僵化与经验主义。o 行动建议：梳理工作中重复性流程，主动寻找AI工具进行自我提效，这是应对未来的第一步。7. 【01:25:59】女性叙事：重构成功与幸福o 应对生育焦虑：晓晗分享了“冻卵”如何为她带来巨大的心理自由，让她能更从容地规划事业与人生。o 重新定义成功：成功不在于外在标配，而在于内心的“自洽”——所做之事与本性相符，不拧巴，不内耗。节目中提到的资源：· 晓晗的内容阵地：o 公众号：小XIAO说o YouTube：@XIAOXIAOTalks· AI学习资源推荐：o 英文播客（前沿洞察）： No Priors, a16z Podcast, YC Podcast, Uncapped with Altman; OpenAI, DeepMind, Anthropic官方频道。o 中文内容（趋势解读）：§ 海外独角兽（公众号）：硅谷视角，讲大趋势，适合入门。§ 张小珺JUN（公众号）：商业漫谈，跟踪AI人物。§ 葬AI（公众号）：文风犀利，提供批判性视角。· 晓晗的精神食粮：o 人生信条： “Live boldly, push yourself, don't settle.” — 来自电影《遇见你之前》o 近期在听：歌曲《凡人诀》（陈楚生版）o 近期在读：书籍《缱绻与决绝》o 崇拜的女性：王菲（因其自洽、不受规训的人生态度）主播寄语：这期节目不仅关于AI投资，更是一个关于“成为自己”的故事。晓晗的经历告诉我们，真正的“Lead Her Way”，是先勇敢地“Find Her Own Way”。在这个瞬息万变的时代，清晰的自我认知、在混沌中保持开放的能力，远比任何具体的知识更能帮助我们应对不确定性。希望她的故事能让你内在生出勇气，去探索属于自己的道路。Lysa 的播客：出海合伙人（小宇宙）

live ai openai anthropic altman deepmind xiao uncapped no priors

Man w No Priors Kills Wife, Lives w Rotting Corpse, Quips "Go Big or Go Home" to Cops| Crime Alert 7PM 07.22.2025

Crime Alert with Nancy Grace

Play Episode Listen Later Jul 22, 2025 7:04 Transcription Available

A Texas man with no criminal history admits to killing his wife and living with her rotting corpse. When confronted by cops, he chillingly says, "Go big or go home." A monster is behind bars for savagely murdering a 5yo boy with Autism & then discarding his remains in a dumpster like trash. Plus, a mom feeling blue, and the heat, after ditching her tot in a hot car to see the "Smurfs" movie. Jennifer GouldSee omnystudio.com/listener for privacy information.

texas wife crime cops autism kills go home corpse smurfs go big rotting quips no priors

Teaching AI to Understand the Physical World, with Dr. Fei-Fei Li of World Labs

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Jun 5, 2025 35:53

In this episode of No Priors, Sarah and Elad are joined by Dr. Fei-Fei Li, AI pioneer, co-director of Stanford's Human-Centered AI Institute, and founder of World Labs. Fei-Fei shares why she's building at the intersection of embodiment and intelligence, and what today's AI systems are still missing. From the early days of ImageNet to her vision for the next generation of robotics, she unpacks the human and technical motivations behind World Labs. They also discuss the challenges of 3D world modeling, her approach to building exceptional teams, and the special qualities that have led her students like Andrej Karpathy to make major breakthroughs. Show Notes: 0:00 Why and what Dr. Fei-Fei Li is building 3:00 World models at World Labs 6:44 Missing gaps in the AI future 9:16 Robotics and physical intelligence 16:15 Greatest challenges of 3D 19:08 Fei-Fei's work in PhD in ImageNet 23:05 Special moments in Dr. Li's career 29:33 Building teams 32:05 Human-centered AI

world ai building phd teaching missing 3d human stanford li robotics labs greatest elad physical world andrej karpathy fei fei li imagenet fei fei no priors

AI Consolidation, Biotech Opportunities, and World Models with Sarah and Elad

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later May 29, 2025 25:13

In this episode of No Priors, Sarah and Elad unpack the current state of the AI market - whether it's consolidating, what's enabling or blocking key mergers, and where the most promising untapped opportunities lie, particularly in biotech. They also explore the rise of world models and how AI's novel methods for understanding complex systems may ultimately reshape how humans approach discovery and problem-solving. Show Notes: 0:00 Is the AI market consolidating into clear winners? + the physics of the current landscape 7:01 Why more companies don't merge (even when it makes sense) 10:09 Exploring biotech's biggest commercial opportunities and the challenges founders face 17:14 Building world models 21:34 How AI is expanding the way humans reason, design, and evolve systems

ai building opportunities exploring models biotech consolidation elad no priors

AI is Making Enterprise Search Relevant, with Arvind Jain of Glean

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later May 15, 2025 31:34

Arvind Jain joins Sarah and Elad on this episode of No Priors. Arvind is the founder and CEO of Glean, an AI-powered enterprise search platform. He previously co-founded Rubrik and spent over a decade as an engineering leader at Google. In this episode, Arvind shares how LLMs are transforming enterprise search, why most tools in the space have failed, and the opportunity to build apps powered by internal knowledge. He discusses how much customization is still needed on top of foundation models, what made building Glean uniquely challenging compared to Arvind's previous ventures, and what's next for the company. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @jainarvind Show Notes: 0:00 Introduction 0:58 How LLMs are changing search 2:05 Building out Glean's platform 5:09 Why most search companies failed 8:41 Out of the box vs. bespoke models 10:26 Creating apps on top of internal knowledge 15:34 User behaviors & insights 19:11 Unique challenges of building Glean 21:51 Product-led growth vs. enterprise sales 25:00 Succeeding in traditionally bad markets 27:08 What Glean is excited to build next

ceo ai google building unique product relevant user succeeding jain rubrik glean arvind elad enterprise search no priors

Gaming as the Future of Education with Duolingo CEO Luis von Ahn

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later May 8, 2025 32:14

On this episode of No Priors, Sarah talks to Luis von Ahn, founder and CEO of Duolingo, the world's most popular education app with over 116 million monthly users and a market cap of approximately $17 billion. Controversially, it has recently committed to being “AI-first.” They discuss why motivation is the biggest challenge in education, how Duolingo harnesses game mechanics and behavioral insights to keep learners engaged, and the company's efforts to leverage AI to personalize education at scale. Luis also shares thoughts on the Duolingo brand, courses beyond language (chess and math), and the broader impact of AI on content creation. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @LuisvonAhn Links: Duolingo is now AI-First: http://bit.ly/3RQzny3 Show Notes: 0:00 Introduction 4:01 Optimizing learning behavior through tech 11:20 Adopting AI at Duolingo 17:25 AI's threat to content companies 18:34 An unhinged corporate brand 21:28 How do people learn? 25:16 What people misunderstand about Duolingo? 26:24 How AI is transforming learning at scale 30:28 Leveraging AI across the business

ceo ai gaming optimizing duolingo leveraging ai future of education ahn controversially luis von ahn no priors

O3 and the Next Leap in Reasoning with OpenAI's Eric Mitchell and Brandon McKinzie

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later May 1, 2025 39:13

This week on No Priors, Elad and Sarah sit down with Eric Mitchell and Brandon McKinzie, two of the minds behind OpenAI's O3 model. They discuss what makes O3 unique, including its focus on reasoning, the role of reinforcement learning, and how tool use enables more powerful interactions. The conversation explores the unification of model capabilities, what the next generation of human-AI interfaces could look like, and how models will continue to advance in the years ahead. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @mckbrando | @ericmitchellai Show Notes: 0:00 What is o3? 3:21 Reinforcement learning in o3 4:44 Unification of models 8:56 Why tool use helps test time scaling 11:10 Deep research 16:00 Future ways to interact with models 22:03 General purpose vs specialized models 25:30 Simulating AI interacting with the world 29:36 How will models advance?

ai future deep leap openai reasoning unification reinforcement elad o3 mckinzie eric mitchell no priors

Inside Deep Research with Isa Fulford: Building the Future of AI Agents

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Apr 24, 2025 30:45

On this episode of No Priors, Sarah sits down with Isa Fulford, one of the masterminds behind deep research. They unpack how the initiative began, the role of human expert data, and what it takes to build agents with real-world capability and even taste. Isa shares the differences between deep research and OpenAI's o3 model, the challenges around latency, and how she sees agent capabilities evolving. Plus, OpenAI has announced that deep research is free for all US users starting today. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @IsaFulf Show Notes: 0:00 Deep research's inception & evolution 6:12 Data creation 7:20 Reinforcement fine-tuning 9:05 Why human expert data matters 11:23 Failure modes of agents 13:55 The roadmap ahead for Deep Research 18:32 How do agents develop taste? 19:29 Experience and path to building a broadly capable agent 22:03 Deep research vs. o3 25:55 Latency 27:56 Predictions for agent capabilities

deep research failure data predictions openai future of ai building the future reinforcement latency fulford no priors

Predicting the Earth with Josh Goldman: How KoBold Uses AI to Find Critical Minerals

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Apr 17, 2025 40:53

This week on No Priors, Sarah and Elad are joined by Josh Goldman, cofounder and president of KoBold Metals. KoBold is using AI to transform how we discover critical minerals like lithium and cobalt, making the exploration process faster, more precise, and more scalable than traditional methods. In this episode, Josh explains how KoBold is rethinking the fundamentals of mineral exploration by combining unique datasets, scientific modeling, and predictive algorithms. They dive into the company's driving philosophy and technical approach, how they validate underground hypotheses, and why regulatory knowledge and a localized approach are crucial. Josh also discusses what success looks like in exploration today and the scarcity of world-class deposits. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @KoBold_Metals Show Notes: 0:00 Introduction 0:29 KoBold Metals 3:14 Using unique datasets 6:20 Traditional methods of lithium exploration 8:38 Regulatory vs. rarity constraints 13:40 Technical approach 16:25 Validating hypotheses 23:56 Redefining success in mineral exploration 25:44 Scarcity of good projects and deposits 32:44 Philosophy behind prediction 36:46 KoBold's origin story

ai earth philosophy traditional redefining technical predicting scarcity regulatory validating kobold critical minerals elad josh goldman no priors

From Job Displacement to AI Trainers, Brendan Foody on Work in the AI Age

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Apr 10, 2025 41:52

On this episode of No Priors, Sarah and Elad sit down with Brendan Foody, CEO and cofounder of Mercor, to discuss the company's rapid growth and their vision for the future of the labor market. They dive into how AI is reshaping the workforce in real, tangible ways and what skills are worth investing in today. Brendan shares insights on evaluating talent in an AI-driven world, including how models might identify outlier or 10x candidates and even assess “taste.” The conversation also touches on the evolving role of human data, the future of hiring in fast-scaling startups, and whether AI will act as an individual contributor or a data-centric manager. Show Notes: 0:00 Introduction 0:16 Building Mercor 3:00 Identifying outlier talent with AI 9:07 How AI is reshaping the workforce: job displacement & evolution 11:18 What skills should we invest in now? 12:18 Verifiability 13:36 Evaluating models 16:07 What should kids learn today? 17:05 Evaluating taste in talent assessments 18:45 Future of data collection 26:07 Humans' role in the AI economy 28:53 AI as a contributor vs. a manager 33:03 Mercor's goals 34:50 Evolution of labor markets 36:00 Hiring advice

ceo ai future evolution humans hiring identifying trainers evaluating displacement elad ai age foody no priors

Public Markets, Image Gen, and Specialized Models, with Sarah and Elad

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Apr 3, 2025 27:44

In this episode of No Priors, Sarah and Elad examine the current state of AI. They break down the recent dip in public markets, how tariffs could impact the tech industry, and where opportunities remain in large language models. They highlight the opportunities in more specialized models, new approaches to model development, and how the market is beginning to standardize with integrations like the Model Context Protocol (MCP). The episode ends with a look at early consumer AI applications and what types of expertise will matter most in the coming years. Show Notes: 0:00 Improvements in image gen 4:42 Public markets 8:08 Effects of tariffs on tech 9:42 Today's large model market 11:34 Opportunities in specialized models 16:30 Research advances in model approaches 21:10 What expertise will matter? 24:30 Anthropic's Model Context Protocol 26:30 Consumer applications

ai opportunities research public effects consumer models improvements anthropic specialized elad public markets no priors

Conversations Are the Source of Truth in Healthcare with Abridge CEO Shiv Rao

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Mar 27, 2025 38:42

In this episode of No Priors, Elad and Sarah chat with Shiv Rao, MD, founder and CEO of Abridge. They dive into how Abridge is reshaping healthcare by creating AI tools that enhance clinical documentation and improve doctor-patient interactions. Shiv shares his thoughts on building trust with established healthcare systems, giving agency and time back to clinicians, and what makes the healthcare AI opportunity different today. They also discuss Abridge's approach to developing and launching AI products, along with Shiv's journey in founding Abridge. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @ShivdevRao Show Notes: 0:00 Introduction 0:35 Abridge's Story and Vision 5:30 Strategy for Customer Choice 7:41 Healthcare AI Opportunities 11:24 Navigating Incumbent Partnerships 14:26 Doctor-Centric AI Solutions 19:54 Abridge's Future Plans 22:13 AI's Impact on Healthcare 28:43 Shipping and Iterating Products 32:50 Shiv's Journey to Abridge

ceo ai conversations strategy vision story impact healthcare md shipping future plans shiv elad no priors

The Robotics Revolution, with Physical Intelligence's Cofounder Chelsea Finn

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Mar 20, 2025 35:14

This week on No Priors, Elad speaks with Chelsea Finn, cofounder of Physical Intelligence and currently Associate Professor at Stanford, leading the Intelligence through Learning and Interaction Lab. They dive into how robots learn, the challenges of training AI models for the physical world, and the importance of diverse data in reaching generalizable intelligence. Chelsea explains the evolving landscape of open-source vs. closed-source robotics and where AI models are likely to have the biggest impact first. They also compare the development of robotics to self-driving cars, explore the future of humanoid and non-humanoid robots, and discuss what's still missing for AI to function effectively in the real world. If you're curious about the next phase of AI beyond the digital space, this episode is a must-listen. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @ChelseaFinn Show Notes: 0:00 Introduction 0:31 Chelsea's background in robotics 3:10 Physical Intelligence 5:13 Defining their approach and model architecture 7:39 Reaching generalizability and diversifying robot data 9:46 Open source vs. closed source 12:32 Where will PI's models integrate first? 14:34 Humanoid as a form factor 16:28 Embodied intelligence 17:36 Key turning points in robotics progress 20:05 Hierarchical interactive robot and decision-making 22:21 Choosing data inputs 26:25 Self driving vs robotics market 28:37 Advice to robotics founders 29:24 Observational data and data generation 31:57 Future robotic forms

learning ai future advice revolution open defining stanford reaching associate professor intelligence pi robotics embodied humanoid elad observational hierarchical chelsea finn no priors

Copilot, Agent Mode, and the New World of Dev Tools with GitHub's CEO Thomas Dohmke

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Mar 13, 2025 50:34

This week on No Priors, Sarah and Elad talk with GitHub CEO Thomas Dohmke about the rise of AI-powered software development and the success of Copilot. They discuss how Copilot is reshaping the developer workflow, GitHub's new Agent Mode, and competition in the developer tooling market. They also explore how AI-driven coding impacts software pricing, the future of open source vs. proprietary APIs, and what Copilot's success means for Microsoft. Plus, Thomas shares insights from his journey growing up in East Berlin and navigating rapidly changing worlds. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @ThomasDohmke Show Notes: 0:00 Introduction 0:37 GitHub Copilot's capabilities 4:12 Will agents replace developers? 6:04 Copilot's development cycle 8:34 Winning the developer market 10:40 Agent mode 13:25 Where GitHub is headed 16:45 Building for the new challenges of AI 21:50 Dev tools market formation 29:56 Copilot's broader impact 32:17 How AI changes software pricing 39:16 Open source vs. proprietary APIs 48:01 Growing up in East Berlin

ai building winning microsoft open agent new world github apis dev copilot github copilot elad east berlin devtools no priors

National Security Strategy and AI Evals on the Eve of Superintelligence with Dan Hendrycks

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Mar 5, 2025 36:24

This week on No Priors, Sarah is joined by Dan Hendrycks, director of the Center of AI Safety. Dan serves as an advisor to xAI and Scale AI. He is a longtime AI researcher, publisher of interesting AI evals such as "Humanity's Last Exam," and co-author of a new paper on National Security "Superintelligence Strategy" along with Scale founder-CEO Alex Wang and former Google CEO Eric Schmidt. They explore AI safety, geopolitical implications, the potential weaponization of AI, along with policy recommendations. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @DanHendrycks Show Notes: 0:00 Introduction 0:36 Dan's path to focusing on AI Safety 1:25 Safety efforts in large labs 3:12 Distinguishing alignment and safety 4:48 AI's impact on national security 9:59 How might AI be weaponized? 14:43 Immigration policies for AI talent 17:50 Mutually assured AI malfunction 22:54 Policy suggestions for current administration 25:34 Compute security 30:37 Current state of evals

ai safety current humanity policy scale immigration distinguishing eric schmidt xai compute mutually superintelligence national security strategy scale ai alex wang no priors

Building the Platform for Scientific Breakthrough, with Noubar Afeyan of Moderna and Flagship Pioneering

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Feb 27, 2025 40:32

This week on No Priors, Sarah sits down with Noubar Afeyan, Co-founder and CEO of Flagship Pioneering, the biotech firm behind groundbreaking companies like Moderna. They explore how Flagship creates the conditions for scientific breakthroughs, tackles regulatory uncertainty, and pushes the boundaries of discovery. Noubar shares insights on AI's role in healthcare, the challenges of bringing new therapies to market, and lessons learned from past pandemics. He also discusses Flagship's platform approach to biotech innovation and introduces the idea of polyintelligence. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @NoubarAfeyan Show Notes: 0:00 Introduction 0:48 Founding Flagship 5:51 Fostering environments for emergence 11:17 Expanding into new frontiers 14:26 Developing technology amid regulatory uncertainty and risk 19:12 How Flagship has evolved 22:47 AI applications in healthcare 27:30 Bottlenecks in bringing new therapies to market 32:20 Lessons for the next pandemic 34:11 Building a platform 38:10 Polyintelligence

ceo ai lessons building developing platform breakthrough expanding scientific fostering moderna pioneering flagship bottlenecks no priors

Virtual Cell Models, Tahoe-100 and Data for AI-in-Bio with Vevo Therapeutics and the Arc Institute

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Feb 25, 2025 57:40

On this week's episode of No Priors, Sarah Guo is joined by leading members of the teams at Vevo Therapeutics and the Arc Institute – Nima Alidoust, CEO/Co-Founder at Vevo Therapeutics; Johnny Yu, CSO/Co-Founder at Vevo Therapeutics; Patrick Hsu, CEO/Co-Founder at Arc Institute; Dave Burke, CTO at Arc Institute; and Hani Goodarzi, Core Investigator at Arc Institute. Predicting protein structure (AlphaFold 3, Chai-1, Evo 2) was a big AI/biology breakthrough. The next big leap is modeling entire human cells—how they behave in disease, or how they respond to new therapeutics. The same way LLMs needed enormous text corpora to become truly powerful, Virtual Cell Models need massive, high-quality cellular datasets to train on. In this episode, the teams discuss the groundbreaking release of the Tahoe-100M single cell dataset, Arc Atlas, and how these advancements could transform drug discovery. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @Nalidoust | @IAmJohnnyYu | @PDHsh | @Davey_Burke | @Genophoria Download the Tahoe Dataset Show Notes: 0:00 Introduction 1:40 Significance of Tahoe-100M dataset 4:22 Where we are with virtual cell models and protein language models 10:26 Significance of perturbational data 17:39 Challenges and innovations in data collection 24:42 Open sourcing and community collaboration 33:51 Predictive ability and importance of virtual cell models 35:27 Drug discovery and virtual cell models 44:27 Platform vs. single hypothesis companies 46:05 Rise of Chinese biotechs 51:36 AI in drug discovery

ai challenges co founders chinese data open institute virtual platform drug significance models cto predicting evo chai predictive tahoe therapeutics alphafold vevo dave burke no priors

Building Hard Tech in Hard Markets: Kyle Vogt on Cruise, Twitch, and The Bot Company

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Feb 20, 2025 38:14

Kyle Vogt joins Sarah and Elad on this week's episode of No Priors. A serial entrepreneur, Kyle co-founded Twitch, transforming live streaming, and later Cruise, the autonomous vehicle company acquired by GM for $1 billion. Now he's taking on AI-powered home robotics with The Bot Company. In this episode, Kyle shares his journey building transformative tech companies, the challenges of scaling autonomous systems, and why he believes home robots are the next frontier. They also discuss the parallels between AVs and robotics, overcoming consumer skepticism, US vs. China manufacturing, and the policies needed to foster a competitive robotics industry. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @KVogt Show Notes: 0:00 Introduction 0:29 Founding Cruise 3:12 Tesla vs. Waymo approach 4:44 Scaling autonomous vehicles 10:03 The Bot Company 16:35 Deploying robots in the home 17:56 Parallels between robots and AV markets 20:51 Personifying robots and overcoming consumer skepticism 25:00 Timeline on consumer robots 26:47 Chinese vs. US manufacturing 29:15 Fostering a competitive domestic robotics industry 34:00 Lessons from Cruise & personal philosophies

ai china lessons tech chinese twitch tesla gm scaling markets timeline cruise bots fostering av parallels waymo deploying avs elad kyle vogt no priors

How Harvey AI is Changing the Legal Industry with Winston Weinberg

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Feb 14, 2025 49:35

This week on No Priors, Sarah sits down with Harvey Co-Founder and CEO Winston Weinberg. Harvey is one of the leading application layer AI companies. Harvey is building domain-specific AI for law firms, professional service providers, and the Fortune 500. They are already working with companies like Bridgewater, KKR, PWC, and O'Melveny with over $500M in funding from OpenAI, Sequoia, Kleiner, GV and Elad and Sarah. In this episode, Sarah and Winston cover everything from approaching customers with AI solutions, expanding across domains, and building to volume capability improvements. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @WinstonWeinberg Show Notes: 0:00 Introduction 2:39 Harvey's founding story 3:46 Capability improvement 6:39 Building teams around AI capabilities 9:17 End to end task ahead 12:37 Beginning with huge industry clients - 17:21 Working with users who are skeptical of automation 20:40 Being a lawyer today and in the future 26:02 Applying learnings and adapting product for other domains 26:58 Hiring 30:39 Lessons and mistakes as a founder 32:53 Winston's personal drive 40:21 Advice to other founders finding their idea 44:35 Prediction for next ChatGPT moment

ai lessons advice building predictions fortune chatgpt openai pwc 500m sequoia kleiner capability bridgewater weinberg kkr gv elad legalindustry no priors

Rick Caruso on LA's Wildfires, Policy Failures, and the Path Forward

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Jan 30, 2025 27:11

This week on No Priors, Elad sits down with Rick Caruso, LA real estate developer and runner-up in the 2022 mayoral race. With experience serving under three LA mayors, as well as on the police commission and the board of water and power, Rick offers a unique perspective on the systemic failures that contributed to the devastation of the January 2025 wildfires in communities like the Palisades and Altadena. He discusses the steps he took to build more resilient infrastructure in his properties and how California can rebuild smarter to better prepare for future disasters. They also explore the state's water management, rising crime, and how to leverage California's vast natural resources and budget to create a better future for all residents. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @RickCarusoLA Show Notes 0:00 Introduction 0:56 Caruso's history in business and public service 3:36 Failures in fire prevention and response 5:58 How Caruso's properties survived 8:26 Water shortages and infrastructure failures 9:47 Arson, looting, and crime in LA 15:03 Rebuilding 20:50 Allocating California's resources effectively 26:15 Caruso's future plans

california water policy rebuilding failures wildfires path forward arson caruso palisades altadena elad rick caruso no priors

What Can We Do About Wildfires? With Convective Capital's Bill Clerico

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Jan 23, 2025 37:59

This week on No Priors, Sarah and Elad sit down with Bill Clerico, founder of Convective Capital, an early stage venture fund focused on technology-driven solutions for wildfire mitigation and climate resilience. The wildfires in Los Angeles have caused unprecedented property damage and immense hardship for countless individuals and families. This episode is devoted to diving into understanding what happened and what we can do in the future. Bill shares his insights into the increasing severity of wildfires, the role of policy, and how infrastructure issues, like outdated building codes and underfunded utilities, are contributing to the crisis. They discuss the latest innovations in fire-fighting technology, from advanced detection to drones, and how these tools can help mitigate future damage. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @BillClerico Show Notes: 0:00 Introduction 1:02 Why are wildfires getting worse? 5:37 Policies and regulatory decisions 10:47 Housing: building codes and permitting 13:19 Key factors in response 16:20 Improving water supply and city infrastructure 19:10 Preventing wildfires 21:26 Underinvestment in California's utilities 26:53 Innovative fire fighting technology 29:35 Accelerating Los Angeles' recovery 34:29 Actions homeowners, insurance companies, and governments can take

california los angeles capital improving housing innovative preventing policies wildfires elad convective no priors bill clerico

How AI Agents Are Transforming Customer Support, with Decagon's Jesse Zhang

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Jan 16, 2025 30:09

Today on No Priors, co-founder and CEO of Decagon, Jesse Zhang, joins Elad to discuss the future of agentic customer support. Decagon provides AI-powered customer interactions for companies like Rippling, Notion, Duolingo, Classpass, Substack, Vanta, Eventbrite, and more. Jesse shares the thesis behind starting Decagon, why he sees customer support as the ideal entry point for agentic technology, and what areas of AI excite him most. They also discuss voice-based interfaces, issues with latency in current capabilities, and the connection between young math olympiad communities and today's AI startups. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @TheJesseZhang Show Notes: 0:00 Introduction 0:30 Starting Decagon 3:15 Business impact of adopting agents for customer support and customer ops 8:00 AI infrastructure and models for customer success agents 12:05 Voice-based capabilities and text-to-speech engines 15:00 Combatting latency 16:25 Crossover of math and AI communities 21:12 Exciting areas of AI 25:29 Strengths and weaknesses of agents

ceo ai business voice transforming exciting substack crossover strengths notion zhang combatting duolingo eventbrite customer support classpass elad rippling vanta no priors

Erik Bernhardsson on Creating Tools That Make AI Feel Effortless

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Jan 9, 2025 23:36

Today on No Priors, Elad chats with Erik Bernhardsson, founder and CEO of Modal Labs, a platform simplifying ML workflows by providing a serverless infrastructure designed to streamline deployment, scaling, and development for AI engineers. Erik talks about his early work on Spotify's ML algorithms, what Modal offers today, and his vision for building an end-to-end solution for AI engineers. They dive into GPU trends, cloud vs on-premise setups, and when to train custom models vs use off-the-shelf solutions. Erik also shares his thoughts on the evolving role of AI in fields like coding, physics, and music. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @Bernhardsson Show Notes: 0:00 Introduction 0:22 Erik's early interest in ML infra 1:22 Founding Modal Labs 4:17 State of GPU use today and what's to come 7:14 Modal's end-to-end vision 9:00 Differentiating amongst competition 10:20 Cloud vs on-premise 12:35 Popular AI models 13:20 Gaps in AI infrastructure 14:55 Insights on vector databases 16:48 Training models vs off-the-shelf models 17:47 AI's impact on coding and physics 22:14 AI's impact on music

ceo spotify ai training state tools cloud gaps ml differentiating gpu effortless modal elad no priors

The Best of 2024 with Sarah Guo and Elad Gil

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Dec 26, 2024 27:07

2024 has been a year of transformative technological progress, marked by conversations that have reshaped our understanding of AI's evolution and what lies ahead. Throughout the year, Sarah and Elad have had the privilege of speaking with some of the brightest minds in the field. As we look back on the past months, we're excited to share highlights from some of our favorite No Priors podcast episodes. Featured guests include Jensen Huang (Nvidia), Andrej Karpathy (OpenAI, Tesla), Bret Taylor (Sierra), Aditya Ramesh, Tim Brooks, and Bill Peebles (OpenAI's Sora Team), Dmitri Dolgov (Waymo), Dylan Field (Figma), and Alexandr Wang (Scale). Want to dive deeper? Listen to the full episodes here: NVIDIA's Jensen Huang on AI Chip Design, Scaling Data Centers, and his 10-Year Bet No Priors Ep. 89 | With NVIDIA CEO Jensen Huang The Road to Autonomous Intelligence, With Andrej Karpathy from OpenAI and Tesla No Priors Ep. 80 | With Andrej Karpathy from OpenAI and Tesla Transforming Customer Service through Company Agents, with Sierra's Bret Taylor No Priors Ep. 82 | With CEO of Sierra Bret Taylor OpenAI's Sora team thinks we've only seen the "GPT-1 of video models" No Priors Ep.61 | OpenAI's Sora Leaders Aditya Ramesh, Tim Brooks and Bill Peebles Waymo's Journey to Full Autonomy: AI Breakthroughs, Safety, and Scaling No Priors Ep. 87 | With Co-CEO of Waymo Dmitri Dolgov Designing the Future: Dylan Field on AI, Collaboration, and Independence No Priors Ep. 55 | With Figma CEO Dylan Field The Data Foundry for AI with Alexandr Wang from Scale No Priors Ep. 65 | With Scale AI CEO Alexandr Wang Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil Timecodes: 0:00 Introduction 0:15 Jensen Huang on building at data-center scale 4:00 Andrej Karpathy on the AI exo-cortex, model control, and a shift to smaller models 7:14 Bret Taylor on the agentic future of business interactions 11:17 OpenAI's Sora team on visual models and their role in AGI 15:53 Waymo's Dmitri Dolgov on bridging the gap to full autonomy and the challenge of 100% accuracy 19:00 Figma's Dylan Field on the future of interfaces and new modalities 23:29 Scale AI's Alexandr Wang on the journey to AGI 26:29 Outro

ai safety tesla collaboration openai nvidia gpt sora agi waymo figma jensen huang elad scale ai andrej karpathy elad gil dylan field tim brooks no priors

2024 in AI Startups [LS Live @ NeurIPS]

Latent Space: The AI Engineer Podcast â€” CodeGen, Agents, Computer Vision, Data Science, AI UX and all things Software 3.0

Play Episode Listen Later Dec 21, 2024 52:23

Happy holidays! We'll be sharing snippets from Latent Space LIVE! through the break bringing you the best of 2024 from friends of the pod!For NeurIPS last year we did our standard conference podcast coverage interviewing selected papers (that we have now also done for ICLR and ICML), however we felt that we could be doing more to help AI Engineers 1) get more industry-relevant content, and 2) recap 2024 year in review from experts. As a result, we organized the first Latent Space LIVE!, our first in person miniconference, at NeurIPS 2024 in Vancouver. For our opening keynote, we could think of no one better to cover 'The State of AI Startups' than our friend Sarah Guo (AI superinvestor, founder of Conviction, host of No Priors!) and Pranav Reddy (Conviction partner) to share their takes on how the AI landscape evolved in 2024 examine the evolving AI landscape and what it means for startups, enterprises, and the industry as a whole! They completely understood the assignment.Recorded live with 200+ in-person and 2200+ online attendees at NeurIPS 2024, this keynote kicks off our mini-conference series exploring different domains of AI development in 2024. Enjoy!LinksSlides: https://x.com/saranormous/status/1866933642401886707Sarh Guo: https://x.com/saranormousPranav Reddy: https://x.com/prnvrdyFull Video on YouTubeWant more content like this? Like and subscribe to stay updated on our latest talks, interviews, and podcasts. Get full access to Latent Space at www.latent.space/subscribe

ai startups vancouver conviction year in review thestate neurips icml iclr latent space no priors

Revolutionizing Customer Success with Agency's Elias Torres

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Dec 19, 2024 31:18

Today on No Priors, Sarah sits down with Elias Torres, CEO and founder of Agency, an AI agent for customer success teams. Elias shares his journey from growing up in Nicaragua to founding several companies, leading engineering at HubSpot, and selling Drift for $1B. He also discusses his work consulting with OpenAI in 2022, which deepened his understanding of the business opportunity LLMs presented and inspired him to start Agency. In this episode, Elias offers a unique perspective on the future of AI and customer success, explaining how current software has fallen short and his vision for a new generation where customers have direct relationships, spend less time on tasks, and have software working invisibly on their behalf. They also discuss the evolving landscape of hiring as teams shrink, and Elias reveals his ambitious plan to reach $1B in revenue with fewer than 100 employees. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @EliasT Show notes: 0:00 Introduction 0:34 Elias' journey to entrepreneurship 2:36 Growing HubSpot to IPO and founding Drift 6:19 Consulting with OpenAI, learning about LLMs, and diving into AI 9:35 Founding Agency to focus on customer success and AI-driven solutions 11:40 What will a customer experience look like in 5 years? 15:48 Company building in an era of AI, as a 5th time founder 18:32 Reducing headcount while raising the bar on hiring 20:35 Key challenges in building Agency and crafting a standout product 23:06 Addressing software flaws and transitioning to an era of intuitive, self-operating solutions 26:27 Timeline for the next-gen software revolution + the power of building from first principles

ceo ai addressing agency consulting timeline reducing torres ipo openai nicaragua revolutionizing drift 1b hubspot customer success no priors

How Diamond Cooling Could Power the Future of AI, with Akash Systems

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Dec 12, 2024 42:21

In this episode of No Priors, Sarah sits down with Felix Ejeckam and Ty Mitchell, founders of Akash Systems, a company pioneering diamond-based cooling technology for semiconductors used in space applications and large-scale AI data centers. Felix and Ty discuss how their backgrounds in materials science led them to tackle one of the most pressing challenges in tech today: thermal efficiency and heat management at scale. They explore how Akash is overcoming the limitations of traditional semiconductors and how their innovations could significantly boost AI performance. Felix and Ty also talk about their collaboration with India's sovereign cloud provider, the importance of strengthening U.S. manufacturing in the AI chip market, and the role Akash Systems could play in advancing satellite technologies. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil |@AkashSystems | @FelixEjeckam Show Notes: 0:00 Introduction 0:30 What is Akash Systems? 2:12 Felix's personal path to building Akash Systems 4:45 Ty's approach to acquiring customers 6:40 Challenges of operating in space 7:54 Live demo on diamond's conductivity 9:50 Heat issues in data centers 15:38 Heat as a fundamental limit to technological progress 20:44 Akash's role in the semiconductor market 22:54 Growing diamonds 25:10 Collaborating with India's sovereign cloud provider 28:15 Importance of American manufacturing for AI chips and outlook on current data capacity 29:45 The Chips Act 31:22 Future of national security lies in satellite and radar tech 32:46 Critical issues in the U.S. AI supply chain 36:34 Deep learning's role in material science discovery 40:16 The future: AI expanding our possibilities

american live ai future challenges deep heat diamond collaborating cooling future of ai akash power the future no priors

Bolt's Eric Simons on Enabling Everyone to Generate Websites with AI

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Dec 5, 2024 38:17

In this episode of No Priors, Sarah talks with Eric Simons, co-founder and CEO of StackBlitz. The company has experienced explosive growth since the launch 2 months ago of Bolt.new, an AI application that lets users prompt, run, edit, and deploy full-stack applications directly in the browser. Eric talks about the years-long journey that led to overnight success, why so many non-technical users are forming a community around Bolt, and the democratization of coding. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @EricSimons40 Show Notes: 0:00 Introduction 0:36 Bolt.new 2:04 How Bolt stands out from other coding assistants 3:28 Building beyond ChatGPT wrappers 6:13 Driving growth through community 9:42 Evals 13:29 Eric's favorite use cases and startups leveraging Bolt 17:10 Why engineers are embracing no- code tools 24:32 The years long journey of StackBlitz 31:50 Balancing an Ironman, a newborn, and a product launch 35:18 Predictions for developers and code generation tools

ceo ai building predictions chatgpt driving balancing iron man websites generate enabling bolt eric simons stackblitz no priors

Model Plateaus and Enterprise AI Adoption with Cohere's Aidan Gomez

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Nov 21, 2024 44:15

In this episode of No Priors, Sarah is joined by Aidan Gomez, cofounder and CEO of Cohere. Aidan reflects on his journey to co-authoring the groundbreaking 2017 paper, “Attention is All You Need,” during his internship, and shares his motivations for building Cohere, which delivers AI-powered language models and solutions for businesses. The discussion explores the current state of enterprise AI adoption and Aidan's advice for companies navigating the build vs. buy decision for AI tools. They also examine the drivers behind the flattening of model improvements and discuss where large language models (LLMs) fall short for predictive tasks. The conversation explores what the market has yet to account for in the rapidly evolving AI ecosystem, as well as Aidan's personal perspectives on AGI—what it might look like and when it could arrive. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @AidanGomez Show Notes: 0:00 Introduction 0:36 Co-authoring “Attention is all you need” 2:27 Leaving Google and founding Cohere 4:04 Cohere's mission and models 6:15 Pitfalls of current AI 8:14 How enterprises are deploying AI today 10:58 Build vs. buy strategy for AI tools 14:37 Barriers to enterprise adoption 20:04 Which types of companies should pretrain models? 24:25 Addressing flaws in open-source models 25:12 Current and expected progress in scaling laws 29:54 Advances in multi-step problem solving and reasoning 32:29 Key drivers behind the flattening curve of model improvements 36:25 Exploring AGI 39:59 Limitations of LLMs 42:10 What the market has mispriced

ceo ai model current attention addressing adoption barriers limitations pitfalls gomez advances agi plateaus all you need enterprise ai cohere no priors

AI and the Future of Math, with DeepMind's AlphaProof Team

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Nov 14, 2024 39:21

In this week's episode of No Priors, Sarah and Elad sit down with the Google DeepMind team behind AlphaProof, Laurent Sartran, Rishi Mehta, and Thomas Hubert. AlphaProof is a new reinforcement learning-based system for formal math reasoning that recently reached a silver-medal standard in solving International Mathematical Olympiad problems. They dive deep into AI and its role in solving complex mathematical problems, featuring insights into AlphaProof and its capabilities. They cover its functionality, unique strengths in reasoning, and the challenges it faces as it scales. The conversation also explores the motivations behind AI in math, practical applications, and how verifiability and human input come into play within a reinforcement learning approach. The DeepMind team shares advice and future perspectives on where math and AI are headed. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @Rishicomplex | @LaurentSartran | @ThomasHubert Show Notes: 0:00 Personal introductions 2:19 Achieving silver medal in IMO competition 3:52 How AlphaProof works 5:56 AlphaProof's strengths within mathematical reasoning 8:56 Challenges in scaling AlphaProof 13:40 Why solve math? 17:50 Pursuing knowledge versus practical applications 21:30 Insights on verifying correctness within reinforcement learning 28:27 How AI could foster more collaboration among mathematicians 30:28 Surprising insights from AI proof generation 34:17 Future of math and AI: advice for math enthusiasts and researchers

ai future personal challenges math achieving surprising pursuing imo deepmind google deepmind elad international mathematical olympiad no priors

NVIDIA's Jensen Huang on AI Chip Design, Scaling Data Centers, and his 10-Year Bets

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Nov 7, 2024 36:53

In this week's episode of No Priors, Sarah and Elad sit down with Jensen Huang, CEO of NVIDIA, for the second time to reflect on the company's extraordinary growth over the past year. Jensen discusses AI's takeover of datacenters and NVIDIA's rapid development of x.AI's supercluster. The conversation also covers Nvidia's decade-long infrastructure bets, software longevity, and innovations like NVLink. Jensen shares his views on the future of embodied AI, digital employees, and how AI is transforming scientific discovery. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @Nvidia Show Notes: 00:00 Introduction 1:22 NVIDIA's 10-year bets 2:28 Outpacing Moore's Law 3:42 Data centers and NVLink 7:16 Infrastructure flexibility for large-scale training and inference 10:40 Building and optimizing data centers 13:30 Maintaining software and architecture compatibility 15:00 X.AI's supercluster 18:55 Challenges of super scaling data centers 20:39 AI's role in chip design 22:23 NVIDIA's market cap surge and company evolution 27:03 Embodied AI 28:33 AI employees 31:25 Impact of AI on science and engineering 35:40 Jensen's personal use of AI tools

ceo ai challenges building design data impact maintaining chip scaling infrastructure nvidia bets data centers jensen huang elad nvlink no priors

Forecasting the Future with Kalshi: America's First Regulated Prediction Market

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Oct 31, 2024 35:36

In this week's episode of No Priors, Sarah sits down with Tarek Mansour, CEO of Kalshi—the first CFTC-regulated prediction market exchange in the U.S. They dive into Kalshi's recent victory to legalize election betting, explore ethical questions around trading on elections, and discuss whether prediction markets can offer more accuracy than traditional polls. Tarek shares insights on the history of futures markets, the line between gambling and financial trading, and the psychology behind betting. Plus, Sarah makes a live election bet, and Tarek reveals some of Kalshi's most intriguing markets. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @MansourTarek Show Notes: 0:00 Introduction 1:22 Sarah makes a live election bet on Kalshi 3:35 Getting approved and regulated by CFTC 5:48 Going up against the CFTC to legalize election betting 7:21 Debating the ethics of trading on elections 8:12 Gambling vs. trading 9:12 Context and purpose of futures markets 12:38 The human psychology behind speculating /Humans conditioned to risk taking 17:17 Building a healthy exchange and scaling liquidity 19:30 Introducing leverage and working with clearinghouses 22:29 Polls vs. prediction markets 24:59 Conditional markets 26:38 What makes Kalshi's markets accurate 31:29 Tarek's insights on the most interesting trades and markets on the platform

america ceo building predictions market humans context gambling polls debating forecasting regulated conditional tarek cftc kalshi no priors

Waymo's Journey to Full Autonomy: AI Breakthroughs, Safety, and Scaling

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Oct 24, 2024 44:30

In this episode of No Priors, Dmitri Dolgov, Co-CEO of Waymo, joins Sarah and Elad to explore the evolution and advancements of Waymo's self-driving technology from its inception at Google to its current real-world deployment. Dmitri also shares insights into the technological breakthroughs and complexities of achieving full autonomy, the design innovations of Waymo's sixth generation driverless cars, and the broader applications of Waymo's advanced technology. They also discuss Waymo's strategic approach to scaling amidst regulation, deployment in cities like Phoenix and San Francisco, and the transformative potential of autonomous driving on car ownership and urban infrastructure. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @Dmitri_Dolgov Shownotes: 00:00 Introduction 00:15 History of Self-Driving at Google 00:29 DARPA Challenges and Early Involvement 01:39 Formation of Waymo 01:53 Industry Lineage and Early Skepticism 03:05 Initial Goals and Milestones 4:33 Pivot to Full Autonomy 04:50 Scaling and Deployment 05:29 Generational Breakthroughs 06:59 Choosing Deployment Cities 09:26 Technological Advancements 11:01 Evaluating Safety 14:41 Regulatory Stance and Trust 16:52 Future of Autonomous Driving 23:19 Business Strategy and Partnerships 26:06 Changing Urban Mobility Trends 26:40 Challenges and Misconceptions in Self-Driving Timelines 28:43 The Role of Traditional OEMs in an Autonomous Future 30:54 Designing Cars for Autonomous Ride-Hailing 33:42 Scaling Responsibly 35:18 Generalizability and Future Applications of AI 37:10 The Complexity of Achieving Full Autonomy 42:58 The Importance of Data and Iteration in AI Development 46:13 Reflecting on the Journey and Future of Waymo

Gaming, Nobel Prizes and At-Risk Businesses in the AI Era

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Oct 17, 2024 29:29

In this episode of No Priors, Sarah and Elad explore how AI is transforming consumer apps and entertainment, with a focus on potential integrations in gaming and dating that could shift traditional societal incentives. They reflect on AI researchers winning Nobel Prizes in Science and Chemistry for the first time, discussing what this trend means for scientific discovery. The episode also covers recent AI releases, including their thoughts on OpenAI's O1 model and Google's NotebookLM, and examines which companies and job functions are most at risk—or resilient—in the face of AI advancements. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil Show Notes: (0:00) Introduction (0:47) Google releases NotebookLM (5:20) Integrating AI into consumer apps and gaming (9:11) Future of AI companionship and procreation (14:45) OpenAI o1 model improves on iterative reasoning (18:06) Sarah and Elad reflect on Nobel Prizes going to AI researchers (21:23) Jobs and businesses at risk of disruption (27:18) AI-durable companies

ai google science future risk jobs gaming businesses prizes chemistry openai nobel prize notebooklm integrating ai elad o1 no priors

Launching AI products with Braintrust's CEO Ankur Goyal

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Oct 8, 2024 38:28

Today on No Priors, Elad is joined by Ankur Goyal, founder and CEO of Braintrust. Braintrust enables companies like Notion, Airtable, Instacart, Zapier, and Vercel to deploy AI solutions at scale by efficiently evaluating and managing complex, non-deterministic AI applications. Ankur shares his insights into emerging trends in the use of AI tooling and coding languages, the rise of open-source, and the future of data infrastructure. Ankur also reflects on building resilient AI products, his philosophy on coding as a CEO, and the importance of a startup's initial customer base. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @Ankrgyl Show Notes: (0:00) Introduction (0:38) Ankur's path to Braintrust (3:05) Braintrust's solution (5:46) AI tooling trends (7:58) Instruction tuning vs. fine-tuning (8:57) Open-source AI adoption (10:42) Future of data infrastructure and synthetic data (14:45) Designing technical interviews (18:04) Rethinking agent-based approaches (19:34) Building out an AI team (23:35) Typescript as the language of AI (25:12) The shift away from using frameworks (26:02) Vendor consolidation among enterprises (27:16) Coding as a CEO (30:16) Collaborating with customers (33:00) Future of Braintrust and evals

The Sheriff of Silicon Valley: Lina Khan's FTC agenda for M&A, AI Acquisitions, and Non-Competes

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Oct 3, 2024 26:08

Lina Khan's FTC has been the most active in decades, notably challenging tech giants and adopting a more hands-on approach to regulating the digital age. On today's episode of No Priors, Lina Khan joins Elad and Sarah to discuss her regulatory philosophy for tech markets and what the industry can expect for future M&A deals. She shares her approach to overseeing emerging technology sectors, including AI at the model layer, and her work to ban non-competes on a federal level. Khan also offers insights into the realities of leading a government agency, the scarcity of young leaders in power, and how she measures the FTC's impact. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @LinaKhanFTC Show Notes: (0:00) Introduction (0:56) Lina Khan's background and path to the FTC (2:35) Amazon's Antitrust Paradox (4:20) Frameworks for regulating M&A in young markets (8:50) Khan's perspective on AI acquisitions (12:18) What founders can expect from Khan's M&A environment (14:55) Promoting competition at the large model layer (17:01) Creating fair AI regulation (18:40) FTC's work to ban non-competes (20:31) Why so few young people hold power in government today (22:18) The realities of running a government agency (24:20) Measuring the impact of FTC

amazon ai silicon valley measuring sheriffs promoting khan acquisitions ftc frameworks elad lina khan non competes no priors

Using AI to evaluate employee performance with Rippling's COO Matt MacInnis

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Sep 25, 2024 31:28

In this episode of No Priors, Sarah and Elad sit down with Matt MacInnis, COO of Rippling, to discuss the company's unique product strategy and the advantages of being a compound startup. Matt introduces Talent Signal, Rippling's AI-powered employee performance tool, and explains how early adopters are using it to gain a competitive edge. They explore Rippling's approach to choosing which AI products to build and how they plan to leverage their rich data sources. The conversation also delves into how AI shapes real-world decision-making and how to realistically integrate these tools into organizational workflows. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @Stanine Show Notes: 0:00 Introduction 0:32 Rippling's mission and product offerings 2:13 Compound startups 3:53 Evaluating human performance with Talent Signal 13:19 Incorporating AI evaluations into decision-making at Rippling 14:56 Leveraging work outputs as inputs for models 18:23 How Rippling chose which AI product to build first 20:53 Building out bundled products 23:26 Merging and scaling diverse data sources 25:16 Early adopters and integrating AI into decision-making processes

ai building employees coo leveraging evaluating evaluate using ai merging compound elad rippling employee performance no priors

Transforming Customer Service through Company Agents, with Sierra's Bret Taylor

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Sep 19, 2024 48:30

Bret Taylor, Cofounder of Sierra, Chairman of the board at OpenAI, and former co-CEO of Salesforce and CTO of Facebook, joins Sarah and Elad in this week's episode of No Priors. Bret discusses building company-branded AI agents with unique personalities, goals, and guardrails at Sierra, and their potential to revolutionize customer engagement while cutting costs. The conversation explores the next sectors for enterprise AI adoption, building resilient AI products, and the parallels between today's AI market and the evolution of the cloud industry. Bret also shares his unique insights on future business models and upcoming technology shifts. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @Btaylor Show Notes: (0:00) Intro (0:42) Defining agentic systems and types of agents (3:55) Customer-facing company agents (5:43) Sierra AI (8:11) Transforming customer service and reducing costs (9:57) Challenges in implementing LLMs for company agents (14:45) Drawing parallels between AI and the cloud market's evolution (17:50) Future of the AI landscape (19:15) Building durable AI products (24:39) Outcome-based business models and tangible ROI in AI solutions (29:22) Next wave of AI sectors for enterprise adoption (31:15) Customizing goals and guardrails with customers (35:55) Creating distinct personalities for Sierra's agents (41:05) Bret's insights on upcoming technology and hardware shifts (46:50) How AI software could enhance human agency

ceo ai future challenges building drawing defining transforming roi cto openai customer service salesforce outcome bret customizing elad no priors

Future of LLM Markets, Consolidation, and Small Models with Sarah and Elad

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Sep 12, 2024 26:28

In this episode of No Priors, Sarah and Elad go deep into what's on everyone's mind. They break down new partnerships and consolidation in the LLM market, specialization of AI models, and AMD's strategic moves. Plus, Elad is looking for a humanoid robot. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil Show Notes: (0:00) Introduction (0:24) LLM market consolidation (2:18) Competition and decreasing API costs (3:58) Innovation in LLM productization (8:20) Comparing the LLM and social network market (11:40) Increasing competition in image generation (13:21) Trend in smaller models with higher performance (14:43) Areas of innovation (17:33) Legacy of AirBnB and Uber pushing boundaries (24:19) AMD Acquires ZT (25:49) Elad's looking for a Robot

ai innovation robots uber competition airbnb trend increasing markets comparing models areas api amd llm consolidation elad no priors

The Road to Autonomous Intelligence with Andrej Karpathy

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

Play Episode Listen Later Sep 5, 2024 44:16

Andrej Karpathy joins Sarah and Elad in this week of No Priors. Andrej, who was a founding team member of OpenAI and former Senior Director of AI at Tesla, needs no introduction. In this episode, Andrej discusses the evolution of self-driving cars, comparing Tesla and Waymo's approaches, and the technical challenges ahead. They also cover Tesla's Optimus humanoid robot, the bottlenecks of AI development today, and how AI capabilities could be further integrated with human cognition. Andrej shares more about his new company Eureka Labs and his insights into AI-driven education, peer networks, and what young people should study to prepare for the reality ahead. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @Karpathy Show Notes: (0:00) Introduction (0:33) Evolution of self-driving cars (2:23) The Tesla vs. Waymo approach to self-driving (6:32) Training Optimus with automotive models (10:26) Reasoning behind the humanoid form factor (13:22) Existing challenges in robotics (16:12) Bottlenecks of AI progress (20:27) Parallels between human cognition and AI models (22:12) Merging human cognition with AI capabilities (27:10) Building high performance small models (30:33) Andrej's current work in AI-enabled education (36:17) How AI-driven education reshapes knowledge networks and status (41:26) Eureka Labs (42:25) What young people study to prepare for the future

ai building evolution tesla intelligence senior director openai existing autonomous merging parallels reasoning waymo optimus andrej bottlenecks elad andrej karpathy no priors

Podcasts about no priors

Best podcasts about no priors

No Priors: Artificial Intelligence | Machine Learning | Technology | Startups

The Valmy

Latest news about no priors

Latest podcast episodes about no priors

⚡️Satya Nadella: No Priors x Latent Space Crossover Special at Microsoft Build

GitHub's plan for Agents — Kyle Daigle, GitHub

Giving Agents Computers — Ivan Burazin, Daytona

From SaaS to AI-First: How Companies Are Reshaping Innovation

No Priors Live: Building Durable Software in the AI Age with MongoDB President & CEO CJ Desai

10 contrarian leadership truths every leader needs to hear | Matt MacInnis (Rippling)

10 contrarian leadership truths every leader needs to hear | Matt MacInnis (Rippling)

10 contrarian leadership truths every leader needs to hear | Matt MacInnis (Rippling)

10 contrarian leadership truths every leader needs to hear | Matt MacInnis (Rippling)

The Best of 2025 (So Far) with Sarah Guo and Elad Gil

EP#39【有活力的年轻人系列】美元基金XIAOXIAOFUND 创始人晓晗：希望你内在生出勇气

Man w No Priors Kills Wife, Lives w Rotting Corpse, Quips "Go Big or Go Home" to Cops| Crime Alert 7PM 07.22.2025

Teaching AI to Understand the Physical World, with Dr. Fei-Fei Li of World Labs

AI Consolidation, Biotech Opportunities, and World Models with Sarah and Elad

AI is Making Enterprise Search Relevant, with Arvind Jain of Glean

Gaming as the Future of Education with Duolingo CEO Luis von Ahn

O3 and the Next Leap in Reasoning with OpenAI's Eric Mitchell and Brandon McKinzie

Inside Deep Research with Isa Fulford: Building the Future of AI Agents

Predicting the Earth with Josh Goldman: How KoBold Uses AI to Find Critical Minerals

From Job Displacement to AI Trainers, Brendan Foody on Work in the AI Age

Public Markets, Image Gen, and Specialized Models, with Sarah and Elad

Conversations Are the Source of Truth in Healthcare with Abridge CEO Shiv Rao

The Robotics Revolution, with Physical Intelligence's Cofounder Chelsea Finn

Copilot, Agent Mode, and the New World of Dev Tools with GitHub's CEO Thomas Dohmke

National Security Strategy and AI Evals on the Eve of Superintelligence with Dan Hendrycks

Building the Platform for Scientific Breakthrough, with Noubar Afeyan of Moderna and Flagship Pioneering

Virtual Cell Models, Tahoe-100 and Data for AI-in-Bio with Vevo Therapeutics and the Arc Institute

Building Hard Tech in Hard Markets: Kyle Vogt on Cruise, Twitch, and The Bot Company

How Harvey AI is Changing the Legal Industry with Winston Weinberg

Rick Caruso on LA's Wildfires, Policy Failures, and the Path Forward

What Can We Do About Wildfires? With Convective Capital's Bill Clerico

How AI Agents Are Transforming Customer Support, with Decagon's Jesse Zhang

Erik Bernhardsson on Creating Tools That Make AI Feel Effortless

The Best of 2024 with Sarah Guo and Elad Gil

2024 in AI Startups [LS Live @ NeurIPS]

Revolutionizing Customer Success with Agency's Elias Torres

How Diamond Cooling Could Power the Future of AI, with Akash Systems

Bolt's Eric Simons on Enabling Everyone to Generate Websites with AI

Model Plateaus and Enterprise AI Adoption with Cohere's Aidan Gomez

AI and the Future of Math, with DeepMind's AlphaProof Team

NVIDIA's Jensen Huang on AI Chip Design, Scaling Data Centers, and his 10-Year Bets

Forecasting the Future with Kalshi: America's First Regulated Prediction Market

Waymo's Journey to Full Autonomy: AI Breakthroughs, Safety, and Scaling

Gaming, Nobel Prizes and At-Risk Businesses in the AI Era

Launching AI products with Braintrust's CEO Ankur Goyal

The Sheriff of Silicon Valley: Lina Khan's FTC agenda for M&A, AI Acquisitions, and Non-Competes

Using AI to evaluate employee performance with Rippling's COO Matt MacInnis

Transforming Customer Service through Company Agents, with Sierra's Bret Taylor

Future of LLM Markets, Consolidation, and Small Models with Sarah and Elad

The Road to Autonomous Intelligence with Andrej Karpathy