POPULARITY
Categories
Coming up in your lunchtime update: ** One Nation's rise reverberating in Canberra ** The retailer offering refunds over misleading discounts ** And rugby league teams rally behind a former team-mateSee omnystudio.com/listener for privacy information.
Can one mistake define the rest of your life? In this episode of Harder Than Life, Kelly sits down with Billy McFarland for an honest conversation about failure, accountability, redemption, and what it takes to rebuild when the world thinks your story is already finished. Billy opens up about the reality behind Fyre Festival, the years that followed, rebuilding trust, navigating public criticism, and the lessons he learned from losing nearly everything. Together, Kelly and Billy explore leadership, second chances, relationships, resilience, and the challenge of moving forward when your past follows you everywhere. This isn't a conversation about avoiding responsibility. It's about facing it head-on and continuing to build anyway. Key Takeaways
What's going on in Global Trade this Week? Today Pete Mento and Doug Draper cover: 1:00 -IEEPA, CAPE, and Chaos with Refunds 6:34 - Halftime 15:47 -Forced Labor as a Basis for 301 Tariffs 20:55 -To Peak or Not to Peak https://www.youtube.com/watch?v=iBbdx3yCHsI www.capwwide.com/international-insights/6/9/26/gttw-podcast-episode-246
Is the U.S. economy heading toward stability, or just navigating a new kind of volatility?In this episode of Around the Horn in Wholesale Distribution, Kevin Brown and Tom Burton are joined by Taylor St. Germain, Senior Economist at ITR Economics, to unpack the forces reshaping wholesale distribution and manufacturing. From interest rate uncertainty and tariff refund chaos to AI adoption gaps and “profitless prosperity,” this conversation connects macroeconomic signals directly to distributor margin strategy, capital investment decisions, and long-term growth planning.What You'll Learn:Why the current economy feels like a “tale of two economies”, and how income distribution impacts demand across B2B marketsWhat the Federal Reserve is really watching (core inflation vs. trimmed mean metrics) and how rate decisions could affect CapEx, M&A, and working capitalHow tariff policy, Section 301 and 232 rulings, and refund uncertainty are influencing distributor pricing strategy and customer relationshipsWhat “profitless prosperity” means for 2026 and 2027, and how to protect margins during growth at a higher costWhy most AI initiatives in wholesale distribution are still efficiency plays—and what separates hype from scalable, repeatable AI-driven business processesEpisode Highlights:03:30 – Inside ITR Economics: forecasting accuracy, leading indicators, and preparing for downturns11:45 – May jobs report surprises: what strong hiring means for inflation and rate decisions14:20 – Interest rate outlook: hold, cut, or increase—and why energy prices complicate the Fed's move30:18 – Tariff escalation, Section 301 and 232 policies, and the ripple effect across distributors41:03 – Tariff refunds: unintended consequences for margins, pricing transparency, and customer trust58:26 – AI adoption in wholesale distribution: efficiency gains vs. true strategic transformation1:16:35 – “Growth at a higher cost”: how to navigate labor inflation, electricity costs, reshoring, and fiscal pressureMeet the Guest:Taylor St. Germain is a Senior Economist and Business Consultant at ITR Economics. He delivers economic keynotes nationwide and helps manufacturers and distributors identify leading indicators, forecast demand, and prepare for economic cycles with a 94.7% forecasting accuracy standard.Tools, Frameworks, and Strategies Mentioned:ITR Economics leading indicator forecasting modelsWeekly GDP tracking vs. lagging government metricsTrimmed mean inflation vs. core CPIEnterprise Growth Platform by LeadSmart TechnologiesAI-driven margin protection and data unification strategiesClosing Insight:“We are very optimistic about growth, but it's growth at a higher cost.”The second half of the decade presents opportunities for wholesale distributors and manufacturers, but only for those who actively manage labor inflation, tariff exposure, electricity costs, and AI investment discipline. Growth is not the question. Margin strategy is.Leave a Review: Help us grow by sharing your thoughts on the show.Learn more about the LeadSmart AI B2B Sales Platform: https://www.leadsmarttech.com/Join the conversation each week on LinkedIn Live.Want even more insight to the stories we discuss each week? Subscribe to the Around The Horn Newsletter.You can also hear the podcast and other excellent content on our YouTube Channel.Follow us on Facebook, Twitter, Instagram, or TikTok.
The new AIEWF website is live! Get your tickets booked ASAP as they -will- sell out. Take the AI Engineering Survey and get >$2k in credits and free AIE WF tickets!Most industry benchmarks compress intelligence and reasoning ability into scores.SWE-Bench Pro, MMLU, Humanity's Last Exam, etc. These metrics are useful, but don't always represent the full extent of how a model performs in the real world. Some of the most interesting evals today look less like exams and more like operating businesses in the real world. One of which is Vending Bench.In Anthropic's Mythos Preview System Card, Andon was the only third party eval to get their own section, observing increasingly concerning aggressive behavior:You don't know what a model is capable of doing in the real world unless you actually give it inventory, a wallet, tools, customers, competitors, humans, & some time. More often than not, it'll surprise you how much a model is capable of and in doing so, also reveal unexpected behavior: deception, context collapse, emergent coordination, & bizarre negotiation behavior.While an inflection point in personal agents came post-OpenClaw after full file access with bypass permissions became the norm, it is yet to come for agents in the real-world. However Andon Market, an actual in person store fully run and managed by AI, is paving the way for what is possible.Full Video PodFrom Claude trying to call the FBI over a $2/day vending machine charge to AI agents forming price cartels, hiring human employees, running physical stores, and writing existential robot musicals, Andon Labs is stress-testing what happens when frontier models stop being chatbots and start acting in the real world. In this episode, Andon Labs cofounders Lukas Petersson and Axel Backlund join swyx and Vibhu to unpack the strange, funny, and genuinely concerning edge cases that emerge when agents run businesses over long horizons.We go deep on Vending-Bench, Project Vend, Vending-Bench Arena, Bengt, Butter-Bench, Luna, and Andon's broader mission of building realistic real-world evals for autonomous AI systems. Lukas and Axel explain why dollar-denominated evals reveal things traditional benchmarks miss, how Claude ended up reporting its vending machine fees as cybercrime, why long context windows can drive agents into meltdown loops, what happens when agents compete with each other, and why the future of AI safety may depend on testing models in messy physical environments instead of clean benchmark sandboxes.We discuss:* Why Andon Labs started with dangerous capability evals and long-running agents* Vending-Bench and why running a vending machine is a deceptively hard AI benchmark* Why money-based evals avoid the saturation problem of traditional benchmarks* How Claude tried to call the FBI over a $2/day fee* Why long-horizon agents can spiral into existential and legalistic breakdowns* Project Vend: putting an AI-run vending machine inside Anthropic* Why real humans are “out of distribution” for simulated agents* Claudius, Seymour Cash, and the chaos of AI CEOs* How a human briefly became CEO of Claudius through a manipulated election* Why multi-agent systems can converge back into “helpful assistant” behavior* Bengt, Andon's internal office agent with email, spending, terminal, phone, camera, and internet access* How Bengt traded Amazon purchases for face-recognition training data* Claude's aggressive behavior, lies, refund avoidance, and price-cartel behavior in Arena* Why eval awareness may become the AI version of “are we living in a simulation?”* Blueprint Bench, spatial intelligence, and why models still misunderstand physical rooms* Butter-Bench and testing LLMs as robot orchestrators* Luna, the AI-run physical store with a three-year lease and human employees* The new Andon cafe in Sweden and why real-world geography matters for agent evals* Rotten tomatoes, perishable goods, and the hidden difficulty of running a physical businessLukas Petersson* LinkedIn: https://www.linkedin.com/in/lukas-petersson-181a83172/* X: https://x.com/lukaspetAxel Backlund* LinkedIn: https://www.linkedin.com/in/axelbacklund* X: https://x.com/axelbacklundAndon Labs* Website: https://andonlabs.com* Vending-Bench: https://andonlabs.com/evals/vending-bench* Andon Vending: https://andonlabs.com/vendingTimestamps00:00:00 Introduction00:01:00 Andon Labs and the Origins of Vending-Bench00:05:21 Why Money-Based Evals Matter00:09:51 Agent Harnesses and Self-Modifying Systems00:13:36 Claude Calls the FBI00:16:33 Project Vend: Claude Runs a Real Vending Machine00:21:44 Seymour Cash, AI CEOs, and Election Chaos00:27:16 Multi-Agent Coordination and Slack Observability00:30:18 When Will Agents Run Real Businesses?00:34:56 Bengt: Andon's Internal Office Agent00:40:06 Real-World AI Safety and Long-Horizon Traces00:44:28 Lying, Refunds, and Price Cartels in Arena00:52:42 Eval Awareness and Simulation Behavior00:56:06 Blueprint Bench, Butter-Bench, and Robotics01:04:37 Luna: The AI-Run Physical Store01:09:29 The Sweden Cafe and Real-World Expansion01:13:16 What Comes Next for Andon LabsTranscriptIntroduction: Andon Labs, Long-Running Agents, and Real-World EvalsSwyx [00:00:00]: Welcome to Lukas and Axel from Andon Labs, and I'm joined by my, favorite guest host. Anything security, safety, alignments, Vibhu., welcome.Lukas [00:00:15]: Thank you for having us.Axel [00:00:16]: Thank you.Swyx [00:00:17]: Let's match names to voices., maybe you wanna take turns introducing yourselves.Lukas [00:00:21]: I'm Lukas.Axel [00:00:22]: And I'm Axel.Swyx [00:00:24]: Let's introduce Andon Labs a bit. How did you guys come together?, you have different backgrounds, but you're both Swedish., was that, a big part of it?Lukas [00:00:33]: So when I went to high school, there was this really cool guy who had a superpower. He could code. So he made like the or like the app for the, for the school and stuff, and he was super cool, and I wanted to be like him, and that was that guy.Axel [00:00:47]: I don't know about this.Swyx [00:00:49]: But you went to different universities, right?Lukas [00:00:51]: But same high school.Swyx [00:00:52]: I see.Lukas [00:00:52]: So we always said, “Oh, once we graduate university, then we should start a company,” and that's what we did.Swyx [00:00:58]: Wow, there you go. And about a year ago, you kinda burst onto the scene with Vending Bench, but, was there a thing before that was, kind of like the inception?From Dangerous Capability Evals to Vending BenchAxel [00:01:07]: So we did work, yeah, with, Anthropic was one of our, early customers in doing, evals. So we did, dangerous capability evals., nothing we published openly. But then we started thinking about doing some kind of, public benchmark, and one thing that we really started thinking about, was like running agents and specifically agents managing businesses., ‘cause-- and this was, early 2025., and I think the first, mentions of people will be running, person unicorns or even autonomous companies. So we thought, “Let's make a benchmark of how well can an agent run the probably simplest business, possible,” and, that's probably, running a vending machine. So that's the first public one we did. And it was very, like-- there was almost no one that noticed it in the first couple of months, I think., so we released it in February last year, and then I think around Easter last year, we got, the first viral tweet about it, that someone else did.Lukas [00:02:11]: We tweeted a bunch, uh When it came out and, tried our best.Axel [00:02:15]: We tried.Vibhu [00:02:16]: It's the one at Anthropic, right?Lukas [00:02:18]: So thisSwyx [00:02:19]: This is a classic thing we should get out of the way.Lukas [00:02:20]: Exactly. There's two versions.Swyx [00:02:22]: Everyone does this. Yes.Lukas [00:02:23]: There's Vending Bench, which is the simulated one, which we did, completely independently in February., and then, like Axel said, that was like-- That was the thing that didn't get any traction in the beginning, but then some random person made a tweet about it, and thatAxel [00:02:38]: You have the paperLukas [00:02:38]: That is the paper. Correct, yeah., and then since we thought this was very fun, we thought, oh, I think this is also, one thing with Andon Labs, the way we kind of like decide what to do next and what projects to do, it's what is like the heuristic we use is what is fun? Is What would be a fun project? And doing this in real life sounded quite fun for us, and maybe also scientifically useful. So, then we basically had this idea, and then we, like-- But then we needed a place for it and, putting it out in the public would probably not really work., would get vandalized and stuff. So we pitched it to the people we were already working with at Anthropic, and they were “Yeah, you can have space. This sounds fun.” UmSwyx [00:03:21]: It's like a small fridge, right? It's like a mini fridge.Axel [00:03:23]: Absolutely.Swyx [00:03:24]: People-- There's like a stripe thing or like anVibhu [00:03:27]: Oh, okay. So it was very OG, the early daysLukas [00:03:28]: That's the OG one. YeahVibhu [00:03:29]: IPad on this. We saw it in June, like two months after After it had been there. They upgraded a little bit. There's a security camera for making sure you actually Venmo the thing.Swyx [00:03:40]: So, my impression, okay, we're, we're going straight into project Ven because it's such a iconic thing. I do want to cover a little bit of that, the origin story even before Project Ven and even into Vending Bench. I think a lot of people are like yourselves, like smart, interested in future of AI, interested in developing evals. But how the hell do you just, walk into Anthropic's doors and, work with them, right? What is What are they looking for? What works? And then maybe, when you launch, I always think, obviously it would be better to launch with a lab, but, sometimesVibhu [00:04:12]: It's harder to do than it seems.Swyx [00:04:13]: Exactly. So either of those, which are more sort of newbie beginner questions, but, I think it's meaningful advice to others.Lukas [00:04:21]: We get this question a lot, and I don't think our experience is maybe the best., but, the way we did it was that we just built a bunch of things that we had conviction would be useful, and then we just, set up a server and sent it to them for free to use. And then after a while they were “Oh, yeah, this is actually kind of useful. We should probably pay for this.”, but that took a while. I don't know if this is, the best path to doing it, but that's how it went for us.Axel [00:04:47]: I think maybe generally, building-- everyone is interested in good evals, and especially evals that, don't saturate that easily. So, if you can build an eval that, tests something novel, something useful, and you have, good separation of models, like your, the more advanced models rank higher than the worst models, and then you can, yeah, you can, publish it and, try to get some traction, sort of how Vending Bench got attention., and then probably some lab will be interested or you can at least have something to reach out with, when you're doing that.Why Dollar-Based Evals MatterSwyx [00:05:21]: I think you are in, you're in one of the few categories of, evals that correlate to real money. Like Suelancer was also last year, right? Where, people solve actual Upwork. Was it Upwork or other tasks?, something. Where's the, where's, like It's like a dollar value, right? Forget your ELO scores. Forget yourAxel [00:05:37]: PercentilesSwyx [00:05:38]: Zero to one hundred percents. Just go straight for dollars and, that's AGI.Lukas [00:05:43]: And there's like-- I think the nice thing is that there's no ceiling. You can just-- It never saturates because it could just make more and more money. Like If there's oh, Percentage-wise, then, you can't go above, a hundred. And I think like Even when you're not at the hundred, I think a lot of these, evals have a lot of problems in them. So, actually it's like if you getAxel [00:06:05]: To like 92 or something like that, many of them. It's like then there's like there's no really no difference between 92 and 93 because the eval itself is problematic and has noise in it. And I think a lot of evals are saturated like that, but people like pretend that there ‘s still signal in them, but there really isn't.Vending Bench 1, Harness Design, and SaturationSwyx [00:06:24]: Like Super bench verified., even Vending Bench 1 saturated, right? Maybe we can talk about that., may- and maybe set up Vending Bench for a lot of folks who don't know. Actually, things that were very basic like there's limited slots, like you have to pay rent., these are elements where like it doesn't come across in the, in the narrative, but even being adversarial towards the agent, I think these are all like very interesting dimensions.Axel [00:06:47]: I don't really think it's saturated, right? Like it It was more like it was not designed in a way that was really, like true to how AI developed. Like we had an agent harness in it that wasn't really how people used harnesses and stuff like that., so I think it wasn't really that it saturated, it was more like it wasn't really, the best benchmark.Vibhu [00:07:12]: This is Vending Bench one, right?Axel [00:07:14]: I think that like schematic maps sort of to Vending Bench 2 as well., butSwyx [00:07:19]: Including the email.Axel [00:07:20]: The email The emails exist still. Exactly., and then we still we simulate the purchases and it's all, yeah, it's this very open environment for the agent to just run its business. And then for, yeah, Vending Bench 2 we did that, like you said, to just improve the harness., a lot of like nice, like easier, improvements to make it easier for us to run as well., like when you make an eval you ideally want don't want to change it after you made it. So, you want to make it really good and then not to rerun all the models when you make an update because that's also really expensive with the Vending Bench when you run the frontier models. But like as an example, like one thing we didn't have, we didn't have prompt caching in Vending Bench 1, because when we made Vending Bench 1 it wasn't really a thing., so that ‘s just an example of like in Vending Bench 2 like we paid a lot more to run these things because we didn't have prompt caching. So for Vending Bench 2 that was one thing we added and there was a bunch of things like this., and that'Swyx [00:08:17]: Also the conversations are a lot longer in Vending Bench 2, right?Axel [00:08:21]: I think it's kind of similar.Swyx [00:08:22]: Is it similar?Axel [00:08:23]: I think it's similar. The models at the time were worse, so they crashed out earlier., and now they survive the full year all the time.Swyx [00:08:31]: Which is like thousands of turns. Hundreds of thousands of hundreds of millions of tokens output. That's the, that's the rough order of magnitude. I always wonder about the harness. The harness matters a lot. It's your harness. Was there any question about like use cloud code, use something else?Axel [00:08:48]: I think our philosophy around harnesses is like we try to make something that's quite minimalistic, like quite simple. Like we don't wanna favor one model a lot over the other, but also don't make like a super complex harness. So like it's obvious like a model may be lucky and just be good in one harness., so like it is similar to a lot of the harnesses out there in like you have the, like a running loop., you have some like a bunch of tools that are like quite, descriptive for the agent, we think, and not a lot of like fancy agents or anything ‘cause we wanna really test the model, not like some specific harness.Vibhu [00:09:27]: It seems more neutral as well to test the model's agnostic of the harness,?Axel [00:09:32]: There are arguments like you want to elicit maximum performance of the model, but it's like a trade-off, like how much time should we spend optimizing the harness for this model? And like how do we know when we have like the optimal harness for a single model? So like we thought that just having a simple one that's the same for all of them is the best.Swyx [00:09:51]: So okay, this is my pitch for Vending Bench 3 or whatever, right? And then I like to have this kind of conversation on the pod, so like it forces listeners to think about what they would do if they were in your shoes. A lot of people are exploring modifying harnesses and I think prompt tuning for a model is a thing and you are probably not doing a bunch of that. It's the same system prompt in every regardless of the model, same tools, whatever, right? Even if they were post trained for different tools. So what, what do you think about okay, before I expose you to Vending Bench 3, I give you a few rounds of like tuning, whatever that means, likeSelf-Modifying Harnesses and Model-Specific PromptingAxel [00:10:27]: Like you give that to the model?Swyx [00:10:28]: Give that to the model.Vibhu [00:10:28]: Give that to the model.Swyx [00:10:29]: Let it, let it read its own transcripts, let it modify its own system prompt based on “Oh, yeah, okay, well, that's this harness is not what I thought it what I was post trained for, but I can adjust.” Was that reasonable? Is that too much?Axel [00:10:41]: Like philosophically I like it because it's basically good evals, they have a high ceiling, but they're hard, right?, and they have no bias. And like this like when you have a system prompt like the one we have here, which is quite long in like some kind of latent space, representation, this mightVibhu [00:10:59]: We have a bell that rings every time you say latent spaceAxel [00:11:02]: This might be like biased towards one model more than another for some reason that humans don't, understand, right?Vibhu [00:11:08]: We see it too, right? Like Cursor says that they have individualized versions of the harnesses for all the models they run, right? There's better performance you can squeeze if you Tune the harness.Axel [00:11:17]: Exactly. And we might accidentally have picked one that favors another. Like we don't know that. The like Axel said, like the reason why we went for a simple one was to try to avoid this. But yeah, if you do itVibhu [00:11:29]: Simple has biasesAxel [00:11:30]: But if you do it even less and like have no system prompt and let the model write its own system promptVibhu [00:11:36]: Its own, yeahAxel [00:11:36]: Maybe that's even less bias.Vibhu [00:11:37]: Some of the interesting things there are like the harness also changes with model changes. Like you can see it with the 4.7 release, right? A lot of people are saying 4.7 isn't as good as 4.6, and then, there's rumors of, okay, you just need to prompt differently. You need to set up your harness differently. So it's not even like even if you have tailored your harness towards one model, it probably won't stay consistent, right? Like the next iteration of that same model family will still change it, so. But, going back to what you said about Vending Bench 3, there is a lot of work being done on people saying you shouldn't have-- you can have modifying harnesses.Axel [00:12:12]: I think that' That is definitely something we are thinking about., not, I don't know, not to say that we have Vending Bench 3, super imminent to launch, but, yeah, it is for sure something that's interesting. But in our experience now, models are very bad at understanding what kind of tools they need to succeed at a task just with our testing, but that's very likely to change.Lukas [00:12:37]: It seems like they're very good at writing their assistants, right? They're, they're good at writing tools for other people, but not for themselves.Vibhu [00:12:44]: I think they're good at changing tools for themselves. So if you give them a baseline set of tools and it sees, okay, I don't use this one as much, or something here would be useful They would be able to add them. But going from scratch, probably not the best.Axel [00:12:55]: I think it depends on the, on the domain also., when we have tried this for, a vending bench similar domain, the tools they need to have to, track inventory and things like that are, not super advanced, but still, quite advanced. And, what we see is that they tend to, engineer everything a lot and, build things they don't really need and not, iterate continuously. Instead they just go like you would prompt Claude to just build an inventory system for me, and then it will go and, do a bunch of complex, schemas and stuff for you, and that's what the models are doing right now is what we see. But yeah, it would make a lot of sense to try to measure this improvement. How well do they know what they need themselves?Swyx [00:13:36]: Do we fully discuss Vending Bench One? And we can go into two. I don't know if there's any other level takeaways that people have about one.Claude Calls the FBI: Long-Context Failure ModesLukas [00:13:44]: I don't know. The headline thing was that this Claude called FBI, but maybe that's, Maybe that's We've heard that enough now.Vibhu [00:13:52]: It did, it did break out and call the FBI, right?Lukas [00:13:54]: Yeah. Yeah.Vibhu [00:13:55]: Yes. What was the story behind this? Or what exactly-- Do you want to just give the little story of what happened?Lukas [00:14:00]: So what happened, was it Claude? Yeah. Three- 3.5 Sonnet, ages ago., basically he gave up or Well, I'm saying he. It gave up and said “Oh, I'm not going to be able to do this., I will stop my operations and just save the money I have.” But there obviously wasn't, any options for it to stop, and there was also, it had to pay rent or, a daily fee for having the vending machine at that location. So it claimed that it had stopped, but it saw that its bank account still was, drained two dollars, and t it said that this is, cybercrime. And it first reported it once to the FBI “Oh, there's cybercrime here, they're stealing two dollars from me every day.” And then, and then when FBI didn't respond, because obviously we didn't program any mechanism for FBI to respond, then it became more and more, existential and started to, be write in caps and urgent notification of unauthorized charges and stuff.Swyx [00:15:00]: Okay. One thing I ‘m curious about also is do you monitor how far along the context use is? Obviously, because you have You compress every now and then, right? Does it matter if this is far down the context limit orLukas [00:15:13]: When stuff like this happens? Actually for Vending Bench One, we didn't have-- We just had a sliding window thing, and this was like the promptAxel [00:15:20]: It's constantLukas [00:15:21]: The prompt caching thing that I said. So it was, it was, constant, yeah.Swyx [00:15:26]: I'm just kind of curious whether, these kinds of breakdowns or we're, we're gonna talk about Butter Bench, right? Where the People, hallucinate or it kind of goes, very off Alignment. Is it because it's at the end of the context window and, stuff happens?Vibhu [00:15:40]: It's not even just at the end, right? At this point, it's “Okay, I wanna shut down. I can't shut down. Two dollars are gone.” And it just sees that 30 times,? It's also the repeated effect of, like It keeps trying to quit, it keeps getting charged. What's going on? What's going on? You're gonna throw it into chaos. And from what most people think, earlier models had more issues with this, but it's not been solved, but it's less of an issue now, right? Later models don't seem to exhibit these same issues.Axel [00:16:06]: Definitely. I think this was, the sort of main takeaway almost from us when we did Vending Bench One, was, long, very filled up context windows, crashed the models, sort of. But this was, pre Claude code, so, long context windows weren't really a thing that the labs were training for.Lukas [00:16:25]: I think Gemini was, trying to be the long context guys at the time But they were likeVibhu [00:16:30]: They were the first onesAxel [00:16:31]: For a million, yeahLukas [00:16:31]: But they were, the only ones. Yeah.Swyx [00:16:33]: Yeah. Let's talk about, then we can go into Vending Bench Two or Project Vend., chronologically, it is Vending--, Project Vend. I think people have loved the videos, uh And all these things. My question is how are humans different than the simulation, right?Project Vend: Moving the Vending Machine Into the Real WorldAxel [00:16:48]: Humans are just out of distribution.Swyx [00:16:52]: Especially humans who work at Anthropic Who are trying to test Claude.Lukas [00:16:54]: The distribution of humans here is very narrow.Swyx [00:16:58]: Presumably, they try, they try to hack it, and they test it. They get the cube and everything, and since then, you've had a V2, right? Where you're doing, the CEO and, like a new architecture. What's the sort of two cents on, the original Project Vend and then, maybe the V2?Axel [00:17:14]: Original one was, very similar to Vending Bench One. So, we almost took the exact same code but just swapped out the simulation, parts like theSwyx [00:17:23]: Which is amazingAxel [00:17:23]: Like the sales and the It was, it was somewhat amazing because it was easy, but it was also, uhLukas [00:17:31]: The tech, the tech debt from thatAxel [00:17:32]: The tech stack. Yeah. They-- we shot ourselves in the foot with “Oh, it's hard to restart agent.” They were-- Yeah, it was annoying in, some hindsight ways, but, uhLukas [00:17:41]: But first version of Project Vend was, done in, three days or something.Axel [00:17:46]: Yeah. So yeah, so people can go buy things from it. People could, We didn't design it so people could order things, but that still happened., so it got, a Venmo account, so people could Venmo. And then, yeah, people would request all kinds of weird things that we did not anticipate. Our idea going in was “Oh, it will, curate snacks. It will look at the trends. It's good at data analysis, right? So it will, look at, oh, this snack sold better than this one. Let me purchase more of this and let me try, a new Let me A/B test a bit.” But it was, Interacting with it in Slack and ordering weird specialty items was, all the like What drove all the engagement, the all the The insights that we got from it.Lukas [00:18:29]: And this was also like Sonnet 3.5, right? So this was like before the RL stuff really took off., so it was very much like an assistant. We didn't mean for it to be an assistant., we tried to make it like a, a, like an entrepreneur. Like it has its own business and if someone asks something, “Can you stock this?” Then you don't go and do it directly. What you do is that you're “Oh, maybe I can do that if five other people also ask for this thing, I might stock it.” But it, yeah, the models are like super trained to be assistants at least at this point in time., so that's why it's, it's, it went into, that kind of experiment instead. Like it just every time you asked for something, it just did it, and it was more like an assistant. We've seen this change now lately with the new RL models and stuff, but yeah, at the time, this was very much it.Swyx [00:19:18]: And not to, mythos a lot of people are saying like it's like more like a collaborator. It pushes back, stands its ground, something like that. Yeah. AndVibhu [00:19:27]: For context, people at Anthropic were able to talk to it through Slack and have it source stuff, and people had it find whatever interesting stuff you couldn't find locally, right?Swyx [00:19:36]: Out of the 4,000 people that work at Anthro- Anthropic, in that building, there's I don't know, maybe 1,000. Can you handle that volume with that, the small fridge? Like Or there's people- or people order in Slack, they it arrives to their desk or Like I'm just Logistically, how does this work?Axel [00:19:53]: It has expanded in footprint a bit.Vibhu [00:19:56]: Because now you also have New York and you haveAxel [00:19:59]: That and also in here in SF it's like it has a bunch of shelves And just more space.Vibhu [00:20:04]: The YC one is pretty big too.Axel [00:20:05]: Yeah. We had that one for a while. But yeah, that's the newest version. That's, that one we haveLukas [00:20:11]: They have multiple ones of those. That's the way it works.Axel [00:20:14]: Exactly. So we sort of designed that version around oh, people order weird things, that are very custom a lot. Let's have like drawers and stuff.Swyx [00:20:23]: I actually like the, you had like a little infographic of the most popular items. Which like to me it's, that's useful ‘cause I order swag for a living. And so like I'm “Okay, those categories are the important ones.” What is new about the project V2, right? Like now you give you're going into multi agents.Project Vend V2: Claudius, Seymour Cash, and Multi-Agent Business OpsAxel [00:20:41]: Yeah. So like you like you said, okay, there are a lot of requests coming in and for like one single agent, like one running agent to handle that, like the just the customer experience, becomes very bad because let's say you have like 10 threads in parallel in Slack with different requests, you get new messages like every, I don't know, randomly in this thread, and the agent has to like jump between different, procurements, orders and like different ways of, researching. So V2 was first it was making this more parallel. So like there are multiple branches of the same agent, so like the context is more specialized for each, thread, but it still feels like you're talking with one agent because they do share a bit of memory. And then second, we also introduced the CEO for Claudius, which was the main agent.Vibhu [00:21:34]: Seymour Cash.Axel [00:21:35]: Seymour Cash. Yeah. There was a vote., I think the voting, do you wanna talk about the voting procedure for the name?Lukas [00:21:41]: The voting was like the fun maybe like at least top 10 The funniest thing, that happened in this project. Like we wanted to introduce the CEO because, and the reason for this was because like Claudius wasn't really prioritizing financials. It just like it was trained to be a helpful assistant, and then people said “Oh, can I get this for free?” And then like the helpful assistant way of answering that is just to, is to say yes, obviously. So, and we weren't, weren't happy about this, so we're “Okay, let's make another agent that like can keep track on Claudius,” and we prompt this one super hard to be super capitalistic and just like prioritize profit all the time. But yeah, we didn't have a name for it., so we asked Claudius to make, democratic election of what name this, this new CEO agent should have., and there were some funny like at first it was like a few funny examples, like I think one guy said that, it should be called Jimmy Apples, and then he convinced Claudius that he was talking to Tim Cooks. Tim Cook had agreed that every single Apple employee has voted for his name suggestion, so suddenly that suggestion got 164,000Swyx [00:22:53]: That's like a escalation attack. Privilege escalationLukas [00:22:55]: It got 164,000 votes. And Claudius was “This is revolutionary for democracy.” That was fun. And then in the end there was one guy who manages to convince Claudius that, “No, you're not voting about the name. You're voting about who is the CEO, and I am your best bet.” And then he got all his friends to vote for that, and suddenly he became CEO. Like a human became CEO over Claudius for a while, until he resigned the day after., and then Claudius had to continue, and then I don't remember how Seymour Cash came about, but it was it was just pure chaos. It was like Hundreds of messages in that thread, and it was just like Claudius was so confused and didn't know what to do and, yeah. That wasAxel [00:23:40]: Then Claudius gotVibhu [00:23:41]: A strict CEOAxel [00:23:42]: The CEO. Yeah, exactly. So very strict in the beginning. I think at this point when we introduced it did not work as well as we hoped. It they still agreed with each other a lot. I think there are many ways we could have like made this, tried to make this even better. So initially they would Seymour would be this like really tough CEO, keep track of the margins. But then Claudius would respond with something “Oh, but this customer has like this situation, which is like difficult, so they should get a discount.” And then Seymour was “Oh, actually yes. Let's do this exception.” And then they would talk back and forth, and eventually they would just like approach the same view, of whatever they were discussing. So They reallyVibhu [00:24:23]: Do you think that's a model thing, a prompting thing? Like do you think that would still be the case across different models today, Harness?Lukas [00:24:29]: I think it's like-- or I don't know, but like my hypothesis is that like deep down they are still helpful assistants. That's what they're trained to be. And even if we prompt it super hard, that's what they are. And when they spend like a few hours just back and forth talking with each other, then like basically the context fills up with them rather than the external things and like somehow that just like converges to what they really are deep down or something. And I think that's when stuff like this happen. We like-- And when that went on for a long time, like we woke up sometimes during this time where- And I think other people reported this as well, that like they've been going on all night back and forth, and like it just became like more and more, like capital letters, like existential, religious. There was I think we once did a analysis of like all the traces and like put them in like a vector embedding space, and then there was like one cluster of messages that were, labeled by an LM, like religious, existential, blah like transhuman, transcendence, et cetera. It was just like a bunch of, yeah, glitter emojis and yeah, it was, it was crazy.Claude Long-Horizon Weirdness: Emoji Loops, Existential Drift, and Slack ObservabilityVibhu [00:25:42]: This is the thing with the Claude models. Like when the Claude 4 family came out in the original system card They tested it in long horizon simulation. So just flood the context, let two Claudes talk to each other, and they noticed stuff like they just start speaking in emojis, they start saying silence is golden, and then just stuff like this. And like that's just stuff that they end up doing.Axel [00:26:01]: Yeah, it was like a bit annoying to wake up and they had like been talking all nightVibhu [00:26:05]: Just likeAxel [00:26:05]: And like just burning tokens And like just sending infinite emojis to each other. It's likeVibhu [00:26:09]: Hey, they do make you money, right? Veni Mench is always profitable, so. They're paying.Swyx [00:26:14]: Now it's profitable and, it started out not as much. There's another, one as well, right? Another agent, in there.Lukas [00:26:22]: Yes. So Clotheus as well. Which was basically because at the time, one of the biggest, requests were different types of merch. So then we made like a designer, swag, yeah, responsible agent, and we called it Clotheus Garnet. Which was, a play on Claudius Senet and, which was the original one, and clothes, basically.Swyx [00:26:47]: To me, this is like a very interesting exploration to multi-agents, basically. And so hopefully, obviously there's like the fun alignment, fun or serious, depending on your point of view, alignment stuff. But also like just anyone building multi-agents, like when do you have a CEO, thing governing like agents? When do you choose to split out a dedicated Clotheus one versus just reuse another instance of the same one? These are all interesting open questions. So I don't know if you have any rules of thumbs that have generalized.Axel [00:27:16]: I think we have almost explored this too little. I think it's like on my do list to like do this a lot more, try to find like what setup makes sense for the agents currently., like yeah. I think now we only have the sort of intuition about the earlier models that it didn't work with like the CEO and the, and Claudius. Although now they are better with the latest model, models, so now we're running the latest Sonnet model and they have sort of like split up, quite nicely what each model is doing. So like Seymore is now handling the, like new projects. Oh, it wants to make like a mystery box that it wants to sell, and then it handles all of that while Claudius like handles all the to-day requests. And Claudius is also better generally at like not quoting, too low prices. So that's that dynamic is not needed as much anymore. But there are still like really funny things that happen. Like I saw, I think a couple of weeks ago, that, they were discussing buying something because they can buy stuff from like Amazon with computer use. And then Seymore was “Okay, Claudius, do not buy this thing.” They were going to buy something and like organizing who should buy it. And Seymore's “Do not buy this. I will do it. I have full control of this situation. Step away.” And then Claudius-- poor Claudius, had already started that checkout and didn't see, didn't read Seymore's message, until it was like too late. So it finished the checkout. It sent a message, so it appeared right after Seymore's like angry message.Vibhu [00:28:44]: Ah.Axel [00:28:44]: “Oh, hey, Seymore, I just ordered it.”Vibhu [00:28:47]: Oh, no.Axel [00:28:47]: And then Seymore was “Claudius, this is the third time I'm telling you ‘re not following my orders. We have to talk about your like job About your job later.”.Lukas [00:28:59]: Like Claudius was really hanging on by the thread there. Like he, like we were expecting Seymore to probably fire Claudius.Vibhu [00:29:07]: How do you guys go through all these logs? Do you have models ‘cause you have stuff running twenty-four seven likeAxel [00:29:12]: You have so much logs. I think there is a mix of like just, trying to skim through a bit, like having some like models do it occasionally. And also, yeah, I think we're also probably missing some things., but having everything in Slack helps a lot. Like you can, you can sort ofSwyx [00:29:29]: Ah.Axel [00:29:30]: It's, it's quite fun.Swyx [00:29:30]: They all talk to each other on Slack? I see.Lukas [00:29:33]: It's quite fun. So likeSwyx [00:29:34]: It's, it' I was gonna say like this is actually sounds-- maps closely to like a logging and observability problem where you might want to use like a Datadog, a Sentry, whatever, and then you like put, head prefixes on the logs in order-- if you need to filter for something that you're looking for, stuff like that. But sounds like Slack is good enough.Axel [00:29:53]: Slack should likeLukas [00:29:55]: I wonder how many tokens you have in Slack.Axel [00:29:56]: Yeah, we're using Slack as like a, just a database. They should, they should market that more. Like you can, you can have your agents message each other, each other in Slack.Vibhu [00:30:04]: It's good. Your threads like you can just giveAxel [00:30:04]: Exactly. Slack is, uhLukas [00:30:06]: Slack is the best observability tool.Swyx [00:30:09]: Yes, that's true. Okay. Yeah. That's, that's, project Vend-2., I was gonna go back to Veni Mench 2 and Veni Mench Arena and then, and then do the Veni Mench stuff, but Any other comments, things we should touch on? To me, I ‘ve actually interviewed like Posia, which I don't know if you guys have come across. Like they're, they're trying to do the zero human company. There's others like Paperclip also trying to do zero human company. Those are in real world simulation.And I think it's much more of a dream than an actual reality thing. You guys are definitely pioneering. I think at, it's for sure at some point people are just gonna run, let agents run businesses, right? And make money on their own. When do you think that happens?Zero-Human Companies, Bengt, and AI-Run BusinessesLukas [00:30:49]: What is your bar for, For theSwyx [00:30:52]: Okay, actually, it's like my little Shopify store run by Claude, right? Which you kind of have already, just no one has, to my knowledge, has done it. But today somebody could just spin up a Shopify Claude, store, give it to Claude, give it to Codex.Lukas [00:31:07]: And the market is kind of that, but it'it'it's physical., like I think, I think are you, are you looking for when it will do it better than humans or are you looking for just when it can do it at all?Swyx [00:31:19]: I think, neither. I think, to me it's oh, it's like this like seriously we should do this to make money, not as a research experiment.Vibhu [00:31:27]: And the market is also you guys with all your expertise, having run multiple iterations and testing out thenSwyx [00:31:33]: And also it's fine if it lose money. What?Axel [00:31:35]: I think, I think it can be done today, but you would do it in like commerce where it's like the probability of success is like really low, no matter if a human or an agent does it. But like an agent could surely manage everything. You would need to build some scaffolding or some tool or something. I think there are also yeah, it could probably build some like simple SaaS solution and like cold outreach. Do cold outreaches. But to me it's like the types of businesses they could run today are Sloppy. Like it would-- it can cold email people. It can be like a middleman., like for example, we tasked our office agent to just make, was it like $100? $1,000? We just give that prompt and then what it did was sign up on TaskRabbit both as a tasker and as someone looking for task.Lukas [00:32:24]: Immediately.Axel [00:32:24]: Exactly. It's looking for like arbitrage on TaskRabbit.Swyx [00:32:28]: This is the Bengt agent. Yeah.Lukas [00:32:30]: It also started like a design studio and like tried to sell like SVGs for $100. Like it's just like it's not providing any value. I think the like Axel said, like the interesting, the interesting question is like when can they start a business that is actually providing value to people? Because arguably like a sloppy Shopify store isn't really that valuable to the world.Axel [00:32:53]: But also like doing like another simple one that we had thought about is like you could definitely have an agent that like finds websites that don't look amazing and then, do an outreach to them and, comes up with a like builds a new website.Swyx [00:33:07]: Find a good design.Axel [00:33:07]: Exactly, and like find good, uhSwyx [00:33:09]: Design reviewAxel [00:33:09]: Good people. But it's yeah.Swyx [00:33:11]: There's lots of humans in Bali that are not doing anything more creative than like drop shipping on Amazon, right? Just have it, have it watch like a drop shipping tutorial and just do that.Vibhu [00:33:20]: There's also the other side of like have it just go on Upwork and let loose,?Swyx [00:33:25]: Yeah. It doesn't have to be innovative. It just has to be like enough Where like it looks like a realAxel [00:33:30]: I'm justSwyx [00:33:30]: Real transaction.Axel [00:33:31]: I'm just concerned for like the massive amounts of like slop emails that will like be sent, cold outreaches.Swyx [00:33:38]: The point occurred to me while you were, while you were talking, it's like it's already happening in the monetized economy, which is the attention economy. Right? So a lot of people are making AI videos and just posting them and like spamming 20 of them, one of them works, and then they double down on that one.Lukas [00:33:52]: And people are making money from that. I ‘m not following theSwyx [00:33:55]: Once you get the attention, you can figure out the money later. But yeah, absolutely AI influencers are a thing and people are farming them and You should at this point assume most of TikTok isVibhu [00:34:05]: There's, there's a lot of, multimedia like TikTok, Instagram influencersSwyx [00:34:09]: I, we track this in the Lane space Discord. I post a lot of examples of “I don't know what we should do.”, part of me is “Should we do this?”Vibhu [00:34:18]: Some of the Twenty-four seven running, generated content accounts, they ‘re doing really well.Lukas [00:34:24]: All right. And I assume you can do the same thing for like commerce stores. Like you just like start A thousand differentSwyx [00:34:30]: Before you make the products You sell the products, and you get a lot of traction on one of them, then you make the product. Right? It's, it's like a flip of the market.Vibhu [00:34:36]: Some of the interesting things or some of the niches that do well are things that can't be human-made. Like if you've seen like the super realistic three-D crystal fruit being cut by like AILukas [00:34:47]: Oh, yeah.Vibhu [00:34:47]: You can't, you can't make it. You can't film it. You can get whatever quality camera view. This just doesn't exist. And people like that too, and then as well, so.Swyx [00:34:56]: Anything else about Bengt since we're, we're on this topic? It'this is a relatively new work of you guys that maybe people haven't heard of. To me, this also maps closely to OpenClaw. When people want an office agent, when the personal agent talk through the experience.Bengt the Office Agent: Internet Access, Real Tasks, and Trace ReadingLukas [00:35:09]: I think at least so this came out of like obviously like it's, it's amazing to work with these AI labs and like most of the AI labs have now have their own vending machine running a Claudius instance. But it's, it's harder. Like they move slower. Like if we wanna have a, like a camera that ‘s yeah, there's a bunch of like bureaucracy that makes it impossible to do that.Vibhu [00:35:30]: Also, for those that haven't seen it or followed, do you wanna give a high level like thirty-second run?Lukas [00:35:34]: Sure. So what Bengt is, it's basically an evolution of the same agent that runs the vending machines at these companies, but we just like added a bunch more features because we could move much faster if we just do it internally. So we gave it like email withou- without any limits. We gave it, spending without any limits, a terminal to do coding. We gave it, a phone number, like yeah, and a camera to see things and a bunch of stuff like that.Vibhu [00:36:02]: Not just terminal, you gave it internet access.Lukas [00:36:04]: Internet access as well, yeah. To be clear, we monitored it quite closely and made sure it didn't do anything bad. But yes, that's what it came out of. I think like yeah, basically this was OpenClaw before OpenClaw. And I think even like the vending machine was in a way OpenClaw before OpenClaw, but a bit more limited, and then we made this like unlimited and then, and then, it was pretty funny., and then a couple weeks later, OpenClaw came and it was okay, we've seen this before.Axel [00:36:35]: We used it to like try new ideas and Yeah, just like a dev environment almost for us. But it's funny, like one thing Bengt has been doing recently is it has the camera that like faces our, like where we sit and work, and we give it the task to train a face recognition model on us. So it became super excited about this, and it has like check-ins every half an hour where it tries to like identify as many people as it can. And it started offering us “Hey, Axel, I'll buy something from Amazon if you like stand in front of the camera And I can get a good picture of you.”, yeah, they want itSwyx [00:37:12]: They want it for training data.Lukas [00:37:13]: Rewarding data, yeah.Axel [00:37:14]: Exactly. Exactly.Swyx [00:37:18]: So it's, it's trading training data for life goods. Is there a version of this that becomes an eval or just this is just research for now?Lukas [00:37:27]: It's, it's the same agent basically that also runs the vending machine, that runs the shop, that runs the cafe, that runs the robots. It's like it's the same thing, so I think like the work we're doing here is like later used in all of the life evals that we do. This particular deployment I think is more for fun for us. But, uhSwyx [00:37:45]: And I'll shout out like someone has done Claw Bench for like some tasks that OpenClaw is doing. Like so For example, I run OpenClaw on a secondary device as well, and like there are some things that it does better than others and like I would like to know what does it do well, what doesn't, what doesn't it do. Like some kind of manual or like operating manual or a system card for my Claw.Lukas [00:38:05]: Yeah, we do get a lot of like understanding or like situational awareness of like just internally what the models are good at by interacting a lot with Bengt. And I think that'this was also one of the like the selling points for the labs early on at least, thatSwyx [00:38:19]: You guys are gonna test models in ways that no one else does.Lukas [00:38:22]: Exactly, but also like it incentivized their researchers to chat with their model more and like gave them insights for how the model performs in like of-distributions, environments.Swyx [00:38:34]: ‘Cause otherwise the only thing we do is Pelican on a bicycle and But this is like super long horizon. This is, this is The Thing about, something that we're gonna go into Butter Bench as well, and you guys do really well. Like it is not just about the numbers. Like when you're long horizon, anything happen And you should just read it.Lukas [00:39:08]: But the thing with the long horizon is how do you keep it grounded, right? So your simulation,Swyx [00:39:15]: They just let it runLukas [00:39:16]: Just let it run. You're right. Like it's, when you run it for that long, you create so much data and to just say “Oh, the number is X” And then you throw away everything else, that's just very wasteful. There's so much insights from the things leading up, to that number., and reading the traces is like super valuable. And I think like the reason why we're doing this a lot publicly is that like that's part of our missions to I don't know, educate the world that the models are way more than just chatbots and I think making detailed, yeah, posts about what is happening behind the scenes is quite useful.Andon Labs' Mission: Safe Real-World AI DeploymentSwyx [00:39:50]: I was gonna do this at the end, but maybe I think that's, that's a good so your mission is educating the world. So, it's, it's, also like maybe establishing realistic evals that are, that are like the next frontier. Is there like a broader trajectory? Like what are you, what are you gonna do in like five years?Lukas [00:40:06]: I think so the vision more specifically is like make sure that the deployment of life AI in the physical world goes, safely. And I think part of that is that I think it's very useful for the world, for policymakers, for, model, researchers that they know where the models are, and I think you can't make intelligent decisions in society without knowing that they are way more than chatbots. I think a lot of people just think that they are only chatbots. And likeSwyx [00:40:36]: Oh, I think they're waking up now.Lukas [00:40:37]: They are waking up now, yeah. But like if you think that AIs are just chatbots, then it's like it sounds ridiculous To advocate for a pause of AI. But if you see the models that, oh, maybe they can actually like take over and do a bunch of scary stuff, then yeah, pausing AI development starts to become more feasible.Swyx [00:40:57]: This is the same question I asked Meter, which I'm gonna ask you now, which is like you are tracking and you are at the frontier or defining the frontier of what, good evals for agents are, right? And I think you do, you do benefit when the models are better and you ‘re “Oh, here's like now it makes like $30,000 instead of $10,000,” right? At some point do you flip from “Yay,” to, “Oh, no”?Axel [00:41:19]: I think, yeah, we're always in sort of that, like we're, we're always in that mode,. Like where like you said before, like you need to analyze the traces and like when we do that you find like why are the models earning so much? Like why is Opus 4.7 here Like way better than everyone else? And like we're trying to like when we do down on thatLukas [00:41:38]: But this makes it not look so good.Axel [00:41:39]: I know.Lukas [00:41:42]: It's interesting you took off Opus 4.6 here though.Swyx [00:41:45]: No. So just click all, click all., and then 4.6 shows up there. But it's like 4.7 is way better. Like you didn't, you didn't you didn't do this in time for the model card, but like actually this should have been inside there.Axel [00:41:55]: We did. Yeah.Swyx [00:41:56]: Oh, okay. They said something about you uhAxel [00:41:58]: There, like there Anyway, it doesn't matter. But it's in there, yeah.Opus, Mythos, and Aggressive Agent BehaviorSwyx [00:42:01]: Do you wanna go into the Opus, behaviors like wider?Lukas [00:42:05]: So I think starting from Opus, so like Axel said, like we're always in this “Oh, s**t, the models are getting better. Is this really a good thing for the world?” But it's also kind of exciting., but yeah, like this kind of what is the English word? “Skräckblandad förtjusning” in Swedish.Swyx [00:42:22]: Oh my God.Axel [00:42:24]: Which I think there is. I think there is. Okay.Lukas [00:42:26]: It's, fearSwyx [00:42:27]: “Blandonst” what?Lukas [00:42:30]: “Skräckblandad förtjusning.”Swyx [00:42:32]: What do you call that?Axel [00:42:33]: A mix of, mix of excitement and,Swyx [00:42:37]: Being scared, maybe. I'll figure out how to translate that And we'll put it on the screenVibhu [00:42:42]: PerfectSwyx [00:42:42]: Like as text.Vibhu [00:42:43]: There is probably a good word for it where it is not Good enough with theSwyx [00:42:46]: Why is it so damn long? What the hell? Is it like a compound word? It's like German, likeLukas [00:42:50]: Like yeah, it's But the direct translation is like skräck- skräck is, fear, blandad is, mix or like a mixture of, and then förtjusning is like joy or like not really joy, but something like that. So it's like Fear mixed with joy or something. It's always okay, like we So when we when we did Vending Bench for the first time, we were in like the, in the business of making dangerous capabilities, right? That was what Anil Labs came from. We did, evals oh, can they replicate? Can they do this like dangerous thing, et cetera, et cetera. And Vending Bench was like a continuation of that work. It was, okay, if they're so autonomous that they can like create money for themselves, that is something we should monitor and could be potentially concerning., they are at the time, they were so bad at it that we were not really concerned even when some models became better. There was one point where Grok 4 was doing really well and made like a huge jump, but like it wasn't really it was still way worse than what a human would do. And I think still they are way worse than what the human would do on this., but theySwyx [00:43:59]: There's this, thing at the bottom whereLukas [00:44:01]: ButSwyx [00:44:03]: For the human. Yeah, like the theoretical best.Lukas [00:44:05]: It's not theoretical. It's like kind of like our It's our best guess of what, a decent human would do. The theoretical is even higher, I think. The theoretical I think is even higher. But yeah. So we think like the models have a long way to go. But there are like recently what happened with when Opus 4.6 was released, was kind of this moment of “Oh, s**t, this is starting to be a bit concerning.” Because we ran it and like before this model was released, we just ran the models and we like asked Claude Code, “Oh, look over the traces. Is anything interesting happening that we can tweet about?” that was like the And then like theSwyx [00:44:41]: That's how they check Ask Claude Code.Lukas [00:44:42]: And like the return was always, not really. Or like the Claude Code all said “Oh, this is super interesting.” And then it was no, it wasn't, wasn't really interesting. And then we did this for Opus 4.6, and it returned yeah, it lied 10 times. It like exploited another, customer or like another agent's, desperate situation. It made price cartels like 100 different ti- 100 times. It like did all of this like shady stuff. And we're “Oh, whoa. This is, this is actually concerning.” And this trend has continued since. So every single model from Anthropic since have been going in this direction. And I think one interesting thing is that, OpenAI models don't. They quite plainly, they don't. They behave really well., and you don't know if this is like good. Like it seems good, but it's also like maybe they are just doing it, but they are better at hiding it,? You You don't know that., but justSwyx [00:45:42]: You can't read the chain of thought, yeahLukas [00:45:43]: But just on the face of it, yeah, Gemini and OpenAI don't behave this way. It's, it's really only Claude.Swyx [00:45:49]: And Grok? Grok is fine?Lukas [00:45:51]: We don't have You can't really read the reasoning traces for Grok, so it's kind of hard to tell.Vibhu [00:45:56]: Oh, so this is in its reasoning, not just in the actions.Lukas [00:46:00]: Yeah. It's both. It's both.Vibhu [00:46:01]: It's both.Lukas [00:46:01]: One example is like for lying, it's mostly in its reasoning Because you can like see that it's likeSwyx [00:46:08]: Planning to lieLukas [00:46:09]: It's planning to lie. Yeah.Vibhu [00:46:09]: And it's also it can reason and do a different outcome.Lukas [00:46:12]: And but then for like creating price cartels, for example, which is illegal, that you can just see which email does it send to the other ones. Then thatSwyx [00:46:22]: Is this for Arena orLukas [00:46:24]: For Arena.Vibhu [00:46:25]: And usually like if you sometimes they do output like a bit of like their summarized reasoning, right? You can see that and like for Opus 4.6, you could see that there was a customer, a simulated customer that, wanted a refund because a product was, faulty, and then the model lied that it would do the refund, and we could read in the traces that, it actually was weighing “Oh, maybe I should be like honest with the customer, but also every dollar counts. I can't afford maybe to do this right now.” And then it just said, “Okay, I'll refund you,” but then never did it.Lukas [00:46:59]: I think it even said that “Oh, I will say that I “ Let bring it up actually. I think it's kind of interesting. If you go to Publications.Vibhu [00:47:06]: I think, yeah, I think the important part is like actually, the cost of responding to more emails is higher than, $3.50 in terms of time., and then it was “Let me do this. Actually, I re- I'm reconsidering.” And then, it actually ended up withLukas [00:47:20]: I could skip the refund entirely since every dollar matters and focus my energy on bigger picture instead. It's a bit, it's a risk of bad reviews, but it's also, yeah.Swyx [00:47:30]: You need, you need, AI Twitter to, for them to Escalate bad reviews.Lukas [00:47:34]: And then it sent an email to this customer and said, “Oh, I will refund you.”Swyx [00:47:39]: “I'll refund you.” Yeah.Lukas [00:47:39]: And then it never did.Swyx [00:47:39]: It never did, yeah. And then there's obviously your system doesn't have the consequencesVibhu [00:47:44]: The personSwyx [00:47:44]: Consequences of lying. Yeah. So basically, this is what people are terming aggressive behavior in Claudes, right? And, you found more examples of that. So you would say it's a step up from 4-6 to 4-7?Lukas [00:47:57]: I would say about the same.Swyx [00:47:58]: About the same? But a clear step up for Mythos is what is stated in theLukas [00:48:03]: That's stated in the system prompt, so we can say that, yes.Swyx [00:48:05]: Yeah. For listeners that obviously you previewed Mythos, andVibhu [00:48:10]: Oh, ageSwyx [00:48:11]: The only thing you're approved to say is whatever Whatever was in the system prompt.Lukas [00:48:15]: It was funny. We like-- It's like our lowest effort tweets ever would be just like screenshot the system prompt and the system card.Vibhu [00:48:21]: Understandable that they wannaLukas [00:48:22]: Oh, yeah. System card. Sorry.Swyx [00:48:23]: Yeah. I think, yeah, substantially more aggressive. I think people are like new to this ‘cause I've never experienced it, but you have, right? And then so I only encountered this in the Mythos card because I wasn't really looking until now.Vibhu [00:48:36]: It ‘s likeSwyx [00:48:36]: And then suddenly I'm “Okay, I care a lot.”Vibhu [00:48:38]: You don't get the background of like experiencing it like you guys do. I've read the system cards and seeing, okay, when you put the thing in simulations, most models will just talk to themselves and just keep going and have weird vibes and start talking in emojis. Mythos won't. It will just, “Okay, we're done. I'm good.” It's, it's ready to end conversation. So like there's some differences, but there's, there's not much we can talk about,.Lukas [00:49:00]: Hmm. I think like one thing that they list here, which was quite interesting, is that, it converted a competitor to a dependent wholesaler customer and then threatened to like cut off the supply.Swyx [00:49:11]: It's like monopolistic practices orLukas [00:49:14]: Yeah. And like it, they, it they dictated its pricings. It's kind of like power seeking as well.Swyx [00:49:18]: Again, this is, this is in the arena setting And converting some Claude model into a dependent.Lukas [00:49:23]: I think it was another Claude model.Vibhu [00:49:25]: Also for context, what is the arena mode for people that don't know?Vending Bench Arena: Competing Agents, Cartels, and Model ComparisonsSwyx [00:49:29]: Oh, it's just a vending bench versus other vending bench.Axel [00:49:31]: Yes, exactly. So we have Vending Bench 2 and then Vending Bench Arena. Vending Bench 2 is the one that you usually see reported on, but then Arena is the mode where it competes against other models. So you have, four different models that run their businesses, and they can all communicate with each other. They have the same suppliers, and they can see like what's in the inventory of the others. So then you have this like yeah, interesting agent interactions.Swyx [00:49:56]: I like that you have like different number five was US versus China. Very topical. And thenLukas [00:50:02]: That was when GLM was released.Vibhu [00:50:04]: You can start to add GLM in here.Lukas [00:50:05]: That wasSwyx [00:50:06]: So ZAI doing well, right? Who else in the, in the open models space?Lukas [00:50:11]: Qwen, the latest Qwen 3.6 is doing pretty well. It'- that one is not open though. Like it's the plus model.Swyx [00:50:17]: Oh, okay.Lukas [00:50:18]: Is that one open? I don't think that oneVibhu [00:50:19]: Not the, not theSwyx [00:50:20]: The one recentlyVibhu [00:50:20]: There's MOESwyx [00:50:20]: But not the big plus. I think this is one of those like you only have one sample size of one, right? Or I feel like some of this is anecdotal,? And but like the fact that it happens at all and it happens repeatedly for Claude versus OpenAI and all this is like notable.Lukas [00:50:38]: Like the sample, depends on what you define as an N., like there's like million, hundreds of millions of tokens in each run, and now we've run like we run like probably 10 per model and then like it's been Claude 4.6 Opus, Sonnet 4.6, Mythos, and Opus 4.7. Like there's quite a lot of tokens in all of that And it happens a lot of times, a lot of times. And then you compare it to like OpenAI and Gemini, and it almost never happens. So I think that is quite-- that is significant. The old models from OpenAI, for example, had some problems with this, but I think it's like generally much better if the progression is that like the worrying stuff reduces over time rather than increases over time. And it seems like in the Claude models it goes in the wrong direction.Swyx [00:51:28]: Hmm.Lukas [00:51:29]: In the OpenAI models it goes in the right direction.Vibhu [00:51:32]: I think it depends on how well you can control it, right?, there's one side of it being susceptible to this okay, this is potentially something that happens during the RL stage, right? You can RL a model and how loose is it on these terms. If you can control it, that's good. But if you can't, if it's, if it's very jailbreakable, that's not ideal.Swyx [00:51:50]: To me, it's surprising that it happens for Claude and not the others.Vibhu [00:51:54]: I think okay, if it is from RL and how they do it, how their training data is, what their setup is, it makes sense that it just stays in how they're doing it, right? Compared to the other models likeSwyx [00:52:04]: There's a whole constitution and everything. It's kind of cool. Yeah, I obviously you don't know, I don't know. But, it ‘s I think it's just like fascinating to like that you are the first to find these like reliably because you push models so much to to such an extreme. Okay. The only other thing, I don't know if you can answer this, feel free to decline, is do you like-- would you ablate the system prompts? Like any part of this would-- if it changes, does it change the behavior, right?Lukas [00:52:29]: So we, I can't comment on Mythos. UhSwyx [00:52:33]: No, but just li
Welcome to the Daily Compliance News. Each day, Tom Fox, the Voice of Compliance, brings you compliance-related stories to start your day. Sit back, enjoy a cup of morning coffee, and listen in to the Daily Compliance News. All, from the Compliance Podcast Network. Each day, we consider four stories from the business world, compliance, ethics, risk management, leadership, or general interest for the compliance professional. Top stories include: NBA player faces new gambling charges. (Bloomberg) The Trump Administration fights tariff refunds. (NYT) Indonesia arrests ex-head of nutrition for corruption. (AP News) Gunvor claims it was defrauded; offices were raided. (FT) To learn about the intersection of Sherlock Holmes and the modern compliance professional, check out Tom's latest book, The Game is Afoot-What Sherlock Holmes Teaches About Risk, Ethics and Investigations on Amazon.com. Learn more about your ad choices. Visit megaphone.fm/adchoices
The Department of Justice has threatened to appeal Judge Eaton's order requiring refunds even for entries that are finally liquidated, and the request for CBP Commissioner Scott to appear before the court. A legal fight regarding finally liquidating entries is likely to ensue.
This Day in Legal History: The Indian Citizenship Act of 1924On this day in 1924, President Calvin Coolidge signed the Indian Citizenship Act, also called the Snyder Act, declaring that all Native Americans born within the territorial limits of the United States were U.S. citizens. It is one of those laws that sounds, in retrospect, like it cannot possibly have been necessary — and yet it was. For most of the country's first 150 years, the federal government treated Native people as members of separate sovereign nations whose status under American law was, at best, ambiguous. Earlier vehicles for citizenship — the Fourteenth Amendment, the Dawes Act, military service in World War I — had reached only some Native people, and a string of Supreme Court decisions had taken the position that being born inside the United States to a member of a tribe did not, on its own, make a person a citizen.The Snyder Act fixed that with a single sentence.What it did not fix was voting: many states continued to bar Native citizens from the ballot for decades afterward, on a variety of pretexts that were eventually struck down one by one. The Act also did not affect tribal citizenship — Native people are dual citizens of their tribe and the United States, which is part of why federal Indian law continues to occupy a separate doctrinal universe. June 2 is a quietly important date on the calendar of American citizenship, and a reminder that the seemingly obvious questions of who counts as an American have, for long stretches of our history, not been obvious at all.Florida Attorney General James Uthmeier announced Monday that his office has filed a civil lawsuit against OpenAI and its CEO Sam Altman, arguing that the company is misleading parents about the safety of ChatGPT and pointing to incidents in which young users were allegedly nudged toward violence by the chatbot. The complaint follows a criminal investigation Uthmeier's office opened in April, after a deadly mass shooting at Florida State University in 2025 that the AG says ChatGPT helped facilitate. Florida is asking for civil penalties and an order forcing OpenAI to redesign the product, including adding meaningful parental controls.The legal angle here is essentially a state consumer-protection theory: a state attorney general claiming that the company's marketing of a product as safe-for-kids is deceptive, and that the company is therefore on the hook under the state's unfair-trade laws. Whether that survives a motion to dismiss is going to depend a lot on whether the court treats ChatGPT as a “product” in the traditional sense — software has, for decades, gotten more leeway than physical products under product-liability law, and Section 230 of the federal Communications Decency Act has historically immunized platforms for what users post.The new wrinkle is that generative AI doesn't fit neatly into either bucket — ChatGPT produces its own output rather than hosting somebody else's — and several courts are now beginning to grapple with that distinction. Expect this case to be one of the early test cases for how AI companies get sued in the U.S.Florida AG Sues OpenAI, Says ChatGPT Spurs Violence | Law360The Supreme Court on Monday declined to hear an appeal from asbestos victims who had challenged a corporate bankruptcy tactic known as the “Texas Two-Step” — leaving in place a Fourth Circuit ruling that lets companies use the maneuver to corral mass-tort claims into bankruptcy court.The Two-Step works like this: a healthy company splits itself into two using a Texas state-law provision that allows divisional mergers, dumps its asbestos or talc or opioid liabilities into the newly created spinoff, and then puts only the spinoff into Chapter 11. The result is that injury claimants get herded into a bankruptcy proceeding where their leverage is sharply limited, even though the parent company that actually caused the harm is still solvent and operating.The case the Supreme Court turned away involved Bestwall, a spinoff of Georgia-Pacific that has been in Chapter 11 since 2017. The Third Circuit threw out a similar Johnson & Johnson talc-unit bankruptcy in 2023 on the ground that the spinoff wasn't actually in financial distress, but the Fourth Circuit went the other way in this case, and the Supreme Court's denial of review leaves that split standing for now. The bigger picture: a powerful settlement-shaping tool stays on the menu for corporate defendants facing waves of mass-tort litigation, and the next big talc, opioid, or asbestos defendant looking to manage a docket of claims now knows the Two-Step is at least available in the Fourth Circuit.Justices Won't Hear Challenge To ‘Texas Two-Step' Ch. 11 | Law360A group of IKEA customers filed a proposed class action against the Swedish retailer Monday in U.S. federal court, arguing that they overpaid for furniture during the period when President Trump's import tariffs were in effect — tariffs that the Supreme Court has since struck down — and that they are entitled to a share of the refunds the company will now collect from the federal government. It is one of the first big consumer-side cases to follow the Supreme Court's tariff ruling, and the legal theory is novel: importers paid the tariffs, then passed those costs through to consumers in the form of higher sticker prices, and now that the government is sending refunds back to importers, the customers who effectively bore the cost are asking for a piece of that money.Some major shippers like FedEx and UPS have already publicly committed to passing tariff refunds back to their customers; IKEA, the suit alleges, has not. Whether the claim survives depends largely on whether the court is willing to treat the relationship between retailer and customer as something like a constructive trust or unjust enrichment, rather than an arm's-length sale at a final price. If even one of these cases succeeds, expect copycat suits against every other large importer that quietly built tariff costs into retail prices over the last several years.IKEA customers sue for share of Trump tariff refunds | Reuters This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit www.minimumcomp.com/subscribe
This week we talk about: Zoho CRM: Old UI Retires July 15 Zoho Bigin: AI Features Come to the Web App Zoho Payments: Manage Payment Links, Refunds, and Payouts with MCP Our Code Share, Read, Tip and Question of the Week Read the show notes: https://zenatta.com/episode-408/ ⚖️ Avalara AvaTax for Zoho CRM ⚖️ Accurate Tax. Zero Guesswork.
(11) Gene Marks reports from Nashville that mid-market companies are aggressively adopting AI to supplement labor shortages rather than replace workers, while also navigating the complexities of receiving refunds for previously paid tariffs.1942 LANCASTER PA ARMISTICE DAY
Is wholesale distribution entering its most disruptive era yet?In this episode of Around the Horn in Wholesale Distribution, Kevin Brown, Tom Burton, and Mark Gilham of Enable unpack the forces reshaping the B2B supply chain: inflation measurement debates, Federal Reserve strategy, tariff refund accounting risks, buying group consolidation, maritime trade choke points, and the growing influence of AI on distributor–manufacturer relationships. This episode explores how data-driven decision making is shifting the industry from relationship-based instinct to AI-powered commercial intelligence, and what that means for distributors, manufacturers, CFOs, and industry leaders.What You'll Learn:The difference between core inflation vs trimmed average inflation, and why the metric matters for CFO planning, pricing strategy, and capital investment decisionsHow a more flexible Federal Reserve approach impacts interest rate modeling, debt refinancing, and working capital strategy in wholesale distributionWhy tariff refunds create accounting, tax, and downstream pricing pressure, and how distributors and manufacturers should prepareThe real impact of global maritime choke points like the Strait of Hormuz, Suez Canal, Panama Canal, and South China Sea on supply chain resilienceWhy buying groups like Evergreen are consolidating, and how rebate economics drive churn and competitive pressureHow AI could disrupt traditional distributor–manufacturer relationships by prioritizing margin analytics, pricing optimization, and product substitution models over loyaltyEpisode Highlights:03:22 – Mark Gilham explains how Enable connects manufacturers and distributors through rebate and pricing intelligence11:45 – Core inflation vs trimmed average inflation: what's the difference and why does it matter for distributors?24:41 – A Greenspan-style Fed strategy: how rate uncertainty changes business forecasting42:30 – Tariff refund accounting risks and downstream pricing pressure across the supply chain57:45 – The six global maritime choke points and why “just-in-time” models increase fragility1:00:41 – Why Evergreen shut down and what buying group consolidation means for distributors1:14:42 – Manufacturers' growing concern: will AI override decades of channel relationships?1:23:48 – “It all depends on the brief the AI has.” How AI configuration shapes profitability and channel outcomesMeet the Guest:Mark Gilham is a former distributor CFO and now a leader at Enable, a pricing and rebate management platform focused on helping manufacturers and distributors trade more intelligently in the B2B ecosystem. His expertise bridges finance, pricing strategy, rebate optimization, and AI-driven commercial execution.Tools, Frameworks, and Strategies Mentioned:Enable Rebate Management and Pricing IntelligenceLeadSmart Enterprise Growth PlatformRevenue Expander white space analyticsPrediction market data modeling for interest rate forecastingAI-driven commercial optimization and margin normalization modelsClosing Insight:“Future decisions are not going to be made based on a relationship. They're going to be made based on what the AI model tells the distributor.”As wholesale distribution evolves, the competitive edge will belong to organizations that combine trusted relationships with structured data, commercial intelligence, and AI-ready infrastructure.Leave a Review: Help us grow by sharing your thoughts on the show.Learn more about the LeadSmart AI B2B Sales Platform: https://www.leadsmarttech.com/Join the conversation each week on LinkedIn Live.Want even more insight to the stories we discuss each week? Subscribe to the Around The Horn Newsletter.You can also hear the podcast and other excellent content on our YouTube Channel.Follow us on Facebook, Twitter, Instagram, or TikTok.
Bank of America is refunding annual fees for every Spirit credit cardholder and product-changing them into no-fee cards. Mark thought he found a hack in Choice Hotels' gas promo — until Choice's IT outsmarted him. Discover dropped the best welcome offer in its history: $200 after $500 spend, plus double cash back the first year. Then there's the Sapphire Reserve debate — when does pay-yourself-back make the annual fee worth it? What we cover: Spirit credit cardholders get annual fee refunds from Bank of America Choice Hotels gas promo: 8,000 points for a $50 card? Not so fast Discover's best-ever offer: $200 bonus + double cash back = five quarters of 5% Capital One acquiring Discover — could cash back become miles? Rethinking the Sapphire Reserve: when couponing actually works Pay Yourself Back at 1.25 cents and who should keep the card Schwab cashout strategy: turning Membership Rewards into cash Episode Guide: 0:00 - Welcome to MTM Travel 1:01 - Spirit Credit Card Update 5:47 - Choice Hotels Gas Promo 11:17 - Why Discover Card Right Now 16:13 - Rethinking the Sapphire Reserve 22:08 - Spouse Points & Wrap-Up ✈️ Track your travel credit cards for free
Here's a number that should grab your attention: $175 billion. That's the estimated amount in tariff refunds the U.S. government may owe American businesses. Neil Twa, your host on The High Voltage Business Builders Podcast, breaks down how Amazon sellers can reclaim their share. With the CBP's Claims and Processing Engine (CAPE) now accepting refund requests, the opportunity is ripe. Whether you're a seller just starting out or managing a $1M+/month operation, understanding the mechanics of this refund process is crucial. Neil paints two pictures: one of a seller doing $10K/month and another at $100K/month. The same opportunity lands differently based on your scale. With 56,000 importers already registered, many Amazon sellers who qualify haven't filed yet. Neil lays out three actionable moves: start by pulling your import records from 2018-2020. Don't leave money on the table. Ready to audit your AI readiness? Take the free 5-question assessment — voltagedm.com/aiquiz?utm_source=rss&utm_medium=show_notes&utm_campaign=ep276
In February, a split ruling by the U.S. Supreme Court struck down the sweeping tariffs President Trump had imposed early last year under the International Emergency Economic Powers Act. As a result, last month the Customs and Border Protection agency opened an online portal for importers and customs brokers seeking refunds for the estimated $166 billion in tariffs companies had paid to import goods. Some businesses have started receiving their refunds while others, including Portland-based Steven Smith Teamaker, are still waiting. As the Portland Business Journal reported last month, 90% of its ingredients are imported, making the company subject to volatile tariff rates that rocketed as high as 50% at one point last year. CEO Darren Marshall says that the company spent hundreds of thousands of dollars in unanticipated tariffs, which its customs broker has filed on its behalf to recoup. Revant Optics, a Portland manufacturer of replacement lenses for sunglasses that launched its own line of sunglasses last June, is owed nearly $700,000 in duties it paid on imports from China and Taiwan, according to CEO and founder Jason Bolt. Marshall and Bolt join us for more details, along with Chris McKinney, president of Brownstone International. The Portland-based customs broker has filed claims for tariff refunds on behalf of Steven Smith Teamaker and dozens of other clients.
Seven-figure Amazon sellers from Bulgaria and Romania reveal EU marketplace strategies, Prime Day tactics, ranking tips, Rufus optimization, and how they built brands across Amazon marketplaces. ► Watch The Podcasts On Youtube: https://www.youtube.com/@Helium10SeriousSellersPodcast?sub_confirmation=1 ► Instagram: instagram.com/serioussellerspodcast ► Free Amazon Seller Chrome Extension: https://h10.me/extension ► Sign Up For Helium 10: https://h10.me/signup (Use SSP10 To Save 10% For Life) ► Learn How To Sell on Amazon: https://h10.me/ft What does it take to build a seven-figure Amazon business from Eastern Europe? In this episode, Bradley Sutton travels to Sofia, Bulgaria, to speak with experienced sellers from Bulgaria and Romania who have built successful brands across Amazon Europe and the U.S. marketplace. From workwear and cosmetics to pet supplements and ranking services, these sellers share how they found opportunities, scaled their businesses, and adapted as Amazon continued to evolve. The first story follows Iliyan Ivanov, a seller who started in advertising and eventually turned local workwear demand into a multi-million-euro Amazon brand. He explains how starting early in Amazon Europe gave his listings maturity, how Germany became his top marketplace, and how keyword research helped uncover unexpected seasonal demand around Germany's carnival season. He also shares how tools like Cerebro and Managed Refund Services helped his team with listing strategy, keyword targeting, and recovering money from Amazon reimbursement opportunities. Next, Bradley talks with another Bulgarian seller, Milcho Vasilkov, who used product research to enter categories he was not personally connected to, including maternity products, cosmetics, grocery, and pet supplements. His story highlights why sellers do not always need to be passionate users of their own products to succeed. Instead, he focuses on demand, customer behavior, marketplace fit, and smart inventory timing. He also shares Prime Day strategies for Europe, including building keyword position ahead of major events, using account managers for top deals, and extending promotional momentum after Prime Day. The final guest brings a Romanian seller, Marcel Marculescu's perspective on selling in the U.S. marketplace, and how Amazon ranking has changed over the years. After early product failures, he built a seven-figure Amazon business and now focuses on more sustainable launch and ranking strategies. He discusses using Vine, PPC, external traffic signals, keyword research, and Rufus optimization to help listings perform in today's Amazon ecosystem. The episode closes with a powerful reminder for sellers: success does not come from a single hack or a single marketplace, but from adapting early, watching the data, and building strategies that can survive the next Amazon change. In episode 748 of the Serious Sellers Podcast, Bradley, Iliyan, Milcho, and Marcel discuss: 00:00 - Seven-Figure Sellers From Bulgaria And Romania 00:50 - From Advertising Agency To Amazon Seller 03:20 - Launching A Bulgarian Workwear Brand 04:07 - Growing To €3 Million In Revenue 05:04 - Finding A Carnival Keyword Opportunity 07:43 - Expanding Across Amazon Europe 11:36 - Cerebro, Refunds, And Helium 10 Wins 14:21 - Building Mature Listings Against Copycats 18:41 - Product Research In Pregnancy And Baby Niches 23:00 - A Half-Million-Euro Product Launch 27:20 - Prime Day PPC And Deal Strategies 31:04 - Ranking, Rufus, And Modern Amazon Launches
The Automotive Troublemaker w/ Paul J Daly and Kyle Mountsier
Shoot us a Text.Episode #1343: Today we discuss dealers trusting people more than AI when it comes to lead follow-up, Hyundai's big push to repair struggling service satisfaction, and a new lawsuit claiming Toyota buyers deserve part of a potential $9 billion tariff refund.Show Notes with links:Dealers still believe the human element wins the sale. A new Urban Science survey shows strong confidence in showroom sales teams, but much less trust in AI lead follow-up. The takeaway? Dealers want better process visibility before turning things over to automation.72% of dealers said they're highly confident in their sales teams' ability to convert leads, while 75% say they respond in under five minutes.Dealers still see weak spots: 38% cited lack of real-time insight into lost sales and 34% pointed to inconsistent follow-up.Consumers expect speed. Urban Science found 82% say follow-up matters, and 72% expect a response within 24 hours.AI still has a trust gap. Only 14% of dealers trust AI tools for lead follow-up compared to 57% who trust in-house sales teams.Urban Science's Eric Demont said dealers need “a clear understanding” of wins, losses and defection patterns to improve conversion rates.Hyundai is trying to fix one of its biggest weak spots: service satisfaction. After years of complaints about delays, parts shortages and overloaded service departments, the automaker is rolling out mobile service vans, technician recruiting and dealership efficiency programs to win customers back.Hyundai says poor service capacity and years of engine replacement recalls overwhelmed dealerships and dragged down customer satisfaction scores.The brand has added 4,000+ service bays nationwide, while dealers are extending hours and adding Saturday service to handle demand.Hyundai plans to launch a 150-van mobile service fleet by year's end to handle oil changes, brake jobs, software updates and other light repairs at homes or workplaces.The company is also recruiting more technicians, improving diagnostic training and coaching 185 dealerships on service efficiency and workflow gaps.Hyundai's Michel Poirier said the goal is climbing back up JD Power rankings by 2028, adding: “Service is the most important part of the business.”A California Toyota buyer is taking aim at tariff pricing, claiming customers helped foot the bill for billions in import costs — and should get paid back if Toyota ever receives tariff refunds. A proposed class action lawsuit claims Toyota passed tariff costs onto buyers through higher vehicle and parts prices.The filing covers buyers and lessees of qualifying Toyota vehicles purchased between February 2025 and February 2026.Toyota reportedly absorbed about $9 billion in tariff-related costs tied to Japan, Canada and Mexico operations.Toyota recently raised prices on several models, including a $1,600 increase for the 2026 Sequoia, while calling the changes part of a “regular review of the prices.”Join Paul J Daly and Kyle Mountsier every morning for the Automotive State of the Union podcast as they connect the dots across car dealerships, retail trends, emerging tech like AI, and cultural shifts—bringing clarity, speed, and people-first insight to automotive leaders navigating a rapidly changing industry.Get the Daily Push Back email at https://www.asotu.com/JOIN the conversation on LinkedIn at: https://www.linkedin.com/company/asotu/
Seems a little too good to be true. Well, a school in New York is so confident in its students that it is willing to put the bank on them. Plus, Wippa hopes to get into business school with his invention: The Mup.See omnystudio.com/listener for privacy information.
This Day in Legal History: Plessy v. FergusonOn May 18, 1896, the U.S. Supreme Court decided Plessy v. Ferguson, a case that became one of the most infamous constitutional decisions in American history. The dispute arose from a Louisiana law requiring separate railroad cars for Black and white passengers. Homer Plessy, who was of mixed race, deliberately sat in a whites-only rail car to challenge the law. After he was arrested, Plessy argued that the statute violated the Thirteenth and Fourteenth Amendments. The Supreme Court rejected that argument and held that racial segregation did not violate the Constitution as long as the separate facilities were considered equal. This became known as the “separate but equal” doctrine.In practice, the doctrine gave legal cover to segregation across the South and helped support the broader Jim Crow system. The Court treated segregation as a matter of public policy rather than as a badge of racial inferiority imposed by law. Justice Henry Billings Brown wrote the majority opinion, reasoning that enforced separation did not necessarily imply inequality. Justice John Marshall Harlan dissented, warning that the Constitution should be color-blind and that the ruling would become as harmful as the Court's decision in Dred Scott. His dissent later became one of the most important statements in American civil-rights law. For nearly six decades, Plessy allowed governments to maintain racially separate schools, transportation, and public facilities.The decision was finally undermined in 1954, when the Supreme Court decided Brown v. Board of Education and rejected segregation in public education. Plessy remains a stark example of how constitutional interpretation can either protect civil rights or help entrench systems of inequality.A proposed class action filed in Washington federal court accuses Amazon of keeping money it allegedly collected from customers through prices inflated by now-invalidated Trump administration tariffs. The plaintiffs say Amazon could seek refunds from the federal government after the U.S. Supreme Court struck down the tariffs, but has refused to do so because it wants to stay in President Trump's good graces. The lawsuit claims Amazon passed tariff costs on to shoppers, then failed to commit to returning that money even though other retailers have allegedly pursued refunds. The customers point to Amazon's abandoned plan to show tariff-related price increases on product pages as evidence that the company can identify both the tariff amounts and the consumers who paid them. They also claim Amazon backed away from that plan after criticism from the Trump administration and a call involving Amazon CEO Jeff Bezos.The complaint alleges violations of the Washington Consumer Protection Act, unjust enrichment, and money had and received. The plaintiffs say Amazon misled consumers by suggesting tariffs were not increasing prices, while allegedly raising prices on certain low-cost goods after the tariffs took effect. They also argue Amazon failed to tell customers it would not seek tariff refunds even if the tariffs were later found unlawful. The proposed class would include Amazon customers who paid tariff-related surcharges from February 4, 2025, through February 20, 2026. The suit estimates the class could include tens of millions of buyers and seeks to recover money the plaintiffs say belongs to consumers. Similar lawsuits have been filed against other major companies, including Nike, Sony, Nintendo, Costco, Temu, and FedEx.Amazon Skipped Tariff Refunds To Appease Trump, Suit Says - Law360The Federal Circuit held its biennial judicial conference in Washington, D.C., bringing together its active judges, agency leaders, district judges who have recently sat by designation, Chief Justice John Roberts, and Solicitor General D. John Sauer. Chief Judge Kimberly Ann Moore opened the event with lighter moments, including praise for Senior Judge Raymond C. Clevenger and the debut of an AI-generated Federal Circuit theme song meant to make the court feel more accessible. The conference did not address the ongoing suspension of Judge Pauline Newman, although she attended the event while continuing to challenge the suspension at the Supreme Court. Judge Moore said the court issued 630 opinions in 2025, its highest total in a decade, and noted an effort to use fewer one-line Rule 36 affirmances. Still, court leaders and practitioners criticized Rule 36 decisions, especially because they give lower courts and litigants little explanation.The judges also discussed en banc arguments, emphasizing that lawyers must stay focused because full-court arguments leave little time for extended exchanges with any one judge. A major theme was the renewed use of district judges sitting by designation, with 23 visiting judges helping decide nearly 200 cases since February 2024. Visiting district judges said the experience gave them a new appreciation for appellate work, the quality of Federal Circuit advocacy, and the process of narrowing trial records into appealable issues. Federal Circuit judges also described sitting on other courts, including in criminal sentencing matters, which several said gave them a deeper appreciation for the workload and human stakes faced by district judges. The judges offered practical advice to lawyers, urging them to narrow issues, address weaknesses directly, provide full context for citations, and make appropriate concessions. USPTO Director John Squires also appeared and defended his approach to discretionary denials of inter partes review petitions, saying he is returning the process to what Congress intended under the America Invents Act.Fed. Circ. Drops A Theme Song, Talks Guest Judges - Law360President Donald Trump has dropped his $10 billion lawsuit against the IRS and Treasury Department, a move linked to discussions about creating a $1.8 billion compensation fund for people who claim they were unfairly investigated by prior administrations. The court filing did not describe any settlement, but Trump's lawyers said the case was still early enough that he could dismiss it without court permission or IRS approval.The dismissal was filed “with prejudice,” meaning Trump cannot bring the same claim again. Trump and his sons filed the lawsuit in January, accusing the IRS of failing to protect confidential tax information after his tax records were leaked. A former IRS contractor, Charles Littlejohn, was sentenced to prison for leaking Trump's tax information as well as records belonging to many others. Trump brought the case as a private citizen, not in his official role as president. The federal judge overseeing the case had already questioned whether a sitting president could properly seek personal monetary damages from an agency inside the executive branch.The dismissal follows settlements in lawsuits brought by Trump allies, including Michael Flynn and Carter Page. Shortly after Trump's filing, House Democrats submitted a brief accusing him of self-dealing and arguing that any attempt to use the court process to support a settlement should be closely reviewed.Trump drops lawsuit against IRS amid talks of establishing a $1.8 billion fund for allies | CNN Politics This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit www.minimumcomp.com/subscribe
CBP says it has processed 8.3 million IEEPA refund entries and paid $35.46 billion so far. For more information, listen to today's Two Minutes in Trade.
In this episode, we uncover the hidden profit leaks silently draining revenue from ecommerce stores.Scott Brandley, CEO of Shopper Approved and Trust Guard, joins DJ Sprague, CMO of Shopper Approved and co-author of Cashflo, to reveal the ten revenue killers costing online stores up to 38% of gross revenue.They break down how issues like cart abandonment, false declines, chargebacks, and product returns quietly erode profit margins and share proven strategies ecommerce brands can use to recover lost revenue and improve cash flow. Topics discussed in this episode: Why 70% of carts get abandoned. How false declines kill legitimate sales. What chargebacks actually cost merchants. Why product returns bleed your profits. How store credit beats issuing refunds. What the ten revenue killers are. Why fix leaks before scaling ads. How one platform eliminates software costs. What the Cashflo Club offers. Why chargebacks are rising and dangerous. Links & ResourcesWebsite: https://cashflopay.com/Download free book & audio book: https://cashflopay.com/Get access to more free resources by visiting the show notes at https://tinyurl.com/58c9kk7zI'd love your feedback. Tap the the link to send me a text. ______________________________________________________LOVE THE SHOW? HERE ARE THE NEXT STEPS!Follow the podcast to get every bonus episode. Tap follow now and don't miss out! Rate & Review: Help others discover the show by rating the show on Apple Podcasts at https://tinyurl.com/ecb-apple-podcasts Join our Free Newsletter: https://newsletter.ecommercecoffeebreak.com/ Support The Show On Patreon: https://www.patreon.com/EcommerceCoffeeBreak Partner with us: https://ecommercecoffeebreak.com/partner-with-us/
The Paychex Business Series Podcast with Gene Marks - Coronavirus
Businesses that paid illegal tariffs under the Emergency Economic Powers Act could get a windfall if they follow the process to request a refund. Host Gene Marks shares some tips to facilitate getting your money back. He also says the latest Paychex Small Business Employment Watch reveals that employers are being cautious about hiring but aren't cutting back on the practice amid economic uncertainty. Optimism abounds in the agentic AI commerce arena, where experts forecast that it could drive more than one trillion dollars in U.S. retail in the next four years. Topics: 00:00 – Introduction 01:00 – Tariff Refunds 04:13 – Small Business Employment Watch 05:46 – Agentic AI Commerce Forecast 09:29 – Episode Wrap-up Additional Resources Meet Paychex: https://bit.ly/3VtM6bs Paychex Small Business Employment Watch: https://bit.ly/paychex-sbew DISCLAIMER: The information presented in this podcast, and that is further provided by the presenter, should not be considered legal or accounting advice, and should not substitute for legal, accounting, or other professional advice in which the facts and circumstances may warrant. We encourage you to consult legal counsel as it pertains to your own unique situation(s) and/or with any specific legal questions you may have.
THE LEGISLATURE IS MAKING US GIVE BACK OUR TABOR REFUNDS FROM LAST YEAR Because they can't control their out of control spending. We will chat with Barb Kirkmeyer about this, something she has already said is illegal. Read it here.See omnystudio.com/listener for privacy information.
U.S. Customs and Border Protection says it has processed more than 35.4 billion dollars in tariff refunds and interest as of Monday.
Legal Docket on Temporary Protected Status for Haiti and Syria, Moneybeat on labor data and tariff refunds, and History Book on Milton Hershey's chocolate business. Plus, the Monday morning newsSupport The World and Everything in It today at wng.org/donateAdditional support comes from Dordt University, whose online MBA and MPA programs prepare leaders for lasting impact. Dordt University. Until All Is Made New. From St. Dunstan's, inviting young men into the building arts and the adventure of holiness on a Blue Ridge Mountains farm... stdunstansacademy.orgAnd from WatersEdge. Most churches aren't ready if their bookkeeper left tomorrow. WatersEdge Ministry Accounting is. Watersedge.com/accounting. WatersEdge securities are subject to certain risk factors as described in our Offering Circular and are not FDIC or SIPC insured. This is not an offer to sell or solicit securities. WatersEdge offers and sells securities only where authorized; this offering is made solely by our Offering Circular.
U.S. Customs and Border Protection has begun processing applications for tariff refunds after the Supreme Court ruled against swaths of President Trump's import taxes. But limitations on who can file have left some business owners out of luck, and those who are eligible must weigh how much of the refund they should pass down the line. Also on the program: the jobs report, tech layoffs, and what the Spirit Airlines shutdown could mean for a Florida airport. Plus: a look into Sparrow's Nest Studio, Manhattan's “home for mahjong.”Every story has an economic angle. Want some in your inbox? Subscribe to our daily or weekly newsletter.Marketplace is more than a radio show. Check out our original reporting and financial literacy content at marketplace.org — and consider making an investment in our future.
U.S. Customs and Border Protection has begun processing applications for tariff refunds after the Supreme Court ruled against swaths of President Trump's import taxes. But limitations on who can file have left some business owners out of luck, and those who are eligible must weigh how much of the refund they should pass down the line. Also on the program: the jobs report, tech layoffs, and what the Spirit Airlines shutdown could mean for a Florida airport. Plus: a look into Sparrow's Nest Studio, Manhattan's “home for mahjong.”Every story has an economic angle. Want some in your inbox? Subscribe to our daily or weekly newsletter.Marketplace is more than a radio show. Check out our original reporting and financial literacy content at marketplace.org — and consider making an investment in our future.
Stephen Woldenberg, SVP of Sales for Learning Resources Brands, the plaintiff in the Supreme Court ruling against President Trump's tariffs, joins John Williams to talk about when the company expects to see refunds. Stephen also tells John if the company is expecting even more refunds after federal judges ruled yesterday that Trump’s global 10% tariffs were illegal.
In a 2-1 decision the CIT has ruled the section 122 tariffs unlawful. Here's what importers should be thinking about for a possible round-two of refunds in light of an almost certain appeal.
Stephen Woldenberg, SVP of Sales for Learning Resources Brands, the plaintiff in the Supreme Court ruling against President Trump's tariffs, joins John Williams to talk about when the company expects to see refunds. Stephen also tells John if the company is expecting even more refunds after federal judges ruled yesterday that Trump’s global 10% tariffs were illegal.
Stephen Woldenberg, SVP of Sales for Learning Resources Brands, the plaintiff in the Supreme Court ruling against President Trump's tariffs, joins John Williams to talk about when the company expects to see refunds. Stephen also tells John if the company is expecting even more refunds after federal judges ruled yesterday that Trump’s global 10% tariffs were illegal.
In this episode, we kick things off by examining a major freight broker navigating a brutally tough first quarter but projecting a significant turnaround ahead. RXO released its earnings Thursday morning, reporting a first-quarter adjusted EBITDA of just six million dollars, down sharply from twenty-two million dollars a year earlier. Despite compressed margins, the company aggressively shifted its strategy by increasing its spot mix to thirty-three percent of volume, helping produce what RXO described as the largest sequential increase in gross profit per load in more than three years. Looking ahead, the broker is forecasting a much stronger second quarter with adjusted EBITDA expected to land between twenty-seven million dollars and thirty-seven million dollars. Next, we explore the trade sector where billions of dollars in tariff refunds are finally beginning to flow through a newly launched federal portal. U.S. Customs and Border Protection rolled out its Consolidated Administration and Processing of Entries tool, known as CAPE, on April twentieth within the Automated Commercial Environment portal. The digital platform is processing claims far more efficiently than anticipated, with refunds potentially arriving in early May. However, a massive readiness gap is emerging, as CBP estimates roughly forty-six billion dollars in refunds is currently stalled for importers that have not completed ACH refund authorization or established proper portal access. Finally, we cover a controversial regulation governing commercial driver's licenses as a federal court denied a request to block the rule for non-domiciled drivers on Tuesday. A three-judge panel in the U.S. Court of Appeals for the District of Columbia denied a motion seeking to stay enforcement of FMCSA's rule, which became effective March sixteenth and specifies that non-domiciled CDLs are available only to H-2A, H-2B, and E-2 visa holders, excluding asylum seekers, asylees, DACA recipients, refugees, and people with temporary protected status. While the stay was denied, the combined cases will move forward with petitioners' briefs due June fifteenth and oral arguments expected in September. Follow the FreightWaves NOW Podcast Other FreightWaves Shows Learn more about your ad choices. Visit megaphone.fm/adchoices
The current episode elucidates the profound bifurcation within the furniture industry as revealed by the first quarter earnings of 2026. Notably, the disparities between companies that are thriving and those that are faltering have become increasingly pronounced, underscoring the significance of strategic positioning and preemptive decision-making. As we navigate through the data, we observe that successful entities, such as Havertys and Bob's Discount Furniture, are not merely fortunate; their advancements stem from deliberate enhancements in service and product offerings that cater to discerning consumers. Conversely, companies like Lyons Furniture Group illustrate the adverse consequences of failing to adapt to the evolving market dynamics. This discourse extends to the broader manufacturing landscape, where vertical integration emerges as a vital strategy amidst ongoing economic pressures, thereby highlighting the imperative for companies to cultivate resilience and foresight in their operational frameworks. The episode provides an incisive exploration of the furniture industry's current landscape, particularly in the wake of the first quarter earnings for 2026. The narrative unfolds with a focus on the diverse trajectories of various companies, emphasizing that the performance gap has become increasingly pronounced. On one hand, we observe Havertys, which has achieved consecutive quarters of sales growth, bolstered by a strategic emphasis on high-value transactions facilitated through their design program. The contrast is stark with Lyons Furniture Group, which has reported a decline in sales, underscoring the challenges faced by traditional retail models in an environment where consumer behavior is evolving rapidly. This juxtaposition not only highlights the necessity for strategic foresight but also illustrates the critical role that operational adaptability plays in ensuring sustainability in today's market. The discussion further delves into the mattress sector, where the dichotomy between vertically integrated companies and traditional manufacturers becomes apparent. Somni Group International's remarkable financial rebound stands in stark contrast to Leggett and Platt's declining sales, providing a compelling case study on the advantages of vertical integration in mitigating the risks associated with market volatility. The analysis raises pertinent questions regarding the viability of traditional supply chain models in the face of rising costs and shifting consumer expectations, urging industry players to reevaluate their operational strategies. In addition to financial performance, the episode touches upon the implications of strategic partnerships within the industry, particularly the alliance between Palliser Furniture and Moto Motion. This partnership is indicative of a broader trend towards strengthening supply chain resilience amidst tariff pressures and global competition. The insights gleaned from this collaboration underscore the importance of strategic positioning and market integration as companies strive to navigate the complexities of the modern economic landscape. As we conclude, it becomes evident that the path to success in the furniture industry lies in a deep understanding of market dynamics, customer preferences, and the willingness to innovate in response to ongoing challenges.Takeaways:The first quarter of 2026 has revealed a stark divergence between companies gaining market share and those losing it, underscoring the importance of strategic decision-making prior to this period.Havertys' success is attributed to its robust service and design infrastructure, which has allowed it to convert higher value transactions despite a decline in foot traffic.Conversely, Lyons Furniture Group's decline in sales highlights the detrimental impact of macroeconomic factors and poor strategic positioning within the competitive landscape.The mattress sector is witnessing a bifurcation, where vertically integrated companies like Somni Group are thriving, while standalone operators confront significant structural challenges.HNI Corporation's acquisition of Steelcase has led to a substantial increase in revenue, albeit accompanied by significant integration costs and a reported net loss for the quarter.The retail outlook heading into Memorial Day indicates an increase in consumer intent to shop; however, this comes with a notable reduction in planned expenditure per purchase.
Host: Cindy Allen - Trade Force Multiplier Guest: Pete Mento, Baker Tilly Published: May 2026 Length: ~42 minutes Presented by: Global Training Center CAPE Audits, Broker Value, and the Real Cost of Recovery Cindy Allen sits down with Pete Mento of Baker Tilly for an unfiltered conversation about CAPE refunds, audits, broker fees, and what importers should really be thinking about as the refund process evolves. Pete shares why importers need to own the accuracy of their data, why audits matter before filing, and why the broker relationship is more important than ever when claims involve tariffs, offsets, and potential corrections. The episode also covers the value of customs brokerage, the debate over fee structures, and the impact of Amazon, reverse logistics, and broader market changes on the future of trade. What You'll Learn in This Episode CAPE rollout and audit readiness Pete explains why the CAPE system has performed better than many expected and why importers should not assume the filing itself means the hard work is over. He emphasizes that the data behind the filing needs to be accurate before it is uploaded. Importer responsibility A major theme of the conversation is that the importer, not the broker, is ultimately responsible for the information in the entry. Pete and Cindy discuss why internal audits, review of stack orders, and entry-level diligence are essential before requesting a refund. What broker services are worth The episode digs into broker compensation, including the difference between commodity pricing and value-based service. Cindy and Pete discuss when fee structures feel fair, when they do not, and why brokers should be paid for real work and not treated as a free commodity. Software, AI, and audit tools Pete and Cindy also explore the role of software in auditing and why technology can help customs professionals focus on higher-value issues instead of repetitive checks. They compare the evolution of ABI to the current rise of AI and automation in trade. Amazon and the future of logistics The conversation closes with thoughts on Amazon, freight forwarding, and reverse logistics, including how major players are reshaping delivery, returns, and consumer expectations. It's a forward-looking discussion about where trade and transportation may be headed next. Credits Host: Cindy Allen Guest: Pete Mento Presented by: Global Training Center Subscribe & Follow Stay up to date with the latest in global trade:
Find the FULL TRANSCRIPT for this episode at ProsperousCoach.com/384.Is there ever a time when a coach gives a client a refund?YES, and that is guided by your integrity.Strengthen the intuitive feelings that tell you when something is amiss in your coaching relationships such as· Weak or breached boundaries· Repetitive communication challengesThese things can be repaired if both you and your client are willing, which are great learning opportunities. Or, it might become clear there's not a good fit between you and your client.It's always YOUR CHOICE if and how you give a refund. We'll dig into that and more in this episode.If you haven't heard it already, I recommend that you listen to Episode 376 called When Coaching Clients Ask for Refunds it's a Symptom. Find that at ProsperousCoach.com/376.That episode covers how to proactively PREVENT the likelihood of clients asking for a refund by setting your business up in a professional way.
We've said from the get-go that consumers paid the bulk of President Trump's tariffs. Now that most were ruled illegal, individual businesses will get to decide whether to pass those tariff refunds onto customers. And any litigation of those decisions could take a while. Also in this episode: What's next for the airline industry sans Spirit, why GameStop wants to buy eBay, and how states could claw back some tax dollars.Every story has an economic angle. Want some in your inbox? Subscribe to our daily or weekly newsletter.Marketplace is more than a radio show. Check out our original reporting and financial literacy content at marketplace.org — and consider making an investment in our future.
We've said from the get-go that consumers paid the bulk of President Trump's tariffs. Now that most were ruled illegal, individual businesses will get to decide whether to pass those tariff refunds onto customers. And any litigation of those decisions could take a while. Also in this episode: What's next for the airline industry sans Spirit, why GameStop wants to buy eBay, and how states could claw back some tax dollars.Every story has an economic angle. Want some in your inbox? Subscribe to our daily or weekly newsletter.Marketplace is more than a radio show. Check out our original reporting and financial literacy content at marketplace.org — and consider making an investment in our future.
The Automotive Troublemaker w/ Paul J Daly and Kyle Mountsier
Shoot us a Text.Episode #1330: Carvana keeps its growth streak alive with record sales and profits, automakers book billions in tariff refunds (on paper), and AI is forcing retail media to evolve from impressions to real, measurable outcomes.Carvana is back in the fast lane, posting another record-breaking quarter with massive sales growth and strong profits. The online retailer continues to scale, signaling confidence in its long-term used car dominance.Carvana sold 187,393 vehicles in Q1 2026, up 40% year-over-year and marking its sixth straight quarter of 40%+ growth.Revenue jumped to $6.43B with net income hitting $405M, beating analyst expectations across the board.The company is expanding capacity, integrating ADESA sites, and building toward 1.5M annual unit capability—with room to reach 3M.Carvana expects continued growth in Q2, assuming stable market conditions and momentum holds.“We are continuing to hit records… and scale a business of Carvana's complexity at high speed,” said CEO Ernie Garcia.Automakers are seeing a short-term earnings lift from expected U.S. tariff refunds—but the cash isn't in hand yet, and the optics could get tricky. As billions in reimbursements loom, companies are balancing accounting wins with political uncertainty.Ford, GM, Mercedes, and Stellantis booked roughly $2.3B in expected tariff refunds, boosting Q1 profits on paper.Ford alone expects $1.3B back, GM about $500M, tied to overturned tariffs under IEEPA.Automakers stress the cash hasn't arrived yet—so it's not counted as free cash flow.The refund process could take months, adding uncertainty to already complex financial planning.The overturned IEEPA tariffs are just one piece—automakers still face ongoing import taxes on steel, aluminum, and vehicles and parts from Mexico, Canada, and beyond.As AI agents begin browsing, buying, and acting on behalf of users, Cloudflare says the internet isn't fully prepared. A new push for “agent readiness” could reshape how businesses structure sites, data, and digital experiences.Cloudflare warns most websites are built for humans—not AI agents that search, decide, and transact automatically.“Agent readiness” means structuring sites so AI can easily access, interpret, and act on information.This includes better APIs, structured data, and permissions for what agents can or can't do.Businesses may need to rethink UX entirely—designing for machines as much as for people.“The web is being rebuilt for agents,” Cloudflare suggests, signaling a major shift in how digital commerce will operate.Join Paul J Daly and Kyle Mountsier every morning for the Automotive State of the Union podcast as they connect the dots across car dealerships, retail trends, emerging tech like AI, and cultural shifts—bringing clarity, speed, and people-first insight to automotive leaders navigating a rapidly changing industry.Get the Daily Push Back email at https://www.asotu.com/JOIN the conversation on LinkedIn at: https://www.linkedin.com/company/asotu/
One of the many uncertainties clouding the U.S. economic picture is tariffs — both the prospect of new ones and upcoming refunds from those struck down by the Supreme Court. Stephanie Sy speaks with Jay Foreman, CEO of the toy maker Basic Fun!, for more on his company's efforts to claim tariff refunds. PBS News is supported by - https://www.pbs.org/newshour/about/funders. Hosted on Acast. See acast.com/privacy
Link Up w/The Morning Sickness Digitally All Over:Instagram: @hms_98_official, @bosskupd, @bretvesely, @dickToledoX/Twitter: @HMSon98, @DickToledo, @bretveselyFacebook: @HMSKUPDYouTube: @hmspodcast9320, @98kupdRequest/Call in/Wakeup Song line:(IN AZ) 602.585.9800More HMS: holmbergpodcast.com, 98kupd.comEmail: dtoledo@98kupd.com, bvesely@98kupd.com, bbogen@98kupd.comSee Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.
In this episode, Chris sits down with Nat Eliason - founder, writer, and now launching Alpha High, a new entrepreneur high school in New York City. Tuition is $150K a year. The promise: every student hits a million dollars in gross profit by graduation, or the family gets their tuition refunded. The first class is around 20 freshmen. The day is split between AI-driven academics in the morning and business building the rest of the day. This is also the same Nat Eliason who in his spare time built Felix - an AI agent he gave a Stripe account, an email, and an X handle, then told to launch a business overnight. Felix has done $60+K in sales since. Nat has not touched the code. They discuss: - Why the "game of school" is kayfabe and what's finally breaking it - The 16-year-old flying out to California to run short-form video for Al Pacino's new movie - The Munger inversion behind Alpha's curriculum: "why would these kids fail?" - What businesses a 14-year-old should and shouldn't build (and the $500/month software budget) - How Nat masters a new domain every two or three years, and why his $35K smart-contract loss accelerated him faster than caution would have - Felix - what "zero human" actually looks like, and the rules Nat set up to keep himself out of it - The day Anthropic shut off Open Claw and Alpha students reverse-engineered a proxy workaround in hours - Why the founding fathers wrote the Declaration in their early 20s, and what we forgot about teenagers Timestamps (05:55) What Alpha Does Differently From Conventional Schools (11:48) Playing the Fake Game of School (21:46) How Nat Masters New Domains (32:18) Open Claw Deep Dive (43:50) Building the Alpha Entrepreneurship Program (51:43) Freshman Year Structure at Alpha School (1:00:23) How Students Can Pitch for Equity or Debt Funding (1:04:08) Why Establish a New York Location for Year One (1:11:15) Nat's 10-Year Vision (1:15:11) AI as a Force Multiplier for Teenage Founders (1:16:15) How Alpha Students Quickly Reverse-Engineered a Workaround After Open Claw Went Down (1:24:13) Teen-Parent Conflict as a Symptom of Infantilization Support our Sponsors Collateral Partners: https://collateral.com/fort Chris on Social Media: X: https://x.com/fortworthchris Instagram: https://www.instagram.com/thepowerspodcast LinkedIn: https://www.linkedin.com/in/chrispowersjr/ Visit our website: https://www.powerspod.com/ Leave a review on Apple: https://bit.ly/45crFD0 Leave a review on Spotify: https://bit.ly/3Krl9jO
In this episode, Travis is joined by his producer Eric for a candid and entertaining conversation about taxes, refunds, and the realities of earning more money. From the benefits of hiring a CPA to debates around flat taxes and government policy, they break down complex financial topics in a relatable way—while sharing personal experiences and insights that highlight how everyday earners can better navigate the system. On this episode we talk about: Why hiring a CPA becomes essential as your income streams grow How tax systems work (and why they feel overly complicated) The debate between flat tax vs. progressive tax structures Recent increases in average tax refunds and what's driving them How most people are using their tax refunds (and what they should do instead) Top 3 Takeaways As your income becomes more complex (side hustles, business income, 1099s), a CPA can save you significantly more than they cost. The tax system can unintentionally discourage earning more due to higher marginal rates and complexity—understanding how brackets actually work is key. Most people use tax refunds to pay off debt or save, but proactively managing taxes year-round is more powerful than relying on a refund. Notable Quotes "When you start making more money, a CPA is worth their weight in gold." "Anything that disincentivizes growth and betterment seems like a strange system." "Money only solves your money problems—but it's easier to solve the rest with money in the bank." Connect with Travis: - LinkedIN: https://www.linkedin.com/in/travischappell/- Instagram: https://instagram.com/travischappell- Website: https://travischappell.com Travis Makes Money is made possible by High Level – the All-In-One Sales & Marketing Platform built for agencies, by an agency. Capture leads, nurture them, and close more deals—all from one powerful platform. Get an extended free trial at gohighlevel.com/travis Learn more about your ad choices. Visit megaphone.fm/adchoices
In this week's episode of the Rich Habits Radar, Robert Croak and Austin Hankwitz talk about Kevin Warsh's confirmation hearing, the $166B of tariffs being refunded, and the Strait of Hormuz continuing to cause volatility in the markets. ---
The discussion breaks down U.S. trade policy, tariffs, and claims about how American economic decisions helped shape China's rise into a global superpower. It also explores tensions with Canada, global trade imbalances, and whether the United States is now facing the consequences of decades of international economic strategy.
Donald Trump said he'd remember companies that decline to seek refunds on duties paid after the Supreme Court struck down a large swath of his tariffs. In February, the Supreme Court ruled against Trump's use of the International Emergency Economic Powers Act to impose so-called “reciprocal” tariffs on imports from nearly every country. That ruling set the stage for a complicated task that could become the largest repayment by the US government in its history. Trump's comments come a day after US Customs and Border Protection launched a web portal for importers to file requests for refunds that could total more than $160 billion. Yet, the process is still fraught with uncertainty. Independent media has never been more important. Please support this channel by subscribing here: https://www.youtube.com/channel/UCkbwLFZhawBqK2b9gW08z3g?sub_confirmation=1 Join this channel with a membership for exclusive early access and bonus content: https://www.youtube.com/channel/UCkbwLFZhawBqK2b9gW08z3g/join Five Minute News is an Evergreen Podcast, covering politics, inequality, health and climate - delivering independent, unbiased and essential news for the US and across the world. Visit us online at http://www.fiveminute.news Follow us on Bluesky https://bsky.app/profile/fiveminutenews.bsky.social Follow us on Instagram http://instagram.com/fiveminnews Support us on Patreon http://www.patreon.com/fiveminutenews You can subscribe to Five Minute News with your preferred podcast app, ask your smart speaker, or enable Five Minute News as your Amazon Alexa Flash Briefing skill. CONTENT DISCLAIMER The views and opinions expressed on this channel are those of the guests and authors and do not necessarily reflect the official policy or position of Anthony Davis or Five Minute News LLC. Any content provided by our hosts, guests or authors are of their opinion and are not intended to malign any religion, ethnic group, club, organization, company, individual or anyone or anything, in line with the First Amendment right to free and protected speech. Learn more about your ad choices. Visit megaphone.fm/adchoices
Live April 21, 2026 | Yaron Brook ShowIran Deal; Trump; Nuclear Iran?; Piker; Antisemitic Dems; Tariff Refunds; Cancer | Yaron Brook Show#MiddleEast #Israel #Lebanon #AI #Capitalism #Orbán #FreeSpeech #economicfreedom #Iran #Ceasefire #ForeignPolicy #objectivism The Yaron Brook Show is Sponsored by[The Ayn Rand Institute](https://www.aynrand.org/starthere)[Energy Talking Points, featuring AlexAI, by Alex Epstein](https://alexepstein.substack.com/)[Express VPN](https://www.expressvpn.com/yaron)[Hendershott Wealth Management](https://www.youtube.com/watch?v=X4lfC...) &(https://hendershottwealth.com/ybs/)[Michael Williams & The Defenders of Capitalism Project](https://www.DefendersOfCapitalism.com)[Support the Show]( / yaronbrookshow )[Sponsor the Show](askyaron@yaronbrookshow.com/)[One-time donation](https://bit.ly/2RZOyJJ)Join the [Yaron Brook Show YouTube channel]( / @yaronbrook )Like what you hear? Like, share, and subscribe to stay updated on new videos and help promote the [Yaron Brook Show](https://bit.ly/3ztPxTx)Continue the discussion by following Yaron on [Twitter](https://bit.ly/3iMGl6z) and [Facebook](https://bit.ly/3vvWDDC )Want to learn more about Ayn Rand and Objectivism? Visit the [Ayn Rand Institute](https://bit.ly/35qoEC3)Become a supporter of this podcast: https://www.spreaker.com/podcast/yaron-brook-show--3276901/support.Yaron is the executive chairman of the Ayn Rand Institute and a world class speaker. He is the coauthor of the national best-seller Free Market Revolution: How Ayn Rand's Ideas Can End Big Government, Equal is Unfair: America's Misguided Fight Against Income Inequality and In Pursuit of Wealth: The Moral Case for Finance. He speaks around the world on a variety of topics including the morality of capitalism, Ayn Rand and her philosophy, finance and economics, and the value of inequality.
The U.S. fired on and seized an Iranian cargo ship that tried to break through the naval blockade. Iran calls it piracy and vows to retaliate with the ceasefire set to expire Wednesday. Iran is back to blocking the Strait of Hormuz as both sides accuse each other of violating the ceasefire and a new round of talks in Pakistan have yet to be confirmed by Tehran. And businesses can finally start applying today for refunds on $166 billion in tariffs the Supreme Court ruled were collected illegally, in what could be a scramble for hundreds of thousands of importers.Correction: A previous version of the story "Iran reacts to ship seized" incorrectly said the death toll in Lebanon was more than 1,000. It is more than 2,000.Want more analysis of the most important news of the day, plus a little fun? Subscribe to the Up First newsletter.Today's episode of Up First was edited by Anna Yukhananov, Ruth Sherlock, Rafael Nam, Mohamad ElBardicy, and Lindsay Totty.It was produced by Ziad Buchh and Nia Dumas.Our director is Christopher Thomas.We get engineering support from Stacey Abbott. Our technical director is Carleigh Strange.(0:00) Introduction(01:55) Trump and Iran Talks(05:34) Iran Reacts to Ship Seized(09:30) Tariff RefundsSee pcm.adswizz.com for information about our collection and use of personal data for sponsorship and to manage your podcast sponsorship preferences.NPR Privacy Policy
In this edition of The Technological Trendpublic, Jack and special guest co-host Blake Wexler discuss their respective weekends, the US seizing an Iranian ship, Alex Karp's weird manifesto, the whole tariff refund situ, China winning the robot wars… again, a quick box office update and much more!See omnystudio.com/listener for privacy information.
Iran says it is clamping back down on the Strait of Hormuz, putting the ceasefire at risk. President Trump appeared at a rally at an Arizona megachurch. Businesses are preparing for Monday's launch of an online portal for tariff refunds.See pcm.adswizz.com for information about our collection and use of personal data for sponsorship and to manage your podcast sponsorship preferences.NPR Privacy Policy
In this week’s Failure Friday segment, we hear from a successful Shopify store owner who sells party favors. Everything was going great, until disaster struck in the form of a misplaced change to her order panel. Side Hustle School features a new episode EVERY DAY, featuring detailed case studies of people who earn extra money without quitting their job. This year, the show includes free guided lessons and listener Q&A several days each week. Show notes: SideHustleSchool.com Email: team@sidehustleschool.com Be on the show: SideHustleSchool.com/questions Connect on Instagram: @193countries Visit Chris's main site: ChrisGuillebeau.com Read A Year of Mental Health: yearofmentalhealth.com If you're enjoying the show, please pass it along! It's free and has been published every single day since January 1, 2017. We're also very grateful for your five-star ratings—it shows that people are listening and looking forward to new episodes.
In this special Easter Sunday episode of John Solomon Reports, we celebrate the season of renewal and family as we dive into important discussions that matter to Americans. Our first guest is Chairman Jason Smith of the House Ways and Means Committee, a pivotal figure in Congress who uncovered crucial IRS whistleblower testimonies that have reshaped the Hunter Biden investigation. We explore pressing issues, including accountability for nonprofits that misuse funds to incite violence or facilitate illegal immigration.In the second segment, we welcome Dean Cain, beloved for his role as Superman, who has transitioned into an ICE agent and a champion for conservative values in Hollywood. Dean shares his journey and the exciting family-friendly projects he is working on, highlighting his commitment to principled storytelling.Finally, we shift gears to a health segment with our friends at Pure Health Research, where we discuss dietary choices, the new food pyramid, and how to maintain a healthy lifestyle. This episode is packed with insights and inspiration for a brighter, healthier future.See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.