Front End Toolbox is a podcast for designer and front-end developers. Each show will focus in on a bite-sized tip on a tool or a hack that you should reach for in your next project.
In this episode, we dive into the latest developments in AI as 2024 draws to a close, focusing on major updates from both ChatGPT and Google's Gemini. The discussion covers ChatGPT's new universal search feature and mobile vision capabilities, along with OpenAI's Sora text-to-video model. The episode highlights Google's comeback with Gemini 2.0 and explores exciting new projects like Project Astra and Project Mariner, which aim to revolutionize how AI interacts with our physical and digital worlds. Special attention is given to Gemini Deep Research, a powerful new tool that can generate comprehensive research papers with proper citations by analyzing multiple web sources. This episode offers valuable insights into the rapidly evolving AI landscape and sets the stage for what's to come in the new year. Hosted on Acast. See acast.com/privacy for more information.
ChatGPT Search is finally here, but is it any good? We're digging into this new AI search engine in a casual, no-nonsense way, exploring its features, quirks, and comparing it to the reigning champ, Google. I'll walk you through my own experiences using ChatGPT Search, showing you what it can do, where it excels, and where it falls a little short. We'll talk about the integration with ChatGPT Plus, the sourcing and citations (because accuracy matters!), and whether it's really a more seamless experience. Find out if ChatGPT Search is the game-changer it promises to be, or just another shiny new tech toy! Join the conversation and let me know what you think of AI search in the comments! Hosted on Acast. See acast.com/privacy for more information.
In this episode of Stack Snacks, we dive into the challenges and surprises of living without Google Search. Our host shares their experience of replacing Google with AI-powered alternatives like Perplexity, ChatGPT, and Claude. From accidental searches on Android to missing out on Google's interactive sports updates, discover the hurdles and unexpected revelations of this digital detox. Whether you're an AI enthusiast or simply curious about reducing your reliance on tech giants, this episode offers valuable insights into the current state of search technology and the potential for AI-driven alternatives. Tune in for a candid exploration of life beyond the Google search bar and stay tuned for future updates on this ongoing experiment. Hosted on Acast. See acast.com/privacy for more information.
In this episode of StackSnacks, we dive into the game-changing impact of Claude 3.5 Sonnet and its Artifacts feature on software development. Our host shares personal experiences of how this AI tool significantly boosted productivity, allowing complex tasks to be completed in a fraction of the usual time. We explore the potential shift in development practices, the growing importance of well-documented frameworks and libraries, and the exciting future possibilities of AI-assisted coding. Whether you're a seasoned developer or just curious about AI's role in tech, this episode offers valuable insights into the evolving landscape of software creation. Join us as we discuss the benefits, potential drawbacks, and the inevitable changes coming to the world of coding. Hosted on Acast. See acast.com/privacy for more information.
In this episode of Stack Snacks, we dive into the latest and most exciting updates to Google's Gemini AI. We explore how Gemini is now integrating with essential Google apps like Keep, Calendar, and Docs, potentially revolutionizing personal productivity. The spotlight then turns to Gemini Live, Google's new voice-based AI companion, as we share first-hand experiences and speculate on its future impact. Join us for an insightful discussion on how these advancements might reshape our daily interactions with AI and what it means for the future of personal digital assistants. Hosted on Acast. See acast.com/privacy for more information.
Hosted on Acast. See acast.com/privacy for more information.
In this episode of the Stack Snacks Podcast, host John Siwicki discusses the recent release of Claude 3.5 Sonnet by Anthropic. He describes it as a mid-tier model in the Claude family and shares his positive first impressions, particularly praising its coding capabilities and the 'Artifacts' feature. John encourages listeners to try out Claude 3.5 Sonnet and share their experiences and demos.00:00 Introduction and Welcome00:07 Claude 3.5 Sonnet Announcement00:31 Initial Impressions and Benchmarks01:30 Coding Capabilities and Features02:30 Recommendations and Final Thoughts03:03 Conclusion and Call to Action Hosted on Acast. See acast.com/privacy for more information.
Recap of Apple's Exciting WWDC AnnouncementsIn this episode of Stack Snacks, we dive into the latest Apple Intelligence announcements from their WWDC event. Highlights include new writing tools for Mail, Notes, and Pages, significant enhancements to Siri's natural language understanding and context awareness, as well as upcoming features like image creation from text prompts. We also explore the security aspects of device-based versus cloud-based AI integrations. Join us as we discuss how these developments could revolutionize user interaction and privacy in Apple's ecosystem.00:00 Introduction and Overview00:26 Apple's New Writing Tools00:58 Revamping Siri02:06 Image Creation and Privacy03:25 Siri and Chat Integration04:41 Final Thoughts and Future Prospects06:16 Conclusion Hosted on Acast. See acast.com/privacy for more information.
On this episode of the Stack Snacks Podcast as he explores the latest AI features introduced by two popular apps—ClickUp and Coda. Discover how ClickUp's new tool, ClickUp Brain, enhances project management through AI-driven features like knowledge management, task automation, and an AI writer. Then, delve into Coda's innovative Coda Brain, which integrates seamlessly with other apps like Google Drive and Slack to enhance document creation and data management. Learn about these exciting new developments and how they aim to make AI more accessible and useful for everyone.https://clickup.com/aihttps://coda.io/product/coda-brain Hosted on Acast. See acast.com/privacy for more information.
In this episode, we dive into VoiceNotes, an innovative tool that transcribes your voice recordings and offers a variety of useful features. From creating summaries and to-do lists to generating tweets, blog posts, and emails, VoiceNotes enhances productivity for busy individuals. We share our experiences using the tool, highlighting its convenience, especially for new parents. Discover how VoiceNotes can simplify your workflow, and learn about its export options and subscription plans. Tune in to see if VoiceNotes is the right fit for you!https://voicenotes.com Hosted on Acast. See acast.com/privacy for more information.
Welcome to another episode of Stack Snacks! In this episode, we dive into the exciting new release from Perplexity Search called Perplexity Pages.Episode Highlights:Introduction to Perplexity Search and its unique ability to summarize search results and cite sources.Overview of the new Perplexity Pages feature, which allows users to create mini Wikipedia-like pages.Personal experiences with creating and experimenting with Perplexity Pages.Discussion on potential use cases for Perplexity Pages, including knowledge management and information organization.Insights on the current availability of this feature on the paid tier and hopes for its future accessibility to all users.Encouragement for listeners to explore Perplexity Pages and share their thoughts and experiences.Links & Resources:Perplexity Search(Link to a published Perplexity Page if available)Get in Touch:Have you tried Perplexity Pages? Let us know your thoughts and experiences in the comments or reach out to us on social media.Follow us on YouTube and visit our website for more episodes and content.Thanks for listening, and see you in the next episode! Hosted on Acast. See acast.com/privacy for more information.
Explore the latest updates and speculations surrounding OpenAI's ChatGPT and Google's Gemini ahead of major tech events.Dive into potential AI enhancements and what they could mean for everyday technology interactions.Discussion on the integration of AI with personal and work applications, and how this could revolutionize our digital experience.Anticipate the impact of new developments from OpenAI and Google, and what these could mean for the tech landscape.Tune in for expert analysis on upcoming AI trends and predictions for future technology integrations. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
Welcome to another episode of Stack Snacks! In this episode, host John Siwicki explores Facebook Meta's launch of their Meta AI product. He discusses the recent release of Llama 3, an open-source model that rivals GPT-4 in benchmarks. John then dives into the more interesting aspect of the product launch, which is the integration of Meta AI into WhatsApp, Facebook, Instagram, and the standalone site. He takes a closer look at the features and functionality of Meta AI, highlighting its image generation tool and search capabilities. John shares his thoughts on the product's performance and its potential impact on the market.Key Takeaways:* Facebook Meta has launched Meta AI, a product that integrates with various platforms, including WhatsApp, Facebook, Instagram, and a standalone site.* Llama 3, an open-source model released by Facebook Meta, is comparable to GPT-4 in terms of performance.* Meta AI offers an image generation tool called "Imagine" and a search feature that leverages both Bing and Google search results.* The product provides summaries and sources for search queries, making it useful for finding information quickly.* Meta AI has the potential to become a significant player in the market, given its integration with popular platforms and the performance of Llama 3. Thank you for listening to this episode of Stack Snacks. Be sure to tune in next time for more insightful discussions. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
This episode dives into the latest advancements in the field of artificial intelligence, focusing on recent updates to two major language models: Google's Gemini 1.5 Pro and OpenAI's ChatGPT 4 Turbo.Gemini 1.5 Pro:Currently available for developers through Google's AI Studio (free) and Poe (subscription).Major updates include:Ability to upload and understand up to 9.5 hours of audio.Support for video input.Expanded context window of 1 million tokens, enabling more comprehensive and informed responses.Advanced reasoning and understanding capabilities.JSON mode for developers.Potential use cases discussed, such as analyzing meeting recordings, generating transcripts, and processing large amounts of text and audio data.ChatGPT 4 Turbo:New update available for ChatGPT Plus users.Focuses on improving the quality and accuracy of responses, particularly in areas like math, logic, reasoning, and code generation.Aims to address previous user concerns regarding verbosity, clarity, and effectiveness of code generation.Early impressions suggest ChatGPT 4 Turbo has reclaimed its position as the leading language model, surpassing competitors like Claude.Conclusion:The episode concludes by emphasizing the rapid evolution of AI technology and the exciting possibilities these advancements offer. Listeners are encouraged to explore the capabilities of these updated AI models and consider their potential applications in various fields. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
In this episode, we're diving into the recent launch of Google's Gemini Advanced model. Join us as we explore whether it's worth your time and investment. Here's what we've got on the menu:* Introduction to Gemini Advanced: A week after its launch, we take a closer look at Google's new AI offering. Is it the chatbot competitor we've been waiting for?* Understanding the Subscription Model: To access Gemini Advanced, powered by the Gemini Ultra 1.0 model, you'll need to navigate Google's branding maze. We break down the subscription process for you.* First Impressions: After a week of testing, we share our initial thoughts on Gemini's performance. From coding assistance to real-time searches, find out how it stacks up against ChatGPT.* Improvements and Quirks: The Gemini team has been busy ironing out the initial kinks. We discuss the quality improvements and the quirky challenges that remain.* Pricing and Value: With a $20/month subscription fee as part of the Google One plan, we evaluate the cost-effectiveness of Gemini Advanced. Is it worth switching from ChatGPT Plus?* Integration with Google Ecosystem: Despite its strengths, Gemini's integration with Google's suite of services leaves room for improvement. We highlight what's missing and the potential for a more cohesive user experience.* Future Prospects: Google's rapid improvements signal a promising future for Gemini Advanced. We speculate on what's next and how it could revolutionize our interaction with AI.Thank you for joining us on this exploration of Google's Gemini Advanced. Your curiosity fuels our journey into the ever-evolving world of AI. See you in the next episode! This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
John Siwicki discusses the highlights of OpenAI's dev day, focusing on the new updates and features. He mentions the launch of GPT-4 Turbo, a larger model with a 128K context window. He also talks about the new Assistance API, which allows users to deploy their own assistants or agents. Additionally, he mentions the availability of DALL·E Three in the API and the introduction of the Text-to-Speech Whisper API. John also discusses the pricing for GPT-4 Turbo with vision, the upgrade to GPT-3 and a half turbo, and the new features in the GPT interface. He concludes by expressing his excitement about the potential of these new developments.Key Takeaways:* OpenAI has launched GPT-4 Turbo, a larger model with a 128K context window.* The Assistance API allows users to deploy their own assistants or agents.* DALL·E Three is now available in the API, and there is a new Text-to-Speech Whisper API.* GPT Four Turbo with vision enables users to pass images into the API, with pricing based on image size.* The GPT interface has been upgraded, allowing users to create their own GPTs and share them with others. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
John Swicki provides a news roundup of recent developments in AI. He starts by discussing Canva's new feature called Magic Studio, which includes text-to-design, text-to-video, and text-to-presentation capabilities. He is impressed by the range of features and the ease of use offered by Canva. Next, he talks about Arcmax, a web browser that has integrated AI components. He highlights features such as a five-second preview of web pages, smartly renamed downloads, and the ability to ask questions based on the content of a web page. John also mentions Google's addition of Bard to Google Assistant, which will allow users to interact with Assistant through text, voice, and photos. He believes this integration has the potential to make Assistant even more powerful. He briefly mentions Snipped, a podcast app that now offers podcast summaries, and Bing Chat, which now allows users to use Dalle3 for image generation.Key Takeaways:* Canva's Magic Studio offers a range of AI-powered features, including text-to-design, text-to-video, and text-to-presentation capabilities.* Arc Max, a web browser, has integrated AI components such as a five-second preview of web pages and smartly renamed downloads.* Google is adding Bard to Google Assistant, allowing users to interact with Assistant through text, voice, and photos.* Snipped, a podcast app, now offers podcast summaries, making it easier for users to get a quick overview of lengthy episodes.* Bing Chat now allows users to use Dalle3 for image generation.Links: https://www.canva.com/newsroom/news/magic-studio/https://blog.google/products/assistant/google-assistant-bard-generative-ai/https://www.anthropic.com/index/evaluating-ai-systemshttps://www.snipd.com/promo/summaryhttps://blogs.bing.com/search/october-2023/DALL-E-3-now-available-in-Bing-Chat-and-Bing-com-create-for-freehttps://arc.net/e/D25B2EEA-7506-4850-A169-3B2A00802889 This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
We discus the recent announcements made by OpenAI regarding the updates to their Chat GPT and the introduction of DALL·E 3, their text image system. He highlights the improvements in image generation and the integration of Dolly Three into ChatGPT. He also explores the new features of back-and-forth conversations and voice capabilities in ChatGPT, emphasizing the impressive quality of the voices. John expresses his excitement about the possibilities these updates bring and the potential for more robust conversations and problem-solving. He concludes by mentioning OpenAI's upcoming developer conference and the anticipation surrounding these updates.Key Takeaways:* OpenAI has announced significant updates to ChatGPT and the introduction of Dolly Three, a text image system.* DALL·E 3 improves image generation and brings it up to par with other platforms.* ChatGPT now offers back-and-forth conversations and voice capabilities with impressive quality.* The new features allow for more interactive and problem-solving conversations.* OpenAI's updates have generated excitement and anticipation among users.Quotes:* "DALL·E 3 is definitely right up there with some of the best image generation platforms I've seen so far."* "The quality of the voices in ChatGPT is really quite good. I couldn't tell the difference between the AI voice and the human voice in the demo."* "The back-and-forth conversations and problem-solving capabilities in ChatGPT are impressive."* "OpenAI's updates have brought their product up to par with other platforms."* "I can't wait to try out these new features and see what ChatGPT can do."Links: Stack SnacksOpenAI's DALL·E 3 Announcement: [https://openai.com/dall-e-3] OpenAI's ChatGPT Voice Demo: [https://openai.com/blog/chatgpt-can-now-see-hear-and-speak] This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
Welcome to another episode of Snack Snacks, where we make the complex world of AI digestible! Join John Siwicki at the ungodly hour of 3:45 AM as he dives into the latest updates in the AI landscape. In this episode, we explore:* Anthropic's Claude Pro: Is the subscription model the future of AI? We discuss the pros and cons of Anthropic's new offering and how it stacks up against other subscription-based AI services.* Prompting Tips from Anthropic: Learn how to make your prompts more effective with XML tags. A quick guide that could change the way you interact with AI models.* Repl.it's Ghostwriter: Discover how Repl.it is integrating GPT-4 into their online IDE, and what it means for developers.* Zoom's AI Features: Zoom is stepping up its game with AI companions in meetings. We delve into the practicality and privacy concerns surrounding this new feature.* OpenAI's Developer Conference: Mark your calendars for November 6th! We discuss what to expect and why you should be excited.* Dust, the AA Assistant: A first look at an assistant that combines GPT-4 with your internal company knowledge, connecting platforms like Notion, Google Drive, Slack, and GitHub.Whether you're an AI enthusiast, a developer, or just curious about the future of technology, this episode has something for you. So grab a snack and tune in!Key Takeaways:* Subscription models are becoming increasingly prevalent in the AI space, offering both advantages and drawbacks.* Effective prompting can enhance your interaction with AI models.* AI is becoming deeply integrated into everyday tools, from IDEs to video conferencing platforms.* Upcoming events like OpenAI's Developer Conference are crucial to keep an eye on for the latest innovations.Don't miss out on this comprehensive rundown. See you in the next episode! This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
John Siwicki is the host of Stack Snacks, a show that aims to make learning about AI simple and fun. He provides insights and commentary on various AI-related topics and shares interesting links and resources with his audience.Summary:In this episode of Stack Snacks, John Siwicki discusses several interesting AI-related topics. He starts by introducing Ideogram, an image-to-text generator that handles text in a unique way. He highlights some of the interesting results it produces, such as generating T-shirt designs and novelty items. John also mentions that Ideogram is free to try, but users should be cautious as their prompts are public-facing.Next, John talks about OpenAI's guide for teachers on how to use GPT in the classroom. He emphasizes that even though it's targeted at teachers, it provides valuable insights and thought exercises for everyone. The guide covers topics like how GPT works, its limitations, and biases. John encourages his audience to check it out and explore different ways of writing prompts.Moving on, John discusses Loom AI, a new feature added to the screen recording app Loom. He highlights the various enhancements Loom AI offers, such as automatically generating titles, creating summarizations, adding chapters, and providing editing features similar to Descript. John finds the addition of AI-powered talking points and the ability to generate personalized versions of recordings with variables particularly interesting. He mentions that Loom AI is available as an add-on with a monthly fee.John then introduces Audio Read, a tool that converts articles into voiceovers. He mentions that what sets Audio Read apart is its ability to generate an RSS feed, allowing users to subscribe to the voiceovers in their podcast player. He sees this as a convenient way to consume articles while on the go.Lastly, John briefly mentions a new Canva plugin called Chat GPT. Although he couldn't find much information about it, he demonstrates how it integrates with Canva and allows users to create logos based on prompts.Key Takeaways:* Ideogram is a text-to-image generator that produces high-quality and interesting results.* OpenAI's guide for teachers on using GPT in the classroom provides valuable insights for everyone, not just teachers.* Loom AI offers several enhancements to the screen recording app, including automatic title generation, summarizations, chapters, and advanced editing features.* Audio Read converts articles into voiceovers and provides an RSS feed for easy podcast-style consumption.* Canva has a new Chat GPT plugin that integrates with the design platform and allows users to create logos based on prompts.Quotes:* "Ideogram is a high-quality, free-to-use text-to-image generator that can produce interesting results."* "OpenAI's guide for teachers on using GPT in the classroom provides valuable insights and thought exercises for everyone."* "Loom AI offers enhancements like automatic title generation, summarizations, chapters, and advanced editing features."* "Audio Read converts articles into voiceovers and provides an RSS feed for easy podcast-style consumption."* "Canva's Chat GPT plugin allows users to create logos based on prompts and integrates seamlessly with the design platform." This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
John Siwicki discusses several interesting topics in this episode. He starts by introducing the Text FX project from Google, which offers AI-powered tools for rappers, writers, and wordsmiths. He explores the different tools available, such as Simile Explode, Unexpected Alliteration, and Acronym Fuse Scene Unfold.Next, he mentions a project called "Visualizing AI" by Google's DeepMind. This project showcases artwork and animations created by artists based on AI. The visuals are captivating and provide a unique perspective on AI.John then moves on to discuss a new plugin called Jambot by Figma. Jambot brings Chat GPT inside Figma's whiteboard software, allowing users to utilize AI-generated responses during ideation and collaboration sessions.He also highlights the launch of Code Llama by Meta Facebook, a model specifically designed for code writing. The model is open-sourced, and Perpexity Labs provides a playground for users to experiment with it.Lastly, John shares the big news of the week: OpenAI's update to their API, which now allows fine-tuning of Chat GPT Three and a Half Turbo. Fine-tuning enables users to train the model on their own data, resulting in higher quality results, shorter prompts, lower costs, and lower latency requests.Key Takeaways:* Google's Text FX project offers AI-powered tools for rappers, writers, and wordsmiths.* "Visualizing AI" by Google's DeepMind showcases artwork and animations inspired by AI.* Figma's Jambot plugin brings Chat GPT inside their whiteboard software for collaborative ideation.* Meta Facebook launched Code Llama, a model tuned for code writing, which is open-sourced and available for experimentation.* OpenAI's API update allows fine-tuning of Chat GPT Three and a Half Turbo, resulting in higher quality results, lower costs, and lower latency requests.Links: * https://www.figma.com/community/widget/1274481464484630971* https://labs.perplexity.ai/* https://platform.openai.com/docs/guides/fine-tuning* https://www.deepmind.com/visualising-ai * https://textfx.withgoogle.com/* https://www.youtube.com/@stacksnacks This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
About The Guest(s):John Siwicki is a tech enthusiast and content creator who shares updates and insights on the latest trends in the tech industry. John provides valuable information and reviews on various platforms and tools.Summary:In this episode, John Siwicki shares updates and new developments in the tech industry, particularly in the video and AI space. He discusses the launch of Heygen 2.0, an AI video platform with lifelike avatars, and demonstrates how to create an AI video using Canva. John also highlights the extension of Runway's gen two videos to 18 seconds, allowing for more creative possibilities. He then explores the new features released by Vimeo, including an AI script generator and teleprompter feature. John introduces two content repurposing tools, Decipher AI and Cast Magic, which offer transcription and audiogram generation capabilities. Lastly, he mentions Creerio, a data-driven app with natural language searchability for SQL commands.Key Takeaways:* Heygen 2.0 introduces lifelike AI avatars, expanding the possibilities for video creation.* Runway now supports gen two videos up to 18 seconds, enabling more creative opportunities.* Vimeo's AI script generator and teleprompter feature simplify video creation and enhance the user experience.* Decipher AI and Cast Magic offer content repurposing capabilities, including transcription and audiogram generation.* Creerio provides a user-friendly interface for executing SQL commands with natural language searchability.Quotes:* "Heygen 2.0 introduces incredible lifelike AI avatars, revolutionizing video creation." * "Vimeo's AI script generator and teleprompter feature make video creation a breeze." * "Decipher AI and Cast Magic offer powerful tools for content repurposing and transcription." * "Creerio simplifies data analysis with its natural language searchability for SQL commands." - https://www.deciphr.ai/ - https://vimeo.com/features/ai-script-generator -https://twitter.com/joshua_xu_/status/1689019874667024384?s=20 - - https://twitter.com/runwayml/status/1689630007746764803?s=20- https://www.deciphr.ai/ - https://www.castmagic.io - https://www.querio.ai This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
In this episode, John Siwicki dives into the latest updates from the AI world. He starts with six new features rolling out for ChatGPT, including prompt examples, suggested replies, and the default use of ChatGPT-4 for Plus subscribers. He also discusses the new ability to upload multiple files for the code interpreter and the introduction of keyboard shortcuts.John then moves on to Google's AI-powered search experience, which has received three significant updates: the inclusion of more videos and images in search results, increased speed, and the addition of cited sources and links. He also touches on Perplexity AI's new file upload feature and how it can quickly summarize PDFs.Finally, he explores Meta's AudioCraft, a tool for sound generation, and its impressive examples. From sirens to the sound of typing on a typewriter, AudioCraft is pushing the boundaries of AI-generated audio.Tune in for a comprehensive look at these updates and how they're shaping the AI landscape.Links: https://help.openai.com/en/articles/6825453-chatgpt-release-noteshttps://audiocraft.metademolab.com/audiogen.htmlhttps://blog.google/products/search/google-search-generative-ai-august-update/ This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
Welcome back to another episode with your host, John Swicki. This week, we're diving into some exciting new releases from AWS, learning how to train a llama for your own purposes, and exploring a few other intriguing topics.AWS's new release, HealthScribe, is our first point of discussion. This innovative tool automatically generates clinical notes from patient and clinical conversations, enhancing clinical productivity with AI-generated notes that are easy to reference, edit, and finalize.We delve into the implications of HealthScribe, discussing its potential impact on medical workflows and patient consultations.Next up, we found a fascinating blog post detailing how to fine-tune your own LlamaTube model, Facebook's open-source project.We walk through the Python-heavy tutorial, discussing how you can use this model for your own purposes.We then turn our attention to LangChain, a tool designed to assist developers in building applications they previously could not.We discuss the potential of LangChain, highlighting its ability to combine different sources of computation and knowledge.Finally, we discuss a thought-provoking blog post titled "AI and the Decline of the Open Web." We explore the implications of login walls, the transformation from forums to private groups, and the rise of walled gardens.We delve into the potential fragmentation of data and the open web, considering the future of the advertisement model and the increasing value of data sets.Links: https://aws.amazon.com/healthscribe/ https://mlabonne.github.io/blog/posts/Fine_Tune_Your_Own_Llama_2_Model_in_a_Colab_Notebook.html https://blog.langchain.dev/automating-web-research/ https://tanay.substack.com/p/ai-and-the-decline-of-the-open-webhttps://www.youtube.com/@stacksnacks This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
In this episode, host John Swicki takes a deep dive into the current state of AI, discussing newly released open-source models, on-device applications, and possible future developments from major tech companies.Highlights of this episode include:* Llama 2's Arrival: John kicks off the episode by reviewing Facebook Meta's recently open-sourced AI model, Llama 2. Discussing its complexity, sizes (7 billion, 13 billion, and 70 billion parameters), and potential for both research and commercial use, John predicts interesting developments as more developers gain access to this tool.* Qualcomm's Ambition: Next, the focus shifts to Qualcomm's plan to run Llama 2 on devices. John explores the potential for increased personalization, better security, and the avoidance of cloud infrastructure reliance.* OpenAI's ChatGPT Custom Instructions: Then, John breaks down OpenAI's new feature - 'Custom Instructions' for ChatGPT. This enhancement allows users to provide ChatGPT with personalized information for improved response quality, changing the way we interact with AI.* Apple's AI Potential: To conclude the episode, John speculates about Apple's rumored AI project, internally referred to as Apple GPT. How might this affect Siri and Apple's privacy-centric approach? Stay tuned to find out.Links mentioned in this episode:* Perplexity Labs' Llama Playground: https://labs.perplexity.ai/* Stack Snacks Youtube: https://www.youtube.com/channel/UC-lzS2PWvUztxHIMdf55S5QJoin John next time for more insights into the world of artificial intelligence, and don't forget to subscribe to our podcast so you never miss an episode! This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
Welcome to another episode where we dive deep into the exciting world of artificial intelligence. Here's what we cover in this episode:* OpenAI-AP Collaboration: We discuss the new agreement between OpenAI and the Associated Press to share select news content and technology. OpenAI will have access to part of AP's text archive to refine its model, and this collaboration could possibly mark the start of a new era in news services.* Claude 2 Update: Anthropic recently launched Claude 2, a significant update from its predecessor. We explore the features of this AI model and its user-friendly interface, accessible now at claude.ai. We also delve into its performance and its newly-introduced file upload feature.* Shopify's Sidekick: Shopify is set to release a new feature - a co-pilot named Sidekick, aimed at aiding users in setting up their Shopify account. We touch upon its features and potential impact on e-commerce business owners.* Poe.com Update: Poe.com has introduced some significant updates, including file uploading across all their offerings and even processing URLs to enhance user experience. We also discuss their two new OpenAI models with larger context windows and the introduction of their Mac app.* Google's SoundStorm: We explore Google's newly-published research paper 'SoundStorm', on audio generation and what it could possibly mean for the future of AI.* Other Updates: We discuss some quick updates about Bard, another AI tool, and the link to a separate video covering its updates in more detail.Links to all topics covered, as well as some related resources, are included in the podcast description. We encourage listeners to check them out for a deeper understanding of each subject.Thank you for joining us in this episode. We appreciate your interest and look forward to bringing you more updates from the fascinating world of AI!Links: https://www.anthropic.com/index/claude-2 https://www.ap.org/press-releases/2023/ap-open-ai-agree-to-share-select-news-content-and-technology-in-new-collaborationhttps://twitter.com/poe_platform/status/1678843136452468754 https://twitter.com/Shopify/status/1679119634685984768https://ai.googleblog.com/2023/06/soundstorm-efficient-parallel-audio.htmlBard Video This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
Show Notes* Chatbot PI's New Update - The chatbot PI has released a new update for their iOS app where users can now speak to it. The bot will talk back, allowing for more interactive conversations. The update also includes six different voice options. However, the feature has not been tested yet due to hardware limitations. [Timestamp: Start - 5:00]* GPT API Release - The long-awaited GPT API is now available for everyone to start experimenting with. The API doesn't seem to have any drastic price cuts, but it's expected that more people will start integrating it into their workflows and apps. [Timestamp: 5:01 - 10:00]* OpenAI's Super Alignment Team - OpenAI has announced the creation of a super alignment team. The team will be dedicated to steering AI systems that are much smarter than humans. They aim to build AI aligners that can do a better job than human alignment teams. OpenAI is dedicating 20% of their compute to this effort. [Timestamp: 10:01 - 15:00]* Public.com's New Co-pilot - Public.com, an investment app, is building a co-pilot into their trading app. The co-pilot will be able to provide customized insights relevant to your portfolio and your goals. The feature will be rolled out in three phases, with the first phase already available. [Timestamp: 15:01 - End]* Try out PI's new iOS update and share your experience in the comments.* Check out the GPT API and explore its potential applications.* Follow OpenAI's super alignment team's progress and understand their efforts.* Test Public.com's new co-pilot feature and see how it can help with your investments.Closing Remarks In this episode, we looked at PI's new iOS update, the rollout of the GPT API, OpenAI's super alignment team, and Public.com's new co-pilot feature. Stay tuned for more updates in the world of AI. See you next time!Links:https://twitter.com/heypi_ai/status/1676998045567680514https://openai.com/blog/introducing-superalignmenthttps://public.com/alpha This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
* Stable Diffusion 0.9: A significant release from Stability AI. The new model has impressive demo images, including an alien walking through Las Vegas and a wolf in Yosemite National Park. Notably, the model has made significant improvements in rendering hands and faces. * ClipDrop.co: A toolset from Stability AI where you can experiment with the 0.9 model. Examples include a lion with wings, a lighter cake, and a farmer cat in a garden. * Image Generation: Discussion of various generated images, including a scene inside a glass bottle and a hand holding a heart.* Adobe Express: Similar to Canva, Adobe Express is a straightforward, templated design tool. It now incorporates Adobe Firefly, allowing users to generate images directly into their workflow. * Color GPT: A simple tool for generating color palettes based on user prompts. * Scribble Diffusion: A fun program that turns sketches into images. https://clipdrop.co/stable-diffusionhttps://stability.ai/blog/sdxl-09-stable-diffusionhttps://scribblediffusion.com/https://new.express.adobe.com/https://www.colourgpt.app/ This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
OpenAI API Updates* Notable reduction in the cost of some models:* 75% cost reduction on embeddings* 25% cost reduction for input tokens for GPT 3.5 Turbo* Function Calling Upgrade for GPT3 and GPT3.5:* This new feature allows conversion of natural language queries into API calls or database queries.* Examples include converting queries like "Email someone to see if she wants to get coffee on Friday" to function calls such as "send an email."* It can even convert database queries like "What are my top 10 customers this month?" into API calls like "get customers by revenue."* More complex examples involve using this feature to answer questions or pull specific data.* Excitement for the new bigger context windows which would allow for new applications.OpenAI API UpdatesFacebook and Meta's New Generative AI Model* Facebook and Meta have developed a generative AI model for speech with state-of-the-art performance.* This model seems to be a major advancement in voice generation.* Check out the linked 2-minute video for an overview of this new technology.Facebook and Meta's New Generative AI ModelAI Speech Classifier from Eleven Labs* A tool that allows users to upload a 60-second audio file to detect if the speech was AI-generated.* Discussion on the potential benefits of these types of classifiers, particularly in the early stages of AI development and use.* Speculation about future tools for detection of AI-generated content, including potential metadata requirements for certain photos.AI Speech Classifier from Eleven Labs This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
Welcome back, everyone, to our latest deep dive into some of the most interesting technology and design tools out there. Without further ado, let's delve straight in.Music Gen: A Novel Music Creation ToolOur first highlight is an impressive demo from Music Gen, a creation of Facebook's research group. Similar to Google's LLM, Music Gen facilitates music creation with a unique twist - the ability to condition melodies. Despite a few connection hitches during our demo, it eventually lived up to expectations, crafting a 90s rock song with heavy drums and an electric guitar, just as we had specified.Not only does it give you several pre-filled prompts to work with, but you also get the chance to experiment with creating your own tunes. The more I explore it, the more I realize how much potential there is for non-musicians like me to tap into their latent creativity. With Music Gen, music production is becoming increasingly accessible, and we look forward to seeing how this evolves.ClipDrop by Stability AI: Powerful Image ManipulationNext up is ClipDrop by Stability AI, a site teeming with valuable image manipulation tools. Whether you're looking to replace a background, upscale an image, remove unwanted elements, or clean up your images, ClipDrop has got you covered.A standout feature is the uncropping tool, quite similar to Adobe's Generative Fill, which seamlessly fills in the edges of an image. It performed remarkably well even on abstract art, offering a broader view of the initial image. Furthermore, their Re-Imagine XL tool provides various alternatives from a single image - perfect for when you need a fresh perspective.While ClipDrop's free account has some limitations, like watermarked images, the range of offerings in their paid version is worth considering, especially if it becomes part of your workflow.Bard's Continuous Improvements: Google Sheets and MoreBard recently rolled out some understated updates that further enhance its user experience. They now provide the option to export data directly to Google Sheets, a feature that data analysts might find particularly useful.Bard also upgraded its tech computational prompts, improving its ability to tackle mathematical tasks, coding questions, and string manipulations. They run code in the background using Python, akin to ChatGPT's code interpreter plugin. While there is no option for file uploads yet, this step towards executing code on behalf of users indicates a promising trajectory.Automator for Figma: Design Automation at its BestLast but certainly not least is Automator for Figma. This plugin allows you to automate tasks such as spec'ing out your design or importing data from Airtable, saving you valuable time. It even comes with a find and replace feature.What's more, the community page offers user-created automations, providing inspiration on how to best utilize the plugin and streamline your design process.That's it for this week's exploration of tech and design tools. We hope you found these insights helpful and feel inspired to check out some of these impressive tools. Stay tuned for more exciting updates in our next piece. Thank you for joining us, and we'll see you next time!https://huggingface.co/spaces/facebook/MusicGenhttps://clipdrop.co/https://bard.google.com/updateshttps://automator.design/ This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
Welcome back to another exciting installment with me, John Siwicki. This week, we have an eclectic mix of fascinating things that caught our attention. Let's dive straight in!First up, we have some intriguing updates from Bard. A couple of days ago, Bard started incorporating location information into their searches, promising a leap towards more localized search results. However, it's important to note that Bard still labels this feature as an experiment.In a test search, we asked Bard to find a place for an oil change - a standard request, right? The first results interestingly pointed us to Colorado, despite our location being elsewhere. Clearly, Bard's location feature has some kinks to iron out, but the potential here is undeniable. The local search results, complemented by added images about oil changes, show promising strides. We eagerly await future improvements to Bard's location-based services.Next, we'll shift gears and explore the ChatGPT plugin. Early on, many of these plugins felt somewhat nebulous, with unclear functions. However, a recent discovery caught our eye: the 'Show Me' plugin. This fantastic tool shows you how certain files work. For instance, it vividly demonstrated how a CSS file operates within a web browser. An added bonus? You can edit the diagrams it generates, providing hands-on learning!Another exciting development is the recent addition of a search feature to the ChatGPT plugin store. Now, finding plugins like 'Show Me' is easier than ever. This proves immensely helpful for your charting needs.Moving on, we discuss the impressive work by po.com, a site run by Cora. It provides a unique platform for testing various language models, even those typically challenging to access. One such feature is the 'Mid Journey Bot,' which assists you in writing more engaging mid-journey prompts.Here's an example: we entered a prompt about a house in Miami after a storm, with a child in a raincoat in the foreground. The bot expanded upon this simple scene, weaving a detailed and evocative narrative. The depth it added to the original prompt was truly remarkable, hinting at the immense creative potential of such tools.Lastly, we're excited to introduce you to an intriguing app discovered on Twitter: Audio Pen.ai. This innovative platform offers an easy way to convert your spoken thoughts into structured text. Record your ideas into the app, and it organizes them into a clear and coherent flow. We found this tool especially helpful for managing voice notes.That's it for this week! We'll continue to explore these promising developments and share our findings with you. We hope you found these insights as exciting as we did, particularly the potential of Audio Pen. Remember, we'll include all relevant links in the show notes.See you all next week. Thank you for joining us!Links: https://poe.com/Midjourneyhttps://audiopen.ai https://bard.google.com/updates This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
We kick things off by dissecting Microsoft's latest innovation, Windows Co-Pilot. We delve into its range of functionalities, such as interacting with system settings, summarizing PDFs, and initiating Spotify playlists. Listen in as we shed light on its potential capabilities and anticipated impact on user experience.Then, we turn our attention to Adobe's newest beta update for Photoshop - the Generative Fill. Accompany us through a detailed walkthrough of this revolutionary feature, which utilizes text-based commands to modify images. We contemplate its potential to transform the landscape of graphic editing.Wrapping up the episode, we discuss a novel feature in ChatGPT, enabling users to share chat links. We explore how this unique innovation may alter our engagement with this AI chatbot, potentially introducing a fresh dimension of sharing and interaction.https://www.adobe.com/products/photoshop/generative-fill.html https://blogs.windows.com/windowsdeveloper/2023/05/23/bringing-the-power-of-ai-to-windows-11-unlocking-a-new-era-of-productivity-for-customers-and-developers-with-windows-copilot-and-dev-home/ This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
Links: https://openai.com/blog/introducing-the-chatgpt-app-for-ioshttps://aitestkitchen.withgoogle.com/experiments/music-lmhttps://zoo.replicate.dev/?id=a-comic-book-panel-of-ocean-waves-pixel-art-by-salvador-dali-lpe55wlgTranscript: Hello, everyone. Welcome back to the show. I'm John Siwicki, and I appreciate you joining us on our weekly AI roundup. Today, we're going to explore three intriguing developments. A few of these are quite entertaining as well.Let's begin with something recent: the release of the Chat GPT iOS app. It's the first version, and an Android version is not available yet. The app has a few interesting features. It integrates with the Whisper API, which ensures excellent text-to-speech capabilities. Additionally, if you're a Plus subscriber, you can use GPT-4 on it, which is an impressive feature.Currently, we're hoping for an Android app soon. I'm curious to see if a well-crafted mobile app will influence user habits and possibly spark new use cases. I've used the web app on my mobile device; it got the job done, albeit not smoothly.Therefore, it will be fascinating to observe how a well-designed mobile app changes user behavior. Will people start using it more frequently? Will they opt for it over Siri or Google Assistant? The next few weeks will certainly provide some insights, especially if the app evolves and introduces new features.Moving on to the second topic: Google's new AI test kitchen. One of their projects is Music LM, which enables text to audio conversion. I tried one demo, inputting "80s synth music that I can listen to on a run after a breakup." I think it's a fascinating tool, especially for those who aren't musically talented. I've been experimenting with it all weekend, and I recommend getting on the waitlist if you can.Lastly, we'll discuss a text-image playground by Replicate called Zoo dot replicate dev. It lets you experiment with different text-image combinations. For listeners, I used a comic panel of "Ocean Waves Pickle Solar" by Salvador Dali and compared the results by different models. It's enjoyable to experiment with, especially if you're using varied images in your project.That's all for this week's roundup. We'll include all the links in the show notes. Thank you for spending your time with us today. We'll be back soon. Until then, take care, everyone. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
* Boston Dynamics and ChatGPT Integration: Boston Dynamics has integrated ChatGPT into its robots. There's a two-minute video showing their team interacting with a ChatGPT-integrated robot that is definitely worth watching.* ChatGPT Updates: ChatGPT has released new settings. You can now export your data and turn off data collection in the settings. There are also some visual changes they've made. Furthermore, updates have been rolled out to their plugins, including the browsing plugin that now works with ChatGPT-4.* AI Learning Resources:* OpenAI and Deep Learning Prompt Engineering Course: This course provides insights into prompt engineering. It's a quick study and is recommended even for those not intending to write code to interact with the API.* Microsoft's AI for Beginners: This robust document can be found on Microsoft's GitHub page. It covers topics such as neural networks, computer vision, and natural language processing. It is suitable for those who want to delve into the technical aspects of AI. Links: Boston Dynamics ChatGPTOpenAI and Deep Learning Prompt Engineering CourseMicrosoft's AI for Beginners This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
In this episode, we dive into some fascinating discoveries, intriguing discussions, and thought-provoking dilemmas surrounding the world of artificial intelligence and the digital future. We start by discussing the latest substantial update to Google's Bard AI, which now boasts impressive coding capabilities and seamless integration with Google Collab and Google Sheets.Next, we contemplate an insightful tweet by Marie Haynes, an SEO expert, that speculates the possibility of a radical shift in how businesses are found online due to Google's new AI search engine. We ponder the implications of AI interacting with businesses on our behalf and how that could change our experiences.Finally, we recommend a thought-provoking talk from the Center for Humane Technology by Tristan Harris, who explores the potential catastrophic risks that AI poses to society, the race to deploy AI without adequate safety measures, and the need to upgrade our institutions for a post-AI world. We emphasize the importance of examining the pros and cons of AI and exploring different perspectives on this transformative technology.Linkshttps://bard.google.com/updateshttps://twitter.com/Marie_Haynes/status/1649755997466947586 This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
In this episode, we g into the exciting world of artificial intelligence, focusing on Google's upcoming AI features and their new project. We examine the challenges Google faces in the competitive AI landscape, especially following the lackluster performance of "Bard," and how it stacks up against ChatGPT. Furthermore, we discuss Google's unique opportunity to leverage its extensive suite of services—like Google Calendar and Google My Business—to develop a groundbreaking personal assistant that could outshine Siri, Google Assistant, and Alexa. Thanks for tuning in! This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
In this episode, we dive into the fascinating world of Poe.com user bots, interactive simulacra of human behavior, and the AI accountability policy request for comment. Join us as we discuss the implications of these technologies and policy initiatives on society, privacy, and the future of artificial intelligence.Show Notes:Poe.com User Bots* Discussion of Adam D'Angelo's tweet: https://twitter.com/adamdangelo/status/1644435126343077888?s=12&t=4DFLsyWB2nlmdJs1ZUkj1AGenerative Agents: Interactive Simulacra of Human Behavior* Introduction to the research paper: https://arxiv.org/abs/2304.03442v1AI Accountability Policy Request for Comment* Introduction to the NTIA's request for comments: https://ntia.gov/issues/artificial-intelligence/request-for-comments This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.stack-snacks.com
Links Mentioned In The Show https://www.smashingmagazine.com/2022/04/designing-better-breadcrumbs/ https://a11yphant.com/ https://blog.openreplay.com/build-a-lightweight-web-component-with-lit-js https://stackdiary.com/centering-in-css/ https://www.builder.io/blog/the-ultimate-guide-to-optimizing-javascript-for-quick-page-loads --- Support this podcast: https://anchor.fm/stacksnacks/support
This week Cloudflare rolled out of a beta of a Google Tag Manager alternative Cloudflare Zaraz. https://blog.cloudflare.com/why-cloudflare-bought-zaraz/ https://www.stack-snacks.com/ --- Support this podcast: https://anchor.fm/stacksnacks/support
A recap of our latest newsletter on stack-snacks.com
We walk through Google's new page speed metrics and how to measure them.
On today's, show we look at Glitch. Glitch is a community where you can create, share and remix some web projects. The unique thing about Glitch is it allows more than just HTML/CSS/JS you can build node apps in the browser.
On todays show we look at serverless functions and one service that is going to help you write and deploy those quicker.
Our goal for this show to showcase tools and products that are going to help all of us get better and grow as developers and designers.