Speech recognition podcasts

Zoho Just Built Its Own LLM—Here's Why It Matters

Play Episode Listen Later Aug 6, 2025 22:01

Zoho launches its own large language model—Zia LLM—built in India, designed for business, and powered by privacy-first AI agents that redefine what digital employees can do. Zoho is taking a bold step into the AI future with the launch of its own large language model (LLM) and a suite of enterprise-ready AI agents, all developed in-house—not in Silicon Valley, but in India. In this conversation, Zoho executive Chandrasekhar “LSP” joins Your Tech Report to unpack what makes Zoho's approach to AI different—and why it could reshape how businesses automate, analyze, and serve customers. With its own infrastructure, private data policies, and “no AI tax” pricing model, Zoho aims to give businesses control over their data, their automation, and their outcomes. LSP explains how Zoho's custom-built LLMs are trained on licensed datasets, operate within customer firewalls, and are tailored to specific business contexts—unlike consumer LLMs from OpenAI or Google. We also dive into Zoho's digital employee framework, the Zoho Directory's access guardrails, and the new Zia agent marketplace, which enables developers to create and monetize AI agents. From speech recognition to interoperability across platforms, this episode offers a deep look into Zoho's vision for AI—one grounded in privacy, performance, and purpose. 0:00 – Zoho's Big AI Announcement 3:25 – Why Zoho Built Its Own LLM from Scratch 8:40 – Privacy by Design: No Data Sharing, No AI Tax 12:20 – Digital Employees vs Traditional Agents 16:10 – Zoho Directory & Enterprise Guardrails 21:15 – Zia Marketplace and Multi-Agent Workflows 27:10 – Speech Recognition and Low-Resource Language Support 31:00 – Staying Grounded Through the AI Hype 35:45 – Zoho's Vision for Accessible, Affordable AI 38:00 – Zoholics Conference Preview #ZohoZia #AIPrivacy #LLM #DigitalEmployees #EnterpriseAI #YourTechReport #Zoholics Learn more about your ad choices. Visit megaphone.fm/adchoices

ai google vision built silicon valley privacy openai accessible llm zoho lsp speech recognition

Season 5 - 011 - Speech recognition in ATC, UFA's ATTranscribe and stepping out of simulation

Radar Contact

Play Episode Listen Later May 20, 2025 16:48

This episode with Dale Drake, Director ATC Sales at UFA explores their latest product: ATTranscribe, which can be used to facilitate voice transcription in the context of incident investigation.We dig deeper in how voice recognition can be used in ATC and discuss how ATTranscribe is UFA's first step out of simulation.

stepping simulation atc ufa speech recognition

Season 5 - 007 - Speech Recognition for Air Traffic Control applications - Eric Button, EnhancedRadar

Radar Contact

Play Episode Listen Later May 6, 2025 19:25

Speech recognition has been around for a long time but never achieved results deemed sufficient for making it usable for Air Traffic Control.Recent advances in Artificial Intelligence and constant increases in computing power could change it. Eric Button, founder and CEO of EnhancedRadar discusses this and how the company focuses exclusively on this topic.EnhancedRadar will be present at AirspaceWorld 2025 at booth number H21600.

ceo artificial intelligence speech applications button air traffic control speech recognition

Gladia: Breaking Language Barriers with AI-Powered Speech Recognition, Podcast

Telecom Reseller

Play Episode Listen Later Feb 21, 2025

“AI shouldn't erase languages—it should amplify them.” – Jean-Louis Quéguiner (JL), CEO of Gladia In a world where AI-driven communication tools are shaping industries, Gladia is setting a new standard in multilingual, real-time speech recognition. In this episode of Technology Reseller News, Doug Green interviews Jean-Louis Quéguiner (JL), CEO of Gladia, to discuss how their automatic speech recognition (ASR) technology is not only solving technical challenges but also protecting and amplifying underrepresented languages. Bridging the AI Divide in Speech Recognition Gladia is not just another transcription company. It is the only ASR provider capable of real-time, multilingual transcription with low latency, supporting over 100 languages—including accents, dialects, and code-switching. Key Differentiators: High-Accuracy Transcription Across 100+ Languages – Handles everything from English and Spanish to Tagalog and Zulu Real-Time Code-Switching – Seamlessly processes multiple languages in a single conversation Bias-Free AI – Ensures equitable representation for underserved languages As JL explains, many AI tools ignore so-called “niche” languages, despite their vast number of speakers. With 1.5 billion people in India and 120 million Tagalog speakers, Gladia is filling a critical gap in global communication. AI and the Future of Contact Centers Gladia's speech AI technology is already being integrated into call centers, CCaaS platforms, and voice AI agents to: Improve real-time CX with hyper-accurate transcription & sentiment analysis Expand global market reach by enabling companies to support more languages Enable AI-powered call automation while maintaining human oversight JL emphasizes that AI voice agents are evolving rapidly—but rather than replacing human interaction, they should work alongside contact center professionals. In the next five years, call center roles will shift from volume-based operations to high-skill, high-quality customer interactions. Expanding into the U.S.: Building the Right Partnerships As Gladia expands its footprint in North America, the company is actively looking for: Channel partners in cloud communications Technology integrations with CCaaS and AI-powered platforms Enterprises looking to enhance multilingual CX JL himself is relocating to New York to foster relationships and drive business development in the U.S. market. Learn More About Gladia Website: www.gladia.io Contact for Partnerships: Connect with Jean-Louis Quéguiner on LinkedIn #AI #SpeechRecognition #CX #CallCenters #MultilingualAI #ContactCenter #ASR #CloudCommunications #LanguageTech

E182 'Revolutionising Medical Speech Recognition' with T-Pro's Meghan Dowling

AI in Action Ireland

Play Episode Listen Later Jan 23, 2025 10:36

Today's guest is Meghan Dowling, Computational speech and language scientist at T-Pro. Meghan has gained recognition for her impactful work, including being a finalist in the 2024 AI Awards Women in AI Ambassador of the year category. Meghan speaks about her work within Automatic Speech Recognition (ASR) at T-Pro, an exciting company making strides in medical documentation. Topics include: 0:00 Her role and journey into working with Automatic Speech Recognition 2:06 How T-Pro reduces clinicians' documentation burden with speech technologies 3:12 Why Medical ASR requires high accuracy for patient safety 4:21 Tackling accuracy, terminology, hallucinations and bias challenges 7:19 The importance of mentors and supporting women in tech 9:19 A new Co-pilot project to enhance doctor-patient interactions with ASR

medical tackling women in tech dowling computational asr speech recognition

Speechmatics CTO - Next-Generation Speech Recognition

Machine Learning Street Talk

Play Episode Listen Later Oct 23, 2024 106:23

Will Williams is CTO of Speechmatics in Cambridge. In this sponsored episode - he shares deep technical insights into modern speech recognition technology and system architecture. The episode covers several key technical areas: * Speechmatics' hybrid approach to ASR, which focusses on unsupervised learning methods, achieving comparable results with 100x less data than fully supervised approaches. Williams explains why this is more efficient and generalizable than end-to-end models like Whisper. * Their production architecture implementing multiple operating points for different latency-accuracy trade-offs, with careful latency padding (up to 1.8 seconds) to ensure consistent user experience. The system uses lattice-based decoding with language model integration for improved accuracy. * The challenges and solutions in real-time ASR, including their approach to diarization (speaker identification), handling cross-talk, and implicit source separation. Williams explains why these problems remain difficult even with modern deep learning approaches. * Their testing and deployment infrastructure, including the use of mirrored environments for catching edge cases in production, and their strategy of maintaining global models rather than allowing customer-specific fine-tuning. * Technical evolution in ASR, from early days of custom CUDA kernels and manual memory management to modern frameworks, with Williams offering interesting critiques of current PyTorch memory management approaches and arguing for more efficient direct memory allocation in production systems. Get coding with their API! This is their URL: https://www.speechmatics.com/ DO YOU WANT WORK ON ARC with the MindsAI team (current ARC winners)? MLST is sponsored by Tufa Labs: Focus: ARC, LLMs, test-time-compute, active inference, system2 reasoning, and more. Interested? Apply for an ML research position: benjamin@tufa.ai TOC 1. ASR Core Technology & Real-time Architecture [00:00:00] 1.1 ASR and Diarization Fundamentals [00:05:25] 1.2 Real-time Conversational AI Architecture [00:09:21] 1.3 Neural Network Streaming Implementation [00:12:49] 1.4 Multi-modal System Integration 2. Production System Optimization [00:29:38] 2.1 Production Deployment and Testing Infrastructure [00:35:40] 2.2 Model Architecture and Deployment Strategy [00:37:12] 2.3 Latency-Accuracy Trade-offs [00:39:15] 2.4 Language Model Integration [00:40:32] 2.5 Lattice-based Decoding Architecture 3. Performance Evaluation & Ethical Considerations [00:44:00] 3.1 ASR Performance Metrics and Capabilities [00:46:35] 3.2 AI Regulation and Evaluation Methods [00:51:09] 3.3 Benchmark and Testing Challenges [00:54:30] 3.4 Real-world Implementation Metrics [01:00:51] 3.5 Ethics and Privacy Considerations 4. ASR Technical Evolution [01:09:00] 4.1 WER Calculation and Evaluation Methodologies [01:10:21] 4.2 Supervised vs Self-Supervised Learning Approaches [01:21:02] 4.3 Temporal Learning and Feature Processing [01:24:45] 4.4 Feature Engineering to Automated ML 5. Enterprise Implementation & Scale [01:27:55] 5.1 Future AI Systems and Adaptation [01:31:52] 5.2 Technical Foundations and History [01:34:53] 5.3 Infrastructure and Team Scaling [01:38:05] 5.4 Research and Talent Strategy [01:41:11] 5.5 Engineering Practice Evolution Shownotes: https://www.dropbox.com/scl/fi/d94b1jcgph9o8au8shdym/Speechmatics.pdf?rlkey=bi55wvktzomzx0y5sic6jz99y&st=6qwofv8t&dl=0

How to Systematically Test and Evaluate Your LLMs Apps // Gideon Mendels // #269

MLOps.community

Play Episode Listen Later Oct 18, 2024 61:42

Gideon Mendels is the Chief Executive Officer at Comet, the leading solution for managing machine learning workflows. How to Systematically Test and Evaluate Your LLMs Apps // MLOps Podcast #269 with Gideon Mendels, CEO of Comet. // Abstract When building LLM Applications, Developers need to take a hybrid approach from both ML and SW Engineering best practices. They need to define eval metrics and track their entire experimentation to see what is and is not working. They also need to define comprehensive unit tests for their particular use-case so they can confidently check if their LLM App is ready to be deployed. // Bio Gideon Mendels is the CEO and co-founder of Comet, the leading solution for managing machine learning workflows from experimentation to production. He is a computer scientist, ML researcher and entrepreneur at his core. Before Comet, Gideon co-founded GroupWize, where they trained and deployed NLP models processing billions of chats. His journey with NLP and Speech Recognition models began at Columbia University and Google where he worked on hate speech and deception detection. // MLOps Swag/Merch https://mlops-community.myshopify.com/ // Related Links Website: https://www.comet.com/site/ All the Hard Stuff with LLMs in Product Development // Phillip Carter // MLOps Podcast #170: https://youtu.be/DZgXln3v85s Opik by Comet: https://www.comet.com/site/products/opik/ --------------- ✌️Connect With Us ✌️ ------------- Join our slack community: https://go.mlops.community/slack Follow us on Twitter: @mlopscommunity Sign up for the next meetup: https://go.mlops.community/register Catch all episodes, blogs, newsletters, and more: https://mlops.community/ Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/ Connect with Gideon on LinkedIn: https://www.linkedin.com/in/gideon-mendels/ Timestamps: [00:00] Gideon's preferred coffee [00:17] Takeaways [01:50] A huge shout-out to Comet ML for sponsoring this episode! [02:09] Please like, share, leave a review, and subscribe to our MLOps channels! [03:30] Evaluation metrics in AI [06:55] LLM Evaluation in Practice [10:57] LLM testing methodologies [16:56] LLM as a judge [18:53] OPIC track function overview [20:33] Tracking user response value [26:32] Exploring AI metrics integration [29:05] Experiment tracking and LLMs [34:27] Micro Macro collaboration in AI [38:20] RAG Pipeline Reproducibility Snapshot [40:15] Collaborative experiment tracking [45:29] Feature flags in CI/CD [48:55] Labeling challenges and solutions [54:31] LLM output quality alerts [56:32] Anomaly detection in model outputs [1:01:07] Wrap up

Human-Centered AI for Disordered Speech Recognition - Katarzyna Foremniak

DataTalks.Club

Play Episode Listen Later Oct 4, 2024 48:01

About the speaker: Katarzyna is a computational linguist with over 10 years of experience in NLP and speech recognition. She has developed language models for automotive brands like Audi and Porsche and specializes in phonetics, morpho-syntax, and sentiment analysis. Kasia also teaches at the University of Warsaw and is passionate about human-centered AI and multilingual NLP. Join our slack: https://datatalks.club/slack.html

university ai nlp porsche warsaw kasia disordered human centered katarzyna speech recognition

The Valley Current®: How is A.I. Exponentially Accelerating Innovation?

THE VALLEY CURRENT®️ COMPUTERLAW GROUP LLP

Play Episode Listen Later Sep 4, 2024 64:02

In this episode of The Valley Current®, host Jack Russo sits down with tech visionary Ronjon Nag, whose journey through the world of AI and technology is nothing short of inspiring. From his days at Cambridge and MIT to founding a groundbreaking handwriting recognition company with just $500, Ronjon has always been ahead of the curve. His insights into the evolution of mobile technology, including his role in launching the first mobile app store, will captivate anyone interested in the tech industry's rapid transformation. But it doesn't stop there. Ronjon's current venture, R.42, is pushing the boundaries of AI, longevity, and healthcare, and he's got some bold predictions about the future of these fields. As a Stanford professor, he's not just shaping technology—he's shaping minds, teaching the next generation about the ethical and practical implications of AI. Listen in as Ronjon educates Jack with fascinating stories, forward-thinking ideas, and a deep dive into the cutting-edge innovations that could change our lives. Whether you're a tech enthusiast or just curious about the future, this conversation is one you won't want to miss. Check out these links to learn more about Ronjon Nag or even chat with him via his avatar! https://www.r42group.com/ https://www.superbio.ai/ https://app.mimio.ai/ronjon/chat Jack Russo Managing Partner Jrusso@computerlaw.com www.computerlaw.com https://www.linkedin.com/in/jackrusso "Every Entrepreneur Imagines a Better World"®️

From Dark Matter to Voice AI with Deepgram Founder Scott Stephenson

Founded and Funded

Play Episode Listen Later Aug 28, 2024 31:20

Today Madrona Managing Director Karan Mahandru and Scott Stephenson, Co-Founder and CEO of Deepgram, a foundational AI company building a voice AI platform providing APIs for speech-to-text and text-to-speech. From medical transcription to autonomous agents, Deepgram is the go-to for developers of voice AI experiences, and they're already working with over 500 companies, including NASA, Spotify, and Twilio. Today, Scott and Karan dive into the realities of building a foundational AI company, meaning they're building models and modalities from scratch. They discuss the challenges of moving from prototype to production, how startups need to out-fox the hyperscalers while also partnering with them, and, of course, how Scott went from being a particle physicist working on detecting dark matter to building large language models for speech recognition. This is a must-listen for anyone building in AI. Full Transcript: http://www.madrona.com/founded-funded-deepgram-scott-stephenson Chapters: (00:00) Introduction (01:15) From Particle Physics to Voice AI (03:16) The Birth of Deepgram (03:40) Building a Developer-Centric AI Company (06:11) Challenges and Early Decisions (09:49) Navigating the AI Market (13:33) OpenAI's Whisper and Deepgram's Response (17:30) The Future of AI and Speech Recognition (21:59) Deepgram's Real-World Applications (31:19) From Prototype to Production

#200 Trevor Back: How Speechmatics is Shaping the Future of Conversational AI

Eye On A.I.

Play Episode Listen Later Aug 1, 2024 56:24

In this episode of the Eye on AI podcast, we explore the forefront of voice-powered AI technology with Trevor Back, Chief Product Officer at Speechmatics. Discover how Speechmatics is pushing the boundaries of speech recognition and conversational AI with their latest innovation, Flow. Trevor shares his journey from a background in computational astrophysics to becoming a key figure in AI at DeepMind and now Speechmatics. He delves into the development and potential of Flow, a groundbreaking tool combining automatic speech recognition (ASR), large language models (LLMs), and text-to-speech synthesis, aimed at creating seamless and responsive voice interactions. We explore the wide-ranging applications of Speechmatics' technology across industries, including media, call centers, and education. Trevor discusses the challenges of achieving high accuracy in speech recognition, especially in diverse and noisy environments, and how Speechmatics addresses these challenges with their unique approach to training models. Listen in as we uncover the intricacies of handling multiple languages, improving diarization, and the future goals of understanding complex audio cues like emotion and sarcasm. Learn about the company's vision for integrating voice technology into everyday products, making technology more accessible and user-friendly. Don't miss this insightful conversation on the future of voice technology, AI in business, and its role in the evolving landscape of AI. Like, subscribe, and hit the notification bell for more expert discussions on cutting-edge advancements in AI. This episode is sponsored by Shopify. Shopify is a commerce platform that allows anyone to set up an online store and sell their products. Whether you're selling online, on social media, or in person, Shopify has you covered on every base. With Shopify you can sell physical and digital products. You can sell services, memberships, ticketed events, rentals and even classes and lessons. Sign up for a $1 per month trial period at http://shopify.com/eyeonai Checkout Speechmatics, the most accurate AI speech technology - with AI transcription & real-time translation components.: https://www.speechmatics.com/ Stay Updated: Craig Smith Twitter: https://twitter.com/craigss Eye on A.I. Twitter: https://twitter.com/EyeOn_AI (00:00) Introduction and Background (01:49) Trevor Back's Journey into AI (04:02) DeepMind and Early AI Applications (07:30) Speechmatics' Mission and Focus (12:06) Key Applications of Speechmatics Technology (14:25) Achieving High Accuracy and Low Latency (17:52) Language Coverage and Challenges (21:27) Future of Voice Technology and AGI (24:52) Integrating Large Language Models (27:31) Handling Multiple Voices and Diarization (29:32) Real-world Applications and Challenges (35:20) Demonstration of Flow and Capabilities (41:14) Endpoint Prediction and Interruption (43:53) Real-time Interactions and Future Prospects (45:34) Launch Event and Future Plans (50:13) New Language Releases and Compliance

Intron Health gets backing for its speech recognition tool that recognizes African accents

TechCrunch Startups – Spoken Edition

Play Episode Listen Later Jul 26, 2024 7:37

Voice recognition is getting integrated in nearly all facets of modern living, but there remains a big gap: speakers of minority languages, and those with thick accents or speech disorders like stuttering are typically less able to use speech recognition tools that control applications, transcribe or automate tasks, among other functions. Learn more about your ad choices. Visit podcastchoices.com/adchoices

health voice african tool backing accents recognizes speech recognition intron

From J2ME, over Bluetooth and Speech Recognition to AI

airhacks.fm podcast with adam bien

Play Episode Listen Later Jul 21, 2024 50:10

An airhacks.fm conversation with Bruce Hopkins about: transition from Basic to Java, work on Bluetooth technology and writing a book on Bluetooth for Java, involvement with Sun Microsystems and Java ME, becoming a Java Champion, shift to AI and natural language processing research, development of speech recognition and hands-free web navigation systems using pure Java, use of Hugging Face libraries for NLP in 2016, writing for Linux Magazine about mesh VPNs, discovery and exploration of ChatGPT, writing a book on integrating ChatGPT with Java, shared experiences and parallel paths in Java development, discussion about Sun Microsystems vs Oracle's approach to Java, mention of various Java-related technologies like JXTA, Sphinx, FreeTTS, and Dalvik, brief explanation of mesh VPNs and Tailscale, plans for a future podcast episode focused on Bruce's JavaChatGPT book

ai chatgpt basic oracle nlp bluetooth java sphinx vpns sun microsystems speech recognition tailscale java champion dalvik bruce hopkins j2me

OpenAI's GPT-4o mini

GPT Reviews

Play Episode Listen Later Jul 19, 2024 14:36

OpenAI has released their newest model, GPT-4o mini, which is more cost-efficient and excels in mathematical reasoning and coding tasks. NVIDIA's Mistral NeMo 12B is a state-of-the-art language model with unprecedented accuracy and enterprise-grade support. A new speech recognition keyboard and service for Android called Transcribro has been developed, which is private and on-device. Research papers explore the impact of vocabulary size on language model scaling, the use of large datastores for retrieval-based language models, and a method for generating long sequences of views of a cityscape using AI and computer vision. Contact: sergi@earkind.com Timestamps: 00:34 Introduction 01:40 OpenAI Announces GPT 4o mini 03:11 Mistral AI and NVIDIA Unveil Mistral NeMo 12B, a Cutting-Edge Enterprise AI Model 05:28 Transcribro: Private and on-device speech recognition keyboard and service for Android 06:43 Fake sponsor 08:49 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies 10:19 Scaling Retrieval-Based Language Models with a Trillion-Token Datastore 11:49 Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion 13:26 Outro

ai research fake android openai nvidia gpt nemo mistral mistral ai speech recognition

Why don't speech recognition systems understand African American English?

Across Acoustics

Play Episode Play 42 sec Highlight Listen Later Jul 8, 2024 17:18 Transcription Available

Most people have encountered speech recognition software in their day-to-day lives, whether through personal digital assistants, auto transcription, or other such modern marvels. As the technology advances, though, it still fails to understand speakers of African American English (AAE). In this episode, we talk to Michelle Cohn (Google Research and University of California Davis) and Zion Mengesha (Google Research and Stanford University) about their research into why these problems with speech recognition software seem to persist and what can be done to make sure more voices are understood by the technology.Associated paper: Michelle Cohn, Zion Mengesha, Michal Lahav, and Courtney Heldreth. "African American English speakers' pitch variation and rate adjustments for imagined technological and human addressees." JASA Express Letters 4, 047601 (2024). https://doi.org/10.1121/10.0025484.Read more from JASA Express Letters. Learn more about Acoustical Society of America Publications Music: Min 2019 by minwbu from Pixabay.

university african americans stanford university pixabay american english california davis speech recognition acoustical society

Ep1106: Amir Haramaty: Innovating Business Processes through AI and Speech Recognition

20 Minute Leaders

Play Episode Listen Later Jun 13, 2024 22:42

Amir Haramaty shares his journey from cybersecurity to pioneering AI solutions, focusing on the power of voice-driven technology to revolutionize business processes. He discusses the challenges of integrating AI into traditional industries and highlights how his team's innovative solutions are making complex tasks simpler and more efficient. Amir's insights reveal the practical applications of AI, offering a glimpse into a future where technology enhances everyday work and drives measurable value in various fields.

ai innovating business processes speech recognition

Unlocking AI use cases with speech recognition with Ricardo Herreros-Symons

VUX World

Play Episode Listen Later May 29, 2024 60:54

Ricardo Herreros Symons is the VP Corporate Development and Strategy at AI speech to text company, Speechmatics. Hosted on Acast. See acast.com/privacy for more information.

ai strategy unlocking acast use cases corporate development speech recognition herreros

KPMG Announces Stellar Judging Panel for Fourth Global Tech Innovator Competition

Irish Tech News Audio Articles

Play Episode Listen Later May 29, 2024 6:25

KPMG has revealed the esteemed judging panel for the Irish round of its fourth annual Global Tech Innovator competition. The winner will represent Ireland, competing against 22 other countries in the global final in November. The competition is accepting applications from tech start-ups in the Republic of Ireland and Northern Ireland until midnight on Friday, May 31st, 2024. This year's KPMG Global Tech Innovator competition will be hosted by award-winning tech journalist and broadcaster Jess Kelly. As the host of Tech Talk on Newstalk, Ireland's only national radio show dedicated to technology, Jess brings a wealth of knowledge and insight to the competition. The Judges The judging panel comprises of industry leaders, including investors, founders, and advisors, all with significant expertise in the tech sector. Alan Bromell, Head of Private Enterprise at KPMG, said: "We're thrilled to have such an outstanding panel of judges for this year's competition. Their diverse expertise, spanning investment, leadership, and advisory roles, ensures we are well-equipped to identify Ireland's next top tech innovator." Will Prendergast, Partner and Co-founder, Frontline Will is a Partner on Frontline Venture's European Seed fund and co-founded the firm in 2012. He has since led over 25 investments at Frontline and has a particular interest in backing mission-driven founders building in Ireland, the Netherlands and DACH region, and supporting them to expand stateside. He has lived and worked in the U.S. himself and continues to spend significant time there to develop investor and corporate relationships for the benefit of Frontline's portfolio. Dr. Patricia Scanlon, Founder Soapbox Labs Patricia holds a PhD in Artificial Intelligence (AI) and Speech Recognition and a bachelor's degree in electronic engineering. Her 25 years' experience in AI spans both academia and industry including roles at Columbia University, Bell Labs and IBM. In 2013, she founded SoapBox Labs, the world's leading provider of ethical Voice AI for children which was acquired by US based Curriculum Associates in 2023. In 2022, Dr Scanlon was appointed Ireland's first Artificial Intelligence Ambassador, to lead a national conversation on Artificial Intelligence, working with the Department of Enterprise, Trade and Employment. In 2023 she was appointed Chairperson of Ireland's new expert AI Advisory Council providing the government with early foresight of emerging trends, challenges, risks and opportunities. Eimear Hennessy, Global Head of Enterprise Services, Stripe Eimear leads Stripe's Global Enterprise Services, based in Dublin, Ireland. This team delivers Stripe's proactive, preventative technical account management partnerships to Stripe's largest Users. Her career spans 20 years across technology and engineering consulting. Building is the common thread in Eimear' s career from the start of her career as a Civil Engineer through to joining Google in the 2nd decade of its existence building businesses, solutions and teams. Now in Stripe she is heavily invested in building Stripe as it too enters its second decade of existence. Eimear's experience is hugely diverse, having led organisations across Engineering (working in sectors such as Oil, Pharma, Nuclear power, Commercial construction) Advertising, Cloud, Publishing and now Fintech, working across Sales, Support, Partnerships, Project management and Strategy. Barry Napier, CEO, Cubic Telecom Barry has extensive experience and a proven track record in building innovative technology companies. Barry possesses a rare combination of leadership and skills in strategy, business and corporate development, well suited to establishing high performance teams to grow and transform technology organisations. Most recently Barry led Cubic in a strategic partnership with Softbank Corp whereby they invested €473 million, valuing the company at over €900 million. Barry also holds board seats on a range of high-growth companies ...

The AI Edge: AI Innovations Driving Supply Chain Efficiency

The Digital Supply Chain podcast

Play Episode Listen Later May 6, 2024 41:00 Transcription Available

Send me a messageIn this episode of the Sustainable Supply Chain Podcast, I'm joined by Amir Haramaty, CEO of aiOla, to explore how artificial intelligence is reshaping business processes with high accuracy in speech recognition. Amir delves into the core functions of aiOla—a tool designed to bridge the gap between human speech and actionable data, thereby streamlining operations across industries.Amir outlines how aiOla not only captures spoken language but converts it into structured, usable data that integrates seamlessly with existing ERP and CRM systems. This transformation is particularly crucial in sectors where precision and speed are paramount, such as logistics, pharmaceuticals, and manufacturing. By enhancing data capture, aiOla facilitates more informed decision-making and operational efficiency.A key focus of our discussion centres on sustainability—how aiOla's technology minimises waste and optimises resource use by eliminating paper processes and improving data accuracy. These enhancements have tangible impacts on the bottom line and environmental sustainability.Tune in to hear how Amir's technology is making significant strides in making business processes smarter, safer, and more sustainable. Whether it's improving pre-op inspections in food processing or ensuring compliance in pharmaceuticals, aiOla is setting a new standard for integrating AI into daily operations.Join us to discover how integrating AI into your supply chain can lead to substantial efficiency gains and a more sustainable future.Don't forget to check out the video version of this episode at https://youtu.be/MQtz0fP3ytkElevate your brand with the ‘Sustainable Supply Chain' podcast, the voice of supply chain sustainability.Last year, this podcast's episodes were downloaded over 113,000 times by senior supply chain executives around the world.Become a sponsor. Lead the conversation.Contact me for sponsorship opportunities and turn downloads into dialogues.Act today. Influence the future.Support the Show.Podcast supportersI'd like to sincerely thank this podcast's generous supporters: Lorcan Sheehan Olivier Brusle Alicia Farag Luis Olavarria Alvaro Aguilar And remember you too can Support the Podcast - it is really easy and hugely important as it will enable me to continue to create more excellent Digital Supply Chain episodes like this one.Podcast Sponsorship Opportunities:If you/your organisation is interested in sponsoring this podcast - I have several options available. Let's talk!FinallyIf you have any comments/suggestions or questions for the podcast - feel free to just send me a direct message on Twitter/LinkedIn. If you liked this show, please don't forget to rate and/or review it. It makes a big difference to help new people discover it. Thanks for listening.

This fluent Mandarin speaker worked for Google training speech recognition systems: Interview with Isaac Myers

I'm Learning Mandarin

Play Episode Listen Later Apr 3, 2024 32:41

Peak Mandarin Newsletter: https://www.peakmandarin.com/free-ebook Isaac's Mandarin from the Ground Up podcast: https://www.mftgu.com/ -- On today's podcast, I speak to language teacher and podcast host, Isaac Myers. Isaac has a fascinating Mandarin learning backstory which involves extensive travel around Taiwan and China and working at Google training language models for speech recognition systems. Through his podcast, Mandarin from the Ground Up, he teaches Chinese using the same imitation techniques we all used to learn our first language. All of which gives him a unique perspective and makes his insights on learning Chinese well worth listening to!

google china training chinese speaker taiwan worked myers mandarin ground up fluent speech recognition

MacVoices #24091: MVL - Roku's Onerous Terms; Apple, Google, and AI; AirTags Under Fire...Again

MacVoices Audio

Play Episode Listen Later Mar 27, 2024 27:18

The privacy theme rolls on as Chuck Joiner, Brian Flanigan-Arthurs, Eric Bolden, Marty Jencius, Jim Rea, Jeff Gamet, and David Ginsburg look at the almost unbelievable Terms of Service in the most recent Roku update and how it applies to which Roku device/channel/app. Then, the MacVoices panel delivers some initial thoughts on what appears to be a relationship between Apple and Google over AI capabilities, and yet another lawsuit targeting AirTags. No other Bluetooth or GPS tracker, just AirTags. This edition of MacVoices is supported by The MacVoices Slack. Available all Patrons of MacVoices. Sign up at Patreon.com/macvoices. Show Notes: Chapters: 01:12 Roku's Onerous Terms 04:15 Audience Experiences with Roku Software 08:34 AI Collaboration Between Apple and Google 13:40 Speculations on Apple and Google AI Partnership 16:46 Targeted AI Models and Speech Recognition 20:28 AirTags Anti-Stalking Lawsuit 24:20 Discussion on AirTag Lawsuit and Tracking Devices Links: Your Roku will stop working unless you agree to its new terms — what to know and how to get around it https://www.tomsguide.com/tvs/your-roku-will-stop-working-unless-you-agree-to-its-new-terms-what-to-know-and-how-to-get-around-it Roku Dispute Resolution Terms https://docs.roku.com/published/disputeresolution/en/ca?cjdata=MXxOfDB8WXww&Ref=CJ&utm_source=cj&utm_medium=affiliate&utm_campaign=cj_affiliate_sale_6361382&utm_content=3486349_Future+Publishing+Limited&utm_term=13571892&cjevent=0a7baf1ee65511ee819000020a82b820&AID=13571892&PID=6361382&SID=trd-us-9097961795919355163 Apple might use Google Gemini to power some AI features on the iPhone https://9to5google.com/2024/03/17/gemini-apple-iphone-talks/ AirTag anti-stalking class-action lawsuit given the green light https://appleinsider.com/articles/24/03/17/airtag-anti-stalking-class-action-lawsuit-given-the-green-light Guests: Eric Bolden is into macOS, plants, sci-fi, food, and is a rural internet supporter. You can connect with him on Twitter, by email at embolden@mac.com, on Mastodon at @eabolden@techhub.social, and on his blog, Trending At Work. Brian Flanigan-Arthurs is an educator with a passion for providing results-driven, innovative learning strategies for all students, but particularly those who are at-risk. He is also a tech enthusiast who has a particular affinity for Apple since he first used the Apple IIGS as a student. You can contact Brian on twitter as @brian8944. He also recently opened a Mastodon account at @brian8944@mastodon.cloud. Jeff Gamet is a technology blogger, podcaster, author, and public speaker. Previously, he was The Mac Observer's Managing Editor, and the TextExpander Evangelist for Smile. He has presented at Macworld Expo, RSA Conference, several WordCamp events, along with many other conferences. You can find him on several podcasts such as The Mac Show, The Big Show, MacVoices, Mac OS Ken, This Week in iOS, and more. Jeff is easy to find on social media as @jgamet on Twitter and Instagram, jeffgamet on LinkedIn., @jgamet@mastodon.social on Mastodon, and on his YouTube Channel at YouTube.com/jgamet. David Ginsburg is the host of the weekly podcast In Touch With iOS where he discusses all things iOS, iPhone, iPad, Apple TV, Apple Watch, and related technologies. He is an IT professional supporting Mac, iOS and Windows users. Visit his YouTube channel at https://youtube.com/daveg65 and find and follow him on Twitter @daveg65 and on Mastodon at @daveg65@mastodon.cloud Dr. Marty Jencius has been an Associate Professor of Counseling at Kent State University since 2000. He has over 120 publications in books, chapters, journal articles, and others, along with 200 podcasts related to counseling, counselor education, and faculty life. His technology interest led him to develop the counseling profession ‘firsts,' including listservs, a web-based peer-reviewed journal, The Journal of Technology in Counseling, teaching and conferencing in virtual worlds as the founder of Counselor Education in Second Life, and podcast founder/producer of CounselorAudioSource.net and ThePodTalk.net. Currently, he produces a podcast about counseling and life questions, the Circular Firing Squad, and digital video interviews with legacies capturing the history of the counseling field. Generally, Marty is chasing the newest tech trends, which explains his interest in A.I. for teaching, research, and productivity. Marty is an active presenter and past president of the NorthEast Ohio Apple Corp (NEOAC). Jim Rea built his own computer from scratch in 1975, started programming in 1977, and has been an independent Mac developer continuously since 1984. He is the founder of ProVUE Development, and the author of Panorama X, ProVUE's ultra fast RAM based database software for the macOS platform. He's been a speaker at MacTech, MacWorld Expo and other industry conferences. Follow Jim at provue.com and via @provuejim@techhub.social on Mastodon. Support: Become a MacVoices Patron on Patreon http://patreon.com/macvoices Enjoy this episode? Make a one-time donation with PayPal Connect: Web: http://macvoices.com Twitter: http://www.twitter.com/chuckjoiner http://www.twitter.com/macvoices Mastodon: https://mastodon.cloud/@chuckjoiner Facebook: http://www.facebook.com/chuck.joiner MacVoices Page on Facebook: http://www.facebook.com/macvoices/ MacVoices Group on Facebook: http://www.facebook.com/groups/macvoice LinkedIn: https://www.linkedin.com/in/chuckjoiner/ Instagram: https://www.instagram.com/chuckjoiner/ Subscribe: Audio in iTunes Video in iTunes Subscribe manually via iTunes or any podcatcher: Audio: http://www.macvoices.com/rss/macvoicesrss Video: http://www.macvoices.com/rss/macvoicesvideorss 00:01:12 Roku's Onerous Terms 00:04:15 Audience Experiences with Roku Software 00:08:34 AI Collaboration Between Apple and Google 00:13:40 Speculations on Apple and Google AI Partnership 00:16:46 Targeted AI Models and Speech Recognition 00:20:28 AirTags Anti-Stalking Lawsuit 00:24:20 Discussion on AirTag Lawsuit and Tracking Devices

ai google apple technology service video iphone security journal mac associate professor smile ios gps ipads windows privacy terms apple tv ram counseling lawsuit speculation generally apple watches bluetooth roku managing editors aid mastodon macos big show second life kent state university airtags airtag trackers apple google google gemini pid apple airtags counselor education rsa conference terms of service wordcamp speech recognition macworld expo onerous jeff gamet david ginsburg apple iigs mac observer chuck joiner macvoices mactech mac os ken in touch with ios macvoices group provue macvoices page

Empowering Young Readers: AI's Role in Enhancing Reading Skills with Dr. Phil Hickman

Open Tech Talks : Technology worth Talking| Blogging |Lifestyle

Play Episode Listen Later Mar 17, 2024 30:52

Integrating Artificial Intelligence (AI) in the educational technology sector is revolutionizing the learning and teaching landscape. As AI continues to advance, its application within education is not just enhancing educational experiences but is fundamentally transforming the industry. From personalized learning paths to intelligent tutoring systems, AI in education unlocks unprecedented opportunities for students and educators alike, setting the stage for a future where education is more accessible, efficient, and tailored to individual needs. In our earlier podcast session 126, we talked to the founders of AI Teacher about how they are using Generative AI for secondary school teachers. In today's enlightening episode, we had the privilege of interviewing the founder of Plabook, a revolutionary educational tool designed to transform how we approach oral reading fluency and comprehension in students. Plabook stands out by leveraging advanced speech recognition technology to listen to students as they read, providing instant, personalized feedback and smart recommendations. This innovative platform assesses reading abilities and delves deep into understanding each student's unique strengths and weaknesses through diagnostic reports. These reports are powerful tools for teachers and parents, offering insights to support and enhance the learning journey. Throughout our discussion, we explored Plabook's inception, development, and impact on the educational landscape, uncovering the vision and challenges behind integrating AI into education. Episode # 131 Today's Guest: Dr. Phil Hickman, Founder / CEO PlaBook Website: PlaBookEducation Linkedin: Dr. Phil This podcast offers invaluable insights into how AI and technology can bridge educational gaps, challenge traditional learning methods, and pave the way for a future where every student has the tools to succeed. What Listeners Will Learn: In today's podcast, you'll gain insights into several key areas: Exploring Use of Speech Recognition for Feedback: Discover the role of advanced speech recognition in providing real-time, tailored feedback to students. Understanding the Impact of Plabook on English Reading Comprehension: Gain insights into how Plabook is transforming the approach to teaching and assessing reading fluency. The Journey from Pilot to Validation: Learn about initial testing phases and the validation of its innovative concept. Adapting to Diverse Dialects and Languages: Find out how accommodates the linguistic diversity of its users. The Educational Sector's Response to AI Innovations: Delve into how new technologies like Plabook are being received and adopted in educational settings. Anticipating Future Educational Innovations: Get a glimpse of potential advancements and changes in the education sector influenced by AI technology. Resources: Website: PlaBookEducation Linkedin: Dr. Phil Meet AI Teacher: The Future of AI in Education Unveiled with Dr Pauldy Otermans and Dev Aditya

ai pilot impact empowering adapting enhancing hickman young readers speech recognition reading skills

The Power of Speech Recognition to Transform Clinical Documentation Practices with nVoq

Home Health Revealed

Play Episode Listen Later Feb 21, 2024 39:14

In this Podcast with nVoq Chief Revenue Officer Chris Moran discusses the transformative role of technology in the home health industry. nVoq, a cloud-based, medically relevant, and HIPAA-compliant speech recognition platform, is enabling in-home healthcare caregivers. T he platform ensures accurate transcriptions, maintains patient information security, and keeps home health agencies compliant with healthcare regulations. It also facilitates interoperability with other healthcare systems and enhances the accuracy and completeness of documentation, ultimately giving clinicians more time back in their day. nVoq focuses on organizational readiness and easy adoption for effective use of the technology. Listeners can find more about nVoq on their website. To learn more, visit nVoq's website: https://www.nvoq.com/ This podcast is brought to you by HealthRev Partners, offering revenue cycle management services powered by Velocity, the most advanced coding and billing software in the market.

transform practices clinical documentation velocity hipaa speech recognition

AI-Powered Business Communications with Dan O'Connell, Chief AI & Strategy Officer at Dialpad

Play Episode Listen Later Jan 9, 2024 82:51

In this episode, Nathan sits down with Dan O'Connell, Chief Strategy Officer at Dialpad. They discuss building their own language models using 5 billion minutes of business calls, custom speech recognition models for every customer, and the challenges of bringing AI into business. If you need an ecommerce platform, check out our sponsor Shopify: https://shopify.com/cognitive for a $1/month trial period. We're sharing a few of Nathan's favorite AI scouting episodes from other shows. Today, Shane Legg, Cofounder at Deepmind and its current Chief AGI Scientist, shares his insights with Dwarkesh Patel on AGI's timeline, the new architectures needed for AGI, and why multimodality will be the next big landmark. If you need an ecommerce platform, check out our sponsor Shopify: https://shopify.com/cognitive for a $1/month trial period. We're hiring across the board at Turpentine and for Erik's personal team on other projects he's incubating. He's hiring a Chief of Staff, EA, Head of Special Projects, Investment Associate, and more. For a list of JDs, check out: eriktorenberg.com. --- SPONSORS: Shopify is the global commerce platform that helps you sell at every stage of your business. Shopify powers 10% of ALL eCommerce in the US. And Shopify's the global force behind Allbirds, Rothy's, and Brooklinen, and 1,000,000s of other entrepreneurs across 175 countries.From their all-in-one e-commerce platform, to their in-person POS system – wherever and whatever you're selling, Shopify's got you covered. With free Shopify Magic, sell more with less effort by whipping up captivating content that converts – from blog posts to product descriptions using AI. Sign up for $1/month trial period: https://shopify.com/cognitive Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that actually work customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off www.omneky.com NetSuite has 25 years of providing financial software for all your business needs. More than 36,000 businesses have already upgraded to NetSuite by Oracle, gaining visibility and control over their financials, inventory, HR, eCommerce, and more. If you're looking for an ERP platform ✅ head to NetSuite: http://netsuite.com/cognitive and download your own customized KPI checklist. X/SOCIAL: @labenz (Nathan) @dialdoc (Dan) @dialpad @CogRev_Podcast (Cognitive Revolution) TIMESTAMPS: (00:00) - Introduction and Welcome (06:50) - Interview with Dan O'Connell, Chief AI and Strategy Officer at Dialpad (07:13) - The Functionality and Utility of Dialpad (17:20) - The Development of Dialpad's Large Language Model Trained on 5Billion Minutes of Calls 19:56 The Future of AI in Business (22:21) - Sponsor Break: Shopify (23:56) - The Challenges and Opportunities of AI Development (31:17 ) - Prioritizing latency, capacity, and cost when evaluating AI (39:41) - Most Loved AI Features in Dialpad (42:01) - The Role of AI in Quality Assurance (43:10) - The Future of Transcription Accuracy (44:06) - The Importance of Speech Recognition in Business (46:59) - Personalizing AI for Better Business Interactions (47:01) - The Role of AI in Content Generation (52:47) - The Challenges and Opportunities of AI in Sales and Support

Beyond Bard: A Conversation With Google's Yury Pinsky On AI & The Future Of SEO - EP325

Search Engine Nerds

Play Episode Listen Later Nov 10, 2023 40:27

Bard is our creative collaborator. It's a place where you can come in and have a conversation with the large language model which really helps you to boost your productivity and bring your ideas to life. –Yuri Pinsky, 02:16 Step behind the curtain and into the world of Google's Bard with its Director of Product Management, Yury Pinsky, in an exclusive conversation with SEJ Editor-in-Chief Amanda Zantal-Wiener. Hear about the origins and journey to Bard's unveiling, and discover how the team behind it envisions a collaborative future with AI. SEO pros and seasoned digital marketers alike will get an up-close look at the nuances of generative AI and a glimpse at what predictions for what's next. So prep your popcorn -- grab your notetaking method of choice -- and tune in to learn how to incorporate Google's current and forward-looking AI initiatives into your own business innovation. [07:11] - The origin of Bard and its market niche. [13:23] - Impact of generative AI and Bard on SEO and content creation. [17:37] - Using Bard for audience evaluation in content creation. [23:54] - Distinctive features of Bard compared to other AI models. [28:33] - Most interesting prompts seen in Bard. [33:29] - Future vision for Bard and generative AI. I'm inspired by this idea that technology can work together with us, and we can bring Bard in as a creative partner in your editorial work or when we're trying to write a document for work or something in our personal lives. –Yuri Pinsky, 09:46 It's a very vibrant, fast-paced, fast moving industry right now. I think some of the unique things we have with Bard are things like the ability to plug into Google tools. –Yuri Pinsky, 23:54 In the sciences and the medical field, there could be lots of interesting breakthroughs in drug discovery in climate applications. How can they use the power of these foundational models to really benefit all of us in some way? –Yuri Pinsky, 25:44 Your ideas still have to be your own in order for AI to work with you best and work for you best. –Yuri Pinsky, 28:33 It is not the end of search. Bard is an experiment. It's complementary to search. It's this conversational collaborator. –Yuri Pinsky, 32:47 Connect with Yury Pinsky: Yury is a Product Manager for Bard, leading areas including Extensions, Factuality, and multi-modality. Yury is passionate about cutting edge technology and finding ways to bring it to users around the world. Prior to serving in his current role, Yury led product teams around Natural Language and Speech Recognition for the Google Assistant, spent time building wearables in Google [X], and helped build out Google Search on mobile devices. Outside of work, Yury enjoys spending time with his family, planning his next vacation, and the daily logistics of kids' extracurricular activities. Connect on LinkedIn: https://www.linkedin.com/in/ypinsky Connect with Amanda Zantal-Wiener: Follow her on Twitter: https://twitter.com/Amanda_ZW Connect with her on LinkedIn: https://www.linkedin.com/in/amandazantalwiener/

#50: Executive Function and Voice Typing Apps

The Personal Brain Trainer Podcast: Embodying Executive Functions

Play Episode Listen Later Oct 20, 2023 59:05

In this episode of The Personal Brain Trainer, Darius and Erica explore the world of voice typing and how it helps with executive function. They discuss the cognitive benefits of voice typing, its applications for individuals with dyslexia and ADHD, and practical strategies for using voice typing to enhance productivity. Discover the power of assistive technology, the future of note-taking apps, and how to streamline your writing process. Tune in for valuable insights and tips to unlock your mind's potential in this engaging and informative discussion. Links: 1. Built-in Software Speech-to-Text Functions: Siri: https://www.apple.com/siri/ Apple Dictation: https://tinyurl.com/4rvxumdt Windows 10 Speech Recognition: https://tinyurl.com/37383efk Google Voice Typing: https://tinyurl.com/3d2susur 2. Speech-to-Text Apps: Otter: https://otter.ai/ Google Docs Voice Typing: https://tinyurl.com/2fa85tuc Gboard: https://tinyurl.com/58mff5r2 3. Desktop Software: Read Write Gold: https://tinyurl.com/4wa5drzp Dragon by Nuance: https://www.nuance.com/dragon.html SpeechTexter: https://www.speechtexter.com/ 4. Other: Executive Functioning Competency Screener: https://mymemorymentor.com/efcs/ One to one sessions with Dr. Warren: https://learningtolearn.biz/ One to one sessions with Darius:www.dyslexiawork.com Executive functions and Study Skills Course: https://tinyurl.com/n86mf2bx BulletMap Academy: https://bulletmapacademy.com/ Learning Specialist Courses:https://www.learningspecialistcourses.com/ Executive functions and Study Skills Course: https://tinyurl.com/n86mf2bx Good Sensory Learning: https://goodsensorylearning.com/ Dyslexia at Work: www.dyslexiawork.com Brought to you by: ⁠⁠⁠⁠⁠⁠⁠https://goodsensorylearning.com⁠⁠⁠⁠⁠⁠⁠ ⁠⁠⁠⁠⁠⁠⁠https://learningspecialistcourses.com⁠⁠⁠⁠⁠⁠⁠ ⁠⁠⁠⁠⁠⁠⁠https://bulletmapacademy.com⁠⁠⁠⁠⁠⁠ ⁠⁠⁠⁠⁠⁠https://www.dyslexiaproductivitycoaching.com⁠

work voice discover executives built dragon adhd speech apps windows dyslexia nuance typing executive functions speech recognition gboard

Generative AI News This Week - Is ChatGPT Dying? Generative AI Market Data, Meta Challenges OpenAI, ElevenLabs & More - Voicebot Podcast Ep 347

The Voicebot Podcast

Play Episode Listen Later Aug 31, 2023 60:36

The Generative AI News (GAIN) rundown is back for August 24, 2023. Special segments this week include: What does the market data say about generative AI adoption? We look at 10 charts that explain a lot about what is happening, why it is happening, and where we are headed. Meta challenges OpenAI with an open-source automated speech recognition and translation system. Game on! Generative AI winners and losers of the week. Eric Schwartz, Voicebot.ai's head writer, and Bret Kinsella gathered again this week to break down the top generative AI stories and a few other useful pieces of information. Generative AI News Links to the stories we covered this week are included below. Like Perplexity AI, we give you source links! Top Stories of the Week - Market Data

Embedded Executive: Speech Recognition on an FPGA, Achronix

Embedded Executive

Play Episode Listen Later Aug 30, 2023 12:01

Speech recognition on an FPGA? That doesn't sound like the most effective path, but Bill Jenkins, of Achronix had a different opinion. Hear his take on why the FPGA is the right way to go for this application on this week's Embedded Executives podcast.

executives speech embedded fpga speech recognition bill jenkins

Voice of the Future: Exploring AI, Speech Tech, Ethics, and Regulations | A conversation with Nigel Cannings | Redefining Society with Marco Ciappelli

ITSPmagazine | Technology. Cybersecurity. Society

Play Episode Listen Later Aug 24, 2023 42:17

Guest/s Name ✨Nigel Cannings, CTO at Intelligent Voice [@intelligentvox]Bio ✨Nigel Cannings is the CTO at Intelligent Voice. He has over 25 years' experience in both Law and Technology, is the founder of Intelligent Voice Ltd and a pioneer in all things voice. Nigel is also a regular speaker at industry events such as NVIDIA GTC and holds multiple patents in Speech, NLP and Confidential Computing technologies. He is an Industrial Fellow at the University of East London.On Linkedin | https://www.linkedin.com/in/nigelcannings/?originalSubdomain=ukGoogle Scholar | https://scholar.google.co.uk/citations?user=zHL1sngAAAAJ&hl=en____________________________Host: Marco Ciappelli, Co-Founder at ITSPmagazine [@ITSPmagazine] and Host of Redefining Society PodcastOn ITSPmagazine | https://www.itspmagazine.com/itspmagazine-podcast-radio-hosts/marco-ciappelli_____________________________This Episode's SponsorsBlackCloak

152. Revisiting The Canadian Down Syndrome's Project Understood - Training Speech Recognition Technology

If We Knew Then - Down Syndrome Podcast

Play Episode Listen Later Aug 14, 2023 57:53

This episode is the entire conversation we had with Matt MacNeil and Ed Casagrande from the Canadian Down Syndrome Society concerning their collaboration with Google AI to create a database that can help train Google's speech recognition technology to better understand people with Down syndrome. Donate your voice at: https://projectunderstood.ca Learn more about the CDSS: https://cdss.ca Episode Transcript: https://ifweknewthen701833686.wordpress.com/2023/08/13/152-revisiting-the-canadian-down-syndromes-project-understood-training-speech-technology/2/ Please follow us on Twitter @ifweknewthenPOD you can drop us a line on our Facebook page @ifweknewthenPOD or visit our website https://www.IfWeKnewThen.com to send us an email with questions and comments. You can join our mailing list there and get alerts of future podcast episodes. Thank you again and we look forward to you joining us on the next episode of IF WE KNEW THEN.

google technology training canadian project down syndrome understood google ai speech recognition cdss

The future is bright: Speech recognition and natural language understanding

Inside Angle

Play Episode Listen Later Jul 18, 2023 17:53

Speech recognition technology has been around for longer than you might think. Discover how it has evolved and advanced over the years from Thomas Schaaf, principal research scientist at 3M HIS. He started his career in the speech recognition environment in the 1990s, working for companies like Amazon and Toshiba. Listen as he shares his insights into the future of the technology for health care and beyond.

amazon discover speech bright toshiba speech recognition natural language understanding

Engines of Our Ingenuity 2719: The Mathematics of Language

Engines of Our Ingenuity

Play Episode Listen Later Jul 5, 2023 3:49

Episode: 2719 The Mathematics of Language. Today, let's see what mathematics can tell us about language.

language mathematics engines ingenuity dasher speech recognition

AI Today Podcast: AI Glossary Series – Cloud ML, On-Premise, Edge Device, Machine Learning -as-a-Service (MLaaS)

AI Today Podcast: Artificial Intelligence Insights, Experts, and Opinion

Play Episode Listen Later Jun 30, 2023 17:26

In this episode of the AI Today podcast hosts Kathleen Walch and Ron Schmelzer define the terms Cloud ML, On-Premise, Edge Device, Machine Learning -as-a-Service (MLaaS), explain how these terms relates to AI and why it's important to know about them. Show Notes: FREE Intro to CPMAI mini course CPMAI Training and Certification AI Glossary Glossary Series: Training Data, Epoch, Batch, Learning Curve Glossary Series: (Artificial) Neural Networks, Node (Neuron), Layer Glossary Series: Bias, Weight, Activation Function, Convergence, ReLU Glossary Series: Perceptron Glossary Series: Hidden Layer, Deep Learning Glossary Series: Loss Function, Cost Function & Gradient Descent Glossary Series: Backpropagation, Learning Rate, Optimizer Glossary Series: Feed-Forward Neural Network Glossary Series: OpenAI, GPT, DALL-E, Stable Diffusion Glossary Series: Natural Language Processing (NLP), NLU, NLG, Speech-to-Text, TTS, Speech Recognition AI Glossary Series – Machine Learning, Algorithm, Model AI Today Podcast: AI Glossary Series – Batch Prediction, Microservice, Real-time Prediction, Stream Learning, Cold-Path Analytics, Hot-Path Analytics This episode is sponsored by Algolia: Algolia Powers Discovery. Continue reading AI Today Podcast: AI Glossary Series – Cloud ML, On-Premise, Edge Device, Machine Learning -as-a-Service (MLaaS) at AI & Data Today.

AI Today Podcast: AI Glossary Series – Batch Prediction, Microservice, Real-time Prediction, Stream Learning, Cold-Path Analytics, Hot-Path Analytics

AI Today Podcast: Artificial Intelligence Insights, Experts, and Opinion

Play Episode Listen Later Jun 28, 2023 19:43

In this episode of the AI Today podcast hosts Kathleen Walch and Ron Schmelzer define the terms Batch Prediction, Microservice, Real-time Prediction, Stream Learning, Cold-Path Analytics, and Hot-Path Analytics, explain how these terms relate to AI and why it's important to know about them. Show Notes: FREE Intro to CPMAI mini course CPMAI Training and Certification AI Glossary Glossary Series: Training Data, Epoch, Batch, Learning Curve Glossary Series: (Artificial) Neural Networks, Node (Neuron), Layer Glossary Series: Bias, Weight, Activation Function, Convergence, ReLU Glossary Series: Perceptron Glossary Series: Hidden Layer, Deep Learning Glossary Series: Loss Function, Cost Function & Gradient Descent Glossary Series: Backpropagation, Learning Rate, Optimizer Glossary Series: Feed-Forward Neural Network Glossary Series: OpenAI, GPT, DALL-E, Stable Diffusion Glossary Series: Natural Language Processing (NLP), NLU, NLG, Speech-to-Text, TTS, Speech Recognition AI Glossary Series – Machine Learning, Algorithm, Model AI Glossary Series – Model Tuning and Hyperparameter AI Glossary Series: Overfitting, Underfitting, Bias, Variance, Bias/Variance Tradeoff AI Glossary Series: Operationalization Interview with Alex Measure, BLS This episode is sponsored by Algolia: Algolia Powers Discovery. Continue reading AI Today Podcast: AI Glossary Series – Batch Prediction, Microservice, Real-time Prediction, Stream Learning, Cold-Path Analytics, Hot-Path Analytics at AI & Data Today.

AI Today Podcast: AI Glossary Series – Operationalization

AI Today Podcast: Artificial Intelligence Insights, Experts, and Opinion

Play Episode Listen Later Jun 23, 2023 11:57

In this episode of the AI Today podcast hosts Kathleen Walch and Ron Schmelzer define the term Operationalization, explain how this term relates to AI and why it's important to know about them. Show Notes: FREE Intro to CPMAI mini course CPMAI Training and Certification AI Glossary Glossary Series: Training Data, Epoch, Batch, Learning Curve Glossary Series: (Artificial) Neural Networks, Node (Neuron), Layer Glossary Series: Bias, Weight, Activation Function, Convergence, ReLU Glossary Series: Perceptron Glossary Series: Hidden Layer, Deep Learning Glossary Series: Loss Function, Cost Function & Gradient Descent Glossary Series: Backpropagation, Learning Rate, Optimizer Glossary Series: Feed-Forward Neural Network Glossary Series: OpenAI, GPT, DALL-E, Stable Diffusion Glossary Series: Natural Language Processing (NLP), NLU, NLG, Speech-to-Text, TTS, Speech Recognition AI Glossary Series – Machine Learning, Algorithm, Model For more information on visit Algolia website FREE CPMAI Intro Course Glossary Series: Natural Language Processing (NLP), NLU, NLG, Speech-to-Text, TTS, Speech Recognition Glossary Series: Tokenization, Vectorization This episode is sponsored by Algolia: Algolia Powers Discovery. Continue reading AI Today Podcast: AI Glossary Series – Operationalization at AI & Data Today.

AI Today Podcast: AI Glossary – Digital Transformation, Return on Investment (ROI), Key Performance Indicator (KPI)

AI Today Podcast: Artificial Intelligence Insights, Experts, and Opinion

Play Episode Listen Later Jun 21, 2023 13:41

In this episode of the AI Today podcast hosts Kathleen Walch and Ron Schmelzer define the terms Digital Transformation, Return on Investment (ROI), and Key Performance Indicator (KPI), explain how these terms relate to AI and why it's important to know about them. Show Notes: FREE Intro to CPMAI mini course CPMAI Training and Certification AI Glossary For more information on visit Algolia website Glossary Series: Natural Language Processing (NLP), NLU, NLG, Speech-to-Text, TTS, Speech Recognition Glossary Series: Tokenization, Vectorization This episode is sponsored by Algolia: Algolia Powers Discovery. Continue reading AI Today Podcast: AI Glossary – Digital Transformation, Return on Investment (ROI), Key Performance Indicator (KPI) at AI & Data Today.

ai speech certification digital transformation key performance indicators return on investment glossary tts investment roi algolia speech recognition nlu ai today nlg free intro kathleen walch ron schmelzer

AI Today Podcast: AI Glossary Series – Confusion Matrix, Accuracy, Precision, F1, Recall, Sensitivity, Specificity, Receiver-Operating Characteristic (ROC) Curve

AI Today Podcast: Artificial Intelligence Insights, Experts, and Opinion

Play Episode Listen Later Jun 16, 2023 16:04

In this episode of the AI Today podcast hosts Kathleen Walch and Ron Schmelzer define the terms Confusion Matrix, Accuracy, Precision, F1, Recall, Sensitivity, Specificity, Receiver-Operating Characteristic (ROC) Curve, explain how these terms relate to AI and why it's important to know about them. Show Notes: FREE Intro to CPMAI mini course CPMAI Training and Certification AI Glossary Glossary Series: Training Data, Epoch, Batch, Learning Curve Glossary Series: (Artificial) Neural Networks, Node (Neuron), Layer Glossary Series: Bias, Weight, Activation Function, Convergence, ReLU Glossary Series: Perceptron Glossary Series: Hidden Layer, Deep Learning Glossary Series: Loss Function, Cost Function & Gradient Descent Glossary Series: Backpropagation, Learning Rate, Optimizer Glossary Series: Feed-Forward Neural Network Glossary Series: OpenAI, GPT, DALL-E, Stable Diffusion Glossary Series: Natural Language Processing (NLP), NLU, NLG, Speech-to-Text, TTS, Speech Recognition AI Glossary Series – Machine Learning, Algorithm, Model AI Glossary Series – Model Tuning and Hyperparameter AI Glossary Series: Overfitting, Underfitting, Bias, Variance, Bias/Variance Tradeoff Glossary Series: Classification & Classifier, Binary Classifier, Multiclass Classifier, Decision Boundary Continue reading AI Today Podcast: AI Glossary Series – Confusion Matrix, Accuracy, Precision, F1, Recall, Sensitivity, Specificity, Receiver-Operating Characteristic (ROC) Curve at AI & Data Today.

#20 - Hidden Markov Models (HMMs): Sequential Data Analysis and Applications

The AI Frontier Podcast

Play Episode Listen Later Jun 4, 2023 9:12

Discover the fascinating world of Hidden Markov Models (HMMs) in this episode of "The AI Frontier" podcast. Explore the fundamentals of HMMs, their applications in fields like speech recognition, bioinformatics, and finance, and learn about their limitations and alternatives. Gain insights into the theoretical concepts and real-world use cases, and stay up-to-date with emerging trends in sequential data analysis. Join us on this journey to uncover the power and potential of HMMs in artificial intelligence and machine learning.Support the Show.Keep AI insights flowing – become a supporter of the show!Click the link for details

ai discover explore finance hidden artificial intelligence models applications machine learning robotics data analysis bioinformatics sequential markov speech recognition hmms

Assembly AI: Generative AI for Speech Recognition with CEO Dylan Fox

The MAD Podcast with Matt Turck

Play Episode Listen Later May 10, 2023 27:04

AssemblyAI Founder & CEO, Dylan Fox joined FirstMark Managing Partner, Matt Turck for Data Driven NYC! AssemblyAI is the fastest way to build with AI for audio. With a simple API, get access to production-ready AI models to transcribe and understand speech. AssemblyAI has raised $63M+.

ceo ai artificial intelligence api assembly generative speech recognition dylan fox ai generative ai matt turck assemblyai

AI Today Podcast: AI Glossary Series – CPU, GPU, TPU, and Federated Learning

AI Today Podcast: Artificial Intelligence Insights, Experts, and Opinion

Play Episode Listen Later May 5, 2023 11:24

In this episode of the AI Today podcast hosts Kathleen Walch and Ron Schmelzer define the terms CPU, GPU, TPU, and Federated Learning, explain how these terms relate to AI and why it's important to know about them. Show Notes: FREE Intro to CPMAI mini course CPMAI Training and Certification AI Glossary Glossary Series: Artificial Intelligence AI Glossary Series – Machine Learning, Algorithm, Model Glossary Series: (Artificial) Neural Networks, Node (Neuron), Layer Glossary Series: Natural Language Processing (NLP), NLU, NLG, Speech-to-Text, TTS, Speech Recognition Continue reading AI Today Podcast: AI Glossary Series – CPU, GPU, TPU, and Federated Learning at AI & Data Today.

ai model speech algorithms certification layer cpu gpu glossary tts speech recognition tpu federated learning nlu ai today nlg free intro cpu gpu kathleen walch ron schmelzer

AI Today Podcast: AI Glossary Series – Tokenization and Vectorization

AI Today Podcast: Artificial Intelligence Insights, Experts, and Opinion

Play Episode Listen Later May 3, 2023 11:20

In this episode of the AI Today podcast hosts Kathleen Walch and Ron Schmelzer define the terms Tokenization and Vectorization, explain how these terms relates to AI and why it's important to know about them. Show Notes: FREE Intro to CPMAI mini course CPMAI Training and Certification AI Glossary Glossary Series: Artificial Intelligence AI Glossary Series – Machine Learning, Algorithm, Model Glossary Series: (Artificial) Neural Networks, Node (Neuron), Layer Glossary Series: Natural Language Processing (NLP), NLU, NLG, Speech-to-Text, TTS, Speech Recognition Continue reading AI Today Podcast: AI Glossary Series – Tokenization and Vectorization at AI & Data Today.

ai model speech algorithms certification layer glossary tokenization tts speech recognition nlu ai today nlg free intro kathleen walch ron schmelzer

Homily for the Fourth Sunday of Easter - Voice Recognition not Speech Recognition

Fr. Brendan McGuire - Podcasts that Break open the Word of God

Play Episode Listen Later May 3, 2023 8:43

Today's gospel tells us that the Good Shepherd is the voice we are called to listen to because he will bring us to all truth; all goodness; and all beauty. We need to make sure that what we are listening to has that voice. (Read more…)Here is my homily from the Fourth Sunday of Easter . I hope you are enjoying this Easter Season.Alleluia, He is Risen Indeed!

good shepherd homily fourth sunday alleluia risen indeed easter season voice recognition speech recognition

344 - The power of clinical speech recognition. Dr Pieter Nel & Dr Andrew Brier - Mackay Hospital & Health Service (HHS)

Talking HealthTech

Play Episode Listen Later Apr 17, 2023 34:16

hospitals clinical keen mackay pieter health services speech recognition

AI-powered speech recognition for proactive compliance with Nigel Cannings, CTO of Intelligent Voice (UK)

Voice of FinTech

Play Episode Listen Later Mar 28, 2023 29:51

Nigel Cannings, CTO at Intelligent Voice, spoke to Rudolf Falat, founder of the Voice of FinTech podcast, about leveraging AI to ensure banks and others trade responsibly and in line with regulations.Here is what they talked about: Nigel's backstory What problem is Intelligent Voice solving and why is it worth solving? Key clients Tech angle? How does Intelligent Voice (IV) keep improving? Is human monitoring needed? Why is this solution better than the competition or incumbent solutions? How IV addresses the privacy concerns of individuals and companies How do they ensure their AI is ethical and responsible Interoperability of Intelligent Voice - how easy is it to plug into the client´s enterprise systems Success stories in Financial Services: e.g., Daiwa What are your plans for the rest of the year? Hint: international expansion? Recommend info channels: 1. Medium What's the best way to reach out? LinkedIn and website: Nigel Cannings and Intelligent Voice.

Let's Hear Dr. Michael Canfarotta - Speech Recognition Outcome Vs. Cut Off Frequency

MED-EL Podcast

Play Episode Listen Later Jan 17, 2023 11:51

From his research outcomes and clinical experience, Dr. Michael Canfarotta shares what the minimal angular of insertion of a CI electrode should be in order to optimize hearing outcomes.

frequency outcome cutoff hear dr speech recognition electrodes

Speech Recognition Technology and UX: How Consistent Content Powers Both at Riot Games, with Cheryl Platz

WordBirds

Play Episode Listen Later Jan 3, 2023 25:42

Games like Disney Friends are changing how people communicate and relate to video game characters. You can now tell Disney favorites like Stitch that you love him. Today's guest helped develop Disney Friends and is the Director of User Experience at Riot Games. Join Cheryl Platz as she discusses speech recognition technology in the video game industry. Cheryl also talks about what it's like to work in Riot Games and how everything has to be consistent now that they have more games. You can also learn about these latest game techs from her book, Design Beyond Devices: Creating Multimodal, Cross-Device Experiences. Discover the future of the video game industry today! Love the show? Subscribe, rate, review & share! https://www.acrolinx.com/wordbirds

love director disney technology discover games powers consistent platz stitch user experience riot games speech recognition

AI Today Podcast: AI Glossary Series: Natural Language Processing (NLP), NLU, NLG, Speech-to-Text, TTS, Speech Recognition

AI Today Podcast: Artificial Intelligence Insights, Experts, and Opinion

Play Episode Listen Later Dec 16, 2022 11:18

In this episode of the AI Today podcast hosts Kathleen Walch and Ron Schmelzer define Natural Language Processing (NLP), Natural Language Understanding (NLU), Natural Language Generation (NLG), Speech-to-Text, Test-to-Speech, and (Automated) Speech Recognition. We share how these terms are related and how they fit into AI. Show Notes: FREE Intro to CPMAI mini course CPMAI Training and Certification AI Glossary AI Glossary Series – Content Summarization & Analysis, Sentiment Analysis AI Glossary Series – Conversational Systems, Chatbots, Voice Assistants, Machine Translation AI Today Podcast #104: Patterns of AI – Conversation / Human Interaction Continue reading AI Today Podcast: AI Glossary Series: Natural Language Processing (NLP), NLU, NLG, Speech-to-Text, TTS, Speech Recognition at Cognilytica.

HPR3731: Speech recognition in Kdenlive

Hacker Public Radio

Play Episode Listen Later Nov 21, 2022

Recently I returned to Kdenlive after about a 10-year break, and was pleased to discover the speech recognition feature. https://docs.kdenlive.org/en/effects_and_compositions/speech_to_text.html#install-python

speech recognition kdenlive

From science fiction to reality: Speech recognition evolution

Inside Angle

Play Episode Listen Later Oct 25, 2022 30:54

In this episode, AI evangelist Juggy Jagannathan, PhD, discusses the advancement of speech recognition technology with Detlef Koll, global vice president of research and development at 3M Health Information Systems. Travel along the timeline of speech recognition history, starting with isolated word speech recognition all the way to continuous word speech recognition and automatic transcription technologies that create time to care for physicians.

ai reality travel phd evolution automation science fiction machine learning speech recognition 3m health information systems

OpenAI Whisper: General-Purpose Speech Recognition | SDS 620

SuperDataScience

Play Episode Listen Later Oct 21, 2022 6:34

What's your secret to superb audio recognition? Whisper it. We mean that literally—Whisper is the latest in OpenAI's growing suite of models aimed to benefit humanity. On this episode of Five-Minute Friday, host Jon Krohn reviews OpenAI's latest model, Whisper. This tool will vastly improve the way human speech is recognized and converted to text. Jon gets under the hood to show how the team managed to get such a powerfully accurate recognition model. Listen to the episode and find out how you can try it yourself, for free! Additional materials: www.superdatascience.com/620 Interested in sponsoring a SuperDataScience Podcast episode? Email natalie@jonkrohn.com for sponsorship information.

whispers openai speech recognition general purpose five minute friday jon krohn

Podcasts about Speech recognition

Best podcasts about Speech recognition

AI Today Podcast: Artificial Intelligence Insights, Experts, and Opinion

VUX World

Tech Ease 4 All: Windows

The AI Eye: stock news & deal tracker

Inside Angle

TechCrunch Startups – Spoken Edition

MLOps.community

Assistive Technology FAQ (ATFAQ) Podcast

Algorithms + Data Structures = Programs

Latest news about Speech recognition

Latest podcast episodes about Speech recognition

Zoho Just Built Its Own LLM—Here's Why It Matters

Season 5 - 011 - Speech recognition in ATC, UFA's ATTranscribe and stepping out of simulation

Season 5 - 007 - Speech Recognition for Air Traffic Control applications - Eric Button, EnhancedRadar

Gladia: Breaking Language Barriers with AI-Powered Speech Recognition, Podcast

E182 'Revolutionising Medical Speech Recognition' with T-Pro's Meghan Dowling

Speechmatics CTO - Next-Generation Speech Recognition

How to Systematically Test and Evaluate Your LLMs Apps // Gideon Mendels // #269

Human-Centered AI for Disordered Speech Recognition - Katarzyna Foremniak

The Valley Current®: How is A.I. Exponentially Accelerating Innovation?

From Dark Matter to Voice AI with Deepgram Founder Scott Stephenson

#200 Trevor Back: How Speechmatics is Shaping the Future of Conversational AI

Intron Health gets backing for its speech recognition tool that recognizes African accents

From J2ME, over Bluetooth and Speech Recognition to AI

OpenAI's GPT-4o mini

Why don't speech recognition systems understand African American English?

Ep1106: Amir Haramaty: Innovating Business Processes through AI and Speech Recognition

Unlocking AI use cases with speech recognition with Ricardo Herreros-Symons

KPMG Announces Stellar Judging Panel for Fourth Global Tech Innovator Competition

The AI Edge: AI Innovations Driving Supply Chain Efficiency

This fluent Mandarin speaker worked for Google training speech recognition systems: Interview with Isaac Myers

MacVoices #24091: MVL - Roku's Onerous Terms; Apple, Google, and AI; AirTags Under Fire...Again

Empowering Young Readers: AI's Role in Enhancing Reading Skills with Dr. Phil Hickman

The Power of Speech Recognition to Transform Clinical Documentation Practices with nVoq

AI-Powered Business Communications with Dan O'Connell, Chief AI & Strategy Officer at Dialpad

Beyond Bard: A Conversation With Google's Yury Pinsky On AI & The Future Of SEO - EP325

#50: Executive Function and Voice Typing Apps

Generative AI News This Week - Is ChatGPT Dying? Generative AI Market Data, Meta Challenges OpenAI, ElevenLabs & More - Voicebot Podcast Ep 347

Embedded Executive: Speech Recognition on an FPGA, Achronix

Voice of the Future: Exploring AI, Speech Tech, Ethics, and Regulations | A conversation with Nigel Cannings | Redefining Society with Marco Ciappelli

152. Revisiting The Canadian Down Syndrome's Project Understood - Training Speech Recognition Technology

The future is bright: Speech recognition and natural language understanding

Engines of Our Ingenuity 2719: The Mathematics of Language

AI Today Podcast: AI Glossary Series – Cloud ML, On-Premise, Edge Device, Machine Learning -as-a-Service (MLaaS)

AI Today Podcast: AI Glossary Series – Batch Prediction, Microservice, Real-time Prediction, Stream Learning, Cold-Path Analytics, Hot-Path Analytics

AI Today Podcast: AI Glossary Series – Operationalization

AI Today Podcast: AI Glossary – Digital Transformation, Return on Investment (ROI), Key Performance Indicator (KPI)

AI Today Podcast: AI Glossary Series – Confusion Matrix, Accuracy, Precision, F1, Recall, Sensitivity, Specificity, Receiver-Operating Characteristic (ROC) Curve

#20 - Hidden Markov Models (HMMs): Sequential Data Analysis and Applications

Assembly AI: Generative AI for Speech Recognition with CEO Dylan Fox

AI Today Podcast: AI Glossary Series – CPU, GPU, TPU, and Federated Learning

AI Today Podcast: AI Glossary Series – Tokenization and Vectorization

Homily for the Fourth Sunday of Easter - Voice Recognition not Speech Recognition

344 - The power of clinical speech recognition. Dr Pieter Nel & Dr Andrew Brier - Mackay Hospital & Health Service (HHS)

AI-powered speech recognition for proactive compliance with Nigel Cannings, CTO of Intelligent Voice (UK)

Let's Hear Dr. Michael Canfarotta - Speech Recognition Outcome Vs. Cut Off Frequency

Speech Recognition Technology and UX: How Consistent Content Powers Both at Riot Games, with Cheryl Platz

AI Today Podcast: AI Glossary Series: Natural Language Processing (NLP), NLU, NLG, Speech-to-Text, TTS, Speech Recognition

HPR3731: Speech recognition in Kdenlive

From science fiction to reality: Speech recognition evolution

OpenAI Whisper: General-Purpose Speech Recognition | SDS 620