scikit learn podcasts

Emergent Intelligence: Sanjay Krishnan On AI's Rapid Evolution

Play Episode Listen Later Aug 4, 2025 42:08

What's driving the rapid evolution of AI? Where do emergent behaviors come from? In today's episode, Sanjay Krishnan joins us to unpack how modern machine learning models are developing specialized skill sets, solving increasingly complex problems, and adapting in surprising ways to real-world data. Sanjay is an Assistant Professor of Computer Science at the University of Chicago, where his research group focuses on making data systems more reliable – even when the information they handle is messy or incomplete. In the real world, data is often missing, corrupted, or uncertain, and Sanjay's work aims to help databases and information systems still function accurately under those conditions. Currently, Sanjay and his team are exploring several key challenges. One area of focus is building systems that can still provide accurate answers even when the underlying data is only partially complete. They're also developing ways to measure the trustworthiness of a result when the data has been corrupted. Another project involves automatically detecting when sensitive data may have leaked, helping improve both privacy and security in data systems. This conversation outlines: The natural progression of AI over the past 15 years. Three factors that make AI powerful. The role that human-provided data plays in machine learning. What AI poisoning is, and how it can be identified. You can learn more about Sanjay and his research by clicking here!

university chicago ai evolution silicon valley shop intelligence assistant professor rapid computer science sanjay emergent krishnan keras scikit learn

#314.src - Au delà de la prédiction: Pousser les Limites de l'IA avec Vincent Maladiere

IFTTD - If This Then Dev

Play Episode Listen Later Mar 19, 2025 58:05

"C'est le travail sur ces sujets qui a permis toute ces avancées des LLMs"Le D.E.V. de la semaine est Vincent Maladiere, co-founder @ Probabl. Vincent et Bruno explorent le rôle des modèles prédictifs dans l'analyse des données et leur aptitude à dévoiler la causalité, avec un focus sur l'intelligence artificielle. Ils discutent également des avancées dans Scikit-Learn et de la nécessité de repérer les liens causaux dépassant une simple corrélation. Certains défis évoqués sont le biais de sélection, l'analyse de survie et les complications liées aux données manquantes. Le duo conclut en réitérant que, malgré les progrès des modèles de langage, ils devraient compléter, plutôt que remplacer, les méthodes traditionnelles. Vincent termine en soulignant l'importance de l'apprentissage continu.Chapitrages00:00:53 : Introduction à la Prédiction et Causalité00:01:34 : Présentation de Vincent00:03:16 : Concepts Mathématiques et Philosophiques00:03:27 : Découverte de Scikit-Learn00:06:38 : Évolution de Scikit-Learn vers l'Industrie00:07:45 : Signaux, Causalité et Modèles00:11:30 : Prédiction vs Causalité00:15:48 : Exemples de Causalité dans Divers Domaines00:20:11 : Évaluation de Tests A/B et Causalité00:28:52 : Automatisation des Décisions Basées sur des Modèles00:30:53 : Projet ROAS et Prédiction00:33:11 : Importance des Cohortes dans l'Analyse00:35:03 : La Notion de Survie en Statistiques00:43:12 : Feature Engineering et Causalité00:47:36 : L'Impact des LLM sur la Prédiction00:52:53 : Mélange de Techniques Traditionnelles et Modernes00:55:44 : Recommandations de Contenus00:56:29 : Conclusion et Remerciements Liens évoqués pendant l'émissionSilicon Valley**Recrutez les meilleurs développeurs grâce à Indeed !** "Trouver des développeurs compétents et passionnés, comme les auditeurs d'If This Then Dev, peut être un vrai défi. Avec Indeed, connectez-vous rapidement avec des candidats qualifiés qui sauront s'épanouir dans votre entreprise. Profitez dès maintenant d'un crédit de 100 euros pour sponsoriser votre offre d'emploi : go.indeed.com/IFTTD."🎙️ Soutenez le podcast If This Then Dev ! 🎙️ Chaque contribution aide à maintenir et améliorer nos épisodes. Cliquez ici pour nous soutenir sur Tipeee 🙏Archives | Site | Boutique | TikTok | Discord | Twitter | LinkedIn | Instagram | Youtube | Twitch | Job Board |Distribué par Audiomeans. Visitez audiomeans.fr/politique-de-confidentialite pour plus d'informations.

scikit-learn & data science you own

Practical AI

Play Episode Listen Later Nov 19, 2024 52:04

We are at GenAI saturation, so let's talk about scikit-learn, a long time favorite for data scientists building classifiers, time series analyzers, dimensionality reducers, and more! Scikit-learn is deployed across industry and driving a significant portion of the "AI" that is actually in production. :probabl is a new kind of company that is stewarding this project along with a variety of other open source projects. Yann Lechelle and Guillaume Lemaitre share some of the vision behind the company and talk about the future of scikit-learn!

ai data science genai scikit learn chris benson scikit daniel whitenack

scikit-learn & data science you own (Practical AI #296)

Changelog Master Feed

Play Episode Listen Later Nov 19, 2024 52:04

We are at GenAI saturation, so let's talk about scikit-learn, a long time favorite for data scientists building classifiers, time series analyzers, dimensionality reducers, and more! Scikit-learn is deployed across industry and driving a significant portion of the "AI" that is actually in production. :probabl is a new kind of company that is stewarding this project along with a variety of other open source projects. Yann Lechelle and Guillaume Lemaitre share some of the vision behind the company and talk about the future of scikit-learn!

ai data science genai practical ai scikit learn chris benson scikit daniel whitenack

#16 - Innovation, open source et impact social : le parcours inspirant de Gaël Varoquaux

IA pas que la Data

Play Episode Listen Later Nov 13, 2024 54:47

Plongez dans l'univers de l'IA avec Gaël Varoquaux, chercheur à l'Inria, Co-fondateur de scikit-learn et ProbablComment un chercheur transforme la science des données et révolutionne le machine learning ? Dans cet épisode passionnant, Gaël Varoquaux, co-fondateur de Scikit-learn et pionnier de l'intelligence artificielle, nous raconte son parcours, de ses débuts en open-source à ses initiatives pour un futur de l'IA plus éthique et durable. Entre anecdotes inspirantes et perspectives éclairantes, découvrez les défis et les ambitions derrière la création de Probabl et le rôle croissant de l'IA dans les sciences sociales.(00:00) - Introduction à Scikit-Learn et à Gaël Varoquaux (02:51) - Parcours de Gaël Varoquaux et ses contributions (06:02) - Recherche fondamentale vs appliquée (08:57) - L'impact de Scikit-Learn sur la communauté (12:06) - Différences entre machine learning et intelligence artificielle (15:04) - Transition vers l'entrepreneuriat avec :Probabl (17:52) - Soda : IA au service des sciences sociales (20:58) - Perspectives sur l'avenir de l'IA et du machine learning (24:07) - Conclusion et réflexions finales

What it is like to maintain the scikit-learn docs

Sample Space

Play Episode Listen Later Oct 31, 2024 55:01

Scikit-learn's documentation pages are celebrated. But not everyone is aware that the project actually has somebody on payroll to take care of it. In this episode we talk to Arturo about stories from the scikit-learn documentation. In particular, the docs have a recommender that few folks are aware of. People just assume that it is manually curated, but there are a few base scikit-learn tools under the hood there. Link to the official scikit-learn MOOC: https://inria.github.io/scikit-learn-mooc/ We have a Discord these days, feel free to discuss the podcast with us there! https://discord.probabl.ai You can follow the podcast on most podcast players including apple podcasts, spotify and rss.com. - https://podcasts.apple.com/us/podcast/sample-space/id1739598572 - https://open.spotify.com/show/0BnwEHuyOlHgeZfselpn1n - https://rss.com/podcasts/sample-space/ This podcast is part of the open efforts over at probabl. To learn more you can check out website or reach out to us on social media. Website: https://probabl.ai/ Bluesky: https://bsky.app/profile/probabl.bsky.social LinkedIn: https://www.linkedin.com/company/probabl Twitter: https://x.com/probabl_ai

discord maintain blue sky docs mooc scikit learn scikit

Pausing to think about scikit-learn & OpenAI o1

Practical AI

Play Episode Listen Later Sep 17, 2024 50:12

Recently the company stewarding the open source library scikit-learn announced their seed funding. Also, OpenAI released "o1" with new behavior in which it pauses to "think" about complex tasks. Chris and Daniel take some time to do their own thinking about o1 and the contrast to the scikit-learn ecosystem, which has the goal to promote "data science that you own."

openai pausing scikit learn chris benson daniel whitenack

Pausing to think about scikit-learn & OpenAI o1 (Practical AI #287)

Changelog Master Feed

Play Episode Listen Later Sep 17, 2024 50:12

Recently the company stewarding the open source library scikit-learn announced their seed funding. Also, OpenAI released "o1" with new behavior in which it pauses to "think" about complex tasks. Chris and Daniel take some time to do their own thinking about o1 and the contrast to the scikit-learn ecosystem, which has the goal to promote "data science that you own."

openai pausing practical ai scikit learn chris benson daniel whitenack

Working as a Core Developer in the Scikit-Learn Universe - Guillaume Lemaître

DataTalks.Club

Play Episode Listen Later Jul 26, 2024 52:30

In this podcast episode, we talked with Guillaume Lemaître about navigating scikit-learn and imbalanced-learn.

google universe analytics developers guillaume llm lema stock markets scikit learn

Model safety, that's a pickle! with Adrin Jalali - scikit-learn maintainer

Sample Space

Play Episode Listen Later Jun 27, 2024 61:47

Historically it's always been the case that you would use a pickle file to store a trained scikit-learn model on disk for deployment. Pickles make sense because these are so flexible, but they do carry a security concern. Adrin has been working on a remedy called skops, which is the main topic of this podcast. To learn more about skops, make sure to check the documentation: https://skops.readthedocs.io/en/stable/

safety model historically pickle pickles maintainer scikit learn adrin

A career in data science: what is it, who is it for and how do you get into it? - with Nicholas Cifuentes of WorldQuant University

Change Work Life

Play Episode Listen Later May 7, 2024 48:17 Transcription Available

#179: Nicholas Ciufentes-Goodbody transitioned from a career in teaching to becoming the Chief Data Scientist at WorldQuant University. He explains what data science involves, what a career in data science looks like, and why it's such a popular field to work in. What you'll learn[1:45] What WorldQuant University is and the types of programs they run. [3:42] How a free university is possible in America.[4:13] The motivation for a hedge fund to run a free university. [4:50] What data science is and how it's used. [6:03] The biggest employers of data scientists. [8:21] What a typical day as a data scientist is like. [10:12] The different specialities within data science. [12:08] What it means to be an AI engineer. [12:40] What qualifications you need to become a data scientist. [14:43] The level of education you need to become a data scientist. [16:27] Character traits that successful data scientists share. [18:10] The amount of nerds working in data science. [20:08] Why musicians become data scientists and doctors. [21:15] How to transition your career to become a data scientist. [27:37] What you need on your resume when applying for a data science role. [29:07] The best ways to learn the skills you need to become a data scientist. [30:29] How to edit your CV when applying for a data science job. [31:55] How to find a good data science boot camp. [33:51] The base knowledge you need prior to joining a data science boot camp. [36:30] The income potential of a data scientist. [38:17] The career path of a data scientist. [39:28] The impact AI will have on data scientists. [42:12] The ever-changing nature of data science. Resources mentioned in this episodePlease note that some of these are affiliate links and we may get a commission in the event that you make a purchase. This helps us to cover our expenses and is at no additional cost to you.DataCampKaggleopenAFRICACourseraUdemyAn Introduction to Statistical Learning, Gareth JamesHands-on machine learning with Scikit-Learn, Keras, and TensorFlow, Aurelien GeronFluent Python, Luciano RamalhoMathematics for Human Flourishing, Francis SuFor the show notes for this episode, including a full transcript and links to all the resources mentioned, visit:https://changeworklife.com/using-chatgpt-to-supercharge-your-career/Re-assessing your career? Know you need a change but don't really know where to start? Check out these two exercises to start the journey of working out what career is right for you!

america university ai career character cv data science adviser human flourishing chief data scientist tensorflow cifuentes keras scikit learn

Cientista de Dados Sênior em Lisboa, Portugal - Dev Sem Fronteiras #137

Dev Sem Fronteiras

Play Episode Listen Later Apr 24, 2024 42:20

O natalense Bernardo se mudou para Brasília aos 8 anos de idade e, após se formar em Engenharia Elétrica com Especialização em Telecomunicações, passou a trabalhar na área, no negócio da família. Depois de constituir a própria família, Bernardo passou a pensar em morar no exterior, em busca de uma maior sensação de segurança. Como já possuía raízes e conhecidos em Portugal, mudou-se para lá onde, com seu background de engenharia (e aconselhado pelo hoje co-apresentador do podcast Let's Data, Leon Sólon), passou a estudar Ciência de Dados. Hoje, além de manter o podcast e a iniciativa Let's Data, Bernardo trabalha como cientista de dados na NielsenIQ e nos conta sobre a necessidade de se ter uma educação formal para buscar trabalho, além das particularidades – boas e talvez não tão boas – de se morar na terra do pastel de Belém nata. Fabrício Carraro, o seu viajante poliglota Bernardo Lago, Cientista de Dados Sênior em Lisboa, Portugal Links: Let's Data Podcast Let's Data Instagram do Bernardo Conheça a Escola de Data Science da Alura, descubra as diferentes possibilidades de análise de dados, do Excel ao Python, e mergulhe em frameworks e bibliotecas, como Pandas, Scikit-Learn e Seaborn. TechGuide.sh, um mapeamento das principais tecnologias demandadas pelo mercado para diferentes carreiras, com nossas sugestões e opiniões. #7DaysOfCode: Coloque em prática os seus conhecimentos de programação em desafios diários e gratuitos. Acesse https://7daysofcode.io/ Ouvintes do podcast Dev Sem Fronteiras têm 10% de desconto em todos os planos da Alura Língua. Basta ir a https://www.aluralingua.com.br/promocao/devsemfronteiras/e começar a aprender inglês e espanhol hoje mesmo! Produção e conteúdo: Alura Língua Cursos online de Idiomas – https://www.aluralingua.com.br/ Alura Cursos online de Tecnologia – https://www.alura.com.br/ Edição e sonorização: Rede Gigahertz de Podcasts

[EN] scikit-learn: Software is People

Code for Thought

Play Episode Listen Later Apr 1, 2024 43:56

scikit-learn is a highly successful and popular Python library for data science and machine learning. It is open source and has a large contributor base. I had the pleasure to meet with some of the scikit-learn team to talk about how they got involved and how it is possible to run an Open Source project of this size and scale. If you like to get involved, here are a few links to their home page and GitHub repository.https://scikit-learn.org/stable/index.html scikit-learn homepagehttps://github.com/scikit-learn/scikit-learn GitHub repohttps://blog.scikit-learn.org Blog postGo to the community page of scikit-learn to get links to LinkedIn, Twitter and othersSupport the Show.Thank you for listening and your ongoing support. It means the world to us! Support the show on Patreon https://www.patreon.com/codeforthought Get in touch: Email mailto:code4thought@proton.me UK RSE Slack (ukrse.slack.com): @code4thought or @piddie US RSE Slack (usrse.slack.com): @Peter Schmidt Mastadon: https://fosstodon.org/@code4thought or @code4thought@fosstodon.org LinkedIn: https://www.linkedin.com/in/pweschmidt/ (personal Profile)LinkedIn: https://www.linkedin.com/company/codeforthought/ (Code for Thought Profile) This podcast is licensed under the Creative Commons Licence: https://creativecommons.org/licenses/by-sa/4.0/

code software open source python github scikit learn

Probabl is a new AI company built around popular library scikit-learn

TechCrunch Startups – Spoken Edition

Play Episode Listen Later Feb 3, 2024 2:59

Probabl isn't your average AI startup as this new French company is an Inria spin-off company that revolves around an open-source data science library called scikit-learn Learn more about your ad choices. Visit megaphone.fm/adchoices

ai french built popular library new ai inria scikit learn

737: scikit-learn's Past, Present and Future, with scikit-learn co-founder Dr. Gaël Varoquaux

SuperDataScience

Play Episode Listen Later Dec 5, 2023 90:04

scikit-learn co-founder Gaël Varoquaux and Jon Krohn are live at the historic Sorbonne in Paris, where they discuss the evolution of scikit-learn. From its origins as a memory-efficient Python implementation of support vector machines to its present-day status as a pivotal resource in machine learning, Gaël paints a vivid picture of its remarkable growth. Join us for a glimpse into scikit-learn's evolution, the realm of open-source collaboration, and the transformative power of data-driven insights in today's dynamic data landscape. This episode is brought to you by Gurobi (gurobi.com/sds), the Decision Intelligence Leader, by Data Universe (https://datauniverse2024.com), the out-of-this-world data conference, and by CloudWolf (www.cloudwolf.com/sds), the Cloud Skills platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • The early beginnings and growth of scikit-learn [05:34] • Development principles of scikit-learn [18:05] • How to apply scikit-learn to your ML problem [21:16] • Resource-efficiency and scikit-learn development [25:32] • How to contribute to an open-source project like scikit-learn yourself [38:21] • The future of scikit-learn [51:13] • Gaël on the social-impact data projects in his Soda lab [1:02:33] • Why domain expertise and statistical rigor are more important than ever [1:11:24] Additional materials: www.superdatascience.com/737

co founders development ga resource past present python soda ml sorbonne co founder dr scikit learn jon krohn

Machine Learning in Elixir vs. Python, SQL, and Matlab with Katelynn Burns & Alexis Carpenter

Smart Software with SmartLogic

Play Episode Listen Later Nov 23, 2023 31:19

In this episode of Elixir Wizards, Katelynn Burns, software engineer at LaunchScout, and Alexis Carpenter, senior data scientist at cars.com, join Host Dan Ivovich to discuss machine learning with Elixir, Python, SQL, and MATLAB. They compare notes on available tools, preprocessing, working with pre-trained models, and training models for specific jobs. The discussion inspires collaboration and learning across communities while revealing the foundational aspects of ML, such as understanding data and asking the right questions to solve problems effectively. Topics discussed: Using pre-trained models in Bumblebee for Elixir projects Training models using Python and SQL The importance of data preprocessing before building models Popular tools used for machine learning in different languages Getting started with ML by picking a personal project topic of interest Resources for ML aspirants, such as online courses, tutorials, and books The potential for Elixir to train more customized models in the future Similarities between ML approaches in different languages Collaboration opportunities across programming communities Choosing the right ML approach for the problem you're trying to solve Productionalizing models like fine-tuned LLM's The need for hands-on practice for learning ML skills Continued maturation of tools like Bumblebee in Elixir Katelynn's upcoming CodeBeam talk on advanced motion tracking Links mentioned in this episode https://launchscout.com/ https://www.cars.com/ Genetic Algorithms in Elixir (https://pragprog.com/titles/smgaelixir/genetic-algorithms-in-elixir/) by Sean Moriarity Machine Learning in Elixir (https://pragprog.com/titles/smelixir/machine-learning-in-elixir/) by Sean Moriarity https://github.com/elixir-nx/bumblebee https://github.com/huggingface https://www.docker.com/products/docker-hub/ Programming with MATLAB (https://www.mathworks.com/products/matlab/programming-with-matlab.html) https://elixirforum.com/ https://pypi.org/project/pyspark/ Machine Learning Course (https://online.stanford.edu/courses/cs229-machine-learning) from Stanford School of Engineering Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow (https://www.oreilly.com/library/view/hands-on-machine-learning/9781492032632/) by Aurélien Géron Data Science for Business (https://data-science-for-biz.com/) by Foster Provost & Tom Fawcett https://medium.com/@carscomtech https://github.com/k-burns Code Beam America (https://codebeamamerica.com/) March, 2024 Special Guests: Alexis Carpenter and Katelynn Burns.

LO SPORCO SEGRETO di scikit-learn che probabilmente ha fatto perdere MILIONI a svariate compagnie

La Miniera

Play Episode Listen Later Nov 16, 2023 58:45

⬇⬇⬇APRIMI⬇⬇ Abbonati qui: https://www.youtube.com/economiaitalia/join https://www.patreon.com/join/EconomiaItalia? LO SPORCO SEGRETO di scikit-learn che probabilmente ha fatto perdere MILIONI a svariate compagnie In un mondo di modelli predittivi, la calibrazione delle probabilità è essenziale. Immagina di vendere due tazze diverse e dover prevedere quale un cliente comprerebbe. La "regressione isotonica" può migliorare la precisione, allineando le probabilità stimate con quelle reali. Una calibrazione accurata è cruciale per decisioni commerciali sagge. Qui per segnalare temi: https://tellonym.me/dr.elegantia Podcast (su tutte le piattaforme): https://www.spreaker.com/show/dr-elegantia-podcast COME SOSTENERCI: Il nostro nuovo libro sull'economia: Guida Terrestre per Autoeconomisti https://www.poliniani.com/product-page/guida-terrestre link acquisto Amazon: https://amzn.to/36XTXs8 Acquistando le nostre T-shirt dedicate ai dati stampate in Serigrafia Artigianale con passione e orgoglio dai detenuti del Carcere Lorusso e Cutugno di Torino https://bit.ly/3zNsdkd e HTTPS://urly.it/3nga1 Guida al VOTO 2022: https://amzn.to/3KflXHd DonazionI Paypal: https://paypal.me/appuntiUAB Vuoi sostenermi ma non sborsare nemmeno un euro? Usa questo link per per il tuo prossimo acquisto su Amazon: https://amzn.to/2JGRyGT Qui trovi i libri che consiglio per iniziare a capirne di più sull'economia: https://www.youtube.com/watch?v=uEaIk8wQ3z8 Dove ci trovi: https://www.umbertobertonelli.it/info/ https://linktr.ee/economiaitalia La mia postazione: Logitech streamcam https://amzn.to/3HR6xq0 Luci https://amzn.to/3n6qtgP Shure MV7https://amzn.to/3HRh7k1 Asta https://amzn.to/3HSRvzY #economiaitalia #drelegantia #economia

united states amazon torino voto logitech guida compagnie fatto perdere milioni probabilmente segreto immagina scikit learn sporco

LO SPORCO SEGRETO di scikit-learn che probabilmente ha fatto perdere MILIONI a svariate compagnie

Dr.Elegantia podcast

Play Episode Listen Later Nov 16, 2023 58:45

⬇⬇⬇APRIMI⬇⬇ Abbonati qui: https://www.youtube.com/economiaitalia/join https://www.patreon.com/join/EconomiaItalia? LO SPORCO SEGRETO di scikit-learn che probabilmente ha fatto perdere MILIONI a svariate compagnie In un mondo di modelli predittivi, la calibrazione delle probabilità è essenziale. Immagina di vendere due tazze diverse e dover prevedere quale un cliente comprerebbe. La "regressione isotonica" può migliorare la precisione, allineando le probabilità stimate con quelle reali. Una calibrazione accurata è cruciale per decisioni commerciali sagge. Qui per segnalare temi: https://tellonym.me/dr.elegantia Podcast (su tutte le piattaforme): https://www.spreaker.com/show/dr-elegantia-podcast COME SOSTENERCI: Il nostro nuovo libro sull'economia: Guida Terrestre per Autoeconomisti https://www.poliniani.com/product-page/guida-terrestre link acquisto Amazon: https://amzn.to/36XTXs8 Acquistando le nostre T-shirt dedicate ai dati stampate in Serigrafia Artigianale con passione e orgoglio dai detenuti del Carcere Lorusso e Cutugno di Torino https://bit.ly/3zNsdkd e HTTPS://urly.it/3nga1 Guida al VOTO 2022: https://amzn.to/3KflXHd DonazionI Paypal: https://paypal.me/appuntiUAB Vuoi sostenermi ma non sborsare nemmeno un euro? Usa questo link per per il tuo prossimo acquisto su Amazon: https://amzn.to/2JGRyGT Qui trovi i libri che consiglio per iniziare a capirne di più sull'economia: https://www.youtube.com/watch?v=uEaIk8wQ3z8 Dove ci trovi: https://www.umbertobertonelli.it/info/ https://linktr.ee/economiaitalia La mia postazione: Logitech streamcam https://amzn.to/3HR6xq0 Luci https://amzn.to/3n6qtgP Shure MV7https://amzn.to/3HRh7k1 Asta https://amzn.to/3HSRvzY #economiaitalia #drelegantia #economiaDiventa un supporter di questo podcast: https://www.spreaker.com/podcast/dr-elegantia-podcast--5692498/support.

united states amazon torino voto logitech guida compagnie fatto perdere milioni probabilmente segreto immagina scikit learn sporco

AI Today Podcast: AI Glossary Series – Machine Learning Tools: Keras, PyTorch, Scikit Learn, TensorFlow, Apache Spark, Kaggle

AI Today Podcast: Artificial Intelligence Insights, Experts, and Opinion

Play Episode Listen Later Jul 14, 2023 17:24

In this episode of the AI Today podcast hosts Kathleen Walch and Ron Schmelzer define the terms Machine Learning Tools: Keras, PyTorch, Scikit Learn, TensorFlow, Apache Spark, Kaggle, explain how these terms relate to AI and why it's important to know about them. Show Notes: FREE Intro to CPMAI mini course CPMAI Training and Certification AI Glossary Glossary Series: (Artificial) Neural Networks, Node (Neuron), Layer Glossary Series: Bias, Weight, Activation Function, Convergence, ReLU Glossary Series: Perceptron Glossary Series: Hidden Layer, Deep Learning Glossary Series: Loss Function, Cost Function & Gradient Descent Glossary Series: Backpropagation, Learning Rate, Optimizer Glossary Series: Feed-Forward Neural Network AI Glossary Series – Machine Learning, Algorithm, Model Continue reading AI Today Podcast: AI Glossary Series – Machine Learning Tools: Keras, PyTorch, Scikit Learn, TensorFlow, Apache Spark, Kaggle at Cognilytica.

#71: Alex O'Connor — Transformers, Generative AI, and the Deep Learning Revolution

Getting Simple

Play Episode Listen Later Apr 26, 2023 105:32

Alex O'Connor—researcher and ML manager—on the latest trends of generative AI. Language and image models, prompt engineering, the latent space, fine-tuning, tokenization, textual inversion, adversarial attacks, and more. Alex O'Connor got his PhD in Computer Science from Trinity College, Dublin. He was a postdoctoral researcher and funded investigator for the ADAPT Centre for digital content, at both TCD and later DCU. In 2017, he joined Pivotus, a Fintech startup, as Director of Research. Alex has been Sr Manager for Data Science & Machine Learning at Autodesk for the past few years, leading a team that delivers machine learning for e-commerce, including personalization and natural language processing. Favorite quotes “None of these models can read.” “Art in the future may not be good, but it will be prompt.” Mastodon Books Machine Learning Systems Design by Chip Huyen Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow by Aurélien Géron Papers The Illustrated Transformer by Jay Alammar Attention Is All You Need by Google Brain Transformers: a Primer by Justin Seonyong Lee Links Alex in Mastodon ★ Training Dream Booth Multimodal Art on HuggingFace by @akhaliq NeurIPS arxiv.org: Where most papers get published Nono's Discord Suggestive Drawing: Nono's master's thesis Crungus is a fictional character from Stable Diffusion's latent space Machine learning models Stable Diffusion Arcane Style Stable Diffusion fine-tuned model ★ Imagen DALL-E CLIP GPT and ChatGPT BERT, ALBERT & RoBERTa Bloom word2vec Mupert.ai and Google's MusicLM t-SNE and UMAP: Dimensionality reduction techniques char-rnn Sites TensorFlow Hub HuggingFace Spaces ★ DreamBooth Jasper AI Midjourney Distill.pub ★ Concepts High-performance computing (HPC) Transformers and Attention Sequence transformers Quadratic growth Super resolution Recurrent neural networks (RNNs) Long short-term memory networks (LSTMs) Gated recurrent units (GRUs) Bayesian classifiers Machine translation Encoder-decoder Gradio Tokenization ★ Embeddings ★ Latent space The distributional hypothesis Textual inversion ★ Pretrained models Zero-shot learning Mercator projection People mentioned Ted Underwood UIUC Chip Huyen Aurélien Géron Chapters 00:00 · Introduction 00:40 · Machine learning 02:36 · Spam and scams 15:57 · Adversarial attacks 20:50 · Deep learning revolution 23:06 · Transformers 31:23 · Language models 37:09 · Zero-shot learning 42:16 · Prompt engineering 43:45 · Training costs and hardware 47:56 · Open contributions 51:26 · BERT and Stable Diffusion 54:42 · Tokenization 59:36 · Latent space 01:05:33 · Ethics 01:10:39 · Fine-tuning and pretrained models 01:18:43 · Textual inversion 01:22:46 · Dimensionality reduction 01:25:21 · Mission 01:27:34 · Advice for beginners 01:30:15 · Books and papers 01:34:17 · The lab notebook 01:44:57 · Thanks I'd love to hear from you. Submit a question about this or any previous episodes. Join the Discord community. Meet other curious minds. If you enjoy the show, would you please consider leaving a short review on Apple Podcasts/iTunes? It takes less than 60 seconds and really helps. Show notes, transcripts, and past episodes at gettingsimple.com/podcast. Thanks to Andrea Villalón Paredes for editing this interview. Sleep and A Loop to Kill For songs by Steve Combs under CC BY 4.0. Follow Nono Twitter.com/nonoesp Instagram.com/nonoesp Facebook.com/nonomartinezalonso YouTube.com/nonomartinezalonso

PNE 004: PyCon US - Carla Florida

Pizza de Dados

Play Episode Listen Later Apr 18, 2023 9:50

A entrevistada, Carla Florida, desse episódio trás experiências como pesquisadora de inteligência artifical na ICI UPB (universidade) da Bolívia. Vem assistir e dizer pra gente o que tu achou. Apoie o pizza: Se você ou sua empresa deseja apadrinhar episódios do Pizza e contribuir com a disseminação do conteúdo de ciência de dados em português, manda um e-mail pra gente. -------- Participantes: Carla Florida Instagram Ana Cecília Vieira Twitter Escute agora: Recomendação de leitura: Mãos à Obra: Aprendizado de Máquina com Scikit-Learn e TensorFlow (Hands–On Machine Learning with Scikit–Learn and TensorFlow) -------- Créditos: Produção Esse episódio foi produzido com a colaboração de Ana Cecília Vieira. -------- Escute:

pizza vem bol vieira escute recomenda pycon scikit learn ana cec

Sebastian Raschka: Learning ML, Responsible AI, AGI | Learning from Machine Learning #4

Learning from Machine Learning

Play Episode Play 35 sec Highlight Listen Later Mar 26, 2023 67:58

This episode we welcome Sebastian Raschka, Lead AI Educator at Lightning and author of Machine Learning with Pytorch and Scikit-Learn to discuss the best ways to learn machine learning, his open source work, how to use chatGPT, AGI, responsible AI and so much more. Sebastian is a fountain of knowledge and it was a pleasure to get his insights on this fast moving industry. Learning from Machine Learning, a podcast that explores more than just algorithms and data: Life lessons from the experts. Resources to learn more about Sebastian Raschka and his work:https://sebastianraschka.com/https://lightning.ai/Machine Learning with Pytorch and Scikit-LearnMachine Learning Q and AIResources to learn more about Learning from Machine Learning and the host: https://www.linkedin.com/company/learning-from-machine-learninghttps://www.linkedin.com/in/sethplevine/https://medium.com/@levine.seth.ptwitterReferences from Episodehttps://scikit-learn.org/stable/http://rasbt.github.io/mlxtend/https://github.com/BioPandas/biopandasUnderstanding and Coding the Self-Attention Mechanism of Large Language Models From ScratchAndrew Ng - https://www.andrewng.org/Andrej Karpathy - https://karpathy.ai/Paige Bailey - https://github.com/dynamicwebpaigeContents01:15 - Career Background05:18 - Industry vs. Academia08:18 - First Project in ML15:04 - Open Source Projects Involvement20:00 - Machine Learning: Q&AI24:18 - ChatGPT as Brainstorm Assistant25:38 - Hype vs. Reality27:55 - AGI31:00 - Use Cases for Generative Models34:01 - Should the goal to be to replicate human intelligence?39:18 - Delegating Tasks using LLM42:26 - ML Models are overconfident on Out of Distribution44:54 - Responsible AI and ML45:59 - Complexity of ML Systems47:26 - Trend for ML Practitioners to move to AI Ethics49:27 - What advice would you give to someone just starting out?52:20 - Advice that you've received that has helped you54:08 - Andrew Ng Advice55:20 - Exercise of Implementing Algorithms from Scratch59:00 - Who else has influenced you?01:01:18 - Production and Real-World Applications - Don't reinvent the wheel01:03:00 - What has a career in ML taught you about life?

learning ai advice exercise chatgpt production hype trend lightning machine learning complexity coding ml agi use cases responsible ai pytorch andrej karpathy scikit learn paige bailey

#419: Free AI Lab (ft Dr Mike Pound of Computerphile fame)

David Bombal

Play Episode Listen Later Mar 3, 2023 28:33

Train your own AI using this free Lab created by Dr Mike Pound. Big thanks to Brilliant for sponsoring this video! Get started with a free 30 day trial and 20% discount: https://brilliant.org/DavidBombal How do you capitalize on this trend and learn AI? Dr Mike Pound of Computerphile fame shows us practically how to train your own AI. And the great news is that he has shared his Google colab lab with us to you can start learning for free! If you are into cybersecurity or any other tech field, you probably want to learn about AI and ML. They can really help your resume and help you increase the $$$ you earn. Machine Learning / Artificial Intelligence is a fantastic opportunity for you to get a better job. Start learning this amazing technology today and start learning with one of the best! // LAB // Go here to access the lab: https://colab.research.google.com/dri... // Previous Videos // Roadmap to ChatGPT and AI mastery: • Roadmap to ChatGP... I challenged ChatGPT to code and hack: • I challenged Chat... The truth about AI and why you should learn it - Computerphile explains: • The truth about A... // Dr Mike's recommend AI Book // Deep learning by Ian Goodfellow, Yoshua Bengio and Aaron Courville: https://amzn.to/3vmu4LP // Dawid's recommend Books // 1. Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow: https://amzn.to/3IrGCHi 2. Pattern Recognition and Machine Learning: https://amzn.to/3IWVm2v 3. Machine Learning: A Probabilistic Perspective: https://amzn.to/3xYFM05 4. Python Machine Learning: https://amzn.to/3y0r08Q 5. Deep Learning: https://amzn.to/3kxSbVu 6. The Elements of Statistical Learning: https://amzn.to/3Iwuuox 7. Linear Algebra and Its Applications: https://amzn.to/3EGwMAs 8. Probability Theory: https://amzn.to/3IrGeZm 9. Calculus: Early Transcendentals: https://amzn.to/3Z3Eugh 10. Discrete Mathematics with Applications: https://amzn.to/3Zpzpyt 11. Mathematics for Machine Learning: https://amzn.to/3m8jp5N 12. A Hands-On Introduction to Data Science: https://amzn.to/3Szob8c 13. Introduction to Algorithms: https://amzn.to/3xXo50K 14. Artificial Intelligence: https://amzn.to/3Z2fqGv // Courses and tutorials // AI For Everyone by Andrew Ng: https://www.coursera.org/learn/ai-for... PyTorch Tutorial From Research to Production: https://www.infoq.com/presentations/p... Scikit-learn Machine Learning in Python: https://scikit-learn.org/stable/ // PyTorch // Github: https://github.com/pytorch Website: https://pytorch.org/ Documentation: https://ai.facebook.com/tools/pytorch/ // Mike SOCIAL // Twitter: https://twitter.com/_mikepound YouTube: / computerphile Website: https://www.nottingham.ac.uk/research... // David SOCIAL // Discord: https://discord.com/invite/usKSyzb Twitter: https://www.twitter.com/davidbombal Instagram: https://www.instagram.com/davidbombal LinkedIn: https://www.linkedin.com/in/davidbombal Facebook: https://www.facebook.com/davidbombal.co TikTok: http://tiktok.com/@davidbombal YouTube: / davidbombal // MY STUFF // https://www.amazon.com/shop/davidbombal // SPONSORS // Interested in sponsoring my videos? Reach out to my team here: sponsors@davidbombal.com Please note that links listed may be affiliate links and provide me with a small percentage/kickback should you use them to purchase any of the items listed or recommended. Thank you for supporting me and this channel! #chatgpt #computerphile #ai

Making Machine Learning more accessible | Sebastian Raschka

Machine Learning Podcast - Jay Shah

Play Episode Listen Later Dec 29, 2022 82:39

Sebastian Raschka is the lead AI educator at GridAI. He is the author of the book "Machine Learning with PyTorch and Scikit Learn" and also a few other books that cover the fundamentals of #machinelearning and #deeplearning techniques and implementing them with Python. He is also an Assistant Professor of Statistics at the University of Wisconsin-Madison and has been actively involved in making ML more accessible to beginners through his blogs, video tutorials, tweets and of course his books. He also holds a doctorate in Computational and Quantitative Biology from Michigan State University.Time Stamps of the Podcast00:00:00 Introductions00:02:40 Entry point in AI/ML that made you interested in it00:05:30 How did you go about learning the basics and implementation of various methods?00:11:45 What makes Python ideal for learning Machine Learning recently?00:21:54 What is your book about and who is this for?00:33:55 What goes into writing a good technical book?00:40:50 Applying ML to toy datasets vs real-world research problems00:47:40 Choosing b/w machine learning methods & deep learning methods00:56:22 Large models vs architecture efficient models 01:01:25 Interpretability & Explainability in AI01:08:45 Insights for people interested in machine learning research, academia or PhD01:14:17 Keeping up with research in deep learningSebastian's homepage: https://sebastianraschka.com/Twitter: https://mobile.twitter.com/rasbtLinkedIn: https://www.linkedin.com/in/sebastianraschka/His book: https://www.amazon.com/Machine-Learning-PyTorch-Scikit-Learn-scikit-learn-ebook-dp-B09NW48MR1/dp/B09NW48MR1/Video Tutorials: @SebastianRaschka About the Host:Jay is a Ph.D. student at Arizona State University.Linkedin: https://www.linkedin.com/in/shahjay22/Twitter: https://twitter.com/jaygshah22Reach out to https://www.public.asu.edu/~jgshah1/ for any queries.Stay tuned for upcoming webinars!***Disclaimer: The information contained in this video represents the views and opinions of the speaker and does not necessarily represent the views or opinions of any institution. It does not constitute an endorsement by any Institution or its affiliates of such video content.***Checkout these Podcasts on YouTube: https://www.youtube.com/c/JayShahmlAbout the author: https://www.public.asu.edu/~jgshah1/

Emmanuel Turlay, Founder and CEO of Sematic and machine learning pioneer, discusses what's required to turn every software engineer into an ML engineer

AI and the Future of Work

Play Episode Listen Later Dec 4, 2022 45:10

Emmanuel Turlay spent more than a decade in engineering roles at tech-first companies like Instacart and Cruise before realizing machine learning engineers need a better solution. Emmanuel started Sematic earlier this year and was part of the YC summer 2022 batch. He recently raised a $3M seed round from investors including Race Capital and Soma Capital. Thanks to friend of the podcast and former guest Hina Dixit from Samsung NEXT for the intro to Emmanuel.I've been involved with the AutoML space for five years and, for full disclosure, I'm on the board of Auger which is in a related space. I've seen the space evolve and know how much room there is for innovation. This one's a great education about what's broken and what's ahead from a true machine learning pioneer.Listen and learn...How to turn every software engineer into a machine learning engineerHow AutoML platforms are automating tasks performed in traditional ML toolsHow Emmanuel translated learning from Cruise, the self-driving car company, into an open source platform available to all data engineering teamsHow to move from building an ML model locally to deploying it to the cloud and creating a data pipeline... in hoursWhat you should know about self-driving cars... from one of the experts who developed the brains that power themWhy 80% of AI and ML projects failReferences in this episode:Unscrupulous users manipulate LLMs to spew hateHina Dixit from Samsung NEXT on AI and the Future of WorkApache BeamEliot Shmukler, Anomalo CEO, on AI and the Future of Work

Marko Plahuta o analizah slovenskega koščka Twitterja in Mastodonu

Odbita do bita

Play Episode Listen Later Nov 24, 2022 33:15

Twitter zaradi prevzema Elona Muska izgublja uporabnike in prešteva kritike strokovnjakov, zaposlenih in tviterašev. Kakšno je razpoloženje na slovenskem delu Twitterja, koliko uporabnikov še tvita v slovenščini, o čem govorijo in kako natančno lahko na podlagi Twitterja napovemo volilne rezultate? Omrežju Mastodon, ki velja za alternativo Twitterju, se je do sredine novembra pridružilo milijon uporabnikov, število narašča, povprečje dnevnih uporabnikov se povečuje. Marko Plahuta je programer, ki se ukvarja s strojnim učenjem na področju obdelave jezika. Z raziskovanjem in vizualizacijo se ukvarja v prostem času. Zapiski: About the author - Virostatiq CENTER ZA JEZIKOVNE VIRE IN TEHNOLOGIJE Filmski pojmovnik – Slovenska kinoteka Kviz! Kaj Marko uporablja: Elastic Search za shranjevanje, iskanje in preproste agregacije Twitterjev API za zajemanje podatkov s Twitterja Naučene jezikovne modele, dostopne na HuggingFace, kot osnovo za klasifikatorje in generativne modele To zgoraj skupaj s knjižnicami TensorFlow/Keras in PyTorch spaCy, ki je nedavno izšel za slovenščino Classla, ki je podoben spaCyju, a temelji na Stanfordovi tehnologiji Starejše jezikovne tehnologije, zbrane v knjižnicah Gensim in Scikit-Learn UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction — umap 0.5 documentation GitHub - facebookresearch/faiss: A library for efficient similarity search and clustering of dense vectors. The hdbscan Clustering Library — hdbscan 0.8.1 documentation GitHub - eliorc/node2vec: Implementation of the node2vec algorithm. Zanimivosti iz tehnološkega sveta pošiljava tudi v elektronske nabiralnike. Naročilnica na Odbito pismo je tukaj. Razpravi o odbitih temah se lahko pridružite na Twitterju. Dosegljiva sva tudi na naslovu: odbita@rtvslo.si. Podkast Odbita do bita je brezplačno na voljo v vseh aplikacijah za podkaste. Naročite se in podkast ocenite.

Machine Learning for Autonomous Vehicles • Oscar Beijbom & Prayson Daniel

GOTO - Today, Tomorrow and the Future

Play Episode Listen Later Nov 11, 2022 42:52 Transcription Available

This interview was recorded for GOTO Unscripted.gotopia.techRead the full transcription of this interview hereOscar Beijbom - Co-Founder at NyckelPrayson Daniel - Principal Data Scientist at NTT DATADESCRIPTIONSelf-driving vehicles have been a hot topic for a while now and everyone is waiting for the next breakthrough. Prayson Daniel, principal data scientist at NTT DATA, and Oscar Beijbom, co-founder at Nyckel, stuck their heads together to review what type of machine learning data is needed to run an autonomous vehicle. Furthermore, they talked about topics such as security, language choices and when the time of deploying a model has come.RECOMMENDED BOOKSPhil Winder • Reinforcement LearningKelleher & Tierney • Data Science (The MIT Press Essential Knowledge series)Lakshmanan, Robinson & Munn • Machine Learning Design PatternsLakshmanan, Görner & Gillard • Practical Machine Learning for Computer VisionAurélien Géron • Hands-On Machine Learning with Scikit-Learn, Keras & TensorFlowTwitterLinkedInFacebookLooking for a unique learning experience?Attend the next GOTO conference near you! Get your ticket: gotopia.techSUBSCRIBE TO OUR YOUTUBE CHANNEL - new videos posted almost daily

ai artificial intelligence ethics robinson programming machine learning av data science ml self driving cars autonomous vehicles autonomous cars ai models keras ntt data nyckel scikit learn

Sebastian Raschka: AI Education and Research

The Gradient Podcast

Play Episode Listen Later Jul 29, 2022 63:32

In episode 36 of The Gradient Podcast, Daniel Bashir speaks to Sebastian Raschka.Sebastian is an Assistant Professor of Statistics at the University of Wisconsin-Madison and Lead AI Educator at Lightning AI. He has written two bestselling books: Python Machine Learning and Machine Learning with PyTorch and Scikit-Learn.Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on TwitterSections:(00:00) Intro(01:10) Sebastian's intro to AI(05:15) Sebastian's process for learning new things(12:15) Learning style varies with purpose(16:10) Ordinal Regression(31:00) Solving rank inconsistency with conditional probability(35:00) Semi-Adversarial Networks(44:15) Why Sebastian got into education(52:45) Lightning AI(1:00:00) Sebastian's advice for educators(1:03:30) Be cool like Sebastian and follow the Gradient(1:03:40) OutroEpisode Links:Sebastian's HomepageSebastian's TwitterSebastian's Books Get full access to The Gradient at thegradientpub.substack.com/subscribe

university learning ai education research assistant professor statistics machine learning wisconsin madison gradient pytorch ai education scikit learn books get

What's the Life Cycle of a Data Scientist? • Ekaterina Sirazitdinova, Prayson Daniel & Nicholai Stålung

GOTO - Today, Tomorrow and the Future

Play Episode Listen Later Jun 17, 2022 28:38 Transcription Available

This interview was recorded at GOTO Copenhagen 2021 for GOTO Unscripted.gotopia.techRead the full transcription of this interview hereEkaterina Sirazitdinova - Data Scientist for Computer Vision, Video Analytics & Deep Learning at NVIDIAPrayson Daniel - Principal Data Scientist at NTT DATANicholai Stålung - Lead Data Scientist at TriforkDESCRIPTIONData science is so much more than collecting, sorting and analyzing data. What does it take to be a data scientist and how does a day in the life of a data scientist look like? Ekaterina Sirazitdinova, Prayson Daniel and Nicholai Stålung will give you an insight into this and more. RECOMMENDED BOOKSPhil Winder • Reinforcement LearningKelleher & Tierney • Data Science (The MIT Press Essential Knowledge series)Lakshmanan, Robinson & Munn • Machine Learning Design PatternsLakshmanan, Görner & Gillard • Practical Machine Learning for Computer VisionAurélien Géron • Hands-On Machine Learning with Scikit-Learn, Keras & TensorFlowTwitterLinkedInFacebookLooking for a unique learning experience?Attend the next GOTO conference near you! Get your ticket at gotopia.techSUBSCRIBE TO OUR YOUTUBE CHANNEL - new videos posted almost daily.Discovery MattersA collection of stories and insights on matters of discovery that advance life...Listen on: Apple Podcasts Spotify

ai artificial intelligence robinson programming machine learning nvidia data science ml lung lifecycle data scientists deep learning computer vision keras ntt data lead data scientist scikit learn nicholai discovery mattersa

Streaming data with ease. With Chip Kent from Deephaven Data Labs (Ep. 198)

Data Science at Home

Play Episode Listen Later May 27, 2022 23:48

In this episode, I am with Chip Kent, chief data scientist at Deephaven Data Labs. We speak about streaming data, real-time, and other powerful tools part of the Deephaven platform. Links Deephaven - https://deephaven.io Deephaven Community Core Documentation - https://deephaven.io/core/docs/ Deephaven Community Slack - https://join.slack.com/t/deephavencommunity/shared_invite/zt-11x3hiufp-DmOMWDAvXv_pNDUlVkagLQ GitHub: Deephaven Community Core - https://github.com/deephaven/deephaven-core Barrage - https://github.com/deephaven/barrage Deephaven web components - https://github.com/deephaven/web-client-ui YouTube Channel - https://www.youtube.com/channel/UCoaYOlkX555PSTTJz8ZaI_w Blog posts Real-time classification with Deephaven and SciKit-Learn - https://deephaven.io/blog/2022/02/02/learn-scikit/ Display a quadrillion rows of data in the browser - https://deephaven.io/blog/2022/01/24/displaying-a-quadrillion-rows/ A performance comparison between Materialize and Deephaven - https://deephaven.io/blog/2022/03/05/deephaven-materialize-study/ Careers https://deephaven.io/company/careers/ Community Slack http://deephaven.io/slack.

data streaming chip careers kent ease labs display materialize scikit learn

Kubeflow for Machine Learning • Holden Karau & Adi Polak

GOTO - Today, Tomorrow and the Future

Play Episode Listen Later May 20, 2022 49:33 Transcription Available

This interview was recorded for the GOTO Book Club.gotopia.tech/bookclubRead the full transcription of the interview hereHolden Karau - Co-Author of "Kubeflow for Machine Learning" & Open Source Engineer at NetflixAdi Polak - VP of Developer Experience at Treeverse & Contributing to lakeFS OSSDESCRIPTIONMachine Learning has been declared dead several times but that's far from true. Join Adi Polak, vice president of developer experience at Treeverse, and Holden Karau, open source engineer at Netflix, in their conversation about Kubeflow and how it provides better tooling in the ML space. The discussion touches on Holden's book “Kubeflow for Machine Learning” and expands to cover the worlds of Ray and Dask.RECOMMENDED BOOKSHolden Karau, Trevor Grant, Boris Lublinsky, Richard Liu & Ilan Filonenko • Kubeflow for Machine LearningHolden Karau • Distributed Computing 4 KidsHolden Karau • Scaling Python with DaskHolden Karau & Boris Lublinsky • Scaling Python with RayHolden Karau & Rachel Warren • High Performance SparkHolden Karau, Konwinski, Wendell & Zaharia • Learning SparkHolden Karau & Krishna Sankar • Fast Data Processing with Spark 2nd EditionHolden Karau • Fast Data Processing with Spark 1st EditionAdi Polak • Machine Learning with Apache SparkPhil Winder • Reinforcement LearningAurélien Géron • Hands-On Machine Learning with Scikit-Learn, Keras & TensorFlowTwitterLinkedInFacebookLooking for a unique learning experience?Attend the next GOTO conference near you! Get your ticket at gotopia.techSUBSCRIBE TO OUR YOUTUBE CHANNEL - new videos posted almost daily.Discovery MattersA collection of stories and insights on matters of discovery that advance life...Listen on: Apple Podcasts Spotify Health, Wellness & Performance Catalyst w/ Dr. Brad CooperLooking for a catalyst to optimize your health, wellness & performance? You've found it!!Listen on: Apple Podcasts Spotify

netflix ai artificial intelligence cloud spark machine learning ml contributing wendell kubernetes polak tf das k cloud native tensorflow keras developer experience k8s apache spark scikit learn scikit richard liu trevor grant holden karau discovery mattersa

Practical Machine Learning and Deep learning

The Data Exchange with Ben Lorica

Play Episode Listen Later May 12, 2022 48:27

Sebastian Raschka is lead author of a new book from Packt entitled “Machine Learning with PyTorch and Scikit-Learn”. He is also an Assistant Professor of Statistics at the University of Wisconsin (Madison), and serves as the Lead AI Educator at Grid.ai. Download a FREE copy of our recent NLP Industry Survey Results: https://gradientflow.com/2021nlpsurvey/Subscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS.Detailed show notes can be found on The Data Exchange web site.

university practical assistant professor statistics machine learning grid detailed wisconsin madison deep learning packt pytorch scikit learn

GRID.ai's Lead AI Educator Sebastian Raschka

How AI Happens

Play Episode Listen Later Apr 21, 2022 31:08

Joining us today on How AI Happens is Sebastian Raschka, Lead AI educator at GRID.ai and Assistant Professor of Statistics at the University of Wisconsin-Madison. Sebastian fills us in on the coursework he's creating in his role at GRID.ai, and we find out what can be attributed to the crossover of machine learning in academia and the private sector. We speculate on the pros and cons of the commodification of deep learning models and which machine learning framework is better: PyTorch or TensorFlow. Key Points From This Episode:Sebastian Raschka's journey from the computation of biology to AI and machine learning.The focus of his current role as Lead AI educator at GRID.ai.The ideal applications and outcomes of the coursework Sebastian is developing.The crossover of machine learning in academia and the private sector; the theory versus the application.Deep learning versus machine learning and what constitutes a deep learning problem.The importance of sufficient data for deep learning to be effective.The applications of the BERT text model.The pros and cons of developing more accessible models.Why Sebastian set out to write Machine Learning with PyTorch and Scikit-Learn.The structure of the book, including theory and application.Why Sebastian prefers PyTorch over TensorFlow.What he finds most exciting in the current deep learning space.The emerging opportunities to use deep learning!Tweetables:“In academia, the focus is more on understanding how deep learning works… On the other hand, in the industry, there are [many] use cases of machine learning.” — @rasbt [0:10:10]“Often it is hard to formulate answers as a human to complex questions.” — @rasbt [0:12:53]“In my experience, deep learning can be very powerful but you need a lot of data to make it work well.” — @rasbt [0:14:06]“In [Machine Learning with PyTorch and Scikit-Learn], I tried to provide a resource that is a hybrid between more theoretical books and more applied books.” — @rasbt [0:23:21]“Why I like PyTorch is that it gives me the readability [and] flexibility to customize things.” — @rasbt [0:25:55]Links Mentioned in Today's Episode:Sebastian RaschkaSebastian Raschka on TwitterGRID.aiMachine Learning with PyTorch and Scikit-Learn

university ai deep educators assistant professor statistics machine learning grid wisconsin madison tensorflow pytorch scikit learn

Advancing Hands-On Machine Learning Education with Sebastian Raschka - #565

This Week in Machine Learning & Artificial Intelligence (AI) Podcast

Play Episode Listen Later Mar 28, 2022 39:56

Today we're joined by Sebastian Raschka, an assistant professor at the University of Wisconsin-Madison and lead AI educator at Grid.ai. In our conversation with Sebastian, we explore his work around AI education, including the “hands-on” philosophy that he takes when building these courses, his recent book Machine Learning with PyTorch and Scikit-Learn, his advise to beginners in the field when they're trying to choose tools and frameworks, and more. We also discuss his work on Pytorch Lightning, a platform that allows users to organize their code and integrate it into other technologies, before switching gears and discuss his recent research efforts around ordinal regression, including a ton of great references that we'll link on the show notes page below! The complete show notes for this episode can be found at twimlai.com/go/565

university ai hands machine learning advancing grid wisconsin madison pytorch scikit learn learning education

Episode 114 - Puget Systems Podcast w/ Special Guest: Harrison "sentdex" Kinsley - Scientific Computing & Python Programming Expert

Puget Systems Podcast

Play Episode Listen Later Mar 18, 2022 71:33

This week we have scientific computing educator extraordinaire, Harrison Kinsley on the show! Harrison is the founder of multiple businesses, all of which leverage the Python programming language. From using Flask web development on all of his business sites to Scikit Learn and TensorFlow for machine learning and data analysis with Ensmo.com to the Natural Language Toolkit for natural language processing with Sentdex.com to teaching a massive variety of Python programming topics on PythonProgramming.net -- Python and programming is a major part of Harrison's life and work. Harrison likes to learn and build with technology which he teaches on his YouTube channel with over a million subscribers. Harrison believes programming is a superpower, and the social impact of making programming education easily accessible to anyone is one of the most important things he can do with his life. Harrison's Python Website: www.pythonprogramming.net Harrison's Python YouTube: www.youtube.com/c/sentdex Harrison's Twitter: www.twitter.com/sentdex Harrison's Instagram: www.instagram.com/sentdex Who is Puget Systems? Puget Systems is based in the Seattle suburb of Auburn, WA, and specializes in high-performance, custom-built computers. We believe that computers should be a pleasure to purchase and own. They should get your work done, and not be a hindrance. Our goal is to provide each client with the best possible computer for their needs and budget. Learn more about Puget Systems: www.pugetsystems.com Learn more about our Scientific Computing Solutions: www.pugetsystems.com/solutions/scientific/index.php --- Send in a voice message: https://anchor.fm/puget-systems/message

seattle wa programming auburn python tensorflow flask scikit learn scientific computing puget systems

#356: Tips for ML / AI startups

Talk Python To Me - Python conversations for passionate developers

Play Episode Listen Later Mar 14, 2022 66:27

Have you been considering launching a product or even a business based on Python's AI / ML stack? We have a great guest on the episode this week, Dylan Fox, who is the cofounder of AssemblyAI and has been building his startup successfully over the past few years. He has interesting stories of 100s of GPUs in the cloud, evolving ML models, and much more that I know you'll enjoy hearing. Links from the show Dylan Twitter: @YouveGotFox AssemblyAI: assemblyai.com TensorFlow: tensorflow.org PyTorch: pytorch.org hugging face: huggingface.co SciKit-Learn: scikit-learn.org GeForce Card: nvidia.com pLS: twitter.com This journalist's Otter.ai scare is a reminder that cloud transcription isn't completely private: theverge.com Programming language trends: insights.stackoverflow.com Can My Water Cooled Raspberry Pi Cluster Beat My MacBook?: the-diy-life.com PyTorch vs TensorFlow in 2022: assemblyai.com/blog/pytorch-vs-tensorflow Watch this episode on YouTube: youtube.com Episode transcripts: talkpython.fm --- Stay in touch with us --- Subscribe on YouTube: youtube.com Follow Talk Python on Twitter: @talkpython Follow Michael on Twitter: @mkennedy Sponsors Stack Overflow Sentry Error Monitoring, Code TALKPYTHON Talk Python Training

online training startups web software developers programming python data science ml online courses otter cloud computing ide gpus ai ml pls software developers web development stack overflow mongodb tensorflow nosql pytorch ml ai scikit learn pycharm dylan fox assemblyai talk python training python3 python2

Dok Talks #111 - Scheduled Scaling with Dask and Argo Workflows

Data on Kubernetes Community

Play Episode Listen Later Jan 19, 2022 65:30

https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Complex computational workloads in Python are a common sight these days, especially in the context of processing large and complex datasets. Battle-hardened modules such as Numpy, Pandas, and Scikit-Learn can perform low-level tasks, while tools like Dask makes it easy to parallelize these workloads across distributed computational environments. Meanwhile, Argo Workflows offers a Kubernetes-native solution to provisioning cloud resources in Kubernetes and triggering workflows on a regular schedule. Being Kubernetes-native, Argo Workflows also meshes nicely with other Kubernetes tools. This talk discusses the combination of these two worlds by showcasing a set-up for Argo-managed workflows which schedule and automatically scale-out Dask-powered data pipelines in Python. BIO Former academic in the field of renewable energy simulation and energy systems analysis. Currently responsible for architecting and maintaining the cloud- and data strategy at ACCURE Battery Intelligence KEY TAKE-AWAYS FROM THE TALK Argo Workflows + Dask is a nice combination for data-processing pipelines. There are a a few "gotchyas" to be on the look-out for, but in nevertheless this is still a generally-applicable and powerful combination. https://github.com/sevberg

battle scaling python scheduled pandas argo workflows kubernetes das k numpy scikit learn

#034 ML Валерий Бабушкин. О карьере, ML в бизнесе и гранд-мастерстве Kaggle

Machine Learning Podcast

Play Episode Listen Later Nov 29, 2021 48:38

Думаю, для тех, кто следит за индустрией искусственного интеллекта, Валерий Бабушкин в особом представлении не нуждается. Специалист высочайшего класса, за которым охотятся самые крутые технологические компании! И, вот, сегодня Валерий в гостях Machine Learning Podcast. Поговорили о карьерном пути Валерия, о том, как бизнесу внедрять машинное обучение в свои процессы, какие книги читать, чтобы держать себя в тонусе, зачем фейсбук борется с альтернативными ватсап-клиентами, зачем нужен Kaggle в карьере дата-сайентиста и еще много о чем! Ссылки выпуска: Реферальная программа Intel, по которой можно получить 70000 рублей за каждого трудоустроенного, по вашей рекомендации, сотрудника в компанию (http://career.intel.com/tp/rj6_qk7Te_e_K) Вакансии компании Intel (http://career.intel.com/tp/rj6_itirV.e_K) Kaggle (https://www.kaggle.com/) Книга "Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow" (https://www.oreilly.com/library/view/hands-on-machine-learning/9781492032632/) Книга "The Minimum Description Length Principle" (https://mitpress.mit.edu/books/minimum-description-length-principle) Книга "Нетрадиционные методы многомерного статистического анализа" (https://www.biznesbooks.com/books/ekonomika/efron-b-netradicionnye-metody-mnogomernogo-statisticheskogo-analiza) Буду благодарен за обратную связь! Поддерживайте подкаст на Patreon (https://www.patreon.com/machinelearningpodcast) Оставляйте ваши комментарии там, где можно. Например, в Apple Podcasts. Они помогут сделать подкаст лучше! Напишите что вам было понятно, что не очень, какие темы раскрыть, каких гостей пригласить, ну, и вообще в какую сторону катить этот подкаст :) Подписывайтесь на телеграм-канал "Стать специалистом по машинному обучению" (https://t.me/toBeAnMLspecialist) Телеграм автора подкаста (https://t.me/kmsint) Со мной также можно связаться по электронной почте: kms101@yandex.ru Также теперь подкаст можно найти на YouTube (https://www.youtube.com/channel/UCzvfXLNpB2Bbf32dc7a8oDQ?) и Яндекс.Музыке https://music.yandex.ru/album/9781458

intel tensorflow keras kaggle scikit learn

Python 3.10

Python Podcast

Play Episode Listen Later Oct 27, 2021 92:55

Gemischte Dinge. Unter anderem Python 3.10. Jochen und Dominik haben sich mit dieser Episode etwas länger Zeit gelassen. Viel Kram zu tun. Das wird wieder besser. Versprochen. Shownotes Unsere E-Mail für Fragen, Anregungen & Kommentare: hallo@python-podcast.de News aus der Szene - Der Python 3.10 Release Stream - Official Python 3.10 Release - PEP 0617 zum neuen PEG-Parser - Helge Schneider über Werbung - Tribute to Sebastian Ramírez: - FastAPI - SQLModel - Typer - Buch Elixir in Action - Inkrementelles black: darker - Guido zur Zukunft von Python. - Ein Struct in C. - High performance code execution engine: Python-piston. - RustPython - Django 4.0 - JavaScript Fetch in den MDN Web Docs und im Modern JavaScript Tutorial - SciKit Learn 1.0 - Fail2Ban CVE - Azure OMIGOD - Twitch Breach - Jochens Stream zur Implementierung eines Naive Bayes Spamfilters - htmx - high power tools for HTML - The Asset Pipeline in ruby on rails Picks - django-upgrade - textual / rich - humanize

#65 scikit-learn 1.0のリリースノートを見ながら気になったポイントを喋る

Work In Progress

Play Episode Listen Later Oct 3, 2021 30:45

こんにちは！今回は、scikit-learn 1.0がリリースされたので、その内容について話しました！ ======================== ◎関連リンク・scikit-learn 1.0 リリースノート https://scikit-learn.org/stable/auto_examples/release_highlights/plot_release_highlights_1_0_0.html https://scikit-learn.org/dev/whats_new/v1.0.html ・バチェラー始まる https://twitter.com/BachelorJapan/status/1443893459471962134 ・トド英語が待望のリリース https://apps.apple.com/JP/app/id1561705273?mt=8 ======================== ◎質問や感想、扱って欲しいトピックなど、お気軽にコメントお待ちしております. ・Googleフォーム http://urx.space/VNOD ・Twitterのハッシュタグ https://twitter.com/hashtag/wipfm ・Twitterアカウント https://twitter.com/wipfm0509 ======================== ◎"Work In Progress"とは機械学習エンジニアの@takapyとデータアナリストの@yaginuuunが、テクノロジーやキャリア、ビジネスなどの話題についてカジュアルに話すPodcast番組です. 公式サイト: https://bit.ly/2UbGXIX @takapy: https://twitter.com/takapy0210 @yaginuuun: https://twitter.com/yaginuuun See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

google jp work in progress scikit learn

#250 skorch your scikit-learn together with PyTorch

Python Bytes

Play Episode Listen Later Sep 15, 2021 41:52

Watch the live stream: Watch on YouTube About the show Sponsored by us: Check out the courses over at Talk Python And Brian's book too! Special guest: Prayson Daniel Brain #1: Exciting New Ways To Be Told That Your Python Code is Bad Two new pylint errors consider-ternary-expression if condition(): x = 4 else: x = 5 x = 4 if condition() else 5 while-used it unconditionally flags every use of while expressions. generally, while should be avoided. Michael #2: GitHub Readme Stats via Роман Великий Dynamically generated stats for your github readmes This are for your repo or your stats (others too I suppose) posted somewhere outside of github Card for a project: https://github-readme-stats.vercel.app/api/pin/?username=mikeckennedy&repo=python-switch Card for a user: https://github-readme-stats.vercel.app/api?username=mikeckennedy&show_icons=true&theme=radical Card for your languages: https://github-readme-stats.vercel.app/api/top-langs/?username=mikeckennedy&repo=python-switch Prayson #3: Nox Nox appeared as “footnotes” in Episodes 182 and 248 (Hypermodern Python …) It does tox what invoke did (substituting GNU Make) Brian #4: Two tools for dealing with text python-easyfrontmatter - a small package to load and parse files (or just text) with YAML (or JSON, TOML or other) front matter. >>> post = frontmatter.load('tests/yaml/hello-world.txt') >>> print(post['title']) Hello, world! Tried it with a helper script I'm using with Hugo, and it parses Hugo metadata in blog posts like a dream. ftfy - fixes text for you “Take in bad Unicode and output good Unicode” >>> import ftfy >>> ftfy.fix_text('âœ” No problems') '✔ No problems' Michael #5: MPIRE (MultiProcessing Is Really Easy) A Python package for easy multiprocessing, but faster than multiprocessing It combines the convenience of map like functions of multiprocessing.Pool with the benefits of using copy-on-write shared objects of multiprocessing.Process, together with easy-to-use worker state, worker insights, and progress bar functionality. Many features Requisite shoutout to unsync too. Prayson #6: skorch Going deep learning with scikit-learn pipelines (Breaking limits of multi-layer perceptron (MLP)) Using PyTorch, skorch provides an API to extend neural networks models in scikit-learn. Example: Penguins Classification shameless Gist Extras Michael vim + jupyter, via Marco Gorelli PyBay talk Prayson python-decouple Joke: Adoption

Scikit-learn: Machine Learning in Python

Papers Read on AI

Play Episode Listen Later Jul 25, 2021 11:28

Scikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems. This package focuses on bringing machine learning to non-specialists using a general-purpose high-level language. 2011: Fabian Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, Mathieu Blondel, Gilles Louppe, P. Prettenhofer, Ron Weiss, Ron J. Weiss, J. Vanderplas, Alexandre Passos, D. Cournapeau, M. Brucher, M. Perrot, E. Duchesnay Keywords: Machine learning, Documentation, Python, scikit-learn, Unsupervised learning, High-level programming language, High- and low-level, Usability, BSD, Binary file, Hypertext Transfer Protocol, Consistency model, Application programming interface, Algorithm, General-purpose markup language https://arxiv.org/pdf/1201.0490v4.pdf

consistency application algorithms machine learning weiss python documentation binary usability unsupervised bsd perrot grisel scikit learn scikit thirion

Scikit-learn: Machine Learning in Python

Papers Read on AI

Play Episode Listen Later Jul 23, 2021 11:28

Scikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems. This package focuses on bringing machine learning to non-specialists using a general-purpose high-level language. 2011: Fabian Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, Mathieu Blondel, Gilles Louppe, P. Prettenhofer, Ron Weiss, Ron J. Weiss, J. Vanderplas, Alexandre Passos, D. Cournapeau, M. Brucher, M. Perrot, E. Duchesnay Keywords: Machine learning, Documentation, Python, scikit-learn, Unsupervised learning, High-level programming language, High- and low-level, Usability, BSD, Binary file, Hypertext Transfer Protocol, Consistency model, Application programming interface, Algorithm, General-purpose markup language https://arxiv.org/pdf/1201.0490v4.pdf

consistency application algorithms machine learning weiss python documentation binary usability unsupervised bsd perrot grisel scikit learn scikit thirion

112 データ分析コンペで獲得した賞金の税金

regonn&curry.fm

Play Episode Listen Later Feb 1, 2021 36:37

話した内容ScrapBox このポッドキャストでは、Kaggleを中心としたデータサイエンスに関連する情報を配信していきます。今回は、データ分析コンペの賞金の税金、Scikit Learn 0.24、SIGNATE金融コンペ、Notion について話しています。

notion kaggle scikit learn

Chris Sanchez: Data Science-related Master's Degrees vs. Bootcamps | The Data Canteen #02

The Data Canteen

Play Episode Listen Later Jan 21, 2021 83:38

So, you've decided that you're ready to take the plunge and seriously upskill into data science & machine learning…but should you do a master's degree or a boot camp? In this episode, I'm joined by Chris Sanchez, a retired Navy SEAL and fellow member of the Veterans in Data Science & Machine Learning community. The conversation starts with an overview of his journey through the military and into data science. Chris' unique perspective extends beyond just his background with Special Operations Forces (SOF). He's also rather rare in the sense that he attended both a data science graduate program (UC Berkley's Master of Information and Data Science) and a data science boot camp (Galvanize Data Science Immersive Bootcamp). Consequently, Chris and I dive deep into a candid compare & contrast of his experiences with both learning approaches, where each excelled, and where they didn't. Chris starts the comparison by telling us about the best qualities of each training approach. Then, we zoom in to continue the comparison, looking at seven key facets: - Cost - Duration - Financing - Prerequisites - Pedagogy - Networking - Career Support Services The episode wraps-up with Chris sharing some of his favorite data science & ML learning resources: - DATAQUEST: https://www.dataquest.io/ - Learn Python 3 the Hard Way: https://www.amazon.com/Learn-Python-Hard-Way-Introduction-ebook/dp/B07378P8W6 - Andrew Ng's Stanford CS229 – Machine Learning: https://see.stanford.edu/Course/CS229 - Hands-on Machine Learning with Scikit-Learn, Keras & TensorFlow: https://www.amazon.com/Hands-Machine-Learning-Scikit-Learn-TensorFlow/dp/1492032646 - Fast.ai: https://www.fast.ai/ - Practical Deep Learning for Coders (Fast.ai's MOOC): https://course.fast.ai/ Go here to connect with Chris on LinkedIn (be sure to include a short contextual note with your connection request!): https://www.linkedin.com/in/excellenceisahabit/ Go here to see Chris' GitHub portfolio: https://americanthinker.github.io/ There is so much packed into this episode that it's impossible to put it all in the show notes. One recommendation that I do have is that you listen to this podcast, then listen again while taking notes. =================================== Have a question for me? Leave me a voicemail or send me a message at: http://www.vetsindatascience.com/thedatacanteen The Data Canteen's parent organization: http://www.vetsindatascience.com/ Join the Veterans in Data Science & Machine Learning Community on LinkedIn: https://www.linkedin.com/groups/8989903/ SUPPORT THE DATA CANTEEN (LIKE PBS, WE'RE LISTENER SUPPORTED!): Donate: https://vetsindatascience.com/support-join

master veterans machine learning navy seals degrees data science ml github hard way bootcamps mooc canteen uc berkley scikit learn special operations forces sof chris sanchez learn python practical deep learning

Episode 11: ML 011: History of AI in the UK with Laurence Moroney

Adventures in Machine Learning

Play Episode Listen Later Oct 27, 2020 55:48

In this episode of Adventures in Machine Learning, the panelists chat with Laurence Moroney about the history of AI in the UK. We talk about the AI overlords, ethics, and the need for teaching. Industry revolutions and how we can adapt them to improve life. Laurence is working on new synthetic datasets so tune in and check it out! Sponsors Machine Learning for Software Engineers by Educative.io Audible.com CacheFly Panel Charles Max Wood Gant Laborde Jason Mayes Daniel Svoboda Guest Laurence Moroney Picks Jason Mayes: TensorFlow.js Show and Tell on Youtube Gant Laborde: https://www.tensorflowtictactoe.co/ Daniel Svoboda: Youtube: Kaggle Laurence Moroney: Hands-On Machine Learning with Scikit-Learn and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems The Manga Guide to Machine Learning Follow Adventures in Machine Learning on Twitter > @podcast_ml

history ai uk tools adventures panel audible machine learning laurence software engineers cid tensorflow 3e moroney cachefly afid scikit learn adid charles max wood gant laborde

Ep. 18 - Ferramentas para acelerar o desenvolvimento dos modelos - Dia 18 - Desafio Mestres do Scikit-learn

Mario Filho - Data Science/Machine Learning

Play Episode Listen Later Mar 8, 2020 8:25

Neste episódio compartilho o que aprendi lendo a documentação do Scikit-learn sobre pipelines e composições de estimadores. Se você quiser aprender a habilidade mais importante para trabalhar com Data Science, acesse http://CursoDeDataScience.com Juntei todas as minhas dicas de machine learning num e-book. Conheça http://www.ManualDeDataScience.com Me siga no Instagram e receba dicas exclusivas: http://www.instagram.com/mariofilhoml

neste conhe data science desenvolvimento modelos desafio ferramentas mestres acelerar scikit learn scikit

21: Why Scikit-Learn and Keras are Awesome for ML

The Data Life Podcast

Play Episode Listen Later Jan 26, 2020 19:54

In this episode, we talk about why the two libraries Scikit-Learn and Keras are great for machine learning. These two libraries combined with Pandas form the 3 core libraries in Python for a data scientist today. We cover things like: 1) Data Exploration and data cleaning - how Pandas and Jupyter notebooks provide a good way to get started here. 2) Data Transformation - how Scikit-Learn provides many useful functions like train_test_split, Scalers, PCA etc. 3) Data Fitting - how Scikit-Learn provides good shallow models and Keras provides great support to quickly get started with neural networks. We also cover various tidbits on things to take note in building ML pipelines and preparing models to be deployed in production, so tune into the episode to find out! Fantastic Resources: 1) Book by head of Youtube DS team Aurelien Geron: https://www.amazon.com/dp/1492032646/?tag=omnilence-20 This is one of the best book I have read on this topic as it covers practical tips incl. Scikit-Learn API etc. 2) Developing Scikit-Learn estimators: https://scikit-learn.org/stable/developers/develop.html 3) Guide to Keras Sequential API: https://keras.io/getting-started/sequential-model-guide/ 4) Guide to Keras Functional API: https://keras.io/getting-started/functional-api-guide/ 5) My previous episode on Pandas: https://podcasts.apple.com/us/podcast/17-why-pandas-is-the-new-excel/id1453716761?i=1000454831790 Thanks for listening! Please consider supporting this podcast from the link in the end. --- Send in a voice message: https://anchor.fm/the-data-life-podcast/message Support this podcast: https://anchor.fm/the-data-life-podcast/support

guide python ml pandas pca keras jupyter data transformation scikit learn scalers

Finding Community in Data Science with Reshama Shaikh, key scikit-Learn sprint organizer

The Banana Data Podcast

Play Episode Listen Later Jan 3, 2020 24:36

Now that we've covered how open source works, we're looking to pull back the curtain and see who's actually contributing. In part 2/2 of our series on open source, we sat down with Reshama Shaikh, a statistician and key organizer of scikit-Learn sprints, to learn about the ups & downs of open source contributing, as well how a Sprint in Nairobi benefits Fortune 500 companies in the US. Reshama Shaikh is an independent data scientist/statistician and MBA with skills in Python, R and SAS. I worked for over 10 years as a biostatistician in the pharmaceutical industry.Further Reading: Stack Overflow Developer Survey; Open Source Contributors: https://insights.stackoverflow.com/survey/2019#developer-profile-_-contributing-to-open-sourceHow to Organize a Scikit Learn Spring by Reshama Shaikh: https://reshamas.github.io/how-to-organize-a-scikit-learn-sprint/Reshama Shaikh's Website: https://reshamas.github.io/Contributing to scikit-Learn: https://scikit-learn.org/stable/developers/contributing.htmlGitter scikit-Learn: https://gitter.im/scikit-learn/scikit-learnscikitLearn Mailing List: https://mail.python.org/mailman/listinfo/scikit-learnJoin Dataiku's Paris scikit-Learn sprint this January: https://github.com/scikit-learn/scikit-learn/wiki/Paris-scikit-learn-Sprint-of-the-Decade

Why Open Source? feat. Andreas Mueller, a Core Contributor of scikit-Learn

The Banana Data Podcast

Play Episode Listen Later Dec 20, 2019 27:26

Open Source software such as scikit-Learn, Python, and Spark form the backbone of data science. In a two-part series, we're covering the ins and outs of open source - and how this special type of software supports 98% of enterprise-level companies' data science efforts.In part 1, we're chatting with Andreas Mueller, a core contributor of scikit-Learn aboutthe value in open source versus corporate software, and what it looks like to run and govern this type of community-written (and driven) project.Join our Paris scikit-Learn sprint this January: https://github.com/scikit-learn/scikit-learn/wiki/Paris-scikit-learn-Sprint-of-the-DecadeAndreas Mueller is a lecturer at the Data Science Institute at Columbia University and author of the O'Reilly book “Introduction to Machine Learning with Python”, describing a practical approach to machine learning with python and scikit-learn. He is one of the core developers of the scikit-learn machine learning library, and he has been co-maintaining it for several years. He is also a Software Carpentry instructor. In the past, he worked at the NYU Center for Data Science on open source and open science, and as Machine Learning Scientist at Amazon. You can find his full cv here. His mission is to create open tools to lower the barrier of entry for machine learning applications, promote reproducible science and democratize the access to high-quality machine learning algorithms.

Podcasts about scikit learn

Best podcasts about scikit learn

Mario Filho - Data Science/Machine Learning

Talk Python To Me - Python conversations for passionate developers

Embedded

Practical AI

GOTO - Today, Tomorrow and the Future

Changelog Master Feed

Sample Space

Papers Read on AI

The Banana Data Podcast

Latest news about scikit learn

Latest podcast episodes about scikit learn

Emergent Intelligence: Sanjay Krishnan On AI's Rapid Evolution

#314.src - Au delà de la prédiction: Pousser les Limites de l'IA avec Vincent Maladiere

scikit-learn & data science you own

scikit-learn & data science you own (Practical AI #296)

#16 - Innovation, open source et impact social : le parcours inspirant de Gaël Varoquaux

What it is like to maintain the scikit-learn docs

Pausing to think about scikit-learn & OpenAI o1

Pausing to think about scikit-learn & OpenAI o1 (Practical AI #287)

Working as a Core Developer in the Scikit-Learn Universe - Guillaume Lemaître

Model safety, that's a pickle! with Adrin Jalali - scikit-learn maintainer

A career in data science: what is it, who is it for and how do you get into it? - with Nicholas Cifuentes of WorldQuant University

Cientista de Dados Sênior em Lisboa, Portugal - Dev Sem Fronteiras #137

[EN] scikit-learn: Software is People

Probabl is a new AI company built around popular library scikit-learn

737: scikit-learn's Past, Present and Future, with scikit-learn co-founder Dr. Gaël Varoquaux

Machine Learning in Elixir vs. Python, SQL, and Matlab with Katelynn Burns & Alexis Carpenter

LO SPORCO SEGRETO di scikit-learn che probabilmente ha fatto perdere MILIONI a svariate compagnie

LO SPORCO SEGRETO di scikit-learn che probabilmente ha fatto perdere MILIONI a svariate compagnie

AI Today Podcast: AI Glossary Series – Machine Learning Tools: Keras, PyTorch, Scikit Learn, TensorFlow, Apache Spark, Kaggle

#71: Alex O'Connor — Transformers, Generative AI, and the Deep Learning Revolution

PNE 004: PyCon US - Carla Florida

Sebastian Raschka: Learning ML, Responsible AI, AGI | Learning from Machine Learning #4

#419: Free AI Lab (ft Dr Mike Pound of Computerphile fame)

Making Machine Learning more accessible | Sebastian Raschka

Emmanuel Turlay, Founder and CEO of Sematic and machine learning pioneer, discusses what's required to turn every software engineer into an ML engineer

Marko Plahuta o analizah slovenskega koščka Twitterja in Mastodonu

Machine Learning for Autonomous Vehicles • Oscar Beijbom & Prayson Daniel

Sebastian Raschka: AI Education and Research

What's the Life Cycle of a Data Scientist? • Ekaterina Sirazitdinova, Prayson Daniel & Nicholai Stålung

Streaming data with ease. With Chip Kent from Deephaven Data Labs (Ep. 198)

Kubeflow for Machine Learning • Holden Karau & Adi Polak

Practical Machine Learning and Deep learning

GRID.ai's Lead AI Educator Sebastian Raschka

Advancing Hands-On Machine Learning Education with Sebastian Raschka - #565

Episode 114 - Puget Systems Podcast w/ Special Guest: Harrison "sentdex" Kinsley - Scientific Computing & Python Programming Expert

#356: Tips for ML / AI startups

Dok Talks #111 - Scheduled Scaling with Dask and Argo Workflows

#034 ML Валерий Бабушкин. О карьере, ML в бизнесе и гранд-мастерстве Kaggle

Python 3.10

#65 scikit-learn 1.0のリリースノートを見ながら気になったポイントを喋る

#250 skorch your scikit-learn together with PyTorch

Scikit-learn: Machine Learning in Python

Scikit-learn: Machine Learning in Python

112 データ分析コンペで獲得した賞金の税金

Chris Sanchez: Data Science-related Master's Degrees vs. Bootcamps | The Data Canteen #02

Episode 11: ML 011: History of AI in the UK with Laurence Moroney

Ep. 18 - Ferramentas para acelerar o desenvolvimento dos modelos - Dia 18 - Desafio Mestres do Scikit-learn

21: Why Scikit-Learn and Keras are Awesome for ML

Finding Community in Data Science with Reshama Shaikh, key scikit-Learn sprint organizer

Why Open Source? feat. Andreas Mueller, a Core Contributor of scikit-Learn