Machine Learning Guide

Share on

Teaches the high level fundamentals of machine learning and artificial intelligence. I teach basic intuition, algorithms, and math. I discuss languages and frameworks, deep learning, and more. Audio may seem inferior, but it's a great supplement during exercise/commute/chores. Where your other resou…

OCDevel

Feb 26, 2026 LATEST EPISODE
monthly NEW EPISODES
38m AVG DURATION
68 EPISODES
1 SEASONS

4.9 from 734 ratings Listeners of Machine Learning Guide that love the show mention: machine learning, ml, coursera, thank you tyler, tyler's, nlp, repetition, great real, math, complement, engineering, supplement, overview, scientist, programming, high level, also appreciate, concepts, computer, beginners.

Search for episodes from Machine Learning Guide with a specific topic:

Latest episodes from Machine Learning Guide

001 Introduction

Play Episode Listen Later Jan 31, 2017 12:35

learning ai intelligence artificial teaches ml

002 What is AI / ML

Play Episode Listen Later Feb 8, 2017 32:05

What is artificial intelligence and machine learning? What's the difference? How about compared to statistics and data science? AI history. ocdevel.com/mlg/2 for notes and resources

learning ai intelligence artificial ml ai ml

MLG 002 What is AI, ML, DS

Play Episode Listen Later Feb 9, 2017 64:10

Show notes at ocdevel.com/mlg/2 Updated! Skip to [00:29:36] for Data Science (new content) if you've already heard this episode. What is artificial intelligence, machine learning, and data science? What are their differences? AI history. Hierarchical breakdown: DS(AI(ML)). Data science: any profession dealing with data (including AI & ML). Artificial intelligence is simulated intellectual tasks. Machine Learning is algorithms trained on data to learn patterns to make predictions. Artificial Intelligence (AI) - Wikipedia Oxford Languages: the theory and development of computer systems able to perform tasks that normally require human intelligence, such as visual perception, speech recognition, decision-making, and translation between languages. AlphaGo Movie, very good! Sub-disciplines Reasoning, problem solving Knowledge representation Planning Learning Natural language processing Perception Motion and manipulation Social intelligence General intelligence Applications Autonomous vehicles (drones, self-driving cars) Medical diagnosis Creating art (such as poetry) Proving mathematical theorems Playing games (such as Chess or Go) Search engines Online assistants (such as Siri) Image recognition in photographs Spam filtering Prediction of judicial decisions Targeting online advertisements Machine Learning (ML) - Wikipedia Oxford Languages: the use and development of computer systems that are able to learn and adapt without following explicit instructions, by using algorithms and statistical models to analyze and draw inferences from patterns in data. Data Science (DS) - Wikipedia Wikipedia: Data science is an interdisciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from noisy, structured and unstructured data, and apply knowledge and actionable insights from data across a broad range of application domains. Data science is related to data mining, machine learning and big data. History Greek mythology, Golums First attempt: Ramon Lull, 13th century Davinci's walking animals Descartes, Leibniz 1700s-1800s: Statistics & Mathematical decision making Thomas Bayes: reasoning about the probability of events George Boole: logical reasoning / binary algebra Gottlob Frege: Propositional logic 1832: Charles Babbage & Ada Byron / Lovelace: designed Analytical Engine (1832), programmable mechanical calculating machines 1936: Universal Turing Machine Computing Machinery and Intelligence - explored AI! 1946: John von Neumann Universal Computing Machine 1943: Warren McCulloch & Walter Pitts: cogsci rep of neuron; Frank Rosemblatt uses to create Perceptron (-> neural networks by way of MLP) 50s-70s: "AI" coined @Dartmouth workshop 1956 - goal to simulate all aspects of intelligence. John McCarthy, Marvin Minksy, Arthur Samuel, Oliver Selfridge, Ray Solomonoff, Allen Newell, Herbert Simon Newell & Simon: Hueristics -> Logic Theories, General Problem Solver Slefridge: Computer Vision NLP Stanford Research Institute: Shakey Feigenbaum: Expert systems GOFAI / symbolism: operations research / management science; logic-based; knowledge-based / expert systems 70s: Lighthill report (James Lighthill), big promises -> AI Winter 90s: Data, Computation, Practical Application -> AI back (90s) Connectionism optimizations: Geoffrey Hinton: 2006, optimized back propagation Bloomberg, 2015 was whopper for AI in industry AlphaGo & DeepMind

003 Inspiration

Play Episode Listen Later Feb 9, 2017 17:41

Why should you care about AI? Inspirational topics about economic revolution, the singularity, consciousness, and fear. ocdevel.com/mlg/3 for notes and resources

learning ai inspiration intelligence artificial ml

004 Algorithms - Intuition

Play Episode Listen Later Feb 11, 2017 21:54

Overview of machine learning algorithms. Infer/predict, error/loss, train/learn. Supervised, unsupervised, reinforcement learning. ocdevel.com/mlg/4 for notes and resources

learning ai intuition intelligence artificial algorithms ml supervised infer

005 Linear Regression

Play Episode Listen Later Feb 15, 2017 33:40

Introduction to the first machine-learning algorithm, the 'hello world' of supervised learning - Linear Regression ocdevel.com/mlg/5 for notes and resources

learning ai intelligence artificial ml linear regression

006 Certificates & Degrees

Play Episode Listen Later Feb 16, 2017 15:37

Discussion on certificates and degrees from Udacity to a Masters degree. ocdevel.com/mlg/6 for notes and resources

learning ai masters intelligence artificial degrees certificates ml udacity

007 Logistic Regression

Play Episode Listen Later Feb 18, 2017 34:20

Your first classifier: Logistic Regression. That plus Linear Regression, and you're a 101 supervised learner! ocdevel.com/mlg/7 for notes and resources

learning ai intelligence artificial ml linear regression logistic regression

008 Math

Play Episode Listen Later Feb 22, 2017 27:24

Introduction to the branches of mathematics used in machine learning. Linear algebra, statistics, calculus. ocdevel.com/mlg/8 for notes and resources

learning ai math intelligence artificial ml linear

009 Deep Learning

Play Episode Listen Later Mar 3, 2017 51:09

Deep learning and neural networks. How to stack our logisitic regression units into a multi-layer perceptron. ocdevel.com/mlg/9 for notes and resources

learning ai deep intelligence artificial ml deep learning

010 Languages & Frameworks

Play Episode Listen Later Mar 6, 2017 44:17

Languages & frameworks comparison. Languages: Python, R, MATLAB/Octave, Julia, Java/Scala, C/C++. Frameworks: Hadoop/Spark, Deeplearning4J, Theano, Torch, TensorFlow. ocdevel.com/mlg/10 for notes and resources

learning ai intelligence artificial languages cc ml torch frameworks tensorflow theano

011 Checkpoint

Play Episode Listen Later Mar 7, 2017 7:45

Checkpoint - start learning the material offline! ocdevel.com/mlg/11 for notes and resources

learning ai intelligence artificial ml checkpoint

012 Shallow Algos 1

Play Episode Listen Later Mar 19, 2017 53:17

Speed-run of some shallow algorithms: K Nearest Neighbors (KNN); K-means; Apriori; PCA; Decision Trees ocdevel.com/mlg/12 for notes and resources

learning ai speed intelligence artificial ml shallow pca algos decision trees

013 Shallow Algos 2

Play Episode Listen Later Apr 9, 2017 55:13

Speed run of Support Vector Machines (SVMs) and Naive Bayes Classifier. ocdevel.com/mlg/13 for notes and resources

learning ai speed intelligence artificial ml shallow algos

014 Shallow Algos 3

Play Episode Listen Later Apr 23, 2017 48:07

Speed run of Anomaly Detection, Recommenders(Content Filtering vs Collaborative Filtering), and Markov Chain Monte Carlo (MCMC). ocdevel.com/mlg/14 for notes and resources

learning ai speed intelligence artificial ml shallow algos anomaly detection markov chain monte carlo mcmc

015 Performance

Play Episode Listen Later May 7, 2017 41:24

Performance evaluation & improvement. ocdevel.com/mlg/15 for notes and resources

learning ai performance intelligence artificial ml

016 Consciousness

Play Episode Listen Later May 21, 2017 73:45

Can AI be conscious? ocdevel.com/mlg/16 for notes and resources

learning ai intelligence consciousness artificial ml

017 Checkpoint

Play Episode Listen Later Jun 4, 2017 7:00

Checkpoint - learn the material offline! ocdevel.com/mlg/17 for notes and resources

learning ai intelligence artificial ml checkpoint

018 Natural Language Processing 1

Play Episode Listen Later Jun 25, 2017 57:48

Introduction to Natural Language Processing (NLP) topics. ocdevel.com/mlg/18 for notes and resources

learning ai intelligence artificial ml natural language processing natural language processing nlp

019 Natural Language Processing 2

Play Episode Listen Later Jul 10, 2017 65:33

Natural Language Processing classical/shallow algorithms. ocdevel.com/mlg/19 for notes and resources

learning ai intelligence artificial ml natural language processing

020 Natural Language Processing 3

Play Episode Listen Later Jul 23, 2017 40:26

Natural Language Processing classical/shallow algorithms. ocdevel.com/mlg/20 for notes and resources

learning ai intelligence artificial ml natural language processing

021 New Series: Machine Learning Applied

Play Episode Listen Later Jul 27, 2017 1:50

Introducing a new podcast series on Patreon: Machine Learning Applied. ocdevel.com/mlg/21 for notes and resources

learning ai intelligence artificial machine learning applied ml new series

022 Deep NLP 1

Play Episode Listen Later Jul 28, 2017 49:21

Recurrent Neural Networks (RNNs) and Word2Vec. ocdevel.com/mlg/22 for notes and resources

learning ai deep intelligence artificial ml word2vec

023 Deep NLP 2

Play Episode Listen Later Aug 20, 2017 42:45

RNN review, bi-directional RNNs, LSTM & GRU cells. ocdevel.com/mlg/23 for notes and resources

learning ai deep intelligence artificial ml rnn rnns

024 Tech Stack

Play Episode Listen Later Oct 6, 2017 61:17

TensorFlow, Pandas, Numpy, Scikit-Learn, Keras, TensorForce. ocdevel.com/mlg/24 for notes and resources

learning ai intelligence artificial ml pandas tech stack tensorflow keras numpy scikit learn

025 Convolutional Neural Networks

Play Episode Listen Later Oct 30, 2017 44:21

Convnets or CNNs. Filters, feature maps, window/stride/padding, max-pooling. ocdevel.com/mlg/25 for notes and resources

learning ai cnn intelligence artificial ml filters convolutional neural networks

026 Project Bitcoin Trader

Play Episode Listen Later Jan 26, 2018 38:20

Community project & intro to Bitcoin/crypto + trading. ocdevel.com/mlg/26 for notes and resources

community learning ai project bitcoin intelligence artificial ml trader

027 Hyperparameters 1

Play Episode Listen Later Jan 27, 2018 46:09

Hyperparameters part 1: network architecture. ocdevel.com/mlg/27 for notes and resources

learning ai intelligence artificial ml

028 Hyperparameters 2

Play Episode Listen Later Feb 4, 2018 50:10

Hyperparameters part 2: hyper-search, regularization, SGD optimizers, scaling. ocdevel.com/mlg/28 for notes and resources

learning ai intelligence artificial ml sgd

029 Reinforcement Learning Intro

Play Episode Listen Later Feb 5, 2018 42:27

Introduction to reinforcement learning concepts. ocdevel.com/mlg/29 for notes and resources.

learning ai intelligence artificial ml reinforcement learning

MLA 001 Certificates & Degrees

Play Episode Listen Later May 24, 2018 11:20

Reboot on the MLG episode, with more confident recommends.

ai artificial intelligence reboot machine learning degrees certificates data science mlg

MLA 002 Numpy & Pandas

Play Episode Listen Later May 24, 2018 17:26

Some numerical data nitty-gritty in Python.

ai artificial intelligence machine learning python data science pandas numpy

MLA 003 Storage: HDF, Pickle, Postgres

Play Episode Listen Later May 24, 2018 17:02

Comparison of different data storage options when working with your ML models.

ai artificial intelligence comparison machine learning storage data science ml pickle postgres

030 New Series: Machine Learning Applied

Play Episode Listen Later May 24, 2018 5:29

MLG: I'm rebooting this series to fix mistakes & add more shallows (Bayesian methods, Tree methods, etc). I'm adding Patreon rewards, including access to a new podcast series: Machine Learning Applied, discussing applied/practical 10-20m frequent episodes. ocdevel.com/mlg/30 for notes and resources

tree machine learning applied new series bayesian

MLA 004 Study Tips

Play Episode Listen Later May 28, 2018 6:45

Two tips that helped me the most while learning ML.

ai artificial intelligence machine learning data science ml study tips

MLA 005 Shapes & Sizes

Play Episode Listen Later Jun 9, 2018 26:30

Dimensions, size, and shape of Numpy ndarrays / TensorFlow tensors, and methods for transforming those.

ai artificial intelligence machine learning shapes data science dimensions sizes tensorflow numpy

MLA 006 Salary

Play Episode Listen Later Jul 19, 2018 18:52

Salary based on location, gender, age, tech... from O'Reilly.

ai artificial intelligence machine learning salary data science

MLA 007 Jupyter Notebooks

Play Episode Listen Later Oct 16, 2018 16:09

Run your code + visualizations in the browser: iPython / Jupyter Notebooks.

ai artificial intelligence run machine learning data science jupyter notebooks

MLA 008 Exploratory Data Analysis

Play Episode Listen Later Oct 26, 2018 24:23

EDA + charting. DataFrame info/describe, imputing strategies. Useful charts like histograms and correlation matrices.

ai artificial intelligence machine learning data science data analysis exploratory

MLA 009 Charting tools

Play Episode Listen Later Nov 6, 2018 24:00

matplotlib, Seaborn, Bokeh, D3, Tableau, Power BI, QlikView, Excel

ai tools artificial intelligence excel machine learning data science charting d3 tableau power bi bokeh seaborn qlikview

031 The Podcasts Return

Play Episode Listen Later Oct 28, 2020 7:57

The podcasts return with new content, especially about NLP: BERT, transformers, spaCy, Gensim, NLTK. Accompanied by a community project - Gnothi, a journal that uses AI to provide insights and resources. Website https://gnothiai.com, project https://github.com/lefnire/gnothi. Share the website on social media and email me a link/screenshot for free access to Machine Learning Applied for 3 months; contribute to the Github repository for free access for life.

learning ai podcasts intelligence artificial ml github accompanied spacy nltk

MLA 010 NLP packages: transformers, spaCy, Gensim, NLTK

Play Episode Listen Later Oct 28, 2020 25:33

NLTK: swiss army knife. Gensim: LDA topic modeling, n-grams. spaCy: linguistics. transformers: high-level business NLP tasks.

ai artificial intelligence nlp transformers machine learning data science packages spacy nltk

MLA 011 Practical Clustering

Play Episode Listen Later Nov 8, 2020 34:08

Kmeans (sklearn vs FAISS), finding n_clusters via inertia/silhouette, Agglomorative, DBSCAN/HDBSCAN

ai practical artificial intelligence machine learning data science clustering k means

032 Cartesian Similarity Metrics

Play Episode Listen Later Nov 8, 2020 42:28

Social media Gnothi and email me a screenshot/link for 3-month access to Machine Learning Applied; commit code to the Github repository for life-access. Normed distances link A norm is a function that assigns a strictly positive length to each vector in a vector space. link Minkowski is generalized. p_root(sum(xi-yi)^p). "p" = ? (1, 2, ..) for below. L1: Manhattan/city-block/taxicab. abs(x2-x1)+abs(y2-y1). Grid-like distance (triangle legs). Preferred for high-dim space. L2: Euclidean. sqrt((x2-x1)^2+(y2-y1)^2. sqrt(dot-product). Straight-line distance; min distance (Pythagorean triangle edge) Others: Mahalanobis, Chebyshev (p=inf), etc Dot product A type of inner product. Outer-product: lies outside the involved planes. Inner-product: dot product lies inside the planes/axes involved link. Dot product: inner product on a finite dimensional Euclidean space link Cosine (normalized dot)

learning ai social straight intelligence artificial metrics grid ml outer preferred similarity cartesian pythagorean euclidean minkowski cosine

MLA 012 Docker

Play Episode Listen Later Nov 9, 2020 30:58

Use Docker for env setup on localhost & cloud deployment, instead of pyenv / Anaconda. I recommend Windows for your desktop.

ai artificial intelligence windows machine learning data science anaconda docker

MLA 013 Customer Facing Tech Stack

Play Episode Listen Later Jan 3, 2021 46:54

Client, server, database, etc.

ai clients facing artificial intelligence machine learning data science tech stack

MLA 014 Machine Learning Server

Play Episode Listen Later Jan 18, 2021 51:50

Server-side ML. Training & hosting for inference, with a goal towards serverless. AWS SageMaker, Batch, Lambda, EFS, Cortex.dev

ai training artificial intelligence machine learning data science ml server batch lambda cortex efs

MLA 015 SageMaker 1

Play Episode Listen Later Nov 4, 2021 46:46

Show notes Part 1 of deploying your ML models to the cloud with SageMaker (MLOps) MLOps is deploying your ML models to the cloud. See MadeWithML for an overview of tooling (also generally a great ML educational run-down.) SageMaker DataWrangler Feature Store Ground Truth Clarify Studio AutoPilot Debugger Distributed Training And I forgot to mention JumpStart, I'll mention next time.

ai artificial intelligence machine learning data science ml sagemaker

MLA 016 SageMaker 2

Play Episode Listen Later Nov 5, 2021 59:43

Part 2 of deploying your ML models to the cloud with SageMaker (MLOps) MLOps is deploying your ML models to the cloud. See MadeWithML for an overview of tooling (also generally a great ML educational run-down.) SageMaker Jumpstart Deploy Pipelines Monitor Kubernetes Neo

ai artificial intelligence machine learning data science ml sagemaker

MLA 017 AWS Local Development

Play Episode Listen Later Nov 6, 2021 64:04

Show notes: ocdevel.com/mlg/mla-17 Developing on AWS first (SageMaker or other) Consider developing against AWS as your local development environment, rather than only your cloud deployment environment. Solutions: Stick to AWS Cloud IDEs (Lambda, SageMaker Studio, Cloud9 Connect to deployed infrastructure via Client VPN Terraform example YouTube tutorial Creating the keys LocalStack Infrastructure as Code Terraform CDK Serverless

ai development local developing artificial intelligence machine learning aws data science sagemaker

MLA 018 Descript

Play Episode Listen Later Nov 7, 2021 6:22

(Optional episode) just showcasing a cool application using machine learning Dept uses Descript for some of their podcasting. I'm using it like a maniac, I think they're surprised at how into it I am. Check out the transcript & see how it performed. Descript The Ship It Podcast How to ship software, from the front lines. We talk with software developers about their craft, developer tools, developer productivity and what makes software development awesome. Hosted by your friends at Rocket Insights. AKA shipit.io Brandbeats Podcast by BASIC An agency podcast with views on design, technology, art, and culture. Explore the new microsite at www.brandbeats.basicagency.com

ai explore artificial intelligence aka machine learning data science optional descript

MLA 019 DevOps

Play Episode Listen Later Jan 13, 2022 74:38

Chatting with co-workers about the role of DevOps in a machine learning engineer's life Expert coworkers at Dept Matt Merrill - Principal Software Developer Jirawat Uttayaya - DevOps Lead The Ship It Podcast (where Matt features often) Devops tools Terraform Ansible Pictures (funny and serious) Which AWS container service should I use? A visual guide on troubleshooting Kubernetes deployments Public Cloud Services Comparison Killed by Google aCloudGuru AWS curriculum

ai artificial intelligence expert chatting machine learning data science devops kubernetes

MLA 020 Kubeflow

Play Episode Listen Later Jan 29, 2022 67:57

Conversation with Dirk-Jan Kubeflow (vs cloud native solutions like SageMaker) Dirk-Jan Verdoorn - Data Scientist at Dept Agency Kubeflow. (From the website:) The Machine Learning Toolkit for Kubernetes. The Kubeflow project is dedicated to making deployments of machine learning (ML) workflows on Kubernetes simple, portable and scalable. Our goal is not to recreate other services, but to provide a straightforward way to deploy best-of-breed open-source systems for ML to diverse infrastructures. Anywhere you are running Kubernetes, you should be able to run Kubeflow. TensorFlow Extended (TFX). If using TensorFlow with Kubeflow, combine with TFX for maximum power. (From the website:) TensorFlow Extended (TFX) is an end-to-end platform for deploying production ML pipelines. When you're ready to move your models from research to production, use TFX to create and manage a production pipeline. Alternatives: Airflow MLflow

conversations artificial intelligence machine learning data science ml data scientists kubernetes tensorflow tfx

MLA 021 Databricks

Play Episode Listen Later Jun 22, 2022 25:45

Discussing Databricks with Ming Chang from Raybeam (part of DEPT®)

machine learning data science databricks

MLG 033 Transformers

Play Episode Listen Later Feb 9, 2025 42:14

Try a walking desk while studying ML or working on your projects! 3Blue1Brown videos Background & Motivation: RNN Limitations: Sequential processing prevents full parallelization—even with attention tweaks—making them inefficient on modern hardware. Breakthrough: “Attention Is All You Need” replaced recurrence with self-attention, unlocking massive parallelism and scalability. Core Architecture: Layer Stack: Consists of alternating self-attention and feed-forward (MLP) layers, each wrapped in residual connections and layer normalization. Positional Encodings: Since self-attention is permutation invariant, add sinusoidal or learned positional embeddings to inject sequence order. Self-Attention Mechanism: Q, K, V Explained: Query (Q): The representation of the token seeking contextual info. Key (K): The representation of tokens being compared against. Value (V): The information to be aggregated based on the attention scores. Multi-Head Attention: Splits Q, K, V into multiple “heads” to capture diverse relationships and nuances across different subspaces. Dot-Product & Scaling: Computes similarity between Q and K (scaled to avoid large gradients), then applies softmax to weigh V accordingly. Masking: Causal Masking: In autoregressive models, prevents a token from “seeing” future tokens, ensuring proper generation. Padding Masks: Ignore padded (non-informative) parts of sequences to maintain meaningful attention distributions. Feed-Forward Networks (MLPs): Transformation & Storage: Post-attention MLPs apply non-linear transformations; many argue they're where the “facts” or learned knowledge really get stored. Depth & Expressivity: Their layered nature deepens the model's capacity to represent complex patterns. Residual Connections & Normalization: Residual Links: Crucial for gradient flow in deep architectures, preventing vanishing/exploding gradients. Layer Normalization: Stabilizes training by normalizing across features, enhancing convergence. Scalability & Efficiency Considerations: Parallelization Advantage: Entire architecture is designed to exploit modern parallel hardware, a huge win over RNNs. Complexity Trade-offs: Self-attention's quadratic complexity with sequence length remains a challenge; spurred innovations like sparse or linearized attention. Training Paradigms & Emergent Properties: Pretraining & Fine-Tuning: Massive self-supervised pretraining on diverse data, followed by task-specific fine-tuning, is the norm. Emergent Behavior: With scale comes abilities like in-context learning and few-shot adaptation, aspects that are still being unpacked. Interpretability & Knowledge Distribution: Distributed Representation: “Facts” aren't stored in a single layer but are embedded throughout both attention heads and MLP layers. Debate on Attention: While some see attention weights as interpretable, a growing view is that real “knowledge” is diffused across the network's parameters.

ai debate artificial intelligence transformers machine learning data science ml mlp mlps 3blue1brown rnns

MLA 022 Code AI Tools

Play Episode Listen Later Feb 9, 2025 46:35

Try a walking desk while studying ML or working on your projects! https://ocdevel.com/walk Show notes: https://ocdevel.com/mlg/mla-22 Tools discussed: Windsurf: https://codeium.com/windsurf Copilot: https://github.com/features/copilot Cursor: https://www.cursor.com/ Cline: https://github.com/cline/cline Roo Code: https://github.com/RooVetGit/Roo-Code Aider: https://aider.chat/ Other: Leaderboards: https://aider.chat/docs/leaderboards/ Video of speed-demon: https://www.youtube.com/watch?v=QlUt06XLbJE&feature=youtu.be Reddit: https://www.reddit.com/r/chatgptcoding/ Examines the rapidly evolving world of AI coding tools designed to boost programming productivity by acting as a pair programming partner. The discussion groups these tools into three categories: • Hands-Off Tools: These include solutions that work on fixed monthly fees and require minimal user intervention. GitHub Copilot started with simple tab completions and now offers an agent mode similar to Cursor, which stands out for its advanced codebase indexing and intelligent file searching. Windsurf is noted for its simplicity—accepting prompts and performing automated edits—but some users report performance throttling after prolonged use. • Hands-On Tools: Aider is presented as a command-line utility that demands configuration and user involvement. It allows developers to specify files and settings, and it efficiently manages token usage by sending prompts in diff format. Aider also implements an “architect versus edit” approach: a reasoning model (such as DeepSeek R1) first outlines a sequence of changes, then an editor model (like Claude 3.5 Sonnet) produces precise code edits. This dual-model strategy enhances accuracy and reduces token costs, especially for complex tasks. • Intermediate Power Tools: Open-source tools such as Cline and its more advanced fork, RooCode, require users to supply their own API keys and pay per token. These tools offer robust, agentic features, including codebase indexing, file editing, and even browser automation. RooCode stands out with its ability to autonomously expand functionality through integrations (for example, managing cloud resources or querying issue trackers), making it particularly attractive for tinkerers and power users. A decision framework is suggested: for those new to AI coding assistants or with limited budgets, starting with Cursor (or cautiously exploring Copilot's new features) is recommended. For developers who want to customize their workflow and dive deep into the tooling, RooCode or Cline offer greater control—always paired with Aider for precise and token-efficient code edits. Also reviews model performance using a coding benchmark leaderboard that updates frequently. The current top-performing combination uses DeepSeek R1 as the architect and Claude 3.5 Sonnet as the editor, with alternatives such as OpenAI's O1 and O3 Mini available. Tools like Open Router are mentioned as a way to consolidate API key management and reduce token costs.

ai video tools code artificial intelligence reddit programming openai api ml cline copilot aider sonnets cursor github copilot windsurf o1

MLA 023 Code AI Models & Modes

Play Episode Listen Later Apr 13, 2025 37:35

Notes and resources at ocdevel.com/mlg/mla-23 Try a walking desk to stay healthy while you study or work! Model Current Leaders According to the Aider Leaderboard (as of April 12, 2025), leading models include for vibe-coding: Gemini 2.5 Pro Preview 03-25: most accurate and cost-effective option currently. Claude 3.7 Sonnet: Performs well in both architect and code modes with enabled reasoning flags. DeepSeek R1 with Claude 3.5 Sonnet: A popular combination for its balance of cost and performance between reasoning and non-reasoning tasks. Local Models Tools for Local Models: Ollama is the standard tool to manage local models, enabling usage without internet connectivity. Privacy and Security: Utilizing local models enhances data security, suitable for sensitive projects or corporate environments that require data to remain onsite. Performance Trade-offs: Local models, due to distillation and size constraints, often perform slightly worse than cloud-hosted models but offer privacy benefits. Fine-Tuning Models Customization: Developers can fine-tune pre-trained models to specialize them for their specific codebase, enhancing relevance and accuracy. Advanced Usage: Suitable for long-term projects, fine-tuning helps models understand unique aspects of a project, resulting in consistent code quality improvements. Tips and Best Practices Judicious Use of the @ Key: Improves model efficiency by specifying the context of commands, reducing the necessity for AI-initiated searches. Examples include specifying file paths, URLs, or git commits to inform AI actions more precisely. Concurrent Feature Implementation: Leverage tools like Boomerang mode to manage multiple features simultaneously, acting more as a manager overseeing several tasks at once, enhancing productivity. Continued Learning: Staying updated with documentation, particularly Roo Code's, due to its comprehensive feature set and versatility among AI coding tools.

ai tips local code artificial intelligence privacy gemini openai machine learning cline boomerang copilot modes anthropic urls cursor ai models

MLA 024 Code AI MCP Servers, ML Engineering

Play Episode Listen Later Apr 13, 2025 43:38

Tool Use and Model Context Protocol (MCP) Notes and resources at ocdevel.com/mlg/mla-24 Try a walking desk to stay healthy while you study or work! Tool Use in Vibe Coding Agents File Operations: Agents can read, edit, and search files using sophisticated regular expressions. Executable Commands: They can recommend and perform installations like pip or npm installs, with user approval. Browser Integration: Allows agents to perform actions and verify outcomes through browser interactions. Model Context Protocol (MCP) Standardization: MCP was created by Anthropic to standardize how AI tools and agents communicate with each other and with external tools. Implementation: MCP Client: Converts AI agent requests into structured commands. MCP Server: Executes commands and sends structured responses back to the client. Local and Cloud Frameworks: Local (S-T-D-I-O MCP): Examples include utilizing Playwright for local browser automation and connecting to local databases like Postgres. Cloud (SSE MCP): SaaS providers offer cloud-hosted MCPs to enhance external integrations. Expanding AI Capabilities with MCP Servers Directories: Various directories exist listing MCP servers for diverse functions beyond programming. modelcontextprotocol/servers Use Cases: Automation Beyond Coding: Implementing MCPs that extend automation into non-programming tasks like sales, marketing, or personal project management. Creative Solutions: Encourages innovation in automating routine tasks by integrating diverse MCP functionalities. AI Tools in Machine Learning Automating ML Process: Auto ML and Feature Engineering: AI tools assist in transforming raw data, optimizing hyperparameters, and inventing new ML solutions. Pipeline Construction and Deployment: Facilitates the use of infrastructure as code for deploying ML models efficiently. Active Experimentation: Jupyter Integration Challenges: While integrations are possible, they often lag and may not support the latest models. Practical Strategies: Suggests alternating between Jupyter and traditional Python files to maximize tool efficiency. Conclusion Action Plan for ML Engineers: Setup structured folders and documentation to leverage AI tools effectively. Encourage systematic exploration of MCPs to enhance both direct programming tasks and associated workflows.

ai local code engineering encourage machine learning python ml cline playwright anthropic servers mcp cursor postgres jupyter mcps tool use

MLG 034 Large Language Models 1

Play Episode Listen Later May 7, 2025 50:48

Explains language models (LLMs) advancements. Scaling laws - the relationships among model size, data size, and compute - and how emergent abilities such as in-context learning, multi-step reasoning, and instruction following arise once certain scaling thresholds are crossed. The evolution of the transformer architecture with Mixture of Experts (MoE), describes the three-phase training process culminating in Reinforcement Learning from Human Feedback (RLHF) for model alignment, and explores advanced reasoning techniques such as chain-of-thought prompting which significantly improve complex task performance. Links Notes and resources at ocdevel.com/mlg/mlg34 Build the future of multi-agent software with AGNTCY Try a walking desk stay healthy & sharp while you learn & code Transformer Foundations and Scaling Laws Transformers: Introduced by the 2017 "Attention is All You Need" paper, transformers allow for parallel training and inference of sequences using self-attention, in contrast to the sequential nature of RNNs. Scaling Laws: Empirical research revealed that LLM performance improves predictably as model size (parameters), data size (training tokens), and compute are increased together, with diminishing returns if only one variable is scaled disproportionately. The "Chinchilla scaling law" (DeepMind, 2022) established the optimal model/data/compute ratio for efficient model performance: earlier large models like GPT-3 were undertrained relative to their size, whereas right-sized models with more training data (e.g., Chinchilla, LLaMA series) proved more compute and inference efficient. Emergent Abilities in LLMs Emergence: When trained beyond a certain scale, LLMs display abilities not present in smaller models, including: In-Context Learning (ICL): Performing new tasks based solely on prompt examples at inference time. Instruction Following: Executing natural language tasks not seen during training. Multi-Step Reasoning & Chain of Thought (CoT): Solving arithmetic, logic, or symbolic reasoning by generating intermediate reasoning steps. Discontinuity & Debate: These abilities appear abruptly in larger models, though recent research suggests that this could result from non-linearities in evaluation metrics rather than innate model properties. Architectural Evolutions: Mixture of Experts (MoE) MoE Layers: Modern LLMs often replace standard feed-forward layers with MoE structures. Composed of many independent "expert" networks specializing in different subdomains or latent structures. A gating network routes tokens to the most relevant experts per input, activating only a subset of parameters—this is called "sparse activation." Enables much larger overall models without proportional increases in compute per inference, but requires the entire model in memory and introduces new challenges like load balancing and communication overhead. Specialization & Efficiency: Experts learn different data/knowledge types, boosting model specialization and throughput, though care is needed to avoid overfitting and underutilization of specialists. The Three-Phase Training Process 1. Unsupervised Pre-Training: Next-token prediction on massive datasets—builds a foundation model capturing general language patterns. 2. Supervised Fine Tuning (SFT): Training on labeled prompt-response pairs to teach the model how to perform specific tasks (e.g., question answering, summarization, code generation). Overfitting and "catastrophic forgetting" are risks if not carefully managed. 3. Reinforcement Learning from Human Feedback (RLHF): Collects human preference data by generating multiple responses to prompts and then having annotators rank them. Builds a reward model (often PPO) based on these rankings, then updates the LLM to maximize alignment with human preferences (helpfulness, harmlessness, truthfulness). Introduces complexity and risk of reward hacking (specification gaming), where the model may exploit the reward system in unanticipated ways. Advanced Reasoning Techniques Prompt Engineering: The art/science of crafting prompts that elicit better model responses, shown to dramatically affect model output quality. Chain of Thought (CoT) Prompting: Guides models to elaborate step-by-step reasoning before arriving at final answers—demonstrably improves results on complex tasks. Variants include zero-shot CoT ("let's think step by step"), few-shot CoT with worked examples, self-consistency (voting among multiple reasoning chains), and Tree of Thought (explores multiple reasoning branches in parallel). Automated Reasoning Optimization: Frontier models selectively apply these advanced reasoning techniques, balancing compute costs with gains in accuracy and transparency. Optimization for Training and Inference Tradeoffs: The optimal balance between model size, data, and compute is determined not only for pretraining but also for inference efficiency, as lifetime inference costs may exceed initial training costs. Current Trends: Efficient scaling, model specialization (MoE), careful fine-tuning, RLHF alignment, and automated reasoning techniques define state-of-the-art LLM development.

MLG 035 Large Language Models 2

Play Episode Listen Later May 8, 2025 45:25

At inference, large language models use in-context learning with zero-, one-, or few-shot examples to perform new tasks without weight updates, and can be grounded with Retrieval Augmented Generation (RAG) by embedding documents into vector databases for real-time factual lookup using cosine similarity. LLM agents autonomously plan, act, and use external tools via orchestrated loops with persistent memory, while recent benchmarks like GPQA (STEM reasoning), SWE Bench (agentic coding), and MMMU (multimodal college-level tasks) test performance alongside prompt engineering techniques such as chain-of-thought reasoning, structured few-shot prompts, positive instruction framing, and iterative self-correction. Links Notes and resources at ocdevel.com/mlg/mlg35 Build the future of multi-agent software with AGNTCY Try a walking desk stay healthy & sharp while you learn & code In-Context Learning (ICL) Definition: LLMs can perform tasks by learning from examples provided directly in the prompt without updating their parameters. Types: Zero-shot: Direct query, no examples provided. One-shot: Single example provided. Few-shot: Multiple examples, balancing quantity with context window limitations. Mechanism: ICL works through analogy and Bayesian inference, using examples as semantic priors to activate relevant internal representations. Emergent Properties: ICL is an "inference-time training" approach, leveraging the model's pre-trained knowledge without gradient updates; its effectiveness can be enhanced with diverse, non-redundant examples. Retrieval Augmented Generation (RAG) and Grounding Grounding: Connecting LLMs with external knowledge bases to supplement or update static training data. Motivation: LLMs' training data becomes outdated or lacks proprietary/specialized knowledge. Benefit: Reduces hallucinations and improves factual accuracy by incorporating current or domain-specific information. RAG Workflow: Embedding: Documents are converted into vector embeddings (using sentence transformers or representation models). Storage: Vectors are stored in a vector database (e.g., FAISS, ChromaDB, Qdrant). Retrieval: When a query is made, relevant chunks are extracted based on similarity, possibly with re-ranking or additional query processing. Augmentation: Retrieved chunks are added to the prompt to provide up-to-date context for generation. Generation: The LLM generates responses informed by the augmented context. Advanced RAG: Includes agentic approaches—self-correction, aggregation, or multi-agent contribution to source ingestion, and can integrate external document sources (e.g., web search for real-time info, or custom datasets for private knowledge). LLM Agents Overview: Agents extend LLMs by providing goal-oriented, iterative problem-solving through interaction, memory, planning, and tool usage. Key Components: Reasoning Engine (LLM Core): Interprets goals, states, and makes decisions. Planning Module: Breaks down complex tasks using strategies such as Chain of Thought or ReAct; can incorporate reflection and adjustment. Memory: Short-term via context window; long-term via persistent storage like RAG-integrated databases or special memory systems. Tools and APIs: Agents select and use external functions—file manipulation, browser control, code execution, database queries, or invoking smaller/fine-tuned models. Capabilities: Support self-evaluation, correction, and multi-step planning; allow integration with other agents (multi-agent systems); face limitations in memory continuity, adaptivity, and controllability. Current Trends: Research and development are shifting toward these agentic paradigms as LLM core scaling saturates. Multimodal Large Language Models (MLLMs) Definition: Models capable of ingesting and generating across different modalities (text, image, audio, video). Architecture: Modality-Specific Encoders: Convert raw modalities (text, image, audio) into numeric embeddings (e.g., vision transformers for images). Fusion/Alignment Layer: Embeddings from different modalities are projected into a shared space, often via cross-attention or concatenation, allowing the model to jointly reason about their content. Unified Transformer Backbone: Processes fused embeddings to allow cross-modal reasoning and generates outputs in the required format. Recent Advances: Unified architectures (e.g., GPT-4o) use a single model for all modalities rather than switching between separate sub-models. Functionality: Enables actions such as image analysis via text prompts, visual Q&A, and integrated speech recognition/generation. Advanced LLM Architectures and Training Directions Predictive Abstract Representation: Incorporating latent concept prediction alongside token prediction (e.g., via autoencoders). Patch-Level Training: Predicting larger “patches” of tokens to reduce sequence lengths and computation. Concept-Centric Modeling: Moving from next-token prediction to predicting sequences of semantic concepts (e.g., Meta's Large Concept Model). Multi-Token Prediction: Training models to predict multiple future tokens for broader context capture. Evaluation Benchmarks (as of 2025) Key Benchmarks Used for LLM Evaluation: GPQA (Diamond): Graduate-level STEM reasoning. SWE Bench Verified: Real-world software engineering, verifying agentic code abilities. MMMU: Multimodal, college-level cross-disciplinary reasoning. HumanEval: Python coding correctness. HLE (Human's Last Exam): Extremely challenging, multimodal knowledge assessment. LiveCodeBench: Coding with contamination-free, up-to-date problems. MLPerf Inference v5.0 Long Context: Throughput/latency for processing long contexts. MultiChallenge Conversational AI: Multiturn dialogue, in-context reasoning. TAUBench/PFCL: Tool utilization in agentic tasks. TruthfulnessQA: Measures tendency toward factual accuracy/robustness against misinformation. Prompt Engineering: High-Impact Techniques Foundational Approaches: Few-Shot Prompting: Provide pairs of inputs and desired outputs to steer the LLM. Chain of Thought: Instructing the LLM to think step-by-step, either explicitly or through internal self-reprompting, enhances reasoning and output quality. Clarity and Structure: Use clear, detailed, and structured instructions—task definition, context, constraints, output format, use of delimiters or markdown structuring. Affirmative Directives: Phrase instructions positively (“write a concise summary” instead of “don't write a long summary”). Iterative Self-Refinement: Prompt the LLM to review and improve its prior response for better completeness, clarity, and factuality. System Prompt/Role Assignment: Assign a persona or role to the LLM for tailored behavior (e.g., “You are an expert Python programmer”). Guideline: Regularly consult official prompting guides from model developers as model capabilities evolve. Trends and Research Outlook Inference-time compute is increasingly important for pushing the boundaries of LLM task performance. Agentic LLMs and multimodal reasoning represent the primary frontiers for innovation. Prompt engineering and benchmarking remain essential for extracting optimal performance and assessing progress. Models are expected to continue evolving with research into new architectures, memory systems, and integration techniques.

ai tools single chatgpt artificial intelligence clarity stem models react gemini chain machine learning gpt python ml llm prompt rag large language models bayesian

MLG 036 Autoencoders

Play Episode Listen Later May 30, 2025 65:55

Auto encoders are neural networks that compress data into a smaller "code," enabling dimensionality reduction, data cleaning, and lossy compression by reconstructing original inputs from this code. Advanced auto encoder types, such as denoising, sparse, and variational auto encoders, extend these concepts for applications in generative modeling, interpretability, and synthetic data generation. Links Notes and resources at ocdevel.com/mlg/36 Try a walking desk - stay healthy & sharp while you learn & code Build the future of multi-agent software with AGNTCY. Thanks to T.J. Wilder from intrep.io for recording this episode! Fundamentals of Autoencoders Autoencoders are neural networks designed to reconstruct their input data by passing data through a compressed intermediate representation called a “code.” The architecture typically follows an hourglass shape: a wide input and output separated by a narrower bottleneck layer that enforces information compression. The encoder compresses input data into the code, while the decoder reconstructs the original input from this code. Comparison with Supervised Learning Unlike traditional supervised learning, where the output differs from the input (e.g., image classification), autoencoders use the same vector for both input and output. Use Cases: Dimensionality Reduction and Representation Autoencoders perform dimensionality reduction by learning compressed forms of high-dimensional data, making it easier to visualize and process data with many features. The compressed code can be used for clustering, visualization in 2D or 3D graphs, and input into subsequent machine learning models, saving computational resources and improving scalability. Feature Learning and Embeddings Autoencoders enable feature learning by extracting abstract representations from the input data, similar in concept to learned embeddings in large language models (LLMs). While effective for many data types, autoencoder-based encodings are less suited for variable-length text compared to LLM embeddings. Data Search, Clustering, and Compression By reducing dimensionality, autoencoders facilitate vector searches, efficient clustering, and similarity retrieval. The compressed codes enable lossy compression analogous to audio codecs like MP3, with the difference that autoencoders lack domain-specific optimizations for preserving perceptually important data. Reconstruction Fidelity and Loss Types Loss functions in autoencoders are defined to compare reconstructed outputs to original inputs, often using different loss types depending on input variable types (e.g., Boolean vs. continuous). Compression via autoencoders is typically lossy, meaning some information from the input is lost during reconstruction, and the areas of information lost may not be easily controlled. Outlier Detection and Noise Reduction Since reconstruction errors tend to move data toward the mean, autoencoders can be used to reduce noise and identify data outliers. Large reconstruction errors can signal atypical or outlier samples in the dataset. Denoising Autoencoders Denoising autoencoders are trained to reconstruct clean data from noisy inputs, making them valuable for applications in image and audio de-noising as well as signal smoothing. Iterative denoising as a principle forms the basis for diffusion models, where repeated application of a denoising autoencoder can gradually turn random noise into structured output. Data Imputation Autoencoders can aid in data imputation by filling in missing values: training on complete records and reconstructing missing entries for incomplete records using learned code representations. This approach leverages the model's propensity to output ‘plausible' values learned from overall data structure. Cryptographic Analogy The separation of encoding and decoding can draw parallels to encryption and decryption, though autoencoders are not intended or suitable for secure communication due to their inherent lossiness. Advanced Architectures: Sparse and Overcomplete Autoencoders Sparse autoencoders use constraints to encourage code representations with only a few active values, increasing interpretability and explainability. Overcomplete autoencoders have a code size larger than the input, often in applications that require extraction of distinct, interpretable features from complex model states. Interpretability and Research Example Research such as Anthropic's “Towards Monosemanticity” applies sparse autoencoders to the internal activations of language models to identify interpretable features correlated with concrete linguistic or semantic concepts. These models can be used to monitor and potentially control model behaviors (e.g., detecting specific language usage or enforcing safety constraints) by manipulating feature activations. Variational Autoencoders (VAEs) VAEs extend autoencoder architecture by encoding inputs as distributions (means and standard deviations) instead of point values, enforcing a continuous, normalized code space. Decoding from sampled points within this space enables synthetic data generation, as any point near the center of the code space corresponds to plausible data according to the model. VAEs for Synthetic Data and Rare Event Amplification VAEs are powerful in domains with sparse data or rare events (e.g., healthcare), allowing generation of synthetic samples representing underrepresented cases. They can increase model performance by augmenting datasets without requiring changes to existing model pipelines. Conditional Generative Techniques Conditional autoencoders extend VAEs by allowing controlled generation based on specified conditions (e.g., generating a house with a pool), through additional decoder inputs and conditional loss terms. Practical Considerations and Limitations Training autoencoders and their variants requires computational resources, and their stochastic training can produce differing code representations across runs. Lossy reconstruction, lack of domain-specific optimizations, and limited code interpretability restrict some use cases, particularly where exact data preservation or meaningful decompositions are required.

MLA 025 AI Image Generation: Midjourney vs Stable Diffusion, GPT-4o, Imagen & Firefly

Play Episode Listen Later Jul 9, 2025 72:33

The 2025 generative AI image market is a trade-off between aesthetic quality, instruction-following, and user control. This episode analyzes the key platforms, comparing Midjourney's artistic output against the superior text generation and prompt adherence of GPT-4o and Imagen 4, the commercial safety of Adobe Firefly, and the total customization of Stable Diffusion. Links Notes and resources at ocdevel.com/mlg/mla-25 Try a walking desk - stay healthy & sharp while you learn & code Build the future of multi-agent software with AGNTCY. The State of the Market The market is split by three core philosophies: The "Artist" (Midjourney): Prioritizes aesthetic excellence and cinematic output, sacrificing precise user control and instruction following. The "Collaborator" (GPT-4o, Imagen 4): Extensions of LLMs that excel at conversational co-creation, complex instruction following, and integration into productivity workflows. The "Sovereign Toolkit" (Stable Diffusion): An open-source engine offering users unparalleled control, customization, and privacy in exchange for technical engagement. Table 1: 2025 Generative AI Image Tool At-a-Glance Comparison Tool Parent Company Access Method(s) Pricing Core Strength Best For Midjourney v7 Midjourney, Inc. Web App, Discord Subscription Artistic Aesthetics & Photorealism Fine Art, Concept Design, Stylized Visuals GPT-4o OpenAI ChatGPT, API Freemium/Sub Conversational Control & Instruction Following Marketing Materials, UI/UX Mockups, Logos Google Imagen 4 Google Gemini, Workspace, Vertex AI Freemium/Sub Ecosystem Integration & Speed Business Presentations, Educational Content Stable Diffusion 3 Stability AI Local Install, Web UIs, API Open Source Ultimate Customization & Control Developers, Power Users, Bespoke Workflows Adobe Firefly Adobe Creative Cloud Apps, Web App Subscription Commercial Safety & Workflow Integration Professional Designers, Agencies, Enterprise Core Platforms Midjourney v7: Premium choice for artistic quality. Features: Web UI with Draft Mode, user personalization, emerging video/3D. Weaknesses: Poor text generation, poor prompt adherence, public images on cheap plans, no API/bans automation. OpenAI GPT-4o: An intelligent co-creator for controlled generation. Features: Conversational refinement, superior text rendering, understands uploaded image context. Weaknesses: Slower than competitors, generates one image at a time, strict content filters. Google Imagen 4: Pragmatic tool focused on speed and ecosystem integration. Features: High-quality photorealism, fast generation, strong text rendering, multilingual. Weaknesses: Less artistic flair; value is dependent on Google ecosystem investment. Stable Diffusion 3: Open-source engine for maximum user control. Features: MMDiT architecture improves prompt/text handling, scalable models, vast ecosystem (LoRAs/ControlNet). Weaknesses: Steep learning curve, quality is user-dependent. Adobe Firefly: Focused on commercial safety and professional workflow integration. Features: Trained on Adobe Stock for legal indemnity, Generative Fill/Expand tools. Weaknesses: Creative range limited by training data, requires Adobe subscription/credits. Tools and Concepts In-painting: Modifying a masked area inside an image. Out-painting: Extending an image beyond its original borders. LoRA (Low-Rank Adaptation): A small file that applies a fine-tuned style, character, or concept to a base model. ControlNet: Uses a reference image (e.g., pose, sketch) to enforce the composition, structure, or pose of the output. A1111 vs. ComfyUI: Two main UIs for Stable Diffusion. A1111 is a beginner-friendly tabbed interface; ComfyUI is a node-based interface for complex, efficient, and automated workflows. Workflows "Best of Both Worlds": Generate aesthetic base images in Midjourney, then composite, edit, and add text with precision in Photoshop/Firefly. Single-Ecosystem: Work entirely within Adobe Creative Cloud or Google Workspace for seamless integration, commercial safety (Adobe), and convenience (Google). "Build Your Own Factory": Use ComfyUI to build automated, multi-step pipelines for consistent character generation, advanced upscaling, and video. Decision Framework Choose by Goal: Fine Art/Concept Art: Midjourney. Logos/Ads with Text: GPT-4o, Google Imagen 4, or specialist Ideogram. Consistent Character in Specific Pose: Stable Diffusion with a Character LoRA and ControlNet (OpenPose). Editing/Expanding an Existing Photo: Adobe Photoshop with Firefly. Exclusion Rules: If you need legible text, exclude Midjourney. If you need absolute privacy or zero cost (post-hardware), Stable Diffusion is the only option. If you need guaranteed commercial legal safety, use Adobe Firefly. If you need an API for a product, use OpenAI or Google; automating Midjourney is a bannable offense.

MLA 026 AI Video Generation: Veo 3 vs Sora, Kling, Runway, Stable Video Diffusion

Play Episode Listen Later Jul 12, 2025 40:39

Google Veo leads the generative video market with superior 4K photorealism and integrated audio, an advantage derived from its YouTube training data. OpenAI Sora is the top tool for narrative storytelling, while Kuaishou Kling excels at animating static images with realistic, high-speed motion. Links Notes and resources at ocdevel.com/mlg/mla-26 Try a walking desk - stay healthy & sharp while you learn & code Build the future of multi-agent software with AGNTCY. S-Tier: Google Veo The market leader due to superior visual quality, physics simulation, 4K resolution, and integrated audio generation, which removes post-production steps. It accurately interprets cinematic prompts ("timelapse," "aerial shots"). Its primary advantage is its integration with Google products, using YouTube's vast video library for rapid model improvement. The professional focus is clear with its filmmaking tool, "Flow." A-Tier: Sora & Kling OpenAI Sora: Excels at interpreting complex narrative prompts and has wide distribution through ChatGPT. Features include in-video editing tools like "Remix" and a "Storyboard" function for multi-shot scenes. Its main limits are 1080p resolution and no native audio. Kuaishou Kling: A leader in image-to-video quality and realistic high-speed motion. It maintains character consistency and has proven commercial viability (RMB 150M in Q1 2025). Its text-to-video interface is less intuitive than Sora's. Summary: Sora is best for storytellers starting with a narrative idea; Kling is best for artists animating a specific image. Control and Customization: Runway & Stable Diffusion Runway: An integrated creative suite with a full video editor and "AI Magic Tools" like Motion Brush and Director Mode. Its value is in generating, editing, and finishing in one platform, offering precise control over stylization and in-shot object alteration. Stable Diffusion: An open-source ecosystem (SVD, AnimateDiff) offering maximum control through technical interfaces like ComfyUI. Its strength is a large community developing custom models, LoRAs, and ControlNets for specific tasks like VFX integration. It has a steep learning curve. Niche Tools: Midjourney & More Midjourney Video: The best tool for animating static Midjourney images (image-to-video only), preserving their unique aesthetic. Avatar Platforms (HeyGen, Synthesia): Built for scalable corporate and marketing videos, featuring realistic talking avatars, voice cloning, and multi-language translation with accurate lip-sync. Head-to-Head Comparison Feature Google Veo (S-Tier) OpenAI Sora (A-Tier) Kuaishou Kling (A-Tier) Runway (Power-User Tier) Photorealism Winner. Best 4K detail and physics. Excellent, but can have a stylistic "AI" look. Very strong, especially with human subjects. Good, but a step below the top tier. Consistency Strong, especially with Flow's scene-building. Co-Winner. Storyboard feature is built for this. Co-Winner. Excels in image-to-video consistency. Good, with character reference tools. Prompt Adherence Winner (Language). Best understanding of cinematic terms. Best for imaginative/narrative prompts. Strong on motion, less on camera specifics. Good, but relies more on UI tools. Directorial Control Strong via prompt. Moderate, via prompt and storyboard. Moderate, focused on motion. Winner (Interface). Motion Brush & Director Mode offer direct control. Integrated Audio Winner. Native dialogue, SFX, and music. Major workflow advantage. No. Requires post-production. No. Requires post-production. No. Requires post-production. Advanced Multi-Tool Workflows High-Quality Animation: Combine Midjourney (for key-frame art) with Kling or Runway (for motion), then use an AI upscaler like Topaz for 4K finishing. VFX Compositing: Use Stable Diffusion (AnimateDiff/ControlNets) to generate specific elements for integration into live-action footage using professional software like Nuke or After Effects. All-in-one models lack the required layer-based control. High-Volume Marketing: Use Veo for the main concept, Runway for creating dozens of variations, and HeyGen for personalized avatar messaging to achieve speed and scale. Decision Matrix: Who Should Use What? User Profile Primary Goal Recommendation Justification The Indie Filmmaker Pre-visualization, short films. OpenAI Sora (Primary), Google Veo (Secondary) Sora's storyboard feature is best for narrative construction. Veo is best for high-quality final shots. The VFX Artist Creating animated elements for live-action. Stable Diffusion (AnimateDiff/ComfyUI) Offers the layer-based control and pipeline integration needed for professional VFX. The Creative Agency Rapid prototyping, social content. Runway (Primary Suite), Google Veo (For Hero Shots) Runway's editing/variation tools are built for agency speed. Veo provides the highest quality for the main asset. The AI Artist / Animator Art-directed animated pieces. Midjourney + Kling Pairs the best image generator with a top-tier motion engine for maximum aesthetic control. The Corporate Trainer Training and personalized marketing videos. HeyGen / Synthesia Specialized tools for avatar-based video production at scale (voice cloning, translation). Future Trajectory Pipeline Collapse: More models will integrate audio and editing, pressuring silent-only video generators. The Control Arms Race: Competition will shift from quality to providing more sophisticated directorial tools. Rise of Aggregators: Platforms like OpenArt that provide access to multiple models through a single interface will become essential.

MLA 027 AI Video End-to-End Workflow

Play Episode Listen Later Jul 14, 2025 71:37

How to maintain character consistency, style consistency, etc in an AI video. Prosumers can use Google Veo 3's "High-Quality Chaining" for fast social media content. Indie filmmakers can achieve narrative consistency by combining Midjourney V7 for style, Kling for lip-synced dialogue, and Runway Gen-4 for camera control, while professional studios gain full control with a layered ComfyUI pipeline to output multi-layer EXR files for standard VFX compositing. Links Notes and resources at ocdevel.com/mlg/mla-27 Try a walking desk - stay healthy & sharp while you learn & code Descript - my favorite AI audio/video editor AI Audio Tool Selection Music: Use Suno for complete songs or Udio for high-quality components for professional editing. Sound Effects: Use ElevenLabs' SFX for integrated podcast production or SFX Engine for large, licensed asset libraries for games and film. Voice: ElevenLabs gives the most realistic voice output. Murf.ai offers an all-in-one studio for marketing, and Play.ht has a low-latency API for developers. Open-Source TTS: For local use, StyleTTS 2 generates human-level speech, Coqui's XTTS-v2 is best for voice cloning from minimal input, and Piper TTS is a fast, CPU-friendly option. I. Prosumer Workflow: Viral Video Goal: Rapidly produce branded, short-form video for social media. This method bypasses Veo 3's weaker native "Extend" feature. Toolchain Image Concept: GPT-4o (API: GPT-Image-1) for its strong prompt adherence, text rendering, and conversational refinement. Video Generation: Google Veo 3 for high single-shot quality and integrated ambient audio. Soundtrack: Udio for creating unique, "viral-style" music. Assembly: CapCut for its standard short-form editing features. Workflow Create Character Sheet (GPT-4o): Generate a primary character image with a detailed "locking" prompt, then use conversational follow-ups to create variations (poses, expressions) for visual consistency. Generate Video (Veo 3): Use "High-Quality Chaining." Clip 1: Generate an 8s clip from a character sheet image. Extract Final Frame: Save the last frame of Clip 1. Clip 2: Use the extracted frame as the image input for the next clip, using a "this then that" prompt to continue the action. Repeat as needed. Create Music (Udio): Use Manual Mode with structured prompts ([Genre: ...], [Mood: ...]) to generate and extend a music track. Final Edit (CapCut): Assemble clips, layer the Udio track over Veo's ambient audio, add text, and use "Auto Captions." Export in 9:16. II. Indie Filmmaker Workflow: Narrative Shorts Goal: Create cinematic short films with consistent characters and storytelling focus, using a hybrid of specialized tools. Toolchain Visual Foundation: Midjourney V7 to establish character and style with --cref and --sref parameters. Dialogue Scenes: Kling for its superior lip-sync and character realism. B-Roll/Action: Runway Gen-4 for its Director Mode camera controls and Multi-Motion Brush. Voice Generation: ElevenLabs for emotive, high-fidelity voices. Edit & Color: DaVinci Resolve for its integrated edit, color, and VFX suite and favorable cost model. Workflow Create Visual Foundation (Midjourney V7): Generate a "hero" character image. Use its URL with --cref --cw 100 to create consistent character poses and with --sref to replicate the visual style in other shots. Assemble a reference set. Create Dialogue Scenes (ElevenLabs -> Kling): Generate the dialogue track in ElevenLabs and download the audio. In Kling, generate a video of the character from a reference image with their mouth closed. Use Kling's "Lip Sync" feature to apply the ElevenLabs audio to the neutral video for a perfect match. Create B-Roll (Runway Gen-4): Use reference images from Midjourney. Apply precise camera moves with Director Mode or add localized, layered motion to static scenes with the Multi-Motion Brush. Assemble & Grade (DaVinci Resolve): Edit clips and audio on the Edit page. On the Color page, use node-based tools to match shots from Kling and Runway, then apply a final creative look. III. Professional Studio Workflow: Full Control Goal: Achieve absolute pixel-level control, actor likeness, and integration into standard VFX pipelines using an open-source, modular approach. Toolchain Core Engine: ComfyUI with Stable Diffusion models (e.g., SD3, FLUX). VFX Compositing: DaVinci Resolve (Fusion page) for node-based, multi-layer EXR compositing. Control Stack & Workflow Train Character LoRA: Train a custom LoRA on a 15-30 image dataset of the actor in ComfyUI to ensure true likeness. Build ComfyUI Node Graph: Construct a generation pipeline in this order: Loaders: Load base model, custom character LoRA, and text prompts (with LoRA trigger word). ControlNet Stack: Chain multiple ControlNets to define structure (e.g., OpenPose for skeleton, Depth map for 3D layout). IPAdapter-FaceID: Use the Plus v2 model as a final reinforcement layer to lock facial identity before animation. AnimateDiff: Apply deterministic camera motion using Motion LoRAs (e.g., v2_lora_PanLeft.ckpt). KSampler -> VAE Decode: Generate the image sequence. Export Multi-Layer EXR: Use a node like mrv2SaveEXRImage to save the output as an EXR sequence (.exr). Configure for a professional pipeline: 32-bit float, linear color space, and PIZ/ZIP lossless compression. This preserves render passes (diffuse, specular, mattes) in a single file. Composite in Fusion: In DaVinci Resolve, import the EXR sequence. Use Fusion's node graph to access individual layers, allowing separate adjustments to elements like color, highlights, and masks before integrating the AI asset into a final shot with a background plate.

MLA 028 AI Agents

Play Episode Listen Later Feb 22, 2026 37:46

AI agents differ from chatbots by pursuing autonomous goals through the ReACT loop rather than responding to turn-based prompts. While coding agents are currently the most reliable due to verifiable feedback loops, the market is expanding into desktop and browser automation via tools like Claude co-work and open claw. Links Notes and resources at ocdevel.com/mlg/mla-28 Try a walking desk - stay healthy & sharp while you learn & code Generate a podcast - use my voice to listen to any AI generated content you want Fundamental Definitions Agent vs. Chatbot: Chatbots are turn-based and human-driven. Agents receive objectives and dynamically direct their own processes. The ReACT Loop: Every modern agent uses the cycle: Thought -> Action -> Observation. This interleaved reasoning and tool usage allows agents to update plans and handle exceptions. Performance: Models using agentic loops with self-correction outperform stronger zero-shot models. GPT-3.5 with an agent loop scored 95.1% on HumanEval, while zero-shot GPT-4 scored 67.0%. The Agentic Spectrum Chat: No tools or autonomy. Chat + Tools: Human-driven web search or code execution. Workflows: LLMs used in predefined code paths. The human designs the flow, the AI adds intelligence at specific nodes. Agents: LLMs dynamically choose their own path and tools based on observations. Tool Categories and Market Players Developer Frameworks: Use LangGraph for complex, stateful graphs or CrewAI for role-based multi-agent delegation. OpenAI Agents SDK provides minimalist primitives (Handoffs, Sessions), while the Claude Agent SDK focuses on local computer interaction. Workflow Automation: n8n and Zapier provide low-code interfaces. These are stable for repeatable business tasks but limited by fixed paths and a lack of persistent memory between runs. Coding Agents: Claude Code, Cursor, and GitHub Copilot are the most advanced agents. They succeed because code provides an unambiguous feedback loop (pass/fail) for the ReACT cycle. Desktop and Browser Agents: Claude Cowork( (released Jan 2026) operates in isolated VMs to produce documents. ChatGPT Atlas is a Chromium-based browser with integrated agent capabilities for web tasks. Autonomous Agents: open claw is an open-source, local system with broad permissions across messaging, file systems, and hardware. While powerful, it carries high security risks, including 512 identified vulnerabilities and potential data exfiltration. Infrastructure and Standards MCP (Model Context Protocol): A universal standard for connecting agents to tools. It has 10,000+ servers and is used by Anthropic, OpenAI, and Google. Future Outlook: By 2028, multi-agent coordination will be the default architecture. Gartner predicts 38% of organizations will utilize AI agents as formal team members, and the developer role will transition primarily to objective specification and output evaluation.

ai google infrastructure react openai generate gpt gartner desktops zapier anthropic cursor chromium vms github copilot handoffs

MLA 029 OpenClaw

Play Episode Listen Later Feb 22, 2026 30:14

OpenClaw is a self-hosted AI agent daemon that executes autonomous tasks through messaging apps like WhatsApp and Telegram using persistent memory. It integrates with Claude Code to enable software development and administrative automation directly from mobile devices. Links Notes and resources at ocdevel.com/mlg/mla-29 Try a walking desk - stay healthy & sharp while you learn & code Generate a podcast - use my voice to listen to any AI generated content you want OpenClaw is a self-hosted AI agent daemon (Node.js, port 18789) that executes autonomous tasks via messaging apps like WhatsApp or Telegram. Developed by Peter Steinberger in November 2025, the project reached 196,000 GitHub stars in three months. Architecture and Persistent Memory Operational Loop: Gateway receives message, loads SOUL.md (personality), USER.md (user context), and MEMORY.md (persistent history), calls LLM for tool execution, streams response, and logs data. Memory System: Compounds context over months. Users should prompt the agent to remember specific preferences to update MEMORY.md. Heartbeats: Proactive cron-style triggers for automated actions, such as 6:30 AM briefings or inbox triage. Skills: 5,705+ community plugins via ClawHub. The agent can author its own skills by reading API documentation and writing TypeScript scripts. Claude Code Integration Mobile to Deploy Workflow: The claude-code-skill bridge provides OpenClaw access to Bash, Read, Edit, and Git tools via Telegram. Agent Teams: claude-team manages multiple workers in isolated git worktrees to perform parallel refactors or issue resolution. Interoperability: Use mcporter to share MCP servers between Claude Code and OpenClaw. Industry Comparisons vs n8n: Use n8n for deterministic, zero-variance pipelines. Use OpenClaw for reasoning and ambiguous natural language tasks. vs Claude Cowork: Cowork is a sandboxed, desktop-only proprietary app. OpenClaw is an open-source, mobile-first, 24/7 daemon with full system access. Professional Applications Therapy: Voice to SOAP note transcription. PHI requires local Ollama models due to a lack of encryption at rest in OpenClaw. Marketing: claw-ads for multi-platform ad management, Mixpost for scheduling, and SearXNG for search. Finance: Receipt OCR and Google Drive filing. Requires human review to mitigate non-deterministic LLM errors. Real Estate: Proactive transaction deadline monitoring and memory-driven buyer matching. Security and Operations Hardening: Bind to localhost, set auth tokens, and use Tailscale for remote access. Default settings are unsafe, exposing over 135,000 instances. Injection Defense: Add instructions to SOUL.md to treat external emails and web pages as hostile. Costs: Software is MIT-licensed. API costs are paid per-token or bundled via a Claude subscription key. Onboarding: Run the BOOTSTRAP.md flow immediately after installation to define agent personality before requesting tasks.

MLA 004 AI Job Displacement

Play Episode Listen Later Feb 26, 2026 35:35

AI is already displacing workers in targeted ways - entry-level knowledge workers are being quietly erased from hiring pipelines, freelancers are getting crushed, and the career ladder is being sawed off at the bottom rungs. Yet ML engineer demand has surged 89% with a 3.2:1 talent deficit and $187K median salary. Covers the real displacement data, lessons from the artist bloodbath, the trades escape hatch, the orchestrator treadmill, expert disagreements on timelines, and concrete short- and long-term career moves for ML engineers. Links Notes and resources at ocdevel.com/mlg/mla-4 Try a walking desk - stay healthy & sharp while you learn & code Generate a podcast - use my voice to listen to any AI generated content you want Market Metrics and Displacement Dynamics ML Market: H1 2025 demand rose 89% with a 3.2 to 1 talent deficit. Median salary is $187,500, while Generative AI specialists earn a 40 to 60 percent premium. The "Quiet" Decline: Macro data shows only 4.5% of total layoffs are AI-attributed, but entry-level hiring is collapsing. Stanford/ADP data shows a 13 to 16 percent employment drop for workers aged 22 to 25 in AI-exposed roles since late 2022. UK graduate job postings fell 67%. Corporate Attrition: Salesforce cut 4,000 roles after AI absorbed 30 to 50 percent of workloads. Microsoft cut 15,000 roles as AI began generating 30% of its code. Amazon cut 30,000 jobs while spending $100 billion on AI infrastructure. Sector Analysis: Creative and Trades Illustrators: Jobs in China's gaming sector fell 70% in one year. Clients accept "good enough" work (80% quality) at 5% of the cost. Western freelance graphic design and writing jobs fell 18.5% and 30% respectively within eight months of ChatGPT's launch. Manual Labor: The U.S. construction industry lacks 1.7 million workers annually, but apprenticeships take five years. Humanoid robotics are advancing, with Unitree's R1 priced at $5,900 and Figure AI robots completing 1,250 runtime hours at BMW. Full automation is 10 to 15 years away, but partial displacement via smaller crews is closer. The Orchestration Treadmill Obsolescence Speed: Prompt engineering roles went from $375,000 salaries to obsolescence in 24 months. AI coding agents like Claude Code now resolve 72% of medium-complexity GitHub issues autonomously. Fragile Expertise: Replacing junior workers with AI prevents the development of future senior talent. New engineers risk "fragile expertise," directed by tools they cannot debug during novel failure modes. Economic and Expert Outlook Macro Risks: Daron Acemoglu warns of "so-so automation" that cuts costs without raising productivity, predicting only 0.66% growth over ten years. "Ghost GDP" describes AI-inflated accounts that fail to circulate because machines do not consume. Expert Camps: Accelerationists (Anthropic, OpenAI) predict human-level AI by 2027. Skeptics (LeCun, Marcus) argue LLMs are a dead end lacking world models. Pragmatists (Andrew Ng) suggest shifting from implementation to specification as the cost of code nears zero. Tactical Adaptation for ML Engineers Immediate Skills: Master production ML systems, MLOps, LLM evaluation, and safety engineering. Ability to manage deployment risks and hallucination detection is the primary hiring differentiator. Long-term Moats: Focus on "Small AI" (on-device, private), mechanistic interpretability, and deep domain knowledge in healthcare, logistics, or climate science. The Playbook: Optimize for the current three to five year window. Move from being a model builder to a product-focused engineer who understands business tradeoffs and regulatory compliance.

amazon ai uk china western microsoft clients chatgpt economic ability openai bmw covers generate ml github llm displacement median r1 humanoid

MLA 030 AI Job Displacement & ML Careers

Play Episode Listen Later Feb 26, 2026 42:17

ML engineering demand remains high with a 3.2 to 1 job-to-candidate ratio, but entry-level hiring is collapsing as AI automates routine programming and data tasks. Career longevity requires shifting from model training to production operations, deep domain expertise, and mastering AI-augmented workflows before standard implementation becomes a commodity. Links Notes and resources at ocdevel.com/mlg/mla-30 Try a walking desk - stay healthy & sharp while you learn & code Generate a podcast - use my voice to listen to any AI generated content you want Market Data and Displacement ML engineering demand rose 89% in early 2025. Median salary is $187,500, with senior roles reaching $550,000. There are 3.2 open jobs for every qualified candidate. AI-exposed roles for workers aged 22 to 25 declined 13 to 16%, while workers over 30 saw 6 to 12% growth. Professional service job openings dropped 20% year-over-year by January 2025. Microsoft cut 15,000 roles, targeting software engineers, and 30% of its code is now AI-generated. Salesforce reduced support headcount from 9,000 to 5,000 after AI handled 30 to 50% of its workload. Sector Comparisons Creative: Chinese illustrator jobs fell 70% in one year. AI increased output from 1 to 40 scenes per day, crashing commission rates by 90%. Trades: US construction lacks 1.7 million workers. Licensing takes 5 years, and the career fatality risk is 1 in 200. High suicide rates (56 per 100,000) and emerging robotics like the $5,900 Unitree R1 indicate a 10 to 15 year window before automation. Orchestration: Prompt engineering roles paying $375,000 became nearly obsolete in 24 months. Claude Code solves 72% of GitHub issues in under eight minutes. Technical Specialization Priorities Model Ops: Move from training to deployment using vLLM or TensorRT. Set up drift detection and monitoring via MLflow or Weights & Biases. Evaluation: Use DeepEval or RAGAS to test for hallucinations, PII leaks, and adversarial robustness. Agentic Workflows: Build multi-step systems with LangGraph or CrewAI. Include human-in-the-loop checkpoints and observability. Optimization: Focus on quantization and distillation for on-device, air-gapped deployment. Domain Expertise: 57.7% of ML postings prefer specialists in healthcare, finance, or climate over generalists. Industry Perspectives Accelerationists (Amodei, Altman): Predict major disruption within 1 to 5 years. Skeptics (LeCun, Marcus): Argue LLMs lack causal reasoning, extending the adoption timeline to 10 to 15 years. Pragmatists (Andrew Ng): Argue that as code gets cheap, the bottleneck shifts from implementation to specification.

ai career microsoft professional careers salesforce generate ml github licensing displacement median pii market data ragas mlflow

MLA 026 AI Video Generation: Veo 3 vs Sora, Kling, Runway, Stable Video Diffusion

Play Episode Listen Later Jul 12, 2025 40:39

MLA 025 AI Image Generation: Midjourney vs Stable Diffusion, GPT-4o, Imagen & Firefly

Play Episode Listen Later Jul 9, 2025 72:33

MLG 036 Autoencoders

Play Episode Listen Later May 30, 2025 65:55

MLG 035 Large Language Models 2

Play Episode Listen Later May 8, 2025 45:25

ai tools single chatgpt artificial intelligence clarity stem models react gemini chain machine learning gpt python ml llm prompt rag large language models bayesian

MLG 034 Large Language Models 1

Play Episode Listen Later May 7, 2025 50:48

MLA 024 Code AI MCP Servers, ML Engineering

Play Episode Listen Later Apr 13, 2025 43:38

ai local code engineering encourage machine learning python ml cline playwright anthropic servers mcp cursor postgres jupyter mcps tool use

MLA 023 Code AI Models & Modes

Play Episode Listen Later Apr 13, 2025 37:35

ai tips local code artificial intelligence privacy gemini openai machine learning cline boomerang copilot modes anthropic urls cursor ai models

MLA 022 Code AI Tools

Play Episode Listen Later Feb 9, 2025 46:35

ai video tools code artificial intelligence reddit programming openai api ml cline copilot aider sonnets cursor github copilot windsurf o1

MLG 033 Transformers

Play Episode Listen Later Feb 9, 2025 42:14

ai debate artificial intelligence transformers machine learning data science ml mlp mlps 3blue1brown rnns

MLA 021 Databricks

Play Episode Listen Later Jun 22, 2022 25:45

Discussing Databricks with Ming Chang from Raybeam (part of DEPT®)

machine learning data science databricks

MLA 020 Kubeflow

Play Episode Listen Later Jan 29, 2022 67:57

conversations artificial intelligence machine learning data science ml data scientists kubernetes tensorflow tfx

MLA 019 DevOps

Play Episode Listen Later Jan 13, 2022 74:38

ai artificial intelligence expert chatting machine learning data science devops kubernetes

MLA 018 Descript

Play Episode Listen Later Nov 7, 2021 6:22

ai explore artificial intelligence aka machine learning data science optional descript

MLA 017 AWS Local Development

Play Episode Listen Later Nov 6, 2021 64:04

ai development local developing artificial intelligence machine learning aws data science sagemaker

MLA 016 SageMaker 2

Play Episode Listen Later Nov 5, 2021 59:43

ai artificial intelligence machine learning data science ml sagemaker

MLA 015 SageMaker 1

Play Episode Listen Later Nov 4, 2021 46:46

ai artificial intelligence machine learning data science ml sagemaker

MLA 014 Machine Learning Server

Play Episode Listen Later Jan 18, 2021 51:50

Server-side ML. Training & hosting for inference, with a goal towards serverless. AWS SageMaker, Batch, Lambda, EFS, Cortex.dev

ai training artificial intelligence machine learning data science ml server batch lambda cortex efs

MLA 013 Customer Facing Tech Stack

Play Episode Listen Later Jan 3, 2021 46:54

Client, server, database, etc.

ai clients facing artificial intelligence machine learning data science tech stack

MLA 012 Docker

Play Episode Listen Later Nov 9, 2020 30:58

Use Docker for env setup on localhost & cloud deployment, instead of pyenv / Anaconda. I recommend Windows for your desktop.

ai artificial intelligence windows machine learning data science anaconda docker

032 Cartesian Similarity Metrics

Play Episode Listen Later Nov 8, 2020 42:28

learning ai social straight intelligence artificial metrics grid ml outer preferred similarity cartesian pythagorean euclidean minkowski cosine

MLA 011 Practical Clustering

Play Episode Listen Later Nov 8, 2020 34:08

Kmeans (sklearn vs FAISS), finding n_clusters via inertia/silhouette, Agglomorative, DBSCAN/HDBSCAN

ai practical artificial intelligence machine learning data science clustering k means

MLA 010 NLP packages: transformers, spaCy, Gensim, NLTK

Play Episode Listen Later Oct 28, 2020 25:33

NLTK: swiss army knife. Gensim: LDA topic modeling, n-grams. spaCy: linguistics. transformers: high-level business NLP tasks.

ai artificial intelligence nlp transformers machine learning data science packages spacy nltk

031 The Podcasts Return

Play Episode Listen Later Oct 28, 2020 7:57

learning ai podcasts intelligence artificial ml github accompanied spacy nltk

MLA 009 Charting tools

Play Episode Listen Later Nov 6, 2018 24:00

matplotlib, Seaborn, Bokeh, D3, Tableau, Power BI, QlikView, Excel

ai tools artificial intelligence excel machine learning data science charting d3 tableau power bi bokeh seaborn qlikview

MLA 008 Exploratory Data Analysis

Play Episode Listen Later Oct 26, 2018 24:23

EDA + charting. DataFrame info/describe, imputing strategies. Useful charts like histograms and correlation matrices.

ai artificial intelligence machine learning data science data analysis exploratory

MLA 007 Jupyter Notebooks

Play Episode Listen Later Oct 16, 2018 16:09

Run your code + visualizations in the browser: iPython / Jupyter Notebooks.

ai artificial intelligence run machine learning data science jupyter notebooks

MLA 006 Salary

Play Episode Listen Later Jul 19, 2018 18:52

Salary based on location, gender, age, tech... from O'Reilly.

ai artificial intelligence machine learning salary data science

MLA 005 Shapes & Sizes

Play Episode Listen Later Jun 9, 2018 26:30

Dimensions, size, and shape of Numpy ndarrays / TensorFlow tensors, and methods for transforming those.

ai artificial intelligence machine learning shapes data science dimensions sizes tensorflow numpy

MLA 004 Study Tips

Play Episode Listen Later May 28, 2018 6:45

Two tips that helped me the most while learning ML.

ai artificial intelligence machine learning data science ml study tips

030 New Series: Machine Learning Applied

Play Episode Listen Later May 24, 2018 5:29

tree machine learning applied new series bayesian

MLA 003 Storage: HDF, Pickle, Postgres

Play Episode Listen Later May 24, 2018 17:02

Comparison of different data storage options when working with your ML models.

ai artificial intelligence comparison machine learning storage data science ml pickle postgres

MLA 002 Numpy & Pandas

Play Episode Listen Later May 24, 2018 17:26

Some numerical data nitty-gritty in Python.

ai artificial intelligence machine learning python data science pandas numpy

MLA 001 Certificates & Degrees

Play Episode Listen Later May 24, 2018 11:20

Reboot on the MLG episode, with more confident recommends.

ai artificial intelligence reboot machine learning degrees certificates data science mlg

029 Reinforcement Learning Intro

Play Episode Listen Later Feb 5, 2018 42:27

Introduction to reinforcement learning concepts. ocdevel.com/mlg/29 for notes and resources.

learning ai intelligence artificial ml reinforcement learning

028 Hyperparameters 2

Play Episode Listen Later Feb 4, 2018 50:10

Hyperparameters part 2: hyper-search, regularization, SGD optimizers, scaling. ocdevel.com/mlg/28 for notes and resources

learning ai intelligence artificial ml sgd

027 Hyperparameters 1

Play Episode Listen Later Jan 27, 2018 46:09

Hyperparameters part 1: network architecture. ocdevel.com/mlg/27 for notes and resources

learning ai intelligence artificial ml

026 Project Bitcoin Trader

Play Episode Listen Later Jan 26, 2018 38:20

Community project & intro to Bitcoin/crypto + trading. ocdevel.com/mlg/26 for notes and resources

community learning ai project bitcoin intelligence artificial ml trader

025 Convolutional Neural Networks

Play Episode Listen Later Oct 30, 2017 44:21

Convnets or CNNs. Filters, feature maps, window/stride/padding, max-pooling. ocdevel.com/mlg/25 for notes and resources

learning ai cnn intelligence artificial ml filters convolutional neural networks

024 Tech Stack

Play Episode Listen Later Oct 6, 2017 61:17

TensorFlow, Pandas, Numpy, Scikit-Learn, Keras, TensorForce. ocdevel.com/mlg/24 for notes and resources

learning ai intelligence artificial ml pandas tech stack tensorflow keras numpy scikit learn

023 Deep NLP 2

Play Episode Listen Later Aug 20, 2017 42:45

RNN review, bi-directional RNNs, LSTM & GRU cells. ocdevel.com/mlg/23 for notes and resources

learning ai deep intelligence artificial ml rnn rnns

022 Deep NLP 1

Play Episode Listen Later Jul 28, 2017 49:21

Recurrent Neural Networks (RNNs) and Word2Vec. ocdevel.com/mlg/22 for notes and resources

learning ai deep intelligence artificial ml word2vec

021 New Series: Machine Learning Applied

Play Episode Listen Later Jul 27, 2017 1:50

Introducing a new podcast series on Patreon: Machine Learning Applied. ocdevel.com/mlg/21 for notes and resources

learning ai intelligence artificial machine learning applied ml new series

020 Natural Language Processing 3

Play Episode Listen Later Jul 23, 2017 40:26

Natural Language Processing classical/shallow algorithms. ocdevel.com/mlg/20 for notes and resources

learning ai intelligence artificial ml natural language processing

019 Natural Language Processing 2

Play Episode Listen Later Jul 10, 2017 65:33

Natural Language Processing classical/shallow algorithms. ocdevel.com/mlg/19 for notes and resources

learning ai intelligence artificial ml natural language processing

018 Natural Language Processing 1

Play Episode Listen Later Jun 25, 2017 57:48

Introduction to Natural Language Processing (NLP) topics. ocdevel.com/mlg/18 for notes and resources

learning ai intelligence artificial ml natural language processing natural language processing nlp

017 Checkpoint

Play Episode Listen Later Jun 4, 2017 7:00

Checkpoint - learn the material offline! ocdevel.com/mlg/17 for notes and resources

learning ai intelligence artificial ml checkpoint

Claim Machine Learning Guide

In order to claim this podcast we'll send an email to with a verification link. Simply click the link and you will be able to edit tags, request a refresh, and other features to take control of your podcast page!

Claim Cancel