Podcasts about Data lake

System or repository of data stored in its natural/raw format

242PODCASTS
375EPISODES
39mAVG DURATION
1EPISODE EVERY OTHER WEEK
Nov 23, 2025LATEST

POPULARITY

20172018201920202021202220232024

Best podcasts about Data lake

Data Engineering Podcast

11 episodes with Data lake

Engenharia de Dados [Cast]

9 episodes with Data lake

The My Love of Golf Podcast

9 episodes with Data lake

Software Engineering Daily

3 episodes with Data lake

Bigdata Hebdo

8 episodes with Data lake

What's Next??????

3 episodes with Data lake

Cloud N Clear

3 episodes with Data lake

FINRA Unscripted

3 episodes with Data lake

Data Protection Gumbo

3 episodes with Data lake

Latest podcast episodes about Data lake

AI business strategy, data organization, overcoming AI resistance - Len Ward

Sales POP! Podcasts

Play Episode Listen Later Nov 23, 2025 20:35

Tired of hearing about AI theory? In this sharp, action-packed episode, we move past the hype and dive into the concrete strategies for leveraging AI in your business, straight from industry maven Len Ward (Cemmexis). The biggest blocker to AI adoption isn't the tech—it's data chaos. We uncover why AI is only as powerful as the data it consumes and give you the non-negotiable steps for building a clean, structured "Data Lake." This foundation is the secret to getting instant, actionable insights.

ai overcoming data tired resistance ward business strategy data lake ai business

Oracle's Juan Loaiza Discusses Trust Privacy, Security in the Age of AI | Cloud Wars Live

Cloud Wars Live with Bob Evans

Play Episode Listen Later Nov 3, 2025 18:56

Juan Loaiza is the EVP of Database Technologies at Oracle. In today's special episode of Cloud Wars Live, Loaiza joins Bob Evans to discuss how AI is transforming the way businesses interact with data. He spotlights Oracle's new AI-native database, the importance of trust and security in enterprise AI, and why business users now play a bigger role in data strategy. It's a revealing look at how Oracle is shaping the future of intelligent data systems.The AI Data RevolutionThe Big Themes:Trust, Governance, and Privacy Must Be Built Into the AI‑Data Stack: One of the strongest points made by Loaiza is about the risk of AI in enterprises: hallucinations, mis‑use of data, privacy violations, regulatory consequences. When mission‑critical systems (hospitals, banks, telecoms) are involved, errors are unacceptable and can be illegal. Oracle's approach is to embed privacy and access controls down into the database engine: the system knows who the end user is, what they can see, and ensures AI cannot leak unauthorized data.Multi‑Cloud, On‑Premises, Hybrid — Customers Want Flexibility: Loaiza describes how Oracle is enabling customers to run their database and AI workloads wherever they need: on‑premises, in public clouds (AWS, Azure, Google Cloud), or via “cloud at your data center” options like Exadata Cloud@Customer. This speaks to regulatory, latency, data sovereignty and operational constraints. For enterprises, the takeaway is that deployment flexibility is essential. A one‑size‑fits‑all cloud model may not meet strategic needs.Business Users and Developers Now Have Voices in Database Strategy: Historically, databases were the domain of DBAs, IT operations, and infrastructure teams. Now business users and developers also have meaningful voices because of AI democratizing access. This shift means organizational structures, roles and processes must change. Data governance, training, tool‑selection and deployment pipelines need to reflect that the “consumer” of the database is broader.The Big Quote: “[AI] can translate English to this language of computers, the language of data, which is SQL. So, what that means is you don't have to learn this crazy language anymore. So pretty much anyone, business people, lay people, can now talk using their normal natural language to the database, and the database will understand what they're saying and give them answers, build applications to all these and this is something I honestly never thought I'd see in my entire life, and it's here today."More from Juan Loaiza and Oracle:Follow Juan on LinkedIn or learn more about Oracle's approach to security. Visit Cloud Wars for more.

Episode 412 – Microsoft Sentinel Gets a Data Lake

Microsoft Cloud IT Pro Podcast

Play Episode Listen Later Oct 9, 2025 32:16 Transcription Available

Welcome to Episode 412 of the Microsoft Cloud IT Pro Podcast. In this episode, we explore three announcements from Microsoft that are reshaping how security teams work with Sentinel. From a reimagined data architecture to AI integration and new visualization capabilities, Microsoft is doubling down on making security operations more intelligent, efficient, and accessible. Whether you're a seasoned SOC analyst or just getting started with cloud security, these updates offer powerful new ways to detect threats, investigate incidents, and understand your security posture. Your support makes this show possible! Please consider becoming a premium member for access to live shows and more. Check out our membership options. Show Notes Logitech MX Master 4, Ergonomic Wireless Mouse with Advanced Performance Haptic Feedback, Ultra-Fast Scrolling, USB-C Charging, Bluetooth, Windows, MacOS - Graphite Microsoft Sentinel data lake is now generally available Announcing Microsoft Sentinel Model Context Protocol (MCP) server – Public Preview What is Microsoft Sentinel's support for Model Context Protocol (MCP)? Add Microsoft Sentinel's collection of MCP tools Introducing Microsoft Sentinel graph (Public Preview) Graph models overview (preview) About the sponsors Would you like to become the irreplaceable Microsoft 365 resource for your organization? Let us know!

ai microsoft windows bluetooth sentinel soc mcp data lake microsoft sentinel

Der Fachbereich hat Datenhunger - Wir stillen ihn oft nicht richtig | Stefan Franke

Unf*ck Your Data

Play Episode Listen Later Oct 8, 2025 58:47

Wenn der Fachbereich Hunger auf Daten hat bekommt er Fast Food oder Rohkost. Aber satt wird er nicht. Warum ist das so und was bedeutet diese Küchenanaloge? Darüber spricht Christian Krug, der Host des Podcasts „Unf*ck Your Data“ mit Stefan Franke, Mitarbeiter für Strategy and Transformation bei igefa.Analogien sind ein tolles Mittel um komplexe Themen besser verständlich oder leichter verdaubar zu machen. Genau darum gehen wir heute in die Datenküche. Als Stefan aus der Chemie in den Einkauf wechselte hatte er andere Erwartungen an die Daten in Unternehmen. Statt R und Python begegnete er Excel.Denn allzu oft bekommen Fachbereich Daten wirklich nicht gut zur Verfügung gestellt. Diese sind unzugänglich in System versteckt oder unverständlich und unauffindbar sicher im Datalake versenkt.Und wenn die mal präsentiert werden, dann entweder als FastFood, hoch aggregiert einzelne Zahlen aber wertlos für Analysen oder als Rohkost. Haufenweise Tabellen und mit Glück eine Anleitung was daraus mal werden kann.Beides führt nicht zum Erfolg.Aber wie geht es besser?Der Schlüssel heißt Kooperation und eine semantische Schicht. Eine Datenstruktur in der Daten für den Fachbereich verständlich dargestellt werden können, ohne bereits alle Analysen und Berechnungen vorweg zu nehmen.Wird diese Schicht aufgebaut ist sie ein wichtiges Asset im Datenmanagement des Unternehmens. Fehlt sie ist das Chaos vorprogrammiert.Stefan hatte auch einmal ein Thema das er mit herkömmlichen Datenbanken nicht gut lösen konnte. Eine Risikobetrachtung, die eine Stücklistenauflösung benötigte. Darin befand sich ein Zirkelbezug. Hier entdeckte er das erste mal Graphdatenbanken und Wissengraphen. Diese arbeiten nicht mit Tabellen, sondern über Beziehungen. Das hat viele Vorteile, ist aber auch kein Allheilmittel.Wann solltest du diese Datenbanken einsetzen und wann lieber nicht?Hör rein!▬▬▬▬▬▬ Profile: ▬▬▬▬Zum LinkedIn-Profil von Stefan: https://www.linkedin.com/in/stefan-franke-38a3b7130/Zum LinkedIn-Profil von Christian: https://www.linkedin.com/in/christian-krug/Christians Wonderlink: https://wonderl.ink/@christiankrugUnf*ck Your Data auf Linkedin: https://www.linkedin.com/company/unfck-your-data▬▬▬▬▬▬ Buchempfehlung: ▬▬▬▬Buchempfehlung von Stefan: Nachtzug nach LissabonAlle Empfehlungen in Melenas Bücherladen: https://gunzenhausen.buchhandlung.de/unfuckyourdata▬▬▬▬▬▬ Hier findest Du Unf*ck Your Data: ▬▬▬▬Zum Podcast auf Spotify: https://open.spotify.com/show/6Ow7ySMbgnir27etMYkpxT?si=dc0fd2b3c6454bfaZum Podcast auf iTunes: https://podcasts.apple.com/de/podcast/unf-ck-your-data/id1673832019Zum Podcast auf Deezer: https://deezer.page.link/FnT5kRSjf2k54iib6Zum Podcast auf Youtube: https://www.youtube.com/@unfckyourdata▬▬▬▬▬▬ Merch:...

Incremental Design, DevOps, Microservices & CICD • Michael Nygard & Dave Farley

GOTO - Today, Tomorrow and the Future

Play Episode Listen Later Oct 3, 2025 32:41

This interview was recorded at GOTO Copenhagen 2024.https://gotocph.comMichael Nygard - General Manager of Data at NubankDave Farley - Continuous Delivery & DevOps Pioneer, Award-winning Author, Founder & Director of Continuous Delivery Ltd.RESOURCESMichaelhttps://www.linkedin.com/in/mtnygardhttps://twitter.com/mtnygardhttp://www.michaelnygard.comDavehttps://bsky.app/profile/davefarley77.bsky.socialhttps://www.continuous-delivery.co.ukhttps://linkedin.com/in/dave-farley-a67927https://twitter.com/davefarley77http://www.davefarley.netRead the full abstract hereRECOMMENDED BOOKSDavid Deutsch • The Beginning of InfinityMichael Nygard • Release It! 2nd EditionMichael Nygard • Release It! 1st EditionZhamak Dehghani • Data MeshDave Farley • Modern Software EngineeringDave Farley • Continuous Delivery PipelinesDave Farley & Jez Humble • Continuous DeliveryInspiring Tech Leaders - The Technology PodcastInterviews with Tech Leaders and insights on the latest emerging technology trends.Listen on: Apple Podcasts SpotifyBlueskyTwitterInstagramLinkedInFacebookCHANNEL MEMBERSHIP BONUSJoin this channel to get early access to videos & other perks:https://www.youtube.com/channel/UCs_tLP3AiwYKwdUHpltJPuA/joinLooking for a unique learning experience?Attend the next GOTO conference near you! Get your ticket: gotopia.techSUBSCRIBE TO OUR YOUTUBE CHANNEL - new videos posted daily!

director founders design data devops incremental ci cd microservices coupling tech leaders etl continuous delivery data lake platform engineering data mesh incremental progress dave farley michael nygard netread

The Truth About Enterprise AI & Why Data Matters with Nick Eayrs and Simon Fassot

Analyse Asia with Bernard Leong

Play Episode Listen Later Sep 25, 2025 55:02

"I think the biggest trap to potentially fall into is, "Hey, it's moving so fast, so much is changing. Let's just wait it out." Completely the wrong approach. You just gotta get started." Nick Eayrs from Databricks "As tech people within the shipping industry, how do we explain, how do we make it accessible to all our users? So that's where we came up with the idea of a data supermarket, with in mind really the target of enabling self-service for our business. So by giving the analogy of a supermarket, it was much easier at the beginning to explain our business." - Simon Fassot from Hafnia Fresh out of the studio, Nick Eayrs, Vice President of Field Engineering for Asia Pacific and Japan at Databricks, and Simon Fassot, General Manager and Head of Global Data and Analytics at Hafnia, join us to explore how data intelligence is transforming enterprise AI across diverse industries in Asia. Nick explained the fundamental distinction between general intelligence and data intelligence - emphasizing how enterprises gain competitive advantage by training AI on their proprietary data rather than public knowledge. Nick showcased customer success stories including Standard Chartered Bank and TechComBank and shared his perspectives on how senior executives can take advantage of AI by moving fast rather than wait and see. Last but not least, Nick offered what great would look like for Databricks in Asia Pacific and Japan in serving their customers. Adding the lens of the customer, Simon shared Hafnia's transformation from legacy SQL Server systems to a unified Databricks architecture serving their global shipping operations and elaborated on how the company is breaking down silos with their data supermarket and "Marvis" AI copilot for maritime operations based on retrieval augmented generation. This is Part 1 from Databricks Data + AI Event Singapore. Episode Highlights: [00:00] QOTD by Nick Eayrs and Simon Fassot [00:49] Introduction: Nick Eayrs from Databricks [03:32] Customer obsession means deeply understanding their business context [05:22] Data intelligence versus artificial general intelligence explanation begins [06:42] AI trained on your data creates competitive advantage [08:17] Only 15% of companies have correct AI infrastructure ready [11:17] Don't wait for AI perfection, just get started now [12:30] Agent Bricks simplify AI development using natural language [13:49] Standard Chartered Bank cybersecurity use case with SIEM [16:22] TechCom Bank in Vietnam customer brain with 12,000 customer attributes [18:32] Shared responsibility model for ethical AI deployment [25:24] Asia Pacific psychology focuses on future, not past [26:28] Most important question: How do you get started? [30:18] What does great look like for Databricks? [33:16] Introduction: Simon Fassot from Hafnia [35:18] How Hafnia transformed to full cloud architecture centralizes data through Databricks [36:28] Self-service access needed for 300 onshore, 4000 vessel employees [37:00] Three user types: operations, business intelligence, domain experts and Use Cases for Hafnia [41:32] Unity catalog controls data quality for AI cases [42:21] Two-phase Gen AI: ingest unstructured, then consume data [44:25] How to implement Generative AI: One bad AI answer loses all user trust [45:31] How reports in Hafnia use RAG embedded in workflows [46:47] Data supermarket analogy simplifies self-service for business [48:39] Marvis AI personalizes Gen AI within company context [49:46] Neo4j partnership adds graph capabilities to ecosystem [53:33] DNA Port platform unifies scattered dashboards and applications [54:22] Databricks enables focus on business value over operations Profiles: Nick Eayrs, Vice President of Field Engineering, Asia Pacific & Japan at Databricks LinkedIn: https://www.linkedin.com/in/nick-eayrs/ Simon Fassot, General Manager and Head of Global Data and Analytics at Hafnia LinkedIn: https://www.linkedin.com/in/simon-fassot-68b95135/ Podcast Information: Bernard Leong hosts and produces the show. The proper credits for the intro and end music are "Energetic Sports Drive." G. Thomas Craig mixed and edited the episode in both video and audio format. Here are the links to watch or listen to our podcast. Analyse Asia Main Site: https://analyse.asia Analyse Asia Spotify: https://open.spotify.com/show/1kkRwzRZa4JCICr2vm0vGl Analyse Asia Apple Podcasts: https://podcasts.apple.com/us/podcast/analyse-asia-with-bernard-leong/id914868245 Analyse Asia LinkedIn: https://www.linkedin.com/company/analyse-asia/ Analyse Asia X (formerly known as Twitter): https://twitter.com/analyseasia Sign Up for Our This Week in Asia Newsletter: https://www.analyse.asia/#/portal/signup Subscribe Newsletter on LinkedIn https://www.linkedin.com/build-relation/newsletter-follow?entityUrn=7149559878934540288

Vad är Vibe coding? Miguel förklarar framtidens kodning (# 235)

Effekten: digitalisering - kunskap

Play Episode Listen Later Sep 22, 2025 23:11

"Hypen är här, viben är här." Jonas Jaani intervjuar och kommenterar detta avsnitt av Effekten. Ämnet är vibecoding, ett begrepp som susar genom branschen just nu. För att reda ut vad det egentligen innebär gästas podden av Miguel Sjunnesson Exposito från Sogeti, som delar med sig av sina närmast revolutionerande upplevelser. Och det blir snabbt tydligt att vibecoding är mer än bara ett nytt verktyg – det är en känsla, ett "mindshift". "Jag känner mig som Professor Balthazar" Så vad är vibecoding? För Miguel, med sin bakgrund som kodare, handlar det om att använda sin intuition för att lösa problem på ett helt nytt sätt. "Jag nyttjar min intuition och jag får skapa glädje," förklarar Miguel. "Jag känner mig faktiskt som professor Baltasar när jag vibecodar." I praktiken innebär det att han skriver en prompt, en önskan om vad som ska skapas och låter AI:n generera koden. Han går inte in och ändrar i själva koden, utan fortsätter istället att prompta. "Jag pratar med min polare, helt enkelt," säger han. Jonas Jaani flikar in med sin egen "wow"-upplevelse: att kunna få upp en hel minisajt, med både kod och innehåll, på bara tio minuter. Från noll till expert på en timme Det är när Miguel berättar om sina konkreta projekt som kraften i vibecoding verkligen blir tydlig. Han beskriver hur en kollega ville förstå bildanalys, ett ämne Miguel själv inte hade någon erfarenhet av. Med hjälp av GitHub Copilot (som han kallar "polaren Per") i Visual Studio Code lyckades han på en timme göra följande: Installera hela den nödvändiga virtuella miljön. Skapa ett program som identifierade alla 26 ansikten på Svenska damlandslagets lagfoto (tog 1,5 minut). Analysera en film och räkna antalet människor och fordon i realtid. Aktivera sin webbkamera för att identifiera ett ansikte och avgöra om personen var glad eller ledsen, samt gissa åldern. "Hjärnan, det bara sprutar i hjärnan. Man vill bara göra mer och mer grejer," skrattar Miguel. "Det där hade tagit lång tid för mig... Jag tror inte ens jag hade kommit dit." Dessutom kunde han be "polaren Per" att förklara koden i detalj och lägga in kommentarer – på svenska. Han fick en senior expert i ämnet bredvid sig, omedelbart. En hel dataplattform före middagen Om exemplet med bildanalys var imponerande, är nästa projekt nästan svindlande. Miguel fick i uppdrag att testa att bygga en end-to-end dataplattform för fordonsdata med Microsoft Fabric. Han kände till begrepp som "Data Lake" och "IoT Hub", men var långt ifrån expert. Genom att prompta sig fram byggde han, steg för steg: En fordonsdatasimulator i .NET. Kopplingen som skickade datat till en IoT-hubb. Hela datalake-strukturen (där AI:n förklarade "medaljong-arkitekturen" från brons till guld). Rapporter i Power BI som visade datat. Total tid för att få upp en fungerande prototyp: sju timmar. "Det hade tagit mig flera veckor," konstaterar Miguel. Är det bara "fort och fel"? Här lyfter Jonas en viktig invändning: Blev det inte bara "fort och fel"? Hur är det med kvalitet, säkerhet och förvaltning? Miguel är noga med att poängtera skillnaden mellan en prototyp och en färdig produkt. "Jag är ju väldigt medveten om att den här lösningen... inte är hållbar i det skicket. För det krävs ju så många, många fler lager," säger han. Men det är inte poängen. Värdet ligger i att kraftigt accelerera fasen från idé till prototyp. Man kan snabbt validera koncept, lära sig nya domäner och sedan ta in experterna för att granska och kvalitetssäkra. Det "demokratiserar kodningen". Koden som genererades inom hans expertområde (.NET) bedömde han var "minst lika bra, kanske till och med bättre" än vad han själv hade skrivit. Uppmaningen: "Experimentera!" Så, var lämnar detta oss? Utvecklingen går i en rasande takt. Verktygen som finns idag är ljusår från vad som fanns för bara ett år sedan. Miguels viktigaste råd till alla – oavsett om du är utvecklare, projektledare,

#750 Architekturen für BI & Analytics – Prof. Dr. Peter Gluchowski im Gespräch (Teil 2v2)

Der Performance Manager Podcast | Für Controller & CFO, die noch erfolgreicher sein wollen

Play Episode Listen Later Sep 4, 2025 36:42

In dieser Folge des Performance Manager Podcast spreche ich mit Prof. Dr. Peter Gluchowski (TU Chemnitz) über moderne BI- und Analytics-Architekturen. Wir klären die Unterschiede zwischen Data Lake, Data Lakehouse, Data Mesh und Data Warehouse und diskutieren, warum klassische Ansätze oft nicht mehr ausreichen. Prof. Dr. Gluchowski gibt praxisnahe Empfehlungen, wie Unternehmen ihre Daten effizienter nutzen und ihre BI-Landschaften modernisieren können. Hier geht´s zu dem Buch "Architekturen für BI & Analytics": https://bit.ly/3Vx2tnG

prof apple podcast cfo analytics unternehmen stelle praxis impulse projekte bi unterschiede im gespr daten ans inhalte ihnen empfehlungen controller weiterentwicklung dadurch macher inspirationen young professionals business intelligence xing praktikum performance management herzlichen dank machen sie data warehouses data lake data mesh architekturen data lakehouse zeitinvestition maximal unsere bitte wenn ihnen performance manager podcast

#749 Architekturen für BI & Analytics – Prof. Dr. Peter Gluchowski im Gespräch (Teil 1v2)

Der Performance Manager Podcast | Für Controller & CFO, die noch erfolgreicher sein wollen

Play Episode Listen Later Sep 2, 2025 24:16

Data Warehouse Automation: Benefits and Market Overview – with Florian Bigelmaier, BARC

Data Culture Podcast

Play Episode Listen Later Aug 18, 2025 34:10

"In essence, this approach empowers you to evolve your infrastructure as needed, while your data warehouse processes and logic remain stable and consistent."

Palo Alto acquires Cyberark, Sentinel News, MDTI is going to be FREE!

Blue Security

Play Episode Listen Later Aug 5, 2025 33:53

SummaryIn this episode of the Blue Security Podcast, hosts Andy and Adam discuss significant developments in the cybersecurity landscape, including Palo Alto's acquisition of CyberArk, the introduction of Microsoft Sentinel's Data Lake feature, and the integration of Defender Threat Intelligence into existing Microsoft security solutions. They emphasize the importance of a platform approach to cybersecurity and the challenges associated with acquisitions in the industry.----------------------------------------------------YouTube Video Link: https://youtu.be/8BRxQUyHNh4----------------------------------------------------Documentation:https://www.paloaltonetworks.com/company/press/2025/palo-alto-networks-announces-agreement-to-acquire-cyberark--the-identity-security-leaderhttps://techcommunity.microsoft.com/blog/microsoft-security-blog/introducing-microsoft-sentinel-data-lake/4434280https://techcommunity.microsoft.com/blog/defenderthreatintelligence/mdti-is-converging-into-microsoft-sentinel-and-defender-xdr/4427991----------------------------------------------------Contact Us:Website: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://bluesecuritypod.comBluesky: https://bsky.app/profile/bluesecuritypod.comLinkedIn: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://www.linkedin.com/company/bluesecpodYouTube: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://www.youtube.com/c/BlueSecurityPodcast-----------------------------------------------------------Andy JawBluesky: https://bsky.app/profile/ajawzero.comLinkedIn: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://www.linkedin.com/in/andyjaw/Email: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠andy@bluesecuritypod.com⁠----------------------------------------------------Adam BrewerTwitter: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://twitter.com/ajbrewerLinkedIn: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://www.linkedin.com/in/adamjbrewer/Email: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠adam@bluesecuritypod.com

microsoft palo alto sentinel acquires summaryin data lake cyberark microsoft sentinel

Episode 116: Microsoft Sentinel Data Lake

The Azure Security Podcast

Play Episode Listen Later Jul 31, 2025 40:03 Transcription Available

In this episode Michael, Sarah and Mark talk to Mark Kendrick about Microsoft Sentinel Data Lake. We also cover news about The Open Group - Roles and Glossary standards, Security Adoption Module 5 - Data Security, Microsoft Azure Cloud HSM, WAF and Containers, PostgreSQL and PowerBI, Azure Managed Lustre, and more. Also, Sarah mentions some Developer Security YouTube videos coming out from MS Build!https://aka.ms/azsecpod

containers data security glossary power bi postgresql data lake waf microsoft sentinel

New Surface Laptop 5G for Business, Copilot+ PC

Microsoft Mechanics Podcast

Play Episode Listen Later Jul 24, 2025 4:37 Transcription Available

The Intel® Core™ Ultra (Series 2) processor powered Surface Laptop 5G for Business is a Copilot+ PC. Integrated Intel® AI Boost supports up to 48 TOPS with Foundry Local for on-device AI inferencing. Stay securely connected with rearchitected 5G design—including six smart-switching antennas, eSIM and Wi-Fi 7—without relying on hotspots. As the first Surface Laptop to feature 5G, it enables enterprise-ready AI features for deeper insights, productivity boosts, and powerful local inferencing wherever work happens. ► QUICK LINKS: 00:00 - Microsoft Sentinel Data Lake 01:49 - Data Management 02:46 - Table Management 03:36 - Data Lake exploration 04:17 - Advanced Hunting 05:23 - Query retention data 06:16 - Automate threat detection 07:18 - Move from reactive to predictive 08:50 - Wrap up ► Link References Check out https://surface.com/business ► Unfamiliar with Microsoft Mechanics? As Microsoft's official video series for IT, you can watch and share valuable content and demos of current and upcoming tech from the people who build it at Microsoft. • Subscribe to our YouTube: https://www.youtube.com/c/MicrosoftMechanicsSeries • Talk with other IT Pros, join us on the Microsoft Tech Community: https://techcommunity.microsoft.com/t5/microsoft-mechanics-blog/bg-p/MicrosoftMechanicsBlog • Watch or listen from anywhere, subscribe to our podcast: https://microsoftmechanics.libsyn.com/podcast ► Keep getting this insider knowledge, join us on social: • Follow us on Twitter: https://twitter.com/MSFTMechanics • Share knowledge on LinkedIn: https://www.linkedin.com/company/microsoft-mechanics/ • Enjoy us on Instagram: https://www.instagram.com/msftmechanics/ • Loosen up with us on TikTok: https://www.tiktok.com/@msftmechanics

New data lake in Microsoft Sentinel

Microsoft Mechanics Podcast

Play Episode Listen Later Jul 24, 2025 9:22 Transcription Available

Centralize, retain, and query high-volume, long-term security data across Microsoft and third-party sources for up to 12 years using Microsoft Sentinel's new unified data lake. Correlate signals, run advanced analytics, and perform forensic investigations from a single copy of data—without costly migrations or data silos. Detect persistent, low-and-slow attacks with greater visibility, automate responses using scheduled jobs, and generate predictive insights by combining Copilot, KQL, and machine learning. Vandana Mahtani, Microsoft Sentinel Principal Product Manager shows how to uncover long-running threats, streamline investigations, and automate defenses—all within a unified, AI-powered SIEM experience. ► QUICK LINKS: 00:00 - Microsoft Sentinel Data Lake 01:49 - Data Management 02:46 - Table Management 03:36 - Data Lake exploration 04:17 - Advanced Hunting 05:23 - Query retention data 06:16 - Automate threat detection 07:18 - Move from reactive to predictive 08:50 - Wrap up ► Link References Check out https://aka.ms/SentinelDataLake ► Unfamiliar with Microsoft Mechanics? As Microsoft's official video series for IT, you can watch and share valuable content and demos of current and upcoming tech from the people who build it at Microsoft. • Subscribe to our YouTube: https://www.youtube.com/c/MicrosoftMechanicsSeries • Talk with other IT Pros, join us on the Microsoft Tech Community: https://techcommunity.microsoft.com/t5/microsoft-mechanics-blog/bg-p/MicrosoftMechanicsBlog • Watch or listen from anywhere, subscribe to our podcast: https://microsoftmechanics.libsyn.com/podcast ► Keep getting this insider knowledge, join us on social: • Follow us on Twitter: https://twitter.com/MSFTMechanics • Share knowledge on LinkedIn: https://www.linkedin.com/company/microsoft-mechanics/ • Enjoy us on Instagram: https://www.instagram.com/msftmechanics/ • Loosen up with us on TikTok: https://www.tiktok.com/@msftmechanics

tiktok ai microsoft wrap automate copilot detect data management new data query siem data lake threat detection centralize it pros security copilot microsoft sentinel

Self-Destructing E-Commerce: Wird künftig ohne Shop geshoppt? |

digital kompakt | Business & Digitalisierung von Startup bis Corporate

Play Episode Listen Later Jul 3, 2025 30:06

Im Dialog zwischen KI-Visionen und E-Commerce-Transformation zeigen sich tiefgreifende Meinungsverschiedenheiten: Lars Jankowfsky und andere diskutieren, ob Künstliche Intelligenz unser Kaufverhalten revolutionieren oder traditionelle Strukturen verdrängen wird. Die Branche steht am Scheideweg zwischen disruptiver Innovation und der Erhaltung vertrauter Prozesse. Vanessa Belstler und Bernd Vermaaten betonen den Druck auf Agenturen, sich neu zu erfinden und echte Werte zu liefern, während soziale Plattformen wie TikTok neue Verkaufswege ebnen. Eine Episode voller kontroverser Thesen und visionärer Ideen zur Zukunft des Handels. Du erfährst... …wie AI den E-Commerce revolutioniert und welche Chancen darin liegen …welche Rolle Shopping Agents in der Zukunft des Einkaufens spielen …warum Marken und persönliche Empfehlungen weiterhin entscheidend bleiben …wie Unternehmen sich auf eine AI-getriebene Zukunft vorbereiten …welche disruptiven Veränderungen im Handel in den nächsten Jahren erwartet werden __________________________ ||||| PERSONEN |||||

spotify tiktok ai apple internet podcasts creator innovation elon musk mit shop businesses thema zukunft dinge ecommerce zeiten newsletter expert gesch entwicklung disruption ideen unternehmen chancen sprache druck learnings vorstellung intelligenz werte fakten einf wachstum handel kapitel vielfalt strukturen prozesse empfehlungen kauf gehirn aktivit partnerschaft plattformen waren verkauf marken inf macher gmbh maschinen playlists einsichten entdecke verwendung tsch dienstleistungen thesen bahnhof einzelhandel geschlechter agenturen conversion rate performance marketing informatik digital business orchestration handels onlinehandel schaffung der wert erhaltung funktionalit scheideweg die branche kaufverhalten data lake im dialog fachbegriffe jeden freitag medienunternehmer fachchinesisch

Data Architecture with Christoph Windheuser

CaSE: Conversations about Software Engineering

Play Episode Listen Later Jul 2, 2025 109:08 Transcription Available

The three of us talk with Christoph Windheuser about the styles in data architecture: data mesh, data lake (house) and data warehouse and how to make a decision. In between Christoph explains data quality, data lineage, and data catalog - cornerstones of any modern approach. We end with emerging trends, DuckDB and data governance.

data architecture christoph data governance data lake data mesh duckdb data lakehouse

AI, Data Engineering, and the Modern Data Stack

AI + a16z

Play Episode Listen Later Jun 20, 2025 35:07

In this episode of AI + a16z, dbt Labs founder and CEO Tristan Handy sits down with a16z's Jennifer Li and Matt Bornstein to explore the next chapter of data engineering — from the rise (and plateau) of the modern data stack to the growing role of AI in analytics and data engineering. As they sum up the impact of AI on data workflows: The interesting question here is human-in-the-loop versus human-not-in-the-loop. AI isn't about replacing analysts — it's about enabling self-service across the company. But without a human to verify the result, that's a very scary thing.Among other specific topics, they also discuss how automation and tooling like SQL compilers are reshaping how engineers work with data; dbt's new Fusion Engine and what it means for developer workflows; and what to make of the spate of recent data-industry acquisitions and ambitious product launches.Follow everyone on X:Tristan HandyJennifer LiMatt Bornstein Check out everything a16z is doing with artificial intelligence here, including articles, projects, and more podcasts.

ai data artificial intelligence analytics big data labs data science data analytics sql large language models ai data data engineering data lake modern data stack jennifer li

The Early AI Journey and Learning Curve

The Cloudcast

Play Episode Listen Later May 4, 2025 21:45

As more companies begin to adopt AI into their workforce and day-to-day processes, it will be interesting to watch how their learning curve is spread across knowledge workers. SHOW: 920SHOW TRANSCRIPT: The Cloudcast #920 TranscriptSHOW VIDEO: https://youtube.com/@TheCloudcastNET CLOUD NEWS OF THE WEEK: http://bit.ly/cloudcast-cnotwCHECK OUT OUR NEW PODCAST: "CLOUDCAST BASICS"SHOW SPONSORS:Cut Enterprise IT Support Costs by 30-50% with US CloudSHOW NOTES:AI Horseless Carriages (AI user-experiences)HOW WILL WE VIEW AN AI AGENT IN THE CONTEXT OF HUMANS OR “USERS”The low-hanging fruit, simple on-ramp is the key to early AI adoption Google and Microsoft are already showing revenue increases, likely through the productivity apps bundlingExpect prices to increase slowly, but frequently as adoption happens and companies get used to the knowledge worker productivity increases (or expectations)Curious how knowledge workers are adopting, sharing, increasing their learning curveSharing still seems to be lacking within the AI tools. Not just sharing of an individual task, but sharing of learning curves, best practices, datasetsIs there a dataset collection opportunity? This feels like Big Data or Data Lake 5.0. FEEDBACK?Email: show at the cloudcast dot netTwitter/X: @cloudcastpodBlueSky: @cloudcastpod.bsky.socialInstagram: @cloudcastpodTikTok: @cloudcastpod

ai google sharing microsoft discovery curious best practices exploration big data experimentation learning curves data lake knowledge work

AI-Driven Healthcare: Sutter Health's Journey to Scale Clinical Innovation

Digital Health Talks - Changemakers Focused on Fixing Healthcare

Play Episode Listen Later Mar 25, 2025 27:38

Join Kiran Mysore, Chief Data & Analytics Officer at Sutter Health, as he shares insights on scaling AI adoption, building sustainable innovation infrastructure, and transforming healthcare delivery through data-driven approaches. Learn how one of the nation's largest health systems is successfully integrating advanced analytics and AI into clinical practice while maintaining governance and ethical standards. Real-world implementation of clinical AI at scale Building sustainable innovation infrastructure Data strategy for improved patient and provider experiences Governance frameworks for responsible AI adoption Implementation and impact from digital scribes to diagnosticsKiran Mysore, Chief Data & Analytics Officer at Sutter HealthShahid Shah, Chairman of the Board, Netspective Foundation

Innovation durch Daten: Warum das Mindset entscheidend ist

So klingt Wirtschaft

Play Episode Listen Later Feb 12, 2025 16:24 Transcription Available

Unternehmen müssen ihren Umgang mit Daten und KI ändern. Weg von zielgerichtetem Denken, hin zu datengetriebener Neugier. Niels Strohkirch von Fujitsu erklärt, wie es geht.

innovation weg durch umgang unternehmen denken daten data analytics neugier business intelligence warum das entscheidend fujitsu künstliche intelligenz wettbewerbsvorteil data lake das mindset

What Is Data Lake? (Deep Dive On LAKE Token)

Steady Lads

Play Episode Listen Later Feb 8, 2025 26:24

n this episode I talk to Oliver Slapal co founder of Data Lake. We do a deep dive into Data Lake, what it is, their new token launchpad Lakedotfun, the LAKE token and how it works as well as a ton more.-----------THE OBSIDIAN COUNCIL PREMIUM MEMBERSHIP

amazon apple testing deep dive lake token swell data lake oneto farcaster

The Analytics Escalator: Unlocking Value in Finance with Mambu CFO Jesper Sorensen

Run The Numbers

Play Episode Listen Later Jan 23, 2025 51:49

CJ delves deep into the world of analytics in this interview with Jesper Sorensen, the CFO of Mambu, a leading fintech and banking platform in Europe valued at over $5 billion. Jesper, who has authored three books on analytics, introduces the Analytics Escalator—a framework for unlocking real value through descriptive, diagnostic, predictive, and prescriptive analytics. In the discussion, Jesper covers everything from getting started with analytics and deciding where it should sit within an organization to identifying key opportunities, building a solid BI tech stack, and understanding the role of analytics tools and data lakes. He outlines the journey to creating a high-impact analytics function within a company, emphasizing the critical interplay between people, processes, and systems in fostering a thriving analytics culture—and shares practical advice on how to achieve it.—SPONSORS:RightRev automates the revenue recognition process from end to end, gives you real-time insights, and ensures ASC 606 / IFRS 15 compliance—all while closing books faster. Whether it's multi-element arrangements, subscription renewals, or complex usage-based contracts, RightRev takes care of it all. That means fewer spreadsheets, fewer errors, and more time for your team to focus on growth. For modern revenue recognition simplified, visit rightrev.com and schedule a demo.Brex offers the world's smartest corporate card on a full-stack global platform that is everything CFOs need to manage their finances on an elite level. Plus they offer modern banking and treasury as well as intuitive expenses and accounting automation, bill pay, and travel. Brex makes it easy to control spend before it happens, automate annoying tasks, and optimize your finances. Find out how Brex can help you make every dollar count at brex.com/metrics.Planful is a financial performance management platform designed to streamline financial tasks for businesses. It helps with budgeting, closing the books, and financial reporting, all on a cloud-based platform. By improving the efficiency and accuracy of these processes, Planful allows businesses to make better financial decisions. Find out more at www.planful.com/metrics.Vanta's trust management platform takes the manual work out of your security and compliance process and replaces it with continuous automation. Over 9000 businesses use it to automate compliance needs across over 35 frameworks like SOC 2 and ISO 27001. Centralize security workflows, complete questionnaires up to five times faster, and proactively manage vendor risk. For a limited time, get $1,000 off of Vanta at vanta.com/metrics.—FOLLOW US ON X:@cjgustafson222 (CJ)—TIMESTAMPS:(00:00) Preview and Intro(02:01) Sponsor – RightRev | Brex(04:53) Jesper's Pre-CFO Career(06:50) The Huge Success of Mambu(10:40) Understanding What Analytics Is(13:56) Sponsor – Planful | Vanta(16:02) Where Analytics Should Sit in the Org(20:12) Model for Serving a Team's BI Needs Internally(22:03) Creating Value with the Analytics Escalator(29:51) Prescriptive Analytics and How to Achieve Them(31:08) The Steps on the Analytics Escalator(33:36) Establishing People, Processes, and Systems(34:54) Self-Service Versus Customer Service BI(37:57) The Components of a Good BI Tech Stack(38:50) Data Visualization Tools Versus Analytics Tools(41:59) The Value of a Data Lake(44:26) Build-Versus-Buy for Analytics Tools(46:50) The Need for a Proof-of-Concept for Analytics Tools(48:10) Managing the Trade-Off Between Performance and Cost(50:19) Wrap Get full access to Mostly metrics at www.mostlymetrics.com/subscribe

europe cost managing team model finance serving unlocking proof concept preview cfo analytics bi processes cj iso components cfos jesper soc sorensen asc creating value escalators huge success brex vanta data lake ifrs centralize mambu planful prescriptive analytics

#325 - Theorycrafting Modern Identity Architecture with Ian Glazer

Identity At The Center

Play Episode Listen Later Jan 13, 2025 69:17

Welcome to the Identity at the Center podcast! In this episode, hosts Jeff and Jim dive deep into modern identity architecture with guest Ian Glazer. They discuss topics such as the importance of policy, data orchestration, and the evolving landscape of identity and access management (IAM). Ian shares his thoughts on the future of IAM, the integration of various data sources, the role of events in IAM, and the potential for real-time identity solutions. They also touch on upcoming conferences, the European Identity and Cloud Conference 2025, and the significance of engaging with the identity community. Tune in for a thought-provoking discussion on the advancements and future directions of digital identity! Chapters 00:00 Introduction and Podcast Overview 00:11 Upcoming Plans and Challenges 01:03 Guest Invitation and Podcast Dynamics 03:31 Conference Announcements and Discounts 06:05 Welcoming the Guest: Ian Glazer 06:46 Fido Feud and Conference Experiences 16:29 Identity Market Trends and Innovations 19:19 Modern Identity Architectures 33:51 Identity First Security: A New Approach 34:50 Unified Data Tiers: Breaking Down Silos 36:14 Modern IAM: Opportunities and Challenges 37:02 Ephemeral Access and Zero Standing Privilege 39:18 Understanding Identity Data 41:30 Workforce Identity Data Platforms 47:14 Orchestration and Execution in IAM 51:09 Real-Time Event-Based Identity Systems 54:45 Future Directions and Community Engagement 59:03 Teaching and Sharing Knowledge 01:05:33 Closing Thoughts and Recommendations Connect with Ian: https://www.linkedin.com/in/iglazer/ Notional architecture for modern IAM: Part 3 of 4 (blog): https://weaveidentity.com/blog/notional-architecture-for-modern-iam/ 2025: The year we free our IAM data: https://weaveidentity.com/blog/2025-the-year-we-free-our-iam-data/ Learn more about Weave Identity: https://weaveidentity.com/ Digital Identity Advancement Foundation: https://digitalidadvancement.org/ Avoid the Noid! - https://en.wikipedia.org/wiki/The_Noid Connect with us on LinkedIn: Jim McDonald: https://www.linkedin.com/in/jimmcdonaldpmp/ Jeff Steadman: https://www.linkedin.com/in/jeffsteadman/ Visit the show on the web at http://idacpodcast.com Keywords: IDAC, Identity at the Center, Jeff Steadman, Jim McDonald, Ian Glazer, Weave Identity, Identity and Access Management, IAM, Modern Identity Architectures, Modern IAM, Data Tier, Events, Orchestration, Zero Trust, ZTNA, Shared Signals Framework, EIC, Gartner, Black Hat, RSA, Identibeer, Data Lake, OIDs, IANS

Episode 283: Data Lakehouse vs Data Warehouse vs My House

SQL Data Partners Podcast

Play Episode Listen Later Jan 2, 2025 48:59

Microsoft Fabric offers two enterprise-scale, open-standard format workloads for data storage: Warehouse and Lakehouse. Which service should you choose? In this episode, we dive into the technical components of OneLake, along with some of the decisions you'll be asked to make as you start to build out your data infrastructure. These are two good articles we mention in the podcast that could help inform your decision on the services to implement in your OneLake. Microsoft Fabric Decision Guide: Choose between Warehouse and Lakehouse - Microsoft Fabric | Microsoft Learn Lakehouse vs Data Warehouse vs Real-Time Analytics/KQL Database: Deep Dive into Use Cases, Differences, and Architecture Designs | Microsoft Fabric Blog | Microsoft Fabric We hope you enjoyed this conversation on the nuances of data storage within Microsoft OneLake! If you have questions or comments, please send them our way. We would love to answer your questions on a future episode. Leave us a comment and some love ❤️ on LinkedIn, X, Facebook, or Instagram. The show notes for today's episode can be found at Episode 283: Data Lakehouse vs Data Warehouse vs My House. Have fun on the SQL Trail!

differences warehouses use cases my house data warehouses data lake microsoft fabric data lakehouse

FOSS4G NA 2024 - Searching the Spatial Data Lake: Bring GeoParquet to Apache Lucene - Wes Richardet

Project Geospatial

Play Episode Listen Later Oct 28, 2024 25:32

Wes Richardet's talk at FOSS4G NA 2024 focuses on improving search capabilities within spatial data lakes using GeoParquet and Apache Lucene. He discusses the evolution of data storage, the need for efficient search solutions, and the integration of different technologies to enhance performance. Highlights

searching apache spatial postgresql data lake lucene foss4g

Community Summit 2024 Preview with Aqueducts Consulting

The MSDW Podcast

Play Episode Listen Later Oct 3, 2024 22:28

MSDW is previewing Community Summit North America 2024 with a new series of quick podcast episodes featuring exhibitors. In this episode, we speak with Joe Christensen, founder of Aqueducts Consulting. The team at Aqueducts thinks a lot about how to help Dynamics 365 F&O and AX customers find success in their reporting and data management efforts, Joe tells us. He has seen plenty of pain and frustration among customers over the years, and there are many good reasons why, from the technology to project planning to user adoption and organizational change. Reporting challenges often come down to a solution's reliability, Joe explains. When there's no reliability, a company's reporting system often reverts to becoming little more than key people and tools, rather than something designed to deliver reports to everyone. We also discuss the evolution of the Dynamics 365 F&O data strategy, first with the introduction of Export to Data Lake and now Synapse Link. The Aqueducts Consulting booth at Summit will give attendees as unique "data factory" experience, and Joe explains what that will look like, blending concepts, technology, and operational process. More information: See the Aqueducts Consulting teaw at Booth 1817 Partner Solution Showcase: Tuesday October 15th, 10:45-11:45 AM Come by to cover reporting out of D365F&O. Hear from your peers on how they designed their reporting out of F&O, with a special focus on Synapse Link, and the lakehouse/data warehouse approaches. And ask me about Report Factory! It's our new way of making sure your team can survive when any of your key people leave the BI / reporting team. Demo Zone: Wednesday Oct 16th, 4:45-4:55 PM Where we run through how to get the most progress on F&O reporting, from 14 years of reporting in the field. Connect with Joe on LinkedIn: https://www.linkedin.com/in/joe-christensen-aqc/

summit consulting reporting dynamics bi booth export fo ax data lake community summit aqueducts

Generative AI with Microsoft Fabric

Microsoft Mechanics Podcast

Play Episode Listen Later Aug 15, 2024 2:52

Microsoft Fabric seamlessly integrates with generative AI to enhance data-driven decision-making across your organization. It unifies data management and analysis, allowing for real-time insights and actions. With Real Time Intelligence, keeping grounding data for large language models (LLMs) up-to-date is simplified. This ensures that generative AI responses are based on the most current information, enhancing the relevance and accuracy of outputs. Microsoft Fabric also infuses generative AI experiences throughout its platform, with tools like Copilot in Fabric and Azure AI Studio enabling easy connection of unified data to sophisticated AI models. ► QUICK LINKS: 00:00 - Unify data with Microsoft Fabric 00:35 - Unified data storage & real-time analysis 01:08 - Security with Microsoft Purview 01:25 - Real-Time Intelligence 02:05 - Integration with Azure AI Studio ► Link References This is Part 3 of 3 in our series on leveraging generative AI. Watch our playlist at https://aka.ms/GenAIwithAzureDBs ► Unfamiliar with Microsoft Mechanics? As Microsoft's official video series for IT, you can watch and share valuable content and demos of current and upcoming tech from the people who build it at Microsoft. • Subscribe to our YouTube: https://www.youtube.com/c/MicrosoftMechanicsSeries • Talk with other IT Pros, join us on the Microsoft Tech Community: https://techcommunity.microsoft.com/t5/microsoft-mechanics-blog/bg-p/MicrosoftMechanicsBlog • Watch or listen from anywhere, subscribe to our podcast: https://microsoftmechanics.libsyn.com/podcast ► Keep getting this insider knowledge, join us on social: • Follow us on Twitter: https://twitter.com/MSFTMechanics • Share knowledge on LinkedIn: https://www.linkedin.com/company/microsoft-mechanics/ • Enjoy us on Instagram: https://www.instagram.com/msftmechanics/ • Loosen up with us on TikTok: https://www.tiktok.com/@msftmechanics

Database Essentials

Oracle University Podcast

Play Episode Listen Later Jul 23, 2024 12:24

Join hosts Lois Houston and Nikita Abraham, along with Hope Fisher, Oracle's Product Manager for Database Technologies, as they break down the basics of databases, explore different database management systems, and delve into database development. Whether you're a newcomer or just need a refresher, this quick, informative episode is sure to offer you some valuable insights. Oracle MyLearn: https://mylearn.oracle.com/ou/course/database-essentials/133032/ Oracle University Learning Community: https://education.oracle.com/ou-community LinkedIn: https://www.linkedin.com/showcase/oracle-university/ X: https://twitter.com/Oracle_Edu Special thanks to Arijit Ghosh, David Wright, Radhika Banka, and the OU Studio Team for helping us create this episode. -------------------------------------------------------- Episode Transcript: 00:00 Welcome to the Oracle University Podcast, the first stop on your cloud journey. During this series of informative podcasts, we'll bring you foundational training on the most popular Oracle technologies. Let's get started! 00:26 Nikita: Hello and welcome to the Oracle University Podcast. I'm Nikita Abraham, Principal Technical Editor with Oracle University, and with me is Lois Houston, Director of Innovation Programs. Lois: Hi there! For the last seven weeks, we've been exploring the world of OCI Container Engine for Kubernetes with our senior instructor Mahendra Mehra. We covered key aspects of OKE to help you create, manage, and optimize Kubernetes clusters in Oracle Cloud Infrastructure. So, be sure you check out those episodes if you're interested in Kubernetes. 01:00 Nikita: Today, we're doing something a little different. We've had a lot of episodes on different aspects of Oracle Database, but what if you're just getting started in this world? We wanted you to have something that you could listen to as well. And so we have Hope Fisher with us today. Hope is a Product Manager for Database Technologies at Oracle, and we're going to ask her to take us through the basics of database, the different database management systems, and database development. Lois: Hi Hope! Thanks for joining us for this episode. Before we dive straight into terminologies and concepts, I want to take a step back and really get down to the basics. We sometimes use the terms data and information interchangeably, but they're not the same, right? 01:43 Hope: Data is raw material or a set of facts and observations. Information is the meaning derived from the facts. The difference between data and information can be explained by using an example, such as test scores. In one class, if every student receives a numbered score and the scores can be calculated to determine a class average, the class average can be calculated to determine the school average. So in this scenario, each student's test score is one piece of data. And information is the class's average score or the school's average score. There is no value in data until you actually do something with it. 02:24 Nikita: Right, so then how do we make all this data useful? Do we create a database system? Hope: A database system provides a simple function—treat data as a collection of information, organize it, and make the data usable by providing easy access to it and giving you a place where that data can be stored. Every organization needs to collect and maintain data to meet its requirements. Most organizations today use a database to automate their information systems. An information system can be defined as a formal system for storing and processing data. A database is an organized collection of data put together as a unit. The rationale of a database is to collect, store, and retrieve related data for use by database applications. A database application is a software program that interacts with the database to access and manipulate data. A database is usually managed by a Database Administrator, also known as a DBA. 03:25 Nikita: Hope, give us some examples of database systems. Hope: Popular examples of database systems include Oracle Database, MySQL, which is also owned by Oracle, Microsoft SQL server, Postgres, and others. There are relational database management systems. The acronym is DBMS. Some of the strengths of a DBMS include flexibility and scalability. Given the huge amounts of information that modern businesses need to handle, these are important factors to consider when surveying different types of databases. 03:59 Lois: This may seem a little bit silly, but why not just use spreadsheets, Hope? Why use databases? Hope: The easy answer is that spreadsheets are designed for specific problems, relatively small amounts of data and individual users. Databases are designed for lots of data, shared information use, and complex data analysis. Spreadsheets are typically used for specific problems or small amounts of data. Individual users generally use spreadsheets. In a database, cells contain records that come from external tables. Databases are designed for lots of data. They are intended to be shared and used for more complex data analysis. They need to be scalable, secure, and available to many users. This differentiation means that spreadsheets are static documents, while databases can be relational. 04:51 Nikita: Hope, what are some common database applications? Hope: Database applications are used in far and wide use cases that most commonly can be grouped into three areas. Applications that run companies called enterprise applications. Enterprise applications are designed to integrate computer systems that run all phases of an enterprise's operations to facilitate cooperation and coordination of work across the enterprise. The intent is to integrate core business processes, like sales, accounting, finance, human resources, inventory, and manufacturing. Applications that do something very specific, like healthcare applications-- specialized software is software that's written for a specific task rather than for a broad application area. And then there are also applications that are used to examine data and turn it into information, like a data warehouse, analytics, and data lake. 05:54 Lois: We've spoken about data lakes before. But since this is an episode about the basics of database, can you briefly tell us what a data lake is? Hope: A data lake is a place to store your structured and unstructured data as well as a method for organizing large volumes of highly diverse data from diverse sources. Data lakes are becoming increasingly important as people, especially in businesses and technology, want to perform broad data exploration and discovery. Bringing data together into a single place or most of it into a single place makes that simpler. 06:29 Nikita: Thanks for that, Hope. So, what kind of organizations use databases? And, who within these organizations uses databases the most? Hope: Almost every enterprise uses databases. Enterprises use databases for a variety of reasons and in a variety of ways. Data and databases are part of almost any process of the enterprise. Data is being collected to help solve business needs and drive value. Many people in an organization work with databases. These include the application developers who create applications that support and drive the business. The database administrator or DBA maintains and updates the database. And the end user uses the data as needed. 07:19 Do you want to stay ahead of the curve in the ever-evolving AI landscape? Look no further than our brand-new OCI Generative AI Professional course and certification. For a limited time only, we're offering both the course and certification for free. So, don't miss out on this exclusive opportunity to get certified on Generative AI at no cost. Act fast because this offer is valid only until July 31, 2024. Visit https://education.oracle.com/genai to get started. That's https://education.oracle.com/genai. 07:57 Nikita: Welcome back. Now that we've discussed foundational database concepts, I want to move on to database management systems. Take us through what a database management system is, Hope. Hope: A Database Management System, DBMS, has the following elements. The kernel code manages memory and storage for the DBMS. The repository of metadata is called a data dictionary. The query language enables applications to access the data. Oracle database functions include data definitions, storage, structure, and security. Additional functionality also provides for user access control, backup and recovery, integrity, and communications. There are many different database types and management systems. The most common is the relational database management system. 08:51 Nikita: And how do relational databases store data? Hope: Essentially and very simplistically, there are key elements of the relational database. Database table containing rows and columns; the data in the table, which is stored a row at a time; and the columns which contain attributes or related information. And then the different tables in a database relate to one another and share a column. 09:17 Lois: Customers usually have a mix of applications and data structures, and ideally, they should be able to implement a data management strategy that effectively uses all of their data in applications, right? How does Oracle approach this? Hope: Oracle's approach to this enterprise data management strategy and architecture is converged database to all different data types and workloads. The converged database is a database that has native support for all modern data types and, of course, traditional relational data. By providing support for all of these data types, a converged database can run all sorts of workloads, from transaction processing to analytics and machine learning to blockchain to support the applications and systems. Oracle provides a single database engine that supports all data models, process types, and development environments. It also addresses many kinds of workloads against the same data sets. And there's no need to use dozens of specialized databases. Deploying several single-purpose databases would increase costs, complexity, and risk. 10:25 Nikita: In the final part of our conversation today, I want to bring up database development. Hope, how are databases developed? Hope: Data modeling is the first part of the database development process. Conceptual data modeling is the examination of a business and business data to determine the structure of business information and the rules that govern it. This structure forms the basis for database design. A conceptual model is relatively stable over long periods of time. Physical data modeling, or database building, is concerned with implementation in each technical software and hardware environment. The physical implementation is highly dependent on the current state of technology and is subject to change as available technologies rapidly change. Conceptual model captures the functional and informational needs of a business and is used to identify important entities and their relationships. A logical model includes the entities and relationships. This is also called an entity relationship model and provides the details of the relationships. 11:34 Lois: I think that's a good place to wrap up our episode. To know more about the Oracle Database architecture, offerings, and so on, visit mylearn.oracle.com. Thanks for joining us today, Hope. Nikita: Join us next week for another episode of the Oracle University Podcast. Until then, this is Nikita Abraham… Lois: And Lois Houston, signing off! 11:55 That's all for this episode of the Oracle University Podcast. If you enjoyed listening, please click Subscribe to get all the latest episodes. We'd also love it if you would take a moment to rate and review us on your podcast app. See you again on the next episode of the Oracle University Podcast.

265: Swing and a WIF

The Cloud Pod

Play Episode Listen Later Jun 28, 2024 39:48

Welcome to episode 265 of the Cloud Pod Podcast – where the forecast is always cloudy! Justin and Matthew are with you this week, and even though it's a light news week, you're definitely going to want to stick around. We're looking forward to FinOps, talking about updates to Consul, WIF coming to Vault 1.17, and giving an intro to Databricks LakeFlow. Because we needed another lake product. Be sure to stick around for this week's Cloud Journey series too. Titles we almost went with this week: The CloudPod lets the DataLake flow Amazon attempts an international incident in Taiwan What's your Vector Mysql? A big thanks to this week's sponsor: We're sponsorless! Want to reach a dedicated audience of cloud engineers? Send us an email, or hit us up on our Slack Channel and let's chat! General News 01:40 Consul 1.19 improves Kubernetes workflows, snapshot support, and Nomad integration Consul 1.19 is now generally available, improving the user experience, providing flexibility and enhancing integration points. Consul 1.19 introduces a new registration custom resource definition (CRD) that simplifies the process of registering external services into the mesh. Consul service mesh already supports routing to services outside of the mesh through terminating gateways. However, there are advantages to using the new Registration CRD. Consul snapshots can now be stored in multiple destinations, previously, you could only snapshot to a local path or to a remote object store destination but not both. Now you can take a snapshot of NFS Mounts, San attached Storage, or Object storage. Consul API gateways can now be deployed on Nomad, combined with transparent proxy and enterprise features like admin partitions 01:37 Matthew- “What I was surprised about, which I did not know, was that console API gateway can now be deployed on Nomad. Was it not able to be deployed before? Just feels weird… you know, consoles should be able to be deployed on nomad compared to that. You know, it’s all the same company, but sometimes team A doesn’t always talk to team B.” 03:21 Vault 1.17 brings WIF, EST support for PKI, and more Vault 1.17 is now generally available with new secure workflows, better performance and improved secrets management scalability. Key new features: Workload Identify Federation (WIF) allows you to eliminate concerns around providing security credentials to vault plugins. Using the new support for WIF< a trust relationship can be established between an external system and va

amazon taiwan swing vault titles api storage nomad object kubernetes consul data lake finops pki crd wif slack channel

60 – Interoperability of Data Lake Table Format (Apache Iceberg, Apache Hudi, Delta Lake)

The Datanation Podcast - Podcast for Data Engineers, Analysts and Scientists

Play Episode Listen Later Jun 28, 2024

Alex Merced discusses where interoperability tools like Apache Xtable and Uniform

table delta uniform interoperability data lake hudi alex merced apache iceberg

#175 - How to Solve Real-World Data Analysis Problems - David Asboth

Tech Lead Journal

Play Episode Listen Later May 20, 2024 57:10

“All data scientists and analysts should spend more time in the business, outside the data sets, just to see how the actual business works. Because then you have the context, and then you understand the columns you're seeing in the data." David Asboth, author of “Solve Any Data Analysis Problem” and co-host of the “Half Stack Data Science” podcast, shares practical tips for solving real-world data analysis challenges. He highlights the gap between academic training and industry demands, emphasizing the importance of understanding the business problem and maintaining a results-driven approach. David offers practical insights on data dictionary, data modeling, data cleaning, data lake, and prediction analysis. We also explore AI's impact on data analysis and the importance of critical thinking when leveraging AI solutions. Tune in to level up your skills and become an indispensable, results-driven data analyst. Listen out for: Career Journey - [00:01:38] Half Stack Data Science Podcast - [00:06:33] Real-World Data Analysis Gaps - [00:10:46] Understanding the Business/Problem - [00:15:36] Result-Driven Data Analysis - [00:18:28] Feedback Iteration - [00:21:44] Data Dictionary - [00:23:48] Data Modeling - [00:27:18] Data Cleaning - [00:30:43] Data Lake - [00:35:05] Common Data Analysis Tasks - [00:36:50] Prediction Analysis - [00:40:23] The Impact of AI on Data Analysis - [00:43:15] Importance of Critical Thinking - [00:47:05] Common Tasks Solved by AI - [00:50:07] 3 Tech Lead Wisdom - [00:53:10] _____ David Asboth's BioDavid is a “data generalist”; currently a freelance data consultant and educator with an MSc. in Data Science and a background in software and web development. With over 6 years experience teaching, he has taught everyone from junior analysts up to C-level executives in industries like banking and management consulting about how to successfully apply data science, machine learning, and AI to their day-to-day roles. He co-hosts the Half Stack Data Science podcast about data science in the real world and is the author of Solve Any Data Analysis Problem, a book about the data skills that aspiring analysts actually need in their jobs, which will be published by Manning in 2024. Follow David: LinkedIn – linkedin.com/in/david-asboth-9256772 Website – davidasboth.com Podcast – halfstackdatascience.com _____ Our Sponsors Manning Publications is a premier publisher of technical books on computer and software development topics for both experienced developers and new learners alike. Manning prides itself on being independently owned and operated, and for paving the way for innovative initiatives, such as early access book content and protection-free PDF formats that are now industry standard.Get a 45% discount for Tech Lead Journal listeners by using the code techlead45 for all products in all formats. Like this episode? Show notes & transcript: techleadjournal.dev/episodes/175. Follow @techleadjournal on LinkedIn, Twitter, and Instagram. Buy me a coffee or become a patron.

ai impact solve real world msc manning data science critical thinking data analysis career journey data lake data modeling

START UP: My Top Pick for a 100x Token in DeSci | Data Lake on 100x Podcast

The Obsidian Table

Play Episode Listen Later Apr 25, 2024 46:29

Can Blockchain lead to breakthroughs in science and medicine? Can crypto help create healthier and longer lives? DataLake is doing exactly that by evolving the way patients are recruited for scientific trials. We talked in depth about their token on a past 100x Gem Show, but this time we're joined by their CEO to go deep on DeSci (decentralized science) for the first time in our podcast's history! This is not sponsored in any way whatsoever; we're just genuinely stoked about exploring one of crypto's more unique use-cases, and asking the question... Why should you be paying attention to Data Lake?

ceo token top picks 100x data lake podcast partner kadena desci

Spotlight On Sophos UK&I, Episode 3 April 2024, Ireland, Integrations And Partner Care

Arrow Bandwidth

Play Episode Listen Later Apr 12, 2024 24:00

In Episode 3 of the "Spotlight on Sophos" podcast series, we have a guest host, Ross Collins, Arrow Technical Account Manager for Ireland talking to Sophos's Jon Hope about the latest achievements and innovations at Sophos. As well as an update on Arrow Ireland they highlight the benefits of Sophos's integration with Veeam and Cisco Umbrella; how the new Sophos Partner Care team can help partners particularly with the NFR Not For Resale programme; and the hot off the press Adaptive Attack Protection additions. Listeners will also gain valuable insights into Sophos' Data Lake control and industry leading network security features. Tune in to get all the latest technical information in this short, compact, compelling podcast.

care partner ireland integration spotlight sophos data lake veeam jon hope cisco umbrella

215: The Future of AI with Salesforce's Einstein 1 Studio & Data Cloud featuring Danielle Larregui

Salesforce Developer Podcast

Play Episode Listen Later Mar 19, 2024 17:13

Join us as we welcome the Data Cloud Queen herself, Danielle Larregui. Get ready to witness the groundbreaking power of Einstein 1 Studio as Danielle unveils its transformative capabilities within the Salesforce Data Cloud. Discover how developers can effortlessly create AI models using a no-code or low-code approach directly with their Data Lake data. We'll explore the practicality of generating predictions, integrating external AI platforms, and leveraging built-in tools for assessing prediction accuracy. Brace yourself for the standout feature of 'Bring Your Own Model,' which allows seamless, real-time data sharing without the need for ETL processes. We'll discuss the availability of Snowflake's integration and the potential that lies with Google BigQuery. Imagine how these integrations can revolutionize your external data management, from segmentation to identity resolution. Stay tuned to learn how Data Cloud Enrichment could further enhance your Salesforce CRM by leveraging the power of Data Cloud data. Show Highlights: Introduction of Einstein 1 Studio and Model Builder within Salesforce Data Cloud for creating AI models using no-code or low-code approaches. How the "Bring Your Own Model" feature enables real-time data sharing with Salesforce Data Cloud without ETL processes. How Data Cloud Enrichment allows Salesforce CRM records to be updated with Data Cloud data. Remote Data Cloud, which could unify data management for organizations with multiple Salesforce instances. Ability to use predictions made by AI models in Salesforce flows, Apex classes, and reporting within Data Cloud. Links: Bring Your Google Vertex AI Models To Data Cloud - https://developer.salesforce.com/blogs/2023/11/bring-your-google-vertex-ai-models-to-data-cloud Use Model Builder to Integrate Databricks Models with Salesforce - https://developer.salesforce.com/blogs/2024/03/use-model-builder-to-integrate-databricks-models-with-salesforce

ai discover studio albert einstein ability brace salesforce apex snowflakes future of ai etl data lake salesforce crm data cloud larregui google bigquery

100X GEM SHOW: Perion ($PERC) | Mendi Finance ($MENDI) | Data Lake ($LAKE) | Will These 100x?

The Obsidian Table

Play Episode Listen Later Mar 7, 2024 29:14

The 100x Gem Show: where we ask one simple question: can this token 100x? It's the start of a bull market, we're ready to make some crazy gains. Let's see if these tokens are ripe for them! Today's projects are Perion, Mendi Finance, and Data Lake. Here's where you can find the projects on CoinGecko: https://www.coingecko.com/en/coins/mendi-finance https://www.coingecko.com/en/coins/perion https://www.coingecko.com/en/coins/data-la This is not a paid episode. None of them were informed we'd be analyzing the token. Find our speakers this week: Matthew Walker - https://twitter.com/hawaiianmint Cesar Martinez: https://twitter.com/poppabigmac Our Current Partners: Astrabit Trading: https://astrabit.io/ Shrapnel: https://twitter.com/playSHRAPNEL Kadena: https://twitter.com/kadena_io Blocksquare: https://twitter.com/blocksquare_io FortBlockGames: https://twitter.com/FortBlockGames Disclosures: As always, we want to stress that nothing in this is financial investment advice. Our goal with these conversations is to give everyone listening one more tool in their belt to utilize while they do their own research and learn more about crypto. 100x Podcast Partners are not endorsements to purchase or invest. They are projects or brands who have (at a minimum) purchased ad space in our podcast (which is how we fund the podcast's operations). We meet with them, often have them on the podcast so you can hear from them directly, and often find additional ways to support each other (like introducing us to other cool guests). Please do your own research. Time stamps: Intro: 00:00 Partner Highlight (Astrabit, Shrapnel, FortBlockGames) : 01:13 Perion: 3:36 Partner Highlight (Kadena, Blocksquare): 13:20 Mendi Finance: 14:51 Data Lake: 20:25

time finance matthew walker 100x perc shrapnel data lake podcast partner mendi

Vidur Gupta | Beekin | CollectiveConversations

ApartmentHacker Podcast

Play Episode Listen Later Feb 20, 2024 26:18

In this conversation, Mike Brewer interviews Vidur Gupta, the founder and CEO of Beacon, an analytics platform for real estate. They discuss the power of AI and machine learning in property management, the benefits of using data to make informed decisions, and the role of machine learning models in predicting resident behavior. Vidur explains how Beacon's products and solutions help investors and operators across the asset lifecycle, from underwriting to management and financing. He also highlights the importance of creating a culture of trust and empowerment within an organization. The conversation concludes with recommendations for operators on adopting AI and a book recommendation: 'The Age of AI' by Eric Schmidt. Takeaways Beacon is an analytics platform that uses AI and machine learning to help investors and operators make data-driven decisions in real estate. Machine learning models can analyze large pools of data to predict resident behavior, optimize pricing, and measure social impact. AI removes emotion from decision-making and provides a more objective and accurate approach. Creating a culture of trust and empowerment is essential for building a successful organization. Operators should have a framework for evaluating AI and understand its limitations and benefits. Chapters 00:00 Introduction and Origin of the Name Beacon 01:15 Unpacking What Beacon Does 02:12 The AI Component of Beacon 03:13 The Power of AI and Machine Learning 04:45 Data Lake and Modeling 05:38 Using Data to Make Informed Decisions 06:05 Machine Learning in Property Management 07:35 The Power of Machine Learning in Decision-Making 08:17 Weighting Variables in Machine Learning Models 09:09 The Dynamic Nature of Machine Learning Models 09:49 The Benefits of Machine Learning over Rules-Based Models 10:48 Applying Machine Learning to Real Estate 11:26 Removing Emotion from Decision-Making with AI 12:43 Using AI to Overcome Biases in Decision-Making 13:39 Building the Future State with Beacon 14:34 Products and Solutions Offered by Beacon 15:52 Predicting Resident Lease Renewals 16:53 Dynamic Pricing of Leases 17:22 Measuring Social Impact with a Score 19:03 Using Beacon for Acquisition and CapEx Planning 26:48 Creating a Culture of Trust and Empowerment 29:47 Drawing Inspiration as a Leader 33:34 Recommended Books: The Age of AI 34:34 Advice for Operators on Adopting AI --- Send in a voice message: https://podcasters.spotify.com/pod/show/mike-brewer/message Support this podcast: https://podcasters.spotify.com/pod/show/mike-brewer/support

Building a Data Lake with Adam Ferrari

Software Engineering Daily

Play Episode Listen Later Feb 6, 2024 46:19 Very Popular

Starburst is a data lake analytics platform. It's designed to help users work with structured data at scale, and is built on the open source platform, Trino. Adam Ferrari is the SVP of Engineering at Starburst. He joins the show to talk about Starburst, data engineering, and what it takes to build a data lake. The post Building a Data Lake with Adam Ferrari appeared first on Software Engineering Daily.

building engineering ferrari svp starburst trino data lake software engineering daily

Subsquid Network - The Web3 Data Lake

The Crypto Conversation

Play Episode Listen Later Feb 1, 2024 45:05

Dr. Dmitry Zhelezov is the Co-Founder and CEO and Marcel Fohrmann is the Co-Founder and CFO of Subsquid, a decentralized data lake and query engine that offers developers permissionless, cost-efficient access to on-chain data from over 100 chains and is integrated into a large ecosystem of Web2- and Web3-native developer tools. Why you should listen Subsquid Network is a decentralized query engine optimized for batch extraction of large volumes of data. It currently serves historial on-chain data ingested from 100+ EVM and Substrate networks, including event logs, transaction receipts, traces and per-transaction state diffs. In the future, it will additionally support general-purpose SQL queries and an ever-growing collection of structured data sets derived from on- and off- chain data. Supporting links Bitget Bitget Academy Bitget Research Bitget Wallet Subsquid Andy on Twitter Brave New Coin on Twitter Brave New Coin If you enjoyed the show please subscribe to the Crypto Conversation and give us a 5-star rating and a positive review in whatever podcast app you are using.

ceo co founders network cfo web3 sql web2 evm substrate data lake

Build A Data Lake For Your Security Logs With Scanner

Data Engineering Podcast

Play Episode Listen Later Jan 29, 2024 62:38

Summary Monitoring and auditing IT systems for security events requires the ability to quickly analyze massive volumes of unstructured log data. The majority of products that are available either require too much effort to structure the logs, or aren't fast enough for interactive use cases. Cliff Crosland co-founded Scanner to provide fast querying of high scale log data for security auditing. In this episode he shares the story of how it got started, how it works, and how you can get started with it. Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management Data lakes are notoriously complex. For data engineers who battle to build and scale high quality data workflows on the data lake, Starburst powers petabyte-scale SQL analytics fast, at a fraction of the cost of traditional methods, so that you can meet all your data needs ranging from AI to data applications to complete analytics. Trusted by teams of all sizes, including Comcast and Doordash, Starburst is a data lake analytics platform that delivers the adaptability and flexibility a lakehouse ecosystem promises. And Starburst does all of this on an open architecture with first-class support for Apache Iceberg, Delta Lake and Hudi, so you always maintain ownership of your data. Want to see Starburst in action? Go to dataengineeringpodcast.com/starburst (https://www.dataengineeringpodcast.com/starburst) and get $500 in credits to try Starburst Galaxy today, the easiest and fastest way to get started using Trino. Your host is Tobias Macey and today I'm interviewing Cliff Crosland about Scanner, a security data lake platform for analyzing security logs and identifying issues quickly and cost-effectively Interview Introduction How did you get involved in the area of data management? Can you describe what Scanner is and the story behind it? What were the shortcomings of other tools that are available in the ecosystem? What is Scanner explicitly not trying to solve for in the security space? (e.g. SIEM) A query engine is useless without data to analyze. What are the data acquisition paths/sources that you are designed to work with?- e.g. cloudtrail logs, app logs, etc. What are some of the other sources of signal for security monitoring that would be valuable to incorporate or integrate with through Scanner? Log data is notoriously messy, with no strictly defined format. How do you handle introspection and querying across loosely structured records that might span multiple sources and inconsistent labelling strategies? Can you describe the architecture of the Scanner platform? What were the motivating constraints that led you to your current implementation? How have the design and goals of the product changed since you first started working on it? Given the security oriented customer base that you are targeting, how do you address trust/network boundaries for compliance with regulatory/organizational policies? What are the personas of the end-users for Scanner? How has that influenced the way that you think about the query formats, APIs, user experience etc. for the prroduct? For teams who are working with Scanner can you describe how it fits into their workflow? What are the most interesting, innovative, or unexpected ways that you have seen Scanner used? What are the most interesting, unexpected, or challenging lessons that you have learned while working on Scanner? When is Scanner the wrong choice? What do you have planned for the future of Scanner? Contact Info LinkedIn (https://www.linkedin.com/in/cliftoncrosland/) Parting Question From your perspective, what is the biggest gap in the tooling or technology for data management today? Closing Announcements Thank you for listening! Don't forget to check out our other shows. Podcast.__init__ (https://www.pythonpodcast.com) covers the Python language, its community, and the innovative ways it is being used. The Machine Learning Podcast (https://www.themachinelearningpodcast.com) helps you go from idea to production with machine learning. Visit the site (https://www.dataengineeringpodcast.com) to subscribe to the show, sign up for the mailing list, and read the show notes. If you've learned something or tried out a project from the show then tell us about it! Email hosts@dataengineeringpodcast.com (mailto:hosts@dataengineeringpodcast.com)) with your story. Links Scanner (https://scanner.dev/) cURL (https://curl.se/) Rust (https://www.rust-lang.org/) Splunk (https://www.splunk.com/) S3 (https://aws.amazon.com/s3/) AWS Athena (https://aws.amazon.com/athena/) Loki (https://grafana.com/oss/loki/) Snowflake (https://www.snowflake.com/en/) Podcast Episode (https://www.dataengineeringpodcast.com/snowflakedb-cloud-data-warehouse-episode-110/) Presto (https://prestodb.io/) Trino (thttps://trino.io/) AWS CloudTrail (https://aws.amazon.com/cloudtrail/) GitHub Audit Logs (https://docs.github.com/en/organizations/keeping-your-organization-secure/managing-security-settings-for-your-organization/reviewing-the-audit-log-for-your-organization) Okta (https://www.okta.com/) Cribl (https://cribl.io/) Vector.dev (https://vector.dev/) Tines (https://www.tines.com/) Torq (https://torq.io/) Jira (https://www.atlassian.com/software/jira) Linear (https://linear.app/) ECS Fargate (https://aws.amazon.com/fargate/) SQS (https://aws.amazon.com/sqs/) Monoid (https://en.wikipedia.org/wiki/Monoid) Group Theory (https://en.wikipedia.org/wiki/Group_theory) Avro (https://avro.apache.org/) Parquet (https://parquet.apache.org/) OCSF (https://github.com/ocsf/) VPC Flow Logs (https://docs.aws.amazon.com/vpc/latest/userguide/flow-logs.html) The intro and outro music is from The Hug (http://freemusicarchive.org/music/The_Freak_Fandango_Orchestra/Love_death_and_a_drunken_monkey/04_-_The_Hug) by The Freak Fandango Orchestra (http://freemusicarchive.org/music/The_Freak_Fandango_Orchestra/) / CC BY-SA (http://creativecommons.org/licenses/by-sa/3.0/)

The Future of Customer Data in the Hybrid Cloud: A Conversation with Industry Experts | EP 166

Cloud N Clear

Play Episode Listen Later Oct 31, 2023 21:10

Brian Suk, Associate CTO at SADA hosts episode 166 of Cloud N Clear to discuss all things data in a hybrid cloud world. He is joined by Adrian Estala, VP Field CDO at Starburst – a data-driven company offering a full-featured data lake analytics platform, built on open source Trino. Learn more about simplifying customer pipelines to make data more useful, insights on Data Lakehouse tools, and how to get your data ‘right' and get it right fast. Join us in this engaging episode, and don't forget to LIKE, SHARE, & SUBSCRIBE for more enlightening content! ✅

conversations data databases data analytics google cloud data protection data security data management industry experts starburst customer data hybrid cloud trino data lake data lakehouse

230: If I Ever Own a Sailboat, I Will Name it Kafka… and Sail it on a Data Lake

The Cloud Pod

Play Episode Listen Later Oct 11, 2023 54:50

Welcome to The Cloud Pod episode 230, where the forecast is always cloudy! This week we're sailing our pod across the data lake and talking about updates to managed delivery from Kafka. We also take a gander at Bedrock, some new security tools from our friends over at Google. We're also back with our Cloud Journey Series talking security theater.Stay Tuned! Titles we almost went with this week:

google ms microsoft security cloud titles initiatives devops azure kafka sql bedrock sailboats data lake foghorn general news azure vm cloud pod foghorn consulting

How to Prime Your Data Lake

Defense in Depth

Play Episode Listen Later Sep 14, 2023 27:18

All links and images for this episode can be found on CISO Series. A security data lake, a data repository of everything you need to analyze and get analyzed sounds wonderful. But priming that lake, and stocking it with the data you want to get the insights you need is a more difficult task than it seems. Check out this post for the discussion that is the basis of our conversation on this week's episode co-hosted by me, David Spark (@dspark), the producer of CISO Series, and Geoff Belknap (@geoffbelknap), CISO, LinkedIn. Joining us is our sponsored guest, Matt Tharp, Head of Field Engineering, Comcast DataBee. Thanks to our podcast sponsor, Comcast Technology Solutions In this episode: What exactly is a data lake? How are people thinking about and handling the risks? If you want security data lakes to be successful, what customer problem are you trying to solve? How can you make it both dead simple to use AND highly effective?

head prime ciso data lake david spark field engineering ciso series

AI Today Podcast: AI Glossary Series – Data Warehouse, Data Lake, Extract Transform Load (ETL)

AI Today Podcast: Artificial Intelligence Insights, Experts, and Opinion

Play Episode Listen Later Sep 8, 2023 16:07

In this episode of the AI Today podcast hosts Kathleen Walch and Ron Schmelzer define the terms Data, Dataset, Big Data, DIKUW Pyramid, explain how these terms relate to AI and why it's important to know about them. Show Notes: FREE Intro to CPMAI mini course CPMAI Training and Certification AI Glossary AI Glossary Series – DevOps, Machine Learning Operations (ML Ops) AI Glossary Series – Automated Machine Learning (AutoML) AI Glossary Series – Data Preparation, Data Cleaning, Data Splitting, Data Multiplication, Data Transformation AI Glossary Series – Data Augmentation, Data Labeling, Bounding box, Sensor fusion AI Glossary Series – Data, Dataset, Big Data, DIKUW Pyramid Continue reading AI Today Podcast: AI Glossary Series – Data Warehouse, Data Lake, Extract Transform Load (ETL) at Cognilytica.

ai data transform load big data certification extract sensor glossary data warehouses data lake dataset bounding ai today data transformation free intro kathleen walch cognilytica ron schmelzer

Ep21: Ethical Sourcing of Medical Data

Who's your Data? Podcast

Play Episode Listen Later Aug 28, 2023 47:54

Progress in healthcare and medical research requires a lot of data. In the era of Big Data health data is among the most valuable and the most private information anyone can have. Two major hurdles with finding quality medical data are access to that data in a private and ethical way as well as bias on the data due to underrepresentation of women, certain racial or ethnic groups. In this episode I talk to Dinidh O'Brien about how donateyourdata.org and DataLake are trying to solve this problem with a patient-first approach to sourcing medical data for research. Data Lake is an EU-funded start-up creating a global medical data donation system based on blockchain technology, with privacy and informed consent as fundamental pillars. We discuss this data donation framework and how it addresses the issues of privacy, consent, data monetization and working to minimize biases. Dinidh explains how they approach patients to opt in, how they vet organizations that request access to this data and how they plan to expand throughout Europe and the US.

europe european union progress ethical big data sourcing data lake medical data

Microsoft Fabric with Andrew Snodgrass

RunAs Radio

Play Episode Listen Later Aug 23, 2023 41:27

What is Microsoft Fabric, and why do you want some? Richard talks to Andrew Snodgrass of Directions on Microsoft about Microsoft's recently announced Fabric product. Andrew explains that Fabric is an effort to integrate the various data products, including PowerBI, DataLake, Data Factory, and Data Warehousing, under a standard banner. It is early days for Fabric, but it's a great time to take it out for a spin for those who haven't dug into Azure data analytics products. But if you have existing implementations of PowerBI and many other data products, test carefully - the migration paths aren't simple!Links:Microsoft FabricAzure Synapse AnalyticsAzure Data FactoryPower BIFabric WorkspacesOneLakeKusto Query Language (KQL)Parquet Files in FabricMicrosoft PurviewOneLake File ExplorerRecorded July 12, 2023

microsoft fabric directions azure power bi snodgrass data lake data warehousing microsoft fabric microsoft purview azure synapse

205: Plumbing the Depths of Unstructured Data - Superna

Data Protection Gumbo

Play Episode Listen Later Jul 18, 2023 28:00

Alex Hesterberg, CEO at Superna embarks on a captivating exploration of the expanding world of unstructured data and data security trends. The discussion gets fervid as Alex enlightens us about the amplified use of unstructured data platforms in recovery and resiliency along with its application in tier zero platforms like SAP HANA. His insights about data integration into business intelligence tools and the potential of AI and ML technologies like ChatGPT are truly riveting.

ceo ai chatgpt cybersecurity openai depths ml ransomware plumbing data security data lake phoenix project unstructured data sap hana superna

Is ChatGPT the iBeer of LLMs?

Equity

Play Episode Listen Later Jul 12, 2023 32:01

This week we had a very special guest on the podcast: Matthew Lynley, one of the founding hosts of Equity and a former TechCruncher. Since his Equity days, Lynley went off and started his very own AI-focused publication called Supervised.We brought him back on the show to ask him questions in a format where we can all learn together. Here's what we got into:From Transformers to GPT4: How attention became so critical inside of neural networks, and how transformers set the path for modern AI services.Recent acquisitions in the AI space, and what it means for the “LLM stack:” With Databricks buying MosaicML and Snowflake already busy with its own checkbook, a lot of folks are working to build out a full-stack LLM data extravaganza. We talked about what that means.Where startups sit in the current AI race: While it's great to think about the majors, we also need to know what the startup angle is. The answer? It's a little early to say, but what is clear is that startups are taking some big swings at the industry and are hellbent to snag a piece of the pie.Thanks to everyone for hanging out with us. Equity is back on Friday for our weekly news roundup!For episode transcripts and more, head to Equity's Simplecast website.Equity drops at 7 a.m. PT every Monday, Wednesday and Friday, so subscribe to us onApple Podcasts, Overcast, Spotify and all the casts. TechCrunch also has a great show on crypto, a show that interviews founders, one that details how our stories come together and more!

127. Data Privacy, Data Clean Rooms, Identity Discussion with U of Digital's Myles Younger

Programmatic Digest's podcast

Play Episode Listen Later Jun 20, 2023 54:50

When this podcast was launched back in 2018, one of the biggest reasons was to share knowledge and highlight diversity. In the last 4-5 years, it grew into a community where we meet weekly and talk all things programmatic activations and industry trends. With that said, one of our goals in 2023, was to invite more guests during the free community where members would have the opportunity to learn and ask questions directly. Myles Younger joined us in our weekly community call, aka the Programmatic Meetup. In this episode, Myles talks about data privacy, data clean rooms, and identity from definition to hot takes. At the latter part of the episode, some of our ninjas had the chance to ask questions directly to Myles and discuss as a group. Truly a wonderful opportunity and experience! Thanks to our friend at U of Digital! About Us: Our mission is to teach historically excluded people how to get started in programmatic media buying and find a dream job. We do so by providing on-demand lessons via the Reach and Frequency™️ program, a dope community with like-minded programmatic experts, and live free and paid group coaching. Hélène Parker has over 10 years of experience in programmatic media buying, servicing agencies and brands in activation, strategy and planning, and leadership. She now dedicates her time to recruiting and training programmatic traders while consulting companies on how to grow and scale a programmatic department. Interested in training or hiring programmatic juniors? Book a Free Call Timestamp: 00:00:29 - 2 Wins and a Challenge 00:03:42 - Myles Younger Introduction 00:06:25 - Myles' shift into programmatic 00:07:52 - Defining programmatic to a 5 years old 00:15:40 - Latest important news about data privacy 00:21:28 - Changing the meaning of third party cookies 00:26:20 - Data Clean Rooms 00:31:18 - DMP Obsoletion 00:33:12 - Data Clean Rooms Accessibility 00:36:16 - Data Clean Rooms difference from DMP or Data Lake 00:39:04 - Question and Answer 00:50:54 - Words of Wisdom from Myles Younger Interested in finding out if you are a fit for a career in digital advertising and programmatic? Take our free Quiz: www.heleneparker.com/programmaticquiz Guest Information: Myles Younger LinkedIn U of Digital Website | LinkedIn | Newsletter Meet Our Team: Hélène Parker - Chief Programmatic Coach Website | LinkedIn | Twitter | The Reach & Frequency Course Programmatic Digest - Youtube | LinkedIn | Instagram Alexa Gabrielle Ramos - Podcast Editor Instagram | Website | LinkedIn S and S Creative Media - Podcast and Media Manager Instagram | Website | LinkedIn Get this directly in your inbox weekly including more gems! Let's keep in touch: Sign up to receive our weekly newsletter here: www.heleneparker.com/newsletter Join our next training program by signing up to our waitlist below: https://www.heleneparker.com/waitlist/ Also take a moment to check out: How To Optimise Data Segment: https://youtu.be/boj0SJF5kn8 Join Our Slack Channel for programmatic ninjas looking to level up and build a network: https://join.slack.com/t/theprogrammaticmeetup/shared_invite/zt-1nlaoighs-ES98OYwn67rkk1vqgC4i9Q

wisdom digital identity reach defining quiz advertising younger frequency ads rooms wins data privacy digital advertising free call dmp data lake

Simplify Data Engineering Projects in Your Lakehouse with Delta Lake Framework with Matthew Powers & Denny Lee, Developer Advocates at Databricks

Engenharia de Dados [Cast]

Play Episode Listen Later May 23, 2023 72:32

No episódio de hoje, Luan Moreno e Mateus Oliveira entrevistaram Denny Lee & Mathew Powers, atualmente Developer Advocates na Databricks.Delta Lake é um produto open-source, que nos permite aplicar o famoso Data Lakehouse {Data Lake + Data Warehouse}, desenvolvido pela empresa dos criadores do Apache Spark. Delta Lake resolve o problema do Apache Spark, armazenamento, processamento de dados no Data Lake de forma otimizada.Com Delta Lake, você tem os seguintes benefícios:Formato de arquivo como se fosse uma tabela;Time Travel;ACID;Batch e Streaming Unificados.Falamos também nesse bate-papo sobre os seguintes temas:Estado da arte dos dados;Delta Lake.Aprenda mais sobre Delta Lake, como utilizar uma tecnologia para Data LakeHouse, junto com o time da databricks que mais impulsiona a comunidade com conteúdos, releases e eventos para ajudar este produto open-source.Denny Lee - Linkedin Mathew Powers - Linkedinhttps://delta.io/ Luan Moreno = https://www.linkedin.com/in/luanmoreno/

Keep Your Data Lake Fresh With Real Time Streams Using Estuary

Data Engineering Podcast

Play Episode Listen Later May 21, 2023 55:50

Summary Batch vs. streaming is a long running debate in the world of data integration and transformation. Proponents of the streaming paradigm argue that stream processing engines can easily handle batched workloads, but the reverse isn't true. The batch world has been the default for years because of the complexities of running a reliable streaming system at scale. In order to remove that barrier, the team at Estuary have built the Gazette and Flow systems from the ground up to resolve the pain points of other streaming engines, while providing an intuitive interface for data and application engineers to build their streaming workflows. In this episode David Yaffe and Johnny Graettinger share the story behind the business and technology and how you can start using it today to build a real-time data lake without all of the headache. Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management RudderStack helps you build a customer data platform on your warehouse or data lake. Instead of trapping data in a black box, they enable you to easily collect customer data from the entire stack and build an identity graph on your warehouse, giving you full visibility and control. Their SDKs make event streaming from any app or website easy, and their extensive library of integrations enable you to automatically send data to hundreds of downstream tools. Sign up free at dataengineeringpodcast.com/rudderstack (https://www.dataengineeringpodcast.com/rudderstack) Your host is Tobias Macey and today I'm interviewing David Yaffe and Johnny Graettinger about using streaming data to build a real-time data lake and how Estuary gives you a single path to integrating and transforming your various sources Interview Introduction How did you get involved in the area of data management? Can you describe what Estuary is and the story behind it? Stream processing technologies have been around for around a decade. How would you characterize the current state of the ecosystem? What was missing in the ecosystem of streaming engines that motivated you to create a new one from scratch? With the growth in tools that are focused on batch-oriented data integration and transformation, what are the reasons that an organization should still invest in streaming? What is the comparative level of difficulty and support for these disparate paradigms? What is the impact of continuous data flows on dags/orchestration of transforms? What role do modern table formats have on the viability of real-time data lakes? Can you describe the architecture of your Flow platform? What are the core capabilities that you are optimizing for in its design? What is involved in getting Flow/Estuary deployed and integrated with an organization's data systems? What does the workflow look like for a team using Estuary? How does it impact the overall system architecture for a data platform as compared to other prevalent paradigms? How do you manage the translation of poll vs. push availability and best practices for API and other non-CDC sources? What are the most interesting, innovative, or unexpected ways that you have seen Estuary used? What are the most interesting, unexpected, or challenging lessons that you have learned while working on Estuary? When is Estuary the wrong choice? What do you have planned for the future of Estuary? Contact Info Dave Y (mailto:dave@estuary.dev) Johnny G (mailto:johnny@estuary.dev) Parting Question From your perspective, what is the biggest gap in the tooling or technology for data management today? Closing Announcements Thank you for listening! Don't forget to check out our other shows. Podcast.__init__ (https://www.pythonpodcast.com) covers the Python language, its community, and the innovative ways it is being used. The Machine Learning Podcast (https://www.themachinelearningpodcast.com) helps you go from idea to production with machine learning. Visit the site (https://www.dataengineeringpodcast.com) to subscribe to the show, sign up for the mailing list, and read the show notes. If you've learned something or tried out a project from the show then tell us about it! Email hosts@dataengineeringpodcast.com (mailto:hosts@dataengineeringpodcast.com)) with your story. To help other people find the show please leave a review on Apple Podcasts (https://podcasts.apple.com/us/podcast/data-engineering-podcast/id1193040557) and tell your friends and co-workers Links Estuary (https://estuary.dev) Try Flow Free (https://dashboard.estuary.dev/register) Gazette (https://gazette.dev) Samza (https://samza.apache.org/) Flink (https://flink.apache.org/) Podcast Episode (https://www.dataengineeringpodcast.com/apache-flink-with-fabian-hueske-episode-57/) Storm (https://storm.apache.org/) Kafka Topic Partitioning (https://www.openlogic.com/blog/kafka-partitions) Trino (https://trino.io/) Avro (https://avro.apache.org/) Parquet (https://parquet.apache.org/) Fivetran (https://www.fivetran.com/) Podcast Episode (https://www.dataengineeringpodcast.com/fivetran-data-replication-episode-93/) Airbyte (https://www.dataengineeringpodcast.com/airbyte-open-source-data-integration-episode-173/) Snowflake (https://www.snowflake.com/en/) BigQuery (https://cloud.google.com/bigquery) Vector Database (https://learn.microsoft.com/en-us/semantic-kernel/concepts-ai/vectordb) CDC == Change Data Capture (https://en.wikipedia.org/wiki/Change_data_capture) Debezium (https://debezium.io/) Podcast Episode (https://www.dataengineeringpodcast.com/debezium-change-data-capture-episode-114/) MapReduce (https://en.wikipedia.org/wiki/MapReduce) Netflix DBLog (https://netflixtechblog.com/dblog-a-generic-change-data-capture-framework-69351fb9099b) JSON-Schema (http://json-schema.org/) The intro and outro music is from The Hug (http://freemusicarchive.org/music/The_Freak_Fandango_Orchestra/Love_death_and_a_drunken_monkey/04_-_The_Hug) by The Freak Fandango Orchestra (http://freemusicarchive.org/music/The_Freak_Fandango_Orchestra/) / CC BY-SA (http://creativecommons.org/licenses/by-sa/3.0/)

Podcasts about Data lake

Best podcasts about Data lake

Data Engineering Podcast

Engenharia de Dados [Cast]

The My Love of Golf Podcast

Software Engineering Daily

Bigdata Hebdo

What's Next??????

Cloud N Clear

FINRA Unscripted

Data Protection Gumbo

Latest news about Data lake

Latest podcast episodes about Data lake

AI business strategy, data organization, overcoming AI resistance - Len Ward

Oracle's Juan Loaiza Discusses Trust Privacy, Security in the Age of AI | Cloud Wars Live

Episode 412 – Microsoft Sentinel Gets a Data Lake

Der Fachbereich hat Datenhunger - Wir stillen ihn oft nicht richtig | Stefan Franke

Incremental Design, DevOps, Microservices & CICD • Michael Nygard & Dave Farley

The Truth About Enterprise AI & Why Data Matters with Nick Eayrs and Simon Fassot

Vad är Vibe coding? Miguel förklarar framtidens kodning (# 235)

#750 Architekturen für BI & Analytics – Prof. Dr. Peter Gluchowski im Gespräch (Teil 2v2)

#749 Architekturen für BI & Analytics – Prof. Dr. Peter Gluchowski im Gespräch (Teil 1v2)

Data Warehouse Automation: Benefits and Market Overview – with Florian Bigelmaier, BARC

Palo Alto acquires Cyberark, Sentinel News, MDTI is going to be FREE!

Episode 116: Microsoft Sentinel Data Lake

New Surface Laptop 5G for Business, Copilot+ PC

New data lake in Microsoft Sentinel

Self-Destructing E-Commerce: Wird künftig ohne Shop geshoppt? |

Data Architecture with Christoph Windheuser

AI, Data Engineering, and the Modern Data Stack

The Early AI Journey and Learning Curve

AI-Driven Healthcare: Sutter Health's Journey to Scale Clinical Innovation

Innovation durch Daten: Warum das Mindset entscheidend ist

What Is Data Lake? (Deep Dive On LAKE Token)

The Analytics Escalator: Unlocking Value in Finance with Mambu CFO Jesper Sorensen

#325 - Theorycrafting Modern Identity Architecture with Ian Glazer

Episode 283: Data Lakehouse vs Data Warehouse vs My House

FOSS4G NA 2024 - Searching the Spatial Data Lake: Bring GeoParquet to Apache Lucene - Wes Richardet

Community Summit 2024 Preview with Aqueducts Consulting

Generative AI with Microsoft Fabric

Database Essentials

265: Swing and a WIF

60 – Interoperability of Data Lake Table Format (Apache Iceberg, Apache Hudi, Delta Lake)

#175 - How to Solve Real-World Data Analysis Problems - David Asboth

START UP: My Top Pick for a 100x Token in DeSci | Data Lake on 100x Podcast

Spotlight On Sophos UK&I, Episode 3 April 2024, Ireland, Integrations And Partner Care

215: The Future of AI with Salesforce's Einstein 1 Studio & Data Cloud featuring Danielle Larregui

100X GEM SHOW: Perion ($PERC) | Mendi Finance ($MENDI) | Data Lake ($LAKE) | Will These 100x?

Vidur Gupta | Beekin | CollectiveConversations

Building a Data Lake with Adam Ferrari

Subsquid Network - The Web3 Data Lake

Build A Data Lake For Your Security Logs With Scanner

The Future of Customer Data in the Hybrid Cloud: A Conversation with Industry Experts | EP 166

230: If I Ever Own a Sailboat, I Will Name it Kafka… and Sail it on a Data Lake

How to Prime Your Data Lake

AI Today Podcast: AI Glossary Series – Data Warehouse, Data Lake, Extract Transform Load (ETL)

Ep21: Ethical Sourcing of Medical Data

Microsoft Fabric with Andrew Snodgrass

205: Plumbing the Depths of Unstructured Data - Superna

Is ChatGPT the iBeer of LLMs?

127. Data Privacy, Data Clean Rooms, Identity Discussion with U of Digital's Myles Younger

Simplify Data Engineering Projects in Your Lakehouse with Delta Lake Framework with Matthew Powers & Denny Lee, Developer Advocates at Databricks

Keep Your Data Lake Fresh With Real Time Streams Using Estuary