Podcasts about Wikidata

Play Episode Listen Later Nov 29, 2025 38:51

This is an episode recorded at WikiconNL 2025 in Leiden. The focus of the episode is the conference, what it and events like this mean to the community and organizational aspects. Shownotes First we meet Kirsten Jansen, office manager of Wikimedia Nederlands who talks about the importance of a smoothly running back office and having fun while doing it. Then we meet Thamara Groenleer, project lead for the conference, who talks about how themes can emerge and how to create environment that enables the participants. Then we hear from Vera de Kok, official event photographer, who talks about what well documented events can mean. After that we widen the perspective a bit with Julia Brungs, Lead Community Relations Specialist at the Wikimedia Foundation, and talk about communication in the community, in general and at events like this. Staying with a Wikimedia Foundation connection, we talk with Jan-Bart de Vreede, former chair of the Foundation and current chair of Wikimedia Nederland, about the importance and value of meetings and what it means to the the chapter and the community. We keep on that theme when talking to Sandra Rientjes, Executive Director of Wikimedia Nederland, and also talk about leadership in the movement as she is the longest serving ED for a few more weeks. All episodes in English (podcast feed) Credits The music and sound clips are from Surf Shimmy Kevin MacLeod (incompetech.com) [CC BY 3.0], via Wikimedia Commons, Wikimedia Sound Logo Finalist VQ97, Thaddeus Osborne, CC BY-SA 4.0, and the audio from Wikidata’s 10th birthday video intro animation, Lea Lacroix (WMDE), CC BY-SA 4.0. Image: WM 25 WikiconNL 2000 x 475.jpg by Grtrwrks, CC BY-SA 4.0 Discuss the episode on the project’s talk page. The episode is also available on Wikimedia Commons.

english executive director foundation staying leiden cc by sa kok wikimedia commons wikimedia foundation wikidata vreede surf shimmy kevin macleod

tretår – #333

Play Episode Listen Later Nov 26, 2025 25:01

Vi tripplar om AI och handlingsavsnitt i filmer, röstar igen men slipper bli omdirigerade och ser att det är ordning på ekonomin. Shownotes Aktuellt från svenska Wikipedia AI-skapade handlingsavsnitt i artiklar om filmer, utvecklades på wikipediavis till en trippel: Hur vet man att ett handlingsavsnitt inte är AI-skapat med risk för ”hallucinationer”? Får man verkligen använda verket som källa? Är det verkligen ok att spoila, eller varför varnar vi inte? Wikimediarörelsen internationellt Rösta på nytt projektnamn Ingen mer omdirigering till mobildomän Wikimedia Foundations 24/25 ekonomiska rapport Veckans mall Magnus: Infobox tunnelbanestation Wikifikor, meetups och träffar i närtid Fredag: Digital Safety Office Hours Lördag: World AIDS Day 2025 online Erkännanden Bild: Sebastian Kraft, GNU General Public License Musiken och ljudklippen är från Surf Shimmy Kevin MacLeod (incompetech.com), CC BY 3.0, samt Wikimedia Sound Logo Finalist VQ97, Thaddeus Osborne, CC BY-SA 4.0, och ljudet från Wikidata’s 10th birthday video intro animation, Lea Lacroix (WMDE), CC BY-SA 4.0. Avsnittet hittas också på Wikimedia Commons. Diskutera avsnittet på projektsidans diskussion.

ai avsnittet cc by sa world aids day wikimedia commons tret diskutera wikidata surf shimmy kevin macleod

FS299 Find My Grave

Freak Show

Play Episode Listen Later Nov 20, 2025 256:26 Transcription Available

Ein bisschen später als ursprünglich geplant tauchen wir wieder aus der Versenkung und begrüßen heute Jens Ohlig in der Runde, der über lange Zet das Geschehen bei Wikimedia mit geprägt hat und maßgeblich an der Entwicklung von Wikidata beteiligt war. Mit ihm sprechen wir über den Status Quo des Projekts und vor allem über die Herausforderung durch AI und LLMs. Vorher haben wir noch zahlreiche Tips für Dinge, die man eigentlich nicht braucht und reden viel über Südafrika, Schach, die Vision Pro und Smart Home. Also mal wieder für jeden was dabei, ein Podcast für die ganze Familie!

ai tips mit dinge familie grave entwicklung herausforderung runde status quo smart homes geschehen vorher projekts schach zet wikimedia versenkung tim pritlove wikidata ralf stockmann roddi

i tid och otid – #332

Play Episode Listen Later Nov 19, 2025 28:35

Vi ondgör oss på oinloggades beskrivningsklotter, tänker på tidlöshet, upplyser om utseendeförändringar, vinkar åt vinnare och mumlar om modeller. Shownotes Aktuellt från svenska Wikipedia Wikidata-beskrivningar från oinloggade Numera A new look for talk pages Wikimediarörelsen internationellt Vinnare i valet till Wikimedia Foundations styrelse Utkast för ny modell för affiliates Veckans mall Jan: Årtal Ämne Wikifikor, meetups och träffar i närtid Torsdag: Wikisource Transcribe-a-thon Söndag: Capacity Exchange community calls Tisdag Connection learning session 3 Erkännanden Bild: Mobilos, CC BY 3.0 Musiken och ljudklippen är från Surf Shimmy Kevin MacLeod (incompetech.com), CC BY 3.0, samt Wikimedia Sound Logo Finalist VQ97, Thaddeus Osborne, CC BY-SA 4.0, och ljudet från Wikidata’s 10th birthday video intro animation, Lea Lacroix (WMDE), CC BY-SA 4.0 och WP25 Anthem video.webm, CC BY-SA 4.0. Avsnittet hittas också på Wikimedia Commons. Diskutera avsnittet på projektsidans diskussion.

avsnittet cc by sa wikimedia commons diskutera wikidata surf shimmy kevin macleod

Episode 194: Vera de Kok

playing impact wikipedia usb diff nairobi autistic kok wikimedia commons wikidata media contributor mikko hypp wikimania

Play Episode Listen Later Nov 4, 2025 116:16

Entities Part 3 : The Knowledge Graph

SEO Is Not That Hard

Play Episode Listen Later Oct 10, 2025 10:59 Transcription Available

Send us a textMost brands still try to “tell” Google who they are. We show how Google actually decides: by stitching together a ledger of facts from your site, LinkedIn, Wikipedia, news articles, and structured data—then trusting only what aligns. This is the Knowledge Graph at work, and it's quietly steering whether you earn a knowledge panel, sitelinks, and richer visibility across search.We break down the four streams feeding the graph—public web pages, licensed datasets, human‑edited knowledge bases like Wikidata, and direct owner signals via schema.org—and explain how each contributes to a confidence score for your entity. If your about page says Jane Doe is CEO but LinkedIn shows John Smith, the score drops and your brand becomes ambiguous. If your website, LinkedIn, reputable press, and Wikidata all agree, trust rises and your facts become “truth” in search.From there, we get specific about what you can control. Use schema.org to describe your organisation, people, products, and identifiers in clear, machine‑readable terms. Link out with sameAs to authoritative profiles so Google can triangulate identity. Audit your knowledge panel as a live diagnostic: check logos, dates, roles, and categories, and chase down any mismatch to the original source. Treat digital PR and reputation management as part of technical SEO—because today they are.By the end, you'll have a practical checklist for entity hygiene that helps you earn and keep a clean knowledge panel, avoid costly confusion, and unlock higher‑trust features across the results page. If this helped clarify how entities power modern SEO, subscribe, share with a colleague, and leave a quick review with one takeaway you'll act on next.SEO Is Not That Hard is hosted by Edd Dawson and brought to you by KeywordsPeopleUse.com Help feed the algorithm and leave a review at ratethispodcast.com/seo You can get your free copy of my 101 Quick SEO Tips at: https://seotips.edddawson.com/101-quick-seo-tipsTo get a personal no-obligation demo of how KeywordsPeopleUse could help you boost your SEO and get a 7 day FREE trial of our Standard Plan book a demo with me nowSee Edd's personal site at edddawson.comAsk me a question and get on the show Click here to record a questionFind Edd on Linkedin, Bluesky & TwitterFind KeywordsPeopleUse on Twitter @kwds_ppl_use"Werq" Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0 Licensehttp://creativecommons.org/licenses/by/4.0/

ceo google pr treat seo wikipedia kevin macleod audit blue sky entities graphs jane doe john smith werq knowledge graph wikidata

Entities Part 2 : How Machines Learn To Read

SEO Is Not That Hard

Play Episode Listen Later Oct 8, 2025 11:59 Transcription Available

Send us a textKeywords don't tell the whole story—entities do. We take you inside the three-step process machines use to read your content like a detective at a crime scene: highlighting potential entities, using context to resolve ambiguity, and linking each mention to a unique identifier in a global knowledge base. By the end, you'll see why “Jordan” only makes sense when surrounded by the right clues—and how to present those clues so search engines and AIs make the right call every time.We start with named entity recognition, the digital highlighter that picks out people, organisations, products, places, and dates across unstructured text. Then we move to entity disambiguation, where context—co-occurring teams, locations, or concepts—guides the system to the correct meaning. Finally, we close with entity linking, the moment a string becomes a node with a library card in Wikipedia or Wikidata. That linkage is the bridge into Google's Knowledge Graph, powering features like knowledge panels and richer, more confident results.Along the way, we dig into why Wikipedia and Wikidata matter far beyond vanity. Accurate, well-sourced entries create a feedback loop that improves how machines understand your brand, your founders, and your products. If you don't meet notability yet, don't force it; build authority elsewhere with consistent profiles, structured data, and content that names and connects related entities. We also share a simple action: search for your brand, founder, and main product on Wikipedia and Wikidata and assess accuracy. Want more like this? Follow the show, share it with a colleague, and leave a review so we can help more teams make sense of entity-first SEO.SEO Is Not That Hard is hosted by Edd Dawson and brought to you by KeywordsPeopleUse.com Help feed the algorithm and leave a review at ratethispodcast.com/seo You can get your free copy of my 101 Quick SEO Tips at: https://seotips.edddawson.com/101-quick-seo-tipsTo get a personal no-obligation demo of how KeywordsPeopleUse could help you boost your SEO and get a 7 day FREE trial of our Standard Plan book a demo with me nowSee Edd's personal site at edddawson.comAsk me a question and get on the show Click here to record a questionFind Edd on Linkedin, Bluesky & TwitterFind KeywordsPeopleUse on Twitter @kwds_ppl_use"Werq" Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0 Licensehttp://creativecommons.org/licenses/by/4.0/

ai google seo wikipedia machines kevin macleod blue sky accurate entities werq knowledge graph wikidata

3391: What Wikidata Reveals About the Good Side of the Internet

The Tech Blog Writer Podcast

Play Episode Listen Later Aug 19, 2025 19:09

When most people think of Wikipedia, they picture an endless scroll of human-readable pages. But there's another side to this ecosystem, one designed not just for people but also for machines. It's called Wikidata, and if you haven't heard of it, that's exactly why this conversation matters. In this episode of Tech Talks Daily, I'm joined by Lydia Pintscher, Wikidata Portfolio Manager at Wikimedia Deutschland, for a deep look into how structured, open data is quietly powering civic tech, cultural preservation, and knowledge equity across the globe. Wikidata is the backbone that helps turn static knowledge into something living, adaptable, and scalable. With over 117 million items, 1.65 billion semantic statements, and more than 2.34 billion edits, it's become one of the largest collaborative datasets in the world. But it's not just the size that makes it impressive. It's what people are doing with it. Lydia shares how volunteers and developers are building tools for everything from investigative journalism to public libraries, all without needing deep pockets or proprietary infrastructure. This isn't big tech. It's a global, grassroots movement making open data work for the public good. We explore how tools like Toolforge and the Wikidata Query Service lower the barrier to entry, allowing civil society groups to build sophisticated applications that would otherwise be out of reach. Whether it's helping connect citizens to government services or preserving disappearing languages, the use cases are multiplying fast. Lydia also reflects on how Wikidata fosters a sense of purpose for contributors, offering a rare example of what many call the good internet, where collaboration outweighs competition and building something meaningful beats chasing virality. If you're curious about where open knowledge is headed, how structured data can be a force for social impact, or why Wikidata might be the most important project you've never fully explored, this episode offers a window into a future where machines help humans build something better, together.

internet wikipedia good side wikidata wikimedia deutschland tech talks daily

Building Open-Source LLMs with Philosophy | Anastasia Stasenko

Open||Source||Data

Play Episode Listen Later Jun 3, 2025 57:45

Join Charna Parkey as she welcomes Anastasia Stasenko, CEO and co-founder of pleias, through her unique journey from philosophy to building open-source, energy-efficient LLMs. Discover how pleias is revolutionizing the AI landscape by training models exclusively on open data and establishing a precedent for ethical and socially acceptable AI. Learn about the challenges and opportunities in creating multilingual models and contributing back to the open-source community. QUOTES[00:00:00] Introducing Anastasia and pleias[00:02:00] From Philosophy to AI[00:06:00] The Problem of Generic Models[00:10:00] Open Weights vs. Open Source vs. Open Science[00:14:00] Why Open Data Matters[00:18:00] High-Quality, Specialized Models[00:22:00] Multilingual Challenges[00:26:00] Global Inclusion Requires Small Models[00:30:00] Using and Contributing to Wikidata[00:38:00] The Future: Specialized Models[00:48:00] Advice for Newcomers[00:54:00] Cultural Sensitivity and Data Representation[00:50:00] Leo's Takeaways[00:52:00] Charna on Ethical, Verifiable AI[00:54:00] Representation vs. Exclusion[00:56:00] Letting People Be More Human[00:57:30] Applied, Transformative AIQUOTESCharna:"If you didn't make it represented in the data, then we're leaving another culture behind... So which one are you wanting to do, misrepresent them or just completely leave them behind from this technical revolution?"Anastasia:"The real issue now is that the lack of diversity in the current AI labs leads to the situation where all LLMs look alike."Anastasia:"Being able to design, to find, and also to create the appropriate data mix for large language models is something that we shouldn't really forget about when we talk about the success of what large language models are."

ceo ai discover advice philosophy takeaways representation ethical quotes open source applied contributing high quality exclusion newcomers open science cultural sensitivity wikidata charna

Episode 180: Denny Vrandečić

#arthistoCast â€“ der Podcast zur Digitalen Kunstgeschichte

Play Episode Listen Later Apr 8, 2025 126:12

head ai challenging wikipedia llama college london fellows large language models special projects wikimedia wikimedia foundation wikidata mediawiki really simple syndication

Episode 260: Robert Douglass and contributing as a corporation to OSS

Sustain

Play Episode Listen Later Dec 13, 2024 32:55

Guest Robert Douglass Panelist Richard Littauer | Abby Cabunoc Mayes Show Notes In this episode of Sustain, hosts Richard Littauer and Abby Cabunoc Mayes speak with Robert Douglass, Entrepreneur in Residence at Open Strategy Partners, to delve into sustaining open source projects. They explore Robert's extensive history with Drupal, the role of Open Strategy Partners, and the innovative Drupal Certified Partner Program designed to address the maker-taker dilemma in open source. The episode also covers the recently launched RFP templates aimed at promoting open source software and certified partners. Robert shares insights on gamification, the economic aspects of contributing to Drupal, and future initiatives to ensure the continued sustainability of open source projects. Hit download now to hear more! [00:01:49] Robert shares his background in the Drupal ecosystem and his involvement with Open Strategy Partners, which provides strategic content marketing for B2B tech companies focusing on open source. [00:02:43] Robert explains Open Strategy Partners' focus on supporting open source projects and mentions clients like DDEV and TYPO3. [00:04:06] Richard and Robert discuss what it means to be an entrepreneur in residence, with Robert explaining his role in developing new products for Open Strategy Partners and the books he has written. [00:05:52] Robert reflects on the early days of Drupal and the challenges in making open source sustainable. He notes how the community was initially driven by passion, with few paid opportunities. [00:08:05] Robert introduces the Drupal Certified Partner Program, a system for supporting Drupal sustainability by encouraging companies to contribute both time and money. [00:10:03] The conversation covers how Drupal's contribution system gamifies the support companies provide to the ecosystem. Companies can earn contribution credits, which are visible on Drupal.org and benefit their reputation. [00:15:41] Abby asks about the potential downsides of gamification, especially regarding diversity. Robert explains how placing the system at the company level may mitigate some negative impacts. [00:18:17] Richard inquires about the financial structure of the Drupal Certified Partner Program. Robert clarifies that the funds collected support the Drupal Association's core mission, including maintaining Drupal.org and organizing events. [00:21:33] Robert discusses the development of RFP (Request for Proposal) templates to encourage companies to consider certified open source providers, explaining how this initiative promotes sustainability in the ecosystem. [00:25:56] Robert describes how the RFP templates allow purpose-driven organizations to incorporate open source values in their procurement process, aligning with their missions. [00:27:00] Robert invites listeners to explore and utilize the RFP templates, which are available under a Creative Commons Zero license, encouraging others to adapt and improve them. [00:29:47] Find out where you can follow Robert and his work online. Quotes [00:08:57] “Open Source is like a free puppy.” Spotlight [00:30:30] Abby's spotlight is Common Sort thrift shop in Toronto. [00:30:52] Richard's spotlight is Wikidata. [00:31:21] Robert's spotlight is Chad Whitacre and Sentry. Links SustainOSS (https://sustainoss.org/) podcast@sustainoss.org (mailto:podcast@sustainoss.org) richard@sustainoss.org (mailto:richard@sustainoss.org) SustainOSS Discourse (https://discourse.sustainoss.org/) SustainOSS Mastodon (https://mastodon.social/tags/sustainoss) Open Collective-SustainOSS (Contribute) (https://opencollective.com/sustainoss) Richard Littauer Socials (https://www.burntfen.com/2023-05-30/socials) Abby Cabunoc Mayes X (https://x.com/abbycabs?lang=en) Robert Douglass LinkedIn (https://www.linkedin.com/in/roberttdouglass/) Open Strategy Partners (https://openstrategypartners.com/) Open Strategy Partners Blog (https://openstrategypartners.com/blog/) Building Online Communities with Drupal, phpBB, and WordPress by Robert Douglass, Mike Little, Jared W. Smith (https://www.drupal.org/node/1850002) Drupal Certified Partner Program (https://www.drupal.org/association/become-a-drupal-certified-partner) Drupal (https://www.drupal.org/) How to Write an RFP for Open Source Solutions: Featuring Drupal Certified Partners (https://www.drupal.org/association/blog/how-to-write-an-rfp-for-open-source-solutions-featuring-drupal-certified-partners) OSP: Supporting Drupal Certified Partners (https://openstrategypartners.com/blog/osp-supporting-drupal-certified-partners/) Sustain Podcast-Episode 148: Ali Nehzat of thanks.dev and OSS Funding (https://podcast.sustainoss.org/148) Common Sort (https://commonsort.com/) Wikidata (https://www.wikidata.org/wiki/Wikidata:Main_Page) Chad Whitacre LinkedIn (https://www.linkedin.com/in/chadwhitacre/) Sentry (https://sentry.io/welcome/) Credits Produced by Richard Littauer (https://www.burntfen.com/) Edited by Paul M. Bahr at Peachtree Sound (https://www.peachtreesound.com/) Show notes by DeAnn Bahr Peachtree Sound (https://www.peachtreesound.com/) Special Guest: Robert Douglass.

Folge 16: Wissensgerechtigkeit auf Wikipedia: Kunsthistoriker*innen gestalten mit

Play Episode Listen Later Nov 6, 2024 48:22

In dieser Folge spricht Jacqueline Klusik-Eckert mit den Kunsthistorikerinnen Anna Gnyp und Maria Merseburger über das Verhältnis von Wikimedia und der Kunstgeschichte. Gemeinsam diskutieren sie, wie Wikipedia und Wikidata inzwischen zu wertvollen Ressourcen für kunsthistorische Forschung geworden sind und warum die aktive Mitgestaltung dieser Plattformen durch Fachwissenschaftler*innen unter dem Aspekt der Wissensgerechtigkeit wichtig ist. Dabei wird auch die Arbeit der AG Kuwiki vorgestellt, die mit mehreren Projekten die Sichtbarkeit kunsthistorischen Wissens auf Wikipedia fördert: Das „Living Handbook“ bietet eine Einführung in die Wikipedia-Arbeit für Kunsthistoriker. „Wikipedia in der Lehre“ zielt darauf ab, Studierende frühzeitig für die Plattform zu sensibilisieren und aktiv einzubinden. Und „Kuwiki Loves Monuments, too“ fördert die Dokumentation und Verbreitung von Bildern zu Denkmälern und Kulturgütern. Ein wichtiges Anliegen ist dabei die Wissensgerechtigkeit, um mehr Diversität auf Wikipedia und Wikimedia Commons zu erreichen.Das Gespräch beleuchtet auch die wachsende Bedeutung von Wikidata als datenbankgestützte Ressource, die zunehmend in digitalen kunsthistorischen Projekten genutzt wird. Anna und Maria zeigen auf, wie Museen, Archive und Bibliotheken von Wikidata und Wikimedia Commons profitieren können, um ihre Bestände öffentlich zugänglich zu machen und neue Vernetzungen zu schaffen. Abschließend plädieren sie für stärkere Kooperationen und „Best Practice“-Beispiele, die die Arbeit mit Wikimedia-Projekten in der Kunstwissenschaft festigen und bereichern können.Anna Gnyp, ist seit knapp zwei Jahren Mitglied der Arbeitsgemeinschaft. Aktuell ist sie Wissenschaftlerin im Datenkompetenzzentrum Sammlungen, Objekte, Datenkompetenz an der Humboldt-Universität Berlin. Das ist ein Verbundprojekt zum Aufbau eines Datenkompetenzzentrums für wissenschaftliche Universitätssammlungen.Dr. Maria Merseburger, ist seit Beginn im der AG KUwiki, hier unter dem Namen Karatecoop und aktuell Wissenschaftlerin am Museum für Kommunikation in Berlin.Begleitmaterial zu den Folgen findest du auf der Homepage unter https://www.arthistoricum.net/themen/podcasts/arthistocast.Alle Folgen des Podcasts werden bei heidICON mit Metadaten und persistentem Identifier gespeichert. Die Folgen haben die Creative-Commons-Lizenz CC BY 4.0 und können heruntergeladen werden. Du findest sie unter https://doi.org/10.11588/heidicon/1738702.Bei Fragen, Anregungen, Kritik und gerne auch Lob kannst du uns gerne per Mail kontaktieren unter podcast@digitale-kunstgeschichte.de.

Could making Wikidata 'human' readable lead to better AI?

ai earth whatsapp large language models open data newsguard readable wikidata gareth mitchell

Play Episode Listen Later Oct 10, 2024 31:41

Could making Wikidata 'human' readable lead to better AI? A new project is underway to allow Large Language Models (LLMs) to read Wikidata. The data is currently structured in a way that's machine readable, but LLMs read data more like humans than machines, meaning this vast amount of human curated, high quality data isn't accessible to this type of AI. By allowing access to Wikidata, LLMs could become more reliable. Ania spoke to Lydia Pintscher, the Portfolio Lead Product Manager at Wikidata Deutschland, to learn more about these developments. Most news websites block AI Chatbots Two thirds of high quality news websites block AI chatbots from accessing their information, according to a report by the misinformation monitoring organisation NewsGuard. This means that some of the world's most popular AI chatbots could be collecting data on misinformation from low quality news sources and even conspiracy and hoax sites. The Enterprise Editor at NewsGuard is Jack Brewster and he is on the show to explain their findings. The programme is presented by Gareth Mitchell and the studio expert is Ania Lichtarowicz. More on this week's stories: Wikidata and Artificial Intelligence: Simplified Access to Open Data for Open-Source Projects AI Chatbots Are Blocked by 67% of Top News Sites, Relying Instead on Low-Quality Sources Support the show Editor: Ania Lichtarowicz Production Manager: Liz Tuohy Recording and audio editing : Lansons | Team Farner For new episodes, subscribe wherever you get your podcasts or via this link: https://www.buzzsprout.com/2265960/supporters/new Follow us on all the socials: Join our Facebook group Instagram Twitter/X If you like Somewhere on Earth, please rate and review it on Apple Podcasts Contact us by email: hello@somewhereonearth.co Send us a voice note: via WhatsApp: +44 7486 329 484 Find a Story + Make it News = Change the World Learn more about your ad choices. Visit megaphone.fm/adchoices

Could making Wikidata 'human' readable lead to better AI?

world ai earth whatsapp recording large language models open data newsguard open source projects readable wikidata gareth mitchell

Play Episode Listen Later Oct 10, 2024 27:26

Could making Wikidata 'human' readable lead to better AI?A new project is underway to allow Large Language Models (LLMs) to read Wikidata. The data is currently structured in a way that's machine readable, but LLMs read data more like humans than machines, meaning this vast amount of human curated, high quality data isn't accessible to this type of AI. By allowing access to Wikidata, LLMs could become more reliable. Ania spoke to Lydia Pintscher, the Portfolio Lead Product Manager at Wikidata Deutschland, to learn more about these developments.Most news websites block AI ChatbotsTwo thirds of high quality news websites block AI chatbots from accessing their information, according to a report by the misinformation monitoring organisation NewsGuard. This means that some of the world's most popular AI chatbots could be collecting data on misinformation from low quality news sources and even conspiracy and hoax sites. The Enterprise Editor at NewsGuard is Jack Brewster and he is on the show to explain their findings.The programme is presented by Gareth Mitchell and the studio expert is Ania Lichtarowicz.More on this week's stories:Wikidata and Artificial Intelligence: Simplified Access to Open Data for Open-Source ProjectsAI Chatbots Are Blocked by 67% of Top News Sites, Relying Instead on Low-Quality SourcesSupport the showEditor: Ania LichtarowiczProduction Manager: Liz TuohyRecording and audio editing : Lansons | Team FarnerFor new episodes, subscribe wherever you get your podcasts or via this link:https://www.buzzsprout.com/2265960/supporters/newFollow us on all the socials:Join our Facebook groupInstagramTwitter/XIf you like Somewhere on Earth, please rate and review it on Apple PodcastsContact us by email: hello@somewhereonearth.coSend us a voice note: via WhatsApp: +44 7486 329 484Find a Story + Make it News = Change the WorldLearn more about your ad choices. Visit megaphone.fm/adchoices

world ai earth whatsapp subscriber large language models open data newsguard readable wikidata cosend gareth mitchell

Play Episode Listen Later Oct 8, 2024 36:18

Subscriber-only episodeCould making Wikidata human readable lead to better AI? A new project is underway to allow Large Language Models (LLMs) to read Wikidata. The data is currently structured in a way that's machine readable, but LLMs read data more like humans than machines, meaning this vast amount of human curated, high quality data isn't accessible to this type of AI. By allowing access to Wikidata, LLMs could become more reliable. Ania spoke to Lydia Pintscher, the Portfolio Lead Product Manager at Wikidata Deutschland, to learn more about these developments. Most news websites block AI ChatbotsTwo thirds of high quality news websites block AI chatbots from accessing their information, according to a report by the misinformation monitoring organisation NewsGuard. This means that some of the world's most popular AI chatbots could be collecting data on misinformation from low quality news sources and even conspiracy and hoax sites. The Enterprise Editor at NewsGuard is Jack Brewster and he is on the show to explain their findings.The programme is presented by Gareth Mitchell and the studio expert is Ania Lichtarowicz.More on this week's stories: Wikidata and Artificial Intelligence: Simplified Access to Open Data for Open-Source ProjectsAI Chatbots Are Blocked by 67% of Top News Sites, Relying Instead on Low-Quality SourcesEditor: Ania LichtarowiczProduction Manager: Liz Tuohy Recording and audio editing : Lansons | Team Farner For new episodes, subscribe wherever you get your podcasts or via this link:https://www.buzzsprout.com/2265960/supporters/newFollow us on all the socials: Join our Facebook group Instagram Twitter/X If you like Somewhere on Earth, please rate and review it on Apple PodcastsContact us by email: hello@somewhereonearth.coSend us a voice note: via WhatsApp: +44 7486 329 484Find a Story + Make it News = Change the World

Could making Wikidata 'human' readable lead to better AI?

world ai earth whatsapp large language models open data newsguard readable wikidata cosend gareth mitchell

Play Episode Listen Later Oct 8, 2024 27:26 Transcription Available

Could making Wikidata 'human' readable lead to better AI? A new project is underway to allow Large Language Models (LLMs) to read Wikidata. The data is currently structured in a way that's machine readable, but LLMs read data more like humans than machines, meaning this vast amount of human curated, high quality data isn't accessible to this type of AI. By allowing access to Wikidata, LLMs could become more reliable. Ania spoke to Lydia Pintscher, the Portfolio Lead Product Manager at Wikidata Deutschland, to learn more about these developments. Most news websites block AI ChatbotsTwo thirds of high quality news websites block AI chatbots from accessing their information, according to a report by the misinformation monitoring organisation NewsGuard. This means that some of the world's most popular AI chatbots could be collecting data on misinformation from low quality news sources and even conspiracy and hoax sites. The Enterprise Editor at NewsGuard is Jack Brewster and he is on the show to explain their findings.The programme is presented by Gareth Mitchell and the studio expert is Ania Lichtarowicz.More on this week's stories: Wikidata and Artificial Intelligence: Simplified Access to Open Data for Open-Source ProjectsAI Chatbots Are Blocked by 67% of Top News Sites, Relying Instead on Low-Quality SourcesSupport the showEditor: Ania LichtarowiczProduction Manager: Liz Tuohy Recording and audio editing : Lansons | Team Farner For new episodes, subscribe wherever you get your podcasts or via this link:https://www.buzzsprout.com/2265960/supporters/newFollow us on all the socials: Join our Facebook group Instagram Twitter/X If you like Somewhere on Earth, please rate and review it on Apple PodcastsContact us by email: hello@somewhereonearth.coSend us a voice note: via WhatsApp: +44 7486 329 484Find a Story + Make it News = Change the World

Wikipedia edit-a-thon to improve Mi'kmaw content online

Information Morning from CBC Radio Nova Scotia (Highlights)

Play Episode Listen Later Aug 2, 2024 7:26

During National Indigenous History Month, Dalhousie University Libraries hosted an edit-a-thon to improve Wikipedia and Wikidata content related to Mi'kmaw people and Mi'kma'ki. A total of 19 Wikipedia articles were edited, 50 references added, and more than 3,300 words were contributed. One of the organizers fills us in.

online wikipedia thon wikidata

Episode 159: BTB Digest 25

ai digest steve schneider wikidata mediawiki

Play Episode Listen Later Mar 26, 2024 21:05

EP 258: Goran S. Milovanović, DataKolektiv & Smartocto - Pojačalo podcast

Pojačalo

Play Episode Listen Later Mar 17, 2024 156:00

"Uvek sam radio samo ono što sam voleo i što me ložilo." Gost Ivana Minića u 258. epizodi Pojačala je dr Goran S. Milovanović, doktor psiholoških nauka koji nije pošao putem koji obično asociramo sa ovim poljem - psiholog, psihijatar ili psihoterapeut, već se specijalizovao za kognitivne i podatkovne nauke i machine learning. Polja koja zahtevaju duboko razumevanje ljudskog uma i razmišljanja, ali takođe i visok nivo tehnološkog znanja i stručnosti. Karijerni put dr Milovanovića mogao bi sam po sebi ispuniti tri epizode ove dužine, ali ova epizoda posvećena je njegovoj mladosti, priči o tome šta ga je stavilo na taj put, radu u Wikidata, najvećoj bazi znanja na svetu kao i kompaniji Smartocto koja se bavi prediktivnom analitikom u digitalnim medijima. Teme u epizodi: - Uvod - Početak razgovora - Kad porastem biću... - Dobri stari dani - Fakultetski dani - Fundamentalna nauka - Big data analytics - Kognitivne pristrasnosti - Wikidata I Smartocto - Problem digitalnih medija - Data kolektiv Realizacija Pojačalo podkasta ne bi bila moguća bez naših izuzetnih partnera: - Kompanija Epson koja je vodeći svetski proizvođač projektora i štampača za sve namene: https://www.epson.rs/sr_RS - Kompanija Orion telekom provajtera najbrže internet infrastrukture u Srbiji sa preko 30 godina iskustva: https://oriontelekom.rs Podržite nas na BuyMeACoffee: https://bit.ly/3uSBmoa Pročitajte transkript ove epizode: https://bit.ly/3PrfigS Posetite naš sajt i prijavite se na našu mailing listu: http://bit.ly/2LUKSBG Prijavite se na naš YouTube kanal: http://bit.ly/2Rgnu7o Pratite Pojačalo na društvenim mrežama: Facebook: http://bit.ly/2FfwqCR Twitter: http://bit.ly/2CVZoGr Instagram: http://bit.ly/2RzGHjN

data podr kad teme goran srbiji wikidata poja

Episode 155: Alan Ang and Kris Litson

wikidata wikimedia deutschland

Play Episode Listen Later Jan 30, 2024 72:23

010 - Monogames Lesen

Glitterbrains

Play Episode Listen Later Jan 20, 2024 119:31

In dieser ersten Folge des Jahres 2024 haben Letty und hukl wieder einen äußerst bunten Themenblumenstrauß für Euch mitgebracht. Schuhe, Game Engines, Trinkwasser und Osmose, Wikidata und ChatGPT und noch Vieles mehr findet den Weg in Eure Ohren und Synapsen. Bitte knacken Sie mit die Synapsen! Die Mixtape Playlists findet Ihr in den Shownotes (nicht 100% weil nicht Alles streambar).

chatgpt weg alles ihr bitte lesen vieles schuhe trinkwasser synapsen osmose eure ohren game engines wikidata

Episode 153: BTB Digest 24

Computer und Kommunikation Beiträge - Deutschlandfunk

Play Episode Listen Later Jan 2, 2024 22:17

bali digest wikidata

Datenspende: Wikidata soll die Qualität von Large Language Modellen verbessern

Play Episode Listen Later Oct 28, 2023 6:56

Kloiber, Manfredwww.deutschlandfunk.de, Forschung aktuellDirekter Link zur Audiodatei

language large soll verbessern modellen die qualit wikidata kloiber datenspende manfredwww

Pizzel Ep. 99 - Q177

Pizzel Podcast

Play Episode Listen Later Oct 12, 2023 67:02

Pedro se muda y nos cuenta de primera mano sus experiencias con fletes japoneses y después nos pone al día sobre las grandes ambiciones de Wikipedia y el proyecto Wikidata. Por último Javier le escribe una carta de amor a uno de sus músicos favoritos, Louis Cole, en forma de sección de podcast.

spotify nos discord reddit wikipedia hagan referencias louis cole wikidata

Wikipedia: edukacja i technologia - Natalia Ćwik (CEO Wikimedia Polska) EM #156

Escola Mobile. Biznes masz w kieszeni

Play Episode Listen Later Sep 20, 2023 52:37

Pamiętasz jak pierwszy raz skorzystałeś z Wikipedii? A może korzystanie z Wiki jest już tak naturalne, że nawet nie zastanawiasz się nad tym, skąd biorą się hasła w tym projekcie? W tym odcinku podcastu gościmy CEO Wikimedia Polska. Natalia wespół z ogromem zaangażowanych ludzi buduje największy projekt społeczny w historii ludzkości. Stowarzyszenie Wikimedia Polska działa na rzecz powszechnego dostępu do wiedzy. Wspiera i promuje Wikipedię i jej projekty siostrzane (projekty Wikimedia). Jest niezależnym partnerem Wikimedia Foundation. Projekt z Wiki w nazwie to coś więcej niż pierwsze hasło w wyszukiwarce. Porozmawiamy o wolontariuszach, czyli wikipedystach, oraz szerokiej współpracy na rzecz edukacji. Uniwersytety, szkoły, firmy, startupy, galerie i biblioteki. Oraz każdy, kto chciałby współtworzyć Wikipedię i upowszechniać edukację. Wikipedia wyznacza trendy, buduje zasób edukacyjny dzięki zapleczu technologicznemu i energii ludzi. Dzięki temu Wikipedia to trzeci największy dostawca informacji w Unii Europejskiej, bez reklam, bez potrzeby komercjalizowania, utrzymujący się wyłącznie z darowizn, od ludzi, którzy chcą wspierać dostęp do wolnej wiedzy, nieustannie się rozrastający, I to w gigantycznym tempie, istniejący w ponad 300 wersjach językowych I coraz bardziej wewnętrznie ze sobą połączony, skoordynowany. Jak wygląda Wikipedia od kuchni? Czy AI jest już w Wikipedii oraz jak naprawiać wandalizmy w Wikipedii? Przesłuchaj podcast i sprawdź, o czym mówimy w tym odcinku. Logo dźwiękowe Wikipedii https://commons.m.wikimedia.org/wiki/File:Wikimedia_Sonic_Logo_-_4-seconds.wav Patronite https://patronite.pl/Wikipedia. Projekt Wikiszkoła https://wikiszkola.pl Wikimedia Commons https://commons.m.wikimedia.org/wiki/Strona_główna Wikimedia Foundation to amerykańska organizacja non-profit założona w celu rozwoju Wikipedii i jej projektów siostrzanych (takich jak Wikimedia Commons, Wikidata, Wikisource, Wiktionary itd). Utrzymuje m.in. serwery, na których znajdują się wszystkie światowe wersje Wikipedii. Wikipedia jest największym i najbardziej rozpoznawalnym projektem prowadzonym przez WMF. WMF wspiera również rozwój innych językowych Wikipedii i projektów siostrzanych na całym świecie. Wikimedia Polska jest niezależnym partnerem Wikimedia Foundation, mającym wyłączne prawo na terenie Polski do używania marki Wikipedia. Wikimedia Polska wspiera rozwój polskiej Wikipedii i jej projektów siostrzanych (np. Wikisłownik, Wikiźrodła, Wikicytaty, Wikidane, Wikimedia Commons i inne). 1 - Intro (00:01:10) 2 - O gościni (00:01:31) 3 - Wikipedia i Wikimedia (00:03:39) 4 - Jak pracują wikipedyści (00:05:56) 5 - Czym jest dziś Wikipedia (00:12:38) 6 - Praca w NBP (00:20:49) 7 - Wikipedia wyznacza trendy (00:23:51) 8 - Dobór treści w Wikipedii (00:25:44) 9 - Wikipedia na mobile (00:28:58) 10 - Dostępność (00:32:21) 11 - Współpraca z dostawcami technologii (00:34:40) 12 - AI i Wikipedia (00:37:49) 13 - Wandalizmy na Wikipedii (00:41:49) 14 - Przyszłość Wikipedii (00:46:53) 15 - Outro (00:50:22) Muzyka: Kevin MacLeod Werq Kevin MacLeod (incompetech.com) Licensed under Creative Commons: By Attribution 4.0 License/mix by Jedrzej Paulus https://creativecommons.org/licenses/by/4.0/ Oceń nasz podcast na Apple Podcasts: https://bit.ly/EscolaMobileIT

ai wikipedia logo projekt jak jest wiki dzi czym polski polska dost wsp pami praca przysz oraz wikimedia commons wikimedia strona dob edukacja porozmawiamy wikimedia foundation oce technologia unii europejskiej wspiera przes wikis nbp wmf czy ai wikipedii wikidata wiktionary wikisource wikipedi stowarzyszenie wikimedia polska

Avsnitt 230 – nu ringer vectorklockan

wikipedia ringer wikidata

Play Episode Listen Later Sep 6, 2023 21:26

Vi klurar på hur andra källor återanvänder data från Wikipedia och Wikidata, hur referenslistor kan läggas till en masse, om den stora Vektoromställningen som nyss annonserades och ljud i rad.

Folge 6: Normdaten in der Kunstgeschichte

#arthistoCast â€“ der Podcast zur Digitalen Kunstgeschichte

Play Episode Listen Later Sep 6, 2023 56:55

In dieser Folge spricht Jacqueline Klusik-Eckert mit Angela Kailus M.A. und Julia Rössel M.A. über die Rolle von Normdaten in der kunsthistorischen Forschung und Praxis. Der Ursprung von Normdaten hängt mit einem bibliothekarischen Systematisierungsbestrebungen in den 1970er Jahren zusammen. Wie hat sich der Umgang und die Konzepte von Normdaten im Zuge der Digitalisierung verändert? Mit einem Blick hinter die Kulissen der Gemeinsamen Normdatei (kurz GND) werden die Zusammenhänge von Identifikationsnummer und den dahinterliegenden Informationen erklärt. Welchen Mehrwert für die eigenen Daten erzielt man durch die Verwendung von Normdaten? Für welche Begriffe bzw. Entitäten gibt es Normdaten? Wo findet man sie? Woher kommt dieses Wissen und wie muss man mit dem Normdatensatz umgehen? Wir sprechen auch über den Unterschied eines institutionell gepflegten und autorisierten Normdatensatzes (GND über die Deutsche Nationalbibliothek) im Vergleich zu crowd-based Normdatensätzen (Wikidata).Darüber hinaus stellt sich die Frage, inwieweit die Verwendung von Normdaten bereits Einzug in das Fach Kunstgeschichte gehalten hat.Wir beleuchten die Herausforderung für sammelnde Institutionen bei der Erfassung von Objekten und der Anreicherung der Sammlungsdaten mit Normdaten. Welche Standards helfen bei der Erfassung und wofür soll man den Aufwand mit Normdaten überhaupt betreiben?Dabei nehmen wir unterschiedliche Szenarien im Datenlebenszyklus unter die Lupe.Wo begegnen wir als Forscher*innen diesen Daten, wie können wir sie nachnutzen und welche Verantwortung haben wir selbst als Produzent*innen von Forschungsdaten, wenn es um die Anreicherung der eigenen Daten mit Normdaten geht?Angela Kailus M.A. ist stellvertretende Leiterin des Deutschen Dokumentationszentrums für Kunstgeschichte – Bildarchiv Foto Marburg, Philipps-Universität Marburg und Ansprechperson bei NFDI4Culture im Arbeitsbereich Standardisierung und Datenqualität.Julia Rössel M.A. ist Kunsthistorikerin und Mitarbeiterin an der Deutschen Digitalen Bibliothek, Fachstelle Denkmalpflege, DDK, Marburg . Neben ihrer Promotion über „Wechsel der Mediensysteme – Graphische Sammlung und ihre digitale Übersetzung“ hat sie sich in den Bereichen Digitalisierung und Museum, Datenqualität und Standards spezialisiert.Begleitmaterial zu den Folgen findest du auf der Homepage unter https://www.arthistoricum.net/themen/podcasts/arthistocastAlle Folgen des Podcasts werden bei heidICON mit Metadaten und persistentem Identifier gespeichert. Die Folgen haben die Creative-Commons-Lizenz CC BY 4.0 und können heruntergeladen werden. Du findest sie unterhttps://doi.org/10.11588/heidicon/1738702 Bei Fragen, Anregungen, Kritik und gerne auch Lob kannst du gerne per Mail an uns schicken unterpodcast@digitale-kunstgeschichte.de

Episode 145: Jan Ainali