Free knowledge database project
POPULARITY
Vi påar populär tävling Jan tävlingsleder, zvarar på zvåra bildfrågor, undrar om underlig artikel, varnar för viktigt val, och önskar specifika nya illustrationer. Shownotes Välkommen Veckans tävling Aktuellt från svenska Wikipedia Zara Larssons huvudbild Johan Jönssons blogg om bildval Relationen mellan kronprinsessan Mette-Marit av Norge och Jeffrey Epstein Wikimediarörelsen internationellt Val av Stewards pågår Veckans mall Jan: Illustrationsbehov Wikifikor, meetups och träffar i närtid Torsdag: Frågestund om nya relevanskriterieförslaget på Wikidata Fredag: DCW Conversation hour med Wikimedia Foundations kommunikationschef Fredag: Language and Product Localization Community meeting Erkännanden Bild: Hellomoto100, CC BY-SA 4.0 Musiken och ljudklippen är från Surf Shimmy Kevin MacLeod (incompetech.com), CC BY 3.0, samt Wikimedia Sound Logo Finalist VQ97, Thaddeus Osborne, CC BY-SA 4.0, och ljudet från Wikidata’s 10th birthday video intro animation, Lea Lacroix (WMDE), CC BY-SA 4.0. Avsnittet hittas också på Wikimedia Commons. Diskutera avsnittet på projektsidans diskussion.
Vi lirkar isär Lennartar, brukar allehanda, talar om tunga termer, rättar upp relevansen, lobbar för lobbying och arkiverar anteckningar. Shownotes Rösta på årets nyuppladdade bild! Aktuellt från svenska Wikipedia Gammal sammanblandning utredd Jordbruk och eller lantbruk Finns elefantkyrkogården? Wikimediarörelsen internationellt Nytt förslag på relevanskriterier för Wikidata Nya policies för hur Wikipedia kan användas i lobbying Dags att spara dina etherpads Veckans mall Magnus: ping Wikifikor, meetups och träffar i närtid Lördag: Introduktion till OpenRefine Tisdag: Global GLAM call Tisdag: Mer om evenemangsverktyget Erkännanden Bild: Owltom at German Wikipedia, CC BY-SA 3.0 Musiken och ljudklippen är från Surf Shimmy Kevin MacLeod (incompetech.com), CC BY 3.0, samt Wikimedia Sound Logo Finalist VQ97, Thaddeus Osborne, CC BY-SA 4.0, och ljudet från Wikidata’s 10th birthday video intro animation, Lea Lacroix (WMDE), CC BY-SA 4.0. Avsnittet hittas också på Wikimedia Commons. Diskutera avsnittet på projektsidans diskussion.
Vi kraxar om kartor, ojar oss om omdirigeringar, bladdrar om bundlad bevakningslista och hälsar välkomna till Jju och Nawat.[källa behövs] Shownotes Aktuellt från svenska Wikipedia Kartfunktionen mår inte så bra Flytt efter död? Wikimediarörelsen internationellt Nu kan man använda egendefinierade etiketter för att filtrera artiklar i bevakningslistan Nya Wikipediaspråkversioner: på Jju på Nawat Veckans mall Jan: Källa behövs Wikifikor, meetups och träffar i närtid Måndag: Make your first Query on Wikidata using SPARQL Erkännanden Bild: Text elements, CC0 Musiken och ljudklippen är från Surf Shimmy Kevin MacLeod (incompetech.com), CC BY 3.0, samt Wikimedia Sound Logo Finalist VQ97, Thaddeus Osborne, CC BY-SA 4.0, och ljudet från Wikidata’s 10th birthday video intro animation, Lea Lacroix (WMDE), CC BY-SA 4.0. Avsnittet hittas också på Wikimedia Commons. Diskutera avsnittet på projektsidans diskussion.
Vi rabblar referenser, mumlar om mallar, antar att abstraktionen kommer, pratar om PTAC och kollar på koordinater. Shownotes Aktuellt från svenska Wikipedia Re-using references with different details – introducing sub-referencing Malla inte etablerade användare Wikimediarörelsen internationellt Testversion av Abstrakta Wikipedia kanske redan i första kvartalet, men hur borde det fungera? Tyck till om Product and Technology Advisory Council Veckans mall Magnus: Mall:coord Wikifikor, meetups och träffar i närtid Fredag: Wikimania 2026 Program Orientation Session Söndag: Cybersecurity Literacy For Wikimedians Måndag: Why and how to add data to Wikidata Måndag: Wikifunctions volontärshörn Erkännanden Bild: Claudius Clavis, CC0 Musiken och ljudklippen är från Surf Shimmy Kevin MacLeod (incompetech.com), CC BY 3.0, samt Wikimedia Sound Logo Finalist VQ97, Thaddeus Osborne, CC BY-SA 4.0, och ljudet från Wikidata’s 10th birthday video intro animation, Lea Lacroix (WMDE), CC BY-SA 4.0. Avsnittet hittas också på Wikimedia Commons. Diskutera avsnittet på projektsidans diskussion.
Vi analyserar AI-användning, kollar kronologin, krockar konsonanter, uppmuntrar förslag och mobilredigerar Wikidata. Shownotes Aktuellt från svenska Wikipedia Förslag: Använda AI som analysverktyg före publicering av Wikipedia-artiklar Avlidna upp och ner Världsstjärnor Wikimediarörelsen internationellt Dags att föreslå sessioner till Wikimania Beta för redigering av Wikidata på mobilen Veckans mall Jan: Mall:Släktträd Wikifikor, meetups och träffar i närtid Erkännanden Bild: Etienne Tremblay, CC BY 3.0 Musiken och ljudklippen är från Surf Shimmy Kevin MacLeod (incompetech.com), CC BY 3.0, samt Wikimedia Sound Logo Finalist VQ97, Thaddeus Osborne, CC BY-SA 4.0, och ljudet från Wikidata’s 10th birthday video intro animation, Lea Lacroix (WMDE), CC BY-SA 4.0. Avsnittet hittas också på Wikimedia Commons. Diskutera avsnittet på projektsidans diskussion.
Vi spanar framåt på ett 2026 som känns oförutsägbart och kaotiskt. Sen gissar vi vilt och lovar ännu vildare. Shownotes Trendspaning AI, fortsatt Mognadsår? WMF:s nya verksamhetschef Mer konkreta hot mot Wikimedia Foundation från den amerikanska administrationen Någon slags global mall i Wikifunctions får stor uppskattning Gissningar Antal artiklar vid årets slut (2017-01-01: 3 783 949, 2017-12-31: 3 788 406, 2018-12-31: 3 761 589, 2019-12-31: 3 745 330, 2020-12-31: 3 459 341, 2021-12-31: 2 769 496, 2022-12-31: 2 558 544. 2023-12-31: 2 573 444, 2024-12-31: 2 600 381, 2025-12-31: 2 621 078. Wikipedia: Statistik) Magnus: 2 645 172 Jan: 2 645 555 Antal aktiva användare vid årets slut (5 eller fler redigeringar, 2019-12: 1 018, 2020-12: 742, 2021-12: 683, 2022-12: 678, 2023-12: 643, 2024-12: 608, 2025-12: 564.Wikimediastatistik: 16 november 2024 – 1 januari 2026.)) Magnus: 532 Jan: 606 Antal administratörer vid årets slut (2019: 59, 2020: 61, 2021: 66, 2022: 66, 2023: 67, 2024: 66, 2025: 63. XTools: Admin Stats 1 januari – 31 december 2025.) Magnus: 65 Jan: 65 Totalt antal sidvisningar (2016: 1 617 963 712, 2017: 1 692 755 900, 2018: 1 592 055 771, 2019: 1 748 696 000, 2020: 1 685 603 596, 2021: 1 691 706 125, 2022: 1 560 812 002, 2023: 1 578 417 128, 2024: 1 471 473 949, 2025: 1 628 449 292. Wikimediastatistik: Totalt antal sidvisningar 1 januari 2025 – 1 januari 2026.) Magnus: 1 599 304 111 Jan: 1 800 000 000 Tre mest besökta sidorna (2021: Sverige, Lista över namnsdagar i Sverige i datumordning, Opinionsundersökningar inför riksdagsvalet i Sverige 2022, 2022: Nato, Opinionsundersökningar inför riksdagsvalet i Sverige 2022, Clark Olofsson, 2023: Sverige, Börje Salming, Loreen, 2024: Helikopterrånet i Västberga, Sverige, Sven-Göran Eriksson, 2025: Jan Stenbeck, Ed Gein, Sverige. Toppvisningsanalys 2025.) Magnus: Sverige, Gösta Engzell och Riksdagsvalet i Sverige 2026 Jan: Sverige, ännu ej skrivna artikeln om Trump och Grönland samt den just påtänkta artikeln om ännu ej skedda AI bubblan Näst mest redigerade sidan (2021: Bröderna Malmströms metallvarufabrik, 2022: Eritrea, 2023: Opinionsundersökningar inför riksdagsvalet i Sverige 2026, 2024: Gruppindelning av hundraser, 2025: 2025 [1]) Magnus: Opinionsundersökningar inför riksdagsvalet i Sverige 2026 Jan: Nato Nyårslöften Magnus: 350 dagar med redigering i artikelnamnrymden och > 199 dagar i rad. Jan: Hjälpa till att integrera/importera nya funktioner/tekniska mallar/moduler tvärs över projekten, en per kvartal. Erkännanden Bild: Ashleigh Thompson, CC BY-SA 2.0. Musiken och ljudklippen är från Surf Shimmy Kevin MacLeod (incompetech.com), CC BY 3.0, samt Wikimedia Sound Logo Finalist VQ97, Thaddeus Osborne, CC BY-SA 4.0, och ljudet från Wikidata’s 10th birthday video intro animation, Lea Lacroix (WMDE), CC BY-SA 4.0. Avsnittet hittas också på Wikimedia Commons. Diskutera avsnittet på projektsidans diskussion.
Wikipediapodden med din återkommande årskrönika som sammanfattar året 2025 i världens största uppslagsverk! Shownotes Kort presentation av oss och vad vi gjort under året. Year in Review-verktyg Gissningar i början av året Antal artiklar vid årets slut (2017-01-01: 3 783 949, 2017-12-31: 3 788 406, 2018-12-31: 3 761 589, 2019-12-31: 3 745 330, 2020-12-31: 3 459 341, 2021-12-31: 2 769 589, 2022-12-31: 2 558 544, 2023-12-31: 2 573 444, 2024-12-31: 2 600 381, 2025-12-31: 2 621 078. Wikipedia: Statistik) Magnus: 2 635 172 Jan: 2 635 000 Antal aktiva användare vid årets slut (5 eller fler redigeringar, 2019-12: 1 018, 2020-12: 742, 2021-12: 683, 2022-12: 678, 2023-12: 643, 2024-12: 608, 2025-12: 564.Wikimediastatistik: 16 november 2024 – 1 januari 2026.) Magnus: 583 Jan: 620 Antal administratörer vid årets slut (2019: 59, 2020: 61, 2021: 66, 2022: 66, 2023: 67, 2024: 66, 2025: 63. XTools: Admin Stats 1 januari – 31 december 2025.) Magnus: 68 Jan: 67 Totalt antal sidvisningar (2016: 1 617 963 712, 2017: 1 692 755 900, 2018: 1 592 055 771, 2019: 1 748 696 000, 2020: 1 685 603 596, 2021: 1 691 706 125, 2022: 1 560 812 002, 2023: 1 578 417 128, 2024: 1 471 473 949, 2025: 1 628 449 292. Wikimediastatistik: Totalt antal sidvisningar 1 januari 2025 – 1 januari 2026.) Magnus: 1 451 563 534 Jan: 1 350 000 000 Tre mest besökta sidorna (2021: Sverige, Lista över namnsdagar i Sverige i datumordning, Opinionsundersökningar inför riksdagsvalet i Sverige 2022, 2022: Nato, Opinionsundersökningar inför riksdagsvalet i Sverige 2022, Clark Olofsson, 2023: Sverige, Börje Salming, Loreen, 2024: Helikopterrånet i Västberga, Sverige, Sven-Göran Eriksson, 2025: Jan Stenbeck, Ed Gein, Sverige. Toppvisningsanalys 2025.) Magnus: Sverige, Gösta Engzell, Lista över namnsdagar i Sverige i datumordning Jan: Sverige, Dubbelmordet i Linköping, Chat GPT Näst mest redigerade sidan (2021: Bröderna Malmströms metallvarufabrik, 2022: Eritrea, 2023: Opinionsundersökningar inför riksdagsvalet i Sverige 2026, 2024: Gruppindelning av hundraser, 2025: 2025 [1]) Magnus: Gösta Engzel Jan: Donald Trump Nyårslöften – hur gick det? Magnus: 350 dagar med redigering i artikelnamnrymden och > 199 dagar i rad. Jan: Se till att Wikimedians for Sustainable Development uppfyller hälsokriterier, skapa listor med videos och diagram Trendspaningen – hur gick det? AI, såklart Diskussionssammanfattningar Verktyg med AI-stöd Ökat tryck att stoppa klotter från oinloggade Afrikanska gemenskaper tar, och får, mer plats Årets hook / soundbite Magnus: Avsnitt 324: Att självdö är inte så farligt, (21:39 ff) Jan: Avsnitt 307: Barbara-Streisand-effekten, (11:42 ff) Statistik Mest lästa artiklar Antal dagar överst på Topp-100 Hundra mest besökta sidorna från Sverige Administratörsaktivitet Vem redigerar flest dagar? Vem redigerar flest gånger? Politik Grokipedia ”Bara britterna tror att Wikipedia är en porrsajt” Meta / Årets klapp på axeln Wikipediapoddens namn, ofrivillig paus Att Wikipediapodden fick behålla namnet Årets viktigaste tekniknyhet: Temporära konton Årets diskussioner Svårt att undvika Diskussion:Kulturmarxism, låg-och högintensiv samtidigt. Som även fått fötter. Incidental exposure of woman's nipples in article where it is not relevant, moralpanik eller demonstativt beteende. Nästa program Spaning på 2026 Erkännanden Bild: Unbuttered Parsnip, CC BY-SA 4.0 Musiken och ljudklippen är från Surf Shimmy Kevin MacLeod (incompetech.com), CC BY 3.0, samt Wikimedia Sound Logo Finalist VQ97, Thaddeus Osborne, CC BY-SA 4.0, och ljudet från Wikidata’s 10th birthday video intro animation, Lea Lacroix (WMDE), CC BY-SA 4.0. Ljudklipp också från avsnitt 307 och 324 (länkade ovan), CC BY-SA 4.0 samt Tape Player – Rewind av Poulpy2.0, CC 0. Avsnittet hittas också på Wikimedia Commons. Diskutera avsnittet på projektsidans diskussion.
Vi avrundar året med prat om plotter och prepositioner, vinster i domstolar och fototävlingar, kollar på kollegor och ställer oss frågande om verksamheten. Shownotes Välkommen Govdirectory vann Award for Projects of Social Benefit Aktuellt från svenska Wikipedia Plottriga kartor Masskjutningen på Bondi Beach Wikimediarörelsen internationellt Vinst i fransk domstol Vinnarbilder i Wiki Loves Earth (och vinnarvideo!) ZedWiki Podcast – ny podcast på engelska Tyck till tidigt om Wikimedia Foundations nästa verksamhetsplan Veckans mall Magnus: Tunnelbanestation med ingångar Wikifikor, meetups och träffar i närtid Nästa program Juluppehåll, vi är tillbaks 14 januari. Årskrönika, ge oss era favoritklipp. Erkännanden Bild: LMarianne, CC BY-SA 4.0 Musiken och ljudklippen är från Surf Shimmy Kevin MacLeod (incompetech.com), CC BY 3.0, samt Wikimedia Sound Logo Finalist VQ97, Thaddeus Osborne, CC BY-SA 4.0, och ljudet från Wikidata’s 10th birthday video intro animation, Lea Lacroix (WMDE), CC BY-SA 4.0. Also a clip from episode 1 of Zedwiki podcast. Avsnittet hittas också på Wikimedia Commons. Diskutera avsnittet på projektsidans diskussion.
Tillsammans med en gäst funderar vi på falna flaggor, buttrar om byrackor, tjenar på chefen, nämner namn vid namn, och gör en dubbeltagning. Shownotes Aktuellt från svenska Wikipedia Reser flagg Reser ragg Wikimediarörelsen internationellt Ny chef på Wikimedia Foundation Abstrakta Wikipedias nya namn är… Abstrakta Wikipedia! Alla kan nu aktivera tvåfaktorsautentisering Veckans mall Johannes: Tidningsref Wikifikor, meetups och träffar i närtid Fredag: Wikimedia LGBT+ 2025 avslutning Erkännanden Bild: 3247 , Public Domain Musiken och ljudklippen är från Surf Shimmy Kevin MacLeod (incompetech.com), CC BY 3.0, samt Wikimedia Sound Logo Finalist VQ97, Thaddeus Osborne, CC BY-SA 4.0, och ljudet från Wikidata’s 10th birthday video intro animation, Lea Lacroix (WMDE), CC BY-SA 4.0. Avsnittet hittas också på Wikimedia Commons. Diskutera avsnittet på projektsidans diskussion.
Vi skanderar om Skanderna, yallar till YouTube, arkiverar aktualiteter, finner fel med Ai och hejar på nya projekt. Shownotes Aktuellt från svenska Wikipedia Vi drar till fj… skanderna? Wikipediadagen på YouTube Wikimediarörelsen internationellt Wikinews föreslås att arkiveras Forskning och tester av AI som hittar fel på Wikipedia Nya projekt: Wikisource på Minangkabau Wikipedia på Toki Pona Wikiquote på Nigeriansk pidgin Veckans mall Jan: Faktamall biografi WD Wikifikor, meetups och träffar i närtid Tisdag: Global GLAM call Onsdag: Wikimedia projects and smell-related content Torsdag: Conversation with Trustees Erkännanden Bild: Olaus Magnus – ”Historia de gentibus septentrionalibus”, Public Domain Musiken och ljudklippen är från Surf Shimmy Kevin MacLeod (incompetech.com), CC BY 3.0, samt Wikimedia Sound Logo Finalist VQ97, Thaddeus Osborne, CC BY-SA 4.0, och ljudet från Wikidata’s 10th birthday video intro animation, Lea Lacroix (WMDE), CC BY-SA 4.0. Avsnittet hittas också på Wikimedia Commons. Diskutera avsnittet på projektsidans diskussion.
This is an episode recorded at WikiconNL 2025 in Leiden. The focus of the episode is the conference, what it and events like this mean to the community and organizational aspects. Shownotes First we meet Kirsten Jansen, office manager of Wikimedia Nederlands who talks about the importance of a smoothly running back office and having fun while doing it. Then we meet Thamara Groenleer, project lead for the conference, who talks about how themes can emerge and how to create environment that enables the participants. Then we hear from Vera de Kok, official event photographer, who talks about what well documented events can mean. After that we widen the perspective a bit with Julia Brungs, Lead Community Relations Specialist at the Wikimedia Foundation, and talk about communication in the community, in general and at events like this. Staying with a Wikimedia Foundation connection, we talk with Jan-Bart de Vreede, former chair of the Foundation and current chair of Wikimedia Nederland, about the importance and value of meetings and what it means to the the chapter and the community. We keep on that theme when talking to Sandra Rientjes, Executive Director of Wikimedia Nederland, and also talk about leadership in the movement as she is the longest serving ED for a few more weeks. All episodes in English (podcast feed) Credits The music and sound clips are from Surf Shimmy Kevin MacLeod (incompetech.com) [CC BY 3.0], via Wikimedia Commons, Wikimedia Sound Logo Finalist VQ97, Thaddeus Osborne, CC BY-SA 4.0, and the audio from Wikidata’s 10th birthday video intro animation, Lea Lacroix (WMDE), CC BY-SA 4.0. Image: WM 25 WikiconNL 2000 x 475.jpg by Grtrwrks, CC BY-SA 4.0 Discuss the episode on the project’s talk page. The episode is also available on Wikimedia Commons.
Vi tripplar om AI och handlingsavsnitt i filmer, röstar igen men slipper bli omdirigerade och ser att det är ordning på ekonomin. Shownotes Aktuellt från svenska Wikipedia AI-skapade handlingsavsnitt i artiklar om filmer, utvecklades på wikipediavis till en trippel: Hur vet man att ett handlingsavsnitt inte är AI-skapat med risk för ”hallucinationer”? Får man verkligen använda verket som källa? Är det verkligen ok att spoila, eller varför varnar vi inte? Wikimediarörelsen internationellt Rösta på nytt projektnamn Ingen mer omdirigering till mobildomän Wikimedia Foundations 24/25 ekonomiska rapport Veckans mall Magnus: Infobox tunnelbanestation Wikifikor, meetups och träffar i närtid Fredag: Digital Safety Office Hours Lördag: World AIDS Day 2025 online Erkännanden Bild: Sebastian Kraft, GNU General Public License Musiken och ljudklippen är från Surf Shimmy Kevin MacLeod (incompetech.com), CC BY 3.0, samt Wikimedia Sound Logo Finalist VQ97, Thaddeus Osborne, CC BY-SA 4.0, och ljudet från Wikidata’s 10th birthday video intro animation, Lea Lacroix (WMDE), CC BY-SA 4.0. Avsnittet hittas också på Wikimedia Commons. Diskutera avsnittet på projektsidans diskussion.
Ein bisschen später als ursprünglich geplant tauchen wir wieder aus der Versenkung und begrüßen heute Jens Ohlig in der Runde, der über lange Zet das Geschehen bei Wikimedia mit geprägt hat und maßgeblich an der Entwicklung von Wikidata beteiligt war. Mit ihm sprechen wir über den Status Quo des Projekts und vor allem über die Herausforderung durch AI und LLMs. Vorher haben wir noch zahlreiche Tips für Dinge, die man eigentlich nicht braucht und reden viel über Südafrika, Schach, die Vision Pro und Smart Home. Also mal wieder für jeden was dabei, ein Podcast für die ganze Familie!
Vi ondgör oss på oinloggades beskrivningsklotter, tänker på tidlöshet, upplyser om utseendeförändringar, vinkar åt vinnare och mumlar om modeller. Shownotes Aktuellt från svenska Wikipedia Wikidata-beskrivningar från oinloggade Numera A new look for talk pages Wikimediarörelsen internationellt Vinnare i valet till Wikimedia Foundations styrelse Utkast för ny modell för affiliates Veckans mall Jan: Årtal Ämne Wikifikor, meetups och träffar i närtid Torsdag: Wikisource Transcribe-a-thon Söndag: Capacity Exchange community calls Tisdag Connection learning session 3 Erkännanden Bild: Mobilos, CC BY 3.0 Musiken och ljudklippen är från Surf Shimmy Kevin MacLeod (incompetech.com), CC BY 3.0, samt Wikimedia Sound Logo Finalist VQ97, Thaddeus Osborne, CC BY-SA 4.0, och ljudet från Wikidata’s 10th birthday video intro animation, Lea Lacroix (WMDE), CC BY-SA 4.0 och WP25 Anthem video.webm, CC BY-SA 4.0. Avsnittet hittas också på Wikimedia Commons. Diskutera avsnittet på projektsidans diskussion.
Send us a textMost brands still try to “tell” Google who they are. We show how Google actually decides: by stitching together a ledger of facts from your site, LinkedIn, Wikipedia, news articles, and structured data—then trusting only what aligns. This is the Knowledge Graph at work, and it's quietly steering whether you earn a knowledge panel, sitelinks, and richer visibility across search.We break down the four streams feeding the graph—public web pages, licensed datasets, human‑edited knowledge bases like Wikidata, and direct owner signals via schema.org—and explain how each contributes to a confidence score for your entity. If your about page says Jane Doe is CEO but LinkedIn shows John Smith, the score drops and your brand becomes ambiguous. If your website, LinkedIn, reputable press, and Wikidata all agree, trust rises and your facts become “truth” in search.From there, we get specific about what you can control. Use schema.org to describe your organisation, people, products, and identifiers in clear, machine‑readable terms. Link out with sameAs to authoritative profiles so Google can triangulate identity. Audit your knowledge panel as a live diagnostic: check logos, dates, roles, and categories, and chase down any mismatch to the original source. Treat digital PR and reputation management as part of technical SEO—because today they are.By the end, you'll have a practical checklist for entity hygiene that helps you earn and keep a clean knowledge panel, avoid costly confusion, and unlock higher‑trust features across the results page. If this helped clarify how entities power modern SEO, subscribe, share with a colleague, and leave a quick review with one takeaway you'll act on next.SEO Is Not That Hard is hosted by Edd Dawson and brought to you by KeywordsPeopleUse.com Help feed the algorithm and leave a review at ratethispodcast.com/seo You can get your free copy of my 101 Quick SEO Tips at: https://seotips.edddawson.com/101-quick-seo-tipsTo get a personal no-obligation demo of how KeywordsPeopleUse could help you boost your SEO and get a 7 day FREE trial of our Standard Plan book a demo with me nowSee Edd's personal site at edddawson.comAsk me a question and get on the show Click here to record a questionFind Edd on Linkedin, Bluesky & TwitterFind KeywordsPeopleUse on Twitter @kwds_ppl_use"Werq" Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0 Licensehttp://creativecommons.org/licenses/by/4.0/
Send us a textKeywords don't tell the whole story—entities do. We take you inside the three-step process machines use to read your content like a detective at a crime scene: highlighting potential entities, using context to resolve ambiguity, and linking each mention to a unique identifier in a global knowledge base. By the end, you'll see why “Jordan” only makes sense when surrounded by the right clues—and how to present those clues so search engines and AIs make the right call every time.We start with named entity recognition, the digital highlighter that picks out people, organisations, products, places, and dates across unstructured text. Then we move to entity disambiguation, where context—co-occurring teams, locations, or concepts—guides the system to the correct meaning. Finally, we close with entity linking, the moment a string becomes a node with a library card in Wikipedia or Wikidata. That linkage is the bridge into Google's Knowledge Graph, powering features like knowledge panels and richer, more confident results.Along the way, we dig into why Wikipedia and Wikidata matter far beyond vanity. Accurate, well-sourced entries create a feedback loop that improves how machines understand your brand, your founders, and your products. If you don't meet notability yet, don't force it; build authority elsewhere with consistent profiles, structured data, and content that names and connects related entities. We also share a simple action: search for your brand, founder, and main product on Wikipedia and Wikidata and assess accuracy. Want more like this? Follow the show, share it with a colleague, and leave a review so we can help more teams make sense of entity-first SEO.SEO Is Not That Hard is hosted by Edd Dawson and brought to you by KeywordsPeopleUse.com Help feed the algorithm and leave a review at ratethispodcast.com/seo You can get your free copy of my 101 Quick SEO Tips at: https://seotips.edddawson.com/101-quick-seo-tipsTo get a personal no-obligation demo of how KeywordsPeopleUse could help you boost your SEO and get a 7 day FREE trial of our Standard Plan book a demo with me nowSee Edd's personal site at edddawson.comAsk me a question and get on the show Click here to record a questionFind Edd on Linkedin, Bluesky & TwitterFind KeywordsPeopleUse on Twitter @kwds_ppl_use"Werq" Kevin MacLeod (incompetech.com)Licensed under Creative Commons: By Attribution 4.0 Licensehttp://creativecommons.org/licenses/by/4.0/
When most people think of Wikipedia, they picture an endless scroll of human-readable pages. But there's another side to this ecosystem, one designed not just for people but also for machines. It's called Wikidata, and if you haven't heard of it, that's exactly why this conversation matters. In this episode of Tech Talks Daily, I'm joined by Lydia Pintscher, Wikidata Portfolio Manager at Wikimedia Deutschland, for a deep look into how structured, open data is quietly powering civic tech, cultural preservation, and knowledge equity across the globe. Wikidata is the backbone that helps turn static knowledge into something living, adaptable, and scalable. With over 117 million items, 1.65 billion semantic statements, and more than 2.34 billion edits, it's become one of the largest collaborative datasets in the world. But it's not just the size that makes it impressive. It's what people are doing with it. Lydia shares how volunteers and developers are building tools for everything from investigative journalism to public libraries, all without needing deep pockets or proprietary infrastructure. This isn't big tech. It's a global, grassroots movement making open data work for the public good. We explore how tools like Toolforge and the Wikidata Query Service lower the barrier to entry, allowing civil society groups to build sophisticated applications that would otherwise be out of reach. Whether it's helping connect citizens to government services or preserving disappearing languages, the use cases are multiplying fast. Lydia also reflects on how Wikidata fosters a sense of purpose for contributors, offering a rare example of what many call the good internet, where collaboration outweighs competition and building something meaningful beats chasing virality. If you're curious about where open knowledge is headed, how structured data can be a force for social impact, or why Wikidata might be the most important project you've never fully explored, this episode offers a window into a future where machines help humans build something better, together.
Join Charna Parkey as she welcomes Anastasia Stasenko, CEO and co-founder of pleias, through her unique journey from philosophy to building open-source, energy-efficient LLMs. Discover how pleias is revolutionizing the AI landscape by training models exclusively on open data and establishing a precedent for ethical and socially acceptable AI. Learn about the challenges and opportunities in creating multilingual models and contributing back to the open-source community. QUOTES[00:00:00] Introducing Anastasia and pleias[00:02:00] From Philosophy to AI[00:06:00] The Problem of Generic Models[00:10:00] Open Weights vs. Open Source vs. Open Science[00:14:00] Why Open Data Matters[00:18:00] High-Quality, Specialized Models[00:22:00] Multilingual Challenges[00:26:00] Global Inclusion Requires Small Models[00:30:00] Using and Contributing to Wikidata[00:38:00] The Future: Specialized Models[00:48:00] Advice for Newcomers[00:54:00] Cultural Sensitivity and Data Representation[00:50:00] Leo's Takeaways[00:52:00] Charna on Ethical, Verifiable AI[00:54:00] Representation vs. Exclusion[00:56:00] Letting People Be More Human[00:57:30] Applied, Transformative AIQUOTESCharna:"If you didn't make it represented in the data, then we're leaving another culture behind... So which one are you wanting to do, misrepresent them or just completely leave them behind from this technical revolution?"Anastasia:"The real issue now is that the lack of diversity in the current AI labs leads to the situation where all LLMs look alike."Anastasia:"Being able to design, to find, and also to create the appropriate data mix for large language models is something that we shouldn't really forget about when we talk about the success of what large language models are."
Guest Robert Douglass Panelist Richard Littauer | Abby Cabunoc Mayes Show Notes In this episode of Sustain, hosts Richard Littauer and Abby Cabunoc Mayes speak with Robert Douglass, Entrepreneur in Residence at Open Strategy Partners, to delve into sustaining open source projects. They explore Robert's extensive history with Drupal, the role of Open Strategy Partners, and the innovative Drupal Certified Partner Program designed to address the maker-taker dilemma in open source. The episode also covers the recently launched RFP templates aimed at promoting open source software and certified partners. Robert shares insights on gamification, the economic aspects of contributing to Drupal, and future initiatives to ensure the continued sustainability of open source projects. Hit download now to hear more! [00:01:49] Robert shares his background in the Drupal ecosystem and his involvement with Open Strategy Partners, which provides strategic content marketing for B2B tech companies focusing on open source. [00:02:43] Robert explains Open Strategy Partners' focus on supporting open source projects and mentions clients like DDEV and TYPO3. [00:04:06] Richard and Robert discuss what it means to be an entrepreneur in residence, with Robert explaining his role in developing new products for Open Strategy Partners and the books he has written. [00:05:52] Robert reflects on the early days of Drupal and the challenges in making open source sustainable. He notes how the community was initially driven by passion, with few paid opportunities. [00:08:05] Robert introduces the Drupal Certified Partner Program, a system for supporting Drupal sustainability by encouraging companies to contribute both time and money. [00:10:03] The conversation covers how Drupal's contribution system gamifies the support companies provide to the ecosystem. Companies can earn contribution credits, which are visible on Drupal.org and benefit their reputation. [00:15:41] Abby asks about the potential downsides of gamification, especially regarding diversity. Robert explains how placing the system at the company level may mitigate some negative impacts. [00:18:17] Richard inquires about the financial structure of the Drupal Certified Partner Program. Robert clarifies that the funds collected support the Drupal Association's core mission, including maintaining Drupal.org and organizing events. [00:21:33] Robert discusses the development of RFP (Request for Proposal) templates to encourage companies to consider certified open source providers, explaining how this initiative promotes sustainability in the ecosystem. [00:25:56] Robert describes how the RFP templates allow purpose-driven organizations to incorporate open source values in their procurement process, aligning with their missions. [00:27:00] Robert invites listeners to explore and utilize the RFP templates, which are available under a Creative Commons Zero license, encouraging others to adapt and improve them. [00:29:47] Find out where you can follow Robert and his work online. Quotes [00:08:57] “Open Source is like a free puppy.” Spotlight [00:30:30] Abby's spotlight is Common Sort thrift shop in Toronto. [00:30:52] Richard's spotlight is Wikidata. [00:31:21] Robert's spotlight is Chad Whitacre and Sentry. Links SustainOSS (https://sustainoss.org/) podcast@sustainoss.org (mailto:podcast@sustainoss.org) richard@sustainoss.org (mailto:richard@sustainoss.org) SustainOSS Discourse (https://discourse.sustainoss.org/) SustainOSS Mastodon (https://mastodon.social/tags/sustainoss) Open Collective-SustainOSS (Contribute) (https://opencollective.com/sustainoss) Richard Littauer Socials (https://www.burntfen.com/2023-05-30/socials) Abby Cabunoc Mayes X (https://x.com/abbycabs?lang=en) Robert Douglass LinkedIn (https://www.linkedin.com/in/roberttdouglass/) Open Strategy Partners (https://openstrategypartners.com/) Open Strategy Partners Blog (https://openstrategypartners.com/blog/) Building Online Communities with Drupal, phpBB, and WordPress by Robert Douglass, Mike Little, Jared W. Smith (https://www.drupal.org/node/1850002) Drupal Certified Partner Program (https://www.drupal.org/association/become-a-drupal-certified-partner) Drupal (https://www.drupal.org/) How to Write an RFP for Open Source Solutions: Featuring Drupal Certified Partners (https://www.drupal.org/association/blog/how-to-write-an-rfp-for-open-source-solutions-featuring-drupal-certified-partners) OSP: Supporting Drupal Certified Partners (https://openstrategypartners.com/blog/osp-supporting-drupal-certified-partners/) Sustain Podcast-Episode 148: Ali Nehzat of thanks.dev and OSS Funding (https://podcast.sustainoss.org/148) Common Sort (https://commonsort.com/) Wikidata (https://www.wikidata.org/wiki/Wikidata:Main_Page) Chad Whitacre LinkedIn (https://www.linkedin.com/in/chadwhitacre/) Sentry (https://sentry.io/welcome/) Credits Produced by Richard Littauer (https://www.burntfen.com/) Edited by Paul M. Bahr at Peachtree Sound (https://www.peachtreesound.com/) Show notes by DeAnn Bahr Peachtree Sound (https://www.peachtreesound.com/) Special Guest: Robert Douglass.
In dieser Folge spricht Jacqueline Klusik-Eckert mit den Kunsthistorikerinnen Anna Gnyp und Maria Merseburger über das Verhältnis von Wikimedia und der Kunstgeschichte. Gemeinsam diskutieren sie, wie Wikipedia und Wikidata inzwischen zu wertvollen Ressourcen für kunsthistorische Forschung geworden sind und warum die aktive Mitgestaltung dieser Plattformen durch Fachwissenschaftler*innen unter dem Aspekt der Wissensgerechtigkeit wichtig ist. Dabei wird auch die Arbeit der AG Kuwiki vorgestellt, die mit mehreren Projekten die Sichtbarkeit kunsthistorischen Wissens auf Wikipedia fördert: Das „Living Handbook“ bietet eine Einführung in die Wikipedia-Arbeit für Kunsthistoriker. „Wikipedia in der Lehre“ zielt darauf ab, Studierende frühzeitig für die Plattform zu sensibilisieren und aktiv einzubinden. Und „Kuwiki Loves Monuments, too“ fördert die Dokumentation und Verbreitung von Bildern zu Denkmälern und Kulturgütern. Ein wichtiges Anliegen ist dabei die Wissensgerechtigkeit, um mehr Diversität auf Wikipedia und Wikimedia Commons zu erreichen.Das Gespräch beleuchtet auch die wachsende Bedeutung von Wikidata als datenbankgestützte Ressource, die zunehmend in digitalen kunsthistorischen Projekten genutzt wird. Anna und Maria zeigen auf, wie Museen, Archive und Bibliotheken von Wikidata und Wikimedia Commons profitieren können, um ihre Bestände öffentlich zugänglich zu machen und neue Vernetzungen zu schaffen. Abschließend plädieren sie für stärkere Kooperationen und „Best Practice“-Beispiele, die die Arbeit mit Wikimedia-Projekten in der Kunstwissenschaft festigen und bereichern können.Anna Gnyp, ist seit knapp zwei Jahren Mitglied der Arbeitsgemeinschaft. Aktuell ist sie Wissenschaftlerin im Datenkompetenzzentrum Sammlungen, Objekte, Datenkompetenz an der Humboldt-Universität Berlin. Das ist ein Verbundprojekt zum Aufbau eines Datenkompetenzzentrums für wissenschaftliche Universitätssammlungen.Dr. Maria Merseburger, ist seit Beginn im der AG KUwiki, hier unter dem Namen Karatecoop und aktuell Wissenschaftlerin am Museum für Kommunikation in Berlin.Begleitmaterial zu den Folgen findest du auf der Homepage unter https://www.arthistoricum.net/themen/podcasts/arthistocast.Alle Folgen des Podcasts werden bei heidICON mit Metadaten und persistentem Identifier gespeichert. Die Folgen haben die Creative-Commons-Lizenz CC BY 4.0 und können heruntergeladen werden. Du findest sie unter https://doi.org/10.11588/heidicon/1738702.Bei Fragen, Anregungen, Kritik und gerne auch Lob kannst du uns gerne per Mail kontaktieren unter podcast@digitale-kunstgeschichte.de.
Could making Wikidata 'human' readable lead to better AI? A new project is underway to allow Large Language Models (LLMs) to read Wikidata. The data is currently structured in a way that's machine readable, but LLMs read data more like humans than machines, meaning this vast amount of human curated, high quality data isn't accessible to this type of AI. By allowing access to Wikidata, LLMs could become more reliable. Ania spoke to Lydia Pintscher, the Portfolio Lead Product Manager at Wikidata Deutschland, to learn more about these developments. Most news websites block AI Chatbots Two thirds of high quality news websites block AI chatbots from accessing their information, according to a report by the misinformation monitoring organisation NewsGuard. This means that some of the world's most popular AI chatbots could be collecting data on misinformation from low quality news sources and even conspiracy and hoax sites. The Enterprise Editor at NewsGuard is Jack Brewster and he is on the show to explain their findings. The programme is presented by Gareth Mitchell and the studio expert is Ania Lichtarowicz. More on this week's stories: Wikidata and Artificial Intelligence: Simplified Access to Open Data for Open-Source Projects AI Chatbots Are Blocked by 67% of Top News Sites, Relying Instead on Low-Quality Sources Support the show Editor: Ania Lichtarowicz Production Manager: Liz Tuohy Recording and audio editing : Lansons | Team Farner For new episodes, subscribe wherever you get your podcasts or via this link: https://www.buzzsprout.com/2265960/supporters/new Follow us on all the socials: Join our Facebook group Instagram Twitter/X If you like Somewhere on Earth, please rate and review it on Apple Podcasts Contact us by email: hello@somewhereonearth.co Send us a voice note: via WhatsApp: +44 7486 329 484 Find a Story + Make it News = Change the World Learn more about your ad choices. Visit megaphone.fm/adchoices
Could making Wikidata 'human' readable lead to better AI?A new project is underway to allow Large Language Models (LLMs) to read Wikidata. The data is currently structured in a way that's machine readable, but LLMs read data more like humans than machines, meaning this vast amount of human curated, high quality data isn't accessible to this type of AI. By allowing access to Wikidata, LLMs could become more reliable. Ania spoke to Lydia Pintscher, the Portfolio Lead Product Manager at Wikidata Deutschland, to learn more about these developments.Most news websites block AI ChatbotsTwo thirds of high quality news websites block AI chatbots from accessing their information, according to a report by the misinformation monitoring organisation NewsGuard. This means that some of the world's most popular AI chatbots could be collecting data on misinformation from low quality news sources and even conspiracy and hoax sites. The Enterprise Editor at NewsGuard is Jack Brewster and he is on the show to explain their findings.The programme is presented by Gareth Mitchell and the studio expert is Ania Lichtarowicz.More on this week's stories:Wikidata and Artificial Intelligence: Simplified Access to Open Data for Open-Source ProjectsAI Chatbots Are Blocked by 67% of Top News Sites, Relying Instead on Low-Quality SourcesSupport the showEditor: Ania LichtarowiczProduction Manager: Liz TuohyRecording and audio editing : Lansons | Team FarnerFor new episodes, subscribe wherever you get your podcasts or via this link:https://www.buzzsprout.com/2265960/supporters/newFollow us on all the socials:Join our Facebook groupInstagramTwitter/XIf you like Somewhere on Earth, please rate and review it on Apple PodcastsContact us by email: hello@somewhereonearth.coSend us a voice note: via WhatsApp: +44 7486 329 484Find a Story + Make it News = Change the WorldLearn more about your ad choices. Visit megaphone.fm/adchoices
Could making Wikidata 'human' readable lead to better AI? A new project is underway to allow Large Language Models (LLMs) to read Wikidata. The data is currently structured in a way that's machine readable, but LLMs read data more like humans than machines, meaning this vast amount of human curated, high quality data isn't accessible to this type of AI. By allowing access to Wikidata, LLMs could become more reliable. Ania spoke to Lydia Pintscher, the Portfolio Lead Product Manager at Wikidata Deutschland, to learn more about these developments. Most news websites block AI ChatbotsTwo thirds of high quality news websites block AI chatbots from accessing their information, according to a report by the misinformation monitoring organisation NewsGuard. This means that some of the world's most popular AI chatbots could be collecting data on misinformation from low quality news sources and even conspiracy and hoax sites. The Enterprise Editor at NewsGuard is Jack Brewster and he is on the show to explain their findings.The programme is presented by Gareth Mitchell and the studio expert is Ania Lichtarowicz.More on this week's stories: Wikidata and Artificial Intelligence: Simplified Access to Open Data for Open-Source ProjectsAI Chatbots Are Blocked by 67% of Top News Sites, Relying Instead on Low-Quality SourcesSupport the showEditor: Ania LichtarowiczProduction Manager: Liz Tuohy Recording and audio editing : Lansons | Team Farner For new episodes, subscribe wherever you get your podcasts or via this link:https://www.buzzsprout.com/2265960/supporters/newFollow us on all the socials: Join our Facebook group Instagram Twitter/X If you like Somewhere on Earth, please rate and review it on Apple PodcastsContact us by email: hello@somewhereonearth.coSend us a voice note: via WhatsApp: +44 7486 329 484Find a Story + Make it News = Change the World
Subscriber-only episodeCould making Wikidata human readable lead to better AI? A new project is underway to allow Large Language Models (LLMs) to read Wikidata. The data is currently structured in a way that's machine readable, but LLMs read data more like humans than machines, meaning this vast amount of human curated, high quality data isn't accessible to this type of AI. By allowing access to Wikidata, LLMs could become more reliable. Ania spoke to Lydia Pintscher, the Portfolio Lead Product Manager at Wikidata Deutschland, to learn more about these developments. Most news websites block AI ChatbotsTwo thirds of high quality news websites block AI chatbots from accessing their information, according to a report by the misinformation monitoring organisation NewsGuard. This means that some of the world's most popular AI chatbots could be collecting data on misinformation from low quality news sources and even conspiracy and hoax sites. The Enterprise Editor at NewsGuard is Jack Brewster and he is on the show to explain their findings.The programme is presented by Gareth Mitchell and the studio expert is Ania Lichtarowicz.More on this week's stories: Wikidata and Artificial Intelligence: Simplified Access to Open Data for Open-Source ProjectsAI Chatbots Are Blocked by 67% of Top News Sites, Relying Instead on Low-Quality SourcesEditor: Ania LichtarowiczProduction Manager: Liz Tuohy Recording and audio editing : Lansons | Team Farner For new episodes, subscribe wherever you get your podcasts or via this link:https://www.buzzsprout.com/2265960/supporters/newFollow us on all the socials: Join our Facebook group Instagram Twitter/X If you like Somewhere on Earth, please rate and review it on Apple PodcastsContact us by email: hello@somewhereonearth.coSend us a voice note: via WhatsApp: +44 7486 329 484Find a Story + Make it News = Change the World
During National Indigenous History Month, Dalhousie University Libraries hosted an edit-a-thon to improve Wikipedia and Wikidata content related to Mi'kmaw people and Mi'kma'ki. A total of 19 Wikipedia articles were edited, 50 references added, and more than 3,300 words were contributed. One of the organizers fills us in.
"Uvek sam radio samo ono što sam voleo i što me ložilo." Gost Ivana Minića u 258. epizodi Pojačala je dr Goran S. Milovanović, doktor psiholoških nauka koji nije pošao putem koji obično asociramo sa ovim poljem - psiholog, psihijatar ili psihoterapeut, već se specijalizovao za kognitivne i podatkovne nauke i machine learning. Polja koja zahtevaju duboko razumevanje ljudskog uma i razmišljanja, ali takođe i visok nivo tehnološkog znanja i stručnosti. Karijerni put dr Milovanovića mogao bi sam po sebi ispuniti tri epizode ove dužine, ali ova epizoda posvećena je njegovoj mladosti, priči o tome šta ga je stavilo na taj put, radu u Wikidata, najvećoj bazi znanja na svetu kao i kompaniji Smartocto koja se bavi prediktivnom analitikom u digitalnim medijima. Teme u epizodi: - Uvod - Početak razgovora - Kad porastem biću... - Dobri stari dani - Fakultetski dani - Fundamentalna nauka - Big data analytics - Kognitivne pristrasnosti - Wikidata I Smartocto - Problem digitalnih medija - Data kolektiv Realizacija Pojačalo podkasta ne bi bila moguća bez naših izuzetnih partnera: - Kompanija Epson koja je vodeći svetski proizvođač projektora i štampača za sve namene: https://www.epson.rs/sr_RS - Kompanija Orion telekom provajtera najbrže internet infrastrukture u Srbiji sa preko 30 godina iskustva: https://oriontelekom.rs Podržite nas na BuyMeACoffee: https://bit.ly/3uSBmoa Pročitajte transkript ove epizode: https://bit.ly/3PrfigS Posetite naš sajt i prijavite se na našu mailing listu: http://bit.ly/2LUKSBG Prijavite se na naš YouTube kanal: http://bit.ly/2Rgnu7o Pratite Pojačalo na društvenim mrežama: Facebook: http://bit.ly/2FfwqCR Twitter: http://bit.ly/2CVZoGr Instagram: http://bit.ly/2RzGHjN
Pedro se muda y nos cuenta de primera mano sus experiencias con fletes japoneses y después nos pone al día sobre las grandes ambiciones de Wikipedia y el proyecto Wikidata. Por último Javier le escribe una carta de amor a uno de sus músicos favoritos, Louis Cole, en forma de sección de podcast.
In dieser Folge spricht Jacqueline Klusik-Eckert mit Angela Kailus M.A. und Julia Rössel M.A. über die Rolle von Normdaten in der kunsthistorischen Forschung und Praxis. Der Ursprung von Normdaten hängt mit einem bibliothekarischen Systematisierungsbestrebungen in den 1970er Jahren zusammen. Wie hat sich der Umgang und die Konzepte von Normdaten im Zuge der Digitalisierung verändert? Mit einem Blick hinter die Kulissen der Gemeinsamen Normdatei (kurz GND) werden die Zusammenhänge von Identifikationsnummer und den dahinterliegenden Informationen erklärt. Welchen Mehrwert für die eigenen Daten erzielt man durch die Verwendung von Normdaten? Für welche Begriffe bzw. Entitäten gibt es Normdaten? Wo findet man sie? Woher kommt dieses Wissen und wie muss man mit dem Normdatensatz umgehen? Wir sprechen auch über den Unterschied eines institutionell gepflegten und autorisierten Normdatensatzes (GND über die Deutsche Nationalbibliothek) im Vergleich zu crowd-based Normdatensätzen (Wikidata).Darüber hinaus stellt sich die Frage, inwieweit die Verwendung von Normdaten bereits Einzug in das Fach Kunstgeschichte gehalten hat.Wir beleuchten die Herausforderung für sammelnde Institutionen bei der Erfassung von Objekten und der Anreicherung der Sammlungsdaten mit Normdaten. Welche Standards helfen bei der Erfassung und wofür soll man den Aufwand mit Normdaten überhaupt betreiben?Dabei nehmen wir unterschiedliche Szenarien im Datenlebenszyklus unter die Lupe.Wo begegnen wir als Forscher*innen diesen Daten, wie können wir sie nachnutzen und welche Verantwortung haben wir selbst als Produzent*innen von Forschungsdaten, wenn es um die Anreicherung der eigenen Daten mit Normdaten geht?Angela Kailus M.A. ist stellvertretende Leiterin des Deutschen Dokumentationszentrums für Kunstgeschichte – Bildarchiv Foto Marburg, Philipps-Universität Marburg und Ansprechperson bei NFDI4Culture im Arbeitsbereich Standardisierung und Datenqualität.Julia Rössel M.A. ist Kunsthistorikerin und Mitarbeiterin an der Deutschen Digitalen Bibliothek, Fachstelle Denkmalpflege, DDK, Marburg . Neben ihrer Promotion über „Wechsel der Mediensysteme – Graphische Sammlung und ihre digitale Übersetzung“ hat sie sich in den Bereichen Digitalisierung und Museum, Datenqualität und Standards spezialisiert.Begleitmaterial zu den Folgen findest du auf der Homepage unter https://www.arthistoricum.net/themen/podcasts/arthistocastAlle Folgen des Podcasts werden bei heidICON mit Metadaten und persistentem Identifier gespeichert. Die Folgen haben die Creative-Commons-Lizenz CC BY 4.0 und können heruntergeladen werden. Du findest sie unterhttps://doi.org/10.11588/heidicon/1738702 Bei Fragen, Anregungen, Kritik und gerne auch Lob kannst du gerne per Mail an uns schicken unterpodcast@digitale-kunstgeschichte.de
Vi klurar på hur andra källor återanvänder data från Wikipedia och Wikidata, hur referenslistor kan läggas till en masse, om den stora Vektoromställningen som nyss annonserades och ljud i rad.
In the first half of the last school year, PEN America has recorded almost 900 different books pulled from library shelves across the country. As long as libraries have existed, people have tried to police what goes in them. The burning of the Library of Alexandria is a metaphor that gets invoked any time we lose access to a treasure trove of books. But for centuries it has also inspired scientists and inventors, philosophers and programmers to dream about creating an ideal library, one that provides access to all the knowledge in the world. OTM producer Molly Schwartz goes to a birthday party for Wikidata at the Brooklyn Public Library, where she talks to Wikimedia New York City president Richard Knipel, Wikimedia software engineer James Forrester, and long-time Wikipedia editor Jim Henderson about how the free online encyclopedia has made strides toward providing knowledge to the sum of human knowledge. She also speaks with library historian Alex Wright, author of the book Glut: Mastering Information Through the Ages, and software engineering consultant Gyula Lakatos, creator of the Library of Alexandria application suite, about the history of universal library projects and what keeps the dream alive.
In the first half of the last school year, PEN America has recorded almost 900 different books pulled from library shelves across the country. As long as libraries have existed, people have tried to police what goes in them. The burning of the Library of Alexandria is a metaphor that gets invoked any time we lose access to a treasure trove of books. But for centuries it has also inspired scientists and inventors, philosophers and programmers to dream about creating an ideal library, one that provides access to all the knowledge in the world. OTM producer Molly Schwartz goes to a birthday party for Wikidata at the Brooklyn Public Library, where she talks to Wikimedia New York City president Richard Knipel, Wikimedia software engineer James Forrester, and long-time Wikipedia editor Jim Henderson about how the free online encyclopedia has made strides toward providing knowledge to the sum of human knowledge. She also speaks with library historian Alex Wright, author of the book Glut: Mastering Information Through the Ages, and software engineering consultant Gyula Lakatos, creator of the Library of Alexandria application suite, about the history of universal library projects and what keeps the dream alive.
Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist and EA communities into audio. This is: An LLM-based “exemplary actor”, published by Roman Leventov on May 29, 2023 on The AI Alignment Forum. Into and summary This post is the second section of "Aligning an H-JEPA agent via training on the outputs of an LLM-based "exemplary actor", posted separately because I think it could warrant a separate discussion, largely independent of the discussion of H-JEPA agent with GFlowNet actors. Here's the summary of this post, copied from the "Overview" section of the main article: In section 2, I describe the “exemplary actor”, an LMCA (language model cognitive architecture) that takes a simple, “brute force” approach to alignment: a powerful LLM (think GPT-5/6 level, with a vast, or quasi-unlimited context) is given a list of “approved” textbooks on methodological and scientific disciplines: epistemology, rationality, ethics, physics, etc. Also, the LLM is given tools: narrow AIs (such as for protein folding or for predicting properties of materials, or for formal scientific modelling). Finally, the LLM is given a compute engine such as Wolfram and a knowledge base such as Wikidata or Wolfram Knowledgebase. The exemplary actor creates plans or predictions for given situations (described in language and fed to the LLM underlying the exemplary actor as prompts) and iteratively critiques and refines its own plans and predictions while putting different textbooks into the LLM context (first, with the textbook on rationality, then epistemology, then physics, etc., with potentially dozens of different textbooks relevant for a plan or prediction that is being criticised), for many iterations, until convergence. In section 2.1, I note that the type of alignment that the exemplary actor's architecture tries to ensure is called (world) model alignment and that is stronger and also more essential than goal alignment. Then, I discuss the properties of the exemplary actor. In section 2.2., I discuss what I see as likely non-issues or straightforwardly addressable issues: the “divergent reasoning nature” of LLMs, the lack of grounded common sense reasoning, and the bias of the quick reactive network (”System 1”), it it is added to the architecture to make it more practically usable in lower-stakes reasoning settings. In section 2.3, I discuss the outstanding technical issues and risks of the exemplary actor's architecture: The risk of direct access to the underlying LLM (section 2.3.1). The exemplary actor's reasoning could still be partially directed by “alien” thinking patterns (i.e., the world model) of the underlying LLM even though these influences won't surface in the explanations of the plan (section 2.3.2). Iterated critique and refinement probably won't make plans strictly conforming to the theories described in the textbooks (section 2.3.3). In section 2.3.4, I discuss the alignment tax of the exemplary actor (compared with the baseline of a bare, minimally fine-tuned LLM) and conclude that the main source of alignment tax might happen to be the theory of ethics which may force the exemplary actor to refuse to participate in “games” (i.e., real-world situations and environments) where it doesn't see ethical ways of “winning”, and thus will consider inaction (or some form of palliative action) the only ethical way forward. This is not a technical problem with the exemplary actor per se, but rather a problem with a higher-level system, i.e., the current economic, social, and political structure of the world. I mention this and other kinds of “higher-level” risks of the plans to build and deploy the exemplary actor (i.e., roughly the plans that OpenAI and Anthropic are betting on, as it seems to me) in section 2.4. 2. An LLM-based “exemplary actor” Let's assume that we have three things: First, a very powerful auto-regressive LLM (think GPT-5/6 level) with the ability to effe...
A Vorta and a Bajoran Walk into a Spaceport You may best recognize this week's guest from her work on Babylon 5, Star Trek Deep Space Nine, JAG, NYPD Blue, Becker and more! Kitty Swink is our guest this week, and we are excited to have her join us! Kitty talks about her work in the performing arts over the years, her love for Star Trek and how its impacted her life, her fight with pancreatic cancer and survival along with her work with the Pancreatic Cancer Action Network and more! Be sure to check out her response to WikiData claiming they have proof that she's an actual human! You can find out more about the Pancreatic Cancer Action Network by visiting www.pancan.org For more information about Kitty Swink, please visit the following - Twitter - @kitswink For RSWOF Merch - https://www.teepublic.com/t-shirt/31938193-rswof?store_id=1397534 100% of all proceeds Benefit Wish Upon a Teen For direct contributions - www.wishuponateen.org Join our Discord! https://discord.gg/cpry4fCDTq FSF PopCast on Twitter and Instagram - @fsfpopcast Buy us Coffee - https://ko-fi.com/fsfpopcast For more on our show partners - Big Boy Graphics - www.etsy.com/shop/bigboygraphics Bridgework Studios - https://www.teepublic.com/user/bridgework-studios Level Up Sabers - https://shareasale.com/r.cfm?b=2018189&u=3289465&m=124959&urllink=&afftrack= Support The FSF PopCast by contributing to their tip jar: https://tips.pinecast.com/jar/funny-science-fiction
A Vorta and a Bajoran Walk into a Spaceport You may best recognize this week's guest from her work on Babylon 5, Star Trek Deep Space Nine, JAG, NYPD Blue, Becker and more! Kitty Swink is our guest this week, and we are excited to have her join us! Kitty talks about her work in the performing arts over the years, her love for Star Trek and how its impacted her life, her fight with pancreatic cancer and survival along with her work with the Pancreatic Cancer Action Network and more! Be sure to check out her response to WikiData claiming they have proof that she's an actual human! You can find out more about the Pancreatic Cancer Action Network by visiting www.pancan.org For more information about Kitty Swink, please visit the following - Twitter - @kitswink For RSWOF Merch - https://www.teepublic.com/t-shirt/31938193-rswof?store_id=1397534 100% of all proceeds Benefit Wish Upon a Teen For direct contributions - www.wishuponateen.org Join our Discord! https://discord.gg/cpry4fCDTq FSF PopCast on Twitter and Instagram - @fsfpopcast Buy us Coffee - https://ko-fi.com/fsfpopcast For more on our show partners - Big Boy Graphics - www.etsy.com/shop/bigboygraphics Bridgework Studios - https://www.teepublic.com/user/bridgework-studios Level Up Sabers - https://shareasale.com/r.cfm?b=2018189&u=3289465&m=124959&urllink=&afftrack= Support The FSF PopCast by contributing to their tip jar: https://tips.pinecast.com/jar/funny-science-fiction
Guest Jan Ainali Panelists Richard Littauer | Justin Dorfman Show Notes Hello and welcome to Sustain! The podcast where we talk about sustaining open source for the long haul. On this episode, we have an amazing guest, Jan Ainali, who's a steward at the Foundation for Public Code, where they develop tools, processes, and collecting best practices for community building. Earlier, he ran a consultancy called Open by Default and he was swept into the ‘open movement' through Wikipedia editing and leading him to cofound the Swedish Wikimedia chapter and become chairman and later, CEO. Today, we'll learn all the details about the Foundation for Public Code, the process of becoming a member, what sets them apart from others, and their Standard for Public Code that is for anyone who wants to prepare their code to be collaborated on. Also, Jan shares what's he looking forward to in the next few months about the standard and the Foundation for Public Code. Download this episode now to learn much more! [00:02:02] Jan tells us about the Foundation for Public Code, as well as how many member organizations it has. [00:03:43] With only one member currently, we find out if Jan is trying to get others on to work with the foundation, he explains the process of becoming of member with them, and the team sizes that are directly working with them. [00:07:02] We learn what sets apart the Foundation for Public Code from the other trans-provincial and trans-governmental organizations that are doing the work of InnerSource Commons but with politics. He also goes into the policies that have gone into code that he's worked on. [00:09:46] Wikimedia and Wikipedia have chapters, and Jan reveals how big his was, how many other chapters there are in the world, and the difference between them. [00:11:15] Find out who Sverker Johansson is and what he did. [00:13:12] Jan tells us more about the Standard for Public Code, what it is, how it applies, who wrote it, and we hear the 16 criteria for it. [00:18:13] Jan explains the “must be in plain English” requirement and what the global efforts are for the Foundation for Public Code. [00:20:13] We learn how Jan is making it beneficial for everyone to join in a way that helps them out as being super awesome and grow the network that way. [00:21:02] Has Jan gotten any pushback from developers in other places? [00:22:24] Jan tells us businesses he's working with to help push this initiative forward. [00:24:38] We hear if there's a sign on process for getting people to use this standard, and Jan talks about the accreditation process. [00:29:14] Richard asks Jan if he knows of other standards that are already in this space and what sets his apart from the others. [00:30:32] Jan explains their level of standards, as well as what he's most looking forward to in the next few months about the standard and the foundation. [00:32:54] Find out where you can follow Jan on the web. Quotes [00:03:19] “We really think if you collaborate with everyone, that's better than to collaborate with just a few." [00:07:22] “We only work with code bases with the public purpose where someone tries to put a policy into code. That's where we're a little bit narrower than others.” Spotlight [00:34:04] Justin's spotlight is WeasyPrint. It's open source and turns simple HTML pages into gorgeous PDF's and open source. [00:34:39] Richard's spotlight is Sverker Johansson. [00:35:10] Jan's spotlight is Denny Vrandečić, first project manager at Wikidata, and right now working on Abstract Wikipedia. Links SustainOSS (https://sustainoss.org/) SustainOSS Twitter (https://twitter.com/SustainOSS?ref_src=twsrc%5Egoogle%7Ctwcamp%5Eserp%7Ctwgr%5Eauthor) SustainOSS Discourse (https://discourse.sustainoss.org/) podcast@sustainoss.org (mailto:podcast@sustainoss.org) Richard Littauer Twitter (https://twitter.com/richlitt?ref_src=twsrc%5Egoogle%7Ctwcamp%5Eserp%7Ctwgr%5Eauthor) Justin Dorfman Twitter (https://twitter.com/jdorfman?ref_src=twsrc%5Egoogle%7Ctwcamp%5Eserp%7Ctwgr%5Eauthor) Jan Ainali Twitter (https://twitter.com/Jan_Ainali?ref_src=twsrc%5Egoogle%7Ctwcamp%5Eserp%7Ctwgr%5Eauthor) Jan Ainali Website (https://ainali.com/) Foundation for Public Code (https://publiccode.net/) Standard for Public Code (https://standard.publiccode.net/) WeasyPrint (https://weasyprint.org/) Standard for Public Code (Book) (https://github.com/publiccodenet/standard#generating-a-pdf-of-the-standard-for-public-code) Sverker Johansson-Wikipedia (https://en.wikipedia.org/wiki/Sverker_Johansson) [The Dawn of Language by Sverker Johansson](https://www.amazon.com/The-Dawn-of-Language/dp/1529411416/ref=tmmpapswatch0?encoding=UTF8&qid=&sr=) xkcd-Standards (https://xkcd.com/927/) Wikidata (https://www.wikidata.org/wiki/Wikidata:Main_Page) Abstract Wikipedia (https://meta.wikimedia.org/wiki/Abstract_Wikipedia) Credits Produced by Richard Littauer (https://www.burntfen.com/) Edited by Paul M. Bahr at Peachtree Sound (https://www.peachtreesound.com/) Show notes by DeAnn Bahr Peachtree Sound (https://www.peachtreesound.com/) Special Guest: Jan Ainali.
On this episode, I got the chance to sit down and chat about social justice in science with Dr. Sabah Ul-Hasan, a bioinformatician and postdoc at The Scripps Research Institute in San Diego under Dr. Andrew Su. They are currently working on the NIH-funded Wikidata biocuration project. Enjoy! --- Send in a voice message: https://podcasters.spotify.com/pod/show/fromwheredoesitstem/message
In this episode, we will talk to Siobhan Leachman, a citizen scientist and prolific Wikidata editor. Rebecca tries to figure out which WikiFauna represents her best. And our Wiki hero is the incoming CEO Maryana Iskander. Logo design by Trish O'Flaherty: https://www.trishoflahertydesign.com/ Twitter: https://twitter.com/world_wikipedia Website: https://www.headstuff.org/the-world-according-to-wikipedia/ This show is part of the HeadStuff Podcast Network. For more, go to HeadStuffPodcasts.com, where you can also become a member of HeadStuff+ and get exclusive access to bonus material and lots more.
In this episode, we will talk to Jan Ainali, a Codebase Steward for the Foundation for Public Code and a complete Wikidata fanboy. The wiki-rule of the episode is that no-one owns an article and we celebrate the many Wikimedians of 2021, including past guest Netha Hussain. Some interesting links from the Interview with Jan: https://www.datastory.org/services/election-tracker https://www.govdirectory.org/ Logo design by Trish O'Flaherty: https://www.trishoflahertydesign.com/ Twitter: https://twitter.com/world_wikipedia Website: https://www.headstuff.org/the-world-according-to-wikipedia/ This show is part of the HeadStuff Podcast Network. For more, go to HeadStuffPodcasts.com, where you can also become a member of HeadStuff+ and get exclusive access to bonus material and lots more.
Projecting and tracking COVID-19 infection rates in England with R, leveraging Wikidata to tag scientific abstracts, and a new deep-learning workflow with the luz package Episode Links This week's curator: Robert Hickman (@robwhickman (https://twitter.com/robwhickman)) Tracking SARS-CoV-2 In England with {epidemia} (https://imperialcollegelondon.github.io/epidemia/articles/multiple-obs.html) Tagging the Scientific Abstracts with Wikidata Items (https://dwayzer.netlify.app/posts/2021-06-15-tagging-the-abstracts-with-wikidata-items) Que haja luz: More light for torch! (https://blogs.rstudio.com/tensorflow/posts/2021-06-17-luz) Entire issue available at rweekly.org/2021-W25 (https://rweekly.org/2021-W25.html) Supplemental Resources {epidemia} package documentation (https://imperialcollegelondon.github.io/epidemia/index.html) A COVID-19 Model for Local Authorities of the United Kingdom (https://rss.org.uk/RSS/media/File-library/News/2021/MishraScott.pdf) How epidemiology has shaped the COVID pandemic (https://www.nature.com/articles/d41586-021-00183-z)
Exploring wikidata with the {tidywikidatar} package, accessibility improvements in {knitr}, and the top 40 new CRAN packages for March. Episode Links This week's curator: Batool Almazrouq (@batool664) (https://twitter.com/batool664) What does Wikidata know about members of the European Parliament? (https://medium.com/european-data-journalism-network/a-new-r-package-for-exploring-the-wealth-of-information-stored-by-wikidata-fe85e82b6440) New in knitr: Improved accessibility with image alt text (https://blog.rstudio.com/2021/04/20/knitr-fig-alt/) March 2021: "Top 40" New CRAN Packages (https://rviews.rstudio.com/2021/04/22/march-2021-top-40-new-cran-packages/) Entire issue available at rweekly.org/2021-W17 (https://rweekly.org/2021-W17.html) Supplemental Resources https://news.yahoo.com/wikipedia-turns-20-aims-reach-035212015.html https://github.com/yihui/knitr/releases/tag/v1.32 https://cran.r-project.org/web/packages/pkglite/index.html https://flujoo.github.io/gm/
Jarno van Driel talks with Jason Barnard about structured data Jarno Van Driel is a technical & semantic SEO expert with more than 20 years of experience in the field, whose main focus is to provide website optimization services and guide organizations into this new era of the semantic web. Contrary to what most marketers believe, structured data is much more than a simple final touch to get a nice Brand SERP – there are serious advantages in making structured data the foundation for your organization. You will learn amazing tips and insights, as well as what the future looks like in terms of the implementation of Schema Markup in e-commerce sites. What you'll learn from Jarno van Driel 00:00 Jarno Van Driel with Jason Barnard00:40 Jarno Van Driel's Brand SERP02:35 Structured Data is MUCH more than just a “final sprinkle” to get a nice looking SERP07:02 Structured Data as the foundation for your organization08:36 Managing Data Layers in Google Tag Manager11:15 How to deal with the current Schema Markup limitations?13:58 Could Structured Data solve the language barrier issue on a search engine?15:54 Wikipedia, Wikidata and Dbpedia for entity disambiguation in different languages17:15 Is there always a need to disambiguate?17:53 What has changed in SEO copywriting since 201020:26 Does Google prefer that a page is about just one single entity (passage based indexing) ?22:33 A discussion about Google's ability to identify content's relevancy27:54 Using Schema Markup to avoid duplicate content on an e-commerce site at the category level32:22 New implementations of Schema Markup in e-commerce sites38:09 Google's Merchant Center feeds and the role of Schema Markup41:06 Is Google “forcing our hand” with structured data? Subscribe to the podcast Subscribe here >> This episode was recorded live on video April 6th 2021 Recorded live at Kalicube Tuesdays (Digital Marketing Livestream Event Series). Watch the video now >>
Jarno van Driel talks with Jason Barnard about structured data Jarno Van Driel is a technical & semantic SEO expert with more than 20 years of experience in the field, whose main focus is to provide website optimization services and guide organizations into this new era of the semantic web. Contrary to what most marketers believe, structured data is much more than a simple final touch to get a nice Brand SERP – there are serious advantages in making structured data the foundation for your organization. You will learn amazing tips and insights, as well as what the future looks like in terms of the implementation of Schema Markup in e-commerce sites. What you'll learn from Jarno van Driel 00:00 Jarno Van Driel with Jason Barnard00:40 Jarno Van Driel's Brand SERP02:35 Structured Data is MUCH more than just a “final sprinkle” to get a nice looking SERP07:02 Structured Data as the foundation for your organization08:36 Managing Data Layers in Google Tag Manager11:15 How to deal with the current Schema Markup limitations?13:58 Could Structured Data solve the language barrier issue on a search engine?15:54 Wikipedia, Wikidata and Dbpedia for entity disambiguation in different languages17:15 Is there always a need to disambiguate?17:53 What has changed in SEO copywriting since 201020:26 Does Google prefer that a page is about just one single entity (passage based indexing) ?22:33 A discussion about Google's ability to identify content's relevancy27:54 Using Schema Markup to avoid duplicate content on an e-commerce site at the category level32:22 New implementations of Schema Markup in e-commerce sites38:09 Google's Merchant Center feeds and the role of Schema Markup41:06 Is Google “forcing our hand” with structured data? Subscribe to the podcast Subscribe here >> This episode was recorded live on video April 6th 2021 Recorded live at Kalicube Tuesdays (Digital Marketing Livestream Event Series). Watch the video now >>