κατασκευή ιστοσελίδων ρόδος

TECH - WEB DEVELOPMENT NEWS

Get the latest tech - web development news and analysis on industry around the world.

  • HOME
You are here: Home / INDUSTRY NEWS / Wikimedia wants to make it easier for you and AI developers to search through its data
άμυνα
.

Wikimedia wants to make it easier for you and AI developers to search through its data

01/10/2025

Wikipedia’s sister project Wikidata just got a new database that is easier for AI models to ingest.Oct 1, 2025, 8:30 AM UTCElissa Welle is a NYC-based AI reporter and is currently supported by the Tarbell Center for AI Journalism. She covers AI companies, policies, and products.The late English writer Douglas Adams is best known as the author of the 1979 book The Hitchhiker’s Guide to the Galaxy. But there is much more to Adams than what is written in his Wikipedia entry. Whether or not you need to know that his birth sign is Pisces or that libraries worldwide store his books under the same string of numbers — 13230702 — you can if you head to an overlooked corner of the Wikimedia Foundation called Wikidata.There, images, text, keywords, and other information related to Adams are stored both in a webpage and, for the robots among us, in formats designed for machines like JSON.Now, Wikidata is getting a new AI-friendly database that makes it easier for large language models to ingest the information. The database comes from the Wikipedia Embedding Project out of the German chapter of the Wikimedia Foundation, Wikimedia Deutschland, which oversees Wikidata. The Berlin-based team spent the past year using a large language model to turn the 19 million entries within Wikidata from clunkily structured data into vectors that capture the context and meaning around the Wikidata entry.In this vectorized format, information is best imagined like a graph with dots and interconnected lines — Adams would be connected to “human” as well as the titles of his books, Lydia Pintscher, Wikidata portfolio lead, told The Verge.While the front-end user experience will remain the same — no, Wikipedia is not becoming a chatbot, the project leaders say — the back end will become easier for AI developers to access when building, for example, their own chatbots using the data.The goal of the project is to level the playing field for AI developers outside the monied core of Big Tech, Pintscher said. Companies like OpenAI and Anthropic have the resources to vectorize Wikidata, just like Pintscher and her team did. It’s the smaller outfits that most benefit from the new access to curated data stored in the vaults of Wikidata. “Really, for me, it’s about giving them that edge up and to at least give them a chance, right?” Pintscher said.She points to Govdirectory as an example project that harnessed Wikidata’s vast data curated by volunteers for good. The platform allows users to find the social media handles and emails for public officials across the world.Most AI chatbots prioritize popular words and topics across the internet. In addition to giving Little Tech a leg up, the team hopes that easier access to Wikidata will result in AI systems that better reflect niche topics not widely represented across the internet, Pintscher said. This could be a better way to get information into ChatGPT, for instance, than “generating a ton of content and then waiting for the next time for ChatGPT to retrain, and maybe, or maybe not, taking into account what you contributed,” Pintscher said.In practice, the vectors will allow AI systems to better access the context around information in addition to the information itself, Philippe Saadé, Wikidata AI project manager, told The Verge.The team used a model from AI company Jina AI to turn Wikidata’s structured data, captured through September 18th, 2024, into vectors. IBM company DataStax currently provides the infrastructure to store the vector database to the project for free.The team is waiting for feedback from developers who use the database before updating it with information added over the last year. While the current database does not include entirely new information added in the last year, Saadé says small edits or tweaks to existing Wikidata will not diminish the database’s usefulness. “At the end of the day, the vector that we’re computing is like a general idea of an item, so if some small edit has been made on Wikidata, it’s not going to be super relevant,” he said.Most Popular
Source: theverge.com

Filed Under: INDUSTRY NEWS Tagged With: Source-1

3 ways I make NotebookLM my personal sidekick

NotebookLM is marketed as a research and notes companion, but honestly, I don’t quite use it the way Google had intended. The “official” pitch is that you upload documents, and it helps you pull insights and keep track of sources. This is undoubtedly useful for things like academic work, but it can also get boring if that’s all you use it for. Source: xda-developers.com … [Read More...]

I didn't know the Obsidian Reminder plugin existed, but it's exactly what I needed

I run my entire life out of my Obsidian vault, but until now, something has been missing. I've tried multiple to-do apps, but none of those really checked all the boxes — and then I stumbled across the Obsidian Reminder plugin. This small, simple plugin has become an essential part of my workflow. I already struggle to remember things — if it isn't written down, it doesn't exist — but with a … [Read More...]

Opera is bringing a huge wave of free AI tools to its browsers

A few days ago, Opera broke the news that my favorite AI browser was getting a general release. It was fantastic news for me, because when I gave their test build out and used it to remake the classic game of Snake, I had a blast experimenting with all the different things I could ask it to do. Source: xda-developers.com … [Read More...]

All the quotes from Borderlands 4 CEO about the game that missed the mark

For a lot of years now, Randy Pitchford, the CEO of Gearbox Entertainment, has been one of the more... interesting personalities in the gaming space. The man certainly has a flair for dramatics, an unfiltered way of talking, and a knack for making headlines for the strangest reasons. Source: xda-developers.com … [Read More...]

4 cheap PC parts I’ll never buy again

Building a PC is an exciting rite of passage for any tech enthusiast. There's an overwhelming breadth of options available at every budget. However, this low barrier to entry is a double-edged sword. It’s incredibly tempting for a novice or first-time PC builder to look at a compatibility list and pair an upper-mid tier CPU and gaming GPU with an affordable $80 motherboard. I've been tempted too, … [Read More...]

I replaced WSL with a full Linux VM, and here’s why it's actually better

Windows Subsystem for Linux, or WSL, has been an incredibly welcome addition to Windows for those who enjoy developing and tinkering with Linux distributions. The lightweight, easy-to-setup nature of these instances is perfect for anyone looking to run Linux tools without needing to leave their primary OS. Once I started to use non-native packages and stepped outside of basic command-line tasks, I … [Read More...]

RGB is secretly the worst bloatware on your PC

If your PC features a lot of RGB lighting, chances are you've got software to manage it. While proprietary RGB software will give you complete control over your lighting, most RGB control software consumes hardware resources to run on a consistent basis. This is especially true of most OEMs' proprietary RGB management software, like Asus Armoury Crate, MSI Center, or Razer Synapse. Source: … [Read More...]

This cute open-source notes app killed Google Keep for me

Google Keep has been my go-to for jotting down quick ideas and making lists. It’s minimal, dead easy to navigate, and also just a tap away on my phone. I love it for its simplicity which allows me to quickly dump half-formed notes. But over time, I started to hit a ceiling. My notes became a messy scroll of colorful squares without folders or real structure. I also wasn’t a fan of my notes being … [Read More...]

The best smart rings for 2025

It’s getting increasingly difficult to say smart rings are just a niche inside the broader world of wearable technology. The raft of celebrities who are seen wearing them, the NBA’s use of Oura rings as an early warning system against COVID-19 and, last year, Samsung’s entry into the market has made them far more prominent in the minds of mainstream consumers. We’ve tested plenty of smart … [Read More...]

The clock is ticking: Savings of up to 20% on group passes end tonight for TechCrunch Disrupt 2025

The Founder and Investor bundle sale for TechCrunch Disrupt 2025 is live — but only until tonight at 11:59 p.m. PT. This is your only chance this year to lock in group bundle savings on Founder Passes and an even bigger discount on group Investor Passes. After today, these deals are gone. Disrupt 2025 brings together over 10,000 founders, investors, and operators from around the world to tackle … [Read More...]

Tags

Source-1 Source-2 Source-3 Source-4 Source-5 Source-6 Source-7 Source-8 Source-9 Source-10 Source-12 Source-13 Source-15 Source-16

Tech Web Development News

This is a PERSONAL and PRIVATE WEBPAGE. Please leave this page. Contact me via email : admin@news-6.com about anything you would like to ask or problem.

Tech News

Disclaimer!
In every post is written below the original source of the post. Copyrights belong on their owners.

Web Development News

HOTELS – CRUISES – CARS – TRAVEL

Recent Posts

  • 3 ways I make NotebookLM my personal sidekick
  • I didn't know the Obsidian Reminder plugin existed, but it's exactly what I needed
  • Opera is bringing a huge wave of free AI tools to its browsers
  • All the quotes from Borderlands 4 CEO about the game that missed the mark
  • 4 cheap PC parts I’ll never buy again

Technology - Seo

Categories

  • INDUSTRY NEWS

World Industry News

Privacy & Cookies: This site uses cookies.
To find out more, as well as how to remove or block these, see here: Our Cookie Policy
TECH - WEB DEVELOPMENT NEWS @ COPYRIGHTS 2023