DBpedia version 2016-10
DBpedia is now producing monthly releases on the Databus: Monthly Dataset Releases
This release took us longer than expected. We had to deal with multiple issues and included new data. Most notable is the addition of the NIF annotation datasets for each language, recording the whole wiki text, its basic structure (sections, titles, paragraphs, etc.) and the included text links. We hope that researchers and developers, working on NLP-related tasks, will find this addition most rewarding. The DBpedia Open Text Extraction Challenge (next deadline Mon 17 July for SEMANTiCS 2017) was introduced to instigate new fact extraction based on these datasets.
DBpedia version 2016-04
DBpedia is now producing monthly releases on the Databus: Monthly Dataset Releases
The DBpedia Data Set (2015-04)
Note: DBpedia is now producing monthly releases on the Databus: Monthly Dataset Releases
DBpedia Events
The English Wikipedia has more than a hundred edits per minute. A large part of the knowledge in Wikipedia is not static, but frequently updated, e.g., new movies or sports and political events. This makes Wikipedia an extremely rich, crowdsourced information hub for events. We have created a dat
Lexicalizations
Lexicalization is defined by WordNet as "the process of making a word to express a concept" [1].
DBpedia version 2014
DBpedia is now producing monthly releases on the Databus: Monthly Dataset Releases
Data Set 3.8
The DBpedia data set uses a large multi-domain ontology which has been derived from Wikipedia. The English version of the DBpedia 3.8 data set describes 3.77 million "things" with 400 million "facts".
Wiktionary RDF extraction
Currently available languages:
English, German, French, Russian, Greek, Vietnamese
In the works: Greece, Vietnamese