Data Set 3.6

Dataset category: 
Publication Year: 
2015

This pages provides downloads of the DBpedia datasets. The DBpedia datasets are licensed under the terms of the Creative Commons Attribution-ShareAlike License and the GNU Free Documentation License The downloads are provided as N-Triples ("nt") and N-Quads ("nq"); the N-Quads version contains additional provenance information for each statement. All files are bz2 packed.


Older Versions: DBpedia 3.5.1DBpedia 3.5DBpedia 3.4DBpedia 3.3DBpedia 3.2DBpedia 3.1DBpedia 3.0DBpedia 3.0RCDBpedia 2.0


See also the change log for recent changes and developments.


 

 


1 Wikipedia Input Files


The datasets were extracted from Wikipedia dumps generated in October/November 2010. Specific dates and times:

  en de fr pl it ja es nl hu sl hr el
Dump end  2010-10-11 2010-10-13 2010-10-17 2010-11-01 2010-10-20 2010-11-02 2010-10-23 2010-11-01 2010-10-27 2010-11-01 2010-10-30 2010-10-29


2 Core Datasets

Click on the dataset names to obtain additional information.

Mapping-based Datasets

These high-quality datasets are based on the mappings defined by the community on http://mappings.dbpedia.org/. All this data is in the /ontology/ namespace.

 

Dataset en de hu sl hr el
DBpedia Ontology preview ) owl -- -- -- -- --
Ontology Infobox Types preview ) nt nq nt nq nt nq nt nq nt nq nt nq
Ontology Infobox Properties preview ) nt nq nt nq nt nq nt nq nt nq nt nq
Ontology Infobox Properties (Specificpreview ) nt nq nt nq nt nq nt nq nt nq nt nq


Other Datasets

Dataset en de fr pl it ja es nl hu sl hr el
Titles preview ) nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Short Abstracts preview ) nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Extended Abstracts preview ) nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Images preview ) nt nq -- -- -- -- -- -- -- -- -- -- nt nq
Geographic Coordinates preview ) nt nq nt nq nt nq -- nt nq nt nq nt nq nt nq nt nq -- nt nq nt nq
Geo-Related preview ) nt -- -- -- -- -- -- -- -- -- -- --
Raw Infobox Properties preview ) nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Raw Infobox Property Definitions preview ) nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Homepages preview ) nt nq nt nq nt nq nt nq -- -- -- -- -- -- -- nt nq
Persondata preview ) nt nq nt nq -- -- -- -- -- -- -- -- -- --
PND preview ) nt nq nt nq -- -- -- -- -- -- -- -- -- --
Articles Categories preview ) nt nq -- -- -- -- -- -- -- -- -- -- --
Categories (Labelspreview ) nt nq -- -- -- -- -- -- -- -- -- -- --
Categories (Skospreview ) nt nq -- -- -- -- -- -- -- -- -- -- --
External Links preview ) nt nq -- -- -- -- -- -- -- -- -- -- --
Links to Wikipedia Article preview ) nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Wikipedia Pagelinks preview ) nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Redirects preview ) nt nq -- -- -- -- -- -- -- -- -- -- --
Disambiguation Links preview ) nt nq -- -- -- -- -- -- -- -- -- -- nt nq
Page IDs preview ) nt nq -- -- -- -- -- -- -- -- -- -- --
Revision IDs preview ) nt nq -- -- -- -- -- -- -- -- -- -- --

 

Download Server

The tables above list the DBpedia data for only a few languages. Please go to the DBpedia download server to choose from DBpedia data in 97 different languages, or download the complete archive of all languages:

 


3 Extended Datasets

Click on the dataset names to obtain additional information.

 

Dataset links
Links to RDF Bookmashup preview ) nt -
Links to DailyMed preview ) nt -
Links to DBLP preview ) nt -
Links to Diseasome preview ) nt -
Links to DrugBank preview ) nt -
Links to Eurostat preview ) nt -
Links to CIA Factbook preview ) nt -
Links to flickr wrappr preview ) nt -
Links to Freebase preview ) nt -
Links to Geonames preview ) nt -
Links to Project Gutenberg preview ) nt -
Links to MusicBrainz preview ) nt -
Links to New York Times preview ) nt -
Links to Cyc preview ) nt -
Links to Revyu preview ) nt -
Links to SIDER preview ) nt -
Links to TCMGeneDIT preview ) nt -
Links to US Census preview ) nt -
Links to WikiCompany preview ) nt -
Links to YAGO2 preview ) nt -
WordNet Classes preview ) nt -

 


4 Dataset Descriptions


 

DBpedia Ontology

The DBpedia ontology in OWL. See our JWS paper for more details.

Ontology Infobox Types

Contains triples of the form $object rdf:type $class from the ontology-based extraction.

Ontology Infobox Properties

High-quality data extracted from Infoboxes using the strict ontology-based extraction. The predicates in this dataset are in the /ontology/ namespace.

Note that this data is of much higher quality than the Raw Infobox Properties in the /property/ namespace. For example, there are three different raw Wikipedia infobox properties for the birth date of a person. In the the /ontology/ namespace, they are all mapped onto one relation http://dbpedia.org/ontology/birthDate. It is a strong point of DBpedia to unify these relations.

 

Ontology Infobox Properties (Specific)

Infoboxes Data from the loose ontology-based extraction.

Titles

Titles of all Wikipedia Articles in the corresponding language

Short Abstracts

Short Abstracts (max. 500 chars long) of Wikipedia Articles

Extended Abstracts

Additional, extended English abstracts.

Images

Thumbnail Links from Wikipedia Articles

Geographic Coordinates

Geographic coordinates extracted from Wikipedia.

Geo-Related

Relates articles to resources of countries, whose label appear in the name of the articles' categories.

Raw Infobox Properties

Information that has been extracted from Wikipedia infoboxes. Note that this data is in the less clean /property/ namespace. The Ontology Infobox Properties (/ontology/ namespace) should always be preferred over this data.

Raw Infobox Property Definitions

All properties / predicates used in infoboxes.

Homepages

Links to external webpages.

Persondata

Information about persons (date and place of birth etc.) extracted from the English and German Wikipedia, represented using the FOAF vocabulary.

PND 

Dataset containing PND (Personennamendatei) identifiers.

Articles Categories

Links from concepts to categories using the SKOS vocabulary.

Categories (Labels)

Labels for Categories.

Categories (Skos)

Information which concept is a category and how categories are related using the SKOS Vocabulary.

External Links

Links to external web pages about a concept.

Links to Wikipedia Article

Links to corresponding Articles in Wikipedia

Wikipedia Pagelinks

Dataset containing internal links between DBpedia instances. The dataset was created from the internal pagelinks between Wikipedia articles. The dataset might be useful for structural analysis, data mining or for ranking DBpedia instances using Page Rank or similar algorithms.

Redirects

Dataset containing redirects between Articles in Wikipedia

Disambiguation Links

Extraction from Disambiguation Templates

Page IDs 

Dataset containing the Wikipedia Page IDs.

Revision IDs 

Dataset containing the Wikipedia Revision IDs.

Links to RDF Bookmashup

Links between books in DBpedia and data about them provided by the RDF Book Mashup. Provided by Georgi Kobilarov. Update mechanism: unclear/copy over from previous release.

Links to DailyMed

Links between DBpedia and DailyMed. Update mechanism: unclear/copy over from previous release.

Links to DBLP

Links between computer scientists in DBpedia and their publications in the DBLP database. Links were created manually. Update mechanism: Copy over from previous release.

Links to Diseasome

Links between DBpedia and Diseasome. Update mechanism: unclear/copy over from previous release.

Links to DrugBank

Links between DBpedia and DrugBank. Update mechanism: unclear/copy over from previous release.

Links to Eurostat

Links between countries and regions in DBpedia and data about them from Eurostat. Links were created manually. Update mechanism: Copy over from previous release.

Links to CIA Factbook

Links between countries in DBpedia and data about them from CIA Factbook. Links were created manually. Update mechanism: Copy over from previous release.

Links to flickr wrappr

Links between DBpedia concepts and photo collections depicting them generated by the flikr wrappr. Update mechanism: script in SVN.

Links to Freebase

Links between DBpedia and Freebase (MIDs). Update mechanism: script in SVN.

Links to Geonames

Links between geographic places in DBpedia and data about them in the Geonames database. Provided by the Geonames people. Update mechanism: unclear/copy over from previous release.

Links to Project Gutenberg

Links between writers in DBpedia and data about them from Project Gutenberg. Update mechanism: script in SVN. Since this requires manual changes of files and a D2R installation, it will be copied over from the previous DBpedia version and updated between releases by the maintainers (Piet Hensel and Georgi Kobilarov).

Links to MusicBrainz

Links between artists, albums and songs in DBpedia and data about them from MusicBrainz. Created manually using the result of SPARQL queries. Update mechanism: unclear/copy over from previous release.

Links to New York Times

Links between New York Times subject headings and DBpedia concepts.

Links to Cyc

Links between DBpedia and Cyc concepts. Details. Update mechanism: awk script.

Links to Revyu

Links to Reviews about things in Revyu. Created manually by Tom Heath. Update mechanism: unclear/copy over from previous release.

Links to SIDER

Links between DBpedia and SIDER. Update mechanism: unclear/copy over from previous release.

Links to TCMGeneDIT

Links between DBpedia and TCMGeneDIT. Update mechanism: unclear/copy over from previous release.

Links to US Census

Links between US cities and states in DBpedia and data about them from US Census. Update mechanism: unclear/copy over from previous release.

Links to WikiCompany

Links between companies in DBpedia and companies in Wikicompany. Update mechanism: script in SVN.

Links to YAGO2

Dataset containing links between DBpedia and YAGO, YAGO type information for DBpedia resources and the YAGO class hierarchy. Currently maintained by Johannes Hoffart.

WordNet Classes

Classification links to RDF representations of WordNet classes. Update mechanism: unclear/copy over from previous release.