DBpedia 3.7 Downloads


This pages provides downloads of the DBpedia datasets. The DBpedia datasets are licensed under the terms of the Creative Commons Attribution-ShareAlike License and the GNU Free Documentation License. The downloads are provided as N-Triples and N-Quads, where the N-Quads version contains additional provenance information for each statement. All files are bz2 packed.


Older Versions: DBpedia 3.6, DBpedia 3.5.1, DBpedia 3.5, DBpedia 3.4, DBpedia 3.3, DBpedia 3.2, DBpedia 3.1, DBpedia 3.0, DBpedia 3.0RC, DBpedia 2.0


See also the change log for recent changes and developments.



1. Wikipedia Input Files


The datasets were extracted from Wikipedia dumps generated in late July 2011 (see also all specific dates and times).


2. Core Datasets

NOTE: You can find DBpedia dumps in 97 languages at our DBpedia download server.


Click on the dataset names to obtain additional information.

Datasetencadeelesfrgahrhuitnlplptrusltr
DBpedia Ontology ( preview ) owl ----------------------
Ontology Infobox Types ( preview ) nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Ontology Infobox Properties ( preview ) nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Ontology Infobox Properties (Specific) ( preview ) nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq -- nt nq nt nq -- nt nq nt nq

Datasetencadeelesfrgahrhuitnlplptrusltr
Titles ( preview ) nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Short Abstracts ( preview ) nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Extended Abstracts ( preview ) nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Images ( preview ) nt nq -- nt nq nt nq nt nq -------------- nt nq nt nq ----
Geographic Coordinates ( preview ) nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq -- nt nq
Raw Infobox Properties ( preview ) nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Raw Infobox Property Definitions ( preview ) nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Homepages ( preview ) nt nq nt nq nt nq nt nq nt nq nt nq nt nq -------- nt nq nt nq nt nq ----
Persondata ( preview ) nt nq -- nt nq --------------------------
PND ( preview ) nt nq -- nt nq --------------------------


Datasetencadeelesfrgahrhuitnlplptrusltr
Articles Categories ( preview ) nt nq ------------------------------
Categories (Labels) ( preview ) nt nq ------------------------------
Categories (Skos) ( preview ) nt nq ------------------------------
External Links ( preview ) nt nq ------------------------------
Links to Wikipedia Article ( preview ) nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Wikipedia Pagelinks ( preview ) nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Redirects ( preview ) nt nq ------------------------------
Disambiguation Links ( preview ) nt nq nt nq nt nq nt nq nt nq -------- nt nq -- nt nq nt nq nt nq ----
Page IDs ( preview ) nt nq ------------------------------
Revision IDs ( preview ) nt nq ------------------------------


3. i18n Datasets

These datasets contain all articles of the respective Wikipedia, including the ones that do not have an equivalent English article. more...


CAUTION: the URIs in these dumps have language-specific namespaces (e.g. http://el.dbpedia.org/...).


NOTE: You can find DBpedia dumps in 97 languages at our DBpedia download server.


Click on the dataset names to obtain additional information.

Datasetcadeelesfrgahrhuitnlplptrusltr
Ontology Infobox Types nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Ontology Infobox Properties nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Ontology Infobox Properties (Specific) nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq -- nt nq nt nq -- nt nq nt nq

Datasetcadeelesfrgahrhuitnlplptrusltr
Titles nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Short Abstracts nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Extended Abstracts nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Images -- nt nq nt nq nt nq -------------- nt nq nt nq ----
Geographic Coordinates nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq -- nt nq
Raw Infobox Properties nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Raw Infobox Property Definitions nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Homepages nt nq nt nq nt nq nt nq nt nq nt nq -------- nt nq nt nq nt nq ----
Persondata -- nt nq --------------------------
PND -- nt nq --------------------------

Datasetcadeelesfrgahrhuitnlplptrusltr
Links to Wikipedia Article nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Wikipedia Pagelinks nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Disambiguation Links nt nq nt nq nt nq nt nq -------- nt nq -- nt nq nt nq nt nq ----
Inter-language links nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq



Datasetcadeelesfrgahrhuitnlplptrusltr
Links to Wikipedia Article nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Wikipedia Pagelinks nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq
Disambiguation Links nt nq nt nq nt nq nt nq -------- nt nq -- nt nq nt nq nt nq ----
Inter-language links nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq nt nq


4. External Links

NOTE: You can find DBpedia dumps in 97 languages at our DBpedia download server.


Click on the dataset names to obtain additional information.

Datasetlinks
Links to RDF Bookmashup ( preview ) nt -
Links to Bricklink ( preview ) nt -
Links to DailyMed ( preview ) nt -
Links to DBLP ( preview ) nt -
Links to Diseasome ( preview ) nt -
Links to DrugBank ( preview ) nt -
Links to EUNIS ( preview ) nt -
Links to Eurostat ( preview ) nt -
Links to CIA Factbook ( preview ) nt -
Links to flickr wrappr ( preview ) nt -
Links to Freebase ( preview ) nt -
Links to GADM ( preview ) nt -
Links to Geonames ( preview ) nt -
Links to GeoSpecies ( preview ) nt -
Links to Project Gutenberg ( preview ) nt -
Links to Italian Public Schools ( preview ) nt -
Links to LinkedMDB ( preview ) nt -
Links to MusicBrainz ( preview ) nt -
Links to New York Times ( preview ) nt -
Links to Cyc ( preview ) nt -
Links to Revyu ( preview ) nt -
Links to SIDER ( preview ) nt -
Links to TCMGeneDIT ( preview ) nt -
Links to Umbel ( preview ) nt -
Links to US Census ( preview ) nt -
Links to WikiCompany ( preview ) nt -
Links to WordNet ( preview ) nt -
Links to YAGO2 ( preview ) nt -


5. Dataset Descriptions


Unknown action "a"

DBpedia Ontology

The DBpedia ontology in OWL. See PDFour JWS paper for more details.
Unknown action "a"

Ontology Infobox Types

Contains triples of the form $object rdf:type $class from the ontology-based extraction.
Unknown action "a"

Ontology Infobox Properties

High-quality data extracted from Infoboxes using the strict ontology-based extraction. The predicates in this dataset are in the /ontology/ namespace.

Note that this data is of much higher quality than the Raw Infobox Properties in the /property/ namespace. For example, there are three different raw Wikipedia infobox properties for the birth date of a person. In the the /ontology/ namespace, they are all mapped onto one relation http://dbpedia.org/ontology/birthDate. It is a strong point of DBpedia to unify these relations.

Unknown action "a"

Ontology Infobox Properties (Specific)

Infoboxes Data from the loose ontology-based extraction.
Unknown action "a"

Titles

Titles of all Wikipedia Articles in the corresponding language
Unknown action "a"

Short Abstracts

Short Abstracts (max. 500 chars long) of Wikipedia Articles
Unknown action "a"

Extended Abstracts

Additional, extended English abstracts.
Unknown action "a"

Images

Thumbnail Links from Wikipedia Articles
Unknown action "a"

Geographic Coordinates

Geographic coordinates extracted from Wikipedia.
Unknown action "a"

Raw Infobox Properties

Information that has been extracted from Wikipedia infoboxes. Note that this data is in the less clean /property/ namespace. The Ontology Infobox Properties (/ontology/ namespace) should always be preferred over this data.
Unknown action "a"

Raw Infobox Property Definitions

All properties / predicates used in infoboxes.
Unknown action "a"

Homepages

Links to external webpages.
Unknown action "a"

Persondata

Information about persons (date and place of birth etc.) extracted from the English and German Wikipedia, represented using the FOAF vocabulary.
Unknown action "a"

PND 

Dataset containing PND (Personennamendatei) identifiers.
Unknown action "a"

Articles Categories

Links from concepts to categories using the SKOS vocabulary.
Unknown action "a"

Categories (Labels)

Labels for Categories.
Unknown action "a"

Categories (Skos)

Information which concept is a category and how categories are related using the SKOS Vocabulary.
Unknown action "a"

External Links

Links to external web pages about a concept.
Unknown action "a"

Links to Wikipedia Article

Links to corresponding Articles in Wikipedia
Unknown action "a"

Wikipedia Pagelinks

Dataset containing internal links between DBpedia instances. The dataset was created from the internal pagelinks between Wikipedia articles. The dataset might be useful for structural analysis, data mining or for ranking DBpedia instances using Page Rank or similar algorithms.
Unknown action "a"

Redirects

Dataset containing redirects between Articles in Wikipedia
Unknown action "a"

Disambiguation Links

Extraction from Disambiguation Templates
Unknown action "a"

Page IDs 

Dataset containing the Wikipedia Page IDs.
Unknown action "a"

Revision IDs 

Dataset containing the Wikipedia Revision IDs.
Unknown action "a"

Inter-language links

Dataset containing links between the different DBpedia URIs for various languages.
Unknown action "a"

Links to RDF Bookmashup

Links between books in DBpedia and data about them provided by the RDF Book Mashup. Provided by Georgi Kobilarov. Update mechanism: unclear/copy over from previous release.
Unknown action "a"

Links to Bricklink

Links between DBpedia and Bricklink.
Unknown action "a"

Links to DailyMed

Links between DBpedia and DailyMed. Update mechanism: unclear/copy over from previous release.
Unknown action "a"

Links to DBLP

Links between computer scientists in DBpedia and their publications in the DBLP database. Links were created manually. Update mechanism: Copy over from previous release.
Unknown action "a"

Links to Diseasome

Links between DBpedia and Diseasome. Update mechanism: unclear/copy over from previous release.
Unknown action "a"

Links to DrugBank

Links between DBpedia and DrugBank. Update mechanism: unclear/copy over from previous release.
Unknown action "a"

Links to EUnis

TODO
Unknown action "a"

Links to Eurostat

Links between countries and regions in DBpedia and data about them from Eurostat. Links were created manually. Update mechanism: Copy over from previous release.
Unknown action "a"

Links to CIA Factbook

Links between countries in DBpedia and data about them from CIA Factbook. Links were created manually. Update mechanism: Copy over from previous release.
Unknown action "a"

Links to flickr wrappr

Links between DBpedia concepts and photo collections depicting them generated by the flikr wrappr. Update mechanism: script in Mercurial.
Unknown action "a"

Links to Freebase

Links between DBpedia and Freebase (MIDs). Update mechanism: script in Mercurial.
Unknown action "a"

Links to GADM

Links between places in DBpedia and GADM.
Unknown action "a"

Links to Geonames

Links between geographic places in DBpedia and data about them in the Geonames database. Provided by the Geonames people. Update mechanism: unclear/copy over from previous release.
Unknown action "a"

Links to GeoSpecies

//Links between species in DBpedia and GeoSpecies.
Unknown action "a"

Links to Project Gutenberg

Links between writers in DBpedia and data about them from Project Gutenberg. Update mechanism: script in Mercurial. Since this requires manual changes of files and a D2R installation, it will be copied over from the previous DBpedia version and updated between releases by the maintainers (Piet Hensel and Georgi Kobilarov).
Unknown action "a"

Links to Italian Public Schools

Links between DBpedia and Italian Public Schools.
Unknown action "a"

Links to LinkedMDB

TODO
Unknown action "a"

Links to MusicBrainz

Links between artists, albums and songs in DBpedia and data about them from MusicBrainz. Created manually using the result of SPARQL queries. Update mechanism: unclear/copy over from previous release.
Unknown action "a"

Links to New York Times

Links between New York Times subject headings and DBpedia concepts.
Unknown action "a"

Links to Cyc

Links between DBpedia and Cyc concepts. Details. Update mechanism: awk script.
Unknown action "a"

Links to Revyu

Links to Reviews about things in Revyu. Created manually by Tom Heath. Update mechanism: unclear/copy over from previous release.
Unknown action "a"

Links to SIDER

Links between DBpedia and SIDER. Update mechanism: unclear/copy over from previous release.
Unknown action "a"

Links to TCMGeneDIT

Links between DBpedia and TCMGeneDIT. Update mechanism: unclear/copy over from previous release.
Unknown action "a"

Links to Umbel

TODO
Unknown action "a"

Links to US Census

Links between US cities and states in DBpedia and data about them from US Census. Update mechanism: unclear/copy over from previous release.
Unknown action "a"

Links to WikiCompany

Links between companies in DBpedia and companies in Wikicompany. Update mechanism: script in Mercurial.
Unknown action "a"

Links to WordNet

Classification links to RDF representations of WordNet classes. Update mechanism: unclear/copy over from previous release.
Unknown action "a"

Links to YAGO2

Dataset containing links between DBpedia and YAGO, YAGO type information for DBpedia resources and the YAGO class hierarchy. Currently maintained by Johannes Hoffart.


6. NLP Datasets


DBpedia also includes a number of NLP Datasets — datasets specifically targeted at supporting Computational Linguistics and Natural Language Processing (NLP) tasks. Among those, we highlight the Lexicalization Dataset, Topic Signatures, Thematic Concepts and Grammatical Genders.