This pages provides downloads of the DBpedia datasets. The DBpedia datasets are licensed under the terms of GNU Free Documentation License. The downloads are provided as N-Triples and in CSV format. All files are bz2 packed.
en: 20080103, de: 20080108, fr: 20080119, es: 20080119, it: 20080117, pl: 20080112, nl: 20080109, pt: 20080116, sv: 20080114, ja: 20071121, ru: 20080109, zh: 20080114, fi: 20080115, no: 20080115
Move the mouse on the download links to obtain additional information.
Older Versions: DBpedia 3.0RC, DBpedia 2.0
1 Core Datasets
2 Extended Datasets
3 Dataset Descriptions
Titles of all Wikipedia Articles in the corresponding language
Short Abstracts (max. 500 chars long) of Wikipedia Articles
Additional, extended English abstracts (max. 3000 chars long).
Thumbnail Links from Wikipedia Articles
Links to Wikipedia Article
Links to corresponding Articles in Wikipedia
Links from concepts to categories using the SKOS vocabulary.
Links to external web pages about a concept.
Information that has been extracted from Wikipedia infoboxes.
All properties / predicates used in infoboxes.
Cleanded Wikipedia Category Class (CWCC) Hierarchy
The aim of this class hierarchy is to be close to the Wikipedia category system, but without some of its obstacles, e.g. cycles of categories, administrative categories, categories which represent instances instead of classes etc. However, the current extraction script contains some bugs and data cleansing still insufficient to be useful in applications. For this reason, the data set is not published in the SPARQL endpoint.
CWCC Hierarchy Instances
Wikipedia Articles connected with the CWCC Hierarchy
Links to external webpages.
Geographic coordinates extracted from Wikipedia.
Dataset containing internal links between DBpedia instances. The dataset was created from the internal pagelinks between Wikipedia articles. The dataset might be useful for structural analysis, data mining or for ranking DBpedia instances using Page Rank or similar algorithms. It's data is NOT available at our Sparql-Endpoint
Information about 80,200 persons (date and place of birth etc.) extracted from the German Wikipedia, represented using the FOAF vocabulary.
Dataset containing rdf:type Statements for all DBpedia instances using YAGO classification algorithm.
YAGO Class Hierarchy
RDFS Hierarchy of all Yago Classes
Dataset containing redirects between Articles in Wikipedia
Extraction from Disambiguation Templates
Classification links to W3C Wordnet.
Labels for Categories.
Information which concept is a category and how categories are related using the SKOS Vocabulary.
Links to Geonames
Links between geographic places in DBpedia and data about them in the Geonames database
Links to RDF Bookmashup
Links between books in DBpedia and data about them provided by the RDF Book Mashup.
Links to DBLP
Links between computer scientists in DBpedia and their publications in the DBLP database.
Links to Eurostat
Links between countries and regions in DBpedia and data about them from Eurostat.
Links to CIA Factbook
Links between countries in DBpedia and data about them from CIA Factbook.
Links to Project Gutenberg
Links between writers in DBpedia and data about them from Project Gutenberg.
Links to Musicbrainz
Links between artists, albums and songs in DBpedia and data about them from Musicbrainz.
Links to Quotationsbook
Links between persons in DBpedia and data about them from Quotationsbook.
Links to Revyu
Links to Reviews about things in Revyu.
Links to US Census
Links between US cities and states in DBpedia and data about them from US Census.
Links to flickr wrappr
Links between DBpedia concepts and photo collections depicting them generated by the flikr wrappr.
Links to WikiCompany
Links between companies in DBpedia and companies in Wikicompany
Links to Cyc
Links between DBpedia and Cyc concepts. Details.
[Note for Wiki Editors: The wiki code for this page is generated automatically. Please modify the files in http://dbpedia.svn.sourceforge.net/viewvc/dbpedia/related_apps/downloadpagecreator/ to make permanent changes.]