Data Set 3.0

This pages provides downloads of the DBpedia datasets. The DBpedia datasets are licensed under the terms of GNU Free Documentation License The downloads are provided as N-Triples and in CSV format. All files are bz2 packed.


Dump Dates:

en: 20080103, de: 20080108, fr: 20080119, es: 20080119, it: 20080117, pl: 20080112, nl: 20080109, pt: 20080116, sv: 20080114, ja: 20071121, ru: 20080109, zh: 20080114, fi: 20080115, no: 20080115

 

 


Move the mouse on the download links to obtain additional information.


Older Versions: DBpedia 3.0RCDBpedia 2.0

1 Core Datasets

Dataset en de fr es it pl nl pt sv ja ru zh fi no
Titles preview ) ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv
Short Abstracts preview ) ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv
Extended Abstracts preview ) ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv
Images preview ) ntcsv -- -- -- -- -- -- -- -- -- -- -- -- --
Links to Wikipedia Article preview ) ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv
Articles Categories preview ) ntcsv -- -- -- -- -- -- -- -- -- -- -- -- --
External Links preview ) ntcsv -- -- -- -- -- -- -- -- -- -- -- -- --
Infoboxes preview ) ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv
Properties preview ) ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv
Cleanded Wikipedia Category Class (CWCC) Hierarchy preview ) (experimental/buggy) ntcsv -- -- -- -- -- -- -- -- -- -- -- -- --
CWCC Hierarchy Instances preview ) ntcsv -- -- -- -- -- -- -- -- -- -- -- -- --
Homepages preview ) ntcsv ntcsv ntcsv -- -- -- -- -- -- -- -- -- -- --
Geographic Coordinates preview ) ntcsv -- -- -- -- -- -- -- -- -- -- -- -- --
Pagelinks preview ) ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv ntcsv
Persondata preview ) -- ntcsv -- -- -- -- -- -- -- -- -- -- -- --
YAGO Classes preview ) ntcsv -- -- -- -- -- -- -- -- -- -- -- -- --
YAGO Class Hierarchy preview ) nt - -- -- -- -- -- -- -- -- -- -- -- -- --
Redirects preview ) ntcsv -- -- -- -- -- -- -- -- -- -- -- -- --
Disambiugation Links preview ) ntcsv -- -- -- -- -- -- -- -- -- -- -- -- --
WordNet Classes preview ) ntcsv -- -- -- -- -- -- -- -- -- -- -- -- --
Categories (Labelspreview ) ntcsv -- -- -- -- -- -- -- -- -- -- -- -- --
Categories (Skospreview ) ntcsv -- -- -- -- -- -- -- -- -- -- -- -- --

2 Extended Datasets

Dataset en de fr es it pl nl pt sv ja ru zh fi no
Links to Geonames preview ) nt - -- -- -- -- -- -- -- -- -- -- -- -- --
Links to RDF Bookmashup preview ) nt - -- -- -- -- -- -- -- -- -- -- -- -- --
Links to DBLP preview ) nt - -- -- -- -- -- -- -- -- -- -- -- -- --
Links to Eurostat preview ) nt - -- -- -- -- -- -- -- -- -- -- -- -- --
Links to CIA Factbook preview ) nt - -- -- -- -- -- -- -- -- -- -- -- -- --
Links to Project Gutenberg preview ) nt - -- -- -- -- -- -- -- -- -- -- -- -- --
Links to Musicbrainz preview ) nt - -- -- -- -- -- -- -- -- -- -- -- -- --
Links to Quotationsbook preview ) nt - -- -- -- -- -- -- -- -- -- -- -- -- --
Links to Revyu preview ) nt - -- -- -- -- -- -- -- -- -- -- -- -- --
Links to US Census preview ) nt - -- -- -- -- -- -- -- -- -- -- -- -- --
Links to flickr wrappr preview ) nt - -- -- -- -- -- -- -- -- -- -- -- -- --
Links to WikiCompany preview ) nt - -- -- -- -- -- -- -- -- -- -- -- -- --
Links to Cyc preview ) nt - -- -- -- -- -- -- -- -- -- -- -- -- --
Links to Yago ( ) nt - -- -- -- -- -- -- -- -- -- -- -- -- --

 

 


3 Dataset Descriptions


 

Titles

Titles of all Wikipedia Articles in the corresponding language

Short Abstracts

Short Abstracts (max. 500 chars long) of Wikipedia Articles

Extended Abstracts

Additional, extended English abstracts (max. 3000 chars long).

Images

Thumbnail Links from Wikipedia Articles

Links to Wikipedia Article

Links to corresponding Articles in Wikipedia

Articles Categories

Links from concepts to categories using the SKOS vocabulary.

External Links

Links to external web pages about a concept.

Infoboxes

Information that has been extracted from Wikipedia infoboxes.

Properties

All properties / predicates used in infoboxes.

Cleanded Wikipedia Category Class (CWCC) Hierarchy

The aim of this class hierarchy is to be close to the Wikipedia category system, but without some of its obstacles, e.g. cycles of categories, administrative categories, categories which represent instances instead of classes etc. However, the current extraction script contains some bugs and data cleansing still insufficient to be useful in applications. For this reason, the data set is not published in the SPARQL endpoint.

CWCC Hierarchy Instances

Wikipedia Articles connected with the CWCC Hierarchy

Homepages

Links to external webpages.

Geographic Coordinates

Geographic coordinates extracted from Wikipedia.

Pagelinks

Dataset containing internal links between DBpedia instances. The dataset was created from the internal pagelinks between Wikipedia articles. The dataset might be useful for structural analysis, data mining or for ranking DBpedia instances using Page Rank or similar algorithms. It's data is NOT available at our Sparql-Endpoint

Persondata

Information about 80,200 persons (date and place of birth etc.) extracted from the German Wikipedia, represented using the FOAF vocabulary.

YAGO Classes

Dataset containing rdf:type Statements for all DBpedia instances using YAGO classification algorithm.

YAGO Class Hierarchy

RDFS Hierarchy of all Yago Classes

Redirects

Dataset containing redirects between Articles in Wikipedia

Disambiugation Links

Extraction from Disambiguation Templates

WordNet Classes

Classification links to W3C Wordnet.

Categories (Labels)

Labels for Categories.

Categories (Skos)

Information which concept is a category and how categories are related using the SKOS Vocabulary.

Links to Geonames

Links between geographic places in DBpedia and data about them in the Geonames database

Links to RDF Bookmashup

Links between books in DBpedia and data about them provided by the RDF Book Mashup.

Links to DBLP

Links between computer scientists in DBpedia and their publications in the DBLP database.

Links to Eurostat

Links between countries and regions in DBpedia and data about them from Eurostat.

Links to CIA Factbook

Links between countries in DBpedia and data about them from CIA Factbook.

Links to Project Gutenberg

Links between writers in DBpedia and data about them from Project Gutenberg.

Links to Musicbrainz

Links between artists, albums and songs in DBpedia and data about them from Musicbrainz.

Links to Quotationsbook

Links between persons in DBpedia and data about them from Quotationsbook.

Links to Revyu

Links to Reviews about things in Revyu.

Links to US Census

Links between US cities and states in DBpedia and data about them from US Census.

Links to flickr wrappr

Links between DBpedia concepts and photo collections depicting them generated by the flikr wrappr.

Links to WikiCompany

Links between companies in DBpedia and companies in Wikicompany

Links to Cyc

Links between DBpedia and Cyc concepts. Details.

 


[Note for Wiki Editors: The wiki code for this page is generated automatically. Please modify the files in http://dbpedia.svn.sourceforge.net/viewvc/dbpedia/related_apps/downloadpagecreator/ to make permanent changes.]

Dataset category: 
Publication Year: 
2015