DBpedia 3.0 Downloads


This pages provides downloads of the DBpedia datasets. The DBpedia datasets are licensed under the terms of GNU Free Documentation License. The downloads are provided as N-Triples and in CSV format. All files are bz2 packed.


Dump Dates:


en: 20080103, de: 20080108, fr: 20080119, es: 20080119, it: 20080117, pl: 20080112, nl: 20080109, pt: 20080116, sv: 20080114, ja: 20071121, ru: 20080109, zh: 20080114, fi: 20080115, no: 20080115


Contents


Move the mouse on the download links to obtain additional information.


Older Versions: DBpedia 3.0RC, DBpedia 2.0

1. Core Datasets

Datasetendefresitplnlptsvjaruzhfino
Titles ( preview ) nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv
Short Abstracts ( preview ) nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv
Extended Abstracts ( preview ) nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv
Images ( preview ) nt csv --------------------------
Links to Wikipedia Article ( preview ) nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv
Articles Categories ( preview ) nt csv --------------------------
External Links ( preview ) nt csv --------------------------
Infoboxes ( preview ) nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv
Properties ( preview ) nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv
Cleanded Wikipedia Category Class (CWCC) Hierarchy ( preview ) (experimental/buggy) nt csv --------------------------
CWCC Hierarchy Instances ( preview ) nt csv --------------------------
Homepages ( preview ) nt csv nt csv nt csv ----------------------
Geographic Coordinates ( preview ) nt csv --------------------------
Pagelinks ( preview ) nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv nt csv
Persondata ( preview )-- nt csv ------------------------
YAGO Classes ( preview ) nt csv --------------------------
YAGO Class Hierarchy ( preview ) nt ---------------------------
Redirects ( preview ) nt csv --------------------------
Disambiugation Links ( preview ) nt csv --------------------------
WordNet Classes ( preview ) nt csv --------------------------
Categories (Labels) ( preview ) nt csv --------------------------
Categories (Skos) ( preview ) nt csv --------------------------

2. Extended Datasets

Datasetendefresitplnlptsvjaruzhfino
Links to Geonames ( preview ) nt ---------------------------
Links to RDF Bookmashup ( preview ) nt ---------------------------
Links to DBLP ( preview ) nt ---------------------------
Links to Eurostat ( preview ) nt ---------------------------
Links to CIA Factbook ( preview ) nt ---------------------------
Links to Project Gutenberg ( preview ) nt ---------------------------
Links to Musicbrainz ( preview ) nt ---------------------------
Links to Quotationsbook ( preview ) nt ---------------------------
Links to Revyu ( preview ) nt ---------------------------
Links to US Census ( preview ) nt ---------------------------
Links to flickr wrappr ( preview ) nt ---------------------------
Links to WikiCompany ( preview ) nt ---------------------------
Links to Cyc ( preview ) nt ---------------------------
Links to Yago ( ) nt ---------------------------


3. Dataset Descriptions


Titles

Titles of all Wikipedia Articles in the corresponding language

Short Abstracts

Short Abstracts (max. 500 chars long) of Wikipedia Articles

Extended Abstracts

Additional, extended English abstracts (max. 3000 chars long).

Images

Thumbnail Links from Wikipedia Articles

Links to Wikipedia Article

Links to corresponding Articles in Wikipedia

Articles Categories

Links from concepts to categories using the SKOS vocabulary.

External Links

Links to external web pages about a concept.

Infoboxes

Information that has been extracted from Wikipedia infoboxes.

Properties

All properties / predicates used in infoboxes.

Cleanded Wikipedia Category Class (CWCC) Hierarchy

The aim of this class hierarchy is to be close to the Wikipedia category system, but without some of its obstacles, e.g. cycles of categories, administrative categories, categories which represent instances instead of classes etc. However, the current extraction script contains some bugs and data cleansing still insufficient to be useful in applications. For this reason, the data set is not published in the SPARQL endpoint.

CWCC Hierarchy Instances

Wikipedia Articles connected with the CWCC Hierarchy

Homepages

Links to external webpages.

Geographic Coordinates

Geographic coordinates extracted from Wikipedia.

Pagelinks

Dataset containing internal links between DBpedia instances. The dataset was created from the internal pagelinks between Wikipedia articles. The dataset might be useful for structural analysis, data mining or for ranking DBpedia instances using Page Rank or similar algorithms. It's data is NOT available at our Sparql-Endpoint

Persondata

Information about 80,200 persons (date and place of birth etc.) extracted from the German Wikipedia, represented using the FOAF vocabulary.

YAGO Classes

Dataset containing rdf:type Statements for all DBpedia instances using YAGO classification algorithm.

YAGO Class Hierarchy

RDFS Hierarchy of all Yago Classes

Redirects

Dataset containing redirects between Articles in Wikipedia

Disambiugation Links

Extraction from Disambiguation Templates

Word Net Classes

Classification links to W3C Wordnet.

Categories (Labels)

Labels for Categories.

Categories (Skos)

Information which concept is a category and how categories are related using the SKOS Vocabulary.

Links to Geonames

Links between geographic places in DBpedia and data about them in the Geonames database

Links to RDF Bookmashup

Links between books in DBpedia and data about them provided by the RDF Book Mashup.

Links to DBLP

Links between computer scientists in DBpedia and their publications in the DBLP database.

Links to Eurostat

Links between countries and regions in DBpedia and data about them from Eurostat.

Links to CIA Factbook

Links between countries in DBpedia and data about them from CIA Factbook.

Links to Project Gutenberg

Links between writers in DBpedia and data about them from Project Gutenberg.

Links to Musicbrainz

Links between artists, albums and songs in DBpedia and data about them from Musicbrainz.

Links to Quotationsbook

Links between persons in DBpedia and data about them from Quotationsbook.

Links to Revyu

Links to Reviews about things in Revyu.

Links to US Census

Links between US cities and states in DBpedia and data about them from US Census.

Links to flickr wrappr

Links between DBpedia concepts and photo collections depicting them generated by the flikr wrappr.

Links to Wiki Company

Links between companies in DBpedia and companies in Wikicompany

Links to Cyc

Links between DBpedia and Cyc concepts. Details.


[Note for Wiki Editors: The wiki code for this page is generated automatically. Please modify the files in  http://dbpedia.svn.sourceforge.net/viewvc/dbpedia/related_apps/downloadpagecreator/ to make permanent changes.]



 
There are no files on this page. [Display files/form]
There is no comment on this page. [Display comments/form]

Information

Last Modification: 2008-02-15 13:46:02 by Jens Lehmann