This page lists use cases for the DBpedia knowledge base together with references to ongoing work into these directions.
1. Revolutionize Wikipedia Search
Wikipedia currently only supports keyword-based search and does not allow more expressive queries like Give me all cities in New Jersey with more than 10,000 inhabitants or Give me all Italian musicians from the 18th century. This lowers the overall utility of Wikipedia.
One major application domain for the DBpedia data set is to enable sophisticated queries against Wikipedia, which could revolutionize the access to this valuable knowledge source.
Here are three prototypical search interfaces using different approaches to improve Wikipedia search:
- DBpedia Faceted Search – allows you to explore Wikipedia via a faceted browsing interface.
- The DBpedia Query Builder provides an easy-to-use interface for formulating queries against DBpedia as a set of matching patterns.
- The OpenLink iSPARQL visual query builder provides a graphical interface for formulating SPARQL queries against DBpedia.
2. Include DBpedia Data in Your Web Page
One nice thing about Wikipedia is that is kept up-to-date by a large community. Therefore, if you need a table on your Web page with, say, German cities, African musicians, Amiga computer games from the 90s, or whatever, you could generate this table with a SPARQL query against the DBpedia endpoint, and your table will stay up-to-date as Wikipedia changes.
Such tables can be implemented either by using Java Script on the client, or with a scripting language like PHP on the server. The second option also allows you to cache query results.
Besides tables, you can also use the text in your page to bring more data from DBpedia to your website. You can use a tool like DBpedia Spotlight to automatically create links to corresponding DBpedia resources.
3. Mobile and Geographic Applications
DBpedia contains information about geographic locations and is interlinked with other geo-related data sources such as Geonames, the US Census, Euro Stat, and the CIA world fact book. The data set contains geo-coordinates for many geographic locations which enable location-based SPARQL searches.
This makes DBpedia a valuable data source for location-based applications. DBpedia contains short abstracts about places which display nicely on mobile phones and PDAs.
As current generation mobiles and PDAs start having GPS receivers, it is possible to implement nice location-based information services for them based on DBpedia data and the DBpedia SPARQL endpoint.
One example of such an application is DBpedia Mobile which demonstrates how DBpedia can be used as entry point into the geo-spacial Semantic Web.
4. Document Classification, Annotation and Social Bookmarking
Terms from DBpedia can be used to annotate Web content. Compared to other subject hierarchies, like the classic ones used within libraries, DBpedia has the advantage that each subject is backed by a rich description including abstracts in 14 languages. Another advantage compared to static hierarchies is that DBpedia evolves as Wikipedia changes.
One application that uses DBpedia terms for the annotation of Web content is DBpedia Spotlight. DBpedia Spotlight automatically detects mentions of DBpedia terms in textual documents, allowing you to create links from those documents to DBpedia. Another example is Faviki, a social bookmarking tool which allows you to tag Web pages you want to remember with Wikipedia terms. This means that everybody uses the same names for tags from the world's largest collection of knowledge.
5. Multi-Domain Ontology
DBpedia is one of the largest multi-domain ontologies that currently exist. Compared to other ontologies which usually only cover specific domains, are created by relatively small groups of knowledge engineers, and are very cost intensive to keep up-to-date as domains change, DBpedia has the advantage that
- it covers many domains and contains lots of instances;
- it represents real community agreement; and
- it (automatically) evolves as Wikipedia changes.
The disadvantages of DBpedia compared to hand-crafted ontologies like SUMO, Open Cyc, or Wordnet are that
- DBpedia is less formally structured; and
- the data quality is lower and there are inconsistencies within DBpedia.
An approach to combine the advantages of both worlds is to interlink DBpedia with hand-crafted ontologies such as Open Cyc, SUMO, or Word Net, which enables applications to use the formal knowledge from these ontologies together with the instance data from DBpedia.
DBpedia already contains
- 42.000 RDF links into OpenCyc,
- 318,000 RDF links into WordNet,
- 3,36 million RDF links to Freebase,
- RDF links to UMBEL.
Interlinking DBpedia with these ontologies could further extend query capabilities. For instance, knowing that cities are geographic places and mountains are geographic places, a query engine could return cities as well as mountains for a query about geographic places.
6. Nucleus for the Web of Data
The Web is currently changing from a medium to publish and share text documents into a medium to publish and share data.
This transition is facilitated by ideas from the Semantic Web community and initiatives like the W3C Linking Open Data project.
As DBpedia covers many domains and provides data-backed identifiers for 3.64 million concepts, it is developing into an interlinking-hub for other data sets.
Please see Interlinking for an overview about the data sets that are currently interlinked with DBpedia and W3C Linking Open Data for a list of other data sets and ontologies that are published on the Web as Linked Data.
7. Support Wikipedia Authors with Editing Suggestions
One strength of Wikipedia, and a central factor for its growth, is that it does not restrict contributors. On the other hand, this leads to many inconsistencies within Wikipedia, especially between the 251 different language versions.
Extracting structured data from all 251 versions of DBpedia and interlinking this data with background knowledge from ontologies like Open Cyc, SUMO, or Word Net, allows different types of consistency checks. For instance:
- Population of Berlin within infoboxes in different languages.
- Classification of a person in the category German cities.
Therefore, one promising direction for future work is to use DBpedia knowledge for consistency checks and to develop tools that support Wikipedia authors by offering editing/correction suggestions.