DBpedia & DBpedia Spotlight @ GSoC 2016

For more general information about DBpedia & GSoC please read here.

If you are a candidate student

If you are a GSoC student who want to apply to our organization, here is a rough guideline on the steps to follow:

  1. Subscribe to the DBpedia-GSoC mailing list. All GSoC related questions (ideas, proposals, technical, etc) must go through this list. This it easier for you to search through the archives and for us to follow the discussion.
  2. Introduce yourself in the list. Tell us who you are, what you do and what projects you are interested in.
    • (Optional) Ask to be invited in the DBpedia #gsoc slack channel.
  3. Read carefully all the ideas we propose and see if any of these suits you. Also note that you can also submit your own idea.
  4. The ultimate goal in your proposal is to convince us that you understood how you will handle your project and to have a specific code-plan so, get as much information as possible for the ideas you like.
  5. Work on some of the Warm-Up task we suggest.
  6. Write your proposal. Use the template we sujject ans try to be as specific as possible (see step 4). Focus especially on the timeline
    • the template template can be found at the end of this page
  7. Queries on ideas should always be public, on the mailing list, slack or the ideas website (preferable on the ideas website). You application however can be private (if you choose to)
  8. For GSoC related queries you should look at the Google Summer of Code 2015 FAQs and the student guide they prepared.


As a general rule, we will treat the money Google is going to give us as if we would have to pay it ourselves. Therefore you should aim in your proposal to 

  1. convince all mentors that your proposal is worth receiving the money
  2. argue the benefit of your proposal for the DBpedia + DBpedia Spotlight project

Warm Up tasks

These are tasks that potential students might want to try in order to (1) get a feeling for the code, (2) learn initial skills, (3) get in contact with the community, and (4) earn an initial good standing.

We already prepared a few warm-up tasks for DBpedia & DBpedia Spotlight so, go ahead and show us your skills.

Our Ideas

This year we decided to put our ideas on a special Q&A website. We beleive this will make the idea discussion easier.

In order to make that work, all student questions about a project idea must be donw through the website. So, if you want to post a question, create an account and do that through the idea comment fields

We use the tag "gsoc" to mark gsoc ideas and the current project list is here: 

Our awesome mentors

Our mentors are all reliable members of the DBpedia and DBpedia Spotlight communities. Most of them have been active for more than one year now and have submitted several commits to our codebase or the mappings wiki. They all hold a stake in the DBpedia project, as their daily work relies in some form on the output produced by DBpedia and DBpedia Spotlight.
Below you can find the complete list, in alphabetical order.

  • Axel Ngonga (http://aksw.org/AxelNgonga) is the current head of AKSW, where his research focused on knowledge extraction, integration, fusion and access. Axel leads a number of research projects that can be found at http://aksw.org/projects.

  • Dimitris Kontokostas (jimkont, http://aksw.org/DimitrisKontokostas.html) Dimitris lives in Veria, Greece. He is a researcher at AKSW/KILT Group of Leipzig University and co-maintains the DBpedia project.

  • Nilesh Chakraborty (nilesh.chakraborty@informatik.uni-leipzig.de, nilesh@nileshc.com) is a researcher at the AKSW group, University of Leipzig, Germany, and is primarily working on entity linking, RDF knowledge fusion and application of distributed machine learning on RDF data.

  • Marco Fossati (marfos, fossati@spaziodati.eu, http://wed.fbk.eu/people/profile/fossati) lives in Trento, Italy. He is the Italian DBpedia chapter representative and has been a GSoC mentor for 2 years. He works as a data scientist at SpazioDati, in collaboration with the Web of Data research unit at Fondazione Bruno Kessler. Currently focused on Fact Extraction from Wikipedia Text, Crowdsourcing for Lexical Semantics, Taxonomy Learning from Wikipedia Categories.

  • Mariano Rico (mariano.rico, mariano.rico@upm.eshttp://sites.google.com/site/marianorico/welcome) is a postdoc researcher at the Ontology Engineering Group (OEG), within Universidad Politécnica de Madrid (UPM), Spain. He is creator and responsible for the Spanish chapter of DBpedia. His research is focused on enabling layman users with tools to provide an easy access to the linked data cloud.

  • Patrick Westphal (patrick.aksw, patrick.westphal@informatik.uni-leipzig.de, http://aksw.org/PatrickWestphal) is a PhD student at the AKSW Group, University of Leipzig, Germany and is combining Semantic Web technologies with machine learning techniques the life science domain.

  • Ricardo Usbeck (aksw.org/RicardoUsbeck.html) is a researcher at the AKSW Group of Leipzig University and his main interests are search engines, question answering, information retrieval and extraction. He wants to spark open source intelligent assistance systems using the wealth of DBpedia knowledge.

  • Robert Höhndorf (robert.hoehndorf@kaust.edu.sa, http://leechuck.de) is assistant professor (tenure track) at KAUST university, lives in Saudi Arabia, and is interested in semantic integration and biomedical ontologies. He wants to connect DBpedia to life science knowledge.

  • Sandro Athaide Coelho (sandro.coelho@ice.ufjf.br) is a full stack developer at AfferoLab, lives in Juiz de Fora, Brazil. He is a contributor to the DBpedia Spotlight Project

  • Ritesh Kumar Singh(ritesh.kumar.singh@informatik.uni-leipzig.de) is a researcher at the AKSW group, Leipzig university. His research interests include distributed and parallel computing, graph databases and semantic web.

Student Application Template

We have included in this page a template of what would be an awesome application to the Google Summer of Code 2014 with DBpedia. Feel free to use these questions as guiding questions for building your project proposal. While you are working on it, feel free to discuss your ideas with the DBpedia GSoC team on our mailing list. After your application is ready, you have to send it through the GSoC2014 website in order to be eligible.

Contact information

  • Your Name:
  • Email Address:
  • Instant messenger name and protocol (if any):
  • Any professional (e.g. github, linkedin) or personal (e.g. blog) URLs you would like to list:


  • Project Name: One of our suggestions?, or your own idea
  • Project Description: Your own description of the project and why it's interesting
  • If you would be willing and able to work on another of our suggested project ideas instead, which ones?
  • Please describe why you are interested in this specific project:
  • Please describe a tentative project architecture or an approach to it:
  • Please detail an expected project plan and timeline with milestones:
  • Please include in your plan how will you evaluate the performance of your contribution (in terms of time, or accuracy, or both), as well as which data sets you will use for that evaluation.

Technical skills

  • Please describe in a few lines your programming knowledge or experience (if any):
  • Please describe any other project related experience (if any):
  • Please provide one or more URLs to code samples that you have written in the past or to DBpedia Spotlight's SVN/issue tracker for which you have provided a fix in the form of a pull request. (Optional but highly appreciated if provided. Proof of code ownership is also required.):

Open Source

  • Please describe any previous Open Source development experience:
  • Why are you interested in Open Source development ?

Background & education

  • What school do you attend?
  • What is your specialty/major at the school?
  • How many years have you attended there?


  • What is your current research experience? Please point us to the best paper you have read (preferably in the context of your project proposal), and the best paper you have written, if any.
  • Would you be interested in co-authoring a conference paper with your mentors about your work in GSoC 2014?

Summer plans

  • What city/country will you be spending this summer in?
  • How much time do you expect to have for this project (est. per day and per week)?
  • Please list jobs, summer classes, and/or vacations that you'll need to work around:

GSoC Experience

  • Did you participate in a previous Summer of Code project? If so, please describe your project and experience:
  • Have you applied or do you plan to apply for any other 2014 Summer of Code projects? If so, which ones?
  • Why did you decide to apply for the Google Summer of Code ?
  • Why did you decide to apply for a DBpedia Spotlight project ?

And finally...

  • ... in 2 sentences, why should we take YOU ?


This application form template was built based on last year's proposal form, which was in turn based on XWiki's template for GSoC 2012.