Thursday, July 25, 2019 - 1:33pm
How is data edited in Wikipedia/Wikidata? Where does it come from? And how can we synchronize it globally?
The GlobalFactSync (GFS) Project — funded by the Wikimedia Foundation — started in June 2019 and has two goals:
- Answer the above-mentioned three questions.
- Build an information system to synchronize facts between all Wikipedia language-editions and Wikidata.
Now we are seven weeks into the project (10+ more months to go) and we are releasing our first prototypes to gather feedback.
How – Synchronization vs Consensus
We follow an absolute “Human(s)-in-the-loop” approach when we talk about synchronization. The final decision whether to synchronize a value or not should rest with a human editor who understands consensus and the implications. There will be no automatic imports. Our focus is to drastically reduce the time to research all references for individual facts.
A trivial example to illustrate our reasoning is the release date of the single “Boys Don’t Cry” (March 16th, 1989) in the English, Japanese, and French Wikipedia, Wikidata and finally in the external open database MusicBrainz. A human editor might need 15-30 minutes finding and opening all different sources, while our current prototype can spot differences and display them in 5 seconds.
We already had our first successful edit where a Wikipedia editor fixed the discrepancy with our prototype: “I’ve updated Wikidata so that all five sources are in agreement.” We are now working on the following tasks:
- Scaling the system to all infoboxes, Wikidata and selected external databases (see below on the difficulties there)
- Making the system:
- “live” without stale information
- “reliable” with less technical errors when extracting and indexing data
- “better referenced” by not only synchronizing facts but also references
Contributions and Feedback
To ensure that GlobalFactSync will serve and help the Wikiverse we encourage everyone to try our data and micro-services and leave us some feedback, either on our Meta-Wiki page or via email. In the following 10+ months, we intend to improve and build upon these initial results. At the same time, these microservices are available to every developer to exploit it and hack useful applications. The most promising contributions will be rewarded and receive the book “Engineering Agile Big-Data Systems”. Please post feedback or any tool or GUI here. In case you need changes to be made to the API, please let us know, too.
For the ambitious future developers among you, we have some budget left that we will dedicate to an internship. In order to apply, just mention it in your feedback post.
Data, APIs & Microservices (Technical prototypes)
Data Processing and Infobox Extraction
For GlobalFactSync we use data from Wikipedia infoboxes of different languages, as well as Wikidata, and DBpedia and fuse them to receive one big, consolidated dataset – a PreFusion dataset (in JSON-LD). More information on the fusion process, which is the engine behind GFS, can be found in the FlexiFusion paper. One of our next steps is to integrate MusicBrainz into this process as an external dataset. We hope to implement even more such external datasets to increase the amount of available information and references.
We deployed a set of microservices to show the current state of our toolchain.
- [Initial User Interface] The GlobalFactSync UI prototype (available at http://global.dbpedia.org) shows all extracted information available for one entity for different sources. It can be used to analyze the factual consensus between different Wikipedia articles for the same thing. Example: Look at the variety of population counts for Grimma.
- [PreFusion JSON API] While the UI allows simple, fast and easy browsing for one entity at a time, we also provide raw access to the underlying data (PreFusion dump). The query UI (http://global.dbpedia.org:8990 (user: read, pw: gfs) can be utilized to run simple analytical queries. Thus, we can determine the number of locations having at least one population value (1,194,007) but can also focus on examples with data quality problems (e.g. one of the 4,268 locations with more than 10 population values). Moreover, documentation about the PreFusion dataset and the download link for the data are available on the Databus website.
- [Reference Data Download] We ran the Reference Extraction Service over 10 Wikipedia languages. Download dumps here.
- [Reference Extraction Service] Good references are crucial for an import of facts from Wikipedia to Wikidata. We are currently working with colleagues from Poznań University of Economics and Business on reference extraction for facts from Wikipedia. A current development reference extraction microservice shows all references and the location where they were spotted in the Infobox – ad hoc – for a given article: http://dbpedia.informatik.uni-leipzig.de:8111/infobox/references?article=https://en.wikipedia.org/wiki/Facebook&format=json ( ‘&format=tsv’ also available)
- [Infobox Extraction Service] A similar ad hoc extraction of factual information from infoboxes and other Wikipedia article information is available here. This microservice displays information which can be extracted with the help of DBpedia mappings from an infobox e.g. from the German Facebook Wikipedia article: http://dbpedia.informatik.uni-leipzig.de:9998/server/extraction/en/extract?title=Facebook&revid=&format=trix&extractors=mappings. See here for more options: http://dbpedia.informatik.uni-leipzig.de:9999/server/extraction/.
- [ID service] Last but not least, we offer the Global ID Resolution Service. It ties together all available identifiers for one thing (i.e. at the moment all DBpedia/Wikipedia and Wikidata identifiers – MusicBrainz coming soon…) and shows their stable DBpedia Global ID.
Finding sync targets
In order to test out our algorithms, we started by looking at various groups of subjects, our so-called sync targets. Based on the different subjects a set of problems were identified with varying layers of complexity:
- identity check/check for ambiguity — Are we talking about the same entity?
- fixed vs. varying property — Some properties vary depending on nationality (e.g., release dates), or point in time (e.g., population count).
- reference — Depending on the entity’s identity check and the property’s fixed or varying state the reference might vary. Also, for some targets, no query-able online reference might be available.
- normalization/conversion of values — Depending on language/nationality of the article properties can have varying units (e.g., currency, metric vs imperial system).
The check for ambiguity is the most crucial step to ensure that the infoboxes that are being compared do refer to the same entity. We found, instances where the Wikipedia page and the infobox shown on that page were presenting information about different subjects (e.g., see here).
As a good sync target to start with the group ‘NBA players’ was identified. There are no ambiguity issues, it is a clearly defined group of persons, and the amount of varying properties is very limited. Information seems to be derived from mainly two web sites (nba.com and basketball-reference.com) and normalization is only a minor issue. ‘Video games’ also proved to be an easy sync target, with the main problem being varying properties such as different release dates for different platforms (Microsoft Windows, Linux, MacOS X, XBox) and different regions (NA vs EU).
More difficult topics, such as ‘cars’, ’music albums’, and ‘music singles’ showed more potential for ambiguity as well as property variability. A major concern we found was Wikipedia pages that contain multiple infoboxes (often seen for pages referring to a certain type of car, such as this one). Reference and fact extraction can be done for each infobox, but currently, we run into trouble once we fuse this data.
Further information about sync targets and their challenges can be found on our Meta-Wiki discussion page, where Wikipedians that deal with infoboxes on a regular basis can also share their insights on the matter. Some issues were also found regarding the mapping of properties. In order to make GlobalFactSync as applicable as possible, we rely on the DBpedia community to help us improve the mappings. If you are interested in participating, we will connect with you at http://mappings.dbpedia.org and in the DBpedia forum.
Bottomline – We value your feedback
Your DBpedia Association
The post Global Fact Sync – Synchronizing Wikidata & Wikipedia’s infoboxes appeared first on DBpedia Blog.
Thursday, July 18, 2019 - 11:39am
With timbr, WPSemantix and the DBpedia Association launch the first SQL Semantic Knowledge Graph that integrates Wikipedia and Wikidata Knowledge into SQL engines.
In part three of DBpedia’s growth hack blog series, we feature timbr, the latest development at DBpedia in collaboration with WPSemantix. Read on to find out how it works.
timbr – DBpedia SQL Semantic Knowledge Platform
Tel Aviv, Israel and Leipzig, Germany – July 18, 2019 – WP-Semantix (WPS) – the “SQL Knowledge Graph Company” and DBpedia Association – Institut für Angewandte Informatik e.V., announced today the launch of the timbr-DBpedia SQL Semantic Knowledge Platform, a unique version of WPS’ timbr SQL Semantic Knowledge Graph that integrates timbr-DBpedia ontology, timbr’s ontology explorer/visualizer and timbr’s SQL query service, to provide for the first time semantic access to DBpedia knowledge in SQL and to thus facilitate DBpedia knowledge integration into standard data warehouses and data lakes.
DBpedia is the crowd-sourced community effort to extract structured content from the information created in various Wikimedia projects and publish these as files on the Databus and via online databases. This structured information resembles an open knowledge graph which has been available for everyone on the Web for over a decade. Knowledge graphs are a new kind of databases developed to store knowledge in a machine-readable form, organized as connected, relationship-rich data. After the publication of DBpedia (in parallel to Freebase) 12 years ago, knowledge graphs have become very successful and Google uses a similar approach to create the knowledge cards displayed in search results.
Query the world’s knowledge in standard SQL
Amit Weitzner, founder and CEO at WPS commented: “Knowledge graphs use specialized languages, require resource-intensive, dedicated infrastructure and require costly ETL operations. That is, they did until timbr came along. timbr employs SQL – the most widely known database language, to eliminate the technological barriers to entry for using knowledge graphs and to implement Semantic Web principles to provide knowledge graph functionality in SQL. timbr enables modelling of data as connected, context-enriched concepts with inference and graph traversal capabilities while being queryable in standard SQL, to represent knowledge in data warehouses and data lakes. timbr-DBpedia is our first vertical application and we are very excited by the prospects of our cooperation with the DBpedia team to enable the largest user base to query the world’s knowledge in standard SQL.”
Sebastian Hellmann, executive director of the DBpedia Association commented that:
timbr will help to explore the power of semantic technologies
Prof. James Hendler, pioneer and a world-leading authority in Semantic Web technologies and WPS’ advisory board member commented “timbr can be a game-changing solution by enabling the semantic inference capabilities needed in many modelling applications to be done in SQL. This approach will enable many users to get the advantages of semantic AI technologies and data integration without the learning curve of many current systems. By giving more people access to the semantic version of Wikipedia, timbr-DBpedia will definitely contribute to allowing the majority of the market to explore the power of semantic technologies.”
timbr-DBpedia is available as a query service or licensed for use as SaaS or on-premises. See the DBpedia website: wiki.dbpedia.org/timbr.
WP-Semantix Ltd. (wpsemantix.com) is the developer of the timbr SQL semantic knowledge platform, a dynamic abstraction layer over relational and non-relational data, facilitating declaration and powerful exploration of semantically rich ontologies using a standard SQL query interface. timbr is natively accessible in Apache Spark, Python, R and SQL to empower data scientists to perform complex analytics and generate sophisticated ML algorithms. Its JDBC interface provides seamless integration with the most popular business intelligence solutions to make complex analytics accessible to analysts and domain experts across the organization.
WP-Semantix, timbr, “SQL Knowledge Graph”, “SQL Semantic Knowledge Graph” and associated marks and trademarks are registered trademarks of WP Semantix Ltd.
DBpedia is looking forward to this cooperation. Follow us on Twitter for the latest information and stay tuned for part four of our growth hack series. The next post features the GlobalFactSyncRe. Curious? You have to be a little more patient and wait till Thursday, July 25th.
Yours DBpedia Association
The post timbr – the DBpedia SQL Semantic Knowledge Platform appeared first on DBpedia Blog.
Thursday, July 11, 2019 - 12:54pm
With part II of our growth hack series, we would like to introduce you to the latest feature of our development – the new DBpedia Forum.
Why a new forum?
DBpedia has an inclusionist model and DBpedia is huge. At the core, there is data extracted from Wikipedia and Wikidata. Around this, there are derived datasets like the fusion/enrichment and also LHD. Additionally, we offer services such as DBpedia Spotlight, DBpedia Lookup, SameAs, and not to forget the main endpoint http://dbpedia.org/sparql as well as our DBpedia Chapters. All of this is surrounded by 25k academic papers and a vivid business network.
Since we have this inclusionist model, we believe that access to data and knowledge should be global and unified (and free where possible). That is exactly why we established the DBpedia Forum – to further this mission.
The DBpedia Forum is a shared community resource — a place to share skills, knowledge, and interests through an ongoing conversation about DBpedia and related topics. It is meant (among others) to replace our old support page for assistance with DBpedia. In the long run, we will shut down our (former) support page, as it is not serving our growing needs anymore.
This is what the forum currently looks like. Traffic and communication are still a little low. Start your conversation about DBpedia here and now.
Where are all the DBpedians?
We figured, most of you are already actively involved in exchange about DBpedia. However, the majority of that is scattered all over the web which makes it hard for us and others to keep track of. With the new forum, we offer you a playground for vivid exchange, and to meet and greet fellow DBpedians – a platform for everyone’s benefit.
The DBpedia Forum simplifies communication
Make this a great place for discussion by contributing yourself. It is super easy. Just visit https://forum.dbpedia.org/, browse the topics, and find the info that helps you or add your own. If you want to contribute just register and off you go. Improve the discussion by discovering ones that are already happening. Help us influence the future of the DBpedia community by engaging in discussions that make this forum an interesting place to be.
Transparency is all
To assist with maintaining an appropriate code of conduct the forum utilizes little discourse tools that enable the community to collectively identify the best (and worst) contributions. The forum tracks bookmarks, likes, flags, replies, edits, and many more. That is similar to the ranking in the old support system but much more transparent and much more fun.
For the hunter-gatherers among you, you can also earn batches for various activities – as long as you are active. And if you feel very passionate about a certain topic, we would gladly make you a moderator – just let us know.
Now is the time
Since you are already talking about DBpedia somewhere on the WWW, why not do it here and now for everyone else to follow? Your knowledge and skills are key, not only for individuals in this forum but also for the whole DBpedia community.
Happy posting and stay tuned for part III in the growth hack series. The next post will feature timbr – DBpedia SQL Semantic Knowledge Platform.
The post DBpedia Forum – New Ways to Exchange about DBpedia appeared first on DBpedia Blog.
Thursday, July 4, 2019 - 1:40pm
*UPDATE* – We are now 5 weeks in our growth hack. Read on below to find out how it all started. Click here to follow up on each of our milestones.
A growth hack – how come?
Things have gone a bit quiet around DBpedia. No new releases, no clear direction to go. Did DBpedia stop? Actually not. There were community and board member meetings, discussions, 500 messages per week on dbpedia.slack.com.
We are still there. We, as a community, restructured and now we are done, which means that DBpedia will now work more focused to build on its Technology Leadership role in the Web of Data and thus – with our very own DBpedia Growth Hack – bring new innovation and free fuel to everybody.
What is this growth hack?
We restructured in two areas:
- The agility of knowledge delivery – our release cycle was too slow and too expensive. We were unable to include substantial contributions from DBpedians. Therefore, quality and features stagnated.
- Transparent processes – DBpedia has a crafty community with highly skilled knowledge engineers backing it. At some point, we grew too much and became lumpy, with a big monolithic system that nobody could improve because of side effects. So we designed a massive curation infrastructure where information can be retrieved, adjusted and errors discussed and fixed.
We have been consistently working on this restructuring for two years now and we now have the infrastructure ready as horizontal prototype meaning each part works and everybody can start using it. We ate our own dog food and built the first application.
Now we will go through each part and polish & document it, and will report about it with a blog post each. Stay tuned !
Is DBpedia Academic or Industrial?
The Semantic Web has a history of being labelled as too academic and a part of it colored DBpedia as well. Here is our personal truth: It is an engineering project and therefore it swings both ways. It is a great academic success with 25,000 papers using the data and enabling research and innovation. The free data drives research on data-driven research. Also, we are probably THE fastest pathway from lab to market as our industry adoption has unprecedented speed. Proof will follow in the blog posts of the Growth Hack series.
Blog Posts of the Growth Hack series:
(not necessarily in that order, depending on how fast we can polish & document )
- Query DBpedia as SQL – a first service on the Databus
- DBpedia Live Extraction – Realtime updates of Wikipedia
- DBpedia Business Models – How to earn money with DBpedia & the Databus
- MARVIN Release Bot – together with https://blogs.tib.eu/wp/tib/ incl. an update of https://wiki.dbpedia.org/Datasets
- The new forum – https://forum.dbpedia.org is already ready to register, but needs some structure. Intended as replacement of support.dbpedia.org
In addition some announcements of on-going projects:
- GlobalFactSync (GFS) – Syncing facts between Wikipedia and Wikidata
- Energy Databus: LOD GEOSS project focusing on energy system data on the bus
- Supply-Chain-Management Databus – PLASS project focusing on SCM data on the bus
So, stay tuned for our upcoming posts and follow our journey.
Tuesday, June 18, 2019 - 2:52pm
For the second time now, we co-located one of our DBpedia community meetings with the LDK-conference. After the previous edition in Galway two years ago, It was Leipzig’s turn to host the event. Thus, the 13th DBpedia community meeting took place in this beautiful city which is also home to the DBpedia Association’s head office. Win, Win we’d say.
After a very successful LDK conference May 20th-21st, representatives of the European DBpedia community met at Villa Ida Mediencampus, on Thursday, May 23rd, to present their work with DBpedia and to exchange about the DBpedia Databus.
For those of you who missed it or for those who want a little retrospective on the day, this blog post provides you with a short LDK-wrap-up as well as a recap of our DBpedia Day.
First things first
First and foremost, we would like to thank LDK organizers for co-locating our meeting and thus enabling fruitful synergies, and a platform for the DBpedia community to exchange.
The first presentation that kicked-off the conference was given by Prof. Christiane Fellbaum from Princeton University. The topic of her talk was on “Mapping the Lexicons of Signs and Words” with the main focus on her research of mapping WordNet and SignStudy, a resource for American Sign Language. Shortly after, Prof Eduard Werner from Leipzig University gave a very exciting talk on the “Sorbian languages”. He discussed the nature of the Sorbian languages, their historical background, and the unfortunate imminent extinction of lower Sorbian due to a decline of native speakers.
The first day of LDK was full of exciting presentations related to various language-oriented topics. Researchers exchanged about linguistic vocabularies, SPARQL query recommendations, role and reference grammar, language detection, entity recognition, machine translation, under-resourced languages, metaphor identification, event detection and linked data in general. The first day ended with fruitful discussions during the poster session. After at the end of the first conference day, LDK visitors had the chance to mingle with locals in some of Leipzig’s most exciting bars during a pub crawl.
Prof. Christian Bizer from the University of Mannheim opened the second day with a keynote on “Schema.org Annotations and Web Tables: Underexploited Semantic Nuggets on the Web?”. In his talk, he gave a nice overview of the research on knowledge extraction around the large-scale Web Data Commons corpus, findings, open challenges and possible exploitations of this corpus.
The second day was busy with four sessions, each populated with presentations on exciting topics ranging from relation classification, dictionary linking and entity linking, to terminology models, topical thesauri and morphology.
The series of presentations was ended with an Organ Prelude played by David Timm, the University Music Director at the Leipzig University. Finally, the day and the conference was concluded with a conference dinner at Moritzbastei, one of Leipzig’s famous cultural centres.
On May 23rd, the DBpedia Community met for the 13th DBpedia community meeting. The event attracted more than 60 participants who extended their LDK experience or followed our call to Leipzig.
Opening & keynotes
The meeting was opened by Dr. Sebastian Hellmann, the executive director of the DBpedia Association. He gave an overview of the latest developments and achievements around DBpedia, with the main focus on the DBpedia Databus technologies. The first keynote was given by Dr. Peter Haase, from metaphacts, with an unusual interactive presentation on “Linked Data Fun with DBpedia”. The second keynote speaker was Prof. Heiko Paulheim, presenting findings, challenges and results from his work on the construction of the DBkWiki Knowledge Graph by exploiting the DBpedia extraction framework.
The showcases session started with a presentation given by Krzysztof Węcel on “Citations and references in DBpedia”, followed by Peter Nancke with a presentation on the “TeBaQA Question Answering System”, Maribel Acosta Deibe speaking about “Crowdsourcing the Quality of DBpedia” and finally, a presentation by Angus Addlesee on “Data Reconciliation using DBpedia”.
NLP & DBpedia session
The DBpedia & NLP session was opened by Diego Moussallem presenting the results from his work on “Generating Natural Language from RDF Data”. The second presentation was given by Christian Jilek on the topic of “Named Entity Recognition for Real-Time Applications”, which at the same time won the best research paper at the LDK conference. Next, Jonathan Kobbe presented the best student paper at the LDK conference on the topic of “Argumentative Relation Classification”. Finally, Edgard Marx closed the session with an overview presentation on “From the word to the resource”.
Side-Event – Hackathon
The “Artificial Intelligence for Smart Agriculture” Hackathon focused on enhancing the usability of automatic analysis tools which utilize semantic big data for agriculture, as well as conducting an outreach of the DataBio project for the DBpedia community. The event was supported by PNO, Spacebel, PSNC, and InfAI e.V.
In addition, we presented a paper about Linked Data publication pipelines, focusing on agri-related data, at the co-located LSWT conference.
After the event, DBpedians joined the DBpedia Association in the nearby pub Gosenschenke to delve into more vital talks about the Semantic Web world, Linked Data & DBpedia.
We are currently looking forward to the next DBpedia Community Meeting, on Sept, 12th in Karlsruhe, Germany. This meeting is co-located with the SEMANTiCS Conference. Contributions are still welcome. Just ping us via email@example.com and show us what you’ve got. You should also get in touch with us if you want to host a DBpedia Meetup yourself. We will help you with the program, the dissemination or organizational matters of the event if need be.
Your DBpedia Association
The post Home Sweet Home – The 13th DBpedia Community Meeting appeared first on DBpedia Blog.
Thursday, April 11, 2019 - 3:20pm
Artificial Intelligence (AI) is currently the central subject of the just announced ‘Year of Science’ by the Federal German Ministry. In recent years, new approaches were explored on how to facilitate AI, new mindsets were established and new tools were developed, new technologies implemented. AI is THE key technology of the 21st century. Together with Machine Learning (ML), it transforms society faster than ever before and, will lead humankind to its digital future.
In this digital transformation era, success will be based on using analytics to discover the insights locked in the massive volume of data being generated today. Success with AI and ML depends on having the right infrastructure to process the data.
The Value of Data Governance
One key element to facilitate ML and AI for the digital future of Europe, are ‘decentralized semantic data flows’, as stated by Sören Auer, a founding member of DBpedia and current director at TIB, during a meeting about the digital future in Germany at the Bundestag. He further commented that major AI breakthroughs were indeed facilitated by easily accessible datasets, whereas the Algorithms used were comparatively old.
In conclusion, Auer reasons that the actual value lies in data governance. Infact, in order to guarantee progress in AI, the development of a common and transparent understanding of data is necessary. 
DBpedia Databus – Digital Factory Platform
The DBpedia Databus – our digital factory Platform – is one of many drivers that will help to build the much-needed data infrastructure for ML and AI to prosper. With the DBpedia Databus, we create a hub that facilitates a ‘networked data-economy’ revolving around the publication of data. Upholding the motto, Unified and Global Access to Knowledge, the databus facilitates exchanging, curating and accessing data between multiple stakeholders – always, anywhere. Publishing data on the Databus means connecting and comparing (your) data to the network. Check our current DBpedia releases via http://downloads.dbpedia.org/repo/dev/.
DBpediaDay – & AI for Smart Agriculture
Furthermore, you can learn about the DBpedia Databus during our 13th DBpedia Community meeting, co-located with LDK conference, in Leipzig, May 2019. Additionally, as a special treat for you, we also offer an AI side-event on May 23rd, 2019.
May we present you the thinktank and hackathon – “Artificial Intelligence for Smart Agriculture”. The goal of this event is to develop new ideas and small tools which can demonstrate the use of AI in the agricultural domain or the use of AI for a sustainable bio-economy. In that regard, a special focus will be on the use and the impact of linked data for AI components.
In short, the two-part event, co-located with LSWT & DBpediaDay, comprises workshops, on-site team hacking as well as presentations of results. The activity is supported by the projects DataBio and Bridge2Era as well as CIAOTECH/PNO. All participating teams are invited to join and present their projects. Further Information are available here. Please submit your ideas and projects here.
Finally, the DBpedia Association is looking forward to meeting you in Leipzig, home of our head office. Pay us a visit!
 Zeus Kerravala; The Success of ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING Requires an Architectural Approach to Infrastructure. ZK Research: A Division of Kerravala Consulting © 2018 ZK Research, available via http://bit.ly/2UwTJRo
 Sören Auer; Statement at the Bundestag during a meeting in AI, Summary is available via https://www.tib.eu/de/service/aktuelles/detail/tib-direktor-als-experte-...
Sunday, March 24, 2019 - 1:40pm
Call for Participation LDK – Conference
With the advent of digital technologies, an ever-increasing amount of language data is now available across various application areas and industry sectors, thus making language data more and more valuable. In that context, we would like to draw your attention to the 2nd Language, Data and Knowledge conference, short LDK conference which will be held in Leipzig from May 20th till 22nd, 2019.
This new biennial conference series aims at bringing together researchers from across disciplines concerned with language data in data science and knowledge-based applications.
We are happy, that Christian Bizer, a founding member of DBpedia, will be one of the three amazing keynote speakers that open the LDK conference. Apart from Christian, Christiane Fellbaum from Princeton University and Eduart Werner, representative of Leipzig University will share their thoughts on current language data issues to start vital discussions revolving around language data.
Be part of this event in Leipzig and catch up with the latest research outcomes in the areas of acquisition, provenance, representation, maintenance, usability, quality as well as legal, organizational and infrastructure aspects of language data.
DBpedia Community Meeting
To get the full Leipzig experience, we also like to invite you to our DBpedia Community meeting, which is colocated with LDK and will be held on May, 23rd 2019. Contributions are still welcome. Just in get in touch via firstname.lastname@example.org .
We also offer an interesting side-event, the Thinktank and Hackathon “Artificial Intelligence for Smart Agriculture”. Visit our website for further information.
Join LDK conference 2019 and our DBpedia Community Meeting to catch up with the latest research and developments in the Semantic Web Community.
Yours DBpedia Association
The post Call for Participation – LDK Conference & DBpedia Day appeared first on DBpedia Blog.
Wednesday, February 27, 2019 - 10:52am
Pinky: Gee, Brain, what are we gonna do this year? Brain: The same thing we do every year, Pinky. Taking over GSoC.
Exactly what DBpedia plans to do this summer. We have been accepted as one of 206 open source organizations to participate in Google Summer of Code (GSoC) again. Yes, ONE OF 206, let that sink in. The upcoming GSoC marks the 15th consecutive year of the program and is the 8th year in a row for DBpedia.
What is GSoC?
Google Summer of Code is a global program focused on bringing student developers into open source software development. Funds will be given to students (BSc, MSc, Ph.D.) to work for three months on a specific task. For GSoC- Newbies, this short video and the information provided on their website will explain all there is to know about GSoC.
Time for a New Narrative
In the past years, we mentored many successful projects by female students but mostly male applicants. Now, it is time to change this narrative and work towards more diversity in science. This year, we at DBpedia are more determined than ever to encourage female students to apply for our projects. That being said, we already engaged excellent female mentors to also raise the female percentage in our mentor team. We are proud of all female DBpedians that help to shape the future DBpedia.
In the following four weeks, we invite all students, female and male, who are interested in Semantic Web and Open Source development to apply for our projects. You can also contribute your own ideas to work on during the summer. We are regularly growing our community through GSoC and can deliver more and more opportunities to you.
And this is how it works: 3 steps to GSoC stardom
- Open source organizations such as DBpedia announce their projects ideas.
- Students contact the mentor organizations they want to work with and write up a project proposal.
- After a selection phase, students are matched with a specific project and a set of mentors to work on the project during the summer.
To all the smart brains out there, if you are a student who wants to work with us during summer 2019, check our list of project ideas, warm-up tasks or come up with your own idea and get in touch with us.
Further information on the application procedure is available via our DBpedia -Guidelines. There you will find information on how to contact us and how to appropriately apply for GSoC. Please also note the official GSoC 2019 timeline for your proposal submission and make sure to submit on time. Unfortunately, extensions cannot be granted. Final submission deadline is April 9th, 2019, 8pm CET.
And in case you still have questions, please do not hesitate to contact us via email@example.com.
We are thrilled to meet you and your ideas.
Your DBpedia-GSoC -Team
Wednesday, February 13, 2019 - 1:18pm
After our meetups in Poland and France last year, we delighted the Czech DBpedia community with a DBpedia meetup. It was co-located with the XML Prague conference on February 7th, 2019.
First and foremost, we would like to thank Jirka Kosek (University of Economics, Prague), Milan Dojchinovski (AKSW/KILT, Czech Technical University in Prague), Tomáš Kliegr (KIZI/University of Economics, Prague) and, the XML Prague conference for co-hosting and support the event.
Opening the DBpedia community meetup
The Czech DBpedia community and the DBpedia Databus were in the focus of this meetup. Therefore, we invited local data scientists as well as DBpedia enthusiasts to discuss the state-of-the-art of the DBpedia databus. Sebastian Hellmann (AKSW/KILT) opened the meeting with an introduction to DBpedia and the DBpedia Databus. Following, Marvin Hofer explained how to use the DBpedia databus in combination with the Docker technology and, Johannes Frey (AKSW/KILT) presented the methods behind the DBpedia’s Data Fusion and Global ID Management.
Marek Dudáš (KIZI/UEP) started the DBpedia Showcase Session with a presentation on “Concept Maps with the help of DBpedia”, where he showed the audience how to create a “concept map” with the ContextMinds application. Furthermore, Tomáš Kliegr (KIZI/UEP) presented “Explainable Machine Learning and Knowledge Graphs”. He explained his contribution to a rule-based classifier for business use cases. Two other showcases followed: Václav Zeman (KIZI/UEP), who presented “RdfRules: Rule Mining from DBpedia” and Denis Streitmatter (AKSW/KILT), who demonstrated the “DBpedia API”.
Closing this Session, Miroslav Blasko (CTU, Prague) gave a presentation on “Ontology-based Dataset Exploration”. He explained a taxonomy developed for dataset description. Additionally, he presented several use cases that have the main goal of improving content-based descriptors.
Summing up, the DBpedia meetup in Prague brought together more than 50 DBpedia enthusiasts from all over Europe. They engaged in vital discussions about Linked Data, the DBpedia databus, as well as DBpedia use cases and services.
We are currently looking forward to the next DBpedia Community Meeting, on May 23rd, 2019 in Leipzig, Germany. This meeting is co-located with the Language, Data and Knowledge (LDK) conference. Stay tuned and check Twitter, Facebook and the website or subscribe to our newsletter for the latest news and updates.
Your DBpedia Association
Wednesday, December 26, 2018 - 7:37pm
We are happy to announce that the upcoming DBpedia meetup will be held in Prague, Czech Republic. During the XML conference Prague , Feb 7-9, the DBpedia Community will get together on February 7, 2019.
– Intro: DBpedia: Global and Unified Access to Knowledge (Graphs)
– DBpedia Databus presentation
– DBpedia Showcase Session
– Web URL: https://wiki.dbpedia.org/meetings/Prague2019
– When: February 7th, 2019
– Where: University of Economics, nam. W. Churchilla 4, 130 67 Prague 3, Czech Republic
- Please check the schedule for the upcoming DBpedia meetup here: https://wiki.dbpedia.org/meetings/Prague2019
– Attending the DBpedia Community Meetup costs €40. DBpedia members get free admission, please contact your nearest DBpedia chapter or the DBpedia Association for a promotion code.
– You need to buy a ticket. Please check all details here: http://www.xmlprague.cz/conference-registration/
Sponsors and Acknowledgments
– XML conference Prague (http://www.xmlprague.cz/)
– Institute for Applied Informatics (http://infai.org/en/AboutInfAI)
– OpenLink Software (http://www.openlinksw.com/)
-Milan Dojčinovski, AKSW/KILT
– Julia Holze, DBpedia Association
– Sebastian Hellmann, AKSW/KILT, DBpedia Association
– Tomáš Kliegr, KIZI/University of Economics, Prague
Tell us what cool things you do with DBpedia. If you would like to give a talk at the DBpedia meetup, please get in contact with the DBpedia Association.
We are looking forward to meeting you in Prague!
Your DBpedia Association
The post Call for Participation: DBpedia meetup @ XML Prague appeared first on DBpedia Blog.