Mapping Generation from Resource Descriptions - GSoC2020


DBpedia currently maintains mappings between Wikipedia infobox template properties to the DBpedia ontology, since several similar templates exist (in single as well as over multiple languages) to describe closely related types of infoboxes. The aim of the project is to enrich and possibly correct the existing mappings with a data-driven method to propose or generate mappings automatically by analyzing instance data from distinct language-specific datasets. This will be a follow-up of a previous GSoC project, which mainly mapped the classes to infobox templates.
A central goal is also to map Wikidata property identifiers.


Provide suggestions (eg by using statistical probabilities) for template parameters which properties from DBpedia ontology and from Wikidata should be mapped.


Increase the coverage for mapped languages and yet not mapped languages, which finally leads to better data quality.

Warm up tasks

Familiarize with and evaluate the results of the previous project code base (no fixed stipulation to re-use this).



mappings, knowledge base completion, data quality