DBpedia Dataset 2019-08-30 (Pre-Release)

@kurzum Looks pretty good now, over the last few days working with it (2020/07 dataset), everything looked fine.

One thing however, shouldn’t the foaf: properties be contained inside mappingbased-objects?

select count(*) { ?s foaf:gender ?o }

returns 1,418,206 in the 2016 public endpoint, but in my up to date local endpoint, these seem to be missing. I used these artifacts from my testing collection and the above query returns 0.

Same thing with dbo:thumbnail: 1,695,460 in 2016, 0 locally.

And with foaf:depiction (which is kinda the same thing as dbo:thumbnail anyway): 1,698,622 2016, 300 locally (??)

Is this another error or am I missing some dataset?

@phil294 could be this one: https://databus.dbpedia.org/dbpedia/generic/persondata/
we excluded it because it is broken anyhow. Not sure, if it worth repairing. Seems like they come from other datasets, i.e. wikidata or musicbrainz. The latest fusion has them:
https://databus.dbpedia.org/vehnem/flexifusion/fusion/2019.12.15
filter tag by gender.

see under: https://wiki.dbpedia.org/develop/datasets/latest-core-dataset-releases Missing:

mageExtractor was malfunctioning and disabled, i.e. only images from infoboxes are extracted, no clean licenses. (Will be fixed with https://databus.dbpedia.org/dbpedia/wikidata/images/)

Huh, so for so depictions/thumbnails, one needs to join wikidata resources over owl:sameAs and select its dbpedia/wikidata/images. That should work, thanks.

Not sure, if it worth repairing.

@kurzum I would assume it is worth – otherwise, the dataset doesnt contain even basic information like the gender of a person, which seems pretty important (?), without also taking the above mentioned route with owl:sameAs/wikidata-resource

For example, http://dbpedia.org/page/Hans_Sarpei knows that Hans Sarpei is male, https://dbpedia.demo.openlinksw.com/page/Hans_Sarpei doesnt.

@phil294 so you are right about the joining with other data. Wikipedia and wikidata are not suitable for that, since they are copies of other data.
We are integrating data from national libraries for example. The age of extraction is ending. the age of integration has started.