Hi,
What SPARQL queries [on the Databus endpoint] do I need to do in order to get latest DBpedia dataset(s) for a non-English language DBpedia?
In particular, I am looking for DBpedia files for the Latvian language (“lv”).
Best regards,
Uldis
Hi,
What SPARQL queries [on the Databus endpoint] do I need to do in order to get latest DBpedia dataset(s) for a non-English language DBpedia?
In particular, I am looking for DBpedia files for the Latvian language (“lv”).
Best regards,
Uldis
Hey @CaptSolo,
yes with a Databus account (log in required) this is quite easy. Databus Collections can be seen as a customizable dynamic shopping carts of data (files). You can just create you own customized collection based on EN latest core release and then change for all groups in the release the language if required/applicable.
The collection link for latest core files is https://databus.dbpedia.org/dbpedia/collections/latest-core . This collection updates automatically and always refers to the latest available files . A small part of data from DBpedia Extraction Groups (approx. 100 of 4000 files or 2.5%) is selected in the latest-core collection . If you would like to customize it, 1. register/login 2. go to the collection and click “Action” -> “Edit Copy”
See the gif below. Just repeat this for all groups in the tree where you would like to switch to LV.
you can also change/override the language for artifacts files only, since not for all files a Latvian variant is available, or just use a mix of EN and LV.
I hope that helps.
Hi,
I have downloaded the latest DBpedia dump files for “lv” (Latvian) based on the latest Core / English version.
However, I would also like to have “sameAs” links (1) to other DBpedia languages and (2) to Wikidata. I did not find these links in the collection mentioned above. What file / artefact should I use for that?
Is it this one? : https://databus.dbpedia.org/vehnem/replaced-iris/sameAs/2022.03.01
P.S. Latvian versions do not exist for some artefacts but I guess it’s normal that some artefacts are available in a smaller number of languages. Just in case, here is a list of what is missing a Latvian version:
https://databus.dbpedia.org/dbpedia/generic/commons-sameas-links/2022.03.01/
https://databus.dbpedia.org/dbpedia/generic/disambiguations/2022.03.01/
https://databus.dbpedia.org/dbpedia/generic/homepages/2022.03.01/
https://databus.dbpedia.org/dbpedia/generic/images/2022.03.01/
https://databus.dbpedia.org/dbpedia/generic/persondata/2022.03.01/
Best regards,
Uldis