During my analysis on this issue, I had some new interesting findings of the multilingualism of the aforementioned datasets. More specifically, what I have discovered is:
- Many attributes have “English” literals, even in non-English datasets, and also the other way around.
- The language tag is not useful for these attributes as it is often incorrect.
Hence, the problem resides not only on the “name” attribute but is distributed among all the attributes that compose a dataset. Following there are a few examples that better describe this issue:
<http://ja.dbpedia.org/resource/アムステルダム大学> <http://xmlns.com/foaf/0.1/name> “Athenaeum Illustre”@ja .
<http://ja.dbpedia.org/resource/ウォルター・ベッカー> <http://xmlns.com/foaf/0.1/givenName> “Walter Carl Becker”@ja .
<http://fr.dbpedia.org/resource/Chris_Claremont> <http://dbpedia.org/ontology/homage> “Comics Buyer’s Guide” .
<http://fr.dbpedia.org/resource/Michigan> <http://xmlns.com/foaf/0.1/nick> “The Wolverine State, The Great Lakes State”@fr .
<http://de.dbpedia.org/resource/Lakeway> <http://dbpedia.org/ontology/cityType> “City” .
<http://de.dbpedia.org/resource/New_Bedford_(Massachusetts)> <http://xmlns.com/foaf/0.1/nick> “The Whaling City”@de .
<http://dbpedia.org/resource/Kanako_Maeda> <http://xmlns.com/foaf/0.1/name> “前田 夏菜子“@en .
<http://dbpedia.org/resource/Bullet,_Switzerland> <http://dbpedia.org/ontology/demonym> “Lè Pi-Bot”@en .
<http://dbpedia.org/resource/North_Rhine-Westphalia> <http://xmlns.com/foaf/0.1/name> “Nordrhein-Westfalen”@en .
To make things even more explicit, clearly the literal “City” corresponding to the “cityType” attribute for the entity “Lakeway” in the German DBpedia is not in German, rather in English.
The question is now more general with respect to what I asked before: is there a way to get the literals corresponding to entity attributes in the “correct” source language (i.e., the one of the dataset) without using any external translation or language recognition tools?
Thank you so much for your help. Looking forward to discussing this.