When running the Jena Apache riot tool I get the error below.
ubuntu@dbpedia_data:~/data$ riot --validate freebase-links_lang\=en.ttl
15:20:42 WARN riot :: [line: 820, col: 31] Illegal character in IRI (codepoint 0x22, '"'): <http://dbpedia.org/resource/["]...>
15:20:42 WARN riot :: [line: 820, col: 50] Illegal character in IRI (codepoint 0x22, '"'): <http://dbpedia.org/resource/"populate_or_perish["]...>
15:20:42 WARN riot :: [line: 820, col: 1 ] Bad IRI: <http://dbpedia.org/resource/"populate_or_perish"> Code: 4/UNWISE_CHARACTER in PATH: The character matches no grammar rules of URIs/IRIs.
I ran wget
on https://databus.dbpedia.org/dbpedia/transition/freebase-links/2019.02.10/freebase-links_lang=en.ttl.bz2 and then ran bunzip2
on the file to get the turtle file which was passed to riot
as input.
file in question is part of this dataset: dbpedia/collections/latest-core
The "
is spitting out errors, has anyone had this issue before?
Note: there are multiple files in the dataset that have this same issue.