Nice. A small tip regarding the URL of https://databus.dbpedia.org/rogargon/collections/browsable_core . Here it is ok to put version info into the URL, e.g. you could call it browsable_core_2021
and then do browsable_core_2022
or you could make it dynamic, i.e. update browsable_core
to always point to the latest working subset. But this decision is up to you.
Regarding the Databus client problem, Databus is also fully bash compatible for downloading:
query=$(curl -H "Accept:text/sparql" https://databus.dbpedia.org/rogargon/collections/browsable_core)
files=$(curl -H "Accept: text/csv" --data-urlencode "query=${query}" https://databus.dbpedia.org/repo/sparql | tail -n+2 | sed 's/"//g')
while IFS= read -r file ; do wget $file; done <<< "$files"
So you could also use this or rewrite it to do the bzip2 to gzip conversion.
@eisenbahnplatte can you look at the problem. This seems to be like a very good use case for the Databus Client, but it can’t be compiled?