I want to create a new extractor but i am unable to understand the following:
1: I want to create new output dataset file, just creating a new dataset in Dataset.scala is not working for me.
2: I want to iterate all the rdf triples in mappingsbased-objects-uncleaned,ttl.bz2 file, perform some processing and then generate new rdf triples in a newly created dataset file. It is also required to run this at last when all other extraction has been done.
In the gender extractor following comment is written:
// Even better: in the first extraction pass, extract all types. Use them in the second pass.
How this multipass functionality can be implemented?
Please tell me how can i perform above operations