Neural SPARQL Machine is a project that deals with building an end-to-end system to answer questions posed by user not versed with writing SPARQL queries.
Currently DBpedia hosts billions of such data points and corresponding relations in the RDF format. Accessing such data is difficult for a lay user, who does not know how to write a SPARQL query. This proposal tries to built upon a System: ( ​​https://github.com/AKSW/NSpM/tree/master ​) — which tries to make this humongous linked data available to a larger user base in their natural languages(now restricted to English) by improving, adding and amending upon the existing codebase.
Read through the blogs and the reading list to get a good understanding of the code. This will allow you to get a good idea about the project.
Run the pipelines in the gsoc/anand folder of the repository mentioned above. For a certain ontology.
Ideas
Now that you have a good understanding of the current state of the project, we suggest you to build proposals pondering on some of the following points, feel free to bring your own solutions to tackle the problems that the project faces.
Structure of the questions.
Basic Graph Pattern (BGP)
1. subordinate clauses or genitive (which / that / of / ’s)
2. con-/disjunctions (and / or / as well as)
3. modifiers (which + mod / what + mod / demonyms)
4. comparative (more than / -er than)
5. superlative (most … / -est)
6. numeric / quantitative (how many / long / tall)
Tackling out of vocabulary words
Using word embedding
Integrating fast-text
Updating to code-base to python3
Feel free to contact us for more information. We eagerly look forward to working with you and contributing towards making data accessible to all.
Before starting up with the proposal, I would suggest you to read the papers and complete the warm up tasks. Doing so will help you in writing a good proposal. Feel free to ask questions here.
Hi. I am interested in this project, but I have some issues running it because of the version of TensorFlow. I see that this project is developed with python 2, tensorflow1.12, but my PC is under system windows and the most recent version for python27 windows is tensorflow1.10. Do I need a Linux environment? Or do you have any suggestion?
We did have multiple discussions after the GSoC period ended, some of the discussed points have been added to the ideas section in the topic description mentioned above.
Sure, do share the draft proposal with us (Recommended platform: Google Docs, share with us privately).
As far as pull requests and merges are concerned, its not compulsory. But we do encourage you to interact with the code and create pull request for small issues, if you come across any.
On first glance they may seem similar but if you take a closer look, you will find that this project (briefly) focuses on the aspect of NSpM that deals with handling a wide range of compositional question currently limited to English Language (complex questions)(hint: Check out the Basic Graph Patterns and other ideas in the topic description above).
Whereas the project you are referring to focuses on the multilingual aspect of Neural QA. Thus, extending the NSpM framework to couple with the multilingualism challenge as stated in the corresponding page. You may find more information about DBNQA here: https://github.com/AKSW/DBNQA.
Indeed, I have gone through your proposal. Please provide us comment access, so that I can answer the questions you have asked in the doc files as comments.
Hello, My name is Mahesh Kulkarni.Currently I am in my final year of B.Tech degree from Vishwakarma Institute Of Technology, Pune , India. I have some prior experience with NLP , Deep Learning. I am finding interest in this project. I want to contribute to it. I have gone through warm up tasks.Any further helpful instructions so that I will get more clarifications about the project?
Thank you
Sounds, great! The description above contains all the information necessary to help you to get started. Draft a proposal with your ideas pertaining to this projects and share with us.
Thanks for quick reply, in your blog
Future aspects of this project:
Working on variable awareness :
can you elaborate this so i can get more idea?
also adding some SPARQL learning resources will be helpful for me.