QADO Question Answering Dataset RDFizer

This tool provides a web service to convert particular question answering datasets (represented in JSON format) into RDF Turtle. It uses the Question Answering Dataset Ontology (QADO) to represent the data in RDF.

Table of Contents

QADO Question Answering Dataset RDFizer

Setup service

Configuration

This service needs a running instance of QADO RML Applicator. The host of a instance (e. g., http://localhost:8000) has to be provided by setting the environment variable RML_APPLICATOR_HOST.

Using Gradle

To run the service locally run ./gradlew run.

Using Docker

Otherwise, you can set up a Docker image running the service by pulling the prepared image:

docker pull wseresearch/qado-rdfizer:latest

Alternatively you can build the Docker image from source. To start a Docker container use the following command:

docker run -d --env RML_APPLICATOR_HOST="YOUR RML APPLICATOR HOST" -p "$EXTERNAL_PORT:8080" wseresearch/qado-rdfizer:latest

Accessing the service

Basic UI

This service provides a basic UI at $HOST:$PORT/ where you can transform a dataset and view the results directly in the web browser.

API endpoint

To transform a JSON file to RDF perform a POST-Request at $HOST:$PORT/json2rdf with a JSON payload of the following structure:

{
  "filePath": "URL of the JSON file",
  "format": "Mapping file name",
  "label": "Name for the generated RDF triples",
  "homepage": "URL of the data publisher",
  "language": "Language tag of the questions (required only for 'compositional_wikidata' format)"
}

Supported Datasets

By default, the following datasets/formats are supported (using these RML mappings):

QALD: Question Answering over Linked Data
- format identifier: qald
- supported versions: 5, 6, 8, 9, 9-plus, and 10
LC-QuAD: Largescale Complex Question Answering Dataset
- supported versions:
  - 1, format identifier: lc-quad
  - 2, format identifier: lc-quad-2
RuBQ: A Russian Knowledge Base Question Answering and Machine Reading Comprehension Data Set
- format identifier: rubq
- supported versions: 1 and 2
Mintaka: A complex, natural, and multilingual dataset for end-to-end question answering
- format identifier: mintaka
ComplexWebQuestions: A dataset for answering complex questions that require reasoning over multiple web snippets
- format identifier: cwq
(beta) Compositional Wikidata Questions
- format identifier: compositional_wikidata

Supported output formats

You can choose the output format of the service by providing an Accept Header. The following Content-Types are supported:

text/turtle (Turtle, default)
text/xml (TriX)
application/ld+json (JSON-LD)
application/n-triples (N-Triples)

Web Service Usage

The following cURL command can be used to convert a JSON file of the QALD benchmark into RDF using Turtle as the output format.

curl --location --request POST 'http://$HOST:$PORT/json2rdf' \
--header 'Content-Type: application/json' \
--data-raw '{
    "filePath": "https://github.com/ag-sc/QALD/raw/master/6/data/qald-6-train-multilingual-raw.json",
    "format": "qald",
    "label": "QALD 6 train multilingual raw",
    "homepage": "https://github.com/ag-sc/QALD"
}'

Adding additional formats

To add new mapping rules just add a new mapping file NAME.ttl to app/mappings while NAME has to be in all caps. The mapping language is RML. To use the file within the webservice just use the base file name as the format parameter.

Statistics

Here, also a script for creating statistics about the created datasets can be created. See scripts/statistics for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 328 Commits
.github		.github
.idea		.idea
app		app
gradle/wrapper		gradle/wrapper
scripts		scripts
service_config		service_config
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.adoc		README.adoc
gradlew		gradlew
gradlew.bat		gradlew.bat
settings.gradle.kts		settings.gradle.kts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

QADO Question Answering Dataset RDFizer

Setup service

Configuration

Using Gradle

Using Docker

Accessing the service

Basic UI

API endpoint

Supported Datasets

Supported output formats

Web Service Usage

Adding additional formats

Statistics

About

Releases 2

Packages

Contributors 3

Languages

License

WSE-research/QADO-question-answering-dataset-RDFizer

Folders and files

Latest commit

History

Repository files navigation

QADO Question Answering Dataset RDFizer

Setup service

Configuration

Using Gradle

Using Docker

Accessing the service

Basic UI

API endpoint

Supported Datasets

Supported output formats

Web Service Usage

Adding additional formats

Statistics

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 3

Languages

Packages