Monorepo for managing and storing the data of the integration layer of Colonial Collections.
- Update the file
organizations.ttlin thedata-registryfolder. Add the details of the data provider (e.g. name and address). The details will automatically be added to the Colonial Collections knowledge graph if you commit the changes using Git
- Update the file
dataset-measurements.ttlin thedata-registryfolder. Add the measurements for the new dataset. The measurements will automatically be added to the Colonial Collections knowledge graph if you commit the changes using Git
- Create a folder in the root folder. By convention: a lower case name, consisting of the name of the data provider and the name of the dataset. For example:
wereldmuseum-collection-archives - Inside the folder, create a
queriesfolder - Create the file
iterate.rqin thequeriesfolder. Put a SPARQL query in this file (e.g. by copying one from the existing files). The query defines how the entities in the dataset can be retrieved from a SPARQL endpoint, e.g. the Colonial Collections knowledge graph - Create the file
generate.rqin thequeriesfolder. Put a SPARQL query in this file (e.g. by copying one from the existing files). The query defines how the entities in the dataset must be transformed, e.g. to the data model of the Colonial Collections search graph - Optionally, create the file
check.rqin thequeriesfolder. Put a SPARQL query in this file (e.g. by copying one from the existing files). The query detects if the dataset has been changed in the Colonial Collections knowledge graph. If that is the case, theiterate.rqandgenerate.rqqueries will be executed. Be aware: acheckquery only makes sense if the date of last modification of the dataset can be retrieved from a SPARQL endpoint - Inside the
.github/workflowsfolder, create a YAML file (e.g. by copying one from the existing files). By convention: a lower case name, consisting of the name of the data provider, the name of the dataset and thecreate-graphsuffix, e.g.wereldmuseum-collection-archives-create-graph.yaml. Put a GitHub Action workflow definition in this file. The definition describes which steps must be taken to execute theiterate.rq,generate.rqand - optionally - thecheck.rqqueries. The results of the queries will automatically be added to the Colonial Collections search graph