Data Publishing
About this guide
This guide walks you through the process of publishing data in the Databus.
What will you learn
You will learn different ways publishing data in the Databus. We will cover how to:
Step 1: Prepare Your Data
Option 1: Github
Option 2: Google Drive
Option 3: Any other storage providing static file download URIs
Step 2: Publish Your Data Using GUI
What you need
Not more than 15 min of time for each technique
Favorite text editor or IDE
Terminal (console) with Unix capabilities
Step 1: Prepare your Data
DBpedia Databus is the file metadata storage, but it does not store the data itself, so the first step for using databus would be publishing you data files in some data storage. Databus keeps the file download URIs, so the storage you use must be able to provide them.
!!!NOTE!!! The URIs must be permanent links and the data to which they point must be static (not editable) because Databus stores checksums of the files and changing underlying data will break the checksums.
Github
You may store your data in a Github repository. To publish it in Databus you simply obtain permanent links to the files.
To get permanent links you need to switch to repository version of particular commit:
then:
and then copy raw links to file data:
For example a link to our readme as of July 2023 will be: https://raw.githubusercontent.com/dbpedia/databus/68f976e29e2db15472f1b664a6fd5807b88d1370/README.md
!NOTE! If you use links referring not to commit, but to branch, the files there may be changing over time, which will break corrspondence with the file hashes stored in Databus
Google Drive
Google Drive does not provide direct download links, but you can easily generate them. Please follow the detailed instructions on this website: https://sites.google.com/site/gdocs2direct/.
!NOTE! Your Google Drive files should be static. Publishing files you still edit will result in wrong files hashes in Databus.
Other Storage
Please prepare file download links of the data your want to publish from any other storage in a similar way to shown above. Examples of the prossible storages: Apache or Nginx web-servers, FTP servers, IPFS, WebDav, any kind of file server, etc.
Step 2: Publish Your Data Using GUI
Publishing your data using web-interface in Databus is very simple.
Log In to your account
Go to Publish Data hovering over your account icon.
Fill in the publishing form following the hints.
Publish!
After publishing, you can move on with querying the data you published using collections or also checkout out Data Download Guide.
More links
You can find more in our Data Download Guide or Quickstart Examples
Last updated