Databus Gitbook
Databus
Databus
  • Overview
  • Guides
    • Data Publishing
    • Data Download
  • Use Cases
    • Data Version Control
    • Populating Database with Data
    • Data Quality Control
    • Data Crawling
    • Automated Deployment
    • Building Data Repositories
  • Organising Your Data (Model)
    • How to Organise Your Data
    • URI Design
    • Versioning
    • Metadata
      • Group
      • Artifact
      • Version
      • Distribution
      • Collection
    • Content Variants
    • Persistence (HowTo)
  • Usage
    • Quickstart Examples (Publish, Download)
    • Web Interface
      • Publish
      • Collections
      • Auto-Completion
    • API
    • Databus Mods
    • Databus Client
    • Integration with CI (Jenkins)
  • Running Your Own Databus Server
    • Run with Docker
    • Configuration
    • HTTPS & Proxy Setup
  • Development Environment
Powered by GitBook
On this page
  • About this guide
  • What will you learn
  • What you need
  • Step 1: Prepare your Data
  • Step 2: Publish Your Data Using GUI
  • More links
  1. Guides

Data Publishing

About this guide

This guide walks you through the process of publishing data in the Databus.

What will you learn

You will learn different ways publishing data in the Databus. We will cover how to:

  • Step 1: Prepare Your Data

    • Option 1: Github

    • Option 2: Google Drive

    • Option 3: Any other storage providing static file download URIs

  • Step 2: Publish Your Data Using GUI

What you need

  • Not more than 15 min of time for each technique

  • Favorite text editor or IDE

  • Terminal (console) with Unix capabilities

Step 1: Prepare your Data

DBpedia Databus is the file metadata storage, but it does not store the data itself, so the first step for using databus would be publishing you data files in some data storage. Databus keeps the file download URIs, so the storage you use must be able to provide them.

Github

You may store your data in a Github repository. To publish it in Databus you simply obtain permanent links to the files.

To get permanent links you need to switch to repository version of particular commit:

then:

and then copy raw links to file data:

!NOTE! If you use links referring not to commit, but to branch, the files there may be changing over time, which will break corrspondence with the file hashes stored in Databus

Google Drive

!NOTE! Your Google Drive files should be static. Publishing files you still edit will result in wrong files hashes in Databus.

Other Storage

Please prepare file download links of the data your want to publish from any other storage in a similar way to shown above. Examples of the prossible storages: Apache or Nginx web-servers, FTP servers, IPFS, WebDav, any kind of file server, etc.

Step 2: Publish Your Data Using GUI

Publishing your data using web-interface in Databus is very simple.

  1. Log In to your account

  2. Go to Publish Data hovering over your account icon.

  1. Fill in the publishing form following the hints.

  2. Publish!

More links

PreviousGuidesNextData Download

Last updated 1 year ago

!!!NOTE!!! The URIs must be and the data to which they point must be static (not editable) because Databus stores checksums of the files and changing underlying data will break the checksums.

For example a link to our readme as of July 2023 will be:

Google Drive does not provide direct download links, but you can easily generate them. Please follow the detailed instructions on this website: .

After publishing, you can move on with querying the data you published using or also checkout out .

You can find more in our or

permanent links
https://raw.githubusercontent.com/dbpedia/databus/68f976e29e2db15472f1b664a6fd5807b88d1370/README.md
https://sites.google.com/site/gdocs2direct/
collections
Data Download Guide
Data Download Guide
Quickstart Examples
publish-guide-1.png
publish-guide-2.png
publish-guide-3.png
publish.png
Publish