Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

This section describes the software components that power the EMO-BON data workflow.

GitHub Actions

EMO-BON uses GitHub Actions extensively for automation:

Repository Constructor

Repository: repo-constructor-action

Purpose: Automatically creates observatory crate repositories from governance data.

Triggers: Changes to logsheets.csv in governance-data repository

What it does:

Logsheet Downloader

Repository: logsheet-downloader-action

Purpose: Downloads spreadsheets from Google Sheets as CSV files.

Schedule: Every 6 months

What it does:

Data Quality Control

Repository: data-quality-control-action

Purpose: Validates and repairs logsheet data.

What it does:

RO-Crate SemBench Setup

Repository: rocrate-sembench-setup

Purpose: Prepares for semantic uplifting.

What it does:

Semantify

Repository: semantify

Purpose: Generates RDF triples from structured data.

What it does:

RO-Crate Validate

Purpose: Validates RO-Crate structure and content.

Status: In development

What it will do:

RO-Crate to Pages

Purpose: Generates GitHub Pages site from RO-Crate.

Status: In development

What it will do:

Analysis Tools

MetaGOflow

Purpose: Bioinformatics workflow for sequence analysis.

What it does:

Post-Processing Tools

Purpose: Semantic uplifting of analysis results.

Implementation: Dockerized processes

What they do:

Data Management Tools

DVC (Data Version Control)

Purpose: Version control for large data files.

Usage: Managing large files in analysis results crates

What it does:

Python Packages

Infrastructure Tools

Docker

Usage: Containerizing workflows and services

Key Images (published to ghcr.io):

Triple Store

Purpose: Store and query the EMO-BON knowledge graph.

Implementation: Dockerized stack

Features:

GitHub Pages

Purpose: Static site hosting for documentation and data views.

Usage:

Development Tools

Git

Purpose: Version control for all repositories.

Branch Strategy: Main branch for stable releases, feature branches for development

GitHub

Purpose: Repository hosting, issue tracking, project management, CI/CD.

Features Used: