Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Diagram view of first stage pipeline (from sampling instrument to shared storage) #11

Merged
merged 31 commits into from
Aug 8, 2024

Conversation

metazool
Copy link
Collaborator

@metazool metazool commented Jul 18, 2024

Addresses part, but not all of #9

  • Adds some rudimentary graphviz diagrams that outline the workflow from instrument through the SAN to cloud storage - both the "as is" situation (with data gaps, manually run stages and personal credentials) and the "could be" (more of an outline with choice of options)
  • Adds a Github Action for publishing a basic Jekyll static site which includes a step that renders SVG output for anything underneath the docs/diagrams/ directory (lifted from the pages action in the discoverability project)

Publishes to here: https://nerc-ceh.github.io/plankton_ml/

Created to help initiate a discussion in the Science Computing user group internal forum, discover how other research groups solve the same issues and what the configuration differences are across the various physical sites. The "could be" view remains sketchy until more information is available.

The intention is to add more to this - a second phase showing the catalogue-to-analysis view, and some detail views in areas that are skimmed over, like the pipeline task graph which should include a binary classifier that sifts for interesting data so we're not filling the cloud storage with obvious detritus) but in the interests of reviewing less material, more frequently, left this as a starting point.

I'll also go back and add the publishing / rendering action to the process diagram PR to make those easier to share

@metazool metazool requested a review from a team July 18, 2024 13:46
Copy link

github-actions bot commented Jul 18, 2024

☂️ Python Coverage

current status: ✅

Overall Coverage

Lines Covered Coverage Threshold Status
98 83 85% 0% 🟢

New Files

No new covered files...

Modified Files

No covered modified files...

updated for commit: 830a4fb by action🐍

@metazool metazool merged commit 4171bb1 into main Aug 8, 2024
2 checks passed
@metazool metazool deleted the diagram_view branch October 2, 2024 08:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants