Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Site-specific ETL code. #52

Closed
wants to merge 2 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
15 changes: 15 additions & 0 deletions site_etl_scripts/c3g/manifest.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,15 @@
description: Mapping of MUHC Melanoma (1st batch) dataset to MOHCCN format for CanDIG
# mapping is the csv file that contains the list of fields and mapping functions
mapping: moh_muhcMelanoma.csv
# the name of the top-level identifier column in the input data
identifier: submitter_donor_id
# a link to the openapi schema
schema: https://raw.githubusercontent.com/CanDIG/katsu/develop/chord_metadata_service/mohpackets/docs/schema.yml
# class of schema for validation:
schema_class: MoHSchema
# a reference date used to calculate date intervals, formatted as a mapping entry for the mapping template
reference_date: earliest_date(donor.date_resolution, donor.date_of_birth) # NEEDS TO BE CHANGED, ONCE DATES ARE REAL.
# one or more files (dataset_functions.py) that implement the mappings
# described in mapping file
functions:
- muhc_mappings
Loading
Loading