diff --git a/crates/medmodels-core/src/medrecord/advanced_example_dataset/synthetic_data/README.md b/crates/medmodels-core/src/medrecord/advanced_example_dataset/synthetic_data/README.md index e3ea243..543d756 100644 --- a/crates/medmodels-core/src/medrecord/advanced_example_dataset/synthetic_data/README.md +++ b/crates/medmodels-core/src/medrecord/advanced_example_dataset/synthetic_data/README.md @@ -12,7 +12,7 @@ The dataset is a synthetic dataset created to simulate a medium-scale medical re The patient data consists of 600 hypothetical patients. -- **patient_id:** A unique identifier assigned each patient +- **patient_id:** Unique identifier assigned each patient - **gender:** Recorded as Male, Female, or Non-Binary - **age:** Patient's age in years @@ -49,25 +49,25 @@ The drug data has 1 possible event There are 5741 diagnosis instances for the 600 patients. -- **patient_id:** Linked to the Patient Demographics +- **patient_id:** Unique identifier assigned each patient - **diagnosis_code:** Unique code identifiying a specific diagnosis - **time:** Date of the diagnosis - **duration_days** Duration in days in which the diagnosis held true ### 2. `patient_procedure.csv` -For all five patients there are 677 procedures in the dataset. +For all 600 patients there are 677 procedures in the dataset. -- **patient_id:** Linked to the Patient Demographics +- **patient_id:** Unique identifier assigned each patient - **procedure_code:** Unique identifier for each procedure - **time:** Exact time and date for the procedure - **duration_minutes:** How long the procedure needed in minutes ### 3. `patient_drug.csv` -There are also 20373 prescribed medications for the five patients combined. +There are also 20373 prescribed medications for the 600 patients combined. -- **patient_id:** Linked to the Patient Demographics +- **patient_id:** Unique identifier assigned each patient - **drug_code:** Unique identifier for the medication prescribed - **time:** Time and date when the medication was prescribed - **quantity:** how many doses were prescribed @@ -75,8 +75,8 @@ There are also 20373 prescribed medications for the five patients combined. ### 4. `patient_event.csv` -There are also 20373 prescribed medications for the five patients combined. +There are also 20373 prescribed medications for the 600 patients combined. -- **patient_id:** Linked to the Patient Demographics +- **patient_id:** Unique identifier assigned each patient - **event:** Name of the event linked to the patient - **time:** Time and date when the event occurred diff --git a/crates/medmodels-core/src/medrecord/example_dataset/synthetic_data/README.md b/crates/medmodels-core/src/medrecord/example_dataset/synthetic_data/README.md index ad872b4..ded0a08 100644 --- a/crates/medmodels-core/src/medrecord/example_dataset/synthetic_data/README.md +++ b/crates/medmodels-core/src/medrecord/example_dataset/synthetic_data/README.md @@ -43,7 +43,7 @@ The drug data has 19 different prescribed medications. There are 60 diagnosis instances for the five patients. -- **patient_id:** Linked to the Patient Demographics +- **patient_id:** Unique identifier assigned each patient - **diagnosis_code:** Unique code identifiying a specific diagnosis - **time:** Date of the diagnosis - **duration_days** Duration in days in which the diagnosis held true @@ -52,7 +52,7 @@ There are 60 diagnosis instances for the five patients. For all five patients there are 50 procedures in the dataset. -- **patient_id:** Linked to the Patient Demographics +- **patient_id:** Unique identifier assigned each patient - **procedure_code:** Unique identifier for each procedure - **time:** Exact time and date for the procedure - **duration_minutes:** How long the procedure needed in minutes @@ -61,7 +61,7 @@ For all five patients there are 50 procedures in the dataset. There are also 50 prescribed medications for the five patients combined. -- **patient_id:** Linked to the Patient Demographics +- **patient_id:** Unique identifier assigned each patient - **drug_code:** Unique identifier for the medication prescribed - **time:** Time and date when the medication was prescribed - **quantity:** how many doses were prescribed diff --git a/medmodels/medrecord/medrecord.py b/medmodels/medrecord/medrecord.py index 6e432a0..0be47ce 100644 --- a/medmodels/medrecord/medrecord.py +++ b/medmodels/medrecord/medrecord.py @@ -268,7 +268,8 @@ def from_advanced_example_dataset(cls) -> MedRecord: """Creates a MedRecord instance from an example dataset with a higher number of nodes and edges. The example dataset was created using the Synthea™ Patient Generator: - https://github.com/synthetichealth/synthea + https://github.com/synthetichealth/synthea and it includes 600 patients with + their diagnoses, prescriptions, procedures, and death events. Returns: MedRecord: A new instance created from the advanced example dataset.