Step 3: Prepare ML tables
Feb 12 – Feb 26 | Prepare ML tables
Feb 12 – Feb 26 | Prepare ML tables
The current Step 3 - Prepare ML tables step is divided into five parts, and involves preparing Machine Learning tables using the extracted features from of the Testing Phase as follows:
Reduce Extracted Features: Use the to reduce the large CSV files obtained from the previous step via Principal Component Analysis (PCA) and Spearman correlation.
Merge All Data: Combine the reduced extracted features with demographic embeddings into a master CSV table using the . Additionally, create MEDprofiles with the master table.
Create Learning and Holdout Sets: Use the to generate Learning and Holdout sets.
The goal of defining static time points is to simulate a longitudinal CDSS (Clinical Decision Support System) scenario using data aggregated over time. In of the Testing Phase, we will attempt to identify the point in time where we reach sufficient predictive power (the point in time when, in real-life, we could potentially intervene).
Before proceeding with Step 3 - Prepare ML tables of the MEDomicsLab Testing Phase, we recommend consulting the documentation of the .
Intro
Reduce extracted features
Merge all our data
Visualize MEDprofiles
Define static time points
Create learning and holdout sets