Family Based Analysis with Pedigree Information

Workbench allows you to associate pedigree information with your samples, enabling trio or family-based analysis workflows. This guide walks you through the process of adding pedigree data and running workflows with family relationships.

Adding Pedigree Information

Upload Pedigree Files

Navigate to the Data page in the left sidebar.
Select the connected storage account containing your sequencing data.
Click the Add Pedigrees button in the top-right corner.
Select one or multiple .ped files (standard pedigree file format).
Click Upload to associate the pedigree information with your samples.

Workbench uses a relaxed version of the standard PED format. While the traditional format includes family ID, individual ID, paternal ID, maternal ID, sex, and phenotype information, Workbench allows you to omit sex and phenotype information if these details are not available. The .ped file should not contain a header before uploading else an error will occur.

Pedigree Management

Cross-Storage Relationships: Family relationships are not storage-scoped. If you have the same samples in multiple storage accounts, the family relationships will be recognized across all accounts.
Updating Pedigrees: To update existing pedigree information, upload a new PED file with the updated relationships. Workbench will update the relationships accordingly.

Viewing Family Information

Once pedigree information is uploaded, you can:

View family groupings on the Samples page.
Filter samples by Family ID to see all samples within a particular family.

Running Workflows with Family Data

Running Family-Based Analysis

Navigate to the Data page and select your storage account.
From the Samples page, you can:
- Filter by a specific Family ID and select all family members.
- Manually select individual samples from the table.
Click Run Workflow after making your selection.

Workflow Submission Behavior

Family Grouping: When you select multiple samples belonging to the same family, Workbench will group them together in the workflow submission.
Automatic Trio Detection: If you select a proband with parents, Workbench will automatically identify and run them as a trio.
Mixed Selections: If you select samples from different families, each family will be processed as a group, and any remaining samples will be processed as singletons.

Previewing Workflow Runs

After selecting samples and clicking Run Workflow, choose the desired workflow and version.
Click the eye icon to open the Preview Runs dialog.
Review how the samples will be grouped and processed before submitting.
Click Submit to start the analysis.

Supported Metadata Fields

The following metadata fields are supported for individual samples:

FAMILY_ID
- Cannot be null or empty.
- If individuals are designated as parent & child, they must have the same family ID.
INDIVIDUAL_ID:
- String. ID of the individual.
- Cannot be null, empty or 0
PATERNAL_ID
- String. ID of the individual's father.
- Cannot be null or empty, but can be 0 if the individual is not in the cohort.
- ID must match an individual with male sex, and with the same family ID as the child
MOTHER_ID
- String. ID of the mother's sample.
- Cannot be null or empty, but can be 0 if the individual is not in the cohort.
- ID must match an individual with female sex, and with the same family ID as the child
SEX (optional)
- Can be 0 = UNKNOWN, 1 = MALE , 2 = FEMALE , 3 = OTHER
- If the individual is a father, sex must be 1 (male). If the individual is a mother, sex must be 2 (female).
PHENOTYPE/Affected status (optional)
- It's currently transformed to strings as follows: 1 = UNAFFECTED, 2 = AFFECTED, other = MISSING.

PreviousRunning Workflows Using Samples NextMonitor the Workflow

Last updated 8 days ago

Was this helpful?