LogoLogo
  • Overview
  • publisher
    • Introduction
    • Getting Started
      • Logging in to Publisher
    • Data Sources
      • Connecting a Data Source
      • Managing a Data Source
      • Connectors
        • AWS S3 Permissions
        • Connecting to AWS S3 Storage
        • Google Cloud Storage (GCS) Permissions
        • Connecting to Google Cloud Storage
        • PostgreSQL Permissions
        • Connecting to PostgreSQL
        • PostgreSQL on Azure Permissions
        • Microsoft Azure Blob Storage Permissions
        • Connecting to Microsoft Azure Blob Storage
        • Connecting to HTTPS
        • Connecting to other sources via Trino
          • BigQuery
    • Collections
      • Creating a Collection
      • Sharing a Collection
      • Collection Filters
      • Editing Collection Metadata
      • Updating Collection Contents
    • Access Policies
      • Creating an Access Policy
      • Managing Access Policies
    • Questions
      • Adding Questions
      • Example Question
    • Settings
      • Viewing Current and Past Administrators
      • Adding an Administrator
      • Removing an Administrator
      • Setting Notification Preferences
  • Explorer
    • Introduction
    • Viewing a Collection
    • Browsing Collections
    • Asking Questions
    • Accessing a Private Collection
      • Requesting Access to a Private Collection
    • Filtering Data in Tables
      • Strings
      • Dates
      • Numbers
  • Workbench
    • Introduction
    • Getting Started
      • Logging into Workbench
      • Connecting an Engine
      • Finding or Importing a Workflow
      • Configuring Workflow Inputs
      • Running and Monitoring a Workflow
      • Locating Outputs
    • Engines
      • Adding and Updating an Engine
        • On AWS HealthOmics
        • On Microsoft Azure
        • On Google Cloud Platform
        • On Premises
      • Parameters
        • AWS HealthOmics
        • Google Cloud Platform
        • Microsoft Azure
        • On-Premises
        • Cromwell
        • Amazon Genomics CLI
    • Workflows
      • Finding Workflows
      • Adding a Workflow
      • Supported Languages
      • Repositories
        • Dockstore
    • Instruments
      • Getting Started with Instruments
      • Connecting a Storage Account
      • Using Sample Data in a Workflow
      • Running Workflows Using Samples
      • Family Based Analysis with Pedigree Information
      • Monitor the Workflow
      • CLI Reference
        • Instruments
        • Storage
        • Samples
        • OpenAPI Specification
    • Entities
    • Terminology
  • Passport
    • Introduction
    • Registering an Email Address for a Google Identity
  • Command Line Interface
    • Installation
    • Usage Examples
    • Working with JSON Data
    • Reference
      • workbench
        • runs submit
        • runs list
        • runs describe
        • runs cancel
        • runs delete
        • runs logs
        • runs tasks list
        • runs events list
        • engines list
        • engines describe
        • engines parameters list
        • engines parameters describe
        • engines health-checks list
        • workflows create
        • workflows list
        • workflows describe
        • workflows update
        • workflows delete
        • workflows versions create
        • workflows versions list
        • workflows versions describe
        • workflows versions files
        • workflows versions update
        • workflows versions delete
        • workflows versions defaults create
        • workflows versions defaults list
        • workflows versions defaults describe
        • workflows versions defaults update
        • workflows versions defaults delete
        • namespaces get-default
        • storage add
        • storage delete
        • storage describe
        • storage list
        • storage update
        • storage platforms add
        • storage platforms delete
        • storage platforms describe
        • storage platforms list
        • samples list
        • samples describe
        • samples files list
      • publisher
        • datasources list
  • Analysis
    • Python Library
    • Popular Environments
      • Cromwell
      • CWL Tool
      • Terra
      • Nextflow
      • DNAnexus
Powered by GitBook

© DNAstack. All rights reserved.

On this page
  • Overview
  • Working with Instruments
  • The Analysis Pipeline
  • Terminology

Was this helpful?

  1. Workbench

Instruments

PreviousDockstoreNextGetting Started with Instruments

Last updated 2 months ago

Was this helpful?

DNAstack simplifies genomic data analysis by eliminating the need for expensive local infrastructure or specialized technical expertise. With Instruments, you can seamlessly connect your raw sequencing data to , choose your , and launch a —all with ease.

Overview

Instruments bridges the gap between primary sequencing data and advanced analysis workflows, and simplifying the process of connecting sequencing data to your Workbench workflow.

Working with Instruments

DNAstack’s Instrument tool allows you to directly connect data from physical sequencing platforms to Workbench. Instruments represent physical sequencing platforms, such as PacBio, that upload sequencing run outputs to a cloud storage location.

The Analysis Pipeline

Primary Analysis: Raw sequencing data from instruments into readable outputs.

Secondary Analysis: Applies quality control, alignment and variant calling to process sequencing data.

Tertiary Analysis: Interprets secondary processed data to uncover insights, validate findings, and drive scientific discovery.

Terminology

  • Storage: Operations pertaining to interacting with cloud storage from Workbench.

  • Storage Account: The entity that is created when a user saves credentials using the storage command.

  • Platform: A specific sequencing platform (e.g. PacBio, Illumina, Oxford Nanopore, etc.) that has uploaded the outputs of a sequencing run to a cloud storage location. Each platform has specific file naming and directory naming conventions for output data and various supporting files that are generated. If the user changes the names of files or the directory structure from what the sequencing platform generated, this will lead to undefined behavior.

  • Instrument: An entity corresponding to a physical sequencing instrument.

  • Sample: A biological sample (e.g. hg013.m89015_220827.bam). Samples are unique within a namespace.

  • File: A file (e.g. FASTQ, BAM, VCF, etc.) belonging to a biological sample. Each file is associated with a sequencing platform and the specific instrument that generated it.

Workbench
Engine
workflow