Workflow: BD Rhapsody™ WTA Analysis Pipeline
The BD Rhapsody™ WTA Analysis Pipeline is used to create sequencing libraries from single cell transcriptomes without having to specify a targeted panel. After sequencing, the analysis pipeline takes the FASTQ files, a reference genome file and a transcriptome annotation file for gene alignment. The pipeline generates molecular counts per cell, read counts per cell, metrics, and an alignment file.
- Selected
- |
- Default Values
- Nested Workflows
- Tools
- Inputs/Outputs
Inputs
ID | Type | Title | Doc |
---|---|---|---|
Reads | File[] | Reads | |
Seq_Run | String (Optional) | ||
Run_Name | String (Optional) | Run Name |
This is a name for output files, for example Experiment1_Metrics_Summary.csv. Default if left empty is to name run based on a library. Any non-alpha numeric characters will be changed to a hyphen. |
Use_DBEC | Boolean (Optional) | ||
AbSeq_UMI | Integer (Optional) | ||
Subsample | Float (Optional) | Subsample Reads |
Any number of reads >1 or a fraction between 0 < n < 1 to indicate the percentage of reads to subsample. |
Tag_Names | String[] (Optional) | Sample Tag Names |
Specify the Sample Tag number followed by - (hyphen) and a sample name to appear in the output files. For example: 4-Ramos. Should be alpha numeric, with + - and _ allowed. Any special characters: &, (), [], {}, <>, ?, | will be corrected to underscores. |
Extra_Seqs | File (Optional) | ||
Barcode_Num | Integer (Optional) | ||
VDJ_Version | https://w3id.org/cwl/view/git/c8d75785fd6accf3b47d1f9ee056675611e22e61/wf.cwl#main/VDJ_Version/VDJ_Version (Optional) | VDJ Species Version |
The VDJ species and chain types. This option should only be set for VDJ experiment. |
Generate_Bam | Boolean (Optional) | Generate Bam Output |
By default, a Bam file that is comprised of reads from all the input libraries is generated. However, this can consume a lot of compute and disk resources. By setting this field to false, the Bam is no longer created. |
MinChunkSize | Integer (Optional) | ||
Subsample_Tags | Float (Optional) | Subsample Sample Tags |
Any number of reads > 1 or a fraction between 0 < n < 1 to indicate the percentage of tag reads to subsample. |
Subsample_seed | Integer (Optional) | Subsample Seed |
For use when replicating a previous subsampling run only. Obtain the seed generated from the log file for the SplitFastQ node. |
AbSeq_Reference | File[] (Optional) | AbSeq Reference | |
Basic_Algo_Only | Boolean (Optional) | Disable Refined Putative Cell Calling |
Determine putative cells using only the basic algorithm (minimum second derivative along the cumulative reads curve). The refined algorithm attempts to remove false positives and recover false negatives, but may not be ideal for certain complex mixtures of cell types. Does not apply if Exact Cell Count is set. |
Read_Filter_Off | Boolean (Optional) | ||
Target_analysis | Boolean (Optional) | ||
Exact_Cell_Count | Integer (Optional) | Exact Cell Count |
Set a specific number (>=1) of cells as putative, based on those with the highest error-corrected read count |
Reference_Genome | File | Reference Genome | |
VDJ_JGene_Evalue | Float (Optional) | e-value threshold for J gene |
The e-value threshold for J gene call by IgBlast/PyIR, default is set as 0.001 |
VDJ_VGene_Evalue | Float (Optional) | e-value threshold for V gene |
The e-value threshold for V gene call by IgBlast/PyIR, default is set as 0.001 |
NumRecordsPerSplit | Long (Optional) | ||
Putative_Cell_Call | https://w3id.org/cwl/view/git/c8d75785fd6accf3b47d1f9ee056675611e22e61/wf.cwl#main/Putative_Cell_Call/Putative_Cell_Call (Optional) | Putative Cell Calling |
Specify the data to be used for putative cell calling. mRNA is the default selected option. |
Sample_Tags_Version | https://w3id.org/cwl/view/git/c8d75785fd6accf3b47d1f9ee056675611e22e61/wf.cwl#main/Sample_Tags_Version/Sample_Tags_Version (Optional) | Sample Tags Version |
The sample multiplexing kit version. This option should only be set for a multiplexed experiment. |
Exclude_Intronic_Reads | Boolean (Optional) | Exclude Intronic Reads |
By default, reads aligned to exons and introns are considered and represented in molecule counts. Including intronic reads may increase sensitivity, resulting in an increase in molecule counts and the number of genes per cell for both cellular and nuclei samples. Intronic reads may indicate unspliced mRNAs and are also useful, for example, in the study of nuclei and RNA velocity. When set to true, intronic reads will be excluded. |
Supplemental_Reference | File[] (Optional) | Supplemental Reference | |
Transcriptome_Annotation | File | Transcriptome Annotation |
Steps
ID | Runs | Label | Doc |
---|---|---|---|
AlignR2 |
wf.cwl#AlignR2.cwl
(CommandLineTool)
|
||
Metrics |
wf.cwl#Metrics.cwl
(CommandLineTool)
|
||
Version |
wf.cwl#Version.cwl
(CommandLineTool)
|
||
AddtoBam |
wf.cwl#AddtoBam.cwl
(CommandLineTool)
|
||
IndexBAM |
wf.cwl#IndexBAM.cwl
(CommandLineTool)
|
||
MergeBAM |
wf.cwl#MergeBAM.cwl
(CommandLineTool)
|
||
AnnotateR1 |
wf.cwl#AnnotateR1.cwl
(CommandLineTool)
|
||
AnnotateR2 |
wf.cwl#AnnotateR2.cwl
(CommandLineTool)
|
||
BundleLogs |
wf.cwl#BundleLogs.cwl
(ExpressionTool)
|
||
Start_Time |
a615493dba82c34c61e316741baca0d2
(ExpressionTool)
|
||
CheckFastqs |
wf.cwl#CheckFastqs.cwl
(CommandLineTool)
|
CheckFastqs does several quality control routines including: (1) ensuring that read pair file names are formatted correctly and contain a read pair mate; (2) disambiguating the \"Subsample Reads\" input and; (3) if not provided, generating a subsampling seed that the downstream instances can use. |
|
Bam_Settings |
wf.cwl#BamSettings.cwl
(ExpressionTool)
|
||
GetDataTable |
wf.cwl#GetDataTable.cwl
(CommandLineTool)
|
||
VDJ_Settings |
wf.cwl#VDJ_Settings.cwl
(ExpressionTool)
|
||
AnnotateReads |
wf.cwl#AnnotateReads.cwl
(CommandLineTool)
|
||
Name_Settings |
wf.cwl#NameSettings.cwl
(ExpressionTool)
|
||
PairReadFiles |
wf.cwl#PairReadFiles.cwl
(ExpressionTool)
|
PairReadFiles takes an array of split files and pairs them, such that an R1 file is transferred to the QualityFilter with its corresponding R2 file. The original FASTQ files are paired in CheckFastqs and then split and sub-sampled in SplitAndSubsample. The pairing information is taken from CheckFastqs. |
|
CheckReference |
wf.cwl#CheckReference.cwl
(CommandLineTool)
|
||
MergeMultiplex |
48704625cc45f4bceb403f3fc8656b6f
(ExpressionTool)
|
||
AnnotateMolecules |
wf.cwl#AnnotateMolecules.cwl
(CommandLineTool)
|
||
Internal_Settings |
wf.cwl#InternalSettings.cwl
(ExpressionTool)
|
||
Metadata_Settings |
wf.cwl#Metadata.cwl
(CommandLineTool)
|
||
SplitAndSubsample |
wf.cwl#SplitAndSubsample.cwl
(Workflow)
|
SplitAndSubsample splits, subsamples and formats read files to be deposited in QualityFilter. |
|
QualityFilterOuter |
wf.cwl#QualityFilterOuter.cwl
(Workflow)
|
||
Subsample_Settings |
wf.cwl#SubsampleSettings.cwl
(ExpressionTool)
|
||
VDJ_Compile_Results |
wf.cwl#VDJ_Compile_Results.cwl
(CommandLineTool)
|
||
Dense_to_Sparse_File |
wf.cwl#DensetoSparseFile.cwl
(CommandLineTool)
|
||
VDJ_Analyze_Reads_IG |
wf.cwl#VDJ_Analyze_Reads_IG.cwl
(Workflow)
|
||
Multiplexing_Settings |
wf.cwl#MultiplexingSettings.cwl
(ExpressionTool)
|
||
Uncompress_Datatables |
wf.cwl#UncompressDatatables.cwl
(Workflow)
|
||
VDJ_Analyze_Reads_TCR |
wf.cwl#VDJ_Analyze_Reads_TCR.cwl
(Workflow)
|
||
Intronic_Reads_Settings |
wf.cwl#IntronicReadsSettings.cwl
(ExpressionTool)
|
||
Dense_to_Sparse_Datatable |
wf.cwl#DensetoSparse.cwl
(CommandLineTool)
|
||
Putative_Cell_Calling_Settings |
wf.cwl#PutativeCellSettings.cwl
(ExpressionTool)
|
||
Dense_to_Sparse_Datatable_Unfiltered |
wf.cwl#DensetoSparse.cwl
(CommandLineTool)
|
Outputs
ID | Type | Label | Doc |
---|---|---|---|
Bam | File (Optional) | BAM File | |
Logs | Directory | Pipeline Logs | |
Bam_Index | File (Optional) | BAM Index | |
Multiplex | File[] (Optional) | ||
Data_Tables | File[] (Optional) | Data Tables | |
vdjMetricsCsv | File (Optional) | vdjMetricsCsv | |
Expression_Data | File (Optional) | Expression Matrix | |
Metrics_Summary | File | Metrics Summary | |
Bioproduct_Stats | File (Optional) | Bioproduct Statistics | |
vdjCellsDatatable | File (Optional) | vdjCellsDatatable | |
Dim_Reduction_Coord | File (Optional) | Dimensionality Reduction Coordinates | |
Visual_Metrics_html | File (Optional) | Pipeline Report HTML | |
Putative_Cells_Origin | File (Optional) | Putative Cells Origin | |
Data_Tables_Unfiltered | File[] (Optional) | Unfiltered Data Tables | |
vdjDominantContigsAIRR | File (Optional) | vdjDominantContigsAIRR | |
vdjUnfilteredContigsAIRR | File (Optional) | vdjUnfilteredContigsAIRR | |
Expression_Data_Unfiltered | File (Optional) | Unfiltered Expression Matrix | |
vdjCellsDatatableUncorrected | File (Optional) | vdjCellsDatatableUncorrected | |
Protein_Aggregates_Experimental | File (Optional) | Protein Aggregates (Experimental) | |
Immune_Cell_Classification(Experimental) | File (Optional) | Immune Cell Classification (Experimental) |
https://w3id.org/cwl/view/git/c8d75785fd6accf3b47d1f9ee056675611e22e61/wf.cwl?part=main