Explore Workflows
View already parsed workflows here or click here to add your own
Graph | Name | Retrieved From | View |
---|---|---|---|
|
tt_univec_wnode.cwl
|
![]() Path: task_types/tt_univec_wnode.cwl Branch/Commit ID: 708e141d99f6e5f30d9402d9f890562606a0d97e |
|
|
Apply filters to VCF file
|
![]() Path: definitions/subworkflows/filter_vcf.cwl Branch/Commit ID: 9c0b1497c467393e1a54735575043dced73e95c4 |
|
|
STAR-RNA-Seq alignment and transcript/gene abundance workflow
|
![]() Path: definitions/pipelines/rnaseq_star_fusion.cwl Branch/Commit ID: 87faba2fff8007ecc95160729b1c7cd0376e46f2 |
|
|
step_valuefrom5_wf_with_id_v1_0.cwl
|
![]() Path: testdata/step_valuefrom5_wf_with_id_v1_0.cwl Branch/Commit ID: e78db9870cb744fe36674f43b3223c688e9989e1 |
|
|
step-valuefrom2-wf_v1_1.cwl
|
![]() Path: testdata/step-valuefrom2-wf_v1_1.cwl Branch/Commit ID: c1875d54dedc41b1d2fa08634dcf1caa8f1bc631 |
|
|
VIRTUS.PE.singlevirus.cwl
|
![]() Path: workflow/VIRTUS.PE.singlevirus.cwl Branch/Commit ID: 43982758be93a31a0c079f448b377cae9fb9f3c7 |
|
|
Variant calling germline paired-end
A workflow for the Broad Institute's best practices gatk4 germline variant calling pipeline. ## __Outputs__ #### Primary Output files: - bqsr2_indels.vcf, filtered and recalibrated indels (IGV browser) - bqsr2_snps.vcf, filtered and recalibrated snps (IGV browser) - bqsr2_snps.ann.vcf, filtered and recalibrated snps with effect annotations #### Secondary Output files: - sorted_dedup_reads.bam, sorted deduplicated alignments (IGV browser) - raw_indels.vcf, first pass indel calls - raw_snps.vcf, first pass snp calls #### Reports: - overview.md (input list, alignment metrics, variant counts) - insert_size_histogram.pdf - recalibration_plots.pdf - snpEff_summary.html ## __Inputs__ #### General Info - Sample short name/Alias: unique name for sample - Experimental condition: condition, variable, etc name (e.g. \"control\" or \"20C 60min\") - Cells: name of cells used for the sample - Catalog No.: vender catalog number if available - BWA index: BWA index sample that contains reference genome FASTA with associated indices. - SNPEFF database: Name of SNPEFF database to use for SNP effect annotation. - Read 1 file: First FASTQ file (generally contains \"R1\" in the filename) - Read 2 file: Paired FASTQ file (generally contains \"R2\" in the filename) #### Advanced - Ploidy: number of copies per chromosome (default should be 2) - SNP filters: see Step 6 Notes: https://gencore.bio.nyu.edu/variant-calling-pipeline-gatk4/ - Indel filters: see Step 7 Notes: https://gencore.bio.nyu.edu/variant-calling-pipeline-gatk4/ #### SNPEFF notes: Get snpeff databases using `docker run --rm -ti gatk4-dev /bin/bash` then running `java -jar $SNPEFF_JAR databases`. Then, use the first column as SNPEFF input (e.g. \"hg38\"). - hg38, Homo_sapiens (USCS), http://downloads.sourceforge.net/project/snpeff/databases/v4_3/snpEff_v4_3_hg38.zip - mm10, Mus_musculus, http://downloads.sourceforge.net/project/snpeff/databases/v4_3/snpEff_v4_3_mm10.zip - dm6.03, Drosophila_melanogaster, http://downloads.sourceforge.net/project/snpeff/databases/v4_3/snpEff_v4_3_dm6.03.zip - Rnor_6.0.86, Rattus_norvegicus, http://downloads.sourceforge.net/project/snpeff/databases/v4_3/snpEff_v4_3_Rnor_6.0.86.zip - R64-1-1.86, Saccharomyces_cerevisiae, http://downloads.sourceforge.net/project/snpeff/databases/v4_3/snpEff_v4_3_R64-1-1.86.zip ### __Data Analysis Steps__ 1. Trimming the adapters with TrimGalore. - This step is particularly important when the reads are long and the fragments are short - resulting in sequencing adapters at the ends of reads. If adapter is not removed the read will not map. TrimGalore can recognize standard adapters, such as Illumina or Nextera/Tn5 adapters. 2. Generate quality control statistics of trimmed, unmapped sequence data 3. Run germline variant calling pipeline, custom wrapper script implementing Steps 1 - 17 of the Broad Institute's best practices gatk4 germline variant calling pipeline (https://gencore.bio.nyu.edu/variant-calling-pipeline-gatk4/) ### __References__ 1. https://gencore.bio.nyu.edu/variant-calling-pipeline-gatk4/ 2. https://gatk.broadinstitute.org/hc/en-us/articles/360035535932-Germline-short-variant-discovery-SNPs-Indels- 3. https://software.broadinstitute.org/software/igv/VCF |
![]() Path: workflows/vc-germline-pe.cwl Branch/Commit ID: 93b844a80f4008cc973ea9b5efedaff32a343895 |
|
|
step_valuefrom5_wf_with_id_v1_1.cwl
|
![]() Path: testdata/step_valuefrom5_wf_with_id_v1_1.cwl Branch/Commit ID: c1875d54dedc41b1d2fa08634dcf1caa8f1bc631 |
|
|
scatterfail.cwl
|
![]() Path: tests/wf/scatterfail.cwl Branch/Commit ID: 8ef515037de411abd2f84b569ad4d4a4f7a2c7a0 |
|
|
abundance
abundace profiles from annotated files, for protein and/or rna |
![]() Path: CWL/Workflows/abundance-clca.workflow.cwl Branch/Commit ID: 1844de830f6935901849ccd9966685fbf13e8042 |