Explore Workflows
View already parsed workflows here or click here to add your own
| Graph | Name | Retrieved From | View |
|---|---|---|---|
|
|
gcaccess_from_list
|
Path: task_types/tt_gcaccess_from_list.cwl Branch/Commit ID: be9d12a3f8e1924183a1dc6a0bda6ada4195ca71 |
|
|
|
protein_extract
|
Path: progs/protein_extract.cwl Branch/Commit ID: be9d12a3f8e1924183a1dc6a0bda6ada4195ca71 |
|
|
|
Filter Protein Seeds; Find ProSplign Alignments
|
Path: protein_alignment/wf_compart_filter_prosplign.cwl Branch/Commit ID: be9d12a3f8e1924183a1dc6a0bda6ada4195ca71 |
|
|
|
RNA-seq alelle specific pipeline for paired-end data
Allele specific RNA-Seq paired-end workflow |
Path: workflows/allele-rnaseq-pe.cwl Branch/Commit ID: dda9e6e06a656b7b3fa7504156474b962fe3953c |
|
|
|
analysis for assembled sequences
rna / protein - qc, annotation, index, abundance |
Path: CWL/Workflows/assembled.workflow.cwl Branch/Commit ID: 6a8727124baf77416ca797982fd4e0689c2a593a |
|
|
|
format_rrnas_from_seq_entry
|
Path: task_types/tt_format_rrnas_from_seq_entry.cwl Branch/Commit ID: 093b60e546237c06cfe7820d6ac8d66467e66725 |
|
|
|
Variant calling germline paired-end
A workflow for the Broad Institute's best practices gatk4 germline variant calling pipeline. ## __Outputs__ #### Primary Output files: - bqsr2_indels.vcf, filtered and recalibrated indels (IGV browser) - bqsr2_snps.vcf, filtered and recalibrated snps (IGV browser) - bqsr2_snps.ann.vcf, filtered and recalibrated snps with effect annotations #### Secondary Output files: - sorted_dedup_reads.bam, sorted deduplicated alignments (IGV browser) - raw_indels.vcf, first pass indel calls - raw_snps.vcf, first pass snp calls #### Reports: - overview.md (input list, alignment metrics, variant counts) - insert_size_histogram.pdf - recalibration_plots.pdf - snpEff_summary.html ## __Inputs__ #### General Info - Sample short name/Alias: unique name for sample - Experimental condition: condition, variable, etc name (e.g. \"control\" or \"20C 60min\") - Cells: name of cells used for the sample - Catalog No.: vender catalog number if available - BWA index: BWA index sample that contains reference genome FASTA with associated indices. - SNPEFF database: Name of SNPEFF database to use for SNP effect annotation. - Read 1 file: First FASTQ file (generally contains \"R1\" in the filename) - Read 2 file: Paired FASTQ file (generally contains \"R2\" in the filename) #### Advanced - Ploidy: number of copies per chromosome (default should be 2) - SNP filters: see Step 6 Notes: https://gencore.bio.nyu.edu/variant-calling-pipeline-gatk4/ - Indel filters: see Step 7 Notes: https://gencore.bio.nyu.edu/variant-calling-pipeline-gatk4/ #### SNPEFF notes: Get snpeff databases using `docker run --rm -ti gatk4-dev /bin/bash` then running `java -jar $SNPEFF_JAR databases`. Then, use the first column as SNPEFF input (e.g. \"hg38\"). - hg38, Homo_sapiens (USCS), http://downloads.sourceforge.net/project/snpeff/databases/v4_3/snpEff_v4_3_hg38.zip - mm10, Mus_musculus, http://downloads.sourceforge.net/project/snpeff/databases/v4_3/snpEff_v4_3_mm10.zip - dm6.03, Drosophila_melanogaster, http://downloads.sourceforge.net/project/snpeff/databases/v4_3/snpEff_v4_3_dm6.03.zip - Rnor_6.0.86, Rattus_norvegicus, http://downloads.sourceforge.net/project/snpeff/databases/v4_3/snpEff_v4_3_Rnor_6.0.86.zip - R64-1-1.86, Saccharomyces_cerevisiae, http://downloads.sourceforge.net/project/snpeff/databases/v4_3/snpEff_v4_3_R64-1-1.86.zip ### __Data Analysis Steps__ 1. Trimming the adapters with TrimGalore. - This step is particularly important when the reads are long and the fragments are short - resulting in sequencing adapters at the ends of reads. If adapter is not removed the read will not map. TrimGalore can recognize standard adapters, such as Illumina or Nextera/Tn5 adapters. 2. Generate quality control statistics of trimmed, unmapped sequence data 3. Run germline variant calling pipeline, custom wrapper script implementing Steps 1 - 17 of the Broad Institute's best practices gatk4 germline variant calling pipeline (https://gencore.bio.nyu.edu/variant-calling-pipeline-gatk4/) ### __References__ 1. https://gencore.bio.nyu.edu/variant-calling-pipeline-gatk4/ 2. https://gatk.broadinstitute.org/hc/en-us/articles/360035535932-Germline-short-variant-discovery-SNPs-Indels- 3. https://software.broadinstitute.org/software/igv/VCF |
Path: workflows/vc-germline-pe.cwl Branch/Commit ID: 36fd18f11e939d3908b1eca8d2939402f7a99b0f |
|
|
|
adapter for sequence_align_and_tag
Some workflow engines won't stage files in our nested structure, so parse it out here |
Path: definitions/subworkflows/sequence_align_and_tag_adapter.cwl Branch/Commit ID: 31602b94b34ff55876147c7299e1bec47e8d1a31 |
|
|
|
gp_makeblastdb
|
Path: progs/gp_makeblastdb.cwl Branch/Commit ID: cb15f907132fb90bc66b39bb0af3c211801feba1 |
|
|
|
Align reference proteins plane complete workflow
|
Path: protein_alignment/wf_protein_alignment.cwl Branch/Commit ID: cb15f907132fb90bc66b39bb0af3c211801feba1 |
