Explore Workflows

View already parsed workflows here or click here to add your own

Graph	Name	Retrieved From	View
	ROSE: rank ordering of super-enhancers Super-enhancers, consist of clusters of enhancers that are densely occupied by the master regulators and Mediator. Super-enhancers differ from typical enhancers in size, transcription factor density and content, ability to activate transcription, and sensitivity to perturbation. Use to create stitched enhancers, and to separate super-enhancers from typical enhancers using sequencing data (.bam) given a file of previously identified constituent enhancers (.gff)	https://github.com/datirium/workflows.git Path: workflows/super-enhancer.cwl Branch/Commit ID: 261c0232a7a40880f2480b811ed2d7e89c463869
	Single-Cell RNA-Seq Filtering Analysis Single-Cell RNA-Seq Filtering Analysis Removes low-quality cells from the outputs of the “Cell Ranger Count (RNA)”, “Cell Ranger Count (RNA+VDJ)”, and “Cell Ranger Aggregate (RNA, RNA+VDJ)” pipelines. The results of this workflow are used in the “Single-Cell RNA-Seq Dimensionality Reduction Analysis” pipeline.	https://github.com/datirium/workflows.git Path: workflows/sc-rna-filter.cwl Branch/Commit ID: 261c0232a7a40880f2480b811ed2d7e89c463869
	heatmap-prepare.cwl Workflow runs homer-make-tag-directory.cwl tool using scatter for the following inputs - bam_file - fragment_size - total_reads `dotproduct` is used as a `scatterMethod`, so one element will be taken from each array to construct each job: 1) bam_file[0] fragment_size[0] total_reads[0] 2) bam_file[1] fragment_size[1] total_reads[1] ... N) bam_file[N] fragment_size[N] total_reads[N] `bam_file`, `fragment_size` and `total_reads` arrays should have the identical order.	https://github.com/Barski-lab/workflows.git Path: tools/heatmap-prepare.cwl Branch/Commit ID: 2d54a7edc45b7dfbee41ecef200a634fd0cd5e97
	conflict-wf.cwl#collision	https://github.com/common-workflow-language/cwltool.git Path: cwltool/schemas/v1.0/v1.0/conflict-wf.cwl Branch/Commit ID: 10492acee927c177933160f6ad67085f9112b0d1 Packed ID: collision
	Variant calling germline paired-end A workflow for the Broad Institute's best practices gatk4 germline variant calling pipeline. ## __Outputs__ #### Primary Output files: - bqsr2_indels.vcf, filtered and recalibrated indels (IGV browser) - bqsr2_snps.vcf, filtered and recalibrated snps (IGV browser) - bqsr2_snps.ann.vcf, filtered and recalibrated snps with effect annotations #### Secondary Output files: - sorted_dedup_reads.bam, sorted deduplicated alignments (IGV browser) - raw_indels.vcf, first pass indel calls - raw_snps.vcf, first pass snp calls #### Reports: - overview.md (input list, alignment metrics, variant counts) - insert_size_histogram.pdf - recalibration_plots.pdf - snpEff_summary.html ## __Inputs__ #### General Info - Sample short name/Alias: unique name for sample - Experimental condition: condition, variable, etc name (e.g. \"control\" or \"20C 60min\") - Cells: name of cells used for the sample - Catalog No.: vender catalog number if available - BWA index: BWA index sample that contains reference genome FASTA with associated indices. - SNPEFF database: Name of SNPEFF database to use for SNP effect annotation. - Read 1 file: First FASTQ file (generally contains \"R1\" in the filename) - Read 2 file: Paired FASTQ file (generally contains \"R2\" in the filename) #### Advanced - Ploidy: number of copies per chromosome (default should be 2) - SNP filters: see Step 6 Notes: https://gencore.bio.nyu.edu/variant-calling-pipeline-gatk4/ - Indel filters: see Step 7 Notes: https://gencore.bio.nyu.edu/variant-calling-pipeline-gatk4/ #### SNPEFF notes: Get snpeff databases using `docker run --rm -ti gatk4-dev /bin/bash` then running `java -jar $SNPEFF_JAR databases`. Then, use the first column as SNPEFF input (e.g. \"hg38\"). - hg38, Homo_sapiens (USCS), http://downloads.sourceforge.net/project/snpeff/databases/v4_3/snpEff_v4_3_hg38.zip - mm10, Mus_musculus, http://downloads.sourceforge.net/project/snpeff/databases/v4_3/snpEff_v4_3_mm10.zip - dm6.03, Drosophila_melanogaster, http://downloads.sourceforge.net/project/snpeff/databases/v4_3/snpEff_v4_3_dm6.03.zip - Rnor_6.0.86, Rattus_norvegicus, http://downloads.sourceforge.net/project/snpeff/databases/v4_3/snpEff_v4_3_Rnor_6.0.86.zip - R64-1-1.86, Saccharomyces_cerevisiae, http://downloads.sourceforge.net/project/snpeff/databases/v4_3/snpEff_v4_3_R64-1-1.86.zip ### __Data Analysis Steps__ 1. Trimming the adapters with TrimGalore. - This step is particularly important when the reads are long and the fragments are short - resulting in sequencing adapters at the ends of reads. If adapter is not removed the read will not map. TrimGalore can recognize standard adapters, such as Illumina or Nextera/Tn5 adapters. 2. Generate quality control statistics of trimmed, unmapped sequence data 3. Run germline variant calling pipeline, custom wrapper script implementing Steps 1 - 17 of the Broad Institute's best practices gatk4 germline variant calling pipeline (https://gencore.bio.nyu.edu/variant-calling-pipeline-gatk4/) ### __References__ 1. https://gencore.bio.nyu.edu/variant-calling-pipeline-gatk4/ 2. https://gatk.broadinstitute.org/hc/en-us/articles/360035535932-Germline-short-variant-discovery-SNPs-Indels- 3. https://software.broadinstitute.org/software/igv/VCF	https://github.com/datirium/workflows.git Path: workflows/vc-germline-pe.cwl Branch/Commit ID: 261c0232a7a40880f2480b811ed2d7e89c463869
	MAnorm SE - quantitative comparison of ChIP-Seq single-read data What is MAnorm? -------------- MAnorm is a robust model for quantitative comparison of ChIP-Seq data sets of TFs (transcription factors) or epigenetic modifications and you can use it for: * Normalization of two ChIP-seq samples * Quantitative comparison (differential analysis) of two ChIP-seq samples * Evaluating the overlap enrichment of the protein binding sites(peaks) * Elucidating underlying mechanisms of cell-type specific gene regulation How MAnorm works? ---------------- MAnorm uses common peaks of two samples as a reference to build the rescaling model for normalization, which is based on the empirical assumption that if a chromatin-associated protein has a substantial number of peaks shared in two conditions, the binding at these common regions will tend to be determined by similar mechanisms, and thus should exhibit similar global binding intensities across samples. The observed differences on common peaks are presumed to reflect the scaling relationship of ChIP-Seq signals between two samples, which can be applied to all peaks. What do the inputs mean? ---------------- ### General Experiment short name/Alias * short name for you experiment to identify among the others ChIP-Seq SE sample 1 * previously analyzed ChIP-Seq single-read experiment to be used as Sample 1 ChIP-Seq SE sample 2 * previously analyzed ChIP-Seq single-read experiment to be used as Sample 2 Genome * Reference genome to be used for gene assigning ### Advanced Reads shift size for sample 1 * This value is used to shift reads towards 3' direction to determine the precise binding site. Set as half of the fragment length. Default 100 Reads shift size for sample 2 * This value is used to shift reads towards 5' direction to determine the precise binding site. Set as half of the fragment length. Default 100 M-value (log2-ratio) cutoff * Absolute M-value (log2-ratio) cutoff to define biased (differential binding) peaks. Default: 1.0 P-value cutoff * P-value cutoff to define biased peaks. Default: 0.01 Window size * Window size to count reads and calculate read densities. 2000 is recommended for sharp histone marks like H3K4me3 and H3K27ac, and 1000 for TFs or DNase-seq. Default: 2000	https://github.com/datirium/workflows.git Path: workflows/manorm-se.cwl Branch/Commit ID: 42dc4f70b117e78785b82865ec4c4b941ac1c259
	Chunked version of phmmer-v3.2.cwl	https://github.com/mscheremetjew/workflow-is-cwl.git Path: workflows/phmmer-v3.2-chunked-wf.cwl Branch/Commit ID: 72bbd5a80688e6a387bfdff5881db2cc3523f7b7
	nestedworkflows.cwl	https://github.com/common-workflow-language/user_guide.git Path: src/_includes/cwl/workflows/nestedworkflows.cwl Branch/Commit ID: 3bfa62397cece91175f3652e2df7d8b43beb0c15
	1st-workflow.cwl	https://github.com/common-workflow-language/user_guide.git Path: src/_includes/cwl/workflows/1st-workflow.cwl Branch/Commit ID: 3bfa62397cece91175f3652e2df7d8b43beb0c15
	wf-loadContents2.cwl	https://github.com/common-workflow-language/cwl-v1.2.git Path: tests/wf-loadContents2.cwl Branch/Commit ID: 5f27e234b4ca88ed1280dedf9e3391a01de12912