Explore Workflows

View already parsed workflows here or click here to add your own

Graph Name Retrieved From View
workflow graph Filter ChIP/ATAC peaks for Tag Density Profile or Motif Enrichment analyses

Filters ChIP/ATAC peaks with the neatest genes assigned for Tag Density Profile or Motif Enrichment analyses ============================================================================================================ Tool filters output from any ChIP/ATAC pipeline to create a file with regions of interest for Tag Density Profile or Motif Enrichment analyses. Peaks with duplicated coordinates are discarded.

https://github.com/datirium/workflows.git

Path: workflows/filter-peaks-for-heatmap.cwl

Branch/Commit ID: a1f6ca50fcb0881781b3ba0306dd61ebf555eaba

workflow graph wgs alignment and germline variant detection

https://github.com/genome/analysis-workflows.git

Path: definitions/pipelines/germline_wgs.cwl

Branch/Commit ID: 061d3a2fbcd8a1c39c0b38c549e528deb24a9d54

workflow graph Cut-n-Run pipeline paired-end

Experimental pipeline for Cut-n-Run analysis. Uses mapping results from the following experiment types: - `chipseq-pe.cwl` - `trim-chipseq-pe.cwl` - `trim-atacseq-pe.cwl` Note, the upstream analyses should not have duplicates removed

https://github.com/datirium/workflows.git

Path: workflows/trim-chipseq-pe-cut-n-run.cwl

Branch/Commit ID: 9d5cbdd3ea0bb417518115d8092584254598a440

workflow graph Motif Finding with HOMER with custom background regions

Motif Finding with HOMER with custom background regions --------------------------------------------------- HOMER contains a novel motif discovery algorithm that was designed for regulatory element analysis in genomics applications (DNA only, no protein). It is a differential motif discovery algorithm, which means that it takes two sets of sequences and tries to identify the regulatory elements that are specifically enriched in on set relative to the other. It uses ZOOPS scoring (zero or one occurrence per sequence) coupled with the hypergeometric enrichment calculations (or binomial) to determine motif enrichment. HOMER also tries its best to account for sequenced bias in the dataset. It was designed with ChIP-Seq and promoter analysis in mind, but can be applied to pretty much any nucleic acids motif finding problem. For more information please refer to: ------------------------------------- [Official documentation](http://homer.ucsd.edu/homer/motif/)

https://github.com/datirium/workflows.git

Path: workflows/homer-motif-analysis-bg.cwl

Branch/Commit ID: 9e3c3e65c19873cd1ed3cf7cc3b94ebc75ae0cc5

workflow graph kmer_cache_retrieve

https://github.com/ncbi/pgap.git

Path: task_types/tt_kmer_cache_retrieve.cwl

Branch/Commit ID: a34f47d1e37af51e387ecdfa5c3047f106c1146b

workflow graph Trim Galore RNA-Seq pipeline paired-end strand specific

Modified original [BioWardrobe's](https://biowardrobe.com) [PubMed ID:26248465](https://www.ncbi.nlm.nih.gov/pubmed/26248465) **RNA-Seq** basic analysis for a **pair-end** experiment. A corresponded input [FASTQ](http://maq.sourceforge.net/fastq.shtml) file has to be provided. Current workflow should be used only with the single-end RNA-Seq data. It performs the following steps: 1. Trim adapters from input FASTQ files 2. Use STAR to align reads from input FASTQ files according to the predefined reference indices; generate unsorted BAM file and alignment statistics file 3. Use fastx_quality_stats to analyze input FASTQ files and generate quality statistics files 4. Use samtools sort to generate coordinate sorted BAM(+BAI) file pair from the unsorted BAM file obtained on the step 1 (after running STAR) 5. Generate BigWig file on the base of sorted BAM file 6. Map input FASTQ files to predefined rRNA reference indices using Bowtie to define the level of rRNA contamination; export resulted statistics to file 7. Calculate isoform expression level for the sorted BAM file and GTF/TAB annotation file using GEEP reads-counting utility; export results to file

https://github.com/datirium/workflows.git

Path: workflows/trim-rnaseq-pe-dutp.cwl

Branch/Commit ID: a1f6ca50fcb0881781b3ba0306dd61ebf555eaba

workflow graph DESeq - differential gene expression analysis

Differential gene expression analysis ===================================== Differential gene expression analysis based on the negative binomial distribution Estimate variance-mean dependence in count data from high-throughput sequencing assays and test for differential expression based on a model using the negative binomial distribution. DESeq1 ------ High-throughput sequencing assays such as RNA-Seq, ChIP-Seq or barcode counting provide quantitative readouts in the form of count data. To infer differential signal in such data correctly and with good statistical power, estimation of data variability throughout the dynamic range and a suitable error model are required. Simon Anders and Wolfgang Huber propose a method based on the negative binomial distribution, with variance and mean linked by local regression and present an implementation, [DESeq](http://bioconductor.org/packages/release/bioc/html/DESeq.html), as an R/Bioconductor package DESeq2 ------ In comparative high-throughput sequencing assays, a fundamental task is the analysis of count data, such as read counts per gene in RNA-seq, for evidence of systematic changes across experimental conditions. Small replicate numbers, discreteness, large dynamic range and the presence of outliers require a suitable statistical approach. [DESeq2](http://www.bioconductor.org/packages/release/bioc/html/DESeq2.html), a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates. This enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression.

https://github.com/datirium/workflows.git

Path: workflows/deseq.cwl

Branch/Commit ID: d1bef74924efcb8bfaa00987b3f148d5a192b7a9

workflow graph tt_hmmsearch_wnode.cwl

https://github.com/ncbi/pgap.git

Path: task_types/tt_hmmsearch_wnode.cwl

Branch/Commit ID: 803f6367d1b279a7b6dc1a4e8ae43f1bbec9f760

workflow graph ani_top_n

https://github.com/ncbi/pgap.git

Path: task_types/tt_ani_top_n.cwl

Branch/Commit ID: 7f857f7f2d7c080d27c775b67a6d6f7d94bce31f

workflow graph Subworkflow that runs cnvkit in single sample mode and returns a vcf file

https://github.com/genome/analysis-workflows.git

Path: definitions/subworkflows/cnvkit_single_sample.cwl

Branch/Commit ID: ae75b938e6e8ae777a55686bbacad824b3c6788c