Explore Workflows

View already parsed workflows here or click here to add your own

Graph Name Retrieved From View
workflow graph Motif Finding with HOMER with custom background regions

Motif Finding with HOMER with custom background regions --------------------------------------------------- HOMER contains a novel motif discovery algorithm that was designed for regulatory element analysis in genomics applications (DNA only, no protein). It is a differential motif discovery algorithm, which means that it takes two sets of sequences and tries to identify the regulatory elements that are specifically enriched in on set relative to the other. It uses ZOOPS scoring (zero or one occurrence per sequence) coupled with the hypergeometric enrichment calculations (or binomial) to determine motif enrichment. HOMER also tries its best to account for sequenced bias in the dataset. It was designed with ChIP-Seq and promoter analysis in mind, but can be applied to pretty much any nucleic acids motif finding problem. For more information please refer to: ------------------------------------- [Official documentation](http://homer.ucsd.edu/homer/motif/)

https://github.com/datirium/workflows.git

Path: workflows/homer-motif-analysis-bg.cwl

Branch/Commit ID: ebbf23764ede324cabc064bd50647c1f643726fa

workflow graph Kraken2 Database installation pipeline

This workflow downloads the user-selected pre-built kraken2 database from: https://benlangmead.github.io/aws-indexes/k2 ### __Inputs__ Select a pre-built Kraken2 database to download and use for metagenomic classification: - Available options comprised of various combinations of RefSeq reference genome sets: - [Viral (0.5 GB)](https://genome-idx.s3.amazonaws.com/kraken/k2_viral_20221209.tar.gz), all refseq viral genomes - [MinusB (8.7 GB)](https://genome-idx.s3.amazonaws.com/kraken/k2_minusb_20221209.tar.gz), standard minus bacteria (archaea, viral, plasmid, human1, UniVec_Core) - [PlusPFP-16 (15.0 GB)](https://genome-idx.s3.amazonaws.com/kraken/k2_pluspfp_16gb_20221209.tar.gz), standard (archaea, bacteria, viral, plasmid, human1, UniVec_Core) + (protozoa, fungi & plant) capped at 16 GB (shrunk via random kmer downselect) - [EuPathDB46 (34.1 GB)](https://genome-idx.s3.amazonaws.com/kraken/k2_eupathdb48_20201113.tar.gz), eukaryotic pathogen genomes with contaminants removed (https://veupathdb.org/veupathdb/app) - [16S_gg_13_5 (73 MB)](https://genome-idx.s3.amazonaws.com/kraken/16S_Greengenes13.5_20200326.tgz), Greengenes 16S rRNA database ([release 13.5](https://greengenes.secondgenome.com/?prefix=downloads/greengenes_database/gg_13_5/), 20200326)\n - [16S_silva_138 (112 MB)](https://genome-idx.s3.amazonaws.com/kraken/16S_Silva138_20200326.tgz), SILVA 16S rRNA database ([release 138.1](https://www.arb-silva.de/documentation/release-1381/), 20200827) ### __Outputs__ - k2db, an upstream database used by kraken2 classification tool - compressed_k2db_tar, compressed and tarred kraken2 database directory file for download and use outside of scidap ### __Data Analysis Steps__ 1. download selected pre-built kraken2 database. 2. make available as upstream source for kraken2 metagenomic taxonomic classification. ### __References__ - Wood, D.E., Lu, J. & Langmead, B. Improved metagenomic analysis with Kraken 2. Genome Biol 20, 257 (2019). https://doi.org/10.1186/s13059-019-1891-0

https://github.com/datirium/workflows.git

Path: workflows/kraken2-databases.cwl

Branch/Commit ID: 69643d8c15f5357a320aa7e2f6adb2e71302fd20

workflow graph chipseq-header.cwl

https://github.com/datirium/workflows.git

Path: metadata/chipseq-header.cwl

Branch/Commit ID: 69643d8c15f5357a320aa7e2f6adb2e71302fd20

workflow graph DiffBind - Differential Binding Analysis of ChIP-Seq or CUTß&RUN/Tag Peak Data

Differential Binding Analysis of ChIP-Seq or CUT&RUN/Tag Peak Data --------------------------------------------------- DiffBind processes ChIP-Seq or CUT&RUN/Tag data enriched for genomic loci where specific protein/DNA binding occurs, including peak sets identified by peak caller tools and aligned sequence read datasets. It is designed to work with multiple peak sets simultaneously, representing different ChIP or CUT&RUN/Tag experiments (antibodies, transcription factor and/or histone marks, experimental conditions, replicates) as well as managing the results of multiple peak callers. For more information please refer to: ------------------------------------- Ross-Innes CS, Stark R, Teschendorff AE, Holmes KA, Ali HR, Dunning MJ, Brown GD, Gojis O, Ellis IO, Green AR, Ali S, Chin S, Palmieri C, Caldas C, Carroll JS (2012). “Differential oestrogen receptor binding is associated with clinical outcome in breast cancer.” Nature, 481, -4.

https://github.com/datirium/workflows.git

Path: workflows/diffbind.cwl

Branch/Commit ID: 69643d8c15f5357a320aa7e2f6adb2e71302fd20

workflow graph SetParameterWorkflowMissing

This is a placeholder for a missing setting workflow.

https://github.com/gammasim/workflows.git

Path: workflows/SetParameterWorkflowMissing.cwl

Branch/Commit ID: bf4d4a44a543bcc04f4508ce020751c71550acf5

workflow graph genome-kallisto-index.cwl

Generates a FASTA file with the DNA sequences for all transcripts in a GFF file and builds kallisto index

https://github.com/Barski-lab/workflows.git

Path: tools/genome-kallisto-index.cwl

Branch/Commit ID: de847468843203ce92b6d19323c5fe77dc488e34

workflow graph copy_outputs.cwl

https://github.com/mskcc/argos-cwl.git

Path: workflows/copy_outputs.cwl

Branch/Commit ID: 0b9721d7d512352c7f80b27f42b3192a585ed5f6

workflow graph cond-single-source-wf-005.1.cwl

https://github.com/common-workflow-language/cwl-utils.git

Path: testdata/cond-single-source-wf-005.1.cwl

Branch/Commit ID: 5759b4275906e6cfe13912c8426de2a2237cb4b0

workflow graph tt_univec_wnode.cwl

https://github.com/ncbi/pgap.git

Path: task_types/tt_univec_wnode.cwl

Branch/Commit ID: c64599f5db2437f9323d41cc3d8d9efb20a2667e

workflow graph scatter-wf3.cwl#main

https://github.com/common-workflow-language/common-workflow-language.git

Path: v1.0/v1.0/scatter-wf3.cwl

Branch/Commit ID: 4fd45edb9531a03223c18a586e32d0baf0d5acb2

Packed ID: main