Explore Workflows

View already parsed workflows here or click here to add your own

Graph Name Retrieved From View
workflow graph FASTQ Vector Removal

This workflow clean up vectros from fastq files

https://github.com/ncbi/cwl-ngs-workflows-cbb.git

Path: workflows/Contamination/fastq-vector-removal.cwl

Branch/Commit ID: 3247592a89deafaa0d9c5910a1cb1d000ef9b098

workflow graph DESeq2 (LRT) - differential gene expression analysis using likelihood ratio test

Runs DESeq2 using LRT (Likelihood Ratio Test) ============================================= The LRT examines two models for the counts, a full model with a certain number of terms and a reduced model, in which some of the terms of the full model are removed. The test determines if the increased likelihood of the data using the extra terms in the full model is more than expected if those extra terms are truly zero. The LRT is therefore useful for testing multiple terms at once, for example testing 3 or more levels of a factor at once, or all interactions between two variables. The LRT for count data is conceptually similar to an analysis of variance (ANOVA) calculation in linear regression, except that in the case of the Negative Binomial GLM, we use an analysis of deviance (ANODEV), where the deviance captures the difference in likelihood between a full and a reduced model. When one performs a likelihood ratio test, the p values and the test statistic (the stat column) are values for the test that removes all of the variables which are present in the full design and not in the reduced design. This tests the null hypothesis that all the coefficients from these variables and levels of these factors are equal to zero. The likelihood ratio test p values therefore represent a test of all the variables and all the levels of factors which are among these variables. However, the results table only has space for one column of log fold change, so a single variable and a single comparison is shown (among the potentially multiple log fold changes which were tested in the likelihood ratio test). This indicates that the p value is for the likelihood ratio test of all the variables and all the levels, while the log fold change is a single comparison from among those variables and levels. **Technical notes** 1. At least two biological replicates are required for every compared category 2. Metadata file describes relations between compared experiments, for example ``` ,time,condition DH1,day5,WT DH2,day5,KO DH3,day7,WT DH4,day7,KO DH5,day7,KO ``` where `time, condition, day5, day7, WT, KO` should be a single words (without spaces) and `DH1, DH2, DH3, DH4, DH5` correspond to the experiment aliases set in **RNA-Seq experiments** input. 3. Design and reduced formulas should start with **~** and include categories or, optionally, their interactions from the metadata file header. See details in DESeq2 manual [here](https://bioconductor.org/packages/release/bioc/vignettes/DESeq2/inst/doc/DESeq2.html#interactions) and [here](https://bioconductor.org/packages/release/bioc/vignettes/DESeq2/inst/doc/DESeq2.html#likelihood-ratio-test) 4. Contrast should be set based on your metadata file header and available categories in a form of `Factor Numerator Denominator`, where `Factor` - column name from metadata file, `Numerator` - category from metadata file to be used as numerator in fold change calculation, `Denominator` - category from metadata file to be used as denominator in fold change calculation. For example `condition WT KO`.

https://github.com/datirium/workflows.git

Path: workflows/deseq-lrt.cwl

Branch/Commit ID: a68821bf3a9ceadc3b2ffbb535d601d9a645b377

workflow graph kmer_top_n_extract

https://github.com/ncbi/pgap.git

Path: task_types/tt_kmer_top_n_extract.cwl

Branch/Commit ID: ce433f771ebf5677c9f40858e2ae91b1a7e75d30

workflow graph allele-vcf-rnaseq-se.cwl

https://github.com/datirium/workflows.git

Path: workflows/allele-vcf-rnaseq-se.cwl

Branch/Commit ID: 4b8bb1a1ec39056253ca8eee976078e51f4a954e

workflow graph fp_filter workflow

https://github.com/genome/analysis-workflows.git

Path: definitions/subworkflows/fp_filter.cwl

Branch/Commit ID: e59c77629936fad069007ba642cad49fef7ad29f

workflow graph exome alignment and somatic variant detection

https://github.com/genome/analysis-workflows.git

Path: definitions/pipelines/somatic_exome.cwl

Branch/Commit ID: a9133c999502acf94b433af8d39897e6c2cdf65f

workflow graph Unaligned to aligned BAM

https://github.com/genome/analysis-workflows.git

Path: definitions/subworkflows/align.cwl

Branch/Commit ID: 0a9a4ce83b49ed4e7eee5bcc09d83725136a36b0

workflow graph scatter-valuefrom-wf1.cwl

https://github.com/common-workflow-language/cwl-v1.2.git

Path: tests/scatter-valuefrom-wf1.cwl

Branch/Commit ID: c7c97715b400ff2194aa29fc211d3401cea3a9bf

workflow graph sum-wf-noET.cwl

https://github.com/common-workflow-language/cwl-v1.2.git

Path: tests/sum-wf-noET.cwl

Branch/Commit ID: 1f3ef888d9ef2306c828065c460c1800604f0de4

workflow graph Generate ATDP heatmap using Homer

Generate ATDP heatmap centered on TSS from an array of input BAM files and genelist TSV file. Returns array of heatmap JSON files with the names that have the same basenames as input BAM files, but with .json extension

https://github.com/datirium/workflows.git

Path: workflows/heatmap.cwl

Branch/Commit ID: 4a80f5b8f86c83af39494ecc309b789aeda77964