Explore Workflows

View already parsed workflows here or click here to add your own

Graph Name Retrieved From View
workflow graph Bismark Methylation - pipeline for BS-Seq data analysis

Sequence reads are first cleaned from adapters and transformed into fully bisulfite-converted forward (C->T) and reverse read (G->A conversion of the forward strand) versions, before they are aligned to similarly converted versions of the genome (also C->T and G->A converted). Sequence reads that produce a unique best alignment from the four alignment processes against the bisulfite genomes (which are running in parallel) are then compared to the normal genomic sequence and the methylation state of all cytosine positions in the read is inferred. A read is considered to align uniquely if an alignment has a unique best alignment score (as reported by the AS:i field). If a read produces several alignments with the same number of mismatches or with the same alignment score (AS:i field), a read (or a read-pair) is discarded altogether. On the next step we extract the methylation call for every single C analysed. The position of every single C will be written out to a new output file, depending on its context (CpG, CHG or CHH), whereby methylated Cs will be labelled as forward reads (+), non-methylated Cs as reverse reads (-). The output of the methylation extractor is then transformed into a bedGraph and coverage file. The bedGraph counts output is then used to generate a genome-wide cytosine report which reports the number on every single CpG (optionally every single cytosine) in the genome, irrespective of whether it was covered by any reads or not. As this type of report is informative for cytosines on both strands the output may be fairly large (~46mn CpG positions or >1.2bn total cytosine positions in the human genome).

https://github.com/datirium/workflows.git

Path: workflows/bismark-methylation-se.cwl

Branch/Commit ID: 564156a9e1cc7c3679a926c479ba3ae133b1bfd4

workflow graph scatter-valuefrom-inputs-wf1.cwl

https://github.com/common-workflow-language/cwl-v1.1.git

Path: tests/scatter-valuefrom-inputs-wf1.cwl

Branch/Commit ID: 3e90671b25f7840ef2926ad2bacbf447772dda94

workflow graph cond-wf-010_nojs.cwl

https://github.com/common-workflow-language/cwl-v1.2.git

Path: tests/conditionals/cond-wf-010_nojs.cwl

Branch/Commit ID: 57baec040c99d7edef8242ef51b5470b1c82d733

workflow graph rnaseq-se-dutp-mitochondrial.cwl

RNA-Seq strand specific mitochondrial workflow for single-read experiment based on BioWardrobe's basic analysis.

https://github.com/datirium/workflows.git

Path: workflows/rnaseq-se-dutp-mitochondrial.cwl

Branch/Commit ID: e284e3f6dff25037b209895c52f2abd37a1ce1bf

workflow graph step_valuefrom5_wf_with_id_v1_0.cwl

https://github.com/common-workflow-language/cwl-utils.git

Path: testdata/step_valuefrom5_wf_with_id_v1_0.cwl

Branch/Commit ID: 77669d4dd1d1ebd2bdd9810f911608146d9b8e51

workflow graph timelimit3-wf.cwl

https://github.com/common-workflow-language/cwl-v1.2.git

Path: tests/timelimit3-wf.cwl

Branch/Commit ID: a5073143db4155e05df8d2e7eb59d9e62acd65a5

workflow graph DESeq2 (LRT) - differential gene expression analysis using likelihood ratio test

Runs DESeq2 using LRT (Likelihood Ratio Test) ============================================= The LRT examines two models for the counts, a full model with a certain number of terms and a reduced model, in which some of the terms of the full model are removed. The test determines if the increased likelihood of the data using the extra terms in the full model is more than expected if those extra terms are truly zero. The LRT is therefore useful for testing multiple terms at once, for example testing 3 or more levels of a factor at once, or all interactions between two variables. The LRT for count data is conceptually similar to an analysis of variance (ANOVA) calculation in linear regression, except that in the case of the Negative Binomial GLM, we use an analysis of deviance (ANODEV), where the deviance captures the difference in likelihood between a full and a reduced model. When one performs a likelihood ratio test, the p values and the test statistic (the stat column) are values for the test that removes all of the variables which are present in the full design and not in the reduced design. This tests the null hypothesis that all the coefficients from these variables and levels of these factors are equal to zero. The likelihood ratio test p values therefore represent a test of all the variables and all the levels of factors which are among these variables. However, the results table only has space for one column of log fold change, so a single variable and a single comparison is shown (among the potentially multiple log fold changes which were tested in the likelihood ratio test). This indicates that the p value is for the likelihood ratio test of all the variables and all the levels, while the log fold change is a single comparison from among those variables and levels. **Technical notes** 1. At least two biological replicates are required for every compared category 2. Metadata file describes relations between compared experiments, for example ``` ,time,condition DH1,day5,WT DH2,day5,KO DH3,day7,WT DH4,day7,KO DH5,day7,KO ``` where `time, condition, day5, day7, WT, KO` should be a single words (without spaces) and `DH1, DH2, DH3, DH4, DH5` correspond to the experiment aliases set in **RNA-Seq experiments** input. 3. Design and reduced formulas should start with **~** and include categories or, optionally, their interactions from the metadata file header. See details in DESeq2 manual [here](https://bioconductor.org/packages/release/bioc/vignettes/DESeq2/inst/doc/DESeq2.html#interactions) and [here](https://bioconductor.org/packages/release/bioc/vignettes/DESeq2/inst/doc/DESeq2.html#likelihood-ratio-test) 4. Contrast should be set based on your metadata file header and available categories in a form of `Factor Numerator Denominator`, where `Factor` - column name from metadata file, `Numerator` - category from metadata file to be used as numerator in fold change calculation, `Denominator` - category from metadata file to be used as denominator in fold change calculation. For example `condition WT KO`.

https://github.com/datirium/workflows.git

Path: workflows/deseq-lrt.cwl

Branch/Commit ID: 261c0232a7a40880f2480b811ed2d7e89c463869

workflow graph example.cwl

Example CWL workflow that uses some advanced features

https://github.com/mskcc/pluto-cwl.git

Path: cwl/example.cwl

Branch/Commit ID: 342e6f1f4f7a3839e579fbe96ccc8d6f7a61ac77

workflow graph Align reference proteins plane complete workflow

https://github.com/ncbi/pgap.git

Path: protein_alignment/wf_protein_alignment.cwl

Branch/Commit ID: 656113dcac0de7cef6cff6c688f61441ee05872a

workflow graph count-lines1-wf.cwl

https://github.com/common-workflow-language/cwltool.git

Path: cwltool/schemas/v1.0/v1.0/count-lines1-wf.cwl

Branch/Commit ID: ae401a813472ca453a99ad067a5e6fc3bd71112b