Explore Workflows

View already parsed workflows here or click here to add your own

Graph	Name	Retrieved From	View
	DESeq2 (LRT) - differential gene expression analysis using likelihood ratio test Runs DESeq2 using LRT (Likelihood Ratio Test) ============================================= The LRT examines two models for the counts, a full model with a certain number of terms and a reduced model, in which some of the terms of the full model are removed. The test determines if the increased likelihood of the data using the extra terms in the full model is more than expected if those extra terms are truly zero. The LRT is therefore useful for testing multiple terms at once, for example testing 3 or more levels of a factor at once, or all interactions between two variables. The LRT for count data is conceptually similar to an analysis of variance (ANOVA) calculation in linear regression, except that in the case of the Negative Binomial GLM, we use an analysis of deviance (ANODEV), where the deviance captures the difference in likelihood between a full and a reduced model. When one performs a likelihood ratio test, the p values and the test statistic (the stat column) are values for the test that removes all of the variables which are present in the full design and not in the reduced design. This tests the null hypothesis that all the coefficients from these variables and levels of these factors are equal to zero. The likelihood ratio test p values therefore represent a test of all the variables and all the levels of factors which are among these variables. However, the results table only has space for one column of log fold change, so a single variable and a single comparison is shown (among the potentially multiple log fold changes which were tested in the likelihood ratio test). This indicates that the p value is for the likelihood ratio test of all the variables and all the levels, while the log fold change is a single comparison from among those variables and levels. Technical notes 1. At least two biological replicates are required for every compared category 2. Metadata file describes relations between compared experiments, for example ``` ,time,condition DH1,day5,WT DH2,day5,KO DH3,day7,WT DH4,day7,KO DH5,day7,KO ``` where `time, condition, day5, day7, WT, KO` should be a single words (without spaces) and `DH1, DH2, DH3, DH4, DH5` correspond to the experiment aliases set in RNA-Seq experiments input. 3. Design and reduced formulas should start with ~ and include categories or, optionally, their interactions from the metadata file header. See details in DESeq2 manual [here](https://bioconductor.org/packages/release/bioc/vignettes/DESeq2/inst/doc/DESeq2.html#interactions) and [here](https://bioconductor.org/packages/release/bioc/vignettes/DESeq2/inst/doc/DESeq2.html#likelihood-ratio-test) 4. Contrast should be set based on your metadata file header and available categories in a form of `Factor Numerator Denominator`, where `Factor` - column name from metadata file, `Numerator` - category from metadata file to be used as numerator in fold change calculation, `Denominator` - category from metadata file to be used as denominator in fold change calculation. For example `condition WT KO`.	https://github.com/datirium/workflows.git Path: workflows/deseq-lrt.cwl Branch/Commit ID: 261c0232a7a40880f2480b811ed2d7e89c463869
	example.cwl Example CWL workflow that uses some advanced features	https://github.com/mskcc/pluto-cwl.git Path: cwl/example.cwl Branch/Commit ID: 342e6f1f4f7a3839e579fbe96ccc8d6f7a61ac77
	Align reference proteins plane complete workflow	https://github.com/ncbi/pgap.git Path: protein_alignment/wf_protein_alignment.cwl Branch/Commit ID: 656113dcac0de7cef6cff6c688f61441ee05872a
	count-lines1-wf.cwl	https://github.com/common-workflow-language/cwltool.git Path: cwltool/schemas/v1.0/v1.0/count-lines1-wf.cwl Branch/Commit ID: ae401a813472ca453a99ad067a5e6fc3bd71112b
	RNA-Seq pipeline paired-end stranded mitochondrial Slightly changed original [BioWardrobe's](https://biowardrobe.com) [PubMed ID:26248465](https://www.ncbi.nlm.nih.gov/pubmed/26248465) RNA-Seq basic analysis for strand specific pair-end experiment. An additional steps were added to map data to mitochondrial chromosome only and then merge the output. Experiment files in [FASTQ](http://maq.sourceforge.net/fastq.shtml) format either compressed or not can be used. Current workflow should be used only with the pair-end strand specific RNA-Seq data. It performs the following steps: 1. `STAR` to align reads from input FASTQ file according to the predefined reference indices; generate unsorted BAM file and alignment statistics file 2. `fastx_quality_stats` to analyze input FASTQ file and generate quality statistics file 3. `samtools sort` to generate coordinate sorted BAM(+BAI) file pair from the unsorted BAM file obtained on the step 1 (after running STAR) 5. Generate BigWig file on the base of sorted BAM file 6. Map input FASTQ file to predefined rRNA reference indices using Bowtie to define the level of rRNA contamination; export resulted statistics to file 7. Calculate isoform expression level for the sorted BAM file and GTF/TAB annotation file using `GEEP` reads-counting utility; export results to file	https://github.com/datirium/workflows.git Path: workflows/rnaseq-pe-dutp-mitochondrial.cwl Branch/Commit ID: 57437c1e9f881411b65f79acd64b7cf14df5b901
	kfdrc-gatk-haplotypecaller-wf.cwl	https://github.com/kids-first/kf-alignment-workflow.git Path: workflows/kfdrc-gatk-haplotypecaller-wf.cwl Branch/Commit ID: af97e25cb213233a4923c881f7c6210b57960cb9
	Feature expression merge - combines feature expression from several experiments Feature expression merge - combines feature expression from several experiments ========================================================================= Workflows merges RPKM (by default) gene expression from several experiments based on the values from GeneId, Chrom, TxStart, TxEnd and Strand columns (by default). Reported unique columns are renamed based on the experiments names.	https://github.com/datirium/workflows.git Path: workflows/feature-merge.cwl Branch/Commit ID: 2caa50434966ebdf4b33e5ca689c2e4df32f9058
	WGS QC workflow mouse	https://github.com/genome/analysis-workflows.git Path: definitions/subworkflows/qc_wgs_mouse.cwl Branch/Commit ID: 22fce2dbdada0c4135b6f0677f78535cf980cb07
	pass-unconnected.cwl	https://github.com/common-workflow-language/cwl-v1.2.git Path: tests/pass-unconnected.cwl Branch/Commit ID: 707ebcd2173889604459c5f4ffb55173c508abb3
	default-dir5.cwl	https://github.com/common-workflow-language/cwltool.git Path: tests/wf/default-dir5.cwl Branch/Commit ID: 55ccde7c2fe3e7899136ce8606a341e292d7050a