Explore Workflows
View already parsed workflows here or click here to add your own
Graph | Name | Retrieved From | View |
---|---|---|---|
|
count-lines11-wf-noET.cwl
|
![]() Path: tests/count-lines11-wf-noET.cwl Branch/Commit ID: 0e37d46e793e72b7c16b5ec03e22cb3ce1f55ba3 |
|
|
chipseq-gen-bigwig.cwl
|
![]() Path: subworkflows/chipseq-gen-bigwig.cwl Branch/Commit ID: 01c6af7a598eb44f6bcaa9b5eecf13229f28546e |
|
|
scRNA-seq pipeline using Salmon and Alevin
|
![]() Path: pipeline.cwl Branch/Commit ID: d71be688b76ed09a64cf5d8cf36182fe35cce6b3 |
|
|
DESeq - differential gene expression analysis
Differential gene expression analysis ===================================== Differential gene expression analysis based on the negative binomial distribution Estimate variance-mean dependence in count data from high-throughput sequencing assays and test for differential expression based on a model using the negative binomial distribution. DESeq1 ------ High-throughput sequencing assays such as RNA-Seq, ChIP-Seq or barcode counting provide quantitative readouts in the form of count data. To infer differential signal in such data correctly and with good statistical power, estimation of data variability throughout the dynamic range and a suitable error model are required. Simon Anders and Wolfgang Huber propose a method based on the negative binomial distribution, with variance and mean linked by local regression and present an implementation, [DESeq](http://bioconductor.org/packages/release/bioc/html/DESeq.html), as an R/Bioconductor package DESeq2 ------ In comparative high-throughput sequencing assays, a fundamental task is the analysis of count data, such as read counts per gene in RNA-seq, for evidence of systematic changes across experimental conditions. Small replicate numbers, discreteness, large dynamic range and the presence of outliers require a suitable statistical approach. [DESeq2](http://www.bioconductor.org/packages/release/bioc/html/DESeq2.html), a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates. This enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression. |
![]() Path: workflows/deseq.cwl Branch/Commit ID: 954bb2f213d97dfef1cddaf9e830169a92ad0c6b |
|
|
scatter GATK HaplotypeCaller over intervals
|
![]() Path: definitions/subworkflows/gatk_haplotypecaller_iterator.cwl Branch/Commit ID: 0a9a4ce83b49ed4e7eee5bcc09d83725136a36b0 |
|
|
exome alignment and germline variant detection
|
![]() Path: definitions/subworkflows/germline_detect_variants.cwl Branch/Commit ID: 0a9a4ce83b49ed4e7eee5bcc09d83725136a36b0 |
|
|
DiffBind - Differential Binding Analysis of ChIP-Seq Peak Data
Differential Binding Analysis of ChIP-Seq Peak Data --------------------------------------------------- DiffBind processes ChIP-Seq data enriched for genomic loci where specific protein/DNA binding occurs, including peak sets identified by ChIP-Seq peak callers and aligned sequence read datasets. It is designed to work with multiple peak sets simultaneously, representing different ChIP experiments (antibodies, transcription factor and/or histone marks, experimental conditions, replicates) as well as managing the results of multiple peak callers. For more information please refer to: ------------------------------------- Ross-Innes CS, Stark R, Teschendorff AE, Holmes KA, Ali HR, Dunning MJ, Brown GD, Gojis O, Ellis IO, Green AR, Ali S, Chin S, Palmieri C, Caldas C, Carroll JS (2012). “Differential oestrogen receptor binding is associated with clinical outcome in breast cancer.” Nature, 481, -4. |
![]() Path: workflows/diffbind.cwl Branch/Commit ID: 954bb2f213d97dfef1cddaf9e830169a92ad0c6b |
|
|
genomics-workspace-cds.cwl
|
![]() Path: flow_genomicsWorkspace/genomics-workspace-cds.cwl Branch/Commit ID: 5910b4d88aca172252d9102ddb610a7dc9e1347f |
|
|
simple_magicblast.cwl
|
![]() Path: blast-pipelines/simple_magicblast.cwl Branch/Commit ID: 9cf84946390112ca1f27347ee2a90a370a92a4a2 |
|
|
DESeq2 (LRT) - differential gene expression analysis using likelihood ratio test
Runs DESeq2 using LRT (Likelihood Ratio Test) ============================================= The LRT examines two models for the counts, a full model with a certain number of terms and a reduced model, in which some of the terms of the full model are removed. The test determines if the increased likelihood of the data using the extra terms in the full model is more than expected if those extra terms are truly zero. The LRT is therefore useful for testing multiple terms at once, for example testing 3 or more levels of a factor at once, or all interactions between two variables. The LRT for count data is conceptually similar to an analysis of variance (ANOVA) calculation in linear regression, except that in the case of the Negative Binomial GLM, we use an analysis of deviance (ANODEV), where the deviance captures the difference in likelihood between a full and a reduced model. When one performs a likelihood ratio test, the p values and the test statistic (the stat column) are values for the test that removes all of the variables which are present in the full design and not in the reduced design. This tests the null hypothesis that all the coefficients from these variables and levels of these factors are equal to zero. The likelihood ratio test p values therefore represent a test of all the variables and all the levels of factors which are among these variables. However, the results table only has space for one column of log fold change, so a single variable and a single comparison is shown (among the potentially multiple log fold changes which were tested in the likelihood ratio test). This indicates that the p value is for the likelihood ratio test of all the variables and all the levels, while the log fold change is a single comparison from among those variables and levels. **Technical notes** 1. At least two biological replicates are required for every compared category 2. Metadata file describes relations between compared experiments, for example ``` ,time,condition DH1,day5,WT DH2,day5,KO DH3,day7,WT DH4,day7,KO DH5,day7,KO ``` where `time, condition, day5, day7, WT, KO` should be a single words (without spaces) and `DH1, DH2, DH3, DH4, DH5` correspond to the experiment aliases set in **RNA-Seq experiments** input. 3. Design and reduced formulas should start with **~** and include categories or, optionally, their interactions from the metadata file header. See details in DESeq2 manual [here](https://bioconductor.org/packages/release/bioc/vignettes/DESeq2/inst/doc/DESeq2.html#interactions) and [here](https://bioconductor.org/packages/release/bioc/vignettes/DESeq2/inst/doc/DESeq2.html#likelihood-ratio-test) 4. Contrast should be set based on your metadata file header and available categories in a form of `Factor Numerator Denominator`, where `Factor` - column name from metadata file, `Numerator` - category from metadata file to be used as numerator in fold change calculation, `Denominator` - category from metadata file to be used as denominator in fold change calculation. For example `condition WT KO`. |
![]() Path: workflows/deseq-lrt.cwl Branch/Commit ID: a0b22644ca178b640fb74849d23b7c631022f0b5 |