Explore Workflows
View already parsed workflows here or click here to add your own
Graph | Name | Retrieved From | View |
---|---|---|---|
|
DiffBind - Differential Binding Analysis of ChIP-Seq Peak Data
Differential Binding Analysis of ChIP-Seq Peak Data --------------------------------------------------- DiffBind processes ChIP-Seq data enriched for genomic loci where specific protein/DNA binding occurs, including peak sets identified by ChIP-Seq peak callers and aligned sequence read datasets. It is designed to work with multiple peak sets simultaneously, representing different ChIP experiments (antibodies, transcription factor and/or histone marks, experimental conditions, replicates) as well as managing the results of multiple peak callers. For more information please refer to: ------------------------------------- Ross-Innes CS, Stark R, Teschendorff AE, Holmes KA, Ali HR, Dunning MJ, Brown GD, Gojis O, Ellis IO, Green AR, Ali S, Chin S, Palmieri C, Caldas C, Carroll JS (2012). “Differential oestrogen receptor binding is associated with clinical outcome in breast cancer.” Nature, 481, -4. |
![]() Path: workflows/diffbind.cwl Branch/Commit ID: 4f48ee6f8665a34cdf96e89c012ee807f80c7a3d |
|
|
CWL4IncorporateTSSintoGXF
CWL4IncorporateTSSintoGXF: mapping CAGE-seq data and add TSS information to GXF format (GFF/GTF) file |
![]() Path: workflow/cageseq_gtf_update_pe.cwl Branch/Commit ID: 9728a86f7b73f7657a1f261e77a14ca59bdd561b |
|
|
gcaccess_from_list
|
![]() Path: task_types/tt_gcaccess_from_list.cwl Branch/Commit ID: ce433f771ebf5677c9f40858e2ae91b1a7e75d30 |
|
|
directory.cwl
Inspect provided directory and return filenames. Generate a new directory and return it (including content). |
![]() Path: tests/wf/directory.cwl Branch/Commit ID: ecdfe1ee769d05790f70ac87a711131f441f3753 |
|
|
Differential Methylation Workflow
A basic differential methylation analysis workflow using BismarkCov formatted bed files as input to the RnBeads tool. Analysis is conducted on region and sites levels according to the sample groups specified by user (limited to 2 conditions in this workflow implementation). See report html files for detailed descriptions of analyses and results interpretation. ### __Inputs__ *General Info:* - Experiment short name/Alias* - a unique name for the sample (e.g. what was used on tubes while processing it) - Condition 1 name - name defining condition/group 1 - Condition 2 name - name defining condition/group 2 - Bismark coverage files* for condition1 - minumum of 2 is required for analysis - Bismark coverage files* for condition2 - minumum of 2 is required for analysis - Sample genome - available options: hg19, hg38, mm9, mm10, rn5 - Genome type - indicate mismark index used for upstream samples (input for conditions 1 and 2) *Advanced:* - Number of threads for steps that support multithreading - default set to `4` *[BismarkCov formatted bed](https://www.bioinformatics.babraham.ac.uk/projects/bismark/Bismark_User_Guide.pdf): The genome-wide cytosine report (optional) is tab-delimited in the following format (1-based coords): <chromosome> <position> <strand> <count methylated> <count unmethylated> <C-context> <trinucleotide context> ### __Outputs__ Intermediate and final downloadable outputs include: - sig_dm_sites.bed ([bed for IGV](https://genome.ucsc.edu/FAQ/FAQformat.html#format1); sig diff meth sites) - sig_dm_sites_annotated.tsv (tsv for TABLE; for each site above, closest single gene annotation) - Site_id, unique indentifer per methylated site - Site_Chr, chromosome of methylated site - Site_position, 1-based position in chr of methylated site - Site_strand, strand of methylated site - Log2_Meth_Quotient, log2 of the quotient in methylation: log2((mean.g1+epsilon)/(mean.g2+epsilon)), where epsilon:=0.01. In case of paired analysis, it is the mean of the pairwise quotients. - FDR, adjusted p-values, all <0.10 assumed to be significant - Coverage_score, value between 0-1000 reflects strength of mean coverage difference between conditions and equals [1000-(1000/(meancov_g1-meancov_g2)^2](https://www.wolframalpha.com/input?i=solve+1000-%281000%2F%28x%5E2%29%29), if meancov_g1-meancov_g2==0, score=0, elif score<1==1, else score - meancov_g1, mean coverage of condition1 - meancov_g2, mean coverage of condition2 - refSeq_id, RefSeq gene id - Gene_id, gene symbol - Chr, gene chromosome - txStart, gene transcription start position - tsEnd, gene transcription end position - txStrand, gene strand - stdout and stderr log files - Packaged RnBeads reports directory (reports.tar.gz) contains: reports/ ├── configuration ├── data_import.html ├── data_import_data ├── data_import_images ├── data_import_pdfs ├── differential_methylation.html ├── differential_methylation_data ├── differential_methylation_images ├── differential_methylation_pdfs ├── preprocessing.html ├── preprocessing_data ├── preprocessing_images ├── preprocessing_pdfs ├── quality_control.html ├── quality_control_data ├── quality_control_images ├── quality_control_pdfs ├── tracks_and_tables.html ├── tracks_and_tables_data ├── tracks_and_tables_images └── tracks_and_tables_pdfs Reported methylation is in the form of regions (genes, promoters, cpg, tiling) and specific sites: - genes - Ensembl gene definitions are downloaded using the biomaRt package. - promoters - A promoter is defined as the region spanning 1,500 bases upstream and 500 bases downstream of the transcription start site of the corresponding gene - cpg - the CpG islands from the UCSC Genome Browser - tiling - a window size of 5 kilobases are defined over the whole genome - sites - all cytosines in the context of CpGs in the respective genome ### __Data Analysis Steps__ 1. generate sample sheet with associated conditions for testing in RnBeads 2. setup rnbeads analyses in R, and run differential methylation analysis 3. process output diffmeth files for regions and sites 4. find single closest gene annotations for all significantly diffmeth sites 5. package and save rnbeads report directory 6. clean up report dir for html outputs ### __References__ - https://rnbeads.org/materials/example_3/differential_methylation.html - Makambi, K. (2003) Weighted inverse chi-square method for correlated significance tests. Journal of Applied Statistics, 30(2), 225234 - https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4216143/ - Assenov Y, Müller F, Lutsik P, Walter J, Lengauer T, Bock C. Comprehensive analysis of DNA methylation data with RnBeads. Nat Methods. 2014 Nov;11(11):1138-1140. doi: 10.1038/nmeth.3115. Epub 2014 Sep 28. PMID: 25262207; PMCID: PMC4216143. |
![]() Path: workflows/diffmeth.cwl Branch/Commit ID: b4d578c2ba4713a5a22163d9f8c7105acda1f22e |
|
|
phase VCF
|
![]() Path: definitions/subworkflows/phase_vcf.cwl Branch/Commit ID: adcae308fdccaa1190083616118dfadb4df65dca |
|
|
exome alignment and tumor-only variant detection
|
![]() Path: definitions/pipelines/exome.cwl Branch/Commit ID: 86fbeb95ef85111f3b4c6bc2bba8f06cef64e157 |
|
|
Immunotherapy Workflow
|
![]() Path: definitions/pipelines/immuno.cwl Branch/Commit ID: d3e4bf55753cd92f97537c7d701187ea92d1e5f0 |
|
|
count-lines7-wf.cwl
|
![]() Path: tests/count-lines7-wf.cwl Branch/Commit ID: 86c46cb397de029e4c91f02cca40fa2b54d22f37 |
|
|
gcaccess_from_list
|
![]() Path: task_types/tt_gcaccess_from_list.cwl Branch/Commit ID: be5ae41801b19ebc69a2889d8fdb39e8e2359611 |