Explore Workflows

View already parsed workflows here or click here to add your own

Graph Name Retrieved From View
workflow graph Bismark Methylation - pipeline for BS-Seq data analysis

Sequence reads are first cleaned from adapters and transformed into fully bisulfite-converted forward (C->T) and reverse read (G->A conversion of the forward strand) versions, before they are aligned to similarly converted versions of the genome (also C->T and G->A converted). Sequence reads that produce a unique best alignment from the four alignment processes against the bisulfite genomes (which are running in parallel) are then compared to the normal genomic sequence and the methylation state of all cytosine positions in the read is inferred. A read is considered to align uniquely if an alignment has a unique best alignment score (as reported by the AS:i field). If a read produces several alignments with the same number of mismatches or with the same alignment score (AS:i field), a read (or a read-pair) is discarded altogether. On the next step we extract the methylation call for every single C analysed. The position of every single C will be written out to a new output file, depending on its context (CpG, CHG or CHH), whereby methylated Cs will be labelled as forward reads (+), non-methylated Cs as reverse reads (-). The output of the methylation extractor is then transformed into a bedGraph and coverage file. The bedGraph counts output is then used to generate a genome-wide cytosine report which reports the number on every single CpG (optionally every single cytosine) in the genome, irrespective of whether it was covered by any reads or not. As this type of report is informative for cytosines on both strands the output may be fairly large (~46mn CpG positions or >1.2bn total cytosine positions in the human genome).

https://github.com/datirium/workflows.git

Path: workflows/bismark-methylation-se.cwl

Branch/Commit ID: c5bae2ca862c764911b83d1f15ff6af4e2a0db28

workflow graph scatter-wf2.cwl

https://github.com/common-workflow-language/cwl-v1.2.git

Path: tests/scatter-wf2.cwl

Branch/Commit ID: c7c97715b400ff2194aa29fc211d3401cea3a9bf

workflow graph Genomic regions intersection and visualization

Genomic regions intersection and visualization ============================================== 1. Merges intervals within each of the filtered peaks files from ChIP/ATAC experiments 2. Overlaps merged intervals and assigns the nearest genes to them

https://github.com/datirium/workflows.git

Path: workflows/intervene.cwl

Branch/Commit ID: 30031ca5e69cec603c4733681de54dc7bffa20a3

workflow graph FastQC - a quality control tool for high throughput sequence data

FastQC - a quality control tool for high throughput sequence data ===================================== FastQC aims to provide a simple way to do some quality control checks on raw sequence data coming from high throughput sequencing pipelines. It provides a modular set of analyses which you can use to give a quick impression of whether your data has any problems of which you should be aware before doing any further analysis. The main functions of FastQC are: - Import of data from FastQ files (any variant) - Providing a quick overview to tell you in which areas there may be problems - Summary graphs and tables to quickly assess your data - Export of results to an HTML based permanent report - Offline operation to allow automated generation of reports without running the interactive application

https://github.com/datirium/workflows.git

Path: workflows/fastqc.cwl

Branch/Commit ID: 30031ca5e69cec603c4733681de54dc7bffa20a3

workflow graph allele-alignreads-se-pe.cwl

Workflow maps FASTQ files from `fastq_files` input into reference genome `reference_star_indices_folder` and insilico generated `insilico_star_indices_folder` genome (concatenated genome for both `strain1` and `strain2` strains). For both genomes STAR is run with `outFilterMultimapNmax` parameter set to 1 to discard all of the multimapped reads. For insilico genome SAM file is generated. Then it's splitted into two SAM files based on strain names and then sorted by coordinates into the BAM format. For reference genome output BAM file from STAR slignment is also coordinate sorted.

https://github.com/Barski-lab/workflows.git

Path: subworkflows/allele-alignreads-se-pe.cwl

Branch/Commit ID: afbec98437a7796a509fffbad8c3370aa099f059

workflow graph scatter-valuefrom-wf2.cwl

https://github.com/common-workflow-language/cwltool.git

Path: cwltool/schemas/v1.0/v1.0/scatter-valuefrom-wf2.cwl

Branch/Commit ID: beab66d649dd3ee82a013322a5e830875e8556ba

workflow graph gcaccess_from_list

https://github.com/ncbi/pgap.git

Path: task_types/tt_gcaccess_from_list.cwl

Branch/Commit ID: 49732e54e2fe2eafd2f82df3c482c73e642f6d64

workflow graph count-lines11-wf.cwl

https://github.com/common-workflow-language/cwl-v1.2.git

Path: tests/count-lines11-wf.cwl

Branch/Commit ID: e62f99dd79d6cb9c157cceb458f74200da84f6e9

workflow graph conflict-wf.cwl#collision

https://github.com/common-workflow-language/cwltool.git

Path: cwltool/schemas/v1.0/v1.0/conflict-wf.cwl

Branch/Commit ID: 4a31f2a1c1163492ae37bbc748a299e8318c462c

Packed ID: collision

workflow graph Running cellranger count and lineage inference

https://github.com/genome/analysis-workflows.git

Path: definitions/subworkflows/single_cell_rnaseq.cwl

Branch/Commit ID: 93656ed6582073e434eab168c610625a835dce37