Explore Workflows
View already parsed workflows here or click here to add your own
| Graph | Name | Retrieved From | View |
|---|---|---|---|
|
|
Generate genome indices for STAR & bowtie
Creates indices for: * [STAR](https://github.com/alexdobin/STAR) v2.5.3a (03/17/2017) PMID: [23104886](https://www.ncbi.nlm.nih.gov/pubmed/23104886) * [bowtie](http://bowtie-bio.sourceforge.net/tutorial.shtml) v1.2.0 (12/30/2016) It performs the following steps: 1. `STAR --runMode genomeGenerate` to generate indices, based on [FASTA](http://zhanglab.ccmb.med.umich.edu/FASTA/) and [GTF](http://mblab.wustl.edu/GTF2.html) input files, returns results as an array of files 2. Outputs indices as [Direcotry](http://www.commonwl.org/v1.0/CommandLineTool.html#Directory) data type 3. Separates *chrNameLength.txt* file from Directory output 4. `bowtie-build` to generate indices requires genome [FASTA](http://zhanglab.ccmb.med.umich.edu/FASTA/) file as input, returns results as a group of main and secondary files |
Path: workflows/genome-indices.cwl Branch/Commit ID: 9ee330737f4603e4e959ffe786fbb2046db70a00 |
|
|
|
tt_blastn_wnode
|
Path: task_types/tt_blastn_wnode.cwl Branch/Commit ID: 50d161364e2859ed5c95ef07c9f7234f1431cf31 |
|
|
|
Bisulfite alignment and QC
|
Path: definitions/pipelines/bisulfite.cwl Branch/Commit ID: d2c2f2eb846ae2e9cdcab46e3bb88e42126cb3f5 |
|
|
|
FastQC - a quality control tool for high throughput sequence data
FastQC - a quality control tool for high throughput sequence data ===================================== FastQC aims to provide a simple way to do some quality control checks on raw sequence data coming from high throughput sequencing pipelines. It provides a modular set of analyses which you can use to give a quick impression of whether your data has any problems of which you should be aware before doing any further analysis. The main functions of FastQC are: - Import of data from FastQ files (any variant) - Providing a quick overview to tell you in which areas there may be problems - Summary graphs and tables to quickly assess your data - Export of results to an HTML based permanent report - Offline operation to allow automated generation of reports without running the interactive application |
Path: workflows/fastqc.cwl Branch/Commit ID: 7ae3b75bbe614e59cdeaba06047234a6c40c0fe9 |
|
|
|
conflict-wf.cwl#collision
|
Path: cwltool/schemas/v1.0/v1.0/conflict-wf.cwl Branch/Commit ID: 1e5ad10c6b0d1c5f531737d12ef64062a00baef2 Packed ID: collision |
|
|
|
running cellranger mkfastq and count
|
Path: definitions/subworkflows/cellranger_mkfastq_and_count.cwl Branch/Commit ID: f401b02285f30de1c12ac2859134099fe04be33f |
|
|
|
Trim Galore SMARTer RNA-Seq pipeline paired-end strand specific
https://chipster.csc.fi/manual/library-type-summary.html Modified original [BioWardrobe's](https://biowardrobe.com) [PubMed ID:26248465](https://www.ncbi.nlm.nih.gov/pubmed/26248465) **RNA-Seq** basic analysis for a **pair-end** experiment. A corresponded input [FASTQ](http://maq.sourceforge.net/fastq.shtml) file has to be provided. Current workflow should be used only with the single-end RNA-Seq data. It performs the following steps: 1. Trim adapters from input FASTQ files 2. Use STAR to align reads from input FASTQ files according to the predefined reference indices; generate unsorted BAM file and alignment statistics file 3. Use fastx_quality_stats to analyze input FASTQ files and generate quality statistics files 4. Use samtools sort to generate coordinate sorted BAM(+BAI) file pair from the unsorted BAM file obtained on the step 1 (after running STAR) 5. Generate BigWig file on the base of sorted BAM file 6. Map input FASTQ files to predefined rRNA reference indices using Bowtie to define the level of rRNA contamination; export resulted statistics to file 7. Calculate isoform expression level for the sorted BAM file and GTF/TAB annotation file using GEEP reads-counting utility; export results to file |
Path: workflows/trim-rnaseq-pe-smarter-dutp.cwl Branch/Commit ID: 22880e0f41d0420a17d643e8a6e8ee18165bbfbf |
|
|
|
group-isoforms-batch.cwl
Workflow runs group-isoforms.cwl tool using scatter for isoforms_file input. genes_filename and common_tss_filename inputs are ignored. |
Path: tools/group-isoforms-batch.cwl Branch/Commit ID: 92f1a6da9c4f85fb51340b01b32373a50fde0891 |
|
|
|
PCA - Principal Component Analysis
Principal Component Analysis --------------- Principal component analysis (PCA) is a statistical procedure that uses an orthogonal transformation to convert a set of observations of possibly correlated variables (entities each of which takes on various numerical values) into a set of values of linearly uncorrelated variables called principal components. The calculation is done by a singular value decomposition of the (centered and possibly scaled) data matrix, not by using eigen on the covariance matrix. This is generally the preferred method for numerical accuracy. |
Path: workflows/pca.cwl Branch/Commit ID: c6bfa0de917efb536dd385624fc7702e6748e61d |
|
|
|
mpi_simple_wf.cwl
Simple 2 step workflow to check that workflow steps are independently picking up on the number of processes. First run the parallel get PIDs step (on the input num procs) then run (on a single proc) the line count. This should equal the input. |
Path: tests/wf/mpi_simple_wf.cwl Branch/Commit ID: 0e8110083bad6ea98fc487aa262953a6c5e010b5 |
