Explore Workflows

View already parsed workflows here or click here to add your own

Graph	Name	Retrieved From	View
	wf-loadContents4.cwl	https://github.com/common-workflow-language/cwl-v1.2.git Path: tests/wf-loadContents4.cwl Branch/Commit ID: a5073143db4155e05df8d2e7eb59d9e62acd65a5
	scatter GATK HaplotypeCaller over intervals	https://github.com/genome/analysis-workflows.git Path: definitions/subworkflows/gatk_haplotypecaller_iterator.cwl Branch/Commit ID: 6949082038c1ad36d6e9848b97a2537aef2d3805
	heatmap-prepare.cwl Workflow runs homer-make-tag-directory.cwl tool using scatter for the following inputs - bam_file - fragment_size - total_reads `dotproduct` is used as a `scatterMethod`, so one element will be taken from each array to construct each job: 1) bam_file[0] fragment_size[0] total_reads[0] 2) bam_file[1] fragment_size[1] total_reads[1] ... N) bam_file[N] fragment_size[N] total_reads[N] `bam_file`, `fragment_size` and `total_reads` arrays should have the identical order.	https://github.com/Barski-lab/workflows.git Path: subworkflows/heatmap-prepare.cwl Branch/Commit ID: 915ea871cc28f7b666a4d7b476fdaa7b454ce7c3
	qc_workflow_wo_waltz.cwl This workflow is intended to be used to test the QC module, without having to run the long waltz step	https://github.com/mskcc/innovation-pipeline.git Path: workflows/QC/qc_workflow_wo_waltz.cwl Branch/Commit ID: b0f226a9ac5152f3afe0d38c8cd54aa25b8b01cf
	Kraken2 Metagenomic pipeline paired-end This workflow taxonomically classifies paired-end sequencing reads in FASTQ format, that have been optionally adapter trimmed with trimgalore, using Kraken2 and a user-selected pre-built database from a list of [genomic index files](https://benlangmead.github.io/aws-indexes/k2). ### __Inputs__ Kraken2 database for taxonomic classification: - [Viral (0.5 GB)](https://genome-idx.s3.amazonaws.com/kraken/k2_viral_20221209.tar.gz), all refseq viral genomes - [MinusB (8.7 GB)](https://genome-idx.s3.amazonaws.com/kraken/k2_minusb_20221209.tar.gz), standard minus bacteria (archaea, viral, plasmid, human1, UniVec_Core) - [PlusPFP-16 (15.0 GB)](https://genome-idx.s3.amazonaws.com/kraken/k2_pluspfp_16gb_20221209.tar.gz), standard (archaea, bacteria, viral, plasmid, human1, UniVec_Core) + (protozoa, fungi & plant) capped at 16 GB (shrunk via random kmer downselect) - [EuPathDB46 (34.1 GB)](https://genome-idx.s3.amazonaws.com/kraken/k2_eupathdb48_20201113.tar.gz), eukaryotic pathogen genomes with contaminants removed (https://veupathdb.org/veupathdb/app) - [16S_gg_13_5 (73 MB)](https://genome-idx.s3.amazonaws.com/kraken/16S_Greengenes13.5_20200326.tgz), Greengenes 16S rRNA database ([release 13.5](https://greengenes.secondgenome.com/?prefix=downloads/greengenes_database/gg_13_5/), 20200326)\n - [16S_silva_138 (112 MB)](https://genome-idx.s3.amazonaws.com/kraken/16S_Silva138_20200326.tgz), SILVA 16S rRNA database ([release 138.1](https://www.arb-silva.de/documentation/release-1381/), 20200827) Read 1 file: - FASTA/Q input R1 from a paired end library Read 2 file: - FASTA/Q input R2 from a paired end library Number of threads for steps that support multithreading: - Number of threads for steps that support multithreading - default set to `4` Advanced Inputs Tab (Optional): - Number of bases to clip from the 3p end - Number of bases to clip from the 5p end ### __Outputs__ - k2db, an upstream database used by kraken2 classifier ### __Data Analysis Steps__ 1. Trimming the adapters with TrimGalore. - This step is particularly important when the reads are long and the fragments are short - resulting in sequencing adapters at the ends of reads. If adapter is not removed the read will not map. TrimGalore can recognize standard adapters, such as Illumina or Nextera/Tn5 adapters. 2. Generate quality control statistics of trimmed, unmapped sequence data 3. (Optional) Clipping of 5' and/or 3' end by the specified number of bases. 4. Mapping reads to primary genome index with Bowtie. ### __References__ - Wood, D.E., Lu, J. & Langmead, B. Improved metagenomic analysis with Kraken 2. Genome Biol 20, 257 (2019). https://doi.org/10.1186/s13059-019-1891-0	https://github.com/datirium/workflows.git Path: workflows/kraken2-classify-pe.cwl Branch/Commit ID: aebf2355539fdf81fd9082616f8b21440d2691c6
	Cellranger reanalyze - reruns secondary analysis performed on the feature-barcode matrix Devel version of Single-Cell Cell Ranger Reanalyze ================================================== Workflow calls \"cellranger aggr\" command to rerun secondary analysis performed on the feature-barcode matrix (dimensionality reduction, clustering and visualization) using different parameter settings. As an input we use filtered feature-barcode matrices in HDF5 format from cellranger count or aggr experiments. Note, we don't pass aggregation_metadata from the upstream cellranger aggr step. Need to address this issue when needed.	https://github.com/datirium/workflows.git Path: workflows/cellranger-reanalyze.cwl Branch/Commit ID: 57437c1e9f881411b65f79acd64b7cf14df5b901
	Filter Protein Alignments	https://github.com/ncbi/pgap.git Path: protein_alignment/wf_align_filter.cwl Branch/Commit ID: 656113dcac0de7cef6cff6c688f61441ee05872a
	Build Bismark indices Copy fasta_file file to the folder and run run bismark_genome_preparation script to prepare indices for Bismark Methylation Analysis. Bowtie2 aligner is used by default. The name of the output indices folder is equal to the genome input.	https://github.com/datirium/workflows.git Path: workflows/bismark-index.cwl Branch/Commit ID: 954bb2f213d97dfef1cddaf9e830169a92ad0c6b
	Workflow to run pVACseq from detect_variants and rnaseq pipeline outputs	https://github.com/genome/analysis-workflows.git Path: definitions/pipelines/pvacseq.cwl Branch/Commit ID: 49508a2757ff2f49f1c200774a38af1c12b531bf
	Detect Variants workflow	https://github.com/genome/analysis-workflows.git Path: definitions/pipelines/detect_variants.cwl Branch/Commit ID: 49508a2757ff2f49f1c200774a38af1c12b531bf