Explore Workflows

View already parsed workflows here or click here to add your own

Graph	Name	Retrieved From	View
	Bacterial Annotation, pass 2, blastp-based functional annotation (first pass)	https://github.com/ncbi/pgap.git Path: bacterial_annot/wf_bacterial_annot_pass2.cwl Branch/Commit ID: 54c5074587af001a44eccb4762a4cb25fa24cb3e
	extract_gencoll_ids	https://github.com/ncbi/pgap.git Path: task_types/tt_extract_gencoll_ids.cwl Branch/Commit ID: 2afb5ebafd1353ba063cc74ee9a7eaf347afce5c
	Single-Cell Differential Abundance Analysis Single-Cell Differential Abundance Analysis Compares the composition of cell types between two tested conditions	https://github.com/datirium/workflows.git Path: workflows/sc-rna-da-cells.cwl Branch/Commit ID: 30031ca5e69cec603c4733681de54dc7bffa20a3
	Bacterial Annotation, ab initio (first pass) searched against AntiFam	https://github.com/ncbi/pgap.git Path: bacterial_annot/wf_ab_initio_antifam.cwl Branch/Commit ID: 54c5074587af001a44eccb4762a4cb25fa24cb3e
	Unaligned BAM to BQSR and VCF	https://github.com/genome/analysis-workflows.git Path: definitions/subworkflows/bam_to_bqsr_no_dup_marking.cwl Branch/Commit ID: 0805e8e0d358136468e0a9f49e06005e41965adc
	Kraken2 Database installation pipeline This workflow downloads the user-selected pre-built kraken2 database from: https://benlangmead.github.io/aws-indexes/k2 ### __Inputs__ Select a pre-built Kraken2 database to download and use for metagenomic classification: - Available options comprised of various combinations of RefSeq reference genome sets: - [Viral (0.5 GB)](https://genome-idx.s3.amazonaws.com/kraken/k2_viral_20221209.tar.gz), all refseq viral genomes - [MinusB (8.7 GB)](https://genome-idx.s3.amazonaws.com/kraken/k2_minusb_20221209.tar.gz), standard minus bacteria (archaea, viral, plasmid, human1, UniVec_Core) - [PlusPFP-16 (15.0 GB)](https://genome-idx.s3.amazonaws.com/kraken/k2_pluspfp_16gb_20221209.tar.gz), standard (archaea, bacteria, viral, plasmid, human1, UniVec_Core) + (protozoa, fungi & plant) capped at 16 GB (shrunk via random kmer downselect) - [EuPathDB46 (34.1 GB)](https://genome-idx.s3.amazonaws.com/kraken/k2_eupathdb48_20201113.tar.gz), eukaryotic pathogen genomes with contaminants removed (https://veupathdb.org/veupathdb/app) - [16S_gg_13_5 (73 MB)](https://genome-idx.s3.amazonaws.com/kraken/16S_Greengenes13.5_20200326.tgz), Greengenes 16S rRNA database ([release 13.5](https://greengenes.secondgenome.com/?prefix=downloads/greengenes_database/gg_13_5/), 20200326)\n - [16S_silva_138 (112 MB)](https://genome-idx.s3.amazonaws.com/kraken/16S_Silva138_20200326.tgz), SILVA 16S rRNA database ([release 138.1](https://www.arb-silva.de/documentation/release-1381/), 20200827) ### __Outputs__ - k2db, an upstream database used by kraken2 classification tool - compressed_k2db_tar, compressed and tarred kraken2 database directory file for download and use outside of scidap ### __Data Analysis Steps__ 1. download selected pre-built kraken2 database. 2. make available as upstream source for kraken2 metagenomic taxonomic classification. ### __References__ - Wood, D.E., Lu, J. & Langmead, B. Improved metagenomic analysis with Kraken 2. Genome Biol 20, 257 (2019). https://doi.org/10.1186/s13059-019-1891-0	https://github.com/datirium/workflows.git Path: workflows/kraken2-databases.cwl Branch/Commit ID: 30031ca5e69cec603c4733681de54dc7bffa20a3
	Subworkflow to allow calling cnvkit with cram instead of bam files	https://github.com/genome/analysis-workflows.git Path: definitions/subworkflows/cram_to_cnvkit.cwl Branch/Commit ID: 72e0bdc1ec449d86df4534132e9a30ad7e9b8afd
	Add snv and indel bam-readcount files to a vcf	https://github.com/genome/analysis-workflows.git Path: definitions/subworkflows/vcf_readcount_annotator.cwl Branch/Commit ID: 25aa4788dd4efb1cc8ed6f609cb7803896e4d28d
	iwdr_with_nested_dirs.cwl	https://github.com/common-workflow-language/cwltool.git Path: cwltool/schemas/v1.0/v1.0/iwdr_with_nested_dirs.cwl Branch/Commit ID: 8010fd2bf1e7090ba6df6ca8c84bbb96e2272d32
	DESeq - differential gene expression analysis Differential gene expression analysis ===================================== Differential gene expression analysis based on the negative binomial distribution Estimate variance-mean dependence in count data from high-throughput sequencing assays and test for differential expression based on a model using the negative binomial distribution. DESeq1 ------ High-throughput sequencing assays such as RNA-Seq, ChIP-Seq or barcode counting provide quantitative readouts in the form of count data. To infer differential signal in such data correctly and with good statistical power, estimation of data variability throughout the dynamic range and a suitable error model are required. Simon Anders and Wolfgang Huber propose a method based on the negative binomial distribution, with variance and mean linked by local regression and present an implementation, [DESeq](http://bioconductor.org/packages/release/bioc/html/DESeq.html), as an R/Bioconductor package DESeq2 ------ In comparative high-throughput sequencing assays, a fundamental task is the analysis of count data, such as read counts per gene in RNA-seq, for evidence of systematic changes across experimental conditions. Small replicate numbers, discreteness, large dynamic range and the presence of outliers require a suitable statistical approach. [DESeq2](http://www.bioconductor.org/packages/release/bioc/html/DESeq2.html), a method for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates. This enables a more quantitative analysis focused on the strength rather than the mere presence of differential expression.	https://github.com/datirium/workflows.git Path: workflows/deseq.cwl Branch/Commit ID: b4d578c2ba4713a5a22163d9f8c7105acda1f22e