Explore Workflows

View already parsed workflows here or click here to add your own

Graph	Name	Retrieved From	View
	bacterial_kmer	https://github.com/ncbi/pgap.git Path: bacterial_kmer/wf_bacterial_kmer.cwl Branch/Commit ID: 369e2b6c7f4db75099d258729dec1326f55d2cc5
	group-isoforms-batch.cwl Workflow runs group-isoforms.cwl tool using scatter for isoforms_file input. genes_filename and common_tss_filename inputs are ignored.	https://github.com/datirium/workflows.git Path: tools/group-isoforms-batch.cwl Branch/Commit ID: b5e16e359007150647b14dc6e038f4eb8dccda79
	List ZIP content for zenodo community For a given Zenodo community, list file content of its downloadable .zip files*	https://github.com/stain/ro-index-paper.git Path: code/data-gathering/workflows/zenodo-zip-content.cwl Branch/Commit ID: dedbb79f76f110eaafd065aee8401052c8d51a0e
	Cellranger ATAC Aggregate Cellranger ATAC Aggregate Aggregates outputs from multiple runs of Cell Ranger Count Chromatin Accessibility experiments	https://github.com/datirium/workflows.git Path: workflows/cellranger-atac-aggr.cwl Branch/Commit ID: 22880e0f41d0420a17d643e8a6e8ee18165bbfbf
	Filter Protein Alignments	https://github.com/ncbi/pgap.git Path: protein_alignment/wf_align_filter.cwl Branch/Commit ID: 4b73bfeb967ee9f57a0410276f7c39e784f0846f
	ani_top_n	https://github.com/ncbi/pgap.git Path: task_types/tt_ani_top_n.cwl Branch/Commit ID: 861d9baa067af98d794ba0ed4e43aa42e37d8a24
	Salmon quantification, FASTQ -> H5AD count matrix	https://github.com/lux563624348/WDL-HuBMAP-salmon-rnaseq.git Path: steps/salmon-quantification.cwl Branch/Commit ID: d020fd86704208b32b513b3edfc2e2d1e0b85022
	Cell Ranger Count (RNA) Cell Ranger Count (RNA) Quantifies single-cell gene expression of the sequencing data from a single 10x Genomics library. The results of this workflow are primarily used in either “Single-Cell RNA-Seq Filtering Analysis” or “Cell Ranger Aggregate (RNA, RNA+VDJ)” pipelines.	https://github.com/datirium/workflows.git Path: workflows/single-cell-preprocess-cellranger.cwl Branch/Commit ID: cc6fa135d04737fdde3b4414d6e214cf8c812f6e
	SoupX Estimate SoupX Estimate ==============	https://github.com/datirium/workflows.git Path: workflows/soupx.cwl Branch/Commit ID: 00ea05e22788029370898fd4c17798b11edf0e57
	16S metagenomic paired-end QIIME2 Sample (preprocessing) A workflow for processing a single 16S sample via a QIIME2 pipeline. ## __Outputs__ #### Output files: - overview.md, list of inputs - demux.qzv, summary visualizations of imported data - alpha-rarefaction.qzv, plot of OTU rarefaction - taxa-bar-plots.qzv, relative frequency of taxomonies barplot ## __Inputs__ #### General Info - Sample short name/Alias: Used for samplename in downstream analyses. Ensure this is the same name used in the metadata samplesheet. - Environment: where the sample was collected - Catalog No.: catalog number if available (optional) - Read 1 FASTQ file: Read 1 FASTQ file from a paired-end sequencing run. - Read 2 FASTQ file: Read 2 FASTQ file that pairs with the input R1 file. - Trim 5' of R1: Recommended if adapters are still on the input sequences. Trims the first J bases from the 5' end of each forward read. - Trim 5' of R2: Recommended if adapters are still on the input sequences. Trims the first K bases from the 5' end of each reverse read. - Truncate 3' of R1: Recommended if quality drops off along the length of the read. Clips the forward read starting M bases from the 5' end (before trimming). - Truncate 3' of R2: Recommended if quality drops off along the length of the read. Clips the reverse read starting N bases from the 5' end (before trimming). - Threads: Number of threads to use for steps that support multithreading. ### __Data Analysis Steps__ 1. Generate FASTX quality statistics for visualization of unmapped, raw FASTQ reads. 2. Import the data, make a qiime artifact (demux.qza), and summary visualization 3. Denoising will detect and correct (where possible) Illumina amplicon sequence data. This process will additionally filter any phiX reads (commonly present in marker gene Illumina sequence data) that are identified in the sequencing data, and will filter chimeric sequences. 4. Generate a phylogenetic tree for diversity analyses and rarefaction processing and plotting. 5. Taxonomy classification of amplicons. Performed using a Naive Bayes classifier trained on the Greengenes2 database \"gg_2022_10_backbone_full_length.nb.qza\". ### __References__ 1. Bolyen E, Rideout JR, Dillon MR, Bokulich NA, Abnet CC, Al-Ghalith GA, Alexander H, Alm EJ, Arumugam M, Asnicar F, Bai Y, Bisanz JE, Bittinger K, Brejnrod A, Brislawn CJ, Brown CT, Callahan BJ, Caraballo-Rodríguez AM, Chase J, Cope EK, Da Silva R, Diener C, Dorrestein PC, Douglas GM, Durall DM, Duvallet C, Edwardson CF, Ernst M, Estaki M, Fouquier J, Gauglitz JM, Gibbons SM, Gibson DL, Gonzalez A, Gorlick K, Guo J, Hillmann B, Holmes S, Holste H, Huttenhower C, Huttley GA, Janssen S, Jarmusch AK, Jiang L, Kaehler BD, Kang KB, Keefe CR, Keim P, Kelley ST, Knights D, Koester I, Kosciolek T, Kreps J, Langille MGI, Lee J, Ley R, Liu YX, Loftfield E, Lozupone C, Maher M, Marotz C, Martin BD, McDonald D, McIver LJ, Melnik AV, Metcalf JL, Morgan SC, Morton JT, Naimey AT, Navas-Molina JA, Nothias LF, Orchanian SB, Pearson T, Peoples SL, Petras D, Preuss ML, Pruesse E, Rasmussen LB, Rivers A, Robeson MS, Rosenthal P, Segata N, Shaffer M, Shiffer A, Sinha R, Song SJ, Spear JR, Swafford AD, Thompson LR, Torres PJ, Trinh P, Tripathi A, Turnbaugh PJ, Ul-Hasan S, van der Hooft JJJ, Vargas F, Vázquez-Baeza Y, Vogtmann E, von Hippel M, Walters W, Wan Y, Wang M, Warren J, Weber KC, Williamson CHD, Willis AD, Xu ZZ, Zaneveld JR, Zhang Y, Zhu Q, Knight R, and Caporaso JG. 2019. Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nature Biotechnology 37: 852–857. https://doi.org/10.1038/s41587-019-0209-9	https://github.com/datirium/workflows.git Path: workflows/qiime2-sample-pe.cwl Branch/Commit ID: cc6fa135d04737fdde3b4414d6e214cf8c812f6e