- Selected
- |
- Default Values
- Nested Workflows
- Tools
- Inputs/Outputs
Inputs
ID | Type | Title | Doc |
---|---|---|---|
nthreads | Integer | ||
fgbio_jar_path | String |
fgbio Java jar file |
|
picard_jar_path | String |
Picard Java jar file |
|
picard_java_opts | String (Optional) |
JVM arguments should be a quoted, space separated list (e.g. \"-Xms128m -Xmx512m\") |
|
regions_bed_file | File |
Regions bed file used to filter-in reads (used in samtools) |
|
genome_sizes_file | File |
Genome sizes tab-delimited file (used in samtools) |
|
input_fastq_umi_files | File[] |
Input fastq with UMIs files |
|
input_fastq_read1_files | File[] |
Input fastq paired-end read 1 files |
|
input_fastq_read2_files | File[] |
Input fastq paired-end read 2 files |
|
ENCODE_blacklist_bedfile | File |
Bedfile containing ENCODE consensus blacklist regions to be excluded. |
|
genome_ref_first_index_file | File |
Bowtie first index files for reference genome (e.g. *1.bt2). The rest of the files should be in the same folder. |
Steps
ID | Runs | Label | Doc |
---|---|---|---|
bowtie2 |
../map/bowtie2.cwl
(CommandLineTool)
|
Bowtie 2 version 2.2.8 by Ben Langmead (langmea@cs.jhu.edu, www.cs.jhu.edu/~langmea)
Usage:
bowtie2 [options]* -x <bt2-idx> {-1 <m1> -2 <m2> | -U <r>} [-S <sam>] |
|
basename |
../utils/basename.cwl
(ExpressionTool)
|
||
sort_bams |
../map/samtools-sort.cwl
(CommandLineTool)
|
||
index_bams |
../map/samtools-index.cwl
(CommandLineTool)
|
||
bam_to_bepe |
../map/bedtools-bamtobed.cwl
(CommandLineTool)
|
||
cut_to_bepe |
../utils/cut.cwl
(CommandLineTool)
|
Cut columns from input file. |
|
sort_to_bepe |
../utils/sort.cwl
(CommandLineTool)
|
Usage: sort [OPTION]... [FILE]...
Write sorted concatenation of all FILE(s) to standard output. |
|
preseq-c-curve |
../map/preseq-c_curve.cwl
(CommandLineTool)
|
Usage: c_curve [OPTIONS] <sorted-bed-file> |
|
mark_duplicates |
../map/picard-MarkDuplicates.cwl
(CommandLineTool)
|
||
sort_dedup_bams |
../map/samtools-sort.cwl
(CommandLineTool)
|
||
index_dedup_bams |
../map/samtools-index.cwl
(CommandLineTool)
|
||
sort_masked_bams |
../map/samtools-sort.cwl
(CommandLineTool)
|
||
index_masked_bams |
../map/samtools-index.cwl
(CommandLineTool)
|
||
remove_duplicates |
../map/samtools-view.cwl
(CommandLineTool)
|
||
extract_basename_1 |
../utils/extract-basename.cwl
(CommandLineTool)
|
Extracts the base name of a file |
|
extract_basename_2 |
../utils/remove-extension.cwl
(CommandLineTool)
|
Extracts the base name of a file |
|
sort_bams_by_tag_name |
../map/samtools-sort.cwl
(CommandLineTool)
|
||
index_dups_marked_bams |
../map/samtools-index.cwl
(CommandLineTool)
|
||
annotate_bams_with_umis |
../map/fgbio-AnnotateBamWithUmis.cwl
(CommandLineTool)
|
AnnotateBamWithUmis
------------------------------------------------------------------------------------------------------------------------
Annotates existing BAM files with UMIs (Unique Molecular Indices, aka Molecular IDs, Molecular barcodes) from a
separate FASTQ file. Takes an existing BAM file and a FASTQ file consisting of UMI reads, matches the reads between the
files based on read names, and produces an output BAM file where each record is annotated with an optional tag
(specified by 'attribute') that contains the read sequence of the UMI. Trailing read numbers ('/1' or '/2') are removed
from FASTQ read names, as is any text after whitespace, before matching. |
|
remove_encode_blacklist |
../map/bedtools-pairtobed.cwl
(CommandLineTool)
|
Tool: bedtools pairtobed (aka pairToBed)
Version: v2.25.0
Summary: Report overlaps between a BEDPE file and a BED/GFF/VCF file. |
|
filter_quality_alignments |
../map/samtools-view.cwl
(CommandLineTool)
|
Outputs
ID | Type | Label | Doc |
---|---|---|---|
output_bowtie_log | File[] |
Bowtie log file. |
|
output_data_bam_files | File[] |
BAM files with aligned reads. |
|
output_templates_files | File[] |
Tags/templates coordinates, sorted by chromosome and position (sort -k1,1 -k2,2g). |
|
output_data_dedup_bam_files | File[] |
Dedup BAM files with aligned reads. |
|
output_preseq_c_curve_files | File[] |
Preseq c_curve output files. |
|
output_data_unmapped_fastq_files | File[] |
FASTQ gzipped files with unmapped reads. |
|
output_picard_mark_duplicates_files | File[] |
Picard MarkDuplicates metrics files. |
https://w3id.org/cwl/view/git/6e008c1170ef818b6c4c63f0eec7baa4f7be7b3c/v1.0/STARR-seq_pipeline/03-map-pe-umis.cwl