Workflow: ACTseq_spike_in.cwl

Fetched 2024-05-21 13:14:56 GMT
children parents
Workflow as SVG
  • Selected
  • Default Values
  • Nested Workflows
  • Tools
  • Inputs/Outputs

Inputs

ID Type Title Doc
fastq1 File[]

List of fastq files containing the first mate of raw reads. Muliple files are provided if multiplexing of the same library has been done on multiple lanes. The reads comming from different fastq files are pooled after alignment. Also see parameter \"fastq2\".

fastq2 File[]

List of fastq files containing the second mate of raw reads in case of paired end (also see parameter \"fastq1\"). Important: this list has to be of same length as parameter \"fastq1\" no matter if paired or single end is used. In case of single end data specify \"null\" for every entry of fastq1.

genome File

Path to reference genome in fasta format. Bowtie2 index files (\".1.bt2\", \".2.bt2\", ...) as well as a samtools index (\".fai\") has to be located in the same directory.\n All of these files can be downloaded for the most common genome builds at https://support.illumina.com/sequencing/sequencing_software/igenome.html. Alternatively, you can use \"bowtie2-build\" or \"samtools index\" to create them yourself.

adapter1 String (Optional)

Adapter sequence for first reads. If not specified (set to \"null\"), trim_galore will try to autodetect whether ...\n - Illumina universal adapter (AGATCGGAAGAGC)\n - Nextera adapter (CTGTCTCTTATA)\n - Illumina Small RNA 3-prime Adapter (TGGAATTCTCGG)\n ... was used.\n You can directly choose one of the above configurations by setting the string to \"illumina\", \"nextera\", or \"small_rna\". Or you specify the adaptor string manually (e.g. \"AGATCGGAAGAGC\").

adapter2 String (Optional)

Adapter sequence for second reads (only relevant for paired end data). If it is not specified (set to \"null\"), trim_galore will try to autodetect whether ...\n - Illumina universal adapter (AGATCGGAAGAGC)\n - Nextera adapter (CTGTCTCTTATA)\n - Illumina Small RNA 3-prime Adapter (TGGAATTCTCGG)\n ... was used.\n You can directly choose one of the above configurations by setting the string to \"illumina\", \"nextera\", or \"small_rna\". Or you specify the adaptor string manually (e.g. \"AGATCGGAAGAGC\").

bin_size Integer

Bin size used for generation of coverage tracks. The larger the bin size the smaller are the coverage tracks, however, the less precise is the signal. For single bp resolution set to 1.

sample_id String

Sample ID used for naming the output files.

fragment_size Integer (Optional)

Mean library fragment size, used to reconstruct entire fragments from single end reads. Not relevant in case of paired end data.

is_paired_end Boolean

If paired end data is used set to true, else set to false.

genome_spike_in File

Path to reference genome of the spike-in organism in fasta format. Bowtie2 index files (\".1.bt2\", \".2.bt2\", ...) as well as a samtools index (\".fai\") has to be located in the same directory.\n All of these files can be downloaded for the most common genome builds at https://support.illumina.com/sequencing/sequencing_software/igenome.html. Alternatively, you can use \"bowtie2-build\" or \"samtools index\" to create them yourself.

effective_genome_size Long

The effectively mappable genome size, please see: https://deeptools.readthedocs.io/en/latest/content/feature/effectiveGenomeSize.html

ignoreForNormalization String (Optional)

List of space-delimited chromosome names that shall be ignored when calculating the scaling factor. Specify as space-delimited string. Default: \"chrX chrY chrM\"

Steps

ID Runs Label Doc
chip_qc
merge_filter
trim_and_map
get_spike_in_counts
indexing_shifted_bam
../tools/samtools_index_hack.cwl (CommandLineTool)
tn5_overhang_correction
../tools/tn5_overhang_correction.cwl (CommandLineTool)
create_summary_qc_report
../tools/multiqc_hack.cwl (CommandLineTool)
generate_coverage_tracks
../tools/deeptools_bamCoverage.cwl (CommandLineTool)

Outputs

ID Type Label Doc
bam File
bigwig File
fastqc_zip File[]
bowtie2_log File[]
fastqc_html File[]
multiqc_zip File
bam_spike_in File
multiqc_html File
raw_fastqc_zip 290f386e8ddc65a3565d3bde23271613[]
raw_fastqc_html b18815a98ee72b38c9d292c37513f7ad[]
spike_in_counts File
trim_galore_log 409c4a0dc993ce056c9dba7471cf2652[]
bam_tn5_corrected File
qc_crosscorr_plot File (Optional)
trimmed_fastqc_zip 36b35000c941ba9621e2bb6752c609c5[]
trimmed_fastqc_html 17199a5cfb98a6efbe072ff29cc18e71[]
qc_crosscorr_summary File (Optional)
qc_plot_coverage_tsv File
qc_plot_coverage_plot File
qc_plot_fingerprint_tsv File (Optional)
qc_plot_fingerprint_plot File (Optional)
qc_plot_fingerprint_stderr File
qc_phantompeakqualtools_stderr File (Optional)
Permalink: https://w3id.org/cwl/view/git/8526687739a6802eeb08e97b27c20010225d5eb5/CWL/workflows/ACTseq_spike_in.cwl