Workflow: Runs InterProScan on batches of sequences to retrieve functional annotations.

Fetched 2024-10-11 02:22:49 GMT
children parents
Workflow as SVG
  • Selected
  • Default Values
  • Nested Workflows
  • Tools
  • Inputs/Outputs

Inputs

ID Type Title Doc
seqtype https://w3id.org/cwl/view/git/72f702591368397f56d455128f60916902104dd2/workflows/InterProScan-v5-chunked-wf.cwl#seqtype/seqtype (Optional) Sequence type

Optional, the type of the input sequences (dna/rna (n) or protein (p)). The default sequence type is protein.

databases Directory
inputFile File [FASTA] Input file path

Optional, path to fasta file that should be loaded on Master startup. Alternatively, in CONVERT mode, the InterProScan 5 XML file to convert.

chunk_size Integer
applications https://w3id.org/cwl/view/git/72f702591368397f56d455128f60916902104dd2/tools/InterProScan/InterProScan-apps.yaml#apps[] (Optional) Analysis

Optional, comma separated list of analyses. If this option is not set, ALL analyses will be run.

outputFormat https://w3id.org/cwl/view/git/72f702591368397f56d455128f60916902104dd2/tools/InterProScan/InterProScan-protein_formats.yaml#protein_formats[] (Optional) output format

Optional, case-insensitive, comma separated list of output formats. Supported formats are TSV, XML, JSON, GFF3, HTML and SVG. Default for protein sequences are TSV, XML and GFF3, or for nucleotide sequences GFF3 and XML.

disableResidueAnnotation Boolean (Optional) Disables residue annotation

Optional, excludes sites from the XML, JSON output.

Steps

ID Runs Label Doc
split_seqs
../utils/fasta_chunker.cwl (CommandLineTool)
split FASTA by number of records

based upon code by developers from EMBL-EBI

combine_interproscan_results
../utils/concatenate.cwl (CommandLineTool)
Redirecting Multiple Files Contain in a Single File

The cat (short for “concatenate“) command is one of the most frequently used command in Linux/Unix like operating systems. cat command allows us to create single or multiple files, view contain of file, concatenate files and redirect output in terminal or files.

Outputs

ID Type Label Doc
i5Annotations File
Permalink: https://w3id.org/cwl/view/git/72f702591368397f56d455128f60916902104dd2/workflows/InterProScan-v5-chunked-wf.cwl