Workflow: protein similarities

Fetched 2024-11-28 04:44:48 GMT

run diamond on mutlple DBs and merge-sort results

children parents
Workflow as SVG
  • Selected
  • Default Values
  • Nested Workflows
  • Tools
  • Inputs/Outputs

Inputs

ID Type Title Doc
jobid String
m5nrFull File[]
sequences File

Steps

ID Runs Label Doc
diamond
../Tools/diamond.tool.cwl (CommandLineTool)
diamond

multi-threaded fast sequence search command line tool, protein only >diamond -t <tempdir> -b <blocksize> -d <database> -q <query> -o <output>

mergeSims
../Tools/sort.tool.cwl (CommandLineTool)
GNU sort

sort text file base on given field(s)

bleachSims
../Tools/bleachsims.tool.cwl (CommandLineTool)
bleachsims

filter similarity file by E-value and number of hits >bleachsims -s <input> -o <output> -m 20 -r 0 -c 3

Outputs

ID Type Label Doc
protSimsOut File
Permalink: https://w3id.org/cwl/view/git/f5839797da8209a9d3e441023f88130219751020/CWL/Workflows/protein-diamond.workflow.cwl