Workflow: protein similarities
run diamond on mutlple DBs and merge-sort results
- Selected
- |
- Default Values
- Nested Workflows
- Tools
- Inputs/Outputs
This workflow is Open Source and may be reused according to the terms of:
BSD 2-clause "Simplified" License
Note that the tools invoked by the workflow may have separate licenses.
Inputs
ID | Type | Title | Doc |
---|---|---|---|
jobid | String | ||
m5nrFull | File[] | ||
sequences | File |
Steps
ID | Runs | Label | Doc |
---|---|---|---|
diamond |
../Tools/diamond.tool.cwl
(CommandLineTool)
|
diamond |
multi-threaded fast sequence search command line tool, protein only >diamond -t <tempdir> -b <blocksize> -d <database> -q <query> -o <output> |
mergeSims |
../Tools/sort.tool.cwl
(CommandLineTool)
|
GNU sort |
sort text file base on given field(s) |
bleachSims |
../Tools/bleachsims.tool.cwl
(CommandLineTool)
|
bleachsims |
filter similarity file by E-value and number of hits >bleachsims -s <input> -o <output> -m 20 -r 0 -c 3 |
Outputs
ID | Type | Label | Doc |
---|---|---|---|
protSimsOut | File |
Permalink:
https://w3id.org/cwl/view/git/f5839797da8209a9d3e441023f88130219751020/CWL/Workflows/protein-diamond.workflow.cwl