Workflow: word-mapping-dir.cwl#word-mapping-wf.cwl

Fetched 2020-08-14 16:12:37 GMT
children parents
Workflow as SVG
  • Selected
  • Default Values
  • Nested Workflows
  • Tools
  • Inputs/Outputs

Inputs

ID Type Title Doc
gs_files File[]
lowercase Boolean (Optional)
align_c String (Optional)
ocr_files File[]
wm_name String (Optional)
language String
align_m String (Optional)

Steps

ID Runs Label Doc
align-texts-wf-2
merge-csv-2
word-mapping-dir.cwl#merge-csv.cwl (CommandLineTool)

Merge csv files (with the same header) into a single csv file.

normalize-whitespace-punctuation-2
word-mapping-dir.cwl#normalize-whitespace-punctuation.cwl (CommandLineTool)

Normalize whitespace and punctuation.

Replace multiple subsequent occurrences of whitespace characters and punctuation with a single occurrence.

pattern-1
word-mapping-dir.cwl#pattern.cwl (CommandLineTool)

Parse text using `pattern <https://www.clips.uantwerpen.be/pattern>`_.

Does tokenization, lemmatization and part of speech tagging. The default language is English, but other languages can be specified (``--language [en|es|de|fr|it|nl]``).

Output is `saf <https://github.com/vanatteveldt/saf>`_.

normalize-whitespace-punctuation-3
word-mapping-dir.cwl#normalize-whitespace-punctuation.cwl (CommandLineTool)

Normalize whitespace and punctuation.

Replace multiple subsequent occurrences of whitespace characters and punctuation with a single occurrence.

create-word-mappings-1
word-mapping-dir.cwl#create-word-mappings.cwl (CommandLineTool)

Outputs

ID Type Label Doc
wm_mapping File
Permalink: https://w3id.org/cwl/view/git/a62bf3b31df83784c017d30a83ed8e01d454bf1c/ochre/cwl/word-mapping-dir.cwl?part=word-mapping-wf.cwl