Workflow: retrieve metadata from Zenodo community
For a given Zenodo community, retrieve its repository records as Zenodo JSON and (eventually) schema.org JSON-LD and DataCite v4 XML.
- Selected
- |
- Default Values
- Nested Workflows
- Tools
- Inputs/Outputs
Inputs
ID | Type | Title | Doc |
---|---|---|---|
community | String (Optional) |
The short-name of the Zenodo community, e.g. \"ro\" for <https://zenodo.org/communities/ro> Use null for all of Zenodo. |
Steps
ID | Runs | Label | Doc |
---|---|---|---|
list-ids |
../tools/oai-pmh.cwl
(CommandLineTool)
|
oai_pmh ListIdentifiers |
List repository identifiers using OAI-PMH protocol |
make-uri |
../tools/sed.cwl
(CommandLineTool)
|
sed search-replace |
Search-replace a stream using regular expressions and other SED commands. |
gather-json |
../tools/gather-directory.cwl
(ExpressionTool)
|
Gather directory |
Gather array of array of Files, flattened into a single Directory. Files are assumed to have unique filenames. Optionally the files can be renamed to add an provided file extension. |
chunk-by-line |
../tools/split-lines.cwl
(CommandLineTool)
|
split a file into smaller pieces |
Output pieces of FILE by splitting into multiple files, e.g. \"xaa\", \"xab\", \"xac\", ... where \"x\" is the default prefix. The filename suffixes are alphabetical in order corresponding to file input, using a suffix letters a-z. The suffix is expanded by 2 characters before exhaustion to \"z\", making increasingly longer filenames that are still ordered correctly, e.g. \"yy\", \"yz\", \"zaaa\", \"zaab\", later \"zyzz\", \"zzaaaa\", \"zzaaab\". |
gather-jsonld |
../tools/gather-directory.cwl
(ExpressionTool)
|
Gather directory |
Gather array of array of Files, flattened into a single Directory. Files are assumed to have unique filenames. Optionally the files can be renamed to add an provided file extension. |
gather-datacite4 |
../tools/gather-directory.cwl
(ExpressionTool)
|
Gather directory |
Gather array of array of Files, flattened into a single Directory. Files are assumed to have unique filenames. Optionally the files can be renamed to add an provided file extension. |
fetch-zenodo-json |
../tools/curl-get-many.cwl
(CommandLineTool)
|
curl: download HTTP files from URLs |
curl will download the HTTP/HTTPS files from the given URLs, following any redirections. A list of files is returned, using filenames reflecting the URI path. |
split-ids-by-line |
../tools/split-by-line.cwl
(ExpressionTool)
|
Split by line |
Read file content and split by newline into array of strings. |
fetch-zenodo-jsonld |
../tools/curl-get-many.cwl
(CommandLineTool)
|
curl: download HTTP files from URLs |
curl will download the HTTP/HTTPS files from the given URLs, following any redirections. A list of files is returned, using filenames reflecting the URI path. |
fetch-zenodo-datacite4 |
../tools/curl-get-many.cwl
(CommandLineTool)
|
curl: download HTTP files from URLs |
curl will download the HTTP/HTTPS files from the given URLs, following any redirections. A list of files is returned, using filenames reflecting the URI path. |
Outputs
ID | Type | Label | Doc |
---|---|---|---|
zenodo-json | Directory |
A directory of Zenodo JSON records retrieved from the given Zenodo community |
|
zenodo-jsonld | Directory |
A directory of Zenodo schema.org JSON-LD records retrieved from the given Zenodo community |
|
zenodo-datacite4 | Directory |
A directory of Zenodo DataCitev4 XML records retrieved from the given Zenodo community |
https://w3id.org/cwl/view/git/5f4b5aab2a9e0fa53eb3d49d62a52d47025a967d/code/data-gathering/workflows/zenodo-records.cwl