Explore Workflows

View already parsed workflows here or click here to add your own

Graph Name Retrieved From View
workflow graph Motif Finding with HOMER with random background regions

Motif Finding with HOMER with random background regions --------------------------------------------------- HOMER contains a novel motif discovery algorithm that was designed for regulatory element analysis in genomics applications (DNA only, no protein). It is a differential motif discovery algorithm, which means that it takes two sets of sequences and tries to identify the regulatory elements that are specifically enriched in on set relative to the other. It uses ZOOPS scoring (zero or one occurrence per sequence) coupled with the hypergeometric enrichment calculations (or binomial) to determine motif enrichment. HOMER also tries its best to account for sequenced bias in the dataset. It was designed with ChIP-Seq and promoter analysis in mind, but can be applied to pretty much any nucleic acids motif finding problem. Here is how we generate background for Motifs Analysis ------------------------------------- 1. Take input file with regions in a form of “chr\" “start\" “end\" 2. Sort and remove duplicates from this regions file 3. Extend each region in 20Kb into both directions 4. Merge all overlapped extended regions 5. Subtract not extended regions from the extended ones 6. Randomly distribute not extended regions within the regions that we got as a result of the previous step 7. Get fasta file from these randomly distributed regions (from the previous step). Use it as background For more information please refer to: ------------------------------------- [Official documentation](http://homer.ucsd.edu/homer/motif/)

https://github.com/datirium/workflows.git

Path: workflows/homer-motif-analysis.cwl

Branch/Commit ID: 9850a859de1f42d3d252c50e15701928856fe774

workflow graph gcaccess_from_list

https://github.com/ncbi/pgap.git

Path: task_types/tt_gcaccess_from_list.cwl

Branch/Commit ID: e2a6cbcc36212433d8fbc804919442787a5e2a49

workflow graph cmsearch-multimodel.cwl

https://github.com/EBI-Metagenomics/ebi-metagenomics-cwl.git

Path: workflows/cmsearch-multimodel.cwl

Branch/Commit ID: 7bb76f33bf40b5cd2604001cac46f967a209c47f

workflow graph final-workflow.cwl

https://github.com/NAL-i5K/Organism_Onboarding.git

Path: final-workflow.cwl

Branch/Commit ID: 0b58c250e8ab7c5efae29443f08ea74316127041

workflow graph readme-genePrediction-workflow.cwl

https://github.com/NAL-i5K/Organism_Onboarding.git

Path: flow_create_readme/readme-genePrediction-workflow.cwl

Branch/Commit ID: 7198756b4b1519d102178042924671bd677e9b17

workflow graph umi molecular alignment fastq workflow

https://github.com/genome/analysis-workflows.git

Path: definitions/pipelines/alignment_umi_molecular.cwl

Branch/Commit ID: 77ec4f26eb14ed82481828bd9f6ef659cfd8b40f

workflow graph format_rrnas_from_seq_entry

https://github.com/ncbi/pgap.git

Path: task_types/tt_format_rrnas_from_seq_entry.cwl

Branch/Commit ID: 001fab592188cb525afa1c4db6226b833faec106

workflow graph Motif Finding with HOMER with custom background regions

Motif Finding with HOMER with custom background regions --------------------------------------------------- HOMER contains a novel motif discovery algorithm that was designed for regulatory element analysis in genomics applications (DNA only, no protein). It is a differential motif discovery algorithm, which means that it takes two sets of sequences and tries to identify the regulatory elements that are specifically enriched in on set relative to the other. It uses ZOOPS scoring (zero or one occurrence per sequence) coupled with the hypergeometric enrichment calculations (or binomial) to determine motif enrichment. HOMER also tries its best to account for sequenced bias in the dataset. It was designed with ChIP-Seq and promoter analysis in mind, but can be applied to pretty much any nucleic acids motif finding problem. For more information please refer to: ------------------------------------- [Official documentation](http://homer.ucsd.edu/homer/motif/)

https://github.com/datirium/workflows.git

Path: workflows/homer-motif-analysis-bg.cwl

Branch/Commit ID: 2f0db4b3c515f91c5cfda19c78cf90d339390986

workflow graph Bismark Methylation - pipeline for BS-Seq data analysis

Sequence reads are first cleaned from adapters and transformed into fully bisulfite-converted forward (C->T) and reverse read (G->A conversion of the forward strand) versions, before they are aligned to similarly converted versions of the genome (also C->T and G->A converted). Sequence reads that produce a unique best alignment from the four alignment processes against the bisulfite genomes (which are running in parallel) are then compared to the normal genomic sequence and the methylation state of all cytosine positions in the read is inferred. A read is considered to align uniquely if an alignment has a unique best alignment score (as reported by the AS:i field). If a read produces several alignments with the same number of mismatches or with the same alignment score (AS:i field), a read (or a read-pair) is discarded altogether. On the next step we extract the methylation call for every single C analysed. The position of every single C will be written out to a new output file, depending on its context (CpG, CHG or CHH), whereby methylated Cs will be labelled as forward reads (+), non-methylated Cs as reverse reads (-). The output of the methylation extractor is then transformed into a bedGraph and coverage file. The bedGraph counts output is then used to generate a genome-wide cytosine report which reports the number on every single CpG (optionally every single cytosine) in the genome, irrespective of whether it was covered by any reads or not. As this type of report is informative for cytosines on both strands the output may be fairly large (~46mn CpG positions or >1.2bn total cytosine positions in the human genome).

https://github.com/datirium/workflows.git

Path: workflows/bismark-methylation-se.cwl

Branch/Commit ID: 8bf36bfad5624fbc8fc315e82783a44e9e5e4470

workflow graph Filter ChIP/ATAC peaks for Tag Density Profile or Motif Enrichment analyses

Filters ChIP/ATAC peaks with the neatest genes assigned for Tag Density Profile or Motif Enrichment analyses ============================================================================================================ Tool filters output from any ChIP/ATAC pipeline to create a file with regions of interest for Tag Density Profile or Motif Enrichment analyses. Peaks with duplicated coordinates are discarded.

https://github.com/datirium/workflows.git

Path: workflows/filter-peaks-for-heatmap.cwl

Branch/Commit ID: 2caa50434966ebdf4b33e5ca689c2e4df32f9058