Difference between revisions of "Analyze SNP list (GTRD) (workflow)"

From BioUML platform
Jump to: navigation, search
(Automatic synchronization with BioUML)
(Automatic synchronization with BioUML)
 
(One intermediate revision by one user not shown)
Line 8: Line 8:
 
This workflow is designed to analyze the SNP list and to predict TFBSs overlapping with regulatory SNPs.  Any list with SNP IDs (in the format rs…) can be used as an input.
 
This workflow is designed to analyze the SNP list and to predict TFBSs overlapping with regulatory SNPs.  Any list with SNP IDs (in the format rs…) can be used as an input.
  
In the first step, the “SNP matching” analysis maps input SNPs on the genome. The result of this step is an annotated SNP table and a corresponding track file with all SNPs.
+
In the first step, the “SNP matching” analysis maps input SNPs on the genome. The result of this step is an annotated SNP table and a corresponding track file with all SNPs.  
  
 
The track file with all SNPs is further converted into a gene set using the “Track to gene set” analysis. This step identifies genes located within the region -10000 to 10000 around each SNP in the list. The region is available in the input form and can be modified. This step returns a schematic map of SNPs in genes mapped to either exons, introns or flanking regions.  
 
The track file with all SNPs is further converted into a gene set using the “Track to gene set” analysis. This step identifies genes located within the region -10000 to 10000 around each SNP in the list. The region is available in the input form and can be modified. This step returns a schematic map of SNPs in genes mapped to either exons, introns or flanking regions.  
Line 14: Line 14:
 
The annotated SNP table is filtered using the “Filter table” analysis, which results in two tables, SNPs in exons and other SNPs.  
 
The annotated SNP table is filtered using the “Filter table” analysis, which results in two tables, SNPs in exons and other SNPs.  
  
The SNPs located in introns or in gene-flanking regions are referred to as regulatory SNPs. They are further analyzed for their overlap with TFBSs. To this end, the track with regulatory SNPs is further processed using “Site search on track” and “Site search results optimization” to find TFBSs enriched around regulatory SNPs as compared to random genomic positions. The workflow uses the default profile vertebrate_non_redundant_minFN from the TRANSFAC library, but other TRANSFAC profiles can be chosen by the user. The region around SNPs analyzed for TFBSs is 60 bp on each flank; it is available in the input form and can be modified.  
+
The SNPs located in introns or in gene-flanking regions are referred to as regulatory SNPs. They are further analyzed for their overlap with TFBSs. To this end, the track with regulatory SNPs is further processed using “Site search on track” and “Site search results optimization” to find TFBSs enriched around regulatory SNPs as compared to random genomic positions. The workflow uses the default profile moderate threshold from GTRD library, but other GTRD profiles can be chosen by the user. The region around SNPs analyzed for TFBSs is 50 bp on each flank; it is available in the input form and can be modified.  
  
 
The output is a result folder with three subfolders called “all SNPs”, “SNPs in exons” and “SNPs regulatory”, containing the resulting tables and tracks for the corresponding SNPs. All tracks can be used immediately to visualize SNPs, nearby located genes, as well as overlapping TFBS on chromosomes.
 
The output is a result folder with three subfolders called “all SNPs”, “SNPs in exons” and “SNPs regulatory”, containing the resulting tables and tracks for the corresponding SNPs. All tracks can be used immediately to visualize SNPs, nearby located genes, as well as overlapping TFBS on chromosomes.
Line 23: Line 23:
 
;Profile
 
;Profile
 
;SNP surrounding region, bp
 
;SNP surrounding region, bp
 +
:Set SNP surrounding region, bp
 
;Species
 
;Species
 
;Results Folder
 
;Results Folder

Latest revision as of 16:34, 12 March 2019

Workflow title
Analyze SNP list (GTRD)
Provider
geneXplain GmbH

[edit] Workflow overview

Analyze-SNP-list-GTRD-workflow-overview.png

[edit] Description

This workflow is designed to analyze the SNP list and to predict TFBSs overlapping with regulatory SNPs.  Any list with SNP IDs (in the format rs…) can be used as an input.

In the first step, the “SNP matching” analysis maps input SNPs on the genome. The result of this step is an annotated SNP table and a corresponding track file with all SNPs.

The track file with all SNPs is further converted into a gene set using the “Track to gene set” analysis. This step identifies genes located within the region -10000 to 10000 around each SNP in the list. The region is available in the input form and can be modified. This step returns a schematic map of SNPs in genes mapped to either exons, introns or flanking regions.

The annotated SNP table is filtered using the “Filter table” analysis, which results in two tables, SNPs in exons and other SNPs.

The SNPs located in introns or in gene-flanking regions are referred to as regulatory SNPs. They are further analyzed for their overlap with TFBSs. To this end, the track with regulatory SNPs is further processed using “Site search on track” and “Site search results optimization” to find TFBSs enriched around regulatory SNPs as compared to random genomic positions. The workflow uses the default profile moderate threshold from GTRD library, but other GTRD profiles can be chosen by the user. The region around SNPs analyzed for TFBSs is 50 bp on each flank; it is available in the input form and can be modified.

The output is a result folder with three subfolders called “all SNPs”, “SNPs in exons” and “SNPs regulatory”, containing the resulting tables and tracks for the corresponding SNPs. All tracks can be used immediately to visualize SNPs, nearby located genes, as well as overlapping TFBS on chromosomes.

[edit] Parameters

Input SNP Table
5' and 3' gene bound extension
Profile
SNP surrounding region, bp
Set SNP surrounding region, bp
Species
Results Folder
Personal tools
Namespaces

Variants
Actions
BioUML platform
Community
Modelling
Analysis & Workflows
Collaborative research
Development
Virtual biology
Wiki
Toolbox