IMECH-IR  > 力学所知识产出(1956-2008)
A combined statistical model for multiple motifs search
Gao LF(高丽锋); Liu X(刘鑫); Guan S(官山); Guan S
Source PublicationChinese Physics B
2008
Volume17Issue:12Pages:4396
ISSN1674-1056
Abstract

Transcription factor binding sites (TFBS) play key roles in genebior 6.8 wavelet expression and regulation. They are short sequence segments with de¯nite structure and can be recognized by the corresponding transcription factors correctly. From the viewpoint of statistics, the candidates of TFBS should be quite di®erent from the segments that are randomly combined together by nucleotide. This paper proposes a combined statistical model for ¯nding over- represented short sequence segments in di®erent kinds of data set. While the over-represented short sequence segment is described by position weight matrix, the nucleotide distribution at most sites of the segment should be far from the background nucleotide distribution. The central idea of this approach is to search for such kind of signals. This algorithm is tested on 3 data sets, including binding sites data set of cyclic AMP receptor protein in E.coli, PlantProm DB which is a non-redundant collection of proximal promoter sequences from di®erent species, collection of the intergenic sequences of the whole genome of E.Coli. Even though the complexity of these three data sets is quite di®erent, the results show that this model is rather general and sensible.

KeywordTranscription Factor Binding Sites Motif Position Weight Matrix
Subject Area生物力学
DOI10.1088/1674-1056/17/12/011
Indexed BySCI ; EI ; CSCD
Language英语
WOS IDWOS:000262494500011
WOS KeywordFACTOR-BINDING SITES; EM ALGORITHM; IDENTIFICATION; GENOMES; SEQUENCES; ALIGNMENT
CSCD IDCSCD:3437045
Citation statistics
Cited Times:2[WOS]   [WOS Record]     [Related Records in WOS]
Cited Times:2[CSCD]   [CSCD Record]
Document Type期刊论文
Identifierhttp://dspace.imech.ac.cn/handle/311007/33079
Collection力学所知识产出(1956-2008)
Corresponding AuthorGuan S
Recommended Citation
GB/T 7714
Gao LF,Liu X,Guan S,et al. A combined statistical model for multiple motifs search[J]. Chinese Physics B,2008,17,12,:4396.
APA Gao LF,Liu X,Guan S,&Guan S.(2008).A combined statistical model for multiple motifs search.Chinese Physics B,17(12),4396.
MLA Gao LF,et al."A combined statistical model for multiple motifs search".Chinese Physics B 17.12(2008):4396.
Files in This Item: Download All
File Name/Size DocType Version Access License
gs.pdf(1074KB)期刊论文作者接受稿开放获取CC BY-NC-SAView Download
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Lanfanshu
Similar articles in Lanfanshu
[Gao LF(高丽锋)]'s Articles
[Liu X(刘鑫)]'s Articles
[Guan S(官山)]'s Articles
Baidu academic
Similar articles in Baidu academic
[Gao LF(高丽锋)]'s Articles
[Liu X(刘鑫)]'s Articles
[Guan S(官山)]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Gao LF(高丽锋)]'s Articles
[Liu X(刘鑫)]'s Articles
[Guan S(官山)]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: gs.pdf
Format: Adobe PDF
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.