Knowledge Management System of Institue of Mechanics, CAS
A combined statistical model for multiple motifs search | |
Gao LF(高丽锋); Liu X(刘鑫)![]() | |
Source Publication | Chinese Physics B
![]() |
2008 | |
Volume | 17Issue:12Pages:4396 |
ISSN | 1674-1056 |
Abstract | Transcription factor binding sites (TFBS) play key roles in genebior 6.8 wavelet expression and regulation. They are short sequence segments with de¯nite structure and can be recognized by the corresponding transcription factors correctly. From the viewpoint of statistics, the candidates of TFBS should be quite di®erent from the segments that are randomly combined together by nucleotide. This paper proposes a combined statistical model for ¯nding over- represented short sequence segments in di®erent kinds of data set. While the over-represented short sequence segment is described by position weight matrix, the nucleotide distribution at most sites of the segment should be far from the background nucleotide distribution. The central idea of this approach is to search for such kind of signals. This algorithm is tested on 3 data sets, including binding sites data set of cyclic AMP receptor protein in E.coli, PlantProm DB which is a non-redundant collection of proximal promoter sequences from di®erent species, collection of the intergenic sequences of the whole genome of E.Coli. Even though the complexity of these three data sets is quite di®erent, the results show that this model is rather general and sensible. |
Keyword | Transcription Factor Binding Sites Motif Position Weight Matrix |
Subject Area | 生物力学 |
DOI | 10.1088/1674-1056/17/12/011 |
Indexed By | SCI ; EI ; CSCD |
Language | 英语 |
WOS ID | WOS:000262494500011 |
WOS Keyword | FACTOR-BINDING SITES; EM ALGORITHM; IDENTIFICATION; GENOMES; SEQUENCES; ALIGNMENT |
CSCD ID | CSCD:3437045 |
Citation statistics |
Cited Times:2[CSCD]
[CSCD Record]
|
Document Type | 期刊论文 |
Identifier | http://dspace.imech.ac.cn/handle/311007/33079 |
Collection | 力学所知识产出(1956-2008) |
Corresponding Author | Guan S |
Recommended Citation GB/T 7714 | Gao LF,Liu X,Guan S,et al. A combined statistical model for multiple motifs search[J]. Chinese Physics B,2008,17,12,:4396. |
APA | Gao LF,Liu X,Guan S,&Guan S.(2008).A combined statistical model for multiple motifs search.Chinese Physics B,17(12),4396. |
MLA | Gao LF,et al."A combined statistical model for multiple motifs search".Chinese Physics B 17.12(2008):4396. |
Files in This Item: | Download All | |||||
File Name/Size | DocType | Version | Access | License | ||
gs.pdf(1074KB) | 期刊论文 | 作者接受稿 | 开放获取 | CC BY-NC-SA | View Download |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment