引用本文:
【打印本页】   【HTML】   【下载PDF全文】   View/Add Comment  【EndNote】   【RefMan】   【BibTex】
←前一篇|后一篇→ 过刊浏览    高级检索
本文已被:浏览 1882次   下载 1251 本文二维码信息
码上扫一扫!
分享到: 微信 更多
Frequency and Correlation of Nearest Neighboring Nucleotides in Human Genome
Neng-zhi Jin,Zi-xian Liu,Wen-yuan Qiu *
Author NameAffiliationE-mail
Neng-zhi Jin Department of Chemistry, State Key Laboratory of Applied Organic Chemistry, Lanzhou University, Lanzhou 730000, China  
Zi-xian Liu Department of Chemistry, State Key Laboratory of Applied Organic Chemistry, Lanzhou University, Lanzhou 730000, China  
Wen-yuan Qiu * Department of Chemistry, State Key Laboratory of Applied Organic Chemistry, Lanzhou University, Lanzhou 730000, China wyqiu@lzu.edu.cn 
Abstract:
Zipf's approach in linguistics is utilized to analyze the statistical features of frequency and correlation of 16 nearest neighboring nucleotides (AA, AC, AG, … , TT) in 12 human chro- mosomes (Y, 22, 21, 20, 19, 18, 17, 16, 15, 14, 13, and 12). It is found that these statistical features of nearest neighboring nucleotides in human genome: (i) the frequency distribution is a linear function, and (ii) the correlation distribution is an inverse function. The coeffi- cients of the linear function and inverse function depend on the GC content. It proposes the correlation distribution of nearest neighboring nucleotides for the first time and extends the descriptor about nearest neighboring nucleotides.
Key words:  Zipf's law, Nearest neighboring nucleotide, Frequency distribution, Correlation distribution
FundProject:
Frequency and Correlation of Nearest Neighboring Nucleotides in Human Genome
金能智,刘子贤,邱文元 *
摘要:
利用语言学中的Zipf方法分析了人类基因组12条染色体(Y、22、21、20、19、18、17、16、15、14、13、12)中16种紧邻核苷酸(AA、AC、AG、…、TT)的频率及关联度的统计特征,发现人类基因组紧邻核苷酸的统计特征,即频率分布满足线性函数关系;关联度分布满足逆函数关系,且线性函数和逆函数的拟合系数取决于GC含量.首次提出了紧邻核苷酸的关联度分布,增加了对紧邻核苷酸研究的描述符.
关键词:  Zipf定律,紧邻核苷酸,频率分布,关联度分布
DOI:10.1088/1674-0068/22/01/27-33
分类号: