Repeat Sequences and Base Correlations in Human Y ChromosomePalindromes
- Received Date: 2008-12-28
- Human Y chromosome, Palindrome, Mutual information, Long range correlation, Short range correlation
Abstract: On the basis of information theory and statistical methods, we use mutual information, n-tuple entropy and conditional entropy, combined with biological characteristics, to analyze the long range correlation and short range correlation in human Y chromosome palindromes.The magnitude distribution of the long range correlation which can be reflected by the mutual information is P5>P5a>P5b (P5a and P5b are the sequences that replace solely Alu repeats and all interspersed repeats with random uncorrelated sequences in human Y chromosome palindrome 5, respectively); and the magnitude distribution of the short range correlation which can be reflected by the n-tuple entropy and the conditional entropy is P5>P5a>P5b>random uncorrelated sequence. In other words, when the Alu repeats and all interspersed repeats replace with random uncorrelated sequence, the long range and short range correlation decrease gradually. However, the random uncorrelated sequence has no correlation. This research indicates that more repeat sequences result in stronger correlation between bases in human Y chromosome. The analyses may be helpful to understand the special structures of human Y chromosome palindromes profoundly.
|Citation:||Neng-zhi Jin, Zi-xian Liu, Yan-jiao Qi, Wen-yuan Qiu. Repeat Sequences and Base Correlations in Human Y ChromosomePalindromes[J]. Chinese Journal of Chemical Physics , 2009, 22(3): 255-261. doi: 10.1088/1674-0068/22/03/255-261|