首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于密码子偏好特征的原核基因组多拷贝基因序列分析
引用本文:陈清利,毕胜男,于家峰.基于密码子偏好特征的原核基因组多拷贝基因序列分析[J].德州学院学报,2014(6):21-25.
作者姓名:陈清利  毕胜男  于家峰
作者单位:1. 山东省功能大分子生物物理重点实验室,德州学院 生物物理研究所,山东 德州 253023; 德州学院 物理与电子信息学院,山东 德州 253023; 山东师范大学 生命科学学院,济南 250014
2. 山东省功能大分子生物物理重点实验室,德州学院 生物物理研究所,山东 德州 253023
基金项目:国家自然科学基金资助项目
摘    要:基因重复是普遍存在的现象,与基因组进化密切相关,是基因组和遗传系统分化的重要推动力.目前针对原核基因组中蛋白质编码基因序列中的重复基因的系统研究还很少.本文以四种具有不同GC%含量的原核生物基因组为研究对象,用CodonW软件对各基因组中完全相同的功能基因的密码子使用偏好进行分析,用CD-hit软件对各基因组中以80%为阈值的重复蛋白编码基因进行分析.结果表明四个基因组的蛋白编码基因中普遍存在基因重复序列,其比例占到2.77%~7.03%.对序列完全相同的功能已知基因的分析表明其序列长度分布在50bp到1000bp左右的范围,多数长度在500bp以下;功能分析表明所研究基因组中大部分重复基因与转座酶有关,还有少量的编码转移酶、水解酶、跨膜蛋白、阻遏蛋白等.对各基因组中重复基因中序列完全相同的基因的密码子偏好性分析表明这些多拷贝基因坐落在基因组中某一特定区域并集中分布,展现出明显的共性特征.本文的尝试性工作将为今后原核基因组研究提供新思路.

关 键 词:原核基因组  重复基因  多拷贝蛋白编码基因

Sequences Analysis of Multi-copied Genes Based on Codon Usage Analysis in Prokaryotic Genomes
CHEN Qing-li,BI Sheng-nan,YU Jia-feng.Sequences Analysis of Multi-copied Genes Based on Codon Usage Analysis in Prokaryotic Genomes[J].Journal of Dezhou University,2014(6):21-25.
Authors:CHEN Qing-li  BI Sheng-nan  YU Jia-feng
Institution:CHEN Qing-li,BI Sheng-nan,YU Jia-feng(1. Shandong Provincial Key Laboratory of Functional Macromolecular Biophysics, Institute of Biophysics, Dezhou University, Dezhou Shandong 253023, China 2. School of Physics and Electronic Engineering, Dezhou University, Dezhou Shandong 253023, China; 3. School of Life Science, Sbandong Normal University, Jinan 250014, China)
Abstract:Gene duplication is a general phenomenon in organism,which is related to the genome evolution as an important driving force of genome and genetic differentiation system.At present,much fewer re-searches have been performed on the duplicated genes in prokaryotic genomes.Four prokaryotic genomes with different GC contents are downloaded from Refseq database.CodonW program is adopted for codon usage analysis of the protein coding genes.CD-hit program is used to determine the duplicated genes with the threshold of 80%.Statistical results show that 2.77%~7.03% of the protein coding genes in the four genomes are duplicated.Further sequences analysis shows that sequence length of the multi-copied known function genes are below 1000bp.Function analysis showed that most of the multi-copied genes are related to transposons,with a small amount of genes that coding transferase,hydrolytic enzymes, transmembrane protein,repressor protein,etc.Codon usage bias analysis indicates that the most of the multi-copied genes locate in particular regions,which exhibit regular intrinsic sequences features.Then it is interesting for further study the evolutionary mechanisms of the multi-copied genes in future work.
Keywords:prokaryotic genome  duplicated gene  multi-copied protein coding genes
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号