寇晓艳1,刘静1,2,周志军1*,骞蕾阳1,石福明1,2015,优雅蝈螽VASA蛋白cDNA序列的克隆与生物信息学分析[J].环境昆虫学报,37(3):558-566
优雅蝈螽VASA蛋白cDNA序列的克隆与生物信息学分析
Cloning and bioinformatic analysis of VASA cDNA from Gampsocleis gratiosa
  
DOI:
中文关键词:  优雅蝈螽  vasa基因  转录组  cDNA
英文关键词:Gampsocleis gratiosa  vasagene  Transcriptome  cDNA
基金项目:国家自然科学基金项目(31101632,31471985);河北省自然科学基金项目(C2012201049)
作者单位
寇晓艳1,刘静1,2,周志军1*,骞蕾阳1,石福明1 (1. 河北大学生命科学学院河北保定071002 2.中国科学院成都生物研究所成都610041) 
摘要点击次数: 1418
全文下载次数: 0
中文摘要:
      Vasa基因属于DEAD-box家族,其功能主要是特定mRNA的翻译调控。在许多动物中,它都是生殖系细胞发育所必须,对生殖干细胞分化具有重要作用。为探究vasa基因在半变态类昆虫生殖系细胞发育中的作用,本研究首先从基于Illumina 高通量测序平台测得的优雅蝈螽Gampsocleis gratiosa成体转录组数据中筛选出一段长度为1215 bp的vasa基因片段,进而设计引物并利用RT-PCR和RACE技术获得其cDNA序列全长,最后利用生物信息学技术进行分析。结果显示:优雅蝈螽vasa基因的cDNA序列全长3359 bp,其中,5′端非编码区82 bp,3′端非编码区1306 bp,开放阅读框1971 bp编码656个氨基酸,理论蛋白相对分子量(Mw)72.3 kDa,等电点(pI)5.48。通过与GenBank数据库中收录的其他VASA蛋白序列比对,发现优雅蝈螽VASA蛋白具有DEAD-box蛋白家族所共有的9个保守基序,AxTGoGKT (I)、PTRELA (Ia)、TPGR(Ib)、DEAD (Ⅱ)、SAT (Ⅲ)、LVFVE (Ⅳ)、TDVuARGID (Ⅴ)、HRIGRTGR(Ⅵ) 和GaccPoh1Q(Q),其中,GaccPoh1Q(Q)的第3个氨基酸残基存在显著变化,建议将GaccPoh1Q(Q)修改为GaxcPoh1Q(Q)。此外,优雅蝈螽VASA蛋白的N端还具有10个RG和2个RGG重复序列、起始及终止密码子附近的色氨酸(W)、C末端的7个氨基酸残基中有4个为酸性氨基酸残基(E),表明其具有ATP依赖的RNA解旋酶活性。基于氨基酸序列聚类结果显示:优雅蝈螽位于六足动物分枝末梢,与双斑蟋 Gryllus bimaculatus的亲缘关系最近,这与二者的分类学地位相符。本研究表明基于短读长二代测序平台获得的转录组数据可以很好地服务于功能基因研究,所获得的优雅蝈螽vasa基因cDNA全长对于进一步深入研究VASA蛋白在半变态类昆虫生殖系细胞发育研究具有重要意义。
英文摘要:
      Vasa is a DEAD-box RNA helicase that functions in translational regulation of specific mRNAs. In many animals it is essential for germ line development and may have a more general stem cell role. In order to have a further known about its function, we conducted a study selecting the 1215 bp fragment of vasa gene from Gampsocleis gratiosa transcriptome sequencing data set. By conducting RT-PCR and RACE experiment, we get the whole sequence of its cDNA with the defined specific primer and do the preliminary bioinformatics analysis.The result shows that the whole sequence of its cDNA is 3359 bp, which contains 5′-untranslated region of 82 bp, 3′-untranslated region of 1306 bp, and an opening reading frame (ORF) of 1971 bp encoding 656 amino acids.Predicted VASA protein sequence contains 9 conserved regions of DEAD-box family protein, AxTGoGKT (I)、PTRELA (Ia)、TPGR(Ib)、DEAD (Ⅱ)、SAT (Ⅲ)、LVFVE (Ⅳ)、TDVuARGID (Ⅴ)、HRIGRTGR(Ⅵ) and GaccPoh1Q(Q), which indicates it is with ATP relied RNase active. Through VASA protein sequences alignment, we found that the third residues of GaccPoh1Q(Q)motif existed obvious variation, and it should be renamed GaxcPoh1Q(Q).In addition, there are 10 RG repeated sequences and 2 RGG repeated sequences exist in the N terminal, 7 amino acids including 4 acidic amino acids residue (E) in the C terminal, and two Try amino acids residues (W) near the initiation and termination codon. The phylogenetic analysis reveals that G. Gratiosa belongs to the Hexapoda clade, and has a closer relationship to Gryllus bimaculatus, which also belong to order Orthoptera, suborder Ensifera. Our research also indicates transcriptome data acquired using short-read Illumina sequencing platform could significantly serve for functional gene studies of nonmodel organisms in the future. Acquired vasa gene full-long cDNA of G. gratiosa plays a significant role in further studying the function of VASA protein during the germline development of Hemimetabolous insect.
查看全文  查看/发表评论  下载PDF阅读器
关闭