|
基于高通量测序的阿尔泰蝠蛾(鳞翅目:蝙蝠蛾科)幼虫转录组分析 |
Transcriptome analysis of Hepialus altaicola (Lepidoptera: Hepialidae) larvae based on high throughput sequencing |
|
DOI: |
中文关键词: 阿尔泰蝠蛾 转录组 基因注释 高通量测序 生物信息学 |
英文关键词:Hepialus altaicola transcriptome gene annotation high-throughput sequencing bioinformatics |
基金项目:国家自然科学基金(32060125,81560614);中国博士后科学基金(2016M602907);西部地区高发人兽共患传染性疾病防控协同创新中心项目 |
Author Name | Affiliation | SUN Tao,ZHANG Shi-Yuan,WANG Yan,CHEN Chuang-Fu | 1. Medical College of Shihezi University, Shihezi 832000, Xinjiang Uygur Autonomous Region, China 2. The Third Affiliated Hospital of Shihezi University, Shihezi 832000, Xinjiang Uygur Autonomous Region, China 3. College of Animal Science and Technology, Shihezi 832000, Xinjiang Uygur Autonomous Region, China 4. Animal Husbandry Post-doctoral Station of Shihezi University, Shihezi 832000, Xinjiang Uygur Autonomous Region, China |
|
Hits: 630 |
Download times: 948 |
中文摘要: |
本研究采用Illumina HiSeq TM 2500测序平台对阿尔泰蝠蛾Hepialus altaicola Wang幼虫进行转录组测序及生物信息学分析。经序列拼接后共获得100 133个Unigenes,总长度86 319 112 bp,平均长度862 bp,N50长度1 628 bp。将Unigenes与NR、COG/KOG、Pfam、Swiss-Prot、GO、KEGG数据库比对,共获得38 198条Unigenes,其中Nr数据库注释的Unigenes最多,为32 381条,占32.34%。通过GO功能分类,共有13 216个Unigenes在GO数据库中细胞组分、分子功能和生物学过程等3大类57个分支中找到注释;KEGG通路分析,共有15 058条Unigenes被注释,归属于305条代谢通路。CDS预测发现54 002条序列可被编码,占全部基因的53.93%。基因注释进一步获得311个与冷适应相关的代谢调节基因,并用FPKM值对基因表达量进行评估。本研究获得的转录组信息及分析结果,为进一步研究阿尔泰蝠蛾的基因功能及低温生态适应性奠定分子基础。 |
英文摘要: |
The transcriptome of Hepialus altaicola larvae was sequenced using an Illumina HiSeq 2500 platform and bioinformatics analysis. The clean reads were then de novo assembled into 100 133 Unigenes with a total length of 86 319 112 bp, an mean length of 862 bp, and an N50 length of 1 628 bp. Based on the NR, COG/KOG, Pfam, SwissProt, GO, KEGG databases, a total of 38 198 Unigenes were annotated. 32 381 Unigenes (32.34%) were annotated to the NR database. Using Gene Ontology (GO), a total of 13 216 Unigenes found annotations in 57 branches in 3 categories, including cellular components, molecular functions, and biological processes in the GO database. In the KEGG database, a total of 15 058 Unigenes were assigned to 305 known metabolic pathways. Totally 54 002 coding squences (CDS) were obtained using blast in NR and Swiss-prot protein databases, accounting for 53.93% of all genes. By further analyzing transcriptome data, 311 metabolic regulation genes related to cold adaptation were obtained,and their expression levels were evaluated based on FPKM value. This study acquired the ranscriptome data and analysis results of H. altaicola and to lay a molecular foundation for further research on gene function and low temperature ecological adaptability. |
View Full Text View/Add Comment Download reader |
Close |
|
|
|