[关键词]
[摘要]
为了研究所有曲霉来源的α-L-鼠李糖苷酶的结构特征,利用NCBI数据库收集到291条无重复的曲霉来源α-L-鼠李糖苷酶核酸序列并通过进化树筛选得到21条具有代表性的蛋白序列,通过序列比对分析得到这21条序列相互独立具有可靠的代表性,并利用生物信息学工具对其理化性质、跨膜区、信号肽进行分析发现,曲霉来源α-L-鼠李糖苷酶的等电点(pI)范围为4.66~7.17,在氨基酸数量、分子量、原子总数上波动较大,其中有10条序列拥有信号肽,1条序列为两次跨膜蛋白,21条序列都为亲水性蛋白;对其进行进化树构建、三级结构建模及结构叠合发现,这21条代表性序列的蛋白结构可以被分成两大类型,第一大类型含有1个(α/α)6桶状结构和在桶底的1个β片层结构,并根据额外含有的β片层数量的不同再被分成4个小类;第二大类型拥有1个(α/β)8结构和环绕在桶装结构域周围的β片层结构,阐明了曲霉来源的α-L-鼠李糖苷酶的蛋白结构特征,这有助于更好的明确α-L-鼠李糖苷酶的共性规律,为改造该酶提供理论指导。
[Key word]
[Abstract]
In order to study the structural characteristics of all Aspergillus-derived α-L-rhamnosidase, 300 non-repeated Aspergillus derived α-L-rhamnosidase nucleic acid sequences were collected using NCBI database and 21 of them were selected through evolutionary tree selection. Through sequence alignment analysis, the 21 sequences are independently and reliably representative. Using bioinformatics tools to analyze the physical and chemical properties, transmembrane region, and signal peptide, it was found that the isoelectric point (pI) of α-L-rhamnosidase derived from Aspergillus species was 4.66 to 7.17, and the number of amino acids, molecular weight and total number of atoms fluctuates greatly. Ten sequences have signal peptide sequences, 1 sequence is twice transmembrane protein, and 21 sequences are all hydrophilic proteins. The phylogenetic tree construction, three-level structure modeling and structure superposition show that these 21 representative sequences can be divided into two types. The first type contains one (α/α)6 barrel structure and one β sheet structure at the bottom of the barrel, and can be further divided into 4 sub-categories according to the number of additional β sheets contained; The second type has one (α/β)8 structure and β sheet structure surrounding the barrel structure domain. It elucidates the protein structure characteristics of α-L-rhamnosidase derived from Aspergillus, which helps to better clarify the common law of α-L-rhamnosidase and provide theoretical guidance for the modification of this enzyme.
[中图分类号]
[基金项目]
国家自然科学基金项目(U1805235)