生物信息学试验报告
生物信息学 实验报告 班级: 姓名: 学号: 日期: 实验一 核酸和蛋白质序列数据的使用 实验目的实验目的 了解常用的序列数据库,掌握基本的序列数据信息的查询方法。 教学基本要求教学基本要求 了解和熟悉NCBI 核酸和蛋白质序列数据库, 可以使用BLAST进行序列搜索, 解读 BLAST 搜索结果, 可以利用PHI-BLAST 等工具进行蛋白质序列的结构域搜索,解 读蛋白质序列信息,可以在蛋白质三维数据库中查询相关结构信息并进行显示。 实验内容提要实验内容提要 在序列数据库中查找某条基因序列(BRCA1) ,通过相关一系列数据库的搜索、比 对与结果解释,回答以下问题: 1. 该基因的基本功能? 2. 编码的蛋白质序列是怎样的? 3. 该蛋白质有没有保守的功能结构域 (NCBI CD-search)? 4. 该蛋白质的功能是怎样的? 5. 该蛋白质的三级结构是什么?如果没有的话,和它最相似的同源物的结 构是什么样子的?给出示意图。 实验结果及结论实验结果及结论 1.1. 该基因的基本功能?该基因的基本功能? Thisgeneencodesanuclearphosphoproteinthatplaysarolein maintaining genomic stability, and it also acts as a tumor suppressor. The encoded protein combines with other tumor suppressors, DNA damage sensors, and signal transducers to a large multi-subunit protein complex known as the BRCA1-associated genome surveillance complex (BASC). This gene product associates with RNA polymerase II, and through the C-terminal domain, also interacts with histone deacetylase compls. Thisproteinthusplaysaroleintranscription,DNArepairof double-stranded breaks, and recombination. Mutations in this gene are responsible for approximately 40% of inherited breast cancers and more than 80% of inherited breast and ovarian cancers. Alternative splicing plays a role in modulating the subcellular localization and physiological function of this gene. Many alternatively spliced transcript variants, some of which are disease-associated mutations, have been described for this gene, but the full-length natures of only some of these variants has been described. A related pseudogene, which is also located on chromosome 17, has been identified. [provided by RefSeq, May 2009] 2.2. 编码的蛋白质序列是怎样的?编码的蛋白质序列是怎样的? [Homo sapiens] 1 mdlsalrvee vqnvinamqk ilecpiclel ikepvstkcd hifckfcmlk llnqkkgpsq 61 cplcknditk rslqestrfs qlveellkii cafqldtgle yansynfakk ennspehlkd 121 evsiiqsmgy rnrakrllqs epenpslqet slsvqlsnlg tvrtlrtkqr iqpqktsvyi 181 elgsdssedt vnkatycsvg dqellqitpq gtrdeislds akkaacefse tdvtntehhq 241 psnndlntte kraaerhpek yqgssvsnlh vepcgtntha sslqhenssl lltkdrmnve 301 kaefcnkskq pglarsqhnr wagsketcnd rrtpstekkv dlnadplcer kewnkqklpc 361 senprdtedv pwitlnssiq kvnewfsrsd ellgsddshd gesesnakva dvldvlnevd 421 eysgssekid llasdpheal ickservhsk svesniedki fgktyrkkas lpnlshvten 481 liigafvtep qiiqerpltn klkrkrrpts glhpedfikk adlavqktpe minqgtnqte 541 qngqvmnitn sghenktkgd siqneknpnp ieslekesaf ktkaepisss isnmelelni 601 hnskapkknr lrrksstrhi halelvvsrn lsppnctelq idscssseei kkkkynqmpv 661 rhsrnlqlme gkepatgakk snkpneqtsk rhdsdtfpel kltnapgsft kcsntselke 721 fvnpslpree keekletvkv snnaedpkdl mlsgervlqt ersvesssis lvpgtdygtq 781 esisllevst lgkaktepnk cvsqcaafen pkglihgcsk dnrndtegfk yplghevnhs 841 retsiemees eldaqylqnt fkvskrqsfa pfsnpgnaee ecatfsahsg slkkqspkvt 901 feceqkeenq gknesnikpv qtvnitagfp vvgqkdkpvd nakcsikggs rfclssqfrg 961 netglitpnk hgllqnpyri pplfpiksfv ktkckknlle enfeehsmsp eremgnenip 1021 stvstisrnn irenvfkeas ssninevgss tnevgssine igssdeniqa elgrnrgpkl 1081 namlrlgvlq pevykqslpg snckhpeikk qeyeevvqtv ntdfspylis dnleqpmgss