公开/公告号CN101090966A
专利类型发明专利
公开/公告日2007-12-19
原文格式PDF
申请/专利权人 财团法人阪大微生物病研究会;
申请/专利号CN200580044240.0
申请日2005-12-22
分类号C12N15/09(20060101);A61K39/12(20060101);A61K39/295(20060101);A61P31/12(20060101);C12N7/00(20060101);C12N7/04(20060101);
代理机构72001 中国专利代理(香港)有限公司;
代理人权陆军;李炳爱
地址 日本大阪府
入库时间 2023-12-17 19:32:51
法律状态公告日
法律状态信息
法律状态
2015-02-11
未缴年费专利权终止 IPC(主分类):C12N15/09 授权公告日:20120704 终止日期:20131222 申请日:20051222
专利权的终止
2013-02-06
发明专利公报更正 卷:28 号:27 IPC(主分类):C12N0015090000 更正项目:说明书 误:错误 正:正确 申请日:20051222
发明专利更正
2012-07-04
授权
授权
2008-02-13
实质审查的生效
实质审查的生效
2007-12-19
公开
公开
技术领域
本发明涉及具有减毒日本脑炎病毒基因作为主链的减毒嵌合黄病毒,其作为用于预防黄病毒感染的减毒的活疫苗是有用的。
背景技术
目前,已知超过大约60种属于黄病毒科(Flaviviridae)的病毒(以下简称黄病毒),包括日本脑炎病毒、西尼罗病毒、登革1-4病毒、黄热病毒、圣路易脑炎病毒、蜱传脑炎病毒、库宁病毒(Kunjin virus)、中欧脑炎病毒、贾萨努尔森林病毒、墨累山谷脑炎病毒、鄂木斯克出血热病毒、玻瓦散病毒、俄罗斯春夏季脑炎病毒、Yokose病毒、Apoi病毒和Aroa病毒。
这些黄病毒具有单链(+)RNA的基因组,并在基因结构方面互相类似。黄病毒基因组的可读框(ORF)由其5′末端开始编码三个结构蛋白(衣壳(C)蛋白、前膜(prM)蛋白,其为膜(M)蛋白的前体、和包膜(E)蛋白)和随后的七个非结构(NS)蛋白(NS1、NS2A、NS2B、NS3、NS4A、NS4B和NS5)。
黄病毒的这些结构蛋白和非结构蛋白被翻译成单个多蛋白;翻译的多蛋白然后被蛋白酶和具有宿主细胞和病毒的蛋白酶活性的NS3蛋白加工,从而导致形成包含上面所述的三种结构蛋白的成熟病毒体。
众所周知许多上面所述的黄病毒经由昆虫例如蚊子和蜱来感染哺乳动物,包括人类和鸟类,并引起脑炎和/或发热症状。起初,这些黄病毒物种的每一种都固有于特定的区域;因此,限制了感染的地方范围。但是,近年来,由于交通/分配的发展、气候变化等,各种黄病毒感染已经扩展到病原黄病毒的最初生境以外的地方,从而引起了公共卫生的重要的问题。
在防止病毒感染的扩展中,用疫苗来预防是有效的。然而,尽管如上所述已经识别了属于黄病毒科的很多病毒,但是在实际应用中仅仅将用于黄热病毒和日本脑炎病毒感染的减毒活疫苗和用于日本脑炎病毒和蜱传脑炎病毒感染的灭活疫苗作为用于黄病毒感染的疫苗。尤其是,减毒活疫苗作为廉价的疫苗是有用的并诱导长期免疫性,但是除了如上所述的那些疫苗以外还没有批准活疫苗。
在这些情况下,作为快速开发新的用于各种黄病毒感染的减毒活疫苗的方法,最近一种应用由基因工程技术制备的嵌合黄病毒的策略已经引起大家的注意。
例如,ChimeriVaxTM-JE是通过将编码黄热病毒疫苗17D毒株的两个结构蛋白(prM-E)的基因替换为日本脑炎病毒疫苗SA14-14-2毒株的相应基因来制备的嵌合黄病毒(参见,例如,国际专利公开号No.98/37911的小册子和国际专利公开号No.01/39802的小册子)。通过存在于与日本脑炎病毒疫苗SA14-14-2毒株的E蛋白相应的病毒多蛋白中的来自野生型的大量氨基酸突变,将ChimeriVaxTM-JE减毒到允许其用作疫苗的程度(参见,例如,Arroyo等人,J.Virol.75:934-942,2001)。
此外,基于这个ChimeriVaxTM-JE的技术,也已经开发了登革1-4病毒的嵌合黄病毒(ChimeriVaxTM-DEN(1-4))(参见,例如,国际专利公开号No.98/37911的小册子和国际专利公开号No.01/39802的小册子),以及西尼罗病毒的嵌合黄病毒(ChimeriVaxTM-West Nile)(参见,例如,国际专利公开号No.2004/045529的小册子),其具有黄热病毒疫苗17D毒株的基因作为主链。
还将这些嵌合黄病毒减毒到允许其用作疫苗的程度,所述减毒主要由存在于与prM-E蛋白相应的病毒多蛋白中的来自野生型的氨基酸取代所导致。
但是,已经报道了ChimeriVaxTM-DEN1的减毒主要由在用于疫苗生产的细胞中的该嵌合黄病毒的传代中发生的E蛋白的氨基酸突变导致(参见,例如,Guirakhoo等人,J.Virol.78:9998-10008,2004)。此外,因为不同血清型的登革病毒的再感染可能是登革出血热发作的原因,所以没有发现该疫苗的实际应用。
同样,已经通过在来源于野生型高毒性的西尼罗病毒NY-99毒株的E蛋白中人工导入氨基酸突变来促进ChimeriVaxTM-West Nile减毒(参见,例如,国际专利公开号No.2004/045529的小册子)。
作为使用除黄热病毒(YF-17D)以外的基因主链的嵌合黄病毒,已经报道了通过将编码登革-4病毒prM-E蛋白的基因替换为西尼罗病毒NY99毒株的相应基因来制备的嵌合黄病毒(WN/DEN4嵌合病毒)(参见,例如,Pletnev等人,Proc.Natl.Acad.Sci.USA 99:3036-3041,2002)。
虽然WN/DEN4嵌合病毒与其亲本毒株(即:西尼罗病毒和登革4病毒)相比是减毒的,但是减毒的机理还没有被充分地阐明,并且没有发现该病毒作为疫苗的实际应用。
发明内容
当按照如上所述的常规方法来开发用于各种黄病毒感染的减毒活嵌合黄病毒疫苗时,必须研究安全性,即,对黄病毒物种的每一组合构建的嵌合病毒的脑神经毒性、从外周向中枢神经系统的感染性(脑神经侵袭力)、防止感染的作用、中和抗体产生等,并且这可能是延长减毒活疫苗实际应用时间段的主要原因。
此外,活疫苗当为了安全性进行减毒时还引起了抗体生产力降低的问题。这是因为通过如上所述修饰E蛋白来进行的病毒的减毒可能降低了其作为疫苗的免疫诱导潜力(抗体生产力),因为诱导中和抗体的抗原决定簇存在于E蛋白中。
因此,本发明的目标是提供在除E蛋白以外的部分具有减毒突变的减毒嵌合黄病毒。
本发明人进行坚持不懈地研究来解决以上所述的问题,发现用于猪的日本脑炎病毒疫苗ML-17毒株具有在除E蛋白以外的prM和NS蛋白中大量的对于ML-17毒株固有的氨基酸突变,并且获得了这些氨基酸突变可能涉及ML-17毒株减毒的暗示。根据这个发现,本发明人被启发来构建具有包含对于ML-17毒株固有的一个或多个氨基酸突变的日本脑炎病毒的除E蛋白以外的结构蛋白和非结构蛋白(即:C蛋白、prM蛋白和NS蛋白)的嵌合黄病毒,进行进一步的研究,并开发了本发明。
因此,该本发明提供了:
[1]包含编码日本脑炎病毒的衣壳蛋白、前膜蛋白和非结构蛋白的核苷酸序列,和编码第二个黄病毒的包膜蛋白的核苷酸序列的核酸分子,其中编码日本脑炎病毒的前膜蛋白和/或非结构蛋白的核苷酸序列包含产生一个或多个减毒病毒的氨基酸突变的核苷酸突变。
[2]在[1]中所述的核酸分子,其中所述日本脑炎病毒是ML-17毒株。
[3]在[1]或[2]中所述的核酸分子,其中所述第二个黄病毒选自西尼罗病毒、登革1-4病毒、黄热病毒、圣路易脑炎病毒、蜱传脑炎病毒、库宁病毒、中欧脑炎病毒、贾萨努尔森林病毒、墨累山谷脑炎病毒、鄂木斯克出血热病毒、玻瓦散病毒、俄罗斯春夏季脑炎病毒、Yokose病毒、Apoi病毒和Aroa病毒。
[4]由在[1]到[3]任何一项中所述核酸分子编码的减毒嵌合黄病毒。
[5]包含在[4]中所述的减毒嵌合黄病毒的减毒活疫苗。
[6]制备在[1]中所述核酸分子的方法,其包含下列步骤:
将在包含编码日本脑炎病毒的核苷酸序列的核酸分子中编码包膜蛋白的核苷酸序列替换为编码第二个黄病毒的包膜蛋白的核苷酸序列的步骤;和
将产生一个或多个减毒病毒的氨基酸突变的核苷酸突变导入编码日本脑炎病毒的前膜蛋白和/或非结构蛋白的核苷酸序列中的步骤。
[7]制备在[1]中所述核酸分子的方法,其包含下列步骤:
将在包含编码日本脑炎病毒的核苷酸序列的核酸分子中编码包膜蛋白的核苷酸序列替换为编码第二个黄病毒的包膜蛋白的核苷酸序列的步骤,所述日本脑炎病毒在前膜蛋白和/或非结构蛋白中具有一个或多个减毒病毒的氨基酸突变。
[8]制备减毒嵌合黄病毒的方法,其包含从在[1]到[3]任何一项中所述核酸分子表达嵌合黄病毒蛋白的步骤。
[9]包含编码日本脑炎病毒的核苷酸序列的核酸分子,所述日本脑炎病毒在前膜蛋白和/或非结构蛋白中具有一个或多个减毒病毒的氨基酸突变。
[10]包含在[9]中所述核酸分子的载体。
[11]由在[9]中所述核酸分子编码的减毒日本脑炎病毒。
[12]制备在[9]中所述核酸分子的方法,其包含将产生一个或多个减毒病毒的氨基酸突变的核苷酸突变导入在包含编码日本脑炎病毒的核苷酸序列的核酸分子中编码前膜蛋白和/或非结构蛋白的核苷酸序列中的步骤。
[13]制备减毒日本脑炎病毒的方法,其包含从在[9]中所述核酸分子表达日本脑炎病毒蛋白的步骤。
根据本发明,提供了在除E蛋白以外的部分具有减毒突变的减毒嵌合黄病毒。因为可以不修饰E蛋白来实现本发明的嵌合黄病毒的减毒,所以用于各种黄病毒感染的减毒活疫苗可以短时间内达到实际应用,而不会降低免疫诱导潜力。
附图简述
图1显示了在日本脑炎病毒L-17毒株和JaOH0566毒株之间的基因组cDNA核苷酸序列和多蛋白氨基酸序列的差异。Nt表示核苷酸;AA表示氨基酸。氨基酸位置用数字显示,从紧挨着多蛋白的起始甲硫氨酸的氨基酸开始计数。
图2显示了在日本脑炎病毒ML-17毒株的多蛋白中从亲本毒株(JaOH0566毒株)突变的氨基酸(用黑体字母标示)与在JaOH0566毒株、JaOArS982毒株、JaGAr01毒株、Nakayama毒株、Beijing毒株、SA14毒株和SA14-14-2毒株的多蛋白中相应的氨基酸相比较的结果。氨基酸位置用数字显示,从紧挨着多蛋白的起始甲硫氨酸的氨基酸开始计数。*表示在标示的位置上的氨基酸与ML-17毒株的氨基酸相同。
图3显示了用长-PCR方法制备重组日本脑炎病毒MS-14毒株和MS-15毒株的全长cDNA的方法的概述。
图4显示了用长-PCR方法制备ML-17/WN(E基因)嵌合黄病毒的cDNA的方法的概述。
实施本发明的最佳方式
本发明提供了包含编码日本脑炎病毒(第一黄病毒)的C蛋白、prM蛋白和NS蛋白的核苷酸序列,和编码第二个黄病毒的E蛋白的核苷酸序列的核酸分子。优选地,本发明提供了包含日本脑炎病毒的5′非翻译区的核苷酸序列,编码C蛋白、prM蛋白和NS蛋白的核苷酸序列,和3′未翻译区的核苷酸序列,以及编码第二个黄病毒的E蛋白的核苷酸序列的核酸分子。此外,该核酸分子在编码日本脑炎病毒的prM蛋白和/或NS蛋白的核苷酸序列中,包含产生一个或多个如下所述能减毒病毒的氨基酸突变的核苷酸突变。
本发明也提供了由这样的核酸分子编码的嵌合黄病毒。
在本说明书中,“核酸分子”表示单链或双链DNA或RNA。
在本说明书中,除非另作说明,“核苷酸序列”表示脱氧核糖核苷酸序列(用A、G、C和T显示)或核糖核苷酸序列(用A、G、C和U显示)。
在本说明书中,除非另作说明,对于单链核苷酸序列来说,左侧末端表示5′末端,且右侧末端表示3′末端;对于氨基酸序列来说,左侧末端表示N末端(氨基末端),且右侧末端表示C末端(羧基末端)。
在本说明书中,除非另作说明,对于氨基酸来说,用标准指示系统中的单字母缩写或三字母缩写来表示氨基酸。
在本说明书中,“减毒”表示病毒是低毒力的(低毒性),从而该病毒能作为疫苗安全地用于动物受试者来接种疫苗(例如,人和非人哺乳动物(例如,猴、马、牛、羊、猪、狗、猫、兔、大鼠、小鼠等)和鸟等)。
在本说明书中,“病毒能被作为疫苗(安全地)使用”表示在疫苗接种的部位观察到病毒的生长,但是最终没有明显的严重症状,而且给予了预防由在随后的用抗疫苗接种的个体的病毒进行的攻击试验中接种的高毒力病毒所引起的疾病发作的特异性免疫。
使用日本脑炎病毒作为第一黄病毒来制备本发明的嵌合黄病毒,所述日本脑炎病毒不受限制,只要本发明的嵌合黄病毒的prM蛋白和/或NS蛋白最终包含如下所述减毒病毒的一个或多个氨基酸突变。
因此,可使用日本脑炎病毒的各种毒株(例如,ML-17毒株、JaOH0566毒株、JaOArS982毒株、JaGAr01毒株、Nakayama毒株、Beijing毒株、SA14毒株、SA14-14-2毒株等)的任何毒株。
已经克隆了各种日本脑炎病毒毒株的基因组,并测定了其完整的或部分的核苷酸序列。参见,例如,对于JaOArS982毒株,Sumiyoshi等人,Virology161:497-510,1987;对于Nakayama毒株,McAda等人,Virology 158:348-360,1987;对于Beijing毒株,Hashimoto等人,Virus Genes 1:305-317,1988;对于SA14毒株和SA14-14-2毒株,Nitayaphan等人,Virology 177:541-552,1990。
关于各种日本脑炎病毒的基因组序列的信息还可以从公众可访问的基因数据库例如GenBank中获得。参见,例如,对于JaOArS982毒株,GenBank登录号:M18370;对于JaGAr01毒株,GenBank登录号:AF069076;对于Beijing-1毒株,GenBank登录号:L48961;对于SA14毒株,GenBank登录号:M55506;对于SA14-14-2毒株,GenBank登录号:AF315119。
为了制备包含编码本发明的嵌合病毒的核苷酸序列的核酸分子,制备了包含编码日本脑炎病毒的核苷酸序列的核酸分子,且其可被用作嵌合黄病毒的基因主链。作为包含编码日本脑炎病毒的核苷酸序列的核酸分子的例子,可提及基因组RNA、cDNA、合成RNA、合成DNA等。
另外,为了制备包含编码本发明的嵌合黄病毒的核苷酸序列的核酸分子,可以使用包含编码日本脑炎病毒的核苷酸序列的核酸分子的任何片段(例如,PCR扩增的DNA片段等)。
可以用一般已知的方法从用日本脑炎病毒感染的细胞(例如,MMC-LK2细胞、HeLa细胞、N2a细胞、PS细胞、BSC-1细胞、HL-CZ细胞、LLC-MK2细胞、Vero细胞、BHK细胞、来源于蚊的C6/36细胞、来源于小鼠或仓鼠的脑内细胞)、生长中的鸡蛋等中制备日本脑炎病毒的基因组RNA。用于制备基因组RNA的日本脑炎病毒毒株不受限制;例如,可使用如上所列的毒株。
可以根据一般的已知方法(例如,在Sumiyoshi等人,Virology 161:497-510,1987中所述的方法)由基因组RNA构建日本脑炎病毒的cDNA。
另外,也可以根据日本脑炎病毒的基因组序列信息,用一般的已知方法来化学合成日本脑炎病毒的基因组RNA或cDNA或其任意片段。
可以用聚合酶链反应(缩写为“PCR方法”),反转录酶-聚合酶链反应(缩写为“RT-PCR方法”);长聚合酶链反应(缩写为“长-PCR方法”);和/或长-反转录酶-聚合酶链反应(缩写为“长-RT-PCR方法”)以基因组RNA或cDNA作为模板来扩增日本脑炎病毒的基因组RNA或cDNA或其任意片段。
另外,还可以将日本脑炎病毒的基因组RNA或cDNA或其任意片段插入到载体中,并克隆。
作为使用的载体的例子,可以提及质粒例如pBR322、pBR325、pBR327、pBR328、pUC7、pUC8、pUC9、pCU18、pUC19、pHSG298、pHSG299、pSC101、pGBM5和pCRII。作为克隆载体,还可以使用噬菌体、粘粒、噬菌粒等。这些克隆载体是从,例如,日本基因有限公司(NIPPON GENE CO.,LTD.)等在商业上可购买到的。
作为用于本发明的第二个黄病毒,可以提及除日本脑炎病毒以外的黄病毒,例如,西尼罗病毒、登革1-4病毒、黄热病毒、圣路易脑炎病毒、蜱传脑炎病毒、库宁病毒、中欧脑炎病毒、贾萨努尔森林病毒、墨累山谷脑炎病毒、鄂木斯克出血热病毒、玻瓦散病毒、俄罗斯春夏季脑炎病毒、Yokose病毒、Apoi病毒、Aroa病毒等。对于这些黄病毒物种,也可使用其各种突变毒株的任意毒株。
编码如上所述的各种黄病毒的E蛋白的核苷酸序列和氨基酸序列是通常已知的,而且,对于许多这些黄病毒物种来说,已经报道了其基因组的整个核苷酸序列。参见,例如,以下:西尼罗病毒(例如,Wengler等人,Virology147:264-274,1985);登革-1病毒(例如,Mason等人,Virology 161:262-267,1987);登革-2病毒(例如,Deubel等人,Virology 155:365-377,1986;Gruenberg等人,J.Gen.Virol.69:1391-1398,1988;Hahn等人,Virology 162:167-180,1988);登革-3病毒(例如,Osatomi等人,Virus Genes 2:99-108,1988);登革-4病毒(例如,Mackow等人,Virology 159:217-228,1987;Zhao等人,Virology 155:77-88,1986);黄热病毒(例如,Rice等人,Science 229:726-733,1985);圣路易脑炎病毒(例如,Trent等人,Virology 156:293-304,1987);蜱传脑炎病毒(例如,Mandl等人,Virology 166:197-205,1988);库宁病毒(例如,Coia等人,J.Gen.Virol.69(Pt1):1-21,1988);贾萨努尔森林病毒(例如,Venugopal等人,Journal of General Virology 75:227-232,1994;Kuno等人,Journal of Virology72:73-83,1998);墨累山谷脑炎病毒(例如,Dalgarno等人,J.Mol.Biol.187:309-323,1986);鄂木斯克出血热病毒(例如,Lin等人,Virology 313:81-90,2003;Li等人,Journal ofGeneral Virology 85:1619-1624,2004;Gritsun等人,Journal of General Virology 74:287-291,1993);玻瓦散病毒(例如,Kuno等人,Am.J.Trop.Med.Hyg.65:671-676,2001;Mandl等人,Virology 194:173-184,1993);俄罗斯春夏季脑炎病毒(例如,Kuno等人,Journal of Virology 72:73-83,1998);Apoi病毒(例如,Billoir等人,Journal of General Virology 81:781-790,2000);Aroa病毒(例如,Gaunt等人,Journal of General Virology 82:1867-1976,2001)。
关于各种黄病毒基因组的核苷酸序列的信息还可以从公众可访问的基因数据库例如GenBank中获得。参见,例如,以下:西尼罗病毒(例如,GenBank登录号:M12294;NC_001563);登革-1病毒(例如,GenBank登录号:M23027);登革-2病毒(例如,GenBank登录号:M19197;NC_001474);登革-3病毒(例如,GenBank登录号:M93130);登革-4病毒(例如,GenBank登录号:M14931);黄热病毒(例如,GenBank登录号:X03700;NC_002031);圣路易脑炎病毒(例如,GenBank登录号:M16614);蜱传脑炎病毒(例如,GenBank登录号:U27495;NC_001672);库宁病毒(例如,GenBank登录号:AY274504;AY274505);贾萨努尔森林病毒(例如,GenBank登录号:X74111);墨累山谷脑炎病毒(例如,GenBank登录号:AF161266;NC_000943);鄂木斯克出血热病毒(例如,GenBank登录号:AY193805;AY438626;X66694;NC_005062);玻瓦散病毒(例如,GenBank登录号:AF310922;AF310920;AF310912;L06436;NC_003687);Yokose病毒(例如,GenBank登录号:AB114858;NC_005039);Apoi病毒(例如,GenBank登录号:AF160193;NC_003676);Aroa病毒(例如,GenBank登录号:AF372413)。
为了制备包含编码本发明的嵌合病毒的核苷酸序列的核酸分子,制备了包含编码第二个黄病毒的核苷酸序列的核酸分子,并可使用编码其E蛋白的区域。作为包含编码第二个黄病毒的核苷酸序列的核酸分子的例子,可提及基因组RNA、cDNA、合成RNA、合成DNA等。
另外,为了制备包含编码本发明的嵌合黄病毒的核苷酸序列的核酸分子,可以使用包含编码第二个黄病毒的核苷酸序列的核酸分子的任何片段(例如,PCR扩增的DNA片段等)。
第二个黄病毒的基因组RNA可以通过与日本脑炎病毒相同的方法来制备。
第二个黄病毒的cDNA还可以可以通过与日本脑炎病毒相同的技术由基因组RNA来构建。
另外,第二个黄病毒的基因组RNA或cDNA或其任意片段还可以基于通常已知的关于用作第二个黄病毒的病毒的基因组序列信息用通常已知的方法来化学合成。
可以如在日本脑炎病毒中那样,用PCR方法、RT-PCR方法、长-PCR方法和/或长-RT-PCR方法,以基因组RNA或cDNA为模板,来扩增第二个黄病毒的基因组RNA或cDNA或其任意片段。
另外,如在日本脑炎病毒中那样,第二个黄病毒的基因组RNA或cDNA或其任意片段还可以插入到如上所列的适当的载体中,并克隆。
通过将编码E蛋白的核苷酸序列替换为编码第二个黄病毒的包膜蛋白的核苷酸序列,来制备包含编码本发明的嵌合黄病毒的核苷酸序列的核酸分子(DNA或RNA),所述的编码E蛋白的核苷酸序列是在包含编码日本脑炎病毒的核苷酸序列的核酸分子中的。
可以通过通常已知的重组技术来进行在包含编码日本脑炎病毒的核苷酸序列的核酸分子中的编码E蛋白的区域的置换(例如,利用Morita等人,Virology287:417-426,2001等所描述的长-PCR的方法)。
另外,包含编码本发明的嵌合黄病毒的核苷酸序列的核酸分子(DNA或RNA)还可以通过化学合成来制备,其通过基于有关日本脑炎病毒和第二个黄病毒的基因组序列信息,设计编码本发明的嵌合黄病毒的整个核苷酸序列来实现。
当以DNA的形式制备了包含编码本发明的嵌合黄病毒的核苷酸序列的核酸分子时,将用于体外转录的启动子序列导入DNA的5′末端。在表达嵌合黄病毒蛋白之前的任意阶段,将编码本发明的嵌合黄病毒的DNA用与导入的启动子相应的RNA聚合酶转录为RNA。作为使用的启动子序列的例子,可以提及T7 RNA聚合酶启动子、SP6 RNA聚合酶启动子等。
用本领域通常已知的基因导入技术,例如转染、电穿孔或显微注射,将编码本发明的嵌合黄病毒的RNA导入适于蛋白表达的细胞中(例如,C6/36细胞、Vero细胞、BHK细胞、MMC-LK2细胞、HeLa细胞、N2a细胞、PS细胞等),并在这些细胞中表达嵌合黄病毒蛋白。
另外,对于本发明的嵌合黄病毒,可以通过利用不使用细胞体系的蛋白质生产方法来获得嵌合黄病毒蛋白,其中通过向细胞匀浆物或提取液中加入底物、酶等来在试管中提供生物的遗传信息翻译系统(也称为无细胞体系表达;参见,例如,US专利号:5,478,730;Madin等人,Proc.Natl.Acad.Sci.USA97:559-564,2000;Sawasaki等人,Proc.Natl.Acad.Sci.USA 99:14652-14657,2002)。
包含编码本发明的嵌合黄病毒的核苷酸序列的核酸分子在该核酸分子中编码来源于日本脑炎病毒的prM蛋白和/或NS蛋白的核苷酸序列中包含产生一个或多个减毒病毒的氨基酸突变的核苷酸突变。
详细来说,当用日本脑炎病毒JaOArS982毒株的单独的结构蛋白和非结构蛋白的氨基酸序列表达(参见H.Sumiyoshi等人,Virology 161:497-510,1987)作为对照时,减毒病毒的氨基酸突变是下列氨基酸的取代:prM蛋白的第1个甲硫氨酸被异亮氨酸取代;prM蛋白的第148个(M蛋白的第56个)天冬酰胺被苏氨酸取代;NS2A蛋白的第4个丙氨酸被丝氨酸取代;NS4B蛋白的第51个天冬酰胺被赖氨酸取代;NS4B蛋白的第52个缬氨酸被异亮氨酸取代;NS4B蛋白的第68个苏氨酸被丝氨酸取代;NS5蛋白的第126个亮氨酸被甲硫氨酸取代;和/或NS5蛋白的第854个丝氨酸被天冬酰胺取代。
另外,还可以利用对于在各自氨基酸位置导入的氨基酸的保守氨基酸来进行以上所述的氨基酸取代。
在本说明书中,“保守氨基酸”表示在物理化学的性质方面彼此相类似的氨基酸;其例子包括分类在相同组中的氨基酸,例如芳香族氨基酸(Phe、Trp、Tyr),脂族氨基酸(Ala、Leu、Ile、Val),极性氨基酸(Gln、Asn),碱性氨基酸(Lys、Arg、His),酸性氨基酸(Glu、Asp),具有羟基的氨基酸(Ser、Thr)和具有小侧链的氨基酸(Gly、Ala、Ser、Thr、Met)。
优选地,在所述的氨基酸取代中,本发明的嵌合黄病毒至少包含prM蛋白的第1个甲硫氨酸被异亮氨酸所取代和/或prM蛋白的第148个(M蛋白的第56个)天冬酰胺被苏氨酸所取代。
可以在任一步骤中将以上所述的氨基酸突变导入,来制备包含编码本发明的嵌合黄病毒的核苷酸序列的核酸分子。
可以利用,例如Morita等人,Virology 287:417-426,2001所描述的利用长-PCR方法的定点诱变方法,或通过将通常已知的例如Kunkel法或带缺口的双链体法的方法或利用这些方法的诱变试剂盒(可以从例如,Takara Bio Inc.处获得)等应用到克隆到质粒的cDNA,来导入以上所述的氨基酸突变。
另外,可以通过使用包含编码在prM蛋白和/或NS蛋白中具有一个或多个减毒病毒的氨基酸突变的日本脑炎病毒的核苷酸序列的核酸分子,来制备包含编码本发明的嵌合黄病毒的核苷酸序列的核酸分子,来将这些突变导入本发明的嵌合黄病毒中。
使用上述的通常已知的定点诱变方法,可以通过向不包含这些突变的日本脑炎病毒毒株(例如,JaOH0566毒株、JaOArS982毒株、JaGAr01毒株、Nakayama毒株、Beijing毒株、SA14毒株SA14-14-2毒株等)的基因组RNA或cDNA中人工导入这些突变,来制备包含编码在prM蛋白和/或NS蛋白中具有一个或多个减毒病毒的氨基酸突变的日本脑炎病毒的核苷酸序列的核酸分子。
可以通过从包含编码在prM蛋白和/或NS蛋白中具有一个或多个减毒病毒的氨基酸突变的日本脑炎病毒的核苷酸序列的核酸分子表达病毒蛋白,来制备重组日本脑炎病毒,所述核酸分子是通过如上所述通常已知的方法人工制备的。
通过用该重组日本脑炎病毒感染如上所述的适当细胞,并培养该细胞,可以从培养细胞中制备大量的基因组RNA。
此外,该重组日本脑炎病毒本身是减毒的日本脑炎病毒,并可以是日本脑炎减毒活疫苗的有希望的候选物。
还可以通过获得来自天然包含一个或多个这样的氨基酸突变的日本脑炎病毒突变毒株的基因组RNA或cDNA,来制备包含编码在prM蛋白和/或NS蛋白中具有一个或多个减毒病毒的氨基酸突变的日本脑炎病毒的核苷酸序列的核酸分子。
作为在prM蛋白和/或NS蛋白中天然包含一个或多个减毒病毒的氨基酸突变的日本脑炎病毒突变毒株的例子,可以提及日本脑炎病毒ML-17毒株。ML-17毒株是猪的日本脑炎病毒疫苗毒株(参见,例如,Yoshida等人,BIKENJOURNAL 24:47-67,1981),并由The Research Foundation for Microbial Diseasesof Osaka University(在Osaka University建立,3-1,Yamadaoka,Suita-shi,Osaka)得到。
另外,可通过通常已知的方法,从细胞等中的传代得到日本脑炎病毒突变毒株之中选择在prM蛋白和/或NS蛋白中具有一个或多个减毒病毒的氨基酸突变的毒株,并且其基因组RNA或cDNA还可以用来制备本发明的嵌合黄病毒。
这样制备的包含编码在prM蛋白和/或NS蛋白中具有一个或多个减毒病毒的氨基酸突变的日本脑炎病毒的核苷酸序列的核酸分子(基因组RNA或者cDNA),可在进行所需断裂之后插入到如上所述的克隆载体中。例如,利用这样的重组载体,可以安全而方便地制备本发明的嵌合黄病毒。
可以通过例如,获得来自制备的嵌合病毒的基因,确定其碱基序列的全部或部分或相应的氨基酸序列(例如,已经导入突变的部分、两个病毒的连接处等),并确认该序列与预期的序列相配,来确认本发明的嵌合黄病毒的构建。
本发明也提供了包含本发明的嵌合黄病毒的减毒活疫苗(在下文中也称为本发明的疫苗)。因为病毒的E蛋白包含诱导中和抗体的抗原决定簇,所以本发明的嵌合黄病毒作为疫苗来接种导致产生抗用作E蛋白的第二个黄病毒物种的抗体。
为了防止各种黄病毒感染,本发明的疫苗可以接种到,例如,动物,例如人和非人哺乳动物(例如:猴、马、牛、羊、猪、狗、猫、兔、大鼠、小鼠等),以及鸟类。
本发明的疫苗可以以悬浮液或冻干制剂的形式生产。除了本发明的嵌合黄病毒之外,本发明的疫苗可以包含,药学上可接受的稳定剂(例如,蔗糖、乳糖、葡萄糖、明胶、明胶水解产物、L-谷氨酸钠、血清白蛋白等)、安抚剂(例如,葡萄糖等)等共同用于疫苗制剂中。
在接种后,本发明的疫苗一般可以溶解或悬浮在药学上可接受的载体中。作为载体的例子,可以提及液体载体,例如水、盐水(包括生理盐水)和缓冲溶液(例如,磷酸盐缓冲液)。
本发明的疫苗代表性地开处方为每0.1到1.0ml剂量包含102到106 PFU的本发明的嵌合黄病毒的无菌水溶液,而且可以通过例如皮下、皮内、肌内等进行接种。
另外,因为已知一些黄病毒经由粘膜感染,所以本发明的疫苗可根据选择的第二个黄病毒物种经口或经鼻施用。
此外,包含编码本发明的嵌合黄病毒的核苷酸序列的核酸分子(RNA或DNA)本身还可以用作核酸疫苗制剂。
构造的嵌合病毒的功效和安全性可以用本领域通常已知的评价方法,通过例如,评价在动物(例如,小鼠、猴等)中的脑神经毒性、从外周向中枢神经系统的感染性(脑神经侵袭力)、防止感染的作用、病毒血症(viremia)的存在或不存在、中和抗体产生等证实。
在下文中通过以下实施例更详细地描述本发明,但是其仅仅用于说明的目的,而不限制本发明的范围。
实施例
(实施例1)
(日本脑炎病毒疫苗ML-17毒株减毒突变位点的确定)
为了确定日本脑炎病毒疫苗ML-17毒株的减毒突变位点,首先,比较在ML-17毒株和其亲本毒株,即,野生型毒性的日本脑炎病毒JaOH0566毒株之间的全长基因组cDNA的核苷酸序列和多蛋白的氨基酸序列。
根据Sumiyoshi等人,Virology 161:497-510,1987公开的方法,确定ML-17毒株的全长基因组cDNA的核苷酸序列(获得自The Research Foundation forMicrobial Diseases of Osaka University)(SEQ ID NO:1)和JaOH0566毒株的全长基因组cDNA的核苷酸序列(SEQ ID NO:3)。对于两个毒株,使用表1A所示的PCR引物来构建cDNA,并使用表1B所示的测序引物来确定构建的cDNA的核苷酸序列。此外,ML-17毒株的多蛋白的氨基酸序列(SEQ ID NO:2)和JaOH0566毒株的多蛋白的氨基酸序列(SEQ ID NO:4)是由各自的全长基因组cDNA的核苷酸序列所推导出来的。
(表1A)
日本脑炎病毒(JEV)的PCR引物
(表1B)
用于日本脑炎病毒(JEV)测序的引物
*:F表示正向引物;R表示反向引物。
作为基因组cDNA的核苷酸序列比较的结果,发现如图1所示,ML-17毒株具有25个核苷酸取代,由于10个这些核苷酸取代而在自紧挨着多蛋白的起始甲硫氨酸的氨基酸开始计数的第127位、第274位、第1209位、第2462位、第2463位、第2479位、第2652位、第2751位、第2896位和第3380位产生了10个氨基酸取代。
此外,将在ML-17毒株的多蛋白中的如上所述的氨基酸突变与其它野生型毒性的日本脑炎病毒JaOArS982毒株、JaGAr01毒株、Nakayama毒株、Beijing毒株和SA14毒株,以及另一日本脑炎病毒疫苗SA14-14-2毒株的多蛋白相应位点的氨基酸序列相比较。
以下毒株的多蛋白的推定的氨基酸序列获得自GenBank,并用于这个实施例:JaOArS982毒株的多蛋白的氨基酸序列(GenBank登录号:M18370);JaGAr01毒株的多蛋白的氨基酸序列(GenBank登录号:AF069076);Beijing-1毒株的多蛋白的氨基酸序列(GenBank登录号:L48961);SA14毒株的多蛋白的氨基酸序列(GenBank登录号:M55506);SA14-14-2毒株的多蛋白的氨基酸序列(GenBank登录号:AF315119)。
对于Nakayama毒株的多蛋白的氨基酸序列,比较所需的区域的氨基酸序列获得自GenBank登录号:AF112297;McAda等人,Virology.158(2):348-60,1987等所描述的基因和/或氨基酸部分序列的信息,并用于这个实施例。
如图2所示,作为多蛋白的氨基酸序列比较的结果,发现在如上所述的在ML-17毒株和JaOH0566毒株之间的10个氨基酸取代之中,在自紧挨着多蛋白的起始甲硫氨酸的氨基酸开始计数的第127位、第274位、第1209位、第2462位、第2463位、第2479位、第2652位和第3380位的总共八个氨基酸取代是ML-17毒株固有的氨基酸突变。
通过与在Sumiyoshi等人,Virology 161:497-510,1987所描述的日本脑炎病毒(JaOArS982毒株)的多蛋白中的单独的结构蛋白和非结构蛋白的位置进行比较,发现在多蛋白中这八个氨基酸取代的位置分别对应于prM蛋白的第1位和第148位氨基酸(M蛋白的第56位);NS2A蛋白的第4位氨基酸;NS4B蛋白的第51位、第52位和第68位氨基酸;NS5蛋白的第126位和第854位氨基酸。
(实施例2)
(重组日本脑炎病毒MS-14毒株和MS-15毒株的制备)
为了证实ML-17毒株固有的氨基酸突变对于病毒减毒的作用,根据Morita等人,Virology 287:417-426,2001所描述的利用长-PCR法进行定点诱变的方法,使用野生型毒性的日本脑炎病毒JaOArS982毒株基因作为主链,制备了MS-14毒株和MS-15毒株,其中MS-14毒株为整合了产生prM蛋白的第1位甲硫氨酸被异亮氨酸取代(在基因组的479位上G被A所取代)的核苷酸突变的重组日本脑炎病毒,而MS-15毒株为整合了产生prM蛋白的第148位(M蛋白的第56位)天冬酰胺被苏氨酸取代(在基因组的919位上A被C所取代)的核苷酸突变的重组日本脑炎病毒。
(MS-14毒株的制备)
如图3所示,以日本脑炎病毒(JaOArS982毒株)的基因RNA为模板,使用一组引物1和引物2、一组引物3和引物4以及一组引物5和引物6,通过长-RT-PCR法,制备了对应于各自的引物组的日本脑炎病毒基因片段(片段1、2和3)。
此外,在用琼脂糖电泳将每个片段纯化之后,首先,以片段1和片段2为模板,使用一组引物1和引物4,再一次通过长-PCR法,制备基因片段4。以相同的方式纯化该基因片段;然后,以基因片段3和片段4为模板,使用一组引物1和引物6,进行长-PCR法来制备在其5′末端具有T7启动子序列的全长日本脑炎病毒cDNA(片段5)。
以该全长的日本脑炎病毒cDNA(片段5)作为模板,进行体外RNA合成反应来制备人造的全长日本脑炎病毒基因RNA。将该RNA用电穿孔法导入蚊子细胞(C6/36细胞)中,并将细胞培养5天,之后回收在培养物上清液中出现的重组病毒(MS-14毒株)。
(MS-15毒株的制备)
对于MS-15毒株,根据如上所述的方法回收重组病毒,只是除了使用引物7代替引物2,并用引物8代替引物3来制备重组病毒。
此外,对于对照,使用引物9代替引物2,并用引物10代替引物3来获得无突变的病毒。
用于该实施例的引物如表2所示。
在上表中,S表示有义序列;R表示互补序列。下划线表示T7启动子序列;双下划线表示核苷酸突变。
(实施例3)
(重组日本脑炎病毒MS-14毒株和MS-15毒株在BHK细胞中的生长潜力的评价)
评价了在实施例2中制备的重组日本脑炎病毒MS-14毒株和MS-15毒株在BHK细胞中的生长潜力。对于对照,使用了野生型毒性的日本脑炎病毒JaOArS982毒株。
用每个MS-14毒株、MS-15毒株和JaOArS982毒株感染BHK细胞;在感染后24、48和72小时,收集细胞培养物的上清液,并检查病毒的出现。每次通过噬斑法来测定在培养物的上清液中的病毒含量。对于所有毒株,证实了其在BHK细胞中的生长。
(实施例4)
(重组日本脑炎病毒MS-14毒株和MS-15毒株在正常成熟小鼠中的神经毒性的评价)
使用实施例2所制备的重组日本脑炎病毒MS-14毒株和MS-15毒株,通过腹膜内接种的方法来评价每个毒株在正常成熟小鼠中的神经毒性。对于毒性对照,使用了野生型毒性的日本脑炎病毒JaOArS982毒株;对于减毒对照,使用了日本脑炎病毒疫苗ML-17毒株。
通过脑内接种,将101到106蚀斑形成单位的MS-14毒株、MS-15毒株、ML-17毒株或JaOArS982毒株给给予十只小鼠(4周龄雄性和雌性ICR小鼠,每组5只),并持续3周每日观察一次,并通过Read-Muench法计算LD50。该试验的结果如表3所示。
(表3)
*:MS-14rev表示使用基于长-PCR的定点诱变,由MS-14制备的MS-14的回复体。
如表3所示,MS-14毒株和MS-15毒株与毒性的JaOArS982毒株相比明显减毒。尤其是,MS-14毒株具有与作为日本脑炎病毒疫苗毒株的ML-17毒株相似的毒力。
这证明可以通过在prM蛋白中的氨基酸取代来减少在小鼠中的神经毒性。
从以上的结果,说明导入重组日本脑炎病毒MS-14毒株和MS-15毒株的氨基酸突变可能对制备日本脑炎活疫苗有用。此外,因为在高剂量接种中MS-14显示出程度低的致病性,说明在NS中的突变也可能是减毒所必需的。
(实施例5)
(日本脑炎病毒和西尼罗病毒的嵌合黄病毒(ML-17/WN(E基因))的cDNA的构建)
为了制备其中日本脑炎病毒疫苗ML-17毒株的E蛋白被西尼罗病毒的E蛋白所取代的嵌合黄病毒ML-17/WN(E基因),根据在Morita等人,Virology287:417-426,2001所描述的利用长-PCR法制备重组日本脑炎病毒的方法来构建ML-17/WN(E基因)的cDNA。
如图4所示,以日本脑炎病毒活疫苗毒株(ML-17毒株)的基因RNA作为模板,使用一组引物1A和引物2A和一组引物5A和引物6A,通过长-RT-PCR法,制备对应于各自引物组的日本脑炎病毒基因片段(片段1A和3A)。
同样,以西尼罗病毒(NY99-35262-11毒株)的基因RNA作为模板,使用一组引物3A和引物4A,通过长-RT-PCR法,制备西尼罗病毒基因片段2A。
在通过琼脂糖电泳纯化每个片段之后,首先,以片段1A和2A作为模板,使用一组引物1A和引物4A,再一次通过长-RT-PCR法,制备基因片段4A。
以同样方式纯化该基因片段;然后,以基因片段3A和4A作为模板,使用一组引物1A和引物6A,进行长-RT-PCR法来制备ML-17/WN(E基因)的嵌合病毒cDNA(片段5A),所述cDNA在其5′末端具有T7启动子序列,并具有西尼罗病毒E蛋白基因。
用于该实施例的引物如表4所示。
*:JE表示日本脑炎病毒JaOAr82毒株;ML17表示日本脑炎病毒ML-17毒株的序列;WN表示西尼罗病毒的序列。S或N表示有义序列;R表示互补序列。下划线表示T7启动子序列。
(实施例6)
(来自ML-17/WN(E基因)的cDNA的嵌合黄病毒的制备)
以在实施例5中构建的日本脑炎病毒和西尼罗病毒的嵌合黄病毒的cDNA(在图4中的片段5A)作为模板,进行体外RNA合成反应来制备人工的全长嵌合病毒基因RNA。通过电穿孔的方法将该RNA导入蚊子细胞(C6/36细胞)中,并将细胞培养5天,之后回收在培养物上清液中出现的嵌合病毒。
(实施例7)
(ML-17/WN(E基因)嵌合黄病毒的生长潜力的评价)
使用的嵌合病毒是在实施例6中获得的ML-17/WN(E基因)。在C6/36细胞和BHK细胞中培养ML-17/WN(E基因)毒株和ML-17毒株,并分配感染的培养液,且在-70℃下储存。对于两种类型的细胞,使用补加有非必需氨基酸,并在根据需要的浓度(2到10%)下添加胎牛血清的Eagle MEM培养液。在培养实验中培养液的感染性效价如表5所示。ML-17/WN(E基因)具有相当于ML-17的生长潜力。
表5在C6/36细胞和BHK细胞中ML-17/WN(E基因)嵌合黄病毒和ML-17的生长
(实施例8)
(ML-17/WN(E基因)嵌合病毒的神经毒性的评价)
使用在实施例6中制备的重组嵌合病毒ML-17/WN(E基因),通过脑内接种法在正常成熟的小鼠中评价神经毒性。也通过腹膜内施用来检查从外周向中枢神经系统的感染性(脑神经侵袭力)。对于对照,使用了野生型高毒性的毒株日本脑炎病毒JaOH0566毒株、西尼罗病毒NY99-3562-11毒株和减毒毒株日本脑炎病毒疫苗ML-17毒株。
通过脑内接种向出生后4周的C57BL/B6小鼠给予101到107的每一病毒,并观察28天。通过Read-Muench法计算LD50。
表6显示从实验中获得的ML-17/WN(E基因)的LD50。与高毒性的JaOH0566毒株相比,ML-17/WN(E基因)明显减毒。尤其是,ML-17/WN(E基因)具有类似于日本脑炎病毒疫苗毒株ML-17毒株的毒力。
表6在C57BL/B6小鼠中ML-17/WN(E基因)的LD50
(实施例9)
(ML-17/WN(E基因)嵌合病毒的性质)
研究了ML-17/WN(E基因)嵌合病毒对于蚊子的感染性。用跨膜法使三带喙库蚊(Culex tritaeniorhynchus)OK7能吮吸在兔血(抗体阴性)中的病毒混合物。通过评价PFU(蚀斑形成单位)和在乳鼠中的发病率,来计算感染率。
在将用跨膜法使其能吮吸嵌合病毒的蚊子饲养10天和21天之后,使其乳化,并通过在乳鼠中脑内接种和PFU评价来尝试验证病毒的存在。即使当使每只蚊子吮吸至少102 PFU ML-17/WN(E基因)病毒时,也没有绝对的证据证明病毒的存在。
ML-17/WN(E基因)嵌合病毒不显示出对三带喙库蚊的易感性。
(实施例10)
(包含ML-17/WN(E基因)嵌合病毒的活疫苗的防御效力的评价)
使用2周龄的C57BL/B6和C3H/He小鼠进行免疫实验。
使用两个自交系的小鼠来进行研究。用西尼罗病毒(NY99-35262-11毒株)通过脑内攻击来致死性感染非免疫的对照组,而通过腹膜内接种给予ML-17/WN(E基因)的组显示出对于感染的明显防御。
工业实用性
根据本发明,提供了在除了E蛋白之外的部分具有减毒突变的减毒嵌合黄病毒。因为不修饰E蛋白而实现本发明的嵌合黄病毒的减毒,所以短时间就可以将用于各种黄病毒感染的减毒活疫苗进行实际应用,而不降低免疫诱导的潜力。
本申请是根据在日本递交的专利申请No.2004-374630的,其内容全部引入此处作为参考。
序列表
<110>The Research Foundation for Microbial Diseases of OsakaUniversity
<120>具有减毒日本脑炎病毒基因作为主链的减毒嵌合黄病毒
<130>09839
<150>JP2004-374630
<151>2004-12-24
<160>63
<170>PatentIn version 3.3
<210>1
<211>10976
<212>DNA
<213>日本脑炎病毒
<220>
<221>CDS
<222>(96)..(10391)
<400>1
agaagtttat ctgtgtgaac ttcttggctt agtatcgttg agaagaatcg agagattagt 60
gcagtttaaa cagtttttta gaacggaaga taacc atg act aaa aaa cca gga 113
Met Thr Lys Lys Pro Gly
1 5
ggg ccc ggt aaa aac cgg gct atc aat atg ctg aaa cgc ggc cta ccc 161
Gly Pro Gly Lys Asn Arg Ala Ile Asn Met Leu Lys Arg Gly Leu Pro
10 15 20
cgc gta ttc cca cta gtg gga gtg aag agg gta gta atg agc ttg ttg 209
Arg Val Phe Pro Leu Val Gly Val Lys Arg Val Val Met Ser Leu Leu
25 30 35
gac ggc aga ggg cca gta cgt ttc gtg ctg gct ctt atc acg ttc ttc 257
Asp Gly Arg Gly Pro Val Arg Phe Val Leu Ala Leu Ile Thr Phe Phe
40 45 50
aag ttt aca gca tta gcc ccg acc aag gcg ctt tta ggc cga tgg aaa 305
Lys Phe Thr Ala Leu Ala Pro Thr Lys Ala Leu Leu Gly Arg Trp Lys
55 60 65 70
gca gtg gaa aag agt gta gca atg aaa cat ctc act agt ttc aaa cga 353
Ala Val Glu Lys Ser Val Ala Met Lys His Leu Thr Ser Phe Lys Arg
75 80 85
gaa ctt gga aca ctc att gac gcc gtg aac aag cgg ggc aga aag caa 401
Glu Leu Gly Thr Leu Ile Asp Ala Val Asn Lys Arg Gly Arg Lys Gln
90 95 100
aac aaa aga gga gga aat gaa ggc tca atc atg tgg ctt gcg agc ttg 449
Asn Lys Arg Gly Gly Asn Glu Gly Ser Ile Met Trp Leu Ala Ser Leu
105 110 115
gca gtt gtc ata gct tgt gca gga gcc ata aag ttg tca aat ttc cag 497
Ala Val Val Ile Ala Cys Ala Gly Ala Ile Lys Leu Ser Asn Phe Gln
120 125 130
ggg aag ctt ttg atg acc att aac aac acg gac att gca gac gtt atc 545
Gly Lys Leu Leu Met Thr Ile Asn Asn Thr Asp Ile Ala Asp Val Ile
135 140 145 150
gta att ccc acc tca aaa gga gag aac aga tgc tgg gtc cgg gca atc 593
Val Ile Pro Thr Ser Lys Gly Glu Asn Arg Cys Trp Val Arg Ala Ile
155 160 165
gac gtc ggc tac atg tgt gag gac act atc acg tac gaa tgt cct aag 641
Asp Val Gly Tyr Met Cys Glu Asp Thr Ile Thr Tyr Glu Cys Pro Lys
170 175 180
ctt gcc atg ggc aat gat cca gag gat gtg gac tgc tgg tgt gac aac 689
Leu Ala Met Gly Asn Asp Pro Glu Asp Val Asp Cys Trp Cys Asp Asn
185 190 195
caa gaa gtc tac gtc caa tat gga cgg tgc acg cgg acc agg cat tcc 737
Gln Glu Val Tyr Val Gln Tyr Gly Arg Cys Thr Arg Thr Arg His Ser
200 205 210
aag cga agc agg aga tcc gtg tcg gtc caa aca cat ggg gag agt tca 785
Lys Arg Ser Arg Arg Ser Val Ser Val Gln Thr His Gly Glu Ser Ser
215 220 225 230
cta gtg aat aaa aaa gag gct tgg ctg gat tca acg aaa gcc aca cga 833
Leu Val Asn Lys Lys Glu Ala Trp Leu Asp Ser Thr Lys Ala Thr Arg
235 240 245
tat ctc atg aaa act gag aac tgg atc ata agg aat cct ggc tat gct 881
Tyr Leu Met Lys Thr Glu Asn Trp Ile Ile Arg Asn Pro Gly Tyr Ala
250 255 260
ttc ctg gcg gcg gta ctc ggc tgg atg ctt ggc agt acc aac ggt caa 929
Phe Leu Ala Ala Val Leu Gly Trp Met Leu Gly Ser Thr Asn Gly Gln
265 270 275
cgc gtg gta ttc acc atc ctc ctg ctg ctg gtc gct ccg gct tac agt 977
Arg Val Val Phe Thr Ile Leu Leu Leu Leu Val Ala Pro Ala Tyr Ser
280 285 290
ttt aat tgt ctg gga atg ggc aat cgt gac ttc ata gaa gga gcc agt 1025
Phe Asn Cys Leu Gly Met Gly Asn Arg Asp Phe Ile Glu Gly Ala Ser
295 300 305 310
gga gcc act tgg gtg gac ttg gtg cta gaa gga gat agc tgc ttg aca 1073
Gly Ala Thr Trp Val Asp Leu Val Leu Glu Gly Asp Ser Cys Leu Thr
315 320 325
att atg gca aac gac aaa cca aca ttg gac gtc cgc atg atc aac atc 1121
Ile Met Ala Asn Asp Lys Pro Thr Leu Asp Val Arg Met Ile Asn Ile
330 335 340
gaa gct agc caa ctt gcc gag gtt aga agt tac tgt tat cat gct tca 1169
Glu Ala Ser Gln Leu Ala Glu Val Arg Ser Tyr Cys Tyr His Ala Ser
345 350 355
gtc act gac atc tcg acg gtg gct cgg tgc ccc acg act gga gaa gcc 1217
Val Thr AspIle Ser Thr Val Ala Arg Cys Pro Thr Thr Gly Glu Ala
360 365 370
cac aac gag aag cga gct gat agt agc tat gtg tgc aaa caa ggc ttc 1265
His Asn Glu Lys Arg Ala Asp Ser Ser Tyr Val Cys Lys Gln Gly Phe
375 380 385 390
act gat cgt ggg tgg ggc aac gga tgt gga ctt ttc ggg aag gga agc 1313
Thr Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser
395 400 405
att gac aca tgt gca aaa ttc tcc tgc acc agc aaa gcg att ggg aga 1361
Ile Asp Thr Cys Ala Lys Phe Ser Cys Thr Ser Lys Ala Ile Gly Arg
410 415 420
aca atc cag cca gaa aac atc aaa tac aaa gtt ggc att ttt gtg cat 1409
Thr Ile Gln Pro Glu Asn Ile Lys Tyr Lys Val Gly Ile Phe Val His
425 430 435
gga gcc act act tcg gaa aac cat ggg aat tat tca gcg caa gtt ggg 1457
Gly Ala Thr Thr Ser Glu Asn His Gly Asn Tyr Ser Ala Gln Val Gly
440 445 450
gcg tcc cag gcg gca aag ttc aca gta aca ccc aat gct cct tcg ata 1505
Ala Ser Gln Ala Ala Lys Phe Thr Val Thr Pro Asn Ala Pro Ser Ile
455 460 465 470
acc ctc aaa ctt ggt gac tac gga gaa gtc aca ctg gac tgt gag cca 1553
Thr Leu Lys Leu Gly Asp Tyr Gly Glu Val Thr Leu Asp Cys Glu Pro
475 480 485
agg agt gga ctg aac act gaa gcg ttt tac gtc atg acc gtg ggg tca 1601
Arg Ser Gly Leu Asn Thr Glu Ala Phe Tyr Val Met Thr Val Gly Ser
490 495 500
aag tca ttt ctg gtc cat agg gaa tgg ttt cat gac ctc gct ctc ccc 1649
Lys Ser Phe Leu Val His Arg Glu Trp Phe His Asp Leu Ala Leu Pro
505 510 515
tgg acg tcc cct tcg agc aca gcg tgg aga aac aga gaa ctc ctc atg 1697
Trp Thr Ser Pro Ser Ser Thr Ala Trp Arg Asn Arg Glu Leu Leu Met
520 525 530
gag ttt gaa gag gcg cac gcc aca aaa cag tcc gtt gtt gct ctt ggg 1745
Glu Phe Glu Glu Ala His Ala Thr Lys Gln Ser Val Val Ala Leu Gly
535 540 545 550
tca cag gaa gga ggc ctc cat cag gcg ttg gca gga gcc atc gtg gtg 1793
Ser Gln Glu Gly Gly Leu His Gln Ala Leu Ala Gly Ala Ile Val Val
555 560 565
gag tac tca agt tca gtg aag tta aca tca ggc cac ctg aaa tgt agg 1841
Glu Tyr Ser Ser Ser Val Lys Leu Thr Ser Gly His Leu Lys Cys Arg
570 575 580
ctg aaa atg gac aaa ctg gct ctg aaa ggc aca acc tat ggc atg tgc 1889
Leu Lys Met Asp Lys Leu Ala Leu Lys Gly Thr Thr Tyr Gly Met Cys
585 590 595
aca gaa aaa ttc tcc ttc gcg aaa aat ccg gcg gac act ggt cac ggg 1937
Thr Glu Lys Phe Ser Phe Ala Lys Asn Pro Ala Asp Thr Gly His Gly
600 605 610
aca gtt gtc att gaa ctc tcc tac tct ggg agt gat ggc ccc tgc aaa 1985
Thr Val Val Ile Glu Leu Ser Tyr Ser Gly Ser Asp Gly Pro Cys Lys
615 620 625 630
att ccg att gtc tcc gtt gcg agc ctc aat gac atg acc ccc gtc ggg 2033
Ile Pro Ile Val Ser Val Ala Ser Leu Asn Asp Met Thr Pro Val Gly
635 640 645
cgg ctg gtg aca gtg aac ccc ttc gtc gcg act tcc agt gcc aat tca 2081
Arg Leu Val Thr Val Asn Pro Phe Val Ala Thr Ser Ser Ala Asn Ser
650 655 660
aag gtg ctg gtc gag atg gaa ccc ccc ttc gga gac tcc tac atc gta 2129
Lys Val Leu Val Glu Met Glu Pro Pro Phe Gly Asp Ser Tyr Ile Val
665 670 675
gtt gga cgg gga gac aag cag atc aac cac cat tgg cat aaa gct gga 2177
Val Gly Arg Gly Asp Lys Gln Ile Asn His His Trp His Lys Ala Gly
680 685 690
agc acg ctg ggc aaa gcc ttt tca aca act ttg aag gga gct cag aga 2225
Ser Thr Leu Gly Lys Ala Phe Ser Thr Thr Leu Lys Gly Ala Gln Arg
695 700 705 710
ctg gca gcg ctg ggt gac aca gcc tgg gac ttt ggc tcc att gga ggg 2273
Leu Ala Ala Leu Gly Asp Thr Ala Trp Asp Phe Gly Ser Ile Gly Gly
715 720 725
gtc ttc aac tcc ata gga aaa gcc gtt cac caa gtg ttt ggt ggt gcc 2321
Val Phe Asn Ser Ile Gly Lys Ala Val His Gln Val Phe Gly Gly Ala
730 735 740
ttc aga aca ctc ttc ggg gga atg tct tgg atc aca caa ggg cta atg 2369
Phe Arg Thr Leu Phe Gly Gly Met Ser Trp Ile Thr Gln Gly Leu Met
745 750 755
ggt gcc cta cta ctc tgg atg ggc gtc aac gca cga gac cga tca att 2417
Gly Ala Leu Leu Leu Trp Met Gly Val Asn Ala Arg Asp Arg Ser Ile
760 765 770
gct ttg gcc ttc tta gcc aca gga ggt gtg ctc gtg ttt tta gcg acc 2465
Ala Leu Ala Phe Leu Ala Thr Gly Gly Val Leu Val Phe Leu Ala Thr
775 780 785 790
aat gtg cat gct gac act gga tgt gcc att gac atc aca aga aaa gag 2513
Asn Val His Ala Asp Thr Gly Cys Ala Ile Asp Ile Thr Arg Lys Glu
795 800 805
atg agg tgt gga agt ggc atc ttc gtg cac aac gac gtg gaa gcc tgg 2561
Met Arg Cys Gly Ser Gly Ile Phe Val His Asn Asp Val Glu Ala Trp
810 815 820
gtg gat agg tat aaa tat ttg cca gaa acg ccc aga tcc cta gca aag 2609
Val Asp Arg Tyr Lys Tyr Leu Pro Glu Thr Pro Arg Ser Leu Ala Lys
825 830 835
atc gtc cac aaa gcg cac aag gaa ggc gtg tgc gga gtc aga tct gtc 2657
Ile Val His Lys Ala His Lys Glu Gly Val Cys Gly Val Arg Ser Val
840 845 850
act aga ctg gag cat caa atg tgg gaa gcc gta cgg gat gaa ttg aac 2705
Thr Arg Leu Glu His Gln Met Trp Glu Ala Val Arg Asp Glu Leu Asn
855 860 865 870
gtc ctg ctc aaa gag aat gca gtg gac ctc agt gtg gtt gtg aac aag 2753
Val Leu Leu Lys Glu Asn Ala Val Asp Leu Ser Val Val Val Asn Lys
875 880 885
ccc gtg ggg aga tat cgc tca gcc cct aaa cgc ctg tcc atg acg caa 2801
Pro Val Gly Arg Tyr Arg Ser Ala Pro Lys Arg Leu Ser Met Thr Gln
890 895 900
gag aag ttt gaa atg ggc tgg aaa gca tgg gga aaa agc att ctc ttt 2849
Glu Lys Phe Glu Met Gly Trp Lys Ala Trp Gly Lys Ser Ile Leu Phe
905 910 915
gcc ccg gaa ttg gcc aac tcc aca ttt gtc gta gat gga cct gag aca 2897
Ala Pro Glu Leu Ala Asn Ser Thr Phe Val Val Asp Gly Pro Glu Thr
920 925 930
aag gaa tgc cct gat gag cac aga gct tgg aac agc atg caa atc gaa 2945
Lys Glu Cys Pro Asp Glu His Arg Ala Trp Asn Ser Met Gln Ile Glu
935 940 945 950
gac ttc ggc ttt ggc atc aca tca acc cgt gtg tgg ctg aag att aga 2993
Asp Phe Gly Phe Gly Ile Thr Ser Thr Arg Val Trp Leu Lys Ile Arg
955 960 965
gag gag agc act gac gag tgt gat gga gcg atc ata ggt acg gct gtc 3041
Glu Glu Ser Thr Asp Glu Cys Asp Gly Ala Ile Ile Gly Thr Ala Val
970 975 980
aaa gga cat gtg gca gtc cat agt gac ttg tcg tac tgg att gag agt 3089
Lys Gly His Val Ala Val His Ser Asp Leu Ser Tyr Trp Ile Glu Ser
985 990 995
cgc tac aac gac aca tgg aaa ctt gag agg gca gtc ttt gga gaa 3134
Arg Tyr Asn Asp Thr Trp Lys Leu Glu Arg Ala Val Phe Gly Glu
1000 1005 1010
gtt aaa tcc tgc act tgg cca gag aca cac acc cta tgg gga gat 3179
Val Lys Ser Cys Thr Trp Pro Glu Thr His Thr Leu Trp Gly Asp
1015 1020 1025
ggt gtt gag gaa agt gaa ctc atc atc ccg cac acc ata gcc gga 3224
Gly Val Glu Glu Ser Glu Leu Ile Ile Pro His Thr Ile Ala Gly
1030 1035 1040
cca aaa agc aag cat aat cgg agg gaa gga tat aag aca caa aac 3269
Pro Lys Ser Lys His Asn Arg Arg Glu Gly Tyr Lys Thr Gln Asn
1045 1050 1055
cag gga cct tgg gac gag aat ggc ata gtc ttg gac ttt gac tat 3314
Gln Gly Pro Trp Asp Glu Asn Gly Ile Val Leu Asp Phe Asp Tyr
1060 1065 1070
tgc cca ggg aca aaa gtc acc att aca gag gat tgt ggc aag aga 3359
Cys Pro Gly Thr Lys Val Thr Ile Thr Glu Asp Cys Gly Lys Arg
1075 1080 1085
ggc cct tcg gtc aga acc act act gac agt gga aag ttg atc act 3404
Gly Pro Ser Val Arg Thr Thr Thr Asp Ser Gly Lys Leu Ile Thr
1090 1095 1100
gac tgg tgc tgt cgc agt tgc tcc ctt ccg ccc cta cga ttc cgg 3449
Asp Trp Cys Cys Arg Ser Cys Ser Leu Pro Pro Leu Arg Phe Arg
1105 1110 1115
aca gaa aat ggc tgc tgg tac gga atg gaa atc aga cct gtc agg 3494
Thr Glu Asn Gly Cys Trp Tyr Gly Met Glu Ile Arg Pro Val Arg
1120 1125 1130
cat gat gaa aca aca ctc gtc aga tcg cag gtt gat gct ttt aat 3539
His Asp Glu Thr Thr Leu Val Arg Ser Gln Val Asp Ala Phe Asn
1135 1140 1145
ggt gaa atg gtt gac cct ttt cag ctg ggc ctt ctg gtg atg ttt 3584
Gly Glu Met Val Asp Pro Phe Gln Leu Gly Leu Leu Val Met Phe
1150 1155 1160
ctg gcc acc cag gag gtc ctt cgc aag agg tgg acg gcc aga ttg 3629
Leu Ala Thr Gln Glu Val Leu Arg Lys Arg Trp Thr Ala Arg Leu
1165 1170 1175
acc att cct gcg gtt ttg ggg gcc cta ctt gtg ctg atg ctt ggg 3674
Thr Ile Pro Ala Val Leu Gly Ala Leu Leu Val Leu Met Leu Gly
1180 1185 1190
ggc atc act tac act gat ttg gcg agg tat gtg gtg cta gtc gct 3719
Gly Ile Thr Tyr Thr Asp Leu Ala Arg Tyr Val Val Leu Val Ala
1195 1200 1205
gcc tct ttc gca gag gcc aac agt gga gga gat gtc ctg cac ctt 3764
Ala Ser Phe Ala Glu Ala Asn Ser Gly Gly Asp Val Leu His Leu
1210 1215 1220
gct ttg att gcc gtt ttc aag atc caa cca gca ttt tta gtg atg 3809
Ala Leu Ile Ala Val Phe Lys Ile Gln Pro Ala Phe Leu Val Met
1225 1230 1235
aac atg ctt agc acg aga tgg acg aac caa gaa aac gtg gtt ctg 3854
Asn Met Leu Ser Thr Arg Trp Thr Asn Gln Glu Asn Val Val Leu
1240 1245 1250
gtc cta ggg gct gcc ttt ttc caa ttg gcc tca gta gat ctg caa 3899
Val Leu Gly Ala Ala Phe Phe Gln Leu Ala Ser Val Asp Leu Gln
1255 1260 1265
ata gga gtt cac gga atc ctg aat gcc gcc gct ata gca tgg atg 3944
Ile Gly Val His Gly Ile Leu Asn Ala Ala Ala Ile Ala Trp Met
1270 1275 1280
att gtc cgg gcg atc acc ttc ccc aca acc tcc tcc gtc acc atg 3989
Ile Val Arg Ala Ile Thr Phe Pro Thr Thr Ser Ser Val Thr Met
1285 1290 1295
cca gtc tta gcg ctt cta act ccg gga atg agg gct cta tac cta 4034
Pro Val Leu Ala Leu Leu Thr Pro Gly Met Arg Ala Leu Tyr Leu
1300 1305 1310
gat act tac aga atc atc ctc ctc gtc ata ggg att tgc tct ctg 4079
Asp Thr Tyr Arg Ile Ile Leu Leu Val Ile Gly Ile Cys Ser Leu
1315 1320 1325
ctg caa gag agg aaa aag acc atg gca aaa aag aaa gga gct gta 4124
Leu Gln Glu Arg Lys Lys Thr Met Ala Lys Lys Lys Gly Ala Val
1330 1335 1340
ctc ttg ggc tta gcg ctc aca tcc act gga tgg ttt tcg ccc acc 4169
Leu Leu Gly Leu Ala Leu Thr Ser Thr Gly Trp Phe Ser Pro Thr
1345 1350 1355
act ata gct gcc gga cta atg gtc tgc aac cca aac aag aag aga 4214
Thr Ile Ala Ala Gly Leu Met Val Cys Asn Pro Asn Lys Lys Arg
1360 1365 1370
ggg tgg cca gct act gag ttt ttg tcg gca gtt gga ttg atg ttt 4259
Gly Trp Pro Ala Thr Glu Phe Leu Ser Ala Val Gly Leu Met Phe
1375 1380 1385
gcc atc gta ggt ggt ttg gcg gag ttg gat att gaa tcc atg tca 4304
Ala Ile Val Gly Gly Leu Ala Glu Leu Asp Ile Glu Ser Met Ser
1390 1395 1400
ata ccc ttc atg ctg gca ggt ctc atg gca gtg tcc tac gtg gtg 4349
Ile Pro Phe Met Leu Ala Gly Leu Met Ala Val Ser Tyr Val Val
1405 1410 1415
tca gga aaa gca aca gat atg tgg ctt gaa cgg gct gcc gac atc 4394
Ser Gly Lys Ala Thr Asp Met Trp Leu Glu Arg Ala Ala Asp Ile
1420 1425 1430
agc tgg gag atg gat gct gca atc aca gga agc agt cgg agg ctg 4439
Ser Trp Glu Met Asp Ala Ala Ile Thr Gly Ser Ser Arg Arg Leu
1435 1440 1445
gat gtg aag cta gat gat gac gga gat ttt cac ttg att gac gat 4484
Asp Val Lys Leu Asp Asp Asp Gly Asp Phe His Leu Ile Asp Asp
1450 1455 1460
ccc ggt gtt cca tgg aag gtc tgg gtc ctg cgc atg tct tgc att 4529
Pro Gly Val Pro Trp Lys Val Trp Val Leu Arg Met Ser Cys Ile
1465 1470 1475
ggg tta gcc gcc ctc acg cct tgg gcc att gtt ccc gcc gct ttt 4574
Gly Leu Ala Ala Leu Thr Pro Trp Ala Ile Val Pro Ala Ala Phe
1480 1485 1490
ggt tat tgg ctc act tta aaa aca aca aaa aga ggg ggc gtg ttt 4619
Gly Tyr Trp Leu Thr Leu Lys Thr Thr Lys Arg Gly Gly Val Phe
1495 1500 1505
tgg gac acg cca tcc cca aaa cct tgc tca aaa gga gac acc act 4664
Trp Asp Thr Pro Ser Pro Lys Pro Cys Ser Lys Gly Asp Thr Thr
1510 1515 1520
aca gga gtt tac cgc att atg gct aga ggg att ctt ggc act tac 4709
Thr Gly Val Tyr Arg Ile Met Ala Arg Gly Ile Leu Gly Thr Tyr
1525 1530 1535
cag gcc ggc gtc gga gtc atg tac gag aat gtt ttc cac aca cta 4754
Gln Ala Gly Val Gly Val Met Tyr Glu Asn Val Phe His Thr Leu
1540 1545 1550
tgg cac aca act aga gga gca gct att atg agt gga gaa gga aaa 4799
Trp His Thr Thr Arg Gly Ala Ala Ile Met Ser Gly Glu Gly Lys
1555 1560 1565
ttg acg cca tac tgg ggt agt gtg aaa gaa gac cgc ata gct tac 4844
Leu Thr Pro Tyr Trp Gly Ser Val Lys Glu Asp Arg Ile Ala Tyr
1570 1575 1580
gga ggc cca tgg agg ttt gat cga aaa tgg aat gga act gat gac 4889
Gly Gly Pro Trp Arg Phe Asp Arg Lys Trp Asn Gly Thr Asp Asp
1585 1590 1595
gtg caa gtg atc gtg gta gaa ccg ggg aag gct gca gta aac atc 4934
Val Gln Val Ile Val Val Glu Pro Gly Lys Ala Ala Val Asn Ile
1600 1605 1610
cag aca aaa cca gga gtg ttt cgg act ccc ttc ggg gag gtt ggg 4979
Gln Thr Lys Pro Gly Val Phe Arg Thr Pro Phe Gly Glu Val Gly
1615 1620 1625
gct gtt agt ctg gat tat ccg cga gga aca tcc ggc tca ccc att 5024
Ala Val Ser Leu Asp Tyr Pro Arg Gly Thr Ser Gly Ser Pro Ile
1630 1635 1640
ctg gat tcc aat gga gac atc ata ggc ctg tac ggc aat gga gtt 5069
Leu Asp Ser Asn Gly Asp Ile Ile Gly Leu Tyr Gly Asn Gly Val
1645 1650 1655
gag ctt ggc gat ggc tca tac gtc agc gcc atc gtg cag ggt gac 5114
Glu Leu Gly Asp Gly Ser Tyr Val Ser Ala Ile Val Gln Gly Asp
1660 1665 1670
cgt cag gag gaa cca gtc cca gaa gct tac acc cca aac atg ttg 5159
Arg Gln Glu Glu Pro Val Pro Glu Ala Tyr Thr Pro Asn Met Leu
1675 1680 1685
aga aag aga cag atg acc gta cta gat ttg cac cct ggt tca ggg 5204
Arg Lys Arg Gln Met Thr Val Leu Asp Leu His Pro Gly Ser Gly
1690 1695 1700
aaa acc aag aaa att ctg cca caa ata att aag gac gct att cag 5249
Lys Thr Lys Lys Ile Leu Pro Gln Ile Ile Lys Asp Ala Ile Gln
1705 1710 1715
cag cgc cta aga aca gct gtg ttg gca ccg acg cgg gtg gta gca 5294
Gln Arg Leu Arg Thr Ala Val Leu Ala Pro Thr Arg Val Val Ala
1720 1725 1730
gca gaa atg gca gaa gct ttg aga ggg ctc cca gta cga tat caa 5339
Ala Glu Met Ala Glu Ala Leu Arg Gly Leu Pro Val Arg Tyr Gln
1735 1740 1745
act tca gca gtg cag aga gag cac caa ggg aat gaa ata gtg gat 5384
Thr Ser Ala Val Gln Arg Glu His Gln Gly Asn Glu Ile Val Asp
1750 1755 1760
gtg atg tgc cac gcc act ctg acc cat aga ctg atg tca ccg aac 5429
Val Met Cys His Ala Thr Leu Thr His Arg Leu Met Ser Pro Asn
1765 1770 1775
aga gtg ccc aac tac aac cta ttt gtc atg gat gaa gct cat ttc 5474
Arg Val Pro Asn Tyr Asn Leu Phe Val Met Asp Glu Ala His Phe
1780 1785 1790
acc gac cca gcc agt ata gcc gca cga gga tac att gct acc aag 5519
Thr Asp Pro Ala Ser Ile Ala Ala Arg Gly Tyr Ile Ala Thr Lys
1795 1800 1805
gtg gaa tta ggg gag gca gca gcc atc ttt atg aca gcg acc ccg 5564
Val Glu Leu Gly Glu Ala Ala Ala Ile Phe Met Thr Ala Thr Pro
1810 1815 1820
cct gga acc acg gat cct ttt cct gac tca aat gcc cca atc cat 5609
Pro Gly Thr Thr Asp Pro Phe Pro Asp Ser Asn Ala Pro Ile His
1825 1830 1835
gat ttg caa gat gag ata cca gac agg gcg tgg agc agt gga tac 5654
Asp Leu Gln Asp Glu Ile Pro Asp Arg Ala Trp Ser Ser Gly Tyr
1840 1845 1850
gaa tgg atc aca gaa tat gcg gga aaa acc gtg tgg ttt gtg gca 5699
Glu Trp Ile Thr Glu Tyr Ala Gly Lys Thr Val Trp Phe Val Ala
1855 1860 1865
agc gtg aaa atg ggg aac gag att gca atg tgc ctc caa aga gcg 5744
Ser Val Lys Met Gly Asn Glu Ile Ala Met Cys Leu Gln Arg Ala
1870 1875 1880
ggg aaa aag gtc atc caa ctc aac cgc aag tcc tat gac aca gaa 5789
Gly Lys Lys Val Ile Gln Leu Asn Arg Lys Ser Tyr Asp Thr Glu
1885 1890 1895
tac cca aaa tgt aag aat gga gac tgg gat ttt gtc atc acc act 5834
Tyr Pro Lys Cys Lys Asn Gly Asp Trp Asp Phe Val Ile Thr Thr
1900 1905 1910
gac att tct gaa atg ggg gcc aac ttc ggt gcg agc agg gtc atc 5879
Asp Ile Ser Glu Met Gly Ala Asn Phe Gly Ala Ser Arg Val Ile
1915 1920 1925
gac tgt aga aag agc gtg aag ccc acc atc tta gaa gag gga gaa 5924
Asp Cys Arg Lys Ser Val Lys Pro Thr Ile Leu Glu Glu Gly Glu
1930 1935 1940
ggc aga gtc atc ctc gga aac cca tcg ccc ata acc agt gca agc 5969
Gly Arg Val Ile Leu Gly Asn Pro Ser Pro Ile Thr Ser Ala Ser
1945 1950 1955
gca gct caa cgg agg ggc aga gta ggc aga aac cct aac cag gtt 6014
Ala Ala Gln Arg Arg Gly Arg Val Gly Arg Asn Pro Asn Gln Val
1960 1965 1970
gga gat gaa tac cac tat ggg ggg gcc acc agt gaa gat gac agt 6059
Gly Asp Glu Tyr His Tyr Gly Gly Ala Thr Ser Glu Asp Asp Ser
1975 1980 1985
aac cta gcc cat tgg aca gag gca aag atc atg tta gat aac ata 6104
Asn Leu Ala His Trp Thr Glu Ala Lys Ile Met Leu Asp Asn Ile
1990 1995 2000
cac atg ccc aat gga ctg gtg gcc cag ctc tat gga cca gag agg 6149
His Met Pro Asn Gly Leu Val Ala Gln Leu Tyr Gly Pro Glu Arg
2005 2010 2015
gaa aag gcc ttc aca atg gat ggc gaa tac cgt ctc aga ggt gaa 6194
Glu Lys Ala Phe Thr Met Asp Gly Glu Tyr Arg Leu Arg Gly Glu
2020 2025 2030
gaa aag aaa aac ttc tta gag ctg ctt agg acg gct gac ctc ccg 6239
Glu Lys Lys Asn Phe Leu Glu Leu Leu Arg Thr Ala Asp Leu Pro
2035 2040 2045
gtg tgg ctg gcc tac aag gtg gcg tcc aat ggc atc cag tac acc 6284
Val Trp Leu Ala Tyr Lys Val Ala Ser Asn Gly Ile Gln Tyr Thr
2050 2055 2060
gat aga aag tgg tgt ttt gat ggg ccg cgt acg aat gcc ata ctg 6329
Asp Arg Lys Trp Cys Phe Asp Gly Pro Arg Thr Asn Ala Ile Leu
2065 2070 2075
gag gac aac acc gag gta gag ata gtc acc cgg atg ggt gag agg 6374
Glu Asp Asn Thr Glu Val Glu Ile Val Thr Arg Met Gly Glu Arg
2080 2085 2090
aaa atc ctc aag ccg aga tgg ctt gat gca aga gtt tat gca gat 6419
Lys Ile Leu Lys Pro Arg Trp Leu Asp Ala Arg Val Tyr Ala Asp
2095 2100 2105
cac caa gct ctc aag tgg ttc aaa gac ttc gca gca gga aag aga 6464
His Gln Ala Leu Lys Trp Phe Lys Asp Phe Ala Ala Gly Lys Arg
2110 2115 2120
tca gcc gtt agc ttc ata gag gtg ctc ggt cgt atg cct gag cat 6509
Ser Ala Val Ser Phe Ile Glu Val Leu Gly Arg Met Pro Glu His
2125 2130 2135
ttc atg gga aag acg cgg gaa gct tta gac acc atg tac ttg gtt 6554
Phe Met Gly Lys Thr Arg Glu Ala Leu Asp Thr Met Tyr Leu Val
2140 2145 2150
gca acg gct gag aaa ggt ggg aaa gca cac cga atg gct ctc gaa 6599
Ala Thr Ala Glu Lys Gly Gly Lys Ala His Arg Met Ala Leu Glu
2155 2160 2165
gag ctg cca gat gca ctg gaa acc att aca ctt att gtt gct atc 6644
Glu Leu Pro Asp Ala Leu Glu Thr Ile Thr Leu Ile Val Ala Ile
2170 2175 2180
act gtg atg aca gga gga ttc ttt cta ctc atg atg cag cga aag 6689
Thr Val Met Thr Gly Gly Phe Phe Leu Leu Met Met Gln Arg Lys
2185 2190 2195
ggt ata ggg aag atg ggt ctt gga gct cta gtg ctc acg cta gct 6734
Gly Ile Gly Lys Met Gly Leu Gly Ala Leu Val Leu Thr Leu Ala
2200 2205 2210
acc ttc ttc ctg tgg gcg gca gag gtt ccc gga aca aaa ata gca 6779
Thr Phe Phe Leu Trp Ala Ala Glu Val Pro Gly Thr Lys Ile Ala
2215 2220 2225
ggg acc ctg ctg atc gcc ctg ctg ctt atg gtg gtt ctc atc cca 6824
Gly Thr Leu Leu Ile Ala Leu Leu Leu Met Val Val Leu Ile Pro
2230 2235 2240
gaa ccg gaa aag cag agg tca caa aca gat aat caa ctg gcg gtg 6869
Glu Pro Glu Lys Gln Arg Ser Gln Thr Asp Asn Gln Leu Ala Val
2245 2250 2255
ttt ctc atc tgt gtc ttg acc gtg gtt gga gtg gtg gca gca aac 6914
Phe Leu Ile Cys Val Leu Thr Val Val Gly Val Val Ala Ala Asn
2260 2265 2270
gag tac ggg atg cta gaa aaa acc aaa gca gac ctc aag agc atg 6959
Glu Tyr Gly Met Leu Glu Lys Thr Lys Ala Asp Leu Lys Ser Met
2275 2280 2285
ttt ggc gga aag acg cag gca tca gga ctg act gga tta cca agc 7004
Phe Gly Gly Lys Thr Gln Ala Ser Gly Leu Thr Gly Leu Pro Ser
2290 2295 2300
atg gca ctg gac ctg cgt cca gcc aca gct tgg gca ctg tat ggg 7049
Met Ala Leu Asp Leu Arg Pro Ala Thr Ala Trp Ala Leu Tyr Gly
2305 2310 2315
ggg agc aca gtc gtg cta acc cct ctt ctg aag cac ctg atc acg 7094
Gly Ser Thr Val Val Leu Thr Pro Leu Leu Lys His Leu Ile Thr
2320 2325 2330
tcg gaa tac gtc acc aca tcg cta gcc tca att aac tca caa gct 7139
Ser Glu Tyr Val Thr Thr Ser Leu Ala Ser Ile Asn Ser Gln Ala
2335 2340 2345
ggc tca tta ttt gtc ttg cca cga ggc gtg cct ttt acc gac cta 7184
Gly Ser Leu Phe Val Leu Pro Arg Gly Val Pro Phe Thr Asp Leu
2350 2355 2360
gac ttg acc gtt ggc ctc gtc ttc ctt ggc tgt tgg ggt caa atc 7229
Asp Leu Thr Val Gly Leu Val Phe Leu Gly Cys Trp Gly Gln Ile
2365 2370 2375
acc ctc aca acg ttt ttg aca gcc atg gtt ctg gcg aca ctt cac 7274
Thr Leu Thr Thr Phe Leu Thr Ala Met Val Leu Ala Thr Leu His
2380 2385 2390
tat ggg tac atg ctc cct gga tgg caa gca gaa gca ctc agg gct 7319
Tyr Gly Tyr Met Leu Pro Gly Trp Gln Ala Glu Ala Leu Arg Ala
2395 2400 2405
gcc cag aga agg aca gcg gct gga ata atg aag aat gcc gtt gtt 7364
Ala Gln Arg Arg Thr Ala Ala Gly Ile Met Lys Asn Ala Val Val
2410 2415 2420
gac gga atg gtc gcc act gat gtg cct gaa ctg gaa agg acc act 7409
Asp Gly Met Val Ala Thr Asp Val Pro Glu Leu Glu Arg Thr Thr
2425 2430 2435
cct ctg atg caa aag aaa gtc gga cag gtg ctc ctc ata ggg gta 7454
Pro Leu Met Gln Lys Lys Val Gly Gln Val Leu Leu Ile Gly Val
2440 2445 2450
agc gtg gca gcg ttc ctc gtc aac ccc aaa atc acc act gtg aga 7499
Ser Val Ala Ala Phe Leu Val Asn Pro Lys Ile Thr Thr Val Arg
2455 2460 2465
gaa gca ggg gtg ttg gtg aca gcg gct acg ctc tct ttg tgg gac 7544
Glu Ala Gly Val Leu Val Thr Ala Ala Thr Leu Ser Leu Trp Asp
2470 2475 2480
aac gga gcc agt gcc gtt tgg aat tcc acc act gcc acg gga ctc 7589
Asn Gly Ala Ser Ala Val Trp Asn Ser Thr Thr Ala Thr Gly Leu
2485 2490 2495
tgc cat gta atg cga ggt agc tac ctg gct gga ggc tcc att gct 7634
Cys His Val Met Arg Gly Ser Tyr Leu Ala Gly Gly Ser Ile Ala
2500 2505 2510
tgg act ctc atc aag aac gct gac aag ccc tcc tta aaa agg gga 7679
Trp Thr Leu Ile Lys Asn Ala Asp Lys Pro Ser Leu Lys Arg Gly
2515 2520 2525
agg cct ggg ggc agg acg cta ggg gag cag tgg aag gaa aaa cta 7724
Arg Pro Gly Gly Arg Thr Leu Gly Glu Gln Trp Lys Glu Lys Leu
2530 2535 2540
aat gcc atg agc aga gaa gag ttt ttt aaa tac cgg aga gag gcc 7769
Asn Ala Met Ser Arg Glu Glu Phe Phe Lys Tyr Arg Arg Glu Ala
2545 2550 2555
ata atc gag gtg gac cgc act gaa gca cgc agg gct aga cgt gaa 7814
Ile Ile Glu Val Asp Arg Thr Glu Ala Arg Arg Ala Arg Arg Glu
2560 2565 2570
aat aac ata gtg gga gga cat ccg gtt tcg cga ggc tca gca aaa 7859
Asn Asn Ile Val Gly Gly His Pro Val Ser Arg Gly Ser Ala Lys
2575 2580 2585
ctc cgt tgg ctc gta gag aaa gga ttt gtc tcg cca ata gga aaa 7904
Leu Arg Trp Leu Val Glu Lys Gly Phe Val Ser Pro Ile Gly Lys
2590 2595 2600
gtc att gat cta ggg tgt ggg cgt gga gga tgg agc tac tac gca 7949
Val Ile Asp Leu Gly Cys Gly Arg Gly Gly Trp Ser Tyr Tyr Ala
2605 2610 2615
gca acc ctg aag aag gtc cag gaa gtc aga gga tac acg aaa ggt 7994
Ala Thr Leu Lys Lys Val Gln Glu Val Arg Gly Tyr Thr Lys Gly
2620 2625 2630
ggg gcg gga cat gaa gaa ccg atg ctc atg cag agc tac ggc tgg 8039
Gly Ala Gly His Glu Glu Pro Met Leu Met Gln Ser Tyr Gly Trp
2635 2640 2645
aac ctg gtc tcc atg aag agt gga gtg gac gtg ttt tac aaa cct 8084
Asn Leu Val Ser Met Lys Ser Gly Val Asp Val Phe Tyr Lys Pro
2650 2655 2660
tca gag ccc agt gac act ctg ttc tgc gac ata ggg gaa tcc tcc 8129
Ser Glu Pro Ser Asp Thr Leu Phe Cys Asp Ile Gly Glu Ser Ser
2665 2670 2675
ccg agt cca gaa gta gaa gaa caa cgc aca cta cgc gtc cta gag 8174
Pro Ser Pro Glu Val Glu Glu Gln Arg Thr Leu Arg Val Leu Glu
2680 2685 2690
atg aca tct gac tgg ttg cac cga gga cct aga gag ttc tgt ata 8219
Met Thr Ser Asp Trp Leu His Arg Gly Pro Arg Glu Phe Cys Ile
2695 2700 2705
aaa gtt ctt tgc ccc tac atg ccc aag gtt ata gaa aaa atg gaa 8264
Lys Val Leu Cys Pro Tyr Met Pro Lys Val Ile Glu Lys Met Glu
2710 2715 2720
gtc ctg caa cgc cgc ttc gga ggt ggg cta gtg cgt ctt ccc ctg 8309
Val Leu Gln Arg Arg Phe Gly Gly Gly Leu Val Arg Leu Pro Leu
2725 2730 2735
tcc cgc aac tcc aat cac gag atg tac tgg gtt agt gga gcc gct 8354
Ser Arg Asn Ser Asn His Glu Met Tyr Trp Val Ser Gly Ala Ala
2740 2745 2750
ggc aat gtg gtg cac gct gtg aac atg acc agc cag gta cta ctg 8399
Gly Asn Val Val His Ala Val Asn Met Thr Ser Gln Val Leu Leu
2755 2760 2765
ggg cga atg gat cgc aca gtg tgg aga ggg cca aag tat gag gaa 8444
Gly Arg Met Asp Arg Thr Val Trp Arg Gly Pro Lys Tyr Glu Glu
2770 2775 2780
gat gtc aac tta ggg agc gga aca aga gcc gtg gga aag gga gaa 8489
Asp Val Asn Leu Gly Ser Gly Thr Arg Ala Val Gly Lys Gly Glu
2785 2790 2795
gtc cat agc aat cag gag aaa atc aag aag aga atc cag aag ctt 8534
Val His Ser Asn Gln Glu Lys Ile Lys Lys Arg Ile Gln Lys Leu
2800 2805 2810
aaa gaa gaa ttc gcc aca acg tgg cac aaa gac cct gag cat cca 8579
Lys Glu Glu Phe Ala Thr Thr Trp His Lys Asp Pro Glu His Pro
2815 2820 2825
tac cgc act tgg aca tac cac gga agc tat gaa gtg aag gct act 8624
Tyr Arg Thr Trp Thr Tyr His Gly Ser Tyr Glu Val Lys Ala Thr
2830 2835 2840
ggc tca gct agt tct ctc gtc aac gga gtg gtg aag ctc atg agc 8669
Gly Ser Ala Ser Ser Leu Val Asn Gly Val Val Lys Leu Met Ser
2845 2850 2855
aaa cct tgg gac gcc att gcc aac gtc acc acc atg gcc atg act 8714
Lys Pro Trp Asp Ala Ile Ala Asn Val Thr Thr Met Ala Met Thr
2860 2865 2870
gac acc acc cct ttt gga cag caa aga gtt ttc aag gag aaa gtt 8759
Asp Thr Thr Pro Phe Gly Gln Gln Arg Val Phe Lys Glu Lys Val
2875 2880 2885
gac acg aag gct cct gag cca cca gct gga gct aag gaa gtg ctc 8804
Asp Thr Lys Ala Pro Glu Pro Pro Ala Gly Ala Lys Glu Val Leu
2890 2895 2900
aac gag acc acc aac tgg ctg tgg gcc tac ttg tca cgg gaa aaa 8849
Asn Glu Thr Thr Asn Trp Leu Trp Ala Tyr Leu Ser Arg Glu Lys
2905 2910 2915
aga ccc cgc ttg tgc acc aag gaa gaa ttc ata aag aaa gtc aat 8894
Arg Pro Arg Leu Cys Thr Lys Glu Glu Phe Ile Lys Lys Val Asn
2920 2925 2930
agc aac gcg gct ctt gga gca gtg ttc gct gaa cag aat caa tgg 8939
Ser Asn Ala Ala Leu Gly Ala Val Phe Ala Glu Gln Asn Gln Trp
2935 2940 2945
agc acg gcg cgt gag gct gtg gat gac ccg cgg ttt tgg gag atg 8984
Ser Thr Ala Arg Glu Ala Val Asp Asp Pro Arg Phe Trp Glu Met
2950 2955 2960
gtt gat gaa gag agg gaa aac cat ctg cga gga gag tgt cac aca 9029
Val Asp Glu Glu Arg Glu Asn His Leu Arg Gly Glu Cys His Thr
2965 2970 2975
tgt atc tat aac atg atg gga aaa aga gag aag aag cct gga gag 9074
Cys Ile Tyr Asn Met Met Gly Lys Arg Glu Lys Lys Pro Gly Glu
2980 2985 2990
ttt gga aaa gct aaa gga agc agg gcc att tgg ttc atg tgg ctt 9119
Phe Gly Lys Ala Lys Gly Ser Arg Ala Ile Trp Phe Met Trp Leu
2995 3000 3005
gga gca cgg tat cta gag ttt gaa gct ttg ggg ttc ctg aat gaa 9164
Gly Ala Arg Tyr Leu Glu Phe Glu Ala Leu Gly Phe Leu Asn Glu
3010 3015 3020
gat cat tgg ctg agc cga gag aat tca gga ggt gga gtg gaa ggc 9209
Asp His Trp Leu Ser Arg Glu Asn Ser Gly Gly Gly Val Glu Gly
3025 3030 3035
tca ggc gtc caa aag ctg gga tac atc ctc cgt gat ata gca gga 9254
Ser Gly Val Gln Lys Leu Gly Tyr Ile Leu Arg Asp Ile Ala Gly
3040 3045 3050
aag caa gga gga aaa atg tac gct gat gat acc gcc ggg tgg gac 9299
Lys Gln Gly Gly Lys Met Tyr Ala Asp Asp Thr Ala Gly Trp Asp
3055 3060 3065
act aga att acc aga act gat tta gaa aat gaa gcc aag gtg ctg 9344
Thr Arg Ile Thr Arg Thr Asp Leu Glu Asn Glu Ala Lys Val Leu
3070 3075 3080
gag ctt cta gac ggt gaa cac cgc atg ctc gcc cga gcc ata att 9389
Glu Leu Leu Asp Gly Glu His Arg Met Leu Ala Arg Ala Ile Ile
3085 3090 3095
gaa ttg act tac agg cac aaa gtg gtc aag gtc atg aga cct gca 9434
Glu Leu Thr Tyr Arg His Lys Val Val Lys Val Met Arg Pro Ala
3100 3105 3110
gca gaa gga aag acc gtg atg gac gtg ata tca agg gag gat caa 9479
Ala Glu Gly Lys Thr Val Met Asp Val Ile Ser Arg Glu Asp Gln
3115 3120 3125
agg ggg agt gga cag gtg gtc act tat gct ctt aac act ttc acg 9524
Arg Gly Ser Gly Gln Val Val Thr Tyr Ala Leu Asn Thr Phe Thr
3130 3135 3140
aac atc gct gtc cag ctc gtc agg ctg atg gag gct gag ggg gtc 9569
Asn Ile Ala Val Gln Leu Val Arg Leu Met Glu Ala Glu Gly Val
3145 3150 3155
att gga cca caa cac ttg gaa cag cta cct aga aaa aac aag ata 9614
Ile Gly Pro Gln His Leu Glu Gln Leu Pro Arg Lys Asn LysIle
3160 3165 3170
gct gtc agg acc tgg ctc ttt gag aat gga gag gag aga gtg tcc 9659
Ala Val Arg Thr Trp Leu Phe Glu Asn Gly Glu Glu Arg Val Ser
3175 3180 3185
agg atg gct atc agc gga gac gac tgt gtc gtc aag ccg ctg gac 9704
Arg Met Ala Ile Ser Gly Asp Asp Cys Val Val Lys Pro Leu Asp
3190 3195 3200
gac aga ttc gcc acg gcc ctc cac ttc ctc aac gca atg tca aag 9749
Asp Arg Phe Ala Thr Ala Leu His Phe Leu Asn Ala Met Ser Lys
3205 3210 3215
gtc aga aaa gac atc cag gaa tgg aag cct tca cat ggc tgg cac 9794
Val Arg Lys Asp Ile Gln Glu Trp Lys Pro Ser His Gly Trp His
3220 3225 3230
gat tgg cag caa gtt ccc ttc tgc tct aac cat ttt cag gag att 9839
Asp Trp Gln Gln Val Pro Phe Cys Ser Asn His Phe Gln Glu Ile
3235 3240 3245
gtg atg aaa gat gga agg agt ata gtt gtc ccg tgc aga gga cag 9884
Val Met Lys Asp Gly Arg Ser Ile Val Val Pro Cys Arg Gly Gln
3250 3255 3260
gat gag ctg ata ggc agg gct cgc atc tcc cca gga gct gga tgg 9929
Asp Glu Leu Ile Gly Arg Ala Arg Ile Ser Pro Gly Ala Gly Trp
3265 3270 3275
aat gtg aag gac aca gct tgt ctg gcc aaa gca tat gca cag atg 9974
Asn Val Lys Asp Thr Ala Cys Leu Ala Lys Ala Tyr Ala Gln Met
3280 3285 3290
tgg cta ctc cta tac ttc cat cgt agg gac ttg cgt ctc atg gca 10019
Trp Leu Leu Leu Tyr Phe His Arg Arg Asp Leu Arg Leu Met Ala
3295 3300 3305
aat gcg att tgc tca gca gtg cca gtg gat tgg gtg ccc acg ggc 10064
Asn Ala Ile Cys Ser Ala Val Pro Val Asp Trp Val Pro Thr Gly
3310 3315 3320
agg aca tcc tgg tcg ata cac tcg aaa gga gag tgg atg acc aca 10109
Arg Thr Ser Trp Ser Ile His Ser Lys Gly Glu Trp Met Thr Thr
3325 3330 3335
gaa gac atg ctg cag gtc tgg aac aga gtc tgg att gaa gaa aat 10154
Glu Asp Met Leu Gln Val Trp Asn Arg Val Trp Ile Glu Glu Asn
3340 3345 3350
gaa tgg atg gtg gac aag act cca ata aca agc tgg aca gac gtt 10199
Glu Trp Met Val Asp Lys Thr Pro Ile Thr Ser Trp Thr Asp Val
3355 3360 3365
ccg tat gtg gga aag cgg gag gac atc tgg tgt ggc aac ctc atc 10244
Pro Tyr Val Gly Lys Arg Glu Asp Ile Trp Cys Gly Asn Leu Ile
3370 3375 3380
gga acg cga tcc aga gca acc tgg gct gag aac atc tac gcg gcg 10289
Gly Thr Arg Ser Arg Ala Thr Trp Ala Glu Asn Ile Tyr Ala Ala
3385 3390 3395
ata aac cag gtt aga gct gtc att ggg aaa gaa aat tat gtt gac 10334
Ile Asn Gln Val Arg Ala Val Ile Gly Lys Glu Asn Tyr Val Asp
3400 3405 3410
tac atg acc tca ctc agg aga tac gaa gat gtc ttg atc cag gaa 10379
Tyr Met Thr Ser Leu Arg Arg Tyr Glu Asp Val Leu Ile Gln Glu
3415 3420 3425
gac agg gtc atc tagtgtgatt taaggtggaa aagcagatta tgtaaataat 10431
Asp Arg Val Ile
3430
gtaaatgaga aaatgcatgc atatggagtc aggccagcaa aagctgccac cggatactgg 10491
gtagacggtg ctgtctgcgt cccagtccca ggaggactgg gttaacaaat ctgacaacag 10551
aaagtgagaa aaccctcaga accgtctcgg aagcaggtcc ctgctcactg gaagttgaag 10611
gaccaacgtc aggccacaaa tttgtgccac tccgctgagg agtgcggcct gcgcagcccc 10671
aggaggactg ggttaccaaa gccgttgagc ccccacggcc caagcctcgt ctaggatgca 10731
atagacgagg tgtaaggact agaggttaga ggagaccccg tggaaacaac aacatgcggc 10791
ccaagccccc tccaagctgt agaggaggtg gaaggactag aggttagagg agaccccgca 10851
tttgcatcaa acagcatatt gacacctggg aatagactgg gagatcttct gctctatctc 10911
aacatcagct actaggcaca gagcgccgaa gtatgtagct ggtggtgagg aagaacacag 10971
gatct 10976
<210>2
<211>3432
<212>PRT
<213>日本脑炎病毒
<400>2
Met Thr Lys Lys Pro Gly Gly Pro Gly Lys Asn Arg Ala Ile Asn Met
1 5 10 15
Leu Lys Arg Gly Leu Pro Arg Val Phe Pro Leu Val Gly Val Lys Arg
20 25 30
Val Val Met Ser Leu Leu Asp Gly Arg Gly Pro Val Arg Phe Val Leu
35 40 45
Ala Leu Ile Thr Phe Phe Lys Phe Thr Ala Leu Ala Pro Thr Lys Ala
50 55 60
Leu Leu Gly Arg Trp Lys Ala Val Glu Lys Ser Val Ala Met Lys His
65 70 75 80
Leu Thr Ser Phe Lys Arg Glu Leu Gly Thr Leu Ile Asp Ala Val Asn
85 90 95
Lys Arg Gly Arg Lys Gln Asn Lys Arg Gly Gly Asn Glu Gly Ser Ile
100 105 110
Met Trp Leu Ala Ser Leu Ala Val Val Ile Ala Cys Ala Gly Ala Ile
115 120 125
Lys Leu Ser Asn Phe Gln Gly Lys Leu Leu Met Thr Ile Asn Asn Thr
130 135 140
Asp Ile Ala Asp Val Ile Val Ile Pro Thr Ser Lys Gly Glu Asn Arg
145 150 155 160
Cys Trp Val Arg Ala Ile Asp Val Gly Tyr Met Cys Glu Asp Thr Ile
165 170 175
Thr Tyr Glu Cys Pro Lys Leu Ala Met Gly Asn Asp Pro Glu Asp Val
180 185 190
Asp Cys Trp Cys Asp Asn Gln Glu Val Tyr Val Gln Tyr Gly Arg Cys
195 200 205
Thr Arg Thr Arg His Ser Lys Arg Ser Arg Arg Ser Val Ser Val Gln
210 215 220
Thr His Gly Glu Ser Ser Leu Val Asn Lys Lys Glu Ala Trp Leu Asp
225 230 235 240
Ser Thr Lys Ala Thr Arg Tyr Leu Met Lys Thr Glu Asn Trp Ile Ile
245 250 255
Arg Asn Pro Gly Tyr Ala Phe Leu Ala Ala Val Leu Gly Trp Met Leu
260 265 270
Gly Ser Thr Asn Gly Gln Arg Val Val Phe Thr Ile Leu Leu Leu Leu
275 280 285
Val Ala Pro Ala Tyr Ser Phe Asn Cys Leu Gly Met Gly Asn Arg Asp
290 295 300
Phe Ile Glu Gly Ala Ser Gly Ala Thr Trp Val Asp Leu Val Leu Glu
305 310 315 320
Gly Asp Ser Cys Leu Thr Ile Met Ala Asn Asp Lys Pro Thr Leu Asp
325 330 335
Val Arg Met Ile Asn Ile Glu Ala Ser Gln Leu Ala Glu Val Arg Ser
340 345 350
Tyr Cys Tyr His Ala Ser Val Thr Asp Ile Ser Thr Val Ala Arg Cys
355 360 365
Pro Thr Thr Gly Glu Ala His Asn Glu Lys Arg Ala Asp Ser Ser Tyr
370 375 380
Val Cys Lys Gln Gly Phe Thr Asp Arg Gly Trp Gly Asn Gly Cys Gly
385 390 395 400
Leu Phe Gly Lys Gly Ser Ile Asp Thr Cys Ala Lys Phe Ser Cys Thr
405 410 415
Ser Lys Ala Ile Gly Arg Thr Ile Gln Pro Glu Asn Ile Lys Tyr Lys
420 425 430
Val Gly Ile Phe Val His Gly Ala Thr Thr Ser Glu Asn His Gly Asn
435 440 445
Tyr Ser Ala Gln Val Gly Ala Ser Gln Ala Ala Lys Phe Thr Val Thr
450 455 460
Pro Asn Ala Pro Ser Ile Thr Leu Lys Leu Gly Asp Tyr Gly Glu Val
465 470 475 480
Thr Leu Asp Cys Glu Pro Arg Ser Gly Leu Asn Thr Glu Ala Phe Tyr
485 490 495
Val Met Thr Val Gly Ser Lys Ser Phe Leu Val His Arg Glu Trp Phe
500 505 510
His Asp Leu Ala Leu Pro Trp Thr Ser Pro Ser Ser Thr Ala Trp Arg
515 520 525
Asn Arg Glu Leu Leu Met Glu Phe Glu Glu Ala His Ala Thr Lys Gln
530 535 540
Ser Val Val Ala Leu Gly Ser Gln Glu Gly Gly Leu His Gln Ala Leu
545 550 555 560
Ala Gly Ala Ile Val Val Glu Tyr Ser Ser Ser Val Lys Leu Thr Ser
565 570 575
Gly His Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Ala Leu Lys Gly
580 585 590
Thr Thr Tyr Gly Met Cys Thr Glu Lys Phe Ser Phe Ala Lys Asn Pro
595 600 605
Ala Asp Thr Gly His Gly Thr Val Val Ile Glu Leu Ser Tyr Ser Gly
610 615 620
Ser Asp Gly Pro Cys Lys Ile Pro Ile Val Ser Val Ala Ser Leu Asn
625 630 635 640
Asp Met Thr Pro Val Gly Arg Leu Val Thr Val Asn Pro Phe Val Ala
645 650 655
Thr Ser Ser Ala Asn Ser Lys Val Leu Val Glu Met Glu Pro Pro Phe
660 665 670
Gly Asp Ser Tyr Ile Val Val Gly Arg Gly Asp Lys Gln Ile Asn His
675 680 685
His Trp His Lys Ala Gly Ser Thr Leu Gly Lys Ala Phe Ser Thr Thr
690 695 700
Leu Lys Gly Ala Gln Arg Leu Ala Ala Leu Gly Asp Thr Ala Trp Asp
705 710 715 720
Phe Gly Ser Ile Gly Gly Val Phe Asn Ser Ile Gly Lys Ala Val His
725 730 735
Gln Val Phe Gly Gly Ala Phe Arg Thr Leu Phe Gly Gly Met Ser Trp
740 745 750
Ile Thr Gln Gly Leu Met Gly Ala Leu Leu Leu Trp Met Gly Val Asn
755 760 765
Ala Arg Asp Arg Ser Ile Ala Leu Ala Phe Leu Ala Thr Gly Gly Val
770 775 780
Leu Val Phe Leu Ala Thr Asn Val His Ala Asp Thr Gly Cys Ala Ile
785 790 795 800
Asp Ile Thr Arg Lys Glu Met Arg Cys Gly Ser Gly Ile Phe Val His
805 810 815
Asn Asp Val Glu Ala Trp Val Asp Arg Tyr Lys Tyr Leu Pro Glu Thr
820 825 830
Pro Arg Ser Leu Ala Lys Ile Val His Lys Ala His Lys Glu Gly Val
835 840 845
Cys Gly Val Arg Ser Val Thr Arg Leu Glu His Gln Met Trp Glu Ala
850 855 860
Val Arg Asp Glu Leu Asn Val Leu Leu Lys Glu Asn Ala Val Asp Leu
865 870 875 880
Ser Val Val Val Asn Lys Pro Val Gly Arg Tyr Arg Ser Ala Pro Lys
885 890 895
Arg Leu Ser Met Thr Gln Glu Lys Phe Glu Met Gly Trp Lys Ala Trp
900 905 910
Gly Lys Ser Ile Leu Phe Ala Pro Glu Leu Ala Asn Ser Thr Phe Val
915 920 925
Val Asp Gly Pro Glu Thr Lys Glu Cys Pro Asp Glu His Arg Ala Trp
930 935 940
Asn Ser Met Gln Ile Glu Asp Phe Gly Phe Gly Ile Thr Ser Thr Arg
945 950 955 960
Val Trp Leu Lys Ile Arg Glu Glu Ser Thr Asp Glu Cys Asp Gly Ala
965 970 975
Ile Ile Gly Thr Ala Val Lys Gly His Val Ala Val His Ser Asp Leu
980 985 990
Ser Tyr Trp Ile Glu Ser Arg Tyr Asn Asp Thr Trp Lys Leu Glu Arg
995 1000 1005
Ala Val Phe Gly Glu Val Lys Ser Cys Thr Trp Pro Glu Thr His
1010 1015 1020
Thr Leu Trp Gly Asp Gly Val Glu Glu Ser Glu Leu Ile Ile Pro
1025 1030 1035
His Thr Ile Ala Gly Pro Lys Ser Lys His Asn Arg Arg Glu Gly
1040 1045 1050
Tyr Lys Thr Gln Asn Gln Gly Pro Trp Asp Glu Asn Gly Ile Val
1055 1060 1065
Leu Asp Phe Asp Tyr Cys Pro Gly Thr Lys Val Thr Ile Thr Glu
1070 1075 1080
Asp Cys Gly Lys Arg Gly Pro Ser Val Arg Thr Thr Thr Asp Ser
1085 1090 1095
Gly Lys Leu Ile Thr Asp Trp Cys Cys Arg Ser Cys Ser Leu Pro
1100 1105 1110
Pro Leu Arg Phe Arg Thr Glu Asn Gly Cys Trp Tyr Gly Met Glu
1115 1120 1125
Ile Arg Pro Val Arg His Asp Glu Thr Thr Leu Val Arg Ser Gln
1130 1135 1140
Val Asp Ala Phe Asn Gly Glu Met Val Asp Pro Phe Gln Leu Gly
1145 1150 1155
Leu Leu Val Met Phe Leu Ala Thr Gln Glu Val Leu Arg Lys Arg
1160 1165 1170
Trp Thr Ala Arg Leu Thr Ile Pro Ala Val Leu Gly Ala Leu Leu
1175 1180 1185
Val Leu Met Leu Gly Gly Ile Thr Tyr Thr Asp Leu Ala Arg Tyr
1190 1195 1200
Val Val Leu Val Ala Ala Ser Phe Ala Glu Ala Asn Ser Gly Gly
1205 1210 1215
Asp Val Leu His Leu Ala Leu Ile Ala Val Phe Lys Ile Gln Pro
1220 1225 1230
Ala Phe Leu Val Met Asn Met Leu Ser Thr Arg Trp Thr Asn Gln
1235 1240 1245
Glu Asn Val Val Leu Val Leu Gly Ala Ala Phe Phe Gln Leu Ala
1250 1255 1260
Ser Val Asp Leu Gln Ile Gly Val His Gly Ile Leu Asn Ala Ala
1265 1270 1275
Ala Ile Ala Trp Met Ile Val Arg Ala Ile Thr Phe Pro Thr Thr
1280 1285 1290
Ser Ser Val Thr Met Pro Val Leu Ala Leu Leu Thr Pro Gly Met
1295 1300 1305
Arg Ala Leu Tyr Leu Asp Thr Tyr Arg Ile Ile Leu Leu Val Ile
1310 1315 1320
Gly Ile Cys Ser Leu Leu Gln Glu Arg Lys Lys Thr Met Ala Lys
1325 1330 1335
Lys Lys Gly Ala Val Leu Leu Gly Leu Ala Leu Thr Ser Thr Gly
1340 1345 1350
Trp Phe Ser Pro Thr Thr Ile Ala Ala Gly Leu Met Val Cys Asn
1355 1360 1365
Pro Asn Lys Lys Arg Gly Trp Pro Ala Thr Glu Phe Leu Ser Ala
1370 1375 1380
Val Gly Leu Met Phe Ala Ile Val Gly Gly Leu Ala Glu Leu Asp
1385 1390 1395
Ile Glu Ser Met Ser Ile Pro Phe Met Leu Ala Gly Leu Met Ala
1400 1405 1410
Val Ser Tyr Val Val Ser Gly Lys Ala Thr Asp Met Trp Leu Glu
1415 1420 1425
Arg Ala Ala Asp Ile Ser Trp Glu Met Asp Ala Ala Ile Thr Gly
1430 1435 1440
Ser Ser Arg Arg Leu Asp Val Lys Leu Asp Asp Asp Gly Asp Phe
1445 1450 1455
His Leu Ile Asp Asp Pro Gly Val Pro Trp Lys Val Trp Val Leu
1460 1465 1470
Arg Met Ser Cys Ile Gly Leu Ala Ala Leu Thr Pro Trp Ala Ile
1475 1480 1485
Val Pro Ala Ala Phe Gly Tyr Trp Leu Thr Leu Lys Thr Thr Lys
1490 1495 1500
Arg Gly Gly Val Phe Trp Asp Thr Pro Ser Pro Lys Pro Cys Ser
1505 1510 1515
Lys Gly Asp Thr Thr Thr Gly Val Tyr Arg Ile Met Ala Arg Gly
1520 1525 1530
Ile Leu Gly Thr Tyr Gln Ala Gly Val Gly Val Met Tyr Glu Asn
1535 1540 1545
Val Phe His Thr Leu Trp His Thr Thr Arg Gly Ala Ala Ile Met
1550 1555 1560
Ser Gly Glu Gly Lys Leu Thr Pro Tyr Trp Gly Ser Val Lys Glu
1565 1570 1575
Asp Arg Ile Ala Tyr Gly Gly Pro Trp Arg Phe Asp Arg Lys Trp
1580 1585 1590
Asn Gly Thr Asp Asp Val Gln Val Ile Val Val Glu Pro Gly Lys
1595 1600 1605
Ala Ala Val Asn Ile Gln Thr Lys Pro Gly Val Phe Arg Thr Pro
1610 1615 1620
Phe Gly Glu Val Gly Ala Val Ser Leu Asp Tyr Pro Arg Gly Thr
1625 1630 1635
Ser Gly Ser Pro Ile Leu Asp Ser Asn Gly Asp Ile Ile Gly Leu
1640 1645 1650
Tyr Gly Asn Gly Val Glu Leu Gly Asp Gly Ser Tyr Val Ser Ala
1655 1660 1665
Ile Val Gln Gly Asp Arg Gln Glu Glu Pro Val Pro Glu Ala Tyr
1670 1675 1680
Thr Pro Asn Met Leu Arg Lys Arg Gln Met Thr Val Leu Asp Leu
1685 1690 1695
His Pro Gly Ser Gly Lys Thr Lys Lys Ile Leu Pro Gln Ile Ile
1700 1705 1710
Lys Asp Ala Ile Gln Gln Arg Leu Arg Thr Ala Val Leu Ala Pro
1715 1720 1725
Thr Arg Val Val Ala Ala Glu Met Ala Glu Ala Leu Arg Gly Leu
1730 1735 1740
Pro Val Arg Tyr Gln Thr Ser Ala Val Gln Arg Glu His Gln Gly
1745 1750 1755
Asn Glu Ile Val Asp Val Met Cys His Ala Thr Leu Thr His Arg
1760 1765 1770
Leu Met Ser Pro Asn Arg Val Pro Asn Tyr Asn Leu Phe Val Met
1775 1780 1785
Asp Glu Ala His Phe Thr Asp Pro Ala Ser Ile Ala Ala Arg Gly
1790 1795 1800
Tyr Ile Ala Thr Lys Val Glu Leu Gly Glu Ala Ala Ala Ile Phe
1805 1810 1815
Met Thr Ala Thr Pro Pro Gly Thr Thr Asp Pro Phe Pro Asp Ser
1820 1825 1830
Asn Ala Pro Ile His Asp Leu Gln Asp Glu Ile Pro Asp Arg Ala
1835 1840 1845
Trp Ser Ser Gly Tyr Glu Trp Ile Thr Glu Tyr Ala Gly Lys Thr
1850 1855 1860
Val Trp Phe Val Ala Ser Val Lys Met Gly Asn Glu Ile Ala Met
1865 1870 1875
Cys Leu Gln Arg Ala Gly Lys Lys Val Ile Gln Leu Asn Arg Lys
1880 1885 1890
Ser Tyr Asp Thr Glu Tyr Pro Lys Cys Lys Asn Gly Asp Trp Asp
1895 1900 1905
Phe Val Ile Thr Thr Asp Ile Ser Glu Met Gly Ala Asn Phe Gly
1910 1915 1920
Ala Ser Arg Val Ile Asp Cys Arg Lys Ser Val Lys Pro Thr Ile
1925 1930 1935
Leu Glu Glu Gly Glu Gly Arg Val Ile Leu Gly Asn Pro Ser Pro
1940 1945 1950
Ile Thr Ser Ala Ser Ala Ala Gln Arg Arg Gly Arg Val Gly Arg
1955 1960 1965
Asn Pro Asn Gln Val Gly Asp Glu Tyr His Tyr Gly Gly Ala Thr
1970 1975 1980
Ser Glu Asp Asp Ser Asn Leu Ala His Trp Thr Glu Ala Lys Ile
1985 1990 1995
Met Leu Asp Asn Ile His Met Pro Asn Gly Leu Val Ala Gln Leu
2000 2005 2010
Tyr Gly Pro Glu Arg Glu Lys Ala Phe Thr Met Asp Gly Glu Tyr
2015 2020 2025
Arg Leu Arg Gly Glu Glu Lys Lys Asn Phe Leu Glu Leu Leu Arg
2030 2035 2040
Thr Ala Asp Leu Pro Val Trp Leu Ala Tyr Lys Val Ala Ser Asn
2045 2050 2055
Gly Ile Gln Tyr Thr Asp Arg Lys Trp Cys Phe Asp Gly Pro Arg
2060 2065 2070
Thr Asn Ala Ile Leu Glu Asp Asn Thr Glu Val Glu Ile Val Thr
2075 2080 2085
Arg Met Gly Glu Arg Lys Ile Leu Lys Pro Arg Trp Leu Asp Ala
2090 2095 2100
Arg Val Tyr Ala Asp His Gln Ala Leu Lys Trp Phe Lys Asp Phe
2105 2110 2115
Ala Ala Gly Lys Arg Ser Ala Val Ser Phe Ile Glu Val Leu Gly
2120 2125 2130
Arg Met Pro Glu His Phe Met Gly Lys Thr Arg Glu Ala Leu Asp
2135 2140 2145
Thr Met Tyr Leu Val Ala Thr Ala Glu Lys Gly Gly Lys Ala His
2150 2155 2160
Arg Met Ala Leu Glu Glu Leu Pro Asp Ala Leu Glu Thr Ile Thr
2165 2170 2175
Leu Ile Val Ala Ile Thr Val Met Thr Gly Gly Phe Phe Leu Leu
2180 2185 2190
Met Met Gln Arg Lys Gly Ile Gly Lys Met Gly Leu Gly Ala Leu
2195 2200 2205
Val Leu Thr Leu Ala Thr Phe Phe Leu Trp Ala Ala Glu Val Pro
2210 2215 2220
Gly Thr Lys Ile Ala Gly Thr Leu Leu Ile Ala Leu Leu Leu Met
2225 2230 2235
Val Val Leu Ile Pro Glu Pro Glu Lys Gln Arg Ser Gln Thr Asp
2240 2245 2250
Asn Gln Leu Ala Val Phe Leu Ile Cys Val Leu Thr Val Val Gly
2255 2260 2265
Val Val Ala Ala Asn Glu Tyr Gly Met Leu Glu Lys Thr Lys Ala
2270 2275 2280
Asp Leu Lys Ser Met Phe Gly Gly Lys Thr Gln Ala Ser Gly Leu
2285 2290 2295
Thr Gly Leu Pro Ser Met Ala Leu Asp Leu Arg Pro Ala Thr Ala
2300 2305 2310
Trp Ala Leu Tyr Gly Gly Ser Thr Val Val Leu Thr Pro Leu Leu
2315 2320 2325
Lys His Leu Ile Thr Ser Glu Tyr Val Thr Thr Ser Leu Ala Ser
2330 2335 2340
Ile Asn Ser Gln Ala Gly Ser Leu Phe Val Leu Pro Arg Gly Val
2345 2350 2355
Pro Phe Thr Asp Leu Asp Leu Thr Val Gly Leu Val Phe Leu Gly
2360 2365 2370
Cys Trp Gly Gln Ile Thr Leu Thr Thr Phe Leu Thr Ala Met Val
2375 2380 2385
Leu Ala Thr Leu His Tyr Gly Tyr Met Leu Pro Gly Trp Gln Ala
2390 2395 2400
Glu Ala Leu Arg Ala Ala Gln Arg Arg Thr Ala Ala Gly Ile Met
2405 2410 2415
Lys Asn Ala Val Val Asp Gly Met Val Ala Thr Asp Val Pro Glu
2420 2425 2430
Leu Glu Arg Thr Thr Pro Leu Met Gln Lys Lys Val Gly Gln Val
2435 2440 2445
Leu Leu Ile Gly Val Ser Val Ala Ala Phe Leu Val Asn Pro Lys
2450 2455 2460
Ile Thr Thr Val Arg Glu Ala Gly Val Leu Val Thr Ala Ala Thr
2465 2470 2475
Leu Ser Leu Trp Asp Asn Gly Ala Ser Ala Val Trp Asn Ser Thr
2480 2485 2490
Thr Ala Thr Gly Leu Cys His Val Met Arg Gly Ser Tyr Leu Ala
2495 2500 2505
Gly Gly Ser Ile Ala Trp Thr Leu Ile Lys Asn Ala Asp Lys Pro
2510 2515 2520
Ser Leu Lys Arg Gly Arg Pro Gly Gly Arg Thr Leu Gly Glu Gln
2525 2530 2535
Trp Lys Glu Lys Leu Asn Ala Met Ser Arg Glu Glu Phe Phe Lys
2540 2545 2550
Tyr Arg Arg Glu Ala Ile Ile Glu Val Asp Arg Thr Glu Ala Arg
2555 2560 2565
Arg Ala Arg Arg Glu Asn Asn Ile Val Gly Gly His Pro Val Ser
2570 2575 2580
Arg Gly Ser Ala Lys Leu Arg Trp Leu Val Glu Lys Gly Phe Val
2585 2590 2595
Ser Pro Ile Gly Lys Val Ile Asp Leu Gly Cys Gly Arg Gly Gly
2600 2605 2610
Trp Ser Tyr Tyr Ala Ala Thr Leu Lys Lys Val Gln Glu Val Arg
2615 2620 2625
Gly Tyr Thr Lys Gly Gly Ala Gly His Glu Glu Pro Met Leu Met
2630 2635 2640
Gln Ser Tyr Gly Trp Asn Leu Val Ser Met Lys Ser Gly Val Asp
2645 2650 2655
Val Phe Tyr Lys Pro Ser Glu Pro Ser Asp Thr Leu Phe Cys Asp
2660 2665 2670
Ile Gly Glu Ser Ser Pro Ser Pro Glu Val Glu Glu Gln Arg Thr
2675 2680 2685
Leu Arg Val Leu Glu Met Thr Ser Asp Trp Leu His Arg Gly Pro
2690 2695 2700
Arg Glu Phe Cys Ile Lys Val Leu Cys Pro Tyr Met Pro Lys Val
2705 2710 2715
Ile Glu Lys Met Glu Val Leu Gln Arg Arg Phe Gly Gly Gly Leu
2720 2725 2730
Val Arg Leu Pro Leu Ser Arg Asn Ser Asn His Glu Met Tyr Trp
2735 2740 2745
Val Ser Gly Ala Ala Gly Asn Val Val Hi s Ala Val Asn Met Thr
2750 2755 2760
Ser Gln Val Leu Leu Gly Arg Met Asp Arg Thr Val Trp Arg Gly
2765 2770 2775
Pro Lys Tyr Glu Glu Asp Val Asn Leu Gly Ser Gly Thr Arg Ala
2780 2785 2790
Val Gly Lys Gly Glu Val His Ser Asn Gln Glu Lys Ile Lys Lys
2795 2800 2805
Arg Ile Gln Lys Leu Lys Glu Glu Phe Ala Thr Thr Trp His Lys
2810 2815 2820
Asp Pro Glu His Pro Tyr Arg Thr Trp Thr Tyr His Gly Ser Tyr
2825 2830 2835
Glu Val Lys Ala Thr Gly Ser Ala Ser Ser Leu Val Asn Gly Val
2840 2845 2850
Val Lys Leu Met Ser Lys Pro Trp Asp Ala Ile Ala Asn Val Thr
2855 2860 2865
Thr Met Ala Met Thr Asp Thr Thr Pro Phe Gly Gln Gln Arg Val
2870 2875 2880
Phe Lys Glu Lys Val Asp Thr Lys Ala Pro Glu Pro Pro Ala Gly
2885 2890 2895
Ala Lys Glu Val Leu Asn Glu Thr Thr Asn Trp Leu Trp Ala Tyr
2900 2905 2910
Leu Ser Arg Glu Lys Arg Pro Arg Leu Cys Thr Lys Glu Glu Phe
2915 2920 2925
Ile Lys Lys Val Asn Ser Asn Ala Ala Leu Gly Ala Val Phe Ala
2930 2935 2940
Glu Gln Asn Gln Trp Ser Thr Ala Arg Glu Ala Val Asp Asp Pro
2945 2950 2955
Arg Phe Trp Glu Met Val Asp Glu Glu Arg Glu Asn His Leu Arg
2960 2965 2970
Gly Glu Cys His Thr Cys Ile Tyr Asn Met Met Gly Lys Arg Glu
2975 2980 2985
Lys Lys Pro Gly Glu Phe Gly Lys Ala Lys Gly Ser Arg Ala Ile
2990 2995 3000
Trp Phe Met Trp Leu Gly Ala Arg Tyr Leu Glu Phe Glu Ala Leu
3005 3010 3015
Gly Phe Leu Asn Glu Asp His Trp Leu Ser Arg Glu Asn Ser Gly
3020 3025 3030
Gly Gly Val Glu Gly Ser Gly Val Gln Lys Leu Gly Tyr Ile Leu
3035 3040 3045
Arg Asp Ile Ala Gly Lys Gln Gly Gly Lys Met Tyr Ala Asp Asp
3050 3055 3060
Thr Ala Gly Trp Asp Thr Arg Ile Thr Arg Thr Asp Leu Glu Asn
3065 3070 3075
Glu Ala Lys Val Leu Glu Leu Leu Asp Gly Glu His Arg Met Leu
3080 3085 3090
Ala Arg Ala Ile Ile Glu Leu Thr Tyr Arg His Lys Val Val Lys
3095 3100 3105
Val Met Arg Pro Ala Ala Glu Gly Lys Thr Val Met Asp Val Ile
3110 3115 3120
Ser Arg Glu Asp Gln Arg Gly Ser Gly Gln Val Val Thr Tyr Ala
3125 3130 3135
Leu Asn Thr Phe Thr Asn Ile Ala Val Gln Leu Val Arg Leu Met
3140 3145 3150
Glu Ala Glu Gly Val Ile Gly Pro Gln His Leu Glu Gln Leu Pro
3155 3160 3165
Arg Lys Asn Lys Ile Ala Val Arg Thr Trp Leu Phe Glu Asn Gly
3170 3175 3180
Glu Glu Arg Val Ser Arg Met Ala Ile Ser Gly Asp Asp Cys Val
3185 3190 3195
Val Lys Pro Leu Asp Asp Arg Phe Ala Thr Ala Leu His Phe Leu
3200 3205 3210
Asn Ala Met Ser Lys Val Arg Lys Asp Ile Gln Glu Trp Lys Pro
3215 3220 3225
Ser His Gly Trp His Asp Trp Gln Gln Val Pro Phe Cys Ser Asn
3230 3235 3240
His Phe Gln Glu Ile Val Met Lys Asp Gly Arg Ser Ile Val Val
3245 3250 3255
Pro Cys Arg Gly Gln Asp Glu Leu Ile Gly Arg Ala Arg Ile Ser
3260 3265 3270
Pro Gly Ala Gly Trp Asn Val Lys Asp Thr Ala Cys Leu Ala Lys
3275 3280 3285
Ala Tyr Ala Gln Met Trp Leu Leu Leu Tyr Phe His Arg Arg Asp
3290 3295 3300
Leu Arg Leu Met Ala Asn Ala Ile Cys Ser Ala Val Pro Val Asp
3305 3310 3315
Trp Val Pro Thr Gly Arg Thr Ser Trp Ser Ile His Ser Lys Gly
3320 3325 3330
Glu Trp Met Thr Thr Glu Asp Met Leu Gln Val Trp Asn Arg Val
3335 3340 3345
Trp Ile Glu Glu Asn Glu Trp Met Val Asp Lys Thr Pro Ile Thr
3350 3355 3360
Ser Trp Thr Asp Val Pro Tyr Val Gly Lys Arg Glu Asp Ile Trp
3365 3370 3375
Cys Gly Asn Leu Ile Gly Thr Arg Ser Arg Ala Thr Trp Ala Glu
3380 3385 3390
Asn Ile Tyr Ala Ala Ile Asn Gln Val Arg Ala Val Ile Gly Lys
3395 3400 3405
Glu Asn Tyr Val Asp Tyr Met Thr Ser Leu Arg Arg Tyr Glu Asp
3410 3415 3420
Val Leu Ile Gln Glu Asp Arg Val Ile
3425 3430
<210>3
<211>10976
<212>DNA
<213>日本脑炎病毒
<220>
<221>CDS
<222>(96)..(10391)
<400>3
agaagtttat ctgtgtgaac ttcttggctt agtatcgttg agaagaatcg agagattagt 60
gcagtttaaa cagtttttta gaacggaaga taacc atg act aaa aaa cca gga 113
Met Thr Lys Lys Pro Gly
1 5
ggg ccc ggt aaa aac cgg gct atc aat atg ctg aaa cgc ggc cta ccc 161
Gly Pro Gly Lys Asn Arg Ala Ile Asn Met Leu Lys Arg Gly Leu Pro
10 15 20
cgc gta ttc cca cta gtg gga gtg aag agg gta gta atg agc ttg ttg 209
Arg Val Phe Pro Leu Val Gly Val Lys Arg Val Val Met Ser Leu Leu
25 30 35
gac ggc aga ggg cca gta cgt ttc gtg ctg gct ctt atc acg ttc ttc 257
Asp Gly Arg Gly Pro Val Arg Phe Val Leu Ala Leu Ile Thr Phe Phe
40 45 50
aag ttt aca gca tta gcc ccg acc aag gcg ctt tta ggc cga tgg aaa 305
Lys Phe Thr Ala Leu Ala Pro Thr Lys Ala Leu Leu Gly Arg Trp Lys
55 60 65 70
gca gtg gaa aag agt gta gca atg aaa cat ctc act agt ttc aaa cga 353
Ala Val Glu Lys Ser Val Ala Met Lys His Leu Thr Ser Phe Lys Arg
75 80 85
gaa ctt gga aca ctc att gac gcc gtg aac aag cgg ggc aga aag caa 401
Glu Leu Gly Thr Leu Ile Asp Ala Val Asn Lys Arg Gly Arg Lys Gln
90 95 100
aac aaa aga gga gga aat gaa ggc tca atc atg tgg ctt gcg agc ttg 449
Asn Lys Arg Gly Gly Asn Glu Gly Ser Ile Met Trp Leu Ala Ser Leu
105 110 115
gca gtt gtc ata gct tgt gca gga gcc atg aag ttg tca aat ttc cag 497
Ala Val Val Ile Ala Cys Ala Gly Ala Met Lys Leu Ser Asn Phe Gln
120 125 130
ggg aag ctt ttg atg acc att aac aac acg gac att gca gac gtt atc 545
Gly Lys Leu Leu Met Thr Ile Asn Asn Thr Asp Ile Ala Asp Val Ile
135 140 145 150
gta att ccc acc tca aaa gga gag aac aga tgc tgg gtc cgg gca atc 593
Val Ile Pro Thr Ser Lys Gly Glu Asn Arg Cys Trp Val Arg Ala Ile
155 160 165
gac gtc ggc tac atg tgt gag gac act atc acg tac gaa tgt cct aag 641
Asp Val Gly Tyr Met Cys Glu Asp Thr Ile Thr Tyr Glu Cys Pro Lys
170 175 180
ctt gcc atg ggc aat gat cca gag gat gtg gac tgc tgg tgt gac aac 689
Leu Ala Met Gly Asn Asp Pro Glu Asp Val Asp Cys Trp Cys Asp Asn
185 190 195
caa gaa gtc tac gtc caa tat gga cgg tgc acg cgg acc agg cat tcc 737
Gln Glu Val Tyr Val Gln Tyr Gly Arg Cys Thr Arg Thr Arg His Ser
200 205 210
aag cga agc agg aga tcc gtg tcg gtc caa aca cat ggg gag agt tca 785
Lys Arg Ser Arg Arg Ser Val Ser Val Gln Thr His Gly Glu Ser Ser
215 220 225 230
cta gtg aat aaa aaa gag gct tgg ctg gat tca acg aaa gcc aca cga 833
Leu Val Asn Lys Lys Glu Ala Trp Leu Asp Ser Thr Lys Ala Thr Arg
235 240 245
tat ctc atg aaa act gag aac tgg atc ata agg aat cct ggc tat gct 881
Tyr Leu Met Lys Thr Glu Asn Trp Ile Ile Arg Asn Pro Gly Tyr Ala
250 255 260
ttc ctg gcg gcg gta ctc ggc tgg atg ctt ggc agt aac aac ggt caa 929
Phe Leu Ala Ala Val Leu Gly Trp Met Leu Gly Ser Asn Asn Gly Gln
265 270 275
cgc gtg gta ttc acc atc ctc ctg ctg ctg gtc gct ccg gct tac agt 977
Arg Val Val Phe Thr Ile Leu Leu Leu Leu Val Ala Pro Ala Tyr Ser
280 285 290
ttt aat tgt ctg gga atg ggc aat cgt gac ttc ata gaa gga gcc agt 1025
Phe Asn Cys Leu Gly Met Gly Asn Arg Asp Phe Ile Glu Gly Ala Ser
295 300 305 310
gga gcc act tgg gtg gac ttg gtg cta gaa gga gat agc tgc ttg aca 1073
Gly Ala Thr Trp Val Asp Leu Val Leu Glu Gly Asp Ser Cys Leu Thr
315 320 325
att atg gca aac gac aaa cca aca ttg gac gtc cgc atg atc aac atc 1121
Ile Met Ala Asn Asp Lys Pro Thr Leu Asp Val Arg Met Ile Asn Ile
330 335 340
gaa gct agc caa ctt gct gag gtt aga agt tac tgt tat cat gct tca 1169
Glu Ala Ser Gln Leu Ala Glu Val Arg Ser Tyr Cys Tyr His Ala Ser
345 350 355
gtc act gac atc tcg acg gtg gct cgg tgc ccc acg act gga gaa gcc 1217
Val Thr Asp Ile Ser Thr Val Ala Arg Cys Pro Thr Thr Gly Glu Ala
360 365 370
cac aac gag aag cga gct gat agt agc tat gtg tgc aaa caa ggc ttc 1265
His Asn Glu Lys Arg Ala Asp Ser Ser Tyr Val Cys Lys Gln Gly Phe
375 380 385 390
act gat cgt ggg tgg ggc aac gga tgt gga ctt ttc ggg aag gga agc 1313
Thr Asp Arg Gly Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser
395 400 405
att gac aca tgt gca aaa ttc tcc tgc acc agc aaa gcg att ggg aga 1361
Ile Asp Thr Cys Ala Lys Phe Ser Cys Thr Ser Lys Ala Ile Gly Arg
410 415 420
aca atc cag cca gaa aac atc aaa tac aaa gtt ggc att ttt gtg cat 1409
Thr Ile Gln Pro Glu Asn Ile Lys Tyr Lys Val Gly Ile Phe Val His
425 430 435
gga gcc act act tcg gaa aac cat ggg aat tat tca gcg caa gtt ggg 1457
Gly Ala Thr Thr Ser Glu Asn His Gly Asn Tyr Ser Ala Gln Val Gly
440 445 450
gcg tcc cag gcg gca aag ttc aca gta aca ccc aat gct cct tcg ata 1505
Ala Ser Gln Ala Ala Lys Phe Thr Val Thr Pro Asn Ala Pro Ser Ile
455 460 465 470
acc ctc aaa ctt ggt gac tac gga gaa gtc aca ctg gac tgt gag cca 1553
Thr Leu Lys Leu Gly Asp Tyr Gly Glu Val Thr Leu Asp Cys Glu Pro
475 480 485
agg agt gga ctg aac act gaa gcg ttt tac gtc atg acc gtg ggg tca 1601
Arg Ser Gly Leu Asn Thr Glu Ala Phe Tyr Val Met Thr Val Gly Ser
490 495 500
aag tca ttt ctg gtc cat agg gaa tgg ttt cat gac ctc gct ctc ccc 1649
Lys Ser Phe Leu Val His Arg Glu Trp Phe His Asp Leu Ala Leu Pro
505 510 515
tgg acg tcc cct tcg agc aca gcg tgg aga aac aga gaa ctc ctc atg 1697
Trp Thr Ser Pro Ser Ser Thr Ala Trp Arg Asn Arg Glu Leu Leu Met
520 525 530
gag ttt gaa gag gcg cac gcc aca aaa cag tcc gtt gtt gct ctt ggg 1745
Glu Phe Glu Glu Ala His Ala Thr Lys Gln Ser Val Val Ala Leu Gly
535 540 545 550
tca cag gaa gga ggc ctc cat cag gcg ttg gca gga gcc atc gtg gtg 1793
Ser Gln Glu Gly Gly Leu His Gln Ala Leu Ala Gly Ala Ile Val Val
555 560 565
gag tac tca agt tca gtg aag tta aca tca ggc cac ctg aaa tgt agg 1841
Glu Tyr Ser Ser Ser Val Lys Leu Thr Ser Gly His Leu Lys Cys Arg
570 575 580
ctg aaa atg gac aaa ctg gct ctg aaa ggc aca acc tat ggc atg tgc 1889
Leu Lys Met Asp Lys Leu Ala Leu Lys Gly Thr Thr Tyr Gly Met Cys
585 590 595
aca gaa aaa ttc tcc ttc gcg aaa aat ccg gcg gac act ggt cac ggg 1937
Thr Glu Lys Phe Ser Phe Ala Lys Asn Pro Ala Asp Thr Gly His Gly
600 605 610
aca gtt gtc att gaa ctc tcc tac tct ggg agt gat ggc ccc tgc aaa 1985
Thr Val Val Ile Glu Leu Ser Tyr Ser Gly Ser Asp Gly Pro Cys Lys
615 620 625 630
att ccg att gtc tcc gtt gcg agc ctc aat gac atg acc ccc gtc ggg 2033
Ile Pro Ile Val Ser Val Ala Ser Leu Asn Asp Met Thr Pro Val Gly
635 640 645
cgg ctg gtg aca gtg aac ccc ttc gtc gcg act tcc agt gcc aat tca 2081
Arg Leu Val Thr Val Asn Pro Phe Val Ala Thr Ser Ser Ala Asn Ser
650 655 660
aag gtg ctg gtc gag atg gaa ccc ccc ttc gga gac tcc tac atc gta 2129
Lys Val Leu Val Glu Met Glu Pro Pro Phe Gly Asp Ser Tyr Ile Val
665 670 675
gtt gga cgg gga gac aag cag atc aac cac cat tgg cat aaa gct gga 2177
Val Gly Arg Gly Asp Lys Gln Ile Asn His His Trp His Lys Ala Gly
680 685 690
agc acg ctg ggc aaa gcc ttt tca aca act ttg aag gga gct cag aga 2225
Ser Thr Leu Gly Lys Ala Phe Ser Thr Thr Leu Lys Gly Ala Gln Arg
695 700 705 710
ctg gca gcg ttg ggt gac aca gcc tgg gac ttt ggc tcc att gga ggg 2273
Leu Ala Ala Leu Gly Asp Thr Ala Trp Asp Phe Gly Ser Ile Gly Gly
715 720 725
gtc ttc aac tcc ata gga aaa gcc gtt cac caa gtg ttt ggt ggt gcc 2321
Val Phe Asn Ser Ile Gly Lys Ala Val His Gln Val Phe Gly Gly Ala
730 735 740
ttc aga aca ctc ttc ggg gga atg tct tgg atc aca caa ggg cta atg 2369
Phe Arg Thr Leu Phe Gly Gly Met Ser Trp Ile Thr Gln Gly Leu Met
745 750 755
ggt gcc cta cta ctc tgg atg ggc gtc aac gca cga gac cga tca att 2417
Gly Ala Leu Leu Leu Trp Met Gly Val Asn Ala Arg Asp Arg Ser Ile
760 765 770
gct ttg gcc ttc tta gcc aca gga ggt gtg ctc gtg ttc tta gcg acc 2465
Ala Leu Ala Phe Leu Ala Thr Gly Gly Val Leu Val Phe Leu Ala Thr
775 780 785 790
aat gtg cat gct gac act gga tgt gcc att gac atc aca aga aaa gag 2513
Asn Val His Ala Asp Thr Gly Cys Ala Ile Asp Ile Thr Arg Lys Glu
795 800 805
atg agg tgt gga agt ggc atc ttc gtg cac aac gac gtg gaa gcc tgg 2561
Met Arg Cys Gly Ser Gly Ile Phe Val His Asn Asp Val Glu Ala Trp
810 815 820
gtg gat agg tat aaa tat ttg cca gaa acg ccc aga tcc cta gca aag 2609
Val Asp Arg Tyr Lys Tyr Leu Pro Glu Thr Pro Arg Ser Leu Ala Lys
825 830 835
atc gtc cac aaa gcg cac aag gaa ggc gtg tgc gga gtc aga tct gtc 2657
Ile Val His Lys Ala His Lys Glu Gly Val Cys Gly Val Arg Ser Val
840 845 850
act aga ctg gag cat caa atg tgg gaa gcc gta cgg gat gaa ttg aac 2705
Thr Arg Leu Glu His Gln Met Trp Glu Ala Val Arg Asp Glu Leu Asn
855 860 865 870
gtc ctg ctc aaa gag aat gca gtg gac ctc agt gtg gtt gtg aac aag 2753
Val Leu Leu Lys Glu Asn Ala Val Asp Leu Ser Val Val Val Asn Lys
875 880 885
ccc gtg ggg aga tat cgc tca gcc cct aaa cgc ctg tcc atg acg caa 2801
Pro Val Gly Arg Tyr Arg Ser Ala Pro Lys Arg Leu Ser Met Thr Gln
890 895 900
gag aag ttt gaa atg ggc tgg aaa gca tgg gga aaa agc att ctc ttt 2849
Glu Lys Phe Glu Met Gly Trp Lys Ala Trp Gly Lys Ser Ile Leu Phe
905 910 915
gcc ccg gaa ttg gcc aac tcc aca ttt gtc gta gat gga cct gag aca 2897
Ala Pro Glu Leu Ala Asn Ser Thr Phe Val Val Asp Gly Pro Glu Thr
920 925 930
aag gaa tgc cct gat gag cac aga gct tgg aac agc atg caa atc gaa 2945
Lys Glu Cys Pro Asp Glu His Arg Ala Trp Asn Ser Met Gln Ile Glu
935 940 945 950
gac ttc ggc ttt ggc atc aca tca acc cgt gtg tgg ctg aag att aga 2993
Asp Phe Gly Phe Gly Ile Thr Ser Thr Arg Val Trp Leu Lys Ile Arg
955 960 965
gag gag agc act gac gag tgt gat gga gcg atc ata ggt acg gct gtc 3041
Glu Glu Ser Thr Asp Glu Cys Asp Gly Ala Ile Ile Gly Thr Ala Val
970 975 980
aaa gga cat gtg gca gtc cat agt gac ttg tcg tac tgg att gag agt 3089
Lys Gly His Val Ala Val His Ser Asp Leu Ser Tyr Trp Ile Glu Ser
985 990 995
cgc tac aac gac aca tgg aaa ctt gag agg gca gtc ttt gga gaa 3134
Arg Tyr Asn Asp Thr Trp Lys Leu Glu Arg Ala Val Phe Gly Glu
1000 1005 1010
gtt aaa tcc tgc act tgg cca gag aca cac acc cta tgg gga gat 3179
Val Lys Ser Cys Thr Trp Pro Glu Thr His Thr Leu Trp Gly Asp
1015 1020 1025
ggt gtt gag gaa agt gaa ctc atc atc ccg cac acc ata gcc gga 3224
Gly Val Glu Glu Ser Glu Leu Ile Ile Pro His Thr Ile Ala Gly
1030 1035 1040
cca aaa agc aag cat aat cgg agg gaa ggg tat aag aca caa aac 3269
Pro Lys Ser Lys His Asn Arg Arg Glu Gly Tyr Lys Thr Gln Asn
1045 1050 1055
cag gga cct tgg gac gag aat ggc ata gtc ttg gac ttt gac tat 3314
Gln Gly Pro Trp Asp Glu Asn Gly Ile Val Leu Asp Phe Asp Tyr
1060 1065 1070
tgc cca ggg aca aaa gtc acc att aca gag gat tgt ggc aag aga 3359
Cys Pro Gly Thr Lys Val Thr Ile Thr Glu Asp Cys Gly Lys Arg
1075 1080 1085
ggc cct tcg gtc aga acc act act gac agt gga aag ttg atc act 3404
Gly Pro Ser Val Arg Thr Thr Thr Asp Ser Gly Lys Leu Ile Thr
1090 1095 1100
gac tgg tgc tgt cgc agt tgc tcc ctt ccg ccc cta cga ttc cgg 3449
Asp Trp Cys Cys Arg Ser Cys Ser Leu Pro Pro Leu Arg Phe Arg
1105 1110 1115
aca gaa aat ggc tgc tgg tac gga atg gaa atc aga cct gtc agg 3494
Thr Glu Asn Gly Cys Trp Tyr Gly Met Glu Ile Arg Pro Val Arg
1120 1125 1130
cat gat gaa aca aca ctc gtc aga tcg cag gtt gat gct ttt aat 3539
His Asp Glu Thr Thr Leu Val Arg Ser Gln Val Asp Ala Phe Asn
1135 1140 1145
ggt gaa atg gtt gac cct ttt cag ctg ggc ctt ctg gtg atg ttt 3584
Gly Glu Met Val Asp Pro Phe Gln Leu Gly Leu Leu Val Met Phe
1150 1155 1160
ctg gcc acc cag gag gtc ctt cgc aag agg tgg acg gcc aga ttg 3629
Leu Ala Thr Gln Glu Val Leu Arg Lys Arg Trp Thr Ala Arg Leu
1165 1170 1175
acc att cct gcg gtt ttg ggg gcc cta ctt gtg ctg atg ctt ggg 3674
Thr Ile Pro Ala Val Leu Gly Ala Leu Leu Val Leu Met Leu Gly
1180 1185 1190
ggc atc act tac act gat ttg gcg agg tat gtg gtg cta gtc gct 3719
Gly Ile Thr Tyr Thr Asp Leu Ala Arg Tyr Val Val Leu Val Ala
1195 1200 1205
gcc gct ttc gca gag gcc aac agt gga gga gat gtc ctg cac ctt 3764
Ala Ala Phe Ala Glu Ala Asn Ser Gly Gly Asp Val Leu His Leu
1210 1215 1220
gct ttg att gcc gtt ttc aag atc caa cca gca ttt tta gtg atg 3809
Ala Leu Ile Ala Val Phe Lys Ile Gln Pro Ala Phe Leu Val Met
1225 1230 1235
aac atg ctt agc acg aga tgg acg aac caa gaa aat gtg gtt ctg 3854
Asn Met Leu Ser Thr Arg Trp Thr Asn Gln Glu Asn Val Val Leu
1240 1245 1250
gtc cta ggg gct gcc ttt ttc caa ttg gcc tca gta gat ctg caa 3899
Val Leu Gly Ala Ala Phe Phe Gln Leu Ala Ser Val Asp Leu Gln
1255 1260 1265
ata gga gtt cac gga atc ctg aat gcc gcc gct ata gca tgg atg 3944
Ile Gly Val His Gly Ile Leu Asn Ala Ala Ala Ile Ala Trp Met
1270 1275 1280
att gtc cgg gcg atc acc ttc ccc aca acc tcc tcc gtc acc atg 3989
Ile Val Arg Ala Ile Thr Phe Pro Thr Thr Ser Ser Val Thr Met
1285 1290 1295
cea gtc tta gcg ctt cta act ccg gga atg agg gct cta tac cta 4034
Pro Val Leu Ala Leu Leu Thr Pro Gly Met Arg Ala Leu Tyr Leu
1300 1305 1310
gat act tac aga atc atc ctc ctc gtc ata ggg att tgc tct ctg 4079
Asp Thr Tyr Arg Ile Ile Leu Leu Val Ile Gly Ile Cys Ser Leu
1315 1320 1325
ttg caa gag agg aaa aag acc atg gca aaa aag aaa gga gct gta 4124
Leu Gln Glu Arg Lys Lys Thr Met Ala Lys Lys Lys Gly Ala Val
1330 1335 1340
ctc ttg ggc tta gcg ctc aca tcc act gga tgg ttt tcg ccc acc 4169
Leu Leu Gly Leu Ala Leu Thr Ser Thr Gly Trp Phe Ser Pro Thr
1345 1350 1355
act ata gct gcc gga cta atg gtc tgc aac cca aac aag aag aga 4214
Thr Ile Ala Ala Gly Leu Met Val Cys Asn Pro Asn Lys Lys Arg
1360 1365 1370
ggg tgg cca gct act gag ttt ttg tcg gca gtt gga ttg atg ttt 4259
Gly Trp Pro Ala Thr Glu Phe Leu Ser Ala Val Gly Leu Met Phe
1375 1380 1385
gcc atc gta ggt ggt ttg gcg gag ttg gat att gaa tcc atg tca 4304
Ala Ile Val Gly Gly Leu Ala Glu Leu Asp Ile Glu Ser Met Ser
1390 1395 1400
ata ccc ttc atg ctg gca ggt ctc atg gca gtg tcc tac gtg gtg 4349
Ile Pro Phe Met Leu Ala Gly Leu Met Ala Val Ser Tyr Val Val
1405 1410 1415
tca gga aaa gca aca gat atg tgg ctt gaa cgg gct gcc gac atc 4394
Ser Gly Lys Ala Thr Asp Met Trp Leu Glu Arg Ala Ala Asp Ile
1420 1425 1430
agc tgg gag atg gat gct gca atc aca gga agc agt cgg agg ctg 4439
Ser Trp Glu Met Asp Ala Ala Ile Thr Gly Ser Ser Arg Arg Leu
1435 1440 1445
gat gtg aag cta gat gat gac gga gat ttt cac ttg att gac gat 4484
Asp Val Lys Leu Asp Asp Asp Gly Asp Phe His Leu Ile Asp Asp
1450 1455 1460
ccc ggt gtt cca tgg aag gtc tgg gtc ctg cgc atg tct tgc att 4529
Pro Gly Val Pro Trp Lys Val Trp Val Leu Arg Met Ser Cys Ile
1465 1470 1475
ggg tta gcc gcc ctc acg cct tgg gcc att gtt ccc gcc gct ttt 4574
Gly Leu Ala Ala Leu Thr Pro Trp Ala Ile Val Pro Ala Ala Phe
1480 1485 1490
ggt tat tgg ctc act tta aaa aca aca aaa aga ggg ggc gtg ttt 4619
Gly Tyr Trp Leu Thr Leu Lys Thr Thr Lys Arg Gly Gly Val Phe
1495 1500 1505
tgg gac acg cca tcc cca aaa cct tgc tca aaa gga gac acc act 4664
Trp Asp Thr Pro Ser Pro Lys Pro Cys Ser Lys Gly Asp Thr Thr
1510 1515 1520
aca gga gtt tac cgc att atg gct aga ggg att ctt ggc act tac 4709
Thr Gly Val Tyr Arg Ile Met Ala Arg Gly Ile Leu Gly Thr Tyr
1525 1530 1535
cag gcc ggc gtc gga gtc atg tac gag aat gtt ttc cac aca cta 4754
Gln Ala Gly Val Gly Val Met Tyr Glu Asn Val Phe His Thr Leu
1540 1545 1550
tgg cac aca act aga gga gca gct att atg agt gga gaa gga aaa 4799
Trp His Thr Thr Arg Gly Ala Ala Ile Met Ser Gly Glu Gly Lys
1555 1560 1565
ttg acg cca tac tgg ggt agt gtg aaa gaa gac cgc ata gct tac 4844
Leu Thr Pro Tyr Trp Gly Ser Val Lys Glu Asp Arg Ile Ala Tyr
1570 1575 1580
gga ggc cca tgg agg ttt gat cga aaa tgg aat gga act gat gac 4889
Gly Gly Pro Trp Arg Phe Asp Arg Lys Trp Asn Gly Thr Asp Asp
1585 1590 1595
gtg caa gtg atc gtg gta gaa ccg ggg aag gct gca gta aac atc 4934
Val Gln Val Ile Val Val Glu Pro Gly Lys Ala Ala Val Asn Ile
1600 1605 1610
cag aca aaa cca gga gtg ttt cgg act ccc ttc ggg gag gtt ggg 4979
Gln Thr Lys Pro Gly Val Phe Arg Thr Pro Phe Gly Glu Val Gly
1615 1620 1625
gct gtt agt ctg gat tac ccg cga gga aca tcc ggc tca ccc att 5024
Ala Val Ser Leu Asp Tyr Pro Arg Gly Thr Ser Gly Ser Pro Ile
1630 1635 1640
ctg gat tcc aat gga gac atc ata ggc ctg tac ggc aat gga gtt 5069
Leu Asp Ser Asn Gly Asp Ile Ile Gly Leu Tyr Gly Asn Gly Val
1645 1650 1655
gag ctt ggc gat ggc tca tac gtc agc gcc atc gtg cag ggt gac 5114
Glu Leu Gly Asp Gly Ser Tyr Val Ser Ala Ile Val Gln Gly Asp
1660 1665 1670
cgt cag gag gaa cca gtc cca gaa gct tac acc cca aac atg ttg 5159
Arg Gln Glu Glu Pro Val Pro Glu Ala Tyr Thr Pro Asn Met Leu
1675 1680 1685
aga aag aga cag atg acc gta cta gat ttg cac cct ggt tca ggg 5204
Arg Lys Arg Gln Met Thr Val Leu Asp Leu His Pro Gly Ser Gly
1690 1695 1700
aaa acc aag aaa att ctg cca caa ata att aag gac gct att cag 5249
Lys Thr Lys Lys Ile Leu Pro Gln Ile Ile Lys Asp Ala Ile Gln
1705 1710 1715
cag cgc cta aga aca gct gtg ttg gca ccg acg cgg gtg gta gca 5294
Gln Arg Leu Arg Thr Ala Val Leu Ala Pro Thr Arg Val Val Ala
1720 1725 1730
gca gaa atg gca gaa gct ttg aga ggg ctc cca gta cga tat caa 5339
Ala Glu Met Ala Glu Ala Leu Arg Gly Leu Pro Val Arg Tyr Gln
1735 1740 1745
act tca gca gtg cag aga gag cac caa ggg aat gaa ata gtg gat 5384
Thr Ser Ala Val Gln Arg Glu His Gln Gly Asn Glu Ile Val Asp
1750 1755 1760
gtg atg tgc cac gcc act ctg acc cat aga ctg atg tca ccg aac 5429
Val Met Cys His Ala Thr Leu Thr His Arg Leu Met Ser Pro Asn
1765 1770 1775
aga gtg ccc aac tac aac cta ttt gtc atg gat gaa gct cat ttc 5474
Arg Val Pro Asn Tyr Asn Leu Phe Val Met Asp Glu Ala His Phe
1780 1785 1790
acc gac cca gcc agt ata gcc gca cga gga tac att gct acc aag 5519
Thr Asp Pro Ala Ser Ile Ala Ala Arg Gly Tyr Ile Ala Thr Lys
1795 1800 1805
gtg gaa tta ggg gag gca gca gcc atc ttt atg aca gcg acc ccg 5564
Val Glu Leu Gly Glu Ala Ala Ala Ile Phe Met Thr Ala Thr Pro
1810 1815 1820
cct gga acc acg gat cct ttt cct gac tca aat gcc cca atc cat 5609
Pro Gly Thr Thr Asp Pro Phe Pro Asp Ser Asn Ala Pro Ile His
1825 1830 1835
gat ttg caa gat gag ata cca gac agg gcg tgg agc agt gga tac 5654
Asp Leu Gln Asp Glu Ile Pro Asp Arg Ala Trp Ser Ser Gly Tyr
1840 1845 1850
gaa tgg atc aca gaa tat gcg gga aaa acc gtg tgg ttt gtg gca 5699
Glu Trp Ile Thr Glu Tyr Ala Gly Lys Thr Val Trp Phe Val Ala
1855 1860 1865
agc gtg aaa atg ggg aac gag att gca atg tgc ctc caa aga gcg 5744
Ser Val Lys Met Gly Asn Glu Ile Ala Met Cys Leu Gln Arg Ala
1870 1875 1880
ggg aaa aag gtc atc caa ctc aac cgc aag tcc tat gac aca gaa 5789
Gly Lys Lys Val Ile Gln Leu Asn Arg Lys Ser Tyr Asp Thr Glu
1885 1890 1895
tac cca aaa tgt aag aat gga gac tgg gat ttt gtc atc acc act 5834
Tyr Pro Lys Cys Lys Asn Gly Asp Trp Asp Phe Val Ile Thr Thr
1900 1905 1910
gac att tct gaa atg ggg gcc aac ttc ggt gcg agc agg gtc atc 5879
Asp Ile Ser Glu Met Gly Ala Asn Phe Gly Ala Ser Arg Val Ile
1915 1920 1925
gac tgt aga aag agc gtg aag ccc acc atc tta gaa gag gga gaa 5924
Asp Cys Arg Lys Ser Val Lys Pro Thr Ile Leu Glu Glu Gly Glu
1930 1935 1940
ggc aga gtc atc ctc gga aac cca tcg ccc ata acc agt gca agc 5969
Gly Arg Val Ile Leu Gly Asn Pro Ser Pro Ile Thr Ser Ala Ser
1945 1950 1955
gca gct caa cgg agg ggc aga gta ggc aga aac cct aac cag gtt 6014
Ala Ala Gln Arg Arg Gly Arg Val Gly Arg Asn Pro Asn Gln Val
1960 1965 1970
gga gat gaa tac cac tat ggg ggg gcc acc agt gaa gat gac agt 6059
Gly Asp Glu Tyr His Tyr Gly Gly Ala Thr Ser Glu Asp Asp Ser
1975 1980 1985
aac cta gcc cat tgg aca gag gca aag atc atg tta gat aac ata 6104
Asn Leu Ala His Trp Thr Glu Ala Lys Ile Met Leu Asp Asn Ile
1990 1995 2000
cac atg ccc aat gga ctg gtg gcc cag ctc tat gga cca gag agg 6149
His Met Pro Asn Gly Leu Val Ala Gln Leu Tyr Gly Pro Glu Arg
2005 2010 2015
gaa aag gcc ttc aca atg gat ggc gaa tac cgt ctc aga ggt gaa 6194
Glu Lys Ala Phe Thr Met Asp Gly Glu Tyr Arg Leu Arg Gly Glu
2020 2025 2030
gaa aag aaa aac ttc tta gag ctg ctt agg acg gct gac ctc ccg 6239
Glu Lys Lys Asn Phe Leu Glu Leu Leu Arg Thr Ala Asp Leu Pro
2035 2040 2045
gtg tgg ctg gcc tac aag gtg gcg tcc aat ggc atc cag tac acc 6284
Val Trp Leu Ala Tyr Lys Val Ala Ser Asn Gly Ile Gln Tyr Thr
2050 2055 2060
gat aga aag tgg tgt ttt gat ggg ccg cgt acg aat gcc ata ctg 6329
Asp Arg Lys Trp Cys Phe Asp Gly Pro Arg Thr Asn Ala Ile Leu
2065 2070 2075
gag gac aac acc gag gta gag ata gtc acc cgg atg ggt gag agg 6374
Glu Asp Asn Thr Glu Val Glu Ile Val Thr Arg Met Gly Glu Arg
2080 2085 2090
aaa atc ctc aag ccg aga tgg ctt gat gca aga gtt tat gca gat 6419
Lys Ile Leu Lys Pro Arg Trp Leu Asp Ala Arg Val Tyr Ala Asp
2095 2100 2105
cac caa gct ctc aag tgg ttc aaa gac ttc gca gca gga aag aga 6464
His Gln Ala Leu Lys Trp Phe Lys Asp Phe Ala Ala Gly Lys Arg
2110 2115 2120
tca gcc gtt agc ttc ata gag gtg ctc ggt cgt atg cct gag cat 6509
Ser Ala Val Ser Phe Ile Glu Val Leu Gly Arg Met Pro Glu His
2125 2130 2135
ttc atg gga aag acg cgg gaa gct tta gac acc atg tac ttg gtt 6554
Phe Met Gly Lys Thr Arg Glu Ala Leu Asp Thr Met Tyr Leu Val
2140 2145 2150
gca acg gct gag aaa ggt ggg aaa gca cac cga atg gct ctc gaa 6599
Ala Thr Ala Glu Lys Gly Gly Lys Ala His Arg Met Ala Leu Glu
2155 2160 2165
gag ctg cca gat gca ctg gaa acc att aca ctt att gtt gct atc 6644
Glu Leu Pro Asp Ala Leu Glu Thr Ile Thr Leu Ile Val Ala Ile
2170 2175 2180
act gtg atg aca gga gga ttc ttt cta ctc atg atg cag cga aag 6689
Thr Val Met Thr Gly Gly Phe Phe Leu Leu Met Met Gln Arg Lys
2185 2190 2195
ggt ata ggg aag atg ggt ctt gga gct cta gtg ctc acg cta gct 6734
Gly Ile Gly Lys Met Gly Leu Gly Ala Leu Val Leu Thr Leu Ala
2200 2205 2210
acc ttc ttc ctg tgg gcg gca gag gtt ccc gga aca aaa ata gca 6779
Thr Phe Phe Leu Trp Ala Ala Glu Val Pro Gly Thr Lys Ile Ala
2215 2220 2225
ggg acc ctg ctg atc gcc ctg ctg ctt atg gtg gtt ctc atc cca 6824
Gly Thr Leu Leu Ile Ala Leu Leu Leu Met Val Val Leu Ile Pro
2230 2235 2240
gaa ccg gaa aag cag agg tca caa aca gat aat caa ctg gcg gtg 6869
Glu Pro Glu Lys Gln Arg Ser Gln Thr Asp Asn Gln Leu Ala Val
2245 2250 2255
ttt ctc atc tgt gtc ttg acc gtg gtt gga gtg gtg gca gca aac 6914
Phe Leu Ile Cys Val Leu Thr Val Val Gly Val Val Ala Ala Asn
2260 2265 2270
gag tac ggg atg cta gaa aaa acc aaa gca gac ctc aag agc atg 6959
Glu Tyr Gly Met Leu Glu Lys Thr Lys Ala Asp Leu Lys Ser Met
2275 2280 2285
ttt ggc gga aag acg cag gca tca gga ctg act gga tta cca agc 7004
Phe Gly Gly Lys Thr Gln Ala Ser Gly Leu Thr Gly Leu Pro Ser
2290 2295 2300
atg gca ctg gac ctg cgt cca gcc aca gct tgg gca ctg tat ggg 7049
Met Ala Leu Asp Leu Arg Pro Ala Thr Ala Trp Ala Leu Tyr Gly
2305 2310 2315
ggg agc aca gtc gtg cta acc cct ctt ctg aag cac ctg atc acg 7094
Gly Ser Thr Val Val Leu Thr Pro Leu Leu Lys His LeuIle Thr
2320 2325 2330
tcg gaa tac gtc acc aca tcg cta gcc tca att aac tca caa gct 7139
Ser Glu Tyr Val Thr Thr Ser Leu Ala Ser Ile Asn Ser Gln Ala
2335 2340 2345
ggc tca tta ttt gtc ttg cca cga ggc gtg cct ttt acc gac cta 7184
Gly Ser Leu Phe Val Leu Pro Arg Gly Val Pro Phe Thr Asp Leu
2350 2355 2360
gac ttg acc gtt ggc ctc gtc ttc ctt ggc tgt tgg ggt caa atc 7229
Asp Leu Thr Val Gly Leu Val Phe Leu Gly Cys Trp Gly Gln Ile
2365 2370 2375
acc ctc aca acg ttt ttg aca gcc atg gtt ctg gcg aca ctt cac 7274
Thr Leu Thr Thr Phe Leu Thr Ala Met Val Leu Ala Thr Leu His
2380 2385 2390
tat ggg tac atg ctc cct gga tgg caa gca gaa gca ctc agg gca 7319
Tyr Gly Tyr Met Leu Pro Gly Trp Gln Ala Glu Ala Leu Arg Ala
2395 2400 2405
gcc cag aga agg aca gcg gct gga ata atg aag aat gcc gtt gtt 7364
Ala Gln Arg Arg Thr Ala Ala Gly Ile Met Lys Asn Ala Val Val
2410 2415 2420
gac gga atg gtc gcc act gat gtg cct gaa ctg gaa agg acc act 7409
Asp Gly Met Val Ala Thr Asp Val Pro Glu Leu Glu Arg Thr Thr
2425 2430 2435
cct ctg atg caa aag aaa gtc gga cag gtg ctc ctc ata ggg gta 7454
Pro Leu Met Gln Lys Lys Val Gly Gln Val Leu Leu Ile Gly Val
2440 2445 2450
agc gtg gca gcg ttc ctc gtc aac cct aat gtc acc act gtg aga 7499
Ser Val Ala Ala Phe Leu Val Asn Pro Asn Val Thr Thr Val Arg
2455 2460 2465
gaa gca ggg gtg ttg gtg aca gcg gct acg ctc act ttg tgg gac 7544
Glu Ala Gly Val Leu Val Thr Ala Ala Thr Leu Thr Leu Trp Asp
2470 2475 2480
aac gga gcc agt gcc gtt tgg aat tcc acc act gcc acg gga ctc 7589
Asn Gly Ala Ser Ala Val Trp Asn Ser Thr Thr Ala Thr Gly Leu
2485 2490 2495
tgc cat gta atg cga ggt agc tac ctg gct gga ggc tcc att gct 7634
Cys His Val Met Arg Gly Ser Tyr Leu Ala Gly Gly Ser Ile Ala
2500 2505 2510
tgg act ctc atc aag aac gct gac aag ccc tcc tta aaa agg gga 7679
Trp Thr Leu Ile Lys Asn Ala Asp Lys Pro Ser Leu Lys Arg Gly
2515 2520 2525
agg cct ggg ggc agg acg cta ggg gag cag tgg aag gaa aaa cta 7724
Arg Pro Gly Gly Arg Thr Leu Gly Glu Gln Trp Lys Glu Lys Leu
2530 2535 2540
aat gcc atg agc aga gaa gag ttt ttt aaa tac cgg aga gag gcc 7769
Asn Ala Met Ser Arg Glu Glu Phe Phe Lys Tyr Arg Arg Glu Ala
2545 2550 2555
ata atc gag gtg gac cgc act gaa gca cgc agg gct aga cgt gaa 7814
Ile Ile Glu Val Asp Arg Thr Glu Ala Arg Arg Ala Arg Arg Glu
2560 2565 2570
aat aac ata gtg gga gga cat ccg gtt tcg cga ggc tca gca aaa 7859
Asn Asn Ile Val Gly Gly His Pro Val Ser Arg Gly Ser Ala Lys
2575 2580 2585
ctc cgt tgg ctc gta gag aaa gga ttt gtc tcg cca ata gga aaa 7904
Leu Arg Trp Leu Val Glu Lys Gly Phe Val Ser Pro Ile Gly Lys
2590 2595 2600
gtc att gat cta ggg tgt ggg cgt gga gga tgg agc tac tac gca 7949
Val Ile Asp Leu Gly Cys Gly Arg Gly Gly Trp Ser Tyr Tyr Ala
2605 2610 2615
gca acc ctg aag aag gtc cag gaa gtc aga gga tac acg aaa ggt 7994
Ala Thr Leu Lys Lys Val Gln Glu Val Arg Gly Tyr Thr Lys Gly
2620 2625 2630
ggg gcg gga cat gaa gaa ccg atg ctc atg cag agc tac ggc tgg 8039
Gly Ala Gly His Glu Glu Pro Met Leu Met Gln Ser Tyr Gly Trp
2635 2640 2645
aac ctg gtc tcc ctg aag agt gga gtg gac gtt ttt tac aaa cct 8084
Asn Leu Val Ser Leu Lys Ser Gly Val Asp Val Phe Tyr Lys Pro
2650 2655 2660
tca gag ccc agt gac act ctg ttc tgc gac ata ggg gaa tcc tcc 8129
Ser Glu Pro Ser Asp Thr Leu Phe Cys Asp Ile Gly Glu Ser Ser
2665 2670 2675
cca agt cca gaa gta gaa gaa caa cgc aca cta cgc gtc cta gag 8174
Pro Ser Pro Glu Val Glu Glu Gln Arg Thr Leu Arg Val Leu Glu
2680 2685 2690
atg aca tct gac tgg ttg cac cga gga cct aga gag ttc tgt ata 8219
Met Thr Ser Asp Trp Leu His Arg Gly Pro Arg Glu Phe Cys Ile
2695 2700 2705
aaa gtt ctt tgc ccc tac atg ccc aag gtt ata gaa aaa atg gaa 8264
Lys Val Leu Cys Pro Tyr Met Pro Lys Val Ile Glu Lys Met Glu
2710 2715 2720
gtc ctg cag cgc cgc ttc gga ggt ggg cta gtg cgt ctt ccc ctg 8309
Val Leu Gln Arg Arg Phe Gly Gly Gly Leu Val Arg Leu Pro Leu
2725 2730 2735
tcc cgc aac tcc aat cac gag atg tac tgg gtt agt gga ccg gct 8354
Ser Arg Asn Ser Asn His Glu Met Tyr Trp Val Ser Gly Pro Ala
2740 2745 2750
ggc aat gtg gtg cac gct gtg aac atg acc agc cag gta cta ctg 8399
Gly Asn Val Val His Ala Val Asn Met Thr Ser Gln Val Leu Leu
2755 2760 2765
ggg cga atg gat cgc aca gtg tgg aga ggg cca aag tat gag gaa 8444
Gly Arg Met Asp Arg Thr Val Trp Arg Gly Pro Lys Tyr Glu Glu
2770 2775 2780
gat gtc aac tta ggg agc gga aca aga gcc gtg gga aag gga gaa 8489
Asp Val Asn Leu Gly Ser Gly Thr Arg Ala Val Gly Lys Gly Glu
2785 2790 2795
gtc cat agc aat cag gag aaa atc aag aag aga atc cag aag ctt 8534
Val His Ser Asn Gln Glu Lys Ile Lys Lys Arg Ile Gln Lys Leu
2800 2805 2810
aaa gaa gaa ttc gcc aca acg tgg cac aaa gac cct gag cat cca 8579
Lys Glu Glu Phe Ala Thr Thr Trp His Lys Asp Pro Glu His Pro
2815 2820 2825
tac cgc act tgg aca tac cac gga agc tat gaa gtg aag gct act 8624
Tyr Arg Thr Trp Thr Tyr His Gly Ser Tyr Glu Val Lys Ala Thr
2830 2835 2840
ggc tca gct agt tct ctc gtc aac gga gtg gtg aag ctc atg agc 8669
Gly Ser Ala Ser Ser Leu Val Asn Gly Val Val Lys Leu Met Ser
2845 2850 2855
aaa cct tgg gac gcc att gcc aac gtc acc acc atg gcc atg act 8714
Lys Pro Trp Asp Ala Ile Ala Asn Val Thr Thr Met Ala Met Thr
2860 2865 2870
gac acc acc cct ttt gga cag caa aga gtt ttc aag gag aaa gtt 8759
Asp Thr Thr Pro Phe Gly Gln Gln Arg Val Phe Lys Glu Lys Val
2875 2880 2885
gac acg aag gct cct gag cca cca act gga gct aag gaa gtg ctc 8804
Asp Thr Lys Ala Pro Glu Pro Pro Thr Gly Ala Lys Glu Val Leu
2890 2895 2900
aac gag acc acc aac tgg ctg tgg gcc tac ttg tca cgg gaa aaa 8849
Asn Glu Thr Thr Asn Trp Leu Trp Ala Tyr Leu Ser Arg Glu Lys
2905 2910 2915
aga ccc cgc ttg tgc acc aag gaa gaa ttc ata aag aaa gtc aat 8894
Arg Pro Arg Leu Cys Thr Lys Glu Glu Phe Ile Lys Lys Val Asn
2920 2925 2930
agc aac gcg gct ctt gga gca gtg ttc gct gaa cag aat caa tgg 8939
Ser Asn Ala Ala Leu Gly Ala Val Phe Ala Glu Gln Asn Gln Trp
2935 2940 2945
agc acg gcg cgt gag gct gtg gat gac ccg cgg ttt tgg gag atg 8984
Ser Thr Ala Arg Glu Ala Val Asp Asp Pro Arg Phe Trp Glu Met
2950 2955 2960
gtt gat gaa gag agg gaa aac cat ctg cga gga gag tgt cac aca 9029
Val Asp Glu Glu Arg Glu Asn His Leu Arg Gly Glu Cys His Thr
2965 2970 2975
tgt atc tat aac atg atg gga aaa aga gag aag aag cct gga gag 9074
Cys Ile Tyr Asn Met Met Gly Lys Arg Glu Lys Lys Pro Gly Glu
2980 2985 2990
ttt gga aaa gct aaa gga agc agg gcc att tgg ttc atg tgg ctt 9119
Phe Gly Lys Ala Lys Gly Ser Arg Ala Ile Trp Phe Met Trp Leu
2995 3000 3005
gga gca cgg tat cta gag ttt gaa gct ttg ggg ttc ctg aat gaa 9164
Gly Ala Arg Tyr Leu Glu Phe Glu Ala Leu Gly Phe Leu Asn Glu
3010 3015 3020
gat cat tgg ctg agc cga gag aat tca gga ggt gga gtg gaa ggc 9209
Asp His Trp Leu Ser Arg Glu Asn Ser Gly Gly Gly Val Glu Gly
3025 3030 3035
tca ggc gtc caa aag ctg gga tac atc ctc cgt gat ata gca gga 9254
Ser Gly Val Gln Lys Leu Gly Tyr Ile Leu Arg Asp Ile Ala Gly
3040 3045 3050
aag caa gga gga aaa atg tac gct gat gat acc gcc ggg tgg gac 9299
Lys Gln Gly Gly Lys Met Tyr Ala Asp Asp Thr Ala Gly Trp Asp
3055 3060 3065
act aga att acc aga act gat tta gaa aat gaa gcc aag gtg ctg 9344
Thr Arg Ile Thr Arg Thr Asp Leu Glu Asn Glu Ala Lys Val Leu
3070 3075 3080
gag ctt cta gac ggt gaa cac cgc atg ctc gcc cga gcc ata att 9389
Glu Leu Leu Asp Gly Glu His Arg Met Leu Ala Arg Ala Ile Ile
3085 3090 3095
gaa ttg act tac agg cac aaa gtg gtc aag gtc atg aga cct gca 9434
Glu Leu Thr Tyr Arg His Lys Val Val Lys Val Met Arg Pro Ala
3100 3105 3110
gca gaa gga aag acc gtg atg gac gtg ata tca agg gag gat caa 9479
Ala Glu Gly Lys Thr Val Met Asp Val Ile Ser Arg Glu Asp Gln
3115 3120 3125
agg ggg agt gga cag gtg gtc act tat gct ctt aac act ttc acg 9524
Arg Gly Ser Gly Gln Val Val Thr Tyr Ala Leu Asn Thr Phe Thr
3130 3135 3140
aac atc gct gtc cag ctc gtc agg ctg atg gag gct gag ggg gtc 9569
Asn Ile Ala Val Gln Leu Val Arg Leu Met Glu Ala Glu Gly Val
3145 3150 3155
att gga cca caa cac ttg gaa cag cta cct aga aaa aac aag ata 9614
Ile Gly Pro Gln His Leu Glu Gln Leu Pro Arg Lys Asn Lys Ile
3160 3165 3170
gct gtc agg acc tgg ctc ttt gag aat gga gag gag aga gtg tcc 9659
Ala Val Arg Thr Trp Leu Phe Glu Asn Gly Glu Glu Arg Val Ser
3175 3180 3185
agg atg gct atc agc gga gac gac tgt gtc gtc aag ccg ctg gac 9704
Arg Met Ala Ile Ser Gly Asp Asp Cys Val Val Lys Pro Leu Asp
3190 3195 3200
gac aga ttc gcc acg gcc ctc cac ttc ctc aac gca atg tca aag 9749
Asp Arg Phe Ala Thr Ala Leu His Phe Leu Asn Ala Met Ser Lys
3205 3210 3215
gtc aga aaa gac atc cag gaa tgg aag cct tca cat ggc tgg cac 9794
Val Arg Lys Asp Ile Gln Glu Trp Lys Pro Ser His Gly Trp His
3220 3225 3230
gat tgg cag caa gtt ccc ttc tgc tct aac cat ttt cag gag att 9839
Asp Trp Gln Gln Val Pro Phe Cys Ser Asn His Phe Gln Glu Ile
3235 3240 3245
gtg atg aaa gat gga agg agt ata gtt gtc ccg tgc aga gga cag 9884
Val Met Lys Asp Gly Arg Ser Ile Val Val Pro Cys Arg Gly Gln
3250 3255 3260
gat gag ctg ata ggc agg gct cgc atc tcc cca gga gct gga tgg 9929
Asp Glu Leu Ile Gly Arg Ala Arg Ile Ser Pro Gly Ala Gly Trp
3265 3270 3275
aat gtg aag gac aca gct tgt ctg gcc aaa gca tat gca cag atg 9974
Asn Val Lys Asp Thr Ala Cys Leu Ala Lys Ala Tyr Ala Gln Met
3280 3285 3290
tgg cta ctc cta tac ttc cat cgt agg gac ttg cgt ctc atg gca 10019
Trp Leu Leu Leu Tyr Phe His Arg Arg Asp Leu Arg Leu Met Ala
3295 3300 3305
aat gcg att tgc tca gca gtg cca gtg gat tgg gtg ccc acg ggc 10064
Asn Ala Ile Cys Ser Ala Val Pro Val Asp Trp Val Pro Thr Gly
3310 3315 3320
agg aca tcc tgg tcg ata cac tcg aaa gga gag tgg atg acc aca 10109
Arg Thr Ser Trp Ser Ile His Ser Lys Gly Glu Trp Met Thr Thr
3325 3330 3335
gaa gac atg ctg cag gtc tgg aac aga gtc tgg att gaa gaa aat 10154
Glu Asp Met Leu Gln Val Trp Asn Arg Val Trp Ile Glu Glu Asn
3340 3345 3350
gaa tgg atg gtg gac aag act cca ata aca agc tgg aca gac gtt 10199
Glu Trp Met Val Asp Lys Thr Pro Ile Thr Ser Trp Thr Asp Val
3355 3360 3365
ccg tat gtg gga aag cgg gag gac atc tgg tgt ggc agc ctc atc 10244
Pro Tyr Val Gly Lys Arg Glu Asp Ile Trp Cys Gly Ser Leu Ile
3370 3375 3380
gga acg cga tcc aga gca acc tgg gct gag aac atc tac gcg gcg 10289
Gly Thr Arg Ser Arg Ala Thr Trp Ala Glu Asn Ile Tyr Ala Ala
3385 3390 3395
ata aac cag gtt aga gct gtc att ggg aaa gaa aat tat gtt gac 10334
Ile Asn Gln Val Arg Ala Val Ile Gly Lys Glu Asn Tyr Val Asp
3400 3405 3410
tac atg acc tca ctc agg aga tac gaa gat gtc ttg atc cag gaa 10379
Tyr Met Thr Ser Leu Arg Arg Tyr Glu Asp Val Leu Ile Gln Glu
3415 3420 3425
gac agg gtc atc tagtgtgatt taaggtggaa aagcagatta tgtaaataat 10431
Asp Arg Val Ile
3430
gtaaatgaga aaatgcatgc atatggagtc aggccagcaa aagctgccac cggatactgg 10491
gtagacggtg ctgtctgcgt cccagtccca ggaggactgg gttaacaaat ctgacaacag 10551
aaagtgagaa agccctcaga accgtctcgg aagcaggtcc ctgctcactg gaagttgaag 10611
gaccaacgtc aggccacaaa tttgtgccac tccgctgagg agtgcggcct gcgcagcccc 10671
aggaggactg ggttaccaaa gccgttgagc ccccacggcc caagcctcgt ctaggatgca 10731
atagacgagg tgtaaggact agaggttaga ggagaccccg tggaaacaac aacatgcggc 1079l
ccaagccccc tcgaagctgt agaggaggtg gaaggactag aggttagagg agaccccgca 10851
tttgcatcaa acagcatatt gacacctggg aatagactgg gagatcttct gctctatctc 10911
aacatcagct actaggcaca gagcgccgaa gtatgtagct ggtggtgagg aagaacacag 10971
gatct 10976
<210>4
<211>3432
<212>PRT
<213>日本脑炎病毒
<400>4
Met Thr Lys Lys Pro Gly Gly Pro Gly Lys Asn Arg Ala Ile Asn Met
1 5 10 15
Leu Lys Arg Gly Leu Pro Arg Val Phe Pro Leu Val Gly Val Lys Arg
20 25 30
Val Val Met Ser Leu Leu Asp Gly Arg Gly Pro Val Arg Phe Val Leu
35 40 45
Ala Leu Ile Thr Phe Phe Lys Phe Thr Ala Leu Ala Pro Thr Lys Ala
50 55 60
Leu Leu Gly Arg Trp Lys Ala Val Glu Lys Ser Val Ala Met Lys His
65 70 75 80
Leu Thr Ser Phe Lys Arg Glu Leu Gly Thr Leu Ile Asp Ala Val Asn
85 90 95
Lys Arg Gly Arg Lys Gln Asn Lys Arg Gly Gly Asn Glu Gly SerIle
100 105 110
Met Trp Leu Ala Ser Leu Ala Val Val Ile Ala Cys Ala Gly Ala Met
115 120 125
Lys Leu Ser Asn Phe Gln Gly Lys Leu Leu Met Thr Ile Asn Asn Thr
130 135 140
Asp Ile Ala Asp Val Ile Val Ile Pro Thr Ser Lys Gly Glu Asn Arg
145 150 155 160
Cys Trp Val Arg Ala Ile Asp Val Gly Tyr Met Cys Glu Asp Thr Ile
165 170 175
Thr Tyr Glu Cys Pro Lys Leu Ala Met Gly Asn Asp Pro Glu Asp Val
180 185 190
Asp Cys Trp Cys Asp Asn Gln Glu Val Tyr Val Gln Tyr Gly Arg Cys
195 200 205
Thr Arg Thr Arg His Ser Lys Arg Ser Arg Arg Ser Val Ser Val Gln
210 215 220
Thr His Gly Glu Ser Ser Leu Val Asn Lys Lys Glu Ala Trp Leu Asp
225 230 235 240
Ser Thr Lys Ala Thr Arg Tyr Leu Met Lys Thr Glu Asn Trp Ile Ile
245 250 255
Arg Asn Pro Gly Tyr Ala Phe Leu Ala Ala Val Leu Gly Trp Met Leu
260 265 270
Gly Ser Asn Asn Gly Gln Arg Val Val Phe Thr Ile Leu Leu Leu Leu
275 280 285
Val Ala Pro Ala Tyr Ser Phe Asn Cys Leu Gly Met Gly Asn Arg Asp
290 295 300
Phe Ile Glu Gly Ala Ser Gly Ala Thr Trp Val Asp Leu Val Leu Glu
305 310 315 320
Gly Asp Ser Cys Leu Thr Ile Met Ala Asn Asp Lys Pro Thr Leu Asp
325 330 335
Val Arg Met Ile Asn Ile Glu Ala Ser Gln Leu Ala Glu Val Arg Ser
340 345 350
Tyr Cys Tyr His Ala Ser Val Thr Asp Ile Ser Thr Val Ala Arg Cys
355 360 365
Pro Thr Thr Gly Glu Ala His Asn Glu Lys Arg Ala Asp Ser Ser Tyr
370 375 380
Val Cys Lys Gln Gly Phe Thr Asp Arg Gly Trp Gly Asn Gly Cys Gly
385 390 395 400
Leu Phe Gly Lys Gly Ser Ile Asp Thr Cys Ala Lys Phe Ser Cys Thr
405 410 415
Ser Lys Ala Ile Gly Arg Thr Ile Gln Pro Glu Asn Ile Lys Tyr Lys
420 425 430
Val Gly Ile Phe Val His Gly Ala Thr Thr Ser Glu Asn His Gly Asn
435 440 445
Tyr Ser Ala Gln Val Gly Ala Ser Gln Ala Ala Lys Phe Thr Val Thr
450 455 460
Pro Asn Ala Pro Ser Ile Thr Leu Lys Leu Gly Asp Tyr Gly Glu Val
465 470 475 480
Thr Leu Asp Cys Glu Pro Arg Ser Gly Leu Asn Thr Glu Ala Phe Tyr
485 490 495
Val Met Thr Val Gly Ser Lys Ser Phe Leu Val His Arg Glu Trp Phe
500 505 510
His Asp Leu Ala Leu Pro Trp Thr Ser Pro Ser Ser Thr Ala Trp Arg
515 520 525
Asn Arg Glu Leu Leu Met Glu Phe Glu Glu Ala His Ala Thr Lys Gln
530 535 540
Ser Val Val Ala Leu Gly Ser Gln Glu Gly Gly Leu His Gln Ala Leu
545 550 555 560
Ala Gly Ala Ile Val Val Glu Tyr Ser Ser Ser Val Lys Leu Thr Ser
565 570 575
Gly His Leu Lys Cys Arg Leu Lys Met Asp Lys Leu Ala Leu Lys Gly
580 585 590
Thr Thr Tyr Gly Met Cys Thr Glu Lys Phe Ser Phe Ala Lys Asn Pro
595 600 605
Ala Asp Thr Gly His Gly Thr Val Val Ile Glu Leu Ser Tyr Ser Gly
610 615 620
Ser Asp Gly Pro Cys Lys Ile Pro Ile Val Ser Val Ala Ser Leu Asn
625 630 635 640
Asp Met Thr Pro Val Gly Arg Leu Val Thr Val Asn Pro Phe Val Ala
645 650 655
Thr Ser Ser Ala Asn Ser Lys Val Leu Val Glu Met Glu Pro Pro Phe
660 665 670
Gly Asp Ser Tyr Ile Val Val Gly Arg Gly Asp Lys Gln Ile Asn His
675 680 685
His Trp His Lys Ala Gly Ser Thr Leu Gly Lys Ala Phe Ser Thr Thr
690 695 700
Leu Lys Gly Ala Gln Arg Leu Ala Ala Leu Gly Asp Thr Ala Trp Asp
705 710 715 720
Phe Gly Ser Ile Gly Gly Val Phe Asn Ser Ile Gly Lys Ala Val His
725 730 735
Gln Val Phe Gly Gly Ala Phe Arg Thr Leu Phe Gly Gly Met Ser Trp
740 745 750
Ile Thr Gln Gly Leu Met Gly Ala Leu Leu Leu Trp Met Gly Val Asn
755 760 765
Ala Arg Asp Arg Ser Ile Ala Leu Ala Phe Leu Ala Thr Gly Gly Val
770 775 780
Leu Val Phe Leu Ala Thr Asn Val His Ala Asp Thr Gly Cys Ala Ile
785 790 795 800
Asp Ile Thr Arg Lys Glu Met Arg Cys Gly Ser Gly Ile Phe Val His
805 810 815
Asn Asp Val Glu Ala Trp Val Asp Arg Tyr Lys Tyr Leu Pro Glu Thr
820 825 830
Pro Arg Ser Leu Ala Lys Ile Val His Lys Ala His Lys Glu Gly Val
835 840 845
Cys Gly Val Arg Ser Val Thr Arg Leu Glu His Gln Met Trp Glu Ala
850 855 860
Val Arg Asp Glu Leu Asn Val Leu Leu Lys Glu Asn Ala Val Asp Leu
865 870 875 880
Ser Val Val Val Asn Lys Pro Val Gly Arg Tyr Arg Ser Ala Pro Lys
885 890 895
Arg Leu Ser Met Thr Gln Glu Lys Phe Glu Met Gly Trp Lys Ala Trp
900 905 910
Gly Lys Ser Ile Leu Phe Ala Pro Glu Leu Ala Asn Ser Thr Phe Val
915 920 925
Val Asp Gly Pro Glu Thr Lys Glu Cys Pro Asp Glu His Arg Ala Trp
930 935 940
Asn Ser Met Gln Ile Glu Asp Phe Gly Phe Gly Ile Thr Ser Thr Arg
945 950 955 960
Val Trp Leu Lys Ile Arg Glu Glu Ser Thr Asp Glu Cys Asp Gly Ala
965 970 975
Ile Ile Gly Thr Ala Val Lys Gly His Val Ala Val His Ser Asp Leu
980 985 990
Ser Tyr Trp Ile Glu Ser Arg Tyr Asn Asp Thr Trp Lys Leu Glu Arg
995 1000 1005
Ala Val Phe Gly Glu Val Lys Ser Cys Thr Trp Pro Glu Thr His
1010 1015 1020
Thr Leu Trp Gly Asp Gly Val Glu Glu Ser Glu Leu Ile Ile Pro
1025 1030 1035
His Thr Ile Ala Gly Pro Lys Ser Lys His Asn Arg Arg Glu Gly
1040 1045 1050
Tyr Lys Thr Gln Asn Gln Gly Pro Trp Asp Glu Asn Gly Ile Val
1055 1060 1065
Leu Asp Phe Asp Tyr Cys Pro Gly Thr Lys Val Thr Ile Thr Glu
1070 1075 1080
Asp Cys Gly Lys Arg Gly Pro Ser Val Arg Thr Thr Thr Asp Ser
1085 1090 1095
Gly Lys Leu Ile Thr Asp Trp Cys Cys Arg Ser Cys Ser Leu Pro
1100 1105 1110
Pro Leu Arg Phe Arg Thr Glu Asn Gly Cys Trp Tyr Gly Met Glu
1115 1120 1125
Ile Arg Pro Val Arg His Asp Glu Thr Thr Leu Val Arg Ser Gln
1130 1135 1140
Val Asp Ala Phe Asn Gly Glu Met Val Asp Pro Phe Gln Leu Gly
1145 1150 1155
Leu Leu Val Met Phe Leu Ala Thr Gln Glu Val Leu Arg Lys Arg
1160 1165 1170
Trp Thr Ala Arg Leu Thr Ile Pro Ala Val Leu Gly Ala Leu Leu
1175 1180 1185
Val Leu Met Leu Gly Gly Ile Thr Tyr Thr Asp Leu Ala Arg Tyr
1190 1195 1200
Val Val Leu Val Ala Ala Ala Phe Ala Glu Ala Asn Ser Gly Gly
1205 1210 1215
Asp Val Leu His Leu Ala Leu Ile Ala Val Phe Lys Ile Gln Pro
1220 1225 1230
Ala Phe Leu Val Met Asn Met Leu Ser Thr Arg Trp Thr Asn Gln
1235 1240 1245
Glu Asn Val Val Leu Val Leu Gly Ala Ala Phe Phe Gln Leu Ala
1250 1255 1260
Ser Val Asp Leu Gln Ile Gly Val His Gly Ile Leu Asn Ala Ala
1265 1270 1275
Ala Ile Ala Trp Met Ile Val Arg Ala Ile Thr Phe Pro Thr Thr
1280 1285 1290
Ser Ser Val Thr Met Pro Val Leu Ala Leu Leu Thr Pro Gly Met
1295 1300 1305
Arg Ala Leu Tyr Leu Asp Thr Tyr Arg Ile Ile Leu Leu Val Ile
1310 1315 1320
Gly Ile Cys Ser Leu Leu Gln Glu Arg Lys Lys Thr Met Ala Lys
1325 1330 1335
Lys Lys Gly Ala Val Leu Leu Gly Leu Ala Leu Thr Ser Thr Gly
1340 1345 1350
Trp Phe Ser Pro Thr Thr Ile Ala Ala Gly Leu Met Val Cys Asn
1355 1360 1365
Pro Asn Lys Lys Arg Gly Trp Pro Ala Thr Glu Phe Leu Ser Ala
1370 1375 1380
Val Gly Leu Met Phe Ala Ile Val Gly Gly Leu Ala Glu Leu Asp
1385 1390 1395
Ile Glu Ser Met Ser Ile Pro Phe Met Leu Ala Gly Leu Met Ala
1400 1405 1410
Val Ser Tyr Val Val Ser Gly Lys Ala Thr Asp Met Trp Leu Glu
1415 1420 1425
Arg Ala Ala Asp Ile Ser Trp Glu Met Asp Ala Ala Ile Thr Gly
1430 1435 1440
Ser Ser Arg Arg Leu Asp Val Lys Leu Asp Asp Asp Gly Asp Phe
1445 1450 1455
His Leu Ile Asp Asp Pro Gly Val Pro Trp Lys Val Trp Val Leu
1460 1465 1470
Arg Met Ser Cys Ile Gly Leu Ala Ala Leu Thr Pro Trp Ala Ile
1475 1480 1485
Val Pro Ala Ala Phe Gly Tyr Trp Leu Thr Leu Lys Thr Thr Lys
1490 1495 1500
Arg Gly Gly Val Phe Trp Asp Thr Pro Ser Pro Lys Pro Cys Ser
1505 1510 1515
Lys Gly Asp Thr Thr Thr Gly Val Tyr Arg Ile Met Ala Arg Gly
1520 1525 1530
Ile Leu Gly Thr Tyr Gln Ala Gly Val GlyVal Met Tyr Glu Asn
1535 1540 1545
Val Phe His Thr Leu Trp His Thr Thr Arg Gly Ala Ala Ile Met
1550 1555 1560
Ser Gly Glu Gly Lys Leu Thr Pro Tyr Trp Gly Ser Val Lys Glu
1565 1570 1575
Asp Arg Ile Ala Tyr Gly Gly Pro Trp Arg Phe Asp Arg Lys Trp
1580 1585 1590
Asn Gly Thr Asp Asp Val Gln Val Ile Val Val Glu Pro Gly Lys
1595 1600 1605
Ala Ala Val Asn Ile Gln Thr Lys Pro Gly Val Phe Arg Thr Pro
1610 1615 1620
Phe Gly Glu Val Gly Ala Val Ser Leu Asp Tyr Pro Arg Gly Thr
1625 1630 1635
Ser Gly Ser Pro Ile Leu Asp Ser Asn Gly Asp Ile Ile Gly Leu
1640 1645 1650
Tyr Gly Asn Gly Val Glu Leu Gly Asp Gly Ser Tyr Val Ser Ala
1655 1660 1665
Ile Val Gln Gly Asp Arg Gln Glu Glu Pro Val Pro Glu Ala Tyr
1670 1675 1680
Thr Pro Asn Met Leu Arg Lys Arg Gln Met Thr Val Leu Asp Leu
1685 1690 1695
His Pro Gly Ser Gly Lys Thr Lys Lys Ile Leu Pro Gln Ile Ile
1700 1705 1710
Lys Asp Ala Ile Gln Gln Arg Leu Arg Thr Ala Val Leu Ala Pro
1715 1720 1725
Thr Arg Val Val Ala Ala Glu Met Ala Glu Ala Leu Arg Gly Leu
1730 1735 1740
Pro Val Arg Tyr Gln Thr Ser Ala Val Gln Arg Glu His Gln Gly
1745 1750 1755
Asn Glu Ile Val Asp Val Met Cys His Ala Thr Leu Thr His Arg
1760 1765 1770
Leu Met Ser Pro Asn Arg Val Pro Asn Tyr Asn Leu Phe Val Met
1775 1780 1785
Asp Glu Ala His Phe Thr Asp Pro Ala Ser Ile Ala Ala Arg Gly
1790 1795 1800
Tyr Ile Ala Thr Lys Val Glu Leu Gly Glu Ala Ala Ala Ile Phe
1805 1810 1815
Met Thr Ala Thr Pro Pro Gly Thr Thr Asp Pro Phe Pro Asp Ser
1820 1825 1830
Asn Ala Pro Ile His Asp Leu Gln Asp Glu Ile Pro Asp Arg Ala
1835 1840 1845
Trp Ser Ser Gly Tyr Glu Trp Ile Thr Glu Tyr Ala Gly Lys Thr
1850 1855 1860
Val Trp Phe Val Ala Ser Val Lys Met Gly Asn Glu Ile Ala Met
1865 1870 1875
Cys Leu Gln Arg Ala Gly Lys Lys Val Ile Gln Leu Asn Arg Lys
1880 1885 1890
Ser Tyr Asp Thr Glu Tyr Pro Lys Cys Lys Asn Gly Asp Trp Asp
1895 1900 1905
Phe Val Ile Thr Thr Asp Ile Ser Glu Met Gly Ala Asn Phe Gly
1910 1915 1920
Ala Ser Arg Val Ile Asp Cys Arg Lys Ser Val Lys Pro Thr Ile
1925 1930 1935
Leu Glu Glu Gly Glu Gly Arg Val Ile Leu Gly Asn Pro Ser Pro
1940 1945 1950
Ile Thr Ser Ala Ser Ala Ala Gln Arg Arg Gly Arg Val Gly Arg
1955 1960 1965
Asn Pro Asn Gln Val Gly Asp Glu Tyr His Tyr Gly Gly Ala Thr
1970 1975 1980
Ser Glu Asp Asp Ser Asn Leu Ala His Trp Thr Glu Ala Lys Ile
1985 1990 1995
Met Leu Asp Asn Ile His Met Pro Asn Gly Leu Val Ala Gln Leu
2000 2005 2010
Tyr Gly Pro Glu Arg Glu Lys Ala Phe Thr Met Asp Gly Glu Tyr
2015 2020 2025
Arg Leu Arg Gly Glu Glu Lys Lys Asn Phe Leu Glu Leu Leu Arg
2030 2035 2040
Thr Ala Asp Leu Pro Val Trp Leu Ala Tyr Lys Val Ala Ser Asn
2045 2050 2055
Gly Ile Gln Tyr Thr Asp Arg Lys Trp Cys Phe Asp Gly Pro Arg
2060 2065 2070
Thr Asn Ala Ile Leu Glu Asp Asn Thr Glu Val Glu Ile Val Thr
2075 2080 2085
Arg Met Gly Glu Arg Lys Ile Leu Lys Pro Arg Trp Leu Asp Ala
2090 2095 2100
Arg Val Tyr Ala Asp His Gln Ala Leu Lys Trp Phe Lys Asp Phe
2105 2110 2115
Ala Ala Gly Lys Arg Ser Ala Val Ser Phe Ile Glu Val Leu Gly
2120 2125 2130
Arg Met Pro Glu His Phe Met Gly Lys Thr Arg Glu Ala Leu Asp
2135 2140 2145
Thr Met Tyr Leu Val Ala Thr Ala Glu Lys Gly Gly Lys Ala His
2150 2155 2160
Arg Met Ala Leu Glu Glu Leu Pro Asp Ala Leu Glu Thr Ile Thr
2165 2170 2175
Leu Ile Val Ala Ile Thr Val Met Thr Gly Gly Phe Phe Leu Leu
2180 2185 2190
Met Met Gln Arg Lys Gly Ile Gly Lys Met Gly Leu Gly Ala Leu
2195 2200 2205
Val Leu Thr Leu Ala Thr Phe Phe Leu Trp Ala Ala Glu Val Pro
2210 2215 2220
Gly Thr Lys Ile Ala Gly Thr Leu Leu Ile Ala Leu Leu Leu Met
2225 2230 2235
Val Val Leu Ile Pro Glu Pro Glu Lys Gln Arg Ser Gln Thr Asp
2240 2245 2250
Asn Gln Leu Ala Val Phe Leu Ile Cys Val Leu Thr Val Val Gly
2255 2260 2265
Val Val Ala Ala Asn Glu Tyr Gly Met Leu Glu Lys Thr Lys Ala
2270 2275 2280
Asp Leu Lys Ser Met Phe Gly Gly Lys Thr Gln Ala Ser Gly Leu
2285 2290 2295
Thr Gly Leu Pro Ser Met Ala Leu Asp Leu Arg Pro Ala Thr Ala
2300 2305 2310
Trp Ala Leu Tyr Gly Gly Ser Thr Val Val Leu Thr Pro Leu Leu
2315 2320 2325
Lys His Leu Ile Thr Ser Glu Tyr Val Thr Thr Ser Leu Ala Ser
2330 2335 2340
Ile Asn Ser Gln Ala Gly Ser Leu Phe Val Leu Pro Arg Gly Val
2345 2350 2355
Pro Phe Thr Asp Leu Asp Leu Thr Val Gly Leu Val Phe Leu Gly
2360 2365 2370
Cys Trp Gly Gln Ile Thr Leu Thr Thr Phe Leu Thr Ala Met Val
2375 2380 2385
Leu Ala Thr Leu His Tyr Gly Tyr Met Leu Pro Gly Trp Gln Ala
2390 2395 2400
Glu Ala Leu Arg Ala Ala Gln Arg Arg Thr Ala Ala Gly Ile Met
2405 2410 2415
Lys Asn Ala Val Val Asp Gly Met Val Ala Thr Asp Val Pro Glu
2420 2425 2430
Leu Glu Arg Thr Thr Pro Leu Met Gln Lys Lys Val Gly Gln Val
2435 2440 2445
Leu Leu Ile Gly Val Ser Val Ala Ala Phe Leu Val Asn Pro Asn
2450 2455 2460
Val Thr Thr Val Arg Glu Ala Gly Val Leu Val Thr Ala Ala Thr
2465 2470 2475
Leu Thr Leu Trp Asp Asn Gly Ala Ser Ala Val Trp Asn Ser Thr
2480 2485 2490
Thr Ala Thr Gly Leu Cys His Val Met Arg Gly Ser Tyr Leu Ala
2495 2500 2505
Gly Gly Ser Ile Ala Trp Thr LeuIle Lys Asn Ala Asp Lys Pro
2510 2515 2520
Ser Leu Lys Arg Gly Arg Pro Gly Gly Arg Thr Leu Gly Glu Gln
2525 2530 2535
Trp Lys Glu Lys Leu Asn Ala Met Ser Arg Glu Glu Phe Phe Lys
2540 2545 2550
Tyr Arg Arg Glu Ala Ile Ile Glu Val Asp Arg Thr Glu Ala Arg
2555 2560 2565
Arg Ala Arg Arg Glu Asn Asn Ile Val Gly Gly His Pro Val Ser
2570 2575 2580
Arg Gly Ser Ala Lys Leu Arg Trp Leu Val Glu Lys Gly Phe Val
2585 2590 2595
Ser Pro Ile Gly Lys Val Ile Asp Leu Gly Cys Gly Arg Gly Gly
2600 2605 2610
Trp Ser Tyr Tyr Ala Ala Thr Leu Lys Lys Val Gln Glu Val Arg
2615 2620 2625
Gly Tyr Thr Lys Gly Gly Ala Gly His Glu Glu Pro Met Leu Met
2630 2635 2640
Gln Ser Tyr Gly Trp Asn Leu Val Ser Leu Lys Ser Gly Val Asp
2645 2650 2655
Val Phe Tyr Lys Pro Ser Glu Pro Ser Asp Thr Leu Phe Cys Asp
2660 2665 2670
Ile Gly Glu Ser Ser Pro Ser Pro Glu Val Glu Glu Gln Arg Thr
2675 2680 2685
Leu Arg Val Leu Glu Met Thr Ser Asp Trp Leu His Arg Gly Pro
2690 2695 2700
Arg Glu Phe Cys Ile Lys Val Leu Cys Pro Tyr Met Pro Lys Val
2705 2710 2715
Ile Glu Lys Met Glu Val Leu Gln Arg Arg Phe Gly Gly Gly Leu
2720 2725 2730
Val Arg Leu Pro Leu Ser Arg Asn Ser Asn His Glu Met Tyr Trp
2735 2740 2745
Val Ser Gly Pro Ala Gly Asn Val Val His Ala Val Asn Met Thr
2750 2755 2760
Ser Gln Val Leu Leu Gly Arg Met Asp Arg Thr Val Trp Arg Gly
2765 2770 2775
Pro Lys Tyr Glu Glu Asp Val Asn Leu Gly Ser Gly Thr Arg Ala
2780 2785 2790
Val Gly Lys Gly Glu Val His Ser Asn Gln Glu Lys Ile Lys Lys
2795 2800 2805
Arg Ile Gln Lys Leu Lys Glu Glu Phe Ala Thr Thr Trp His Lys
2810 2815 2820
Asp Pro Glu His Pro Tyr Arg Thr Trp Thr Tyr His Gly Ser Tyr
2825 2830 2835
Glu Val Lys Ala Thr Gly Ser Ala Ser Ser Leu Val Asn Gly Val
2840 2845 2850
Val Lys Leu Met Ser Lys Pro Trp Asp Ala Ile Ala Asn Val Thr
2855 2860 2865
Thr Met Ala Met Thr Asp Thr Thr Pro Phe Gly Gln Gln Arg Val
2870 2875 2880
Phe Lys Glu Lys Val Asp Thr Lys Ala Pro Glu Pro Pro Thr Gly
2885 2890 2895
Ala Lys Glu Val Leu Asn Glu Thr Thr Asn Trp Leu Trp Ala Tyr
2900 2905 2910
Leu Ser Arg Glu Lys Arg Pro Arg Leu Cys Thr Lys Glu Glu Phe
2915 2920 2925
Ile Lys Lys Val Asn Ser Asn Ala Ala Leu Gly Ala Val Phe Ala
2930 2935 2940
Glu Gln Asn Gln Trp Ser Thr Ala Arg Glu Ala Val Asp Asp Pro
2945 2950 2955
Arg Phe Trp Glu Met Val Asp Glu Glu Arg Glu Asn His Leu Arg
2960 2965 2970
Gly Glu Cys His Thr Cys Ile Tyr Asn Met Met Gly Lys Arg Glu
2975 2980 2985
Lys Lys Pro Gly Glu Phe Gly Lys Ala Lys Gly Ser Arg Ala Ile
2990 2995 3000
Trp Phe Met Trp Leu Gly Ala Arg Tyr Leu Glu Phe Glu Ala Leu
3005 3010 3015
Gly Phe Leu Asn Glu Asp His Trp Leu Ser Arg Glu Asn Ser Gly
3020 3025 3030
Gly Gly Val Glu Gly Ser Gly Val Gln Lys Leu Gly Tyr Ile Leu
3035 3040 3045
Arg Asp Ile Ala Gly Lys Gln Gly Gly Lys Met Tyr Ala Asp Asp
3050 3055 3060
Thr Ala Gly Trp Asp Thr Arg Ile Thr Arg Thr Asp Leu Glu Asn
3065 3070 3075
Glu Ala Lys Val Leu Glu Leu Leu Asp Gly Glu His Arg Met Leu
3080 3085 3090
Ala Arg Ala IleIle Glu Leu Thr Tyr Arg His Lys Val Val Lys
3095 3100 3105
Val Met Arg Pro Ala Ala Glu Gly Lys Thr Val Met Asp Val Ile
3110 3115 3120
Ser Arg Glu Asp Gln Arg Gly Ser Gly Gln Val Val Thr Tyr Ala
3125 3130 3135
Leu Asn Thr Phe Thr Asn Ile Ala Val Gln Leu Val Arg Leu Met
3140 3145 3150
Glu Ala Glu Gly Val Ile Gly Pro Gln His Leu Glu Gln Leu Pro
3155 3160 3165
Arg Lys Asn Lys Ile Ala Val Arg Thr Trp Leu Phe Glu Asn Gly
3170 3175 3180
Glu Glu Arg Val Ser Arg Met Ala Ile Ser Gly Asp Asp Cys Val
3185 3190 3195
Val Lys Pro Leu Asp Asp Arg Phe Ala Thr Ala Leu His Phe Leu
3200 3205 3210
Asn Ala Met Ser Lys Val Arg Lys Asp Ile Gln Glu Trp Lys Pro
3215 3220 3225
Ser His Gly Trp His Asp Trp Gln Gln Val Pro Phe Cys Ser Asn
3230 3235 3240
His Phe Gln Glu Ile Val Met Lys Asp Gly Arg Ser Ile Val Val
3245 3250 3255
Pro Cys Arg Gly Gln Asp Glu Leu Ile Gly Arg Ala Arg Ile Ser
3260 3265 3270
Pro Gly Ala Gly Trp Asn Val Lys Asp Thr Ala Cys Leu Ala Lys
3275 3280 3285
Ala Tyr Ala Gln Met Trp Leu Leu Leu Tyr Phe His Arg Arg Asp
3290 3295 3300
Leu Arg Leu Met Ala Asn Ala Ile Cys Ser Ala Val Pro Val Asp
3305 3310 3315
Trp Val Pro Thr Gly Arg Thr Ser Trp Ser Ile His Ser Lys Gly
3320 3325 3330
Glu Trp Met Thr Thr Glu Asp Met Leu Gln Val Trp Asn Arg Val
3335 3340 3345
Trp Ile Glu Glu Asn Glu Trp Met Val Asp Lys Thr Pro Ile Thr
3350 3355 3360
Ser Trp Thr Asp Val Pro Tyr Val Gly Lys Arg Glu Asp Ile Trp
3365 3370 3375
Cys Gly Ser Leu Ile Gly Thr Arg Ser Arg Ala Thr Trp Ala Glu
3380 3385 3390
Asn Ile Tyr Ala Ala Ile Asn Gln Val Arg Ala Val Ile Gly Lys
3395 3400 3405
Glu Asn Tyr Val Asp Tyr Met Thr Set Leu Arg Arg Tyr Glu Asp
3410 3415 3420
Val Leu Ile Gln Glu Asp Arg Val Ile
3425 3430
<210>5
<211>34
<212>DNA
<213>日本脑炎病毒
<400>5
agaagtttat ctgtgtgaac ttcttggctt agta 34
<210>6
<211>35
<212>DNA
<213>日本脑炎病毒
<400>6
gcaaagagaa tgctttttcc ccatgctttc cagcc 35
<210>7
<211>35
<212>DNA
<213>日本脑炎病毒
<400>7
tcctgctcaa agagaatgca gtggacctca gtgtg 35
<210>8
<211>35
<212>DNA
<213>日本脑炎病毒
<400>8
gtcatgaaga tggctgctgc ctcccctaat tccac 35
<210>9
<211>36
<212>DNA
<213>日本脑炎病毒
<400>9
actgatgtca ccgaacagag tgcccaacta caacct 36
<210>10
<211>35
<212>DNA
<213>日本脑炎病毒
<400>10
ttttctataa ccttgggcat gtaagggcag agaac 35
<210>11
<211>35
<212>DNA
<213>日本脑炎病毒
<400>11
gggaatcctc tccaagtcca gaagtagaag aacaa 35
<210>12
<211>34
<212>DNA
<213>日本脑炎病毒
<400>12
agatcctgtg ttcttcctca ccaccagcta cata 34
<210>13
<211>20
<212>DNA
<213>日本脑炎病毒
<400>13
agaagtttat ctgtgtgaac 20
<210>14
<211>20
<212>DNA
<213>日本脑炎病毒
<400>14
aaggctcaat catgtggctc 20
<210>15
<211>20
<212>DNA
<213>日本脑炎病毒
<400>15
gaaagccaca cggtatctca 20
<210>16
<211>20
<212>DNA
<213>日本脑炎病毒
<400>16
ctgacatctc gacggtggct 20
<210>17
<211>20
<212>DNA
<213>日本脑炎病毒
<400>17
tggactgaac actgaagcgt 20
<210>18
<211>20
<212>DNA
<213>日本脑炎病毒
<400>18
ttgtcattga actatcctac 20
<210>19
<211>20
<212>DNA
<213>日本脑炎病毒
<400>19
gaacactctt tgggggaatg 20
<210>20
<211>20
<212>DNA
<213>日本脑炎病毒
<400>20
tgtggaagtg gcatctttgt 20
<210>21
<211>20
<212>DNA
<213>日本脑炎病毒
<400>21
tgaacaagcc cgtgggaaga 20
<210>22
<211>20
<212>DNA
<213>日本脑炎病毒
<400>22
acttggccag agacacacac 20
<210>23
<211>20
<212>DNA
<213>日本脑炎病毒
<400>23
atggtgaaat ggttgaccct 20
<210>24
<211>20
<212>DNA
<213>日本脑炎病毒
<400>24
agcatggatg attgtccgag 20
<210>25
<211>20
<212>DNA
<213>日本脑炎病毒
<400>25
agaaaggagc tgtactcttg 20
<210>26
<211>20
<212>DNA
<213>日本脑炎病毒
<400>26
cctacgtggt gtcaggaaaa 20
<210>27
<211>20
<212>DNA
<213>日本脑炎病毒
<400>27
ttttccacac actatggcac 20
<210>28
<211>20
<212>DNA
<213>日本脑炎病毒
<400>28
gaccgtcagg aggaaccagt 20
<210>29
<211>19
<212>DNA
<213>日本脑炎病毒
<400>29
aaggtggaat taggggagg 19
<210>30
<211>19
<212>DNA
<213>日本脑炎病毒
<400>30
aagagggaga aggcagagt 19
<210>31
<211>20
<212>DNA
<213>日本脑炎病毒
<400>31
atgccatact ggaggacaac 20
<210>32
<211>20
<212>DNA
<213>日本脑炎病毒
<400>32
gaaagggtat agggaagatg 20
<210>33
<211>20
<212>DNA
<213>日本脑炎病毒
<400>33
caccacatcg ctagcctcaa 20
<210>34
<211>20
<212>DNA
<213>日本脑炎病毒
<400>34
ggtattggtg acggcggcta 20
<210>35
<211>20
<212>DNA
<213>日本脑炎病毒
<400>35
tgtgggcgtg gaggatggag 20
<210>36
<211>20
<212>DNA
<213>日本脑炎病毒
<400>36
ctagtgcgtc tccccctgtc 20
<210>37
<211>20
<212>DNA
<213>日本脑炎病毒
<400>37
ctgacaccac cccttttgga 20
<210>38
<211>19
<212>DNA
<213>日本脑炎病毒
<400>38
gctttggggt tcctgaatg 19
<210>39
<211>19
<212>DNA
<213>日本脑炎病毒
<400>39
agaagatcaa agggggagt 19
<210>40
<211>20
<212>DNA
<213>日本脑炎病毒
<400>40
ccgtgcagag gacaggatga 20
<210>41
<211>18
<212>DNA
<213>日本脑炎病毒
<400>41
tatgcggcga taaaccag 18
<210>42
<211>18
<212>DNA
<213>日本脑炎病毒
<400>42
gcgcagcccc aggaggac 18
<210>43
<211>20
<212>DNA
<213>日本脑炎病毒
<400>43
cgtcaatgag tgttccaagt 20
<210>44
<211>20
<212>DNA
<213>日本脑炎病毒
<400>44
caatccagta cgacaagtca 20
<210>45
<211>20
<212>DNA
<213>日本脑炎病毒
<400>45
tgaccttttt ccccgctctt 20
<210>46
<211>20
<212>DNA
<213>日本脑炎病毒
<400>46
ttcttgattt tctcctgatt 20
<210>47
<211>22
<212>DNA
<213>日本脑炎病毒
<400>47
agatcctgtg ttcttcctca cc 22
<210>48
<211>62
<212>DNA
<213>日本脑炎病毒
<400>48
aaatttaata cgactcacta taagaagttt atctgtgtga acttcttggc ttagtatcgt 60
tg 62
<210>49
<211>45
<212>DNA
<213>日本脑炎病毒
<400>49
cgtcatcaaa agcttcccct ggaaattcga caactttatt gctcc 45
<210>50
<211>46
<212>DNA
<213>日本脑炎病毒
<400>50
cgcaagcttg gcagttgtca tagcttacgc aggagcaata aagttg 46
<210>51
<211>45
<212>DNA
<213>日本脑炎病毒
<400>51
gctccaatct agtgacagat ctgactccgc acacgccttc cttgt 45
<210>52
<211>47
<212>DNA
<213>日本脑炎病毒
<400>52
cacaagaaaa gagatgagat gtggaagtgg catctttgtg cacaacg 47
<210>53
<211>40
<212>DNA
<213>日本脑炎病毒
<400>53
agatcctgtg ttcttcctca ccaccagcta catacttcgg 40
<210>54
<211>45
<212>DNA
<213>日本脑炎病毒
<400>54
ggtaaatacc acgcgttgac cgttggtact gccaaccatc cagcc 45
<210>55
<211>45
<212>DNA
<213>日本脑炎病毒
<400>55
ttcctggcgg cgacacttgg ctggatgctt ggcagtacca acggt 45
<210>56
<211>46
<212>DNA
<213>日本脑炎病毒
<400>56
aagccggagc gaccaacagc aggaggatgg taaataccac gcgttg 46
<210>57
<211>46
<212>DNA
<213>日本脑炎病毒
<400>57
caacgcgtgg tatttaccat cctcctgctg ttggtcgctc cggctt 46
<210>58
<211>62
<212>DNA
<213>日本脑炎病毒
<400>58
aaatttaata cgactcacta taagaagttt atctgtgtga acttcttggc ttagtatcgt 60
tg 62
<210>59
<211>70
<212>DNA
<213>人工的
<220>
<223>源自日本脑炎病毒和西尼罗病毒
<400>59
ccaagaagtc tctgttgctc attccaaggc agttgaaact gtaagccgga gcgaccagca 60
gcaggaggat 70
<210>60
<211>70
<212>DNA
<213>人工的
<220>
<223>源自日本脑炎病毒和西尼罗病毒
<400>60
caccatcctc ctgctgctgg tcgctccggc ttacagtttc aactgccttg gaatgagcaa 60
cagagacttc 70
<210>61
<211>73
<212>DNA
<213>人工的
<220>
<223>源自日本脑炎病毒和西尼罗病毒
<400>61
ctcttttctt gtgatgtcaa tggcacatcc agtgtcagcg tgcacgttca cggagaggaa 60
gagcagaact cct 73
<210>62
<211>75
<212>DNA
<213>人工的
<220>
<223>源自日本脑炎病毒和西尼罗病毒
<400>62
ggaggagttc tgctcttcct ctccgtgaac gtgcacgctg acactggatg tgccattgac 60
atcacaagaa aagag 75
<210>63
<211>40
<212>DNA
<213>日本脑炎病毒
<400>63
agatcctgtg ttcttcctca ccaccagcta catacttcgg 40
机译: 以减毒日本脑炎病毒基因为骨架的减毒嵌合黄病毒
机译: 以减毒日本脑炎病毒基因为骨架的减毒嵌合黄病毒
机译: 以减毒的日本脑炎病毒基因为骨架的减毒嵌合黄病毒