Homology
BLAST of HG10019700 vs. NCBI nr
Match:
XP_038903869.1 (thyroid adenoma-associated protein homolog [Benincasa hispida])
HSP 1 Score: 4065.4 bits (10542), Expect = 0.0e+00
Identity = 2044/2196 (93.08%), Postives = 2107/2196 (95.95%), Query Frame = 0
Query: 1 MSAKWRALQHRHRYTYSAIVFPPSYVDSLNSFQYPSKFFTELLQLVSLNSVYAQVNHAKK 60
MSAKWRALQHRHRYTYSAIVFP S+VDSLNSFQ SKFFTELLQLVSLNSVYAQVNHAKK
Sbjct: 1 MSAKWRALQHRHRYTYSAIVFPHSFVDSLNSFQSSSKFFTELLQLVSLNSVYAQVNHAKK 60
Query: 61 VASAFSELLANGDEDSVSKAARFYLEVLFCENSQPLHRTLVSTLAKSRKFQDRLGECFRN 120
VASAFSELLANGDEDSVSKAARFYLEVLFCENSQPLHRTLVSTLAKSRKFQD LGECFRN
Sbjct: 61 VASAFSELLANGDEDSVSKAARFYLEVLFCENSQPLHRTLVSTLAKSRKFQDSLGECFRN 120
Query: 121 LCEEHSGMLQGGEKRFCVSRVALSVMGMPKLGYLVDIIRDCAILVARDTVSSLDSVVKET 180
LCEEHSG+ QGGEKRFCVSRVALSVMGMPKLGYLVD+I+DCAILVARD VSSLDSVVKET
Sbjct: 121 LCEEHSGLQQGGEKRFCVSRVALSVMGMPKLGYLVDVIKDCAILVARDIVSSLDSVVKET 180
Query: 181 NESARPSPIVMEQCQEALSCLYYLLQRFPSKFQEDPSVMGMIVRTILSILKSLAFSRDCY 240
NESARPSPI+MEQCQEALSCLYYLLQRFPSKFQED SVMGMIV TILSILKSLAFSRDCY
Sbjct: 181 NESARPSPIIMEQCQEALSCLYYLLQRFPSKFQEDSSVMGMIVSTILSILKSLAFSRDCY 240
Query: 241 VAAGVSFCASLQVCLNSAELGMLIFYGIFEQTNHIAFLKYESEFRNAIAKVPHQANVCTE 300
VAAGVSFCASLQVCLNSAELG+LIFYGIFEQTNHI+FLKYESEF+NA+ KVPHQANVC+E
Sbjct: 241 VAAGVSFCASLQVCLNSAELGVLIFYGIFEQTNHISFLKYESEFKNAVLKVPHQANVCSE 300
Query: 301 IQTFSVLSRLCLIRGILTAIPRPVLNIPFSMMEGELNGHPGCLNSGNSVKTILYDGILPE 360
IQTFSVLSRLCLIRGILTAIPRPVLNIPF MMEG+LNGHPGCLNSG+SVKT+LYDGILPE
Sbjct: 301 IQTFSVLSRLCLIRGILTAIPRPVLNIPFYMMEGDLNGHPGCLNSGDSVKTVLYDGILPE 360
Query: 361 LCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRILRIMWT 420
LCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRILRIMWT
Sbjct: 361 LCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRILRIMWT 420
Query: 421 NLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIKLYLQKIAFDLLRLGSRCKGRYVP 480
NLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIK YL+KIAFDLLRLGSRCKGRYVP
Sbjct: 421 NLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIKSYLRKIAFDLLRLGSRCKGRYVP 480
Query: 481 LASLTKRLGAKALLDMSPSLLSETVQAYIDDDVCCAATSFLKCFLEHLRDECWSSDGIEG 540
LASLTKRLGAKALLDMSPSLL ETVQAYIDDDVCCAATSFLKCFLEHLRDECWSSDGIEG
Sbjct: 481 LASLTKRLGAKALLDMSPSLLLETVQAYIDDDVCCAATSFLKCFLEHLRDECWSSDGIEG 540
Query: 541 GYALYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFEIDLDSIFPMLAFISVWPSSRDN 600
GYALYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFEIDLDSIFPMLAFISVWPSS DN
Sbjct: 541 GYALYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFEIDLDSIFPMLAFISVWPSSSDN 600
Query: 601 GVLYPGDYQGSMELRVEQKVAIFISLLKVSRSLALIEGDIDWLEKPSLEQQSVNEIEYFS 660
GVLYPG+ QGSMELRVEQKVAIFISLLKVSRSLALIEGDIDWLEKPSL+++SV+EIEYFS
Sbjct: 601 GVLYPGNNQGSMELRVEQKVAIFISLLKVSRSLALIEGDIDWLEKPSLDEKSVHEIEYFS 660
Query: 661 CHALVFVKGVKVEILVEWLLLALTHVDESLRVDAAEFLFLNPKTSSLPSHLELTLLKKAV 720
CHALVFVKGVKVEILVEWLLLALTHVDESLRVDAAEFLFLNPKTSSLPSHLELTLLKKA+
Sbjct: 661 CHALVFVKGVKVEILVEWLLLALTHVDESLRVDAAEFLFLNPKTSSLPSHLELTLLKKAI 720
Query: 721 PLNMRCSSTAFQMKWSSLFRKFFSRVRTALERQFKLGNWIPLASCCNSESDLPNGSEQII 780
PLNMRCSSTAFQMKWSSLFRKFFSRVRTALERQFKLGNWIPLAS CNSE+ LPNGSE II
Sbjct: 721 PLNMRCSSTAFQMKWSSLFRKFFSRVRTALERQFKLGNWIPLASSCNSENYLPNGSEHII 780
Query: 781 TGRADDLLHFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWSVVPSKEKSSETLLL 840
GRADDL HFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLN+WS+VPSKEK +E LLL
Sbjct: 781 AGRADDLFHFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNIWSIVPSKEKFNENLLL 840
Query: 841 PYNEGITLPDSVLLLVGSIIDSWDRLRENSFRILLHFPTPLPGISSEYMVGKVITWAKML 900
PYNEGITLPDSVLLLVGSIIDSWDRLRENSFRILLHFPTPLPGISSEYMVGKVITWAKML
Sbjct: 841 PYNEGITLPDSVLLLVGSIIDSWDRLRENSFRILLHFPTPLPGISSEYMVGKVITWAKML 900
Query: 901 VCSSRVRESDAGALALRLVFRKYVLDLGWIVRASGTVVCLDSLNKLTNVGEELKKSNHPV 960
VCSSRVRESDAGALALRLVFRKYVLDLGW+V+ASG VVCLDSL KL NV EE+KKSNHPV
Sbjct: 901 VCSSRVRESDAGALALRLVFRKYVLDLGWMVKASGVVVCLDSLKKLPNVDEEIKKSNHPV 960
Query: 961 AEYLKSLIDWLDISVSEGERNLPEACKNSFVHGVLLTLRYAFEELDWNSDVVLSSILEMR 1020
AEYLKSLIDWL+ISV+EGERNL EACKNSFVHGVLLTLRY FEELDWNSDVVLSSI EMR
Sbjct: 961 AEYLKSLIDWLNISVTEGERNLSEACKNSFVHGVLLTLRYTFEELDWNSDVVLSSISEMR 1020
Query: 1021 SLLEKLLELGMRITSLALWVVSADAWYLPEDMDDMVDDDAFMLDVPDEADVSTSLSELED 1080
SLLEKLLEL MRITSLALWVVSADAWYLPEDMDDMVDDDAFMLDVP+EADVS SLSE+E
Sbjct: 1021 SLLEKLLELVMRITSLALWVVSADAWYLPEDMDDMVDDDAFMLDVPNEADVSMSLSEMEY 1080
Query: 1081 SKEKTTVNSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPAASDSVESDPIASIISRQQ 1140
S+EKTTVN RTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPAASDSVESDP ASIISRQ+
Sbjct: 1081 SEEKTTVNLRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPAASDSVESDPSASIISRQE 1140
Query: 1141 EVLDLKQLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSNDQRLCKLTESWMDQ 1200
EVLDL+QLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSNDQRLCKLTESWMDQ
Sbjct: 1141 EVLDLRQLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSNDQRLCKLTESWMDQ 1200
Query: 1201 LMERTSAKGQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAERLLQNP 1260
LMERT AKGQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAERLLQNP
Sbjct: 1201 LMERTIAKGQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAERLLQNP 1260
Query: 1261 IDTNHKNSNFSKLPSTELGQDTESVSLHETYPSEKASKIRDEGVIPTVHAFNVLRAAFND 1320
IDTNHKNSNFSKLPST LGQ+TESV HETYPSEKASKIRDEGVIPTVHAFNVLRAAFND
Sbjct: 1261 IDTNHKNSNFSKLPSTGLGQETESVLPHETYPSEKASKIRDEGVIPTVHAFNVLRAAFND 1320
Query: 1321 TNLATDTSGFSAQAIIVAIRSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRESARRAL 1380
TNLATDTSGFSAQAIIVAIRSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRESARRAL
Sbjct: 1321 TNLATDTSGFSAQAIIVAIRSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRESARRAL 1380
Query: 1381 TGLEFFHRYPALHRFLLDELKVATESLDDGCSGNSKFNLANVVHPSLCPMLILLSRLKPS 1440
TGLEFFHRYPALHRFLL+EL+VATESLDDG SGNSKFNLANVVHPSLCPMLILLSRL+PS
Sbjct: 1381 TGLEFFHRYPALHRFLLEELEVATESLDDGYSGNSKFNLANVVHPSLCPMLILLSRLRPS 1440
Query: 1441 TIASEDGDDLDPFLLMPFIRKCSSQSNLRIRILASRALTGLVSNENLQLVILNIASGLPV 1500
TIASE GDDLDPFL MPFIRKCSSQSNLRIRILASRALTGLVSNENL VILNIA+GLPV
Sbjct: 1441 TIASEAGDDLDPFLFMPFIRKCSSQSNLRIRILASRALTGLVSNENLPSVILNIAAGLPV 1500
Query: 1501 DDNAMMAHE-----DATATTQHASYNRIHGILLQLISLLDTNCRNLADILKKSQVLNDLA 1560
DD MMA E DAT T QHASYNRIHGILLQLISLLDTNCRNLADI KKSQ+L+DLA
Sbjct: 1501 DDITMMARESSISLDATETPQHASYNRIHGILLQLISLLDTNCRNLADISKKSQILSDLA 1560
Query: 1561 EVLARCSWMARRRRCSCPILSTSILRVLGHMLSIVRTCPRSKSFYIIRNLLLDLSTECLD 1620
EVLARCSWMARRR CSCPILSTS+LRVLG MLSIVRTCPRSKSFYIIRNLLLDLSTECLD
Sbjct: 1561 EVLARCSWMARRRCCSCPILSTSVLRVLGDMLSIVRTCPRSKSFYIIRNLLLDLSTECLD 1620
Query: 1621 VEASHELSYYDPTVAELRQQASICYFNCVFQPFDEEDDAALQKSQRSQSDEDVPATLMDY 1680
VEASHELSYYDPTVAELRQQASICYFNCVFQPFDEEDDA LQ SQRS+ EDVPATLMDY
Sbjct: 1621 VEASHELSYYDPTVAELRQQASICYFNCVFQPFDEEDDADLQTSQRSRFAEDVPATLMDY 1680
Query: 1681 PFSQLQERLIRSLQDPCYEVRLSTMKWLFKFLKSTEYSDGLYDLSCHEIRTVDHWIKTNL 1740
PFSQLQERLIRSLQDPCYEVRLSTMKWLFKFL STEYS GLYDLSCHEIRTV HWIKTNL
Sbjct: 1681 PFSQLQERLIRSLQDPCYEVRLSTMKWLFKFLNSTEYSAGLYDLSCHEIRTVYHWIKTNL 1740
Query: 1741 QALLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGNEECTEEVVYIGKMDCGSVLQFWD 1800
Q LLTELLSLEKNYRCLYYILKN+FAWN+SQFQKFGNEECTEEVVYIGKMDCGSVLQFWD
Sbjct: 1741 QTLLTELLSLEKNYRCLYYILKNIFAWNISQFQKFGNEECTEEVVYIGKMDCGSVLQFWD 1800
Query: 1801 KLISLYKLTRHAKTRENTIRCMGTCIKRLAVQYSACIISDATTTESPNDRISNNLDEFHS 1860
KLISLYKLT+HAKTRENTIRCMGTCIKRLA+QYSACI+SDAT TES NDRIS+NL +FHS
Sbjct: 1801 KLISLYKLTKHAKTRENTIRCMGTCIKRLAMQYSACIVSDATVTESSNDRISDNLAKFHS 1860
Query: 1861 CITLFTDLIRQHSAASEPVNMRTAAADSIIASGLLEQAEIFGDFVFDNQIPRDTSKSHFE 1920
CITLFTDLIRQHSAASEPVNMR AAADSIIASGLLEQAEIFGDFVF+NQIP +T+ S FE
Sbjct: 1861 CITLFTDLIRQHSAASEPVNMRMAAADSIIASGLLEQAEIFGDFVFNNQIPLETANSRFE 1920
Query: 1921 QREYVNMYAHQILNMWSTCIMLLEDEDDDIRKRLAADVQKCFSSERTTTSSDVPNQVEQV 1980
QREYVNMYAHQILN+WSTCIMLLEDEDDDIRKRLAADVQKCFSSERT PNQVEQV
Sbjct: 1921 QREYVNMYAHQILNIWSTCIMLLEDEDDDIRKRLAADVQKCFSSERT------PNQVEQV 1980
Query: 1981 IGSSFEYLSSIFGHWVLYFDYLANWVLNSVNYTISPVDPVRRVFDKEIDNHHEEKLLISQ 2040
IGSSFEYLSSIFGHWVLYF+YLANWVLN+ NYT+S DPVRRVFDKEIDNHHEEKLLISQ
Sbjct: 1981 IGSSFEYLSSIFGHWVLYFNYLANWVLNTANYTVSAADPVRRVFDKEIDNHHEEKLLISQ 2040
Query: 2041 TCCLHMEKLSRSKLVALWDTQWFINYLVGLRKRFFHQLIKFSDEHMNKHGGFDWIGGAGN 2100
TCCLHMEKLSRSKLVAL DT+WFINYLV LRKRFF QLIKFSDEHMNKHGGFDWIGGAGN
Sbjct: 2041 TCCLHMEKLSRSKLVALLDTEWFINYLVSLRKRFFRQLIKFSDEHMNKHGGFDWIGGAGN 2100
Query: 2101 HKDAFLLLYSNLLGFYALSNCIVNGKTKVTMQPLVTEVVEIGKIINPFLRNPLISNLYLL 2160
HKDAFL LY NLLGF+ALSNCIVNGKT+VTMQPLV EVVEIGKII PFLRNPLISNLY L
Sbjct: 2101 HKDAFLPLYLNLLGFFALSNCIVNGKTEVTMQPLVAEVVEIGKIIIPFLRNPLISNLYWL 2160
Query: 2161 VIRIHKEAIDINRDHKIPELGHEAIWEGFDPYFLLR 2192
VIRIHKEA+ +NRDH+I ELGHEAIWEGFDPYFLLR
Sbjct: 2161 VIRIHKEALYVNRDHRILELGHEAIWEGFDPYFLLR 2190
BLAST of HG10019700 vs. NCBI nr
Match:
XP_008443417.1 (PREDICTED: uncharacterized protein LOC103487009 [Cucumis melo] >KAA0053755.1 thyroid adenoma-associated protein-like protein isoform X1 [Cucumis melo var. makuwa] >TYK25648.1 thyroid adenoma-associated protein-like protein isoform X1 [Cucumis melo var. makuwa])
HSP 1 Score: 4000.3 bits (10373), Expect = 0.0e+00
Identity = 2000/2196 (91.07%), Postives = 2086/2196 (94.99%), Query Frame = 0
Query: 1 MSAKWRALQHRHRYTYSAIVFPPSYVDSLNSFQYPSKFFTELLQLVSLNSVYAQVNHAKK 60
MSAKWRALQHRHRYTYSAIVFP S+VDSLNSF+ SKFFTELLQLVSLNSVYAQVNHAKK
Sbjct: 1 MSAKWRALQHRHRYTYSAIVFPNSFVDSLNSFRSSSKFFTELLQLVSLNSVYAQVNHAKK 60
Query: 61 VASAFSELLANGDEDSVSKAARFYLEVLFCENSQPLHRTLVSTLAKSRKFQDRLGECFRN 120
VASAFSELLANGDEDSV KAARFYLEVLF ENSQPLHRTLVSTLAKSRKFQD LGECFR+
Sbjct: 61 VASAFSELLANGDEDSVFKAARFYLEVLFFENSQPLHRTLVSTLAKSRKFQDPLGECFRD 120
Query: 121 LCEEHSGMLQGGEKRFCVSRVALSVMGMPKLGYLVDIIRDCAILVARDTVSSLDSVVKET 180
LCE+HSG+LQGGEKRFCVSRVALSVMGMPKLGYLVD+I+DCA+LVARD VSSLD VVKET
Sbjct: 121 LCEKHSGVLQGGEKRFCVSRVALSVMGMPKLGYLVDVIKDCALLVARDIVSSLDYVVKET 180
Query: 181 NESARPSPIVMEQCQEALSCLYYLLQRFPSKFQEDPSVMGMIVRTILSILKSLAFSRDCY 240
NESARPSPI+MEQCQEALSCLYYLLQRFP+KFQED V+ MI+ +ILSILKSLAFSRDCY
Sbjct: 181 NESARPSPIIMEQCQEALSCLYYLLQRFPAKFQEDSDVLRMILSSILSILKSLAFSRDCY 240
Query: 241 VAAGVSFCASLQVCLNSAELGMLIFYGIFEQTNHIAFLKYESEFRNAIAKVPHQANVCTE 300
VAAGVSFCASLQVCLNS ELG+LIFYGI EQTNHI+FLKY+SEFRN + KVPHQANVC E
Sbjct: 241 VAAGVSFCASLQVCLNSEELGVLIFYGILEQTNHISFLKYDSEFRNTVGKVPHQANVCAE 300
Query: 301 IQTFSVLSRLCLIRGILTAIPRPVLNIPFSMMEGELNGHPGCLNSGNSVKTILYDGILPE 360
I+TFSVLSRLCLIRGILTAIPRPVLNI FSM+EG+ NGHPGCLNSGNSVKTILYDGILPE
Sbjct: 301 IRTFSVLSRLCLIRGILTAIPRPVLNIQFSMVEGDSNGHPGCLNSGNSVKTILYDGILPE 360
Query: 361 LCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRILRIMWT 420
LCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRIL IMWT
Sbjct: 361 LCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRILSIMWT 420
Query: 421 NLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIKLYLQKIAFDLLRLGSRCKGRYVP 480
NLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIKLYL+KIAFD+LRLGSRCKGRYVP
Sbjct: 421 NLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIKLYLRKIAFDILRLGSRCKGRYVP 480
Query: 481 LASLTKRLGAKALLDMSPSLLSETVQAYIDDDVCCAATSFLKCFLEHLRDECWSSDGIEG 540
LASLTKRLGAKALLDMSPSLLSETVQAYIDDDVCCAATSFLKCFLEHLRDECWSSDGIEG
Sbjct: 481 LASLTKRLGAKALLDMSPSLLSETVQAYIDDDVCCAATSFLKCFLEHLRDECWSSDGIEG 540
Query: 541 GYALYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFEIDLDSIFPMLAFISVWPSSRDN 600
GYALYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFE+DLDSIFPMLAFISVWPSSRDN
Sbjct: 541 GYALYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFEVDLDSIFPMLAFISVWPSSRDN 600
Query: 601 GVLYPGDYQGSMELRVEQKVAIFISLLKVSRSLALIEGDIDWLEKPSLEQQSVNEIEYFS 660
G+LYP QGSMELRVEQKVAIFISLLKVSRSLALIEGDIDWLEKPSLEQQS +EIEYFS
Sbjct: 601 GILYPSINQGSMELRVEQKVAIFISLLKVSRSLALIEGDIDWLEKPSLEQQSFHEIEYFS 660
Query: 661 CHALVFVKGVKVEILVEWLLLALTHVDESLRVDAAEFLFLNPKTSSLPSHLELTLLKKAV 720
+ALV VKGVKVEILVEWLLLALTHVDE+LRVDAAEFLFLNPKTSSLPSHLELTLLKKA+
Sbjct: 661 RYALVSVKGVKVEILVEWLLLALTHVDETLRVDAAEFLFLNPKTSSLPSHLELTLLKKAI 720
Query: 721 PLNMRCSSTAFQMKWSSLFRKFFSRVRTALERQFKLGNWIPLASCCNSESDLPNGSEQII 780
PLNMRC+STAFQMKWSSLFRKFFSRVRTALER+FKLGNWIPLASCCNSES +PNGSEQI+
Sbjct: 721 PLNMRCTSTAFQMKWSSLFRKFFSRVRTALERKFKLGNWIPLASCCNSESYMPNGSEQIV 780
Query: 781 TGRADDLLHFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWSVVPSKEKSSETLLL 840
GRADDL FMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWS+VPSKEKS+ETLLL
Sbjct: 781 AGRADDLFQFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWSIVPSKEKSNETLLL 840
Query: 841 PYNEGITLPDSVLLLVGSIIDSWDRLRENSFRILLHFPTPLPGISSEYMVGKVITWAKML 900
PYNEGITLPDSVLLLVGSIIDSWDRLRENSFRILLHFPTPLPGISSEYMVGKVI WAK+L
Sbjct: 841 PYNEGITLPDSVLLLVGSIIDSWDRLRENSFRILLHFPTPLPGISSEYMVGKVIKWAKVL 900
Query: 901 VCSSRVRESDAGALALRLVFRKYVLDLGWIVRASGTVVCLDSLNKLTNVGEELKKSNHPV 960
VCSSRVRESDAGALALRLVFRKYVLDLGWIVRAS VVCLDSLNKL NV EE+ KSNHPV
Sbjct: 901 VCSSRVRESDAGALALRLVFRKYVLDLGWIVRASDAVVCLDSLNKLPNVREEICKSNHPV 960
Query: 961 AEYLKSLIDWLDISVSEGERNLPEACKNSFVHGVLLTLRYAFEELDWNSDVVLSSILEMR 1020
+EYLKSLIDWL++SV+EGE NL EACKNSFVHGVLLTLRY+FEELDWNSDVVLSSI EMR
Sbjct: 961 SEYLKSLIDWLNVSVTEGEMNLSEACKNSFVHGVLLTLRYSFEELDWNSDVVLSSISEMR 1020
Query: 1021 SLLEKLLELGMRITSLALWVVSADAWYLPEDMDDMVDDDAFMLDVPDEADVSTSLSELED 1080
SLLEKLLEL MRITSLALWVVSADAW+LPEDM DMVDDDAF+LDVPDE +VSTSLSELED
Sbjct: 1021 SLLEKLLELVMRITSLALWVVSADAWHLPEDMGDMVDDDAFVLDVPDETNVSTSLSELED 1080
Query: 1081 SKEKTTVNSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPAASDSVESDPIASIISRQQ 1140
SKEKTT NSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPAASDS+E DP SI+ RQ+
Sbjct: 1081 SKEKTTDNSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPAASDSIEFDPNDSIMPRQE 1140
Query: 1141 EVLDLKQLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSNDQRLCKLTESWMDQ 1200
EVLD+KQLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCS+DQRLCKLTESWMDQ
Sbjct: 1141 EVLDVKQLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSDDQRLCKLTESWMDQ 1200
Query: 1201 LMERTSAKGQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAERLLQNP 1260
LMERT+A+GQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAERLLQNP
Sbjct: 1201 LMERTTARGQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAERLLQNP 1260
Query: 1261 IDTNHKNSNFSKLPSTELGQDTESVSLHETYPSEKASKIRDEGVIPTVHAFNVLRAAFND 1320
I+ + KNSNFSKLPST L QDT+ +S HE YPSEKASKIRDEGVIPTVHAFNVLRAAFND
Sbjct: 1261 IERDCKNSNFSKLPSTGLSQDTKPISTHENYPSEKASKIRDEGVIPTVHAFNVLRAAFND 1320
Query: 1321 TNLATDTSGFSAQAIIVAIRSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRESARRAL 1380
TNLATDTSGFSAQAIIV IRSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRESARRAL
Sbjct: 1321 TNLATDTSGFSAQAIIVCIRSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRESARRAL 1380
Query: 1381 TGLEFFHRYPALHRFLLDELKVATESLDDGCSGNSKFNLANVVHPSLCPMLILLSRLKPS 1440
TGLEFFHRYPALHRFLL+EL+VATESLDDGCSG+SKFNLA +VHPSLCPMLILLSRLKPS
Sbjct: 1381 TGLEFFHRYPALHRFLLEELEVATESLDDGCSGDSKFNLAKIVHPSLCPMLILLSRLKPS 1440
Query: 1441 TIASEDGDDLDPFLLMPFIRKCSSQSNLRIRILASRALTGLVSNENLQLVILNIASGLPV 1500
TIASE GDDLDPFL MPFIRKCSSQSNLRIRILASRALTGLVSNENL VILNIASGLPV
Sbjct: 1441 TIASEAGDDLDPFLFMPFIRKCSSQSNLRIRILASRALTGLVSNENLPSVILNIASGLPV 1500
Query: 1501 DDNAMMAHED----ATATTQHASYNRIHGILLQLISLLDTNCRNLADILKKSQVLNDLAE 1560
DDN M E ATATTQH SYNRIHGILLQLISLLDTNCRNL DI KK ++LNDL E
Sbjct: 1501 DDNTTMGCESSILLATATTQHTSYNRIHGILLQLISLLDTNCRNLGDISKKRRILNDLVE 1560
Query: 1561 VLARCSWMARRRRCSCPILSTSILRVLGHMLSIVRTCPRSKSFYIIRNLLLDLSTECLDV 1620
VLA CSWMAR CSCPILSTS+L+VLGHMLSIVRTCPRSKSFYIIRNLLLDLSTECLDV
Sbjct: 1561 VLAHCSWMARTSHCSCPILSTSMLQVLGHMLSIVRTCPRSKSFYIIRNLLLDLSTECLDV 1620
Query: 1621 EASHELSYYDPTVAELRQQASICYFNCVFQPFDEEDDAALQKSQRSQSDEDVPATLMDYP 1680
E SH+LSYYDPT+AELRQQA+ICYFNCV QPFDEEDDAALQKSQRSQSDEDVP TL DY
Sbjct: 1621 ETSHKLSYYDPTLAELRQQAAICYFNCVLQPFDEEDDAALQKSQRSQSDEDVPGTLRDYS 1680
Query: 1681 FSQLQERLIRSLQDPCYEVRLSTMKWLFKFLKSTEYSDGLYDLSCHEIRTVDHWIKTNLQ 1740
FSQLQERLIRSLQDPCYEVRLSTMKWLFKFLKSTEYS G YDLSCHEIRTVD WIKTNLQ
Sbjct: 1681 FSQLQERLIRSLQDPCYEVRLSTMKWLFKFLKSTEYSAGSYDLSCHEIRTVDQWIKTNLQ 1740
Query: 1741 ALLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGNEECTEEVVYIGKMDCGSVLQFWDK 1800
+LLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGNEEC E+VVYIGKMDC SVLQFWDK
Sbjct: 1741 SLLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGNEECAEDVVYIGKMDCESVLQFWDK 1800
Query: 1801 LISLYKLTRHAKTRENTIRCMGTCIKRLAVQYSACIISDATTTESPNDRISNNLDEFHSC 1860
LISLYKLTRHAKTRENTIRCMGTCIKRLAVQYSACI+SDATT ESPN RISNNLD++HSC
Sbjct: 1801 LISLYKLTRHAKTRENTIRCMGTCIKRLAVQYSACIVSDATTIESPNGRISNNLDKYHSC 1860
Query: 1861 ITLFTDLIRQHSAASEPVNMRTAAADSIIASGLLEQAEIFGDFVFDNQIPRDTSKSHFEQ 1920
ITLFTDLI+QHSAASEPVNMRTAAADSIIASGLLEQAEIFGD+VFDNQIP+ T+ SH E
Sbjct: 1861 ITLFTDLIKQHSAASEPVNMRTAAADSIIASGLLEQAEIFGDYVFDNQIPQATANSHCEH 1920
Query: 1921 REYVNMYAHQILNMWSTCIMLLEDEDDDIRKRLAADVQKCFSSERTTTSSDVPNQVEQVI 1980
REY NMYAHQILNMWSTCIMLLEDEDDDIRKRLAADVQKCF ERTTTSSDVPNQVEQVI
Sbjct: 1921 REYANMYAHQILNMWSTCIMLLEDEDDDIRKRLAADVQKCFRLERTTTSSDVPNQVEQVI 1980
Query: 1981 GSSFEYLSSIFGHWVLYFDYLANWVLNSVNYTISPVDPVRRVFDKEIDNHHEEKLLISQT 2040
GSSFEYLSSIFGHWVLYFDYLANWVLN+ NYTISP DPVRRVFDKEIDNHHEEKLLI QT
Sbjct: 1981 GSSFEYLSSIFGHWVLYFDYLANWVLNTANYTISPADPVRRVFDKEIDNHHEEKLLICQT 2040
Query: 2041 CCLHMEKLSRSKLVALWDTQWFINYLVGLRKRFFHQLIKFSDEHMNKHGGFDWIGGAGNH 2100
CC HMEKLS S+L+ALWDTQWF+NYLV LRKRFFHQLI+FSDE+M+KH GFDWIGGAGNH
Sbjct: 2041 CCFHMEKLSSSRLIALWDTQWFMNYLVSLRKRFFHQLIRFSDEYMSKHSGFDWIGGAGNH 2100
Query: 2101 KDAFLLLYSNLLGFYALSNCIVNGKTK-VTMQPLVTEVVEIGKIINPFLRNPLISNLYLL 2160
KDAFL LY+NLLGF A+SNCIVNGK+K VTMQP VTEVVEIGKIINPFLRNPLISNLYLL
Sbjct: 2101 KDAFLPLYTNLLGFCAISNCIVNGKSKVVTMQPFVTEVVEIGKIINPFLRNPLISNLYLL 2160
Query: 2161 VIRIHKEAIDINRDHKIPELGHEAIWEGFDPYFLLR 2192
VIRIHKEAID+NRDH IPE GHEAIWEGFDPYFLLR
Sbjct: 2161 VIRIHKEAIDVNRDHNIPEHGHEAIWEGFDPYFLLR 2196
BLAST of HG10019700 vs. NCBI nr
Match:
XP_031737854.1 (LOW QUALITY PROTEIN: thyroid adenoma-associated protein homolog [Cucumis sativus])
HSP 1 Score: 3998.0 bits (10367), Expect = 0.0e+00
Identity = 1999/2197 (90.99%), Postives = 2084/2197 (94.86%), Query Frame = 0
Query: 1 MSAKWRALQHRHRYTYSAIVFPPSYVDSLNSFQYPSKFFTELLQLVSLNSVYAQVNHAKK 60
MSAKWRALQHRHRYTYSAIVFP SYVDSLNSFQ SKFFTELLQLVSLNSVYAQVNHAKK
Sbjct: 1 MSAKWRALQHRHRYTYSAIVFPNSYVDSLNSFQSSSKFFTELLQLVSLNSVYAQVNHAKK 60
Query: 61 VASAFSELLANGDEDSVSKAARFYLEVLFCENSQPLHRTLVSTLAKSRKFQDRLGECFRN 120
VASAFSELLANGDEDSVSKAARFYLEVLF ENSQPLHRTLVSTLAKSRKF D LGECFR+
Sbjct: 61 VASAFSELLANGDEDSVSKAARFYLEVLFFENSQPLHRTLVSTLAKSRKFHDPLGECFRD 120
Query: 121 LCEEHSGMLQGGEKRFCVSRVALSVMGMPKLGYLVDIIRDCAILVARDTVSSLDSVVKET 180
LCEEHSG+LQGGEKRFCVSRVALSVMGMPKLGYLVD+I+DCA+LVARD VSSLD VVKET
Sbjct: 121 LCEEHSGVLQGGEKRFCVSRVALSVMGMPKLGYLVDVIKDCALLVARDIVSSLDYVVKET 180
Query: 181 NESARPSPIVMEQCQEALSCLYYLLQRFPSKFQEDPSVMGMIVRTILSILKSLAFSRDCY 240
NESARPSPI+MEQCQEALSCLYYLLQRFPSKFQED V+GMIV +ILSILKSLAFSRDCY
Sbjct: 181 NESARPSPIIMEQCQEALSCLYYLLQRFPSKFQEDFGVLGMIVSSILSILKSLAFSRDCY 240
Query: 241 VAAGVSFCASLQVCLNSAELGMLIFYGIFEQTNHIAFLKYESEFRNAIAKVPHQANVCTE 300
VAAGVSFCASLQVCLNS ELG+LIFYGI EQTNHI FLKY+SEFRN + KVPHQANVC E
Sbjct: 241 VAAGVSFCASLQVCLNSEELGVLIFYGILEQTNHIPFLKYDSEFRNTVGKVPHQANVCAE 300
Query: 301 IQTFSVLSRLCLIRGILTAIPRPVLNIPFSMMEGELNGHPGCLNSGNSVKTILYDGILPE 360
I+TFSVLSRLCLIRGILTAIPRPVLNIPFSM+EG+ NGHPGCLNSGNSVKTILYDGILPE
Sbjct: 301 IRTFSVLSRLCLIRGILTAIPRPVLNIPFSMVEGDSNGHPGCLNSGNSVKTILYDGILPE 360
Query: 361 LCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRILRIMWT 420
LCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRIL IMWT
Sbjct: 361 LCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRILSIMWT 420
Query: 421 NLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIKLYLQKIAFDLLRLGSRCKGRYVP 480
NLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIKL+L+KIAFDLLRLGSRCKGRYVP
Sbjct: 421 NLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIKLFLRKIAFDLLRLGSRCKGRYVP 480
Query: 481 LASLTKRLGAKALLDMSPSLLSETVQAYIDDDVCCAATSFLKCFLEHLRDECWSSDGIEG 540
LASLTKRLGAK LLDMSPSLLSETVQAYIDDDVCCA TSFLKCFLEHLRDECWSSDGIEG
Sbjct: 481 LASLTKRLGAKTLLDMSPSLLSETVQAYIDDDVCCAVTSFLKCFLEHLRDECWSSDGIEG 540
Query: 541 GYALYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFEIDLDSIFPMLAFISVWPSSRDN 600
GYA+YRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFE+DLDSIFPMLAFISVWPSSRDN
Sbjct: 541 GYAIYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFEVDLDSIFPMLAFISVWPSSRDN 600
Query: 601 GVLYPGDYQGSMELRVEQKVAIFISLLKVSRSLALIEGDIDWLEKPSLEQQSVNEIEYFS 660
G+LYPG+ QGSMELRVEQ+VAIFISLLKVSRSLALIEGDIDWLEKPSLEQQS++EIEYFS
Sbjct: 601 GILYPGNNQGSMELRVEQRVAIFISLLKVSRSLALIEGDIDWLEKPSLEQQSIHEIEYFS 660
Query: 661 CHALVFVKGVKVEILVEWLLLALTHVDESLRVDAAEFLFLNPKTSSLPSHLELTLLKKAV 720
+ALVFVKGVKVEILVEWLLLALTHVDE+LRVDAAEFLFLNPKTSSLPSHLELTLLKKA+
Sbjct: 661 RYALVFVKGVKVEILVEWLLLALTHVDETLRVDAAEFLFLNPKTSSLPSHLELTLLKKAI 720
Query: 721 PLNMRCSSTAFQMKWSSLFRKFFSRVRTALERQFKLGNWIPLASCCNSESDLPNGSEQII 780
PLNMRC+STAFQMKWSSLFRKFFSRVRTALERQFKLGNWIPLASCCN ES +PNG+EQI+
Sbjct: 721 PLNMRCTSTAFQMKWSSLFRKFFSRVRTALERQFKLGNWIPLASCCNRESYMPNGNEQIV 780
Query: 781 TGRADDLLHFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWSVVPSKEKSSETLLL 840
GRADDL FMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWS+VPSKEK +ETLLL
Sbjct: 781 AGRADDLFQFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWSIVPSKEKCNETLLL 840
Query: 841 PYNEGITLPDSVLLLVGSIIDSWDRLRENSFRILLHFPTPLPGISSEYMVGKVITWAKML 900
PYNEGITLPDSVLLLV SIIDSWDRLRENSFRILLHFPTPLPGIS EYMVGK+I WAK+L
Sbjct: 841 PYNEGITLPDSVLLLVVSIIDSWDRLRENSFRILLHFPTPLPGISGEYMVGKIIKWAKVL 900
Query: 901 VCSSRVRESDAGALALRLVFRKYVLDLGWIVRASGTVVCLDSLNKLTNVGEELKKSNHPV 960
VCSSRVRESDAGALALRLVFRKYVLDLGWIVRAS VVCLDS+NKL NVG+E+ KSNHPV
Sbjct: 901 VCSSRVRESDAGALALRLVFRKYVLDLGWIVRASDAVVCLDSVNKLPNVGKEMCKSNHPV 960
Query: 961 AEYLKSLIDWLDISVSEGERNLPEACKNSFVHGVLLTLRYAFEELDWNSDVVLSSILEMR 1020
AEYLKSLIDWL++SV+EGE NL EACKNSFVHGVLLTLRY+FEELDWNSDVVLSSI EMR
Sbjct: 961 AEYLKSLIDWLNVSVTEGEINLSEACKNSFVHGVLLTLRYSFEELDWNSDVVLSSISEMR 1020
Query: 1021 SLLEKLLELGMRITSLALWVVSADAWYLPEDMDDMVDDDAFMLDVPDEADVSTSLSELED 1080
SLLEKLLEL MRITSLALWVVSADAW+LPEDMDDMVDDDAF+LDVPDE ++STS SELED
Sbjct: 1021 SLLEKLLELVMRITSLALWVVSADAWHLPEDMDDMVDDDAFVLDVPDETNMSTSFSELED 1080
Query: 1081 SKEKTTVNSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPAA-SDSVESDPIASIISRQ 1140
SKEKTT NSRTSEQ VMVGCWLAMKEVSLLLGTITRKVPLPAA SDS ESDP SI+ RQ
Sbjct: 1081 SKEKTTDNSRTSEQTVMVGCWLAMKEVSLLLGTITRKVPLPAAXSDSFESDPNDSIMPRQ 1140
Query: 1141 QEVLDLKQLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSNDQRLCKLTESWMD 1200
+EVLD+KQLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCS+DQRLCKLTESWMD
Sbjct: 1141 EEVLDVKQLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSDDQRLCKLTESWMD 1200
Query: 1201 QLMERTSAKGQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAERLLQN 1260
QLMERT+AKGQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAERLLQN
Sbjct: 1201 QLMERTTAKGQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAERLLQN 1260
Query: 1261 PIDTNHKNSNFSKLPSTELGQDTESVSLHETYPSEKASKIRDEGVIPTVHAFNVLRAAFN 1320
PI+T+ KNSNFSKLPST L QDTE +S HETYPSEKASKIRDEGVIPTVHAFNVLRAAFN
Sbjct: 1261 PIETDCKNSNFSKLPSTGLSQDTEPISTHETYPSEKASKIRDEGVIPTVHAFNVLRAAFN 1320
Query: 1321 DTNLATDTSGFSAQAIIVAIRSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRESARRA 1380
DTNLATDTSGFSAQAIIV IRSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRESARRA
Sbjct: 1321 DTNLATDTSGFSAQAIIVCIRSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRESARRA 1380
Query: 1381 LTGLEFFHRYPALHRFLLDELKVATESLDDGCSGNSKFNLANVVHPSLCPMLILLSRLKP 1440
LTGLEFFHRYPALHRFLL EL VATESLDDGCSG+SK NLA VVHPSLCPMLILLSRLKP
Sbjct: 1381 LTGLEFFHRYPALHRFLLQELDVATESLDDGCSGDSKSNLAKVVHPSLCPMLILLSRLKP 1440
Query: 1441 STIASEDGDDLDPFLLMPFIRKCSSQSNLRIRILASRALTGLVSNENLQLVILNIASGLP 1500
STI SE GDDLDPFL MPFIRKCSSQSNLRIRILASRALTGLVSNENL VILNIASGLP
Sbjct: 1441 STIGSEAGDDLDPFLFMPFIRKCSSQSNLRIRILASRALTGLVSNENLPSVILNIASGLP 1500
Query: 1501 VDDNAMMAHED----ATATTQHASYNRIHGILLQLISLLDTNCRNLADILKKSQVLNDLA 1560
VDD+ M E AT TTQ+ SYNRIHGILLQLISLLD NCRNL DILKKSQ+LNDL
Sbjct: 1501 VDDSTTMGRESSILLATTTTQYTSYNRIHGILLQLISLLDINCRNLGDILKKSQILNDLV 1560
Query: 1561 EVLARCSWMARRRRCSCPILSTSILRVLGHMLSIVRTCPRSKSFYIIRNLLLDLSTECLD 1620
EVLA CSWMAR CSCPILSTS+L+VLGHMLSIVR CPRSKSFY+IRNLLLDLST CLD
Sbjct: 1561 EVLAHCSWMARSSHCSCPILSTSMLQVLGHMLSIVRKCPRSKSFYVIRNLLLDLSTGCLD 1620
Query: 1621 VEASHELSYYDPTVAELRQQASICYFNCVFQPFDEEDDAALQKSQRSQSDEDVPATLMDY 1680
VE SH+L YYDPT+AELRQQA+ICYFNCV QPFDEEDDA LQKSQRSQSDEDVPATLMDY
Sbjct: 1621 VETSHKLPYYDPTLAELRQQAAICYFNCVLQPFDEEDDATLQKSQRSQSDEDVPATLMDY 1680
Query: 1681 PFSQLQERLIRSLQDPCYEVRLSTMKWLFKFLKSTEYSDGLYDLSCHEIRTVDHWIKTNL 1740
PFSQLQERLIRSLQDPCYEVRLSTMKWLFKFLKSTEYS GLYDLSCHEIRTVD WIKTNL
Sbjct: 1681 PFSQLQERLIRSLQDPCYEVRLSTMKWLFKFLKSTEYSAGLYDLSCHEIRTVDQWIKTNL 1740
Query: 1741 QALLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGNEECTEEVVYIGKMDCGSVLQFWD 1800
QALLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGN ECTE+VVYIGKMDCGSVLQFWD
Sbjct: 1741 QALLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGNGECTEDVVYIGKMDCGSVLQFWD 1800
Query: 1801 KLISLYKLTRHAKTRENTIRCMGTCIKRLAVQYSACIISDATTTESPNDRISNNLDEFHS 1860
KLISLYKLTRHAKTRENTIRCMGTCIKRLAVQYSACI+SDATTTESPN +ISN+LD+FHS
Sbjct: 1801 KLISLYKLTRHAKTRENTIRCMGTCIKRLAVQYSACIVSDATTTESPNGKISNDLDKFHS 1860
Query: 1861 CITLFTDLIRQHSAASEPVNMRTAAADSIIASGLLEQAEIFGDFVFDNQIPRDTSKSHFE 1920
CITLFTDLI+QHSAASEPVNMRTAAADSIIASGLLEQAEIFGD+VFDNQIP T SH E
Sbjct: 1861 CITLFTDLIKQHSAASEPVNMRTAAADSIIASGLLEQAEIFGDYVFDNQIPEATVNSHSE 1920
Query: 1921 QREYVNMYAHQILNMWSTCIMLLEDEDDDIRKRLAADVQKCFSSERTTTSSDVPNQVEQV 1980
REY NMYAHQILNMWSTCIMLLEDEDDDIRKRLAADVQK FS ERT TSSDVPNQVEQV
Sbjct: 1921 LREYANMYAHQILNMWSTCIMLLEDEDDDIRKRLAADVQKYFSLERTATSSDVPNQVEQV 1980
Query: 1981 IGSSFEYLSSIFGHWVLYFDYLANWVLNSVNYTISPVDPVRRVFDKEIDNHHEEKLLISQ 2040
IGSSFEYLSSIFGHWVLYFDYLANWVLN+ +YT+SP DPVRRVFDKEIDNHHEEKLLISQ
Sbjct: 1981 IGSSFEYLSSIFGHWVLYFDYLANWVLNTADYTVSPADPVRRVFDKEIDNHHEEKLLISQ 2040
Query: 2041 TCCLHMEKLSRSKLVALWDTQWFINYLVGLRKRFFHQLIKFSDEHMNKHGGFDWIGGAGN 2100
TCC HMEKLSRSKL+ALWDTQWF+NYLVGLRKRFF QLI+F+DE+M+KH GFDWIGGAGN
Sbjct: 2041 TCCFHMEKLSRSKLIALWDTQWFMNYLVGLRKRFFLQLIRFADEYMSKHSGFDWIGGAGN 2100
Query: 2101 HKDAFLLLYSNLLGFYALSNCIVNGKTK-VTMQPLVTEVVEIGKIINPFLRNPLISNLYL 2160
HKDAFL LY+NLLGFYA+SNCIVNGK+K VTMQPL+TEVVE GKIINPFLRNPLISNLYL
Sbjct: 2101 HKDAFLPLYTNLLGFYAISNCIVNGKSKVVTMQPLITEVVETGKIINPFLRNPLISNLYL 2160
Query: 2161 LVIRIHKEAIDINRDHKIPELGHEAIWEGFDPYFLLR 2192
LV RIH+EAID+NRDH IPE GHEAIWEGFDPYFLLR
Sbjct: 2161 LVTRIHEEAIDVNRDHNIPERGHEAIWEGFDPYFLLR 2197
BLAST of HG10019700 vs. NCBI nr
Match:
XP_022934862.1 (thyroid adenoma-associated protein homolog [Cucurbita moschata])
HSP 1 Score: 3910.1 bits (10139), Expect = 0.0e+00
Identity = 1961/2202 (89.06%), Postives = 2062/2202 (93.64%), Query Frame = 0
Query: 1 MSAKWRALQHRHRYTYSAIVFPPSYVDSLNSF----QYPSKFFTELLQLVSLNSVYAQVN 60
MSAKWRALQHRHRYTYSAIVFP SY+DSLNSF Q SKFFTELL+LVSLNSVYAQVN
Sbjct: 1 MSAKWRALQHRHRYTYSAIVFPHSYIDSLNSFQSQHQSSSKFFTELLELVSLNSVYAQVN 60
Query: 61 HAKKVASAFSELLANGDEDSVSKAARFYLEVLFCENSQPLHRTLVSTLAKSRKFQDRLGE 120
HAKKVASAFSELLANGDEDSVS+AARFYLEVLFCENSQPLHRTLVSTLAKSRKFQD LGE
Sbjct: 61 HAKKVASAFSELLANGDEDSVSRAARFYLEVLFCENSQPLHRTLVSTLAKSRKFQDSLGE 120
Query: 121 CFRNLCEEHSGMLQGGEKRFCVSRVALSVMGMPKLGYLVDIIRDCAILVARDTVSSLDSV 180
CFRNLCEEHSGM QGG+KRFCVSRVALSVMGMPKLGYLVD+IRDCAILV+RD VSSLDSV
Sbjct: 121 CFRNLCEEHSGMQQGGDKRFCVSRVALSVMGMPKLGYLVDVIRDCAILVSRDIVSSLDSV 180
Query: 181 VKETNESARPSPIVMEQCQEALSCLYYLLQRFPSKFQEDPSVMGMIVRTILSILKSLAFS 240
VKETN+ ARPSPIV+EQCQEALSCLYYLLQRFPSKF ED SVMGMIV TILSILKSLAFS
Sbjct: 181 VKETNDLARPSPIVIEQCQEALSCLYYLLQRFPSKFLEDSSVMGMIVSTILSILKSLAFS 240
Query: 241 RDCYVAAGVSFCASLQVCLNSAELGMLIFYGIFEQTNHIAFLKYESEFRNAIAKVPHQAN 300
RDCYV+AGVSFCASLQVCLNS ELG+LIFYGIFEQTNHI+ LKYE+EFRNA+AKVP+QAN
Sbjct: 241 RDCYVSAGVSFCASLQVCLNSEELGVLIFYGIFEQTNHISCLKYENEFRNAVAKVPYQAN 300
Query: 301 VCTEIQTFSVLSRLCLIRGILTAIPRPVLNIPFSMMEGELNGHPGCLNSGNSVKTILYDG 360
VC EIQTFSVLSRLCLIRGILTAIPRPVLNIPFSM+EG+LNGHPGCL SGNSVKTILYD
Sbjct: 301 VCAEIQTFSVLSRLCLIRGILTAIPRPVLNIPFSMIEGDLNGHPGCLYSGNSVKTILYDA 360
Query: 361 ILPELCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRILR 420
ILPELCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRILR
Sbjct: 361 ILPELCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRILR 420
Query: 421 IMWTNLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIKLYLQKIAFDLLRLGSRCKG 480
IMWTNL+DPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEK K YL+KIAFDLL LGSRCKG
Sbjct: 421 IMWTNLEDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKTKSYLRKIAFDLLHLGSRCKG 480
Query: 481 RYVPLASLTKRLGAKALLDMSPSLLSETVQAYIDDDVCCAATSFLKCFLEHLRDECWSSD 540
RYVPLASLTKRLGAKALLDMSPSLLS+TVQAYIDDDVCCAATSFLKCFLEHLRDECWSSD
Sbjct: 481 RYVPLASLTKRLGAKALLDMSPSLLSDTVQAYIDDDVCCAATSFLKCFLEHLRDECWSSD 540
Query: 541 GIEGGYALYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFEIDLDSIFPMLAFISVWPS 600
GIEGGYALYRG CLPP+L GLGSGISKLRSNLNTYALPVLFEID+DSIFPMLAFISVWPS
Sbjct: 541 GIEGGYALYRGRCLPPILRGLGSGISKLRSNLNTYALPVLFEIDIDSIFPMLAFISVWPS 600
Query: 601 SRDNGVLYPGDYQGSMELRVEQKVAIFISLLKVSRSLALIEGDIDWLEKPSLEQQSVNEI 660
+ DNGVLYPG+ +GSMELRVEQKVAIFISL KVSRSLALIEGDIDWLEK SLEQ+ +EI
Sbjct: 601 ACDNGVLYPGNNEGSMELRVEQKVAIFISLFKVSRSLALIEGDIDWLEKRSLEQRFSHEI 660
Query: 661 EYFSCHALVFVKGVKVEILVEWLLLALTHVDESLRVDAAEFLFLNPKTSSLPSHLELTLL 720
EYF CHALVF+KGVKVEILVEWL+LALTHVDESLRVDAAEF+FLNPKTSSLPSHLELTLL
Sbjct: 661 EYFGCHALVFIKGVKVEILVEWLVLALTHVDESLRVDAAEFIFLNPKTSSLPSHLELTLL 720
Query: 721 KKAVPLNMRCSSTAFQMKWSSLFRKFFSRVRTALERQFKLGNWIPLASCCNSESDLPNGS 780
KKA+PLNMRCSSTAFQMKW+SLFRKFFSRVRTALERQFK G+WIP AS + ES LPNG+
Sbjct: 721 KKAIPLNMRCSSTAFQMKWTSLFRKFFSRVRTALERQFKQGSWIPRASSSSRESYLPNGN 780
Query: 781 EQIITGRADDLLHFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWSVVPSKEKSSE 840
EQ I GRADDL FMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVW+VVPSKEKS+E
Sbjct: 781 EQTIAGRADDLFRFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWAVVPSKEKSNE 840
Query: 841 TLLLPYNEGITLPDSVLLLVGSIIDSWDRLRENSFRILLHFPTPLPGISSEYMVGKVITW 900
TLLLPYNEGITLPDSVLLLVGSIIDSWD LRE+SFRILLHFPTPLPGISSE+MVG+VI W
Sbjct: 841 TLLLPYNEGITLPDSVLLLVGSIIDSWDSLRESSFRILLHFPTPLPGISSEHMVGEVIAW 900
Query: 901 AKMLVCSSRVRESDAGALALRLVFRKYVLDLGWIVRASGTVVCL--DSLNKLTNVGEELK 960
AK LVCSSRVRESDAGAL LRLVFRKYVLDLGWIVRASG VVCL DS++KL NVGEE+
Sbjct: 901 AKKLVCSSRVRESDAGALTLRLVFRKYVLDLGWIVRASGDVVCLDSDSVHKLPNVGEEIC 960
Query: 961 KSNHPVAEYLKSLIDWLDISVSEGERNLPEACKNSFVHGVLLTLRYAFEELDWNSDVVLS 1020
+SNHPVAEYLKSLIDWL+ISV+EGERNL EACKNSFVHGVLL LRY FEELDW+SD+VLS
Sbjct: 961 RSNHPVAEYLKSLIDWLNISVTEGERNLAEACKNSFVHGVLLALRYTFEELDWSSDIVLS 1020
Query: 1021 SILEMRSLLEKLLELGMRITSLALWVVSADAWYLPEDMDDMVDDDAFMLDVPDEADVSTS 1080
S+ E+RSLLEKLLEL MRITSLAL VVSADAWYLPEDMDDM DDDAF+LDVPDEAD STS
Sbjct: 1021 SLSEIRSLLEKLLELVMRITSLALGVVSADAWYLPEDMDDMDDDDAFLLDVPDEADASTS 1080
Query: 1081 LSELEDSKEKTTVNSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPAASDSVESDPIAS 1140
LSELEDSKEKTTVNSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPAASDSVESDP AS
Sbjct: 1081 LSELEDSKEKTTVNSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPAASDSVESDPNAS 1140
Query: 1141 IISRQQEVLDLKQLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSNDQRLCKLT 1200
II + EVLDL+QLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSNDQRLCKLT
Sbjct: 1141 IILKHDEVLDLRQLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSNDQRLCKLT 1200
Query: 1201 ESWMDQLMERTSAKGQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAE 1260
ESWMDQLMER +A GQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAE
Sbjct: 1201 ESWMDQLMERMTANGQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAE 1260
Query: 1261 RLLQNPIDTNHKNSNFSKLPSTELGQDTESVSLHETYPSEKASKIRDEGVIPTVHAFNVL 1320
RLL NPID++ KN NFS ELGQDTESVS HETY SEKASKIRDEGVIPTVHAFNVL
Sbjct: 1261 RLLLNPIDSDCKNRNFS-----ELGQDTESVSPHETYASEKASKIRDEGVIPTVHAFNVL 1320
Query: 1321 RAAFNDTNLATDTSGFSAQAIIVAIRSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRE 1380
RA+FND NLATDTSGFSAQAIIV+IR+FSS YWEVRNSACLAYTALVRRM+GFLNVHKRE
Sbjct: 1321 RASFNDANLATDTSGFSAQAIIVSIRAFSSSYWEVRNSACLAYTALVRRMVGFLNVHKRE 1380
Query: 1381 SARRALTGLEFFHRYPALHRFLLDELKVATESLDDGCSGNSKFNLANVVHPSLCPMLILL 1440
SARRALTGLEFFHRYPALHRFLLDELKVAT+SLDDGCSGN++ NLA VVHPSLCP+LILL
Sbjct: 1381 SARRALTGLEFFHRYPALHRFLLDELKVATDSLDDGCSGNAESNLAKVVHPSLCPVLILL 1440
Query: 1441 SRLKPSTIASEDGDDLDPFLLMPFIRKCSSQSNLRIRILASRALTGLVSNENLQLVILNI 1500
SRLKPSTI SE GDDLDPFL MPFIRKCSSQSNLRIR+LASRALTGLVSNENL VILNI
Sbjct: 1441 SRLKPSTIVSEAGDDLDPFLFMPFIRKCSSQSNLRIRVLASRALTGLVSNENLPSVILNI 1500
Query: 1501 ASGLPVDDNAMMAHE-----DATATTQHASYNRIHGILLQLISLLDTNCRNLADILKKSQ 1560
ASGLPVDD MMA E D TATT+ +SYN+IHGILLQLISLLDTNCRNLADI KKSQ
Sbjct: 1501 ASGLPVDDTTMMAPESSTVLDVTATTRRSSYNKIHGILLQLISLLDTNCRNLADISKKSQ 1560
Query: 1561 VLNDLAEVLARCSWMARRRRCSCPILSTSILRVLGHMLSIVRTCPRSKSFYIIRNLLLDL 1620
+LNDL EVL RCSWMA+RR CSCPIL TS LRVLGHMLSIVRTCPRSKS YIIRNLLLD+
Sbjct: 1561 ILNDLVEVLGRCSWMAKRRHCSCPILGTSFLRVLGHMLSIVRTCPRSKSLYIIRNLLLDV 1620
Query: 1621 STECLDVEASHELSYYDPTVAELRQQASICYFNCVFQPFDEEDDAALQKSQRSQSDEDVP 1680
STECLD+E H+LS+YDPT+AELRQQA+ICYFNCV QPFDEED AA+QKSQRS+SDEDVP
Sbjct: 1621 STECLDMETYHKLSFYDPTLAELRQQAAICYFNCVLQPFDEEDYAAIQKSQRSESDEDVP 1680
Query: 1681 ATLMDYPFSQLQERLIRSLQDPCYEVRLSTMKWLFKFLKSTEYSDGLYDLSCHEIRTVDH 1740
ATL++YPF QLQERLIRSLQDPCYEVRLST+KWLFKFLKSTEYSDG DLS HEI TVDH
Sbjct: 1681 ATLINYPFPQLQERLIRSLQDPCYEVRLSTLKWLFKFLKSTEYSDGFNDLSIHEITTVDH 1740
Query: 1741 WIKTNLQALLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGNEECTEEVVYIGKMDCGS 1800
W KTNLQALLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGN ECTEEVVYIGKMDCGS
Sbjct: 1741 WTKTNLQALLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGNVECTEEVVYIGKMDCGS 1800
Query: 1801 VLQFWDKLISLYKLTRHAKTRENTIRCMGTCIKRLAVQYSACIISDATTTESPNDRISNN 1860
VLQFWDKLISLYKLT+HAKTRE T+RCMGTCIKR AV YSA I+SDA ESP DR SNN
Sbjct: 1801 VLQFWDKLISLYKLTKHAKTRETTLRCMGTCIKRCAVLYSASIVSDAMMGESPKDRTSNN 1860
Query: 1861 LDEFHSCITLFTDLIRQHSAASEPVNMRTAAADSIIASGLLEQAEIFGDFVFDNQIPRDT 1920
L+EF SCITLFTDLI QHSAASEPVNMRTAAADSIIASGLLEQAEIFGD++FDNQIP++T
Sbjct: 1861 LEEFQSCITLFTDLISQHSAASEPVNMRTAAADSIIASGLLEQAEIFGDYMFDNQIPQET 1920
Query: 1921 SKSHFEQREYVNMYAHQILNMWSTCIMLLEDEDDDIRKRLAADVQKCFSSERTTTSSDVP 1980
S SHFEQR+YVNMYAHQILN+WSTCIMLLEDEDD+IRK LAADVQKCFSSERTTTSSD
Sbjct: 1921 SNSHFEQRDYVNMYAHQILNIWSTCIMLLEDEDDEIRKSLAADVQKCFSSERTTTSSDAR 1980
Query: 1981 NQVEQVIGSSFEYLSSIFGHWVLYFDYLANWVLNSVNYTISPVDPVRRVFDKEIDNHHEE 2040
QVEQVIGSSFEYLSSIFGHWV YFDYLANWVLN+ NY SP DPVRRVFDKEIDNHHEE
Sbjct: 1981 TQVEQVIGSSFEYLSSIFGHWVRYFDYLANWVLNTANYAASPADPVRRVFDKEIDNHHEE 2040
Query: 2041 KLLISQTCCLHMEKLSRSKLVALWDTQWFINYLVGLRKRFFHQLIKFSDEHMNKHGGFDW 2100
KLLISQTCCLH+EKLSRSKLVALWDTQWFINYLVGLRKRFF QLIKFSDEHM+KHGGFDW
Sbjct: 2041 KLLISQTCCLHLEKLSRSKLVALWDTQWFINYLVGLRKRFFRQLIKFSDEHMSKHGGFDW 2100
Query: 2101 IGGAGNHKDAFLLLYSNLLGFYALSNCIVNGKTKVTMQPLVTEVVEIGKIINPFLRNPLI 2160
IGGAGNHKDAFL LY NLLGFY++SNC++NGKT++ QPL TEVVEIGKIINPFLRNPLI
Sbjct: 2101 IGGAGNHKDAFLPLYGNLLGFYSISNCMINGKTQIITQPLDTEVVEIGKIINPFLRNPLI 2160
Query: 2161 SNLYLLVIRIHKEAIDINRDHKIPELGHEAIWEGFDPYFLLR 2192
SNLYLLVIRIHKEAID+NRDHKIPEL HE IWEGFDPYFLLR
Sbjct: 2161 SNLYLLVIRIHKEAIDVNRDHKIPELEHEGIWEGFDPYFLLR 2197
BLAST of HG10019700 vs. NCBI nr
Match:
KAG6580971.1 (Thyroid adenoma-associated protein-like protein, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 3908.6 bits (10135), Expect = 0.0e+00
Identity = 1961/2202 (89.06%), Postives = 2060/2202 (93.55%), Query Frame = 0
Query: 1 MSAKWRALQHRHRYTYSAIVFPPSYVDSLNSF----QYPSKFFTELLQLVSLNSVYAQVN 60
MSAKWRALQHRHRYTYSAIVFP SY+DSLNSF Q SKFFTELL+LVSLNSVYAQVN
Sbjct: 1 MSAKWRALQHRHRYTYSAIVFPHSYIDSLNSFQSQHQSSSKFFTELLELVSLNSVYAQVN 60
Query: 61 HAKKVASAFSELLANGDEDSVSKAARFYLEVLFCENSQPLHRTLVSTLAKSRKFQDRLGE 120
HAKKVASAFSELLANGDEDSVS+AARFYLEVLFCENSQPLHRTLVSTLAKSRKFQD LGE
Sbjct: 61 HAKKVASAFSELLANGDEDSVSRAARFYLEVLFCENSQPLHRTLVSTLAKSRKFQDSLGE 120
Query: 121 CFRNLCEEHSGMLQGGEKRFCVSRVALSVMGMPKLGYLVDIIRDCAILVARDTVSSLDSV 180
CFRNLCEEHSGM QGG+KRFCVSRVALSV+GMPKLGYLVD+IRDCAILV+RD VSSLDSV
Sbjct: 121 CFRNLCEEHSGMQQGGDKRFCVSRVALSVLGMPKLGYLVDVIRDCAILVSRDIVSSLDSV 180
Query: 181 VKETNESARPSPIVMEQCQEALSCLYYLLQRFPSKFQEDPSVMGMIVRTILSILKSLAFS 240
VKETN+ ARPSPIV+EQCQEALSCLYYLLQRFPSKF ED SVMGMIV TILSILKSLAFS
Sbjct: 181 VKETNDLARPSPIVIEQCQEALSCLYYLLQRFPSKFLEDSSVMGMIVSTILSILKSLAFS 240
Query: 241 RDCYVAAGVSFCASLQVCLNSAELGMLIFYGIFEQTNHIAFLKYESEFRNAIAKVPHQAN 300
RDCYV+AGVSFCASLQVCLNS ELG+LIFYGIFEQTNHI+ LKYE+EFRNA+AKVP+QAN
Sbjct: 241 RDCYVSAGVSFCASLQVCLNSEELGVLIFYGIFEQTNHISCLKYENEFRNAVAKVPYQAN 300
Query: 301 VCTEIQTFSVLSRLCLIRGILTAIPRPVLNIPFSMMEGELNGHPGCLNSGNSVKTILYDG 360
VC EIQTFSVLSRLCLIRGILTAIPRPVLNIPFSM+EG+LNGHPGCL SGNSVKTILYD
Sbjct: 301 VCAEIQTFSVLSRLCLIRGILTAIPRPVLNIPFSMIEGDLNGHPGCLYSGNSVKTILYDA 360
Query: 361 ILPELCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRILR 420
ILPELCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRILR
Sbjct: 361 ILPELCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRILR 420
Query: 421 IMWTNLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIKLYLQKIAFDLLRLGSRCKG 480
IMWTNL+DPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEK K YL+KIAFDLL LGSRCKG
Sbjct: 421 IMWTNLEDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKTKSYLRKIAFDLLHLGSRCKG 480
Query: 481 RYVPLASLTKRLGAKALLDMSPSLLSETVQAYIDDDVCCAATSFLKCFLEHLRDECWSSD 540
RYVPLASLTKRLGAKALLDMSPSLLS+TVQAYIDDDVCCAATSFLKCFLEHLRDECWSSD
Sbjct: 481 RYVPLASLTKRLGAKALLDMSPSLLSDTVQAYIDDDVCCAATSFLKCFLEHLRDECWSSD 540
Query: 541 GIEGGYALYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFEIDLDSIFPMLAFISVWPS 600
GIEGGYALYRG CLPPVL GLGSGISKLRSNLNTYALPVLFEID+DSIFPMLAFISVWPS
Sbjct: 541 GIEGGYALYRGRCLPPVLRGLGSGISKLRSNLNTYALPVLFEIDIDSIFPMLAFISVWPS 600
Query: 601 SRDNGVLYPGDYQGSMELRVEQKVAIFISLLKVSRSLALIEGDIDWLEKPSLEQQSVNEI 660
+ DNGVLYPG+ +GSMELRVEQKVAIFISL KVSRSLALIEGDIDWLEK SLEQ+ +EI
Sbjct: 601 ACDNGVLYPGNNEGSMELRVEQKVAIFISLFKVSRSLALIEGDIDWLEKRSLEQRFSHEI 660
Query: 661 EYFSCHALVFVKGVKVEILVEWLLLALTHVDESLRVDAAEFLFLNPKTSSLPSHLELTLL 720
EYF CHALVF+KGVKVEILVEWL+LALTHVDESLRVDAAEF+FLNPKTSSLPSHLELTLL
Sbjct: 661 EYFGCHALVFIKGVKVEILVEWLVLALTHVDESLRVDAAEFIFLNPKTSSLPSHLELTLL 720
Query: 721 KKAVPLNMRCSSTAFQMKWSSLFRKFFSRVRTALERQFKLGNWIPLASCCNSESDLPNGS 780
KKA+PLNMRCSSTAFQMKW+SLFRKFFSRVRTALERQFK G+WIP AS N ES LPNG+
Sbjct: 721 KKAIPLNMRCSSTAFQMKWTSLFRKFFSRVRTALERQFKQGSWIPRASSSNRESYLPNGN 780
Query: 781 EQIITGRADDLLHFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWSVVPSKEKSSE 840
EQ I GRADDL FMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWSVVPSKEKS+E
Sbjct: 781 EQTIAGRADDLFRFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWSVVPSKEKSNE 840
Query: 841 TLLLPYNEGITLPDSVLLLVGSIIDSWDRLRENSFRILLHFPTPLPGISSEYMVGKVITW 900
TLLLPYNEGITLPDSVLLLVGSIIDSWD LRE+SFRILLHFPTPLPGISSE+MVG+VI W
Sbjct: 841 TLLLPYNEGITLPDSVLLLVGSIIDSWDSLRESSFRILLHFPTPLPGISSEHMVGEVIAW 900
Query: 901 AKMLVCSSRVRESDAGALALRLVFRKYVLDLGWIVRASGTVVCL--DSLNKLTNVGEELK 960
AK LVCSSRVRESDAGAL LRLVFRKYVLDLGWIVRASG VVCL DS++KL NVGEE+
Sbjct: 901 AKKLVCSSRVRESDAGALTLRLVFRKYVLDLGWIVRASGDVVCLDSDSVHKLPNVGEEIC 960
Query: 961 KSNHPVAEYLKSLIDWLDISVSEGERNLPEACKNSFVHGVLLTLRYAFEELDWNSDVVLS 1020
+SNHPVAEYLKSLIDWL+ISV+EGERNL EACKNSFVHGVLL LRY FEELDW+SD+VLS
Sbjct: 961 RSNHPVAEYLKSLIDWLNISVTEGERNLAEACKNSFVHGVLLALRYTFEELDWSSDIVLS 1020
Query: 1021 SILEMRSLLEKLLELGMRITSLALWVVSADAWYLPEDMDDMVDDDAFMLDVPDEADVSTS 1080
S+ E+RSLLEKLLEL MRITSLAL VVSADAWYLPEDMDDM DDDAF+LDVPDEAD STS
Sbjct: 1021 SLSEIRSLLEKLLELVMRITSLALGVVSADAWYLPEDMDDMDDDDAFLLDVPDEADASTS 1080
Query: 1081 LSELEDSKEKTTVNSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPAASDSVESDPIAS 1140
LSELEDSKEKTTVNSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPAASDSVESDP AS
Sbjct: 1081 LSELEDSKEKTTVNSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPAASDSVESDPNAS 1140
Query: 1141 IISRQQEVLDLKQLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSNDQRLCKLT 1200
II + EVLDL+QLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSNDQRLCKLT
Sbjct: 1141 IILKHDEVLDLRQLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSNDQRLCKLT 1200
Query: 1201 ESWMDQLMERTSAKGQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAE 1260
ESWMDQLMER +A GQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAE
Sbjct: 1201 ESWMDQLMERMTANGQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAE 1260
Query: 1261 RLLQNPIDTNHKNSNFSKLPSTELGQDTESVSLHETYPSEKASKIRDEGVIPTVHAFNVL 1320
RLL NPID++ KN NFS ELGQDTESVS HETY SEKASKIRDEGVIPTVHAFNVL
Sbjct: 1261 RLLLNPIDSDCKNRNFS-----ELGQDTESVSPHETYASEKASKIRDEGVIPTVHAFNVL 1320
Query: 1321 RAAFNDTNLATDTSGFSAQAIIVAIRSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRE 1380
RA+FND NLATDTSGFSAQAIIV+IR+FSS YWEVRNSACLAYTALVRRM+GFLNVHKRE
Sbjct: 1321 RASFNDANLATDTSGFSAQAIIVSIRAFSSSYWEVRNSACLAYTALVRRMVGFLNVHKRE 1380
Query: 1381 SARRALTGLEFFHRYPALHRFLLDELKVATESLDDGCSGNSKFNLANVVHPSLCPMLILL 1440
SARRALTGLEFFHRYPALHRFLLDELKVAT+SLDDGCSGN++ NLA VVHPSLCP+LILL
Sbjct: 1381 SARRALTGLEFFHRYPALHRFLLDELKVATDSLDDGCSGNAESNLAKVVHPSLCPVLILL 1440
Query: 1441 SRLKPSTIASEDGDDLDPFLLMPFIRKCSSQSNLRIRILASRALTGLVSNENLQLVILNI 1500
SRLKPSTI SE GDDLDPFL MPFIRKCSSQSNLRIR+LASRALTGLVSNENL VILNI
Sbjct: 1441 SRLKPSTIVSEAGDDLDPFLFMPFIRKCSSQSNLRIRVLASRALTGLVSNENLPSVILNI 1500
Query: 1501 ASGLPVDDNAMMAHE-----DATATTQHASYNRIHGILLQLISLLDTNCRNLADILKKSQ 1560
ASGLPVDD MMA E D TATTQ +SYN+IHGILLQLISLLDTNCRNLADI KKSQ
Sbjct: 1501 ASGLPVDDTTMMAPESSTVLDVTATTQRSSYNKIHGILLQLISLLDTNCRNLADISKKSQ 1560
Query: 1561 VLNDLAEVLARCSWMARRRRCSCPILSTSILRVLGHMLSIVRTCPRSKSFYIIRNLLLDL 1620
+LNDL EVL RCSWMA+RR CSCPIL TS LRVLGHMLSIVRTCPRSKS YIIRNLLLDL
Sbjct: 1561 ILNDLVEVLGRCSWMAKRRHCSCPILGTSFLRVLGHMLSIVRTCPRSKSLYIIRNLLLDL 1620
Query: 1621 STECLDVEASHELSYYDPTVAELRQQASICYFNCVFQPFDEEDDAALQKSQRSQSDEDVP 1680
STECLD+E H+LSYYDPT+AELRQQA+ICYFNCV QPFDEED AA+QKSQRS+ DEDVP
Sbjct: 1621 STECLDMETYHKLSYYDPTLAELRQQAAICYFNCVLQPFDEEDYAAIQKSQRSEPDEDVP 1680
Query: 1681 ATLMDYPFSQLQERLIRSLQDPCYEVRLSTMKWLFKFLKSTEYSDGLYDLSCHEIRTVDH 1740
ATL++YPF QLQERLIRSLQDPCYEVRLST+KWLFKFLKSTEYSDG DLS HEIRTVDH
Sbjct: 1681 ATLINYPFPQLQERLIRSLQDPCYEVRLSTLKWLFKFLKSTEYSDGFNDLSSHEIRTVDH 1740
Query: 1741 WIKTNLQALLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGNEECTEEVVYIGKMDCGS 1800
W KTNLQALLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGN ECTEEVVYIGKMDCGS
Sbjct: 1741 WTKTNLQALLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGNVECTEEVVYIGKMDCGS 1800
Query: 1801 VLQFWDKLISLYKLTRHAKTRENTIRCMGTCIKRLAVQYSACIISDATTTESPNDRISNN 1860
VL FWDKLISLYKLT+HAKTRE T+RCMGTCIKR AV YSA I+SDA ESP DR SNN
Sbjct: 1801 VLLFWDKLISLYKLTKHAKTRETTLRCMGTCIKRCAVLYSASIVSDAMMGESPKDRTSNN 1860
Query: 1861 LDEFHSCITLFTDLIRQHSAASEPVNMRTAAADSIIASGLLEQAEIFGDFVFDNQIPRDT 1920
L+EF SCITLFTDLI QHSAASEPVN+RTAAADSIIASGLLEQAE+F D++FDNQIP++T
Sbjct: 1861 LEEFQSCITLFTDLISQHSAASEPVNIRTAAADSIIASGLLEQAEVFDDYMFDNQIPQET 1920
Query: 1921 SKSHFEQREYVNMYAHQILNMWSTCIMLLEDEDDDIRKRLAADVQKCFSSERTTTSSDVP 1980
S SHFEQR+YVNMYAHQILN+WSTCIMLLEDEDD+IRK LAADVQKCFS+ERTTTSSD
Sbjct: 1921 SNSHFEQRDYVNMYAHQILNIWSTCIMLLEDEDDEIRKSLAADVQKCFSTERTTTSSDAR 1980
Query: 1981 NQVEQVIGSSFEYLSSIFGHWVLYFDYLANWVLNSVNYTISPVDPVRRVFDKEIDNHHEE 2040
QVEQVIGSSFEYLSSIFGHWV YFDYLANWVLN+ NY SP DPVRRVFDKEIDNHHEE
Sbjct: 1981 TQVEQVIGSSFEYLSSIFGHWVRYFDYLANWVLNTANYAASPADPVRRVFDKEIDNHHEE 2040
Query: 2041 KLLISQTCCLHMEKLSRSKLVALWDTQWFINYLVGLRKRFFHQLIKFSDEHMNKHGGFDW 2100
KLLISQTCCLH+EKLSRSKLVALWDTQWFINYLVGLRKRFF QLIKFSDEHM+KHGGFDW
Sbjct: 2041 KLLISQTCCLHLEKLSRSKLVALWDTQWFINYLVGLRKRFFRQLIKFSDEHMSKHGGFDW 2100
Query: 2101 IGGAGNHKDAFLLLYSNLLGFYALSNCIVNGKTKVTMQPLVTEVVEIGKIINPFLRNPLI 2160
IGGAGNHKDAFL LY NLLGFY++SNC++NGKT++ QPL TEVVEIGKIINPFLRNPLI
Sbjct: 2101 IGGAGNHKDAFLPLYGNLLGFYSISNCMINGKTQIITQPLDTEVVEIGKIINPFLRNPLI 2160
Query: 2161 SNLYLLVIRIHKEAIDINRDHKIPELGHEAIWEGFDPYFLLR 2192
SNLYLLVIRIHKEAID+NRDHKIPEL HE IWEGFDPYFLLR
Sbjct: 2161 SNLYLLVIRIHKEAIDVNRDHKIPELEHEGIWEGFDPYFLLR 2197
BLAST of HG10019700 vs. ExPASy Swiss-Prot
Match:
A8C754 (Thyroid adenoma-associated protein homolog OS=Gallus gallus OX=9031 GN=THADA PE=2 SV=1)
HSP 1 Score: 240.4 bits (612), Expect = 2.0e-61
Identity = 284/1208 (23.51%), Postives = 501/1208 (41.47%), Query Frame = 0
Query: 409 EMGSRILRIMWTNLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIKLYLQKIAFDLL 468
E+ ++L ++T+ + PL Q LIF L+I ++ + EK + ++ LL
Sbjct: 389 EVVGKLLEYVYTHWEHPLDAVRHQSKLIFRNLLQIHRTII-AASDEKSDPFFARLTRRLL 448
Query: 469 RLGSRCKGRYVPLASLTKRLGAKALLDMSPSLLSETVQAYIDDDVCCAATSFLKCFLEHL 528
L KG+Y LA L + LG + +L + S+ + + D + A+ L+ +
Sbjct: 449 SLEWHVKGKYASLACLVECLGTENILQLDRSIPVQILNVMNDQSLAPYASDLLETMFTNH 508
Query: 529 RDECWSSDGIEGGYALYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFEIDLDSIFPML 588
+ + S + + P+L L G + + Y LP L DS+ M
Sbjct: 509 KVQFTSGSQKSTWIDQWHDVWVSPLLQILCEGNHDQTTYIIDYYLPKLLRCSPDSLSYM- 568
Query: 589 AFISVWPSSRDNGVLYPGDYQGSMELRVEQKVAIFISLLKVSRSLALIEGDIDWLEKPSL 648
I + +S D + GS R + ++ L+ +R+ G +
Sbjct: 569 --IRILQASADANL-------GSWSTR--GALGALMACLRTARA----HGHL-------- 628
Query: 649 EQQSVNEIEYFSCHALVFVKGVKVEILVEWLLLALTHVDESLRVDAAEFLFLNPKTSSLP 708
E+ LV + +K L H + +DA L +++ +
Sbjct: 629 ------ELSNIMSRGLVSTESIK---------QGLVHQHNQVCIDALGLLCETHRSTEIV 688
Query: 709 SHLELTLLKKAVPLNMRCSSTAFQMKWSSLFRKFFSRVRTALERQFKLGNWIPLASCCNS 768
S E+ L+ + N+ S + + + SL RK F R+R + + +K W +
Sbjct: 689 SVEEMQLILFFITYNLNSQSPSVRQQICSLLRKLFCRIRESSQVLYK---WEQNKTKQEL 748
Query: 769 ESDLPNGSEQIITGRADDLLHFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWSVV 828
D P + I + D F+ L LF + +P + + + A+ + + ++SV
Sbjct: 749 FEDSPKRNPLGILQKYQD---FLSSLCDRLFEALFPGSSHPTRFSALSILGSVAEIFSVQ 808
Query: 829 PSKEKSSETLLLPYNEGITLPDSVLLLVGSIIDSWDRLRENSFRILLHFPTPLPGISSEY 888
+E+ + ++ I V L+ +++ ++ +F +L+ + +
Sbjct: 809 KGQEQ-----VFRLDQEIN-SARVRTLIQCFASTFEEVKVLAFELLMKLRDVVFXLQDSE 868
Query: 889 MVGKVITWAKMLVCSSRVRESDAGALALRLVFRKYVLDLGWIVRASGTVVCLDSLNKLTN 948
+ + A L S+ + D + L F Y DL I + +N+ T+
Sbjct: 869 SLDLLFQAAMDL--STSTKPYDCVTASYLLNFLAYHEDLQHICLGK-WIKHNPQMNEDTS 928
Query: 949 VGEELKKSNHPVAEYLKSLIDWLDISVSEGERNLPEACKNSFVHGVLLTLRYAFEELDWN 1008
VG K + +K L+ ++ + + +++L +A + ++G + + A ++L N
Sbjct: 929 VGTVEKN----ILAVIKLLLVNVEEEIFQAKKSLLQAAASFPMYGRVHCINGALQQLPLN 988
Query: 1009 SDVVLSSILEMRSLLEKLLELGMRITSLALWVV--SADAWYLPEDMDDMVDDDAFM-LDV 1068
+ L + E + ++ +L+ + ++++ VV S+ +P D+D D M L
Sbjct: 989 N---LMFVTEWKQIVARLILMSYELSAVVSPVVQSSSPEGLIPMDIDSETADRLHMILKE 1048
Query: 1069 PDEADVSTSLSELEDSKEKTTVNSR------------------------TSEQIVMVGCW 1128
D + + + KE + S + Q+V+V CW
Sbjct: 1049 IQPQDTNDYFMQAKMLKEHCKIQSEKLAEHKPMENICTEMRGKESQICDVTAQMVLVCCW 1108
Query: 1129 LAMKEVSLLLGTITRKVPLPAASDSVESDPIASIISRQQEVLDLKQLKVIGDHFLEVLLK 1188
+MKEVSLLLGT+ + +P A+S+ ++ ++Q+K IGD+F L++
Sbjct: 1109 RSMKEVSLLLGTLCKLLPTQASSEP------------SHGLITVEQVKNIGDYFKHHLMQ 1168
Query: 1189 MKHNGAIDKTRAGFTALCNRLLCSNDQRLCKLTESWMDQLMER-TSAKGQTVDDLLRRSA 1248
+H GA + AGF L L N + L K+ E W+ ++E S + RRSA
Sbjct: 1169 SRHRGAFELAYAGFVQLTETLSRCNSESLRKMPEQWLRCVLEEIKSCDPSSTLCATRRSA 1228
Query: 1249 GIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAERLLQNPIDTNHKNSNFSKLPSTELGQ 1308
GIP AL +EP+ LL +K L+ +A PS+E
Sbjct: 1229 GIPFYIQALLASEPKKGKMDLLKMTIKELMSLAS-------------------PSSE--- 1288
Query: 1309 DTESVSLHETYPSEKASKIRDEGVIPTVHAFNVLRAAFNDTNLATDTSGFSAQAIIVAIR 1368
IP VHA N+LRA F DT L + + A I AI
Sbjct: 1289 --------------------PPSAIPQVHALNILRALFRDTRLGENIMPYVADGIQAAIL 1348
Query: 1369 SFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRESARRALTGLEFFHRYPALHRFLLDEL 1428
F+SP W VRNS+ L ++AL+ R+ G S + +TG EFF R+P+L+ FLL +L
Sbjct: 1349 GFTSPIWAVRNSSTLLFSALITRIFGVKRGKDENSKKNRMTGAEFFSRFPSLYPFLLKQL 1408
Query: 1429 KVATESLDDGCSGNSKFNLANVVHPSLCPMLILLSRLKPSTI-ASEDGDDLDPFLLMPFI 1488
+V +L+ S + + +HPSL +L++L +L PS + + + PF PFI
Sbjct: 1409 EVVANTLN---SEDEELK----IHPSLFLLLLILGKLYPSPMDGTYSALSMAPF--XPFI 1455
Query: 1489 RKCSSQSNLRIRILASRALTGLVSNENLQLVILNIASGLPVDDNAMMAHEDATATTQHAS 1548
+C R R ++ RAL V + +L++ GLP D+A +
Sbjct: 1469 IRCGHSPVYRSREMSGRALVPFVMINEVPHTVLSLLKGLP--DSASLC----------IR 1455
Query: 1549 YNRIHGILLQ----LISLLDTNCRNLADILKKSQVLNDLAEVLARCSWMARRRRCSCPIL 1584
N IHG LLQ L S LD+ +D Q L+D+ + W+A+R C +
Sbjct: 1529 QNNIHGTLLQVSHLLQSYLDSKQLGNSDF---EQGLSDIVTCIGSKLWLAKRPN-PCLVT 1455
BLAST of HG10019700 vs. ExPASy Swiss-Prot
Match:
A8C756 (Thyroid adenoma-associated protein homolog OS=Mus musculus OX=10090 GN=Thada PE=1 SV=1)
HSP 1 Score: 238.8 bits (608), Expect = 5.8e-61
Identity = 288/1195 (24.10%), Postives = 493/1195 (41.26%), Query Frame = 0
Query: 413 RILRIMWTNLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIKLYLQKIAFDLLRLGS 472
R+L ++T+ + PL Q ++F L++ L + ++ LL+L
Sbjct: 391 RLLEYVYTHWEHPLDALRHQTKVMFRNLLQMH-RLTMEGADLATDPFCLELTKSLLQLEW 450
Query: 473 RCKGRYVPLASLTKRLGAKALLDMSPSLLSETVQAYIDDDVCCAATSFLKCFLEHLRDEC 532
KG+Y L L + LG + +L + ++ S+ ++ D + A+ L+ ++ +
Sbjct: 451 HIKGKYACLGCLVETLGIEHILAIDKTIPSQILEVMGDQSLVPYASDLLETMFKNHKSHL 510
Query: 533 WSSDGIEGGYALYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFEIDLDSIFPMLAFIS 592
S + + PVL L G RS + Y LP + +S+ M +
Sbjct: 511 KSQTVTNTWMDKWHETWVFPVLSVLCGGNLDQRSYVIDYYLPRILNYSPESLHYM---VH 570
Query: 593 VWPSSRDNGVLYPGDYQGSMELRVEQKVAIFISLLKVSRSLALIEGDIDWLEKPSLEQQS 652
+ +S D G GS R + ++ L+ +R+ G + Q +
Sbjct: 571 ILQASTDTGT-------GSCNHR--GALGALMACLRTARA----HGHL---------QSA 630
Query: 653 VNEIEYFSCHALVFVKGVKVEILVEWLLLALTHVDESLRVDAAEFLFLNPKTSSLPSHLE 712
E C A V L H +R+D L + +++ + S E
Sbjct: 631 TQAWENLVCSARV--------------KQGLIHQHCQVRIDTLGLLCESNRSTEVVSTEE 690
Query: 713 LTLLKKAVPLNMRCSSTAFQMKWSSLFRKFFSRVRTALERQFKLGNWIPLASCCNSESDL 772
+ ++ + N+ S + + SL +K F R++ + + +KL S D
Sbjct: 691 MQWVQFFITYNLNSQSPGVRQQICSLLKKLFCRIQESSQVLYKLEQ-------RKSTPDS 750
Query: 773 PNGS-EQIITGRADDLLHFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWSVVPSK 832
NGS + + +FM + LF + +P + Y + A+ ++ +V V P
Sbjct: 751 ENGSIREQPSVTLQQYKNFMSSVCNILFEALFPGSSYSTRFSAL---TILGSVAEVFPDP 810
Query: 833 EKSSETLLLPYNEGITLPDSVLLLVGSIIDSWDRLRENSFRILLHFPTPLPG-ISSEYMV 892
E + +T+ ++ +L+ +++ ++ +F +L+ + G +
Sbjct: 811 EGNIQTVYQLSHD--IDAGRYQILMECFTSTFEEVKTLAFDLLMKLSSVTAGQFQDSEKL 870
Query: 893 GKVITWAKMLVCSSRVRESDAGALALRLVFRKYVLDLGWIVRASGTVVCLDSLNKLT-NV 952
+ A L S++ + + L L+ R+ L V+ S +LT
Sbjct: 871 QDLFQAALELSTSTKPYDCVTASYLLNLLIRQDALP---------AVLSASSPQQLTRGA 930
Query: 953 GEELKKSNHPVAEYLKSLIDWLDISVSEGERNLPEACKNSFVHGVLLTLRYAFEELDWNS 1012
GE +K L++ L+ +S+ E +L +A + ++G + + AF+ L N
Sbjct: 931 GETSAVLERNTLVVIKCLMENLEDEISQAENSLLQAASSFPMYGRVHCITRAFQRLPLN- 990
Query: 1013 DVVLSSILEMRSLLEKLLELGMRITSLALWVV--------------SADAWYL------- 1072
D+ L+S E R LL +LL L R++++ V+ SA A L
Sbjct: 991 DLRLAS--EWRPLLGRLLLLSYRLSTVVAPVIQSSSPEGLIPVDTDSASASRLQLILNEI 1050
Query: 1073 -PEDMDDMVDDDAFM--LDVPDEADVSTSLSELEDSKE---KTTVNSRTSEQIVMVGCWL 1132
P D +D + + D D D+STS+S ++ S E K + Q+V+ CW
Sbjct: 1051 QPRDTNDYFNHTKILKECDSFDLEDLSTSVSNIDSSAEVKGKEEKACDVTAQMVLACCWR 1110
Query: 1133 AMKEVSLLLGTITRKVPLPAASDSVESDPIASIISRQQEVLDLKQLKVIGDHFLEVLLKM 1192
+MKEV+LLLGT+ + +P+ +S L ++Q+K IGD+F + LL+
Sbjct: 1111 SMKEVALLLGTLCQLLPVQPGPES------------SNVFLTVQQVKEIGDYFKQHLLQS 1170
Query: 1193 KHNGAIDKTRAGFTALCNRLLCSNDQRLCKLTESWMDQLMERTSAKGQTVDDLL---RRS 1252
+H GA + GF L L ++ L KL E W+ ++E KG L RRS
Sbjct: 1171 RHRGAFELAYTGFVKLTEILNRCSNVSLQKLPEQWLRSVLE--EIKGSDPSSKLCATRRS 1230
Query: 1253 AGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAERLLQNPIDTNHKNSNFSKLPSTELG 1312
AGIP AL +EP+ S LL ++ LI +A
Sbjct: 1231 AGIPFYIQALLASEPKKSRMDLLKITMRELISLA-------------------------- 1290
Query: 1313 QDTESVSLHETYPSEKASKIRDEGVIPTVHAFNVLRAAFNDTNLATDTSGFSAQAIIVAI 1372
S +G +P VHA N+LRA F DT L + + A AI
Sbjct: 1291 ----------------LSADDSKGRVPQVHALNILRALFRDTRLGENIIPYVAGGAKAAI 1350
Query: 1373 RSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRESARRALTGLEFFHRYPALHRFLLDE 1432
F+SP W VRNS+ L +++L+ R+ G S +TG EFF R+P L+ FLL +
Sbjct: 1351 LGFTSPVWAVRNSSTLLFSSLITRVFGVKRGKDEVSKTNRMTGREFFSRFPELYPFLLKQ 1410
Query: 1433 LKVATESLDDGCSGNSKFNLANVVHPSLCPMLILLSRLKPSTI-ASEDGDDLDPFLLMPF 1492
L+ ++D + HP + +L++L RL PS + + L PF +PF
Sbjct: 1411 LETVASTVDSELGEPDR-------HPGMFLLLLVLERLYPSPMDGTSSALSLAPF--VPF 1441
Query: 1493 IRKCSSQSNLRIRILASRALTGLVSNENLQLVILNIASGLPVDDNAMMAHEDATATTQHA 1552
I +C R R +A+RAL + + + + + + LP +T Q
Sbjct: 1471 IIRCGRSPIYRSREMAARALVPFIMIDQIPSTLCALLNSLP------------NSTDQCF 1441
Query: 1553 SYNRIHGILLQLISLLD---TNCRNL--ADILKKSQVLNDLAEVLARCSWMARRR 1569
N IHG LLQ+ LL T+CR+ AD L Q L+D+ W+A R+
Sbjct: 1531 RQNHIHGTLLQVFHLLQAYITDCRHRTNADFL---QELSDVTACTKAKLWLAMRQ 1441
BLAST of HG10019700 vs. ExPASy Swiss-Prot
Match:
A8C750 (Thyroid adenoma-associated protein homolog OS=Canis lupus familiaris OX=9615 GN=THADA PE=2 SV=1)
HSP 1 Score: 226.5 bits (576), Expect = 3.0e-57
Identity = 273/1204 (22.67%), Postives = 487/1204 (40.45%), Query Frame = 0
Query: 413 RILRIMWTNLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIKL----YLQKIAFDLL 472
R+L ++T+ + PL Q +IF L++ SE L ++ + LL
Sbjct: 391 RLLEYVYTHWEHPLDALRHQTKIIFRNILQMHQLTKEKSNSEVSGLAADHFICDLTEGLL 450
Query: 473 RLGSRCKGRYVPLASLTKRLGAKALLDMSPSLLSETVQAYIDDDVCCAATSFLKCFL--- 532
RL KG+Y L L +G +L ++ ++ S+ ++ D + A+ L+
Sbjct: 451 RLEWHVKGKYTCLGCLVDYIGIGHILALAKTIPSQILEVMGDQSLVPYASDLLETMFRSH 510
Query: 533 -EHLRDECWSSDGIEGGYALYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFEIDLDSI 592
HL+ + S I+ + + + P+L L G +S + Y LP L +S+
Sbjct: 511 KNHLKSQALDSTWIDEWHETW----VSPLLFILCEGNLDQKSYVIDYYLPKLLNCSPESL 570
Query: 593 FPMLAFISVWPSSRDNGVLYPGDYQGSMELRVEQKVAIFISLLKVSRSLALIEGDIDWLE 652
M+ + ++ G Y L ++ L+ +R+ ++ D
Sbjct: 571 SYMVKILQTSADAK------TGSYNSRGAL------GALMACLRTARAHGHLQSATDTWR 630
Query: 653 KPSLEQQSVNEIEYFSCHALVFVKGVKVEILVEWLLLALTHVDESLRVDAAEFLFLNPKT 712
V +++ L H +R+D L + ++
Sbjct: 631 N--------------------LVSSARIK-------QGLIHQHCQVRIDTLGLLCESNRS 690
Query: 713 SSLPSHLELTLLKKAVPLNMRCSSTAFQMKWSSLFRKFFSRVRTALERQFKLGNWIPLAS 772
+ + S E+ ++ + N+ S + + SL +K F R++ + + +K S
Sbjct: 691 TEIVSTEEMQWIQFFITYNLNSQSPGVRQQICSLLKKLFCRIQESSQVLYKQEQ---SRS 750
Query: 773 CCNSESDLPNGSEQIITGRADDLLHFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNV 832
E++L ++Q + +FM + LF + +P + Y + A+ + + V
Sbjct: 751 KHEPENEL---TKQHPSVSLQQYKNFMSSICSHLFEALFPGSSYPTRFSALTILGSIAEV 810
Query: 833 WSVVPSKEKSSETLLLPYNEGITLPDSVLLLVGSIIDSWDRLRENSFRILLHFPTPLPGI 892
+ V + ++ L + G L+ +++ ++ +F +L+ P +
Sbjct: 811 FPVTEGQVQAVYQLSHDIDVG-----RFQTLMECFTSTFEEVKILAFDLLMKLPKTVVQF 870
Query: 893 SSEYMVGKVITWAKMLVCSSRVRESDAGALALRLVFRKYVLDLGWIVRASGTVVCLDSLN 952
+ + A L S++ + + L + + VL DSL
Sbjct: 871 QDSEKLQGLFQAALELSSSTKPYDCVTASYLLNFLIWQDVLPSS----------LFDSLK 930
Query: 953 KLTNVGEELKKSNHPVAE----YLKSLIDWLDISVSEGERNLPEACKNSFVHGVLLTLRY 1012
E+ KS V +K L++ L+ VS+ E +L +A + ++G + +
Sbjct: 931 TQQTACEDGDKSAIVVERNTLMVIKCLLENLEEEVSQAENSLLQAAASFPLYGRVHCVTG 990
Query: 1013 AFEELDWNSDVVLSSILEMRSLLEKLLELGMRITSLALWVV--SADAWYLPEDMDD---- 1072
A + L N+ L + E R ++EKLL + R++++ V+ S+ +P D D
Sbjct: 991 ALQRLSLNN---LQLVSEWRPVIEKLLLMSYRLSAVVSPVIQSSSPEGLIPMDTDSESAS 1050
Query: 1073 -----------------------MVDDDAFMLDVPDEADVSTSLSELEDSKEKTTVNSRT 1132
+ + D+F L+ D++ S+ + S E +T
Sbjct: 1051 RLQTILNEIQPRDTNDYFTQAKILKEHDSFDLE-----DLNVSVQNIGASAEVKGKERKT 1110
Query: 1133 SE---QIVMVGCWLAMKEVSLLLGTITRKVPLPAASDSVESDPIASIISRQQEVLDLKQL 1192
+ Q+V+V CW +MKEV+LLLGT+ + +P+ + +S +L +Q+
Sbjct: 1111 CDVTAQMVLVCCWRSMKEVALLLGTLCQLLPMQSVPES------------SNGLLTEEQV 1170
Query: 1193 KVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSNDQRLCKLTESWMDQLMERTSAK 1252
K IGD+F + LL+ +H GA + GF L L + L KL E W+ ++E
Sbjct: 1171 KEIGDYFKQHLLQSRHRGAFELAYTGFVKLTEILNRCPNVSLQKLPEQWLWNVLEEIKCS 1230
Query: 1253 GQTVD-DLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAERLLQNPIDTNHKN 1312
+ RRSAGIP AL +EP+ LL +K LI +A P D +
Sbjct: 1231 DPSSKLCATRRSAGIPFYIQALLASEPKKGKMDLLKITMKELISLA-----GPTDDS--- 1290
Query: 1313 SNFSKLPSTELGQDTESVSLHETYPSEKASKIRDEGVIPTVHAFNVLRAAFNDTNLATDT 1372
+ +P VHA N+LRA F DT L +
Sbjct: 1291 ----------------------------------QSTVPQVHALNILRALFRDTRLGENI 1350
Query: 1373 SGFSAQAIIVAIRSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRESARRALTGLEFFH 1432
+ A AI F+SP W VRNS+ L ++ L+ R+ G S + +TG EFF
Sbjct: 1351 IPYVADGAKAAILGFTSPVWAVRNSSTLLFSTLITRIFGVKRGKDELSKKNRMTGSEFFS 1410
Query: 1433 RYPALHRFLLDELKVATESLDDGCSGNSKFNLANVVHPSLCPMLILLSRLKPSTI-ASED 1492
R+P L+ FLL +L+ ++D S + N HPS+ +L++L RL PS + +
Sbjct: 1411 RFPELYPFLLQQLEAVANTVD---SDTGELNR----HPSMFLLLLVLGRLYPSPMDGTYS 1446
Query: 1493 GDDLDPFLLMPFIRKCSSQSNLRIRILASRALTGLVSNENLQLVILNIASGLPVDDNAMM 1552
+ PF+ PFI +C + R R +A+RAL V + + I + + LP
Sbjct: 1471 ALSMAPFI--PFIMRCGRSPDYRSREMAARALVPFVMVDEIPTTIRTLLAKLP------- 1446
Query: 1553 AHEDATATTQHASYNRIHGILLQLISLLD--TNCRNLADILKKSQVLNDLAEVLARCSWM 1569
T Q N IHG LLQ+ LL T+ + + + Q L D+A W+
Sbjct: 1531 -----NCTDQRFRQNHIHGTLLQVFHLLQAFTDSKYRLNTYFQ-QELADVAVCTRAKLWL 1446
BLAST of HG10019700 vs. ExPASy Swiss-Prot
Match:
A8C752 (Thyroid adenoma-associated protein homolog OS=Chlorocebus aethiops OX=9534 GN=THADA PE=2 SV=1)
HSP 1 Score: 224.2 bits (570), Expect = 1.5e-56
Identity = 269/1194 (22.53%), Postives = 487/1194 (40.79%), Query Frame = 0
Query: 413 RILRIMWTNLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKI-KLYLQKIAFDLLRLG 472
R+L ++T+ + PL Q ++F L++ EG+ + + K+ LLRL
Sbjct: 391 RLLEYVYTHWEHPLDALRHQTKIMFKNLLQMHRLTV--EGAVLVPDPFFVKLTESLLRLE 450
Query: 473 SRCKGRYVPLASLTKRLGAKALLDMSPSLLSETVQAYIDDDVCCAATSFLKCFLE----H 532
KG+Y+ L L + +G + +L + ++ S+ ++ D + A+ L+ + H
Sbjct: 451 WHIKGKYMCLGCLVECIGIEHILAIDKTIPSQILEVMGDQSLVPYASDLLETMFKNHKSH 510
Query: 533 LRDECWSSDGIEGGYALYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFEIDLDSIFPM 592
L+ + S I+ + + + P+L L G +S + Y LP L +S+ M
Sbjct: 511 LKSQTAESSWIDQWHETW----VSPLLFILCEGNLDQKSYVIDYYLPKLLSYSPESLQYM 570
Query: 593 LAFISVWPSSRDNGVLYPGDYQGSMELRVEQKVAIFISLLKVSRSLALIEGDIDWLEKPS 652
+ + +S D + + ++ L+++R+ ++ D E
Sbjct: 571 ---VKILQTSIDAKTGQEQSFPSLGSCNSRGALGALMACLRIARAHGHLQSATDTWEN-- 630
Query: 653 LEQQSVNEIEYFSCHALVFVKGVKVEILVEWLLLALTHVDESLRVDAAEFLFLNPKTSSL 712
V G +++ L H +R+D L + +++ +
Sbjct: 631 ------------------LVSGARIK-------QGLIHQHCQVRIDTLGLLCESNRSTEI 690
Query: 713 PSHLELTLLKKAVPLNMRCSSTAFQMKWSSLFRKFFSRVRTALERQFKLGNWIPLASCCN 772
S E+ ++ + N+ S + + SL +K F R++ + + +KL S
Sbjct: 691 VSMEEMQWIQFFITYNLNSQSPGVRQQICSLLKKLFCRIQESSQVLYKLEQ---NKSKHE 750
Query: 773 SESDLPNGSEQIITGRADDLLHFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWSV 832
E +L ++Q + +FM + LF + +P + Y + A+ + + V+ V
Sbjct: 751 PEKEL---TKQHPSVSLQQYKNFMSSICNSLFEALFPGSSYSTRFSALTILGSIAEVFHV 810
Query: 833 VPSKEKSSETLLLPYNEGITLPDSVLLLVGSIIDSWDRLRENSFRILLHFPTPLPGISSE 892
+ + L N I + L+ +++ ++ +F +L+
Sbjct: 811 PEGRIYTVYQL----NHDIDV-GRFQALMECFTSTFEDVKMLAFDLLMKLSKTAVHFQDS 870
Query: 893 YMVGKVITWAKMLVCSSRVRESDAGALALRLVFRKYVLDLGWIVRASGTVVCLDSLNKLT 952
+ + A L S++ + + L + + L V + V D +
Sbjct: 871 EKLQGLFQAALALSTSTKPYDCVTASYLLNFLIWQDALPSSLSVYLTQQVARGDGDRPAS 930
Query: 953 NVGEELKKSNHPVAEYLKSLIDWLDISVSEGERNLPEACKNSFVHGVLLTLRYAFEELDW 1012
V +K L++ L+ V + E +L +A + ++G + + A ++L
Sbjct: 931 VV-------ERNTLMVIKCLMENLEEEVYQAENSLLQAAASFPMYGRVHCITGALQKLSL 990
Query: 1013 NSDVVLSSILEMRSLLEKLLELGMRITSLALWVV--SADAWYLPEDMDDMVDDDAFML-- 1072
NS L + E R ++EKLL + R++++ V+ S+ +P D D M+
Sbjct: 991 NS---LQLVSEWRPVVEKLLLMSYRLSTVVSPVIQSSSPEGLIPMDTDSESASRLQMILN 1050
Query: 1073 --------------------DVPDEADVSTSLSELEDSKE---KTTVNSRTSEQIVMVGC 1132
D D D++ S+ ++ S E K + Q+V+V C
Sbjct: 1051 EIQPRDTNDYFNQAKILKEHDSFDMKDLNASVVNIDISTEIKGKEVKTCDVTAQMVLVCC 1110
Query: 1133 WLAMKEVSLLLGTITRKVPLPAASDSVESDPIASIISRQQEVLDLKQLKVIGDHFLEVLL 1192
W +MKEV+LLLGT+ + +P+ +S + +L ++Q+K IGD+F + LL
Sbjct: 1111 WRSMKEVALLLGTLCQLLPMQPVPESSDG------------LLTVEQVKEIGDYFKQHLL 1170
Query: 1193 KMKHNGAIDKTRAGFTALCNRLLCSNDQRLCKLTESWMDQLMERTSAKGQTVD-DLLRRS 1252
+ +H GA + GF L L + L KL E W+ ++E + RRS
Sbjct: 1171 QSRHRGAFELAYTGFVKLTEVLNRCPNVSLQKLPEQWLWSVLEEIKCSDPSSKLCATRRS 1230
Query: 1253 AGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAERLLQNPIDTNHKNSNFSKLPSTELG 1312
AGIP AL +EP+ LL +K LI +A P+ +L
Sbjct: 1231 AGIPFYIQALLASEPKKGKMDLLKITMKELISLAG-------------------PTDDL- 1290
Query: 1313 QDTESVSLHETYPSEKASKIRDEGVIPTVHAFNVLRAAFNDTNLATDTSGFSAQAIIVAI 1372
+ +P VHA N+LRA F DT L + + A AI
Sbjct: 1291 ----------------------QSTVPQVHALNILRALFRDTRLGENIIPYVADGAKAAI 1350
Query: 1373 RSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRESARRALTGLEFFHRYPALHRFLLDE 1432
F+SP W VRNS+ L ++AL+ R+ G S +TG EFF R+P L+ FLL +
Sbjct: 1351 LGFTSPVWAVRNSSTLLFSALITRIFGVKRAKDELSKTNRMTGREFFSRFPELYPFLLKQ 1410
Query: 1433 LKVATESLDDGCSGNSKFNLANVVHPSLCPMLILLSRLKPSTI-ASEDGDDLDPFLLMPF 1492
L+ ++D ++ HPS+ +L++L RL PS + + + PF +PF
Sbjct: 1411 LETVANAVDSDMGEPNR-------HPSMFLLLLVLERLYPSPMDGTSSALSMGPF--VPF 1449
Query: 1493 IRKCSSQSNLRIRILASRALTGLVSNENLQLVILNIASGLPVDDNAMMAHEDATATTQHA 1552
I +C R +A+RAL V +++ I + + LP + T Q
Sbjct: 1471 IMRCGHSPVYHSREMAARALVPFVMIDHIPNTIRTLLATLP------------SCTDQCF 1449
Query: 1553 SYNRIHGILLQLISLL----DTNCRNLADILKKSQVLNDLAEVLARCSWMARRR 1569
NRIHG LLQ+ LL D+ R +D L D+ W+A+R+
Sbjct: 1531 RQNRIHGTLLQVFHLLQAYSDSKHRTNSDF---QHELTDITVCTKAKLWLAKRQ 1449
BLAST of HG10019700 vs. ExPASy Swiss-Prot
Match:
Q6YHU6 (Thyroid adenoma-associated protein OS=Homo sapiens OX=9606 GN=THADA PE=1 SV=1)
HSP 1 Score: 218.8 bits (556), Expect = 6.2e-55
Identity = 282/1263 (22.33%), Postives = 508/1263 (40.22%), Query Frame = 0
Query: 346 GNSVKTILYD--GILPELCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSY 405
G S + +L D +L L + + PT F L Q+ S +LTD+
Sbjct: 326 GRSGEALLLDTAHVLFTLSSQIKEPTLEMFLSRILASWTNSAIQVLESSSPSLTDSLNGN 385
Query: 406 DPLPEEMGSRILRIMWTNLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKI-KLYLQK 465
+ R+L ++T+ + PL Q ++F L++ EG++ + + +
Sbjct: 386 SSIV----GRLLEYVYTHWEHPLDALRHQTKIMFKNLLQMHRLTV--EGADFVPDPFFVE 445
Query: 466 IAFDLLRLGSRCKGRYVPLASLTKRLGAKALLDMSPSLLSETVQAYIDDDVCCAATSFLK 525
+ LLRL KG+Y L L + +G + +L + ++ S+ ++ D + A+ L+
Sbjct: 446 LTESLLRLEWHIKGKYTCLGCLVECIGVEHILAIDKTIPSQILEVMGDQSLVPYASDLLE 505
Query: 526 CFL----EHLRDECWSSDGIEGGYALYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFE 585
HL+ + S I+ + + + P+L L G +S + Y LP L
Sbjct: 506 TMFRNHKSHLKSQTAESSWIDQWHETW----VSPLLFILCEGNLDQKSYVIDYYLPKLLS 565
Query: 586 IDLDSIFPMLAFISVWPSSRDNGVLYPGDYQGSMELRVEQKVAIFISLLKVSRSLALIEG 645
+S+ M + + +S D + + ++ L+++R+ ++
Sbjct: 566 YSPESLQYM---VKILQTSIDAKTGQEQSFPSLGSCNSRGALGALMACLRIARAHGHLQS 625
Query: 646 DIDWLEKPSLEQQSVNEIEYFSCHALVFVKGVKVEILVEWLLLALTHVDESLRVDAAEFL 705
D E V +++ L H +R+D L
Sbjct: 626 ATDTWEN--------------------LVSDARIK-------QGLIHQHCQVRIDTLGLL 685
Query: 706 FLNPKTSSLPSHLELTLLKKAVPLNMRCSSTAFQMKWSSLFRKFFSRVRTALERQFKLGN 765
+ +++ + S E+ ++ + N+ S + + SL +K F R++ + + +KL
Sbjct: 686 CESNRSTEIVSMEEMQWIQFFITYNLNSQSPGVRQQICSLLKKLFCRIQESSQVLYKLEQ 745
Query: 766 WIPLASCCNSESDLPNGSEQIITGRADDLLHFMKWLSCFLFFSCYPSAPYRRKIMAMDLF 825
S E++L ++Q + +FM + LF + +P + Y + A+ +
Sbjct: 746 ---SKSKREPENEL---TKQHPSVSLQQYKNFMSSICNSLFEALFPGSSYSTRFSALTIL 805
Query: 826 LVMLNVWSVVPSKEKSSETLLLPYNEGITLPDSVLLLVGSIIDSWDRLRENSFRILLHFP 885
+ V+ V + + L + G L+ +++ ++ +F +L+
Sbjct: 806 GSIAEVFHVPEGRIYTVYQLSHDIDVG-----RFQTLMECFTSTFEDVKILAFDLLMKLS 865
Query: 886 TPLPGISSEYMVGKVITWAKMLVCSSRVRESDAGALALRLVFRKYVLDLGWIVRASGTVV 945
+ + A L S++ + + L + + L + V
Sbjct: 866 KTAVHFQDSGKLQGLFQAALELSTSTKPYDCVTASYLLNFLIWQDALPSSLSAYLTQQVA 925
Query: 946 CLDSLNKLTNVGEELKKSNHPVAEYLKSLIDWLDISVSEGERNLPEACKNSFVHGVLLTL 1005
C D+ ++ V E +K L++ L+ VS+ E +L +A ++G + +
Sbjct: 926 C-DNGDRPAAVVE------RNTLMVIKCLMENLEEEVSQAENSLLQAAAAFPMYGRVHCI 985
Query: 1006 RYAFEELDWNSDVVLSSILEMRSLLEKLLELGMRITSLALWVV--SADAWYLPEDMDDMV 1065
A ++L NS L + E R ++EKLL + R++++ V+ S+ +P D D
Sbjct: 986 TGALQKLSLNS---LQLVSEWRPVVEKLLLMSYRLSTVVSPVIQSSSPEGLIPMDTDSES 1045
Query: 1066 DDDAFML----------------------DVPDEADVSTSLSELEDSKE---KTTVNSRT 1125
M+ D D D++ S+ ++ S E K
Sbjct: 1046 ASRLQMILNEIQPRDTNDYFNQAKILKEHDSFDMKDLNASVVNIDTSTEIKGKEVKTCDV 1105
Query: 1126 SEQIVMVGCWLAMKEVSLLLGTITRKVPLPAASDSVESDPIASIISRQQEVLDLKQLKVI 1185
+ Q+V+V CW +MKEV+LLLG + + +P+ +S + +L ++Q+K I
Sbjct: 1106 TAQMVLVCCWRSMKEVALLLGMLCQLLPMQPVPESSDG------------LLTVEQVKEI 1165
Query: 1186 GDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSNDQRLCKLTESWMDQLMERTSAKGQT 1245
GD+F + LL+ +H GA + GF L L + L KL E W+ ++E +
Sbjct: 1166 GDYFKQHLLQSRHRGAFELAYTGFVKLTEVLNRCPNVSLQKLPEQWLWSVLEEIKCSDPS 1225
Query: 1246 VD-DLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAERLLQNPIDTNHKNSNF 1305
RRSAGIP AL +EP+ LL +K LI +A P D
Sbjct: 1226 SKLCATRRSAGIPFYIQALLASEPKKGRMDLLKITMKELISLA-----GPTD-------- 1285
Query: 1306 SKLPSTELGQDTESVSLHETYPSEKASKIRDEGVIPTVHAFNVLRAAFNDTNLATDTSGF 1365
D +S +P VHA N+LRA F DT L + +
Sbjct: 1286 ----------DIQS-------------------TVPQVHALNILRALFRDTRLGENIIPY 1345
Query: 1366 SAQAIIVAIRSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRESARRALTGLEFFHRYP 1425
A AI F+SP W VRNS+ L ++AL+ R+ G S +TG EFF R+P
Sbjct: 1346 VADGAKAAILGFTSPVWAVRNSSTLLFSALITRIFGVKRAKDEHSKTNRMTGREFFSRFP 1405
Query: 1426 ALHRFLLDELKVATESLDDGCSGNSKFNLANVVHPSLCPMLILLSRLKPSTI-ASEDGDD 1485
L+ FLL +L+ ++D ++ HPS+ +L++L RL S + +
Sbjct: 1406 ELYPFLLKQLETVANTVDSDMGEPNR-------HPSMFLLLLVLERLYASPMDGTSSALS 1449
Query: 1486 LDPFLLMPFIRKCSSQSNLRIRILASRALTGLVSNENLQLVILNIASGLPVDDNAMMAHE 1545
+ PF +PFI +C R +A+RAL V +++ I + S LP
Sbjct: 1466 MGPF--VPFIMRCGHSPVYHSREMAARALVPFVMIDHIPNTIRTLLSTLP---------- 1449
Query: 1546 DATATTQHASYNRIHGILLQLISLL----DTNCRNLADILKKSQVLNDLAEVLARCSWMA 1569
+ T Q N IHG LLQ+ LL D+ +D L D+ W+A
Sbjct: 1526 --SCTDQCFRQNHIHGTLLQVFHLLQAYSDSKHGTNSDF---QHELTDITVCTKAKLWLA 1449
BLAST of HG10019700 vs. ExPASy TrEMBL
Match:
A0A5A7UJ45 (Thyroid adenoma-associated protein-like protein isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G008110 PE=4 SV=1)
HSP 1 Score: 4000.3 bits (10373), Expect = 0.0e+00
Identity = 2000/2196 (91.07%), Postives = 2086/2196 (94.99%), Query Frame = 0
Query: 1 MSAKWRALQHRHRYTYSAIVFPPSYVDSLNSFQYPSKFFTELLQLVSLNSVYAQVNHAKK 60
MSAKWRALQHRHRYTYSAIVFP S+VDSLNSF+ SKFFTELLQLVSLNSVYAQVNHAKK
Sbjct: 1 MSAKWRALQHRHRYTYSAIVFPNSFVDSLNSFRSSSKFFTELLQLVSLNSVYAQVNHAKK 60
Query: 61 VASAFSELLANGDEDSVSKAARFYLEVLFCENSQPLHRTLVSTLAKSRKFQDRLGECFRN 120
VASAFSELLANGDEDSV KAARFYLEVLF ENSQPLHRTLVSTLAKSRKFQD LGECFR+
Sbjct: 61 VASAFSELLANGDEDSVFKAARFYLEVLFFENSQPLHRTLVSTLAKSRKFQDPLGECFRD 120
Query: 121 LCEEHSGMLQGGEKRFCVSRVALSVMGMPKLGYLVDIIRDCAILVARDTVSSLDSVVKET 180
LCE+HSG+LQGGEKRFCVSRVALSVMGMPKLGYLVD+I+DCA+LVARD VSSLD VVKET
Sbjct: 121 LCEKHSGVLQGGEKRFCVSRVALSVMGMPKLGYLVDVIKDCALLVARDIVSSLDYVVKET 180
Query: 181 NESARPSPIVMEQCQEALSCLYYLLQRFPSKFQEDPSVMGMIVRTILSILKSLAFSRDCY 240
NESARPSPI+MEQCQEALSCLYYLLQRFP+KFQED V+ MI+ +ILSILKSLAFSRDCY
Sbjct: 181 NESARPSPIIMEQCQEALSCLYYLLQRFPAKFQEDSDVLRMILSSILSILKSLAFSRDCY 240
Query: 241 VAAGVSFCASLQVCLNSAELGMLIFYGIFEQTNHIAFLKYESEFRNAIAKVPHQANVCTE 300
VAAGVSFCASLQVCLNS ELG+LIFYGI EQTNHI+FLKY+SEFRN + KVPHQANVC E
Sbjct: 241 VAAGVSFCASLQVCLNSEELGVLIFYGILEQTNHISFLKYDSEFRNTVGKVPHQANVCAE 300
Query: 301 IQTFSVLSRLCLIRGILTAIPRPVLNIPFSMMEGELNGHPGCLNSGNSVKTILYDGILPE 360
I+TFSVLSRLCLIRGILTAIPRPVLNI FSM+EG+ NGHPGCLNSGNSVKTILYDGILPE
Sbjct: 301 IRTFSVLSRLCLIRGILTAIPRPVLNIQFSMVEGDSNGHPGCLNSGNSVKTILYDGILPE 360
Query: 361 LCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRILRIMWT 420
LCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRIL IMWT
Sbjct: 361 LCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRILSIMWT 420
Query: 421 NLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIKLYLQKIAFDLLRLGSRCKGRYVP 480
NLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIKLYL+KIAFD+LRLGSRCKGRYVP
Sbjct: 421 NLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIKLYLRKIAFDILRLGSRCKGRYVP 480
Query: 481 LASLTKRLGAKALLDMSPSLLSETVQAYIDDDVCCAATSFLKCFLEHLRDECWSSDGIEG 540
LASLTKRLGAKALLDMSPSLLSETVQAYIDDDVCCAATSFLKCFLEHLRDECWSSDGIEG
Sbjct: 481 LASLTKRLGAKALLDMSPSLLSETVQAYIDDDVCCAATSFLKCFLEHLRDECWSSDGIEG 540
Query: 541 GYALYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFEIDLDSIFPMLAFISVWPSSRDN 600
GYALYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFE+DLDSIFPMLAFISVWPSSRDN
Sbjct: 541 GYALYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFEVDLDSIFPMLAFISVWPSSRDN 600
Query: 601 GVLYPGDYQGSMELRVEQKVAIFISLLKVSRSLALIEGDIDWLEKPSLEQQSVNEIEYFS 660
G+LYP QGSMELRVEQKVAIFISLLKVSRSLALIEGDIDWLEKPSLEQQS +EIEYFS
Sbjct: 601 GILYPSINQGSMELRVEQKVAIFISLLKVSRSLALIEGDIDWLEKPSLEQQSFHEIEYFS 660
Query: 661 CHALVFVKGVKVEILVEWLLLALTHVDESLRVDAAEFLFLNPKTSSLPSHLELTLLKKAV 720
+ALV VKGVKVEILVEWLLLALTHVDE+LRVDAAEFLFLNPKTSSLPSHLELTLLKKA+
Sbjct: 661 RYALVSVKGVKVEILVEWLLLALTHVDETLRVDAAEFLFLNPKTSSLPSHLELTLLKKAI 720
Query: 721 PLNMRCSSTAFQMKWSSLFRKFFSRVRTALERQFKLGNWIPLASCCNSESDLPNGSEQII 780
PLNMRC+STAFQMKWSSLFRKFFSRVRTALER+FKLGNWIPLASCCNSES +PNGSEQI+
Sbjct: 721 PLNMRCTSTAFQMKWSSLFRKFFSRVRTALERKFKLGNWIPLASCCNSESYMPNGSEQIV 780
Query: 781 TGRADDLLHFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWSVVPSKEKSSETLLL 840
GRADDL FMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWS+VPSKEKS+ETLLL
Sbjct: 781 AGRADDLFQFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWSIVPSKEKSNETLLL 840
Query: 841 PYNEGITLPDSVLLLVGSIIDSWDRLRENSFRILLHFPTPLPGISSEYMVGKVITWAKML 900
PYNEGITLPDSVLLLVGSIIDSWDRLRENSFRILLHFPTPLPGISSEYMVGKVI WAK+L
Sbjct: 841 PYNEGITLPDSVLLLVGSIIDSWDRLRENSFRILLHFPTPLPGISSEYMVGKVIKWAKVL 900
Query: 901 VCSSRVRESDAGALALRLVFRKYVLDLGWIVRASGTVVCLDSLNKLTNVGEELKKSNHPV 960
VCSSRVRESDAGALALRLVFRKYVLDLGWIVRAS VVCLDSLNKL NV EE+ KSNHPV
Sbjct: 901 VCSSRVRESDAGALALRLVFRKYVLDLGWIVRASDAVVCLDSLNKLPNVREEICKSNHPV 960
Query: 961 AEYLKSLIDWLDISVSEGERNLPEACKNSFVHGVLLTLRYAFEELDWNSDVVLSSILEMR 1020
+EYLKSLIDWL++SV+EGE NL EACKNSFVHGVLLTLRY+FEELDWNSDVVLSSI EMR
Sbjct: 961 SEYLKSLIDWLNVSVTEGEMNLSEACKNSFVHGVLLTLRYSFEELDWNSDVVLSSISEMR 1020
Query: 1021 SLLEKLLELGMRITSLALWVVSADAWYLPEDMDDMVDDDAFMLDVPDEADVSTSLSELED 1080
SLLEKLLEL MRITSLALWVVSADAW+LPEDM DMVDDDAF+LDVPDE +VSTSLSELED
Sbjct: 1021 SLLEKLLELVMRITSLALWVVSADAWHLPEDMGDMVDDDAFVLDVPDETNVSTSLSELED 1080
Query: 1081 SKEKTTVNSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPAASDSVESDPIASIISRQQ 1140
SKEKTT NSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPAASDS+E DP SI+ RQ+
Sbjct: 1081 SKEKTTDNSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPAASDSIEFDPNDSIMPRQE 1140
Query: 1141 EVLDLKQLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSNDQRLCKLTESWMDQ 1200
EVLD+KQLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCS+DQRLCKLTESWMDQ
Sbjct: 1141 EVLDVKQLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSDDQRLCKLTESWMDQ 1200
Query: 1201 LMERTSAKGQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAERLLQNP 1260
LMERT+A+GQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAERLLQNP
Sbjct: 1201 LMERTTARGQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAERLLQNP 1260
Query: 1261 IDTNHKNSNFSKLPSTELGQDTESVSLHETYPSEKASKIRDEGVIPTVHAFNVLRAAFND 1320
I+ + KNSNFSKLPST L QDT+ +S HE YPSEKASKIRDEGVIPTVHAFNVLRAAFND
Sbjct: 1261 IERDCKNSNFSKLPSTGLSQDTKPISTHENYPSEKASKIRDEGVIPTVHAFNVLRAAFND 1320
Query: 1321 TNLATDTSGFSAQAIIVAIRSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRESARRAL 1380
TNLATDTSGFSAQAIIV IRSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRESARRAL
Sbjct: 1321 TNLATDTSGFSAQAIIVCIRSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRESARRAL 1380
Query: 1381 TGLEFFHRYPALHRFLLDELKVATESLDDGCSGNSKFNLANVVHPSLCPMLILLSRLKPS 1440
TGLEFFHRYPALHRFLL+EL+VATESLDDGCSG+SKFNLA +VHPSLCPMLILLSRLKPS
Sbjct: 1381 TGLEFFHRYPALHRFLLEELEVATESLDDGCSGDSKFNLAKIVHPSLCPMLILLSRLKPS 1440
Query: 1441 TIASEDGDDLDPFLLMPFIRKCSSQSNLRIRILASRALTGLVSNENLQLVILNIASGLPV 1500
TIASE GDDLDPFL MPFIRKCSSQSNLRIRILASRALTGLVSNENL VILNIASGLPV
Sbjct: 1441 TIASEAGDDLDPFLFMPFIRKCSSQSNLRIRILASRALTGLVSNENLPSVILNIASGLPV 1500
Query: 1501 DDNAMMAHED----ATATTQHASYNRIHGILLQLISLLDTNCRNLADILKKSQVLNDLAE 1560
DDN M E ATATTQH SYNRIHGILLQLISLLDTNCRNL DI KK ++LNDL E
Sbjct: 1501 DDNTTMGCESSILLATATTQHTSYNRIHGILLQLISLLDTNCRNLGDISKKRRILNDLVE 1560
Query: 1561 VLARCSWMARRRRCSCPILSTSILRVLGHMLSIVRTCPRSKSFYIIRNLLLDLSTECLDV 1620
VLA CSWMAR CSCPILSTS+L+VLGHMLSIVRTCPRSKSFYIIRNLLLDLSTECLDV
Sbjct: 1561 VLAHCSWMARTSHCSCPILSTSMLQVLGHMLSIVRTCPRSKSFYIIRNLLLDLSTECLDV 1620
Query: 1621 EASHELSYYDPTVAELRQQASICYFNCVFQPFDEEDDAALQKSQRSQSDEDVPATLMDYP 1680
E SH+LSYYDPT+AELRQQA+ICYFNCV QPFDEEDDAALQKSQRSQSDEDVP TL DY
Sbjct: 1621 ETSHKLSYYDPTLAELRQQAAICYFNCVLQPFDEEDDAALQKSQRSQSDEDVPGTLRDYS 1680
Query: 1681 FSQLQERLIRSLQDPCYEVRLSTMKWLFKFLKSTEYSDGLYDLSCHEIRTVDHWIKTNLQ 1740
FSQLQERLIRSLQDPCYEVRLSTMKWLFKFLKSTEYS G YDLSCHEIRTVD WIKTNLQ
Sbjct: 1681 FSQLQERLIRSLQDPCYEVRLSTMKWLFKFLKSTEYSAGSYDLSCHEIRTVDQWIKTNLQ 1740
Query: 1741 ALLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGNEECTEEVVYIGKMDCGSVLQFWDK 1800
+LLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGNEEC E+VVYIGKMDC SVLQFWDK
Sbjct: 1741 SLLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGNEECAEDVVYIGKMDCESVLQFWDK 1800
Query: 1801 LISLYKLTRHAKTRENTIRCMGTCIKRLAVQYSACIISDATTTESPNDRISNNLDEFHSC 1860
LISLYKLTRHAKTRENTIRCMGTCIKRLAVQYSACI+SDATT ESPN RISNNLD++HSC
Sbjct: 1801 LISLYKLTRHAKTRENTIRCMGTCIKRLAVQYSACIVSDATTIESPNGRISNNLDKYHSC 1860
Query: 1861 ITLFTDLIRQHSAASEPVNMRTAAADSIIASGLLEQAEIFGDFVFDNQIPRDTSKSHFEQ 1920
ITLFTDLI+QHSAASEPVNMRTAAADSIIASGLLEQAEIFGD+VFDNQIP+ T+ SH E
Sbjct: 1861 ITLFTDLIKQHSAASEPVNMRTAAADSIIASGLLEQAEIFGDYVFDNQIPQATANSHCEH 1920
Query: 1921 REYVNMYAHQILNMWSTCIMLLEDEDDDIRKRLAADVQKCFSSERTTTSSDVPNQVEQVI 1980
REY NMYAHQILNMWSTCIMLLEDEDDDIRKRLAADVQKCF ERTTTSSDVPNQVEQVI
Sbjct: 1921 REYANMYAHQILNMWSTCIMLLEDEDDDIRKRLAADVQKCFRLERTTTSSDVPNQVEQVI 1980
Query: 1981 GSSFEYLSSIFGHWVLYFDYLANWVLNSVNYTISPVDPVRRVFDKEIDNHHEEKLLISQT 2040
GSSFEYLSSIFGHWVLYFDYLANWVLN+ NYTISP DPVRRVFDKEIDNHHEEKLLI QT
Sbjct: 1981 GSSFEYLSSIFGHWVLYFDYLANWVLNTANYTISPADPVRRVFDKEIDNHHEEKLLICQT 2040
Query: 2041 CCLHMEKLSRSKLVALWDTQWFINYLVGLRKRFFHQLIKFSDEHMNKHGGFDWIGGAGNH 2100
CC HMEKLS S+L+ALWDTQWF+NYLV LRKRFFHQLI+FSDE+M+KH GFDWIGGAGNH
Sbjct: 2041 CCFHMEKLSSSRLIALWDTQWFMNYLVSLRKRFFHQLIRFSDEYMSKHSGFDWIGGAGNH 2100
Query: 2101 KDAFLLLYSNLLGFYALSNCIVNGKTK-VTMQPLVTEVVEIGKIINPFLRNPLISNLYLL 2160
KDAFL LY+NLLGF A+SNCIVNGK+K VTMQP VTEVVEIGKIINPFLRNPLISNLYLL
Sbjct: 2101 KDAFLPLYTNLLGFCAISNCIVNGKSKVVTMQPFVTEVVEIGKIINPFLRNPLISNLYLL 2160
Query: 2161 VIRIHKEAIDINRDHKIPELGHEAIWEGFDPYFLLR 2192
VIRIHKEAID+NRDH IPE GHEAIWEGFDPYFLLR
Sbjct: 2161 VIRIHKEAIDVNRDHNIPEHGHEAIWEGFDPYFLLR 2196
BLAST of HG10019700 vs. ExPASy TrEMBL
Match:
A0A1S3B8Q1 (uncharacterized protein LOC103487009 OS=Cucumis melo OX=3656 GN=LOC103487009 PE=4 SV=1)
HSP 1 Score: 4000.3 bits (10373), Expect = 0.0e+00
Identity = 2000/2196 (91.07%), Postives = 2086/2196 (94.99%), Query Frame = 0
Query: 1 MSAKWRALQHRHRYTYSAIVFPPSYVDSLNSFQYPSKFFTELLQLVSLNSVYAQVNHAKK 60
MSAKWRALQHRHRYTYSAIVFP S+VDSLNSF+ SKFFTELLQLVSLNSVYAQVNHAKK
Sbjct: 1 MSAKWRALQHRHRYTYSAIVFPNSFVDSLNSFRSSSKFFTELLQLVSLNSVYAQVNHAKK 60
Query: 61 VASAFSELLANGDEDSVSKAARFYLEVLFCENSQPLHRTLVSTLAKSRKFQDRLGECFRN 120
VASAFSELLANGDEDSV KAARFYLEVLF ENSQPLHRTLVSTLAKSRKFQD LGECFR+
Sbjct: 61 VASAFSELLANGDEDSVFKAARFYLEVLFFENSQPLHRTLVSTLAKSRKFQDPLGECFRD 120
Query: 121 LCEEHSGMLQGGEKRFCVSRVALSVMGMPKLGYLVDIIRDCAILVARDTVSSLDSVVKET 180
LCE+HSG+LQGGEKRFCVSRVALSVMGMPKLGYLVD+I+DCA+LVARD VSSLD VVKET
Sbjct: 121 LCEKHSGVLQGGEKRFCVSRVALSVMGMPKLGYLVDVIKDCALLVARDIVSSLDYVVKET 180
Query: 181 NESARPSPIVMEQCQEALSCLYYLLQRFPSKFQEDPSVMGMIVRTILSILKSLAFSRDCY 240
NESARPSPI+MEQCQEALSCLYYLLQRFP+KFQED V+ MI+ +ILSILKSLAFSRDCY
Sbjct: 181 NESARPSPIIMEQCQEALSCLYYLLQRFPAKFQEDSDVLRMILSSILSILKSLAFSRDCY 240
Query: 241 VAAGVSFCASLQVCLNSAELGMLIFYGIFEQTNHIAFLKYESEFRNAIAKVPHQANVCTE 300
VAAGVSFCASLQVCLNS ELG+LIFYGI EQTNHI+FLKY+SEFRN + KVPHQANVC E
Sbjct: 241 VAAGVSFCASLQVCLNSEELGVLIFYGILEQTNHISFLKYDSEFRNTVGKVPHQANVCAE 300
Query: 301 IQTFSVLSRLCLIRGILTAIPRPVLNIPFSMMEGELNGHPGCLNSGNSVKTILYDGILPE 360
I+TFSVLSRLCLIRGILTAIPRPVLNI FSM+EG+ NGHPGCLNSGNSVKTILYDGILPE
Sbjct: 301 IRTFSVLSRLCLIRGILTAIPRPVLNIQFSMVEGDSNGHPGCLNSGNSVKTILYDGILPE 360
Query: 361 LCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRILRIMWT 420
LCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRIL IMWT
Sbjct: 361 LCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRILSIMWT 420
Query: 421 NLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIKLYLQKIAFDLLRLGSRCKGRYVP 480
NLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIKLYL+KIAFD+LRLGSRCKGRYVP
Sbjct: 421 NLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIKLYLRKIAFDILRLGSRCKGRYVP 480
Query: 481 LASLTKRLGAKALLDMSPSLLSETVQAYIDDDVCCAATSFLKCFLEHLRDECWSSDGIEG 540
LASLTKRLGAKALLDMSPSLLSETVQAYIDDDVCCAATSFLKCFLEHLRDECWSSDGIEG
Sbjct: 481 LASLTKRLGAKALLDMSPSLLSETVQAYIDDDVCCAATSFLKCFLEHLRDECWSSDGIEG 540
Query: 541 GYALYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFEIDLDSIFPMLAFISVWPSSRDN 600
GYALYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFE+DLDSIFPMLAFISVWPSSRDN
Sbjct: 541 GYALYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFEVDLDSIFPMLAFISVWPSSRDN 600
Query: 601 GVLYPGDYQGSMELRVEQKVAIFISLLKVSRSLALIEGDIDWLEKPSLEQQSVNEIEYFS 660
G+LYP QGSMELRVEQKVAIFISLLKVSRSLALIEGDIDWLEKPSLEQQS +EIEYFS
Sbjct: 601 GILYPSINQGSMELRVEQKVAIFISLLKVSRSLALIEGDIDWLEKPSLEQQSFHEIEYFS 660
Query: 661 CHALVFVKGVKVEILVEWLLLALTHVDESLRVDAAEFLFLNPKTSSLPSHLELTLLKKAV 720
+ALV VKGVKVEILVEWLLLALTHVDE+LRVDAAEFLFLNPKTSSLPSHLELTLLKKA+
Sbjct: 661 RYALVSVKGVKVEILVEWLLLALTHVDETLRVDAAEFLFLNPKTSSLPSHLELTLLKKAI 720
Query: 721 PLNMRCSSTAFQMKWSSLFRKFFSRVRTALERQFKLGNWIPLASCCNSESDLPNGSEQII 780
PLNMRC+STAFQMKWSSLFRKFFSRVRTALER+FKLGNWIPLASCCNSES +PNGSEQI+
Sbjct: 721 PLNMRCTSTAFQMKWSSLFRKFFSRVRTALERKFKLGNWIPLASCCNSESYMPNGSEQIV 780
Query: 781 TGRADDLLHFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWSVVPSKEKSSETLLL 840
GRADDL FMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWS+VPSKEKS+ETLLL
Sbjct: 781 AGRADDLFQFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWSIVPSKEKSNETLLL 840
Query: 841 PYNEGITLPDSVLLLVGSIIDSWDRLRENSFRILLHFPTPLPGISSEYMVGKVITWAKML 900
PYNEGITLPDSVLLLVGSIIDSWDRLRENSFRILLHFPTPLPGISSEYMVGKVI WAK+L
Sbjct: 841 PYNEGITLPDSVLLLVGSIIDSWDRLRENSFRILLHFPTPLPGISSEYMVGKVIKWAKVL 900
Query: 901 VCSSRVRESDAGALALRLVFRKYVLDLGWIVRASGTVVCLDSLNKLTNVGEELKKSNHPV 960
VCSSRVRESDAGALALRLVFRKYVLDLGWIVRAS VVCLDSLNKL NV EE+ KSNHPV
Sbjct: 901 VCSSRVRESDAGALALRLVFRKYVLDLGWIVRASDAVVCLDSLNKLPNVREEICKSNHPV 960
Query: 961 AEYLKSLIDWLDISVSEGERNLPEACKNSFVHGVLLTLRYAFEELDWNSDVVLSSILEMR 1020
+EYLKSLIDWL++SV+EGE NL EACKNSFVHGVLLTLRY+FEELDWNSDVVLSSI EMR
Sbjct: 961 SEYLKSLIDWLNVSVTEGEMNLSEACKNSFVHGVLLTLRYSFEELDWNSDVVLSSISEMR 1020
Query: 1021 SLLEKLLELGMRITSLALWVVSADAWYLPEDMDDMVDDDAFMLDVPDEADVSTSLSELED 1080
SLLEKLLEL MRITSLALWVVSADAW+LPEDM DMVDDDAF+LDVPDE +VSTSLSELED
Sbjct: 1021 SLLEKLLELVMRITSLALWVVSADAWHLPEDMGDMVDDDAFVLDVPDETNVSTSLSELED 1080
Query: 1081 SKEKTTVNSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPAASDSVESDPIASIISRQQ 1140
SKEKTT NSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPAASDS+E DP SI+ RQ+
Sbjct: 1081 SKEKTTDNSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPAASDSIEFDPNDSIMPRQE 1140
Query: 1141 EVLDLKQLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSNDQRLCKLTESWMDQ 1200
EVLD+KQLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCS+DQRLCKLTESWMDQ
Sbjct: 1141 EVLDVKQLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSDDQRLCKLTESWMDQ 1200
Query: 1201 LMERTSAKGQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAERLLQNP 1260
LMERT+A+GQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAERLLQNP
Sbjct: 1201 LMERTTARGQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAERLLQNP 1260
Query: 1261 IDTNHKNSNFSKLPSTELGQDTESVSLHETYPSEKASKIRDEGVIPTVHAFNVLRAAFND 1320
I+ + KNSNFSKLPST L QDT+ +S HE YPSEKASKIRDEGVIPTVHAFNVLRAAFND
Sbjct: 1261 IERDCKNSNFSKLPSTGLSQDTKPISTHENYPSEKASKIRDEGVIPTVHAFNVLRAAFND 1320
Query: 1321 TNLATDTSGFSAQAIIVAIRSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRESARRAL 1380
TNLATDTSGFSAQAIIV IRSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRESARRAL
Sbjct: 1321 TNLATDTSGFSAQAIIVCIRSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRESARRAL 1380
Query: 1381 TGLEFFHRYPALHRFLLDELKVATESLDDGCSGNSKFNLANVVHPSLCPMLILLSRLKPS 1440
TGLEFFHRYPALHRFLL+EL+VATESLDDGCSG+SKFNLA +VHPSLCPMLILLSRLKPS
Sbjct: 1381 TGLEFFHRYPALHRFLLEELEVATESLDDGCSGDSKFNLAKIVHPSLCPMLILLSRLKPS 1440
Query: 1441 TIASEDGDDLDPFLLMPFIRKCSSQSNLRIRILASRALTGLVSNENLQLVILNIASGLPV 1500
TIASE GDDLDPFL MPFIRKCSSQSNLRIRILASRALTGLVSNENL VILNIASGLPV
Sbjct: 1441 TIASEAGDDLDPFLFMPFIRKCSSQSNLRIRILASRALTGLVSNENLPSVILNIASGLPV 1500
Query: 1501 DDNAMMAHED----ATATTQHASYNRIHGILLQLISLLDTNCRNLADILKKSQVLNDLAE 1560
DDN M E ATATTQH SYNRIHGILLQLISLLDTNCRNL DI KK ++LNDL E
Sbjct: 1501 DDNTTMGCESSILLATATTQHTSYNRIHGILLQLISLLDTNCRNLGDISKKRRILNDLVE 1560
Query: 1561 VLARCSWMARRRRCSCPILSTSILRVLGHMLSIVRTCPRSKSFYIIRNLLLDLSTECLDV 1620
VLA CSWMAR CSCPILSTS+L+VLGHMLSIVRTCPRSKSFYIIRNLLLDLSTECLDV
Sbjct: 1561 VLAHCSWMARTSHCSCPILSTSMLQVLGHMLSIVRTCPRSKSFYIIRNLLLDLSTECLDV 1620
Query: 1621 EASHELSYYDPTVAELRQQASICYFNCVFQPFDEEDDAALQKSQRSQSDEDVPATLMDYP 1680
E SH+LSYYDPT+AELRQQA+ICYFNCV QPFDEEDDAALQKSQRSQSDEDVP TL DY
Sbjct: 1621 ETSHKLSYYDPTLAELRQQAAICYFNCVLQPFDEEDDAALQKSQRSQSDEDVPGTLRDYS 1680
Query: 1681 FSQLQERLIRSLQDPCYEVRLSTMKWLFKFLKSTEYSDGLYDLSCHEIRTVDHWIKTNLQ 1740
FSQLQERLIRSLQDPCYEVRLSTMKWLFKFLKSTEYS G YDLSCHEIRTVD WIKTNLQ
Sbjct: 1681 FSQLQERLIRSLQDPCYEVRLSTMKWLFKFLKSTEYSAGSYDLSCHEIRTVDQWIKTNLQ 1740
Query: 1741 ALLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGNEECTEEVVYIGKMDCGSVLQFWDK 1800
+LLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGNEEC E+VVYIGKMDC SVLQFWDK
Sbjct: 1741 SLLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGNEECAEDVVYIGKMDCESVLQFWDK 1800
Query: 1801 LISLYKLTRHAKTRENTIRCMGTCIKRLAVQYSACIISDATTTESPNDRISNNLDEFHSC 1860
LISLYKLTRHAKTRENTIRCMGTCIKRLAVQYSACI+SDATT ESPN RISNNLD++HSC
Sbjct: 1801 LISLYKLTRHAKTRENTIRCMGTCIKRLAVQYSACIVSDATTIESPNGRISNNLDKYHSC 1860
Query: 1861 ITLFTDLIRQHSAASEPVNMRTAAADSIIASGLLEQAEIFGDFVFDNQIPRDTSKSHFEQ 1920
ITLFTDLI+QHSAASEPVNMRTAAADSIIASGLLEQAEIFGD+VFDNQIP+ T+ SH E
Sbjct: 1861 ITLFTDLIKQHSAASEPVNMRTAAADSIIASGLLEQAEIFGDYVFDNQIPQATANSHCEH 1920
Query: 1921 REYVNMYAHQILNMWSTCIMLLEDEDDDIRKRLAADVQKCFSSERTTTSSDVPNQVEQVI 1980
REY NMYAHQILNMWSTCIMLLEDEDDDIRKRLAADVQKCF ERTTTSSDVPNQVEQVI
Sbjct: 1921 REYANMYAHQILNMWSTCIMLLEDEDDDIRKRLAADVQKCFRLERTTTSSDVPNQVEQVI 1980
Query: 1981 GSSFEYLSSIFGHWVLYFDYLANWVLNSVNYTISPVDPVRRVFDKEIDNHHEEKLLISQT 2040
GSSFEYLSSIFGHWVLYFDYLANWVLN+ NYTISP DPVRRVFDKEIDNHHEEKLLI QT
Sbjct: 1981 GSSFEYLSSIFGHWVLYFDYLANWVLNTANYTISPADPVRRVFDKEIDNHHEEKLLICQT 2040
Query: 2041 CCLHMEKLSRSKLVALWDTQWFINYLVGLRKRFFHQLIKFSDEHMNKHGGFDWIGGAGNH 2100
CC HMEKLS S+L+ALWDTQWF+NYLV LRKRFFHQLI+FSDE+M+KH GFDWIGGAGNH
Sbjct: 2041 CCFHMEKLSSSRLIALWDTQWFMNYLVSLRKRFFHQLIRFSDEYMSKHSGFDWIGGAGNH 2100
Query: 2101 KDAFLLLYSNLLGFYALSNCIVNGKTK-VTMQPLVTEVVEIGKIINPFLRNPLISNLYLL 2160
KDAFL LY+NLLGF A+SNCIVNGK+K VTMQP VTEVVEIGKIINPFLRNPLISNLYLL
Sbjct: 2101 KDAFLPLYTNLLGFCAISNCIVNGKSKVVTMQPFVTEVVEIGKIINPFLRNPLISNLYLL 2160
Query: 2161 VIRIHKEAIDINRDHKIPELGHEAIWEGFDPYFLLR 2192
VIRIHKEAID+NRDH IPE GHEAIWEGFDPYFLLR
Sbjct: 2161 VIRIHKEAIDVNRDHNIPEHGHEAIWEGFDPYFLLR 2196
BLAST of HG10019700 vs. ExPASy TrEMBL
Match:
A0A6J1F3Z3 (thyroid adenoma-associated protein homolog OS=Cucurbita moschata OX=3662 GN=LOC111441902 PE=4 SV=1)
HSP 1 Score: 3910.1 bits (10139), Expect = 0.0e+00
Identity = 1961/2202 (89.06%), Postives = 2062/2202 (93.64%), Query Frame = 0
Query: 1 MSAKWRALQHRHRYTYSAIVFPPSYVDSLNSF----QYPSKFFTELLQLVSLNSVYAQVN 60
MSAKWRALQHRHRYTYSAIVFP SY+DSLNSF Q SKFFTELL+LVSLNSVYAQVN
Sbjct: 1 MSAKWRALQHRHRYTYSAIVFPHSYIDSLNSFQSQHQSSSKFFTELLELVSLNSVYAQVN 60
Query: 61 HAKKVASAFSELLANGDEDSVSKAARFYLEVLFCENSQPLHRTLVSTLAKSRKFQDRLGE 120
HAKKVASAFSELLANGDEDSVS+AARFYLEVLFCENSQPLHRTLVSTLAKSRKFQD LGE
Sbjct: 61 HAKKVASAFSELLANGDEDSVSRAARFYLEVLFCENSQPLHRTLVSTLAKSRKFQDSLGE 120
Query: 121 CFRNLCEEHSGMLQGGEKRFCVSRVALSVMGMPKLGYLVDIIRDCAILVARDTVSSLDSV 180
CFRNLCEEHSGM QGG+KRFCVSRVALSVMGMPKLGYLVD+IRDCAILV+RD VSSLDSV
Sbjct: 121 CFRNLCEEHSGMQQGGDKRFCVSRVALSVMGMPKLGYLVDVIRDCAILVSRDIVSSLDSV 180
Query: 181 VKETNESARPSPIVMEQCQEALSCLYYLLQRFPSKFQEDPSVMGMIVRTILSILKSLAFS 240
VKETN+ ARPSPIV+EQCQEALSCLYYLLQRFPSKF ED SVMGMIV TILSILKSLAFS
Sbjct: 181 VKETNDLARPSPIVIEQCQEALSCLYYLLQRFPSKFLEDSSVMGMIVSTILSILKSLAFS 240
Query: 241 RDCYVAAGVSFCASLQVCLNSAELGMLIFYGIFEQTNHIAFLKYESEFRNAIAKVPHQAN 300
RDCYV+AGVSFCASLQVCLNS ELG+LIFYGIFEQTNHI+ LKYE+EFRNA+AKVP+QAN
Sbjct: 241 RDCYVSAGVSFCASLQVCLNSEELGVLIFYGIFEQTNHISCLKYENEFRNAVAKVPYQAN 300
Query: 301 VCTEIQTFSVLSRLCLIRGILTAIPRPVLNIPFSMMEGELNGHPGCLNSGNSVKTILYDG 360
VC EIQTFSVLSRLCLIRGILTAIPRPVLNIPFSM+EG+LNGHPGCL SGNSVKTILYD
Sbjct: 301 VCAEIQTFSVLSRLCLIRGILTAIPRPVLNIPFSMIEGDLNGHPGCLYSGNSVKTILYDA 360
Query: 361 ILPELCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRILR 420
ILPELCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRILR
Sbjct: 361 ILPELCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRILR 420
Query: 421 IMWTNLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIKLYLQKIAFDLLRLGSRCKG 480
IMWTNL+DPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEK K YL+KIAFDLL LGSRCKG
Sbjct: 421 IMWTNLEDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKTKSYLRKIAFDLLHLGSRCKG 480
Query: 481 RYVPLASLTKRLGAKALLDMSPSLLSETVQAYIDDDVCCAATSFLKCFLEHLRDECWSSD 540
RYVPLASLTKRLGAKALLDMSPSLLS+TVQAYIDDDVCCAATSFLKCFLEHLRDECWSSD
Sbjct: 481 RYVPLASLTKRLGAKALLDMSPSLLSDTVQAYIDDDVCCAATSFLKCFLEHLRDECWSSD 540
Query: 541 GIEGGYALYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFEIDLDSIFPMLAFISVWPS 600
GIEGGYALYRG CLPP+L GLGSGISKLRSNLNTYALPVLFEID+DSIFPMLAFISVWPS
Sbjct: 541 GIEGGYALYRGRCLPPILRGLGSGISKLRSNLNTYALPVLFEIDIDSIFPMLAFISVWPS 600
Query: 601 SRDNGVLYPGDYQGSMELRVEQKVAIFISLLKVSRSLALIEGDIDWLEKPSLEQQSVNEI 660
+ DNGVLYPG+ +GSMELRVEQKVAIFISL KVSRSLALIEGDIDWLEK SLEQ+ +EI
Sbjct: 601 ACDNGVLYPGNNEGSMELRVEQKVAIFISLFKVSRSLALIEGDIDWLEKRSLEQRFSHEI 660
Query: 661 EYFSCHALVFVKGVKVEILVEWLLLALTHVDESLRVDAAEFLFLNPKTSSLPSHLELTLL 720
EYF CHALVF+KGVKVEILVEWL+LALTHVDESLRVDAAEF+FLNPKTSSLPSHLELTLL
Sbjct: 661 EYFGCHALVFIKGVKVEILVEWLVLALTHVDESLRVDAAEFIFLNPKTSSLPSHLELTLL 720
Query: 721 KKAVPLNMRCSSTAFQMKWSSLFRKFFSRVRTALERQFKLGNWIPLASCCNSESDLPNGS 780
KKA+PLNMRCSSTAFQMKW+SLFRKFFSRVRTALERQFK G+WIP AS + ES LPNG+
Sbjct: 721 KKAIPLNMRCSSTAFQMKWTSLFRKFFSRVRTALERQFKQGSWIPRASSSSRESYLPNGN 780
Query: 781 EQIITGRADDLLHFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWSVVPSKEKSSE 840
EQ I GRADDL FMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVW+VVPSKEKS+E
Sbjct: 781 EQTIAGRADDLFRFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWAVVPSKEKSNE 840
Query: 841 TLLLPYNEGITLPDSVLLLVGSIIDSWDRLRENSFRILLHFPTPLPGISSEYMVGKVITW 900
TLLLPYNEGITLPDSVLLLVGSIIDSWD LRE+SFRILLHFPTPLPGISSE+MVG+VI W
Sbjct: 841 TLLLPYNEGITLPDSVLLLVGSIIDSWDSLRESSFRILLHFPTPLPGISSEHMVGEVIAW 900
Query: 901 AKMLVCSSRVRESDAGALALRLVFRKYVLDLGWIVRASGTVVCL--DSLNKLTNVGEELK 960
AK LVCSSRVRESDAGAL LRLVFRKYVLDLGWIVRASG VVCL DS++KL NVGEE+
Sbjct: 901 AKKLVCSSRVRESDAGALTLRLVFRKYVLDLGWIVRASGDVVCLDSDSVHKLPNVGEEIC 960
Query: 961 KSNHPVAEYLKSLIDWLDISVSEGERNLPEACKNSFVHGVLLTLRYAFEELDWNSDVVLS 1020
+SNHPVAEYLKSLIDWL+ISV+EGERNL EACKNSFVHGVLL LRY FEELDW+SD+VLS
Sbjct: 961 RSNHPVAEYLKSLIDWLNISVTEGERNLAEACKNSFVHGVLLALRYTFEELDWSSDIVLS 1020
Query: 1021 SILEMRSLLEKLLELGMRITSLALWVVSADAWYLPEDMDDMVDDDAFMLDVPDEADVSTS 1080
S+ E+RSLLEKLLEL MRITSLAL VVSADAWYLPEDMDDM DDDAF+LDVPDEAD STS
Sbjct: 1021 SLSEIRSLLEKLLELVMRITSLALGVVSADAWYLPEDMDDMDDDDAFLLDVPDEADASTS 1080
Query: 1081 LSELEDSKEKTTVNSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPAASDSVESDPIAS 1140
LSELEDSKEKTTVNSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPAASDSVESDP AS
Sbjct: 1081 LSELEDSKEKTTVNSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPAASDSVESDPNAS 1140
Query: 1141 IISRQQEVLDLKQLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSNDQRLCKLT 1200
II + EVLDL+QLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSNDQRLCKLT
Sbjct: 1141 IILKHDEVLDLRQLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSNDQRLCKLT 1200
Query: 1201 ESWMDQLMERTSAKGQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAE 1260
ESWMDQLMER +A GQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAE
Sbjct: 1201 ESWMDQLMERMTANGQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAE 1260
Query: 1261 RLLQNPIDTNHKNSNFSKLPSTELGQDTESVSLHETYPSEKASKIRDEGVIPTVHAFNVL 1320
RLL NPID++ KN NFS ELGQDTESVS HETY SEKASKIRDEGVIPTVHAFNVL
Sbjct: 1261 RLLLNPIDSDCKNRNFS-----ELGQDTESVSPHETYASEKASKIRDEGVIPTVHAFNVL 1320
Query: 1321 RAAFNDTNLATDTSGFSAQAIIVAIRSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRE 1380
RA+FND NLATDTSGFSAQAIIV+IR+FSS YWEVRNSACLAYTALVRRM+GFLNVHKRE
Sbjct: 1321 RASFNDANLATDTSGFSAQAIIVSIRAFSSSYWEVRNSACLAYTALVRRMVGFLNVHKRE 1380
Query: 1381 SARRALTGLEFFHRYPALHRFLLDELKVATESLDDGCSGNSKFNLANVVHPSLCPMLILL 1440
SARRALTGLEFFHRYPALHRFLLDELKVAT+SLDDGCSGN++ NLA VVHPSLCP+LILL
Sbjct: 1381 SARRALTGLEFFHRYPALHRFLLDELKVATDSLDDGCSGNAESNLAKVVHPSLCPVLILL 1440
Query: 1441 SRLKPSTIASEDGDDLDPFLLMPFIRKCSSQSNLRIRILASRALTGLVSNENLQLVILNI 1500
SRLKPSTI SE GDDLDPFL MPFIRKCSSQSNLRIR+LASRALTGLVSNENL VILNI
Sbjct: 1441 SRLKPSTIVSEAGDDLDPFLFMPFIRKCSSQSNLRIRVLASRALTGLVSNENLPSVILNI 1500
Query: 1501 ASGLPVDDNAMMAHE-----DATATTQHASYNRIHGILLQLISLLDTNCRNLADILKKSQ 1560
ASGLPVDD MMA E D TATT+ +SYN+IHGILLQLISLLDTNCRNLADI KKSQ
Sbjct: 1501 ASGLPVDDTTMMAPESSTVLDVTATTRRSSYNKIHGILLQLISLLDTNCRNLADISKKSQ 1560
Query: 1561 VLNDLAEVLARCSWMARRRRCSCPILSTSILRVLGHMLSIVRTCPRSKSFYIIRNLLLDL 1620
+LNDL EVL RCSWMA+RR CSCPIL TS LRVLGHMLSIVRTCPRSKS YIIRNLLLD+
Sbjct: 1561 ILNDLVEVLGRCSWMAKRRHCSCPILGTSFLRVLGHMLSIVRTCPRSKSLYIIRNLLLDV 1620
Query: 1621 STECLDVEASHELSYYDPTVAELRQQASICYFNCVFQPFDEEDDAALQKSQRSQSDEDVP 1680
STECLD+E H+LS+YDPT+AELRQQA+ICYFNCV QPFDEED AA+QKSQRS+SDEDVP
Sbjct: 1621 STECLDMETYHKLSFYDPTLAELRQQAAICYFNCVLQPFDEEDYAAIQKSQRSESDEDVP 1680
Query: 1681 ATLMDYPFSQLQERLIRSLQDPCYEVRLSTMKWLFKFLKSTEYSDGLYDLSCHEIRTVDH 1740
ATL++YPF QLQERLIRSLQDPCYEVRLST+KWLFKFLKSTEYSDG DLS HEI TVDH
Sbjct: 1681 ATLINYPFPQLQERLIRSLQDPCYEVRLSTLKWLFKFLKSTEYSDGFNDLSIHEITTVDH 1740
Query: 1741 WIKTNLQALLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGNEECTEEVVYIGKMDCGS 1800
W KTNLQALLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGN ECTEEVVYIGKMDCGS
Sbjct: 1741 WTKTNLQALLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGNVECTEEVVYIGKMDCGS 1800
Query: 1801 VLQFWDKLISLYKLTRHAKTRENTIRCMGTCIKRLAVQYSACIISDATTTESPNDRISNN 1860
VLQFWDKLISLYKLT+HAKTRE T+RCMGTCIKR AV YSA I+SDA ESP DR SNN
Sbjct: 1801 VLQFWDKLISLYKLTKHAKTRETTLRCMGTCIKRCAVLYSASIVSDAMMGESPKDRTSNN 1860
Query: 1861 LDEFHSCITLFTDLIRQHSAASEPVNMRTAAADSIIASGLLEQAEIFGDFVFDNQIPRDT 1920
L+EF SCITLFTDLI QHSAASEPVNMRTAAADSIIASGLLEQAEIFGD++FDNQIP++T
Sbjct: 1861 LEEFQSCITLFTDLISQHSAASEPVNMRTAAADSIIASGLLEQAEIFGDYMFDNQIPQET 1920
Query: 1921 SKSHFEQREYVNMYAHQILNMWSTCIMLLEDEDDDIRKRLAADVQKCFSSERTTTSSDVP 1980
S SHFEQR+YVNMYAHQILN+WSTCIMLLEDEDD+IRK LAADVQKCFSSERTTTSSD
Sbjct: 1921 SNSHFEQRDYVNMYAHQILNIWSTCIMLLEDEDDEIRKSLAADVQKCFSSERTTTSSDAR 1980
Query: 1981 NQVEQVIGSSFEYLSSIFGHWVLYFDYLANWVLNSVNYTISPVDPVRRVFDKEIDNHHEE 2040
QVEQVIGSSFEYLSSIFGHWV YFDYLANWVLN+ NY SP DPVRRVFDKEIDNHHEE
Sbjct: 1981 TQVEQVIGSSFEYLSSIFGHWVRYFDYLANWVLNTANYAASPADPVRRVFDKEIDNHHEE 2040
Query: 2041 KLLISQTCCLHMEKLSRSKLVALWDTQWFINYLVGLRKRFFHQLIKFSDEHMNKHGGFDW 2100
KLLISQTCCLH+EKLSRSKLVALWDTQWFINYLVGLRKRFF QLIKFSDEHM+KHGGFDW
Sbjct: 2041 KLLISQTCCLHLEKLSRSKLVALWDTQWFINYLVGLRKRFFRQLIKFSDEHMSKHGGFDW 2100
Query: 2101 IGGAGNHKDAFLLLYSNLLGFYALSNCIVNGKTKVTMQPLVTEVVEIGKIINPFLRNPLI 2160
IGGAGNHKDAFL LY NLLGFY++SNC++NGKT++ QPL TEVVEIGKIINPFLRNPLI
Sbjct: 2101 IGGAGNHKDAFLPLYGNLLGFYSISNCMINGKTQIITQPLDTEVVEIGKIINPFLRNPLI 2160
Query: 2161 SNLYLLVIRIHKEAIDINRDHKIPELGHEAIWEGFDPYFLLR 2192
SNLYLLVIRIHKEAID+NRDHKIPEL HE IWEGFDPYFLLR
Sbjct: 2161 SNLYLLVIRIHKEAIDVNRDHKIPELEHEGIWEGFDPYFLLR 2197
BLAST of HG10019700 vs. ExPASy TrEMBL
Match:
A0A6J1J6K6 (thyroid adenoma-associated protein homolog OS=Cucurbita maxima OX=3661 GN=LOC111482224 PE=4 SV=1)
HSP 1 Score: 3905.5 bits (10127), Expect = 0.0e+00
Identity = 1954/2201 (88.78%), Postives = 2060/2201 (93.59%), Query Frame = 0
Query: 1 MSAKWRALQHRHRYTYSAIVFPPSYVDSLNSF----QYPSKFFTELLQLVSLNSVYAQVN 60
MSAKWRALQHRHRYTYSAIVFP SY+DSLNSF Q SKFFTELL+LVSLNSVYAQVN
Sbjct: 1 MSAKWRALQHRHRYTYSAIVFPRSYIDSLNSFQSQHQSSSKFFTELLELVSLNSVYAQVN 60
Query: 61 HAKKVASAFSELLANGDEDSVSKAARFYLEVLFCENSQPLHRTLVSTLAKSRKFQDRLGE 120
HAKKVASAFSELLANGDEDSVS+AARFYLEVLFCENSQPLHRTLVSTLAKSRKFQD LGE
Sbjct: 61 HAKKVASAFSELLANGDEDSVSRAARFYLEVLFCENSQPLHRTLVSTLAKSRKFQDSLGE 120
Query: 121 CFRNLCEEHSGMLQGGEKRFCVSRVALSVMGMPKLGYLVDIIRDCAILVARDTVSSLDSV 180
CFRNLCEEHSGM Q G+KRFCVSRVALS+MGMPKLGYLVD+IRDCAILV+RD VSSLDSV
Sbjct: 121 CFRNLCEEHSGMQQDGDKRFCVSRVALSIMGMPKLGYLVDVIRDCAILVSRDIVSSLDSV 180
Query: 181 VKETNESARPSPIVMEQCQEALSCLYYLLQRFPSKFQEDPSVMGMIVRTILSILKSLAFS 240
VKETN+ ARPSPIV+EQCQEALSCLYYLLQRFPSKF ED SVMGMIV TILSILKSLAFS
Sbjct: 181 VKETNDLARPSPIVIEQCQEALSCLYYLLQRFPSKFLEDSSVMGMIVSTILSILKSLAFS 240
Query: 241 RDCYVAAGVSFCASLQVCLNSAELGMLIFYGIFEQTNHIAFLKYESEFRNAIAKVPHQAN 300
RDCYV+AGVSFCASLQVCLNS ELG+LIFYGIFEQTNHI+ LKYE EFRNA+AKVP+QAN
Sbjct: 241 RDCYVSAGVSFCASLQVCLNSEELGVLIFYGIFEQTNHISCLKYEDEFRNAVAKVPYQAN 300
Query: 301 VCTEIQTFSVLSRLCLIRGILTAIPRPVLNIPFSMMEGELNGHPGCLNSGNSVKTILYDG 360
VC EIQTFSVLSRLCLIRGILTAIPRPVLNIPFSM+EG+LNGHP CLNSGNSVKTILYD
Sbjct: 301 VCAEIQTFSVLSRLCLIRGILTAIPRPVLNIPFSMIEGDLNGHPDCLNSGNSVKTILYDA 360
Query: 361 ILPELCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRILR 420
ILPELCNY ENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRILR
Sbjct: 361 ILPELCNYSENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRILR 420
Query: 421 IMWTNLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIKLYLQKIAFDLLRLGSRCKG 480
IMWTNL+DPLSQTVKQVHLIFDLFLEIQSSLCWSE SEK YLQKIAFDLL LGSRCKG
Sbjct: 421 IMWTNLEDPLSQTVKQVHLIFDLFLEIQSSLCWSESSEKTTSYLQKIAFDLLHLGSRCKG 480
Query: 481 RYVPLASLTKRLGAKALLDMSPSLLSETVQAYIDDDVCCAATSFLKCFLEHLRDECWSSD 540
RYVPLASLTKRLGAKA+LDMSPSLLS+TVQAYIDDDVCCAATSFLKCFLEHLRDECWSSD
Sbjct: 481 RYVPLASLTKRLGAKAMLDMSPSLLSDTVQAYIDDDVCCAATSFLKCFLEHLRDECWSSD 540
Query: 541 GIEGGYALYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFEIDLDSIFPMLAFISVWPS 600
GIEGGYALYRG CLPPVL GLGSGISKLRSNLNTYALPVLFEID+DSIFPMLAFISVWPS
Sbjct: 541 GIEGGYALYRGRCLPPVLRGLGSGISKLRSNLNTYALPVLFEIDIDSIFPMLAFISVWPS 600
Query: 601 SRDNGVLYPGDYQGSMELRVEQKVAIFISLLKVSRSLALIEGDIDWLEKPSLEQQSVNEI 660
+ DNGVLYPG+ +GSMELRVEQKVAIFISL KVSRSLALIEGDI+WLEK SLEQ+ +EI
Sbjct: 601 ACDNGVLYPGNNEGSMELRVEQKVAIFISLFKVSRSLALIEGDINWLEKRSLEQRFAHEI 660
Query: 661 EYFSCHALVFVKGVKVEILVEWLLLALTHVDESLRVDAAEFLFLNPKTSSLPSHLELTLL 720
EYF CHA VF+KGVKVEILVEWL+LALTHVDESLRVDAAEF+FLNPKTSSLPSHLELTLL
Sbjct: 661 EYFGCHAFVFIKGVKVEILVEWLVLALTHVDESLRVDAAEFIFLNPKTSSLPSHLELTLL 720
Query: 721 KKAVPLNMRCSSTAFQMKWSSLFRKFFSRVRTALERQFKLGNWIPLASCCNSESDLPNGS 780
KKA+PLNMRCSSTAFQMKW+SLFRKFFSRVRTALERQFK G+WIP AS N ES LPNG+
Sbjct: 721 KKAIPLNMRCSSTAFQMKWTSLFRKFFSRVRTALERQFKQGSWIPRASSSNRESYLPNGN 780
Query: 781 EQIITGRADDLLHFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWSVVPSKEKSSE 840
EQ I GRA+DL FMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWSV PSKEKS+E
Sbjct: 781 EQTIAGRANDLFSFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWSVFPSKEKSNE 840
Query: 841 TLLLPYNEGITLPDSVLLLVGSIIDSWDRLRENSFRILLHFPTPLPGISSEYMVGKVITW 900
TLLLPYNEGITLPDSVLLLVGSIIDSWD LRE+SFRILLHFPTPLPGISSE+MVG+VITW
Sbjct: 841 TLLLPYNEGITLPDSVLLLVGSIIDSWDSLRESSFRILLHFPTPLPGISSEHMVGEVITW 900
Query: 901 AKMLVCSSRVRESDAGALALRLVFRKYVLDLGWIVRASGTVVCLDSLNKLTNVGEELKKS 960
AK LVCSSRVRESDAGAL LRLVFRKYVLDLGWIVRASG VVCLDS++KL NVGEE+ +S
Sbjct: 901 AKKLVCSSRVRESDAGALTLRLVFRKYVLDLGWIVRASGDVVCLDSVHKLPNVGEEICRS 960
Query: 961 NHPVAEYLKSLIDWLDISVSEGERNLPEACKNSFVHGVLLTLRYAFEELDWNSDVVLSSI 1020
NHPVAEYLKSLIDWL+ISV+EGERNL EACKNSFVHGVLL LRY FEELDW+SD+VLSS+
Sbjct: 961 NHPVAEYLKSLIDWLNISVTEGERNLAEACKNSFVHGVLLALRYTFEELDWSSDIVLSSL 1020
Query: 1021 LEMRSLLEKLLELGMRITSLALWVVSADAWYLPEDMDDMVDDDAFMLDVPDEADVSTSLS 1080
EMRSLLEKLLEL MRITSLAL VVSADAWYLPEDMDDM DDDAF+LDVPDEAD STSLS
Sbjct: 1021 SEMRSLLEKLLELVMRITSLALCVVSADAWYLPEDMDDMDDDDAFLLDVPDEADASTSLS 1080
Query: 1081 ELEDSKEKTTVNSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLP-AASDSVESDPIASI 1140
ELEDSKEKTTVNSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLP AASDSVESDP ASI
Sbjct: 1081 ELEDSKEKTTVNSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPAAASDSVESDPNASI 1140
Query: 1141 ISRQQEVLDLKQLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSNDQRLCKLTE 1200
I + EVLDL+QLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSNDQRLCKLTE
Sbjct: 1141 ILKHDEVLDLRQLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSNDQRLCKLTE 1200
Query: 1201 SWMDQLMERTSAKGQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAER 1260
SWMDQLMER +A GQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAER
Sbjct: 1201 SWMDQLMERMTANGQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAER 1260
Query: 1261 LLQNPIDTNHKNSNFSKLPSTELGQDTESVSLHETYPSEKASKIRDEGVIPTVHAFNVLR 1320
LL NPID++ KN NF +LPSTE+GQDT+SVS HET SEKASKIRDEGVIPTVHAFNVLR
Sbjct: 1261 LLLNPIDSDCKNRNFPELPSTEIGQDTQSVSPHETNASEKASKIRDEGVIPTVHAFNVLR 1320
Query: 1321 AAFNDTNLATDTSGFSAQAIIVAIRSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRES 1380
A+FND NLATDTSGFSAQAIIV+IR+FSS YWEVRNSACLAYTALVRRMIGFLNVHKRES
Sbjct: 1321 ASFNDANLATDTSGFSAQAIIVSIRAFSSSYWEVRNSACLAYTALVRRMIGFLNVHKRES 1380
Query: 1381 ARRALTGLEFFHRYPALHRFLLDELKVATESLDDGCSGNSKFNLANVVHPSLCPMLILLS 1440
ARRALTGLEFFHRYPALHRFLLDELKVAT+SLDDGCSGN++ LA VVHPSLCP+LILLS
Sbjct: 1381 ARRALTGLEFFHRYPALHRFLLDELKVATDSLDDGCSGNAESTLAKVVHPSLCPVLILLS 1440
Query: 1441 RLKPSTIASEDGDDLDPFLLMPFIRKCSSQSNLRIRILASRALTGLVSNENLQLVILNIA 1500
RLKPSTI SE GDDLDPFL MPFIRKCSSQSNLRIR+LASRALTGLVSNENL VILNIA
Sbjct: 1441 RLKPSTIVSEAGDDLDPFLFMPFIRKCSSQSNLRIRVLASRALTGLVSNENLPSVILNIA 1500
Query: 1501 SGLPVDDNAMMAHE-----DATATTQHASYNRIHGILLQLISLLDTNCRNLADILKKSQV 1560
SGLP+DDN +MA E D TATTQ +SYN+IHGILLQLISLLDTNCRNLADI KKSQ+
Sbjct: 1501 SGLPIDDNTIMAPESSTVVDVTATTQRSSYNKIHGILLQLISLLDTNCRNLADISKKSQI 1560
Query: 1561 LNDLAEVLARCSWMARRRRCSCPILSTSILRVLGHMLSIVRTCPRSKSFYIIRNLLLDLS 1620
LNDL E L RCSWMA+RR CSCPIL TS LRVLGHMLSIVRTCPRSKS YIIRNLLLDLS
Sbjct: 1561 LNDLVEFLGRCSWMAKRRHCSCPILGTSFLRVLGHMLSIVRTCPRSKSLYIIRNLLLDLS 1620
Query: 1621 TECLDVEASHELSYYDPTVAELRQQASICYFNCVFQPFDEEDDAALQKSQRSQSDEDVPA 1680
TECLD+E H+LSYYDPT+AELRQQA+ICYFNCV QPFDEED AA+QKSQRS+SDEDVPA
Sbjct: 1621 TECLDMETYHKLSYYDPTLAELRQQAAICYFNCVLQPFDEEDYAAIQKSQRSESDEDVPA 1680
Query: 1681 TLMDYPFSQLQERLIRSLQDPCYEVRLSTMKWLFKFLKSTEYSDGLYDLSCHEIRTVDHW 1740
TL++YPF QLQERLIRSLQDPCYEVRLST+KWLFKFLKSTEYSDG DLS HEI+TVDHW
Sbjct: 1681 TLINYPFPQLQERLIRSLQDPCYEVRLSTLKWLFKFLKSTEYSDGFNDLSSHEIKTVDHW 1740
Query: 1741 IKTNLQALLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGNEECTEEVVYIGKMDCGSV 1800
KTNLQALLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGN ECTEEVVYIGKM+CGSV
Sbjct: 1741 TKTNLQALLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGNVECTEEVVYIGKMNCGSV 1800
Query: 1801 LQFWDKLISLYKLTRHAKTRENTIRCMGTCIKRLAVQYSACIISDATTTESPNDRISNNL 1860
LQFWDKLISLYKLT+HAKTRE T+RCMGTCIKR AV YS+ I+SDA ESP DR SNNL
Sbjct: 1801 LQFWDKLISLYKLTKHAKTRETTLRCMGTCIKRCAVLYSSSIVSDAMMGESPKDRTSNNL 1860
Query: 1861 DEFHSCITLFTDLIRQHSAASEPVNMRTAAADSIIASGLLEQAEIFGDFVFDNQIPRDTS 1920
+EF SCI LFTDLI QHSAASEPVNMRTAAADSIIASGLLE+AEIFGD++FDNQIP++TS
Sbjct: 1861 EEFQSCIILFTDLISQHSAASEPVNMRTAAADSIIASGLLEEAEIFGDYMFDNQIPQETS 1920
Query: 1921 KSHFEQREYVNMYAHQILNMWSTCIMLLEDEDDDIRKRLAADVQKCFSSERTTTSSDVPN 1980
SHFEQR+YVNMYAHQILN+WSTCIMLLEDEDD+IRK LAADVQKCFSSERTTTSSD
Sbjct: 1921 NSHFEQRDYVNMYAHQILNIWSTCIMLLEDEDDEIRKSLAADVQKCFSSERTTTSSDART 1980
Query: 1981 QVEQVIGSSFEYLSSIFGHWVLYFDYLANWVLNSVNYTISPVDPVRRVFDKEIDNHHEEK 2040
QVEQVIGSSFEYLSSIFGHWV YFDYLANWVLN+ NY SP DPVRRVFDKEIDNHHEEK
Sbjct: 1981 QVEQVIGSSFEYLSSIFGHWVRYFDYLANWVLNTANYAASPADPVRRVFDKEIDNHHEEK 2040
Query: 2041 LLISQTCCLHMEKLSRSKLVALWDTQWFINYLVGLRKRFFHQLIKFSDEHMNKHGGFDWI 2100
LLISQTCCLH+EKLSRSKLVALWDTQWFINYLVGLRKRFFHQLIKFSDEHM+KHGGFDWI
Sbjct: 2041 LLISQTCCLHLEKLSRSKLVALWDTQWFINYLVGLRKRFFHQLIKFSDEHMSKHGGFDWI 2100
Query: 2101 GGAGNHKDAFLLLYSNLLGFYALSNCIVNGKTKVTMQPLVTEVVEIGKIINPFLRNPLIS 2160
GGAGNHKDAFL LY NLLGFY++SNC++NGKT+++ PL TEVVEIGKIINPFLRNPLIS
Sbjct: 2101 GGAGNHKDAFLPLYGNLLGFYSISNCMINGKTQISTLPLDTEVVEIGKIINPFLRNPLIS 2160
Query: 2161 NLYLLVIRIHKEAIDINRDHKIPELGHEAIWEGFDPYFLLR 2192
NLYLLV+RIHKEAID+NRDHKIPELGHE IWEGFDPYFLLR
Sbjct: 2161 NLYLLVVRIHKEAIDVNRDHKIPELGHEGIWEGFDPYFLLR 2201
BLAST of HG10019700 vs. ExPASy TrEMBL
Match:
A0A6J1BVK0 (thyroid adenoma-associated protein homolog OS=Momordica charantia OX=3673 GN=LOC111005108 PE=4 SV=1)
HSP 1 Score: 3784.6 bits (9813), Expect = 0.0e+00
Identity = 1904/2202 (86.47%), Postives = 2028/2202 (92.10%), Query Frame = 0
Query: 1 MSAKWRALQHRHRYTYSAIVFPPSYVDSLNSF----QYPSKFFTELLQLVSLNSVYAQVN 60
MSAKWRALQHRHRYTYSAIVFP SYVDSL SF Q SKFF+EL++LVSLNSVYAQVN
Sbjct: 1 MSAKWRALQHRHRYTYSAIVFPHSYVDSLISFQSHHQSSSKFFSELIELVSLNSVYAQVN 60
Query: 61 HAKKVASAFSELLANGDEDSVSKAARFYLEVLFCENSQPLHRTLVSTLAKSRKFQDRLGE 120
HAKKVASAF+ELLANGDED VSKA RF+LEVLFCENSQPLHRTLVSTLAKSR F D LG
Sbjct: 61 HAKKVASAFAELLANGDEDLVSKAERFFLEVLFCENSQPLHRTLVSTLAKSRSFHDSLGG 120
Query: 121 CFRNLCEEHSGMLQGGEKRFCVSRVALSVMGMPKLGYLVDIIRDCAILVARDTVSSLDSV 180
CFR+LCEEHSG+ QG KRFCVSRVALSVMGMPKLGYLVD+IR+CAILVARD V LDSV
Sbjct: 121 CFRDLCEEHSGLQQGQGKRFCVSRVALSVMGMPKLGYLVDVIRECAILVARDIVFGLDSV 180
Query: 181 VKETNESARPSPIVMEQCQEALSCLYYLLQRFPSKFQEDPSVMGMIVRTILSILKSLAFS 240
VKETNE ARPSPIVMEQCQEALSCLYYLLQRFPSKFQED SVM IV TILSILKS AF+
Sbjct: 181 VKETNEWARPSPIVMEQCQEALSCLYYLLQRFPSKFQEDSSVMETIVSTILSILKSSAFT 240
Query: 241 RDCYVAAGVSFCASLQVCLNSAELGMLIFYGIFEQTNHIAFLKYESEFRNAIAKVPHQAN 300
RDC+VAAGVSFCASLQVCL S ELG+LIFYGIFEQ+ HI+F K+ESEFRNA++K+P+Q N
Sbjct: 241 RDCFVAAGVSFCASLQVCLTSQELGVLIFYGIFEQSTHISFSKFESEFRNAVSKIPYQGN 300
Query: 301 VCTEIQTFSVLSRLCLIRGILTAIPRPVLNIPFSMMEGELNGHPGCLNSGNSVKTILYDG 360
VC EIQTF+VLSRLCLIRGILTAIPR VLNIPFSM+EG+L+ HPGC+NSGN VKTILYDG
Sbjct: 301 VCAEIQTFAVLSRLCLIRGILTAIPRAVLNIPFSMIEGDLDDHPGCINSGNFVKTILYDG 360
Query: 361 ILPELCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDPLPEEMGSRILR 420
ILPELC YCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLT SC+YDPLPEEMGSRILR
Sbjct: 361 ILPELCTYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTIISCNYDPLPEEMGSRILR 420
Query: 421 IMWTNLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIKLYLQKIAFDLLRLGSRCKG 480
IMWTNL+DPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIK YLQKIA DLL LGSRCKG
Sbjct: 421 IMWTNLEDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIKSYLQKIALDLLHLGSRCKG 480
Query: 481 RYVPLASLTKRLGAKALLDMSPSLLSETVQAYIDDDVCCAATSFLKCFLEHLRDECWSSD 540
RYVPLASLTKRLGAKALLDMSPSLLSETVQAYIDDDVCCAATSFLKCFLE+LRDECWSSD
Sbjct: 481 RYVPLASLTKRLGAKALLDMSPSLLSETVQAYIDDDVCCAATSFLKCFLENLRDECWSSD 540
Query: 541 GIEGGYALYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFEIDLDSIFPMLAFISVWPS 600
GIEGGYALYRGHCLPP+L+GL SGISKLRSNLNTYALPVLFEIDLDSIFPMLA ISVW S
Sbjct: 541 GIEGGYALYRGHCLPPILYGLASGISKLRSNLNTYALPVLFEIDLDSIFPMLASISVWSS 600
Query: 601 SRDNGVLYPGDYQGSMELRVEQKVAIFISLLKVSRSLALIEGDIDWLEKPSLEQQSVNEI 660
S +NGVLYPG QGSMELRV+QKVAIFISLLKVSRSLALIEGDIDWLEKPSLEQ+SV+EI
Sbjct: 601 SGENGVLYPGINQGSMELRVQQKVAIFISLLKVSRSLALIEGDIDWLEKPSLEQESVHEI 660
Query: 661 EYFSCHALVFVKGVKVEILVEWLLLALTHVDESLRVDAAEFLFLNPKTSSLPSHLELTLL 720
EYFSCHALVF+KGVKVEILVEWL+LALTHVDESLRVDAAEFLFLNPKTSSLPSHLELTLL
Sbjct: 661 EYFSCHALVFIKGVKVEILVEWLVLALTHVDESLRVDAAEFLFLNPKTSSLPSHLELTLL 720
Query: 721 KKAVPLNMRCSSTAFQMKWSSLFRKFFSRVRTALERQFKLGNWIPLASCCNSESDLPNGS 780
KKA+PLNMRCSSTAFQMKW+SLFRKFFSRVRTALERQFK GNWIPLA+ CNS+ LPNGS
Sbjct: 721 KKAIPLNMRCSSTAFQMKWTSLFRKFFSRVRTALERQFKQGNWIPLAASCNSKCYLPNGS 780
Query: 781 EQIITGRADDLLHFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVWSVVPSKEKSSE 840
EQI GRADDL +FMKWLSC+LFFSCYPSAPY+RKIMAMDLFLVMLNVWS+VPSKEKS+E
Sbjct: 781 EQIELGRADDLFYFMKWLSCYLFFSCYPSAPYKRKIMAMDLFLVMLNVWSIVPSKEKSNE 840
Query: 841 TLLLPYNEGITLPDSVLLLVGSIIDSWDRLRENSFRILLHFPTPLPGISSEYMVGKVITW 900
TLL PYNEGITLPDSVLLLVGSIIDSWDRLRE+SFRILLHFPTPLPGISSEYMV KVITW
Sbjct: 841 TLLHPYNEGITLPDSVLLLVGSIIDSWDRLRESSFRILLHFPTPLPGISSEYMVSKVITW 900
Query: 901 AKMLVCSSRVRESDAGALALRLVFRKYVLDLGWIVRASGTVVCLDSLNKLTNVGEELKKS 960
AK LVCSSRVRESDAGAL LRL+FRKYVLDLGWIVRAS VVCLDS KL VGE KS
Sbjct: 901 AKKLVCSSRVRESDAGALTLRLLFRKYVLDLGWIVRASVDVVCLDSQEKLPKVGE--CKS 960
Query: 961 NHPVAEYLKSLIDWLDISVSEGERNLPEACKNSFVHGVLLTLRYAFEELDWNSDVVLSSI 1020
NHPVAEYL+SLIDWL++SV+EGERNL EAC+NSFVHGVLLTLRY FEELDWNSD+VLSSI
Sbjct: 961 NHPVAEYLRSLIDWLNVSVTEGERNLSEACRNSFVHGVLLTLRYTFEELDWNSDLVLSSI 1020
Query: 1021 LEMRSLLEKLLELGMRITSLALWVVSADAWYLPEDMDDMVDDDAFMLDVPDEADVSTSLS 1080
EMRSLLEKLLEL MRITSLALWVVSADAW+LPEDMDDMV+DDAF+LDVPDEADVSTSLS
Sbjct: 1021 TEMRSLLEKLLELVMRITSLALWVVSADAWHLPEDMDDMVEDDAFLLDVPDEADVSTSLS 1080
Query: 1081 ELEDSKEKTTVNSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPAASDSVESDPIASII 1140
+LEDSK+KTTV+SRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLP ASDSVESD +SII
Sbjct: 1081 KLEDSKDKTTVSSRTSEQIVMVGCWLAMKEVSLLLGTITRKVPLPTASDSVESDLNSSII 1140
Query: 1141 SRQQEVLDLKQLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSNDQRLCKLTES 1200
+Q EVLDL+QLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSND RLCKLTES
Sbjct: 1141 LKQDEVLDLRQLKVIGDHFLEVLLKMKHNGAIDKTRAGFTALCNRLLCSNDPRLCKLTES 1200
Query: 1201 WMDQLMERTSAKGQTVDDLLRRSAGIPAAFIALFLAEPEGSPKKLLPRALKWLIDVAERL 1260
WMDQLMER +A GQTVDDLLRRSAGIPAAF+ALFLAEPEGSPK LLPRALKWLIDVAERL
Sbjct: 1201 WMDQLMERMTANGQTVDDLLRRSAGIPAAFVALFLAEPEGSPKNLLPRALKWLIDVAERL 1260
Query: 1261 LQNPIDTNHKNSNFSKLPSTELGQDTESVSLHETYPSEKASKIRDEGVIPTVHAFNVLRA 1320
LQNP++ + +N NFSKLPSTELGQDTESV HETY S+KASKIRDEGVIPTVHAFNVLRA
Sbjct: 1261 LQNPVEIDCENGNFSKLPSTELGQDTESVLPHETYASDKASKIRDEGVIPTVHAFNVLRA 1320
Query: 1321 AFNDTNLATDTSGFSAQAIIVAIRSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRESA 1380
AFNDTNLATDTSGFSAQAIIV+IRSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRESA
Sbjct: 1321 AFNDTNLATDTSGFSAQAIIVSIRSFSSPYWEVRNSACLAYTALVRRMIGFLNVHKRESA 1380
Query: 1381 RRALTGLEFFHRYPALHRFLLDELKVATESLDDGCSGNSKFNLANVVHPSLCPMLILLSR 1440
RRALTGLEFFHRYPALHRFLLDELKVATE LDDGCSGNS+ +LA VVHPSLCPMLILLSR
Sbjct: 1381 RRALTGLEFFHRYPALHRFLLDELKVATEYLDDGCSGNSESSLAKVVHPSLCPMLILLSR 1440
Query: 1441 LKPSTIASEDGDDLDPFLLMPFIRKCSSQSNLRIRILASRALTGLVSNENLQLVILNIAS 1500
LKP TIASE GDDLDPFL MPF+R+CSSQSNLRIRILASRALTGLVSNE L VILNIAS
Sbjct: 1441 LKPFTIASETGDDLDPFLFMPFLRRCSSQSNLRIRILASRALTGLVSNEKLPSVILNIAS 1500
Query: 1501 GLPVDDNAMMAHE----DATATTQHASYNRIHGILLQLISLLDTNCRNLADILKKSQVLN 1560
LPVDDN M+A E +AT TTQH SYNRIHGILLQLISLLDTNCRNLADILKKSQ+LN
Sbjct: 1501 ELPVDDNTMLASESISLEATKTTQHTSYNRIHGILLQLISLLDTNCRNLADILKKSQLLN 1560
Query: 1561 DLAEVLARCSWMARRRRCSCPILSTSILRVLGHMLSIVRTCPRSKSFYIIRNLLLDLSTE 1620
DL +V+A CSW+AR+RR SCPILSTS LRVLGHML I TCPRSKSFYIIRNLLLDLSTE
Sbjct: 1561 DLVDVIACCSWIARQRRSSCPILSTSFLRVLGHMLGISITCPRSKSFYIIRNLLLDLSTE 1620
Query: 1621 CLDVEASHELSYYDPTVAELRQQASICYFNCVFQPFDEEDDAALQKSQRSQSDEDVPATL 1680
CLDVE S+ELSYYDPT+ ELRQQA+ICYFNCV QPFDEEDDA LQ SQRSQSD DVPA L
Sbjct: 1621 CLDVETSYELSYYDPTLVELRQQAAICYFNCVLQPFDEEDDAVLQTSQRSQSDADVPAAL 1680
Query: 1681 MDYPFSQLQERLIRSLQDPCYEVRLSTMKWLFKFLKSTEYSDGLYDLSCHEIRTVDHWIK 1740
+DYPF QLQERLIRSLQDPCYEVRLST+KW+FKFLKSTEYS G YDLS +EIRT+D+WIK
Sbjct: 1681 IDYPFPQLQERLIRSLQDPCYEVRLSTLKWMFKFLKSTEYSAGFYDLSSYEIRTIDYWIK 1740
Query: 1741 TNLQALLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFG-NEECTEEVVYIGKMDCGSVL 1800
TNLQ LLTELLS EKN+RCLYYILKNLF WNMSQFQK G ++C EEVVYIG+MDCGSVL
Sbjct: 1741 TNLQTLLTELLSFEKNHRCLYYILKNLFNWNMSQFQKLGKGKKCAEEVVYIGEMDCGSVL 1800
Query: 1801 QFWDKLISLYKLTRHAKTRENTIRCMGTCIKRLAVQYSACIISDATTTESPNDRISNNLD 1860
QFWDKLISLYKLTRHAKTRE +RCMGTCIKR +V YS I+SDAT TESPN + NNL+
Sbjct: 1801 QFWDKLISLYKLTRHAKTREIVVRCMGTCIKRFSVIYSISIVSDATKTESPNYGMLNNLE 1860
Query: 1861 EFHSCITLFTDLIRQHSAASEPVNMRTAAADSIIASGLLEQAEIFGDFVFDNQIPRDTSK 1920
EF C+ LFTDLIRQHSAASEP NMR AAADSIIASGLLEQAEIF +FVFDN+IP T
Sbjct: 1861 EFRDCLALFTDLIRQHSAASEPANMRLAAADSIIASGLLEQAEIFVNFVFDNRIPDGT-- 1920
Query: 1921 SHFEQREYVNMYAHQILNMWSTCIMLLEDEDDDIRKRLAADVQKCFSSERTTTSSDVPNQ 1980
SH EQREYVN YAHQILN W TCIMLLEDEDD+IR+RLA DVQKCFSSER TTSSDVPNQ
Sbjct: 1921 SHSEQREYVNRYAHQILNTWFTCIMLLEDEDDEIRRRLAVDVQKCFSSERITTSSDVPNQ 1980
Query: 1981 VEQVIGSSFEYLSSIFGHWVLYFDYLANWVLNSVNYTISPVDPVRRVFDKEIDNHHEEKL 2040
VEQVIGSSF+YLSSIFGHWV+YFDYL+ WVLN+ N+ +S DPVRRVFDKEIDNHHEEKL
Sbjct: 1981 VEQVIGSSFDYLSSIFGHWVMYFDYLSQWVLNTANHAVSQADPVRRVFDKEIDNHHEEKL 2040
Query: 2041 LISQTCCLHMEKLSRSKLVALWDTQWFINYLVGLRKRFFHQLIKFSDEHMNKHGGFDWIG 2100
LISQTCCLHMEKLS+SKLVALWDTQWF+NYLVGLRKRF HQ I+FSDEHM K GGF+WIG
Sbjct: 2041 LISQTCCLHMEKLSKSKLVALWDTQWFVNYLVGLRKRFLHQFIQFSDEHMGKDGGFNWIG 2100
Query: 2101 GAGNHKDAFLLLYSNLLGFYALSNCIVNGKTKVTMQPLVTEVVEIGKIINPFLRNPLISN 2160
GAGNHKDAFL +Y+NLLGFYALSNCI+NGK++V+ QPL+ EV+EIGKII+PFLRNPLISN
Sbjct: 2101 GAGNHKDAFLPVYANLLGFYALSNCIINGKSQVSTQPLIAEVIEIGKIISPFLRNPLISN 2160
Query: 2161 LYLLVIRIHKEAIDINRDHKIPELGH--EAIWEGFDPYFLLR 2192
LYLLVI+IHKE ID+N DHKIPEL H EAIWE FD YFLLR
Sbjct: 2161 LYLLVIKIHKEVIDVNIDHKIPELEHDDEAIWESFDAYFLLR 2198
BLAST of HG10019700 vs. TAIR 10
Match:
AT3G55160.1 (unknown protein; EXPRESSED IN: 11 plant structures; EXPRESSED DURING: 4 anthesis, F mature embryo stage, petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF2428, death-receptor-like (InterPro:IPR019442); Has 357 Blast hits to 330 proteins in 163 species: Archae - 0; Bacteria - 0; Metazoa - 144; Fungi - 118; Plants - 50; Viruses - 0; Other Eukaryotes - 45 (source: NCBI BLink). )
HSP 1 Score: 2340.8 bits (6065), Expect = 0.0e+00
Identity = 1280/2218 (57.71%), Postives = 1567/2218 (70.65%), Query Frame = 0
Query: 1 MSAKWRALQHRHRYTYSAIVFPPSYVDSLNS---FQYPSKFFTELLQLVSLNSVYAQVNH 60
MSAKWRALQHRHRYTYSA++FP S+ SL+ Q KF++ + +LVSLNS+YAQVNH
Sbjct: 1 MSAKWRALQHRHRYTYSAVLFPSSFTASLSQSSLSQSCPKFYSNIEELVSLNSIYAQVNH 60
Query: 61 AKKVASAFSELLANGDED--------SVSKAARFYLEVLFCENSQPLHRTLVSTLAKSRK 120
AKKV ++F E LA +E+ SV +A RFYLE+LF ENS PLH+TLVS LAK+ K
Sbjct: 61 AKKVVASFGEFLAKANENEGGERETVSVREAIRFYLEILFMENSLPLHKTLVSALAKTTK 120
Query: 121 FQDRLGECFRNLCEEHSGMLQGGEKRFCVSRVALSVMGMPKLGYLVDIIRDCAILVARDT 180
F + CF+ LC+E+ G GG RFCVSRVALSVMGMPKLGYLVDII DCA+LV D
Sbjct: 121 FHSVISSCFKELCDEYGGFEDGG-NRFCVSRVALSVMGMPKLGYLVDIIEDCALLVGYDI 180
Query: 181 VSSLDSVVKETNESARPSPIVMEQCQEALSCLYYLLQRFPSKFQ----EDPSVMGMIVRT 240
VS L+ +V +T RP P VMEQCQEALSC YYL QRFP KF+ ED S M ++
Sbjct: 181 VSGLNGIVLDTEACDRPPPTVMEQCQEALSCSYYLFQRFPLKFKGLVGEDASFMESVLAV 240
Query: 241 ILSILKSLAFSRDCYVAAGVSFCASLQVCLNSAELGMLIFYGIFEQTNHIAFLKYESEFR 300
+SILKSLAFSRDCYVAAGVSFCA+LQVCL ELG+ I IF ++ +
Sbjct: 241 QVSILKSLAFSRDCYVAAGVSFCAALQVCLKDEELGLFIAQCIFCWSSVV-------RLA 300
Query: 301 NAIAKVPHQANVCTEIQTFSVLSRLCLIRGILTAIPRPVLNIPFSMMEGELNGHPGCLNS 360
+ ++K+P ++C+EI +FS LSRLCLIRGILT + R +L F+ + NS
Sbjct: 301 DIVSKIPFAGDICSEICSFSSLSRLCLIRGILTTVSRGILVSSFARLS----------NS 360
Query: 361 GNSVKTILYDGILPELCNYCENPTDSHFNFHSLTVLQICLQQIKTSLVSNLTDTSCSYDP 420
KTILYDGIL ELC+ CENP DSH NFH LTV+QIC+QQIKTS+ LTD S YDP
Sbjct: 361 DCDHKTILYDGILLELCDLCENPIDSHLNFHVLTVMQICMQQIKTSM---LTDLSEGYDP 420
Query: 421 LPEEMGSRILRIMWTNLDDPLSQTVKQVHLIFDLFLEIQSSLCWSEGSEKIKLYLQKIAF 480
+P+ M +R+LRI+W NL+DPLSQTVKQVHL+FDL L+IQ+++ ++ ++ L KI
Sbjct: 421 MPDSMAARVLRIIWNNLEDPLSQTVKQVHLMFDLLLDIQTTVHQTDDKVGMRESLLKIVN 480
Query: 481 DLLRLGSRCKGRYVPLASLTKRLGAKALLDMSPSLLSETVQAYIDDDVCCAATSFLKCFL 540
LLRLGSRCKGRYVPLASLT+RLGAK L+DMSP+LL E AYIDDDVC A TSF+KCFL
Sbjct: 481 YLLRLGSRCKGRYVPLASLTRRLGAKTLMDMSPNLLFEMANAYIDDDVCYAVTSFIKCFL 540
Query: 541 EHLRDECWSSDGIEGGYALYRGHCLPPVLHGLGSGISKLRSNLNTYALPVLFEIDLDSIF 600
E LRDE W S+G++ GYA YR HCLPP L+GL SG SKLRSNLNTYA+ VL E+D+DSIF
Sbjct: 541 ELLRDESWGSEGVDQGYARYREHCLPPFLYGLASGKSKLRSNLNTYAVQVLLELDVDSIF 600
Query: 601 PMLAFISVWPSSRDNGVLYPGDYQGSMELRVEQKVAIFISLLKVSRSLALIEGDIDWLEK 660
+LA+IS+ PS + + Y SMEL VEQKV + +SLLKV R+LA +EGDI+ +K
Sbjct: 601 LLLAYISIGPSEEETKLNYTELSNMSMELTVEQKVVVLVSLLKVCRTLAFLEGDIE--QK 660
Query: 661 PSLEQQSVNEIEYFSCHALVFVKGVKVEILVEWLLLALTHVDESLRVDAAEFLFLNPKTS 720
S + A+V +KG++++I +EWL +ALTHVDES+RVDAAE LFLNPKTS
Sbjct: 661 RSTD-----------AFAVVQIKGIELKIPIEWLKMALTHVDESVRVDAAETLFLNPKTS 720
Query: 721 SLPSHLELTLLKKAVPLNMRCSSTAFQMKWSSLFRKFFSRVRTALERQFKLGNWIPLASC 780
SLPS LEL L+K+AVPLNMR SST FQMKW+SLFRKFF RVRT+LE+Q+KLG+ PL S
Sbjct: 721 SLPSPLELYLMKEAVPLNMRSSSTGFQMKWTSLFRKFFLRVRTSLEKQYKLGSLQPLKSD 780
Query: 781 CNSESDLPNGSEQIITGRADDLLHFMKWLSCFLFFSCYPSAPYRRKIMAMDLFLVMLNVW 840
N+ RA+ L FM+WLS FL+ SCYPSAPYRRKIMA +L +M+ VW
Sbjct: 781 KNA------------VLRAESLFKFMRWLSSFLYLSCYPSAPYRRKIMATELIQIMIEVW 840
Query: 841 SVVPSKEKSS-ETLLLPYNEGITLPDSVLLLVGSIIDSWDRLRENSFRILLHFPTPLPGI 900
VV SK+ +S + L PY + +T DS LLLVGSI+DSWDRLRENSFRILLHFPTP GI
Sbjct: 841 PVVASKDPTSHQGHLYPYCDIVTSHDSTLLLVGSIVDSWDRLRENSFRILLHFPTPFTGI 900
Query: 901 SSEYMVGKVITWAKMLVCSSRVRESDAGALALRLVFRKYVLDLGWIVRASGTVVCLDSLN 960
SSE MV +I WAK LVCS RVRESDAGAL LRL+FRKYVLDLGWIV+ S TV C +
Sbjct: 901 SSEDMVQIIIPWAKQLVCSPRVRESDAGALTLRLIFRKYVLDLGWIVKVSTTVFCCEREC 960
Query: 961 KLTNVGEELKKSNHPVAEYLKSLIDWLDISVSEGERNLPEACKNSFVHGVLLTLRYAFEE 1020
+ + + K +PV EY+KSLI WLD SV+EGER+L EACKNSFVHGVLL LRY FEE
Sbjct: 961 ENIDCRNQNSKPKYPVVEYIKSLIQWLDASVTEGERDLSEACKNSFVHGVLLALRYTFEE 1020
Query: 1021 LDWNSDVVLSSILEMRSLLEKLLELGMRITSLALWVVSADAWYLPEDMDDMVDDDAFMLD 1080
LDWNS+ VL SI EMR LEKLL+L RIT+LALWVVSADA LPEDMDD++DDD+F +
Sbjct: 1021 LDWNSNAVL-SISEMRKELEKLLKLVTRITTLALWVVSADALCLPEDMDDIIDDDSFFSN 1080
Query: 1081 VPDEADVSTSLSELEDSKEKTTVNSRT-SEQIVMVGCWLAMKEVSLLLGTITRKVPLPAA 1140
V D D + LSE S V+ SEQ+VMVGCWLAMKEVSLLLGTI RK+PLP +
Sbjct: 1081 VQD--DSAAVLSEEHTSTYPKHVHETVQSEQVVMVGCWLAMKEVSLLLGTIIRKIPLPTS 1140
Query: 1141 S-------DSVESDPIASIISRQQEVLDLKQLKVIGDHFLEVLLKMKHNGAIDKTRAGFT 1200
S D+ S P +I + +LDLKQL+ IGDHFLEVLLKMKHNGAIDKTRAGFT
Sbjct: 1141 SLRPLENGDTASSVPNDLVIGNSESLLDLKQLEKIGDHFLEVLLKMKHNGAIDKTRAGFT 1200
Query: 1201 ALCNRLLCSNDQRLCKLTESWMDQLMERTSAKGQTVDDLLRRSAGIPAAFIALFLAEPEG 1260
ALC+RLLCSND RLCKLTESWM+QLMERT AKGQTVDD+LRRSAGIPAAFIALFL+EPEG
Sbjct: 1201 ALCHRLLCSNDPRLCKLTESWMEQLMERTVAKGQTVDDVLRRSAGIPAAFIALFLSEPEG 1260
Query: 1261 SPKKLLPRALKWLIDVAERLLQNPIDTNHKNSNFSKLPSTELGQDTESVSLHETYPSEKA 1320
SPKKLLPRAL+WLI +AE+ L P++ K S E ++ + + +EK
Sbjct: 1261 SPKKLLPRALRWLIGLAEKPLMEPLE--QKGSKHM----------VEEINSSDMHSNEKL 1320
Query: 1321 SKIRDEGVIPTVHAFNVLRAAFNDTNLATDTSGFSAQAIIVAIRSFSSPYWEVRNSACLA 1380
SK+RDEGV+PTVHAFNVL+A FNDTNL+TDTSGFSA+A+IV+IRSFSSPYWEVRNSA LA
Sbjct: 1321 SKVRDEGVVPTVHAFNVLKATFNDTNLSTDTSGFSAEAMIVSIRSFSSPYWEVRNSATLA 1380
Query: 1381 YTALVRRMIGFLNVHKRESARRALTGLEFFHRYPALHRFLLDELKVATESLDDGCSGNSK 1440
YTALVRRMIGFLNV KR S RRALTGLEFFHRYP LH F+ ELK AT+ LD SG+S
Sbjct: 1381 YTALVRRMIGFLNVQKRGSTRRALTGLEFFHRYPLLHPFIYSELKAATDLLD--TSGSSD 1440
Query: 1441 FNLANVVHPSLCPMLILLSRLKPSTIASEDGDDLDPFLLMPFIRKCSSQSNLRIRILASR 1500
NLAN+VHPSL P+LILLSRLKPS IASE GDDLDPF+ MPFI KCS+QSNLR+R+LASR
Sbjct: 1441 SNLANLVHPSLWPILILLSRLKPSPIASESGDDLDPFVFMPFIMKCSTQSNLRVRVLASR 1500
Query: 1501 ALTGLVSNENLQLVILNIASGLPVDDNAMMAHEDATATTQHASYNRIHGILLQLISLLDT 1560
AL GLVSNE LQ V+L IAS LP + Q S+N +HGILLQL +LLDT
Sbjct: 1501 ALVGLVSNEKLQSVLLRIASTLPSNG------------AQGGSFNYLHGILLQLGNLLDT 1560
Query: 1561 NCRNLADILKKSQVLNDLAEVLARCSWMARRRRCSCPILSTSILRVLGHMLSIVRTCPRS 1620
NCR+LAD KK Q++ L VLA CSW+A C CPIL TS LRVL HM I TC S
Sbjct: 1561 NCRDLADNSKKDQIIGKLINVLANCSWLASPLTCPCPILCTSFLRVLDHMRVIEWTCSES 1620
Query: 1621 KSFYIIRNLLLDLSTECLDVEASHELSYYDPTVAELRQQASICYFNCVFQPFDEEDDAAL 1680
K+ I L LDLST CLD +AS+ SYYDP++AELR+QA++ YF CVFQP DE +
Sbjct: 1621 KNLRDIYKLHLDLSTNCLDADASYGFSYYDPSIAELREQAAVSYFGCVFQPSDEAAE-VF 1680
Query: 1681 QKSQRSQSDEDVPATLMDYPFSQLQERLIRSLQDPCYEVRLSTMKWLFKFLKSTEYSDGL 1740
Q +QR +D+P L ERL+R + D YEVRL+T+KW +FLKS
Sbjct: 1681 QITQRPNLQSQKVPEALDFP--HLNERLLRCISDQSYEVRLATLKWFLRFLKSE------ 1740
Query: 1741 YDLSCHEIRTVDHWIKTNLQALLTELLSLEKNYRCLYYILKNLFAWNMSQFQKFGNEECT 1800
D S E ++ +W K LQ +L ELL EKN++C YIL+ LF WN+ F+K N+E
Sbjct: 1741 -DSSFSESSSIWNWAKNGLQVILLELLDKEKNHKCENYILRILFQWNLLMFKKSCNKESV 1800
Query: 1801 EEVVYIGKMDCGSVLQFWDKLISLYKLTRHAKTRENTIRCMGTCIKRLAVQYSACIISDA 1860
E +Y+G ++ SV W +L SLY+ TR AKTR + C+ C+K L + I
Sbjct: 1801 EG-IYVGSLNYDSVFHLWGRLTSLYESTRRAKTRGTLMCCLAICVKHL----TGLFIHKN 1860
Query: 1861 TTTESPNDRISNNLDEFHSCITLFTDLIRQHSAASEPVNMRTAAADSIIASGLLEQAEIF 1920
+ + R S D C++ F +LI+Q S SE VN+R A+A++IIASG+LEQA++
Sbjct: 1861 ESEKEEEPRWSCITD----CVSYFVNLIKQKSLPSEQVNVRHASAEAIIASGILEQAKLI 1920
Query: 1921 GDFVFDNQIPRDTSKSHFEQREYVNMYAHQILNMWSTCIMLLEDEDDDIRKRLAADVQKC 1980
G V ++QI +T+ S F++ ++YA+QIL MW TCI LLEDEDD IR +LA DVQKC
Sbjct: 1921 GPLVSNHQISSETTPSKFQKA--CDVYAYQILEMWFTCIKLLEDEDDVIRSKLATDVQKC 1980
Query: 1981 FSSERTTTSSDVPNQVEQVIGSSFEYLSSIFGHWVLYFDYLANWVLNSVNYTISP---VD 2040
F T+ +VP QV++V+ SF +LSSI GHW Y YL+ WV N+ +YT P D
Sbjct: 1981 F-----FTAVEVPTQVDKVLELSFNHLSSILGHWNEYSQYLSRWVFNTADYTSPPKGGSD 2040
Query: 2041 PVRRVFDKEIDNHHEEKLLISQTCCLHMEKLSRSKLVALWDTQWFINYLVGLRKRFFHQL 2100
VRRVFDKEIDNHHEEKLLI Q CC H++KL + + + L+ R +F +QL
Sbjct: 2041 LVRRVFDKEIDNHHEEKLLILQFCCYHLQKLP--------NRDFSLAQLLDWRSKFHNQL 2100
Query: 2101 IKFSDEHMNKHGGFDWIGGAGNHKDAFLLLYSNLLGFYALSNCIVNGKTKVT-MQPLVTE 2160
+ F+ +H++K W+GG GNHKD FL LY NLLG Y S+CI T + L ++
Sbjct: 2101 LAFAKDHVSKQRE-SWVGGVGNHKDVFLPLYGNLLGLYVFSDCIFRFSTDSNDKKTLFSD 2107
Query: 2161 VVEIGKIINPFLRNPLISNLYLLVIRIHKEAIDINRDHKIPELGHEAIWEGFDPYFLL 2191
++E+G+ + PFLRNPL+SN++ +V+R+H++ ++ + L E IWEGFDPYFLL
Sbjct: 2161 IIELGEALKPFLRNPLVSNMFRVVVRLHEKLLNDSLMDLSTVLSGE-IWEGFDPYFLL 2107
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038903869.1 | 0.0e+00 | 93.08 | thyroid adenoma-associated protein homolog [Benincasa hispida] | [more] |
XP_008443417.1 | 0.0e+00 | 91.07 | PREDICTED: uncharacterized protein LOC103487009 [Cucumis melo] >KAA0053755.1 thy... | [more] |
XP_031737854.1 | 0.0e+00 | 90.99 | LOW QUALITY PROTEIN: thyroid adenoma-associated protein homolog [Cucumis sativus... | [more] |
XP_022934862.1 | 0.0e+00 | 89.06 | thyroid adenoma-associated protein homolog [Cucurbita moschata] | [more] |
KAG6580971.1 | 0.0e+00 | 89.06 | Thyroid adenoma-associated protein-like protein, partial [Cucurbita argyrosperma... | [more] |
Match Name | E-value | Identity | Description | |
A8C754 | 2.0e-61 | 23.51 | Thyroid adenoma-associated protein homolog OS=Gallus gallus OX=9031 GN=THADA PE=... | [more] |
A8C756 | 5.8e-61 | 24.10 | Thyroid adenoma-associated protein homolog OS=Mus musculus OX=10090 GN=Thada PE=... | [more] |
A8C750 | 3.0e-57 | 22.67 | Thyroid adenoma-associated protein homolog OS=Canis lupus familiaris OX=9615 GN=... | [more] |
A8C752 | 1.5e-56 | 22.53 | Thyroid adenoma-associated protein homolog OS=Chlorocebus aethiops OX=9534 GN=TH... | [more] |
Q6YHU6 | 6.2e-55 | 22.33 | Thyroid adenoma-associated protein OS=Homo sapiens OX=9606 GN=THADA PE=1 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A5A7UJ45 | 0.0e+00 | 91.07 | Thyroid adenoma-associated protein-like protein isoform X1 OS=Cucumis melo var. ... | [more] |
A0A1S3B8Q1 | 0.0e+00 | 91.07 | uncharacterized protein LOC103487009 OS=Cucumis melo OX=3656 GN=LOC103487009 PE=... | [more] |
A0A6J1F3Z3 | 0.0e+00 | 89.06 | thyroid adenoma-associated protein homolog OS=Cucurbita moschata OX=3662 GN=LOC1... | [more] |
A0A6J1J6K6 | 0.0e+00 | 88.78 | thyroid adenoma-associated protein homolog OS=Cucurbita maxima OX=3661 GN=LOC111... | [more] |
A0A6J1BVK0 | 0.0e+00 | 86.47 | thyroid adenoma-associated protein homolog OS=Momordica charantia OX=3673 GN=LOC... | [more] |
Match Name | E-value | Identity | Description | |
AT3G55160.1 | 0.0e+00 | 57.71 | unknown protein; EXPRESSED IN: 11 plant structures; EXPRESSED DURING: 4 anthesis... | [more] |