Homology
BLAST of HG10022266 vs. NCBI nr
Match:
XP_038890797.1 (regulator of nonsense transcripts UPF2 [Benincasa hispida])
HSP 1 Score: 2152.1 bits (5575), Expect = 0.0e+00
Identity = 1139/1195 (95.31%), Postives = 1153/1195 (96.49%), Query Frame = 0
Query: 76 MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 135
MDHHEDDGR GGESQPKRDDEETVARQEEIKK FEAKMALRQSNLNPERPDSGFLRTLDS
Sbjct: 1 MDHHEDDGRQGGESQPKRDDEETVARQEEIKKIFEAKMALRQSNLNPERPDSGFLRTLDS 60
Query: 136 SIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ-- 195
SI+RNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ
Sbjct: 61 SIKRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQIC 120
Query: 196 --------------------------SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 255
SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED
Sbjct: 121 SLLHQRYKDFSPCLIQGLLKVFFPGKSGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 180
Query: 256 TAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSIT 315
+AIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGL PTGQDHEEFFKSL+IT
Sbjct: 181 SAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLSPTGQDHEEFFKSLNIT 240
Query: 316 ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 375
ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS
Sbjct: 241 ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 300
Query: 376 YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 435
YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA
Sbjct: 301 YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 360
Query: 436 FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTALEAIEVSTDCSLQ 495
FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESE DQGQQ +LEAIE+STDCSLQ
Sbjct: 361 FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEVDQGQQASLEAIEISTDCSLQ 420
Query: 496 EGKIN---EKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTNLDALLQR 555
+GKIN EKGK++EEKDKEKNKDTDKEKGKEKDADRK+ENEKEKLKNIEGTNLDALLQR
Sbjct: 421 DGKINEKGEKGKDREEKDKEKNKDTDKEKGKEKDADRKMENEKEKLKNIEGTNLDALLQR 480
Query: 556 LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD 615
LPGCVSRDLIDQLTVEFCYLNSK+NRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD
Sbjct: 481 LPGCVSRDLIDQLTVEFCYLNSKSNRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD 540
Query: 616 VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF 675
VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF
Sbjct: 541 VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF 600
Query: 676 THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC 735
THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC
Sbjct: 601 THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC 660
Query: 736 KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH 795
KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH
Sbjct: 661 KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH 720
Query: 796 KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE 855
KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE
Sbjct: 721 KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE 780
Query: 856 LYNYELVDSSVVFDTLYLIIVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS 915
LYNYELVDSSVVFDTLYLI+VFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS
Sbjct: 781 LYNYELVDSSVVFDTLYLILVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS 840
Query: 916 KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE 975
KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPN+TRYSSIEEIN AFVELEEHE
Sbjct: 841 KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNVTRYSSIEEINTAFVELEEHE 900
Query: 976 RSVSNDKPNTEKHLDAEKPSRATSNTASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTI 1035
RSVSNDKPNTEKHLDAEKPSRATSN+ SANGRDTVNGSKENGGAHEDGVDSDSDTGSGTI
Sbjct: 901 RSVSNDKPNTEKHLDAEKPSRATSNSTSANGRDTVNGSKENGGAHEDGVDSDSDTGSGTI 960
Query: 1036 EAEGRDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQ 1095
EAEGRDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQ
Sbjct: 961 EAEGRDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQ 1020
Query: 1096 ELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGVGGESGDEGLDEDAGGS 1155
ELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGS RDHHGRGVGGESGDEGLDEDAGGS
Sbjct: 1021 ELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGS-RDHHGRGVGGESGDEGLDEDAGGS 1080
Query: 1156 KEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEE 1215
KEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEE
Sbjct: 1081 KEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEE 1140
Query: 1216 LNGLGSQTMNWMQTGGNRGVPTRGNNWEASGGRSGGSRHPHHRYPGSGVHYSRKK 1240
NGLGSQTMNWMQTGGNR VPTRGNNWEASGGRSGG RHPHHRYPGSG+HYSRKK
Sbjct: 1141 HNGLGSQTMNWMQTGGNR-VPTRGNNWEASGGRSGGLRHPHHRYPGSGMHYSRKK 1193
BLAST of HG10022266 vs. NCBI nr
Match:
XP_008463566.1 (PREDICTED: LOW QUALITY PROTEIN: regulator of nonsense transcripts UPF2 [Cucumis melo])
HSP 1 Score: 2150.9 bits (5572), Expect = 0.0e+00
Identity = 1136/1195 (95.06%), Postives = 1148/1195 (96.07%), Query Frame = 0
Query: 76 MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 135
MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS
Sbjct: 1 MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 60
Query: 136 SIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ-- 195
SI+RNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ
Sbjct: 61 SIKRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQIC 120
Query: 196 --------------------------SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 255
SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED
Sbjct: 121 SLLHQRYKDFSPCLIQGLLKVFFPGKSGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 180
Query: 256 TAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSIT 315
+AIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGL PT QDHEEFFKSL+IT
Sbjct: 181 SAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLPPTAQDHEEFFKSLNIT 240
Query: 316 ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 375
ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQ+EQENAKILNAKGELNDENVSSYEKLRKS
Sbjct: 241 ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQLEQENAKILNAKGELNDENVSSYEKLRKS 300
Query: 376 YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 435
YDHLYRNVSS AEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA
Sbjct: 301 YDHLYRNVSSFAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 360
Query: 436 FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTALEAIEVSTDCSLQ 495
FYECLPDLRAFVPAVLLGEAEPKANEQSAKP ENLAESEA+QGQQT+LEAIEVSTDC LQ
Sbjct: 361 FYECLPDLRAFVPAVLLGEAEPKANEQSAKPTENLAESEAEQGQQTSLEAIEVSTDCPLQ 420
Query: 496 EGKIN---EKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTNLDALLQR 555
+GKIN EKGK++EEKDKEKN DTDKEKGKEKD DRK+ENEK KLKNIEGTNLDALLQR
Sbjct: 421 DGKINEKGEKGKDREEKDKEKNNDTDKEKGKEKDGDRKMENEKXKLKNIEGTNLDALLQR 480
Query: 556 LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD 615
LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD
Sbjct: 481 LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD 540
Query: 616 VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF 675
VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF
Sbjct: 541 VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF 600
Query: 676 THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC 735
THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC
Sbjct: 601 THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC 660
Query: 736 KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH 795
KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH
Sbjct: 661 KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH 720
Query: 796 KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE 855
KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE
Sbjct: 721 KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE 780
Query: 856 LYNYELVDSSVVFDTLYLIIVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS 915
LYNYELVDSSVVFDTLYLI+VFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS
Sbjct: 781 LYNYELVDSSVVFDTLYLILVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS 840
Query: 916 KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE 975
KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE
Sbjct: 841 KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE 900
Query: 976 RSVSNDKPNTEKHLDAEKPSRATSNTASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTI 1035
RSVSNDKPNTEKHLDAEKPSRATS T SANGRD VNGSKEN GAHEDG DSDSDTGSGTI
Sbjct: 901 RSVSNDKPNTEKHLDAEKPSRATSTTTSANGRDRVNGSKENSGAHEDGADSDSDTGSGTI 960
Query: 1036 EAEGRDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQ 1095
EAEGRDDEESDLENHEDGCDTEDDEDDEE GGPASDEDDEVHVRQKVPEVDPREEANFEQ
Sbjct: 961 EAEGRDDEESDLENHEDGCDTEDDEDDEEPGGPASDEDDEVHVRQKVPEVDPREEANFEQ 1020
Query: 1096 ELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGVGGESGDEGLDEDAGGS 1155
ELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRG GGESGDEGLDEDAGGS
Sbjct: 1021 ELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGAGGESGDEGLDEDAGGS 1080
Query: 1156 KEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEE 1215
KEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEE
Sbjct: 1081 KEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEE 1140
Query: 1216 LNGLGSQTMNWMQTGGNRGVPTRGNNWEASGGRSGGSRHPHHRYPGSGVHYSRKK 1240
LNGLGSQTMNWMQTGGNR VPTRGNNWEASGGRSGGSRHPHHRYPGSG+HYSRKK
Sbjct: 1141 LNGLGSQTMNWMQTGGNR-VPTRGNNWEASGGRSGGSRHPHHRYPGSGMHYSRKK 1194
BLAST of HG10022266 vs. NCBI nr
Match:
XP_004143811.1 (regulator of nonsense transcripts UPF2 [Cucumis sativus] >KGN51237.1 hypothetical protein Csa_008908 [Cucumis sativus])
HSP 1 Score: 2150.6 bits (5571), Expect = 0.0e+00
Identity = 1138/1196 (95.15%), Postives = 1151/1196 (96.24%), Query Frame = 0
Query: 76 MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 135
MDHHEDDGRPGGESQPKRDDEE+VARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS
Sbjct: 1 MDHHEDDGRPGGESQPKRDDEESVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 60
Query: 136 SIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ-- 195
SI+RNTTVIKKLKQINEEQREGLMD+LRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ
Sbjct: 61 SIKRNTTVIKKLKQINEEQREGLMDDLRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQIC 120
Query: 196 --------------------------SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 255
SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED
Sbjct: 121 SLLHQRYKDFSPCLIQGLLKVFFPGKSGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 180
Query: 256 TAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSIT 315
+AIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGL PT QDHEEFFKSL+IT
Sbjct: 181 SAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLPPTAQDHEEFFKSLNIT 240
Query: 316 ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 375
ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS
Sbjct: 241 ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 300
Query: 376 YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 435
YDHLYRNVSS AEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA
Sbjct: 301 YDHLYRNVSSFAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 360
Query: 436 FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTALEAIEVSTDCSLQ 495
FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEA+QGQQT+LEAIEVSTDC LQ
Sbjct: 361 FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEAEQGQQTSLEAIEVSTDCLLQ 420
Query: 496 EGKIN---EKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTNLDALLQR 555
+GKIN EKGK++EEKDKEKN DTDKEKGKEKD DRK+ENEKEKLKNIEGTNLDALLQR
Sbjct: 421 DGKINEKGEKGKDREEKDKEKNNDTDKEKGKEKDGDRKMENEKEKLKNIEGTNLDALLQR 480
Query: 556 LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD 615
LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD
Sbjct: 481 LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD 540
Query: 616 VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF 675
VSVILLQMLEEEF+FLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF
Sbjct: 541 VSVILLQMLEEEFSFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF 600
Query: 676 THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC 735
THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC
Sbjct: 601 THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC 660
Query: 736 KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH 795
KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH
Sbjct: 661 KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH 720
Query: 796 KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE 855
KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE
Sbjct: 721 KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE 780
Query: 856 LYNYELVDSSVVFDTLYLIIVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS 915
LYNYELVDSSVVFDTLYLI+VFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS
Sbjct: 781 LYNYELVDSSVVFDTLYLILVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS 840
Query: 916 KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE 975
KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE
Sbjct: 841 KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE 900
Query: 976 RSVSNDKPNTEKHLDAEKPSRATSNTASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTI 1035
RSVSNDKPNTEKHLDAEKPSRATSN SANGRDTVNGSKENGGAHEDG DSDSDTGSGTI
Sbjct: 901 RSVSNDKPNTEKHLDAEKPSRATSNITSANGRDTVNGSKENGGAHEDGADSDSDTGSGTI 960
Query: 1036 EAEGRDDEESDLE-NHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFE 1095
EAEGRDDEESDLE NHEDGCDTEDDEDDEE GGPASDEDDEVHVRQKVPEVDPREEANFE
Sbjct: 961 EAEGRDDEESDLENNHEDGCDTEDDEDDEEPGGPASDEDDEVHVRQKVPEVDPREEANFE 1020
Query: 1096 QELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGVGGESGDEGLDEDAGG 1155
QELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRG GGESGDEGLDEDAGG
Sbjct: 1021 QELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGAGGESGDEGLDEDAGG 1080
Query: 1156 SKEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEE 1215
SKEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEE
Sbjct: 1081 SKEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEE 1140
Query: 1216 ELNGLGSQTMNWMQTGGNRGVPTRGNNWEASGGRSGGSRHPHHRYPGSGVHYSRKK 1240
ELNGLGSQTMNWMQTGGNR VPTRGNNWE SGGRSGGSRHPHHRYPGSGVHYSRKK
Sbjct: 1141 ELNGLGSQTMNWMQTGGNR-VPTRGNNWEGSGGRSGGSRHPHHRYPGSGVHYSRKK 1195
BLAST of HG10022266 vs. NCBI nr
Match:
KAG6578740.1 (Regulator of nonsense transcripts UPF2, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 2127.4 bits (5511), Expect = 0.0e+00
Identity = 1128/1229 (91.78%), Postives = 1158/1229 (94.22%), Query Frame = 0
Query: 43 SSQFTILPS---HRFSVSVSAPLPTITFSSL-SGGTDMDHHEDDGRPGGESQPKRDDEET 102
S+ T+ PS R S+S+ P + L + TDMDHHEDDGRP ESQPKRDDEET
Sbjct: 28 STIHTVSPSVLHCRRSLSLHLPAELLVPKLLQNNSTDMDHHEDDGRPVSESQPKRDDEET 87
Query: 103 VARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDSSIRRNTTVIKKLKQINEEQREGL 162
ARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDSSI+RNTT+IKKLKQIN+EQREGL
Sbjct: 88 AARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDSSIKRNTTIIKKLKQINDEQREGL 147
Query: 163 MDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ------------------------- 222
MDELRNVNMSKFVSEAVSAICDAKLR SDIQAAVQ
Sbjct: 148 MDELRNVNMSKFVSEAVSAICDAKLRASDIQAAVQICSLLHQRYKDFSPCLIQGLLKVFF 207
Query: 223 ---SGDELDADRNLKAMKKRSTLKLLMELFFVGVVEDTAIFNNIIKDLTSIEHLRDRDTT 282
SGDELDADRNLKAMKKRSTLKLL+ELFFVGVVED+AIFNNIIKDLTSIEHLRDRDTT
Sbjct: 208 PGKSGDELDADRNLKAMKKRSTLKLLLELFFVGVVEDSAIFNNIIKDLTSIEHLRDRDTT 267
Query: 283 LTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSITADQKKFFRKAFHTYYDAAAELLQ 342
LTNLTLLASFARQGRILLGL PTGQDHEEFFKSL+ITADQKKFFRKAFHTYYDAAAELLQ
Sbjct: 268 LTNLTLLASFARQGRILLGLPPTGQDHEEFFKSLNITADQKKFFRKAFHTYYDAAAELLQ 327
Query: 343 SEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKSYDHLYRNVSSLAEALDMQPPVMP 402
SEHTSLRQMEQENAK+LNAKGELNDENVSSYEKLRKSYDHLYRNVSSLAEA+DMQPPVMP
Sbjct: 328 SEHTSLRQMEQENAKLLNAKGELNDENVSSYEKLRKSYDHLYRNVSSLAEAVDMQPPVMP 387
Query: 403 EDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRAFYECLPDLRAFVPAVLLGEAEPK 462
EDGHTTRVSAGEDVSSPAAGKDSSV+EAIWDDEDTRAFYECLPDLRAFVPAVLLGEAEPK
Sbjct: 388 EDGHTTRVSAGEDVSSPAAGKDSSVLEAIWDDEDTRAFYECLPDLRAFVPAVLLGEAEPK 447
Query: 463 ANEQSAKPAENLAESEADQGQQTALEAIEVSTDCSLQEGKINEKGKEKEEKDKEKNKDTD 522
ANEQSAKPAENLAESEADQGQQT+LEA+E+STD LQ+GKINEKGK+KEEKDKEKNKDTD
Sbjct: 448 ANEQSAKPAENLAESEADQGQQTSLEAVEISTDSLLQDGKINEKGKDKEEKDKEKNKDTD 507
Query: 523 KEKGKEKDADRKVENEKEKLKNIEGTNLDALLQRLPGCVSRDLIDQLTVEFCYLNSKANR 582
KEKGKEKDADRK+ENEKEKLKNIEGTNLDALLQRLPGCVSRDLIDQLTVEFCYLNSKANR
Sbjct: 508 KEKGKEKDADRKMENEKEKLKNIEGTNLDALLQRLPGCVSRDLIDQLTVEFCYLNSKANR 567
Query: 583 KKLVRALFNVPRTSLELLPYYSRMVATLSTCMKDVSVILLQMLEEEFNFLLNKKDQMNIE 642
KKLVRALFNVPRTSLELLPYYSR+VATLSTCMKDVSVILLQMLEEEFNFLLNKKDQMNIE
Sbjct: 568 KKLVRALFNVPRTSLELLPYYSRLVATLSTCMKDVSVILLQMLEEEFNFLLNKKDQMNIE 627
Query: 643 TKIRNIRFIGELCKFKIASAGLVFSCLKACLDDFTHHNIDVACNLLETCGRFLYRSPETT 702
TKI+NIRFIGELCKFKIASAGLVFSCLKACLDDFTHHNIDVACNLLETCGRFLYRSPETT
Sbjct: 628 TKIKNIRFIGELCKFKIASAGLVFSCLKACLDDFTHHNIDVACNLLETCGRFLYRSPETT 687
Query: 703 VRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPPERSARVSKVRPPLHQYIRKLLFS 762
VRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPPERSARVSKVRPPLHQYIRKLLFS
Sbjct: 688 VRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPPERSARVSKVRPPLHQYIRKLLFS 747
Query: 763 DLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGKYGQIHLIASLTSGLSRYHDEFSV 822
DLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGKYGQIHLIASLTSGLSRYHDEFSV
Sbjct: 748 DLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGKYGQIHLIASLTSGLSRYHDEFSV 807
Query: 823 AVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYNYELVDSSVVFDTLYLIIVFGHGT 882
AVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYNYELVDSSVVFDTLYLI+VFGHGT
Sbjct: 808 AVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYNYELVDSSVVFDTLYLILVFGHGT 867
Query: 883 SEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRKLDRFFIHFQKYILSKGALPLDIE 942
SEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRKLDRFFIHFQKYILSKGALPLD+E
Sbjct: 868 SEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRKLDRFFIHFQKYILSKGALPLDVE 927
Query: 943 FDLQDLFAELQPNMTRYSSIEEINAAFVELEEHERSVSNDKPNTEKHLDAEKPSRATSNT 1002
FDLQDLFAELQPNMTRYSSIEEINAAFVEL+EHERSVSN KPNTEKHLDA+KPSRATSNT
Sbjct: 928 FDLQDLFAELQPNMTRYSSIEEINAAFVELDEHERSVSNVKPNTEKHLDAKKPSRATSNT 987
Query: 1003 ASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTIEAEGRDDEESDLENHEDGCDTEDDED 1062
SANGRDTVNGSKENG AHEDGVDSDSDTGSGTIEAEG DDEESDLENHEDGCDTEDDED
Sbjct: 988 TSANGRDTVNGSKENGAAHEDGVDSDSDTGSGTIEAEGHDDEESDLENHEDGCDTEDDED 1047
Query: 1063 DEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQELRAVMQESMDQRRQELRGRPTLNMM 1122
DEEAGGPASDEDDEVHVR KV EVDPREEANFEQELRAVMQESMDQRRQELRGRPTLNMM
Sbjct: 1048 DEEAGGPASDEDDEVHVRPKVAEVDPREEANFEQELRAVMQESMDQRRQELRGRPTLNMM 1107
Query: 1123 IPMNLFEGSTRDHHGRGVGGESGDEGLDEDAGGSKEVQVKVLVKRGNKQQTKKMYIPRDC 1182
IPMNLFEGSTRDHHGRG GGESGDE L+EDAGGSKEVQVKVLVKRGNKQQTKKMYIPRD
Sbjct: 1108 IPMNLFEGSTRDHHGRGAGGESGDEALEEDAGGSKEVQVKVLVKRGNKQQTKKMYIPRDS 1167
Query: 1183 TLLQSTKQKEAAELEEKQDIKRLILEYNDREEEELNGLGSQTMNWMQTGGNRGVPTRGNN 1240
LLQSTKQKEA ELEEKQDIKRLILEYNDREEEELNGLGSQTMNWMQTG NR PTRGNN
Sbjct: 1168 ALLQSTKQKEAEELEEKQDIKRLILEYNDREEEELNGLGSQTMNWMQTGNNR-APTRGNN 1227
BLAST of HG10022266 vs. NCBI nr
Match:
XP_023550316.1 (regulator of nonsense transcripts UPF2-like [Cucurbita pepo subsp. pepo])
HSP 1 Score: 2121.7 bits (5496), Expect = 0.0e+00
Identity = 1116/1192 (93.62%), Postives = 1140/1192 (95.64%), Query Frame = 0
Query: 76 MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 135
MDHHEDDGRP ESQPKRDDEET ARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS
Sbjct: 1 MDHHEDDGRPVSESQPKRDDEETAARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 60
Query: 136 SIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ-- 195
SI+RNTT+IKKLKQIN+EQREGLMDELRNVNMSKFVSEAVSAICDAKLR SDIQAAVQ
Sbjct: 61 SIKRNTTIIKKLKQINDEQREGLMDELRNVNMSKFVSEAVSAICDAKLRASDIQAAVQIC 120
Query: 196 --------------------------SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 255
SGD+LDADRNLKAMKKRSTLKLL+ELFFVGVVED
Sbjct: 121 SLLHQRYKDFSPCLIQGLLKVFFPGKSGDDLDADRNLKAMKKRSTLKLLLELFFVGVVED 180
Query: 256 TAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSIT 315
+AIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGL PTGQDHEEFFKSL+IT
Sbjct: 181 SAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLPPTGQDHEEFFKSLNIT 240
Query: 316 ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 375
ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAK+LNAKGELNDENVSSYEKLRKS
Sbjct: 241 ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKLLNAKGELNDENVSSYEKLRKS 300
Query: 376 YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 435
YDHLYRNVSSLAEA+DMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSV+EAIWDDEDTRA
Sbjct: 301 YDHLYRNVSSLAEAVDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVLEAIWDDEDTRA 360
Query: 436 FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTALEAIEVSTDCSLQ 495
FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQT+LE +E+STD LQ
Sbjct: 361 FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTSLEPVEISTDSLLQ 420
Query: 496 EGKINEKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTNLDALLQRLPG 555
+GKINEKGK+KEEKDKEKNKDTDKEKGKEKDADRK+ENEKEKLKNIEGTNLDALLQRLPG
Sbjct: 421 DGKINEKGKDKEEKDKEKNKDTDKEKGKEKDADRKMENEKEKLKNIEGTNLDALLQRLPG 480
Query: 556 CVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKDVSV 615
CVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSR+VATLSTCMKDVSV
Sbjct: 481 CVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRLVATLSTCMKDVSV 540
Query: 616 ILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDFTHH 675
ILLQMLEEEFNFLLNKKDQMNIETKI+NIRFIGELCKFKIASAGLVFSCLKACLDDFTHH
Sbjct: 541 ILLQMLEEEFNFLLNKKDQMNIETKIKNIRFIGELCKFKIASAGLVFSCLKACLDDFTHH 600
Query: 676 NIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPP 735
NIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPP
Sbjct: 601 NIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPP 660
Query: 736 ERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGK 795
ERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGK
Sbjct: 661 ERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGK 720
Query: 796 YGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYN 855
YGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYN
Sbjct: 721 YGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYN 780
Query: 856 YELVDSSVVFDTLYLIIVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRK 915
YELVDSSVVFDTLYLI+VFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRK
Sbjct: 781 YELVDSSVVFDTLYLILVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRK 840
Query: 916 LDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHERSV 975
LDRFFIHFQKYILSKGALPLD+EFDLQDLFAELQPNMTRYSSIEEINAAFVEL+EHERSV
Sbjct: 841 LDRFFIHFQKYILSKGALPLDVEFDLQDLFAELQPNMTRYSSIEEINAAFVELDEHERSV 900
Query: 976 SNDKPNTEKHLDAEKPSRATSNTASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTIEAE 1035
SN KPNTEKHLDA+KPSRATSNT SANGRDTVNGSKENG AHEDGVDSDSDTGSGTIEAE
Sbjct: 901 SNVKPNTEKHLDAKKPSRATSNTTSANGRDTVNGSKENGAAHEDGVDSDSDTGSGTIEAE 960
Query: 1036 GRDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQELR 1095
G DDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVR KV EVDPREEANFEQELR
Sbjct: 961 GHDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRPKVAEVDPREEANFEQELR 1020
Query: 1096 AVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGVGGESGDEGLDEDAGGSKEV 1155
AVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRG GGESGDE L+EDAGGSKEV
Sbjct: 1021 AVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGAGGESGDEALEEDAGGSKEV 1080
Query: 1156 QVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEELNG 1215
QVKVLVKRGNKQQTKKMYIPRD LLQSTKQKEA ELEEKQDIKRLILEYNDREEEELNG
Sbjct: 1081 QVKVLVKRGNKQQTKKMYIPRDSALLQSTKQKEAEELEEKQDIKRLILEYNDREEEELNG 1140
Query: 1216 LGSQTMNWMQTGGNRGVPTRGNNWEASGGRSGGSRHPHHRYPGSGVHYSRKK 1240
LGSQTMNWMQTG NR PTRGNNW+ASGGRSGGS HPHHRYPG GVHYSRKK
Sbjct: 1141 LGSQTMNWMQTGNNR-APTRGNNWDASGGRSGGSHHPHHRYPGGGVHYSRKK 1191
BLAST of HG10022266 vs. ExPASy Swiss-Prot
Match:
F4IUX6 (Regulator of nonsense transcripts UPF2 OS=Arabidopsis thaliana OX=3702 GN=UPF2 PE=2 SV=1)
HSP 1 Score: 1538.5 bits (3982), Expect = 0.0e+00
Identity = 831/1196 (69.48%), Postives = 992/1196 (82.94%), Query Frame = 0
Query: 76 MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 135
MDH ED+ K+DDEE +AR EEIKKS EAK+ LRQ+NLNPERPDS +LRTLDS
Sbjct: 1 MDHPEDESH-----SEKQDDEEALARLEEIKKSIEAKLTLRQNNLNPERPDSAYLRTLDS 60
Query: 136 SIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ-- 195
SI+RNT VIKKLKQINEEQREGLMD+LR VN+SKFVSEAV+AIC+AKL++SDIQAAVQ
Sbjct: 61 SIKRNTAVIKKLKQINEEQREGLMDDLRGVNLSKFVSEAVTAICEAKLKSSDIQAAVQIC 120
Query: 196 --------------------------SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 255
S ++L+AD+N KAMKKRSTLKLL+EL++VGV+ED
Sbjct: 121 SLLHQRYKEFSASLTQGLLKVFFPGKSAEDLEADKNSKAMKKRSTLKLLLELYYVGVIED 180
Query: 256 TAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSIT 315
+ IF NIIKDLTS+E L+DRDTT TNLTLL SFARQGRI LGL +GQD E+FFK L +T
Sbjct: 181 SNIFINIIKDLTSVEQLKDRDTTQTNLTLLTSFARQGRIFLGLPISGQD-EDFFKGLDVT 240
Query: 316 ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 375
ADQKK F+KAF+TYYDA A+LLQSEH L QME+ENAK++NAKGEL++++ SSYEKLRKS
Sbjct: 241 ADQKKSFKKAFNTYYDALADLLQSEHKLLLQMEKENAKLVNAKGELSEDSASSYEKLRKS 300
Query: 376 YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 435
YDHLYRN+SSLAEALDMQPPVMPEDG TTR++AG++ S KD+SV E IWDDEDT+
Sbjct: 301 YDHLYRNISSLAEALDMQPPVMPEDG-TTRLTAGDEASPSGTVKDTSVPEPIWDDEDTKT 360
Query: 436 FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEAD--QGQQTALEAIEVSTDCS 495
FYECLPDLRAFVPAVLLGEAEPK+NEQSAK E L+ES ++ + QQT + EVS D +
Sbjct: 361 FYECLPDLRAFVPAVLLGEAEPKSNEQSAKAKEKLSESSSEVVENQQTTEDTTEVSADSA 420
Query: 496 LQEGKIN-EKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTNLDALLQR 555
+ + N E+ KEKEE +KEK KDT KEKGKEKD+++K+E+EKEK K+++ N + LLQR
Sbjct: 421 SMDDRSNAEQPKEKEEVEKEKAKDTKKEKGKEKDSEKKMEHEKEKGKSLDVANFERLLQR 480
Query: 556 LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD 615
LPGCVSRDLIDQLTVE+CYLNSK NRKKLV+ALFNVPRTSLELL YYSRMVATL++CMKD
Sbjct: 481 LPGCVSRDLIDQLTVEYCYLNSKTNRKKLVKALFNVPRTSLELLAYYSRMVATLASCMKD 540
Query: 616 VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF 675
+ +L+QMLE+EFN L++KKDQMNIETKIRNIRFIGELCKFKI AGLVFSCLKACLD+F
Sbjct: 541 IPSMLVQMLEDEFNSLVHKKDQMNIETKIRNIRFIGELCKFKIVPAGLVFSCLKACLDEF 600
Query: 676 THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC 735
THHNIDVACNLLETCGRFLYRSPETT+RM NML+ILMRLKNVKNLDPR STLVENAYYLC
Sbjct: 601 THHNIDVACNLLETCGRFLYRSPETTLRMTNMLDILMRLKNVKNLDPRQSTLVENAYYLC 660
Query: 736 KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH 795
KPPERSAR+SKVRPPLHQY+RKLLFSDLDK +I NVL+QLRKLPWSECEQY+LKCFMKVH
Sbjct: 661 KPPERSARISKVRPPLHQYVRKLLFSDLDKDSIANVLKQLRKLPWSECEQYILKCFMKVH 720
Query: 796 KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE 855
KGKYGQIHLIASLTSGLSR+HDEF VAVVDEVLEEIR+GLE+N+YG QQKR+AHMRFLGE
Sbjct: 721 KGKYGQIHLIASLTSGLSRHHDEFVVAVVDEVLEEIRVGLELNEYGAQQKRLAHMRFLGE 780
Query: 856 LYNYELVDSSVVFDTLYLIIVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS 915
LYNYE VDSSV+F+TLYL +++GH TSEQ+VLDPPED FR+RM+I LL+TCGHYFDRGSS
Sbjct: 781 LYNYEHVDSSVIFETLYLTLLYGHDTSEQEVLDPPEDFFRVRMVIILLETCGHYFDRGSS 840
Query: 916 KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE 975
K++LD+F IHFQ+YILSKG LPLDIEFDLQDLFA L+PNMTRYS+I+E+NAA ++LEE E
Sbjct: 841 KKRLDQFLIHFQRYILSKGHLPLDIEFDLQDLFANLRPNMTRYSTIDEVNAAILQLEERE 900
Query: 976 RSVSNDKPNTEKHLDAEKPSRATSNTASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTI 1035
+ S DK + E+H D + ++++S+ S+NG+ T +ENG AH G +SDSD+GSG++
Sbjct: 901 HASSGDKVSIERHSDTKPSNKSSSDVISSNGKSTAKDIRENGEAH--GEESDSDSGSGSV 960
Query: 1036 EAEGRDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQ 1095
+G+ +EE D NHE G ++ D +D ++ GP SD DD+ VRQKV VD E+A+F+Q
Sbjct: 961 VRDGQ-NEELDDGNHERGSESGDGDDYDDGDGPGSD-DDKFRVRQKVVTVDLEEQADFDQ 1020
Query: 1096 ELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRG-VGGESGDEGLDEDAGG 1155
EL+A++QESM+QR+ ELRGRP LNM IPM++FEGS +DHH G V GE+G+E LDE+ G
Sbjct: 1021 ELKALLQESMEQRKLELRGRPALNMTIPMSVFEGSGKDHHHFGRVVGENGEEVLDEENGE 1080
Query: 1156 SKEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEE 1215
+EVQVKVLVKRGNKQQT++M IP DC L+QSTKQKEAAELEEKQDIKRL+LEYN+R+EE
Sbjct: 1081 QREVQVKVLVKRGNKQQTRQMLIPSDCALVQSTKQKEAAELEEKQDIKRLVLEYNERDEE 1140
Query: 1216 ELNGLGSQTMNWMQTGGNRGVPTRGNNWEASGGRSGGSRHPHHRYPGSGVHYSRKK 1240
E NGLG+Q +NW +GG+RG G G+SGGSRH + + G G Y ++
Sbjct: 1141 EANGLGTQILNW-TSGGSRGSTRTGE----GSGKSGGSRHRFYYHQGGGGSYHARR 1180
BLAST of HG10022266 vs. ExPASy Swiss-Prot
Match:
A2AT37 (Regulator of nonsense transcripts 2 OS=Mus musculus OX=10090 GN=Upf2 PE=1 SV=1)
HSP 1 Score: 589.3 bits (1518), Expect = 9.8e-167
Identity = 430/1212 (35.48%), Postives = 658/1212 (54.29%), Query Frame = 0
Query: 80 EDDGRPGGESQPKRDDEETVARQEEIKKSF----------EAKMALRQSNLN--PERPDS 139
E++ R E Q KR EE A+ +E ++S + + LR N N RP+
Sbjct: 98 EEEERKKQEEQAKRQQEEAAAQLKEKEESLQLHQEAWERHQLRKELRSKNQNAPDNRPEE 157
Query: 140 GFLRTLDSSIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSD 199
F LDSS+++NT +KKLK I E+QR+ L + +N+SK+++EAV++I +AKL+ SD
Sbjct: 158 NFFSRLDSSLKKNTAFVKKLKTITEQQRDSLSHDFNGLNLSKYIAEAVASIVEAKLKLSD 217
Query: 200 IQAAV----------------------QSGDELDADRNLKAMKKRSTLKLLMELFFVGVV 259
+ A + + ++ K R+ L+ + EL VG+
Sbjct: 218 VNCAAHLCSLFHQRYSDFAPSLLQVWKKHFEARKEEKTPNITKLRTDLRFIAELTIVGIF 277
Query: 260 EDTAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQ-GRILLGLLP------TGQDHE 319
D + I + L SI + DR++ T+++++ SF R G + GL+P + +
Sbjct: 278 TDKEGLSLIYEQLKSIIN-ADRESH-THVSVVISFCRHCGDDIAGLVPRKVKSAAEKFNL 337
Query: 320 EFFKSLSITADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENV 379
F S I+ ++++ F+ Y+ + + L+ +H L+ E++N +IL++KGEL+++
Sbjct: 338 SFPPSEIISPEKQQPFQNLLKEYFTSLTKHLKRDHRELQNTERQNRRILHSKGELSEDRH 397
Query: 380 SSYEKLRKSYDHLYRNVSSLAEALDMQPPVMPEDGHTTRV-SAGEDVSSPAAGKDSSVIE 439
YE+ SY L N SLA+ LD P +P+D T G D+ +P + +
Sbjct: 398 KQYEEFAMSYQKLLANSQSLADLLDENMPDLPQDKPTPEEHGPGIDIFTPGKPGEYDLEG 457
Query: 440 AIWDDEDTRAFYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTALEA 499
IW+DED R FYE L DL+AFVPA+L + E N+ S K A+ D + ++ +
Sbjct: 458 GIWEDEDARNFYENLIDLKAFVPAILFKDNEKSQNKDSNKDDSKEAKEPKDNKEASSPDD 517
Query: 500 IEVSTDCSLQEGKINEKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTN 559
+E+ L+ +IN+ E E D+ +D K+ E++ + + + LK I
Sbjct: 518 LEL----ELENLEINDDTLELEGADEA--EDLTKKLLDEQEQEDEEASTGSHLKLI---- 577
Query: 560 LDALLQRLPGCVSRDLIDQLTVEFCY-LNSKANRKKLVRALFNVPRTSLELLPYYSRMVA 619
+DA LQ+LP CV+RDLID+ ++FC +N+KANRKKLVRALF VPR L+LLP+Y+R+VA
Sbjct: 578 VDAFLQQLPNCVNRDLIDKAAMDFCMNMNTKANRKKLVRALFIVPRQRLDLLPFYARLVA 637
Query: 620 TLSTCMKDVSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSC 679
TL CM DV+ L ML +F F + KKDQ+NIETK + +RFIGEL KFK+ + C
Sbjct: 638 TLHPCMSDVAEDLCSMLRGDFRFHVRKKDQINIETKNKTVRFIGELTKFKMFTKNDTLHC 697
Query: 680 LKACLDDFTHHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTL 739
LK L DF+HH+I++AC LLETCGRFL+RSPE+ +R + +LE +MR K +LD R+ T+
Sbjct: 698 LKMLLSDFSHHHIEMACTLLETCGRFLFRSPESHLRTSVLLEQMMRKKQAMHLDARYVTM 757
Query: 740 VENAYYLCKPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPW--SECEQ 799
VENAYY C PP V K RPPL +Y+RKLL+ DL K E VLRQ+RKLPW E +
Sbjct: 758 VENAYYYCNPPPAEKTVRKKRPPLQEYVRKLLYKDLSKVTTEKVLRQMRKLPWQDQEVKD 817
Query: 800 YLLKCFMKVHKGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQK 859
Y++ C + + KY IH +A+L +GL Y ++ + VVD VLE+IRLG+EVN Q+
Sbjct: 818 YVICCMINIWNVKYNSIHCVANLLAGLVLYQEDVGIHVVDGVLEDIRLGMEVNQPKFNQR 877
Query: 860 RIAHMRFLGELYNYELVDSSVVFDTLYLIIVFG-HGTSEQDVLDPPEDTFRIRMIITLLQ 919
RI+ +FLGELYNY +V+S+V+F TLY FG + LDPPE FRIR++ T+L
Sbjct: 878 RISSAKFLGELYNYRMVESAVIFRTLYSFTSFGVNPDGSPSSLDPPEHLFRIRLVCTILD 937
Query: 920 TCGHYFDRGSSKRKLDRFFIHFQKYILSKGAL---------PLDIEFDLQDLFAELQPNM 979
TCG YFDRGSSKRKLD F ++FQ+Y+ K +L P+DI++ + D L+P +
Sbjct: 938 TCGQYFDRGSSKRKLDCFLVYFQRYVWWKKSLEVWTKDHPFPIDIDYMISDTLELLRPKI 997
Query: 980 TRYSSIEEINAAFVELEEH---ERSVSNDKPNTEKHLDAEKPSRATSNTASANGRDTVNG 1039
+S+EE +LE + + NDK + + + E +
Sbjct: 998 KLCNSLEESIRQVQDLEREFLIKLGLVNDKESKDSMTEGENLEE--------------DE 1057
Query: 1040 SKENGGAHEDGVDSDSDTGSGTIEAEGRDDEESDLENHEDGCDTEDDEDDEEAGGPASDE 1099
+E GGA + + + E EG ++EE E E+ D D + E +E
Sbjct: 1058 EEEEGGAETEEQSGNESEVNEPEEEEGSEEEEEGEEEEEENTDYLTDSNKE---NETDEE 1117
Query: 1100 DDEVHVR----QKVPEVDPREEANFEQELRAVMQESMDQRRQELRGRPTLNMMIPMNLFE 1159
+ EV ++ + VP V E+ +F Q L +M E++ QR E L++ IP++L
Sbjct: 1118 NAEVMIKGGGLKHVPCV---EDEDFIQALDKMMLENLQQRSGESVKVHQLDVAIPLHL-- 1177
Query: 1160 GSTRDHHGRGVGGESGDEGLDEDAGGSKEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTK 1219
++ G +GG G + + + +L ++GNKQQ K + +P L +
Sbjct: 1178 -KSQLRKGPPLGGGEG------ETESADTMPFVMLTRKGNKQQFKILNVPMSSQLAANHW 1237
Query: 1220 QKEAAELEEKQDIKRLILEYNDREEEELNGLGSQTMNWMQTGGNRGVPTRGNNWEASGGR 1230
++ AE EE+ +K+L L+ N+R+E+E +Q+ R P N
Sbjct: 1238 NQQQAEQEERMRMKKLTLDINERQEQE------DYQEMLQSLAQRPAPANTNR------- 1252
BLAST of HG10022266 vs. ExPASy Swiss-Prot
Match:
Q9HAU5 (Regulator of nonsense transcripts 2 OS=Homo sapiens OX=9606 GN=UPF2 PE=1 SV=1)
HSP 1 Score: 588.2 bits (1515), Expect = 2.2e-166
Identity = 434/1224 (35.46%), Postives = 662/1224 (54.08%), Query Frame = 0
Query: 79 HEDDGRPGGESQPKRDDEETVA-----RQEEIKKSFEA------KMALRQSNLN--PERP 138
H+++ R E Q KR EE A ++E I+ EA + LR N N RP
Sbjct: 97 HQEEERKKQEEQAKRQQEEEAAAQMKEKEESIQLHQEAWERHHLRKELRSKNQNAPDSRP 156
Query: 139 DSGFLRTLDSSIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRT 198
+ F LDSS+++NT +KKLK I E+QR+ L + +N+SK+++EAV++I +AKL+
Sbjct: 157 EENFFSRLDSSLKKNTAFVKKLKTITEQQRDSLSHDFNGLNLSKYIAEAVASIVEAKLKI 216
Query: 199 SDIQAAV----------------------QSGDELDADRNLKAMKKRSTLKLLMELFFVG 258
SD+ AV + + ++ K R+ L+ + EL VG
Sbjct: 217 SDVNCAVHLCSLFHQRYADFAPSLLQVWKKHFEARKEEKTPNITKLRTDLRFIAELTIVG 276
Query: 259 VVEDTAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQ-GRILLGLLP------TGQD 318
+ D + I + L +I + DR++ T+++++ SF R G + GL+P +
Sbjct: 277 IFTDKEGLSLIYEQLKNIIN-ADRESH-THVSVVISFCRHCGDDIAGLVPRKVKSAAEKF 336
Query: 319 HEEFFKSLSITADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDE 378
+ F S I+ ++++ F+ Y+ + + L+ +H L+ E++N +IL++KGEL+++
Sbjct: 337 NLSFPPSEIISPEKQQPFQNLLKEYFTSLTKHLKRDHRELQNTERQNRRILHSKGELSED 396
Query: 379 NVSSYEKLRKSYDHLYRNVSSLAEALDMQPPVMPEDGHTTRV-SAGEDVSSPAAGKDSSV 438
YE+ SY L N SLA+ LD P +P+D T G D+ +P + +
Sbjct: 397 RHKQYEEFAMSYQKLLANSQSLADLLDENMPDLPQDKPTPEEHGPGIDIFTPGKPGEYDL 456
Query: 439 IEAIWDDEDTRAFYECLPDLRAFVPAVLLGEAEP--------KANEQSAKPAENLAESEA 498
IW+DED R FYE L DL+AFVPA+L + E K + + AK ++ E +
Sbjct: 457 EGGIWEDEDARNFYENLIDLKAFVPAILFKDNEKSCQNKESNKDDTKEAKESKENKEVSS 516
Query: 499 DQGQQTALEAIEVSTDCSLQEGKINEKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEK 558
+ LE +E++ D EG + K+ D+++ +D + G
Sbjct: 517 PDDLELELENLEINDDTLELEGGDEAEDLTKKLLDEQEQEDEEASTGSH----------- 576
Query: 559 EKLKNIEGTNLDALLQRLPGCVSRDLIDQLTVEFCY-LNSKANRKKLVRALFNVPRTSLE 618
LK I +DA LQ+LP CV+RDLID+ ++FC +N+KANRKKLVRALF VPR L+
Sbjct: 577 --LKLI----VDAFLQQLPNCVNRDLIDKAAMDFCMNMNTKANRKKLVRALFIVPRQRLD 636
Query: 619 LLPYYSRMVATLSTCMKDVSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFK 678
LLP+Y+R+VATL CM DV+ L ML +F F + KKDQ+NIETK + +RFIGEL KFK
Sbjct: 637 LLPFYARLVATLHPCMSDVAEDLCSMLRGDFRFHVRKKDQINIETKNKTVRFIGELTKFK 696
Query: 679 IASAGLVFSCLKACLDDFTHHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNV 738
+ + CLK L DF+HH+I++AC LLETCGRFL+RSPE+ +R + +LE +MR K
Sbjct: 697 MFTKNDTLHCLKMLLSDFSHHHIEMACTLLETCGRFLFRSPESHLRTSVLLEQMMRKKQA 756
Query: 739 KNLDPRHSTLVENAYYLCKPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRK 798
+LD R+ T+VENAYY C PP V K RPPL +Y+RKLL+ DL K E VLRQ+RK
Sbjct: 757 MHLDARYVTMVENAYYYCNPPPAEKTVKKKRPPLQEYVRKLLYKDLSKVTTEKVLRQMRK 816
Query: 799 LPW--SECEQYLLKCFMKVHKGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGL 858
LPW E + Y++ C + + KY IH +A+L +GL Y ++ + VVD VLE+IRLG+
Sbjct: 817 LPWQDQEVKDYVICCMINIWNVKYNSIHCVANLLAGLVLYQEDVGIHVVDGVLEDIRLGM 876
Query: 859 EVNDYGMQQKRIAHMRFLGELYNYELVDSSVVFDTLYLIIVFG-HGTSEQDVLDPPEDTF 918
EVN Q+RI+ +FLGELYNY +V+S+V+F TLY FG + LDPPE F
Sbjct: 877 EVNQPKFNQRRISSAKFLGELYNYRMVESAVIFRTLYSFTSFGVNPDGSPSSLDPPEHLF 936
Query: 919 RIRMIITLLQTCGHYFDRGSSKRKLDRFFIHFQKYILSKGAL---------PLDIEFDLQ 978
RIR++ T+L TCG YFDRGSSKRKLD F ++FQ+Y+ K +L P+DI++ +
Sbjct: 937 RIRLVCTILDTCGQYFDRGSSKRKLDCFLVYFQRYVWWKKSLEVWTKDHPFPIDIDYMIS 996
Query: 979 DLFAELQPNMTRYSSIEEINAAFVELEEH---ERSVSNDKPNTEKHLDAE--KPSRATSN 1038
D L+P + +S+EE +LE + + NDK + + + E +
Sbjct: 997 DTLELLRPKIKLCNSLEESIRQVQDLEREFLIKLGLVNDKDSKDSMTEGENLEEDEEEEE 1056
Query: 1039 TASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTIEAEGRDDEESDLENHEDGCDTEDDE 1098
+ + N S+ N E+G D+D D EG ++EE + + D +++E
Sbjct: 1057 GGAETEEQSGNESEVNEPEEEEGSDNDDD--------EGEEEEEENTDYLTD--SNKENE 1116
Query: 1099 DDEEAGGPASDEDDEVHVR----QKVPEVDPREEANFEQELRAVMQESMDQRRQELRGRP 1158
DE E+ EV ++ + VP V E+ +F Q L +M E++ QR E
Sbjct: 1117 TDE--------ENTEVMIKGGGLKHVPCV---EDEDFIQALDKMMLENLQQRSGESVKVH 1176
Query: 1159 TLNMMIPMNLFEGSTRDHHGRGVGGESGDEGLDEDAGGSKEVQVKVLVKRGNKQQTKKMY 1218
L++ IP++L ++ G +GG G +A + + +L ++GNKQQ K +
Sbjct: 1177 QLDVAIPLHL---KSQLRKGPPLGGGEG------EAESADTMPFVMLTRKGNKQQFKILN 1236
Query: 1219 IPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEELNGLGSQTMNWMQTGGNRGVP 1230
+P L + ++ AE EE+ +K+L L+ N+R+E+E +Q+ R P
Sbjct: 1237 VPMSSQLAANHWNQQQAEQEERMRMKKLTLDINERQEQE------DYQEMLQSLAQRPAP 1255
BLAST of HG10022266 vs. ExPASy Swiss-Prot
Match:
O13824 (Nonsense-mediated mRNA decay protein 2 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=upf2 PE=1 SV=1)
HSP 1 Score: 247.3 bits (630), Expect = 9.2e-64
Identity = 282/1165 (24.21%), Postives = 508/1165 (43.61%), Query Frame = 0
Query: 99 VARQEEIKKSFEAKMALRQSNLNPERPDSGFLRT---LDSSIRRNTTVIKKLK-QINEEQ 158
++R+E+IKK + R+ + D T LDSS+++NT +K+ K + E
Sbjct: 1 MSREEQIKK-LNQYLDNRELAFRAKDGDKNIFHTESQLDSSLKKNTAFMKRCKSSLTSEN 60
Query: 159 REGLMDELRNVNMSKFVSEAVSAICDAKLR---TSDIQAAV------------------- 218
+ + E++ +++ KF+ E +AI + ++ T DI ++V
Sbjct: 61 YDSFIKEIKTLSLKKFIPEITAAIVEGMMKCKATKDILSSVKIVWALNLRFSTAFTGPML 120
Query: 219 ------------------------QSGDEL-DADRNLKAMKKRSTLKLLMELFFVGVV-- 278
Q+ +E+ + DR+ +K R L+ L+E + GVV
Sbjct: 121 ANLYCALYPNPGYSLCHESYFELKQNENEVSEKDRSSHLLKVRPLLRFLIEFWLNGVVGT 180
Query: 279 -EDTAIF------------------NNIIKDLTSI--EHLRDRDTTLTNLTLLASFARQG 338
ED + N+ K L + L D L +L S R
Sbjct: 181 PEDFVSYLPSTDSNDKKFRKPWFEEQNLKKPLVVLLFNDLMDTRFGFLLLPVLTSLVRTF 240
Query: 339 RILLGLLPTGQDHE--EFFKSLSITADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQE 398
L +D E E L+ + + RK+ ++Y D Q + ++ ++
Sbjct: 241 SCELFTTEDFEDKETLELVNRLNPVV-WRTYLRKSLNSYVDKLEVYCQKRKSLFEELNKQ 300
Query: 399 NAKILNAKGELNDENVSSYEKLRKSYDHLYRNVSSLAEALDMQPPV-MPEDGHTTRVSAG 458
+ + + N+E KS + + + +SL+E L+ + + E + S+G
Sbjct: 301 YQEQSIIRADPNNEKFQRLANFSKSIESEFSSYASLSEVLNRKASEDLLELNFMEKASSG 360
Query: 459 EDVSSPAAGK--DSSVIEA--IWDDEDTRAFYECLPDLRAFVPAVLLGEAEPKANEQSAK 518
+ A+G+ +S+ +E +WDD + FYE P+ NE S
Sbjct: 361 TNSVFNASGERSESANVETAQVWDDREQYFFYEVFPNF----------------NEGS-- 420
Query: 519 PAENLAESEADQGQQTALEAIEVSTDCSLQEGKINEKGKEKEEKDKEKNKDTDKEKGKEK 578
+AE + I E +E E NK D K
Sbjct: 421 ----IAE----------------------MKSSIYESSQEGIRSSSENNKKEDDLKDSTG 480
Query: 579 DADRKVENEKEKLKNIEGTNLDALLQRLPGCVSRDLIDQLTVEFCYLNSKANRKKLVRAL 638
D + + + +D L +LP VS +L +++ +EF LN+KA+R +L++AL
Sbjct: 481 DLNTTQVSSR----------VDNFLLKLPSMVSLELTNEMALEFYDLNTKASRNRLIKAL 540
Query: 639 FNVPRTSLELLPYYSRMVATLSTCMKDVSVILLQMLEEEFNFLLNKKDQMNIETKIRNIR 698
+PRTS L+PYY R+ LS + S L+ F ++++K + +T++ +R
Sbjct: 541 CTIPRTSSFLVPYYVRLARILSQLSSEFSTSLVDHARHSFKRMIHRKAKHEYDTRLLIVR 600
Query: 699 FIGELCKFKIASAGLVFSCLKACLDDFTHHNIDVACNLLETCGRFLYRSPETTVRMANML 758
+I EL KF++ +VF C K C+++FT +++V LLE+CGRFL R PET ++M + L
Sbjct: 601 YISELTKFQLMPFHMVFECYKLCINEFTPFDLEVLALLLESCGRFLLRYPETKLQMQSFL 660
Query: 759 EILMRLKNVKNLDPRHSTLVENAYYLCKPPERSARVSKVRPPLHQYIRKLLFSDLDKSAI 818
E + + K L + ++ENA + PP+R VSK + +++ L+ L +
Sbjct: 661 EAIQKKKLASALASQDQLVLENALHFVNPPKRGIIVSKKKSLKEEFLYDLIQIRLKDDNV 720
Query: 819 ENVLRQLRKLPWSECEQYLLKCFMKVHKGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVL 878
L LRK W + Q L M+V KY ++ +A L S L ++H EF + V+D+ L
Sbjct: 721 FPTLLLLRKFDWKDDYQILYNTIMEVWNIKYNSLNALARLLSALYKFHPEFCIHVIDDTL 780
Query: 879 EEIRLGLEVNDYGMQQKRIAHMRFLGELYNYELVDSSVVFDTLYLIIVFGHGTS-----E 938
E + + +D+ +QKR+A RF+ EL ++D + + L+ ++ S
Sbjct: 781 ESLFSAVNNSDHVEKQKRLAQARFISELCVIHMLDVRAITNFLFHLLPLEKFESFLTMKA 840
Query: 939 QDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRKLDRFFIHFQKYILSKGALPLDIEFD 998
+ + D FR+R+I+ +LQTCG R +K+ + + + +Q Y L + +PLD+ ++
Sbjct: 841 STLTNINNDMFRLRLIVVVLQTCGPSIIRSKTKKTMLTYLLAYQCYFLIQPEMPLDMLYE 900
Query: 999 LQDLFAELQPNMTRYSSIEEINAAFVELEEHERSVSNDKPNTEKHLDAEKPSRATSNTAS 1058
+D+ ++P+M Y EE A L E +++S+D D +P +N
Sbjct: 901 FEDVIGYVRPSMKVYMHYEEARNA---LTERLQAISDDWEE-----DDTRPVFQGANDGD 960
Query: 1059 ANGRDTVNGSKENGGAHEDGVDSDSDTGSGTIEAEGRDDEESDLENHEDGCDTEDDEDDE 1118
++ ++E+ ED I E DEES D D+ED+
Sbjct: 961 ------ISSNEESVYLPED------------ISDESETDEESSGLEESDLLDSEDE---- 1020
Query: 1119 EAGGPASDEDDEVHVRQKVPEVDPREEANFEQELRAVMQESMDQRRQELRGRPTLNMMIP 1178
D D+E+ + +++ ++E + ES+ R E P ++ +P
Sbjct: 1021 -------DIDNEMQLSREL-----------DEEFERLTNESLLTRMHE--KNPGFDVPLP 1048
BLAST of HG10022266 vs. ExPASy Swiss-Prot
Match:
P38798 (Nonsense-mediated mRNA decay protein 2 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=NMD2 PE=1 SV=2)
HSP 1 Score: 209.9 bits (533), Expect = 1.6e-52
Identity = 267/1175 (22.72%), Postives = 481/1175 (40.94%), Query Frame = 0
Query: 131 RTLDSSIRRNTTVIKKLKQ-INEEQREGLMDELRNVNMSKFVSEAVSAICDAKL----RT 190
+ LDSSI+RNT IKKLK+ + L+ +L ++ K++SE + + + L +
Sbjct: 28 KKLDSSIKRNTGFIKKLKKGFVKGSESSLLKDLSEASLEKYLSEIIVTVTECLLNVLNKN 87
Query: 191 SDIQAAVQ--SG---------------------------DELDADRNLKAMKKRSTLKLL 250
D+ AAV+ SG E + D + + + L++
Sbjct: 88 DDVIAAVEIISGLHQRFNGRFTSPLLGAFLQAFENPSVDIESERDELQRITRVKGNLRVF 147
Query: 251 MELFFVGV------VEDTAIFNNII------KDLTSIEHLRDRDTTLTNLTLLASFARQG 310
EL+ VGV +E N + KD LR+ L + A
Sbjct: 148 TELYLVGVFRTLDDIESKDAIPNFLQKKTGRKDPLLFSILREILNYKFKLGFTTTIAT-- 207
Query: 311 RILLGLLPTGQDHEEFFKSLSITADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENA 370
+ P +D + + L + K + F + DA H + ++++E+
Sbjct: 208 AFIKKFAPLFRDDDNSWDDLIYDSKLKGALQSLFKNFIDATFARATELHKKVNKLQREHQ 267
Query: 371 KILNAKGELNDENVSSYEKLRKSYDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDV 430
K G+L DE V Y+KL + + +L E ++ P + ++ +D+
Sbjct: 268 KCQIRTGKLRDEYVEEYDKLLPIFIRFKTSAITLGEFFKLEIPEL-------EGASNDDL 327
Query: 431 SSPAAGKDSSVI----EAIWDDEDTRAFYECLPDLRAFVPAVLLGEAEPKANEQSAKPAE 490
A+ ++ I + +W++EDTR FYE LPD+
Sbjct: 328 KETASPMITNQILPPNQRLWENEDTRKFYEILPDI------------------------- 387
Query: 491 NLAESEADQGQQTALEAIEVSTDCSLQEGKINEKGKEKEEKDKEKNKDTDKEKGKEKDAD 550
K EE K
Sbjct: 388 ----------------------------------SKTVEESQSSKT-------------- 447
Query: 551 RKVENEKEKLKNIEGTNLDALLQRLPGCVSRDLIDQLTVEF--CYLNSKANRKKLVRALF 610
EK N+ N++ L +D+ID L+ + YL++KA R ++++ F
Sbjct: 448 -------EKDSNVNSKNINLFFTDLEMADCKDIIDDLSNRYWSSYLDNKATRNRILK--F 507
Query: 611 NVPRTSLELLPYYSRMVATLSTCMKDVSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRF 670
+ LP YSR +AT S M ++ + L+ F L+ + ++NI F
Sbjct: 508 FMETQDWSKLPVYSRFIATNSKYMPEIVSEFINYLDNGFRSQLHSN-----KINVKNIIF 567
Query: 671 IGELCKFKIASAGLVFSCLKACLDDF-THHNIDVACNLLETCGRFLYRSPETTVRMANML 730
E+ KF++ + ++F ++ + +N+++ LLE G+FL PE M M+
Sbjct: 568 FSEMIKFQLIPSFMIFHKIRTLIMYMQVPNNVEILTVLLEHSGKFLLNKPEYKELMEKMV 627
Query: 731 EILMRLKNVKNLDPRHSTLVENAYYLCKPPE-RSARVS-KVRPPLHQYIRKLLFSDLDKS 790
+++ KN + L+ + +EN L PP +S V+ K P Q+ R L+ S+L
Sbjct: 628 QLIKDKKNDRQLNMNMKSALENIITLLYPPSVKSLNVTVKTITPEQQFYRILIRSELSSL 687
Query: 791 AIENVLRQLRKLPWSE--CEQYLLKCFMKVHKGKYGQIHLIASLTSGLSRYHDEFSVAVV 850
+++++ +RK W + ++ L F K HK Y I L+ + GL Y +F + +
Sbjct: 688 DFKHIVKLVRKAHWDDVAIQKVLFSLFSKPHKISYQNIPLLTKVLGGLYSYRRDFVIRCI 747
Query: 851 DEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYNYELVDSSVVFDTLYLIIVFGHGTSEQ 910
D+VLE I GLE+NDYG RI+++R+L E++N+E++ S V+ DT+Y II FGH ++
Sbjct: 748 DQVLENIERGLEINDYGQNMHRISNVRYLTEIFNFEMIKSDVLLDTIYHIIRFGHINNQP 807
Query: 911 DVL-----DPPEDTFRIRMIITLLQTCGHYFDRGSSKRKLDRFFIHFQKYILSKGALPLD 970
+ DPP++ FRI+++ T+L + K KL F + +I + LP +
Sbjct: 808 NPFYLNYSDPPDNYFRIQLVTTILLNINRTPAAFTKKCKLLLRFFEYYTFI-KEQPLPKE 867
Query: 971 IEFDLQDLFAELQP--NMTRYSSIEEINAAFVELEEHERSVSNDKPNTEKHLDAEKPSRA 1030
EF + F + + T++ E + + LE +S++ K K R
Sbjct: 868 TEFRVSSTFKKYENIFGNTKFERSENLVESASRLESLLKSLNAIK---------SKDDRV 927
Query: 1031 TSNTASA-NGRDT-------VNGSKENGGAHEDGVD----------SDSDTGSGTIEAEG 1090
++AS NG+++ ++ ++DGVD S +T S + +
Sbjct: 928 KGSSASIHNGKESAVPIESITEDDEDEDDENDDGVDLLGEDEDAEISTPNTESAPGKHQA 987
Query: 1091 RDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDD------------------------- 1150
+ DE D ++ +D D +DD+DD++ G DEDD
Sbjct: 988 KQDESEDEDDEDDDEDDDDDDDDDDDDGEEGDEDDDEDDDDEDDDDEEEEDSDSDLEYGG 1047
Query: 1151 --------------EVHVRQKVPEVDPREEANFEQELRAVMQESMDQRRQE--------L 1177
E + R+ E + + E E++ + +MQES+D R+ E +
Sbjct: 1048 DLDADRDIEMKRMYEEYERKLKDEEERKAEEELERQFQKMMQESIDARKSEKVVASKIPV 1085
BLAST of HG10022266 vs. ExPASy TrEMBL
Match:
A0A1S3CJK4 (LOW QUALITY PROTEIN: regulator of nonsense transcripts UPF2 OS=Cucumis melo OX=3656 GN=LOC103501680 PE=4 SV=1)
HSP 1 Score: 2150.9 bits (5572), Expect = 0.0e+00
Identity = 1136/1195 (95.06%), Postives = 1148/1195 (96.07%), Query Frame = 0
Query: 76 MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 135
MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS
Sbjct: 1 MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 60
Query: 136 SIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ-- 195
SI+RNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ
Sbjct: 61 SIKRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQIC 120
Query: 196 --------------------------SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 255
SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED
Sbjct: 121 SLLHQRYKDFSPCLIQGLLKVFFPGKSGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 180
Query: 256 TAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSIT 315
+AIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGL PT QDHEEFFKSL+IT
Sbjct: 181 SAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLPPTAQDHEEFFKSLNIT 240
Query: 316 ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 375
ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQ+EQENAKILNAKGELNDENVSSYEKLRKS
Sbjct: 241 ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQLEQENAKILNAKGELNDENVSSYEKLRKS 300
Query: 376 YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 435
YDHLYRNVSS AEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA
Sbjct: 301 YDHLYRNVSSFAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 360
Query: 436 FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTALEAIEVSTDCSLQ 495
FYECLPDLRAFVPAVLLGEAEPKANEQSAKP ENLAESEA+QGQQT+LEAIEVSTDC LQ
Sbjct: 361 FYECLPDLRAFVPAVLLGEAEPKANEQSAKPTENLAESEAEQGQQTSLEAIEVSTDCPLQ 420
Query: 496 EGKIN---EKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTNLDALLQR 555
+GKIN EKGK++EEKDKEKN DTDKEKGKEKD DRK+ENEK KLKNIEGTNLDALLQR
Sbjct: 421 DGKINEKGEKGKDREEKDKEKNNDTDKEKGKEKDGDRKMENEKXKLKNIEGTNLDALLQR 480
Query: 556 LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD 615
LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD
Sbjct: 481 LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD 540
Query: 616 VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF 675
VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF
Sbjct: 541 VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF 600
Query: 676 THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC 735
THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC
Sbjct: 601 THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC 660
Query: 736 KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH 795
KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH
Sbjct: 661 KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH 720
Query: 796 KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE 855
KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE
Sbjct: 721 KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE 780
Query: 856 LYNYELVDSSVVFDTLYLIIVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS 915
LYNYELVDSSVVFDTLYLI+VFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS
Sbjct: 781 LYNYELVDSSVVFDTLYLILVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS 840
Query: 916 KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE 975
KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE
Sbjct: 841 KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE 900
Query: 976 RSVSNDKPNTEKHLDAEKPSRATSNTASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTI 1035
RSVSNDKPNTEKHLDAEKPSRATS T SANGRD VNGSKEN GAHEDG DSDSDTGSGTI
Sbjct: 901 RSVSNDKPNTEKHLDAEKPSRATSTTTSANGRDRVNGSKENSGAHEDGADSDSDTGSGTI 960
Query: 1036 EAEGRDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQ 1095
EAEGRDDEESDLENHEDGCDTEDDEDDEE GGPASDEDDEVHVRQKVPEVDPREEANFEQ
Sbjct: 961 EAEGRDDEESDLENHEDGCDTEDDEDDEEPGGPASDEDDEVHVRQKVPEVDPREEANFEQ 1020
Query: 1096 ELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGVGGESGDEGLDEDAGGS 1155
ELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRG GGESGDEGLDEDAGGS
Sbjct: 1021 ELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGAGGESGDEGLDEDAGGS 1080
Query: 1156 KEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEE 1215
KEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEE
Sbjct: 1081 KEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEE 1140
Query: 1216 LNGLGSQTMNWMQTGGNRGVPTRGNNWEASGGRSGGSRHPHHRYPGSGVHYSRKK 1240
LNGLGSQTMNWMQTGGNR VPTRGNNWEASGGRSGGSRHPHHRYPGSG+HYSRKK
Sbjct: 1141 LNGLGSQTMNWMQTGGNR-VPTRGNNWEASGGRSGGSRHPHHRYPGSGMHYSRKK 1194
BLAST of HG10022266 vs. ExPASy TrEMBL
Match:
A0A0A0KS34 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G497020 PE=4 SV=1)
HSP 1 Score: 2150.6 bits (5571), Expect = 0.0e+00
Identity = 1138/1196 (95.15%), Postives = 1151/1196 (96.24%), Query Frame = 0
Query: 76 MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 135
MDHHEDDGRPGGESQPKRDDEE+VARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS
Sbjct: 1 MDHHEDDGRPGGESQPKRDDEESVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 60
Query: 136 SIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ-- 195
SI+RNTTVIKKLKQINEEQREGLMD+LRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ
Sbjct: 61 SIKRNTTVIKKLKQINEEQREGLMDDLRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQIC 120
Query: 196 --------------------------SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 255
SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED
Sbjct: 121 SLLHQRYKDFSPCLIQGLLKVFFPGKSGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 180
Query: 256 TAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSIT 315
+AIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGL PT QDHEEFFKSL+IT
Sbjct: 181 SAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLPPTAQDHEEFFKSLNIT 240
Query: 316 ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 375
ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS
Sbjct: 241 ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 300
Query: 376 YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 435
YDHLYRNVSS AEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA
Sbjct: 301 YDHLYRNVSSFAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 360
Query: 436 FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTALEAIEVSTDCSLQ 495
FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEA+QGQQT+LEAIEVSTDC LQ
Sbjct: 361 FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEAEQGQQTSLEAIEVSTDCLLQ 420
Query: 496 EGKIN---EKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTNLDALLQR 555
+GKIN EKGK++EEKDKEKN DTDKEKGKEKD DRK+ENEKEKLKNIEGTNLDALLQR
Sbjct: 421 DGKINEKGEKGKDREEKDKEKNNDTDKEKGKEKDGDRKMENEKEKLKNIEGTNLDALLQR 480
Query: 556 LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD 615
LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD
Sbjct: 481 LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD 540
Query: 616 VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF 675
VSVILLQMLEEEF+FLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF
Sbjct: 541 VSVILLQMLEEEFSFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF 600
Query: 676 THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC 735
THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC
Sbjct: 601 THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC 660
Query: 736 KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH 795
KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH
Sbjct: 661 KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH 720
Query: 796 KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE 855
KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE
Sbjct: 721 KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE 780
Query: 856 LYNYELVDSSVVFDTLYLIIVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS 915
LYNYELVDSSVVFDTLYLI+VFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS
Sbjct: 781 LYNYELVDSSVVFDTLYLILVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS 840
Query: 916 KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE 975
KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE
Sbjct: 841 KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE 900
Query: 976 RSVSNDKPNTEKHLDAEKPSRATSNTASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTI 1035
RSVSNDKPNTEKHLDAEKPSRATSN SANGRDTVNGSKENGGAHEDG DSDSDTGSGTI
Sbjct: 901 RSVSNDKPNTEKHLDAEKPSRATSNITSANGRDTVNGSKENGGAHEDGADSDSDTGSGTI 960
Query: 1036 EAEGRDDEESDLE-NHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFE 1095
EAEGRDDEESDLE NHEDGCDTEDDEDDEE GGPASDEDDEVHVRQKVPEVDPREEANFE
Sbjct: 961 EAEGRDDEESDLENNHEDGCDTEDDEDDEEPGGPASDEDDEVHVRQKVPEVDPREEANFE 1020
Query: 1096 QELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGVGGESGDEGLDEDAGG 1155
QELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRG GGESGDEGLDEDAGG
Sbjct: 1021 QELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGAGGESGDEGLDEDAGG 1080
Query: 1156 SKEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEE 1215
SKEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEE
Sbjct: 1081 SKEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEE 1140
Query: 1216 ELNGLGSQTMNWMQTGGNRGVPTRGNNWEASGGRSGGSRHPHHRYPGSGVHYSRKK 1240
ELNGLGSQTMNWMQTGGNR VPTRGNNWE SGGRSGGSRHPHHRYPGSGVHYSRKK
Sbjct: 1141 ELNGLGSQTMNWMQTGGNR-VPTRGNNWEGSGGRSGGSRHPHHRYPGSGVHYSRKK 1195
BLAST of HG10022266 vs. ExPASy TrEMBL
Match:
A0A6J1FEV2 (regulator of nonsense transcripts UPF2-like OS=Cucurbita moschata OX=3662 GN=LOC111445073 PE=4 SV=1)
HSP 1 Score: 2118.2 bits (5487), Expect = 0.0e+00
Identity = 1114/1192 (93.46%), Postives = 1139/1192 (95.55%), Query Frame = 0
Query: 76 MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 135
MDHHEDDGRP ESQPKRDDEET ARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS
Sbjct: 1 MDHHEDDGRPVSESQPKRDDEETAARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 60
Query: 136 SIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ-- 195
SI+RNTT+IKKLKQIN+EQREGLMDELRNVNMSKFVSEAVSAICDAKLR SDIQAAVQ
Sbjct: 61 SIKRNTTIIKKLKQINDEQREGLMDELRNVNMSKFVSEAVSAICDAKLRASDIQAAVQIC 120
Query: 196 --------------------------SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 255
SGDELDADRNLKAMKKRSTLKLL+ELFFVGVVED
Sbjct: 121 SLLHQRYKDFSPCLIQGLLKVFFPGKSGDELDADRNLKAMKKRSTLKLLLELFFVGVVED 180
Query: 256 TAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSIT 315
+AIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGL PTGQDHEEFFKSL+IT
Sbjct: 181 SAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLPPTGQDHEEFFKSLNIT 240
Query: 316 ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 375
ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAK+LNAKGELNDENVSSYEKLRKS
Sbjct: 241 ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKLLNAKGELNDENVSSYEKLRKS 300
Query: 376 YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 435
YDHLYRNVSSLAEA+DMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSV+EAIWDDEDTRA
Sbjct: 301 YDHLYRNVSSLAEAVDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVLEAIWDDEDTRA 360
Query: 436 FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTALEAIEVSTDCSLQ 495
FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQT+LEA+E+STD LQ
Sbjct: 361 FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTSLEAVEISTDSLLQ 420
Query: 496 EGKINEKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTNLDALLQRLPG 555
+GKINEKGK+KEEKDKEKNKDTDKEKGKEKDADRK+ENEKEKLKNIEGTNLDALLQRLPG
Sbjct: 421 DGKINEKGKDKEEKDKEKNKDTDKEKGKEKDADRKMENEKEKLKNIEGTNLDALLQRLPG 480
Query: 556 CVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKDVSV 615
CVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSR+VATLSTCMKDVSV
Sbjct: 481 CVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRLVATLSTCMKDVSV 540
Query: 616 ILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDFTHH 675
ILLQMLEEEFNFLLNKKDQMNIETKI+NIRFIGELCKFKIASAGLVFSCLKACLDDFTHH
Sbjct: 541 ILLQMLEEEFNFLLNKKDQMNIETKIKNIRFIGELCKFKIASAGLVFSCLKACLDDFTHH 600
Query: 676 NIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPP 735
NIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPP
Sbjct: 601 NIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPP 660
Query: 736 ERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGK 795
ERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGK
Sbjct: 661 ERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGK 720
Query: 796 YGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYN 855
YG IHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYN
Sbjct: 721 YGHIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYN 780
Query: 856 YELVDSSVVFDTLYLIIVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRK 915
YELVDSSVVFDTLYLI+VFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRK
Sbjct: 781 YELVDSSVVFDTLYLILVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRK 840
Query: 916 LDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHERSV 975
LDRFFIHFQKYILSKGALPLD+EFDLQDLFAELQPNMTRYSSIEEINAAFVEL+EHERSV
Sbjct: 841 LDRFFIHFQKYILSKGALPLDVEFDLQDLFAELQPNMTRYSSIEEINAAFVELDEHERSV 900
Query: 976 SNDKPNTEKHLDAEKPSRATSNTASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTIEAE 1035
SN KPN EKHLDA+KPSRATSNT SANGRDT+NGSKENG AHEDGVDSDSDTGSGTIEAE
Sbjct: 901 SNVKPNIEKHLDAKKPSRATSNTTSANGRDTMNGSKENGAAHEDGVDSDSDTGSGTIEAE 960
Query: 1036 GRDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQELR 1095
G DDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVR KV EVDPREEANFEQELR
Sbjct: 961 GHDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRPKVAEVDPREEANFEQELR 1020
Query: 1096 AVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGVGGESGDEGLDEDAGGSKEV 1155
AVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRG GGESGDE L+EDAGGSKEV
Sbjct: 1021 AVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGAGGESGDEALEEDAGGSKEV 1080
Query: 1156 QVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEELNG 1215
QVKVLVKRGNKQQTKKMYIPRD LLQSTKQKEA ELEEKQDIKRLILEYNDREEEELNG
Sbjct: 1081 QVKVLVKRGNKQQTKKMYIPRDSALLQSTKQKEAEELEEKQDIKRLILEYNDREEEELNG 1140
Query: 1216 LGSQTMNWMQTGGNRGVPTRGNNWEASGGRSGGSRHPHHRYPGSGVHYSRKK 1240
LGSQTMNWMQTG NR PTRGNNW+ASGGRSGGS HPHHRYPG G+HYSRKK
Sbjct: 1141 LGSQTMNWMQTGNNR-APTRGNNWDASGGRSGGSHHPHHRYPGGGMHYSRKK 1191
BLAST of HG10022266 vs. ExPASy TrEMBL
Match:
A0A6J1JX56 (regulator of nonsense transcripts UPF2-like OS=Cucurbita maxima OX=3661 GN=LOC111489130 PE=4 SV=1)
HSP 1 Score: 2117.4 bits (5485), Expect = 0.0e+00
Identity = 1114/1192 (93.46%), Postives = 1139/1192 (95.55%), Query Frame = 0
Query: 76 MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 135
MDHHEDDGR ESQPKRDDEET ARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS
Sbjct: 1 MDHHEDDGRSVSESQPKRDDEETAARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 60
Query: 136 SIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ-- 195
SI+RNTT+IKKLKQIN+EQREGLMDELRNVNMSKFVSEAVSAICDAKLR SDIQAAVQ
Sbjct: 61 SIKRNTTIIKKLKQINDEQREGLMDELRNVNMSKFVSEAVSAICDAKLRASDIQAAVQIC 120
Query: 196 --------------------------SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 255
SGDELDADRNLKAMKKRSTLKLL+ELFFVGVVED
Sbjct: 121 SLLHQRYKDFSPCLIQGLLKVFFPGKSGDELDADRNLKAMKKRSTLKLLLELFFVGVVED 180
Query: 256 TAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSIT 315
+AIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGL PTGQDHEEFFKSL+IT
Sbjct: 181 SAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLPPTGQDHEEFFKSLNIT 240
Query: 316 ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 375
ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAK+LNAKGELNDENVSSYEKLRKS
Sbjct: 241 ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKLLNAKGELNDENVSSYEKLRKS 300
Query: 376 YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 435
YDHLYRNVSSLAEA+DMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSV+EAIWDDEDTRA
Sbjct: 301 YDHLYRNVSSLAEAVDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVLEAIWDDEDTRA 360
Query: 436 FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTALEAIEVSTDCSLQ 495
FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQT+LEA+E+STD L+
Sbjct: 361 FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTSLEAVEISTDSLLE 420
Query: 496 EGKINEKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTNLDALLQRLPG 555
+GKINEKGK+KEEKDKEKNKDTDKEKGKEKDADRK+ENEKEKLKNIEGTNLDALLQRLPG
Sbjct: 421 DGKINEKGKDKEEKDKEKNKDTDKEKGKEKDADRKMENEKEKLKNIEGTNLDALLQRLPG 480
Query: 556 CVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKDVSV 615
CVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSR+VATLSTCMKDVSV
Sbjct: 481 CVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRLVATLSTCMKDVSV 540
Query: 616 ILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDFTHH 675
ILLQMLEEEFNFLLNKKDQMNIETKI+NIRFIGELCKFKIASAGLVFSCLKACLDDFTHH
Sbjct: 541 ILLQMLEEEFNFLLNKKDQMNIETKIKNIRFIGELCKFKIASAGLVFSCLKACLDDFTHH 600
Query: 676 NIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPP 735
NIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPP
Sbjct: 601 NIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPP 660
Query: 736 ERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGK 795
ERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGK
Sbjct: 661 ERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGK 720
Query: 796 YGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYN 855
YGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYN
Sbjct: 721 YGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYN 780
Query: 856 YELVDSSVVFDTLYLIIVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRK 915
YELVDSSVVFDTLYLI+VFGHGTSEQDVLDPPEDTFRIRM+ITLLQTCGHYFDRGSSKRK
Sbjct: 781 YELVDSSVVFDTLYLILVFGHGTSEQDVLDPPEDTFRIRMVITLLQTCGHYFDRGSSKRK 840
Query: 916 LDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHERSV 975
LDRFFIHFQKYILSKGALPLD+EFDLQDLFAELQPNMTRYSSIEEINAAFVEL+EHERSV
Sbjct: 841 LDRFFIHFQKYILSKGALPLDVEFDLQDLFAELQPNMTRYSSIEEINAAFVELDEHERSV 900
Query: 976 SNDKPNTEKHLDAEKPSRATSNTASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTIEAE 1035
SN KPNTEKHLDA+KPSRATSNT SANGRDTVNGSKENG AHEDGVDSDSDTGSGTIEAE
Sbjct: 901 SNVKPNTEKHLDAKKPSRATSNTTSANGRDTVNGSKENGAAHEDGVDSDSDTGSGTIEAE 960
Query: 1036 GRDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQELR 1095
G DEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVR KV EVDPREEANFEQELR
Sbjct: 961 GHGDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRPKVAEVDPREEANFEQELR 1020
Query: 1096 AVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGVGGESGDEGLDEDAGGSKEV 1155
AVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRG GGESGDE L+EDAGGSKEV
Sbjct: 1021 AVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGAGGESGDEALEEDAGGSKEV 1080
Query: 1156 QVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEELNG 1215
QVKVLVKRGNKQQTKKMYIPRD LLQSTKQKEA ELEEKQDIKRLILEYNDREEEELNG
Sbjct: 1081 QVKVLVKRGNKQQTKKMYIPRDSALLQSTKQKEAEELEEKQDIKRLILEYNDREEEELNG 1140
Query: 1216 LGSQTMNWMQTGGNRGVPTRGNNWEASGGRSGGSRHPHHRYPGSGVHYSRKK 1240
LGSQTMNWMQTG NR PTRGNNW+ASGGRSGGS HPHHRYPG GVHYSRKK
Sbjct: 1141 LGSQTMNWMQTGNNR-APTRGNNWDASGGRSGGSHHPHHRYPGGGVHYSRKK 1191
BLAST of HG10022266 vs. ExPASy TrEMBL
Match:
A0A6J1BVY0 (regulator of nonsense transcripts UPF2 OS=Momordica charantia OX=3673 GN=LOC111006232 PE=4 SV=1)
HSP 1 Score: 2092.8 bits (5421), Expect = 0.0e+00
Identity = 1109/1195 (92.80%), Postives = 1140/1195 (95.40%), Query Frame = 0
Query: 76 MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 135
MDHHEDDGR GGESQPKRDDEETVARQEEIKKSFEAK+ALRQSNLNPERPDSGFLRTLDS
Sbjct: 1 MDHHEDDGRLGGESQPKRDDEETVARQEEIKKSFEAKIALRQSNLNPERPDSGFLRTLDS 60
Query: 136 SIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ-- 195
SI+RNTT+IKKLKQINEEQREGLMDELRNVNMSKFVSEAV+AICDAKLR SDIQAAVQ
Sbjct: 61 SIKRNTTIIKKLKQINEEQREGLMDELRNVNMSKFVSEAVAAICDAKLRASDIQAAVQIC 120
Query: 196 --------------------------SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 255
SGDELDADRNLKAMKKRSTLKLL+ELFFVGV+ED
Sbjct: 121 SLLHQRYKDFSPCLIQGLLKVFFPGKSGDELDADRNLKAMKKRSTLKLLLELFFVGVIED 180
Query: 256 TAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSIT 315
AIFNNIIKDLTSIEHLRDRD TLTNLTLLASFARQGRILLGL PTGQDHEEFFKSL+IT
Sbjct: 181 CAIFNNIIKDLTSIEHLRDRDATLTNLTLLASFARQGRILLGLPPTGQDHEEFFKSLNIT 240
Query: 316 ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 375
ADQKKFFRK FHTYYDAA+ELLQSEHTSLRQME ENAKILNAKGELNDENVSSYEKLRKS
Sbjct: 241 ADQKKFFRKVFHTYYDAASELLQSEHTSLRQMEHENAKILNAKGELNDENVSSYEKLRKS 300
Query: 376 YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 435
YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSV+EAIWDDEDTRA
Sbjct: 301 YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVLEAIWDDEDTRA 360
Query: 436 FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEADQGQQTALEAIEVSTDCSLQ 495
FYECLPDLRAFVPAVLLGEAEPKAN+QS KPAEN+AESEADQGQQT+LEA+E+STDCSLQ
Sbjct: 361 FYECLPDLRAFVPAVLLGEAEPKANDQSTKPAENMAESEADQGQQTSLEAVEISTDCSLQ 420
Query: 496 EGKINEKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTNLDALLQRLPG 555
+GKINEKGK+KEEKDKEK+KDTDKEKGKEKDADRK+ENEKEKLKN+EGTNLDALLQRLPG
Sbjct: 421 DGKINEKGKDKEEKDKEKSKDTDKEKGKEKDADRKMENEKEKLKNVEGTNLDALLQRLPG 480
Query: 556 CVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKDVSV 615
CVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKDVSV
Sbjct: 481 CVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKDVSV 540
Query: 616 ILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDFTHH 675
ILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKI+SAGLVFSCLK+CLDDFTHH
Sbjct: 541 ILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKISSAGLVFSCLKSCLDDFTHH 600
Query: 676 NIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPP 735
NIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPP
Sbjct: 601 NIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLCKPP 660
Query: 736 ERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGK 795
ERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGK
Sbjct: 661 ERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVHKGK 720
Query: 796 YGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYN 855
YGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYN
Sbjct: 721 YGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGELYN 780
Query: 856 YELVDSSVVFDTLYLIIVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRK 915
YELV+SSVVFDTLYLI+VFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRK
Sbjct: 781 YELVESSVVFDTLYLILVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSSKRK 840
Query: 916 LDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHERSV 975
LDRFFIHFQKYILSKGALPLD+EFDLQDLFAELQPNMTRYSSIEEINAAFVEL+EHERSV
Sbjct: 841 LDRFFIHFQKYILSKGALPLDVEFDLQDLFAELQPNMTRYSSIEEINAAFVELDEHERSV 900
Query: 976 -SNDKPNTEKHLDAEK-PSRATSNTASANGRDTVNGSKENG-GAHEDGVDSDSDTGSGTI 1035
S+DKPNTEKHLDAEK PSR TSNT SANGRDTVNGS+ENG AHED DSDSDTGSGTI
Sbjct: 901 SSSDKPNTEKHLDAEKTPSRTTSNTTSANGRDTVNGSRENGAAAHEDVADSDSDTGSGTI 960
Query: 1036 EAEGRDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQ 1095
EAEGRDDEESDLENHEDG D+EDDEDDEE GGPASDEDDEVHVRQKV EVDPREEANFEQ
Sbjct: 961 EAEGRDDEESDLENHEDG-DSEDDEDDEEGGGPASDEDDEVHVRQKVAEVDPREEANFEQ 1020
Query: 1096 ELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRGVGGESGDEGLDEDAGGS 1155
ELRAVMQESMDQRRQE+RGRPTLNMMIPMNLFEG TRDHHGRGVGGESGDE LDEDAGG+
Sbjct: 1021 ELRAVMQESMDQRRQEIRGRPTLNMMIPMNLFEG-TRDHHGRGVGGESGDEALDEDAGGT 1080
Query: 1156 KEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEEE 1215
KEVQVKVLVKRGNKQQTKKM+IPRDC LLQSTKQKEAAELEEKQDIKRLILEYNDREEEE
Sbjct: 1081 KEVQVKVLVKRGNKQQTKKMFIPRDCALLQSTKQKEAAELEEKQDIKRLILEYNDREEEE 1140
Query: 1216 LNGLGSQTMNWMQTGGNRGVPTRGNNWEASGGRSGGSRHPHHRYPGSGVHYSRKK 1240
LNGLGSQTMNWMQTGGNR VPTRGNNWE SGGRSGG RHPHHRY G GVHYSRKK
Sbjct: 1141 LNGLGSQTMNWMQTGGNR-VPTRGNNWEGSGGRSGGPRHPHHRYIGGGVHYSRKK 1192
BLAST of HG10022266 vs. TAIR 10
Match:
AT2G39260.1 (binding;RNA binding )
HSP 1 Score: 1538.5 bits (3982), Expect = 0.0e+00
Identity = 831/1196 (69.48%), Postives = 992/1196 (82.94%), Query Frame = 0
Query: 76 MDHHEDDGRPGGESQPKRDDEETVARQEEIKKSFEAKMALRQSNLNPERPDSGFLRTLDS 135
MDH ED+ K+DDEE +AR EEIKKS EAK+ LRQ+NLNPERPDS +LRTLDS
Sbjct: 1 MDHPEDESH-----SEKQDDEEALARLEEIKKSIEAKLTLRQNNLNPERPDSAYLRTLDS 60
Query: 136 SIRRNTTVIKKLKQINEEQREGLMDELRNVNMSKFVSEAVSAICDAKLRTSDIQAAVQ-- 195
SI+RNT VIKKLKQINEEQREGLMD+LR VN+SKFVSEAV+AIC+AKL++SDIQAAVQ
Sbjct: 61 SIKRNTAVIKKLKQINEEQREGLMDDLRGVNLSKFVSEAVTAICEAKLKSSDIQAAVQIC 120
Query: 196 --------------------------SGDELDADRNLKAMKKRSTLKLLMELFFVGVVED 255
S ++L+AD+N KAMKKRSTLKLL+EL++VGV+ED
Sbjct: 121 SLLHQRYKEFSASLTQGLLKVFFPGKSAEDLEADKNSKAMKKRSTLKLLLELYYVGVIED 180
Query: 256 TAIFNNIIKDLTSIEHLRDRDTTLTNLTLLASFARQGRILLGLLPTGQDHEEFFKSLSIT 315
+ IF NIIKDLTS+E L+DRDTT TNLTLL SFARQGRI LGL +GQD E+FFK L +T
Sbjct: 181 SNIFINIIKDLTSVEQLKDRDTTQTNLTLLTSFARQGRIFLGLPISGQD-EDFFKGLDVT 240
Query: 316 ADQKKFFRKAFHTYYDAAAELLQSEHTSLRQMEQENAKILNAKGELNDENVSSYEKLRKS 375
ADQKK F+KAF+TYYDA A+LLQSEH L QME+ENAK++NAKGEL++++ SSYEKLRKS
Sbjct: 241 ADQKKSFKKAFNTYYDALADLLQSEHKLLLQMEKENAKLVNAKGELSEDSASSYEKLRKS 300
Query: 376 YDHLYRNVSSLAEALDMQPPVMPEDGHTTRVSAGEDVSSPAAGKDSSVIEAIWDDEDTRA 435
YDHLYRN+SSLAEALDMQPPVMPEDG TTR++AG++ S KD+SV E IWDDEDT+
Sbjct: 301 YDHLYRNISSLAEALDMQPPVMPEDG-TTRLTAGDEASPSGTVKDTSVPEPIWDDEDTKT 360
Query: 436 FYECLPDLRAFVPAVLLGEAEPKANEQSAKPAENLAESEAD--QGQQTALEAIEVSTDCS 495
FYECLPDLRAFVPAVLLGEAEPK+NEQSAK E L+ES ++ + QQT + EVS D +
Sbjct: 361 FYECLPDLRAFVPAVLLGEAEPKSNEQSAKAKEKLSESSSEVVENQQTTEDTTEVSADSA 420
Query: 496 LQEGKIN-EKGKEKEEKDKEKNKDTDKEKGKEKDADRKVENEKEKLKNIEGTNLDALLQR 555
+ + N E+ KEKEE +KEK KDT KEKGKEKD+++K+E+EKEK K+++ N + LLQR
Sbjct: 421 SMDDRSNAEQPKEKEEVEKEKAKDTKKEKGKEKDSEKKMEHEKEKGKSLDVANFERLLQR 480
Query: 556 LPGCVSRDLIDQLTVEFCYLNSKANRKKLVRALFNVPRTSLELLPYYSRMVATLSTCMKD 615
LPGCVSRDLIDQLTVE+CYLNSK NRKKLV+ALFNVPRTSLELL YYSRMVATL++CMKD
Sbjct: 481 LPGCVSRDLIDQLTVEYCYLNSKTNRKKLVKALFNVPRTSLELLAYYSRMVATLASCMKD 540
Query: 616 VSVILLQMLEEEFNFLLNKKDQMNIETKIRNIRFIGELCKFKIASAGLVFSCLKACLDDF 675
+ +L+QMLE+EFN L++KKDQMNIETKIRNIRFIGELCKFKI AGLVFSCLKACLD+F
Sbjct: 541 IPSMLVQMLEDEFNSLVHKKDQMNIETKIRNIRFIGELCKFKIVPAGLVFSCLKACLDEF 600
Query: 676 THHNIDVACNLLETCGRFLYRSPETTVRMANMLEILMRLKNVKNLDPRHSTLVENAYYLC 735
THHNIDVACNLLETCGRFLYRSPETT+RM NML+ILMRLKNVKNLDPR STLVENAYYLC
Sbjct: 601 THHNIDVACNLLETCGRFLYRSPETTLRMTNMLDILMRLKNVKNLDPRQSTLVENAYYLC 660
Query: 736 KPPERSARVSKVRPPLHQYIRKLLFSDLDKSAIENVLRQLRKLPWSECEQYLLKCFMKVH 795
KPPERSAR+SKVRPPLHQY+RKLLFSDLDK +I NVL+QLRKLPWSECEQY+LKCFMKVH
Sbjct: 661 KPPERSARISKVRPPLHQYVRKLLFSDLDKDSIANVLKQLRKLPWSECEQYILKCFMKVH 720
Query: 796 KGKYGQIHLIASLTSGLSRYHDEFSVAVVDEVLEEIRLGLEVNDYGMQQKRIAHMRFLGE 855
KGKYGQIHLIASLTSGLSR+HDEF VAVVDEVLEEIR+GLE+N+YG QQKR+AHMRFLGE
Sbjct: 721 KGKYGQIHLIASLTSGLSRHHDEFVVAVVDEVLEEIRVGLELNEYGAQQKRLAHMRFLGE 780
Query: 856 LYNYELVDSSVVFDTLYLIIVFGHGTSEQDVLDPPEDTFRIRMIITLLQTCGHYFDRGSS 915
LYNYE VDSSV+F+TLYL +++GH TSEQ+VLDPPED FR+RM+I LL+TCGHYFDRGSS
Sbjct: 781 LYNYEHVDSSVIFETLYLTLLYGHDTSEQEVLDPPEDFFRVRMVIILLETCGHYFDRGSS 840
Query: 916 KRKLDRFFIHFQKYILSKGALPLDIEFDLQDLFAELQPNMTRYSSIEEINAAFVELEEHE 975
K++LD+F IHFQ+YILSKG LPLDIEFDLQDLFA L+PNMTRYS+I+E+NAA ++LEE E
Sbjct: 841 KKRLDQFLIHFQRYILSKGHLPLDIEFDLQDLFANLRPNMTRYSTIDEVNAAILQLEERE 900
Query: 976 RSVSNDKPNTEKHLDAEKPSRATSNTASANGRDTVNGSKENGGAHEDGVDSDSDTGSGTI 1035
+ S DK + E+H D + ++++S+ S+NG+ T +ENG AH G +SDSD+GSG++
Sbjct: 901 HASSGDKVSIERHSDTKPSNKSSSDVISSNGKSTAKDIRENGEAH--GEESDSDSGSGSV 960
Query: 1036 EAEGRDDEESDLENHEDGCDTEDDEDDEEAGGPASDEDDEVHVRQKVPEVDPREEANFEQ 1095
+G+ +EE D NHE G ++ D +D ++ GP SD DD+ VRQKV VD E+A+F+Q
Sbjct: 961 VRDGQ-NEELDDGNHERGSESGDGDDYDDGDGPGSD-DDKFRVRQKVVTVDLEEQADFDQ 1020
Query: 1096 ELRAVMQESMDQRRQELRGRPTLNMMIPMNLFEGSTRDHHGRG-VGGESGDEGLDEDAGG 1155
EL+A++QESM+QR+ ELRGRP LNM IPM++FEGS +DHH G V GE+G+E LDE+ G
Sbjct: 1021 ELKALLQESMEQRKLELRGRPALNMTIPMSVFEGSGKDHHHFGRVVGENGEEVLDEENGE 1080
Query: 1156 SKEVQVKVLVKRGNKQQTKKMYIPRDCTLLQSTKQKEAAELEEKQDIKRLILEYNDREEE 1215
+EVQVKVLVKRGNKQQT++M IP DC L+QSTKQKEAAELEEKQDIKRL+LEYN+R+EE
Sbjct: 1081 QREVQVKVLVKRGNKQQTRQMLIPSDCALVQSTKQKEAAELEEKQDIKRLVLEYNERDEE 1140
Query: 1216 ELNGLGSQTMNWMQTGGNRGVPTRGNNWEASGGRSGGSRHPHHRYPGSGVHYSRKK 1240
E NGLG+Q +NW +GG+RG G G+SGGSRH + + G G Y ++
Sbjct: 1141 EANGLGTQILNW-TSGGSRGSTRTGE----GSGKSGGSRHRFYYHQGGGGSYHARR 1180
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038890797.1 | 0.0e+00 | 95.31 | regulator of nonsense transcripts UPF2 [Benincasa hispida] | [more] |
XP_008463566.1 | 0.0e+00 | 95.06 | PREDICTED: LOW QUALITY PROTEIN: regulator of nonsense transcripts UPF2 [Cucumis ... | [more] |
XP_004143811.1 | 0.0e+00 | 95.15 | regulator of nonsense transcripts UPF2 [Cucumis sativus] >KGN51237.1 hypothetica... | [more] |
KAG6578740.1 | 0.0e+00 | 91.78 | Regulator of nonsense transcripts UPF2, partial [Cucurbita argyrosperma subsp. s... | [more] |
XP_023550316.1 | 0.0e+00 | 93.62 | regulator of nonsense transcripts UPF2-like [Cucurbita pepo subsp. pepo] | [more] |
Match Name | E-value | Identity | Description | |
F4IUX6 | 0.0e+00 | 69.48 | Regulator of nonsense transcripts UPF2 OS=Arabidopsis thaliana OX=3702 GN=UPF2 P... | [more] |
A2AT37 | 9.8e-167 | 35.48 | Regulator of nonsense transcripts 2 OS=Mus musculus OX=10090 GN=Upf2 PE=1 SV=1 | [more] |
Q9HAU5 | 2.2e-166 | 35.46 | Regulator of nonsense transcripts 2 OS=Homo sapiens OX=9606 GN=UPF2 PE=1 SV=1 | [more] |
O13824 | 9.2e-64 | 24.21 | Nonsense-mediated mRNA decay protein 2 OS=Schizosaccharomyces pombe (strain 972 ... | [more] |
P38798 | 1.6e-52 | 22.72 | Nonsense-mediated mRNA decay protein 2 OS=Saccharomyces cerevisiae (strain ATCC ... | [more] |
Match Name | E-value | Identity | Description | |
A0A1S3CJK4 | 0.0e+00 | 95.06 | LOW QUALITY PROTEIN: regulator of nonsense transcripts UPF2 OS=Cucumis melo OX=3... | [more] |
A0A0A0KS34 | 0.0e+00 | 95.15 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G497020 PE=4 SV=1 | [more] |
A0A6J1FEV2 | 0.0e+00 | 93.46 | regulator of nonsense transcripts UPF2-like OS=Cucurbita moschata OX=3662 GN=LOC... | [more] |
A0A6J1JX56 | 0.0e+00 | 93.46 | regulator of nonsense transcripts UPF2-like OS=Cucurbita maxima OX=3661 GN=LOC11... | [more] |
A0A6J1BVY0 | 0.0e+00 | 92.80 | regulator of nonsense transcripts UPF2 OS=Momordica charantia OX=3673 GN=LOC1110... | [more] |
Match Name | E-value | Identity | Description | |
AT2G39260.1 | 0.0e+00 | 69.48 | binding;RNA binding | [more] |