Homology
BLAST of HG10020750 vs. NCBI nr
Match:
XP_038894150.1 (nuclear exosome regulator NRDE2 isoform X2 [Benincasa hispida])
HSP 1 Score: 2169.8 bits (5621), Expect = 0.0e+00
Identity = 1109/1165 (95.19%), Postives = 1130/1165 (97.00%), Query Frame = 0
Query: 1 MEAPAEEESSP-EEQNPKTSLFPLSFVANNPQSLSSPPNSSVPQWLCNSSFTTDLSVIND 60
MEAP EE+ SP EEQNPKTSLFPLSFVANNPQS SSPP SSVPQWLCNSSFTTDLSVIND
Sbjct: 1 MEAPTEEKDSPHEEQNPKTSLFPLSFVANNPQSQSSPPTSSVPQWLCNSSFTTDLSVIND 60
Query: 61 ALSSQNNVHSSISAGGDQEEAVEDEGGPSDRREVQKPSRSYELLESSASDDDSEHGKRRK 120
ALSSQNN + S GDQEEAV DEGGPSDRREVQKPSRSYELLESSASDDDSEH KR+K
Sbjct: 61 ALSSQNNAYPSFPDDGDQEEAVADEGGPSDRREVQKPSRSYELLESSASDDDSEHEKRKK 120
Query: 121 RKKKRRRRRGNESEERGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLY 180
RKK+RRRRR NESE+RGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNL FGSLY
Sbjct: 121 RKKRRRRRR-NESEDRGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLVFGSLY 180
Query: 181 RMDVARYRPLNRGEKPGLNFHGFSQWNKSSSALDRDADADVLDSKVKSGGRYWSAKNAAI 240
RMDVARYRPLNRGE PGLNFHGFSQWNKSSSALDRDADADVLDSKVKSGGRYWSAKNAAI
Sbjct: 181 RMDVARYRPLNRGETPGLNFHGFSQWNKSSSALDRDADADVLDSKVKSGGRYWSAKNAAI 240
Query: 241 ERHKNFKRVRIGFSRKTPDTLLDDFIPLSDVQTSNNIEESWEDEVLRKTREFNKLTREHP 300
ERHKNFKRVRIGFSRKTPDTLLDDFIPLS VQTSNNIEESWEDEVLRKTREFNKLTRE P
Sbjct: 241 ERHKNFKRVRIGFSRKTPDTLLDDFIPLSGVQTSNNIEESWEDEVLRKTREFNKLTREQP 300
Query: 301 HDEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKTYQN 360
HDEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKA+ELNPENEELLLYLLKTYQN
Sbjct: 301 HDEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKASELNPENEELLLYLLKTYQN 360
Query: 361 RDNIDVVISRWEKILMQNSGSYRLWREFLHLIQGEFSRFKVSDMRQLYAHAIQALSAACN 420
RDNIDVVISRWEKILMQNSGSY+LWREFLHLIQGEFSRFKV+DMRQ+YAHAIQALSAACN
Sbjct: 361 RDNIDVVISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVTDMRQMYAHAIQALSAACN 420
Query: 421 QHIRQANQIAKPSVEHDLIQLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP 480
QHIRQ +Q AKPSVEHDLI+LELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP
Sbjct: 421 QHIRQTDQTAKPSVEHDLIKLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP 480
Query: 481 ALHLNDRSKQRLFEHFWNTDAERVGEEGAVGWSTWLEKEEENRQKAMREEALEADEKGGW 540
ALHLNDRSKQRLFEHFWNTDAERVGEEGA+GWSTWLEKEEENRQKAMREEALEADEKGGW
Sbjct: 481 ALHLNDRSKQRLFEHFWNTDAERVGEEGALGWSTWLEKEEENRQKAMREEALEADEKGGW 540
Query: 541 TGWSDPPPKEKKNSDGTETTAEMGVAAEETMEEYVEEEDIEREDSTEALLKILGINADAG 600
TGWSDP PKEKKNSD TETT EMGVAAEETMEEYVEEEDI+REDSTEALLKILGINADAG
Sbjct: 541 TGWSDPAPKEKKNSDATETTIEMGVAAEETMEEYVEEEDIDREDSTEALLKILGINADAG 600
Query: 601 VDEEVKDVSTWARWSKEESSRDCEQWMPIRGKTADVIHDEGMPDGETNEQFLRVILYEDV 660
VDEEVKD STWARWSKEES RDCEQWMPIRGKTADVIHDE M DGET+EQ LRVILYEDV
Sbjct: 601 VDEEVKDASTWARWSKEESLRDCEQWMPIRGKTADVIHDERMGDGETSEQLLRVILYEDV 660
Query: 661 KEFLFSLISSEARLSLIYQLIEFFSGKIYSRSSSNSSSWMERILSLEVLPDDILRHLRSV 720
KE+LFSLISSEARLSLIYQLIEFFSGKIYSR+SSNSSSWMERI+SLEVLPDDIL HLRSV
Sbjct: 661 KEYLFSLISSEARLSLIYQLIEFFSGKIYSRASSNSSSWMERIISLEVLPDDILHHLRSV 720
Query: 721 HDVLNKRQSSSSSSTLEVLVGGSENLSQMSDMMKFLRNAILLCLTAFPRNYILEEAALIA 780
VLNKRQSSSSSSTLEVLVGGS+NLSQMSDMMKFLRNAILLCLTAFPRNYILEEAALIA
Sbjct: 721 LHVLNKRQSSSSSSTLEVLVGGSDNLSQMSDMMKFLRNAILLCLTAFPRNYILEEAALIA 780
Query: 781 EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL 840
EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL
Sbjct: 781 EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL 840
Query: 841 ASVESLPVDQKSNAPLLYFWYAELELANDHHNGHNSLNRAVHILSCLGSGTAYSPFKCQP 900
ASVESLPVDQKSNAPLLYFWYAELELANDHHNGHNSLNRAVHILSCLGSGTAYSPFKCQP
Sbjct: 841 ASVESLPVDQKSNAPLLYFWYAELELANDHHNGHNSLNRAVHILSCLGSGTAYSPFKCQP 900
Query: 901 SSLELLRAHQGFKEKIREVRSTWLHGVIDDSSAALISSAALFEELTTGYNAGLEVLDQAF 960
SSL+LLRAHQGFKEKIREVRSTWLHGVIDDSS ALISSAALFEELTTGY AGLEVLDQAF
Sbjct: 901 SSLQLLRAHQGFKEKIREVRSTWLHGVIDDSSVALISSAALFEELTTGYLAGLEVLDQAF 960
Query: 961 SMVLPERRKQSYQLEYLFNYYVKMLLRHHKQLSQLKVRQSISHGLQFYPLNPELYSAFLE 1020
SMVLPERRKQSYQLEYLFNYYVKMLLRHHKQLS LKVR+SISHGLQFYPLNPELYSAFLE
Sbjct: 961 SMVLPERRKQSYQLEYLFNYYVKMLLRHHKQLSHLKVRESISHGLQFYPLNPELYSAFLE 1020
Query: 1021 ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKALGNENLRH 1080
ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKAL NENLRH
Sbjct: 1021 ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKALENENLRH 1080
Query: 1081 SVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNSVLSAKELSDLQ 1140
SVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLN+VLSAKELSDLQ
Sbjct: 1081 SVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNTVLSAKELSDLQ 1140
Query: 1141 EVMRDKELNLRTDIYEILLQDELVS 1165
EVMRDKELNLRTDIYEILLQDELVS
Sbjct: 1141 EVMRDKELNLRTDIYEILLQDELVS 1164
BLAST of HG10020750 vs. NCBI nr
Match:
XP_038894149.1 (nuclear exosome regulator NRDE2 isoform X1 [Benincasa hispida])
HSP 1 Score: 2165.2 bits (5609), Expect = 0.0e+00
Identity = 1109/1166 (95.11%), Postives = 1130/1166 (96.91%), Query Frame = 0
Query: 1 MEAPAEEESSP-EEQNPKTSLFPLSFVANNPQSLSSPPNSSVPQWLCNSSFTTDLSVIND 60
MEAP EE+ SP EEQNPKTSLFPLSFVANNPQS SSPP SSVPQWLCNSSFTTDLSVIND
Sbjct: 1 MEAPTEEKDSPHEEQNPKTSLFPLSFVANNPQSQSSPPTSSVPQWLCNSSFTTDLSVIND 60
Query: 61 ALSSQNNVHSSISAGGDQEEAVEDEGGPSDRREVQKPSRSYELLESSASDDDSEHGKRRK 120
ALSSQNN + S GDQEEAV DEGGPSDRREVQKPSRSYELLESSASDDDSEH KR+K
Sbjct: 61 ALSSQNNAYPSFPDDGDQEEAVADEGGPSDRREVQKPSRSYELLESSASDDDSEHEKRKK 120
Query: 121 RKKKRRRRRGNESEERGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLY 180
RKK+RRRRR NESE+RGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNL FGSLY
Sbjct: 121 RKKRRRRRR-NESEDRGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLVFGSLY 180
Query: 181 RMDVARYRPLNRGEKPGLNFHGFSQWNKSSSALDRDADADVLDSKVKSGGRYWSAKNAAI 240
RMDVARYRPLNRGE PGLNFHGFSQWNKSSSALDRDADADVLDSKVKSGGRYWSAKNAAI
Sbjct: 181 RMDVARYRPLNRGETPGLNFHGFSQWNKSSSALDRDADADVLDSKVKSGGRYWSAKNAAI 240
Query: 241 ERHKNFKRVRIGFSRKTPDTLLDDFIPLSDVQTSNNIEESWEDEVLRKTREFNKLTREHP 300
ERHKNFKRVRIGFSRKTPDTLLDDFIPLS VQTSNNIEESWEDEVLRKTREFNKLTRE P
Sbjct: 241 ERHKNFKRVRIGFSRKTPDTLLDDFIPLSGVQTSNNIEESWEDEVLRKTREFNKLTREQP 300
Query: 301 HDEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKTYQN 360
HDEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKA+ELNPENEELLLYLLKTYQN
Sbjct: 301 HDEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKASELNPENEELLLYLLKTYQN 360
Query: 361 RDNIDVVISRWEKILMQNSGSYRLWREFLHLIQGEFSRFKVSDMRQLYAHAIQALSAACN 420
RDNIDVVISRWEKILMQNSGSY+LWREFLHLIQGEFSRFKV+DMRQ+YAHAIQALSAACN
Sbjct: 361 RDNIDVVISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVTDMRQMYAHAIQALSAACN 420
Query: 421 QHIRQANQIAKPSVEHDLIQLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP 480
QHIRQ +Q AKPSVEHDLI+LELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP
Sbjct: 421 QHIRQTDQTAKPSVEHDLIKLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP 480
Query: 481 ALHLNDRSKQRLFEHFWNTDAERVGEEGAVGWSTWLEKEEENRQKAMREEALEADEKGGW 540
ALHLNDRSKQRLFEHFWNTDAERVGEEGA+GWSTWLEKEEENRQKAMREEALEADEKGGW
Sbjct: 481 ALHLNDRSKQRLFEHFWNTDAERVGEEGALGWSTWLEKEEENRQKAMREEALEADEKGGW 540
Query: 541 TGWSDPPPKEKKNSDGTETTAEMGVAAEETMEEYVEEEDIEREDSTEALLKILGINADAG 600
TGWSDP PKEKKNSD TETT EMGVAAEETMEEYVEEEDI+REDSTEALLKILGINADAG
Sbjct: 541 TGWSDPAPKEKKNSDATETTIEMGVAAEETMEEYVEEEDIDREDSTEALLKILGINADAG 600
Query: 601 VDEEVKDVSTWARWSKEESSRDCEQWMPIRGKTADVIHDEGMPDGETNEQFLRVILYEDV 660
VDEEVKD STWARWSKEES RDCEQWMPIRGKTADVIHDE M DGET+EQ LRVILYEDV
Sbjct: 601 VDEEVKDASTWARWSKEESLRDCEQWMPIRGKTADVIHDERMGDGETSEQLLRVILYEDV 660
Query: 661 KEFLFSLISSEARLSLIYQLIEFFSGKIYSRSSSNSSSWMERILSLEVLPDDILRHLRSV 720
KE+LFSLISSEARLSLIYQLIEFFSGKIYSR+SSNSSSWMERI+SLEVLPDDIL HLRSV
Sbjct: 661 KEYLFSLISSEARLSLIYQLIEFFSGKIYSRASSNSSSWMERIISLEVLPDDILHHLRSV 720
Query: 721 HDVLNKRQSSSSSSTLEVLVGGSENLSQMSDMMKFLRNAILLCLTAFPRNYILEEAALIA 780
VLNKRQSSSSSSTLEVLVGGS+NLSQMSDMMKFLRNAILLCLTAFPRNYILEEAALIA
Sbjct: 721 LHVLNKRQSSSSSSTLEVLVGGSDNLSQMSDMMKFLRNAILLCLTAFPRNYILEEAALIA 780
Query: 781 EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL 840
EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL
Sbjct: 781 EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL 840
Query: 841 ASVESLPV-DQKSNAPLLYFWYAELELANDHHNGHNSLNRAVHILSCLGSGTAYSPFKCQ 900
ASVESLPV DQKSNAPLLYFWYAELELANDHHNGHNSLNRAVHILSCLGSGTAYSPFKCQ
Sbjct: 841 ASVESLPVQDQKSNAPLLYFWYAELELANDHHNGHNSLNRAVHILSCLGSGTAYSPFKCQ 900
Query: 901 PSSLELLRAHQGFKEKIREVRSTWLHGVIDDSSAALISSAALFEELTTGYNAGLEVLDQA 960
PSSL+LLRAHQGFKEKIREVRSTWLHGVIDDSS ALISSAALFEELTTGY AGLEVLDQA
Sbjct: 901 PSSLQLLRAHQGFKEKIREVRSTWLHGVIDDSSVALISSAALFEELTTGYLAGLEVLDQA 960
Query: 961 FSMVLPERRKQSYQLEYLFNYYVKMLLRHHKQLSQLKVRQSISHGLQFYPLNPELYSAFL 1020
FSMVLPERRKQSYQLEYLFNYYVKMLLRHHKQLS LKVR+SISHGLQFYPLNPELYSAFL
Sbjct: 961 FSMVLPERRKQSYQLEYLFNYYVKMLLRHHKQLSHLKVRESISHGLQFYPLNPELYSAFL 1020
Query: 1021 EISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKALGNENLR 1080
EISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKAL NENLR
Sbjct: 1021 EISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKALENENLR 1080
Query: 1081 HSVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNSVLSAKELSDL 1140
HSVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLN+VLSAKELSDL
Sbjct: 1081 HSVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNTVLSAKELSDL 1140
Query: 1141 QEVMRDKELNLRTDIYEILLQDELVS 1165
QEVMRDKELNLRTDIYEILLQDELVS
Sbjct: 1141 QEVMRDKELNLRTDIYEILLQDELVS 1165
BLAST of HG10020750 vs. NCBI nr
Match:
XP_038894151.1 (nuclear exosome regulator NRDE2 isoform X3 [Benincasa hispida])
HSP 1 Score: 2159.0 bits (5593), Expect = 0.0e+00
Identity = 1108/1166 (95.03%), Postives = 1129/1166 (96.83%), Query Frame = 0
Query: 1 MEAPAEEESSP-EEQNPKTSLFPLSFVANNPQSLSSPPNSSVPQWLCNSSFTTDLSVIND 60
MEAP EE+ SP EEQNPKTSLFPLSFVANNPQS SSPP SSVPQWLCNSSFTTDLSVIND
Sbjct: 1 MEAPTEEKDSPHEEQNPKTSLFPLSFVANNPQSQSSPPTSSVPQWLCNSSFTTDLSVIND 60
Query: 61 ALSSQNNVHSSISAGGDQEEAVEDEGGPSDRREVQKPSRSYELLESSASDDDSEHGKRRK 120
ALSSQNN + S GDQEEAV DEGGPSDRREVQKPSRSYELLESSASDDDSEH KR+K
Sbjct: 61 ALSSQNNAYPSFPDDGDQEEAVADEGGPSDRREVQKPSRSYELLESSASDDDSEHEKRKK 120
Query: 121 RKKKRRRRRGNESEERGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLY 180
RKK+RRRRR NESE+RGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNL FGSLY
Sbjct: 121 RKKRRRRRR-NESEDRGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLVFGSLY 180
Query: 181 RMDVARYRPLNRGEKPGLNFHGFSQWNKSSSALDRDADADVLDSKVKSGGRYWSAKNAAI 240
RMDVARYRPLNRGE PGLNFHGFSQWNKSSSALDRDADADVLDSKVKSGGRYWSAKNAAI
Sbjct: 181 RMDVARYRPLNRGETPGLNFHGFSQWNKSSSALDRDADADVLDSKVKSGGRYWSAKNAAI 240
Query: 241 ERHKNFKRVRIGFSRKTPDTLLDDFIPLSDVQTSNNIEESWEDEVLRKTREFNKLTREHP 300
ERHKNFKRVRIGFSRKTPDTLLDDFIPLS VQTSNNIEESWEDEVLRKTREFNKLTRE P
Sbjct: 241 ERHKNFKRVRIGFSRKTPDTLLDDFIPLSGVQTSNNIEESWEDEVLRKTREFNKLTREQP 300
Query: 301 HDEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKTYQN 360
HDEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKA+ELNPENEELLLYLLKTYQN
Sbjct: 301 HDEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKASELNPENEELLLYLLKTYQN 360
Query: 361 RDNIDVVISRWEKILMQNSGSYRLWREFLHLIQGEFSRFKVSDMRQLYAHAIQALSAACN 420
RDNIDVVISRWEKILMQNSGSY+LWREFLHLIQGEFSRFKV+DMRQ+YAHAIQALSAACN
Sbjct: 361 RDNIDVVISRWEKILMQNSGSYKLWREFLHLIQGEFSRFKVTDMRQMYAHAIQALSAACN 420
Query: 421 QHIRQANQIAKPSVEHDLIQLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP 480
QHIRQ +Q AKPSVEHDLI+LELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP
Sbjct: 421 QHIRQTDQTAKPSVEHDLIKLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP 480
Query: 481 ALHLNDRSKQRLFEHFWNTDAERVGEEGAVGWSTWLEKEEENRQKAMREEALEADEKGGW 540
ALHLNDRSKQRLFEHFWNTDAERVGEEGA+GWSTWLEKEEENRQKAMREEALEADEKGGW
Sbjct: 481 ALHLNDRSKQRLFEHFWNTDAERVGEEGALGWSTWLEKEEENRQKAMREEALEADEKGGW 540
Query: 541 TGWSDPPPKEKKNSDGTETTAEMGVAAEETMEEYVEEEDIEREDSTEALLKILGINADAG 600
TGWSDP PKEKKNSD TETT EMGVAAEETMEEYVEEEDI+REDSTEALLKILGINADAG
Sbjct: 541 TGWSDPAPKEKKNSDATETTIEMGVAAEETMEEYVEEEDIDREDSTEALLKILGINADAG 600
Query: 601 VDEEVKDVSTWARWSKEESSRDCEQWMPIRGKTADVIHDEGMPDGETNEQFLRVILYEDV 660
VDEEVKD STWARWSKEES RDCEQWMPIRGKT DVIHDE M DGET+EQ LRVILYEDV
Sbjct: 601 VDEEVKDASTWARWSKEESLRDCEQWMPIRGKT-DVIHDERMGDGETSEQLLRVILYEDV 660
Query: 661 KEFLFSLISSEARLSLIYQLIEFFSGKIYSRSSSNSSSWMERILSLEVLPDDILRHLRSV 720
KE+LFSLISSEARLSLIYQLIEFFSGKIYSR+SSNSSSWMERI+SLEVLPDDIL HLRSV
Sbjct: 661 KEYLFSLISSEARLSLIYQLIEFFSGKIYSRASSNSSSWMERIISLEVLPDDILHHLRSV 720
Query: 721 HDVLNKRQSSSSSSTLEVLVGGSENLSQMSDMMKFLRNAILLCLTAFPRNYILEEAALIA 780
VLNKRQSSSSSSTLEVLVGGS+NLSQMSDMMKFLRNAILLCLTAFPRNYILEEAALIA
Sbjct: 721 LHVLNKRQSSSSSSTLEVLVGGSDNLSQMSDMMKFLRNAILLCLTAFPRNYILEEAALIA 780
Query: 781 EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL 840
EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL
Sbjct: 781 EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL 840
Query: 841 ASVESLPV-DQKSNAPLLYFWYAELELANDHHNGHNSLNRAVHILSCLGSGTAYSPFKCQ 900
ASVESLPV DQKSNAPLLYFWYAELELANDHHNGHNSLNRAVHILSCLGSGTAYSPFKCQ
Sbjct: 841 ASVESLPVQDQKSNAPLLYFWYAELELANDHHNGHNSLNRAVHILSCLGSGTAYSPFKCQ 900
Query: 901 PSSLELLRAHQGFKEKIREVRSTWLHGVIDDSSAALISSAALFEELTTGYNAGLEVLDQA 960
PSSL+LLRAHQGFKEKIREVRSTWLHGVIDDSS ALISSAALFEELTTGY AGLEVLDQA
Sbjct: 901 PSSLQLLRAHQGFKEKIREVRSTWLHGVIDDSSVALISSAALFEELTTGYLAGLEVLDQA 960
Query: 961 FSMVLPERRKQSYQLEYLFNYYVKMLLRHHKQLSQLKVRQSISHGLQFYPLNPELYSAFL 1020
FSMVLPERRKQSYQLEYLFNYYVKMLLRHHKQLS LKVR+SISHGLQFYPLNPELYSAFL
Sbjct: 961 FSMVLPERRKQSYQLEYLFNYYVKMLLRHHKQLSHLKVRESISHGLQFYPLNPELYSAFL 1020
Query: 1021 EISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKALGNENLR 1080
EISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKAL NENLR
Sbjct: 1021 EISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKALENENLR 1080
Query: 1081 HSVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNSVLSAKELSDL 1140
HSVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLN+VLSAKELSDL
Sbjct: 1081 HSVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNTVLSAKELSDL 1140
Query: 1141 QEVMRDKELNLRTDIYEILLQDELVS 1165
QEVMRDKELNLRTDIYEILLQDELVS
Sbjct: 1141 QEVMRDKELNLRTDIYEILLQDELVS 1164
BLAST of HG10020750 vs. NCBI nr
Match:
XP_011650955.1 (nuclear exosome regulator NRDE2 isoform X2 [Cucumis sativus] >KGN64201.1 hypothetical protein Csa_014370 [Cucumis sativus])
HSP 1 Score: 2140.2 bits (5544), Expect = 0.0e+00
Identity = 1089/1165 (93.48%), Postives = 1123/1165 (96.39%), Query Frame = 0
Query: 1 MEAPAEE-ESSPEEQNPKTSLFPLSFVANNPQSLSSPPNSSVPQWLCNSSFTTDLSVIND 60
MEAP EE ES PEEQNPK SLFPLSFVANNPQ+ S+P SSVPQWLCNSSFTTDL+VIND
Sbjct: 1 MEAPPEEKESPPEEQNPKPSLFPLSFVANNPQTQSNPSTSSVPQWLCNSSFTTDLTVIND 60
Query: 61 ALSSQNNVHSSISAGGDQEEAVEDEGGPSDRREVQKPSRSYELLESSASDDDSEHGKRRK 120
ALSSQNNVH S SA +QEEAVEDEGGPS RREVQKPSRSYELLESSAS+DDSEH KR+K
Sbjct: 61 ALSSQNNVHPSCSADSEQEEAVEDEGGPSGRREVQKPSRSYELLESSASEDDSEHEKRKK 120
Query: 121 RKKKRRRRRGNESEERGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLY 180
RKKK+RRRR NESEERGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLY
Sbjct: 121 RKKKKRRRRRNESEERGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLY 180
Query: 181 RMDVARYRPLNRGEKPGLNFHGFSQWNKSSSALDRDADADVLDSKVKSGGRYWSAKNAAI 240
RMDVARYRPLNRGE+ G NFHGFSQWNKSSSALDRDADADVLD+KVKSGGRYWSAKNAAI
Sbjct: 181 RMDVARYRPLNRGERHGQNFHGFSQWNKSSSALDRDADADVLDNKVKSGGRYWSAKNAAI 240
Query: 241 ERHKNFKRVRIGFSRKTPDTLLDDFIPLSDVQTSNNIEESWEDEVLRKTREFNKLTREHP 300
ERHKNFKRVRIGFS T DTLLDDFIPLSDVQTSNNIEESWEDEVLRKTREFNKLTREHP
Sbjct: 241 ERHKNFKRVRIGFSSNTSDTLLDDFIPLSDVQTSNNIEESWEDEVLRKTREFNKLTREHP 300
Query: 301 HDEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKTYQN 360
HDEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKTYQN
Sbjct: 301 HDEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKTYQN 360
Query: 361 RDNIDVVISRWEKILMQNSGSYRLWREFLHLIQGEFSRFKVSDMRQLYAHAIQALSAACN 420
RDNIDVVI+RWEKIL+QNSGSYRLWREFLHL+QGEFSRFKVSDMRQ+YAHAIQALSAACN
Sbjct: 361 RDNIDVVINRWEKILLQNSGSYRLWREFLHLMQGEFSRFKVSDMRQMYAHAIQALSAACN 420
Query: 421 QHIRQANQIAKPSVEHDLIQLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP 480
QHIRQANQI KPSVEHDLIQLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP
Sbjct: 421 QHIRQANQIGKPSVEHDLIQLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP 480
Query: 481 ALHLNDRSKQRLFEHFWNTDAERVGEEGAVGWSTWLEKEEENRQKAMREEALEADEKGGW 540
ALHLNDR+KQRLFEHFWNTDAERVGEEGAVGWSTWLEKEEENRQKAMREE LEADEKGGW
Sbjct: 481 ALHLNDRNKQRLFEHFWNTDAERVGEEGAVGWSTWLEKEEENRQKAMREEVLEADEKGGW 540
Query: 541 TGWSDPPPKEKKNSDGTETTAEMGVAAEETMEEYVEEEDIEREDSTEALLKILGINADAG 600
TGW +P PKE KNSDGT TTAEM VAAEETMEEYV EEDIEREDSTEALLKILGIN DAG
Sbjct: 541 TGWFNPAPKENKNSDGTGTTAEMDVAAEETMEEYV-EEDIEREDSTEALLKILGINTDAG 600
Query: 601 VDEEVKDVSTWARWSKEESSRDCEQWMPIRGKTADVIHDEGMPDGETNEQFLRVILYEDV 660
VDEEVKD STWARWSKEESSRD EQWMP+R +T DVIHDEGMPDGETNEQ LRVILYEDV
Sbjct: 601 VDEEVKDASTWARWSKEESSRDSEQWMPVRERTVDVIHDEGMPDGETNEQLLRVILYEDV 660
Query: 661 KEFLFSLISSEARLSLIYQLIEFFSGKIYSRSSSNSSSWMERILSLEVLPDDILRHLRSV 720
KE+LFSL+SSEARLSLIYQLIEFFSGKIYSR+SSN+SSWMERILSLEVLPDDI+ HLRSV
Sbjct: 661 KEYLFSLVSSEARLSLIYQLIEFFSGKIYSRASSNNSSWMERILSLEVLPDDIVHHLRSV 720
Query: 721 HDVLNKRQSSSSSSTLEVLVGGSENLSQMSDMMKFLRNAILLCLTAFPRNYILEEAALIA 780
HDVLNKRQSSSSSS++EVL+G S+NLSQMS+MMKFLRN ILLCLTAFPRNYILEEAALIA
Sbjct: 721 HDVLNKRQSSSSSSSMEVLIGSSDNLSQMSEMMKFLRNTILLCLTAFPRNYILEEAALIA 780
Query: 781 EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL 840
EELFVTKMNSCSSSVTPCRSLAK+LLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL
Sbjct: 781 EELFVTKMNSCSSSVTPCRSLAKSLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL 840
Query: 841 ASVESLPVDQKSNAPLLYFWYAELELANDHHNGHNSLNRAVHILSCLGSGTAYSPFKCQP 900
ASVESLPVDQKSNAPLLYFWYAELEL NDH+NGHNS NRAVHILSCLGSGT YSPFKCQP
Sbjct: 841 ASVESLPVDQKSNAPLLYFWYAELELVNDHNNGHNSSNRAVHILSCLGSGTTYSPFKCQP 900
Query: 901 SSLELLRAHQGFKEKIREVRSTWLHGVIDDSSAALISSAALFEELTTGYNAGLEVLDQAF 960
SSL+LLRAHQGFKEKIREVRSTWLHGVIDDSS ALISSAALFEELTTGYNAGLEVLDQAF
Sbjct: 901 SSLQLLRAHQGFKEKIREVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAF 960
Query: 961 SMVLPERRKQSYQLEYLFNYYVKMLLRHHKQLSQLKVRQSISHGLQFYPLNPELYSAFLE 1020
SMVLPERRKQSYQLE+LFNYYVKML RHHKQLSQLKVR+SI+HGLQFYPLNPELYSAFLE
Sbjct: 961 SMVLPERRKQSYQLEHLFNYYVKMLQRHHKQLSQLKVRESITHGLQFYPLNPELYSAFLE 1020
Query: 1021 ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKALGNENLRH 1080
ISYIYSVPSKLRWTFDD+CQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKAL NENLRH
Sbjct: 1021 ISYIYSVPSKLRWTFDDFCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKALENENLRH 1080
Query: 1081 SVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNSVLSAKELSDLQ 1140
SVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNSVLSAKELSDLQ
Sbjct: 1081 SVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNSVLSAKELSDLQ 1140
Query: 1141 EVMRDKELNLRTDIYEILLQDELVS 1165
EVMRDKELNLRTDIYEILLQDELVS
Sbjct: 1141 EVMRDKELNLRTDIYEILLQDELVS 1164
BLAST of HG10020750 vs. NCBI nr
Match:
XP_008467185.1 (PREDICTED: protein NRDE2 homolog isoform X1 [Cucumis melo] >TYJ99059.1 protein NRDE2-like protein isoform X1 [Cucumis melo var. makuwa])
HSP 1 Score: 2139.8 bits (5543), Expect = 0.0e+00
Identity = 1090/1165 (93.56%), Postives = 1126/1165 (96.65%), Query Frame = 0
Query: 1 MEAPAEE-ESSPEEQNPKTSLFPLSFVANNPQSLSSPPNSSVPQWLCNSSFTTDLSVIND 60
MEAPAEE ES P EQNPK SLFPLSFV+NNPQ+ S+P NSSVPQWLCNSSFT+DLSVIND
Sbjct: 1 MEAPAEEKESPPAEQNPKPSLFPLSFVSNNPQTQSNPSNSSVPQWLCNSSFTSDLSVIND 60
Query: 61 ALSSQNNVHSSISAGGDQEEAVEDEGGPSDRREVQKPSRSYELLESSASDDDSEHGKRRK 120
ALSSQ+NV+ S SA +QEEAVEDEGGPSDRREVQKPSRSYELLESSAS+DDSEH KR+K
Sbjct: 61 ALSSQSNVYPSCSADSEQEEAVEDEGGPSDRREVQKPSRSYELLESSASEDDSEHEKRKK 120
Query: 121 RKKKRRRRRGNESEERGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLY 180
RKKK++RRR NESEERGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLY
Sbjct: 121 RKKKKKRRRRNESEERGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLY 180
Query: 181 RMDVARYRPLNRGEKPGLNFHGFSQWNKSSSALDRDADADVLDSKVKSGGRYWSAKNAAI 240
RMDVARYRPLNRGE+ G NFHGFSQWNKSSSALDRDADADVLD+KVKSGGRYWSAKNAAI
Sbjct: 181 RMDVARYRPLNRGERRGQNFHGFSQWNKSSSALDRDADADVLDNKVKSGGRYWSAKNAAI 240
Query: 241 ERHKNFKRVRIGFSRKTPDTLLDDFIPLSDVQTSNNIEESWEDEVLRKTREFNKLTREHP 300
ERHKNFKRVRIGFSR T DTLLDDFIPLSDVQTS+NIEESWEDEVLRKTREFNKLTREHP
Sbjct: 241 ERHKNFKRVRIGFSRNTSDTLLDDFIPLSDVQTSSNIEESWEDEVLRKTREFNKLTREHP 300
Query: 301 HDEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKTYQN 360
HDEKAWLAFAEFQDKVAAM+PQKGARLQTLEKKISILEKAAELNPENEELLLYLLKTYQN
Sbjct: 301 HDEKAWLAFAEFQDKVAAMEPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKTYQN 360
Query: 361 RDNIDVVISRWEKILMQNSGSYRLWREFLHLIQGEFSRFKVSDMRQLYAHAIQALSAACN 420
RDNIDVVISRWEKIL+QNSGSYRLWREFLHL+QGEFS+FKVSDMRQ+YAHAIQALSAACN
Sbjct: 361 RDNIDVVISRWEKILLQNSGSYRLWREFLHLMQGEFSKFKVSDMRQMYAHAIQALSAACN 420
Query: 421 QHIRQANQIAKPSVEHDLIQLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP 480
QHIRQANQIAKPSVEHDLIQLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP
Sbjct: 421 QHIRQANQIAKPSVEHDLIQLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP 480
Query: 481 ALHLNDRSKQRLFEHFWNTDAERVGEEGAVGWSTWLEKEEENRQKAMREEALEADEKGGW 540
ALHLNDRSKQRLFEHFWNT+AERVGEEGAVGWSTWLEKEEENRQKAMREEALEADEKGGW
Sbjct: 481 ALHLNDRSKQRLFEHFWNTNAERVGEEGAVGWSTWLEKEEENRQKAMREEALEADEKGGW 540
Query: 541 TGWSDPPPKEKKNSDGTETTAEMGVAAEETMEEYVEEEDIEREDSTEALLKILGINADAG 600
+GW DP PKE KNSDGT TTAEM VAAEET+E YV EEDIEREDSTEALLKILGIN DAG
Sbjct: 541 SGWFDPAPKENKNSDGTGTTAEMDVAAEETVEGYV-EEDIEREDSTEALLKILGINTDAG 600
Query: 601 VDEEVKDVSTWARWSKEESSRDCEQWMPIRGKTADVIHDEGMPDGETNEQFLRVILYEDV 660
VDEEVKD STWARWSKEESSRD EQWMP+R +TADVIHDEGMPDGETNEQ LRVILYEDV
Sbjct: 601 VDEEVKDASTWARWSKEESSRDSEQWMPVRERTADVIHDEGMPDGETNEQLLRVILYEDV 660
Query: 661 KEFLFSLISSEARLSLIYQLIEFFSGKIYSRSSSNSSSWMERILSLEVLPDDILRHLRSV 720
KE+LFSL+SSEARLSLIYQLIEFFSGKIYSR+SSNSSSWMERILSLEVLPDDIL HLRSV
Sbjct: 661 KEYLFSLVSSEARLSLIYQLIEFFSGKIYSRASSNSSSWMERILSLEVLPDDILHHLRSV 720
Query: 721 HDVLNKRQSSSSSSTLEVLVGGSENLSQMSDMMKFLRNAILLCLTAFPRNYILEEAALIA 780
HDVLNKRQ SSSSST+EVL+G S+NLSQMSDMMKFLRN ILLCLTAFPRNYILEEAALIA
Sbjct: 721 HDVLNKRQISSSSSTMEVLIGSSDNLSQMSDMMKFLRNTILLCLTAFPRNYILEEAALIA 780
Query: 781 EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL 840
EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL
Sbjct: 781 EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL 840
Query: 841 ASVESLPVDQKSNAPLLYFWYAELELANDHHNGHNSLNRAVHILSCLGSGTAYSPFKCQP 900
ASVESLPVDQKSNAPLLYFWYAELEL NDH+NGHNS NRAVHILSCLGSGT YSPFKCQP
Sbjct: 841 ASVESLPVDQKSNAPLLYFWYAELELVNDHNNGHNSSNRAVHILSCLGSGTTYSPFKCQP 900
Query: 901 SSLELLRAHQGFKEKIREVRSTWLHGVIDDSSAALISSAALFEELTTGYNAGLEVLDQAF 960
SSL+LLRAHQGFK+KIREVRSTWLHGVIDDSS ALISSAALFEELTTGYNAGLEVL QAF
Sbjct: 901 SSLQLLRAHQGFKDKIREVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLHQAF 960
Query: 961 SMVLPERRKQSYQLEYLFNYYVKMLLRHHKQLSQLKVRQSISHGLQFYPLNPELYSAFLE 1020
SMVLPERRKQSYQLE+LFNYYVKML RHHKQLSQLKVR+SI+HGLQFYPLNPELYSAFLE
Sbjct: 961 SMVLPERRKQSYQLEHLFNYYVKMLRRHHKQLSQLKVRESITHGLQFYPLNPELYSAFLE 1020
Query: 1021 ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKALGNENLRH 1080
ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKAL NENLRH
Sbjct: 1021 ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKALENENLRH 1080
Query: 1081 SVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNSVLSAKELSDLQ 1140
SVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNSVLSAKELSDLQ
Sbjct: 1081 SVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNSVLSAKELSDLQ 1140
Query: 1141 EVMRDKELNLRTDIYEILLQDELVS 1165
EVMRDKELNLRTDIYEILLQDELVS
Sbjct: 1141 EVMRDKELNLRTDIYEILLQDELVS 1164
BLAST of HG10020750 vs. ExPASy Swiss-Prot
Match:
Q80XC6 (Nuclear exosome regulator NRDE2 OS=Mus musculus OX=10090 GN=Nrde2 PE=1 SV=3)
HSP 1 Score: 250.8 bits (639), Expect = 7.8e-65
Identity = 296/1200 (24.67%), Postives = 490/1200 (40.83%), Query Frame = 0
Query: 55 SVINDALSSQNNVHSSISAGGDQEEAVEDEGGPSDRREVQKPSRSYELLESSASDDDSEH 114
S + LS ++N ++ +++ +++ + +K R +E L SS S+ D+E
Sbjct: 64 SPLKSELSGESNTSEKLAQTSRKKK--KEKKKRRKHQHHRKTKRRHEQLSSSGSESDTEA 123
Query: 115 GKRRKRKKKRRRRRGNESEERG--GFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDN 174
GK R + R ++ E +G + W + + + D D N
Sbjct: 124 GKDRASRSIRDDQKEAEKPCQGSNAAAAVAAAAGHRSIWLEDIHDLTDVFRTDKKPDPAN 183
Query: 175 LAFGSLYRMDVARYRPLN------RGEKPGLNFHGFSQWNKSSSA-LDRDADADVLDSKV 234
+ SLYR D+ARY+ +K +++ G S K S L+R +
Sbjct: 184 WEYKSLYRGDIARYKRKGDSCLGINPKKQCISWEGASAAKKHSHRHLERYFTKKNVGLMR 243
Query: 235 KSGGRYWSAKNAAIERHKNFKRVRIGFSRKTPDTLLDDFIPLSDVQTSN----------- 294
G S A F V+ TP T + + + D T+
Sbjct: 244 TEGIAVCSNPEPASSEPVTFIPVKDSAEAATPVTSWLNPLGIYDQSTTQWLQGQGPAEQE 303
Query: 295 ----NIEESWEDEVLR-KTREFNKLTREHPHDEKAWLAFAEFQDKV-----------AAM 354
+ ++ E+ L+ + EFN+ RE+P D + W+AF FQD+V
Sbjct: 304 SKQPDSQQDRENAALKARVEEFNRRVRENPWDTQLWMAFVAFQDEVMRSPGIYALGEGEQ 363
Query: 355 QPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKTYQNRDNIDVVISRWEKILMQNS 414
+ + + LEKK+++LE+A E NP + EL L L+ + W+K+L +
Sbjct: 364 EKHRKSLKLLLEKKLAVLERAIESNPGSVELKLAKLQLCSEFWEPSALAKEWQKLLFLHP 423
Query: 415 GSYRLWREFLHLIQGEFSRFKVSDMRQLYAHAIQALSAACNQHIRQANQIAKPSVEHDLI 474
+ LW+ +L Q +F F VS + LY + LSA ++ + ++ P L
Sbjct: 424 NNTSLWQRYLSFCQSQFGTFSVSKLHSLYGKCLSTLSA-----VKDGSMLSHPV----LP 483
Query: 475 QLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP--ALHLNDRSKQRLFEHFW 534
E + +F+ C F QAG+ E +LFQA ++F+ F P L + + FE FW
Sbjct: 484 GTEEAMFGLFLQQCHFLRQAGHSEKVISLFQAMVDFTFFKPDSVKELPTKVQVEFFEPFW 543
Query: 535 NTDAERVGEEGAVGWSTWLEKEEENRQKAMREEALEADEKGGWTGWSDPPPKEKKNSDGT 594
++ RVGE+GA GW W+ ++ E+GGW
Sbjct: 544 DSGEPRVGEKGARGWRAWMHQQ----------------ERGGWV---------------- 603
Query: 595 ETTAEMGVAAEETMEEYVEEEDIEREDSTEALLKILGINADAGVDEEVKDVSTWARWSKE 654
+ + +E EEED E +D T + W W
Sbjct: 604 -------LITPDEDDEEPEEEDQEIKDKT---------------------LPRWQIWLAV 663
Query: 655 ESSRDCEQWMPIRGKTADVIHDEGMPDGETNEQFLRVILYEDVKEFLFSLISSEARLSLI 714
E SRD W P R +E D E R +L++D+ + L L S + + LI
Sbjct: 664 ERSRDQRHWRPWRPDKTKKQTEEDCEDPE------RQVLFDDIGQSLIRLSSPDLQFQLI 723
Query: 715 YQLIEFF---SGKIYSRSSSNSSSWMERILSLEVLPDDILRHLRSVHDVLNKRQSSSSSS 774
++F SG + S + I E+ + L + + S
Sbjct: 724 QAFLQFLGVPSGFLPPASCLYLAMDESSIFESELYDEKPLTYFNPSFSGI------SCVG 783
Query: 775 TLEVLVGGSENLSQMSDMMKFLRNAILLCL--------TAFPRNYILEEAALIAEELFVT 834
++E L + +F+RN L L + +++ E A + L T
Sbjct: 784 SMEQLGRPRWTKGHNREGEEFVRNVFHLVLPLLAGKQKSQLSLSWLRYEIAKVIWCLH-T 843
Query: 835 KMNSCSSSVTPCRSLAKNLLK--SDRQDMLLCGVYARREATYGNIDHARKVFDMALASVE 894
K S C+ LAKNLLK +R + L YA E GN + ARKVFD AL+
Sbjct: 844 KKKRLKSQGKSCKKLAKNLLKEPENRNNFCLWKQYAHLEWLLGNTEDARKVFDTALSMAG 903
Query: 895 SLPVDQKSNAPLLYFWYAELELANDHHNGHNSLNRAVHILSCLGSGTAYSPFKCQPSSLE 954
S + + L YAELE+ + + RAVHIL+ L + Y P+ Q SS +
Sbjct: 904 SSELKDRELCELSLL-YAELEMELSPDSRGATTGRAVHILTRLTESSPYGPYTGQVSSTQ 963
Query: 955 LLRAHQGFKEKIRE-----VRSTWLHGVIDDSSAALISSAALFEELTTGYNAGLEVLDQA 1014
+L+A + ++ +++ S+ D +L+ LF+ LT G +A +++ +
Sbjct: 964 VLKARKAYELALQDCLGQSCASSPAPAEALDCLGSLVRCFMLFQYLTVGIDAAVQIYGRV 1023
Query: 1015 FSMVL---------PERRKQSYQLEYLFNYYVKM---LLRHHKQL---SQLKVRQSISHG 1074
F+ + PE S L + M LLR H + +R+++S
Sbjct: 1024 FAKLKGSARLEDPGPEDSTSSQSLTNVLEAVSMMHTSLLRFHMNVCVYPLAPLRETLSDA 1083
Query: 1075 LQFYPLNPELYSAFLEISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFE--------- 1134
L+ YP N L+ A+++I +K R FD + L W+FA+ E
Sbjct: 1084 LKLYPGNQVLWRAYVQIQNKSHSANKTRRFFDTVTRSAKHLEPWLFAIEAEKLRKKLVES 1143
Query: 1135 ------------MGYGGSLHRIRRLFEKALGNENLRHSVLLWRCYISYELNTACDPSSAR 1161
+ G HRIR LFE A+ ++ LLWR Y+++ L + + ++
Sbjct: 1144 VQRVGGREVHATIPETGLTHRIRALFENAIRSDKGNQCPLLWRMYLNF-LVSLGNKERSK 1172
BLAST of HG10020750 vs. ExPASy Swiss-Prot
Match:
Q9H7Z3 (Nuclear exosome regulator NRDE2 OS=Homo sapiens OX=9606 GN=NRDE2 PE=1 SV=3)
HSP 1 Score: 243.0 bits (619), Expect = 1.6e-62
Identity = 307/1241 (24.74%), Postives = 510/1241 (41.10%), Query Frame = 0
Query: 59 DALSSQNNVHSSISAGGDQEEAVE---DEGGPSDRREVQKPSR----SYELLESSASDDD 118
D LS+ + SI++ Q EA EG P R ++ S + + L+ ++
Sbjct: 24 DWLSNPSFCVGSITSLSQQTEAAPAHVSEGLPLTRSHLKSESSDESDTNKKLKQTSRKKK 83
Query: 119 SEHGKRRKRK--KKRRRRRGNESEERG-----------GFGEYGSRKSDV------RAWA 178
E K+RK + KK +R+ G S R G GS+K A A
Sbjct: 84 KEKKKKRKHQHHKKTKRKHGPSSSSRSETDTDSEKDKPSRGVGGSKKESEEPNQGNNAAA 143
Query: 179 DADGR----------PSKDYYFDSNGDRDNLAFGSLYRMDVARYRPLNRGEKPGLNFHGF 238
D R + + D D N + SLYR D+ARY+ + G + G
Sbjct: 144 DTGHRFVWLEDIQAVTGETFRTDKKPDPANWEYKSLYRGDIARYK------RKGDSCLGI 203
Query: 239 SQWNKSSSALDRDADADVLDSKVKSGGRYWSAKNAAIERHKNFKRVRIGFSRKTPDTLLD 298
N + + + K RY++ K+ + N V I + P +
Sbjct: 204 ---NPKKQCISWEGTSTEKKHSRKQVERYFTKKSVGL---MNIDGVAISSKTEPPSSEPI 263
Query: 299 DFIPLSDVQTSNNI-------------------------EESWEDE---------VLRKT 358
FIP+ D++ + + +ES + + + K
Sbjct: 264 SFIPVKDLEDAAPVTTWLNPLGIYDQSTTHWLQGQGPPEQESKQPDAQPDSESAALKAKV 323
Query: 359 REFNKLTREHPHDEKAWLAFAEFQDKV-----------AAMQPQKGARLQTLEKKISILE 418
EFN+ RE+P D + W+AF FQD+V + +K + LEKK++ILE
Sbjct: 324 EEFNRRVRENPRDTQLWMAFVAFQDEVMKSPGLYAIEEGEQEKRKRSLKLILEKKLAILE 383
Query: 419 KAAELNPENEELLLYLLKTYQNRDNIDVVISRWEKILMQNSGSYRLWREFLHLIQGEFSR 478
+A E N + +L L LK ++ W+K++ + + LW+++L Q +FS
Sbjct: 384 RAIESNQSSVDLKLAKLKLCTEFWEPSTLVKEWQKLIFLHPNNTALWQKYLLFCQSQFST 443
Query: 479 FKVSDMRQLYAHAIQALSAACNQHIRQANQIAKPSVEHDLIQLELGLVDIFMSLCRFEWQ 538
F +S + LY + LSA ++ + ++ P+ L E + +F+ C F Q
Sbjct: 444 FSISKIHSLYGKCLSTLSA-----VKDGSILSHPA----LPGTEEAMFALFLQQCHFLRQ 503
Query: 539 AGYQELATALFQAEIEFSLFCP--ALHLNDRSKQRLFEHFWNTDAERVGEEGAVGWSTWL 598
AG+ E A +LFQA ++F+ F P L + + FE FW++ R GE+GA GW W+
Sbjct: 504 AGHSEKAISLFQAMVDFTFFKPDSVKDLPTKGQVEFFEPFWDSGEPRAGEKGARGWKAWM 563
Query: 599 EKEEENRQKAMREEALEADEKGGWTGWSDPPPKEKKNSDGTETTAEMGVAAEETMEEYVE 658
++ E+GGW N D E
Sbjct: 564 HQQ----------------ERGGWV---------VINPD--------------------E 623
Query: 659 EEDIEREDSTEALLKILGINADAGVDEEVKD--VSTWARWSKEESSRDCEQWMPIRGKTA 718
++D ED D+E+KD + W W E SRD W P R
Sbjct: 624 DDDEPEED-----------------DQEIKDKTLPRWQIWLAAERSRDQRHWRPWRPDKT 683
Query: 719 DVIHDEGMPDGETNEQFLRVILYEDVKEFLFSLISSEARLSLIYQLIEFFSGKIYSRSSS 778
+E D E R +L++D+ + L L S + + L+ ++F +
Sbjct: 684 KKQTEEDCEDPE------RQVLFDDIGQSLIRLSSHDLQFQLVEAFLQFLG---VPSGFT 743
Query: 779 NSSSWMERILSLEVLPDDILRHLRSVHDVLNKRQSSSSSSTLEVLVGGSENLSQMSDMMK 838
+S + + + D+ L + + +S ++ L Q + +
Sbjct: 744 PPASCLYLAMDENSIFDNGLYDEKPLTFFNPLFSGASCVGRMDRLGYPRWTRGQNREGEE 803
Query: 839 FLRNAILLCLTAFPR--------NYILEEAALIAEELFVTKMNSCSSSVTPCRSLAKNLL 898
F+RN L + F +++ E A + L S C+ LAKNLL
Sbjct: 804 FIRNVFHLVMPLFSGKEKSQLCFSWLQYEIAKVIWCLHTKNKKRLKSQGKNCKKLAKNLL 863
Query: 899 KSDR--QDMLLCGVYARREATYGNIDHARKVFDMALASVESLPVDQKSNAPLLYFWYAEL 958
K + L YA E GN + ARKVFD AL S + + S+ L YAEL
Sbjct: 864 KEPENCNNFCLWKQYAHLEWLLGNTEDARKVFDTALGMAGSREL-KDSDLCELSLLYAEL 923
Query: 959 ELANDHHNGHNSLNRAVHILSCLGSGTAYSPFKCQPSSLELLRAHQGFKEKIREV--RST 1018
E+ + RAVHIL+ L + Y P+ Q ++ +L+A + ++ +++ S
Sbjct: 924 EVELSPEVRRAATARAVHILTKLTESSPYGPYTGQVLAVHILKARKAYEHALQDCLGDSC 983
Query: 1019 WLHGVIDDSSAALISSA---ALFEELTTGYNAGLEVLDQAF----SMVLPE-------RR 1078
+ DS + LIS A LF+ LT G +A +++ +Q F S V PE
Sbjct: 984 VSNPAPTDSCSRLISLAKCFMLFQYLTIGIDAAVQIYEQVFAKLNSSVFPEGSGEGDSAS 1043
Query: 1079 KQSYQ--LEYLFNYYVKMLLRHHKQLS---QLKVRQSISHGLQFYPLNPELYSAFLEISY 1138
QS+ LE + + LLR H ++S +R+++S L+ YP N L+ ++++I
Sbjct: 1044 SQSWTSVLEAITLMHTS-LLRFHMKVSVYPLAPLREALSQALKLYPGNQVLWRSYVQIQN 1103
Query: 1139 IYSVPSKLRWTFDDYCQKQPSLILWIFALSFE---------------------MGYGGSL 1161
SK R FD + L W+FA+ E + G +
Sbjct: 1104 KSHSASKTRRFFDTITRSAKPLEPWLFAIEAEKLRKRLVETVQRLDGREIHATIPETGLM 1163
BLAST of HG10020750 vs. ExPASy Swiss-Prot
Match:
Q54QP0 (Nuclear exosome regulator NRDE2 OS=Dictyostelium discoideum OX=44689 GN=nrde2 PE=3 SV=1)
HSP 1 Score: 143.7 bits (361), Expect = 1.3e-32
Identity = 255/1273 (20.03%), Postives = 504/1273 (39.59%), Query Frame = 0
Query: 27 ANNPQSLSSPPNSSVPQWLCNSSFTTD--------LSVINDALSSQNNVHSSISAGGDQE 86
++N S PP+SS P + D S I+ + ++ + S+ DQ+
Sbjct: 77 SDNTFSSPPPPSSSPPLYSKTEKKNNDKVKIIMKKRSFIDSSSDDNSDDDDNDSSSSDQD 136
Query: 87 EAVEDEGGPSDRREVQKPSRSYELLESSASDDDSEHGKRRKRKKKRRRRRGNESEERGGF 146
+ +D GG + R+ K + + + ++++E R+K KKK+R+ + +++++
Sbjct: 137 SSDDDSGGFTYNRKKYKKEQQQ---QENEENEENERKNRKKEKKKKRKDKKFKNDDKSMM 196
Query: 147 ---GEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLYRMDVARYRPLNRGEKP 206
E SD ++ + D F S N + + + + ++ Y+ + +K
Sbjct: 197 IISNENSENYSDNSSYFI---EKTGDKVFSSRTSTPNYNYDNSFILGMSDYK-IGFSKKE 256
Query: 207 GLNFHGFSQWNKSSSALDRDADADVLDSKVKSGGRYWSAKNAAIERH--KNFKRVRIGFS 266
G S + + ++ S S +R + ++V+ +
Sbjct: 257 GYQIEPISLTSFNKQQINNRYFTKPSSSSSSSSQSQQQLITVITKRKEIEEIEKVKPISN 316
Query: 267 RKTPDTLLDDFIPL----------SDVQTSNNIEESWEDEVLRKTREFNKLTREHPHDEK 326
K P DD I L D ++ E+ E + L+K E NKL ++P++ +
Sbjct: 317 IKDPSKSNDDEIKLIVLNENNHDNDDDDNDDDDNETLERKTLKKNSELNKLVEQYPNNIE 376
Query: 327 AWLAFAEFQDKVAAMQPQ-KGARLQTLEKKISILEKAAELNPENEELLLYLLKTYQNRDN 386
W+ +FQ+ ++ EK++SI + NP++E L + LK +
Sbjct: 377 YWIDLVKFQENFQQFSRNVNKSKTSMYEKQLSIYRNSLLHNPDSEILTIEYLKLASKLWD 436
Query: 387 IDVVISRWEKILMQNSG-------SYRLWREFLHLIQGEFSRFKVSDMRQLYAHAIQALS 446
V+ W K+L +S S +LW+E++ F+ FK+ +++ I+ +
Sbjct: 437 QQKVLDLWNKVLSSSSSSSSSSIISEKLWKEYIEFCLSNFNDFKIEKIKETIITIIRKML 496
Query: 447 AACNQHIRQANQIAKPSVEHDLIQLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFS 506
R++ ++ + ++ LE ++ L + QAG+ E ++Q+ IEF+
Sbjct: 497 VK-----RRSFKVKDYNFMENISNLEESILQFISQLSKLLNQAGFSERVIGIYQSLIEFN 556
Query: 507 LFCPALHLNDRSKQRL--FEHFWNT-DAERVGEEGAVGWSTWL------------EKEEE 566
F P N+ L F+ +W++ D ++G ++GWS K
Sbjct: 557 CFEPIQLSNETQATLLKEFKSYWSSLDYPKIGNPNSIGWSKSFTILLNNSINNNNNKNNN 616
Query: 567 NRQKAMREEALEADEKGGWTGWSDPPPKEKKNSDGTETTAEMGVAAEETMEEYVEEEDIE 626
N + ++ N++ + + E +E+ ++E++ +
Sbjct: 617 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNMDLDNLDNLSIEEIEKLLKEQEDQ 676
Query: 627 REDSTEALLKILGINADAGVD----------EEVKDVST-----------WARWSKEESS 686
E + I + D D EE +D + + W K+E
Sbjct: 677 ENQDNENIFNITHKSKDLNEDDDNENNNNNQEEQEDNDSNSNDNDNNNNKFNTWGKKEIE 736
Query: 687 RDCEQWMPIRGKTADVIHDEGMPDGETNEQFL-RVILYEDVKEFLFSLISSEARLSLIYQ 746
D +W P+ I++ + E NE RV+L+ D E LF + E +L L++Q
Sbjct: 737 LDELKWKPLD------INNNLEVNKEVNENDTERVVLFNDFYELLFRFVKEENKLELVFQ 796
Query: 747 LIEFFSGKI----------YS-----RSSSNSSSWMERILSL-------EVLPDDILRHL 806
+EF I YS R S +S E I+SL + P
Sbjct: 797 FLEFLGVPISLLDDKIQPRYSFYHPQRRDSINSIHNENIISLLFKDLKQQPSPPSPSPEY 856
Query: 807 RSVHDVLNKRQSSSSSSTLEVLVGGSENLSQMS-DMMKFLRNAILLCLTAFPRNYILEEA 866
+ +K +++++ S+NL +S D +KF+ + L L
Sbjct: 857 PNWFKTFDKFSNNNNN---------SQNLLGLSDDKIKFIDSIYKLILEN-------SNG 916
Query: 867 ALIAEELFVTK-MNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKV 926
+ E+L+V+ M S + + K+L + + +++ ++A E G AR +
Sbjct: 917 IKLKEKLYVSYIMFKASIDINDAKVYTKSLCEKFK-NLIYFDIFASLELKSGKTQQARTI 976
Query: 927 FDMA------LASVESLPVDQKSNAPLLYFWYAELEL-----------------ANDHHN 986
+ L + ++ Q+ L+Y Y +EL +H
Sbjct: 977 YQTTCFYINQLINQQAQQQQQQLQIDLVYREYLFMELNLIYQTIEKDPQILKRFIKSNHK 1036
Query: 987 GHNSLNRAVHILSCLGSGT----AYSPFKCQPSSLELLRAHQGFKEKIREVRSTWLHGVI 1046
+HIL C G + S F + L + + F +K+++ +
Sbjct: 1037 PIELFFTPLHILQCYLDGNYKQYSSSTFNLNTINQFLNQLNLKFLQKLQQQQQQQQQNSS 1096
Query: 1047 DDSSAALISSAA---------------LFEELTTGYNAGLEVLDQAFSMVLPERRK-QSY 1106
SS++ SS++ +FE L+ G++ L + + S + K S
Sbjct: 1097 SSSSSSSSSSSSSSSSSSSVDFLLCYCIFELLSNGFDGFLILFKRITSSSTNDYLKIFSI 1156
Query: 1107 QLEYLFNYYVKMLLRHHKQL--SQLKVRQSISHGLQFYPLNPELYSAFLEISYIYSVPSK 1151
Q E L + M+ + + +++ I L Y +P+L S FL + ++
Sbjct: 1157 QHELLTIRCIDMVTKIAPLIGTDPKRIKNLIIDSLNQYYDHPKLLSLFLNWESKNQLINR 1216
BLAST of HG10020750 vs. ExPASy Swiss-Prot
Match:
O42975 (Protein NRDE2 homolog OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=SPBC20F10.05 PE=1 SV=1)
HSP 1 Score: 75.1 bits (183), Expect = 5.9e-12
Identity = 80/363 (22.04%), Postives = 164/363 (45.18%), Query Frame = 0
Query: 191 RGEKP----GLNFHGFSQWNKSSSALDRDADADVLDSKVKSGGRYWSAKNAAIERHKNFK 250
+GEK G+N ++++SSS++ A + + K G K+ I+ +
Sbjct: 63 KGEKQNLLYGINKRPVPKYHRSSSSVYGSAPLLRIVKESKEGITLNKKKSLEIK----YD 122
Query: 251 RVRIGFSRKTPDTLLDD----FIPLSDVQTSNNIEES-WEDEVLRKTREFNKLTREHPHD 310
R ++ ++ +D FIPL + S+ E+S + +L+ +E ++ +++P
Sbjct: 123 EERSFDEKENDESEFEDGQQGFIPLLVNRNSDPSEKSTFSLNILKAIKETDEEIKKNPGK 182
Query: 311 EKAWLAFAEFQDKVAAMQPQKG----------ARLQTLEKKISILEKAAE--LNPENEEL 370
+ W+ E+Q+++ + ++ + K+SILEKA + ++E L
Sbjct: 183 ARLWIKMCEYQERLLFDEFRRSNSDDIKGKLKIENNSRSVKLSILEKALKEVKGCDHEIL 242
Query: 371 LLYLLKTYQNRDNIDVVISRWEKILMQNSGSYRLWREFLHLIQGEFSRFKVSDMRQLYAH 430
+ Y L+ + + ++E++L+++ G LW ++ G S F +D +++
Sbjct: 243 VSYYLQLGSEEWSKEETNQKFEEVLIEHPGYLNLWMKYAEYFTG-ISEFTFNDCLNMFSK 302
Query: 431 AIQALSAACNQHIRQANQIAKPSVEHDLIQLELGLVDIFMSLCRFEWQAGYQELATALFQ 490
+ L + R++ + + + ++E ++ + + LC F GY ELA ++FQ
Sbjct: 303 CFKFLKQKLSD--RKSCKERESTDVTSNFEVEEAILHLLIRLCDFLKNCGYYELAWSIFQ 362
Query: 491 AEIEFSLFCPALHLNDRSKQRLFE---HFWNTDAERVGEEGAVGWSTWLEKEEENRQKAM 530
A +E F P +L + FE FWN+D + EE A GW L+ E + +
Sbjct: 363 ANMELCYFYPR-YLEKKLDSTFFESFSKFWNSDTPKFSEENARGWCNVLDDESSQQNQNF 417
BLAST of HG10020750 vs. ExPASy TrEMBL
Match:
A0A0A0LVY4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G043050 PE=3 SV=1)
HSP 1 Score: 2140.2 bits (5544), Expect = 0.0e+00
Identity = 1089/1165 (93.48%), Postives = 1123/1165 (96.39%), Query Frame = 0
Query: 1 MEAPAEE-ESSPEEQNPKTSLFPLSFVANNPQSLSSPPNSSVPQWLCNSSFTTDLSVIND 60
MEAP EE ES PEEQNPK SLFPLSFVANNPQ+ S+P SSVPQWLCNSSFTTDL+VIND
Sbjct: 1 MEAPPEEKESPPEEQNPKPSLFPLSFVANNPQTQSNPSTSSVPQWLCNSSFTTDLTVIND 60
Query: 61 ALSSQNNVHSSISAGGDQEEAVEDEGGPSDRREVQKPSRSYELLESSASDDDSEHGKRRK 120
ALSSQNNVH S SA +QEEAVEDEGGPS RREVQKPSRSYELLESSAS+DDSEH KR+K
Sbjct: 61 ALSSQNNVHPSCSADSEQEEAVEDEGGPSGRREVQKPSRSYELLESSASEDDSEHEKRKK 120
Query: 121 RKKKRRRRRGNESEERGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLY 180
RKKK+RRRR NESEERGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLY
Sbjct: 121 RKKKKRRRRRNESEERGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLY 180
Query: 181 RMDVARYRPLNRGEKPGLNFHGFSQWNKSSSALDRDADADVLDSKVKSGGRYWSAKNAAI 240
RMDVARYRPLNRGE+ G NFHGFSQWNKSSSALDRDADADVLD+KVKSGGRYWSAKNAAI
Sbjct: 181 RMDVARYRPLNRGERHGQNFHGFSQWNKSSSALDRDADADVLDNKVKSGGRYWSAKNAAI 240
Query: 241 ERHKNFKRVRIGFSRKTPDTLLDDFIPLSDVQTSNNIEESWEDEVLRKTREFNKLTREHP 300
ERHKNFKRVRIGFS T DTLLDDFIPLSDVQTSNNIEESWEDEVLRKTREFNKLTREHP
Sbjct: 241 ERHKNFKRVRIGFSSNTSDTLLDDFIPLSDVQTSNNIEESWEDEVLRKTREFNKLTREHP 300
Query: 301 HDEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKTYQN 360
HDEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKTYQN
Sbjct: 301 HDEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKTYQN 360
Query: 361 RDNIDVVISRWEKILMQNSGSYRLWREFLHLIQGEFSRFKVSDMRQLYAHAIQALSAACN 420
RDNIDVVI+RWEKIL+QNSGSYRLWREFLHL+QGEFSRFKVSDMRQ+YAHAIQALSAACN
Sbjct: 361 RDNIDVVINRWEKILLQNSGSYRLWREFLHLMQGEFSRFKVSDMRQMYAHAIQALSAACN 420
Query: 421 QHIRQANQIAKPSVEHDLIQLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP 480
QHIRQANQI KPSVEHDLIQLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP
Sbjct: 421 QHIRQANQIGKPSVEHDLIQLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP 480
Query: 481 ALHLNDRSKQRLFEHFWNTDAERVGEEGAVGWSTWLEKEEENRQKAMREEALEADEKGGW 540
ALHLNDR+KQRLFEHFWNTDAERVGEEGAVGWSTWLEKEEENRQKAMREE LEADEKGGW
Sbjct: 481 ALHLNDRNKQRLFEHFWNTDAERVGEEGAVGWSTWLEKEEENRQKAMREEVLEADEKGGW 540
Query: 541 TGWSDPPPKEKKNSDGTETTAEMGVAAEETMEEYVEEEDIEREDSTEALLKILGINADAG 600
TGW +P PKE KNSDGT TTAEM VAAEETMEEYV EEDIEREDSTEALLKILGIN DAG
Sbjct: 541 TGWFNPAPKENKNSDGTGTTAEMDVAAEETMEEYV-EEDIEREDSTEALLKILGINTDAG 600
Query: 601 VDEEVKDVSTWARWSKEESSRDCEQWMPIRGKTADVIHDEGMPDGETNEQFLRVILYEDV 660
VDEEVKD STWARWSKEESSRD EQWMP+R +T DVIHDEGMPDGETNEQ LRVILYEDV
Sbjct: 601 VDEEVKDASTWARWSKEESSRDSEQWMPVRERTVDVIHDEGMPDGETNEQLLRVILYEDV 660
Query: 661 KEFLFSLISSEARLSLIYQLIEFFSGKIYSRSSSNSSSWMERILSLEVLPDDILRHLRSV 720
KE+LFSL+SSEARLSLIYQLIEFFSGKIYSR+SSN+SSWMERILSLEVLPDDI+ HLRSV
Sbjct: 661 KEYLFSLVSSEARLSLIYQLIEFFSGKIYSRASSNNSSWMERILSLEVLPDDIVHHLRSV 720
Query: 721 HDVLNKRQSSSSSSTLEVLVGGSENLSQMSDMMKFLRNAILLCLTAFPRNYILEEAALIA 780
HDVLNKRQSSSSSS++EVL+G S+NLSQMS+MMKFLRN ILLCLTAFPRNYILEEAALIA
Sbjct: 721 HDVLNKRQSSSSSSSMEVLIGSSDNLSQMSEMMKFLRNTILLCLTAFPRNYILEEAALIA 780
Query: 781 EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL 840
EELFVTKMNSCSSSVTPCRSLAK+LLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL
Sbjct: 781 EELFVTKMNSCSSSVTPCRSLAKSLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL 840
Query: 841 ASVESLPVDQKSNAPLLYFWYAELELANDHHNGHNSLNRAVHILSCLGSGTAYSPFKCQP 900
ASVESLPVDQKSNAPLLYFWYAELEL NDH+NGHNS NRAVHILSCLGSGT YSPFKCQP
Sbjct: 841 ASVESLPVDQKSNAPLLYFWYAELELVNDHNNGHNSSNRAVHILSCLGSGTTYSPFKCQP 900
Query: 901 SSLELLRAHQGFKEKIREVRSTWLHGVIDDSSAALISSAALFEELTTGYNAGLEVLDQAF 960
SSL+LLRAHQGFKEKIREVRSTWLHGVIDDSS ALISSAALFEELTTGYNAGLEVLDQAF
Sbjct: 901 SSLQLLRAHQGFKEKIREVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQAF 960
Query: 961 SMVLPERRKQSYQLEYLFNYYVKMLLRHHKQLSQLKVRQSISHGLQFYPLNPELYSAFLE 1020
SMVLPERRKQSYQLE+LFNYYVKML RHHKQLSQLKVR+SI+HGLQFYPLNPELYSAFLE
Sbjct: 961 SMVLPERRKQSYQLEHLFNYYVKMLQRHHKQLSQLKVRESITHGLQFYPLNPELYSAFLE 1020
Query: 1021 ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKALGNENLRH 1080
ISYIYSVPSKLRWTFDD+CQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKAL NENLRH
Sbjct: 1021 ISYIYSVPSKLRWTFDDFCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKALENENLRH 1080
Query: 1081 SVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNSVLSAKELSDLQ 1140
SVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNSVLSAKELSDLQ
Sbjct: 1081 SVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNSVLSAKELSDLQ 1140
Query: 1141 EVMRDKELNLRTDIYEILLQDELVS 1165
EVMRDKELNLRTDIYEILLQDELVS
Sbjct: 1141 EVMRDKELNLRTDIYEILLQDELVS 1164
BLAST of HG10020750 vs. ExPASy TrEMBL
Match:
A0A5D3BJ75 (Protein NRDE2-like protein isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G002630 PE=3 SV=1)
HSP 1 Score: 2139.8 bits (5543), Expect = 0.0e+00
Identity = 1090/1165 (93.56%), Postives = 1126/1165 (96.65%), Query Frame = 0
Query: 1 MEAPAEE-ESSPEEQNPKTSLFPLSFVANNPQSLSSPPNSSVPQWLCNSSFTTDLSVIND 60
MEAPAEE ES P EQNPK SLFPLSFV+NNPQ+ S+P NSSVPQWLCNSSFT+DLSVIND
Sbjct: 1 MEAPAEEKESPPAEQNPKPSLFPLSFVSNNPQTQSNPSNSSVPQWLCNSSFTSDLSVIND 60
Query: 61 ALSSQNNVHSSISAGGDQEEAVEDEGGPSDRREVQKPSRSYELLESSASDDDSEHGKRRK 120
ALSSQ+NV+ S SA +QEEAVEDEGGPSDRREVQKPSRSYELLESSAS+DDSEH KR+K
Sbjct: 61 ALSSQSNVYPSCSADSEQEEAVEDEGGPSDRREVQKPSRSYELLESSASEDDSEHEKRKK 120
Query: 121 RKKKRRRRRGNESEERGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLY 180
RKKK++RRR NESEERGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLY
Sbjct: 121 RKKKKKRRRRNESEERGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLY 180
Query: 181 RMDVARYRPLNRGEKPGLNFHGFSQWNKSSSALDRDADADVLDSKVKSGGRYWSAKNAAI 240
RMDVARYRPLNRGE+ G NFHGFSQWNKSSSALDRDADADVLD+KVKSGGRYWSAKNAAI
Sbjct: 181 RMDVARYRPLNRGERRGQNFHGFSQWNKSSSALDRDADADVLDNKVKSGGRYWSAKNAAI 240
Query: 241 ERHKNFKRVRIGFSRKTPDTLLDDFIPLSDVQTSNNIEESWEDEVLRKTREFNKLTREHP 300
ERHKNFKRVRIGFSR T DTLLDDFIPLSDVQTS+NIEESWEDEVLRKTREFNKLTREHP
Sbjct: 241 ERHKNFKRVRIGFSRNTSDTLLDDFIPLSDVQTSSNIEESWEDEVLRKTREFNKLTREHP 300
Query: 301 HDEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKTYQN 360
HDEKAWLAFAEFQDKVAAM+PQKGARLQTLEKKISILEKAAELNPENEELLLYLLKTYQN
Sbjct: 301 HDEKAWLAFAEFQDKVAAMEPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKTYQN 360
Query: 361 RDNIDVVISRWEKILMQNSGSYRLWREFLHLIQGEFSRFKVSDMRQLYAHAIQALSAACN 420
RDNIDVVISRWEKIL+QNSGSYRLWREFLHL+QGEFS+FKVSDMRQ+YAHAIQALSAACN
Sbjct: 361 RDNIDVVISRWEKILLQNSGSYRLWREFLHLMQGEFSKFKVSDMRQMYAHAIQALSAACN 420
Query: 421 QHIRQANQIAKPSVEHDLIQLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP 480
QHIRQANQIAKPSVEHDLIQLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP
Sbjct: 421 QHIRQANQIAKPSVEHDLIQLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP 480
Query: 481 ALHLNDRSKQRLFEHFWNTDAERVGEEGAVGWSTWLEKEEENRQKAMREEALEADEKGGW 540
ALHLNDRSKQRLFEHFWNT+AERVGEEGAVGWSTWLEKEEENRQKAMREEALEADEKGGW
Sbjct: 481 ALHLNDRSKQRLFEHFWNTNAERVGEEGAVGWSTWLEKEEENRQKAMREEALEADEKGGW 540
Query: 541 TGWSDPPPKEKKNSDGTETTAEMGVAAEETMEEYVEEEDIEREDSTEALLKILGINADAG 600
+GW DP PKE KNSDGT TTAEM VAAEET+E YV EEDIEREDSTEALLKILGIN DAG
Sbjct: 541 SGWFDPAPKENKNSDGTGTTAEMDVAAEETVEGYV-EEDIEREDSTEALLKILGINTDAG 600
Query: 601 VDEEVKDVSTWARWSKEESSRDCEQWMPIRGKTADVIHDEGMPDGETNEQFLRVILYEDV 660
VDEEVKD STWARWSKEESSRD EQWMP+R +TADVIHDEGMPDGETNEQ LRVILYEDV
Sbjct: 601 VDEEVKDASTWARWSKEESSRDSEQWMPVRERTADVIHDEGMPDGETNEQLLRVILYEDV 660
Query: 661 KEFLFSLISSEARLSLIYQLIEFFSGKIYSRSSSNSSSWMERILSLEVLPDDILRHLRSV 720
KE+LFSL+SSEARLSLIYQLIEFFSGKIYSR+SSNSSSWMERILSLEVLPDDIL HLRSV
Sbjct: 661 KEYLFSLVSSEARLSLIYQLIEFFSGKIYSRASSNSSSWMERILSLEVLPDDILHHLRSV 720
Query: 721 HDVLNKRQSSSSSSTLEVLVGGSENLSQMSDMMKFLRNAILLCLTAFPRNYILEEAALIA 780
HDVLNKRQ SSSSST+EVL+G S+NLSQMSDMMKFLRN ILLCLTAFPRNYILEEAALIA
Sbjct: 721 HDVLNKRQISSSSSTMEVLIGSSDNLSQMSDMMKFLRNTILLCLTAFPRNYILEEAALIA 780
Query: 781 EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL 840
EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL
Sbjct: 781 EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL 840
Query: 841 ASVESLPVDQKSNAPLLYFWYAELELANDHHNGHNSLNRAVHILSCLGSGTAYSPFKCQP 900
ASVESLPVDQKSNAPLLYFWYAELEL NDH+NGHNS NRAVHILSCLGSGT YSPFKCQP
Sbjct: 841 ASVESLPVDQKSNAPLLYFWYAELELVNDHNNGHNSSNRAVHILSCLGSGTTYSPFKCQP 900
Query: 901 SSLELLRAHQGFKEKIREVRSTWLHGVIDDSSAALISSAALFEELTTGYNAGLEVLDQAF 960
SSL+LLRAHQGFK+KIREVRSTWLHGVIDDSS ALISSAALFEELTTGYNAGLEVL QAF
Sbjct: 901 SSLQLLRAHQGFKDKIREVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLHQAF 960
Query: 961 SMVLPERRKQSYQLEYLFNYYVKMLLRHHKQLSQLKVRQSISHGLQFYPLNPELYSAFLE 1020
SMVLPERRKQSYQLE+LFNYYVKML RHHKQLSQLKVR+SI+HGLQFYPLNPELYSAFLE
Sbjct: 961 SMVLPERRKQSYQLEHLFNYYVKMLRRHHKQLSQLKVRESITHGLQFYPLNPELYSAFLE 1020
Query: 1021 ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKALGNENLRH 1080
ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKAL NENLRH
Sbjct: 1021 ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKALENENLRH 1080
Query: 1081 SVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNSVLSAKELSDLQ 1140
SVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNSVLSAKELSDLQ
Sbjct: 1081 SVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNSVLSAKELSDLQ 1140
Query: 1141 EVMRDKELNLRTDIYEILLQDELVS 1165
EVMRDKELNLRTDIYEILLQDELVS
Sbjct: 1141 EVMRDKELNLRTDIYEILLQDELVS 1164
BLAST of HG10020750 vs. ExPASy TrEMBL
Match:
A0A1S3CSX9 (protein NRDE2 homolog isoform X1 OS=Cucumis melo OX=3656 GN=LOC103504593 PE=3 SV=1)
HSP 1 Score: 2139.8 bits (5543), Expect = 0.0e+00
Identity = 1090/1165 (93.56%), Postives = 1126/1165 (96.65%), Query Frame = 0
Query: 1 MEAPAEE-ESSPEEQNPKTSLFPLSFVANNPQSLSSPPNSSVPQWLCNSSFTTDLSVIND 60
MEAPAEE ES P EQNPK SLFPLSFV+NNPQ+ S+P NSSVPQWLCNSSFT+DLSVIND
Sbjct: 1 MEAPAEEKESPPAEQNPKPSLFPLSFVSNNPQTQSNPSNSSVPQWLCNSSFTSDLSVIND 60
Query: 61 ALSSQNNVHSSISAGGDQEEAVEDEGGPSDRREVQKPSRSYELLESSASDDDSEHGKRRK 120
ALSSQ+NV+ S SA +QEEAVEDEGGPSDRREVQKPSRSYELLESSAS+DDSEH KR+K
Sbjct: 61 ALSSQSNVYPSCSADSEQEEAVEDEGGPSDRREVQKPSRSYELLESSASEDDSEHEKRKK 120
Query: 121 RKKKRRRRRGNESEERGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLY 180
RKKK++RRR NESEERGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLY
Sbjct: 121 RKKKKKRRRRNESEERGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLY 180
Query: 181 RMDVARYRPLNRGEKPGLNFHGFSQWNKSSSALDRDADADVLDSKVKSGGRYWSAKNAAI 240
RMDVARYRPLNRGE+ G NFHGFSQWNKSSSALDRDADADVLD+KVKSGGRYWSAKNAAI
Sbjct: 181 RMDVARYRPLNRGERRGQNFHGFSQWNKSSSALDRDADADVLDNKVKSGGRYWSAKNAAI 240
Query: 241 ERHKNFKRVRIGFSRKTPDTLLDDFIPLSDVQTSNNIEESWEDEVLRKTREFNKLTREHP 300
ERHKNFKRVRIGFSR T DTLLDDFIPLSDVQTS+NIEESWEDEVLRKTREFNKLTREHP
Sbjct: 241 ERHKNFKRVRIGFSRNTSDTLLDDFIPLSDVQTSSNIEESWEDEVLRKTREFNKLTREHP 300
Query: 301 HDEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKTYQN 360
HDEKAWLAFAEFQDKVAAM+PQKGARLQTLEKKISILEKAAELNPENEELLLYLLKTYQN
Sbjct: 301 HDEKAWLAFAEFQDKVAAMEPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKTYQN 360
Query: 361 RDNIDVVISRWEKILMQNSGSYRLWREFLHLIQGEFSRFKVSDMRQLYAHAIQALSAACN 420
RDNIDVVISRWEKIL+QNSGSYRLWREFLHL+QGEFS+FKVSDMRQ+YAHAIQALSAACN
Sbjct: 361 RDNIDVVISRWEKILLQNSGSYRLWREFLHLMQGEFSKFKVSDMRQMYAHAIQALSAACN 420
Query: 421 QHIRQANQIAKPSVEHDLIQLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP 480
QHIRQANQIAKPSVEHDLIQLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP
Sbjct: 421 QHIRQANQIAKPSVEHDLIQLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP 480
Query: 481 ALHLNDRSKQRLFEHFWNTDAERVGEEGAVGWSTWLEKEEENRQKAMREEALEADEKGGW 540
ALHLNDRSKQRLFEHFWNT+AERVGEEGAVGWSTWLEKEEENRQKAMREEALEADEKGGW
Sbjct: 481 ALHLNDRSKQRLFEHFWNTNAERVGEEGAVGWSTWLEKEEENRQKAMREEALEADEKGGW 540
Query: 541 TGWSDPPPKEKKNSDGTETTAEMGVAAEETMEEYVEEEDIEREDSTEALLKILGINADAG 600
+GW DP PKE KNSDGT TTAEM VAAEET+E YV EEDIEREDSTEALLKILGIN DAG
Sbjct: 541 SGWFDPAPKENKNSDGTGTTAEMDVAAEETVEGYV-EEDIEREDSTEALLKILGINTDAG 600
Query: 601 VDEEVKDVSTWARWSKEESSRDCEQWMPIRGKTADVIHDEGMPDGETNEQFLRVILYEDV 660
VDEEVKD STWARWSKEESSRD EQWMP+R +TADVIHDEGMPDGETNEQ LRVILYEDV
Sbjct: 601 VDEEVKDASTWARWSKEESSRDSEQWMPVRERTADVIHDEGMPDGETNEQLLRVILYEDV 660
Query: 661 KEFLFSLISSEARLSLIYQLIEFFSGKIYSRSSSNSSSWMERILSLEVLPDDILRHLRSV 720
KE+LFSL+SSEARLSLIYQLIEFFSGKIYSR+SSNSSSWMERILSLEVLPDDIL HLRSV
Sbjct: 661 KEYLFSLVSSEARLSLIYQLIEFFSGKIYSRASSNSSSWMERILSLEVLPDDILHHLRSV 720
Query: 721 HDVLNKRQSSSSSSTLEVLVGGSENLSQMSDMMKFLRNAILLCLTAFPRNYILEEAALIA 780
HDVLNKRQ SSSSST+EVL+G S+NLSQMSDMMKFLRN ILLCLTAFPRNYILEEAALIA
Sbjct: 721 HDVLNKRQISSSSSTMEVLIGSSDNLSQMSDMMKFLRNTILLCLTAFPRNYILEEAALIA 780
Query: 781 EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL 840
EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL
Sbjct: 781 EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL 840
Query: 841 ASVESLPVDQKSNAPLLYFWYAELELANDHHNGHNSLNRAVHILSCLGSGTAYSPFKCQP 900
ASVESLPVDQKSNAPLLYFWYAELEL NDH+NGHNS NRAVHILSCLGSGT YSPFKCQP
Sbjct: 841 ASVESLPVDQKSNAPLLYFWYAELELVNDHNNGHNSSNRAVHILSCLGSGTTYSPFKCQP 900
Query: 901 SSLELLRAHQGFKEKIREVRSTWLHGVIDDSSAALISSAALFEELTTGYNAGLEVLDQAF 960
SSL+LLRAHQGFK+KIREVRSTWLHGVIDDSS ALISSAALFEELTTGYNAGLEVL QAF
Sbjct: 901 SSLQLLRAHQGFKDKIREVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLHQAF 960
Query: 961 SMVLPERRKQSYQLEYLFNYYVKMLLRHHKQLSQLKVRQSISHGLQFYPLNPELYSAFLE 1020
SMVLPERRKQSYQLE+LFNYYVKML RHHKQLSQLKVR+SI+HGLQFYPLNPELYSAFLE
Sbjct: 961 SMVLPERRKQSYQLEHLFNYYVKMLRRHHKQLSQLKVRESITHGLQFYPLNPELYSAFLE 1020
Query: 1021 ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKALGNENLRH 1080
ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKAL NENLRH
Sbjct: 1021 ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKALENENLRH 1080
Query: 1081 SVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNSVLSAKELSDLQ 1140
SVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNSVLSAKELSDLQ
Sbjct: 1081 SVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNSVLSAKELSDLQ 1140
Query: 1141 EVMRDKELNLRTDIYEILLQDELVS 1165
EVMRDKELNLRTDIYEILLQDELVS
Sbjct: 1141 EVMRDKELNLRTDIYEILLQDELVS 1164
BLAST of HG10020750 vs. ExPASy TrEMBL
Match:
A0A1S3CT61 (protein NRDE2 homolog isoform X2 OS=Cucumis melo OX=3656 GN=LOC103504593 PE=3 SV=1)
HSP 1 Score: 2133.2 bits (5526), Expect = 0.0e+00
Identity = 1089/1165 (93.48%), Postives = 1125/1165 (96.57%), Query Frame = 0
Query: 1 MEAPAEE-ESSPEEQNPKTSLFPLSFVANNPQSLSSPPNSSVPQWLCNSSFTTDLSVIND 60
MEAPAEE ES P EQNPK SLFPLSFV+NNPQ+ S+P NSSVPQWLCNSSFT+DLSVIND
Sbjct: 1 MEAPAEEKESPPAEQNPKPSLFPLSFVSNNPQTQSNPSNSSVPQWLCNSSFTSDLSVIND 60
Query: 61 ALSSQNNVHSSISAGGDQEEAVEDEGGPSDRREVQKPSRSYELLESSASDDDSEHGKRRK 120
ALSSQ+NV+ S SA +QEEAVEDEGGPSDRREVQKPSRSYELLESSAS+DDSEH KR+K
Sbjct: 61 ALSSQSNVYPSCSADSEQEEAVEDEGGPSDRREVQKPSRSYELLESSASEDDSEHEKRKK 120
Query: 121 RKKKRRRRRGNESEERGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLY 180
RKKK++RRR NESEERGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLY
Sbjct: 121 RKKKKKRRRRNESEERGGFGEYGSRKSDVRAWADADGRPSKDYYFDSNGDRDNLAFGSLY 180
Query: 181 RMDVARYRPLNRGEKPGLNFHGFSQWNKSSSALDRDADADVLDSKVKSGGRYWSAKNAAI 240
RMDVARYRPLNRGE+ G NFHGFSQWNKSSSALDRDADADVLD+KVKSGGRYWSAKNAAI
Sbjct: 181 RMDVARYRPLNRGERRGQNFHGFSQWNKSSSALDRDADADVLDNKVKSGGRYWSAKNAAI 240
Query: 241 ERHKNFKRVRIGFSRKTPDTLLDDFIPLSDVQTSNNIEESWEDEVLRKTREFNKLTREHP 300
ERHKNFKRVRIGFSR T DTLLDDFIPLSDVQTS+NIEESWEDEVLRKTREFNKLTREHP
Sbjct: 241 ERHKNFKRVRIGFSRNTSDTLLDDFIPLSDVQTSSNIEESWEDEVLRKTREFNKLTREHP 300
Query: 301 HDEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKTYQN 360
HDEKAWLAFAEFQDKVAAM+PQKGARLQTLEKKISILEKAAELNPENEELLLYLLKTYQN
Sbjct: 301 HDEKAWLAFAEFQDKVAAMEPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKTYQN 360
Query: 361 RDNIDVVISRWEKILMQNSGSYRLWREFLHLIQGEFSRFKVSDMRQLYAHAIQALSAACN 420
RDNIDVVISRWEKIL+QNSGSYRLWREFLHL+QGEFS+FKVSDMRQ+YAHAIQALSAACN
Sbjct: 361 RDNIDVVISRWEKILLQNSGSYRLWREFLHLMQGEFSKFKVSDMRQMYAHAIQALSAACN 420
Query: 421 QHIRQANQIAKPSVEHDLIQLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP 480
QHIRQANQIAKPSVEHDLIQLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP
Sbjct: 421 QHIRQANQIAKPSVEHDLIQLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP 480
Query: 481 ALHLNDRSKQRLFEHFWNTDAERVGEEGAVGWSTWLEKEEENRQKAMREEALEADEKGGW 540
ALHLNDRSKQRLFEHFWNT+AERVGEEGAVGWSTWLEKEEENRQKAMREEALEADEKGGW
Sbjct: 481 ALHLNDRSKQRLFEHFWNTNAERVGEEGAVGWSTWLEKEEENRQKAMREEALEADEKGGW 540
Query: 541 TGWSDPPPKEKKNSDGTETTAEMGVAAEETMEEYVEEEDIEREDSTEALLKILGINADAG 600
+GW DP PKE KNSDGT TTAEM VAAEET+E YV EEDIEREDSTEALLKILGIN DAG
Sbjct: 541 SGWFDPAPKENKNSDGTGTTAEMDVAAEETVEGYV-EEDIEREDSTEALLKILGINTDAG 600
Query: 601 VDEEVKDVSTWARWSKEESSRDCEQWMPIRGKTADVIHDEGMPDGETNEQFLRVILYEDV 660
VDEEVKD STWARWSKEESSRD EQWMP+R +T DVIHDEGMPDGETNEQ LRVILYEDV
Sbjct: 601 VDEEVKDASTWARWSKEESSRDSEQWMPVRERT-DVIHDEGMPDGETNEQLLRVILYEDV 660
Query: 661 KEFLFSLISSEARLSLIYQLIEFFSGKIYSRSSSNSSSWMERILSLEVLPDDILRHLRSV 720
KE+LFSL+SSEARLSLIYQLIEFFSGKIYSR+SSNSSSWMERILSLEVLPDDIL HLRSV
Sbjct: 661 KEYLFSLVSSEARLSLIYQLIEFFSGKIYSRASSNSSSWMERILSLEVLPDDILHHLRSV 720
Query: 721 HDVLNKRQSSSSSSTLEVLVGGSENLSQMSDMMKFLRNAILLCLTAFPRNYILEEAALIA 780
HDVLNKRQ SSSSST+EVL+G S+NLSQMSDMMKFLRN ILLCLTAFPRNYILEEAALIA
Sbjct: 721 HDVLNKRQISSSSSTMEVLIGSSDNLSQMSDMMKFLRNTILLCLTAFPRNYILEEAALIA 780
Query: 781 EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL 840
EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL
Sbjct: 781 EELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMAL 840
Query: 841 ASVESLPVDQKSNAPLLYFWYAELELANDHHNGHNSLNRAVHILSCLGSGTAYSPFKCQP 900
ASVESLPVDQKSNAPLLYFWYAELEL NDH+NGHNS NRAVHILSCLGSGT YSPFKCQP
Sbjct: 841 ASVESLPVDQKSNAPLLYFWYAELELVNDHNNGHNSSNRAVHILSCLGSGTTYSPFKCQP 900
Query: 901 SSLELLRAHQGFKEKIREVRSTWLHGVIDDSSAALISSAALFEELTTGYNAGLEVLDQAF 960
SSL+LLRAHQGFK+KIREVRSTWLHGVIDDSS ALISSAALFEELTTGYNAGLEVL QAF
Sbjct: 901 SSLQLLRAHQGFKDKIREVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLHQAF 960
Query: 961 SMVLPERRKQSYQLEYLFNYYVKMLLRHHKQLSQLKVRQSISHGLQFYPLNPELYSAFLE 1020
SMVLPERRKQSYQLE+LFNYYVKML RHHKQLSQLKVR+SI+HGLQFYPLNPELYSAFLE
Sbjct: 961 SMVLPERRKQSYQLEHLFNYYVKMLRRHHKQLSQLKVRESITHGLQFYPLNPELYSAFLE 1020
Query: 1021 ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKALGNENLRH 1080
ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKAL NENLRH
Sbjct: 1021 ISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKALENENLRH 1080
Query: 1081 SVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNSVLSAKELSDLQ 1140
SVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNSVLSAKELSDLQ
Sbjct: 1081 SVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNSVLSAKELSDLQ 1140
Query: 1141 EVMRDKELNLRTDIYEILLQDELVS 1165
EVMRDKELNLRTDIYEILLQDELVS
Sbjct: 1141 EVMRDKELNLRTDIYEILLQDELVS 1163
BLAST of HG10020750 vs. ExPASy TrEMBL
Match:
A0A6J1E7N7 (protein NRDE2 homolog isoform X5 OS=Cucurbita moschata OX=3662 GN=LOC111431485 PE=3 SV=1)
HSP 1 Score: 2091.2 bits (5417), Expect = 0.0e+00
Identity = 1070/1166 (91.77%), Postives = 1108/1166 (95.03%), Query Frame = 0
Query: 1 MEAPAEEESSPEEQNPKTSLFPLSFVANNPQSLSSPPNSSVPQWLCNSSFTTDLSVINDA 60
MEAPAEEE PEEQ PKTSLFPL FVANNPQS SPPNSSVPQWLCNSSFTTDLSVINDA
Sbjct: 1 MEAPAEEELPPEEQKPKTSLFPLPFVANNPQSQISPPNSSVPQWLCNSSFTTDLSVINDA 60
Query: 61 LSSQNNVHSSISAGGDQEEAVEDEGGPSDRREVQKPSRSYELLESSASDDDSEHGKRRKR 120
LSSQNNV+ S+S GDQEEAVEDEGGPS R EVQK SRSYELLESSASDDDS+H KR+KR
Sbjct: 61 LSSQNNVYPSLSTDGDQEEAVEDEGGPSVRPEVQKSSRSYELLESSASDDDSDHEKRKKR 120
Query: 121 KKKRRRRRGNESEERGGFGEYGSRKSDVRAWAD-ADGRPSKDYYFDSNGDRDNLAFGSLY 180
KKK+RRRR NE EE+ GFGEYGSRKSDVRAWAD ADGRPSKDYYFDSNGDRDNLAFGSLY
Sbjct: 121 KKKKRRRR-NEYEEKKGFGEYGSRKSDVRAWADAADGRPSKDYYFDSNGDRDNLAFGSLY 180
Query: 181 RMDVARYRPLNRGEKPGLNFHGFSQWNKSSSALDRDADADVLDSKVKSGGRYWSAKNAAI 240
RMDVARYRPLN GE+PGLNF+GFSQWNKSSSALD+DADA+VLDSK+KSGGRYWSAKNAAI
Sbjct: 181 RMDVARYRPLNHGERPGLNFNGFSQWNKSSSALDKDADAEVLDSKLKSGGRYWSAKNAAI 240
Query: 241 ERHKNFKRVRIGFSRKTPDTLLDDFIPLSDVQTSNNIEESWEDEVLRKTREFNKLTREHP 300
ERHKNFKRVRIGFSRKTPD LLDDFIP SD QTSNNIEESWEDEVLRKTREFNKLTREHP
Sbjct: 241 ERHKNFKRVRIGFSRKTPDKLLDDFIPFSDSQTSNNIEESWEDEVLRKTREFNKLTREHP 300
Query: 301 HDEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKTYQN 360
HDEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLK YQ
Sbjct: 301 HDEKAWLAFAEFQDKVAAMQPQKGARLQTLEKKISILEKAAELNPENEELLLYLLKNYQK 360
Query: 361 RDNIDVVISRWEKILMQNSGSYRLWREFLHLIQGEFSRFKVSDMRQLYAHAIQALSAACN 420
RD IDVVIS WEKILMQNSGSY+LWREFLHLIQGEFSRFKVSDMRQ+YAHAIQALSAACN
Sbjct: 361 RDTIDVVISTWEKILMQNSGSYKLWREFLHLIQGEFSRFKVSDMRQMYAHAIQALSAACN 420
Query: 421 QHIRQANQIAKPSVEHDLIQLELGLVDIFMSLCRFEWQAGYQELATALFQAEIEFSLFCP 480
QHIRQANQ AKPSVEHDLIQLELGLVDIF+SLCRFEWQAGYQELATALFQAEIEFSLFCP
Sbjct: 421 QHIRQANQTAKPSVEHDLIQLELGLVDIFLSLCRFEWQAGYQELATALFQAEIEFSLFCP 480
Query: 481 ALHLNDRSKQRLFEHFWNTDAERVGEEGAVGWSTWLEKEEENRQKAMR-EEALEADEKGG 540
ALHLNDRSKQRLFEHFWNTDAERVGEEGA+GWSTWLEKEEENRQK MR EEALEADEKGG
Sbjct: 481 ALHLNDRSKQRLFEHFWNTDAERVGEEGALGWSTWLEKEEENRQKVMREEEALEADEKGG 540
Query: 541 WTGWSDPPPKEKKNSDGTETTAEMGVAAEETMEEYVEEEDIEREDSTEALLKILGINADA 600
WTGWSDP PKEKKN+D ETTAE+GVAAEE ME+ VEEED EREDSTEALLKILGIN DA
Sbjct: 541 WTGWSDPAPKEKKNNDDAETTAEVGVAAEEAMEQDVEEEDTEREDSTEALLKILGINPDA 600
Query: 601 GVDEEVKDVSTWARWSKEESSRDCEQWMPIRGKTADVIHDEGMPDGETNEQFLRVILYED 660
GVDEEVKD STWARWSKEES RDCEQWMPIR ADVIHDEGMPDGETNEQF RVILYED
Sbjct: 601 GVDEEVKDTSTWARWSKEESLRDCEQWMPIRENYADVIHDEGMPDGETNEQFQRVILYED 660
Query: 661 VKEFLFSLISSEARLSLIYQLIEFFSGKIYSRSSSNSSSWMERILSLEVLPDDILRHLRS 720
VKE+LFSLISSEARLSLIYQLIEFFSGKI SR +SNSSSWMERILSLEVLPDDIL HLRS
Sbjct: 661 VKEYLFSLISSEARLSLIYQLIEFFSGKICSRVASNSSSWMERILSLEVLPDDILHHLRS 720
Query: 721 VHDVLNKRQSSSSSSTLEVLVGGSENLSQMSDMMKFLRNAILLCLTAFPRNYILEEAALI 780
VHDVLNKRQSSSSS TLEVLVGGS+NL+QMSDMMKFLRN ILLCLTAFPRN+ILEEAALI
Sbjct: 721 VHDVLNKRQSSSSSFTLEVLVGGSDNLTQMSDMMKFLRNVILLCLTAFPRNFILEEAALI 780
Query: 781 AEELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATYGNIDHARKVFDMA 840
AEELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREAT+GNIDHARKVFDMA
Sbjct: 781 AEELFVTKMNSCSSSVTPCRSLAKNLLKSDRQDMLLCGVYARREATHGNIDHARKVFDMA 840
Query: 841 LASVESLPVDQKSNAPLLYFWYAELELANDHHNGHNSLNRAVHILSCLGSGTAYSPFKCQ 900
LASVESLPVDQKSNAPLLYFWYAELELA D HNGH+S+NRAVHILSCLG+G +YSPFKCQ
Sbjct: 841 LASVESLPVDQKSNAPLLYFWYAELELAKDPHNGHDSVNRAVHILSCLGNGDSYSPFKCQ 900
Query: 901 PSSLELLRAHQGFKEKIREVRSTWLHGVIDDSSAALISSAALFEELTTGYNAGLEVLDQA 960
PSSL+LLRAHQGFKEKIR VRSTWLHGVIDDSS ALISSAALFEELTTGYNAGLEVLDQA
Sbjct: 901 PSSLQLLRAHQGFKEKIRAVRSTWLHGVIDDSSVALISSAALFEELTTGYNAGLEVLDQA 960
Query: 961 FSMVLPERRKQSYQLEYLFNYYVKMLLRHHKQLSQLKVRQSISHGLQFYPLNPELYSAFL 1020
F+MVLPERRK SYQLE LFNYYVKMLLRHHKQLSQLKVR+SIS GLQFYPLNPELY+AFL
Sbjct: 961 FNMVLPERRKHSYQLECLFNYYVKMLLRHHKQLSQLKVRESISQGLQFYPLNPELYTAFL 1020
Query: 1021 EISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYGGSLHRIRRLFEKALGNENLR 1080
EISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGY GS HRIRRLFEKAL N+NLR
Sbjct: 1021 EISYIYSVPSKLRWTFDDYCQKQPSLILWIFALSFEMGYAGSPHRIRRLFEKALENDNLR 1080
Query: 1081 HSVLLWRCYISYELNTACDPSSARRVFFRAIHSCPWSKKLWLDGFLKLNSVLSAKELSDL 1140
HSVLLWRCYISYELNTACDPSSA+RVFFRAIHSCPWSKKLWLDGF+KLNS+LSAKELSDL
Sbjct: 1081 HSVLLWRCYISYELNTACDPSSAKRVFFRAIHSCPWSKKLWLDGFIKLNSILSAKELSDL 1140
Query: 1141 QEVMRDKELNLRTDIYEILLQDELVS 1165
QEVM DKELNLRTDIYEILLQ+EL+S
Sbjct: 1141 QEVMHDKELNLRTDIYEILLQEELIS 1165
BLAST of HG10020750 vs. TAIR 10
Match:
AT3G17740.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1740 (InterPro:IPR013633); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G17712.1); Has 409 Blast hits to 335 proteins in 133 species: Archae - 1; Bacteria - 0; Metazoa - 140; Fungi - 188; Plants - 42; Viruses - 0; Other Eukaryotes - 38 (source: NCBI BLink). )
HSP 1 Score: 1179.1 bits (3049), Expect = 0.0e+00
Identity = 638/1134 (56.26%), Postives = 818/1134 (72.13%), Query Frame = 0
Query: 39 SSVPQWLCNSSFTTDLSVINDALSSQNNVHSSISAGGDQEEAVEDEGGPSDRREVQKPSR 98
S+ PQWL N+SFTTDLSVIN A S+ + S + AG D++E EGG + +R
Sbjct: 33 SNAPQWLRNASFTTDLSVINAAASTAPS-SSEVEAGDDEDE----EGGADGNIGLANQAR 92
Query: 99 SYELLESSASDDDSEHGKRRKRKKKRRRRRGNESEERGGFGEYGSRKSDVRAWADADGRP 158
Y L+E S + + +RKR+KK++R+ N S+E SRKSD + +P
Sbjct: 93 VYNLVEEEGSLESDDDKVKRKREKKKKRKSDNASDES------RSRKSD-----EYYSKP 152
Query: 159 SKDYYFDSNGDRDNLAFGSLYRMDVARYRPLNRGEKPGLNFHGFSQWNKSSSALDRDADA 218
KDYY D+ D DNLA+GS+YRM+V RY+ N PG F N+ SS LD + D
Sbjct: 153 VKDYYLDTRPDPDNLAYGSIYRMNVPRYKLDNSQRVPGSGSLRFYLRNRRSSMLDTEIDI 212
Query: 219 DVLDSKVKSGGRYWSAKNAAIERHKNFKRVRIGFSRKTPDTLLDDFIPL-SDVQTSNNIE 278
D L+ + KS RYW AK+AA+ER+KNFKR+R+ + + D+ D+FIPL DV + E
Sbjct: 213 DSLEGRAKSDTRYWYAKHAAMERNKNFKRIRLSAASEAVDSSFDNFIPLEEDVTVPESDE 272
Query: 279 E-----------SWEDEVLRKTREFNKLTREHPHDEKAWLAFAEFQDKVAAMQPQKGARL 338
E SWEDEVL KTREFN++TRE PHD KAWLAFA+FQDKV++MQ QKG RL
Sbjct: 273 EDVLSKDSMIGASWEDEVLNKTREFNRVTRERPHDAKAWLAFADFQDKVSSMQSQKGVRL 332
Query: 339 QTLEKKISILEKAAELNPENEELLLYLLKTYQNRDNIDVVISRWEKILMQNSGSYRLWRE 398
QTLEKKISILEKA ELNP++EELLL LLK Y++RDN DV+ISRWEK LMQNS SY+LWRE
Sbjct: 333 QTLEKKISILEKAFELNPDSEELLLALLKAYRSRDNADVLISRWEKALMQNSASYKLWRE 392
Query: 399 FLHLIQGEFSRFKVSDMRQLYAHAIQALSAACNQHIRQANQIAKPSVEHDLIQLELGLVD 458
FL ++QGEFSRFKVS++R+LY++AIQALS+AC++ RQ + ++P ++ IQ EL LVD
Sbjct: 393 FLCVVQGEFSRFKVSEVRRLYSYAIQALSSACSKRHRQVDTTSEP-LDSAAIQQELVLVD 452
Query: 459 IFMSLCRFEWQAGYQELATALFQAEIEFSLFCPALHLNDRSKQRLFEHFWNTDAERVGEE 518
+ +SLCRFEWQAGYQELATAL QAE+EFS+F P+L L ++SK RLFEHFW+++ RVGEE
Sbjct: 453 MLVSLCRFEWQAGYQELATALLQAEVEFSIFSPSLLLTEQSKLRLFEHFWSSNGARVGEE 512
Query: 519 GAVGWSTWLEKEEENRQKAMREEALEADEKGGWTGWSDPPPKEKKNSDGTETTAEMGVAA 578
GA GW WLEKEEENRQK ++EE+ + +E GGWTGW++ + + T E+ V
Sbjct: 513 GAFGWLLWLEKEEENRQKILKEESSDDNEVGGWTGWTEQVSGRNGDDIASANTGEVDV-D 572
Query: 579 EETMEEYVEEEDIEREDSTEALLKILGINADAGVDEEVKDVSTWARWSKEESSRDCEQWM 638
+ ++E +E+E+ + ED TEA+LK+LGI+ + +EVKD STW +W +EE SRD QWM
Sbjct: 573 RKGLDEEMEDENSKPEDDTEAMLKLLGIDVNTAASDEVKDTSTWVKWFEEEVSRDHSQWM 632
Query: 639 PIRGKTADVIHDEGMPDGETNEQFLRVILYEDVKEFLFSLISSEARLSLIYQLIEFFSGK 698
P R K + EGM +GE EQ V+LYED+ +LFSL S EARLSL+YQ I+FF
Sbjct: 633 PTR-KAGEFSSVEGMGEGEDEEQLSSVVLYEDINGYLFSLRSKEARLSLVYQFIDFFGAH 692
Query: 699 IYSRSSSNSSSWMERILSLEVLPDDILRHLRSVHDVLNKRQSSSSSSTLEVLVGGSENLS 758
I +SSNS SW E+I SLE D +L +LRSVH+ L+K S++ S L L+GGS +LS
Sbjct: 693 ISPWTSSNSLSWSEKISSLETFSDSMLENLRSVHECLSKSDSANCFS-LGSLLGGSCDLS 752
Query: 759 QMSDMMKFLRNAILLCLTAFPRNYILEEAALIAEELFVTKMNSCSSSVTPCRSLAKNLLK 818
++MMKFLRNAILLCL FPRNYILEEA L+AEELFVT M +C + PC++LAK LLK
Sbjct: 753 MRTEMMKFLRNAILLCLNVFPRNYILEEAVLVAEELFVTNMKTCEVATMPCQALAKRLLK 812
Query: 819 SDRQDMLLCGVYARREATYGNIDHARKVFDMALASVESLPVDQKSNAPLLYFWYAELELA 878
SDRQD+LLCGVYA+REA GN+ HAR+VFDMAL S+ LP + + N PLL WYAE E+A
Sbjct: 813 SDRQDLLLCGVYAQREAASGNMKHARRVFDMALTSICGLPKELQCNTPLLCLWYAESEVA 872
Query: 879 NDHHNGHN--SLNRAVHILSCLGSGTAYSPFKCQPSSLELLRAHQGFKEKIREVRSTWLH 938
N +G + S +RA+HIL LGSG AYSP+ Q SS+++LRA QGF+EK+++++STW H
Sbjct: 873 NSSGSGRDTESSSRAMHILCYLGSGLAYSPYTSQSSSMQILRARQGFREKLKKIQSTWSH 932
Query: 939 GVIDDSSAALISSAALFEELTTGYNAGLEVLDQAFSMVLPERRKQSYQLEYLFNYYVKML 998
GV DD SAAL+ SAALFEELT LE+L+ FS VLP R+ QS+QLE LFNYYV+ML
Sbjct: 933 GVTDDQSAALVCSAALFEELTNDLPGALEILEHMFSSVLPGRKSQSHQLELLFNYYVRML 992
Query: 999 LRHHKQLSQLKVRQSISHGLQFYPLNPELYSAFLEISYIYSVPSKLRWTFDDYCQKQPSL 1058
RH L+ ++ + IS GLQ YPLNPELY A ++I KLR FDDY +K S+
Sbjct: 993 QRHQDDLTLSQLWKPISEGLQLYPLNPELYRALVDICNHRMTSHKLRMMFDDYSRKNSSV 1052
Query: 1059 ILWIFALSFEMGYGGSLHRIRRLFEKALGNENLRHSVLLWRCYISYELNTACDPSSARRV 1118
++W+FALS+E+ GGS HRIR LFE+AL + +SV+LWRCYI+YE++ A +PS+ARR+
Sbjct: 1053 VVWLFALSYELSKGGSSHRIRGLFERALAQDTQNNSVILWRCYIAYEIDIADNPSAARRI 1112
Query: 1119 FFRAIHSCPWSKKLWLDGFLKLNSVLSAKELSDLQEVMRDKELNLRTDIYEILL 1159
+FRAI++CPWSKKLWLDGF KL SVL+AKE+SDLQEVMRDKELN+RTDIYEILL
Sbjct: 1113 YFRAINACPWSKKLWLDGFGKLGSVLTAKEMSDLQEVMRDKELNIRTDIYEILL 1146
BLAST of HG10020750 vs. TAIR 10
Match:
AT3G17712.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1740 (InterPro:IPR013633); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G17740.1); Has 265 Blast hits to 264 proteins in 123 species: Archae - 1; Bacteria - 0; Metazoa - 116; Fungi - 89; Plants - 33; Viruses - 0; Other Eukaryotes - 26 (source: NCBI BLink). )
HSP 1 Score: 716.5 bits (1848), Expect = 3.6e-206
Identity = 408/764 (53.40%), Postives = 527/764 (68.98%), Query Frame = 0
Query: 39 SSVPQWLCNSSFTTDLSVINDALSSQNNVHSSISAGGDQEEAVEDEGGPSDRREVQKPSR 98
S+ PQWL N+SFTTDLSVIN A S+ + S + AG D++E EGG + +R
Sbjct: 33 SNAPQWLRNASFTTDLSVINAAASTAPS-SSEVEAGDDEDE----EGGADGNIGLANQAR 92
Query: 99 SYELLESSASDDDSEHGKRRKRKKKRRRRRGNESEERGGFGEYGSRKSDVRAWADADGRP 158
Y L+E S + + +RKR+KK++R+ N S+E SRKSD + +P
Sbjct: 93 VYNLVEEEGSLESDDDKVKRKREKKKKRKSDNASDES------RSRKSD-----EYYSKP 152
Query: 159 SKDYYFDSNGDRDNLAFGSLYRMDVARYRPLNRGEKPGLNFHGFSQWNKSSSALDRDADA 218
KDYY D+ D DNLA+GS+YRM+V RY+ N PG F N+ SS LD + D
Sbjct: 153 VKDYYLDTRPDPDNLAYGSIYRMNVPRYKLDNSQRVPGSGSLRFYLRNRRSSMLDTEIDI 212
Query: 219 DVLDSKVKSGGRYWSAKNAAIERHKNFKRVRIGFSRKTPDTLLDDFIPL-SDVQTSNNIE 278
D L+ + KS RYW AK+AA+ER+KNFKR+R+ + + D+ D+FIPL DV + E
Sbjct: 213 DSLEGRAKSDTRYWYAKHAAMERNKNFKRIRLSAASEAVDSSFDNFIPLEEDVTVPESDE 272
Query: 279 E-----------SWEDEVLRKTREFNKLTREHPHDEKAWLAFAEFQDKVAAMQPQKGARL 338
E SWEDEVL KTREFN++TRE PHD KAWLAFA+FQDKV++MQ QKG RL
Sbjct: 273 EDVLSKDSMIGASWEDEVLNKTREFNRVTRERPHDAKAWLAFADFQDKVSSMQSQKGVRL 332
Query: 339 QTLEKKISILEKAAELNPENEELLLYLLKTYQNRDNIDVVISRWEKILMQNSGSYRLWRE 398
QTLEKKISILEKA ELNP++EELLL LLK Y++RDN DV+ISRWEK LMQNS SY+LWRE
Sbjct: 333 QTLEKKISILEKAFELNPDSEELLLALLKAYRSRDNADVLISRWEKALMQNSASYKLWRE 392
Query: 399 FLHLIQGEFSRFKVSDMRQLYAHAIQALSAACNQHIRQANQIAKPSVEHDLIQLELGLVD 458
FL ++QGEFSRFKVS++R+LY++AIQALS+AC++ RQ + ++P ++ IQ EL LVD
Sbjct: 393 FLCVVQGEFSRFKVSEVRRLYSYAIQALSSACSKRHRQVDTTSEP-LDSAAIQQELVLVD 452
Query: 459 IFMSLCRFEWQAGYQELATALFQAEIEFSLFCPALHLNDRSKQRLFEHFWNTDAERVGEE 518
+ +SLCRFEWQAGYQELATAL QAE+EFS+F P+L L ++SK RLFEHFW+++ RVGEE
Sbjct: 453 MLVSLCRFEWQAGYQELATALLQAEVEFSIFSPSLLLTEQSKLRLFEHFWSSNGARVGEE 512
Query: 519 GAVGWSTWLEKEEENRQKAMREEALEADEKGGWTGWSDPPPKEKKNSDGTETTAEMGVAA 578
GA GW WLEKEEENRQK ++EE+ + +E GGWTGW++ + + T E+ V
Sbjct: 513 GAFGWLLWLEKEEENRQKILKEESSDDNEVGGWTGWTEQVSGRNGDDLASANTGEVDV-D 572
Query: 579 EETMEEYVEEEDIEREDSTEALLKILGINADAGVDEEVKDVSTWARWSKEESSRDCEQWM 638
+ ++E +E+E+ + ED TEA+LK+LGI+ + +EVKD STW W +EE SRD QWM
Sbjct: 573 RKGLDEEMEDENSKPEDDTEAMLKLLGIDVNTAASDEVKDTSTWVEWFEEEVSRDHSQWM 632
Query: 639 PIRGKTADVIHDEGMPDGETNEQFLRVILYEDVKEFLFSLISSEARLSLIYQLIEFFSGK 698
P R K + EGM +GE EQ V+LYED+ +LFSL S EARLSL+YQ I+FF
Sbjct: 633 PTR-KAGEFSSVEGMGEGEDEEQLSSVVLYEDINGYLFSLRSKEARLSLVYQFIDFFGAH 692
Query: 699 IYSRSSSNSSSWMERILSLEVLPDDILRHLRSVHDVLNKRQSSSSSSTLEVLVGGSENLS 758
I SW E+I SLE L D +L +LRSVH+ L+K S++ S L L+GGS +LS
Sbjct: 693 ISPMDFQQQLSWSEKISSLETLSDSMLENLRSVHECLSKSDSANCFS-LGSLLGGSCDLS 752
Query: 759 QMSDMMKFLRNAILLCLTAFPRNYILEEAALIAEELFVTKMNSC 791
++MMKFLRNAILLCL FP+NYI EEA L+ EELFVT M +C
Sbjct: 753 MRTEMMKFLRNAILLCLNVFPQNYIPEEAVLVTEELFVTNMKTC 776
BLAST of HG10020750 vs. TAIR 10
Match:
AT3G17712.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1740 (InterPro:IPR013633); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G17740.1). )
HSP 1 Score: 701.0 bits (1808), Expect = 1.5e-201
Identity = 413/820 (50.37%), Postives = 535/820 (65.24%), Query Frame = 0
Query: 39 SSVPQWLCNSSFTTDLSVINDALSSQNNVHSSISAGGDQEEAVEDEGGPSDRREVQKPSR 98
S+ PQWL N+SFTTDLSVIN A S+ + S + AG D++E EGG + +R
Sbjct: 64 SNAPQWLRNASFTTDLSVINAAASTAPS-SSEVEAGDDEDE----EGGADGNIGLANQAR 123
Query: 99 SYELLESSASDDDSEHGKRRKRKKKRRRRRGNESEERGGFGEYGSRKSDVRAWADADGRP 158
Y L+E S + + +RKR+KK++R+ N S+E SRKSD + +P
Sbjct: 124 VYNLVEEEGSLESDDDKVKRKREKKKKRKSDNASDES------RSRKSD-----EYYSKP 183
Query: 159 SKDYYFDSNGDRDNLAFGSLYRMDVARYRPLNRGEKPGLNFHGFSQWNKSSSALDRDADA 218
KDYY D+ D DNLA+GS+YRM+V RY+ N PG F N+ SS LD + D
Sbjct: 184 VKDYYLDTRPDPDNLAYGSIYRMNVPRYKLDNSQRVPGSGSLRFYLRNRRSSMLDTEIDI 243
Query: 219 DVLDSKVKSGGRYWSAKNAAIERHKNFKRVRIGFSRKTPDTLLDDFIPL-SDVQTSNNIE 278
D L+ + KS RYW AK+AA+ER+KNFKR+R+ + + D+ D+FIPL DV + E
Sbjct: 244 DSLEGRAKSDTRYWYAKHAAMERNKNFKRIRLSAASEAVDSSFDNFIPLEEDVTVPESDE 303
Query: 279 E-----------SWEDEVLRKTREFNKLTREHPHDEKAWLAFAEFQDKVAAMQPQKGARL 338
E SWEDEVL KTREFN++TRE PHD KAWLAFA+FQDKV++MQ QKG RL
Sbjct: 304 EDVLSKDSMIGASWEDEVLNKTREFNRVTRERPHDAKAWLAFADFQDKVSSMQSQKGVRL 363
Query: 339 QTLEKKISILEKAAELNPENEELLLYLLKTYQNRDNIDVVISRWEKILMQNSGSYRLWRE 398
QTLEKKISILEKA ELNP++EELLL LLK Y++RDN DV+I
Sbjct: 364 QTLEKKISILEKAFELNPDSEELLLALLKAYRSRDNADVLIR------------------ 423
Query: 399 FLHLIQGEFSRFKVSDMRQLYAHAIQALSAACNQHIRQANQIAKPSVEHDLIQLELGLVD 458
EFSRFKVS++R+LY++AIQALS+AC++ RQ + ++P ++ IQ EL LVD
Sbjct: 424 -------EFSRFKVSEVRRLYSYAIQALSSACSKRHRQVDTTSEP-LDSAAIQQELVLVD 483
Query: 459 IFMSLCRFEWQAGYQELATALFQAEIEFSLFCPALHLNDRSKQRLFEHFWNTDAERVGEE 518
+ +SLCRFEWQAGYQELATAL QAE+EFS+F P+L L ++SK RLFEHFW+++ RVGEE
Sbjct: 484 MLVSLCRFEWQAGYQELATALLQAEVEFSIFSPSLLLTEQSKLRLFEHFWSSNGARVGEE 543
Query: 519 GAVGWSTWLEKEEENRQKAMREEALEADEKGGWTGWSDPPPKEKKNSDGTETTAEMGVAA 578
GA GW WLEKEEENRQK ++EE+ + +E GGWTGW++ + + T E+ V
Sbjct: 544 GAFGWLLWLEKEEENRQKILKEESSDDNEVGGWTGWTEQVSGRNGDDLASANTGEVDV-D 603
Query: 579 EETMEEYVEEEDIEREDSTEALLKILGINADAGVDEEVKDVSTWARWSKEESSRDCEQWM 638
+ ++E +E+E+ + ED TEA+LK+LGI+ + +EVKD STW W +EE SRD QWM
Sbjct: 604 RKGLDEEMEDENSKPEDDTEAMLKLLGIDVNTAASDEVKDTSTWVEWFEEEVSRDHSQWM 663
Query: 639 PIRGKTADVIHDEGMPDGETNEQFLRVILYEDVKEFLFSLISSEARLSLIYQLIEFFSGK 698
P R K + EGM +GE EQ V+LYED+ +LFSL S EARLSL+YQ I+FF
Sbjct: 664 PTR-KAGEFSSVEGMGEGEDEEQLSSVVLYEDINGYLFSLRSKEARLSLVYQFIDFFGAH 723
Query: 699 IYSRSSSNSSSWMERILSLEVLPDDILRHLRSVHDVLNKRQSSSSSSTLEVLVGGSENLS 758
I SW E+I SLE L D +L +LRSVH+ L+K S++ S L L+GGS +LS
Sbjct: 724 ISPMDFQQQLSWSEKISSLETLSDSMLENLRSVHECLSKSDSANCFS-LGSLLGGSCDLS 783
Query: 759 QMSDMMKFLRNAILLCLTAFPRNYILEEAALIAEELFVTKMNSCSSSVTPCRSLAKNLLK 818
++MMKFLRNAILLCL FP+NYI EEA L+ EELFVT M +C
Sbjct: 784 MRTEMMKFLRNAILLCLNVFPQNYIPEEAVLVTEELFVTNMKTC---------------- 819
Query: 819 SDRQDMLLCGVYARREATYGNIDHARKVFDMALASVESLP 847
+D+LLCGVYA+REA GN+ HAR+VFDMAL S+ LP
Sbjct: 844 ---EDLLLCGVYAQREAASGNMKHARRVFDMALTSICGLP 819
BLAST of HG10020750 vs. TAIR 10
Match:
AT3G17712.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1740 (InterPro:IPR013633); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G17740.1). )
HSP 1 Score: 659.1 bits (1699), Expect = 6.7e-189
Identity = 388/764 (50.79%), Postives = 504/764 (65.97%), Query Frame = 0
Query: 39 SSVPQWLCNSSFTTDLSVINDALSSQNNVHSSISAGGDQEEAVEDEGGPSDRREVQKPSR 98
S+ PQWL N+SFTTDLSVIN A S+ + S + AG D++E EGG + +R
Sbjct: 33 SNAPQWLRNASFTTDLSVINAAASTAPS-SSEVEAGDDEDE----EGGADGNIGLANQAR 92
Query: 99 SYELLESSASDDDSEHGKRRKRKKKRRRRRGNESEERGGFGEYGSRKSDVRAWADADGRP 158
Y L+E S + + +RKR+KK++R+ N S+E SRKSD + +P
Sbjct: 93 VYNLVEEEGSLESDDDKVKRKREKKKKRKSDNASDES------RSRKSD-----EYYSKP 152
Query: 159 SKDYYFDSNGDRDNLAFGSLYRMDVARYRPLNRGEKPGLNFHGFSQWNKSSSALDRDADA 218
KDYY D+ D DNLA+GS+YRM+V RY+ N PG F N+ SS LD + D
Sbjct: 153 VKDYYLDTRPDPDNLAYGSIYRMNVPRYKLDNSQRVPGSGSLRFYLRNRRSSMLDTEIDI 212
Query: 219 DVLDSKVKSGGRYWSAKNAAIERHKNFKRVRIGFSRKTPDTLLDDFIPL-SDVQTSNNIE 278
D L+ + KS RYW AK+AA+ER+KNFKR+R+ + + D+ D+FIPL DV + E
Sbjct: 213 DSLEGRAKSDTRYWYAKHAAMERNKNFKRIRLSAASEAVDSSFDNFIPLEEDVTVPESDE 272
Query: 279 E-----------SWEDEVLRKTREFNKLTREHPHDEKAWLAFAEFQDKVAAMQPQKGARL 338
E SWEDEVL KTREFN++TRE PHD KAWLAFA+FQDKV++MQ QKG RL
Sbjct: 273 EDVLSKDSMIGASWEDEVLNKTREFNRVTRERPHDAKAWLAFADFQDKVSSMQSQKGVRL 332
Query: 339 QTLEKKISILEKAAELNPENEELLLYLLKTYQNRDNIDVVISRWEKILMQNSGSYRLWRE 398
QTLEKKISILEKA ELNP++EELLL LLK Y++RDN DV+I
Sbjct: 333 QTLEKKISILEKAFELNPDSEELLLALLKAYRSRDNADVLIR------------------ 392
Query: 399 FLHLIQGEFSRFKVSDMRQLYAHAIQALSAACNQHIRQANQIAKPSVEHDLIQLELGLVD 458
EFSRFKVS++R+LY++AIQALS+AC++ RQ + ++P ++ IQ EL LVD
Sbjct: 393 -------EFSRFKVSEVRRLYSYAIQALSSACSKRHRQVDTTSEP-LDSAAIQQELVLVD 452
Query: 459 IFMSLCRFEWQAGYQELATALFQAEIEFSLFCPALHLNDRSKQRLFEHFWNTDAERVGEE 518
+ +SLCRFEWQAGYQELATAL QAE+EFS+F P+L L ++SK RLFEHFW+++ RVGEE
Sbjct: 453 MLVSLCRFEWQAGYQELATALLQAEVEFSIFSPSLLLTEQSKLRLFEHFWSSNGARVGEE 512
Query: 519 GAVGWSTWLEKEEENRQKAMREEALEADEKGGWTGWSDPPPKEKKNSDGTETTAEMGVAA 578
GA GW WLEKEEENRQK ++EE+ + +E GGWTGW++ + + T E+ V
Sbjct: 513 GAFGWLLWLEKEEENRQKILKEESSDDNEVGGWTGWTEQVSGRNGDDLASANTGEVDV-D 572
Query: 579 EETMEEYVEEEDIEREDSTEALLKILGINADAGVDEEVKDVSTWARWSKEESSRDCEQWM 638
+ ++E +E+E+ + ED TEA+LK+LGI+ + +EVKD STW W +EE SRD QWM
Sbjct: 573 RKGLDEEMEDENSKPEDDTEAMLKLLGIDVNTAASDEVKDTSTWVEWFEEEVSRDHSQWM 632
Query: 639 PIRGKTADVIHDEGMPDGETNEQFLRVILYEDVKEFLFSLISSEARLSLIYQLIEFFSGK 698
P R K + EGM +GE EQ V+LYED+ +LFSL S EARLSL+YQ I+FF
Sbjct: 633 PTR-KAGEFSSVEGMGEGEDEEQLSSVVLYEDINGYLFSLRSKEARLSLVYQFIDFFGAH 692
Query: 699 IYSRSSSNSSSWMERILSLEVLPDDILRHLRSVHDVLNKRQSSSSSSTLEVLVGGSENLS 758
I SW E+I SLE L D +L +LRSVH+ L+K S++ S L L+GGS +LS
Sbjct: 693 ISPMDFQQQLSWSEKISSLETLSDSMLENLRSVHECLSKSDSANCFS-LGSLLGGSCDLS 751
Query: 759 QMSDMMKFLRNAILLCLTAFPRNYILEEAALIAEELFVTKMNSC 791
++MMKFLRNAILLCL FP+NYI EEA L+ EELFVT M +C
Sbjct: 753 MRTEMMKFLRNAILLCLNVFPQNYIPEEAVLVTEELFVTNMKTC 751
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038894150.1 | 0.0e+00 | 95.19 | nuclear exosome regulator NRDE2 isoform X2 [Benincasa hispida] | [more] |
XP_038894149.1 | 0.0e+00 | 95.11 | nuclear exosome regulator NRDE2 isoform X1 [Benincasa hispida] | [more] |
XP_038894151.1 | 0.0e+00 | 95.03 | nuclear exosome regulator NRDE2 isoform X3 [Benincasa hispida] | [more] |
XP_011650955.1 | 0.0e+00 | 93.48 | nuclear exosome regulator NRDE2 isoform X2 [Cucumis sativus] >KGN64201.1 hypothe... | [more] |
XP_008467185.1 | 0.0e+00 | 93.56 | PREDICTED: protein NRDE2 homolog isoform X1 [Cucumis melo] >TYJ99059.1 protein N... | [more] |
Match Name | E-value | Identity | Description | |
Q80XC6 | 7.8e-65 | 24.67 | Nuclear exosome regulator NRDE2 OS=Mus musculus OX=10090 GN=Nrde2 PE=1 SV=3 | [more] |
Q9H7Z3 | 1.6e-62 | 24.74 | Nuclear exosome regulator NRDE2 OS=Homo sapiens OX=9606 GN=NRDE2 PE=1 SV=3 | [more] |
Q54QP0 | 1.3e-32 | 20.03 | Nuclear exosome regulator NRDE2 OS=Dictyostelium discoideum OX=44689 GN=nrde2 PE... | [more] |
O42975 | 5.9e-12 | 22.04 | Protein NRDE2 homolog OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=... | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0LVY4 | 0.0e+00 | 93.48 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G043050 PE=3 SV=1 | [more] |
A0A5D3BJ75 | 0.0e+00 | 93.56 | Protein NRDE2-like protein isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=... | [more] |
A0A1S3CSX9 | 0.0e+00 | 93.56 | protein NRDE2 homolog isoform X1 OS=Cucumis melo OX=3656 GN=LOC103504593 PE=3 SV... | [more] |
A0A1S3CT61 | 0.0e+00 | 93.48 | protein NRDE2 homolog isoform X2 OS=Cucumis melo OX=3656 GN=LOC103504593 PE=3 SV... | [more] |
A0A6J1E7N7 | 0.0e+00 | 91.77 | protein NRDE2 homolog isoform X5 OS=Cucurbita moschata OX=3662 GN=LOC111431485 P... | [more] |
Match Name | E-value | Identity | Description | |
AT3G17740.1 | 0.0e+00 | 56.26 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT3G17712.1 | 3.6e-206 | 53.40 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT3G17712.2 | 1.5e-201 | 50.37 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT3G17712.3 | 6.7e-189 | 50.79 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |