Homology
BLAST of MS008891 vs. NCBI nr
Match:
XP_022150078.1 (DNA repair protein UVH3 isoform X1 [Momordica charantia])
HSP 1 Score: 2922.9 bits (7576), Expect = 0.0e+00
Identity = 1546/1602 (96.50%), Postives = 1551/1602 (96.82%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM 60
MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM
Sbjct: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM 60
Query: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR
Sbjct: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
Query: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG 180
ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG
Sbjct: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG 180
Query: 181 RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESIL----- 240
RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESIL
Sbjct: 181 RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESILPLMNE 240
Query: 241 -------------------KQKYKNDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLD 300
KQKYKNDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLD
Sbjct: 241 VDPDVFSTLPSSIQYELLQKQKYKNDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLD 300
Query: 301 EMLAASIAAEEAGSLNENASVSAAANLDEDTDDEDEEMILPEMDGVVDPSVLAALPPSVQ 360
EMLAASIAAEEAGSLNENASVSAAANLDEDTDDEDEEMILPEMDGVVDPSVLAALPPSVQ
Sbjct: 301 EMLAASIAAEEAGSLNENASVSAAANLDEDTDDEDEEMILPEMDGVVDPSVLAALPPSVQ 360
Query: 361 LDLLVQ------------------DPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVG 420
LDLLVQ DPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVG
Sbjct: 361 LDLLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVG 420
Query: 421 GVQTSRIASEANREFIFSSSFTGDKQVLASRTEKSGDEDLQAPRGQQPLSSLNNTEVPST 480
GVQTSRIASEANREFIFSSSFTGDKQVLASR EKSGDEDLQAPRGQQPLSSLNNTEVPST
Sbjct: 421 GVQTSRIASEANREFIFSSSFTGDKQVLASRIEKSGDEDLQAPRGQQPLSSLNNTEVPST 480
Query: 481 SNALAQSTPDKTGVFEENIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNAS 540
SNALA+STPDKTGVFEENIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNAS
Sbjct: 481 SNALARSTPDKTGVFEENIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNAS 540
Query: 541 ANEVVNHEPVQSSEICNPRSHSSQSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEILL 600
ANEVVNHEPVQ+SEICNP+SHSSQSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEILL
Sbjct: 541 ANEVVNHEPVQNSEICNPKSHSSQSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEILL 600
Query: 601 EDEGDKSFDGDDDLFTHLAAENPIQVASFDMSSQKLSFDGTTDSGWEEAVEGKTYSPKNV 660
EDEGDKSFDGDDDLFTHLAAENPIQVASFD SSQKLSFDGTTDSGWEEAVEGKTYSPKNV
Sbjct: 601 EDEGDKSFDGDDDLFTHLAAENPIQVASFDKSSQKLSFDGTTDSGWEEAVEGKTYSPKNV 660
Query: 661 EVDDHPFVEGRVSDESEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRS 720
EVDDHPFVEGRVSDESEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRS
Sbjct: 661 EVDDHPFVEGRVSDESEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRS 720
Query: 721 LKDVGDRKPGSVLSEHQKPESAGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGR 780
LKDVGDRKPGSVLSEHQKPESAGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGR
Sbjct: 721 LKDVGDRKPGSVLSEHQKPESAGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGR 780
Query: 781 KETTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDVSHKDSNENDDTLKPLSRDASGA 840
KETTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDVSHKDSNENDDTLKPLSRDASGA
Sbjct: 781 KETTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDVSHKDSNENDDTLKPLSRDASGA 840
Query: 841 VLVGDRINNKLTEPPCHMVGMEDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEENDA 900
VLVGDRINNKLTEPPCHMVGMEDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEENDA
Sbjct: 841 VLVGDRINNKLTEPPCHMVGMEDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEENDA 900
Query: 901 QKPAVEVISNAEIEFTEDELTNRISILEQERLNLGDEQKRLERNAESVSSEMFAECQELL 960
QKPAVEVISNAEIEFTEDELTNRI ILEQERLNLGDEQKRLERNAESV SEMFAECQELL
Sbjct: 901 QKPAVEVISNAEIEFTEDELTNRIXILEQERLNLGDEQKRLERNAESVXSEMFAECQELL 960
Query: 961 QMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVILFGARSVYKNIFDDRKYVETYF 1020
QMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDV LFGARSVYKNIFDDRKYVETYF
Sbjct: 961 QMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYF 1020
Query: 1021 MKDVENELGLNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIES 1080
MKDVENELGLNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIES
Sbjct: 1021 MKDVENELGLNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIES 1080
Query: 1081 PDPSILGTLSAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSF 1140
PDPSILGTLSAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSF
Sbjct: 1081 PDPSILGTLSAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSF 1140
Query: 1141 MKKHRNVSKNWHIPSEFPSETVISAYTCPQVDKSAESFSWGKPDHFVLRRLCWEKFGWDN 1200
MKKHRNVSKNWHIPSEFPSE VISAYTCPQVDKSAESFSWGKPD FVLRRLCWEKFGWDN
Sbjct: 1141 MKKHRNVSKNWHIPSEFPSEXVISAYTCPQVDKSAESFSWGKPDXFVLRRLCWEKFGWDN 1200
Query: 1201 SKADELLLPVLKEYSKHETQLRLEAFYTFNERFAKIRSKRIKKAVRGITGSKSAVLMDDA 1260
SKADELLLPVLKEYSKHETQLRLE FYTF+ERFAKIRSKRIKKAVRGITGSKSAVLMDDA
Sbjct: 1201 SKADELLLPVLKEYSKHETQLRLETFYTFDERFAKIRSKRIKKAVRGITGSKSAVLMDDA 1260
Query: 1261 VRAVSANKQRELSVEPQEKSEKCSSEIQGSCSNVDDVEKRLGKPSRKRQLHGEQSQPAKG 1320
VRAVSANKQRELSVEPQEKSEKCSSEIQGSCSNVDDVEKRLGKPSRKRQLHGEQSQPAKG
Sbjct: 1261 VRAVSANKQRELSVEPQEKSEKCSSEIQGSCSNVDDVEKRLGKPSRKRQLHGEQSQPAKG 1320
Query: 1321 QKLTMKEKGNRNRNEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDESE 1380
QKLTMKEKGNRNRNEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDE+E
Sbjct: 1321 QKLTMKEKGNRNRNEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDENE 1380
Query: 1381 FDDQKCDFVNLEEPQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSNDEAKEHDVIHDQS 1440
FDDQKCDFVNLEEPQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSNDEAKEHDVIHDQS
Sbjct: 1381 FDDQKCDFVNLEEPQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSNDEAKEHDVIHDQS 1440
Query: 1441 EKTERDLGTPKRPPQEDYFETGGGFCPVEDEMSQDIDPSLEANNSEDYLRMGGGFCLDDD 1500
EKTERDLGTPKRPPQEDYFETGGGFCPVEDEMSQDIDPSLEANNSEDYLRMGGGFCLDDD
Sbjct: 1441 EKTERDLGTPKRPPQEDYFETGGGFCPVEDEMSQDIDPSLEANNSEDYLRMGGGFCLDDD 1500
Query: 1501 NECIDPDAYPGRATVSEDLQDRSEHDPDQSTFHPEKCTSPVQNKEGTDARVDSLLDTGNP 1560
NECIDPDAYPGRATVSEDLQDRSEHDPDQSTFHPEKCTS VQNKEGTDARVDSLLDTGNP
Sbjct: 1501 NECIDPDAYPGRATVSEDLQDRSEHDPDQSTFHPEKCTSSVQNKEGTDARVDSLLDTGNP 1560
BLAST of MS008891 vs. NCBI nr
Match:
XP_022150080.1 (DNA repair protein UVH3 isoform X2 [Momordica charantia])
HSP 1 Score: 2921.3 bits (7572), Expect = 0.0e+00
Identity = 1545/1600 (96.56%), Postives = 1550/1600 (96.88%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM 60
MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM
Sbjct: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM 60
Query: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR
Sbjct: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
Query: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG 180
ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG
Sbjct: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG 180
Query: 181 RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESILK---- 240
RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESIL
Sbjct: 181 RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESILPLMNE 240
Query: 241 ------------------QKYKNDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLDEM 300
QKYKNDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLDEM
Sbjct: 241 VDPDVFSTLPSSIQYELLQKYKNDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLDEM 300
Query: 301 LAASIAAEEAGSLNENASVSAAANLDEDTDDEDEEMILPEMDGVVDPSVLAALPPSVQLD 360
LAASIAAEEAGSLNENASVSAAANLDEDTDDEDEEMILPEMDGVVDPSVLAALPPSVQLD
Sbjct: 301 LAASIAAEEAGSLNENASVSAAANLDEDTDDEDEEMILPEMDGVVDPSVLAALPPSVQLD 360
Query: 361 LLVQ------------------DPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGV 420
LLVQ DPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGV
Sbjct: 361 LLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGV 420
Query: 421 QTSRIASEANREFIFSSSFTGDKQVLASRTEKSGDEDLQAPRGQQPLSSLNNTEVPSTSN 480
QTSRIASEANREFIFSSSFTGDKQVLASR EKSGDEDLQAPRGQQPLSSLNNTEVPSTSN
Sbjct: 421 QTSRIASEANREFIFSSSFTGDKQVLASRIEKSGDEDLQAPRGQQPLSSLNNTEVPSTSN 480
Query: 481 ALAQSTPDKTGVFEENIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNASAN 540
ALA+STPDKTGVFEENIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNASAN
Sbjct: 481 ALARSTPDKTGVFEENIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNASAN 540
Query: 541 EVVNHEPVQSSEICNPRSHSSQSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEILLED 600
EVVNHEPVQ+SEICNP+SHSSQSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEILLED
Sbjct: 541 EVVNHEPVQNSEICNPKSHSSQSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEILLED 600
Query: 601 EGDKSFDGDDDLFTHLAAENPIQVASFDMSSQKLSFDGTTDSGWEEAVEGKTYSPKNVEV 660
EGDKSFDGDDDLFTHLAAENPIQVASFD SSQKLSFDGTTDSGWEEAVEGKTYSPKNVEV
Sbjct: 601 EGDKSFDGDDDLFTHLAAENPIQVASFDKSSQKLSFDGTTDSGWEEAVEGKTYSPKNVEV 660
Query: 661 DDHPFVEGRVSDESEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRSLK 720
DDHPFVEGRVSDESEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRSLK
Sbjct: 661 DDHPFVEGRVSDESEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRSLK 720
Query: 721 DVGDRKPGSVLSEHQKPESAGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGRKE 780
DVGDRKPGSVLSEHQKPESAGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGRKE
Sbjct: 721 DVGDRKPGSVLSEHQKPESAGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGRKE 780
Query: 781 TTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDVSHKDSNENDDTLKPLSRDASGAVL 840
TTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDVSHKDSNENDDTLKPLSRDASGAVL
Sbjct: 781 TTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDVSHKDSNENDDTLKPLSRDASGAVL 840
Query: 841 VGDRINNKLTEPPCHMVGMEDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEENDAQK 900
VGDRINNKLTEPPCHMVGMEDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEENDAQK
Sbjct: 841 VGDRINNKLTEPPCHMVGMEDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEENDAQK 900
Query: 901 PAVEVISNAEIEFTEDELTNRISILEQERLNLGDEQKRLERNAESVSSEMFAECQELLQM 960
PAVEVISNAEIEFTEDELTNRI ILEQERLNLGDEQKRLERNAESV SEMFAECQELLQM
Sbjct: 901 PAVEVISNAEIEFTEDELTNRIXILEQERLNLGDEQKRLERNAESVXSEMFAECQELLQM 960
Query: 961 FGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVILFGARSVYKNIFDDRKYVETYFMK 1020
FGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDV LFGARSVYKNIFDDRKYVETYFMK
Sbjct: 961 FGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMK 1020
Query: 1021 DVENELGLNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIESPD 1080
DVENELGLNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIESPD
Sbjct: 1021 DVENELGLNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIESPD 1080
Query: 1081 PSILGTLSAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSFMK 1140
PSILGTLSAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSFMK
Sbjct: 1081 PSILGTLSAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSFMK 1140
Query: 1141 KHRNVSKNWHIPSEFPSETVISAYTCPQVDKSAESFSWGKPDHFVLRRLCWEKFGWDNSK 1200
KHRNVSKNWHIPSEFPSE VISAYTCPQVDKSAESFSWGKPD FVLRRLCWEKFGWDNSK
Sbjct: 1141 KHRNVSKNWHIPSEFPSEXVISAYTCPQVDKSAESFSWGKPDXFVLRRLCWEKFGWDNSK 1200
Query: 1201 ADELLLPVLKEYSKHETQLRLEAFYTFNERFAKIRSKRIKKAVRGITGSKSAVLMDDAVR 1260
ADELLLPVLKEYSKHETQLRLE FYTF+ERFAKIRSKRIKKAVRGITGSKSAVLMDDAVR
Sbjct: 1201 ADELLLPVLKEYSKHETQLRLETFYTFDERFAKIRSKRIKKAVRGITGSKSAVLMDDAVR 1260
Query: 1261 AVSANKQRELSVEPQEKSEKCSSEIQGSCSNVDDVEKRLGKPSRKRQLHGEQSQPAKGQK 1320
AVSANKQRELSVEPQEKSEKCSSEIQGSCSNVDDVEKRLGKPSRKRQLHGEQSQPAKGQK
Sbjct: 1261 AVSANKQRELSVEPQEKSEKCSSEIQGSCSNVDDVEKRLGKPSRKRQLHGEQSQPAKGQK 1320
Query: 1321 LTMKEKGNRNRNEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDESEFD 1380
LTMKEKGNRNRNEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDE+EFD
Sbjct: 1321 LTMKEKGNRNRNEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDENEFD 1380
Query: 1381 DQKCDFVNLEEPQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSNDEAKEHDVIHDQSEK 1440
DQKCDFVNLEEPQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSNDEAKEHDVIHDQSEK
Sbjct: 1381 DQKCDFVNLEEPQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSNDEAKEHDVIHDQSEK 1440
Query: 1441 TERDLGTPKRPPQEDYFETGGGFCPVEDEMSQDIDPSLEANNSEDYLRMGGGFCLDDDNE 1500
TERDLGTPKRPPQEDYFETGGGFCPVEDEMSQDIDPSLEANNSEDYLRMGGGFCLDDDNE
Sbjct: 1441 TERDLGTPKRPPQEDYFETGGGFCPVEDEMSQDIDPSLEANNSEDYLRMGGGFCLDDDNE 1500
Query: 1501 CIDPDAYPGRATVSEDLQDRSEHDPDQSTFHPEKCTSPVQNKEGTDARVDSLLDTGNPNR 1560
CIDPDAYPGRATVSEDLQDRSEHDPDQSTFHPEKCTS VQNKEGTDARVDSLLDTGNPNR
Sbjct: 1501 CIDPDAYPGRATVSEDLQDRSEHDPDQSTFHPEKCTSSVQNKEGTDARVDSLLDTGNPNR 1560
BLAST of MS008891 vs. NCBI nr
Match:
XP_022150081.1 (DNA repair protein UVH3 isoform X3 [Momordica charantia])
HSP 1 Score: 2862.0 bits (7418), Expect = 0.0e+00
Identity = 1521/1602 (94.94%), Postives = 1526/1602 (95.26%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM 60
MGVQGLWELLAPVGRRVSVETLAGKKLAI DASIWM
Sbjct: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAI-------------------------DASIWM 60
Query: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR
Sbjct: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
Query: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG 180
ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG
Sbjct: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG 180
Query: 181 RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESIL----- 240
RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESIL
Sbjct: 181 RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESILPLMNE 240
Query: 241 -------------------KQKYKNDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLD 300
KQKYKNDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLD
Sbjct: 241 VDPDVFSTLPSSIQYELLQKQKYKNDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLD 300
Query: 301 EMLAASIAAEEAGSLNENASVSAAANLDEDTDDEDEEMILPEMDGVVDPSVLAALPPSVQ 360
EMLAASIAAEEAGSLNENASVSAAANLDEDTDDEDEEMILPEMDGVVDPSVLAALPPSVQ
Sbjct: 301 EMLAASIAAEEAGSLNENASVSAAANLDEDTDDEDEEMILPEMDGVVDPSVLAALPPSVQ 360
Query: 361 LDLLVQ------------------DPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVG 420
LDLLVQ DPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVG
Sbjct: 361 LDLLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVG 420
Query: 421 GVQTSRIASEANREFIFSSSFTGDKQVLASRTEKSGDEDLQAPRGQQPLSSLNNTEVPST 480
GVQTSRIASEANREFIFSSSFTGDKQVLASR EKSGDEDLQAPRGQQPLSSLNNTEVPST
Sbjct: 421 GVQTSRIASEANREFIFSSSFTGDKQVLASRIEKSGDEDLQAPRGQQPLSSLNNTEVPST 480
Query: 481 SNALAQSTPDKTGVFEENIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNAS 540
SNALA+STPDKTGVFEENIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNAS
Sbjct: 481 SNALARSTPDKTGVFEENIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNAS 540
Query: 541 ANEVVNHEPVQSSEICNPRSHSSQSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEILL 600
ANEVVNHEPVQ+SEICNP+SHSSQSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEILL
Sbjct: 541 ANEVVNHEPVQNSEICNPKSHSSQSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEILL 600
Query: 601 EDEGDKSFDGDDDLFTHLAAENPIQVASFDMSSQKLSFDGTTDSGWEEAVEGKTYSPKNV 660
EDEGDKSFDGDDDLFTHLAAENPIQVASFD SSQKLSFDGTTDSGWEEAVEGKTYSPKNV
Sbjct: 601 EDEGDKSFDGDDDLFTHLAAENPIQVASFDKSSQKLSFDGTTDSGWEEAVEGKTYSPKNV 660
Query: 661 EVDDHPFVEGRVSDESEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRS 720
EVDDHPFVEGRVSDESEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRS
Sbjct: 661 EVDDHPFVEGRVSDESEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRS 720
Query: 721 LKDVGDRKPGSVLSEHQKPESAGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGR 780
LKDVGDRKPGSVLSEHQKPESAGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGR
Sbjct: 721 LKDVGDRKPGSVLSEHQKPESAGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGR 780
Query: 781 KETTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDVSHKDSNENDDTLKPLSRDASGA 840
KETTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDVSHKDSNENDDTLKPLSRDASGA
Sbjct: 781 KETTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDVSHKDSNENDDTLKPLSRDASGA 840
Query: 841 VLVGDRINNKLTEPPCHMVGMEDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEENDA 900
VLVGDRINNKLTEPPCHMVGMEDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEENDA
Sbjct: 841 VLVGDRINNKLTEPPCHMVGMEDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEENDA 900
Query: 901 QKPAVEVISNAEIEFTEDELTNRISILEQERLNLGDEQKRLERNAESVSSEMFAECQELL 960
QKPAVEVISNAEIEFTEDELTNRI ILEQERLNLGDEQKRLERNAESV SEMFAECQELL
Sbjct: 901 QKPAVEVISNAEIEFTEDELTNRIXILEQERLNLGDEQKRLERNAESVXSEMFAECQELL 960
Query: 961 QMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVILFGARSVYKNIFDDRKYVETYF 1020
QMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDV LFGARSVYKNIFDDRKYVETYF
Sbjct: 961 QMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYF 1020
Query: 1021 MKDVENELGLNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIES 1080
MKDVENELGLNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIES
Sbjct: 1021 MKDVENELGLNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIES 1080
Query: 1081 PDPSILGTLSAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSF 1140
PDPSILGTLSAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSF
Sbjct: 1081 PDPSILGTLSAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSF 1140
Query: 1141 MKKHRNVSKNWHIPSEFPSETVISAYTCPQVDKSAESFSWGKPDHFVLRRLCWEKFGWDN 1200
MKKHRNVSKNWHIPSEFPSE VISAYTCPQVDKSAESFSWGKPD FVLRRLCWEKFGWDN
Sbjct: 1141 MKKHRNVSKNWHIPSEFPSEXVISAYTCPQVDKSAESFSWGKPDXFVLRRLCWEKFGWDN 1200
Query: 1201 SKADELLLPVLKEYSKHETQLRLEAFYTFNERFAKIRSKRIKKAVRGITGSKSAVLMDDA 1260
SKADELLLPVLKEYSKHETQLRLE FYTF+ERFAKIRSKRIKKAVRGITGSKSAVLMDDA
Sbjct: 1201 SKADELLLPVLKEYSKHETQLRLETFYTFDERFAKIRSKRIKKAVRGITGSKSAVLMDDA 1260
Query: 1261 VRAVSANKQRELSVEPQEKSEKCSSEIQGSCSNVDDVEKRLGKPSRKRQLHGEQSQPAKG 1320
VRAVSANKQRELSVEPQEKSEKCSSEIQGSCSNVDDVEKRLGKPSRKRQLHGEQSQPAKG
Sbjct: 1261 VRAVSANKQRELSVEPQEKSEKCSSEIQGSCSNVDDVEKRLGKPSRKRQLHGEQSQPAKG 1320
Query: 1321 QKLTMKEKGNRNRNEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDESE 1380
QKLTMKEKGNRNRNEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDE+E
Sbjct: 1321 QKLTMKEKGNRNRNEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDENE 1380
Query: 1381 FDDQKCDFVNLEEPQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSNDEAKEHDVIHDQS 1440
FDDQKCDFVNLEEPQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSNDEAKEHDVIHDQS
Sbjct: 1381 FDDQKCDFVNLEEPQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSNDEAKEHDVIHDQS 1440
Query: 1441 EKTERDLGTPKRPPQEDYFETGGGFCPVEDEMSQDIDPSLEANNSEDYLRMGGGFCLDDD 1500
EKTERDLGTPKRPPQEDYFETGGGFCPVEDEMSQDIDPSLEANNSEDYLRMGGGFCLDDD
Sbjct: 1441 EKTERDLGTPKRPPQEDYFETGGGFCPVEDEMSQDIDPSLEANNSEDYLRMGGGFCLDDD 1500
Query: 1501 NECIDPDAYPGRATVSEDLQDRSEHDPDQSTFHPEKCTSPVQNKEGTDARVDSLLDTGNP 1560
NECIDPDAYPGRATVSEDLQDRSEHDPDQSTFHPEKCTS VQNKEGTDARVDSLLDTGNP
Sbjct: 1501 NECIDPDAYPGRATVSEDLQDRSEHDPDQSTFHPEKCTSSVQNKEGTDARVDSLLDTGNP 1560
BLAST of MS008891 vs. NCBI nr
Match:
XP_038903931.1 (DNA repair protein UVH3 isoform X1 [Benincasa hispida])
HSP 1 Score: 2123.2 bits (5500), Expect = 0.0e+00
Identity = 1204/1611 (74.74%), Postives = 1318/1611 (81.81%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSL------LSNHTSAIFKMFCLFLSFE 60
MGV GLWELLAPVGRRVSVETLAGKKLAIG SL L N S F ++ + LS E
Sbjct: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIGTNLSLRTSLNYLKNFMSMRFSLW-VCLSIE 60
Query: 61 DASIWMVQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLI 120
DASIWMVQFIKAMRD+RGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLI
Sbjct: 61 DASIWMVQFIKAMRDDRGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLI 120
Query: 121 ARRRQRENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRT 180
ARRRQRENAQAKVRKTAEKLLLNH+K MRLKELAED+QNQKQQR+Q + KK LP+ +
Sbjct: 121 ARRRQRENAQAKVRKTAEKLLLNHIKVMRLKELAEDIQNQKQQRKQKLSKKSTLPSRDKN 180
Query: 181 ADGTSGRNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESI 240
+GTS + G E LD MLAASIMAEENG F SS+SSFSGA LAK+ GE SI
Sbjct: 181 FNGTSTSESCEGIPNRGSLENLDEMLAASIMAEENGLFLSSASSFSGATLAKEGGGEGSI 240
Query: 241 LKQKYKNDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLDEMLAASIAAEEAGSLNEN 300
L QKY N+SKGK+ILSDE + VGSD+ERMEVASR HQQNLDEMLAASIAAEEA SLNEN
Sbjct: 241 LNQKYNNESKGKEILSDETYIVGSDSERMEVASRSVHQQNLDEMLAASIAAEEARSLNEN 300
Query: 301 ASVSAAANLD-EDTDDEDEEMILPEMDGVVDPSVLAALPPSVQLDLLVQ----------- 360
ASVSA NLD EDTDDEDEEMILPEM GVVDPSVLAALPPSVQLDLLVQ
Sbjct: 301 ASVSAVTNLDGEDTDDEDEEMILPEMHGVVDPSVLAALPPSVQLDLLVQMRERLMAENRQ 360
Query: 361 -------DPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGVQTSRIASEANREFIF 420
DPAKFSELQIQAYLKTVAFRRDIDQVQKAA+GRGVGGVQTSRIASEANREFIF
Sbjct: 361 KYQRVKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAAAGRGVGGVQTSRIASEANREFIF 420
Query: 421 SSSFTGDKQVLASR-TEKSGDEDLQAPRGQQPLSSLNNTEVPSTSNALAQSTPDKTGVFE 480
SSSFTGDKQVLAS EK+GD+DLQAP QQPLSSL NTE+PSTSN LAQSTPDK+G FE
Sbjct: 421 SSSFTGDKQVLASTIVEKNGDKDLQAPTVQQPLSSLKNTEIPSTSNPLAQSTPDKSGGFE 480
Query: 481 ENIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNASANEVVNHEPVQSSEIC 540
+NIETFLDERGRVRVSRV+AMGM MTRDLERNLDLMKEIEKN SAN+ N EP+Q+ EIC
Sbjct: 481 DNIETFLDERGRVRVSRVKAMGMHMTRDLERNLDLMKEIEKNTSANKAANPEPIQNIEIC 540
Query: 541 NPRSHSSQSQDLDTPYEGVSESV-QLSLRSRGSMLDEDTAIEILLEDEGDKSFDGDDDLF 600
NP + S QSQ LDT EGV S+ +L+ R ML+E+TAIEILLEDEG KSFDGDDDLF
Sbjct: 541 NPENFSFQSQVLDTSDEGVGGSINKLNERGGEPMLNEETAIEILLEDEGGKSFDGDDDLF 600
Query: 601 THLAAENPIQVASFDMSSQKLSFDGTTDSGWEEAVEGKTYSPKNVEVDDHPFVEGRVSDE 660
T+LAAENPI +ASFD+S+QKLS DGTTDSGWE+AVEGKTYSPKNVEVDDH F EG VSDE
Sbjct: 601 TNLAAENPIGMASFDISTQKLSLDGTTDSGWEDAVEGKTYSPKNVEVDDHSFKEGTVSDE 660
Query: 661 SEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRSLKDVGDRKPGSVLSE 720
S+V+WE+G CDHVNPVPF A+ +SVSKG LEEEADLQEAIRRSL+D G K G+V SE
Sbjct: 661 SDVDWEDGACDHVNPVPF-EADLAQSVSKGFLEEEADLQEAIRRSLEDRGYTKSGTVSSE 720
Query: 721 HQKPES--AGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGRKETTESSSQEKQC 780
Q+P+ GK EQCTSVQNE++IGL +D ADGM+C NDST + TESSSQEKQC
Sbjct: 721 LQQPQPVIVGKRAEQCTSVQNESMIGLDKLDSADGMNCLNFNDSTRTEGMTESSSQEKQC 780
Query: 781 SECIVLLDTTTHTVTEKLDASYK--DVSHKDSNENDDTLKPLSRDASGAVLVGDRINNKL 840
SE ++ LDT THT+ E+LDASY S K+SNEN+DTL+PLSRD GAV VGDRINN +
Sbjct: 781 SEPVMSLDTKTHTIAEQLDASYNVAKFSPKESNENNDTLEPLSRDTFGAVQVGDRINNTV 840
Query: 841 TEPPCHMVGMEDSYTPEVDSSPKVVASENH--QNFPVDELSSDILLEENDAQKPAVEVIS 900
+PPC MV ME Y P SS K A EN+ QN PVDE S+D+ L+ DA+ +VE S
Sbjct: 841 IDPPCRMVEMEGIYPPGNGSSRKPFACENNFKQNLPVDEHSNDLSLDIKDAKILSVEETS 900
Query: 901 NAEIEFTEDELTNRISILEQERLNLGDEQKRLERNAESVSSEMFAECQELLQMFGLPYII 960
NAEIE T+DEL NR S+LEQERLNLGDEQKRLERNAESV+SEMFAECQELLQMFGLPYII
Sbjct: 901 NAEIEVTDDELKNRFSVLEQERLNLGDEQKRLERNAESVNSEMFAECQELLQMFGLPYII 960
Query: 961 APMEAEAQCAYMELANLVDGVVTDDSDVILFGARSVYKNIFDDRKYVETYFMKDVENELG 1020
APMEAEAQCAYMELANLVDGVVTDDSDV LFGARSVYKNIFDDRKYVETYFMKDVENELG
Sbjct: 961 APMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELG 1020
Query: 1021 LNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIESPDPSILGTL 1080
LN+DK+IRMALLLGSDYTEGISGIGIVNA+EVMNAFPEEDGL KFKEWIESPDPSILGTL
Sbjct: 1021 LNQDKLIRMALLLGSDYTEGISGIGIVNAVEVMNAFPEEDGLHKFKEWIESPDPSILGTL 1080
Query: 1081 SAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSFMKKHRNVSK 1140
A+TGL++R+RGSKASE D TCSN+ GSAS E+IS+DL+ENI VKQSFM KHRNVSK
Sbjct: 1081 GAKTGLTARRRGSKASENDMTCSNTG---GSASEENISKDLEENIAVKQSFMDKHRNVSK 1140
Query: 1141 NWHIPSEFPSETVISAYTCPQVDKSAESFSWGKPDHFVLRRLCWEKFGWDNSKADELLLP 1200
NWHIPSEFPSE VISAY CPQVDKSAE FSWGKPDHFVLRRLCWEKFGW+NSKADELLLP
Sbjct: 1141 NWHIPSEFPSEAVISAYICPQVDKSAEPFSWGKPDHFVLRRLCWEKFGWENSKADELLLP 1200
Query: 1201 VLKEYSKHETQLRLEAFYTFNERFAKIRSKRIKKAVRGITGSKSAVLMDDAVRAVSANKQ 1260
VLKEYSKHETQLRLEAFYTFNERFAKIRSKRIKKAV+ ITGS+SAVLMDDAVR VS N Q
Sbjct: 1201 VLKEYSKHETQLRLEAFYTFNERFAKIRSKRIKKAVKSITGSRSAVLMDDAVRDVSVNNQ 1260
Query: 1261 RELSVEPQEK-SEKCSSEIQGSCSNVDDVEKRLGKPSRKRQLHGEQSQPAKGQKLTMKEK 1320
RELSVEP+E SEKCSSE Q +CSN DD R KPSRKRQL GEQ+QP K +KLT KEK
Sbjct: 1261 RELSVEPKENMSEKCSSERQDACSNEDD---RHRKPSRKRQLDGEQAQPGKDRKLTKKEK 1320
Query: 1321 GNRNRNEGSH-KNGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDESEFDDQKCD 1380
G +RNEGSH + GRGRG KGRGRGRL KGK +P TEL+ETSSSDDESEFD+QK D
Sbjct: 1321 GKPSRNEGSHSERGRGRGRGKGRGRGRLVSKGK---APITELIETSSSDDESEFDNQKFD 1380
Query: 1381 FVNLEEPQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSNDEAKEHDVIH---------- 1440
N +EPQE+RRSSRIRKS SYT+ +ADQ SD+ GD FSND+A+E VI
Sbjct: 1381 LENFQEPQEKRRSSRIRKSASYTIDNADQQSDHTGDEFSNDKAEEDRVIQGQYAHPETVM 1440
Query: 1441 DQSEKTERDLGTPKRPPQEDYFETGGGFCPVEDEMS-----QDIDPSLEANNSEDYLRMG 1500
QSE E G+PKR PQ DY +TGGGFC VEDEMS Q+ DP+LEANNSEDYL MG
Sbjct: 1441 SQSENMESGSGSPKRSPQNDYLKTGGGFCLVEDEMSRQEMCQNKDPALEANNSEDYLTMG 1500
Query: 1501 GGFCLDDDNECIDPDAYPGRATVSEDLQDRSEHDPDQSTFHPEKCTSPVQNKEGTDARVD 1560
GGFCLDDD+E IDP A+P +ATV E +D E+DP QST PEK E TDARV+
Sbjct: 1501 GGFCLDDDDERIDPVAHPNQATVLEVPKDGFENDPGQSTVSPEKHV----GVEDTDARVE 1560
BLAST of MS008891 vs. NCBI nr
Match:
XP_038903932.1 (DNA repair protein UVH3 isoform X2 [Benincasa hispida])
HSP 1 Score: 2108.2 bits (5461), Expect = 0.0e+00
Identity = 1194/1605 (74.39%), Postives = 1305/1605 (81.31%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM 60
MGV GLWELLAPVGRRVSVETLAGKKLAI DASIWM
Sbjct: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAI-------------------------DASIWM 60
Query: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
VQFIKAMRD+RGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR
Sbjct: 61 VQFIKAMRDDRGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
Query: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG 180
ENAQAKVRKTAEKLLLNH+K MRLKELAED+QNQKQQR+Q + KK LP+ + +GTS
Sbjct: 121 ENAQAKVRKTAEKLLLNHIKVMRLKELAEDIQNQKQQRKQKLSKKSTLPSRDKNFNGTST 180
Query: 181 RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESILKQKYK 240
+ G E LD MLAASIMAEENG F SS+SSFSGA LAK+ GE SIL QKY
Sbjct: 181 SESCEGIPNRGSLENLDEMLAASIMAEENGLFLSSASSFSGATLAKEGGGEGSILNQKYN 240
Query: 241 NDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLDEMLAASIAAEEAGSLNENASVSAA 300
N+SKGK+ILSDE + VGSD+ERMEVASR HQQNLDEMLAASIAAEEA SLNENASVSA
Sbjct: 241 NESKGKEILSDETYIVGSDSERMEVASRSVHQQNLDEMLAASIAAEEARSLNENASVSAV 300
Query: 301 ANLD-EDTDDEDEEMILPEMDGVVDPSVLAALPPSVQLDLLVQ----------------- 360
NLD EDTDDEDEEMILPEM GVVDPSVLAALPPSVQLDLLVQ
Sbjct: 301 TNLDGEDTDDEDEEMILPEMHGVVDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVK 360
Query: 361 -DPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGVQTSRIASEANREFIFSSSFTG 420
DPAKFSELQIQAYLKTVAFRRDIDQVQKAA+GRGVGGVQTSRIASEANREFIFSSSFTG
Sbjct: 361 KDPAKFSELQIQAYLKTVAFRRDIDQVQKAAAGRGVGGVQTSRIASEANREFIFSSSFTG 420
Query: 421 DKQVLASR-TEKSGDEDLQAPRGQQPLSSLNNTEVPSTSNALAQSTPDKTGVFEENIETF 480
DKQVLAS EK+GD+DLQAP QQPLSSL NTE+PSTSN LAQSTPDK+G FE+NIETF
Sbjct: 421 DKQVLASTIVEKNGDKDLQAPTVQQPLSSLKNTEIPSTSNPLAQSTPDKSGGFEDNIETF 480
Query: 481 LDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNASANEVVNHEPVQSSEICNPRSHS 540
LDERGRVRVSRV+AMGM MTRDLERNLDLMKEIEKN SAN+ N EP+Q+ EICNP + S
Sbjct: 481 LDERGRVRVSRVKAMGMHMTRDLERNLDLMKEIEKNTSANKAANPEPIQNIEICNPENFS 540
Query: 541 SQSQDLDTPYEGVSESV-QLSLRSRGSMLDEDTAIEILLEDEGDKSFDGDDDLFTHLAAE 600
QSQ LDT EGV S+ +L+ R ML+E+TAIEILLEDEG KSFDGDDDLFT+LAAE
Sbjct: 541 FQSQVLDTSDEGVGGSINKLNERGGEPMLNEETAIEILLEDEGGKSFDGDDDLFTNLAAE 600
Query: 601 NPIQVASFDMSSQKLSFDGTTDSGWEEAVEGKTYSPKNVEVDDHPFVEGRVSDESEVEWE 660
NPI +ASFD+S+QKLS DGTTDSGWE+AVEGKTYSPKNVEVDDH F EG VSDES+V+WE
Sbjct: 601 NPIGMASFDISTQKLSLDGTTDSGWEDAVEGKTYSPKNVEVDDHSFKEGTVSDESDVDWE 660
Query: 661 EGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRSLKDVGDRKPGSVLSEHQKPES 720
+G CDHVNPVPF A+ +SVSKG LEEEADLQEAIRRSL+D G K G+V SE Q+P+
Sbjct: 661 DGACDHVNPVPF-EADLAQSVSKGFLEEEADLQEAIRRSLEDRGYTKSGTVSSELQQPQP 720
Query: 721 --AGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGRKETTESSSQEKQCSECIVL 780
GK EQCTSVQNE++IGL +D ADGM+C NDST + TESSSQEKQCSE ++
Sbjct: 721 VIVGKRAEQCTSVQNESMIGLDKLDSADGMNCLNFNDSTRTEGMTESSSQEKQCSEPVMS 780
Query: 781 LDTTTHTVTEKLDASYK--DVSHKDSNENDDTLKPLSRDASGAVLVGDRINNKLTEPPCH 840
LDT THT+ E+LDASY S K+SNEN+DTL+PLSRD GAV VGDRINN + +PPC
Sbjct: 781 LDTKTHTIAEQLDASYNVAKFSPKESNENNDTLEPLSRDTFGAVQVGDRINNTVIDPPCR 840
Query: 841 MVGMEDSYTPEVDSSPKVVASENH--QNFPVDELSSDILLEENDAQKPAVEVISNAEIEF 900
MV ME Y P SS K A EN+ QN PVDE S+D+ L+ DA+ +VE SNAEIE
Sbjct: 841 MVEMEGIYPPGNGSSRKPFACENNFKQNLPVDEHSNDLSLDIKDAKILSVEETSNAEIEV 900
Query: 901 TEDELTNRISILEQERLNLGDEQKRLERNAESVSSEMFAECQELLQMFGLPYIIAPMEAE 960
T+DEL NR S+LEQERLNLGDEQKRLERNAESV+SEMFAECQELLQMFGLPYIIAPMEAE
Sbjct: 901 TDDELKNRFSVLEQERLNLGDEQKRLERNAESVNSEMFAECQELLQMFGLPYIIAPMEAE 960
Query: 961 AQCAYMELANLVDGVVTDDSDVILFGARSVYKNIFDDRKYVETYFMKDVENELGLNRDKI 1020
AQCAYMELANLVDGVVTDDSDV LFGARSVYKNIFDDRKYVETYFMKDVENELGLN+DK+
Sbjct: 961 AQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELGLNQDKL 1020
Query: 1021 IRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIESPDPSILGTLSAQTGL 1080
IRMALLLGSDYTEGISGIGIVNA+EVMNAFPEEDGL KFKEWIESPDPSILGTL A+TGL
Sbjct: 1021 IRMALLLGSDYTEGISGIGIVNAVEVMNAFPEEDGLHKFKEWIESPDPSILGTLGAKTGL 1080
Query: 1081 SSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSFMKKHRNVSKNWHIPS 1140
++R+RGSKASE D TCSN+ GSAS E+IS+DL+ENI VKQSFM KHRNVSKNWHIPS
Sbjct: 1081 TARRRGSKASENDMTCSNTG---GSASEENISKDLEENIAVKQSFMDKHRNVSKNWHIPS 1140
Query: 1141 EFPSETVISAYTCPQVDKSAESFSWGKPDHFVLRRLCWEKFGWDNSKADELLLPVLKEYS 1200
EFPSE VISAY CPQVDKSAE FSWGKPDHFVLRRLCWEKFGW+NSKADELLLPVLKEYS
Sbjct: 1141 EFPSEAVISAYICPQVDKSAEPFSWGKPDHFVLRRLCWEKFGWENSKADELLLPVLKEYS 1200
Query: 1201 KHETQLRLEAFYTFNERFAKIRSKRIKKAVRGITGSKSAVLMDDAVRAVSANKQRELSVE 1260
KHETQLRLEAFYTFNERFAKIRSKRIKKAV+ ITGS+SAVLMDDAVR VS N QRELSVE
Sbjct: 1201 KHETQLRLEAFYTFNERFAKIRSKRIKKAVKSITGSRSAVLMDDAVRDVSVNNQRELSVE 1260
Query: 1261 PQEK-SEKCSSEIQGSCSNVDDVEKRLGKPSRKRQLHGEQSQPAKGQKLTMKEKGNRNRN 1320
P+E SEKCSSE Q +CSN DD R KPSRKRQL GEQ+QP K +KLT KEKG +RN
Sbjct: 1261 PKENMSEKCSSERQDACSNEDD---RHRKPSRKRQLDGEQAQPGKDRKLTKKEKGKPSRN 1320
Query: 1321 EGSH-KNGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDESEFDDQKCDFVNLEE 1380
EGSH + GRGRG KGRGRGRL KGK +P TEL+ETSSSDDESEFD+QK D N +E
Sbjct: 1321 EGSHSERGRGRGRGKGRGRGRLVSKGK---APITELIETSSSDDESEFDNQKFDLENFQE 1380
Query: 1381 PQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSNDEAKEHDVIH----------DQSEKT 1440
PQE+RRSSRIRKS SYT+ +ADQ SD+ GD FSND+A+E VI QSE
Sbjct: 1381 PQEKRRSSRIRKSASYTIDNADQQSDHTGDEFSNDKAEEDRVIQGQYAHPETVMSQSENM 1440
Query: 1441 ERDLGTPKRPPQEDYFETGGGFCPVEDEMS-----QDIDPSLEANNSEDYLRMGGGFCLD 1500
E G+PKR PQ DY +TGGGFC VEDEMS Q+ DP+LEANNSEDYL MGGGFCLD
Sbjct: 1441 ESGSGSPKRSPQNDYLKTGGGFCLVEDEMSRQEMCQNKDPALEANNSEDYLTMGGGFCLD 1500
Query: 1501 DDNECIDPDAYPGRATVSEDLQDRSEHDPDQSTFHPEKCTSPVQNKEGTDARVDSLLDTG 1560
DD+E IDP A+P +ATV E +D E+DP QST PEK E TDARV+S+LD G
Sbjct: 1501 DDDERIDPVAHPNQATVLEVPKDGFENDPGQSTVSPEKHV----GVEDTDARVESVLDVG 1560
BLAST of MS008891 vs. ExPASy Swiss-Prot
Match:
Q9ATY5 (DNA repair protein UVH3 OS=Arabidopsis thaliana OX=3702 GN=UVH3 PE=2 SV=1)
HSP 1 Score: 977.2 bits (2525), Expect = 2.1e-283
Identity = 700/1643 (42.60%), Postives = 930/1643 (56.60%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM 60
MGVQGLWELLAPVGRRVSVETLA K+LAI DASIWM
Sbjct: 1 MGVQGLWELLAPVGRRVSVETLANKRLAI-------------------------DASIWM 60
Query: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
VQFIKAMRDE+G+MV+NAHL+GFFRRICKLLFLRTKP+FVFDGATPALKRRT+IARRRQR
Sbjct: 61 VQFIKAMRDEKGDMVQNAHLIGFFRRICKLLFLRTKPIFVFDGATPALKRRTVIARRRQR 120
Query: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG 180
ENAQ K+RKTAEKLLLN LK +RLKE A+D++NQ+ ++ KK + + +
Sbjct: 121 ENAQTKIRKTAEKLLLNRLKDIRLKEQAKDIKNQRLKQDDSDRVKKRVSSDSVEDNLRVP 180
Query: 181 RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESILKQKYK 240
+ S EKLD + AS++ E +G + ++K+ K
Sbjct: 181 VEEDDVGASFFQEEKLDEVSQASLVGE---------------------TGVDDVVKESVK 240
Query: 241 NDSKGKKILSDEIHAVGSDTERM--EVASRGA-HQQNLDEMLAASIAAEEAGSLNENASV 300
+D KGK +L D G D + + + + +G +Q+ LDEMLAAS+AAEE + AS
Sbjct: 241 DDPKGKGVLLD-----GDDLDNLVQDSSVQGKDYQEKLDEMLAASLAAEEERNFTSKAST 300
Query: 301 SAAANLDEDTDDE----DEEMILPEMDGVVDPSVLAALPPSVQLDLLVQ----------- 360
SAAA E+ ++E DEE++LP MDG +DP+VLA+LPPS+QLDLL Q
Sbjct: 301 SAAAIPSEEDEEEDSDGDEEILLPVMDGNIDPAVLASLPPSMQLDLLAQMREKLMAENRQ 360
Query: 361 -------DPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGVQTSRIASEANREFIF 420
P KFSELQI+AYLKTVAFRR+I++VQ++A GR VGGVQTSRIASEANREFIF
Sbjct: 361 KYQKVKKAPEKFSELQIEAYLKTVAFRREINEVQRSAGGRAVGGVQTSRIASEANREFIF 420
Query: 421 SSSFTGDKQVLASRTEKSGDEDLQAPRGQQPLSSLNNTEVPSTSNALAQSTPDKTGVFEE 480
SSSF GDK+VLAS E DE+ + Q S+ N S+A + D+ +E
Sbjct: 421 SSSFAGDKEVLASAREGRNDENQKKTSQQSLPVSVKNASPLKKSDATIELDRDEPKNPDE 480
Query: 481 NIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNASANEVVNHEPVQSSEICN 540
NIE ++DERGR R+ R R MG++MTRD++RNL LMKE E+ AS + N E + E
Sbjct: 481 NIEVYIDERGRFRI-RNRHMGIQMTRDIQRNLHLMKEKERTASGSMAKNDETFSAWE--- 540
Query: 541 PRSHSSQSQDLD-TPYEGVSESVQLSLRSRGSMLDEDTAIEILLE-DEGDKSFDGDDDLF 600
+ ++ Q L+ +P E + V L +++ SML ++IEI + D G K + +DD+F
Sbjct: 541 --NFPTEDQFLEKSPVE--KDVVDLEIQNDDSMLHPPSSIEISFDHDGGGKDLNDEDDMF 600
Query: 601 THLAAENPIQVASFDMSSQKLSFDGTTDSGWEEAVEGKTYSPKNVEVD-DHPFVEGRVSD 660
LAA P+ ++S + ++ + +DS WEE + S +E + + + +S
Sbjct: 601 LQLAAGGPVTISSTENDPKEDTSPWASDSDWEEVPVEQNTSVSKLEANLSNQHIPKDISI 660
Query: 661 ESEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRSLKDVGDRKPGSVLS 720
V WEE C + N ++ ++KG LEEEADLQEAI++SL ++ D++ G VL
Sbjct: 661 AEGVAWEEYSCKNANNSV--ENDTVTKITKGYLEEEADLQEAIKKSLLELHDKESGDVLE 720
Query: 721 EHQ---------KP--------ESAG-----KMLEQCTSVQNENVIGLK-------NVDG 780
E+Q KP E+ G + L++ T ++ I + N DG
Sbjct: 721 ENQSVRVNLVVDKPSEDSLCSRETVGEAEEERFLDEITILKTSGAISEQSNTSVAGNADG 780
Query: 781 ADGMSCS-KANDSTGRKETTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDVSHKDSN 840
G++ + S+G + + S + + ++ + + ++ S H +
Sbjct: 781 QKGITKQFGTHPSSGSNNVSHAVSNKLSKVKSVISPEKALNVASQNRMLSTMAKQHNE-- 840
Query: 841 ENDDTLKPLSRDASGAVLVGDRINNKLTEPPC----HMVGMEDS-----------YTPEV 900
E ++ S S + + I L E + M+D T
Sbjct: 841 EGSESFGGESVKVSAMPIADEEITGFLDEKDNADGESSIMMDDKRDYSRRKIQSLVTESR 900
Query: 901 DSSPKVVASENHQNFPVDELSSDILLEENDAQKPAVEVISNAE-------IEFTEDELTN 960
D S VV S D S + EEN++ + + S+ + +EF+E +
Sbjct: 901 DPSRNVVRSRIGILHDTD--SQNERREENNSNEHTFNIDSSTDFEEKGVPVEFSEANIEE 960
Query: 961 RISILEQERLNLGDEQKRLERNAESVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYME 1020
I +L+QE ++LGDEQ++LERNAESVSSEMFAECQELLQ+FG+PYIIAPMEAEAQCA+ME
Sbjct: 961 EIRVLDQEFVSLGDEQRKLERNAESVSSEMFAECQELLQIFGIPYIIAPMEAEAQCAFME 1020
Query: 1021 LANLVDGVVTDDSDVILFGARSVYKNIFDDRKYVETYFMKDVENELGLNRDKIIRMALLL 1080
+NLVDG+VTDDSDV LFGARSVYKNIFDDRKYVETYFMKD+E ELGL+RDKIIRMA+LL
Sbjct: 1021 QSNLVDGIVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDIEKELGLSRDKIIRMAMLL 1080
Query: 1081 GSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIESPDPSILGTLSAQTGLSSRKRGS 1140
GSDYTEGISGIGIVNAIEV+ AFPEEDGLQKF+EW+ESPDP+ILG A+TG +KRGS
Sbjct: 1081 GSDYTEGISGIGIVNAIEVVTAFPEEDGLQKFREWVESPDPTILGKTDAKTGSKVKKRGS 1140
Query: 1141 KASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSFMKKHRNVSKNWHIPSEFPSETV 1200
+ + S +S D ++KQ FM +HR VSKNWHIP FPSE V
Sbjct: 1141 ASVDNKGIISGASTDDTE--------------EIKQIFMDQHRKVSKNWHIPLTFPSEAV 1200
Query: 1201 ISAYTCPQVDKSAESFSWGKPDHFVLRRLCWEKFGWDNSKADELLLPVLKEYSKHETQLR 1260
ISAY PQVD S E FSWGKPD VLR+LCWEKF W+ K DELLLPVLKEY K ETQLR
Sbjct: 1201 ISAYLNPQVDLSTEKFSWGKPDLSVLRKLCWEKFNWNGKKTDELLLPVLKEYEKRETQLR 1260
Query: 1261 LEAFYTFNERFAKIRSKRIKKAVRGITGSKSAVLMDDAVRAVSANKQRELSVEPQEKSEK 1320
+EAFY+FNERFAKIRSKRI KAV+GI G S+ + D ++ K+ + V P E +
Sbjct: 1261 IEAFYSFNERFAKIRSKRINKAVKGIGGGLSSDVADHTLQE-GPRKRNKKKVAPHETEDN 1320
Query: 1321 CSSEIQGSCSN--VDDVEKRLGKPSRKRQLHGEQSQPAKGQKLTMKEKGNRNRNEGSHKN 1380
+S+ +N V + KRL KPS R
Sbjct: 1321 NTSDKDSPIANEKVKNKRKRLEKPSSSR-------------------------------- 1380
Query: 1381 GRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDESEFDDQKCDFVNLE-EPQERRR 1440
GRGR +++GRGRGR+Q +L+E S + + DD K V LE +P ++
Sbjct: 1381 GRGRAQKRGRGRGRVQ----------KDLLELSDGSSDDDDDDDK--VVELEAKPANLQK 1440
Query: 1441 SSRIRKSVSYTMGDADQPSDYNGDRFSNDEAKEHDVIHDQSEKTERDLGTPKRPPQEDYF 1500
S+R R V Y+ + D+ + + S E E +V + + + P EDY
Sbjct: 1441 STRSRNPVMYSAKEDDELDESRSNEGSPSENFE-EVDEGRIGNDDSVDASINDCPSEDYI 1477
Query: 1501 ETGGGFCPVEDEMSQDIDPSLEANNSEDYLRMGGGFCLDDDNECIDPDAYPGRATVSEDL 1560
+TGGGFC DE + D LE ++DY +GGGFC+D+D T E+
Sbjct: 1501 QTGGGFC--ADEADEIGDAHLEDKATDDYRVIGGGFCVDEDE------------TAEENT 1477
BLAST of MS008891 vs. ExPASy Swiss-Prot
Match:
P14629 (DNA excision repair protein ERCC-5 homolog OS=Xenopus laevis OX=8355 GN=ercc5 PE=2 SV=1)
HSP 1 Score: 257.7 bits (657), Expect = 8.5e-67
Identity = 358/1400 (25.57%), Postives = 586/1400 (41.86%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM 60
MGVQGLW+LL GR ++ TL GK LA+ D SIW+
Sbjct: 1 MGVQGLWKLLECSGRPINPGTLEGKILAV-------------------------DISIWL 60
Query: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
Q +K RD +G ++NAHLL F R+CKLLF R +P+FVFDG P LKR+TL RR++
Sbjct: 61 NQAVKGARDRQGNAIQNAHLLTLFHRLCKLLFFRIRPIFVFDGEAPLLKRQTLAKRRQRT 120
Query: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG 180
+ A RKT EKLL LK +K + +Q +++P +P K T D
Sbjct: 121 DKASNDARKTNEKLLRTFLKRQAIKAAL----SGNKQSNEELPSFSQVP-RKETED---- 180
Query: 181 RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESILKQKYK 240
L E+N +N+ E +++++
Sbjct: 181 -------------------LYILPPLEDN-----------------ENNSSEEEEEREWE 240
Query: 241 NDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLDEMLAASIAAEEAGSLNENASVSAA 300
K+ L ++ A S + I +EE SL
Sbjct: 241 ERMNQKQRLQEDFFANPSSVD---------------------IESEEFKSLPPEVKHEIL 300
Query: 301 ANLDEDTDDEDEEMILPEMDGVVDPSVLAALPPSVQLDLLVQDPAKFSELQIQAYLKTVA 360
++ + T ++ A+P +D + FS+ Q++ LK
Sbjct: 301 TDMKDFTKRR--------------RTLFEAMP---------EDSSDFSQYQLKGLLKKND 360
Query: 361 FRRDIDQVQKAASGRGVGGVQ--------------TSRIASEANREFIFSSSFTGDKQVL 420
+ ID V+K + + G VQ T R+ SE + +I K +
Sbjct: 361 LNKCIDNVRKELNQQYSGEVQAQFESEGGFLKEVETRRLVSEDDSHYILI------KGIQ 420
Query: 421 ASRTEKSGDEDLQAPRGQQPLSSLNNTEVPSTSNALAQSTPDKTGVFEENIETFLDERGR 480
+ + EK D Q+ + N+++ P T L ++ KT + + R
Sbjct: 421 SKQEEKKVDSPPQS-------ITFNSSQTPKTYLDLKLASAHKTKPLQTSSAEAAPPSPR 480
Query: 481 VRVSRVRAMGMRMTRDLERNLDLMKEIEKNASANEVVNHEPVQSSEICNPRSHSSQSQDL 540
+ AM + D K + + S E + +PR+ + Q L
Sbjct: 481 TLFAIQEAMA--------ESWDHEKHEKPSVSGCEAEGN--------VSPRTLQAIYQVL 540
Query: 541 DTPYEGVSESVQLSLRSRGSMLDEDTAIEILLEDEGDKSFDGDDDLFTHLAAENPIQVAS 600
G S +++ L+S + + I DE D + D T L A ++
Sbjct: 541 AEDEAGESNKIKVVLQSDEERKPKTKVLVISSSDEEDDCLNYQDGTKTTLGASLIKSISP 600
Query: 601 FDMSSQKLSFDGTTD-------SGWEEAV---------------EGKTYSPKNVEVDDHP 660
M Q+ + D + S EE + + K P ++ D P
Sbjct: 601 SSMQCQESTADSLPNYTRSKPVSQIEEPMADHNLQGDNCNVPNAKDKLIVPPSLGNVDKP 660
Query: 661 FVEGRVSDESEVEWEEGVCDHVNPVPFGAAES----GKSVSKGSLEEEADLQEAIRRSLK 720
+ +S+ V E + + PV E+ + S GS + ++A ++
Sbjct: 661 II---LSNTIPVNSEFRI--PLLPVNMSMRETVIIPNNTGSLGSSRYISLERDATKQGFS 720
Query: 721 D--VGD--RKP-------GSVLSEHQKPESAGKMLEQCTSVQNENVIGLKNVDGADGMSC 780
D VGD R P S LS+ + + + + V G N +
Sbjct: 721 DNPVGDLVRSPDEPALNASSALSDRKTSATQSLLCNNIECTEQSMVQGCSNT-----LDV 780
Query: 781 SKANDSTGRKETTESSSQEKQCSECIVLLDTTTHTV---TEKLDASYKD-VSHKDSNEND 840
++ +G E + + Q + D++ V E + S ++ V+ K+ +++D
Sbjct: 781 TQTTQPSGGSEVNKPAEYNPQDKKVFGSNDSSAMYVPMTPESIIVSDEEFVNEKEDSDSD 840
Query: 841 DTLKPLSRDASGAVLVGDRINNKLTEPPCHMVGMEDSYTPEVDSSPKVVASEN--HQNFP 900
D+ + + S T H+V E T E ++ + V N Q+ P
Sbjct: 841 DSFIEVDSEFS-------------TSNSQHVVFKEPGDTRETATNFQAVEEGNSGSQDIP 900
Query: 901 VDELSSDILLEENDAQKPAVEVISNAEIEFTEDELTNRISILEQERLNLGDEQKRLERNA 960
++ S + + N + ++ +SN + + +EL + + L ++ +L +Q++ ER A
Sbjct: 901 LEHDSGEPHEQSNSEESKDLDDVSNEWQDISVEELESLENNLYVQQTSLQAQQQQQERIA 960
Query: 961 ESVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVILFGARSV 1020
+V+ +M E QELLQ+FG+PYI+APMEAEAQCA ++L + G +TDDSD+ LFGAR V
Sbjct: 961 ATVTGQMCLESQELLQLFGIPYIVAPMEAEAQCAILDLTDQTSGTITDDSDIWLFGARHV 1020
Query: 1021 YKNIFDDRKYVETYFMKDVENELGLNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAF 1080
YKN F K+VE Y D+ N+LGL+R K+I +A LLGSDYTEGI +G V+A+E++N F
Sbjct: 1021 YKNFFSQNKHVEYYQYADIHNQLGLDRSKLINLAYLLGSDYTEGIPTVGYVSAMEILNEF 1080
Query: 1081 PEE--DGLQKFKEWIESPDPSILGTLSAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASG 1140
P + + L KFKEW + + + + + DT
Sbjct: 1081 PGQGLEPLVKFKEWWSE---------------AQKDKKMRPNPNDT-------------- 1140
Query: 1141 EDISEDLKENIDVKQSFMKKHRNVSKNWHIPSEFPSETVISAYTCPQVDKSAESFSWGKP 1200
+ + L+ +D++QS FP+ V SAY P VD+S +FSWG+P
Sbjct: 1141 -KVKKKLR-LLDLQQS-----------------FPNPAVASAYLKPVVDESKSAFSWGRP 1167
Query: 1201 DHFVLRRLCWEKFGWDNSKADELLLPVLKEYSKHETQLRLEAFYTFNE-RFAKIRSKRIK 1260
D +R C +FGW K DE+LLPVLK+ + +TQLR+++F+ + A ++S+R++
Sbjct: 1201 DLEQIREFCESRFGWYRLKTDEVLLPVLKQLNAQQTQLRIDSFFRLEQHEAAGLKSQRLR 1167
Query: 1261 KAVRGITGSKSAVLMDDAVRAVSANKQRELSVEPQEKSEKCSSEIQGSCSNVDDVEKRLG 1320
+AV + + V ++ V A A +RE + Q K +K +++ QG+ +
Sbjct: 1261 RAVTCMKRKERDVEAEE-VEAAVAVMERECT--NQRKGQKTNTKSQGT---------KRR 1167
Query: 1321 KPSRKRQLHGEQSQPAKG----QKLTMKEKGNRNRNEGSHKNGRGRGERKGRGRGRLQPK 1337
KP+ Q E P G + T+ K + S G + + +
Sbjct: 1321 KPTECSQ---EDQDPGGGFIGIELKTLSSKAYSSDGSSSDAEDLPSGLIDKQSQSGI--V 1167
BLAST of MS008891 vs. ExPASy Swiss-Prot
Match:
P28706 (DNA repair protein rad13 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=rad13 PE=2 SV=2)
HSP 1 Score: 235.7 bits (600), Expect = 3.5e-60
Identity = 317/1234 (25.69%), Postives = 507/1234 (41.09%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM 60
MGV GLW++L PV R V +ETL K+LAI DASIW+
Sbjct: 1 MGVSGLWDILEPVKRPVKLETLVNKRLAI-------------------------DASIWI 60
Query: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
QF+KA+RD+ G ++++H++GFFRRICKLLF KPVFVFDG P+LKR+T+ R+ +R
Sbjct: 61 YQFLKAVRDKEGNQLKSSHVVGFFRRICKLLFFGIKPVFVFDGGAPSLKRQTIQKRQARR 120
Query: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG 180
+ + TA KLL Q R Q + ++N N K TA
Sbjct: 121 LDREENATVTANKLL------------------ALQMRHQAMLLEEN--NKKATA----- 180
Query: 181 RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESILKQKYK 240
+ AS+ E +++ DNS + +L Q+ K
Sbjct: 181 ------------------LANASVQNERQ----------MPSSMTLDNSEIKPVLNQR-K 240
Query: 241 NDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLDEMLAASIAAEEAGSLNENASVSAA 300
N K E+ V D R Q L + +++ E+ +L + +++
Sbjct: 241 NYLKPDPYQLPEMD-VSFDKLGSSYDPRIMSQDELTQYVSSFTKIEDI-NLFDFSNIDFD 300
Query: 301 ANLDEDTDDEDEEMILPEMDGVVDPSVLAALPPSVQLDLLVQDPAKFSELQIQAYLKTVA 360
+ L + D D+ IL S L S QL + + FS QI+
Sbjct: 301 SELFQSLPDTDKYSIL---SAARLRSRLRMGLSSEQLSEMFPNRMDFSRFQIE------R 360
Query: 361 FRRDIDQVQKAASGRGVGGVQTSRIASEANREFIFSSSFTGDKQVLASRTEKSGDEDLQA 420
+ D Q+ G+ SR+ SE NRE+I L G L
Sbjct: 361 LKERNDLTQRLMDFTGMNEFGPSRVVSEKNREYI-----------LVKNEGAEGGWALGV 420
Query: 421 PRGQQPLSSLNNTEVPSTSNALAQSTPDKTGVFEENIETFLDERGRVRVSRVRAMGMRMT 480
G S NN + A S+ + E+ E F D SR +M R
Sbjct: 421 ISG-----STNNEPIIIDDEATKLSS---NLIDEDEDEAFYDV---PLPSRSHSMNPR-- 480
Query: 481 RDLERNLDLMKEIEKNASANEVVNHEPVQSSEICNPRSHSSQSQDLDTPYEGVSESVQLS 540
E +KEI++N+ S++Q D V++ + L
Sbjct: 481 ---ELVAAKLKEIKENS----------------------FSENQQSDEADYNVTDDLILQ 540
Query: 541 LRSRGSMLDEDTAIEI--LLEDEGDKSFDGDDDLFTHLAAENPIQ--VASFDMSSQKLSF 600
L ++ S+ + + E+ L E DK + + F L+ + P + + S + L
Sbjct: 541 LATQQSLEENKKSKELFSLSASEFDK-LNSEKKTFEILSTDIPAEDSMNSLLNDEENLKL 600
Query: 601 DGTTDSGWEEAVEGKTYSPKNVEVDDHPFVEGRVSDESEVEWEEGVCDHVNPVPFGAAES 660
+ D + + P+N F++ S E + D ++P+PF +
Sbjct: 601 EHVGDVSNDSLAFAEKKHPEN---GTSIFMDALPSASREKKTN----DLIDPLPFQPMDW 660
Query: 661 GKSV--------SKGSLEEEADLQEAIRRSLKDVGDRKPGSVLS--------EHQKPESA 720
GKS+ ++ ++ + D+ + K V D ++ + + KP
Sbjct: 661 GKSIFFEKLKKPTETFMDSKTDIPSEAPDNSKLVEDTNLHTINATVNIESDLDAAKPGIE 720
Query: 721 GKMLEQCTSVQ-NENVIGLKNVDGADGMSCSKANDSTGRKETTESSSQEKQCSECIVLLD 780
++ V+ +E + L+ ++ + K G + S +K S + LL
Sbjct: 721 NPIISPLLPVKDDEKDLDLRELNPLEPFENMKEQADDGTVTNPLNVSSDKAMS--VYLLS 780
Query: 781 TTTHTVTEKLDASYKDVSHKDSNENDDTLKPLSRDASGAVLVGDRINNKLTEPPCHMVGM 840
+ + KD S D L L + + D K P +
Sbjct: 781 S----------ENAKDTGDIKSESIDAVLPTLETSSPSLSIPTD--FQKEASPNKGAAAL 840
Query: 841 EDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEENDAQKPAVEVISNAEIEFTEDELT 900
PEV +DE ++++ + +K +S +E
Sbjct: 841 SSKVEPEVVEK------------LLDEEEEEMIIRMAEEEKEYDRFVSELNQRHETEEWN 900
Query: 901 NRISILEQERLNLGDEQKRLERNAESVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYM 960
E+ L ++++ +R+A+ V+ M ECQELL++FGLPYI+AP EAEAQC+ +
Sbjct: 901 Q--EAFEKRLKELKNQKRSEKRDADEVTQVMIKECQELLRLFGLPYIVAPQEAEAQCSKL 960
Query: 961 ELANLVDGVVTDDSDVILFGARSVYKNIFDDRKYVETYFMKDVENELGLNRDKIIRMALL 1020
LVDG+VTDDSDV LFG VY+N+F+ K+VE Y M D++ E +N+ +I++A L
Sbjct: 961 LELKLVDGIVTDDSDVFLFGGTRVYRNMFNQNKFVELYLMDDMKREFNVNQMDLIKLAHL 1014
Query: 1021 LGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIESPDPSILGTLSAQTGLSSRKRG 1080
LGSDYT G+S +G V A+E+++ FP + GL +FK+W +
Sbjct: 1021 LGSDYTMGLSRVGPVLALEILHEFPGDTGLFEFKKWFQ---------------------- 1014
Query: 1081 SKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSFMKKHRNVSKNWHIPSEFPSET 1140
+ G AS D++ +K+ I+ + +PSEFP+
Sbjct: 1081 -------------RLSTGHASKNDVNTPVKKRIN----------KLVGKIILPSEFPNPL 1014
Query: 1141 VISAYTCPQVDKSAESFSWGKPDHFVLRRLCWEKFGWDNSKADELLLPVLKEYSKHE--- 1200
V AY P VD S +SF WG PD LR+ GW + +E+LLPV+++ K +
Sbjct: 1141 VDEAYLHPAVDDSKQSFQWGIPDLDELRQFLMATVGWSKQRTNEVLLPVIQDMHKKQFVG 1014
Query: 1201 TQLRLEAFY------TFNERFA-KIRSKRIKKAV 1204
TQ L F+ + R A +SKR++ A+
Sbjct: 1201 TQSNLTQFFEGGNTNVYAPRVAYHFKSKRLENAL 1014
BLAST of MS008891 vs. ExPASy Swiss-Prot
Match:
P35689 (DNA excision repair protein ERCC-5 OS=Mus musculus OX=10090 GN=Ercc5 PE=1 SV=4)
HSP 1 Score: 231.9 bits (590), Expect = 5.0e-59
Identity = 316/1250 (25.28%), Postives = 502/1250 (40.16%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM 60
MGVQGLW+LL G RVS E L GK LA+ D SIW+
Sbjct: 1 MGVQGLWKLLECSGHRVSPEALEGKVLAV-------------------------DISIWL 60
Query: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
Q +K +RD G ++ NAHLL F R+CKLLF R +P+FVFDG P LK++TL RR+++
Sbjct: 61 NQALKGVRDSHGNVIENAHLLTLFHRLCKLLFFRIRPIFVFDGDAPLLKKQTLAKRRQRK 120
Query: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQ------KQQRRQDVPKKKNLP----- 180
++A RKT EKLL LK LK +++ + QR+ D+ LP
Sbjct: 121 DSASIDSRKTTEKLLKTFLKRQALKTAFRSSRHEAPPSLTQVQRQDDIYVLPPLPEEEKH 180
Query: 181 --------------NHKRTADGTSGRNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSS 240
+ K+ N S D L + I+ + F
Sbjct: 181 SSEEEDEKQWQARMDQKQALQEEFFHNPQAIDIESEDFSSLPPEVKHEILTDMKEFTKRR 240
Query: 241 SSSFSGAALAKDNSGE---ESILKQKYKNDSKGKKILSDEIHAVGSDTERMEVASRGAHQ 300
+ F ++ + + +LK+ Y N + + + E++ S + + G
Sbjct: 241 RTLFEAMPEESNDFSQYQLKGLLKKNYLN--QHIENVQKEMNQQHSGQIQRQYQDEGGF- 300
Query: 301 QNLDEMLAASIAAEEAGSLNENASVSAAANLDEDTDDEDEEMILPEMDGVVDPSVLAALP 360
L E+ + + +E+ + +D D++ + + + S +
Sbjct: 301 --LKEVESRRVVSEDTSHYILIKGIQGKKVMDVDSESLPSSSNVHSVSSNLKSSPHEKVK 360
Query: 361 PSVQLDLLVQDPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGVQTSRIASEANRE 420
P + + P R + +Q A G SR ++
Sbjct: 361 PEREPEAAPPSP------------------RTLLAIQAAMLGSSSEDEPESREGRQSKER 420
Query: 421 FIFSSSFTGDKQVLASRTEKSGDEDLQAPRGQQPLSSLNNTEVPSTSNALAQSTPDKTGV 480
+S T D ++ RT Q+ L N+ +V +S+ LA+ +G+
Sbjct: 421 ---NSGATADAGSISPRT---------CAAIQKALDDDNDEKVSGSSDDLAEKMLLGSGL 480
Query: 481 FEENIETFLDERGRVRVSRVRAMGMRM-TRDLERNLDLMKE-IEKNASANEVVNHEPVQS 540
+E ERG G+ T L ++ +KE + +SAN
Sbjct: 481 EQEEHADETAERGG---------GVPFDTAPLTPSVTEVKECVTSGSSAN---------- 540
Query: 541 SEICNPRSHSSQSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEILLEDEGDKSFDGDD 600
+ + S ++ S DTP E VS + A++ + + +G
Sbjct: 541 GQTDSAHSFTTASHRCDTPKETVSLA---------------RAVKEASQISSECEVEGRP 600
Query: 601 DLFTHLAAENPIQVASFDMSSQKLSFDGTTDSGWEEAVEGKTYSPKNVEVDD--HPFV-- 660
+ P S +S ++ + T + +G P++ E+ + +P
Sbjct: 601 AALSPAFIGTPSSHVSGVLSEREPTLAPPTTRTHSD--QGIDIHPEDPELQNGLYPLETK 660
Query: 661 --EGRVSDESEVEWEEG----VCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRSLK 720
R+S + E E + C V+ VP A + ++ + EE D QE I+ L+
Sbjct: 661 CNSSRLSSDDETEGGQNPAPKACSTVH-VPAEAMSNLENALPSNAEERGDFQETIQ--LR 720
Query: 721 DVGDRKPGSVLSEHQKPESAGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGRKE 780
+V + ++S P+ G M + S ++E +DG + + +
Sbjct: 721 EVPEAAARELIS---APKPMGPM--EMESEESE----------SDGSFIEVQSVVSNSEL 780
Query: 781 TTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDVSHKDSNENDDTLKP-LSRDASGAV 840
TESS + +TH +S KD+ E +TL+ SRD
Sbjct: 781 QTESS-------------EASTH------------LSEKDAEEPRETLEEGTSRDTECL- 840
Query: 841 LVGDRINNKLTEPPCHMVGMEDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEENDAQ 900
+ S + A E H+ +D
Sbjct: 841 ---------------------------LQDSSDIEAMEGHREADID-------------- 900
Query: 901 KPAVEVISNAEIEFTEDELTNRISILEQERLNLGDEQKRLERNAESVSSEMFAECQELLQ 960
E + N + +EL S L E+ +L ++++ +R A SV+ +MF E QELL+
Sbjct: 901 ---AEDMPNEWQDINLEELDALESNLLAEQNSLKAQKQQQDRIAASVTGQMFLESQELLR 960
Query: 961 MFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVILFGARSVYKNIFDDRKYVETYFM 1020
+FG+PYI APMEAEAQCA ++L + G +TDDSD+ LFGAR VYKN F+ K+VE Y
Sbjct: 961 LFGVPYIQAPMEAEAQCAMLDLTDQTSGTITDDSDIWLFGARHVYKNFFNKNKFVEYYQY 1018
Query: 1021 KDVENELGLNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEE--DGLQKFKEWIE 1080
D ++LGL+R+K+I +A LLGSDYTEGI +G V A+E++N FP D L KF EW
Sbjct: 1021 VDFYSQLGLDRNKLINLAYLLGSDYTEGIPTVGCVTAMEILNEFPGRGLDPLLKFSEWWH 1018
Query: 1081 SPDPSILGTLSAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQS 1140
+N V +
Sbjct: 1081 EAQ-----------------------------NNKKVAE-------------------NP 1018
Query: 1141 FMKKHRNVSKNWHIPSEFPSETVISAYTCPQVDKSAESFSWGKPDHFVLRRLCWEKFGWD 1200
+ K + + + FP+ V AY P VD S SF WGKPD + C FGW+
Sbjct: 1141 YDTKVKKKLRKLQLTPGFPNPAVADAYLRPVVDDSRGSFLWGKPDVDKISTFCQRYFGWN 1018
Query: 1201 NSKADELLLPVLKEYSKHETQLRLEAFYTFNERFAK----IRSKRIKKAV 1204
K DE L PVLK + H+TQLR+++F+ ++ + I+S R+ +AV
Sbjct: 1201 RMKTDESLYPVLKHLNAHQTQLRIDSFFRLAQQEKQDAKLIKSHRLNRAV 1018
BLAST of MS008891 vs. ExPASy Swiss-Prot
Match:
P28715 (DNA excision repair protein ERCC-5 OS=Homo sapiens OX=9606 GN=ERCC5 PE=1 SV=3)
HSP 1 Score: 216.9 bits (551), Expect = 1.7e-54
Identity = 339/1381 (24.55%), Postives = 555/1381 (40.19%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM 60
MGVQGLW+LL GR+VS E L GK LA+ D SIW+
Sbjct: 1 MGVQGLWKLLECSGRQVSPEALEGKILAV-------------------------DISIWL 60
Query: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
Q +K +RD G + N HLL F R+CKLLF R +P+FVFDG P LK++TL+ RR+++
Sbjct: 61 NQALKGVRDRHGNSIENPHLLTLFHRLCKLLFFRIRPIFVFDGDAPLLKKQTLVKRRQRK 120
Query: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG 180
+ A + RKT EKLL LK +K +++ V ++ +L
Sbjct: 121 DLASSDSRKTTEKLLKTFLKRQAIKTAFRSKRDEALPSLTQVRRENDL------------ 180
Query: 181 RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESILKQKYK 240
+ + EE K +S EE +++++
Sbjct: 181 ------------------YVLPPLQEEE-----------------KHSSEEED--EKEWQ 240
Query: 241 NDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLDEMLAASIAAEEAGSLNENASVSAA 300
K+ L +E A I +E+ SL
Sbjct: 241 ERMNQKQALQEEF---------------------FHNPQAIDIESEDFSSLPPEVKHEIL 300
Query: 301 ANLDEDTDDEDEEMILPEMDGVVDPSVLAALPPSVQLDLLVQDPAKFSELQIQAYLKTVA 360
++ E T ++ A+P ++ FS+ Q++ LK
Sbjct: 301 TDMKEFTKRR--------------RTLFEAMP---------EESDDFSQYQLKGLLKKNY 360
Query: 361 FRRDIDQVQKAASGRGVGGVQTSRIASEANREFIFSSSFTGDKQVLASRTEKSGDEDLQA 420
+ I+ VQK + + G ++ R++ F K+V + R
Sbjct: 361 LNQHIEHVQKEMNQQHSGHIR---------RQYEDEGGFL--KEVESRRVVSEDTSHYIL 420
Query: 421 PRGQQ--PLSSLNNTEVPSTSNALAQSTPDKTGVFEENIETFLDERGRVRVSRVRAMGMR 480
+G Q ++ +++ +PS+S S K+ E+ ++T + R
Sbjct: 421 IKGIQAKTVAEVDSESLPSSSKMHGMSFDVKSSPCEK-LKTEKEPDATPPSPRTLLAMQA 480
Query: 481 MTRDLERNLDLMKEIEKNASANEVVNHEPVQSSEICNPRSHSSQSQDLDTPYEGVSESVQ 540
+L E + A N +PR+ S+ + LD E V
Sbjct: 481 ALLGSSSEEELESENRRQARGR---NAPAAVDEGSISPRTLSAIKRALDDD-EDVKVCAG 540
Query: 541 LSLRSRGSMLDE---DTAIEILLEDEGDKSFDGDDDLFTHLAAENPIQVASFDMSSQKLS 600
+++ G +E +++ E DEG K DG FT A + + A ++S
Sbjct: 541 DDVQTGGPGAEEMRINSSTE--NSDEGLKVRDGKGIPFTATLASSSVNSAEEHVASTNEG 600
Query: 601 FDGTTDSGWEEAVEGKTYSPKNVEVDDHPFVEGRVSDESEVEWEEGVCDHVNP--VPFG- 660
+ TDS +E + + + D ++ R + + E V H + +P G
Sbjct: 601 RE-PTDSVPKEQMSLVHVGTEAFPISDESMIKDR---KDRLPLESAVVRHSDAPGLPNGR 660
Query: 661 -----AAESGKSVSKGSLEEEADLQEAIRRSLKDVGDRKPGSVLSEHQ----KPESAGKM 720
+ SVSK E A++ E + L + S+LS KP SA ++
Sbjct: 661 ELTPASPTCTNSVSKN--ETHAEVLEQ-QNELCPYESKFDSSLLSSDDETKCKPNSASEV 720
Query: 721 L--------EQCTSVQNENVIGLKNVDGADGMSCSKANDSTGRKETTESSSQEK-QCSEC 780
+ SV +E V ++NV + ++ ++TTES+ Q+ +
Sbjct: 721 IGPVSLQETSSIVSVPSEAVDNVENVVSFNAKEHENFLETIQEQQTTESAGQDLISIPKA 780
Query: 781 IVLLDTTTHTVTEKLDASYKDVSHKDSNEN-----DDTLKPLSRDASGAVLVGDRINNKL 840
+ ++ + + D S+ +V S+E +T KP S
Sbjct: 781 VEPMEIDSE--ESESDGSFIEVQSVISDEELQAEFPETSKPPSEQGEE------------ 840
Query: 841 TEPPCHMVGMEDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEENDAQKPAVEVISNA 900
+VG + P S + +N + VD E +A+K A + +
Sbjct: 841 -----ELVGTREGEAPAESES---LLRDNSERDDVDG-------EPQEAEKDAEDSLHEW 900
Query: 901 EIEFTEDELTNRISILEQERLNLGDEQKRLERNAESVSSEMFAECQELLQMFGLPYIIAP 960
+ + +EL S L ++ +L ++++ ER A +V+ +MF E QELL++FG+PYI AP
Sbjct: 901 Q-DINLEELETLESNLLAQQNSLKAQKQQQERIAATVTGQMFLESQELLRLFGIPYIQAP 960
Query: 961 MEAEAQCAYMELANLVDGVVTDDSDVILFGARSVYKNIFDDRKYVETYFMKDVENELGLN 1020
MEAEAQCA ++L + G +TDDSD+ LFGAR VY+N F+ K+VE Y D N+LGL+
Sbjct: 961 MEAEAQCAILDLTDQTSGTITDDSDIWLFGARHVYRNFFNKNKFVEYYQYVDFHNQLGLD 1020
Query: 1021 RDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEE--DGLQKFKEWIESPDPSILGTL 1080
R+K+I +A LLGSDYTEGI +G V A+E++N FP + L KF EW
Sbjct: 1021 RNKLINLAYLLGSDYTEGIPTVGCVTAMEILNEFPGHGLEPLLKFSEWWHE--------- 1080
Query: 1081 SAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSFMKKHRNVSK 1140
+ + + + DT K + +
Sbjct: 1081 ------AQKNPKIRPNPHDT---------------------------------KVKKKLR 1140
Query: 1141 NWHIPSEFPSETVISAYTCPQVDKSAESFSWGKPDHFVLRRLCWEKFGWDNSKADELLLP 1200
+ FP+ V AY P VD S SF WGKPD +R C FGW+ +K DE L P
Sbjct: 1141 TLQLTPGFPNPAVAEAYLKPVVDDSKGSFLWGKPDLDKIREFCQRYFGWNRTKTDESLFP 1157
Query: 1201 VLKEYSKHETQLRLEAFYTF----NERFAKIRSKRIKKAVRGITGSKSAVLMDDAVRAVS 1260
VLK+ +TQLR+++F+ E +I+S+R+ +AV + + + + AVS
Sbjct: 1201 VLKQLDAQQTQLRIDSFFRLAQQEKEDAKRIKSQRLNRAVTCMLRKEKEAAASE-IEAVS 1157
Query: 1261 ANKQRELSVEPQEKSEKCSSEIQGSCSNVDDVE-KRLGKPSRKRQLHGEQSQPAKGQKLT 1320
++E + + K + I + ++ KRL K G + + +
Sbjct: 1261 VAMEKEFELLDKAKGKTQKRGITNTLEESSSLKRKRLSDSKGKNTCGGFLGETCLSE--S 1157
Query: 1321 MKEKGNRNRNEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDESEFDDQ 1344
+ + S N + R K + +K +P T+SS +S+ D
Sbjct: 1321 SDGSSSEDAESSSLMNVQRRTAAKEPKTSASDSQNSVKEAPVKNGGATTSSSSDSDDDGG 1157
BLAST of MS008891 vs. ExPASy TrEMBL
Match:
A0A6J1D8H5 (DNA repair protein UVH3 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111018345 PE=4 SV=1)
HSP 1 Score: 2922.9 bits (7576), Expect = 0.0e+00
Identity = 1546/1602 (96.50%), Postives = 1551/1602 (96.82%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM 60
MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM
Sbjct: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM 60
Query: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR
Sbjct: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
Query: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG 180
ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG
Sbjct: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG 180
Query: 181 RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESIL----- 240
RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESIL
Sbjct: 181 RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESILPLMNE 240
Query: 241 -------------------KQKYKNDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLD 300
KQKYKNDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLD
Sbjct: 241 VDPDVFSTLPSSIQYELLQKQKYKNDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLD 300
Query: 301 EMLAASIAAEEAGSLNENASVSAAANLDEDTDDEDEEMILPEMDGVVDPSVLAALPPSVQ 360
EMLAASIAAEEAGSLNENASVSAAANLDEDTDDEDEEMILPEMDGVVDPSVLAALPPSVQ
Sbjct: 301 EMLAASIAAEEAGSLNENASVSAAANLDEDTDDEDEEMILPEMDGVVDPSVLAALPPSVQ 360
Query: 361 LDLLVQ------------------DPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVG 420
LDLLVQ DPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVG
Sbjct: 361 LDLLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVG 420
Query: 421 GVQTSRIASEANREFIFSSSFTGDKQVLASRTEKSGDEDLQAPRGQQPLSSLNNTEVPST 480
GVQTSRIASEANREFIFSSSFTGDKQVLASR EKSGDEDLQAPRGQQPLSSLNNTEVPST
Sbjct: 421 GVQTSRIASEANREFIFSSSFTGDKQVLASRIEKSGDEDLQAPRGQQPLSSLNNTEVPST 480
Query: 481 SNALAQSTPDKTGVFEENIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNAS 540
SNALA+STPDKTGVFEENIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNAS
Sbjct: 481 SNALARSTPDKTGVFEENIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNAS 540
Query: 541 ANEVVNHEPVQSSEICNPRSHSSQSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEILL 600
ANEVVNHEPVQ+SEICNP+SHSSQSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEILL
Sbjct: 541 ANEVVNHEPVQNSEICNPKSHSSQSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEILL 600
Query: 601 EDEGDKSFDGDDDLFTHLAAENPIQVASFDMSSQKLSFDGTTDSGWEEAVEGKTYSPKNV 660
EDEGDKSFDGDDDLFTHLAAENPIQVASFD SSQKLSFDGTTDSGWEEAVEGKTYSPKNV
Sbjct: 601 EDEGDKSFDGDDDLFTHLAAENPIQVASFDKSSQKLSFDGTTDSGWEEAVEGKTYSPKNV 660
Query: 661 EVDDHPFVEGRVSDESEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRS 720
EVDDHPFVEGRVSDESEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRS
Sbjct: 661 EVDDHPFVEGRVSDESEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRS 720
Query: 721 LKDVGDRKPGSVLSEHQKPESAGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGR 780
LKDVGDRKPGSVLSEHQKPESAGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGR
Sbjct: 721 LKDVGDRKPGSVLSEHQKPESAGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGR 780
Query: 781 KETTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDVSHKDSNENDDTLKPLSRDASGA 840
KETTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDVSHKDSNENDDTLKPLSRDASGA
Sbjct: 781 KETTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDVSHKDSNENDDTLKPLSRDASGA 840
Query: 841 VLVGDRINNKLTEPPCHMVGMEDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEENDA 900
VLVGDRINNKLTEPPCHMVGMEDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEENDA
Sbjct: 841 VLVGDRINNKLTEPPCHMVGMEDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEENDA 900
Query: 901 QKPAVEVISNAEIEFTEDELTNRISILEQERLNLGDEQKRLERNAESVSSEMFAECQELL 960
QKPAVEVISNAEIEFTEDELTNRI ILEQERLNLGDEQKRLERNAESV SEMFAECQELL
Sbjct: 901 QKPAVEVISNAEIEFTEDELTNRIXILEQERLNLGDEQKRLERNAESVXSEMFAECQELL 960
Query: 961 QMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVILFGARSVYKNIFDDRKYVETYF 1020
QMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDV LFGARSVYKNIFDDRKYVETYF
Sbjct: 961 QMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYF 1020
Query: 1021 MKDVENELGLNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIES 1080
MKDVENELGLNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIES
Sbjct: 1021 MKDVENELGLNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIES 1080
Query: 1081 PDPSILGTLSAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSF 1140
PDPSILGTLSAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSF
Sbjct: 1081 PDPSILGTLSAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSF 1140
Query: 1141 MKKHRNVSKNWHIPSEFPSETVISAYTCPQVDKSAESFSWGKPDHFVLRRLCWEKFGWDN 1200
MKKHRNVSKNWHIPSEFPSE VISAYTCPQVDKSAESFSWGKPD FVLRRLCWEKFGWDN
Sbjct: 1141 MKKHRNVSKNWHIPSEFPSEXVISAYTCPQVDKSAESFSWGKPDXFVLRRLCWEKFGWDN 1200
Query: 1201 SKADELLLPVLKEYSKHETQLRLEAFYTFNERFAKIRSKRIKKAVRGITGSKSAVLMDDA 1260
SKADELLLPVLKEYSKHETQLRLE FYTF+ERFAKIRSKRIKKAVRGITGSKSAVLMDDA
Sbjct: 1201 SKADELLLPVLKEYSKHETQLRLETFYTFDERFAKIRSKRIKKAVRGITGSKSAVLMDDA 1260
Query: 1261 VRAVSANKQRELSVEPQEKSEKCSSEIQGSCSNVDDVEKRLGKPSRKRQLHGEQSQPAKG 1320
VRAVSANKQRELSVEPQEKSEKCSSEIQGSCSNVDDVEKRLGKPSRKRQLHGEQSQPAKG
Sbjct: 1261 VRAVSANKQRELSVEPQEKSEKCSSEIQGSCSNVDDVEKRLGKPSRKRQLHGEQSQPAKG 1320
Query: 1321 QKLTMKEKGNRNRNEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDESE 1380
QKLTMKEKGNRNRNEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDE+E
Sbjct: 1321 QKLTMKEKGNRNRNEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDENE 1380
Query: 1381 FDDQKCDFVNLEEPQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSNDEAKEHDVIHDQS 1440
FDDQKCDFVNLEEPQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSNDEAKEHDVIHDQS
Sbjct: 1381 FDDQKCDFVNLEEPQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSNDEAKEHDVIHDQS 1440
Query: 1441 EKTERDLGTPKRPPQEDYFETGGGFCPVEDEMSQDIDPSLEANNSEDYLRMGGGFCLDDD 1500
EKTERDLGTPKRPPQEDYFETGGGFCPVEDEMSQDIDPSLEANNSEDYLRMGGGFCLDDD
Sbjct: 1441 EKTERDLGTPKRPPQEDYFETGGGFCPVEDEMSQDIDPSLEANNSEDYLRMGGGFCLDDD 1500
Query: 1501 NECIDPDAYPGRATVSEDLQDRSEHDPDQSTFHPEKCTSPVQNKEGTDARVDSLLDTGNP 1560
NECIDPDAYPGRATVSEDLQDRSEHDPDQSTFHPEKCTS VQNKEGTDARVDSLLDTGNP
Sbjct: 1501 NECIDPDAYPGRATVSEDLQDRSEHDPDQSTFHPEKCTSSVQNKEGTDARVDSLLDTGNP 1560
BLAST of MS008891 vs. ExPASy TrEMBL
Match:
A0A6J1D7H7 (DNA repair protein UVH3 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111018345 PE=4 SV=1)
HSP 1 Score: 2921.3 bits (7572), Expect = 0.0e+00
Identity = 1545/1600 (96.56%), Postives = 1550/1600 (96.88%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM 60
MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM
Sbjct: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM 60
Query: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR
Sbjct: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
Query: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG 180
ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG
Sbjct: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG 180
Query: 181 RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESILK---- 240
RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESIL
Sbjct: 181 RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESILPLMNE 240
Query: 241 ------------------QKYKNDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLDEM 300
QKYKNDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLDEM
Sbjct: 241 VDPDVFSTLPSSIQYELLQKYKNDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLDEM 300
Query: 301 LAASIAAEEAGSLNENASVSAAANLDEDTDDEDEEMILPEMDGVVDPSVLAALPPSVQLD 360
LAASIAAEEAGSLNENASVSAAANLDEDTDDEDEEMILPEMDGVVDPSVLAALPPSVQLD
Sbjct: 301 LAASIAAEEAGSLNENASVSAAANLDEDTDDEDEEMILPEMDGVVDPSVLAALPPSVQLD 360
Query: 361 LLVQ------------------DPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGV 420
LLVQ DPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGV
Sbjct: 361 LLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGV 420
Query: 421 QTSRIASEANREFIFSSSFTGDKQVLASRTEKSGDEDLQAPRGQQPLSSLNNTEVPSTSN 480
QTSRIASEANREFIFSSSFTGDKQVLASR EKSGDEDLQAPRGQQPLSSLNNTEVPSTSN
Sbjct: 421 QTSRIASEANREFIFSSSFTGDKQVLASRIEKSGDEDLQAPRGQQPLSSLNNTEVPSTSN 480
Query: 481 ALAQSTPDKTGVFEENIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNASAN 540
ALA+STPDKTGVFEENIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNASAN
Sbjct: 481 ALARSTPDKTGVFEENIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNASAN 540
Query: 541 EVVNHEPVQSSEICNPRSHSSQSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEILLED 600
EVVNHEPVQ+SEICNP+SHSSQSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEILLED
Sbjct: 541 EVVNHEPVQNSEICNPKSHSSQSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEILLED 600
Query: 601 EGDKSFDGDDDLFTHLAAENPIQVASFDMSSQKLSFDGTTDSGWEEAVEGKTYSPKNVEV 660
EGDKSFDGDDDLFTHLAAENPIQVASFD SSQKLSFDGTTDSGWEEAVEGKTYSPKNVEV
Sbjct: 601 EGDKSFDGDDDLFTHLAAENPIQVASFDKSSQKLSFDGTTDSGWEEAVEGKTYSPKNVEV 660
Query: 661 DDHPFVEGRVSDESEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRSLK 720
DDHPFVEGRVSDESEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRSLK
Sbjct: 661 DDHPFVEGRVSDESEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRSLK 720
Query: 721 DVGDRKPGSVLSEHQKPESAGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGRKE 780
DVGDRKPGSVLSEHQKPESAGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGRKE
Sbjct: 721 DVGDRKPGSVLSEHQKPESAGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGRKE 780
Query: 781 TTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDVSHKDSNENDDTLKPLSRDASGAVL 840
TTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDVSHKDSNENDDTLKPLSRDASGAVL
Sbjct: 781 TTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDVSHKDSNENDDTLKPLSRDASGAVL 840
Query: 841 VGDRINNKLTEPPCHMVGMEDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEENDAQK 900
VGDRINNKLTEPPCHMVGMEDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEENDAQK
Sbjct: 841 VGDRINNKLTEPPCHMVGMEDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEENDAQK 900
Query: 901 PAVEVISNAEIEFTEDELTNRISILEQERLNLGDEQKRLERNAESVSSEMFAECQELLQM 960
PAVEVISNAEIEFTEDELTNRI ILEQERLNLGDEQKRLERNAESV SEMFAECQELLQM
Sbjct: 901 PAVEVISNAEIEFTEDELTNRIXILEQERLNLGDEQKRLERNAESVXSEMFAECQELLQM 960
Query: 961 FGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVILFGARSVYKNIFDDRKYVETYFMK 1020
FGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDV LFGARSVYKNIFDDRKYVETYFMK
Sbjct: 961 FGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMK 1020
Query: 1021 DVENELGLNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIESPD 1080
DVENELGLNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIESPD
Sbjct: 1021 DVENELGLNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIESPD 1080
Query: 1081 PSILGTLSAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSFMK 1140
PSILGTLSAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSFMK
Sbjct: 1081 PSILGTLSAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSFMK 1140
Query: 1141 KHRNVSKNWHIPSEFPSETVISAYTCPQVDKSAESFSWGKPDHFVLRRLCWEKFGWDNSK 1200
KHRNVSKNWHIPSEFPSE VISAYTCPQVDKSAESFSWGKPD FVLRRLCWEKFGWDNSK
Sbjct: 1141 KHRNVSKNWHIPSEFPSEXVISAYTCPQVDKSAESFSWGKPDXFVLRRLCWEKFGWDNSK 1200
Query: 1201 ADELLLPVLKEYSKHETQLRLEAFYTFNERFAKIRSKRIKKAVRGITGSKSAVLMDDAVR 1260
ADELLLPVLKEYSKHETQLRLE FYTF+ERFAKIRSKRIKKAVRGITGSKSAVLMDDAVR
Sbjct: 1201 ADELLLPVLKEYSKHETQLRLETFYTFDERFAKIRSKRIKKAVRGITGSKSAVLMDDAVR 1260
Query: 1261 AVSANKQRELSVEPQEKSEKCSSEIQGSCSNVDDVEKRLGKPSRKRQLHGEQSQPAKGQK 1320
AVSANKQRELSVEPQEKSEKCSSEIQGSCSNVDDVEKRLGKPSRKRQLHGEQSQPAKGQK
Sbjct: 1261 AVSANKQRELSVEPQEKSEKCSSEIQGSCSNVDDVEKRLGKPSRKRQLHGEQSQPAKGQK 1320
Query: 1321 LTMKEKGNRNRNEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDESEFD 1380
LTMKEKGNRNRNEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDE+EFD
Sbjct: 1321 LTMKEKGNRNRNEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDENEFD 1380
Query: 1381 DQKCDFVNLEEPQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSNDEAKEHDVIHDQSEK 1440
DQKCDFVNLEEPQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSNDEAKEHDVIHDQSEK
Sbjct: 1381 DQKCDFVNLEEPQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSNDEAKEHDVIHDQSEK 1440
Query: 1441 TERDLGTPKRPPQEDYFETGGGFCPVEDEMSQDIDPSLEANNSEDYLRMGGGFCLDDDNE 1500
TERDLGTPKRPPQEDYFETGGGFCPVEDEMSQDIDPSLEANNSEDYLRMGGGFCLDDDNE
Sbjct: 1441 TERDLGTPKRPPQEDYFETGGGFCPVEDEMSQDIDPSLEANNSEDYLRMGGGFCLDDDNE 1500
Query: 1501 CIDPDAYPGRATVSEDLQDRSEHDPDQSTFHPEKCTSPVQNKEGTDARVDSLLDTGNPNR 1560
CIDPDAYPGRATVSEDLQDRSEHDPDQSTFHPEKCTS VQNKEGTDARVDSLLDTGNPNR
Sbjct: 1501 CIDPDAYPGRATVSEDLQDRSEHDPDQSTFHPEKCTSSVQNKEGTDARVDSLLDTGNPNR 1560
BLAST of MS008891 vs. ExPASy TrEMBL
Match:
A0A6J1DAD7 (DNA repair protein UVH3 isoform X3 OS=Momordica charantia OX=3673 GN=LOC111018345 PE=3 SV=1)
HSP 1 Score: 2862.0 bits (7418), Expect = 0.0e+00
Identity = 1521/1602 (94.94%), Postives = 1526/1602 (95.26%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM 60
MGVQGLWELLAPVGRRVSVETLAGKKLAI DASIWM
Sbjct: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAI-------------------------DASIWM 60
Query: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR
Sbjct: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
Query: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG 180
ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG
Sbjct: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG 180
Query: 181 RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESIL----- 240
RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESIL
Sbjct: 181 RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESILPLMNE 240
Query: 241 -------------------KQKYKNDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLD 300
KQKYKNDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLD
Sbjct: 241 VDPDVFSTLPSSIQYELLQKQKYKNDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLD 300
Query: 301 EMLAASIAAEEAGSLNENASVSAAANLDEDTDDEDEEMILPEMDGVVDPSVLAALPPSVQ 360
EMLAASIAAEEAGSLNENASVSAAANLDEDTDDEDEEMILPEMDGVVDPSVLAALPPSVQ
Sbjct: 301 EMLAASIAAEEAGSLNENASVSAAANLDEDTDDEDEEMILPEMDGVVDPSVLAALPPSVQ 360
Query: 361 LDLLVQ------------------DPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVG 420
LDLLVQ DPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVG
Sbjct: 361 LDLLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVG 420
Query: 421 GVQTSRIASEANREFIFSSSFTGDKQVLASRTEKSGDEDLQAPRGQQPLSSLNNTEVPST 480
GVQTSRIASEANREFIFSSSFTGDKQVLASR EKSGDEDLQAPRGQQPLSSLNNTEVPST
Sbjct: 421 GVQTSRIASEANREFIFSSSFTGDKQVLASRIEKSGDEDLQAPRGQQPLSSLNNTEVPST 480
Query: 481 SNALAQSTPDKTGVFEENIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNAS 540
SNALA+STPDKTGVFEENIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNAS
Sbjct: 481 SNALARSTPDKTGVFEENIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNAS 540
Query: 541 ANEVVNHEPVQSSEICNPRSHSSQSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEILL 600
ANEVVNHEPVQ+SEICNP+SHSSQSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEILL
Sbjct: 541 ANEVVNHEPVQNSEICNPKSHSSQSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEILL 600
Query: 601 EDEGDKSFDGDDDLFTHLAAENPIQVASFDMSSQKLSFDGTTDSGWEEAVEGKTYSPKNV 660
EDEGDKSFDGDDDLFTHLAAENPIQVASFD SSQKLSFDGTTDSGWEEAVEGKTYSPKNV
Sbjct: 601 EDEGDKSFDGDDDLFTHLAAENPIQVASFDKSSQKLSFDGTTDSGWEEAVEGKTYSPKNV 660
Query: 661 EVDDHPFVEGRVSDESEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRS 720
EVDDHPFVEGRVSDESEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRS
Sbjct: 661 EVDDHPFVEGRVSDESEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRS 720
Query: 721 LKDVGDRKPGSVLSEHQKPESAGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGR 780
LKDVGDRKPGSVLSEHQKPESAGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGR
Sbjct: 721 LKDVGDRKPGSVLSEHQKPESAGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGR 780
Query: 781 KETTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDVSHKDSNENDDTLKPLSRDASGA 840
KETTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDVSHKDSNENDDTLKPLSRDASGA
Sbjct: 781 KETTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDVSHKDSNENDDTLKPLSRDASGA 840
Query: 841 VLVGDRINNKLTEPPCHMVGMEDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEENDA 900
VLVGDRINNKLTEPPCHMVGMEDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEENDA
Sbjct: 841 VLVGDRINNKLTEPPCHMVGMEDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEENDA 900
Query: 901 QKPAVEVISNAEIEFTEDELTNRISILEQERLNLGDEQKRLERNAESVSSEMFAECQELL 960
QKPAVEVISNAEIEFTEDELTNRI ILEQERLNLGDEQKRLERNAESV SEMFAECQELL
Sbjct: 901 QKPAVEVISNAEIEFTEDELTNRIXILEQERLNLGDEQKRLERNAESVXSEMFAECQELL 960
Query: 961 QMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVILFGARSVYKNIFDDRKYVETYF 1020
QMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDV LFGARSVYKNIFDDRKYVETYF
Sbjct: 961 QMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYF 1020
Query: 1021 MKDVENELGLNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIES 1080
MKDVENELGLNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIES
Sbjct: 1021 MKDVENELGLNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIES 1080
Query: 1081 PDPSILGTLSAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSF 1140
PDPSILGTLSAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSF
Sbjct: 1081 PDPSILGTLSAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSF 1140
Query: 1141 MKKHRNVSKNWHIPSEFPSETVISAYTCPQVDKSAESFSWGKPDHFVLRRLCWEKFGWDN 1200
MKKHRNVSKNWHIPSEFPSE VISAYTCPQVDKSAESFSWGKPD FVLRRLCWEKFGWDN
Sbjct: 1141 MKKHRNVSKNWHIPSEFPSEXVISAYTCPQVDKSAESFSWGKPDXFVLRRLCWEKFGWDN 1200
Query: 1201 SKADELLLPVLKEYSKHETQLRLEAFYTFNERFAKIRSKRIKKAVRGITGSKSAVLMDDA 1260
SKADELLLPVLKEYSKHETQLRLE FYTF+ERFAKIRSKRIKKAVRGITGSKSAVLMDDA
Sbjct: 1201 SKADELLLPVLKEYSKHETQLRLETFYTFDERFAKIRSKRIKKAVRGITGSKSAVLMDDA 1260
Query: 1261 VRAVSANKQRELSVEPQEKSEKCSSEIQGSCSNVDDVEKRLGKPSRKRQLHGEQSQPAKG 1320
VRAVSANKQRELSVEPQEKSEKCSSEIQGSCSNVDDVEKRLGKPSRKRQLHGEQSQPAKG
Sbjct: 1261 VRAVSANKQRELSVEPQEKSEKCSSEIQGSCSNVDDVEKRLGKPSRKRQLHGEQSQPAKG 1320
Query: 1321 QKLTMKEKGNRNRNEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDESE 1380
QKLTMKEKGNRNRNEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDE+E
Sbjct: 1321 QKLTMKEKGNRNRNEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDENE 1380
Query: 1381 FDDQKCDFVNLEEPQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSNDEAKEHDVIHDQS 1440
FDDQKCDFVNLEEPQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSNDEAKEHDVIHDQS
Sbjct: 1381 FDDQKCDFVNLEEPQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSNDEAKEHDVIHDQS 1440
Query: 1441 EKTERDLGTPKRPPQEDYFETGGGFCPVEDEMSQDIDPSLEANNSEDYLRMGGGFCLDDD 1500
EKTERDLGTPKRPPQEDYFETGGGFCPVEDEMSQDIDPSLEANNSEDYLRMGGGFCLDDD
Sbjct: 1441 EKTERDLGTPKRPPQEDYFETGGGFCPVEDEMSQDIDPSLEANNSEDYLRMGGGFCLDDD 1500
Query: 1501 NECIDPDAYPGRATVSEDLQDRSEHDPDQSTFHPEKCTSPVQNKEGTDARVDSLLDTGNP 1560
NECIDPDAYPGRATVSEDLQDRSEHDPDQSTFHPEKCTS VQNKEGTDARVDSLLDTGNP
Sbjct: 1501 NECIDPDAYPGRATVSEDLQDRSEHDPDQSTFHPEKCTSSVQNKEGTDARVDSLLDTGNP 1560
BLAST of MS008891 vs. ExPASy TrEMBL
Match:
A0A6J1ERW5 (DNA repair protein UVH3 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111435307 PE=3 SV=1)
HSP 1 Score: 2095.9 bits (5429), Expect = 0.0e+00
Identity = 1192/1610 (74.04%), Postives = 1297/1610 (80.56%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM 60
MGV GLWELLAPVGRRVSVETLAGKKLAI DASIWM
Sbjct: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAI-------------------------DASIWM 60
Query: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATP+LKRRTLIARRRQR
Sbjct: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPSLKRRTLIARRRQR 120
Query: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG 180
ENAQAKVRKTAEKLLLNHLK MRL+ELAE ++NQKQQR+QDVPKKK L NH DGTS
Sbjct: 121 ENAQAKVRKTAEKLLLNHLKEMRLRELAEGIKNQKQQRKQDVPKKKTLLNHNEIVDGTSV 180
Query: 181 RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESILKQKYK 240
+S + +SG+HE LDGM+AASIM EENGFF+SS+ SFSG L K + GE+SIL QKYK
Sbjct: 181 SERSKSVPNSGNHENLDGMVAASIMIEENGFFSSSAPSFSGVTLPKKDRGEQSILNQKYK 240
Query: 241 NDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLDEMLAASIAAEEAGSLNENASVSAA 300
NDSKGKKILSDEIH VGSD+ERMEVASR AHQQNLDEMLAASIAAEEA LNEN SVS+A
Sbjct: 241 NDSKGKKILSDEIHVVGSDSERMEVASRSAHQQNLDEMLAASIAAEEARGLNENISVSSA 300
Query: 301 ANL---DEDTDDEDEEMILPEMDGVVDPSVLAALPPSVQLDLLVQ--------------- 360
ANL D D +DEDEEMILPEM GVVDPSVLAALPPSVQLDLLVQ
Sbjct: 301 ANLAGEDMDDEDEDEEMILPEMHGVVDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQR 360
Query: 361 ---DPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGVQTSRIASEANREFIFSSSF 420
DPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGVQTSRIASEANREFIFSSSF
Sbjct: 361 VKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGVQTSRIASEANREFIFSSSF 420
Query: 421 TGDKQVLAS-RTEKSGDEDLQAPRGQQPLSSLNNTEVPSTSNALAQSTPDKTGVFEENIE 480
TGDKQVL S R EK+GD++LQ PR QQ LSSLNNT++PSTSN LAQSTPDK+GVFE+NIE
Sbjct: 421 TGDKQVLTSTRAEKNGDKNLQEPRVQQSLSSLNNTDIPSTSNGLAQSTPDKSGVFEDNIE 480
Query: 481 TFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNASANEVVNHEPVQSSEICNPRS 540
TFLDERGRVRVSRVRAMGM MTRDLERNLDLMKEIEKN +AN+V N EP+Q+ EICNP S
Sbjct: 481 TFLDERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEKNINANKVANPEPMQNIEICNPES 540
Query: 541 HSSQSQDLDTPYEGVSESV-QLSLRSRGSMLDEDTAIEILLEDEGDKSFDGDDDLFTHLA 600
S +SQ LD EG+ ES+ +L R SML+EDTAIEILLE EG KSFDGDDDLFTHLA
Sbjct: 541 SSLRSQVLDVSNEGIDESINKLDERGADSMLNEDTAIEILLEGEGGKSFDGDDDLFTHLA 600
Query: 601 AENPIQVASFDMSSQKLSFDGTTDSGWEEAVEGKTYSPKNVEVDDHPFVEGRVSDESEVE 660
AENPIQ+ASFD+SSQKLS DGTTDSGW+EA +EG +SDESEV+
Sbjct: 601 AENPIQMASFDISSQKLSQDGTTDSGWKEA------------------LEGTISDESEVD 660
Query: 661 WEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRSLKDVGDRKPGSVLSEHQKP 720
WE+GVCDHVNPVPF ESGKSVSKGSLEEEADLQEAIRRSL+DVGD K G V EH++P
Sbjct: 661 WEDGVCDHVNPVPF-EDESGKSVSKGSLEEEADLQEAIRRSLEDVGDGKSGPVSLEHEQP 720
Query: 721 ES----AGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGRKETTESSSQEKQCSE 780
+S GKM EQCTSV+NENVIGL+ +D DGM+ S A DS +K TESSSQEKQCSE
Sbjct: 721 QSQPSIVGKMAEQCTSVENENVIGLEKMDSVDGMNWSNAKDSILKKGMTESSSQEKQCSE 780
Query: 781 CIVLLDTTTHTVTEKLDASYKDVSH--KDSNENDDTLKPLSRDASGAVLVGDRINNKLTE 840
+VLLDT T+ E+LDASYKD S ++SNE+ DTLK LSRDA A VGD IN+ + E
Sbjct: 781 PVVLLDT---TIAEQLDASYKDTSFSLQESNESSDTLKSLSRDAPRATQVGDMINSTMIE 840
Query: 841 PPCHMVGMEDSYTPEVDSSPKVVASENH--QNFPVDELSSDILLEENDAQKPAVEVISNA 900
P C MV M+ TP+VDSS K A ENH QN PV++ SSD+LLEE + VE IS A
Sbjct: 841 PACRMVEMDGVNTPDVDSSTKDSAFENHFKQNLPVEKHSSDLLLEEEVGKGHTVE-ISKA 900
Query: 901 EIEFTEDELTNRISILEQERLNLGDEQKRLERNAESVSSEMFAECQELLQMFGLPYIIAP 960
E E TEDEL +RISILEQERLNLGDEQKRLERNAE+VSSEMFAECQELLQMFGLPYIIAP
Sbjct: 901 ETEVTEDELKSRISILEQERLNLGDEQKRLERNAEAVSSEMFAECQELLQMFGLPYIIAP 960
Query: 961 MEAEAQCAYMELANLVDGVVTDDSDVILFGARSVYKNIFDDRKYVETYFMKDVENELGLN 1020
MEAEAQCAYMELANLVDGVVTDDSDV LFGARSVYKNIFDDRKYVETYFMKDVENELGL+
Sbjct: 961 MEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELGLD 1020
Query: 1021 RDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIESPDPSILGTLSA 1080
R+K+IRMALLLGSDYTEGISGIGIVNA+EVMNAFPEEDGL KFKEWIESPDPSILGTL A
Sbjct: 1021 RNKLIRMALLLGSDYTEGISGIGIVNAVEVMNAFPEEDGLHKFKEWIESPDPSILGTLGA 1080
Query: 1081 QTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSFMKKHRNVSKNW 1140
+TGLS+RKRG KASE D CSNSSV DGSAS E+I +DLKENIDVKQ+FM KHRNVSKNW
Sbjct: 1081 KTGLSARKRGQKASENDAPCSNSSVRDGSASEENIDKDLKENIDVKQNFMVKHRNVSKNW 1140
Query: 1141 HIPSEFPSETVISAYTCPQVDKSAESFSWGKPDHFVLRRLCWEKFGWDNSKADELLLPVL 1200
HIPSEFPSE VISAY PQVDKSAE FSWGKPDHFVLRRLC EKFGW+NSKADELLLPVL
Sbjct: 1141 HIPSEFPSEAVISAYISPQVDKSAEPFSWGKPDHFVLRRLCLEKFGWENSKADELLLPVL 1200
Query: 1201 KEYSKHETQLRLEAFYTFNERFAKIRSKRIKKAVRGITGSKSAVLMDDAVRAVSANKQRE 1260
KEY KHETQLRLEAFYTFNERFAKIRSKRIKKAV+ ITGSKSA LMD+ V VS N Q
Sbjct: 1201 KEYGKHETQLRLEAFYTFNERFAKIRSKRIKKAVKSITGSKSASLMDETVPNVSVNNQIN 1260
Query: 1261 LSVEPQEK-SEKCSSEIQGSCSNVDDVEKRLGKPSRKRQLHGEQSQPAKGQKLTMKEKGN 1320
LS E Q+ SEKCSSEIQG+CSN D+V+ RL KPSRKRQL EQSQPAK +KLTMKEKG
Sbjct: 1261 LSGETQKNMSEKCSSEIQGACSNEDNVDNRLRKPSRKRQLDREQSQPAKDRKLTMKEKGK 1320
Query: 1321 RNRNEGSH-KNGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDESEFDDQKCDFV 1380
R+RNEGSH + GRGRG KGRGRGRL KGK SP TE V TSSSDDESEFDDQK D
Sbjct: 1321 RSRNEGSHSERGRGRGRGKGRGRGRLALKGK---SPITEFVGTSSSDDESEFDDQKIDLE 1380
Query: 1381 NLEEPQERRRSSRIRKSVSYTMGDA--DQPSDYNGDRFSNDEAKEHDVIHD--------- 1440
N++EPQERR+SSR+RKS SY M DA DQPSD++G R SNDEA + +V+
Sbjct: 1381 NVQEPQERRKSSRVRKSASYKMDDADQDQPSDHSGYRLSNDEANDDNVVQGGYTGPETVM 1440
Query: 1441 -QSEKTERDLGTPKRPPQEDYFETGGGFCPVEDEMS-----QDIDPSLEANNSEDYLRMG 1500
SE TE D PKR P DY TGGGFCP EDEMS Q+ DP+LEA+NSEDYL +G
Sbjct: 1441 IHSENTECDYEIPKRSPLRDYLGTGGGFCPTEDEMSREAMCQNKDPALEASNSEDYLTLG 1500
Query: 1501 GGFCLDDDNECIDPDAYPGRATVSEDLQDRSEHDPDQSTFHPEKCTSPVQNKEGTDARVD 1560
GGFCLDDDNEC+DP A+ +AT SE +D SE DPDQSTFHPEK Q E T +
Sbjct: 1501 GGFCLDDDNECVDPVAHLDQATASEVPKDGSEDDPDQSTFHPEKDIGGNQLNEDTYPHGE 1559
BLAST of MS008891 vs. ExPASy TrEMBL
Match:
A0A6J1JJE1 (DNA repair protein UVH3 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111486346 PE=3 SV=1)
HSP 1 Score: 2080.4 bits (5389), Expect = 0.0e+00
Identity = 1185/1614 (73.42%), Postives = 1296/1614 (80.30%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM 60
MGV GLWELLAPVGRRVSVETLAGKKLAI DASIWM
Sbjct: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAI-------------------------DASIWM 60
Query: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
VQFIKAMRD+RGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATP+LKRRTLIARRRQR
Sbjct: 61 VQFIKAMRDDRGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPSLKRRTLIARRRQR 120
Query: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG 180
ENAQAKVRKTAEKLLLNHLK MRL+ELAE ++NQKQQR+QDVPKKK L NH DGTS
Sbjct: 121 ENAQAKVRKTAEKLLLNHLKEMRLRELAEGIKNQKQQRKQDVPKKKTLLNHNAIVDGTSI 180
Query: 181 RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESILKQKYK 240
+S + +SG+HE LDGM+AASIM EENGFF+SS+ SF G L K++ GE+S QKYK
Sbjct: 181 SERSKSVPNSGNHENLDGMVAASIMIEENGFFSSSAPSFVGVTLPKEDRGEQSTWNQKYK 240
Query: 241 NDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLDEMLAASIAAEEAGSLNENASVSAA 300
NDSKGKKILSDEIH VGSD+ERMEVASR AHQQNLDEMLAASIAAEEA LNEN SVS+A
Sbjct: 241 NDSKGKKILSDEIHVVGSDSERMEVASRSAHQQNLDEMLAASIAAEEARGLNENVSVSSA 300
Query: 301 ANL---DEDTDDEDEEMILPEMDGVVDPSVLAALPPSVQLDLLVQ--------------- 360
+ L D D +DEDEEMILPEM GVVDPSVLAALPPSVQLDLLVQ
Sbjct: 301 SYLAGEDMDDEDEDEEMILPEMHGVVDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQR 360
Query: 361 ---DPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGVQTSRIASEANREFIFSSSF 420
DPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGVQTSRIASEANREFIFSSSF
Sbjct: 361 VKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGVQTSRIASEANREFIFSSSF 420
Query: 421 TGDKQVLAS-RTEKSGDEDLQAPRGQQPLSSLNNTEVPSTSNALAQSTPDKTGVFEENIE 480
TGDKQVL S R EK+GD+DLQAPR QQ LSSLNNT++PSTSN LAQSTPDK+GVFE+NIE
Sbjct: 421 TGDKQVLTSTRAEKNGDKDLQAPRVQQSLSSLNNTDIPSTSNGLAQSTPDKSGVFEDNIE 480
Query: 481 TFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNASANEVVNHEPVQSSEICNPRS 540
TFLDERG VRVSRVRAMGM MTRDLERNLDLMKEIEKN +AN+ N EP+Q+ EICNP+S
Sbjct: 481 TFLDERGCVRVSRVRAMGMHMTRDLERNLDLMKEIEKNINANKAANPEPMQNIEICNPKS 540
Query: 541 HSSQSQDLDTPYEGVSESV-QLSLRSRGSMLDEDTAIEILLEDEGDKSFDGDDDLFTHLA 600
S +SQ LD EGV ES+ +L R SML+EDTAIEI+LE EG KSFDGDDDLFTHLA
Sbjct: 541 SSLRSQVLDVSNEGVDESINKLDERGADSMLNEDTAIEIVLEGEGGKSFDGDDDLFTHLA 600
Query: 601 AENPIQVASFDMSSQKLSFDGTTDSGWEEAVEGKTYSPKNVEVDDHPFVEGRVSDESEVE 660
AENPIQ+ASFD+SSQKLS DGTTDSGW+EA +EG VSDESEV+
Sbjct: 601 AENPIQMASFDISSQKLSQDGTTDSGWKEA------------------LEGTVSDESEVD 660
Query: 661 WEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRSLKDVGDRKPGSVLSEHQKP 720
WE+GVCDHVNPVPF ESGKSVSKGSLEEEADLQEAIRRSL+DVGD K G V EH++P
Sbjct: 661 WEDGVCDHVNPVPF-EDESGKSVSKGSLEEEADLQEAIRRSLEDVGDGKSGPVSLEHEQP 720
Query: 721 ES----AGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGRKETTESSSQEKQCSE 780
+S GKM E+C S +NENVIGL+ +D DGM+ S A DS +K TESSSQEKQCSE
Sbjct: 721 QSQPSIVGKMAERCMSFENENVIGLEKMDSVDGMNWSNAKDSILKKGMTESSSQEKQCSE 780
Query: 781 CIVLLDTTTHTVTEKLDASYKDVSH--KDSNENDDTLKPLSRDASGAVLVGDRINNKLTE 840
+VLLDT T+ E+LDASYKD S + SNEN DTLK LSRDA A VGD IN+ + E
Sbjct: 781 PVVLLDT---TIAEQLDASYKDTSFSLQVSNENSDTLKSLSRDAPRATQVGDMINSTVIE 840
Query: 841 PPCHMVGMEDSYTPEVDSSPKVVASENH--QNFPVDELSSDILLEENDAQKPAVEVISNA 900
P C MV M+ TP+VDSS K A ENH QNFPV++ SSD+LLEE + V+ IS A
Sbjct: 841 PACRMVEMDGVNTPDVDSSTKDSAFENHFKQNFPVEKHSSDLLLEEEVGKGHTVK-ISKA 900
Query: 901 EIEFTEDELTNRISILEQERLNLGDEQKRLERNAESVSSEMFAECQELLQMFGLPYIIAP 960
E E TEDEL +RISILEQERL+LGDEQKRLERNAE+VSSEMFAECQELLQMFGLPYIIAP
Sbjct: 901 EAEVTEDELKSRISILEQERLSLGDEQKRLERNAEAVSSEMFAECQELLQMFGLPYIIAP 960
Query: 961 MEAEAQCAYMELANLVDGVVTDDSDVILFGARSVYKNIFDDRKYVETYFMKDVENELGLN 1020
MEAEAQCAYMELANLVDGVVTDDSDV LFGARSVYKNIFDDRKYVETYFMKD+ENELGL+
Sbjct: 961 MEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDIENELGLD 1020
Query: 1021 RDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIESPDPSILGTLSA 1080
R+K+IRMALLLGSDYTEGISGIGIVNA+EVMNAF EEDGL KFKEWIESPDPSILGTL A
Sbjct: 1021 RNKLIRMALLLGSDYTEGISGIGIVNAVEVMNAFSEEDGLHKFKEWIESPDPSILGTLGA 1080
Query: 1081 QTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSFMKKHRNVSKNW 1140
+TGLS+RKRG KASE D TCSNSSV DGSAS E+I +DLKENIDVKQ+FM KHRNVSKNW
Sbjct: 1081 KTGLSARKRGQKASENDATCSNSSVRDGSASEENIDKDLKENIDVKQNFMVKHRNVSKNW 1140
Query: 1141 HIPSEFPSETVISAYTCPQVDKSAESFSWGKPDHFVLRRLCWEKFGWDNSKADELLLPVL 1200
HIPSEFPSE VISAY PQVDKSAE FSWGKPDHFVLRRLC EKFGW+NSKADELLLPVL
Sbjct: 1141 HIPSEFPSEAVISAYISPQVDKSAEPFSWGKPDHFVLRRLCLEKFGWENSKADELLLPVL 1200
Query: 1201 KEYSKHETQLRLEAFYTFNERFAKIRSKRIKKAVRGITGSKSAVLMDDAVRAVSANKQRE 1260
KEY KHETQLRLEAFYTFNERFAKIRSKRIKKAV+ ITGSKSA LMD+ V VS N Q
Sbjct: 1201 KEYGKHETQLRLEAFYTFNERFAKIRSKRIKKAVKSITGSKSASLMDETVPNVSVNNQIN 1260
Query: 1261 LSVEPQEK-SEKCSSEIQGSCSNVDDVEKRLGKPSRKRQLHGEQSQPAKGQKLTMKEKGN 1320
LS E Q+ SEKCSSEIQG+CSN D+V+ RL KPSRKRQL EQSQPAK +KLTMKEKG
Sbjct: 1261 LSGETQKNMSEKCSSEIQGACSNEDNVDNRLRKPSRKRQLDREQSQPAKDRKLTMKEKGK 1320
Query: 1321 RNRNEGSHK---NGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDESEFDDQKCD 1380
R+RNEGSH GRGRG KGRGRGRL KGK SP TE VETSSSDDESE DD+K D
Sbjct: 1321 RSRNEGSHSERGRGRGRGGGKGRGRGRLASKGK---SPITEFVETSSSDDESESDDKKLD 1380
Query: 1381 FVNLEEPQERRRSSRIRKSVSYTMGDA----DQPSDYNGDRFSNDEAKEHDVIHDQ---- 1440
N++EPQERR+SSR+RKS SY M DA DQPSDY+G R SNDEA + +V+ +
Sbjct: 1381 LENVQEPQERRKSSRVRKSASYKMDDADPDQDQPSDYSGYRLSNDEANDDNVVQGRYTGP 1440
Query: 1441 ------SEKTERDLGTPKRPPQEDYFETGGGFCPVEDEMSQDI-----DPSLEANNSEDY 1500
SE TE D PKR P DY TGGGFCP EDEMSQ+ DP+LEA+NSEDY
Sbjct: 1441 ETVMIHSENTECDYEIPKRSPLRDYLGTGGGFCPTEDEMSQEAMCRNKDPALEASNSEDY 1500
Query: 1501 LRMGGGFCLDDDNECIDPDAYPGRATVSEDLQDRSEHDPDQSTFHPEKCTSPVQNKEGTD 1560
L +GGGFCLDDDNEC+DP A+ +ATVSE L+D SE DP QSTFHPEK Q +E T
Sbjct: 1501 LTLGGGFCLDDDNECVDPVAHLDQATVSEALKDGSEDDPGQSTFHPEKDIGGDQLEEDTY 1560
BLAST of MS008891 vs. TAIR 10
Match:
AT3G28030.1 (5'-3' exonuclease family protein )
HSP 1 Score: 977.2 bits (2525), Expect = 1.5e-284
Identity = 700/1643 (42.60%), Postives = 930/1643 (56.60%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM 60
MGVQGLWELLAPVGRRVSVETLA K+LAI DASIWM
Sbjct: 1 MGVQGLWELLAPVGRRVSVETLANKRLAI-------------------------DASIWM 60
Query: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
VQFIKAMRDE+G+MV+NAHL+GFFRRICKLLFLRTKP+FVFDGATPALKRRT+IARRRQR
Sbjct: 61 VQFIKAMRDEKGDMVQNAHLIGFFRRICKLLFLRTKPIFVFDGATPALKRRTVIARRRQR 120
Query: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG 180
ENAQ K+RKTAEKLLLN LK +RLKE A+D++NQ+ ++ KK + + +
Sbjct: 121 ENAQTKIRKTAEKLLLNRLKDIRLKEQAKDIKNQRLKQDDSDRVKKRVSSDSVEDNLRVP 180
Query: 181 RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESILKQKYK 240
+ S EKLD + AS++ E +G + ++K+ K
Sbjct: 181 VEEDDVGASFFQEEKLDEVSQASLVGE---------------------TGVDDVVKESVK 240
Query: 241 NDSKGKKILSDEIHAVGSDTERM--EVASRGA-HQQNLDEMLAASIAAEEAGSLNENASV 300
+D KGK +L D G D + + + + +G +Q+ LDEMLAAS+AAEE + AS
Sbjct: 241 DDPKGKGVLLD-----GDDLDNLVQDSSVQGKDYQEKLDEMLAASLAAEEERNFTSKAST 300
Query: 301 SAAANLDEDTDDE----DEEMILPEMDGVVDPSVLAALPPSVQLDLLVQ----------- 360
SAAA E+ ++E DEE++LP MDG +DP+VLA+LPPS+QLDLL Q
Sbjct: 301 SAAAIPSEEDEEEDSDGDEEILLPVMDGNIDPAVLASLPPSMQLDLLAQMREKLMAENRQ 360
Query: 361 -------DPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGVQTSRIASEANREFIF 420
P KFSELQI+AYLKTVAFRR+I++VQ++A GR VGGVQTSRIASEANREFIF
Sbjct: 361 KYQKVKKAPEKFSELQIEAYLKTVAFRREINEVQRSAGGRAVGGVQTSRIASEANREFIF 420
Query: 421 SSSFTGDKQVLASRTEKSGDEDLQAPRGQQPLSSLNNTEVPSTSNALAQSTPDKTGVFEE 480
SSSF GDK+VLAS E DE+ + Q S+ N S+A + D+ +E
Sbjct: 421 SSSFAGDKEVLASAREGRNDENQKKTSQQSLPVSVKNASPLKKSDATIELDRDEPKNPDE 480
Query: 481 NIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNASANEVVNHEPVQSSEICN 540
NIE ++DERGR R+ R R MG++MTRD++RNL LMKE E+ AS + N E + E
Sbjct: 481 NIEVYIDERGRFRI-RNRHMGIQMTRDIQRNLHLMKEKERTASGSMAKNDETFSAWE--- 540
Query: 541 PRSHSSQSQDLD-TPYEGVSESVQLSLRSRGSMLDEDTAIEILLE-DEGDKSFDGDDDLF 600
+ ++ Q L+ +P E + V L +++ SML ++IEI + D G K + +DD+F
Sbjct: 541 --NFPTEDQFLEKSPVE--KDVVDLEIQNDDSMLHPPSSIEISFDHDGGGKDLNDEDDMF 600
Query: 601 THLAAENPIQVASFDMSSQKLSFDGTTDSGWEEAVEGKTYSPKNVEVD-DHPFVEGRVSD 660
LAA P+ ++S + ++ + +DS WEE + S +E + + + +S
Sbjct: 601 LQLAAGGPVTISSTENDPKEDTSPWASDSDWEEVPVEQNTSVSKLEANLSNQHIPKDISI 660
Query: 661 ESEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRSLKDVGDRKPGSVLS 720
V WEE C + N ++ ++KG LEEEADLQEAI++SL ++ D++ G VL
Sbjct: 661 AEGVAWEEYSCKNANNSV--ENDTVTKITKGYLEEEADLQEAIKKSLLELHDKESGDVLE 720
Query: 721 EHQ---------KP--------ESAG-----KMLEQCTSVQNENVIGLK-------NVDG 780
E+Q KP E+ G + L++ T ++ I + N DG
Sbjct: 721 ENQSVRVNLVVDKPSEDSLCSRETVGEAEEERFLDEITILKTSGAISEQSNTSVAGNADG 780
Query: 781 ADGMSCS-KANDSTGRKETTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDVSHKDSN 840
G++ + S+G + + S + + ++ + + ++ S H +
Sbjct: 781 QKGITKQFGTHPSSGSNNVSHAVSNKLSKVKSVISPEKALNVASQNRMLSTMAKQHNE-- 840
Query: 841 ENDDTLKPLSRDASGAVLVGDRINNKLTEPPC----HMVGMEDS-----------YTPEV 900
E ++ S S + + I L E + M+D T
Sbjct: 841 EGSESFGGESVKVSAMPIADEEITGFLDEKDNADGESSIMMDDKRDYSRRKIQSLVTESR 900
Query: 901 DSSPKVVASENHQNFPVDELSSDILLEENDAQKPAVEVISNAE-------IEFTEDELTN 960
D S VV S D S + EEN++ + + S+ + +EF+E +
Sbjct: 901 DPSRNVVRSRIGILHDTD--SQNERREENNSNEHTFNIDSSTDFEEKGVPVEFSEANIEE 960
Query: 961 RISILEQERLNLGDEQKRLERNAESVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYME 1020
I +L+QE ++LGDEQ++LERNAESVSSEMFAECQELLQ+FG+PYIIAPMEAEAQCA+ME
Sbjct: 961 EIRVLDQEFVSLGDEQRKLERNAESVSSEMFAECQELLQIFGIPYIIAPMEAEAQCAFME 1020
Query: 1021 LANLVDGVVTDDSDVILFGARSVYKNIFDDRKYVETYFMKDVENELGLNRDKIIRMALLL 1080
+NLVDG+VTDDSDV LFGARSVYKNIFDDRKYVETYFMKD+E ELGL+RDKIIRMA+LL
Sbjct: 1021 QSNLVDGIVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDIEKELGLSRDKIIRMAMLL 1080
Query: 1081 GSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIESPDPSILGTLSAQTGLSSRKRGS 1140
GSDYTEGISGIGIVNAIEV+ AFPEEDGLQKF+EW+ESPDP+ILG A+TG +KRGS
Sbjct: 1081 GSDYTEGISGIGIVNAIEVVTAFPEEDGLQKFREWVESPDPTILGKTDAKTGSKVKKRGS 1140
Query: 1141 KASEKDTTCSNSSVGDGSASGEDISEDLKENIDVKQSFMKKHRNVSKNWHIPSEFPSETV 1200
+ + S +S D ++KQ FM +HR VSKNWHIP FPSE V
Sbjct: 1141 ASVDNKGIISGASTDDTE--------------EIKQIFMDQHRKVSKNWHIPLTFPSEAV 1200
Query: 1201 ISAYTCPQVDKSAESFSWGKPDHFVLRRLCWEKFGWDNSKADELLLPVLKEYSKHETQLR 1260
ISAY PQVD S E FSWGKPD VLR+LCWEKF W+ K DELLLPVLKEY K ETQLR
Sbjct: 1201 ISAYLNPQVDLSTEKFSWGKPDLSVLRKLCWEKFNWNGKKTDELLLPVLKEYEKRETQLR 1260
Query: 1261 LEAFYTFNERFAKIRSKRIKKAVRGITGSKSAVLMDDAVRAVSANKQRELSVEPQEKSEK 1320
+EAFY+FNERFAKIRSKRI KAV+GI G S+ + D ++ K+ + V P E +
Sbjct: 1261 IEAFYSFNERFAKIRSKRINKAVKGIGGGLSSDVADHTLQE-GPRKRNKKKVAPHETEDN 1320
Query: 1321 CSSEIQGSCSN--VDDVEKRLGKPSRKRQLHGEQSQPAKGQKLTMKEKGNRNRNEGSHKN 1380
+S+ +N V + KRL KPS R
Sbjct: 1321 NTSDKDSPIANEKVKNKRKRLEKPSSSR-------------------------------- 1380
Query: 1381 GRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDESEFDDQKCDFVNLE-EPQERRR 1440
GRGR +++GRGRGR+Q +L+E S + + DD K V LE +P ++
Sbjct: 1381 GRGRAQKRGRGRGRVQ----------KDLLELSDGSSDDDDDDDK--VVELEAKPANLQK 1440
Query: 1441 SSRIRKSVSYTMGDADQPSDYNGDRFSNDEAKEHDVIHDQSEKTERDLGTPKRPPQEDYF 1500
S+R R V Y+ + D+ + + S E E +V + + + P EDY
Sbjct: 1441 STRSRNPVMYSAKEDDELDESRSNEGSPSENFE-EVDEGRIGNDDSVDASINDCPSEDYI 1477
Query: 1501 ETGGGFCPVEDEMSQDIDPSLEANNSEDYLRMGGGFCLDDDNECIDPDAYPGRATVSEDL 1560
+TGGGFC DE + D LE ++DY +GGGFC+D+D T E+
Sbjct: 1501 QTGGGFC--ADEADEIGDAHLEDKATDDYRVIGGGFCVDEDE------------TAEENT 1477
BLAST of MS008891 vs. TAIR 10
Match:
AT1G01880.1 (5'-3' exonuclease family protein )
HSP 1 Score: 114.4 bits (285), Expect = 8.3e-25
Identity = 95/295 (32.20%), Postives = 145/295 (49.15%), Query Frame = 0
Query: 899 LERNAESVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVILF 958
+ERN + SE EC ELL++ G+P + A EAEA CA + VD +T DSD LF
Sbjct: 110 VERN--KLFSEWVRECVELLELLGIPVLKANGEAEALCAQLNSQGFVDACITPDSDAFLF 169
Query: 959 GARSVYKNIF-DDRKYVETYFMKDVENELGLNRDKIIRMALLLGSDY-TEGISGIGIVNA 1018
GA V K+I + R+ E Y M +E+ LGL R +I ++LL+G+DY + G+ GIG+ A
Sbjct: 170 GAMCVIKDIKPNSREPFECYHMSHIESGLGLKRKHLIAISLLVGNDYDSGGVLGIGVDKA 229
Query: 1019 IEVMNAFPEEDGLQKFKEWIESPDPSILGTL-SAQTGLSSRKRGSKASEKDTTCSN--SS 1078
+ ++ F E+ L++ ++ P++ G + S G R K S + C + S
Sbjct: 230 LRIVREFSEDQVLERLQDIGNGLQPAVPGGIKSGDDGEEFRSEMKKRSPHCSRCGHLGSK 289
Query: 1079 VGDGSASGEDISED---LKENIDVK--QSFMKKHRNV-----SKNWHI--------PSEF 1138
+S E D +K+ + + SF K R++ + +W I EF
Sbjct: 290 RTHFKSSCEHCGCDSGCIKKPLGFRCECSFCSKDRDLREQKKTNDWWIKVCDKIALAPEF 349
Query: 1139 PSETVISAYTCP-QVDKSAESFSWGKPDHFVLRRLCWEKFGWDNSKADELLLPVL 1170
P+ +I Y + S SWG PD +L L K WD S ++LLP+L
Sbjct: 350 PNRKIIELYLSDGLMTGDGSSMSWGTPDTGMLVDLMVFKLHWDPSYVRKMLLPML 402
BLAST of MS008891 vs. TAIR 10
Match:
AT1G01880.2 (5'-3' exonuclease family protein )
HSP 1 Score: 110.5 bits (275), Expect = 1.2e-23
Identity = 95/295 (32.20%), Postives = 145/295 (49.15%), Query Frame = 0
Query: 899 LERNAESVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVILF 958
+ERN + SE EC ELL++ G+P + A EAEA CA + VD +T DSD LF
Sbjct: 110 VERN--KLFSEWVREC-ELLELLGIPVLKANGEAEALCAQLNSQGFVDACITPDSDAFLF 169
Query: 959 GARSVYKNIF-DDRKYVETYFMKDVENELGLNRDKIIRMALLLGSDY-TEGISGIGIVNA 1018
GA V K+I + R+ E Y M +E+ LGL R +I ++LL+G+DY + G+ GIG+ A
Sbjct: 170 GAMCVIKDIKPNSREPFECYHMSHIESGLGLKRKHLIAISLLVGNDYDSGGVLGIGVDKA 229
Query: 1019 IEVMNAFPEEDGLQKFKEWIESPDPSILGTL-SAQTGLSSRKRGSKASEKDTTCSN--SS 1078
+ ++ F E+ L++ ++ P++ G + S G R K S + C + S
Sbjct: 230 LRIVREFSEDQVLERLQDIGNGLQPAVPGGIKSGDDGEEFRSEMKKRSPHCSRCGHLGSK 289
Query: 1079 VGDGSASGEDISED---LKENIDVK--QSFMKKHRNV-----SKNWHI--------PSEF 1138
+S E D +K+ + + SF K R++ + +W I EF
Sbjct: 290 RTHFKSSCEHCGCDSGCIKKPLGFRCECSFCSKDRDLREQKKTNDWWIKVCDKIALAPEF 349
Query: 1139 PSETVISAYTCP-QVDKSAESFSWGKPDHFVLRRLCWEKFGWDNSKADELLLPVL 1170
P+ +I Y + S SWG PD +L L K WD S ++LLP+L
Sbjct: 350 PNRKIIELYLSDGLMTGDGSSMSWGTPDTGMLVDLMVFKLHWDPSYVRKMLLPML 401
BLAST of MS008891 vs. TAIR 10
Match:
AT3G48900.1 (single-stranded DNA endonuclease family protein )
HSP 1 Score: 92.0 bits (227), Expect = 4.4e-18
Identity = 83/287 (28.92%), Postives = 134/287 (46.69%), Query Frame = 0
Query: 895 EQKRLERNAESVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSD 954
++ L+RN S S + E + + G+ + EAEAQCA + +L D + DSD
Sbjct: 42 KETSLKRNMGSEFSCIIKEAKVIASTLGILCLDGIEEAEAQCALLNSESLCDACFSFDSD 101
Query: 955 VILFGARSVYKNI-FDDRKYVETYFMKDVENELGLNRDKIIRMALLLGSDYTEGISGIGI 1014
+ LFGA++VY+ I + YV Y M D++ +LGL R+ +I +ALLLGSDY++G+ G+
Sbjct: 102 IFLFGAKTVYREICLGEGGYVVCYEMDDIKKKLGLGRNSLIALALLLGSDYSQGVRGLRQ 161
Query: 1015 VNAIEVMNAFPEEDGLQKFKEWIESPDPSILGTLSAQTGLS--SRKRGSKASEKDTTCSN 1074
A E++ + + L+K A GLS + R SK + + CS
Sbjct: 162 EKACELVRSIGDNVILEKV----------------ASEGLSFAEKPRKSKKQVRPSVCSK 221
Query: 1075 SS-----VGDGSASGEDISEDLKENIDVKQSFM--KKHRNVSKNWHIPSEFPSETVISAY 1134
V +G+ + E++K+ ID +FM K H+ S
Sbjct: 222 KGTLPLVVINGNNRDPERLEEIKQVID---AFMNPKCHQADSNT---------------- 281
Query: 1135 TCPQVDKSAESFSWGKPDHFVLRRLCWEKFGWDNSKADELLLPVLKE 1172
V ++ FS+ + L+ +C + F W K DE +LP + E
Sbjct: 282 ----VSRALAEFSFQRTK---LQEICHQFFEWPPEKTDEYILPKVAE 286
BLAST of MS008891 vs. TAIR 10
Match:
AT3G48900.2 (single-stranded DNA endonuclease family protein )
HSP 1 Score: 92.0 bits (227), Expect = 4.4e-18
Identity = 83/287 (28.92%), Postives = 134/287 (46.69%), Query Frame = 0
Query: 895 EQKRLERNAESVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSD 954
++ L+RN S S + E + + G+ + EAEAQCA + +L D + DSD
Sbjct: 106 KETSLKRNMGSEFSCIIKEAKVIASTLGILCLDGIEEAEAQCALLNSESLCDACFSFDSD 165
Query: 955 VILFGARSVYKNI-FDDRKYVETYFMKDVENELGLNRDKIIRMALLLGSDYTEGISGIGI 1014
+ LFGA++VY+ I + YV Y M D++ +LGL R+ +I +ALLLGSDY++G+ G+
Sbjct: 166 IFLFGAKTVYREICLGEGGYVVCYEMDDIKKKLGLGRNSLIALALLLGSDYSQGVRGLRQ 225
Query: 1015 VNAIEVMNAFPEEDGLQKFKEWIESPDPSILGTLSAQTGLS--SRKRGSKASEKDTTCSN 1074
A E++ + + L+K A GLS + R SK + + CS
Sbjct: 226 EKACELVRSIGDNVILEKV----------------ASEGLSFAEKPRKSKKQVRPSVCSK 285
Query: 1075 SS-----VGDGSASGEDISEDLKENIDVKQSFM--KKHRNVSKNWHIPSEFPSETVISAY 1134
V +G+ + E++K+ ID +FM K H+ S
Sbjct: 286 KGTLPLVVINGNNRDPERLEEIKQVID---AFMNPKCHQADSNT---------------- 345
Query: 1135 TCPQVDKSAESFSWGKPDHFVLRRLCWEKFGWDNSKADELLLPVLKE 1172
V ++ FS+ + L+ +C + F W K DE +LP + E
Sbjct: 346 ----VSRALAEFSFQRTK---LQEICHQFFEWPPEKTDEYILPKVAE 350
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9ATY5 | 2.1e-283 | 42.60 | DNA repair protein UVH3 OS=Arabidopsis thaliana OX=3702 GN=UVH3 PE=2 SV=1 | [more] |
P14629 | 8.5e-67 | 25.57 | DNA excision repair protein ERCC-5 homolog OS=Xenopus laevis OX=8355 GN=ercc5 PE... | [more] |
P28706 | 3.5e-60 | 25.69 | DNA repair protein rad13 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) ... | [more] |
P35689 | 5.0e-59 | 25.28 | DNA excision repair protein ERCC-5 OS=Mus musculus OX=10090 GN=Ercc5 PE=1 SV=4 | [more] |
P28715 | 1.7e-54 | 24.55 | DNA excision repair protein ERCC-5 OS=Homo sapiens OX=9606 GN=ERCC5 PE=1 SV=3 | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1D8H5 | 0.0e+00 | 96.50 | DNA repair protein UVH3 isoform X1 OS=Momordica charantia OX=3673 GN=LOC11101834... | [more] |
A0A6J1D7H7 | 0.0e+00 | 96.56 | DNA repair protein UVH3 isoform X2 OS=Momordica charantia OX=3673 GN=LOC11101834... | [more] |
A0A6J1DAD7 | 0.0e+00 | 94.94 | DNA repair protein UVH3 isoform X3 OS=Momordica charantia OX=3673 GN=LOC11101834... | [more] |
A0A6J1ERW5 | 0.0e+00 | 74.04 | DNA repair protein UVH3 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111435307... | [more] |
A0A6J1JJE1 | 0.0e+00 | 73.42 | DNA repair protein UVH3 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111486346 P... | [more] |