Homology
BLAST of Sgr021433 vs. NCBI nr
Match:
XP_022150081.1 (DNA repair protein UVH3 isoform X3 [Momordica charantia])
HSP 1 Score: 2427.5 bits (6290), Expect = 0.0e+00
Identity = 1324/1605 (82.49%), Postives = 1411/1605 (87.91%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKRLAIDASIWMVQFIKAMRDDRGEMVRNAHLLGFFR 60
MGVQGLWELLAPVGRRVSVETLAGK+LAIDASIWMVQFIKAMRD+RGEMVRNAHLLGFFR
Sbjct: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDERGEMVRNAHLLGFFR 60
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLR 120
RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRL+
Sbjct: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLK 120
Query: 121 ELAEDLQNQKQQRRQDVQKKKTLPNHNEIADGTSERNKGV--PSSGSHENLDEMLAASIM 180
ELAEDLQNQKQQRRQDV KKK LPNH ADGTS RNK + SSG HE LD MLAASIM
Sbjct: 121 ELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSGRNKSITTTSSGDHEKLDGMLAASIM 180
Query: 181 AEENGFLMSSASSFAGTTLAKEDSGEESILPLMHEVDPDVLSTLPSSIRHE-LQKQKYKN 240
AEENGF SS+SSF+G LAK++SGEESILPLM+EVDPDV STLPSSI++E LQKQKYKN
Sbjct: 181 AEENGFFTSSSSSFSGAALAKDNSGEESILPLMNEVDPDVFSTLPSSIQYELLQKQKYKN 240
Query: 241 DSKDKKILSDEIHVVGSDSERMEVVSRSAYQKNLDEMLAASIAAEEAQSLNENASVSATA 300
DSK KKILSDEIH VGSD+ERMEV SR A+Q+NLDEMLAASIAAEEA SLNENASVSA A
Sbjct: 241 DSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLDEMLAASIAAEEAGSLNENASVSAAA 300
Query: 301 NWDGEDTDDEDEEMILPEMHGIVDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVKK 360
N D EDTDDEDEEMILPEM G+VDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVKK
Sbjct: 301 NLD-EDTDDEDEEMILPEMDGVVDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVKK 360
Query: 361 DPAKFSELQIQAYLKTVAFRRDIDQVQKAAAGRGVGGVQTSKIASEANREFIFSSSFTGD 420
DPAKFSELQIQAYLKTVAFRRDIDQVQKAA+GRGVGGVQTS+IASEANREFIFSSSFTGD
Sbjct: 361 DPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGVQTSRIASEANREFIFSSSFTGD 420
Query: 421 KQVLASTRAEKNGDKGLEAPRGQQPLSSLNNTEVPSTSNALARSTPDKTAVFEDNIETFL 480
KQVLAS R EK+GD+ L+APRGQQPLSSLNNTEVPSTSNALARSTPDKT VFE+NIETFL
Sbjct: 421 KQVLAS-RIEKSGDEDLQAPRGQQPLSSLNNTEVPSTSNALARSTPDKTGVFEENIETFL 480
Query: 481 DERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEKNASANEVVNPEPVQNIEICNPKSFSS 540
DERGRVRVSRVRAMGM MTRDLERNLDLMKEIEKNASANEVVN EPVQN EICNPKS SS
Sbjct: 481 DERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNASANEVVNHEPVQNSEICNPKSHSS 540
Query: 541 QSQVLDTPYEGVGESIKLNESSRGSMLNEDTAIEILLEDKGDKSFDGDDDVFTHLAAENP 600
QSQ LDTPYEGV ES++L+ SRGSML+EDTAIEILLED+GDKSFDGDDD+FTHLAAENP
Sbjct: 541 QSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEILLEDEGDKSFDGDDDLFTHLAAENP 600
Query: 601 IQMASFDISSQKLSLDGTTDSGWEETVEGKTYSPKNVEVDDHSFKEGRVSDDSEVEWEDG 660
IQ+ASFD SSQKLS DGTTDSGWEE VEGKTYSPKNVEVDDH F EGRVSD+SEVEWE+G
Sbjct: 601 IQVASFDKSSQKLSFDGTTDSGWEEAVEGKTYSPKNVEVDDHPFVEGRVSDESEVEWEEG 660
Query: 661 VCDQVNPVPFG-VESGKSVSKGSLEEEADLQEAIRRSLADIGDRKSGPVFSEHQQPVIVG 720
VCD VNPVPFG ESGKSVSKGSLEEEADLQEAIRRSL D+GDRK G V SEHQ+P G
Sbjct: 661 VCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRSLKDVGDRKPGSVLSEHQKPESAG 720
Query: 721 KMVEQCMVVENENVIELDKPDSADGMNCLKADDSTGRKETTESSSQEKQCSKSVVLLDTK 780
KM+EQC V+NENVI L D ADGM+C KA+DSTGRKETTESSSQEKQCS+ +VLLDT
Sbjct: 721 KMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGRKETTESSSQEKQCSECIVLLDTT 780
Query: 781 TDTIAEQLDAPFKGAASSHKESNENDDTLKPLSRDASGAAQVVDRINNTVIEPPCRMVEM 840
T T+ E+LDA +K SHK+SNENDDTLKPLSRDASGA V DRINN + EPPC MV M
Sbjct: 781 THTVTEKLDASYKDV--SHKDSNENDDTLKPLSRDASGAVLVGDRINNKLTEPPCHMVGM 840
Query: 841 EGIY--NVDSSPKAVACENHQNFPVEKHTSDLLLEENDAKKPAVEVISNAEIEFAEIEFT 900
E Y VDSSPK VA ENHQNFPV++ +SD+LLEENDA+KPAVEVISN AEIEFT
Sbjct: 841 EDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEENDAQKPAVEVISN-----AEIEFT 900
Query: 901 EDELTNRISILEQERLNLGNEQKRLERNAESVSSEMFAECQELLQMFGLPYIIAPMEAEA 960
EDELTNRI ILEQERLNLG+EQKRLERNAESV SEMFAECQELLQMFGLPYIIAPMEAEA
Sbjct: 901 EDELTNRIXILEQERLNLGDEQKRLERNAESVXSEMFAECQELLQMFGLPYIIAPMEAEA 960
Query: 961 QCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELGLNRDKLI 1020
QCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELGLNRDK+I
Sbjct: 961 QCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELGLNRDKII 1020
Query: 1021 RMALLLGSDYTEGISGIGIVNAIEVMNAFPEEGGLHKFKEWIESPDPSILGTLGAQTGLS 1080
RMALLLGSDYTEGISGIGIVNAIEVMNAFPEE GL KFKEWIESPDPSILGTL AQTGLS
Sbjct: 1021 RMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIESPDPSILGTLSAQTGLS 1080
Query: 1081 ARKRGSKASENDMTCSNGSVRDGSASGESIFKAPKEKGSIDVKQSFMDKHRNVSKNWHIP 1140
+RKRGSKASE D TCSN SV DGSASGE I + KE +IDVKQSFM KHRNVSKNWHIP
Sbjct: 1081 SRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKE--NIDVKQSFMKKHRNVSKNWHIP 1140
Query: 1141 SAFPSEAVISAYTCPQVDKSAEAFSWGKPDHFVLRRLCWEKFGWENSKADELLLPVLKEY 1200
S FPSE VISAYTCPQVDKSAE+FSWGKPD FVLRRLCWEKFGW+NSKADELLLPVLKEY
Sbjct: 1141 SEFPSEXVISAYTCPQVDKSAESFSWGKPDXFVLRRLCWEKFGWDNSKADELLLPVLKEY 1200
Query: 1201 SKHQTQLRLEAFYTFNERFAKIRSKRIKKAVRSITGSKSAVLMDDAVQGVSVNKQRELSV 1260
SKH+TQLRLE FYTF+ERFAKIRSKRIKKAVR ITGSKSAVLMDDAV+ VS NKQRELSV
Sbjct: 1201 SKHETQLRLETFYTFDERFAKIRSKRIKKAVRGITGSKSAVLMDDAVRAVSANKQRELSV 1260
Query: 1261 EPQENISEKCSSEIQGTCSNEDEVENRRRKPSRKRQLHGEPSQPAK-DKLTMKERGKRSR 1320
EPQE SEKCSSEIQG+CSN D+VE R KPSRKRQLHGE SQPAK KLTMKE+G R+R
Sbjct: 1261 EPQEK-SEKCSSEIQGSCSNVDDVEKRLGKPSRKRQLHGEQSQPAKGQKLTMKEKGNRNR 1320
Query: 1321 NEGSHKNERGRGRGKGRGRGRLASKGKTKGTPITELVGTSSSDDESEFDEQKFDLENLQE 1380
NEGSHKN RGRG KGRGRGRL KGK KG+P TELV TSSSDDE+EFD+QK D NL+E
Sbjct: 1321 NEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDENEFDDQKCDFVNLEE 1380
Query: 1381 PRERRRSARIQKSASSTMNDVDQPSGHSRDRFSDDEAKEHDVVRDRHALPETVISQSENT 1440
P+ERRRS+RI+KS S TM D DQPS ++ DRFS+DEAKEHDV+ D QSE T
Sbjct: 1381 PQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSNDEAKEHDVIHD----------QSEKT 1440
Query: 1441 ECDFKTRKRSPQKDHLETGGGFCPVEDEMSQQGMCQNKDPSLEANIGEDYLSMGGGFCSD 1500
E D T KR PQ+D+ ETGGGFCPVEDEMS Q+ DPSLEAN EDYL MGGGFC D
Sbjct: 1441 ERDLGTPKRPPQEDYFETGGGFCPVEDEMS-----QDIDPSLEANNSEDYLRMGGGFCLD 1500
Query: 1501 DGNECVDPNSYPDQATFSEDPKDGSEDHPIQSTFHPE-YIGRVQNEEGTDAHVDSPPNVG 1560
D NEC+DP++YP +AT SED +D SE P QSTFHPE VQN+EGTDA VDS + G
Sbjct: 1501 DDNECIDPDAYPGRATVSEDLQDRSEHDPDQSTFHPEKCTSSVQNKEGTDARVDSLLDTG 1560
Query: 1561 DSNPVSNPNSSQVVEGVQEEAKEHSVGAFGGALSAMPNLRRKKRR 1598
+ N V NPNSSQ EGVQEE K+HSV AFGGALSAMPNLRRK+R+
Sbjct: 1561 NPNRVCNPNSSQGGEGVQEEEKDHSVSAFGGALSAMPNLRRKRRK 1578
BLAST of Sgr021433 vs. NCBI nr
Match:
XP_022150078.1 (DNA repair protein UVH3 isoform X1 [Momordica charantia])
HSP 1 Score: 2413.6 bits (6254), Expect = 0.0e+00
Identity = 1324/1630 (81.23%), Postives = 1411/1630 (86.56%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKRLAI-------------------------DASIWM 60
MGVQGLWELLAPVGRRVSVETLAGK+LAI DASIWM
Sbjct: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM 60
Query: 61 VQFIKAMRDDRGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
VQFIKAMRD+RGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR
Sbjct: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
Query: 121 ENAQAKVRKTAEKLLLNHLKAMRLRELAEDLQNQKQQRRQDVQKKKTLPNHNEIADGTSE 180
ENAQAKVRKTAEKLLLNHLKAMRL+ELAEDLQNQKQQRRQDV KKK LPNH ADGTS
Sbjct: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG 180
Query: 181 RNKGV--PSSGSHENLDEMLAASIMAEENGFLMSSASSFAGTTLAKEDSGEESILPLMHE 240
RNK + SSG HE LD MLAASIMAEENGF SS+SSF+G LAK++SGEESILPLM+E
Sbjct: 181 RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESILPLMNE 240
Query: 241 VDPDVLSTLPSSIRHE-LQKQKYKNDSKDKKILSDEIHVVGSDSERMEVVSRSAYQKNLD 300
VDPDV STLPSSI++E LQKQKYKNDSK KKILSDEIH VGSD+ERMEV SR A+Q+NLD
Sbjct: 241 VDPDVFSTLPSSIQYELLQKQKYKNDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLD 300
Query: 301 EMLAASIAAEEAQSLNENASVSATANWDGEDTDDEDEEMILPEMHGIVDPSVLAALPPSV 360
EMLAASIAAEEA SLNENASVSA AN D EDTDDEDEEMILPEM G+VDPSVLAALPPSV
Sbjct: 301 EMLAASIAAEEAGSLNENASVSAAANLD-EDTDDEDEEMILPEMDGVVDPSVLAALPPSV 360
Query: 361 QLDLLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAAAGRGV 420
QLDLLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAA+GRGV
Sbjct: 361 QLDLLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGV 420
Query: 421 GGVQTSKIASEANREFIFSSSFTGDKQVLASTRAEKNGDKGLEAPRGQQPLSSLNNTEVP 480
GGVQTS+IASEANREFIFSSSFTGDKQVLAS R EK+GD+ L+APRGQQPLSSLNNTEVP
Sbjct: 421 GGVQTSRIASEANREFIFSSSFTGDKQVLAS-RIEKSGDEDLQAPRGQQPLSSLNNTEVP 480
Query: 481 STSNALARSTPDKTAVFEDNIETFLDERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEKN 540
STSNALARSTPDKT VFE+NIETFLDERGRVRVSRVRAMGM MTRDLERNLDLMKEIEKN
Sbjct: 481 STSNALARSTPDKTGVFEENIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKN 540
Query: 541 ASANEVVNPEPVQNIEICNPKSFSSQSQVLDTPYEGVGESIKLNESSRGSMLNEDTAIEI 600
ASANEVVN EPVQN EICNPKS SSQSQ LDTPYEGV ES++L+ SRGSML+EDTAIEI
Sbjct: 541 ASANEVVNHEPVQNSEICNPKSHSSQSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEI 600
Query: 601 LLEDKGDKSFDGDDDVFTHLAAENPIQMASFDISSQKLSLDGTTDSGWEETVEGKTYSPK 660
LLED+GDKSFDGDDD+FTHLAAENPIQ+ASFD SSQKLS DGTTDSGWEE VEGKTYSPK
Sbjct: 601 LLEDEGDKSFDGDDDLFTHLAAENPIQVASFDKSSQKLSFDGTTDSGWEEAVEGKTYSPK 660
Query: 661 NVEVDDHSFKEGRVSDDSEVEWEDGVCDQVNPVPFG-VESGKSVSKGSLEEEADLQEAIR 720
NVEVDDH F EGRVSD+SEVEWE+GVCD VNPVPFG ESGKSVSKGSLEEEADLQEAIR
Sbjct: 661 NVEVDDHPFVEGRVSDESEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIR 720
Query: 721 RSLADIGDRKSGPVFSEHQQPVIVGKMVEQCMVVENENVIELDKPDSADGMNCLKADDST 780
RSL D+GDRK G V SEHQ+P GKM+EQC V+NENVI L D ADGM+C KA+DST
Sbjct: 721 RSLKDVGDRKPGSVLSEHQKPESAGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDST 780
Query: 781 GRKETTESSSQEKQCSKSVVLLDTKTDTIAEQLDAPFKGAASSHKESNENDDTLKPLSRD 840
GRKETTESSSQEKQCS+ +VLLDT T T+ E+LDA +K SHK+SNENDDTLKPLSRD
Sbjct: 781 GRKETTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDV--SHKDSNENDDTLKPLSRD 840
Query: 841 ASGAAQVVDRINNTVIEPPCRMVEMEGIY--NVDSSPKAVACENHQNFPVEKHTSDLLLE 900
ASGA V DRINN + EPPC MV ME Y VDSSPK VA ENHQNFPV++ +SD+LLE
Sbjct: 841 ASGAVLVGDRINNKLTEPPCHMVGMEDSYTPEVDSSPKVVASENHQNFPVDELSSDILLE 900
Query: 901 ENDAKKPAVEVISNAEIEFAEIEFTEDELTNRISILEQERLNLGNEQKRLERNAESVSSE 960
ENDA+KPAVEVISN AEIEFTEDELTNRI ILEQERLNLG+EQKRLERNAESV SE
Sbjct: 901 ENDAQKPAVEVISN-----AEIEFTEDELTNRIXILEQERLNLGDEQKRLERNAESVXSE 960
Query: 961 MFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFD 1020
MFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFD
Sbjct: 961 MFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFD 1020
Query: 1021 DRKYVETYFMKDVENELGLNRDKLIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEGGL 1080
DRKYVETYFMKDVENELGLNRDK+IRMALLLGSDYTEGISGIGIVNAIEVMNAFPEE GL
Sbjct: 1021 DRKYVETYFMKDVENELGLNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGL 1080
Query: 1081 HKFKEWIESPDPSILGTLGAQTGLSARKRGSKASENDMTCSNGSVRDGSASGESIFKAPK 1140
KFKEWIESPDPSILGTL AQTGLS+RKRGSKASE D TCSN SV DGSASGE I + K
Sbjct: 1081 QKFKEWIESPDPSILGTLSAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLK 1140
Query: 1141 EKGSIDVKQSFMDKHRNVSKNWHIPSAFPSEAVISAYTCPQVDKSAEAFSWGKPDHFVLR 1200
E +IDVKQSFM KHRNVSKNWHIPS FPSE VISAYTCPQVDKSAE+FSWGKPD FVLR
Sbjct: 1141 E--NIDVKQSFMKKHRNVSKNWHIPSEFPSEXVISAYTCPQVDKSAESFSWGKPDXFVLR 1200
Query: 1201 RLCWEKFGWENSKADELLLPVLKEYSKHQTQLRLEAFYTFNERFAKIRSKRIKKAVRSIT 1260
RLCWEKFGW+NSKADELLLPVLKEYSKH+TQLRLE FYTF+ERFAKIRSKRIKKAVR IT
Sbjct: 1201 RLCWEKFGWDNSKADELLLPVLKEYSKHETQLRLETFYTFDERFAKIRSKRIKKAVRGIT 1260
Query: 1261 GSKSAVLMDDAVQGVSVNKQRELSVEPQENISEKCSSEIQGTCSNEDEVENRRRKPSRKR 1320
GSKSAVLMDDAV+ VS NKQRELSVEPQE SEKCSSEIQG+CSN D+VE R KPSRKR
Sbjct: 1261 GSKSAVLMDDAVRAVSANKQRELSVEPQEK-SEKCSSEIQGSCSNVDDVEKRLGKPSRKR 1320
Query: 1321 QLHGEPSQPAK-DKLTMKERGKRSRNEGSHKNERGRGRGKGRGRGRLASKGKTKGTPITE 1380
QLHGE SQPAK KLTMKE+G R+RNEGSHKN RGRG KGRGRGRL KGK KG+P TE
Sbjct: 1321 QLHGEQSQPAKGQKLTMKEKGNRNRNEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTE 1380
Query: 1381 LVGTSSSDDESEFDEQKFDLENLQEPRERRRSARIQKSASSTMNDVDQPSGHSRDRFSDD 1440
LV TSSSDDE+EFD+QK D NL+EP+ERRRS+RI+KS S TM D DQPS ++ DRFS+D
Sbjct: 1381 LVETSSSDDENEFDDQKCDFVNLEEPQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSND 1440
Query: 1441 EAKEHDVVRDRHALPETVISQSENTECDFKTRKRSPQKDHLETGGGFCPVEDEMSQQGMC 1500
EAKEHDV+ D QSE TE D T KR PQ+D+ ETGGGFCPVEDEMS
Sbjct: 1441 EAKEHDVIHD----------QSEKTERDLGTPKRPPQEDYFETGGGFCPVEDEMS----- 1500
Query: 1501 QNKDPSLEANIGEDYLSMGGGFCSDDGNECVDPNSYPDQATFSEDPKDGSEDHPIQSTFH 1560
Q+ DPSLEAN EDYL MGGGFC DD NEC+DP++YP +AT SED +D SE P QSTFH
Sbjct: 1501 QDIDPSLEANNSEDYLRMGGGFCLDDDNECIDPDAYPGRATVSEDLQDRSEHDPDQSTFH 1560
Query: 1561 PE-YIGRVQNEEGTDAHVDSPPNVGDSNPVSNPNSSQVVEGVQEEAKEHSVGAFGGALSA 1598
PE VQN+EGTDA VDS + G+ N V NPNSSQ EGVQEE K+HSV AFGGALSA
Sbjct: 1561 PEKCTSSVQNKEGTDARVDSLLDTGNPNRVCNPNSSQGGEGVQEEEKDHSVSAFGGALSA 1603
BLAST of Sgr021433 vs. NCBI nr
Match:
XP_022150080.1 (DNA repair protein UVH3 isoform X2 [Momordica charantia])
HSP 1 Score: 2409.0 bits (6242), Expect = 0.0e+00
Identity = 1322/1629 (81.15%), Postives = 1409/1629 (86.49%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKRLAI-------------------------DASIWM 60
MGVQGLWELLAPVGRRVSVETLAGK+LAI DASIWM
Sbjct: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM 60
Query: 61 VQFIKAMRDDRGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
VQFIKAMRD+RGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR
Sbjct: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
Query: 121 ENAQAKVRKTAEKLLLNHLKAMRLRELAEDLQNQKQQRRQDVQKKKTLPNHNEIADGTSE 180
ENAQAKVRKTAEKLLLNHLKAMRL+ELAEDLQNQKQQRRQDV KKK LPNH ADGTS
Sbjct: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG 180
Query: 181 RNKGV--PSSGSHENLDEMLAASIMAEENGFLMSSASSFAGTTLAKEDSGEESILPLMHE 240
RNK + SSG HE LD MLAASIMAEENGF SS+SSF+G LAK++SGEESILPLM+E
Sbjct: 181 RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESILPLMNE 240
Query: 241 VDPDVLSTLPSSIRHELQKQKYKNDSKDKKILSDEIHVVGSDSERMEVVSRSAYQKNLDE 300
VDPDV STLPSSI++EL QKYKNDSK KKILSDEIH VGSD+ERMEV SR A+Q+NLDE
Sbjct: 241 VDPDVFSTLPSSIQYEL-LQKYKNDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLDE 300
Query: 301 MLAASIAAEEAQSLNENASVSATANWDGEDTDDEDEEMILPEMHGIVDPSVLAALPPSVQ 360
MLAASIAAEEA SLNENASVSA AN D EDTDDEDEEMILPEM G+VDPSVLAALPPSVQ
Sbjct: 301 MLAASIAAEEAGSLNENASVSAAANLD-EDTDDEDEEMILPEMDGVVDPSVLAALPPSVQ 360
Query: 361 LDLLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAAAGRGVG 420
LDLLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAA+GRGVG
Sbjct: 361 LDLLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVG 420
Query: 421 GVQTSKIASEANREFIFSSSFTGDKQVLASTRAEKNGDKGLEAPRGQQPLSSLNNTEVPS 480
GVQTS+IASEANREFIFSSSFTGDKQVLAS R EK+GD+ L+APRGQQPLSSLNNTEVPS
Sbjct: 421 GVQTSRIASEANREFIFSSSFTGDKQVLAS-RIEKSGDEDLQAPRGQQPLSSLNNTEVPS 480
Query: 481 TSNALARSTPDKTAVFEDNIETFLDERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEKNA 540
TSNALARSTPDKT VFE+NIETFLDERGRVRVSRVRAMGM MTRDLERNLDLMKEIEKNA
Sbjct: 481 TSNALARSTPDKTGVFEENIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNA 540
Query: 541 SANEVVNPEPVQNIEICNPKSFSSQSQVLDTPYEGVGESIKLNESSRGSMLNEDTAIEIL 600
SANEVVN EPVQN EICNPKS SSQSQ LDTPYEGV ES++L+ SRGSML+EDTAIEIL
Sbjct: 541 SANEVVNHEPVQNSEICNPKSHSSQSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEIL 600
Query: 601 LEDKGDKSFDGDDDVFTHLAAENPIQMASFDISSQKLSLDGTTDSGWEETVEGKTYSPKN 660
LED+GDKSFDGDDD+FTHLAAENPIQ+ASFD SSQKLS DGTTDSGWEE VEGKTYSPKN
Sbjct: 601 LEDEGDKSFDGDDDLFTHLAAENPIQVASFDKSSQKLSFDGTTDSGWEEAVEGKTYSPKN 660
Query: 661 VEVDDHSFKEGRVSDDSEVEWEDGVCDQVNPVPFG-VESGKSVSKGSLEEEADLQEAIRR 720
VEVDDH F EGRVSD+SEVEWE+GVCD VNPVPFG ESGKSVSKGSLEEEADLQEAIRR
Sbjct: 661 VEVDDHPFVEGRVSDESEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRR 720
Query: 721 SLADIGDRKSGPVFSEHQQPVIVGKMVEQCMVVENENVIELDKPDSADGMNCLKADDSTG 780
SL D+GDRK G V SEHQ+P GKM+EQC V+NENVI L D ADGM+C KA+DSTG
Sbjct: 721 SLKDVGDRKPGSVLSEHQKPESAGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTG 780
Query: 781 RKETTESSSQEKQCSKSVVLLDTKTDTIAEQLDAPFKGAASSHKESNENDDTLKPLSRDA 840
RKETTESSSQEKQCS+ +VLLDT T T+ E+LDA +K SHK+SNENDDTLKPLSRDA
Sbjct: 781 RKETTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDV--SHKDSNENDDTLKPLSRDA 840
Query: 841 SGAAQVVDRINNTVIEPPCRMVEMEGIY--NVDSSPKAVACENHQNFPVEKHTSDLLLEE 900
SGA V DRINN + EPPC MV ME Y VDSSPK VA ENHQNFPV++ +SD+LLEE
Sbjct: 841 SGAVLVGDRINNKLTEPPCHMVGMEDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEE 900
Query: 901 NDAKKPAVEVISNAEIEFAEIEFTEDELTNRISILEQERLNLGNEQKRLERNAESVSSEM 960
NDA+KPAVEVISN AEIEFTEDELTNRI ILEQERLNLG+EQKRLERNAESV SEM
Sbjct: 901 NDAQKPAVEVISN-----AEIEFTEDELTNRIXILEQERLNLGDEQKRLERNAESVXSEM 960
Query: 961 FAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDD 1020
FAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDD
Sbjct: 961 FAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDD 1020
Query: 1021 RKYVETYFMKDVENELGLNRDKLIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEGGLH 1080
RKYVETYFMKDVENELGLNRDK+IRMALLLGSDYTEGISGIGIVNAIEVMNAFPEE GL
Sbjct: 1021 RKYVETYFMKDVENELGLNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQ 1080
Query: 1081 KFKEWIESPDPSILGTLGAQTGLSARKRGSKASENDMTCSNGSVRDGSASGESIFKAPKE 1140
KFKEWIESPDPSILGTL AQTGLS+RKRGSKASE D TCSN SV DGSASGE I + KE
Sbjct: 1081 KFKEWIESPDPSILGTLSAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKE 1140
Query: 1141 KGSIDVKQSFMDKHRNVSKNWHIPSAFPSEAVISAYTCPQVDKSAEAFSWGKPDHFVLRR 1200
+IDVKQSFM KHRNVSKNWHIPS FPSE VISAYTCPQVDKSAE+FSWGKPD FVLRR
Sbjct: 1141 --NIDVKQSFMKKHRNVSKNWHIPSEFPSEXVISAYTCPQVDKSAESFSWGKPDXFVLRR 1200
Query: 1201 LCWEKFGWENSKADELLLPVLKEYSKHQTQLRLEAFYTFNERFAKIRSKRIKKAVRSITG 1260
LCWEKFGW+NSKADELLLPVLKEYSKH+TQLRLE FYTF+ERFAKIRSKRIKKAVR ITG
Sbjct: 1201 LCWEKFGWDNSKADELLLPVLKEYSKHETQLRLETFYTFDERFAKIRSKRIKKAVRGITG 1260
Query: 1261 SKSAVLMDDAVQGVSVNKQRELSVEPQENISEKCSSEIQGTCSNEDEVENRRRKPSRKRQ 1320
SKSAVLMDDAV+ VS NKQRELSVEPQE SEKCSSEIQG+CSN D+VE R KPSRKRQ
Sbjct: 1261 SKSAVLMDDAVRAVSANKQRELSVEPQEK-SEKCSSEIQGSCSNVDDVEKRLGKPSRKRQ 1320
Query: 1321 LHGEPSQPAK-DKLTMKERGKRSRNEGSHKNERGRGRGKGRGRGRLASKGKTKGTPITEL 1380
LHGE SQPAK KLTMKE+G R+RNEGSHKN RGRG KGRGRGRL KGK KG+P TEL
Sbjct: 1321 LHGEQSQPAKGQKLTMKEKGNRNRNEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTEL 1380
Query: 1381 VGTSSSDDESEFDEQKFDLENLQEPRERRRSARIQKSASSTMNDVDQPSGHSRDRFSDDE 1440
V TSSSDDE+EFD+QK D NL+EP+ERRRS+RI+KS S TM D DQPS ++ DRFS+DE
Sbjct: 1381 VETSSSDDENEFDDQKCDFVNLEEPQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSNDE 1440
Query: 1441 AKEHDVVRDRHALPETVISQSENTECDFKTRKRSPQKDHLETGGGFCPVEDEMSQQGMCQ 1500
AKEHDV+ D QSE TE D T KR PQ+D+ ETGGGFCPVEDEMS Q
Sbjct: 1441 AKEHDVIHD----------QSEKTERDLGTPKRPPQEDYFETGGGFCPVEDEMS-----Q 1500
Query: 1501 NKDPSLEANIGEDYLSMGGGFCSDDGNECVDPNSYPDQATFSEDPKDGSEDHPIQSTFHP 1560
+ DPSLEAN EDYL MGGGFC DD NEC+DP++YP +AT SED +D SE P QSTFHP
Sbjct: 1501 DIDPSLEANNSEDYLRMGGGFCLDDDNECIDPDAYPGRATVSEDLQDRSEHDPDQSTFHP 1560
Query: 1561 E-YIGRVQNEEGTDAHVDSPPNVGDSNPVSNPNSSQVVEGVQEEAKEHSVGAFGGALSAM 1598
E VQN+EGTDA VDS + G+ N V NPNSSQ EGVQEE K+HSV AFGGALSAM
Sbjct: 1561 EKCTSSVQNKEGTDARVDSLLDTGNPNRVCNPNSSQGGEGVQEEEKDHSVSAFGGALSAM 1601
BLAST of Sgr021433 vs. NCBI nr
Match:
XP_038903932.1 (DNA repair protein UVH3 isoform X2 [Benincasa hispida])
HSP 1 Score: 2258.0 bits (5850), Expect = 0.0e+00
Identity = 1249/1609 (77.63%), Postives = 1371/1609 (85.21%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKRLAIDASIWMVQFIKAMRDDRGEMVRNAHLLGFFR 60
MGV GLWELLAPVGRRVSVETLAGK+LAIDASIWMVQFIKAMRDDRGEMVRNAHLLGFFR
Sbjct: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAHLLGFFR 60
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLR 120
RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNH+K MRL+
Sbjct: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHIKVMRLK 120
Query: 121 ELAEDLQNQKQQRRQDVQKKKTLPNHNEIADG--TSERNKGVPSSGSHENLDEMLAASIM 180
ELAED+QNQKQQR+Q + KK TLP+ ++ +G TSE +G+P+ GS ENLDEMLAASIM
Sbjct: 121 ELAEDIQNQKQQRKQKLSKKSTLPSRDKNFNGTSTSESCEGIPNRGSLENLDEMLAASIM 180
Query: 181 AEENGFLMSSASSFAGTTLAKEDSGEESILPLMHEVDPDVLSTLPSSIRHELQKQKYKND 240
AEENG +SSASSF+G TLAKE GE SIL QKY N+
Sbjct: 181 AEENGLFLSSASSFSGATLAKEGGGEGSIL-----------------------NQKYNNE 240
Query: 241 SKDKKILSDEIHVVGSDSERMEVVSRSAYQKNLDEMLAASIAAEEAQSLNENASVSATAN 300
SK K+ILSDE ++VGSDSERMEV SRS +Q+NLDEMLAASIAAEEA+SLNENASVSA N
Sbjct: 241 SKGKEILSDETYIVGSDSERMEVASRSVHQQNLDEMLAASIAAEEARSLNENASVSAVTN 300
Query: 301 WDGEDTDDEDEEMILPEMHGIVDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVKKD 360
DGEDTDDEDEEMILPEMHG+VDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVKKD
Sbjct: 301 LDGEDTDDEDEEMILPEMHGVVDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVKKD 360
Query: 361 PAKFSELQIQAYLKTVAFRRDIDQVQKAAAGRGVGGVQTSKIASEANREFIFSSSFTGDK 420
PAKFSELQIQAYLKTVAFRRDIDQVQKAAAGRGVGGVQTS+IASEANREFIFSSSFTGDK
Sbjct: 361 PAKFSELQIQAYLKTVAFRRDIDQVQKAAAGRGVGGVQTSRIASEANREFIFSSSFTGDK 420
Query: 421 QVLASTRAEKNGDKGLEAPRGQQPLSSLNNTEVPSTSNALARSTPDKTAVFEDNIETFLD 480
QVLAST EKNGDK L+AP QQPLSSL NTE+PSTSN LA+STPDK+ FEDNIETFLD
Sbjct: 421 QVLASTIVEKNGDKDLQAPTVQQPLSSLKNTEIPSTSNPLAQSTPDKSGGFEDNIETFLD 480
Query: 481 ERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEKNASANEVVNPEPVQNIEICNPKSFSSQ 540
ERGRVRVSRV+AMGMHMTRDLERNLDLMKEIEKN SAN+ NPEP+QNIEICNP++FS Q
Sbjct: 481 ERGRVRVSRVKAMGMHMTRDLERNLDLMKEIEKNTSANKAANPEPIQNIEICNPENFSFQ 540
Query: 541 SQVLDTPYEGVGESI-KLNESSRGSMLNEDTAIEILLEDKGDKSFDGDDDVFTHLAAENP 600
SQVLDT EGVG SI KLNE MLNE+TAIEILLED+G KSFDGDDD+FT+LAAENP
Sbjct: 541 SQVLDTSDEGVGGSINKLNERGGEPMLNEETAIEILLEDEGGKSFDGDDDLFTNLAAENP 600
Query: 601 IQMASFDISSQKLSLDGTTDSGWEETVEGKTYSPKNVEVDDHSFKEGRVSDDSEVEWEDG 660
I MASFDIS+QKLSLDGTTDSGWE+ VEGKTYSPKNVEVDDHSFKEG VSD+S+V+WEDG
Sbjct: 601 IGMASFDISTQKLSLDGTTDSGWEDAVEGKTYSPKNVEVDDHSFKEGTVSDESDVDWEDG 660
Query: 661 VCDQVNPVPFGVESGKSVSKGSLEEEADLQEAIRRSLADIGDRKSGPVFSEHQ--QPVIV 720
CD VNPVPF + +SVSKG LEEEADLQEAIRRSL D G KSG V SE Q QPVIV
Sbjct: 661 ACDHVNPVPFEADLAQSVSKGFLEEEADLQEAIRRSLEDRGYTKSGTVSSELQQPQPVIV 720
Query: 721 GKMVEQCMVVENENVIELDKPDSADGMNCLKADDSTGRKETTESSSQEKQCSKSVVLLDT 780
GK EQC V+NE++I LDK DSADGMNCL +DST + TESSSQEKQCS+ V+ LDT
Sbjct: 721 GKRAEQCTSVQNESMIGLDKLDSADGMNCLNFNDSTRTEGMTESSSQEKQCSEPVMSLDT 780
Query: 781 KTDTIAEQLDAPFKGAASSHKESNENDDTLKPLSRDASGAAQVVDRINNTVIEPPCRMVE 840
KT TIAEQLDA + A S KESNEN+DTL+PLSRD GA QV DRINNTVI+PPCRMVE
Sbjct: 781 KTHTIAEQLDASYNVAKFSPKESNENNDTLEPLSRDTFGAVQVGDRINNTVIDPPCRMVE 840
Query: 841 MEGIY--NVDSSPKAVACENH--QNFPVEKHTSDLLLEENDAKKPAVEVISNAEIEFAEI 900
MEGIY SS K ACEN+ QN PV++H++DL L+ DAK +VE SN AEI
Sbjct: 841 MEGIYPPGNGSSRKPFACENNFKQNLPVDEHSNDLSLDIKDAKILSVEETSN-----AEI 900
Query: 901 EFTEDELTNRISILEQERLNLGNEQKRLERNAESVSSEMFAECQELLQMFGLPYIIAPME 960
E T+DEL NR S+LEQERLNLG+EQKRLERNAESV+SEMFAECQELLQMFGLPYIIAPME
Sbjct: 901 EVTDDELKNRFSVLEQERLNLGDEQKRLERNAESVNSEMFAECQELLQMFGLPYIIAPME 960
Query: 961 AEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELGLNRD 1020
AEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELGLN+D
Sbjct: 961 AEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELGLNQD 1020
Query: 1021 KLIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEGGLHKFKEWIESPDPSILGTLGAQT 1080
KLIRMALLLGSDYTEGISGIGIVNA+EVMNAFPEE GLHKFKEWIESPDPSILGTLGA+T
Sbjct: 1021 KLIRMALLLGSDYTEGISGIGIVNAVEVMNAFPEEDGLHKFKEWIESPDPSILGTLGAKT 1080
Query: 1081 GLSARKRGSKASENDMTCSNGSVRDGSASGESIFKAPKEKGSIDVKQSFMDKHRNVSKNW 1140
GL+AR+RGSKASENDMTCSN GSAS E+I K +E +I VKQSFMDKHRNVSKNW
Sbjct: 1081 GLTARRRGSKASENDMTCSN---TGGSASEENISKDLEE--NIAVKQSFMDKHRNVSKNW 1140
Query: 1141 HIPSAFPSEAVISAYTCPQVDKSAEAFSWGKPDHFVLRRLCWEKFGWENSKADELLLPVL 1200
HIPS FPSEAVISAY CPQVDKSAE FSWGKPDHFVLRRLCWEKFGWENSKADELLLPVL
Sbjct: 1141 HIPSEFPSEAVISAYICPQVDKSAEPFSWGKPDHFVLRRLCWEKFGWENSKADELLLPVL 1200
Query: 1201 KEYSKHQTQLRLEAFYTFNERFAKIRSKRIKKAVRSITGSKSAVLMDDAVQGVSVNKQRE 1260
KEYSKH+TQLRLEAFYTFNERFAKIRSKRIKKAV+SITGS+SAVLMDDAV+ VSVN QRE
Sbjct: 1201 KEYSKHETQLRLEAFYTFNERFAKIRSKRIKKAVKSITGSRSAVLMDDAVRDVSVNNQRE 1260
Query: 1261 LSVEPQENISEKCSSEIQGTCSNEDEVENRRRKPSRKRQLHGEPSQPAKD-KLTMKERGK 1320
LSVEP+EN+SEKCSSE Q CSNED+ R RKPSRKRQL GE +QP KD KLT KE+GK
Sbjct: 1261 LSVEPKENMSEKCSSERQDACSNEDD---RHRKPSRKRQLDGEQAQPGKDRKLTKKEKGK 1320
Query: 1321 RSRNEGSHKNE-RGRGRGKGRGRGRLASKGKTKGTPITELVGTSSSDDESEFDEQKFDLE 1380
SRNEGSH RGRGRGKGRGRGRL SKGK PITEL+ TSSSDDESEFD QKFDLE
Sbjct: 1321 PSRNEGSHSERGRGRGRGKGRGRGRLVSKGK---APITELIETSSSDDESEFDNQKFDLE 1380
Query: 1381 NLQEPRERRRSARIQKSASSTMNDVDQPSGHSRDRFSDDEAKEHDVVRDRHALPETVISQ 1440
N QEP+E+RRS+RI+KSAS T+++ DQ S H+ D FS+D+A+E V++ ++A PETV+SQ
Sbjct: 1381 NFQEPQEKRRSSRIRKSASYTIDNADQQSDHTGDEFSNDKAEEDRVIQGQYAHPETVMSQ 1440
Query: 1441 SENTECDFKTRKRSPQKDHLETGGGFCPVEDEMSQQGMCQNKDPSLEANIGEDYLSMGGG 1500
SEN E + KRSPQ D+L+TGGGFC VEDEMS+Q MCQNKDP+LEAN EDYL+MGGG
Sbjct: 1441 SENMESGSGSPKRSPQNDYLKTGGGFCLVEDEMSRQEMCQNKDPALEANNSEDYLTMGGG 1500
Query: 1501 FCSDDGNECVDPNSYPDQATFSEDPKDGSEDHPIQSTFHPE-YIGRVQNEEGTDAHVDSP 1560
FC DD +E +DP ++P+QAT E PKDG E+ P QST PE ++G E TDA V+S
Sbjct: 1501 FCLDDDDERIDPVAHPNQATVLEVPKDGFENDPGQSTVSPEKHVG----VEDTDARVESV 1560
Query: 1561 PNVGDSNPVSNPNSSQVVEGVQEEAKEHSV-GAFGGALSAMPNLRRKKR 1597
+VG+ NPV+N NSSQV E VQEE K+HSV AFGGALSAMPNL+RK++
Sbjct: 1561 LDVGNPNPVNNSNSSQVGEDVQEEPKDHSVRRAFGGALSAMPNLKRKRK 1566
BLAST of Sgr021433 vs. NCBI nr
Match:
XP_022928520.1 (DNA repair protein UVH3 isoform X1 [Cucurbita moschata])
HSP 1 Score: 2244.2 bits (5814), Expect = 0.0e+00
Identity = 1252/1615 (77.52%), Postives = 1356/1615 (83.96%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKRLAIDASIWMVQFIKAMRDDRGEMVRNAHLLGFFR 60
MGV GLWELLAPVGRRVSVETLAGK+LAIDASIWMVQFIKAMRD+RGEMVRNAHLLGFFR
Sbjct: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDERGEMVRNAHLLGFFR 60
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLR 120
RICKLLFLRTKPVFVFDGATP+LKRRTLIARRRQRENAQAKVRKTAEKLLLNHLK MRLR
Sbjct: 61 RICKLLFLRTKPVFVFDGATPSLKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKEMRLR 120
Query: 121 ELAEDLQNQKQQRRQDVQKKKTLPNHNEIADGT--SERNKGVPSSGSHENLDEMLAASIM 180
ELAE ++NQKQQR+QDV KKKTL NHNEI DGT SER+K VP+SG+HENLD M+AASIM
Sbjct: 121 ELAEGIKNQKQQRKQDVPKKKTLLNHNEIVDGTSVSERSKSVPNSGNHENLDGMVAASIM 180
Query: 181 AEENGFLMSSASSFAGTTLAKEDSGEESILPLMHEVDPDVLSTLPSSIRHELQKQKYKND 240
EENGF SSA SF+G TL K+D GE+SIL QKYKND
Sbjct: 181 IEENGFFSSSAPSFSGVTLPKKDRGEQSIL-----------------------NQKYKND 240
Query: 241 SKDKKILSDEIHVVGSDSERMEVVSRSAYQKNLDEMLAASIAAEEAQSLNENASVSATAN 300
SK KKILSDEIHVVGSDSERMEV SRSA+Q+NLDEMLAASIAAEEA+ LNEN SVS+ AN
Sbjct: 241 SKGKKILSDEIHVVGSDSERMEVASRSAHQQNLDEMLAASIAAEEARGLNENISVSSAAN 300
Query: 301 WDGEDTD--DEDEEMILPEMHGIVDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVK 360
GED D DEDEEMILPEMHG+VDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVK
Sbjct: 301 LAGEDMDDEDEDEEMILPEMHGVVDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVK 360
Query: 361 KDPAKFSELQIQAYLKTVAFRRDIDQVQKAAAGRGVGGVQTSKIASEANREFIFSSSFTG 420
KDPAKFSELQIQAYLKTVAFRRDIDQVQKAA+GRGVGGVQTS+IASEANREFIFSSSFTG
Sbjct: 361 KDPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGVQTSRIASEANREFIFSSSFTG 420
Query: 421 DKQVLASTRAEKNGDKGLEAPRGQQPLSSLNNTEVPSTSNALARSTPDKTAVFEDNIETF 480
DKQVL STRAEKNGDK L+ PR QQ LSSLNNT++PSTSN LA+STPDK+ VFEDNIETF
Sbjct: 421 DKQVLTSTRAEKNGDKNLQEPRVQQSLSSLNNTDIPSTSNGLAQSTPDKSGVFEDNIETF 480
Query: 481 LDERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEKNASANEVVNPEPVQNIEICNPKSFS 540
LDERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEKN +AN+V NPEP+QNIEICNP+S S
Sbjct: 481 LDERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEKNINANKVANPEPMQNIEICNPESSS 540
Query: 541 SQSQVLDTPYEGVGESI-KLNESSRGSMLNEDTAIEILLEDKGDKSFDGDDDVFTHLAAE 600
+SQVLD EG+ ESI KL+E SMLNEDTAIEILLE +G KSFDGDDD+FTHLAAE
Sbjct: 541 LRSQVLDVSNEGIDESINKLDERGADSMLNEDTAIEILLEGEGGKSFDGDDDLFTHLAAE 600
Query: 601 NPIQMASFDISSQKLSLDGTTDSGWEETVEGKTYSPKNVEVDDHSFKEGRVSDDSEVEWE 660
NPIQMASFDISSQKLS DGTTDSGW+E + EG +SD+SEV+WE
Sbjct: 601 NPIQMASFDISSQKLSQDGTTDSGWKEAL------------------EGTISDESEVDWE 660
Query: 661 DGVCDQVNPVPFGVESGKSVSKGSLEEEADLQEAIRRSLADIGDRKSGPVFSEHQ----Q 720
DGVCD VNPVPF ESGKSVSKGSLEEEADLQEAIRRSL D+GD KSGPV EH+ Q
Sbjct: 661 DGVCDHVNPVPFEDESGKSVSKGSLEEEADLQEAIRRSLEDVGDGKSGPVSLEHEQPQSQ 720
Query: 721 PVIVGKMVEQCMVVENENVIELDKPDSADGMNCLKADDSTGRKETTESSSQEKQCSKSVV 780
P IVGKM EQC VENENVI L+K DS DGMN A DS +K TESSSQEKQCS+ VV
Sbjct: 721 PSIVGKMAEQCTSVENENVIGLEKMDSVDGMNWSNAKDSILKKGMTESSSQEKQCSEPVV 780
Query: 781 LLDTKTDTIAEQLDAPFKGAASSHKESNENDDTLKPLSRDASGAAQVVDRINNTVIEPPC 840
LLDT TIAEQLDA +K + S +ESNE+ DTLK LSRDA A QV D IN+T+IEP C
Sbjct: 781 LLDT---TIAEQLDASYKDTSFSLQESNESSDTLKSLSRDAPRATQVGDMINSTMIEPAC 840
Query: 841 RMVEMEGIY--NVDSSPKAVACENH--QNFPVEKHTSDLLLEENDAKKPAVEVISNAEIE 900
RMVEM+G+ +VDSS K A ENH QN PVEKH+SDLLLEE K V EI
Sbjct: 841 RMVEMDGVNTPDVDSSTKDSAFENHFKQNLPVEKHSSDLLLEEEVGKGHTV------EIS 900
Query: 901 FAEIEFTEDELTNRISILEQERLNLGNEQKRLERNAESVSSEMFAECQELLQMFGLPYII 960
AE E TEDEL +RISILEQERLNLG+EQKRLERNAE+VSSEMFAECQELLQMFGLPYII
Sbjct: 901 KAETEVTEDELKSRISILEQERLNLGDEQKRLERNAEAVSSEMFAECQELLQMFGLPYII 960
Query: 961 APMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELG 1020
APMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELG
Sbjct: 961 APMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELG 1020
Query: 1021 LNRDKLIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEGGLHKFKEWIESPDPSILGTL 1080
L+R+KLIRMALLLGSDYTEGISGIGIVNA+EVMNAFPEE GLHKFKEWIESPDPSILGTL
Sbjct: 1021 LDRNKLIRMALLLGSDYTEGISGIGIVNAVEVMNAFPEEDGLHKFKEWIESPDPSILGTL 1080
Query: 1081 GAQTGLSARKRGSKASENDMTCSNGSVRDGSASGESIFKAPKEKGSIDVKQSFMDKHRNV 1140
GA+TGLSARKRG KASEND CSN SVRDGSAS E+I K KE +IDVKQ+FM KHRNV
Sbjct: 1081 GAKTGLSARKRGQKASENDAPCSNSSVRDGSASEENIDKDLKE--NIDVKQNFMVKHRNV 1140
Query: 1141 SKNWHIPSAFPSEAVISAYTCPQVDKSAEAFSWGKPDHFVLRRLCWEKFGWENSKADELL 1200
SKNWHIPS FPSEAVISAY PQVDKSAE FSWGKPDHFVLRRLC EKFGWENSKADELL
Sbjct: 1141 SKNWHIPSEFPSEAVISAYISPQVDKSAEPFSWGKPDHFVLRRLCLEKFGWENSKADELL 1200
Query: 1201 LPVLKEYSKHQTQLRLEAFYTFNERFAKIRSKRIKKAVRSITGSKSAVLMDDAVQGVSVN 1260
LPVLKEY KH+TQLRLEAFYTFNERFAKIRSKRIKKAV+SITGSKSA LMD+ V VSVN
Sbjct: 1201 LPVLKEYGKHETQLRLEAFYTFNERFAKIRSKRIKKAVKSITGSKSASLMDETVPNVSVN 1260
Query: 1261 KQRELSVEPQENISEKCSSEIQGTCSNEDEVENRRRKPSRKRQLHGEPSQPAKD-KLTMK 1320
Q LS E Q+N+SEKCSSEIQG CSNED V+NR RKPSRKRQL E SQPAKD KLTMK
Sbjct: 1261 NQINLSGETQKNMSEKCSSEIQGACSNEDNVDNRLRKPSRKRQLDREQSQPAKDRKLTMK 1320
Query: 1321 ERGKRSRNEGSHKNE-RGRGRGKGRGRGRLASKGKTKGTPITELVGTSSSDDESEFDEQK 1380
E+GKRSRNEGSH RGRGRGKGRGRGRLA KGK +PITE VGTSSSDDESEFD+QK
Sbjct: 1321 EKGKRSRNEGSHSERGRGRGRGKGRGRGRLALKGK---SPITEFVGTSSSDDESEFDDQK 1380
Query: 1381 FDLENLQEPRERRRSARIQKSASSTMNDV--DQPSGHSRDRFSDDEAKEHDVVRDRHALP 1440
DLEN+QEP+ERR+S+R++KSAS M+D DQPS HS R S+DEA + +VV+ + P
Sbjct: 1381 IDLENVQEPQERRKSSRVRKSASYKMDDADQDQPSDHSGYRLSNDEANDDNVVQGGYTGP 1440
Query: 1441 ETVISQSENTECDFKTRKRSPQKDHLETGGGFCPVEDEMSQQGMCQNKDPSLEANIGEDY 1500
ETV+ SENTECD++ KRSP +D+L TGGGFCP EDEMS++ MCQNKDP+LEA+ EDY
Sbjct: 1441 ETVMIHSENTECDYEIPKRSPLRDYLGTGGGFCPTEDEMSREAMCQNKDPALEASNSEDY 1500
Query: 1501 LSMGGGFCSDDGNECVDPNSYPDQATFSEDPKDGSEDHPIQSTFHPEY-IGRVQNEEGTD 1560
L++GGGFC DD NECVDP ++ DQAT SE PKDGSED P QSTFHPE IG Q E T
Sbjct: 1501 LTLGGGFCLDDDNECVDPVAHLDQATASEVPKDGSEDDPDQSTFHPEKDIGGNQLNEDTY 1560
Query: 1561 AHVDSPPNVGDSNPVSNPNSSQVVEGVQEEAKEHSVGAFGGALSAMPNLRRKKRR 1598
H +S +VGD NP S PNSS+V EGVQEE K+HSV AFGGALSAMPNLRRK++R
Sbjct: 1561 PHGESLLDVGDPNPASFPNSSRVGEGVQEEPKDHSVRAFGGALSAMPNLRRKRKR 1560
BLAST of Sgr021433 vs. ExPASy Swiss-Prot
Match:
Q9ATY5 (DNA repair protein UVH3 OS=Arabidopsis thaliana OX=3702 GN=UVH3 PE=2 SV=1)
HSP 1 Score: 1002.3 bits (2590), Expect = 7.5e-291
Identity = 720/1655 (43.50%), Postives = 944/1655 (57.04%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKRLAIDASIWMVQFIKAMRDDRGEMVRNAHLLGFFR 60
MGVQGLWELLAPVGRRVSVETLA KRLAIDASIWMVQFIKAMRD++G+MV+NAHL+GFFR
Sbjct: 1 MGVQGLWELLAPVGRRVSVETLANKRLAIDASIWMVQFIKAMRDEKGDMVQNAHLIGFFR 60
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLR 120
RICKLLFLRTKP+FVFDGATPALKRRT+IARRRQRENAQ K+RKTAEKLLLN LK +RL+
Sbjct: 61 RICKLLFLRTKPIFVFDGATPALKRRTVIARRRQRENAQTKIRKTAEKLLLNRLKDIRLK 120
Query: 121 ELAEDLQNQKQQRRQDVQKKKTLPNHNEIADGTSERNKGVPSSGSHENLDEMLAASIMAE 180
E A+D++NQ+ ++ + KK ++ + E N VP ++ + AS E
Sbjct: 121 EQAKDIKNQRLKQDDSDRVKK------RVSSDSVEDNLRVPVE------EDDVGASFFQE 180
Query: 181 ENGFLMSSASSFAGTTLAKEDSGEESILPLMHEVDPDVLSTLPSSIRHELQKQKYKNDSK 240
E +S AS L+ E D ++ K+ K+D K
Sbjct: 181 EKLDEVSQAS-------------------LVGETGVD-----------DVVKESVKDDPK 240
Query: 241 DKKILSDEIHVVGSDSERM---EVVSRSAYQKNLDEMLAASIAAEEAQSLNENASVSATA 300
K +L D G D + + V YQ+ LDEMLAAS+AAEE ++ AS SA A
Sbjct: 241 GKGVLLD-----GDDLDNLVQDSSVQGKDYQEKLDEMLAASLAAEEERNFTSKASTSAAA 300
Query: 301 ---NWDGEDTDDEDEEMILPEMHGIVDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQR 360
D E+ D DEE++LP M G +DP+VLA+LPPS+QLDLL QMRE+LMAENRQKYQ+
Sbjct: 301 IPSEEDEEEDSDGDEEILLPVMDGNIDPAVLASLPPSMQLDLLAQMREKLMAENRQKYQK 360
Query: 361 VKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAAAGRGVGGVQTSKIASEANREFIFSSSF 420
VKK P KFSELQI+AYLKTVAFRR+I++VQ++A GR VGGVQTS+IASEANREFIFSSSF
Sbjct: 361 VKKAPEKFSELQIEAYLKTVAFRREINEVQRSAGGRAVGGVQTSRIASEANREFIFSSSF 420
Query: 421 TGDKQVLASTRAEKNGDKGLEAPRGQQPLSSLNNTEVPSTSNALARSTPDKTAVFEDNIE 480
GDK+VLAS R +N + + + P+ S+ N S+A D+ ++NIE
Sbjct: 421 AGDKEVLASAREGRNDENQKKTSQQSLPV-SVKNASPLKKSDATIELDRDEPKNPDENIE 480
Query: 481 TFLDERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEKNASANEVVNPEPVQNIEICNPKS 540
++DERGR R+ R R MG+ MTRD++RNL LMKE E+ AS + N E E +
Sbjct: 481 VYIDERGRFRI-RNRHMGIQMTRDIQRNLHLMKEKERTASGSMAKNDETFSAWE-----N 540
Query: 541 FSSQSQVLD-TPYEGVGESIKLNESSRGSMLNEDTAIEILLE-DKGDKSFDGDDDVFTHL 600
F ++ Q L+ +P E + + L + SML+ ++IEI + D G K + +DD+F L
Sbjct: 541 FPTEDQFLEKSPVE--KDVVDLEIQNDDSMLHPPSSIEISFDHDGGGKDLNDEDDMFLQL 600
Query: 601 AAENPIQMASFDISSQKLSLDGTTDSGWEETVEGKTYSPKNVEV---DDHSFKEGRVSDD 660
AA P+ ++S + ++ + +DS WEE + S +E + H K+ +S
Sbjct: 601 AAGGPVTISSTENDPKEDTSPWASDSDWEEVPVEQNTSVSKLEANLSNQHIPKD--ISIA 660
Query: 661 SEVEWEDGVCDQVNPVPFGVESG--KSVSKGSLEEEADLQEAIRRSLADIGDRKSGPVFS 720
V WE+ C N VE+ ++KG LEEEADLQEAI++SL ++ D++SG V
Sbjct: 661 EGVAWEEYSCKNANN---SVENDTVTKITKGYLEEEADLQEAIKKSLLELHDKESGDVLE 720
Query: 721 EHQQ---PVIVGKMVEQCMVVENENVIELDKPDSADGMNCLK-------------ADDST 780
E+Q ++V K E + E V E ++ D + LK A ++
Sbjct: 721 ENQSVRVNLVVDKPSEDSL-CSRETVGEAEEERFLDEITILKTSGAISEQSNTSVAGNAD 780
Query: 781 GRKETTES-----SSQEKQCSKSV---------VLLDTKTDTIAEQLDAPFKGAASSHKE 840
G+K T+ SS S +V V+ K +A Q + A H E
Sbjct: 781 GQKGITKQFGTHPSSGSNNVSHAVSNKLSKVKSVISPEKALNVASQ-NRMLSTMAKQHNE 840
Query: 841 SNENDDTLKPLSRDASGAAQ-----VVDRINNTVIEPPCRMVE--------MEGIYNVDS 900
+ + A A +D +N E M + ++ +
Sbjct: 841 EGSESFGGESVKVSAMPIADEEITGFLDEKDNADGESSIMMDDKRDYSRRKIQSLVTESR 900
Query: 901 SPKAVACENHQNFPVEKHTSDLLLEENDAKKPAVEVISNAEIE--FAEIEFTEDELTNRI 960
P + + + + EEN++ + + S+ + E +EF+E + I
Sbjct: 901 DPSRNVVRSRIGILHDTDSQNERREENNSNEHTFNIDSSTDFEEKGVPVEFSEANIEEEI 960
Query: 961 SILEQERLNLGNEQKRLERNAESVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELA 1020
+L+QE ++LG+EQ++LERNAESVSSEMFAECQELLQ+FG+PYIIAPMEAEAQCA+ME +
Sbjct: 961 RVLDQEFVSLGDEQRKLERNAESVSSEMFAECQELLQIFGIPYIIAPMEAEAQCAFMEQS 1020
Query: 1021 NLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELGLNRDKLIRMALLLGS 1080
NLVDG+VTDDSDVFLFGARSVYKNIFDDRKYVETYFMKD+E ELGL+RDK+IRMA+LLGS
Sbjct: 1021 NLVDGIVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDIEKELGLSRDKIIRMAMLLGS 1080
Query: 1081 DYTEGISGIGIVNAIEVMNAFPEEGGLHKFKEWIESPDPSILGTLGAQTGLSARKRGSKA 1140
DYTEGISGIGIVNAIEV+ AFPEE GL KF+EW+ESPDP+ILG A+TG +KRGS +
Sbjct: 1081 DYTEGISGIGIVNAIEVVTAFPEEDGLQKFREWVESPDPTILGKTDAKTGSKVKKRGSAS 1140
Query: 1141 SENDMTCSNGSVRDGSASGESIFKAPKEKGSIDVKQSFMDKHRNVSKNWHIPSAFPSEAV 1200
+N S S D + ++KQ FMD+HR VSKNWHIP FPSEAV
Sbjct: 1141 VDNKGIISGASTDD----------------TEEIKQIFMDQHRKVSKNWHIPLTFPSEAV 1200
Query: 1201 ISAYTCPQVDKSAEAFSWGKPDHFVLRRLCWEKFGWENSKADELLLPVLKEYSKHQTQLR 1260
ISAY PQVD S E FSWGKPD VLR+LCWEKF W K DELLLPVLKEY K +TQLR
Sbjct: 1201 ISAYLNPQVDLSTEKFSWGKPDLSVLRKLCWEKFNWNGKKTDELLLPVLKEYEKRETQLR 1260
Query: 1261 LEAFYTFNERFAKIRSKRIKKAVRSITGSKSAVLMDDAVQGVSVNKQRELSVEPQENISE 1320
+EAFY+FNERFAKIRSKRI KAV+ I G S+ + D +Q K+ + V P E
Sbjct: 1261 IEAFYSFNERFAKIRSKRINKAVKGIGGGLSSDVADHTLQ-EGPRKRNKKKVAPHE---- 1320
Query: 1321 KCSSEIQGTCSNEDEVENRRRKPSRKRQLHGEPSQPAKDKLTMKERGKRSRNEGSHKNER 1380
+E T + + N + K RKR E+ SR R
Sbjct: 1321 ---TEDNNTSDKDSPIANEKVKNKRKR----------------LEKPSSSRG-------R 1380
Query: 1381 GRGRGKGRGRGRLASKGKTKGTPITELVGTSSSDDESEFDEQKFDLENLQEPRERRRSAR 1440
GR + +GRGRGR+ + EL SS DD+ D++ +LE + A
Sbjct: 1381 GRAQKRGRGRGRVQK-------DLLELSDGSSDDDDD--DDKVVELE--------AKPAN 1440
Query: 1441 IQKSASSTMNDVDQPSGHSRDRFSDDEAKEHDVVRDRHALPETVISQSENTECDFKTRKR 1500
+QKS S N V S D + + E + + E I ++ +
Sbjct: 1441 LQKSTRS-RNPV-MYSAKEDDELDESRSNEGSPSENFEEVDEGRIGNDDSVDASIND--- 1478
Query: 1501 SPQKDHLETGGGFCPVEDEMSQQGMCQNKDPSLEANIGEDYLSMGGGFCSDDGNECVDPN 1560
P +D+++TGGGFC DE + G D LE +DY +GGGFC D+ +E + N
Sbjct: 1501 CPSEDYIQTGGGFC--ADEADEIG-----DAHLEDKATDDYRVIGGGFCVDE-DETAEEN 1478
Query: 1561 SYPDQATFSEDPKDGSEDHPIQSTFHPEYIGRVQNEEGTDAHVDSPPNVGDSNPVSNPNS 1598
+ D A E K SE+ + G+ +NEE DA +D
Sbjct: 1561 TMDDDA---EILKMESEEQRKK--------GKRRNEE--DASLD---------------- 1478
BLAST of Sgr021433 vs. ExPASy Swiss-Prot
Match:
F4KHA8 (WAT1-related protein At5g40230 OS=Arabidopsis thaliana OX=3702 GN=At5g40230 PE=3 SV=1)
HSP 1 Score: 285.0 bits (728), Expect = 6.1e-75
Identity = 156/305 (51.15%), Postives = 212/305 (69.51%), Query Frame = 0
Query: 1597 REFAPLAGMIAAECATVGSNTVYKAISTQEISYYVFTFYTCLAAALVLLPFAFIFRRSGV 1656
R+ P M+A EC TVGSNT++KA + + +S+YVF FYT + A LVLLP + IF RS
Sbjct: 17 RDVVPFTAMVAVECVTVGSNTLFKAATLRGLSFYVFVFYTYVVATLVLLPLSLIFGRSKR 76
Query: 1657 FPSDKLSSFLLRLIFLSAMGVACQLFAYKGLEYSSPTLASAISNLIPALTFIFAVLFGME 1716
PS K F + L+ +G + KG+EYSSPTLASAISNL PA TF AV+F ME
Sbjct: 77 LPSAKTPVF-FNIFLLALVGFMSLIVGCKGIEYSSPTLASAISNLTPAFTFTLAVIFRME 136
Query: 1717 KLALKGSSSIAKIIGSVVSISGALVVVLYKGPVILSNPFSGP--TRLNLPHHPLGSTQPN 1776
++ L+ S++ AKIIG++VSISGALVV+LYKGP +L++ P ++L H L S +
Sbjct: 137 QIVLRSSATQAKIIGTIVSISGALVVILYKGPKVLTDASLTPPSPTISLYQH-LTSFDSS 196
Query: 1777 WIMGGLCFFAQYLLNSFWYIILTQMVNMYPDELAVVCLYYVFEAIIAAPICLLVEGNLSA 1836
WI+GGL QYLL S WYI+ T+++ +YP+E+ VV LY + +I+AP+CL E +L++
Sbjct: 197 WIIGGLLLATQYLLVSVWYILQTRVMELYPEEITVVFLYNLCATLISAPVCLFAEKDLNS 256
Query: 1837 WKLKNGLELVAVLNSGCVGQSFVTAIHTWGVHVKGPVYVSSFRPLSIAIAAATGVIFLGD 1896
+ LK G+ L +V+ SG + SF + IHTWG+H+KGPVY+S F+PLSI IA A GV+FLGD
Sbjct: 257 FILKPGVSLASVMYSGGLVSSFGSVIHTWGLHLKGPVYISLFKPLSIVIAVAMGVMFLGD 316
Query: 1897 DLYLG 1900
LYLG
Sbjct: 317 ALYLG 319
BLAST of Sgr021433 vs. ExPASy Swiss-Prot
Match:
Q9FL08 (WAT1-related protein At5g40240 OS=Arabidopsis thaliana OX=3702 GN=At5g40240 PE=2 SV=1)
HSP 1 Score: 282.3 bits (721), Expect = 3.9e-74
Identity = 152/304 (50.00%), Postives = 205/304 (67.43%), Query Frame = 0
Query: 1597 REFAPLAGMIAAECATVGSNTVYKAISTQEISYYVFTFYTCLAAALVLLPFAFIFRRSGV 1656
R+ P A M A ECATVGSNT++KA + + +S+YVF FY+ + + L+LLP + IF RS
Sbjct: 16 RDVVPFAAMFAVECATVGSNTLFKAATLRGLSFYVFVFYSYIVSTLLLLPLSVIFGRSRR 75
Query: 1657 FPSDKLSSFLLRLIFLSAMGVACQLFAYKGLEYSSPTLASAISNLIPALTFIFAVLFGME 1716
P+ K S ++ L +G Q+ KG+ YSSPTLASAISNL PA TF AV+F ME
Sbjct: 76 LPAAK-SPLFFKIFLLGLVGFMSQIAGCKGIAYSSPTLASAISNLTPAFTFTLAVIFRME 135
Query: 1717 KLALKGSSSIAKIIGSVVSISGALVVVLYKGP-VILSNPFSGPTRLNLPHHPLGSTQPNW 1776
++ L+ S++ AKIIG+++SISGALVVVLYKGP V+ S F+ H L S + +W
Sbjct: 136 QVRLRSSATQAKIIGAILSISGALVVVLYKGPQVLASASFTTVLPTVTLHQQLTSIESSW 195
Query: 1777 IMGGLCFFAQYLLNSFWYIILTQMVNMYPDELAVVCLYYVFEAIIAAPICLLVEGNLSAW 1836
I+GGL +QY L S WYI+ T+++ +YP+E+ VV Y +F +I+ P+CL E NL++W
Sbjct: 196 IIGGLLLASQYFLISVWYILQTRVMEVYPEEITVVFFYNLFATLISVPVCLFAESNLTSW 255
Query: 1837 KLKNGLELVAVLNSGCVGQSFVTAIHTWGVHVKGPVYVSSFRPLSIAIAAATGVIFLGDD 1896
LK + L A++ SG F HTWG+H+KGPVY+S FRPLSIAIA A G IFLGD
Sbjct: 256 VLKPDISLAAIIYSGVFVSLFSALTHTWGLHLKGPVYISLFRPLSIAIAVAMGAIFLGDA 315
Query: 1897 LYLG 1900
L+LG
Sbjct: 316 LHLG 318
BLAST of Sgr021433 vs. ExPASy Swiss-Prot
Match:
P35689 (DNA excision repair protein ERCC-5 OS=Mus musculus OX=10090 GN=Ercc5 PE=1 SV=4)
HSP 1 Score: 272.7 bits (696), Expect = 3.1e-71
Identity = 324/1252 (25.88%), Postives = 523/1252 (41.77%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKRLAIDASIWMVQFIKAMRDDRGEMVRNAHLLGFFR 60
MGVQGLW+LL G RVS E L GK LA+D SIW+ Q +K +RD G ++ NAHLL F
Sbjct: 1 MGVQGLWKLLECSGHRVSPEALEGKVLAVDISIWLNQALKGVRDSHGNVIENAHLLTLFH 60
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLR 120
R+CKLLF R +P+FVFDG P LK++TL RR+++++A RKT EKLL LK L+
Sbjct: 61 RLCKLLFFRIRPIFVFDGDAPLLKKQTLAKRRQRKDSASIDSRKTTEKLLKTFLKRQALK 120
Query: 121 ELAEDLQNQKQQRRQDVQKKKTLPNHNEIADGTSERNKGVPSSGSHENLDEMLAASIMAE 180
S R++ PS
Sbjct: 121 TAFR-----------------------------SSRHEAPPSL----------------- 180
Query: 181 ENGFLMSSASSFAGTTLAKEDSGEESILPLMHEVDPDVLSTLPSSIRHELQKQKYKNDSK 240
T + ++D D VL LP +++K+ ++ +
Sbjct: 181 --------------TQVQRQD-------------DIYVLPPLP-------EEEKHSSEEE 240
Query: 241 DKKILSDEIHVVGSDSERMEVVSRSAYQKNLDEMLAASIAAEEAQSLNENASVSATANWD 300
D+K +Q +D + Q+L E N
Sbjct: 241 DEK----------------------QWQARMD----------QKQALQE----EFFHNPQ 300
Query: 301 GEDTDDEDEEMILPEMHGIVDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVKKDPA 360
D + ED ++LPP V+ ++L M+E R ++ + ++
Sbjct: 301 AIDIESED----------------FSSLPPEVKHEILTDMKE-FTKRRRTLFEAMPEESN 360
Query: 361 KFSELQIQAYLKTVAFRRDIDQVQKAAAGRGVGGVQTSKIASEANREFIFSSSFTGD--- 420
FS+ Q++ LK + I+ VQK + G +Q R++ F +
Sbjct: 361 DFSQYQLKGLLKKNYLNQHIENVQKEMNQQHSGQIQ---------RQYQDEGGFLKEVES 420
Query: 421 KQVLASTRAEKNGDKGLEAPRGQQPLSSLNNTEVPSTSNALARSTPDKTAVFEDNIETFL 480
++V++ + KG++ + + +++ +PS+SN + S+ K++ E
Sbjct: 421 RRVVSEDTSHYILIKGIQGKK----VMDVDSESLPSSSNVHSVSSNLKSSPHE------- 480
Query: 481 DERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEKNASANEVVNPEPVQNIEICNPKSFSS 540
+V+ R R L L + + ++S +E + E Q+ E N + +
Sbjct: 481 ----KVKPEREPEAAPPSPRTL---LAIQAAMLGSSSEDEPESREGRQSKE-RNSGATAD 540
Query: 541 QSQVLDTPYEGVGESIKLNESSRGSMLNEDTAIEILL------EDKGDKSFDGDDDVFTH 600
+ + +++ + + S ++D A ++LL E+ D++ + V
Sbjct: 541 AGSISPRTCAAIQKALDDDNDEKVSGSSDDLAEKMLLGSGLEQEEHADETAERGGGVPFD 600
Query: 601 LAAENPIQMASFDISSQKLSLDGTTDSGWE-ETVEGKTYSPKNVEVDDHSFKE-GRVSDD 660
A P + + S +G TDS T + +PK + KE ++S +
Sbjct: 601 TAPLTPSVTEVKECVTSGSSANGQTDSAHSFTTASHRCDTPKETVSLARAVKEASQISSE 660
Query: 661 SEVEWEDGVCDQVNPVPFGVESGKSVSKGSLEEEADLQEAIRRSLAD--IGDRKSGPVFS 720
EVE G ++P G S VS E E L R+ +D I P
Sbjct: 661 CEVE---GRPAALSPAFIGTPS-SHVSGVLSEREPTLAPPTTRTHSDQGIDIHPEDPELQ 720
Query: 721 EHQQPVIVGKMVEQCMVVENENVIELDKPDSADGMNCLKADDSTGRKETTESSSQEKQCS 780
P + K + ++E + A + A+ + + S+++E+
Sbjct: 721 NGLYP-LETKCNSSRLSSDDETEGGQNPAPKACSTVHVPAEAMSNLENALPSNAEERGDF 780
Query: 781 KSVVLLDTKTDTIAEQLDAPFKGAASSHKESNENDDTLKPLSRDASGAAQVVDRINNTVI 840
+ + L + A +L + K ES E S +V ++N+ +
Sbjct: 781 QETIQLREVPEAAARELISAPKPMGPMEMESEE--------SESDGSFIEVQSVVSNSEL 840
Query: 841 EPPCRMVEMEGIYNVDSSPKAVACENHQNFPVEKHTSDLLLEENDAKKPAVEVISNAEIE 900
+ P+ E + T LL + +D + A+E A+I+
Sbjct: 841 QTESSEASTHLSEKDAEEPRETLEEG-----TSRDTECLLQDSSDIE--AMEGHREADID 900
Query: 901 FAEI-----EFTEDELTNRISILEQERLNLGNEQKRLERNAESVSSEMFAECQELLQMFG 960
++ + +EL S L E+ +L ++++ +R A SV+ +MF E QELL++FG
Sbjct: 901 AEDMPNEWQDINLEELDALESNLLAEQNSLKAQKQQQDRIAASVTGQMFLESQELLRLFG 960
Query: 961 LPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDV 1020
+PYI APMEAEAQCA ++L + G +TDDSD++LFGAR VYKN F+ K+VE Y D
Sbjct: 961 VPYIQAPMEAEAQCAMLDLTDQTSGTITDDSDIWLFGARHVYKNFFNKNKFVEYYQYVDF 1020
Query: 1021 ENELGLNRDKLIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEG--GLHKFKEWIESPD 1080
++LGL+R+KLI +A LLGSDYTEGI +G V A+E++N FP G L KF EW
Sbjct: 1021 YSQLGLDRNKLINLAYLLGSDYTEGIPTVGCVTAMEILNEFPGRGLDPLLKFSEWWHE-- 1021
Query: 1081 PSILGTLGAQTGLSARKRGSKASENDMTCSNGSVRDGSASGESIFKAPKEKGSIDVKQSF 1140
+ K +EN +
Sbjct: 1081 ---------------AQNNKKVAEN---------------------------------PY 1021
Query: 1141 MDKHRNVSKNWHIPSAFPSEAVISAYTCPQVDKSAEAFSWGKPDHFVLRRLCWEKFGWEN 1200
K + + + FP+ AV AY P VD S +F WGKPD + C FGW
Sbjct: 1141 DTKVKKKLRKLQLTPGFPNPAVADAYLRPVVDDSRGSFLWGKPDVDKISTFCQRYFGWNR 1021
Query: 1201 SKADELLLPVLKEYSKHQTQLRLEAFYTFNERFAK----IRSKRIKKAVRSI 1229
K DE L PVLK + HQTQLR+++F+ ++ + I+S R+ +AV I
Sbjct: 1201 MKTDESLYPVLKHLNAHQTQLRIDSFFRLAQQEKQDAKLIKSHRLNRAVTCI 1021
BLAST of Sgr021433 vs. ExPASy Swiss-Prot
Match:
P14629 (DNA excision repair protein ERCC-5 homolog OS=Xenopus laevis OX=8355 GN=ercc5 PE=2 SV=1)
HSP 1 Score: 272.7 bits (696), Expect = 3.1e-71
Identity = 343/1393 (24.62%), Postives = 595/1393 (42.71%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKRLAIDASIWMVQFIKAMRDDRGEMVRNAHLLGFFR 60
MGVQGLW+LL GR ++ TL GK LA+D SIW+ Q +K RD +G ++NAHLL F
Sbjct: 1 MGVQGLWKLLECSGRPINPGTLEGKILAVDISIWLNQAVKGARDRQGNAIQNAHLLTLFH 60
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRL- 120
R+CKLLF R +P+FVFDG P LKR+TL RR++ + A RKT EKLL LK +
Sbjct: 61 RLCKLLFFRIRPIFVFDGEAPLLKRQTLAKRRQRTDKASNDARKTNEKLLRTFLKRQAIK 120
Query: 121 ------RELAEDLQNQKQQRRQDVQKKKTLPNHNEIADGTSERNKGVPSSGSHENLDEML 180
++ E+L + Q R++ + LP + + +SE + +E +
Sbjct: 121 AALSGNKQSNEELPSFSQVPRKETEDLYILPPLEDNENNSSEEEE-------EREWEERM 180
Query: 181 AASIMAEENGFLMSSASSFAGTTLAKEDSGEESILPLMHEVDPDVLSTLPSSIRHELQKQ 240
+E+ F S+ +++ + +LP ++HE+
Sbjct: 181 NQKQRLQEDFFANPSSV----------------------DIESEEFKSLPPEVKHEILTD 240
Query: 241 KYKNDSKDKKILSDEIHVVGSDSERME---VVSRSAYQKNLDEMLAASIAAEEAQSLNEN 300
K+ +K ++ L + + SD + + ++ ++ K +D + + LN+
Sbjct: 241 -MKDFTKRRRTLFEAMPEDSSDFSQYQLKGLLKKNDLNKCIDNV---------RKELNQQ 300
Query: 301 ASVSATANWDGED------------TDDEDEEMILPEMHGIVDPSVLAALPPSVQLDLLV 360
S A ++ E ++D+ +++ + + + + P S+ +
Sbjct: 301 YSGEVQAQFESEGGFLKEVETRRLVSEDDSHYILIKGIQSKQEEKKVDSPPQSITFNSSQ 360
Query: 361 QMRERL-----MAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAAAGRGVG 420
+ L A + Q + A S + A + +A D ++ +K + V
Sbjct: 361 TPKTYLDLKLASAHKTKPLQTSSAEAAPPSPRTLFAIQEAMAESWDHEKHEKPS----VS 420
Query: 421 GVQTSKIASEANREFIFSSSFTGDKQVLASTRA-EKNGDK-GLEAPRGQQPLSSLNNTEV 480
G + S + I+ QVLA A E N K L++ ++P T+V
Sbjct: 421 GCEAEGNVSPRTLQAIY--------QVLAEDEAGESNKIKVVLQSDEERKP-----KTKV 480
Query: 481 PSTSNALARSTPDKTAVFEDNIETFLDERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEK 540
S++ D ++D +T L S ++++ + E D + +
Sbjct: 481 LVISSS---DEEDDCLNYQDGTKTTLG------ASLIKSISPSSMQCQESTADSLPNYTR 540
Query: 541 NASANEVVNPEPVQNIEICNPKSFSSQSQVLDTPYEGVGESIKLNESSRGSMLNEDTAIE 600
+ +++ P N++ N +++ +++ P +G K S +N + I
Sbjct: 541 SKPVSQIEEPMADHNLQGDNCNVPNAKDKLIVPP--SLGNVDKPIILSNTIPVNSEFRIP 600
Query: 601 ILLEDKGDKSFDGDDDVFTHLAAENPIQMASFDISSQKLSLDGTTDSGWEETVEGKTYSP 660
+L + + T + N + S S L D T + V SP
Sbjct: 601 LLPVNMSMRE--------TVIIPNNTGSLGSSRYIS--LERDATKQGFSDNPVGDLVRSP 660
Query: 661 KNVEVDDHSFKEGRVSDDSEVEWEDGVCDQVNPVPFGVESG--KSVSKGSLEEEADLQEA 720
++ S +SD + +C+ + + G ++ + + E
Sbjct: 661 DEPALNASS----ALSDRKTSATQSLLCNNIECTEQSMVQGCSNTLDVTQTTQPSGGSEV 720
Query: 721 IRRSLADIGDRKSGPVFSEHQQPVIVGKMVEQCMVVENENVIELDKPDSADGMNCLKADD 780
+ + + D+K VF + + M + ++V +E +
Sbjct: 721 NKPAEYNPQDKK---VFGSNDSSAMYVPMTPESIIVSDEEFV------------------ 780
Query: 781 STGRKETTESSSQEKQCSKSVVLLDTKTDTIAEQLDAPFKGAASSHKESNENDDTLKPLS 840
EK+ D+ +D ++D+ F + S H E DT
Sbjct: 781 ------------NEKE--------DSDSDDSFIEVDSEFSTSNSQHVVFKEPGDT----- 840
Query: 841 RDASGAAQVVDRINNTVIEPPCRMVEMEGIYNVDSSPKAVACENHQNFPVEKHTSDLLLE 900
R+ + Q V+ N+ Q+ P+E + + +
Sbjct: 841 RETATNFQAVEEGNS----------------------------GSQDIPLEHDSGEPHEQ 900
Query: 901 ENDAKKPAVEVISNAEIEFAEIEFTE-DELTNRISILEQERLNLGNEQKRLERNAESVSS 960
N + ++ +SN E+ +I E + L N + + ++ +L +Q++ ER A +V+
Sbjct: 901 SNSEESKDLDDVSN---EWQDISVEELESLENNLYV---QQTSLQAQQQQQERIAATVTG 960
Query: 961 EMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIF 1020
+M E QELLQ+FG+PYI+APMEAEAQCA ++L + G +TDDSD++LFGAR VYKN F
Sbjct: 961 QMCLESQELLQLFGIPYIVAPMEAEAQCAILDLTDQTSGTITDDSDIWLFGARHVYKNFF 1020
Query: 1021 DDRKYVETYFMKDVENELGLNRDKLIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEG- 1080
K+VE Y D+ N+LGL+R KLI +A LLGSDYTEGI +G V+A+E++N FP +G
Sbjct: 1021 SQNKHVEYYQYADIHNQLGLDRSKLINLAYLLGSDYTEGIPTVGYVSAMEILNEFPGQGL 1080
Query: 1081 -GLHKFKEWIESPDPSILGTLGAQTGLSARKRGSKASENDMTCSNGSVRDGSASGESIFK 1140
L KFKEW + + + + + ND K
Sbjct: 1081 EPLVKFKEWWSE---------------AQKDKKMRPNPNDT------------------K 1140
Query: 1141 APKEKGSIDVKQSFMDKHRNVSKNWHIPSAFPSEAVISAYTCPQVDKSAEAFSWGKPDHF 1200
K+ +D++QS FP+ AV SAY P VD+S AFSWG+PD
Sbjct: 1141 VKKKLRLLDLQQS-----------------FPNPAVASAYLKPVVDESKSAFSWGRPDLE 1167
Query: 1201 VLRRLCWEKFGWENSKADELLLPVLKEYSKHQTQLRLEAFYTFNE-RFAKIRSKRIKKAV 1260
+R C +FGW K DE+LLPVLK+ + QTQLR+++F+ + A ++S+R+++AV
Sbjct: 1201 QIREFCESRFGWYRLKTDEVLLPVLKQLNAQQTQLRIDSFFRLEQHEAAGLKSQRLRRAV 1167
Query: 1261 RSITGSKSAVLMDDAVQGVSVNKQRELSVEPQENISEKCSSEIQGTCSNEDEVENRRRKP 1320
+ + V ++ V+V + +C+++ +G +N +RRKP
Sbjct: 1261 TCMKRKERDVEAEEVEAAVAV-------------MERECTNQRKGQKTNTKSQGTKRRKP 1167
Query: 1321 SRKRQLHGEPSQPAKDKLTMKERGKRSRNEGSHKNERGRGRGKGRGRGRLASKGKTKGTP 1359
+ Q +P K ++GS + G + + G+ K +
Sbjct: 1321 TECSQEDQDPGGGFIGIELKTLSSKAYSSDGSSSDAEDLPSGLIDKQSQSGIVGRQKAS- 1167
BLAST of Sgr021433 vs. ExPASy TrEMBL
Match:
A0A6J1DAD7 (DNA repair protein UVH3 isoform X3 OS=Momordica charantia OX=3673 GN=LOC111018345 PE=3 SV=1)
HSP 1 Score: 2427.5 bits (6290), Expect = 0.0e+00
Identity = 1324/1605 (82.49%), Postives = 1411/1605 (87.91%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKRLAIDASIWMVQFIKAMRDDRGEMVRNAHLLGFFR 60
MGVQGLWELLAPVGRRVSVETLAGK+LAIDASIWMVQFIKAMRD+RGEMVRNAHLLGFFR
Sbjct: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDERGEMVRNAHLLGFFR 60
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLR 120
RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRL+
Sbjct: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLK 120
Query: 121 ELAEDLQNQKQQRRQDVQKKKTLPNHNEIADGTSERNKGV--PSSGSHENLDEMLAASIM 180
ELAEDLQNQKQQRRQDV KKK LPNH ADGTS RNK + SSG HE LD MLAASIM
Sbjct: 121 ELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSGRNKSITTTSSGDHEKLDGMLAASIM 180
Query: 181 AEENGFLMSSASSFAGTTLAKEDSGEESILPLMHEVDPDVLSTLPSSIRHE-LQKQKYKN 240
AEENGF SS+SSF+G LAK++SGEESILPLM+EVDPDV STLPSSI++E LQKQKYKN
Sbjct: 181 AEENGFFTSSSSSFSGAALAKDNSGEESILPLMNEVDPDVFSTLPSSIQYELLQKQKYKN 240
Query: 241 DSKDKKILSDEIHVVGSDSERMEVVSRSAYQKNLDEMLAASIAAEEAQSLNENASVSATA 300
DSK KKILSDEIH VGSD+ERMEV SR A+Q+NLDEMLAASIAAEEA SLNENASVSA A
Sbjct: 241 DSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLDEMLAASIAAEEAGSLNENASVSAAA 300
Query: 301 NWDGEDTDDEDEEMILPEMHGIVDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVKK 360
N D EDTDDEDEEMILPEM G+VDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVKK
Sbjct: 301 NLD-EDTDDEDEEMILPEMDGVVDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVKK 360
Query: 361 DPAKFSELQIQAYLKTVAFRRDIDQVQKAAAGRGVGGVQTSKIASEANREFIFSSSFTGD 420
DPAKFSELQIQAYLKTVAFRRDIDQVQKAA+GRGVGGVQTS+IASEANREFIFSSSFTGD
Sbjct: 361 DPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGVQTSRIASEANREFIFSSSFTGD 420
Query: 421 KQVLASTRAEKNGDKGLEAPRGQQPLSSLNNTEVPSTSNALARSTPDKTAVFEDNIETFL 480
KQVLAS R EK+GD+ L+APRGQQPLSSLNNTEVPSTSNALARSTPDKT VFE+NIETFL
Sbjct: 421 KQVLAS-RIEKSGDEDLQAPRGQQPLSSLNNTEVPSTSNALARSTPDKTGVFEENIETFL 480
Query: 481 DERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEKNASANEVVNPEPVQNIEICNPKSFSS 540
DERGRVRVSRVRAMGM MTRDLERNLDLMKEIEKNASANEVVN EPVQN EICNPKS SS
Sbjct: 481 DERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNASANEVVNHEPVQNSEICNPKSHSS 540
Query: 541 QSQVLDTPYEGVGESIKLNESSRGSMLNEDTAIEILLEDKGDKSFDGDDDVFTHLAAENP 600
QSQ LDTPYEGV ES++L+ SRGSML+EDTAIEILLED+GDKSFDGDDD+FTHLAAENP
Sbjct: 541 QSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEILLEDEGDKSFDGDDDLFTHLAAENP 600
Query: 601 IQMASFDISSQKLSLDGTTDSGWEETVEGKTYSPKNVEVDDHSFKEGRVSDDSEVEWEDG 660
IQ+ASFD SSQKLS DGTTDSGWEE VEGKTYSPKNVEVDDH F EGRVSD+SEVEWE+G
Sbjct: 601 IQVASFDKSSQKLSFDGTTDSGWEEAVEGKTYSPKNVEVDDHPFVEGRVSDESEVEWEEG 660
Query: 661 VCDQVNPVPFG-VESGKSVSKGSLEEEADLQEAIRRSLADIGDRKSGPVFSEHQQPVIVG 720
VCD VNPVPFG ESGKSVSKGSLEEEADLQEAIRRSL D+GDRK G V SEHQ+P G
Sbjct: 661 VCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRRSLKDVGDRKPGSVLSEHQKPESAG 720
Query: 721 KMVEQCMVVENENVIELDKPDSADGMNCLKADDSTGRKETTESSSQEKQCSKSVVLLDTK 780
KM+EQC V+NENVI L D ADGM+C KA+DSTGRKETTESSSQEKQCS+ +VLLDT
Sbjct: 721 KMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTGRKETTESSSQEKQCSECIVLLDTT 780
Query: 781 TDTIAEQLDAPFKGAASSHKESNENDDTLKPLSRDASGAAQVVDRINNTVIEPPCRMVEM 840
T T+ E+LDA +K SHK+SNENDDTLKPLSRDASGA V DRINN + EPPC MV M
Sbjct: 781 THTVTEKLDASYKDV--SHKDSNENDDTLKPLSRDASGAVLVGDRINNKLTEPPCHMVGM 840
Query: 841 EGIY--NVDSSPKAVACENHQNFPVEKHTSDLLLEENDAKKPAVEVISNAEIEFAEIEFT 900
E Y VDSSPK VA ENHQNFPV++ +SD+LLEENDA+KPAVEVISN AEIEFT
Sbjct: 841 EDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEENDAQKPAVEVISN-----AEIEFT 900
Query: 901 EDELTNRISILEQERLNLGNEQKRLERNAESVSSEMFAECQELLQMFGLPYIIAPMEAEA 960
EDELTNRI ILEQERLNLG+EQKRLERNAESV SEMFAECQELLQMFGLPYIIAPMEAEA
Sbjct: 901 EDELTNRIXILEQERLNLGDEQKRLERNAESVXSEMFAECQELLQMFGLPYIIAPMEAEA 960
Query: 961 QCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELGLNRDKLI 1020
QCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELGLNRDK+I
Sbjct: 961 QCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELGLNRDKII 1020
Query: 1021 RMALLLGSDYTEGISGIGIVNAIEVMNAFPEEGGLHKFKEWIESPDPSILGTLGAQTGLS 1080
RMALLLGSDYTEGISGIGIVNAIEVMNAFPEE GL KFKEWIESPDPSILGTL AQTGLS
Sbjct: 1021 RMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQKFKEWIESPDPSILGTLSAQTGLS 1080
Query: 1081 ARKRGSKASENDMTCSNGSVRDGSASGESIFKAPKEKGSIDVKQSFMDKHRNVSKNWHIP 1140
+RKRGSKASE D TCSN SV DGSASGE I + KE +IDVKQSFM KHRNVSKNWHIP
Sbjct: 1081 SRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKE--NIDVKQSFMKKHRNVSKNWHIP 1140
Query: 1141 SAFPSEAVISAYTCPQVDKSAEAFSWGKPDHFVLRRLCWEKFGWENSKADELLLPVLKEY 1200
S FPSE VISAYTCPQVDKSAE+FSWGKPD FVLRRLCWEKFGW+NSKADELLLPVLKEY
Sbjct: 1141 SEFPSEXVISAYTCPQVDKSAESFSWGKPDXFVLRRLCWEKFGWDNSKADELLLPVLKEY 1200
Query: 1201 SKHQTQLRLEAFYTFNERFAKIRSKRIKKAVRSITGSKSAVLMDDAVQGVSVNKQRELSV 1260
SKH+TQLRLE FYTF+ERFAKIRSKRIKKAVR ITGSKSAVLMDDAV+ VS NKQRELSV
Sbjct: 1201 SKHETQLRLETFYTFDERFAKIRSKRIKKAVRGITGSKSAVLMDDAVRAVSANKQRELSV 1260
Query: 1261 EPQENISEKCSSEIQGTCSNEDEVENRRRKPSRKRQLHGEPSQPAK-DKLTMKERGKRSR 1320
EPQE SEKCSSEIQG+CSN D+VE R KPSRKRQLHGE SQPAK KLTMKE+G R+R
Sbjct: 1261 EPQEK-SEKCSSEIQGSCSNVDDVEKRLGKPSRKRQLHGEQSQPAKGQKLTMKEKGNRNR 1320
Query: 1321 NEGSHKNERGRGRGKGRGRGRLASKGKTKGTPITELVGTSSSDDESEFDEQKFDLENLQE 1380
NEGSHKN RGRG KGRGRGRL KGK KG+P TELV TSSSDDE+EFD+QK D NL+E
Sbjct: 1321 NEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTELVETSSSDDENEFDDQKCDFVNLEE 1380
Query: 1381 PRERRRSARIQKSASSTMNDVDQPSGHSRDRFSDDEAKEHDVVRDRHALPETVISQSENT 1440
P+ERRRS+RI+KS S TM D DQPS ++ DRFS+DEAKEHDV+ D QSE T
Sbjct: 1381 PQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSNDEAKEHDVIHD----------QSEKT 1440
Query: 1441 ECDFKTRKRSPQKDHLETGGGFCPVEDEMSQQGMCQNKDPSLEANIGEDYLSMGGGFCSD 1500
E D T KR PQ+D+ ETGGGFCPVEDEMS Q+ DPSLEAN EDYL MGGGFC D
Sbjct: 1441 ERDLGTPKRPPQEDYFETGGGFCPVEDEMS-----QDIDPSLEANNSEDYLRMGGGFCLD 1500
Query: 1501 DGNECVDPNSYPDQATFSEDPKDGSEDHPIQSTFHPE-YIGRVQNEEGTDAHVDSPPNVG 1560
D NEC+DP++YP +AT SED +D SE P QSTFHPE VQN+EGTDA VDS + G
Sbjct: 1501 DDNECIDPDAYPGRATVSEDLQDRSEHDPDQSTFHPEKCTSSVQNKEGTDARVDSLLDTG 1560
Query: 1561 DSNPVSNPNSSQVVEGVQEEAKEHSVGAFGGALSAMPNLRRKKRR 1598
+ N V NPNSSQ EGVQEE K+HSV AFGGALSAMPNLRRK+R+
Sbjct: 1561 NPNRVCNPNSSQGGEGVQEEEKDHSVSAFGGALSAMPNLRRKRRK 1578
BLAST of Sgr021433 vs. ExPASy TrEMBL
Match:
A0A6J1D8H5 (DNA repair protein UVH3 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111018345 PE=4 SV=1)
HSP 1 Score: 2413.6 bits (6254), Expect = 0.0e+00
Identity = 1324/1630 (81.23%), Postives = 1411/1630 (86.56%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKRLAI-------------------------DASIWM 60
MGVQGLWELLAPVGRRVSVETLAGK+LAI DASIWM
Sbjct: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM 60
Query: 61 VQFIKAMRDDRGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
VQFIKAMRD+RGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR
Sbjct: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
Query: 121 ENAQAKVRKTAEKLLLNHLKAMRLRELAEDLQNQKQQRRQDVQKKKTLPNHNEIADGTSE 180
ENAQAKVRKTAEKLLLNHLKAMRL+ELAEDLQNQKQQRRQDV KKK LPNH ADGTS
Sbjct: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG 180
Query: 181 RNKGV--PSSGSHENLDEMLAASIMAEENGFLMSSASSFAGTTLAKEDSGEESILPLMHE 240
RNK + SSG HE LD MLAASIMAEENGF SS+SSF+G LAK++SGEESILPLM+E
Sbjct: 181 RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESILPLMNE 240
Query: 241 VDPDVLSTLPSSIRHE-LQKQKYKNDSKDKKILSDEIHVVGSDSERMEVVSRSAYQKNLD 300
VDPDV STLPSSI++E LQKQKYKNDSK KKILSDEIH VGSD+ERMEV SR A+Q+NLD
Sbjct: 241 VDPDVFSTLPSSIQYELLQKQKYKNDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLD 300
Query: 301 EMLAASIAAEEAQSLNENASVSATANWDGEDTDDEDEEMILPEMHGIVDPSVLAALPPSV 360
EMLAASIAAEEA SLNENASVSA AN D EDTDDEDEEMILPEM G+VDPSVLAALPPSV
Sbjct: 301 EMLAASIAAEEAGSLNENASVSAAANLD-EDTDDEDEEMILPEMDGVVDPSVLAALPPSV 360
Query: 361 QLDLLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAAAGRGV 420
QLDLLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAA+GRGV
Sbjct: 361 QLDLLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGV 420
Query: 421 GGVQTSKIASEANREFIFSSSFTGDKQVLASTRAEKNGDKGLEAPRGQQPLSSLNNTEVP 480
GGVQTS+IASEANREFIFSSSFTGDKQVLAS R EK+GD+ L+APRGQQPLSSLNNTEVP
Sbjct: 421 GGVQTSRIASEANREFIFSSSFTGDKQVLAS-RIEKSGDEDLQAPRGQQPLSSLNNTEVP 480
Query: 481 STSNALARSTPDKTAVFEDNIETFLDERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEKN 540
STSNALARSTPDKT VFE+NIETFLDERGRVRVSRVRAMGM MTRDLERNLDLMKEIEKN
Sbjct: 481 STSNALARSTPDKTGVFEENIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKN 540
Query: 541 ASANEVVNPEPVQNIEICNPKSFSSQSQVLDTPYEGVGESIKLNESSRGSMLNEDTAIEI 600
ASANEVVN EPVQN EICNPKS SSQSQ LDTPYEGV ES++L+ SRGSML+EDTAIEI
Sbjct: 541 ASANEVVNHEPVQNSEICNPKSHSSQSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEI 600
Query: 601 LLEDKGDKSFDGDDDVFTHLAAENPIQMASFDISSQKLSLDGTTDSGWEETVEGKTYSPK 660
LLED+GDKSFDGDDD+FTHLAAENPIQ+ASFD SSQKLS DGTTDSGWEE VEGKTYSPK
Sbjct: 601 LLEDEGDKSFDGDDDLFTHLAAENPIQVASFDKSSQKLSFDGTTDSGWEEAVEGKTYSPK 660
Query: 661 NVEVDDHSFKEGRVSDDSEVEWEDGVCDQVNPVPFG-VESGKSVSKGSLEEEADLQEAIR 720
NVEVDDH F EGRVSD+SEVEWE+GVCD VNPVPFG ESGKSVSKGSLEEEADLQEAIR
Sbjct: 661 NVEVDDHPFVEGRVSDESEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIR 720
Query: 721 RSLADIGDRKSGPVFSEHQQPVIVGKMVEQCMVVENENVIELDKPDSADGMNCLKADDST 780
RSL D+GDRK G V SEHQ+P GKM+EQC V+NENVI L D ADGM+C KA+DST
Sbjct: 721 RSLKDVGDRKPGSVLSEHQKPESAGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDST 780
Query: 781 GRKETTESSSQEKQCSKSVVLLDTKTDTIAEQLDAPFKGAASSHKESNENDDTLKPLSRD 840
GRKETTESSSQEKQCS+ +VLLDT T T+ E+LDA +K SHK+SNENDDTLKPLSRD
Sbjct: 781 GRKETTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDV--SHKDSNENDDTLKPLSRD 840
Query: 841 ASGAAQVVDRINNTVIEPPCRMVEMEGIY--NVDSSPKAVACENHQNFPVEKHTSDLLLE 900
ASGA V DRINN + EPPC MV ME Y VDSSPK VA ENHQNFPV++ +SD+LLE
Sbjct: 841 ASGAVLVGDRINNKLTEPPCHMVGMEDSYTPEVDSSPKVVASENHQNFPVDELSSDILLE 900
Query: 901 ENDAKKPAVEVISNAEIEFAEIEFTEDELTNRISILEQERLNLGNEQKRLERNAESVSSE 960
ENDA+KPAVEVISN AEIEFTEDELTNRI ILEQERLNLG+EQKRLERNAESV SE
Sbjct: 901 ENDAQKPAVEVISN-----AEIEFTEDELTNRIXILEQERLNLGDEQKRLERNAESVXSE 960
Query: 961 MFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFD 1020
MFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFD
Sbjct: 961 MFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFD 1020
Query: 1021 DRKYVETYFMKDVENELGLNRDKLIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEGGL 1080
DRKYVETYFMKDVENELGLNRDK+IRMALLLGSDYTEGISGIGIVNAIEVMNAFPEE GL
Sbjct: 1021 DRKYVETYFMKDVENELGLNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGL 1080
Query: 1081 HKFKEWIESPDPSILGTLGAQTGLSARKRGSKASENDMTCSNGSVRDGSASGESIFKAPK 1140
KFKEWIESPDPSILGTL AQTGLS+RKRGSKASE D TCSN SV DGSASGE I + K
Sbjct: 1081 QKFKEWIESPDPSILGTLSAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLK 1140
Query: 1141 EKGSIDVKQSFMDKHRNVSKNWHIPSAFPSEAVISAYTCPQVDKSAEAFSWGKPDHFVLR 1200
E +IDVKQSFM KHRNVSKNWHIPS FPSE VISAYTCPQVDKSAE+FSWGKPD FVLR
Sbjct: 1141 E--NIDVKQSFMKKHRNVSKNWHIPSEFPSEXVISAYTCPQVDKSAESFSWGKPDXFVLR 1200
Query: 1201 RLCWEKFGWENSKADELLLPVLKEYSKHQTQLRLEAFYTFNERFAKIRSKRIKKAVRSIT 1260
RLCWEKFGW+NSKADELLLPVLKEYSKH+TQLRLE FYTF+ERFAKIRSKRIKKAVR IT
Sbjct: 1201 RLCWEKFGWDNSKADELLLPVLKEYSKHETQLRLETFYTFDERFAKIRSKRIKKAVRGIT 1260
Query: 1261 GSKSAVLMDDAVQGVSVNKQRELSVEPQENISEKCSSEIQGTCSNEDEVENRRRKPSRKR 1320
GSKSAVLMDDAV+ VS NKQRELSVEPQE SEKCSSEIQG+CSN D+VE R KPSRKR
Sbjct: 1261 GSKSAVLMDDAVRAVSANKQRELSVEPQEK-SEKCSSEIQGSCSNVDDVEKRLGKPSRKR 1320
Query: 1321 QLHGEPSQPAK-DKLTMKERGKRSRNEGSHKNERGRGRGKGRGRGRLASKGKTKGTPITE 1380
QLHGE SQPAK KLTMKE+G R+RNEGSHKN RGRG KGRGRGRL KGK KG+P TE
Sbjct: 1321 QLHGEQSQPAKGQKLTMKEKGNRNRNEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTE 1380
Query: 1381 LVGTSSSDDESEFDEQKFDLENLQEPRERRRSARIQKSASSTMNDVDQPSGHSRDRFSDD 1440
LV TSSSDDE+EFD+QK D NL+EP+ERRRS+RI+KS S TM D DQPS ++ DRFS+D
Sbjct: 1381 LVETSSSDDENEFDDQKCDFVNLEEPQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSND 1440
Query: 1441 EAKEHDVVRDRHALPETVISQSENTECDFKTRKRSPQKDHLETGGGFCPVEDEMSQQGMC 1500
EAKEHDV+ D QSE TE D T KR PQ+D+ ETGGGFCPVEDEMS
Sbjct: 1441 EAKEHDVIHD----------QSEKTERDLGTPKRPPQEDYFETGGGFCPVEDEMS----- 1500
Query: 1501 QNKDPSLEANIGEDYLSMGGGFCSDDGNECVDPNSYPDQATFSEDPKDGSEDHPIQSTFH 1560
Q+ DPSLEAN EDYL MGGGFC DD NEC+DP++YP +AT SED +D SE P QSTFH
Sbjct: 1501 QDIDPSLEANNSEDYLRMGGGFCLDDDNECIDPDAYPGRATVSEDLQDRSEHDPDQSTFH 1560
Query: 1561 PE-YIGRVQNEEGTDAHVDSPPNVGDSNPVSNPNSSQVVEGVQEEAKEHSVGAFGGALSA 1598
PE VQN+EGTDA VDS + G+ N V NPNSSQ EGVQEE K+HSV AFGGALSA
Sbjct: 1561 PEKCTSSVQNKEGTDARVDSLLDTGNPNRVCNPNSSQGGEGVQEEEKDHSVSAFGGALSA 1603
BLAST of Sgr021433 vs. ExPASy TrEMBL
Match:
A0A6J1D7H7 (DNA repair protein UVH3 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111018345 PE=4 SV=1)
HSP 1 Score: 2409.0 bits (6242), Expect = 0.0e+00
Identity = 1322/1629 (81.15%), Postives = 1409/1629 (86.49%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKRLAI-------------------------DASIWM 60
MGVQGLWELLAPVGRRVSVETLAGK+LAI DASIWM
Sbjct: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIGIFKSLLSNHTSAIFKMFCLFLSFEDASIWM 60
Query: 61 VQFIKAMRDDRGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
VQFIKAMRD+RGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR
Sbjct: 61 VQFIKAMRDERGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIARRRQR 120
Query: 121 ENAQAKVRKTAEKLLLNHLKAMRLRELAEDLQNQKQQRRQDVQKKKTLPNHNEIADGTSE 180
ENAQAKVRKTAEKLLLNHLKAMRL+ELAEDLQNQKQQRRQDV KKK LPNH ADGTS
Sbjct: 121 ENAQAKVRKTAEKLLLNHLKAMRLKELAEDLQNQKQQRRQDVPKKKNLPNHKRTADGTSG 180
Query: 181 RNKGV--PSSGSHENLDEMLAASIMAEENGFLMSSASSFAGTTLAKEDSGEESILPLMHE 240
RNK + SSG HE LD MLAASIMAEENGF SS+SSF+G LAK++SGEESILPLM+E
Sbjct: 181 RNKSITTTSSGDHEKLDGMLAASIMAEENGFFTSSSSSFSGAALAKDNSGEESILPLMNE 240
Query: 241 VDPDVLSTLPSSIRHELQKQKYKNDSKDKKILSDEIHVVGSDSERMEVVSRSAYQKNLDE 300
VDPDV STLPSSI++EL QKYKNDSK KKILSDEIH VGSD+ERMEV SR A+Q+NLDE
Sbjct: 241 VDPDVFSTLPSSIQYEL-LQKYKNDSKGKKILSDEIHAVGSDTERMEVASRGAHQQNLDE 300
Query: 301 MLAASIAAEEAQSLNENASVSATANWDGEDTDDEDEEMILPEMHGIVDPSVLAALPPSVQ 360
MLAASIAAEEA SLNENASVSA AN D EDTDDEDEEMILPEM G+VDPSVLAALPPSVQ
Sbjct: 301 MLAASIAAEEAGSLNENASVSAAANLD-EDTDDEDEEMILPEMDGVVDPSVLAALPPSVQ 360
Query: 361 LDLLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAAAGRGVG 420
LDLLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAA+GRGVG
Sbjct: 361 LDLLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVG 420
Query: 421 GVQTSKIASEANREFIFSSSFTGDKQVLASTRAEKNGDKGLEAPRGQQPLSSLNNTEVPS 480
GVQTS+IASEANREFIFSSSFTGDKQVLAS R EK+GD+ L+APRGQQPLSSLNNTEVPS
Sbjct: 421 GVQTSRIASEANREFIFSSSFTGDKQVLAS-RIEKSGDEDLQAPRGQQPLSSLNNTEVPS 480
Query: 481 TSNALARSTPDKTAVFEDNIETFLDERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEKNA 540
TSNALARSTPDKT VFE+NIETFLDERGRVRVSRVRAMGM MTRDLERNLDLMKEIEKNA
Sbjct: 481 TSNALARSTPDKTGVFEENIETFLDERGRVRVSRVRAMGMRMTRDLERNLDLMKEIEKNA 540
Query: 541 SANEVVNPEPVQNIEICNPKSFSSQSQVLDTPYEGVGESIKLNESSRGSMLNEDTAIEIL 600
SANEVVN EPVQN EICNPKS SSQSQ LDTPYEGV ES++L+ SRGSML+EDTAIEIL
Sbjct: 541 SANEVVNHEPVQNSEICNPKSHSSQSQDLDTPYEGVSESVQLSLRSRGSMLDEDTAIEIL 600
Query: 601 LEDKGDKSFDGDDDVFTHLAAENPIQMASFDISSQKLSLDGTTDSGWEETVEGKTYSPKN 660
LED+GDKSFDGDDD+FTHLAAENPIQ+ASFD SSQKLS DGTTDSGWEE VEGKTYSPKN
Sbjct: 601 LEDEGDKSFDGDDDLFTHLAAENPIQVASFDKSSQKLSFDGTTDSGWEEAVEGKTYSPKN 660
Query: 661 VEVDDHSFKEGRVSDDSEVEWEDGVCDQVNPVPFG-VESGKSVSKGSLEEEADLQEAIRR 720
VEVDDH F EGRVSD+SEVEWE+GVCD VNPVPFG ESGKSVSKGSLEEEADLQEAIRR
Sbjct: 661 VEVDDHPFVEGRVSDESEVEWEEGVCDHVNPVPFGAAESGKSVSKGSLEEEADLQEAIRR 720
Query: 721 SLADIGDRKSGPVFSEHQQPVIVGKMVEQCMVVENENVIELDKPDSADGMNCLKADDSTG 780
SL D+GDRK G V SEHQ+P GKM+EQC V+NENVI L D ADGM+C KA+DSTG
Sbjct: 721 SLKDVGDRKPGSVLSEHQKPESAGKMLEQCTSVQNENVIGLKNVDGADGMSCSKANDSTG 780
Query: 781 RKETTESSSQEKQCSKSVVLLDTKTDTIAEQLDAPFKGAASSHKESNENDDTLKPLSRDA 840
RKETTESSSQEKQCS+ +VLLDT T T+ E+LDA +K SHK+SNENDDTLKPLSRDA
Sbjct: 781 RKETTESSSQEKQCSECIVLLDTTTHTVTEKLDASYKDV--SHKDSNENDDTLKPLSRDA 840
Query: 841 SGAAQVVDRINNTVIEPPCRMVEMEGIY--NVDSSPKAVACENHQNFPVEKHTSDLLLEE 900
SGA V DRINN + EPPC MV ME Y VDSSPK VA ENHQNFPV++ +SD+LLEE
Sbjct: 841 SGAVLVGDRINNKLTEPPCHMVGMEDSYTPEVDSSPKVVASENHQNFPVDELSSDILLEE 900
Query: 901 NDAKKPAVEVISNAEIEFAEIEFTEDELTNRISILEQERLNLGNEQKRLERNAESVSSEM 960
NDA+KPAVEVISN AEIEFTEDELTNRI ILEQERLNLG+EQKRLERNAESV SEM
Sbjct: 901 NDAQKPAVEVISN-----AEIEFTEDELTNRIXILEQERLNLGDEQKRLERNAESVXSEM 960
Query: 961 FAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDD 1020
FAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDD
Sbjct: 961 FAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDD 1020
Query: 1021 RKYVETYFMKDVENELGLNRDKLIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEGGLH 1080
RKYVETYFMKDVENELGLNRDK+IRMALLLGSDYTEGISGIGIVNAIEVMNAFPEE GL
Sbjct: 1021 RKYVETYFMKDVENELGLNRDKIIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEDGLQ 1080
Query: 1081 KFKEWIESPDPSILGTLGAQTGLSARKRGSKASENDMTCSNGSVRDGSASGESIFKAPKE 1140
KFKEWIESPDPSILGTL AQTGLS+RKRGSKASE D TCSN SV DGSASGE I + KE
Sbjct: 1081 KFKEWIESPDPSILGTLSAQTGLSSRKRGSKASEKDTTCSNSSVGDGSASGEDISEDLKE 1140
Query: 1141 KGSIDVKQSFMDKHRNVSKNWHIPSAFPSEAVISAYTCPQVDKSAEAFSWGKPDHFVLRR 1200
+IDVKQSFM KHRNVSKNWHIPS FPSE VISAYTCPQVDKSAE+FSWGKPD FVLRR
Sbjct: 1141 --NIDVKQSFMKKHRNVSKNWHIPSEFPSEXVISAYTCPQVDKSAESFSWGKPDXFVLRR 1200
Query: 1201 LCWEKFGWENSKADELLLPVLKEYSKHQTQLRLEAFYTFNERFAKIRSKRIKKAVRSITG 1260
LCWEKFGW+NSKADELLLPVLKEYSKH+TQLRLE FYTF+ERFAKIRSKRIKKAVR ITG
Sbjct: 1201 LCWEKFGWDNSKADELLLPVLKEYSKHETQLRLETFYTFDERFAKIRSKRIKKAVRGITG 1260
Query: 1261 SKSAVLMDDAVQGVSVNKQRELSVEPQENISEKCSSEIQGTCSNEDEVENRRRKPSRKRQ 1320
SKSAVLMDDAV+ VS NKQRELSVEPQE SEKCSSEIQG+CSN D+VE R KPSRKRQ
Sbjct: 1261 SKSAVLMDDAVRAVSANKQRELSVEPQEK-SEKCSSEIQGSCSNVDDVEKRLGKPSRKRQ 1320
Query: 1321 LHGEPSQPAK-DKLTMKERGKRSRNEGSHKNERGRGRGKGRGRGRLASKGKTKGTPITEL 1380
LHGE SQPAK KLTMKE+G R+RNEGSHKN RGRG KGRGRGRL KGK KG+P TEL
Sbjct: 1321 LHGEQSQPAKGQKLTMKEKGNRNRNEGSHKNGRGRGERKGRGRGRLQPKGKMKGSPTTEL 1380
Query: 1381 VGTSSSDDESEFDEQKFDLENLQEPRERRRSARIQKSASSTMNDVDQPSGHSRDRFSDDE 1440
V TSSSDDE+EFD+QK D NL+EP+ERRRS+RI+KS S TM D DQPS ++ DRFS+DE
Sbjct: 1381 VETSSSDDENEFDDQKCDFVNLEEPQERRRSSRIRKSVSYTMGDADQPSDYNGDRFSNDE 1440
Query: 1441 AKEHDVVRDRHALPETVISQSENTECDFKTRKRSPQKDHLETGGGFCPVEDEMSQQGMCQ 1500
AKEHDV+ D QSE TE D T KR PQ+D+ ETGGGFCPVEDEMS Q
Sbjct: 1441 AKEHDVIHD----------QSEKTERDLGTPKRPPQEDYFETGGGFCPVEDEMS-----Q 1500
Query: 1501 NKDPSLEANIGEDYLSMGGGFCSDDGNECVDPNSYPDQATFSEDPKDGSEDHPIQSTFHP 1560
+ DPSLEAN EDYL MGGGFC DD NEC+DP++YP +AT SED +D SE P QSTFHP
Sbjct: 1501 DIDPSLEANNSEDYLRMGGGFCLDDDNECIDPDAYPGRATVSEDLQDRSEHDPDQSTFHP 1560
Query: 1561 E-YIGRVQNEEGTDAHVDSPPNVGDSNPVSNPNSSQVVEGVQEEAKEHSVGAFGGALSAM 1598
E VQN+EGTDA VDS + G+ N V NPNSSQ EGVQEE K+HSV AFGGALSAM
Sbjct: 1561 EKCTSSVQNKEGTDARVDSLLDTGNPNRVCNPNSSQGGEGVQEEEKDHSVSAFGGALSAM 1601
BLAST of Sgr021433 vs. ExPASy TrEMBL
Match:
A0A6J1ERW5 (DNA repair protein UVH3 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111435307 PE=3 SV=1)
HSP 1 Score: 2244.2 bits (5814), Expect = 0.0e+00
Identity = 1252/1615 (77.52%), Postives = 1356/1615 (83.96%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKRLAIDASIWMVQFIKAMRDDRGEMVRNAHLLGFFR 60
MGV GLWELLAPVGRRVSVETLAGK+LAIDASIWMVQFIKAMRD+RGEMVRNAHLLGFFR
Sbjct: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDERGEMVRNAHLLGFFR 60
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLR 120
RICKLLFLRTKPVFVFDGATP+LKRRTLIARRRQRENAQAKVRKTAEKLLLNHLK MRLR
Sbjct: 61 RICKLLFLRTKPVFVFDGATPSLKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKEMRLR 120
Query: 121 ELAEDLQNQKQQRRQDVQKKKTLPNHNEIADGT--SERNKGVPSSGSHENLDEMLAASIM 180
ELAE ++NQKQQR+QDV KKKTL NHNEI DGT SER+K VP+SG+HENLD M+AASIM
Sbjct: 121 ELAEGIKNQKQQRKQDVPKKKTLLNHNEIVDGTSVSERSKSVPNSGNHENLDGMVAASIM 180
Query: 181 AEENGFLMSSASSFAGTTLAKEDSGEESILPLMHEVDPDVLSTLPSSIRHELQKQKYKND 240
EENGF SSA SF+G TL K+D GE+SIL QKYKND
Sbjct: 181 IEENGFFSSSAPSFSGVTLPKKDRGEQSIL-----------------------NQKYKND 240
Query: 241 SKDKKILSDEIHVVGSDSERMEVVSRSAYQKNLDEMLAASIAAEEAQSLNENASVSATAN 300
SK KKILSDEIHVVGSDSERMEV SRSA+Q+NLDEMLAASIAAEEA+ LNEN SVS+ AN
Sbjct: 241 SKGKKILSDEIHVVGSDSERMEVASRSAHQQNLDEMLAASIAAEEARGLNENISVSSAAN 300
Query: 301 WDGEDTD--DEDEEMILPEMHGIVDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVK 360
GED D DEDEEMILPEMHG+VDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVK
Sbjct: 301 LAGEDMDDEDEDEEMILPEMHGVVDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVK 360
Query: 361 KDPAKFSELQIQAYLKTVAFRRDIDQVQKAAAGRGVGGVQTSKIASEANREFIFSSSFTG 420
KDPAKFSELQIQAYLKTVAFRRDIDQVQKAA+GRGVGGVQTS+IASEANREFIFSSSFTG
Sbjct: 361 KDPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGVQTSRIASEANREFIFSSSFTG 420
Query: 421 DKQVLASTRAEKNGDKGLEAPRGQQPLSSLNNTEVPSTSNALARSTPDKTAVFEDNIETF 480
DKQVL STRAEKNGDK L+ PR QQ LSSLNNT++PSTSN LA+STPDK+ VFEDNIETF
Sbjct: 421 DKQVLTSTRAEKNGDKNLQEPRVQQSLSSLNNTDIPSTSNGLAQSTPDKSGVFEDNIETF 480
Query: 481 LDERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEKNASANEVVNPEPVQNIEICNPKSFS 540
LDERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEKN +AN+V NPEP+QNIEICNP+S S
Sbjct: 481 LDERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEKNINANKVANPEPMQNIEICNPESSS 540
Query: 541 SQSQVLDTPYEGVGESI-KLNESSRGSMLNEDTAIEILLEDKGDKSFDGDDDVFTHLAAE 600
+SQVLD EG+ ESI KL+E SMLNEDTAIEILLE +G KSFDGDDD+FTHLAAE
Sbjct: 541 LRSQVLDVSNEGIDESINKLDERGADSMLNEDTAIEILLEGEGGKSFDGDDDLFTHLAAE 600
Query: 601 NPIQMASFDISSQKLSLDGTTDSGWEETVEGKTYSPKNVEVDDHSFKEGRVSDDSEVEWE 660
NPIQMASFDISSQKLS DGTTDSGW+E + EG +SD+SEV+WE
Sbjct: 601 NPIQMASFDISSQKLSQDGTTDSGWKEAL------------------EGTISDESEVDWE 660
Query: 661 DGVCDQVNPVPFGVESGKSVSKGSLEEEADLQEAIRRSLADIGDRKSGPVFSEHQ----Q 720
DGVCD VNPVPF ESGKSVSKGSLEEEADLQEAIRRSL D+GD KSGPV EH+ Q
Sbjct: 661 DGVCDHVNPVPFEDESGKSVSKGSLEEEADLQEAIRRSLEDVGDGKSGPVSLEHEQPQSQ 720
Query: 721 PVIVGKMVEQCMVVENENVIELDKPDSADGMNCLKADDSTGRKETTESSSQEKQCSKSVV 780
P IVGKM EQC VENENVI L+K DS DGMN A DS +K TESSSQEKQCS+ VV
Sbjct: 721 PSIVGKMAEQCTSVENENVIGLEKMDSVDGMNWSNAKDSILKKGMTESSSQEKQCSEPVV 780
Query: 781 LLDTKTDTIAEQLDAPFKGAASSHKESNENDDTLKPLSRDASGAAQVVDRINNTVIEPPC 840
LLDT TIAEQLDA +K + S +ESNE+ DTLK LSRDA A QV D IN+T+IEP C
Sbjct: 781 LLDT---TIAEQLDASYKDTSFSLQESNESSDTLKSLSRDAPRATQVGDMINSTMIEPAC 840
Query: 841 RMVEMEGIY--NVDSSPKAVACENH--QNFPVEKHTSDLLLEENDAKKPAVEVISNAEIE 900
RMVEM+G+ +VDSS K A ENH QN PVEKH+SDLLLEE K V EI
Sbjct: 841 RMVEMDGVNTPDVDSSTKDSAFENHFKQNLPVEKHSSDLLLEEEVGKGHTV------EIS 900
Query: 901 FAEIEFTEDELTNRISILEQERLNLGNEQKRLERNAESVSSEMFAECQELLQMFGLPYII 960
AE E TEDEL +RISILEQERLNLG+EQKRLERNAE+VSSEMFAECQELLQMFGLPYII
Sbjct: 901 KAETEVTEDELKSRISILEQERLNLGDEQKRLERNAEAVSSEMFAECQELLQMFGLPYII 960
Query: 961 APMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELG 1020
APMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELG
Sbjct: 961 APMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELG 1020
Query: 1021 LNRDKLIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEGGLHKFKEWIESPDPSILGTL 1080
L+R+KLIRMALLLGSDYTEGISGIGIVNA+EVMNAFPEE GLHKFKEWIESPDPSILGTL
Sbjct: 1021 LDRNKLIRMALLLGSDYTEGISGIGIVNAVEVMNAFPEEDGLHKFKEWIESPDPSILGTL 1080
Query: 1081 GAQTGLSARKRGSKASENDMTCSNGSVRDGSASGESIFKAPKEKGSIDVKQSFMDKHRNV 1140
GA+TGLSARKRG KASEND CSN SVRDGSAS E+I K KE +IDVKQ+FM KHRNV
Sbjct: 1081 GAKTGLSARKRGQKASENDAPCSNSSVRDGSASEENIDKDLKE--NIDVKQNFMVKHRNV 1140
Query: 1141 SKNWHIPSAFPSEAVISAYTCPQVDKSAEAFSWGKPDHFVLRRLCWEKFGWENSKADELL 1200
SKNWHIPS FPSEAVISAY PQVDKSAE FSWGKPDHFVLRRLC EKFGWENSKADELL
Sbjct: 1141 SKNWHIPSEFPSEAVISAYISPQVDKSAEPFSWGKPDHFVLRRLCLEKFGWENSKADELL 1200
Query: 1201 LPVLKEYSKHQTQLRLEAFYTFNERFAKIRSKRIKKAVRSITGSKSAVLMDDAVQGVSVN 1260
LPVLKEY KH+TQLRLEAFYTFNERFAKIRSKRIKKAV+SITGSKSA LMD+ V VSVN
Sbjct: 1201 LPVLKEYGKHETQLRLEAFYTFNERFAKIRSKRIKKAVKSITGSKSASLMDETVPNVSVN 1260
Query: 1261 KQRELSVEPQENISEKCSSEIQGTCSNEDEVENRRRKPSRKRQLHGEPSQPAKD-KLTMK 1320
Q LS E Q+N+SEKCSSEIQG CSNED V+NR RKPSRKRQL E SQPAKD KLTMK
Sbjct: 1261 NQINLSGETQKNMSEKCSSEIQGACSNEDNVDNRLRKPSRKRQLDREQSQPAKDRKLTMK 1320
Query: 1321 ERGKRSRNEGSHKNE-RGRGRGKGRGRGRLASKGKTKGTPITELVGTSSSDDESEFDEQK 1380
E+GKRSRNEGSH RGRGRGKGRGRGRLA KGK +PITE VGTSSSDDESEFD+QK
Sbjct: 1321 EKGKRSRNEGSHSERGRGRGRGKGRGRGRLALKGK---SPITEFVGTSSSDDESEFDDQK 1380
Query: 1381 FDLENLQEPRERRRSARIQKSASSTMNDV--DQPSGHSRDRFSDDEAKEHDVVRDRHALP 1440
DLEN+QEP+ERR+S+R++KSAS M+D DQPS HS R S+DEA + +VV+ + P
Sbjct: 1381 IDLENVQEPQERRKSSRVRKSASYKMDDADQDQPSDHSGYRLSNDEANDDNVVQGGYTGP 1440
Query: 1441 ETVISQSENTECDFKTRKRSPQKDHLETGGGFCPVEDEMSQQGMCQNKDPSLEANIGEDY 1500
ETV+ SENTECD++ KRSP +D+L TGGGFCP EDEMS++ MCQNKDP+LEA+ EDY
Sbjct: 1441 ETVMIHSENTECDYEIPKRSPLRDYLGTGGGFCPTEDEMSREAMCQNKDPALEASNSEDY 1500
Query: 1501 LSMGGGFCSDDGNECVDPNSYPDQATFSEDPKDGSEDHPIQSTFHPEY-IGRVQNEEGTD 1560
L++GGGFC DD NECVDP ++ DQAT SE PKDGSED P QSTFHPE IG Q E T
Sbjct: 1501 LTLGGGFCLDDDNECVDPVAHLDQATASEVPKDGSEDDPDQSTFHPEKDIGGNQLNEDTY 1560
Query: 1561 AHVDSPPNVGDSNPVSNPNSSQVVEGVQEEAKEHSVGAFGGALSAMPNLRRKKRR 1598
H +S +VGD NP S PNSS+V EGVQEE K+HSV AFGGALSAMPNLRRK++R
Sbjct: 1561 PHGESLLDVGDPNPASFPNSSRVGEGVQEEPKDHSVRAFGGALSAMPNLRRKRKR 1560
BLAST of Sgr021433 vs. ExPASy TrEMBL
Match:
A0A6J1JJE1 (DNA repair protein UVH3 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111486346 PE=3 SV=1)
HSP 1 Score: 2216.4 bits (5742), Expect = 0.0e+00
Identity = 1243/1618 (76.82%), Postives = 1348/1618 (83.31%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKRLAIDASIWMVQFIKAMRDDRGEMVRNAHLLGFFR 60
MGV GLWELLAPVGRRVSVETLAGK+LAIDASIWMVQFIKAMRDDRGEMVRNAHLLGFFR
Sbjct: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAHLLGFFR 60
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLR 120
RICKLLFLRTKPVFVFDGATP+LKRRTLIARRRQRENAQAKVRKTAEKLLLNHLK MRLR
Sbjct: 61 RICKLLFLRTKPVFVFDGATPSLKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKEMRLR 120
Query: 121 ELAEDLQNQKQQRRQDVQKKKTLPNHNEIADGT--SERNKGVPSSGSHENLDEMLAASIM 180
ELAE ++NQKQQR+QDV KKKTL NHN I DGT SER+K VP+SG+HENLD M+AASIM
Sbjct: 121 ELAEGIKNQKQQRKQDVPKKKTLLNHNAIVDGTSISERSKSVPNSGNHENLDGMVAASIM 180
Query: 181 AEENGFLMSSASSFAGTTLAKEDSGEESILPLMHEVDPDVLSTLPSSIRHELQKQKYKND 240
EENGF SSA SF G TL KED GE+S QKYKND
Sbjct: 181 IEENGFFSSSAPSFVGVTLPKEDRGEQS-----------------------TWNQKYKND 240
Query: 241 SKDKKILSDEIHVVGSDSERMEVVSRSAYQKNLDEMLAASIAAEEAQSLNENASVSATAN 300
SK KKILSDEIHVVGSDSERMEV SRSA+Q+NLDEMLAASIAAEEA+ LNEN SVS+ +
Sbjct: 241 SKGKKILSDEIHVVGSDSERMEVASRSAHQQNLDEMLAASIAAEEARGLNENVSVSSASY 300
Query: 301 WDGEDTD--DEDEEMILPEMHGIVDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVK 360
GED D DEDEEMILPEMHG+VDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVK
Sbjct: 301 LAGEDMDDEDEDEEMILPEMHGVVDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVK 360
Query: 361 KDPAKFSELQIQAYLKTVAFRRDIDQVQKAAAGRGVGGVQTSKIASEANREFIFSSSFTG 420
KDPAKFSELQIQAYLKTVAFRRDIDQVQKAA+GRGVGGVQTS+IASEANREFIFSSSFTG
Sbjct: 361 KDPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGVQTSRIASEANREFIFSSSFTG 420
Query: 421 DKQVLASTRAEKNGDKGLEAPRGQQPLSSLNNTEVPSTSNALARSTPDKTAVFEDNIETF 480
DKQVL STRAEKNGDK L+APR QQ LSSLNNT++PSTSN LA+STPDK+ VFEDNIETF
Sbjct: 421 DKQVLTSTRAEKNGDKDLQAPRVQQSLSSLNNTDIPSTSNGLAQSTPDKSGVFEDNIETF 480
Query: 481 LDERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEKNASANEVVNPEPVQNIEICNPKSFS 540
LDERG VRVSRVRAMGMHMTRDLERNLDLMKEIEKN +AN+ NPEP+QNIEICNPKS S
Sbjct: 481 LDERGCVRVSRVRAMGMHMTRDLERNLDLMKEIEKNINANKAANPEPMQNIEICNPKSSS 540
Query: 541 SQSQVLDTPYEGVGESI-KLNESSRGSMLNEDTAIEILLEDKGDKSFDGDDDVFTHLAAE 600
+SQVLD EGV ESI KL+E SMLNEDTAIEI+LE +G KSFDGDDD+FTHLAAE
Sbjct: 541 LRSQVLDVSNEGVDESINKLDERGADSMLNEDTAIEIVLEGEGGKSFDGDDDLFTHLAAE 600
Query: 601 NPIQMASFDISSQKLSLDGTTDSGWEETVEGKTYSPKNVEVDDHSFKEGRVSDDSEVEWE 660
NPIQMASFDISSQKLS DGTTDSGW+E + EG VSD+SEV+WE
Sbjct: 601 NPIQMASFDISSQKLSQDGTTDSGWKEAL------------------EGTVSDESEVDWE 660
Query: 661 DGVCDQVNPVPFGVESGKSVSKGSLEEEADLQEAIRRSLADIGDRKSGPVFSEHQ----Q 720
DGVCD VNPVPF ESGKSVSKGSLEEEADLQEAIRRSL D+GD KSGPV EH+ Q
Sbjct: 661 DGVCDHVNPVPFEDESGKSVSKGSLEEEADLQEAIRRSLEDVGDGKSGPVSLEHEQPQSQ 720
Query: 721 PVIVGKMVEQCMVVENENVIELDKPDSADGMNCLKADDSTGRKETTESSSQEKQCSKSVV 780
P IVGKM E+CM ENENVI L+K DS DGMN A DS +K TESSSQEKQCS+ VV
Sbjct: 721 PSIVGKMAERCMSFENENVIGLEKMDSVDGMNWSNAKDSILKKGMTESSSQEKQCSEPVV 780
Query: 781 LLDTKTDTIAEQLDAPFKGAASSHKESNENDDTLKPLSRDASGAAQVVDRINNTVIEPPC 840
LLDT TIAEQLDA +K + S + SNEN DTLK LSRDA A QV D IN+TVIEP C
Sbjct: 781 LLDT---TIAEQLDASYKDTSFSLQVSNENSDTLKSLSRDAPRATQVGDMINSTVIEPAC 840
Query: 841 RMVEMEGIY--NVDSSPKAVACENH--QNFPVEKHTSDLLLEENDAKKPAVEVISNAEIE 900
RMVEM+G+ +VDSS K A ENH QNFPVEKH+SDLLLEE K V +I
Sbjct: 841 RMVEMDGVNTPDVDSSTKDSAFENHFKQNFPVEKHSSDLLLEEEVGKGHTV------KIS 900
Query: 901 FAEIEFTEDELTNRISILEQERLNLGNEQKRLERNAESVSSEMFAECQELLQMFGLPYII 960
AE E TEDEL +RISILEQERL+LG+EQKRLERNAE+VSSEMFAECQELLQMFGLPYII
Sbjct: 901 KAEAEVTEDELKSRISILEQERLSLGDEQKRLERNAEAVSSEMFAECQELLQMFGLPYII 960
Query: 961 APMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELG 1020
APMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKD+ENELG
Sbjct: 961 APMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDIENELG 1020
Query: 1021 LNRDKLIRMALLLGSDYTEGISGIGIVNAIEVMNAFPEEGGLHKFKEWIESPDPSILGTL 1080
L+R+KLIRMALLLGSDYTEGISGIGIVNA+EVMNAF EE GLHKFKEWIESPDPSILGTL
Sbjct: 1021 LDRNKLIRMALLLGSDYTEGISGIGIVNAVEVMNAFSEEDGLHKFKEWIESPDPSILGTL 1080
Query: 1081 GAQTGLSARKRGSKASENDMTCSNGSVRDGSASGESIFKAPKEKGSIDVKQSFMDKHRNV 1140
GA+TGLSARKRG KASEND TCSN SVRDGSAS E+I K KE +IDVKQ+FM KHRNV
Sbjct: 1081 GAKTGLSARKRGQKASENDATCSNSSVRDGSASEENIDKDLKE--NIDVKQNFMVKHRNV 1140
Query: 1141 SKNWHIPSAFPSEAVISAYTCPQVDKSAEAFSWGKPDHFVLRRLCWEKFGWENSKADELL 1200
SKNWHIPS FPSEAVISAY PQVDKSAE FSWGKPDHFVLRRLC EKFGWENSKADELL
Sbjct: 1141 SKNWHIPSEFPSEAVISAYISPQVDKSAEPFSWGKPDHFVLRRLCLEKFGWENSKADELL 1200
Query: 1201 LPVLKEYSKHQTQLRLEAFYTFNERFAKIRSKRIKKAVRSITGSKSAVLMDDAVQGVSVN 1260
LPVLKEY KH+TQLRLEAFYTFNERFAKIRSKRIKKAV+SITGSKSA LMD+ V VSVN
Sbjct: 1201 LPVLKEYGKHETQLRLEAFYTFNERFAKIRSKRIKKAVKSITGSKSASLMDETVPNVSVN 1260
Query: 1261 KQRELSVEPQENISEKCSSEIQGTCSNEDEVENRRRKPSRKRQLHGEPSQPAKD-KLTMK 1320
Q LS E Q+N+SEKCSSEIQG CSNED V+NR RKPSRKRQL E SQPAKD KLTMK
Sbjct: 1261 NQINLSGETQKNMSEKCSSEIQGACSNEDNVDNRLRKPSRKRQLDREQSQPAKDRKLTMK 1320
Query: 1321 ERGKRSRNEGSHKNE---RGRGRGKGRGRGRLASKGKTKGTPITELVGTSSSDDESEFDE 1380
E+GKRSRNEGSH RGRG GKGRGRGRLASKGK +PITE V TSSSDDESE D+
Sbjct: 1321 EKGKRSRNEGSHSERGRGRGRGGGKGRGRGRLASKGK---SPITEFVETSSSDDESESDD 1380
Query: 1381 QKFDLENLQEPRERRRSARIQKSASSTMNDV----DQPSGHSRDRFSDDEAKEHDVVRDR 1440
+K DLEN+QEP+ERR+S+R++KSAS M+D DQPS +S R S+DEA + +VV+ R
Sbjct: 1381 KKLDLENVQEPQERRKSSRVRKSASYKMDDADPDQDQPSDYSGYRLSNDEANDDNVVQGR 1440
Query: 1441 HALPETVISQSENTECDFKTRKRSPQKDHLETGGGFCPVEDEMSQQGMCQNKDPSLEANI 1500
+ PETV+ SENTECD++ KRSP +D+L TGGGFCP EDEMSQ+ MC+NKDP+LEA+
Sbjct: 1441 YTGPETVMIHSENTECDYEIPKRSPLRDYLGTGGGFCPTEDEMSQEAMCRNKDPALEASN 1500
Query: 1501 GEDYLSMGGGFCSDDGNECVDPNSYPDQATFSEDPKDGSEDHPIQSTFHPEY-IGRVQNE 1560
EDYL++GGGFC DD NECVDP ++ DQAT SE KDGSED P QSTFHPE IG Q E
Sbjct: 1501 SEDYLTLGGGFCLDDDNECVDPVAHLDQATVSEALKDGSEDDPGQSTFHPEKDIGGDQLE 1560
Query: 1561 EGTDAHVDSPPNVGDSNPVSNPNSSQVVEGVQEEAKEHSVGAFGGALSAMPNLRRKKR 1597
E T +S +VGD NPVS PNSS+V EGVQE+ K+HSV +FGGALSAMPNLRRK+R
Sbjct: 1561 EDTYPRGESLLDVGDPNPVSYPNSSEVGEGVQEKPKDHSVRSFGGALSAMPNLRRKRR 1563
BLAST of Sgr021433 vs. TAIR 10
Match:
AT3G28030.1 (5'-3' exonuclease family protein )
HSP 1 Score: 1002.3 bits (2590), Expect = 5.3e-292
Identity = 720/1655 (43.50%), Postives = 944/1655 (57.04%), Query Frame = 0
Query: 1 MGVQGLWELLAPVGRRVSVETLAGKRLAIDASIWMVQFIKAMRDDRGEMVRNAHLLGFFR 60
MGVQGLWELLAPVGRRVSVETLA KRLAIDASIWMVQFIKAMRD++G+MV+NAHL+GFFR
Sbjct: 1 MGVQGLWELLAPVGRRVSVETLANKRLAIDASIWMVQFIKAMRDEKGDMVQNAHLIGFFR 60
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLR 120
RICKLLFLRTKP+FVFDGATPALKRRT+IARRRQRENAQ K+RKTAEKLLLN LK +RL+
Sbjct: 61 RICKLLFLRTKPIFVFDGATPALKRRTVIARRRQRENAQTKIRKTAEKLLLNRLKDIRLK 120
Query: 121 ELAEDLQNQKQQRRQDVQKKKTLPNHNEIADGTSERNKGVPSSGSHENLDEMLAASIMAE 180
E A+D++NQ+ ++ + KK ++ + E N VP ++ + AS E
Sbjct: 121 EQAKDIKNQRLKQDDSDRVKK------RVSSDSVEDNLRVPVE------EDDVGASFFQE 180
Query: 181 ENGFLMSSASSFAGTTLAKEDSGEESILPLMHEVDPDVLSTLPSSIRHELQKQKYKNDSK 240
E +S AS L+ E D ++ K+ K+D K
Sbjct: 181 EKLDEVSQAS-------------------LVGETGVD-----------DVVKESVKDDPK 240
Query: 241 DKKILSDEIHVVGSDSERM---EVVSRSAYQKNLDEMLAASIAAEEAQSLNENASVSATA 300
K +L D G D + + V YQ+ LDEMLAAS+AAEE ++ AS SA A
Sbjct: 241 GKGVLLD-----GDDLDNLVQDSSVQGKDYQEKLDEMLAASLAAEEERNFTSKASTSAAA 300
Query: 301 ---NWDGEDTDDEDEEMILPEMHGIVDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQR 360
D E+ D DEE++LP M G +DP+VLA+LPPS+QLDLL QMRE+LMAENRQKYQ+
Sbjct: 301 IPSEEDEEEDSDGDEEILLPVMDGNIDPAVLASLPPSMQLDLLAQMREKLMAENRQKYQK 360
Query: 361 VKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAAAGRGVGGVQTSKIASEANREFIFSSSF 420
VKK P KFSELQI+AYLKTVAFRR+I++VQ++A GR VGGVQTS+IASEANREFIFSSSF
Sbjct: 361 VKKAPEKFSELQIEAYLKTVAFRREINEVQRSAGGRAVGGVQTSRIASEANREFIFSSSF 420
Query: 421 TGDKQVLASTRAEKNGDKGLEAPRGQQPLSSLNNTEVPSTSNALARSTPDKTAVFEDNIE 480
GDK+VLAS R +N + + + P+ S+ N S+A D+ ++NIE
Sbjct: 421 AGDKEVLASAREGRNDENQKKTSQQSLPV-SVKNASPLKKSDATIELDRDEPKNPDENIE 480
Query: 481 TFLDERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEKNASANEVVNPEPVQNIEICNPKS 540
++DERGR R+ R R MG+ MTRD++RNL LMKE E+ AS + N E E +
Sbjct: 481 VYIDERGRFRI-RNRHMGIQMTRDIQRNLHLMKEKERTASGSMAKNDETFSAWE-----N 540
Query: 541 FSSQSQVLD-TPYEGVGESIKLNESSRGSMLNEDTAIEILLE-DKGDKSFDGDDDVFTHL 600
F ++ Q L+ +P E + + L + SML+ ++IEI + D G K + +DD+F L
Sbjct: 541 FPTEDQFLEKSPVE--KDVVDLEIQNDDSMLHPPSSIEISFDHDGGGKDLNDEDDMFLQL 600
Query: 601 AAENPIQMASFDISSQKLSLDGTTDSGWEETVEGKTYSPKNVEV---DDHSFKEGRVSDD 660
AA P+ ++S + ++ + +DS WEE + S +E + H K+ +S
Sbjct: 601 AAGGPVTISSTENDPKEDTSPWASDSDWEEVPVEQNTSVSKLEANLSNQHIPKD--ISIA 660
Query: 661 SEVEWEDGVCDQVNPVPFGVESG--KSVSKGSLEEEADLQEAIRRSLADIGDRKSGPVFS 720
V WE+ C N VE+ ++KG LEEEADLQEAI++SL ++ D++SG V
Sbjct: 661 EGVAWEEYSCKNANN---SVENDTVTKITKGYLEEEADLQEAIKKSLLELHDKESGDVLE 720
Query: 721 EHQQ---PVIVGKMVEQCMVVENENVIELDKPDSADGMNCLK-------------ADDST 780
E+Q ++V K E + E V E ++ D + LK A ++
Sbjct: 721 ENQSVRVNLVVDKPSEDSL-CSRETVGEAEEERFLDEITILKTSGAISEQSNTSVAGNAD 780
Query: 781 GRKETTES-----SSQEKQCSKSV---------VLLDTKTDTIAEQLDAPFKGAASSHKE 840
G+K T+ SS S +V V+ K +A Q + A H E
Sbjct: 781 GQKGITKQFGTHPSSGSNNVSHAVSNKLSKVKSVISPEKALNVASQ-NRMLSTMAKQHNE 840
Query: 841 SNENDDTLKPLSRDASGAAQ-----VVDRINNTVIEPPCRMVE--------MEGIYNVDS 900
+ + A A +D +N E M + ++ +
Sbjct: 841 EGSESFGGESVKVSAMPIADEEITGFLDEKDNADGESSIMMDDKRDYSRRKIQSLVTESR 900
Query: 901 SPKAVACENHQNFPVEKHTSDLLLEENDAKKPAVEVISNAEIE--FAEIEFTEDELTNRI 960
P + + + + EEN++ + + S+ + E +EF+E + I
Sbjct: 901 DPSRNVVRSRIGILHDTDSQNERREENNSNEHTFNIDSSTDFEEKGVPVEFSEANIEEEI 960
Query: 961 SILEQERLNLGNEQKRLERNAESVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELA 1020
+L+QE ++LG+EQ++LERNAESVSSEMFAECQELLQ+FG+PYIIAPMEAEAQCA+ME +
Sbjct: 961 RVLDQEFVSLGDEQRKLERNAESVSSEMFAECQELLQIFGIPYIIAPMEAEAQCAFMEQS 1020
Query: 1021 NLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELGLNRDKLIRMALLLGS 1080
NLVDG+VTDDSDVFLFGARSVYKNIFDDRKYVETYFMKD+E ELGL+RDK+IRMA+LLGS
Sbjct: 1021 NLVDGIVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDIEKELGLSRDKIIRMAMLLGS 1080
Query: 1081 DYTEGISGIGIVNAIEVMNAFPEEGGLHKFKEWIESPDPSILGTLGAQTGLSARKRGSKA 1140
DYTEGISGIGIVNAIEV+ AFPEE GL KF+EW+ESPDP+ILG A+TG +KRGS +
Sbjct: 1081 DYTEGISGIGIVNAIEVVTAFPEEDGLQKFREWVESPDPTILGKTDAKTGSKVKKRGSAS 1140
Query: 1141 SENDMTCSNGSVRDGSASGESIFKAPKEKGSIDVKQSFMDKHRNVSKNWHIPSAFPSEAV 1200
+N S S D + ++KQ FMD+HR VSKNWHIP FPSEAV
Sbjct: 1141 VDNKGIISGASTDD----------------TEEIKQIFMDQHRKVSKNWHIPLTFPSEAV 1200
Query: 1201 ISAYTCPQVDKSAEAFSWGKPDHFVLRRLCWEKFGWENSKADELLLPVLKEYSKHQTQLR 1260
ISAY PQVD S E FSWGKPD VLR+LCWEKF W K DELLLPVLKEY K +TQLR
Sbjct: 1201 ISAYLNPQVDLSTEKFSWGKPDLSVLRKLCWEKFNWNGKKTDELLLPVLKEYEKRETQLR 1260
Query: 1261 LEAFYTFNERFAKIRSKRIKKAVRSITGSKSAVLMDDAVQGVSVNKQRELSVEPQENISE 1320
+EAFY+FNERFAKIRSKRI KAV+ I G S+ + D +Q K+ + V P E
Sbjct: 1261 IEAFYSFNERFAKIRSKRINKAVKGIGGGLSSDVADHTLQ-EGPRKRNKKKVAPHE---- 1320
Query: 1321 KCSSEIQGTCSNEDEVENRRRKPSRKRQLHGEPSQPAKDKLTMKERGKRSRNEGSHKNER 1380
+E T + + N + K RKR E+ SR R
Sbjct: 1321 ---TEDNNTSDKDSPIANEKVKNKRKR----------------LEKPSSSRG-------R 1380
Query: 1381 GRGRGKGRGRGRLASKGKTKGTPITELVGTSSSDDESEFDEQKFDLENLQEPRERRRSAR 1440
GR + +GRGRGR+ + EL SS DD+ D++ +LE + A
Sbjct: 1381 GRAQKRGRGRGRVQK-------DLLELSDGSSDDDDD--DDKVVELE--------AKPAN 1440
Query: 1441 IQKSASSTMNDVDQPSGHSRDRFSDDEAKEHDVVRDRHALPETVISQSENTECDFKTRKR 1500
+QKS S N V S D + + E + + E I ++ +
Sbjct: 1441 LQKSTRS-RNPV-MYSAKEDDELDESRSNEGSPSENFEEVDEGRIGNDDSVDASIND--- 1478
Query: 1501 SPQKDHLETGGGFCPVEDEMSQQGMCQNKDPSLEANIGEDYLSMGGGFCSDDGNECVDPN 1560
P +D+++TGGGFC DE + G D LE +DY +GGGFC D+ +E + N
Sbjct: 1501 CPSEDYIQTGGGFC--ADEADEIG-----DAHLEDKATDDYRVIGGGFCVDE-DETAEEN 1478
Query: 1561 SYPDQATFSEDPKDGSEDHPIQSTFHPEYIGRVQNEEGTDAHVDSPPNVGDSNPVSNPNS 1598
+ D A E K SE+ + G+ +NEE DA +D
Sbjct: 1561 TMDDDA---EILKMESEEQRKK--------GKRRNEE--DASLD---------------- 1478
BLAST of Sgr021433 vs. TAIR 10
Match:
AT5G40230.1 (nodulin MtN21 /EamA-like transporter family protein )
HSP 1 Score: 285.0 bits (728), Expect = 4.3e-76
Identity = 156/305 (51.15%), Postives = 212/305 (69.51%), Query Frame = 0
Query: 1597 REFAPLAGMIAAECATVGSNTVYKAISTQEISYYVFTFYTCLAAALVLLPFAFIFRRSGV 1656
R+ P M+A EC TVGSNT++KA + + +S+YVF FYT + A LVLLP + IF RS
Sbjct: 17 RDVVPFTAMVAVECVTVGSNTLFKAATLRGLSFYVFVFYTYVVATLVLLPLSLIFGRSKR 76
Query: 1657 FPSDKLSSFLLRLIFLSAMGVACQLFAYKGLEYSSPTLASAISNLIPALTFIFAVLFGME 1716
PS K F + L+ +G + KG+EYSSPTLASAISNL PA TF AV+F ME
Sbjct: 77 LPSAKTPVF-FNIFLLALVGFMSLIVGCKGIEYSSPTLASAISNLTPAFTFTLAVIFRME 136
Query: 1717 KLALKGSSSIAKIIGSVVSISGALVVVLYKGPVILSNPFSGP--TRLNLPHHPLGSTQPN 1776
++ L+ S++ AKIIG++VSISGALVV+LYKGP +L++ P ++L H L S +
Sbjct: 137 QIVLRSSATQAKIIGTIVSISGALVVILYKGPKVLTDASLTPPSPTISLYQH-LTSFDSS 196
Query: 1777 WIMGGLCFFAQYLLNSFWYIILTQMVNMYPDELAVVCLYYVFEAIIAAPICLLVEGNLSA 1836
WI+GGL QYLL S WYI+ T+++ +YP+E+ VV LY + +I+AP+CL E +L++
Sbjct: 197 WIIGGLLLATQYLLVSVWYILQTRVMELYPEEITVVFLYNLCATLISAPVCLFAEKDLNS 256
Query: 1837 WKLKNGLELVAVLNSGCVGQSFVTAIHTWGVHVKGPVYVSSFRPLSIAIAAATGVIFLGD 1896
+ LK G+ L +V+ SG + SF + IHTWG+H+KGPVY+S F+PLSI IA A GV+FLGD
Sbjct: 257 FILKPGVSLASVMYSGGLVSSFGSVIHTWGLHLKGPVYISLFKPLSIVIAVAMGVMFLGD 316
Query: 1897 DLYLG 1900
LYLG
Sbjct: 317 ALYLG 319
BLAST of Sgr021433 vs. TAIR 10
Match:
AT5G40240.1 (nodulin MtN21 /EamA-like transporter family protein )
HSP 1 Score: 282.3 bits (721), Expect = 2.8e-75
Identity = 152/304 (50.00%), Postives = 205/304 (67.43%), Query Frame = 0
Query: 1597 REFAPLAGMIAAECATVGSNTVYKAISTQEISYYVFTFYTCLAAALVLLPFAFIFRRSGV 1656
R+ P A M A ECATVGSNT++KA + + +S+YVF FY+ + + L+LLP + IF RS
Sbjct: 16 RDVVPFAAMFAVECATVGSNTLFKAATLRGLSFYVFVFYSYIVSTLLLLPLSVIFGRSRR 75
Query: 1657 FPSDKLSSFLLRLIFLSAMGVACQLFAYKGLEYSSPTLASAISNLIPALTFIFAVLFGME 1716
P+ K S ++ L +G Q+ KG+ YSSPTLASAISNL PA TF AV+F ME
Sbjct: 76 LPAAK-SPLFFKIFLLGLVGFMSQIAGCKGIAYSSPTLASAISNLTPAFTFTLAVIFRME 135
Query: 1717 KLALKGSSSIAKIIGSVVSISGALVVVLYKGP-VILSNPFSGPTRLNLPHHPLGSTQPNW 1776
++ L+ S++ AKIIG+++SISGALVVVLYKGP V+ S F+ H L S + +W
Sbjct: 136 QVRLRSSATQAKIIGAILSISGALVVVLYKGPQVLASASFTTVLPTVTLHQQLTSIESSW 195
Query: 1777 IMGGLCFFAQYLLNSFWYIILTQMVNMYPDELAVVCLYYVFEAIIAAPICLLVEGNLSAW 1836
I+GGL +QY L S WYI+ T+++ +YP+E+ VV Y +F +I+ P+CL E NL++W
Sbjct: 196 IIGGLLLASQYFLISVWYILQTRVMEVYPEEITVVFFYNLFATLISVPVCLFAESNLTSW 255
Query: 1837 KLKNGLELVAVLNSGCVGQSFVTAIHTWGVHVKGPVYVSSFRPLSIAIAAATGVIFLGDD 1896
LK + L A++ SG F HTWG+H+KGPVY+S FRPLSIAIA A G IFLGD
Sbjct: 256 VLKPDISLAAIIYSGVFVSLFSALTHTWGLHLKGPVYISLFRPLSIAIAVAMGAIFLGDA 315
Query: 1897 LYLG 1900
L+LG
Sbjct: 316 LHLG 318
BLAST of Sgr021433 vs. TAIR 10
Match:
AT5G40240.2 (nodulin MtN21 /EamA-like transporter family protein )
HSP 1 Score: 282.3 bits (721), Expect = 2.8e-75
Identity = 152/304 (50.00%), Postives = 205/304 (67.43%), Query Frame = 0
Query: 1597 REFAPLAGMIAAECATVGSNTVYKAISTQEISYYVFTFYTCLAAALVLLPFAFIFRRSGV 1656
R+ P A M A ECATVGSNT++KA + + +S+YVF FY+ + + L+LLP + IF RS
Sbjct: 30 RDVVPFAAMFAVECATVGSNTLFKAATLRGLSFYVFVFYSYIVSTLLLLPLSVIFGRSRR 89
Query: 1657 FPSDKLSSFLLRLIFLSAMGVACQLFAYKGLEYSSPTLASAISNLIPALTFIFAVLFGME 1716
P+ K S ++ L +G Q+ KG+ YSSPTLASAISNL PA TF AV+F ME
Sbjct: 90 LPAAK-SPLFFKIFLLGLVGFMSQIAGCKGIAYSSPTLASAISNLTPAFTFTLAVIFRME 149
Query: 1717 KLALKGSSSIAKIIGSVVSISGALVVVLYKGP-VILSNPFSGPTRLNLPHHPLGSTQPNW 1776
++ L+ S++ AKIIG+++SISGALVVVLYKGP V+ S F+ H L S + +W
Sbjct: 150 QVRLRSSATQAKIIGAILSISGALVVVLYKGPQVLASASFTTVLPTVTLHQQLTSIESSW 209
Query: 1777 IMGGLCFFAQYLLNSFWYIILTQMVNMYPDELAVVCLYYVFEAIIAAPICLLVEGNLSAW 1836
I+GGL +QY L S WYI+ T+++ +YP+E+ VV Y +F +I+ P+CL E NL++W
Sbjct: 210 IIGGLLLASQYFLISVWYILQTRVMEVYPEEITVVFFYNLFATLISVPVCLFAESNLTSW 269
Query: 1837 KLKNGLELVAVLNSGCVGQSFVTAIHTWGVHVKGPVYVSSFRPLSIAIAAATGVIFLGDD 1896
LK + L A++ SG F HTWG+H+KGPVY+S FRPLSIAIA A G IFLGD
Sbjct: 270 VLKPDISLAAIIYSGVFVSLFSALTHTWGLHLKGPVYISLFRPLSIAIAVAMGAIFLGDA 329
Query: 1897 LYLG 1900
L+LG
Sbjct: 330 LHLG 332
BLAST of Sgr021433 vs. TAIR 10
Match:
AT4G15540.1 (EamA-like transporter family )
HSP 1 Score: 266.5 bits (680), Expect = 1.6e-70
Identity = 147/304 (48.36%), Postives = 202/304 (66.45%), Query Frame = 0
Query: 1596 RREFAPLAGMIAAECATVGSNTVYKAISTQEISYYVFTFYTCLAAALVLLPFAFIFRRSG 1655
+R+ P MIA EC TVGS+ +YKA + + S+YVF FY + A LVLL + IF RS
Sbjct: 12 KRDVVPFTAMIAIECTTVGSSILYKAATLRGFSFYVFVFYAYVGATLVLLLLSLIFGRSR 71
Query: 1656 VFPSDKLSSFLLRLIFLSAMGVACQLFAYKGLEYSSPTLASAISNLIPALTFIFAVLFGM 1715
P+ K SS ++ L+ +G+ ++ KG+EYSSPTL+SAISNL PA TFI A+ F M
Sbjct: 72 SLPTAK-SSLFFKIFLLALLGLTSRVAGCKGIEYSSPTLSSAISNLTPAFTFILAIFFRM 131
Query: 1716 EKLALKGSSSIAKIIGSVVSISGALVVVLYKGPVILSNPFSGPTRLNLPHHPLGSTQPNW 1775
E++ L+ S++ AKIIG++VSISGALV+VLYKGP +L S + +W
Sbjct: 132 EQVMLRSSATQAKIIGTIVSISGALVIVLYKGPKLLVAA------------SFTSFESSW 191
Query: 1776 IMGGLCFFAQYLLNSFWYIILTQMVNMYPDELAVVCLYYVFEAIIAAPICLLVEGNLSAW 1835
I+GGL Q+LL S W+I+ T ++ +YP+E+AVV Y + +I+ +CLLVE +L++W
Sbjct: 192 IIGGLLLGLQFLLLSVWFILQTHIMEIYPEEIAVVFCYNLCATLISGTVCLLVEKDLNSW 251
Query: 1836 KLKNGLELVAVLNSGCVGQSFVTAIHTWGVHVKGPVYVSSFRPLSIAIAAATGVIFLGDD 1895
+LK G L +V+ SG S + IHTWG+HVKGPVY+S F+PLSIAIA A IFLGD
Sbjct: 252 QLKPGFSLASVIYSGLFDTSLGSVIHTWGLHVKGPVYISLFKPLSIAIAVAMAAIFLGDT 302
Query: 1896 LYLG 1900
L+LG
Sbjct: 312 LHLG 302
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9ATY5 | 7.5e-291 | 43.50 | DNA repair protein UVH3 OS=Arabidopsis thaliana OX=3702 GN=UVH3 PE=2 SV=1 | [more] |
F4KHA8 | 6.1e-75 | 51.15 | WAT1-related protein At5g40230 OS=Arabidopsis thaliana OX=3702 GN=At5g40230 PE=3... | [more] |
Q9FL08 | 3.9e-74 | 50.00 | WAT1-related protein At5g40240 OS=Arabidopsis thaliana OX=3702 GN=At5g40240 PE=2... | [more] |
P35689 | 3.1e-71 | 25.88 | DNA excision repair protein ERCC-5 OS=Mus musculus OX=10090 GN=Ercc5 PE=1 SV=4 | [more] |
P14629 | 3.1e-71 | 24.62 | DNA excision repair protein ERCC-5 homolog OS=Xenopus laevis OX=8355 GN=ercc5 PE... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1DAD7 | 0.0e+00 | 82.49 | DNA repair protein UVH3 isoform X3 OS=Momordica charantia OX=3673 GN=LOC11101834... | [more] |
A0A6J1D8H5 | 0.0e+00 | 81.23 | DNA repair protein UVH3 isoform X1 OS=Momordica charantia OX=3673 GN=LOC11101834... | [more] |
A0A6J1D7H7 | 0.0e+00 | 81.15 | DNA repair protein UVH3 isoform X2 OS=Momordica charantia OX=3673 GN=LOC11101834... | [more] |
A0A6J1ERW5 | 0.0e+00 | 77.52 | DNA repair protein UVH3 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111435307... | [more] |
A0A6J1JJE1 | 0.0e+00 | 76.82 | DNA repair protein UVH3 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111486346 P... | [more] |