Homology
BLAST of Spg001975 vs. NCBI nr
Match:
XP_038903932.1 (DNA repair protein UVH3 isoform X2 [Benincasa hispida])
HSP 1 Score: 2449.5 bits (6347), Expect = 0.0e+00
Identity = 1311/1576 (83.19%), Postives = 1410/1576 (89.47%), Query Frame = 0
Query: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAHILGFFR 60
MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAH+LGFFR
Sbjct: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAHLLGFFR 60
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLR 120
RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNH+K MRL+
Sbjct: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHIKVMRLK 120
Query: 121 ELAEDIKNQKQQRKQDVPKKKTLLNHNEIVDGTSTSERSKSVPNSDSHENLDGMLAASIM 180
ELAEDI+NQKQQRKQ + KK TL + ++ +GTSTSE + +PN S ENLD MLAASIM
Sbjct: 121 ELAEDIQNQKQQRKQKLSKKSTLPSRDKNFNGTSTSESCEGIPNRGSLENLDEMLAASIM 180
Query: 181 AEENGFFLSSASSFAGSSLAKEDSGEQSILNQKYKSDSKGKKILSDEVHVMGSDSERMEV 240
AEENG FLSSASSF+G++LAKE GE SILNQKY ++SKGK+ILSDE +++GSDSERMEV
Sbjct: 181 AEENGLFLSSASSFSGATLAKEGGGEGSILNQKYNNESKGKEILSDETYIVGSDSERMEV 240
Query: 241 ASRSAHQQNLDEMLAASIAAEEARSLNENASVSAAANLDGEDTDDEDEEMILPEMHGVVD 300
ASRS HQQNLDEMLAASIAAEEARSLNENASVSA NLDGEDTDDEDEEMILPEMHGVVD
Sbjct: 241 ASRSVHQQNLDEMLAASIAAEEARSLNENASVSAVTNLDGEDTDDEDEEMILPEMHGVVD 300
Query: 301 PSVLAALPPSVQLDLLVQ------------------DPAKFSELQIQAYLKTVAFRRDID 360
PSVLAALPPSVQLDLLVQ DPAKFSELQIQAYLKTVAFRRDID
Sbjct: 301 PSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRRDID 360
Query: 361 QVQKAASGRGVGGVQTSRIASEANREFIFSSSFTGDKQALASSRAEKNGDKDLQAPRVQQ 420
QVQKAA+GRGVGGVQTSRIASEANREFIFSSSFTGDKQ LAS+ EKNGDKDLQAP VQQ
Sbjct: 361 QVQKAAAGRGVGGVQTSRIASEANREFIFSSSFTGDKQVLASTIVEKNGDKDLQAPTVQQ 420
Query: 421 PLSSLNNTEIPSTSNALARSTPDKSGVLEDNIETFLDERGRVRVSRVRAMGMHMTRDLER 480
PLSSL NTEIPSTSN LA+STPDKSG EDNIETFLDERGRVRVSRV+AMGMHMTRDLER
Sbjct: 421 PLSSLKNTEIPSTSNPLAQSTPDKSGGFEDNIETFLDERGRVRVSRVKAMGMHMTRDLER 480
Query: 481 NLDLMKEIEKNASANKAANPEHMQHIEICNPESFSFQSQVLDTSDEGVGDSINKLNEKGA 540
NLDLMKEIEKN SANKAANPE +Q+IEICNPE+FSFQSQVLDTSDEGVG SINKLNE+G
Sbjct: 481 NLDLMKEIEKNTSANKAANPEPIQNIEICNPENFSFQSQVLDTSDEGVGGSINKLNERGG 540
Query: 541 EFMLNEDTAIEILLEDEGGKSFDGDDDLFTHLAAENPIQMASFDISPQKLSLDSTTDSGW 600
E MLNE+TAIEILLEDEGGKSFDGDDDLFT+LAAENPI MASFDIS QKLSLD TTDSGW
Sbjct: 541 EPMLNEETAIEILLEDEGGKSFDGDDDLFTNLAAENPIGMASFDISTQKLSLDGTTDSGW 600
Query: 601 EEKLEGKAYSPKNVQVDDHSFKEGTVSDESEVDWEDGVCDHVNPVPFEAESGKSVSKGSM 660
E+ +EGK YSPKNV+VDDHSFKEGTVSDES+VDWEDG CDHVNPVPFEA+ +SVSKG +
Sbjct: 601 EDAVEGKTYSPKNVEVDDHSFKEGTVSDESDVDWEDGACDHVNPVPFEADLAQSVSKGFL 660
Query: 661 EEEADLQEAIRRSLEDIGDRKSGPVSSEHQQPQQVIVGKMAEQCMSVQNENVIGLEKLDS 720
EEEADLQEAIRRSLED G KSG VSSE QQPQ VIVGK AEQC SVQNE++IGL+KLDS
Sbjct: 661 EEEADLQEAIRRSLEDRGYTKSGTVSSELQQPQPVIVGKRAEQCTSVQNESMIGLDKLDS 720
Query: 721 ADGMNCSNANDSTRTKGMTESSSQEKQCPEPVVLLDTKTHTIAEQLDASCNDATFSHKES 780
ADGMNC N NDSTRT+GMTESSSQEKQC EPV+ LDTKTHTIAEQLDAS N A FS KES
Sbjct: 721 ADGMNCLNFNDSTRTEGMTESSSQEKQCSEPVMSLDTKTHTIAEQLDASYNVAKFSPKES 780
Query: 781 NENNDTLKALSRDASGAIQVEDRINNTVIEAPCCMVEMEGIYTPNIDSSPKAFTCENHFK 840
NENNDTL+ LSRD GA+QV DRINNTVI+ PC MVEMEGIY P SS K F CEN+FK
Sbjct: 781 NENNDTLEPLSRDTFGAVQVGDRINNTVIDPPCRMVEMEGIYPPGNGSSRKPFACENNFK 840
Query: 841 QNLPVDKCNSDLLLE-KDAKKPAVEEISNAEIEITEDELTNRISVLEQERLNLGDEQKRL 900
QNLPVD+ ++DL L+ KDAK +VEE SNAEIE+T+DEL NR SVLEQERLNLGDEQKRL
Sbjct: 841 QNLPVDEHSNDLSLDIKDAKILSVEETSNAEIEVTDDELKNRFSVLEQERLNLGDEQKRL 900
Query: 901 ERNAESVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFG 960
ERNAESV+SEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFG
Sbjct: 901 ERNAESVNSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFG 960
Query: 961 ARSVYKNIFDDRKYVETYFMKDVENELGLNRDKLIRMALLLGSDYTEGISGIGIVNAVEV 1020
ARSVYKNIFDDRKYVETYFMKDVENELGLN+DKLIRMALLLGSDYTEGISGIGIVNAVEV
Sbjct: 961 ARSVYKNIFDDRKYVETYFMKDVENELGLNQDKLIRMALLLGSDYTEGISGIGIVNAVEV 1020
Query: 1021 MNAFPEEDGLHKFKEWIESPDPSILGTLGAKTGLSARKRGSKASENDMTCSNSSARDGSA 1080
MNAFPEEDGLHKFKEWIESPDPSILGTLGAKTGL+AR+RGSKASENDMTCSN+ GSA
Sbjct: 1021 MNAFPEEDGLHKFKEWIESPDPSILGTLGAKTGLTARRRGSKASENDMTCSNTG---GSA 1080
Query: 1081 SEENIDKDLKENIDVKQSFMDKHRNVSKNWHIPSEFPSEAVISAYICPQVDKSAEPFSWG 1140
SEENI KDL+ENI VKQSFMDKHRNVSKNWHIPSEFPSEAVISAYICPQVDKSAEPFSWG
Sbjct: 1081 SEENISKDLEENIAVKQSFMDKHRNVSKNWHIPSEFPSEAVISAYICPQVDKSAEPFSWG 1140
Query: 1141 KPDHFVLRRLCWEKFGWDNSKADELLLPVLKEYGKHETQLRLEAFYTFNERFAKIRSKRI 1200
KPDHFVLRRLCWEKFGW+NSKADELLLPVLKEY KHETQLRLEAFYTFNERFAKIRSKRI
Sbjct: 1141 KPDHFVLRRLCWEKFGWENSKADELLLPVLKEYSKHETQLRLEAFYTFNERFAKIRSKRI 1200
Query: 1201 KKAVKSITGSKSAVLMDDTVRDVSVNYQRELSVEPDENMSEKCSSKIQGACSNEDDIENR 1260
KKAVKSITGS+SAVLMDD VRDVSVN QRELSVEP ENMSEKCSS+ Q ACSNEDD R
Sbjct: 1201 KKAVKSITGSRSAVLMDDAVRDVSVNNQRELSVEPKENMSEKCSSERQDACSNEDD---R 1260
Query: 1261 LQKPSRKRQLDGERSQPAKDKKRTMKGKGKQSRNGGSRSE--RGRGRGEGRGRGRLASKG 1320
+KPSRKRQLDGE++QP KD+K T K KGK SRN GS SE RGRGRG+GRGRGRL SKG
Sbjct: 1261 HRKPSRKRQLDGEQAQPGKDRKLTKKEKGKPSRNEGSHSERGRGRGRGKGRGRGRLVSKG 1320
Query: 1321 KTPITELVETSSSDDGSEFDDQKFDLENLQEPQERRRSTRIRKSVSYTMDDGDQPSDHSE 1380
K PITEL+ETSSSDD SEFD+QKFDLEN QEPQE+RRS+RIRKS SYT+D+ DQ SDH+
Sbjct: 1321 KAPITELIETSSSDDESEFDNQKFDLENFQEPQEKRRSSRIRKSASYTIDNADQQSDHTG 1380
Query: 1381 ERFSNDEAKDDNVARDQYAHPETVMSQFKNTECDFGTPKKSPQRDYLETGGGFCLAEDEM 1440
+ FSND+A++D V + QYAHPETVMSQ +N E G+PK+SPQ DYL+TGGGFCL EDEM
Sbjct: 1381 DEFSNDKAEEDRVIQGQYAHPETVMSQSENMESGSGSPKRSPQNDYLKTGGGFCLVEDEM 1440
Query: 1441 SQEAMCMNKDLPLEANNSEDYLTMGGGFCLDDDNECVDTVAHPDQATILEVPKDGSGDDP 1500
S++ MC NKD LEANNSEDYLTMGGGFCLDDD+E +D VAHP+QAT+LEVPKDG +DP
Sbjct: 1441 SRQEMCQNKDPALEANNSEDYLTMGGGFCLDDDDERIDPVAHPNQATVLEVPKDGFENDP 1500
Query: 1501 GQSTFHPEKCIGGDQNKEDTDARGDSLLDMGDPNPVSNPNSSQLGEGVQEEPKDHSV-RT 1555
GQST PEK +G EDTDAR +S+LD+G+PNPV+N NSSQ+GE VQEEPKDHSV R
Sbjct: 1501 GQSTVSPEKHVG----VEDTDARVESVLDVGNPNPVNNSNSSQVGEDVQEEPKDHSVRRA 1560
BLAST of Spg001975 vs. NCBI nr
Match:
KAG6588970.1 (Potassium transporter 8, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 2438.7 bits (6319), Expect = 0.0e+00
Identity = 1309/1564 (83.70%), Postives = 1395/1564 (89.19%), Query Frame = 0
Query: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAHILGFFR 60
MGVHGLWELL PVGRRVSVETLAGKKLAIDASIWMVQFIKAMRD+RGEMVRNAH+LGFFR
Sbjct: 813 MGVHGLWELLTPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDERGEMVRNAHLLGFFR 872
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLR 120
RICKLLFLRTKPVFVFDGATP+LKRRTLIARRRQRENAQAKVRKTAEKLLLNHLK MRLR
Sbjct: 873 RICKLLFLRTKPVFVFDGATPSLKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKEMRLR 932
Query: 121 ELAEDIKNQKQQRKQDVPKKKTLLNHNEIVDGTSTSERSKSVPNSDSHENLDGMLAASIM 180
ELAE IKNQKQQRKQDVPKKKTLLNHNEIVDGTS SERSKSVPN +HENLDGM+AASIM
Sbjct: 933 ELAEGIKNQKQQRKQDVPKKKTLLNHNEIVDGTSVSERSKSVPNGGNHENLDGMVAASIM 992
Query: 181 AEENGFFLSSASSFAGSSLAKEDSGEQSILNQKYKSDSKGKKILSDEVHVMGSDSERMEV 240
EENGFF SSA SF+G +L K+D GEQSILNQKYK+DSKGKKILSDE+HV+GSDSERMEV
Sbjct: 993 IEENGFFSSSAPSFSGVTLPKKDRGEQSILNQKYKNDSKGKKILSDEIHVVGSDSERMEV 1052
Query: 241 ASRSAHQQNLDEMLAASIAAEEARSLNENASVSAAANLDGEDTD--DEDEEMILPEMHGV 300
ASRSAHQQNLDEMLAASIAAEEAR LNEN SVS+AANL GED D DEDEEMILPEMHGV
Sbjct: 1053 ASRSAHQQNLDEMLAASIAAEEARGLNENISVSSAANLAGEDMDDEDEDEEMILPEMHGV 1112
Query: 301 VDPSVLAALPPSVQLDLLVQDPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGVQT 360
VDPSVLAALPPSVQLDLLVQDPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGVQT
Sbjct: 1113 VDPSVLAALPPSVQLDLLVQDPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGVQT 1172
Query: 361 SRIASEANREFIFSSSFTGDKQALASSRAEKNGDKDLQAPRVQQPLSSLNNTEIPSTSNA 420
SRIASEANREFIFSSSFTGDKQ L S+RAEKNGDK+LQ PRVQQ LSSLNNT+IPSTSN
Sbjct: 1173 SRIASEANREFIFSSSFTGDKQVLTSTRAEKNGDKNLQEPRVQQSLSSLNNTDIPSTSNG 1232
Query: 421 LARSTPDKSGVLEDNIETFLDERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEKNASANK 480
LA+STPDKSGV EDNIETFLDERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEKN +ANK
Sbjct: 1233 LAQSTPDKSGVFEDNIETFLDERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEKNINANK 1292
Query: 481 AANPEHMQHIEICNPESFSFQSQVLDTSDEGVGDSINKLNEKGAEFMLNEDTAIEILLED 540
ANPE MQ+IEICNPES S +SQVLD S+EG+ +SINKL+E+GA+ MLNEDTAIEILLE
Sbjct: 1293 VANPEPMQNIEICNPESSSLRSQVLDVSNEGIDESINKLDERGADSMLNEDTAIEILLEG 1352
Query: 541 EGGKSFDGDDDLFTHLAAENPIQMASFDISPQKLSLDSTTDSGWEEKLEGKAYSPKNVQV 600
EGGKSFDGDDDLFTHLAAENPIQMASFDIS QKLS D TTDSGW+E L
Sbjct: 1353 EGGKSFDGDDDLFTHLAAENPIQMASFDISSQKLSQDGTTDSGWKEAL------------ 1412
Query: 601 DDHSFKEGTVSDESEVDWEDGVCDHVNPVPFEAESGKSVSKGSMEEEADLQEAIRRSLED 660
EGT+SDESEVDWEDGVCDHVNPVPFE ESGKSVSKGS+EEEADLQEAIRRSLED
Sbjct: 1413 ------EGTISDESEVDWEDGVCDHVNPVPFEDESGKSVSKGSLEEEADLQEAIRRSLED 1472
Query: 661 IGDRKSGPVSSEHQQPQQ--VIVGKMAEQCMSVQNENVIGLEKLDSADGMNCSNANDSTR 720
+GD KSGPVS EH+QPQ IVGKMAEQC SV+NENVIGLEK+DS DGMN SNA DS
Sbjct: 1473 VGDGKSGPVSLEHEQPQSQPSIVGKMAEQCTSVENENVIGLEKMDSVDGMNWSNAKDSIL 1532
Query: 721 TKGMTESSSQEKQCPEPVVLLDTKTHTIAEQLDASCNDATFSHKESNENNDTLKALSRDA 780
KGMTESSSQEKQC EPVVLLDT TIAEQLDAS D +FS +ESNE++DTLK+LSRDA
Sbjct: 1533 KKGMTESSSQEKQCSEPVVLLDT---TIAEQLDASYKDTSFSLQESNESSDTLKSLSRDA 1592
Query: 781 SGAIQVEDRINNTVIEAPCCMVEMEGIYTPNIDSSPKAFTCENHFKQNLPVDKCNSDLLL 840
A QV D IN+T+IE C MVEM+G+ TP++DSS K ENHFKQNLPV+K +SDLLL
Sbjct: 1593 PRATQVGDMINSTMIEPACRMVEMDGVNTPDVDSSTKDSAFENHFKQNLPVEKHSSDLLL 1652
Query: 841 EKDAKKPAVEEISNAEIEITEDELTNRISVLEQERLNLGDEQKRLERNAESVSSEMFAEC 900
E++ K EIS AE E+TEDEL +RIS+LEQERLNLGDEQKRLERNAE+VSSEMFAEC
Sbjct: 1653 EEEVGKGHTVEISKAETEVTEDELKSRISILEQERLNLGDEQKRLERNAEAVSSEMFAEC 1712
Query: 901 QELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYV 960
QELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYV
Sbjct: 1713 QELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYV 1772
Query: 961 ETYFMKDVENELGLNRDKLIRMALLLGSDYTEGISGIGIVNAVEVMNAFPEEDGLHKFKE 1020
ETYFMKDVENELGL+R+KLIRMALLLGSDYTEGISGIGIVNAVEVMNAFPEEDGLHKFKE
Sbjct: 1773 ETYFMKDVENELGLDRNKLIRMALLLGSDYTEGISGIGIVNAVEVMNAFPEEDGLHKFKE 1832
Query: 1021 WIESPDPSILGTLGAKTGLSARKRGSKASENDMTCSNSSARDGSASEENIDKDLKENIDV 1080
WIESPDPSILGTLGAKTGLSARKRG KASEND CSNSS RDGSASEENIDKDLKENIDV
Sbjct: 1833 WIESPDPSILGTLGAKTGLSARKRGQKASENDAPCSNSSVRDGSASEENIDKDLKENIDV 1892
Query: 1081 KQSFMDKHRNVSKNWHIPSEFPSEAVISAYICPQVDKSAEPFSWGKPDHFVLRRLCWEKF 1140
KQ+FM KHRNVSKNWHIPSEFPSEAVISAYI PQVDKSAEPFSWGKPDHFVLRRLC EKF
Sbjct: 1893 KQNFMVKHRNVSKNWHIPSEFPSEAVISAYISPQVDKSAEPFSWGKPDHFVLRRLCLEKF 1952
Query: 1141 GWDNSKADELLLPVLKEYGKHETQLRLEAFYTFNERFAKIRSKRIKKAVKSITGSKSAVL 1200
GW+NSKADELLLPVLKEYGKHETQLRLEAFYTFNERFAKIRSKRIKKAVKSITGSKSA L
Sbjct: 1953 GWENSKADELLLPVLKEYGKHETQLRLEAFYTFNERFAKIRSKRIKKAVKSITGSKSASL 2012
Query: 1201 MDDTVRDVSVNYQRELSVEPDENMSEKCSSKIQGACSNEDDIENRLQKPSRKRQLDGERS 1260
MD+TV +VSVN Q LS E +NMSEKCSS+IQGACSNED+++NRL+KPSRKRQLD E+S
Sbjct: 2013 MDETVPNVSVNNQINLSGETQKNMSEKCSSEIQGACSNEDNVDNRLRKPSRKRQLDREQS 2072
Query: 1261 QPAKDKKRTMKGKGKQSRNGGSRSE--RGRGRGEGRGRGRLASKGKTPITELVETSSSDD 1320
QPAKD+K TMK KGK+SRN GS SE RGRGRG+GRGRGRLA KGK+PITE V TSSSDD
Sbjct: 2073 QPAKDRKLTMKEKGKRSRNEGSHSERGRGRGRGKGRGRGRLALKGKSPITEFVGTSSSDD 2132
Query: 1321 GSEFDDQKFDLENLQEPQERRRSTRIRKSVSYTMDDG--DQPSDHSEERFSNDEAKDDNV 1380
SEFDDQK DLEN+QEPQERR+S+R+RKS SY MDD DQPSDHS R SNDEA DDNV
Sbjct: 2133 ESEFDDQKIDLENVQEPQERRKSSRVRKSASYKMDDADQDQPSDHSGYRLSNDEANDDNV 2192
Query: 1381 ARDQYAHPETVMSQFKNTECDFGTPKKSPQRDYLETGGGFCLAEDEMSQEAMCMNKDLPL 1440
+ Y PETVM +NTECD+ PK+SP RDYL TGGGFC EDEMS+EAMC NKD L
Sbjct: 2193 VQGGYTGPETVMIHSENTECDYEIPKRSPLRDYLGTGGGFCPTEDEMSREAMCQNKDPAL 2252
Query: 1441 EANNSEDYLTMGGGFCLDDDNECVDTVAHPDQATILEVPKDGSGDDPGQSTFHPEKCIGG 1500
EA+NSEDYLT+GGGFCLDDDNECVD VAH DQAT EVPKDGS DDP QSTFHPEK IGG
Sbjct: 2253 EASNSEDYLTLGGGFCLDDDNECVDPVAHLDQATASEVPKDGSEDDPDQSTFHPEKDIGG 2312
Query: 1501 DQNKEDTDARGDSLLDMGDPNPVSNPNSSQLGEGVQEEPKDHSVRTFGGALSAMPNLRRK 1557
+Q EDT G+SLLD+GDPNP S PNSS++GEGVQEEPKDHSVR FGGALSAMPNLRRK
Sbjct: 2313 NQLNEDTYPHGESLLDVGDPNPASFPNSSRVGEGVQEEPKDHSVRAFGGALSAMPNLRRK 2355
BLAST of Spg001975 vs. NCBI nr
Match:
XP_023529571.1 (DNA repair protein UVH3 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 2434.1 bits (6307), Expect = 0.0e+00
Identity = 1312/1580 (83.04%), Postives = 1393/1580 (88.16%), Query Frame = 0
Query: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAHILGFFR 60
MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRD+RGEMVRNAH+LGFFR
Sbjct: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDERGEMVRNAHLLGFFR 60
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLR 120
RICKLLFLRTKPVFVFDGATP+LKRRTLIARRRQRENAQAKVRKTAEKLLLNHLK MRLR
Sbjct: 61 RICKLLFLRTKPVFVFDGATPSLKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKEMRLR 120
Query: 121 ELAEDIKNQKQQRKQDVPKKKTLLNHNEIVDGTSTSERSKSVPNSDSHENLDGMLAASIM 180
ELAE IKNQKQQRKQDVPKKKTLLNHNE VDGTS SERSKSVPNS +HENLDGM+AASIM
Sbjct: 121 ELAEGIKNQKQQRKQDVPKKKTLLNHNETVDGTSVSERSKSVPNSGNHENLDGMVAASIM 180
Query: 181 AEENGFFLSSASSFAGSSLAKEDSGEQSILNQKYKSDSKGKKILSDEVHVMGSDSERMEV 240
EENG F SSA SFAG +L KED GEQS NQKYK+DSKGKKILSDE+HV+GSDSERMEV
Sbjct: 181 IEENGVFSSSAPSFAGVTLPKEDRGEQSTWNQKYKNDSKGKKILSDEIHVVGSDSERMEV 240
Query: 241 ASRSAHQQNLDEMLAASIAAEEARSLNENASVSAAANLDGEDTD--DEDEEMILPEMHGV 300
ASRSAHQQNLDEMLAASIAAEEAR LNEN SVS+AANL GED D DEDEEMILPEMHGV
Sbjct: 241 ASRSAHQQNLDEMLAASIAAEEARGLNENVSVSSAANLAGEDMDDEDEDEEMILPEMHGV 300
Query: 301 VDPSVLAALPPSVQLDLLVQ------------------DPAKFSELQIQAYLKTVAFRRD 360
VDPSVLAALPPSVQLDLLVQ DPAKFSELQIQAYLKTVAFRRD
Sbjct: 301 VDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRRD 360
Query: 361 IDQVQKAASGRGVGGVQTSRIASEANREFIFSSSFTGDKQALASSRAEKNGDKDLQAPRV 420
IDQVQKAASGRGVGGVQTSRIASEANREFIFSSSFTGDKQ L S+RAEKNGDKDLQ PRV
Sbjct: 361 IDQVQKAASGRGVGGVQTSRIASEANREFIFSSSFTGDKQVLTSTRAEKNGDKDLQEPRV 420
Query: 421 QQPLSSLNNTEIPSTSNALARSTPDKSGVLEDNIETFLDERGRVRVSRVRAMGMHMTRDL 480
QQ LSSLNNT+IPSTSN LA+STPDKSG EDNIETFLDERGRVRVSRVRAMGMHMTRDL
Sbjct: 421 QQSLSSLNNTDIPSTSNGLAQSTPDKSGGFEDNIETFLDERGRVRVSRVRAMGMHMTRDL 480
Query: 481 ERNLDLMKEIEKNASANKAANPEHMQHIEICNPESFSFQSQVLDTSDEGVGDSINKLNEK 540
ERNLDLMKEIEKN +ANKAANPE MQ+IEICNPES S +SQVLD S+EGV +SINKL+E+
Sbjct: 481 ERNLDLMKEIEKNINANKAANPEPMQNIEICNPESSSLRSQVLDVSNEGVNESINKLDER 540
Query: 541 GAEFMLNEDTAIEILLEDEGGKSFDGDDDLFTHLAAENPIQMASFDISPQKLSLDSTTDS 600
GA+ MLNEDTAIEILLE EGGKSFDGDDDLFTHLAAENPIQMASFD+S QKLS D TTDS
Sbjct: 541 GADSMLNEDTAIEILLEGEGGKSFDGDDDLFTHLAAENPIQMASFDMSSQKLSQDGTTDS 600
Query: 601 GWEEKLEGKAYSPKNVQVDDHSFKEGTVSDESEVDWEDGVCDHVNPVPFEAESGKSVSKG 660
GW+E L EGTVSDESEVDWEDGVCDHVNPVPFE ESGKSVSKG
Sbjct: 601 GWKEAL------------------EGTVSDESEVDWEDGVCDHVNPVPFEDESGKSVSKG 660
Query: 661 SMEEEADLQEAIRRSLEDIGDRKSGPVSSEHQQPQQ--VIVGKMAEQCMSVQNENVIGLE 720
S+EEEADLQEAIRRSLED+GD KS PVS EH QPQ IVGKMAE+C SV+NENVIGLE
Sbjct: 661 SLEEEADLQEAIRRSLEDVGDGKSAPVSLEHDQPQSQPSIVGKMAERCTSVENENVIGLE 720
Query: 721 KLDSADGMNCSNANDSTRTKGMTESSSQEKQCPEPVVLLDTKTHTIAEQLDASCNDATFS 780
K+DS DGMN SNA DS KGMTESSSQEKQC EPVVLLDT TIAEQLDAS D +FS
Sbjct: 721 KMDSVDGMNWSNAKDSILKKGMTESSSQEKQCSEPVVLLDT---TIAEQLDASYKDTSFS 780
Query: 781 HKESNENNDTLKALSRDASGAIQVEDRINNTVIEAPCCMVEMEGIYTPNIDSSPKAFTCE 840
+ESNEN+DTLK+LSRDA A QV D IN+T+IE C MVEM+G+ TP++DSS K E
Sbjct: 781 LQESNENSDTLKSLSRDAPRATQVGDMINSTMIEPACRMVEMDGVSTPDVDSSTKDSAFE 840
Query: 841 NHFKQNLPVDKCNSDLLLEKDAKKPAVEEISNAEIEITEDELTNRISVLEQERLNLGDEQ 900
NHFKQNLPV+K +SDLLLE++ K EIS AE E+TEDEL +RIS+LEQERLNLGDEQ
Sbjct: 841 NHFKQNLPVEKHSSDLLLEEEVGKGHTVEISKAETEVTEDELKSRISILEQERLNLGDEQ 900
Query: 901 KRLERNAESVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVF 960
KRLERNAE+VSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVF
Sbjct: 901 KRLERNAEAVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVF 960
Query: 961 LFGARSVYKNIFDDRKYVETYFMKDVENELGLNRDKLIRMALLLGSDYTEGISGIGIVNA 1020
LFGARSVYKNIFDDRKYVETYFMKDVENELGL+R+KLIRMALLLGSDYTEGISGIGIVNA
Sbjct: 961 LFGARSVYKNIFDDRKYVETYFMKDVENELGLDRNKLIRMALLLGSDYTEGISGIGIVNA 1020
Query: 1021 VEVMNAFPEEDGLHKFKEWIESPDPSILGTLGAKTGLSARKRGSKASENDMTCSNSSARD 1080
VEVMNAFPEEDGLHKFKEWIESPDPSILGTLGAKTGLSARKRG KASEND TCSNS+ RD
Sbjct: 1021 VEVMNAFPEEDGLHKFKEWIESPDPSILGTLGAKTGLSARKRGQKASENDATCSNSNVRD 1080
Query: 1081 GSASEENIDKDLKENIDVKQSFMDKHRNVSKNWHIPSEFPSEAVISAYICPQVDKSAEPF 1140
GSASEENIDKDLKENIDVKQ+FM KHRNVSKNWHIPSEFPSEAVISAYI PQVDKSAEPF
Sbjct: 1081 GSASEENIDKDLKENIDVKQNFMVKHRNVSKNWHIPSEFPSEAVISAYISPQVDKSAEPF 1140
Query: 1141 SWGKPDHFVLRRLCWEKFGWDNSKADELLLPVLKEYGKHETQLRLEAFYTFNERFAKIRS 1200
SWGKPDHFVLRRLC EKFGW+NSKADELLLPVLKEYGKHETQLRLEAFYTFNERFAKIRS
Sbjct: 1141 SWGKPDHFVLRRLCLEKFGWENSKADELLLPVLKEYGKHETQLRLEAFYTFNERFAKIRS 1200
Query: 1201 KRIKKAVKSITGSKSAVLMDDTVRDVSVNYQRELSVEPDENMSEKCSSKIQGACSNEDDI 1260
KRIKKAVKSITGSKSA LMD+TV +VSVN Q LS E +NMSEKCSS+IQGACSNED++
Sbjct: 1201 KRIKKAVKSITGSKSASLMDETVPNVSVNNQINLSGETQKNMSEKCSSEIQGACSNEDNV 1260
Query: 1261 ENRLQKPSRKRQLDGERSQPAKDKKRTMKGKGKQSRNGGSRSERGRGRGEGRGRGRLASK 1320
+NRL+KPSRKRQLD E+SQPAKD+K TMK KGK+SRN GS SERGRGRG+GRGRGRLASK
Sbjct: 1261 DNRLRKPSRKRQLDREQSQPAKDRKLTMKEKGKRSRNEGSHSERGRGRGKGRGRGRLASK 1320
Query: 1321 GKTPITELVETSSSDDGSEFDDQKFDLENLQEPQERRRSTRIRKSVSYTMDDG--DQPSD 1380
GK+PITE V TSSSDD SEFDDQKFDLEN+QEPQERR+S+R+RKS SY MDD DQPSD
Sbjct: 1321 GKSPITEFVGTSSSDDESEFDDQKFDLENVQEPQERRKSSRVRKSASYKMDDADQDQPSD 1380
Query: 1381 HSEERFSNDEAKDDNVARDQYAHPETVMSQFKNTECDFGTPKKSPQRDYLETGGGFCLAE 1440
HS R SNDEA DDNV + Y PETVM +NTECD+ PK+SP RDYL TGGGFC E
Sbjct: 1381 HSGYRLSNDEANDDNVVQGGYTGPETVMIHSENTECDYEIPKRSPLRDYLGTGGGFCPTE 1440
Query: 1441 DEMSQEAMCMNKDLPLEANNSEDYLTMGGGFCLDDDNECVDTVAHPDQATILEVPKDGSG 1500
DEMS+EAMC NKD LEA+NSEDYLT+GGGFCLDDDNECVD VAH DQAT EVPKDGS
Sbjct: 1441 DEMSREAMCQNKDPALEASNSEDYLTLGGGFCLDDDNECVDPVAHLDQATASEVPKDGSE 1500
Query: 1501 DDPGQSTFHPEKCIGGDQNKEDTDARGDSLLDMGDPNPVSNPNSSQLGEGVQEEPKDHSV 1557
DDP QSTFHPEK IGGDQ EDT G+SLLD+GDPNP S PNSS++GEGVQEEPKDHS
Sbjct: 1501 DDPDQSTFHPEKDIGGDQLNEDTYPHGESLLDVGDPNPASFPNSSRVGEGVQEEPKDHSA 1559
BLAST of Spg001975 vs. NCBI nr
Match:
XP_038903931.1 (DNA repair protein UVH3 isoform X1 [Benincasa hispida])
HSP 1 Score: 2433.7 bits (6306), Expect = 0.0e+00
Identity = 1311/1606 (81.63%), Postives = 1410/1606 (87.80%), Query Frame = 0
Query: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAI------------------------------D 60
MGVHGLWELLAPVGRRVSVETLAGKKLAI D
Sbjct: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIGTNLSLRTSLNYLKNFMSMRFSLWVCLSIED 60
Query: 61 ASIWMVQFIKAMRDDRGEMVRNAHILGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIA 120
ASIWMVQFIKAMRDDRGEMVRNAH+LGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIA
Sbjct: 61 ASIWMVQFIKAMRDDRGEMVRNAHLLGFFRRICKLLFLRTKPVFVFDGATPALKRRTLIA 120
Query: 121 RRRQRENAQAKVRKTAEKLLLNHLKAMRLRELAEDIKNQKQQRKQDVPKKKTLLNHNEIV 180
RRRQRENAQAKVRKTAEKLLLNH+K MRL+ELAEDI+NQKQQRKQ + KK TL + ++
Sbjct: 121 RRRQRENAQAKVRKTAEKLLLNHIKVMRLKELAEDIQNQKQQRKQKLSKKSTLPSRDKNF 180
Query: 181 DGTSTSERSKSVPNSDSHENLDGMLAASIMAEENGFFLSSASSFAGSSLAKEDSGEQSIL 240
+GTSTSE + +PN S ENLD MLAASIMAEENG FLSSASSF+G++LAKE GE SIL
Sbjct: 181 NGTSTSESCEGIPNRGSLENLDEMLAASIMAEENGLFLSSASSFSGATLAKEGGGEGSIL 240
Query: 241 NQKYKSDSKGKKILSDEVHVMGSDSERMEVASRSAHQQNLDEMLAASIAAEEARSLNENA 300
NQKY ++SKGK+ILSDE +++GSDSERMEVASRS HQQNLDEMLAASIAAEEARSLNENA
Sbjct: 241 NQKYNNESKGKEILSDETYIVGSDSERMEVASRSVHQQNLDEMLAASIAAEEARSLNENA 300
Query: 301 SVSAAANLDGEDTDDEDEEMILPEMHGVVDPSVLAALPPSVQLDLLVQ------------ 360
SVSA NLDGEDTDDEDEEMILPEMHGVVDPSVLAALPPSVQLDLLVQ
Sbjct: 301 SVSAVTNLDGEDTDDEDEEMILPEMHGVVDPSVLAALPPSVQLDLLVQMRERLMAENRQK 360
Query: 361 ------DPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGVQTSRIASEANREFIFS 420
DPAKFSELQIQAYLKTVAFRRDIDQVQKAA+GRGVGGVQTSRIASEANREFIFS
Sbjct: 361 YQRVKKDPAKFSELQIQAYLKTVAFRRDIDQVQKAAAGRGVGGVQTSRIASEANREFIFS 420
Query: 421 SSFTGDKQALASSRAEKNGDKDLQAPRVQQPLSSLNNTEIPSTSNALARSTPDKSGVLED 480
SSFTGDKQ LAS+ EKNGDKDLQAP VQQPLSSL NTEIPSTSN LA+STPDKSG ED
Sbjct: 421 SSFTGDKQVLASTIVEKNGDKDLQAPTVQQPLSSLKNTEIPSTSNPLAQSTPDKSGGFED 480
Query: 481 NIETFLDERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEKNASANKAANPEHMQHIEICN 540
NIETFLDERGRVRVSRV+AMGMHMTRDLERNLDLMKEIEKN SANKAANPE +Q+IEICN
Sbjct: 481 NIETFLDERGRVRVSRVKAMGMHMTRDLERNLDLMKEIEKNTSANKAANPEPIQNIEICN 540
Query: 541 PESFSFQSQVLDTSDEGVGDSINKLNEKGAEFMLNEDTAIEILLEDEGGKSFDGDDDLFT 600
PE+FSFQSQVLDTSDEGVG SINKLNE+G E MLNE+TAIEILLEDEGGKSFDGDDDLFT
Sbjct: 541 PENFSFQSQVLDTSDEGVGGSINKLNERGGEPMLNEETAIEILLEDEGGKSFDGDDDLFT 600
Query: 601 HLAAENPIQMASFDISPQKLSLDSTTDSGWEEKLEGKAYSPKNVQVDDHSFKEGTVSDES 660
+LAAENPI MASFDIS QKLSLD TTDSGWE+ +EGK YSPKNV+VDDHSFKEGTVSDES
Sbjct: 601 NLAAENPIGMASFDISTQKLSLDGTTDSGWEDAVEGKTYSPKNVEVDDHSFKEGTVSDES 660
Query: 661 EVDWEDGVCDHVNPVPFEAESGKSVSKGSMEEEADLQEAIRRSLEDIGDRKSGPVSSEHQ 720
+VDWEDG CDHVNPVPFEA+ +SVSKG +EEEADLQEAIRRSLED G KSG VSSE Q
Sbjct: 661 DVDWEDGACDHVNPVPFEADLAQSVSKGFLEEEADLQEAIRRSLEDRGYTKSGTVSSELQ 720
Query: 721 QPQQVIVGKMAEQCMSVQNENVIGLEKLDSADGMNCSNANDSTRTKGMTESSSQEKQCPE 780
QPQ VIVGK AEQC SVQNE++IGL+KLDSADGMNC N NDSTRT+GMTESSSQEKQC E
Sbjct: 721 QPQPVIVGKRAEQCTSVQNESMIGLDKLDSADGMNCLNFNDSTRTEGMTESSSQEKQCSE 780
Query: 781 PVVLLDTKTHTIAEQLDASCNDATFSHKESNENNDTLKALSRDASGAIQVEDRINNTVIE 840
PV+ LDTKTHTIAEQLDAS N A FS KESNENNDTL+ LSRD GA+QV DRINNTVI+
Sbjct: 781 PVMSLDTKTHTIAEQLDASYNVAKFSPKESNENNDTLEPLSRDTFGAVQVGDRINNTVID 840
Query: 841 APCCMVEMEGIYTPNIDSSPKAFTCENHFKQNLPVDKCNSDLLLE-KDAKKPAVEEISNA 900
PC MVEMEGIY P SS K F CEN+FKQNLPVD+ ++DL L+ KDAK +VEE SNA
Sbjct: 841 PPCRMVEMEGIYPPGNGSSRKPFACENNFKQNLPVDEHSNDLSLDIKDAKILSVEETSNA 900
Query: 901 EIEITEDELTNRISVLEQERLNLGDEQKRLERNAESVSSEMFAECQELLQMFGLPYIIAP 960
EIE+T+DEL NR SVLEQERLNLGDEQKRLERNAESV+SEMFAECQELLQMFGLPYIIAP
Sbjct: 901 EIEVTDDELKNRFSVLEQERLNLGDEQKRLERNAESVNSEMFAECQELLQMFGLPYIIAP 960
Query: 961 MEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELGLN 1020
MEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELGLN
Sbjct: 961 MEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELGLN 1020
Query: 1021 RDKLIRMALLLGSDYTEGISGIGIVNAVEVMNAFPEEDGLHKFKEWIESPDPSILGTLGA 1080
+DKLIRMALLLGSDYTEGISGIGIVNAVEVMNAFPEEDGLHKFKEWIESPDPSILGTLGA
Sbjct: 1021 QDKLIRMALLLGSDYTEGISGIGIVNAVEVMNAFPEEDGLHKFKEWIESPDPSILGTLGA 1080
Query: 1081 KTGLSARKRGSKASENDMTCSNSSARDGSASEENIDKDLKENIDVKQSFMDKHRNVSKNW 1140
KTGL+AR+RGSKASENDMTCSN+ GSASEENI KDL+ENI VKQSFMDKHRNVSKNW
Sbjct: 1081 KTGLTARRRGSKASENDMTCSNTG---GSASEENISKDLEENIAVKQSFMDKHRNVSKNW 1140
Query: 1141 HIPSEFPSEAVISAYICPQVDKSAEPFSWGKPDHFVLRRLCWEKFGWDNSKADELLLPVL 1200
HIPSEFPSEAVISAYICPQVDKSAEPFSWGKPDHFVLRRLCWEKFGW+NSKADELLLPVL
Sbjct: 1141 HIPSEFPSEAVISAYICPQVDKSAEPFSWGKPDHFVLRRLCWEKFGWENSKADELLLPVL 1200
Query: 1201 KEYGKHETQLRLEAFYTFNERFAKIRSKRIKKAVKSITGSKSAVLMDDTVRDVSVNYQRE 1260
KEY KHETQLRLEAFYTFNERFAKIRSKRIKKAVKSITGS+SAVLMDD VRDVSVN QRE
Sbjct: 1201 KEYSKHETQLRLEAFYTFNERFAKIRSKRIKKAVKSITGSRSAVLMDDAVRDVSVNNQRE 1260
Query: 1261 LSVEPDENMSEKCSSKIQGACSNEDDIENRLQKPSRKRQLDGERSQPAKDKKRTMKGKGK 1320
LSVEP ENMSEKCSS+ Q ACSNEDD R +KPSRKRQLDGE++QP KD+K T K KGK
Sbjct: 1261 LSVEPKENMSEKCSSERQDACSNEDD---RHRKPSRKRQLDGEQAQPGKDRKLTKKEKGK 1320
Query: 1321 QSRNGGSRSE--RGRGRGEGRGRGRLASKGKTPITELVETSSSDDGSEFDDQKFDLENLQ 1380
SRN GS SE RGRGRG+GRGRGRL SKGK PITEL+ETSSSDD SEFD+QKFDLEN Q
Sbjct: 1321 PSRNEGSHSERGRGRGRGKGRGRGRLVSKGKAPITELIETSSSDDESEFDNQKFDLENFQ 1380
Query: 1381 EPQERRRSTRIRKSVSYTMDDGDQPSDHSEERFSNDEAKDDNVARDQYAHPETVMSQFKN 1440
EPQE+RRS+RIRKS SYT+D+ DQ SDH+ + FSND+A++D V + QYAHPETVMSQ +N
Sbjct: 1381 EPQEKRRSSRIRKSASYTIDNADQQSDHTGDEFSNDKAEEDRVIQGQYAHPETVMSQSEN 1440
Query: 1441 TECDFGTPKKSPQRDYLETGGGFCLAEDEMSQEAMCMNKDLPLEANNSEDYLTMGGGFCL 1500
E G+PK+SPQ DYL+TGGGFCL EDEMS++ MC NKD LEANNSEDYLTMGGGFCL
Sbjct: 1441 MESGSGSPKRSPQNDYLKTGGGFCLVEDEMSRQEMCQNKDPALEANNSEDYLTMGGGFCL 1500
Query: 1501 DDDNECVDTVAHPDQATILEVPKDGSGDDPGQSTFHPEKCIGGDQNKEDTDARGDSLLDM 1555
DDD+E +D VAHP+QAT+LEVPKDG +DPGQST PEK +G EDTDAR +S+LD+
Sbjct: 1501 DDDDERIDPVAHPNQATVLEVPKDGFENDPGQSTVSPEKHVG----VEDTDARVESVLDV 1560
BLAST of Spg001975 vs. NCBI nr
Match:
KAG7015248.1 (DNA repair protein UVH3 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 2429.1 bits (6294), Expect = 0.0e+00
Identity = 1310/1582 (82.81%), Postives = 1396/1582 (88.24%), Query Frame = 0
Query: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAHILGFFR 60
MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRD+RGEMVRNAH+LGFFR
Sbjct: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDERGEMVRNAHLLGFFR 60
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLR 120
RICKLLFLRTKPVFVFDGATP+LKRRTLIARRRQRENAQAKVRKTAEKLLLNHLK MRLR
Sbjct: 61 RICKLLFLRTKPVFVFDGATPSLKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKEMRLR 120
Query: 121 ELAEDIKNQKQQRKQDVPKKKTLLNHNEIVDGTSTSERSKSVPNSDSHENLDGMLAASIM 180
ELAE IKNQKQQRKQDVPKKKTLLNHNEIVDGTS SERSKSVPN +HENLDGM+AASIM
Sbjct: 121 ELAEGIKNQKQQRKQDVPKKKTLLNHNEIVDGTSVSERSKSVPNGGNHENLDGMVAASIM 180
Query: 181 AEENGFFLSSASSFAGSSLAKEDSGEQSILNQKYKSDSKGKKILSDEVHVMGSDSERMEV 240
EENGFF SSA SF+G +L K+D GEQSILNQKYK+DSKGKKILSDE+HV+GSDSERMEV
Sbjct: 181 IEENGFFSSSAPSFSGVTLPKKDRGEQSILNQKYKNDSKGKKILSDEIHVVGSDSERMEV 240
Query: 241 ASRSAHQQNLDEMLAASIAAEEARSLNENASVSAAANLDGEDTD--DEDEEMILPEMHGV 300
ASRSAHQQNLDEMLAASIAAEEAR LNEN SVS+AANL GED D DEDEEMILPEMHGV
Sbjct: 241 ASRSAHQQNLDEMLAASIAAEEARGLNENISVSSAANLAGEDMDDEDEDEEMILPEMHGV 300
Query: 301 VDPSVLAALPPSVQLDLLVQ------------------DPAKFSELQIQAYLKTVAFRRD 360
VDPSVLAALPPSVQLDLLVQ DPAKFSELQIQAYLKTVAFRRD
Sbjct: 301 VDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRRD 360
Query: 361 IDQVQKAASGRGVGGVQTSRIASEANREFIFSSSFTGDKQALASSRAEKNGDKDLQAPRV 420
IDQVQKAASGRGVGGVQTSRIASEANREFIFSSSFTGDKQ L S+RAEKNGDK+LQ PRV
Sbjct: 361 IDQVQKAASGRGVGGVQTSRIASEANREFIFSSSFTGDKQVLTSTRAEKNGDKNLQEPRV 420
Query: 421 QQPLSSLNNTEIPSTSNALARSTPDKSGVLEDNIETFLDERGRVRVSRVRAMGMHMTRDL 480
QQ LSSLNNT+IPSTSN LA+STPDKSGV EDNIETFLDERGRVRVSRVRAMGMHMTRDL
Sbjct: 421 QQSLSSLNNTDIPSTSNGLAQSTPDKSGVFEDNIETFLDERGRVRVSRVRAMGMHMTRDL 480
Query: 481 ERNLDLMKEIEKNASANKAANPEHMQHIEICNPESFSFQSQVLDTSDEGVGDSINKLNEK 540
ERNLDLMKEIEKN +ANK ANPE MQ+IEICNPES S +SQVLD S+EG+ +SINKL+E+
Sbjct: 481 ERNLDLMKEIEKNINANKVANPEPMQNIEICNPESSSLRSQVLDVSNEGIDESINKLDER 540
Query: 541 GAEFMLNEDTAIEILLEDEGGKSFDGDDDLFTHLAAENPIQMASFDISPQKLSLDSTTDS 600
GA+ MLNEDTAIEILLE EGGKSFDGDDDLFTHLAAENPIQMASFDIS QKLS D TTDS
Sbjct: 541 GADSMLNEDTAIEILLEGEGGKSFDGDDDLFTHLAAENPIQMASFDISSQKLSQDGTTDS 600
Query: 601 GWEEKLEGKAYSPKNVQVDDHSFKEGTVSDESEVDWEDGVCDHVNPVPFEAESGKSVSKG 660
GW+E L EGT+SDESEVDWEDGVCDHVNPVPFE ESGKSVSKG
Sbjct: 601 GWKEAL------------------EGTISDESEVDWEDGVCDHVNPVPFEDESGKSVSKG 660
Query: 661 SMEEEADLQEAIRRSLEDIGDRKSGPVSSEHQQPQQ--VIVGKMAEQCMSVQNENVIGLE 720
S+EEEADLQEAIRRSLED+GD KSGPVS EH+QPQ IVGKMAEQC SV+NENVIGLE
Sbjct: 661 SLEEEADLQEAIRRSLEDVGDGKSGPVSLEHEQPQSQPSIVGKMAEQCTSVENENVIGLE 720
Query: 721 KLDSADGMNCSNANDSTRTKGMTESSSQEKQCPEPVVLLDTKTHTIAEQLDASCNDATFS 780
K+DS DGMN SNA DS KGMTESSSQEKQC EPVVLLDT TIAEQLDAS D +FS
Sbjct: 721 KMDSVDGMNWSNAKDSILKKGMTESSSQEKQCSEPVVLLDT---TIAEQLDASYKDTSFS 780
Query: 781 HKESNENNDTLKALSRDASGAIQVEDRINNTVIEAPCCMVEMEGIYTPNIDSSPKAFTCE 840
+ESNE++DTLK+LSRDA A QV D IN+T+IE C MVEM+G+ TP++DSS K E
Sbjct: 781 LQESNESSDTLKSLSRDAPRATQVGDMINSTMIEPACRMVEMDGVNTPDVDSSTKDSAFE 840
Query: 841 NHFKQNLPVDKCNSDLLLEKDAKKPAVEEISNAEIEITEDELTNRISVLEQERLNLGDEQ 900
NHFKQNLPV+K +SDLLLE++ K EIS AE E+TEDEL +RIS+LEQERLNLGDEQ
Sbjct: 841 NHFKQNLPVEKHSSDLLLEEEVGKGHTVEISKAETEVTEDELKSRISILEQERLNLGDEQ 900
Query: 901 KRLERNAESVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVF 960
KRLERNAE+VSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVF
Sbjct: 901 KRLERNAEAVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVF 960
Query: 961 LFGARSVYKNIFDDRKYVETYFMKDVENELGLNRDKLIRMALLLGSDYTEGISGIGIVNA 1020
LFGARSVYKNIFDDRKYVETYFMKDVENELGL+R+KLIRMALLLGSDYTEGISGIGIVNA
Sbjct: 961 LFGARSVYKNIFDDRKYVETYFMKDVENELGLDRNKLIRMALLLGSDYTEGISGIGIVNA 1020
Query: 1021 VEVMNAFPEEDGLHKFKEWIESPDPSILGTLGAKTGLSARKRGSKASENDMTCSNSSARD 1080
VEVMNAFPEEDGLHKFKEWIESPDPSILGTLGAKTGLSARKRG KASEND CSNSS RD
Sbjct: 1021 VEVMNAFPEEDGLHKFKEWIESPDPSILGTLGAKTGLSARKRGQKASENDAPCSNSSVRD 1080
Query: 1081 GSASEENIDKDLKENIDVKQSFMDKHRNVSKNWHIPSEFPSEAVISAYICPQVDKSAEPF 1140
GSASEENIDKDLKENIDVKQ+FM KHRNVSKNWHIPSEFPSEAVISAYI PQVDKSAEPF
Sbjct: 1081 GSASEENIDKDLKENIDVKQNFMVKHRNVSKNWHIPSEFPSEAVISAYISPQVDKSAEPF 1140
Query: 1141 SWGKPDHFVLRRLCWEKFGWDNSKADELLLPVLKEYGKHETQLRLEAFYTFNERFAKIRS 1200
SWGKPDHFVLRRLC EKFGW+NSKADELLLPVLKEYGKHETQLRLEAFYTFNERFAKIRS
Sbjct: 1141 SWGKPDHFVLRRLCLEKFGWENSKADELLLPVLKEYGKHETQLRLEAFYTFNERFAKIRS 1200
Query: 1201 KRIKKAVKSITGSKSAVLMDDTVRDVSVNYQRELSVEPDENMSEKCSSKIQGACSNEDDI 1260
KRIKKAVKSITGSKSA LMD+TV +VSVN Q LS E +NMSEKCSS+IQGACSNED++
Sbjct: 1201 KRIKKAVKSITGSKSASLMDETVPNVSVNNQINLSGETQKNMSEKCSSEIQGACSNEDNV 1260
Query: 1261 ENRLQKPSRKRQLDGERSQPAKDKKRTMKGKGKQSRNGGSRSE--RGRGRGEGRGRGRLA 1320
+NRL+KPSRKRQLD E+SQPAKD+K TMK KGK+SRN GS SE RGRGRG+GRGRGRLA
Sbjct: 1261 DNRLRKPSRKRQLDREQSQPAKDRKLTMKEKGKRSRNEGSHSERGRGRGRGKGRGRGRLA 1320
Query: 1321 SKGKTPITELVETSSSDDGSEFDDQKFDLENLQEPQERRRSTRIRKSVSYTMDDG--DQP 1380
KGK+PITE V TSSSDD SEFDDQK DLEN+QEPQERR+S+R+RKS SY MDD DQP
Sbjct: 1321 LKGKSPITEFVGTSSSDDESEFDDQKIDLENVQEPQERRKSSRVRKSASYKMDDADQDQP 1380
Query: 1381 SDHSEERFSNDEAKDDNVARDQYAHPETVMSQFKNTECDFGTPKKSPQRDYLETGGGFCL 1440
SDHS R SNDEA DDNV + Y PETVM +NTECD+ PK+SP RDYL TGGGFC
Sbjct: 1381 SDHSGYRLSNDEANDDNVVQGGYTGPETVMIHSENTECDYEIPKRSPLRDYLGTGGGFCP 1440
Query: 1441 AEDEMSQEAMCMNKDLPLEANNSEDYLTMGGGFCLDDDNECVDTVAHPDQATILEVPKDG 1500
EDEMS+EAMC NKD LEA+NSEDYLT+GGGFCLDDDNECVD VAH DQAT EVPKDG
Sbjct: 1441 TEDEMSREAMCQNKDPALEASNSEDYLTLGGGFCLDDDNECVDPVAHLDQATASEVPKDG 1500
Query: 1501 SGDDPGQSTFHPEKCIGGDQNKEDTDARGDSLLDMGDPNPVSNPNSSQLGEGVQEEPKDH 1557
S DDP QSTFHPEK IGG+Q EDT G+SLLD+GDPNP S PNSS++GEGVQEEPKDH
Sbjct: 1501 SEDDPDQSTFHPEKDIGGNQLNEDTYPHGESLLDVGDPNPASFPNSSRVGEGVQEEPKDH 1560
BLAST of Spg001975 vs. ExPASy Swiss-Prot
Match:
Q9ATY5 (DNA repair protein UVH3 OS=Arabidopsis thaliana OX=3702 GN=UVH3 PE=2 SV=1)
HSP 1 Score: 989.9 bits (2558), Expect = 3.1e-287
Identity = 695/1629 (42.66%), Postives = 929/1629 (57.03%), Query Frame = 0
Query: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAHILGFFR 60
MGV GLWELLAPVGRRVSVETLA K+LAIDASIWMVQFIKAMRD++G+MV+NAH++GFFR
Sbjct: 1 MGVQGLWELLAPVGRRVSVETLANKRLAIDASIWMVQFIKAMRDEKGDMVQNAHLIGFFR 60
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLR 120
RICKLLFLRTKP+FVFDGATPALKRRT+IARRRQRENAQ K+RKTAEKLLLN LK +RL+
Sbjct: 61 RICKLLFLRTKPIFVFDGATPALKRRTVIARRRQRENAQTKIRKTAEKLLLNRLKDIRLK 120
Query: 121 ELAEDIKNQKQQRKQDVPKKKTLLNHNEIVDGTSTSERSKSVPNS-DSHENLDGMLAASI 180
E A+DIKNQ+ ++ D + K ++ + + D V S E LD + AS+
Sbjct: 121 EQAKDIKNQR-LKQDDSDRVKKRVSSDSVEDNLRVPVEEDDVGASFFQEEKLDEVSQASL 180
Query: 181 MAEENGFFLSSASSFAGSSLAKEDSGEQSILNQKYKSDSKGKKILSDEVHVMGSDSERME 240
+ E +G ++ + K D KGK +L D G D + +
Sbjct: 181 VGE---------------------TGVDDVVKESVKDDPKGKGVLLD-----GDDLDNLV 240
Query: 241 VASR---SAHQQNLDEMLAASIAAEEARSLNENASVSAAA---NLDGEDTDDEDEEMILP 300
S +Q+ LDEMLAAS+AAEE R+ AS SAAA D E+ D DEE++LP
Sbjct: 241 QDSSVQGKDYQEKLDEMLAASLAAEEERNFTSKASTSAAAIPSEEDEEEDSDGDEEILLP 300
Query: 301 EMHGVVDPSVLAALPPSVQLDLLVQ------------------DPAKFSELQIQAYLKTV 360
M G +DP+VLA+LPPS+QLDLL Q P KFSELQI+AYLKTV
Sbjct: 301 VMDGNIDPAVLASLPPSMQLDLLAQMREKLMAENRQKYQKVKKAPEKFSELQIEAYLKTV 360
Query: 361 AFRRDIDQVQKAASGRGVGGVQTSRIASEANREFIFSSSFTGDKQALASSRAEKNGDKDL 420
AFRR+I++VQ++A GR VGGVQTSRIASEANREFIFSSSF GDK+ LAS+R +N +
Sbjct: 361 AFRREINEVQRSAGGRAVGGVQTSRIASEANREFIFSSSFAGDKEVLASAREGRNDENQK 420
Query: 421 QAPRVQQPLSSLNNTEIPSTSNALARSTPDKSGVLEDNIETFLDERGRVRVSRVRAMGMH 480
+ + P+S N + + S+A D+ ++NIE ++DERGR R+ R R MG+
Sbjct: 421 KTSQQSLPVSVKNASPL-KKSDATIELDRDEPKNPDENIEVYIDERGRFRI-RNRHMGIQ 480
Query: 481 MTRDLERNLDLMKEIEKNASANKAANPEHMQHIEICNPESFSFQSQVLDTSDEGVGDSIN 540
MTRD++RNL LMKE E+ AS + A N E E+F + Q L+ S V +
Sbjct: 481 MTRDIQRNLHLMKEKERTASGSMAKNDETFSAW-----ENFPTEDQFLEKSP--VEKDVV 540
Query: 541 KLNEKGAEFMLNEDTAIEILLE-DEGGKSFDGDDDLFTHLAAENPIQMASFDISPQKLSL 600
L + + ML+ ++IEI + D GGK + +DD+F LAA P+ ++S + P++ +
Sbjct: 541 DLEIQNDDSMLHPPSSIEISFDHDGGGKDLNDEDDMFLQLAAGGPVTISSTENDPKEDTS 600
Query: 601 DSTTDSGWEEKLEGKAYSPKNVQV---DDHSFKEGTVSDESEVDWEDGVCDHVNPVPFEA 660
+DS WEE + S ++ + H K+ +S V WE+ C + N E
Sbjct: 601 PWASDSDWEEVPVEQNTSVSKLEANLSNQHIPKD--ISIAEGVAWEEYSCKNANN-SVEN 660
Query: 661 ESGKSVSKGSMEEEADLQEAIRRSLEDIGDRKSGPVSSEHQQPQ-QVIVGKMAEQCM--- 720
++ ++KG +EEEADLQEAI++SL ++ D++SG V E+Q + ++V K +E +
Sbjct: 661 DTVTKITKGYLEEEADLQEAIKKSLLELHDKESGDVLEENQSVRVNLVVDKPSEDSLCSR 720
Query: 721 ----SVQNENVIGLEKLDSADGMNCSNANDSTRTKGMTESSSQEKQCPEPVVLLDTKTHT 780
+ E + + G +N S + ++ P + +H
Sbjct: 721 ETVGEAEEERFLDEITILKTSGAISEQSNTSVAGNADGQKGITKQFGTHPSSGSNNVSHA 780
Query: 781 IAEQLD--------------ASCND--ATFSHKESNENNDTLKALSRDASGAIQVEDRIN 840
++ +L AS N +T + + + E +++ S S ++ I
Sbjct: 781 VSNKLSKVKSVISPEKALNVASQNRMLSTMAKQHNEEGSESFGGESVKVSAMPIADEEIT 840
Query: 841 NTVIEAPCCMVEMEGIYTPNIDSSPK---AFTCENHFKQNLPVDKCNSDLLLEKDAKKPA 900
+ E E + D S + + E+ + V + +L + D++
Sbjct: 841 GFLDEKDNADGESSIMMDDKRDYSRRKIQSLVTESR-DPSRNVVRSRIGILHDTDSQNER 900
Query: 901 VEEISNAE-----------------IEITEDELTNRISVLEQERLNLGDEQKRLERNAES 960
EE ++ E +E +E + I VL+QE ++LGDEQ++LERNAES
Sbjct: 901 REENNSNEHTFNIDSSTDFEEKGVPVEFSEANIEEEIRVLDQEFVSLGDEQRKLERNAES 960
Query: 961 VSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYK 1020
VSSEMFAECQELLQ+FG+PYIIAPMEAEAQCA+ME +NLVDG+VTDDSDVFLFGARSVYK
Sbjct: 961 VSSEMFAECQELLQIFGIPYIIAPMEAEAQCAFMEQSNLVDGIVTDDSDVFLFGARSVYK 1020
Query: 1021 NIFDDRKYVETYFMKDVENELGLNRDKLIRMALLLGSDYTEGISGIGIVNAVEVMNAFPE 1080
NIFDDRKYVETYFMKD+E ELGL+RDK+IRMA+LLGSDYTEGISGIGIVNA+EV+ AFPE
Sbjct: 1021 NIFDDRKYVETYFMKDIEKELGLSRDKIIRMAMLLGSDYTEGISGIGIVNAIEVVTAFPE 1080
Query: 1081 EDGLHKFKEWIESPDPSILGTLGAKTGLSARKRGSKASENDMTCSNSSARDGSASEENID 1140
EDGL KF+EW+ESPDP+ILG AKTG +KRGS + +N S +S D
Sbjct: 1081 EDGLQKFREWVESPDPTILGKTDAKTGSKVKKRGSASVDNKGIISGASTDDTE------- 1140
Query: 1141 KDLKENIDVKQSFMDKHRNVSKNWHIPSEFPSEAVISAYICPQVDKSAEPFSWGKPDHFV 1200
++KQ FMD+HR VSKNWHIP FPSEAVISAY+ PQVD S E FSWGKPD V
Sbjct: 1141 -------EIKQIFMDQHRKVSKNWHIPLTFPSEAVISAYLNPQVDLSTEKFSWGKPDLSV 1200
Query: 1201 LRRLCWEKFGWDNSKADELLLPVLKEYGKHETQLRLEAFYTFNERFAKIRSKRIKKAVKS 1260
LR+LCWEKF W+ K DELLLPVLKEY K ETQLR+EAFY+FNERFAKIRSKRI KAVK
Sbjct: 1201 LRKLCWEKFNWNGKKTDELLLPVLKEYEKRETQLRIEAFYSFNERFAKIRSKRINKAVKG 1260
Query: 1261 ITGSKSAVLMDDTVRDVSVNYQRELSVEPDENMSEKCSSKIQGACSNEDDIENRLQKPSR 1320
I G S+ + D T+++ + + V P E S K + I N K R
Sbjct: 1261 IGGGLSSDVADHTLQE-GPRKRNKKKVAPHETEDNNTSDK-------DSPIANEKVKNKR 1320
Query: 1321 KRQLDGERSQPAKDKKRTMKGKGKQSRNGGSRSERGRGRGEGRGRGRLASKGKTPITELV 1380
KR +P+ S RGRGR + RGRGR + + + EL
Sbjct: 1321 KR-----LEKPS--------------------SSRGRGRAQKRGRGR--GRVQKDLLELS 1380
Query: 1381 ETSSSDDGSEFDDQKFDLENLQEPQERRRSTRIRKSVSYTMDDGDQPSDHSEERFSNDEA 1440
+ SS DD + DD+ +LE +P ++STR R V Y+ + D+ +E SN+ +
Sbjct: 1381 DGSSDDD--DDDDKVVELE--AKPANLQKSTRSRNPVMYSAKEDDE----LDESRSNEGS 1440
Query: 1441 KDDNVARDQYAHPETVMSQFKNTECDFGTPKKSPQRDYLETGGGFCLAE-DEMSQEAMCM 1500
+N E + N + + P DY++TGGGFC E DE+
Sbjct: 1441 PSEN-------FEEVDEGRIGNDDSVDASINDCPSEDYIQTGGGFCADEADEIG------ 1478
Query: 1501 NKDLPLEANNSEDYLTMGGGFCLDDDNECVDTVAHPDQATILEVPKDGSGDDPGQSTFHP 1556
D LE ++DY +GGGFC+D+D E + D A IL++ +
Sbjct: 1501 --DAHLEDKATDDYRVIGGGFCVDED-ETAEENTMDDDAEILKMESE------------E 1478
BLAST of Spg001975 vs. ExPASy Swiss-Prot
Match:
P14629 (DNA excision repair protein ERCC-5 homolog OS=Xenopus laevis OX=8355 GN=ercc5 PE=2 SV=1)
HSP 1 Score: 267.7 bits (683), Expect = 8.2e-70
Identity = 343/1357 (25.28%), Postives = 581/1357 (42.82%), Query Frame = 0
Query: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAHILGFFR 60
MGV GLW+LL GR ++ TL GK LA+D SIW+ Q +K RD +G ++NAH+L F
Sbjct: 1 MGVQGLWKLLECSGRPINPGTLEGKILAVDISIWLNQAVKGARDRQGNAIQNAHLLTLFH 60
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLR 120
R+CKLLF R +P+FVFDG P LKR+TL RR++ + A RKT EKLL LK ++
Sbjct: 61 RLCKLLFFRIRPIFVFDGEAPLLKRQTLAKRRQRTDKASNDARKTNEKLLRTFLKRQAIK 120
Query: 121 ELAEDIK--NQKQQRKQDVPKKKTLLNHNEIVDGTSTSERSKSVPNSDSHENLDGMLAAS 180
K N++ VP+K+T E +P + +EN
Sbjct: 121 AALSGNKQSNEELPSFSQVPRKET--------------EDLYILPPLEDNEN-------- 180
Query: 181 IMAEENGFFLSSASSFAGSSLAKEDSGEQSILNQKYKSDSKGKKILSDEVHVMGSDSERM 240
SS +E+ + +NQK
Sbjct: 181 -----------------NSSEEEEEREWEERMNQK------------------------- 240
Query: 241 EVASRSAHQQNLDEMLAASIAAEEARSLNENASVSAAANLDGEDTDDEDEEMILPEM-HG 300
Q L E AN D + E+ + + PE+ H
Sbjct: 241 ---------QRLQE--------------------DFFANPSSVDIESEEFKSLPPEVKHE 300
Query: 301 VVDPSVLAALPPSVQLDLLVQDPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGVQ 360
++ + + +D + FS+ Q++ LK + ID V+K + + G VQ
Sbjct: 301 ILTDMKDFTKRRRTLFEAMPEDSSDFSQYQLKGLLKKNDLNKCIDNVRKELNQQYSGEVQ 360
Query: 361 --------------TSRIASEANREFIFSSSF-TGDKQALASSRAEKNGDKDLQAPR--V 420
T R+ SE + +I + ++ S + Q P+ +
Sbjct: 361 AQFESEGGFLKEVETRRLVSEDDSHYILIKGIQSKQEEKKVDSPPQSITFNSSQTPKTYL 420
Query: 421 QQPLSSLNNTEIPSTSNALARSTPDKS--GVLEDNIETFLDER-GRVRVSRVRAMGMHMT 480
L+S + T+ TS+A A ++ + E E++ E+ + VS A G
Sbjct: 421 DLKLASAHKTKPLQTSSAEAAPPSPRTLFAIQEAMAESWDHEKHEKPSVSGCEAEGNVSP 480
Query: 481 RDLERNLDLMKEIEKNASANKAANPEHMQHIEICNPESFSFQSQVLDTSDEGVGDSINKL 540
R L+ ++ E E S NK +Q E P++ + V+ +SDE D +N
Sbjct: 481 RTLQAIYQVLAEDEAGES-NKI--KVVLQSDEERKPKT---KVLVISSSDE-EDDCLN-- 540
Query: 541 NEKGAEFMLNEDTAIEILLEDEGGKSFDGDDDLFTHLAAENPIQMASFDISPQKLSLDST 600
+ G + L I + + D + P+ ++ L D+
Sbjct: 541 YQDGTKTTLGASLIKSI--SPSSMQCQESTADSLPNYTRSKPVSQIEEPMADHNLQGDNC 600
Query: 601 TDSGWEEKLEGKAYSPKNVQVDDHSFKEGTVSDESEVDWEDGVCDHVNPVPFEAESGKSV 660
++KL P VD T+ SE + +P ++V
Sbjct: 601 NVPNAKDKL---IVPPSLGNVDKPIILSNTIPVNSEF--------RIPLLPVNMSMRETV 660
Query: 661 ----SKGSMEEEADLQEAIRRSLEDIGDRKSGPVSSEHQQPQQVIVGKMAEQCMS-VQNE 720
+ GS+ + + + D G + +P ++++ S Q+
Sbjct: 661 IIPNNTGSLGSSRYISLERDATKQGFSDNPVGDLVRSPDEPALNASSALSDRKTSATQSL 720
Query: 721 NVIGLEKLDSADGMNCSNANDSTRT---KGMTESSSQEKQCPEPVVLLDTKTHT-----I 780
+E + + CSN D T+T G +E + + P+ + + + +
Sbjct: 721 LCNNIECTEQSMVQGCSNTLDVTQTTQPSGGSEVNKPAEYNPQDKKVFGSNDSSAMYVPM 780
Query: 781 AEQLDASCNDATFSHKESNENNDTLKALSRDASGAIQVEDRINNTVIEAPCCMVEMEGIY 840
+ ++ + KE ++++D+ I+V+ + + + +V E
Sbjct: 781 TPESIIVSDEEFVNEKEDSDSDDSF----------IEVDSEFSTSNSQH---VVFKEPGD 840
Query: 841 TPNIDSSPKAFTCENHFKQNLPVDKCNSDLLLEKDAKKPA-VEEISNAEIEITEDELTNR 900
T ++ +A N Q++P++ + + + ++++ ++++SN +I+ +EL +
Sbjct: 841 TRETATNFQAVEEGNSGSQDIPLEHDSGEPHEQSNSEESKDLDDVSNEWQDISVEELESL 900
Query: 901 ISVLEQERLNLGDEQKRLERNAESVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMEL 960
+ L ++ +L +Q++ ER A +V+ +M E QELLQ+FG+PYI+APMEAEAQCA ++L
Sbjct: 901 ENNLYVQQTSLQAQQQQQERIAATVTGQMCLESQELLQLFGIPYIVAPMEAEAQCAILDL 960
Query: 961 ANLVDGVVTDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELGLNRDKLIRMALLLG 1020
+ G +TDDSD++LFGAR VYKN F K+VE Y D+ N+LGL+R KLI +A LLG
Sbjct: 961 TDQTSGTITDDSDIWLFGARHVYKNFFSQNKHVEYYQYADIHNQLGLDRSKLINLAYLLG 1020
Query: 1021 SDYTEGISGIGIVNAVEVMNAFPEE--DGLHKFKEWIESPDPSILGTLGAKTGLSARKRG 1080
SDYTEGI +G V+A+E++N FP + + L KFKEW S ++
Sbjct: 1021 SDYTEGIPTVGYVSAMEILNEFPGQGLEPLVKFKEW-----------------WSEAQKD 1080
Query: 1081 SKASENDMTCSNSSARDGSASEENIDKDLKENIDVKQSFMDKHRNVSKNWHIPSEFPSEA 1140
K N ++ + K L+ +D++QS FP+ A
Sbjct: 1081 KKMRPN-------------PNDTKVKKKLR-LLDLQQS-----------------FPNPA 1140
Query: 1141 VISAYICPQVDKSAEPFSWGKPDHFVLRRLCWEKFGWDNSKADELLLPVLKEYGKHETQL 1200
V SAY+ P VD+S FSWG+PD +R C +FGW K DE+LLPVLK+ +TQL
Sbjct: 1141 VASAYLKPVVDESKSAFSWGRPDLEQIREFCESRFGWYRLKTDEVLLPVLKQLNAQQTQL 1168
Query: 1201 RLEAFYTFNE-RFAKIRSKRIKKAVKSITGSKSAVLMDDTVRDVSVNYQRELSVEPDENM 1260
R+++F+ + A ++S+R+++AV + + V ++ V+V M
Sbjct: 1201 RIDSFFRLEQHEAAGLKSQRLRRAVTCMKRKERDVEAEEVEAAVAV-------------M 1168
Query: 1261 SEKCSSKIQGACSNEDDIENRLQKPSRKRQLDGERSQPAKDKKRTMKGKGKQSRNGGSRS 1318
+C+++ +G +N + +KP+ Q D + + S +G S
Sbjct: 1261 ERECTNQRKGQKTNTKSQGTKRRKPTECSQEDQDPGGGFIGIELKTLSSKAYSSDGSSSD 1168
BLAST of Spg001975 vs. ExPASy Swiss-Prot
Match:
P35689 (DNA excision repair protein ERCC-5 OS=Mus musculus OX=10090 GN=Ercc5 PE=1 SV=4)
HSP 1 Score: 260.0 bits (663), Expect = 1.7e-67
Identity = 308/1207 (25.52%), Postives = 508/1207 (42.09%), Query Frame = 0
Query: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAHILGFFR 60
MGV GLW+LL G RVS E L GK LA+D SIW+ Q +K +RD G ++ NAH+L F
Sbjct: 1 MGVQGLWKLLECSGHRVSPEALEGKVLAVDISIWLNQALKGVRDSHGNVIENAHLLTLFH 60
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLR 120
R+CKLLF R +P+FVFDG P LK++TL RR+++++A RKT EKLL LK L+
Sbjct: 61 RLCKLLFFRIRPIFVFDGDAPLLKKQTLAKRRQRKDSASIDSRKTTEKLLKTFLKRQALK 120
Query: 121 ELAEDIKNQKQQRKQDVPKKKTLLNHNEIVDGTSTSERSKSVPNSDSHENLDGMLAASIM 180
+ R + P + ++I E K + + +
Sbjct: 121 TAF------RSSRHEAPPSLTQVQRQDDIYVLPPLPEEEKHSSEEEDEKQWQARMDQKQA 180
Query: 181 AEENGFFLSSASSFAGSSLAKEDSGEQSILNQKYKSDSKGKKILSDEVHVMGSDSERMEV 240
+E F A + + + K +K ++ L + + +D + ++
Sbjct: 181 LQEEFFHNPQAIDIESEDFSSLPPEVKHEILTDMKEFTKRRRTLFEAMPEESNDFSQYQL 240
Query: 241 AS------RSAHQQNLDEMLAASIAAEEARSLNENASVSAAANLDGEDTDDEDEEMILPE 300
+ H +N+ + + + + R + ++D +++
Sbjct: 241 KGLLKKNYLNQHIENVQKEMNQQHSGQIQRQYQDEGGFLKEVESRRVVSEDTSHYILIKG 300
Query: 301 MHG--VVDPSVLAALPPSVQLDLLVQDPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRG 360
+ G V+D +LP S + + S L+ + K + + + S R
Sbjct: 301 IQGKKVMDVD-SESLPSSSNVH------SVSSNLKSSPHEKV---KPEREPEAAPPSPRT 360
Query: 361 VGGVQTSRIASEANREFIFSSSFTGDKQALASSRAEKNGDKDLQAPRVQQPLSSLNNTEI 420
+ +Q + + S + E S K+ + + A+ +Q+ L N+ ++
Sbjct: 361 LLAIQAAMLGSSSEDE-PESREGRQSKERNSGATADAGSISPRTCAAIQKALDDDNDEKV 420
Query: 421 PSTSNALARSTPDKSGVLEDNIETFLDERGRVRVSRVRAMGMHM-TRDLERNLDLMKE-I 480
+S+ LA SG+ ++ ERG G+ T L ++ +KE +
Sbjct: 421 SGSSDDLAEKMLLGSGLEQEEHADETAERGG---------GVPFDTAPLTPSVTEVKECV 480
Query: 481 EKNASANKAANPEHMQHIEICNPESFSFQSQVLDTSDEGVGDSINKLNEKGAEFMLNEDT 540
+SAN + H SF+ S DT E V S+ + ++ ++ ++ +
Sbjct: 481 TSGSSANGQTDSAH----------SFTTASHRCDTPKETV--SLARAVKEASQ--ISSEC 540
Query: 541 AIEILLEDEGGKSFDGDDDLFTHLAAENPIQMASFDISPQKLSLDSTTDSGWEEKLEGKA 600
+E G + P S +S ++ +L T + +G
Sbjct: 541 EVE------------GRPAALSPAFIGTPSSHVSGVLSEREPTLAPPTTRTHSD--QGID 600
Query: 601 YSPKNVQVDDHSFKEGTVSDESEVDWEDGVCDHVNPVPFEAESGKSVSKGSMEEEA--DL 660
P++ ++ + + T + S + +D NP P K+ S + EA +L
Sbjct: 601 IHPEDPELQNGLYPLETKCNSSRLSSDDETEGGQNPAP------KACSTVHVPAEAMSNL 660
Query: 661 QEAIRRSLEDIGDRKSGPVSSEHQQPQQVIVGKMAEQCMSVQNENVIGLEKLDSADGMNC 720
+ A+ + E+ GD + I L ++ A
Sbjct: 661 ENALPSNAEERGD-----------------------------FQETIQLREVPEAAAREL 720
Query: 721 SNANDSTRTKGMTESSSQEKQCPEPVVLLDTKTHTIAEQLDASCNDATFSHKESNENNDT 780
+A + G E S+E + + + + Q ++S S K++ E +T
Sbjct: 721 ISA---PKPMGPMEMESEESESDGSFIEVQSVVSNSELQTESSEASTHLSEKDAEEPRET 780
Query: 781 L-KALSRDASGAIQVEDRINNTVIEAPCCMVEMEGIYTPNIDSSPKAFTCENHFKQNLPV 840
L + SRD +Q +++ IEA MEG +ID+
Sbjct: 781 LEEGTSRDTECLLQ-----DSSDIEA------MEGHREADIDA----------------- 840
Query: 841 DKCNSDLLLEKDAKKPAVEEISNAEIEITEDELTNRISVLEQERLNLGDEQKRLERNAES 900
E++ N +I +EL S L E+ +L ++++ +R A S
Sbjct: 841 ------------------EDMPNEWQDINLEELDALESNLLAEQNSLKAQKQQQDRIAAS 900
Query: 901 VSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYK 960
V+ +MF E QELL++FG+PYI APMEAEAQCA ++L + G +TDDSD++LFGAR VYK
Sbjct: 901 VTGQMFLESQELLRLFGVPYIQAPMEAEAQCAMLDLTDQTSGTITDDSDIWLFGARHVYK 960
Query: 961 NIFDDRKYVETYFMKDVENELGLNRDKLIRMALLLGSDYTEGISGIGIVNAVEVMNAFPE 1020
N F+ K+VE Y D ++LGL+R+KLI +A LLGSDYTEGI +G V A+E++N FP
Sbjct: 961 NFFNKNKFVEYYQYVDFYSQLGLDRNKLINLAYLLGSDYTEGIPTVGCVTAMEILNEFPG 1020
Query: 1021 E--DGLHKFKEWIESPDPSILGTLGAKTGLSARKRGSKASENDMTCSNSSARDGSASEEN 1080
D L KF EW + K +EN
Sbjct: 1021 RGLDPLLKFSEWWHE-----------------AQNNKKVAENP----------------- 1021
Query: 1081 IDKDLKENIDVKQSFMDKHRNVSKNWHIPSEFPSEAVISAYICPQVDKSAEPFSWGKPDH 1140
D +K+ + + + FP+ AV AY+ P VD S F WGKPD
Sbjct: 1081 YDTKVKKKL--------------RKLQLTPGFPNPAVADAYLRPVVDDSRGSFLWGKPDV 1021
Query: 1141 FVLRRLCWEKFGWDNSKADELLLPVLKEYGKHETQLRLEAFYTFNERFAK----IRSKRI 1189
+ C FGW+ K DE L PVLK H+TQLR+++F+ ++ + I+S R+
Sbjct: 1141 DKISTFCQRYFGWNRMKTDESLYPVLKHLNAHQTQLRIDSFFRLAQQEKQDAKLIKSHRL 1021
BLAST of Spg001975 vs. ExPASy Swiss-Prot
Match:
P28715 (DNA excision repair protein ERCC-5 OS=Homo sapiens OX=9606 GN=ERCC5 PE=1 SV=3)
HSP 1 Score: 249.6 bits (636), Expect = 2.3e-64
Identity = 343/1358 (25.26%), Postives = 565/1358 (41.61%), Query Frame = 0
Query: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAHILGFFR 60
MGV GLW+LL GR+VS E L GK LA+D SIW+ Q +K +RD G + N H+L F
Sbjct: 1 MGVQGLWKLLECSGRQVSPEALEGKILAVDISIWLNQALKGVRDRHGNSIENPHLLTLFH 60
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLR 120
R+CKLLF R +P+FVFDG P LK++TL+ RR++++ A + RKT EKLL LK
Sbjct: 61 RLCKLLFFRIRPIFVFDGDAPLLKKQTLVKRRQRKDLASSDSRKTTEKLLKTFLK----- 120
Query: 121 ELAEDIKNQKQQRKQDVPKKKTLLNHNEIVDGTSTSERSKSVPNSDSHENLDGMLAASIM 180
+ + S+R +++P+ + + +
Sbjct: 121 --------------------------RQAIKTAFRSKRDEALPSLTQVRRENDLYVLPPL 180
Query: 181 AEENGFFLSSASSFAGSSLAKEDSGEQSILNQKYKSDSKGKKILSDEVHVMGSDSERMEV 240
EE +K+ S+ + +K E
Sbjct: 181 QEE----------------------------EKHSSEEEDEK----------------EW 240
Query: 241 ASRSAHQQNLDEMLAASIAAEEARSLNENASVSAAANLDGEDTDDEDEEMILPEM-HGVV 300
R +Q L E N D + ED + PE+ H ++
Sbjct: 241 QERMNQKQALQE--------------------EFFHNPQAIDIESEDFSSLPPEVKHEIL 300
Query: 301 DPSVLAALPPSVQLDLLVQDPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGGVQTS 360
+ + ++ FS+ Q++ LK + I+ VQK + + G ++
Sbjct: 301 TDMKEFTKRRRTLFEAMPEESDDFSQYQLKGLLKKNYLNQHIEHVQKEMNQQHSGHIR-- 360
Query: 361 RIASEANREFIFSSSFTGD---KQALASSRAEKNGDKDLQAPRVQQPLSSLNNTEIPSTS 420
R++ F + ++ ++ + K +QA V + +++ +PS+S
Sbjct: 361 -------RQYEDEGGFLKEVESRRVVSEDTSHYILIKGIQAKTV----AEVDSESLPSSS 420
Query: 421 NALARSTPDKSGVLEDNIETFLDERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEKNASA 480
S KS E ++T ++ R + L + + E E A
Sbjct: 421 KMHGMSFDVKSSPCE-KLKT--EKEPDATPPSPRTLLAMQAALLGSSSEEELESENRRQA 480
Query: 481 NKAANPEHMQHIEICNPESFSFQSQVLDTSDEGV----GDSINKLNEKGAEFMLNEDTAI 540
P + I +P + S + LD DE V GD + E +N T
Sbjct: 481 RGRNAPAAVDEGSI-SPRTLSAIKRALD-DDEDVKVCAGDDVQTGGPGAEEMRINSSTE- 540
Query: 541 EILLEDEGGKSFDGDDDLFTHLAAENPIQMASFDISPQKLSLDSTTDSGWEEKLEGKAYS 600
DEG K DG FT A + + A ++ + TDS +E++
Sbjct: 541 ---NSDEGLKVRDGKGIPFTATLASSSVNSAEEHVASTNEGRE-PTDSVPKEQMSLVHVG 600
Query: 601 PKNVQVDDHSF----KEGTVSDESEVDWED--GVCDHVNPVPFEAESGKSVSKGSMEEEA 660
+ + D S K+ + + V D G+ + P SVSK E
Sbjct: 601 TEAFPISDESMIKDRKDRLPLESAVVRHSDAPGLPNGRELTPASPTCTNSVSKNETHAEV 660
Query: 661 DLQEAIRRSLEDIGDRKSGPVSSEHQ---QPQQV--IVGKMAEQ----CMSVQNENVIGL 720
Q+ E D S +SS+ + +P ++G ++ Q +SV +E V +
Sbjct: 661 LEQQNELCPYESKFD--SSLLSSDDETKCKPNSASEVIGPVSLQETSSIVSVPSEAVDNV 720
Query: 721 EKLDSADGMNCSNANDSTRTKGMTESSSQEKQCPEPVVLLDTKTHTIAEQLDASCNDATF 780
E + S + N ++ + + TES+ Q+ ++ + + + S +D +F
Sbjct: 721 ENVVSFNAKEHENFLETIQEQQTTESAGQD------LISIPKAVEPMEIDSEESESDGSF 780
Query: 781 SHKESNENNDTLKALSRDASGAIQVEDRINNTVIEAPCCMVEMEGIYTPNIDSSPKAFTC 840
+S +++ L+A + S + P E E + T ++ ++
Sbjct: 781 IEVQSVISDEELQAEFPETS--------------KPPSEQGEEELVGTREGEAPAES--- 840
Query: 841 ENHFKQNLPVDKCNSDLLLEKDAKKPAVEEISNAEIEITEDELTNRISVLEQERLNLGDE 900
E+ + N D + + ++A+K A + + + +I +EL S L ++ +L +
Sbjct: 841 ESLLRDNSERDDVDGE---PQEAEKDAEDSLHEWQ-DINLEELETLESNLLAQQNSLKAQ 900
Query: 901 QKRLERNAESVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDV 960
+++ ER A +V+ +MF E QELL++FG+PYI APMEAEAQCA ++L + G +TDDSD+
Sbjct: 901 KQQQERIAATVTGQMFLESQELLRLFGIPYIQAPMEAEAQCAILDLTDQTSGTITDDSDI 960
Query: 961 FLFGARSVYKNIFDDRKYVETYFMKDVENELGLNRDKLIRMALLLGSDYTEGISGIGIVN 1020
+LFGAR VY+N F+ K+VE Y D N+LGL+R+KLI +A LLGSDYTEGI +G V
Sbjct: 961 WLFGARHVYRNFFNKNKFVEYYQYVDFHNQLGLDRNKLINLAYLLGSDYTEGIPTVGCVT 1020
Query: 1021 AVEVMNAFPEE--DGLHKFKEWIESPDPSILGTLGAKTGLSARKRGSKASENDMTCSNSS 1080
A+E++N FP + L KF EW +A +N N
Sbjct: 1021 AMEILNEFPGHGLEPLLKFSEW-----------------------WHEAQKNPKIRPNPH 1080
Query: 1081 ARDGSASEENIDKDLKENIDVKQSFMDKHRNVSKNWHIPSEFPSEAVISAYICPQVDKSA 1140
D +K+ + Q + FP+ AV AY+ P VD S
Sbjct: 1081 -----------DTKVKKKLRTLQ--------------LTPGFPNPAVAEAYLKPVVDDSK 1140
Query: 1141 EPFSWGKPDHFVLRRLCWEKFGWDNSKADELLLPVLKEYGKHETQLRLEAFYTF----NE 1200
F WGKPD +R C FGW+ +K DE L PVLK+ +TQLR+++F+ E
Sbjct: 1141 GSFLWGKPDLDKIREFCQRYFGWNRTKTDESLFPVLKQLDAQQTQLRIDSFFRLAQQEKE 1160
Query: 1201 RFAKIRSKRIKKAVK-----------------SITGSKSAVLMD------------DTVR 1260
+I+S+R+ +AV S+ K L+D +T+
Sbjct: 1201 DAKRIKSQRLNRAVTCMLRKEKEAAASEIEAVSVAMEKEFELLDKAKGKTQKRGITNTLE 1160
Query: 1261 DVSVNYQRELSVEPDEN-----MSEKCSSKIQGACSNEDDIENRLQKPSRKRQLDGERSQ 1296
+ S ++ LS +N + E C S+ S+ED + L R R E
Sbjct: 1261 ESSSLKRKRLSDSKGKNTCGGFLGETCLSESSDGSSSEDAESSSLMNVQR-RTAAKEPKT 1160
BLAST of Spg001975 vs. ExPASy Swiss-Prot
Match:
P28706 (DNA repair protein rad13 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=rad13 PE=2 SV=2)
HSP 1 Score: 248.8 bits (634), Expect = 4.0e-64
Identity = 322/1212 (26.57%), Postives = 518/1212 (42.74%), Query Frame = 0
Query: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAHILGFFR 60
MGV GLW++L PV R V +ETL K+LAIDASIW+ QF+KA+RD G ++++H++GFFR
Sbjct: 1 MGVSGLWDILEPVKRPVKLETLVNKRLAIDASIWIYQFLKAVRDKEGNQLKSSHVVGFFR 60
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLK--AMR 120
RICKLLF KPVFVFDG P+LKR+T+ R+ +R + + TA KLL ++ AM
Sbjct: 61 RICKLLFFGIKPVFVFDGGAPSLKRQTIQKRQARRLDREENATVTANKLLALQMRHQAML 120
Query: 121 LRE---LAEDIKNQKQQRKQDVPKKKTLLNHNEIVDGTSTSERSKSVPNSDSHENLDGML 180
L E A + N Q ++ +P T L+++EI + K+ D ++
Sbjct: 121 LEENNKKATALANASVQNERQMPSSMT-LDNSEI---KPVLNQRKNYLKPDPYQ------ 180
Query: 181 AASIMAEENGFFLSSASSFAGSSLAKEDSGEQSILNQKYKSDSKGKKILSDEVHVMGSDS 240
+ E + F SS+ +++++ L Q S +K + I + + DS
Sbjct: 181 ----LPEMDVSFDKLGSSYDPRIMSQDE------LTQYVSSFTKIEDINLFDFSNIDFDS 240
Query: 241 ERMEVASRSAHQQNLDEMLAASIAAEEARSLNENASVSAAANLDGEDTDDEDEEMILPEM 300
E Q+L + + S+ +AA L
Sbjct: 241 ELF---------QSLPD--------------TDKYSILSAARL----------------- 300
Query: 301 HGVVDPSVLAALPPSVQLDLLVQDPAKFSELQIQAYLKTVAFRRDIDQVQKAASGRGVGG 360
S L S QL + + FS QI+ + D Q+ G+
Sbjct: 301 -----RSRLRMGLSSEQLSEMFPNRMDFSRFQIE------RLKERNDLTQRLMDFTGMNE 360
Query: 361 VQTSRIASEANREFIFSSSFTGDKQALASSRAEKNGDKDLQAPRVQQPLSSLNNTEIPST 420
SR+ SE NRE+I + G + A V S NN I
Sbjct: 361 FGPSRVVSEKNREYIL---------------VKNEGAEGGWALGVIS--GSTNNEPIIID 420
Query: 421 SNALARSTPDKSGVLEDNIETFLDERGRVRVSRVRAMGMHMTRDLERNLDLMKEIEKNAS 480
A S+ + + ED E F D R H E +KEI++N
Sbjct: 421 DEATKLSS---NLIDEDEDEAFYDVPLPSR--------SHSMNPRELVAAKLKEIKEN-- 480
Query: 481 ANKAANPEHMQHIEICNPESFSFQSQVLDTSDEGVGDSI-------NKLNEKGAEFMLNE 540
SFS ++Q D +D V D + L E L
Sbjct: 481 -------------------SFS-ENQQSDEADYNVTDDLILQLATQQSLEENKKSKELFS 540
Query: 541 DTAIEILLEDEGGKSFDGDDDLFTHLAAENPIQMASFDISPQKLSLDSTTDSGWEEKLEG 600
+A E + K+F + L T + AE+ M S + L L+ D +
Sbjct: 541 LSASEFDKLNSEKKTF---EILSTDIPAED--SMNSLLNDEENLKLEHVGDVSNDSLAFA 600
Query: 601 KAYSPKNVQVDDHSFKEGTVSDESEVDWEDGVCDHVNPVPFE-AESGKSVSKGSMEEEAD 660
+ P+N F + S E D ++P+PF+ + GKS+
Sbjct: 601 EKKHPEN---GTSIFMDALPSASREKKTN----DLIDPLPFQPMDWGKSI---------- 660
Query: 661 LQEAIRRSLEDIGDRKSGPVSSEHQQPQQVIVGKMAEQCMSVQNENVIGLEK--LDSADG 720
E +++ E D K+ S + V + +V E+ + K +++
Sbjct: 661 FFEKLKKPTETFMDSKTDIPSEAPDNSKLVEDTNLHTINATVNIESDLDAAKPGIENPII 720
Query: 721 MNCSNANDSTRTKGMTESSSQEKQCPEPVVLLDTKTHTIAEQLDASCNDATFSHKESNEN 780
D + + E + E P + T+ L+ S + A + S+EN
Sbjct: 721 SPLLPVKDDEKDLDLRELNPLE---PFENMKEQADDGTVTNPLNVSSDKAMSVYLLSSEN 780
Query: 781 NDTLKALSRDASGAIQVEDRINNTVIEAPCCMVEMEGIYTPNIDSSPKAFTCENHFKQNL 840
+ ++ A+ ++ + P + + +PN ++ + E + L
Sbjct: 781 AKDTGDIKSESIDAVLPTLETSSPSLSIP---TDFQKEASPNKGAAALSSKVEPEVVEKL 840
Query: 841 PVDKCNSDLLLEKDAKKPAVEEISNAEIEITEDELTNRISVLEQERLNLGDEQKRLERNA 900
+D+ ++++ ++ + + + E E N+ E+ L ++++ +R+A
Sbjct: 841 -LDEEEEEMIIRMAEEEKEYDRFVSELNQRHETEEWNQ-EAFEKRLKELKNQKRSEKRDA 900
Query: 901 ESVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSV 960
+ V+ M ECQELL++FGLPYI+AP EAEAQC+ + LVDG+VTDDSDVFLFG V
Sbjct: 901 DEVTQVMIKECQELLRLFGLPYIVAPQEAEAQCSKLLELKLVDGIVTDDSDVFLFGGTRV 960
Query: 961 YKNIFDDRKYVETYFMKDVENELGLNRDKLIRMALLLGSDYTEGISGIGIVNAVEVMNAF 1020
Y+N+F+ K+VE Y M D++ E +N+ LI++A LLGSDYT G+S +G V A+E+++ F
Sbjct: 961 YRNMFNQNKFVELYLMDDMKREFNVNQMDLIKLAHLLGSDYTMGLSRVGPVLALEILHEF 1016
Query: 1021 PEEDGLHKFKEWIESPDPSILGTLGAKTGLSARKRGSKASENDMTCSNSSARDGSASEEN 1080
P + GL +FK+W + L T G AS+ +
Sbjct: 1021 PGDTGLFEFKKWFQR-----LST------------------------------GHASKND 1016
Query: 1081 IDKDLKENIDVKQSFMDKHRNVSKNWHIPSEFPSEAVISAYICPQVDKSAEPFSWGKPDH 1140
++ +K+ I+ + +PSEFP+ V AY+ P VD S + F WG PD
Sbjct: 1081 VNTPVKKRIN----------KLVGKIILPSEFPNPLVDEAYLHPAVDDSKQSFQWGIPDL 1016
Query: 1141 FVLRRLCWEKFGWDNSKADELLLPVLKEYGKHE---TQLRLEAFY------TFNERFA-K 1188
LR+ GW + +E+LLPV+++ K + TQ L F+ + R A
Sbjct: 1141 DELRQFLMATVGWSKQRTNEVLLPVIQDMHKKQFVGTQSNLTQFFEGGNTNVYAPRVAYH 1016
BLAST of Spg001975 vs. ExPASy TrEMBL
Match:
A0A6J1ERW5 (DNA repair protein UVH3 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111435307 PE=3 SV=1)
HSP 1 Score: 2427.5 bits (6290), Expect = 0.0e+00
Identity = 1310/1581 (82.86%), Postives = 1396/1581 (88.30%), Query Frame = 0
Query: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAHILGFFR 60
MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRD+RGEMVRNAH+LGFFR
Sbjct: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDERGEMVRNAHLLGFFR 60
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLR 120
RICKLLFLRTKPVFVFDGATP+LKRRTLIARRRQRENAQAKVRKTAEKLLLNHLK MRLR
Sbjct: 61 RICKLLFLRTKPVFVFDGATPSLKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKEMRLR 120
Query: 121 ELAEDIKNQKQQRKQDVPKKKTLLNHNEIVDGTSTSERSKSVPNSDSHENLDGMLAASIM 180
ELAE IKNQKQQRKQDVPKKKTLLNHNEIVDGTS SERSKSVPNS +HENLDGM+AASIM
Sbjct: 121 ELAEGIKNQKQQRKQDVPKKKTLLNHNEIVDGTSVSERSKSVPNSGNHENLDGMVAASIM 180
Query: 181 AEENGFFLSSASSFAGSSLAKEDSGEQSILNQKYKSDSKGKKILSDEVHVMGSDSERMEV 240
EENGFF SSA SF+G +L K+D GEQSILNQKYK+DSKGKKILSDE+HV+GSDSERMEV
Sbjct: 181 IEENGFFSSSAPSFSGVTLPKKDRGEQSILNQKYKNDSKGKKILSDEIHVVGSDSERMEV 240
Query: 241 ASRSAHQQNLDEMLAASIAAEEARSLNENASVSAAANLDGEDTD--DEDEEMILPEMHGV 300
ASRSAHQQNLDEMLAASIAAEEAR LNEN SVS+AANL GED D DEDEEMILPEMHGV
Sbjct: 241 ASRSAHQQNLDEMLAASIAAEEARGLNENISVSSAANLAGEDMDDEDEDEEMILPEMHGV 300
Query: 301 VDPSVLAALPPSVQLDLLVQ------------------DPAKFSELQIQAYLKTVAFRRD 360
VDPSVLAALPPSVQLDLLVQ DPAKFSELQIQAYLKTVAFRRD
Sbjct: 301 VDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRRD 360
Query: 361 IDQVQKAASGRGVGGVQTSRIASEANREFIFSSSFTGDKQALASSRAEKNGDKDLQAPRV 420
IDQVQKAASGRGVGGVQTSRIASEANREFIFSSSFTGDKQ L S+RAEKNGDK+LQ PRV
Sbjct: 361 IDQVQKAASGRGVGGVQTSRIASEANREFIFSSSFTGDKQVLTSTRAEKNGDKNLQEPRV 420
Query: 421 QQPLSSLNNTEIPSTSNALARSTPDKSGVLEDNIETFLDERGRVRVSRVRAMGMHMTRDL 480
QQ LSSLNNT+IPSTSN LA+STPDKSGV EDNIETFLDERGRVRVSRVRAMGMHMTRDL
Sbjct: 421 QQSLSSLNNTDIPSTSNGLAQSTPDKSGVFEDNIETFLDERGRVRVSRVRAMGMHMTRDL 480
Query: 481 ERNLDLMKEIEKNASANKAANPEHMQHIEICNPESFSFQSQVLDTSDEGVGDSINKLNEK 540
ERNLDLMKEIEKN +ANK ANPE MQ+IEICNPES S +SQVLD S+EG+ +SINKL+E+
Sbjct: 481 ERNLDLMKEIEKNINANKVANPEPMQNIEICNPESSSLRSQVLDVSNEGIDESINKLDER 540
Query: 541 GAEFMLNEDTAIEILLEDEGGKSFDGDDDLFTHLAAENPIQMASFDISPQKLSLDSTTDS 600
GA+ MLNEDTAIEILLE EGGKSFDGDDDLFTHLAAENPIQMASFDIS QKLS D TTDS
Sbjct: 541 GADSMLNEDTAIEILLEGEGGKSFDGDDDLFTHLAAENPIQMASFDISSQKLSQDGTTDS 600
Query: 601 GWEEKLEGKAYSPKNVQVDDHSFKEGTVSDESEVDWEDGVCDHVNPVPFEAESGKSVSKG 660
GW+E L EGT+SDESEVDWEDGVCDHVNPVPFE ESGKSVSKG
Sbjct: 601 GWKEAL------------------EGTISDESEVDWEDGVCDHVNPVPFEDESGKSVSKG 660
Query: 661 SMEEEADLQEAIRRSLEDIGDRKSGPVSSEHQQPQQ--VIVGKMAEQCMSVQNENVIGLE 720
S+EEEADLQEAIRRSLED+GD KSGPVS EH+QPQ IVGKMAEQC SV+NENVIGLE
Sbjct: 661 SLEEEADLQEAIRRSLEDVGDGKSGPVSLEHEQPQSQPSIVGKMAEQCTSVENENVIGLE 720
Query: 721 KLDSADGMNCSNANDSTRTKGMTESSSQEKQCPEPVVLLDTKTHTIAEQLDASCNDATFS 780
K+DS DGMN SNA DS KGMTESSSQEKQC EPVVLLDT TIAEQLDAS D +FS
Sbjct: 721 KMDSVDGMNWSNAKDSILKKGMTESSSQEKQCSEPVVLLDT---TIAEQLDASYKDTSFS 780
Query: 781 HKESNENNDTLKALSRDASGAIQVEDRINNTVIEAPCCMVEMEGIYTPNIDSSPKAFTCE 840
+ESNE++DTLK+LSRDA A QV D IN+T+IE C MVEM+G+ TP++DSS K E
Sbjct: 781 LQESNESSDTLKSLSRDAPRATQVGDMINSTMIEPACRMVEMDGVNTPDVDSSTKDSAFE 840
Query: 841 NHFKQNLPVDKCNSDLLLEKDAKKPAVEEISNAEIEITEDELTNRISVLEQERLNLGDEQ 900
NHFKQNLPV+K +SDLLLE++ K EIS AE E+TEDEL +RIS+LEQERLNLGDEQ
Sbjct: 841 NHFKQNLPVEKHSSDLLLEEEVGKGHTVEISKAETEVTEDELKSRISILEQERLNLGDEQ 900
Query: 901 KRLERNAESVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVF 960
KRLERNAE+VSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVF
Sbjct: 901 KRLERNAEAVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVF 960
Query: 961 LFGARSVYKNIFDDRKYVETYFMKDVENELGLNRDKLIRMALLLGSDYTEGISGIGIVNA 1020
LFGARSVYKNIFDDRKYVETYFMKDVENELGL+R+KLIRMALLLGSDYTEGISGIGIVNA
Sbjct: 961 LFGARSVYKNIFDDRKYVETYFMKDVENELGLDRNKLIRMALLLGSDYTEGISGIGIVNA 1020
Query: 1021 VEVMNAFPEEDGLHKFKEWIESPDPSILGTLGAKTGLSARKRGSKASENDMTCSNSSARD 1080
VEVMNAFPEEDGLHKFKEWIESPDPSILGTLGAKTGLSARKRG KASEND CSNSS RD
Sbjct: 1021 VEVMNAFPEEDGLHKFKEWIESPDPSILGTLGAKTGLSARKRGQKASENDAPCSNSSVRD 1080
Query: 1081 GSASEENIDKDLKENIDVKQSFMDKHRNVSKNWHIPSEFPSEAVISAYICPQVDKSAEPF 1140
GSASEENIDKDLKENIDVKQ+FM KHRNVSKNWHIPSEFPSEAVISAYI PQVDKSAEPF
Sbjct: 1081 GSASEENIDKDLKENIDVKQNFMVKHRNVSKNWHIPSEFPSEAVISAYISPQVDKSAEPF 1140
Query: 1141 SWGKPDHFVLRRLCWEKFGWDNSKADELLLPVLKEYGKHETQLRLEAFYTFNERFAKIRS 1200
SWGKPDHFVLRRLC EKFGW+NSKADELLLPVLKEYGKHETQLRLEAFYTFNERFAKIRS
Sbjct: 1141 SWGKPDHFVLRRLCLEKFGWENSKADELLLPVLKEYGKHETQLRLEAFYTFNERFAKIRS 1200
Query: 1201 KRIKKAVKSITGSKSAVLMDDTVRDVSVNYQRELSVEPDENMSEKCSSKIQGACSNEDDI 1260
KRIKKAVKSITGSKSA LMD+TV +VSVN Q LS E +NMSEKCSS+IQGACSNED++
Sbjct: 1201 KRIKKAVKSITGSKSASLMDETVPNVSVNNQINLSGETQKNMSEKCSSEIQGACSNEDNV 1260
Query: 1261 ENRLQKPSRKRQLDGERSQPAKDKKRTMKGKGKQSRNGGSRSE--RGRGRGEGRGRGRLA 1320
+NRL+KPSRKRQLD E+SQPAKD+K TMK KGK+SRN GS SE RGRGRG+GRGRGRLA
Sbjct: 1261 DNRLRKPSRKRQLDREQSQPAKDRKLTMKEKGKRSRNEGSHSERGRGRGRGKGRGRGRLA 1320
Query: 1321 SKGKTPITELVETSSSDDGSEFDDQKFDLENLQEPQERRRSTRIRKSVSYTMDDG--DQP 1380
KGK+PITE V TSSSDD SEFDDQK DLEN+QEPQERR+S+R+RKS SY MDD DQP
Sbjct: 1321 LKGKSPITEFVGTSSSDDESEFDDQKIDLENVQEPQERRKSSRVRKSASYKMDDADQDQP 1380
Query: 1381 SDHSEERFSNDEAKDDNVARDQYAHPETVMSQFKNTECDFGTPKKSPQRDYLETGGGFCL 1440
SDHS R SNDEA DDNV + Y PETVM +NTECD+ PK+SP RDYL TGGGFC
Sbjct: 1381 SDHSGYRLSNDEANDDNVVQGGYTGPETVMIHSENTECDYEIPKRSPLRDYLGTGGGFCP 1440
Query: 1441 AEDEMSQEAMCMNKDLPLEANNSEDYLTMGGGFCLDDDNECVDTVAHPDQATILEVPKDG 1500
EDEMS+EAMC NKD LEA+NSEDYLT+GGGFCLDDDNECVD VAH DQAT EVPKDG
Sbjct: 1441 TEDEMSREAMCQNKDPALEASNSEDYLTLGGGFCLDDDNECVDPVAHLDQATASEVPKDG 1500
Query: 1501 SGDDPGQSTFHPEKCIGGDQNKEDTDARGDSLLDMGDPNPVSNPNSSQLGEGVQEEPKDH 1556
S DDP QSTFHPEK IGG+Q EDT G+SLLD+GDPNP S PNSS++GEGVQEEPKDH
Sbjct: 1501 SEDDPDQSTFHPEKDIGGNQLNEDTYPHGESLLDVGDPNPASFPNSSRVGEGVQEEPKDH 1560
BLAST of Spg001975 vs. ExPASy TrEMBL
Match:
A0A6J1JJE1 (DNA repair protein UVH3 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111486346 PE=3 SV=1)
HSP 1 Score: 2416.7 bits (6262), Expect = 0.0e+00
Identity = 1307/1584 (82.51%), Postives = 1395/1584 (88.07%), Query Frame = 0
Query: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAHILGFFR 60
MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAH+LGFFR
Sbjct: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAHLLGFFR 60
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLR 120
RICKLLFLRTKPVFVFDGATP+LKRRTLIARRRQRENAQAKVRKTAEKLLLNHLK MRLR
Sbjct: 61 RICKLLFLRTKPVFVFDGATPSLKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKEMRLR 120
Query: 121 ELAEDIKNQKQQRKQDVPKKKTLLNHNEIVDGTSTSERSKSVPNSDSHENLDGMLAASIM 180
ELAE IKNQKQQRKQDVPKKKTLLNHN IVDGTS SERSKSVPNS +HENLDGM+AASIM
Sbjct: 121 ELAEGIKNQKQQRKQDVPKKKTLLNHNAIVDGTSISERSKSVPNSGNHENLDGMVAASIM 180
Query: 181 AEENGFFLSSASSFAGSSLAKEDSGEQSILNQKYKSDSKGKKILSDEVHVMGSDSERMEV 240
EENGFF SSA SF G +L KED GEQS NQKYK+DSKGKKILSDE+HV+GSDSERMEV
Sbjct: 181 IEENGFFSSSAPSFVGVTLPKEDRGEQSTWNQKYKNDSKGKKILSDEIHVVGSDSERMEV 240
Query: 241 ASRSAHQQNLDEMLAASIAAEEARSLNENASVSAAANLDGEDTD--DEDEEMILPEMHGV 300
ASRSAHQQNLDEMLAASIAAEEAR LNEN SVS+A+ L GED D DEDEEMILPEMHGV
Sbjct: 241 ASRSAHQQNLDEMLAASIAAEEARGLNENVSVSSASYLAGEDMDDEDEDEEMILPEMHGV 300
Query: 301 VDPSVLAALPPSVQLDLLVQ------------------DPAKFSELQIQAYLKTVAFRRD 360
VDPSVLAALPPSVQLDLLVQ DPAKFSELQIQAYLKTVAFRRD
Sbjct: 301 VDPSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRRD 360
Query: 361 IDQVQKAASGRGVGGVQTSRIASEANREFIFSSSFTGDKQALASSRAEKNGDKDLQAPRV 420
IDQVQKAASGRGVGGVQTSRIASEANREFIFSSSFTGDKQ L S+RAEKNGDKDLQAPRV
Sbjct: 361 IDQVQKAASGRGVGGVQTSRIASEANREFIFSSSFTGDKQVLTSTRAEKNGDKDLQAPRV 420
Query: 421 QQPLSSLNNTEIPSTSNALARSTPDKSGVLEDNIETFLDERGRVRVSRVRAMGMHMTRDL 480
QQ LSSLNNT+IPSTSN LA+STPDKSGV EDNIETFLDERG VRVSRVRAMGMHMTRDL
Sbjct: 421 QQSLSSLNNTDIPSTSNGLAQSTPDKSGVFEDNIETFLDERGCVRVSRVRAMGMHMTRDL 480
Query: 481 ERNLDLMKEIEKNASANKAANPEHMQHIEICNPESFSFQSQVLDTSDEGVGDSINKLNEK 540
ERNLDLMKEIEKN +ANKAANPE MQ+IEICNP+S S +SQVLD S+EGV +SINKL+E+
Sbjct: 481 ERNLDLMKEIEKNINANKAANPEPMQNIEICNPKSSSLRSQVLDVSNEGVDESINKLDER 540
Query: 541 GAEFMLNEDTAIEILLEDEGGKSFDGDDDLFTHLAAENPIQMASFDISPQKLSLDSTTDS 600
GA+ MLNEDTAIEI+LE EGGKSFDGDDDLFTHLAAENPIQMASFDIS QKLS D TTDS
Sbjct: 541 GADSMLNEDTAIEIVLEGEGGKSFDGDDDLFTHLAAENPIQMASFDISSQKLSQDGTTDS 600
Query: 601 GWEEKLEGKAYSPKNVQVDDHSFKEGTVSDESEVDWEDGVCDHVNPVPFEAESGKSVSKG 660
GW+E L EGTVSDESEVDWEDGVCDHVNPVPFE ESGKSVSKG
Sbjct: 601 GWKEAL------------------EGTVSDESEVDWEDGVCDHVNPVPFEDESGKSVSKG 660
Query: 661 SMEEEADLQEAIRRSLEDIGDRKSGPVSSEHQQPQQ--VIVGKMAEQCMSVQNENVIGLE 720
S+EEEADLQEAIRRSLED+GD KSGPVS EH+QPQ IVGKMAE+CMS +NENVIGLE
Sbjct: 661 SLEEEADLQEAIRRSLEDVGDGKSGPVSLEHEQPQSQPSIVGKMAERCMSFENENVIGLE 720
Query: 721 KLDSADGMNCSNANDSTRTKGMTESSSQEKQCPEPVVLLDTKTHTIAEQLDASCNDATFS 780
K+DS DGMN SNA DS KGMTESSSQEKQC EPVVLLDT TIAEQLDAS D +FS
Sbjct: 721 KMDSVDGMNWSNAKDSILKKGMTESSSQEKQCSEPVVLLDT---TIAEQLDASYKDTSFS 780
Query: 781 HKESNENNDTLKALSRDASGAIQVEDRINNTVIEAPCCMVEMEGIYTPNIDSSPKAFTCE 840
+ SNEN+DTLK+LSRDA A QV D IN+TVIE C MVEM+G+ TP++DSS K E
Sbjct: 781 LQVSNENSDTLKSLSRDAPRATQVGDMINSTVIEPACRMVEMDGVNTPDVDSSTKDSAFE 840
Query: 841 NHFKQNLPVDKCNSDLLLEKDAKKPAVEEISNAEIEITEDELTNRISVLEQERLNLGDEQ 900
NHFKQN PV+K +SDLLLE++ K +IS AE E+TEDEL +RIS+LEQERL+LGDEQ
Sbjct: 841 NHFKQNFPVEKHSSDLLLEEEVGKGHTVKISKAEAEVTEDELKSRISILEQERLSLGDEQ 900
Query: 901 KRLERNAESVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVF 960
KRLERNAE+VSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVF
Sbjct: 901 KRLERNAEAVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVF 960
Query: 961 LFGARSVYKNIFDDRKYVETYFMKDVENELGLNRDKLIRMALLLGSDYTEGISGIGIVNA 1020
LFGARSVYKNIFDDRKYVETYFMKD+ENELGL+R+KLIRMALLLGSDYTEGISGIGIVNA
Sbjct: 961 LFGARSVYKNIFDDRKYVETYFMKDIENELGLDRNKLIRMALLLGSDYTEGISGIGIVNA 1020
Query: 1021 VEVMNAFPEEDGLHKFKEWIESPDPSILGTLGAKTGLSARKRGSKASENDMTCSNSSARD 1080
VEVMNAF EEDGLHKFKEWIESPDPSILGTLGAKTGLSARKRG KASEND TCSNSS RD
Sbjct: 1021 VEVMNAFSEEDGLHKFKEWIESPDPSILGTLGAKTGLSARKRGQKASENDATCSNSSVRD 1080
Query: 1081 GSASEENIDKDLKENIDVKQSFMDKHRNVSKNWHIPSEFPSEAVISAYICPQVDKSAEPF 1140
GSASEENIDKDLKENIDVKQ+FM KHRNVSKNWHIPSEFPSEAVISAYI PQVDKSAEPF
Sbjct: 1081 GSASEENIDKDLKENIDVKQNFMVKHRNVSKNWHIPSEFPSEAVISAYISPQVDKSAEPF 1140
Query: 1141 SWGKPDHFVLRRLCWEKFGWDNSKADELLLPVLKEYGKHETQLRLEAFYTFNERFAKIRS 1200
SWGKPDHFVLRRLC EKFGW+NSKADELLLPVLKEYGKHETQLRLEAFYTFNERFAKIRS
Sbjct: 1141 SWGKPDHFVLRRLCLEKFGWENSKADELLLPVLKEYGKHETQLRLEAFYTFNERFAKIRS 1200
Query: 1201 KRIKKAVKSITGSKSAVLMDDTVRDVSVNYQRELSVEPDENMSEKCSSKIQGACSNEDDI 1260
KRIKKAVKSITGSKSA LMD+TV +VSVN Q LS E +NMSEKCSS+IQGACSNED++
Sbjct: 1201 KRIKKAVKSITGSKSASLMDETVPNVSVNNQINLSGETQKNMSEKCSSEIQGACSNEDNV 1260
Query: 1261 ENRLQKPSRKRQLDGERSQPAKDKKRTMKGKGKQSRNGGSRSERGRGR----GEGRGRGR 1320
+NRL+KPSRKRQLD E+SQPAKD+K TMK KGK+SRN GS SERGRGR G+GRGRGR
Sbjct: 1261 DNRLRKPSRKRQLDREQSQPAKDRKLTMKEKGKRSRNEGSHSERGRGRGRGGGKGRGRGR 1320
Query: 1321 LASKGKTPITELVETSSSDDGSEFDDQKFDLENLQEPQERRRSTRIRKSVSYTMDDG--- 1380
LASKGK+PITE VETSSSDD SE DD+K DLEN+QEPQERR+S+R+RKS SY MDD
Sbjct: 1321 LASKGKSPITEFVETSSSDDESESDDKKLDLENVQEPQERRKSSRVRKSASYKMDDADPD 1380
Query: 1381 -DQPSDHSEERFSNDEAKDDNVARDQYAHPETVMSQFKNTECDFGTPKKSPQRDYLETGG 1440
DQPSD+S R SNDEA DDNV + +Y PETVM +NTECD+ PK+SP RDYL TGG
Sbjct: 1381 QDQPSDYSGYRLSNDEANDDNVVQGRYTGPETVMIHSENTECDYEIPKRSPLRDYLGTGG 1440
Query: 1441 GFCLAEDEMSQEAMCMNKDLPLEANNSEDYLTMGGGFCLDDDNECVDTVAHPDQATILEV 1500
GFC EDEMSQEAMC NKD LEA+NSEDYLT+GGGFCLDDDNECVD VAH DQAT+ E
Sbjct: 1441 GFCPTEDEMSQEAMCRNKDPALEASNSEDYLTLGGGFCLDDDNECVDPVAHLDQATVSEA 1500
Query: 1501 PKDGSGDDPGQSTFHPEKCIGGDQNKEDTDARGDSLLDMGDPNPVSNPNSSQLGEGVQEE 1555
KDGS DDPGQSTFHPEK IGGDQ +EDT RG+SLLD+GDPNPVS PNSS++GEGVQE+
Sbjct: 1501 LKDGSEDDPGQSTFHPEKDIGGDQLEEDTYPRGESLLDVGDPNPVSYPNSSEVGEGVQEK 1560
BLAST of Spg001975 vs. ExPASy TrEMBL
Match:
A0A5D3CQ45 (DNA repair protein UVH3 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1119G00080 PE=3 SV=1)
HSP 1 Score: 2363.2 bits (6123), Expect = 0.0e+00
Identity = 1272/1585 (80.25%), Postives = 1380/1585 (87.07%), Query Frame = 0
Query: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAHILGFFR 60
MGV GLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAH+LGFFR
Sbjct: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAHLLGFFR 60
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLR 120
RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLK MRL+
Sbjct: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKVMRLK 120
Query: 121 ELAEDIKNQKQQRKQDVPKKKTLLNHNEIVDGTSTSERSKSVPNSDSHENLDGMLAASIM 180
ELAED++NQKQQRKQ +PKK TL + E DGTSTSER KSVPN SHENLDGML ASIM
Sbjct: 121 ELAEDLQNQKQQRKQKLPKKSTLPSPKEKFDGTSTSERCKSVPNRGSHENLDGMLGASIM 180
Query: 181 AEENGFFLSSASSFAGSSLAKEDSGEQSILNQKYKSDSKGKKILSDEVHVMGSDSERMEV 240
AEE G F SSASS AG++L KED EQS++NQKY DSKGK++LSDE HV+GSDSERM++
Sbjct: 181 AEEKGIFSSSASSVAGATLDKEDGDEQSVVNQKYNKDSKGKELLSDETHVVGSDSERMDL 240
Query: 241 ASRSAHQQNLDEMLAASIAAEEARSLNENASVSAAANLDGEDTDDEDEEMILPEMHGVVD 300
ASR HQQN+DEMLAASIAAEEARSLN AS SA NLDGEDTDDEDEE+ILPEMHGVVD
Sbjct: 241 ASR-RHQQNVDEMLAASIAAEEARSLNGKASASAVTNLDGEDTDDEDEELILPEMHGVVD 300
Query: 301 PSVLAALPPSVQLDLLVQ----------------------------DPAKFSELQIQAYL 360
PSVLAALPPSVQLDLLVQ DPAKFSELQIQAYL
Sbjct: 301 PSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVKKVGSEMVQYASDPAKFSELQIQAYL 360
Query: 361 KTVAFRRDIDQVQKAASGRGVGGVQTSRIASEANREFIFSSSFTGDKQALASSRAEKNGD 420
KTVAFR+DIDQVQKAA+GRGVGGVQTSR++SEANREFIFSSSFTGDKQALAS+RAEKNGD
Sbjct: 361 KTVAFRQDIDQVQKAAAGRGVGGVQTSRVSSEANREFIFSSSFTGDKQALASARAEKNGD 420
Query: 421 KDLQAPRVQQPLSSLNNTEIPSTSNALARSTPDKSGVLEDNIETFLDERGRVRVSRVRAM 480
KD+QAP VQQPLSSL +T+IPSTSN LARSTPDKS V E+NIETFLDERGRVRVSRVRAM
Sbjct: 421 KDVQAPIVQQPLSSLKSTKIPSTSNPLARSTPDKSEVFEENIETFLDERGRVRVSRVRAM 480
Query: 481 GMHMTRDLERNLDLMKEIEKNASANKAANPEHMQHIEICNPESFSFQSQVLDTSDEGVGD 540
G+HMTRDLERNLDLMKEIEK SA KA NP+ MQ+IEICNPESFSFQSQ LDTSDEGVG
Sbjct: 481 GIHMTRDLERNLDLMKEIEKTTSAKKATNPDLMQNIEICNPESFSFQSQDLDTSDEGVGG 540
Query: 541 SINKLNEKGAEFMLNEDTAIEILLEDEGGKSFDGDDDLFTHLAAENPIQMASFDISPQKL 600
SINKL+E+G EFMLNE+TAIEI++EDEGGKSFDGDDDLFT+LAAENPI M SFDIS QKL
Sbjct: 541 SINKLDERGTEFMLNEETAIEIMMEDEGGKSFDGDDDLFTNLAAENPIGMGSFDISTQKL 600
Query: 601 SLDSTTDSGWEEKLEGKAYSPKNVQVDDHSFKEGTVSDESEVDWEDGVCDHVNPVPFEAE 660
SLD+TTDS W E LEGK Y+PKNV VDDH FKEG VSDESEVDWEDGVCDHVNPVPFEA+
Sbjct: 601 SLDNTTDSAWVEALEGKIYTPKNVGVDDHCFKEGIVSDESEVDWEDGVCDHVNPVPFEAD 660
Query: 661 SGKSVSKGSMEEEADLQEAIRRSLEDIGDRKSGPVSSEHQQPQQVIVGKMAEQCMSVQNE 720
S KSVSKGS+EEEADLQEAIRRSLED G K+GP+SS+HQQPQ VIVGK AE S E
Sbjct: 661 SAKSVSKGSLEEEADLQEAIRRSLEDKGYTKTGPLSSDHQQPQPVIVGKRAEHNTSALKE 720
Query: 721 NVIGLEKLDSADGMNCSNANDSTRTKGMTESSSQEKQCPEPVVLLDTKTHTIAEQLDASC 780
N+IGL KLDS DGM+ N NDS+ KG TES SQEKQC EPVVLLDTKTHT++EQLDAS
Sbjct: 721 NMIGLGKLDSDDGMSSLNFNDSSGIKGTTESLSQEKQCSEPVVLLDTKTHTLSEQLDASY 780
Query: 781 NDATFSHKESNENNDTLKALSRDASGAIQVEDRINNTVIEAPCCMVEMEGIYTPNIDSSP 840
ND TFS KE NENND L+ LS DAS A+QV D INNT++ +PC MVEMEG YTP SSP
Sbjct: 781 NDTTFSPKELNENNDALEPLSEDASSAVQVGDMINNTLVNSPCHMVEMEGFYTPGNGSSP 840
Query: 841 KAFTCENHFKQNLPVDKCNSDLLL-EKDAKKPAVEEISNAEIEITEDELTNRISVLEQER 900
K+F CENH KQNLPVDK + DLLL EKDAK P VE+ S+A EITEDEL NRISVLEQER
Sbjct: 841 KSFACENHLKQNLPVDKHSHDLLLDEKDAKIPIVEKTSSA--EITEDELMNRISVLEQER 900
Query: 901 LNLGDEQKRLERNAESVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVV 960
LNLGDEQKRLERNAESV+SEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVV
Sbjct: 901 LNLGDEQKRLERNAESVNSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVV 960
Query: 961 TDDSDVFLFGARSVYKNIFDDRKYVETYFMKDVENELGLNRDKLIRMALLLGSDYTEGIS 1020
TDDSDVFLFGA+SVYKNIFDDRKYVETYFMKD+ENELGLNRDKLIRMALLLGSDYTEG+S
Sbjct: 961 TDDSDVFLFGAKSVYKNIFDDRKYVETYFMKDIENELGLNRDKLIRMALLLGSDYTEGVS 1020
Query: 1021 GIGIVNAVEVMNAFPEEDGLHKFKEWIESPDPSILGTLGAKTGLSARKRGSKASENDMTC 1080
GIGIVNAVEVMNAFPEEDGLHKFKEWIESPDPSILG LGAKTGL+ARKRGSKASEN+ TC
Sbjct: 1021 GIGIVNAVEVMNAFPEEDGLHKFKEWIESPDPSILGPLGAKTGLNARKRGSKASENETTC 1080
Query: 1081 SNSSARDGSASEENIDKDLKENIDVKQSFMDKHRNVSKNWHIPSEFPSEAVISAYICPQV 1140
SNSS G ASEENI KDLKEN+ VKQSFMDKHRNVSKNWHIPSEFPSEAVISAYICPQV
Sbjct: 1081 SNSS---GPASEENISKDLKENMTVKQSFMDKHRNVSKNWHIPSEFPSEAVISAYICPQV 1140
Query: 1141 DKSAEPFSWGKPDHFVLRRLCWEKFGWDNSKADELLLPVLKEYGKHETQLRLEAFYTFNE 1200
DKSAEPFSWGKPDHFVLRRLCWEKFGW+NSKADELLLPVLKEY KHETQLRLEAFYTFNE
Sbjct: 1141 DKSAEPFSWGKPDHFVLRRLCWEKFGWENSKADELLLPVLKEYSKHETQLRLEAFYTFNE 1200
Query: 1201 RFAKIRSKRIKKAVKSITGSKSAVLMDDTVRDVSVNYQRELSVEPDENMSEKCSSKIQGA 1260
RFAKIRSKRIKKAVKSITGS+SAVLMDD VRD SVN Q+ELSVEP EN+SEKCSS+IQGA
Sbjct: 1201 RFAKIRSKRIKKAVKSITGSRSAVLMDDAVRDASVNNQKELSVEPKENISEKCSSEIQGA 1260
Query: 1261 CSNEDDIENRLQKPSRKRQLDGERSQPAKDKKRTMKGKGKQSRNGGSRSERGRGRGEGRG 1320
SN++DIENRLQKPSRKR LDGE+SQ K KK T K KGK+S + GS S+RGRGRG GRG
Sbjct: 1261 SSNKEDIENRLQKPSRKRHLDGEQSQFGKGKKLTKKEKGKRSVSEGSHSKRGRGRGRGRG 1320
Query: 1321 RGRLASKGKTPITELVETSSSDDGSEFDDQKFDLENLQEPQERRRSTRIRKSVSYTMDDG 1380
RGRLAS+GKTPIT+LVETSSSDD SEFD+QKFD ENL EPQERRRS+RIRKS SY +DD
Sbjct: 1321 RGRLASRGKTPITDLVETSSSDDESEFDNQKFDSENLPEPQERRRSSRIRKSASYAIDDS 1380
Query: 1381 DQPSDHSEERFSNDEAKDDNVARDQYAHPETVMSQFKNTECDFGTPKKSPQRDYLETGGG 1440
+Q SDHS +RFSND+A++D V + QYA+PET++SQ++NTE T K+S Q DYLETGGG
Sbjct: 1381 NQLSDHSADRFSNDKAEEDMVVQGQYAYPETIISQYENTESSSRTLKQSLQNDYLETGGG 1440
Query: 1441 FCLAEDEMSQEAMCMNKDLPLEANNSEDYLTMGGGFCLDDDNECVDTVAHPDQATILEVP 1500
FCL EDE S++ MC NKD +EANNSEDYLT+GGGFCLDD++ECVD VAHP+QATILEV
Sbjct: 1441 FCLVEDETSRQEMCQNKDPAMEANNSEDYLTIGGGFCLDDNDECVDPVAHPNQATILEVQ 1500
Query: 1501 KDGSGDDPGQSTFHPEKCIGGDQNKEDTDARGDSLLDMGDPNPVSNPNSSQLGEGVQEEP 1557
KD P +STF P+K I +EDTD +SL ++G+ + VSN NSSQ+GE V+EEP
Sbjct: 1501 KDDL--IPDRSTFSPKKHI----VEEDTDTHIESLHNLGNSSYVSNRNSSQVGENVEEEP 1560
BLAST of Spg001975 vs. ExPASy TrEMBL
Match:
A0A5A7UJM9 (DNA repair protein UVH3 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold135G00640 PE=3 SV=1)
HSP 1 Score: 2360.5 bits (6116), Expect = 0.0e+00
Identity = 1271/1575 (80.70%), Postives = 1379/1575 (87.56%), Query Frame = 0
Query: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAHILGFFR 60
MGV GLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAH+LGFFR
Sbjct: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAHLLGFFR 60
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLR 120
RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLK MRL+
Sbjct: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKVMRLK 120
Query: 121 ELAEDIKNQKQQRKQDVPKKKTLLNHNEIVDGTSTSERSKSVPNSDSHENLDGMLAASIM 180
ELAED++NQKQQRKQ +PKK TL + E DGTSTSER KSVPN SHENLDGML ASIM
Sbjct: 121 ELAEDLQNQKQQRKQKLPKKSTLPSPKEKFDGTSTSERCKSVPNRGSHENLDGMLGASIM 180
Query: 181 AEENGFFLSSASSFAGSSLAKEDSGEQSILNQKYKSDSKGKKILSDEVHVMGSDSERMEV 240
AEE G F SSASS AG++L KED EQS++NQKY DSKGK++LSDE HV+GSDSERM++
Sbjct: 181 AEEKGIFSSSASSVAGATLDKEDGDEQSVVNQKYNKDSKGKELLSDETHVVGSDSERMDL 240
Query: 241 ASRSAHQQNLDEMLAASIAAEEARSLNENASVSAAANLDGEDTDDEDEEMILPEMHGVVD 300
ASR HQQN+DEMLAASIAAEEARSLN AS SA NLDGEDTDDEDEE+ILPEMHGVVD
Sbjct: 241 ASR-RHQQNVDEMLAASIAAEEARSLNGKASASAVTNLDGEDTDDEDEELILPEMHGVVD 300
Query: 301 PSVLAALPPSVQLDLLVQ------------------DPAKFSELQIQAYLKTVAFRRDID 360
PSVLAALPPSVQLDLLVQ DPAKFSELQIQAYLKTVAFR+DID
Sbjct: 301 PSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRQDID 360
Query: 361 QVQKAASGRGVGGVQTSRIASEANREFIFSSSFTGDKQALASSRAEKNGDKDLQAPRVQQ 420
QVQKAA+GRGVGGVQTSR++SEANREFIFSSSFTGDKQALAS+RAEKNGDKD+QAP VQQ
Sbjct: 361 QVQKAAAGRGVGGVQTSRVSSEANREFIFSSSFTGDKQALASARAEKNGDKDVQAPIVQQ 420
Query: 421 PLSSLNNTEIPSTSNALARSTPDKSGVLEDNIETFLDERGRVRVSRVRAMGMHMTRDLER 480
PLSSL +T+IPSTSN LARSTPDKS V E+NIETFLDERGRVRVSRVRAMG+HMTRDLER
Sbjct: 421 PLSSLKSTKIPSTSNPLARSTPDKSEVFEENIETFLDERGRVRVSRVRAMGIHMTRDLER 480
Query: 481 NLDLMKEIEKNASANKAANPEHMQHIEICNPESFSFQSQVLDTSDEGVGDSINKLNEKGA 540
NLDLMKEIEK SA KA NP+ MQ+IEICNPESFSFQSQ LDTSDEGVG SINKL+E+G
Sbjct: 481 NLDLMKEIEKTTSAKKATNPDLMQNIEICNPESFSFQSQDLDTSDEGVGGSINKLDERGT 540
Query: 541 EFMLNEDTAIEILLEDEGGKSFDGDDDLFTHLAAENPIQMASFDISPQKLSLDSTTDSGW 600
EFMLNE+TAIEI++EDEGGKSFDGDDDLFT+LAAENPI M SFDIS QKLSLD+TTDS W
Sbjct: 541 EFMLNEETAIEIMMEDEGGKSFDGDDDLFTNLAAENPIGMGSFDISTQKLSLDNTTDSAW 600
Query: 601 EEKLEGKAYSPKNVQVDDHSFKEGTVSDESEVDWEDGVCDHVNPVPFEAESGKSVSKGSM 660
E LEGK Y+PKNV VDDH FKEG VSDESEVDWEDGVCDHVNPVPFEA+S KSVSKGS+
Sbjct: 601 VEALEGKIYTPKNVGVDDHCFKEGIVSDESEVDWEDGVCDHVNPVPFEADSAKSVSKGSL 660
Query: 661 EEEADLQEAIRRSLEDIGDRKSGPVSSEHQQPQQVIVGKMAEQCMSVQNENVIGLEKLDS 720
EEEADLQEAIRRSLED G K+GP+SS+HQQPQ VIVGK AE S EN+IGL KLDS
Sbjct: 661 EEEADLQEAIRRSLEDKGYTKTGPLSSDHQQPQPVIVGKRAEHNTSALKENMIGLGKLDS 720
Query: 721 ADGMNCSNANDSTRTKGMTESSSQEKQCPEPVVLLDTKTHTIAEQLDASCNDATFSHKES 780
DGM+ N NDS+ KG TES SQEKQC EPVVLLDTKTHT++EQLDAS ND TFS KE
Sbjct: 721 DDGMSSLNFNDSSGIKGTTESLSQEKQCSEPVVLLDTKTHTLSEQLDASYNDTTFSPKEL 780
Query: 781 NENNDTLKALSRDASGAIQVEDRINNTVIEAPCCMVEMEGIYTPNIDSSPKAFTCENHFK 840
NENND L+ LS DAS A+QV D INNT++ +PC MVEMEG YTP SSPK+F CENH K
Sbjct: 781 NENNDALEPLSEDASSAVQVGDMINNTLVNSPCHMVEMEGFYTPGNGSSPKSFACENHLK 840
Query: 841 QNLPVDKCNSDLLL-EKDAKKPAVEEISNAEIEITEDELTNRISVLEQERLNLGDEQKRL 900
QNLPVDK + DLLL EKDAK P VE+ S+A EITEDEL NRISVLEQERLNLGDEQKRL
Sbjct: 841 QNLPVDKHSHDLLLDEKDAKIPIVEKTSSA--EITEDELMNRISVLEQERLNLGDEQKRL 900
Query: 901 ERNAESVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFG 960
ERNAESV+SEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFG
Sbjct: 901 ERNAESVNSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFG 960
Query: 961 ARSVYKNIFDDRKYVETYFMKDVENELGLNRDKLIRMALLLGSDYTEGISGIGIVNAVEV 1020
A+SVYKNIFDDRKYVETYFMKD+ENELGLNRDKLIRMALLLGSDYTEG+SGIGIVNAVEV
Sbjct: 961 AKSVYKNIFDDRKYVETYFMKDIENELGLNRDKLIRMALLLGSDYTEGVSGIGIVNAVEV 1020
Query: 1021 MNAFPEEDGLHKFKEWIESPDPSILGTLGAKTGLSARKRGSKASENDMTCSNSSARDGSA 1080
MNAFPEEDGLHKFKEWIESPDPSILG LGAKTGL+ARKRGSKASEN+ TCSNSS G A
Sbjct: 1021 MNAFPEEDGLHKFKEWIESPDPSILGPLGAKTGLNARKRGSKASENETTCSNSS---GPA 1080
Query: 1081 SEENIDKDLKENIDVKQSFMDKHRNVSKNWHIPSEFPSEAVISAYICPQVDKSAEPFSWG 1140
SEENI KDLKEN+ VKQSFMDKHRNVSKNWHIPSEFPSEAVISAYICPQVDKSAEPFSWG
Sbjct: 1081 SEENISKDLKENMTVKQSFMDKHRNVSKNWHIPSEFPSEAVISAYICPQVDKSAEPFSWG 1140
Query: 1141 KPDHFVLRRLCWEKFGWDNSKADELLLPVLKEYGKHETQLRLEAFYTFNERFAKIRSKRI 1200
KPDHFVLRRLCWEKFGW+NSKADELLLPVLKEY KHETQLRLEAFYTFNERFAKIRSKRI
Sbjct: 1141 KPDHFVLRRLCWEKFGWENSKADELLLPVLKEYSKHETQLRLEAFYTFNERFAKIRSKRI 1200
Query: 1201 KKAVKSITGSKSAVLMDDTVRDVSVNYQRELSVEPDENMSEKCSSKIQGACSNEDDIENR 1260
KKAVKSITGS+SAVLMDD VRD SVN Q+ELSVEP EN+SEKCSS+IQGA SN++DIENR
Sbjct: 1201 KKAVKSITGSRSAVLMDDAVRDASVNNQKELSVEPKENISEKCSSEIQGASSNKEDIENR 1260
Query: 1261 LQKPSRKRQLDGERSQPAKDKKRTMKGKGKQSRNGGSRSERGRGRGEGRGRGRLASKGKT 1320
LQKPSRKR LDGE+SQ K KK T K KGK+S + GS S+RGRGR GRGRGRLAS+GKT
Sbjct: 1261 LQKPSRKRHLDGEQSQFGKGKKLTKKEKGKRSVSEGSHSKRGRGR--GRGRGRLASRGKT 1320
Query: 1321 PITELVETSSSDDGSEFDDQKFDLENLQEPQERRRSTRIRKSVSYTMDDGDQPSDHSEER 1380
PIT+LVETSSSDD SEFD+QKFD ENL EPQERRRS+RIRKS SY +DD +Q SDHS +R
Sbjct: 1321 PITDLVETSSSDDESEFDNQKFDSENLPEPQERRRSSRIRKSASYAIDDSNQLSDHSADR 1380
Query: 1381 FSNDEAKDDNVARDQYAHPETVMSQFKNTECDFGTPKKSPQRDYLETGGGFCLAEDEMSQ 1440
FSND+A++D V + QYA+PET++SQ++NTE T K+S Q DYLETGGGFCL EDE S+
Sbjct: 1381 FSNDKAEEDMVVQGQYAYPETIISQYENTESSSRTLKQSLQNDYLETGGGFCLVEDETSR 1440
Query: 1441 EAMCMNKDLPLEANNSEDYLTMGGGFCLDDDNECVDTVAHPDQATILEVPKDGSGDDPGQ 1500
+ MC NKD +EANNSEDYLT+GGGFCLDD++ECVD VAHP+QATILEV KD P +
Sbjct: 1441 QEMCQNKDPAMEANNSEDYLTIGGGFCLDDNDECVDPVAHPNQATILEVQKDDL--IPDR 1500
Query: 1501 STFHPEKCIGGDQNKEDTDARGDSLLDMGDPNPVSNPNSSQLGEGVQEEPKDHSVRTFGG 1557
STF P+K I +EDTD +SL ++G+ + VSN NSSQ+GE V+EEPKDH VR FGG
Sbjct: 1501 STFSPKKHI----VEEDTDTHIESLHNLGNSSYVSNRNSSQVGENVEEEPKDHCVRAFGG 1560
BLAST of Spg001975 vs. ExPASy TrEMBL
Match:
A0A1S4DUP2 (LOW QUALITY PROTEIN: DNA repair protein UVH3 OS=Cucumis melo OX=3656 GN=LOC103487179 PE=3 SV=1)
HSP 1 Score: 2349.7 bits (6088), Expect = 0.0e+00
Identity = 1270/1575 (80.63%), Postives = 1375/1575 (87.30%), Query Frame = 0
Query: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAHILGFFR 60
MGV GLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAH+LGFFR
Sbjct: 1 MGVQGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAHLLGFFR 60
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLR 120
RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLK MRL+
Sbjct: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKVMRLK 120
Query: 121 ELAEDIKNQKQQRKQDVPKKKTLLNHNEIVDGTSTSERSKSVPNSDSHENLDGMLAASIM 180
ELAED++NQKQQRKQ +PKK TL + E DGTSTSER KSVPN SHENLDGML ASIM
Sbjct: 121 ELAEDLQNQKQQRKQKLPKKSTLPSPKEKFDGTSTSERCKSVPNRGSHENLDGMLGASIM 180
Query: 181 AEENGFFLSSASSFAGSSLAKEDSGEQSILNQKYKSDSKGKKILSDEVHVMGSDSERMEV 240
AEE G F SSASS AG++L KED EQS++NQKY DSKGK++LSDE HV+GSDSERM++
Sbjct: 181 AEEKGIFSSSASSVAGATLDKEDGDEQSVVNQKYNKDSKGKELLSDETHVVGSDSERMDL 240
Query: 241 ASRSAHQQNLDEMLAASIAAEEARSLNENASVSAAANLDGEDTDDEDEEMILPEMHGVVD 300
ASR HQQN+DEMLAASIAAEEARSLN AS SA NLDGEDTDDEDEE+ILPEMHGVVD
Sbjct: 241 ASR-RHQQNVDEMLAASIAAEEARSLNGKASASAVTNLDGEDTDDEDEELILPEMHGVVD 300
Query: 301 PSVLAALPPSVQLDLLVQ------------------DPAKFSELQIQAYLKTVAFRRDID 360
PSVLAALPPSVQLDLLVQ DPAKFSELQIQAYLKTVAFR+DID
Sbjct: 301 PSVLAALPPSVQLDLLVQMRERLMAENRQKYQRVKKDPAKFSELQIQAYLKTVAFRQDID 360
Query: 361 QVQKAASGRGVGGVQTSRIASEANREFIFSSSFTGDKQALASSRAEKNGDKDLQAPRVQQ 420
QVQKAA+GRGVGGVQTSR+ASEANREFIFSSSFTGDKQALAS+RAEKNGDKD+QAP VQQ
Sbjct: 361 QVQKAAAGRGVGGVQTSRVASEANREFIFSSSFTGDKQALASARAEKNGDKDVQAPIVQQ 420
Query: 421 PLSSLNNTEIPSTSNALARSTPDKSGVLEDNIETFLDERGRVRVSRVRAMGMHMTRDLER 480
PLSSL +T+IPSTSN LARSTPDKS V E+NIETFLDERGRVRVSRVRAMG+HMTRDLER
Sbjct: 421 PLSSLKSTKIPSTSNPLARSTPDKSEVFEENIETFLDERGRVRVSRVRAMGIHMTRDLER 480
Query: 481 NLDLMKEIEKNASANKAANPEHMQHIEICNPESFSFQSQVLDTSDEGVGDSINKLNEKGA 540
NLDLMKEIEK SA KA NP+ MQ+IEICNPESFSFQSQ LDTSDEGVG SINKL+E+G
Sbjct: 481 NLDLMKEIEKTTSAKKATNPDLMQNIEICNPESFSFQSQDLDTSDEGVGGSINKLDERGT 540
Query: 541 EFMLNEDTAIEILLEDEGGKSFDGDDDLFTHLAAENPIQMASFDISPQKLSLDSTTDSGW 600
EFMLNE+TAIEI+LEDEGGKSFDGDDDLFT+LAAENPI M SFDIS QKLSLD+TTDS W
Sbjct: 541 EFMLNEETAIEIMLEDEGGKSFDGDDDLFTNLAAENPIGMGSFDISTQKLSLDNTTDSAW 600
Query: 601 EEKLEGKAYSPKNVQVDDHSFKEGTVSDESEVDWEDGVCDHVNPVPFEAESGKSVSKGSM 660
E LE K KNV VDDH FKEG VSDESEVDWEDGVCDHVNPVPFEA+S KSVSKGS+
Sbjct: 601 VEALEEKFILQKNVGVDDHCFKEGIVSDESEVDWEDGVCDHVNPVPFEADSAKSVSKGSL 660
Query: 661 EEEADLQEAIRRSLEDIGDRKSGPVSSEHQQPQQVIVGKMAEQCMSVQNENVIGLEKLDS 720
EEEADLQEAIRRSLED G K+GP+SS+HQQPQ VIVGK AE S EN+IGL KLDS
Sbjct: 661 EEEADLQEAIRRSLEDKGYTKTGPLSSDHQQPQPVIVGKRAEHNTSALKENMIGLGKLDS 720
Query: 721 ADGMNCSNANDSTRTKGMTESSSQEKQCPEPVVLLDTKTHTIAEQLDASCNDATFSHKES 780
DGM+ N NDS+ KG TES SQEKQC EPVVLLDTKTHT++EQLDAS ND TFS KE
Sbjct: 721 DDGMSSLNFNDSSGIKGTTESLSQEKQCSEPVVLLDTKTHTLSEQLDASYNDTTFSPKEL 780
Query: 781 NENNDTLKALSRDASGAIQVEDRINNTVIEAPCCMVEMEGIYTPNIDSSPKAFTCENHFK 840
NENND L+ LS DAS A+QV D INNT++ +PC MVEMEG YTP SSPK+F CENH K
Sbjct: 781 NENNDALEPLSEDASSAVQVGDMINNTLVNSPCHMVEMEGFYTPGNGSSPKSFACENHLK 840
Query: 841 QNLPVDKCNSDLLL-EKDAKKPAVEEISNAEIEITEDELTNRISVLEQERLNLGDEQKRL 900
QNLPVDK + DLLL EKDAK P VE+ S+A EITEDEL NRISVLEQERLNLGDEQKRL
Sbjct: 841 QNLPVDKHSHDLLLDEKDAKIPIVEKTSSA--EITEDELMNRISVLEQERLNLGDEQKRL 900
Query: 901 ERNAESVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFG 960
ERNAESV+SEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFG
Sbjct: 901 ERNAESVNSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFG 960
Query: 961 ARSVYKNIFDDRKYVETYFMKDVENELGLNRDKLIRMALLLGSDYTEGISGIGIVNAVEV 1020
A+SVYKNIFDDRKYVETYFMKD+ENELGLNRDKLIRMALLLGSDYTEG+SGIGIVNAVEV
Sbjct: 961 AKSVYKNIFDDRKYVETYFMKDIENELGLNRDKLIRMALLLGSDYTEGVSGIGIVNAVEV 1020
Query: 1021 MNAFPEEDGLHKFKEWIESPDPSILGTLGAKTGLSARKRGSKASENDMTCSNSSARDGSA 1080
MNAF EEDGLHKFKEWIESPDPSILG LGAKTGL+ARKRGSKASEN+ TCSNSS G A
Sbjct: 1021 MNAFLEEDGLHKFKEWIESPDPSILGPLGAKTGLNARKRGSKASENETTCSNSS---GPA 1080
Query: 1081 SEENIDKDLKENIDVKQSFMDKHRNVSKNWHIPSEFPSEAVISAYICPQVDKSAEPFSWG 1140
SEENI KDLKEN+ VKQSFMDKHRNVSKNWHIPSEFPSEAVISAYICPQVDKSAEPFSWG
Sbjct: 1081 SEENISKDLKENMTVKQSFMDKHRNVSKNWHIPSEFPSEAVISAYICPQVDKSAEPFSWG 1140
Query: 1141 KPDHFVLRRLCWEKFGWDNSKADELLLPVLKEYGKHETQLRLEAFYTFNERFAKIRSKRI 1200
KPDHFVLRRLCWEKFGW+NSKADELLLPVLKEY KHETQLRLEAFYTFNERFAKIRSKRI
Sbjct: 1141 KPDHFVLRRLCWEKFGWENSKADELLLPVLKEYSKHETQLRLEAFYTFNERFAKIRSKRI 1200
Query: 1201 KKAVKSITGSKSAVLMDDTVRDVSVNYQRELSVEPDENMSEKCSSKIQGACSNEDDIENR 1260
KKAVKSITGS+SAVLMDD VRD SVN Q+ELSVEP EN+SEKCSS+IQGA SN++DIENR
Sbjct: 1201 KKAVKSITGSRSAVLMDDAVRDASVNNQKELSVEPKENISEKCSSEIQGASSNKEDIENR 1260
Query: 1261 LQKPSRKRQLDGERSQPAKDKKRTMKGKGKQSRNGGSRSERGRGRGEGRGRGRLASKGKT 1320
LQKPSRKRQLDGE+SQ K KK T K KGK+S + GS S+RGRGR GRGRGRLAS+GKT
Sbjct: 1261 LQKPSRKRQLDGEQSQFGKGKKLTKKEKGKRSVSEGSHSKRGRGR--GRGRGRLASRGKT 1320
Query: 1321 PITELVETSSSDDGSEFDDQKFDLENLQEPQERRRSTRIRKSVSYTMDDGDQPSDHSEER 1380
PIT+LVETSSSDD SEFD+QKFD ENL EPQERRRS+RIRKS SY +DD +Q SDHS +R
Sbjct: 1321 PITDLVETSSSDDESEFDNQKFDSENLPEPQERRRSSRIRKSASYAIDDSNQLSDHSADR 1380
Query: 1381 FSNDEAKDDNVARDQYAHPETVMSQFKNTECDFGTPKKSPQRDYLETGGGFCLAEDEMSQ 1440
FSND+A++D V + QYA+PET++SQ++NTE T K+S Q DYLETGGGFCL EDE S+
Sbjct: 1381 FSNDKAEEDMVVQGQYAYPETIISQYENTESSSRTLKQSLQNDYLETGGGFCLVEDETSR 1440
Query: 1441 EAMCMNKDLPLEANNSEDYLTMGGGFCLDDDNECVDTVAHPDQATILEVPKDGSGDDPGQ 1500
+ MC NKD +EANNSEDYLT+GGGFCLDD++ECVD VAHP+QATILEV KD P +
Sbjct: 1441 QEMCQNKDPAMEANNSEDYLTIGGGFCLDDNDECVDPVAHPNQATILEVQKDDL--IPDR 1500
Query: 1501 STFHPEKCIGGDQNKEDTDARGDSLLDMGDPNPVSNPNSSQLGEGVQEEPKDHSVRTFGG 1557
STF P+K I +EDTD +SL ++G+ + VSN NSSQ+GE V+EEPKDH VR FGG
Sbjct: 1501 STFSPKKHI----VEEDTDTHIESLHNLGNSSYVSNRNSSQVGENVEEEPKDHCVRAFGG 1560
BLAST of Spg001975 vs. TAIR 10
Match:
AT3G28030.1 (5'-3' exonuclease family protein )
HSP 1 Score: 989.9 bits (2558), Expect = 2.2e-288
Identity = 695/1629 (42.66%), Postives = 929/1629 (57.03%), Query Frame = 0
Query: 1 MGVHGLWELLAPVGRRVSVETLAGKKLAIDASIWMVQFIKAMRDDRGEMVRNAHILGFFR 60
MGV GLWELLAPVGRRVSVETLA K+LAIDASIWMVQFIKAMRD++G+MV+NAH++GFFR
Sbjct: 1 MGVQGLWELLAPVGRRVSVETLANKRLAIDASIWMVQFIKAMRDEKGDMVQNAHLIGFFR 60
Query: 61 RICKLLFLRTKPVFVFDGATPALKRRTLIARRRQRENAQAKVRKTAEKLLLNHLKAMRLR 120
RICKLLFLRTKP+FVFDGATPALKRRT+IARRRQRENAQ K+RKTAEKLLLN LK +RL+
Sbjct: 61 RICKLLFLRTKPIFVFDGATPALKRRTVIARRRQRENAQTKIRKTAEKLLLNRLKDIRLK 120
Query: 121 ELAEDIKNQKQQRKQDVPKKKTLLNHNEIVDGTSTSERSKSVPNS-DSHENLDGMLAASI 180
E A+DIKNQ+ ++ D + K ++ + + D V S E LD + AS+
Sbjct: 121 EQAKDIKNQR-LKQDDSDRVKKRVSSDSVEDNLRVPVEEDDVGASFFQEEKLDEVSQASL 180
Query: 181 MAEENGFFLSSASSFAGSSLAKEDSGEQSILNQKYKSDSKGKKILSDEVHVMGSDSERME 240
+ E +G ++ + K D KGK +L D G D + +
Sbjct: 181 VGE---------------------TGVDDVVKESVKDDPKGKGVLLD-----GDDLDNLV 240
Query: 241 VASR---SAHQQNLDEMLAASIAAEEARSLNENASVSAAA---NLDGEDTDDEDEEMILP 300
S +Q+ LDEMLAAS+AAEE R+ AS SAAA D E+ D DEE++LP
Sbjct: 241 QDSSVQGKDYQEKLDEMLAASLAAEEERNFTSKASTSAAAIPSEEDEEEDSDGDEEILLP 300
Query: 301 EMHGVVDPSVLAALPPSVQLDLLVQ------------------DPAKFSELQIQAYLKTV 360
M G +DP+VLA+LPPS+QLDLL Q P KFSELQI+AYLKTV
Sbjct: 301 VMDGNIDPAVLASLPPSMQLDLLAQMREKLMAENRQKYQKVKKAPEKFSELQIEAYLKTV 360
Query: 361 AFRRDIDQVQKAASGRGVGGVQTSRIASEANREFIFSSSFTGDKQALASSRAEKNGDKDL 420
AFRR+I++VQ++A GR VGGVQTSRIASEANREFIFSSSF GDK+ LAS+R +N +
Sbjct: 361 AFRREINEVQRSAGGRAVGGVQTSRIASEANREFIFSSSFAGDKEVLASAREGRNDENQK 420
Query: 421 QAPRVQQPLSSLNNTEIPSTSNALARSTPDKSGVLEDNIETFLDERGRVRVSRVRAMGMH 480
+ + P+S N + + S+A D+ ++NIE ++DERGR R+ R R MG+
Sbjct: 421 KTSQQSLPVSVKNASPL-KKSDATIELDRDEPKNPDENIEVYIDERGRFRI-RNRHMGIQ 480
Query: 481 MTRDLERNLDLMKEIEKNASANKAANPEHMQHIEICNPESFSFQSQVLDTSDEGVGDSIN 540
MTRD++RNL LMKE E+ AS + A N E E+F + Q L+ S V +
Sbjct: 481 MTRDIQRNLHLMKEKERTASGSMAKNDETFSAW-----ENFPTEDQFLEKSP--VEKDVV 540
Query: 541 KLNEKGAEFMLNEDTAIEILLE-DEGGKSFDGDDDLFTHLAAENPIQMASFDISPQKLSL 600
L + + ML+ ++IEI + D GGK + +DD+F LAA P+ ++S + P++ +
Sbjct: 541 DLEIQNDDSMLHPPSSIEISFDHDGGGKDLNDEDDMFLQLAAGGPVTISSTENDPKEDTS 600
Query: 601 DSTTDSGWEEKLEGKAYSPKNVQV---DDHSFKEGTVSDESEVDWEDGVCDHVNPVPFEA 660
+DS WEE + S ++ + H K+ +S V WE+ C + N E
Sbjct: 601 PWASDSDWEEVPVEQNTSVSKLEANLSNQHIPKD--ISIAEGVAWEEYSCKNANN-SVEN 660
Query: 661 ESGKSVSKGSMEEEADLQEAIRRSLEDIGDRKSGPVSSEHQQPQ-QVIVGKMAEQCM--- 720
++ ++KG +EEEADLQEAI++SL ++ D++SG V E+Q + ++V K +E +
Sbjct: 661 DTVTKITKGYLEEEADLQEAIKKSLLELHDKESGDVLEENQSVRVNLVVDKPSEDSLCSR 720
Query: 721 ----SVQNENVIGLEKLDSADGMNCSNANDSTRTKGMTESSSQEKQCPEPVVLLDTKTHT 780
+ E + + G +N S + ++ P + +H
Sbjct: 721 ETVGEAEEERFLDEITILKTSGAISEQSNTSVAGNADGQKGITKQFGTHPSSGSNNVSHA 780
Query: 781 IAEQLD--------------ASCND--ATFSHKESNENNDTLKALSRDASGAIQVEDRIN 840
++ +L AS N +T + + + E +++ S S ++ I
Sbjct: 781 VSNKLSKVKSVISPEKALNVASQNRMLSTMAKQHNEEGSESFGGESVKVSAMPIADEEIT 840
Query: 841 NTVIEAPCCMVEMEGIYTPNIDSSPK---AFTCENHFKQNLPVDKCNSDLLLEKDAKKPA 900
+ E E + D S + + E+ + V + +L + D++
Sbjct: 841 GFLDEKDNADGESSIMMDDKRDYSRRKIQSLVTESR-DPSRNVVRSRIGILHDTDSQNER 900
Query: 901 VEEISNAE-----------------IEITEDELTNRISVLEQERLNLGDEQKRLERNAES 960
EE ++ E +E +E + I VL+QE ++LGDEQ++LERNAES
Sbjct: 901 REENNSNEHTFNIDSSTDFEEKGVPVEFSEANIEEEIRVLDQEFVSLGDEQRKLERNAES 960
Query: 961 VSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYK 1020
VSSEMFAECQELLQ+FG+PYIIAPMEAEAQCA+ME +NLVDG+VTDDSDVFLFGARSVYK
Sbjct: 961 VSSEMFAECQELLQIFGIPYIIAPMEAEAQCAFMEQSNLVDGIVTDDSDVFLFGARSVYK 1020
Query: 1021 NIFDDRKYVETYFMKDVENELGLNRDKLIRMALLLGSDYTEGISGIGIVNAVEVMNAFPE 1080
NIFDDRKYVETYFMKD+E ELGL+RDK+IRMA+LLGSDYTEGISGIGIVNA+EV+ AFPE
Sbjct: 1021 NIFDDRKYVETYFMKDIEKELGLSRDKIIRMAMLLGSDYTEGISGIGIVNAIEVVTAFPE 1080
Query: 1081 EDGLHKFKEWIESPDPSILGTLGAKTGLSARKRGSKASENDMTCSNSSARDGSASEENID 1140
EDGL KF+EW+ESPDP+ILG AKTG +KRGS + +N S +S D
Sbjct: 1081 EDGLQKFREWVESPDPTILGKTDAKTGSKVKKRGSASVDNKGIISGASTDDTE------- 1140
Query: 1141 KDLKENIDVKQSFMDKHRNVSKNWHIPSEFPSEAVISAYICPQVDKSAEPFSWGKPDHFV 1200
++KQ FMD+HR VSKNWHIP FPSEAVISAY+ PQVD S E FSWGKPD V
Sbjct: 1141 -------EIKQIFMDQHRKVSKNWHIPLTFPSEAVISAYLNPQVDLSTEKFSWGKPDLSV 1200
Query: 1201 LRRLCWEKFGWDNSKADELLLPVLKEYGKHETQLRLEAFYTFNERFAKIRSKRIKKAVKS 1260
LR+LCWEKF W+ K DELLLPVLKEY K ETQLR+EAFY+FNERFAKIRSKRI KAVK
Sbjct: 1201 LRKLCWEKFNWNGKKTDELLLPVLKEYEKRETQLRIEAFYSFNERFAKIRSKRINKAVKG 1260
Query: 1261 ITGSKSAVLMDDTVRDVSVNYQRELSVEPDENMSEKCSSKIQGACSNEDDIENRLQKPSR 1320
I G S+ + D T+++ + + V P E S K + I N K R
Sbjct: 1261 IGGGLSSDVADHTLQE-GPRKRNKKKVAPHETEDNNTSDK-------DSPIANEKVKNKR 1320
Query: 1321 KRQLDGERSQPAKDKKRTMKGKGKQSRNGGSRSERGRGRGEGRGRGRLASKGKTPITELV 1380
KR +P+ S RGRGR + RGRGR + + + EL
Sbjct: 1321 KR-----LEKPS--------------------SSRGRGRAQKRGRGR--GRVQKDLLELS 1380
Query: 1381 ETSSSDDGSEFDDQKFDLENLQEPQERRRSTRIRKSVSYTMDDGDQPSDHSEERFSNDEA 1440
+ SS DD + DD+ +LE +P ++STR R V Y+ + D+ +E SN+ +
Sbjct: 1381 DGSSDDD--DDDDKVVELE--AKPANLQKSTRSRNPVMYSAKEDDE----LDESRSNEGS 1440
Query: 1441 KDDNVARDQYAHPETVMSQFKNTECDFGTPKKSPQRDYLETGGGFCLAE-DEMSQEAMCM 1500
+N E + N + + P DY++TGGGFC E DE+
Sbjct: 1441 PSEN-------FEEVDEGRIGNDDSVDASINDCPSEDYIQTGGGFCADEADEIG------ 1478
Query: 1501 NKDLPLEANNSEDYLTMGGGFCLDDDNECVDTVAHPDQATILEVPKDGSGDDPGQSTFHP 1556
D LE ++DY +GGGFC+D+D E + D A IL++ +
Sbjct: 1501 --DAHLEDKATDDYRVIGGGFCVDED-ETAEENTMDDDAEILKMESE------------E 1478
BLAST of Spg001975 vs. TAIR 10
Match:
AT1G01880.1 (5'-3' exonuclease family protein )
HSP 1 Score: 118.6 bits (296), Expect = 4.4e-26
Identity = 96/299 (32.11%), Postives = 142/299 (47.49%), Query Frame = 0
Query: 881 LERNAESVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLF 940
+ERN + SE EC ELL++ G+P + A EAEA CA + VD +T DSD FLF
Sbjct: 110 VERN--KLFSEWVRECVELLELLGIPVLKANGEAEALCAQLNSQGFVDACITPDSDAFLF 169
Query: 941 GARSVYKNIF-DDRKYVETYFMKDVENELGLNRDKLIRMALLLGSDY-TEGISGIGIVNA 1000
GA V K+I + R+ E Y M +E+ LGL R LI ++LL+G+DY + G+ GIG+ A
Sbjct: 170 GAMCVIKDIKPNSREPFECYHMSHIESGLGLKRKHLIAISLLVGNDYDSGGVLGIGVDKA 229
Query: 1001 VEVMNAFPEEDGLHKFKEWIESPDPSILGTLGAKTGLSARKRGSKASENDMTCS------ 1060
+ ++ F E+ L + ++ P++ G G K+G + S+ + CS
Sbjct: 230 LRIVREFSEDQVLERLQDIGNGLQPAVPG--GIKSGDDGEEFRSEMKKRSPHCSRCGHLG 289
Query: 1061 ------NSSARDGSASEENIDKDLKENIDVKQSFMDKHRNV-----SKNWHI-------- 1120
SS I K L + SF K R++ + +W I
Sbjct: 290 SKRTHFKSSCEHCGCDSGCIKKPL--GFRCECSFCSKDRDLREQKKTNDWWIKVCDKIAL 349
Query: 1121 PSEFPSEAVISAYICP-QVDKSAEPFSWGKPDHFVLRRLCWEKFGWDNSKADELLLPVL 1152
EFP+ +I Y+ + SWG PD +L L K WD S ++LLP+L
Sbjct: 350 APEFPNRKIIELYLSDGLMTGDGSSMSWGTPDTGMLVDLMVFKLHWDPSYVRKMLLPML 402
BLAST of Spg001975 vs. TAIR 10
Match:
AT1G01880.2 (5'-3' exonuclease family protein )
HSP 1 Score: 91.7 bits (226), Expect = 5.7e-18
Identity = 125/489 (25.56%), Postives = 207/489 (42.33%), Query Frame = 0
Query: 881 LERNAESVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLF 940
+ERN + SE EC ELL++ G+P + A EAEA CA + VD +T DSD FLF
Sbjct: 110 VERN--KLFSEWVREC-ELLELLGIPVLKANGEAEALCAQLNSQGFVDACITPDSDAFLF 169
Query: 941 GARSVYKNIF-DDRKYVETYFMKDVENELGLNRDKLIRMALLLGSDY-TEGISGIGIVNA 1000
GA V K+I + R+ E Y M +E+ LGL R LI ++LL+G+DY + G+ GIG+ A
Sbjct: 170 GAMCVIKDIKPNSREPFECYHMSHIESGLGLKRKHLIAISLLVGNDYDSGGVLGIGVDKA 229
Query: 1001 VEVMNAFPEEDGLHKFKEWIESPDPSILGTLGAKTGLSARKRGSKASENDMTCS------ 1060
+ ++ F E+ L + ++ P++ G G K+G + S+ + CS
Sbjct: 230 LRIVREFSEDQVLERLQDIGNGLQPAVPG--GIKSGDDGEEFRSEMKKRSPHCSRCGHLG 289
Query: 1061 ------NSSARDGSASEENIDKDLKENIDVKQSFMDKHRNV-----SKNWHI-------- 1120
SS I K L + SF K R++ + +W I
Sbjct: 290 SKRTHFKSSCEHCGCDSGCIKKPL--GFRCECSFCSKDRDLREQKKTNDWWIKVCDKIAL 349
Query: 1121 PSEFPSEAVISAYICP-QVDKSAEPFSWGKPDHFVLRRLCWEKFGWDNSKADELLLPVLK 1180
EFP+ +I Y+ + SWG PD +L L K WD S ++LLP+L
Sbjct: 350 APEFPNRKIIELYLSDGLMTGDGSSMSWGTPDTGMLVDLMVFKLHWDPSYVRKMLLPMLS 409
Query: 1181 EYGKHETQLRLEAFYTFNERFAKIRSKRIK------------KAVKSITG--------SK 1240
E + +++ K IK + KS +G +
Sbjct: 410 TIYLREKARNNTGYALLCDQYEFHSIKCIKTRYGHQSFVIRWRKPKSTSGYSHSHSEPEE 469
Query: 1241 SAVLMDDTVRDVS-VNYQRELSVEPDEN----MSEKCSSKIQGACSNEDD---IENRLQK 1300
S V++++ V ++ E V+ D ++++C +Q A +E + E +L++
Sbjct: 470 SIVVLEEEEESVDPLDGLNEPQVQNDNGDCFLLTDECIGLVQSAFPDETEHFLHEKKLRE 529
Query: 1301 PSRKRQLDGERSQPAKDKKRTMKG------KGKQSRNGGSRSERGRGRGEGRGRGRLASK 1308
+K + E + P + K++ G S G + + + S
Sbjct: 530 SKKKNVSEEETATPRATTMGVQRSITDFYRSAKKAAAGQSIETGGSSKASAEKKRQATST 589
BLAST of Spg001975 vs. TAIR 10
Match:
AT3G48900.1 (single-stranded DNA endonuclease family protein )
HSP 1 Score: 90.9 bits (224), Expect = 9.8e-18
Identity = 82/288 (28.47%), Postives = 129/288 (44.79%), Query Frame = 0
Query: 877 EQKRLERNAESVSSEMFAECQELLQMFGLPYIIAPMEAEAQCAYMELANLVDGVVTDDSD 936
++ L+RN S S + E + + G+ + EAEAQCA + +L D + DSD
Sbjct: 42 KETSLKRNMGSEFSCIIKEAKVIASTLGILCLDGIEEAEAQCALLNSESLCDACFSFDSD 101
Query: 937 VFLFGARSVYKNI-FDDRKYVETYFMKDVENELGLNRDKLIRMALLLGSDYTEGISGIGI 996
+FLFGA++VY+ I + YV Y M D++ +LGL R+ LI +ALLLGSDY++G+ G+
Sbjct: 102 IFLFGAKTVYREICLGEGGYVVCYEMDDIKKKLGLGRNSLIALALLLGSDYSQGVRGLRQ 161
Query: 997 VNAVEVMNAFPEEDGLHKFKEWIESPDPSILGTLGAKTGLS--ARKRGSKASENDMTCSN 1056
A E++ + + L K A GLS + R SK CS
Sbjct: 162 EKACELVRSIGDNVILEKV----------------ASEGLSFAEKPRKSKKQVRPSVCSK 221
Query: 1057 SSARDGSASEENIDKDLKENIDVKQSFMDKHRNVSKNWHIPSEFPSEAVISAYICPQ--- 1116
N ++D + ++KQ VI A++ P+
Sbjct: 222 KGTLPLVVINGN-NRDPERLEEIKQ-----------------------VIDAFMNPKCHQ 281
Query: 1117 -----VDKSAEPFSWGKPDHFVLRRLCWEKFGWDNSKADELLLPVLKE 1154
V ++ FS+ + L+ +C + F W K DE +LP + E
Sbjct: 282 ADSNTVSRALAEFSFQRTK---LQEICHQFFEWPPEKTDEYILPKVAE 286
BLAST of Spg001975 vs. TAIR 10
Match:
AT3G48900.2 (single-stranded DNA endonuclease family protein )
HSP 1 Score: 90.9 bits (224), Expect = 9.8e-18
Identity = 97/379 (25.59%), Postives = 157/379 (41.42%), Query Frame = 0
Query: 791 IEAPCCMVEMEGIYTPNIDSSPKAFTCENHFKQNLPVDKCNSDLLLEKDA-----KKPAV 850
++ C MVE+ + + K + F + + N ++L D K P
Sbjct: 29 VDLSCWMVELHKVNKSYCATKEKVY-LRGFFHRLRALIALNCSIILVSDGAIPGIKVPTY 88
Query: 851 EEISNAEIEITEDELTNRISVLEQERLNLGDEQKRLERNAESVSSEMFAECQELLQMFGL 910
+ A EI +D + ++ L+RN S S + E + + G+
Sbjct: 89 KRRLKARFEIADDGVE-------------PSKETSLKRNMGSEFSCIIKEAKVIASTLGI 148
Query: 911 PYIIAPMEAEAQCAYMELANLVDGVVTDDSDVFLFGARSVYKNI-FDDRKYVETYFMKDV 970
+ EAEAQCA + +L D + DSD+FLFGA++VY+ I + YV Y M D+
Sbjct: 149 LCLDGIEEAEAQCALLNSESLCDACFSFDSDIFLFGAKTVYREICLGEGGYVVCYEMDDI 208
Query: 971 ENELGLNRDKLIRMALLLGSDYTEGISGIGIVNAVEVMNAFPEEDGLHKFKEWIESPDPS 1030
+ +LGL R+ LI +ALLLGSDY++G+ G+ A E++ + + L K
Sbjct: 209 KKKLGLGRNSLIALALLLGSDYSQGVRGLRQEKACELVRSIGDNVILEKV---------- 268
Query: 1031 ILGTLGAKTGLS--ARKRGSKASENDMTCSNSSARDGSASEENIDKDLKENIDVKQSFMD 1090
A GLS + R SK CS N ++D + ++KQ
Sbjct: 269 ------ASEGLSFAEKPRKSKKQVRPSVCSKKGTLPLVVINGN-NRDPERLEEIKQ---- 328
Query: 1091 KHRNVSKNWHIPSEFPSEAVISAYICPQ--------VDKSAEPFSWGKPDHFVLRRLCWE 1150
VI A++ P+ V ++ FS+ + L+ +C +
Sbjct: 329 -------------------VIDAFMNPKCHQADSNTVSRALAEFSFQRTK---LQEICHQ 350
Query: 1151 KFGWDNSKADELLLPVLKE 1154
F W K DE +LP + E
Sbjct: 389 FFEWPPEKTDEYILPKVAE 350
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038903932.1 | 0.0e+00 | 83.19 | DNA repair protein UVH3 isoform X2 [Benincasa hispida] | [more] |
KAG6588970.1 | 0.0e+00 | 83.70 | Potassium transporter 8, partial [Cucurbita argyrosperma subsp. sororia] | [more] |
XP_023529571.1 | 0.0e+00 | 83.04 | DNA repair protein UVH3 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
XP_038903931.1 | 0.0e+00 | 81.63 | DNA repair protein UVH3 isoform X1 [Benincasa hispida] | [more] |
KAG7015248.1 | 0.0e+00 | 82.81 | DNA repair protein UVH3 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
Match Name | E-value | Identity | Description | |
Q9ATY5 | 3.1e-287 | 42.66 | DNA repair protein UVH3 OS=Arabidopsis thaliana OX=3702 GN=UVH3 PE=2 SV=1 | [more] |
P14629 | 8.2e-70 | 25.28 | DNA excision repair protein ERCC-5 homolog OS=Xenopus laevis OX=8355 GN=ercc5 PE... | [more] |
P35689 | 1.7e-67 | 25.52 | DNA excision repair protein ERCC-5 OS=Mus musculus OX=10090 GN=Ercc5 PE=1 SV=4 | [more] |
P28715 | 2.3e-64 | 25.26 | DNA excision repair protein ERCC-5 OS=Homo sapiens OX=9606 GN=ERCC5 PE=1 SV=3 | [more] |
P28706 | 4.0e-64 | 26.57 | DNA repair protein rad13 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) ... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1ERW5 | 0.0e+00 | 82.86 | DNA repair protein UVH3 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111435307... | [more] |
A0A6J1JJE1 | 0.0e+00 | 82.51 | DNA repair protein UVH3 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111486346 P... | [more] |
A0A5D3CQ45 | 0.0e+00 | 80.25 | DNA repair protein UVH3 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A5A7UJM9 | 0.0e+00 | 80.70 | DNA repair protein UVH3 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... | [more] |
A0A1S4DUP2 | 0.0e+00 | 80.63 | LOW QUALITY PROTEIN: DNA repair protein UVH3 OS=Cucumis melo OX=3656 GN=LOC10348... | [more] |