Csor.00g073470 (gene) Silver-seed gourd (wild; sororia) v1

Overview
NameCsor.00g073470
Typegene
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
DescriptionDNA glycosylase superfamily protein
LocationCsor_Chr02: 4170765 .. 4172174 (+)
RNA-Seq ExpressionCsor.00g073470
SyntenyCsor.00g073470
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSstart_codoninitialpolypeptideintroninternalterminalstop_codon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGTCGTTCCGACCAAGCCTTGGAATCCACTTCTGTCGTCCTTGATTCCAAATCCACTTCCAGTCGCCTCCTCCACCGCCGTAATTCCCTTAACAAACACCCTTCCCCCTCTCCCAACCTCACCTCCACCTCTGACAACATTCTCCTTCCGGTTGCCGCCGCTAACGGCGGCTCTCTGTCTCGCCCCCGCCCTGCCTTGGATACGAAGAAATCCAAAAGCTTCAAGCTTGGGGGAAATGGGAATGTGGTTTCTGATAATGCTGCTGAAGTCGCGTCGCCGGGGAGCATCGCCGCCGTGAGAAGAGAACAGGTGGCGCTGCAGCAGGCGCAGAGGAAGATGAGAATTGCCCATTATGGACGGTCTAAATCCGCCCGGTTTGAGAAAATTGTTCCCTTTGATTCTAAAATTAAAGGCGTTGTTGAAGATAGAAGATGCAGCTTCATCACTCCCAACTCAGGTACCCTTCAATCTCCCCTGTTTTTTTATTTATTTTTTAAATCTCCCCTGTTTTTTTATATTCATAAATTAAAATTTTCCTTGTCAGATCCCATTTATGTGGCCTATCATGATGAAGAATGGGGCGTTCCTGTTCATGATGACCAGTGAGCTTCTCTCTCTCTCTCTCTCTCTCTCCTTCCCTGTTTCTCTCTCCTCCATTGTTTTAATCACTGAAGCTTAAATTTGAAAACAGAGCACTGTTTGAACTGCTGGTTCTGAGTGTGGCCCAAGTGGGTTCGGATTGGACTTCAATTTTGAAGAAACGACAAGATTTCAGGTGCAAAAACAACCTTAATCGATCCGTTTAAGATTGTGGGTTGGTCTGTAATTCACTGATTTAGAGTTTTCATCTTCACCCAGAAATGCATTTTCAGATTTCGATGCAGAAGTGGTGGCGAATTTTTCCGACAGACAGATGGTTTCAATCAGCTCAGAGTATGGAATGGACATAAACAGAGTCCGAGGAGTGGTCGACAACGCAATCCGGATCCTGGAGGTAGTTTAAATGAATAATTAATATGTTCTTCCTTTTCCAAACATGAATGAATTAATTAATTAGTGGGTGATTTGTTTAATTTTTAGATTAAGAAGGAATTTGGGTCACTGGAGAAATACATTTGGGGGTTTATGAACAACAACCCATTCTCACCGCACTACAAATCCGGCCACAAAATCCCGGTCAAGACATCAAAATCAGATACCATAAGCAAAGACATGATCCGGCGAGGATTCCGGTCTGTCGGTCCGACCGTGGTCCATTCCTTCATGCAAGCCGCCGGTCTGACCAACGACCATCTCACCACCTGCCACAGGCACCTGCACTGCACATTAATCGCCGCCGGCCGCCGCGCTCCACCGGCGGAAGTGGAGGAGACGGCGACAGGTGCGGCAGGCTCGGAAGCTGTGTAG

mRNA sequence

ATGTGTCGTTCCGACCAAGCCTTGGAATCCACTTCTGTCGTCCTTGATTCCAAATCCACTTCCAGTCGCCTCCTCCACCGCCGTAATTCCCTTAACAAACACCCTTCCCCCTCTCCCAACCTCACCTCCACCTCTGACAACATTCTCCTTCCGGTTGCCGCCGCTAACGGCGGCTCTCTGTCTCGCCCCCGCCCTGCCTTGGATACGAAGAAATCCAAAAGCTTCAAGCTTGGGGGAAATGGGAATGTGGTTTCTGATAATGCTGCTGAAGTCGCGTCGCCGGGGAGCATCGCCGCCGTGAGAAGAGAACAGGTGGCGCTGCAGCAGGCGCAGAGGAAGATGAGAATTGCCCATTATGGACGGTCTAAATCCGCCCGGTTTGAGAAAATTGTTCCCTTTGATTCTAAAATTAAAGGCGTTGTTGAAGATAGAAGATGCAGCTTCATCACTCCCAACTCAGATCCCATTTATGTGGCCTATCATGATGAAGAATGGGGCGTTCCTGTTCATGATGACCAAGCACTGTTTGAACTGCTGGTTCTGAGTGTGGCCCAAGTGGGTTCGGATTGGACTTCAATTTTGAAGAAACGACAAGATTTCAGAAATGCATTTTCAGATTTCGATGCAGAAGTGGTGGCGAATTTTTCCGACAGACAGATGGTTTCAATCAGCTCAGAGTATGGAATGGACATAAACAGAGTCCGAGGAGTGGTCGACAACGCAATCCGGATCCTGGAGATTAAGAAGGAATTTGGGTCACTGGAGAAATACATTTGGGGGTTTATGAACAACAACCCATTCTCACCGCACTACAAATCCGGCCACAAAATCCCGGTCAAGACATCAAAATCAGATACCATAAGCAAAGACATGATCCGGCGAGGATTCCGGTCTGTCGGTCCGACCGTGGTCCATTCCTTCATGCAAGCCGCCGGTCTGACCAACGACCATCTCACCACCTGCCACAGGCACCTGCACTGCACATTAATCGCCGCCGGCCGCCGCGCTCCACCGGCGGAAGTGGAGGAGACGGCGACAGGTGCGGCAGGCTCGGAAGCTGTGTAG

Coding sequence (CDS)

ATGTGTCGTTCCGACCAAGCCTTGGAATCCACTTCTGTCGTCCTTGATTCCAAATCCACTTCCAGTCGCCTCCTCCACCGCCGTAATTCCCTTAACAAACACCCTTCCCCCTCTCCCAACCTCACCTCCACCTCTGACAACATTCTCCTTCCGGTTGCCGCCGCTAACGGCGGCTCTCTGTCTCGCCCCCGCCCTGCCTTGGATACGAAGAAATCCAAAAGCTTCAAGCTTGGGGGAAATGGGAATGTGGTTTCTGATAATGCTGCTGAAGTCGCGTCGCCGGGGAGCATCGCCGCCGTGAGAAGAGAACAGGTGGCGCTGCAGCAGGCGCAGAGGAAGATGAGAATTGCCCATTATGGACGGTCTAAATCCGCCCGGTTTGAGAAAATTGTTCCCTTTGATTCTAAAATTAAAGGCGTTGTTGAAGATAGAAGATGCAGCTTCATCACTCCCAACTCAGATCCCATTTATGTGGCCTATCATGATGAAGAATGGGGCGTTCCTGTTCATGATGACCAAGCACTGTTTGAACTGCTGGTTCTGAGTGTGGCCCAAGTGGGTTCGGATTGGACTTCAATTTTGAAGAAACGACAAGATTTCAGAAATGCATTTTCAGATTTCGATGCAGAAGTGGTGGCGAATTTTTCCGACAGACAGATGGTTTCAATCAGCTCAGAGTATGGAATGGACATAAACAGAGTCCGAGGAGTGGTCGACAACGCAATCCGGATCCTGGAGATTAAGAAGGAATTTGGGTCACTGGAGAAATACATTTGGGGGTTTATGAACAACAACCCATTCTCACCGCACTACAAATCCGGCCACAAAATCCCGGTCAAGACATCAAAATCAGATACCATAAGCAAAGACATGATCCGGCGAGGATTCCGGTCTGTCGGTCCGACCGTGGTCCATTCCTTCATGCAAGCCGCCGGTCTGACCAACGACCATCTCACCACCTGCCACAGGCACCTGCACTGCACATTAATCGCCGCCGGCCGCCGCGCTCCACCGGCGGAAGTGGAGGAGACGGCGACAGGTGCGGCAGGCTCGGAAGCTGTGTAG

Protein sequence

MCRSDQALESTSVVLDSKSTSSRLLHRRNSLNKHPSPSPNLTSTSDNILLPVAAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV
Homology
BLAST of Csor.00g073470 vs. ExPASy Swiss-Prot
Match: Q7VG78 (Probable GMP synthase [glutamine-hydrolyzing] OS=Helicobacter hepaticus (strain ATCC 51449 / 3B1) OX=235279 GN=guaA PE=3 SV=1)

HSP 1 Score: 164.9 bits (416), Expect = 1.7e-39
Identity = 79/191 (41.36%), Postives = 118/191 (61.78%), Query Frame = 0

Query: 138 KGVVEDRRCSFITPNSD---PIYVAYHDEEWGVPVHDDQALFELLVLSVAQVGSDWTSIL 197
           +GV E  RC++ T   +    +Y  YHD EWG P+H+D+ LFE LVL   Q G  W +IL
Sbjct: 780 EGVREKVRCAWATDKDEAARKLYEDYHDTEWGEPLHEDKKLFEHLVLEGFQAGLSWITIL 839

Query: 198 KKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINR--VRGVVDNAIRILEIKKEFG 257
           KKR+ FR AF DFD  +VAN+ + ++  +    G+  NR  +   + NA   + +++EFG
Sbjct: 840 KKREAFRVAFDDFDPHIVANYDEDKIKELMRNEGIIRNRAKIEAAIINAKAFMAVQREFG 899

Query: 258 SLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAG 317
           S +KYIWGF+   P    ++S   +P  T  SD I+KD+ +RGF+ VG T +++ MQ+ G
Sbjct: 900 SFDKYIWGFVGGKPIINAFESIADLPASTPLSDKIAKDLKKRGFKFVGTTTMYAMMQSIG 959

Query: 318 LTNDHLTTCHR 324
           + NDHLT+C +
Sbjct: 960 MVNDHLTSCFK 970

BLAST of Csor.00g073470 vs. ExPASy Swiss-Prot
Match: P05100 (DNA-3-methyladenine glycosylase 1 OS=Escherichia coli (strain K12) OX=83333 GN=tag PE=1 SV=1)

HSP 1 Score: 141.0 bits (354), Expect = 2.6e-32
Identity = 66/179 (36.87%), Postives = 106/179 (59.22%), Query Frame = 0

Query: 145 RCSFITPNSDPIYVAYHDEEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAF 204
           RC ++  + DP+Y+AYHD EWGVP  D + LFE++ L   Q G  W ++LKKR+++R  F
Sbjct: 3   RCGWV--SQDPLYIAYHDNEWGVPETDSKKLFEMICLEGQQAGLSWITVLKKRENYRACF 62

Query: 205 SDFDAEVVANFSDRQMVSISSEYGMDINR--VRGVVDNAIRILEIKKEFGSLEKYIWGFM 264
             FD   VA   +  +  +  + G+  +R  ++ ++ NA   L++++       ++W F+
Sbjct: 63  HQFDPVKVAAMQEEDVERLVQDAGIIRHRGKIQAIIGNARAYLQMEQNGEPFVDFVWSFV 122

Query: 265 NNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTC 322
           N+ P      +  +IP  TS SD +SK + +RGF+ VG T+ +SFMQA GL NDH+  C
Sbjct: 123 NHQPQVTQATTLSEIPTSTSASDALSKALKKRGFKFVGTTICYSFMQACGLVNDHVVGC 179

BLAST of Csor.00g073470 vs. ExPASy Swiss-Prot
Match: P44321 (DNA-3-methyladenine glycosylase OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) OX=71421 GN=tag PE=3 SV=1)

HSP 1 Score: 124.8 bits (312), Expect = 2.0e-27
Identity = 63/179 (35.20%), Postives = 96/179 (53.63%), Query Frame = 0

Query: 145 RCSFITPNSDPIYVAYHDEEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAF 204
           RC ++   S  IY+ YHD+EWG P  D Q LFE + L   Q G  W ++LKKR+ +R AF
Sbjct: 4   RCPWVGEQS--IYIDYHDKEWGKPEFDSQKLFEKICLEGQQAGLSWITVLKKRESYREAF 63

Query: 205 SDFDAEVVANFSDRQMVSISSEYGMDINRVR--GVVDNAIRILEIKKEFGSLEKYIWGFM 264
             FD + +A  +   + +     G+  +R +   +V NA   L ++K   +   +IW F+
Sbjct: 64  HQFDPKKIAKMTALDIDACMQNSGLIRHRAKLEAIVKNAKAYLAMEKCGENFSDFIWSFV 123

Query: 265 NNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTC 322
           N+ P          +P KT  S  +SK + +RGF  +G T  ++FMQ+ GL +DHL  C
Sbjct: 124 NHKPIVNDVPDLRSVPTKTEVSKALSKALKKRGFVFIGETTCYAFMQSMGLVDDHLNDC 180

BLAST of Csor.00g073470 vs. NCBI nr
Match: KAG6605362.1 (hypothetical protein SDJN03_02679, partial [Cucurbita argyrosperma subsp. sororia] >KAG6605378.1 hypothetical protein SDJN03_02695, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 692 bits (1786), Expect = 5.35e-251
Identity = 354/354 (100.00%), Postives = 354/354 (100.00%), Query Frame = 0

Query: 1   MCRSDQALESTSVVLDSKSTSSRLLHRRNSLNKHPSPSPNLTSTSDNILLPVAAANGGSL 60
           MCRSDQALESTSVVLDSKSTSSRLLHRRNSLNKHPSPSPNLTSTSDNILLPVAAANGGSL
Sbjct: 1   MCRSDQALESTSVVLDSKSTSSRLLHRRNSLNKHPSPSPNLTSTSDNILLPVAAANGGSL 60

Query: 61  SRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQRKMRIAHYG 120
           SRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQRKMRIAHYG
Sbjct: 61  SRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQRKMRIAHYG 120

Query: 121 RSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDQALFELLV 180
           RSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDQALFELLV
Sbjct: 121 RSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDQALFELLV 180

Query: 181 LSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDN 240
           LSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDN
Sbjct: 181 LSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDN 240

Query: 241 AIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVG 300
           AIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVG
Sbjct: 241 AIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVG 300

Query: 301 PTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV 354
           PTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV
Sbjct: 301 PTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV 354

BLAST of Csor.00g073470 vs. NCBI nr
Match: XP_022947743.1 (uncharacterized protein LOC111451515 isoform X2 [Cucurbita moschata] >XP_022947919.1 uncharacterized protein LOC111451659 isoform X1 [Cucurbita moschata])

HSP 1 Score: 659 bits (1699), Expect = 6.70e-238
Identity = 340/354 (96.05%), Postives = 344/354 (97.18%), Query Frame = 0

Query: 1   MCRSDQALESTSVVLDSKSTSSRLLHRRNSLNKHPSPSPNLTSTSDNILLPVAAANGGSL 60
           MCRSDQALESTS         +RLLHRRNSLNKHPSP+PNLTSTSD+ILLPVAA NGGSL
Sbjct: 1   MCRSDQALESTS---------NRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAA-NGGSL 60

Query: 61  SRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQRKMRIAHYG 120
           SRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQRKMRIAHYG
Sbjct: 61  SRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQRKMRIAHYG 120

Query: 121 RSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDQALFELLV 180
           RSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHD+EWGVPVHDDQALFELLV
Sbjct: 121 RSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLV 180

Query: 181 LSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDN 240
           LSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDN
Sbjct: 181 LSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDN 240

Query: 241 AIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVG 300
           AIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVG
Sbjct: 241 AIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVG 300

Query: 301 PTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV 354
           PTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV
Sbjct: 301 PTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV 344

BLAST of Csor.00g073470 vs. NCBI nr
Match: XP_022947742.1 (uncharacterized protein LOC111451515 isoform X1 [Cucurbita moschata])

HSP 1 Score: 652 bits (1683), Expect = 2.22e-235
Identity = 340/359 (94.71%), Postives = 344/359 (95.82%), Query Frame = 0

Query: 1   MCRSDQALESTSVVLDSKSTSSRLLHRRNSLNKHPSPSPNLTSTSDNILLPVAAANGGSL 60
           MCRSDQALESTS         +RLLHRRNSLNKHPSP+PNLTSTSD+ILLPVAA NGGSL
Sbjct: 1   MCRSDQALESTS---------NRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAA-NGGSL 60

Query: 61  SRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQRKMRIAHYG 120
           SRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQRKMRIAHYG
Sbjct: 61  SRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQRKMRIAHYG 120

Query: 121 RSKSARFEKIVPFDSKIKGVVEDRRCSFITPNS-----DPIYVAYHDEEWGVPVHDDQAL 180
           RSKSARFEKIVPFDSKIKGVVEDRRCSFITPNS     DPIYVAYHD+EWGVPVHDDQAL
Sbjct: 121 RSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSGFSLSDPIYVAYHDQEWGVPVHDDQAL 180

Query: 181 FELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVR 240
           FELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVR
Sbjct: 181 FELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVR 240

Query: 241 GVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRG 300
           GVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRG
Sbjct: 241 GVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRG 300

Query: 301 FRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV 354
           FRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV
Sbjct: 301 FRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV 349

BLAST of Csor.00g073470 vs. NCBI nr
Match: XP_023007195.1 (uncharacterized protein LOC111499758 isoform X2 [Cucurbita maxima])

HSP 1 Score: 645 bits (1664), Expect = 2.72e-232
Identity = 334/362 (92.27%), Postives = 341/362 (94.20%), Query Frame = 0

Query: 1   MCRSDQALESTSVVLDSK--------STSSRLLHRRNSLNKHPSPSPNLTSTSDNILLPV 60
           MCRSDQALES+SVVLDSK         TS+RLLHRRNSLN+HPSPSPN+TSTSD ILLP+
Sbjct: 1   MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPL 60

Query: 61  AAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQR 120
           AA NGGSLSRPRPALD KKSKSFK GGNGNV SDN AEVASPGSIAAVRREQVALQQAQR
Sbjct: 61  AA-NGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVASPGSIAAVRREQVALQQAQR 120

Query: 121 KMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDD 180
           KMRIAHYGRSKSARFEKIVPFDSKIK VVE+RRCSFITPNSDPIYVAYHDEEWGVPVHDD
Sbjct: 121 KMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPNSDPIYVAYHDEEWGVPVHDD 180

Query: 181 QALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDIN 240
           Q LFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDA+VVANFSDRQMVSISSEYGMDIN
Sbjct: 181 QTLFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDIN 240

Query: 241 RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMI 300
           RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKS HKIPVKTSKSDTISKDMI
Sbjct: 241 RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMI 300

Query: 301 RRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSE 354
           RRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGR APP EVEET TGAAGSE
Sbjct: 301 RRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSE 360

BLAST of Csor.00g073470 vs. NCBI nr
Match: XP_023007198.1 (uncharacterized protein LOC111499759 isoform X2 [Cucurbita maxima])

HSP 1 Score: 643 bits (1658), Expect = 2.24e-231
Identity = 334/362 (92.27%), Postives = 340/362 (93.92%), Query Frame = 0

Query: 1   MCRSDQALESTSVVLDSK--------STSSRLLHRRNSLNKHPSPSPNLTSTSDNILLPV 60
           MCRSDQALES+SVVLDSK         TS+RLLHRRNSLNKHPSPSPN+TSTSD ILLP+
Sbjct: 1   MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNKHPSPSPNITSTSDKILLPL 60

Query: 61  AAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQR 120
           AA NG SLSRPRPALD KKSKSFK GGNGNV  DN AEVASPGSIAAVRREQVALQQAQR
Sbjct: 61  AA-NGCSLSRPRPALDRKKSKSFKPGGNGNVGCDNVAEVASPGSIAAVRREQVALQQAQR 120

Query: 121 KMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDD 180
           KMRIAHYGRSKSARFEKIVPFDSKIK VVE+RRCSFITPNSDPIYVAYHDEEWGVPVHDD
Sbjct: 121 KMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPNSDPIYVAYHDEEWGVPVHDD 180

Query: 181 QALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDIN 240
           Q LFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDA+VVANFSDRQMVSISSEYGMDIN
Sbjct: 181 QTLFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDIN 240

Query: 241 RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMI 300
           RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKS HKIPVKTSKSDTISKDMI
Sbjct: 241 RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMI 300

Query: 301 RRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSE 354
           RRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPP EVEET TGAAGSE
Sbjct: 301 RRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPPEVEETTTGAAGSE 360

BLAST of Csor.00g073470 vs. ExPASy TrEMBL
Match: A0A6J1G8B3 (uncharacterized protein LOC111451515 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111451659 PE=4 SV=1)

HSP 1 Score: 659 bits (1699), Expect = 3.24e-238
Identity = 340/354 (96.05%), Postives = 344/354 (97.18%), Query Frame = 0

Query: 1   MCRSDQALESTSVVLDSKSTSSRLLHRRNSLNKHPSPSPNLTSTSDNILLPVAAANGGSL 60
           MCRSDQALESTS         +RLLHRRNSLNKHPSP+PNLTSTSD+ILLPVAA NGGSL
Sbjct: 1   MCRSDQALESTS---------NRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAA-NGGSL 60

Query: 61  SRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQRKMRIAHYG 120
           SRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQRKMRIAHYG
Sbjct: 61  SRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQRKMRIAHYG 120

Query: 121 RSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDQALFELLV 180
           RSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHD+EWGVPVHDDQALFELLV
Sbjct: 121 RSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDQEWGVPVHDDQALFELLV 180

Query: 181 LSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDN 240
           LSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDN
Sbjct: 181 LSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVRGVVDN 240

Query: 241 AIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVG 300
           AIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVG
Sbjct: 241 AIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVG 300

Query: 301 PTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV 354
           PTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV
Sbjct: 301 PTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV 344

BLAST of Csor.00g073470 vs. ExPASy TrEMBL
Match: A0A6J1G7A4 (uncharacterized protein LOC111451515 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111451515 PE=4 SV=1)

HSP 1 Score: 652 bits (1683), Expect = 1.07e-235
Identity = 340/359 (94.71%), Postives = 344/359 (95.82%), Query Frame = 0

Query: 1   MCRSDQALESTSVVLDSKSTSSRLLHRRNSLNKHPSPSPNLTSTSDNILLPVAAANGGSL 60
           MCRSDQALESTS         +RLLHRRNSLNKHPSP+PNLTSTSD+ILLPVAA NGGSL
Sbjct: 1   MCRSDQALESTS---------NRLLHRRNSLNKHPSPTPNLTSTSDSILLPVAA-NGGSL 60

Query: 61  SRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQRKMRIAHYG 120
           SRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQRKMRIAHYG
Sbjct: 61  SRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQRKMRIAHYG 120

Query: 121 RSKSARFEKIVPFDSKIKGVVEDRRCSFITPNS-----DPIYVAYHDEEWGVPVHDDQAL 180
           RSKSARFEKIVPFDSKIKGVVEDRRCSFITPNS     DPIYVAYHD+EWGVPVHDDQAL
Sbjct: 121 RSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSGFSLSDPIYVAYHDQEWGVPVHDDQAL 180

Query: 181 FELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVR 240
           FELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVR
Sbjct: 181 FELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDINRVR 240

Query: 241 GVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRG 300
           GVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRG
Sbjct: 241 GVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRG 300

Query: 301 FRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV 354
           FRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV
Sbjct: 301 FRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSEAV 349

BLAST of Csor.00g073470 vs. ExPASy TrEMBL
Match: A0A6J1L2B3 (uncharacterized protein LOC111499758 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111499758 PE=4 SV=1)

HSP 1 Score: 645 bits (1664), Expect = 1.32e-232
Identity = 334/362 (92.27%), Postives = 341/362 (94.20%), Query Frame = 0

Query: 1   MCRSDQALESTSVVLDSK--------STSSRLLHRRNSLNKHPSPSPNLTSTSDNILLPV 60
           MCRSDQALES+SVVLDSK         TS+RLLHRRNSLN+HPSPSPN+TSTSD ILLP+
Sbjct: 1   MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPL 60

Query: 61  AAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQR 120
           AA NGGSLSRPRPALD KKSKSFK GGNGNV SDN AEVASPGSIAAVRREQVALQQAQR
Sbjct: 61  AA-NGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVASPGSIAAVRREQVALQQAQR 120

Query: 121 KMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDD 180
           KMRIAHYGRSKSARFEKIVPFDSKIK VVE+RRCSFITPNSDPIYVAYHDEEWGVPVHDD
Sbjct: 121 KMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPNSDPIYVAYHDEEWGVPVHDD 180

Query: 181 QALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDIN 240
           Q LFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDA+VVANFSDRQMVSISSEYGMDIN
Sbjct: 181 QTLFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDIN 240

Query: 241 RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMI 300
           RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKS HKIPVKTSKSDTISKDMI
Sbjct: 241 RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMI 300

Query: 301 RRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSE 354
           RRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGR APP EVEET TGAAGSE
Sbjct: 301 RRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTGAAGSE 360

BLAST of Csor.00g073470 vs. ExPASy TrEMBL
Match: A0A6J1L721 (uncharacterized protein LOC111499759 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111499759 PE=4 SV=1)

HSP 1 Score: 643 bits (1658), Expect = 1.08e-231
Identity = 334/362 (92.27%), Postives = 340/362 (93.92%), Query Frame = 0

Query: 1   MCRSDQALESTSVVLDSK--------STSSRLLHRRNSLNKHPSPSPNLTSTSDNILLPV 60
           MCRSDQALES+SVVLDSK         TS+RLLHRRNSLNKHPSPSPN+TSTSD ILLP+
Sbjct: 1   MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNKHPSPSPNITSTSDKILLPL 60

Query: 61  AAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQR 120
           AA NG SLSRPRPALD KKSKSFK GGNGNV  DN AEVASPGSIAAVRREQVALQQAQR
Sbjct: 61  AA-NGCSLSRPRPALDRKKSKSFKPGGNGNVGCDNVAEVASPGSIAAVRREQVALQQAQR 120

Query: 121 KMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDD 180
           KMRIAHYGRSKSARFEKIVPFDSKIK VVE+RRCSFITPNSDPIYVAYHDEEWGVPVHDD
Sbjct: 121 KMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPNSDPIYVAYHDEEWGVPVHDD 180

Query: 181 QALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDIN 240
           Q LFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDA+VVANFSDRQMVSISSEYGMDIN
Sbjct: 181 QTLFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEYGMDIN 240

Query: 241 RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMI 300
           RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKS HKIPVKTSKSDTISKDMI
Sbjct: 241 RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTISKDMI 300

Query: 301 RRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATGAAGSE 354
           RRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPP EVEET TGAAGSE
Sbjct: 301 RRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPPEVEETTTGAAGSE 360

BLAST of Csor.00g073470 vs. ExPASy TrEMBL
Match: A0A6J1L4A1 (uncharacterized protein LOC111499758 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111499758 PE=4 SV=1)

HSP 1 Score: 639 bits (1648), Expect = 4.35e-230
Identity = 334/367 (91.01%), Postives = 341/367 (92.92%), Query Frame = 0

Query: 1   MCRSDQALESTSVVLDSK--------STSSRLLHRRNSLNKHPSPSPNLTSTSDNILLPV 60
           MCRSDQALES+SVVLDSK         TS+RLLHRRNSLN+HPSPSPN+TSTSD ILLP+
Sbjct: 1   MCRSDQALESSSVVLDSKFNPPPLLQPTSNRLLHRRNSLNRHPSPSPNITSTSDKILLPL 60

Query: 61  AAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQR 120
           AA NGGSLSRPRPALD KKSKSFK GGNGNV SDN AEVASPGSIAAVRREQVALQQAQR
Sbjct: 61  AA-NGGSLSRPRPALDRKKSKSFKPGGNGNVGSDNVAEVASPGSIAAVRREQVALQQAQR 120

Query: 121 KMRIAHYGRSKSARFEKIVPFDSKIKGVVEDRRCSFITPNS-----DPIYVAYHDEEWGV 180
           KMRIAHYGRSKSARFEKIVPFDSKIK VVE+RRCSFITPNS     DPIYVAYHDEEWGV
Sbjct: 121 KMRIAHYGRSKSARFEKIVPFDSKIKAVVEERRCSFITPNSGFSLSDPIYVAYHDEEWGV 180

Query: 181 PVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEY 240
           PVHDDQ LFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDA+VVANFSDRQMVSISSEY
Sbjct: 181 PVHDDQTLFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAQVVANFSDRQMVSISSEY 240

Query: 241 GMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTI 300
           GMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKS HKIPVKTSKSDTI
Sbjct: 241 GMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSAHKIPVKTSKSDTI 300

Query: 301 SKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRAPPAEVEETATG 354
           SKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGR APP EVEET TG
Sbjct: 301 SKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRHAPPPEVEETTTG 360

BLAST of Csor.00g073470 vs. TAIR 10
Match: AT3G12710.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 341.3 bits (874), Expect = 9.5e-94
Identity = 181/300 (60.33%), Postives = 224/300 (74.67%), Query Frame = 0

Query: 35  PSPSPNLTSTSDNILLPVAAANGGSLSRPRPALDTKKSKSFKLGGNGNVVSDNAAEVASP 94
           PS   +L   S+++       NG   ++ R +L+ KKSKSFK G +      +     +P
Sbjct: 17  PSSCNSLMDRSESLKRDSVMGNGA--AKVRGSLERKKSKSFKEGDS----YSSWLITEAP 76

Query: 95  GSIAAVRREQVALQQAQRKMRIAHYGRSKSA---RFEKIVPFDSKIKGVVEDRRCSFITP 154
           GSIAAVRREQVA QQA RK++IAHYGRSKS       K+VP  +        +RCSF+TP
Sbjct: 77  GSIAAVRREQVAAQQALRKLKIAHYGRSKSTINFTSSKVVPLLNPNPN-PHPQRCSFLTP 136

Query: 155 NSDPIYVAYHDEEWGVPVHDDQALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEV 214
            SDPIYVAYHDEEWGVPVHDD+ LFELL LS AQVGSDWTS L+KR D+R AF +F+AEV
Sbjct: 137 TSDPIYVAYHDEEWGVPVHDDKTLFELLTLSGAQVGSDWTSTLRKRHDYRKAFMEFEAEV 196

Query: 215 VANFSDRQMVSISSEYGMDINRVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHY 274
           VA  ++++M +IS EY +++++VRGVV+NA +I+EIKK F SLEKY+WGF+N+ P S +Y
Sbjct: 197 VAKLTEKEMNAISIEYKIEMSKVRGVVENAKKIVEIKKAFVSLEKYLWGFVNHKPISTNY 256

Query: 275 KSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIA 332
           K GHKIPVKTSKS++ISKDM+RRGFR VGPTVVHSFMQAAGLTNDHL TC RH  CTL+A
Sbjct: 257 KLGHKIPVKTSKSESISKDMVRRGFRFVGPTVVHSFMQAAGLTNDHLITCCRHAPCTLLA 309

BLAST of Csor.00g073470 vs. TAIR 10
Match: AT5G44680.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 313.2 bits (801), Expect = 2.8e-85
Identity = 177/340 (52.06%), Postives = 227/340 (66.76%), Query Frame = 0

Query: 14  VLDSKSTSSRLLHRRNSLNKHPS----------PSPNLTSTSDNILLPVAAANGGSLSRP 73
           VL  KS     L RRNSL K P           PSP   S    ++ P  + N  SL +P
Sbjct: 22  VLQPKSNQVPTLDRRNSLKKSPPKPLNPIASKIPSPRPIS----LISPPLSPNTKSLRKP 81

Query: 74  ----RPALDTKKSKSFKL------GGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQRK 133
               +  L +  +KS  +       G    V         PGSIAA RRE+VA++Q +RK
Sbjct: 82  AGSCKELLRSSSTKSKPVISPENSDGGYKEVMPMVIVQKQPGSIAAARREEVAMKQEERK 141

Query: 134 MRIAHYGRSKSARF-EKIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDD 193
            +I+HYGR KS +  EK +  + + K     +RCSFIT +SDPIYVAYHD+EWGVPVHDD
Sbjct: 142 KKISHYGRIKSVKSNEKNLNVEHEKK-----KRCSFITTSSDPIYVAYHDKEWGVPVHDD 201

Query: 194 QALFELLVLSVAQVGSDWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDIN 253
             LFELLVL+ AQVGSDWTS+LK+R  FR AFS F+AE+VA+F+++++ SI ++YG++++
Sbjct: 202 NLLFELLVLTGAQVGSDWTSVLKRRNTFREAFSGFEAELVADFNEKKIQSIVNDYGINLS 261

Query: 254 RVRGVVDNAIRILEIKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMI 313
           +V  VVDNA +IL++K++ GS  KYIWGFM + P +  Y S  KIPVKTSKS+TISKDM+
Sbjct: 262 QVLAVVDNAKQILKVKRDLGSFNKYIWGFMKHKPVTTKYTSCQKIPVKTSKSETISKDMV 321

Query: 314 RRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAA 333
           RRGFR VGPTV+HS MQAAGLTNDHL TC RHL CT +AA
Sbjct: 322 RRGFRFVGPTVIHSLMQAAGLTNDHLITCPRHLECTAMAA 352

BLAST of Csor.00g073470 vs. TAIR 10
Match: AT5G57970.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 217.2 bits (552), Expect = 2.1e-56
Identity = 98/196 (50.00%), Postives = 138/196 (70.41%), Query Frame = 0

Query: 134 DSKIKGVVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDQALFELLVLSVAQVGSDWTSI 193
           DS   G    +RC+++TPNSDP Y+ +HDEEWGVPVHDD+ LFELLVLS A     W +I
Sbjct: 144 DSPPNGSETKKRCTWVTPNSDPCYIVFHDEEWGVPVHDDKRLFELLVLSGALAEHTWPTI 203

Query: 194 LKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDIN--RVRGVVDNAIRILEIKKEF 253
           L KRQ FR  F+DFD   +   ++++++   S     ++  ++R V++NA +IL++ +E+
Sbjct: 204 LSKRQAFREVFADFDPNAIVKINEKKIIGPGSPASTLLSDLKLRAVIENARQILKVIEEY 263

Query: 254 GSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAA 313
           GS +KYIW F+ N      ++   ++P KT K++ ISKD++RRGFRSVGPTVV+SFMQAA
Sbjct: 264 GSFDKYIWSFVKNKAIVSKFRYQRQVPAKTPKAEVISKDLVRRGFRSVGPTVVYSFMQAA 323

Query: 314 GLTNDHLTTCHRHLHC 328
           G+TNDHLT+C R  HC
Sbjct: 324 GITNDHLTSCFRFHHC 339

BLAST of Csor.00g073470 vs. TAIR 10
Match: AT5G57970.2 (DNA glycosylase superfamily protein )

HSP 1 Score: 217.2 bits (552), Expect = 2.1e-56
Identity = 98/196 (50.00%), Postives = 138/196 (70.41%), Query Frame = 0

Query: 134 DSKIKGVVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDQALFELLVLSVAQVGSDWTSI 193
           DS   G    +RC+++TPNSDP Y+ +HDEEWGVPVHDD+ LFELLVLS A     W +I
Sbjct: 144 DSPPNGSETKKRCTWVTPNSDPCYIVFHDEEWGVPVHDDKRLFELLVLSGALAEHTWPTI 203

Query: 194 LKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDIN--RVRGVVDNAIRILEIKKEF 253
           L KRQ FR  F+DFD   +   ++++++   S     ++  ++R V++NA +IL++ +E+
Sbjct: 204 LSKRQAFREVFADFDPNAIVKINEKKIIGPGSPASTLLSDLKLRAVIENARQILKVIEEY 263

Query: 254 GSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHSFMQAA 313
           GS +KYIW F+ N      ++   ++P KT K++ ISKD++RRGFRSVGPTVV+SFMQAA
Sbjct: 264 GSFDKYIWSFVKNKAIVSKFRYQRQVPAKTPKAEVISKDLVRRGFRSVGPTVVYSFMQAA 323

Query: 314 GLTNDHLTTCHRHLHC 328
           G+TNDHLT+C R  HC
Sbjct: 324 GITNDHLTSCFRFHHC 339

BLAST of Csor.00g073470 vs. TAIR 10
Match: AT1G75090.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 210.3 bits (534), Expect = 2.5e-54
Identity = 107/263 (40.68%), Postives = 161/263 (61.22%), Query Frame = 0

Query: 69  TKKSKSFKLGGNGNVVSDNAAEVASPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFE 128
           TK   + K   N +V +D+++  +S    ++V            K        +  A   
Sbjct: 46  TKSPATKKPDSNFSVSTDDSSSSSSSSERSSVNTTNSGKVTTPSKRNGVEKLNNVVASVA 105

Query: 129 KIVPFDSKIKGVVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDQALFELLVLSVAQVGS 188
            +     KI G V  +RC +ITPNSDPIYV +HDEEWGVPV DD+ LFELLV S A    
Sbjct: 106 VVEDISPKIPGPV--KRCHWITPNSDPIYVLFHDEEWGVPVRDDKKLFELLVFSQALAEF 165

Query: 189 DWTSILKKRQDFRNAFSDFDAEVVANFSDRQMVSISSEYGMDIN--RVRGVVDNAIRILE 248
            W SIL++R DFR  F +FD   +A F++++++S+     + ++  ++R +V+NA  +L+
Sbjct: 166 SWPSILRRRDDFRKLFEEFDPSAIAQFTEKRLMSLRVNGCLILSEQKLRAIVENAKSVLK 225

Query: 249 IKKEFGSLEKYIWGFMNNNPFSPHYKSGHKIPVKTSKSDTISKDMIRRGFRSVGPTVVHS 308
           +K+EFGS   Y W F+N+ P    Y+ G ++PVK+ K++ ISKDM++RGFR VGPTV++S
Sbjct: 226 VKQEFGSFSNYCWRFVNHKPLRNGYRYGRQVPVKSPKAEYISKDMMQRGFRCVGPTVMYS 285

Query: 309 FMQAAGLTNDHLTTCHRHLHCTL 330
           F+QA+G+ NDHLT C R+  C +
Sbjct: 286 FLQASGIVNDHLTACFRYQECNV 306

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q7VG781.7e-3941.36Probable GMP synthase [glutamine-hydrolyzing] OS=Helicobacter hepaticus (strain ... [more]
P051002.6e-3236.87DNA-3-methyladenine glycosylase 1 OS=Escherichia coli (strain K12) OX=83333 GN=t... [more]
P443212.0e-2735.20DNA-3-methyladenine glycosylase OS=Haemophilus influenzae (strain ATCC 51907 / D... [more]
Match NameE-valueIdentityDescription
KAG6605362.15.35e-251100.00hypothetical protein SDJN03_02679, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022947743.16.70e-23896.05uncharacterized protein LOC111451515 isoform X2 [Cucurbita moschata] >XP_0229479... [more]
XP_022947742.12.22e-23594.71uncharacterized protein LOC111451515 isoform X1 [Cucurbita moschata][more]
XP_023007195.12.72e-23292.27uncharacterized protein LOC111499758 isoform X2 [Cucurbita maxima][more]
XP_023007198.12.24e-23192.27uncharacterized protein LOC111499759 isoform X2 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1G8B33.24e-23896.05uncharacterized protein LOC111451515 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1G7A41.07e-23594.71uncharacterized protein LOC111451515 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1L2B31.32e-23292.27uncharacterized protein LOC111499758 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1L7211.08e-23192.27uncharacterized protein LOC111499759 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1L4A14.35e-23091.01uncharacterized protein LOC111499758 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT3G12710.19.5e-9460.33DNA glycosylase superfamily protein [more]
AT5G44680.12.8e-8552.06DNA glycosylase superfamily protein [more]
AT5G57970.12.1e-5650.00DNA glycosylase superfamily protein [more]
AT5G57970.22.1e-5650.00DNA glycosylase superfamily protein [more]
AT1G75090.12.5e-5440.68DNA glycosylase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Silver-seed gourd (sororia) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D1.10.340.30Hypothetical protein; domain 2coord: 143..325
e-value: 6.2E-64
score: 216.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 55..74
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..45
NoneNo IPR availablePANTHERPTHR31116:SF20DNA GLYCOSYLASE SUPERFAMILY PROTEINcoord: 13..332
NoneNo IPR availablePANTHERPTHR31116OS04G0501200 PROTEINcoord: 13..332
IPR005019Methyladenine glycosylasePFAMPF03352Adenine_glycocoord: 152..324
e-value: 2.1E-60
score: 203.5
IPR011257DNA glycosylaseSUPERFAMILY48150DNA-glycosylasecoord: 144..327

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csor.00g073470.m01Csor.00g073470.m01mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006284 base-excision repair
biological_process GO:0006281 DNA repair
molecular_function GO:0008725 DNA-3-methyladenine glycosylase activity
molecular_function GO:0003824 catalytic activity