CSPI06G10900 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI06G10900
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionDNA glycosylase superfamily protein
LocationChr6: 9510827 .. 9513383 (-)
RNA-Seq ExpressionCSPI06G10900
SyntenyCSPI06G10900
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAGGATGCATCCAACTTCTTCTCTCATCCAATTCCTTTATAAAACCCATCTCTCTCCCATTCCACCTCTCTTCACTAATTCCCATTTCCCAATTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTTAATTCACCAAAACTAAAAACCAAAAACTAAAAAAACGATGTGTCGTTCCGAGGAGACCTTGGAAGCCACTTCTGTCGTGGTCGATTCAAAATTCAATTCCCGTCCTGTCCTTCAACCCACTGGTAACCGTGTCCTCGACCGCCGTAATTCCCTCAAAAAACAACACCCTTCTCTCAAACCCCCTTCCGCCGCCGCCGTCTCCCCCACTTCTCCTAAATCCAAATCCCCCCGTCCTCCGGCCACCAAACGAGCCAATGACGGTAATAATCCCATGAACTCCAGCTCCGAGAAGATCCTCATTCCGGCCGCAGTGTCACGGCCCAGAGCTACGTTGGATAGAAAGAAATCGAAAAGCTTTAAACTGGGTGGAAATGGGAATGTGATTTGTGATAATGGTGGGTTTGAGGTGGCGTATGCTTCTTCTTTGATTACTGAATCGCCAGGAAGTATCGCCGCCGTGAGAAGAGAACAGGTGGCGCTGCAACAGGCGCAGAGGAAAATGAGAATTGCTCATTACGGAAGATCTAAATCTGCTCGTTTTGAAAAAATCGTTCCTCTTGATTCTAAAATTAAACCCGCTGTTGAAGATAGAAGATGTAGCTTCATCACTCCCAATTCAGGTAACTAAATCAAAAAAGGAATTACATTCTTTCTTTCTTTCTTTCTTTATATATATACATACATACACAATTTTATTTACTCTGTTCATCATATATATTCATAAATTTAACTTAAATTCTCTCTGTTTTTAGATCCCATTTATGTTGCTTACCATGATGAGGAATGGGGTGTCCCTGTTCATGATGACAAGTGAGTTTCTCTCTCCATTTTTTTTTTTTTTTTTTTTTTTGAGTTCCAACTCCAAACCCATCAAACCTTCGATTCAACAAAGAGAGTTCTAAACTCTATTGAACTAAACTTAATTCATTAATTTAAACTTTTTAATTTAGTGGTAATTCATTTTTTAAAAAATGTATCTAATCTTCAGGCTCCCCTTTCCATATTGCAGCCAAGTATTTTCCATTTAAATTCTTAGTTCAATTTCAATTTAGTATGAAGTAAACTTAATTTGAGCATGAACGCAAACAGCATTTTTTAACCCTGTGATTTATCCCTCAGTTAATATTAACTTTGACTAGTTACCTTATTTTTGTAATTTTTTAAATTACAACATCAATATAATTTGAACTCACTTAAATTTTTCGAATACAGGATGCTGTTTGAATTGCTGGTTCTAAGTGTAGCCCAGGTGGGTTCGGATTGGACTTCAATTCTGAAGAAACGTCAAGATTTCAGGTACCCAAAAACACAAACAATTTCCAATTAATTATTTCTCCCTCTCTCTTTTTTAATTAATTTCTGAATTCACTAATTTAATTTATATTGTTCAGGAATGCATTTTCAAGTTTCGATTCAGAAATTGTGGCAAATTTTTCGGACAAACAAATGGTTTCAATCAGCACAGAATATGGCATCGACATCAACAGAGTCCGAGGAGTTGTGGATAACGCAATTCGAATCCTCCAGGTAATTAAACCTAATAGTTAATTAGTAAAATCCAAATCCATTGACTTAATTAATAAATATATTTTAATCACTTCAGATTAAGAAGGAATTTGGGTCATTCGACAAATACATTTGGGGATTTGTGAACAACAAACCATTCTCACCGCAGTACAAATCCGGCCACAAAATTCCGGTGAAGACATCAAAATCAGAAACCATAAGCAAAGACATGGTCAGACGAGGTTTCCGGTCCGTCGGACCCACGGTGGTTCACTCCTTCATGCAAGCCGCCGGTCTAACCAACGACCATCTCACCACTTGCCACAGGCACCTCCATTGCACCTTAACCGCCGCCGGCCGCCGTACTCCGGCTCCGACGACGACAACACCCGAAGTGGAGGATACGGCGGCGGTTTGTGAAACACTCTAGAATTGACTCGAGAATTTTAATTAACAGACAAAAAGAAAAAATGATAACCTTTACGAGGAGTCAATCAACCATGATTTGCTTGCTAATTAACTAGATAACTAAATATATATCTTTGGGTTTTTCTTTTGTGGGGTTTGTGTATATTATAAAAAAATAGACTTGTAAGAGAAAAAGAAAAAAAAAAGAGATTGTGGGGTTGTGAATTTGTGTGGTTTTTTTTCTTTCTTTTCTTTTTTGTGGGAATTTTAGTGAAAGTGTTTTATATAATTAGAAGAAAAAAAGAAAAAAAGAAGAAGGTTATTTGAAGTGGTAGGGTTAGAATAGGAGACAGAGAGCATGTGCTTGTGCAATTGGGAGGCAATGGCATGTGAGTTTGTGTCAGTTTGCTTTTGTAAATTCCCATGTGATCCATCTAAATTTCAAACATTATTAATCATTATTATCTTATTATTATTCTTTTATTTTCAACACCCTGTTTCCTTTGCTTTGCTTTATT

mRNA sequence

GAGGATGCATCCAACTTCTTCTCTCATCCAATTCCTTTATAAAACCCATCTCTCTCCCATTCCACCTCTCTTCACTAATTCCCATTTCCCAATTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTTAATTCACCAAAACTAAAAACCAAAAACTAAAAAAACGATGTGTCGTTCCGAGGAGACCTTGGAAGCCACTTCTGTCGTGGTCGATTCAAAATTCAATTCCCGTCCTGTCCTTCAACCCACTGGTAACCGTGTCCTCGACCGCCGTAATTCCCTCAAAAAACAACACCCTTCTCTCAAACCCCCTTCCGCCGCCGCCGTCTCCCCCACTTCTCCTAAATCCAAATCCCCCCGTCCTCCGGCCACCAAACGAGCCAATGACGGTAATAATCCCATGAACTCCAGCTCCGAGAAGATCCTCATTCCGGCCGCAGTGTCACGGCCCAGAGCTACGTTGGATAGAAAGAAATCGAAAAGCTTTAAACTGGGTGGAAATGGGAATGTGATTTGTGATAATGGTGGGTTTGAGGTGGCGTATGCTTCTTCTTTGATTACTGAATCGCCAGGAAGTATCGCCGCCGTGAGAAGAGAACAGGTGGCGCTGCAACAGGCGCAGAGGAAAATGAGAATTGCTCATTACGGAAGATCTAAATCTGCTCGTTTTGAAAAAATCGTTCCTCTTGATTCTAAAATTAAACCCGCTGTTGAAGATAGAAGATGTAGCTTCATCACTCCCAATTCAGATCCCATTTATGTTGCTTACCATGATGAGGAATGGGGTGTCCCTGTTCATGATGACAAGATGCTGTTTGAATTGCTGGTTCTAAGTGTAGCCCAGGTGGGTTCGGATTGGACTTCAATTCTGAAGAAACGTCAAGATTTCAGGAATGCATTTTCAAGTTTCGATTCAGAAATTGTGGCAAATTTTTCGGACAAACAAATGGTTTCAATCAGCACAGAATATGGCATCGACATCAACAGAGTCCGAGGAGTTGTGGATAACGCAATTCGAATCCTCCAGATTAAGAAGGAATTTGGGTCATTCGACAAATACATTTGGGGATTTGTGAACAACAAACCATTCTCACCGCAGTACAAATCCGGCCACAAAATTCCGGTGAAGACATCAAAATCAGAAACCATAAGCAAAGACATGGTCAGACGAGGTTTCCGGTCCGTCGGACCCACGGTGGTTCACTCCTTCATGCAAGCCGCCGGTCTAACCAACGACCATCTCACCACTTGCCACAGGCACCTCCATTGCACCTTAACCGCCGCCGGCCGCCGTACTCCGGCTCCGACGACGACAACACCCGAAGTGGAGGATACGGCGGCGGTTTGTGAAACACTCTAGAATTGACTCGAGAATTTTAATTAACAGACAAAAAGAAAAAATGATAACCTTTACGAGGAGTCAATCAACCATGATTTGCTTGCTAATTAACTAGATAACTAAATATATATCTTTGGGTTTTTCTTTTGTGGGGTTTGTGTATATTATAAAAAAATAGACTTGTAAGAGAAAAAGAAAAAAAAAAGAGATTGTGGGGTTGTGAATTTGTGTGGTTTTTTTTCTTTCTTTTCTTTTTTGTGGGAATTTTAGTGAAAGTGTTTTATATAATTAGAAGAAAAAAAGAAAAAAAGAAGAAGGTTATTTGAAGTGGTAGGGTTAGAATAGGAGACAGAGAGCATGTGCTTGTGCAATTGGGAGGCAATGGCATGTGAGTTTGTGTCAGTTTGCTTTTGTAAATTCCCATGTGATCCATCTAAATTTCAAACATTATTAATCATTATTATCTTATTATTATTCTTTTATTTTCAACACCCTGTTTCCTTTGCTTTGCTTTATT

Coding sequence (CDS)

ATGTGTCGTTCCGAGGAGACCTTGGAAGCCACTTCTGTCGTGGTCGATTCAAAATTCAATTCCCGTCCTGTCCTTCAACCCACTGGTAACCGTGTCCTCGACCGCCGTAATTCCCTCAAAAAACAACACCCTTCTCTCAAACCCCCTTCCGCCGCCGCCGTCTCCCCCACTTCTCCTAAATCCAAATCCCCCCGTCCTCCGGCCACCAAACGAGCCAATGACGGTAATAATCCCATGAACTCCAGCTCCGAGAAGATCCTCATTCCGGCCGCAGTGTCACGGCCCAGAGCTACGTTGGATAGAAAGAAATCGAAAAGCTTTAAACTGGGTGGAAATGGGAATGTGATTTGTGATAATGGTGGGTTTGAGGTGGCGTATGCTTCTTCTTTGATTACTGAATCGCCAGGAAGTATCGCCGCCGTGAGAAGAGAACAGGTGGCGCTGCAACAGGCGCAGAGGAAAATGAGAATTGCTCATTACGGAAGATCTAAATCTGCTCGTTTTGAAAAAATCGTTCCTCTTGATTCTAAAATTAAACCCGCTGTTGAAGATAGAAGATGTAGCTTCATCACTCCCAATTCAGATCCCATTTATGTTGCTTACCATGATGAGGAATGGGGTGTCCCTGTTCATGATGACAAGATGCTGTTTGAATTGCTGGTTCTAAGTGTAGCCCAGGTGGGTTCGGATTGGACTTCAATTCTGAAGAAACGTCAAGATTTCAGGAATGCATTTTCAAGTTTCGATTCAGAAATTGTGGCAAATTTTTCGGACAAACAAATGGTTTCAATCAGCACAGAATATGGCATCGACATCAACAGAGTCCGAGGAGTTGTGGATAACGCAATTCGAATCCTCCAGATTAAGAAGGAATTTGGGTCATTCGACAAATACATTTGGGGATTTGTGAACAACAAACCATTCTCACCGCAGTACAAATCCGGCCACAAAATTCCGGTGAAGACATCAAAATCAGAAACCATAAGCAAAGACATGGTCAGACGAGGTTTCCGGTCCGTCGGACCCACGGTGGTTCACTCCTTCATGCAAGCCGCCGGTCTAACCAACGACCATCTCACCACTTGCCACAGGCACCTCCATTGCACCTTAACCGCCGCCGGCCGCCGTACTCCGGCTCCGACGACGACAACACCCGAAGTGGAGGATACGGCGGCGGTTTGTGAAACACTCTAG

Protein sequence

MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPSAAAVSPTSPKSKSPRPPATKRANDGNNPMNSSSEKILIPAAVSRPRATLDRKKSKSFKLGGNGNVICDNGGFEVAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLTAAGRRTPAPTTTTPEVEDTAAVCETL*
Homology
BLAST of CSPI06G10900 vs. ExPASy Swiss-Prot
Match: Q7VG78 (Probable GMP synthase [glutamine-hydrolyzing] OS=Helicobacter hepaticus (strain ATCC 51449 / 3B1) OX=235279 GN=guaA PE=3 SV=1)

HSP 1 Score: 167.2 bits (422), Expect = 3.9e-40
Identity = 82/187 (43.85%), Postives = 116/187 (62.03%), Query Frame = 0

Query: 183 EDRRCSFITPNSD---PIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQ 242
           E  RC++ T   +    +Y  YHD EWG P+H+DK LFE LVL   Q G  W +ILKKR+
Sbjct: 784 EKVRCAWATDKDEAARKLYEDYHDTEWGEPLHEDKKLFEHLVLEGFQAGLSWITILKKRE 843

Query: 243 DFRNAFSSFDSEIVANFSDKQMVSISTEYGIDINR--VRGVVDNAIRILQIKKEFGSFDK 302
            FR AF  FD  IVAN+ + ++  +    GI  NR  +   + NA   + +++EFGSFDK
Sbjct: 844 AFRVAFDDFDPHIVANYDEDKIKELMRNEGIIRNRAKIEAAIINAKAFMAVQREFGSFDK 903

Query: 303 YIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTND 362
           YIWGFV  KP    ++S   +P  T  S+ I+KD+ +RGF+ VG T +++ MQ+ G+ ND
Sbjct: 904 YIWGFVGGKPIINAFESIADLPASTPLSDKIAKDLKKRGFKFVGTTTMYAMMQSIGMVND 963

Query: 363 HLTTCHR 365
           HLT+C +
Sbjct: 964 HLTSCFK 970

BLAST of CSPI06G10900 vs. ExPASy Swiss-Prot
Match: P05100 (DNA-3-methyladenine glycosylase 1 OS=Escherichia coli (strain K12) OX=83333 GN=tag PE=1 SV=1)

HSP 1 Score: 147.1 bits (370), Expect = 4.1e-34
Identity = 71/179 (39.66%), Postives = 110/179 (61.45%), Query Frame = 0

Query: 186 RCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAF 245
           RC ++  + DP+Y+AYHD EWGVP  D K LFE++ L   Q G  W ++LKKR+++R  F
Sbjct: 3   RCGWV--SQDPLYIAYHDNEWGVPETDSKKLFEMICLEGQQAGLSWITVLKKRENYRACF 62

Query: 246 SSFDSEIVANFSDKQMVSISTEYGIDINR--VRGVVDNAIRILQIKKEFGSFDKYIWGFV 305
             FD   VA   ++ +  +  + GI  +R  ++ ++ NA   LQ+++    F  ++W FV
Sbjct: 63  HQFDPVKVAAMQEEDVERLVQDAGIIRHRGKIQAIIGNARAYLQMEQNGEPFVDFVWSFV 122

Query: 306 NNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTC 363
           N++P   Q  +  +IP  TS S+ +SK + +RGF+ VG T+ +SFMQA GL NDH+  C
Sbjct: 123 NHQPQVTQATTLSEIPTSTSASDALSKALKKRGFKFVGTTICYSFMQACGLVNDHVVGC 179

BLAST of CSPI06G10900 vs. ExPASy Swiss-Prot
Match: P44321 (DNA-3-methyladenine glycosylase OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) OX=71421 GN=tag PE=3 SV=1)

HSP 1 Score: 127.9 bits (320), Expect = 2.6e-28
Identity = 65/179 (36.31%), Postives = 99/179 (55.31%), Query Frame = 0

Query: 186 RCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAF 245
           RC ++   S  IY+ YHD+EWG P  D + LFE + L   Q G  W ++LKKR+ +R AF
Sbjct: 4   RCPWVGEQS--IYIDYHDKEWGKPEFDSQKLFEKICLEGQQAGLSWITVLKKRESYREAF 63

Query: 246 SSFDSEIVANFSDKQMVSISTEYGIDINRVR--GVVDNAIRILQIKKEFGSFDKYIWGFV 305
             FD + +A  +   + +     G+  +R +   +V NA   L ++K   +F  +IW FV
Sbjct: 64  HQFDPKKIAKMTALDIDACMQNSGLIRHRAKLEAIVKNAKAYLAMEKCGENFSDFIWSFV 123

Query: 306 NNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTC 363
           N+KP          +P KT  S+ +SK + +RGF  +G T  ++FMQ+ GL +DHL  C
Sbjct: 124 NHKPIVNDVPDLRSVPTKTEVSKALSKALKKRGFVFIGETTCYAFMQSMGLVDDHLNDC 180

BLAST of CSPI06G10900 vs. ExPASy TrEMBL
Match: A0A0A0KED6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G134890 PE=4 SV=1)

HSP 1 Score: 776.9 bits (2005), Expect = 3.9e-221
Identity = 396/397 (99.75%), Postives = 396/397 (99.75%), Query Frame = 0

Query: 1   MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPSAAAVSPTSPK 60
           MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPSAAAVSPTSPK
Sbjct: 1   MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPSAAAVSPTSPK 60

Query: 61  SKSPRPPATKRANDGNNPMNSSSEKILIPAAVSRPRATLDRKKSKSFKLGGNGNVICDNG 120
           SKSPRPPATKRANDGNNPMNSSSEKILIPAAVSRPRATLDRKKSKSFKLGGNGNVICDNG
Sbjct: 61  SKSPRPPATKRANDGNNPMNSSSEKILIPAAVSRPRATLDRKKSKSFKLGGNGNVICDNG 120

Query: 121 GFEVAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKP 180
           GFEVAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKP
Sbjct: 121 GFEVAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKP 180

Query: 181 AVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQD 240
           AVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQD
Sbjct: 181 AVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQD 240

Query: 241 FRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIW 300
           FRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIW
Sbjct: 241 FRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIW 300

Query: 301 GFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT 360
           GFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT
Sbjct: 301 GFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT 360

Query: 361 TCHRHLHCTLTAAGRRTPAPTTTTPEVEDTAAVCETL 398
           TCHRHLHCTL AAGRRTPAPTTTTPEVEDTAAVCETL
Sbjct: 361 TCHRHLHCTLIAAGRRTPAPTTTTPEVEDTAAVCETL 397

BLAST of CSPI06G10900 vs. ExPASy TrEMBL
Match: A0A5A7UM21 (Putative GMP synthase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold104G00320 PE=4 SV=1)

HSP 1 Score: 753.4 bits (1944), Expect = 4.7e-214
Identity = 388/399 (97.24%), Postives = 392/399 (98.25%), Query Frame = 0

Query: 1   MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPS-AAAVSPTSP 60
           MCRSEE LEATSVVVDSKFNSRPVLQPT NRVLDRRNSLKKQHPSLKPPS AAAVSPTSP
Sbjct: 1   MCRSEEALEATSVVVDSKFNSRPVLQPTCNRVLDRRNSLKKQHPSLKPPSPAAAVSPTSP 60

Query: 61  KSKSPRPPATKRANDGNNPMNSSSEKILIPAAVSRPRATLDRKKSKSFKLGGNGNVICDN 120
           KSKSPRPPATKRANDGNNPMNSSSEKILIPAA SRPRATLDRKKSKSFKLGGNGNVICDN
Sbjct: 61  KSKSPRPPATKRANDGNNPMNSSSEKILIPAAASRPRATLDRKKSKSFKLGGNGNVICDN 120

Query: 121 GGFEVAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIK 180
           GGFEVAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIK
Sbjct: 121 GGFEVAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIK 180

Query: 181 PAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQ 240
           P+VEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQ
Sbjct: 181 PSVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQ 240

Query: 241 DFRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYI 300
           DFRNAFSSFDSEIVANFS+KQMVSISTEYGIDINRVRGVVDN+IRILQIKKEFGSFDKYI
Sbjct: 241 DFRNAFSSFDSEIVANFSEKQMVSISTEYGIDINRVRGVVDNSIRILQIKKEFGSFDKYI 300

Query: 301 WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHL 360
           WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHL
Sbjct: 301 WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHL 360

Query: 361 TTCHRHLHCTLTAAGRRTPAPTTTTPEV-EDTAAVCETL 398
           TTCHRHLHCTL AAGRRTPAPTTTTPEV EDTAAVC+ L
Sbjct: 361 TTCHRHLHCTLIAAGRRTPAPTTTTPEVEEDTAAVCQKL 399

BLAST of CSPI06G10900 vs. ExPASy TrEMBL
Match: A0A6J1FSP1 (uncharacterized protein LOC111448434 OS=Cucurbita moschata OX=3662 GN=LOC111448434 PE=4 SV=1)

HSP 1 Score: 651.0 bits (1678), Expect = 3.3e-183
Identity = 348/406 (85.71%), Postives = 366/406 (90.15%), Query Frame = 0

Query: 1   MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPSAAAVSPTSPK 60
           MCRSE+ LEATSVVVDSKF +RPVLQPT NRVLDRRNSLKK       P +AAVSPTSPK
Sbjct: 1   MCRSEQALEATSVVVDSKFTARPVLQPTCNRVLDRRNSLKK-------PPSAAVSPTSPK 60

Query: 61  SKSPRPPATKRANDGNNPMNSSSEKILIP-AAVSRPRATLDRKKSKSFKLGGNGN-VICD 120
           SKSPRPPATKRAND  NPMNSSS+KILIP AA+SRP+A LDRKKSKSFKL GNGN VICD
Sbjct: 61  SKSPRPPATKRAND-TNPMNSSSDKILIPAAALSRPKAALDRKKSKSFKLAGNGNVVICD 120

Query: 121 N----GGFEVA---YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKI 180
           N    GGFEVA   YASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF+K+
Sbjct: 121 NVAGGGGFEVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDKV 180

Query: 181 VPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDW 240
           VPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDW
Sbjct: 181 VPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDW 240

Query: 241 TSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKE 300
           TSILKKRQDFRNAFSSF +E VA FSDKQM+SIS+EYGIDINRVRGVVDNAIRIL+IKKE
Sbjct: 241 TSILKKRQDFRNAFSSFVAETVAIFSDKQMLSISSEYGIDINRVRGVVDNAIRILEIKKE 300

Query: 301 FGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQA 360
           FGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQA
Sbjct: 301 FGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMIRRGFRSVGPTVVHSFMQA 360

Query: 361 AGLTNDHLTTCHRHLHCTLTAAGRRTPAPTTTTPEVEDTAAVCETL 398
           AGLTNDHLT+CHRHLHC++TAA RR PA       VE+T    ETL
Sbjct: 361 AGLTNDHLTSCHRHLHCSITAADRRAPAVV-----VEETTTASETL 393

BLAST of CSPI06G10900 vs. ExPASy TrEMBL
Match: A0A6J1J7H3 (uncharacterized protein LOC111484173 OS=Cucurbita maxima OX=3661 GN=LOC111484173 PE=4 SV=1)

HSP 1 Score: 646.4 bits (1666), Expect = 8.0e-182
Identity = 345/406 (84.98%), Postives = 365/406 (89.90%), Query Frame = 0

Query: 1   MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPSAAAVSPTSPK 60
           MCRSE+ LEATSVVVDSKF +RPVLQPT NRVLDRRNSLKK       P +AAVSPTSPK
Sbjct: 1   MCRSEQALEATSVVVDSKFTARPVLQPTCNRVLDRRNSLKK-------PPSAAVSPTSPK 60

Query: 61  SKSPRPPATKRANDGNNPMNSSSEKILIP-AAVSRPRATLDRKKSKSFKLGGNGN-VICD 120
           SKSPRPPATKRAN+  NPMNSSS+KILIP AA+SRP+A LDRKKSKSFKL GNGN VICD
Sbjct: 61  SKSPRPPATKRANE-TNPMNSSSDKILIPAAALSRPKAALDRKKSKSFKLAGNGNVVICD 120

Query: 121 N----GGFEVA---YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKI 180
           N    GGFEVA   YASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF+K+
Sbjct: 121 NVAGGGGFEVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDKV 180

Query: 181 VPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDW 240
           VPLDSKIKPAVE RRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDW
Sbjct: 181 VPLDSKIKPAVEHRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDW 240

Query: 241 TSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKE 300
           TSILKKRQDFRNAFSSF +E VA FSDKQM+SIS+EYGIDINRVRGVVDNAIRIL+IKKE
Sbjct: 241 TSILKKRQDFRNAFSSFVAETVAIFSDKQMLSISSEYGIDINRVRGVVDNAIRILEIKKE 300

Query: 301 FGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQA 360
           FGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQ 
Sbjct: 301 FGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMIRRGFRSVGPTVVHSFMQG 360

Query: 361 AGLTNDHLTTCHRHLHCTLTAAGRRTPAPTTTTPEVEDTAAVCETL 398
           AGLTNDHLT+CHRHLHC++TAAGRR PA       VE+T    E+L
Sbjct: 361 AGLTNDHLTSCHRHLHCSITAAGRRAPAVV-----VEETTTASESL 393

BLAST of CSPI06G10900 vs. ExPASy TrEMBL
Match: A0A6J1D778 (uncharacterized protein LOC111017989 OS=Momordica charantia OX=3673 GN=LOC111017989 PE=4 SV=1)

HSP 1 Score: 599.0 bits (1543), Expect = 1.5e-167
Identity = 323/391 (82.61%), Postives = 344/391 (87.98%), Query Frame = 0

Query: 1   MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPSAAAVSPTSPK 60
           MCRSE+ +EATSVV       R VLQPT NR L RRNSLKKQ PS  PP  +  SP SPK
Sbjct: 1   MCRSEQVMEATSVVA----VGRAVLQPTCNR-LHRRNSLKKQPPSPSPP-LSPPSPASPK 60

Query: 61  SKSPRPPATKRANDGNNPMNSSSEKILIPAAVSRPRATLDRKKSKSFKLGGNGNVICDNG 120
           SKSPRPPATKRAND    MNSSS+K+++PAA +RPRA LDRKKSKSFKLGG+G    D  
Sbjct: 61  SKSPRPPATKRANDAATAMNSSSDKLVLPAA-ARPRA-LDRKKSKSFKLGGSG---ADEA 120

Query: 121 GFEVAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKP 180
              ++YASSLITESPGSIAAVRREQVALQQAQRKM+IAHYGRSKSARFEKIVP+DSK KP
Sbjct: 121 APSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKIVPIDSKTKP 180

Query: 181 AVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQD 240
           AVEDRRCSFITPNSDPIYVAYHDEEWGVPVH+DK+LFELLVLSVAQVGSDWTSILKKRQD
Sbjct: 181 AVEDRRCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDWTSILKKRQD 240

Query: 241 FRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIW 300
           FRNAFSSFD+E VANFSDKQMVSISTEYGIDINRVRGVVDNAIRIL+IKKEFGSFDKYIW
Sbjct: 241 FRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKEFGSFDKYIW 300

Query: 301 GFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT 360
           GFVN+KPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT
Sbjct: 301 GFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT 360

Query: 361 TCHRHLHCTLTAAGRRTPAPTTTTPEVEDTA 392
           +CHRHL CTL AAGRR P       EVE+T+
Sbjct: 361 SCHRHLRCTLLAAGRRAPPAV----EVEETS 376

BLAST of CSPI06G10900 vs. NCBI nr
Match: XP_004139917.2 (uncharacterized protein LOC101218536 [Cucumis sativus] >KGN46782.1 hypothetical protein Csa_020741 [Cucumis sativus])

HSP 1 Score: 776.9 bits (2005), Expect = 8.2e-221
Identity = 396/397 (99.75%), Postives = 396/397 (99.75%), Query Frame = 0

Query: 1   MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPSAAAVSPTSPK 60
           MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPSAAAVSPTSPK
Sbjct: 1   MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPSAAAVSPTSPK 60

Query: 61  SKSPRPPATKRANDGNNPMNSSSEKILIPAAVSRPRATLDRKKSKSFKLGGNGNVICDNG 120
           SKSPRPPATKRANDGNNPMNSSSEKILIPAAVSRPRATLDRKKSKSFKLGGNGNVICDNG
Sbjct: 61  SKSPRPPATKRANDGNNPMNSSSEKILIPAAVSRPRATLDRKKSKSFKLGGNGNVICDNG 120

Query: 121 GFEVAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKP 180
           GFEVAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKP
Sbjct: 121 GFEVAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKP 180

Query: 181 AVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQD 240
           AVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQD
Sbjct: 181 AVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQD 240

Query: 241 FRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIW 300
           FRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIW
Sbjct: 241 FRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIW 300

Query: 301 GFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT 360
           GFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT
Sbjct: 301 GFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT 360

Query: 361 TCHRHLHCTLTAAGRRTPAPTTTTPEVEDTAAVCETL 398
           TCHRHLHCTL AAGRRTPAPTTTTPEVEDTAAVCETL
Sbjct: 361 TCHRHLHCTLIAAGRRTPAPTTTTPEVEDTAAVCETL 397

BLAST of CSPI06G10900 vs. NCBI nr
Match: KAA0054725.1 (putative GMP synthase [Cucumis melo var. makuwa] >TYJ95615.1 putative GMP synthase [Cucumis melo var. makuwa])

HSP 1 Score: 753.4 bits (1944), Expect = 9.7e-214
Identity = 388/399 (97.24%), Postives = 392/399 (98.25%), Query Frame = 0

Query: 1   MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPS-AAAVSPTSP 60
           MCRSEE LEATSVVVDSKFNSRPVLQPT NRVLDRRNSLKKQHPSLKPPS AAAVSPTSP
Sbjct: 1   MCRSEEALEATSVVVDSKFNSRPVLQPTCNRVLDRRNSLKKQHPSLKPPSPAAAVSPTSP 60

Query: 61  KSKSPRPPATKRANDGNNPMNSSSEKILIPAAVSRPRATLDRKKSKSFKLGGNGNVICDN 120
           KSKSPRPPATKRANDGNNPMNSSSEKILIPAA SRPRATLDRKKSKSFKLGGNGNVICDN
Sbjct: 61  KSKSPRPPATKRANDGNNPMNSSSEKILIPAAASRPRATLDRKKSKSFKLGGNGNVICDN 120

Query: 121 GGFEVAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIK 180
           GGFEVAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIK
Sbjct: 121 GGFEVAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIK 180

Query: 181 PAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQ 240
           P+VEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQ
Sbjct: 181 PSVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQ 240

Query: 241 DFRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYI 300
           DFRNAFSSFDSEIVANFS+KQMVSISTEYGIDINRVRGVVDN+IRILQIKKEFGSFDKYI
Sbjct: 241 DFRNAFSSFDSEIVANFSEKQMVSISTEYGIDINRVRGVVDNSIRILQIKKEFGSFDKYI 300

Query: 301 WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHL 360
           WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHL
Sbjct: 301 WGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHL 360

Query: 361 TTCHRHLHCTLTAAGRRTPAPTTTTPEV-EDTAAVCETL 398
           TTCHRHLHCTL AAGRRTPAPTTTTPEV EDTAAVC+ L
Sbjct: 361 TTCHRHLHCTLIAAGRRTPAPTTTTPEVEEDTAAVCQKL 399

BLAST of CSPI06G10900 vs. NCBI nr
Match: XP_038902889.1 (uncharacterized protein LOC120089476 [Benincasa hispida])

HSP 1 Score: 704.5 bits (1817), Expect = 5.1e-199
Identity = 376/403 (93.30%), Postives = 383/403 (95.04%), Query Frame = 0

Query: 1   MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPSA--AAVSPTS 60
           MCRSEE LEA++VVVDSKFN+RPVLQPT NRVLDRRNSLKKQ PSLKPPSA  AAVSPTS
Sbjct: 1   MCRSEEALEASTVVVDSKFNARPVLQPTCNRVLDRRNSLKKQ-PSLKPPSAAVAAVSPTS 60

Query: 61  PKSKSPRPPATKRANDGNNPMNSSSEKILIPAA------VSRPRATLDRKKSKSFKLGGN 120
           PKSKSPRPPATKRANDGNNPMNSSS+KILIPAA      VSRPRATLDRKKSKSFKLGGN
Sbjct: 61  PKSKSPRPPATKRANDGNNPMNSSSDKILIPAATNGGGSVSRPRATLDRKKSKSFKLGGN 120

Query: 121 GN-VICDNGGFEVA---YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF 180
           GN VICDNGG+EVA   YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF
Sbjct: 121 GNVVICDNGGYEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF 180

Query: 181 EKIVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVG 240
           EKIVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVG
Sbjct: 181 EKIVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVG 240

Query: 241 SDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQI 300
           SDWTSILKKRQDFRNAFSSFDSEIVA FSDKQMVSIS+EYGIDINRVRGVVDNAIRILQI
Sbjct: 241 SDWTSILKKRQDFRNAFSSFDSEIVAVFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI 300

Query: 301 KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSF 360
           KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSF
Sbjct: 301 KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSF 360

Query: 361 MQAAGLTNDHLTTCHRHLHCTLTAAGRRTPAPTTTTPEVEDTA 392
           MQAAGLTNDHLTTCHRHLHCTL AAGRRT A TTTT EVE+TA
Sbjct: 361 MQAAGLTNDHLTTCHRHLHCTLIAAGRRTTA-TTTTTEVEETA 401

BLAST of CSPI06G10900 vs. NCBI nr
Match: XP_022943791.1 (uncharacterized protein LOC111448434 [Cucurbita moschata])

HSP 1 Score: 651.0 bits (1678), Expect = 6.7e-183
Identity = 348/406 (85.71%), Postives = 366/406 (90.15%), Query Frame = 0

Query: 1   MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPSAAAVSPTSPK 60
           MCRSE+ LEATSVVVDSKF +RPVLQPT NRVLDRRNSLKK       P +AAVSPTSPK
Sbjct: 1   MCRSEQALEATSVVVDSKFTARPVLQPTCNRVLDRRNSLKK-------PPSAAVSPTSPK 60

Query: 61  SKSPRPPATKRANDGNNPMNSSSEKILIP-AAVSRPRATLDRKKSKSFKLGGNGN-VICD 120
           SKSPRPPATKRAND  NPMNSSS+KILIP AA+SRP+A LDRKKSKSFKL GNGN VICD
Sbjct: 61  SKSPRPPATKRAND-TNPMNSSSDKILIPAAALSRPKAALDRKKSKSFKLAGNGNVVICD 120

Query: 121 N----GGFEVA---YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKI 180
           N    GGFEVA   YASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF+K+
Sbjct: 121 NVAGGGGFEVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDKV 180

Query: 181 VPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDW 240
           VPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDW
Sbjct: 181 VPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDW 240

Query: 241 TSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKE 300
           TSILKKRQDFRNAFSSF +E VA FSDKQM+SIS+EYGIDINRVRGVVDNAIRIL+IKKE
Sbjct: 241 TSILKKRQDFRNAFSSFVAETVAIFSDKQMLSISSEYGIDINRVRGVVDNAIRILEIKKE 300

Query: 301 FGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQA 360
           FGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQA
Sbjct: 301 FGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMIRRGFRSVGPTVVHSFMQA 360

Query: 361 AGLTNDHLTTCHRHLHCTLTAAGRRTPAPTTTTPEVEDTAAVCETL 398
           AGLTNDHLT+CHRHLHC++TAA RR PA       VE+T    ETL
Sbjct: 361 AGLTNDHLTSCHRHLHCSITAADRRAPAVV-----VEETTTASETL 393

BLAST of CSPI06G10900 vs. NCBI nr
Match: KAG6570606.1 (hypothetical protein SDJN03_29521, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 649.8 bits (1675), Expect = 1.5e-182
Identity = 347/406 (85.47%), Postives = 366/406 (90.15%), Query Frame = 0

Query: 1   MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPSAAAVSPTSPK 60
           MCRSE+ LEAT+VVVDSKF +RPVLQPT NRVLDRRNSLKK       P +AAVSPTSPK
Sbjct: 1   MCRSEQALEATAVVVDSKFTARPVLQPTCNRVLDRRNSLKK-------PPSAAVSPTSPK 60

Query: 61  SKSPRPPATKRANDGNNPMNSSSEKILIP-AAVSRPRATLDRKKSKSFKLGGNGN-VICD 120
           SKSPRPPATKRAND  NPMNSSS+KILIP AA+SRP+A LDRKKSKSFKL GNGN VICD
Sbjct: 61  SKSPRPPATKRAND-TNPMNSSSDKILIPAAALSRPKAALDRKKSKSFKLAGNGNVVICD 120

Query: 121 N----GGFEVA---YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKI 180
           N    GGFEVA   YASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF+K+
Sbjct: 121 NVAGGGGFEVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFDKV 180

Query: 181 VPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDW 240
           VPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDW
Sbjct: 181 VPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDW 240

Query: 241 TSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKE 300
           TSILKKRQDFRNAFSSF +E VA FSDKQM+SIS+EYGIDINRVRGVVDNAIRIL+IKKE
Sbjct: 241 TSILKKRQDFRNAFSSFVAETVAIFSDKQMLSISSEYGIDINRVRGVVDNAIRILEIKKE 300

Query: 301 FGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQA 360
           FGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSFMQA
Sbjct: 301 FGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMIRRGFRSVGPTVVHSFMQA 360

Query: 361 AGLTNDHLTTCHRHLHCTLTAAGRRTPAPTTTTPEVEDTAAVCETL 398
           AGLTNDHLT+CHRHLHC++TAA RR PA       VE+T    ETL
Sbjct: 361 AGLTNDHLTSCHRHLHCSITAADRRAPAVV-----VEETTTASETL 393

BLAST of CSPI06G10900 vs. TAIR 10
Match: AT3G12710.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 357.5 bits (916), Expect = 1.4e-98
Identity = 186/283 (65.72%), Postives = 220/283 (77.74%), Query Frame = 0

Query: 93  SRPRATLDRKKSKSFKLGGNGNVICDNGGFEVAYASSLITESPGSIAAVRREQVALQQAQ 152
           ++ R +L+RKKSKSFK G              +Y+S LITE+PGSIAAVRREQVA QQA 
Sbjct: 41  AKVRGSLERKKSKSFKEGD-------------SYSSWLITEAPGSIAAVRREQVAAQQAL 100

Query: 153 RKMRIAHYGRSKSA---RFEKIVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVP 212
           RK++IAHYGRSKS       K+VPL +   P    +RCSF+TP SDPIYVAYHDEEWGVP
Sbjct: 101 RKLKIAHYGRSKSTINFTSSKVVPLLNP-NPNPHPQRCSFLTPTSDPIYVAYHDEEWGVP 160

Query: 213 VHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISTEYG 272
           VHDDK LFELL LS AQVGSDWTS L+KR D+R AF  F++E+VA  ++K+M +IS EY 
Sbjct: 161 VHDDKTLFELLTLSGAQVGSDWTSTLRKRHDYRKAFMEFEAEVVAKLTEKEMNAISIEYK 220

Query: 273 IDINRVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETIS 332
           I++++VRGVV+NA +I++IKK F S +KY+WGFVN+KP S  YK GHKIPVKTSKSE+IS
Sbjct: 221 IEMSKVRGVVENAKKIVEIKKAFVSLEKYLWGFVNHKPISTNYKLGHKIPVKTSKSESIS 280

Query: 333 KDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLTA 373
           KDMVRRGFR VGPTVVHSFMQAAGLTNDHL TC RH  CTL A
Sbjct: 281 KDMVRRGFRFVGPTVVHSFMQAAGLTNDHLITCCRHAPCTLLA 309

BLAST of CSPI06G10900 vs. TAIR 10
Match: AT5G44680.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 344.4 bits (882), Expect = 1.3e-94
Identity = 192/360 (53.33%), Postives = 254/360 (70.56%), Query Frame = 0

Query: 17  SKFNSRPVLQPTGNRV--LDRRNSLKKQHPSLKPPSAAAVSPTSPKSKSPRPPATKRAND 76
           S+ N RPVLQP  N+V  LDRRNSLKK  P  KP     ++P + K  SPRP +      
Sbjct: 15  SQINGRPVLQPKSNQVPTLDRRNSLKKSPP--KP-----LNPIASKIPSPRPISLI---- 74

Query: 77  GNNPMNSSSEKILIPAAVSRPRATLDRKKSKSFKLGGNGNVICDNGGFEVAYASSLITES 136
            + P++ +++ +  PA   +        KSK      N      +GG++      ++ + 
Sbjct: 75  -SPPLSPNTKSLRKPAGSCKELLRSSSTKSKPVISPEN-----SDGGYKEVMPMVIVQKQ 134

Query: 137 PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF-EKIVPLDSKIKPAVEDRRCSFITPN 196
           PGSIAA RRE+VA++Q +RK +I+HYGR KS +  EK + ++ + K     +RCSFIT +
Sbjct: 135 PGSIAAARREEVAMKQEERKKKISHYGRIKSVKSNEKNLNVEHEKK-----KRCSFITTS 194

Query: 197 SDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIV 256
           SDPIYVAYHD+EWGVPVHDD +LFELLVL+ AQVGSDWTS+LK+R  FR AFS F++E+V
Sbjct: 195 SDPIYVAYHDKEWGVPVHDDNLLFELLVLTGAQVGSDWTSVLKRRNTFREAFSGFEAELV 254

Query: 257 ANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYK 316
           A+F++K++ SI  +YGI++++V  VVDNA +IL++K++ GSF+KYIWGF+ +KP + +Y 
Sbjct: 255 ADFNEKKIQSIVNDYGINLSQVLAVVDNAKQILKVKRDLGSFNKYIWGFMKHKPVTTKYT 314

Query: 317 SGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLTAA 374
           S  KIPVKTSKSETISKDMVRRGFR VGPTV+HS MQAAGLTNDHL TC RHL CT  AA
Sbjct: 315 SCQKIPVKTSKSETISKDMVRRGFRFVGPTVIHSLMQAAGLTNDHLITCPRHLECTAMAA 352

BLAST of CSPI06G10900 vs. TAIR 10
Match: AT5G57970.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 223.4 bits (568), Expect = 3.2e-58
Identity = 104/197 (52.79%), Postives = 141/197 (71.57%), Query Frame = 0

Query: 174 LDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTS 233
           LDS    +   +RC+++TPNSDP Y+ +HDEEWGVPVHDDK LFELLVLS A     W +
Sbjct: 143 LDSPPNGSETKKRCTWVTPNSDPCYIVFHDEEWGVPVHDDKRLFELLVLSGALAEHTWPT 202

Query: 234 ILKKRQDFRNAFSSFDSEIVANFSDKQMVSISTEYGIDIN--RVRGVVDNAIRILQIKKE 293
           IL KRQ FR  F+ FD   +   ++K+++   +     ++  ++R V++NA +IL++ +E
Sbjct: 203 ILSKRQAFREVFADFDPNAIVKINEKKIIGPGSPASTLLSDLKLRAVIENARQILKVIEE 262

Query: 294 FGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQA 353
           +GSFDKYIW FV NK    +++   ++P KT K+E ISKD+VRRGFRSVGPTVV+SFMQA
Sbjct: 263 YGSFDKYIWSFVKNKAIVSKFRYQRQVPAKTPKAEVISKDLVRRGFRSVGPTVVYSFMQA 322

Query: 354 AGLTNDHLTTCHRHLHC 369
           AG+TNDHLT+C R  HC
Sbjct: 323 AGITNDHLTSCFRFHHC 339

BLAST of CSPI06G10900 vs. TAIR 10
Match: AT5G57970.2 (DNA glycosylase superfamily protein )

HSP 1 Score: 223.4 bits (568), Expect = 3.2e-58
Identity = 104/197 (52.79%), Postives = 141/197 (71.57%), Query Frame = 0

Query: 174 LDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTS 233
           LDS    +   +RC+++TPNSDP Y+ +HDEEWGVPVHDDK LFELLVLS A     W +
Sbjct: 143 LDSPPNGSETKKRCTWVTPNSDPCYIVFHDEEWGVPVHDDKRLFELLVLSGALAEHTWPT 202

Query: 234 ILKKRQDFRNAFSSFDSEIVANFSDKQMVSISTEYGIDIN--RVRGVVDNAIRILQIKKE 293
           IL KRQ FR  F+ FD   +   ++K+++   +     ++  ++R V++NA +IL++ +E
Sbjct: 203 ILSKRQAFREVFADFDPNAIVKINEKKIIGPGSPASTLLSDLKLRAVIENARQILKVIEE 262

Query: 294 FGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQA 353
           +GSFDKYIW FV NK    +++   ++P KT K+E ISKD+VRRGFRSVGPTVV+SFMQA
Sbjct: 263 YGSFDKYIWSFVKNKAIVSKFRYQRQVPAKTPKAEVISKDLVRRGFRSVGPTVVYSFMQA 322

Query: 354 AGLTNDHLTTCHRHLHC 369
           AG+TNDHLT+C R  HC
Sbjct: 323 AGITNDHLTSCFRFHHC 339

BLAST of CSPI06G10900 vs. TAIR 10
Match: AT1G75090.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 216.9 bits (551), Expect = 3.0e-56
Identity = 103/202 (50.99%), Postives = 142/202 (70.30%), Query Frame = 0

Query: 185 RRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNA 244
           +RC +ITPNSDPIYV +HDEEWGVPV DDK LFELLV S A     W SIL++R DFR  
Sbjct: 119 KRCHWITPNSDPIYVLFHDEEWGVPVRDDKKLFELLVFSQALAEFSWPSILRRRDDFRKL 178

Query: 245 FSSFDSEIVANFSDKQMVSISTEYGIDIN--RVRGVVDNAIRILQIKKEFGSFDKYIWGF 304
           F  FD   +A F++K+++S+     + ++  ++R +V+NA  +L++K+EFGSF  Y W F
Sbjct: 179 FEEFDPSAIAQFTEKRLMSLRVNGCLILSEQKLRAIVENAKSVLKVKQEFGSFSNYCWRF 238

Query: 305 VNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTC 364
           VN+KP    Y+ G ++PVK+ K+E ISKDM++RGFR VGPTV++SF+QA+G+ NDHLT C
Sbjct: 239 VNHKPLRNGYRYGRQVPVKSPKAEYISKDMMQRGFRCVGPTVMYSFLQASGIVNDHLTAC 298

Query: 365 HRHLHCTLTAAGRRTPAPTTTT 385
            R+  C +    R T +  T T
Sbjct: 299 FRYQECNVETE-RETKSHETET 319

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q7VG783.9e-4043.85Probable GMP synthase [glutamine-hydrolyzing] OS=Helicobacter hepaticus (strain ... [more]
P051004.1e-3439.66DNA-3-methyladenine glycosylase 1 OS=Escherichia coli (strain K12) OX=83333 GN=t... [more]
P443212.6e-2836.31DNA-3-methyladenine glycosylase OS=Haemophilus influenzae (strain ATCC 51907 / D... [more]
Match NameE-valueIdentityDescription
A0A0A0KED63.9e-22199.75Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G134890 PE=4 SV=1[more]
A0A5A7UM214.7e-21497.24Putative GMP synthase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold10... [more]
A0A6J1FSP13.3e-18385.71uncharacterized protein LOC111448434 OS=Cucurbita moschata OX=3662 GN=LOC1114484... [more]
A0A6J1J7H38.0e-18284.98uncharacterized protein LOC111484173 OS=Cucurbita maxima OX=3661 GN=LOC111484173... [more]
A0A6J1D7781.5e-16782.61uncharacterized protein LOC111017989 OS=Momordica charantia OX=3673 GN=LOC111017... [more]
Match NameE-valueIdentityDescription
XP_004139917.28.2e-22199.75uncharacterized protein LOC101218536 [Cucumis sativus] >KGN46782.1 hypothetical ... [more]
KAA0054725.19.7e-21497.24putative GMP synthase [Cucumis melo var. makuwa] >TYJ95615.1 putative GMP syntha... [more]
XP_038902889.15.1e-19993.30uncharacterized protein LOC120089476 [Benincasa hispida][more]
XP_022943791.16.7e-18385.71uncharacterized protein LOC111448434 [Cucurbita moschata][more]
KAG6570606.11.5e-18285.47hypothetical protein SDJN03_29521, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
AT3G12710.11.4e-9865.72DNA glycosylase superfamily protein [more]
AT5G44680.11.3e-9453.33DNA glycosylase superfamily protein [more]
AT5G57970.13.2e-5852.79DNA glycosylase superfamily protein [more]
AT5G57970.23.2e-5852.79DNA glycosylase superfamily protein [more]
AT1G75090.13.0e-5650.99DNA glycosylase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D1.10.340.30Hypothetical protein; domain 2coord: 184..366
e-value: 4.3E-65
score: 220.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 17..33
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 17..102
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 73..87
NoneNo IPR availablePANTHERPTHR31116OS04G0501200 PROTEINcoord: 1..373
NoneNo IPR availablePANTHERPTHR31116:SF20DNA GLYCOSYLASE SUPERFAMILY PROTEINcoord: 1..373
IPR005019Methyladenine glycosylasePFAMPF03352Adenine_glycocoord: 193..365
e-value: 1.7E-61
score: 207.1
IPR011257DNA glycosylaseSUPERFAMILY48150DNA-glycosylasecoord: 185..368

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI06G10900.1CSPI06G10900.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006284 base-excision repair
biological_process GO:0006281 DNA repair
molecular_function GO:0008725 DNA-3-methyladenine glycosylase activity
molecular_function GO:0003824 catalytic activity