Cucsa.118710 (gene) Cucumber (Gy14) v1

NameCucsa.118710
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionDNA-3-methyladenine glycosylase, putative
Locationscaffold00998 : 587903 .. 590339 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTTCACTAATTCCCATTTCCCAATTCTCTCTCTCTCTCTCTCTCTTAaTtCACCAAAaCTAAAAaCCAAAAaCTAAAAAAaCGATGTGTCGTTCCGAGGAGACCTTGGAAGCCACTTCTGTCGTGGTCGATTCAAAATTCAATTCCCGTCCTGTCCTTCAACCCACTGGTAACCGTGTCCTCGACCGCCGTAATTCCCTCAAAAAACAACACCCTTCTCTCAAACCCCCTTCCGCCGCCGCCGTCTCCCCCACTTCTCCTAAATCCAAATCCCCCCGTCCTCCGGCCACCAAACGAGCCAATGACGGTAATAATCCCATGAACTCCAGCTCCGAGAAGATCCTCATTCCGGCCGCAGTGTCACGGCCCAGAGCTACGTTGGATAGAAAGAAATCGAAAAGCTTTAAACTGGGTGGAAATGGGAATGTGATTTGTGATAATGGTGGGTTTGAGGTGGCGTATGCTTCTTCTTTGATTACTGAATCGCCGGGAAGTATCGCCGCCGTGAGAAGAGAACAGGTGGCGCTGCAACAGGCGCAGAGGAAAATGAGAATTGCTCATTACGGAAGATCTAAATCTGCTCGTTTTGAAAAAATCGTTCCTCTTGATTCTAAAATTAAACCCGCTGTTGAAGATAGAAGATGTAGCTTCATCACTCCCAATTCAGGTAACTAAATCAAAAAAaGAaTTACATTCTTTCTTTCTTTCTTTCTTTCTTTCTTTATATATATACATACATACACAATTTTATTTACTCTGTTCATCATATATATTCATAAATTTAACTTAAATTCTCTCTGTTTTTAGATCCCATTTATGTTGCTTACCATGATGAGGAATGGGGTGTCCCTGTTCATGATGACAAGTGAGTTTCTCTCTCCATTTTTtCCTTTTTttttttttGAGTTCCAACTCCAAACCCATCAAACCTTCGATTCAACAAAGAGAGTTCTAAACTCTATTGAACTAAACTTAATTCATTAATTTAAACTTTTTAATTTAGTGGTAATTCATTTTTTAAAAAATGTATCTAATCTTCAGGCTCCCCTTTCCATATTGCAGCCAAGTATTTTCCATTTAAATTCTTAGTTCAATTTCAATTTAGTATGAAGTAAACTTAATTTGAGCATGAACGCAAACTCAGCATTTTTTAACCCTGTGATTTATCCCTCAATTAATATTAACTTTGACTAGTTACCTTATTTTTGTAATTTTTAAAATTACAACATCAATATAATTTGAACTCACTTAAATTTTTCGAATACAGGATGCTGTTTGAATTGCTGGTTCTAAGTGTAGCCCAGGTGGGTTCGGATTGGACTTCAATTCTGAAGAAACGTCAAGATTTCAGGTACCCAAAAACACAAACAATTTCCAATTAATTATTTCTCCCTCTCTCTTTTTtAATTAATTTCTGATTTCACTAATTTAATTTATATTGTTCAGGAATGCATTTTCAAGTTTCGATTCAGAAATTGTGGCAAATTTTTCGGACAAACAAATGGTTTCAATCAGCACAGAATATGGCATCGACATCAACAGAGTCCGAGGAGTTGTGGATAACGCAATTCGAATCCTCCAGGTAATTAAACCTAATAGTTAATTAGTAAAATCCAAATCCATTGACTTAATTAATAAATATATTTTAATCACTTCAGATTAAGAAGGAATTTGGGTCATTCGACAAATACATTTGGGGATTTGTGAACAACAAACCATTCTCACCGCAGTACAAATCCGGCCACAAAATTCCGGTGAAGACATCAAAATCAGAAACCATAAGCAAAGACATGGTCCGACGAGGTTTCCGGTCCGTCGGACCCACGGTGGTTCACTCCTTCATGCAAGCCGCCGGTCTAACCAACGACCATCTCACCACTTGCCACAGGCACCTCCATTGCACCTTAATCGCCGCCGGCCGCCGTACTCCGGCTCCGACGACGACAACACCCGAAGTGGAGGATACGGCGGCGGTTTGTGAAACACTCTAGAATTGACTCGAGAATTTTAATTAACAGACAAAAAGAAAAAaTGATAACCTTTACGAGGAGTCAATCAACCATGATTTGCTTGCTAATTAACTAGATAACTAAATATATATCTTTGGGTTTTtCTTTTGTGGGgTTTGTGTATATTATAAAAAaTAGACTTGTAAGAGAAAAaGAAAAAAAAaGAGATTGTGGGGTTGTGAATTTGTGTGGTTTTTTTtCTTTCTTTTTTTTTtGTGGGAATTTTAGTGAAAGTGTTTTATATAATTAGAAGAGAAAAaGAAAAAAaGAAGAAGGTTATTTGAAGTGGTAGGGTTAGAATAGGAGACAGAGAGCATGTGCTTGTGCAATTGGGAGGCAATGGCATGTGAGTTTGTGTCAGTTTGCTTTTGTAAATTCCCATGTGATCCATCTAAATTTCAAACATTATTAATCATTATTATCTTATT

mRNA sequence

TCTTCACTAATTCCCATTTCCCAATtctctctctctctctctctctTAATTCACCAAAACTAAAAACCAAAAACTAAAAAAACGATGTGTCGTTCCGAGGAGACCTTGGAAGCCACTTCTGTCGTGGTCGATTCAAAATTCAATTCCCGTCCTGTCCTTCAACCCACTGGTAACCGTGTCCTCGACCGCCGTAATTCCCTCAAAAAACAACACCCTTCTCTCAAACCCCCTTCCGCCGCCGCCGTCTCCCCCACTTCTCCTAAATCCAAATCCCCCCGTCCTCCGGCCACCAAACGAGCCAATGACGGTAATAATCCCATGAACTCCAGCTCCGAGAAGATCCTCATTCCGGCCGCAGTGTCACGGCCCAGAGCTACGTTGGATAGAAAGAAATCGAAAAGCTTTAAACTGGGTGGAAATGGGAATGTGATTTGTGATAATGGTGGGTTTGAGGTGGCGTATGCTTCTTCTTTGATTACTGAATCGCCGGGAAGTATCGCCGCCGTGAGAAGAGAACAGGTGGCGCTGCAACAGGCGCAGAGGAAAATGAGAATTGCTCATTACGGAAGATCTAAATCTGCTCGTTTTGAAAAAATCGTTCCTCTTGATTCTAAAATTAAACCCGCTGTTGAAGATAGAAGATGTAGCTTCATCACTCCCAATTCAGATCCCATTTATGTTGCTTACCATGATGAGGAATGGGGTGTCCCTGTTCATGATGACAAGATGCTGTTTGAATTGCTGGTTCTAAGTGTAGCCCAGGTGGGTTCGGATTGGACTTCAATTCTGAAGAAACGTCAAGATTTCAGGAATGCATTTTCAAGTTTCGATTCAGAAATTGTGGCAAATTTTTCGGACAAACAAATGGTTTCAATCAGCACAGAATATGGCATCGACATCAACAGAGTCCGAGGAGTTGTGGATAACGCAATTCGAATCCTCCAGATTAAGAAGGAATTTGGGTCATTCGACAAATACATTTGGGGATTTGTGAACAACAAACCATTCTCACCGCAGTACAAATCCGGCCACAAAATTCCGGTGAAGACATCAAAATCAGAAACCATAAGCAAAGACATGGTCCGACGAGGTTTCCGGTCCGTCGGACCCACGGTGGTTCACTCCTTCATGCAAGCCGCCGGTCTAACCAACGACCATCTCACCACTTGCCACAGGCACCTCCATTGCACCTTAATCGCCGCCGGCCGCCGTACTCCGGCTCCGACGACGACAACACCCGAAGTGGAGGATACGGCGGCGGTTTGTGAAACACTCTAGAATTGACTCGAGAATTTTAATTAACAGACAAAAAGAAAAAATGATAACCTTTACGAGGAGTCAATCAACCATGATTTGCTTGCTAATTAACTAGATAACTAAATATATATCTTTGGGTTTTTCTTTTGTGGGGTTTGTGTATATTATAAAAAATAGACTTGTAAGAGAAAAAGAAAAAAAAAGAGATTGTGGGGTTGTGAATTTGTGTGGTTTTTTTTCTTTCTTTTTTTTTTGTGGGAATTTTAGTGAAAGTGTTTTATATAATTAGAAGAGAAAAAGAAAAAAAGAAGAAGGTTATTTGAAGTGGTAGGGTTAGAATAGGAGACAGAGAGCATGTGCTTGTGCAATTGGGAGGCAATGGCATGTGAGTTTGTGTCAGTTTGCTTTTGTAAATTCCCATGTGATCCATCTAAATTTCAAACATTATTAATCATTATTATCTTATT

Coding sequence (CDS)

ATGTGTCGTTCCGAGGAGACCTTGGAAGCCACTTCTGTCGTGGTCGATTCAAAATTCAATTCCCGTCCTGTCCTTCAACCCACTGGTAACCGTGTCCTCGACCGCCGTAATTCCCTCAAAAAACAACACCCTTCTCTCAAACCCCCTTCCGCCGCCGCCGTCTCCCCCACTTCTCCTAAATCCAAATCCCCCCGTCCTCCGGCCACCAAACGAGCCAATGACGGTAATAATCCCATGAACTCCAGCTCCGAGAAGATCCTCATTCCGGCCGCAGTGTCACGGCCCAGAGCTACGTTGGATAGAAAGAAATCGAAAAGCTTTAAACTGGGTGGAAATGGGAATGTGATTTGTGATAATGGTGGGTTTGAGGTGGCGTATGCTTCTTCTTTGATTACTGAATCGCCGGGAAGTATCGCCGCCGTGAGAAGAGAACAGGTGGCGCTGCAACAGGCGCAGAGGAAAATGAGAATTGCTCATTACGGAAGATCTAAATCTGCTCGTTTTGAAAAAATCGTTCCTCTTGATTCTAAAATTAAACCCGCTGTTGAAGATAGAAGATGTAGCTTCATCACTCCCAATTCAGATCCCATTTATGTTGCTTACCATGATGAGGAATGGGGTGTCCCTGTTCATGATGACAAGATGCTGTTTGAATTGCTGGTTCTAAGTGTAGCCCAGGTGGGTTCGGATTGGACTTCAATTCTGAAGAAACGTCAAGATTTCAGGAATGCATTTTCAAGTTTCGATTCAGAAATTGTGGCAAATTTTTCGGACAAACAAATGGTTTCAATCAGCACAGAATATGGCATCGACATCAACAGAGTCCGAGGAGTTGTGGATAACGCAATTCGAATCCTCCAGATTAAGAAGGAATTTGGGTCATTCGACAAATACATTTGGGGATTTGTGAACAACAAACCATTCTCACCGCAGTACAAATCCGGCCACAAAATTCCGGTGAAGACATCAAAATCAGAAACCATAAGCAAAGACATGGTCCGACGAGGTTTCCGGTCCGTCGGACCCACGGTGGTTCACTCCTTCATGCAAGCCGCCGGTCTAACCAACGACCATCTCACCACTTGCCACAGGCACCTCCATTGCACCTTAATCGCCGCCGGCCGCCGTACTCCGGCTCCGACGACGACAACACCCGAAGTGGAGGATACGGCGGCGGTTTGTGAAACACTCTAG

Protein sequence

MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPSAAAVSPTSPKSKSPRPPATKRANDGNNPMNSSSEKILIPAAVSRPRATLDRKKSKSFKLGGNGNVICDNGGFEVAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGRRTPAPTTTTPEVEDTAAVCETL*
BLAST of Cucsa.118710 vs. Swiss-Prot
Match: GUAA_HELHP (Probable GMP synthase [glutamine-hydrolyzing] OS=Helicobacter hepaticus (strain ATCC 51449 / 3B1) GN=guaA PE=3 SV=1)

HSP 1 Score: 166.4 bits (420), Expect = 6.4e-40
Identity = 82/187 (43.85%), Postives = 116/187 (62.03%), Query Frame = 1

Query: 183 EDRRCSFITPNSDP---IYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQ 242
           E  RC++ T   +    +Y  YHD EWG P+H+DK LFE LVL   Q G  W +ILKKR+
Sbjct: 784 EKVRCAWATDKDEAARKLYEDYHDTEWGEPLHEDKKLFEHLVLEGFQAGLSWITILKKRE 843

Query: 243 DFRNAFSSFDSEIVANFSDKQMVSISTEYGIDINR--VRGVVDNAIRILQIKKEFGSFDK 302
            FR AF  FD  IVAN+ + ++  +    GI  NR  +   + NA   + +++EFGSFDK
Sbjct: 844 AFRVAFDDFDPHIVANYDEDKIKELMRNEGIIRNRAKIEAAIINAKAFMAVQREFGSFDK 903

Query: 303 YIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTND 362
           YIWGFV  KP    ++S   +P  T  S+ I+KD+ +RGF+ VG T +++ MQ+ G+ ND
Sbjct: 904 YIWGFVGGKPIINAFESIADLPASTPLSDKIAKDLKKRGFKFVGTTTMYAMMQSIGMVND 963

Query: 363 HLTTCHR 365
           HLT+C +
Sbjct: 964 HLTSCFK 970

BLAST of Cucsa.118710 vs. Swiss-Prot
Match: 3MG1_ECOLI (DNA-3-methyladenine glycosylase 1 OS=Escherichia coli (strain K12) GN=tag PE=1 SV=1)

HSP 1 Score: 146.4 bits (368), Expect = 6.8e-34
Identity = 71/179 (39.66%), Postives = 110/179 (61.45%), Query Frame = 1

Query: 186 RCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAF 245
           RC +++   DP+Y+AYHD EWGVP  D K LFE++ L   Q G  W ++LKKR+++R  F
Sbjct: 3   RCGWVS--QDPLYIAYHDNEWGVPETDSKKLFEMICLEGQQAGLSWITVLKKRENYRACF 62

Query: 246 SSFDSEIVANFSDKQMVSISTEYGIDINR--VRGVVDNAIRILQIKKEFGSFDKYIWGFV 305
             FD   VA   ++ +  +  + GI  +R  ++ ++ NA   LQ+++    F  ++W FV
Sbjct: 63  HQFDPVKVAAMQEEDVERLVQDAGIIRHRGKIQAIIGNARAYLQMEQNGEPFVDFVWSFV 122

Query: 306 NNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTC 363
           N++P   Q  +  +IP  TS S+ +SK + +RGF+ VG T+ +SFMQA GL NDH+  C
Sbjct: 123 NHQPQVTQATTLSEIPTSTSASDALSKALKKRGFKFVGTTICYSFMQACGLVNDHVVGC 179

BLAST of Cucsa.118710 vs. Swiss-Prot
Match: 3MGA_HAEIN (DNA-3-methyladenine glycosylase OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) GN=tag PE=3 SV=1)

HSP 1 Score: 126.7 bits (317), Expect = 5.6e-28
Identity = 65/179 (36.31%), Postives = 99/179 (55.31%), Query Frame = 1

Query: 186 RCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAF 245
           RC ++   S  IY+ YHD+EWG P  D + LFE + L   Q G  W ++LKKR+ +R AF
Sbjct: 4   RCPWVGEQS--IYIDYHDKEWGKPEFDSQKLFEKICLEGQQAGLSWITVLKKRESYREAF 63

Query: 246 SSFDSEIVANFSDKQMVSISTEYGIDINRVR--GVVDNAIRILQIKKEFGSFDKYIWGFV 305
             FD + +A  +   + +     G+  +R +   +V NA   L ++K   +F  +IW FV
Sbjct: 64  HQFDPKKIAKMTALDIDACMQNSGLIRHRAKLEAIVKNAKAYLAMEKCGENFSDFIWSFV 123

Query: 306 NNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTC 363
           N+KP          +P KT  S+ +SK + +RGF  +G T  ++FMQ+ GL +DHL  C
Sbjct: 124 NHKPIVNDVPDLRSVPTKTEVSKALSKALKKRGFVFIGETTCYAFMQSMGLVDDHLNDC 180

BLAST of Cucsa.118710 vs. TrEMBL
Match: A0A0A0KED6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G134890 PE=4 SV=1)

HSP 1 Score: 793.9 bits (2049), Expect = 9.1e-227
Identity = 397/397 (100.00%), Postives = 397/397 (100.00%), Query Frame = 1

Query: 1   MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPSAAAVSPTSPK 60
           MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPSAAAVSPTSPK
Sbjct: 1   MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPSAAAVSPTSPK 60

Query: 61  SKSPRPPATKRANDGNNPMNSSSEKILIPAAVSRPRATLDRKKSKSFKLGGNGNVICDNG 120
           SKSPRPPATKRANDGNNPMNSSSEKILIPAAVSRPRATLDRKKSKSFKLGGNGNVICDNG
Sbjct: 61  SKSPRPPATKRANDGNNPMNSSSEKILIPAAVSRPRATLDRKKSKSFKLGGNGNVICDNG 120

Query: 121 GFEVAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKP 180
           GFEVAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKP
Sbjct: 121 GFEVAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKP 180

Query: 181 AVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQD 240
           AVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQD
Sbjct: 181 AVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQD 240

Query: 241 FRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIW 300
           FRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIW
Sbjct: 241 FRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIW 300

Query: 301 GFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT 360
           GFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT
Sbjct: 301 GFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT 360

Query: 361 TCHRHLHCTLIAAGRRTPAPTTTTPEVEDTAAVCETL 398
           TCHRHLHCTLIAAGRRTPAPTTTTPEVEDTAAVCETL
Sbjct: 361 TCHRHLHCTLIAAGRRTPAPTTTTPEVEDTAAVCETL 397

BLAST of Cucsa.118710 vs. TrEMBL
Match: V7BVU6_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_005G045900g PE=4 SV=1)

HSP 1 Score: 508.1 bits (1307), Expect = 1.0e-140
Identity = 276/386 (71.50%), Postives = 307/386 (79.53%), Query Frame = 1

Query: 11  TSVVVDS--KFNSRPVLQPTGNRV--LDRRNSLKKQHP--SLKPPSAAAVS------PTS 70
           TS V+ S  + N RPVLQPT NRV  L+RRNS+KK  P  SL PPS    S      P S
Sbjct: 22  TSTVMPSVARINGRPVLQPTCNRVPNLERRNSIKKVQPPKSLSPPSPPLSSKTSLTPPVS 81

Query: 71  PKSKSPRPPATKRANDGNNPMNSSSEKILIPAAVSRPRATLDRKKSKSFKLGGNGNVICD 130
           PKSKSPR PA KR ND NN +N+S EKI IP + S+   TL+RKKSKSFK G      C 
Sbjct: 82  PKSKSPRLPAVKRGND-NNGLNTSYEKIAIPKSSSKA-PTLERKKSKSFKEGS-----CA 141

Query: 131 NGGFEVA--YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDS 190
               E +  YASSLIT+SPGSIAAVRREQ+ALQQAQRKM+IAHYGRSKSA+FE++VPLD 
Sbjct: 142 PASTEASFSYASSLITDSPGSIAAVRREQMALQQAQRKMKIAHYGRSKSAKFERVVPLDP 201

Query: 191 KI-----KPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDW 250
                  KP  E++RCSFIT NSDPIY+AYHDEEWGVPVHDDKMLFELLVLS AQVGSDW
Sbjct: 202 STTTLTSKPTEEEKRCSFITANSDPIYIAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDW 261

Query: 251 TSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKE 310
           TS LKKRQDFR AFS FD+E VAN +DKQM+SIS+EYGIDI+RVRGVVDNA +IL+IKK+
Sbjct: 262 TSTLKKRQDFRAAFSDFDAETVANLTDKQMMSISSEYGIDISRVRGVVDNANQILEIKKD 321

Query: 311 FGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQA 370
           FGSFDKYIWGFVN+KP S QYK GHKIPVKTSKSE+ISKDMVRRG+R VGPTVVHSFMQA
Sbjct: 322 FGSFDKYIWGFVNHKPISTQYKFGHKIPVKTSKSESISKDMVRRGYRFVGPTVVHSFMQA 381

Query: 371 AGLTNDHLTTCHRHLHCTLIAAGRRT 378
           AGLTNDHL TCHRHL CTL+AA   T
Sbjct: 382 AGLTNDHLITCHRHLQCTLLAARPHT 400

BLAST of Cucsa.118710 vs. TrEMBL
Match: A0A151QRD8_CAJCA (Putative GMP synthase [glutamine-hydrolyzing] OS=Cajanus cajan GN=KK1_046337 PE=4 SV=1)

HSP 1 Score: 507.7 bits (1306), Expect = 1.3e-140
Identity = 274/397 (69.02%), Postives = 311/397 (78.34%), Query Frame = 1

Query: 4   SEETLEATSVVVDSKFNSRPVLQPTGNRV--LDRRNSLKKQHP--SLKPPS------AAA 63
           S  T   T+    ++ N RPVLQPT NRV  L+RRNS+KK  P  SL PPS      A+ 
Sbjct: 14  STTTTTTTTTPSVARINGRPVLQPTCNRVPSLERRNSIKKVAPPKSLSPPSPPLPSKASL 73

Query: 64  VSPTSPKSKSPRPPATKRANDGNNPMNSSSEKILIPAAVSRPRATLDRKKSKSFKLGGNG 123
             P SPKSKSPR PATKR ND NN +N SSEKI+IP + S    TL+RKKSKSFK G   
Sbjct: 74  TPPVSPKSKSPRLPATKRGND-NNGLNLSSEKIVIPRS-STKAPTLERKKSKSFKEGS-- 133

Query: 124 NVICDNGGFEVAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVP 183
              C +    ++Y+SSLIT+SPGSIAAVRREQ+ALQQAQRKM+IAHYGRSKSA+FE++VP
Sbjct: 134 ---CASIEASLSYSSSLITDSPGSIAAVRREQMALQQAQRKMKIAHYGRSKSAKFERVVP 193

Query: 184 LDSKI-----KPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVG 243
           LD        KP  E++RCSFIT NSDPIY+AYHDEEWGVPVHDDKMLFELLVLS AQVG
Sbjct: 194 LDPSTTTLTSKPTEEEKRCSFITANSDPIYIAYHDEEWGVPVHDDKMLFELLVLSGAQVG 253

Query: 244 SDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQI 303
           SDWTS LKKR DFR AFS FD+E VAN +DKQM+ IS+EYGID+++VRGVVDNA +IL+I
Sbjct: 254 SDWTSTLKKRLDFRAAFSEFDAETVANLTDKQMMCISSEYGIDMSKVRGVVDNANQILEI 313

Query: 304 KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSF 363
           KK+FGSFDKYIWGFVN+KP S QYK GHKIPVKTSKSE+ISKDMVRRGFR VGPTVVHSF
Sbjct: 314 KKDFGSFDKYIWGFVNHKPISTQYKFGHKIPVKTSKSESISKDMVRRGFRFVGPTVVHSF 373

Query: 364 MQAAGLTNDHLTTCHRHLHCTLIAAGRRTPAPTTTTP 386
           MQA+GLTNDHL TCHRHL CTL+AA    P  TT  P
Sbjct: 374 MQASGLTNDHLITCHRHLQCTLLAA---KPHSTTIEP 400

BLAST of Cucsa.118710 vs. TrEMBL
Match: M5X1J5_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006139mg PE=4 SV=1)

HSP 1 Score: 507.7 bits (1306), Expect = 1.3e-140
Identity = 283/425 (66.59%), Postives = 319/425 (75.06%), Query Frame = 1

Query: 1   MCRSEETL----EATSVVVDSKFNSRPVLQPTGNRV--LDRRNSLKKQHPSLKPP----- 60
           MC S+  +    E T +V  ++ N RPVLQPT NRV  LDRRNS+KK      PP     
Sbjct: 1   MCSSKAKVTIGVEVTPMV--ARINGRPVLQPTCNRVPSLDRRNSIKKISTPRAPPPPPLP 60

Query: 61  --SAAAVSPT-------------SPKSKSPRPPATKRANDGNNPMNSSSEKILIPAAVSR 120
             SA++ SP              SPKSKSPRPPA KR ND N  +NSSSEK++ P   +R
Sbjct: 61  TSSASSTSPRISNKASSLLTPPISPKSKSPRPPAIKRGNDPNG-LNSSSEKVVTPGGTTR 120

Query: 121 PRATLDRKKSKSFKLGGNG--NVICD--------NGGFE--------VAYASSLITESPG 180
            +  L+RKKSKSFK    G      D         GGF         ++Y+SSLITE+PG
Sbjct: 121 AKI-LERKKSKSFKRASVGVDGASADLHHHGDFSAGGFSSSLNIEASLSYSSSLITEAPG 180

Query: 181 SIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDS----KIKPAVEDRRCSFITP 240
           SIAAVRREQ+ALQ AQRKMRIAHYGRSKSA FE++VP+D+    + K A E++RCSFIT 
Sbjct: 181 SIAAVRREQMALQHAQRKMRIAHYGRSKSANFERVVPVDASGNIEAKGAEEEKRCSFITA 240

Query: 241 NSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEI 300
           NSDPIYVAYHDEEWGVPVHDDKMLFELLVLS AQVGSDWTSILKKRQDFRNAFS FD+EI
Sbjct: 241 NSDPIYVAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRNAFSDFDAEI 300

Query: 301 VANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQY 360
           VANF+DKQMVSI +EYGIDI+RVRGVVDN+ RIL+IKKEFGSFDKYIWGFVN KP SPQY
Sbjct: 301 VANFTDKQMVSIGSEYGIDISRVRGVVDNSNRILEIKKEFGSFDKYIWGFVNQKPISPQY 360

Query: 361 KSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIA 378
           K G+KIPVKTSKSE+ISKDMVRRGFR VGPTVVHSFMQA+GLTNDHL TCHRHL CTL+A
Sbjct: 361 KLGYKIPVKTSKSESISKDMVRRGFRFVGPTVVHSFMQASGLTNDHLITCHRHLQCTLLA 420

BLAST of Cucsa.118710 vs. TrEMBL
Match: A0A0B2QMF8_GLYSO (Putative GMP synthase [glutamine-hydrolyzing] OS=Glycine soja GN=glysoja_026565 PE=4 SV=1)

HSP 1 Score: 507.3 bits (1305), Expect = 1.7e-140
Identity = 277/395 (70.13%), Postives = 312/395 (78.99%), Query Frame = 1

Query: 1   MCRSEETLEATSVVVDSK-----FNSRPVLQPTGNRV--LDRRNSLKKQHP--SLKPPSA 60
           MC S+   + T+VV  +K      N RPVLQPT NRV  L+RRNS+KK  P  SL PPS 
Sbjct: 1   MCSSKT--KVTAVVAAAKPSVARINGRPVLQPTCNRVPNLERRNSIKKVAPPKSLSPPSP 60

Query: 61  AAVS------PTSPKSKSPRPPATKRANDGNNPMNSSSEKILIPAAVSRPRATLDRKKSK 120
              S      P SPK KSPR PATKR ND NN +NSS EKI+IP + S    TL+RKKSK
Sbjct: 61  PLPSKTSLTPPVSPKLKSPRLPATKRGND-NNGLNSSYEKIVIPRS-STKTPTLERKKSK 120

Query: 121 SFKLGGNGNVICDNGGFE--VAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRS 180
           SFK G      C +   E  ++Y+SSLIT+SPGSIAAVRREQ+ALQQAQRKM+IAHYGRS
Sbjct: 121 SFKEGS-----CVSASIEASLSYSSSLITDSPGSIAAVRREQMALQQAQRKMKIAHYGRS 180

Query: 181 KSARFEKIVPLDSK-----IKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFE 240
           KSA+FE++VPLD        KP  E++RCSFITPNSDPIY+AYHDEEWGVPVHDDKMLFE
Sbjct: 181 KSAKFERVVPLDPSNTSLASKPTEEEKRCSFITPNSDPIYIAYHDEEWGVPVHDDKMLFE 240

Query: 241 LLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGV 300
           LLVLS AQVGSDWTS LKKR DFR AFS FD+E VAN +DKQM+SIS+EYGIDI+RVRGV
Sbjct: 241 LLVLSGAQVGSDWTSTLKKRLDFRAAFSEFDAETVANLTDKQMMSISSEYGIDISRVRGV 300

Query: 301 VDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFR 360
           VDNA +IL+IKK+FGSFDKYIWGFVN+KP S QYK GHKIPVKTSKSE+ISKDMVRRGFR
Sbjct: 301 VDNANQILEIKKDFGSFDKYIWGFVNHKPISTQYKFGHKIPVKTSKSESISKDMVRRGFR 360

Query: 361 SVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAA 374
            VGPTVVHSFMQ +GLTNDHL TCHRHL CTL+AA
Sbjct: 361 FVGPTVVHSFMQTSGLTNDHLITCHRHLQCTLLAA 386

BLAST of Cucsa.118710 vs. TAIR10
Match: AT3G12710.1 (AT3G12710.1 DNA glycosylase superfamily protein)

HSP 1 Score: 361.7 bits (927), Expect = 5.8e-100
Identity = 186/283 (65.72%), Postives = 221/283 (78.09%), Query Frame = 1

Query: 93  SRPRATLDRKKSKSFKLGGNGNVICDNGGFEVAYASSLITESPGSIAAVRREQVALQQAQ 152
           ++ R +L+RKKSKSFK G +             Y+S LITE+PGSIAAVRREQVA QQA 
Sbjct: 41  AKVRGSLERKKSKSFKEGDS-------------YSSWLITEAPGSIAAVRREQVAAQQAL 100

Query: 153 RKMRIAHYGRSKSA---RFEKIVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVP 212
           RK++IAHYGRSKS       K+VPL +   P    +RCSF+TP SDPIYVAYHDEEWGVP
Sbjct: 101 RKLKIAHYGRSKSTINFTSSKVVPLLNP-NPNPHPQRCSFLTPTSDPIYVAYHDEEWGVP 160

Query: 213 VHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISTEYG 272
           VHDDK LFELL LS AQVGSDWTS L+KR D+R AF  F++E+VA  ++K+M +IS EY 
Sbjct: 161 VHDDKTLFELLTLSGAQVGSDWTSTLRKRHDYRKAFMEFEAEVVAKLTEKEMNAISIEYK 220

Query: 273 IDINRVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETIS 332
           I++++VRGVV+NA +I++IKK F S +KY+WGFVN+KP S  YK GHKIPVKTSKSE+IS
Sbjct: 221 IEMSKVRGVVENAKKIVEIKKAFVSLEKYLWGFVNHKPISTNYKLGHKIPVKTSKSESIS 280

Query: 333 KDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIA 373
           KDMVRRGFR VGPTVVHSFMQAAGLTNDHL TC RH  CTL+A
Sbjct: 281 KDMVRRGFRFVGPTVVHSFMQAAGLTNDHLITCCRHAPCTLLA 309

BLAST of Cucsa.118710 vs. TAIR10
Match: AT5G44680.1 (AT5G44680.1 DNA glycosylase superfamily protein)

HSP 1 Score: 356.3 bits (913), Expect = 2.5e-98
Identity = 192/360 (53.33%), Postives = 255/360 (70.83%), Query Frame = 1

Query: 17  SKFNSRPVLQPTGNRV--LDRRNSLKKQHPSLKPPSAAAVSPTSPKSKSPRPPATKRAND 76
           S+ N RPVLQP  N+V  LDRRNSLKK  P  KP     ++P + K  SPRP +      
Sbjct: 15  SQINGRPVLQPKSNQVPTLDRRNSLKKSPP--KP-----LNPIASKIPSPRPISLI---- 74

Query: 77  GNNPMNSSSEKILIPAAVSRPRATLDRKKSKSFKLGGNGNVICDNGGFEVAYASSLITES 136
            + P++ +++ +  PA   +        KSK      N +     GG++      ++ + 
Sbjct: 75  -SPPLSPNTKSLRKPAGSCKELLRSSSTKSKPVISPENSD-----GGYKEVMPMVIVQKQ 134

Query: 137 PGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF-EKIVPLDSKIKPAVEDRRCSFITPN 196
           PGSIAA RRE+VA++Q +RK +I+HYGR KS +  EK + ++ + K     +RCSFIT +
Sbjct: 135 PGSIAAARREEVAMKQEERKKKISHYGRIKSVKSNEKNLNVEHEKK-----KRCSFITTS 194

Query: 197 SDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIV 256
           SDPIYVAYHD+EWGVPVHDD +LFELLVL+ AQVGSDWTS+LK+R  FR AFS F++E+V
Sbjct: 195 SDPIYVAYHDKEWGVPVHDDNLLFELLVLTGAQVGSDWTSVLKRRNTFREAFSGFEAELV 254

Query: 257 ANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYK 316
           A+F++K++ SI  +YGI++++V  VVDNA +IL++K++ GSF+KYIWGF+ +KP + +Y 
Sbjct: 255 ADFNEKKIQSIVNDYGINLSQVLAVVDNAKQILKVKRDLGSFNKYIWGFMKHKPVTTKYT 314

Query: 317 SGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAA 374
           S  KIPVKTSKSETISKDMVRRGFR VGPTV+HS MQAAGLTNDHL TC RHL CT +AA
Sbjct: 315 SCQKIPVKTSKSETISKDMVRRGFRFVGPTVIHSLMQAAGLTNDHLITCPRHLECTAMAA 352

BLAST of Cucsa.118710 vs. TAIR10
Match: AT5G57970.1 (AT5G57970.1 DNA glycosylase superfamily protein)

HSP 1 Score: 222.6 bits (566), Expect = 4.2e-58
Identity = 104/197 (52.79%), Postives = 141/197 (71.57%), Query Frame = 1

Query: 174 LDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTS 233
           LDS    +   +RC+++TPNSDP Y+ +HDEEWGVPVHDDK LFELLVLS A     W +
Sbjct: 143 LDSPPNGSETKKRCTWVTPNSDPCYIVFHDEEWGVPVHDDKRLFELLVLSGALAEHTWPT 202

Query: 234 ILKKRQDFRNAFSSFDSEIVANFSDKQMVSISTEYGIDIN--RVRGVVDNAIRILQIKKE 293
           IL KRQ FR  F+ FD   +   ++K+++   +     ++  ++R V++NA +IL++ +E
Sbjct: 203 ILSKRQAFREVFADFDPNAIVKINEKKIIGPGSPASTLLSDLKLRAVIENARQILKVIEE 262

Query: 294 FGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQA 353
           +GSFDKYIW FV NK    +++   ++P KT K+E ISKD+VRRGFRSVGPTVV+SFMQA
Sbjct: 263 YGSFDKYIWSFVKNKAIVSKFRYQRQVPAKTPKAEVISKDLVRRGFRSVGPTVVYSFMQA 322

Query: 354 AGLTNDHLTTCHRHLHC 369
           AG+TNDHLT+C R  HC
Sbjct: 323 AGITNDHLTSCFRFHHC 339

BLAST of Cucsa.118710 vs. TAIR10
Match: AT1G15970.1 (AT1G15970.1 DNA glycosylase superfamily protein)

HSP 1 Score: 219.2 bits (557), Expect = 4.7e-57
Identity = 138/373 (37.00%), Postives = 194/373 (52.01%), Query Frame = 1

Query: 22  RPVLQPTGNRVLDRRNSLKKQHPSLKPPSAAAVSPTSPKSKSPRPPATKRANDGNNPMNS 81
           R VL PTGN++  +   +K + P ++      +     K+K P  PA+ R        +S
Sbjct: 18  RSVLGPTGNKLQRKPPGMKLEKPMMEK---TIIDSKDEKAKKPTTPASPRTT--LKQCSS 77

Query: 82  SSEKILIPAAVSRPRATLDRKKSKSF--KLGGNGNVICDNGGFEVAYASSL--ITESPGS 141
               IL             RK S S       + +  C++    VA +SS   +    GS
Sbjct: 78  LCSSIL-------------RKNSASMTASYSSDASSSCESSPLSVASSSSCKKVVRRSGS 137

Query: 142 IAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCSFITPNSDPI 201
           +++ R+  V  ++ +        GR                      +RC++ITP +DP 
Sbjct: 138 VSSTRKLSVGKEEEKVSGDCFADGR----------------------KRCAWITPKADPC 197

Query: 202 YVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFS 261
           YVA+HDEEWGVPVHDDK LFELL LS A     WT IL +R   R  F  FD   VA  +
Sbjct: 198 YVAFHDEEWGVPVHDDKKLFELLCLSGALAELSWTDILSRRHILREVFMDFDPVAVAELN 257

Query: 262 DKQMVSISTEYGIDIN--RVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSG 321
           DK++ +  T     ++  ++R ++DN+  + +I  E GS  KY+W FVNNKP   Q++  
Sbjct: 258 DKKLTAPGTAAISLLSEVKIRSILDNSRHVRKIIAECGSLKKYMWNFVNNKPTQSQFRYQ 317

Query: 322 HKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAAGR 381
            ++PVKTSK+E ISKD+VRRGFRSV PTV++SFMQAAGLTNDHL  C R+  C +     
Sbjct: 318 RQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDHLIGCFRYQDCCV----- 343

Query: 382 RTPAPTTTTPEVE 389
              A TTTT + +
Sbjct: 378 --DAETTTTTKAK 343

BLAST of Cucsa.118710 vs. TAIR10
Match: AT1G75090.1 (AT1G75090.1 DNA glycosylase superfamily protein)

HSP 1 Score: 218.8 bits (556), Expect = 6.1e-57
Identity = 103/202 (50.99%), Postives = 142/202 (70.30%), Query Frame = 1

Query: 185 RRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNA 244
           +RC +ITPNSDPIYV +HDEEWGVPV DDK LFELLV S A     W SIL++R DFR  
Sbjct: 119 KRCHWITPNSDPIYVLFHDEEWGVPVRDDKKLFELLVFSQALAEFSWPSILRRRDDFRKL 178

Query: 245 FSSFDSEIVANFSDKQMVSISTEYGIDIN--RVRGVVDNAIRILQIKKEFGSFDKYIWGF 304
           F  FD   +A F++K+++S+     + ++  ++R +V+NA  +L++K+EFGSF  Y W F
Sbjct: 179 FEEFDPSAIAQFTEKRLMSLRVNGCLILSEQKLRAIVENAKSVLKVKQEFGSFSNYCWRF 238

Query: 305 VNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTC 364
           VN+KP    Y+ G ++PVK+ K+E ISKDM++RGFR VGPTV++SF+QA+G+ NDHLT C
Sbjct: 239 VNHKPLRNGYRYGRQVPVKSPKAEYISKDMMQRGFRCVGPTVMYSFLQASGIVNDHLTAC 298

Query: 365 HRHLHCTLIAAGRRTPAPTTTT 385
            R+  C  +   R T +  T T
Sbjct: 299 FRYQECN-VETERETKSHETET 319

BLAST of Cucsa.118710 vs. NCBI nr
Match: gi|778713005|ref|XP_004139917.2| (PREDICTED: uncharacterized protein LOC101218536 [Cucumis sativus])

HSP 1 Score: 793.9 bits (2049), Expect = 1.3e-226
Identity = 397/397 (100.00%), Postives = 397/397 (100.00%), Query Frame = 1

Query: 1   MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPSAAAVSPTSPK 60
           MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPSAAAVSPTSPK
Sbjct: 1   MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPSAAAVSPTSPK 60

Query: 61  SKSPRPPATKRANDGNNPMNSSSEKILIPAAVSRPRATLDRKKSKSFKLGGNGNVICDNG 120
           SKSPRPPATKRANDGNNPMNSSSEKILIPAAVSRPRATLDRKKSKSFKLGGNGNVICDNG
Sbjct: 61  SKSPRPPATKRANDGNNPMNSSSEKILIPAAVSRPRATLDRKKSKSFKLGGNGNVICDNG 120

Query: 121 GFEVAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKP 180
           GFEVAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKP
Sbjct: 121 GFEVAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKP 180

Query: 181 AVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQD 240
           AVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQD
Sbjct: 181 AVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQD 240

Query: 241 FRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIW 300
           FRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIW
Sbjct: 241 FRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIW 300

Query: 301 GFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT 360
           GFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT
Sbjct: 301 GFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLT 360

Query: 361 TCHRHLHCTLIAAGRRTPAPTTTTPEVEDTAAVCETL 398
           TCHRHLHCTLIAAGRRTPAPTTTTPEVEDTAAVCETL
Sbjct: 361 TCHRHLHCTLIAAGRRTPAPTTTTPEVEDTAAVCETL 397

BLAST of Cucsa.118710 vs. NCBI nr
Match: gi|593697344|ref|XP_007149154.1| (hypothetical protein PHAVU_005G045900g [Phaseolus vulgaris])

HSP 1 Score: 508.1 bits (1307), Expect = 1.4e-140
Identity = 276/386 (71.50%), Postives = 307/386 (79.53%), Query Frame = 1

Query: 11  TSVVVDS--KFNSRPVLQPTGNRV--LDRRNSLKKQHP--SLKPPSAAAVS------PTS 70
           TS V+ S  + N RPVLQPT NRV  L+RRNS+KK  P  SL PPS    S      P S
Sbjct: 22  TSTVMPSVARINGRPVLQPTCNRVPNLERRNSIKKVQPPKSLSPPSPPLSSKTSLTPPVS 81

Query: 71  PKSKSPRPPATKRANDGNNPMNSSSEKILIPAAVSRPRATLDRKKSKSFKLGGNGNVICD 130
           PKSKSPR PA KR ND NN +N+S EKI IP + S+   TL+RKKSKSFK G      C 
Sbjct: 82  PKSKSPRLPAVKRGND-NNGLNTSYEKIAIPKSSSKA-PTLERKKSKSFKEGS-----CA 141

Query: 131 NGGFEVA--YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDS 190
               E +  YASSLIT+SPGSIAAVRREQ+ALQQAQRKM+IAHYGRSKSA+FE++VPLD 
Sbjct: 142 PASTEASFSYASSLITDSPGSIAAVRREQMALQQAQRKMKIAHYGRSKSAKFERVVPLDP 201

Query: 191 KI-----KPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDW 250
                  KP  E++RCSFIT NSDPIY+AYHDEEWGVPVHDDKMLFELLVLS AQVGSDW
Sbjct: 202 STTTLTSKPTEEEKRCSFITANSDPIYIAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDW 261

Query: 251 TSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKE 310
           TS LKKRQDFR AFS FD+E VAN +DKQM+SIS+EYGIDI+RVRGVVDNA +IL+IKK+
Sbjct: 262 TSTLKKRQDFRAAFSDFDAETVANLTDKQMMSISSEYGIDISRVRGVVDNANQILEIKKD 321

Query: 311 FGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQA 370
           FGSFDKYIWGFVN+KP S QYK GHKIPVKTSKSE+ISKDMVRRG+R VGPTVVHSFMQA
Sbjct: 322 FGSFDKYIWGFVNHKPISTQYKFGHKIPVKTSKSESISKDMVRRGYRFVGPTVVHSFMQA 381

Query: 371 AGLTNDHLTTCHRHLHCTLIAAGRRT 378
           AGLTNDHL TCHRHL CTL+AA   T
Sbjct: 382 AGLTNDHLITCHRHLQCTLLAARPHT 400

BLAST of Cucsa.118710 vs. NCBI nr
Match: gi|1012320007|gb|KYP32870.1| (putative GMP synthase [glutamine-hydrolyzing])

HSP 1 Score: 507.7 bits (1306), Expect = 1.9e-140
Identity = 274/397 (69.02%), Postives = 311/397 (78.34%), Query Frame = 1

Query: 4   SEETLEATSVVVDSKFNSRPVLQPTGNRV--LDRRNSLKKQHP--SLKPPS------AAA 63
           S  T   T+    ++ N RPVLQPT NRV  L+RRNS+KK  P  SL PPS      A+ 
Sbjct: 14  STTTTTTTTTPSVARINGRPVLQPTCNRVPSLERRNSIKKVAPPKSLSPPSPPLPSKASL 73

Query: 64  VSPTSPKSKSPRPPATKRANDGNNPMNSSSEKILIPAAVSRPRATLDRKKSKSFKLGGNG 123
             P SPKSKSPR PATKR ND NN +N SSEKI+IP + S    TL+RKKSKSFK G   
Sbjct: 74  TPPVSPKSKSPRLPATKRGND-NNGLNLSSEKIVIPRS-STKAPTLERKKSKSFKEGS-- 133

Query: 124 NVICDNGGFEVAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVP 183
              C +    ++Y+SSLIT+SPGSIAAVRREQ+ALQQAQRKM+IAHYGRSKSA+FE++VP
Sbjct: 134 ---CASIEASLSYSSSLITDSPGSIAAVRREQMALQQAQRKMKIAHYGRSKSAKFERVVP 193

Query: 184 LDSKI-----KPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVG 243
           LD        KP  E++RCSFIT NSDPIY+AYHDEEWGVPVHDDKMLFELLVLS AQVG
Sbjct: 194 LDPSTTTLTSKPTEEEKRCSFITANSDPIYIAYHDEEWGVPVHDDKMLFELLVLSGAQVG 253

Query: 244 SDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQI 303
           SDWTS LKKR DFR AFS FD+E VAN +DKQM+ IS+EYGID+++VRGVVDNA +IL+I
Sbjct: 254 SDWTSTLKKRLDFRAAFSEFDAETVANLTDKQMMCISSEYGIDMSKVRGVVDNANQILEI 313

Query: 304 KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSF 363
           KK+FGSFDKYIWGFVN+KP S QYK GHKIPVKTSKSE+ISKDMVRRGFR VGPTVVHSF
Sbjct: 314 KKDFGSFDKYIWGFVNHKPISTQYKFGHKIPVKTSKSESISKDMVRRGFRFVGPTVVHSF 373

Query: 364 MQAAGLTNDHLTTCHRHLHCTLIAAGRRTPAPTTTTP 386
           MQA+GLTNDHL TCHRHL CTL+AA    P  TT  P
Sbjct: 374 MQASGLTNDHLITCHRHLQCTLLAA---KPHSTTIEP 400

BLAST of Cucsa.118710 vs. NCBI nr
Match: gi|595864201|ref|XP_007211731.1| (hypothetical protein PRUPE_ppa006139mg [Prunus persica])

HSP 1 Score: 507.7 bits (1306), Expect = 1.9e-140
Identity = 283/425 (66.59%), Postives = 319/425 (75.06%), Query Frame = 1

Query: 1   MCRSEETL----EATSVVVDSKFNSRPVLQPTGNRV--LDRRNSLKKQHPSLKPP----- 60
           MC S+  +    E T +V  ++ N RPVLQPT NRV  LDRRNS+KK      PP     
Sbjct: 1   MCSSKAKVTIGVEVTPMV--ARINGRPVLQPTCNRVPSLDRRNSIKKISTPRAPPPPPLP 60

Query: 61  --SAAAVSPT-------------SPKSKSPRPPATKRANDGNNPMNSSSEKILIPAAVSR 120
             SA++ SP              SPKSKSPRPPA KR ND N  +NSSSEK++ P   +R
Sbjct: 61  TSSASSTSPRISNKASSLLTPPISPKSKSPRPPAIKRGNDPNG-LNSSSEKVVTPGGTTR 120

Query: 121 PRATLDRKKSKSFKLGGNG--NVICD--------NGGFE--------VAYASSLITESPG 180
            +  L+RKKSKSFK    G      D         GGF         ++Y+SSLITE+PG
Sbjct: 121 AKI-LERKKSKSFKRASVGVDGASADLHHHGDFSAGGFSSSLNIEASLSYSSSLITEAPG 180

Query: 181 SIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDS----KIKPAVEDRRCSFITP 240
           SIAAVRREQ+ALQ AQRKMRIAHYGRSKSA FE++VP+D+    + K A E++RCSFIT 
Sbjct: 181 SIAAVRREQMALQHAQRKMRIAHYGRSKSANFERVVPVDASGNIEAKGAEEEKRCSFITA 240

Query: 241 NSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEI 300
           NSDPIYVAYHDEEWGVPVHDDKMLFELLVLS AQVGSDWTSILKKRQDFRNAFS FD+EI
Sbjct: 241 NSDPIYVAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRNAFSDFDAEI 300

Query: 301 VANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQY 360
           VANF+DKQMVSI +EYGIDI+RVRGVVDN+ RIL+IKKEFGSFDKYIWGFVN KP SPQY
Sbjct: 301 VANFTDKQMVSIGSEYGIDISRVRGVVDNSNRILEIKKEFGSFDKYIWGFVNQKPISPQY 360

Query: 361 KSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIA 378
           K G+KIPVKTSKSE+ISKDMVRRGFR VGPTVVHSFMQA+GLTNDHL TCHRHL CTL+A
Sbjct: 361 KLGYKIPVKTSKSESISKDMVRRGFRFVGPTVVHSFMQASGLTNDHLITCHRHLQCTLLA 420

BLAST of Cucsa.118710 vs. NCBI nr
Match: gi|734376562|gb|KHN21364.1| (Putative GMP synthase [glutamine-hydrolyzing])

HSP 1 Score: 507.3 bits (1305), Expect = 2.4e-140
Identity = 277/395 (70.13%), Postives = 312/395 (78.99%), Query Frame = 1

Query: 1   MCRSEETLEATSVVVDSK-----FNSRPVLQPTGNRV--LDRRNSLKKQHP--SLKPPSA 60
           MC S+   + T+VV  +K      N RPVLQPT NRV  L+RRNS+KK  P  SL PPS 
Sbjct: 1   MCSSKT--KVTAVVAAAKPSVARINGRPVLQPTCNRVPNLERRNSIKKVAPPKSLSPPSP 60

Query: 61  AAVS------PTSPKSKSPRPPATKRANDGNNPMNSSSEKILIPAAVSRPRATLDRKKSK 120
              S      P SPK KSPR PATKR ND NN +NSS EKI+IP + S    TL+RKKSK
Sbjct: 61  PLPSKTSLTPPVSPKLKSPRLPATKRGND-NNGLNSSYEKIVIPRS-STKTPTLERKKSK 120

Query: 121 SFKLGGNGNVICDNGGFE--VAYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRS 180
           SFK G      C +   E  ++Y+SSLIT+SPGSIAAVRREQ+ALQQAQRKM+IAHYGRS
Sbjct: 121 SFKEGS-----CVSASIEASLSYSSSLITDSPGSIAAVRREQMALQQAQRKMKIAHYGRS 180

Query: 181 KSARFEKIVPLDSK-----IKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFE 240
           KSA+FE++VPLD        KP  E++RCSFITPNSDPIY+AYHDEEWGVPVHDDKMLFE
Sbjct: 181 KSAKFERVVPLDPSNTSLASKPTEEEKRCSFITPNSDPIYIAYHDEEWGVPVHDDKMLFE 240

Query: 241 LLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGV 300
           LLVLS AQVGSDWTS LKKR DFR AFS FD+E VAN +DKQM+SIS+EYGIDI+RVRGV
Sbjct: 241 LLVLSGAQVGSDWTSTLKKRLDFRAAFSEFDAETVANLTDKQMMSISSEYGIDISRVRGV 300

Query: 301 VDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFR 360
           VDNA +IL+IKK+FGSFDKYIWGFVN+KP S QYK GHKIPVKTSKSE+ISKDMVRRGFR
Sbjct: 301 VDNANQILEIKKDFGSFDKYIWGFVNHKPISTQYKFGHKIPVKTSKSESISKDMVRRGFR 360

Query: 361 SVGPTVVHSFMQAAGLTNDHLTTCHRHLHCTLIAA 374
            VGPTVVHSFMQ +GLTNDHL TCHRHL CTL+AA
Sbjct: 361 FVGPTVVHSFMQTSGLTNDHLITCHRHLQCTLLAA 386

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GUAA_HELHP6.4e-4043.85Probable GMP synthase [glutamine-hydrolyzing] OS=Helicobacter hepaticus (strain ... [more]
3MG1_ECOLI6.8e-3439.66DNA-3-methyladenine glycosylase 1 OS=Escherichia coli (strain K12) GN=tag PE=1 S... [more]
3MGA_HAEIN5.6e-2836.31DNA-3-methyladenine glycosylase OS=Haemophilus influenzae (strain ATCC 51907 / D... [more]
Match NameE-valueIdentityDescription
A0A0A0KED6_CUCSA9.1e-227100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_6G134890 PE=4 SV=1[more]
V7BVU6_PHAVU1.0e-14071.50Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_005G045900g PE=4 SV=1[more]
A0A151QRD8_CAJCA1.3e-14069.02Putative GMP synthase [glutamine-hydrolyzing] OS=Cajanus cajan GN=KK1_046337 PE=... [more]
M5X1J5_PRUPE1.3e-14066.59Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006139mg PE=4 SV=1[more]
A0A0B2QMF8_GLYSO1.7e-14070.13Putative GMP synthase [glutamine-hydrolyzing] OS=Glycine soja GN=glysoja_026565 ... [more]
Match NameE-valueIdentityDescription
AT3G12710.15.8e-10065.72 DNA glycosylase superfamily protein[more]
AT5G44680.12.5e-9853.33 DNA glycosylase superfamily protein[more]
AT5G57970.14.2e-5852.79 DNA glycosylase superfamily protein[more]
AT1G15970.14.7e-5737.00 DNA glycosylase superfamily protein[more]
AT1G75090.16.1e-5750.99 DNA glycosylase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778713005|ref|XP_004139917.2|1.3e-226100.00PREDICTED: uncharacterized protein LOC101218536 [Cucumis sativus][more]
gi|593697344|ref|XP_007149154.1|1.4e-14071.50hypothetical protein PHAVU_005G045900g [Phaseolus vulgaris][more]
gi|1012320007|gb|KYP32870.1|1.9e-14069.02putative GMP synthase [glutamine-hydrolyzing][more]
gi|595864201|ref|XP_007211731.1|1.9e-14066.59hypothetical protein PRUPE_ppa006139mg [Prunus persica][more]
gi|734376562|gb|KHN21364.1|2.4e-14070.13Putative GMP synthase [glutamine-hydrolyzing][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005019Adenine_glyco
IPR011257DNA_glycosylase
IPR005019Adenine_glyco
IPR011257DNA_glycosylase
Vocabulary: Biological Process
TermDefinition
GO:0006284base-excision repair
GO:0006281DNA repair
GO:0006284base-excision repair
GO:0006281DNA repair
Vocabulary: Molecular Function
TermDefinition
GO:0008725DNA-3-methyladenine glycosylase activity
GO:0003824catalytic activity
GO:0008725DNA-3-methyladenine glycosylase activity
GO:0003824catalytic activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006284 base-excision repair
biological_process GO:0006281 DNA repair
cellular_component GO:0005575 cellular_component
molecular_function GO:0008725 DNA-3-methyladenine glycosylase activity
molecular_function GO:0003824 catalytic activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.118710.1Cucsa.118710.1mRNA
Cucsa.118710.2Cucsa.118710.2mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005019Methyladenine glycosylasePFAMPF03352Adenine_glycocoord: 193..290
score: 2.8
IPR011257DNA glycosylaseGENE3DG3DSA:1.10.340.30coord: 185..289
score: 1.3
IPR011257DNA glycosylaseunknownSSF48150DNA-glycosylasecoord: 185..290
score: 3.92
NoneNo IPR availablePANTHERPTHR31116FAMILY NOT NAMEDcoord: 3..290
score: 4.7E
NoneNo IPR availablePANTHERPTHR31116:SF6SUBFAMILY NOT NAMEDcoord: 3..290
score: 4.7E