Cucsa.105900 (gene) Cucumber (Gy14) v1

NameCucsa.105900
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionDNA glycosylase
Locationscaffold00929 : 350063 .. 353897 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGGTTTAATCTGTCCCCTGCTTGATTTCGACACCCCAAATTGTGGTTCTGGTGATTATTGTTTTTGTTTCTGCATTTTTTAATTGAGAAGAAGAAATTTCACTGAAATGTCAGGCCCTCCGAGAATCCGGTCTATGAATGTGGCGGATTCCGATTCACGACCGGTACTTGGGCCTACTGGGAACAAAGCGCGAACTGTAGAGACTAGAAAACCTGGTGTGAAGCCATTGAAGAAGCTTGAAAAGCCTCGTCAAGAAGTTGAATCAAAGGACAAAAGGGTGCCATTGTCTCCGCCTCAATGTGTTACAGTTCCATCGGTTTTAAGGCAACAGGATCGCCACCAGGCGATTCTCAATCTGTCAATGAATGCCTCGTGTTCTTCGGATGCGTCGTCTGATTCATTTAATAGTCGGGCGTCCAGTGCAAGAGGTACGAGACAGCGTGGTCCAAATTTGAGAAGGAAGCAATGTAGTACGGTTAAGGGGGCTGACAAGGCTGTTGAAAAAGTTGGTGTCGAAAGCGTGGCCGTGGTGGTGGATACAGTTGGTTGCTTAGAGTCCAAAAAACGATGTGCTTGGGTAACGCCTAATACAGGTACTTGTACATTGAGTTTAAATTTTCTGTTCGTTTCTGTTAGCTAAAAGTGGTATTCTGATCTTGCCAGCATTGCAATTTAGTCGGCGACTAAGTTGTGAGCATTGTCAAGTTGATTATTAAATGAAGGATTGAAGCTCATTATATCAATGTGGGTTCTAGTAGTGTTCTTTTCATGATTATAAAGGACTACAGTCATAAGACGTCGAAAGTTTGAGCAAGCAACATGCAGAACATTTGCTGTTCTGATTGGCTAGGTTCACATATCTTGAGTGGTTGATTTTCATCTCATTAGTGGCTTTCTTCTTGCTATCTGATACGGCTTTCATTCCTCACTGTGCTCTTGTACACTCTAGGAGAAATCTAGCATCGACGGCCACAATGTAGTAGTTCCTACTTTCATGACTGTTAGAAATGTGTAGATAGAAAAGCATTTTTTCATCCATCAACCATGGAAATAGAGTCGACCCCTACATACTTTGCCGAACTACTCACCCTTCTCTACTTCTTTCTTCTCCATCGTTTTGGAGATGTGATGGTGAATTTCAAACTAAAGATAACAAATCGAGGGCTGATTTATATGCTATGTGTTACTGGACTGTAATGCTGTCTAAGTTAACTTGTCATTTGGCCTCAAGTTACCTTTTAATTTTTTGTTTTTCAAGAGTGATCATCTTGTTAGCATTCAGTCCGCTTTACCTTTCAAAACTTGAAATCTTTAGTTTATTAAGATTGCGGATTATTTGTTATTTTCTGGTTTACAAATTTAAGTTCGTGACTAGCGGGGGCTCAAGAGTTAAAGAAATGGGAAAATTGGAAAAAGTAGCACACTTCTGGTGGAATATTTGAAAATTTGAAAAGATAAAAAAGTTAAAATAAAATTGGGGAAGATTTGAAAAATGGTAGAATCTTGGTCTAATTGTCTACCGAGATGCACCGAGAAAACTGTAACTTGGAAAATTTTAATATGTTAGGAAAACCTAGGTGGTCGATCTCTCGTCAAGAAAAACAAAATTACTAGTTTTCCCTAAGTGTGCTATTTTTTAAATGTAAAATCAAAATGGTGCTATTTCTCAAAATTTCCCAAAGAAATGATAGAGCTAAGTATTTTATTGATGAGAGTCGTTTCACATGTTTCCCTCAAGCAGGTTCATATTATAGCTTACAAATTTGACCATCTCTTCTCTCTAATACAAGTCCATTTCCTAATGCCCCCACCTCATTAATTTTTTTCTTCTCCATCCTAGTGACCATTCCAGCTTTTCTTTCATCACAAACTTGGACATCATCTTTAAATTTTACCCTTTGTCCGTAGACTTTATGTCTAGTTTTTCATTAAGCAAATGTGGGATTGAAATGCTGAATGTAATGTGCACCACTTAGTCTGTTGCTGGTTTTCACTCAAATTGTTTACGCCTTCCTTGAGGCTATGGTTGCCAATCTCATGACTCATCCAGAATCGAGAATTTTTTCTCTTCTTTCTAAGATCTTTTCAAGTTTGTTTGGGAACCTAAGCATGATTGTTTTTCTCTTTTCACTAGATCCATGTTATGCTGCTTTTCATGACGAAGAATGGGGAGTACCGGTTCACGATGACAAGTGCGTAACTTTATGATGAAGTTTCATATTTTCTTTATTTCAATCTTTGCATCAAACTCCATTCACCTACTTTATCTTTAATCGCTAAAAAATCCCCCACTTGCAGAAAATTGTTTGAACTGCTTTGCCTATCGGGCGCATTGGCTGAACTTACATGGCCTGCCATCCTCAACAAAAGACATCTATTTAGGTATTACTTTACTGCTGAGTTTATTATGCTTATTTTTTAGCAGTGAGTGTAGTATTATCATTTTATTCAAGCAATGACTTCATTGGTATGCTAATGTGTACTTAAGGGAAATTTTTTTGGACTTCGACCCAACTGCCGTTTCGAAATTAAACGAGAAAAAGATGGTTGCTCCTGGAAGTGCTGCTACTTCTTTACTGTCAGAACTCAAGGTTCGAGCTATCATTGAAAATGGTCGTCAAATGTGCAAGGTACCAACTCTTGCCTCTCCTATCGAATTGTATATTAGTTTGCTTTATTGCATAAAACAGACAAATATGTTGCCTGTGTTGAGTGAATGCGCTCTGTTTGAAAATTCTTGTCTTGTCTGGCATGCAAACACACAGTCGCTCTCCTACCCATCTAATGATTTATAACCTGTCATATCGTTTCCCTTTCCATTGCAATGCAGGTCATTGTTGGGTTTGGTCTTTTAAGTTTTTAATGTCATGTTTATAATGTCTAATCACAAACCAATCAAAGAAGATAGAGATTTGGAGAAATGTCTTTAATGGGGTGTTTGGGCCACCGACTTTATAAGTCGGTGTTATACTAATCTTCACCTACTTCAACAGCTATTGAAGTTTGCACATTTATCGAGGAGCTTCATCTTTCCTTCTCTCTTCTTTCTCTCTAGTGATCCTAACTTTAACCTAAACAACTCTACTTTAAATACAAACTTCTTAACTCCAACCGCTTAACTTCATGCTTCAAAACTCAACTTTCCGTGTCAAACAGTTTCTAATTTTTATTTTTACCTCTTCTCCATCGTGGTGCAGGTAATTGATGAATTTGGTTCCTTCAACGTGTACATGTGGAACTTTGTGAACCATAAACCAATCATCAGTCAGTTCCGGTACCCACGTCAAGTCCCGGATAAGACATCGAAAGCAGAGGTGATAAGCAAGGATCTCGTAAAGAGAGGGTTTCGAAGCGTAGGACCAACAGTCATCTATACATTCATGCAGGTGGCTGGGTTAACTAATGACCATCTCATCGGTTGCTTTAGGTTTACAGAATGTATAGAGACACAAACAGCAGAGAAAGGAGAAAGAGATGGTGGTGAAATGAAGCTTAATCCTAATGAGAAAATGCCAGAGGCTTTGAAAAACTTGGAACTATAAAAGAAACCATTGGTAGCTTTTGAACCTTTGCCTCGTTGTAATTAGCTTCCAGAGTTCTTTTTTTCTTTTCTTTTCTTTTTTGTAATGATGGCTTGTAAATTCCTTGATGGGATATTCGCCACTTCTTTCAATGGGGTAAATTTTAGCAATGATTTTGTGTATAAACTGAATTGGATACAGAAGACAGCTAGAATCAGTTCTGTTAGTTTATTACTTCAAGCAATGTGGTTGGTTATTTACATTTAGAATTTATATTTAAAACCATTATGTTGGTTCCACTTCTACTTCTAACA

mRNA sequence

GGGTTTAATCTGTCCCCTGCTTGATTTCGACACCCCAAATTGTGGTTCTGGTGATTATTGTTTTTGTTTCTGCATTTTTTAATTGAGAAGAAGAAATTTCACTGAAATGTCAGGCCCTCCGAGAATCCGGTCTATGAATGTGGCGGATTCCGATTCACGACCGGTACTTGGGCCTACTGGGAACAAAGCGCGAACTGTAGAGACTAGAAAACCTGGTGTGAAGCCATTGAAGAAGCTTGAAAAGCCTCGTCAAGAAGTTGAATCAAAGGACAAAAGGGTGCCATTGTCTCCGCCTCAATGTGTTACAGTTCCATCGGTTTTAAGGCAACAGGATCGCCACCAGGCGATTCTCAATCTGTCAATGAATGCCTCGTGTTCTTCGGATGCGTCGTCTGATTCATTTAATAGTCGGGCGTCCAGTGCAAGAGGTACGAGACAGCGTGGTCCAAATTTGAGAAGGAAGCAATGTAGTACGGTTAAGGGGGCTGACAAGGCTGTTGAAAAAGTTGGTGTCGAAAGCGTGGCCGTGGTGGTGGATACAGTTGGTTGCTTAGAGTCCAAAAAACGATGTGCTTGGGTAACGCCTAATACAGATCCATGTTATGCTGCTTTTCATGACGAAGAATGGGGAGTACCGGTTCACGATGACAAAAAATTGTTTGAACTGCTTTGCCTATCGGGCGCATTGGCTGAACTTACATGGCCTGCCATCCTCAACAAAAGACATCTATTTAGGGAAATTTTTTTGGACTTCGACCCAACTGCCGTTTCGAAATTAAACGAGAAAAAGATGGTTGCTCCTGGAAGTGCTGCTACTTCTTTACTGTCAGAACTCAAGGTTCGAGCTATCATTGAAAATGGTCGTCAAATGTGCAAGGTAATTGATGAATTTGGTTCCTTCAACGTGTACATGTGGAACTTTGTGAACCATAAACCAATCATCAGTCAGTTCCGGTACCCACGTCAAGTCCCGGATAAGACATCGAAAGCAGAGGTGATAAGCAAGGATCTCGTAAAGAGAGGGTTTCGAAGCGTAGGACCAACAGTCATCTATACATTCATGCAGGTGGCTGGGTTAACTAATGACCATCTCATCGGTTGCTTTAGGTTTACAGAATGTATAGAGACACAAACAGCAGAGAAAGGAGAAAGAGATGGTGGTGAAATGAAGCTTAATCCTAATGAGAAAATGCCAGAGGCTTTGAAAAACTTGGAACTATAAAAGAAACCATTGGTAGCTTTTGAACCTTTGCCTCGTTGTAATTAGCTTCCAGAGTTCTTTTTTTCTTTTCTTTTCTTTTTTGTAATGATGGCTTGTAAATTCCTTGATGGGATATTCGCCACTTCTTTCAATGGGGTAAATTTTAGCAATGATTTTGTGTATAAACTGAATTGGATACAGAAGACAGCTAGAATCAGTTCTGTTAGTTTATTACTTCAAGCAATGTGGTTGGTTATTTACATTTAGAATTTATATTTAAAACCATTATGTTGGTTCCACTTCTACTTCTAACA

Coding sequence (CDS)

ATGTCAGGCCCTCCGAGAATCCGGTCTATGAATGTGGCGGATTCCGATTCACGACCGGTACTTGGGCCTACTGGGAACAAAGCGCGAACTGTAGAGACTAGAAAACCTGGTGTGAAGCCATTGAAGAAGCTTGAAAAGCCTCGTCAAGAAGTTGAATCAAAGGACAAAAGGGTGCCATTGTCTCCGCCTCAATGTGTTACAGTTCCATCGGTTTTAAGGCAACAGGATCGCCACCAGGCGATTCTCAATCTGTCAATGAATGCCTCGTGTTCTTCGGATGCGTCGTCTGATTCATTTAATAGTCGGGCGTCCAGTGCAAGAGGTACGAGACAGCGTGGTCCAAATTTGAGAAGGAAGCAATGTAGTACGGTTAAGGGGGCTGACAAGGCTGTTGAAAAAGTTGGTGTCGAAAGCGTGGCCGTGGTGGTGGATACAGTTGGTTGCTTAGAGTCCAAAAAACGATGTGCTTGGGTAACGCCTAATACAGATCCATGTTATGCTGCTTTTCATGACGAAGAATGGGGAGTACCGGTTCACGATGACAAAAAATTGTTTGAACTGCTTTGCCTATCGGGCGCATTGGCTGAACTTACATGGCCTGCCATCCTCAACAAAAGACATCTATTTAGGGAAATTTTTTTGGACTTCGACCCAACTGCCGTTTCGAAATTAAACGAGAAAAAGATGGTTGCTCCTGGAAGTGCTGCTACTTCTTTACTGTCAGAACTCAAGGTTCGAGCTATCATTGAAAATGGTCGTCAAATGTGCAAGGTAATTGATGAATTTGGTTCCTTCAACGTGTACATGTGGAACTTTGTGAACCATAAACCAATCATCAGTCAGTTCCGGTACCCACGTCAAGTCCCGGATAAGACATCGAAAGCAGAGGTGATAAGCAAGGATCTCGTAAAGAGAGGGTTTCGAAGCGTAGGACCAACAGTCATCTATACATTCATGCAGGTGGCTGGGTTAACTAATGACCATCTCATCGGTTGCTTTAGGTTTACAGAATGTATAGAGACACAAACAGCAGAGAAAGGAGAAAGAGATGGTGGTGAAATGAAGCTTAATCCTAATGAGAAAATGCCAGAGGCTTTGAAAAACTTGGAACTATAA

Protein sequence

MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFTECIETQTAEKGERDGGEMKLNPNEKMPEALKNLEL*
BLAST of Cucsa.105900 vs. Swiss-Prot
Match: 3MG1_ECOLI (DNA-3-methyladenine glycosylase 1 OS=Escherichia coli (strain K12) GN=tag PE=1 SV=1)

HSP 1 Score: 157.5 bits (397), Expect = 2.8e-37
Identity = 76/180 (42.22%), Postives = 109/180 (60.56%), Query Frame = 1

Query: 153 KRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREI 212
           +RC WV+   DP Y A+HD EWGVP  D KKLFE++CL G  A L+W  +L KR  +R  
Sbjct: 2   ERCGWVSQ--DPLYIAYHDNEWGVPETDSKKLFEMICLEGQQAGLSWITVLKKRENYRAC 61

Query: 213 FLDFDPTAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYMWNF 272
           F  FDP  V+ + E+ +      A  +    K++AII N R   ++      F  ++W+F
Sbjct: 62  FHQFDPVKVAAMQEEDVERLVQDAGIIRHRGKIQAIIGNARAYLQMEQNGEPFVDFVWSF 121

Query: 273 VNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGC 332
           VNH+P ++Q     ++P  TS ++ +SK L KRGF+ VG T+ Y+FMQ  GL NDH++GC
Sbjct: 122 VNHQPQVTQATTLSEIPTSTSASDALSKALKKRGFKFVGTTICYSFMQACGLVNDHVVGC 179

BLAST of Cucsa.105900 vs. Swiss-Prot
Match: GUAA_HELHP (Probable GMP synthase [glutamine-hydrolyzing] OS=Helicobacter hepaticus (strain ATCC 51449 / 3B1) GN=guaA PE=3 SV=1)

HSP 1 Score: 157.1 bits (396), Expect = 3.6e-37
Identity = 83/197 (42.13%), Postives = 108/197 (54.82%), Query Frame = 1

Query: 149 LESKKRCAWVTPNTDPC---YAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNK 208
           +  K RCAW T   +     Y  +HD EWG P+H+DKKLFE L L G  A L+W  IL K
Sbjct: 782 VREKVRCAWATDKDEAARKLYEDYHDTEWGEPLHEDKKLFEHLVLEGFQAGLSWITILKK 841

Query: 209 RHLFREIFLDFDPTAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSF 268
           R  FR  F DFDP  V+  +E K+         + +  K+ A I N +    V  EFGSF
Sbjct: 842 REAFRVAFDDFDPHIVANYDEDKIKELMRNEGIIRNRAKIEAAIINAKAFMAVQREFGSF 901

Query: 269 NVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLT 328
           + Y+W FV  KPII+ F     +P  T  ++ I+KDL KRGF+ VG T +Y  MQ  G+ 
Sbjct: 902 DKYIWGFVGGKPIINAFESIADLPASTPLSDKIAKDLKKRGFKFVGTTTMYAMMQSIGMV 961

Query: 329 NDHLIGCFRFTECIETQ 343
           NDHL  CF+    +  Q
Sbjct: 962 NDHLTSCFKCNSSLGMQ 978

BLAST of Cucsa.105900 vs. Swiss-Prot
Match: 3MGA_HAEIN (DNA-3-methyladenine glycosylase OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) GN=tag PE=3 SV=1)

HSP 1 Score: 137.1 bits (344), Expect = 3.9e-31
Identity = 70/179 (39.11%), Postives = 101/179 (56.42%), Query Frame = 1

Query: 154 RCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIF 213
           RC WV   +   Y  +HD+EWG P  D +KLFE +CL G  A L+W  +L KR  +RE F
Sbjct: 4   RCPWVGEQS--IYIDYHDKEWGKPEFDSQKLFEKICLEGQQAGLSWITVLKKRESYREAF 63

Query: 214 LDFDPTAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYMWNFV 273
             FDP  ++K+    + A    +  +    K+ AI++N +    +     +F+ ++W+FV
Sbjct: 64  HQFDPKKIAKMTALDIDACMQNSGLIRHRAKLEAIVKNAKAYLAMEKCGENFSDFIWSFV 123

Query: 274 NHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGC 333
           NHKPI++     R VP KT  ++ +SK L KRGF  +G T  Y FMQ  GL +DHL  C
Sbjct: 124 NHKPIVNDVPDLRSVPTKTEVSKALSKALKKRGFVFIGETTCYAFMQSMGLVDDHLNDC 180

BLAST of Cucsa.105900 vs. TrEMBL
Match: A0A0A0K8L6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_7G432000 PE=4 SV=1)

HSP 1 Score: 749.6 bits (1934), Expect = 1.8e-213
Identity = 371/371 (100.00%), Postives = 371/371 (100.00%), Query Frame = 1

Query: 1   MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPL 60
           MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPL
Sbjct: 1   MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPL 60

Query: 61  SPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKQ 120
           SPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKQ
Sbjct: 61  SPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKQ 120

Query: 121 CSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHD 180
           CSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHD
Sbjct: 121 CSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHD 180

Query: 181 DKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTAVSKLNEKKMVAPGSAATSLL 240
           DKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTAVSKLNEKKMVAPGSAATSLL
Sbjct: 181 DKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTAVSKLNEKKMVAPGSAATSLL 240

Query: 241 SELKVRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK 300
           SELKVRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Sbjct: 241 SELKVRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK 300

Query: 301 DLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFTECIETQTAEKGERDGGEMKLNPNE 360
           DLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFTECIETQTAEKGERDGGEMKLNPNE
Sbjct: 301 DLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFTECIETQTAEKGERDGGEMKLNPNE 360

Query: 361 KMPEALKNLEL 372
           KMPEALKNLEL
Sbjct: 361 KMPEALKNLEL 371

BLAST of Cucsa.105900 vs. TrEMBL
Match: W9R0J8_9ROSA (Putative Glutamine amidotransferase OS=Morus notabilis GN=L484_013516 PE=4 SV=1)

HSP 1 Score: 455.7 bits (1171), Expect = 5.5e-125
Identity = 245/406 (60.34%), Postives = 282/406 (69.46%), Query Frame = 1

Query: 1   MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESK----DK 60
           MSGPPR+RSMN+AD++ RPVLGP GNKAR  +TRK   K LKK EKP QE E K      
Sbjct: 1   MSGPPRLRSMNIADTEPRPVLGPAGNKARPADTRKSASKSLKKSEKPSQETEKKAVAHSP 60

Query: 61  RVPLSPPQCVTVPSVLRQ--QDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGP 120
            +  SP Q V VP+VLRQ  Q  H A+L  SM+ASCSSDASS   +    + R  R    
Sbjct: 61  SLSPSPRQRVKVPAVLRQPQQHHHHALLGSSMSASCSSDASSSDSSHSGRAVR--RSVVA 120

Query: 121 NLRRKQCSTVKGADKAVEKVGVESVA----------VVVDTVGCLESKKRCAWVTPNT-- 180
            +RR+QC     A+K VEK+  ES++          V  D+  CL+SKKRC+W+TPN   
Sbjct: 121 PMRRRQCGLK--AEKKVEKIETESISMNKVGGGGNVVTADSDDCLDSKKRCSWITPNAYL 180

Query: 181 ----------------------DPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWP 240
                                 D CY  FHDE WG+PVHDDKKLFELL LSGALAEL+WP
Sbjct: 181 KDFISTQKSLIRFLTASHFVQKDQCYITFHDEVWGLPVHDDKKLFELLSLSGALAELSWP 240

Query: 241 AILNKRHLFREIFLDFDPTAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVID 300
           AILNKR +FRE+FLDFDP A+SKLNEKK+ APGS ATSLLSELK+RA+IEN RQMCKVI+
Sbjct: 241 AILNKRDIFREVFLDFDPVAISKLNEKKVTAPGSPATSLLSELKLRAMIENARQMCKVIE 300

Query: 301 EFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQ 360
           EFGSF+ Y+W+FVNHKPI+SQFRYPRQVP KT KAEVISKDLV+RGFRSVGPTVIY+FMQ
Sbjct: 301 EFGSFDEYIWSFVNHKPIVSQFRYPRQVPVKTPKAEVISKDLVRRGFRSVGPTVIYSFMQ 360

Query: 361 VAGLTNDHLIGCFRFTECIETQTAEKGERDGGEMKLNPNEKMPEAL 367
           VAGLTNDHLI CFRF EC+   TAE  ERDGG     P E     L
Sbjct: 361 VAGLTNDHLISCFRFQECL--ATAEASERDGGHNTETPREPTDRVL 400

BLAST of Cucsa.105900 vs. TrEMBL
Match: M5W9T8_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa026720mg PE=4 SV=1)

HSP 1 Score: 442.6 bits (1137), Expect = 4.8e-121
Identity = 232/367 (63.22%), Postives = 276/367 (75.20%), Query Frame = 1

Query: 1   MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPL 60
           MSG PR+RS+NVADS+SRPVLGP GNKA T   RKP  KPL+K EK  ++V S +++   
Sbjct: 1   MSGAPRVRSINVADSESRPVLGPAGNKAGTFSARKPVSKPLRKAEKLAEKVASAEEKKTR 60

Query: 61  SPPQCVT--------VPSVLRQQDRHQAIL--NLSMNASCSSDASSDSFNSRASSARGTR 120
                 T        VPSVLR   RH+ +L  N S+NASCSSDAS+DSF+SRAS+ R TR
Sbjct: 61  QSSMLTTSPQLHSPSVPSVLR---RHEQLLHSNFSLNASCSSDASTDSFHSRASTGRLTR 120

Query: 121 QRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFH 180
                 RRKQ   V      V   G++S           +SKKRCAWVTPNTDPCYAAFH
Sbjct: 121 SNSAGSRRKQY--VSKPRSVVSDGGLDSPP------DGSQSKKRCAWVTPNTDPCYAAFH 180

Query: 181 DEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTAVSKLNEKKMV 240
           DEEWG+PVHDDKKLFELL LSGALAEL+WPAIL+K+H+FRE+F DFDP A+SKLNEKK++
Sbjct: 181 DEEWGLPVHDDKKLFELLVLSGALAELSWPAILSKKHIFREVFADFDPVAISKLNEKKLI 240

Query: 241 APGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPD 300
           APGS A+SLLSELK+RAIIEN RQM KVI+EFGSF+ Y+W+FVN+KPI+S+FRYPRQVP 
Sbjct: 241 APGSNASSLLSELKLRAIIENARQMTKVIEEFGSFDKYIWSFVNNKPIVSRFRYPRQVPA 300

Query: 301 KTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFTECIETQTA--EKGE 356
           KT KA+VISKDL++RGFRSVGPTVIY+FMQVAG+TNDHL+ CFRF EC+       E G 
Sbjct: 301 KTPKADVISKDLMRRGFRSVGPTVIYSFMQVAGITNDHLVSCFRFQECLNAAEGKEEYGI 356

BLAST of Cucsa.105900 vs. TrEMBL
Match: A0A151QNJ1_CAJCA (Putative GMP synthase [glutamine-hydrolyzing] OS=Cajanus cajan GN=KK1_047590 PE=4 SV=1)

HSP 1 Score: 438.7 bits (1127), Expect = 6.9e-120
Identity = 232/371 (62.53%), Postives = 276/371 (74.39%), Query Frame = 1

Query: 1   MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVP- 60
           MSG PR+RSMNV DS+ RPVLGP GNK+ ++ +RK   KPL+K+EK   EV S  ++ P 
Sbjct: 1   MSGAPRLRSMNVGDSEVRPVLGPAGNKSGSLGSRKAASKPLRKVEKLLDEVASVKEKKPH 60

Query: 61  -------LSPPQC--VTVPSVLRQQDRHQAIL--NLSMNASCSSDASSDSFNSRASSARG 120
                   S P     +V SVLR   RH+ +L  NLS+NASCSSDAS+DSF+SRAS+ R 
Sbjct: 61  QVLSSVVTSSPHSHSASVSSVLR---RHEQLLHSNLSLNASCSSDASTDSFHSRASTGRL 120

Query: 121 TRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAA 180
           TR      RRK C +        +   V S  V+     C +SKKRCAWVTPNT+PCYA 
Sbjct: 121 TRSYSLGSRRKSCVS--------KARSVASDGVLESPPDCSQSKKRCAWVTPNTEPCYAT 180

Query: 181 FHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTAVSKLNEKK 240
           FHDEEWGVPVHDDKKLFELL LS  LAELTWPAIL+KRH FRE+F+DFDP AVSKL+EKK
Sbjct: 181 FHDEEWGVPVHDDKKLFELLVLSSVLAELTWPAILSKRHTFREVFVDFDPVAVSKLSEKK 240

Query: 241 MVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQV 300
           M+APG+ A+SLLSE+K+RAIIEN RQ+ KVIDEFGSF+ Y+W+FVNHKP++S+FRYPRQV
Sbjct: 241 MMAPGTIASSLLSEVKLRAIIENARQISKVIDEFGSFDKYIWSFVNHKPVVSRFRYPRQV 300

Query: 301 PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFTECIETQTAEKGE 360
           P KT KA+VISKDLV+RGFR VGPTV+Y+FMQVAGLTNDHLI CFRF ECI    AE  E
Sbjct: 301 PVKTPKADVISKDLVRRGFRGVGPTVVYSFMQVAGLTNDHLISCFRFDECI--AVAEGKE 358

BLAST of Cucsa.105900 vs. TrEMBL
Match: A0A0D2VII3_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_013G216100 PE=4 SV=1)

HSP 1 Score: 438.7 bits (1127), Expect = 6.9e-120
Identity = 239/375 (63.73%), Postives = 282/375 (75.20%), Query Frame = 1

Query: 1   MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKD--KRV 60
           MSG PR+RSMN  DS++RPVLGP GNKA ++  RKP  KPL+K+EK   EV + +  K +
Sbjct: 1   MSGAPRLRSMNAPDSEARPVLGPAGNKAGSLSARKPASKPLRKVEKSPVEVTATEEKKSL 60

Query: 61  P------LSPPQ-CVTVPSVLRQQDRHQAIL--NLSMNASCSSDASSDSFNSRASSARGT 120
           P      LSP +  V+VPSVLR   RH+ +L  NLS+NASCSSDAS+DSF+SRAS+ R  
Sbjct: 61  PSSIVSSLSPKKHSVSVPSVLR---RHEKLLHSNLSLNASCSSDASTDSFHSRASTGRLI 120

Query: 121 RQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAF 180
           R      RRK    V      V   G +S +           KKRCAWVTPNTDP YA F
Sbjct: 121 RSNSVGSRRKPY--VSKPRSFVSDSGSDSPS------DGSHQKKRCAWVTPNTDPSYATF 180

Query: 181 HDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTAVSKLNEKKM 240
           HDEEWGVPVHDDKKLFELL LSGAL+ELTWPAIL+KR +FRE+F+DFDP AVSKLNEKK+
Sbjct: 181 HDEEWGVPVHDDKKLFELLVLSGALSELTWPAILSKRQMFREVFMDFDPAAVSKLNEKKL 240

Query: 241 VAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVP 300
           +APGS ++SLLSELK+RAIIEN RQ+ KVIDEFGSF+ Y+W+FVNHKPIIS+FRYPRQVP
Sbjct: 241 IAPGSVSSSLLSELKLRAIIENARQISKVIDEFGSFDEYIWSFVNHKPIISKFRYPRQVP 300

Query: 301 DKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFTECIETQTAEKGER 360
            KT KA+VISKDLV+RGFRSVGPTVIY+FMQVAG+TNDHL GCFRF ECI   TA +G+ 
Sbjct: 301 VKTPKADVISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLTGCFRFQECI---TAAEGKE 359

Query: 361 DGGEMKLNPNEKMPE 365
              E+K    EK P+
Sbjct: 361 --VEIKERAEEKKPD 359

BLAST of Cucsa.105900 vs. TAIR10
Match: AT5G57970.1 (AT5G57970.1 DNA glycosylase superfamily protein)

HSP 1 Score: 364.4 bits (934), Expect = 8.4e-101
Identity = 192/351 (54.70%), Postives = 239/351 (68.09%), Query Frame = 1

Query: 1   MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPL 60
           MSG PR++SMNVA++++R  LG T  KA    T K   K L+KLE+        D++   
Sbjct: 1   MSGAPRVQSMNVAEAETRSTLGSTAKKASPFITHKAVSKSLRKLERSSSGRTGSDEKTSY 60

Query: 61  SPP----------QCVTVPSVLRQQDRHQAILN--LSMNASCSSDASSDSFNSRASSARG 120
           + P            +   S+LR   RH+  LN  LS+NAS SSDAS DSF+SRAS+ R 
Sbjct: 61  ATPTETVSSSSQKHTLNAASILR---RHEQNLNSNLSLNASFSSDASMDSFHSRASTGRL 120

Query: 121 TRQRGPNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAA 180
            R      R K   +        +   V S   +       E+KKRC WVTPN+DPCY  
Sbjct: 121 IRSYSVGSRSKSYPS--------KPRSVVSEGALDSPPNGSETKKRCTWVTPNSDPCYIV 180

Query: 181 FHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTAVSKLNEKK 240
           FHDEEWGVPVHDDK+LFELL LSGALAE TWP IL+KR  FRE+F DFDP A+ K+NEKK
Sbjct: 181 FHDEEWGVPVHDDKRLFELLVLSGALAEHTWPTILSKRQAFREVFADFDPNAIVKINEKK 240

Query: 241 MVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQV 300
           ++ PGS A++LLS+LK+RA+IEN RQ+ KVI+E+GSF+ Y+W+FV +K I+S+FRY RQV
Sbjct: 241 IIGPGSPASTLLSDLKLRAVIENARQILKVIEEYGSFDKYIWSFVKNKAIVSKFRYQRQV 300

Query: 301 PDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFTECI 340
           P KT KAEVISKDLV+RGFRSVGPTV+Y+FMQ AG+TNDHL  CFRF  CI
Sbjct: 301 PAKTPKAEVISKDLVRRGFRSVGPTVVYSFMQAAGITNDHLTSCFRFHHCI 340

BLAST of Cucsa.105900 vs. TAIR10
Match: AT1G15970.1 (AT1G15970.1 DNA glycosylase superfamily protein)

HSP 1 Score: 352.4 bits (903), Expect = 3.3e-97
Identity = 199/367 (54.22%), Postives = 246/367 (67.03%), Query Frame = 1

Query: 1   MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQE---VESKDKR 60
           MS PPR RS+N  + + R VLGPTGNK +    RKP   P  KLEKP  E   ++SKD++
Sbjct: 1   MSVPPRFRSVNSDEREFRSVLGPTGNKLQ----RKP---PGMKLEKPMMEKTIIDSKDEK 60

Query: 61  V-----PLSP----PQCVTV-PSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSAR 120
                 P SP     QC ++  S+LR+        + SM AS SSDASS   +S  S A 
Sbjct: 61  AKKPTTPASPRTTLKQCSSLCSSILRKN-------SASMTASYSSDASSSCESSPLSVAS 120

Query: 121 GTRQRGPNLRRKQCSTVK--GADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPC 180
            +  +    R    S+ +     K  EKV  +  A         + +KRCAW+TP  DPC
Sbjct: 121 SSSCKKVVRRSGSVSSTRKLSVGKEEEKVSGDCFA---------DGRKRCAWITPKADPC 180

Query: 181 YAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTAVSKLN 240
           Y AFHDEEWGVPVHDDKKLFELLCLSGALAEL+W  IL++RH+ RE+F+DFDP AV++LN
Sbjct: 181 YVAFHDEEWGVPVHDDKKLFELLCLSGALAELSWTDILSRRHILREVFMDFDPVAVAELN 240

Query: 241 EKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYP 300
           +KK+ APG+AA SLLSE+K+R+I++N R + K+I E GS   YMWNFVN+KP  SQFRY 
Sbjct: 241 DKKLTAPGTAAISLLSEVKIRSILDNSRHVRKIIAECGSLKKYMWNFVNNKPTQSQFRYQ 300

Query: 301 RQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFTEC---IETQ 350
           RQVP KTSKAE ISKDLV+RGFRSV PTVIY+FMQ AGLTNDHLIGCFR+ +C    ET 
Sbjct: 301 RQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAAGLTNDHLIGCFRYQDCCVDAETT 344

BLAST of Cucsa.105900 vs. TAIR10
Match: AT1G80850.1 (AT1G80850.1 DNA glycosylase superfamily protein)

HSP 1 Score: 335.9 bits (860), Expect = 3.2e-92
Identity = 183/339 (53.98%), Postives = 224/339 (66.08%), Query Frame = 1

Query: 1   MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPL 60
           MS PPR+RS++ +D + R VLGP GNK +     KP  KP+ +  K     E   +  PL
Sbjct: 1   MSAPPRVRSVDSSDREFRSVLGPAGNKLQQKPLSKPVKKPVAEKTKNLTFTEKMPQCSPL 60

Query: 61  SPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKQ 120
           SPP       +LR+         +SM AS SSDASS S  S   S   T      LRR  
Sbjct: 61  SPP-------ILRRN-------GISMTASYSSDASS-SCESSPLSMTSTSSGKRVLRRS- 120

Query: 121 CSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHD 180
             +V  +      +  E      D     + +KRCAW+TP +D CY AFHDEEWGVPVHD
Sbjct: 121 -GSVSSSSSLRRNLTEERDEKASDCF--CDGRKRCAWITPKSDQCYIAFHDEEWGVPVHD 180

Query: 181 DKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTAVSKLNEKKMVAPGSAATSLL 240
           DK+LFELL LSGALAEL+W  IL+KR LFRE+F+DFDP A+S+L  KK+ +P  AAT+LL
Sbjct: 181 DKRLFELLSLSGALAELSWKDILSKRQLFREVFMDFDPIAISELTNKKITSPEIAATTLL 240

Query: 241 SELKVRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK 300
           SE K+R+I+EN  Q+CK+I  FGSF+ Y+WNFVN KP  SQFRYPRQVP KTSKAE+ISK
Sbjct: 241 SEQKLRSILENANQVCKIIGAFGSFDKYIWNFVNQKPTQSQFRYPRQVPVKTSKAELISK 300

Query: 301 DLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFTECI 340
           DLV+RGFRSV PTVIY+FMQ AGLTNDHL  CFR  +C+
Sbjct: 301 DLVRRGFRSVSPTVIYSFMQTAGLTNDHLTCCFRHHDCM 320

BLAST of Cucsa.105900 vs. TAIR10
Match: AT1G75090.1 (AT1G75090.1 DNA glycosylase superfamily protein)

HSP 1 Score: 260.4 bits (664), Expect = 1.7e-69
Identity = 141/322 (43.79%), Postives = 197/322 (61.18%), Query Frame = 1

Query: 40  PLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSF 99
           P+K +++ R  + S   R  ++  +    P +  +  +  A      N S S+D SS S 
Sbjct: 10  PVKPIDESRAILCSTGNRFKVTKTEMTKKPQLNPRVTKSPATKKPDSNFSVSTDDSSSSS 69

Query: 100 NSRASSARGTRQRGPNLRRKQCSTVKGADKAVEKVG--VESVAVVVDTVGCLESK-KRCA 159
           +S   S+  T   G         T       VEK+   V SVAVV D    +    KRC 
Sbjct: 70  SSSERSSVNTTNSGK-------VTTPSKRNGVEKLNNVVASVAVVEDISPKIPGPVKRCH 129

Query: 160 WVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDF 219
           W+TPN+DP Y  FHDEEWGVPV DDKKLFELL  S ALAE +WP+IL +R  FR++F +F
Sbjct: 130 WITPNSDPIYVLFHDEEWGVPVRDDKKLFELLVFSQALAEFSWPSILRRRDDFRKLFEEF 189

Query: 220 DPTAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYMWNFVNHK 279
           DP+A+++  EK++++       +LSE K+RAI+EN + + KV  EFGSF+ Y W FVNHK
Sbjct: 190 DPSAIAQFTEKRLMSLRVNGCLILSEQKLRAIVENAKSVLKVKQEFGSFSNYCWRFVNHK 249

Query: 280 PIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFT 339
           P+ + +RY RQVP K+ KAE ISKD+++RGFR VGPTV+Y+F+Q +G+ NDHL  CFR+ 
Sbjct: 250 PLRNGYRYGRQVPVKSPKAEYISKDMMQRGFRCVGPTVMYSFLQASGIVNDHLTACFRYQ 309

Query: 340 EC-IETQTAEKGERDGGEMKLN 358
           EC +ET+   K      ++ L+
Sbjct: 310 ECNVETERETKSHETETKLDLH 324

BLAST of Cucsa.105900 vs. TAIR10
Match: AT1G13635.1 (AT1G13635.1 DNA glycosylase superfamily protein)

HSP 1 Score: 216.1 bits (549), Expect = 3.7e-56
Identity = 92/190 (48.42%), Postives = 136/190 (71.58%), Query Frame = 1

Query: 150 ESKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLF 209
           +  KRC W+T  +D  Y  FHD++WGVPV+DD  LFE L +SG L +  W  IL ++  F
Sbjct: 112 DEPKRCNWITKKSDEVYVMFHDQQWGVPVYDDNLLFEFLAMSGMLMDYNWTEILKRKEHF 171

Query: 210 REIFLDFDPTAVSKLNEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYM 269
           RE F +FDP  V+K+ EK++    S    +L E +VR I++N + + KV++EFGSF+ ++
Sbjct: 172 REAFCEFDPNRVAKMGEKEIAEIASNKAIMLQESRVRCIVDNAKCITKVVNEFGSFSSFV 231

Query: 270 WNFVNHKPIISQFRYPRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHL 329
           W F+++KPII++F+Y R VP ++ KAE+ISKD++KRGFR VGP ++++FMQ AGLT DHL
Sbjct: 232 WGFMDYKPIINKFKYSRNVPLRSPKAEIISKDMIKRGFRFVGPVIVHSFMQAAGLTIDHL 291

Query: 330 IGCFRFTECI 340
           + CFR  +C+
Sbjct: 292 VDCFRHGDCV 301

BLAST of Cucsa.105900 vs. NCBI nr
Match: gi|778728928|ref|XP_004136097.2| (PREDICTED: uncharacterized protein LOC101205558 [Cucumis sativus])

HSP 1 Score: 749.6 bits (1934), Expect = 2.6e-213
Identity = 371/371 (100.00%), Postives = 371/371 (100.00%), Query Frame = 1

Query: 1   MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPL 60
           MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPL
Sbjct: 1   MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPL 60

Query: 61  SPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKQ 120
           SPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKQ
Sbjct: 61  SPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKQ 120

Query: 121 CSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHD 180
           CSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHD
Sbjct: 121 CSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHD 180

Query: 181 DKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTAVSKLNEKKMVAPGSAATSLL 240
           DKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTAVSKLNEKKMVAPGSAATSLL
Sbjct: 181 DKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTAVSKLNEKKMVAPGSAATSLL 240

Query: 241 SELKVRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK 300
           SELKVRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Sbjct: 241 SELKVRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK 300

Query: 301 DLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFTECIETQTAEKGERDGGEMKLNPNE 360
           DLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFTECIETQTAEKGERDGGEMKLNPNE
Sbjct: 301 DLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFTECIETQTAEKGERDGGEMKLNPNE 360

Query: 361 KMPEALKNLEL 372
           KMPEALKNLEL
Sbjct: 361 KMPEALKNLEL 371

BLAST of Cucsa.105900 vs. NCBI nr
Match: gi|659122505|ref|XP_008461179.1| (PREDICTED: uncharacterized protein LOC103499838 [Cucumis melo])

HSP 1 Score: 736.9 bits (1901), Expect = 1.8e-209
Identity = 366/371 (98.65%), Postives = 367/371 (98.92%), Query Frame = 1

Query: 1   MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPL 60
           MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPL
Sbjct: 1   MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPL 60

Query: 61  SPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKQ 120
           SPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKQ
Sbjct: 61  SPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASSARGTRQRGPNLRRKQ 120

Query: 121 CSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHD 180
           CSTVKGADKAVEKVGVESVAVV DTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHD
Sbjct: 121 CSTVKGADKAVEKVGVESVAVVADTVGCLESKKRCAWVTPNTDPCYAAFHDEEWGVPVHD 180

Query: 181 DKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTAVSKLNEKKMVAPGSAATSLL 240
           DKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPT VSKLNEKKMVAPGSAATSLL
Sbjct: 181 DKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTVVSKLNEKKMVAPGSAATSLL 240

Query: 241 SELKVRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK 300
           SELK+RAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK
Sbjct: 241 SELKIRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTSKAEVISK 300

Query: 301 DLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFTECIETQTAEKGERDGGEMKLNPNE 360
           DLVKRGFRSVGPTVIYTFMQVAGLTNDHLI CFRFTECIETQTAEKGERD GEMKLNPNE
Sbjct: 301 DLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFTECIETQTAEKGERD-GEMKLNPNE 360

Query: 361 KMPEALKNLEL 372
           KMPEALKNLEL
Sbjct: 361 KMPEALKNLEL 370

BLAST of Cucsa.105900 vs. NCBI nr
Match: gi|1009176825|ref|XP_015869637.1| (PREDICTED: probable GMP synthase [glutamine-hydrolyzing])

HSP 1 Score: 464.5 bits (1194), Expect = 1.7e-127
Identity = 238/366 (65.03%), Postives = 276/366 (75.41%), Query Frame = 1

Query: 1   MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPL 60
           MSGPPR+RS N+AD++SRPVLGP GNKA   + RKP  KPLKK EKP QE E K      
Sbjct: 1   MSGPPRLRSQNIADTESRPVLGPAGNKATPTDNRKPASKPLKKAEKPSQETEKKAGVHHH 60

Query: 61  SPPQCVTVPSVLR-----QQDRHQ---AILNLSMNASCSSDASSDSFNSRASSARGTRQR 120
           SPPQ  TVP +LR     QQ+ HQ    +LN SMNASCSSDASS + +S + S R +R+ 
Sbjct: 61  SPPQRFTVPMILRRQKQQQQEHHQYQTMLLNSSMNASCSSDASSSTTDS-SHSWRASRRS 120

Query: 121 GPNLRRKQCSTV--------KGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDP 180
            P LR+K   +          G    + K  V +  V  D+   +++K+RCAW+TPNTD 
Sbjct: 121 VPPLRKKHFGSKAEKVERVGSGTGSVLVKKSVGNEVVAEDSTEVVDTKRRCAWITPNTDQ 180

Query: 181 CYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTAVSKL 240
           CY AFHDEEWGVPVHDDK+LFELL LSGALAEL WPAIL+KRH+FREI+LDFDP+AVSKL
Sbjct: 181 CYVAFHDEEWGVPVHDDKELFELLSLSGALAELPWPAILSKRHIFREIYLDFDPSAVSKL 240

Query: 241 NEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRY 300
           NEKK+ APGS A  LLSELK+R+IIEN RQ+CKV++EFGSF+ Y+WNFVNHKPII QFRY
Sbjct: 241 NEKKIAAPGSVAIPLLSELKLRSIIENARQVCKVVEEFGSFDKYIWNFVNHKPIIGQFRY 300

Query: 301 PRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFTECIETQTA 351
           PRQVP KT KAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLI CFRF EC+ T   
Sbjct: 301 PRQVPVKTPKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECLATGGE 360

BLAST of Cucsa.105900 vs. NCBI nr
Match: gi|1009177582|ref|XP_015870050.1| (PREDICTED: probable GMP synthase [glutamine-hydrolyzing])

HSP 1 Score: 464.5 bits (1194), Expect = 1.7e-127
Identity = 238/366 (65.03%), Postives = 276/366 (75.41%), Query Frame = 1

Query: 1   MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPL 60
           MSGPPR+RS N+AD++SRPVLGP GNKA   + RKP  KPLKK EKP QE E K      
Sbjct: 1   MSGPPRLRSQNIADTESRPVLGPAGNKATPTDNRKPASKPLKKAEKPSQETEKKAGVHHH 60

Query: 61  SPPQCVTVPSVLR-----QQDRHQ---AILNLSMNASCSSDASSDSFNSRASSARGTRQR 120
           SPPQ  TVP +LR     QQ+ HQ    +LN SMNASCSSDASS + +S + S R +R+ 
Sbjct: 61  SPPQRFTVPMILRRQKQQQQEHHQYQTMLLNSSMNASCSSDASSSTTDS-SHSWRASRRS 120

Query: 121 GPNLRRKQCSTV--------KGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDP 180
            P LR+K   +          G    + K  V +  V  D+   +++K+RCAW+TPNTD 
Sbjct: 121 VPPLRKKHFGSKAEKVERVGSGTGSVLVKKSVGNEVVAEDSTEVVDTKRRCAWITPNTDQ 180

Query: 181 CYAAFHDEEWGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTAVSKL 240
           CY AFHDEEWGVPVHDDK+LFELL LSGALAEL WPAIL+KRH+FREI+LDFDP+AVSKL
Sbjct: 181 CYVAFHDEEWGVPVHDDKELFELLSLSGALAELPWPAILSKRHIFREIYLDFDPSAVSKL 240

Query: 241 NEKKMVAPGSAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRY 300
           NEKK+ APGS A  LLSELK+R+IIEN RQ+CKV++EFGSF+ Y+WNFVNHKPII QFRY
Sbjct: 241 NEKKIAAPGSVAIPLLSELKLRSIIENARQVCKVVEEFGSFDKYIWNFVNHKPIIGQFRY 300

Query: 301 PRQVPDKTSKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFTECIETQTA 351
           PRQVP KT KAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLI CFRF EC+ T   
Sbjct: 301 PRQVPVKTPKAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLISCFRFQECLATGGE 360

BLAST of Cucsa.105900 vs. NCBI nr
Match: gi|720064030|ref|XP_010275821.1| (PREDICTED: uncharacterized protein LOC104610746 [Nelumbo nucifera])

HSP 1 Score: 458.4 bits (1178), Expect = 1.2e-125
Identity = 239/374 (63.90%), Postives = 288/374 (77.01%), Query Frame = 1

Query: 1   MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKR--- 60
           MSG PR+RS+NVADS++RPVLGP GNK R++ TRKP  KPL+K+EK  + V+ + K    
Sbjct: 1   MSGAPRVRSINVADSEARPVLGPAGNKTRSLVTRKPASKPLRKVEKTPEAVDEEKKAPSS 60

Query: 61  -VPLSPP--QCVTVPSVLRQQDRHQAI-LNLSMNASCSSDASSDSFNSRASSARGTRQRG 120
            V  SPP  Q V+VPS+LR   RH+ +  NLS+NASCSSDASSDS  SRAS+ R  R R 
Sbjct: 61  PVAASPPKLQPVSVPSILR---RHEFLHSNLSLNASCSSDASSDSVYSRASTGRLIRTRS 120

Query: 121 PNLRRKQCSTVKGADKAVEKVGVESVAVVVDTVGCLESKKRCAWVTPNTDPCYAAFHDEE 180
              RRK   ++   +K V     +S      +   +E+KKRCAWVTPNTDPCYAAFHDEE
Sbjct: 121 TPSRRKY--SISRPEKVVPDSASDS------SPDSIETKKRCAWVTPNTDPCYAAFHDEE 180

Query: 181 WGVPVHDDKKLFELLCLSGALAELTWPAILNKRHLFREIFLDFDPTAVSKLNEKKMVAPG 240
           WGVPVHDDKKLFELL LSGALAELTWP IL+KRH+FRE+F DFDP AVSKLNEKK+ APG
Sbjct: 181 WGVPVHDDKKLFELLVLSGALAELTWPTILSKRHIFREVFSDFDPVAVSKLNEKKITAPG 240

Query: 241 SAATSLLSELKVRAIIENGRQMCKVIDEFGSFNVYMWNFVNHKPIISQFRYPRQVPDKTS 300
           S A+SLLSELK+RAIIEN RQ+CKVIDEFGSF+ Y+W+FVNHKPIIS+FRYPRQVP K  
Sbjct: 241 STASSLLSELKLRAIIENARQICKVIDEFGSFDNYIWSFVNHKPIISKFRYPRQVPVKIP 300

Query: 301 KAEVISKDLVKRGFRSVGPTVIYTFMQVAGLTNDHLIGCFRFTECIETQTAEKGERDGGE 360
           KA+VISKDLV+RGFRSVGPTV+Y+FMQVAG+TNDHLI CFRF  C++T T  +G+    +
Sbjct: 301 KADVISKDLVRRGFRSVGPTVVYSFMQVAGITNDHLINCFRFQVCMDTPTVSEGD---DK 360

Query: 361 MKLNPNEKMPEALK 368
           +++   E+ P   K
Sbjct: 361 LRIGKAEETPTGSK 360

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
3MG1_ECOLI2.8e-3742.22DNA-3-methyladenine glycosylase 1 OS=Escherichia coli (strain K12) GN=tag PE=1 S... [more]
GUAA_HELHP3.6e-3742.13Probable GMP synthase [glutamine-hydrolyzing] OS=Helicobacter hepaticus (strain ... [more]
3MGA_HAEIN3.9e-3139.11DNA-3-methyladenine glycosylase OS=Haemophilus influenzae (strain ATCC 51907 / D... [more]
Match NameE-valueIdentityDescription
A0A0A0K8L6_CUCSA1.8e-213100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_7G432000 PE=4 SV=1[more]
W9R0J8_9ROSA5.5e-12560.34Putative Glutamine amidotransferase OS=Morus notabilis GN=L484_013516 PE=4 SV=1[more]
M5W9T8_PRUPE4.8e-12163.22Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa026720mg PE=4 SV=1[more]
A0A151QNJ1_CAJCA6.9e-12062.53Putative GMP synthase [glutamine-hydrolyzing] OS=Cajanus cajan GN=KK1_047590 PE=... [more]
A0A0D2VII3_GOSRA6.9e-12063.73Uncharacterized protein OS=Gossypium raimondii GN=B456_013G216100 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G57970.18.4e-10154.70 DNA glycosylase superfamily protein[more]
AT1G15970.13.3e-9754.22 DNA glycosylase superfamily protein[more]
AT1G80850.13.2e-9253.98 DNA glycosylase superfamily protein[more]
AT1G75090.11.7e-6943.79 DNA glycosylase superfamily protein[more]
AT1G13635.13.7e-5648.42 DNA glycosylase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778728928|ref|XP_004136097.2|2.6e-213100.00PREDICTED: uncharacterized protein LOC101205558 [Cucumis sativus][more]
gi|659122505|ref|XP_008461179.1|1.8e-20998.65PREDICTED: uncharacterized protein LOC103499838 [Cucumis melo][more]
gi|1009176825|ref|XP_015869637.1|1.7e-12765.03PREDICTED: probable GMP synthase [glutamine-hydrolyzing][more]
gi|1009177582|ref|XP_015870050.1|1.7e-12765.03PREDICTED: probable GMP synthase [glutamine-hydrolyzing][more]
gi|720064030|ref|XP_010275821.1|1.2e-12563.90PREDICTED: uncharacterized protein LOC104610746 [Nelumbo nucifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005019Adenine_glyco
IPR011257DNA_glycosylase
Vocabulary: Biological Process
TermDefinition
GO:0006284base-excision repair
GO:0006281DNA repair
Vocabulary: Molecular Function
TermDefinition
GO:0008725DNA-3-methyladenine glycosylase activity
GO:0003824catalytic activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006284 base-excision repair
biological_process GO:0006281 DNA repair
cellular_component GO:0005575 cellular_component
molecular_function GO:0008725 DNA-3-methyladenine glycosylase activity
molecular_function GO:0003824 catalytic activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.105900.1Cucsa.105900.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005019Methyladenine glycosylasePFAMPF03352Adenine_glycocoord: 161..334
score: 3.3
IPR011257DNA glycosylaseGENE3DG3DSA:1.10.340.30coord: 153..335
score: 2.0
IPR011257DNA glycosylaseunknownSSF48150DNA-glycosylasecoord: 153..339
score: 2.12
NoneNo IPR availablePANTHERPTHR31116FAMILY NOT NAMEDcoord: 2..371
score: 3.5E
NoneNo IPR availablePANTHERPTHR31116:SF1SUBFAMILY NOT NAMEDcoord: 2..371
score: 3.5E

The following gene(s) are paralogous to this gene:

None