Cucsa.284320 (gene) Cucumber (Gy14) v1

NameCucsa.284320
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionDNA-3-methyladenine glycosylase, putative
Locationscaffold02653 : 1146007 .. 1147855 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAAACCTTATCTTTTTTtATTTTtCTCCAATATGTCATCAAAAGCCACTGTTAGAAGACATATTTTGGAGAGGCAAGCATGTCCTAAAGAGAAAGATAGGACAAGCCAAAACATTTTGTCCAAACACCTTAAGAAGATTTACCCAATTGGGTTACAAAGAACCACTTCTTCGCTATCTTTATCTTCAATGTCATTGTCTTTGTCTCAAAACTCAAATGACTCTTCTCTTACGGACTCCTCGATCCAACTAGATCAGAAAATTTCATACGCAATTCGCCTTATTACGCCGCCTCCTGAAAGAAGAGAAGTCCCATTGCCTAAAAGTATTCAACAGCAAAGTCAAGAACTTAGTGATGGAGAATTGAGACGGTGCAACTGGATCACTCATACTAGTGGTAAGATTATGCGTATGCTTACACAGTTGAAATGCTTCCTAGCAAGTGATAATATAACATTAAATAAAAAAATAAGTTAGATACTAAATAACGTTTGTTTGAAAGTTCATATTATTGATGTGATAAATAATGTCATCAACTTAGAACAACAGTTGCTAACACTGTCATATTTCTTCATATAATGAGATTGTGACTTCTAATAATAATAGTTCTGATATGGATTAAGAAACACTATATCTTATTGGAAAAGAAAGAAAAAGTAGTTTTCCTCTATAGAATACGTTCCCCACATGCTAGACTACTTCATGACTTTACTGATGTTTTTCTCTTAATGTAATTGACATAGATAAAGCCTATGTATCTTTTCACGACGAGTGTTGGGGCGTTCCAGTGTACGATGACAAGTAGGTTGTCTATGTAACTTTTTGTTTACTTTACATGTCAATTCAATTAATAGATATGTAGTTAACTTTGGAAAAAAAAAAAaTGTTCAGCCGACTTTTCGAGCTACTCGCACTATCTGGGATGTTAATGGACTACAACTGGACTGAAATTGTGAAAAGAAGGGAACTATTCAGGTAATATTACTCTGCCAGTAACACACCCTTTATGTACTTGAGTTAAATTTCAATCCCATGATAAATATACATGCACATTTGGTTGGTAAAAGGGAAGCTTTTGCTGGATTTGAGCCAAGTGTTGTTGCCAACATGGGGGAGAAAGAGATAACAGATGTAGCATCTGACAAGGCCATTATGCTGGTGGAAAGCAGAGTGAGGTGCATAGTTGACAATGCCAAATGCATATTGAAGGCAAGCCTAGATCAACTGAGTTTTTAACAAGTTGTTTCATAGATTGCATTATTAATCAGAATTACTGGTTTTTAATTCAACTGCAGATAGCTAGAGATTTTGGATCATTCAGTAACTATATGTGGAGCTATGTGAACTTCAAACCTACAATAAACAGATTTAGACATCCAAGAAATGTTCCCTTGAGAAGTCCTAAAGCAGAAGCCATTAGCAAGGACATGGTGAAGCGTGGTTTTCGGTTTGTTGGGCCAGTGATTGTTTATTCATTCATGCAAGCGGCTGGGTTGACAATCGATCATCTTATCGATTGTTTTCGACATGGTGAATGTGTGAATCTTGCAGAAAGACCATGGAGACATATCTGAAAACAAGCTTTCCTAATTTCCCTTTAATGTTGTGGTTATTTTGTGAGCATTAAGTTAAAAAATATGAATAATTATATATAGAGAAAGAGAAAAGAACCAAGATGGAATTTATCAACTCTTTATTTATCAGTTTCTGCTAGAATGGCCAATTCTGCCACTGAAATTAAAGCCTTGAGAATTGATCAGTTCATATCATCAACAAATGCAGAGCTCACCATTACAACAGTTGTGGCGTAAAGGTATTCGACAAGTTGGAGCTGGTGGAGAATGAGAATGG

mRNA sequence

TAAACCTTATCTTTTTTTATTTTTCTCCAATATGTCATCAAAAGCCACTGTTAGAAGACATATTTTGGAGAGGCAAGCATGTCCTAAAGAGAAAGATAGGACAAGCCAAAACATTTTGTCCAAACACCTTAAGAAGATTTACCCAATTGGGTTACAAAGAACCACTTCTTCGCTATCTTTATCTTCAATGTCATTGTCTTTGTCTCAAAACTCAAATGACTCTTCTCTTACGGACTCCTCGATCCAACTAGATCAGAAAATTTCATACGCAATTCGCCTTATTACGCCGCCTCCTGAAAGAAGAGAAGTCCCATTGCCTAAAAGTATTCAACAGCAAAGTCAAGAACTTAGTGATGGAGAATTGAGACGGTGCAACTGGATCACTCATACTAGTGATAAAGCCTATGTATCTTTTCACGACGAGTGTTGGGGCGTTCCAGTGTACGATGACAACCGACTTTTCGAGCTACTCGCACTATCTGGGATGTTAATGGACTACAACTGGACTGAAATTGTGAAAAGAAGGGAACTATTCAGGGAAGCTTTTGCTGGATTTGAGCCAAGTGTTGTTGCCAACATGGGGGAGAAAGAGATAACAGATGTAGCATCTGACAAGGCCATTATGCTGGTGGAAAGCAGAGTGAGGTGCATAGTTGACAATGCCAAATGCATATTGAAGATAGCTAGAGATTTTGGATCATTCAGTAACTATATGTGGAGCTATGTGAACTTCAAACCTACAATAAACAGATTTAGACATCCAAGAAATGTTCCCTTGAGAAGTCCTAAAGCAGAAGCCATTAGCAAGGACATGGTGAAGCGTGGTTTTCGGTTTGTTGGGCCAGTGATTGTTTATTCATTCATGCAAGCGGCTGGGTTGACAATCGATCATCTTATCGATTGTTTTCGACATGGTGAATGTGTGAATCTTGCAGAAAGACCATGGAGACATATCTGAAAACAAGCTTTCCTAATTTCCCTTTAATGTTGTGGTTATTTTGTGAGCATTAAGTTAAAAAATATGAATAATTATATATAGAGAAAGAGAAAAGAACCAAGATGGAATTTATCAACTCTTTATTTATCAGTTTCTGCTAGAATGGCCAATTCTGCCACTGAAATTAAAGCCTTGAGAATTGATCAGTTCATATCATCAACAAATGCAGAGCTCACCATTACAACAGTTGTGGCGTAAAGGTATTCGACAAGTTGGAGCTGGTGGAGAATGAGAATGG

Coding sequence (CDS)

ATGTCATCAAAAGCCACTGTTAGAAGACATATTTTGGAGAGGCAAGCATGTCCTAAAGAGAAAGATAGGACAAGCCAAAACATTTTGTCCAAACACCTTAAGAAGATTTACCCAATTGGGTTACAAAGAACCACTTCTTCGCTATCTTTATCTTCAATGTCATTGTCTTTGTCTCAAAACTCAAATGACTCTTCTCTTACGGACTCCTCGATCCAACTAGATCAGAAAATTTCATACGCAATTCGCCTTATTACGCCGCCTCCTGAAAGAAGAGAAGTCCCATTGCCTAAAAGTATTCAACAGCAAAGTCAAGAACTTAGTGATGGAGAATTGAGACGGTGCAACTGGATCACTCATACTAGTGATAAAGCCTATGTATCTTTTCACGACGAGTGTTGGGGCGTTCCAGTGTACGATGACAACCGACTTTTCGAGCTACTCGCACTATCTGGGATGTTAATGGACTACAACTGGACTGAAATTGTGAAAAGAAGGGAACTATTCAGGGAAGCTTTTGCTGGATTTGAGCCAAGTGTTGTTGCCAACATGGGGGAGAAAGAGATAACAGATGTAGCATCTGACAAGGCCATTATGCTGGTGGAAAGCAGAGTGAGGTGCATAGTTGACAATGCCAAATGCATATTGAAGATAGCTAGAGATTTTGGATCATTCAGTAACTATATGTGGAGCTATGTGAACTTCAAACCTACAATAAACAGATTTAGACATCCAAGAAATGTTCCCTTGAGAAGTCCTAAAGCAGAAGCCATTAGCAAGGACATGGTGAAGCGTGGTTTTCGGTTTGTTGGGCCAGTGATTGTTTATTCATTCATGCAAGCGGCTGGGTTGACAATCGATCATCTTATCGATTGTTTTCGACATGGTGAATGTGTGAATCTTGCAGAAAGACCATGGAGACATATCTGA

Protein sequence

MSSKATVRRHILERQACPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSMSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVPLPKSIQQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSVVANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDCFRHGECVNLAERPWRHI*
BLAST of Cucsa.284320 vs. Swiss-Prot
Match: GUAA_HELHP (Probable GMP synthase [glutamine-hydrolyzing] OS=Helicobacter hepaticus (strain ATCC 51449 / 3B1) GN=guaA PE=3 SV=1)

HSP 1 Score: 164.1 bits (414), Expect = 2.5e-39
Identity = 78/204 (38.24%), Postives = 123/204 (60.29%), Query Frame = 1

Query: 95  LPKSIQQQSQELSDG--ELRRCNWITHTSD---KAYVSFHDECWGVPVYDDNRLFELLAL 154
           L KS+  ++Q+ ++G  E  RC W T   +   K Y  +HD  WG P+++D +LFE L L
Sbjct: 767 LQKSLGLEAQDSNEGVREKVRCAWATDKDEAARKLYEDYHDTEWGEPLHEDKKLFEHLVL 826

Query: 155 SGMLMDYNWTEIVKRRELFREAFAGFEPSVVANMGEKEITDVASDKAIMLVESRVRCIVD 214
            G     +W  I+K+RE FR AF  F+P +VAN  E +I ++  ++ I+   +++   + 
Sbjct: 827 EGFQAGLSWITILKKREAFRVAFDDFDPHIVANYDEDKIKELMRNEGIIRNRAKIEAAII 886

Query: 215 NAKCILKIARDFGSFSNYMWSYVNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFV 274
           NAK  + + R+FGSF  Y+W +V  KP IN F    ++P  +P ++ I+KD+ KRGF+FV
Sbjct: 887 NAKAFMAVQREFGSFDKYIWGFVGGKPIINAFESIADLPASTPLSDKIAKDLKKRGFKFV 946

Query: 275 GPVIVYSFMQAAGLTIDHLIDCFR 294
           G   +Y+ MQ+ G+  DHL  CF+
Sbjct: 947 GTTTMYAMMQSIGMVNDHLTSCFK 970

BLAST of Cucsa.284320 vs. Swiss-Prot
Match: 3MG1_ECOLI (DNA-3-methyladenine glycosylase 1 OS=Escherichia coli (strain K12) GN=tag PE=1 SV=1)

HSP 1 Score: 144.8 bits (364), Expect = 1.5e-33
Identity = 64/181 (35.36%), Postives = 109/181 (60.22%), Query Frame = 1

Query: 111 LRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFRE 170
           + RC W++   D  Y+++HD  WGVP  D  +LFE++ L G     +W  ++K+RE +R 
Sbjct: 1   MERCGWVSQ--DPLYIAYHDNEWGVPETDSKKLFEMICLEGQQAGLSWITVLKKRENYRA 60

Query: 171 AFAGFEPSVVANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWS 230
            F  F+P  VA M E+++  +  D  I+    +++ I+ NA+  L++ ++   F +++WS
Sbjct: 61  CFHQFDPVKVAAMQEEDVERLVQDAGIIRHRGKIQAIIGNARAYLQMEQNGEPFVDFVWS 120

Query: 231 YVNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLID 290
           +VN +P + +      +P  +  ++A+SK + KRGF+FVG  I YSFMQA GL  DH++ 
Sbjct: 121 FVNHQPQVTQATTLSEIPTSTSASDALSKALKKRGFKFVGTTICYSFMQACGLVNDHVVG 179

Query: 291 C 292
           C
Sbjct: 181 C 179

BLAST of Cucsa.284320 vs. Swiss-Prot
Match: 3MGA_HAEIN (DNA-3-methyladenine glycosylase OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) GN=tag PE=3 SV=1)

HSP 1 Score: 141.4 bits (355), Expect = 1.7e-32
Identity = 67/179 (37.43%), Postives = 106/179 (59.22%), Query Frame = 1

Query: 113 RCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAF 172
           RC W+   S   Y+ +HD+ WG P +D  +LFE + L G     +W  ++K+RE +REAF
Sbjct: 4   RCPWVGEQS--IYIDYHDKEWGKPEFDSQKLFEKICLEGQQAGLSWITVLKKRESYREAF 63

Query: 173 AGFEPSVVANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYV 232
             F+P  +A M   +I     +  ++   +++  IV NAK  L + +   +FS+++WS+V
Sbjct: 64  HQFDPKKIAKMTALDIDACMQNSGLIRHRAKLEAIVKNAKAYLAMEKCGENFSDFIWSFV 123

Query: 233 NFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDC 292
           N KP +N     R+VP ++  ++A+SK + KRGF F+G    Y+FMQ+ GL  DHL DC
Sbjct: 124 NHKPIVNDVPDLRSVPTKTEVSKALSKALKKRGFVFIGETTCYAFMQSMGLVDDHLNDC 180

BLAST of Cucsa.284320 vs. TrEMBL
Match: A0A0A0KUC5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G606920 PE=4 SV=1)

HSP 1 Score: 622.9 bits (1605), Expect = 2.2e-175
Identity = 308/308 (100.00%), Postives = 308/308 (100.00%), Query Frame = 1

Query: 1   MSSKATVRRHILERQACPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSMSLSLSQN 60
           MSSKATVRRHILERQACPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSMSLSLSQN
Sbjct: 1   MSSKATVRRHILERQACPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSMSLSLSQN 60

Query: 61  SNDSSLTDSSIQLDQKISYAIRLITPPPERREVPLPKSIQQQSQELSDGELRRCNWITHT 120
           SNDSSLTDSSIQLDQKISYAIRLITPPPERREVPLPKSIQQQSQELSDGELRRCNWITHT
Sbjct: 61  SNDSSLTDSSIQLDQKISYAIRLITPPPERREVPLPKSIQQQSQELSDGELRRCNWITHT 120

Query: 121 SDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSVV 180
           SDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSVV
Sbjct: 121 SDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSVV 180

Query: 181 ANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINR 240
           ANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINR
Sbjct: 181 ANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINR 240

Query: 241 FRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDCFRHGECVNL 300
           FRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDCFRHGECVNL
Sbjct: 241 FRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDCFRHGECVNL 300

Query: 301 AERPWRHI 309
           AERPWRHI
Sbjct: 301 AERPWRHI 308

BLAST of Cucsa.284320 vs. TrEMBL
Match: A0A061E9D7_THECC (DNA glycosylase superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_011427 PE=4 SV=1)

HSP 1 Score: 483.0 bits (1242), Expect = 2.7e-133
Identity = 237/317 (74.76%), Postives = 276/317 (87.07%), Query Frame = 1

Query: 3   SKATVRRHILERQACPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSMSLSLSQNSN 62
           SKA VRRHILE+   PKEK++ +Q++LSKHLKKIYPIGLQR+TSSLSLSS+SLSLSQNSN
Sbjct: 7   SKANVRRHILEKNRSPKEKEKPAQSVLSKHLKKIYPIGLQRSTSSLSLSSLSLSLSQNSN 66

Query: 63  DSSLTD-SSIQLDQKISYAIRLITPPPERRE--VPLPKSIQ--------QQSQELSDGEL 122
           DSSLTD SS  L+QKIS A+ LI P  ERRE  VP+ KS+Q        Q SQ+   GEL
Sbjct: 67  DSSLTDHSSTPLEQKISLALSLIAPHHERREFVVPVVKSVQHHHHQQQQQPSQDPGSGEL 126

Query: 123 RRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREA 182
           RRCNW+T  SDK YVSFHDE WGVPVYDDN+LFELLALSGMLMDYNWTEI+KR+EL+REA
Sbjct: 127 RRCNWVTKNSDKVYVSFHDEQWGVPVYDDNQLFELLALSGMLMDYNWTEILKRKELYREA 186

Query: 183 FAGFEPSVVANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSY 242
           F+GF+P +VA MG+KEI +++SDKAIML ESRVRCIVDNAKCILKI R++GSFS++MW Y
Sbjct: 187 FSGFDPEIVAKMGDKEINEISSDKAIMLAESRVRCIVDNAKCILKIVREYGSFSSFMWGY 246

Query: 243 VNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDC 302
           VN+KPTINR+++PRNVPLR+PKAEAIS+D++KRGFRFVGPVIV SFMQAAGLTIDHL+DC
Sbjct: 247 VNYKPTINRYKYPRNVPLRTPKAEAISRDLLKRGFRFVGPVIVCSFMQAAGLTIDHLVDC 306

Query: 303 FRHGECVNLAERPWRHI 309
           FR+ ECV LAERPWRHI
Sbjct: 307 FRYSECVGLAERPWRHI 323

BLAST of Cucsa.284320 vs. TrEMBL
Match: B9RLG1_RICCO (DNA-3-methyladenine glycosylase, putative OS=Ricinus communis GN=RCOM_1466540 PE=4 SV=1)

HSP 1 Score: 477.6 bits (1228), Expect = 1.1e-131
Identity = 239/319 (74.92%), Postives = 282/319 (88.40%), Query Frame = 1

Query: 3   SKATVRRHILERQAC-PKEKDRTSQNIL---SKHLKKIYPIGLQRTTSSLSLSSMSLSLS 62
           SKATVR+ +LE+++    EK+RT+ N L   SK+LKK+YPIGL R+ SSLSLSS+SLSLS
Sbjct: 2   SKATVRKQVLEKKSIFTNEKERTTSNQLGFFSKNLKKVYPIGLHRSNSSLSLSSVSLSLS 61

Query: 63  QNSNDSSLTD-SSIQLDQKISYAIRLITPPPERREVP-LPKSIQQQ-------SQELSDG 122
           +NSNDSSLTD S+  LDQKIS A+RLITP  ERREVP L +++QQQ       SQE + G
Sbjct: 62  ENSNDSSLTDYSNTPLDQKISLALRLITPL-ERREVPALSRNVQQQQQQQQQQSQESNGG 121

Query: 123 ELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFR 182
           E+RRCNWIT  SDK YV+FHDECWGVPVYDDN+LFELLALSGMLMDYNWTEI+KR++LFR
Sbjct: 122 EIRRCNWITKNSDKVYVAFHDECWGVPVYDDNQLFELLALSGMLMDYNWTEILKRKQLFR 181

Query: 183 EAFAGFEPSVVANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMW 242
           EAFAGF+P++VANMGEKEI D+AS+KAIML +SRVRCIVDNAKCI KIAR+FGSFS++MW
Sbjct: 182 EAFAGFDPNIVANMGEKEILDIASNKAIMLADSRVRCIVDNAKCIAKIAREFGSFSSFMW 241

Query: 243 SYVNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLI 302
            +VN+KPTIN++++PRNVPLR+PKAEAISKD++KRGFRFVGPVIVYSFMQAAGLTIDHL+
Sbjct: 242 GHVNYKPTINKYKYPRNVPLRTPKAEAISKDLLKRGFRFVGPVIVYSFMQAAGLTIDHLV 301

Query: 303 DCFRHGECVNLAERPWRHI 309
           DCFRHGECV LAERPWRHI
Sbjct: 302 DCFRHGECVGLAERPWRHI 319

BLAST of Cucsa.284320 vs. TrEMBL
Match: B9HWT9_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0010s14710g PE=4 SV=2)

HSP 1 Score: 474.2 bits (1219), Expect = 1.2e-130
Identity = 235/316 (74.37%), Postives = 277/316 (87.66%), Query Frame = 1

Query: 4   KATVRRHILERQ-ACPKEKDR---TSQNILSKHLKKIYPIGLQRTTSSLSLSSMSLSLSQ 63
           KA VR+ ILE+     KEK++    +Q + SKHLK++YPIGL R+TSSLSLSS+SLSLSQ
Sbjct: 3   KANVRKQILEKNNILIKEKEKPISNTQGLFSKHLKRVYPIGLHRSTSSLSLSSVSLSLSQ 62

Query: 64  NSNDSSLTDSS-IQLDQKISYAIRLITPPPERREVPLPKSIQ------QQSQELSDGELR 123
           NSNDSSLTDSS + L+QKIS A+RLI+P  ERREVP+ ++ Q      QQ+Q+ +DGE++
Sbjct: 63  NSNDSSLTDSSAVPLEQKISLALRLISPL-ERREVPVARNFQPQQQQQQQNQDSNDGEVK 122

Query: 124 RCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAF 183
           RCNWIT  SDK YV+FHDECWGVPVYDDN+LFELLALSGMLMDYNWTEI+KR+ELFREAF
Sbjct: 123 RCNWITKNSDKVYVAFHDECWGVPVYDDNQLFELLALSGMLMDYNWTEILKRKELFREAF 182

Query: 184 AGFEPSVVANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYV 243
            GF+P++VA MGEKEI ++AS+KAIML ESRVRCIVDN+KCILKIAR+FGSFSNYMW  V
Sbjct: 183 EGFDPNIVAKMGEKEIMEIASNKAIMLAESRVRCIVDNSKCILKIAREFGSFSNYMWGNV 242

Query: 244 NFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDCF 303
           NFKPTINR+++PRNVPLRSPKAEAISKD++KRGFRF GPVIVYSFMQAAGLTIDHL+DCF
Sbjct: 243 NFKPTINRYKYPRNVPLRSPKAEAISKDLLKRGFRFAGPVIVYSFMQAAGLTIDHLVDCF 302

Query: 304 RHGECVNLAERPWRHI 309
           R+ ECV+LAERPWRHI
Sbjct: 303 RYSECVSLAERPWRHI 317

BLAST of Cucsa.284320 vs. TrEMBL
Match: A0A067FVV1_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g021082mg PE=4 SV=1)

HSP 1 Score: 471.1 bits (1211), Expect = 1.0e-129
Identity = 233/317 (73.50%), Postives = 273/317 (86.12%), Query Frame = 1

Query: 3   SKATVRRHILERQACPKEKD-RTSQNILSKHLKKIYPIGLQRTTSSLSLSSMSLSLSQNS 62
           SKA VRRHILE+   PKEK+ + +Q++LSKHLKK+YPIGL R++SSLSLSS+SLSLSQNS
Sbjct: 2   SKANVRRHILEKNRSPKEKEPKPTQSLLSKHLKKVYPIGLHRSSSSLSLSSLSLSLSQNS 61

Query: 63  NDSSLTDSSIQ-LDQKISYAIRLITPPPERREVPLPKSIQ---------QQSQELSDGEL 122
           NDSS+TD+S   L+Q+IS A+RLITPP ERREV + K+ Q         QQSQ+   GEL
Sbjct: 62  NDSSVTDNSNSPLEQRISLALRLITPP-ERREVTVAKNAQPQQQQQQQQQQSQDSCCGEL 121

Query: 123 RRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREA 182
           +RCNWIT  SDK YV+FHDECWGVPVYDDN+LFELLALSGMLMDYNWTEI+KR+ELFREA
Sbjct: 122 KRCNWITKNSDKVYVAFHDECWGVPVYDDNQLFELLALSGMLMDYNWTEILKRKELFREA 181

Query: 183 FAGFEPSVVANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSY 242
           F GF+P  VA MGEKEI +++S+ AIML E RVRCIVDNAKCI+KI  +FGSFS++MW Y
Sbjct: 182 FGGFDPKSVAKMGEKEILEISSNTAIMLAECRVRCIVDNAKCIVKILNEFGSFSSFMWGY 241

Query: 243 VNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDC 302
           VNFKP IN+FR+PRNVPLRSPKAEAIS+D++KRGFR VGPVIVYSFMQAAGLTIDHL+DC
Sbjct: 242 VNFKPMINKFRYPRNVPLRSPKAEAISRDLLKRGFRLVGPVIVYSFMQAAGLTIDHLVDC 301

Query: 303 FRHGECVNLAERPWRHI 309
           FR+ ECV+LAERPWRHI
Sbjct: 302 FRYSECVSLAERPWRHI 317

BLAST of Cucsa.284320 vs. TAIR10
Match: AT1G13635.1 (AT1G13635.1 DNA glycosylase superfamily protein)

HSP 1 Score: 412.1 bits (1058), Expect = 2.9e-115
Identity = 206/305 (67.54%), Postives = 252/305 (82.62%), Query Frame = 1

Query: 8   RRHILERQACPKEKD-RTSQNILSKHLKKIYPIGLQRTTSS-LSLSSMSLSLSQNSNDSS 67
           R+ I+E+    +EK+ + + N  +KHLK+IYPI LQR+TSS  SLSS+SLSLSQNS DS 
Sbjct: 8   RKEIVEKSKSVREKEIKQNSNFFAKHLKRIYPITLQRSTSSSFSLSSISLSLSQNSTDSV 67

Query: 68  LTDSSIQLDQKISYAIRLITPPPERREVPLPKSIQQQ-SQEL-SDGELRRCNWITHTSDK 127
            TDS+  L+QKIS A+ LI+ P  RRE+ +PKSI QQ  Q+  S  E +RCNWIT  SD+
Sbjct: 68  STDSNSTLEQKISLALGLISSP-HRREIFVPKSIPQQLCQDFNSSDEPKRCNWITKKSDE 127

Query: 128 AYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSVVANM 187
            YV FHD+ WGVPVYDDN LFE LA+SGMLMDYNWTEI+KR+E FREAF  F+P+ VA M
Sbjct: 128 VYVMFHDQQWGVPVYDDNLLFEFLAMSGMLMDYNWTEILKRKEHFREAFCEFDPNRVAKM 187

Query: 188 GEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINRFRH 247
           GEKEI ++AS+KAIML ESRVRCIVDNAKCI K+  +FGSFS+++W ++++KP IN+F++
Sbjct: 188 GEKEIAEIASNKAIMLQESRVRCIVDNAKCITKVVNEFGSFSSFVWGFMDYKPIINKFKY 247

Query: 248 PRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDCFRHGECVNLAER 307
            RNVPLRSPKAE ISKDM+KRGFRFVGPVIV+SFMQAAGLTIDHL+DCFRHG+CV+LAER
Sbjct: 248 SRNVPLRSPKAEIISKDMIKRGFRFVGPVIVHSFMQAAGLTIDHLVDCFRHGDCVSLAER 307

Query: 308 PWRHI 309
           PWRHI
Sbjct: 308 PWRHI 311

BLAST of Cucsa.284320 vs. TAIR10
Match: AT1G75090.1 (AT1G75090.1 DNA glycosylase superfamily protein)

HSP 1 Score: 227.3 bits (578), Expect = 1.3e-59
Identity = 115/267 (43.07%), Postives = 166/267 (62.17%), Query Frame = 1

Query: 43  RTTSSLSLSSMSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVPLPKSIQQQ 102
           R T S +      + S +++DSS + SS +     +     +T P +R  V    ++   
Sbjct: 44  RVTKSPATKKPDSNFSVSTDDSSSSSSSSERSSVNTTNSGKVTTPSKRNGVEKLNNVVAS 103

Query: 103 SQELSD------GELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDY 162
              + D      G ++RC+WIT  SD  YV FHDE WGVPV DD +LFELL  S  L ++
Sbjct: 104 VAVVEDISPKIPGPVKRCHWITPNSDPIYVLFHDEEWGVPVRDDKKLFELLVFSQALAEF 163

Query: 163 NWTEIVKRRELFREAFAGFEPSVVANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILK 222
           +W  I++RR+ FR+ F  F+PS +A   EK +  +  +  ++L E ++R IV+NAK +LK
Sbjct: 164 SWPSILRRRDDFRKLFEEFDPSAIAQFTEKRLMSLRVNGCLILSEQKLRAIVENAKSVLK 223

Query: 223 IARDFGSFSNYMWSYVNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYS 282
           + ++FGSFSNY W +VN KP  N +R+ R VP++SPKAE ISKDM++RGFR VGP ++YS
Sbjct: 224 VKQEFGSFSNYCWRFVNHKPLRNGYRYGRQVPVKSPKAEYISKDMMQRGFRCVGPTVMYS 283

Query: 283 FMQAAGLTIDHLIDCFRHGECVNLAER 304
           F+QA+G+  DHL  CFR+ EC    ER
Sbjct: 284 FLQASGIVNDHLTACFRYQECNVETER 310

BLAST of Cucsa.284320 vs. TAIR10
Match: AT5G57970.1 (AT5G57970.1 DNA glycosylase superfamily protein)

HSP 1 Score: 221.5 bits (563), Expect = 7.3e-58
Identity = 112/262 (42.75%), Postives = 163/262 (62.21%), Query Frame = 1

Query: 42  QRTTSSLSLSSMSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVPLPKSIQQ 101
           Q   S+LSL++ S S   + +      S+ +L +  S   R  + P + R V    ++  
Sbjct: 87  QNLNSNLSLNA-SFSSDASMDSFHSRASTGRLIRSYSVGSRSKSYPSKPRSVVSEGALD- 146

Query: 102 QSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEI 161
            S        +RC W+T  SD  Y+ FHDE WGVPV+DD RLFELL LSG L ++ W  I
Sbjct: 147 -SPPNGSETKKRCTWVTPNSDPCYIVFHDEEWGVPVHDDKRLFELLVLSGALAEHTWPTI 206

Query: 162 VKRRELFREAFAGFEPSVVANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDF 221
           + +R+ FRE FA F+P+ +  + EK+I    S  + +L + ++R +++NA+ ILK+  ++
Sbjct: 207 LSKRQAFREVFADFDPNAIVKINEKKIIGPGSPASTLLSDLKLRAVIENARQILKVIEEY 266

Query: 222 GSFSNYMWSYVNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAA 281
           GSF  Y+WS+V  K  +++FR+ R VP ++PKAE ISKD+V+RGFR VGP +VYSFMQAA
Sbjct: 267 GSFDKYIWSFVKNKAIVSKFRYQRQVPAKTPKAEVISKDLVRRGFRSVGPTVVYSFMQAA 326

Query: 282 GLTIDHLIDCFRHGECVNLAER 304
           G+T DHL  CFR   C+   ER
Sbjct: 327 GITNDHLTSCFRFHHCIFEHER 345

BLAST of Cucsa.284320 vs. TAIR10
Match: AT1G15970.1 (AT1G15970.1 DNA glycosylase superfamily protein)

HSP 1 Score: 221.5 bits (563), Expect = 7.3e-58
Identity = 113/261 (43.30%), Postives = 166/261 (63.60%), Query Frame = 1

Query: 45  TSSLSLSSMSLSLSQNSNDSSLTDSS---IQLDQKISYAIRLITPPPERREVPLPKSIQQ 104
           +S L  +S S++ S +S+ SS  +SS   +         +R        R++ + K  ++
Sbjct: 75  SSILRKNSASMTASYSSDASSSCESSPLSVASSSSCKKVVRRSGSVSSTRKLSVGKEEEK 134

Query: 105 QSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEI 164
            S +      +RC WIT  +D  YV+FHDE WGVPV+DD +LFELL LSG L + +WT+I
Sbjct: 135 VSGDCFADGRKRCAWITPKADPCYVAFHDEEWGVPVHDDKKLFELLCLSGALAELSWTDI 194

Query: 165 VKRRELFREAFAGFEPSVVANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDF 224
           + RR + RE F  F+P  VA + +K++T   +    +L E ++R I+DN++ + KI  + 
Sbjct: 195 LSRRHILREVFMDFDPVAVAELNDKKLTAPGTAAISLLSEVKIRSILDNSRHVRKIIAEC 254

Query: 225 GSFSNYMWSYVNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAA 284
           GS   YMW++VN KPT ++FR+ R VP+++ KAE ISKD+V+RGFR V P ++YSFMQAA
Sbjct: 255 GSLKKYMWNFVNNKPTQSQFRYQRQVPVKTSKAEFISKDLVRRGFRSVSPTVIYSFMQAA 314

Query: 285 GLTIDHLIDCFRHGECVNLAE 303
           GLT DHLI CFR+ +C   AE
Sbjct: 315 GLTNDHLIGCFRYQDCCVDAE 335

BLAST of Cucsa.284320 vs. TAIR10
Match: AT1G80850.1 (AT1G80850.1 DNA glycosylase superfamily protein)

HSP 1 Score: 220.7 bits (561), Expect = 1.2e-57
Identity = 112/262 (42.75%), Postives = 165/262 (62.98%), Query Frame = 1

Query: 41  LQRTTSSLSLSSMSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPERREVPLPKSIQ 100
           L+R   S++ +S S   S +   S L+ +S    +++      ++     R     +  +
Sbjct: 65  LRRNGISMT-ASYSSDASSSCESSPLSMTSTSSGKRVLRRSGSVSSSSSLRRNLTEERDE 124

Query: 101 QQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTE 160
           + S    DG  +RC WIT  SD+ Y++FHDE WGVPV+DD RLFELL+LSG L + +W +
Sbjct: 125 KASDCFCDGR-KRCAWITPKSDQCYIAFHDEEWGVPVHDDKRLFELLSLSGALAELSWKD 184

Query: 161 IVKRRELFREAFAGFEPSVVANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARD 220
           I+ +R+LFRE F  F+P  ++ +  K+IT        +L E ++R I++NA  + KI   
Sbjct: 185 ILSKRQLFREVFMDFDPIAISELTNKKITSPEIAATTLLSEQKLRSILENANQVCKIIGA 244

Query: 221 FGSFSNYMWSYVNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQA 280
           FGSF  Y+W++VN KPT ++FR+PR VP+++ KAE ISKD+V+RGFR V P ++YSFMQ 
Sbjct: 245 FGSFDKYIWNFVNQKPTQSQFRYPRQVPVKTSKAELISKDLVRRGFRSVSPTVIYSFMQT 304

Query: 281 AGLTIDHLIDCFRHGECVNLAE 303
           AGLT DHL  CFRH +C+   E
Sbjct: 305 AGLTNDHLTCCFRHHDCMTKDE 324

BLAST of Cucsa.284320 vs. NCBI nr
Match: gi|449435284|ref|XP_004135425.1| (PREDICTED: uncharacterized protein LOC101218195 [Cucumis sativus])

HSP 1 Score: 622.9 bits (1605), Expect = 3.1e-175
Identity = 308/308 (100.00%), Postives = 308/308 (100.00%), Query Frame = 1

Query: 1   MSSKATVRRHILERQACPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSMSLSLSQN 60
           MSSKATVRRHILERQACPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSMSLSLSQN
Sbjct: 1   MSSKATVRRHILERQACPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSMSLSLSQN 60

Query: 61  SNDSSLTDSSIQLDQKISYAIRLITPPPERREVPLPKSIQQQSQELSDGELRRCNWITHT 120
           SNDSSLTDSSIQLDQKISYAIRLITPPPERREVPLPKSIQQQSQELSDGELRRCNWITHT
Sbjct: 61  SNDSSLTDSSIQLDQKISYAIRLITPPPERREVPLPKSIQQQSQELSDGELRRCNWITHT 120

Query: 121 SDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSVV 180
           SDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSVV
Sbjct: 121 SDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSVV 180

Query: 181 ANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINR 240
           ANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINR
Sbjct: 181 ANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINR 240

Query: 241 FRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDCFRHGECVNL 300
           FRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDCFRHGECVNL
Sbjct: 241 FRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDCFRHGECVNL 300

Query: 301 AERPWRHI 309
           AERPWRHI
Sbjct: 301 AERPWRHI 308

BLAST of Cucsa.284320 vs. NCBI nr
Match: gi|659091306|ref|XP_008446481.1| (PREDICTED: uncharacterized protein LOC103489204 [Cucumis melo])

HSP 1 Score: 617.5 bits (1591), Expect = 1.3e-173
Identity = 304/308 (98.70%), Postives = 307/308 (99.68%), Query Frame = 1

Query: 1   MSSKATVRRHILERQACPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSMSLSLSQN 60
           MSSKATVRRHILERQACPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSS+SLSLSQN
Sbjct: 63  MSSKATVRRHILERQACPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 122

Query: 61  SNDSSLTDSSIQLDQKISYAIRLITPPPERREVPLPKSIQQQSQELSDGELRRCNWITHT 120
           SNDSSLTDSSIQLDQKISYAIRLITPPPERREVPLPKSIQQQSQELSDGELRRCNWITHT
Sbjct: 123 SNDSSLTDSSIQLDQKISYAIRLITPPPERREVPLPKSIQQQSQELSDGELRRCNWITHT 182

Query: 121 SDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSVV 180
           SDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS+V
Sbjct: 183 SDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSIV 242

Query: 181 ANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTINR 240
           ANMGEKEITD+ASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWS VNFKPTINR
Sbjct: 243 ANMGEKEITDIASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSSVNFKPTINR 302

Query: 241 FRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDCFRHGECVNL 300
           FRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDCFRHGECVNL
Sbjct: 303 FRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDCFRHGECVNL 362

Query: 301 AERPWRHI 309
           AERPWRHI
Sbjct: 363 AERPWRHI 370

BLAST of Cucsa.284320 vs. NCBI nr
Match: gi|590698505|ref|XP_007045734.1| (DNA glycosylase superfamily protein isoform 1 [Theobroma cacao])

HSP 1 Score: 483.0 bits (1242), Expect = 3.8e-133
Identity = 237/317 (74.76%), Postives = 276/317 (87.07%), Query Frame = 1

Query: 3   SKATVRRHILERQACPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSMSLSLSQNSN 62
           SKA VRRHILE+   PKEK++ +Q++LSKHLKKIYPIGLQR+TSSLSLSS+SLSLSQNSN
Sbjct: 7   SKANVRRHILEKNRSPKEKEKPAQSVLSKHLKKIYPIGLQRSTSSLSLSSLSLSLSQNSN 66

Query: 63  DSSLTD-SSIQLDQKISYAIRLITPPPERRE--VPLPKSIQ--------QQSQELSDGEL 122
           DSSLTD SS  L+QKIS A+ LI P  ERRE  VP+ KS+Q        Q SQ+   GEL
Sbjct: 67  DSSLTDHSSTPLEQKISLALSLIAPHHERREFVVPVVKSVQHHHHQQQQQPSQDPGSGEL 126

Query: 123 RRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREA 182
           RRCNW+T  SDK YVSFHDE WGVPVYDDN+LFELLALSGMLMDYNWTEI+KR+EL+REA
Sbjct: 127 RRCNWVTKNSDKVYVSFHDEQWGVPVYDDNQLFELLALSGMLMDYNWTEILKRKELYREA 186

Query: 183 FAGFEPSVVANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSY 242
           F+GF+P +VA MG+KEI +++SDKAIML ESRVRCIVDNAKCILKI R++GSFS++MW Y
Sbjct: 187 FSGFDPEIVAKMGDKEINEISSDKAIMLAESRVRCIVDNAKCILKIVREYGSFSSFMWGY 246

Query: 243 VNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDC 302
           VN+KPTINR+++PRNVPLR+PKAEAIS+D++KRGFRFVGPVIV SFMQAAGLTIDHL+DC
Sbjct: 247 VNYKPTINRYKYPRNVPLRTPKAEAISRDLLKRGFRFVGPVIVCSFMQAAGLTIDHLVDC 306

Query: 303 FRHGECVNLAERPWRHI 309
           FR+ ECV LAERPWRHI
Sbjct: 307 FRYSECVGLAERPWRHI 323

BLAST of Cucsa.284320 vs. NCBI nr
Match: gi|255547045|ref|XP_002514580.1| (PREDICTED: DNA-3-methyladenine glycosylase [Ricinus communis])

HSP 1 Score: 477.6 bits (1228), Expect = 1.6e-131
Identity = 239/319 (74.92%), Postives = 282/319 (88.40%), Query Frame = 1

Query: 3   SKATVRRHILERQAC-PKEKDRTSQNIL---SKHLKKIYPIGLQRTTSSLSLSSMSLSLS 62
           SKATVR+ +LE+++    EK+RT+ N L   SK+LKK+YPIGL R+ SSLSLSS+SLSLS
Sbjct: 2   SKATVRKQVLEKKSIFTNEKERTTSNQLGFFSKNLKKVYPIGLHRSNSSLSLSSVSLSLS 61

Query: 63  QNSNDSSLTD-SSIQLDQKISYAIRLITPPPERREVP-LPKSIQQQ-------SQELSDG 122
           +NSNDSSLTD S+  LDQKIS A+RLITP  ERREVP L +++QQQ       SQE + G
Sbjct: 62  ENSNDSSLTDYSNTPLDQKISLALRLITPL-ERREVPALSRNVQQQQQQQQQQSQESNGG 121

Query: 123 ELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFR 182
           E+RRCNWIT  SDK YV+FHDECWGVPVYDDN+LFELLALSGMLMDYNWTEI+KR++LFR
Sbjct: 122 EIRRCNWITKNSDKVYVAFHDECWGVPVYDDNQLFELLALSGMLMDYNWTEILKRKQLFR 181

Query: 183 EAFAGFEPSVVANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMW 242
           EAFAGF+P++VANMGEKEI D+AS+KAIML +SRVRCIVDNAKCI KIAR+FGSFS++MW
Sbjct: 182 EAFAGFDPNIVANMGEKEILDIASNKAIMLADSRVRCIVDNAKCIAKIAREFGSFSSFMW 241

Query: 243 SYVNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLI 302
            +VN+KPTIN++++PRNVPLR+PKAEAISKD++KRGFRFVGPVIVYSFMQAAGLTIDHL+
Sbjct: 242 GHVNYKPTINKYKYPRNVPLRTPKAEAISKDLLKRGFRFVGPVIVYSFMQAAGLTIDHLV 301

Query: 303 DCFRHGECVNLAERPWRHI 309
           DCFRHGECV LAERPWRHI
Sbjct: 302 DCFRHGECVGLAERPWRHI 319

BLAST of Cucsa.284320 vs. NCBI nr
Match: gi|743797932|ref|XP_011009373.1| (PREDICTED: uncharacterized protein LOC105114510 [Populus euphratica])

HSP 1 Score: 474.9 bits (1221), Expect = 1.0e-130
Identity = 235/316 (74.37%), Postives = 279/316 (88.29%), Query Frame = 1

Query: 4   KATVRRHILERQACP-KEKDR---TSQNILSKHLKKIYPIGLQRTTSSLSLSSMSLSLSQ 63
           KA VR+ ILE+     KEK++   ++Q + SKHLK++YPIGL R+TSSLSLSS+SLSLSQ
Sbjct: 3   KANVRKQILEKNNISIKEKEKPISSTQGLFSKHLKRVYPIGLHRSTSSLSLSSVSLSLSQ 62

Query: 64  NSNDSSLTDSS-IQLDQKISYAIRLITPPPERREVPLPKSIQ------QQSQELSDGELR 123
           NSNDSSLTDSS + L+QKIS A+RLI+P  ERREVP+ ++ Q      QQ+Q+ +DGE++
Sbjct: 63  NSNDSSLTDSSAVPLEQKISLALRLISPL-ERREVPVARNFQPQQQQQQQNQDSNDGEVK 122

Query: 124 RCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAF 183
           RCNWIT  SDK YV+FHDECWGVPVYDDN+LFELLALSGMLMDYNWTEI+KR+ELFREAF
Sbjct: 123 RCNWITKNSDKVYVAFHDECWGVPVYDDNQLFELLALSGMLMDYNWTEILKRKELFREAF 182

Query: 184 AGFEPSVVANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYV 243
            GF+P++VA MGEKEI ++AS+KAI+L ESRVRCIVDN+KCILKIAR+FGSFSNYMW  V
Sbjct: 183 EGFDPNIVAKMGEKEIMEIASNKAIILAESRVRCIVDNSKCILKIAREFGSFSNYMWGNV 242

Query: 244 NFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDCF 303
           NFKPTINR+++PRNVPLRSPKAEAISKD++KRGFRFVGPVIVYSFMQAAGLTIDHL+DCF
Sbjct: 243 NFKPTINRYKYPRNVPLRSPKAEAISKDLLKRGFRFVGPVIVYSFMQAAGLTIDHLVDCF 302

Query: 304 RHGECVNLAERPWRHI 309
           R+ ECV+LAERPWRHI
Sbjct: 303 RYSECVSLAERPWRHI 317

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GUAA_HELHP2.5e-3938.24Probable GMP synthase [glutamine-hydrolyzing] OS=Helicobacter hepaticus (strain ... [more]
3MG1_ECOLI1.5e-3335.36DNA-3-methyladenine glycosylase 1 OS=Escherichia coli (strain K12) GN=tag PE=1 S... [more]
3MGA_HAEIN1.7e-3237.43DNA-3-methyladenine glycosylase OS=Haemophilus influenzae (strain ATCC 51907 / D... [more]
Match NameE-valueIdentityDescription
A0A0A0KUC5_CUCSA2.2e-175100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_5G606920 PE=4 SV=1[more]
A0A061E9D7_THECC2.7e-13374.76DNA glycosylase superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_011427 P... [more]
B9RLG1_RICCO1.1e-13174.92DNA-3-methyladenine glycosylase, putative OS=Ricinus communis GN=RCOM_1466540 PE... [more]
B9HWT9_POPTR1.2e-13074.37Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0010s14710g PE=4 SV=2[more]
A0A067FVV1_CITSI1.0e-12973.50Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g021082mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G13635.12.9e-11567.54 DNA glycosylase superfamily protein[more]
AT1G75090.11.3e-5943.07 DNA glycosylase superfamily protein[more]
AT5G57970.17.3e-5842.75 DNA glycosylase superfamily protein[more]
AT1G15970.17.3e-5843.30 DNA glycosylase superfamily protein[more]
AT1G80850.11.2e-5742.75 DNA glycosylase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449435284|ref|XP_004135425.1|3.1e-175100.00PREDICTED: uncharacterized protein LOC101218195 [Cucumis sativus][more]
gi|659091306|ref|XP_008446481.1|1.3e-17398.70PREDICTED: uncharacterized protein LOC103489204 [Cucumis melo][more]
gi|590698505|ref|XP_007045734.1|3.8e-13374.76DNA glycosylase superfamily protein isoform 1 [Theobroma cacao][more]
gi|255547045|ref|XP_002514580.1|1.6e-13174.92PREDICTED: DNA-3-methyladenine glycosylase [Ricinus communis][more]
gi|743797932|ref|XP_011009373.1|1.0e-13074.37PREDICTED: uncharacterized protein LOC105114510 [Populus euphratica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005019Adenine_glyco
IPR011257DNA_glycosylase
Vocabulary: Biological Process
TermDefinition
GO:0006284base-excision repair
GO:0006281DNA repair
Vocabulary: Molecular Function
TermDefinition
GO:0008725DNA-3-methyladenine glycosylase activity
GO:0003824catalytic activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006284 base-excision repair
biological_process GO:0006281 DNA repair
cellular_component GO:0005575 cellular_component
molecular_function GO:0008725 DNA-3-methyladenine glycosylase activity
molecular_function GO:0003824 catalytic activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.284320.1Cucsa.284320.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005019Methyladenine glycosylasePFAMPF03352Adenine_glycocoord: 120..294
score: 1.0
IPR011257DNA glycosylaseGENE3DG3DSA:1.10.340.30coord: 112..294
score: 5.7
IPR011257DNA glycosylaseunknownSSF48150DNA-glycosylasecoord: 112..298
score: 4.32
NoneNo IPR availablePANTHERPTHR31116FAMILY NOT NAMEDcoord: 1..308
score: 7.2E
NoneNo IPR availablePANTHERPTHR31116:SF23-METHYLADENINE GLYCOSYLASE I-RELATEDcoord: 1..308
score: 7.2E

The following gene(s) are paralogous to this gene:

None