Lsi04G003420 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi04G003420
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionDNA-3-methyladenine glycosylase, putative
Locationchr04 : 3422632 .. 3424259 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAACCTTTATAATCTCAATCCATTTTTGTATCAAAATTGTCCCATTAAGTCTTATCTTATTTTCCAATATGTCATCCAAAGCCACTGTTAGAAGACAGATTCTGGAGAGGCAAACATGTCCTAAAGAGAAAGATAGGACAAGCCAACACATTTTGTCCAAACACCTTAAGAAGATTTACCCAATTGGGTTACAAAGAACCACTTCATCACTATCTTTATCTTCACTATCATTGTCTTTGTCTCAAAATTCAAATGACTCTTCTCTTACAGACTCCTCAATCCAATTGGATCAGAAAATTTCGTACGCAATTCGCCTTATTACGCCGCCGCCTCCTGAAAGAAGAGAAGTCCCATTGCCTAAAAGTATCCAACAACAAAGTCAAGAACTTAGTGATGGGGAATTGAGGAGGTGCAACTGGATCACCCATACCAGTGGTAAGATTATACTTACACAGTTATAAAATTTTTCGCTCTCGTAACCAGATAAATGCTTCCTAGCAAGTGATAATATAGCATTGTCCAAGAAAAATAAGTCAACAAAGTTTTTTAGGAAGTTCAAACTATTGATATGATTAATAATGTCATTAACTCAGTATAGCAGTTGCTACCACTGTCATATTTCTTAAGAAAATGAGATTGTGGCTTCTAACAAAAACTACAGAATAGTATAGTTCTGATATGGACTTAAAACACTATATCTTACTGAACAAAGAAAAAAGGGAAAAAGTAGTTTCCCTTTATAGGATACTTTCCCCACATGCTGGATTACTTCGTGATTGATGTTTTTCGTAATTGATACAGATAAAGCCTATGTATCTTTTCATGACGAGTGTTGGGGTGTCCCAGTATACGATGACAAGTAGGTTATCATGTAACTTTTCATTTATTTAACCTATCAATTAATAGATTTGTATGTTAACTTCGAAGAAAAATTACCATACAGCCGACTTTTCGAGCTACTCGCACTATCTGGGATGTTGATGGACTACAATTGGACTGAAATTGTGAAAAGAAGGGAACTATTCAGGTAATATTTATTTTTTTTCATAACTCACACTTTATGTACTTAAGATAAATGTCAATCACATATAAATATATATACACATATGGTTGGTCAATAGGGAAGCTTTTGCTGGATTTGAGCCAAGTACAGTTGCCAATATGGGGGAGAAAGAGATAACAGATATAGCTTCTGACAAGGCCATTATGCTGGTGGAGAGCAGAGTGAGGTGCATAGTAGATAATGCCAAATGCATATTAAAGGCAAGCTAGTTCAACTCATCTTTTAACTAGTTATTTCACAGATTGTATTATTAATTAGAATTATTGGTTTTTAATTCAACTGCAGATAGCTAGAGATTTTGGATCGTTTAGTAACTATATGTGGAGCTATATGAACTTTAAACCTACAATAAACAGATTTAGACATCCAAGAAATGTTCCCTTGAGAAGTCCCAAAGCAGAAGCCATTAGCAAGGACATGGTGAAGCGCGGTTTTCGGTTTGTTGGGCCAGTGATTGTCTATTCATTCATGCAAGCGGCTGGGTTGACCATCGATCATCTTGTCGATTGTTTTCGACACGGTGAATGTGTAAATCTTGCAGAAAGGCCATGGAGACATATCTGA

mRNA sequence

CAAACCTTTATAATCTCAATCCATTTTTGTATCAAAATTGTCCCATTAAGTCTTATCTTATTTTCCAATATGTCATCCAAAGCCACTGTTAGAAGACAGATTCTGGAGAGGCAAACATGTCCTAAAGAGAAAGATAGGACAAGCCAACACATTTTGTCCAAACACCTTAAGAAGATTTACCCAATTGGGTTACAAAGAACCACTTCATCACTATCTTTATCTTCACTATCATTGTCTTTGTCTCAAAATTCAAATGACTCTTCTCTTACAGACTCCTCAATCCAATTGGATCAGAAAATTTCGTACGCAATTCGCCTTATTACGCCGCCGCCTCCTGAAAGAAGAGAAGTCCCATTGCCTAAAAGTATCCAACAACAAAGTCAAGAACTTAGTGATGGGGAATTGAGGAGGTGCAACTGGATCACCCATACCAGTGATAAAGCCTATGTATCTTTTCATGACGAGTGTTGGGGTGTCCCAGTATACGATGACAACCGACTTTTCGAGCTACTCGCACTATCTGGGATGTTGATGGACTACAATTGGACTGAAATTGTGAAAAGAAGGGAACTATTCAGGGAAGCTTTTGCTGGATTTGAGCCAAGTACAGTTGCCAATATGGGGGAGAAAGAGATAACAGATATAGCTTCTGACAAGGCCATTATGCTGGTGGAGAGCAGAATAGCTAGAGATTTTGGATCGTTTAGTAACTATATGTGGAGCTATATGAACTTTAAACCTACAATAAACAGATTTAGACATCCAAGAAATGTTCCCTTGAGAAGTCCCAAAGCAGAAGCCATTAGCAAGGACATGGTGAAGCGCGGTTTTCGGTTTGTTGGGCCAGTGATTGTCTATTCATTCATGCAAGCGGCTGGGTTGACCATCGATCATCTTGTCGATTGTTTTCGACACGGTGAATGTGTAAATCTTGCAGAAAGGCCATGGAGACATATCTGA

Coding sequence (CDS)

ATGTCATCCAAAGCCACTGTTAGAAGACAGATTCTGGAGAGGCAAACATGTCCTAAAGAGAAAGATAGGACAAGCCAACACATTTTGTCCAAACACCTTAAGAAGATTTACCCAATTGGGTTACAAAGAACCACTTCATCACTATCTTTATCTTCACTATCATTGTCTTTGTCTCAAAATTCAAATGACTCTTCTCTTACAGACTCCTCAATCCAATTGGATCAGAAAATTTCGTACGCAATTCGCCTTATTACGCCGCCGCCTCCTGAAAGAAGAGAAGTCCCATTGCCTAAAAGTATCCAACAACAAAGTCAAGAACTTAGTGATGGGGAATTGAGGAGGTGCAACTGGATCACCCATACCAGTGATAAAGCCTATGTATCTTTTCATGACGAGTGTTGGGGTGTCCCAGTATACGATGACAACCGACTTTTCGAGCTACTCGCACTATCTGGGATGTTGATGGACTACAATTGGACTGAAATTGTGAAAAGAAGGGAACTATTCAGGGAAGCTTTTGCTGGATTTGAGCCAAGTACAGTTGCCAATATGGGGGAGAAAGAGATAACAGATATAGCTTCTGACAAGGCCATTATGCTGGTGGAGAGCAGAATAGCTAGAGATTTTGGATCGTTTAGTAACTATATGTGGAGCTATATGAACTTTAAACCTACAATAAACAGATTTAGACATCCAAGAAATGTTCCCTTGAGAAGTCCCAAAGCAGAAGCCATTAGCAAGGACATGGTGAAGCGCGGTTTTCGGTTTGTTGGGCCAGTGATTGTCTATTCATTCATGCAAGCGGCTGGGTTGACCATCGATCATCTTGTCGATTGTTTTCGACACGGTGAATGTGTAAATCTTGCAGAAAGGCCATGGAGACATATCTGA

Protein sequence

MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRIARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAERPWRHI
BLAST of Lsi04G003420 vs. Swiss-Prot
Match: GUAA_HELHP (Probable GMP synthase [glutamine-hydrolyzing] OS=Helicobacter hepaticus (strain ATCC 51449 / 3B1) GN=guaA PE=3 SV=1)

HSP 1 Score: 149.4 bits (376), Expect = 6.0e-35
Identity = 75/204 (36.76%), Postives = 116/204 (56.86%), Query Frame = 1

Query: 96  LPKSIQQQSQELSDG--ELRRCNWITHTSD---KAYVSFHDECWGVPVYDDNRLFELLAL 155
           L KS+  ++Q+ ++G  E  RC W T   +   K Y  +HD  WG P+++D +LFE L L
Sbjct: 767 LQKSLGLEAQDSNEGVREKVRCAWATDKDEAARKLYEDYHDTEWGEPLHEDKKLFEHLVL 826

Query: 156 SGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRI----- 215
            G     +W  I+K+RE FR AF  F+P  VAN  E +I ++  ++ I+   ++I     
Sbjct: 827 EGFQAGLSWITILKKREAFRVAFDDFDPHIVANYDEDKIKELMRNEGIIRNRAKIEAAII 886

Query: 216 --------ARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFV 275
                    R+FGSF  Y+W ++  KP IN F    ++P  +P ++ I+KD+ KRGF+FV
Sbjct: 887 NAKAFMAVQREFGSFDKYIWGFVGGKPIINAFESIADLPASTPLSDKIAKDLKKRGFKFV 946

Query: 276 GPVIVYSFMQAAGLTIDHLVDCFR 282
           G   +Y+ MQ+ G+  DHL  CF+
Sbjct: 947 GTTTMYAMMQSIGMVNDHLTSCFK 970

BLAST of Lsi04G003420 vs. Swiss-Prot
Match: 3MG1_ECOLI (DNA-3-methyladenine glycosylase 1 OS=Escherichia coli (strain K12) GN=tag PE=1 SV=1)

HSP 1 Score: 132.9 bits (333), Expect = 5.8e-30
Identity = 62/181 (34.25%), Postives = 100/181 (55.25%), Query Frame = 1

Query: 112 LRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFRE 171
           + RC W++   D  Y+++HD  WGVP  D  +LFE++ L G     +W  ++K+RE +R 
Sbjct: 1   MERCGWVSQ--DPLYIAYHDNEWGVPETDSKKLFEMICLEGQQAGLSWITVLKKRENYRA 60

Query: 172 AFAGFEPSTVANMGEKEITDIASDKAIMLVESRIARDFGS-------------FSNYMWS 231
            F  F+P  VA M E+++  +  D  I+    +I    G+             F +++WS
Sbjct: 61  CFHQFDPVKVAAMQEEDVERLVQDAGIIRHRGKIQAIIGNARAYLQMEQNGEPFVDFVWS 120

Query: 232 YMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVD 280
           ++N +P + +      +P  +  ++A+SK + KRGF+FVG  I YSFMQA GL  DH+V 
Sbjct: 121 FVNHQPQVTQATTLSEIPTSTSASDALSKALKKRGFKFVGTTICYSFMQACGLVNDHVVG 179

BLAST of Lsi04G003420 vs. Swiss-Prot
Match: 3MGA_HAEIN (DNA-3-methyladenine glycosylase OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) GN=tag PE=3 SV=1)

HSP 1 Score: 125.9 bits (315), Expect = 7.1e-28
Identity = 61/179 (34.08%), Postives = 101/179 (56.42%), Query Frame = 1

Query: 114 RCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAF 173
           RC W+   S   Y+ +HD+ WG P +D  +LFE + L G     +W  ++K+RE +REAF
Sbjct: 4   RCPWVGEQS--IYIDYHDKEWGKPEFDSQKLFEKICLEGQQAGLSWITVLKKRESYREAF 63

Query: 174 AGFEPSTVANMGEKEITDIASDKAIMLVESRI------ARDF-------GSFSNYMWSYM 233
             F+P  +A M   +I     +  ++   +++      A+ +        +FS+++WS++
Sbjct: 64  HQFDPKKIAKMTALDIDACMQNSGLIRHRAKLEAIVKNAKAYLAMEKCGENFSDFIWSFV 123

Query: 234 NFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC 280
           N KP +N     R+VP ++  ++A+SK + KRGF F+G    Y+FMQ+ GL  DHL DC
Sbjct: 124 NHKPIVNDVPDLRSVPTKTEVSKALSKALKKRGFVFIGETTCYAFMQSMGLVDDHLNDC 180

BLAST of Lsi04G003420 vs. TrEMBL
Match: A0A0A0KUC5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G606920 PE=4 SV=1)

HSP 1 Score: 565.8 bits (1457), Expect = 3.0e-158
Identity = 287/309 (92.88%), Postives = 292/309 (94.50%), Query Frame = 1

Query: 1   MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 60
           MSSKATVRR ILERQ CPKEKDRTSQ+ILSKHLKKIYPIGLQRTTSSLSLSS+SLSLSQN
Sbjct: 1   MSSKATVRRHILERQACPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSMSLSLSQN 60

Query: 61  SNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQSQELSDGELRRCNWITH 120
           SNDSSLTDSSIQLDQKISYAIRLITPPP ERREVPLPKSIQQQSQELSDGELRRCNWITH
Sbjct: 61  SNDSSLTDSSIQLDQKISYAIRLITPPP-ERREVPLPKSIQQQSQELSDGELRRCNWITH 120

Query: 121 TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPST 180
           TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS 
Sbjct: 121 TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSV 180

Query: 181 VANMGEKEITDIASDKAIMLVESR-------------IARDFGSFSNYMWSYMNFKPTIN 240
           VANMGEKEITD+ASDKAIMLVESR             IARDFGSFSNYMWSY+NFKPTIN
Sbjct: 181 VANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTIN 240

Query: 241 RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN 297
           RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVN
Sbjct: 241 RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDCFRHGECVN 300

BLAST of Lsi04G003420 vs. TrEMBL
Match: W9QWE1_9ROSA (Putative Glutamine amidotransferase OS=Morus notabilis GN=L484_026493 PE=4 SV=1)

HSP 1 Score: 441.0 bits (1133), Expect = 1.1e-120
Identity = 220/298 (73.83%), Postives = 256/298 (85.91%), Query Frame = 1

Query: 4   KATVRRQILERQTCPKE---KDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 63
           KA VRR +LER    KE   KD+TS  +LSKHLK+IYPIGLQ++ SS SLSSLSLSLS+N
Sbjct: 3   KANVRRPVLERNGSLKENEKKDKTSPGLLSKHLKRIYPIGLQKSNSSPSLSSLSLSLSEN 62

Query: 64  SNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQSQELSDG--ELRRCNWI 123
           SNDSSL D    LD KIS A+RL+   PP R+E P PK++QQQ  + ++   ELRRCNWI
Sbjct: 63  SNDSSLADFGSPLDHKISLALRLVA--PPRRKESPAPKNVQQQQSQDANNPEELRRCNWI 122

Query: 124 THTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEP 183
           T  SDK YV+FHDECWGVPVYDDN+LFELLA+SGMLMDYNWTEI+KRRELFREAF+GF+P
Sbjct: 123 TKNSDKVYVAFHDECWGVPVYDDNQLFELLAMSGMLMDYNWTEILKRRELFREAFSGFDP 182

Query: 184 STVANMGEKEITDIASDKAIMLVESRIARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLR 243
           S VA MGEKEIT+I+S+KAIML ESR+ R+FGSFSNYMWSY++ KP INR+R+PRNVPLR
Sbjct: 183 SKVAKMGEKEITEISSNKAIMLAESRVVREFGSFSNYMWSYVDHKPVINRYRYPRNVPLR 242

Query: 244 SPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAERPWRHI 297
           SPKAEAISKD++KRGFRFVGPVIV+SFMQAAGLTIDHLV+C+RH ECV+LAERPWRHI
Sbjct: 243 SPKAEAISKDLLKRGFRFVGPVIVHSFMQAAGLTIDHLVNCYRHYECVSLAERPWRHI 298

BLAST of Lsi04G003420 vs. TrEMBL
Match: B9RLG1_RICCO (DNA-3-methyladenine glycosylase, putative OS=Ricinus communis GN=RCOM_1466540 PE=4 SV=1)

HSP 1 Score: 436.4 bits (1121), Expect = 2.7e-119
Identity = 228/320 (71.25%), Postives = 270/320 (84.38%), Query Frame = 1

Query: 3   SKATVRRQILERQTC-PKEKDRTSQHIL---SKHLKKIYPIGLQRTTSSLSLSSLSLSLS 62
           SKATVR+Q+LE+++    EK+RT+ + L   SK+LKK+YPIGL R+ SSLSLSS+SLSLS
Sbjct: 2   SKATVRKQVLEKKSIFTNEKERTTSNQLGFFSKNLKKVYPIGLHRSNSSLSLSSVSLSLS 61

Query: 63  QNSNDSSLTD-SSIQLDQKISYAIRLITPPPPERREVP-LPKSIQQQ-------SQELSD 122
           +NSNDSSLTD S+  LDQKIS A+RLITP   ERREVP L +++QQQ       SQE + 
Sbjct: 62  ENSNDSSLTDYSNTPLDQKISLALRLITPL--ERREVPALSRNVQQQQQQQQQQSQESNG 121

Query: 123 GELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELF 182
           GE+RRCNWIT  SDK YV+FHDECWGVPVYDDN+LFELLALSGMLMDYNWTEI+KR++LF
Sbjct: 122 GEIRRCNWITKNSDKVYVAFHDECWGVPVYDDNQLFELLALSGMLMDYNWTEILKRKQLF 181

Query: 183 REAFAGFEPSTVANMGEKEITDIASDKAIMLVESR-------------IARDFGSFSNYM 242
           REAFAGF+P+ VANMGEKEI DIAS+KAIML +SR             IAR+FGSFS++M
Sbjct: 182 REAFAGFDPNIVANMGEKEILDIASNKAIMLADSRVRCIVDNAKCIAKIAREFGSFSSFM 241

Query: 243 WSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL 297
           W ++N+KPTIN++++PRNVPLR+PKAEAISKD++KRGFRFVGPVIVYSFMQAAGLTIDHL
Sbjct: 242 WGHVNYKPTINKYKYPRNVPLRTPKAEAISKDLLKRGFRFVGPVIVYSFMQAAGLTIDHL 301

BLAST of Lsi04G003420 vs. TrEMBL
Match: B9HWT9_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0010s14710g PE=4 SV=2)

HSP 1 Score: 433.7 bits (1114), Expect = 1.8e-118
Identity = 225/317 (70.98%), Postives = 264/317 (83.28%), Query Frame = 1

Query: 4   KATVRRQILERQTCP-KEKDR---TSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQ 63
           KA VR+QILE+     KEK++    +Q + SKHLK++YPIGL R+TSSLSLSS+SLSLSQ
Sbjct: 3   KANVRKQILEKNNILIKEKEKPISNTQGLFSKHLKRVYPIGLHRSTSSLSLSSVSLSLSQ 62

Query: 64  NSNDSSLTDSS-IQLDQKISYAIRLITPPPPERREVPLPKSIQ------QQSQELSDGEL 123
           NSNDSSLTDSS + L+QKIS A+RLI+P   ERREVP+ ++ Q      QQ+Q+ +DGE+
Sbjct: 63  NSNDSSLTDSSAVPLEQKISLALRLISPL--ERREVPVARNFQPQQQQQQQNQDSNDGEV 122

Query: 124 RRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREA 183
           +RCNWIT  SDK YV+FHDECWGVPVYDDN+LFELLALSGMLMDYNWTEI+KR+ELFREA
Sbjct: 123 KRCNWITKNSDKVYVAFHDECWGVPVYDDNQLFELLALSGMLMDYNWTEILKRKELFREA 182

Query: 184 FAGFEPSTVANMGEKEITDIASDKAIMLVESR-------------IARDFGSFSNYMWSY 243
           F GF+P+ VA MGEKEI +IAS+KAIML ESR             IAR+FGSFSNYMW  
Sbjct: 183 FEGFDPNIVAKMGEKEIMEIASNKAIMLAESRVRCIVDNSKCILKIAREFGSFSNYMWGN 242

Query: 244 MNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC 297
           +NFKPTINR+++PRNVPLRSPKAEAISKD++KRGFRF GPVIVYSFMQAAGLTIDHLVDC
Sbjct: 243 VNFKPTINRYKYPRNVPLRSPKAEAISKDLLKRGFRFAGPVIVYSFMQAAGLTIDHLVDC 302

BLAST of Lsi04G003420 vs. TrEMBL
Match: A0A061E9D7_THECC (DNA glycosylase superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_011427 PE=4 SV=1)

HSP 1 Score: 432.2 bits (1110), Expect = 5.2e-118
Identity = 225/318 (70.75%), Postives = 260/318 (81.76%), Query Frame = 1

Query: 3   SKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSN 62
           SKA VRR ILE+   PKEK++ +Q +LSKHLKKIYPIGLQR+TSSLSLSSLSLSLSQNSN
Sbjct: 7   SKANVRRHILEKNRSPKEKEKPAQSVLSKHLKKIYPIGLQRSTSSLSLSSLSLSLSQNSN 66

Query: 63  DSSLTD-SSIQLDQKISYAIRLITPPPPERRE--VPLPKSIQ--------QQSQELSDGE 122
           DSSLTD SS  L+QKIS A+ LI P   ERRE  VP+ KS+Q        Q SQ+   GE
Sbjct: 67  DSSLTDHSSTPLEQKISLALSLIAPHH-ERREFVVPVVKSVQHHHHQQQQQPSQDPGSGE 126

Query: 123 LRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFRE 182
           LRRCNW+T  SDK YVSFHDE WGVPVYDDN+LFELLALSGMLMDYNWTEI+KR+EL+RE
Sbjct: 127 LRRCNWVTKNSDKVYVSFHDEQWGVPVYDDNQLFELLALSGMLMDYNWTEILKRKELYRE 186

Query: 183 AFAGFEPSTVANMGEKEITDIASDKAIMLVESR-------------IARDFGSFSNYMWS 242
           AF+GF+P  VA MG+KEI +I+SDKAIML ESR             I R++GSFS++MW 
Sbjct: 187 AFSGFDPEIVAKMGDKEINEISSDKAIMLAESRVRCIVDNAKCILKIVREYGSFSSFMWG 246

Query: 243 YMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVD 297
           Y+N+KPTINR+++PRNVPLR+PKAEAIS+D++KRGFRFVGPVIV SFMQAAGLTIDHLVD
Sbjct: 247 YVNYKPTINRYKYPRNVPLRTPKAEAISRDLLKRGFRFVGPVIVCSFMQAAGLTIDHLVD 306

BLAST of Lsi04G003420 vs. TAIR10
Match: AT1G13635.1 (AT1G13635.1 DNA glycosylase superfamily protein)

HSP 1 Score: 375.2 bits (962), Expect = 3.8e-104
Identity = 196/306 (64.05%), Postives = 241/306 (78.76%), Query Frame = 1

Query: 8   RRQILERQTCPKEKD-RTSQHILSKHLKKIYPIGLQRTTSS-LSLSSLSLSLSQNSNDSS 67
           R++I+E+    +EK+ + + +  +KHLK+IYPI LQR+TSS  SLSS+SLSLSQNS DS 
Sbjct: 8   RKEIVEKSKSVREKEIKQNSNFFAKHLKRIYPITLQRSTSSSFSLSSISLSLSQNSTDSV 67

Query: 68  LTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQ-SQEL-SDGELRRCNWITHTSD 127
            TDS+  L+QKIS A+ LI+   P RRE+ +PKSI QQ  Q+  S  E +RCNWIT  SD
Sbjct: 68  STDSNSTLEQKISLALGLIS--SPHRREIFVPKSIPQQLCQDFNSSDEPKRCNWITKKSD 127

Query: 128 KAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVAN 187
           + YV FHD+ WGVPVYDDN LFE LA+SGMLMDYNWTEI+KR+E FREAF  F+P+ VA 
Sbjct: 128 EVYVMFHDQQWGVPVYDDNLLFEFLAMSGMLMDYNWTEILKRKEHFREAFCEFDPNRVAK 187

Query: 188 MGEKEITDIASDKAIMLVESR-------------IARDFGSFSNYMWSYMNFKPTINRFR 247
           MGEKEI +IAS+KAIML ESR             +  +FGSFS+++W +M++KP IN+F+
Sbjct: 188 MGEKEIAEIASNKAIMLQESRVRCIVDNAKCITKVVNEFGSFSSFVWGFMDYKPIINKFK 247

Query: 248 HPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAE 297
           + RNVPLRSPKAE ISKDM+KRGFRFVGPVIV+SFMQAAGLTIDHLVDCFRHG+CV+LAE
Sbjct: 248 YSRNVPLRSPKAEIISKDMIKRGFRFVGPVIVHSFMQAAGLTIDHLVDCFRHGDCVSLAE 307

BLAST of Lsi04G003420 vs. TAIR10
Match: AT5G57970.1 (AT5G57970.1 DNA glycosylase superfamily protein)

HSP 1 Score: 200.7 bits (509), Expect = 1.3e-51
Identity = 90/192 (46.88%), Postives = 125/192 (65.10%), Query Frame = 1

Query: 113 RRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREA 172
           +RC W+T  SD  Y+ FHDE WGVPV+DD RLFELL LSG L ++ W  I+ +R+ FRE 
Sbjct: 154 KRCTWVTPNSDPCYIVFHDEEWGVPVHDDKRLFELLVLSGALAEHTWPTILSKRQAFREV 213

Query: 173 FAGFEPSTVANMGEKEITDIASDKAIMLVESR-------------IARDFGSFSNYMWSY 232
           FA F+P+ +  + EK+I    S  + +L + +             +  ++GSF  Y+WS+
Sbjct: 214 FADFDPNAIVKINEKKIIGPGSPASTLLSDLKLRAVIENARQILKVIEEYGSFDKYIWSF 273

Query: 233 MNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC 292
           +  K  +++FR+ R VP ++PKAE ISKD+V+RGFR VGP +VYSFMQAAG+T DHL  C
Sbjct: 274 VKNKAIVSKFRYQRQVPAKTPKAEVISKDLVRRGFRSVGPTVVYSFMQAAGITNDHLTSC 333

BLAST of Lsi04G003420 vs. TAIR10
Match: AT1G80850.1 (AT1G80850.1 DNA glycosylase superfamily protein)

HSP 1 Score: 199.9 bits (507), Expect = 2.2e-51
Identity = 107/261 (41.00%), Postives = 161/261 (61.69%), Query Frame = 1

Query: 48  LSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQSQEL 107
           L  + +S++ S +S+ SS  +SS       S   R++           L +++ ++  E 
Sbjct: 65  LRRNGISMTASYSSDASSSCESSPLSMTSTSSGKRVLRRSGSVSSSSSLRRNLTEERDEK 124

Query: 108 S-----DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEI 167
           +     DG  +RC WIT  SD+ Y++FHDE WGVPV+DD RLFELL+LSG L + +W +I
Sbjct: 125 ASDCFCDGR-KRCAWITPKSDQCYIAFHDEEWGVPVHDDKRLFELLSLSGALAELSWKDI 184

Query: 168 VKRRELFREAFAGFEPSTVANMGEKEITD--------IASDKAIMLVES-----RIARDF 227
           + +R+LFRE F  F+P  ++ +  K+IT         ++  K   ++E+     +I   F
Sbjct: 185 LSKRQLFREVFMDFDPIAISELTNKKITSPEIAATTLLSEQKLRSILENANQVCKIIGAF 244

Query: 228 GSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAA 287
           GSF  Y+W+++N KPT ++FR+PR VP+++ KAE ISKD+V+RGFR V P ++YSFMQ A
Sbjct: 245 GSFDKYIWNFVNQKPTQSQFRYPRQVPVKTSKAELISKDLVRRGFRSVSPTVIYSFMQTA 304

Query: 288 GLTIDHLVDCFRHGECVNLAE 291
           GLT DHL  CFRH +C+   E
Sbjct: 305 GLTNDHLTCCFRHHDCMTKDE 324

BLAST of Lsi04G003420 vs. TAIR10
Match: AT5G44680.1 (AT5G44680.1 DNA glycosylase superfamily protein)

HSP 1 Score: 199.1 bits (505), Expect = 3.7e-51
Identity = 94/205 (45.85%), Postives = 136/205 (66.34%), Query Frame = 1

Query: 98  KSIQQQSQELS--DGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLM 157
           KS++   + L+    + +RC++IT +SD  YV++HD+ WGVPV+DDN LFELL L+G  +
Sbjct: 147 KSVKSNEKNLNVEHEKKKRCSFITTSSDPIYVAYHDKEWGVPVHDDNLLFELLVLTGAQV 206

Query: 158 DYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVES-----------R 217
             +WT ++KRR  FREAF+GFE   VA+  EK+I  I +D  I L +            +
Sbjct: 207 GSDWTSVLKRRNTFREAFSGFEAELVADFNEKKIQSIVNDYGINLSQVLAVVDNAKQILK 266

Query: 218 IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYS 277
           + RD GSF+ Y+W +M  KP   ++   + +P+++ K+E ISKDMV+RGFRFVGP +++S
Sbjct: 267 VKRDLGSFNKYIWGFMKHKPVTTKYTSCQKIPVKTSKSETISKDMVRRGFRFVGPTVIHS 326

Query: 278 FMQAAGLTIDHLVDCFRHGECVNLA 290
            MQAAGLT DHL+ C RH EC  +A
Sbjct: 327 LMQAAGLTNDHLITCPRHLECTAMA 351

BLAST of Lsi04G003420 vs. TAIR10
Match: AT1G75090.1 (AT1G75090.1 DNA glycosylase superfamily protein)

HSP 1 Score: 193.4 bits (490), Expect = 2.0e-49
Identity = 104/267 (38.95%), Postives = 152/267 (56.93%), Query Frame = 1

Query: 43  RTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPER-----REVPLP 102
           R T S +      + S +++DSS + SS +     +     +T P           V   
Sbjct: 44  RVTKSPATKKPDSNFSVSTDDSSSSSSSSERSSVNTTNSGKVTTPSKRNGVEKLNNVVAS 103

Query: 103 KSIQQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDY 162
            ++ +       G ++RC+WIT  SD  YV FHDE WGVPV DD +LFELL  S  L ++
Sbjct: 104 VAVVEDISPKIPGPVKRCHWITPNSDPIYVLFHDEEWGVPVRDDKKLFELLVFSQALAEF 163

Query: 163 NWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESR------------- 222
           +W  I++RR+ FR+ F  F+PS +A   EK +  +  +  ++L E +             
Sbjct: 164 SWPSILRRRDDFRKLFEEFDPSAIAQFTEKRLMSLRVNGCLILSEQKLRAIVENAKSVLK 223

Query: 223 IARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYS 282
           + ++FGSFSNY W ++N KP  N +R+ R VP++SPKAE ISKDM++RGFR VGP ++YS
Sbjct: 224 VKQEFGSFSNYCWRFVNHKPLRNGYRYGRQVPVKSPKAEYISKDMMQRGFRCVGPTVMYS 283

Query: 283 FMQAAGLTIDHLVDCFRHGECVNLAER 292
           F+QA+G+  DHL  CFR+ EC    ER
Sbjct: 284 FLQASGIVNDHLTACFRYQECNVETER 310

BLAST of Lsi04G003420 vs. NCBI nr
Match: gi|449435284|ref|XP_004135425.1| (PREDICTED: uncharacterized protein LOC101218195 [Cucumis sativus])

HSP 1 Score: 565.8 bits (1457), Expect = 4.3e-158
Identity = 287/309 (92.88%), Postives = 292/309 (94.50%), Query Frame = 1

Query: 1   MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 60
           MSSKATVRR ILERQ CPKEKDRTSQ+ILSKHLKKIYPIGLQRTTSSLSLSS+SLSLSQN
Sbjct: 1   MSSKATVRRHILERQACPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSMSLSLSQN 60

Query: 61  SNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQSQELSDGELRRCNWITH 120
           SNDSSLTDSSIQLDQKISYAIRLITPPP ERREVPLPKSIQQQSQELSDGELRRCNWITH
Sbjct: 61  SNDSSLTDSSIQLDQKISYAIRLITPPP-ERREVPLPKSIQQQSQELSDGELRRCNWITH 120

Query: 121 TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPST 180
           TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS 
Sbjct: 121 TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSV 180

Query: 181 VANMGEKEITDIASDKAIMLVESR-------------IARDFGSFSNYMWSYMNFKPTIN 240
           VANMGEKEITD+ASDKAIMLVESR             IARDFGSFSNYMWSY+NFKPTIN
Sbjct: 181 VANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTIN 240

Query: 241 RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN 297
           RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVN
Sbjct: 241 RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDCFRHGECVN 300

BLAST of Lsi04G003420 vs. NCBI nr
Match: gi|659091306|ref|XP_008446481.1| (PREDICTED: uncharacterized protein LOC103489204 [Cucumis melo])

HSP 1 Score: 563.1 bits (1450), Expect = 2.8e-157
Identity = 288/309 (93.20%), Postives = 291/309 (94.17%), Query Frame = 1

Query: 1   MSSKATVRRQILERQTCPKEKDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 60
           MSSKATVRR ILERQ CPKEKDRTSQ+ILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN
Sbjct: 63  MSSKATVRRHILERQACPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 122

Query: 61  SNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQSQELSDGELRRCNWITH 120
           SNDSSLTDSSIQLDQKISYAIRLITPPP ERREVPLPKSIQQQSQELSDGELRRCNWITH
Sbjct: 123 SNDSSLTDSSIQLDQKISYAIRLITPPP-ERREVPLPKSIQQQSQELSDGELRRCNWITH 182

Query: 121 TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPST 180
           TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS 
Sbjct: 183 TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSI 242

Query: 181 VANMGEKEITDIASDKAIMLVESR-------------IARDFGSFSNYMWSYMNFKPTIN 240
           VANMGEKEITDIASDKAIMLVESR             IARDFGSFSNYMWS +NFKPTIN
Sbjct: 243 VANMGEKEITDIASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSSVNFKPTIN 302

Query: 241 RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN 297
           RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVN
Sbjct: 303 RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDCFRHGECVN 362

BLAST of Lsi04G003420 vs. NCBI nr
Match: gi|703081259|ref|XP_010091642.1| (Putative Glutamine amidotransferase [Morus notabilis])

HSP 1 Score: 441.0 bits (1133), Expect = 1.6e-120
Identity = 220/298 (73.83%), Postives = 256/298 (85.91%), Query Frame = 1

Query: 4   KATVRRQILERQTCPKE---KDRTSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 63
           KA VRR +LER    KE   KD+TS  +LSKHLK+IYPIGLQ++ SS SLSSLSLSLS+N
Sbjct: 3   KANVRRPVLERNGSLKENEKKDKTSPGLLSKHLKRIYPIGLQKSNSSPSLSSLSLSLSEN 62

Query: 64  SNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQSQELSDG--ELRRCNWI 123
           SNDSSL D    LD KIS A+RL+   PP R+E P PK++QQQ  + ++   ELRRCNWI
Sbjct: 63  SNDSSLADFGSPLDHKISLALRLVA--PPRRKESPAPKNVQQQQSQDANNPEELRRCNWI 122

Query: 124 THTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEP 183
           T  SDK YV+FHDECWGVPVYDDN+LFELLA+SGMLMDYNWTEI+KRRELFREAF+GF+P
Sbjct: 123 TKNSDKVYVAFHDECWGVPVYDDNQLFELLAMSGMLMDYNWTEILKRRELFREAFSGFDP 182

Query: 184 STVANMGEKEITDIASDKAIMLVESRIARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLR 243
           S VA MGEKEIT+I+S+KAIML ESR+ R+FGSFSNYMWSY++ KP INR+R+PRNVPLR
Sbjct: 183 SKVAKMGEKEITEISSNKAIMLAESRVVREFGSFSNYMWSYVDHKPVINRYRYPRNVPLR 242

Query: 244 SPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAERPWRHI 297
           SPKAEAISKD++KRGFRFVGPVIV+SFMQAAGLTIDHLV+C+RH ECV+LAERPWRHI
Sbjct: 243 SPKAEAISKDLLKRGFRFVGPVIVHSFMQAAGLTIDHLVNCYRHYECVSLAERPWRHI 298

BLAST of Lsi04G003420 vs. NCBI nr
Match: gi|255547045|ref|XP_002514580.1| (PREDICTED: DNA-3-methyladenine glycosylase [Ricinus communis])

HSP 1 Score: 436.4 bits (1121), Expect = 3.9e-119
Identity = 228/320 (71.25%), Postives = 270/320 (84.38%), Query Frame = 1

Query: 3   SKATVRRQILERQTC-PKEKDRTSQHIL---SKHLKKIYPIGLQRTTSSLSLSSLSLSLS 62
           SKATVR+Q+LE+++    EK+RT+ + L   SK+LKK+YPIGL R+ SSLSLSS+SLSLS
Sbjct: 2   SKATVRKQVLEKKSIFTNEKERTTSNQLGFFSKNLKKVYPIGLHRSNSSLSLSSVSLSLS 61

Query: 63  QNSNDSSLTD-SSIQLDQKISYAIRLITPPPPERREVP-LPKSIQQQ-------SQELSD 122
           +NSNDSSLTD S+  LDQKIS A+RLITP   ERREVP L +++QQQ       SQE + 
Sbjct: 62  ENSNDSSLTDYSNTPLDQKISLALRLITPL--ERREVPALSRNVQQQQQQQQQQSQESNG 121

Query: 123 GELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELF 182
           GE+RRCNWIT  SDK YV+FHDECWGVPVYDDN+LFELLALSGMLMDYNWTEI+KR++LF
Sbjct: 122 GEIRRCNWITKNSDKVYVAFHDECWGVPVYDDNQLFELLALSGMLMDYNWTEILKRKQLF 181

Query: 183 REAFAGFEPSTVANMGEKEITDIASDKAIMLVESR-------------IARDFGSFSNYM 242
           REAFAGF+P+ VANMGEKEI DIAS+KAIML +SR             IAR+FGSFS++M
Sbjct: 182 REAFAGFDPNIVANMGEKEILDIASNKAIMLADSRVRCIVDNAKCIAKIAREFGSFSSFM 241

Query: 243 WSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL 297
           W ++N+KPTIN++++PRNVPLR+PKAEAISKD++KRGFRFVGPVIVYSFMQAAGLTIDHL
Sbjct: 242 WGHVNYKPTINKYKYPRNVPLRTPKAEAISKDLLKRGFRFVGPVIVYSFMQAAGLTIDHL 301

BLAST of Lsi04G003420 vs. NCBI nr
Match: gi|743797932|ref|XP_011009373.1| (PREDICTED: uncharacterized protein LOC105114510 [Populus euphratica])

HSP 1 Score: 434.9 bits (1117), Expect = 1.1e-118
Identity = 225/317 (70.98%), Postives = 266/317 (83.91%), Query Frame = 1

Query: 4   KATVRRQILERQTCP-KEKDR---TSQHILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQ 63
           KA VR+QILE+     KEK++   ++Q + SKHLK++YPIGL R+TSSLSLSS+SLSLSQ
Sbjct: 3   KANVRKQILEKNNISIKEKEKPISSTQGLFSKHLKRVYPIGLHRSTSSLSLSSVSLSLSQ 62

Query: 64  NSNDSSLTDSS-IQLDQKISYAIRLITPPPPERREVPLPKSIQ------QQSQELSDGEL 123
           NSNDSSLTDSS + L+QKIS A+RLI+P   ERREVP+ ++ Q      QQ+Q+ +DGE+
Sbjct: 63  NSNDSSLTDSSAVPLEQKISLALRLISPL--ERREVPVARNFQPQQQQQQQNQDSNDGEV 122

Query: 124 RRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREA 183
           +RCNWIT  SDK YV+FHDECWGVPVYDDN+LFELLALSGMLMDYNWTEI+KR+ELFREA
Sbjct: 123 KRCNWITKNSDKVYVAFHDECWGVPVYDDNQLFELLALSGMLMDYNWTEILKRKELFREA 182

Query: 184 FAGFEPSTVANMGEKEITDIASDKAIMLVESR-------------IARDFGSFSNYMWSY 243
           F GF+P+ VA MGEKEI +IAS+KAI+L ESR             IAR+FGSFSNYMW  
Sbjct: 183 FEGFDPNIVAKMGEKEIMEIASNKAIILAESRVRCIVDNSKCILKIAREFGSFSNYMWGN 242

Query: 244 MNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC 297
           +NFKPTINR+++PRNVPLRSPKAEAISKD++KRGFRFVGPVIVYSFMQAAGLTIDHLVDC
Sbjct: 243 VNFKPTINRYKYPRNVPLRSPKAEAISKDLLKRGFRFVGPVIVYSFMQAAGLTIDHLVDC 302

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GUAA_HELHP6.0e-3536.76Probable GMP synthase [glutamine-hydrolyzing] OS=Helicobacter hepaticus (strain ... [more]
3MG1_ECOLI5.8e-3034.25DNA-3-methyladenine glycosylase 1 OS=Escherichia coli (strain K12) GN=tag PE=1 S... [more]
3MGA_HAEIN7.1e-2834.08DNA-3-methyladenine glycosylase OS=Haemophilus influenzae (strain ATCC 51907 / D... [more]
Match NameE-valueIdentityDescription
A0A0A0KUC5_CUCSA3.0e-15892.88Uncharacterized protein OS=Cucumis sativus GN=Csa_5G606920 PE=4 SV=1[more]
W9QWE1_9ROSA1.1e-12073.83Putative Glutamine amidotransferase OS=Morus notabilis GN=L484_026493 PE=4 SV=1[more]
B9RLG1_RICCO2.7e-11971.25DNA-3-methyladenine glycosylase, putative OS=Ricinus communis GN=RCOM_1466540 PE... [more]
B9HWT9_POPTR1.8e-11870.98Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0010s14710g PE=4 SV=2[more]
A0A061E9D7_THECC5.2e-11870.75DNA glycosylase superfamily protein isoform 1 OS=Theobroma cacao GN=TCM_011427 P... [more]
Match NameE-valueIdentityDescription
AT1G13635.13.8e-10464.05 DNA glycosylase superfamily protein[more]
AT5G57970.11.3e-5146.88 DNA glycosylase superfamily protein[more]
AT1G80850.12.2e-5141.00 DNA glycosylase superfamily protein[more]
AT5G44680.13.7e-5145.85 DNA glycosylase superfamily protein[more]
AT1G75090.12.0e-4938.95 DNA glycosylase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449435284|ref|XP_004135425.1|4.3e-15892.88PREDICTED: uncharacterized protein LOC101218195 [Cucumis sativus][more]
gi|659091306|ref|XP_008446481.1|2.8e-15793.20PREDICTED: uncharacterized protein LOC103489204 [Cucumis melo][more]
gi|703081259|ref|XP_010091642.1|1.6e-12073.83Putative Glutamine amidotransferase [Morus notabilis][more]
gi|255547045|ref|XP_002514580.1|3.9e-11971.25PREDICTED: DNA-3-methyladenine glycosylase [Ricinus communis][more]
gi|743797932|ref|XP_011009373.1|1.1e-11870.98PREDICTED: uncharacterized protein LOC105114510 [Populus euphratica][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006281DNA repair
GO:0006284base-excision repair
Vocabulary: Molecular Function
TermDefinition
GO:0003824catalytic activity
GO:0008725DNA-3-methyladenine glycosylase activity
Vocabulary: INTERPRO
TermDefinition
IPR011257DNA_glycosylase
IPR005019Adenine_glyco
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006284 base-excision repair
biological_process GO:0006541 glutamine metabolic process
biological_process GO:0006281 DNA repair
cellular_component GO:0005575 cellular_component
molecular_function GO:0008725 DNA-3-methyladenine glycosylase activity
molecular_function GO:0016740 transferase activity
molecular_function GO:0003824 catalytic activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi04G003420.1Lsi04G003420.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005019Methyladenine glycosylasePFAMPF03352Adenine_glycocoord: 121..282
score: 7.6
IPR011257DNA glycosylaseGENE3DG3DSA:1.10.340.30coord: 113..282
score: 1.6
IPR011257DNA glycosylaseunknownSSF48150DNA-glycosylasecoord: 113..286
score: 2.59
NoneNo IPR availablePANTHERPTHR31116FAMILY NOT NAMEDcoord: 1..296
score: 1.4E
NoneNo IPR availablePANTHERPTHR31116:SF23-METHYLADENINE GLYCOSYLASE I-RELATEDcoord: 1..296
score: 1.4E

The following gene(s) are paralogous to this gene:

None