HG10015431 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10015431
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDNA glycosylase superfamily protein
LocationChr02: 26557782 .. 26559340 (-)
RNA-Seq ExpressionHG10015431
SyntenyHG10015431
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCATCCAAAGCCACTGTTAGAAGACAGATTCTGGAGAGGCAAACATGTCCTAAAGAGAAAGATAGGACAAGCCAAAACATTTTGTCCAAACACCTTAAGAAGATTTACCCAATTGGGTTACAAAGAACCACTTCATCACTATCTTTATCTTCACTATCATTGTCTTTGTCTCAAAATTCAAATGACTCTTCTCTTACAGACTCCTCAATCCAATTGGATCAGAAAATTTCGTACGCAATTCGCCTTATTACGCCGCCGCCTCCTGAAAGAAGAGAAGTCCCATTGCCTAAAAGTATCCAACAACAAAGTCAAGAACTTAGTGATGGGGAATTGAGGAGGTGCAACTGGATCACCCATACCAGTGGTAAGATTATACTTACACAGTTATAAAATTTTTCGCTCTCGTAACCAGATAAATGCTTCCTAGCAAGTGATAATATAGCATTGTCCAAGAAAAATAAGTCAACAAAGTTTTTTAGGAAGTTCAAACTATTGATATGATTAATAATGTCATTAACTCAGTATAGCAGTTGCTACCACTGTCATATTTCTTAAGAAAATGAGATTGTGGCTTCTAACAAAAACTACAGAATAGTATAGTTCTGATATGGACTTAAAACACTATATCTTACTGAACAAAGAAAAAAGGGAAAAAGTAGTTTCCCTTTATAGGATACTTTCCCCACATGCTGGATTACTTCGTGATTGATGTTTTTCGTAATTGATACAGATAAAGCCTATGTATCTTTTCATGACGAGTGTTGGGGTGTCCCAGTATACGATGACAAGTAGGTTATCATGTAACTTTTCATTTATTTAACCTATCAATTAATAGATTTGTATGTTAACTTCGAAGAAAAATTACCATACAGCCGACTTTTCGAGCTACTCGCACTATCTGGGATGTTGATGGACTACAATTGGACTGAAATTGTGAAAAGAAGGGAACTATTCAGGTAATATTTATTTTTTTTCATAACTCACACTTTATGTACTTAAGATAAATGTCAATCACATATAAATATATATACACATATGGTTGGTCAATAGGGAAGCTTTTGCTGGATTTGAGCCAAGTACAGTTGCCAATATGGGGGAGAAAGAGATAACAGATATAGCTTCTGACAAGGCCATTATGCTGGTGGAGAGCAGAGTGAGGTGCATAGTAGATAATGCCAAATGCATATTAAAGGCAAGCTAGTTCAACTCATCTTTTAACTAGTTATTTCACAGATTGTATTATTAATTAGAATTATTGGTTTTTAATTCAACTGCAGATAGCTAGAGATTTTGGATCGTTTAGTAACTATATGTGGAGCTATATGAACTTTAAACCTACAATAAACAGATTTAGACATCCAAGAAATGTTCCCTTGAGAAGTCCCAAAGCAGAAGCCATTAGCAAGGACATGGTGAAGCGCGGTTTTCGGTTTGTTGGGCCAGTGATTGTCTATTCATTCATGCAAGCGGCTGGGTTGACCATCGATCATCTTGTCGATTGTTTTCGACACGGTGAATGTGTAAATCTTGCAGAAAGGCCATGGAGACATATCTGA

mRNA sequence

ATGTCATCCAAAGCCACTGTTAGAAGACAGATTCTGGAGAGGCAAACATGTCCTAAAGAGAAAGATAGGACAAGCCAAAACATTTTGTCCAAACACCTTAAGAAGATTTACCCAATTGGGTTACAAAGAACCACTTCATCACTATCTTTATCTTCACTATCATTGTCTTTGTCTCAAAATTCAAATGACTCTTCTCTTACAGACTCCTCAATCCAATTGGATCAGAAAATTTCGTACGCAATTCGCCTTATTACGCCGCCGCCTCCTGAAAGAAGAGAAGTCCCATTGCCTAAAAGTATCCAACAACAAAGTCAAGAACTTAGTGATGGGGAATTGAGGAGGTGCAACTGGATCACCCATACCAGTGATAAAGCCTATGTATCTTTTCATGACGAGTGTTGGGGTGTCCCAGTATACGATGACAACCGACTTTTCGAGCTACTCGCACTATCTGGGATGTTGATGGACTACAATTGGACTGAAATTGTGAAAAGAAGGGAACTATTCAGGGAAGCTTTTGCTGGATTTGAGCCAAGTACAGTTGCCAATATGGGGGAGAAAGAGATAACAGATATAGCTTCTGACAAGGCCATTATGCTGGTGGAGAGCAGAATAGCTAGAGATTTTGGATCGTTTAGTAACTATATGTGGAGCTATATGAACTTTAAACCTACAATAAACAGATTTAGACATCCAAGAAATGTTCCCTTGAGAAGTCCCAAAGCAGAAGCCATTAGCAAGGACATGGTGAAGCGCGGTTTTCGGTTTGTTGGGCCAGTGATTGTCTATTCATTCATGCAAGCGGCTGGGTTGACCATCGATCATCTTGTCGATTGTTTTCGACACGGTGAATGTGTAAATCTTGCAGAAAGGCCATGGAGACATATCTGA

Coding sequence (CDS)

ATGTCATCCAAAGCCACTGTTAGAAGACAGATTCTGGAGAGGCAAACATGTCCTAAAGAGAAAGATAGGACAAGCCAAAACATTTTGTCCAAACACCTTAAGAAGATTTACCCAATTGGGTTACAAAGAACCACTTCATCACTATCTTTATCTTCACTATCATTGTCTTTGTCTCAAAATTCAAATGACTCTTCTCTTACAGACTCCTCAATCCAATTGGATCAGAAAATTTCGTACGCAATTCGCCTTATTACGCCGCCGCCTCCTGAAAGAAGAGAAGTCCCATTGCCTAAAAGTATCCAACAACAAAGTCAAGAACTTAGTGATGGGGAATTGAGGAGGTGCAACTGGATCACCCATACCAGTGATAAAGCCTATGTATCTTTTCATGACGAGTGTTGGGGTGTCCCAGTATACGATGACAACCGACTTTTCGAGCTACTCGCACTATCTGGGATGTTGATGGACTACAATTGGACTGAAATTGTGAAAAGAAGGGAACTATTCAGGGAAGCTTTTGCTGGATTTGAGCCAAGTACAGTTGCCAATATGGGGGAGAAAGAGATAACAGATATAGCTTCTGACAAGGCCATTATGCTGGTGGAGAGCAGAATAGCTAGAGATTTTGGATCGTTTAGTAACTATATGTGGAGCTATATGAACTTTAAACCTACAATAAACAGATTTAGACATCCAAGAAATGTTCCCTTGAGAAGTCCCAAAGCAGAAGCCATTAGCAAGGACATGGTGAAGCGCGGTTTTCGGTTTGTTGGGCCAGTGATTGTCTATTCATTCATGCAAGCGGCTGGGTTGACCATCGATCATCTTGTCGATTGTTTTCGACACGGTGAATGTGTAAATCTTGCAGAAAGGCCATGGAGACATATCTGA

Protein sequence

MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQSQELSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRIARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAERPWRHI
Homology
BLAST of HG10015431 vs. NCBI nr
Match: XP_004135425.1 (uncharacterized protein LOC101218195 [Cucumis sativus] >KGN52002.1 hypothetical protein Csa_009065 [Cucumis sativus])

HSP 1 Score: 567.8 bits (1462), Expect = 5.6e-158
Identity = 288/309 (93.20%), Postives = 292/309 (94.50%), Query Frame = 0

Query: 1   MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 60
           MSSKATVRR ILERQ CPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSS+SLSLSQN
Sbjct: 1   MSSKATVRRHILERQACPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSMSLSLSQN 60

Query: 61  SNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQSQELSDGELRRCNWITH 120
           SNDSSLTDSSIQLDQKISYAIRLIT PPPERREVPLPKSIQQQSQELSDGELRRCNWITH
Sbjct: 61  SNDSSLTDSSIQLDQKISYAIRLIT-PPPERREVPLPKSIQQQSQELSDGELRRCNWITH 120

Query: 121 TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPST 180
           TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS 
Sbjct: 121 TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSV 180

Query: 181 VANMGEKEITDIASDKAIMLVESR-------------IARDFGSFSNYMWSYMNFKPTIN 240
           VANMGEKEITD+ASDKAIMLVESR             IARDFGSFSNYMWSY+NFKPTIN
Sbjct: 181 VANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTIN 240

Query: 241 RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN 297
           RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVN
Sbjct: 241 RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDCFRHGECVN 300

BLAST of HG10015431 vs. NCBI nr
Match: XP_022956507.1 (uncharacterized protein LOC111458228 [Cucurbita moschata] >XP_022956508.1 uncharacterized protein LOC111458228 [Cucurbita moschata])

HSP 1 Score: 566.6 bits (1459), Expect = 1.2e-157
Identity = 287/309 (92.88%), Postives = 291/309 (94.17%), Query Frame = 0

Query: 1   MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 60
           MSSKATVRR+ILERQTC KEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN
Sbjct: 1   MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 60

Query: 61  SNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQSQELSDGELRRCNWITH 120
           SNDSSLTDSSIQLDQKISYAIRLITPPPPERRE PLPKS+QQQ QEL DGELRRCNWITH
Sbjct: 61  SNDSSLTDSSIQLDQKISYAIRLITPPPPERREAPLPKSVQQQCQELGDGELRRCNWITH 120

Query: 121 TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPST 180
           TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFE ST
Sbjct: 121 TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEAST 180

Query: 181 VANMGEKEITDIASDKAIMLVESR-------------IARDFGSFSNYMWSYMNFKPTIN 240
           VANMGEKEI+DIASDKAIMLVESR             IARDFGSFSNYMWSYMNFKPTIN
Sbjct: 181 VANMGEKEISDIASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTIN 240

Query: 241 RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN 297
           RFR+PRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
Sbjct: 241 RFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN 300

BLAST of HG10015431 vs. NCBI nr
Match: XP_008446481.2 (PREDICTED: DNA-3-methyladenine glycosylase 1 [Cucumis melo] >KAA0034490.1 DNA-3-methyladenine glycosylase 1 [Cucumis melo var. makuwa] >TYK09044.1 DNA-3-methyladenine glycosylase 1 [Cucumis melo var. makuwa])

HSP 1 Score: 565.1 bits (1455), Expect = 3.6e-157
Identity = 289/309 (93.53%), Postives = 291/309 (94.17%), Query Frame = 0

Query: 1   MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 60
           MSSKATVRR ILERQ CPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN
Sbjct: 1   MSSKATVRRHILERQACPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 60

Query: 61  SNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQSQELSDGELRRCNWITH 120
           SNDSSLTDSSIQLDQKISYAIRLIT PPPERREVPLPKSIQQQSQELSDGELRRCNWITH
Sbjct: 61  SNDSSLTDSSIQLDQKISYAIRLIT-PPPERREVPLPKSIQQQSQELSDGELRRCNWITH 120

Query: 121 TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPST 180
           TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS 
Sbjct: 121 TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSI 180

Query: 181 VANMGEKEITDIASDKAIMLVESR-------------IARDFGSFSNYMWSYMNFKPTIN 240
           VANMGEKEITDIASDKAIMLVESR             IARDFGSFSNYMWS +NFKPTIN
Sbjct: 181 VANMGEKEITDIASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSSVNFKPTIN 240

Query: 241 RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN 297
           RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVN
Sbjct: 241 RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDCFRHGECVN 300

BLAST of HG10015431 vs. NCBI nr
Match: KAG7032142.1 (guaA [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 564.3 bits (1453), Expect = 6.2e-157
Identity = 285/300 (95.00%), Postives = 289/300 (96.33%), Query Frame = 0

Query: 1   MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 60
           MSSKATVRR+ILERQTC KEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN
Sbjct: 1   MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 60

Query: 61  SNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQSQELSDGELRRCNWITH 120
           SNDSSLTDSSI LDQKISYAIRLITP PPERRE PLPKS+QQQ QEL DGELRRCNWITH
Sbjct: 61  SNDSSLTDSSIHLDQKISYAIRLITPLPPERREAPLPKSVQQQCQELGDGELRRCNWITH 120

Query: 121 TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPST 180
           TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFE ST
Sbjct: 121 TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEAST 180

Query: 181 VANMGEKEITDIASDKAIMLVESR----IARDFGSFSNYMWSYMNFKPTINRFRHPRNVP 240
           VANMGEKEI+DIASDKAIMLVESR    IARDFGSFSNYMWSYMNFKPTINRFR+PRNVP
Sbjct: 181 VANMGEKEISDIASDKAIMLVESRVRCIIARDFGSFSNYMWSYMNFKPTINRFRYPRNVP 240

Query: 241 LRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAERPWRHI 297
           LRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAERPWRHI
Sbjct: 241 LRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAERPWRHI 300

BLAST of HG10015431 vs. NCBI nr
Match: XP_022993235.1 (uncharacterized protein LOC111489316 [Cucurbita maxima] >XP_022993244.1 uncharacterized protein LOC111489316 [Cucurbita maxima])

HSP 1 Score: 563.5 bits (1451), Expect = 1.1e-156
Identity = 287/310 (92.58%), Postives = 292/310 (94.19%), Query Frame = 0

Query: 1   MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 60
           MSSKATVRR+ILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN
Sbjct: 1   MSSKATVRRRILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 60

Query: 61  SNDSSLTDSSIQLDQKISYAIRLIT-PPPPERREVPLPKSIQQQSQELSDGELRRCNWIT 120
           SNDSSLTDSSIQLD+KISYAIRLIT PPPPERRE PLPKS+QQQ QEL DGELRRCNWIT
Sbjct: 61  SNDSSLTDSSIQLDRKISYAIRLITPPPPPERREAPLPKSVQQQCQELGDGELRRCNWIT 120

Query: 121 HTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS 180
           HTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFE S
Sbjct: 121 HTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEAS 180

Query: 181 TVANMGEKEITDIASDKAIMLVESR-------------IARDFGSFSNYMWSYMNFKPTI 240
           TVANMGEKEI+DIASDKAIMLVESR             IARDFGSFSNYMWSYMNFKPTI
Sbjct: 181 TVANMGEKEISDIASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTI 240

Query: 241 NRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV 297
           NRFR+PRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV
Sbjct: 241 NRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV 300

BLAST of HG10015431 vs. ExPASy Swiss-Prot
Match: Q7VG78 (Probable GMP synthase [glutamine-hydrolyzing] OS=Helicobacter hepaticus (strain ATCC 51449 / 3B1) OX=235279 GN=guaA PE=3 SV=1)

HSP 1 Score: 149.4 bits (376), Expect = 6.2e-35
Identity = 75/204 (36.76%), Postives = 116/204 (56.86%), Query Frame = 0

Query: 96  LPKSIQQQSQELSDG--ELRRCNWITHTSD---KAYVSFHDECWGVPVYDDNRLFELLAL 155
           L KS+  ++Q+ ++G  E  RC W T   +   K Y  +HD  WG P+++D +LFE L L
Sbjct: 767 LQKSLGLEAQDSNEGVREKVRCAWATDKDEAARKLYEDYHDTEWGEPLHEDKKLFEHLVL 826

Query: 156 SGMLMDYNWTEIVKRRELFREAFAGFEPSTVANMGEKEITDIASDKAIMLVESRI----- 215
            G     +W  I+K+RE FR AF  F+P  VAN  E +I ++  ++ I+   ++I     
Sbjct: 827 EGFQAGLSWITILKKREAFRVAFDDFDPHIVANYDEDKIKELMRNEGIIRNRAKIEAAII 886

Query: 216 --------ARDFGSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFV 275
                    R+FGSF  Y+W ++  KP IN F    ++P  +P ++ I+KD+ KRGF+FV
Sbjct: 887 NAKAFMAVQREFGSFDKYIWGFVGGKPIINAFESIADLPASTPLSDKIAKDLKKRGFKFV 946

Query: 276 GPVIVYSFMQAAGLTIDHLVDCFR 282
           G   +Y+ MQ+ G+  DHL  CF+
Sbjct: 947 GTTTMYAMMQSIGMVNDHLTSCFK 970

BLAST of HG10015431 vs. ExPASy Swiss-Prot
Match: P05100 (DNA-3-methyladenine glycosylase 1 OS=Escherichia coli (strain K12) OX=83333 GN=tag PE=1 SV=1)

HSP 1 Score: 132.9 bits (333), Expect = 6.0e-30
Identity = 62/181 (34.25%), Postives = 100/181 (55.25%), Query Frame = 0

Query: 112 LRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFRE 171
           + RC W+  + D  Y+++HD  WGVP  D  +LFE++ L G     +W  ++K+RE +R 
Sbjct: 1   MERCGWV--SQDPLYIAYHDNEWGVPETDSKKLFEMICLEGQQAGLSWITVLKKRENYRA 60

Query: 172 AFAGFEPSTVANMGEKEITDIASDKAIMLVESRIARDFGS-------------FSNYMWS 231
            F  F+P  VA M E+++  +  D  I+    +I    G+             F +++WS
Sbjct: 61  CFHQFDPVKVAAMQEEDVERLVQDAGIIRHRGKIQAIIGNARAYLQMEQNGEPFVDFVWS 120

Query: 232 YMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVD 280
           ++N +P + +      +P  +  ++A+SK + KRGF+FVG  I YSFMQA GL  DH+V 
Sbjct: 121 FVNHQPQVTQATTLSEIPTSTSASDALSKALKKRGFKFVGTTICYSFMQACGLVNDHVVG 179

BLAST of HG10015431 vs. ExPASy Swiss-Prot
Match: P44321 (DNA-3-methyladenine glycosylase OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) OX=71421 GN=tag PE=3 SV=1)

HSP 1 Score: 125.9 bits (315), Expect = 7.4e-28
Identity = 61/179 (34.08%), Postives = 101/179 (56.42%), Query Frame = 0

Query: 114 RCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAF 173
           RC W+   S   Y+ +HD+ WG P +D  +LFE + L G     +W  ++K+RE +REAF
Sbjct: 4   RCPWVGEQS--IYIDYHDKEWGKPEFDSQKLFEKICLEGQQAGLSWITVLKKRESYREAF 63

Query: 174 AGFEPSTVANMGEKEITDIASDKAIMLVESRI------ARDF-------GSFSNYMWSYM 233
             F+P  +A M   +I     +  ++   +++      A+ +        +FS+++WS++
Sbjct: 64  HQFDPKKIAKMTALDIDACMQNSGLIRHRAKLEAIVKNAKAYLAMEKCGENFSDFIWSFV 123

Query: 234 NFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC 280
           N KP +N     R+VP ++  ++A+SK + KRGF F+G    Y+FMQ+ GL  DHL DC
Sbjct: 124 NHKPIVNDVPDLRSVPTKTEVSKALSKALKKRGFVFIGETTCYAFMQSMGLVDDHLNDC 180

BLAST of HG10015431 vs. ExPASy TrEMBL
Match: A0A0A0KUC5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G606920 PE=4 SV=1)

HSP 1 Score: 567.8 bits (1462), Expect = 2.7e-158
Identity = 288/309 (93.20%), Postives = 292/309 (94.50%), Query Frame = 0

Query: 1   MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 60
           MSSKATVRR ILERQ CPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSS+SLSLSQN
Sbjct: 1   MSSKATVRRHILERQACPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSMSLSLSQN 60

Query: 61  SNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQSQELSDGELRRCNWITH 120
           SNDSSLTDSSIQLDQKISYAIRLIT PPPERREVPLPKSIQQQSQELSDGELRRCNWITH
Sbjct: 61  SNDSSLTDSSIQLDQKISYAIRLIT-PPPERREVPLPKSIQQQSQELSDGELRRCNWITH 120

Query: 121 TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPST 180
           TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS 
Sbjct: 121 TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSV 180

Query: 181 VANMGEKEITDIASDKAIMLVESR-------------IARDFGSFSNYMWSYMNFKPTIN 240
           VANMGEKEITD+ASDKAIMLVESR             IARDFGSFSNYMWSY+NFKPTIN
Sbjct: 181 VANMGEKEITDVASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYVNFKPTIN 240

Query: 241 RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN 297
           RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVN
Sbjct: 241 RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDCFRHGECVN 300

BLAST of HG10015431 vs. ExPASy TrEMBL
Match: A0A6J1GX19 (uncharacterized protein LOC111458228 OS=Cucurbita moschata OX=3662 GN=LOC111458228 PE=4 SV=1)

HSP 1 Score: 566.6 bits (1459), Expect = 6.0e-158
Identity = 287/309 (92.88%), Postives = 291/309 (94.17%), Query Frame = 0

Query: 1   MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 60
           MSSKATVRR+ILERQTC KEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN
Sbjct: 1   MSSKATVRRRILERQTCSKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 60

Query: 61  SNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQSQELSDGELRRCNWITH 120
           SNDSSLTDSSIQLDQKISYAIRLITPPPPERRE PLPKS+QQQ QEL DGELRRCNWITH
Sbjct: 61  SNDSSLTDSSIQLDQKISYAIRLITPPPPERREAPLPKSVQQQCQELGDGELRRCNWITH 120

Query: 121 TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPST 180
           TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFE ST
Sbjct: 121 TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEAST 180

Query: 181 VANMGEKEITDIASDKAIMLVESR-------------IARDFGSFSNYMWSYMNFKPTIN 240
           VANMGEKEI+DIASDKAIMLVESR             IARDFGSFSNYMWSYMNFKPTIN
Sbjct: 181 VANMGEKEISDIASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTIN 240

Query: 241 RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN 297
           RFR+PRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN
Sbjct: 241 RFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN 300

BLAST of HG10015431 vs. ExPASy TrEMBL
Match: A0A5D3CCU6 (DNA-3-methyladenine glycosylase 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold475G00310 PE=4 SV=1)

HSP 1 Score: 565.1 bits (1455), Expect = 1.8e-157
Identity = 289/309 (93.53%), Postives = 291/309 (94.17%), Query Frame = 0

Query: 1   MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 60
           MSSKATVRR ILERQ CPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN
Sbjct: 1   MSSKATVRRHILERQACPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 60

Query: 61  SNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQSQELSDGELRRCNWITH 120
           SNDSSLTDSSIQLDQKISYAIRLIT PPPERREVPLPKSIQQQSQELSDGELRRCNWITH
Sbjct: 61  SNDSSLTDSSIQLDQKISYAIRLIT-PPPERREVPLPKSIQQQSQELSDGELRRCNWITH 120

Query: 121 TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPST 180
           TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS 
Sbjct: 121 TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSI 180

Query: 181 VANMGEKEITDIASDKAIMLVESR-------------IARDFGSFSNYMWSYMNFKPTIN 240
           VANMGEKEITDIASDKAIMLVESR             IARDFGSFSNYMWS +NFKPTIN
Sbjct: 181 VANMGEKEITDIASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSSVNFKPTIN 240

Query: 241 RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN 297
           RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVN
Sbjct: 241 RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDCFRHGECVN 300

BLAST of HG10015431 vs. ExPASy TrEMBL
Match: A0A1S3BEN5 (DNA-3-methyladenine glycosylase 1 OS=Cucumis melo OX=3656 GN=LOC103489204 PE=4 SV=1)

HSP 1 Score: 565.1 bits (1455), Expect = 1.8e-157
Identity = 289/309 (93.53%), Postives = 291/309 (94.17%), Query Frame = 0

Query: 1   MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 60
           MSSKATVRR ILERQ CPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN
Sbjct: 1   MSSKATVRRHILERQACPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 60

Query: 61  SNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQSQELSDGELRRCNWITH 120
           SNDSSLTDSSIQLDQKISYAIRLIT PPPERREVPLPKSIQQQSQELSDGELRRCNWITH
Sbjct: 61  SNDSSLTDSSIQLDQKISYAIRLIT-PPPERREVPLPKSIQQQSQELSDGELRRCNWITH 120

Query: 121 TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPST 180
           TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS 
Sbjct: 121 TSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSI 180

Query: 181 VANMGEKEITDIASDKAIMLVESR-------------IARDFGSFSNYMWSYMNFKPTIN 240
           VANMGEKEITDIASDKAIMLVESR             IARDFGSFSNYMWS +NFKPTIN
Sbjct: 181 VANMGEKEITDIASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSSVNFKPTIN 240

Query: 241 RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVN 297
           RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHL+DCFRHGECVN
Sbjct: 241 RFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLIDCFRHGECVN 300

BLAST of HG10015431 vs. ExPASy TrEMBL
Match: A0A6J1JY14 (uncharacterized protein LOC111489316 OS=Cucurbita maxima OX=3661 GN=LOC111489316 PE=4 SV=1)

HSP 1 Score: 563.5 bits (1451), Expect = 5.1e-157
Identity = 287/310 (92.58%), Postives = 292/310 (94.19%), Query Frame = 0

Query: 1   MSSKATVRRQILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 60
           MSSKATVRR+ILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN
Sbjct: 1   MSSKATVRRRILERQTCPKEKDRTSQNILSKHLKKIYPIGLQRTTSSLSLSSLSLSLSQN 60

Query: 61  SNDSSLTDSSIQLDQKISYAIRLIT-PPPPERREVPLPKSIQQQSQELSDGELRRCNWIT 120
           SNDSSLTDSSIQLD+KISYAIRLIT PPPPERRE PLPKS+QQQ QEL DGELRRCNWIT
Sbjct: 61  SNDSSLTDSSIQLDRKISYAIRLITPPPPPERREAPLPKSVQQQCQELGDGELRRCNWIT 120

Query: 121 HTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPS 180
           HTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFE S
Sbjct: 121 HTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEAS 180

Query: 181 TVANMGEKEITDIASDKAIMLVESR-------------IARDFGSFSNYMWSYMNFKPTI 240
           TVANMGEKEI+DIASDKAIMLVESR             IARDFGSFSNYMWSYMNFKPTI
Sbjct: 181 TVANMGEKEISDIASDKAIMLVESRVRCIVDNAKCILKIARDFGSFSNYMWSYMNFKPTI 240

Query: 241 NRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV 297
           NRFR+PRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV
Sbjct: 241 NRFRYPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECV 300

BLAST of HG10015431 vs. TAIR 10
Match: AT1G13635.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 377.1 bits (967), Expect = 1.3e-104
Identity = 197/306 (64.38%), Postives = 241/306 (78.76%), Query Frame = 0

Query: 8   RRQILERQTCPKEKD-RTSQNILSKHLKKIYPIGLQR-TTSSLSLSSLSLSLSQNSNDSS 67
           R++I+E+    +EK+ + + N  +KHLK+IYPI LQR T+SS SLSS+SLSLSQNS DS 
Sbjct: 8   RKEIVEKSKSVREKEIKQNSNFFAKHLKRIYPITLQRSTSSSFSLSSISLSLSQNSTDSV 67

Query: 68  LTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQ-SQEL-SDGELRRCNWITHTSD 127
            TDS+  L+QKIS A+ LI+   P RRE+ +PKSI QQ  Q+  S  E +RCNWIT  SD
Sbjct: 68  STDSNSTLEQKISLALGLIS--SPHRREIFVPKSIPQQLCQDFNSSDEPKRCNWITKKSD 127

Query: 128 KAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVAN 187
           + YV FHD+ WGVPVYDDN LFE LA+SGMLMDYNWTEI+KR+E FREAF  F+P+ VA 
Sbjct: 128 EVYVMFHDQQWGVPVYDDNLLFEFLAMSGMLMDYNWTEILKRKEHFREAFCEFDPNRVAK 187

Query: 188 MGEKEITDIASDKAIMLVESR-------------IARDFGSFSNYMWSYMNFKPTINRFR 247
           MGEKEI +IAS+KAIML ESR             +  +FGSFS+++W +M++KP IN+F+
Sbjct: 188 MGEKEIAEIASNKAIMLQESRVRCIVDNAKCITKVVNEFGSFSSFVWGFMDYKPIINKFK 247

Query: 248 HPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAE 297
           + RNVPLRSPKAE ISKDM+KRGFRFVGPVIV+SFMQAAGLTIDHLVDCFRHG+CV+LAE
Sbjct: 248 YSRNVPLRSPKAEIISKDMIKRGFRFVGPVIVHSFMQAAGLTIDHLVDCFRHGDCVSLAE 307

BLAST of HG10015431 vs. TAIR 10
Match: AT1G13635.2 (DNA glycosylase superfamily protein )

HSP 1 Score: 377.1 bits (967), Expect = 1.3e-104
Identity = 197/306 (64.38%), Postives = 241/306 (78.76%), Query Frame = 0

Query: 8   RRQILERQTCPKEKD-RTSQNILSKHLKKIYPIGLQR-TTSSLSLSSLSLSLSQNSNDSS 67
           R++I+E+    +EK+ + + N  +KHLK+IYPI LQR T+SS SLSS+SLSLSQNS DS 
Sbjct: 8   RKEIVEKSKSVREKEIKQNSNFFAKHLKRIYPITLQRSTSSSFSLSSISLSLSQNSTDSV 67

Query: 68  LTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQ-SQEL-SDGELRRCNWITHTSD 127
            TDS+  L+QKIS A+ LI+   P RRE+ +PKSI QQ  Q+  S  E +RCNWIT  SD
Sbjct: 68  STDSNSTLEQKISLALGLIS--SPHRREIFVPKSIPQQLCQDFNSSDEPKRCNWITKKSD 127

Query: 128 KAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREAFAGFEPSTVAN 187
           + YV FHD+ WGVPVYDDN LFE LA+SGMLMDYNWTEI+KR+E FREAF  F+P+ VA 
Sbjct: 128 EVYVMFHDQQWGVPVYDDNLLFEFLAMSGMLMDYNWTEILKRKEHFREAFCEFDPNRVAK 187

Query: 188 MGEKEITDIASDKAIMLVESR-------------IARDFGSFSNYMWSYMNFKPTINRFR 247
           MGEKEI +IAS+KAIML ESR             +  +FGSFS+++W +M++KP IN+F+
Sbjct: 188 MGEKEIAEIASNKAIMLQESRVRCIVDNAKCITKVVNEFGSFSSFVWGFMDYKPIINKFK 247

Query: 248 HPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDCFRHGECVNLAE 297
           + RNVPLRSPKAE ISKDM+KRGFRFVGPVIV+SFMQAAGLTIDHLVDCFRHG+CV+LAE
Sbjct: 248 YSRNVPLRSPKAEIISKDMIKRGFRFVGPVIVHSFMQAAGLTIDHLVDCFRHGDCVSLAE 307

BLAST of HG10015431 vs. TAIR 10
Match: AT5G57970.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 200.7 bits (509), Expect = 1.7e-51
Identity = 90/192 (46.88%), Postives = 125/192 (65.10%), Query Frame = 0

Query: 113 RRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREA 172
           +RC W+T  SD  Y+ FHDE WGVPV+DD RLFELL LSG L ++ W  I+ +R+ FRE 
Sbjct: 154 KRCTWVTPNSDPCYIVFHDEEWGVPVHDDKRLFELLVLSGALAEHTWPTILSKRQAFREV 213

Query: 173 FAGFEPSTVANMGEKEITDIASDKAIMLVE-------------SRIARDFGSFSNYMWSY 232
           FA F+P+ +  + EK+I    S  + +L +              ++  ++GSF  Y+WS+
Sbjct: 214 FADFDPNAIVKINEKKIIGPGSPASTLLSDLKLRAVIENARQILKVIEEYGSFDKYIWSF 273

Query: 233 MNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC 292
           +  K  +++FR+ R VP ++PKAE ISKD+V+RGFR VGP +VYSFMQAAG+T DHL  C
Sbjct: 274 VKNKAIVSKFRYQRQVPAKTPKAEVISKDLVRRGFRSVGPTVVYSFMQAAGITNDHLTSC 333

BLAST of HG10015431 vs. TAIR 10
Match: AT5G57970.2 (DNA glycosylase superfamily protein )

HSP 1 Score: 200.7 bits (509), Expect = 1.7e-51
Identity = 90/192 (46.88%), Postives = 125/192 (65.10%), Query Frame = 0

Query: 113 RRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEIVKRRELFREA 172
           +RC W+T  SD  Y+ FHDE WGVPV+DD RLFELL LSG L ++ W  I+ +R+ FRE 
Sbjct: 154 KRCTWVTPNSDPCYIVFHDEEWGVPVHDDKRLFELLVLSGALAEHTWPTILSKRQAFREV 213

Query: 173 FAGFEPSTVANMGEKEITDIASDKAIMLVE-------------SRIARDFGSFSNYMWSY 232
           FA F+P+ +  + EK+I    S  + +L +              ++  ++GSF  Y+WS+
Sbjct: 214 FADFDPNAIVKINEKKIIGPGSPASTLLSDLKLRAVIENARQILKVIEEYGSFDKYIWSF 273

Query: 233 MNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAAGLTIDHLVDC 292
           +  K  +++FR+ R VP ++PKAE ISKD+V+RGFR VGP +VYSFMQAAG+T DHL  C
Sbjct: 274 VKNKAIVSKFRYQRQVPAKTPKAEVISKDLVRRGFRSVGPTVVYSFMQAAGITNDHLTSC 333

BLAST of HG10015431 vs. TAIR 10
Match: AT1G80850.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 199.9 bits (507), Expect = 2.9e-51
Identity = 107/261 (41.00%), Postives = 160/261 (61.30%), Query Frame = 0

Query: 48  LSLSSLSLSLSQNSNDSSLTDSSIQLDQKISYAIRLITPPPPERREVPLPKSIQQQSQE- 107
           L  + +S++ S +S+ SS  +SS       S   R++           L +++ ++  E 
Sbjct: 65  LRRNGISMTASYSSDASSSCESSPLSMTSTSSGKRVLRRSGSVSSSSSLRRNLTEERDEK 124

Query: 108 ----LSDGELRRCNWITHTSDKAYVSFHDECWGVPVYDDNRLFELLALSGMLMDYNWTEI 167
                 DG  +RC WIT  SD+ Y++FHDE WGVPV+DD RLFELL+LSG L + +W +I
Sbjct: 125 ASDCFCDGR-KRCAWITPKSDQCYIAFHDEEWGVPVHDDKRLFELLSLSGALAELSWKDI 184

Query: 168 VKRRELFREAFAGFEPSTVANMGEKEITD--------IASDKAIMLVES-----RIARDF 227
           + +R+LFRE F  F+P  ++ +  K+IT         ++  K   ++E+     +I   F
Sbjct: 185 LSKRQLFREVFMDFDPIAISELTNKKITSPEIAATTLLSEQKLRSILENANQVCKIIGAF 244

Query: 228 GSFSNYMWSYMNFKPTINRFRHPRNVPLRSPKAEAISKDMVKRGFRFVGPVIVYSFMQAA 287
           GSF  Y+W+++N KPT ++FR+PR VP+++ KAE ISKD+V+RGFR V P ++YSFMQ A
Sbjct: 245 GSFDKYIWNFVNQKPTQSQFRYPRQVPVKTSKAELISKDLVRRGFRSVSPTVIYSFMQTA 304

Query: 288 GLTIDHLVDCFRHGECVNLAE 291
           GLT DHL  CFRH +C+   E
Sbjct: 305 GLTNDHLTCCFRHHDCMTKDE 324

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004135425.15.6e-15893.20uncharacterized protein LOC101218195 [Cucumis sativus] >KGN52002.1 hypothetical ... [more]
XP_022956507.11.2e-15792.88uncharacterized protein LOC111458228 [Cucurbita moschata] >XP_022956508.1 unchar... [more]
XP_008446481.23.6e-15793.53PREDICTED: DNA-3-methyladenine glycosylase 1 [Cucumis melo] >KAA0034490.1 DNA-3-... [more]
KAG7032142.16.2e-15795.00guaA [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022993235.11.1e-15692.58uncharacterized protein LOC111489316 [Cucurbita maxima] >XP_022993244.1 uncharac... [more]
Match NameE-valueIdentityDescription
Q7VG786.2e-3536.76Probable GMP synthase [glutamine-hydrolyzing] OS=Helicobacter hepaticus (strain ... [more]
P051006.0e-3034.25DNA-3-methyladenine glycosylase 1 OS=Escherichia coli (strain K12) OX=83333 GN=t... [more]
P443217.4e-2834.08DNA-3-methyladenine glycosylase OS=Haemophilus influenzae (strain ATCC 51907 / D... [more]
Match NameE-valueIdentityDescription
A0A0A0KUC52.7e-15893.20Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G606920 PE=4 SV=1[more]
A0A6J1GX196.0e-15892.88uncharacterized protein LOC111458228 OS=Cucurbita moschata OX=3662 GN=LOC1114582... [more]
A0A5D3CCU61.8e-15793.53DNA-3-methyladenine glycosylase 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A1S3BEN51.8e-15793.53DNA-3-methyladenine glycosylase 1 OS=Cucumis melo OX=3656 GN=LOC103489204 PE=4 S... [more]
A0A6J1JY145.1e-15792.58uncharacterized protein LOC111489316 OS=Cucurbita maxima OX=3661 GN=LOC111489316... [more]
Match NameE-valueIdentityDescription
AT1G13635.11.3e-10464.38DNA glycosylase superfamily protein [more]
AT1G13635.21.3e-10464.38DNA glycosylase superfamily protein [more]
AT5G57970.11.7e-5146.88DNA glycosylase superfamily protein [more]
AT5G57970.21.7e-5146.88DNA glycosylase superfamily protein [more]
AT1G80850.12.9e-5141.00DNA glycosylase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D1.10.340.30Hypothetical protein; domain 2coord: 112..283
e-value: 2.6E-59
score: 201.8
NoneNo IPR availablePANTHERPTHR31116OS04G0501200 PROTEINcoord: 3..296
NoneNo IPR availablePANTHERPTHR31116:SF29DNA GLYCOSYLASE SUPERFAMILY PROTEINcoord: 3..296
IPR005019Methyladenine glycosylasePFAMPF03352Adenine_glycocoord: 121..282
e-value: 7.3E-52
score: 175.7
IPR011257DNA glycosylaseSUPERFAMILY48150DNA-glycosylasecoord: 113..286

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10015431.1HG10015431.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006284 base-excision repair
biological_process GO:0006541 glutamine metabolic process
biological_process GO:0006281 DNA repair
molecular_function GO:0008725 DNA-3-methyladenine glycosylase activity
molecular_function GO:0016740 transferase activity
molecular_function GO:0003824 catalytic activity