HG10003371 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10003371
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionDNA glycosylase
LocationChr08: 202248 .. 203583 (-)
RNA-Seq ExpressionHG10003371
SyntenyHG10003371
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGAAGATGATTGTTTTGAATTTGGGAGTGAGTAGTGATTTTGATCTTGAGAAAGCAGTTTGTAACCATGGGCAATTTATGATGTCACCAAACCAATGGATTCCTTCTTCTAAAACTCTCCAACGTCCACTTCGACTTATTTCTAATTCTAACTCTTCTGTTTTTGTCTCTATCAACCATTCTTCTTCTTTTCTTCTAACCATTCAAATCCACTCTTCTCCCCCTCTCTCTCCCCAAGATCAACAAGCTATATTGGTATATTTGCTAAATTAATATTACATTTTTATTTTATAATTTCTCTAATTAATTCATTCATTAAAATTAAATTTATACATATATGTAGGATCAAGTGGTTCGGATGCTTAGGCTTACGGAGAAAGATGAAGATGAGTTGAGGAAATTTCAAAGTTTGCATCCCAAAGCCAAACAGATGGGATTTGGTCGCCTTTTTCGATCTCCCACTCTTTTTGAAGATGCAATCAAGTCCATCCTTCTATGCAATACCACGTAATTATTAATTTAATTCATCAATTAATATTATTAATATTATTAATGCATAATGCATGGTTAATTACTTGTCAGGTGGAAAAGGACACTGGCAATGGCTGAACAGCTATGTGAGCTCCAAGCCAAAATGAGCAACCAAAGTAGGAAGAGAAAAAGGAAAGTAATTGGGAATTTTCCAAATGCAGAAGAAGTTTGTAGAATGGGGGTTGAATTGTTGAAGAAGCATAATCTTGGTTACAGAGCTGGTTACATCATTAACTTTGCTCAACGTGTTCAAAATGGCACAATTAATCTCCAAAATCCTAATCATTTACCTAAAATCAAAGGCTTTGGACCTTTTGCAACCGCTAATTTACTCATGTGCCTCGGTTTTTACCGCCAACTTCCAATTGATACTGAAACTATAAGGCACTTAAAACAGGTTTAATTTCTTTCCATGCATTCATTATTATATATATTATACTTAATTTGTGTTATGATTATTAATTATTTGAATTTACAAGCTACATGGGAGACAATTTTGCAACAAAAAGACAGTACAGGAAGACGTCAAACAAATTTACGACAAGTATGCTCCTTTCCAATGCTTGGCCTATTGGTACGTAAATCAATTTTTTCTTCTTCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTACCTAAAATCTAATTTGCTTTAAATGCTTTATTTTGGTTTTAAATATTTCAAATTTAATTTCAGGTTGGAGCTTGTGGAGTATTATGAGAGCAAATTCGGGAAGCTAAGTGAACTGTGCTCCCTTGATTATAACAAGATCAGTGGCACCACCCTCAACCTTTGA

mRNA sequence

ATGAAGAAGATGATTGTTTTGAATTTGGGAGTGAGTAGTGATTTTGATCTTGAGAAAGCAGTTTGTAACCATGGGCAATTTATGATGTCACCAAACCAATGGATTCCTTCTTCTAAAACTCTCCAACGTCCACTTCGACTTATTTCTAATTCTAACTCTTCTGTTTTTGTCTCTATCAACCATTCTTCTTCTTTTCTTCTAACCATTCAAATCCACTCTTCTCCCCCTCTCTCTCCCCAAGATCAACAAGCTATATTGGATCAAGTGGTTCGGATGCTTAGGCTTACGGAGAAAGATGAAGATGAGTTGAGGAAATTTCAAAGTTTGCATCCCAAAGCCAAACAGATGGGATTTGGTCGCCTTTTTCGATCTCCCACTCTTTTTGAAGATGCAATCAAGTCCATCCTTCTATGCAATACCACGTGGAAAAGGACACTGGCAATGGCTGAACAGCTATGTGAGCTCCAAGCCAAAATGAGCAACCAAAGTAGGAAGAGAAAAAGGAAAGTAATTGGGAATTTTCCAAATGCAGAAGAAGTTTGTAGAATGGGGGTTGAATTGTTGAAGAAGCATAATCTTGGTTACAGAGCTGGTTACATCATTAACTTTGCTCAACGTGTTCAAAATGGCACAATTAATCTCCAAAATCCTAATCATTTACCTAAAATCAAAGGCTTTGGACCTTTTGCAACCGCTAATTTACTCATGTGCCTCGGTTTTTACCGCCAACTTCCAATTGATACTGAAACTATAAGGCACTTAAAACAGCTACATGGGAGACAATTTTGCAACAAAAAGACAGTACAGGAAGACGTCAAACAAATTTACGACAAGTATGCTCCTTTCCAATGCTTGGCCTATTGGTTGGAGCTTGTGGAGTATTATGAGAGCAAATTCGGGAAGCTAAGTGAACTGTGCTCCCTTGATTATAACAAGATCAGTGGCACCACCCTCAACCTTTGA

Coding sequence (CDS)

ATGAAGAAGATGATTGTTTTGAATTTGGGAGTGAGTAGTGATTTTGATCTTGAGAAAGCAGTTTGTAACCATGGGCAATTTATGATGTCACCAAACCAATGGATTCCTTCTTCTAAAACTCTCCAACGTCCACTTCGACTTATTTCTAATTCTAACTCTTCTGTTTTTGTCTCTATCAACCATTCTTCTTCTTTTCTTCTAACCATTCAAATCCACTCTTCTCCCCCTCTCTCTCCCCAAGATCAACAAGCTATATTGGATCAAGTGGTTCGGATGCTTAGGCTTACGGAGAAAGATGAAGATGAGTTGAGGAAATTTCAAAGTTTGCATCCCAAAGCCAAACAGATGGGATTTGGTCGCCTTTTTCGATCTCCCACTCTTTTTGAAGATGCAATCAAGTCCATCCTTCTATGCAATACCACGTGGAAAAGGACACTGGCAATGGCTGAACAGCTATGTGAGCTCCAAGCCAAAATGAGCAACCAAAGTAGGAAGAGAAAAAGGAAAGTAATTGGGAATTTTCCAAATGCAGAAGAAGTTTGTAGAATGGGGGTTGAATTGTTGAAGAAGCATAATCTTGGTTACAGAGCTGGTTACATCATTAACTTTGCTCAACGTGTTCAAAATGGCACAATTAATCTCCAAAATCCTAATCATTTACCTAAAATCAAAGGCTTTGGACCTTTTGCAACCGCTAATTTACTCATGTGCCTCGGTTTTTACCGCCAACTTCCAATTGATACTGAAACTATAAGGCACTTAAAACAGCTACATGGGAGACAATTTTGCAACAAAAAGACAGTACAGGAAGACGTCAAACAAATTTACGACAAGTATGCTCCTTTCCAATGCTTGGCCTATTGGTTGGAGCTTGTGGAGTATTATGAGAGCAAATTCGGGAAGCTAAGTGAACTGTGCTCCCTTGATTATAACAAGATCAGTGGCACCACCCTCAACCTTTGA

Protein sequence

MKKMIVLNLGVSSDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSRKRKRKVIGNFPNAEEVCRMGVELLKKHNLGYRAGYIINFAQRVQNGTINLQNPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYNKISGTTLNL
Homology
BLAST of HG10003371 vs. NCBI nr
Match: XP_038877617.1 (uncharacterized protein LOC120069874 [Benincasa hispida])

HSP 1 Score: 494.2 bits (1271), Expect = 8.5e-136
Identity = 257/297 (86.53%), Postives = 270/297 (90.91%), Query Frame = 0

Query: 3   KMIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINH 62
           K I LNLGVS SDFDLEKAVCNHGQFMM PNQWIPSSKTLQRPLRL S+S+SSVFVSIN 
Sbjct: 2   KTIHLNLGVSVSDFDLEKAVCNHGQFMMPPNQWIPSSKTLQRPLRL-SDSHSSVFVSINQ 61

Query: 63  SSSFLLTIQIH-SSPPLSPQDQQAILDQVVRMLRLTEKDEDELRKFQSLHPKAKQMGFGR 122
            SS LLTIQIH SS PLSPQDQQAILDQVVRMLRLTEKDEDELRKFQSLHP+AKQMGFGR
Sbjct: 62  PSSSLLTIQIHSSSTPLSPQDQQAILDQVVRMLRLTEKDEDELRKFQSLHPRAKQMGFGR 121

Query: 123 LFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQ-SRKRKRKV------IGN 182
           LFRSPTLFEDA+KSILLCNTTWKRTLAMA QLCELQAKM  Q +RKRKRK+      IGN
Sbjct: 122 LFRSPTLFEDALKSILLCNTTWKRTLAMAGQLCELQAKMRRQITRKRKRKLGEKEGEIGN 181

Query: 183 FPNAEEVCRMGVELLKKHNLGYRAGYIINFAQRVQNGTINLQNPNHLPKIKGFGPFATAN 242
           FPNAEEVCRMGVELLKKH LGYRA YIINFA+ VQ+G I+LQNPN+ PKIKGFGPFATAN
Sbjct: 182 FPNAEEVCRMGVELLKKHCLGYRAAYIINFAKCVQSGKIDLQNPNYFPKIKGFGPFATAN 241

Query: 243 LLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLE 291
           +LMCLG YRQLPIDTETIRHLKQ+HGRQFCN KTV+EDVKQIYDKYAPFQCLAYWLE
Sbjct: 242 VLMCLGLYRQLPIDTETIRHLKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYWLE 297

BLAST of HG10003371 vs. NCBI nr
Match: XP_022951918.1 (uncharacterized protein LOC111454659 [Cucurbita moschata])

HSP 1 Score: 476.1 bits (1224), Expect = 2.4e-130
Identity = 241/328 (73.48%), Postives = 278/328 (84.76%), Query Frame = 0

Query: 4   MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHS 63
           MI L LGV  SDF+LEKAVCNHG FMM+PNQWIPSSKTLQRPLRL SNS++S+ VSIN S
Sbjct: 1   MIELKLGVGVSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRL-SNSDTSLLVSINQS 60

Query: 64  SSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDELRKFQSLHPKAKQMGFGRLF 123
           SS LLT+QIHS   L P+D+ AILDQV RMLRLTEKDEDE+R+FQ+LHP AKQ+GFGR+F
Sbjct: 61  SSSLLTLQIHSPRSLPPKDEVAILDQVARMLRLTEKDEDEIRRFQNLHPTAKQIGFGRIF 120

Query: 124 RSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSRKRKRK---VIGNFPNAEE 183
           RSP+LFED +KSIL+CNT+W+RTL MAE+LCE+QAKM  +S+KRKRK     GNFPNA E
Sbjct: 121 RSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEVQAKM-RESKKRKRKGNNERGNFPNARE 180

Query: 184 VCRMGVELLKKHNLGYRAGYIINFAQRVQNGTINLQ-------NPNHLPKIKGFGPFATA 243
           VCRMGVE LK H LGYRA Y++ FAQ V++G INLQ       +P+  PKIKGFGPFATA
Sbjct: 181 VCRMGVEALKNHCLGYRANYVVKFAQSVESGRINLQSLEKPVSSPDAFPKIKGFGPFATA 240

Query: 244 NLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELV 303
           N+ MCLGFY QLPIDTETIRHLKQ+HG Q+C KKTV EDVKQIYD YAP+QCLAYWLELV
Sbjct: 241 NIFMCLGFYHQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLAYWLELV 300

Query: 304 EYYESKFGKLSELCSLDYNKISGTTLNL 321
           +YYE+KFGKLSEL S DY+KISG+TL+L
Sbjct: 301 QYYETKFGKLSELSSFDYHKISGSTLHL 326

BLAST of HG10003371 vs. NCBI nr
Match: KAG6585875.1 (hypothetical protein SDJN03_18608, partial [Cucurbita argyrosperma subsp. sororia] >KAG7020778.1 hypothetical protein SDJN02_17466, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 475.7 bits (1223), Expect = 3.1e-130
Identity = 241/328 (73.48%), Postives = 278/328 (84.76%), Query Frame = 0

Query: 4   MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHS 63
           MI L LGV  SDF+LEKAVCNHG FMM+PNQWIPSSKTLQRPLRL SNS++S+ VSIN S
Sbjct: 1   MIELKLGVRVSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRL-SNSDTSLLVSINQS 60

Query: 64  SSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDELRKFQSLHPKAKQMGFGRLF 123
           SS LLT+QIHS   L P+D+ AILDQV RMLRLTEKDEDE+R+FQ+LHP AKQ+GFGR+F
Sbjct: 61  SSSLLTLQIHSPRSLPPKDEVAILDQVARMLRLTEKDEDEIRRFQNLHPTAKQIGFGRIF 120

Query: 124 RSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSRKRKRK---VIGNFPNAEE 183
           RSP+LFED +KSIL+CNT+W+RTL MAE+LCE+QAKM  +S+KRKRK     GNFPNA E
Sbjct: 121 RSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEVQAKM-RESKKRKRKGNNERGNFPNARE 180

Query: 184 VCRMGVELLKKHNLGYRAGYIINFAQRVQNGTINLQ-------NPNHLPKIKGFGPFATA 243
           VCRMGVE LK H LGYRA Y++ FAQ V++G INLQ       +P+  PKIKGFGPFATA
Sbjct: 181 VCRMGVEALKNHCLGYRANYVVKFAQSVESGRINLQSLEKPVSSPDAFPKIKGFGPFATA 240

Query: 244 NLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELV 303
           N+ MCLGFY QLPIDTETIRHLKQ+HG Q+C KKTV EDVKQIYD YAP+QCLAYWLELV
Sbjct: 241 NIFMCLGFYHQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLAYWLELV 300

Query: 304 EYYESKFGKLSELCSLDYNKISGTTLNL 321
           +YYE+KFGKLSEL S DY+KISG+TL+L
Sbjct: 301 QYYETKFGKLSELSSFDYHKISGSTLHL 326

BLAST of HG10003371 vs. NCBI nr
Match: XP_022156993.1 (uncharacterized protein LOC111023822 [Momordica charantia])

HSP 1 Score: 442.2 bits (1136), Expect = 3.8e-120
Identity = 230/331 (69.49%), Postives = 263/331 (79.46%), Query Frame = 0

Query: 2   KKMIVLNLG-VSSDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSIN 61
           ++MI LNLG  +S FDLE+AVCNHG FMM PN+WIPSSKTLQRPLRL ++S +SV VSI+
Sbjct: 5   RRMIDLNLGETTSGFDLERAVCNHGFFMMPPNKWIPSSKTLQRPLRL-ADSTTSVLVSIS 64

Query: 62  HSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDELRKFQSLHPKAKQMGFGR 121
             SS LL IQIHSSP  SP D+QAILDQV RMLR+TE+DE+ +R FQ+LH KAK++GFGR
Sbjct: 65  QPSSHLLNIQIHSSPSFSPLDRQAILDQVTRMLRITERDEENIRNFQNLHAKAKEIGFGR 124

Query: 122 LFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMS----NQSRKRKRK------- 181
           LFRSPTLFEDA+KSILLCN TW+RTLAMA QLCELQAK+        +KRKRK       
Sbjct: 125 LFRSPTLFEDAVKSILLCNATWRRTLAMAGQLCELQAKLGRGPITDGKKRKRKGKGECEL 184

Query: 182 VIGNFPNAEEVCRMGVELLKKHNLGYRAGYIINFAQRVQNGTINLQNPNH---LPKIKGF 241
             GNFP A E+CRM V LL+KH +GYRA YII+ AQRVQNG I+LQ        PKIKGF
Sbjct: 185 EGGNFPTAAELCRMSVLLLQKHFIGYRAVYIIDLAQRVQNGKIDLQKIERALSFPKIKGF 244

Query: 242 GPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLA 301
           GPF TAN+ MCLG Y +LPIDTETIRHLKQ+HGRQ CN KT +E VK +YDKYAPFQCLA
Sbjct: 245 GPFTTANVFMCLGLYDRLPIDTETIRHLKQVHGRQDCNMKTAEEAVKDVYDKYAPFQCLA 304

Query: 302 YWLELVEYYESKFGKLSELCSLDYNKISGTT 318
           YW+ELVEYYES+FGKLSEL   DY KISGTT
Sbjct: 305 YWMELVEYYESRFGKLSELGWHDYKKISGTT 334

BLAST of HG10003371 vs. NCBI nr
Match: XP_021905122.1 (uncharacterized protein LOC110820055 isoform X2 [Carica papaya])

HSP 1 Score: 341.7 bits (875), Expect = 7.0e-90
Identity = 179/332 (53.92%), Postives = 235/332 (70.78%), Query Frame = 0

Query: 7   LNLG-VSSDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINH-SSS 66
           L+LG     F+LEKAVCNHG FMM PN W PS KTL+RPLRL SN +SSV+ SI+H S+S
Sbjct: 5   LHLGECKGSFNLEKAVCNHGFFMMPPNLWSPSKKTLERPLRL-SNVSSSVYASISHPSNS 64

Query: 67  FLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDELRKFQSLHPKAKQMGFGRLFRS 126
             L IQ+H    +S  D+ AIL+QV RMLR+++KDE+ +R+FQ +H  AK  GFGR+FRS
Sbjct: 65  TFLVIQLHHIHNISSSDKHAILEQVGRMLRISKKDEEVVREFQKVHEAAKNKGFGRVFRS 124

Query: 127 PTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAK----MSNQSRKRKRKVI--------- 186
           P+LFED +KS+LLCN TW RTL MA+ LCELQ +    +S + RK++++           
Sbjct: 125 PSLFEDVVKSLLLCNCTWGRTLKMAKSLCELQYEIVRGISVEKRKKRKRTTNRSINDTMN 184

Query: 187 ------GNFPNAEEVCRMGVELLKKH-NLGYRAGYIINFAQRVQNGTINLQNPNHLPKIK 246
                 GNFPNAEE+  +  +LL++   LGYRA Y+IN AQ V++G ++L N   L KIK
Sbjct: 185 QEYFSKGNFPNAEELAGLSPDLLEERCKLGYRANYVINLAQLVKSGRLDLTNIQDLVKIK 244

Query: 247 GFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQC 306
           GFG F  AN+ MC+GFY+ +P DTET+RHLKQ+HG + C++ T+ +DVK IYDKY+PFQ 
Sbjct: 245 GFGSFVCANVSMCIGFYQNIPADTETMRHLKQVHGLETCSRSTLVKDVKAIYDKYSPFQA 304

Query: 307 LAYWLELVEYYESKFGKLSELCSLDYNKISGT 317
           LAYW EL+ YYESK GKLSEL    Y  ++G+
Sbjct: 305 LAYWFELLNYYESKCGKLSELPCSKYPSVTGS 335

BLAST of HG10003371 vs. ExPASy TrEMBL
Match: A0A6J1GJ25 (uncharacterized protein LOC111454659 OS=Cucurbita moschata OX=3662 GN=LOC111454659 PE=4 SV=1)

HSP 1 Score: 476.1 bits (1224), Expect = 1.2e-130
Identity = 241/328 (73.48%), Postives = 278/328 (84.76%), Query Frame = 0

Query: 4   MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHS 63
           MI L LGV  SDF+LEKAVCNHG FMM+PNQWIPSSKTLQRPLRL SNS++S+ VSIN S
Sbjct: 1   MIELKLGVGVSDFNLEKAVCNHGAFMMAPNQWIPSSKTLQRPLRL-SNSDTSLLVSINQS 60

Query: 64  SSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDELRKFQSLHPKAKQMGFGRLF 123
           SS LLT+QIHS   L P+D+ AILDQV RMLRLTEKDEDE+R+FQ+LHP AKQ+GFGR+F
Sbjct: 61  SSSLLTLQIHSPRSLPPKDEVAILDQVARMLRLTEKDEDEIRRFQNLHPTAKQIGFGRIF 120

Query: 124 RSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSRKRKRK---VIGNFPNAEE 183
           RSP+LFED +KSIL+CNT+W+RTL MAE+LCE+QAKM  +S+KRKRK     GNFPNA E
Sbjct: 121 RSPSLFEDVVKSILMCNTSWRRTLEMAEKLCEVQAKM-RESKKRKRKGNNERGNFPNARE 180

Query: 184 VCRMGVELLKKHNLGYRAGYIINFAQRVQNGTINLQ-------NPNHLPKIKGFGPFATA 243
           VCRMGVE LK H LGYRA Y++ FAQ V++G INLQ       +P+  PKIKGFGPFATA
Sbjct: 181 VCRMGVEALKNHCLGYRANYVVKFAQSVESGRINLQSLEKPVSSPDAFPKIKGFGPFATA 240

Query: 244 NLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELV 303
           N+ MCLGFY QLPIDTETIRHLKQ+HG Q+C KKTV EDVKQIYD YAP+QCLAYWLELV
Sbjct: 241 NIFMCLGFYHQLPIDTETIRHLKQVHGIQYCTKKTVGEDVKQIYDTYAPYQCLAYWLELV 300

Query: 304 EYYESKFGKLSELCSLDYNKISGTTLNL 321
           +YYE+KFGKLSEL S DY+KISG+TL+L
Sbjct: 301 QYYETKFGKLSELSSFDYHKISGSTLHL 326

BLAST of HG10003371 vs. ExPASy TrEMBL
Match: A0A6J1DS88 (uncharacterized protein LOC111023822 OS=Momordica charantia OX=3673 GN=LOC111023822 PE=4 SV=1)

HSP 1 Score: 442.2 bits (1136), Expect = 1.9e-120
Identity = 230/331 (69.49%), Postives = 263/331 (79.46%), Query Frame = 0

Query: 2   KKMIVLNLG-VSSDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSIN 61
           ++MI LNLG  +S FDLE+AVCNHG FMM PN+WIPSSKTLQRPLRL ++S +SV VSI+
Sbjct: 5   RRMIDLNLGETTSGFDLERAVCNHGFFMMPPNKWIPSSKTLQRPLRL-ADSTTSVLVSIS 64

Query: 62  HSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDELRKFQSLHPKAKQMGFGR 121
             SS LL IQIHSSP  SP D+QAILDQV RMLR+TE+DE+ +R FQ+LH KAK++GFGR
Sbjct: 65  QPSSHLLNIQIHSSPSFSPLDRQAILDQVTRMLRITERDEENIRNFQNLHAKAKEIGFGR 124

Query: 122 LFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMS----NQSRKRKRK------- 181
           LFRSPTLFEDA+KSILLCN TW+RTLAMA QLCELQAK+        +KRKRK       
Sbjct: 125 LFRSPTLFEDAVKSILLCNATWRRTLAMAGQLCELQAKLGRGPITDGKKRKRKGKGECEL 184

Query: 182 VIGNFPNAEEVCRMGVELLKKHNLGYRAGYIINFAQRVQNGTINLQNPNH---LPKIKGF 241
             GNFP A E+CRM V LL+KH +GYRA YII+ AQRVQNG I+LQ        PKIKGF
Sbjct: 185 EGGNFPTAAELCRMSVLLLQKHFIGYRAVYIIDLAQRVQNGKIDLQKIERALSFPKIKGF 244

Query: 242 GPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLA 301
           GPF TAN+ MCLG Y +LPIDTETIRHLKQ+HGRQ CN KT +E VK +YDKYAPFQCLA
Sbjct: 245 GPFTTANVFMCLGLYDRLPIDTETIRHLKQVHGRQDCNMKTAEEAVKDVYDKYAPFQCLA 304

Query: 302 YWLELVEYYESKFGKLSELCSLDYNKISGTT 318
           YW+ELVEYYES+FGKLSEL   DY KISGTT
Sbjct: 305 YWMELVEYYESRFGKLSELGWHDYKKISGTT 334

BLAST of HG10003371 vs. ExPASy TrEMBL
Match: A0A6A1W9S6 (Uncharacterized protein OS=Morella rubra OX=262757 GN=CJ030_MR3G027886 PE=4 SV=1)

HSP 1 Score: 340.1 bits (871), Expect = 9.9e-90
Identity = 187/363 (51.52%), Postives = 248/363 (68.32%), Query Frame = 0

Query: 15  FDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSS---FLLTIQI 74
           F++EKAVCNHG FMM+PN WIPS+KTLQRPLRL +NS  SV VSI+H +S     + IQ+
Sbjct: 17  FNMEKAVCNHGFFMMAPNAWIPSTKTLQRPLRL-ANSAVSVLVSISHPASGTANYILIQV 76

Query: 75  HSSPPLSPQDQQAILDQVVRMLRLTEKDEDELRKFQSLHPKAKQMGFGRLFRSPTLFEDA 134
           H +  +SPQD++AIL+QV RMLR++E+DE  LR+FQ+LHP+AK+ GFGR FRSP+LFEDA
Sbjct: 77  HDTDKVSPQDEKAILEQVARMLRISERDERNLREFQNLHPEAKEKGFGRCFRSPSLFEDA 136

Query: 135 IKSILLCNTTWKRTLAMAEQLCELQAKMSN-------------QSRKR--KRK------- 194
           IKS+LLCN TW RTL MA+ LCELQ +++N              SRKR  KRK       
Sbjct: 137 IKSLLLCNCTWTRTLDMAKALCELQWELANGLIPDKCENLARQYSRKRGLKRKQATRKQS 196

Query: 195 -----------------------VIGNFPNAEEVCRMGVELLKKH-NLGYRAGYIINFAQ 254
                                   +GNFP+++EV  +    L+ H NLGYRA YI+  A+
Sbjct: 197 KVKKCERNCSDNSQLPLKGKDCRPLGNFPSSKEVAMLNEYFLENHCNLGYRARYIVKLAK 256

Query: 255 RVQNGTINLQ--NPNH----------LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRH 314
           +V++G + L+  + +H          L KIKGFGPFA AN++MC+G+Y+ +P+DTET+RH
Sbjct: 257 QVESGKLKLKEFDDDHSATCEELYEKLTKIKGFGPFACANVMMCMGYYQLVPVDTETVRH 316

Query: 315 LKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYNKI 317
           L+Q+HGR+   K+TV EDVK +YDK+APFQ LAYW EL+E+YE KFGKLSEL +  Y  +
Sbjct: 317 LRQVHGRK---KETVHEDVKDVYDKHAPFQSLAYWFELLEHYERKFGKLSELPNSSYRIV 375

BLAST of HG10003371 vs. ExPASy TrEMBL
Match: A0A2P5FT40 (DNA glycosylase OS=Trema orientale OX=63057 GN=TorRG33x02_031220 PE=4 SV=1)

HSP 1 Score: 333.2 bits (853), Expect = 1.2e-87
Identity = 187/366 (51.09%), Postives = 246/366 (67.21%), Query Frame = 0

Query: 4   MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHS 63
           ++ L LG S S F++EKAVCNHG FMM+PN+W PS+KTLQRPLRL ++  SSV VSI+HS
Sbjct: 6   VLTLALGESKSSFNMEKAVCNHGFFMMAPNRWSPSAKTLQRPLRL-ADGASSVTVSISHS 65

Query: 64  --SSFLLTIQIHSSPP---LSPQDQQAILDQVVRMLRLTEKDEDELRKFQSLHPKAKQMG 123
              S LL I++    P   LS  D  AIL+QV RMLR+TE+DE ++R+FQ +HP+AK+ G
Sbjct: 66  PLHSHLLYIRVLLQSPSKGLSLSDSNAILEQVGRMLRITERDERDVREFQKVHPQAKERG 125

Query: 124 FGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQ---------------AKMSNQ 183
           FGR+FRSP+LFEDA+KSILLCN +W RTL MAE LC+LQ               +  SN+
Sbjct: 126 FGRVFRSPSLFEDAVKSILLCNCSWARTLKMAEALCKLQFEVTENHVHTIRRTTSSTSNK 185

Query: 184 SRKRKR------------KVIGNFPNAEEVCRM-GVELLKKHN--LGYRAGYIINFAQRV 243
             KRKR            +++GNFPNA E+  +     L+K+   LGYRA +I++ A+  
Sbjct: 186 DLKRKRAKSKASTDDDDSQIVGNFPNAREIASLDNSYFLEKYTPILGYRAKHILSLAKDF 245

Query: 244 QNGTIN--------LQNPNH-------LPKIKGFGPFATANLLMCLGFYRQLPIDTETIR 303
           ++G +N         +   H       +  I+GFGPF  AN+LMC+  Y  +P D+ETIR
Sbjct: 246 ESGKLNGLEEAEKAAEEVLHHEEMIMIMKNIRGFGPFVCANVLMCIRIYENVPADSETIR 305

Query: 304 HLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYNK 319
           HL+Q+H R+ CNKKT+Q++VK+IYDKYAPFQCLAYW+EL+EYYE KFGKLSEL    Y  
Sbjct: 306 HLQQVHARKNCNKKTIQKEVKEIYDKYAPFQCLAYWMELLEYYEDKFGKLSELPESSYKT 365

BLAST of HG10003371 vs. ExPASy TrEMBL
Match: A0A2P5ACW8 (DNA glycosylase OS=Parasponia andersonii OX=3476 GN=PanWU01x14_344830 PE=4 SV=1)

HSP 1 Score: 332.8 bits (852), Expect = 1.6e-87
Identity = 187/365 (51.23%), Postives = 249/365 (68.22%), Query Frame = 0

Query: 4   MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHS 63
           ++ L LG S S F++EKAVCNHG FMM+PN+W PS+KTLQRPLRL ++  SSV VSI+HS
Sbjct: 6   VLTLALGESKSSFNMEKAVCNHGFFMMAPNRWSPSAKTLQRPLRL-ADGASSVTVSISHS 65

Query: 64  --SSFLLTIQIHSSPP---LSPQDQQAILDQVVRMLRLTEKDEDELRKFQSLHPKAKQMG 123
              S LL I++    P   LS  D  AIL+QV RMLR+T++DE ++R+FQ +HP+AK+ G
Sbjct: 66  PLHSHLLYIRVLLQSPSKALSLSDSNAILEQVGRMLRITKRDERDVREFQKVHPQAKERG 125

Query: 124 FGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKM---------------SNQ 183
           FGR+FRSP+LFEDA+KSILLCN +W RTL MAE LC+LQ ++               SN+
Sbjct: 126 FGRVFRSPSLFEDAVKSILLCNCSWARTLKMAEALCKLQFEVTENHVHPIKKTTTSTSNK 185

Query: 184 SRKRKR-----------KVIGNFPNAEEVCRMGVE-LLKKHN--LGYRAGYIINFAQRVQ 243
             KRKR           +++GNFPNA E+  +     L+K+   LGYRA +I++ A+  +
Sbjct: 186 GLKRKRAKTKATDDDDSQIMGNFPNAREIASLDKSYFLEKYTPILGYRAKHILSLAKDFE 245

Query: 244 NGTIN---------LQNPNH------LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRH 303
           +G +N          +  +H      + KI+GFGPF  AN+LMC+  Y  +P D+ETIRH
Sbjct: 246 SGKLNGLEVAEKAEEEALHHEEMILIMKKIRGFGPFVCANVLMCIRIYENVPADSETIRH 305

Query: 304 LKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYNKI 319
           L+Q+HGR+ CNKKT+ ++VK+IYDKYAPFQCLAYW+EL+EYYE KFGKLSEL    Y  I
Sbjct: 306 LQQVHGRKNCNKKTILKEVKEIYDKYAPFQCLAYWMELLEYYEDKFGKLSELPESSYKTI 365

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038877617.18.5e-13686.53uncharacterized protein LOC120069874 [Benincasa hispida][more]
XP_022951918.12.4e-13073.48uncharacterized protein LOC111454659 [Cucurbita moschata][more]
KAG6585875.13.1e-13073.48hypothetical protein SDJN03_18608, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022156993.13.8e-12069.49uncharacterized protein LOC111023822 [Momordica charantia][more]
XP_021905122.17.0e-9053.92uncharacterized protein LOC110820055 isoform X2 [Carica papaya][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1GJ251.2e-13073.48uncharacterized protein LOC111454659 OS=Cucurbita moschata OX=3662 GN=LOC1114546... [more]
A0A6J1DS881.9e-12069.49uncharacterized protein LOC111023822 OS=Momordica charantia OX=3673 GN=LOC111023... [more]
A0A6A1W9S69.9e-9051.52Uncharacterized protein OS=Morella rubra OX=262757 GN=CJ030_MR3G027886 PE=4 SV=1[more]
A0A2P5FT401.2e-8751.09DNA glycosylase OS=Trema orientale OX=63057 GN=TorRG33x02_031220 PE=4 SV=1[more]
A0A2P5ACW81.6e-8751.23DNA glycosylase OS=Parasponia andersonii OX=3476 GN=PanWU01x14_344830 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003265HhH-GPD domainPFAMPF00730HhH-GPDcoord: 132..274
e-value: 9.3E-7
score: 29.3
NoneNo IPR availableGENE3D1.10.340.30Hypothetical protein; domain 2coord: 127..240
e-value: 7.6E-15
score: 57.2
NoneNo IPR availablePANTHERPTHR102428-OXOGUANINE DNA GLYCOSYLASEcoord: 9..317
NoneNo IPR availablePANTHERPTHR10242:SF7BNAC06G12980D PROTEINcoord: 9..317
IPR011257DNA glycosylaseSUPERFAMILY48150DNA-glycosylasecoord: 127..291

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10003371.1HG10003371.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006284 base-excision repair
biological_process GO:0006281 DNA repair
molecular_function GO:0003824 catalytic activity