HG10016803 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10016803
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionShikimate kinase
LocationChr03: 8238266 .. 8242016 (+)
RNA-Seq ExpressionHG10016803
SyntenyHG10016803
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAGCTGGAAGTGGCAATTTTAACAAACTATATCTACAAAAAGGAAGGTGTGGCTGTGGCAATAGATTGCTCCATCAGCCAGGTCATCCTCCGCTGCTTGCACGGATTGATTCTCGGTTCTGATGAAGGTGGTGGTGCTCGTGATCTTCCTTCTGGTAAAGTAAATGATATCAACTCCAAGCTAGAGATTGAAGTTGTGAAGTCATATTGTCAAAGAGATCCAATTTTGGACATTGATCTTTGGAAGAAAGGAAGTCTTGTGAGAAACTGGGCAAGTGTGGCTCTCGGATGCATACAAGTGAGTTTTCTTTCAGATCTCATTGTAAATATGTTTAGTTAGTAAAAGCTAAAGCTATCTGCCTTGCACCATTATAGTAAGTGCAAAATAATCCCTTCATTTGGAATCCGGTTGTCTCCCCATTATAAATAGGGAACTCATGTTTTAATGATTATTCAAGTAAATCTAGAGATATATGCTCTGATTATTTAGCTTTACATTCTGCATTTTGGAGTAATCAGTGGCCATTTAATAAATGCAGAGAGAGAATATTGATTTTCATTCCTAATGTAAAATCTGCAGCTCCCTCTTTAACAAGTGAACGAACATTATGCATTTAGTGGTTGGTTAATATTTTCTTCACCCATCACTTATGGATTACAAATTACAATCAACTGTGTAAGATGTGTAAAGTATAGTTTTAACTTTTAAGTTTAGGAATAAGAATACAAGGTTGTCTAGATTGTTTAGTATCATAAAGTTTGCCCAAAAATGAGGAAGATTTTTTAAATCTGGGTCTTGTGACAGTGGGAATATTGAGATTTTTAATTTCGTTAGAAGTTATGTATCATTGTATCATATTTATAGATCTCTTTGATGAAGATTTTATTGAATAGTATGGTAATAGACGACATGCGAGTGAGATAAGTTGTTTTTTTAGTACAATATGTATAGGAAAATTTAAACTACGGACTTTTAGATTATTAACACATTCATATGTCAATTGAGTTATGCTCACTTTTGCGCCTTAATTTTCTGTTATATATGTTGTTATCCAAGATAATTGTTTCCTTGGACATTTGTCAAGAGAGATAGTGATGAAATAAGGCCACATCCTTACAATAGGTTTTTTTTTTTTTTTTTAAATTTTTATCCATTCGGACCAGCTTACGTGTACTTCGATTAATTTCATGGGACAACCTGCTTGATCCTACAATATTTGGATATAAAAAAAACTTGAAGGATATTAAAACACGCTAATTTACTATTGTAAGGGCATCCTTACAATAGTAAATTAACATGTTAAAAGTCAATAAAAGTGAATGTTTGCATGCATTGAAAGTAGTCTTGATTATTGAAAATTTGAATTGAAAATTTGATTCAGAAATTACTTGTCTGTTTAAAATTAACGCATTATTTTGTACGGTGTTTTATTTATTTATTTATTGCAAAATATATGTTCTAATTCGTTAGTGGTTGGAGACTTTTCAATATAGTTCACATTATTTAAAAGGTCCATCAGTTGTTAAAAGTAAAAATGCATATGCATAAATATGTATAGTGGGTTAGTAGAGATTTAGAATAAGAAAATATTAGGCGATGTTCAAATATAGTAAAATGAGTCAACTAATTATAAAATATAGCAAAATATTGCTAATTATCTGATAGATATTAATAGACGTTTATCGTTGAAAGACCGTAACATTTTGCTATATTTAAATTTTTTTTAATAATTTTTATCATTTAAAACAATTACCTAAAATATTATATGAAAATAACTTTTTAAAAAATTTAAATATTGGTAGAAGTTTTGAATTCCTTCCCCTTGAATGATCTTCTCTAAATTTTTTTCAATTCTATAATATTAGGTGATTCTTCTTTTTACTTGAACAATTGTAAGGTGAGAGATTAAACCATCGATCTTTGAGATGATAATTGATGATATCTCATCTATTTAACTATTCTTGAATGGGCATGTTTTGTGAGTTTATTATCAACTAATTTAGAAGAATTTTTTAAATGACATGGACTAAATTAAAAAAGAAAAAAAAACTAAAAAAACATTTATAGACAAAAAAAAAAGAAGAAAAATTATTCGTAGATAATAGATATTTGGGTTCTGGGCCTTGAGGACTACAAGCAGGCGGTGTTTTGGCAGGCCCAGAATCTTCACATCCTCTCGGCTATGCTCCAACTTCCGAGCTCGGAGATTGCTTTACCGAGCAACAACGGCAGAAGCAATGTTCAGAGCTTTGGAACTACCGCCACCGTGCCCGGCGGCGAAGCTTAATCTCGTTCACGCATTGCCAAATGACGTCAAAATCTGCCGACTACCTTATAATCTCGGCCTACCGAATCGTCGACTTTCTTTGCTTTCGATACGAGCACAATCGCTATCTGATCCATCGACTTCATCGCGTTATACGGAAACTATTGGACATTCCTCTCCAGCATTTCTTCAATTCTCTCAGTGCACGCTAACTCAACGCCACATCCTTGTTCTTAATGTCGTTGCCTGCGCGGTAATCCACCTATTTTAATTGCTTTCACTGAGAATCTGATTTCGAATGGACTTTTTGGATGCGCTTAATTCAGATATGTTTAGTTTCTTGCGGTCATTATTGTTGAAAGCATGAGCAATAAGTCATTCCTAGAACCAAATTTTGCGATTGCTTTCCCTTTTGACTGACAAAATTGAACTTGAAAATTTTATTTGCGTATTTCTGGCGCTAATAGTTGGAAATATGCGAGCAATTGCGTTATTTTGTTGTAATTTGCTTTACTTGCAATGCTTGTGATATTAGGGTCTTTGAATGCAAATGAAGCCGAGATATTCTGCTAAAATCCATAGTTATTGCTGAGAGTAATTCGATGTTTGGACTTCTTTGTAAAGCTGTTGAAATCTTATTTGGCTTGTACTTTGATTGGTAAATGGGAGGTGTGGCTGATAACTATGCAATTTCACATCTTTGTAGACGGCTATTTCTGCAACCTGGCTCTTTTGTTCTGCGATCCCCACTCTTCTGGTTTGTATTCTTCAACTTCGATTAAGAATAATGTTTTCAATTTGACTTGATGTCATATGGGAATGACTATATTTCCTGTTGTCTTAGCATCCATCTGCCCTTCTTTCAATGCTTGCAATGTTTTAATGTTAAAGGCATTCAAGAGAGCAGCCGAATCACTAGAGAAGCTCATGGATGTCACAAGGGAGGAACTTCCAGGCACTATGGCAGCCATTCGTTTATCTGGCATGGAAATTAGTGATCTGACCATGGAGCTCAGTGATCTTGGGTTTATATCAGATTTTGAATTCTTTTGTTTATTTTGTAAGTATTGATATATGTTGCAAAAAACAAGACACAAGTTTGATTTGGTATCTTTTGCAGCCAGGATATCACCCAAGGTGTGAGAAGTTCCACTAGAGCTGTTCGAGTAGCCGAAGAGAGATTGCGTCACTTGACAAACATGTCTCCAACAGGTTTTCTTATGATTTCTTTACCACTTATTTCTACAAAAGAGAGAGTATTGAATCTTGAGCTATGAATTCGTTTTTACTTGATTATCCTCAGTGCAGGAAATGACAATAACCAATCTGGGAGTGGAGACAGCAGAGCCGGTTCTGGCTAAAAGGGCAAGAGACATTAAGGAGGGGATTGTGAAAGGACGTTCCATCTTCCAATTGTTTCTCTCCCTTACAAGATTCTCTCGGCTGGCCTTGAATTATTTTAGCAAACGAGGTAAGAAGTAG

mRNA sequence

ATGCAGCTGGAAGTGGCAATTTTAACAAACTATATCTACAAAAAGGAAGGTGTGGCTGTGGCAATAGATTGCTCCATCAGCCAGGTCATCCTCCGCTGCTTGCACGGATTGATTCTCGGTTCTGATGAAGGTGGTGGTGCTCGTGATCTTCCTTCTGGTAAAGTAAATGATATCAACTCCAAGCTAGAGATTGAAGTTGTGAAGTCATATTGTCAAAGAGATCCAATTTTGGACATTGATCTTTGGAAGAAAGGAAGTCTTGTGAGAAACTGGGCAAGTGTGGCTCTCGGATGCATACAAACGGCTATTTCTGCAACCTGGCTCTTTTGTTCTGCGATCCCCACTCTTCTGGCATTCAAGAGAGCAGCCGAATCACTAGAGAAGCTCATGGATGTCACAAGGGAGGAACTTCCAGGCACTATGGCAGCCATTCGTTTATCTGGCATGGAAATTAGTGATCTGACCATGGAGCTCAGTGATCTTGGCCAGGATATCACCCAAGGTGTGAGAAGTTCCACTAGAGCTGTTCGAGTAGCCGAAGAGAGATTGCGTCACTTGACAAACATGTCTCCAACAGTGCAGGAAATGACAATAACCAATCTGGGAGTGGAGACAGCAGAGCCGGTTCTGGCTAAAAGGGCAAGAGACATTAAGGAGGGGATTGTGAAAGGACGTTCCATCTTCCAATTGTTTCTCTCCCTTACAAGATTCTCTCGGCTGGCCTTGAATTATTTTAGCAAACGAGGTAAGAAGTAG

Coding sequence (CDS)

ATGCAGCTGGAAGTGGCAATTTTAACAAACTATATCTACAAAAAGGAAGGTGTGGCTGTGGCAATAGATTGCTCCATCAGCCAGGTCATCCTCCGCTGCTTGCACGGATTGATTCTCGGTTCTGATGAAGGTGGTGGTGCTCGTGATCTTCCTTCTGGTAAAGTAAATGATATCAACTCCAAGCTAGAGATTGAAGTTGTGAAGTCATATTGTCAAAGAGATCCAATTTTGGACATTGATCTTTGGAAGAAAGGAAGTCTTGTGAGAAACTGGGCAAGTGTGGCTCTCGGATGCATACAAACGGCTATTTCTGCAACCTGGCTCTTTTGTTCTGCGATCCCCACTCTTCTGGCATTCAAGAGAGCAGCCGAATCACTAGAGAAGCTCATGGATGTCACAAGGGAGGAACTTCCAGGCACTATGGCAGCCATTCGTTTATCTGGCATGGAAATTAGTGATCTGACCATGGAGCTCAGTGATCTTGGCCAGGATATCACCCAAGGTGTGAGAAGTTCCACTAGAGCTGTTCGAGTAGCCGAAGAGAGATTGCGTCACTTGACAAACATGTCTCCAACAGTGCAGGAAATGACAATAACCAATCTGGGAGTGGAGACAGCAGAGCCGGTTCTGGCTAAAAGGGCAAGAGACATTAAGGAGGGGATTGTGAAAGGACGTTCCATCTTCCAATTGTTTCTCTCCCTTACAAGATTCTCTCGGCTGGCCTTGAATTATTTTAGCAAACGAGGTAAGAAGTAG

Protein sequence

MQLEVAILTNYIYKKEGVAVAIDCSISQVILRCLHGLILGSDEGGGARDLPSGKVNDINSKLEIEVVKSYCQRDPILDIDLWKKGSLVRNWASVALGCIQTAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSDLGQDITQGVRSSTRAVRVAEERLRHLTNMSPTVQEMTITNLGVETAEPVLAKRARDIKEGIVKGRSIFQLFLSLTRFSRLALNYFSKRGKK
Homology
BLAST of HG10016803 vs. NCBI nr
Match: XP_008440149.1 (PREDICTED: uncharacterized protein LOC103484701 isoform X2 [Cucumis melo])

HSP 1 Score: 273.9 bits (699), Expect = 1.4e-69
Identity = 146/151 (96.69%), Postives = 149/151 (98.68%), Query Frame = 0

Query: 101 TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD 160
           TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD
Sbjct: 137 TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD 196

Query: 161 LGQDITQGVRSSTRAVRVAEERLRHLTNMSPTVQEMTITNLGVETAEPVLAKRARDIKEG 220
           LGQDITQGVRSSTRAVRVAEERLR LTNMSPTVQEMTITNLGV+ A+PVLAKRARDIKEG
Sbjct: 197 LGQDITQGVRSSTRAVRVAEERLRRLTNMSPTVQEMTITNLGVKGADPVLAKRARDIKEG 256

Query: 221 IVKGRSIFQLFLSLTRFSRLALNYFSKRGKK 252
           IVKGRSIFQLFLS+TRFSRLALNYFSKRGKK
Sbjct: 257 IVKGRSIFQLFLSITRFSRLALNYFSKRGKK 287

BLAST of HG10016803 vs. NCBI nr
Match: XP_038880851.1 (uncharacterized protein LOC120072535 [Benincasa hispida])

HSP 1 Score: 271.2 bits (692), Expect = 9.2e-69
Identity = 146/153 (95.42%), Postives = 150/153 (98.04%), Query Frame = 0

Query: 101 TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD 160
           TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMEL+D
Sbjct: 95  TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELND 154

Query: 161 LGQDITQGVRSSTRAVRVAEERLRHLTNMSPT--VQEMTITNLGVETAEPVLAKRARDIK 220
           LGQDITQGVRSSTRAVRVAE+RLR LTNM+PT  VQEMT+TNLGVETAEPVLAKRARDIK
Sbjct: 155 LGQDITQGVRSSTRAVRVAEDRLRRLTNMTPTASVQEMTVTNLGVETAEPVLAKRARDIK 214

Query: 221 EGIVKGRSIFQLFLSLTRFSRLALNYFSKRGKK 252
           EGIVKGRSIFQLFLSLTRFSRLALNYFSKRGKK
Sbjct: 215 EGIVKGRSIFQLFLSLTRFSRLALNYFSKRGKK 247

BLAST of HG10016803 vs. NCBI nr
Match: XP_008440148.1 (PREDICTED: uncharacterized protein LOC103484701 isoform X1 [Cucumis melo])

HSP 1 Score: 268.9 bits (686), Expect = 4.5e-68
Identity = 146/153 (95.42%), Postives = 149/153 (97.39%), Query Frame = 0

Query: 101 TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD 160
           TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD
Sbjct: 137 TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD 196

Query: 161 LGQDITQGVRSSTRAVRVAEERLRHLTNMSPT--VQEMTITNLGVETAEPVLAKRARDIK 220
           LGQDITQGVRSSTRAVRVAEERLR LTNMSPT  VQEMTITNLGV+ A+PVLAKRARDIK
Sbjct: 197 LGQDITQGVRSSTRAVRVAEERLRRLTNMSPTASVQEMTITNLGVKGADPVLAKRARDIK 256

Query: 221 EGIVKGRSIFQLFLSLTRFSRLALNYFSKRGKK 252
           EGIVKGRSIFQLFLS+TRFSRLALNYFSKRGKK
Sbjct: 257 EGIVKGRSIFQLFLSITRFSRLALNYFSKRGKK 289

BLAST of HG10016803 vs. NCBI nr
Match: XP_011657800.1 (uncharacterized protein LOC101204218 isoform X2 [Cucumis sativus])

HSP 1 Score: 266.9 bits (681), Expect = 1.7e-67
Identity = 143/151 (94.70%), Postives = 146/151 (96.69%), Query Frame = 0

Query: 101 TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD 160
           TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTMAAIRLSGMEISDLTMELSD
Sbjct: 136 TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMAAIRLSGMEISDLTMELSD 195

Query: 161 LGQDITQGVRSSTRAVRVAEERLRHLTNMSPTVQEMTITNLGVETAEPVLAKRARDIKEG 220
           LGQ ITQGVRSSTRAVRVAEERLR LTNMSPTVQEMTITNLGV  AEPVLAKRA+DIKEG
Sbjct: 196 LGQGITQGVRSSTRAVRVAEERLRRLTNMSPTVQEMTITNLGVRGAEPVLAKRAKDIKEG 255

Query: 221 IVKGRSIFQLFLSLTRFSRLALNYFSKRGKK 252
           I+KGRSIFQLFLSLTRFS LALNYFSKRGKK
Sbjct: 256 ILKGRSIFQLFLSLTRFSGLALNYFSKRGKK 286

BLAST of HG10016803 vs. NCBI nr
Match: XP_022977945.1 (uncharacterized protein LOC111478086 isoform X1 [Cucurbita maxima])

HSP 1 Score: 264.2 bits (674), Expect = 1.1e-66
Identity = 143/153 (93.46%), Postives = 148/153 (96.73%), Query Frame = 0

Query: 101 TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD 160
           TAI+ATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD
Sbjct: 100 TAIAATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD 159

Query: 161 LGQDITQGVRSSTRAVRVAEERLRHLTNMSPT--VQEMTITNLGVETAEPVLAKRARDIK 220
           LGQ+ITQGVRSSTRAVRVAEERLR LTNM+PT  VQEMT+ NLGVE AEPVLAKRARDIK
Sbjct: 160 LGQNITQGVRSSTRAVRVAEERLRSLTNMTPTAKVQEMTVANLGVEAAEPVLAKRARDIK 219

Query: 221 EGIVKGRSIFQLFLSLTRFSRLALNYFSKRGKK 252
           EGIVKGRSIFQLFLSLTRFSRLALN+FSKRGKK
Sbjct: 220 EGIVKGRSIFQLFLSLTRFSRLALNHFSKRGKK 252

BLAST of HG10016803 vs. ExPASy TrEMBL
Match: A0A1S3B011 (uncharacterized protein LOC103484701 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103484701 PE=4 SV=1)

HSP 1 Score: 273.9 bits (699), Expect = 6.8e-70
Identity = 146/151 (96.69%), Postives = 149/151 (98.68%), Query Frame = 0

Query: 101 TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD 160
           TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD
Sbjct: 137 TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD 196

Query: 161 LGQDITQGVRSSTRAVRVAEERLRHLTNMSPTVQEMTITNLGVETAEPVLAKRARDIKEG 220
           LGQDITQGVRSSTRAVRVAEERLR LTNMSPTVQEMTITNLGV+ A+PVLAKRARDIKEG
Sbjct: 197 LGQDITQGVRSSTRAVRVAEERLRRLTNMSPTVQEMTITNLGVKGADPVLAKRARDIKEG 256

Query: 221 IVKGRSIFQLFLSLTRFSRLALNYFSKRGKK 252
           IVKGRSIFQLFLS+TRFSRLALNYFSKRGKK
Sbjct: 257 IVKGRSIFQLFLSITRFSRLALNYFSKRGKK 287

BLAST of HG10016803 vs. ExPASy TrEMBL
Match: A0A1S3B164 (uncharacterized protein LOC103484701 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103484701 PE=4 SV=1)

HSP 1 Score: 268.9 bits (686), Expect = 2.2e-68
Identity = 146/153 (95.42%), Postives = 149/153 (97.39%), Query Frame = 0

Query: 101 TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD 160
           TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD
Sbjct: 137 TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD 196

Query: 161 LGQDITQGVRSSTRAVRVAEERLRHLTNMSPT--VQEMTITNLGVETAEPVLAKRARDIK 220
           LGQDITQGVRSSTRAVRVAEERLR LTNMSPT  VQEMTITNLGV+ A+PVLAKRARDIK
Sbjct: 197 LGQDITQGVRSSTRAVRVAEERLRRLTNMSPTASVQEMTITNLGVKGADPVLAKRARDIK 256

Query: 221 EGIVKGRSIFQLFLSLTRFSRLALNYFSKRGKK 252
           EGIVKGRSIFQLFLS+TRFSRLALNYFSKRGKK
Sbjct: 257 EGIVKGRSIFQLFLSITRFSRLALNYFSKRGKK 289

BLAST of HG10016803 vs. ExPASy TrEMBL
Match: A0A6J1INQ2 (uncharacterized protein LOC111478086 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111478086 PE=4 SV=1)

HSP 1 Score: 264.2 bits (674), Expect = 5.4e-67
Identity = 143/153 (93.46%), Postives = 148/153 (96.73%), Query Frame = 0

Query: 101 TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD 160
           TAI+ATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD
Sbjct: 100 TAIAATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD 159

Query: 161 LGQDITQGVRSSTRAVRVAEERLRHLTNMSPT--VQEMTITNLGVETAEPVLAKRARDIK 220
           LGQ+ITQGVRSSTRAVRVAEERLR LTNM+PT  VQEMT+ NLGVE AEPVLAKRARDIK
Sbjct: 160 LGQNITQGVRSSTRAVRVAEERLRSLTNMTPTAKVQEMTVANLGVEAAEPVLAKRARDIK 219

Query: 221 EGIVKGRSIFQLFLSLTRFSRLALNYFSKRGKK 252
           EGIVKGRSIFQLFLSLTRFSRLALN+FSKRGKK
Sbjct: 220 EGIVKGRSIFQLFLSLTRFSRLALNHFSKRGKK 252

BLAST of HG10016803 vs. ExPASy TrEMBL
Match: A0A5D3CRC7 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G005520 PE=4 SV=1)

HSP 1 Score: 262.7 bits (670), Expect = 1.6e-66
Identity = 143/150 (95.33%), Postives = 146/150 (97.33%), Query Frame = 0

Query: 101 TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD 160
           TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD
Sbjct: 95  TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD 154

Query: 161 LGQDITQGVRSSTRAVRVAEERLRHLTNMSPT--VQEMTITNLGVETAEPVLAKRARDIK 220
           LGQDITQGVRSSTRAVRVAEERLR LTNMSPT  VQEMTITNLGV+ A+PVLAKRARDIK
Sbjct: 155 LGQDITQGVRSSTRAVRVAEERLRRLTNMSPTASVQEMTITNLGVKGADPVLAKRARDIK 214

Query: 221 EGIVKGRSIFQLFLSLTRFSRLALNYFSKR 249
           EGIVKGRSIFQLFLS+TRFSRLALNYFSKR
Sbjct: 215 EGIVKGRSIFQLFLSITRFSRLALNYFSKR 244

BLAST of HG10016803 vs. ExPASy TrEMBL
Match: A0A6J1GDG4 (uncharacterized protein LOC111452981 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111452981 PE=4 SV=1)

HSP 1 Score: 258.8 bits (660), Expect = 2.3e-65
Identity = 143/154 (92.86%), Postives = 147/154 (95.45%), Query Frame = 0

Query: 101 TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD 160
           TAI+ATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD
Sbjct: 100 TAIAATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD 159

Query: 161 LGQDITQGVRSSTRAVRVAEERLRHLTNMSPT--VQEMTITNLG-VETAEPVLAKRARDI 220
           LGQDITQGVRSSTRAVRVAEERLR LTNM+PT  VQEMT+ NLG VE AEPVLAKRARDI
Sbjct: 160 LGQDITQGVRSSTRAVRVAEERLRSLTNMTPTAKVQEMTVANLGVVEAAEPVLAKRARDI 219

Query: 221 KEGIVKGRSIFQLFLSLTRFSRLALNYFSKRGKK 252
           K GIVKGRSIFQLFLSLTRFSRLALN+FSKRGKK
Sbjct: 220 KGGIVKGRSIFQLFLSLTRFSRLALNHFSKRGKK 253

BLAST of HG10016803 vs. TAIR 10
Match: AT5G09995.3 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G08530.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 202.2 bits (513), Expect = 4.9e-52
Identity = 104/156 (66.67%), Postives = 131/156 (83.97%), Query Frame = 0

Query: 96  LGCIQTAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLT 155
           + C+ TAISA+WLF +AIPTLLAFK+AAESLEKL+DVTREELP TMAA+RLSGMEISDLT
Sbjct: 102 VACV-TAISASWLFFAAIPTLLAFKKAAESLEKLLDVTREELPDTMAAVRLSGMEISDLT 161

Query: 156 MELSDLGQDITQGVRSSTRAVRVAEERLRHLTNMSPTVQEMTITNLGVETAEPVLAKRAR 215
           MELSDLGQ ITQGV+SSTRA+RVAE+RLR LTNM+P   +  +     +  EP+LAK+AR
Sbjct: 162 MELSDLGQGITQGVKSSTRAIRVAEDRLRRLTNMNPASMQEVMRQTKTDETEPMLAKQAR 221

Query: 216 DIKEGIVKGRSIFQLFLSLTRFSRLALNYFSKRGKK 252
             +EG+VKGRS++QLF ++TRFS++  +Y +KR K+
Sbjct: 222 SFREGVVKGRSLWQLFSTITRFSKITTSYLAKRAKQ 256

BLAST of HG10016803 vs. TAIR 10
Match: AT5G09995.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G08530.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 198.4 bits (503), Expect = 7.0e-51
Identity = 104/157 (66.24%), Postives = 131/157 (83.44%), Query Frame = 0

Query: 96  LGCIQTAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLT 155
           + C+ TAISA+WLF +AIPTLLAFK+AAESLEKL+DVTREELP TMAA+RLSGMEISDLT
Sbjct: 102 VACV-TAISASWLFFAAIPTLLAFKKAAESLEKLLDVTREELPDTMAAVRLSGMEISDLT 161

Query: 156 MELSDLGQDITQGVRSSTRAVRVAEERLRHLTNMSPTV-QEMTITNLGVETAEPVLAKRA 215
           MELSDLGQ ITQGV+SSTRA+RVAE+RLR LTNM+P    +  +     +  EP+LAK+A
Sbjct: 162 MELSDLGQGITQGVKSSTRAIRVAEDRLRRLTNMNPVASMQEVMRQTKTDETEPMLAKQA 221

Query: 216 RDIKEGIVKGRSIFQLFLSLTRFSRLALNYFSKRGKK 252
           R  +EG+VKGRS++QLF ++TRFS++  +Y +KR K+
Sbjct: 222 RSFREGVVKGRSLWQLFSTITRFSKITTSYLAKRAKQ 257

BLAST of HG10016803 vs. TAIR 10
Match: AT5G09995.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G08530.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 150.2 bits (378), Expect = 2.2e-36
Identity = 79/96 (82.29%), Postives = 90/96 (93.75%), Query Frame = 0

Query: 96  LGCIQTAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLT 155
           + C+ TAISA+WLF +AIPTLLAFK+AAESLEKL+DVTREELP TMAA+RLSGMEISDLT
Sbjct: 102 VACV-TAISASWLFFAAIPTLLAFKKAAESLEKLLDVTREELPDTMAAVRLSGMEISDLT 161

Query: 156 MELSDLGQDITQGVRSSTRAVRVAEERLRHLTNMSP 192
           MELSDLGQ ITQGV+SSTRA+RVAE+RLR LTNM+P
Sbjct: 162 MELSDLGQGITQGVKSSTRAIRVAEDRLRRLTNMNP 196

BLAST of HG10016803 vs. TAIR 10
Match: AT1G08530.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G09995.2); Has 140 Blast hits to 140 proteins in 53 species: Archae - 0; Bacteria - 63; Metazoa - 0; Fungi - 0; Plants - 76; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 97.4 bits (241), Expect = 1.7e-20
Identity = 58/127 (45.67%), Postives = 81/127 (63.78%), Query Frame = 0

Query: 101 TAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREELPGTMAAIRLSGMEISDLTMELSD 160
           T+++ T L  +AIPTL+A  RAA S  KL D  R+ELP T+AA+RLSGMEISDLT+ELSD
Sbjct: 115 TSVAFTSLVITAIPTLVAMGRAATSFAKLADTARKELPSTLAAVRLSGMEISDLTLELSD 174

Query: 161 LGQDITQGVRSSTRAVRVAEERLRHLTNMSPTVQEMTIT------NLGVETAEPVLAKRA 220
           L QDIT G+  S +AV+ AE  ++ +  ++   Q+ T++      NL   + +PV+A  A
Sbjct: 175 LSQDITDGINKSAKAVQAAEAGIKQIGTLA---QQQTLSMIEERANLPEISLQPVVAGAA 234

Query: 221 RDIKEGI 222
                 I
Sbjct: 235 EKTSHAI 238

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008440149.11.4e-6996.69PREDICTED: uncharacterized protein LOC103484701 isoform X2 [Cucumis melo][more]
XP_038880851.19.2e-6995.42uncharacterized protein LOC120072535 [Benincasa hispida][more]
XP_008440148.14.5e-6895.42PREDICTED: uncharacterized protein LOC103484701 isoform X1 [Cucumis melo][more]
XP_011657800.11.7e-6794.70uncharacterized protein LOC101204218 isoform X2 [Cucumis sativus][more]
XP_022977945.11.1e-6693.46uncharacterized protein LOC111478086 isoform X1 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A1S3B0116.8e-7096.69uncharacterized protein LOC103484701 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A1S3B1642.2e-6895.42uncharacterized protein LOC103484701 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1INQ25.4e-6793.46uncharacterized protein LOC111478086 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A5D3CRC71.6e-6695.33Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A6J1GDG42.3e-6592.86uncharacterized protein LOC111452981 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT5G09995.34.9e-5266.67unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G09995.27.0e-5166.24unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G09995.12.2e-3682.29unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G08530.11.7e-2045.67unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 116..136
NoneNo IPR availablePANTHERPTHR33825:SF5TRANSMEMBRANE PROTEINcoord: 99..251
NoneNo IPR availablePANTHERPTHR33825CHITINASE-LIKE PROTEINcoord: 99..251

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10016803.1HG10016803.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane