HG10004466 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10004466
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUnknown protein
LocationChr08: 17334600 .. 17338710 (-)
RNA-Seq ExpressionHG10004466
SyntenyHG10004466
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGGATTTTTTCAATTTCTTCCTCTTCTTCAACCTACCTTCGTTCCACCATCCAATCCCACCGCCACCGCCACCACCTTCGTCACCGCCATGGATCAAACCTTAACACAACTCCGAGAAGGTTGCCTTTCCCAACTACCAAAAATGCCGAAAACTCAACTTTCCATATTATCACAACTTGCAGAGATAATATGGCCATTTACAGCGGCGAAGCCGATGGACTTCCACCTTTTCCTCCTCTCCCTGACACTCCTTTGTAATTTCACTTTTCTTATATTTTATGTTTTTTAGTTGATTAGTGATTTAATAAGTCTCGTATTAAATCGATTTGCTTTAATATTTGAGATTGGATTTGAATTGCAGGCCATTATGGATAGCTGGGGTGCTATTGTCTGCAATTCTATCAATTTGGAAGACCCAGAACTATTGGGCGCCTTTTCTTATGTTAAAGGGTAAGGAGTATAAATCGGAAATATAATCTTTTTTATTTTTTATTTTTTAAAATTAAACTTATTTTCTCTATTTCTTTACCAGAATTTACATCTTCCTTAACTATATATATATATATTTTTTTTTTTTTTGTTAGTTTTCAAATTTCCATTTAGTTTTGGAAAATATTGGTACAAATTAAGCAAAAATTTTAGATGTGGAAAAAATGTATATAAGTTTAAATCTAAAAAAATAAAAACCCAAATGATTACCAAATCGGGTTTTAACTTTTGTTTTTAATTTTTGAAAATTAAGTTTATTTCCTTTCAATTTCTTATAATGATTTGCATTTTTCTTAAGTAAAAATGTTGAATTCTTAGCCAAATTCCAAAATAAAAAAATTTTAAAAACTACCGCTACATTTGGTAACCATTTTATTTTTTCGTTTTTTATTTTAAAAAATTAAGCTTATAAACCCTTCTTCCATCTCTAAATTTCTAAATTTATTATCTACTTTTTACTTATATTTCCAAAAACCAAGCCAAAATTTGAAAACTAGAAAAAGTAGGAAAATTACTATAAACAGAAAAAATATCAAATCATTTACAAATATAGAAAATTTTTACTTTCTATCAGTGATAGATCGTGATAGAATTCTATCGCTTTCTATCGCGGTCTATCACTGATAGACTGTAAAAAAATTTATATATTTATAAATAGTTTGACTTATTTTTTTATATATGAAAAGAACCCAAAAAAAGTAGGTTTTAAAAATTTGTTTTTATTTTTACAATTTGGTCTAGAATTCAACTCTTTTCAACATTATAAAAAAAATGTGAGAAAATAGACTTAATTTTTAAAAACAAAAACAAAAAAAGGAAATGATTACGACGAAAACTAATAAATTTGATTCGATTTATTTATTTATTTCTATAAATTAAACTGAGAGATATATATAATCTGGGGTTGGTTTACGTTAAGTGAATATGATGATTTTGTATAGAGAAAGTGGATCAAGTGGTGGATGTGGTGGAAGATGTGGCGGAAATGGTGGAGACGGCGGCGGAGACAGTGGATAAGGTGGCGGAAGAGATTGCAGAACATTCTCCGGAAGGCAGCAAACTCCAACAGGCTGCATTATTCGTCGAAAAATCAGCGGAAAGGATTGCTAAGGATGCTGATTTTGCAGGAGATATTATTGACAAGGTTAGCCCTTTTCTCCAAATTCTAAACTTTACCCCATCAATTCATCTTAATTTTTTTTCTTTATTTCTATGTCCATATAGTTCAACGTCAAATATGAATTGGCATTTCTCTTCTATTCAATTAAGGGCGCATGAAATGTAACATAATTTAAATTGATATCTAAGTTTGACTCAATAATAAACCAATAGAAATTTTTTCTTTTCTCTTCTTTTAAAAAAAAAGTAAAATATTTATTTGTTAATTAATTCATAATTTTCTACTTATCAATAACATTTCTATCAACATCCATATTTAGAACCTACTTTTTTTTAAAAGTTGCAATATTACCAACTCTTAGAGTAAAACTACCGTACGAAAATCGAGATCTAAAAAAGTGAAAGTAAAAGTTACCACAAATTTTAGGTCACAATCACACCTATTCTAGATCCTAATTAATTTTTTAAAATTTTAATGAGTTCTTAATTCAATGATAGTTTATTCTAGTTTTCAGAAATAAAAATAATATATATTTAACTAAGTGAAAACGTTCATAAATAATAAGTATATTTTTCCTTTTAATTTAGCATTAAAAAATATTCCCTTTTTCTATGCTTGTAGAACATCTTTCTCTTTAAAAGAACTCGAGCACACCTCAACCAACTTGGTATTGAATTATTTAATTATTTGATTTGTTTTAAATATATCTGTCTTATTTAAAAAATTATTAAAATATTACCATTTAATTTTCATAACCATTATAATATTATAGGAGCAGTAATTTAAAAATAAAAATTTAGATCACACACTATTCTAATTCAATGCCACACCATTTTAAGTCTCAATTTTTTTTACTGTAAATTTTAAAGAATTTATACTCTATGAATAATTTTGAAATGTTTATGAAAAGATTGAGGTAATATTGTAATTTCATAAATTAAACAAATAATAATTTATGAAATTATTATTATAGTAGTTGCTAAAAAATAATAATAATAAATTTTTTCATAAATTATGTTGAGTTTATTTATGTTTATTTGGTTTATTATAGTAGTTTGAAATAATAATAATAATAATAATAATAATAATAATATATTTGTCATAAATACAATAATAATAATATTGTAATTTAATAAGACCATGAATTAAATAAATAATAATTTCATAGCCTTACAATTGTAGTAGGTGCTCTCCTATTAGTAGAAATTAGTTGGTTGATTAAATGTTATGGATGAGACTGAAAAAAGGGGCAATAATAGTGTTTTGAGTACAATTATAATAAGAGTAGGAGGACTCAATAATAATTTATATTGTGCTAACATATCTGTTTGTTGAACTAGCAAAATAATAATAGTAATAATAATAATTAAAAAAAAAAAAAAGTGTTTTACTGTGTATGGTGGGGGGTTTGATTAATTTGTATGGCTATTCTCAATTGCGTTTGTCTGCTCCCATTTCCCATCAATTTAAACCACCGTACCTTTGGTCGGAAAAATTTGAAGTTTCAATTACTAAATTTCATATTCTTTAACAATGCATTATATTTTAATATATTATTACTTTTATCCACGTTTTTTTTTATGAGGACTCCCTACCTAACCAATTGACGTAATCGACCCTTCTCTTTTGAGAAATCTAGTTTGGCCTATTTTCAAAATCTTCCAATTCTGTCCTTTTCACTTTACAATTCATTACGTTTTTTTTTTCCTCTCTCTCTTTGTTCGAACATGAGTTTTATCTCAAAACTAATTTGACAATAAAAAAAAAATAATCAATGCATCGAACTCTCCTTTTTTTTTTTTAATGTTAAATTCTCGACATACCCCTCAAGATGACGTGTTTTTATATCTGTGAGTGTGATAGCCAACTTACACACACCTTAACTAATCTCATGGAACAACCTATGTGCCCCTATAATATTTGGGTGTTATGCCCCTTAAGATGACATAAATTTTGAATTGACCATTTTTGGATTAAGCTGTGACACCATCTAGATAATATTTAGTTTTTTTTTATGTCAGATGCCTCTTCAAGATGGTGCCTCTTGGGTACCATTCTAGGTCTGATCTCTTTTTCTTTTTTTAGACCAAATATATCTGTTGAAGTTTCATAGACTTTGATATCATATTAGATAAACATGATATTTCATCTCAAAACTAATTAGCAATGAGTGTCCTATCTATTTTATCAAGATTGGAAAGTGATTTCTCCATCTTTTTAATGTGGAGCCACCATTTAGTTTTAAGATAGAAATTCTGAGATCATCTAACAACCTTTAATTTAATCAATAAATAATATTATTTAATTTTATTAATGGAGACTTTTTGTTTTTCTAATATGATTTATTATGAATATAAACCTCAGATTGAGGAAGCAGAAGACGAGTTGAGTACATTTATTAAGCAAAATGGAGAAAGTAAAGAAGAAGACGACGCAAAGCAAGAGAATGATCAAACTGAAGCCCGCGTCGAGGAACAAAGCAAGAAGGCAACCTCTTAA

mRNA sequence

ATGTGGATTTTTTCAATTTCTTCCTCTTCTTCAACCTACCTTCGTTCCACCATCCAATCCCACCGCCACCGCCACCACCTTCGTCACCGCCATGGATCAAACCTTAACACAACTCCGAGAAGGTTGCCTTTCCCAACTACCAAAAATGCCGAAAACTCAACTTTCCATATTATCACAACTTGCAGAGATAATATGGCCATTTACAGCGGCGAAGCCGATGGACTTCCACCTTTTCCTCCTCTCCCTGACACTCCTTTGCCATTATGGATAGCTGGGGTGCTATTGTCTGCAATTCTATCAATTTGGAAGACCCAGAACTATTGGGCGCCTTTTCTTATGTTAAAGGAGAAAGTGGATCAAGTGGTGGATGTGGTGGAAGATGTGGCGGAAATGGTGGAGACGGCGGCGGAGACAGTGGATAAGGTGGCGGAAGAGATTGCAGAACATTCTCCGGAAGGCAGCAAACTCCAACAGGCTGCATTATTCGTCGAAAAATCAGCGGAAAGGATTGCTAAGGATGCTGATTTTGCAGGAGATATTATTGACAAGATTGAGGAAGCAGAAGACGAGTTGAGTACATTTATTAAGCAAAATGGAGAAAGTAAAGAAGAAGACGACGCAAAGCAAGAGAATGATCAAACTGAAGCCCGCGTCGAGGAACAAAGCAAGAAGGCAACCTCTTAA

Coding sequence (CDS)

ATGTGGATTTTTTCAATTTCTTCCTCTTCTTCAACCTACCTTCGTTCCACCATCCAATCCCACCGCCACCGCCACCACCTTCGTCACCGCCATGGATCAAACCTTAACACAACTCCGAGAAGGTTGCCTTTCCCAACTACCAAAAATGCCGAAAACTCAACTTTCCATATTATCACAACTTGCAGAGATAATATGGCCATTTACAGCGGCGAAGCCGATGGACTTCCACCTTTTCCTCCTCTCCCTGACACTCCTTTGCCATTATGGATAGCTGGGGTGCTATTGTCTGCAATTCTATCAATTTGGAAGACCCAGAACTATTGGGCGCCTTTTCTTATGTTAAAGGAGAAAGTGGATCAAGTGGTGGATGTGGTGGAAGATGTGGCGGAAATGGTGGAGACGGCGGCGGAGACAGTGGATAAGGTGGCGGAAGAGATTGCAGAACATTCTCCGGAAGGCAGCAAACTCCAACAGGCTGCATTATTCGTCGAAAAATCAGCGGAAAGGATTGCTAAGGATGCTGATTTTGCAGGAGATATTATTGACAAGATTGAGGAAGCAGAAGACGAGTTGAGTACATTTATTAAGCAAAATGGAGAAAGTAAAGAAGAAGACGACGCAAAGCAAGAGAATGATCAAACTGAAGCCCGCGTCGAGGAACAAAGCAAGAAGGCAACCTCTTAA

Protein sequence

MWIFSISSSSSTYLRSTIQSHRHRHHLRHRHGSNLNTTPRRLPFPTTKNAENSTFHIITTCRDNMAIYSGEADGLPPFPPLPDTPLPLWIAGVLLSAILSIWKTQNYWAPFLMLKEKVDQVVDVVEDVAEMVETAAETVDKVAEEIAEHSPEGSKLQQAALFVEKSAERIAKDADFAGDIIDKIEEAEDELSTFIKQNGESKEEDDAKQENDQTEARVEEQSKKATS
Homology
BLAST of HG10004466 vs. NCBI nr
Match: XP_038886505.1 (uncharacterized protein LOC120076681 isoform X1 [Benincasa hispida])

HSP 1 Score: 276.6 bits (706), Expect = 2.0e-70
Identity = 169/231 (73.16%), Postives = 193/231 (83.55%), Query Frame = 0

Query: 1   MWIFSISSSSSTYLRST-IQSHRHRHHLRHRHGSNLNTTPRRLPFPTTKNAENSTFHIIT 60
           M I SI SSSST+LRST IQSH      R RHGSNLNTT   LPFPTT N + S     T
Sbjct: 1   MSIISI-SSSSTFLRSTNIQSH------RRRHGSNLNTT-HILPFPTTTNPKYS-----T 60

Query: 61  TCRDN--MAIY-SGEADGLPPFPPLPDTPLPLWIAGVLLSAILSIWKTQNYWAPFLMLKE 120
           +CR N  MAIY SG+A+GLPP PPLPDTP PLW+AGVL+SA+LSIW+T++YW PFL LKE
Sbjct: 61  SCRHNIDMAIYSSGDANGLPPLPPLPDTPWPLWVAGVLMSAVLSIWQTKSYWGPFLTLKE 120

Query: 121 KVDQVVDVVEDVAEMVETAAETVDKVAEEIAEHSPEGSKLQQAALFVEKSAERIAKDADF 180
           KVD+VVDVVE+VAEMVETAAE VDKVAEEIAEH PEGSKLQ+AALFVE +AERIAKDAD 
Sbjct: 121 KVDKVVDVVEEVAEMVETAAERVDKVAEEIAEHLPEGSKLQKAALFVENAAERIAKDADL 180

Query: 181 AGDIIDKIEEAEDELSTFIKQNGESKEEDDAKQENDQTEARVEEQSKKATS 228
           AGDIIDK+E AE+ELS+FIKQNGESKE +D+KQ+NDQTEA ++EQS+ ATS
Sbjct: 181 AGDIIDKLEAAEEELSSFIKQNGESKEVEDSKQKNDQTEA-LDEQSQGATS 217

BLAST of HG10004466 vs. NCBI nr
Match: XP_038886507.1 (uncharacterized protein LOC120076681 isoform X3 [Benincasa hispida] >XP_038886572.1 uncharacterized protein LOC120076742 isoform X2 [Benincasa hispida])

HSP 1 Score: 232.6 bits (592), Expect = 3.3e-57
Identity = 138/188 (73.40%), Postives = 154/188 (81.91%), Query Frame = 0

Query: 1   MWIFSISSSSSTYLRST-IQSHRHRHHLRHRHGSNLNTTPRRLPFPTTKNAENSTFHIIT 60
           M I SI SSSST+LRST IQSH      R RHGSNLNTT   LPFPTT N + S     T
Sbjct: 1   MSIISI-SSSSTFLRSTNIQSH------RRRHGSNLNTT-HILPFPTTTNPKYS-----T 60

Query: 61  TCRDN--MAIY-SGEADGLPPFPPLPDTPLPLWIAGVLLSAILSIWKTQNYWAPFLMLKE 120
           +CR N  MAIY SG+A+GLPP PPLPDTP PLW+AGVL+SA+LSIW+T++YW PFL LKE
Sbjct: 61  SCRHNIDMAIYSSGDANGLPPLPPLPDTPWPLWVAGVLMSAVLSIWQTKSYWGPFLTLKE 120

Query: 121 KVDQVVDVVEDVAEMVETAAETVDKVAEEIAEHSPEGSKLQQAALFVEKSAERIAKDADF 180
           KVD+VVDVVE+VAEMVETAAE VDKVAEEIAEH PEGSKLQ+AALFVE +AERIAKDAD 
Sbjct: 121 KVDKVVDVVEEVAEMVETAAERVDKVAEEIAEHLPEGSKLQKAALFVENAAERIAKDADL 175

Query: 181 AGDIIDKI 185
           AGDIIDK+
Sbjct: 181 AGDIIDKV 175

BLAST of HG10004466 vs. NCBI nr
Match: XP_038886506.1 (uncharacterized protein LOC120076681 isoform X2 [Benincasa hispida] >XP_038886571.1 uncharacterized protein LOC120076742 isoform X1 [Benincasa hispida])

HSP 1 Score: 231.5 bits (589), Expect = 7.3e-57
Identity = 140/196 (71.43%), Postives = 156/196 (79.59%), Query Frame = 0

Query: 1   MWIFSISSSSSTYLRST-IQSHRHRHHLRHRHGSNLNTTPRRLPFPTTKNAENSTFHIIT 60
           M I SI SSSST+LRST IQSH      R RHGSNLNTT   LPFPTT N + S     T
Sbjct: 1   MSIISI-SSSSTFLRSTNIQSH------RRRHGSNLNTT-HILPFPTTTNPKYS-----T 60

Query: 61  TCRDN--MAIY-SGEADGLPPFPPLPDTPLPLWIAGVLLSAILSIWKTQNYWAPFLMLKE 120
           +CR N  MAIY SG+A+GLPP PPLPDTP PLW+AGVL+SA+LSIW+T++YW PFL LKE
Sbjct: 61  SCRHNIDMAIYSSGDANGLPPLPPLPDTPWPLWVAGVLMSAVLSIWQTKSYWGPFLTLKE 120

Query: 121 KVDQVVDVVEDVAEMVETAAETVDKVAEEIAEHSPEGSKLQQAALFVEKSAERIAKDADF 180
           KVD+VVDVVE+VAEMVETAAE VDKVAEEIAEH PEGSKLQ+AALFVE +AERIAKDAD 
Sbjct: 121 KVDKVVDVVEEVAEMVETAAERVDKVAEEIAEHLPEGSKLQKAALFVENAAERIAKDADL 180

Query: 181 AGDIIDKIEEAEDELS 193
           AGDIIDK +     LS
Sbjct: 181 AGDIIDKCQGLRGSLS 183

BLAST of HG10004466 vs. NCBI nr
Match: XP_011649672.1 (uncharacterized protein LOC105434653 isoform X2 [Cucumis sativus] >KAE8652133.1 hypothetical protein Csa_022545 [Cucumis sativus])

HSP 1 Score: 199.9 bits (507), Expect = 2.3e-47
Identity = 123/218 (56.42%), Postives = 151/218 (69.27%), Query Frame = 0

Query: 1   MWIFSISSSSSTYLRSTIQSHRHRHHLRHRHGSNLNTT----PRRLP-FPTTKNAENSTF 60
           M I SISSSS+ +   TI+SH       HRHGSNLN T       LP  PTTK    S  
Sbjct: 1   MSILSISSSSTYFGLPTIRSHH-----PHRHGSNLNLTHLHNHNSLPLIPTTKTNPKS-- 60

Query: 61  HIITTCRDNMAIYSGEA-DGLPPFPPLPDTPLPLWIAGVLLSAILSIWKTQNYWAPFLML 120
              +TC  NMAIYS ++  GLP  PP  DTP PLWIAGV+LSA+LSIWKT+NYW PFL L
Sbjct: 61  ---STCTRNMAIYSADSIIGLPSLPPFHDTPSPLWIAGVVLSAVLSIWKTKNYWKPFLTL 120

Query: 121 KEKVDQVVDVVEDVAEMVETAAETVDKVAEEIAEHSPEGSKLQQAALFVEKSAERIAKDA 180
           KEK+D+VV+  EDVAEM  +AA+ VDK AE+IA + P+GS+LQ+ A  V+  AE+I KDA
Sbjct: 121 KEKMDKVVEKAEDVAEMAGSAADKVDKAAEDIAAYLPDGSELQKTAESVDDVAEKIGKDA 180

Query: 181 DFAGDIIDKIEEAEDELSTFIKQNGESKEEDDAKQEND 213
           D AGD+ +K + AEDELS+ +  +GES EEDD KQ+ND
Sbjct: 181 DMAGDLFEKFKTAEDELSSLVDHSGESNEEDDLKQKND 208

BLAST of HG10004466 vs. NCBI nr
Match: XP_011649671.1 (uncharacterized protein LOC105434653 isoform X1 [Cucumis sativus])

HSP 1 Score: 194.9 bits (494), Expect = 7.5e-46
Identity = 123/220 (55.91%), Postives = 151/220 (68.64%), Query Frame = 0

Query: 1   MWIFSISSSSSTYLRSTIQSHRHRHHLRHRHGSNLNTT----PRRLP-FPTTKNAENSTF 60
           M I SISSSS+ +   TI+SH       HRHGSNLN T       LP  PTTK    S  
Sbjct: 1   MSILSISSSSTYFGLPTIRSHH-----PHRHGSNLNLTHLHNHNSLPLIPTTKTNPKS-- 60

Query: 61  HIITTCRDNMAIYSGEA-DGLPPFPPLPDTP--LPLWIAGVLLSAILSIWKTQNYWAPFL 120
              +TC  NMAIYS ++  GLP  PP  DTP   PLWIAGV+LSA+LSIWKT+NYW PFL
Sbjct: 61  ---STCTRNMAIYSADSIIGLPSLPPFHDTPSFRPLWIAGVVLSAVLSIWKTKNYWKPFL 120

Query: 121 MLKEKVDQVVDVVEDVAEMVETAAETVDKVAEEIAEHSPEGSKLQQAALFVEKSAERIAK 180
            LKEK+D+VV+  EDVAEM  +AA+ VDK AE+IA + P+GS+LQ+ A  V+  AE+I K
Sbjct: 121 TLKEKMDKVVEKAEDVAEMAGSAADKVDKAAEDIAAYLPDGSELQKTAESVDDVAEKIGK 180

Query: 181 DADFAGDIIDKIEEAEDELSTFIKQNGESKEEDDAKQEND 213
           DAD AGD+ +K + AEDELS+ +  +GES EEDD KQ+ND
Sbjct: 181 DADMAGDLFEKFKTAEDELSSLVDHSGESNEEDDLKQKND 210

BLAST of HG10004466 vs. ExPASy TrEMBL
Match: A0A5A7VBX0 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G004630 PE=4 SV=1)

HSP 1 Score: 176.4 bits (446), Expect = 1.3e-40
Identity = 104/190 (54.74%), Postives = 130/190 (68.42%), Query Frame = 0

Query: 1   MWIFSISSSSSTYLRSTIQSHRHRHHLRHRHGSNLNT-----TPRRLPFPTTK-NAENST 60
           MWI SISSSS+ +   TI+SH       HR GS+LN        + L  PTTK N +NS 
Sbjct: 1   MWILSISSSSTYFGSPTIRSHH-----PHRRGSDLNLISHLHNHKSLSLPTTKPNLKNS- 60

Query: 61  FHIITTCRDNMAIYSGEADGLPPFPPLPDTPLPLWIAGVLLSAILSIWKTQNYWAPFLML 120
                TC  +MAI+S ++ GLPPF P  +TP  LW+AGV+LSAILSIWKT+NYW PFLML
Sbjct: 61  -----TCSRSMAIFSADSMGLPPFLPFSNTPSQLWMAGVVLSAILSIWKTKNYWRPFLML 120

Query: 121 KEKVDQVVDVVEDVAEMVETAAETVDKVAEEIAEHSPEGSKLQQAALFVEKSAERIAKDA 180
           KEKVD VV+  E+ AEM ET A+ VDK AE+IAEH P+GS LQ+ A  V+ +A++I KDA
Sbjct: 121 KEKVDTVVEKAEEAAEMAETTADGVDKAAEKIAEHLPDGSDLQKTAQSVDDAAKKIGKDA 179

Query: 181 DFAGDIIDKI 185
           D AGD  +K+
Sbjct: 181 DLAGDFCEKV 179

BLAST of HG10004466 vs. ExPASy TrEMBL
Match: A0A1S3BDR2 (uncharacterized protein LOC103488552 OS=Cucumis melo OX=3656 GN=LOC103488552 PE=4 SV=1)

HSP 1 Score: 176.4 bits (446), Expect = 1.3e-40
Identity = 104/190 (54.74%), Postives = 130/190 (68.42%), Query Frame = 0

Query: 1   MWIFSISSSSSTYLRSTIQSHRHRHHLRHRHGSNLNT-----TPRRLPFPTTK-NAENST 60
           MWI SISSSS+ +   TI+SH       HR GS+LN        + L  PTTK N +NS 
Sbjct: 1   MWILSISSSSTYFGSPTIRSHH-----PHRRGSDLNLISHLHNHKSLSLPTTKPNLKNS- 60

Query: 61  FHIITTCRDNMAIYSGEADGLPPFPPLPDTPLPLWIAGVLLSAILSIWKTQNYWAPFLML 120
                TC  +MAI+S ++ GLPPF P  +TP  LW+AGV+LSAILSIWKT+NYW PFLML
Sbjct: 61  -----TCSRSMAIFSADSMGLPPFLPFSNTPSQLWMAGVVLSAILSIWKTKNYWRPFLML 120

Query: 121 KEKVDQVVDVVEDVAEMVETAAETVDKVAEEIAEHSPEGSKLQQAALFVEKSAERIAKDA 180
           KEKVD VV+  E+ AEM ET A+ VDK AE+IAEH P+GS LQ+ A  V+ +A++I KDA
Sbjct: 121 KEKVDTVVEKAEEAAEMAETTADGVDKAAEKIAEHLPDGSDLQKTAQSVDDAAKKIGKDA 179

Query: 181 DFAGDIIDKI 185
           D AGD  +K+
Sbjct: 181 DLAGDFCEKV 179

BLAST of HG10004466 vs. ExPASy TrEMBL
Match: A0A6J1K717 (uncharacterized protein LOC111492223 OS=Cucurbita maxima OX=3661 GN=LOC111492223 PE=4 SV=1)

HSP 1 Score: 173.3 bits (438), Expect = 1.1e-39
Identity = 116/217 (53.46%), Postives = 145/217 (66.82%), Query Frame = 0

Query: 1   MWIFSISSSSSTYLRSTIQSHRHRHHLRHRHGSNLNTT-----PRRLPFPTTKNAENSTF 60
           M I S+S SS++ LRST+QS         RHGSNLN +        LPFP T   +N   
Sbjct: 1   MSIVSVSYSSAS-LRSTVQS--------RRHGSNLNLSHLLHKANSLPFPRT-IPKNRVT 60

Query: 61  HIITTCRDNMAIYSGEADG------LP-PFPPLPDTPLPLWIAGVLLSAILSIWKTQNYW 120
           H ITT RD MA+Y G   G      LP P PP P TP PLW+AG +LSAILSIWKT  YW
Sbjct: 61  H-ITTYRD-MAVYGGGGGGGGLGSPLPQPPPPPPLTPWPLWVAGAVLSAILSIWKTTKYW 120

Query: 121 APFLMLKEKVDQVVDVVEDVAEMVETAAETVDKVAEEIAEHSPEGSKLQQAALFVEKSAE 180
            PFLMLK++V++VV V EDV +MVETAA+ VDKV+EEI +H P+ S LQ+ A+ VE +AE
Sbjct: 121 GPFLMLKQRVEKVVHVAEDVVDMVETAAKEVDKVSEEIVDHLPKDSILQKTAVLVENTAE 180

Query: 181 RIAKDADFAGDIIDKIEEAEDELSTFIKQNGESKEED 206
            +AKDA+ A +II K+E+ +D LS+ IK+ G SKE D
Sbjct: 181 TVAKDANLAAEIIGKVEKFDDGLSSLIKKKGGSKEGD 205

BLAST of HG10004466 vs. ExPASy TrEMBL
Match: A0A6J1HCK6 (uncharacterized protein LOC111462754 OS=Cucurbita moschata OX=3662 GN=LOC111462754 PE=4 SV=1)

HSP 1 Score: 171.4 bits (433), Expect = 4.3e-39
Identity = 114/214 (53.27%), Postives = 141/214 (65.89%), Query Frame = 0

Query: 1   MWIFSISSSSSTYLRSTIQSHRHRHHLRHRHGSNLNTT-----PRRLPFPTTKNAENSTF 60
           M I S+SS S+  LRS + S         RHGSNLN +      + LP P T     +T 
Sbjct: 1   MSIISVSSRSAN-LRSAVDS--------RRHGSNLNLSHLLHRAKSLPLPRTIPKNRATH 60

Query: 61  HIITTCRDNMAIY---SGEADGLP-PFPPLPDTPLPLWIAGVLLSAILSIWKTQNYWAPF 120
             ITT RD MA+Y    G    LP P PP P TP PLW+AG +LSAILSIWKT  YW PF
Sbjct: 61  --ITTYRD-MAVYGVGGGLGSPLPLPPPPPPLTPWPLWVAGAVLSAILSIWKTTKYWKPF 120

Query: 121 LMLKEKVDQVVDVVEDVAEMVETAAETVDKVAEEIAEHSPEGSKLQQAALFVEKSAERIA 180
           LMLK++V++VV V EDV +MVETAAE VDKVAEEIA+H P+ S LQ+ A+ VE +AE +A
Sbjct: 121 LMLKQRVEKVVHVAEDVVDMVETAAEEVDKVAEEIADHLPKDSILQKTAVLVENTAETVA 180

Query: 181 KDADFAGDIIDKIEEAEDELSTFIKQNGESKEED 206
           KDA+ A DII K+E+ +D L++ IK+ G SKE D
Sbjct: 181 KDANLAADIIGKVEKFDDGLNSLIKKKGGSKEGD 202

BLAST of HG10004466 vs. ExPASy TrEMBL
Match: A0A6J1BVN6 (uncharacterized protein LOC111005138 OS=Momordica charantia OX=3673 GN=LOC111005138 PE=4 SV=1)

HSP 1 Score: 158.7 bits (400), Expect = 2.9e-35
Identity = 98/192 (51.04%), Postives = 132/192 (68.75%), Query Frame = 0

Query: 27  LRHRHGSNLNTTPRRLPFPTT----KNAENSTFHIITTCRDNMAIYSGEADGLPPFPPLP 86
           LRH    NL+   ++LPF +T    +   +    II+  R  MAIYSGE  G P  P LP
Sbjct: 14  LRHGSKINLSDHDQKLPFASTNIDLRKRSSCHLTIISQYRHRMAIYSGEGLGFPSLPDLP 73

Query: 87  -DTPLPLWIAGVLLSAILSIWKTQNYWAPFLMLKEKVDQVVDVVEDVAEMVETAAETVDK 146
              P PLW+ G+L+SAIL  +     W PFL+L + VD+VVD VE+VAEMVETAAE V+K
Sbjct: 74  RQVPWPLWMLGMLISAILP-FGGNKLW-PFLILNQNVDKVVDAVEEVAEMVETAAEGVEK 133

Query: 147 VAEEIAEHSPEGSKLQQAALFVEKSAERIAKDADFAGDIIDKIEEAEDELSTFIKQNGES 206
           VAEE+AEH P+G +LQ+AALF+E +A+ ++KDA  A  I+ KIEE ED++ +  K+NGE+
Sbjct: 134 VAEEVAEHLPQGGQLQKAALFLENAAKTLSKDAHVAEQIVHKIEEVEDKVISSFKKNGET 193

Query: 207 KEEDDAKQENDQ 214
           KEE+DA+Q+ DQ
Sbjct: 194 KEEEDAQQKKDQ 203

BLAST of HG10004466 vs. TAIR 10
Match: AT2G14095.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: mitochondrion; EXPRESSED IN: 7 plant structures; EXPRESSED DURING: 4 anthesis, C globular stage, petal differentiation and expansion stage; Has 106 Blast hits to 103 proteins in 21 species: Archae - 0; Bacteria - 5; Metazoa - 0; Fungi - 4; Plants - 87; Viruses - 0; Other Eukaryotes - 10 (source: NCBI BLink). )

HSP 1 Score: 57.8 bits (138), Expect = 1.3e-08
Identity = 48/139 (34.53%), Postives = 75/139 (53.96%), Query Frame = 0

Query: 89  WIAGVLLSAILSIWKTQNYWAPFLMLKEKVDQVVDVVEDVAEMVETAAETVDKVAEEIAE 148
           W+ G  +S +LS W  +        ++ + + VV+ VE VAEMVE  A   D++AEE+AE
Sbjct: 96  WVIGSAISLVLSFWNNERL-QKLKRIEGEAELVVEGVEAVAEMVEKVATATDEMAEEMAE 155

Query: 149 HSPEGSKLQQAALFVEKSAERIAKDADFAGDIIDKIEEAE---DELSTFIKQNGESKEED 208
             PE +KL+Q AL +E  +E  A +A    D + K+E+     D+L   IK   + K  +
Sbjct: 156 KLPEKNKLKQVALVLEHISEVAAHEAHLTQDFLHKVEKVTQDIDDLEAMIKPLIDKKVAN 215

Query: 209 -DAKQENDQTEARVEEQSK 224
            + KQ+  + EA  E  S+
Sbjct: 216 AETKQQTKEEEANHESPSR 233

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038886505.12.0e-7073.16uncharacterized protein LOC120076681 isoform X1 [Benincasa hispida][more]
XP_038886507.13.3e-5773.40uncharacterized protein LOC120076681 isoform X3 [Benincasa hispida] >XP_03888657... [more]
XP_038886506.17.3e-5771.43uncharacterized protein LOC120076681 isoform X2 [Benincasa hispida] >XP_03888657... [more]
XP_011649672.12.3e-4756.42uncharacterized protein LOC105434653 isoform X2 [Cucumis sativus] >KAE8652133.1 ... [more]
XP_011649671.17.5e-4655.91uncharacterized protein LOC105434653 isoform X1 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5A7VBX01.3e-4054.74Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A1S3BDR21.3e-4054.74uncharacterized protein LOC103488552 OS=Cucumis melo OX=3656 GN=LOC103488552 PE=... [more]
A0A6J1K7171.1e-3953.46uncharacterized protein LOC111492223 OS=Cucurbita maxima OX=3661 GN=LOC111492223... [more]
A0A6J1HCK64.3e-3953.27uncharacterized protein LOC111462754 OS=Cucurbita moschata OX=3662 GN=LOC1114627... [more]
A0A6J1BVN62.9e-3551.04uncharacterized protein LOC111005138 OS=Momordica charantia OX=3673 GN=LOC111005... [more]
Match NameE-valueIdentityDescription
AT2G14095.11.3e-0834.53unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 184..224
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 195..227
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 25..46
NoneNo IPR availablePANTHERPTHR33735:SF14PHAGE CAPSID SCAFFOLDING PROTEIN (GPO) SERINE PEPTIDASEcoord: 7..209
NoneNo IPR availablePANTHERPTHR33735EXPRESSED PROTEINcoord: 7..209

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10004466.1HG10004466.1mRNA