HG10023085 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10023085
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUnknown protein
LocationChr05: 31082659 .. 31084987 (+)
RNA-Seq ExpressionHG10023085
SyntenyHG10023085
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCAAATCCTCAAATTCCAAATCCAAGCCTCACTTTTTCCTTTCCTGCTTTGGATTTTCCGGCAATCTTCGCCGCTTTAAGCCCCTCAAATCCGCCGCCGGTCACCGAAAACGGCCGTTTTCTTGGTTGAGGTTTCACTCAAAGCCGCCTCCGCCGGTTCACTCCTCTTTTCCGGCCCAGACGATCCGGTCCATTCCTGATTCCGACCGGCTCTCCTCGATTTCTATTGCTCCGACCGCCACTTGTGAGTCGTCGAATGAAGATTTCGCCGTCGCGGTGTCGGTGGTGACAAATCGAACCGGCGAGGTTATGATCATCGAGCCACAAGAGGTAAGTGAATTTTGCTTCAGAACTTTTTATTCTTGTCTTAAATTAATATGGAAACATGTAATTAAGAATAAAATTTTTCCTTTTTTCTTTTTGAGAAAAGAATAAAATACATATATGTATATATCAGAAAAAAAATTTAGCGTTGGAAAATTTCCTTTCCCAATTATTAAATTGTGGAATAAAATGAGTCCAGTTGCAAGAAGGAAATAATAACAATAAATGTAATTTACTTTGTATTAATTTTCACTAATGGAATAATTCATTATCTTGTTAAAGATATAAAAAATTAATGATTATATTTATTGAATAAGCATGAGAGTAATCAAACTATTGAAAAATTATTCGATGGGTGAACAAAATGTATTTATTTGGGGTGGATGGTTCGAATTTCTAGTATTTTGATATAGTGTATTTGATATTGGTTGAAATATTGCTTTGACGGGATGAAACATCGTTTATAGAATTAAACATATGGTATACGTTGATTTTTTTTTTTCAGTAAAATTGAGCATATTCTATTAAAAGCACTTGATTTGATGCAACCTAATTTCATGCTCTTTCTTTTAGATTAAAAAAAAAAAATTAATCATTTTTAAAAAGAAAATAAATTGACATATAACAATTTTTGATGGGCTGCACCCATGTTCTCTCTTCTATTAATATCTTTAAAAGAACTTAAATATGGCATGTCAACTAATGTGTCTAATTACATACTACATAATTTGTGCTACATCAAATTATTCGAATATGTCAGGTAACACTTTGAATTGAGTGGTGATTAAAGACCCAACTTGAGAAAAAATTATTTTTATTTAAACTTTTTTTATATATAAAAATAATTTAAAATATATACCTTCCAACTTTTTCTAAAATGAGTTATTTTTTAAATTAAATATTTAAAAATATATTTCAGACACCCTAAATCGGCAAAAGAAACGAATATCCATCTGAAATTGAATTTATTCAAATTATAAAATCTAATTATATTTATAAAATATGTTAAATTCCAAATTAAGAAATCCTAAACTTTAAAAGGAAAAACACATCTAATAATCTTTTTTAATTTTTTTATTTTGTTTCGGATTTTGAACTTGAAAAATATAATAAGTTACTAAATAACCTTCTTTTAAATTAACGTATTTATTGAATATAAAACTGAACTTTGTGTTTAGAATGACATTAAACTTTTAAATTTATTAGACAGAAGACAAACTTTTAATTTTTTCTCTAATAAATTTAGGAACTTTAAAAAGTGTTAGATACAAGTCAACAAATATTAGACACAAATTTAAAAAGTGTTAGATACAAGTCAACAAATATTAGACACAAAATTTAAAGTAATGAAAATTCATAGAATAAACATGTAACCTTTGAAAAAATCTATATTAATTATGTTATTCCTAAATTTCGTTATTTTTTTTCATTTTAGGTAAAAAATGACATCATTGCAGAAAAAATCATCAAAGAAAGTTGTGAACAGTCGCCAAAGAAGCCCACCGATCAGAGTCAATCAAGGTTCTCATTAACAAAAAAACTCGAGTCATTTCGATCGGTCCGGTTCACTCAACCCGCCTCACCAACGGCGAATAAGAATCTGAGATCCATCAACCTGCAGACACCCGTCATATCACACTCTTTATCTTTCCCTCCACCGAACCCGGCCCGAATAAACCGGGTCCAGGAACCGCCGGGGAGTAATCGAGCCGGGTTAAAGTTGAGGAACGGTGAAACGAGCCAACGGTACCGATCAGCAGCGGCGATGTGGGTTTTGATGATGACGTTGGCGATGATGGTACTGTGGGGAAGAATTTGTGCGATTTTGTGTACGGCAACTTGGATTTTTGTTGTAACGAGTTTAAGGTCAATTGTTGAAGAATATGATACAGTTGATTTTGTTGAATCCGATTCATATTCTGAAGGGTTTAAGAAGAAATTAGTGGTTTTAAAAGGGTTTCTTTGTAGAAATCACACAGAGAATCTGTTGAAGGAATTATAG

mRNA sequence

ATGGCCAAATCCTCAAATTCCAAATCCAAGCCTCACTTTTTCCTTTCCTGCTTTGGATTTTCCGGCAATCTTCGCCGCTTTAAGCCCCTCAAATCCGCCGCCGGTCACCGAAAACGGCCGTTTTCTTGGTTGAGGTTTCACTCAAAGCCGCCTCCGCCGGTTCACTCCTCTTTTCCGGCCCAGACGATCCGGTCCATTCCTGATTCCGACCGGCTCTCCTCGATTTCTATTGCTCCGACCGCCACTTGTGAGTCGTCGAATGAAGATTTCGCCGTCGCGGTGTCGGTGGTGACAAATCGAACCGGCGAGGTTATGATCATCGAGCCACAAGAGGTAAAAAATGACATCATTGCAGAAAAAATCATCAAAGAAAGTTGTGAACAGTCGCCAAAGAAGCCCACCGATCAGAGTCAATCAAGGTTCTCATTAACAAAAAAACTCGAGTCATTTCGATCGGTCCGGTTCACTCAACCCGCCTCACCAACGGCGAATAAGAATCTGAGATCCATCAACCTGCAGACACCCGTCATATCACACTCTTTATCTTTCCCTCCACCGAACCCGGCCCGAATAAACCGGGTCCAGGAACCGCCGGGGAGTAATCGAGCCGGGTTAAAGTTGAGGAACGGTGAAACGAGCCAACGGTACCGATCAGCAGCGGCGATGTGGGTTTTGATGATGACGTTGGCGATGATGGTACTGTGGGGAAGAATTTGTGCGATTTTGTGTACGGCAACTTGGATTTTTGTTGTAACGAGTTTAAGGTCAATTGTTGAAGAATATGATACAGTTGATTTTGTTGAATCCGATTCATATTCTGAAGGGTTTAAGAAGAAATTAGTGGTTTTAAAAGGGTTTCTTTGTAGAAATCACACAGAGAATCTGTTGAAGGAATTATAG

Coding sequence (CDS)

ATGGCCAAATCCTCAAATTCCAAATCCAAGCCTCACTTTTTCCTTTCCTGCTTTGGATTTTCCGGCAATCTTCGCCGCTTTAAGCCCCTCAAATCCGCCGCCGGTCACCGAAAACGGCCGTTTTCTTGGTTGAGGTTTCACTCAAAGCCGCCTCCGCCGGTTCACTCCTCTTTTCCGGCCCAGACGATCCGGTCCATTCCTGATTCCGACCGGCTCTCCTCGATTTCTATTGCTCCGACCGCCACTTGTGAGTCGTCGAATGAAGATTTCGCCGTCGCGGTGTCGGTGGTGACAAATCGAACCGGCGAGGTTATGATCATCGAGCCACAAGAGGTAAAAAATGACATCATTGCAGAAAAAATCATCAAAGAAAGTTGTGAACAGTCGCCAAAGAAGCCCACCGATCAGAGTCAATCAAGGTTCTCATTAACAAAAAAACTCGAGTCATTTCGATCGGTCCGGTTCACTCAACCCGCCTCACCAACGGCGAATAAGAATCTGAGATCCATCAACCTGCAGACACCCGTCATATCACACTCTTTATCTTTCCCTCCACCGAACCCGGCCCGAATAAACCGGGTCCAGGAACCGCCGGGGAGTAATCGAGCCGGGTTAAAGTTGAGGAACGGTGAAACGAGCCAACGGTACCGATCAGCAGCGGCGATGTGGGTTTTGATGATGACGTTGGCGATGATGGTACTGTGGGGAAGAATTTGTGCGATTTTGTGTACGGCAACTTGGATTTTTGTTGTAACGAGTTTAAGGTCAATTGTTGAAGAATATGATACAGTTGATTTTGTTGAATCCGATTCATATTCTGAAGGGTTTAAGAAGAAATTAGTGGTTTTAAAAGGGTTTCTTTGTAGAAATCACACAGAGAATCTGTTGAAGGAATTATAG

Protein sequence

MAKSSNSKSKPHFFLSCFGFSGNLRRFKPLKSAAGHRKRPFSWLRFHSKPPPPVHSSFPAQTIRSIPDSDRLSSISIAPTATCESSNEDFAVAVSVVTNRTGEVMIIEPQEVKNDIIAEKIIKESCEQSPKKPTDQSQSRFSLTKKLESFRSVRFTQPASPTANKNLRSINLQTPVISHSLSFPPPNPARINRVQEPPGSNRAGLKLRNGETSQRYRSAAAMWVLMMTLAMMVLWGRICAILCTATWIFVVTSLRSIVEEYDTVDFVESDSYSEGFKKKLVVLKGFLCRNHTENLLKEL
Homology
BLAST of HG10023085 vs. NCBI nr
Match: XP_038897219.1 (uncharacterized protein LOC120085350 isoform X1 [Benincasa hispida])

HSP 1 Score: 481.5 bits (1238), Expect = 5.3e-132
Identity = 259/301 (86.05%), Postives = 271/301 (90.03%), Query Frame = 0

Query: 1   MAKSSNSKSKPHFFLSCFGFSGNLRRFKPLKSAAGHRKRPFSWLRFHSKPPPPVHSSFPA 60
           MAKSSNSKSKPHFFLSCFGFSG LRRFKPLKS+AG RKRPFSW+ FHSKPPPPVHSS P 
Sbjct: 1   MAKSSNSKSKPHFFLSCFGFSGKLRRFKPLKSSAGQRKRPFSWMSFHSKPPPPVHSSLPV 60

Query: 61  QTIRSIPDSDRLSSISIAPTATCESSNEDFAVAVSVVTNRTGEVMIIEPQEVKNDIIAEK 120
           Q  RSIPDSDRLSSISIAPTAT +SSNEDFAVAVSVVTNRTG+ +II  QEVKNDIIAEK
Sbjct: 61  QINRSIPDSDRLSSISIAPTATSKSSNEDFAVAVSVVTNRTGDELIIGQQEVKNDIIAEK 120

Query: 121 IIKESCEQ--SPKKPTDQSQSRFSLTKKLESFRSVRFTQPASPTANKNLRSINLQTPVIS 180
           IIKESCEQ  S KKP DQSQSRFSLTKKLESFRSVRFTQPASP   KNL+S NL+TP IS
Sbjct: 121 IIKESCEQSNSRKKPADQSQSRFSLTKKLESFRSVRFTQPASP--KKNLKSTNLKTPTIS 180

Query: 181 HSLSFPPPNPARINRVQEPPGSNRAGLKLRNGETSQRYRSAAAMWVLMMTLAMMVLWGRI 240
           HSLSFPPPNPAR NR  E PGS R GLK +NGETSQRYRSAAA+ VLMMTLAMMVLWGRI
Sbjct: 181 HSLSFPPPNPARRNRNNEWPGSKRDGLKSKNGETSQRYRSAAAVSVLMMTLAMMVLWGRI 240

Query: 241 CAILCTATWIFVVTSLRSIVEEYDTVDFVESDSYSEGFKKKLVVLKGFLCRNHTENLLKE 300
           CAILCTATWIF+VTSLR+IVEE DT+DFVESDSYSEGFKKKLVVLKGFLCRNH ENLLKE
Sbjct: 241 CAILCTATWIFIVTSLRTIVEECDTIDFVESDSYSEGFKKKLVVLKGFLCRNHRENLLKE 299

BLAST of HG10023085 vs. NCBI nr
Match: XP_008461362.1 (PREDICTED: uncharacterized protein At5g23160 [Cucumis melo])

HSP 1 Score: 409.1 bits (1050), Expect = 3.4e-110
Identity = 225/293 (76.79%), Postives = 242/293 (82.59%), Query Frame = 0

Query: 1   MAKSSNSKSKPHFFLSCFGFSGNLRRFKPLKSAAGHRKRPFSWLRFHSKPPPPVHSSFPA 60
           MAKSSNSKSKPHFFLSCFGFS  LRRFKP KS AGHRKRPFSW R +SKPP PVHSSFP+
Sbjct: 1   MAKSSNSKSKPHFFLSCFGFSDKLRRFKPPKSPAGHRKRPFSWFR-NSKPPTPVHSSFPS 60

Query: 61  QTIRSIPDSDRLSSISIAPTATCESSNEDFAVAVSVVTNRTGEVMIIEPQEVKNDIIAEK 120
            T  S+ +SDRLSS+SIAPTAT +SSNED AVAV V T RTG+ +II P EVKNDI+AEK
Sbjct: 61  HTNPSVTNSDRLSSVSIAPTATSKSSNEDLAVAVPVATKRTGQEVIIGPHEVKNDIVAEK 120

Query: 121 IIKESCEQS--PKKPTDQSQSRFSLTKKLESFRSVRFTQPASPTANKNLRSINLQTPVIS 180
            I ESCE S  P K  DQSQSRFSLTKKLESFR VRFTQ ASP   KN +S NLQ P IS
Sbjct: 121 TIHESCEHSNLPNKFIDQSQSRFSLTKKLESFRLVRFTQTASP--KKNPKSTNLQRPTIS 180

Query: 181 HSLSFPPPNPARINRVQEPPGSNRAGLKLRNGETSQRYRSAAAMWVLMMTLAMMVLWGRI 240
           HSLSFPPP PA +N+V EP  S + G    N ++SQRYRSA AM VLMMTLAMMVLWGRI
Sbjct: 181 HSLSFPPPKPAPLNKVSEPSRSKQFGSMSANRKSSQRYRSAVAMSVLMMTLAMMVLWGRI 240

Query: 241 CAILCTATWIFVVTSLRSIVEEYDTVDFVESDSYSEGFKKKLVVLKGFLCRNH 292
           CAILCTATWIFVVTSLRSIVEEYD +DFVESDSYSEGFKKKLVVLKGF+CRNH
Sbjct: 241 CAILCTATWIFVVTSLRSIVEEYDRIDFVESDSYSEGFKKKLVVLKGFMCRNH 290

BLAST of HG10023085 vs. NCBI nr
Match: XP_011659425.1 (uncharacterized protein LOC105436182 isoform X1 [Cucumis sativus])

HSP 1 Score: 380.6 bits (976), Expect = 1.3e-101
Identity = 213/294 (72.45%), Postives = 236/294 (80.27%), Query Frame = 0

Query: 1   MAKSSNSKSKPHFFLSCFGFSGNLRRFKPLKSAAGHRKRPFSWLRFHSKPPPPVH-SSFP 60
           MAKSSNSKSKPHFFL CFG+S  L RFKPLKS AGHRKR FSW R +SKPP P+H SSFP
Sbjct: 1   MAKSSNSKSKPHFFLCCFGYSDKLPRFKPLKSPAGHRKRQFSWFR-NSKPPTPLHSSSFP 60

Query: 61  AQTIRSIPDSDRLSSISIAPTATCESSNEDFAVAVSVVTNRTGEVMIIEPQEVKNDIIAE 120
           + T RS+ +SDRLSS+SIAPTAT  SSNED  +AV V TNRTGE +II P EVKND +A 
Sbjct: 61  SHTNRSVTNSDRLSSVSIAPTATTNSSNED--LAVPVATNRTGEEVIISPHEVKNDTVAG 120

Query: 121 KIIKESCE--QSPKKPTDQSQSRFSLTKKLESFRSVRFTQPASPTANKNLRSINLQTPVI 180
           K I  S E   SP KP DQSQSRFSLTK+LESFRS+RF Q A P   KN +SIN+QTP I
Sbjct: 121 KTIHGSSEHSNSPNKPIDQSQSRFSLTKRLESFRSIRFNQTAPP--KKNTKSINVQTPTI 180

Query: 181 SHSLSFPPPNPARINRVQEPPGSNRAGLKLRNGETSQRYRSAAAMWVLMMTLAMMVLWGR 240
           SHSLSFPPP P   NRV E   S R   K +N ++SQ+YRS AAM VLMMTLAMMV+WGR
Sbjct: 181 SHSLSFPPPKPTPSNRVSELRESKRVCSKSKNRKSSQQYRSVAAMSVLMMTLAMMVVWGR 240

Query: 241 ICAILCTATWIFVVTSLRSIVEEYDTVDFVESDSYSEGFKKKLVVLKGFLCRNH 292
           ICAILCTATWIF+VTSLRSIVEEY+ +DFVESDSYSEGFKKKLVVLKGF+CRNH
Sbjct: 241 ICAILCTATWIFIVTSLRSIVEEYEGIDFVESDSYSEGFKKKLVVLKGFVCRNH 289

BLAST of HG10023085 vs. NCBI nr
Match: XP_023549176.1 (uncharacterized protein LOC111807611 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 358.2 bits (918), Expect = 6.8e-95
Identity = 203/303 (67.00%), Postives = 231/303 (76.24%), Query Frame = 0

Query: 1   MAKSSNSKSKPHFFLSCFGFSGNLRRFKPLKSAAGHRKRPFSWLRFHSKPPPP---VHSS 60
           MAKS NSKS PH FL CFGFSG  RRFKPLK AA HRK P SW+RFHSK PPP   VH +
Sbjct: 1   MAKSPNSKSNPHPFLCCFGFSGKHRRFKPLKPAAAHRKGPISWMRFHSKQPPPSPSVHFN 60

Query: 61  FPAQTIRSIPDSDRLSSISIAPTATCESSNEDFAVAVSVVTNRTGEVMIIEPQEVKNDII 120
                  S  DSDRLSS SIAP A   +SNEDF VAV V TNR+GE ++  P+ VKNDI+
Sbjct: 61  L------SNTDSDRLSSKSIAPAAAF-NSNEDFTVAVPVATNRSGEEIVTRPENVKNDIL 120

Query: 121 AEKIIKESCEQ--SPKKPTDQSQSRFSLTKKLESFRSVRFTQPASPTANKNLRSINLQTP 180
            EKII+ESCEQ  SPKKP+ +S+SRFSLT+KLESFRS RF QPASPTA KNL+S NLQ+P
Sbjct: 121 PEKIIRESCEQWNSPKKPSARSESRFSLTRKLESFRSGRFAQPASPTAKKNLKSTNLQSP 180

Query: 181 VISHSLSFPPPNPARINRVQEPPGSNRAGLKLRNGETSQRYRSAAAMWVLMMTLAMMVLW 240
            IS        NP RIN+V+EP  S R GLK  NGET QRY+S A M +LM+TL +MV+W
Sbjct: 181 TIS-------LNPPRINQVKEPLASMRIGLKPNNGETRQRYQSMAGMSILMITLVIMVVW 240

Query: 241 GRICAILCTATWIFVVTSLRSIVEEYDTVDFVESDSYSEGFKKKLVVLKGFLCRNHTENL 299
           GR+CAILCTA WI +VTSLRSIVE+YDT+ F ESDSYSEGFKKKLVVLKGFLCRN T+N 
Sbjct: 241 GRLCAILCTAAWIVMVTSLRSIVEDYDTLQFFESDSYSEGFKKKLVVLKGFLCRNQTQNR 289

BLAST of HG10023085 vs. NCBI nr
Match: XP_022954035.1 (uncharacterized protein LOC111456417 [Cucurbita moschata])

HSP 1 Score: 349.0 bits (894), Expect = 4.1e-92
Identity = 197/300 (65.67%), Postives = 226/300 (75.33%), Query Frame = 0

Query: 1   MAKSSNSKSKPHFFLSCFGFSGNLRRFKPLKSAAGHRKRPFSWLRFHSKPPPPVHSSFPA 60
           MAKS NSKS PH FL CFGFSG  RRFKPLK AA HRK P SW+RFHSK PPP  S    
Sbjct: 1   MAKSPNSKSNPHPFLCCFGFSGKHRRFKPLKPAAAHRKGPISWMRFHSKQPPPSPSP-SV 60

Query: 61  QTIRSIPDSDRLSSISIAPTATCESSNEDFAVAVSVVTNRTGEVMIIEPQEVKNDIIAEK 120
           Q   S  DSDRLSS SI+PTA   +SNEDF VAV V TNR G  ++  P+ VKNDI+ EK
Sbjct: 61  QFNLSNTDSDRLSSKSISPTAAF-NSNEDFTVAVPVATNRAGGEIVTRPENVKNDILPEK 120

Query: 121 IIKESCEQ--SPKKPTDQSQSRFSLTKKLESFRSVRFTQPASPTANKNLRSINLQTPVIS 180
           II+ESCEQ  SPKKP+ +S+SRFSLT+KLESFRS RF QPASPT  KNL+S +L +P IS
Sbjct: 121 IIRESCEQWNSPKKPSARSESRFSLTRKLESFRSGRFAQPASPTTKKNLKSTSLHSPTIS 180

Query: 181 HSLSFPPPNPARINRVQEPPGSNRAGLKLRNGETSQRYRSAAAMWVLMMTLAMMVLWGRI 240
                   +P RIN+V+EP  S R GLK  NGET QRY+S A M +LM+TL +MV WGR+
Sbjct: 181 -------LHPPRINQVKEPLASMRVGLKPNNGETRQRYQSMAGMSILMITLVIMVAWGRL 240

Query: 241 CAILCTATWIFVVTSLRSIVEEYDTVDFVESDSYSEGFKKKLVVLKGFLCRNHTENLLKE 299
           CAILCTA WI +VTSLRSI+E+YDTV F ESDSYSEGFKKKLVVLKGF+CRN T+N  K+
Sbjct: 241 CAILCTAAWIVMVTSLRSIIEDYDTVQFFESDSYSEGFKKKLVVLKGFVCRNQTQNQSKK 291

BLAST of HG10023085 vs. ExPASy Swiss-Prot
Match: Q9FMY4 (Uncharacterized protein At5g23160 OS=Arabidopsis thaliana OX=3702 GN=At5g23160 PE=2 SV=1)

HSP 1 Score: 50.1 bits (118), Expect = 5.2e-05
Identity = 31/82 (37.80%), Postives = 44/82 (53.66%), Query Frame = 0

Query: 224 VLMMTLAMMVLWGRICAILCTATWIFVVTSLRSIVEEYDTVDFVES-------------- 283
           ++M+TL +M+ WGR+CAILCT+TW ++   L+        V+   S              
Sbjct: 187 IIMLTLMIMLTWGRLCAILCTSTWCYIFPRLKEAATAVAVVNRKRSGSGKGEEGSFQGDL 246

Query: 284 DSYSEGFKKKLVVLKGFLCRNH 292
           D  S  +KKK VVL+GFL R H
Sbjct: 247 DLNSVAYKKK-VVLEGFLVRQH 267

BLAST of HG10023085 vs. ExPASy TrEMBL
Match: A0A1S3CFR9 (uncharacterized protein At5g23160 OS=Cucumis melo OX=3656 GN=LOC103499966 PE=4 SV=1)

HSP 1 Score: 409.1 bits (1050), Expect = 1.6e-110
Identity = 225/293 (76.79%), Postives = 242/293 (82.59%), Query Frame = 0

Query: 1   MAKSSNSKSKPHFFLSCFGFSGNLRRFKPLKSAAGHRKRPFSWLRFHSKPPPPVHSSFPA 60
           MAKSSNSKSKPHFFLSCFGFS  LRRFKP KS AGHRKRPFSW R +SKPP PVHSSFP+
Sbjct: 1   MAKSSNSKSKPHFFLSCFGFSDKLRRFKPPKSPAGHRKRPFSWFR-NSKPPTPVHSSFPS 60

Query: 61  QTIRSIPDSDRLSSISIAPTATCESSNEDFAVAVSVVTNRTGEVMIIEPQEVKNDIIAEK 120
            T  S+ +SDRLSS+SIAPTAT +SSNED AVAV V T RTG+ +II P EVKNDI+AEK
Sbjct: 61  HTNPSVTNSDRLSSVSIAPTATSKSSNEDLAVAVPVATKRTGQEVIIGPHEVKNDIVAEK 120

Query: 121 IIKESCEQS--PKKPTDQSQSRFSLTKKLESFRSVRFTQPASPTANKNLRSINLQTPVIS 180
            I ESCE S  P K  DQSQSRFSLTKKLESFR VRFTQ ASP   KN +S NLQ P IS
Sbjct: 121 TIHESCEHSNLPNKFIDQSQSRFSLTKKLESFRLVRFTQTASP--KKNPKSTNLQRPTIS 180

Query: 181 HSLSFPPPNPARINRVQEPPGSNRAGLKLRNGETSQRYRSAAAMWVLMMTLAMMVLWGRI 240
           HSLSFPPP PA +N+V EP  S + G    N ++SQRYRSA AM VLMMTLAMMVLWGRI
Sbjct: 181 HSLSFPPPKPAPLNKVSEPSRSKQFGSMSANRKSSQRYRSAVAMSVLMMTLAMMVLWGRI 240

Query: 241 CAILCTATWIFVVTSLRSIVEEYDTVDFVESDSYSEGFKKKLVVLKGFLCRNH 292
           CAILCTATWIFVVTSLRSIVEEYD +DFVESDSYSEGFKKKLVVLKGF+CRNH
Sbjct: 241 CAILCTATWIFVVTSLRSIVEEYDRIDFVESDSYSEGFKKKLVVLKGFMCRNH 290

BLAST of HG10023085 vs. ExPASy TrEMBL
Match: A0A0A0K6H3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G420760 PE=4 SV=1)

HSP 1 Score: 380.6 bits (976), Expect = 6.2e-102
Identity = 213/294 (72.45%), Postives = 236/294 (80.27%), Query Frame = 0

Query: 1   MAKSSNSKSKPHFFLSCFGFSGNLRRFKPLKSAAGHRKRPFSWLRFHSKPPPPVH-SSFP 60
           MAKSSNSKSKPHFFL CFG+S  L RFKPLKS AGHRKR FSW R +SKPP P+H SSFP
Sbjct: 1   MAKSSNSKSKPHFFLCCFGYSDKLPRFKPLKSPAGHRKRQFSWFR-NSKPPTPLHSSSFP 60

Query: 61  AQTIRSIPDSDRLSSISIAPTATCESSNEDFAVAVSVVTNRTGEVMIIEPQEVKNDIIAE 120
           + T RS+ +SDRLSS+SIAPTAT  SSNED  +AV V TNRTGE +II P EVKND +A 
Sbjct: 61  SHTNRSVTNSDRLSSVSIAPTATTNSSNED--LAVPVATNRTGEEVIISPHEVKNDTVAG 120

Query: 121 KIIKESCE--QSPKKPTDQSQSRFSLTKKLESFRSVRFTQPASPTANKNLRSINLQTPVI 180
           K I  S E   SP KP DQSQSRFSLTK+LESFRS+RF Q A P   KN +SIN+QTP I
Sbjct: 121 KTIHGSSEHSNSPNKPIDQSQSRFSLTKRLESFRSIRFNQTAPP--KKNTKSINVQTPTI 180

Query: 181 SHSLSFPPPNPARINRVQEPPGSNRAGLKLRNGETSQRYRSAAAMWVLMMTLAMMVLWGR 240
           SHSLSFPPP P   NRV E   S R   K +N ++SQ+YRS AAM VLMMTLAMMV+WGR
Sbjct: 181 SHSLSFPPPKPTPSNRVSELRESKRVCSKSKNRKSSQQYRSVAAMSVLMMTLAMMVVWGR 240

Query: 241 ICAILCTATWIFVVTSLRSIVEEYDTVDFVESDSYSEGFKKKLVVLKGFLCRNH 292
           ICAILCTATWIF+VTSLRSIVEEY+ +DFVESDSYSEGFKKKLVVLKGF+CRNH
Sbjct: 241 ICAILCTATWIFIVTSLRSIVEEYEGIDFVESDSYSEGFKKKLVVLKGFVCRNH 289

BLAST of HG10023085 vs. ExPASy TrEMBL
Match: A0A6J1GRB9 (uncharacterized protein LOC111456417 OS=Cucurbita moschata OX=3662 GN=LOC111456417 PE=4 SV=1)

HSP 1 Score: 349.0 bits (894), Expect = 2.0e-92
Identity = 197/300 (65.67%), Postives = 226/300 (75.33%), Query Frame = 0

Query: 1   MAKSSNSKSKPHFFLSCFGFSGNLRRFKPLKSAAGHRKRPFSWLRFHSKPPPPVHSSFPA 60
           MAKS NSKS PH FL CFGFSG  RRFKPLK AA HRK P SW+RFHSK PPP  S    
Sbjct: 1   MAKSPNSKSNPHPFLCCFGFSGKHRRFKPLKPAAAHRKGPISWMRFHSKQPPPSPSP-SV 60

Query: 61  QTIRSIPDSDRLSSISIAPTATCESSNEDFAVAVSVVTNRTGEVMIIEPQEVKNDIIAEK 120
           Q   S  DSDRLSS SI+PTA   +SNEDF VAV V TNR G  ++  P+ VKNDI+ EK
Sbjct: 61  QFNLSNTDSDRLSSKSISPTAAF-NSNEDFTVAVPVATNRAGGEIVTRPENVKNDILPEK 120

Query: 121 IIKESCEQ--SPKKPTDQSQSRFSLTKKLESFRSVRFTQPASPTANKNLRSINLQTPVIS 180
           II+ESCEQ  SPKKP+ +S+SRFSLT+KLESFRS RF QPASPT  KNL+S +L +P IS
Sbjct: 121 IIRESCEQWNSPKKPSARSESRFSLTRKLESFRSGRFAQPASPTTKKNLKSTSLHSPTIS 180

Query: 181 HSLSFPPPNPARINRVQEPPGSNRAGLKLRNGETSQRYRSAAAMWVLMMTLAMMVLWGRI 240
                   +P RIN+V+EP  S R GLK  NGET QRY+S A M +LM+TL +MV WGR+
Sbjct: 181 -------LHPPRINQVKEPLASMRVGLKPNNGETRQRYQSMAGMSILMITLVIMVAWGRL 240

Query: 241 CAILCTATWIFVVTSLRSIVEEYDTVDFVESDSYSEGFKKKLVVLKGFLCRNHTENLLKE 299
           CAILCTA WI +VTSLRSI+E+YDTV F ESDSYSEGFKKKLVVLKGF+CRN T+N  K+
Sbjct: 241 CAILCTAAWIVMVTSLRSIIEDYDTVQFFESDSYSEGFKKKLVVLKGFVCRNQTQNQSKK 291

BLAST of HG10023085 vs. ExPASy TrEMBL
Match: A0A6J1JV00 (uncharacterized protein LOC111488090 OS=Cucurbita maxima OX=3661 GN=LOC111488090 PE=4 SV=1)

HSP 1 Score: 334.0 bits (855), Expect = 6.6e-88
Identity = 190/300 (63.33%), Postives = 221/300 (73.67%), Query Frame = 0

Query: 1   MAKSSNSKSKPHFFLSCFGFSGNLRRFKPLKSAAGHRKRPFSWLRFHSKPPPPVHSSFPA 60
           MAKS NSKS PH FL CFGFSG  RRFKPLK AA HRK P SW+RFHSK PPP  S    
Sbjct: 1   MAKSPNSKSNPHPFLCCFGFSGKHRRFKPLKPAAAHRKGPISWMRFHSKQPPPSPS---V 60

Query: 61  QTIRSIPDSDRLSSISIAPTATCESSNEDFAVAVSVVTNRTGEVMIIEPQEVKNDIIAEK 120
           Q   S  DSDRLSS SI+PTA   +SNEDF VAV V TNR G  ++  P+ VKNDI+ EK
Sbjct: 61  QFNLSNTDSDRLSSKSISPTAAF-NSNEDFTVAVPVATNRAGGEIVTRPENVKNDILPEK 120

Query: 121 IIKESCEQ--SPKKPTDQSQSRFSLTKKLESFRSVRFTQPASPTANKNLRSINLQTPVIS 180
           II+E+CEQ  SPKK + +S+SRFS T+KLESFRS RF QP SPT  KNL+S NLQ+P IS
Sbjct: 121 IIRENCEQLNSPKKLSARSESRFSFTRKLESFRSGRFAQPVSPTTKKNLKSTNLQSPTIS 180

Query: 181 HSLSFPPPNPARINRVQEPPGSNRAGLKLRNGETSQRYRSAAAMWVLMMTLAMMVLWGRI 240
            +L        RIN+V+E   S +  LK  N E  QRYRS A M +LM+TL +M++WGR+
Sbjct: 181 LNL-------PRINQVKESMASMQVRLKPNNSEMKQRYRSMAGMSILMITLVIMIVWGRL 240

Query: 241 CAILCTATWIFVVTSLRSIVEEYDTVDFVESDSYSEGFKKKLVVLKGFLCRNHTENLLKE 299
           CAILCTA WI +VTSLRSIVE+YDT+ F +SDSYSEGFKKKLVVLKGFLCRN T+N  K+
Sbjct: 241 CAILCTAAWIVMVTSLRSIVEDYDTLQFFKSDSYSEGFKKKLVVLKGFLCRNQTQNRSKK 289

BLAST of HG10023085 vs. ExPASy TrEMBL
Match: A0A6J1DHG8 (uncharacterized protein LOC111020508 OS=Momordica charantia OX=3673 GN=LOC111020508 PE=4 SV=1)

HSP 1 Score: 265.0 bits (676), Expect = 3.8e-67
Identity = 162/294 (55.10%), Postives = 202/294 (68.71%), Query Frame = 0

Query: 7   SKSKPHFFLSCFGFSGNLRRFKPLKSAAGHRKRPFSWLRFHSKPPPPVHSSFPAQTIRSI 66
           +KSKP   L CFGFSG LRR KPLK AA H KRPFSW+RFH+K  PPV S+  A   RS 
Sbjct: 2   AKSKPQ-LLCCFGFSGKLRRSKPLKPAAAHTKRPFSWMRFHAK--PPVRSALLAD--RSN 61

Query: 67  PDSDRLSSISIAPTATCESSNEDFAVAVSVVTNRTGEVMIIEPQEVKNDIIAEKIIKESC 126
           PDSDRL+  S+A T   +  +ED A       N+ GE     P++V N+ + E+II E+ 
Sbjct: 62  PDSDRLAPGSLASTTVYKPPSEDLAAVEPPAMNQAGEG---RPEKVGNEDVVEEIILETR 121

Query: 127 EQS--PKKPTDQSQSRFSLTKKLESFRSVRFTQPASPTANKNLRSINLQTPVISHSLSFP 186
             S  PKKP  +SQ+RFSLT++LESFRS RFT+P SPTA K+  + N   P IS SLSFP
Sbjct: 122 GHSDTPKKPVGESQTRFSLTRRLESFRSGRFTKPVSPTARKDPITTN-PVPAISKSLSFP 181

Query: 187 PPNPARINRVQEPPGSNRAGLKLRNGETSQRYRSAAAMWVLMMTLAMMVLWGRICAILCT 246
             NP R  +V+ PP S R GL     E S++YRSAAA+ + ++TL++M+LWGR+CAIL T
Sbjct: 182 ALNPTRRVQVESPPTSMRVGLD--GDEMSKKYRSAAAISIFIVTLSIMILWGRMCAILST 241

Query: 247 ATWIFVVTSLRSIVEEYDTVDFVESDSYSEGFKKKLVVLKGFLCRNHTENLLKE 299
           A WI +V SL SIVEE D  DF+ SDSYSEG KKKLV+LKGFL R+  EN+LK+
Sbjct: 242 AAWILIVASLESIVEEDDENDFIGSDSYSEGLKKKLVILKGFLYRSQRENMLKK 284

BLAST of HG10023085 vs. TAIR 10
Match: AT5G23160.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G08240.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 50.1 bits (118), Expect = 3.7e-06
Identity = 31/82 (37.80%), Postives = 44/82 (53.66%), Query Frame = 0

Query: 224 VLMMTLAMMVLWGRICAILCTATWIFVVTSLRSIVEEYDTVDFVES-------------- 283
           ++M+TL +M+ WGR+CAILCT+TW ++   L+        V+   S              
Sbjct: 187 IIMLTLMIMLTWGRLCAILCTSTWCYIFPRLKEAATAVAVVNRKRSGSGKGEEGSFQGDL 246

Query: 284 DSYSEGFKKKLVVLKGFLCRNH 292
           D  S  +KKK VVL+GFL R H
Sbjct: 247 DLNSVAYKKK-VVLEGFLVRQH 267

BLAST of HG10023085 vs. TAIR 10
Match: AT5G08240.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G23160.1); Has 69 Blast hits to 69 proteins in 10 species: Archae - 0; Bacteria - 1; Metazoa - 0; Fungi - 0; Plants - 68; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 48.9 bits (115), Expect = 8.2e-06
Identity = 73/289 (25.26%), Postives = 119/289 (41.18%), Query Frame = 0

Query: 13  FFLSCFGFSGNLRRFKPL--KSAAGHRKRPFSWLRFHSKPPPPVHSSFPAQTIRSIPDSD 72
           +FL CFGFS  +   KP+  K   G +K+    ++    P   + S F  +     P   
Sbjct: 17  YFLGCFGFSRKIYSDKPMVTKGDGGEKKK----MKKKRIPRWFLCSKFRLKNSEIKPSPI 76

Query: 73  RLSSISIAPTATCESSNEDFAVAVSVVTNRTGEVMIIEPQEVKNDIIAEKII-KESCEQS 132
             +     PT+  E   +D    +SV+   T         + KN  + +K + +E+ E  
Sbjct: 77  EETE---KPTSRVEDETDDKQKPLSVIRRIT---------DRKNIPVDDKAMNQETKETK 136

Query: 133 PKKPTDQSQSRFSLTKKLESFRSVRFTQPASPTANKNLRSINLQTPVISHSLSFPPPNPA 192
           PK   D +  R    + L SF+    T+  S                IS     P   P 
Sbjct: 137 PKDLRDITPDRSKPIEPLGSFKEDTCTERISS---------------ISSRYGKPDLKPT 196

Query: 193 RINRVQEPPGSNRAGLKLRNGETSQRYRSAAAMWVLMMTLAMMVLWGRICAILCTATWIF 252
           R                 RNG   + +     + ++++TL +M++WGR+CAILCT+ W +
Sbjct: 197 R----------------SRNGSRVKPFDPVIGISIIILTLMIMLVWGRLCAILCTSAWCY 256

Query: 253 VVTSLR---SIVEEYDTVDFVESDSYSEGFKKKLVVLKGFLCRNHTENL 296
           V+  +R   ++ +          D  SE +K+K VVL GFL R +  +L
Sbjct: 257 VLPRVRDAAALAKRKRNGSASVPDLNSESYKRK-VVLDGFLGRQNRVSL 257

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038897219.15.3e-13286.05uncharacterized protein LOC120085350 isoform X1 [Benincasa hispida][more]
XP_008461362.13.4e-11076.79PREDICTED: uncharacterized protein At5g23160 [Cucumis melo][more]
XP_011659425.11.3e-10172.45uncharacterized protein LOC105436182 isoform X1 [Cucumis sativus][more]
XP_023549176.16.8e-9567.00uncharacterized protein LOC111807611 [Cucurbita pepo subsp. pepo][more]
XP_022954035.14.1e-9265.67uncharacterized protein LOC111456417 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q9FMY45.2e-0537.80Uncharacterized protein At5g23160 OS=Arabidopsis thaliana OX=3702 GN=At5g23160 P... [more]
Match NameE-valueIdentityDescription
A0A1S3CFR91.6e-11076.79uncharacterized protein At5g23160 OS=Cucumis melo OX=3656 GN=LOC103499966 PE=4 S... [more]
A0A0A0K6H36.2e-10272.45Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G420760 PE=4 SV=1[more]
A0A6J1GRB92.0e-9265.67uncharacterized protein LOC111456417 OS=Cucurbita moschata OX=3662 GN=LOC1114564... [more]
A0A6J1JV006.6e-8863.33uncharacterized protein LOC111488090 OS=Cucurbita maxima OX=3661 GN=LOC111488090... [more]
A0A6J1DHG83.8e-6755.10uncharacterized protein LOC111020508 OS=Momordica charantia OX=3673 GN=LOC111020... [more]
Match NameE-valueIdentityDescription
AT5G23160.13.7e-0637.80unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G08240.18.2e-0625.26unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR040411Uncharacterized protein At5g23160-likePANTHERPTHR34379OS07G0553800 PROTEINcoord: 3..292
NoneNo IPR availablePANTHERPTHR34379:SF3PROTEIN, PUTATIVE-RELATEDcoord: 3..292

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10023085.1HG10023085.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane