HG10013217 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10013217
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUPF0565 protein C2orf69 homolog
LocationChr01: 27826406 .. 27827852 (+)
RNA-Seq ExpressionHG10013217
SyntenyHG10013217
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATCGTTGGAGTGGAATTTTGAAGGTTCCATTGCATCCCAACAGCCGCAAGTTTTATCGAGTTGCAGCTTCACTCTGCCTTTCCCCAACCTCCAAAACATTGACTGTAAGTTCTCGAATATTCTGCCATTGCATCACTATCATAAGATGAAAATCTTCCACATATTGCCCTAAACTAAACGCCTTCCTTTCAGATTGAGGGAAATATTATATGTTATTTTGCCTTTTGGAAAATGATATTCATGGGGTTTTTTCGAACTTTTATTAGGTTCCTTGTGCAAACGCCATCTTTTTTAATGGGGATCGGGTCGAAGGGACAGGCAATCCAGTGATTGAGAGATTGTCTAATCTGCAGAACATAGCCGAAATTTTGGTTTCGAAATTTGGAGAGTCCACTAATGCATGGGTTGTCGAGGCTTCTGATTTCAATGGGGCTTTTGCTATATATCAGGATTTTATACCGTCTCTCAATCGGTGGGGAGAACCAAATTCATATACTCCAAATGGGTTTCCTGCTTCACTGTCAACCATTTCACTTTTGGGAAGTTGCTACAATGAGGTATGGTCCAAATGAGGTCGCTGCTTGTGATCTTTTGAGTCAACAATATATTGATTCGAATCTTTCACCTTTTACTCTCTTAGTCGAATGGTATGCCTAAACCATATGAGTGTGCTCAAGAAAGATTTGAATCCTAACTTCTTGGTGAATGGTATATGGCTAAACCAATTGACCACCTAGTTCAGGAAAGATTCGACTCTTTGACCTCGAGATTTGAATCTTAACTAAGCTATCTTTAGGTTGACTGCTTGCTTGTGATTTTGATGTGGACTAATGCATTCAGTTATGGATTTGGCAGGTAAAGAAGATAATTTCTAGGGGAAGACCAGGATCACAGGAAACCACCATATCCACACTGGGCTGCTGCACCCCCAAAACAATCATTCTTGGATTTAGCAAGGGAGGTACTGTGGTTAACCAGCTAGTTGCTGAACTTGGCTCCAAAGACTTGATAGCTGCTGATAAGAATCTACCTCATTCCAAGCAAGAACCAGGTGTTGAATGTCCTAAACTTGATGAGGTTCAGTTCATATCAACTACAGAACAAAGCTTTTTGAAAAGCATAACAGAGATTCATTATGTGGATGTTGGGTTGAACTCCCATGGTGCATATCTAACAGATCCTGAGGTAATCAAAAGAATCTCCAGCAGCCTTGTTCAGGAGTCGAGAGGAATCCGTTTTGTTCTTCATGGTACACCAAGACAATGGTGTGACAGCAGAAGAGTTTGGATTCGAGATGAAAAGGAAAGAATGATGAGTTTTCTTGAATCCGAAGCTCCCAGAAGTGGAGGAAACTTGCAGGTTTGTGAGAAATTTTATTTTGCTGATAGGCCTGCGGACATGCAGATGCATTTTGAAATAATTGAAAAGTTGGATGTTTGCTGA

mRNA sequence

ATGGATCGTTGGAGTGGAATTTTGAAGGTTCCATTGCATCCCAACAGCCGCAAGTTTTATCGAGTTGCAGCTTCACTCTGCCTTTCCCCAACCTCCAAAACATTGACTGTTCCTTGTGCAAACGCCATCTTTTTTAATGGGGATCGGGTCGAAGGGACAGGCAATCCAGTGATTGAGAGATTGTCTAATCTGCAGAACATAGCCGAAATTTTGGTTTCGAAATTTGGAGAGTCCACTAATGCATGGGTTGTCGAGGCTTCTGATTTCAATGGGGCTTTTGCTATATATCAGGATTTTATACCGTCTCTCAATCGGTGGGGAGAACCAAATTCATATACTCCAAATGGGTTTCCTGCTTCACTGTCAACCATTTCACTTTTGGGAAGTTGCTACAATGAGGTAAAGAAGATAATTTCTAGGGGAAGACCAGGATCACAGGAAACCACCATATCCACACTGGGCTGCTGCACCCCCAAAACAATCATTCTTGGATTTAGCAAGGGAGGTACTGTGGTTAACCAGCTAGTTGCTGAACTTGGCTCCAAAGACTTGATAGCTGCTGATAAGAATCTACCTCATTCCAAGCAAGAACCAGGTGTTGAATGTCCTAAACTTGATGAGGTTCAGTTCATATCAACTACAGAACAAAGCTTTTTGAAAAGCATAACAGAGATTCATTATGTGGATGTTGGGTTGAACTCCCATGGTGCATATCTAACAGATCCTGAGGTAATCAAAAGAATCTCCAGCAGCCTTGTTCAGGAGTCGAGAGGAATCCGTTTTGTTCTTCATGGTACACCAAGACAATGGTGTGACAGCAGAAGAGTTTGGATTCGAGATGAAAAGGAAAGAATGATGAGTTTTCTTGAATCCGAAGCTCCCAGAAGTGGAGGAAACTTGCAGGTTTGTGAGAAATTTTATTTTGCTGATAGGCCTGCGGACATGCAGATGCATTTTGAAATAATTGAAAAGTTGGATGTTTGCTGA

Coding sequence (CDS)

ATGGATCGTTGGAGTGGAATTTTGAAGGTTCCATTGCATCCCAACAGCCGCAAGTTTTATCGAGTTGCAGCTTCACTCTGCCTTTCCCCAACCTCCAAAACATTGACTGTTCCTTGTGCAAACGCCATCTTTTTTAATGGGGATCGGGTCGAAGGGACAGGCAATCCAGTGATTGAGAGATTGTCTAATCTGCAGAACATAGCCGAAATTTTGGTTTCGAAATTTGGAGAGTCCACTAATGCATGGGTTGTCGAGGCTTCTGATTTCAATGGGGCTTTTGCTATATATCAGGATTTTATACCGTCTCTCAATCGGTGGGGAGAACCAAATTCATATACTCCAAATGGGTTTCCTGCTTCACTGTCAACCATTTCACTTTTGGGAAGTTGCTACAATGAGGTAAAGAAGATAATTTCTAGGGGAAGACCAGGATCACAGGAAACCACCATATCCACACTGGGCTGCTGCACCCCCAAAACAATCATTCTTGGATTTAGCAAGGGAGGTACTGTGGTTAACCAGCTAGTTGCTGAACTTGGCTCCAAAGACTTGATAGCTGCTGATAAGAATCTACCTCATTCCAAGCAAGAACCAGGTGTTGAATGTCCTAAACTTGATGAGGTTCAGTTCATATCAACTACAGAACAAAGCTTTTTGAAAAGCATAACAGAGATTCATTATGTGGATGTTGGGTTGAACTCCCATGGTGCATATCTAACAGATCCTGAGGTAATCAAAAGAATCTCCAGCAGCCTTGTTCAGGAGTCGAGAGGAATCCGTTTTGTTCTTCATGGTACACCAAGACAATGGTGTGACAGCAGAAGAGTTTGGATTCGAGATGAAAAGGAAAGAATGATGAGTTTTCTTGAATCCGAAGCTCCCAGAAGTGGAGGAAACTTGCAGGTTTGTGAGAAATTTTATTTTGCTGATAGGCCTGCGGACATGCAGATGCATTTTGAAATAATTGAAAAGTTGGATGTTTGCTGA

Protein sequence

MDRWSGILKVPLHPNSRKFYRVAASLCLSPTSKTLTVPCANAIFFNGDRVEGTGNPVIERLSNLQNIAEILVSKFGESTNAWVVEASDFNGAFAIYQDFIPSLNRWGEPNSYTPNGFPASLSTISLLGSCYNEVKKIISRGRPGSQETTISTLGCCTPKTIILGFSKGGTVVNQLVAELGSKDLIAADKNLPHSKQEPGVECPKLDEVQFISTTEQSFLKSITEIHYVDVGLNSHGAYLTDPEVIKRISSSLVQESRGIRFVLHGTPRQWCDSRRVWIRDEKERMMSFLESEAPRSGGNLQVCEKFYFADRPADMQMHFEIIEKLDVC
Homology
BLAST of HG10013217 vs. NCBI nr
Match: XP_011650653.1 (UPF0565 protein C2orf69 homolog [Cucumis sativus] >KGN56378.1 hypothetical protein Csa_011056 [Cucumis sativus])

HSP 1 Score: 610.9 bits (1574), Expect = 6.4e-171
Identity = 296/328 (90.24%), Postives = 310/328 (94.51%), Query Frame = 0

Query: 1   MDRWSGILKVPLHPNSRKFYRVAASLCLSPTSKTLTVPCANAIFFNGDRVEGTGNPVIER 60
           MDRW+GILKVPL+ NSRKFYRVA SLCLSPTSKTLTVP  NAIFFNGDRVEGTGNPVIER
Sbjct: 1   MDRWNGILKVPLNSNSRKFYRVAVSLCLSPTSKTLTVPRGNAIFFNGDRVEGTGNPVIER 60

Query: 61  LSNLQNIAEILVSKFGESTNAWVVEASDFNGAFAIYQDFIPSLNRWGEPNSYTPNGFPAS 120
           LSNLQNIAEILVSKFG+STNAWVVEASDFNGAFAIYQDFIPSLNRWGEP SYTPNGFPAS
Sbjct: 61  LSNLQNIAEILVSKFGDSTNAWVVEASDFNGAFAIYQDFIPSLNRWGEPKSYTPNGFPAS 120

Query: 121 LSTISLLGSCYNEVKKIISRGRPGSQETTISTLGCCTPKTIILGFSKGGTVVNQLVAELG 180
           LST+SLLGSCYNEVKKI+SRG+P SQET ISTL CCTP+TIILGFSKGGTVVNQLV ELG
Sbjct: 121 LSTVSLLGSCYNEVKKIVSRGKPRSQETAISTLSCCTPETIILGFSKGGTVVNQLVTELG 180

Query: 181 SKDLIAADKNLPHSKQEPGVECPKLDEVQFISTTEQSFLKSITEIHYVDVGLNSHGAYLT 240
           SKDL+AAD+NLP SKQE GVEC KLDE+QF+ TT QSFLKSITEIHYVDVGLNSHGAYLT
Sbjct: 181 SKDLMAADENLPLSKQESGVECSKLDEIQFVPTTGQSFLKSITEIHYVDVGLNSHGAYLT 240

Query: 241 DPEVIKRISSSLVQESRGIRFVLHGTPRQWCDSRRVWIRDEKERMMSFLESEAPRSGGNL 300
           DPEVIKRISSSL+QESRGIRFVLHGTPRQWCD RRVWIRDEKE+M SFLESEA RSGGNL
Sbjct: 241 DPEVIKRISSSLIQESRGIRFVLHGTPRQWCDRRRVWIRDEKEKMRSFLESEALRSGGNL 300

Query: 301 QVCEKFYFADRPADMQMHFEIIEKLDVC 329
           +V EKFYFADRPADMQMHFEIIEKLDVC
Sbjct: 301 KVNEKFYFADRPADMQMHFEIIEKLDVC 328

BLAST of HG10013217 vs. NCBI nr
Match: XP_038899615.1 (uncharacterized protein LOC120086870 [Benincasa hispida])

HSP 1 Score: 608.6 bits (1568), Expect = 3.2e-170
Identity = 296/327 (90.52%), Postives = 309/327 (94.50%), Query Frame = 0

Query: 1   MDRWSGILKVPLHPNSRKFYRVAASLCLSPTSKTLTVPCANAIFFNGDRVEGTGNPVIER 60
           MDRWSGILKVPLHP SRKFYRVAASLCLSPTSKTL VPCANAIFFNGDRVEGTGNPVIER
Sbjct: 1   MDRWSGILKVPLHPKSRKFYRVAASLCLSPTSKTLNVPCANAIFFNGDRVEGTGNPVIER 60

Query: 61  LSNLQNIAEILVSKFGESTNAWVVEASDFNGAFAIYQDFIPSLNRWGEPNSYTPNGFPAS 120
           LSNLQNIAEILVSKFGESTNAWVVEASDFNGAFAIYQDFIPSLNRWGEP SY+P+GFPAS
Sbjct: 61  LSNLQNIAEILVSKFGESTNAWVVEASDFNGAFAIYQDFIPSLNRWGEPKSYSPDGFPAS 120

Query: 121 LSTISLLGSCYNEVKKIISRGRPGSQETTISTLGCCTPKTIILGFSKGGTVVNQLVAELG 180
           LST+SLL SCYNEVKKII   +P S+ TTISTLGCCTPKTIILGFSKGGTVVNQLV ELG
Sbjct: 121 LSTVSLLESCYNEVKKIIPWVKPVSKGTTISTLGCCTPKTIILGFSKGGTVVNQLVTELG 180

Query: 181 SKDLIAADKNLPHSKQEPGVECPKLDEVQFISTTEQSFLKSITEIHYVDVGLNSHGAYLT 240
           SKDLIAAD+++PHSKQEPGVEC  L+E QFISTTEQSFLKSITEIHYVDVGLNSHGAYLT
Sbjct: 181 SKDLIAADEDIPHSKQEPGVECSTLEEDQFISTTEQSFLKSITEIHYVDVGLNSHGAYLT 240

Query: 241 DPEVIKRISSSLVQESRGIRFVLHGTPRQWCDSRRVWIRDEKERMMSFLESEAPRSGGNL 300
           DPEVIKRISSSLVQESRGIRFVLHGTPRQWCDSRRVWIRDEKE+MMSFL SEA RSGGNL
Sbjct: 241 DPEVIKRISSSLVQESRGIRFVLHGTPRQWCDSRRVWIRDEKEKMMSFLASEALRSGGNL 300

Query: 301 QVCEKFYFADRPADMQMHFEIIEKLDV 328
           Q CE+FYF+ +P DMQMHFEIIEKLDV
Sbjct: 301 QFCERFYFSGKPGDMQMHFEIIEKLDV 327

BLAST of HG10013217 vs. NCBI nr
Match: XP_008437824.1 (PREDICTED: UPF0565 protein C2orf69 homolog [Cucumis melo])

HSP 1 Score: 607.1 bits (1564), Expect = 9.2e-170
Identity = 297/328 (90.55%), Postives = 307/328 (93.60%), Query Frame = 0

Query: 1   MDRWSGILKVPLHPNSRKFYRVAASLCLSPTSKTLTVPCANAIFFNGDRVEGTGNPVIER 60
           MDRW+GILKVPL  NSRKFYRVA SLCLSPTSKTLTVP ANAIFFNGDRVEGTGNPVIE 
Sbjct: 1   MDRWNGILKVPLRSNSRKFYRVAVSLCLSPTSKTLTVPRANAIFFNGDRVEGTGNPVIEG 60

Query: 61  LSNLQNIAEILVSKFGESTNAWVVEASDFNGAFAIYQDFIPSLNRWGEPNSYTPNGFPAS 120
           LSNLQNIAEILVSKFG+STNAWVVEASDFNGAFAIYQDFIPSLNRWGEP SYTPNGFPAS
Sbjct: 61  LSNLQNIAEILVSKFGDSTNAWVVEASDFNGAFAIYQDFIPSLNRWGEPKSYTPNGFPAS 120

Query: 121 LSTISLLGSCYNEVKKIISRGRPGSQETTISTLGCCTPKTIILGFSKGGTVVNQLVAELG 180
           LST+SLLGSCYNEVKKI+SRG+PGSQETTI TL  CTP+T+ILGFSKGGTVVNQLV ELG
Sbjct: 121 LSTVSLLGSCYNEVKKIVSRGKPGSQETTIPTLSGCTPETVILGFSKGGTVVNQLVTELG 180

Query: 181 SKDLIAADKNLPHSKQEPGVECPKLDEVQFISTTEQSFLKSITEIHYVDVGLNSHGAYLT 240
           SKDLIAAD+NLP SKQE GVEC KLDE QFI TTE SFLKSITEIHYVDVGLN+HGAYLT
Sbjct: 181 SKDLIAADENLPLSKQESGVECSKLDENQFIPTTEHSFLKSITEIHYVDVGLNTHGAYLT 240

Query: 241 DPEVIKRISSSLVQESRGIRFVLHGTPRQWCDSRRVWIRDEKERMMSFLESEAPRSGGNL 300
           DPEVIKRISSSL+QESRGIRFVLHGTPRQWCD RRVWIRDEKE M SFLESEA RSGGNL
Sbjct: 241 DPEVIKRISSSLIQESRGIRFVLHGTPRQWCDRRRVWIRDEKETMTSFLESEALRSGGNL 300

Query: 301 QVCEKFYFADRPADMQMHFEIIEKLDVC 329
           QV EKFYFADRPADMQMHFEIIEKLDVC
Sbjct: 301 QVYEKFYFADRPADMQMHFEIIEKLDVC 328

BLAST of HG10013217 vs. NCBI nr
Match: KAG7020705.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 593.6 bits (1529), Expect = 1.1e-165
Identity = 287/328 (87.50%), Postives = 304/328 (92.68%), Query Frame = 0

Query: 1   MDRWSGILKVPLHPNSRKFYRVAASLCLSPTSKTLTVPCANAIFFNGDRVEGTGNPVIER 60
           M+RW GILKVPLHP S KFYRVAASLCLSP+SKTLT+P ANAI FNGDRVEGTGNPVIER
Sbjct: 655 MERWCGILKVPLHPQSHKFYRVAASLCLSPSSKTLTMPHANAILFNGDRVEGTGNPVIER 714

Query: 61  LSNLQNIAEILVSKFGESTNAWVVEASDFNGAFAIYQDFIPSLNRWGEPNSYTPNGFPAS 120
           LS+LQNIA+ILVSKFG+STNAWV+EASDFNG FAIY DFIPSLNRWGEP SYT NGFPAS
Sbjct: 715 LSDLQNIADILVSKFGDSTNAWVIEASDFNGPFAIYHDFIPSLNRWGEPKSYTANGFPAS 774

Query: 121 LSTISLLGSCYNEVKKIISRGRPGSQETTISTLGCCTPKTIILGFSKGGTVVNQLVAELG 180
           +ST+SLLGSCY+EVKKIISR   GS E TISTLGC TPKTIILGFSKGGTVVNQLVAELG
Sbjct: 775 VSTLSLLGSCYSEVKKIISRRNQGSPEATISTLGCSTPKTIILGFSKGGTVVNQLVAELG 834

Query: 181 SKDLIAADKNLPHSKQEPGVECPKLDEVQFISTTEQSFLKSITEIHYVDVGLNSHGAYLT 240
           SKDLIAAD+N PHSKQ PGVEC KLDEVQFI  TEQSFLKSITEIHYVDVGLNSHGAY T
Sbjct: 835 SKDLIAADENPPHSKQAPGVECSKLDEVQFIPNTEQSFLKSITEIHYVDVGLNSHGAYFT 894

Query: 241 DPEVIKRISSSLVQESRGIRFVLHGTPRQWCDSRRVWIRDEKERMMSFLESEAPRSGGNL 300
           DPEV+KRIS+SLVQESRGIRFVLHGTPRQWCDSRRVWIRDEKE+M+S LESEA RSGG L
Sbjct: 895 DPEVMKRISNSLVQESRGIRFVLHGTPRQWCDSRRVWIRDEKEKMLSLLESEARRSGGRL 954

Query: 301 QVCEKFYFADRPADMQMHFEIIEKLDVC 329
           QVCE+F+FADRPADMQMHFEIIEKLDVC
Sbjct: 955 QVCERFHFADRPADMQMHFEIIEKLDVC 982

BLAST of HG10013217 vs. NCBI nr
Match: KAG6582701.1 (hypothetical protein SDJN03_22703, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 593.6 bits (1529), Expect = 1.1e-165
Identity = 287/328 (87.50%), Postives = 304/328 (92.68%), Query Frame = 0

Query: 1   MDRWSGILKVPLHPNSRKFYRVAASLCLSPTSKTLTVPCANAIFFNGDRVEGTGNPVIER 60
           M+RW GILKVPLHP S KFYRVAASLCLSP+SKTLT+P ANAI FNGDRVEGTGNPVIER
Sbjct: 46  MERWCGILKVPLHPQSHKFYRVAASLCLSPSSKTLTMPHANAILFNGDRVEGTGNPVIER 105

Query: 61  LSNLQNIAEILVSKFGESTNAWVVEASDFNGAFAIYQDFIPSLNRWGEPNSYTPNGFPAS 120
           LS+LQNIA+ILVSKFG+STNAWV+EASDFNG FAIY DFIPSLNRWGEP SYT NGFPAS
Sbjct: 106 LSDLQNIADILVSKFGDSTNAWVIEASDFNGPFAIYHDFIPSLNRWGEPKSYTANGFPAS 165

Query: 121 LSTISLLGSCYNEVKKIISRGRPGSQETTISTLGCCTPKTIILGFSKGGTVVNQLVAELG 180
           +ST+SLLGSCY+EVKKIISR   GS E TISTLGC TPKTIILGFSKGGTVVNQLVAELG
Sbjct: 166 VSTLSLLGSCYSEVKKIISRRNQGSPEATISTLGCSTPKTIILGFSKGGTVVNQLVAELG 225

Query: 181 SKDLIAADKNLPHSKQEPGVECPKLDEVQFISTTEQSFLKSITEIHYVDVGLNSHGAYLT 240
           SKDLIAAD+N PHSKQ PGVEC KLDEVQFI  TEQSFLKSITEIHYVDVGLNSHGAY T
Sbjct: 226 SKDLIAADENPPHSKQAPGVECSKLDEVQFIPNTEQSFLKSITEIHYVDVGLNSHGAYFT 285

Query: 241 DPEVIKRISSSLVQESRGIRFVLHGTPRQWCDSRRVWIRDEKERMMSFLESEAPRSGGNL 300
           DPEV+KRIS+SLVQESRGIRFVLHGTPRQWCDSRRVWIRDEKE+M+S LESEA RSGG L
Sbjct: 286 DPEVMKRISNSLVQESRGIRFVLHGTPRQWCDSRRVWIRDEKEKMLSLLESEARRSGGRL 345

Query: 301 QVCEKFYFADRPADMQMHFEIIEKLDVC 329
           QVCE+F+FADRPADMQMHFEIIEKLDVC
Sbjct: 346 QVCERFHFADRPADMQMHFEIIEKLDVC 373

BLAST of HG10013217 vs. ExPASy TrEMBL
Match: A0A0A0L3P6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G118130 PE=4 SV=1)

HSP 1 Score: 610.9 bits (1574), Expect = 3.1e-171
Identity = 296/328 (90.24%), Postives = 310/328 (94.51%), Query Frame = 0

Query: 1   MDRWSGILKVPLHPNSRKFYRVAASLCLSPTSKTLTVPCANAIFFNGDRVEGTGNPVIER 60
           MDRW+GILKVPL+ NSRKFYRVA SLCLSPTSKTLTVP  NAIFFNGDRVEGTGNPVIER
Sbjct: 1   MDRWNGILKVPLNSNSRKFYRVAVSLCLSPTSKTLTVPRGNAIFFNGDRVEGTGNPVIER 60

Query: 61  LSNLQNIAEILVSKFGESTNAWVVEASDFNGAFAIYQDFIPSLNRWGEPNSYTPNGFPAS 120
           LSNLQNIAEILVSKFG+STNAWVVEASDFNGAFAIYQDFIPSLNRWGEP SYTPNGFPAS
Sbjct: 61  LSNLQNIAEILVSKFGDSTNAWVVEASDFNGAFAIYQDFIPSLNRWGEPKSYTPNGFPAS 120

Query: 121 LSTISLLGSCYNEVKKIISRGRPGSQETTISTLGCCTPKTIILGFSKGGTVVNQLVAELG 180
           LST+SLLGSCYNEVKKI+SRG+P SQET ISTL CCTP+TIILGFSKGGTVVNQLV ELG
Sbjct: 121 LSTVSLLGSCYNEVKKIVSRGKPRSQETAISTLSCCTPETIILGFSKGGTVVNQLVTELG 180

Query: 181 SKDLIAADKNLPHSKQEPGVECPKLDEVQFISTTEQSFLKSITEIHYVDVGLNSHGAYLT 240
           SKDL+AAD+NLP SKQE GVEC KLDE+QF+ TT QSFLKSITEIHYVDVGLNSHGAYLT
Sbjct: 181 SKDLMAADENLPLSKQESGVECSKLDEIQFVPTTGQSFLKSITEIHYVDVGLNSHGAYLT 240

Query: 241 DPEVIKRISSSLVQESRGIRFVLHGTPRQWCDSRRVWIRDEKERMMSFLESEAPRSGGNL 300
           DPEVIKRISSSL+QESRGIRFVLHGTPRQWCD RRVWIRDEKE+M SFLESEA RSGGNL
Sbjct: 241 DPEVIKRISSSLIQESRGIRFVLHGTPRQWCDRRRVWIRDEKEKMRSFLESEALRSGGNL 300

Query: 301 QVCEKFYFADRPADMQMHFEIIEKLDVC 329
           +V EKFYFADRPADMQMHFEIIEKLDVC
Sbjct: 301 KVNEKFYFADRPADMQMHFEIIEKLDVC 328

BLAST of HG10013217 vs. ExPASy TrEMBL
Match: A0A1S3AV17 (UPF0565 protein C2orf69 homolog OS=Cucumis melo OX=3656 GN=LOC103483140 PE=4 SV=1)

HSP 1 Score: 607.1 bits (1564), Expect = 4.5e-170
Identity = 297/328 (90.55%), Postives = 307/328 (93.60%), Query Frame = 0

Query: 1   MDRWSGILKVPLHPNSRKFYRVAASLCLSPTSKTLTVPCANAIFFNGDRVEGTGNPVIER 60
           MDRW+GILKVPL  NSRKFYRVA SLCLSPTSKTLTVP ANAIFFNGDRVEGTGNPVIE 
Sbjct: 1   MDRWNGILKVPLRSNSRKFYRVAVSLCLSPTSKTLTVPRANAIFFNGDRVEGTGNPVIEG 60

Query: 61  LSNLQNIAEILVSKFGESTNAWVVEASDFNGAFAIYQDFIPSLNRWGEPNSYTPNGFPAS 120
           LSNLQNIAEILVSKFG+STNAWVVEASDFNGAFAIYQDFIPSLNRWGEP SYTPNGFPAS
Sbjct: 61  LSNLQNIAEILVSKFGDSTNAWVVEASDFNGAFAIYQDFIPSLNRWGEPKSYTPNGFPAS 120

Query: 121 LSTISLLGSCYNEVKKIISRGRPGSQETTISTLGCCTPKTIILGFSKGGTVVNQLVAELG 180
           LST+SLLGSCYNEVKKI+SRG+PGSQETTI TL  CTP+T+ILGFSKGGTVVNQLV ELG
Sbjct: 121 LSTVSLLGSCYNEVKKIVSRGKPGSQETTIPTLSGCTPETVILGFSKGGTVVNQLVTELG 180

Query: 181 SKDLIAADKNLPHSKQEPGVECPKLDEVQFISTTEQSFLKSITEIHYVDVGLNSHGAYLT 240
           SKDLIAAD+NLP SKQE GVEC KLDE QFI TTE SFLKSITEIHYVDVGLN+HGAYLT
Sbjct: 181 SKDLIAADENLPLSKQESGVECSKLDENQFIPTTEHSFLKSITEIHYVDVGLNTHGAYLT 240

Query: 241 DPEVIKRISSSLVQESRGIRFVLHGTPRQWCDSRRVWIRDEKERMMSFLESEAPRSGGNL 300
           DPEVIKRISSSL+QESRGIRFVLHGTPRQWCD RRVWIRDEKE M SFLESEA RSGGNL
Sbjct: 241 DPEVIKRISSSLIQESRGIRFVLHGTPRQWCDRRRVWIRDEKETMTSFLESEALRSGGNL 300

Query: 301 QVCEKFYFADRPADMQMHFEIIEKLDVC 329
           QV EKFYFADRPADMQMHFEIIEKLDVC
Sbjct: 301 QVYEKFYFADRPADMQMHFEIIEKLDVC 328

BLAST of HG10013217 vs. ExPASy TrEMBL
Match: A0A6J1E9C8 (uncharacterized protein LOC111431905 OS=Cucurbita moschata OX=3662 GN=LOC111431905 PE=4 SV=1)

HSP 1 Score: 590.1 bits (1520), Expect = 5.6e-165
Identity = 286/328 (87.20%), Postives = 302/328 (92.07%), Query Frame = 0

Query: 1   MDRWSGILKVPLHPNSRKFYRVAASLCLSPTSKTLTVPCANAIFFNGDRVEGTGNPVIER 60
           M+RW GILKVPLHP S KFYRVAASLCLSPTSKTLT+P ANAI FNGDRVEGTGNPVIER
Sbjct: 1   MERWCGILKVPLHPQSHKFYRVAASLCLSPTSKTLTMPHANAILFNGDRVEGTGNPVIER 60

Query: 61  LSNLQNIAEILVSKFGESTNAWVVEASDFNGAFAIYQDFIPSLNRWGEPNSYTPNGFPAS 120
           LS+LQNIA+ILVSKFG+STNAWV+EASDFNG FAIY DF+ SLNRWGEP SYT NGFPAS
Sbjct: 61  LSDLQNIADILVSKFGDSTNAWVIEASDFNGPFAIYHDFMHSLNRWGEPKSYTANGFPAS 120

Query: 121 LSTISLLGSCYNEVKKIISRGRPGSQETTISTLGCCTPKTIILGFSKGGTVVNQLVAELG 180
           +ST+SLLGSCY+EVKKIISR   GS E TISTLGC TPKTIILGFSKGGTVVNQLVAELG
Sbjct: 121 VSTLSLLGSCYSEVKKIISRRNQGSPEATISTLGCSTPKTIILGFSKGGTVVNQLVAELG 180

Query: 181 SKDLIAADKNLPHSKQEPGVECPKLDEVQFISTTEQSFLKSITEIHYVDVGLNSHGAYLT 240
           SKDLIAAD+N PHSKQ PGVEC KLDEVQFI  TEQSFLKSITEIHYVDVGLNSHGAY T
Sbjct: 181 SKDLIAADENPPHSKQAPGVECSKLDEVQFIPNTEQSFLKSITEIHYVDVGLNSHGAYFT 240

Query: 241 DPEVIKRISSSLVQESRGIRFVLHGTPRQWCDSRRVWIRDEKERMMSFLESEAPRSGGNL 300
           DPEVIKRIS+SLVQESRGIRF+LHGTPRQWCDSRRVWIRDEKE+M S LESEA RSGG L
Sbjct: 241 DPEVIKRISNSLVQESRGIRFILHGTPRQWCDSRRVWIRDEKEKMSSLLESEARRSGGRL 300

Query: 301 QVCEKFYFADRPADMQMHFEIIEKLDVC 329
           QVCE+F+FADRPADMQMHFEIIEKLDVC
Sbjct: 301 QVCERFHFADRPADMQMHFEIIEKLDVC 328

BLAST of HG10013217 vs. ExPASy TrEMBL
Match: A0A6J1IN96 (uncharacterized protein LOC111479029 OS=Cucurbita maxima OX=3661 GN=LOC111479029 PE=4 SV=1)

HSP 1 Score: 578.6 bits (1490), Expect = 1.7e-161
Identity = 281/328 (85.67%), Postives = 298/328 (90.85%), Query Frame = 0

Query: 1   MDRWSGILKVPLHPNSRKFYRVAASLCLSPTSKTLTVPCANAIFFNGDRVEGTGNPVIER 60
           M+RW GILKVPLHP   +FYRVAASLCLSPTSKTLT+P ANAI FNGDRVEGTGNPVIER
Sbjct: 61  MERWCGILKVPLHPQCHRFYRVAASLCLSPTSKTLTMPHANAILFNGDRVEGTGNPVIER 120

Query: 61  LSNLQNIAEILVSKFGESTNAWVVEASDFNGAFAIYQDFIPSLNRWGEPNSYTPNGFPAS 120
           LS+LQNIA+ILVSKFG+STNAWV+EASDFNG FAIY DFIPSLNRWGEP SYT NGFPAS
Sbjct: 121 LSDLQNIADILVSKFGDSTNAWVIEASDFNGPFAIYHDFIPSLNRWGEPKSYTANGFPAS 180

Query: 121 LSTISLLGSCYNEVKKIISRGRPGSQETTISTLGCCTPKTIILGFSKGGTVVNQLVAELG 180
           +ST+SLLGSCY+EVKKIISR   GS E TISTLGC TPKTIILGFSKGGTV NQLVAELG
Sbjct: 181 VSTLSLLGSCYSEVKKIISRRNQGSPEATISTLGCSTPKTIILGFSKGGTVANQLVAELG 240

Query: 181 SKDLIAADKNLPHSKQEPGVECPKLDEVQFISTTEQSFLKSITEIHYVDVGLNSHGAYLT 240
           SKDLIAAD+N PHSKQ PGVEC KLDE QFI  TEQSFLKSITEIHYVDVGLNS GAY T
Sbjct: 241 SKDLIAADENPPHSKQAPGVECSKLDEDQFIPNTEQSFLKSITEIHYVDVGLNSQGAYFT 300

Query: 241 DPEVIKRISSSLVQESRGIRFVLHGTPRQWCDSRRVWIRDEKERMMSFLESEAPRSGGNL 300
           DPE IKRIS+SLVQESRGIRFVLHGTPRQW DSRRVWIRDEK++M+S LESEA RSGG L
Sbjct: 301 DPEAIKRISNSLVQESRGIRFVLHGTPRQWGDSRRVWIRDEKDKMLSLLESEARRSGGRL 360

Query: 301 QVCEKFYFADRPADMQMHFEIIEKLDVC 329
           QVCE+F+FADRPADMQMHFEIIEKLDVC
Sbjct: 361 QVCERFHFADRPADMQMHFEIIEKLDVC 388

BLAST of HG10013217 vs. ExPASy TrEMBL
Match: A0A5A7U5V0 (UPF0565 protein C2orf69-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold171G00630 PE=4 SV=1)

HSP 1 Score: 554.3 bits (1427), Expect = 3.4e-154
Identity = 271/302 (89.74%), Postives = 282/302 (93.38%), Query Frame = 0

Query: 1   MDRWSGILKVPLHPNSRKFYRVAASLCLSPTSKTLTVPCANAIFFNGDRVEGTGNPVIER 60
           MDRW+GILKVPL  NSRKFYRVA SLCLSPTSKTLTVP ANAIFFNGDRVEGTGNPVIE 
Sbjct: 1   MDRWNGILKVPLRSNSRKFYRVAVSLCLSPTSKTLTVPRANAIFFNGDRVEGTGNPVIEG 60

Query: 61  LSNLQNIAEILVSKFGESTNAWVVEASDFNGAFAIYQDFIPSLNRWGEPNSYTPNGFPAS 120
           LSNLQNIAEILVSKFG+STNAWVVEASDFNGAFAIYQDFIPSLNRWGEP SYTPNGFPAS
Sbjct: 61  LSNLQNIAEILVSKFGDSTNAWVVEASDFNGAFAIYQDFIPSLNRWGEPKSYTPNGFPAS 120

Query: 121 LSTISLLGSCYNEVKKIISRGRPGSQETTISTLGCCTPKTIILGFSKGGTVVNQLVAELG 180
           LST+SLLGSCYNEVKKI+SRG+PGSQETTI TL  CTP+T+ILGFSKGGTVVNQLV ELG
Sbjct: 121 LSTVSLLGSCYNEVKKIVSRGKPGSQETTIPTLSGCTPETVILGFSKGGTVVNQLVTELG 180

Query: 181 SKDLIAADKNLPHSKQEPGVECPKLDEVQFISTTEQSFLKSITEIHYVDVGLNSHGAYLT 240
           SKDLIAAD+NLP SKQE GVEC KLDE QFI TTE SFLKSITEIHYVDVGLN+HGAYLT
Sbjct: 181 SKDLIAADENLPLSKQESGVECSKLDENQFIPTTEHSFLKSITEIHYVDVGLNTHGAYLT 240

Query: 241 DPEVIKRISSSLVQESRGIRFVLHGTPRQWCDSRRVWIRDEKERMMSFLESEAPRSGGNL 300
           DPEVIKRISSSL+QESRGIRFVLHGTPRQWCD RRVWIRDEKE M SFLESEA RSGGNL
Sbjct: 241 DPEVIKRISSSLIQESRGIRFVLHGTPRQWCDRRRVWIRDEKETMTSFLESEALRSGGNL 300

Query: 301 QV 303
           Q+
Sbjct: 301 QI 302

BLAST of HG10013217 vs. TAIR 10
Match: AT2G44850.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 13 plant structures; EXPRESSED DURING: M germinated pollen stage, LP.04 four leaves visible, 4 anthesis, petal differentiation and expansion stage, E expanded cotyledon stage; CONTAINS InterPro DOMAIN/s: Uncharacterised protein family UPF0565 (InterPro:IPR018881); Has 106 Blast hits to 106 proteins in 50 species: Archae - 0; Bacteria - 0; Metazoa - 73; Fungi - 0; Plants - 31; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 370.5 bits (950), Expect = 1.4e-102
Identity = 184/327 (56.27%), Postives = 240/327 (73.39%), Query Frame = 0

Query: 1   MDRWSGILKVPLHPNSRKFYRVAASLCLSPTSKTLTVPCANAIFFNGDRVEGTGNPVIER 60
           M+RWSG+LK+PL   +  +YRVAASLCLS +SKTLTVP ANAIFF+GD+V+ TGN VIER
Sbjct: 1   MERWSGVLKIPLDATTSNYYRVAASLCLS-SSKTLTVPSANAIFFHGDKVQDTGNHVIER 60

Query: 61  LSNLQNIAEILVSKFGESTNAWVVEASDFNGAFAIYQDFIPSLNRWGEPNSYTPNGFPAS 120
           L +LQ +AEI+VSKFG S NAWVVEAS FNG FAIY+DF+PS+N  G P SY+P GFPAS
Sbjct: 61  LYDLQKVAEIIVSKFGNSVNAWVVEASVFNGPFAIYKDFVPSVNHMGAPKSYSPVGFPAS 120

Query: 121 LSTISLLGSCYNEVKKIISRGRPGSQETTISTLGCCTPKTIILGFSKGGTVVNQLVAELG 180
            S +SLL SC +EV K    G        I+++  C PKTI+LGFSKGG V+NQL++E+ 
Sbjct: 121 SSIVSLLSSCLHEVLK---EGTDVCLIDQIASVHHC-PKTIVLGFSKGGVVMNQLMSEIS 180

Query: 181 SKDLIAADKNLPHSKQEPGVECPKLDEVQFISTTEQSFLKSITEIHYVDVGLNSHGAYLT 240
           S D   A  +    ++       + +++Q I  +++SFL SI+E+HY+DVGLNS GAY+T
Sbjct: 181 SLDTNFAKTSSAMVEESTS----QHEKIQIIPASKESFLNSISEVHYIDVGLNSSGAYIT 240

Query: 241 DPEVIKRISSSLVQESRGIRFVLHGTPRQWCDSRRVWIRDEKERMMSFLESEAPRSGGNL 300
           D  V++RIS  L + +  +R V+HGTPRQWCD  R WIR EK+ ++  L++E   SGG L
Sbjct: 241 DHNVVQRISQRLARGADSLRIVIHGTPRQWCDELRGWIRKEKDELVRLLKAETENSGGKL 300

Query: 301 QVCEKFYFADRPADMQMHFEIIEKLDV 328
           QVCE+FYF+DR AD+QMHFEII+ +DV
Sbjct: 301 QVCERFYFSDRLADLQMHFEIIDAMDV 318

BLAST of HG10013217 vs. TAIR 10
Match: AT2G44850.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Uncharacterised protein family UPF0565 (InterPro:IPR018881); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G45380.1); Has 138 Blast hits to 138 proteins in 53 species: Archae - 0; Bacteria - 0; Metazoa - 73; Fungi - 0; Plants - 62; Viruses - 0; Other Eukaryotes - 3 (source: NCBI BLink). )

HSP 1 Score: 322.8 bits (826), Expect = 3.2e-88
Identity = 160/291 (54.98%), Postives = 210/291 (72.16%), Query Frame = 0

Query: 37  VPCANAIFFNGDRVEGTGNPVIERLSNLQNIAEILVSKFGESTNAWVVEASDFNGAFAIY 96
           VP ANAIFF+GD+V+ TGN VIERL +LQ +AEI+VSKFG S NAWVVEAS FNG FAIY
Sbjct: 117 VPSANAIFFHGDKVQDTGNHVIERLYDLQKVAEIIVSKFGNSVNAWVVEASVFNGPFAIY 176

Query: 97  QDFIPSLNRWGEPNSYTPNGFPASLSTISLLGSCYNEVKKIISRGRPGSQETTISTLGCC 156
           +DF+PS+N  G P SY+P GFPAS S +SLL SC +EV K    G        I+++  C
Sbjct: 177 KDFVPSVNHMGAPKSYSPVGFPASSSIVSLLSSCLHEVLK---EGTDVCLIDQIASVHHC 236

Query: 157 TPKTIILGFSKGGTVVNQLVAELGSKDLIAADKNLPHSKQEPGVECPKLDEVQFISTTEQ 216
            PKTI+LGFSKGG V+NQL++E+ S D   A  +    ++       + +++Q I  +++
Sbjct: 237 -PKTIVLGFSKGGVVMNQLMSEISSLDTNFAKTSSAMVEESTS----QHEKIQIIPASKE 296

Query: 217 SFLKSITEIHYVDVGLNSHGAYLTDPEVIKRISSSLVQESRGIRFVLHGTPRQWCDSRRV 276
           SFL SI+E+HY+DVGLNS GAY+TD  V++RIS  L + +  +R V+HGTPRQWCD  R 
Sbjct: 297 SFLNSISEVHYIDVGLNSSGAYITDHNVVQRISQRLARGADSLRIVIHGTPRQWCDELRG 356

Query: 277 WIRDEKERMMSFLESEAPRSGGNLQVCEKFYFADRPADMQMHFEIIEKLDV 328
           WIR EK+ ++  L++E   SGG LQVCE+FYF+DR AD+QMHFEII+ +DV
Sbjct: 357 WIRKEKDELVRLLKAETENSGGKLQVCERFYFSDRLADLQMHFEIIDAMDV 399

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011650653.16.4e-17190.24UPF0565 protein C2orf69 homolog [Cucumis sativus] >KGN56378.1 hypothetical prote... [more]
XP_038899615.13.2e-17090.52uncharacterized protein LOC120086870 [Benincasa hispida][more]
XP_008437824.19.2e-17090.55PREDICTED: UPF0565 protein C2orf69 homolog [Cucumis melo][more]
KAG7020705.11.1e-16587.50Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
KAG6582701.11.1e-16587.50hypothetical protein SDJN03_22703, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L3P63.1e-17190.24Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G118130 PE=4 SV=1[more]
A0A1S3AV174.5e-17090.55UPF0565 protein C2orf69 homolog OS=Cucumis melo OX=3656 GN=LOC103483140 PE=4 SV=... [more]
A0A6J1E9C85.6e-16587.20uncharacterized protein LOC111431905 OS=Cucurbita moschata OX=3662 GN=LOC1114319... [more]
A0A6J1IN961.7e-16185.67uncharacterized protein LOC111479029 OS=Cucurbita maxima OX=3661 GN=LOC111479029... [more]
A0A5A7U5V03.4e-15489.74UPF0565 protein C2orf69-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E... [more]
Match NameE-valueIdentityDescription
AT2G44850.21.4e-10256.27unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G44850.13.2e-8854.98unknown protein; CONTAINS InterPro DOMAIN/s: Uncharacterised protein family UPF0... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR018881C2orf69PFAMPF10561UPF0565coord: 151..290
e-value: 3.5E-10
score: 39.5
IPR018881C2orf69PANTHERPTHR31296UPF0565 PROTEIN C2ORF69coord: 1..326

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10013217.1HG10013217.1mRNA