Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGAATAACTGTTTCAAAAGCAACAAAGTGATGGCGCAAGACGAGCCTGATGATCTTTTGCCTCCGATGGAAGCTAAGAAAGTTGAGCAAAAACCGCCGTCTGGATCGGCGATGGCGAAGCCGAAGACGGCAGAGGCGAGAACCGGTGGTGCGGCCGGTAAGAAGGTAGTGAGGTTTAAGTTAGAAGAAGAAGAGGAGGAGGAGAAAATTTCCGGCGGAAGTGGCGGCGATGGTGGAGTATTGAGGATTAAAGTGGTGATGTCTCAGAAAGAGTTGAAGCAGATATTGAAGGATAGAGAGAACAATTCGTGTACATTGGAGGAATTGATTGCTGAATTGAAGGTGAAAGGCAGGACGAGGATTTCAGATGGGAGAATGGATGAAGTTGAAGATGAAAAAGGAAGCTGGAAGCCGGATCTGGAATGTATTCCTGAAGGTCTCCATTAA
mRNA sequence
ATGGGGAATAACTGTTTCAAAAGCAACAAAGTGATGGCGCAAGACGAGCCTGATGATCTTTTGCCTCCGATGGAAGCTAAGAAAGTTGAGCAAAAACCGCCGTCTGGATCGGCGATGGCGAAGCCGAAGACGGCAGAGGCGAGAACCGGTGGTGCGGCCGGTAAGAAGGTAGTGAGGTTTAAGTTAGAAGAAGAAGAGGAGGAGGAGAAAATTTCCGGCGGAAGTGGCGGCGATGGTGGAGTATTGAGGATTAAAGTGGTGATGTCTCAGAAAGAGTTGAAGCAGATATTGAAGGATAGAGAGAACAATTCGTGTACATTGGAGGAATTGATTGCTGAATTGAAGGTGAAAGGCAGGACGAGGATTTCAGATGGGAGAATGGATGAAGTTGAAGATGAAAAAGGAAGCTGGAAGCCGGATCTGGAATGTATTCCTGAAGGTCTCCATTAA
Coding sequence (CDS)
ATGGGGAATAACTGTTTCAAAAGCAACAAAGTGATGGCGCAAGACGAGCCTGATGATCTTTTGCCTCCGATGGAAGCTAAGAAAGTTGAGCAAAAACCGCCGTCTGGATCGGCGATGGCGAAGCCGAAGACGGCAGAGGCGAGAACCGGTGGTGCGGCCGGTAAGAAGGTAGTGAGGTTTAAGTTAGAAGAAGAAGAGGAGGAGGAGAAAATTTCCGGCGGAAGTGGCGGCGATGGTGGAGTATTGAGGATTAAAGTGGTGATGTCTCAGAAAGAGTTGAAGCAGATATTGAAGGATAGAGAGAACAATTCGTGTACATTGGAGGAATTGATTGCTGAATTGAAGGTGAAAGGCAGGACGAGGATTTCAGATGGGAGAATGGATGAAGTTGAAGATGAAAAAGGAAGCTGGAAGCCGGATCTGGAATGTATTCCTGAAGGTCTCCATTAA
Protein sequence
MGNNCFKSNKVMAQDEPDDLLPPMEAKKVEQKPPSGSAMAKPKTAEARTGGAAGKKVVRFKLEEEEEEEKISGGSGGDGGVLRIKVVMSQKELKQILKDRENNSCTLEELIAELKVKGRTRISDGRMDEVEDEKGSWKPDLECIPEGLH
Homology
BLAST of HG10000199 vs. NCBI nr
Match:
XP_038902397.1 (uncharacterized protein LOC120089037 [Benincasa hispida])
HSP 1 Score: 229.6 bits (584), Expect = 1.8e-56
Identity = 122/147 (82.99%), Postives = 131/147 (89.12%), Query Frame = 0
Query: 1 MGNNCFKSNKVMAQDEPDDLLPPMEAKKVEQKPPSGSAMAKPKTAEARTGGAAGKKVVRF 60
MGNNCFKSNKVMAQDEP+DLLPP+EAKKVE+KP GSAMAKPKTAEARTGGA+ KKVVRF
Sbjct: 1 MGNNCFKSNKVMAQDEPEDLLPPIEAKKVEEKPRPGSAMAKPKTAEARTGGAS-KKVVRF 60
Query: 61 KLEEEEEEEKISGGSGGDGGVLRIKVVMSQKELKQILKDRENNSCTLEELIAELKVKGRT 120
KL+EEEE+ + GDGGVLRIKVVMSQKELKQ+LKDRENNSCTLEELI ELKVKGRT
Sbjct: 61 KLQEEEEK------NSGDGGVLRIKVVMSQKELKQMLKDRENNSCTLEELITELKVKGRT 120
Query: 121 RISDGRMDEVEDEKGSWKPDLECIPEG 148
ISDGR+D VEDE G WKPDLE IPEG
Sbjct: 121 TISDGRIDAVEDENGRWKPDLEGIPEG 140
BLAST of HG10000199 vs. NCBI nr
Match:
KAG6570883.1 (hypothetical protein SDJN03_29798, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 204.1 bits (518), Expect = 8.2e-49
Identity = 115/155 (74.19%), Postives = 128/155 (82.58%), Query Frame = 0
Query: 1 MGNNCFKSNKVMAQDEPDDLL---PPMEAKKVEQKPPSGSAMAKPKTAEARTGGAAGKKV 60
MGN+CFKSNKVMAQDE L PP+EAKKVE+KP +GSAMAKPKTAE R+G AAGKKV
Sbjct: 1 MGNSCFKSNKVMAQDESCLALSNSPPVEAKKVEEKPVAGSAMAKPKTAEERSGAAAGKKV 60
Query: 61 VRFKLEEEEEEEKISGGSGGDG---GVLRIKVVMSQKELKQILKDRENNSCTLEELIAEL 120
VRFKL+EE+E SGGSGGDG GVLRIKVVMSQ+ELKQILK+ EN+S +LEELIAE
Sbjct: 61 VRFKLQEEDEN---SGGSGGDGDRAGVLRIKVVMSQRELKQILKENENSSRSLEELIAEF 120
Query: 121 KVKGRTRISDGRMDEVEDEKGSWKPDLECIPEGLH 150
KVKGRT +SD DEVEDE GS +P LECIPEGLH
Sbjct: 121 KVKGRTTVSDAITDEVEDENGSRRPALECIPEGLH 152
BLAST of HG10000199 vs. NCBI nr
Match:
KGN63254.1 (hypothetical protein Csa_022493 [Cucumis sativus])
HSP 1 Score: 168.7 bits (426), Expect = 3.8e-38
Identity = 100/151 (66.23%), Postives = 111/151 (73.51%), Query Frame = 0
Query: 1 MGNNCFKSNKVMAQDEPDDLLPP---MEAKKVEQKPPSGSAMAKPKTAEARTGGAAGKKV 60
MGN CFKSNKVMAQD+ D PP +E KKV+Q+P GSAMAKPK TGGAAGKKV
Sbjct: 1 MGNICFKSNKVMAQDDSYDDFPPHHLIEPKKVQQQPLPGSAMAKPKNG---TGGAAGKKV 60
Query: 61 VRFKLEEEEEEEKISGGSGGDGGVLRIKVVMSQKELKQILKDRENNSCTLEELIAELKVK 120
VRF L+EEE++E+ GVLRIKVV+SQKELKQILK RENNSC+LEELI ELKVK
Sbjct: 61 VRFNLQEEEKDEEGRNSGDSGPGVLRIKVVISQKELKQILKSRENNSCSLEELIEELKVK 120
Query: 121 GR-TRISDGRMDEVEDEKGSWKPDLECIPEG 148
GR T +S DE GSWKP LECIPEG
Sbjct: 121 GRATTVS-------ADETGSWKPALECIPEG 141
BLAST of HG10000199 vs. NCBI nr
Match:
TYK24218.1 (hypothetical protein E5676_scaffold27G00200 [Cucumis melo var. makuwa])
HSP 1 Score: 164.5 bits (415), Expect = 7.2e-37
Identity = 99/152 (65.13%), Postives = 113/152 (74.34%), Query Frame = 0
Query: 1 MGNNCFKSNKVMAQDEPDDLLPP---MEAKKVEQKP-PSGSAMAKPKTAEARTGGAAGKK 60
MGN CF++NKVMAQD+ D LPP +EA+KVE++P GSAMAKPK TGGAAGKK
Sbjct: 1 MGNICFRTNKVMAQDDSYDNLPPDQFIEAEKVEEQPLRPGSAMAKPKNG---TGGAAGKK 60
Query: 61 VVRFKL-EEEEEEEKISGGSGGDGGVLRIKVVMSQKELKQILKDRENNSCTLEELIAELK 120
VVRF L EEEE++E + G GVLRIKVV+SQKELK+ILK+RENNSC+LEELI ELK
Sbjct: 61 VVRFNLQEEEEDQEDRNSGDDSGAGVLRIKVVISQKELKEILKNRENNSCSLEELIEELK 120
Query: 121 VKGRTRISDGRMDEVEDEKGSWKPDLECIPEG 148
VKGR V DE GSWKP LECIPEG
Sbjct: 121 VKGRA-------TTVSDEIGSWKPALECIPEG 142
BLAST of HG10000199 vs. NCBI nr
Match:
XP_022140639.1 (uncharacterized protein LOC111011249 [Momordica charantia])
HSP 1 Score: 152.9 bits (385), Expect = 2.2e-33
Identity = 93/150 (62.00%), Postives = 106/150 (70.67%), Query Frame = 0
Query: 4 NCFKSNKVMAQDEPDDLLPPMEAK----KVEQKPPSGSAMAKPKTAEARTGGAAGKKVVR 63
NC ++N+VMAQDE P KVE KP +GSA+A+PKT EAR A KKVVR
Sbjct: 45 NCLRNNRVMAQDEACSPSPNSSLTETNYKVEDKPAAGSALARPKTEEARI-AARRKKVVR 104
Query: 64 FKLEEEEEEEKISGGSGGDGGVLRIKVVMSQKELKQILKDRENNSCTLEELIAELKVKGR 123
F+ E+E SGG GGVLRIKVV+SQKELKQILKDRE+NS TLEEL+AELK+KGR
Sbjct: 105 FQQREDEI-------SGGGGGVLRIKVVVSQKELKQILKDRESNSSTLEELLAELKMKGR 164
Query: 124 TRISDGRMDEVEDEKGSWKPDLECIPEGLH 150
T ISD R D EDE GSW+P LE IPE LH
Sbjct: 165 T-ISDARADNEEDENGSWRPALESIPEDLH 185
BLAST of HG10000199 vs. ExPASy TrEMBL
Match:
A0A0A0LQE9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G418890 PE=4 SV=1)
HSP 1 Score: 168.7 bits (426), Expect = 1.8e-38
Identity = 100/151 (66.23%), Postives = 111/151 (73.51%), Query Frame = 0
Query: 1 MGNNCFKSNKVMAQDEPDDLLPP---MEAKKVEQKPPSGSAMAKPKTAEARTGGAAGKKV 60
MGN CFKSNKVMAQD+ D PP +E KKV+Q+P GSAMAKPK TGGAAGKKV
Sbjct: 1 MGNICFKSNKVMAQDDSYDDFPPHHLIEPKKVQQQPLPGSAMAKPKNG---TGGAAGKKV 60
Query: 61 VRFKLEEEEEEEKISGGSGGDGGVLRIKVVMSQKELKQILKDRENNSCTLEELIAELKVK 120
VRF L+EEE++E+ GVLRIKVV+SQKELKQILK RENNSC+LEELI ELKVK
Sbjct: 61 VRFNLQEEEKDEEGRNSGDSGPGVLRIKVVISQKELKQILKSRENNSCSLEELIEELKVK 120
Query: 121 GR-TRISDGRMDEVEDEKGSWKPDLECIPEG 148
GR T +S DE GSWKP LECIPEG
Sbjct: 121 GRATTVS-------ADETGSWKPALECIPEG 141
BLAST of HG10000199 vs. ExPASy TrEMBL
Match:
A0A5D3DKZ8 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold27G00200 PE=4 SV=1)
HSP 1 Score: 164.5 bits (415), Expect = 3.5e-37
Identity = 99/152 (65.13%), Postives = 113/152 (74.34%), Query Frame = 0
Query: 1 MGNNCFKSNKVMAQDEPDDLLPP---MEAKKVEQKP-PSGSAMAKPKTAEARTGGAAGKK 60
MGN CF++NKVMAQD+ D LPP +EA+KVE++P GSAMAKPK TGGAAGKK
Sbjct: 1 MGNICFRTNKVMAQDDSYDNLPPDQFIEAEKVEEQPLRPGSAMAKPKNG---TGGAAGKK 60
Query: 61 VVRFKL-EEEEEEEKISGGSGGDGGVLRIKVVMSQKELKQILKDRENNSCTLEELIAELK 120
VVRF L EEEE++E + G GVLRIKVV+SQKELK+ILK+RENNSC+LEELI ELK
Sbjct: 61 VVRFNLQEEEEDQEDRNSGDDSGAGVLRIKVVISQKELKEILKNRENNSCSLEELIEELK 120
Query: 121 VKGRTRISDGRMDEVEDEKGSWKPDLECIPEG 148
VKGR V DE GSWKP LECIPEG
Sbjct: 121 VKGRA-------TTVSDEIGSWKPALECIPEG 142
BLAST of HG10000199 vs. ExPASy TrEMBL
Match:
A0A6J1CG85 (uncharacterized protein LOC111011249 OS=Momordica charantia OX=3673 GN=LOC111011249 PE=4 SV=1)
HSP 1 Score: 152.9 bits (385), Expect = 1.0e-33
Identity = 93/150 (62.00%), Postives = 106/150 (70.67%), Query Frame = 0
Query: 4 NCFKSNKVMAQDEPDDLLPPMEAK----KVEQKPPSGSAMAKPKTAEARTGGAAGKKVVR 63
NC ++N+VMAQDE P KVE KP +GSA+A+PKT EAR A KKVVR
Sbjct: 45 NCLRNNRVMAQDEACSPSPNSSLTETNYKVEDKPAAGSALARPKTEEARI-AARRKKVVR 104
Query: 64 FKLEEEEEEEKISGGSGGDGGVLRIKVVMSQKELKQILKDRENNSCTLEELIAELKVKGR 123
F+ E+E SGG GGVLRIKVV+SQKELKQILKDRE+NS TLEEL+AELK+KGR
Sbjct: 105 FQQREDEI-------SGGGGGVLRIKVVVSQKELKQILKDRESNSSTLEELLAELKMKGR 164
Query: 124 TRISDGRMDEVEDEKGSWKPDLECIPEGLH 150
T ISD R D EDE GSW+P LE IPE LH
Sbjct: 165 T-ISDARADNEEDENGSWRPALESIPEDLH 185
BLAST of HG10000199 vs. ExPASy TrEMBL
Match:
A0A4V3WQ11 (Uncharacterized protein OS=Camellia sinensis var. sinensis OX=542762 GN=TEA_029836 PE=4 SV=1)
HSP 1 Score: 82.0 bits (201), Expect = 2.3e-12
Identity = 59/150 (39.33%), Postives = 85/150 (56.67%), Query Frame = 0
Query: 4 NCFKSNKVMAQDEPDDLLPPMEAKKVEQKPPSGSAMAKPKTAEARTGGAAGKKVVRFKL- 63
NC SNK++ QDE D+ P E + +E+ R KK VRFKL
Sbjct: 3 NCVTSNKILGQDEKDE--QPREERAIER--------------SVRHVDGGKKKSVRFKLH 62
Query: 64 EEEEEEEKISGGSGGD------GGVLRIKVVMSQKELKQILKDRENNSCTLEELIAELKV 123
EEEEEEE+ G+G + GG +RI+VV++Q+EL +IL + S ++E+++ E+K+
Sbjct: 63 EEEEEEEEEEDGNGEERQGCSKGGAVRIRVVVTQRELIRILNTKSKYS-SVEQMLGEMKL 122
Query: 124 KGRTRISDGRMDEVEDEKGSWKPDLECIPE 147
K R +IS R + E GSW+P LE IPE
Sbjct: 123 KSR-KISQIRSSDDEGTNGSWRPALESIPE 134
BLAST of HG10000199 vs. ExPASy TrEMBL
Match:
A0A7N2MA05 (Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1)
HSP 1 Score: 79.0 bits (193), Expect = 1.9e-11
Identity = 63/146 (43.15%), Postives = 87/146 (59.59%), Query Frame = 0
Query: 4 NCFKSNKVMAQDEPDDLLPPMEAKKVEQKPPSGSAMAKP-KTAEARTGGAAGKKVVRFKL 63
NC SNK +AQ+E P EA+ VEQ PS ++ +P K + G KKVVRFKL
Sbjct: 3 NCL-SNKSLAQEEE----VPKEAEVVEQTKPSTASKLEPVKLVDG--GHKKKKKVVRFKL 62
Query: 64 EEEEEEEKISGGSGGDGGVLRIKVVMSQKELKQILKDREN-NSCTLEELIAELKVKGRTR 123
EE++ S GV+RI+VV++QKELKQIL +E ++E+L+ L ++GR +
Sbjct: 63 EEDDTNVGTSSEGDSRSGVVRIRVVVTQKELKQILDCKEGLKYSSVEQLVNALNLRGR-K 122
Query: 124 ISDGR-MDEVEDEKGSWKPDLECIPE 147
IS+ R DE E +W+P LE IPE
Sbjct: 123 ISEVRTSDEDEGINSNWRPALESIPE 140
BLAST of HG10000199 vs. TAIR 10
Match:
AT3G21680.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-terminal protein myristoylation; LOCATED IN: cellular_component unknown; EXPRESSED IN: root, flower, stamen; EXPRESSED DURING: 4 anthesis, petal differentiation and expansion stage; Has 34 Blast hits to 34 proteins in 7 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 34; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 52.8 bits (125), Expect = 2.8e-07
Identity = 47/145 (32.41%), Postives = 71/145 (48.97%), Query Frame = 0
Query: 4 NCFKSNKVMAQDEPDDLLPPMEAKKVEQKPPSGSAMAKPKTAEARTGGAAGKKVVRFKLE 63
NC + + +A+ E DDL P K +E+ GK R + E
Sbjct: 3 NCLRHDNGVARKEKDDLDPEPLVKLLEE----------------------GKTSFRGEEE 62
Query: 64 EEEEEEKISGGSGGDGGVLRIKVVMSQKELKQILKDRENNSCTLEELIAELKVKGRTRIS 123
E E+ + V+RIKVV+++KEL+QIL +N ++++L+ LK GR IS
Sbjct: 63 SERSTEE-------ESKVVRIKVVVTKKELRQIL-GHKNGINSIQQLVHVLKDSGR-NIS 116
Query: 124 DGRMDEVEDEKG--SWKPDLECIPE 147
+E E E+G +W+P LE IPE
Sbjct: 123 MASYEEDEKEEGDENWRPTLESIPE 116
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038902397.1 | 1.8e-56 | 82.99 | uncharacterized protein LOC120089037 [Benincasa hispida] | [more] |
KAG6570883.1 | 8.2e-49 | 74.19 | hypothetical protein SDJN03_29798, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KGN63254.1 | 3.8e-38 | 66.23 | hypothetical protein Csa_022493 [Cucumis sativus] | [more] |
TYK24218.1 | 7.2e-37 | 65.13 | hypothetical protein E5676_scaffold27G00200 [Cucumis melo var. makuwa] | [more] |
XP_022140639.1 | 2.2e-33 | 62.00 | uncharacterized protein LOC111011249 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0LQE9 | 1.8e-38 | 66.23 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G418890 PE=4 SV=1 | [more] |
A0A5D3DKZ8 | 3.5e-37 | 65.13 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A6J1CG85 | 1.0e-33 | 62.00 | uncharacterized protein LOC111011249 OS=Momordica charantia OX=3673 GN=LOC111011... | [more] |
A0A4V3WQ11 | 2.3e-12 | 39.33 | Uncharacterized protein OS=Camellia sinensis var. sinensis OX=542762 GN=TEA_0298... | [more] |
A0A7N2MA05 | 1.9e-11 | 43.15 | Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT3G21680.1 | 2.8e-07 | 32.41 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-termin... | [more] |