Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSsinglestart_codonpolypeptidestop_codon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGAATAGCTGTTTTAAAAGCAATAAAGTTATGGCGCAAGATGAATCTTGTTTGGCTTTGTCTAATTCGCCTCCTGTGGAGGCTAAGAAAGTAGAGGAGAAACCGGTGGCCGGATCGGCTATGGCGAAGCCGAAGACGGCAGAGGAGAGAAGCGGTGCGGCTGCTGGTAAGAAGGTAGTGAGGTTTAAGCTACAAGAAGAAGACGAAAATTCTGGTGGAAGTGGCGGGGATGGAGACAGAGCCGGAGTATTGAGGATTAAAGTGGTGATGTCTCAGAGAGAGTTGAAGCAGATATTGAAGGAAAACGAGAACAGTTCACGTTCGTTGGAGGAATTGATTGCTGAATTTAAGGTGAAAGGCAGAACGACGGTTTCAGATGCGATAACCGATGAAGTTGAGGATGAAAATGGAAGCAGGAGGCCGGCTTTGGAATGTATTCCTGAAGGTCTCCATTAA
mRNA sequence
ATGGGGAATAGCTGTTTTAAAAGCAATAAAGTTATGGCGCAAGATGAATCTTGTTTGGCTTTGTCTAATTCGCCTCCTGTGGAGGCTAAGAAAGTAGAGGAGAAACCGGTGGCCGGATCGGCTATGGCGAAGCCGAAGACGGCAGAGGAGAGAAGCGGTGCGGCTGCTGGTAAGAAGGTAGTGAGGTTTAAGCTACAAGAAGAAGACGAAAATTCTGGTGGAAGTGGCGGGGATGGAGACAGAGCCGGAGTATTGAGGATTAAAGTGGTGATGTCTCAGAGAGAGTTGAAGCAGATATTGAAGGAAAACGAGAACAGTTCACGTTCGTTGGAGGAATTGATTGCTGAATTTAAGGTGAAAGGCAGAACGACGGTTTCAGATGCGATAACCGATGAAGTTGAGGATGAAAATGGAAGCAGGAGGCCGGCTTTGGAATGTATTCCTGAAGGTCTCCATTAA
Coding sequence (CDS)
ATGGGGAATAGCTGTTTTAAAAGCAATAAAGTTATGGCGCAAGATGAATCTTGTTTGGCTTTGTCTAATTCGCCTCCTGTGGAGGCTAAGAAAGTAGAGGAGAAACCGGTGGCCGGATCGGCTATGGCGAAGCCGAAGACGGCAGAGGAGAGAAGCGGTGCGGCTGCTGGTAAGAAGGTAGTGAGGTTTAAGCTACAAGAAGAAGACGAAAATTCTGGTGGAAGTGGCGGGGATGGAGACAGAGCCGGAGTATTGAGGATTAAAGTGGTGATGTCTCAGAGAGAGTTGAAGCAGATATTGAAGGAAAACGAGAACAGTTCACGTTCGTTGGAGGAATTGATTGCTGAATTTAAGGTGAAAGGCAGAACGACGGTTTCAGATGCGATAACCGATGAAGTTGAGGATGAAAATGGAAGCAGGAGGCCGGCTTTGGAATGTATTCCTGAAGGTCTCCATTAA
Protein sequence
MGNSCFKSNKVMAQDESCLALSNSPPVEAKKVEEKPVAGSAMAKPKTAEERSGAAAGKKVVRFKLQEEDENSGGSGGDGDRAGVLRIKVVMSQRELKQILKENENSSRSLEELIAEFKVKGRTTVSDAITDEVEDENGSRRPALECIPEGLH
Homology
BLAST of Csor.00g100530 vs. NCBI nr
Match:
KAG6570883.1 (hypothetical protein SDJN03_29798, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 289 bits (740), Expect = 2.80e-98
Identity = 152/152 (100.00%), Postives = 152/152 (100.00%), Query Frame = 0
Query: 1 MGNSCFKSNKVMAQDESCLALSNSPPVEAKKVEEKPVAGSAMAKPKTAEERSGAAAGKKV 60
MGNSCFKSNKVMAQDESCLALSNSPPVEAKKVEEKPVAGSAMAKPKTAEERSGAAAGKKV
Sbjct: 1 MGNSCFKSNKVMAQDESCLALSNSPPVEAKKVEEKPVAGSAMAKPKTAEERSGAAAGKKV 60
Query: 61 VRFKLQEEDENSGGSGGDGDRAGVLRIKVVMSQRELKQILKENENSSRSLEELIAEFKVK 120
VRFKLQEEDENSGGSGGDGDRAGVLRIKVVMSQRELKQILKENENSSRSLEELIAEFKVK
Sbjct: 61 VRFKLQEEDENSGGSGGDGDRAGVLRIKVVMSQRELKQILKENENSSRSLEELIAEFKVK 120
Query: 121 GRTTVSDAITDEVEDENGSRRPALECIPEGLH 152
GRTTVSDAITDEVEDENGSRRPALECIPEGLH
Sbjct: 121 GRTTVSDAITDEVEDENGSRRPALECIPEGLH 152
BLAST of Csor.00g100530 vs. NCBI nr
Match:
XP_038902397.1 (uncharacterized protein LOC120089037 [Benincasa hispida])
HSP 1 Score: 177 bits (450), Expect = 3.82e-54
Identity = 104/150 (69.33%), Postives = 117/150 (78.00%), Query Frame = 0
Query: 1 MGNSCFKSNKVMAQDESCLALSNSPPVEAKKVEEKPVAGSAMAKPKTAEERSGAAAGKKV 60
MGN+CFKSNKVMAQDE L PP+EAKKVEEKP GSAMAKPKTAE R+G A+ KKV
Sbjct: 1 MGNNCFKSNKVMAQDEPEDLL---PPIEAKKVEEKPRPGSAMAKPKTAEARTGGAS-KKV 60
Query: 61 VRFKLQEEDENSGGSGGDGDRAGVLRIKVVMSQRELKQILKENENSSRSLEELIAEFKVK 120
VRFKLQEE+E + G G GVLRIKVVMSQ+ELKQ+LK+ EN+S +LEELI E KVK
Sbjct: 61 VRFKLQEEEEKNSGDG------GVLRIKVVMSQKELKQMLKDRENNSCTLEELITELKVK 120
Query: 121 GRTTVSDAITDEVEDENGSRRPALECIPEG 150
GRTT+SD D VEDENG +P LE IPEG
Sbjct: 121 GRTTISDGRIDAVEDENGRWKPDLEGIPEG 140
BLAST of Csor.00g100530 vs. NCBI nr
Match:
KGN63254.1 (hypothetical protein Csa_022493 [Cucumis sativus])
HSP 1 Score: 152 bits (383), Expect = 5.44e-44
Identity = 92/153 (60.13%), Postives = 107/153 (69.93%), Query Frame = 0
Query: 1 MGNSCFKSNKVMAQDESCLALSNSPP---VEAKKVEEKPVAGSAMAKPKTAEERSGAAAG 60
MGN CFKSNKVMAQD+S + PP +E KKV+++P+ GSAMAKPK +G AAG
Sbjct: 1 MGNICFKSNKVMAQDDS---YDDFPPHHLIEPKKVQQQPLPGSAMAKPKNG---TGGAAG 60
Query: 61 KKVVRFKLQEEDENSGGSGGDGDRAGVLRIKVVMSQRELKQILKENENSSRSLEELIAEF 120
KKVVRF LQEE+++ G GVLRIKVV+SQ+ELKQILK EN+S SLEELI E
Sbjct: 61 KKVVRFNLQEEEKDEEGRNSGDSGPGVLRIKVVISQKELKQILKSRENNSCSLEELIEEL 120
Query: 121 KVKGRTTVSDAITDEVEDENGSRRPALECIPEG 150
KVKGR T A DE GS +PALECIPEG
Sbjct: 121 KVKGRATTVSA------DETGSWKPALECIPEG 141
BLAST of Csor.00g100530 vs. NCBI nr
Match:
XP_022140639.1 (uncharacterized protein LOC111011249 [Momordica charantia])
HSP 1 Score: 152 bits (383), Expect = 1.73e-43
Identity = 96/153 (62.75%), Postives = 113/153 (73.86%), Query Frame = 0
Query: 1 MGNSCFKSNKVMAQDESCLALSNSPPVEAK-KVEEKPVAGSAMAKPKTAEERSGAAAGKK 60
MGN C ++N+VMAQDE+C NS E KVE+KP AGSA+A+PKT E R AA KK
Sbjct: 43 MGN-CLRNNRVMAQDEACSPSPNSSLTETNYKVEDKPAAGSALARPKTEEARI-AARRKK 102
Query: 61 VVRFKLQEEDENSGGSGGDGDRAGVLRIKVVMSQRELKQILKENENSSRSLEELIAEFKV 120
VVRF+ Q EDE SGG G GVLRIKVV+SQ+ELKQILK+ E++S +LEEL+AE K+
Sbjct: 103 VVRFQ-QREDEISGGGG------GVLRIKVVVSQKELKQILKDRESNSSTLEELLAELKM 162
Query: 121 KGRTTVSDAITDEVEDENGSRRPALECIPEGLH 152
KGRT +SDA D EDENGS RPALE IPE LH
Sbjct: 163 KGRT-ISDARADNEEDENGSWRPALESIPEDLH 185
BLAST of Csor.00g100530 vs. NCBI nr
Match:
TYK24218.1 (hypothetical protein E5676_scaffold27G00200 [Cucumis melo var. makuwa])
HSP 1 Score: 149 bits (377), Expect = 4.57e-43
Identity = 95/155 (61.29%), Postives = 111/155 (71.61%), Query Frame = 0
Query: 1 MGNSCFKSNKVMAQDESCLALSNSPP---VEAKKVEEKPVA-GSAMAKPKTAEERSGAAA 60
MGN CF++NKVMAQD+S N PP +EA+KVEE+P+ GSAMAKPK +G AA
Sbjct: 1 MGNICFRTNKVMAQDDS---YDNLPPDQFIEAEKVEEQPLRPGSAMAKPKNG---TGGAA 60
Query: 61 GKKVVRFKLQEEDENSGG-SGGDGDRAGVLRIKVVMSQRELKQILKENENSSRSLEELIA 120
GKKVVRF LQEE+E+ + GD AGVLRIKVV+SQ+ELK+ILK EN+S SLEELI
Sbjct: 61 GKKVVRFNLQEEEEDQEDRNSGDDSGAGVLRIKVVISQKELKEILKNRENNSCSLEELIE 120
Query: 121 EFKVKGRTTVSDAITDEVEDENGSRRPALECIPEG 150
E KVKGR T V DE GS +PALECIPEG
Sbjct: 121 ELKVKGRATT-------VSDEIGSWKPALECIPEG 142
BLAST of Csor.00g100530 vs. ExPASy TrEMBL
Match:
A0A0A0LQE9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G418890 PE=4 SV=1)
HSP 1 Score: 152 bits (383), Expect = 2.63e-44
Identity = 92/153 (60.13%), Postives = 107/153 (69.93%), Query Frame = 0
Query: 1 MGNSCFKSNKVMAQDESCLALSNSPP---VEAKKVEEKPVAGSAMAKPKTAEERSGAAAG 60
MGN CFKSNKVMAQD+S + PP +E KKV+++P+ GSAMAKPK +G AAG
Sbjct: 1 MGNICFKSNKVMAQDDS---YDDFPPHHLIEPKKVQQQPLPGSAMAKPKNG---TGGAAG 60
Query: 61 KKVVRFKLQEEDENSGGSGGDGDRAGVLRIKVVMSQRELKQILKENENSSRSLEELIAEF 120
KKVVRF LQEE+++ G GVLRIKVV+SQ+ELKQILK EN+S SLEELI E
Sbjct: 61 KKVVRFNLQEEEKDEEGRNSGDSGPGVLRIKVVISQKELKQILKSRENNSCSLEELIEEL 120
Query: 121 KVKGRTTVSDAITDEVEDENGSRRPALECIPEG 150
KVKGR T A DE GS +PALECIPEG
Sbjct: 121 KVKGRATTVSA------DETGSWKPALECIPEG 141
BLAST of Csor.00g100530 vs. ExPASy TrEMBL
Match:
A0A6J1CG85 (uncharacterized protein LOC111011249 OS=Momordica charantia OX=3673 GN=LOC111011249 PE=4 SV=1)
HSP 1 Score: 152 bits (383), Expect = 8.37e-44
Identity = 96/153 (62.75%), Postives = 113/153 (73.86%), Query Frame = 0
Query: 1 MGNSCFKSNKVMAQDESCLALSNSPPVEAK-KVEEKPVAGSAMAKPKTAEERSGAAAGKK 60
MGN C ++N+VMAQDE+C NS E KVE+KP AGSA+A+PKT E R AA KK
Sbjct: 43 MGN-CLRNNRVMAQDEACSPSPNSSLTETNYKVEDKPAAGSALARPKTEEARI-AARRKK 102
Query: 61 VVRFKLQEEDENSGGSGGDGDRAGVLRIKVVMSQRELKQILKENENSSRSLEELIAEFKV 120
VVRF+ Q EDE SGG G GVLRIKVV+SQ+ELKQILK+ E++S +LEEL+AE K+
Sbjct: 103 VVRFQ-QREDEISGGGG------GVLRIKVVVSQKELKQILKDRESNSSTLEELLAELKM 162
Query: 121 KGRTTVSDAITDEVEDENGSRRPALECIPEGLH 152
KGRT +SDA D EDENGS RPALE IPE LH
Sbjct: 163 KGRT-ISDARADNEEDENGSWRPALESIPEDLH 185
BLAST of Csor.00g100530 vs. ExPASy TrEMBL
Match:
A0A5D3DKZ8 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold27G00200 PE=4 SV=1)
HSP 1 Score: 149 bits (377), Expect = 2.21e-43
Identity = 95/155 (61.29%), Postives = 111/155 (71.61%), Query Frame = 0
Query: 1 MGNSCFKSNKVMAQDESCLALSNSPP---VEAKKVEEKPVA-GSAMAKPKTAEERSGAAA 60
MGN CF++NKVMAQD+S N PP +EA+KVEE+P+ GSAMAKPK +G AA
Sbjct: 1 MGNICFRTNKVMAQDDS---YDNLPPDQFIEAEKVEEQPLRPGSAMAKPKNG---TGGAA 60
Query: 61 GKKVVRFKLQEEDENSGG-SGGDGDRAGVLRIKVVMSQRELKQILKENENSSRSLEELIA 120
GKKVVRF LQEE+E+ + GD AGVLRIKVV+SQ+ELK+ILK EN+S SLEELI
Sbjct: 61 GKKVVRFNLQEEEEDQEDRNSGDDSGAGVLRIKVVISQKELKEILKNRENNSCSLEELIE 120
Query: 121 EFKVKGRTTVSDAITDEVEDENGSRRPALECIPEG 150
E KVKGR T V DE GS +PALECIPEG
Sbjct: 121 ELKVKGRATT-------VSDEIGSWKPALECIPEG 142
BLAST of Csor.00g100530 vs. ExPASy TrEMBL
Match:
A0A7N2MA05 (Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1)
HSP 1 Score: 82.8 bits (203), Expect = 3.50e-17
Identity = 65/151 (43.05%), Postives = 86/151 (56.95%), Query Frame = 0
Query: 1 MGNSCFKSNKVMAQDESCLALSNSPPVEAKKVEE-KPVAGSAMAKPKTAEERSGAAAGKK 60
MGN C SNK +AQ+E P EA+ VE+ KP S + K + G KK
Sbjct: 1 MGN-CL-SNKSLAQEEEV-------PKEAEVVEQTKPSTASKLEPVKLVD--GGHKKKKK 60
Query: 61 VVRFKLQEEDENSGGSGGDGDRAGVLRIKVVMSQRELKQILKENENSS-RSLEELIAEFK 120
VVRFKL+E+D N G S R+GV+RI+VV++Q+ELKQIL E S+E+L+
Sbjct: 61 VVRFKLEEDDTNVGTSSEGDSRSGVVRIRVVVTQKELKQILDCKEGLKYSSVEQLVNALN 120
Query: 121 VKGRTTVSDAITDEVEDENGSRRPALECIPE 149
++GR +DE E N + RPALE IPE
Sbjct: 121 LRGRKISEVRTSDEDEGINSNWRPALESIPE 140
BLAST of Csor.00g100530 vs. ExPASy TrEMBL
Match:
A0A540LQ93 (Uncharacterized protein OS=Malus baccata OX=106549 GN=C1H46_025785 PE=4 SV=1)
HSP 1 Score: 76.6 bits (187), Expect = 9.18e-15
Identity = 62/160 (38.75%), Postives = 87/160 (54.37%), Query Frame = 0
Query: 1 MGNSCFKSNKVMAQD-ESCLALSNSPPVEAKKVEEKPVAGSAMAKPKTAEERSGAAAGKK 60
MGN +NK+ +QD E A + P+EA+K +AM P ++ KK
Sbjct: 1 MGNCLRNNNKIASQDYEKHEAAKEAEPLEARK--------TAMPLPSNLKQE------KK 60
Query: 61 VVRFKLQEEDENSGG--SGGDGDRAGVLRIKVVMSQRELKQILKENENSSRS-LEELIAE 120
VRF LQE+ +NSG GGD G +RI++V++Q ELKQ+L ++S+ S LEEL+
Sbjct: 61 SVRFNLQEDHQNSGKRVDGGDSKTGGAVRIRLVVTQEELKQLLNYKKDSNHSSLEELLNA 120
Query: 121 FKVKGRTTVSDAITDEVEDENGSR----RPALECIPEGLH 152
K +G T VS+ +DE+ S RP LE IPE H
Sbjct: 121 VKSRG-TRVSEINGTSSDDESISSGSCWRPTLESIPEDQH 145
BLAST of Csor.00g100530 vs. TAIR 10
Match:
AT3G21680.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-terminal protein myristoylation; LOCATED IN: cellular_component unknown; EXPRESSED IN: root, flower, stamen; EXPRESSED DURING: 4 anthesis, petal differentiation and expansion stage; Has 34 Blast hits to 34 proteins in 7 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 34; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 46.6 bits (109), Expect = 2.1e-05
Identity = 35/89 (39.33%), Postives = 53/89 (59.55%), Query Frame = 0
Query: 63 FKLQEEDENSGGSGGDGDRAGVLRIKVVMSQRELKQILKENENSSRSLEELIAEFKVKGR 122
F+ +EE E S + + V+RIKVV++++EL+QIL ++N S+++L+ K GR
Sbjct: 35 FRGEEESERS-----TEEESKVVRIKVVVTKKELRQIL-GHKNGINSIQQLVHVLKDSGR 94
Query: 123 TTVSDAITDEVEDENGSR--RPALECIPE 150
+S A +E E E G RP LE IPE
Sbjct: 95 -NISMASYEEDEKEEGDENWRPTLESIPE 116
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
KAG6570883.1 | 2.80e-98 | 100.00 | hypothetical protein SDJN03_29798, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_038902397.1 | 3.82e-54 | 69.33 | uncharacterized protein LOC120089037 [Benincasa hispida] | [more] |
KGN63254.1 | 5.44e-44 | 60.13 | hypothetical protein Csa_022493 [Cucumis sativus] | [more] |
XP_022140639.1 | 1.73e-43 | 62.75 | uncharacterized protein LOC111011249 [Momordica charantia] | [more] |
TYK24218.1 | 4.57e-43 | 61.29 | hypothetical protein E5676_scaffold27G00200 [Cucumis melo var. makuwa] | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0LQE9 | 2.63e-44 | 60.13 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G418890 PE=4 SV=1 | [more] |
A0A6J1CG85 | 8.37e-44 | 62.75 | uncharacterized protein LOC111011249 OS=Momordica charantia OX=3673 GN=LOC111011... | [more] |
A0A5D3DKZ8 | 2.21e-43 | 61.29 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A7N2MA05 | 3.50e-17 | 43.05 | Uncharacterized protein OS=Quercus lobata OX=97700 PE=4 SV=1 | [more] |
A0A540LQ93 | 9.18e-15 | 38.75 | Uncharacterized protein OS=Malus baccata OX=106549 GN=C1H46_025785 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT3G21680.1 | 2.1e-05 | 39.33 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-termin... | [more] |