Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGAATAATTGCTTCAAAAGCAACAAAGTGATGGCGCAAGACGAGCCTGATGATCTTTTGCCTCCTACCGAAGTTAAGAAAGTTGAGGAAAAACCGCCGGCTGGATCGGCAATGGCGAAGCCGAAGACGGCAGAGGCGAGAGCCGGTGGTGCGAGTAAGAAGGTAGTGAGGTTTAAGCTACAAGAAGAAGAGGAGAAAAATTCCGGCGGAAGTGGCAGCGATGCTGGAGTACTGAGGATTAAAGTGGTGATGTCTCAGAAAGAGTTGAAACAGATGTTGACGGATAGAGAGAACAATTCGTGTACATTGGAGGAATTGATTGCTGAATTGAAGGTGAGAGGCAGAACGACGATTTCAGATGCGAGAATCGATGAAGTTGAAGATGAAAATGGAAGCTGGAAGCCGGATCTGGAATCTATTCCTGAAGGTCTCCATTAA
mRNA sequence
ATGGGGAATAATTGCTTCAAAAGCAACAAAGTGATGGCGCAAGACGAGCCTGATGATCTTTTGCCTCCTACCGAAGTTAAGAAAGTTGAGGAAAAACCGCCGGCTGGATCGGCAATGGCGAAGCCGAAGACGGCAGAGGCGAGAGCCGGTGGTGCGAGTAAGAAGGTAGTGAGGTTTAAGCTACAAGAAGAAGAGGAGAAAAATTCCGGCGGAAGTGGCAGCGATGCTGGAGTACTGAGGATTAAAGTGGTGATGTCTCAGAAAGAGTTGAAACAGATGTTGACGGATAGAGAGAACAATTCGTGTACATTGGAGGAATTGATTGCTGAATTGAAGGTGAGAGGCAGAACGACGATTTCAGATGCGAGAATCGATGAAGTTGAAGATGAAAATGGAAGCTGGAAGCCGGATCTGGAATCTATTCCTGAAGGTCTCCATTAA
Coding sequence (CDS)
ATGGGGAATAATTGCTTCAAAAGCAACAAAGTGATGGCGCAAGACGAGCCTGATGATCTTTTGCCTCCTACCGAAGTTAAGAAAGTTGAGGAAAAACCGCCGGCTGGATCGGCAATGGCGAAGCCGAAGACGGCAGAGGCGAGAGCCGGTGGTGCGAGTAAGAAGGTAGTGAGGTTTAAGCTACAAGAAGAAGAGGAGAAAAATTCCGGCGGAAGTGGCAGCGATGCTGGAGTACTGAGGATTAAAGTGGTGATGTCTCAGAAAGAGTTGAAACAGATGTTGACGGATAGAGAGAACAATTCGTGTACATTGGAGGAATTGATTGCTGAATTGAAGGTGAGAGGCAGAACGACGATTTCAGATGCGAGAATCGATGAAGTTGAAGATGAAAATGGAAGCTGGAAGCCGGATCTGGAATCTATTCCTGAAGGTCTCCATTAA
Protein sequence
MGNNCFKSNKVMAQDEPDDLLPPTEVKKVEEKPPAGSAMAKPKTAEARAGGASKKVVRFKLQEEEEKNSGGSGSDAGVLRIKVVMSQKELKQMLTDRENNSCTLEELIAELKVRGRTTISDARIDEVEDENGSWKPDLESIPEGLH
Homology
BLAST of CaUC02G035540.1 vs. NCBI nr
Match:
XP_038902397.1 (uncharacterized protein LOC120089037 [Benincasa hispida])
HSP 1 Score: 238.0 bits (606), Expect = 5.0e-59
Identity = 126/144 (87.50%), Postives = 128/144 (88.89%), Query Frame = 0
Query: 1 MGNNCFKSNKVMAQDEPDDLLPPTEVKKVEEKPPAGSAMAKPKTAEARAGGASKKVVRFK 60
MGNNCFKSNKVMAQDEP+DLLPP E KKVEEKP GSAMAKPKTAEAR GGASKKVVRFK
Sbjct: 1 MGNNCFKSNKVMAQDEPEDLLPPIEAKKVEEKPRPGSAMAKPKTAEARTGGASKKVVRFK 60
Query: 61 LQEEEEKNSGGSGSDAGVLRIKVVMSQKELKQMLTDRENNSCTLEELIAELKVRGRTTIS 120
LQEEEEKNSG D GVLRIKVVMSQKELKQML DRENNSCTLEELI ELKV+GRTTIS
Sbjct: 61 LQEEEEKNSG----DGGVLRIKVVMSQKELKQMLKDRENNSCTLEELITELKVKGRTTIS 120
Query: 121 DARIDEVEDENGSWKPDLESIPEG 145
D RID VEDENG WKPDLE IPEG
Sbjct: 121 DGRIDAVEDENGRWKPDLEGIPEG 140
BLAST of CaUC02G035540.1 vs. NCBI nr
Match:
KAG6570883.1 (hypothetical protein SDJN03_29798, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 194.1 bits (492), Expect = 8.3e-46
Identity = 114/153 (74.51%), Postives = 126/153 (82.35%), Query Frame = 0
Query: 1 MGNNCFKSNKVMAQDEPDDLL---PPTEVKKVEEKPPAGSAMAKPKTAEARAGGAS-KKV 60
MGN+CFKSNKVMAQDE L PP E KKVEEKP AGSAMAKPKTAE R+G A+ KKV
Sbjct: 1 MGNSCFKSNKVMAQDESCLALSNSPPVEAKKVEEKPVAGSAMAKPKTAEERSGAAAGKKV 60
Query: 61 VRFKLQEEEEKNSGGSGSD---AGVLRIKVVMSQKELKQMLTDRENNSCTLEELIAELKV 120
VRFKLQEE+E NSGGSG D AGVLRIKVVMSQ+ELKQ+L + EN+S +LEELIAE KV
Sbjct: 61 VRFKLQEEDE-NSGGSGGDGDRAGVLRIKVVMSQRELKQILKENENSSRSLEELIAEFKV 120
Query: 121 RGRTTISDARIDEVEDENGSWKPDLESIPEGLH 147
+GRTT+SDA DEVEDENGS +P LE IPEGLH
Sbjct: 121 KGRTTVSDAITDEVEDENGSRRPALECIPEGLH 152
BLAST of CaUC02G035540.1 vs. NCBI nr
Match:
XP_022140639.1 (uncharacterized protein LOC111011249 [Momordica charantia])
HSP 1 Score: 158.3 bits (399), Expect = 5.0e-35
Identity = 95/148 (64.19%), Postives = 112/148 (75.68%), Query Frame = 0
Query: 4 NCFKSNKVMAQDE-----PDDLLPPTEVKKVEEKPPAGSAMAKPKTAEARAGGASKKVVR 63
NC ++N+VMAQDE P+ L T KVE+KP AGSA+A+PKT EAR KKVVR
Sbjct: 45 NCLRNNRVMAQDEACSPSPNSSLTETNY-KVEDKPAAGSALARPKTEEARIAARRKKVVR 104
Query: 64 FKLQEEEEKNSGGSGSDAGVLRIKVVMSQKELKQMLTDRENNSCTLEELIAELKVRGRTT 123
F Q+ E++ SGG G GVLRIKVV+SQKELKQ+L DRE+NS TLEEL+AELK++GR T
Sbjct: 105 F--QQREDEISGGGG---GVLRIKVVVSQKELKQILKDRESNSSTLEELLAELKMKGR-T 164
Query: 124 ISDARIDEVEDENGSWKPDLESIPEGLH 147
ISDAR D EDENGSW+P LESIPE LH
Sbjct: 165 ISDARADNEEDENGSWRPALESIPEDLH 185
BLAST of CaUC02G035540.1 vs. NCBI nr
Match:
KGN63254.1 (hypothetical protein Csa_022493 [Cucumis sativus])
HSP 1 Score: 153.3 bits (386), Expect = 1.6e-33
Identity = 96/151 (63.58%), Postives = 106/151 (70.20%), Query Frame = 0
Query: 1 MGNNCFKSNKVMAQDEPDDLLPP---TEVKKVEEKPPAGSAMAKPKTAEARAGGASKKVV 60
MGN CFKSNKVMAQD+ D PP E KKV+++P GSAMAKPK G A KKVV
Sbjct: 1 MGNICFKSNKVMAQDDSYDDFPPHHLIEPKKVQQQPLPGSAMAKPK--NGTGGAAGKKVV 60
Query: 61 RFKLQEEEE----KNSGGSGSDAGVLRIKVVMSQKELKQMLTDRENNSCTLEELIAELKV 120
RF LQEEE+ +NSG SG GVLRIKVV+SQKELKQ+L RENNSC+LEELI ELKV
Sbjct: 61 RFNLQEEEKDEEGRNSGDSG--PGVLRIKVVISQKELKQILKSRENNSCSLEELIEELKV 120
Query: 121 RGRTTISDARIDEVEDENGSWKPDLESIPEG 145
+GR T A DE GSWKP LE IPEG
Sbjct: 121 KGRATTVSA------DETGSWKPALECIPEG 141
BLAST of CaUC02G035540.1 vs. NCBI nr
Match:
TYK24218.1 (hypothetical protein E5676_scaffold27G00200 [Cucumis melo var. makuwa])
HSP 1 Score: 149.8 bits (377), Expect = 1.8e-32
Identity = 94/151 (62.25%), Postives = 106/151 (70.20%), Query Frame = 0
Query: 1 MGNNCFKSNKVMAQDEPDDLLPP---TEVKKVEEKP-PAGSAMAKPKTAEARAGGASKKV 60
MGN CF++NKVMAQD+ D LPP E +KVEE+P GSAMAKPK G A KKV
Sbjct: 1 MGNICFRTNKVMAQDDSYDNLPPDQFIEAEKVEEQPLRPGSAMAKPK--NGTGGAAGKKV 60
Query: 61 VRFKLQEEE---EKNSGGSGSDAGVLRIKVVMSQKELKQMLTDRENNSCTLEELIAELKV 120
VRF LQEEE E + G S AGVLRIKVV+SQKELK++L +RENNSC+LEELI ELKV
Sbjct: 61 VRFNLQEEEEDQEDRNSGDDSGAGVLRIKVVISQKELKEILKNRENNSCSLEELIEELKV 120
Query: 121 RGRTTISDARIDEVEDENGSWKPDLESIPEG 145
+GR T V DE GSWKP LE IPEG
Sbjct: 121 KGRAT-------TVSDEIGSWKPALECIPEG 142
BLAST of CaUC02G035540.1 vs. ExPASy TrEMBL
Match:
A0A6J1CG85 (uncharacterized protein LOC111011249 OS=Momordica charantia OX=3673 GN=LOC111011249 PE=4 SV=1)
HSP 1 Score: 158.3 bits (399), Expect = 2.4e-35
Identity = 95/148 (64.19%), Postives = 112/148 (75.68%), Query Frame = 0
Query: 4 NCFKSNKVMAQDE-----PDDLLPPTEVKKVEEKPPAGSAMAKPKTAEARAGGASKKVVR 63
NC ++N+VMAQDE P+ L T KVE+KP AGSA+A+PKT EAR KKVVR
Sbjct: 45 NCLRNNRVMAQDEACSPSPNSSLTETNY-KVEDKPAAGSALARPKTEEARIAARRKKVVR 104
Query: 64 FKLQEEEEKNSGGSGSDAGVLRIKVVMSQKELKQMLTDRENNSCTLEELIAELKVRGRTT 123
F Q+ E++ SGG G GVLRIKVV+SQKELKQ+L DRE+NS TLEEL+AELK++GR T
Sbjct: 105 F--QQREDEISGGGG---GVLRIKVVVSQKELKQILKDRESNSSTLEELLAELKMKGR-T 164
Query: 124 ISDARIDEVEDENGSWKPDLESIPEGLH 147
ISDAR D EDENGSW+P LESIPE LH
Sbjct: 165 ISDARADNEEDENGSWRPALESIPEDLH 185
BLAST of CaUC02G035540.1 vs. ExPASy TrEMBL
Match:
A0A0A0LQE9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G418890 PE=4 SV=1)
HSP 1 Score: 153.3 bits (386), Expect = 7.8e-34
Identity = 96/151 (63.58%), Postives = 106/151 (70.20%), Query Frame = 0
Query: 1 MGNNCFKSNKVMAQDEPDDLLPP---TEVKKVEEKPPAGSAMAKPKTAEARAGGASKKVV 60
MGN CFKSNKVMAQD+ D PP E KKV+++P GSAMAKPK G A KKVV
Sbjct: 1 MGNICFKSNKVMAQDDSYDDFPPHHLIEPKKVQQQPLPGSAMAKPK--NGTGGAAGKKVV 60
Query: 61 RFKLQEEEE----KNSGGSGSDAGVLRIKVVMSQKELKQMLTDRENNSCTLEELIAELKV 120
RF LQEEE+ +NSG SG GVLRIKVV+SQKELKQ+L RENNSC+LEELI ELKV
Sbjct: 61 RFNLQEEEKDEEGRNSGDSG--PGVLRIKVVISQKELKQILKSRENNSCSLEELIEELKV 120
Query: 121 RGRTTISDARIDEVEDENGSWKPDLESIPEG 145
+GR T A DE GSWKP LE IPEG
Sbjct: 121 KGRATTVSA------DETGSWKPALECIPEG 141
BLAST of CaUC02G035540.1 vs. ExPASy TrEMBL
Match:
A0A5D3DKZ8 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold27G00200 PE=4 SV=1)
HSP 1 Score: 149.8 bits (377), Expect = 8.7e-33
Identity = 94/151 (62.25%), Postives = 106/151 (70.20%), Query Frame = 0
Query: 1 MGNNCFKSNKVMAQDEPDDLLPP---TEVKKVEEKP-PAGSAMAKPKTAEARAGGASKKV 60
MGN CF++NKVMAQD+ D LPP E +KVEE+P GSAMAKPK G A KKV
Sbjct: 1 MGNICFRTNKVMAQDDSYDNLPPDQFIEAEKVEEQPLRPGSAMAKPK--NGTGGAAGKKV 60
Query: 61 VRFKLQEEE---EKNSGGSGSDAGVLRIKVVMSQKELKQMLTDRENNSCTLEELIAELKV 120
VRF LQEEE E + G S AGVLRIKVV+SQKELK++L +RENNSC+LEELI ELKV
Sbjct: 61 VRFNLQEEEEDQEDRNSGDDSGAGVLRIKVVISQKELKEILKNRENNSCSLEELIEELKV 120
Query: 121 RGRTTISDARIDEVEDENGSWKPDLESIPEG 145
+GR T V DE GSWKP LE IPEG
Sbjct: 121 KGRAT-------TVSDEIGSWKPALECIPEG 142
BLAST of CaUC02G035540.1 vs. ExPASy TrEMBL
Match:
A0A6J1B1M6 (uncharacterized protein LOC110423293 OS=Herrania umbratica OX=108875 GN=LOC110423293 PE=4 SV=1)
HSP 1 Score: 79.0 bits (193), Expect = 1.9e-11
Identity = 57/143 (39.86%), Postives = 82/143 (57.34%), Query Frame = 0
Query: 4 NCFKSNKVMAQ-DEPDDLLPPTEVKKVEEKPPAGSAMAKPKTAEARAGGASKKVVRFKLQ 63
NC SNK++AQ D+P+ EV + K A A+ KK+VRFKL
Sbjct: 3 NCLTSNKIVAQNDQPEPQGCRAEVIEETGKVTASKLERAEVAADEGEKVKKKKMVRFKLN 62
Query: 64 EEEEKNSGGSG-SDAGVLRIKVVMSQKELKQMLTDREN-NSCTLEELIAELKVRGRTTIS 123
EE + + G G S GV+RI++V++QKELKQ+L+ RE+ +LE LI +K+RG
Sbjct: 63 EENDVDGGRQGESKDGVVRIRLVVTQKELKQILSSREDLKHTSLEGLIRVMKLRGVRISE 122
Query: 124 DARIDEVEDENGSWKPDLESIPE 144
R ++ + +G W+P LESIPE
Sbjct: 123 GGRTNDDDGFHGGWRPALESIPE 145
BLAST of CaUC02G035540.1 vs. ExPASy TrEMBL
Match:
A0A4V3WQ11 (Uncharacterized protein OS=Camellia sinensis var. sinensis OX=542762 GN=TEA_029836 PE=4 SV=1)
HSP 1 Score: 78.6 bits (192), Expect = 2.5e-11
Identity = 56/149 (37.58%), Postives = 83/149 (55.70%), Query Frame = 0
Query: 4 NCFKSNKVMAQDEPDDLLPPTEVKKVEEKPPAGSAMAKPKTAEARAGGASKKVVRFKL-- 63
NC SNK++ QDE D+ P E + +E + G KK VRFKL
Sbjct: 3 NCVTSNKILGQDEKDE--QPREERAIER-------------SVRHVDGGKKKSVRFKLHE 62
Query: 64 -QEEEEKNSGGSG------SDAGVLRIKVVMSQKELKQMLTDRENNSCTLEELIAELKVR 123
+EEEE+ G+G S G +RI+VV++Q+EL ++L + S ++E+++ E+K++
Sbjct: 63 EEEEEEEEEDGNGEERQGCSKGGAVRIRVVVTQRELIRILNTKSKYS-SVEQMLGEMKLK 122
Query: 124 GRTTISDARIDEVEDENGSWKPDLESIPE 144
R IS R + E NGSW+P LESIPE
Sbjct: 123 SR-KISQIRSSDDEGTNGSWRPALESIPE 134
BLAST of CaUC02G035540.1 vs. TAIR 10
Match:
AT3G21680.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-terminal protein myristoylation; LOCATED IN: cellular_component unknown; EXPRESSED IN: root, flower, stamen; EXPRESSED DURING: 4 anthesis, petal differentiation and expansion stage; Has 34 Blast hits to 34 proteins in 7 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 34; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 57.4 bits (137), Expect = 1.1e-08
Identity = 50/142 (35.21%), Postives = 78/142 (54.93%), Query Frame = 0
Query: 4 NCFKSNKVMAQDEPDDLLPPTEVKKVEEKPPAGSAMAKPKTAEARAGGASKKVVRFKLQE 63
NC + + +A+ E DDL P VK +EE KT+ F+ +E
Sbjct: 3 NCLRHDNGVARKEKDDLDPEPLVKLLEE----------GKTS-------------FRGEE 62
Query: 64 EEEKNSGGSGSDAGVLRIKVVMSQKELKQMLTDRENNSCTLEELIAELKVRGRTTISDAR 123
E E++ + ++ V+RIKVV+++KEL+Q+L +N ++++L+ LK GR IS A
Sbjct: 63 ESERS---TEEESKVVRIKVVVTKKELRQIL-GHKNGINSIQQLVHVLKDSGR-NISMAS 116
Query: 124 IDEVEDENG--SWKPDLESIPE 144
+E E E G +W+P LESIPE
Sbjct: 123 YEEDEKEEGDENWRPTLESIPE 116
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038902397.1 | 5.0e-59 | 87.50 | uncharacterized protein LOC120089037 [Benincasa hispida] | [more] |
KAG6570883.1 | 8.3e-46 | 74.51 | hypothetical protein SDJN03_29798, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022140639.1 | 5.0e-35 | 64.19 | uncharacterized protein LOC111011249 [Momordica charantia] | [more] |
KGN63254.1 | 1.6e-33 | 63.58 | hypothetical protein Csa_022493 [Cucumis sativus] | [more] |
TYK24218.1 | 1.8e-32 | 62.25 | hypothetical protein E5676_scaffold27G00200 [Cucumis melo var. makuwa] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1CG85 | 2.4e-35 | 64.19 | uncharacterized protein LOC111011249 OS=Momordica charantia OX=3673 GN=LOC111011... | [more] |
A0A0A0LQE9 | 7.8e-34 | 63.58 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G418890 PE=4 SV=1 | [more] |
A0A5D3DKZ8 | 8.7e-33 | 62.25 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A6J1B1M6 | 1.9e-11 | 39.86 | uncharacterized protein LOC110423293 OS=Herrania umbratica OX=108875 GN=LOC11042... | [more] |
A0A4V3WQ11 | 2.5e-11 | 37.58 | Uncharacterized protein OS=Camellia sinensis var. sinensis OX=542762 GN=TEA_0298... | [more] |
Match Name | E-value | Identity | Description | |
AT3G21680.1 | 1.1e-08 | 35.21 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-termin... | [more] |