Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGACAATTAGTTCTAATATGGCATTGAGAACTTGTGCTCTCGTACTCCTTGGGGTGCACTTTGGAATCTATCAAACACATGGTAGAAGTTTGTTTAATGTCAATGAAGACAACTCTCTTTTTGAAAGTAAATTTAAGGCGCCCATTATAATTCATCCAAAACAAAATATCCATTTTTCACTTTCGCAACCCGAAGGAATGCCTGATTCAGATACACATTTCAGTTTTGGAATATTCTCAAAAGATGTCCGTATTCCTCCATCTGGACCAAGTCAAAGATCTTCAGACTCACCGCCTCTCCCATCCATTGTCTTACATAAGGAATCTCGAATTAACTTTGGAATATTACCAAAAGGTCGACGTATTCCTCCATCTGGGCCAAGTCAAAGATTTTCAGACTCTCCACCTTCCCCATTCAATGTTTTACACAAGGAATCTAGACTTAACTTTGGAATATTACCGAAAGGTATGCGTATTCCTCCATCTGGGCCAAGTCAAAGATTTTCAGACTATCCACCTCCTCCACCACATGCTCCTTTCCTTATTTAA
mRNA sequence
ATGGTGACAATTAGTTCTAATATGGCATTGAGAACTTGTGCTCTCGTACTCCTTGGGGTGCACTTTGGAATCTATCAAACACATGGTAGAAGTTTGTTTAATGTCAATGAAGACAACTCTCTTTTTGAAAGTAAATTTAAGGCGCCCATTATAATTCATCCAAAACAAAATATCCATTTTTCACTTTCGCAACCCGAAGGAATGCCTGATTCAGATACACATTTCAGTTTTGGAATATTCTCAAAAGATGTCCGTATTCCTCCATCTGGACCAAGTCAAAGATCTTCAGACTCACCGCCTCTCCCATCCATTGTCTTACATAAGGAATCTCGAATTAACTTTGGAATATTACCAAAAGGTCGACGTATTCCTCCATCTGGGCCAAGTCAAAGATTTTCAGACTCTCCACCTTCCCCATTCAATGTTTTACACAAGGAATCTAGACTTAACTTTGGAATATTACCGAAAGGTATGCGTATTCCTCCATCTGGGCCAAGTCAAAGATTTTCAGACTATCCACCTCCTCCACCACATGCTCCTTTCCTTATTTAA
Coding sequence (CDS)
ATGGTGACAATTAGTTCTAATATGGCATTGAGAACTTGTGCTCTCGTACTCCTTGGGGTGCACTTTGGAATCTATCAAACACATGGTAGAAGTTTGTTTAATGTCAATGAAGACAACTCTCTTTTTGAAAGTAAATTTAAGGCGCCCATTATAATTCATCCAAAACAAAATATCCATTTTTCACTTTCGCAACCCGAAGGAATGCCTGATTCAGATACACATTTCAGTTTTGGAATATTCTCAAAAGATGTCCGTATTCCTCCATCTGGACCAAGTCAAAGATCTTCAGACTCACCGCCTCTCCCATCCATTGTCTTACATAAGGAATCTCGAATTAACTTTGGAATATTACCAAAAGGTCGACGTATTCCTCCATCTGGGCCAAGTCAAAGATTTTCAGACTCTCCACCTTCCCCATTCAATGTTTTACACAAGGAATCTAGACTTAACTTTGGAATATTACCGAAAGGTATGCGTATTCCTCCATCTGGGCCAAGTCAAAGATTTTCAGACTATCCACCTCCTCCACCACATGCTCCTTTCCTTATTTAA
Protein sequence
MVTISSNMALRTCALVLLGVHFGIYQTHGRSLFNVNEDNSLFESKFKAPIIIHPKQNIHFSLSQPEGMPDSDTHFSFGIFSKDVRIPPSGPSQRSSDSPPLPSIVLHKESRINFGILPKGRRIPPSGPSQRFSDSPPSPFNVLHKESRLNFGILPKGMRIPPSGPSQRFSDYPPPPPHAPFLI*
Homology
BLAST of CsGy4G019040 vs. NCBI nr
Match:
KAA0043502.1 (proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa] >TYK01137.1 proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa])
HSP 1 Score: 276 bits (707), Expect = 3.14e-92
Identity = 135/183 (73.77%), Postives = 155/183 (84.70%), Query Frame = 0
Query: 1 MVTISSNMALRTCALVLLGVHFGIYQTHGRSLFNVNEDNSLFESKFKAPIIIHPKQNIHF 60
M++ISSNM +TCA+VLLG+HFGIYQT+GR+LF V EDNS+FE++ KA IIHPKQ IHF
Sbjct: 1 MMSISSNMTFKTCAIVLLGLHFGIYQTNGRNLFEVKEDNSVFENELKANSIIHPKQYIHF 60
Query: 61 SLSQPEGMPDSDTHFSFGIFSKDVRIPPSGPSQRSSDSPPLPSIVLHKESRINFGILPKG 120
SL++ +G P+ DTHF FGIFSKDVRIPPSGPSQRSSDSPP PS++L KESRINFGILPKG
Sbjct: 61 SLTKSKGKPEFDTHFRFGIFSKDVRIPPSGPSQRSSDSPPSPSMILPKESRINFGILPKG 120
Query: 121 RRIPPSGPSQRFSDSPPSPFNVLHKESRLNFGILPKGMRIPPSGPSQRFSDYPPPPPHAP 180
RIPPSGPSQRFSDSP P LHK S + FG+LPKG IPPSGPS+R SD PPPPPH+P
Sbjct: 121 SRIPPSGPSQRFSDSPLPPSTFLHKGSNMIFGMLPKGHHIPPSGPSKRTSDNPPPPPHSP 180
Query: 181 FLI 183
LI
Sbjct: 181 SLI 183
BLAST of CsGy4G019040 vs. NCBI nr
Match:
XP_031744003.1 (abl interactor homolog [Cucumis sativus])
HSP 1 Score: 223 bits (569), Expect = 9.93e-70
Identity = 118/172 (68.60%), Postives = 136/172 (79.07%), Query Frame = 0
Query: 8 MALRTCALVLLGVHFGIYQTHGRSLFNVNEDNSLFESKFKAPIIIHPKQNIHFSLSQPEG 67
MA RTC LVLLG+ +YQT+G +LF VNEDNS+FE++FKA +IIHPKQ+I FS ++ +
Sbjct: 1 MAFRTCVLVLLGL---LYQTNGINLFEVNEDNSVFENEFKAHVIIHPKQHIRFSRTKSKR 60
Query: 68 MPDSDTHFSFGIFSKDVRIPPSGPSQRSSDSPPLPSIVLHKESRINFGILPKGRRIPPSG 127
+PD DTH FGI SKD+RIPP GPSQRSSDS P PSIVLHKESR+NFGIL KG R SG
Sbjct: 61 IPDFDTHSGFGILSKDIRIPPFGPSQRSSDSTPPPSIVLHKESRMNFGILLKGVRTHSSG 120
Query: 128 PSQRFSDSPPSP--FNVLHKESRLNFGILPKGMRIPPSGPSQRFSDYPPPPP 177
SQRFSDSPPSP VLHKE R+ FGILPKG+ SGPS+RFSD PPPPP
Sbjct: 121 SSQRFSDSPPSPPPSIVLHKEPRIKFGILPKGVPTHSSGPSRRFSDSPPPPP 169
BLAST of CsGy4G019040 vs. NCBI nr
Match:
XP_031744042.1 (proline-rich receptor-like protein kinase PERK9 [Cucumis sativus])
HSP 1 Score: 223 bits (567), Expect = 6.40e-69
Identity = 117/172 (68.02%), Postives = 136/172 (79.07%), Query Frame = 0
Query: 8 MALRTCALVLLGVHFGIYQTHGRSLFNVNEDNSLFESKFKAPIIIHPKQNIHFSLSQPEG 67
MA RTC LVLLG+ +YQT+G +LF VNE+NS+ E++FKA +IIHPKQ+I FS ++ +
Sbjct: 1 MAFRTCVLVLLGL---LYQTNGINLFEVNEENSVLENEFKAHVIIHPKQHIRFSRTKSKR 60
Query: 68 MPDSDTHFSFGIFSKDVRIPPSGPSQRSSDSPPLPSIVLHKESRINFGILPKGRRIPPSG 127
+PD DTH FGI SK +RIPPSGPSQRSSDS P PSIVLHKES +NFGILPKG SG
Sbjct: 61 IPDFDTHSGFGILSKAIRIPPSGPSQRSSDSTPPPSIVLHKESMMNFGILPKGVPTHSSG 120
Query: 128 PSQRFSDSPPSP--FNVLHKESRLNFGILPKGMRIPPSGPSQRFSDYPPPPP 177
PSQRFSDSPPSP VLHKESR+ FGILPKG+ SGPS+RFSD PPPPP
Sbjct: 121 PSQRFSDSPPSPPPSIVLHKESRIKFGILPKGVPTHSSGPSRRFSDSPPPPP 169
BLAST of CsGy4G019040 vs. NCBI nr
Match:
KAA0040932.1 (proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa] >TYK10605.1 proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa])
HSP 1 Score: 202 bits (513), Expect = 1.28e-62
Identity = 108/191 (56.54%), Postives = 136/191 (71.20%), Query Frame = 0
Query: 1 MVTISSNMALRTCALVLLGVHFGIYQTHGRSLFNVNEDNSLFESKFKAPIIIHPKQNIHF 60
MV++ +N+ R CAL+ LG+HF IY T+ + F V+EDNS E+ + II HPK+ HF
Sbjct: 1 MVSVGTNIFFRACALIFLGLHFEIYLTNAKRSFVVDEDNSSLENVLRGDIINHPKKVAHF 60
Query: 61 SLSQPEGMPDSDTHFSFGIFSKDVRIPPSGPSQRSSDSPPLPS---IVLHKESRINFGIL 120
S +P+ +PD FS GI SK +R PPSG SQ +S++PP P I+LHKES +NFGIL
Sbjct: 61 SWPKPKKIPDLAPPFSLGILSKGIRTPPSGLSQGTSNNPPSPHVAPIILHKESMVNFGIL 120
Query: 121 PKGRRIPPSGPSQRFSDSPPSPFNVLH--KESRLNFGILPKGMRIPPSGPSQRFSDYPPP 180
PKG RIPPSGPS+R SD PP P + H KESR+ +GILPKG+RIPPSGPS+R SDY PP
Sbjct: 121 PKGVRIPPSGPSRRTSDPPP-PLSNFHSIKESRIKYGILPKGVRIPPSGPSKRNSDYYPP 180
Query: 181 PP---HAPFLI 183
PP HAP +I
Sbjct: 181 PPMHLHAPSII 190
BLAST of CsGy4G019040 vs. NCBI nr
Match:
XP_038882352.1 (uncharacterized protein LOC120073615 [Benincasa hispida])
HSP 1 Score: 197 bits (500), Expect = 1.29e-60
Identity = 105/192 (54.69%), Postives = 136/192 (70.83%), Query Frame = 0
Query: 1 MVTIS-SNMALRTCALVLLGVHFGIYQTHGRSLFNVNEDNSLFESKFKAPIIIHPKQNIH 60
MV+IS SN+ CA + LG+H IYQT +++F V + +S E++ I HPK++IH
Sbjct: 1 MVSISYSNVFFGACAFIFLGLHSTIYQTDAKNVFVVEKGDSTIENELSDENIKHPKRHIH 60
Query: 61 FSLSQPEGMPDSDTHFSFGIFSKDVRIPPSGPSQRSSDSPPLP----SIVLHKESRINFG 120
FSL + + +PD F+ GI SK+ R+PPSG SQ +SD+PP P SI+LHKESRINF
Sbjct: 61 FSLGKYKRIPDLAPPFNLGILSKEERVPPSGLSQSTSDNPPPPPHVISIILHKESRINFR 120
Query: 121 ILPKGRRIPPSGPSQRFSDSPPSPFN----VLHKESRLNFGILPKGMRIPPSGPSQRFSD 180
+L KG RIPPSGPSQR S+SPP P + +LHK+ +NFGILPK M IPPSGPS+RFS+
Sbjct: 121 VLSKGNRIPPSGPSQRTSESPPPPPHALSVILHKKPGINFGILPKSMHIPPSGPSKRFSN 180
Query: 181 YPPPPPHAPFLI 183
YP PP HAP +I
Sbjct: 181 YPSPPTHAPSVI 192
BLAST of CsGy4G019040 vs. ExPASy TrEMBL
Match:
A0A5D3BP86 (Proline-rich protein HaeIII subfamily 1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold109G00050 PE=4 SV=1)
HSP 1 Score: 276 bits (707), Expect = 1.52e-92
Identity = 135/183 (73.77%), Postives = 155/183 (84.70%), Query Frame = 0
Query: 1 MVTISSNMALRTCALVLLGVHFGIYQTHGRSLFNVNEDNSLFESKFKAPIIIHPKQNIHF 60
M++ISSNM +TCA+VLLG+HFGIYQT+GR+LF V EDNS+FE++ KA IIHPKQ IHF
Sbjct: 1 MMSISSNMTFKTCAIVLLGLHFGIYQTNGRNLFEVKEDNSVFENELKANSIIHPKQYIHF 60
Query: 61 SLSQPEGMPDSDTHFSFGIFSKDVRIPPSGPSQRSSDSPPLPSIVLHKESRINFGILPKG 120
SL++ +G P+ DTHF FGIFSKDVRIPPSGPSQRSSDSPP PS++L KESRINFGILPKG
Sbjct: 61 SLTKSKGKPEFDTHFRFGIFSKDVRIPPSGPSQRSSDSPPSPSMILPKESRINFGILPKG 120
Query: 121 RRIPPSGPSQRFSDSPPSPFNVLHKESRLNFGILPKGMRIPPSGPSQRFSDYPPPPPHAP 180
RIPPSGPSQRFSDSP P LHK S + FG+LPKG IPPSGPS+R SD PPPPPH+P
Sbjct: 121 SRIPPSGPSQRFSDSPLPPSTFLHKGSNMIFGMLPKGHHIPPSGPSKRTSDNPPPPPHSP 180
Query: 181 FLI 183
LI
Sbjct: 181 SLI 183
BLAST of CsGy4G019040 vs. ExPASy TrEMBL
Match:
A0A5D3CGG1 (Proline-rich protein HaeIII subfamily 1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold315G00020 PE=4 SV=1)
HSP 1 Score: 202 bits (513), Expect = 6.22e-63
Identity = 108/191 (56.54%), Postives = 136/191 (71.20%), Query Frame = 0
Query: 1 MVTISSNMALRTCALVLLGVHFGIYQTHGRSLFNVNEDNSLFESKFKAPIIIHPKQNIHF 60
MV++ +N+ R CAL+ LG+HF IY T+ + F V+EDNS E+ + II HPK+ HF
Sbjct: 1 MVSVGTNIFFRACALIFLGLHFEIYLTNAKRSFVVDEDNSSLENVLRGDIINHPKKVAHF 60
Query: 61 SLSQPEGMPDSDTHFSFGIFSKDVRIPPSGPSQRSSDSPPLPS---IVLHKESRINFGIL 120
S +P+ +PD FS GI SK +R PPSG SQ +S++PP P I+LHKES +NFGIL
Sbjct: 61 SWPKPKKIPDLAPPFSLGILSKGIRTPPSGLSQGTSNNPPSPHVAPIILHKESMVNFGIL 120
Query: 121 PKGRRIPPSGPSQRFSDSPPSPFNVLH--KESRLNFGILPKGMRIPPSGPSQRFSDYPPP 180
PKG RIPPSGPS+R SD PP P + H KESR+ +GILPKG+RIPPSGPS+R SDY PP
Sbjct: 121 PKGVRIPPSGPSRRTSDPPP-PLSNFHSIKESRIKYGILPKGVRIPPSGPSKRNSDYYPP 180
Query: 181 PP---HAPFLI 183
PP HAP +I
Sbjct: 181 PPMHLHAPSII 190
BLAST of CsGy4G019040 vs. ExPASy TrEMBL
Match:
A0A6J1HRH7 (actin cytoskeleton-regulatory complex protein PAN1-like OS=Cucurbita maxima OX=3661 GN=LOC111467141 PE=4 SV=1)
HSP 1 Score: 142 bits (358), Expect = 1.54e-36
Identity = 82/174 (47.13%), Postives = 113/174 (64.94%), Query Frame = 0
Query: 24 IYQTHGRSLFNVNEDNSLFESKFKAPIIIHPKQNIHFSLSQPEGMPDSDTHFS------F 83
IY+T +++ V+ D+++ E++ I+ +PK S+S + H + F
Sbjct: 185 IYETDAKNVVEVDVDDNIREAELTDLIVQNPKG---ISISSSRSSQRTLFHRTPSPSLVF 244
Query: 84 GIFSKDVRIPPSGPSQRSSDSPPLP----SIVLHKESRINFGILPKGRRIPPSGPSQRFS 143
G+ K V IPPS PSQR+SD PP P SI+L+K S+IN G+LP+G IPPSGPSQR S
Sbjct: 245 GMLPKGVPIPPSRPSQRTSDYPPPPPRASSIILNKHSKINLGMLPRGVPIPPSGPSQRTS 304
Query: 144 DSPPSPFN----VLHKESRLNFGILPKGMRIPPSGPSQRFSDYPPPPPHAPFLI 183
D PP P + +L+K+S++NFG+LPKG+ IPPSGPSQR S YPPPPP A +I
Sbjct: 305 DYPPPPPHASSVILNKQSKINFGMLPKGVPIPPSGPSQRTSXYPPPPPRASSVI 355
BLAST of CsGy4G019040 vs. ExPASy TrEMBL
Match:
A0A5A7T8R0 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold36G001740 PE=4 SV=1)
HSP 1 Score: 113 bits (283), Expect = 1.19e-27
Identity = 79/231 (34.20%), Postives = 107/231 (46.32%), Query Frame = 0
Query: 1 MVTISSNMALRTCALVLLGVHFGIYQTHGRSLFNVNEDNSLFESKFKAPIIIHPKQNIHF 60
M++ SSN+ TCAL+LLG+HF Y RS E N + E++ +I HPK++I F
Sbjct: 8 MMSTSSNVLFGTCALILLGLHFETYHIDARSFSVFGEGNYVIENQLTDHVIEHPKRHIRF 67
Query: 61 SLSQPEGMPDSDTHFSFGIFSKDVRIPPSG-----------------------PSQRSSD 120
S +P+ +PD FS GI +KD+ PPS P +
Sbjct: 68 SWPKPKRVPDLAPPFSLGILTKDILTPPSENELGDNIVKHPKSHIRFSWVWPKPKRVPDL 127
Query: 121 SPPLP-------------------SIVLHKESRINF-----------------GILPKGR 168
+PP SI H ++ I F GIL K
Sbjct: 128 APPFSLGILNNHLRNPPSENELGHSITKHPKNHIRFSWPKPKRVPDLAPPFSLGILLKDV 187
BLAST of CsGy4G019040 vs. ExPASy TrEMBL
Match:
A0A5D3BVE5 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold2454G00110 PE=4 SV=1)
HSP 1 Score: 111 bits (278), Expect = 5.60e-27
Identity = 81/231 (35.06%), Postives = 104/231 (45.02%), Query Frame = 0
Query: 1 MVTISSNMALRTCALVLLGVHFGIYQTHGRSLFNVNEDNSLFESKFKAPIIIHPKQNIHF 60
M++ SSN+ CAL+LLG HF Y R V E N + E++ +I HPK++I F
Sbjct: 1 MMSTSSNVLFGACALILLGFHFETYHIDARRFSVVGEGNYVIENQLTDHVIEHPKRHIRF 60
Query: 61 SLSQPEGMPDSDTHFSFGIFSKDVRIPPSG-----------------------PSQRSSD 120
S +P+ +PD FS GI +KDV PPS P +
Sbjct: 61 SWPKPKRVPDLAPPFSLGILTKDVLTPPSENEVGDNIVKHPKNHIRFSWIWPKPKRVPDL 120
Query: 121 SPPLP-------------------SIVLHKESRINF-----------------GILPKGR 168
+PP SI H +S I F GIL K
Sbjct: 121 APPFSLGILNNHLRTPPSENELGHSITKHPKSHIRFSWPKPKRVPDLAPPFSLGILSKDV 180
BLAST of CsGy4G019040 vs. TAIR 10
Match:
AT1G68690.1 (Protein kinase superfamily protein )
HSP 1 Score: 44.3 bits (103), Expect = 1.2e-04
Identity = 34/96 (35.42%), Postives = 42/96 (43.75%), Query Frame = 0
Query: 82 KDVRIPPSGPSQRSSDSPPLPSIVLHKESRINFGILPKGRRIPPSGPSQRFSDSPPSPFN 141
+ ++ PP PS R + SPP PS PPS PS+R + SPPSP +
Sbjct: 150 RPIQSPPPPPSDRPTQSPPPPS--------------------PPSPPSERPTQSPPSPPS 209
Query: 142 VLHKESRLNFGILPKGMRIPPSGPSQRFSDYPPPPP 178
+S PPS PS R S PPPPP
Sbjct: 210 ERPTQS--------PPPPSPPSPPSDRPSQSPPPPP 217
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
KAA0043502.1 | 3.14e-92 | 73.77 | proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa] >TYK0113... | [more] |
XP_031744003.1 | 9.93e-70 | 68.60 | abl interactor homolog [Cucumis sativus] | [more] |
XP_031744042.1 | 6.40e-69 | 68.02 | proline-rich receptor-like protein kinase PERK9 [Cucumis sativus] | [more] |
KAA0040932.1 | 1.28e-62 | 56.54 | proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa] >TYK1060... | [more] |
XP_038882352.1 | 1.29e-60 | 54.69 | uncharacterized protein LOC120073615 [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |
A0A5D3BP86 | 1.52e-92 | 73.77 | Proline-rich protein HaeIII subfamily 1-like OS=Cucumis melo var. makuwa OX=1194... | [more] |
A0A5D3CGG1 | 6.22e-63 | 56.54 | Proline-rich protein HaeIII subfamily 1-like OS=Cucumis melo var. makuwa OX=1194... | [more] |
A0A6J1HRH7 | 1.54e-36 | 47.13 | actin cytoskeleton-regulatory complex protein PAN1-like OS=Cucurbita maxima OX=3... | [more] |
A0A5A7T8R0 | 1.19e-27 | 34.20 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... | [more] |
A0A5D3BVE5 | 5.60e-27 | 35.06 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
Match Name | E-value | Identity | Description | |
AT1G68690.1 | 1.2e-04 | 35.42 | Protein kinase superfamily protein | [more] |