Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGACAATTATTAGTTCTAATATGGCATTGAGAACTTGTGCTCTCGTACTCCTTGGGGTGCACTTTGGAATCTATCAAACACATGGTAGAAGTTTGTTTGATGTCAATGAAGACAACTCTCTTTTTGAAAGTAAATTTAAGGCGCCCATTATAATTCATCCAAAACAACATATCCATTTTTCACTTTCGAAACCCGAAGGAATGCCTGATTCAGATACACATTTCAGTTTTGGAATATTTTCAAAAGATGTCCGTATTCCTCCATCTGGACCAAGTCAAAGATCTTTAGACTCACCGCCTCTCCCATCCATTGTCTTACATAAGGAATCTCGAATTAACTTTGGAATATTACCAAAAGGTCGACGTATTCCTCCATCTGGGCCAAGTCAAAGATTTTCAGACTCTCCACCTTCCCCATTCAATGTTTTACACAAGGAATCTAGGCTTAACTTTGGAATATTACCGAAAGGTATGCATATTCCTCCATCTGGGCCAAGTCAAAGATTTTCAGACTATCCACCTCCTCCACCACATGCTCCTTTCCTTATTTAA
mRNA sequence
ATGGTGACAATTATTAGTTCTAATATGGCATTGAGAACTTGTGCTCTCGTACTCCTTGGGGTGCACTTTGGAATCTATCAAACACATGGTAGAAGTTTGTTTGATGTCAATGAAGACAACTCTCTTTTTGAAAGTAAATTTAAGGCGCCCATTATAATTCATCCAAAACAACATATCCATTTTTCACTTTCGAAACCCGAAGGAATGCCTGATTCAGATACACATTTCAGTTTTGGAATATTTTCAAAAGATGTCCGTATTCCTCCATCTGGACCAAGTCAAAGATCTTTAGACTCACCGCCTCTCCCATCCATTGTCTTACATAAGGAATCTCGAATTAACTTTGGAATATTACCAAAAGGTCGACGTATTCCTCCATCTGGGCCAAGTCAAAGATTTTCAGACTCTCCACCTTCCCCATTCAATGTTTTACACAAGGAATCTAGGCTTAACTTTGGAATATTACCGAAAGGTATGCATATTCCTCCATCTGGGCCAAGTCAAAGATTTTCAGACTATCCACCTCCTCCACCACATGCTCCTTTCCTTATTTAA
Coding sequence (CDS)
ATGGTGACAATTATTAGTTCTAATATGGCATTGAGAACTTGTGCTCTCGTACTCCTTGGGGTGCACTTTGGAATCTATCAAACACATGGTAGAAGTTTGTTTGATGTCAATGAAGACAACTCTCTTTTTGAAAGTAAATTTAAGGCGCCCATTATAATTCATCCAAAACAACATATCCATTTTTCACTTTCGAAACCCGAAGGAATGCCTGATTCAGATACACATTTCAGTTTTGGAATATTTTCAAAAGATGTCCGTATTCCTCCATCTGGACCAAGTCAAAGATCTTTAGACTCACCGCCTCTCCCATCCATTGTCTTACATAAGGAATCTCGAATTAACTTTGGAATATTACCAAAAGGTCGACGTATTCCTCCATCTGGGCCAAGTCAAAGATTTTCAGACTCTCCACCTTCCCCATTCAATGTTTTACACAAGGAATCTAGGCTTAACTTTGGAATATTACCGAAAGGTATGCATATTCCTCCATCTGGGCCAAGTCAAAGATTTTCAGACTATCCACCTCCTCCACCACATGCTCCTTTCCTTATTTAA
Protein sequence
MVTIISSNMALRTCALVLLGVHFGIYQTHGRSLFDVNEDNSLFESKFKAPIIIHPKQHIHFSLSKPEGMPDSDTHFSFGIFSKDVRIPPSGPSQRSLDSPPLPSIVLHKESRINFGILPKGRRIPPSGPSQRFSDSPPSPFNVLHKESRLNFGILPKGMHIPPSGPSQRFSDYPPPPPHAPFLI*
Homology
BLAST of CsGy4G019020 vs. NCBI nr
Match:
KAA0043502.1 (proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa] >TYK01137.1 proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa])
HSP 1 Score: 278 bits (710), Expect = 1.14e-92
Identity = 135/180 (75.00%), Postives = 154/180 (85.56%), Query Frame = 0
Query: 5 ISSNMALRTCALVLLGVHFGIYQTHGRSLFDVNEDNSLFESKFKAPIIIHPKQHIHFSLS 64
ISSNM +TCA+VLLG+HFGIYQT+GR+LF+V EDNS+FE++ KA IIHPKQ+IHFSL+
Sbjct: 4 ISSNMTFKTCAIVLLGLHFGIYQTNGRNLFEVKEDNSVFENELKANSIIHPKQYIHFSLT 63
Query: 65 KPEGMPDSDTHFSFGIFSKDVRIPPSGPSQRSLDSPPLPSIVLHKESRINFGILPKGRRI 124
K +G P+ DTHF FGIFSKDVRIPPSGPSQRS DSPP PS++L KESRINFGILPKG RI
Sbjct: 64 KSKGKPEFDTHFRFGIFSKDVRIPPSGPSQRSSDSPPSPSMILPKESRINFGILPKGSRI 123
Query: 125 PPSGPSQRFSDSPPSPFNVLHKESRLNFGILPKGMHIPPSGPSQRFSDYPPPPPHAPFLI 184
PPSGPSQRFSDSP P LHK S + FG+LPKG HIPPSGPS+R SD PPPPPH+P LI
Sbjct: 124 PPSGPSQRFSDSPLPPSTFLHKGSNMIFGMLPKGHHIPPSGPSKRTSDNPPPPPHSPSLI 183
BLAST of CsGy4G019020 vs. NCBI nr
Match:
XP_031744003.1 (abl interactor homolog [Cucumis sativus])
HSP 1 Score: 224 bits (571), Expect = 5.14e-70
Identity = 119/172 (69.19%), Postives = 136/172 (79.07%), Query Frame = 0
Query: 9 MALRTCALVLLGVHFGIYQTHGRSLFDVNEDNSLFESKFKAPIIIHPKQHIHFSLSKPEG 68
MA RTC LVLLG+ +YQT+G +LF+VNEDNS+FE++FKA +IIHPKQHI FS +K +
Sbjct: 1 MAFRTCVLVLLGL---LYQTNGINLFEVNEDNSVFENEFKAHVIIHPKQHIRFSRTKSKR 60
Query: 69 MPDSDTHFSFGIFSKDVRIPPSGPSQRSLDSPPLPSIVLHKESRINFGILPKGRRIPPSG 128
+PD DTH FGI SKD+RIPP GPSQRS DS P PSIVLHKESR+NFGIL KG R SG
Sbjct: 61 IPDFDTHSGFGILSKDIRIPPFGPSQRSSDSTPPPSIVLHKESRMNFGILLKGVRTHSSG 120
Query: 129 PSQRFSDSPPSP--FNVLHKESRLNFGILPKGMHIPPSGPSQRFSDYPPPPP 178
SQRFSDSPPSP VLHKE R+ FGILPKG+ SGPS+RFSD PPPPP
Sbjct: 121 SSQRFSDSPPSPPPSIVLHKEPRIKFGILPKGVPTHSSGPSRRFSDSPPPPP 169
BLAST of CsGy4G019020 vs. NCBI nr
Match:
XP_031744042.1 (proline-rich receptor-like protein kinase PERK9 [Cucumis sativus])
HSP 1 Score: 223 bits (569), Expect = 3.32e-69
Identity = 118/172 (68.60%), Postives = 136/172 (79.07%), Query Frame = 0
Query: 9 MALRTCALVLLGVHFGIYQTHGRSLFDVNEDNSLFESKFKAPIIIHPKQHIHFSLSKPEG 68
MA RTC LVLLG+ +YQT+G +LF+VNE+NS+ E++FKA +IIHPKQHI FS +K +
Sbjct: 1 MAFRTCVLVLLGL---LYQTNGINLFEVNEENSVLENEFKAHVIIHPKQHIRFSRTKSKR 60
Query: 69 MPDSDTHFSFGIFSKDVRIPPSGPSQRSLDSPPLPSIVLHKESRINFGILPKGRRIPPSG 128
+PD DTH FGI SK +RIPPSGPSQRS DS P PSIVLHKES +NFGILPKG SG
Sbjct: 61 IPDFDTHSGFGILSKAIRIPPSGPSQRSSDSTPPPSIVLHKESMMNFGILPKGVPTHSSG 120
Query: 129 PSQRFSDSPPSP--FNVLHKESRLNFGILPKGMHIPPSGPSQRFSDYPPPPP 178
PSQRFSDSPPSP VLHKESR+ FGILPKG+ SGPS+RFSD PPPPP
Sbjct: 121 PSQRFSDSPPSPPPSIVLHKESRIKFGILPKGVPTHSSGPSRRFSDSPPPPP 169
BLAST of CsGy4G019020 vs. NCBI nr
Match:
XP_038882352.1 (uncharacterized protein LOC120073615 [Benincasa hispida])
HSP 1 Score: 201 bits (512), Expect = 2.02e-62
Identity = 106/192 (55.21%), Postives = 135/192 (70.31%), Query Frame = 0
Query: 1 MVTIISSNMALRTCALVLLGVHFGIYQTHGRSLFDVNEDNSLFESKFKAPIIIHPKQHIH 60
MV+I SN+ CA + LG+H IYQT +++F V + +S E++ I HPK+HIH
Sbjct: 1 MVSISYSNVFFGACAFIFLGLHSTIYQTDAKNVFVVEKGDSTIENELSDENIKHPKRHIH 60
Query: 61 FSLSKPEGMPDSDTHFSFGIFSKDVRIPPSGPSQRSLDSPPLP----SIVLHKESRINFG 120
FSL K + +PD F+ GI SK+ R+PPSG SQ + D+PP P SI+LHKESRINF
Sbjct: 61 FSLGKYKRIPDLAPPFNLGILSKEERVPPSGLSQSTSDNPPPPPHVISIILHKESRINFR 120
Query: 121 ILPKGRRIPPSGPSQRFSDSPPSPFN----VLHKESRLNFGILPKGMHIPPSGPSQRFSD 180
+L KG RIPPSGPSQR S+SPP P + +LHK+ +NFGILPK MHIPPSGPS+RFS+
Sbjct: 121 VLSKGNRIPPSGPSQRTSESPPPPPHALSVILHKKPGINFGILPKSMHIPPSGPSKRFSN 180
Query: 181 YPPPPPHAPFLI 184
YP PP HAP +I
Sbjct: 181 YPSPPTHAPSVI 192
BLAST of CsGy4G019020 vs. NCBI nr
Match:
KAA0040932.1 (proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa] >TYK10605.1 proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa])
HSP 1 Score: 194 bits (494), Expect = 1.02e-59
Identity = 105/188 (55.85%), Postives = 131/188 (69.68%), Query Frame = 0
Query: 5 ISSNMALRTCALVLLGVHFGIYQTHGRSLFDVNEDNSLFESKFKAPIIIHPKQHIHFSLS 64
+ +N+ R CAL+ LG+HF IY T+ + F V+EDNS E+ + II HPK+ HFS
Sbjct: 4 VGTNIFFRACALIFLGLHFEIYLTNAKRSFVVDEDNSSLENVLRGDIINHPKKVAHFSWP 63
Query: 65 KPEGMPDSDTHFSFGIFSKDVRIPPSGPSQRSLDSPPLPS---IVLHKESRINFGILPKG 124
KP+ +PD FS GI SK +R PPSG SQ + ++PP P I+LHKES +NFGILPKG
Sbjct: 64 KPKKIPDLAPPFSLGILSKGIRTPPSGLSQGTSNNPPSPHVAPIILHKESMVNFGILPKG 123
Query: 125 RRIPPSGPSQRFSDSPPSPFNVLH--KESRLNFGILPKGMHIPPSGPSQRFSDYPPPPP- 184
RIPPSGPS+R SD PP P + H KESR+ +GILPKG+ IPPSGPS+R SDY PPPP
Sbjct: 124 VRIPPSGPSRRTSDPPP-PLSNFHSIKESRIKYGILPKGVRIPPSGPSKRNSDYYPPPPM 183
BLAST of CsGy4G019020 vs. ExPASy TrEMBL
Match:
A0A5D3BP86 (Proline-rich protein HaeIII subfamily 1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold109G00050 PE=4 SV=1)
HSP 1 Score: 278 bits (710), Expect = 5.53e-93
Identity = 135/180 (75.00%), Postives = 154/180 (85.56%), Query Frame = 0
Query: 5 ISSNMALRTCALVLLGVHFGIYQTHGRSLFDVNEDNSLFESKFKAPIIIHPKQHIHFSLS 64
ISSNM +TCA+VLLG+HFGIYQT+GR+LF+V EDNS+FE++ KA IIHPKQ+IHFSL+
Sbjct: 4 ISSNMTFKTCAIVLLGLHFGIYQTNGRNLFEVKEDNSVFENELKANSIIHPKQYIHFSLT 63
Query: 65 KPEGMPDSDTHFSFGIFSKDVRIPPSGPSQRSLDSPPLPSIVLHKESRINFGILPKGRRI 124
K +G P+ DTHF FGIFSKDVRIPPSGPSQRS DSPP PS++L KESRINFGILPKG RI
Sbjct: 64 KSKGKPEFDTHFRFGIFSKDVRIPPSGPSQRSSDSPPSPSMILPKESRINFGILPKGSRI 123
Query: 125 PPSGPSQRFSDSPPSPFNVLHKESRLNFGILPKGMHIPPSGPSQRFSDYPPPPPHAPFLI 184
PPSGPSQRFSDSP P LHK S + FG+LPKG HIPPSGPS+R SD PPPPPH+P LI
Sbjct: 124 PPSGPSQRFSDSPLPPSTFLHKGSNMIFGMLPKGHHIPPSGPSKRTSDNPPPPPHSPSLI 183
BLAST of CsGy4G019020 vs. ExPASy TrEMBL
Match:
A0A5D3CGG1 (Proline-rich protein HaeIII subfamily 1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold315G00020 PE=4 SV=1)
HSP 1 Score: 194 bits (494), Expect = 4.95e-60
Identity = 105/188 (55.85%), Postives = 131/188 (69.68%), Query Frame = 0
Query: 5 ISSNMALRTCALVLLGVHFGIYQTHGRSLFDVNEDNSLFESKFKAPIIIHPKQHIHFSLS 64
+ +N+ R CAL+ LG+HF IY T+ + F V+EDNS E+ + II HPK+ HFS
Sbjct: 4 VGTNIFFRACALIFLGLHFEIYLTNAKRSFVVDEDNSSLENVLRGDIINHPKKVAHFSWP 63
Query: 65 KPEGMPDSDTHFSFGIFSKDVRIPPSGPSQRSLDSPPLPS---IVLHKESRINFGILPKG 124
KP+ +PD FS GI SK +R PPSG SQ + ++PP P I+LHKES +NFGILPKG
Sbjct: 64 KPKKIPDLAPPFSLGILSKGIRTPPSGLSQGTSNNPPSPHVAPIILHKESMVNFGILPKG 123
Query: 125 RRIPPSGPSQRFSDSPPSPFNVLH--KESRLNFGILPKGMHIPPSGPSQRFSDYPPPPP- 184
RIPPSGPS+R SD PP P + H KESR+ +GILPKG+ IPPSGPS+R SDY PPPP
Sbjct: 124 VRIPPSGPSRRTSDPPP-PLSNFHSIKESRIKYGILPKGVRIPPSGPSKRNSDYYPPPPM 183
BLAST of CsGy4G019020 vs. ExPASy TrEMBL
Match:
A0A6J1HRH7 (actin cytoskeleton-regulatory complex protein PAN1-like OS=Cucurbita maxima OX=3661 GN=LOC111467141 PE=4 SV=1)
HSP 1 Score: 139 bits (351), Expect = 1.61e-35
Identity = 81/174 (46.55%), Postives = 113/174 (64.94%), Query Frame = 0
Query: 25 IYQTHGRSLFDVNEDNSLFESKFKAPIIIHPKQHIHFSLSKPEGMPDSDTHFS------F 84
IY+T +++ +V+ D+++ E++ I+ +PK S+S + H + F
Sbjct: 185 IYETDAKNVVEVDVDDNIREAELTDLIVQNPKG---ISISSSRSSQRTLFHRTPSPSLVF 244
Query: 85 GIFSKDVRIPPSGPSQRSLDSPPLP----SIVLHKESRINFGILPKGRRIPPSGPSQRFS 144
G+ K V IPPS PSQR+ D PP P SI+L+K S+IN G+LP+G IPPSGPSQR S
Sbjct: 245 GMLPKGVPIPPSRPSQRTSDYPPPPPRASSIILNKHSKINLGMLPRGVPIPPSGPSQRTS 304
Query: 145 DSPPSPFN----VLHKESRLNFGILPKGMHIPPSGPSQRFSDYPPPPPHAPFLI 184
D PP P + +L+K+S++NFG+LPKG+ IPPSGPSQR S YPPPPP A +I
Sbjct: 305 DYPPPPPHASSVILNKQSKINFGMLPKGVPIPPSGPSQRTSXYPPPPPRASSVI 355
BLAST of CsGy4G019020 vs. ExPASy TrEMBL
Match:
A0A5D3BVE5 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold2454G00110 PE=4 SV=1)
HSP 1 Score: 117 bits (293), Expect = 3.38e-29
Identity = 84/227 (37.00%), Postives = 102/227 (44.93%), Query Frame = 0
Query: 6 SSNMALRTCALVLLGVHFGIYQTHGRSLFDVNEDNSLFESKFKAPIIIHPKQHIHFSLSK 65
SSN+ CAL+LLG HF Y R V E N + E++ +I HPK+HI FS K
Sbjct: 5 SSNVLFGACALILLGFHFETYHIDARRFSVVGEGNYVIENQLTDHVIEHPKRHIRFSWPK 64
Query: 66 PEGMPDSDTHFSFGIFSKDVRIPPS----------------------------------- 125
P+ +PD FS GI +KDV PPS
Sbjct: 65 PKRVPDLAPPFSLGILTKDVLTPPSENEVGDNIVKHPKNHIRFSWIWPKPKRVPDLAPPF 124
Query: 126 --GPSQRSLDSPPLP-----SIVLHKESRINF-----------------GILPKGRRIPP 169
G L +PP SI H +S I F GIL K R PP
Sbjct: 125 SLGILNNHLRTPPSENELGHSITKHPKSHIRFSWPKPKRVPDLAPPFSLGILSKDVRTPP 184
BLAST of CsGy4G019020 vs. ExPASy TrEMBL
Match:
A0A5A7T8R0 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold36G001740 PE=4 SV=1)
HSP 1 Score: 114 bits (285), Expect = 6.21e-28
Identity = 80/227 (35.24%), Postives = 103/227 (45.37%), Query Frame = 0
Query: 6 SSNMALRTCALVLLGVHFGIYQTHGRSLFDVNEDNSLFESKFKAPIIIHPKQHIHFSLSK 65
SSN+ TCAL+LLG+HF Y RS E N + E++ +I HPK+HI FS K
Sbjct: 12 SSNVLFGTCALILLGLHFETYHIDARSFSVFGEGNYVIENQLTDHVIEHPKRHIRFSWPK 71
Query: 66 PEGMPDSDTHFSFGIFSKDVRIPPS----------------------------------- 125
P+ +PD FS GI +KD+ PPS
Sbjct: 72 PKRVPDLAPPFSLGILTKDILTPPSENELGDNIVKHPKSHIRFSWVWPKPKRVPDLAPPF 131
Query: 126 --GPSQRSLDSPPLP-----SIVLHKESRINF-----------------GILPKGRRIPP 169
G L +PP SI H ++ I F GIL K R PP
Sbjct: 132 SLGILNNHLRNPPSENELGHSITKHPKNHIRFSWPKPKRVPDLAPPFSLGILLKDVRTPP 191
BLAST of CsGy4G019020 vs. TAIR 10
Match:
AT1G68690.1 (Protein kinase superfamily protein )
HSP 1 Score: 44.3 bits (103), Expect = 1.3e-04
Identity = 34/96 (35.42%), Postives = 41/96 (42.71%), Query Frame = 0
Query: 83 KDVRIPPSGPSQRSLDSPPLPSIVLHKESRINFGILPKGRRIPPSGPSQRFSDSPPSPFN 142
+ ++ PP PS R SPP PS PPS PS+R + SPPSP +
Sbjct: 150 RPIQSPPPPPSDRPTQSPPPPS--------------------PPSPPSERPTQSPPSPPS 209
Query: 143 VLHKESRLNFGILPKGMHIPPSGPSQRFSDYPPPPP 179
+S PPS PS R S PPPPP
Sbjct: 210 ERPTQS--------PPPPSPPSPPSDRPSQSPPPPP 217
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
KAA0043502.1 | 1.14e-92 | 75.00 | proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa] >TYK0113... | [more] |
XP_031744003.1 | 5.14e-70 | 69.19 | abl interactor homolog [Cucumis sativus] | [more] |
XP_031744042.1 | 3.32e-69 | 68.60 | proline-rich receptor-like protein kinase PERK9 [Cucumis sativus] | [more] |
XP_038882352.1 | 2.02e-62 | 55.21 | uncharacterized protein LOC120073615 [Benincasa hispida] | [more] |
KAA0040932.1 | 1.02e-59 | 55.85 | proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa] >TYK1060... | [more] |
Match Name | E-value | Identity | Description | |
A0A5D3BP86 | 5.53e-93 | 75.00 | Proline-rich protein HaeIII subfamily 1-like OS=Cucumis melo var. makuwa OX=1194... | [more] |
A0A5D3CGG1 | 4.95e-60 | 55.85 | Proline-rich protein HaeIII subfamily 1-like OS=Cucumis melo var. makuwa OX=1194... | [more] |
A0A6J1HRH7 | 1.61e-35 | 46.55 | actin cytoskeleton-regulatory complex protein PAN1-like OS=Cucurbita maxima OX=3... | [more] |
A0A5D3BVE5 | 3.38e-29 | 37.00 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A5A7T8R0 | 6.21e-28 | 35.24 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... | [more] |
Match Name | E-value | Identity | Description | |
AT1G68690.1 | 1.3e-04 | 35.42 | Protein kinase superfamily protein | [more] |