Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGTCAATTAGTTCTTGTAATGTGTTTTTCGCAGCTTGCGCTCTCATCCTCCTTGGGTTGCACGTTGCAATCTACCAAGCAGATGCTAAAAATGTTTTTGTAGACGTCGGAGGCTTCCCTATCATAGACAACGTATTGAGCGACCCGAGTATAAAACATCCGAAAAGGCATATCCGTTTCTCAACGGGGAAACCCAAGAGGGTACCTGACTTAGATCCACCTTCCAGCGTGGGAATACTATCGAAAGAAGAGCGCACTACCCCGTCCGGACCGAGTCCAAGAACTTCGGACAATCCACCTCCCCCACCGCATGTCTCGTCCACCATTTTGCACAAGCAATCTAGCATCAACTTCGGAGTGTTACCGAAAGGTGAGCGTATTCCTCCGTCTGGGCCGAGCCAAAGAACTTCAGAGAGTCCACCTCCCCCGCCGCATGCCCCATCCGTCATTTTACACAAGGAATCTGGGATCAACTTTGGAATATTACCGAAAGGCGTGCGTATTCCACCGTCAGGGCCGAGTACAAGATCTTCGGACTATCCACCGCCCCCACCTCATGCTCCTTTCGTTATTTTAAAGAAAGAATCTAAGATTAGATTTGGGATGTATCCTAAAAATAACCCCATTCCACCATCTGCTCCAAGCGGACGCGTACCATCATGA
mRNA sequence
ATGGTGTCAATTAGTTCTTGTAATGTGTTTTTCGCAGCTTGCGCTCTCATCCTCCTTGGGTTGCACGTTGCAATCTACCAAGCAGATGCTAAAAATGTTTTTGTAGACGTCGGAGGCTTCCCTATCATAGACAACGTATTGAGCGACCCGAGTATAAAACATCCGAAAAGGCATATCCGTTTCTCAACGGGGAAACCCAAGAGGGTACCTGACTTAGATCCACCTTCCAGCGTGGGAATACTATCGAAAGAAGAGCGCACTACCCCGTCCGGACCGAGTCCAAGAACTTCGGACAATCCACCTCCCCCACCGCATGTCTCGTCCACCATTTTGCACAAGCAATCTAGCATCAACTTCGGAGTGTTACCGAAAGGTGAGCGTATTCCTCCGTCTGGGCCGAGCCAAAGAACTTCAGAGAGTCCACCTCCCCCGCCGCATGCCCCATCCGTCATTTTACACAAGGAATCTGGGATCAACTTTGGAATATTACCGAAAGGCGTGCGTATTCCACCGTCAGGGCCGAGTACAAGATCTTCGGACTATCCACCGCCCCCACCTCATGCTCCTTTCGTTATTTTAAAGAAAGAATCTAAGATTAGATTTGGGATGTATCCTAAAAATAACCCCATTCCACCATCTGCTCCAAGCGGACGCGTACCATCATGA
Coding sequence (CDS)
ATGGTGTCAATTAGTTCTTGTAATGTGTTTTTCGCAGCTTGCGCTCTCATCCTCCTTGGGTTGCACGTTGCAATCTACCAAGCAGATGCTAAAAATGTTTTTGTAGACGTCGGAGGCTTCCCTATCATAGACAACGTATTGAGCGACCCGAGTATAAAACATCCGAAAAGGCATATCCGTTTCTCAACGGGGAAACCCAAGAGGGTACCTGACTTAGATCCACCTTCCAGCGTGGGAATACTATCGAAAGAAGAGCGCACTACCCCGTCCGGACCGAGTCCAAGAACTTCGGACAATCCACCTCCCCCACCGCATGTCTCGTCCACCATTTTGCACAAGCAATCTAGCATCAACTTCGGAGTGTTACCGAAAGGTGAGCGTATTCCTCCGTCTGGGCCGAGCCAAAGAACTTCAGAGAGTCCACCTCCCCCGCCGCATGCCCCATCCGTCATTTTACACAAGGAATCTGGGATCAACTTTGGAATATTACCGAAAGGCGTGCGTATTCCACCGTCAGGGCCGAGTACAAGATCTTCGGACTATCCACCGCCCCCACCTCATGCTCCTTTCGTTATTTTAAAGAAAGAATCTAAGATTAGATTTGGGATGTATCCTAAAAATAACCCCATTCCACCATCTGCTCCAAGCGGACGCGTACCATCATGA
Protein sequence
MVSISSCNVFFAACALILLGLHVAIYQADAKNVFVDVGGFPIIDNVLSDPSIKHPKRHIRFSTGKPKRVPDLDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPHVSSTILHKQSSINFGVLPKGERIPPSGPSQRTSESPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPSTRSSDYPPPPPHAPFVILKKESKIRFGMYPKNNPIPPSAPSGRVPS
Homology
BLAST of Clc04G04910 vs. NCBI nr
Match:
XP_038882352.1 (uncharacterized protein LOC120073615 [Benincasa hispida])
HSP 1 Score: 270.0 bits (689), Expect = 1.8e-68
Identity = 142/192 (73.96%), Postives = 151/192 (78.65%), Query Frame = 0
Query: 1 MVSISSCNVFFAACALILLGLHVAIYQADAKNVFVDVGGFPIIDNVLSDPSIKHPKRHIR 60
MVSIS NVFF ACA I LGLH IYQ DAKNVFV G I+N LSD +IKHPKRHI
Sbjct: 1 MVSISYSNVFFGACAFIFLGLHSTIYQTDAKNVFVVEKGDSTIENELSDENIKHPKRHIH 60
Query: 61 FSTGKPKRVPDLDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPHVSSTILHKQSSINFG 120
FS GK KR+PDL PP ++GILSKEER PSG S TSDNPPPPPHV S ILHK+S INF
Sbjct: 61 FSLGKYKRIPDLAPPFNLGILSKEERVPPSGLSQSTSDNPPPPPHVISIILHKESRINFR 120
Query: 121 VLPKGERIPPSGPSQRTSESPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPSTRSSD 180
VL KG RIPPSGPSQRTSESPPPPPHA SVILHK+ GINFGILPK + IPPSGPS R S+
Sbjct: 121 VLSKGNRIPPSGPSQRTSESPPPPPHALSVILHKKPGINFGILPKSMHIPPSGPSKRFSN 180
Query: 181 YPPPPPHAPFVI 193
YP PP HAP VI
Sbjct: 181 YPSPPTHAPSVI 192
BLAST of Clc04G04910 vs. NCBI nr
Match:
KAG6603169.1 (hypothetical protein SDJN03_03778, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 204.5 bits (519), Expect = 9.3e-49
Identity = 120/201 (59.70%), Postives = 139/201 (69.15%), Query Frame = 0
Query: 23 VAIYQADAKNVFVDVGGFPII-DNVLSDPSIKHPKR-HIRFSTGKPKRVPDLDPPSSV-- 82
V +Y+ DAK V V+V + I + L+D +KHPK I S + + L PP V
Sbjct: 23 VQVYETDAKKV-VEVDVYDTIREEELTDQMVKHPKGISISSSRSSQRTLFHLTPPPPVVL 82
Query: 83 GILSKEERTTPSGPSPRTSDNPPPPPHVSSTILHKQSSINFGVLPKGERIPPSGPSQRTS 142
+L K +PSGPS RTSD PPPPP SS IL KQS INFG+LPKG IPPSGPSQRTS
Sbjct: 83 RMLPKGVPISPSGPSQRTSDYPPPPPRASSVILSKQSKINFGMLPKGVPIPPSGPSQRTS 142
Query: 143 ESPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPSTRSSDYPPPPP-HAPFVILKKES 202
+ PPPPP A SVIL+K+S INFG+LPKGV IPPSGPS R+S+YPPPPP HA VIL +S
Sbjct: 143 DYPPPPPRASSVILNKQSKINFGMLPKGVPIPPSGPSQRTSNYPPPPPLHASSVILNTQS 202
Query: 203 KIRFGMYPKNNPIPPSAPSGR 219
KI FGM PK PIPPS PS R
Sbjct: 203 KINFGMLPKGVPIPPSGPSQR 222
BLAST of Clc04G04910 vs. NCBI nr
Match:
KAA0040932.1 (proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa] >TYK10605.1 proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa])
HSP 1 Score: 201.1 bits (510), Expect = 1.0e-47
Identity = 116/195 (59.49%), Postives = 138/195 (70.77%), Query Frame = 0
Query: 1 MVSISSCNVFFAACALILLGLHVAIYQADAKNVFVDVGGFPIIDNVLSDPSIKHPKRHIR 60
MVS+ + N+FF ACALI LGLH IY +AK FV ++NVL I HPK+
Sbjct: 1 MVSVGT-NIFFRACALIFLGLHFEIYLTNAKRSFVVDEDNSSLENVLRGDIINHPKKVAH 60
Query: 61 FSTGKPKRVPDLDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPHVSSTILHKQSSINFG 120
FS KPK++PDL PP S+GILSK RT PSG S TS+N PP PHV+ ILHK+S +NFG
Sbjct: 61 FSWPKPKKIPDLAPPFSLGILSKGIRTPPSGLSQGTSNN-PPSPHVAPIILHKESMVNFG 120
Query: 121 VLPKGERIPPSGPSQRTSESPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPSTRSSD 180
+LPKG RIPPSGPS+RTS+ PPP + S+ KES I +GILPKGVRIPPSGPS R+SD
Sbjct: 121 ILPKGVRIPPSGPSRRTSDPPPPLSNFHSI---KESRIKYGILPKGVRIPPSGPSKRNSD 180
Query: 181 YPPPPP---HAPFVI 193
Y PPPP HAP +I
Sbjct: 181 YYPPPPMHLHAPSII 190
BLAST of Clc04G04910 vs. NCBI nr
Match:
KAA0043502.1 (proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa] >TYK01137.1 proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa])
HSP 1 Score: 186.0 bits (471), Expect = 3.4e-43
Identity = 106/192 (55.21%), Postives = 129/192 (67.19%), Query Frame = 0
Query: 1 MVSISSCNVFFAACALILLGLHVAIYQADAKNVFVDVGGFPIIDNVLSDPSIKHPKRHIR 60
M+SISS N+ F CA++LLGLH IYQ + +N+F + +N L SI HPK++I
Sbjct: 1 MMSISS-NMTFKTCAIVLLGLHFGIYQTNGRNLFEVKEDNSVFENELKANSIIHPKQYIH 60
Query: 61 FSTGKPKRVPDLDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPHVSSTILHKQSSINFG 120
FS K K P+ D GI SK+ R PSGPS R+SD+PP P S IL K+S INFG
Sbjct: 61 FSLTKSKGKPEFDTHFRFGIFSKDVRIPPSGPSQRSSDSPPSP----SMILPKESRINFG 120
Query: 121 VLPKGERIPPSGPSQRTSESPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPSTRSSD 180
+LPKG RIPPSGPSQR S+SP P PS LHK S + FG+LPKG IPPSGPS R+SD
Sbjct: 121 ILPKGSRIPPSGPSQRFSDSPLP----PSTFLHKGSNMIFGMLPKGHHIPPSGPSKRTSD 180
Query: 181 YPPPPPHAPFVI 193
PPPPPH+P +I
Sbjct: 181 NPPPPPHSPSLI 183
BLAST of Clc04G04910 vs. NCBI nr
Match:
XP_022967687.1 (actin cytoskeleton-regulatory complex protein PAN1-like [Cucurbita maxima])
HSP 1 Score: 185.7 bits (470), Expect = 4.5e-43
Identity = 112/198 (56.57%), Postives = 131/198 (66.16%), Query Frame = 0
Query: 25 IYQADAKNVF-VDVGGFPIIDNVLSDPSIKHPKRHIRFSTGKPKRVPDLDPPSS---VGI 84
IY+ DAKNV VDV I + L+D +++PK S+ +R PS G+
Sbjct: 185 IYETDAKNVVEVDVDD-NIREAELTDLIVQNPKGISISSSRSSQRTLFHRTPSPSLVFGM 244
Query: 85 LSKEERTTPSGPSPRTSDNPPPPPHVSSTILHKQSSINFGVLPKGERIPPSGPSQRTSES 144
L K PS PS RTSD PPPPP SS IL+K S IN G+LP+G IPPSGPSQRTS+
Sbjct: 245 LPKGVPIPPSRPSQRTSDYPPPPPRASSIILNKHSKINLGMLPRGVPIPPSGPSQRTSDY 304
Query: 145 PPPPPHAPSVILHKESGINFGILPKGVRIPPSGPSTRSSDYPPPPPHAPFVILKKESKIR 204
PPPPPHA SVIL+K+S INFG+LPKGV IPPSGPS R+S YPPPPP A VIL K+SKI
Sbjct: 305 PPPPPHASSVILNKQSKINFGMLPKGVPIPPSGPSQRTSXYPPPPPRASSVILNKQSKIY 364
Query: 205 FGMYPKNNPIPPSAPSGR 219
GM P+ PIPP S R
Sbjct: 365 LGMLPRGVPIPPPGLSQR 381
BLAST of Clc04G04910 vs. ExPASy TrEMBL
Match:
A0A5D3CGG1 (Proline-rich protein HaeIII subfamily 1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold315G00020 PE=4 SV=1)
HSP 1 Score: 201.1 bits (510), Expect = 5.0e-48
Identity = 116/195 (59.49%), Postives = 138/195 (70.77%), Query Frame = 0
Query: 1 MVSISSCNVFFAACALILLGLHVAIYQADAKNVFVDVGGFPIIDNVLSDPSIKHPKRHIR 60
MVS+ + N+FF ACALI LGLH IY +AK FV ++NVL I HPK+
Sbjct: 1 MVSVGT-NIFFRACALIFLGLHFEIYLTNAKRSFVVDEDNSSLENVLRGDIINHPKKVAH 60
Query: 61 FSTGKPKRVPDLDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPHVSSTILHKQSSINFG 120
FS KPK++PDL PP S+GILSK RT PSG S TS+N PP PHV+ ILHK+S +NFG
Sbjct: 61 FSWPKPKKIPDLAPPFSLGILSKGIRTPPSGLSQGTSNN-PPSPHVAPIILHKESMVNFG 120
Query: 121 VLPKGERIPPSGPSQRTSESPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPSTRSSD 180
+LPKG RIPPSGPS+RTS+ PPP + S+ KES I +GILPKGVRIPPSGPS R+SD
Sbjct: 121 ILPKGVRIPPSGPSRRTSDPPPPLSNFHSI---KESRIKYGILPKGVRIPPSGPSKRNSD 180
Query: 181 YPPPPP---HAPFVI 193
Y PPPP HAP +I
Sbjct: 181 YYPPPPMHLHAPSII 190
BLAST of Clc04G04910 vs. ExPASy TrEMBL
Match:
A0A5D3BP86 (Proline-rich protein HaeIII subfamily 1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold109G00050 PE=4 SV=1)
HSP 1 Score: 186.0 bits (471), Expect = 1.7e-43
Identity = 106/192 (55.21%), Postives = 129/192 (67.19%), Query Frame = 0
Query: 1 MVSISSCNVFFAACALILLGLHVAIYQADAKNVFVDVGGFPIIDNVLSDPSIKHPKRHIR 60
M+SISS N+ F CA++LLGLH IYQ + +N+F + +N L SI HPK++I
Sbjct: 1 MMSISS-NMTFKTCAIVLLGLHFGIYQTNGRNLFEVKEDNSVFENELKANSIIHPKQYIH 60
Query: 61 FSTGKPKRVPDLDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPHVSSTILHKQSSINFG 120
FS K K P+ D GI SK+ R PSGPS R+SD+PP P S IL K+S INFG
Sbjct: 61 FSLTKSKGKPEFDTHFRFGIFSKDVRIPPSGPSQRSSDSPPSP----SMILPKESRINFG 120
Query: 121 VLPKGERIPPSGPSQRTSESPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPSTRSSD 180
+LPKG RIPPSGPSQR S+SP P PS LHK S + FG+LPKG IPPSGPS R+SD
Sbjct: 121 ILPKGSRIPPSGPSQRFSDSPLP----PSTFLHKGSNMIFGMLPKGHHIPPSGPSKRTSD 180
Query: 181 YPPPPPHAPFVI 193
PPPPPH+P +I
Sbjct: 181 NPPPPPHSPSLI 183
BLAST of Clc04G04910 vs. ExPASy TrEMBL
Match:
A0A6J1HRH7 (actin cytoskeleton-regulatory complex protein PAN1-like OS=Cucurbita maxima OX=3661 GN=LOC111467141 PE=4 SV=1)
HSP 1 Score: 185.7 bits (470), Expect = 2.2e-43
Identity = 112/198 (56.57%), Postives = 131/198 (66.16%), Query Frame = 0
Query: 25 IYQADAKNVF-VDVGGFPIIDNVLSDPSIKHPKRHIRFSTGKPKRVPDLDPPSS---VGI 84
IY+ DAKNV VDV I + L+D +++PK S+ +R PS G+
Sbjct: 185 IYETDAKNVVEVDVDD-NIREAELTDLIVQNPKGISISSSRSSQRTLFHRTPSPSLVFGM 244
Query: 85 LSKEERTTPSGPSPRTSDNPPPPPHVSSTILHKQSSINFGVLPKGERIPPSGPSQRTSES 144
L K PS PS RTSD PPPPP SS IL+K S IN G+LP+G IPPSGPSQRTS+
Sbjct: 245 LPKGVPIPPSRPSQRTSDYPPPPPRASSIILNKHSKINLGMLPRGVPIPPSGPSQRTSDY 304
Query: 145 PPPPPHAPSVILHKESGINFGILPKGVRIPPSGPSTRSSDYPPPPPHAPFVILKKESKIR 204
PPPPPHA SVIL+K+S INFG+LPKGV IPPSGPS R+S YPPPPP A VIL K+SKI
Sbjct: 305 PPPPPHASSVILNKQSKINFGMLPKGVPIPPSGPSQRTSXYPPPPPRASSVILNKQSKIY 364
Query: 205 FGMYPKNNPIPPSAPSGR 219
GM P+ PIPP S R
Sbjct: 365 LGMLPRGVPIPPPGLSQR 381
BLAST of Clc04G04910 vs. ExPASy TrEMBL
Match:
A0A5A7T8R0 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold36G001740 PE=4 SV=1)
HSP 1 Score: 147.9 bits (372), Expect = 5.0e-32
Identity = 104/233 (44.64%), Postives = 124/233 (53.22%), Query Frame = 0
Query: 1 MVSISSCNVFFAACALILLGLHVAIYQADAKNVFVDVGGFPIIDNVLSDPSIKHPKRHIR 60
M+S SS NV F CALILLGLH Y DA++ V G +I+N L+D I+HPKRHIR
Sbjct: 8 MMSTSS-NVLFGTCALILLGLHFETYHIDARSFSVFGEGNYVIENQLTDHVIEHPKRHIR 67
Query: 61 FSTGKPKRVPDLDPPSSVGILSKEERTTPS---------------------GPSP-RTSD 120
FS KPKRVPDL PP S+GIL+K+ T PS P P R D
Sbjct: 68 FSWPKPKRVPDLAPPFSLGILTKDILTPPSENELGDNIVKHPKSHIRFSWVWPKPKRVPD 127
Query: 121 NPPP------------PPHVS----STILHKQSSINF-----------------GVLPKG 179
PP PP + S H ++ I F G+L K
Sbjct: 128 LAPPFSLGILNNHLRNPPSENELGHSITKHPKNHIRFSWPKPKRVPDLAPPFSLGILLKD 187
BLAST of Clc04G04910 vs. ExPASy TrEMBL
Match:
A0A5D3BVE5 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold2454G00110 PE=4 SV=1)
HSP 1 Score: 141.7 bits (356), Expect = 3.6e-30
Identity = 102/233 (43.78%), Postives = 120/233 (51.50%), Query Frame = 0
Query: 1 MVSISSCNVFFAACALILLGLHVAIYQADAKNVFVDVGGFPIIDNVLSDPSIKHPKRHIR 60
M+S SS NV F ACALILLG H Y DA+ V G +I+N L+D I+HPKRHIR
Sbjct: 1 MMSTSS-NVLFGACALILLGFHFETYHIDARRFSVVGEGNYVIENQLTDHVIEHPKRHIR 60
Query: 61 FSTGKPKRVPDLDPPSSVGILSKEERTTPS---------------------GPSP-RTSD 120
FS KPKRVPDL PP S+GIL+K+ T PS P P R D
Sbjct: 61 FSWPKPKRVPDLAPPFSLGILTKDVLTPPSENEVGDNIVKHPKNHIRFSWIWPKPKRVPD 120
Query: 121 NPPP------------PPHVS----STILHKQSSINF-----------------GVLPKG 179
PP PP + S H +S I F G+L K
Sbjct: 121 LAPPFSLGILNNHLRTPPSENELGHSITKHPKSHIRFSWPKPKRVPDLAPPFSLGILSKD 180
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038882352.1 | 1.8e-68 | 73.96 | uncharacterized protein LOC120073615 [Benincasa hispida] | [more] |
KAG6603169.1 | 9.3e-49 | 59.70 | hypothetical protein SDJN03_03778, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KAA0040932.1 | 1.0e-47 | 59.49 | proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa] >TYK1060... | [more] |
KAA0043502.1 | 3.4e-43 | 55.21 | proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa] >TYK0113... | [more] |
XP_022967687.1 | 4.5e-43 | 56.57 | actin cytoskeleton-regulatory complex protein PAN1-like [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5D3CGG1 | 5.0e-48 | 59.49 | Proline-rich protein HaeIII subfamily 1-like OS=Cucumis melo var. makuwa OX=1194... | [more] |
A0A5D3BP86 | 1.7e-43 | 55.21 | Proline-rich protein HaeIII subfamily 1-like OS=Cucumis melo var. makuwa OX=1194... | [more] |
A0A6J1HRH7 | 2.2e-43 | 56.57 | actin cytoskeleton-regulatory complex protein PAN1-like OS=Cucurbita maxima OX=3... | [more] |
A0A5A7T8R0 | 5.0e-32 | 44.64 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... | [more] |
A0A5D3BVE5 | 3.6e-30 | 43.78 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
Match Name | E-value | Identity | Description | |