Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGTCAATTAGTTCTTGTAATGTGTTTTTCGCAGCTTGCGCTCTCATCCTCCTTGGGTCGCACGTTGCAATCTACCAAGCAGATGCTAAAAATGTTTTTGTAGACGTCGGAGGCTTCCCTATCATAGACAACGTATTGAGCGACCCGAGTATAAAACATCCGAAAAGGCACATCCGTTTCTCAACGGGGAAACCCAAGAGGGTACCTGACTTAGATCCACCTTCCAGCGTGGGAATACTATCGAAAGAAGAGCGCACTACCCCGTCCGGACCGAGTCCAAGAACTTCGGACAATCCACCTCCCCCACCGCATGTCTCGTCCACCATTTTGCACAAGCAATCTAGCATCAACTTCGGAGTGTTACCGAAAGGTGAGCGTATTCCTCCGTCTGGGCCGAGCCAAAGAACTTCAGAGAGTCCACCTCCCCCGCCGCATGCCCCATCCGTCATTTTACACAAGGAATCTGGGATCAACTTTGGAATATTACCGAAAGGCGTGCGTATTCCACCGTCAGGGCCGAGTACAAGATCTTCGGACTATCCACCGCCCCCACCTCATGCTCCTTTCGTTATTTTAAAGAAAGAATCTAAGATTAGATTTGGGATGTATCCTAAAAATAACCCCATTCCACCATCTGCTCCAAGCGGACGCGTACCATCATGA
mRNA sequence
ATGGTGTCAATTAGTTCTTGTAATGTGTTTTTCGCAGCTTGCGCTCTCATCCTCCTTGGGTCGCACGTTGCAATCTACCAAGCAGATGCTAAAAATGTTTTTGTAGACGTCGGAGGCTTCCCTATCATAGACAACGTATTGAGCGACCCGAGTATAAAACATCCGAAAAGGCACATCCGTTTCTCAACGGGGAAACCCAAGAGGGTACCTGACTTAGATCCACCTTCCAGCGTGGGAATACTATCGAAAGAAGAGCGCACTACCCCGTCCGGACCGAGTCCAAGAACTTCGGACAATCCACCTCCCCCACCGCATGTCTCGTCCACCATTTTGCACAAGCAATCTAGCATCAACTTCGGAGTGTTACCGAAAGGTGAGCGTATTCCTCCGTCTGGGCCGAGCCAAAGAACTTCAGAGAGTCCACCTCCCCCGCCGCATGCCCCATCCGTCATTTTACACAAGGAATCTGGGATCAACTTTGGAATATTACCGAAAGGCGTGCGTATTCCACCGTCAGGGCCGAGTACAAGATCTTCGGACTATCCACCGCCCCCACCTCATGCTCCTTTCGTTATTTTAAAGAAAGAATCTAAGATTAGATTTGGGATGTATCCTAAAAATAACCCCATTCCACCATCTGCTCCAAGCGGACGCGTACCATCATGA
Coding sequence (CDS)
ATGGTGTCAATTAGTTCTTGTAATGTGTTTTTCGCAGCTTGCGCTCTCATCCTCCTTGGGTCGCACGTTGCAATCTACCAAGCAGATGCTAAAAATGTTTTTGTAGACGTCGGAGGCTTCCCTATCATAGACAACGTATTGAGCGACCCGAGTATAAAACATCCGAAAAGGCACATCCGTTTCTCAACGGGGAAACCCAAGAGGGTACCTGACTTAGATCCACCTTCCAGCGTGGGAATACTATCGAAAGAAGAGCGCACTACCCCGTCCGGACCGAGTCCAAGAACTTCGGACAATCCACCTCCCCCACCGCATGTCTCGTCCACCATTTTGCACAAGCAATCTAGCATCAACTTCGGAGTGTTACCGAAAGGTGAGCGTATTCCTCCGTCTGGGCCGAGCCAAAGAACTTCAGAGAGTCCACCTCCCCCGCCGCATGCCCCATCCGTCATTTTACACAAGGAATCTGGGATCAACTTTGGAATATTACCGAAAGGCGTGCGTATTCCACCGTCAGGGCCGAGTACAAGATCTTCGGACTATCCACCGCCCCCACCTCATGCTCCTTTCGTTATTTTAAAGAAAGAATCTAAGATTAGATTTGGGATGTATCCTAAAAATAACCCCATTCCACCATCTGCTCCAAGCGGACGCGTACCATCATGA
Protein sequence
MVSISSCNVFFAACALILLGSHVAIYQADAKNVFVDVGGFPIIDNVLSDPSIKHPKRHIRFSTGKPKRVPDLDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPHVSSTILHKQSSINFGVLPKGERIPPSGPSQRTSESPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPSTRSSDYPPPPPHAPFVILKKESKIRFGMYPKNNPIPPSAPSGRVPS
Homology
BLAST of Cla97C04G071360 vs. NCBI nr
Match:
XP_038882352.1 (uncharacterized protein LOC120073615 [Benincasa hispida])
HSP 1 Score: 267.7 bits (683), Expect = 8.9e-68
Identity = 141/192 (73.44%), Postives = 150/192 (78.12%), Query Frame = 0
Query: 1 MVSISSCNVFFAACALILLGSHVAIYQADAKNVFVDVGGFPIIDNVLSDPSIKHPKRHIR 60
MVSIS NVFF ACA I LG H IYQ DAKNVFV G I+N LSD +IKHPKRHI
Sbjct: 1 MVSISYSNVFFGACAFIFLGLHSTIYQTDAKNVFVVEKGDSTIENELSDENIKHPKRHIH 60
Query: 61 FSTGKPKRVPDLDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPHVSSTILHKQSSINFG 120
FS GK KR+PDL PP ++GILSKEER PSG S TSDNPPPPPHV S ILHK+S INF
Sbjct: 61 FSLGKYKRIPDLAPPFNLGILSKEERVPPSGLSQSTSDNPPPPPHVISIILHKESRINFR 120
Query: 121 VLPKGERIPPSGPSQRTSESPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPSTRSSD 180
VL KG RIPPSGPSQRTSESPPPPPHA SVILHK+ GINFGILPK + IPPSGPS R S+
Sbjct: 121 VLSKGNRIPPSGPSQRTSESPPPPPHALSVILHKKPGINFGILPKSMHIPPSGPSKRFSN 180
Query: 181 YPPPPPHAPFVI 193
YP PP HAP VI
Sbjct: 181 YPSPPTHAPSVI 192
BLAST of Cla97C04G071360 vs. NCBI nr
Match:
KAG6603169.1 (hypothetical protein SDJN03_03778, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 204.5 bits (519), Expect = 9.3e-49
Identity = 120/201 (59.70%), Postives = 139/201 (69.15%), Query Frame = 0
Query: 23 VAIYQADAKNVFVDVGGFPII-DNVLSDPSIKHPKR-HIRFSTGKPKRVPDLDPPSSV-- 82
V +Y+ DAK V V+V + I + L+D +KHPK I S + + L PP V
Sbjct: 23 VQVYETDAKKV-VEVDVYDTIREEELTDQMVKHPKGISISSSRSSQRTLFHLTPPPPVVL 82
Query: 83 GILSKEERTTPSGPSPRTSDNPPPPPHVSSTILHKQSSINFGVLPKGERIPPSGPSQRTS 142
+L K +PSGPS RTSD PPPPP SS IL KQS INFG+LPKG IPPSGPSQRTS
Sbjct: 83 RMLPKGVPISPSGPSQRTSDYPPPPPRASSVILSKQSKINFGMLPKGVPIPPSGPSQRTS 142
Query: 143 ESPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPSTRSSDYPPPPP-HAPFVILKKES 202
+ PPPPP A SVIL+K+S INFG+LPKGV IPPSGPS R+S+YPPPPP HA VIL +S
Sbjct: 143 DYPPPPPRASSVILNKQSKINFGMLPKGVPIPPSGPSQRTSNYPPPPPLHASSVILNTQS 202
Query: 203 KIRFGMYPKNNPIPPSAPSGR 219
KI FGM PK PIPPS PS R
Sbjct: 203 KINFGMLPKGVPIPPSGPSQR 222
BLAST of Cla97C04G071360 vs. NCBI nr
Match:
KAA0040932.1 (proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa] >TYK10605.1 proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa])
HSP 1 Score: 199.1 bits (505), Expect = 3.9e-47
Identity = 115/195 (58.97%), Postives = 137/195 (70.26%), Query Frame = 0
Query: 1 MVSISSCNVFFAACALILLGSHVAIYQADAKNVFVDVGGFPIIDNVLSDPSIKHPKRHIR 60
MVS+ + N+FF ACALI LG H IY +AK FV ++NVL I HPK+
Sbjct: 1 MVSVGT-NIFFRACALIFLGLHFEIYLTNAKRSFVVDEDNSSLENVLRGDIINHPKKVAH 60
Query: 61 FSTGKPKRVPDLDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPHVSSTILHKQSSINFG 120
FS KPK++PDL PP S+GILSK RT PSG S TS+N PP PHV+ ILHK+S +NFG
Sbjct: 61 FSWPKPKKIPDLAPPFSLGILSKGIRTPPSGLSQGTSNN-PPSPHVAPIILHKESMVNFG 120
Query: 121 VLPKGERIPPSGPSQRTSESPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPSTRSSD 180
+LPKG RIPPSGPS+RTS+ PPP + S+ KES I +GILPKGVRIPPSGPS R+SD
Sbjct: 121 ILPKGVRIPPSGPSRRTSDPPPPLSNFHSI---KESRIKYGILPKGVRIPPSGPSKRNSD 180
Query: 181 YPPPPP---HAPFVI 193
Y PPPP HAP +I
Sbjct: 181 YYPPPPMHLHAPSII 190
BLAST of Cla97C04G071360 vs. NCBI nr
Match:
XP_022967687.1 (actin cytoskeleton-regulatory complex protein PAN1-like [Cucurbita maxima])
HSP 1 Score: 184.9 bits (468), Expect = 7.6e-43
Identity = 117/212 (55.19%), Postives = 138/212 (65.09%), Query Frame = 0
Query: 11 FAACALILLGSHVAIYQADAKNVF-VDVGGFPIIDNVLSDPSIKHPKRHIRFSTGKPKRV 70
+AA LGS+ IY+ DAKNV VDV I + L+D +++PK S+ +R
Sbjct: 172 YAATDTEPLGSN-QIYETDAKNVVEVDVDD-NIREAELTDLIVQNPKGISISSSRSSQRT 231
Query: 71 PDLDPPSS---VGILSKEERTTPSGPSPRTSDNPPPPPHVSSTILHKQSSINFGVLPKGE 130
PS G+L K PS PS RTSD PPPPP SS IL+K S IN G+LP+G
Sbjct: 232 LFHRTPSPSLVFGMLPKGVPIPPSRPSQRTSDYPPPPPRASSIILNKHSKINLGMLPRGV 291
Query: 131 RIPPSGPSQRTSESPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPSTRSSDYPPPPP 190
IPPSGPSQRTS+ PPPPPHA SVIL+K+S INFG+LPKGV IPPSGPS R+S YPPPPP
Sbjct: 292 PIPPSGPSQRTSDYPPPPPHASSVILNKQSKINFGMLPKGVPIPPSGPSQRTSXYPPPPP 351
Query: 191 HAPFVILKKESKIRFGMYPKNNPIPPSAPSGR 219
A VIL K+SKI GM P+ PIPP S R
Sbjct: 352 RASSVILNKQSKIYLGMLPRGVPIPPPGLSQR 381
BLAST of Cla97C04G071360 vs. NCBI nr
Match:
KAA0043502.1 (proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa] >TYK01137.1 proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa])
HSP 1 Score: 183.7 bits (465), Expect = 1.7e-42
Identity = 105/192 (54.69%), Postives = 128/192 (66.67%), Query Frame = 0
Query: 1 MVSISSCNVFFAACALILLGSHVAIYQADAKNVFVDVGGFPIIDNVLSDPSIKHPKRHIR 60
M+SISS N+ F CA++LLG H IYQ + +N+F + +N L SI HPK++I
Sbjct: 1 MMSISS-NMTFKTCAIVLLGLHFGIYQTNGRNLFEVKEDNSVFENELKANSIIHPKQYIH 60
Query: 61 FSTGKPKRVPDLDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPHVSSTILHKQSSINFG 120
FS K K P+ D GI SK+ R PSGPS R+SD+PP P S IL K+S INFG
Sbjct: 61 FSLTKSKGKPEFDTHFRFGIFSKDVRIPPSGPSQRSSDSPPSP----SMILPKESRINFG 120
Query: 121 VLPKGERIPPSGPSQRTSESPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPSTRSSD 180
+LPKG RIPPSGPSQR S+SP P PS LHK S + FG+LPKG IPPSGPS R+SD
Sbjct: 121 ILPKGSRIPPSGPSQRFSDSPLP----PSTFLHKGSNMIFGMLPKGHHIPPSGPSKRTSD 180
Query: 181 YPPPPPHAPFVI 193
PPPPPH+P +I
Sbjct: 181 NPPPPPHSPSLI 183
BLAST of Cla97C04G071360 vs. ExPASy TrEMBL
Match:
A0A5D3CGG1 (Proline-rich protein HaeIII subfamily 1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold315G00020 PE=4 SV=1)
HSP 1 Score: 199.1 bits (505), Expect = 1.9e-47
Identity = 115/195 (58.97%), Postives = 137/195 (70.26%), Query Frame = 0
Query: 1 MVSISSCNVFFAACALILLGSHVAIYQADAKNVFVDVGGFPIIDNVLSDPSIKHPKRHIR 60
MVS+ + N+FF ACALI LG H IY +AK FV ++NVL I HPK+
Sbjct: 1 MVSVGT-NIFFRACALIFLGLHFEIYLTNAKRSFVVDEDNSSLENVLRGDIINHPKKVAH 60
Query: 61 FSTGKPKRVPDLDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPHVSSTILHKQSSINFG 120
FS KPK++PDL PP S+GILSK RT PSG S TS+N PP PHV+ ILHK+S +NFG
Sbjct: 61 FSWPKPKKIPDLAPPFSLGILSKGIRTPPSGLSQGTSNN-PPSPHVAPIILHKESMVNFG 120
Query: 121 VLPKGERIPPSGPSQRTSESPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPSTRSSD 180
+LPKG RIPPSGPS+RTS+ PPP + S+ KES I +GILPKGVRIPPSGPS R+SD
Sbjct: 121 ILPKGVRIPPSGPSRRTSDPPPPLSNFHSI---KESRIKYGILPKGVRIPPSGPSKRNSD 180
Query: 181 YPPPPP---HAPFVI 193
Y PPPP HAP +I
Sbjct: 181 YYPPPPMHLHAPSII 190
BLAST of Cla97C04G071360 vs. ExPASy TrEMBL
Match:
A0A6J1HRH7 (actin cytoskeleton-regulatory complex protein PAN1-like OS=Cucurbita maxima OX=3661 GN=LOC111467141 PE=4 SV=1)
HSP 1 Score: 184.9 bits (468), Expect = 3.7e-43
Identity = 117/212 (55.19%), Postives = 138/212 (65.09%), Query Frame = 0
Query: 11 FAACALILLGSHVAIYQADAKNVF-VDVGGFPIIDNVLSDPSIKHPKRHIRFSTGKPKRV 70
+AA LGS+ IY+ DAKNV VDV I + L+D +++PK S+ +R
Sbjct: 172 YAATDTEPLGSN-QIYETDAKNVVEVDVDD-NIREAELTDLIVQNPKGISISSSRSSQRT 231
Query: 71 PDLDPPSS---VGILSKEERTTPSGPSPRTSDNPPPPPHVSSTILHKQSSINFGVLPKGE 130
PS G+L K PS PS RTSD PPPPP SS IL+K S IN G+LP+G
Sbjct: 232 LFHRTPSPSLVFGMLPKGVPIPPSRPSQRTSDYPPPPPRASSIILNKHSKINLGMLPRGV 291
Query: 131 RIPPSGPSQRTSESPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPSTRSSDYPPPPP 190
IPPSGPSQRTS+ PPPPPHA SVIL+K+S INFG+LPKGV IPPSGPS R+S YPPPPP
Sbjct: 292 PIPPSGPSQRTSDYPPPPPHASSVILNKQSKINFGMLPKGVPIPPSGPSQRTSXYPPPPP 351
Query: 191 HAPFVILKKESKIRFGMYPKNNPIPPSAPSGR 219
A VIL K+SKI GM P+ PIPP S R
Sbjct: 352 RASSVILNKQSKIYLGMLPRGVPIPPPGLSQR 381
BLAST of Cla97C04G071360 vs. ExPASy TrEMBL
Match:
A0A5D3BP86 (Proline-rich protein HaeIII subfamily 1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold109G00050 PE=4 SV=1)
HSP 1 Score: 183.7 bits (465), Expect = 8.2e-43
Identity = 105/192 (54.69%), Postives = 128/192 (66.67%), Query Frame = 0
Query: 1 MVSISSCNVFFAACALILLGSHVAIYQADAKNVFVDVGGFPIIDNVLSDPSIKHPKRHIR 60
M+SISS N+ F CA++LLG H IYQ + +N+F + +N L SI HPK++I
Sbjct: 1 MMSISS-NMTFKTCAIVLLGLHFGIYQTNGRNLFEVKEDNSVFENELKANSIIHPKQYIH 60
Query: 61 FSTGKPKRVPDLDPPSSVGILSKEERTTPSGPSPRTSDNPPPPPHVSSTILHKQSSINFG 120
FS K K P+ D GI SK+ R PSGPS R+SD+PP P S IL K+S INFG
Sbjct: 61 FSLTKSKGKPEFDTHFRFGIFSKDVRIPPSGPSQRSSDSPPSP----SMILPKESRINFG 120
Query: 121 VLPKGERIPPSGPSQRTSESPPPPPHAPSVILHKESGINFGILPKGVRIPPSGPSTRSSD 180
+LPKG RIPPSGPSQR S+SP P PS LHK S + FG+LPKG IPPSGPS R+SD
Sbjct: 121 ILPKGSRIPPSGPSQRFSDSPLP----PSTFLHKGSNMIFGMLPKGHHIPPSGPSKRTSD 180
Query: 181 YPPPPPHAPFVI 193
PPPPPH+P +I
Sbjct: 181 NPPPPPHSPSLI 183
BLAST of Cla97C04G071360 vs. ExPASy TrEMBL
Match:
A0A5A7T8R0 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold36G001740 PE=4 SV=1)
HSP 1 Score: 146.0 bits (367), Expect = 1.9e-31
Identity = 103/233 (44.21%), Postives = 123/233 (52.79%), Query Frame = 0
Query: 1 MVSISSCNVFFAACALILLGSHVAIYQADAKNVFVDVGGFPIIDNVLSDPSIKHPKRHIR 60
M+S SS NV F CALILLG H Y DA++ V G +I+N L+D I+HPKRHIR
Sbjct: 8 MMSTSS-NVLFGTCALILLGLHFETYHIDARSFSVFGEGNYVIENQLTDHVIEHPKRHIR 67
Query: 61 FSTGKPKRVPDLDPPSSVGILSKEERTTPS---------------------GPSP-RTSD 120
FS KPKRVPDL PP S+GIL+K+ T PS P P R D
Sbjct: 68 FSWPKPKRVPDLAPPFSLGILTKDILTPPSENELGDNIVKHPKSHIRFSWVWPKPKRVPD 127
Query: 121 NPPP------------PPHVS----STILHKQSSINF-----------------GVLPKG 179
PP PP + S H ++ I F G+L K
Sbjct: 128 LAPPFSLGILNNHLRNPPSENELGHSITKHPKNHIRFSWPKPKRVPDLAPPFSLGILLKD 187
BLAST of Cla97C04G071360 vs. ExPASy TrEMBL
Match:
A0A5D3BVE5 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold2454G00110 PE=4 SV=1)
HSP 1 Score: 141.4 bits (355), Expect = 4.7e-30
Identity = 102/233 (43.78%), Postives = 120/233 (51.50%), Query Frame = 0
Query: 1 MVSISSCNVFFAACALILLGSHVAIYQADAKNVFVDVGGFPIIDNVLSDPSIKHPKRHIR 60
M+S SS NV F ACALILLG H Y DA+ V G +I+N L+D I+HPKRHIR
Sbjct: 1 MMSTSS-NVLFGACALILLGFHFETYHIDARRFSVVGEGNYVIENQLTDHVIEHPKRHIR 60
Query: 61 FSTGKPKRVPDLDPPSSVGILSKEERTTPS---------------------GPSP-RTSD 120
FS KPKRVPDL PP S+GIL+K+ T PS P P R D
Sbjct: 61 FSWPKPKRVPDLAPPFSLGILTKDVLTPPSENEVGDNIVKHPKNHIRFSWIWPKPKRVPD 120
Query: 121 NPPP------------PPHVS----STILHKQSSINF-----------------GVLPKG 179
PP PP + S H +S I F G+L K
Sbjct: 121 LAPPFSLGILNNHLRTPPSENELGHSITKHPKSHIRFSWPKPKRVPDLAPPFSLGILSKD 180
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038882352.1 | 8.9e-68 | 73.44 | uncharacterized protein LOC120073615 [Benincasa hispida] | [more] |
KAG6603169.1 | 9.3e-49 | 59.70 | hypothetical protein SDJN03_03778, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KAA0040932.1 | 3.9e-47 | 58.97 | proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa] >TYK1060... | [more] |
XP_022967687.1 | 7.6e-43 | 55.19 | actin cytoskeleton-regulatory complex protein PAN1-like [Cucurbita maxima] | [more] |
KAA0043502.1 | 1.7e-42 | 54.69 | proline-rich protein HaeIII subfamily 1-like [Cucumis melo var. makuwa] >TYK0113... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5D3CGG1 | 1.9e-47 | 58.97 | Proline-rich protein HaeIII subfamily 1-like OS=Cucumis melo var. makuwa OX=1194... | [more] |
A0A6J1HRH7 | 3.7e-43 | 55.19 | actin cytoskeleton-regulatory complex protein PAN1-like OS=Cucurbita maxima OX=3... | [more] |
A0A5D3BP86 | 8.2e-43 | 54.69 | Proline-rich protein HaeIII subfamily 1-like OS=Cucumis melo var. makuwa OX=1194... | [more] |
A0A5A7T8R0 | 1.9e-31 | 44.21 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... | [more] |
A0A5D3BVE5 | 4.7e-30 | 43.78 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
Match Name | E-value | Identity | Description | |