Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
GGGATCCCAATCCAACGTCAGCTTCGTTTCCAGCGTCCGAATCCTGTGAACTTCTTCTTCTCAGGCTCTGCCAGTGTTTCGCTCAGTGCGGAATGAGGAACGTTCAGGTTGCTTTTCGAACCATTGGAGGTGGCATAACGAGGAAATTTAAGCAACCTTCTGCGCTCAGAAACCTTTCCACAAACGCCGCTCGAGGTAATTTTTACTTTTTACTTATACTGTGATTTCTGAAGTTCTTTTACCTAATGATTTGGCCGGAAATTTGTCAGGTTTATTGTTAAGACTGTTGCTCAAATGTGTTTCGGTTTAGATCTTGTTTGTCCGAGTTTTTGCGTGAAAAACCTCATGTTTAGTTGTTTAATTGGTTAATGTTCTTCGAGGCATGTTTTTACATCTTTCGTTTTTGATAATTGTAAACAGATTGTAGTTTCGATGATGAGGGAAAATCCAATTCGTTTGAGTCTGCTGATGACTTTGAACGTCGGATATTTGGTGGCGTATCTTTGGGCGATTCCGGAAACGATGCTTTCTTTGAGAAGCTTGATAGACTTGGTAAACCTCGTGAAAGGATCGGTTCAAGGCTGAGTGGTGGAAACAATTTTCAGGCGTTGTATGGTATTGAAGATAATCTTAACACGCTGTCGGATGGGATGGATGGCAAGTTGAAGAAAGCTTCTACGTATTTTGAGTTTGATCCTGAAGAAATAGCAAAAGATGATTATACTTTCAGAGCGGATATGTCCTTTAAACCTGGATCAACATACGAAATCAAGACCCGTAGCTGGGCAACTAATTGTTTTTATACTTGTTAA
mRNA sequence
GGGATCCCAATCCAACGTCAGCTTCGTTTCCAGCGTCCGAATCCTGTGAACTTCTTCTTCTCAGGCTCTGCCAGTGTTTCGCTCAGTGCGGAATGAGGAACGTTCAGGTTGCTTTTCGAACCATTGGAGGTGGCATAACGAGGAAATTTAAGCAACCTTCTGCGCTCAGAAACCTTTCCACAAACGCCGCTCGAGATTGTAGTTTCGATGATGAGGGAAAATCCAATTCGTTTGAGTCTGCTGATGACTTTGAACGTCGGATATTTGGTGGCGTATCTTTGGGCGATTCCGGAAACGATGCTTTCTTTGAGAAGCTTGATAGACTTGGTAAACCTCGTGAAAGGATCGGTTCAAGGCTGAGTGGTGGAAACAATTTTCAGGCGTTGTATGGTATTGAAGATAATCTTAACACGCTGTCGGATGGGATGGATGGCAAGTTGAAGAAAGCTTCTACGTATTTTGAGTTTGATCCTGAAGAAATAGCAAAAGATGATTATACTTTCAGAGCGGATATGTCCTTTAAACCTGGATCAACATACGAAATCAAGACCCGTAGCTGGGCAACTAATTGTTTTTATACTTGTTAA
Coding sequence (CDS)
ATGAGGAACGTTCAGGTTGCTTTTCGAACCATTGGAGGTGGCATAACGAGGAAATTTAAGCAACCTTCTGCGCTCAGAAACCTTTCCACAAACGCCGCTCGAGATTGTAGTTTCGATGATGAGGGAAAATCCAATTCGTTTGAGTCTGCTGATGACTTTGAACGTCGGATATTTGGTGGCGTATCTTTGGGCGATTCCGGAAACGATGCTTTCTTTGAGAAGCTTGATAGACTTGGTAAACCTCGTGAAAGGATCGGTTCAAGGCTGAGTGGTGGAAACAATTTTCAGGCGTTGTATGGTATTGAAGATAATCTTAACACGCTGTCGGATGGGATGGATGGCAAGTTGAAGAAAGCTTCTACGTATTTTGAGTTTGATCCTGAAGAAATAGCAAAAGATGATTATACTTTCAGAGCGGATATGTCCTTTAAACCTGGATCAACATACGAAATCAAGACCCGTAGCTGGGCAACTAATTGTTTTTATACTTGTTAA
Protein sequence
MRNVQVAFRTIGGGITRKFKQPSALRNLSTNAARDCSFDDEGKSNSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGIEDNLNTLSDGMDGKLKKASTYFEFDPEEIAKDDYTFRADMSFKPGSTYEIKTRSWATNCFYTC
Homology
BLAST of CmoCh02G007470 vs. ExPASy TrEMBL
Match:
A0A6J1G586 (uncharacterized protein LOC111451027 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111451027 PE=3 SV=1)
HSP 1 Score: 306.2 bits (783), Expect = 8.1e-80
Identity = 152/152 (100.00%), Postives = 152/152 (100.00%), Query Frame = 0
Query: 1 MRNVQVAFRTIGGGITRKFKQPSALRNLSTNAARDCSFDDEGKSNSFESADDFERRIFGG 60
MRNVQVAFRTIGGGITRKFKQPSALRNLSTNAARDCSFDDEGKSNSFESADDFERRIFGG
Sbjct: 1 MRNVQVAFRTIGGGITRKFKQPSALRNLSTNAARDCSFDDEGKSNSFESADDFERRIFGG 60
Query: 61 VSLGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGIEDNLNTLSDGMDGKLKKAS 120
VSLGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGIEDNLNTLSDGMDGKLKKAS
Sbjct: 61 VSLGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGIEDNLNTLSDGMDGKLKKAS 120
Query: 121 TYFEFDPEEIAKDDYTFRADMSFKPGSTYEIK 153
TYFEFDPEEIAKDDYTFRADMSFKPGSTYEIK
Sbjct: 121 TYFEFDPEEIAKDDYTFRADMSFKPGSTYEIK 152
BLAST of CmoCh02G007470 vs. ExPASy TrEMBL
Match:
A0A6J1G5B3 (uncharacterized protein LOC111451027 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111451027 PE=4 SV=1)
HSP 1 Score: 306.2 bits (783), Expect = 8.1e-80
Identity = 152/152 (100.00%), Postives = 152/152 (100.00%), Query Frame = 0
Query: 1 MRNVQVAFRTIGGGITRKFKQPSALRNLSTNAARDCSFDDEGKSNSFESADDFERRIFGG 60
MRNVQVAFRTIGGGITRKFKQPSALRNLSTNAARDCSFDDEGKSNSFESADDFERRIFGG
Sbjct: 1 MRNVQVAFRTIGGGITRKFKQPSALRNLSTNAARDCSFDDEGKSNSFESADDFERRIFGG 60
Query: 61 VSLGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGIEDNLNTLSDGMDGKLKKAS 120
VSLGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGIEDNLNTLSDGMDGKLKKAS
Sbjct: 61 VSLGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGIEDNLNTLSDGMDGKLKKAS 120
Query: 121 TYFEFDPEEIAKDDYTFRADMSFKPGSTYEIK 153
TYFEFDPEEIAKDDYTFRADMSFKPGSTYEIK
Sbjct: 121 TYFEFDPEEIAKDDYTFRADMSFKPGSTYEIK 152
BLAST of CmoCh02G007470 vs. ExPASy TrEMBL
Match:
A0A6J1KYL3 (LOW QUALITY PROTEIN: uncharacterized protein LOC111499912 OS=Cucurbita maxima OX=3661 GN=LOC111499912 PE=3 SV=1)
HSP 1 Score: 295.4 bits (755), Expect = 1.4e-76
Identity = 149/152 (98.03%), Postives = 149/152 (98.03%), Query Frame = 0
Query: 1 MRNVQVAFRTIGGGITRKFKQPSALRNLSTNAARDCSFDDEGKSNSFESADDFERRIFGG 60
MRNV FRTIGGGITRKFKQPSALRNLSTNAARDCSFDDEGKSNSFESADDFERRIFGG
Sbjct: 1 MRNV---FRTIGGGITRKFKQPSALRNLSTNAARDCSFDDEGKSNSFESADDFERRIFGG 60
Query: 61 VSLGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGIEDNLNTLSDGMDGKLKKAS 120
VSLGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGIEDNLNTLSDGMDGKLKKAS
Sbjct: 61 VSLGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGIEDNLNTLSDGMDGKLKKAS 120
Query: 121 TYFEFDPEEIAKDDYTFRADMSFKPGSTYEIK 153
TYFEFDPEEIAKDDYTFRADMSFKPGSTYEIK
Sbjct: 121 TYFEFDPEEIAKDDYTFRADMSFKPGSTYEIK 149
BLAST of CmoCh02G007470 vs. ExPASy TrEMBL
Match:
A0A6J1JHF0 (uncharacterized protein LOC111484513 OS=Cucurbita maxima OX=3661 GN=LOC111484513 PE=4 SV=1)
HSP 1 Score: 275.8 bits (704), Expect = 1.2e-70
Identity = 135/152 (88.82%), Postives = 143/152 (94.08%), Query Frame = 0
Query: 1 MRNVQVAFRTIGGGITRKFKQPSALRNLSTNAARDCSFDDEGKSNSFESADDFERRIFGG 60
M VQVA RT+GGG++R+FKQPS LRNLSTNAARD FDD+GK +SFESADDFERRIFGG
Sbjct: 1 MSTVQVALRTLGGGLSRRFKQPSVLRNLSTNAARDSGFDDKGKPDSFESADDFERRIFGG 60
Query: 61 VSLGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGIEDNLNTLSDGMDGKLKKAS 120
VS GDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYG++DNLNTLSDGMDGKLKKAS
Sbjct: 61 VSAGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGLDDNLNTLSDGMDGKLKKAS 120
Query: 121 TYFEFDPEEIAKDDYTFRADMSFKPGSTYEIK 153
TYFEFD EEIAKDDYTFRADMSFKPGSTYEIK
Sbjct: 121 TYFEFDTEEIAKDDYTFRADMSFKPGSTYEIK 152
BLAST of CmoCh02G007470 vs. ExPASy TrEMBL
Match:
A0A6J1FYC1 (uncharacterized protein LOC111448465 OS=Cucurbita moschata OX=3662 GN=LOC111448465 PE=4 SV=1)
HSP 1 Score: 275.8 bits (704), Expect = 1.2e-70
Identity = 133/152 (87.50%), Postives = 144/152 (94.74%), Query Frame = 0
Query: 1 MRNVQVAFRTIGGGITRKFKQPSALRNLSTNAARDCSFDDEGKSNSFESADDFERRIFGG 60
MR VQVA RT+GGG++R+FKQPS LRNLSTNAARD FDD+GK +SFESADDFERRIFGG
Sbjct: 1 MRTVQVALRTLGGGLSRRFKQPSVLRNLSTNAARDSGFDDKGKPDSFESADDFERRIFGG 60
Query: 61 VSLGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGIEDNLNTLSDGMDGKLKKAS 120
VS GDSGNDAFFEKLDR+GKPRERIGSRLSGGNNFQ+LYG++DNLNTLSDGMDGKLKKAS
Sbjct: 61 VSTGDSGNDAFFEKLDRIGKPRERIGSRLSGGNNFQSLYGLDDNLNTLSDGMDGKLKKAS 120
Query: 121 TYFEFDPEEIAKDDYTFRADMSFKPGSTYEIK 153
TYFEFD EEIAKDDYTFRADMSFKPGSTYE+K
Sbjct: 121 TYFEFDTEEIAKDDYTFRADMSFKPGSTYEVK 152
BLAST of CmoCh02G007470 vs. NCBI nr
Match:
XP_022947027.1 (uncharacterized protein LOC111451027 isoform X2 [Cucurbita moschata])
HSP 1 Score: 306.2 bits (783), Expect = 1.7e-79
Identity = 152/152 (100.00%), Postives = 152/152 (100.00%), Query Frame = 0
Query: 1 MRNVQVAFRTIGGGITRKFKQPSALRNLSTNAARDCSFDDEGKSNSFESADDFERRIFGG 60
MRNVQVAFRTIGGGITRKFKQPSALRNLSTNAARDCSFDDEGKSNSFESADDFERRIFGG
Sbjct: 1 MRNVQVAFRTIGGGITRKFKQPSALRNLSTNAARDCSFDDEGKSNSFESADDFERRIFGG 60
Query: 61 VSLGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGIEDNLNTLSDGMDGKLKKAS 120
VSLGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGIEDNLNTLSDGMDGKLKKAS
Sbjct: 61 VSLGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGIEDNLNTLSDGMDGKLKKAS 120
Query: 121 TYFEFDPEEIAKDDYTFRADMSFKPGSTYEIK 153
TYFEFDPEEIAKDDYTFRADMSFKPGSTYEIK
Sbjct: 121 TYFEFDPEEIAKDDYTFRADMSFKPGSTYEIK 152
BLAST of CmoCh02G007470 vs. NCBI nr
Match:
XP_022947026.1 (uncharacterized protein LOC111451027 isoform X1 [Cucurbita moschata])
HSP 1 Score: 306.2 bits (783), Expect = 1.7e-79
Identity = 152/152 (100.00%), Postives = 152/152 (100.00%), Query Frame = 0
Query: 1 MRNVQVAFRTIGGGITRKFKQPSALRNLSTNAARDCSFDDEGKSNSFESADDFERRIFGG 60
MRNVQVAFRTIGGGITRKFKQPSALRNLSTNAARDCSFDDEGKSNSFESADDFERRIFGG
Sbjct: 1 MRNVQVAFRTIGGGITRKFKQPSALRNLSTNAARDCSFDDEGKSNSFESADDFERRIFGG 60
Query: 61 VSLGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGIEDNLNTLSDGMDGKLKKAS 120
VSLGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGIEDNLNTLSDGMDGKLKKAS
Sbjct: 61 VSLGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGIEDNLNTLSDGMDGKLKKAS 120
Query: 121 TYFEFDPEEIAKDDYTFRADMSFKPGSTYEIK 153
TYFEFDPEEIAKDDYTFRADMSFKPGSTYEIK
Sbjct: 121 TYFEFDPEEIAKDDYTFRADMSFKPGSTYEIK 152
BLAST of CmoCh02G007470 vs. NCBI nr
Match:
XP_023007407.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111499912 [Cucurbita maxima])
HSP 1 Score: 295.4 bits (755), Expect = 3.0e-76
Identity = 149/152 (98.03%), Postives = 149/152 (98.03%), Query Frame = 0
Query: 1 MRNVQVAFRTIGGGITRKFKQPSALRNLSTNAARDCSFDDEGKSNSFESADDFERRIFGG 60
MRNV FRTIGGGITRKFKQPSALRNLSTNAARDCSFDDEGKSNSFESADDFERRIFGG
Sbjct: 1 MRNV---FRTIGGGITRKFKQPSALRNLSTNAARDCSFDDEGKSNSFESADDFERRIFGG 60
Query: 61 VSLGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGIEDNLNTLSDGMDGKLKKAS 120
VSLGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGIEDNLNTLSDGMDGKLKKAS
Sbjct: 61 VSLGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGIEDNLNTLSDGMDGKLKKAS 120
Query: 121 TYFEFDPEEIAKDDYTFRADMSFKPGSTYEIK 153
TYFEFDPEEIAKDDYTFRADMSFKPGSTYEIK
Sbjct: 121 TYFEFDPEEIAKDDYTFRADMSFKPGSTYEIK 149
BLAST of CmoCh02G007470 vs. NCBI nr
Match:
XP_022986919.1 (uncharacterized protein LOC111484513 [Cucurbita maxima])
HSP 1 Score: 275.8 bits (704), Expect = 2.4e-70
Identity = 135/152 (88.82%), Postives = 143/152 (94.08%), Query Frame = 0
Query: 1 MRNVQVAFRTIGGGITRKFKQPSALRNLSTNAARDCSFDDEGKSNSFESADDFERRIFGG 60
M VQVA RT+GGG++R+FKQPS LRNLSTNAARD FDD+GK +SFESADDFERRIFGG
Sbjct: 1 MSTVQVALRTLGGGLSRRFKQPSVLRNLSTNAARDSGFDDKGKPDSFESADDFERRIFGG 60
Query: 61 VSLGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGIEDNLNTLSDGMDGKLKKAS 120
VS GDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYG++DNLNTLSDGMDGKLKKAS
Sbjct: 61 VSAGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGLDDNLNTLSDGMDGKLKKAS 120
Query: 121 TYFEFDPEEIAKDDYTFRADMSFKPGSTYEIK 153
TYFEFD EEIAKDDYTFRADMSFKPGSTYEIK
Sbjct: 121 TYFEFDTEEIAKDDYTFRADMSFKPGSTYEIK 152
BLAST of CmoCh02G007470 vs. NCBI nr
Match:
XP_022943865.1 (uncharacterized protein LOC111448465 [Cucurbita moschata])
HSP 1 Score: 275.8 bits (704), Expect = 2.4e-70
Identity = 133/152 (87.50%), Postives = 144/152 (94.74%), Query Frame = 0
Query: 1 MRNVQVAFRTIGGGITRKFKQPSALRNLSTNAARDCSFDDEGKSNSFESADDFERRIFGG 60
MR VQVA RT+GGG++R+FKQPS LRNLSTNAARD FDD+GK +SFESADDFERRIFGG
Sbjct: 1 MRTVQVALRTLGGGLSRRFKQPSVLRNLSTNAARDSGFDDKGKPDSFESADDFERRIFGG 60
Query: 61 VSLGDSGNDAFFEKLDRLGKPRERIGSRLSGGNNFQALYGIEDNLNTLSDGMDGKLKKAS 120
VS GDSGNDAFFEKLDR+GKPRERIGSRLSGGNNFQ+LYG++DNLNTLSDGMDGKLKKAS
Sbjct: 61 VSTGDSGNDAFFEKLDRIGKPRERIGSRLSGGNNFQSLYGLDDNLNTLSDGMDGKLKKAS 120
Query: 121 TYFEFDPEEIAKDDYTFRADMSFKPGSTYEIK 153
TYFEFD EEIAKDDYTFRADMSFKPGSTYE+K
Sbjct: 121 TYFEFDTEEIAKDDYTFRADMSFKPGSTYEVK 152
BLAST of CmoCh02G007470 vs. TAIR 10
Match:
AT1G07210.1 (Ribosomal protein S18 )
HSP 1 Score: 51.6 bits (122), Expect = 6.9e-07
Identity = 43/148 (29.05%), Postives = 71/148 (47.97%), Query Frame = 0
Query: 21 QPSALRNLSTNAARDCSFDDEGKSNSFESADDFERRIFGGVSLGDSGNDAFFEKLDRLGK 80
+P R+L+TNA + D +SF+S+D+ + +FG + D ++ FF+ L + K
Sbjct: 21 RPIVSRSLATNA----NEDPNQNRSSFDSSDNLD-SLFGDYTGNDEKSNDFFQHLSKAEK 80
Query: 81 PRERI-------GSRLSGGNNFQALYGIEDNLNTLSDGMDGKLKKASTYFEFDPEEIAKD 140
+ GSR SGG + ++ + SDG+DGKLK+A+ + D + K
Sbjct: 81 DKRDFNGYNRSGGSRYSGGGSMSK----DETFDPSSDGVDGKLKEAALAYNMDEGDGFK- 140
Query: 141 DYTFRADMSFKPGSTYEIKTRSWATNCF 162
+Y+FR D + SW N F
Sbjct: 141 EYSFRPDFN-----------NSWGMNNF 147
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1G586 | 8.1e-80 | 100.00 | uncharacterized protein LOC111451027 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1G5B3 | 8.1e-80 | 100.00 | uncharacterized protein LOC111451027 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1KYL3 | 1.4e-76 | 98.03 | LOW QUALITY PROTEIN: uncharacterized protein LOC111499912 OS=Cucurbita maxima OX... | [more] |
A0A6J1JHF0 | 1.2e-70 | 88.82 | uncharacterized protein LOC111484513 OS=Cucurbita maxima OX=3661 GN=LOC111484513... | [more] |
A0A6J1FYC1 | 1.2e-70 | 87.50 | uncharacterized protein LOC111448465 OS=Cucurbita moschata OX=3662 GN=LOC1114484... | [more] |
Match Name | E-value | Identity | Description | |
XP_022947027.1 | 1.7e-79 | 100.00 | uncharacterized protein LOC111451027 isoform X2 [Cucurbita moschata] | [more] |
XP_022947026.1 | 1.7e-79 | 100.00 | uncharacterized protein LOC111451027 isoform X1 [Cucurbita moschata] | [more] |
XP_023007407.1 | 3.0e-76 | 98.03 | LOW QUALITY PROTEIN: uncharacterized protein LOC111499912 [Cucurbita maxima] | [more] |
XP_022986919.1 | 2.4e-70 | 88.82 | uncharacterized protein LOC111484513 [Cucurbita maxima] | [more] |
XP_022943865.1 | 2.4e-70 | 87.50 | uncharacterized protein LOC111448465 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
AT1G07210.1 | 6.9e-07 | 29.05 | Ribosomal protein S18 | [more] |