Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACAATGTTAGGGTTTCTTACTATCCCATACCAATGGAAGATTTCCCCATCTCTCTCTTCATCTTCATCACCTTTTTCCCTCTCAACGAGATCTTCCCTCTCCTTTCCACTCTTCAAATTCCCTCTCCATTACTCGGAATCTCAAATTCCTGGTAAACGCAAAAGATTCACGGCGTTGGCGAGTAATAACGACAATGGATTGGGCGGGAATATCAAGGAGAGAGAAGGAGAAAGAAATGGAGCGAAGGGCTCCAATGGCGGCGATGACTTGAGGAAAGAACGAGGGCCGGTTTTCAATATCAAATGGGCTGAACTCTTAATCGATCCGGATCCTGATAACATCTTGGCCGTGGCGTTGACTGGCTTGCTTGCTTGGGCAAGTGTTCAGGTTTTGTGGCAGCTATTCTTCATCTCTTTGGCTATTTTAGTGGCTGCTCTTAAGTACTCTTTTATTGCTGCGCTTCTTATTTTCATTCTAATTACATTACTATAG
mRNA sequence
ATGACAATGTTAGGGTTTCTTACTATCCCATACCAATGGAAGATTTCCCCATCTCTCTCTTCATCTTCATCACCTTTTTCCCTCTCAACGAGATCTTCCCTCTCCTTTCCACTCTTCAAATTCCCTCTCCATTACTCGGAATCTCAAATTCCTGGTAAACGCAAAAGATTCACGGCGTTGGCGAGTAATAACGACAATGGATTGGGCGGGAATATCAAGGAGAGAGAAGGAGAAAGAAATGGAGCGAAGGGCTCCAATGGCGGCGATGACTTGAGGAAAGAACGAGGGCCGGTTTTCAATATCAAATGGGCTGAACTCTTAATCGATCCGGATCCTGATAACATCTTGGCCGTGGCGTTGACTGGCTTGCTTGCTTGGGCAAGTGTTCAGGTTTTGTGGCAGCTATTCTTCATCTCTTTGGCTATTTTAGTGGCTGCTCTTAAGTACTCTTTTATTGCTGCGCTTCTTATTTTCATTCTAATTACATTACTATAG
Coding sequence (CDS)
ATGACAATGTTAGGGTTTCTTACTATCCCATACCAATGGAAGATTTCCCCATCTCTCTCTTCATCTTCATCACCTTTTTCCCTCTCAACGAGATCTTCCCTCTCCTTTCCACTCTTCAAATTCCCTCTCCATTACTCGGAATCTCAAATTCCTGGTAAACGCAAAAGATTCACGGCGTTGGCGAGTAATAACGACAATGGATTGGGCGGGAATATCAAGGAGAGAGAAGGAGAAAGAAATGGAGCGAAGGGCTCCAATGGCGGCGATGACTTGAGGAAAGAACGAGGGCCGGTTTTCAATATCAAATGGGCTGAACTCTTAATCGATCCGGATCCTGATAACATCTTGGCCGTGGCGTTGACTGGCTTGCTTGCTTGGGCAAGTGTTCAGGTTTTGTGGCAGCTATTCTTCATCTCTTTGGCTATTTTAGTGGCTGCTCTTAAGTACTCTTTTATTGCTGCGCTTCTTATTTTCATTCTAATTACATTACTATAG
Protein sequence
MTMLGFLTIPYQWKISPSLSSSSSPFSLSTRSSLSFPLFKFPLHYSESQIPGKRKRFTALASNNDNGLGGNIKEREGERNGAKGSNGGDDLRKERGPVFNIKWAELLIDPDPDNILAVALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL
Homology
BLAST of Cp4.1LG01g14320 vs. NCBI nr
Match:
XP_023549272.1 (uncharacterized protein LOC111807677 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 304 bits (778), Expect = 1.11e-103
Identity = 164/164 (100.00%), Postives = 164/164 (100.00%), Query Frame = 0
Query: 1 MTMLGFLTIPYQWKISPSLSSSSSPFSLSTRSSLSFPLFKFPLHYSESQIPGKRKRFTAL 60
MTMLGFLTIPYQWKISPSLSSSSSPFSLSTRSSLSFPLFKFPLHYSESQIPGKRKRFTAL
Sbjct: 1 MTMLGFLTIPYQWKISPSLSSSSSPFSLSTRSSLSFPLFKFPLHYSESQIPGKRKRFTAL 60
Query: 61 ASNNDNGLGGNIKEREGERNGAKGSNGGDDLRKERGPVFNIKWAELLIDPDPDNILAVAL 120
ASNNDNGLGGNIKEREGERNGAKGSNGGDDLRKERGPVFNIKWAELLIDPDPDNILAVAL
Sbjct: 61 ASNNDNGLGGNIKEREGERNGAKGSNGGDDLRKERGPVFNIKWAELLIDPDPDNILAVAL 120
Query: 121 TGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 164
TGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL
Sbjct: 121 TGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 164
BLAST of Cp4.1LG01g14320 vs. NCBI nr
Match:
KAG7032002.1 (hypothetical protein SDJN02_06044, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 294 bits (752), Expect = 1.02e-99
Identity = 160/164 (97.56%), Postives = 160/164 (97.56%), Query Frame = 0
Query: 1 MTMLGFLTIPYQWKISPSLSSSSSPFSLSTRSSLSFPLFKFPLHYSESQIPGKRKRFTAL 60
MTMLGFLTIPYQWKISPSL SSSSPFSLSTRSSLSF LFKFPLHY ESQIPGKRKRFTAL
Sbjct: 1 MTMLGFLTIPYQWKISPSLPSSSSPFSLSTRSSLSFSLFKFPLHYLESQIPGKRKRFTAL 60
Query: 61 ASNNDNGLGGNIKEREGERNGAKGSNGGDDLRKERGPVFNIKWAELLIDPDPDNILAVAL 120
ASNNDNGLGGNIKEREGERNGAKGSNG DDLRKERGPVFNIKWAELLIDPDPDNILAVAL
Sbjct: 61 ASNNDNGLGGNIKEREGERNGAKGSNGDDDLRKERGPVFNIKWAELLIDPDPDNILAVAL 120
Query: 121 TGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 164
TGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL
Sbjct: 121 TGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 164
BLAST of Cp4.1LG01g14320 vs. NCBI nr
Match:
XP_022957096.1 (uncharacterized protein LOC111458579 [Cucurbita moschata])
HSP 1 Score: 294 bits (752), Expect = 1.02e-99
Identity = 160/164 (97.56%), Postives = 160/164 (97.56%), Query Frame = 0
Query: 1 MTMLGFLTIPYQWKISPSLSSSSSPFSLSTRSSLSFPLFKFPLHYSESQIPGKRKRFTAL 60
MTMLGFLTIPYQWKISPSL SSSSPFSLSTRSSLSF LFKFPLHYSESQIPGKRKRFTAL
Sbjct: 1 MTMLGFLTIPYQWKISPSLPSSSSPFSLSTRSSLSFSLFKFPLHYSESQIPGKRKRFTAL 60
Query: 61 ASNNDNGLGGNIKEREGERNGAKGSNGGDDLRKERGPVFNIKWAELLIDPDPDNILAVAL 120
ASNNDNGLGGNIKEREGERNGAKGS G DDLRKERGPVFNIKWAELLIDPDPDNILAVAL
Sbjct: 61 ASNNDNGLGGNIKEREGERNGAKGSKGDDDLRKERGPVFNIKWAELLIDPDPDNILAVAL 120
Query: 121 TGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 164
TGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL
Sbjct: 121 TGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 164
BLAST of Cp4.1LG01g14320 vs. NCBI nr
Match:
XP_022993576.1 (uncharacterized protein LOC111489528 [Cucurbita maxima])
HSP 1 Score: 290 bits (741), Expect = 4.85e-98
Identity = 158/164 (96.34%), Postives = 158/164 (96.34%), Query Frame = 0
Query: 1 MTMLGFLTIPYQWKISPSLSSSSSPFSLSTRSSLSFPLFKFPLHYSESQIPGKRKRFTAL 60
MTMLGFLTIPYQWKISPSLSSSSSPF LSTRS LSFPLFKFPLHYSESQI GKRKRF AL
Sbjct: 1 MTMLGFLTIPYQWKISPSLSSSSSPFPLSTRSFLSFPLFKFPLHYSESQISGKRKRFAAL 60
Query: 61 ASNNDNGLGGNIKEREGERNGAKGSNGGDDLRKERGPVFNIKWAELLIDPDPDNILAVAL 120
ASNNDNGLGGNIKEREGERNGAKGS G DDLRKERGPVFNIKWAELLIDPDPDNILAVAL
Sbjct: 61 ASNNDNGLGGNIKEREGERNGAKGSKGDDDLRKERGPVFNIKWAELLIDPDPDNILAVAL 120
Query: 121 TGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 164
TGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL
Sbjct: 121 TGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 164
BLAST of Cp4.1LG01g14320 vs. NCBI nr
Match:
KAG6601207.1 (hypothetical protein SDJN03_06440, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 250 bits (638), Expect = 3.86e-82
Identity = 127/130 (97.69%), Postives = 127/130 (97.69%), Query Frame = 0
Query: 1 MTMLGFLTIPYQWKISPSLSSSSSPFSLSTRSSLSFPLFKFPLHYSESQIPGKRKRFTAL 60
MTMLGFLTIPYQWKISPSL SSSSPFSLSTRSS SFPLFKFPLHYSESQIPGKRKRFTAL
Sbjct: 1 MTMLGFLTIPYQWKISPSLPSSSSPFSLSTRSSFSFPLFKFPLHYSESQIPGKRKRFTAL 60
Query: 61 ASNNDNGLGGNIKEREGERNGAKGSNGGDDLRKERGPVFNIKWAELLIDPDPDNILAVAL 120
ASNNDNGLGGNIKEREGERNGAKGSNG DDLRKERGPVFNIKWAELLIDPDPDNILAVAL
Sbjct: 61 ASNNDNGLGGNIKEREGERNGAKGSNGDDDLRKERGPVFNIKWAELLIDPDPDNILAVAL 120
Query: 121 TGLLAWASVQ 130
TGLLAWASVQ
Sbjct: 121 TGLLAWASVQ 130
BLAST of Cp4.1LG01g14320 vs. ExPASy TrEMBL
Match:
A0A6J1GYA2 (uncharacterized protein LOC111458579 OS=Cucurbita moschata OX=3662 GN=LOC111458579 PE=4 SV=1)
HSP 1 Score: 294 bits (752), Expect = 4.94e-100
Identity = 160/164 (97.56%), Postives = 160/164 (97.56%), Query Frame = 0
Query: 1 MTMLGFLTIPYQWKISPSLSSSSSPFSLSTRSSLSFPLFKFPLHYSESQIPGKRKRFTAL 60
MTMLGFLTIPYQWKISPSL SSSSPFSLSTRSSLSF LFKFPLHYSESQIPGKRKRFTAL
Sbjct: 1 MTMLGFLTIPYQWKISPSLPSSSSPFSLSTRSSLSFSLFKFPLHYSESQIPGKRKRFTAL 60
Query: 61 ASNNDNGLGGNIKEREGERNGAKGSNGGDDLRKERGPVFNIKWAELLIDPDPDNILAVAL 120
ASNNDNGLGGNIKEREGERNGAKGS G DDLRKERGPVFNIKWAELLIDPDPDNILAVAL
Sbjct: 61 ASNNDNGLGGNIKEREGERNGAKGSKGDDDLRKERGPVFNIKWAELLIDPDPDNILAVAL 120
Query: 121 TGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 164
TGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL
Sbjct: 121 TGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 164
BLAST of Cp4.1LG01g14320 vs. ExPASy TrEMBL
Match:
A0A6J1JWP5 (uncharacterized protein LOC111489528 OS=Cucurbita maxima OX=3661 GN=LOC111489528 PE=4 SV=1)
HSP 1 Score: 290 bits (741), Expect = 2.35e-98
Identity = 158/164 (96.34%), Postives = 158/164 (96.34%), Query Frame = 0
Query: 1 MTMLGFLTIPYQWKISPSLSSSSSPFSLSTRSSLSFPLFKFPLHYSESQIPGKRKRFTAL 60
MTMLGFLTIPYQWKISPSLSSSSSPF LSTRS LSFPLFKFPLHYSESQI GKRKRF AL
Sbjct: 1 MTMLGFLTIPYQWKISPSLSSSSSPFPLSTRSFLSFPLFKFPLHYSESQISGKRKRFAAL 60
Query: 61 ASNNDNGLGGNIKEREGERNGAKGSNGGDDLRKERGPVFNIKWAELLIDPDPDNILAVAL 120
ASNNDNGLGGNIKEREGERNGAKGS G DDLRKERGPVFNIKWAELLIDPDPDNILAVAL
Sbjct: 61 ASNNDNGLGGNIKEREGERNGAKGSKGDDDLRKERGPVFNIKWAELLIDPDPDNILAVAL 120
Query: 121 TGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 164
TGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL
Sbjct: 121 TGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 164
BLAST of Cp4.1LG01g14320 vs. ExPASy TrEMBL
Match:
A0A6J1CF05 (uncharacterized protein LOC111010120 OS=Momordica charantia OX=3673 GN=LOC111010120 PE=4 SV=1)
HSP 1 Score: 223 bits (568), Expect = 6.09e-72
Identity = 130/168 (77.38%), Postives = 138/168 (82.14%), Query Frame = 0
Query: 3 MLGFLTIPYQWKISPS--LSSSSSPFSLSTRSSLSFPLFKFPLHYS----ESQIPGKRKR 62
MLGF T+P QW + LSS+ +P S S S + P FKF LHY+ SQIP R R
Sbjct: 1 MLGFRTLPCQWSSASVRLLSSTPTPSSSSKISLRTVPRFKFTLHYALLMTRSQIPRNRAR 60
Query: 63 FTALASNNDNGLGGNIKEREGERNGAKGSNGGDDLRKERGPVFNIKWAELLIDPDPDNIL 122
FTA + N DNGLGGNIKEREGER GAKGSNGGDDL+KERGPVFNIKWAELLIDPDPDNIL
Sbjct: 61 FTAFSGNGDNGLGGNIKEREGERTGAKGSNGGDDLKKERGPVFNIKWAELLIDPDPDNIL 120
Query: 123 AVALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 164
AVALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL
Sbjct: 121 AVALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 168
BLAST of Cp4.1LG01g14320 vs. ExPASy TrEMBL
Match:
A0A1S3BFX0 (uncharacterized protein LOC103489407 OS=Cucumis melo OX=3656 GN=LOC103489407 PE=4 SV=1)
HSP 1 Score: 222 bits (565), Expect = 3.16e-71
Identity = 132/168 (78.57%), Postives = 143/168 (85.12%), Query Frame = 0
Query: 2 TMLGFLTIPYQWKISPSLSSSSSPFSLSTRSSLSFPLFKFPLHY----SESQIPGKRKRF 61
TMLGFLTIP+Q KISPSL+S S +S+ SSL PL+KFPLH+ S+ I R+RF
Sbjct: 22 TMLGFLTIPHQLKISPSLASLPS---ISSPSSLFLPLYKFPLHHTFFNSKFLISSNRRRF 81
Query: 62 TALASNNDNGLGGNIKEREGERNGAKGS-NGGDDLRKERGPVFNIKWAELLIDPDPDNIL 121
TA ASN + GG+IKEREGERNGAK S NGGDDL+KERGPVFNIKWAELLIDPDPDNIL
Sbjct: 82 TASASNKNTEFGGSIKEREGERNGAKSSSNGGDDLKKERGPVFNIKWAELLIDPDPDNIL 141
Query: 122 AVALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 164
AVALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL
Sbjct: 142 AVALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 186
BLAST of Cp4.1LG01g14320 vs. ExPASy TrEMBL
Match:
A0A5D3CD16 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold475G002130 PE=4 SV=1)
HSP 1 Score: 220 bits (560), Expect = 8.80e-71
Identity = 131/167 (78.44%), Postives = 142/167 (85.03%), Query Frame = 0
Query: 3 MLGFLTIPYQWKISPSLSSSSSPFSLSTRSSLSFPLFKFPLHY----SESQIPGKRKRFT 62
MLGFLTIP+Q KISPSL+S S +S+ SSL PL+KFPLH+ S+ I R+RFT
Sbjct: 1 MLGFLTIPHQLKISPSLASLPS---ISSPSSLFLPLYKFPLHHTFFNSKFLISSNRRRFT 60
Query: 63 ALASNNDNGLGGNIKEREGERNGAKGS-NGGDDLRKERGPVFNIKWAELLIDPDPDNILA 122
A ASN + GG+IKEREGERNGAK S NGGDDL+KERGPVFNIKWAELLIDPDPDNILA
Sbjct: 61 ASASNKNTEFGGSIKEREGERNGAKSSSNGGDDLKKERGPVFNIKWAELLIDPDPDNILA 120
Query: 123 VALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 164
VALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL
Sbjct: 121 VALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 164
BLAST of Cp4.1LG01g14320 vs. TAIR 10
Match:
AT4G40045.1 (unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 87.0 bits (214), Expect = 1.5e-17
Identity = 63/128 (49.22%), Postives = 81/128 (63.28%), Query Frame = 0
Query: 37 PLFKFPLHYSESQIPGKRKRFTALASNNDNGLGGNIKEREGERNGAKGSNGGDDLRKERG 96
P F + + + I + L +N NG + KE G N ++ G+ +K++
Sbjct: 15 PRFSYNIPHHHHHIRLCKSPSLILRTNAQNG-NDSAKESSGGGNRPVTNDDGNGSKKDQF 74
Query: 97 PVFNIKWAELLIDPDPDNILAVALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALL 156
F+ KW ELL +PD DN +AV L G+L WAS+QVL QLFFIS AILVAALKYSFIAALL
Sbjct: 75 AGFSFKWGELL-NPDQDNFVAVGLAGVLTWASLQVLSQLFFISFAILVAALKYSFIAALL 134
Query: 157 IFILITLL 165
IFIL+TLL
Sbjct: 135 IFILVTLL 140
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023549272.1 | 1.11e-103 | 100.00 | uncharacterized protein LOC111807677 [Cucurbita pepo subsp. pepo] | [more] |
KAG7032002.1 | 1.02e-99 | 97.56 | hypothetical protein SDJN02_06044, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
XP_022957096.1 | 1.02e-99 | 97.56 | uncharacterized protein LOC111458579 [Cucurbita moschata] | [more] |
XP_022993576.1 | 4.85e-98 | 96.34 | uncharacterized protein LOC111489528 [Cucurbita maxima] | [more] |
KAG6601207.1 | 3.86e-82 | 97.69 | hypothetical protein SDJN03_06440, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1GYA2 | 4.94e-100 | 97.56 | uncharacterized protein LOC111458579 OS=Cucurbita moschata OX=3662 GN=LOC1114585... | [more] |
A0A6J1JWP5 | 2.35e-98 | 96.34 | uncharacterized protein LOC111489528 OS=Cucurbita maxima OX=3661 GN=LOC111489528... | [more] |
A0A6J1CF05 | 6.09e-72 | 77.38 | uncharacterized protein LOC111010120 OS=Momordica charantia OX=3673 GN=LOC111010... | [more] |
A0A1S3BFX0 | 3.16e-71 | 78.57 | uncharacterized protein LOC103489407 OS=Cucumis melo OX=3656 GN=LOC103489407 PE=... | [more] |
A0A5D3CD16 | 8.80e-71 | 78.44 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
Match Name | E-value | Identity | Description | |
AT4G40045.1 | 1.5e-17 | 49.22 | unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae -... | [more] |