Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCTTCTACGATTCCTACTACGATTCTGCTCAAATTGAGCCTCCAATTCCGCAATCCAGCTACGAACCCACTTTCTACAATCTCTTTGACTACCCACCTCCTTGTTATTTCGGTCAGGCTTATGCTCCCTACACATCCAATTTCAATGAATTCCCCCAATTGATCGAGTATCAACCCGTTGACCATGGGGCTTATGGCTATACAATTAGCTACTCGGCCAATGCTTGTTCTGCATCGACTTTCAGTGTCCCAAAAGTGATCGAATACGACCCTGATTTGTATAGCGATGGTTACCAAAAGGTGTCGTCCCAATTTGTGATCTCCTACTCTGTTTCAGAATTCAACGAGACAGAATTTGAAGAGTACGATCCAACCCCTTACGGTGGTGGCTACGACATTCATGAAACCTACGGTAAGCCCCTTCAACCTTCAACTGACATTTGCTACTCACCCTCCTCTTCTTCACCTCCAAAACCCCCACCCACCGCAATTCAGGAGGCACCAAAGGAAAAAATTGAAGAAAAAACAAAGCCGTCGAGCGAAATCAAGCCGACCCAGATCGAGAAAGATAACACGGCATCTGAATCTGAAGAAATTGAGGAAGTTCAAGCGATTCCCTTTGCAGATCCGGGAATAGGGTATGGAAATGGAAGGGAAGTGAACCAATTTCCAAGTGGGTATGGACTGGAAGCGATGGATCTTTGTGAAAGTTTATTTGGGTATTGGCCATGTCTCTCACGGATTAAAAAGCAAACAGGTTGTAGACAACCCAACAACGGCTGTGGGCGTTGCCATGGGCATTGCTATTGCTATGGCAATTACGGCAACCAGTGGCAGACGGCGGCGGACTATCTATTCGGAAGCCATAACCCATATCCAGATGGAAGGAGTGAAGGAGACGGTGTTTATGGGTATCAAACACAGTATCAAACGGAGCCTGTCTATGGCTACGTTTGGTTGAATCAAGACGACTTCGTTCGGTCCGATGACGCTTGA
mRNA sequence
ATGGCCTTCTACGATTCCTACTACGATTCTGCTCAAATTGAGCCTCCAATTCCGCAATCCAGCTACGAACCCACTTTCTACAATCTCTTTGACTACCCACCTCCTTGTTATTTCGGTCAGGCTTATGCTCCCTACACATCCAATTTCAATGAATTCCCCCAATTGATCGAGTATCAACCCGTTGACCATGGGGCTTATGGCTATACAATTAGCTACTCGGCCAATGCTTGTTCTGCATCGACTTTCAGTGTCCCAAAAGTGATCGAATACGACCCTGATTTGTATAGCGATGGTTACCAAAAGGTGTCGTCCCAATTTGTGATCTCCTACTCTGTTTCAGAATTCAACGAGACAGAATTTGAAGAGTACGATCCAACCCCTTACGGTGGTGGCTACGACATTCATGAAACCTACGGTAAGCCCCTTCAACCTTCAACTGACATTTGCTACTCACCCTCCTCTTCTTCACCTCCAAAACCCCCACCCACCGCAATTCAGGAGGCACCAAAGGAAAAAATTGAAGAAAAAACAAAGCCGTCGAGCGAAATCAAGCCGACCCAGATCGAGAAAGATAACACGGCATCTGAATCTGAAGAAATTGAGGAAGTTCAAGCGATTCCCTTTGCAGATCCGGGAATAGGGTATGGAAATGGAAGGGAAGTGAACCAATTTCCAAGTGGGTATGGACTGGAAGCGATGGATCTTTGTGAAAGTTTATTTGGGTATTGGCCATGTCTCTCACGGATTAAAAAGCAAACAGGTTGTAGACAACCCAACAACGGCTGTGGGCGTTGCCATGGGCATTGCTATTGCTATGGCAATTACGGCAACCAGTGGCAGACGGCGGCGGACTATCTATTCGGAAGCCATAACCCATATCCAGATGGAAGGAGTGAAGGAGACGGTGTTTATGGGTATCAAACACAGTATCAAACGGAGCCTGTCTATGGCTACGTTTGGTTGAATCAAGACGACTTCGTTCGGTCCGATGACGCTTGA
Coding sequence (CDS)
ATGGCCTTCTACGATTCCTACTACGATTCTGCTCAAATTGAGCCTCCAATTCCGCAATCCAGCTACGAACCCACTTTCTACAATCTCTTTGACTACCCACCTCCTTGTTATTTCGGTCAGGCTTATGCTCCCTACACATCCAATTTCAATGAATTCCCCCAATTGATCGAGTATCAACCCGTTGACCATGGGGCTTATGGCTATACAATTAGCTACTCGGCCAATGCTTGTTCTGCATCGACTTTCAGTGTCCCAAAAGTGATCGAATACGACCCTGATTTGTATAGCGATGGTTACCAAAAGGTGTCGTCCCAATTTGTGATCTCCTACTCTGTTTCAGAATTCAACGAGACAGAATTTGAAGAGTACGATCCAACCCCTTACGGTGGTGGCTACGACATTCATGAAACCTACGGTAAGCCCCTTCAACCTTCAACTGACATTTGCTACTCACCCTCCTCTTCTTCACCTCCAAAACCCCCACCCACCGCAATTCAGGAGGCACCAAAGGAAAAAATTGAAGAAAAAACAAAGCCGTCGAGCGAAATCAAGCCGACCCAGATCGAGAAAGATAACACGGCATCTGAATCTGAAGAAATTGAGGAAGTTCAAGCGATTCCCTTTGCAGATCCGGGAATAGGGTATGGAAATGGAAGGGAAGTGAACCAATTTCCAAGTGGGTATGGACTGGAAGCGATGGATCTTTGTGAAAGTTTATTTGGGTATTGGCCATGTCTCTCACGGATTAAAAAGCAAACAGGTTGTAGACAACCCAACAACGGCTGTGGGCGTTGCCATGGGCATTGCTATTGCTATGGCAATTACGGCAACCAGTGGCAGACGGCGGCGGACTATCTATTCGGAAGCCATAACCCATATCCAGATGGAAGGAGTGAAGGAGACGGTGTTTATGGGTATCAAACACAGTATCAAACGGAGCCTGTCTATGGCTACGTTTGGTTGAATCAAGACGACTTCGTTCGGTCCGATGACGCTTGA
Protein sequence
MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEYQPVDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNETEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPSSEIKPTQIEKDNTASESEEIEEVQAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEGDGVYGYQTQYQTEPVYGYVWLNQDDFVRSDDA
Homology
BLAST of Cp4.1LG20g03950 vs. NCBI nr
Match:
XP_023519612.1 (uncharacterized protein At5g39570 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 696 bits (1796), Expect = 2.86e-253
Identity = 332/332 (100.00%), Postives = 332/332 (100.00%), Query Frame = 0
Query: 1 MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEYQP 60
MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEYQP
Sbjct: 1 MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEYQP 60
Query: 61 VDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNETEF 120
VDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNETEF
Sbjct: 61 VDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNETEF 120
Query: 121 EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS 180
EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS
Sbjct: 121 EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS 180
Query: 181 SEIKPTQIEKDNTASESEEIEEVQAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLF 240
SEIKPTQIEKDNTASESEEIEEVQAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLF
Sbjct: 181 SEIKPTQIEKDNTASESEEIEEVQAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLF 240
Query: 241 GYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEG 300
GYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEG
Sbjct: 241 GYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEG 300
Query: 301 DGVYGYQTQYQTEPVYGYVWLNQDDFVRSDDA 332
DGVYGYQTQYQTEPVYGYVWLNQDDFVRSDDA
Sbjct: 301 DGVYGYQTQYQTEPVYGYVWLNQDDFVRSDDA 332
BLAST of Cp4.1LG20g03950 vs. NCBI nr
Match:
XP_022927527.1 (uncharacterized protein LOC111434325 [Cucurbita moschata])
HSP 1 Score: 681 bits (1758), Expect = 1.78e-247
Identity = 325/332 (97.89%), Postives = 328/332 (98.80%), Query Frame = 0
Query: 1 MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEYQP 60
MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTS+ NEFPQLIEYQP
Sbjct: 1 MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSSSNEFPQLIEYQP 60
Query: 61 VDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNETEF 120
VDHGAYGYTISYSANACSASTFSVPKVIEYDPD YSDGYQKVSSQFVISYSVSEFNETEF
Sbjct: 61 VDHGAYGYTISYSANACSASTFSVPKVIEYDPDFYSDGYQKVSSQFVISYSVSEFNETEF 120
Query: 121 EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS 180
EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS
Sbjct: 121 EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS 180
Query: 181 SEIKPTQIEKDNTASESEEIEEVQAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLF 240
SEIKPTQIEKDNTASESEEIEEV+AIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLF
Sbjct: 181 SEIKPTQIEKDNTASESEEIEEVKAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLF 240
Query: 241 GYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEG 300
GYWPCLSRIKKQT CRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEG
Sbjct: 241 GYWPCLSRIKKQTACRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEG 300
Query: 301 DGVYGYQTQYQTEPVYGYVWLNQDDFVRSDDA 332
DGVYGYQTQYQTEPVYGYVWLNQ+D VRSDDA
Sbjct: 301 DGVYGYQTQYQTEPVYGYVWLNQNDLVRSDDA 332
BLAST of Cp4.1LG20g03950 vs. NCBI nr
Match:
KAG6584049.1 (hypothetical protein SDJN03_19981, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 679 bits (1753), Expect = 1.03e-246
Identity = 323/332 (97.29%), Postives = 327/332 (98.49%), Query Frame = 0
Query: 1 MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEYQP 60
MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIE+QP
Sbjct: 1 MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEHQP 60
Query: 61 VDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNETEF 120
VDHGAYGYTISYSANACS STFSVPKVIEYDPDLYSDGYQK+SSQFVISYSVSEFNETEF
Sbjct: 61 VDHGAYGYTISYSANACSTSTFSVPKVIEYDPDLYSDGYQKMSSQFVISYSVSEFNETEF 120
Query: 121 EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS 180
EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS
Sbjct: 121 EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS 180
Query: 181 SEIKPTQIEKDNTASESEEIEEVQAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLF 240
SEIKPTQIEKDNTASESEEIEEV+AIPFADPG GYGNGREVNQFPSGYGLEAMDLCESLF
Sbjct: 181 SEIKPTQIEKDNTASESEEIEEVKAIPFADPGTGYGNGREVNQFPSGYGLEAMDLCESLF 240
Query: 241 GYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEG 300
GYWPCLSRIKKQT CRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGR EG
Sbjct: 241 GYWPCLSRIKKQTACRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRCEG 300
Query: 301 DGVYGYQTQYQTEPVYGYVWLNQDDFVRSDDA 332
DGVYGYQ QYQTEPVYGYVWLNQ+DFVRSDDA
Sbjct: 301 DGVYGYQRQYQTEPVYGYVWLNQNDFVRSDDA 332
BLAST of Cp4.1LG20g03950 vs. NCBI nr
Match:
XP_023001286.1 (uncharacterized protein LOC111495462 [Cucurbita maxima])
HSP 1 Score: 656 bits (1692), Expect = 2.77e-237
Identity = 319/340 (93.82%), Postives = 323/340 (95.00%), Query Frame = 0
Query: 1 MAFYDSY---YDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIE 60
MAFYDSY YDSAQ EPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIE
Sbjct: 1 MAFYDSYDSYYDSAQTEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIE 60
Query: 61 YQPVDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNE 120
+QPVDHGAYGYTISYSANACSASTFSVPKVIEYD DLYSDG QKVSSQFVISYSVSEFNE
Sbjct: 61 HQPVDHGAYGYTISYSANACSASTFSVPKVIEYDSDLYSDGTQKVSSQFVISYSVSEFNE 120
Query: 121 TEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAI-----QEAPKEK 180
TEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAI EAPKEK
Sbjct: 121 TEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIPISAIHEAPKEK 180
Query: 181 IEEKTKPSSEIKPTQIEKDNTASESEEIEEVQAIPFADPGIGYGNGREVNQFPSGYGLEA 240
IEEKT+PSSEIKPTQIEKDNTASESEEIEEV+AIPFADPGIGYGNGREVNQFPSGYGLEA
Sbjct: 181 IEEKTEPSSEIKPTQIEKDNTASESEEIEEVKAIPFADPGIGYGNGREVNQFPSGYGLEA 240
Query: 241 MDLCESLFGYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNP 300
MDLCESLFGYWPCLSRIKKQT CRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNP
Sbjct: 241 MDLCESLFGYWPCLSRIKKQTACRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNP 300
Query: 301 YPDGRSEGDGVYGYQTQYQTEPVYGYVWLNQDDFVRSDDA 332
YPDGRSEGDGVYGYQ QYQ EPVY YVWLNQ+DFVRSDD
Sbjct: 301 YPDGRSEGDGVYGYQRQYQAEPVYRYVWLNQNDFVRSDDV 340
BLAST of Cp4.1LG20g03950 vs. NCBI nr
Match:
KAG7019657.1 (hypothetical protein SDJN02_18620 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 594 bits (1531), Expect = 1.86e-213
Identity = 288/332 (86.75%), Postives = 292/332 (87.95%), Query Frame = 0
Query: 1 MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEYQP 60
MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIE+QP
Sbjct: 1 MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEHQP 60
Query: 61 VDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNETEF 120
VDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQK+SSQFVISYSVSEFNETEF
Sbjct: 61 VDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKMSSQFVISYSVSEFNETEF 120
Query: 121 EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS 180
EEYDPTPYGGGYDIHETY EKTKPS
Sbjct: 121 EEYDPTPYGGGYDIHETY------------------------------------EKTKPS 180
Query: 181 SEIKPTQIEKDNTASESEEIEEVQAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLF 240
SEIKPTQIEKDNTASESEEIEEV+AIPFADPG GYGNGREVNQFPSGYGLEAMDLCESLF
Sbjct: 181 SEIKPTQIEKDNTASESEEIEEVKAIPFADPGTGYGNGREVNQFPSGYGLEAMDLCESLF 240
Query: 241 GYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEG 300
GYWPCLSRIKKQT CRQ NNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGR EG
Sbjct: 241 GYWPCLSRIKKQTACRQLNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRCEG 296
Query: 301 DGVYGYQTQYQTEPVYGYVWLNQDDFVRSDDA 332
DGVYGYQTQYQTEPVYGYVWLNQ+DFVRSDDA
Sbjct: 301 DGVYGYQTQYQTEPVYGYVWLNQNDFVRSDDA 296
BLAST of Cp4.1LG20g03950 vs. ExPASy TrEMBL
Match:
A0A6J1EHF5 (uncharacterized protein LOC111434325 OS=Cucurbita moschata OX=3662 GN=LOC111434325 PE=4 SV=1)
HSP 1 Score: 681 bits (1758), Expect = 8.62e-248
Identity = 325/332 (97.89%), Postives = 328/332 (98.80%), Query Frame = 0
Query: 1 MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEYQP 60
MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTS+ NEFPQLIEYQP
Sbjct: 1 MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSSSNEFPQLIEYQP 60
Query: 61 VDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNETEF 120
VDHGAYGYTISYSANACSASTFSVPKVIEYDPD YSDGYQKVSSQFVISYSVSEFNETEF
Sbjct: 61 VDHGAYGYTISYSANACSASTFSVPKVIEYDPDFYSDGYQKVSSQFVISYSVSEFNETEF 120
Query: 121 EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS 180
EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS
Sbjct: 121 EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS 180
Query: 181 SEIKPTQIEKDNTASESEEIEEVQAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLF 240
SEIKPTQIEKDNTASESEEIEEV+AIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLF
Sbjct: 181 SEIKPTQIEKDNTASESEEIEEVKAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLF 240
Query: 241 GYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEG 300
GYWPCLSRIKKQT CRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEG
Sbjct: 241 GYWPCLSRIKKQTACRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEG 300
Query: 301 DGVYGYQTQYQTEPVYGYVWLNQDDFVRSDDA 332
DGVYGYQTQYQTEPVYGYVWLNQ+D VRSDDA
Sbjct: 301 DGVYGYQTQYQTEPVYGYVWLNQNDLVRSDDA 332
BLAST of Cp4.1LG20g03950 vs. ExPASy TrEMBL
Match:
A0A6J1KI70 (uncharacterized protein LOC111495462 OS=Cucurbita maxima OX=3661 GN=LOC111495462 PE=4 SV=1)
HSP 1 Score: 656 bits (1692), Expect = 1.34e-237
Identity = 319/340 (93.82%), Postives = 323/340 (95.00%), Query Frame = 0
Query: 1 MAFYDSY---YDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIE 60
MAFYDSY YDSAQ EPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIE
Sbjct: 1 MAFYDSYDSYYDSAQTEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIE 60
Query: 61 YQPVDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNE 120
+QPVDHGAYGYTISYSANACSASTFSVPKVIEYD DLYSDG QKVSSQFVISYSVSEFNE
Sbjct: 61 HQPVDHGAYGYTISYSANACSASTFSVPKVIEYDSDLYSDGTQKVSSQFVISYSVSEFNE 120
Query: 121 TEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAI-----QEAPKEK 180
TEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAI EAPKEK
Sbjct: 121 TEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIPISAIHEAPKEK 180
Query: 181 IEEKTKPSSEIKPTQIEKDNTASESEEIEEVQAIPFADPGIGYGNGREVNQFPSGYGLEA 240
IEEKT+PSSEIKPTQIEKDNTASESEEIEEV+AIPFADPGIGYGNGREVNQFPSGYGLEA
Sbjct: 181 IEEKTEPSSEIKPTQIEKDNTASESEEIEEVKAIPFADPGIGYGNGREVNQFPSGYGLEA 240
Query: 241 MDLCESLFGYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNP 300
MDLCESLFGYWPCLSRIKKQT CRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNP
Sbjct: 241 MDLCESLFGYWPCLSRIKKQTACRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNP 300
Query: 301 YPDGRSEGDGVYGYQTQYQTEPVYGYVWLNQDDFVRSDDA 332
YPDGRSEGDGVYGYQ QYQ EPVY YVWLNQ+DFVRSDD
Sbjct: 301 YPDGRSEGDGVYGYQRQYQAEPVYRYVWLNQNDFVRSDDV 340
BLAST of Cp4.1LG20g03950 vs. ExPASy TrEMBL
Match:
A0A1S3B404 (uncharacterized protein LOC103485767 OS=Cucumis melo OX=3656 GN=LOC103485767 PE=4 SV=1)
HSP 1 Score: 538 bits (1386), Expect = 2.40e-190
Identity = 274/368 (74.46%), Postives = 294/368 (79.89%), Query Frame = 0
Query: 1 MAFYDSY-------YDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAY----------A 60
MAFYDSY Y+SAQIEPPI QSS EPTFYNLFDYPPPCYFGQAY A
Sbjct: 17 MAFYDSYDFYDDSYYNSAQIEPPILQSSNEPTFYNLFDYPPPCYFGQAYDSEVGYFAINA 76
Query: 61 PYTSNFNEFPQLIEYQPVDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVS 120
Y SNF+EFPQLIE++PVDHG YGY I YSANACSAS+F++PKV YDPDLYS+ VS
Sbjct: 77 AYGSNFSEFPQLIEHEPVDHGDYGYAIRYSANACSASSFTLPKVFGYDPDLYSE----VS 136
Query: 121 SQFVISYSVSEFNETEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPT 180
+QFVISYSVSEFNET+FEEYDPTPY GGYDI+ETYGKPLQPST+ICY PSSSSP KPPP
Sbjct: 137 TQFVISYSVSEFNETDFEEYDPTPYDGGYDIYETYGKPLQPSTEICYPPSSSSPSKPPPP 196
Query: 181 A-----------IQEAPKEKIEEKTKPSSEIKPTQIEKDN--------TASESEEIEEVQ 240
I EAPK KIEE+TKPSSEIKP QIEK N T SES EIEEV+
Sbjct: 197 TATAIPITTIPKIDEAPKGKIEEQTKPSSEIKPIQIEKTNNSSSSDSDTTSESGEIEEVK 256
Query: 241 AIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTGCRQPNNGCGR 300
AI DPGIGYGNGREVN+FPSGYGLEAMDLCESLFGYWPCLSR K+QT CRQP NGCGR
Sbjct: 257 AIQLGDPGIGYGNGREVNEFPSGYGLEAMDLCESLFGYWPCLSRAKRQTLCRQPKNGCGR 316
Query: 301 CHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEGDGVYGYQTQYQTEPVYGYVWLNQD 332
CHGHCYCYGNYGNQWQTAA+YLFGSHNPY DGR EGDG YGYQ Q+Q EPVYGYVWLNQ+
Sbjct: 317 CHGHCYCYGNYGNQWQTAAEYLFGSHNPYLDGRGEGDGFYGYQRQFQEEPVYGYVWLNQN 376
BLAST of Cp4.1LG20g03950 vs. ExPASy TrEMBL
Match:
A0A5D3DRV2 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold861G00010 PE=4 SV=1)
HSP 1 Score: 536 bits (1382), Expect = 5.48e-190
Identity = 273/368 (74.18%), Postives = 294/368 (79.89%), Query Frame = 0
Query: 1 MAFYDSY-------YDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAY----------A 60
MAFYDSY Y+SAQIEPPI QSS EPTFYNLFDYPPPCYFGQAY A
Sbjct: 1 MAFYDSYDFYDDSYYNSAQIEPPILQSSNEPTFYNLFDYPPPCYFGQAYDSEVGYFAINA 60
Query: 61 PYTSNFNEFPQLIEYQPVDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVS 120
Y SNF+EFPQLIE++PVDHG YGY I YSANACSAS+F++PKV YDPDLYS+ VS
Sbjct: 61 AYGSNFSEFPQLIEHEPVDHGDYGYAIRYSANACSASSFTLPKVFGYDPDLYSE----VS 120
Query: 121 SQFVISYSVSEFNETEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPT 180
+QFVISYSVSEFNET+FEEYDPTPY GGYDI+ETYGKPLQPST+ICY PSSSSP KPPP
Sbjct: 121 TQFVISYSVSEFNETDFEEYDPTPYDGGYDIYETYGKPLQPSTEICYPPSSSSPSKPPPP 180
Query: 181 A-----------IQEAPKEKIEEKTKPSSEIKPTQIEKDN--------TASESEEIEEVQ 240
I EAPK KIEE+TKPSSEIKP QIEK N T SES EIEEV+
Sbjct: 181 TATAIPITTIPKIDEAPKGKIEEQTKPSSEIKPIQIEKTNNSYSSDSDTTSESGEIEEVK 240
Query: 241 AIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTGCRQPNNGCGR 300
AI DPGIGYGNGREVN+FPSGYGLEAMDLCESLFGYWPCLSR K+QT CRQP NGCGR
Sbjct: 241 AIQLGDPGIGYGNGREVNEFPSGYGLEAMDLCESLFGYWPCLSRAKRQTLCRQPKNGCGR 300
Query: 301 CHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEGDGVYGYQTQYQTEPVYGYVWLNQD 332
CHGHCYCYGNYGNQWQTAA+YLFGSHNPY DGR EGDG YGYQ ++Q EPVYGYVWLNQ+
Sbjct: 301 CHGHCYCYGNYGNQWQTAAEYLFGSHNPYLDGRGEGDGFYGYQRRFQEEPVYGYVWLNQN 360
BLAST of Cp4.1LG20g03950 vs. ExPASy TrEMBL
Match:
A0A0A0LUY1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G070580 PE=4 SV=1)
HSP 1 Score: 520 bits (1338), Expect = 3.27e-183
Identity = 268/373 (71.85%), Postives = 290/373 (77.75%), Query Frame = 0
Query: 1 MAFY-------DSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAY----------A 60
MAFY DSYY+ AQIEPPIPQSS EP FYNLFDYPPPCYFGQAY A
Sbjct: 1 MAFYNSYDFYDDSYYNYAQIEPPIPQSSNEPNFYNLFDYPPPCYFGQAYDYEVGYSANDA 60
Query: 61 PYTSNFNEFPQLIEYQPVDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVS 120
PY SNFNE PQLI+++PVDHG YGY I YSANACSAS+F++PK+ EY+PDLYS+ VS
Sbjct: 61 PYRSNFNELPQLIDHEPVDHGDYGYAIRYSANACSASSFTLPKLCEYNPDLYSE----VS 120
Query: 121 SQFVISYSVSEFNETEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSP-----P 180
+QFVISYSVS+FNETEFEEYDPTPY GGYDI ETYGKPLQPS +ICY PSSSSP P
Sbjct: 121 TQFVISYSVSQFNETEFEEYDPTPYDGGYDISETYGKPLQPSIEICYPPSSSSPSKSPPP 180
Query: 181 KPPPTA-----------IQEAPKEKIEEKTKPSSEIKPTQIEKDN--------TASESEE 240
PPPTA I EAPK KIEE+TKPSSEIKPTQIEK N T SES E
Sbjct: 181 PPPPTATAIPIITTIPKIDEAPKGKIEEQTKPSSEIKPTQIEKTNNSSSSDSDTTSESGE 240
Query: 241 IEEVQAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTGCRQPN 300
IEE +AI DPGIGYGN REVN+FPSG GLEAMDLCESLFGYWPCLSR K+QT RQP
Sbjct: 241 IEEDKAIQLGDPGIGYGNAREVNEFPSGCGLEAMDLCESLFGYWPCLSRAKRQTAYRQPK 300
Query: 301 NGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEGDGVYGYQTQYQTEPVYGYV 332
NGCGRCHGHCYCYGNYGN+WQTAA+YLFGSHNPY DGR EGD VYGYQ Q+Q EPVYGYV
Sbjct: 301 NGCGRCHGHCYCYGNYGNEWQTAAEYLFGSHNPYLDGRREGDVVYGYQRQFQEEPVYGYV 360
BLAST of Cp4.1LG20g03950 vs. TAIR 10
Match:
AT1G11440.1 (BEST Arabidopsis thaliana protein match is: glycine-rich protein (TAIR:AT3G29075.1); Has 19337 Blast hits to 8589 proteins in 488 species: Archae - 26; Bacteria - 641; Metazoa - 7852; Fungi - 2167; Plants - 955; Viruses - 616; Other Eukaryotes - 7080 (source: NCBI BLink). )
HSP 1 Score: 128.3 bits (321), Expect = 1.2e-29
Identity = 126/383 (32.90%), Postives = 168/383 (43.86%), Query Frame = 0
Query: 3 FYDSY---YDSAQIEPPIPQSSY-----------EPTFYNLFDYPPPCYFGQAYAPYTSN 62
FY++Y YD Q+ Q+ Y EP YN + N
Sbjct: 4 FYENYQSPYDYNQVNNLYDQNHYHYNQQQQQLGFEPMSYNYY-----------------N 63
Query: 63 FNEFPQLIEY-------QPVDHGAYGYTISYSAN-----ACSASTFSVPKVIEYDPDLYS 122
+NE EY P+ + Y + S S A S ST S PK + YDP+LY+
Sbjct: 64 WNESESESEYVAYSGYDDPMSYNCYNWNGSESETTSAYVAYSVSTMSEPKHLFYDPNLYT 123
Query: 123 DGYQKVSSQFVISYSVS---EFNETEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPS 182
+ QF I SV+ +FNE EF+EYDPTPYGGGYD+ TYGKPL PS + CY P
Sbjct: 124 T--YESPPQFSIYCSVASALDFNEPEFDEYDPTPYGGGYDVVATYGKPLPPSVETCY-PC 183
Query: 183 SSSP----PKPP------PTAIQEAPKEKIEEK----TKPSSEIKPTQIEK--------- 242
S++P P PP P I + ++ + +K +P E+KP + K
Sbjct: 184 STAPHAKAPSPPEIIAPVPLGIYDGGQKNVVKKRVSFAEPVEEVKPIETIKEQEQEQDED 243
Query: 243 -----------DNTASESEEIEEVQAIPFADPGIGYGNGR-------EVNQF--PSGYGL 302
D+ E EE +E D YGN EV PSGYGL
Sbjct: 244 YDEESEDEDDGDDDDEEEEEGDEEAKEEEKDHSSSYGNEEYEVVDKGEVKALYVPSGYGL 303
Query: 303 EAMDLCESLF-GYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGS 308
EA DLCE +F GY+PC+ R K++ Q C N + W+T +D+LFG
Sbjct: 304 EATDLCEVIFGGYFPCVLRNKRRQEDEQDRGAAVSC-----WESNDSDPWKTTSDHLFGD 361
BLAST of Cp4.1LG20g03950 vs. TAIR 10
Match:
AT3G29075.1 (glycine-rich protein )
HSP 1 Score: 49.7 bits (117), Expect = 5.3e-06
Identity = 23/47 (48.94%), Postives = 30/47 (63.83%), Query Frame = 0
Query: 110 YSVSEFNETEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSS 157
Y+ + + +F EYDP PY GGYDI TYG+ + PS + CY SS S
Sbjct: 4 YTNDDNDVDDFTEYDPMPYSGGYDITVTYGRSIPPSDETCYPLSSLS 50
BLAST of Cp4.1LG20g03950 vs. TAIR 10
Match:
AT5G39570.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cytosol, nucleus; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 14 growth stages; BEST Arabidopsis thaliana protein match is: glycine-rich protein (TAIR:AT3G29075.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 44.3 bits (103), Expect = 2.2e-04
Identity = 56/186 (30.11%), Postives = 78/186 (41.94%), Query Frame = 0
Query: 110 YSVSEFNETEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSS-----SPPKPPPTA 169
Y+ + + +F+E+DPTPY GGYDI YG+P+ PS + CY SS +P T
Sbjct: 4 YTRDDNDVDDFDEFDPTPYSGGYDITVIYGRPIPPSDETCYPLSSGVDDDFEYERPEFTQ 63
Query: 170 IQEAPKEKIE---------EKTKPSSEIKP-------TQIEKDNTASESEEIEEVQAIPF 229
I E E + KP +P Q E+ N SE P
Sbjct: 64 IHEPSAYGDEALNTEYSSYSRPKPRPAFRPDSGGGGHVQGERPNPGYGSE--SGYGRKPE 123
Query: 230 ADPGIGYGNGREV-------NQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTGCRQPNNG 268
++ G GYG EV + SGYG ES +G R + + G R+P +G
Sbjct: 124 SEYGSGYGGQTEVEYGRRPEQSYGSGYG--GRTETESEYGSGGG-GRTEVEYG-RRPESG 183
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023519612.1 | 2.86e-253 | 100.00 | uncharacterized protein At5g39570 [Cucurbita pepo subsp. pepo] | [more] |
XP_022927527.1 | 1.78e-247 | 97.89 | uncharacterized protein LOC111434325 [Cucurbita moschata] | [more] |
KAG6584049.1 | 1.03e-246 | 97.29 | hypothetical protein SDJN03_19981, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_023001286.1 | 2.77e-237 | 93.82 | uncharacterized protein LOC111495462 [Cucurbita maxima] | [more] |
KAG7019657.1 | 1.86e-213 | 86.75 | hypothetical protein SDJN02_18620 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1EHF5 | 8.62e-248 | 97.89 | uncharacterized protein LOC111434325 OS=Cucurbita moschata OX=3662 GN=LOC1114343... | [more] |
A0A6J1KI70 | 1.34e-237 | 93.82 | uncharacterized protein LOC111495462 OS=Cucurbita maxima OX=3661 GN=LOC111495462... | [more] |
A0A1S3B404 | 2.40e-190 | 74.46 | uncharacterized protein LOC103485767 OS=Cucumis melo OX=3656 GN=LOC103485767 PE=... | [more] |
A0A5D3DRV2 | 5.48e-190 | 74.18 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A0A0LUY1 | 3.27e-183 | 71.85 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G070580 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT1G11440.1 | 1.2e-29 | 32.90 | BEST Arabidopsis thaliana protein match is: glycine-rich protein (TAIR:AT3G29075... | [more] |
AT3G29075.1 | 5.3e-06 | 48.94 | glycine-rich protein | [more] |
AT5G39570.1 | 2.2e-04 | 30.11 | FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... | [more] |