Cp4.1LG20g03950 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG20g03950
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionBEST Arabidopsis thaliana protein match is: glycine-rich protein .
LocationCp4.1LG20: 2240151 .. 2241149 (-)
RNA-Seq ExpressionCp4.1LG20g03950
SyntenyCp4.1LG20g03950
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCTTCTACGATTCCTACTACGATTCTGCTCAAATTGAGCCTCCAATTCCGCAATCCAGCTACGAACCCACTTTCTACAATCTCTTTGACTACCCACCTCCTTGTTATTTCGGTCAGGCTTATGCTCCCTACACATCCAATTTCAATGAATTCCCCCAATTGATCGAGTATCAACCCGTTGACCATGGGGCTTATGGCTATACAATTAGCTACTCGGCCAATGCTTGTTCTGCATCGACTTTCAGTGTCCCAAAAGTGATCGAATACGACCCTGATTTGTATAGCGATGGTTACCAAAAGGTGTCGTCCCAATTTGTGATCTCCTACTCTGTTTCAGAATTCAACGAGACAGAATTTGAAGAGTACGATCCAACCCCTTACGGTGGTGGCTACGACATTCATGAAACCTACGGTAAGCCCCTTCAACCTTCAACTGACATTTGCTACTCACCCTCCTCTTCTTCACCTCCAAAACCCCCACCCACCGCAATTCAGGAGGCACCAAAGGAAAAAATTGAAGAAAAAACAAAGCCGTCGAGCGAAATCAAGCCGACCCAGATCGAGAAAGATAACACGGCATCTGAATCTGAAGAAATTGAGGAAGTTCAAGCGATTCCCTTTGCAGATCCGGGAATAGGGTATGGAAATGGAAGGGAAGTGAACCAATTTCCAAGTGGGTATGGACTGGAAGCGATGGATCTTTGTGAAAGTTTATTTGGGTATTGGCCATGTCTCTCACGGATTAAAAAGCAAACAGGTTGTAGACAACCCAACAACGGCTGTGGGCGTTGCCATGGGCATTGCTATTGCTATGGCAATTACGGCAACCAGTGGCAGACGGCGGCGGACTATCTATTCGGAAGCCATAACCCATATCCAGATGGAAGGAGTGAAGGAGACGGTGTTTATGGGTATCAAACACAGTATCAAACGGAGCCTGTCTATGGCTACGTTTGGTTGAATCAAGACGACTTCGTTCGGTCCGATGACGCTTGA

mRNA sequence

ATGGCCTTCTACGATTCCTACTACGATTCTGCTCAAATTGAGCCTCCAATTCCGCAATCCAGCTACGAACCCACTTTCTACAATCTCTTTGACTACCCACCTCCTTGTTATTTCGGTCAGGCTTATGCTCCCTACACATCCAATTTCAATGAATTCCCCCAATTGATCGAGTATCAACCCGTTGACCATGGGGCTTATGGCTATACAATTAGCTACTCGGCCAATGCTTGTTCTGCATCGACTTTCAGTGTCCCAAAAGTGATCGAATACGACCCTGATTTGTATAGCGATGGTTACCAAAAGGTGTCGTCCCAATTTGTGATCTCCTACTCTGTTTCAGAATTCAACGAGACAGAATTTGAAGAGTACGATCCAACCCCTTACGGTGGTGGCTACGACATTCATGAAACCTACGGTAAGCCCCTTCAACCTTCAACTGACATTTGCTACTCACCCTCCTCTTCTTCACCTCCAAAACCCCCACCCACCGCAATTCAGGAGGCACCAAAGGAAAAAATTGAAGAAAAAACAAAGCCGTCGAGCGAAATCAAGCCGACCCAGATCGAGAAAGATAACACGGCATCTGAATCTGAAGAAATTGAGGAAGTTCAAGCGATTCCCTTTGCAGATCCGGGAATAGGGTATGGAAATGGAAGGGAAGTGAACCAATTTCCAAGTGGGTATGGACTGGAAGCGATGGATCTTTGTGAAAGTTTATTTGGGTATTGGCCATGTCTCTCACGGATTAAAAAGCAAACAGGTTGTAGACAACCCAACAACGGCTGTGGGCGTTGCCATGGGCATTGCTATTGCTATGGCAATTACGGCAACCAGTGGCAGACGGCGGCGGACTATCTATTCGGAAGCCATAACCCATATCCAGATGGAAGGAGTGAAGGAGACGGTGTTTATGGGTATCAAACACAGTATCAAACGGAGCCTGTCTATGGCTACGTTTGGTTGAATCAAGACGACTTCGTTCGGTCCGATGACGCTTGA

Coding sequence (CDS)

ATGGCCTTCTACGATTCCTACTACGATTCTGCTCAAATTGAGCCTCCAATTCCGCAATCCAGCTACGAACCCACTTTCTACAATCTCTTTGACTACCCACCTCCTTGTTATTTCGGTCAGGCTTATGCTCCCTACACATCCAATTTCAATGAATTCCCCCAATTGATCGAGTATCAACCCGTTGACCATGGGGCTTATGGCTATACAATTAGCTACTCGGCCAATGCTTGTTCTGCATCGACTTTCAGTGTCCCAAAAGTGATCGAATACGACCCTGATTTGTATAGCGATGGTTACCAAAAGGTGTCGTCCCAATTTGTGATCTCCTACTCTGTTTCAGAATTCAACGAGACAGAATTTGAAGAGTACGATCCAACCCCTTACGGTGGTGGCTACGACATTCATGAAACCTACGGTAAGCCCCTTCAACCTTCAACTGACATTTGCTACTCACCCTCCTCTTCTTCACCTCCAAAACCCCCACCCACCGCAATTCAGGAGGCACCAAAGGAAAAAATTGAAGAAAAAACAAAGCCGTCGAGCGAAATCAAGCCGACCCAGATCGAGAAAGATAACACGGCATCTGAATCTGAAGAAATTGAGGAAGTTCAAGCGATTCCCTTTGCAGATCCGGGAATAGGGTATGGAAATGGAAGGGAAGTGAACCAATTTCCAAGTGGGTATGGACTGGAAGCGATGGATCTTTGTGAAAGTTTATTTGGGTATTGGCCATGTCTCTCACGGATTAAAAAGCAAACAGGTTGTAGACAACCCAACAACGGCTGTGGGCGTTGCCATGGGCATTGCTATTGCTATGGCAATTACGGCAACCAGTGGCAGACGGCGGCGGACTATCTATTCGGAAGCCATAACCCATATCCAGATGGAAGGAGTGAAGGAGACGGTGTTTATGGGTATCAAACACAGTATCAAACGGAGCCTGTCTATGGCTACGTTTGGTTGAATCAAGACGACTTCGTTCGGTCCGATGACGCTTGA

Protein sequence

MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEYQPVDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNETEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPSSEIKPTQIEKDNTASESEEIEEVQAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEGDGVYGYQTQYQTEPVYGYVWLNQDDFVRSDDA
Homology
BLAST of Cp4.1LG20g03950 vs. NCBI nr
Match: XP_023519612.1 (uncharacterized protein At5g39570 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 696 bits (1796), Expect = 2.86e-253
Identity = 332/332 (100.00%), Postives = 332/332 (100.00%), Query Frame = 0

Query: 1   MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEYQP 60
           MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEYQP
Sbjct: 1   MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEYQP 60

Query: 61  VDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNETEF 120
           VDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNETEF
Sbjct: 61  VDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNETEF 120

Query: 121 EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS 180
           EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS
Sbjct: 121 EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS 180

Query: 181 SEIKPTQIEKDNTASESEEIEEVQAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLF 240
           SEIKPTQIEKDNTASESEEIEEVQAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLF
Sbjct: 181 SEIKPTQIEKDNTASESEEIEEVQAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLF 240

Query: 241 GYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEG 300
           GYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEG
Sbjct: 241 GYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEG 300

Query: 301 DGVYGYQTQYQTEPVYGYVWLNQDDFVRSDDA 332
           DGVYGYQTQYQTEPVYGYVWLNQDDFVRSDDA
Sbjct: 301 DGVYGYQTQYQTEPVYGYVWLNQDDFVRSDDA 332

BLAST of Cp4.1LG20g03950 vs. NCBI nr
Match: XP_022927527.1 (uncharacterized protein LOC111434325 [Cucurbita moschata])

HSP 1 Score: 681 bits (1758), Expect = 1.78e-247
Identity = 325/332 (97.89%), Postives = 328/332 (98.80%), Query Frame = 0

Query: 1   MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEYQP 60
           MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTS+ NEFPQLIEYQP
Sbjct: 1   MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSSSNEFPQLIEYQP 60

Query: 61  VDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNETEF 120
           VDHGAYGYTISYSANACSASTFSVPKVIEYDPD YSDGYQKVSSQFVISYSVSEFNETEF
Sbjct: 61  VDHGAYGYTISYSANACSASTFSVPKVIEYDPDFYSDGYQKVSSQFVISYSVSEFNETEF 120

Query: 121 EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS 180
           EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS
Sbjct: 121 EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS 180

Query: 181 SEIKPTQIEKDNTASESEEIEEVQAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLF 240
           SEIKPTQIEKDNTASESEEIEEV+AIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLF
Sbjct: 181 SEIKPTQIEKDNTASESEEIEEVKAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLF 240

Query: 241 GYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEG 300
           GYWPCLSRIKKQT CRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEG
Sbjct: 241 GYWPCLSRIKKQTACRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEG 300

Query: 301 DGVYGYQTQYQTEPVYGYVWLNQDDFVRSDDA 332
           DGVYGYQTQYQTEPVYGYVWLNQ+D VRSDDA
Sbjct: 301 DGVYGYQTQYQTEPVYGYVWLNQNDLVRSDDA 332

BLAST of Cp4.1LG20g03950 vs. NCBI nr
Match: KAG6584049.1 (hypothetical protein SDJN03_19981, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 679 bits (1753), Expect = 1.03e-246
Identity = 323/332 (97.29%), Postives = 327/332 (98.49%), Query Frame = 0

Query: 1   MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEYQP 60
           MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIE+QP
Sbjct: 1   MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEHQP 60

Query: 61  VDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNETEF 120
           VDHGAYGYTISYSANACS STFSVPKVIEYDPDLYSDGYQK+SSQFVISYSVSEFNETEF
Sbjct: 61  VDHGAYGYTISYSANACSTSTFSVPKVIEYDPDLYSDGYQKMSSQFVISYSVSEFNETEF 120

Query: 121 EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS 180
           EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS
Sbjct: 121 EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS 180

Query: 181 SEIKPTQIEKDNTASESEEIEEVQAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLF 240
           SEIKPTQIEKDNTASESEEIEEV+AIPFADPG GYGNGREVNQFPSGYGLEAMDLCESLF
Sbjct: 181 SEIKPTQIEKDNTASESEEIEEVKAIPFADPGTGYGNGREVNQFPSGYGLEAMDLCESLF 240

Query: 241 GYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEG 300
           GYWPCLSRIKKQT CRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGR EG
Sbjct: 241 GYWPCLSRIKKQTACRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRCEG 300

Query: 301 DGVYGYQTQYQTEPVYGYVWLNQDDFVRSDDA 332
           DGVYGYQ QYQTEPVYGYVWLNQ+DFVRSDDA
Sbjct: 301 DGVYGYQRQYQTEPVYGYVWLNQNDFVRSDDA 332

BLAST of Cp4.1LG20g03950 vs. NCBI nr
Match: XP_023001286.1 (uncharacterized protein LOC111495462 [Cucurbita maxima])

HSP 1 Score: 656 bits (1692), Expect = 2.77e-237
Identity = 319/340 (93.82%), Postives = 323/340 (95.00%), Query Frame = 0

Query: 1   MAFYDSY---YDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIE 60
           MAFYDSY   YDSAQ EPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIE
Sbjct: 1   MAFYDSYDSYYDSAQTEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIE 60

Query: 61  YQPVDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNE 120
           +QPVDHGAYGYTISYSANACSASTFSVPKVIEYD DLYSDG QKVSSQFVISYSVSEFNE
Sbjct: 61  HQPVDHGAYGYTISYSANACSASTFSVPKVIEYDSDLYSDGTQKVSSQFVISYSVSEFNE 120

Query: 121 TEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAI-----QEAPKEK 180
           TEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAI      EAPKEK
Sbjct: 121 TEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIPISAIHEAPKEK 180

Query: 181 IEEKTKPSSEIKPTQIEKDNTASESEEIEEVQAIPFADPGIGYGNGREVNQFPSGYGLEA 240
           IEEKT+PSSEIKPTQIEKDNTASESEEIEEV+AIPFADPGIGYGNGREVNQFPSGYGLEA
Sbjct: 181 IEEKTEPSSEIKPTQIEKDNTASESEEIEEVKAIPFADPGIGYGNGREVNQFPSGYGLEA 240

Query: 241 MDLCESLFGYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNP 300
           MDLCESLFGYWPCLSRIKKQT CRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNP
Sbjct: 241 MDLCESLFGYWPCLSRIKKQTACRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNP 300

Query: 301 YPDGRSEGDGVYGYQTQYQTEPVYGYVWLNQDDFVRSDDA 332
           YPDGRSEGDGVYGYQ QYQ EPVY YVWLNQ+DFVRSDD 
Sbjct: 301 YPDGRSEGDGVYGYQRQYQAEPVYRYVWLNQNDFVRSDDV 340

BLAST of Cp4.1LG20g03950 vs. NCBI nr
Match: KAG7019657.1 (hypothetical protein SDJN02_18620 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 594 bits (1531), Expect = 1.86e-213
Identity = 288/332 (86.75%), Postives = 292/332 (87.95%), Query Frame = 0

Query: 1   MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEYQP 60
           MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIE+QP
Sbjct: 1   MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEHQP 60

Query: 61  VDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNETEF 120
           VDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQK+SSQFVISYSVSEFNETEF
Sbjct: 61  VDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKMSSQFVISYSVSEFNETEF 120

Query: 121 EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS 180
           EEYDPTPYGGGYDIHETY                                    EKTKPS
Sbjct: 121 EEYDPTPYGGGYDIHETY------------------------------------EKTKPS 180

Query: 181 SEIKPTQIEKDNTASESEEIEEVQAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLF 240
           SEIKPTQIEKDNTASESEEIEEV+AIPFADPG GYGNGREVNQFPSGYGLEAMDLCESLF
Sbjct: 181 SEIKPTQIEKDNTASESEEIEEVKAIPFADPGTGYGNGREVNQFPSGYGLEAMDLCESLF 240

Query: 241 GYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEG 300
           GYWPCLSRIKKQT CRQ NNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGR EG
Sbjct: 241 GYWPCLSRIKKQTACRQLNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRCEG 296

Query: 301 DGVYGYQTQYQTEPVYGYVWLNQDDFVRSDDA 332
           DGVYGYQTQYQTEPVYGYVWLNQ+DFVRSDDA
Sbjct: 301 DGVYGYQTQYQTEPVYGYVWLNQNDFVRSDDA 296

BLAST of Cp4.1LG20g03950 vs. ExPASy TrEMBL
Match: A0A6J1EHF5 (uncharacterized protein LOC111434325 OS=Cucurbita moschata OX=3662 GN=LOC111434325 PE=4 SV=1)

HSP 1 Score: 681 bits (1758), Expect = 8.62e-248
Identity = 325/332 (97.89%), Postives = 328/332 (98.80%), Query Frame = 0

Query: 1   MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIEYQP 60
           MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTS+ NEFPQLIEYQP
Sbjct: 1   MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSSSNEFPQLIEYQP 60

Query: 61  VDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNETEF 120
           VDHGAYGYTISYSANACSASTFSVPKVIEYDPD YSDGYQKVSSQFVISYSVSEFNETEF
Sbjct: 61  VDHGAYGYTISYSANACSASTFSVPKVIEYDPDFYSDGYQKVSSQFVISYSVSEFNETEF 120

Query: 121 EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS 180
           EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS
Sbjct: 121 EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIQEAPKEKIEEKTKPS 180

Query: 181 SEIKPTQIEKDNTASESEEIEEVQAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLF 240
           SEIKPTQIEKDNTASESEEIEEV+AIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLF
Sbjct: 181 SEIKPTQIEKDNTASESEEIEEVKAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLF 240

Query: 241 GYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEG 300
           GYWPCLSRIKKQT CRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEG
Sbjct: 241 GYWPCLSRIKKQTACRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEG 300

Query: 301 DGVYGYQTQYQTEPVYGYVWLNQDDFVRSDDA 332
           DGVYGYQTQYQTEPVYGYVWLNQ+D VRSDDA
Sbjct: 301 DGVYGYQTQYQTEPVYGYVWLNQNDLVRSDDA 332

BLAST of Cp4.1LG20g03950 vs. ExPASy TrEMBL
Match: A0A6J1KI70 (uncharacterized protein LOC111495462 OS=Cucurbita maxima OX=3661 GN=LOC111495462 PE=4 SV=1)

HSP 1 Score: 656 bits (1692), Expect = 1.34e-237
Identity = 319/340 (93.82%), Postives = 323/340 (95.00%), Query Frame = 0

Query: 1   MAFYDSY---YDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIE 60
           MAFYDSY   YDSAQ EPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIE
Sbjct: 1   MAFYDSYDSYYDSAQTEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIE 60

Query: 61  YQPVDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVSSQFVISYSVSEFNE 120
           +QPVDHGAYGYTISYSANACSASTFSVPKVIEYD DLYSDG QKVSSQFVISYSVSEFNE
Sbjct: 61  HQPVDHGAYGYTISYSANACSASTFSVPKVIEYDSDLYSDGTQKVSSQFVISYSVSEFNE 120

Query: 121 TEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAI-----QEAPKEK 180
           TEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAI      EAPKEK
Sbjct: 121 TEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIPISAIHEAPKEK 180

Query: 181 IEEKTKPSSEIKPTQIEKDNTASESEEIEEVQAIPFADPGIGYGNGREVNQFPSGYGLEA 240
           IEEKT+PSSEIKPTQIEKDNTASESEEIEEV+AIPFADPGIGYGNGREVNQFPSGYGLEA
Sbjct: 181 IEEKTEPSSEIKPTQIEKDNTASESEEIEEVKAIPFADPGIGYGNGREVNQFPSGYGLEA 240

Query: 241 MDLCESLFGYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNP 300
           MDLCESLFGYWPCLSRIKKQT CRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNP
Sbjct: 241 MDLCESLFGYWPCLSRIKKQTACRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNP 300

Query: 301 YPDGRSEGDGVYGYQTQYQTEPVYGYVWLNQDDFVRSDDA 332
           YPDGRSEGDGVYGYQ QYQ EPVY YVWLNQ+DFVRSDD 
Sbjct: 301 YPDGRSEGDGVYGYQRQYQAEPVYRYVWLNQNDFVRSDDV 340

BLAST of Cp4.1LG20g03950 vs. ExPASy TrEMBL
Match: A0A1S3B404 (uncharacterized protein LOC103485767 OS=Cucumis melo OX=3656 GN=LOC103485767 PE=4 SV=1)

HSP 1 Score: 538 bits (1386), Expect = 2.40e-190
Identity = 274/368 (74.46%), Postives = 294/368 (79.89%), Query Frame = 0

Query: 1   MAFYDSY-------YDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAY----------A 60
           MAFYDSY       Y+SAQIEPPI QSS EPTFYNLFDYPPPCYFGQAY          A
Sbjct: 17  MAFYDSYDFYDDSYYNSAQIEPPILQSSNEPTFYNLFDYPPPCYFGQAYDSEVGYFAINA 76

Query: 61  PYTSNFNEFPQLIEYQPVDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVS 120
            Y SNF+EFPQLIE++PVDHG YGY I YSANACSAS+F++PKV  YDPDLYS+    VS
Sbjct: 77  AYGSNFSEFPQLIEHEPVDHGDYGYAIRYSANACSASSFTLPKVFGYDPDLYSE----VS 136

Query: 121 SQFVISYSVSEFNETEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPT 180
           +QFVISYSVSEFNET+FEEYDPTPY GGYDI+ETYGKPLQPST+ICY PSSSSP KPPP 
Sbjct: 137 TQFVISYSVSEFNETDFEEYDPTPYDGGYDIYETYGKPLQPSTEICYPPSSSSPSKPPPP 196

Query: 181 A-----------IQEAPKEKIEEKTKPSSEIKPTQIEKDN--------TASESEEIEEVQ 240
                       I EAPK KIEE+TKPSSEIKP QIEK N        T SES EIEEV+
Sbjct: 197 TATAIPITTIPKIDEAPKGKIEEQTKPSSEIKPIQIEKTNNSSSSDSDTTSESGEIEEVK 256

Query: 241 AIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTGCRQPNNGCGR 300
           AI   DPGIGYGNGREVN+FPSGYGLEAMDLCESLFGYWPCLSR K+QT CRQP NGCGR
Sbjct: 257 AIQLGDPGIGYGNGREVNEFPSGYGLEAMDLCESLFGYWPCLSRAKRQTLCRQPKNGCGR 316

Query: 301 CHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEGDGVYGYQTQYQTEPVYGYVWLNQD 332
           CHGHCYCYGNYGNQWQTAA+YLFGSHNPY DGR EGDG YGYQ Q+Q EPVYGYVWLNQ+
Sbjct: 317 CHGHCYCYGNYGNQWQTAAEYLFGSHNPYLDGRGEGDGFYGYQRQFQEEPVYGYVWLNQN 376

BLAST of Cp4.1LG20g03950 vs. ExPASy TrEMBL
Match: A0A5D3DRV2 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold861G00010 PE=4 SV=1)

HSP 1 Score: 536 bits (1382), Expect = 5.48e-190
Identity = 273/368 (74.18%), Postives = 294/368 (79.89%), Query Frame = 0

Query: 1   MAFYDSY-------YDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAY----------A 60
           MAFYDSY       Y+SAQIEPPI QSS EPTFYNLFDYPPPCYFGQAY          A
Sbjct: 1   MAFYDSYDFYDDSYYNSAQIEPPILQSSNEPTFYNLFDYPPPCYFGQAYDSEVGYFAINA 60

Query: 61  PYTSNFNEFPQLIEYQPVDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVS 120
            Y SNF+EFPQLIE++PVDHG YGY I YSANACSAS+F++PKV  YDPDLYS+    VS
Sbjct: 61  AYGSNFSEFPQLIEHEPVDHGDYGYAIRYSANACSASSFTLPKVFGYDPDLYSE----VS 120

Query: 121 SQFVISYSVSEFNETEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPT 180
           +QFVISYSVSEFNET+FEEYDPTPY GGYDI+ETYGKPLQPST+ICY PSSSSP KPPP 
Sbjct: 121 TQFVISYSVSEFNETDFEEYDPTPYDGGYDIYETYGKPLQPSTEICYPPSSSSPSKPPPP 180

Query: 181 A-----------IQEAPKEKIEEKTKPSSEIKPTQIEKDN--------TASESEEIEEVQ 240
                       I EAPK KIEE+TKPSSEIKP QIEK N        T SES EIEEV+
Sbjct: 181 TATAIPITTIPKIDEAPKGKIEEQTKPSSEIKPIQIEKTNNSYSSDSDTTSESGEIEEVK 240

Query: 241 AIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTGCRQPNNGCGR 300
           AI   DPGIGYGNGREVN+FPSGYGLEAMDLCESLFGYWPCLSR K+QT CRQP NGCGR
Sbjct: 241 AIQLGDPGIGYGNGREVNEFPSGYGLEAMDLCESLFGYWPCLSRAKRQTLCRQPKNGCGR 300

Query: 301 CHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEGDGVYGYQTQYQTEPVYGYVWLNQD 332
           CHGHCYCYGNYGNQWQTAA+YLFGSHNPY DGR EGDG YGYQ ++Q EPVYGYVWLNQ+
Sbjct: 301 CHGHCYCYGNYGNQWQTAAEYLFGSHNPYLDGRGEGDGFYGYQRRFQEEPVYGYVWLNQN 360

BLAST of Cp4.1LG20g03950 vs. ExPASy TrEMBL
Match: A0A0A0LUY1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G070580 PE=4 SV=1)

HSP 1 Score: 520 bits (1338), Expect = 3.27e-183
Identity = 268/373 (71.85%), Postives = 290/373 (77.75%), Query Frame = 0

Query: 1   MAFY-------DSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAY----------A 60
           MAFY       DSYY+ AQIEPPIPQSS EP FYNLFDYPPPCYFGQAY          A
Sbjct: 1   MAFYNSYDFYDDSYYNYAQIEPPIPQSSNEPNFYNLFDYPPPCYFGQAYDYEVGYSANDA 60

Query: 61  PYTSNFNEFPQLIEYQPVDHGAYGYTISYSANACSASTFSVPKVIEYDPDLYSDGYQKVS 120
           PY SNFNE PQLI+++PVDHG YGY I YSANACSAS+F++PK+ EY+PDLYS+    VS
Sbjct: 61  PYRSNFNELPQLIDHEPVDHGDYGYAIRYSANACSASSFTLPKLCEYNPDLYSE----VS 120

Query: 121 SQFVISYSVSEFNETEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSP-----P 180
           +QFVISYSVS+FNETEFEEYDPTPY GGYDI ETYGKPLQPS +ICY PSSSSP     P
Sbjct: 121 TQFVISYSVSQFNETEFEEYDPTPYDGGYDISETYGKPLQPSIEICYPPSSSSPSKSPPP 180

Query: 181 KPPPTA-----------IQEAPKEKIEEKTKPSSEIKPTQIEKDN--------TASESEE 240
            PPPTA           I EAPK KIEE+TKPSSEIKPTQIEK N        T SES E
Sbjct: 181 PPPPTATAIPIITTIPKIDEAPKGKIEEQTKPSSEIKPTQIEKTNNSSSSDSDTTSESGE 240

Query: 241 IEEVQAIPFADPGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTGCRQPN 300
           IEE +AI   DPGIGYGN REVN+FPSG GLEAMDLCESLFGYWPCLSR K+QT  RQP 
Sbjct: 241 IEEDKAIQLGDPGIGYGNAREVNEFPSGCGLEAMDLCESLFGYWPCLSRAKRQTAYRQPK 300

Query: 301 NGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGRSEGDGVYGYQTQYQTEPVYGYV 332
           NGCGRCHGHCYCYGNYGN+WQTAA+YLFGSHNPY DGR EGD VYGYQ Q+Q EPVYGYV
Sbjct: 301 NGCGRCHGHCYCYGNYGNEWQTAAEYLFGSHNPYLDGRREGDVVYGYQRQFQEEPVYGYV 360

BLAST of Cp4.1LG20g03950 vs. TAIR 10
Match: AT1G11440.1 (BEST Arabidopsis thaliana protein match is: glycine-rich protein (TAIR:AT3G29075.1); Has 19337 Blast hits to 8589 proteins in 488 species: Archae - 26; Bacteria - 641; Metazoa - 7852; Fungi - 2167; Plants - 955; Viruses - 616; Other Eukaryotes - 7080 (source: NCBI BLink). )

HSP 1 Score: 128.3 bits (321), Expect = 1.2e-29
Identity = 126/383 (32.90%), Postives = 168/383 (43.86%), Query Frame = 0

Query: 3   FYDSY---YDSAQIEPPIPQSSY-----------EPTFYNLFDYPPPCYFGQAYAPYTSN 62
           FY++Y   YD  Q+     Q+ Y           EP  YN +                 N
Sbjct: 4   FYENYQSPYDYNQVNNLYDQNHYHYNQQQQQLGFEPMSYNYY-----------------N 63

Query: 63  FNEFPQLIEY-------QPVDHGAYGYTISYSAN-----ACSASTFSVPKVIEYDPDLYS 122
           +NE     EY        P+ +  Y +  S S       A S ST S PK + YDP+LY+
Sbjct: 64  WNESESESEYVAYSGYDDPMSYNCYNWNGSESETTSAYVAYSVSTMSEPKHLFYDPNLYT 123

Query: 123 DGYQKVSSQFVISYSVS---EFNETEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPS 182
               +   QF I  SV+   +FNE EF+EYDPTPYGGGYD+  TYGKPL PS + CY P 
Sbjct: 124 T--YESPPQFSIYCSVASALDFNEPEFDEYDPTPYGGGYDVVATYGKPLPPSVETCY-PC 183

Query: 183 SSSP----PKPP------PTAIQEAPKEKIEEK----TKPSSEIKPTQIEK--------- 242
           S++P    P PP      P  I +  ++ + +K     +P  E+KP +  K         
Sbjct: 184 STAPHAKAPSPPEIIAPVPLGIYDGGQKNVVKKRVSFAEPVEEVKPIETIKEQEQEQDED 243

Query: 243 -----------DNTASESEEIEEVQAIPFADPGIGYGNGR-------EVNQF--PSGYGL 302
                      D+   E EE +E       D    YGN         EV     PSGYGL
Sbjct: 244 YDEESEDEDDGDDDDEEEEEGDEEAKEEEKDHSSSYGNEEYEVVDKGEVKALYVPSGYGL 303

Query: 303 EAMDLCESLF-GYWPCLSRIKKQTGCRQPNNGCGRCHGHCYCYGNYGNQWQTAADYLFGS 308
           EA DLCE +F GY+PC+ R K++    Q       C        N  + W+T +D+LFG 
Sbjct: 304 EATDLCEVIFGGYFPCVLRNKRRQEDEQDRGAAVSC-----WESNDSDPWKTTSDHLFGD 361

BLAST of Cp4.1LG20g03950 vs. TAIR 10
Match: AT3G29075.1 (glycine-rich protein )

HSP 1 Score: 49.7 bits (117), Expect = 5.3e-06
Identity = 23/47 (48.94%), Postives = 30/47 (63.83%), Query Frame = 0

Query: 110 YSVSEFNETEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSS 157
           Y+  + +  +F EYDP PY GGYDI  TYG+ + PS + CY  SS S
Sbjct: 4   YTNDDNDVDDFTEYDPMPYSGGYDITVTYGRSIPPSDETCYPLSSLS 50

BLAST of Cp4.1LG20g03950 vs. TAIR 10
Match: AT5G39570.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cytosol, nucleus; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 14 growth stages; BEST Arabidopsis thaliana protein match is: glycine-rich protein (TAIR:AT3G29075.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 44.3 bits (103), Expect = 2.2e-04
Identity = 56/186 (30.11%), Postives = 78/186 (41.94%), Query Frame = 0

Query: 110 YSVSEFNETEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSS-----SPPKPPPTA 169
           Y+  + +  +F+E+DPTPY GGYDI   YG+P+ PS + CY  SS         +P  T 
Sbjct: 4   YTRDDNDVDDFDEFDPTPYSGGYDITVIYGRPIPPSDETCYPLSSGVDDDFEYERPEFTQ 63

Query: 170 IQEAPKEKIE---------EKTKPSSEIKP-------TQIEKDNTASESEEIEEVQAIPF 229
           I E      E          + KP    +P        Q E+ N    SE        P 
Sbjct: 64  IHEPSAYGDEALNTEYSSYSRPKPRPAFRPDSGGGGHVQGERPNPGYGSE--SGYGRKPE 123

Query: 230 ADPGIGYGNGREV-------NQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTGCRQPNNG 268
           ++ G GYG   EV         + SGYG       ES +G      R + + G R+P +G
Sbjct: 124 SEYGSGYGGQTEVEYGRRPEQSYGSGYG--GRTETESEYGSGGG-GRTEVEYG-RRPESG 183

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023519612.12.86e-253100.00uncharacterized protein At5g39570 [Cucurbita pepo subsp. pepo][more]
XP_022927527.11.78e-24797.89uncharacterized protein LOC111434325 [Cucurbita moschata][more]
KAG6584049.11.03e-24697.29hypothetical protein SDJN03_19981, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_023001286.12.77e-23793.82uncharacterized protein LOC111495462 [Cucurbita maxima][more]
KAG7019657.11.86e-21386.75hypothetical protein SDJN02_18620 [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
A0A6J1EHF58.62e-24897.89uncharacterized protein LOC111434325 OS=Cucurbita moschata OX=3662 GN=LOC1114343... [more]
A0A6J1KI701.34e-23793.82uncharacterized protein LOC111495462 OS=Cucurbita maxima OX=3661 GN=LOC111495462... [more]
A0A1S3B4042.40e-19074.46uncharacterized protein LOC103485767 OS=Cucumis melo OX=3656 GN=LOC103485767 PE=... [more]
A0A5D3DRV25.48e-19074.18Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A0A0LUY13.27e-18371.85Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G070580 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G11440.11.2e-2932.90BEST Arabidopsis thaliana protein match is: glycine-rich protein (TAIR:AT3G29075... [more]
AT3G29075.15.3e-0648.94glycine-rich protein [more]
AT5G39570.12.2e-0430.11FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 145..200
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 168..194
NoneNo IPR availablePANTHERPTHR33971:SF3OS02G0743600 PROTEINcoord: 1..322
IPR038943PLD-regulated protein1-likePANTHERPTHR33971OS06G0232000 PROTEINcoord: 1..322

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g03950.1Cp4.1LG20g03950.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0070300 phosphatidic acid binding