Cp4.1LG07g11540 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG07g11540
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionUnknown protein
LocationCp4.1LG07: 9708179 .. 9713184 (+)
RNA-Seq ExpressionCp4.1LG07g11540
SyntenyCp4.1LG07g11540
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAAAGAAGGAAGAAGAAGAAAAGATGAACACGCGGTTGCGGTTTGTTTATGTTAACTGCTATTAACGGTATAAAATTACGGAGTTCCGCGGGATTCCGTAAAGTTTCAATGGAAGTCGCAATCGCAACAATATCCTATCCATTCATACCTTGTTTATCCCTTTCTATCCGGTATTAGGGATTTCGATTTCGGTTTTTCGATTTCGGTTTCGATTTCGGCATGTTTCTATGTTTTGATGTTTGTAGTTTGATTTTTTGCCGAGTATGGTGACGCCAGAGAGCTGGATTTCTGTTTGGATCGATCGATTTCTCTCTTGTTTGGGGTAAGTTCTCTTGCTCTGTGTCGATCGCCCTATTTCTGTTATCAATTTCTCTCTGGTTTTCGGAATTCTTTGATTCGAAGCATGCGGATTAGGAACTTAAGCGTTGCTCCGGATTGTCTGCTGTTGATTAAATTTGGTCGTCTGTGGGGACTTTTTATGGATTCGTGATATATTCATCTTATGATTGTCATTGTTATTTCAGAGGTTATATTTTGGATGACTGTTTTTAACTGATGGATGAATTTGTTCTACAAATATTTTCGGCTTCTCTTATGAATAGTTAGGAGCTTTGGGAGAGATTTCATTGATCTTAATCTGTAATTAAGGAATTTTTTTAGTGAAACAAATACCAGTTGGTTCTTAGTAGCAAGTCTTCATCTCTGAAGAAGGAGATGAAGATAGCTTCATTTGTCTTCCATGGATTGAACATTTACACATTGTGGCAGGCATATTTTTAATTAGGTTAATTTGTTCCCTTACTAATAAATTCTTTCATGACGAAGCATTGATCGAATTATGTCAGAAATGGACAAGTTTAAGTGAAAATTTGGGATATTATAGGAAACAGAATCCATGATAATCCAAGCAAAAATACATCAGGATATTTCTTCGTCTTGGCAATTTCGTGACCTCGTTGAAGAAGATGGACTATGGTAGGAGATCAAAGCTCTTAGATTGAGTGGATCATAATTGAACTCATCATTCAAATCGACTCTTGCTCACAAATGTATGCTCTTTAAATTAGTAGCAAGCTAACACATGTCGTATATCGTTAATTTATCAAGTGATTGATCATTTATCTTTCTTCAATTCACTAAAGTAGTTCACCTCTCTGTTACAATGTTTCATGTTTCGTTGTCATTAACTGAGTCAATGACATTGAGGGTTCATTTTTCTAAGTTAGTGATCTGCATAGTCGGTTTATTTCAAACTTCATTGAGGTCCCGAAGTTTAACCGTGTTATTGAATTCCTCTCTCGACTTAAAAGTATGATGGAGGCAATTAGAAATGGCAGTATCATGATGTTCATGCATTTGCATTTATCATTAATTCAACTGTTATGTTGTTGTCTGTTCTGCAATTATCCTAACCGCAGTTTTCAATATGATTCTAACAAAGGAAGTGTTTCTTTTTGTAGGAGCACCAAATGTGCACCTGCTATCTCCGGGAATAATCTGAATTCTAGGATGCCAAGCATGTCAGAAGATTTTTGGAGCACAAGCACGTGTGATCTCGATGAGTTGATTACGCTTCAATCTCGACAGAGCTCATTTATCAGCATAATAAATCACAACCCTAAACATGGAGGTGGCACTGACGATCTGAGCAATCATTCTGACTTTGTAAATCATGGTAAACTCAGATCTTATTTTCCACTGCTTGCTGGAAAGCAAAAAGGATGTTCCTTTGTTTGCACTTTCATTCGTCATATAGCTTTAGTATCATTAGTTCTATACCTGCTCTAGTCTGGTTTAAAGTGAATCGATTTGTGTCACACAAGCTTATATGCCAAATTGCAGGTTTGATTCTTTGGACTCAGACCAGGCTTCAATGGATCGGAAATTATGAGCCTGCGAAACGAACCAAAAAAAATCATTTTACAGGATTAAGGTGAGTCTAATTTTGACACAGCTACATACTCGTTCACGGCATCAACCTCGCTGTTAAATATGTCCACATGCTTATAATACAAATGAAATCTGTCTCTTGCCGATATTCCTAGTAATTAACTGAATATTTTCTCATCGTATTCGCCAGTTGTTCGAAATTTTAGGTCATTACAAATCAAGTAGAAGTTTTTGAGTTGAAGCACACCCATTTTGGAAAGTCAAATTTCCAAATTTTGAGTCATTGAAAATTTTTGCCCCAACCATATTGCTTGGTTTCTGTTTGCAGTTGGTATATGACCAAAGACCTCTTGCTGGAAAACAAAAAACCTTTTCCTCGGCCCATACCTTTATCTGTAAGGCTGCTCTTTGCTGTTTCCTTTAATTGAGTACACATTCAAATCCTCTGCTTTGAATATGTAATCATGATACATTTGACAGGAAATGGTAGACTTTCTGATAGAAGAATGGGAAGAAGAAGGGCTGTATTATTGAGCCAATCAATGGAAATAAAACGCCATCACCTTGTACATACACCATGTTTGTGACTTTTTCAGAGTTTAATTAAACGGCCACTGCCTTAACTTGGGATCAATTCGGTATTAACAAACAAAATGACACCACCTGGTTGGCGGTACATTGAAGATCTCTAGGGGAGAAGCCAAGTGTGGAATAAACTGAAATCCAAGTTCTTCTGAACGATCGGCGAGCTAAACGGTGAGCCTCCTCCGACCATCAGCTAATATATATACTTCATACATACATATCGAGCTTTTTTCATTGTTTCTTCTACGCATGACAGTATGCATCAACTTTGTTGTGTGTTTAATACAATGTTGATGATTGACTTGTTGTAATTGATGTATAAGAATAGATATTAATCTCTAACCCACAAACTCTCTGGCTTTGAATTCTGATAATACTGAGCTTGATTTTGTGGTGATATTGCTGTCAACTTTGTGTTGGATGGTAGAGAACAAATAATAGACAAAGTCAAGTCCCCATTGTCCTTTTGTCTGGTGGTTGTGGCATTAAAAGACTATAAATTTTGAAGGGTTTATGGTTGAAGATGCAAAGATGCTTTTGAAACTCATTTTCAGCTGAGAATACTCACTGTTGTTAGCATATGGACCCTACCTCTTTAGATCAGAAATCTTATCTCCCATTCTGATAATATAGAATGTAGGTAGGAGGAAGTGGAATCATTTTTTGGTGGCATAAATGTTTTCACTCAAATTATGTACTTTATGAGTCACAATGAGAGAGAAAATGATCTTTGTTTTCTTTTTCAATGGTTCATATATATTGGAATATAATGGTACTCATCCCGAGTTTGTATGTTGCAGCAGTCAGATTCAAAGTATGATAGTGGCCTGCATAGAGATAGTTCCCGGTAAAATTTTCATTTTCTATAGTTGTTATATAATCTCTATGCATAGAAATATAATGGTATTCATTATGAGTTTGTACGTTGCAGTGGTCTGATTCGAAGTATGATAGTGGCTTGCATAGAGATAACTCCCGGTAAAATTTTCAGTGTTTAGGGTCAATAACTTATCTTATGAGATGAGTTGAAGTACTCATAGTACTCGTAGCTAGGTTTTATAAAGGTGAAAAACTACTTAGATGGTGTACGGAAAAAGATATCAATTTCTAACTATTGCATAGAGATAACTCCTGGTAAAATTTTCATTTTCTATAGTTGTTATATAATCTCTATGCATAGAAATATAATGGTATTCATTATGAGTTTGTACGTTGCAATGGTCAGATTCAAAGTATGATAGTAGCTTGCATAGAGATAACACCCGGTAAAATTTTCAGTGTTTAGGGTCAATAACTTATCATCTTATGAGACCAATTGAAGTACTCAGAGTACTCGTAGCTAGGTTTTATAAAGGTGAAATACTACTTAGATGATGTACGGAAAAAGATTTCAATTTCCGACGCTTGCATAGAGATAACTCCTGGTAAAATTTTCAGTGTTTAGGGTCAATAACTTATCTTATGTGATGAGTTGAAGTACTCATAGTACTCGTAGCTAGGTTTTATAAAAGTGAAAAACTACTTAGATGATGTACGGAAAAAGATTTCAATTTCCGACTATTGCATAGAGATAACTCCCGGTAAAATTTTCAGTGTTTAGGGTCAATAACTTATCTTATGAGATGAGTTGAAGTACTCATAGTACTCGTAGCTAGGTTTTATAAAGGTGAAAAACTACTTAGATGGTGTACGGAAAAAGATATCAATTTCTAACTATTGCATAGAGATAACTCCTGGTAAAATTTTCATTTTCTATAGTTGTTATATAATCTCTATGCATAGAAATATAATGGTATTCATTATGAGTTTGTACGTTGCAATGGTCAGATTCAAAGTATGATAGTAGCTTGCATAGAGATAACACCCGGTAAAATTTTCAGTGTTTAGGGTCAATAACTTATCATCTTATGAGACCAATTGAAGTACTCAGAGTACTCGTAGCTAGGTTTTATAAAGGTGAAATACTACTTAGATGATGTACGGAAAAAGATTTCAATTTCCGACTATTGCATAGAGATAACTCCCGGTAAAATTTTCAGTGTTTAGGGTCAGTAACTTATCTTATGAGATGAGTTGAAGTACTCATTCATAGTACTCGTAGCTAGGTTTTATAAAGGTGAAAAAACTACTTGTACGGAAAAAGATTCAATTTCCGACTATTTATTTAAAGATTACATGAACTTGACTCGAAAAGTCTGTCGTTCAGAGTTTGACTATGGGCAATGAACTTTATCCCTTCATCTATATTCAATGAAAATTATGAAAAGCAAGCTGTCCGTCCTTCTTTGACTTTTTGAGGCCATTTAAAGAGATGAGACATTGTTGAAGCATGATGTGAACTGATTGAGAACTACTTTTTCTTCAATTAGCTTCACAAGGGACCCTTCCCAAAGAGATGAAAATCAAAATGAAATTATTTCATTACTTTATACTCCAAAGTTTGATCTCAAACAACGACTTTAAGATGATTATGTCGTGTTATTTCATATCAACTATAATTATTTCATTTACATTTTCTTAATGTTTTTGTC

mRNA sequence

TAAAGAAGGAAGAAGAAGAAAAGATGAACACGCGGTTGCGGTTTGTTTATGTTAACTGCTATTAACGGTATAAAATTACGGAGTTCCGCGGGATTCCGTAAAGTTTCAATGGAAGTCGCAATCGCAACAATATCCTATCCATTCATACCTTGTTTATCCCTTTCTATCCGGTATTAGGGATTTCGATTTCGGTTTTTCGATTTCGGTTTCGATTTCGGCATGTTTCTATGTTTTGATGTTTGTAGTTTGATTTTTTGCCGAGTATGGTGACGCCAGAGAGCTGGATTTCTGTTTGGATCGATCGATTTCTCTCTTGTTTGGGGAGCACCAAATGTGCACCTGCTATCTCCGGGAATAATCTGAATTCTAGGATGCCAAGCATGTCAGAAGATTTTTGGAGCACAAGCACGTGTGATCTCGATGAGTTGATTACGCTTCAATCTCGACAGAGCTCATTTATCAGCATAATAAATCACAACCCTAAACATGGAGGTGGCACTGACGATCTGAGCAATCATTCTGACTTTGTAAATCATGGTTTGATTCTTTGGACTCAGACCAGGCTTCAATGGATCGGAAATTATGAGCCTGCGAAACGAACCAAAAAAAATCATTTTACAGGATTAAGTTGGTATATGACCAAAGACCTCTTGCTGGAAAACAAAAAACCTTTTCCTCGGCCCATACCTTTATCTGAAATGGTAGACTTTCTGATAGAAGAATGGGAAGAAGAAGGGCTGTATTATTGAGCCAATCAATGGAAATAAAACGCCATCACCTTGTACATACACCATGTTTGTGACTTTTTCAGAGTTTAATTAAACGGCCACTGCCTTAACTTGGGATCAATTCGGTATTAACAAACAAAATGACACCACCTGGTTGGCGGTACATTGAAGATCTCTAGGGGAGAAGCCAAGTGTGGAATAAACTGAAATCCAAGTTCTTCTGAACGATCGGCGAGCTAAACGCAGTCAGATTCAAAGTATGATAGTGGCCTGCATAGAGATAGTTCCCGGTAAAATTTTCATTTTCTATAGTTGTTATATAATCTCTATGCATAGAAATATAATGGTATTCATTATGAGTTTGTACGTTGCAGTGGTCTGATTCGAAGTATGATAGTGGCTTGCATAGAGATAACTCCCGGTAAAATTTTCAGTGTTTAGGGTCAATAACTTATCTTATGAGATGAGTTGAAGTACTCATAGTACTCGTAGCTAGGTTTTATAAAGGTGAAAAACTACTTAGATGGTGTACGGAAAAAGATATCAATTTCTAACTATTGCATAGAGATAACTCCTGGTAAAATTTTCATTTTCTATAGTTGTTATATAATCTCTATGCATAGAAATATAATGGTATTCATTATGAGTTTGTACGTTGCAATGGTCAGATTCAAAGTATGATAGTAGCTTGCATAGAGATAACACCCGGTAAAATTTTCAGTGTTTAGGGTCAATAACTTATCATCTTATGAGACCAATTGAAGTACTCAGAGTACTCGTAGCTAGGTTTTATAAAGGTGAAATACTACTTAGATGATGTACGGAAAAAGATTTCAATTTCCGACGCTTGCATAGAGATAACTCCTGGTAAAATTTTCAGTGTTTAGGGTCAATAACTTATCTTATGTGATGAGTTGAAGTACTCATAGTACTCGTAGCTAGGTTTTATAAAAGTGAAAAACTACTTAGATGATGTACGGAAAAAGATTTCAATTTCCGACTATTGCATAGAGATAACTCCCGGTAAAATTTTCAGTGTTTAGGGTCAATAACTTATCTTATGAGATGAGTTGAAGTACTCATAGTACTCGTAGCTAGGTTTTATAAAGGTGAAAAACTACTTAGATGGTGTACGGAAAAAGATATCAATTTCTAACTATTGCATAGAGATAACTCCTGGTAAAATTTTCATTTTCTATAGTTGTTATATAATCTCTATGCATAGAAATATAATGGTATTCATTATGAGTTTGTACGTTGCAATGGTCAGATTCAAAGTATGATAGTAGCTTGCATAGAGATAACACCCGGTAAAATTTTCAGTGTTTAGGGTCAATAACTTATCATCTTATGAGACCAATTGAAGTACTCAGAGTACTCGTAGCTAGGTTTTATAAAGGTGAAATACTACTTAGATGATGTACGGAAAAAGATTTCAATTTCCGACTATTGCATAGAGATAACTCCCGGTAAAATTTTCAGTGTTTAGGGTCAGTAACTTATCTTATGAGATGAGTTGAAGTACTCATTCATAGTACTCGTAGCTAGGTTTTATAAAGGTGAAAAAACTACTTGTACGGAAAAAGATTCAATTTCCGACTATTTATTTAAAGATTACATGAACTTGACTCGAAAAGTCTGTCGTTCAGAGTTTGACTATGGGCAATGAACTTTATCCCTTCATCTATATTCAATGAAAATTATGAAAAGCAAGCTGTCCGTCCTTCTTTGACTTTTTGAGGCCATTTAAAGAGATGAGACATTGTTGAAGCATGATGTGAACTGATTGAGAACTACTTTTTCTTCAATTAGCTTCACAAGGGACCCTTCCCAAAGAGATGAAAATCAAAATGAAATTATTTCATTACTTTATACTCCAAAGTTTGATCTCAAACAACGACTTTAAGATGATTATGTCGTGTTATTTCATATCAACTATAATTATTTCATTTACATTTTCTTAATGTTTTTGTC

Coding sequence (CDS)

ATGGTGACGCCAGAGAGCTGGATTTCTGTTTGGATCGATCGATTTCTCTCTTGTTTGGGGAGCACCAAATGTGCACCTGCTATCTCCGGGAATAATCTGAATTCTAGGATGCCAAGCATGTCAGAAGATTTTTGGAGCACAAGCACGTGTGATCTCGATGAGTTGATTACGCTTCAATCTCGACAGAGCTCATTTATCAGCATAATAAATCACAACCCTAAACATGGAGGTGGCACTGACGATCTGAGCAATCATTCTGACTTTGTAAATCATGGTTTGATTCTTTGGACTCAGACCAGGCTTCAATGGATCGGAAATTATGAGCCTGCGAAACGAACCAAAAAAAATCATTTTACAGGATTAAGTTGGTATATGACCAAAGACCTCTTGCTGGAAAACAAAAAACCTTTTCCTCGGCCCATACCTTTATCTGAAATGGTAGACTTTCTGATAGAAGAATGGGAAGAAGAAGGGCTGTATTATTGA

Protein sequence

MVTPESWISVWIDRFLSCLGSTKCAPAISGNNLNSRMPSMSEDFWSTSTCDLDELITLQSRQSSFISIINHNPKHGGGTDDLSNHSDFVNHGLILWTQTRLQWIGNYEPAKRTKKNHFTGLSWYMTKDLLLENKKPFPRPIPLSEMVDFLIEEWEEEGLYY
Homology
BLAST of Cp4.1LG07g11540 vs. NCBI nr
Match: XP_023538299.1 (uncharacterized protein LOC111799122 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 344 bits (883), Expect = 8.54e-120
Identity = 161/161 (100.00%), Postives = 161/161 (100.00%), Query Frame = 0

Query: 1   MVTPESWISVWIDRFLSCLGSTKCAPAISGNNLNSRMPSMSEDFWSTSTCDLDELITLQS 60
           MVTPESWISVWIDRFLSCLGSTKCAPAISGNNLNSRMPSMSEDFWSTSTCDLDELITLQS
Sbjct: 1   MVTPESWISVWIDRFLSCLGSTKCAPAISGNNLNSRMPSMSEDFWSTSTCDLDELITLQS 60

Query: 61  RQSSFISIINHNPKHGGGTDDLSNHSDFVNHGLILWTQTRLQWIGNYEPAKRTKKNHFTG 120
           RQSSFISIINHNPKHGGGTDDLSNHSDFVNHGLILWTQTRLQWIGNYEPAKRTKKNHFTG
Sbjct: 61  RQSSFISIINHNPKHGGGTDDLSNHSDFVNHGLILWTQTRLQWIGNYEPAKRTKKNHFTG 120

Query: 121 LSWYMTKDLLLENKKPFPRPIPLSEMVDFLIEEWEEEGLYY 161
           LSWYMTKDLLLENKKPFPRPIPLSEMVDFLIEEWEEEGLYY
Sbjct: 121 LSWYMTKDLLLENKKPFPRPIPLSEMVDFLIEEWEEEGLYY 161

BLAST of Cp4.1LG07g11540 vs. NCBI nr
Match: XP_022965989.1 (uncharacterized protein LOC111465709 [Cucurbita maxima])

HSP 1 Score: 342 bits (876), Expect = 9.98e-119
Identity = 159/161 (98.76%), Postives = 160/161 (99.38%), Query Frame = 0

Query: 1   MVTPESWISVWIDRFLSCLGSTKCAPAISGNNLNSRMPSMSEDFWSTSTCDLDELITLQS 60
           MVTPESWISVWIDRFLSCLGSTKCAPAI+GNNLNSRMPSMSEDFWSTSTCDLDELITLQS
Sbjct: 1   MVTPESWISVWIDRFLSCLGSTKCAPAITGNNLNSRMPSMSEDFWSTSTCDLDELITLQS 60

Query: 61  RQSSFISIINHNPKHGGGTDDLSNHSDFVNHGLILWTQTRLQWIGNYEPAKRTKKNHFTG 120
           RQSSFISIINHNPKHGGGTDDLSNHSDFVNHGLILWTQTRLQWIGNYEP KRTKKNHFTG
Sbjct: 61  RQSSFISIINHNPKHGGGTDDLSNHSDFVNHGLILWTQTRLQWIGNYEPTKRTKKNHFTG 120

Query: 121 LSWYMTKDLLLENKKPFPRPIPLSEMVDFLIEEWEEEGLYY 161
           LSWYMTKDLLLENKKPFPRPIPLSEMVDFLIEEWEEEGLYY
Sbjct: 121 LSWYMTKDLLLENKKPFPRPIPLSEMVDFLIEEWEEEGLYY 161

BLAST of Cp4.1LG07g11540 vs. NCBI nr
Match: XP_022938229.1 (uncharacterized protein LOC111444373 [Cucurbita moschata])

HSP 1 Score: 340 bits (872), Expect = 1.72e-117
Identity = 159/161 (98.76%), Postives = 161/161 (100.00%), Query Frame = 0

Query: 1   MVTPESWISVWIDRFLSCLGSTKCAPAISGNNLNSRMPSMSEDFWSTSTCDLDELITLQS 60
           MVTPESWISVWIDRFLSCLGSTKCAPAISGNNLNSRMPSMSEDFWSTSTCDLDELITLQS
Sbjct: 42  MVTPESWISVWIDRFLSCLGSTKCAPAISGNNLNSRMPSMSEDFWSTSTCDLDELITLQS 101

Query: 61  RQSSFISIINHNPKHGGGTDDLSNHSDFVNHGLILWTQTRLQWIGNYEPAKRTKKNHFTG 120
           RQSSFISIINHNPKHGGGTDDLSNHSDFVN+GLILWTQTRLQWIGNYEPAKRTKKNHFTG
Sbjct: 102 RQSSFISIINHNPKHGGGTDDLSNHSDFVNNGLILWTQTRLQWIGNYEPAKRTKKNHFTG 161

Query: 121 LSWYMTKDLLLENKKPFPRPIPLSEMVDFLIEEWEEEGLYY 161
           LSWYMTK+LLLENKKPFPRPIPLSEMVDFLIEEWEEEGLYY
Sbjct: 162 LSWYMTKELLLENKKPFPRPIPLSEMVDFLIEEWEEEGLYY 202

BLAST of Cp4.1LG07g11540 vs. NCBI nr
Match: KAG6586391.1 (hypothetical protein SDJN03_19124, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 309 bits (791), Expect = 1.97e-105
Identity = 144/146 (98.63%), Postives = 145/146 (99.32%), Query Frame = 0

Query: 1   MVTPESWISVWIDRFLSCLGSTKCAPAISGNNLNSRMPSMSEDFWSTSTCDLDELITLQS 60
           MVTPESWISVWIDRFLSCLGSTKC PAISGNNLNSRMPSMSEDFWSTSTCDLDELITLQS
Sbjct: 1   MVTPESWISVWIDRFLSCLGSTKCTPAISGNNLNSRMPSMSEDFWSTSTCDLDELITLQS 60

Query: 61  RQSSFISIINHNPKHGGGTDDLSNHSDFVNHGLILWTQTRLQWIGNYEPAKRTKKNHFTG 120
           RQSSFISIINHNPKHGGGTDDLSNHSDFVNHGLILWTQTRLQWIGNYEPAKRTKKNHFTG
Sbjct: 61  RQSSFISIINHNPKHGGGTDDLSNHSDFVNHGLILWTQTRLQWIGNYEPAKRTKKNHFTG 120

Query: 121 LSWYMTKDLLLENKKPFPRPIPLSEM 146
           LSWYMTK+LLLENKKPFPRPIPLSEM
Sbjct: 121 LSWYMTKELLLENKKPFPRPIPLSEM 146

BLAST of Cp4.1LG07g11540 vs. NCBI nr
Match: KAG7021241.1 (hypothetical protein SDJN02_17929, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 298 bits (764), Expect = 1.25e-101
Identity = 142/151 (94.04%), Postives = 144/151 (95.36%), Query Frame = 0

Query: 15  FLSCLG----STKCAPAISGNNLNSRMPSMSEDFWSTSTCDLDELITLQSRQSSFISIIN 74
           F+ CL     STKC PAISGNNLNSRMPSMSEDFWSTSTCDLDELITLQSRQSSFISIIN
Sbjct: 12  FIPCLSLSIRSTKCTPAISGNNLNSRMPSMSEDFWSTSTCDLDELITLQSRQSSFISIIN 71

Query: 75  HNPKHGGGTDDLSNHSDFVNHGLILWTQTRLQWIGNYEPAKRTKKNHFTGLSWYMTKDLL 134
           HNPKHGGGTDDLSNHSDFVNHGLILWTQTRLQWIGNYEPAKRTKKNHFTGLSWYMTK+LL
Sbjct: 72  HNPKHGGGTDDLSNHSDFVNHGLILWTQTRLQWIGNYEPAKRTKKNHFTGLSWYMTKELL 131

Query: 135 LENKKPFPRPIPLSEMVDFLIEEWEEEGLYY 161
           LENKKPFPRPIPLSEMVDFLIEEWEEEGLYY
Sbjct: 132 LENKKPFPRPIPLSEMVDFLIEEWEEEGLYY 162

BLAST of Cp4.1LG07g11540 vs. ExPASy TrEMBL
Match: A0A6J1HSF5 (uncharacterized protein LOC111465709 OS=Cucurbita maxima OX=3661 GN=LOC111465709 PE=4 SV=1)

HSP 1 Score: 342 bits (876), Expect = 4.83e-119
Identity = 159/161 (98.76%), Postives = 160/161 (99.38%), Query Frame = 0

Query: 1   MVTPESWISVWIDRFLSCLGSTKCAPAISGNNLNSRMPSMSEDFWSTSTCDLDELITLQS 60
           MVTPESWISVWIDRFLSCLGSTKCAPAI+GNNLNSRMPSMSEDFWSTSTCDLDELITLQS
Sbjct: 1   MVTPESWISVWIDRFLSCLGSTKCAPAITGNNLNSRMPSMSEDFWSTSTCDLDELITLQS 60

Query: 61  RQSSFISIINHNPKHGGGTDDLSNHSDFVNHGLILWTQTRLQWIGNYEPAKRTKKNHFTG 120
           RQSSFISIINHNPKHGGGTDDLSNHSDFVNHGLILWTQTRLQWIGNYEP KRTKKNHFTG
Sbjct: 61  RQSSFISIINHNPKHGGGTDDLSNHSDFVNHGLILWTQTRLQWIGNYEPTKRTKKNHFTG 120

Query: 121 LSWYMTKDLLLENKKPFPRPIPLSEMVDFLIEEWEEEGLYY 161
           LSWYMTKDLLLENKKPFPRPIPLSEMVDFLIEEWEEEGLYY
Sbjct: 121 LSWYMTKDLLLENKKPFPRPIPLSEMVDFLIEEWEEEGLYY 161

BLAST of Cp4.1LG07g11540 vs. ExPASy TrEMBL
Match: A0A6J1FDG3 (uncharacterized protein LOC111444373 OS=Cucurbita moschata OX=3662 GN=LOC111444373 PE=4 SV=1)

HSP 1 Score: 340 bits (872), Expect = 8.31e-118
Identity = 159/161 (98.76%), Postives = 161/161 (100.00%), Query Frame = 0

Query: 1   MVTPESWISVWIDRFLSCLGSTKCAPAISGNNLNSRMPSMSEDFWSTSTCDLDELITLQS 60
           MVTPESWISVWIDRFLSCLGSTKCAPAISGNNLNSRMPSMSEDFWSTSTCDLDELITLQS
Sbjct: 42  MVTPESWISVWIDRFLSCLGSTKCAPAISGNNLNSRMPSMSEDFWSTSTCDLDELITLQS 101

Query: 61  RQSSFISIINHNPKHGGGTDDLSNHSDFVNHGLILWTQTRLQWIGNYEPAKRTKKNHFTG 120
           RQSSFISIINHNPKHGGGTDDLSNHSDFVN+GLILWTQTRLQWIGNYEPAKRTKKNHFTG
Sbjct: 102 RQSSFISIINHNPKHGGGTDDLSNHSDFVNNGLILWTQTRLQWIGNYEPAKRTKKNHFTG 161

Query: 121 LSWYMTKDLLLENKKPFPRPIPLSEMVDFLIEEWEEEGLYY 161
           LSWYMTK+LLLENKKPFPRPIPLSEMVDFLIEEWEEEGLYY
Sbjct: 162 LSWYMTKELLLENKKPFPRPIPLSEMVDFLIEEWEEEGLYY 202

BLAST of Cp4.1LG07g11540 vs. ExPASy TrEMBL
Match: A0A0A0LFZ6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G006290 PE=4 SV=1)

HSP 1 Score: 285 bits (728), Expect = 1.80e-96
Identity = 132/161 (81.99%), Postives = 143/161 (88.82%), Query Frame = 0

Query: 1   MVTPESWISVWIDRFLSCLGSTKCAPAISGNNLNSRMPSMSEDFWSTSTCDLDELITLQS 60
           MVT ESW SVWIDR LSCLGS K APAISGNNLNSRMPSMSEDFWSTSTCDLDEL+TLQS
Sbjct: 1   MVTRESWFSVWIDRLLSCLGSIKPAPAISGNNLNSRMPSMSEDFWSTSTCDLDELLTLQS 60

Query: 61  RQSSFISIINHNPKHGGGTDDLSNHSDFVNHGLILWTQTRLQWIGNYEPAKRTKKNHFTG 120
           RQ+SFIS  NHN  HGG  D+LSNHSDFVNHG +LWTQTRL+W+GN  PAKRTKKNH TG
Sbjct: 61  RQNSFISTTNHNSNHGGVIDNLSNHSDFVNHGFVLWTQTRLRWVGNCVPAKRTKKNHITG 120

Query: 121 LSWYMTKDLLLENKKPFPRPIPLSEMVDFLIEEWEEEGLYY 161
           LSWYMTK+LLLE +KP+ R IPLS+MVDFL+EEWEEEGLYY
Sbjct: 121 LSWYMTKELLLETRKPYHRRIPLSDMVDFLVEEWEEEGLYY 161

BLAST of Cp4.1LG07g11540 vs. ExPASy TrEMBL
Match: A0A1S3CQL1 (uncharacterized protein LOC103503177 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103503177 PE=4 SV=1)

HSP 1 Score: 284 bits (726), Expect = 3.63e-96
Identity = 132/161 (81.99%), Postives = 143/161 (88.82%), Query Frame = 0

Query: 1   MVTPESWISVWIDRFLSCLGSTKCAPAISGNNLNSRMPSMSEDFWSTSTCDLDELITLQS 60
           MVT ESW SVWIDR LSCLGS K APAISGNNLNSRMPSMSEDFWSTSTCDLDEL+TLQS
Sbjct: 1   MVTRESWFSVWIDRLLSCLGSIKPAPAISGNNLNSRMPSMSEDFWSTSTCDLDELLTLQS 60

Query: 61  RQSSFISIINHNPKHGGGTDDLSNHSDFVNHGLILWTQTRLQWIGNYEPAKRTKKNHFTG 120
           RQ+SFIS  NHN  HGG  D+LSNHSDFVNHG +LWTQTRL+W+GN  PAKRTKK+H TG
Sbjct: 61  RQNSFISTTNHNSNHGGVIDNLSNHSDFVNHGFVLWTQTRLRWVGNCVPAKRTKKSHITG 120

Query: 121 LSWYMTKDLLLENKKPFPRPIPLSEMVDFLIEEWEEEGLYY 161
           LSWYMTK+LLLE +KP+ R IPLSEMVDFL+EEWEEEGLYY
Sbjct: 121 LSWYMTKELLLETRKPYHRRIPLSEMVDFLVEEWEEEGLYY 161

BLAST of Cp4.1LG07g11540 vs. ExPASy TrEMBL
Match: A0A6J1G589 (uncharacterized protein LOC111450850 OS=Cucurbita moschata OX=3662 GN=LOC111450850 PE=4 SV=1)

HSP 1 Score: 279 bits (713), Expect = 3.48e-94
Identity = 128/161 (79.50%), Postives = 144/161 (89.44%), Query Frame = 0

Query: 1   MVTPESWISVWIDRFLSCLGSTKCAPAISGNNLNSRMPSMSEDFWSTSTCDLDELITLQS 60
           MVT ESW SVWIDRFLSCL  TK AP ISGNNLNSRM SMS+DFWSTSTCDLDE++TLQS
Sbjct: 1   MVTRESWFSVWIDRFLSCLRGTKLAPTISGNNLNSRMLSMSDDFWSTSTCDLDEMLTLQS 60

Query: 61  RQSSFISIINHNPKHGGGTDDLSNHSDFVNHGLILWTQTRLQWIGNYEPAKRTKKNHFTG 120
           RQ+SFIS  ++NP HGG TD LSNHSDFVNHGLILWTQTRL+W+GN+E AKRTK+ H TG
Sbjct: 61  RQNSFISTTSYNPNHGGATDYLSNHSDFVNHGLILWTQTRLRWVGNHESAKRTKRKHLTG 120

Query: 121 LSWYMTKDLLLENKKPFPRPIPLSEMVDFLIEEWEEEGLYY 161
           LSWYMTK+L+LE+K+P+ R IPLSEMVDFL+EEWEEEGLYY
Sbjct: 121 LSWYMTKELMLESKRPYHRLIPLSEMVDFLVEEWEEEGLYY 161

BLAST of Cp4.1LG07g11540 vs. TAIR 10
Match: AT5G25360.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G32342.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 118.6 bits (296), Expect = 4.5e-27
Identity = 72/165 (43.64%), Postives = 89/165 (53.94%), Query Frame = 0

Query: 6   SWISVWIDRFLSCLGSTKCAPAI-------SGNNLNSRM---PSMSEDFWSTSTCDLDEL 65
           SWI         C G     P I        G  +  R+   PS+SEDFWSTSTC++D  
Sbjct: 9   SWIYQLFGCMGGCFGCCNKPPLIVAVDEPSKGLRIQGRLVKKPSVSEDFWSTSTCEMDN- 68

Query: 66  ITLQSRQSSFISIINHNPKHGGGTDDLSNHSDFVNHGLILWTQTRLQWIGNYEPAKRTKK 125
            TLQS++S       +N      T   SN ++FVNHGL LW QTR QW+ N    K+ K 
Sbjct: 69  STLQSQRSMSSISFTNNTSTSAST---SNPTEFVNHGLNLWNQTRQQWLANGTSQKKAKV 128

Query: 126 NHFTGLSWYMTKDLLLENKKPFPRPIPLSEMVDFLIEEWEEEGLY 161
              T +SW  T + LL   K F RPIPL EMVDFL++ WE+EGLY
Sbjct: 129 REPT-ISWNATYESLLGMNKRFSRPIPLPEMVDFLVDVWEQEGLY 168

BLAST of Cp4.1LG07g11540 vs. TAIR 10
Match: AT5G25360.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G32342.1). )

HSP 1 Score: 118.6 bits (296), Expect = 4.5e-27
Identity = 72/165 (43.64%), Postives = 89/165 (53.94%), Query Frame = 0

Query: 6   SWISVWIDRFLSCLGSTKCAPAI-------SGNNLNSRM---PSMSEDFWSTSTCDLDEL 65
           SWI         C G     P I        G  +  R+   PS+SEDFWSTSTC++D  
Sbjct: 9   SWIYQLFGCMGGCFGCCNKPPLIVAVDEPSKGLRIQGRLVKKPSVSEDFWSTSTCEMDN- 68

Query: 66  ITLQSRQSSFISIINHNPKHGGGTDDLSNHSDFVNHGLILWTQTRLQWIGNYEPAKRTKK 125
            TLQS++S       +N      T   SN ++FVNHGL LW QTR QW+ N    K+ K 
Sbjct: 69  STLQSQRSMSSISFTNNTSTSAST---SNPTEFVNHGLNLWNQTRQQWLANGTSQKKAKV 128

Query: 126 NHFTGLSWYMTKDLLLENKKPFPRPIPLSEMVDFLIEEWEEEGLY 161
              T +SW  T + LL   K F RPIPL EMVDFL++ WE+EGLY
Sbjct: 129 REPT-ISWNATYESLLGMNKRFSRPIPLPEMVDFLVDVWEQEGLY 168

BLAST of Cp4.1LG07g11540 vs. TAIR 10
Match: AT1G15350.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G15770.2); Has 148 Blast hits to 148 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes - 7 (source: NCBI BLink). )

HSP 1 Score: 105.1 bits (261), Expect = 5.2e-23
Identity = 57/126 (45.24%), Postives = 77/126 (61.11%), Query Frame = 0

Query: 36  RMPSMSEDFWSTSTCDLDELITLQSRQSSFISIINHNPKHGGGTDDLSNHSDFVNHGLIL 95
           + PS+SEDFWSTST D+D  IT  S+ S  +S  N          + +   ++VN GL+L
Sbjct: 31  KKPSVSEDFWSTSTVDMDN-ITFPSQGS--LSSSNQTFDSQSAARNSNAPPEYVNQGLLL 90

Query: 96  WTQTRLQWIGNYEPAKRTKKNHFTGLSW-YMTKDLLLENKKPFPRPIPLSEMVDFLIEEW 155
           W QTR +W+G  +P      N    L+W   T D LL + K FP+PIPL+EMVDFL++ W
Sbjct: 91  WNQTRERWVGKDKPNNPVDHNQGAKLNWNTATYDSLLGSNKLFPQPIPLTEMVDFLVDIW 150

Query: 156 EEEGLY 161
           E+EGLY
Sbjct: 151 EQEGLY 153

BLAST of Cp4.1LG07g11540 vs. TAIR 10
Match: AT1G15350.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G15770.2); Has 148 Blast hits to 148 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 141; Viruses - 0; Other Eukaryotes - 7 (source: NCBI BLink). )

HSP 1 Score: 105.1 bits (261), Expect = 5.2e-23
Identity = 57/126 (45.24%), Postives = 77/126 (61.11%), Query Frame = 0

Query: 36  RMPSMSEDFWSTSTCDLDELITLQSRQSSFISIINHNPKHGGGTDDLSNHSDFVNHGLIL 95
           + PS+SEDFWSTST D+D  IT  S+ S  +S  N          + +   ++VN GL+L
Sbjct: 31  KKPSVSEDFWSTSTVDMDN-ITFPSQGS--LSSSNQTFDSQSAARNSNAPPEYVNQGLLL 90

Query: 96  WTQTRLQWIGNYEPAKRTKKNHFTGLSW-YMTKDLLLENKKPFPRPIPLSEMVDFLIEEW 155
           W QTR +W+G  +P      N    L+W   T D LL + K FP+PIPL+EMVDFL++ W
Sbjct: 91  WNQTRERWVGKDKPNNPVDHNQGAKLNWNTATYDSLLGSNKLFPQPIPLTEMVDFLVDIW 150

Query: 156 EEEGLY 161
           E+EGLY
Sbjct: 151 EQEGLY 153

BLAST of Cp4.1LG07g11540 vs. TAIR 10
Match: AT4G32342.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G25360.2); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 104.0 bits (258), Expect = 1.2e-22
Identity = 58/120 (48.33%), Postives = 69/120 (57.50%), Query Frame = 0

Query: 41  SEDFWSTSTCDLDELITLQSRQSSFISIINHNPKHGGGTDDLSNHSDFVNHGLILWTQTR 100
           S+DFWSTSTCD+D  IT+QS+ S        NP         SN ++FVNHGLILW  TR
Sbjct: 52  SDDFWSTSTCDMDHNITIQSQSS--------NPPFDPQC-STSNSTEFVNHGLILWNHTR 111

Query: 101 LQWIGNYEPAKRTKKNHFTGLSWYMTKDLLLENKKPFPRPIPLSEMVDFLIEEWEEEGLY 160
            QW       +         +SW  T D LL   K FP+PIPL EMV FL++ WEEEGLY
Sbjct: 112 QQWRECLTRQQCLVPE--PAISWNSTYDSLLSTNKLFPQPIPLKEMVHFLVDVWEEEGLY 160

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023538299.18.54e-120100.00uncharacterized protein LOC111799122 [Cucurbita pepo subsp. pepo][more]
XP_022965989.19.98e-11998.76uncharacterized protein LOC111465709 [Cucurbita maxima][more]
XP_022938229.11.72e-11798.76uncharacterized protein LOC111444373 [Cucurbita moschata][more]
KAG6586391.11.97e-10598.63hypothetical protein SDJN03_19124, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG7021241.11.25e-10194.04hypothetical protein SDJN02_17929, partial [Cucurbita argyrosperma subsp. argyro... [more]
Match NameE-valueIdentityDescription
A0A6J1HSF54.83e-11998.76uncharacterized protein LOC111465709 OS=Cucurbita maxima OX=3661 GN=LOC111465709... [more]
A0A6J1FDG38.31e-11898.76uncharacterized protein LOC111444373 OS=Cucurbita moschata OX=3662 GN=LOC1114443... [more]
A0A0A0LFZ61.80e-9681.99Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G006290 PE=4 SV=1[more]
A0A1S3CQL13.63e-9681.99uncharacterized protein LOC103503177 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1G5893.48e-9479.50uncharacterized protein LOC111450850 OS=Cucurbita moschata OX=3662 GN=LOC1114508... [more]
Match NameE-valueIdentityDescription
AT5G25360.14.5e-2743.64unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G25360.24.5e-2743.64unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G15350.25.2e-2345.24unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G15350.15.2e-2345.24unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G32342.11.2e-2248.33unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025124Domain of unknown function DUF4050PFAMPF13259DUF4050coord: 49..116
e-value: 1.5E-6
score: 28.6
coord: 123..160
e-value: 4.1E-7
score: 30.5
NoneNo IPR availablePANTHERPTHR33373OS07G0479600 PROTEINcoord: 1..160
NoneNo IPR availablePANTHERPTHR33373:SF13DUF4050 FAMILY PROTEINcoord: 1..160

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG07g11540.1Cp4.1LG07g11540.1mRNA