Cp4.1LG14g10070.1 (mRNA) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG14g10070.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionUnknown protein
LocationCp4.1LG14: 8454502 .. 8459063 (+)
Sequence length1358
RNA-Seq ExpressionCp4.1LG14g10070.1
SyntenyCp4.1LG14g10070.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
AGATTTTGGGTTGGCATAAAGATTCAATATAGAAAATTTCCAACGCTAAAACGGCGTCGTACGGTGCCCATTGGTCGAACAATTATTGGAAAATGCCAATTTAATGGAGTGGTTGGCTACAGGCGGCCATTCTAGAACGGCGAAAAATTAGTCGAAGTTCGTGTTTGTGGCGACCACACGTCTGAACTTGCTCCTTCTCTCTTCTTCGCTGCGATTTTCCGGCGTCAATCACTCGAAGAGATTCACTTTCTTCGGTATTTTATCTCATTTCAAACGCTTCGGGTTTCTCTTTTCTTGATTTTTCCATGTTTTTCGATGTTAATTTTGTGCTTTCTTCATTTTTCAAGCTGAAGGAAAATCCAATTTGTAGGGCTTCGTATGATCAGATATTAGTGTTCCGAATTCTCCTGCTTATGAAATTTCAGTGTGTGTGTTTTTATGTCGTCGCCATGATTGTCTTGAAATTTTGTACGAATTACTCTCTCGCTGCGGGCGGTTGATGTGATTGATTTTACGGCTGGAGAAATTCGTCCGGCTTTCTTCCTGTTCGTGCCGACGGTACATGCTATTGAGTTGTTTAGATCTTGTATTCTGTGGTTGGTTTGTTATTTGAAAGGTTTAATTGGAGCATAAGGTTTTCAAAATCTTCTATCATCCAGGAGTTTGTTTGCGCGGAGTTCCTTGTTTGGTTGTAAGCTATAGATTATTTATTGGTCACCATTTATAGTTGTCAATTTCTCTTTGACTTCGTCCTTTTTTTCTGGACTCTGGTTGTCGAAGATTGACGTGACGGAGAATCGACTTTATGATGTGTTTTTCTGTTTGAACGAGTTTGATTCAGCACTGGCCTCGTGAGGGTGGGTGTACCTCGAAATTTAAAACCATGTAATTCTAATATCTAATGTATGAGGAAGAACAAACTACCTTTTGTTGGCCTTTTGGAATTGGGGCGTGTGAAATAAGTAGCGACTTGGGGAAGGGGAAATAGGATCAAACTACAAAGTAAGGGGAAGTTTAGTATGTATGTGTGAGATCCCACATCGATTGGGGAGGGGAACAAAACATTCTTCATAAGGGTGTGGAAACTTCTCCCTAGCAGACAACATTTTCAAAACCTTGAGGGAAAGCCCAAAAGGGAAAACTCAAAGAGGACAATATCTGCTAGGGGTGACTTGGGCTGTTATAAATGGTATTAGAGCCAGACATCGGGCGATGTGCCAGTGAGGAGGCTGAGCTCCGAAGGGGAGGATGTGTTGAGATATATTATTTATGGAATATATCCTAGATATTATATCTTAATATTCTTCAATTTATGGTTAGATTTATAGTATATTTTATTTATTTACCAGATTTAGTTAGATATATATTTTTTTTCTATTTTTAGGTATTAGTTGATAGTTTGTATCCTATTTAAACGTGGTAAACATGAATGAAGATCATACTTTCGATCCCAATTCTATTTCTATTTCTCATTCTTAACAGGATGGATTGGAGGGTCTCCACGTAGATCGAAGAAGGGAACGAGTGCCTGCGAGGACGCTGGGCCTTGAAGCAGGGGGGAGGGGGGTGGACACAAGGTGGTGTGCCAGCAAGGACGTTGGCCCCAAAGGAGGGATGGATTGGAGGGTCCCACATCACATCAATTGAAGAAGGGAACGAGTGAAGGGGGGGGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAAGGGTGGATTGTGAGATCCACACCAGTGGATTTATAAGGGTGTGGAAGCTTCTCCCTAGTAGATGCATTTTAAAAACCTTAGTACCACCCATTTTAACAAAGAGAACAATATCTGCTAGTAGTGGGCTTGGGCTTAAAACCCTTGAGGGGGAGCCCAACAATATCTGCTAGTAGTGGGCTTGGGCTTAAAACCCTTGAGGGGAGCCCAAAGAGGACAATATCTGCTAGTAGTGGCCTTGGGCCATTACAAGTACAAGACAAATTTCGTGTGAGATTAGTTAGGGTTCATGTAAGATAGCTCGTATACACATAGTACAAAAACAAAATGAAATTGAAGAGAAGCTATGATATGAAATGAACTGATATATTTGCAGTGTTATATAAAGTTCCCTTGAGCTTTGCTTTTGCCCTTCGTGTCTTAAAACATGGAAGTTGATAATTATTTAACCTAGAGGGAATCTTGACCTTTCTACTTTCTATCCCCTTCTGTTTCTTATCTCCGTATGCTTGCAGGAAAACTCTTACTGAGAAGCAGTAATGGACTTTATATTCAGAGGAATGAATGATGAGTCATCCGAGTGCTACTCTAATAATGAAATGGACATTCAGAGATGCCCTTTCCTCAGGAATATCAATAAACCAACCTGCTTTTCCTTCTCCACCATGAATTTCACCTTCCCGGTAACTTCCTTATCTGCTATCCCCTGCACATTTACAGTCCCAGATCCGTTTTAGAGAAGTCGTTATATCGTTTTGTATTTTCAGGTTCGGGGAGCGAAAGGGCCAATTTTTGAGGATGGTCCTAATTTTGACACAGCGTTTAAGCTCTTTCATGGTAAAGATGGGGTCGTTCCACTATCTGAGAAATCCATCTTTGAGACCTCATTGCCAGAACCAGAGACTGCTCCTCATTTCAACCCCCTGGCAGCAAAAGCTGCAACCATCAGTCTGTCGGCTTTCGGGCCGGGAGGGCCATTCAGCTTCGGGAGCTTCTCTGAGAAATGGAAGAAACAAAAGTCTGAGGCTTCCAACAAGAAAGATCATTCGTCACAGGTAGATTTGCCTTCTCTTTAAGCTTTAAACTTGACACACAAAGTTGGATCATCTGTAGTACTTTTTATAAGGGGTAGCTACTTCAATGGAGACTGAACTTCATAGTTAATACAATTTGCTTACAAAAATTTGATTTTCGTTACGTCTGTTGCACGGGGCGGGGCATAGCTTTGGACATGAAAAAAGGGCAAGAATTTGACATGCATAGAAATCTTGAATCTAGCCTGGTTATGAAGGAAAGCTAGAAAATATGGGACTTAGAACTTGTGATAATATCGCATGTGGGTGGGAGATAGTCTGATTCTATAATGTTTGGAAGACTTTTACCCAAGTTCATGTGATGTCCCACATTGGTTGGGGAGGAGAACAAACCACCCTTTATAAGGGTGTGGAAACCTTCTCCTAGTAGACGCGTTTTAAAGCCTTGAGGGGAAGCCCGAAGGGGAAAGCCCAAAGAGGACAATATCTGCTAGCGGAGGATCTAGGGACGCATGTCAGATCTTTCTTATTCCAAACAACACAACTTTAAGATCATTTCTAGATTTTCAATTTGAATTATAGTTGTTTGCTCGATTAAAGTTGGGGAGGCTTTTGCTCATGGGCCTTCTTCTTCCTGTATTAATTTCATATCTGTTTCTTGCATGCTTTCCAGAAAAAAGGAGGTTCATCGAAGCACGAGGCTCTGGGAAGTGAGTGGCTGGAAACAGGAAACTGCCCAATTGCCAAGTCTTTTAGAGCTGTTAGCGGAGTCCTCCCTCTTGTTGCAACAGCTTTTCAGCCACCACCTGGTATGAAGCTTAAGTGCCCGCCTGCTATAGTCGCTGCCCGAGCTGCCCTTGCTCGTACAGCATTCGTGAAAAATCTGCGTCCTCAACCGCTTCCTTCAAAAATGCTCGTCATTGCAGCCTTAGGCATGGCAGCAAACGTTCCTCTTGGTATATGGAGGGAGCACACTAAGAAGTTTTCATTTTCATGGTTTGTGGCAATTCATGCAGCTGTACCCTTTATTGGCATGCTTCGGAAATGTGTCTTGATGCCAAAGACAGCCATGGCAATGACCATTGCAGCTTCTATACTAGGGCAGGTAATTGGTTCGAGGGCTGAACGTATGCGACTGAAAGCTATTTCCGAGAAGGGGAAGGCAACGACAGTTATACCAACGCTAGATACTACTCCAAGCTATGAGTTAACCCAGGTCGATGCCATTGTAGGTGGTCGTTGTGGCGTTGAAAGAATGGTGTTCGATCCTCTCCAGAAGGATGGTAGGCAGACGTCAACCCCGGCAAATGTATGCTCGTAA

mRNA sequence

AGATTTTGGGTTGGCATAAAGATTCAATATAGAAAATTTCCAACGCTAAAACGGCGTCGTACGGTGCCCATTGGTCGAACAATTATTGGAAAATGCCAATTTAATGGAGTGGTTGGCTACAGGCGGCCATTCTAGAACGGCGAAAAATTAGTCGAAGTTCGTGTTTGTGGCGACCACACGTCTGAACTTGCTCCTTCTCTCTTCTTCGCTGCGATTTTCCGGCGTCAATCACTCGAAGAGATTCACTTTCTTCGGAAAACTCTTACTGAGAAGCAGTAATGGACTTTATATTCAGAGGAATGAATGATGAGTCATCCGAGTGCTACTCTAATAATGAAATGGACATTCAGAGATGCCCTTTCCTCAGGAATATCAATAAACCAACCTGCTTTTCCTTCTCCACCATGAATTTCACCTTCCCGGTTCGGGGAGCGAAAGGGCCAATTTTTGAGGATGGTCCTAATTTTGACACAGCGTTTAAGCTCTTTCATGGTAAAGATGGGGTCGTTCCACTATCTGAGAAATCCATCTTTGAGACCTCATTGCCAGAACCAGAGACTGCTCCTCATTTCAACCCCCTGGCAGCAAAAGCTGCAACCATCAGTCTGTCGGCTTTCGGGCCGGGAGGGCCATTCAGCTTCGGGAGCTTCTCTGAGAAATGGAAGAAACAAAAGTCTGAGGCTTCCAACAAGAAAGATCATTCGTCACAGAAAAAAGGAGGTTCATCGAAGCACGAGGCTCTGGGAAGTGAGTGGCTGGAAACAGGAAACTGCCCAATTGCCAAGTCTTTTAGAGCTGTTAGCGGAGTCCTCCCTCTTGTTGCAACAGCTTTTCAGCCACCACCTGGTATGAAGCTTAAGTGCCCGCCTGCTATAGTCGCTGCCCGAGCTGCCCTTGCTCGTACAGCATTCGTGAAAAATCTGCGTCCTCAACCGCTTCCTTCAAAAATGCTCGTCATTGCAGCCTTAGGCATGGCAGCAAACGTTCCTCTTGGTATATGGAGGGAGCACACTAAGAAGTTTTCATTTTCATGGTTTGTGGCAATTCATGCAGCTGTACCCTTTATTGGCATGCTTCGGAAATGTGTCTTGATGCCAAAGACAGCCATGGCAATGACCATTGCAGCTTCTATACTAGGGCAGGTAATTGGTTCGAGGGCTGAACGTATGCGACTGAAAGCTATTTCCGAGAAGGGGAAGGCAACGACAGTTATACCAACGCTAGATACTACTCCAAGCTATGAGTTAACCCAGGTCGATGCCATTGTAGGTGGTCGTTGTGGCGTTGAAAGAATGGTGTTCGATCCTCTCCAGAAGGATGGTAGGCAGACGTCAACCCCGGCAAATGTATGCTCGTAA

Coding sequence (CDS)

ATGGACTTTATATTCAGAGGAATGAATGATGAGTCATCCGAGTGCTACTCTAATAATGAAATGGACATTCAGAGATGCCCTTTCCTCAGGAATATCAATAAACCAACCTGCTTTTCCTTCTCCACCATGAATTTCACCTTCCCGGTTCGGGGAGCGAAAGGGCCAATTTTTGAGGATGGTCCTAATTTTGACACAGCGTTTAAGCTCTTTCATGGTAAAGATGGGGTCGTTCCACTATCTGAGAAATCCATCTTTGAGACCTCATTGCCAGAACCAGAGACTGCTCCTCATTTCAACCCCCTGGCAGCAAAAGCTGCAACCATCAGTCTGTCGGCTTTCGGGCCGGGAGGGCCATTCAGCTTCGGGAGCTTCTCTGAGAAATGGAAGAAACAAAAGTCTGAGGCTTCCAACAAGAAAGATCATTCGTCACAGAAAAAAGGAGGTTCATCGAAGCACGAGGCTCTGGGAAGTGAGTGGCTGGAAACAGGAAACTGCCCAATTGCCAAGTCTTTTAGAGCTGTTAGCGGAGTCCTCCCTCTTGTTGCAACAGCTTTTCAGCCACCACCTGGTATGAAGCTTAAGTGCCCGCCTGCTATAGTCGCTGCCCGAGCTGCCCTTGCTCGTACAGCATTCGTGAAAAATCTGCGTCCTCAACCGCTTCCTTCAAAAATGCTCGTCATTGCAGCCTTAGGCATGGCAGCAAACGTTCCTCTTGGTATATGGAGGGAGCACACTAAGAAGTTTTCATTTTCATGGTTTGTGGCAATTCATGCAGCTGTACCCTTTATTGGCATGCTTCGGAAATGTGTCTTGATGCCAAAGACAGCCATGGCAATGACCATTGCAGCTTCTATACTAGGGCAGGTAATTGGTTCGAGGGCTGAACGTATGCGACTGAAAGCTATTTCCGAGAAGGGGAAGGCAACGACAGTTATACCAACGCTAGATACTACTCCAAGCTATGAGTTAACCCAGGTCGATGCCATTGTAGGTGGTCGTTGTGGCGTTGAAAGAATGGTGTTCGATCCTCTCCAGAAGGATGGTAGGCAGACGTCAACCCCGGCAAATGTATGCTCGTAA

Protein sequence

MDFIFRGMNDESSECYSNNEMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDGPNFDTAFKLFHGKDGVVPLSEKSIFETSLPEPETAPHFNPLAAKAATISLSAFGPGGPFSFGSFSEKWKKQKSEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPLVATAFQPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGIWREHTKKFSFSWFVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLKAISEKGKATTVIPTLDTTPSYELTQVDAIVGGRCGVERMVFDPLQKDGRQTSTPANVCS
Homology
BLAST of Cp4.1LG14g10070.1 vs. NCBI nr
Match: XP_023552670.1 (uncharacterized protein LOC111810249 [Cucurbita pepo subsp. pepo] >XP_023552671.1 uncharacterized protein LOC111810249 [Cucurbita pepo subsp. pepo] >XP_023552672.1 uncharacterized protein LOC111810249 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 714 bits (1844), Expect = 1.14e-259
Identity = 359/359 (100.00%), Postives = 359/359 (100.00%), Query Frame = 0

Query: 1   MDFIFRGMNDESSECYSNNEMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDG 60
           MDFIFRGMNDESSECYSNNEMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDG
Sbjct: 1   MDFIFRGMNDESSECYSNNEMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDG 60

Query: 61  PNFDTAFKLFHGKDGVVPLSEKSIFETSLPEPETAPHFNPLAAKAATISLSAFGPGGPFS 120
           PNFDTAFKLFHGKDGVVPLSEKSIFETSLPEPETAPHFNPLAAKAATISLSAFGPGGPFS
Sbjct: 61  PNFDTAFKLFHGKDGVVPLSEKSIFETSLPEPETAPHFNPLAAKAATISLSAFGPGGPFS 120

Query: 121 FGSFSEKWKKQKSEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPL 180
           FGSFSEKWKKQKSEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPL
Sbjct: 121 FGSFSEKWKKQKSEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPL 180

Query: 181 VATAFQPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI 240
           VATAFQPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI
Sbjct: 181 VATAFQPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI 240

Query: 241 WREHTKKFSFSWFVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLK 300
           WREHTKKFSFSWFVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLK
Sbjct: 241 WREHTKKFSFSWFVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLK 300

Query: 301 AISEKGKATTVIPTLDTTPSYELTQVDAIVGGRCGVERMVFDPLQKDGRQTSTPANVCS 359
           AISEKGKATTVIPTLDTTPSYELTQVDAIVGGRCGVERMVFDPLQKDGRQTSTPANVCS
Sbjct: 301 AISEKGKATTVIPTLDTTPSYELTQVDAIVGGRCGVERMVFDPLQKDGRQTSTPANVCS 359

BLAST of Cp4.1LG14g10070.1 vs. NCBI nr
Match: KAG7015791.1 (hypothetical protein SDJN02_23429 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 707 bits (1826), Expect = 6.08e-257
Identity = 358/359 (99.72%), Postives = 358/359 (99.72%), Query Frame = 0

Query: 1   MDFIFRGMNDESSECYSNNEMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDG 60
           MDFIFRGMNDESSECYSN EMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDG
Sbjct: 1   MDFIFRGMNDESSECYSN-EMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDG 60

Query: 61  PNFDTAFKLFHGKDGVVPLSEKSIFETSLPEPETAPHFNPLAAKAATISLSAFGPGGPFS 120
           PNFDTAFKLFHGKDGVVPLSEKSIFETSLPEPETAPHFNPLAAKAATISLSAFGPGGPFS
Sbjct: 61  PNFDTAFKLFHGKDGVVPLSEKSIFETSLPEPETAPHFNPLAAKAATISLSAFGPGGPFS 120

Query: 121 FGSFSEKWKKQKSEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPL 180
           FGSFSEKWKKQKSEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPL
Sbjct: 121 FGSFSEKWKKQKSEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPL 180

Query: 181 VATAFQPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI 240
           VATAFQPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI
Sbjct: 181 VATAFQPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI 240

Query: 241 WREHTKKFSFSWFVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLK 300
           WREHTKKFSFSWFVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLK
Sbjct: 241 WREHTKKFSFSWFVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLK 300

Query: 301 AISEKGKATTVIPTLDTTPSYELTQVDAIVGGRCGVERMVFDPLQKDGRQTSTPANVCS 359
           AISEKGKATTVIPTLDTTPSYELTQVDAIVGGRCGVERMVFDPLQKDGRQTSTPANVCS
Sbjct: 301 AISEKGKATTVIPTLDTTPSYELTQVDAIVGGRCGVERMVFDPLQKDGRQTSTPANVCS 358

BLAST of Cp4.1LG14g10070.1 vs. NCBI nr
Match: XP_022923335.1 (uncharacterized protein LOC111431059 [Cucurbita moschata] >XP_022923336.1 uncharacterized protein LOC111431059 [Cucurbita moschata])

HSP 1 Score: 706 bits (1821), Expect = 3.52e-256
Identity = 357/359 (99.44%), Postives = 357/359 (99.44%), Query Frame = 0

Query: 1   MDFIFRGMNDESSECYSNNEMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDG 60
           MDFIFRGMNDESSECYSN EMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDG
Sbjct: 1   MDFIFRGMNDESSECYSN-EMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDG 60

Query: 61  PNFDTAFKLFHGKDGVVPLSEKSIFETSLPEPETAPHFNPLAAKAATISLSAFGPGGPFS 120
           PNFDTAFKLFHGKDGVVPL EKSIFETSLPEPETAPHFNPLAAKAATISLSAFGPGGPFS
Sbjct: 61  PNFDTAFKLFHGKDGVVPLPEKSIFETSLPEPETAPHFNPLAAKAATISLSAFGPGGPFS 120

Query: 121 FGSFSEKWKKQKSEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPL 180
           FGSFSEKWKKQKSEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPL
Sbjct: 121 FGSFSEKWKKQKSEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPL 180

Query: 181 VATAFQPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI 240
           VATAFQPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI
Sbjct: 181 VATAFQPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI 240

Query: 241 WREHTKKFSFSWFVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLK 300
           WREHTKKFSFSWFVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLK
Sbjct: 241 WREHTKKFSFSWFVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLK 300

Query: 301 AISEKGKATTVIPTLDTTPSYELTQVDAIVGGRCGVERMVFDPLQKDGRQTSTPANVCS 359
           AISEKGKATTVIPTLDTTPSYELTQVDAIVGGRCGVERMVFDPLQKDGRQTSTPANVCS
Sbjct: 301 AISEKGKATTVIPTLDTTPSYELTQVDAIVGGRCGVERMVFDPLQKDGRQTSTPANVCS 358

BLAST of Cp4.1LG14g10070.1 vs. NCBI nr
Match: XP_022965273.1 (uncharacterized protein LOC111465188 [Cucurbita maxima] >XP_022965274.1 uncharacterized protein LOC111465188 [Cucurbita maxima])

HSP 1 Score: 703 bits (1814), Expect = 4.26e-255
Identity = 353/359 (98.33%), Postives = 356/359 (99.16%), Query Frame = 0

Query: 1   MDFIFRGMNDESSECYSNNEMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDG 60
           MDFIFRGMND+SSECYSNNEMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDG
Sbjct: 1   MDFIFRGMNDKSSECYSNNEMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDG 60

Query: 61  PNFDTAFKLFHGKDGVVPLSEKSIFETSLPEPETAPHFNPLAAKAATISLSAFGPGGPFS 120
           PNFDTAFKLFHGKDGVVPLSEKS FET LPEPETAPHFNPLAAKAATISLSAFGPGGPFS
Sbjct: 61  PNFDTAFKLFHGKDGVVPLSEKSSFETLLPEPETAPHFNPLAAKAATISLSAFGPGGPFS 120

Query: 121 FGSFSEKWKKQKSEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPL 180
           FGSFSEKWKKQKSEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPL
Sbjct: 121 FGSFSEKWKKQKSEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPL 180

Query: 181 VATAFQPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI 240
           VATAFQPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI
Sbjct: 181 VATAFQPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI 240

Query: 241 WREHTKKFSFSWFVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLK 300
           WREHTKKFSFSWFVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLK
Sbjct: 241 WREHTKKFSFSWFVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLK 300

Query: 301 AISEKGKATTVIPTLDTTPSYELTQVDAIVGGRCGVERMVFDPLQKDGRQTSTPANVCS 359
           AI+EKGKATTVIPTLDTTPSYELTQVDAIVGGRCGVER VFDPLQK+GRQTSTPANVCS
Sbjct: 301 AIAEKGKATTVIPTLDTTPSYELTQVDAIVGGRCGVERRVFDPLQKNGRQTSTPANVCS 359

BLAST of Cp4.1LG14g10070.1 vs. NCBI nr
Match: XP_038904883.1 (uncharacterized protein LOC120091115 [Benincasa hispida] >XP_038904884.1 uncharacterized protein LOC120091115 [Benincasa hispida] >XP_038905444.1 uncharacterized protein LOC120091474 [Benincasa hispida])

HSP 1 Score: 649 bits (1674), Expect = 8.91e-234
Identity = 328/359 (91.36%), Postives = 343/359 (95.54%), Query Frame = 0

Query: 1   MDFIFRGMNDESSECYSNNEMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDG 60
           M+FI RGMNDE SECYSN EMDIQRCPFLRNINKPTCFSFST++FT PVRGAKGPIFEDG
Sbjct: 1   MEFILRGMNDEGSECYSN-EMDIQRCPFLRNINKPTCFSFSTLSFTLPVRGAKGPIFEDG 60

Query: 61  PNFDTAFKLFHGKDGVVPLSEKSIFETSLPEPETAPHFNPLAAKAATISLSAFGPGGPFS 120
           PNFDTAFKLFHGKDGVVPLSE++ FE    EPETAPHFNPLAAKAATISLSAFGPGGPFS
Sbjct: 61  PNFDTAFKLFHGKDGVVPLSERTGFEKISSEPETAPHFNPLAAKAATISLSAFGPGGPFS 120

Query: 121 FGSFSEKWKKQKSEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPL 180
           FG+FSEKWKKQKSEASNK DHSS+KK  SSKHEALG+EWLETGNCPIAKSFRAVSGVLPL
Sbjct: 121 FGNFSEKWKKQKSEASNKNDHSSKKKESSSKHEALGNEWLETGNCPIAKSFRAVSGVLPL 180

Query: 181 VATAFQPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI 240
           VATAFQ PPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI
Sbjct: 181 VATAFQLPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI 240

Query: 241 WREHTKKFSFSWFVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLK 300
           WREHT+KFSFSWFVAIHAAVPFI MLRK VLMPKTAMAMTIAAS+LGQVIGSRAERMRLK
Sbjct: 241 WREHTQKFSFSWFVAIHAAVPFIAMLRKSVLMPKTAMAMTIAASVLGQVIGSRAERMRLK 300

Query: 301 AISEKGKATTVIPTLDTTPSYELTQVDAIVGGRCGVERMVFDPLQKDGRQTSTPANVCS 359
           AI+EKGKATTVIP L++TPSYELTQVDAIVGGRCGVERMVFDPL+KDGRQTS+PANVCS
Sbjct: 301 AIAEKGKATTVIPALESTPSYELTQVDAIVGGRCGVERMVFDPLRKDGRQTSSPANVCS 358

BLAST of Cp4.1LG14g10070.1 vs. ExPASy TrEMBL
Match: A0A6J1E6J0 (uncharacterized protein LOC111431059 OS=Cucurbita moschata OX=3662 GN=LOC111431059 PE=4 SV=1)

HSP 1 Score: 706 bits (1821), Expect = 1.70e-256
Identity = 357/359 (99.44%), Postives = 357/359 (99.44%), Query Frame = 0

Query: 1   MDFIFRGMNDESSECYSNNEMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDG 60
           MDFIFRGMNDESSECYSN EMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDG
Sbjct: 1   MDFIFRGMNDESSECYSN-EMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDG 60

Query: 61  PNFDTAFKLFHGKDGVVPLSEKSIFETSLPEPETAPHFNPLAAKAATISLSAFGPGGPFS 120
           PNFDTAFKLFHGKDGVVPL EKSIFETSLPEPETAPHFNPLAAKAATISLSAFGPGGPFS
Sbjct: 61  PNFDTAFKLFHGKDGVVPLPEKSIFETSLPEPETAPHFNPLAAKAATISLSAFGPGGPFS 120

Query: 121 FGSFSEKWKKQKSEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPL 180
           FGSFSEKWKKQKSEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPL
Sbjct: 121 FGSFSEKWKKQKSEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPL 180

Query: 181 VATAFQPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI 240
           VATAFQPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI
Sbjct: 181 VATAFQPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI 240

Query: 241 WREHTKKFSFSWFVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLK 300
           WREHTKKFSFSWFVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLK
Sbjct: 241 WREHTKKFSFSWFVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLK 300

Query: 301 AISEKGKATTVIPTLDTTPSYELTQVDAIVGGRCGVERMVFDPLQKDGRQTSTPANVCS 359
           AISEKGKATTVIPTLDTTPSYELTQVDAIVGGRCGVERMVFDPLQKDGRQTSTPANVCS
Sbjct: 301 AISEKGKATTVIPTLDTTPSYELTQVDAIVGGRCGVERMVFDPLQKDGRQTSTPANVCS 358

BLAST of Cp4.1LG14g10070.1 vs. ExPASy TrEMBL
Match: A0A6J1HNE2 (uncharacterized protein LOC111465188 OS=Cucurbita maxima OX=3661 GN=LOC111465188 PE=4 SV=1)

HSP 1 Score: 703 bits (1814), Expect = 2.06e-255
Identity = 353/359 (98.33%), Postives = 356/359 (99.16%), Query Frame = 0

Query: 1   MDFIFRGMNDESSECYSNNEMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDG 60
           MDFIFRGMND+SSECYSNNEMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDG
Sbjct: 1   MDFIFRGMNDKSSECYSNNEMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDG 60

Query: 61  PNFDTAFKLFHGKDGVVPLSEKSIFETSLPEPETAPHFNPLAAKAATISLSAFGPGGPFS 120
           PNFDTAFKLFHGKDGVVPLSEKS FET LPEPETAPHFNPLAAKAATISLSAFGPGGPFS
Sbjct: 61  PNFDTAFKLFHGKDGVVPLSEKSSFETLLPEPETAPHFNPLAAKAATISLSAFGPGGPFS 120

Query: 121 FGSFSEKWKKQKSEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPL 180
           FGSFSEKWKKQKSEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPL
Sbjct: 121 FGSFSEKWKKQKSEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPL 180

Query: 181 VATAFQPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI 240
           VATAFQPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI
Sbjct: 181 VATAFQPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI 240

Query: 241 WREHTKKFSFSWFVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLK 300
           WREHTKKFSFSWFVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLK
Sbjct: 241 WREHTKKFSFSWFVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLK 300

Query: 301 AISEKGKATTVIPTLDTTPSYELTQVDAIVGGRCGVERMVFDPLQKDGRQTSTPANVCS 359
           AI+EKGKATTVIPTLDTTPSYELTQVDAIVGGRCGVER VFDPLQK+GRQTSTPANVCS
Sbjct: 301 AIAEKGKATTVIPTLDTTPSYELTQVDAIVGGRCGVERRVFDPLQKNGRQTSTPANVCS 359

BLAST of Cp4.1LG14g10070.1 vs. ExPASy TrEMBL
Match: A0A6J1CWI6 (uncharacterized protein LOC111014867 OS=Momordica charantia OX=3673 GN=LOC111014867 PE=4 SV=1)

HSP 1 Score: 640 bits (1650), Expect = 2.03e-230
Identity = 321/360 (89.17%), Postives = 342/360 (95.00%), Query Frame = 0

Query: 1   MDFIFRGMNDESSECYSNNEMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDG 60
           M+FI +GMNDE+SEC S+NEMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDG
Sbjct: 1   MEFILKGMNDEASEC-SSNEMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDG 60

Query: 61  PNFDTAFKLFHGKDGVVPLSEKSIFETSLPEPETAPHFNPLAAKAATISLSAFGPGGPFS 120
           PNFDTAF LFHGKDGVVPLSE+S F++ L EPETAPHFNPLAAKAATISLSAFGPGGPFS
Sbjct: 61  PNFDTAFNLFHGKDGVVPLSERSGFQSKLAEPETAPHFNPLAAKAATISLSAFGPGGPFS 120

Query: 121 FGSFSEKWKKQKSEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPL 180
           FG+FSEKWKKQKSE+SNKKD SSQKKG SSKHEALGSEWLETGNCPIAKSFRAVSGVLPL
Sbjct: 121 FGNFSEKWKKQKSESSNKKDSSSQKKGSSSKHEALGSEWLETGNCPIAKSFRAVSGVLPL 180

Query: 181 VATAFQPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI 240
           VA+A QPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI
Sbjct: 181 VASALQPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI 240

Query: 241 WREHTKKFSFSWFVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLK 300
           WREHT+KFSFSWFVAIHAAVPFI MLRK VLMPKTAMAMTIAAS+LGQ+IGSRAERMRLK
Sbjct: 241 WREHTQKFSFSWFVAIHAAVPFIAMLRKSVLMPKTAMAMTIAASVLGQIIGSRAERMRLK 300

Query: 301 AISEKGKATTVIPTLDTTPSYELTQVDAIVGG-RCGVERMVFDPLQKDGRQTSTPANVCS 359
           A++EKGKATTV+PT+ T+PSYELTQVD IV G RCG E M+FDPLQKDGRQTS+PA +CS
Sbjct: 301 AVAEKGKATTVLPTVGTSPSYELTQVDTIVAGSRCGTETMMFDPLQKDGRQTSSPAKICS 359

BLAST of Cp4.1LG14g10070.1 vs. ExPASy TrEMBL
Match: A0A5D3CML1 (Putative transmembrane protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold139G001060 PE=4 SV=1)

HSP 1 Score: 639 bits (1648), Expect = 3.95e-230
Identity = 325/359 (90.53%), Postives = 339/359 (94.43%), Query Frame = 0

Query: 1   MDFIFRGMNDESSECYSNNEMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDG 60
           MDFI RGMNDE+S CYSN EMDIQRCPFLRNINKPTCFSFST+ F FPVRG KGPIFEDG
Sbjct: 1   MDFILRGMNDEASGCYSN-EMDIQRCPFLRNINKPTCFSFSTLTFNFPVRGEKGPIFEDG 60

Query: 61  PNFDTAFKLFHGKDGVVPLSEKSIFETSLPEPETAPHFNPLAAKAATISLSAFGPGGPFS 120
           PNFDTAFKLFHGKDGVVPLSE+S F+    EPE A HFNPLAAKAATISLSAFGPGGPFS
Sbjct: 61  PNFDTAFKLFHGKDGVVPLSERSGFDKISLEPEMASHFNPLAAKAATISLSAFGPGGPFS 120

Query: 121 FGSFSEKWKKQKSEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPL 180
           FGSFSEKWKKQKSEASNKK++SSQKKG SSKHEALG+EWLETGNCPIAKSFRAVSGVLPL
Sbjct: 121 FGSFSEKWKKQKSEASNKKNNSSQKKGNSSKHEALGNEWLETGNCPIAKSFRAVSGVLPL 180

Query: 181 VATAFQPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI 240
           VATAFQ PPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI
Sbjct: 181 VATAFQLPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI 240

Query: 241 WREHTKKFSFSWFVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLK 300
           WREHT+KFSFSWFVAIHAAVPFI MLRK VLMPKTAMAMTIAAS+LGQVIGSRAERMRLK
Sbjct: 241 WREHTQKFSFSWFVAIHAAVPFIAMLRKSVLMPKTAMAMTIAASVLGQVIGSRAERMRLK 300

Query: 301 AISEKGKATTVIPTLDTTPSYELTQVDAIVGGRCGVERMVFDPLQKDGRQTSTPANVCS 359
           AI+EKGK TTVIPTL++TPSYELTQVDAIVG RCG+ERMVFDPL+K GRQTSTPANVCS
Sbjct: 301 AIAEKGKVTTVIPTLESTPSYELTQVDAIVGSRCGLERMVFDPLRKGGRQTSTPANVCS 358

BLAST of Cp4.1LG14g10070.1 vs. ExPASy TrEMBL
Match: A0A1S3BK30 (uncharacterized protein LOC103490724 OS=Cucumis melo OX=3656 GN=LOC103490724 PE=4 SV=1)

HSP 1 Score: 639 bits (1648), Expect = 3.95e-230
Identity = 325/359 (90.53%), Postives = 339/359 (94.43%), Query Frame = 0

Query: 1   MDFIFRGMNDESSECYSNNEMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDG 60
           MDFI RGMNDE+S CYSN EMDIQRCPFLRNINKPTCFSFST+ F FPVRG KGPIFEDG
Sbjct: 1   MDFILRGMNDEASGCYSN-EMDIQRCPFLRNINKPTCFSFSTLTFNFPVRGEKGPIFEDG 60

Query: 61  PNFDTAFKLFHGKDGVVPLSEKSIFETSLPEPETAPHFNPLAAKAATISLSAFGPGGPFS 120
           PNFDTAFKLFHGKDGVVPLSE+S F+    EPE A HFNPLAAKAATISLSAFGPGGPFS
Sbjct: 61  PNFDTAFKLFHGKDGVVPLSERSGFDKISLEPEMASHFNPLAAKAATISLSAFGPGGPFS 120

Query: 121 FGSFSEKWKKQKSEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPL 180
           FGSFSEKWKKQKSEASNKK++SSQKKG SSKHEALG+EWLETGNCPIAKSFRAVSGVLPL
Sbjct: 121 FGSFSEKWKKQKSEASNKKNNSSQKKGNSSKHEALGNEWLETGNCPIAKSFRAVSGVLPL 180

Query: 181 VATAFQPPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI 240
           VATAFQ PPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI
Sbjct: 181 VATAFQLPPGMKLKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGI 240

Query: 241 WREHTKKFSFSWFVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLK 300
           WREHT+KFSFSWFVAIHAAVPFI MLRK VLMPKTAMAMTIAAS+LGQVIGSRAERMRLK
Sbjct: 241 WREHTQKFSFSWFVAIHAAVPFIAMLRKSVLMPKTAMAMTIAASVLGQVIGSRAERMRLK 300

Query: 301 AISEKGKATTVIPTLDTTPSYELTQVDAIVGGRCGVERMVFDPLQKDGRQTSTPANVCS 359
           AI+EKGK TTVIPTL++TPSYELTQVDAIVG RCG+ERMVFDPL+K GRQTSTPANVCS
Sbjct: 301 AIAEKGKVTTVIPTLESTPSYELTQVDAIVGSRCGLERMVFDPLRKGGRQTSTPANVCS 358

BLAST of Cp4.1LG14g10070.1 vs. TAIR 10
Match: AT5G45410.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G25030.2); Has 124 Blast hits to 124 proteins in 34 species: Archae - 2; Bacteria - 31; Metazoa - 0; Fungi - 0; Plants - 91; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 401.4 bits (1030), Expect = 7.8e-112
Identity = 213/325 (65.54%), Postives = 247/325 (76.00%), Query Frame = 0

Query: 14  ECYSNNEMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDGPNFDTAFKLFHGK 73
           EC    E  IQ+CPFLRNINKPT  SFS+++F  PV+G KGPIFEDGP FD+AFKLFHGK
Sbjct: 4   ECPFAAESIIQKCPFLRNINKPTNLSFSSLSFPIPVQGGKGPIFEDGPGFDSAFKLFHGK 63

Query: 74  DGVVPLSEKSIFETSLPEP-ETAPHFNPLAAKAATISLSAFGPGGPFSFGSFSEKWKKQK 133
           DG+VPLS     + S  E    A  FNPLA K ATISLSAFGPGGPF FG FSEKWKKQ+
Sbjct: 64  DGIVPLS--GFADDSEDEAGRRALQFNPLAGKVATISLSAFGPGGPFGFGPFSEKWKKQQ 123

Query: 134 SEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPLVATAFQPPPGMK 193
            +    K   +Q+ G SSKHEA+G EWL+TGNCPIAKSFRA S V+PL++ A   PPGMK
Sbjct: 124 KK---PKPSKNQQSGDSSKHEAVGDEWLKTGNCPIAKSFRAASKVMPLISKALTLPPGMK 183

Query: 194 LKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGIWREHTKKFSFSW 253
            +CP  IVAARAAL++TA VK+LRPQPLP KML IA +GMAANVPLG+WREHTKKFS +W
Sbjct: 184 YRCPAPIVAARAALSKTALVKSLRPQPLPEKMLAIALMGMAANVPLGVWREHTKKFSPAW 243

Query: 254 FVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLKAISEKGKATTVI 313
           F+A+HAAVPFI MLRK VLMPKTAMA+TI ASILGQVIGSRAER RLKA++EK     ++
Sbjct: 244 FLAVHAAVPFIAMLRKSVLMPKTAMALTIGASILGQVIGSRAERYRLKAVAEK-----MV 303

Query: 314 PTLDTTPSYELTQVDA-IVGGRCGV 337
           P       Y  +  D+ I GG CG+
Sbjct: 304 PVTAMVSGYNQSPGDSGISGGHCGI 318

BLAST of Cp4.1LG14g10070.1 vs. TAIR 10
Match: AT5G45410.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G25030.2); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 401.4 bits (1030), Expect = 7.8e-112
Identity = 213/325 (65.54%), Postives = 247/325 (76.00%), Query Frame = 0

Query: 14  ECYSNNEMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDGPNFDTAFKLFHGK 73
           EC    E  IQ+CPFLRNINKPT  SFS+++F  PV+G KGPIFEDGP FD+AFKLFHGK
Sbjct: 4   ECPFAAESIIQKCPFLRNINKPTNLSFSSLSFPIPVQGGKGPIFEDGPGFDSAFKLFHGK 63

Query: 74  DGVVPLSEKSIFETSLPEP-ETAPHFNPLAAKAATISLSAFGPGGPFSFGSFSEKWKKQK 133
           DG+VPLS     + S  E    A  FNPLA K ATISLSAFGPGGPF FG FSEKWKKQ+
Sbjct: 64  DGIVPLS--GFADDSEDEAGRRALQFNPLAGKVATISLSAFGPGGPFGFGPFSEKWKKQQ 123

Query: 134 SEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPLVATAFQPPPGMK 193
            +    K   +Q+ G SSKHEA+G EWL+TGNCPIAKSFRA S V+PL++ A   PPGMK
Sbjct: 124 KK---PKPSKNQQSGDSSKHEAVGDEWLKTGNCPIAKSFRAASKVMPLISKALTLPPGMK 183

Query: 194 LKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGIWREHTKKFSFSW 253
            +CP  IVAARAAL++TA VK+LRPQPLP KML IA +GMAANVPLG+WREHTKKFS +W
Sbjct: 184 YRCPAPIVAARAALSKTALVKSLRPQPLPEKMLAIALMGMAANVPLGVWREHTKKFSPAW 243

Query: 254 FVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLKAISEKGKATTVI 313
           F+A+HAAVPFI MLRK VLMPKTAMA+TI ASILGQVIGSRAER RLKA++EK     ++
Sbjct: 244 FLAVHAAVPFIAMLRKSVLMPKTAMALTIGASILGQVIGSRAERYRLKAVAEK-----MV 303

Query: 314 PTLDTTPSYELTQVDA-IVGGRCGV 337
           P       Y  +  D+ I GG CG+
Sbjct: 304 PVTAMVSGYNQSPGDSGISGGHCGI 318

BLAST of Cp4.1LG14g10070.1 vs. TAIR 10
Match: AT5G45410.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G25030.2); Has 124 Blast hits to 124 proteins in 34 species: Archae - 2; Bacteria - 31; Metazoa - 0; Fungi - 0; Plants - 91; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 401.4 bits (1030), Expect = 7.8e-112
Identity = 213/325 (65.54%), Postives = 247/325 (76.00%), Query Frame = 0

Query: 14  ECYSNNEMDIQRCPFLRNINKPTCFSFSTMNFTFPVRGAKGPIFEDGPNFDTAFKLFHGK 73
           EC    E  IQ+CPFLRNINKPT  SFS+++F  PV+G KGPIFEDGP FD+AFKLFHGK
Sbjct: 4   ECPFAAESIIQKCPFLRNINKPTNLSFSSLSFPIPVQGGKGPIFEDGPGFDSAFKLFHGK 63

Query: 74  DGVVPLSEKSIFETSLPEP-ETAPHFNPLAAKAATISLSAFGPGGPFSFGSFSEKWKKQK 133
           DG+VPLS     + S  E    A  FNPLA K ATISLSAFGPGGPF FG FSEKWKKQ+
Sbjct: 64  DGIVPLS--GFADDSEDEAGRRALQFNPLAGKVATISLSAFGPGGPFGFGPFSEKWKKQQ 123

Query: 134 SEASNKKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPLVATAFQPPPGMK 193
            +    K   +Q+ G SSKHEA+G EWL+TGNCPIAKSFRA S V+PL++ A   PPGMK
Sbjct: 124 KK---PKPSKNQQSGDSSKHEAVGDEWLKTGNCPIAKSFRAASKVMPLISKALTLPPGMK 183

Query: 194 LKCPPAIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGIWREHTKKFSFSW 253
            +CP  IVAARAAL++TA VK+LRPQPLP KML IA +GMAANVPLG+WREHTKKFS +W
Sbjct: 184 YRCPAPIVAARAALSKTALVKSLRPQPLPEKMLAIALMGMAANVPLGVWREHTKKFSPAW 243

Query: 254 FVAIHAAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLKAISEKGKATTVI 313
           F+A+HAAVPFI MLRK VLMPKTAMA+TI ASILGQVIGSRAER RLKA++EK     ++
Sbjct: 244 FLAVHAAVPFIAMLRKSVLMPKTAMALTIGASILGQVIGSRAERYRLKAVAEK-----MV 303

Query: 314 PTLDTTPSYELTQVDA-IVGGRCGV 337
           P       Y  +  D+ I GG CG+
Sbjct: 304 PVTAMVSGYNQSPGDSGISGGHCGI 318

BLAST of Cp4.1LG14g10070.1 vs. TAIR 10
Match: AT4G25030.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G45410.3); Has 125 Blast hits to 125 proteins in 36 species: Archae - 2; Bacteria - 31; Metazoa - 0; Fungi - 4; Plants - 88; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 392.9 bits (1008), Expect = 2.8e-109
Identity = 208/331 (62.84%), Postives = 252/331 (76.13%), Query Frame = 0

Query: 19  NEMDIQRCPFLRNINKPTCFSF-STMNFTFPVRGAKGPIFEDGPNFDTAFKLFHGKDGVV 78
           ++++I RCPFLRNIN+PT  SF S++ F  P R  KGPIFEDGPNFDTAF+LFHG+DGVV
Sbjct: 12  SQLNILRCPFLRNINEPTNLSFSSSLPFPIPARAGKGPIFEDGPNFDTAFRLFHGQDGVV 71

Query: 79  PLSEKSIFETSLPEPETAPHFNPLAAKAATISLSAFGPGGPFSFGSFSEKWKKQKSEASN 138
           PLS+ +  E   P     P F+PLAAKAATISLS+FG GGPF F +FS+ +K QK     
Sbjct: 72  PLSDTARTEAQKP----VPVFHPLAAKAATISLSSFGSGGPFGFDAFSDMFKNQK----- 131

Query: 139 KKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPLVATAFQPPPGMKLKCPP 198
           KK  SS+ KGG+  HEA+G EWL+TGNCPIAKS+RAVSGV PLVA   QPPPGMK KCP 
Sbjct: 132 KKSDSSKNKGGN--HEAMGDEWLKTGNCPIAKSYRAVSGVAPLVAKILQPPPGMKFKCPQ 191

Query: 199 AIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGIWREHTKKFSFSWFVAIH 258
           AIV ARAA+++T F KNLRPQPLP+K+LVI  LGMA NVPLG+WREHT+KFS SWF+A+H
Sbjct: 192 AIVTARAAISKTPFAKNLRPQPLPAKVLVIGMLGMALNVPLGVWREHTEKFSASWFIALH 251

Query: 259 AAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLKAISEKGKATTVIPTLDT 318
           AAVPFIG+LRK VLMPKTAM  TIAAS+LGQVIGSRAER RLK+++EK K T  +P   +
Sbjct: 252 AAVPFIGILRKSVLMPKTAMVFTIAASVLGQVIGSRAERRRLKSVAEK-KLTLEVPNPSS 311

Query: 319 TPSYELTQVDAIVGGRCGVE-RMVFDPLQKD 348
             + ++        GRCG +  M ++P+  D
Sbjct: 312 VEADQMQFAGVSSDGRCGDKVVMKWNPMMLD 330

BLAST of Cp4.1LG14g10070.1 vs. TAIR 10
Match: AT4G25030.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G45410.3); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 392.9 bits (1008), Expect = 2.8e-109
Identity = 208/331 (62.84%), Postives = 252/331 (76.13%), Query Frame = 0

Query: 19  NEMDIQRCPFLRNINKPTCFSF-STMNFTFPVRGAKGPIFEDGPNFDTAFKLFHGKDGVV 78
           ++++I RCPFLRNIN+PT  SF S++ F  P R  KGPIFEDGPNFDTAF+LFHG+DGVV
Sbjct: 12  SQLNILRCPFLRNINEPTNLSFSSSLPFPIPARAGKGPIFEDGPNFDTAFRLFHGQDGVV 71

Query: 79  PLSEKSIFETSLPEPETAPHFNPLAAKAATISLSAFGPGGPFSFGSFSEKWKKQKSEASN 138
           PLS+ +  E   P     P F+PLAAKAATISLS+FG GGPF F +FS+ +K QK     
Sbjct: 72  PLSDTARTEAQKP----VPVFHPLAAKAATISLSSFGSGGPFGFDAFSDMFKNQK----- 131

Query: 139 KKDHSSQKKGGSSKHEALGSEWLETGNCPIAKSFRAVSGVLPLVATAFQPPPGMKLKCPP 198
           KK  SS+ KGG+  HEA+G EWL+TGNCPIAKS+RAVSGV PLVA   QPPPGMK KCP 
Sbjct: 132 KKSDSSKNKGGN--HEAMGDEWLKTGNCPIAKSYRAVSGVAPLVAKILQPPPGMKFKCPQ 191

Query: 199 AIVAARAALARTAFVKNLRPQPLPSKMLVIAALGMAANVPLGIWREHTKKFSFSWFVAIH 258
           AIV ARAA+++T F KNLRPQPLP+K+LVI  LGMA NVPLG+WREHT+KFS SWF+A+H
Sbjct: 192 AIVTARAAISKTPFAKNLRPQPLPAKVLVIGMLGMALNVPLGVWREHTEKFSASWFIALH 251

Query: 259 AAVPFIGMLRKCVLMPKTAMAMTIAASILGQVIGSRAERMRLKAISEKGKATTVIPTLDT 318
           AAVPFIG+LRK VLMPKTAM  TIAAS+LGQVIGSRAER RLK+++EK K T  +P   +
Sbjct: 252 AAVPFIGILRKSVLMPKTAMVFTIAASVLGQVIGSRAERRRLKSVAEK-KLTLEVPNPSS 311

Query: 319 TPSYELTQVDAIVGGRCGVE-RMVFDPLQKD 348
             + ++        GRCG +  M ++P+  D
Sbjct: 312 VEADQMQFAGVSSDGRCGDKVVMKWNPMMLD 330

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023552670.11.14e-259100.00uncharacterized protein LOC111810249 [Cucurbita pepo subsp. pepo] >XP_023552671.... [more]
KAG7015791.16.08e-25799.72hypothetical protein SDJN02_23429 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022923335.13.52e-25699.44uncharacterized protein LOC111431059 [Cucurbita moschata] >XP_022923336.1 unchar... [more]
XP_022965273.14.26e-25598.33uncharacterized protein LOC111465188 [Cucurbita maxima] >XP_022965274.1 uncharac... [more]
XP_038904883.18.91e-23491.36uncharacterized protein LOC120091115 [Benincasa hispida] >XP_038904884.1 unchara... [more]
Match NameE-valueIdentityDescription
A0A6J1E6J01.70e-25699.44uncharacterized protein LOC111431059 OS=Cucurbita moschata OX=3662 GN=LOC1114310... [more]
A0A6J1HNE22.06e-25598.33uncharacterized protein LOC111465188 OS=Cucurbita maxima OX=3661 GN=LOC111465188... [more]
A0A6J1CWI62.03e-23089.17uncharacterized protein LOC111014867 OS=Momordica charantia OX=3673 GN=LOC111014... [more]
A0A5D3CML13.95e-23090.53Putative transmembrane protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_s... [more]
A0A1S3BK303.95e-23090.53uncharacterized protein LOC103490724 OS=Cucumis melo OX=3656 GN=LOC103490724 PE=... [more]
Match NameE-valueIdentityDescription
AT5G45410.17.8e-11265.54unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G45410.27.8e-11265.54unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G45410.37.8e-11265.54unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G25030.12.8e-10962.84unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G25030.22.8e-10962.84unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 130..155
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 130..152
NoneNo IPR availablePANTHERPTHR31033PROTEIN, PUTATIVE-RELATEDcoord: 1..350
NoneNo IPR availablePANTHERPTHR31033:SF18PROTEIN, PUTATIVE-RELATEDcoord: 1..350

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG14g10070Cp4.1LG14g10070gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG14g10070.1:exon:001Cp4.1LG14g10070.1:exon:001exon
Cp4.1LG14g10070.1:exon:002Cp4.1LG14g10070.1:exon:002exon
Cp4.1LG14g10070.1:exon:003Cp4.1LG14g10070.1:exon:003exon
Cp4.1LG14g10070.1:exon:004Cp4.1LG14g10070.1:exon:004exon


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG14g10070.1:five_prime_utr:001Cp4.1LG14g10070.1:five_prime_utr:001five_prime_UTR
Cp4.1LG14g10070.1:five_prime_utr:002Cp4.1LG14g10070.1:five_prime_utr:002five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG14g10070.1:cds:001Cp4.1LG14g10070.1:cds:001CDS
Cp4.1LG14g10070.1:cds:002Cp4.1LG14g10070.1:cds:002CDS
Cp4.1LG14g10070.1:cds:003Cp4.1LG14g10070.1:cds:003CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG14g10070.1Cp4.1LG14g10070.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0009507 chloroplast
cellular_component GO:0016021 integral component of membrane