Cp4.1LG02g06500 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG02g06500
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionprotein PHLOEM PROTEIN 2-LIKE A10-like
LocationCp4.1LG02: 1186402 .. 1195797 (+)
RNA-Seq ExpressionCp4.1LG02g06500
SyntenyCp4.1LG02g06500
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTATGTGGGCGCATCGCACATTATACCCAACTCCTGAAATTTTCCTCTTCTTTTCTTCCTTAAATTTTGTTTTTTTTTTGTTTTAATTTTGCCAGCTGTCACTTACAATGTGAATGGGTTGGGCCGGAATAGGAAACGAAAACGTGGGCCTTAGGCTTGTTTAGATAAGGTGGCCCATTATAATGTATTCATGGCCCAATCATAACCCAAGCCCACATTATAATATAATATAAGGTTGTATTAATAATAATAATTGGAGAAAAGTGTATCTACCAAAAAAAAAAGCGCAAGCGAAAGGGCGACCGATCAAACAGCGCCATATTAAGGGTGGGAAGTGTGAACGAACCAGTCGCTTTCTTCACCGTACGAATCAGATGCAATGTCATGGTTGGCTCTCTCCATTGCTAACACCCTTCGCCTTGACGACGAAGAAGATGAGCAGCACAACGACGTCGTATCTTCTTCTTCTTCTTCTTCCACCATTCCCCGCAACCAGATGGAATCCCAATCCCAATCCCAATCCCAATCCCAATCCCAACCCCAACTCGACGACGAAGCTTTATCTCGCCGCGTCAAAGAGGACTTGACTGAATTCAAACAAACCCTAACCCGCCAATTTTGGGGCGTGGCCACTTTCCTTGCTCCTCCTCCGCCTCCGCCGGAACCGGGGCCGACTCATCATCTCAACCCGGCGGAGGATCTGGTTGCCCCTCCCGATTGGAAGTCGTTCGAGGCGTCTAATCAGTCTGATCCATCGATCTCCGGGGACGAAGAGGACCTGACTGATCCGATTGAGGTTTTGAATATGCGTTCCAATCATGACGCCTATGCGAAATCGGGGATTTTACAGGAGGAATGCTATGAGGTGGATTGGGAAGGCGCTGTTGGGATCACTGATGAAGTGCTGACGTTTGCGACGAACATTGCAATGCACCCTGAGACTTGGATTGATTTCCCAATTGACGAGGAGGAGGACAACGGTGGTATGTGTGTTTTGCTAATTTTGCTATCCAATTCCTTAGATGCTTGTTAATTGATTTCTCTTCGTCAAGGAAACCTGTTATCTGGTCTTAATCTTATTCGTTTCTTATCGATATAGGATGAAGGTTACTGAATGGTCTGAAGGCCACTGCTAATGATCTTTAGCGTCATTGTTTCTTTTAATGAATTGTTCAACTGATGCTGGAATCGTTTATGAAATGCTTTTAGAAACTCGTTTTCATAAGTGGAGACTTCTATGAAATAAGGCATTCGACTATCCGAGTGTTCCATCCTTTTGGATCTTCTGCACCGGGCATGTTAGAAATTTATCTTGGGATGTGTTGAGTGCCACAATTAGATGTCGGCAGTTAGCTCAGAATTTTTGTAGCTGTATCTCATTCAACAACTATTTTCCAAAATGAACTAAGTATCTTTAGTTGGCACACCTGTGGTCCATAAAGTAAGAGATGGGTATTAAATTTTTTTGATAACTCAGTAAACCCTTTTGGATTTCAGGATAGTCCTCTTAGATCATGCCTCAGTTTTTATATCTACCTGGGTTCTGTCACCTCACTTGTTGACCTCCACTTGTGTATCGTTCCTCATAACTTCCTGCCTTGTCATTCCAGTTCCCAATTTTTCTTCCTTTTCTGGTTCTCGATGTCTATTACCTTTTCCATTAACGATTACCTTTGGCTTTGGGCAGAAGTTGAAATATTGTGGTTCCCTCGGTGCTATACAACACATACATTCAGTGATTCCATTCGCGTACTTTACATCTTTTGTACGTGAATTGAACTCATTCATTCAAGGTGCCATTGCACTTATGTTGTGATAGGAAAGATAAGCCCCCAATTCAGTTCACCTACACTAGGTGAAAGCATCATGAGTCTTTAAGTGATAGTTTTGTTATGCAACATGTGAATTTGTCAAGTGTTAGATAATCTCCCGCGTGTAGTGTGCATGGATCCTTTATGTGATGTGGGACCCCAATGTCTATAAATGGTCATAGTATACTTTTGCGGGTCAGAAGGCGTCGGGGAAGGAGGATATTGGGTGGGATCAATGAACAACAAATCCCCCATACATCTCAAGGTGAAGGTCTTTTGGCCTATCACAATAAAGAGTGAAGTCAAGATAGCTTTAAGTGTGTGTGGGGATTTGTCATGTGATTCAGATACACTCCTAATTTGAAAAGATGAGGCACCAACCCAAGCTGAAAACTAAGTCTACACAAGCAAAACTTTATAGATAGACATTTTTATTGTGAATGCTTAAGGTTTTTATCTTTATTTTTATTTTTTATTTTTTTTAAATACCATGGGAAGCATAAATTTCTCAAGTATAAAGCTAGTTATTATATAATTGATGACAAGAGATTTTAGGGCTTGTTGGAGGTGGACAGTTGGAGGTGGACATTAAGCATAGTGAATTGAAACTAGAGACGGATCAAGGTAGCTTGCCAGCGAAGGGAGAGACTTGGGCCTTTCCAAAACCTGGGAACTTGTTTCCTTTCAATATAATTAGCATATCACATCTTGCATTAGAGGCTCTATCGATTATGTTCATCATTTAGAGTGCGAATATGATGTTTCAGTTTTATACAATCAAGTGTGACCCAAGTTATTTGATTTTCAGTCTAATGGATTATTCTGAATTTAATCATAACCCCATCTCCTATACCTGGATATGTAAGATGAACGGGAAAGGAATTTTTATAAGGTGATCCACTTAAATTACTAAATATGTTGACTTTTGAATGGTGAGTTTGGACTTTGTCGTTGTTTAACTATCTGCTTTTGTGATGTTTTTCACATTGTTGAACCCTGTGTATTTACGTCTTTAGACCCTTCTTTTTCCTGTTTTAGATTTTGAAATGTCTGATGCCCAAAAAGAGCATGCTTTCACTATTGAACATCTTGCTCCTAGATTGGCTGCTCTCAGATTTGAACTCTGTCCTTGCCATATGAGTGAGAGTTACTTTTGGAAAGTCTACTTTGTGCTCCTGCATTCAAGACTCAATAAACAGGATGCTGAGGCTTTATCAACTCCACAGGTTGACTACTCTTTCTTTTCTTTGTTTTTTAAAATTGCAATGCAACCTTTTTAGGTGGGCTACGTAATTTCCAAATTGTTATGGTCTGGCATGATGATGCATATTTGTAATTTAGGGTTCCCATGTATAGTTAAATTGTGCACAAAATGACCCAAATGCTATGATTGCAGAAAATTTTATTATATTCATATGGAGTTCAAACATGTATTATTTTACTGCTTTAAATGAGTATTAATCTGGTTCCTTTGATTGAAACGTTATGTTTCTAGTATAATTTGGTTTCACCAAACTTGTAGGTAGCAGAAGCCAGATCAATGTGGATGCAGGAATTACAAAAGAAGACCAAGCCAGAGACTTTCTGGTGTGGAAGAGACACTTTTGAATTAAAAGAGAGCTCTGATTTGTTGCAGGAAGATGATAGCTCCATGGGCCTTGAAACTCATTCTGTATCTACGCTGCCTTGGACATTTACATCTGAGCCAAGCATGTCGTCTATGTCGAGCAATTGTGAGACAGAAAAATACCAAATAGAGACTTCTGAAACACAATTCATTGATAAGTCCGTCATCGTTGAAAAACCTATAATTAAAGATGAGGATAAAAACTCGACGATTGAGTCTTCTTCAAAATTCCTTGTTCAGAACTACGATGACGAGTCAGATAATGATTGGCTAGAGGAAGACTCTGGGACCATCCTCCCTCCGGGGTACGACGAAGATATTTCTTTCAGTGATCTCGAGGATGATGATATGGTTCTGCCTGCTAAGTTCAAGATTGTTTCAAAAGAATAAGGAAGTTAAACAAAAGAATTCAAGGCGATGCTTCCAATTGTGTATGTATGCTTTGCCGATGAGGATAAAGGAATTACAAAGTCAGTCTGACAAGATTGATGGGGCTTCAGTTACCGATCTTTATGAAACCATCTGAAATTGCTTTTAGTTGATGGGGTTGGTGGGTGCATTTGTAAATGAATGAGAGAATGATTATTCCAGTTTGGAAGAGTGACAATGTGTCTCTGTTTGGTTTCTTGTTTGGGAACTGTGTAAAGAATGTATCAGTAAATAGACAAAAGGGGAAAAGATATAGATTCTTTTTGTTCCATGCCCAACGATGATTGTCCTCATCCCTTTGTTGGTCTGTCAGGAGTGGAGATTCATTCATCGGAGGCTGTCTGACAGAAGAAAGCGCTAAAAACAGCCAAAGGAGCCACCGAATTGAGGCAGTTGAGAAGAGGATTCTGTTGGATGGAAGTTACCCATCATCTCATTGGTATCTACAGAACCACCATAATGGACCATTCTTTCTATTTGATTGCTGTACTTTTTTACCTTTCTACTCCTTAAAGATAAGATATATAGCATATAGCTTTAGTTATCATACTCACAGATAGGCCTTTGAAATGCAAGTTGGCCCATTATTTTGTTCTTTGAGTATCAGACCAATGAGCTTTTATTTGTTTGTTTGTTCAAAATTGATACCAGAAATTGAACCGAAGCGTACTGACCACAATCACTCTGGATGAGATTTGTTTTCAAGTTAGAAAGACGACAAACAGGATAAAGAATTGAAACGGAAGACAGGGAATATTCAACAGAACCGAGAGGAAATTCACCAAGAAATTTGCTGTCAGAAATTATTATTCAACTTTAATGGATATTTATTACTTAATGTCGTATTTATTATAAGTTTAAATGCGCAACTTGAGTTATAATCATGAAAAATTGAGAACGAGAGTGTCTAGATTATGGGGAAAATTGCATACAAAATGGCCATGCAGTTGGGTTCAAATTTAATACACAAGGCTGGCGTGGTCCGTTTGTGGGAACTTGAATTCATTCTGATTGGTTATCTGATCTGCGATTTTGGTGTTTCCGTGTAGGATTGGTTTAACCGATCCGAGCATGAACTCCTCCATTCAATTCTTTCTCGTCTTATTTTTCTTTATTAAATTTTCATTCCGTTCTATCAATCATTTCAAGCCAATCACCTGCACTCGCCTCCTGTTTCCATTTGCCGCCAATTAAATTTCCCAGAAATTTTGCTGCAGAATCCTCCAGGTTTGACCCCTCAATCTTCATTGACTTACTACAATAATCCCAAGCAGGTTTCCGATTCGATTCTGTGTTATTTATCGGATTTTTCCTTTCCATTTGTTGAGAGGGAAAATCTCAACGATTTTCTCGACCCTCTCACTGTTTGTCGCGCCATTTTTTTGTACGGTTTCGTGCCTTTCAATCTTGAATCTGGGTTTCGGTTTGATTTCGGTTCTATAGACTCTCCATCTTGCTATTTGGAAATTTCTCTGTATGGATTTTGAACTTGTTAGAAGGGGCTTGGGATTCTCCCAAAGGAGGAAGAAATGGCTTGTTCTATTGGCTCTCATGGGGGTCTCTGGTTACGGAGTTTATAAGGTCTATCATTTCCCCTCCGTCGAGAGGAAGAGGAAGAGGCTGATGAAGCTCTTCGGTACCATGATTTCCGTCGCTGAAATGGTTGCGGATTCTTCTGAAGCAATCGGAGTAATTTCTAAAGATTTGAAGGAGTTTCTGAAATCTGATTCCGATCAAATTCCCAACAGCTTGAAGCAAATTTCCAAACTCGCTAAATCGGAGGAATTTTCGGAGTCTCTGGAGAAGGTCACCGAGGCATTTACGGTTGGGATGATGAGAGGGTATAAATCTGTAACAAAGAACGACGGAAATTTGGAGGCTGATTCGGTGAATTCGAGCTCTTCTTCTGACGTCGTCGAGAAGCTTTTCTCAACAGCTGGGACTGGTTTTGCTTCTGTTGTGGTTGGAAGTTTTGCAAAGAATTTGGTGATGGGGTATTACTCGATCCCTGGATCAGTCGATGATGCTTACAAATCTGGATCTGAATTTTCAGACGTGCCTAAATGGGTAACTGTGGCCTCCGATGAGAAATGCAAGAATGTTATAGCAGATTGCATACAAGTTTTTGTTAGCACTGCAGTTTCTGTATATCTTGATAAAACAATGGATATTAATGTGTACAATGATCTCTTTAGTGGATTGACCAATCCCACTCATCAGGACAAGGTGAAGGACATGGTTGTTTCTGTTTGTAATGGCGCTGTGGAAACTCTTGTAAAAACATCTCACCAGGTCTTGACAAGCTCGCGATCGTCTTCGAATTTGAGTCCGGTTTCATCCTGTAATGGACTGTCAAAGCTAGGAGACGATGTCTTTTCAGAAGAAGCTTCTTCAAAGAAGATGGCTGCGGGCAGCTCAATTGAGAGTTGCCAAAATGGGTGGATTGACACAGTTTCATCTACTCTGGCAGTTCCTAGAAACAGGAAGTTTGTGCTTGATTTAACTGGTAGAGTGACATTTGAAACGACAAGATCTGTTGTGGATTTTTTGTTATGGAAGCTAATGGATGGTCTCAAGAGAAGTTTTGGTACAGTTCATGATGAAGTTGTGGGTAGGGGTTTGGAAGTCGTAAGCTACTTCAGTGCAAAGTCTTCTGTTATTGTTACAATTTGTGTAGCATTATATCTACATGTTTTTGTTGGCACCGGGCTTCTGTTATCTGCATAATAATAGAGCGGAAAGCAGTTCAAGCCACTTAGATTAGAAAGTATATACCAGTGTTGCATTTAGGGAAGGTTTTGGACTTCCTATCATTCAAATTTGAATGATGATCATACTTCTTGAACAAATGCATACATTGATGCTTAAGTAACTCTGATTCTCTGGCTTTTCTACAGGATGCTTTTGATATATTCTAATACAGTTGAGAGCCAAGTTCTTCATTCACCCAGATTGATGCATTTCAAATAGCTACATGACTTGGAAGCCTAATGCAATTTCATCCTTTGTTCCAAACACCCTAGTTAATAACGATTGAGTAACAACCCAAACCCAGATATTGTTCGTTTATGTATCGCCGTCAGCCTCACGGTTTTAGAACGCGTCTGTTAGGGAGAGGTTTCCACACCCTTATAAGAAATTTTTTGTTCCCCTCCCTCTCCAACCGATGTGGGAATCTCAATCTACCCCTCTTGGGGTCCCAATGTCCTCGCTGACTACTCTGTGATTGACTTTGATACCATTAGTAACAGCTCAAATTCATCGCTAGCAGATATTGTTTGCTTTAGCTTGTTACGTATTGGTGTCAGCCTCACGGTTTTAAAATGCGTCTGTTAGGGAGAGGGTTCCACACCTTTATAAGGAATGCTTCATTTCCCTCTCCAATTGATGTGAGATCTCATAGATTGATTCATGTCTCCCTTTACCTCTCTTTCTCAACTCAGCATGTCATACTACTGAAGACTTTAGAAAAGAGGCAGTTCCATTTGATTTTTACTTTCATGTCTCAACGACTTGTACAAGTTTCCTGTTCTTTTGGAGCTGTAAACATAGATGTCTTCTATGCCAATTATAAAGGTTCTATGATTCACGTAATTGACTCCGCAGGACTGATTTTCACCCATTACCCCACTCCTTGAGTTACCTTATAATTACCTTGTGCAATTTTAGCATATTTTTGCAGTTGCCTTCTGACTAATGAATTTAGTTATTTACTATATGTTTAGCCATAGATGACGGAGAACTTGAAATTCCTTTTGACAAACTCATATTACTATTCAACATTCCTTACTATGCTACAGATTTTTGTTGTTGGCTCCAGGACTACTTATCCTGTTATTTTCAATCTTTTTTGTTTCTGATCTCATAACCTTGTTCTCACAGAACCGTTTTGCAGAGCCGCTGGCTATTTTGAACTGCTTTGGATATCACAAAGTAATTTTGTTCTGAATCTTAATGGCATGTTCTTGATCCGGTGAGGCCTGCAAACACCACCATCTTGGATGTCTCGCGGGGATGTCGTTGCTGGAAAGTTGATGAACTTTCTATTTCCGTGAACTTCCTCAATATACTCTAAACAAATGCTGGATGAAGTTCTTTCTGGATGGAAAGTAAACAGTAAGCTGACTTTTCCAGACAGGGTGCCGTAGAAACTTCTCTTTGTTAAAGAGATTATTATTTACTCAGTAGGTCTGCATATAGGCGAGGATGTGGGTGTTAACCCCATGTGATATGCATTTTTAACATTTGGGTATCATCATTACCATCATCCCATTCTTCTTTAATATCCACTTCCTTTACCTTCATTTTTGTTGATAACATTTTTTATGCCATGTAGCTCTTGATTAATCATGCCTTCTAGTAATAGAGATGTCAATGGGTTATGATTTCTTTTGCCCATCTCTTCCTGGAAGCTGCCTCCTTTTAACCTGCTGGAAATTTCTGGTATTTTGACTGATATTATCATTCATTCTTCCATCCTATTTGGATAGAACGAGAGTAAAAGCTGAAGTGAGAGGGAAGGATAATAGGGGAAGCTGCCAGCTATGAGCGGCGGCAAATACACTCATGCAGGCCCCGTAGGACCATTGCGGGGTATGGCATAATCTTCCAAGTTGCTTTCATATTTGCTTTTGTTTTTTGTTCAACTGGCCCTTAATCTTGAATATGTCATTGTAATTGTATGTAGATTGATTTATTAACAAGCCCCGAATCATGATTGGCTCTTTGCACGCTTTGTTACCAACTTTTGTTTATTTTTCCTTTATAAAGATCTTTGTTCTGAGGTTAATTTCCATTGTCTTGTCGTTAATTTTAACAGTGTCAAGGGAATGCTTCAAAATTCAACTCTTAATTATTAACTGGGTCGGTAGAAGATGCTCCCTTGGGACTTATAAGCTGCTAGTTTGGAGCTTTCATTACTCTGTCTCTGTCTCTGTCTCTGTTTCTCTCTCTCTGTTTCTGCTTATTTCCATGTCTGCCATTACTTCCACCTGCAGTAGCTTCTTCTCCATCAGATCAAATTCTATGGAGTCAAGAGTGAGAACTTCTTCATCAACCCATGGCTCTCCAGCATGTGGGAAGCTTGATGGAGTAGCAACTTGGCTCATCAATGGCTTTGTCACAGCTTTCTTTGGATCCTTGGAACGATGCTCTTGTATTCGTATTGCCACAGCTGAGGACGACGGCGATGAGTCGAATGACGCTCCTTTGATCCCAAAAGATGTAATCCATCGACAGGACGGCGATGCAGCTGGCCGGAGGAGGGCCGGGAAAGGCAAGAAGTGTCAGCCACTTGTAGCTGCAATCTAATGAACAACTCTGCTACTTATCTTGAATAACCCAAATCAAAAAGCTCCTCTTTTTCGTTGGTAAGTGATATGAAGAAGCTTCTGAATCTGGGAAACGTACTGCTTTTCTAAATTATTTACAAGTGTTTACTGTTCTTAGCAACTCAATCTCAGCTACAATTTAATCTCAATCTCAGTTAATAACTTCAATAACAACTCAAATTTTGATCTTCTCAT

mRNA sequence

ATGGCTGTCACTTACAATGTGAATGGGTTGGGCCGGAATAGGAAACGAAAACATGCAATGTCATGGTTGGCTCTCTCCATTGCTAACACCCTTCGCCTTGACGACGAAGAAGATGAGCAGCACAACGACGTCGTATCTTCTTCTTCTTCTTCTTCCACCATTCCCCGCAACCAGATGGAATCCCAATCCCAATCCCAATCCCAATCCCAATCCCAACCCCAACTCGACGACGAAGCTTTATCTCGCCGCGTCAAAGAGGACTTGACTGAATTCAAACAAACCCTAACCCGCCAATTTTGGGGCGTGGCCACTTTCCTTGCTCCTCCTCCGCCTCCGCCGGAACCGGGGCCGACTCATCATCTCAACCCGGCGGAGGATCTGGTTGCCCCTCCCGATTGGAAGTCGTTCGAGGCGTCTAATCAGTCTGATCCATCGATCTCCGGGGACGAAGAGGACCTGACTGATCCGATTGAGGTTTTGAATATGCGTTCCAATCATGACGCCTATGCGAAATCGGGGATTTTACAGGAGGAATGCTATGAGGTGGATTGGGAAGGCGCTGTTGGGATCACTGATGAAGTGCTGACGTTTGCGACGAACATTGCAATGCACCCTGAGACTTGGATTGATTTCCCAATTGACGAGGAGGAGGACAACGGTGATTTTGAAATGTCTGATGCCCAAAAAGAGCATGCTTTCACTATTGAACATCTTGCTCCTAGATTGGCTGCTCTCAGATTTGAACTCTGTCCTTGCCATATGAGTGAGAGTTACTTTTGGAAAGTCTACTTTGTGCTCCTGCATTCAAGACTCAATAAACAGGATGCTGAGGCTTTATCAACTCCACAGGTAGCAGAAGCCAGATCAATGTGGATGCAGGAATTACAAAAGAAGACCAAGCCAGAGACTTTCTGGTGTGGAAGAGACACTTTTGAATTAAAAGAGAGCTCTGATTTGTTGCAGGAAGATGATAGCTCCATGGGCCTTGAAACTCATTCTGTATCTACGCTGCCTTGGACATTTACATCTGAGCCAAGCATGTCGTCTATGTCGAGCAATTGTGAGACAGAAAAATACCAAATAGAGACTTCTGAAACACAATTCATTGATAAGTCCGTCATCGTTGAAAAACCTATAATTAAAGATGAGGATAAAAACTCGACGATTGAGTCTTCTTCAAAATTCCTTGTTCAGAACTACGATGACGAGTCAGATAATGATTGGCTAGAGGAAGACTCTGGGACCATCCTCCCTCCGGGGTACGACGAAGATATTTCTTTCAGTGATCTCGAGGATGATGATATGGTTCTGCCTGCTAAGTTCAAGATTAATCCTCCAGGTTTGACCCCTCAATCTTCATTGACTTACTACAATAATCCCAAGCAGACTCTCCATCTTGCTATTTGGAAATTTCTCTGTATGGATTTTGAACTTGTTAGAAGGGGCTTGGGATTCTCCCAAAGGAGGAAGAAATGGCTTGTTCTATTGGCTCTCATGGGGGTCTCTGGTTACGGAGTTTATAAGGTCTATCATTTCCCCTCCGTCGAGAGGAAGAGGAAGAGGCTGATGAAGCTCTTCGGTACCATGATTTCCGTCGCTGAAATGGTTGCGGATTCTTCTGAAGCAATCGGAGTAATTTCTAAAGATTTGAAGGAGTTTCTGAAATCTGATTCCGATCAAATTCCCAACAGCTTGAAGCAAATTTCCAAACTCGCTAAATCGGAGGAATTTTCGGAGTCTCTGGAGAAGGTCACCGAGGCATTTACGGTTGGGATGATGAGAGGGTATAAATCTGTAACAAAGAACGACGGAAATTTGGAGGCTGATTCGGTGAATTCGAGCTCTTCTTCTGACGTCGTCGAGAAGCTTTTCTCAACAGCTGGGACTGGTTTTGCTTCTGTTGTGGTTGGAAGTTTTGCAAAGAATTTGGTGATGGGGTATTACTCGATCCCTGGATCAGTCGATGATGCTTACAAATCTGGATCTGAATTTTCAGACGTGCCTAAATGGGTAACTGTGGCCTCCGATGAGAAATGCAAGAATGTTATAGCAGATTGCATACAAGTTTTTGTTAGCACTGCAGTTTCTGTATATCTTGATAAAACAATGGATATTAATGTGTACAATGATCTCTTTAGTGGATTGACCAATCCCACTCATCAGGACAAGGTGAAGGACATGGTTGTTTCTGTTTGTAATGGCGCTGTGGAAACTCTTGTAAAAACATCTCACCAGGTCTTGACAAGCTCGCGATCGTCTTCGAATTTGAGTCCGGTTTCATCCTGTAATGGACTGTCAAAGCTAGGAGACGATGTCTTTTCAGAAGAAGCTTCTTCAAAGAAGATGGCTGCGGGCAGCTCAATTGAGAGTTGCCAAAATGGGTGGATTGACACAGTTTCATCTACTCTGGCAGTTCCTAGAAACAGGAAGTTTGTGCTTGATTTAACTGGTAGAGTGACATTTGAAACGACAAGATCTGTTGTGGATTTTTTGTTATGGAAGCTAATGGATGGTCTCAAGAGAAGTTTTGGTACAGTTCATGATGAAGTTGTGGGTAGGGGTTTGGAAGTCATCTTTGTTCTGAGTAGCTTCTTCTCCATCAGATCAAATTCTATGGAGTCAAGAGTGAGAACTTCTTCATCAACCCATGGCTCTCCAGCATGTGGGAAGCTTGATGGAGTAGCAACTTGGCTCATCAATGGCTTTGTCACAGCTTTCTTTGGATCCTTGGAACGATGCTCTTGTATTCGTATTGCCACAGCTGAGGACGACGGCGATGAGTCGAATGACGCTCCTTTGATCCCAAAAGATGTAATCCATCGACAGGACGGCGATGCAGCTGGCCGGAGGAGGGCCGGGAAAGGCAAGAAGTGTCAGCCACTTGTAGCTGCAATCTAATGAACAACTCTGCTACTTATCTTGAATAACCCAAATCAAAAAGCTCCTCTTTTTCGTTGGTAAGTGATATGAAGAAGCTTCTGAATCTGGGAAACGTACTGCTTTTCTAAATTATTTACAAGTGTTTACTGTTCTTAGCAACTCAATCTCAGCTACAATTTAATCTCAATCTCAGTTAATAACTTCAATAACAACTCAAATTTTGATCTTCTCAT

Coding sequence (CDS)

ATGGCTGTCACTTACAATGTGAATGGGTTGGGCCGGAATAGGAAACGAAAACATGCAATGTCATGGTTGGCTCTCTCCATTGCTAACACCCTTCGCCTTGACGACGAAGAAGATGAGCAGCACAACGACGTCGTATCTTCTTCTTCTTCTTCTTCCACCATTCCCCGCAACCAGATGGAATCCCAATCCCAATCCCAATCCCAATCCCAATCCCAACCCCAACTCGACGACGAAGCTTTATCTCGCCGCGTCAAAGAGGACTTGACTGAATTCAAACAAACCCTAACCCGCCAATTTTGGGGCGTGGCCACTTTCCTTGCTCCTCCTCCGCCTCCGCCGGAACCGGGGCCGACTCATCATCTCAACCCGGCGGAGGATCTGGTTGCCCCTCCCGATTGGAAGTCGTTCGAGGCGTCTAATCAGTCTGATCCATCGATCTCCGGGGACGAAGAGGACCTGACTGATCCGATTGAGGTTTTGAATATGCGTTCCAATCATGACGCCTATGCGAAATCGGGGATTTTACAGGAGGAATGCTATGAGGTGGATTGGGAAGGCGCTGTTGGGATCACTGATGAAGTGCTGACGTTTGCGACGAACATTGCAATGCACCCTGAGACTTGGATTGATTTCCCAATTGACGAGGAGGAGGACAACGGTGATTTTGAAATGTCTGATGCCCAAAAAGAGCATGCTTTCACTATTGAACATCTTGCTCCTAGATTGGCTGCTCTCAGATTTGAACTCTGTCCTTGCCATATGAGTGAGAGTTACTTTTGGAAAGTCTACTTTGTGCTCCTGCATTCAAGACTCAATAAACAGGATGCTGAGGCTTTATCAACTCCACAGGTAGCAGAAGCCAGATCAATGTGGATGCAGGAATTACAAAAGAAGACCAAGCCAGAGACTTTCTGGTGTGGAAGAGACACTTTTGAATTAAAAGAGAGCTCTGATTTGTTGCAGGAAGATGATAGCTCCATGGGCCTTGAAACTCATTCTGTATCTACGCTGCCTTGGACATTTACATCTGAGCCAAGCATGTCGTCTATGTCGAGCAATTGTGAGACAGAAAAATACCAAATAGAGACTTCTGAAACACAATTCATTGATAAGTCCGTCATCGTTGAAAAACCTATAATTAAAGATGAGGATAAAAACTCGACGATTGAGTCTTCTTCAAAATTCCTTGTTCAGAACTACGATGACGAGTCAGATAATGATTGGCTAGAGGAAGACTCTGGGACCATCCTCCCTCCGGGGTACGACGAAGATATTTCTTTCAGTGATCTCGAGGATGATGATATGGTTCTGCCTGCTAAGTTCAAGATTAATCCTCCAGGTTTGACCCCTCAATCTTCATTGACTTACTACAATAATCCCAAGCAGACTCTCCATCTTGCTATTTGGAAATTTCTCTGTATGGATTTTGAACTTGTTAGAAGGGGCTTGGGATTCTCCCAAAGGAGGAAGAAATGGCTTGTTCTATTGGCTCTCATGGGGGTCTCTGGTTACGGAGTTTATAAGGTCTATCATTTCCCCTCCGTCGAGAGGAAGAGGAAGAGGCTGATGAAGCTCTTCGGTACCATGATTTCCGTCGCTGAAATGGTTGCGGATTCTTCTGAAGCAATCGGAGTAATTTCTAAAGATTTGAAGGAGTTTCTGAAATCTGATTCCGATCAAATTCCCAACAGCTTGAAGCAAATTTCCAAACTCGCTAAATCGGAGGAATTTTCGGAGTCTCTGGAGAAGGTCACCGAGGCATTTACGGTTGGGATGATGAGAGGGTATAAATCTGTAACAAAGAACGACGGAAATTTGGAGGCTGATTCGGTGAATTCGAGCTCTTCTTCTGACGTCGTCGAGAAGCTTTTCTCAACAGCTGGGACTGGTTTTGCTTCTGTTGTGGTTGGAAGTTTTGCAAAGAATTTGGTGATGGGGTATTACTCGATCCCTGGATCAGTCGATGATGCTTACAAATCTGGATCTGAATTTTCAGACGTGCCTAAATGGGTAACTGTGGCCTCCGATGAGAAATGCAAGAATGTTATAGCAGATTGCATACAAGTTTTTGTTAGCACTGCAGTTTCTGTATATCTTGATAAAACAATGGATATTAATGTGTACAATGATCTCTTTAGTGGATTGACCAATCCCACTCATCAGGACAAGGTGAAGGACATGGTTGTTTCTGTTTGTAATGGCGCTGTGGAAACTCTTGTAAAAACATCTCACCAGGTCTTGACAAGCTCGCGATCGTCTTCGAATTTGAGTCCGGTTTCATCCTGTAATGGACTGTCAAAGCTAGGAGACGATGTCTTTTCAGAAGAAGCTTCTTCAAAGAAGATGGCTGCGGGCAGCTCAATTGAGAGTTGCCAAAATGGGTGGATTGACACAGTTTCATCTACTCTGGCAGTTCCTAGAAACAGGAAGTTTGTGCTTGATTTAACTGGTAGAGTGACATTTGAAACGACAAGATCTGTTGTGGATTTTTTGTTATGGAAGCTAATGGATGGTCTCAAGAGAAGTTTTGGTACAGTTCATGATGAAGTTGTGGGTAGGGGTTTGGAAGTCATCTTTGTTCTGAGTAGCTTCTTCTCCATCAGATCAAATTCTATGGAGTCAAGAGTGAGAACTTCTTCATCAACCCATGGCTCTCCAGCATGTGGGAAGCTTGATGGAGTAGCAACTTGGCTCATCAATGGCTTTGTCACAGCTTTCTTTGGATCCTTGGAACGATGCTCTTGTATTCGTATTGCCACAGCTGAGGACGACGGCGATGAGTCGAATGACGCTCCTTTGATCCCAAAAGATGTAATCCATCGACAGGACGGCGATGCAGCTGGCCGGAGGAGGGCCGGGAAAGGCAAGAAGTGTCAGCCACTTGTAGCTGCAATCTAA

Protein sequence

MAVTYNVNGLGRNRKRKHAMSWLALSIANTLRLDDEEDEQHNDVVSSSSSSSTIPRNQMESQSQSQSQSQSQPQLDDEALSRRVKEDLTEFKQTLTRQFWGVATFLAPPPPPPEPGPTHHLNPAEDLVAPPDWKSFEASNQSDPSISGDEEDLTDPIEVLNMRSNHDAYAKSGILQEECYEVDWEGAVGITDEVLTFATNIAMHPETWIDFPIDEEEDNGDFEMSDAQKEHAFTIEHLAPRLAALRFELCPCHMSESYFWKVYFVLLHSRLNKQDAEALSTPQVAEARSMWMQELQKKTKPETFWCGRDTFELKESSDLLQEDDSSMGLETHSVSTLPWTFTSEPSMSSMSSNCETEKYQIETSETQFIDKSVIVEKPIIKDEDKNSTIESSSKFLVQNYDDESDNDWLEEDSGTILPPGYDEDISFSDLEDDDMVLPAKFKINPPGLTPQSSLTYYNNPKQTLHLAIWKFLCMDFELVRRGLGFSQRRKKWLVLLALMGVSGYGVYKVYHFPSVERKRKRLMKLFGTMISVAEMVADSSEAIGVISKDLKEFLKSDSDQIPNSLKQISKLAKSEEFSESLEKVTEAFTVGMMRGYKSVTKNDGNLEADSVNSSSSSDVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDAYKSGSEFSDVPKWVTVASDEKCKNVIADCIQVFVSTAVSVYLDKTMDINVYNDLFSGLTNPTHQDKVKDMVVSVCNGAVETLVKTSHQVLTSSRSSSNLSPVSSCNGLSKLGDDVFSEEASSKKMAAGSSIESCQNGWIDTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVDFLLWKLMDGLKRSFGTVHDEVVGRGLEVIFVLSSFFSIRSNSMESRVRTSSSTHGSPACGKLDGVATWLINGFVTAFFGSLERCSCIRIATAEDDGDESNDAPLIPKDVIHRQDGDAAGRRRAGKGKKCQPLVAAI
Homology
BLAST of Cp4.1LG02g06500 vs. ExPASy Swiss-Prot
Match: Q9SY57 (Protein PHLOEM PROTEIN 2-LIKE A10 OS=Arabidopsis thaliana OX=3702 GN=PP2A10 PE=2 SV=1)

HSP 1 Score: 401.7 bits (1031), Expect = 2.3e-110
Identity = 205/384 (53.39%), Postives = 280/384 (72.92%), Query Frame = 0

Query: 478 LVRRGLGFSQRRKKWLVLLALMGVSGYGVYKVYHFPSVERKRKRLMKLFGTMISVAEMVA 537
           L  +G+  SQRR+KWL+ +A+ GVSGYG YKVYH PSV RKRKRL KLFG ++SVAE+++
Sbjct: 6   LREKGIFLSQRRRKWLIFMAISGVSGYGAYKVYHLPSVARKRKRLFKLFGAIVSVAELIS 65

Query: 538 DSSEAIGVISKDLKEFLKSDSDQIPNSLKQISKLAKSEEFSESLEKVTEAFTVGMMRGYK 597
           DS+E + ++S+D+K+FL SDSD+IPNSLKQI+K+  S EF++SL +V++A T+G  RGYK
Sbjct: 66  DSAETLSMVSRDVKDFLNSDSDEIPNSLKQIAKITTSNEFTDSLSRVSQAVTIGAFRGYK 125

Query: 598 SVTK-NDGNLEADSVNSSSSSDVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDD 657
           S +   D  +E  S +SS    V++K+FS AGTGF SVVVGSFAKNLV+G+YS  G V+ 
Sbjct: 126 SESSIGDSGIEKSS-DSSVVDRVIDKVFSEAGTGFVSVVVGSFAKNLVLGFYS--GKVES 185

Query: 658 AYK-SGSEFSDVPKWVTVASDEKCKNVIADCIQVFVSTAVSVYLDKTMDINVYNDLFSGL 717
             K  GS+ S+ P+WVT+  D+KC+ ++ADCI+ F STA+ VYLDKTMDIN Y+ +F GL
Sbjct: 186 GVKCEGSDSSETPRWVTLLGDDKCRELLADCIERFTSTAIGVYLDKTMDINTYDQIFEGL 245

Query: 718 TNPTHQDKVKDMVVSVCNGAVETLVKTSHQVLTSSRSSSNLSPVSSCNGLSKLGDDVFSE 777
           TNP HQD VKD++VSVCNGA+ET+V+TSH V TSSRS          N + ++ DD F  
Sbjct: 246 TNPKHQDSVKDVLVSVCNGALETIVRTSHDVFTSSRSK---------NVIEEIEDDDFKS 305

Query: 778 EASSK-KMAAGSSIESCQNGWIDTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVDFLLWK 837
             S++ KM + S      NGW + +++TLAVP NR+F+ D+TGRVT ETTRS++ F++ K
Sbjct: 306 NGSARSKMVSESGDGVKSNGWTEAIATTLAVPSNRRFMFDVTGRVTLETTRSIIAFIMVK 365

Query: 838 LMDGLKRSFGTVHDEVVGRGLEVI 859
              G ++S   VH+EV  RG + +
Sbjct: 366 TFQGFRKSINVVHEEVTDRGRQAV 377

BLAST of Cp4.1LG02g06500 vs. NCBI nr
Match: XP_023524215.1 (uncharacterized protein LOC111788188 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 833 bits (2152), Expect = 1.47e-295
Identity = 424/424 (100.00%), Postives = 424/424 (100.00%), Query Frame = 0

Query: 20  MSWLALSIANTLRLDDEEDEQHNDVVSSSSSSSTIPRNQMESQSQSQSQSQSQPQLDDEA 79
           MSWLALSIANTLRLDDEEDEQHNDVVSSSSSSSTIPRNQMESQSQSQSQSQSQPQLDDEA
Sbjct: 1   MSWLALSIANTLRLDDEEDEQHNDVVSSSSSSSTIPRNQMESQSQSQSQSQSQPQLDDEA 60

Query: 80  LSRRVKEDLTEFKQTLTRQFWGVATFLAPPPPPPEPGPTHHLNPAEDLVAPPDWKSFEAS 139
           LSRRVKEDLTEFKQTLTRQFWGVATFLAPPPPPPEPGPTHHLNPAEDLVAPPDWKSFEAS
Sbjct: 61  LSRRVKEDLTEFKQTLTRQFWGVATFLAPPPPPPEPGPTHHLNPAEDLVAPPDWKSFEAS 120

Query: 140 NQSDPSISGDEEDLTDPIEVLNMRSNHDAYAKSGILQEECYEVDWEGAVGITDEVLTFAT 199
           NQSDPSISGDEEDLTDPIEVLNMRSNHDAYAKSGILQEECYEVDWEGAVGITDEVLTFAT
Sbjct: 121 NQSDPSISGDEEDLTDPIEVLNMRSNHDAYAKSGILQEECYEVDWEGAVGITDEVLTFAT 180

Query: 200 NIAMHPETWIDFPIDEEEDNGDFEMSDAQKEHAFTIEHLAPRLAALRFELCPCHMSESYF 259
           NIAMHPETWIDFPIDEEEDNGDFEMSDAQKEHAFTIEHLAPRLAALRFELCPCHMSESYF
Sbjct: 181 NIAMHPETWIDFPIDEEEDNGDFEMSDAQKEHAFTIEHLAPRLAALRFELCPCHMSESYF 240

Query: 260 WKVYFVLLHSRLNKQDAEALSTPQVAEARSMWMQELQKKTKPETFWCGRDTFELKESSDL 319
           WKVYFVLLHSRLNKQDAEALSTPQVAEARSMWMQELQKKTKPETFWCGRDTFELKESSDL
Sbjct: 241 WKVYFVLLHSRLNKQDAEALSTPQVAEARSMWMQELQKKTKPETFWCGRDTFELKESSDL 300

Query: 320 LQEDDSSMGLETHSVSTLPWTFTSEPSMSSMSSNCETEKYQIETSETQFIDKSVIVEKPI 379
           LQEDDSSMGLETHSVSTLPWTFTSEPSMSSMSSNCETEKYQIETSETQFIDKSVIVEKPI
Sbjct: 301 LQEDDSSMGLETHSVSTLPWTFTSEPSMSSMSSNCETEKYQIETSETQFIDKSVIVEKPI 360

Query: 380 IKDEDKNSTIESSSKFLVQNYDDESDNDWLEEDSGTILPPGYDEDISFSDLEDDDMVLPA 439
           IKDEDKNSTIESSSKFLVQNYDDESDNDWLEEDSGTILPPGYDEDISFSDLEDDDMVLPA
Sbjct: 361 IKDEDKNSTIESSSKFLVQNYDDESDNDWLEEDSGTILPPGYDEDISFSDLEDDDMVLPA 420

Query: 440 KFKI 443
           KFKI
Sbjct: 421 KFKI 424

BLAST of Cp4.1LG02g06500 vs. NCBI nr
Match: XP_022940623.1 (uncharacterized protein LOC111446162 [Cucurbita moschata])

HSP 1 Score: 805 bits (2079), Expect = 1.27e-284
Identity = 412/424 (97.17%), Postives = 414/424 (97.64%), Query Frame = 0

Query: 20  MSWLALSIANTLRLDDEEDEQHNDVVSSSSSSSTIPRNQMESQSQSQSQSQSQPQLDDEA 79
           MSWLALSIANTLRLDDEEDEQHNDVVSSSSSSSTIPRNQMESQSQ        PQLDDEA
Sbjct: 1   MSWLALSIANTLRLDDEEDEQHNDVVSSSSSSSTIPRNQMESQSQ--------PQLDDEA 60

Query: 80  LSRRVKEDLTEFKQTLTRQFWGVATFLAPPPPPPEPGPTHHLNPAEDLVAPPDWKSFEAS 139
           LSRRVKEDLTEFKQTLTRQFWGVATFLAPPPPPPEPGPTHHLN AEDLVAPPDWKSFEAS
Sbjct: 61  LSRRVKEDLTEFKQTLTRQFWGVATFLAPPPPPPEPGPTHHLNLAEDLVAPPDWKSFEAS 120

Query: 140 NQSDPSISGDEEDLTDPIEVLNMRSNHDAYAKSGILQEECYEVDWEGAVGITDEVLTFAT 199
           NQSDPSISGDEEDLTDPIEVLNMRSNHDAYAKSGILQEECYEVDWEGAVGITDEVLTFAT
Sbjct: 121 NQSDPSISGDEEDLTDPIEVLNMRSNHDAYAKSGILQEECYEVDWEGAVGITDEVLTFAT 180

Query: 200 NIAMHPETWIDFPIDEEEDNGDFEMSDAQKEHAFTIEHLAPRLAALRFELCPCHMSESYF 259
           NIAMHPETWIDFPIDEEEDNGDFEMSDAQKEHAFTIEHLAPRLAALRFELCPCHMSESYF
Sbjct: 181 NIAMHPETWIDFPIDEEEDNGDFEMSDAQKEHAFTIEHLAPRLAALRFELCPCHMSESYF 240

Query: 260 WKVYFVLLHSRLNKQDAEALSTPQVAEARSMWMQELQKKTKPETFWCGRDTFELKESSDL 319
           WKVYFVLLHSRLNKQDAEALSTPQVAEARSMWMQELQKKTKPETFWCGRDTFELK+SSDL
Sbjct: 241 WKVYFVLLHSRLNKQDAEALSTPQVAEARSMWMQELQKKTKPETFWCGRDTFELKDSSDL 300

Query: 320 LQEDDSSMGLETHSVSTLPWTFTSEPSMSSMSSNCETEKYQIETSETQFIDKSVIVEKPI 379
           LQEDD+SMGLETHSVSTLPWTFTSEPSMSSMSSNCETEKYQIETSETQFIDKSVIVEKPI
Sbjct: 301 LQEDDNSMGLETHSVSTLPWTFTSEPSMSSMSSNCETEKYQIETSETQFIDKSVIVEKPI 360

Query: 380 IKDEDKNSTIESSSKFLVQNYDDESDNDWLEEDSGTILPPGYDEDISFSDLEDDDMVLPA 439
           IKDEDKNSTIESSSKFLVQNYDDESDNDWLEEDSGTILP GYDEDISFSDLEDDDMVLPA
Sbjct: 361 IKDEDKNSTIESSSKFLVQNYDDESDNDWLEEDSGTILPLGYDEDISFSDLEDDDMVLPA 416

Query: 440 KFKI 443
           KFKI
Sbjct: 421 KFKI 416

BLAST of Cp4.1LG02g06500 vs. NCBI nr
Match: XP_022981253.1 (uncharacterized protein LOC111480445 [Cucurbita maxima])

HSP 1 Score: 796 bits (2056), Expect = 4.51e-281
Identity = 410/424 (96.70%), Postives = 414/424 (97.64%), Query Frame = 0

Query: 20  MSWLALSIANTLRLDDEEDEQHNDVVSSSSSSSTIPRNQMESQSQSQSQSQSQPQLDDEA 79
           MSWLALSIANTLRLDDEEDEQHNDVVSSSSS  TI RNQMESQSQSQSQSQ  PQLDDEA
Sbjct: 1   MSWLALSIANTLRLDDEEDEQHNDVVSSSSS--TITRNQMESQSQSQSQSQ--PQLDDEA 60

Query: 80  LSRRVKEDLTEFKQTLTRQFWGVATFLAPPPPPPEPGPTHHLNPAEDLVAPPDWKSFEAS 139
           LSRRVKEDLTEFKQTLTRQFWGVATFLAPPPPPPEPGPTHHLNPAEDL APPDWKSFEAS
Sbjct: 61  LSRRVKEDLTEFKQTLTRQFWGVATFLAPPPPPPEPGPTHHLNPAEDLAAPPDWKSFEAS 120

Query: 140 NQSDPSISGDEEDLTDPIEVLNMRSNHDAYAKSGILQEECYEVDWEGAVGITDEVLTFAT 199
           NQSDPSISGDEEDLTD IEVLNMRSNHDAYAKSGILQEECYEVDWEGAVGITDEVLTFAT
Sbjct: 121 NQSDPSISGDEEDLTDSIEVLNMRSNHDAYAKSGILQEECYEVDWEGAVGITDEVLTFAT 180

Query: 200 NIAMHPETWIDFPIDEEEDNGDFEMSDAQKEHAFTIEHLAPRLAALRFELCPCHMSESYF 259
           NIAMHPETWIDFPIDEEEDNGDFEMSDAQKEHAFTIEHLAPRLAALRFELCPCHMSESYF
Sbjct: 181 NIAMHPETWIDFPIDEEEDNGDFEMSDAQKEHAFTIEHLAPRLAALRFELCPCHMSESYF 240

Query: 260 WKVYFVLLHSRLNKQDAEALSTPQVAEARSMWMQELQKKTKPETFWCGRDTFELKESSDL 319
           WKVYFVLLHSRLNKQDAEALSTPQVAEARS+WMQELQKKTKPETFWCGRDTFELK+SSDL
Sbjct: 241 WKVYFVLLHSRLNKQDAEALSTPQVAEARSIWMQELQKKTKPETFWCGRDTFELKDSSDL 300

Query: 320 LQEDDSSMGLETHSVSTLPWTFTSEPSMSSMSSNCETEKYQIETSETQFIDKSVIVEKPI 379
           LQEDDSS+GLETHSVSTLPWTFTSEPSMSSMSSNCETEKYQIETSETQFIDKSVIVEKPI
Sbjct: 301 LQEDDSSLGLETHSVSTLPWTFTSEPSMSSMSSNCETEKYQIETSETQFIDKSVIVEKPI 360

Query: 380 IKDEDKNSTIESSSKFLVQNYDDESDNDWLEEDSGTILPPGYDEDISFSDLEDDDMVLPA 439
           IKDEDKNSTI SSSKFLVQNY+DESDNDWLEEDSG ILP GYDEDISFSDLEDDDMVLPA
Sbjct: 361 IKDEDKNSTIGSSSKFLVQNYNDESDNDWLEEDSGAILPLGYDEDISFSDLEDDDMVLPA 420

Query: 440 KFKI 443
           KFKI
Sbjct: 421 KFKI 420

BLAST of Cp4.1LG02g06500 vs. NCBI nr
Match: KAG7037753.1 (hypothetical protein SDJN02_01384 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 769 bits (1986), Expect = 4.17e-269
Identity = 408/478 (85.36%), Postives = 410/478 (85.77%), Query Frame = 0

Query: 20  MSWLALSIANTLRLDDEEDEQHNDVVSSSSSSSTIPRNQMESQSQSQSQSQSQPQLDDEA 79
           MSWLALSIANTLRLDDEEDEQHNDVVSSSSSS TIPRNQMESQSQ        PQLDDE 
Sbjct: 1   MSWLALSIANTLRLDDEEDEQHNDVVSSSSSS-TIPRNQMESQSQ--------PQLDDEP 60

Query: 80  LSRRVKEDLTEFKQTLTRQFWGVATFLAPPPPPPEPGPTHHLNPAEDLVAPPDWKSFEAS 139
           LSRRVKEDLTEFKQTLTRQFWGVATFLAPPPPPPEPGPTHHLNPAEDLVAPPDWKSFEAS
Sbjct: 61  LSRRVKEDLTEFKQTLTRQFWGVATFLAPPPPPPEPGPTHHLNPAEDLVAPPDWKSFEAS 120

Query: 140 NQSDPSISGDEEDLTDPIEVLNMRSNHDAYAKSGILQEECYEVDWEGAVGITDEVLTFAT 199
           NQSDPSISGDEEDLTDPIEVLNMRSNHDAYAKSGILQEECYEVDWEGAVGITDEVLTFAT
Sbjct: 121 NQSDPSISGDEEDLTDPIEVLNMRSNHDAYAKSGILQEECYEVDWEGAVGITDEVLTFAT 180

Query: 200 NIAMHPETWIDFPIDEEEDNG--------------------------------------- 259
           NIAMHPETWIDFPIDEEEDNG                                       
Sbjct: 181 NIAMHPETWIDFPIDEEEDNGGVGKGGYWAGSMSNESPIHLKVKVFWPITIKSEVKIALS 240

Query: 260 ------------------DFEMSDAQKEHAFTIEHLAPRLAALRFELCPCHMSESYFWKV 319
                             DFEMSDAQKEHAFTIEHLAPRLAALRFELCPCHMSESYFWKV
Sbjct: 241 GSLEVDIKHSELKLETDQDFEMSDAQKEHAFTIEHLAPRLAALRFELCPCHMSESYFWKV 300

Query: 320 YFVLLHSRLNKQDAEALSTPQVAEARSMWMQELQKKTKPETFWCGRDTFELKESSDLLQE 379
           YFVLLHSRLNKQDAEALSTPQVAEARSMWMQELQKKTKPETFWCGRDTFELK+S+DLLQE
Sbjct: 301 YFVLLHSRLNKQDAEALSTPQVAEARSMWMQELQKKTKPETFWCGRDTFELKDSTDLLQE 360

Query: 380 DDSSMGLETHSVSTLPWTFTSEPSMSSMSSNCETEKYQIETSETQFIDKSVIVEKPIIKD 439
           DDSSMGLETHSVSTLPWTFTSEPSMSSMSSNCETEKYQIETSETQFIDKSVIVEKPIIKD
Sbjct: 361 DDSSMGLETHSVSTLPWTFTSEPSMSSMSSNCETEKYQIETSETQFIDKSVIVEKPIIKD 420

BLAST of Cp4.1LG02g06500 vs. NCBI nr
Match: KAG6608414.1 (hypothetical protein SDJN03_01756, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 757 bits (1955), Expect = 1.16e-265
Identity = 388/401 (96.76%), Postives = 390/401 (97.26%), Query Frame = 0

Query: 20  MSWLALSIANTLRLDDEEDEQHNDVVSSSSSSSTIPRNQMESQSQSQSQSQSQPQLDDEA 79
           MSWLALSIANTLRLDDEEDEQHNDVVSSSSSS TIPRNQMESQSQ        PQLDDE 
Sbjct: 1   MSWLALSIANTLRLDDEEDEQHNDVVSSSSSS-TIPRNQMESQSQ--------PQLDDEP 60

Query: 80  LSRRVKEDLTEFKQTLTRQFWGVATFLAPPPPPPEPGPTHHLNPAEDLVAPPDWKSFEAS 139
           LSRRVKEDLTEFKQTLTRQFWGVATFLAPPPPPPEPGPTHHLNPAEDLVAPPDWKSFEAS
Sbjct: 61  LSRRVKEDLTEFKQTLTRQFWGVATFLAPPPPPPEPGPTHHLNPAEDLVAPPDWKSFEAS 120

Query: 140 NQSDPSISGDEEDLTDPIEVLNMRSNHDAYAKSGILQEECYEVDWEGAVGITDEVLTFAT 199
           NQSDPSISGDEEDLTDPIEVLNMRSNHDAYAKSGILQEECYEVDWEGAVGITDEVLTFAT
Sbjct: 121 NQSDPSISGDEEDLTDPIEVLNMRSNHDAYAKSGILQEECYEVDWEGAVGITDEVLTFAT 180

Query: 200 NIAMHPETWIDFPIDEEEDNGDFEMSDAQKEHAFTIEHLAPRLAALRFELCPCHMSESYF 259
           NIAMHPETWIDFPIDEEEDNGDFEMSDAQKEHAFTIEHLAPRLAALRFELCPCHMSESYF
Sbjct: 181 NIAMHPETWIDFPIDEEEDNGDFEMSDAQKEHAFTIEHLAPRLAALRFELCPCHMSESYF 240

Query: 260 WKVYFVLLHSRLNKQDAEALSTPQVAEARSMWMQELQKKTKPETFWCGRDTFELKESSDL 319
           WKVYFVLLHSRLNKQDAEALSTPQVAEARSMWMQELQKKTKPETFWCGRDTFELK+S+DL
Sbjct: 241 WKVYFVLLHSRLNKQDAEALSTPQVAEARSMWMQELQKKTKPETFWCGRDTFELKDSTDL 300

Query: 320 LQEDDSSMGLETHSVSTLPWTFTSEPSMSSMSSNCETEKYQIETSETQFIDKSVIVEKPI 379
           LQEDDSSMGLETHSVSTLPWTFTSEPSMSSMSSNCETEKYQIETSETQFIDKSVIVEKPI
Sbjct: 301 LQEDDSSMGLETHSVSTLPWTFTSEPSMSSMSSNCETEKYQIETSETQFIDKSVIVEKPI 360

Query: 380 IKDEDKNSTIESSSKFLVQNYDDESDNDWLEEDSGTILPPG 420
           IKDEDKNSTIESSSKFLVQNYDDESDNDWLEEDSGTILP G
Sbjct: 361 IKDEDKNSTIESSSKFLVQNYDDESDNDWLEEDSGTILPLG 392

BLAST of Cp4.1LG02g06500 vs. ExPASy TrEMBL
Match: A0A6J1FPT7 (uncharacterized protein LOC111446162 OS=Cucurbita moschata OX=3662 GN=LOC111446162 PE=4 SV=1)

HSP 1 Score: 805 bits (2079), Expect = 6.14e-285
Identity = 412/424 (97.17%), Postives = 414/424 (97.64%), Query Frame = 0

Query: 20  MSWLALSIANTLRLDDEEDEQHNDVVSSSSSSSTIPRNQMESQSQSQSQSQSQPQLDDEA 79
           MSWLALSIANTLRLDDEEDEQHNDVVSSSSSSSTIPRNQMESQSQ        PQLDDEA
Sbjct: 1   MSWLALSIANTLRLDDEEDEQHNDVVSSSSSSSTIPRNQMESQSQ--------PQLDDEA 60

Query: 80  LSRRVKEDLTEFKQTLTRQFWGVATFLAPPPPPPEPGPTHHLNPAEDLVAPPDWKSFEAS 139
           LSRRVKEDLTEFKQTLTRQFWGVATFLAPPPPPPEPGPTHHLN AEDLVAPPDWKSFEAS
Sbjct: 61  LSRRVKEDLTEFKQTLTRQFWGVATFLAPPPPPPEPGPTHHLNLAEDLVAPPDWKSFEAS 120

Query: 140 NQSDPSISGDEEDLTDPIEVLNMRSNHDAYAKSGILQEECYEVDWEGAVGITDEVLTFAT 199
           NQSDPSISGDEEDLTDPIEVLNMRSNHDAYAKSGILQEECYEVDWEGAVGITDEVLTFAT
Sbjct: 121 NQSDPSISGDEEDLTDPIEVLNMRSNHDAYAKSGILQEECYEVDWEGAVGITDEVLTFAT 180

Query: 200 NIAMHPETWIDFPIDEEEDNGDFEMSDAQKEHAFTIEHLAPRLAALRFELCPCHMSESYF 259
           NIAMHPETWIDFPIDEEEDNGDFEMSDAQKEHAFTIEHLAPRLAALRFELCPCHMSESYF
Sbjct: 181 NIAMHPETWIDFPIDEEEDNGDFEMSDAQKEHAFTIEHLAPRLAALRFELCPCHMSESYF 240

Query: 260 WKVYFVLLHSRLNKQDAEALSTPQVAEARSMWMQELQKKTKPETFWCGRDTFELKESSDL 319
           WKVYFVLLHSRLNKQDAEALSTPQVAEARSMWMQELQKKTKPETFWCGRDTFELK+SSDL
Sbjct: 241 WKVYFVLLHSRLNKQDAEALSTPQVAEARSMWMQELQKKTKPETFWCGRDTFELKDSSDL 300

Query: 320 LQEDDSSMGLETHSVSTLPWTFTSEPSMSSMSSNCETEKYQIETSETQFIDKSVIVEKPI 379
           LQEDD+SMGLETHSVSTLPWTFTSEPSMSSMSSNCETEKYQIETSETQFIDKSVIVEKPI
Sbjct: 301 LQEDDNSMGLETHSVSTLPWTFTSEPSMSSMSSNCETEKYQIETSETQFIDKSVIVEKPI 360

Query: 380 IKDEDKNSTIESSSKFLVQNYDDESDNDWLEEDSGTILPPGYDEDISFSDLEDDDMVLPA 439
           IKDEDKNSTIESSSKFLVQNYDDESDNDWLEEDSGTILP GYDEDISFSDLEDDDMVLPA
Sbjct: 361 IKDEDKNSTIESSSKFLVQNYDDESDNDWLEEDSGTILPLGYDEDISFSDLEDDDMVLPA 416

Query: 440 KFKI 443
           KFKI
Sbjct: 421 KFKI 416

BLAST of Cp4.1LG02g06500 vs. ExPASy TrEMBL
Match: A0A6J1J1L0 (uncharacterized protein LOC111480445 OS=Cucurbita maxima OX=3661 GN=LOC111480445 PE=4 SV=1)

HSP 1 Score: 796 bits (2056), Expect = 2.18e-281
Identity = 410/424 (96.70%), Postives = 414/424 (97.64%), Query Frame = 0

Query: 20  MSWLALSIANTLRLDDEEDEQHNDVVSSSSSSSTIPRNQMESQSQSQSQSQSQPQLDDEA 79
           MSWLALSIANTLRLDDEEDEQHNDVVSSSSS  TI RNQMESQSQSQSQSQ  PQLDDEA
Sbjct: 1   MSWLALSIANTLRLDDEEDEQHNDVVSSSSS--TITRNQMESQSQSQSQSQ--PQLDDEA 60

Query: 80  LSRRVKEDLTEFKQTLTRQFWGVATFLAPPPPPPEPGPTHHLNPAEDLVAPPDWKSFEAS 139
           LSRRVKEDLTEFKQTLTRQFWGVATFLAPPPPPPEPGPTHHLNPAEDL APPDWKSFEAS
Sbjct: 61  LSRRVKEDLTEFKQTLTRQFWGVATFLAPPPPPPEPGPTHHLNPAEDLAAPPDWKSFEAS 120

Query: 140 NQSDPSISGDEEDLTDPIEVLNMRSNHDAYAKSGILQEECYEVDWEGAVGITDEVLTFAT 199
           NQSDPSISGDEEDLTD IEVLNMRSNHDAYAKSGILQEECYEVDWEGAVGITDEVLTFAT
Sbjct: 121 NQSDPSISGDEEDLTDSIEVLNMRSNHDAYAKSGILQEECYEVDWEGAVGITDEVLTFAT 180

Query: 200 NIAMHPETWIDFPIDEEEDNGDFEMSDAQKEHAFTIEHLAPRLAALRFELCPCHMSESYF 259
           NIAMHPETWIDFPIDEEEDNGDFEMSDAQKEHAFTIEHLAPRLAALRFELCPCHMSESYF
Sbjct: 181 NIAMHPETWIDFPIDEEEDNGDFEMSDAQKEHAFTIEHLAPRLAALRFELCPCHMSESYF 240

Query: 260 WKVYFVLLHSRLNKQDAEALSTPQVAEARSMWMQELQKKTKPETFWCGRDTFELKESSDL 319
           WKVYFVLLHSRLNKQDAEALSTPQVAEARS+WMQELQKKTKPETFWCGRDTFELK+SSDL
Sbjct: 241 WKVYFVLLHSRLNKQDAEALSTPQVAEARSIWMQELQKKTKPETFWCGRDTFELKDSSDL 300

Query: 320 LQEDDSSMGLETHSVSTLPWTFTSEPSMSSMSSNCETEKYQIETSETQFIDKSVIVEKPI 379
           LQEDDSS+GLETHSVSTLPWTFTSEPSMSSMSSNCETEKYQIETSETQFIDKSVIVEKPI
Sbjct: 301 LQEDDSSLGLETHSVSTLPWTFTSEPSMSSMSSNCETEKYQIETSETQFIDKSVIVEKPI 360

Query: 380 IKDEDKNSTIESSSKFLVQNYDDESDNDWLEEDSGTILPPGYDEDISFSDLEDDDMVLPA 439
           IKDEDKNSTI SSSKFLVQNY+DESDNDWLEEDSG ILP GYDEDISFSDLEDDDMVLPA
Sbjct: 361 IKDEDKNSTIGSSSKFLVQNYNDESDNDWLEEDSGAILPLGYDEDISFSDLEDDDMVLPA 420

Query: 440 KFKI 443
           KFKI
Sbjct: 421 KFKI 420

BLAST of Cp4.1LG02g06500 vs. ExPASy TrEMBL
Match: A0A6J1FR53 (protein PHLOEM PROTEIN 2-LIKE A10-like OS=Cucurbita moschata OX=3662 GN=LOC111446163 PE=4 SV=1)

HSP 1 Score: 718 bits (1853), Expect = 8.75e-251
Identity = 376/397 (94.71%), Postives = 386/397 (97.23%), Query Frame = 0

Query: 474 MDFELVRRGLGFSQRRKKWLVLLALMGVSGYGVYKVYHFPSVERKRKRLMKLFGTMISVA 533
           MDFELVRRGLGFS+RRKKWLVLLALMGVSGYGVYKVYHFP VERKRKRLMKLFG MISVA
Sbjct: 1   MDFELVRRGLGFSERRKKWLVLLALMGVSGYGVYKVYHFPYVERKRKRLMKLFGAMISVA 60

Query: 534 EMVADSSEAIGVISKDLKEFLKSDSDQIPNSLKQISKLAKSEEFSESLEKVTEAFTVGMM 593
           EMVADSSEAIGVISKDLKEFLKSDSDQIPNSLKQISKLAKSEEFSESLEKVTEAFTVGMM
Sbjct: 61  EMVADSSEAIGVISKDLKEFLKSDSDQIPNSLKQISKLAKSEEFSESLEKVTEAFTVGMM 120

Query: 594 RGYKSVTKNDGNLEADSVNSSSSSDVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGS 653
           RGYKSVTKNDGNLEADSVNSSSSS VVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGS
Sbjct: 121 RGYKSVTKNDGNLEADSVNSSSSSGVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGS 180

Query: 654 VDDAYKSGSEFSDVPKWVTVASDEKCKNVIADCIQVFVSTAVSVYLDKTMDINVYNDLFS 713
           VDDAYKSGSEFSDVP+WVTVASDEKCKNVIADCIQVFVSTAVSVYLDKTMD+NVYNDLFS
Sbjct: 181 VDDAYKSGSEFSDVPRWVTVASDEKCKNVIADCIQVFVSTAVSVYLDKTMDVNVYNDLFS 240

Query: 714 GLTNPTHQDKVKDMVVSVCNGAVETLVKTSHQVLTSSRSSSNLSPVSSCNGLSKLGDDVF 773
           GLTNPTHQDKVKDMVVSVCNGAVETLVKTSHQVLTSSRS+SNLSPVSSCNGL KLGD++F
Sbjct: 241 GLTNPTHQDKVKDMVVSVCNGAVETLVKTSHQVLTSSRSTSNLSPVSSCNGLPKLGDNLF 300

Query: 774 SEEASSKKMAAGSSIESCQNGWIDTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVDFLLW 833
           SEEASSKKMAAGSSIES QNGWIDTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVDFLLW
Sbjct: 301 SEEASSKKMAAGSSIESSQNGWIDTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVDFLLW 360

Query: 834 KLMDGLKRSFGTVHDEVVGRGLEVIFVLSSFFSIRSN 870
           KLMDGLKRSFG VHDEVVGRGLEV+    S+FS +S+
Sbjct: 361 KLMDGLKRSFGIVHDEVVGRGLEVV----SYFSAKSS 393

BLAST of Cp4.1LG02g06500 vs. ExPASy TrEMBL
Match: A0A6J1IYZ5 (protein PHLOEM PROTEIN 2-LIKE A10-like OS=Cucurbita maxima OX=3661 GN=LOC111480446 PE=4 SV=1)

HSP 1 Score: 716 bits (1848), Expect = 4.99e-250
Identity = 376/397 (94.71%), Postives = 385/397 (96.98%), Query Frame = 0

Query: 474 MDFELVRRGLGFSQRRKKWLVLLALMGVSGYGVYKVYHFPSVERKRKRLMKLFGTMISVA 533
           MDFELVRRGLGFSQRRKKWLVLLALMGVSGYGVYKVYHFPSVE KRKRLMKLFG MISVA
Sbjct: 1   MDFELVRRGLGFSQRRKKWLVLLALMGVSGYGVYKVYHFPSVEWKRKRLMKLFGAMISVA 60

Query: 534 EMVADSSEAIGVISKDLKEFLKSDSDQIPNSLKQISKLAKSEEFSESLEKVTEAFTVGMM 593
           EMVADSSEAIGVISKDLKEFLKSDSD+IPNSLKQISKLAKSEEFSESLEKVTEAFTVGMM
Sbjct: 61  EMVADSSEAIGVISKDLKEFLKSDSDRIPNSLKQISKLAKSEEFSESLEKVTEAFTVGMM 120

Query: 594 RGYKSVTKNDGNLEADSVNSSSSSDVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGS 653
           RGYKSVTKNDGNLEADSVNSSSSS VVEKLF TAGTGFASVVVGSFAKNLVMGYYSIPGS
Sbjct: 121 RGYKSVTKNDGNLEADSVNSSSSSGVVEKLFLTAGTGFASVVVGSFAKNLVMGYYSIPGS 180

Query: 654 VDDAYKSGSEFSDVPKWVTVASDEKCKNVIADCIQVFVSTAVSVYLDKTMDINVYNDLFS 713
           VDDAYKSGSEFSDVP+WVTVASDEKCKNVIADCIQVFVSTAVSVYLDKTMD+NVYNDLFS
Sbjct: 181 VDDAYKSGSEFSDVPRWVTVASDEKCKNVIADCIQVFVSTAVSVYLDKTMDVNVYNDLFS 240

Query: 714 GLTNPTHQDKVKDMVVSVCNGAVETLVKTSHQVLTSSRSSSNLSPVSSCNGLSKLGDDVF 773
           GLTNPTHQDKVKDMVVSVCNGAVETLVKTSHQVLTSSRS+SNLSPVSSCNGLSKLGDD+F
Sbjct: 241 GLTNPTHQDKVKDMVVSVCNGAVETLVKTSHQVLTSSRSTSNLSPVSSCNGLSKLGDDLF 300

Query: 774 SEEASSKKMAAGSSIESCQNGWIDTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVDFLLW 833
           SEEASSKKMAA SSIES QNGWIDTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVDFLLW
Sbjct: 301 SEEASSKKMAAASSIESSQNGWIDTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVDFLLW 360

Query: 834 KLMDGLKRSFGTVHDEVVGRGLEVIFVLSSFFSIRSN 870
           KLMDGLKRSFG VHDEVVGRGLEV+    S+FS +S+
Sbjct: 361 KLMDGLKRSFGIVHDEVVGRGLEVV----SYFSAKSS 393

BLAST of Cp4.1LG02g06500 vs. ExPASy TrEMBL
Match: A5C9A8 (BSD domain-containing protein OS=Vitis vinifera OX=29760 GN=VITISV_020688 PE=4 SV=1)

HSP 1 Score: 706 bits (1822), Expect = 8.10e-238
Identity = 452/964 (46.89%), Postives = 572/964 (59.34%), Query Frame = 0

Query: 20  MSWLALSIANTLRLDDEEDEQHNDVVSSSSSSSTIPRNQMESQSQSQSQSQSQPQLDDEA 79
           MSWLA S+AN+LRLDD+ D +  D        +TIP  + E  ++       +P+LD   
Sbjct: 3   MSWLARSLANSLRLDDDRDGEGED----DGDDTTIPETREEKNTE-------EPELDQHG 62

Query: 80  LSRRVKEDLTEFKQTLTRQFWGVATFLAPPPPPPEPGPTHHLNPAEDLVAPPDWKSFEAS 139
             R VKEDLTEFKQTLTRQ WGVA+FLAPPPP     P+   NP  D+ +  D +  E S
Sbjct: 63  --RGVKEDLTEFKQTLTRQLWGVASFLAPPPPAQSSAPS---NP-HDVESISDLEQCEPS 122

Query: 140 NQSDPSISGDEE--------------------DLTDPI------EVLNMRSNHDAYAKSG 199
           +QS      DEE                    ++++P       E+  M SN   +   G
Sbjct: 123 DQSVSGEISDEETFDSVGIRDDIAGIGGRFRPEISNPSNNRAVSEISEMDSNFLPFRSDG 182

Query: 200 ILQEECYEVDWEGAVGITDEVLTFATNIAMHPETWIDFPIDEEEDNGDFEMSDAQKEHAF 259
             ++   E D+EG  GIT+EVL FA NIA HPETW+DFP++EE+D  DF+MSDAQ++HA 
Sbjct: 183 --RDSVEEYDFEGIPGITEEVLAFARNIAHHPETWLDFPLEEEDDLDDFDMSDAQEDHAL 242

Query: 260 TIEHLAPRLAALRFELCPCHMSESYFWKVYFVLLHSRLNKQDAEALSTPQVAEARSMWMQ 319
            IEHLAPRLAALR ELCP HMS+ YFWK+YFVLLHSRLN+QDAE LSTPQ+  AR+MWMQ
Sbjct: 243 AIEHLAPRLAALRIELCPSHMSKGYFWKIYFVLLHSRLNRQDAELLSTPQIVAARAMWMQ 302

Query: 320 ELQKKTKPETFWCGRDTFELKESSDLLQEDDSSMGLETHSVSTLPWTFTSEPSMSSMSSN 379
           ELQKKTKPE  W GR T+  K+S+ L QED +S    +HS +    TF  EP    ++++
Sbjct: 303 ELQKKTKPEPDWSGRSTYYSKDSTSLHQEDSASTN-NSHSENMPFRTFALEPVSFPVTTD 362

Query: 380 CETEKYQIETSETQFIDKSVIVEKPIIKDEDKNSTIESSSKFLVQNYDDESDNDWLEED- 439
            ETEKY    SE+Q  DKSVI EKP ++   K+     SSK L+QNY+D+ D DW E++ 
Sbjct: 363 LETEKYTXAXSESQISDKSVIEEKPEVRT--KDFLPGPSSKVLIQNYEDDED-DWPEDEI 422

Query: 440 ------SGTILPPGYDEDISFSDLEDDD-------MVLPAKFKINPPGLTPQSSLTYYN- 499
                 S T++P G++ED+SFSDLEDDD       +VLP+ F  +P     +  L  Y  
Sbjct: 423 LELGAYSRTVIPVGHEEDVSFSDLEDDDYLILVIILVLPSVFGFDPSEKYGKLLLMIYPI 482

Query: 500 -----NPKQTLHLAIW----------------------------------KFLC------ 559
                N ++  HL+ W                                  + +C      
Sbjct: 483 VSFLFNKRR--HLSTWHLHPAGILNTRFRISDGVQKGELEIPHEFFEHTFQMVCEPENRR 542

Query: 560 --------------------------------------MDFELVRRGLGFSQRRKKWLVL 619
                                                 MD ELV++ L  ++RRKK +VL
Sbjct: 543 YESLISSKPIPAHTESYISYLFLIIHALTFQFQGLFWGMDLELVKKALDLARRRKKLVVL 602

Query: 620 LALMGVSGYGVYKVYHFPSVERKRKRLMKLFGTMISVAEMVADSSEAIGVISKDLKEFLK 679
           LA+ GVS +G YKVYH+P V RKR+RL KL G +IS+AEM +DS+E  GV+SKDLKEFL+
Sbjct: 603 LAVFGVSSFGAYKVYHWPYVVRKRQRLFKLLGALISIAEMXSDSAETXGVVSKDLKEFLQ 662

Query: 680 SDSDQIPNSLKQISKLAKSEEFSESLEKVTEAFTVGMMRGYKSVTKNDGNLEADSVNSSS 739
           SD+DQIPNSLKQISK+ KSEEFS S+  V EA TVG++RGY+     D  +E D+    +
Sbjct: 663 SDTDQIPNSLKQISKIVKSEEFSGSMIGVMEALTVGILRGYRL----DARIENDAQTRGA 722

Query: 740 SSDVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDAYKSGSEFSDVPKWVTVAS 799
           S D                  GS   N            + A   G+E S  PKWV V  
Sbjct: 723 SHD------------------GSNENNR-----------NAAALDGAEVSTAPKWVNVVC 782

Query: 800 DEKCKNVIADCIQVFVSTAVSVYLDKTMDINVYNDLFSGLTNPTHQDKVKDMVVSVCNGA 858
            +KCK +IADCIQ+FVSTAV+VYLDKTMDIN Y++LF+GLTNP HQ KV+D++VS+CNGA
Sbjct: 783 GDKCKPLIADCIQMFVSTAVAVYLDKTMDINTYDELFAGLTNPKHQIKVRDILVSLCNGA 842

BLAST of Cp4.1LG02g06500 vs. TAIR 10
Match: AT1G10150.1 (Carbohydrate-binding protein )

HSP 1 Score: 401.7 bits (1031), Expect = 1.6e-111
Identity = 205/384 (53.39%), Postives = 280/384 (72.92%), Query Frame = 0

Query: 478 LVRRGLGFSQRRKKWLVLLALMGVSGYGVYKVYHFPSVERKRKRLMKLFGTMISVAEMVA 537
           L  +G+  SQRR+KWL+ +A+ GVSGYG YKVYH PSV RKRKRL KLFG ++SVAE+++
Sbjct: 6   LREKGIFLSQRRRKWLIFMAISGVSGYGAYKVYHLPSVARKRKRLFKLFGAIVSVAELIS 65

Query: 538 DSSEAIGVISKDLKEFLKSDSDQIPNSLKQISKLAKSEEFSESLEKVTEAFTVGMMRGYK 597
           DS+E + ++S+D+K+FL SDSD+IPNSLKQI+K+  S EF++SL +V++A T+G  RGYK
Sbjct: 66  DSAETLSMVSRDVKDFLNSDSDEIPNSLKQIAKITTSNEFTDSLSRVSQAVTIGAFRGYK 125

Query: 598 SVTK-NDGNLEADSVNSSSSSDVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDD 657
           S +   D  +E  S +SS    V++K+FS AGTGF SVVVGSFAKNLV+G+YS  G V+ 
Sbjct: 126 SESSIGDSGIEKSS-DSSVVDRVIDKVFSEAGTGFVSVVVGSFAKNLVLGFYS--GKVES 185

Query: 658 AYK-SGSEFSDVPKWVTVASDEKCKNVIADCIQVFVSTAVSVYLDKTMDINVYNDLFSGL 717
             K  GS+ S+ P+WVT+  D+KC+ ++ADCI+ F STA+ VYLDKTMDIN Y+ +F GL
Sbjct: 186 GVKCEGSDSSETPRWVTLLGDDKCRELLADCIERFTSTAIGVYLDKTMDINTYDQIFEGL 245

Query: 718 TNPTHQDKVKDMVVSVCNGAVETLVKTSHQVLTSSRSSSNLSPVSSCNGLSKLGDDVFSE 777
           TNP HQD VKD++VSVCNGA+ET+V+TSH V TSSRS          N + ++ DD F  
Sbjct: 246 TNPKHQDSVKDVLVSVCNGALETIVRTSHDVFTSSRSK---------NVIEEIEDDDFKS 305

Query: 778 EASSK-KMAAGSSIESCQNGWIDTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVDFLLWK 837
             S++ KM + S      NGW + +++TLAVP NR+F+ D+TGRVT ETTRS++ F++ K
Sbjct: 306 NGSARSKMVSESGDGVKSNGWTEAIATTLAVPSNRRFMFDVTGRVTLETTRSIIAFIMVK 365

Query: 838 LMDGLKRSFGTVHDEVVGRGLEVI 859
              G ++S   VH+EV  RG + +
Sbjct: 366 TFQGFRKSINVVHEEVTDRGRQAV 377

BLAST of Cp4.1LG02g06500 vs. TAIR 10
Match: AT1G59510.1 (Carbohydrate-binding protein )

HSP 1 Score: 345.1 bits (884), Expect = 1.8e-94
Identity = 182/372 (48.92%), Postives = 258/372 (69.35%), Query Frame = 0

Query: 487 QRRKKWLVLLALMGVSGYGVYKVYHFPSVERKRKRLMKLFGTMISVAEMVADSSEAIGVI 546
           QRR+KWL+LLA+ GVSGYGVY+VY+   + +K KRLMKLF  ++S AEMV DS+E I ++
Sbjct: 15  QRRRKWLILLAVFGVSGYGVYRVYNSQYIAKKTKRLMKLFSGIVSFAEMVIDSAETISIV 74

Query: 547 SKDLKEFLKSDSDQIPNSLKQISKLAKSEEFSESLEKVTEAFTVGMMRGYKSVTKNDGNL 606
           S+DLKEFL+S+S +IPNSLKQ+SK+ KS+EF++SL +V+EA  +G+ RGY S    D N+
Sbjct: 75  SRDLKEFLESNSHEIPNSLKQLSKITKSKEFTDSLARVSEAVAIGVFRGYNS----DPNV 134

Query: 607 EADSVNSSSSSDVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDDAYKSGSEFSD 666
           E +     S+  VV+++FS  G GF SVVVGSFAKNLV+G+YS  G ++     GS+ S 
Sbjct: 135 EKE-----SNLSVVDRVFSEEGAGFVSVVVGSFAKNLVLGFYS--GEIE----IGSDDSL 194

Query: 667 VPKWVTVASDEKCKNVIADCIQVFVSTAVSVYLDKTMDINVYNDLFSGLTNPTHQDKVKD 726
            P+W+ + SD+KC+ ++ADCI+ F S+AVSVY+DKT+ +N Y+ +F+GLTNP H+D  +D
Sbjct: 195 KPRWMNLLSDDKCRELLADCIERFTSSAVSVYIDKTVGVNTYDQIFAGLTNPKHRDSARD 254

Query: 727 MVVSVCNGAVETLVKTSHQVLTSSRSSSNLSPVSSCNGLSKLGDDVFSEEASSKKMAAGS 786
           ++VSVCNGA+ET ++TSH V TSS   ++ S   S                         
Sbjct: 255 VLVSVCNGALETFMRTSHDVFTSSGEKTDSSLRKS------------------------- 314

Query: 787 SIESCQNGWIDTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVDFLLWKLMDGLKRSFGTV 846
             E+ +NGW + +S+TLAVP NRKF+ D+TGRVT ET RS+++F++ K     KRS   +
Sbjct: 315 --ENRENGWAEALSTTLAVPSNRKFMFDVTGRVTLETMRSILEFVILKTSQSFKRSLDVI 344

Query: 847 HDEVVGRGLEVI 859
           H+EV  RG +V+
Sbjct: 375 HEEVTERGRQVV 344

BLAST of Cp4.1LG02g06500 vs. TAIR 10
Match: AT3G49790.1 (Carbohydrate-binding protein )

HSP 1 Score: 284.6 bits (727), Expect = 2.9e-76
Identity = 167/356 (46.91%), Postives = 233/356 (65.45%), Query Frame = 0

Query: 478 LVRRG-LGFSQRRKKWLVLLALMGVSGYGVYKVYHFPSVERKRKRLMKLFGTMISVAEMV 537
           L +RG   F+ + KKW+    L+ VSGYG ++VYH PS+ +KRKR+ KLF  ++++ E  
Sbjct: 6   LSKRGYFDFALKNKKWI----LLAVSGYGAFRVYHSPSISQKRKRISKLFTLLLNLIEAA 65

Query: 538 ADSSEAIGVISKDLKEFLKSDSDQIPNSLKQISKLAKSEEFSESLEKVTEAFTVGMMRGY 597
           +DS+E + VISKDL EFL+SDSDQIPNSLKQISK+AKS+E + SL + T+A TVG++RG 
Sbjct: 66  SDSAETVSVISKDLTEFLRSDSDQIPNSLKQISKIAKSDELNSSLIRFTQAMTVGLIRGI 125

Query: 598 KSVTKNDGNLEADSVNSSSSSDVVEKLFSTAGTGFASVVVGSFAKNLVMGYYSIPGSVDD 657
                       D   S  +  V++KLF+ +G+GFAS +VGSFA+NLV+  YS  G   +
Sbjct: 126 D-----------DGSGSGFTDRVMDKLFTKSGSGFASAIVGSFARNLVVALYSSAGDGSN 185

Query: 658 AYKSGSEFSDVPKWVTVASDEKCKNVIADCIQVFVSTAVSVYLDKTMDINVYNDLFSGLT 717
           +    + FSD             + +I DC+Q FVSTAVSVYLDKT D+NV++DLF+GLT
Sbjct: 186 SKLLDAVFSD-----------DGRRLIGDCVQRFVSTAVSVYLDKTSDVNVFDDLFAGLT 245

Query: 718 NPTHQDKVKDMVVSVCNGAVETLVKTSHQVLTSSRSSSNLSPVSSCNGLSKLGDDVFSEE 777
           NP H+ KVK  +V++CN AVET V+ S + +  +RS       SSC             +
Sbjct: 246 NPKHEGKVKQTLVTLCNSAVETFVRASRKPVQLNRS-------SSC-------------Q 305

Query: 778 ASSKKMAAGSSIESCQNGWIDTVSSTLAVPRNRKFVLDLTGRVTFETTRSVVDFLL 833
            SS+ +  GS+ ++    WID VSS+L+VP NRK+V+DLTGRVTFET RS+++ L+
Sbjct: 306 DSSQTLTVGSTKQAT---WIDRVSSSLSVPSNRKYVVDLTGRVTFETVRSLLEVLI 312

BLAST of Cp4.1LG02g06500 vs. TAIR 10
Match: AT1G10720.1 (BSD domain-containing protein )

HSP 1 Score: 235.3 bits (599), Expect = 2.0e-61
Identity = 171/432 (39.58%), Postives = 233/432 (53.94%), Query Frame = 0

Query: 34  DDEEDEQHNDVVSSSSSSSTIPRNQMESQSQSQSQSQSQPQLDDEALSRRVKEDLTEFKQ 93
           DD+ED  +++           P+   E    S++  +  P  ++EA +R VK+DLTE   
Sbjct: 20  DDDEDNNNDE---------KTPKASTERHDFSRNAVRLSP--EEEAQARGVKDDLTELGH 79

Query: 94  TLTRQFWGVATFLAPPPPPPEPGP---THHLNPAEDLVAPPDWKSFEASNQSDPSISGDE 153
           TLTRQF GVA FLAP P          ++H    +   + P      +S++ +  +  D 
Sbjct: 80  TLTRQFRGVANFLAPLPDGSSSSSSDLSNHPRFNQSRSSDPGLNQSRSSDRDESCVGSDT 139

Query: 154 E---------DLTDPIEVLNMRSNHDAYAKSGILQEECYEVDWEGAVGITDEVLTFATNI 213
                     DL + +   N     D   +     EE  E +   AV +TDEVL FA NI
Sbjct: 140 PETGIRFRSWDLEEKLAEGN--DPEDEEEEEEETDEEEEEEEEIAAVALTDEVLAFARNI 199

Query: 214 AMHPETWIDFPIDEEEDNGDFEMSDAQKEHAFTIEHLAPRLAALRFELCPCHMSESYFWK 273
           AMHPETW+DFP+D +ED  D EMSDAQ+ HA  IE LAPRLAALR ELCPCHMS  YFWK
Sbjct: 200 AMHPETWLDFPLDPDEDLDDLEMSDAQRGHALAIERLAPRLAALRIELCPCHMSVGYFWK 259

Query: 274 VYFVLLHSRLNKQDAEALSTPQVAEARSMWMQELQKKTKPETFWCGRDTFELKESSDLLQ 333
           VYFVLL SRLNK DA  LS+PQV EAR++WM+ELQ +T              KES D++ 
Sbjct: 260 VYFVLLLSRLNKHDAHLLSSPQVMEARALWMKELQNQTHSS-----------KESRDMIL 319

Query: 334 EDDSSMGLETHSVSTLPWTFTSEPSMSSMSSNCETEKYQIETSETQFIDKSVIVEKPIIK 393
           E++      ++  +  P  F S    +    +     ++      QFIDK+VI EKPI K
Sbjct: 320 EEEDITPSTSNYYNHAPPEFLSPRIYAFEPPSIMYRDFEHGFENAQFIDKAVIEEKPIQK 379

Query: 394 DEDKNSTIESSSKFLVQNYDDESDNDW-LEEDSGTILPPGY---DEDISFSDLEDDDMV- 448
           ++  ++++  +SK +V    D+ D+DW  EEDS     P +   ++D+SFSDLE DD + 
Sbjct: 380 NDKNSASLSQTSKDVV----DDDDDDWPEEEDSANSWAPMFTVNEDDVSFSDLEGDDDIS 423

BLAST of Cp4.1LG02g06500 vs. TAIR 10
Match: AT3G49800.1 (BSD domain-containing protein )

HSP 1 Score: 234.6 bits (597), Expect = 3.4e-61
Identity = 175/461 (37.96%), Postives = 242/461 (52.49%), Query Frame = 0

Query: 20  MSWLALSIANTLRLDDEEDEQHNDVVSSSSSSSTIPRNQMESQSQSQSQSQSQPQLDDEA 79
           M+WLA SIAN+L++D+EE +Q            TI  N +E+    Q  S S  Q     
Sbjct: 1   MAWLARSIANSLKIDEEEYDQ----------KETIKTNSIENSGSDQPSSPSVLQTQS-- 60

Query: 80  LSRRVKEDLTEFKQTLTRQFWGVATFLAPPPPPPEPGPTHHLNPAEDLVAPPDWKSFEAS 139
             R VKED++E  +TL  Q WGVA+FLAPPP   +           D V     KS + +
Sbjct: 61  -PRGVKEDISELTKTLRSQLWGVASFLAPPPSSSD---------TADHVDEETRKSSDLA 120

Query: 140 NQSDPSISGDEEDLTD----------------PI-EVLNMRSNHDAYAKSGILQEECYEV 199
              +  I+G   D  +                P+ +  NM SN       G+   +  +V
Sbjct: 121 EGDEDLIAGIRNDFVEIGGRFKTGISKLSGNLPVSKFTNMASNFLQLGSEGV-DSKNRDV 180

Query: 200 DWEGAVGITDEVLTFATNIAMHPETWIDFPIDEEEDN-GDFEMSDAQKEHAFTIEHLAPR 259
               A+G+T+EV+ FA ++A+HPETW+DFP  +E+DN  DFEM+DAQ EHA  +E+LA  
Sbjct: 181 AIGNAIGVTEEVVLFARDLALHPETWLDFPFPDEDDNFDDFEMTDAQYEHALAVENLASS 240

Query: 260 LAALRFELCPCHMSESYFWKVYFVLLHSRLNKQDAEALSTPQVAEARSMWMQELQKKTKP 319
           LAALR ELCP +MSE  FW++YFVL+H   +K DA  LSTPQV E+R++   EL +K   
Sbjct: 241 LAALRIELCPAYMSEYCFWRIYFVLVHPIFSKHDALTLSTPQVLESRALLSHELLRKR-- 300

Query: 320 ETFWCGRDTFELKESSD----------LLQEDDSSMGLETHSVSTLPWTFTSEPSMSSMS 379
                 +DT  + ESSD          L Q  + S   E   V T+    T E   S+  
Sbjct: 301 -----NKDTVVVPESSDRGADSENVEPLFQPTNPSPKSEPEPVKTI----TVETIHSAER 360

Query: 380 SNCETEKYQIETSETQFIDKSVIVEKPIIKDEDK--NSTIESSSKFLVQNYDDESDNDWL 439
           S  ETEK+ +ET E Q +DK VI E+P     DK   S +  SS  ++    D+  +DWL
Sbjct: 361 SEFETEKHTVETKEVQVVDKPVIEERPAPAYHDKPVQSLVTGSSPRVIDVQVDDDADDWL 420

Query: 440 --EEDSGTI------LPPGYDEDISFSDLEDDDMVLPAKFK 443
             E+++GT+      L    DED+SFSDLE+DD  +P  +K
Sbjct: 421 KDEDNAGTVSATTNHLVQDVDEDVSFSDLEEDDDDVPVSYK 427

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SY572.3e-11053.39Protein PHLOEM PROTEIN 2-LIKE A10 OS=Arabidopsis thaliana OX=3702 GN=PP2A10 PE=2... [more]
Match NameE-valueIdentityDescription
XP_023524215.11.47e-295100.00uncharacterized protein LOC111788188 [Cucurbita pepo subsp. pepo][more]
XP_022940623.11.27e-28497.17uncharacterized protein LOC111446162 [Cucurbita moschata][more]
XP_022981253.14.51e-28196.70uncharacterized protein LOC111480445 [Cucurbita maxima][more]
KAG7037753.14.17e-26985.36hypothetical protein SDJN02_01384 [Cucurbita argyrosperma subsp. argyrosperma][more]
KAG6608414.11.16e-26596.76hypothetical protein SDJN03_01756, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
A0A6J1FPT76.14e-28597.17uncharacterized protein LOC111446162 OS=Cucurbita moschata OX=3662 GN=LOC1114461... [more]
A0A6J1J1L02.18e-28196.70uncharacterized protein LOC111480445 OS=Cucurbita maxima OX=3661 GN=LOC111480445... [more]
A0A6J1FR538.75e-25194.71protein PHLOEM PROTEIN 2-LIKE A10-like OS=Cucurbita moschata OX=3662 GN=LOC11144... [more]
A0A6J1IYZ54.99e-25094.71protein PHLOEM PROTEIN 2-LIKE A10-like OS=Cucurbita maxima OX=3661 GN=LOC1114804... [more]
A5C9A88.10e-23846.89BSD domain-containing protein OS=Vitis vinifera OX=29760 GN=VITISV_020688 PE=4 S... [more]
Match NameE-valueIdentityDescription
AT1G10150.11.6e-11153.39Carbohydrate-binding protein [more]
AT1G59510.11.8e-9448.92Carbohydrate-binding protein [more]
AT3G49790.12.9e-7646.91Carbohydrate-binding protein [more]
AT1G10720.12.0e-6139.58BSD domain-containing protein [more]
AT3G49800.13.4e-6137.96BSD domain-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005607BSD domainSMARTSM00751wurzfinal6coord: 219..271
e-value: 5.7E-14
score: 62.4
IPR005607BSD domainPFAMPF03909BSDcoord: 219..275
e-value: 1.8E-12
score: 47.1
IPR005607BSD domainPROSITEPS50858BSDcoord: 226..271
score: 11.574845
IPR035925BSD domain superfamilyGENE3D1.10.3970.10BSD domaincoord: 189..285
e-value: 1.2E-8
score: 36.9
IPR035925BSD domain superfamilySUPERFAMILY140383BSD domain-likecoord: 189..276
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 44..77
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 103..152
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 38..79
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 105..119
NoneNo IPR availablePANTHERPTHR31923BSD DOMAIN-CONTAINING PROTEINcoord: 20..443
NoneNo IPR availablePANTHERPTHR31923:SF27BSD DOMAIN-CONTAINING PROTEINcoord: 20..443

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g06500.1Cp4.1LG02g06500.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane