Cp4.1LG04g00020 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG04g00020
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionProtein of unknown function (DUF1399)
LocationCp4.1LG04 : 2241558 .. 2252318 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCCTTTTTCACTACCCATGTCCTCCCACCGTCCCGGTTCCGATCTTTCTGCCTCCGTGTCGGCGAGAAGCCTCGGCGACATCTCCCAACTCACTACCATCCGAATCGGCGTCGACATTATTTCGGCCGTCCGGCGGAATCTCGCCTTTCTCAGAACCGTCCTCCATTCCCACTGGCTCCACTCTGAGCCCACTCTCACTCAAGCCATCAGAAGGTCAAATCTCAACTGGAGCCACCGTTTTTTCAACTTCTTAATGCGCTGCTGTCTATTCAAGTTTTGCATTTAATGCTAAGAGGGATTTTCTTGGGGTTTTATTTTATTTTTTTCTAGGTATGAGGAGCTCTGGATGCCGTTGATTTCTGATCTAACGGTTGATGGGGCTTCTCCTCCGATGATTCTTCCTCCTTTGGATGTTGAATGGGTTTGGTTCTGTCACACGTTGAATCCGGTTAGTTTTAATTTCTGTTTGTTTCCCCAGAAAATGCCTAATGAATCACATGTCCTTTAGATTGCTTCTGATTGGTTTAATCTGTATGCACTTTGCTCTAAATTGATTTATGTTTGGTTTTAGAGTTTCAAAAGTGCTTTTATAATCAATCACTTGGCTCAAAAGTACTTACAGAAGTGATTTCTTATGGGGGCTTTTTAAGTGCTCTCTATTGAGAAATTTTATTGGTATTTTTTTTCTTTCCCTCTTTTTGTACTTCCTCTGCAGCCAAATAAGTATCAAAGGTAATGATATATTTCTTTCAGGTTCGTTACAAACATTACTGTGAGTCCAGATTCTCTAAGCTTATTGGGAAACCATCGATTTTTGATGAAGAAAATGAAGAATATGCATATATGAGGTGCCAAGAGATATGGGTCAAGAGGTATCCAACCCAGTCTTTTGAAAATGAGGAAGAATCCAGTTTCAGAGATGATATTACTGTTGAAAATCAGGAGCTTTTGGAGGAAGTGATGAGGCAAAGGCACTTATATTCTAACTTTTTGGAGCCATATCAGTCCGAGATCGTTTACCTTATTGCAGCTAGACAAAGATACATGGGGTTTCTGTATATGTTGCAAAGGTTTTCTGATGAATGTTCTTCGTTTGTGCCTTCCTCTGATATTCTCCTCATGTGGCTTACCCATCAGGTTGATGCTCTCTCAATTTTCCATCTCATAGTTGATGGGGTTTTCTTTAATTTCTTAGGGGTTGTTTGATAATGAAACAAANTTTCACAAAAAAAATAAATAAAACGATAAAAATACATAATAACAAGACATGGGTGTGTTAATGAATGAATATTTAGGTTATGGTTTTGAATCTATTTTATGTTGCATATGTTAGGAATCACGGATCTCTATAATGATATGATATTGTCCACTTTGAGCATAAGTTTTCGTGGCTTTGCTTTGGGCTTCTCCAAAAGGCCTCATCCTAATGGAGATGTATTCCTTACTCATAAACCCATGATCATTTCCTAAATTAGTCAATGTAGGACTCTCTCCCAACAATCCTCAACGGATGATCGATTAGAGGTTGAAAATTTCCTCCGAGCATGATACTACCCTTTACTTAGAATGATGCTCGCAGTGGTGTCTCTAAGCACAAGAGCCTTACTTTGAAATTCGATTACATTAGTCGAGCTTAGCTATGTCTATACAACGTATTGCGATGTATATATATTTCCTCAGCTCCAGTCATGTTCTTGTTTATTCAATGAGTTCGCTTACTAATGTATCTAGAACTGTAGTTCTTGATATCACTTTTACGATAAAGGTGGAATAAAAAATTTAATTGCGAATGAACATGAAGTTCTGTATTCATGATTGAAGATGCAAATGGCAGGTGGAAGTGTTGTTATCATACTGTGTTGTCCGCCTTTACTTCAGATGTCTTACGTTTATGTTTTTGTTTGTTTGTTATATGTAAACCAGAGTTATCCAACAGTTTATGCAGAAGATGTAAAGGAGATGCAGGGGGAGTTGGCAAAAGTGGTGAGATTTGGGGAAACTGTGAAGCCGAAGGAACTGGAAGAAACCAAGAAACTATGGCACAGAACTTTTGGGCAATCTTACGAGAAAGCTGGAGGAGAGGTAATCATGGAATTAGGCAGAGTTGTAACTTCTAAGCCTTTAGTTTACTGGGAAGTGTCCTTCTCGGATGTCAACTCGAAGTACAAGTCCATGACATCTCGGTTCATCCTCGAGGTTGGTTAACTCACTTGATTCTCTGTACTTTTCATGTTGGAAATAAGTACATTTTGTAAAGGAAGCTGAATTCTGTAGCAACTTTATCCTTGCTTCAAATTCAGGTTTCTGTGTTTATGCGGCTCAAAGATCAAAAACAGCCATTGCAACAAGTTGCTTCTCGAGAATTTCTTCGTCTACGCACTCTAAGATGCCATAAGGAATTTAAGCTCAAACAACCAATCTCGAGCCTCGACAATGATGTTTGGATCAAAGCTTGGCATCTCTATTGTGAATTTGGTACCAAGGGAGTTGTTATCGAGCTTCGGCATCCTGGTAGCCATTGTTTCAAAGGAAGTAGCACTAAAGACACCACCACATTCAAGTGGAATGACTTGATAAGAGCACCGTCTCTTACTTTAGAAAGACAATTCGATCAGAATCTCAAGATAGTTGCCTCAATAACTCCACCCGTTCAAGCACCGTACTTGTTAAAATGTGTGCCAGACAAAGTGACAGATGATTCGGGGGCAATGGTTTCGGATGTTGTTCTTAGAATGAACCAATACCGGCCTCAAGAAGGTCGGTGGCTATCTCGAACTGTACTCGACCATGGAGGGAGAGAGTGTTTTGTCATTCGTTTAAGGTAATAATTCAATCAATAGTTCATTGTCATTTGTCATTCCGTTTACTATTTGTTGAGTTCTGAAGGCTTACTAATGTTCGTTAAATAGAGATGCAACATAACAAATGTGAATGTAAAATGCAGAGTTGGAGCGGGATTTTGGAGGAGAGGAGGTGAAACTCCTTTGCCTGTGAAATGGGAGGACCGAATTATCGAGATTCGAGAAGGTTGTTGGTCCTATATTGCTGGCTCCATCGGCAGATGCCCGGGTATGGCGTGTCTGTACTTGAATCTCATCTCTCTAAGATTGAAAGTTATTTCAAATTTTAATTCTTTTTTTCCTTTTCATTTATTTCTAAGAAAGTATCGTGAGATCCAACAACGGTTGGAGAGGGGAACGAAACATTCTTTATAAGGGTGTGGAAATCTCTCCCTAACAGACGCGTTTAAAAACCATGAGACTAACGATGATACGTAACGGGCCAAAACGGACAATATTTGTTAGCAGTGGGCTTGGGTTATTACAAATGGTATCAGAGCCAGACACCAGGCGTTGTGCCAGCGAGGACTCTGGGCTCCCAAGGGGGGTGGATTGTGAGATTCCACATCAGTTGGAGAGGGCAACAAAACATTCCTTATAAGGGTATACAAACCTCTCCCTAACAGACGCGTTTTAAAACCGTGAGGCTAACAGCGATACGTAACGGGCCAAAGCGGACAATATTTGCTAGCGGTAGACTTGAGTTGTTACAAATGATATCAGAGCCAGACACTAGACGGTGTGCCAGCGAGGATGCTGGGTGACACAATCGCGCTTTCGAAGGCAGCGGACTAGCGATTGTGCGGCACTCACTTTCCAGCGACAAGTGAGCTTAAGCCAATTTGTTCTCGCGTCGCCTACGGCCAAGCGACGTATGATCGAGCCTCTAGCCTCGGTTTTAAAAGGTTTTAGAAAACGAGGAGAGAGAGAGATTTTAGAAAGAGAGATTTTGGAAAGAGACGTTATATAAGCAAGCAAGAGAAAGATAACAACGGCTTAGCATATATAACCTTGGGTAGGGAGAAGTTGACAACTAGTCATATCCCCCTATAGTACATGCTAAGCCATTTTAAGTCTGGCAGGCCTAGAAATGATATTTTTGAGGCGTCATACGCCCTAGAATTACAAAAGGAAAAACACTAGAATGAAAAGACATGACAAGGGTTTGGCTTGTCATGGGCATGCAGCCATGGGCAACATGACCTGGACGAGCATGACTGTGACAATCCTCCCCCACCTAATTGGTTGACGTCCCTGTCAACCGACTACTTTCAAACGTTGCGATGTGGGTGGCGGCGGAGTTCAGATCTTCGGCGCGTTCCCAGCTGGTTTCCTCTGTAGGGAGATTCTTCCATTTGACAAGGAATTCACGAATCGTCCGTACAGGTCTTCCTATCTTCCTAACCCTGTCGGCTAGGATCTCTTCTACTTCTTTTGTTTCTTTTTGCTTCAGATCGATGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCGTCTTCACTACGAACTGTGATCCCAAGAGATACTGTCTCCAGACTCGAAGGCAGTGGACTACAGCCAGAATTTCTTTCTCGGAGACGGTGTATCTNGTTATCCGTCTTCACTACGAACTGTGATCCCAAGAGATACTGTCTCCAGACTCGAAGGCAGTGGACTACAGCCAGAATTTCTTTCTCGGAGACGGTGTATCTTCTTTCGGCATCGTTGAGCTTTCGACTTTCGTAAGCGATGGGGTGGCCTTCCTGAATAAGGACCCCACCTAGATCAAAGTCGGAAACATCGGTTTCTATTTCAAATGACTTTGTAACATCTACCAACCCGAGGACAGGACCCCTCGTCATGGTTGTTTTCAGATTTTCAAAGGCCATTTGACAATCATTCGACCACGACCAAGAGTGGTCTTTCTTCAACAGCTTTGTCAATGGGGCGACTCGTCGTGAAAACCCTTCGACGAACCGCCTATAGTAGTTGGCTAATCCTAAGAAGGACCGCAACTCGGATACGGAAGTAGGAACTTTCCATTCTTGGATAGCTTTTATCTTGTCGCTATCCATACTAATTTGTCCACATCTGACGACATGTCCAAGGAAGTTGATGCATGTTTGTGCGAATGCACATTTCTCTTTCTTCACATACAGTTGGTTTTGCCGTAGCTTGTCAAACACCAGCTTCAAGTGCACTTTGTGTTCCTCGAGGGTTGTGCTATAAACAACGATGTCGTCGAGGTATACTATGACGAACTGATCCAAGTATTTGTAGAAAACCTGGTTCATCAACGTGCAGAAAGTAGCTGGGGCGTTTGTCAAGCCAAAGGGCATTACTAGGAATTCAAAGGCCCCATATCTTGTTACGCACGTCGTCTTAGGTTCGTCCCCTTCGGCAATACGTACTTGATAATACCCTGATCGTAAGTCCAACTTCGTGAAGTACTTGGCCCCGTGAAGTTGGTCGAACAAGTCGGATATTATCGGCGGAGGATATTTGTTGCGTACCGTCACCTTGTTTAAGGCTCTATAGTCTATGCACAGACGCAACGTCCCATCCTTCTTCTTCTGGAACAGTACGGGGGCTTCATAAGGTGCCTTTGCCGGGCGAATGAATCCCGCCTTCAGCAACTCATCCAGTTGTTTCCTCAATTCGGCTAGCTCGGGCGGAGCCATCCGGTATGCGTTTTTCGCTGGCGGTTTAACCCCGGGGAGGAGTTCGATTTCATGGTCAATGCCTCGACGAGGTGGTAGTGTTTGGGGTAGACTCTCTGGCATTATGTCGATATAGCTGTTTAGTACTTCCTTGATTTCCTCTGGAACAGTCTCTTCGGTGGTTGCTTCTTCCATCAGTGGTATGGCCATGAATGTAGGTTCCTCTCGTGCGAGTCCCCTCTTCAGCTGTATGGCCGAGATCATTCAAAGATTACCTGGTTGCTTGATGCTTGCAGGTATTACTGTGGGGTTATGGTCGGTGATCACCAAGCATTTTGCCAGTGGCATTGGGATGACCTTGTGTTCTAGGAGGAACTCCATCCCGAGTACCACGTCGAAGTCATCCATGCGAACCACGACAAGGTCTAGCTCTCCAGTCCAATCCCCTATTTTGAAGGGGACTCCTTTGGAANCGAGTACCACGTCGAAGTCATCCATGCGAACCACGACAAGGTCTAGCTCTCCAGTCCAATCCCCTATTTTGAAGGGGACTCCTTTGGAAACTCCCACAATAGGCAAGGCCTCGGAGTTGACAGCTTTCATTTTCCCCGGGTCCTTTCCTATGGTGAGTCCCAATCTTCTGGCTTCTTGATCAGCAATGAAGTTGTGGGTTGCTCCAGAGTCTATCAGAGTGCTCTTGCTCAGTCGAGAGTTTATGGTGGCATCTACGAACATAAGCCCTTTCTCTATTATCTCCTTCGGTTCGACTTTCCGTTGGAGGGCTGACAAGAATTTGAGTGCGCCCATTCGGGGGTTGTCGTGATCTTCTTTCTTGTCTAGCAAAGTCTCGACCCTTGCCTCATTGCTCTCTTGGATGGACACTTGGAGTGCAGTGAGAGAGGCCCGATGAGGACAGTAGGATACCTTGTGGGGGCCTTTACATAGCATGCATTGCAAAGGCGTCGTCGGGTGGTTCTTTTGGTGGTAAGGCCCTCGAGATGTCCCCGCTTGATTGTTCTGAGGAGGTCTATCTGTCCATCCATTCGATCTGTTGGGGCTTCCATTTCGATGGCCTGGTGGTCGGTATGGTTTGCCCCCATTGTTTGGGGCTGGCGTCGCTCTTCTTTGGGATCCTGCTTCGTTCCCACAGTCCACGAGTCTTTCGGCGCAGGCCATTGCGGCAGCTAGGGTTTGGACCCTGTTCTCATGTATCTTTGTTTTGGCCCACGGTTGTAGACCATTTATAAAGAAGAACACCTTGTCCTTCTCTGCTGTACCCCTGATATCCAGCATCAGGGTTGAGAATTGTCTGACGTAGTCCCGTATGTTTCCAGTGTGTTTCAGGGCTACTATTTTCTCCATTGCTATGTGTCCTGCGTTTTCGGGGAGGAATTGTTCCCTCAACTCTCTCTTGAGGTCCTCCCATGAGTCAATGGTGCACAATCCATCCTCGATGTCTTGCACCTTCGTACGCCACCACAGTTTGGCGTCGTCTACGAGATACATCGCAGCTACAGTCACCTTCTTGTCGTCGGTACAAGCCGTTGTGGCTTTGAAGTACTGTTCGACGTCAAAGATGAAGTTCTCCAACTCTTTGGCGTCCCGATTCCCTTTGAAGGCTCTAGGGTCTGGGAACCTTAGTTTGTTGGATCCTGCACTAGTTTGTCCAGCCGTGACACCTTCCACGGCTTTCATGGTAACTTCAATTCGAGTGCTCATGGCGGACATCCTTTCTTGGAGGTCGTCGACTGTTGATCTGAATTCATCAGCCAATCCATTGAACAAACTCATCATTGTGTTTTGTAGTACGTCGAACTCTTCGCCACGTCCCTCTTTGTGTGCGACAGAGCTATCCGGACTATCAGACGGTTTTGGGCTGCTCGTAGGAGCAACTCTTTCTTCGAGCGAGGTCACTCGAAACAACAGTTCTGCGATTGGCAACCCATCTAGGCGGTTGCCCATTGCGTCGACTTGGACGACTTTCTCACTCAGTTCGGTCACCCGTGTTTCCAGGAAACGGAGGGTGTCAGGGACTTCTTTGAGGAACAGCATTTCCTCCTCTATTAGCGTAAGCCGATCGGCTTGTGCTTTGGGTGTCGACTTTGTCGTCGACATGATTTCGGTCTGTATGACGTTACGGAGCTAACAAAGCTCTGATACCACTTGACACAATCGCACTTTCGAAGGCAGCGGACTAGCGATTGTGCGGCACTCACTTTCCAGCGACAAGTGAGTTCAGCCAATTTGTTCTTGCGTCGCCTACGGCCAAGCGACGTACGGTCGAGCCTCTAGCCTCGGTTTTAAAAGAAGGTTTTAGAAAACGAGGAGAGAGAGAGATTTTTGAAAGAGAGATTTTAGAAAGAGACGTTATATAAGCAAGCAAGAGAAAGATAACAACGGCTTAGCATATATAACCTTGGGTAGGGAGAAGTTGACAACTAGTCATATCCCCCTATAGTACATTCTGAGCTATTTTAAGTCTGGCAGACCTAGACATGATATTTTTGAGGCTTCATACGCCCTAGAATTACAAAAGGAAAAACACTAGAATGAAAAGACATGACAAGGGTTTGGCTTGTCATGGGCATGCGGCCATGGGCAACATGCCCTGGACGAGCATGACCGTGACACTGGGCTCCCAAGAGGGGTGGATTGTGAGATCCCACATCAGTTGGAGAGGGGAACAAAATATTCCTTATAAGGGTGTGAAAACCTCTCCCTAACAGACGCGTTTTAAAACCATGAGGCTGACGGCAATACGTAATGGGCCAAAGTGGACAATATTTGCTAGCGGTAGGCTTGAGTTGTTACAAATGTATTCGGGGATGTATCTGTTAAAGCCTATCATAAAATTCATTTTTGATTCCTGAAATTTGAACTTAATATATTCTTATGTTAGACTTTGCAGACGTGTTTTTTTTTGAATTAAGGACCCATTAGAAATTAGTTTCTAACTTTAGGGACCCATTAGCAATCAAATTCAAACTTCGAGAACGACATGTTACTTTCGAAACCTCGGAGTCTCCCTACTATTAAGTGATTGATTGATATGGAACAAACAGATGGATTCTTGGTTAAATTCTGATAATTGATTCATATATTGGCATCTAGAACAGAGAAATTGGTGGGAACAGCAACACCAAAACAGCCACTAGAGGAATGGAAAGCTGCTTGGAATTTTTCAACTGGAGATGAACTCATAATCCAATGGGACACTTCAACACCAAAACCAGCCCTTACCTTTTCCTTAACAAATCCAACTTCTTCAGAATCATCTGTAGGTCGTCGTTTCGTATCGTATCTTGTTCAAACAGTCTTTTCGATTTCAGGTTTCAATATGTTTTGATGCTGTAGGTAAGGCTTCTAAAAGGTCGCCAAATGGTATATCGAGTACAGAGAGACAATGGCAGAGAGGAGGAAGAAGAGGAAGGCGGAGATGGAAATGGGTTTGTAACGGTGATTCGATATACCAATGAGGACCCAACTGGAAGAGCAACAGCTCTTCTAAACTGGAAGTTGCTGGTGATCGAGTTGTTGCCTGAGGAGGACGCTGTGTTGGCTCTCCTTCTCTGTGTCTCGATCCTCAAAAGCGTGTCGGAAATGAAGAAAGAAGATGTTGGGAAGTTGTTGATCCGAAGGAGATTGAGGGAGACAACAATTGGCCTTAGAGATTGGGGCTCCATAATGCTTCACCCATCTTCGTCGCCTTATCTTCAACCGTGGTATTGGAATGCCGAGACCGTGATGGCATCGAACAACATCGACCACCTCTCAAGACAGCGTGCATCAAGCTACTTGCCCGTGGAGGGTGGAGATAAGCTGTACAAGCAAGGAATAATATCATAGAATGATAAAAAATGAGGAAACAACATGTTTTGTTGCTCACTCATTTTTCCTTTCTTTGCTCTATAAATTAAGTACATCAAGGTGTGTTTGTACAGCTATTTTTTTAGTTTCAATTTTATTGTCTGTAAAAATGATTAATCATTAGCTCCACCCTCCCGGCTTGCTCGAACATATCAATTGATACTGACACACGTTTC

mRNA sequence

CTCCTTTTTCACTACCCATGTCCTCCCACCGTCCCGGTTCCGATCTTTCTGCCTCCGTGTCGGCGAGAAGCCTCGGCGACATCTCCCAACTCACTACCATCCGAATCGGCGTCGACATTATTTCGGCCGTCCGGCGGAATCTCGCCTTTCTCAGAACCGTCCTCCATTCCCACTGGCTCCACTCTGAGCCCACTCTCACTCAAGCCATCAGAAGGTATGAGGAGCTCTGGATGCCGTTGATTTCTGATCTAACGGTTGATGGGGCTTCTCCTCCGATGATTCTTCCTCCTTTGGATGTTGAATGGGTTTGGTTCTGTCACACGTTGAATCCGGTTCGTTACAAACATTACTGTGAGTCCAGATTCTCTAAGCTTATTGGGAAACCATCGATTTTTGATGAAGAAAATGAAGAATATGCATATATGAGGTGCCAAGAGATATGGGTCAAGAGGTATCCAACCCAGTCTTTTGAAAATGAGGAAGAATCCAGTTTCAGAGATGATATTACTGTTGAAAATCAGGAGCTTTTGGAGGAAGTGATGAGGCAAAGGCACTTATATTCTAACTTTTTGGAGCCATATCAGTCCGAGATCGTTTACCTTATTGCAGCTAGACAAAGATACATGGGGTTTCTGTATATGTTGCAAAGGTTTTCTGATGAATGTTCTTCGTTTGTGCCTTCCTCTGATATTCTCCTCATGTGGCTTACCCATCAGAGTTATCCAACAGTTTATGCAGAAGATGTAAAGGAGATGCAGGGGGAGTTGGCAAAAGTGGTGAGATTTGGGGAAACTGTGAAGCCGAAGGAACTGGAAGAAACCAAGAAACTATGGCACAGAACTTTTGGGCAATCTTACGAGAAAGCTGGAGGAGAGGTAATCATGGAATTAGGCAGAGTTGTAACTTCTAAGCCTTTAGTTTACTGGGAAGTGTCCTTCTCGGATGTCAACTCGAAGTACAAGTCCATGACATCTCGGTTCATCCTCGAGGTTTCTGTGTTTATGCGGCTCAAAGATCAAAAACAGCCATTGCAACAAGTTGCTTCTCGAGAATTTCTTCGTCTACGCACTCTAAGATGCCATAAGGAATTTAAGCTCAAACAACCAATCTCGAGCCTCGACAATGATGTTTGGATCAAAGCTTGGCATCTCTATTGTGAATTTGGTACCAAGGGAGTTGTTATCGAGCTTCGGCATCCTGGTAGCCATTGTTTCAAAGGAAGTAGCACTAAAGACACCACCACATTCAAGTGGAATGACTTGATAAGAGCACCGTCTCTTACTTTAGAAAGACAATTCGATCAGAATCTCAAGATAGTTGCCTCAATAACTCCACCCGTTCAAGCACCGTACTTGTTAAAATGTGTGCCAGACAAAGTGACAGATGATTCGGGGGCAATGGTTTCGGATGTTGTTCTTAGAATGAACCAATACCGGCCTCAAGAAGGTCGGTGGCTATCTCGAACTGTACTCGACCATGGAGGGAGAGAGTGTTTTGTCATTCGTTTAAGAGTTGGAGCGGGATTTTGGAGGAGAGGAGGTGAAACTCCTTTGCCTGTGAAATGGGAGGACCGAATTATCGAGATTCGAGAAGGTTGTTGGTCCTATATTGCTGGCTCCATCGGCAGATGCCCGGAGAAATTGGTGGGAACAGCAACACCAAAACAGCCACTAGAGGAATGGAAAGCTGCTTGGAATTTTTCAACTGGAGATGAACTCATAATCCAATGGGACACTTCAACACCAAAACCAGCCCTTACCTTTTCCTTAACAAATCCAACTTCTTCAGAATCATCTGTAAGGCTTCTAAAAGGTCGCCAAATGGTATATCGAGTACAGAGAGACAATGGCAGAGAGGAGGAAGAAGAGGAAGGCGGAGATGGAAATGGGTTTGTAACGGTGATTCGATATACCAATGAGGACCCAACTGGAAGAGCAACAGCTCTTCTAAACTGGAAGTTGCTGGTGATCGAGTTGTTGCCTGAGGAGGACGCTGTGTTGGCTCTCCTTCTCTGTGTCTCGATCCTCAAAAGCGTGTCGGAAATGAAGAAAGAAGATGTTGGGAAGTTGTTGATCCGAAGGAGATTGAGGGAGACAACAATTGGCCTTAGAGATTGGGGCTCCATAATGCTTCACCCATCTTCGTCGCCTTATCTTCAACCGTGGTATTGGAATGCCGAGACCGTGATGGCATCGAACAACATCGACCACCTCTCAAGACAGCGTGCATCAAGCTACTTGCCCGTGGAGGGTGGAGATAAGCTGTACAAGCAAGGAATAATATCATAGAATGATAAAAAATGAGGAAACAACATGTTTTGTTGCTCACTCATTTTTCCTTTCTTTGCTCTATAAATTAAGTACATCAAGGTGTGTTTGTACAGCTATTTTTTTAGTTTCAATTTTATTGTCTGTAAAAATGATTAATCATTAGCTCCACCCTCCCGGCTTGCTCGAACATATCAATTGATACTGACACACGTTTC

Coding sequence (CDS)

ATGTCCTCCCACCGTCCCGGTTCCGATCTTTCTGCCTCCGTGTCGGCGAGAAGCCTCGGCGACATCTCCCAACTCACTACCATCCGAATCGGCGTCGACATTATTTCGGCCGTCCGGCGGAATCTCGCCTTTCTCAGAACCGTCCTCCATTCCCACTGGCTCCACTCTGAGCCCACTCTCACTCAAGCCATCAGAAGGTATGAGGAGCTCTGGATGCCGTTGATTTCTGATCTAACGGTTGATGGGGCTTCTCCTCCGATGATTCTTCCTCCTTTGGATGTTGAATGGGTTTGGTTCTGTCACACGTTGAATCCGGTTCGTTACAAACATTACTGTGAGTCCAGATTCTCTAAGCTTATTGGGAAACCATCGATTTTTGATGAAGAAAATGAAGAATATGCATATATGAGGTGCCAAGAGATATGGGTCAAGAGGTATCCAACCCAGTCTTTTGAAAATGAGGAAGAATCCAGTTTCAGAGATGATATTACTGTTGAAAATCAGGAGCTTTTGGAGGAAGTGATGAGGCAAAGGCACTTATATTCTAACTTTTTGGAGCCATATCAGTCCGAGATCGTTTACCTTATTGCAGCTAGACAAAGATACATGGGGTTTCTGTATATGTTGCAAAGGTTTTCTGATGAATGTTCTTCGTTTGTGCCTTCCTCTGATATTCTCCTCATGTGGCTTACCCATCAGAGTTATCCAACAGTTTATGCAGAAGATGTAAAGGAGATGCAGGGGGAGTTGGCAAAAGTGGTGAGATTTGGGGAAACTGTGAAGCCGAAGGAACTGGAAGAAACCAAGAAACTATGGCACAGAACTTTTGGGCAATCTTACGAGAAAGCTGGAGGAGAGGTAATCATGGAATTAGGCAGAGTTGTAACTTCTAAGCCTTTAGTTTACTGGGAAGTGTCCTTCTCGGATGTCAACTCGAAGTACAAGTCCATGACATCTCGGTTCATCCTCGAGGTTTCTGTGTTTATGCGGCTCAAAGATCAAAAACAGCCATTGCAACAAGTTGCTTCTCGAGAATTTCTTCGTCTACGCACTCTAAGATGCCATAAGGAATTTAAGCTCAAACAACCAATCTCGAGCCTCGACAATGATGTTTGGATCAAAGCTTGGCATCTCTATTGTGAATTTGGTACCAAGGGAGTTGTTATCGAGCTTCGGCATCCTGGTAGCCATTGTTTCAAAGGAAGTAGCACTAAAGACACCACCACATTCAAGTGGAATGACTTGATAAGAGCACCGTCTCTTACTTTAGAAAGACAATTCGATCAGAATCTCAAGATAGTTGCCTCAATAACTCCACCCGTTCAAGCACCGTACTTGTTAAAATGTGTGCCAGACAAAGTGACAGATGATTCGGGGGCAATGGTTTCGGATGTTGTTCTTAGAATGAACCAATACCGGCCTCAAGAAGGTCGGTGGCTATCTCGAACTGTACTCGACCATGGAGGGAGAGAGTGTTTTGTCATTCGTTTAAGAGTTGGAGCGGGATTTTGGAGGAGAGGAGGTGAAACTCCTTTGCCTGTGAAATGGGAGGACCGAATTATCGAGATTCGAGAAGGTTGTTGGTCCTATATTGCTGGCTCCATCGGCAGATGCCCGGAGAAATTGGTGGGAACAGCAACACCAAAACAGCCACTAGAGGAATGGAAAGCTGCTTGGAATTTTTCAACTGGAGATGAACTCATAATCCAATGGGACACTTCAACACCAAAACCAGCCCTTACCTTTTCCTTAACAAATCCAACTTCTTCAGAATCATCTGTAAGGCTTCTAAAAGGTCGCCAAATGGTATATCGAGTACAGAGAGACAATGGCAGAGAGGAGGAAGAAGAGGAAGGCGGAGATGGAAATGGGTTTGTAACGGTGATTCGATATACCAATGAGGACCCAACTGGAAGAGCAACAGCTCTTCTAAACTGGAAGTTGCTGGTGATCGAGTTGTTGCCTGAGGAGGACGCTGTGTTGGCTCTCCTTCTCTGTGTCTCGATCCTCAAAAGCGTGTCGGAAATGAAGAAAGAAGATGTTGGGAAGTTGTTGATCCGAAGGAGATTGAGGGAGACAACAATTGGCCTTAGAGATTGGGGCTCCATAATGCTTCACCCATCTTCGTCGCCTTATCTTCAACCGTGGTATTGGAATGCCGAGACCGTGATGGCATCGAACAACATCGACCACCTCTCAAGACAGCGTGCATCAAGCTACTTGCCCGTGGAGGGTGGAGATAAGCTGTACAAGCAAGGAATAATATCATAG

Protein sequence

MSSHRPGSDLSASVSARSLGDISQLTTIRIGVDIISAVRRNLAFLRTVLHSHWLHSEPTLTQAIRRYEELWMPLISDLTVDGASPPMILPPLDVEWVWFCHTLNPVRYKHYCESRFSKLIGKPSIFDEENEEYAYMRCQEIWVKRYPTQSFENEEESSFRDDITVENQELLEEVMRQRHLYSNFLEPYQSEIVYLIAARQRYMGFLYMLQRFSDECSSFVPSSDILLMWLTHQSYPTVYAEDVKEMQGELAKVVRFGETVKPKELEETKKLWHRTFGQSYEKAGGEVIMELGRVVTSKPLVYWEVSFSDVNSKYKSMTSRFILEVSVFMRLKDQKQPLQQVASREFLRLRTLRCHKEFKLKQPISSLDNDVWIKAWHLYCEFGTKGVVIELRHPGSHCFKGSSTKDTTTFKWNDLIRAPSLTLERQFDQNLKIVASITPPVQAPYLLKCVPDKVTDDSGAMVSDVVLRMNQYRPQEGRWLSRTVLDHGGRECFVIRLRVGAGFWRRGGETPLPVKWEDRIIEIREGCWSYIAGSIGRCPEKLVGTATPKQPLEEWKAAWNFSTGDELIIQWDTSTPKPALTFSLTNPTSSESSVRLLKGRQMVYRVQRDNGREEEEEEGGDGNGFVTVIRYTNEDPTGRATALLNWKLLVIELLPEEDAVLALLLCVSILKSVSEMKKEDVGKLLIRRRLRETTIGLRDWGSIMLHPSSSPYLQPWYWNAETVMASNNIDHLSRQRASSYLPVEGGDKLYKQGIIS
BLAST of Cp4.1LG04g00020 vs. Swiss-Prot
Match: GRDP1_ARATH (Glycine-rich domain-containing protein 1 OS=Arabidopsis thaliana GN=GRDP1 PE=2 SV=1)

HSP 1 Score: 144.4 bits (363), Expect = 4.9e-33
Identity = 85/276 (30.80%), Postives = 145/276 (52.54%), Query Frame = 1

Query: 28  IRIGVDIISAVRRNLAFLRTVLHSHWLHSEPTLTQAIRRYEELWMPLISDLTVDGA-SPP 87
           I I VD+++A +++L FL TV  + WL+  P L +AI RY   W+PL+   +   + S  
Sbjct: 17  IEISVDLLAAAKQHLLFLETVDRNRWLYDGPALEKAIYRYNACWLPLLVKYSESSSVSEG 76

Query: 88  MILPPLDVEWVWFCHTLNPVRYKHYCESRFSKLIGKPSIFDEENEEYAYMRCQEIWVKRY 147
            ++PPLD EW+W CH LNPVRY   CE  + +++    +    +     ++ +++W + Y
Sbjct: 77  SLVPPLDCEWIWHCHRLNPVRYNSDCEQFYGRVLDNSGVLSSVDGN-CKLKTEDLWKRLY 136

Query: 148 PTQSFENEEESSFRDDITVENQ--------ELLEEVMRQRHLYSNFLEPYQSEIVYLIAA 207
           P + +E + ++   +DI+ ++         +L+  V RQ   Y      + +  ++L  A
Sbjct: 137 PDEPYELDLDNIDLEDISEKSSALEKCTKYDLVSAVKRQSPFYYQVSRSHVNSDIFLQEA 196

Query: 208 RQRYMGFLYMLQRFSDECSS--FVPSSDILLMWLTHQSYPTVYAEDVKEMQGELAKVVRF 267
             RY GFLY+++   +       VP+ D+ L+W THQ +P  Y +D+ ++ G   KV+  
Sbjct: 197 VARYKGFLYLIKMNRERSLKRFCVPTYDVDLIWHTHQLHPVSYCDDMVKLIG---KVLEH 256

Query: 268 GET----VKPKELE----ETKKLWHRTFGQSYEKAG 285
            +T     K K+L+    +T   W  TFG  Y KAG
Sbjct: 257 DDTDSDRGKGKKLDTGFSKTTAQWEETFGTRYWKAG 288

BLAST of Cp4.1LG04g00020 vs. Swiss-Prot
Match: GRDP2_ARATH (Glycine-rich domain-containing protein 2 OS=Arabidopsis thaliana GN=GRDP2 PE=2 SV=1)

HSP 1 Score: 133.3 bits (334), Expect = 1.1e-29
Identity = 89/291 (30.58%), Postives = 141/291 (48.45%), Query Frame = 1

Query: 28  IRIGVDIISAVRRNLAFLRTVLHSHWLHSEPTLTQAIRRYEELWMPLISDLTVDGA-SPP 87
           I I VD+++A +++L FL  V  +  L+  P L +AI RY   W+PL++  T   +    
Sbjct: 17  IDISVDLLAAAKKHLLFLGAVDRNRCLYDGPALQRAIYRYNAYWLPLLAQYTESSSICQG 76

Query: 88  MILPPLDVEWVWFCHTLNPVRYKHYCESRFSKLIGKPSIFDEENEEYAYMRCQEIWVKRY 147
            ++PPLD EWVW CH LNPVRYK  CE  + +++    +    N      + + +W + Y
Sbjct: 77  PLVPPLDCEWVWHCHRLNPVRYKTDCEQFYGRVLDNSGVVSSVNGN-CKSQTETLWKRLY 136

Query: 148 PTQSFENEEESSFRDDITVE------NQELLEEVMRQRHLYSNFLEPYQSEIVYLIAARQ 207
           PT+ ++ +  ++  +   V         +L+  V RQ   +      +    V+L  A  
Sbjct: 137 PTEPYDLDFANAISEPADVSALEKCTTYDLVLAVKRQSPFFYQVSRAHVDNDVFLQEAVA 196

Query: 208 RYMGFLYMLQRFSDECSSF--VPSSDILLMWLTHQSYPTVYAEDVKEMQGELAKVVRFGE 267
           RY  FLY+++   +       VP+ DI L+W THQ +   Y  D+ +M G   KV+   +
Sbjct: 197 RYKAFLYLIKGNRERSIKLFCVPTYDIDLIWHTHQLHAISYCNDLTKMIG---KVLEHDD 256

Query: 268 T----VKPKELEE----TKKLWHRTFGQSYEKAGGEVIMELGRVVTSKPLV 302
           T     K K+L+     T   W  TFG+ Y KAG        + VT+ P V
Sbjct: 257 TDSDRSKGKKLDTGFSGTTAQWEETFGRRYWKAGAMNRGNTPKPVTTSPYV 303

BLAST of Cp4.1LG04g00020 vs. TrEMBL
Match: A0A0A0M1H5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G569190 PE=4 SV=1)

HSP 1 Score: 1296.6 bits (3354), Expect = 0.0e+00
Identity = 645/768 (83.98%), Postives = 694/768 (90.36%), Query Frame = 1

Query: 1   MSSHRPGSDLSASVSARSLGDISQLTTIRIGVDIISAVRRNLAFLRTVLHSHWLHSEPTL 60
           MSS R GS   AS+SARSLGDIS+ +T RIG+DIISAVRRNL FLRTV  SHWLHSEPT+
Sbjct: 1   MSSDRLGS---ASISARSLGDISEFSTTRIGLDIISAVRRNLGFLRTVADSHWLHSEPTI 60

Query: 61  TQAIRRYEELWMPLISDLTVDGASPPMILPPLDVEWVWFCHTLNPVRYKHYCESRFSKLI 120
           T+AIRRYEELWMPLISDL V G+SPPMILPPLDVEWVWFCHTLNPV YKHYCE+RFSK+I
Sbjct: 61  TEAIRRYEELWMPLISDLMVAGSSPPMILPPLDVEWVWFCHTLNPVGYKHYCETRFSKII 120

Query: 121 GKPSIFDEENEEYAYMRCQEIWVKRYPTQSFENEEESSFRDDITVENQELLEEVMRQRHL 180
           GKPSIFDEENEEYAYMRC+EIWVK+YPTQSFE EE SS RD ITVENQELLEEV RQR+L
Sbjct: 121 GKPSIFDEENEEYAYMRCKEIWVKKYPTQSFELEESSSLRDVITVENQELLEEVKRQRNL 180

Query: 181 YSNFLEPYQSEIVYLIAARQRYMGFLYMLQRFSDECSSFVPSSDILLMWLTHQSYPTVYA 240
           YS F EP++SEIVYLIAA+QRY GFLYMLQRFSDECSSFVP+SDILLMWLTHQSYPTVYA
Sbjct: 181 YSKFSEPFRSEIVYLIAAKQRYKGFLYMLQRFSDECSSFVPASDILLMWLTHQSYPTVYA 240

Query: 241 EDVKEMQGELAKVVRFGETVKPKELEETKKLWHRTFGQSYEKAGGEVIMELGRVVTSKPL 300
           EDVKEMQG+LAKVVRFGETV  KEL+ETK+LWHRTFGQ YEKAGG +IMELGRVVTS PL
Sbjct: 241 EDVKEMQGDLAKVVRFGETVNSKELDETKQLWHRTFGQPYEKAGGGIIMELGRVVTSNPL 300

Query: 301 VYWEVSFSDVNSKYKSMTSRFILEVSVFMRLKDQKQPLQQVASREFLRLRTLRCHKEFKL 360
           VY E S  DVN+KYKSMTSRFILEV VFM  K QK+PLQQV S+EFLRLR+LRCH+EFKL
Sbjct: 301 VYLETSHLDVNTKYKSMTSRFILEVCVFMWHKAQKRPLQQV-SQEFLRLRSLRCHREFKL 360

Query: 361 KQPISSLDNDVWIKAWHLYCEFGTKGVVIELRHPGSHCFKGSSTKDTTTFKWNDLIRAPS 420
            QPISSL+ND+W KAWHL CEFGTKGV++ELRHP  HCFKGSS K+TTTFKWNDLIRAPS
Sbjct: 361 DQPISSLNNDLWHKAWHLCCEFGTKGVILELRHPSGHCFKGSSIKETTTFKWNDLIRAPS 420

Query: 421 LTLERQFDQNLKIVASITPPVQAPYLLKCVPDKVTDDSGAMVSDVVLRMNQYRPQEGRWL 480
           LTLERQ + NLKIVASITPPVQAPYLLKCVPDKVTDDSGAMVSDVVLRMNQYRPQEGRWL
Sbjct: 421 LTLERQLNHNLKIVASITPPVQAPYLLKCVPDKVTDDSGAMVSDVVLRMNQYRPQEGRWL 480

Query: 481 SRTVLDHGGRECFVIRLRVGAGFWRRGGETPLPVKWEDRIIEIREGCWSYIAGSIGRCPE 540
           SRTVLDHGGRECFVIR+RVG GFWRRGGETPLPVKWEDRIIEIREG WSYIAGSIGR PE
Sbjct: 481 SRTVLDHGGRECFVIRMRVGGGFWRRGGETPLPVKWEDRIIEIREGSWSYIAGSIGRSPE 540

Query: 541 KLVGTATPKQPLEEWKAAWNFSTGDELIIQWDTSTPKPALTFSLTNPTSSESSVRLLKGR 600
           K+VGTATPKQPLEE KAAWNFSTGDELIIQWDTST +P+L+FSLTNP +SESSVRLLKGR
Sbjct: 541 KVVGTATPKQPLEELKAAWNFSTGDELIIQWDTSTTEPSLSFSLTNP-ASESSVRLLKGR 600

Query: 601 QMVYRV-------QRDNGREEEEEEGGDGNGFVTVIRYTNEDPTGRATALLNWKLLVIEL 660
           Q +Y V       Q D   +EEE EGGD +GFVT+IRYT+EDPTGRATALLNWKLLVIEL
Sbjct: 601 QKLYHVWRKVKEPQHDGNIQEEENEGGDDDGFVTMIRYTDEDPTGRATALLNWKLLVIEL 660

Query: 661 LPEEDAVLALLLCVSILKSVSEMKKEDVGKLLIRRRLRETTIGLRDWGSIMLHPSS---- 720
           LPEEDAVLALL+CVSIL+S+SEMKKEDVG LLIRRRLRET IGLRDWGSIMLHPS     
Sbjct: 661 LPEEDAVLALLICVSILRSISEMKKEDVGNLLIRRRLRETKIGLRDWGSIMLHPSKNSTT 720

Query: 721 -SPYLQPWYWNAETVMASNNIDHLSRQRASSYLPVEGGDKLYKQGIIS 757
            SPYL+PWYWNAETVMASN+++HL RQ ASSYLPVEGGDKLYKQGIIS
Sbjct: 721 PSPYLRPWYWNAETVMASNSVEHLMRQPASSYLPVEGGDKLYKQGIIS 763

BLAST of Cp4.1LG04g00020 vs. TrEMBL
Match: M5W894_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001832mg PE=4 SV=1)

HSP 1 Score: 1047.0 bits (2706), Expect = 1.1e-302
Identity = 501/761 (65.83%), Postives = 612/761 (80.42%), Query Frame = 1

Query: 14  VSARSLGDISQLTTIRIGVDIISAVRRNLAFLRTVLHSHWLHSEPTLTQAIRRYEELWMP 73
           +SARS  +IS++   ++G+D++SA RRN+ FLRTV  S WLH +PT+ +AIRRY ELWMP
Sbjct: 1   MSARSFSEISEVENFKVGLDLVSAARRNIGFLRTVAESQWLHQQPTVIEAIRRYNELWMP 60

Query: 74  LISDLTVDGASPPMILPPLDVEWVWFCHTLNPVRYKHYCESRFSKLIGKPSIFDEENEEY 133
           L+SDLTV+  +PP I PP+D+EWVWFCHTLNPV Y+ YCES+FSKLIGK +IFDEENEEY
Sbjct: 61  LVSDLTVESTTPPAIHPPIDIEWVWFCHTLNPVYYRQYCESKFSKLIGKATIFDEENEEY 120

Query: 134 AYMRCQEIWVKRYPTQSFENE--EESSFRDDITVENQELLEEVMRQRHLYSNFLEPYQSE 193
           A MRC+E+WV+RYP + FENE   +S  R       +ELLEEV + R L+S F EPY++E
Sbjct: 121 ALMRCRELWVRRYPNEPFENEVDSDSDVRVPEAANEEELLEEVKKNRFLHSKFSEPYRAE 180

Query: 194 IVYLIAARQRYMGFLYMLQRFSDECSSFVPSSDILLMWLTHQSYPTVYAEDVKEMQGELA 253
           IVYLIAA+QRY  FL+M+Q   D CSS VP+SDI+LMWL+HQSYPTVYAED+KEM+G+L 
Sbjct: 181 IVYLIAAKQRYKRFLFMVQSTIDLCSSLVPASDIMLMWLSHQSYPTVYAEDLKEMEGDLG 240

Query: 254 KVVRFGETVKPKELEETKKLWHRTFGQSYEKAGGEVIMELGRVVTSKPLVYWEVSFSDVN 313
           KVV    TVK KE+EETKKLW RTF Q YEKAGGE+ +EL   V+ KP VYWEVS +DVN
Sbjct: 241 KVVSMWATVKEKEVEETKKLWERTFDQPYEKAGGEIALELDGGVSFKPTVYWEVSDTDVN 300

Query: 314 SKYKSMTSRFILEVSVFMRLKDQKQPLQQVASREFLRLRTLRCHKEFKLKQPISSLDNDV 373
           +KYK M  RF+LEV VF+RL+D+ + +Q+   R  LRLR +RCH+E KL++P+S   +  
Sbjct: 301 TKYKPMHPRFLLEVCVFVRLRDKMKEMQEDMKRNVLRLRMVRCHRELKLEKPVSDFPHSS 360

Query: 374 WIKAWHLYCEFGTKGVVIELRHPGSHCFKGSSTKDTTTFKWNDLIRAPSLTLERQFDQNL 433
           W KAWHLYCEFGTKGV+ E+R  G  CFKGSS ++T TF WNDL+RAPSLTLE++ DQ +
Sbjct: 361 WRKAWHLYCEFGTKGVIFEIRKRGGSCFKGSSVQETVTFHWNDLLRAPSLTLEKE-DQQV 420

Query: 434 KIVASITPPVQAPYLLKCVPDKVTDDSGAMVSDVVLRMNQYRPQEGRWLSRTVLDHGGRE 493
           KIVASITPPVQAPYLLKCVPD+VTDDSGAM+SD++LRMNQYRPQEGRWLSRTVLDH GR+
Sbjct: 421 KIVASITPPVQAPYLLKCVPDRVTDDSGAMISDLILRMNQYRPQEGRWLSRTVLDHAGRD 480

Query: 494 CFVIRLRVGAGFWRRGGETPLPVKWEDRIIEIREGCWSYIAGSIGRCPEKLVGTATPKQP 553
           CFVIR+RVGAGFWRRGGETP  VKWEDRIIEIREG WSY+AGSIGR P KLVGTA PK+P
Sbjct: 481 CFVIRIRVGAGFWRRGGETPSAVKWEDRIIEIREGSWSYVAGSIGRAPVKLVGTAIPKEP 540

Query: 554 LEEWKAAWNFSTGDELIIQWDTSTPKPALTFSLTNPTSSESSVRLLKGRQMVYRVQR--- 613
            E+WKAAWNFSTGDEL+IQW+ S+ K  L+F L N  ++ES+V+LLKGR+M Y+V++   
Sbjct: 541 PEQWKAAWNFSTGDELMIQWELSSSKSGLSFGLKN-QAAESTVKLLKGRKMQYQVKKKKS 600

Query: 614 -------DNGRE-EEEEEGGDGNGFVTVIRYTNEDPTGRATALLNWKLLVIELLPEEDAV 673
                   N  E EEEEE  +  GF+T++RYT ++P GRATALLNWKLLV EL+PEEDAV
Sbjct: 601 VTKDEECQNEEEGEEEEEDEEEEGFLTLVRYTEDNPNGRATALLNWKLLVAELMPEEDAV 660

Query: 674 LALLLCVSILKSVSEMKKEDVGKLLIRRRLRETTIGLRDWGSIMLHPS-----SSPYLQP 733
           L LLLC+SIL+SVSEMKKEDVG LLIRRRL+E  +G RDWGS++LHPS     SSPYLQP
Sbjct: 661 LVLLLCISILRSVSEMKKEDVGCLLIRRRLKEVKLGTRDWGSVVLHPSSSSSISSPYLQP 720

Query: 734 WYWNAETVMASNNIDHLSRQRASSYLPVEGGDKLYKQGIIS 757
           WYWNA+ ++AS+   H++RQ + SY P EGGDK YK+GI++
Sbjct: 721 WYWNAKAIIASDGAGHITRQPSISYSPEEGGDKFYKRGILA 759

BLAST of Cp4.1LG04g00020 vs. TrEMBL
Match: V4V486_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10003191mg PE=4 SV=1)

HSP 1 Score: 1025.4 bits (2650), Expect = 3.5e-296
Identity = 489/756 (64.68%), Postives = 609/756 (80.56%), Query Frame = 1

Query: 7   GSDLSASVSARSLGDISQLTTIRIGVDIISAVRRNLAFLRTVLHSHWLHSEPTLTQAIRR 66
           GS++S +++ R+L  IS++ TIR+GVD++SA R+N+ FLRTV  S WLH  PT+ +AIRR
Sbjct: 5   GSEISDNLTTRTLSGISEVDTIRLGVDLVSATRKNIGFLRTVNESQWLHERPTILEAIRR 64

Query: 67  YEELWMPLISDLTVDGASPPMILPPLDVEWVWFCHTLNPVRYKHYCESRFSKLIGKPSIF 126
           YE LWMPL+SDLTV GA PPMILPP+D+EWVWFCH+LNPVRY+ YCESRFSKLIGKP+IF
Sbjct: 65  YEGLWMPLMSDLTV-GAPPPMILPPVDIEWVWFCHSLNPVRYRQYCESRFSKLIGKPAIF 124

Query: 127 DEENEEYAYMRCQEIWVKRYPTQSFENEEESSFRDDITVENQELLEEVMRQRHLYSNFLE 186
           DEENEEYA MRC+EIW  +YP + FENE +S   + I V N+++L EV RQR LYS F E
Sbjct: 125 DEENEEYALMRCREIWEHKYPYEPFENEVDSDSENPICVTNEDILNEVKRQRFLYSKFSE 184

Query: 187 PYQSEIVYLIAARQRYMGFLYMLQRFSDECSSFVPSSDILLMWLTHQSYPTVYAEDVKEM 246
           PY  E+VYLIAARQRY GFLY+LQ+FSD CS FVP+SDI LMWLTH SYPTVYAED+K+M
Sbjct: 185 PYMCELVYLIAARQRYKGFLYILQKFSDGCSLFVPASDIQLMWLTHLSYPTVYAEDLKDM 244

Query: 247 QGELAKVVRFGETVKPKELEETKKLWHRTFGQSYEKAGGEVIMELGRVVTSKPLVYWEVS 306
             ++ KVV     VK K++EETKK+W +TF   YEKAGG + +E   + + KP ++W VS
Sbjct: 245 WDDMGKVVGVWGNVKAKDVEETKKIWEKTFDLPYEKAGGGLALEFDGIASVKPPIFWNVS 304

Query: 307 FSDVNSKYKSMTSRFILEVSVFMRLKDQKQPLQQVASREFLRLRTLRCHKEFKLKQPISS 366
            +DVNSKYKSM  RF+LEV +F++LK   + +QQ    +FLRLR +RCH+E KL +PIS+
Sbjct: 305 DTDVNSKYKSMLPRFLLEVCIFLKLKSGMKAMQQDIKCDFLRLRMVRCHRELKLGKPISN 364

Query: 367 LDNDVWIKAWHLYCEFGTKGVVIELRHPGSHCFKGSSTKDTTTFKWNDLIRAPSLTLERQ 426
             ++ W+K WHLYCEFGTKG+++ELRHPG  CFKGS+ + T  F+WN+L+RAPSLT+ER+
Sbjct: 365 FSHNSWLKVWHLYCEFGTKGLILELRHPGGACFKGSTLQGTVEFRWNNLLRAPSLTMERE 424

Query: 427 FDQNLKIVASITPPVQAPYLLKCVPDKVTDDSGAMVSDVVLRMNQYRPQEGRWLSRTVLD 486
            +Q  ++V SITPPVQA YLLKCVPD+VTDDSGAM+SDV+LR+N+YRPQEGRWLSRTVLD
Sbjct: 425 IEQ-FRVVISITPPVQAQYLLKCVPDRVTDDSGAMISDVILRLNRYRPQEGRWLSRTVLD 484

Query: 487 HGGRECFVIRLRVGAGFWRRGGETPLPVKWEDRIIEIREGCWSYIAGSIGRCPEKLVGTA 546
           H GRECFV+R+RVG GFWRRGGETP  VKWEDRIIEIREG WSY+AGSIGR PEK+VGTA
Sbjct: 485 HAGRECFVVRIRVGGGFWRRGGETPSAVKWEDRIIEIREGFWSYVAGSIGRAPEKVVGTA 544

Query: 547 TPKQPLEEWKAAWNFSTGDELIIQWDTSTPKPALTFSLTNPTSSESSVRLLKGRQMVYRV 606
           TPK+   E +AAW+FSTGDEL+I W++S+    L F+L N  S +S + LL+GR+M Y+ 
Sbjct: 545 TPKEATAECQAAWDFSTGDELMINWESSSSTSGLKFTLKNAASPDSLLVLLRGRKMQYQ- 604

Query: 607 QRDNGREEEEEEGGDGNGFVTVIRYTNEDPTGRATALLNWKLLVIELLPEEDAVLALLLC 666
            R+    E+E E  D  GFVT+IR+T+E+PTG+ATALLNWKLLVIELLPEEDAVLALLLC
Sbjct: 605 GRELSEVEKEAEEEDDEGFVTLIRFTDENPTGKATALLNWKLLVIELLPEEDAVLALLLC 664

Query: 667 VSILKSVSEMKKEDVGKLLIRRRLRETTIGLRDWGSIMLHPSS-------SPYLQPWYWN 726
            SIL+S+SEM+KEDVG LLIRRR++ET +G RDWGS++LHPSS       SPY+QPWYWN
Sbjct: 665 FSILRSISEMRKEDVGGLLIRRRIKETKLGHRDWGSVILHPSSLSSSSSTSPYIQPWYWN 724

Query: 727 AETVMASNNIDHLSRQRASSYLPVEGGDKLYKQGII 756
           A+ VMA++  D++ R  A +Y P EGGDKLYK+GII
Sbjct: 725 AKAVMAAST-DNIRRPPAQNYSPAEGGDKLYKRGII 756

BLAST of Cp4.1LG04g00020 vs. TrEMBL
Match: B9I7B2_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0013s01450g PE=4 SV=1)

HSP 1 Score: 1001.5 bits (2588), Expect = 5.5e-289
Identity = 491/780 (62.95%), Postives = 612/780 (78.46%), Query Frame = 1

Query: 1   MSSHRPGSDLSASV-SARSLGDISQLTTIRIGVDIISAVRRNLAFLRTVLHSHWLHSEPT 60
           MS+ R  +  S+ V S RSL +IS++ T+R+ VD++SA R+NL  LRTV  S WLH   T
Sbjct: 1   MSATRNRTHESSDVLSTRSLSEISEVETVRLSVDLVSASRKNLGLLRTVSESPWLHERAT 60

Query: 61  LTQAIRRYEELWMPLISDLTVDGASPPMILPPLDVEWVWFCHTLNPVRYKHYCESRFSKL 120
           + +AIRRY+ELWMPLISDL ++G+SPPM+LPPLDVEWVWFCHTLNPV Y+ YCE RFSKL
Sbjct: 61  ILEAIRRYDELWMPLISDL-MEGSSPPMVLPPLDVEWVWFCHTLNPVSYRKYCEKRFSKL 120

Query: 121 IGKPSIFDEENEEYAYMRCQEIWVKRYPTQSFENEEE--SSFRDDITV--ENQELLEEVM 180
           IGKP+IF +ENEEY+ MRC+E+W+KRYP +SFENE +  SS   D+ V  ++++LL EV 
Sbjct: 121 IGKPAIFYKENEEYSLMRCEELWMKRYPNESFENEVDITSSNLQDLHVAQDHEDLLNEVE 180

Query: 181 RQRHLYSNFLEPYQSEIVYLIAARQRYMGFLYMLQRFSDECSS-FVPSSDILLMWLTHQS 240
           +QRH+YS F  PY SEIVYLIAARQRY GFLY+LQRF+D+CSS  +PS DILLMW+THQS
Sbjct: 181 KQRHVYSKFSWPYMSEIVYLIAARQRYKGFLYVLQRFADDCSSRLLPSLDILLMWVTHQS 240

Query: 241 YPTVYAEDVKEMQGELAKVVRFGETVKPKELEETKKLWHRTFGQSYEKAGGEVIMELGRV 300
           YPTVYAED+KEM+G++ K+V   ETV+ KE+EETKKLW R F Q Y KAGG +  E G V
Sbjct: 241 YPTVYAEDLKEMEGDMGKIVGLWETVRSKEVEETKKLWERAFDQPYVKAGGAI--EFGGV 300

Query: 301 VTS-KPLVYWEVSFSDVNSKYKSMTSRFILEVSVFMRLKDQKQPLQQVASREFLRLRTLR 360
            +  KP VYWEVS +DVN+KYKS+  RF+LEV VF+RL  + +P+QQ     FLRL+ +R
Sbjct: 301 ASIVKPPVYWEVSDTDVNTKYKSLLPRFLLEVCVFVRLNSRMKPVQQERQHNFLRLQLVR 360

Query: 361 CHKEFKLKQPISSLDNDVWIKAWHLYCEFGTKGVVIELRHPGSHCFKGSSTKDTTTFKWN 420
           CH+E K+ +PISS  +D W K  HLYCEFGT+G+++E+R  G  CFK S  +D+ TF WN
Sbjct: 361 CHRELKIDKPISSFSSDTWKKVTHLYCEFGTRGLMLEVRKHGGGCFKTSKLEDSKTFLWN 420

Query: 421 DLIRAPSLTLERQFD-QNLKIVASITPPVQAPYLLKCVPDKVTDDSGAMVSDVVLRMNQY 480
           DL+RAPSLTLE   D +  + VASITPP QAPYLLKCVPDKVTDDSGAMVSDV+LRMN Y
Sbjct: 421 DLLRAPSLTLETHLDDKQARAVASITPPAQAPYLLKCVPDKVTDDSGAMVSDVILRMNNY 480

Query: 481 RPQEGRWLSRTVLDHGGRECFVIRLRVGAGFWRRGGETPLPVKWEDRIIEIREGCWSYIA 540
           +PQEGRWLSRTVLDH GRECFV+R+RV  GFWRRG ETP  VKWEDRIIEIREG WSY+A
Sbjct: 481 KPQEGRWLSRTVLDHAGRECFVVRMRVAGGFWRRGDETPSAVKWEDRIIEIREGSWSYVA 540

Query: 541 GSIGRCPEKLVGTATPKQPLEEWKAAWNFSTGDELIIQWDTSTPKPALTFSLTNPTSSES 600
           GSIGR PEK+VGTATP++P E W+AAW FSTGDEL+I W++S     L F L N  SS+S
Sbjct: 541 GSIGRAPEKIVGTATPREPPEHWQAAWCFSTGDELLISWESSASMSDLNFCLRNQKSSDS 600

Query: 601 SVRLLKGRQMVYRVQRDNGR----------EEEEEEGGDGNGFVTVIRYTNEDPTGRATA 660
            V+LLKG++M YR ++ + +          EE +EE  D  GF+T++R+T ++P GR TA
Sbjct: 601 LVKLLKGKKMQYRARKISSKSKEHEKRENTEETDEEDEDEEGFLTLVRFTEDNPIGRPTA 660

Query: 661 LLNWKLLVIELLPEEDAVLALLLCVSILKSVSEMKKEDVGKLLIRRRLRETTIGLRDWGS 720
           LLNWKLL++ELLPEEDAV  LLLC+SIL+S+SEM+KEDVG LLIRRRL+E  +G RDWGS
Sbjct: 661 LLNWKLLIVELLPEEDAVFVLLLCISILRSISEMRKEDVGSLLIRRRLKEAKLGARDWGS 720

Query: 721 IMLHPS------SSPYLQPWYWNAETVMASNNIDHLSRQRASSYLPVEGGDKLYKQGIIS 757
           ++LHPS      SSPYLQPWYWNA++V+A +  D++++Q A S+ PVEGGDKLYK+GI++
Sbjct: 721 VILHPSSFSSTISSPYLQPWYWNAKSVIAPDGGDNVTKQPAVSHSPVEGGDKLYKKGIMA 777

BLAST of Cp4.1LG04g00020 vs. TrEMBL
Match: A0A061F3C6_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_026691 PE=4 SV=1)

HSP 1 Score: 983.8 bits (2542), Expect = 1.2e-283
Identity = 480/762 (62.99%), Postives = 587/762 (77.03%), Query Frame = 1

Query: 8   SDLSASVSARSLGDISQLTTIRIGVDIISAVRRNLAFLRTVLHSHWLHSEPTLTQAIRRY 67
           S  S+S   RSL +IS+  T+ + VD++SA RRN+ FLR+V   HWLH   T+ +AIRRY
Sbjct: 11  SSSSSSSGVRSLSEISEKDTVHLSVDLVSAARRNIGFLRSVNECHWLHQRATIVEAIRRY 70

Query: 68  EELWMPLISDLTVDGASPPMILPPLDVEWVWFCHTLNPVRYKHYCESRFSKLIGKPSIFD 127
           EE+WMPLISDLTV G++PPM+LPP DVEWVWFCHTLNPV Y+ YCESRFSKLIGKP+IF+
Sbjct: 71  EEVWMPLISDLTVVGSTPPMVLPPFDVEWVWFCHTLNPVAYRKYCESRFSKLIGKPAIFN 130

Query: 128 EENEEYAYMRCQEIWVKRYPTQSFENEEESSFRDDITVENQELLEEVMRQRHLYSNFLEP 187
           EENEEYA MRC+EIWV+R+  + FENE ES  +D   + NQ+L  +V   + LYS F EP
Sbjct: 131 EENEEYALMRCREIWVQRHEFEPFENEVESDSQDPPGI-NQDLFNQVKEHKFLYSKFSEP 190

Query: 188 YQSEIVYLIAARQRYMGFLYMLQRFSDECSSFVPSSDILLMWLTHQSYPTVYAEDVKEMQ 247
           Y  E+VYLIAARQRY GFLYM+QRF D C  FVP+ DI+LM LTHQSYPTVY ED+K+  
Sbjct: 191 YFCELVYLIAARQRYRGFLYMMQRFGDGCLRFVPALDIVLMLLTHQSYPTVYVEDLKDKW 250

Query: 248 GELAKVVRFGETVKPKELEETKKLWHRTFGQSYEKAGGEVIMELGRVVTSKPLVYWEVSF 307
            ++ KVV   ETVK KE+EE+K LW RTF Q YEKAGG + +EL  +   +P +YWEVS 
Sbjct: 251 DDMGKVVGLWETVKEKEVEESKNLWERTFDQPYEKAGGGLAVELDNLKAKRP-IYWEVSD 310

Query: 308 SDVNSKYKSMTSRFILEVSVFMRLKDQKQPLQQVASREFLRLRTLRCHKEFKLKQPISSL 367
            DVN+KYKSM  RF+LEV VF+RL D+ +        +FLRLR +RCH+E KL + IS+ 
Sbjct: 311 VDVNTKYKSMIPRFLLEVCVFVRLNDRTKVSNGDTKHKFLRLRAVRCHRELKLDELISNF 370

Query: 368 DNDVWIKAWHLYCEFGTKGVVIELRHPGSHCFKGSSTKDTTTFKWNDLIRAPSLTLERQF 427
             D W KAWHLYCEFGT+G+++ELR  G  CFKGS + D+  F WNDL+RAPS+TL R+ 
Sbjct: 371 SYDSWRKAWHLYCEFGTRGLMVELRGRGGRCFKGSKSLDSMPFYWNDLLRAPSITLSRKV 430

Query: 428 DQNLKIVASITPPVQAPYLLKCVPDKVTDDSGAMVSDVVLRMNQYRPQEGRWLSRTVLDH 487
           DQ ++IVASITPPVQAPYLLKCVPD+VTDDSGAM+SDV+L++N YRPQ+GRWLSRTVLDH
Sbjct: 431 DQ-VRIVASITPPVQAPYLLKCVPDRVTDDSGAMISDVILKLNNYRPQKGRWLSRTVLDH 490

Query: 488 GGRECFVIRLRVGAGFWRRGGETPLPVKWEDRIIEIREGCWSYIAGSIGRCPEKLVGTAT 547
            GRECFV+R+RVG GFWRRG ETP  V WEDRIIEIREG WSY+AGSIGR PEK+VGTAT
Sbjct: 491 AGRECFVVRIRVGGGFWRRGAETPSAVNWEDRIIEIREGSWSYVAGSIGRAPEKVVGTAT 550

Query: 548 PKQPLEEWKAAWNFSTGDELIIQWDTSTPKPALTFSLTNPTSSESSVRLLKGRQMVYR-- 607
           PK+  E+W+AAW FSTGDEL+I W +ST    L+F L    S +SSV LL+GR+M Y+  
Sbjct: 551 PKESPEQWQAAWEFSTGDELLINWGSSTSSSGLSFCLKTQESFDSSVMLLRGRKMQYQDK 610

Query: 608 -----VQRDNGREEEEEEGGDGNGFVTVIRYTNEDPTGRATALLNWKLLVIELLPEEDAV 667
                 +    R+EE  +  D + +VT++R+T E+PTGRATALLNWKLLV+ELLPEEDAV
Sbjct: 611 VAGCAAKETKTRQEEYAKEAD-DEYVTLVRFTEENPTGRATALLNWKLLVVELLPEEDAV 670

Query: 668 LALLLCVSILKSVSEMKKEDVGKLLIRRRLRETTIGLRDWGSIMLHPSS------SPYLQ 727
           L LLLCVSIL++VSEM+KEDVG LLIRRRL+E  +G RDWGS++LH SS      SP LQ
Sbjct: 671 LVLLLCVSILRTVSEMRKEDVGSLLIRRRLKEAKLGARDWGSVVLHTSSLPSSIASPCLQ 730

Query: 728 PWYWNAETVMASNNIDHLSRQRASSYLPVEGGDKLYKQGIIS 757
           PWYWNA  VMA +  + ++RQ AS+Y PVEGGD LYK+GII+
Sbjct: 731 PWYWNANKVMAQHEGNSITRQPASNYSPVEGGDMLYKRGIIT 768

BLAST of Cp4.1LG04g00020 vs. TAIR10
Match: AT1G56230.1 (AT1G56230.1 Protein of unknown function (DUF1399))

HSP 1 Score: 805.1 bits (2078), Expect = 3.8e-233
Identity = 413/761 (54.27%), Postives = 530/761 (69.65%), Query Frame = 1

Query: 8   SDLSASVSARSLGDISQLTTIRIGVDIISAVRRNLAFLRTVLHSHWLHSEPTLTQAIRRY 67
           S+    V+ARSL +IS++  +RIG DIIS+ RR +A LR+V    WLH  P + +AIRRY
Sbjct: 6   SEFVDGVAARSLSEISEVDAVRIGGDIISSARRLIALLRSVGDCQWLHHPPVIAEAIRRY 65

Query: 68  EELWMPLISDLTVDGASPPMILPPLDVEWVWFCHTLNPVRYKHYCESRFSKLIGKPSIFD 127
           +ELWMPLISDLTV G  PPMILPPLDVEWVWFCH LNPV Y  YCE RFSKLIGKP+I+D
Sbjct: 66  DELWMPLISDLTV-GLKPPMILPPLDVEWVWFCHCLNPVSYSDYCERRFSKLIGKPAIYD 125

Query: 128 EENEEYAYMRCQEIWVKRYPTQSFENEEESSFRDDITVENQELLEEVMRQRHLYSNFLEP 187
           EENE+YA ++C++IW  RYP +SFEN  +    + +++ N+++   V +Q  L+  F  P
Sbjct: 126 EENEDYAVLQCEKIWSLRYPLESFENRADPDSLETVSLVNEDIKSLVKKQMFLWEKFSAP 185

Query: 188 YQSEIVYLIAARQRYMGFLYMLQRFSDECSSFVPSSDILLMWLTHQSYPTVYAEDVKEMQ 247
           Y SE VYLIAAR RY GFL +L +F DE SS +P+SDILLMWLTHQSYPTVY +DV EM 
Sbjct: 186 YMSETVYLIAARLRYKGFLLILHKFKDEVSSLIPASDILLMWLTHQSYPTVYKDDVDEML 245

Query: 248 GELA-KVVRFGETVKPKELEETKKLWHRTFGQSYEKAGGEV-IMELGRVVTSKPLVYWEV 307
            E+  KVV+ GE V+  E+E TK+LW R F Q YEKAGGE+ I+     +++  + YW V
Sbjct: 246 EEMTRKVVQVGEKVEKTEVETTKELWDRYFNQPYEKAGGELSIIANESGLSNNTMFYWPV 305

Query: 308 SFSDVNSKYKSMTSRFILEVSVFMRLKDQKQPLQQVASREFLRLRTLRCHKEFKLKQPIS 367
           S  DVN+ YKS+  RF+LE+ +F+RL  + +  + +  R FLRLR  RCH++ +L + ++
Sbjct: 306 SDMDVNTAYKSIRPRFVLELCIFLRLNPKAEQNESI-DRSFLRLRVARCHRKLQLDKKMT 365

Query: 368 SLDNDV-WIKAWHLYCEFGTKGVVIELRHPGSH--CFKGSSTKDTTTFKWNDLIRAPSLT 427
            L ++  W KAWHLYCEFGT G ++E     S   CFK    +    F WNDL+RA SL 
Sbjct: 366 DLSSEASWQKAWHLYCEFGTLGFILESHCDRSRGICFKSGKPEGMIEFPWNDLLRAHSLA 425

Query: 428 LERQFDQNLKIVASITPPVQAPYLLKCVPDKVTDDSGAMVSDVVLRMNQYRPQEGRWLSR 487
             R   + + + AS+TPPVQAPYLL+ VPD+VTDDSGAM+SD V R N +RPQEGRWL+R
Sbjct: 426 SGRFLGKQVSVFASVTPPVQAPYLLRFVPDRVTDDSGAMISDSVQRTNNFRPQEGRWLTR 485

Query: 488 TVLDHGGRECFVIRLRVGAGFWRRGGETPLPVKWEDRIIEIREGCWSYIAGSIGRCPEKL 547
           TVLDH GRECFVIR+RVG G ++RGGE P PVK E+RI E+R G WSY+ GSIG+ P K+
Sbjct: 486 TVLDHAGRECFVIRIRVGKGVFKRGGEVPSPVKSEERITEVRVGSWSYVEGSIGKAPAKV 545

Query: 548 VGTATPKQPLEEWKAAWNFSTGDELIIQWDTSTPKPALTFSLTNPTSSESSVRLLKGRQM 607
           VGT TPK+P+E+W+AAW FSTGDEL I+WD+      L     NP    S VRLL GR+M
Sbjct: 546 VGTVTPKEPMEDWEAAWEFSTGDELCIRWDSLGTISELRLYSRNP---GSLVRLLTGRRM 605

Query: 608 VYRVQRDNGREEEEEEGGDGNGFVTVIRYTNEDPTGRATALLNWKLLVIELLPEEDAVLA 667
            Y+     G +EE++E     GFVTV+R T EDPT +ATAL++WK   +E LPEEDAV  
Sbjct: 606 QYK-----GEDEEDDE-----GFVTVVRSTEEDPTEKATALIDWKHQAVEFLPEEDAVFV 665

Query: 668 LLLCVSILKSVSEMKKEDVGKLLIRRRLRETTIGLRDWGSIM-------LHPSSSPYLQP 727
           LLL VSIL+SV+  ++EDVGKLL+R+R+ E T G RDWGS++       +  SSSPY++P
Sbjct: 666 LLLSVSILRSVTHKRREDVGKLLVRKRITEAT-GERDWGSVIVDASSTNVSSSSSPYVEP 725

Query: 728 WYWNAETVMASNNIDHLSR--QRASSYLPVEGGDKLYKQGI 755
           WY N+  VMA      ++R      SY  V+GGD LYK  I
Sbjct: 726 WYRNSGKVMAMEEKAQVARYPYPVMSYSNVDGGDNLYKHVI 750

BLAST of Cp4.1LG04g00020 vs. TAIR10
Match: AT2G22660.2 (AT2G22660.2 Protein of unknown function (duplicated DUF1399))

HSP 1 Score: 144.4 bits (363), Expect = 2.8e-34
Identity = 85/276 (30.80%), Postives = 145/276 (52.54%), Query Frame = 1

Query: 28  IRIGVDIISAVRRNLAFLRTVLHSHWLHSEPTLTQAIRRYEELWMPLISDLTVDGA-SPP 87
           I I VD+++A +++L FL TV  + WL+  P L +AI RY   W+PL+   +   + S  
Sbjct: 17  IEISVDLLAAAKQHLLFLETVDRNRWLYDGPALEKAIYRYNACWLPLLVKYSESSSVSEG 76

Query: 88  MILPPLDVEWVWFCHTLNPVRYKHYCESRFSKLIGKPSIFDEENEEYAYMRCQEIWVKRY 147
            ++PPLD EW+W CH LNPVRY   CE  + +++    +    +     ++ +++W + Y
Sbjct: 77  SLVPPLDCEWIWHCHRLNPVRYNSDCEQFYGRVLDNSGVLSSVDGN-CKLKTEDLWKRLY 136

Query: 148 PTQSFENEEESSFRDDITVENQ--------ELLEEVMRQRHLYSNFLEPYQSEIVYLIAA 207
           P + +E + ++   +DI+ ++         +L+  V RQ   Y      + +  ++L  A
Sbjct: 137 PDEPYELDLDNIDLEDISEKSSALEKCTKYDLVSAVKRQSPFYYQVSRSHVNSDIFLQEA 196

Query: 208 RQRYMGFLYMLQRFSDECSS--FVPSSDILLMWLTHQSYPTVYAEDVKEMQGELAKVVRF 267
             RY GFLY+++   +       VP+ D+ L+W THQ +P  Y +D+ ++ G   KV+  
Sbjct: 197 VARYKGFLYLIKMNRERSLKRFCVPTYDVDLIWHTHQLHPVSYCDDMVKLIG---KVLEH 256

Query: 268 GET----VKPKELE----ETKKLWHRTFGQSYEKAG 285
            +T     K K+L+    +T   W  TFG  Y KAG
Sbjct: 257 DDTDSDRGKGKKLDTGFSKTTAQWEETFGTRYWKAG 288

BLAST of Cp4.1LG04g00020 vs. TAIR10
Match: AT4G37900.1 (AT4G37900.1 Protein of unknown function (duplicated DUF1399))

HSP 1 Score: 133.3 bits (334), Expect = 6.4e-31
Identity = 89/291 (30.58%), Postives = 141/291 (48.45%), Query Frame = 1

Query: 28  IRIGVDIISAVRRNLAFLRTVLHSHWLHSEPTLTQAIRRYEELWMPLISDLTVDGA-SPP 87
           I I VD+++A +++L FL  V  +  L+  P L +AI RY   W+PL++  T   +    
Sbjct: 17  IDISVDLLAAAKKHLLFLGAVDRNRCLYDGPALQRAIYRYNAYWLPLLAQYTESSSICQG 76

Query: 88  MILPPLDVEWVWFCHTLNPVRYKHYCESRFSKLIGKPSIFDEENEEYAYMRCQEIWVKRY 147
            ++PPLD EWVW CH LNPVRYK  CE  + +++    +    N      + + +W + Y
Sbjct: 77  PLVPPLDCEWVWHCHRLNPVRYKTDCEQFYGRVLDNSGVVSSVNGN-CKSQTETLWKRLY 136

Query: 148 PTQSFENEEESSFRDDITVE------NQELLEEVMRQRHLYSNFLEPYQSEIVYLIAARQ 207
           PT+ ++ +  ++  +   V         +L+  V RQ   +      +    V+L  A  
Sbjct: 137 PTEPYDLDFANAISEPADVSALEKCTTYDLVLAVKRQSPFFYQVSRAHVDNDVFLQEAVA 196

Query: 208 RYMGFLYMLQRFSDECSSF--VPSSDILLMWLTHQSYPTVYAEDVKEMQGELAKVVRFGE 267
           RY  FLY+++   +       VP+ DI L+W THQ +   Y  D+ +M G   KV+   +
Sbjct: 197 RYKAFLYLIKGNRERSIKLFCVPTYDIDLIWHTHQLHAISYCNDLTKMIG---KVLEHDD 256

Query: 268 T----VKPKELEE----TKKLWHRTFGQSYEKAGGEVIMELGRVVTSKPLV 302
           T     K K+L+     T   W  TFG+ Y KAG        + VT+ P V
Sbjct: 257 TDSDRSKGKKLDTGFSGTTAQWEETFGRRYWKAGAMNRGNTPKPVTTSPYV 303

BLAST of Cp4.1LG04g00020 vs. NCBI nr
Match: gi|778662367|ref|XP_011659813.1| (PREDICTED: uncharacterized protein LOC101207151 [Cucumis sativus])

HSP 1 Score: 1296.6 bits (3354), Expect = 0.0e+00
Identity = 645/768 (83.98%), Postives = 694/768 (90.36%), Query Frame = 1

Query: 1   MSSHRPGSDLSASVSARSLGDISQLTTIRIGVDIISAVRRNLAFLRTVLHSHWLHSEPTL 60
           MSS R GS   AS+SARSLGDIS+ +T RIG+DIISAVRRNL FLRTV  SHWLHSEPT+
Sbjct: 1   MSSDRLGS---ASISARSLGDISEFSTTRIGLDIISAVRRNLGFLRTVADSHWLHSEPTI 60

Query: 61  TQAIRRYEELWMPLISDLTVDGASPPMILPPLDVEWVWFCHTLNPVRYKHYCESRFSKLI 120
           T+AIRRYEELWMPLISDL V G+SPPMILPPLDVEWVWFCHTLNPV YKHYCE+RFSK+I
Sbjct: 61  TEAIRRYEELWMPLISDLMVAGSSPPMILPPLDVEWVWFCHTLNPVGYKHYCETRFSKII 120

Query: 121 GKPSIFDEENEEYAYMRCQEIWVKRYPTQSFENEEESSFRDDITVENQELLEEVMRQRHL 180
           GKPSIFDEENEEYAYMRC+EIWVK+YPTQSFE EE SS RD ITVENQELLEEV RQR+L
Sbjct: 121 GKPSIFDEENEEYAYMRCKEIWVKKYPTQSFELEESSSLRDVITVENQELLEEVKRQRNL 180

Query: 181 YSNFLEPYQSEIVYLIAARQRYMGFLYMLQRFSDECSSFVPSSDILLMWLTHQSYPTVYA 240
           YS F EP++SEIVYLIAA+QRY GFLYMLQRFSDECSSFVP+SDILLMWLTHQSYPTVYA
Sbjct: 181 YSKFSEPFRSEIVYLIAAKQRYKGFLYMLQRFSDECSSFVPASDILLMWLTHQSYPTVYA 240

Query: 241 EDVKEMQGELAKVVRFGETVKPKELEETKKLWHRTFGQSYEKAGGEVIMELGRVVTSKPL 300
           EDVKEMQG+LAKVVRFGETV  KEL+ETK+LWHRTFGQ YEKAGG +IMELGRVVTS PL
Sbjct: 241 EDVKEMQGDLAKVVRFGETVNSKELDETKQLWHRTFGQPYEKAGGGIIMELGRVVTSNPL 300

Query: 301 VYWEVSFSDVNSKYKSMTSRFILEVSVFMRLKDQKQPLQQVASREFLRLRTLRCHKEFKL 360
           VY E S  DVN+KYKSMTSRFILEV VFM  K QK+PLQQV S+EFLRLR+LRCH+EFKL
Sbjct: 301 VYLETSHLDVNTKYKSMTSRFILEVCVFMWHKAQKRPLQQV-SQEFLRLRSLRCHREFKL 360

Query: 361 KQPISSLDNDVWIKAWHLYCEFGTKGVVIELRHPGSHCFKGSSTKDTTTFKWNDLIRAPS 420
            QPISSL+ND+W KAWHL CEFGTKGV++ELRHP  HCFKGSS K+TTTFKWNDLIRAPS
Sbjct: 361 DQPISSLNNDLWHKAWHLCCEFGTKGVILELRHPSGHCFKGSSIKETTTFKWNDLIRAPS 420

Query: 421 LTLERQFDQNLKIVASITPPVQAPYLLKCVPDKVTDDSGAMVSDVVLRMNQYRPQEGRWL 480
           LTLERQ + NLKIVASITPPVQAPYLLKCVPDKVTDDSGAMVSDVVLRMNQYRPQEGRWL
Sbjct: 421 LTLERQLNHNLKIVASITPPVQAPYLLKCVPDKVTDDSGAMVSDVVLRMNQYRPQEGRWL 480

Query: 481 SRTVLDHGGRECFVIRLRVGAGFWRRGGETPLPVKWEDRIIEIREGCWSYIAGSIGRCPE 540
           SRTVLDHGGRECFVIR+RVG GFWRRGGETPLPVKWEDRIIEIREG WSYIAGSIGR PE
Sbjct: 481 SRTVLDHGGRECFVIRMRVGGGFWRRGGETPLPVKWEDRIIEIREGSWSYIAGSIGRSPE 540

Query: 541 KLVGTATPKQPLEEWKAAWNFSTGDELIIQWDTSTPKPALTFSLTNPTSSESSVRLLKGR 600
           K+VGTATPKQPLEE KAAWNFSTGDELIIQWDTST +P+L+FSLTNP +SESSVRLLKGR
Sbjct: 541 KVVGTATPKQPLEELKAAWNFSTGDELIIQWDTSTTEPSLSFSLTNP-ASESSVRLLKGR 600

Query: 601 QMVYRV-------QRDNGREEEEEEGGDGNGFVTVIRYTNEDPTGRATALLNWKLLVIEL 660
           Q +Y V       Q D   +EEE EGGD +GFVT+IRYT+EDPTGRATALLNWKLLVIEL
Sbjct: 601 QKLYHVWRKVKEPQHDGNIQEEENEGGDDDGFVTMIRYTDEDPTGRATALLNWKLLVIEL 660

Query: 661 LPEEDAVLALLLCVSILKSVSEMKKEDVGKLLIRRRLRETTIGLRDWGSIMLHPSS---- 720
           LPEEDAVLALL+CVSIL+S+SEMKKEDVG LLIRRRLRET IGLRDWGSIMLHPS     
Sbjct: 661 LPEEDAVLALLICVSILRSISEMKKEDVGNLLIRRRLRETKIGLRDWGSIMLHPSKNSTT 720

Query: 721 -SPYLQPWYWNAETVMASNNIDHLSRQRASSYLPVEGGDKLYKQGIIS 757
            SPYL+PWYWNAETVMASN+++HL RQ ASSYLPVEGGDKLYKQGIIS
Sbjct: 721 PSPYLRPWYWNAETVMASNSVEHLMRQPASSYLPVEGGDKLYKQGIIS 763

BLAST of Cp4.1LG04g00020 vs. NCBI nr
Match: gi|645221937|ref|XP_008246354.1| (PREDICTED: uncharacterized protein LOC103344538 [Prunus mume])

HSP 1 Score: 1051.6 bits (2718), Expect = 6.6e-304
Identity = 507/774 (65.50%), Postives = 615/774 (79.46%), Query Frame = 1

Query: 1   MSSHRPGSDLSASVSARSLGDISQLTTIRIGVDIISAVRRNLAFLRTVLHSHWLHSEPTL 60
           MS    GS++  ++S RS  +IS++ T ++G+D++SA RRN+ FLRTV  SHWLH +PT+
Sbjct: 1   MSLSSGGSEIPGNMSGRSYSEISEVETFKVGLDLVSAARRNIGFLRTVAESHWLHQKPTV 60

Query: 61  TQAIRRYEELWMPLISDLTVDGASPPMILPPLDVEWVWFCHTLNPVRYKHYCESRFSKLI 120
            +AIRRY ELWMPL+SDLTV+  +PP I PP+D+EWVWFCHTLNPV Y+ YCES+FSKLI
Sbjct: 61  IEAIRRYNELWMPLVSDLTVESTTPPRIHPPIDIEWVWFCHTLNPVYYRQYCESKFSKLI 120

Query: 121 GKPSIFDEENEEYAYMRCQEIWVKRYPTQSFENE--EESSFRDDITVENQELLEEVMRQR 180
           GK +IFD+ENEEYA MRC+E+WV+RYP + FENE   +S  R       QELLEEV + R
Sbjct: 121 GKATIFDDENEEYALMRCRELWVRRYPNEPFENEVYSDSDVRLPEEANEQELLEEVKKNR 180

Query: 181 HLYSNFLEPYQSEIVYLIAARQRYMGFLYMLQRFSDECSSFVPSSDILLMWLTHQSYPTV 240
            LYS F EPY++EIVYLIAARQRY  FL+M+Q   D CSS VP+SDI+LMWL+HQSYPTV
Sbjct: 181 FLYSKFSEPYRAEIVYLIAARQRYKRFLFMVQSSIDLCSSLVPASDIMLMWLSHQSYPTV 240

Query: 241 YAEDVKEMQGELAKVVRFGETVKPKELEETKKLWHRTFGQSYEKAGGEVIMELGRVVTSK 300
           YA D+KEM+G+L KVV    TVK KE+EETKKLW RTF Q YEKAGGE+ +EL   V+ K
Sbjct: 241 YAADLKEMEGDLGKVVCMWATVKEKEVEETKKLWERTFDQPYEKAGGEIALELDGGVSFK 300

Query: 301 PLVYWEVSFSDVNSKYKSMTSRFILEVSVFMRLKDQKQPLQQVASREFLRLRTLRCHKEF 360
           P VYWEVS +DVN+KYK M  RF+LEV VF+RL+D+ + +Q+   R  LRLR +RCH+E 
Sbjct: 301 PTVYWEVSDTDVNTKYKPMHPRFLLEVCVFVRLRDKMKEMQEDMKRNVLRLRMVRCHREL 360

Query: 361 KLKQPISSLDNDVWIKAWHLYCEFGTKGVVIELRHPGSHCFKGSSTKDTTTFKWNDLIRA 420
           KL++P+S      W KAWHLYCEFGTKGV+ E+R  G  CFKGSS ++T TF WNDL+RA
Sbjct: 361 KLEKPVSDFPYSSWRKAWHLYCEFGTKGVIFEIRKRGGSCFKGSSVQETVTFHWNDLLRA 420

Query: 421 PSLTLERQFDQNLKIVASITPPVQAPYLLKCVPDKVTDDSGAMVSDVVLRMNQYRPQEGR 480
           PSLTLE++ DQ +KIVASITPPVQAPYLLKCVPD+VTDDSGAM+SD++LRMNQYRPQEGR
Sbjct: 421 PSLTLEKE-DQQVKIVASITPPVQAPYLLKCVPDRVTDDSGAMISDLILRMNQYRPQEGR 480

Query: 481 WLSRTVLDHGGRECFVIRLRVGAGFWRRGGETPLPVKWEDRIIEIREGCWSYIAGSIGRC 540
           WLSRTVLDH GRECFVIR+RVG GFWRRGGETP  VKWEDRIIEIREG WSY+AGSIGR 
Sbjct: 481 WLSRTVLDHAGRECFVIRIRVGEGFWRRGGETPSAVKWEDRIIEIREGSWSYVAGSIGRA 540

Query: 541 PEKLVGTATPKQPLEEWKAAWNFSTGDELIIQWDTSTPKPALTFSLTNPTSSESSVRLLK 600
           P KLVGTA PK+P E+WKAAWNFSTGDEL+IQW+ S+ K  L+F L NP ++ES V+LLK
Sbjct: 541 PVKLVGTAIPKEPPEQWKAAWNFSTGDELMIQWELSSSKSGLSFGLKNP-AAESMVKLLK 600

Query: 601 GRQMVYRVQR----------DNGRE-EEEEEGGDGNGFVTVIRYTNEDPTGRATALLNWK 660
           GR+M Y+V++           N  E EEEEE  +  GF+T++RYT ++P GRATALLNWK
Sbjct: 601 GRKMQYQVKKKKSLTKDEEWQNEEEGEEEEEDEEEEGFLTLVRYTEDNPNGRATALLNWK 660

Query: 661 LLVIELLPEEDAVLALLLCVSILKSVSEMKKEDVGKLLIRRRLRETTIGLRDWGSIMLHP 720
           LLV EL+PEEDAVL LLLC+SIL+SVSEMKKEDVG LLIRRRL+E  +G RDWGS++LHP
Sbjct: 661 LLVAELMPEEDAVLVLLLCISILRSVSEMKKEDVGCLLIRRRLKEVKLGTRDWGSVVLHP 720

Query: 721 S-----SSPYLQPWYWNAETVMASNNIDHLSRQRASSYLPVEGGDKLYKQGIIS 757
           S     SSPYLQPWYWNA+  +AS+   H++RQ +  Y P EGGDK YK+GI++
Sbjct: 721 SSSSSISSPYLQPWYWNAKAFLASDGAGHITRQPSICYSPEEGGDKFYKRGILA 772

BLAST of Cp4.1LG04g00020 vs. NCBI nr
Match: gi|595842103|ref|XP_007208347.1| (hypothetical protein PRUPE_ppa001832mg [Prunus persica])

HSP 1 Score: 1047.0 bits (2706), Expect = 1.6e-302
Identity = 501/761 (65.83%), Postives = 612/761 (80.42%), Query Frame = 1

Query: 14  VSARSLGDISQLTTIRIGVDIISAVRRNLAFLRTVLHSHWLHSEPTLTQAIRRYEELWMP 73
           +SARS  +IS++   ++G+D++SA RRN+ FLRTV  S WLH +PT+ +AIRRY ELWMP
Sbjct: 1   MSARSFSEISEVENFKVGLDLVSAARRNIGFLRTVAESQWLHQQPTVIEAIRRYNELWMP 60

Query: 74  LISDLTVDGASPPMILPPLDVEWVWFCHTLNPVRYKHYCESRFSKLIGKPSIFDEENEEY 133
           L+SDLTV+  +PP I PP+D+EWVWFCHTLNPV Y+ YCES+FSKLIGK +IFDEENEEY
Sbjct: 61  LVSDLTVESTTPPAIHPPIDIEWVWFCHTLNPVYYRQYCESKFSKLIGKATIFDEENEEY 120

Query: 134 AYMRCQEIWVKRYPTQSFENE--EESSFRDDITVENQELLEEVMRQRHLYSNFLEPYQSE 193
           A MRC+E+WV+RYP + FENE   +S  R       +ELLEEV + R L+S F EPY++E
Sbjct: 121 ALMRCRELWVRRYPNEPFENEVDSDSDVRVPEAANEEELLEEVKKNRFLHSKFSEPYRAE 180

Query: 194 IVYLIAARQRYMGFLYMLQRFSDECSSFVPSSDILLMWLTHQSYPTVYAEDVKEMQGELA 253
           IVYLIAA+QRY  FL+M+Q   D CSS VP+SDI+LMWL+HQSYPTVYAED+KEM+G+L 
Sbjct: 181 IVYLIAAKQRYKRFLFMVQSTIDLCSSLVPASDIMLMWLSHQSYPTVYAEDLKEMEGDLG 240

Query: 254 KVVRFGETVKPKELEETKKLWHRTFGQSYEKAGGEVIMELGRVVTSKPLVYWEVSFSDVN 313
           KVV    TVK KE+EETKKLW RTF Q YEKAGGE+ +EL   V+ KP VYWEVS +DVN
Sbjct: 241 KVVSMWATVKEKEVEETKKLWERTFDQPYEKAGGEIALELDGGVSFKPTVYWEVSDTDVN 300

Query: 314 SKYKSMTSRFILEVSVFMRLKDQKQPLQQVASREFLRLRTLRCHKEFKLKQPISSLDNDV 373
           +KYK M  RF+LEV VF+RL+D+ + +Q+   R  LRLR +RCH+E KL++P+S   +  
Sbjct: 301 TKYKPMHPRFLLEVCVFVRLRDKMKEMQEDMKRNVLRLRMVRCHRELKLEKPVSDFPHSS 360

Query: 374 WIKAWHLYCEFGTKGVVIELRHPGSHCFKGSSTKDTTTFKWNDLIRAPSLTLERQFDQNL 433
           W KAWHLYCEFGTKGV+ E+R  G  CFKGSS ++T TF WNDL+RAPSLTLE++ DQ +
Sbjct: 361 WRKAWHLYCEFGTKGVIFEIRKRGGSCFKGSSVQETVTFHWNDLLRAPSLTLEKE-DQQV 420

Query: 434 KIVASITPPVQAPYLLKCVPDKVTDDSGAMVSDVVLRMNQYRPQEGRWLSRTVLDHGGRE 493
           KIVASITPPVQAPYLLKCVPD+VTDDSGAM+SD++LRMNQYRPQEGRWLSRTVLDH GR+
Sbjct: 421 KIVASITPPVQAPYLLKCVPDRVTDDSGAMISDLILRMNQYRPQEGRWLSRTVLDHAGRD 480

Query: 494 CFVIRLRVGAGFWRRGGETPLPVKWEDRIIEIREGCWSYIAGSIGRCPEKLVGTATPKQP 553
           CFVIR+RVGAGFWRRGGETP  VKWEDRIIEIREG WSY+AGSIGR P KLVGTA PK+P
Sbjct: 481 CFVIRIRVGAGFWRRGGETPSAVKWEDRIIEIREGSWSYVAGSIGRAPVKLVGTAIPKEP 540

Query: 554 LEEWKAAWNFSTGDELIIQWDTSTPKPALTFSLTNPTSSESSVRLLKGRQMVYRVQR--- 613
            E+WKAAWNFSTGDEL+IQW+ S+ K  L+F L N  ++ES+V+LLKGR+M Y+V++   
Sbjct: 541 PEQWKAAWNFSTGDELMIQWELSSSKSGLSFGLKN-QAAESTVKLLKGRKMQYQVKKKKS 600

Query: 614 -------DNGRE-EEEEEGGDGNGFVTVIRYTNEDPTGRATALLNWKLLVIELLPEEDAV 673
                   N  E EEEEE  +  GF+T++RYT ++P GRATALLNWKLLV EL+PEEDAV
Sbjct: 601 VTKDEECQNEEEGEEEEEDEEEEGFLTLVRYTEDNPNGRATALLNWKLLVAELMPEEDAV 660

Query: 674 LALLLCVSILKSVSEMKKEDVGKLLIRRRLRETTIGLRDWGSIMLHPS-----SSPYLQP 733
           L LLLC+SIL+SVSEMKKEDVG LLIRRRL+E  +G RDWGS++LHPS     SSPYLQP
Sbjct: 661 LVLLLCISILRSVSEMKKEDVGCLLIRRRLKEVKLGTRDWGSVVLHPSSSSSISSPYLQP 720

Query: 734 WYWNAETVMASNNIDHLSRQRASSYLPVEGGDKLYKQGIIS 757
           WYWNA+ ++AS+   H++RQ + SY P EGGDK YK+GI++
Sbjct: 721 WYWNAKAIIASDGAGHITRQPSISYSPEEGGDKFYKRGILA 759

BLAST of Cp4.1LG04g00020 vs. NCBI nr
Match: gi|567881821|ref|XP_006433469.1| (hypothetical protein CICLE_v10003191mg [Citrus clementina])

HSP 1 Score: 1025.4 bits (2650), Expect = 5.1e-296
Identity = 489/756 (64.68%), Postives = 609/756 (80.56%), Query Frame = 1

Query: 7   GSDLSASVSARSLGDISQLTTIRIGVDIISAVRRNLAFLRTVLHSHWLHSEPTLTQAIRR 66
           GS++S +++ R+L  IS++ TIR+GVD++SA R+N+ FLRTV  S WLH  PT+ +AIRR
Sbjct: 5   GSEISDNLTTRTLSGISEVDTIRLGVDLVSATRKNIGFLRTVNESQWLHERPTILEAIRR 64

Query: 67  YEELWMPLISDLTVDGASPPMILPPLDVEWVWFCHTLNPVRYKHYCESRFSKLIGKPSIF 126
           YE LWMPL+SDLTV GA PPMILPP+D+EWVWFCH+LNPVRY+ YCESRFSKLIGKP+IF
Sbjct: 65  YEGLWMPLMSDLTV-GAPPPMILPPVDIEWVWFCHSLNPVRYRQYCESRFSKLIGKPAIF 124

Query: 127 DEENEEYAYMRCQEIWVKRYPTQSFENEEESSFRDDITVENQELLEEVMRQRHLYSNFLE 186
           DEENEEYA MRC+EIW  +YP + FENE +S   + I V N+++L EV RQR LYS F E
Sbjct: 125 DEENEEYALMRCREIWEHKYPYEPFENEVDSDSENPICVTNEDILNEVKRQRFLYSKFSE 184

Query: 187 PYQSEIVYLIAARQRYMGFLYMLQRFSDECSSFVPSSDILLMWLTHQSYPTVYAEDVKEM 246
           PY  E+VYLIAARQRY GFLY+LQ+FSD CS FVP+SDI LMWLTH SYPTVYAED+K+M
Sbjct: 185 PYMCELVYLIAARQRYKGFLYILQKFSDGCSLFVPASDIQLMWLTHLSYPTVYAEDLKDM 244

Query: 247 QGELAKVVRFGETVKPKELEETKKLWHRTFGQSYEKAGGEVIMELGRVVTSKPLVYWEVS 306
             ++ KVV     VK K++EETKK+W +TF   YEKAGG + +E   + + KP ++W VS
Sbjct: 245 WDDMGKVVGVWGNVKAKDVEETKKIWEKTFDLPYEKAGGGLALEFDGIASVKPPIFWNVS 304

Query: 307 FSDVNSKYKSMTSRFILEVSVFMRLKDQKQPLQQVASREFLRLRTLRCHKEFKLKQPISS 366
            +DVNSKYKSM  RF+LEV +F++LK   + +QQ    +FLRLR +RCH+E KL +PIS+
Sbjct: 305 DTDVNSKYKSMLPRFLLEVCIFLKLKSGMKAMQQDIKCDFLRLRMVRCHRELKLGKPISN 364

Query: 367 LDNDVWIKAWHLYCEFGTKGVVIELRHPGSHCFKGSSTKDTTTFKWNDLIRAPSLTLERQ 426
             ++ W+K WHLYCEFGTKG+++ELRHPG  CFKGS+ + T  F+WN+L+RAPSLT+ER+
Sbjct: 365 FSHNSWLKVWHLYCEFGTKGLILELRHPGGACFKGSTLQGTVEFRWNNLLRAPSLTMERE 424

Query: 427 FDQNLKIVASITPPVQAPYLLKCVPDKVTDDSGAMVSDVVLRMNQYRPQEGRWLSRTVLD 486
            +Q  ++V SITPPVQA YLLKCVPD+VTDDSGAM+SDV+LR+N+YRPQEGRWLSRTVLD
Sbjct: 425 IEQ-FRVVISITPPVQAQYLLKCVPDRVTDDSGAMISDVILRLNRYRPQEGRWLSRTVLD 484

Query: 487 HGGRECFVIRLRVGAGFWRRGGETPLPVKWEDRIIEIREGCWSYIAGSIGRCPEKLVGTA 546
           H GRECFV+R+RVG GFWRRGGETP  VKWEDRIIEIREG WSY+AGSIGR PEK+VGTA
Sbjct: 485 HAGRECFVVRIRVGGGFWRRGGETPSAVKWEDRIIEIREGFWSYVAGSIGRAPEKVVGTA 544

Query: 547 TPKQPLEEWKAAWNFSTGDELIIQWDTSTPKPALTFSLTNPTSSESSVRLLKGRQMVYRV 606
           TPK+   E +AAW+FSTGDEL+I W++S+    L F+L N  S +S + LL+GR+M Y+ 
Sbjct: 545 TPKEATAECQAAWDFSTGDELMINWESSSSTSGLKFTLKNAASPDSLLVLLRGRKMQYQ- 604

Query: 607 QRDNGREEEEEEGGDGNGFVTVIRYTNEDPTGRATALLNWKLLVIELLPEEDAVLALLLC 666
            R+    E+E E  D  GFVT+IR+T+E+PTG+ATALLNWKLLVIELLPEEDAVLALLLC
Sbjct: 605 GRELSEVEKEAEEEDDEGFVTLIRFTDENPTGKATALLNWKLLVIELLPEEDAVLALLLC 664

Query: 667 VSILKSVSEMKKEDVGKLLIRRRLRETTIGLRDWGSIMLHPSS-------SPYLQPWYWN 726
            SIL+S+SEM+KEDVG LLIRRR++ET +G RDWGS++LHPSS       SPY+QPWYWN
Sbjct: 665 FSILRSISEMRKEDVGGLLIRRRIKETKLGHRDWGSVILHPSSLSSSSSTSPYIQPWYWN 724

Query: 727 AETVMASNNIDHLSRQRASSYLPVEGGDKLYKQGII 756
           A+ VMA++  D++ R  A +Y P EGGDKLYK+GII
Sbjct: 725 AKAVMAAST-DNIRRPPAQNYSPAEGGDKLYKRGII 756

BLAST of Cp4.1LG04g00020 vs. NCBI nr
Match: gi|694420664|ref|XP_009338232.1| (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103930600 [Pyrus x bretschneideri])

HSP 1 Score: 1017.7 bits (2630), Expect = 1.1e-293
Identity = 495/772 (64.12%), Postives = 603/772 (78.11%), Query Frame = 1

Query: 7   GSDLSASVSARSLGDISQLTTIRIGVDIISAVRRNLAFLRTVLHSHWLHSEPTLTQAIRR 66
           G  +SA   A SL +IS + T+++G+D++SA RRN+ FLRTV  S WLH + T  +A+RR
Sbjct: 6   GGSMSA---ASSLSEISDMETLKLGLDLVSAARRNIGFLRTVAESQWLHQKATAVEAVRR 65

Query: 67  YEELWMPLISDLTVDGASPPMILPPLDVEWVWFCHTLNPVRYKHYCESRFSKLIGKPSIF 126
           Y ELWMPL+SDLT + A+ P+I PP+D+EWVWFCHTLNPV Y+ YCE +FS LIGK +IF
Sbjct: 66  YNELWMPLVSDLTAESAAVPVIHPPIDIEWVWFCHTLNPVSYRQYCEQKFSNLIGKATIF 125

Query: 127 DEENEEYAYMRCQEIWVKRYPTQSFENEEESSFRDDITV-----ENQELLEEVMRQRHLY 186
           +EEN+EYA M+C+EIW++RYP + FENE +S    ++ V     E  ELLE+V + R LY
Sbjct: 126 NEENKEYALMKCREIWIRRYPNEPFENEADSDXNSEVRVLEVADEEAELLEQVKKNRFLY 185

Query: 187 SNFLEPYQSEIVYLIAARQRYMGFLYMLQRFS-DECSSFVPSSDILLMWLTHQSYPTVYA 246
           S FLEPY+SEIVYLIAARQRY  FL+M+QRFS D  S  VP+SDI+LMWLTHQSYPTVYA
Sbjct: 186 SKFLEPYRSEIVYLIAARQRYKRFLFMVQRFSIDLDSPLVPASDIMLMWLTHQSYPTVYA 245

Query: 247 EDVKEMQGELAKVVRFGETVKPKELEETKKLWHRTFGQSYEKAGGEVIMELGRVVTSKPL 306
           ED+K M+G+L KVV     VK KE+EET+KLW RTF Q YEK GGE+ ++L   V+ KPL
Sbjct: 246 EDLKGMEGDLGKVVTVWAVVKEKEVEETRKLWERTFDQPYEKGGGEIALKLDGGVSFKPL 305

Query: 307 VYWEVSFSDVNSKYKSMTSRFILEVSVFMRLKDQKQPLQQVASREFLRLRTLRCHKEFKL 366
           VYWE S +DVN+KYK M  RF+LEV V +RL+D+ + +Q+      LRLR +RCH+E KL
Sbjct: 306 VYWEASDTDVNTKYKPMHPRFLLEVCVLVRLRDKTKVMQEDTKHNILRLRMVRCHRELKL 365

Query: 367 KQPISSLDNDVWIKAWHLYCEFGTKGVVIELRHPGSHCFKGSSTKDTTTFKWNDLIRAPS 426
           ++PIS      W KAWHLYCEFGTKGV++ELR  G  CFKG+S ++T TF WNDL+RAPS
Sbjct: 366 EKPISDFPYATWQKAWHLYCEFGTKGVILELRRRGGSCFKGTSVQETVTFHWNDLLRAPS 425

Query: 427 LTLERQFDQNLKIVASITPPVQAPYLLKCVPDKVTDDSGAMVSDVVLRMNQYRPQEGRWL 486
           LTL++++ Q + IVASITPPVQAPYLLKCVPD+VTDDSGAM+SDV+LRMNQYRPQEGRWL
Sbjct: 426 LTLDKEYQQ-VDIVASITPPVQAPYLLKCVPDRVTDDSGAMISDVILRMNQYRPQEGRWL 485

Query: 487 SRTVLDHGGRECFVIRLRVGAGFWRRGGETPLPVKWEDRIIEIREGCWSYIAGSIGRCPE 546
           SRTVLDH GRECFVIR+R+G GFWRRGGE P  VKWEDRIIEIREG WSY+AGSIGR P 
Sbjct: 486 SRTVLDHAGRECFVIRIRMGEGFWRRGGEKPSAVKWEDRIIEIREGSWSYVAGSIGRAPA 545

Query: 547 KLVGTATPKQPLEEWKAAWNFSTGDELIIQWDTSTPKPALTFSLTNPTSSESSVRLLKGR 606
           KLVGTA PK+PLE+W AAWNFSTGDEL+IQWD S+    L+F + N   +ES+V+LLKGR
Sbjct: 546 KLVGTAIPKEPLEQWTAAWNFSTGDELMIQWDLSSSISGLSFGIQNQ-DAESTVKLLKGR 605

Query: 607 QMVYR------VQRDNGREEEEE----EGGDGN-GFVTVIRYTNEDPTGRATALLNWKLL 666
           +M YR      V  D  R+ EEE    EG D   GF+T++RYT ++P GRATALLNWKLL
Sbjct: 606 KMQYRGWKKRPVTEDEDRQHEEEGEDEEGEDEEEGFLTLVRYTEDNPNGRATALLNWKLL 665

Query: 667 VIELLPEEDAVLALLLCVSILKSVSEMKKEDVGKLLIRRRLRETTIGLRDWGSIMLHPS- 726
           V ELLPEEDAVL LLLC+SIL+SVSEM+KED+G LLIRRRL+E  +G RDW S++LHPS 
Sbjct: 666 VAELLPEEDAVLVLLLCISILRSVSEMRKEDLGCLLIRRRLKEVKLGTRDWASVVLHPSS 725

Query: 727 ----SSPYLQPWYWNAETVMASNNIDHLSRQRASSYLPVEGGDKLYKQGIIS 757
               SSPYLQPWYWNA+ +MAS   DH +RQ A SY P EGGDKLYK+GI++
Sbjct: 726 SSSISSPYLQPWYWNAKVIMASEGADHFTRQPAISYSPEEGGDKLYKRGILA 772

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GRDP1_ARATH4.9e-3330.80Glycine-rich domain-containing protein 1 OS=Arabidopsis thaliana GN=GRDP1 PE=2 S... [more]
GRDP2_ARATH1.1e-2930.58Glycine-rich domain-containing protein 2 OS=Arabidopsis thaliana GN=GRDP2 PE=2 S... [more]
Match NameE-valueIdentityDescription
A0A0A0M1H5_CUCSA0.0e+0083.98Uncharacterized protein OS=Cucumis sativus GN=Csa_1G569190 PE=4 SV=1[more]
M5W894_PRUPE1.1e-30265.83Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001832mg PE=4 SV=1[more]
V4V486_9ROSI3.5e-29664.68Uncharacterized protein OS=Citrus clementina GN=CICLE_v10003191mg PE=4 SV=1[more]
B9I7B2_POPTR5.5e-28962.95Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0013s01450g PE=4 SV=1[more]
A0A061F3C6_THECC1.2e-28362.99Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_026691 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G56230.13.8e-23354.27 Protein of unknown function (DUF1399)[more]
AT2G22660.22.8e-3430.80 Protein of unknown function (duplicated DUF1399)[more]
AT4G37900.16.4e-3130.58 Protein of unknown function (duplicated DUF1399)[more]
Match NameE-valueIdentityDescription
gi|778662367|ref|XP_011659813.1|0.0e+0083.98PREDICTED: uncharacterized protein LOC101207151 [Cucumis sativus][more]
gi|645221937|ref|XP_008246354.1|6.6e-30465.50PREDICTED: uncharacterized protein LOC103344538 [Prunus mume][more]
gi|595842103|ref|XP_007208347.1|1.6e-30265.83hypothetical protein PRUPE_ppa001832mg [Prunus persica][more]
gi|567881821|ref|XP_006433469.1|5.1e-29664.68hypothetical protein CICLE_v10003191mg [Citrus clementina][more]
gi|694420664|ref|XP_009338232.1|1.1e-29364.12PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103930600 [Pyrus x br... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR009836GRDP-like
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005886 plasma membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g00020.1Cp4.1LG04g00020.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR009836Glycine-rich domain-containing protein-likePFAMPF07173DUF1399coord: 29..109
score: 9.6E-7coord: 107..242
score: 2.0
NoneNo IPR availablePANTHERPTHR34365FAMILY NOT NAMEDcoord: 8..756
score:
NoneNo IPR availablePANTHERPTHR34365:SF1SUBFAMILY NOT NAMEDcoord: 8..756
score:

The following gene(s) are paralogous to this gene:

None