Cp4.1LG03g15580.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG03g15580.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionVacuolar protein sorting-associated protein 8-like protein
LocationCp4.1LG03 : 13689552 .. 13707991 (-)
Sequence length6365
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CACTAACTATTTGGTGCCTCCCTAGTTCTCCAAGCCGAAGGCTTCAAGATCGAGAATGACCAAGGAGCTGACTGACACCGAAACACTGCCTCCAATGGAGCTGGACTTGAATGCCTTCATCCACGCACACCTCTCCAGCGGCGACGACGACCACGACCACGACGAAGACGACCTATCATTCCCTCACCGTAGCATCGACGAGATTCTAAACGAATCCAGTTCTTCCACCTCATCTTCACCATCGTCTCCTCCCAATTCACCGCCGCCTCGTGCCCGTCGCTCCATCGCCGCAAGGGACGGCAGGGCCTCTGCTTCTCGTTCTATACCACCATTCAAGTCACCGTTTGAGGAAATAATAAAGGCTTCTAAAGTTCCCAGAAGCAACCAACGGAATGAGAAGTCAGTTCAATTGAAACCAGGTTCGGTCTCTCATACTAAGGTTGGTGAATTAACGGACGATCCATTTCGAAGGGGATCTCGTGCATTGCCATCGTTGTTTGGAGGGGTTAGATCGAATGCCAAACCTGGGGCGGCGCTTGCCGCAGCTGCTGCGGCTTCTCGGTCCATGCCGGCTCCGCACGCTGCAGCAATCAAGTCGAGGAGGTCAGGACATGGGAGCGTGGTTCTTGACGATGACGAATTGGCTTCCTCTTCTGCTGTTGATTCAGAGTTTGTCTTTGATGATTTATATTCTACTATCGATCATTCAAAGGAGTCTCGCGAAAAGTCAATTTCATTGGTCGAAAGGAATGCCGATTATCAGGGTGCATCTGTAAACGTCGGTGTTGAATTTTGGGCAAGAGACAACATTCGAGACTGTGTCCTATATAACGATGAGTTTCGTATAACTAAAGATACGGAATGCGAAGCAGAACAGAGTTTTGTGGATGATGTGAATTTCGACGAGAGCTCTACAACTCTGCCGCCAGTGGAAGCTAACGGTAGAAGCTTGTCTGATTCTGCTGACAACAATGTTTGTTCAATGGATGCAGAACCAACAGTATTGGATGGCGATGAATCAAATGAAGGGGCCTTTCCGTGTTCTCCCAAACCTGATTACGACAGAAGTGCCGTGGGCTATGGGAGGCTGGAGTTGGAAACTCAAGATTTCGAGAAACGTTCTCAACCATCAAAAGATTCGGAGGTTCTAGCCATTGAGGATCTCAGTGTAGTGAACGATATCAGTGAATCGAGGGAAACAGCCAAGCAGCTGGATAACTTTCACACCGGTGAACGTGCAGAAACGATGTCCCTGTCCTCGTCTAATCCACTCGAATTGGCCGAAGAAATTGAAAAGAAGCAAGCTTTCACTGCACTGCATTGGGAAGAAGGCGTAGCTGCTCAACCAATGAGGCTTGAAGGTATTAAGGGAGGCACAACAGCATTGGGGTACTTCGACATTCAAGCTGACAATAGTATTTCAAGAACTATTTCATCACATTCGTTCAAGCGTGAACATGGTTTTCCCCAAGCCTTGGCTGTTCATGCAAATTATATTGCAGTTGGAATGTCAAAAGGAAATATTGTTGTGGTGGCCAGTAAATACTCGGCTCAAAATGGCGACAACATGGATGCGAAGGTCAACAGTGTACCTATTCANTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTCGTGGTGCATCTGTTATCTTCGTTTTTTTTATGTCTACTTAGAACTTGGATGAATTTGGGAGATTTATGGAGCTGCGACCATGAGTGCATGTTTGATTGTTCAGTAGCATCAACTAATAGATGCAATTAGTGTTACCGATTGCTATTCTTTCTCCTTCTTCACGACTCGTACTCGATGGGTCTCTTGAATTCTATGTGAAGTTTTTGCACTCACTTCAACGATATGTTTGACATGTTACAAATGTTGTGCGAGACTTGAAACTCTTGGTGTCTTATACAGGAGATTTTGAAGTTGTAGTCAGAGTTTTTCTTTCCAGCAATGCCATAGGTTACCAGGCATTGAGATGTCTTATACAGGAGATTGTGATGTTGTAGTCAGAGTTTTTCTTTCCAGCAATGCCATAGGTTACCAGTCATTTAGATGTCTTATACAGGAGATTTTGAAGTTGTAGTCAGAGTTTTTCTTTCCAGCAATGCCATAGGTTACCAGTCATTTAGAAAACTGTATTACGTCCATAGTTTAAGTTTCTTAAAGCGTACCTTTCTTTTGGTAGAGGGATTTAAGCATCAGGATATAATGAGTTAATATGTTGGCATTTTACCGGTAAAGTACTAAAAGGGCCAAATTTGGCATCATATTGGAAGTTTTGTTGACTTTTCAGAATACAACCAGTTTCCATTCCTTGAGATGTCGATATCATTTCTACTATTCAAGCTTTTAAAAAATGTATCTACCATCAGGGGAAATGACTATCACATACCGAGTGAAATTCTCGGCATTGTATTGGAAAGTACGGAAAGGGCCAATTTGGAGTTGCTTAAGAAGGGAGCAACGCGGATTAGCGGAAAAAATATAAAAATAGAAGAAAGAGGAAGTTTCAACACCTGTATGGTCTTCTTTGATAAGTTGACAGAACTTTATACTTAATAGTAGGAACAGCATGAGCGGAGAAAAGATTAGAATAGAAGAAACGGGAAATTTCAACATCTCTATGGTCTTTTTTGATCTCAAGAAGTTTGGACTGGAGATGTCTTCTCTGACCCAACATTGTGCCCTTTAAACACAAAGTTTTCTAGGAAACATCCCCTTGGGGAGAATAATCTGTGGGTGGAAAAGATTACAAACAAAAGTGAGGTTTTGCAGAGGTTATGAAACTTGAGAAAAATTGCTTTTTTGTAGTTTTAAACACAAAGTTTTATAATTTTAATGATTGTTTTTTTGTGTCTCTTGGTGGAGGCTGGAGGCTTTGTCCTTCCCTTTTTATATCCTTCATTGTCTGATGGAATCCTCTAGTTTATTTTTGGAAAAAATAAACATTTTTTCTTGTTAAAAAAAGTGATATCATCTTTTCAAACCAAGCTGAGAAACTGACTTCTATTGTATTATTTTTATATTTGATTGACAATATCCTTGATCATCCTTGTTGGCCTCAGTCTTTTTGTTACTTTCTTTTTGTTACTTTTCCATTTATTTAGATATTTAATATTATGAATTACACTATGTGGATATAGATGCTGCTGCTTGGATCACAAGGTGATAAATCAACTGCGCCAGTAACATCTCTATGCTTTAATCAGCAAGGGGACCTTCTTCTGGCTGGTTACAGTGATGGTCAGGTTACAGTCTGGGATGTGTTGAGGGCGACGGCAGCCAAGGTTATTTCCGGAGAACACACATCACCAGTTGTTCATTCATTGTTCCTTGGGCAGGAGGCTCAGGTTACTCGACAATTTAAAGCAGTTACTGGCGATAGTAAGGGTCTGGTTTTGTTACATACGTTCTCAGTGGTTCCTTTGCTAAATAGATTCTCCATTAAAACTCAGGTAGAAAAAATCTACGCCATGGAGTTAATTATTTGGGTTCTGTTGCATTAGAATTATGATAATTATATGGGTTGTTGCGATTAAATGCTTATTCTGCAAGGGTTTGTGGCACTGAACAAATAATTTGTTCTGAATTGTGATCTGTTGTTAATCTACTGAATTAGTGTCTTCTTGATGGGCAAAAAACGGGGACTGTTCTATCAGCTTCAGCACTTCTTTTGAATGAATTCTGTGGAAGTTCTTTGCCACCATCTCTTTCAAATGTCGCAGTTTCAACCAGCAGTATTGGGAGCATGATGGGAGGGGTTGTTGGAGGAGATTCAGGCTGGAAACTCTTCAATGAAGGATCATCTTTGGTTGAAGGAGTTGTCATATTTGCTACCCATCAAACTGCTCTGGTGGTATGGGACCCTTGACAATAGCACGATTTCAGTTTTATTTCATAATTGAACGTTGATGATATTTTGAGGTAAACTATTCAGAAATTTCTTGAATTCGCTTGAGATTGCTTCAGGTAAGGCTGAGTCCCACTGTGGAAGTGTATGCTCGGCTCTCTAAGCCAGATGGAATTCAGGAAGGTTCTATGCCTTACACTGCATGGAAATGCTCACAGTCTATTGGTACGAATTTAATGCCCTTAGAATGGTCACGGCTTCAATCATTTGTGGAATGAATACAGTATTTGTTTCATATAAAAAAAATATAGTGTTTGTAAAAATTCAAAATTGATTTGATGAGTGAACTGGATCACATTTTTGAAGTGATGTTTGAATTTATGGTGTGAAAGAAATGGAAATCTTGCTAACATGGTTCTCTGTTGTTGAAAAAAAAAAAAAAAAAAATTATTTTCTTATTTGGAGCAGTGTGTTTTTTACACTTAAGACAAGTTCTAGTTGTACCTCACAAAAGATAAGTGCGGTTTAGAAAAATGTGACTAAAATTCTGAATAGTACCTTGTTGTCACTGAGTACGTACATGTGCTTGGTGATTCATCTAGGAGTGCTGGCAGCTATATAAGAAATGAATTAGACTACTTTCAAAGCGGTATCTTGATGGTTATCTCTTCACCTTTAATCAATTTTAAGCATATCAAGACAAACTTGTTTTTTATGATTTTAGATTAAGTTCTGAATTAATTAGAGTTTCCTATCTAAGTTCATCTCAGTTATTTTCAAATTTTCAACTTAGTGTGAATCAGATTAATTTTACTTGTGTTTCAATTATGGAAAAATGAGTCCTGATTACATTTGGATTGTTTTAAAAGGAGACTATATGAATTATCTTATTTTACTTTGTAACTTAATACAATCAGTCTCGTTCTGTTGGAGCATGGTTGGAGAAGATAGTGCTTTCCATAAGTCATTTGTAACCATTCCCTTTCCATGGTTTGTGTCCATAGAGAGGTTTTTGTAACTCCTTTGACTTGTAGGCTTTTACCTTATGGATTCTTTAAGTCAATGAAATTTTCTTTTTATAAATGAAGGATTTGCGTGTTTAGTTTTTACTTCTGGCCCTAAGATTTAAATTATGGCAGAAACTTCACCTTCTGAAGCAGTAGAAAGGATTTCGTTGCTTGCAATTGCCTGGGATAAAATGGTTCAGGTAGCAAAGTTGGTGAAAACAGAGCTTAATGTATGTGGCAAGTGGTCCCTTGAGAGTGCAGCCATAGGTGTGGCATGGTTAGATGACCAGGTAATTTTTAAAAAGTTGATCGTTCTTATCAAAAAAAAGAAGAAAAAAAAAAGAAAAGAAAGAAGGATGATCATTTTCTACTGTTTCTAAATTCTTGTGTCTTTAACTACTATTGTCTAATAGTTGAGTGCAGCTGTAAAAGGTTATTGGTTTCAAGTTTTAACCAAGTATTTATAGTCCAAACTTCTAACCATTTAGGTCATGTTGACGATTGGTTTCCTGAGGAATCAATGTTGGGCATGGGGCTTATTTTAAGTTTCAAGCATAGTACATTAAGCTAGAAAAGTTAGGGATTTATAGATACAACAAAAACCATCAGATTACGTTTAGTAAGATCGGTTCTTATTTAAAGTGAAAAAGTAGAATTCCAGTTTTGACCGGCGGCTTCTCCCATATTTTCTGCATGTAGTAAGGATTTACCTGTTTGCATCAAGATTTAATAAAAAGTGTTGAAAACTCTACAGGTTCTAGTCATTCTCACAGTAACAGGACAACTCTTCTTATTTGAGAAGGATGGAACCATGATTCACCAGACAAGTGTATTTGTAGATGGGTTTGATAAGGAGGATTTCATTGCACATCACACCCACTTTGTTAATGTTTTGGGCAATCCTGAGAAAGCGTATCACAATTGTGTAGCTGTTAGAGGAGCTTCAGTATATGTGTTGGGACCAAAGCATCTCGTTATTTCCCGTCTCCTTCCATGGAAGGAGCGGGTTCAGGTTCTAAGGAAAGCAGGGGACTGGATGACTGCCCTTAGCATGGCAATAACAATTTATGATGGCCATGCTCATGGTGTTATTGATCTCCCTAGGTCGTTGGAGTCCTTGCAAGAGTTGGTAATGCCCTTTCTGATCGAGTTGCTGTTATCATACGTGGATGAAGTGTTTTCATATATTTCAGTGGCTTTTTGTAACCAAATTGAAAAGAATGAAAAATTGGATGATGTAACGAGTGGAAGCCATTCTGCACATTCTGAAATAAAAGAGCAATATAATCGTGTTGGTGGAGTTGCCGTTGAGTTTTGTGTCCATATCACGAGGACTGATATTCTCTTTGATGAAATTTTCTCCAAATTTGTGGCTGTTCAACAGAGAGGTATTGAACACATAACTTACCTTTATAGTAATTGTTTCTTTCGAAGGTTGTTGGGAGTTCAAATTAATATGCATTCAGTCTATTAATTGCTTCTTTGCGTTAGTTGCTGGAGGATGAAATTCAAATTGTTTCACATGTCTATGACTGGACCAATTATTAGAAGAGAAAGGAGTGTTTGGGCTAGTTGCGAAGTCTCTTAATGTACTGTGGGGGTTGGCTATGACTGGACCAATTATTAGAAGAGAAAGGAGTGTTTGGGCTAGTTGAGAAGTCTCTTAATGTACTGTGGGGATTGGCTACATGGGGAAACTTCATTATGTGATCACTTGAAAACTCAAATAGGGAATAACCACCAGAACTTTGCTGAAGTTGAATAGTGTAATTGAGGAATTGGTTGTCTTGGAAATCCCCCTAATGAATGGCAGGTTTACTAGGCCAAATGTTTGAATTTAGGTGAATTATTTATTTTATTTTATTTCATTTAACTNAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACTAATGGCAAGATACAAACTCCACCACGGAGTGAAAAGAAAGAGAAACTAAAAGTTAAAAATTTGAAATGAATACATCCCAATGAAAATAAATATTATGTGAAGAAAAATCAGTAAAGGTCGTGGAAAGAGAATACCAAGAAGAAGCTTTCAACTCAGCTGATTCAAAACTGTCGAACCAAGGAACTTGCTTGTTGTACAAAATTCTCGAGTTTCTGTCCATCCATAATTCAGAAAGAAGAACCTTTATTGCTGTCATCCATAAAATATGACCTTTCGAGTGATAGGAATCCTGAAAATAAGCTGCTGCGAATTATCCCTCAAATTTGGGGAAAACACCCATGAAACCTTACAGATGAAAAGAAACTTGAACCAGCAATTCAAAGAAATAAGCAGCCAAAAAATAAATGGTTAAGAAAATCCCATCTTTGCAATACGAGAACAAACTGAAGGCATAAGAACAAAGAATGGCAATATCCTGTGCAAAACTTCGGAGGAGTTGAGATGAAAATACACCTGAAGGCTTTAATGACCATAGACGAGTACCATTCAAAGAACCAATCTTGAAGGATAAAAGAATACCCTATGAAGAGGATAATCCAGTGATCTCATCTTCTTTTGAAAGCCTCCTAAAGAGAATATTCCATGATAAAACAAGAGAATTCAATAAATCGGCCATGGATCCTTTAGGAGAAGGTGAGAGAATGAATAGCTGTGGAAAATGAACTCAGTGGAGAATCACAAGCTCAAGCATTACTCCTGAAAGATAAAAAAAAATCTGACTTTCAAATCCCAATATAAAGGTAGCAACAGAATCCACCTTCTGTCATAATTTAGAAATGTTAATCCAAGGACTTTTAAAGCTCATTCCGTACTGGGGGTTTGCCTGTCAAAAGAAGAATGCCCATGAATGCTTCTAATCATCAACTGGTAAAGAGAACGGGGTTCAGCCATGTACCTCCAACCCCATTTTGCAAGCAAAGTGAAATTTCCATGCTTCATTGCCCCAGTACCGAGTCCTCCATCAGCCTGATACTTCCAAGCTACCAAACCAATTGATTGGATTTCGTCCATCACTACCTTCCCAGAAGAAATTACGCAATAATTGGTCTAAAGAGAGACTGGTATGAGCCAGCATTAAAAAAAAGTGACATGTAATAAAGAGGAACGATTGAAGGTACTGAATTACAAAGAGTTAACCTTTCCTCACGTGATAGATTAAAATGCTTCCAACAATCAATTGTTTTTCACAATTTTGTCAATCAGGGGCTGCCAAAATGTTGAAGATTTTGGATATTCTCCAAGAGGAATACCAAGATAATTGAAAGGGAGATGGTCCGCCTTACCATTCAATCTTCAGGCCACTTGAGAAAGTTTAGAATCCTCCACACTGATTCCACAAATCGCTCATGGAGGGGATAAAAGTTTTATAGTTACCCTCCATTTACCAGCCCTTAAAAATGAAAATACTCCATTCCAGACTTTACCTTGATACAAAAGTAGCTTGGATGAAAGGGGATGCCTCTGAAAAGGTCATTTTGATTCAAGAGTAGCTTGGATGTATCCATGGTTGATTTATTCCTCATCACTATAATGACTTACAGTTAAAGAAGTATGATTTTGTTTGTTGATATTTTCATTATTTTAATGAAAAGTTTTGTTTCTTTCTCTTTAGTTGTTTATCAAGCTCTTCAATATTGTTGTATATCAAGCTATATCTCGGTGTCAACTGTCTAACTTTTCCTTGAGTGGTTGTTTATTGGTATCTGGGTGTACTTGATAAAATAAATGGATTAGTGATGACTAATTCTTAACCATGTTTTGTGGATTACTTTTTTTATTGCTTCTTGAAGTGATTTGCTTAGATATATATATATATATTGATGAGTTCGTAGAAATATGAAATTACAAAAAGAGGGACACTAAAACACTTCTATCGACATCACATCTTTTTTTGTTATAGATACATTTTTGGAGCTTCTAGAACCATATATTTTAAAGGACATGCTTGGATCATTGCCTCCGGAGGTATAGACTAACGATTTTTTTTTGGTATTTTTCCATTCAGAGAACAAAGCTTACTTGANTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGCGTTCTTTCAGATTATGCAAGCTTTGGTAGAGCATTATAGCCAAAAAGGATGGTTGCAGCGTGTAGAACAGTGTGTTCTTCACATGGATATTTCTTCCTTAGATTTTAATCAGGTTTTTACCACTTAATCTTTTCAAACGTAATTTTGTATCCATCCAACTTTCTTTTTTTATGTATTTTATCTAATTTTTACCACATACGCATCATCAAGTCACCAATTTGTGTTGTTACAAGAATGTGATATAATTAATGGTGTTATTACCCACACGAATGATAACCAAATACCAAAAAAGCACCCCACCACACACACGAACACACACACACACACACACACACGAAATGGAAAGAAAGAAAAGAAAAAACTAGAAGCACTTTCCCAAGAAATGTTTTACTTGTCAGATTATCTTCATTTAACTAAGTGAGAAGCCCTACCATCTGTCAACTTTTTTTCAGTCGTCCAAGAGAAAAGTAACAGACACACAAAAGCCCGTGGCTAATGCATACAACAATGAAAATCTAGGGTACCACCTAAATTAGATCTGAGTAGGTCATATTTCAGTCGCTAATGCCGATAATACCATTTACAAATTATCATAGGAAAGGTGGACTTGCCTCCTCTTCGTGCCCTTATTATTTTCTTAATAAGAAACAATTTCATTATAAAATAATAAAAGAGAGGAGGAAATATTCCTCAAGCCAAAGGAAGAAAATGGGTTAACAGGAACAAGGTTTATAGTTTTCCTTCCTAGATCATGGATACATGGTCATGGTTATAGGAAGGACAGGGGTTGGCAAAAGATTTTCCTGTCTAACTCTGCCAAACAATTTGAACTTACCTCTTGTCAATATTGAATAGGTTGGACTCCTCCAGTTGGGCTGTTCTTTGGAATATCCTTGTATATCTTTCATGTTTTTCAATGAAAGTTTAGTTTTTCAATTTAATATTGGAAAGGTGGAAAAGTAGAGTTGCTAGAGCTATGGGATCTCCTTGCTAAAATCCACTGGTTCTTTGAATTTAATTTTTTAGAACTATAGTTATACCTTTTACCTCTTTTCTTCTATAGGTTGTTAGGTTATGCCGGGATCATGGATTGTATAGTGCATTGGTGTATCTTTTCAACAAAGGATTAGATGATTTCAGGACGCCTTTAGAGGAGCTGTTGGCAGTGTTACAAACCAGTAAAAGCAAACGTGCTTCTGCCATTGGGTATGATATATCAGTTACATCTGTTTATTATTATCATTATTTATTTATTTATTTATTTTGAGGAATAACCAAGCTTTTATTGAGGGAAAAAAATTAAAGAATACAAGAACATACAAAAAAGAGAGTCCACAAGGATGGAGCCACCTAAAAGAAGGACCTCTAATCAAACAAGATACAAATTAGTGAATAATTACAAAAAAAAGCTTCTTAACTGAAGCCCACAGGGAAACATAGAACCCAGCGAAGGACCCAATTACATCCTCTTAATTAATTATTATGTATGTGTTTATAACTTCTTTTATTAATGCTAAGTTCATGCTAAGTTCCTAACAATTTTTTCCATGATGAAAAAGAATTCCTGCAGTTACAAATCCTTGTCAAGTTCCTACTGAAATAATATGTTCTCCCTTCTTTCCTCCTCTCTCCTTCCCTCATGATTGGTTGATTGTTGAGAGCTTCAATGGTGGCTATAAAATTTCTCCATCTCTTTTTATCTTCACCTTAGGGGAGGGTTAGTTTATTTGAACCCCCATTGGCTCCAAGCTTTGTGATTTCTGCACAGTGGCCCTTTAGGTTCGATATCTTCTCAACTCGTACCACATATTCAGGGAGGTAAGATTCTCTATGGATTTTATGGCTGAGGGGAGTTTCACATAGCTTGGAGAAGCATTCTTGGAGCCATTGGAGAGACTCTTAAAGAAGGTGACGGAGAAACTCGTCTTTGGTGGCTTCAATGATACGAAAATTAGGTCTAATATATGCTAGTCCTTGTCTATGGTGAAGGTCTTTCTCTCGATAGTAAAAAAAGCTTGGGTGGTGGATGGGGTTCCATGGGTATGTAGGTTCTCTTGCAGGTTCAAGGCAGAGGACAAACCAGAAGGTGGGTGGTGGGGACCTGGTGGTAGAGTGAGGGGGCAGAGAGCCATTGTACATTTTAATTCAGAATTATGATTCAAGTGAAATGAATAAAATTTTGAGGGTAAGAGTAAGTATTGGTTGGAGCATTTTGATATTACTAAGCTCTAGGCTTCCTACTGTTGCTCTCTTTTTAATTTATTTGAGCCTTATTCTCCTGATGTATTAGTTCGGGAGCTTTTACTCACTTTTCTGTTCGTTTCTTGTTCAAGAAAAAAATTTCATTTATGAAGGAGCTTGGGTGTTATACTTGGTGTATGTTTAATCGATGAAGATTTTGTTCAAATCTTTGTTTTGTTCCTTTTCTTCAAAAGATTAATTTATGCATATTGATTGAAGTACAATTTTTTTCCTTTACAGGTACAAGACACTAGTTTATCTGAAATATTGCTTTTCAGGACTTGCTTTTCCACCAGGTATTACTCAAGTTGTTGTTGGTTTAGCATTTCATGGACCCCAACACATCCCAAAGATTACCAATATTTTATGCAGTTTGTTTTCATATAATTAATGATTTGAAATCTGCTTTTGCTACATCCTTTCTTTTATTTGACCTGTACAAGTATCTTTGAACTGATGAGAAGCGTTAAGCTTTATCTTTTTTCAGGCCAAGGAACTCTTGCTCATTCACGCGTGCAATCTCTTAGAGATGAACTACTACAGTTTTTGTTGGAGAATTCTGATACCGTGGATACAAGATCAATTTCAAATAAATCATCTGAAGTTGGATATTTAAATCTGTATCATCTCTTAGAGTTAGATACTGGTGCTACTTTAGATGTTTTGAGATGTGCTTTTGTTGAAGGTGAAATCGTCAAAGCTGATTCTTCTCTAGATGGTTCAGTGGATGCAAGTATGCAGGTACAAAAAGAAAAGAACTCAACTTCTGGAAGAAAGAACTTTCTAGTTCAAAATGTAGTCGATGCTCTTGTTCATATTCTTGATAAGGCTATCAGTCAAACATACGGGTCCCCAGGTGGCGATAATATCACATTGGTTGAAGATTGGCCTTCGAAGAAGGACTTATTTCATTTGTTCGATTTTGTTGCCAATTACGTTGCGTGTGGAAAGGCTACTGCCTCTAAGGACGTAGTTGGTCAGATTTTGGAACACTTGATATCCAATAGTGATATTCCAGAAATGGAAATTGATTTTGTGCATAGCGTTACTGCAAACAGTGTACATTCAAGAAAAAGGGAAAAGCAAGTACTTTCTCTCTTGGAAGTGATACCAGAGACCCATTGGAATCCGTCTTCTGTGTTAAGAATGTGTGAGAAAGCACAATTTTTCCAGGTACTTTTGCCATATTCATTTCTTTTACAGTCATAGGAAATAGGTTCCATTGAAATGATGAGATATGCAAAAAGGGACCGAAGAACTTTACTTTTAATCTAGATTGGAGTCTTTCAATGTGATATGATTCCCTATCAATTCACTACGTCCACAAGAAAATTCAACACAAAACTAGAGACCAATTCGGGAGGGTCTACTATCTCTCCATAAACATCCATAAACCACACAAAATACCAGGTAATCCACAGAAAAAACATATAAACAAAAACAATATAACTGACTAAAGGAGTTATAACTCCCTAATTTAAAATACCAAACAAAGAAAACAAACAGAAAAATGATAGTTTTAAACATTAGAAACAGTCACAATTCCCTTTTCGCCCTTTCTTGAGTAGACCTTTTGGATTGGAGGTCTAACACTATGTTTTTAGTTTCAATATCCATTTACATTACTTGTTCTTTTTTATGTTTTCTGTTTGTGAAATATTATTGTTGTGTTTGGTCATACTCTGACATCTTTCATGTTCTTATAGGTTTGTGGCCTAATTCATAGCATCAGTCATCAGTATTCATCAGCTTTGGATGGCTACATGAAAGATGTAGAGGAACCCATTCATGCTTTTGCCTTTATCAACAGAACATTAATGGAGCTCAGCAATTCTGAACAAACTGAATTTCGGGGAGTGGTCATTTCTCGAATTCCAGAGCTTCTTAATTTAAACAGGTTACAGAGTTATTTTTTTATGAAGCTACCTTGTATCCTCATTTTACTTAGTAGGTTTATCTGGAAAGCGTTGATATGAAAAGTAAATTAGTCTTTTGTTTCCAAAAGTGTACTACTTCAGTCAGTCATATCTTTCAATTCTCTCTCTCAAACATATCTGATTCATTGCAGAGAGGGAACATTCTTCTTGGTCATTGATCATTTCGGCAATGATGTTTCAGACATCCTCTCCCGGCTTCATAATCATCCAAGAAGCCTATTTCTGTACTTAAAAACTCTCATTGAGGTTCACCAATCTGGAAACCTGAACTTCTCTTGCTTAAAGAAAGATGATAATTTTGGGGTCAACTATTCAACTAAAGGATTGGATGATTACCTGCAAAAACTCTCTGATTTCCCTAAATTTCTGTCTAACAATCCTGTTGATGTGACTGATGATATAATTGAGCTTTATGTGGAGGTAAGCATTTTACGTATTTAACCTAGTTCTTTCCATTCAATTGGTGGTTGTTGAGGGATGCTGTCTTATGTGATACATTAGGTATTATCTATTGTTATTTCATCAAACAATATTTTTCATTGTGTTGTGAAGTACAACAATATAGGAGGAAAAACCAAGGGAGCTTCCAAGAAAATCTCTGCAACTAGTATCAACCAAAAGGAGCTAGAATTACAAAAGGAAGTAGAGAGCACACCAATGAGAGAAAAAATTGAATTTGACTTGCATCTAATTTAGAATTTCTCTTCCACTTAATTTCCCAAAGAGTTGCAACCACATCAATTTTCGTATGAAAATTTTCAGCGTGCTCATCATAAATGGATAAACAATGTGTGAGATAGGAAAAAAAATCTCTGAATGTGGATGCCTAATCTAGATGTTGTCTGCTATATTTAACTTCTCTTTCTAAACCCTACAAGCTGATCTTGAAAGATTTGATTTGGGGAACTTTAGAAGTTAAAAGAGATAAGAGAGACAACTTTCAATGAAAAGTTCCTTGCCGGATCAGGTTGCCTATCTGATCCACCAGAGCAAGTCCTCTTGCTTGGTGGATTGGAATTGCCAAAAGTGGCCAAGAGGTTAATCCGTTATTTAGTTTTACCTTAATCCTGTCTACATCTGAGGTTTCACCCTATCACATATTCATCCCTACAATCTTTGATTGAGGAAGAGCTAAACTTACCATTGTAAACAGGTCTGAAGAACAAGTCCGCTAAAAGCTGCTGGCTTGTTATCCTGTCAAATTACCTTACAGAGTCTTGTTACCTTTACCCACTTTTAACTTAATGGCCTCCTTGATTAAGCTGCAGCTTTTGCCATAAAACTACATACTAGTTTTTTTTTCCCCCTACGCTTCAACAGCATAAAGCTTGGGAAAGGACTTGTTATTATATTGATTTTACTTTGGATTATATTGAATTTCGTGAAGAACTTAGAAATCTTGATCCTTGATCATCGTTGCAATTTGTAGCTACTTTGTCGGTATGAACGTGAATCAGTTCTCAAGTTTTTGGAGACTTTTGATAGCTATCATGTGGAGCATTGTTTGCGCCTCTGCCAACAGTATGAGGTTATTGATGCTGCAGCATTCTTGTTGGAGAGGGTTGGTGATGTTGGTAGTGCTCTTTTCTTAACACTTTCTAGCCTAGACAAAAAATTCCATGACCTAGAGGCTGCTGTGGGAGGCATTGTTACAAATGGTGCTTCAAGCGGTTCTAATGATTCACAACTTTTCAGCTCTGTTTTAAAATTGCAAGAGGTTTGTAGGTTGGACTTCTGAAATGTGCAAATAAAATTTCCTCGTGTTTTCATGTAGAAGCTTTGCTGCTGGCAACTCATGTTTTCTCTAATCATTCTTGTGCAGGTGAATGATATATATGTTTTGTTGCATGCTTGTATTGGACTGTGCCAGCGAAATACTCCTCGTTTGAACTCTGAGGAGTCTGAGACACTTTGGTTCAAATTACTCGACTCGTGAGTTCTTTGCCTTCTGGCTTGCTTTTCTTTTGTATAATAAACATGTTTTAATAAAAGATAAGGAACATTTTTCTAATGAAGGATAAGAAACGTTTCATTGATAAGATGAAATTCTAAAATGATGGAGAAAAACTCAATTCAAAGGAACTTGTGAGAAATATTTATGTTGGACTATAGGAGTAGACAGGCTATAGTGTCAGAAGTAGAAAACTTACCCCGCATAAGAGCTAAAACAGATCATGGATCAAAGAAGCTGTCAGAGCCTATCCTTGAAGACATGAGAGCTATTCCAAGTAGGACATAAGATAGCTCTAATAAAGTTAGAGCCCTAGAGTTTTCTTCTATCGAAAGGATGACGAGCAACCAAATACTGTCAACTGTGGCTAGCATGACATGCAAATCCTCTGGTAGATTAGCCAGCAAAAGCTGGGAAAATGAGCCAAAAAAAGTCACAGCATAGGTTGAAGTGATGAATAAGGGCATTTAGATACACTCTTGCAAATAGGGAATAAGTTTGGGTGGATGCACATTATTTATGTGGAAAAGGATGTGGAAAAGAGTAGAAGGAAGAGAGTAAGCATAGTGTGCAGTAAGGTAGAAGTACCAGGACGTTGTATATAGAGCCTTGAGTTGGAGTAGGTGTTAGATGGGCTGTTGGGGGAATTAAGTTTATTTTTTGCATGTGTTGTTCTGGTATAGGAGAGGTGGGAGCTTTCGAAATCTTCTTTAACTTGTATTTCTCTTTTATATTTAATAGAGAAGTTCTATCAATCTGCTTTGGAGTTTGTCATGGGTGTTAATGAACCGGTGACGAAGCTCCTAAATGAAGGAATTTACTTTCTTCGGGTCCAAAGGAATTTACTTTATATTTAATAAAGAAGTTCTATCAATCTGCTTTGGAGTTTGTCATGGGTGTTAATGAACCAGTGACGAAGCTCCTAAATGAAGGAATTTACTTTCTTCGGGTGTTCTCCTTTCCAAATGTATAAAATAGCACAATGTTTTGATCCACAGGAAGTGAGGTCCAAAGACAGATTTTGGTGAGAAAGCACTATTTTTTCAAGTTTCACGTGTAAATATCGTCATGCAGTGAGATTTGAGAGATTTGGAAGGAGGGTAGCTAGCATGCTGATAAATCCTAAATACTTCTCTTATTGTTAACGAACTTATTGGTGAATGAAAAAGGAAAAAGAAGAAAGGGGCGTTTAACATCTTTTCGACTTTTTGTCCTATGCTTTTGGGTACATATGTATGTATGTACCTATGCGCATATGCATAAACATATCTCATGTATTTCGACCTTAAATCTTAATTGGAAAAAATATAGGTCTCAGTGTCTTACTTCGATAAATTTATCCTGCAATTGGTAGTTCCTGAGCATAACTTTAGTGTTATAGTATGCGTTGTACCTTGTATTCATAATTTAAATTTGGTTCTCAGTTTGTGATGGACTTTTTTTTTCTTTGGATTGGCAGGTTTTGTGAGCCTTTAATTGAATCATTTAATTATAGAACTGCTTCTTTTGGAGAAAATCAAGTTCAATTCTTGAACGAGTCATCAGGTTCACAAAAGGATAAAGAAGCACATATAGTTACATGGAGAATTTTGAAGTCCAATCAAACTGCTCATATATTAAGGAGTTTGTTCTCTCGATTTATCAGAGAGATAGTTGAGGGGATGATAGGATATGTTCATCTTCCAACGATCATGTCCAGACTTCTCTCTGACAACGGAAGTCAAGAATTTGGCGATTTTAAACTTACCATACTTGGGATGCTTGGGACTTTTGGCTTTGAAAGAAGAATTCTGGTATGTTTTCAGTCCCTCAGTATAACATACTAAGAAATGTCTGGGGTGCTTGATATCTGGCTTATTCTTTTTCGACAAATAAATATCATTGAATTTATGATTAAATTTCCATTATTAAGTTCTTTTTTAACCAAGTTTTAAGTATTTAAGTTCGAATACTTAAAATGGTATATGTTATATTATAAGGAACTTGATATGAAAAATGTATCCCTTCAAACTCGTTGTAGCCAAGTAGTTATTATAGGCTTTAAGGCTTGGCTACTCGTATGGATGTTATGGAACACAAGAACGTACAAGATTAATTCAAGGTTAATGCCTAGGAGACATGCACAAGTCTCTGAGGCCTGAGTTTTAGTGTTTGATCCTCTAGAATATGTGATCCCCCAGGCAGACCTCCAAGATGTGTCTGATACGCCTGTAAAATGTAAATTGAAGTCTTTTCAAACTTGTTTAGGATACTGCCAAAGCTTTAATTGAGGATGACACGTTCTATACCATGAGTTTATTGAAGAAAGGGGCATCTCACGGATATGCTCCCCGGGGTGCTGTCTGTTGCATATGTAATCGCCTTCTTGTCAAGAGCTCATCAAGTTACAGAGTTCGAGTTTTTAACTGTGGTCATGCAACTCATCTTCAGTGCGAAGTTCTTGATAATGAGGCTTCAGGTGGAGACTTCTCATGCCCAGTTTGTGTGCATAGCAATCACTCTCAACGGTCTAGAGGCAAAGCACTGACTGAGTACAGTCTAGTGAATAAATTTTCATCAAGAACTCAATCGTCGTCAGGAGCTTCTGTTTCATATCCACAAGAAACAGATTTATTGGAGCTTCCATACACTCTTCAGCAAATACCACGGGTATTGTTTTTGTCTTACTGGTGCGACGACCAATAGGTTTTGTTGCAACTGAACTAATACCTGAATTTGTGTATTGTTGCAGTTTGAGATTCTGGCTAACTTACAGAAAAATCAGAGAGTAATAGACATAGAAAATATGCCTCAATTGAGGCTTGCACCACCAGCCGTCTACCATGACAAGGTCACAAAAGGATATCATCTTTTAGTAGGAGATAGCAGCGGTGGAGTAGAAAAAGTAGAAAAACTAAACAAGAGCAGGCAACTGAAGGAGGTAAAAGTAAAGAGACCGTCCTCCCTTCGATTCCCTTTAAAAGCAAATCTATTTGGTGAGTTATGTTTCTGATCCTCCTAATATGTTTTGTCTTTCTGATCCGTCAATTTCTGACATGAAACGAAATACATATATAATTCATGTAGGGAAAGAGAAGACGACCAAATCTTGACAGAGTATGAAGCGACTCCTTAATTTTAAACCCACGTGATAAAAGGTGCTTTTGGTTAGTCTCAGTTATGTACTGTCAAACTTCTCGAGGGGAGGATGACATTTTGTATCTGTAAATACAATTACGGAGGTTGTTAGTTTGTAGCTGATTTGTTAGGGTAAATTATATGAACTATTGATTGAATTTTTCCTGCAATCCCATACAGAGATTTCTCCACGGTAGAACGCAGTTGTTAATTACCCAACCAAGTACTTTGTTGTTCATAAAAATATGAAGCTGTTAACG

mRNA sequence

CACTAACTATTTGGTGCCTCCCTAGTTCTCCAAGCCGAAGGCTTCAAGATCGAGAATGACCAAGGAGCTGACTGACACCGAAACACTGCCTCCAATGGAGCTGGACTTGAATGCCTTCATCCACGCACACCTCTCCAGCGGCGACGACGACCACGACCACGACGAAGACGACCTATCATTCCCTCACCGTAGCATCGACGAGATTCTAAACGAATCCAGTTCTTCCACCTCATCTTCACCATCGTCTCCTCCCAATTCACCGCCGCCTCGTGCCCGTCGCTCCATCGCCGCAAGGGACGGCAGGGCCTCTGCTTCTCGTTCTATACCACCATTCAAGTCACCGTTTGAGGAAATAATAAAGGCTTCTAAAGTTCCCAGAAGCAACCAACGGAATGAGAAGTCAGTTCAATTGAAACCAGGTTCGGTCTCTCATACTAAGGTTGGTGAATTAACGGACGATCCATTTCGAAGGGGATCTCGTGCATTGCCATCGTTGTTTGGAGGGGTTAGATCGAATGCCAAACCTGGGGCGGCGCTTGCCGCAGCTGCTGCGGCTTCTCGGTCCATGCCGGCTCCGCACGCTGCAGCAATCAAGTCGAGGAGGTCAGGACATGGGAGCGTGGTTCTTGACGATGACGAATTGGCTTCCTCTTCTGCTGTTGATTCAGAGTTTGTCTTTGATGATTTATATTCTACTATCGATCATTCAAAGGAGTCTCGCGAAAAGTCAATTTCATTGGTCGAAAGGAATGCCGATTATCAGGGTGCATCTGTAAACGTCGGTGTTGAATTTTGGGCAAGAGACAACATTCGAGACTGTGTCCTATATAACGATGAGTTTCGTATAACTAAAGATACGGAATGCGAAGCAGAACAGAGTTTTGTGGATGATGTGAATTTCGACGAGAGCTCTACAACTCTGCCGCCAGTGGAAGCTAACGGTAGAAGCTTGTCTGATTCTGCTGACAACAATGTTTGTTCAATGGATGCAGAACCAACAGTATTGGATGGCGATGAATCAAATGAAGGGGCCTTTCCGTGTTCTCCCAAACCTGATTACGACAGAAGTGCCGTGGGCTATGGGAGGCTGGAGTTGGAAACTCAAGATTTCGAGAAACGTTCTCAACCATCAAAAGATTCGGAGGTTCTAGCCATTGAGGATCTCAGTGTAGTGAACGATATCAGTGAATCGAGGGAAACAGCCAAGCAGCTGGATAACTTTCACACCGGTGAACGTGCAGAAACGATGTCCCTGTCCTCGTCTAATCCACTCGAATTGGCCGAAGAAATTGAAAAGAAGCAAGCTTTCACTGCACTGCATTGGGAAGAAGGCGTAGCTGCTCAACCAATGAGGCTTGAAGGTATTAAGGGAGGCACAACAGCATTGGGGTACTTCGACATTCAAGCTGACAATAGTATTTCAAGAACTATTTCATCACATTCGTTCAAGCGTGAACATGGTTTTCCCCAAGCCTTGGCTGTTCATGCAAATTATATTGCAGTTGGAATGTCAAAAGGAAATATTGTTGTGGTGGCCAGTAAATACTCGGCTCAAAATGGCGACAACATGGATGCGAAGATGCTGCTGCTTGGATCACAAGGTGATAAATCAACTGCGCCAGTAACATCTCTATGCTTTAATCAGCAAGGGGACCTTCTTCTGGCTGGTTACAGTGATGGTCAGGTTACAGTCTGGGATGTGTTGAGGGCGACGGCAGCCAAGGTTATTTCCGGAGAACACACATCACCAGTTGTTCATTCATTGTTCCTTGGGCAGGAGGCTCAGGTTACTCGACAATTTAAAGCAGTTACTGGCGATAGTAAGGGTCTGGTTTTGTTACATACGTTCTCAGTGGTTCCTTTGCTAAATAGATTCTCCATTAAAACTCAGTGTCTTCTTGATGGGCAAAAAACGGGGACTGTTCTATCAGCTTCAGCACTTCTTTTGAATGAATTCTGTGGAAGTTCTTTGCCACCATCTCTTTCAAATGTCGCAGTTTCAACCAGCAGTATTGGGAGCATGATGGGAGGGGTTGTTGGAGGAGATTCAGGCTGGAAACTCTTCAATGAAGGATCATCTTTGGTTGAAGGAGTTGTCATATTTGCTACCCATCAAACTGCTCTGGTGGTAAGGCTGAGTCCCACTGTGGAAGTGTATGCTCGGCTCTCTAAGCCAGATGGAATTCAGGAAGGTTCTATGCCTTACACTGCATGGAAATGCTCACAGTCTATTGAAACTTCACCTTCTGAAGCAGTAGAAAGGATTTCGTTGCTTGCAATTGCCTGGGATAAAATGGTTCAGGTAGCAAAGTTGGTGAAAACAGAGCTTAATGTATGTGGCAAGTGGTCCCTTGAGAGTGCAGCCATAGGTGTGGCATGGTTAGATGACCAGGTTCTAGTCATTCTCACAGTAACAGGACAACTCTTCTTATTTGAGAAGGATGGAACCATGATTCACCAGACAAGTGTATTTGTAGATGGGTTTGATAAGGAGGATTTCATTGCACATCACACCCACTTTGTTAATGTTTTGGGCAATCCTGAGAAAGCGTATCACAATTGTGTAGCTGTTAGAGGAGCTTCAGTATATGTGTTGGGACCAAAGCATCTCGTTATTTCCCGTCTCCTTCCATGGAAGGAGCGGGTTCAGGTTCTAAGGAAAGCAGGGGACTGGATGACTGCCCTTAGCATGGCAATAACAATTTATGATGGCCATGCTCATGGTGTTATTGATCTCCCTAGGTCGTTGGAGTCCTTGCAAGAGTTGGTAATGCCCTTTCTGATCGAGTTGCTGTTATCATACGTGGATGAAGTGTTTTCATATATTTCAGTGGCTTTTTGTAACCAAATTGAAAAGAATGAAAAATTGGATGATGTAACGAGTGGAAGCCATTCTGCACATTCTGAAATAAAAGAGCAATATAATCGTGTTGGTGGAGTTGCCGTTGAGTTTTGTGTCCATATCACGAGGACTGATATTCTCTTTGATGAAATTTTCTCCAAATTTGTGGCTGTTCAACAGAGAGATACATTTTTGGAGCTTCTAGAACCATATATTTTAAAGGACATGCTTGGATCATTGCCTCCGGAGATTATGCAAGCTTTGGTAGAGCATTATAGCCAAAAAGGATGGTTGCAGCGTGTAGAACAGTGTGTTCTTCACATGGATATTTCTTCCTTAGATTTTAATCAGGTTGTTAGGTTATGCCGGGATCATGGATTGTATAGTGCATTGGTGTATCTTTTCAACAAAGGATTAGATGATTTCAGGACGCCTTTAGAGGAGCTGTTGGCAGTGTTACAAACCAGTCTTTCTCTCGATAGTAAAAAAAGCTTGGGTGGTGGATGGGGTTCCATGGGTATGTACAAGACACTAGTTTATCTGAAATATTGCTTTTCAGGACTTGCTTTTCCACCAGGCCAAGGAACTCTTGCTCATTCACGCGTGCAATCTCTTAGAGATGAACTACTACAGTTTTTGTTGGAGAATTCTGATACCGTGGATACAAGATCAATTTCAAATAAATCATCTGAAGTTGGATATTTAAATCTGTATCATCTCTTAGAGTTAGATACTGGTGCTACTTTAGATGTTTTGAGATGTGCTTTTGTTGAAGGTGAAATCGTCAAAGCTGATTCTTCTCTAGATGGTTCAGTGGATGCAAGTATGCAGGTACAAAAAGAAAAGAACTCAACTTCTGGAAGAAAGAACTTTCTAGTTCAAAATGTAGTCGATGCTCTTGTTCATATTCTTGATAAGGCTATCAGTCAAACATACGGGTCCCCAGGTGGCGATAATATCACATTGGTTGAAGATTGGCCTTCGAAGAAGGACTTATTTCATTTGTTCGATTTTGTTGCCAATTACGTTGCGTGTGGAAAGGCTACTGCCTCTAAGGACGTAGTTGGTCAGATTTTGGAACACTTGATATCCAATAGTGATATTCCAGAAATGGAAATTGATTTTGTGCATAGCGTTACTGCAAACAGTGTACATTCAAGAAAAAGGGAAAAGCAAGTACTTTCTCTCTTGGAAGTGATACCAGAGACCCATTGGAATCCGTCTTCTGTGTTAAGAATGTGTGAGAAAGCACAATTTTTCCAGGTTTGTGGCCTAATTCATAGCATCAGTCATCAGTATTCATCAGCTTTGGATGGCTACATGAAAGATGTAGAGGAACCCATTCATGCTTTTGCCTTTATCAACAGAACATTAATGGAGCTCAGCAATTCTGAACAAACTGAATTTCGGGGAGTGGTCATTTCTCGAATTCCAGAGCTTCTTAATTTAAACAGAGAGGGAACATTCTTCTTGGTCATTGATCATTTCGGCAATGATGTTTCAGACATCCTCTCCCGGCTTCATAATCATCCAAGAAGCCTATTTCTGTACTTAAAAACTCTCATTGAGGTTCACCAATCTGGAAACCTGAACTTCTCTTGCTTAAAGAAAGATGATAATTTTGGGGTCAACTATTCAACTAAAGGATTGGATGATTACCTGCAAAAACTCTCTGATTTCCCTAAATTTCTGTCTAACAATCCTGTTGATGTGACTGATGATATAATTGAGCTTTATGTGGAGCTACTTTGTCGGTATGAACGTGAATCAGTTCTCAAGTTTTTGGAGACTTTTGATAGCTATCATGTGGAGCATTGTTTGCGCCTCTGCCAACAGTATGAGGTTATTGATGCTGCAGCATTCTTGTTGGAGAGGGTTGGTGATGTTGGTAGTGCTCTTTTCTTAACACTTTCTAGCCTAGACAAAAAATTCCATGACCTAGAGGCTGCTGTGGGAGGCATTGTTACAAATGGTGCTTCAAGCGGTTCTAATGATTCACAACTTTTCAGCTCTGTTTTAAAATTGCAAGAGGTGAATGATATATATGTTTTGTTGCATGCTTGTATTGGACTGTGCCAGCGAAATACTCCTCGTTTGAACTCTGAGGAGTCTGAGACACTTTGGTTCAAATTACTCGACTCGTTTTGTGAGCCTTTAATTGAATCATTTAATTATAGAACTGCTTCTTTTGGAGAAAATCAAGTTCAATTCTTGAACGAGTCATCAGGTTCACAAAAGGATAAAGAAGCACATATAGTTACATGGAGAATTTTGAAGTCCAATCAAACTGCTCATATATTAAGGAGTTTGTTCTCTCGATTTATCAGAGAGATAGTTGAGGGGATGATAGGATATGTTCATCTTCCAACGATCATGTCCAGACTTCTCTCTGACAACGGAAGTCAAGAATTTGGCGATTTTAAACTTACCATACTTGGGATGCTTGGGACTTTTGGCTTTGAAAGAAGAATTCTGGATACTGCCAAAGCTTTAATTGAGGATGACACGTTCTATACCATGAGTTTATTGAAGAAAGGGGCATCTCACGGATATGCTCCCCGGGGTGCTGTCTGTTGCATATGTAATCGCCTTCTTGTCAAGAGCTCATCAAGTTACAGAGTTCGAGTTTTTAACTGTGGTCATGCAACTCATCTTCAGTGCGAAGTTCTTGATAATGAGGCTTCAGGTGGAGACTTCTCATGCCCAGTTTGTGTGCATAGCAATCACTCTCAACGGTCTAGAGGCAAAGCACTGACTGAGTACAGTCTAGTGAATAAATTTTCATCAAGAACTCAATCGTCGTCAGGAGCTTCTGTTTCATATCCACAAGAAACAGATTTATTGGAGCTTCCATACACTCTTCAGCAAATACCACGGTTTGAGATTCTGGCTAACTTACAGAAAAATCAGAGAGTAATAGACATAGAAAATATGCCTCAATTGAGGCTTGCACCACCAGCCGTCTACCATGACAAGGTCACAAAAGGATATCATCTTTTAGTAGGAGATAGCAGCGGTGGAGTAGAAAAAGTAGAAAAACTAAACAAGAGCAGGCAACTGAAGGAGGTAAAAGTAAAGAGACCGTCCTCCCTTCGATTCCCTTTAAAAGCAAATCTATTTGGTGAGTTATGTTTCTGATCCTCCTAATATGTTTTGTCTTTCTGATCCGTCAATTTCTGACATGAAACGAAATACATATATAATTCATGTAGGGAAAGAGAAGACGACCAAATCTTGACAGAGTATGAAGCGACTCCTTAATTTTAAACCCACGTGATAAAAGGTGCTTTTGGTTAGTCTCAGTTATGTACTGTCAAACTTCTCGAGGGGAGGATGACATTTTGTATCTGTAAATACAATTACGGAGGTTGTTAGTTTGTAGCTGATTTGTTAGGGTAAATTATATGAACTATTGATTGAATTTTTCCTGCAATCCCATACAGAGATTTCTCCACGGTAGAACGCAGTTGTTAATTACCCAACCAAGTACTTTGTTGTTCATAAAAATATGAAGCTGTTAACG

Coding sequence (CDS)

ATGACCAAGGAGCTGACTGACACCGAAACACTGCCTCCAATGGAGCTGGACTTGAATGCCTTCATCCACGCACACCTCTCCAGCGGCGACGACGACCACGACCACGACGAAGACGACCTATCATTCCCTCACCGTAGCATCGACGAGATTCTAAACGAATCCAGTTCTTCCACCTCATCTTCACCATCGTCTCCTCCCAATTCACCGCCGCCTCGTGCCCGTCGCTCCATCGCCGCAAGGGACGGCAGGGCCTCTGCTTCTCGTTCTATACCACCATTCAAGTCACCGTTTGAGGAAATAATAAAGGCTTCTAAAGTTCCCAGAAGCAACCAACGGAATGAGAAGTCAGTTCAATTGAAACCAGGTTCGGTCTCTCATACTAAGGTTGGTGAATTAACGGACGATCCATTTCGAAGGGGATCTCGTGCATTGCCATCGTTGTTTGGAGGGGTTAGATCGAATGCCAAACCTGGGGCGGCGCTTGCCGCAGCTGCTGCGGCTTCTCGGTCCATGCCGGCTCCGCACGCTGCAGCAATCAAGTCGAGGAGGTCAGGACATGGGAGCGTGGTTCTTGACGATGACGAATTGGCTTCCTCTTCTGCTGTTGATTCAGAGTTTGTCTTTGATGATTTATATTCTACTATCGATCATTCAAAGGAGTCTCGCGAAAAGTCAATTTCATTGGTCGAAAGGAATGCCGATTATCAGGGTGCATCTGTAAACGTCGGTGTTGAATTTTGGGCAAGAGACAACATTCGAGACTGTGTCCTATATAACGATGAGTTTCGTATAACTAAAGATACGGAATGCGAAGCAGAACAGAGTTTTGTGGATGATGTGAATTTCGACGAGAGCTCTACAACTCTGCCGCCAGTGGAAGCTAACGGTAGAAGCTTGTCTGATTCTGCTGACAACAATGTTTGTTCAATGGATGCAGAACCAACAGTATTGGATGGCGATGAATCAAATGAAGGGGCCTTTCCGTGTTCTCCCAAACCTGATTACGACAGAAGTGCCGTGGGCTATGGGAGGCTGGAGTTGGAAACTCAAGATTTCGAGAAACGTTCTCAACCATCAAAAGATTCGGAGGTTCTAGCCATTGAGGATCTCAGTGTAGTGAACGATATCAGTGAATCGAGGGAAACAGCCAAGCAGCTGGATAACTTTCACACCGGTGAACGTGCAGAAACGATGTCCCTGTCCTCGTCTAATCCACTCGAATTGGCCGAAGAAATTGAAAAGAAGCAAGCTTTCACTGCACTGCATTGGGAAGAAGGCGTAGCTGCTCAACCAATGAGGCTTGAAGGTATTAAGGGAGGCACAACAGCATTGGGGTACTTCGACATTCAAGCTGACAATAGTATTTCAAGAACTATTTCATCACATTCGTTCAAGCGTGAACATGGTTTTCCCCAAGCCTTGGCTGTTCATGCAAATTATATTGCAGTTGGAATGTCAAAAGGAAATATTGTTGTGGTGGCCAGTAAATACTCGGCTCAAAATGGCGACAACATGGATGCGAAGATGCTGCTGCTTGGATCACAAGGTGATAAATCAACTGCGCCAGTAACATCTCTATGCTTTAATCAGCAAGGGGACCTTCTTCTGGCTGGTTACAGTGATGGTCAGGTTACAGTCTGGGATGTGTTGAGGGCGACGGCAGCCAAGGTTATTTCCGGAGAACACACATCACCAGTTGTTCATTCATTGTTCCTTGGGCAGGAGGCTCAGGTTACTCGACAATTTAAAGCAGTTACTGGCGATAGTAAGGGTCTGGTTTTGTTACATACGTTCTCAGTGGTTCCTTTGCTAAATAGATTCTCCATTAAAACTCAGTGTCTTCTTGATGGGCAAAAAACGGGGACTGTTCTATCAGCTTCAGCACTTCTTTTGAATGAATTCTGTGGAAGTTCTTTGCCACCATCTCTTTCAAATGTCGCAGTTTCAACCAGCAGTATTGGGAGCATGATGGGAGGGGTTGTTGGAGGAGATTCAGGCTGGAAACTCTTCAATGAAGGATCATCTTTGGTTGAAGGAGTTGTCATATTTGCTACCCATCAAACTGCTCTGGTGGTAAGGCTGAGTCCCACTGTGGAAGTGTATGCTCGGCTCTCTAAGCCAGATGGAATTCAGGAAGGTTCTATGCCTTACACTGCATGGAAATGCTCACAGTCTATTGAAACTTCACCTTCTGAAGCAGTAGAAAGGATTTCGTTGCTTGCAATTGCCTGGGATAAAATGGTTCAGGTAGCAAAGTTGGTGAAAACAGAGCTTAATGTATGTGGCAAGTGGTCCCTTGAGAGTGCAGCCATAGGTGTGGCATGGTTAGATGACCAGGTTCTAGTCATTCTCACAGTAACAGGACAACTCTTCTTATTTGAGAAGGATGGAACCATGATTCACCAGACAAGTGTATTTGTAGATGGGTTTGATAAGGAGGATTTCATTGCACATCACACCCACTTTGTTAATGTTTTGGGCAATCCTGAGAAAGCGTATCACAATTGTGTAGCTGTTAGAGGAGCTTCAGTATATGTGTTGGGACCAAAGCATCTCGTTATTTCCCGTCTCCTTCCATGGAAGGAGCGGGTTCAGGTTCTAAGGAAAGCAGGGGACTGGATGACTGCCCTTAGCATGGCAATAACAATTTATGATGGCCATGCTCATGGTGTTATTGATCTCCCTAGGTCGTTGGAGTCCTTGCAAGAGTTGGTAATGCCCTTTCTGATCGAGTTGCTGTTATCATACGTGGATGAAGTGTTTTCATATATTTCAGTGGCTTTTTGTAACCAAATTGAAAAGAATGAAAAATTGGATGATGTAACGAGTGGAAGCCATTCTGCACATTCTGAAATAAAAGAGCAATATAATCGTGTTGGTGGAGTTGCCGTTGAGTTTTGTGTCCATATCACGAGGACTGATATTCTCTTTGATGAAATTTTCTCCAAATTTGTGGCTGTTCAACAGAGAGATACATTTTTGGAGCTTCTAGAACCATATATTTTAAAGGACATGCTTGGATCATTGCCTCCGGAGATTATGCAAGCTTTGGTAGAGCATTATAGCCAAAAAGGATGGTTGCAGCGTGTAGAACAGTGTGTTCTTCACATGGATATTTCTTCCTTAGATTTTAATCAGGTTGTTAGGTTATGCCGGGATCATGGATTGTATAGTGCATTGGTGTATCTTTTCAACAAAGGATTAGATGATTTCAGGACGCCTTTAGAGGAGCTGTTGGCAGTGTTACAAACCAGTCTTTCTCTCGATAGTAAAAAAAGCTTGGGTGGTGGATGGGGTTCCATGGGTATGTACAAGACACTAGTTTATCTGAAATATTGCTTTTCAGGACTTGCTTTTCCACCAGGCCAAGGAACTCTTGCTCATTCACGCGTGCAATCTCTTAGAGATGAACTACTACAGTTTTTGTTGGAGAATTCTGATACCGTGGATACAAGATCAATTTCAAATAAATCATCTGAAGTTGGATATTTAAATCTGTATCATCTCTTAGAGTTAGATACTGGTGCTACTTTAGATGTTTTGAGATGTGCTTTTGTTGAAGGTGAAATCGTCAAAGCTGATTCTTCTCTAGATGGTTCAGTGGATGCAAGTATGCAGGTACAAAAAGAAAAGAACTCAACTTCTGGAAGAAAGAACTTTCTAGTTCAAAATGTAGTCGATGCTCTTGTTCATATTCTTGATAAGGCTATCAGTCAAACATACGGGTCCCCAGGTGGCGATAATATCACATTGGTTGAAGATTGGCCTTCGAAGAAGGACTTATTTCATTTGTTCGATTTTGTTGCCAATTACGTTGCGTGTGGAAAGGCTACTGCCTCTAAGGACGTAGTTGGTCAGATTTTGGAACACTTGATATCCAATAGTGATATTCCAGAAATGGAAATTGATTTTGTGCATAGCGTTACTGCAAACAGTGTACATTCAAGAAAAAGGGAAAAGCAAGTACTTTCTCTCTTGGAAGTGATACCAGAGACCCATTGGAATCCGTCTTCTGTGTTAAGAATGTGTGAGAAAGCACAATTTTTCCAGGTTTGTGGCCTAATTCATAGCATCAGTCATCAGTATTCATCAGCTTTGGATGGCTACATGAAAGATGTAGAGGAACCCATTCATGCTTTTGCCTTTATCAACAGAACATTAATGGAGCTCAGCAATTCTGAACAAACTGAATTTCGGGGAGTGGTCATTTCTCGAATTCCAGAGCTTCTTAATTTAAACAGAGAGGGAACATTCTTCTTGGTCATTGATCATTTCGGCAATGATGTTTCAGACATCCTCTCCCGGCTTCATAATCATCCAAGAAGCCTATTTCTGTACTTAAAAACTCTCATTGAGGTTCACCAATCTGGAAACCTGAACTTCTCTTGCTTAAAGAAAGATGATAATTTTGGGGTCAACTATTCAACTAAAGGATTGGATGATTACCTGCAAAAACTCTCTGATTTCCCTAAATTTCTGTCTAACAATCCTGTTGATGTGACTGATGATATAATTGAGCTTTATGTGGAGCTACTTTGTCGGTATGAACGTGAATCAGTTCTCAAGTTTTTGGAGACTTTTGATAGCTATCATGTGGAGCATTGTTTGCGCCTCTGCCAACAGTATGAGGTTATTGATGCTGCAGCATTCTTGTTGGAGAGGGTTGGTGATGTTGGTAGTGCTCTTTTCTTAACACTTTCTAGCCTAGACAAAAAATTCCATGACCTAGAGGCTGCTGTGGGAGGCATTGTTACAAATGGTGCTTCAAGCGGTTCTAATGATTCACAACTTTTCAGCTCTGTTTTAAAATTGCAAGAGGTGAATGATATATATGTTTTGTTGCATGCTTGTATTGGACTGTGCCAGCGAAATACTCCTCGTTTGAACTCTGAGGAGTCTGAGACACTTTGGTTCAAATTACTCGACTCGTTTTGTGAGCCTTTAATTGAATCATTTAATTATAGAACTGCTTCTTTTGGAGAAAATCAAGTTCAATTCTTGAACGAGTCATCAGGTTCACAAAAGGATAAAGAAGCACATATAGTTACATGGAGAATTTTGAAGTCCAATCAAACTGCTCATATATTAAGGAGTTTGTTCTCTCGATTTATCAGAGAGATAGTTGAGGGGATGATAGGATATGTTCATCTTCCAACGATCATGTCCAGACTTCTCTCTGACAACGGAAGTCAAGAATTTGGCGATTTTAAACTTACCATACTTGGGATGCTTGGGACTTTTGGCTTTGAAAGAAGAATTCTGGATACTGCCAAAGCTTTAATTGAGGATGACACGTTCTATACCATGAGTTTATTGAAGAAAGGGGCATCTCACGGATATGCTCCCCGGGGTGCTGTCTGTTGCATATGTAATCGCCTTCTTGTCAAGAGCTCATCAAGTTACAGAGTTCGAGTTTTTAACTGTGGTCATGCAACTCATCTTCAGTGCGAAGTTCTTGATAATGAGGCTTCAGGTGGAGACTTCTCATGCCCAGTTTGTGTGCATAGCAATCACTCTCAACGGTCTAGAGGCAAAGCACTGACTGAGTACAGTCTAGTGAATAAATTTTCATCAAGAACTCAATCGTCGTCAGGAGCTTCTGTTTCATATCCACAAGAAACAGATTTATTGGAGCTTCCATACACTCTTCAGCAAATACCACGGTTTGAGATTCTGGCTAACTTACAGAAAAATCAGAGAGTAATAGACATAGAAAATATGCCTCAATTGAGGCTTGCACCACCAGCCGTCTACCATGACAAGGTCACAAAAGGATATCATCTTTTAGTAGGAGATAGCAGCGGTGGAGTAGAAAAAGTAGAAAAACTAAACAAGAGCAGGCAACTGAAGGAGGTAAAAGTAAAGAGACCGTCCTCCCTTCGATTCCCTTTAAAAGCAAATCTATTTGGTGAGTTATGTTTCTGA

Protein sequence

MTKELTDTETLPPMELDLNAFIHAHLSSGDDDHDHDEDDLSFPHRSIDEILNESSSSTSSSPSSPPNSPPPRARRSIAARDGRASASRSIPPFKSPFEEIIKASKVPRSNQRNEKSVQLKPGSVSHTKVGELTDDPFRRGSRALPSLFGGVRSNAKPGAALAAAAAASRSMPAPHAAAIKSRRSGHGSVVLDDDELASSSAVDSEFVFDDLYSTIDHSKESREKSISLVERNADYQGASVNVGVEFWARDNIRDCVLYNDEFRITKDTECEAEQSFVDDVNFDESSTTLPPVEANGRSLSDSADNNVCSMDAEPTVLDGDESNEGAFPCSPKPDYDRSAVGYGRLELETQDFEKRSQPSKDSEVLAIEDLSVVNDISESRETAKQLDNFHTGERAETMSLSSSNPLELAEEIEKKQAFTALHWEEGVAAQPMRLEGIKGGTTALGYFDIQADNSISRTISSHSFKREHGFPQALAVHANYIAVGMSKGNIVVVASKYSAQNGDNMDAKMLLLGSQGDKSTAPVTSLCFNQQGDLLLAGYSDGQVTVWDVLRATAAKVISGEHTSPVVHSLFLGQEAQVTRQFKAVTGDSKGLVLLHTFSVVPLLNRFSIKTQCLLDGQKTGTVLSASALLLNEFCGSSLPPSLSNVAVSTSSIGSMMGGVVGGDSGWKLFNEGSSLVEGVVIFATHQTALVVRLSPTVEVYARLSKPDGIQEGSMPYTAWKCSQSIETSPSEAVERISLLAIAWDKMVQVAKLVKTELNVCGKWSLESAAIGVAWLDDQVLVILTVTGQLFLFEKDGTMIHQTSVFVDGFDKEDFIAHHTHFVNVLGNPEKAYHNCVAVRGASVYVLGPKHLVISRLLPWKERVQVLRKAGDWMTALSMAITIYDGHAHGVIDLPRSLESLQELVMPFLIELLLSYVDEVFSYISVAFCNQIEKNEKLDDVTSGSHSAHSEIKEQYNRVGGVAVEFCVHITRTDILFDEIFSKFVAVQQRDTFLELLEPYILKDMLGSLPPEIMQALVEHYSQKGWLQRVEQCVLHMDISSLDFNQVVRLCRDHGLYSALVYLFNKGLDDFRTPLEELLAVLQTSLSLDSKKSLGGGWGSMGMYKTLVYLKYCFSGLAFPPGQGTLAHSRVQSLRDELLQFLLENSDTVDTRSISNKSSEVGYLNLYHLLELDTGATLDVLRCAFVEGEIVKADSSLDGSVDASMQVQKEKNSTSGRKNFLVQNVVDALVHILDKAISQTYGSPGGDNITLVEDWPSKKDLFHLFDFVANYVACGKATASKDVVGQILEHLISNSDIPEMEIDFVHSVTANSVHSRKREKQVLSLLEVIPETHWNPSSVLRMCEKAQFFQVCGLIHSISHQYSSALDGYMKDVEEPIHAFAFINRTLMELSNSEQTEFRGVVISRIPELLNLNREGTFFLVIDHFGNDVSDILSRLHNHPRSLFLYLKTLIEVHQSGNLNFSCLKKDDNFGVNYSTKGLDDYLQKLSDFPKFLSNNPVDVTDDIIELYVELLCRYERESVLKFLETFDSYHVEHCLRLCQQYEVIDAAAFLLERVGDVGSALFLTLSSLDKKFHDLEAAVGGIVTNGASSGSNDSQLFSSVLKLQEVNDIYVLLHACIGLCQRNTPRLNSEESETLWFKLLDSFCEPLIESFNYRTASFGENQVQFLNESSGSQKDKEAHIVTWRILKSNQTAHILRSLFSRFIREIVEGMIGYVHLPTIMSRLLSDNGSQEFGDFKLTILGMLGTFGFERRILDTAKALIEDDTFYTMSLLKKGASHGYAPRGAVCCICNRLLVKSSSSYRVRVFNCGHATHLQCEVLDNEASGGDFSCPVCVHSNHSQRSRGKALTEYSLVNKFSSRTQSSSGASVSYPQETDLLELPYTLQQIPRFEILANLQKNQRVIDIENMPQLRLAPPAVYHDKVTKGYHLLVGDSSGGVEKVEKLNKSRQLKEVKVKRPSSLRFPLKANLFGELCF
BLAST of Cp4.1LG03g15580.1 vs. Swiss-Prot
Match: VPS8_MOUSE (Vacuolar protein sorting-associated protein 8 homolog OS=Mus musculus GN=Vps8 PE=1 SV=1)

HSP 1 Score: 304.3 bits (778), Expect = 9.7e-81
Identity = 297/1210 (24.55%), Postives = 521/1210 (43.06%), Query Frame = 1

Query: 690  LVVRLSPTVEVYARLSKPDG-IQEGSMPYTAWK-CSQSIETSPSEAVER---ISLLAIAW 749
            LV+ L P+++V+  ++ P G +   S+P  AW   + +   +P  A  R   +  L +  
Sbjct: 325  LVIGLKPSLKVW--MTFPYGRMDPSSVPLLAWHFVAVNNSVNPMLAFCRGDMVHFLLVKR 384

Query: 750  DKMVQVAKLVKTELNVCGKWSLESAAIGVAWLDDQVLVILTVTGQLFLFEKDGTMIHQTS 809
            D+   +    +  L+      L    I   W++ + +V+L    +L + ++      Q  
Sbjct: 385  DESGAIHVTKQKHLH------LYYDLINFTWINSRTVVLLDSVEKLHVIDRQT----QEE 444

Query: 810  VFVDGFDKEDFIAHHTHFVNVL--GNP--------EKAYHNCVAVRGASVYVLGPKHLVI 869
            +      +   + + +HF ++   GN         EKA +  ++  G  ++ LG K + +
Sbjct: 445  LETMEISEVQLVYNSSHFKSLATGGNVSQALALVGEKACYQSISSYGGQIFYLGTKSVYV 504

Query: 870  SRLLPWKERVQVLRKAGDWMTALSMAITIYDGHAHGVIDLPRSLESLQELVMPFLIELLL 929
              L  W+ER+  L K      AL++A + ++G A  V+ L   +   + +V   ++E+L 
Sbjct: 505  MMLRSWRERMDHLLKQDCLTEALALAWSFHEGKAKAVVGLSGDVSKRKAVVADRMVEILF 564

Query: 930  SYVDEVFSYISVAFCNQIEKNEKLDDVTSGSHSAHSEIKEQYNRVGGVAVEFCVHITRTD 989
             Y D            Q+ +    D V                    V V++C+ + R D
Sbjct: 565  HYADRALKKCPDQGKIQVMEQHFQDTVP-------------------VIVDYCLLLQRKD 624

Query: 990  ILFDEIFSKFVAVQ-QRDTFLELLEPYILKDMLGSLPPEIMQALVEHYSQKGWLQRVEQC 1049
            +LF +++ K       +  FLE LEPYIL D L  + P++M+ L+ H+  K  L+ VE  
Sbjct: 625  LLFGQMYDKLSENSVAKGVFLECLEPYILSDKLVGITPQVMKDLIVHFQDKKLLENVEAL 684

Query: 1050 VLHMDISSLDFNQVVRLCRDHGLYSALVYLFNKGLDDFRTPLEELLAVLQTSLSLDSKKS 1109
            ++HMDI+SLD  QVV +C ++ LY A+VY++N+G+++F +P+E+L  V+   L+  + K+
Sbjct: 685  IVHMDITSLDIQQVVLMCWENRLYDAMVYVYNRGMNEFISPMEKLFKVIAPPLN--AGKT 744

Query: 1110 LGGGWGSMGMYKTLVYLKYCFSGLAFPPGQGTLAHSRVQSLRDELLQFLLENSDTVDTRS 1169
            L      MG  K LVY+  C +G A+P G   +    V  +++++ +FL+         S
Sbjct: 745  LTDEQVVMGN-KLLVYISCCLAGRAYPLGD--IPEDLVPLVKNQVFEFLIR------LHS 804

Query: 1170 ISNKSSEVGYLNLYHLLELDTGATLDVLRCAFVEGEIVKADSSLDGSVDASMQVQKEKNS 1229
            +   S E  Y  +  LL  DT   L+VL   F   E  K D         +++ Q     
Sbjct: 805  VEASSEEEVYPYVRTLLHFDTREFLNVLALTF---EDFKNDKQ-------AVEYQ----- 864

Query: 1230 TSGRKNFLVQNVVDALVHILDKAISQTYGSPGGDNITLVEDWPSKKDLFHLFDFVANYVA 1289
                     Q +VD L+ ++ +    T    G                  LF F+A  +A
Sbjct: 865  ---------QRIVDILLKVMVENSDFTPSQVGC-----------------LFTFLARQLA 924

Query: 1290 CGKAT--ASKDVVGQILEHLISNSDIPEMEIDFVHSVTANSVHSRKREKQVLSLLEVIPE 1349
                T   ++ +  Q+LE L S  D              +S HS +R++ +L LL+    
Sbjct: 925  KPDNTLFVNRTLFDQVLEFLCSPDD--------------DSRHS-ERQQVLLELLQAGGI 984

Query: 1350 THWNPSSVLRMCEKAQFFQVCGLIHSISHQYSSALDGYMKDVEEPIHAFAFINRTLMELS 1409
              +  S ++RM EKA+F+Q+C  ++   HQY   +D Y+ D       F +I+  L    
Sbjct: 985  VQFEESRLIRMAEKAEFYQICEFMYEREHQYDKIIDCYLHDPLREEEVFNYIHNILSIPG 1044

Query: 1410 NS--EQTEFRGVVISRIPELLNLNREGTFFLVIDHFGNDVSDILSRLHNHPRSLFLYLKT 1469
            +S  E+       ++ + EL++L       LV  HF   +  ++ +L N    LF +L++
Sbjct: 1045 HSAEEKQSVWQKAMNHMEELVSLKPCKAAELVATHFSEQIEVVIGQLQNQ-LLLFKFLRS 1104

Query: 1470 LIEVHQSGNLNFSCLKKDDNFGVNYSTKGLDDYLQKLSDFPKFLSNNPVDVTDDIIELYV 1529
            L++  +  ++N   L+                               P  +T+  IEL  
Sbjct: 1105 LLDPREGVHVNQELLQ------------------------------IPPHITEQFIELLC 1164

Query: 1530 ELLCRYERESVLKFLETFDSYHVEHCLRLCQQYEVIDAAAFLLERVGDVGSALFLTLSSL 1589
            +    +  + V++ L+  + Y +E  +++ Q+Y++ +  A+LLE+ GD   A  L L  L
Sbjct: 1165 Q----FSPDQVIQTLQVLECYRLEETIQITQKYQLHEVTAYLLEKKGDAHGAFLLLLERL 1224

Query: 1590 DKKFHDLEAAVGGIVTNGASSGSNDSQLFSSVLKLQEVNDIYVLLHACIGLCQRNTPRLN 1649
              +  ++                 D      +L L+ V D  V     I LCQRN+  LN
Sbjct: 1225 QSRLQEMT--------------RQDENTKEDIL-LKGVEDTMV---ETIALCQRNSQNLN 1284

Query: 1650 SEESETLWFKLLDSFCEPLIESFNYRTASFGENQVQFLNESSGSQKD--KEAHIVTWRIL 1709
             ++ E LWF LL++   P                 Q L+ S+ +     +    +T ++L
Sbjct: 1285 QQQREALWFPLLEAMMTP-----------------QKLSSSAAAPHPHCEALKSLTMQVL 1342

Query: 1710 KSNQTAHILRSLFSRFIREIVEGMIGYVHLPTIMSRLLSDNGSQEFGDFKLTILGMLGTF 1769
             S      L S+  R +++ +                    G  + G+ +  ILGML TF
Sbjct: 1345 NSMAAFIALPSILQRILQDPI-------------------YGKGKLGEIQGLILGMLDTF 1342

Query: 1770 GFERRILDTAKALIEDDTFYTMSLLKKGASHGYAPRGAVCCIC-NRLLVKSSSSYRVRVF 1829
             +E+ +L+T  +L+  D  +++  L+   S G  P+   C IC  +   +   +  + VF
Sbjct: 1405 NYEQTLLETTASLLNQDLHWSLCNLRASVSRGLNPKQDYCSICLQQYKRRQEMADEIIVF 1342

Query: 1830 NCGHATH---LQCEVLDNEASGGD-FSCPVCVHSNHSQRSRGKALTEYSLVNKFSSRTQS 1873
            +CGH  H   LQ +    E  G   ++C  C  SN +    GK L+E    NK    T S
Sbjct: 1465 SCGHLYHSFCLQSKECTLEVEGQTRWACHKCSSSNKA----GK-LSENPSENKKGRITSS 1342

BLAST of Cp4.1LG03g15580.1 vs. Swiss-Prot
Match: VPS8_HUMAN (Vacuolar protein sorting-associated protein 8 homolog OS=Homo sapiens GN=VPS8 PE=1 SV=3)

HSP 1 Score: 302.8 bits (774), Expect = 2.8e-80
Identity = 280/1125 (24.89%), Postives = 487/1125 (43.29%), Query Frame = 1

Query: 771  IGVAWLDDQVLVILTVTGQLFLFEKDGTMIHQTSVFVDGFDKEDFIAHHTHFVNVL--GN 830
            I   W++ + +V+L    +L + ++      Q  +      +   + + +HF ++   GN
Sbjct: 405  INFTWINSRTVVLLDSVEKLHVIDRQT----QEELETVEISEVQLVYNSSHFKSLATGGN 464

Query: 831  P--------EKAYHNCVAVRGASVYVLGPKHLVISRLLPWKERVQVLRKAGDWMTALSMA 890
                     EKA +  ++  G  ++ LG K + +  L  W+ERV  L K      AL++A
Sbjct: 465  VSQALALVGEKACYQSISSYGGQIFYLGTKSVYVMMLRSWRERVDHLLKQDCLTEALALA 524

Query: 891  ITIYDGHAHGVIDLPRSLESLQELVMPFLIELLLSYVDEVFSYISVAFCNQIEKNEKLDD 950
             + ++G A  V+ L       + +V   ++E+L  Y D          C    K + ++ 
Sbjct: 525  WSFHEGKAKAVVGLSGDASKRKAIVADRMVEILFHYADRALKK-----CPDQGKIQVME- 584

Query: 951  VTSGSHSAHSEIKEQYNRVGGVAVEFCVHITRTDILFDEIFSKFVAVQ-QRDTFLELLEP 1010
                         + +  +  V V++C+ + R D+LF +++ K       +  FLE LEP
Sbjct: 585  -------------QHFQDMVPVIVDYCLLLQRKDLLFSQMYDKLSENSVAKGVFLECLEP 644

Query: 1011 YILKDMLGSLPPEIMQALVEHYSQKGWLQRVEQCVLHMDISSLDFNQVVRLCRDHGLYSA 1070
            YIL D L  + P++M+ L+ H+  K  ++ VE  ++HMDI+SLD  QVV +C ++ LY A
Sbjct: 645  YILSDKLVGITPQVMKDLIVHFQDKKLMENVEALIVHMDITSLDIQQVVLMCWENRLYDA 704

Query: 1071 LVYLFNKGLDDFRTPLEELLAVLQTSLSLDSKKSLGGGWGSMGMYKTLVYLKYCFSGLAF 1130
            ++Y++N+G+++F +P+E+L  V+   L+  + K+L      MG  K LVY+  C +G A+
Sbjct: 705  MIYVYNRGMNEFISPMEKLFRVIAPPLN--AGKTLTDEQVVMGN-KLLVYISCCLAGRAY 764

Query: 1131 PPGQGTLAHSRVQSLRDELLQFLLENSDTVDTRSISNKSSEVGYLNLYHLLELDTGATLD 1190
            P G   +    V  +++++ +FL+         S      E  Y  +  LL  DT   L+
Sbjct: 765  PLGD--IPEDLVPLVKNQVFEFLIR------LHSAEASPEEEIYPYIRTLLHFDTREFLN 824

Query: 1191 VLRCAFVEGEIVKADSSLDGSVDASMQVQKEKNSTSGRKNFLVQNVVDALVHILDKAISQ 1250
            VL   F   E  K D         +++ Q              Q +VD L+ ++ +    
Sbjct: 825  VLALTF---EDFKNDKQ-------AVEYQ--------------QRIVDILLKVMVENSDF 884

Query: 1251 TYGSPGGDNITLVEDWPSKKDLFHLFDFVANYVACGKAT--ASKDVVGQILEHLISNSDI 1310
            T    G                  LF F+A  +A    T   ++ +  Q+LE L S  D 
Sbjct: 885  TPSQVGC-----------------LFTFLARQLAKPDNTLFVNRTLFDQVLEFLCSPDD- 944

Query: 1311 PEMEIDFVHSVTANSVHSRKREKQVLSLLEVIPETHWNPSSVLRMCEKAQFFQVCGLIHS 1370
                         +S HS +R++ +L LL+      +  S ++RM EKA+F+Q+C  ++ 
Sbjct: 945  -------------DSRHS-ERQQVLLELLQAGGIVQFEESRLIRMAEKAEFYQICEFMYE 1004

Query: 1371 ISHQYSSALDGYMKDVEEPIHAFAFINRTLMELSNS--EQTEFRGVVISRIPELLNLNRE 1430
              HQY   +D Y++D       F +I+  L    +S  E+       +  I EL++L   
Sbjct: 1005 REHQYDKIIDCYLRDPLREEEVFNYIHNILSIPGHSAEEKQSVWQKAMDHIEELVSLKPC 1064

Query: 1431 GTFFLVIDHFGNDVSDILSRLHNHPRSLFLYLKTLIEVHQSGNLNFSCLKKDDNFGVNYS 1490
                LV  HF   +  ++ +L N    LF +L++L++  +  ++N               
Sbjct: 1065 KAAELVATHFSGHIETVIKKLQNQV-LLFKFLRSLLDPREGIHVN--------------- 1124

Query: 1491 TKGLDDYLQKLSDFPKFLSNNPVDVTDDIIELYVELLCRYERESVLKFLETFDSYHVEHC 1550
                           + L  +P  +T+  IEL  +    +    V++ L+  + Y +E  
Sbjct: 1125 --------------QELLQISPC-ITEQFIELLCQ----FNPTQVIETLQVLECYRLEET 1184

Query: 1551 LRLCQQYEVIDAAAFLLERVGDVGSALFLTLSSLDKKFHDLEAAVGGIVTNGASSGSNDS 1610
            +++ Q+Y++ +  A+LLE+ GD+  A  + L  L  K  +        VT+   +   D 
Sbjct: 1185 IQITQKYQLHEVTAYLLEKKGDIHGAFLIMLERLQSKLQE--------VTHQGENTKEDP 1244

Query: 1611 QLFSSVLKLQEVNDIYVLLHACIGLCQRNTPRLNSEESETLWFKLLDSFCEPLIESFNYR 1670
                    L++V D  V     I LCQRN+  LN ++ E LWF LL++   P        
Sbjct: 1245 -------SLKDVEDTMV---ETIALCQRNSHNLNQQQREALWFPLLEAMMAP-------- 1304

Query: 1671 TASFGENQVQFLNESSGSQKDKEA-HIVTWRILKSNQTAHILRSLFSRFIREIVEGMIGY 1730
                     Q L+ S+      EA   +T ++L S      L S+  R +++ V      
Sbjct: 1305 ---------QKLSSSAIPHLHSEALKSLTMQVLNSMAAFIALPSILQRILQDPV------ 1343

Query: 1731 VHLPTIMSRLLSDNGSQEFGDFKLTILGMLGTFGFERRILDTAKALIEDDTFYTMSLLKK 1790
                          G  + G+ +  ILGML TF +E+ +L+T  +L+  D  +++  L+ 
Sbjct: 1365 -------------YGKGKLGEIQGLILGMLDTFNYEQTLLETTTSLLNQDLHWSLCNLRA 1343

Query: 1791 GASHGYAPRGAVCCIC-NRLLVKSSSSYRVRVFNCGHATHLQCEVLDNEASGGDF----- 1850
              + G  P+   C IC  +   +   +  + VF+CGH  H  C  L N+    +F     
Sbjct: 1425 SVTRGLNPKQDYCSICLQQYKRRQEMADEIIVFSCGHLYHSFC--LQNKECTVEFEGQTR 1343

Query: 1851 -SCPVCVHSNHSQRSRGKALTEYSLVNKFSSRTQSSSGASVSYPQ 1873
             +C  C  SN      GK L+E S   K    T S    S SY Q
Sbjct: 1485 WTCYKCSSSN----KVGK-LSENSSEIKKGRITPSQVKMSPSYHQ 1343

BLAST of Cp4.1LG03g15580.1 vs. Swiss-Prot
Match: VPS8_YEAST (Vacuolar protein sorting-associated protein 8 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=VPS8 PE=1 SV=2)

HSP 1 Score: 71.2 bits (173), Expect = 1.4e-10
Identity = 92/497 (18.51%), Postives = 202/497 (40.64%), Query Frame = 1

Query: 735  ERISLLAIAWDKMVQVAKLVKTELNV-CGKWSLESAA--IGVAWLDDQVLVILTVTGQLF 794
            +  S +A + +  + V  +  ++ NV     S E A   + + W+D  +L +LT++ Q  
Sbjct: 312  QNCSRVAYSVNNKISVISISSSDFNVQSASHSPEFAESILSIQWIDQLLLGVLTISHQFL 371

Query: 795  LFEKDGTMIHQTSVFVDGFDKEDFIAHHTHFVNVLGNPEKAYHNCVAVRGASVYVLGPKH 854
            +        H   + +    + DF+ H     +++  P K +     +   S Y+L    
Sbjct: 372  VLHPQ----HDFKILL----RLDFLIH-----DLMIPPNKYF----VISRRSFYLLTNYS 431

Query: 855  LVISRLLPWKERVQVLRKAGDWMTALSMAITIYDGHAH--GVIDLPRSLESLQELVMPFL 914
              I + + W +        GD++ AL    ++   +     ++ L  + E   + +M   
Sbjct: 432  FKIGKFVSWSDITLRHILKGDYLGALEFIESLLQPYCPLANLLKLDNNTEERTKQLM--- 491

Query: 915  IELLLSYVDEVFSYISVAFCNQIEKNEKLDDVTSGSHSAHSEIKEQYNRVGGVAVEFCVH 974
                     E F  +S+A    + K +  D                YNRV  + +     
Sbjct: 492  ---------EPFYNLSLAALRFLIKKDNAD----------------YNRVYQLLMVVVRV 551

Query: 975  ITRTDILFDEIFSKFVAVQQRDTFLELLEPYILKDMLG---------SLPPEIMQALVEH 1034
            + ++    D I S  V ++Q   F EL +  +  +++          S+ P + ++++++
Sbjct: 552  LQQSSKKLDSIPSLDVFLEQGLEFFELKDNAVYFEVVANIVAQGSVTSISPVLFRSIIDY 611

Query: 1035 YSQKGWLQRVEQCVLHMDISSLDFNQVVRLCRDHGLYSALVYLFNKGLDDFRTPLEELLA 1094
            Y+++  L+ +E  ++ ++ ++LD +  V+LC+ + L+  L+Y++NK  DD++TP+ +L+ 
Sbjct: 612  YAKEENLKVIEDLIIMLNPTTLDVDLAVKLCQKYNLFDLLIYIWNKIFDDYQTPVVDLI- 671

Query: 1095 VLQTSLSLDSKKSLGGGWGSMGMYKTLV-YLKYCFSGLAFPPGQGTLAHSRVQSLRDELL 1154
                 +S  S+K +      +    T+  Y+ Y  +G  +P         +   ++ EL 
Sbjct: 672  ---YRISNQSEKCVIFNGPQVPPETTIFDYVTYILTGRQYPQNLSISPSDKCSKIQRELS 731

Query: 1155 QFLLEN------SDTVDTRSISNKSSEVGYLNLYHLLELDTGATLDVLRCAFVEGEIVKA 1211
             F+         S++     I     E      +HLL     +    +     E  +   
Sbjct: 732  AFIFSGFSIKWPSNSNHKLYICENPEEEPAFPYFHLLLKSNPSRFLAMLNEVFEASLFND 759

BLAST of Cp4.1LG03g15580.1 vs. TrEMBL
Match: A0A0A0L2X7_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G116870 PE=4 SV=1)

HSP 1 Score: 3278.8 bits (8500), Expect = 0.0e+00
Identity = 1697/1973 (86.01%), Postives = 1795/1973 (90.98%), Query Frame = 1

Query: 1    MTKELTDTETLPPMELDLNAFIHAHLSSGDDDHDHDEDDLSFPHRSIDEILNESSSSTSS 60
            MT+ELTDTETLPPMELDLNAFIHAHLSSG DD D  +DDLSFPHRSIDEILN+SSSSTS 
Sbjct: 1    MTEELTDTETLPPMELDLNAFIHAHLSSGGDDDD--DDDLSFPHRSIDEILNDSSSSTSP 60

Query: 61   SPSSPPNSPPPRARRSIAARDGRASASRSIPPFKSPFEEIIKASKVPRSNQRNEKSVQLK 120
            SPSS P+ PPPR RR+I A D   SAS S  P+K         S+  R+N  NEKS QLK
Sbjct: 61   SPSSSPHFPPPRGRRNIVAGDDGVSASPSTSPYKD--------SEAARNNPWNEKSAQLK 120

Query: 121  PGSVSHTKVGELTDDPFRRGSRALPSLFGGVRSNAKPGAALAAAAAASRSMPAPHAAAIK 180
            PG+ SH+KVGELTDDPFRRGSR LPSLFG VRSNAKPGAALAAAAAASRS PAPHAAAIK
Sbjct: 121  PGTASHSKVGELTDDPFRRGSRPLPSLFGAVRSNAKPGAALAAAAAASRSTPAPHAAAIK 180

Query: 181  SRRSGHGSVVLDDDELASSSAVDSEFVFDDLYSTIDHSKESREKSISLVERNADYQGASV 240
            SRR+G+G++VLDDDELASSSAVDSEF  D LY    HSKES E SIS+V+R  DYQ AS+
Sbjct: 181  SRRAGYGNMVLDDDELASSSAVDSEFFSDSLYHANIHSKESGENSISVVDRITDYQIASM 240

Query: 241  NVGVEFWARDNIRDCVLYNDEFRITKDTECEAEQSFVDDVNFDESSTTLPPVEANGRSLS 300
            NV  E WA +NIRD V +NDEFR+T+D E EAE S VDDVNF ES +T+PPVE N RSL 
Sbjct: 241  NVSGELWATNNIRDGVPHNDEFRMTEDMEFEAETSSVDDVNFKESLSTVPPVETNDRSLL 300

Query: 301  DSADNNVCSMDAEPTVLDGDESNEGAFPCSPKPDYDRSAVGYGRLELETQDFEKRSQPSK 360
              A+ NVCS DA PT LD DESNEGA P   +PD + SAVGYG LELETQDFEK  QPSK
Sbjct: 301  GPAEKNVCSTDAHPTELDVDESNEGAIPRPTEPDDEESAVGYGSLELETQDFEKYHQPSK 360

Query: 361  DSEV-LAIEDLSVVNDISESRETAKQLDNFHTGERAETMSLSSSNPLELAEEIEKKQAFT 420
            D+EV LAIED S+VNDI ES ET +Q DN   G+R E +S+SS+NPL+LAEEIEKKQAFT
Sbjct: 361  DTEVDLAIEDPSIVNDIIESGETTEQPDNLQIGKRPEMISVSSTNPLDLAEEIEKKQAFT 420

Query: 421  ALHWEEGVAAQPMRLEGIKGGTTALGYFDIQADNSISRTISSHSFKREHGFPQALAVHAN 480
            ALHWEEGVAAQPMRLEGIKG TT LGYFDIQADNSISRTISSHSF+REHGFPQ LAVHAN
Sbjct: 421  ALHWEEGVAAQPMRLEGIKGVTTTLGYFDIQADNSISRTISSHSFRREHGFPQVLAVHAN 480

Query: 481  YIAVGMSKGNIVVVASKYSAQNGDNMDAKMLLLGSQGDKSTAPVTSLCFNQQGDLLLAGY 540
            YIAVGMSKGNIVVVASKYSAQNGDNMDAKM+LLGSQGDKSTAP TSLCF+QQGDLLLAGY
Sbjct: 481  YIAVGMSKGNIVVVASKYSAQNGDNMDAKMILLGSQGDKSTAPATSLCFSQQGDLLLAGY 540

Query: 541  SDGQVTVWDVLRATAAKVISGEHTSPVVHSLFLGQEAQVTRQFKAVTGDSKGLVLLHTFS 600
            SDG +TVWDVLRA+AAKVISGEH SPVVHSLFLGQEAQVTRQFKAVTGDSKGLVLLHTFS
Sbjct: 541  SDGHITVWDVLRASAAKVISGEHASPVVHSLFLGQEAQVTRQFKAVTGDSKGLVLLHTFS 600

Query: 601  VVPLLNRFSIKTQCLLDGQKTGTVLSASALLLNEFCGSSLPPSLSNVAVSTSSIGSMMGG 660
            VVPLLNRFS KTQCLLDGQKTGTVLSASALLLNEF GSSLPP+LSNVAVSTSSIGSMMGG
Sbjct: 601  VVPLLNRFSSKTQCLLDGQKTGTVLSASALLLNEFVGSSLPPTLSNVAVSTSSIGSMMGG 660

Query: 661  VVGGDSGWKLFNEGSSLVE-GVVIFATHQTALVVRLSPTVEVYARLSKPDGIQEGSMPYT 720
            VVGGDSGWKLFNEGSSLVE GVVIFATHQTALVVRLSPTVEVYA+LSKPDGI+EGSMPYT
Sbjct: 661  VVGGDSGWKLFNEGSSLVEEGVVIFATHQTALVVRLSPTVEVYAQLSKPDGIREGSMPYT 720

Query: 721  AWKCSQSIETSPSEAVERISLLAIAWDKMVQVAKLVKTELNVCGKWSLESAAIGVAWLDD 780
            AWKCSQS ETSPSEAVER+SLLAIAWDKMVQVAKLVKTEL VCGKWSLESAAIGV WLDD
Sbjct: 721  AWKCSQSFETSPSEAVERVSLLAIAWDKMVQVAKLVKTELKVCGKWSLESAAIGVVWLDD 780

Query: 781  QVLVILTVTGQLFLFEKDGTMIHQTSVFVDGFDKEDFIAHHTHFVNVLGNPEKAYHNCVA 840
            QVLVILTVTGQLFLFEKDGTMIHQTS+FVDGF KEDFIA+HTHF N+LGNPEKAYHNCVA
Sbjct: 781  QVLVILTVTGQLFLFEKDGTMIHQTSIFVDGFVKEDFIAYHTHFANILGNPEKAYHNCVA 840

Query: 841  VRGASVYVLGPKHLVISRLLPWKERVQVLRKAGDWMTALSMAITIYDGHAHGVIDLPRSL 900
            VRGAS+YVLGP HLVISRLLPWKERVQVLRKAGDWM+ALSMAITIYDGHAHGVIDLPRSL
Sbjct: 841  VRGASIYVLGPMHLVISRLLPWKERVQVLRKAGDWMSALSMAITIYDGHAHGVIDLPRSL 900

Query: 901  ESLQELVMPFLIELLLSYVDEVFSYISVAFCNQIEKNEKLDDVTSGSHSAHSEIKEQYNR 960
            ESLQELVMPFLIELLLSYVDEVFSYISVAFCNQIEKNEKLDD+T  SHSAHSEIKEQYNR
Sbjct: 901  ESLQELVMPFLIELLLSYVDEVFSYISVAFCNQIEKNEKLDDMTIESHSAHSEIKEQYNR 960

Query: 961  VGGVAVEFCVHITRTDILFDEIFSKFVAVQQRDTFLELLEPYILKDMLGSLPPEIMQALV 1020
            VGGVAVEFCVHI+RTDILFDEIFSKFV VQQRDTFLELLEPYILKDMLGSLPPEIMQALV
Sbjct: 961  VGGVAVEFCVHISRTDILFDEIFSKFVGVQQRDTFLELLEPYILKDMLGSLPPEIMQALV 1020

Query: 1021 EHYSQKGWLQRVEQCVLHMDISSLDFNQVVRLCRDHGLYSALVYLFNKGLDDFRTPLEEL 1080
            EHYS KGWLQRVEQCVLHMDISSLDFNQVVRLCRDHGLYSALVYLFNKGLDDFRTPLEEL
Sbjct: 1021 EHYSHKGWLQRVEQCVLHMDISSLDFNQVVRLCRDHGLYSALVYLFNKGLDDFRTPLEEL 1080

Query: 1081 LAVLQTSLSLDSKKSLGGGWGSMGMYKTLVYLKYCFSGLAFPPGQGTLAHSRVQSLRDEL 1140
            LAVL+TS S  +  SLG        YKTLVYLKYCFSGLAFPPGQGTLAHSRVQSLRDEL
Sbjct: 1081 LAVLRTSKSKHAS-SLG--------YKTLVYLKYCFSGLAFPPGQGTLAHSRVQSLRDEL 1140

Query: 1141 LQFLLENSDTVDTRSISNKSSEVGYLNLYHLLELDTGATLDVLRCAFVEGEIVKADSSLD 1200
            LQFLLENSD VDTRSISNKSSEVG LNLY LLELDT ATLDVLRCAFVEGEI+KA SSLD
Sbjct: 1141 LQFLLENSDAVDTRSISNKSSEVGCLNLYPLLELDTEATLDVLRCAFVEGEILKAISSLD 1200

Query: 1201 GSVDASMQVQKEKNSTSGRKNFLVQNVVDALVHILDKAISQTYGSPGGDNITLVEDWPSK 1260
            G VD SMQ+Q+EKNS SGRKNFL+QNVVDALVH+LDKAI +T  SP GDNITLV+DWPSK
Sbjct: 1201 GPVDTSMQLQEEKNSISGRKNFLIQNVVDALVHVLDKAICETDESPAGDNITLVDDWPSK 1260

Query: 1261 KDLFHLFDFVANYVACGKATASKDVVGQILEHLISNSDIPEMEIDFVHSVTANSVHSRKR 1320
            K+L HLFDF+A YVACGKAT SKDVVGQILEHLISNSDIPE   DF+  VTANSV SRKR
Sbjct: 1261 KELIHLFDFIATYVACGKATVSKDVVGQILEHLISNSDIPETVSDFLPRVTANSVLSRKR 1320

Query: 1321 EKQVLSLLEVIPETHWNPSSVLRMCEKAQFFQVCGLIHSISHQYSSALDGYMKDVEEPIH 1380
            EKQVLSLLEVIPETHWNPSSVLRMCEKAQFFQVCGLIHSI+HQYSSALD YMKDV+EPIH
Sbjct: 1321 EKQVLSLLEVIPETHWNPSSVLRMCEKAQFFQVCGLIHSITHQYSSALDSYMKDVDEPIH 1380

Query: 1381 AFAFINRTLMELSNSEQTEFRGVVISRIPELLNLNREGTFFLVIDHFGNDVSDILSRLHN 1440
             F FINRTL+EL NSEQTEFR VVISRIPEL NLNR  TFFLVIDHF NDVS+ILS+L N
Sbjct: 1381 TFTFINRTLLELGNSEQTEFRAVVISRIPELFNLNRGATFFLVIDHFNNDVSNILSQLRN 1440

Query: 1441 HPRSLFLYLKTLIEVHQSGNLNFSCLKKDDNFGVNYSTKGLDDYLQKLSDFPKFLSNNPV 1500
            HPRSLFLYLKTLIEVH SG+ +FSCLKKDDN GVNYSTKG+DDYLQKLSDFPK+LSNNPV
Sbjct: 1441 HPRSLFLYLKTLIEVHLSGSPDFSCLKKDDNLGVNYSTKGMDDYLQKLSDFPKYLSNNPV 1500

Query: 1501 DVTDDIIELYVELLCRYERESVLKFLETFDSYHVEHCLRLCQQYEVIDAAAFLLERVGDV 1560
            DVTDDIIELYVELLC++ERESVLKFLETFDSY VEHCLRLCQQYEVIDAAAFLLERVGDV
Sbjct: 1501 DVTDDIIELYVELLCQHERESVLKFLETFDSYRVEHCLRLCQQYEVIDAAAFLLERVGDV 1560

Query: 1561 GSALFLTLSSLDKKFHDLEAAVGGIVTNGASSGSNDSQLFSSVLKLQEVNDIYVLLHACI 1620
            GSALFLTLSSLDKKFHDLEAAVG  V+N ASSGSNDSQ F+SVLKLQEVN + VLLHACI
Sbjct: 1561 GSALFLTLSSLDKKFHDLEAAVGATVSNTASSGSNDSQNFNSVLKLQEVNAVKVLLHACI 1620

Query: 1621 GLCQRNTPRLNSEESETLWFKLLDSFCEPLIESFNYRTASFGENQVQFLNESSGSQKDKE 1680
            GLCQRNTPRLNSEES+TLWFKLLDSFCEPLI+S+N+RTASF +NQVQFLNESS SQKDKE
Sbjct: 1621 GLCQRNTPRLNSEESQTLWFKLLDSFCEPLIDSYNHRTASFEKNQVQFLNESSCSQKDKE 1680

Query: 1681 AHIVTWRILKSNQTAHILRSLFSRFIREIVEGMIGYVHLPTIMSRLLSDNGSQEFGDFKL 1740
            A+IVTWRILKSN+ AH+LR LFS+FIREIVEGM+GYVHLPTIMSRLL DNGSQEFGDFKL
Sbjct: 1681 ANIVTWRILKSNKVAHLLRKLFSQFIREIVEGMMGYVHLPTIMSRLLYDNGSQEFGDFKL 1740

Query: 1741 TILGMLGTFGFERRILDTAKALIEDDTFYTMSLLKKGASHGYAPRGAVCCICNRLLVKSS 1800
            TILGMLGTFGFERRILD+AKALIEDD+FYTMSLLKKGA+HGYAPR  VCCICNRLLVKSS
Sbjct: 1741 TILGMLGTFGFERRILDSAKALIEDDSFYTMSLLKKGAAHGYAPRSVVCCICNRLLVKSS 1800

Query: 1801 SSYRVRVFNCGHATHLQCEVLDNEASGGDFSCPVCVHSNHSQRSRGKALTEYSLVNKFSS 1860
            SSYRVRVFNCGHATHLQCE L+NEASGGD++CP+CVHSN SQ S+ KA TEYSLVNKFSS
Sbjct: 1801 SSYRVRVFNCGHATHLQCEDLENEASGGDYTCPICVHSNQSQGSKSKAPTEYSLVNKFSS 1860

Query: 1861 RTQSSSGASVSYPQETDLLELPYTLQQIPRFEILANLQKNQRVIDIENMPQLRLAPPAVY 1920
            RTQSSSGASVSYPQETDLLELPYTLQQIPRFEIL NLQKNQRVIDIEN+PQLRLAPPAVY
Sbjct: 1861 RTQSSSGASVSYPQETDLLELPYTLQQIPRFEILTNLQKNQRVIDIENVPQLRLAPPAVY 1920

Query: 1921 HDKVTKGYHLLVGDSSGGVEKVEKLNKSRQLKEVKVKRPSSLRFPLKANLFGE 1972
            HDKVTKGYHLLVG+SSGG EKVEKLNKSRQL  VKVKRPSSLRFPLK +LFG+
Sbjct: 1921 HDKVTKGYHLLVGESSGGREKVEKLNKSRQLTGVKVKRPSSLRFPLKTSLFGK 1954

BLAST of Cp4.1LG03g15580.1 vs. TrEMBL
Match: F6I2Y1_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_15s0048g02590 PE=4 SV=1)

HSP 1 Score: 2219.5 bits (5750), Expect = 0.0e+00
Identity = 1232/2025 (60.84%), Postives = 1491/2025 (73.63%), Query Frame = 1

Query: 1    MTKELTDTETLPPMELDLNAFIHAHLSSGDDDHDHDEDDLSFPHRSIDEILNESSSSTSS 60
            MTK+L+     PPMELDL++FIH  L+S DDD D        PHR++DEILN+S SS+SS
Sbjct: 1    MTKKLS----APPMELDLDSFIH--LTSDDDDDDALN---RVPHRTVDEILNDSDSSSSS 60

Query: 61   SPSSPPNSPPPRARRSIAARDGRASASRSIPPFKSPFEEIIKASKVPRSNQRNEKSVQLK 120
               S  +     +     A D R        P K+  +E  K+++  + N+  ++ VQ K
Sbjct: 61   LSPSDHSYLAKHSSLFEDANDSRDDVVSVSTP-KTLSDERPKSAESLKFNEIEDRLVQFK 120

Query: 121  PGSVSHTKVGELTDDPF---RRGSRALPSLFGGVRSNAKPGAALAAAAAASRSMPAPHAA 180
              S+S  + G+L+ D F   RR SR LP LFG VRSNAKPGAALAAAAAASR +P PHAA
Sbjct: 121  ANSLSRVRTGDLSGDSFSLGRRVSRPLPPLFGSVRSNAKPGAALAAAAAASRPVPTPHAA 180

Query: 181  AIKSRRSGHGSV--VLDDDELASS------------SAVDSEFVFDDLYSTIDHSKESRE 240
            AIKSRR+G G++  VLD +EL  S            +   SE    D  S  +  K    
Sbjct: 181  AIKSRRAGSGALQRVLDTEELGGSGLDKLGSSSDVLNGAGSEIASSDWKSGEEDDKFEDF 240

Query: 241  KSISL---VERNADYQGASVNVGVEFWARDN-IRDC---------VLYNDEFRITKDTEC 300
            +S ++   V+ + D + +  +  VE   RD  + D           L  DE R+    E 
Sbjct: 241  QSATIEWTVKADVDDKVSVKDEIVESSHRDGEVFDLEKVPTEVVHTLEEDESRVNDSDEI 300

Query: 301  ----EAEQSFVDDVNFDESSTTLPPVEANGRSLSDSADNNVCSMDAEPTVLDGDESNEGA 360
                 AE      ++ +E S  L    A   S  D  D N+ S + E T      SN   
Sbjct: 301  LLNSSAETGLAASLSIEEESFDLNEGSAISGSY-DVKDQNIASDNVEETA-----SNSTF 360

Query: 361  FPCSPKPDYDRSAVGYGRLELETQDFEKRSQPSKDSEV-LAIEDLSVVNDISES-RETAK 420
               +   D D        L L+TQD E    PS D EV +A +D S  +D++E   E   
Sbjct: 361  LDAANSADKDEKV--REDLTLKTQDLEPVEPPSTDGEVNIAGDDWSPKSDVTELVEERLG 420

Query: 421  QLDNFHTGERAETMSLSSSNPLELAEEIEKKQAFTALHWEEGVAAQPMRLEGIKGGTTAL 480
            QL++    +R E        PLELAEE+EK QA T LHWEEG AAQPMRLEG++ G+T L
Sbjct: 421  QLESKMGSKRTEKKP--RLKPLELAEELEKSQASTGLHWEEGAAAQPMRLEGVRRGSTTL 480

Query: 481  GYFDIQADNSISRTISSHSFKREHGFPQALAVHANYIAVGMSKGNIVVVASKYSAQNGDN 540
            GYF+I  +N+I+RTISS +FKR+HG PQ LAVH N+IAVGMS+G ++VV SKYSA N DN
Sbjct: 481  GYFEIDNNNTITRTISSPAFKRDHGSPQVLAVHLNFIAVGMSRGVVMVVPSKYSAYNADN 540

Query: 541  MDAKMLLLGSQGDKSTAPVTSLCFNQQGDLLLAGYSDGQVTVWDVLRATAAKVISGEHTS 600
            MDAK+L+LG QG++S APVTS+CFN QGDLLLAGY DG +TVWDV RATAAKVI+GEH++
Sbjct: 541  MDAKILMLGLQGERSHAPVTSMCFNHQGDLLLAGYGDGHITVWDVQRATAAKVITGEHSA 600

Query: 601  PVVHSLFLGQEAQVTRQFKAVTGDSKGLVLLHTFSVVPLLNRFSIKTQCLLDGQKTGTVL 660
            PV+H+LFLGQ++QVTRQFKAVTGDSKGLVLLH FSVVPLLNRFSIKTQCLLDGQ+TGTVL
Sbjct: 601  PVIHTLFLGQDSQVTRQFKAVTGDSKGLVLLHAFSVVPLLNRFSIKTQCLLDGQRTGTVL 660

Query: 661  SASALLLNEFCGSSLPPSLSNVAVSTSSIGSMMGGVVGGDSGWKLFNEGSSLVE-GVVIF 720
            SAS LLL+E  GSSL  S  N   STSSIGSMMGGVVGGD+GWKLF+EGSSLVE GVVIF
Sbjct: 661  SASPLLLDESSGSSLMSSQGNATGSTSSIGSMMGGVVGGDAGWKLFSEGSSLVEEGVVIF 720

Query: 721  ATHQTALVVRLSPTVEVYARLSKPDGIQEGSMPYTAWKCSQ------SIETSPSEAVERI 780
             THQTALVVRLSP++EVYA+L+KPDG++EGSMPYTAWKC        S E +P EA ER+
Sbjct: 721  VTHQTALVVRLSPSLEVYAQLNKPDGVREGSMPYTAWKCMTIHSRGLSTENTPVEASERV 780

Query: 781  SLLAIAWDKMVQVAKLVKTELNVCGKWSLESAAIGVAWLDDQVLVILTVTGQLFLFEKDG 840
            SLLAIAWD+ VQVAKLVK+EL + GKW+LES AIGVAWLDDQ+LV+LT TGQL LF KDG
Sbjct: 781  SLLAIAWDRKVQVAKLVKSELKIYGKWTLESTAIGVAWLDDQILVVLTSTGQLCLFAKDG 840

Query: 841  TMIHQTSVFVDGFDKEDFIAHHTHFVNVLGNPEKAYHNCVAVRGASVYVLGPKHLVISRL 900
            T+IHQTS  VDG   +D +A+HT+F N+ GNPEKAY N +AVRGAS+Y+LGP HLV+SRL
Sbjct: 841  TVIHQTSFAVDGSGGDDPVAYHTYFTNIFGNPEKAYQNSIAVRGASIYILGPVHLVVSRL 900

Query: 901  LPWKERVQVLRKAGDWMTALSMAITIYDGHAHGVIDLPRSLESLQELVMPFLIELLLSYV 960
            L WKER+QVLRKAGDWM AL+MA+T+YDG++HGVIDLPRSLE++QE +MP+L+ELLLSYV
Sbjct: 901  LTWKERIQVLRKAGDWMGALNMAMTLYDGNSHGVIDLPRSLEAVQEAIMPYLVELLLSYV 960

Query: 961  DEVFSYISVAFCNQIEKNEKLDDVTSGSHSAHSEIKEQYNRVGGVAVEFCVHITRTDILF 1020
            DEVFSYISVAFCNQI K E+LDD  +   S H EIKEQ+ RVGGVAVEFCVHI RTDILF
Sbjct: 961  DEVFSYISVAFCNQIGKMEQLDDPKNRGSSVHFEIKEQFTRVGGVAVEFCVHIKRTDILF 1020

Query: 1021 DEIFSKFVAVQQRDTFLELLEPYILKDMLGSLPPEIMQALVEHYSQKGWLQRVEQCVLHM 1080
            DEIFSKFV VQ RDTFLELLEPYILKDMLGSLPPEIMQALVEHYS KGWLQRVEQCVLHM
Sbjct: 1021 DEIFSKFVGVQHRDTFLELLEPYILKDMLGSLPPEIMQALVEHYSSKGWLQRVEQCVLHM 1080

Query: 1081 DISSLDFNQVVRLCRDHGLYSALVYLFNKGLDDFRTPLEELLAVLQTSLSLDSKKSLGGG 1140
            DISSLDFNQVVRLCR+HGLY AL+YLFN+GLDDF+ PLEELL VL  +   +S  SLG  
Sbjct: 1081 DISSLDFNQVVRLCREHGLYGALIYLFNRGLDDFKAPLEELLVVL-LNRPRESASSLG-- 1140

Query: 1141 WGSMGMYKTLVYLKYCFSGLAFPPGQGTLAHSRVQSLRDELLQFLLENSDTVDTRSISNK 1200
                  Y+ LVYLKYCFSGLAFPPG GTL  +R+ SLR EL+QFLLE+ + ++++++S+ 
Sbjct: 1141 ------YRMLVYLKYCFSGLAFPPGHGTLPPTRLPSLRTELVQFLLEDLNALNSQAVSSL 1200

Query: 1201 SSEVGYLNLYHLLELDTGATLDVLRCAFVEGEIVKADSSLDGSVDASMQVQKEKNSTSGR 1260
            SS     NLYHLLELDT ATLDVLR AFVE EI K D SL  S DA+M+  KE +     
Sbjct: 1201 SSTRALPNLYHLLELDTEATLDVLRYAFVEDEITKPDVSLHDSTDANMEAGKEIDLMGEI 1260

Query: 1261 KNFLVQNVVDALVHILDKAISQTYGSPGGDNITLVEDWPSKKDLFHLFDFVANYVACGKA 1320
            +N LVQN V+AL+HILD  ISQ   S G  +I  +E WPSKKD+ HLF+FVA YVAC +A
Sbjct: 1261 QNLLVQNTVNALIHILD--ISQKNRSSGSSDIGSLELWPSKKDMGHLFEFVAYYVACKRA 1320

Query: 1321 TASKDVVGQILEHLISNSDIPEMEIDFVHSVTANSVHS-RKREKQVLSLLEVIPETHWNP 1380
              SK V+ QILE+L S + +P+       S +  SV + ++REKQVL+LLEV+PE  W+ 
Sbjct: 1321 NVSKTVLSQILEYLTSENKLPQ-------SSSKESVGTLKRREKQVLALLEVVPEKDWDA 1380

Query: 1381 SSVLRMCEKAQFFQVCGLIHSISHQYSSALDGYMKDVEEPIHAFAFINRTLMELSNSEQT 1440
            S VL +CEKA+F+QVCGLIHSI HQY +ALD YMKDV+EP+HAF+FIN TL +LS++E  
Sbjct: 1381 SYVLHLCEKAEFYQVCGLIHSIRHQYLTALDSYMKDVDEPVHAFSFINHTLSQLSDTESA 1440

Query: 1441 EFRGVVISRIPELLNLNREGTFFLVIDHFGNDVSDILSRLHNHPRSLFLYLKTLIEVHQS 1500
             FR  VISRIPEL+NL+REGTFFL+IDHF  +   ILS L +HP+SLFLYLKT+IEVH S
Sbjct: 1441 AFRSAVISRIPELVNLSREGTFFLIIDHFNKESPHILSELRSHPKSLFLYLKTVIEVHLS 1500

Query: 1501 GNLNFSCLKKDDNFGVNYSTK------GLDDYLQKLSDFPKFLSNNPVDVTDDIIELYVE 1560
            G LNFSCL+ DD    +   +      GL+ YL+++ DFPK L NNPV VTD++IELY+E
Sbjct: 1501 GTLNFSCLQNDDTMDASCGRRVKNQLYGLEAYLERILDFPKLLLNNPVHVTDEMIELYLE 1560

Query: 1561 LLCRYERESVLKFLETFDSYHVEHCLRLCQQYEVIDAAAFLLERVGDVGSALFLTLSSLD 1620
            LLC+YE  SVLKFLETF+SY VEHCLRLCQ+Y +IDAAAFLLERVGDVGSAL LTLS L+
Sbjct: 1561 LLCQYEHTSVLKFLETFESYRVEHCLRLCQEYGIIDAAAFLLERVGDVGSALLLTLSGLN 1620

Query: 1621 KKFHDLEAAVGGIVTNGASSGSNDSQLFSSVLKLQEVNDIYVLLHACIGLCQRNTPRLNS 1680
             KF+ LE AVG I++  ASS  +     ++VLK++EV+DIY +LH CIGLCQRNTPRL  
Sbjct: 1621 DKFNVLETAVGSILSEKASSVDH----LNTVLKMKEVSDIYDILHTCIGLCQRNTPRLVP 1680

Query: 1681 EESETLWFKLLDSFCEPLIESFNYRTASFGENQVQFLNESSGSQKDKEAHIVTWRILKSN 1740
            EESE+LWF+LLDSFCEPL++S++ +  S  E  V  L ES  +Q   EA +  W I KS+
Sbjct: 1681 EESESLWFQLLDSFCEPLMDSYDDKIVSEVEKPVGILAESLETQAGDEACLNKWSIPKSH 1740

Query: 1741 QTAHILRSLFSRFIREIVEGMIGYVHLPTIMSRLLSDNGSQEFGDFKLTILGMLGTFGFE 1800
            Q AH+LR LFS+FI+EIVEGM+G+V LP IMS+LLSDNG+QEFGDFK+TILGMLGT+GFE
Sbjct: 1741 QGAHLLRRLFSQFIKEIVEGMVGFVRLPVIMSKLLSDNGNQEFGDFKVTILGMLGTYGFE 1800

Query: 1801 RRILDTAKALIEDDTFYTMSLLKKGASHGYAPRGAVCCICNRLLVKSSSSYRVRVFNCGH 1860
            RRILDTAK+LIEDDTFYTMSLLKKGASHGYAPR  +CCICN L  K+SSS  +RVFNCGH
Sbjct: 1801 RRILDTAKSLIEDDTFYTMSLLKKGASHGYAPRSLICCICNCLFTKNSSSSSIRVFNCGH 1860

Query: 1861 ATHLQCEVLDNEASGGDFS--CPVCVHSNHSQRSRGKA-LTEYSLVNKFSSR-TQSSSGA 1920
            ATHLQCE+L+NEAS    S  CPVC+    +QRSR K+ L E  LV+K  SR TQ + G 
Sbjct: 1861 ATHLQCELLENEASNRSSSVGCPVCLPKKKTQRSRSKSVLMENGLVSKVPSRKTQQAQGT 1920

Query: 1921 SVSYPQETDLLELPYTLQQIPRFEILANLQKNQRVIDIENMPQLRLAPPAVYHDKVTKGY 1972
             V +P E D+LE PY LQQIPRFEIL NLQK++R I IEN+PQLRLAPPAVYH+KV KG 
Sbjct: 1921 IVLHPHENDVLENPYGLQQIPRFEILNNLQKDKRAIQIENLPQLRLAPPAVYHEKVAKGI 1980

BLAST of Cp4.1LG03g15580.1 vs. TrEMBL
Match: V4U715_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10018449mg PE=4 SV=1)

HSP 1 Score: 2207.2 bits (5718), Expect = 0.0e+00
Identity = 1208/2024 (59.68%), Postives = 1487/2024 (73.47%), Query Frame = 1

Query: 1    MTKELTDTETLPPMELDLNAFIHAHLSSGDDDHDHDEDDLSFPHRSIDEILNESSSSTS- 60
            MTKEL DT++L  MELD+++F+++HLSS     D D++  S PHR++DEILN+S SSTS 
Sbjct: 1    MTKELQDTKSL--MELDVDSFLNSHLSS-----DSDDEFNSVPHRTLDEILNDSESSTSP 60

Query: 61   SSPSSPPNSPPPRARRSIAARDGRASASRSIPPFKSPFEEIIKASKVPRSNQRNEKSVQL 120
            SSP+S  +       +     DG +S  +  P                            
Sbjct: 61   SSPTSSIHHSDTSLAKPQPQGDGVSSQDKPTP---------------------------- 120

Query: 121  KPGSVSHTKVGELTDDPFRR----GSRALPSLFGGVRSNAKPGAALAAAAAASRSMPAPH 180
            KPGS    K  EL+ DP  R     SR LPSLFGGVRS AKPGAALAAAAAASRS+P PH
Sbjct: 121  KPGSFHRVKSNELSGDPIWRVPPSSSRQLPSLFGGVRSTAKPGAALAAAAAASRSVPTPH 180

Query: 181  AAAIKSRRSGHGSVVL----DDDELASSSAVDSEFVFDDLYSTIDHSKESREKSISLVER 240
            AAAIKSRR+G G+++     DD E+AS S+           + I  S E  E    L+  
Sbjct: 181  AAAIKSRRAGSGTLLKVLDGDDHEIASVSS-----------NEISVSSEKLEGDAELI-- 240

Query: 241  NADYQGASVNVGVEFWARDNIRDCVLYNDEFRITKDTECEAEQSFVDDVNFDESS----- 300
              D+Q A VNV  E  +  + RD            DT+ E+E S VDD   + SS     
Sbjct: 241  -GDFQSAQVNVSGELSSLASSRDV-----------DTKLESEVSNVDDEFLNTSSNLNTG 300

Query: 301  --------TTLPPVEANGRSLSDSAD--NNV--------CSMDAEPTVLDGDESNEGAFP 360
                      +  +    +S+  S+D  N++         + D +   L+ + S E +  
Sbjct: 301  QLIGCSPRVVVKDLNLREKSIIASSDDANDIDGNRIVAPVTADDDSMFLEVNASTESS-- 360

Query: 361  CSPKPDYDRSAVGYGRLELETQDFE---KRSQPSKDSE--VLAIEDLSVVNDISE-SRET 420
              P  + DR+ +    LE+ T + E   K    S+D E  V    D S ++DISE   E 
Sbjct: 361  VVPLNESDRTGLMEENLEIPTLEMESSDKSMSTSQDDEVGVDGSNDASSIDDISELVEER 420

Query: 421  AKQLDNFHTGERAETMSLSSSNPLELAEEIEKKQAFTALHWEEGVAAQPMRLEGIKGGTT 480
              QL++  T  RAE     S  PLELAEE+EKKQA T LHW+EG AAQPMRLEG++ G+T
Sbjct: 421  IGQLESEITSRRAEKKVQPSLKPLELAEELEKKQASTGLHWKEGAAAQPMRLEGVRRGST 480

Query: 481  ALGYFDIQADNSISRTISSHSFKREHGFPQALAVHANYIAVGMSKGNIVVVASKYSAQNG 540
             LGYFD+ A+N+I++TI+S +F+R+HG PQ LAVH ++IAVGMSKG IVVV SKYSA + 
Sbjct: 481  TLGYFDVDANNTITQTIASQAFRRDHGSPQVLAVHPSFIAVGMSKGAIVVVPSKYSAHHR 540

Query: 541  DNMDAKMLLLGSQGDKSTAPVTSLCFNQQGDLLLAGYSDGQVTVWDVLRATAAKVISGEH 600
            D+MD+KM++LG  GD+S APVT++CFNQ GDLLLAGY+DG VTVWDV RA+AAKVI+GEH
Sbjct: 541  DSMDSKMMMLGLLGDRSPAPVTAMCFNQPGDLLLAGYADGHVTVWDVQRASAAKVITGEH 600

Query: 601  TSPVVHSLFLGQEAQVTRQFKAVTGDSKGLVLLHTFSVVPLLNRFSIKTQCLLDGQKTGT 660
            TSPVVH+LFLGQ++QVTRQFKAVTGD+KGLV LH+ SVVPLLNRFSIKTQCLLDGQKTG 
Sbjct: 601  TSPVVHTLFLGQDSQVTRQFKAVTGDTKGLVQLHSLSVVPLLNRFSIKTQCLLDGQKTGI 660

Query: 661  VLSASALLLNEFCGSSLPPSLSNVAVSTSSIGSMMGGVVGGDSGWKLFNEGSSLV-EGVV 720
            VLSAS LL +E CG +   S  N   S SSIGSMMGGVVG D+GWKLFNEGSSLV EGVV
Sbjct: 661  VLSASPLLFDESCGGAPLSSQGNSTASASSIGSMMGGVVGSDTGWKLFNEGSSLVEEGVV 720

Query: 721  IFATHQTALVVRLSPTVEVYARLSKPDGIQEGSMPYTAWKC-----SQSIETSPSEAVER 780
            IF T+QTALVVRL+PT+EVYA++ +PDG++EG+MPYTAWKC     S + E+ P+EA ER
Sbjct: 721  IFVTYQTALVVRLTPTLEVYAQIPRPDGVREGAMPYTAWKCMTTCRSSTTESIPTEAAER 780

Query: 781  ISLLAIAWDKMVQVAKLVKTELNVCGKWSLESAAIGVAWLDDQVLVILTVTGQLFLFEKD 840
            +SLLAIAWD+ VQVAKLVK+EL V GKWSL+SAAIGVAWLDDQ+LV+LT+ GQL+L+ +D
Sbjct: 781  VSLLAIAWDRKVQVAKLVKSELKVYGKWSLDSAAIGVAWLDDQMLVVLTLLGQLYLYARD 840

Query: 841  GTMIHQTSVFVDGFDKEDFIAHHTHFVNVLGNPEKAYHNCVAVRGASVYVLGPKHLVISR 900
            GT+IHQTS  VDG    D + + ++F NV GNPEK+YHNCV+VRGAS+YVLGP HLV+SR
Sbjct: 841  GTVIHQTSFAVDGSQGYDLVGYRSYFTNVFGNPEKSYHNCVSVRGASIYVLGPMHLVVSR 900

Query: 901  LLPWKERVQVLRKAGDWMTALSMAITIYDGHAHGVIDLPRSLESLQELVMPFLIELLLSY 960
            LLPWKER+QVLRKAGDWM AL+MA+T+YDG AHGVIDLPR+L+++QE +MP+L+ELLLSY
Sbjct: 901  LLPWKERIQVLRKAGDWMGALNMAMTLYDGQAHGVIDLPRTLDAVQEAIMPYLVELLLSY 960

Query: 961  VDEVFSYISVAFCNQIEKNEKLDDVTSGSHSAHSEIKEQYNRVGGVAVEFCVHITRTDIL 1020
            VDEVFSYISVAFCNQIEK  +L++  S S + H+EIKEQ+ RVGGVAVEFCVHI RTDIL
Sbjct: 961  VDEVFSYISVAFCNQIEKLAQLNNPQSRSSTVHAEIKEQFTRVGGVAVEFCVHINRTDIL 1020

Query: 1021 FDEIFSKFVAVQQRDTFLELLEPYILKDMLGSLPPEIMQALVEHYSQKGWLQRVEQCVLH 1080
            FD+IFSKF AVQ RDTFLELLEPYILKDMLGSLPPEIMQALVEHYS KGWLQRVEQCVLH
Sbjct: 1021 FDDIFSKFEAVQHRDTFLELLEPYILKDMLGSLPPEIMQALVEHYSSKGWLQRVEQCVLH 1080

Query: 1081 MDISSLDFNQVVRLCRDHGLYSALVYLFNKGLDDFRTPLEELLAVLQTSLSLDSKKSLGG 1140
            MDISSLDFNQVVRLCR+HGL+ ALVYLFNKGLDDFR PLEELL VL+ S   +S  +LG 
Sbjct: 1081 MDISSLDFNQVVRLCREHGLHGALVYLFNKGLDDFRAPLEELLVVLRNS-ERESAYALG- 1140

Query: 1141 GWGSMGMYKTLVYLKYCFSGLAFPPGQGTLAHSRVQSLRDELLQFLLENSDTVDTRSISN 1200
                   Y+ LVYLKYCF GLAFPPG GTL  +R+ SLR EL+QFLLE SD  ++++ S+
Sbjct: 1141 -------YRMLVYLKYCFKGLAFPPGHGTLPSTRLPSLRAELVQFLLEESDAQNSQAASS 1200

Query: 1201 KSSEVGYLNLYHLLELDTGATLDVLRCAFVEGEIVKADSSLDGSVDASMQVQKEKNSTSG 1260
               +  YLNLYHLLELDT ATLDVLRCAF+E E  K+D       D + +        + 
Sbjct: 1201 LLLKGSYLNLYHLLELDTEATLDVLRCAFIEVETPKSDFYACDMADTNAEPNNGNKMVAE 1260

Query: 1261 RKNFLVQNVVDALVHILDKAISQTYGSPGGDNITLVEDWPSKKDLFHLFDFVANYVACGK 1320
             +N LVQN V+ALVHILD+ IS T GS   D+   VE WPS KD+ H+F+F+A YVA G+
Sbjct: 1261 YQNMLVQNTVNALVHILDEDISSTDGSASKDDSGSVEAWPSTKDIGHIFEFIACYVASGR 1320

Query: 1321 ATASKDVVGQILEHLISNSDIPEMEIDFVHSVTANSVHSRKREKQVLSLLEVIPETHWNP 1380
            AT SK V+ QIL++L S  ++P+       S+ ++   S++REKQ+L+LLE +PET WN 
Sbjct: 1321 ATVSKSVLSQILQYLTSEKNVPQ-------SILSHIETSKRREKQLLALLEAVPETDWNA 1380

Query: 1381 SSVLRMCEKAQFFQVCGLIHSISHQYSSALDGYMKDVEEPIHAFAFINRTLMELSNSEQT 1440
            S VL +CE A F+QVCGLIH+I + Y +ALD YMKDV+EPI AF+FI+ TL++L+++E T
Sbjct: 1381 SEVLHLCENAHFYQVCGLIHTIRYNYLAALDSYMKDVDEPICAFSFIHDTLLQLTDNEYT 1440

Query: 1441 EFRGVVISRIPELLNLNREGTFFLVIDHFGNDVSDILSRLHNHPRSLFLYLKTLIEVHQS 1500
             F   VISRIPEL+ L+RE TFFLVID F ++ S ILS L +HP+SLFLYLKT++EVH  
Sbjct: 1441 AFHSAVISRIPELICLSREATFFLVIDQFNDEASHILSELRSHPKSLFLYLKTVVEVHLH 1500

Query: 1501 GNLNFSCLKKDDNFG------VNYSTKGLDDYLQKLSDFPKFLSNNPVDVTDDIIELYVE 1560
            G LN S L+KDD         V Y +KGL  Y++++SD PKFLS+N V VTDD+IELY+E
Sbjct: 1501 GTLNLSYLRKDDTLDVANCKWVKYQSKGLGAYIERISDLPKFLSSNAVHVTDDMIELYLE 1560

Query: 1561 LLCRYERESVLKFLETFDSYHVEHCLRLCQQYEVIDAAAFLLERVGDVGSALFLTLSSLD 1620
            LLCRYER+SVLKFLETFDSY VE+CLRLCQ+Y + DAAAFLLERVGDVGSAL LTLS L+
Sbjct: 1561 LLCRYERDSVLKFLETFDSYRVEYCLRLCQEYGITDAAAFLLERVGDVGSALLLTLSELN 1620

Query: 1621 KKFHDLEAAVGGIVTNGASSGSNDSQLFSSVLKLQEVNDIYVLLHACIGLCQRNTPRLNS 1680
             KF  LE AVG  +    S+GS   + FS+VL ++EVND+  +L ACIGLCQRNTPRLN 
Sbjct: 1621 DKFAALETAVGSALPIAVSNGSVSVEHFSTVLNMEEVNDVNNILRACIGLCQRNTPRLNP 1680

Query: 1681 EESETLWFKLLDSFCEPLIESFNYRTASFGENQVQFLNESSGSQKDKEAHIVTWRILKSN 1740
            EESE LWFKLLDSFCEPL+ SF  R AS  EN  + L ES GSQ+D EA I+ WRI KS+
Sbjct: 1681 EESEVLWFKLLDSFCEPLMGSFVER-ASERENHSRMLEESFGSQEDAEACIIKWRISKSH 1740

Query: 1741 QTAHILRSLFSRFIREIVEGMIGYVHLPTIMSRLLSDNGSQEFGDFKLTILGMLGTFGFE 1800
            + +HILR LFS+FI+EIVEGMIGYVHLPTIMS+LLSDNGSQEFGDFKLTILGMLGT+ FE
Sbjct: 1741 RGSHILRKLFSQFIKEIVEGMIGYVHLPTIMSKLLSDNGSQEFGDFKLTILGMLGTYSFE 1800

Query: 1801 RRILDTAKALIEDDTFYTMSLLKKGASHGYAPRGAVCCICNRLLVKSSSSYRVRVFNCGH 1860
            RRILDTAK+LIEDDTFYTMS+LKK ASHGYAPR  +CCICN LL K+SSS+++RVFNCGH
Sbjct: 1801 RRILDTAKSLIEDDTFYTMSVLKKEASHGYAPRSLLCCICNCLLTKNSSSFQIRVFNCGH 1860

Query: 1861 ATHLQCEVLDNEASGGD--FSCPVCVHSNHSQRSRGK-ALTEYSLVNKFSSRTQSSSGAS 1920
            ATH+QCE+L+NE+S       CP+C+   ++QRSR K  L E  LV+KFSSR Q S G +
Sbjct: 1861 ATHIQCELLENESSSKSNLSGCPLCMPKKNTQRSRNKTVLAESGLVSKFSSRPQQSLGTT 1920

Query: 1921 VSYPQETDLLELPYTLQQIPRFEILANLQKNQRVIDIENMPQLRLAPPAVYHDKVTKGYH 1972
            + +  E+D  +    +QQ+ RFEIL NL+K+QRV+ IENMPQLRLAPPA+YH+KV KG  
Sbjct: 1921 L-HSHESDTSDYSNGIQQLSRFEILNNLRKDQRVVQIENMPQLRLAPPAIYHEKVKKGTD 1944

BLAST of Cp4.1LG03g15580.1 vs. TrEMBL
Match: A0A067H3N5_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g000170mg PE=4 SV=1)

HSP 1 Score: 2206.8 bits (5717), Expect = 0.0e+00
Identity = 1210/2024 (59.78%), Postives = 1483/2024 (73.27%), Query Frame = 1

Query: 1    MTKELTDTETLPPMELDLNAFIHAHLSSGDDDHDHDEDDLSFPHRSIDEILNESSSSTS- 60
            MTKEL DT++L  MELD+++F+++HLSS     D D++  S PHR++DEILN+S SSTS 
Sbjct: 1    MTKELQDTKSL--MELDVDSFLNSHLSS-----DSDDEFNSVPHRTLDEILNDSESSTSP 60

Query: 61   SSPSSPPNSPPPRARRSIAARDGRASASRSIPPFKSPFEEIIKASKVPRSNQRNEKSVQL 120
            SSP+S  +       +     DG +S  +  P                            
Sbjct: 61   SSPTSSIHHSDTSLAKPQPQGDGVSSQDKPTP---------------------------- 120

Query: 121  KPGSVSHTKVGELTDDPFRR----GSRALPSLFGGVRSNAKPGAALAAAAAASRSMPAPH 180
            KPGS    K  EL+ DP  R     SR LPSLFGGVRS AKPGAALAAAAAASRS+P PH
Sbjct: 121  KPGSFHRVKSNELSGDPIWRVPPSSSRQLPSLFGGVRSTAKPGAALAAAAAASRSVPTPH 180

Query: 181  AAAIKSRRSGHGSVVL----DDDELASSSAVDSEFVFDDLYSTIDHSKESREKSISLVER 240
            AAAIKSRR+G G+++     DD E+AS S+           + I  S E  E    L+  
Sbjct: 181  AAAIKSRRAGSGTLLKVLDGDDHEIASVSS-----------NEISVSSEKLEGDAELI-- 240

Query: 241  NADYQGASVNVGVEFWARDNIRDCVLYNDEFRITKDTECEAEQSFVDD--------VNFD 300
              D+Q A VNV  E  +  + RD            DT+ E+E S VDD        +N D
Sbjct: 241  -GDFQSAQVNVSGELSSLASSRDV-----------DTKLESEVSNVDDEFLNTSSNLNTD 300

Query: 301  ESSTTLPPVEANGRSL--------SDSADNNVCSMDAEPTVLDGDE-------SNEGAFP 360
            +     P V     +L        SD A++   +    P   D D        S E +  
Sbjct: 301  QLIGCSPRVVVKDLNLREKSIIASSDDANDIDGNRIVAPVTADDDSMFLEVNASTESS-- 360

Query: 361  CSPKPDYDRSAVGYGRLELETQDFE---KRSQPSKDSE--VLAIEDLSVVNDISE-SRET 420
              P  + DR+ +    LE+ T + E   K    S+D E  V    D S ++DISE   E 
Sbjct: 361  VVPLNESDRTGLMEENLEIPTLEMESSDKSMSTSQDDEVGVDGSNDASSIDDISELVEER 420

Query: 421  AKQLDNFHTGERAETMSLSSSNPLELAEEIEKKQAFTALHWEEGVAAQPMRLEGIKGGTT 480
              QL++  T  RAE     S  PLELAEE+EKKQA T LHW+EG AAQPMRLEG++ G+T
Sbjct: 421  IGQLESEITSRRAEKKVQPSLKPLELAEELEKKQASTGLHWKEGAAAQPMRLEGVRRGST 480

Query: 481  ALGYFDIQADNSISRTISSHSFKREHGFPQALAVHANYIAVGMSKGNIVVVASKYSAQNG 540
             LGYFD+ A+N+I++TI+S +F+R+HG PQ LAVH ++IAVGMSKG IVVV  KYSA + 
Sbjct: 481  TLGYFDVDANNTITQTIASQAFRRDHGSPQVLAVHPSFIAVGMSKGAIVVVPGKYSAHHR 540

Query: 541  DNMDAKMLLLGSQGDKSTAPVTSLCFNQQGDLLLAGYSDGQVTVWDVLRATAAKVISGEH 600
            D+MD+KM++LG  GD+S APVT++CFNQ GDLLLAGY+DG VTVWDV RA+AAKVI+GEH
Sbjct: 541  DSMDSKMMMLGLLGDRSPAPVTAMCFNQPGDLLLAGYADGHVTVWDVQRASAAKVITGEH 600

Query: 601  TSPVVHSLFLGQEAQVTRQFKAVTGDSKGLVLLHTFSVVPLLNRFSIKTQCLLDGQKTGT 660
            TSPVVH+LFLGQ++QVTRQFKAVTGD+KGLV LH+ SVVPLLNRFSIKTQCLLDGQKTG 
Sbjct: 601  TSPVVHTLFLGQDSQVTRQFKAVTGDTKGLVQLHSLSVVPLLNRFSIKTQCLLDGQKTGI 660

Query: 661  VLSASALLLNEFCGSSLPPSLSNVAVSTSSIGSMMGGVVGGDSGWKLFNEGSSLV-EGVV 720
            VLSAS LL +E CG +   S  N   S SSIGSMMGGVVG D+GWKLFNEGSSLV EGVV
Sbjct: 661  VLSASPLLFDESCGGAPLSSQGNSTASASSIGSMMGGVVGSDTGWKLFNEGSSLVEEGVV 720

Query: 721  IFATHQTALVVRLSPTVEVYARLSKPDGIQEGSMPYTAWKC-----SQSIETSPSEAVER 780
            IF T+QTALVVRL+PT+EVYA++ +PDG++EG+MPYTAWKC     S + E+ P+EA ER
Sbjct: 721  IFVTYQTALVVRLTPTLEVYAQIPRPDGVREGAMPYTAWKCMTTCRSSTTESIPTEAAER 780

Query: 781  ISLLAIAWDKMVQVAKLVKTELNVCGKWSLESAAIGVAWLDDQVLVILTVTGQLFLFEKD 840
            +SLLAIAWD+ VQVAKLVK+EL V GKWSL+SAAIGVAWLDDQ+LV+LT+ GQL+L+ +D
Sbjct: 781  VSLLAIAWDRKVQVAKLVKSELKVYGKWSLDSAAIGVAWLDDQMLVVLTLLGQLYLYARD 840

Query: 841  GTMIHQTSVFVDGFDKEDFIAHHTHFVNVLGNPEKAYHNCVAVRGASVYVLGPKHLVISR 900
            GT+IHQTS  VDG    D + + ++F NV GNPEK+YHNCV+VRGAS+YVLGP HLV+SR
Sbjct: 841  GTVIHQTSFAVDGSQGYDLVGYRSYFTNVFGNPEKSYHNCVSVRGASIYVLGPMHLVVSR 900

Query: 901  LLPWKERVQVLRKAGDWMTALSMAITIYDGHAHGVIDLPRSLESLQELVMPFLIELLLSY 960
            LLPWKER+QVLRKAGDWM AL+MA+T+YDG AHGVIDLPR+L+++QE +MP+L+ELLLSY
Sbjct: 901  LLPWKERIQVLRKAGDWMGALNMAMTLYDGQAHGVIDLPRTLDAVQEAIMPYLVELLLSY 960

Query: 961  VDEVFSYISVAFCNQIEKNEKLDDVTSGSHSAHSEIKEQYNRVGGVAVEFCVHITRTDIL 1020
            VDEVFSYISVAFCNQIEK  +L++  S S + H+EIKEQ+ RVGGVAVEFCVHI RTDIL
Sbjct: 961  VDEVFSYISVAFCNQIEKLAQLNNPQSRSSTVHAEIKEQFTRVGGVAVEFCVHINRTDIL 1020

Query: 1021 FDEIFSKFVAVQQRDTFLELLEPYILKDMLGSLPPEIMQALVEHYSQKGWLQRVEQCVLH 1080
            FD+IFSKF AVQ RDTFLELLEPYILKDMLGSLPPEIMQALVEHYS KGWLQRVEQCVLH
Sbjct: 1021 FDDIFSKFEAVQHRDTFLELLEPYILKDMLGSLPPEIMQALVEHYSSKGWLQRVEQCVLH 1080

Query: 1081 MDISSLDFNQVVRLCRDHGLYSALVYLFNKGLDDFRTPLEELLAVLQTSLSLDSKKSLGG 1140
            MDISSLDFNQVVRLCR+HGL+ ALVYLFNKGLDDFR PLEELL VL+ S   +S  +LG 
Sbjct: 1081 MDISSLDFNQVVRLCREHGLHGALVYLFNKGLDDFRAPLEELLVVLRNS-ERESAYALG- 1140

Query: 1141 GWGSMGMYKTLVYLKYCFSGLAFPPGQGTLAHSRVQSLRDELLQFLLENSDTVDTRSISN 1200
                   Y+ LVYLKYCF GLAFPPG GTL  +R+ SLR EL+QFLLE SD  ++++ S+
Sbjct: 1141 -------YRMLVYLKYCFKGLAFPPGHGTLPSTRLPSLRAELVQFLLEESDAQNSQAASS 1200

Query: 1201 KSSEVGYLNLYHLLELDTGATLDVLRCAFVEGEIVKADSSLDGSVDASMQVQKEKNSTSG 1260
               +  YLNLYHLLELDT ATLDVLRCAF+E E  K+D       D + +        + 
Sbjct: 1201 LLLKGSYLNLYHLLELDTEATLDVLRCAFIEVETPKSDFYACDMADTNAEPNNGNKMVAE 1260

Query: 1261 RKNFLVQNVVDALVHILDKAISQTYGSPGGDNITLVEDWPSKKDLFHLFDFVANYVACGK 1320
             +N LVQN V+ALVHILD+ IS T GS   D+   VE WPS KD+ H+F+F+A YVA G+
Sbjct: 1261 YQNMLVQNTVNALVHILDEDISSTDGSASKDDSGSVEAWPSTKDIGHIFEFIACYVASGR 1320

Query: 1321 ATASKDVVGQILEHLISNSDIPEMEIDFVHSVTANSVHSRKREKQVLSLLEVIPETHWNP 1380
            AT SK V+ QIL++L S  ++P+       S+ ++   S++REKQ+L+LLE +PET WN 
Sbjct: 1321 ATVSKSVLSQILQYLTSEKNVPQ-------SILSHIETSKRREKQLLALLEAVPETDWNA 1380

Query: 1381 SSVLRMCEKAQFFQVCGLIHSISHQYSSALDGYMKDVEEPIHAFAFINRTLMELSNSEQT 1440
            S VL +CE A F+QVCGLIH+I + Y +ALD YMKDV+EPI AF+FI+ TL++L+++E T
Sbjct: 1381 SEVLHLCENAHFYQVCGLIHTIRYNYLAALDSYMKDVDEPICAFSFIHDTLLQLTDNEYT 1440

Query: 1441 EFRGVVISRIPELLNLNREGTFFLVIDHFGNDVSDILSRLHNHPRSLFLYLKTLIEVHQS 1500
             F   VISRIPEL+ L+RE TFFLVID F ++ S ILS L +HP+SLFLYLKT++EVH  
Sbjct: 1441 AFHSAVISRIPELICLSREATFFLVIDQFNDEASHILSELRSHPKSLFLYLKTVVEVHLH 1500

Query: 1501 GNLNFSCLKKDDNFG------VNYSTKGLDDYLQKLSDFPKFLSNNPVDVTDDIIELYVE 1560
            G LN S L+KDD         V Y +KGL  Y++++SD PKFLS+N V VTDD+IELY+E
Sbjct: 1501 GTLNLSYLRKDDTLDVANCKWVKYQSKGLGAYIERISDLPKFLSSNAVHVTDDMIELYLE 1560

Query: 1561 LLCRYERESVLKFLETFDSYHVEHCLRLCQQYEVIDAAAFLLERVGDVGSALFLTLSSLD 1620
            LLCRYER+SVLKFLETFDSY VE+CLRLCQ+Y + DAAAFLLERVGDVGSAL LTLS L+
Sbjct: 1561 LLCRYERDSVLKFLETFDSYRVEYCLRLCQEYGITDAAAFLLERVGDVGSALLLTLSELN 1620

Query: 1621 KKFHDLEAAVGGIVTNGASSGSNDSQLFSSVLKLQEVNDIYVLLHACIGLCQRNTPRLNS 1680
             KF  LE AVG  +    S+GS   + FS+VL ++EVND+  +L ACIGLCQRNTPRLN 
Sbjct: 1621 DKFAALETAVGSALPIAVSNGSVSVEHFSTVLNMEEVNDVNNILRACIGLCQRNTPRLNP 1680

Query: 1681 EESETLWFKLLDSFCEPLIESFNYRTASFGENQVQFLNESSGSQKDKEAHIVTWRILKSN 1740
            EESE LWFKLLDSFCEPL+ SF  R AS  EN  + L ES GSQ+D EA I+ WRI KS+
Sbjct: 1681 EESEVLWFKLLDSFCEPLMGSFVER-ASERENHSRMLEESFGSQEDAEACIIKWRISKSH 1740

Query: 1741 QTAHILRSLFSRFIREIVEGMIGYVHLPTIMSRLLSDNGSQEFGDFKLTILGMLGTFGFE 1800
            + +HILR LFS+FI+EIVEGMIGYVHLPTIMS+LLSDNGSQEFGDFKLTILGMLGT+ FE
Sbjct: 1741 RGSHILRKLFSQFIKEIVEGMIGYVHLPTIMSKLLSDNGSQEFGDFKLTILGMLGTYSFE 1800

Query: 1801 RRILDTAKALIEDDTFYTMSLLKKGASHGYAPRGAVCCICNRLLVKSSSSYRVRVFNCGH 1860
            RRILDTAK+LIEDDTFYTMS+LKK ASHGYAPR  +CCICN LL K+SSS+++RVFNCGH
Sbjct: 1801 RRILDTAKSLIEDDTFYTMSVLKKEASHGYAPRSLLCCICNCLLTKNSSSFQIRVFNCGH 1860

Query: 1861 ATHLQCEVLDNEASGGD--FSCPVCVHSNHSQRSRGK-ALTEYSLVNKFSSRTQSSSGAS 1920
            ATH+QCE+L+NE+S       CP+C+   ++QRSR K  L E  LV+KFSSR Q S G +
Sbjct: 1861 ATHIQCELLENESSSKSNLSGCPLCMPKKNTQRSRNKTVLAESGLVSKFSSRPQQSLGTT 1920

Query: 1921 VSYPQETDLLELPYTLQQIPRFEILANLQKNQRVIDIENMPQLRLAPPAVYHDKVTKGYH 1972
            + +  E+D  +    +QQ+ RFEIL NL+K+QRV+ IENMPQLRLAPPA+YH+KV KG  
Sbjct: 1921 L-HSHESDTSDYSNGIQQLSRFEILNNLRKDQRVVQIENMPQLRLAPPAIYHEKVKKGTD 1944

BLAST of Cp4.1LG03g15580.1 vs. TrEMBL
Match: M5X747_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa000078mg PE=4 SV=1)

HSP 1 Score: 2199.1 bits (5697), Expect = 0.0e+00
Identity = 1208/1988 (60.76%), Postives = 1481/1988 (74.50%), Query Frame = 1

Query: 1    MTKELTDTETLPPMELDLNAFIHAHLSSGDDDHDHDEDDL-SFPHRSIDEILNESSSSTS 60
            MTK+LT  E    MELDL++F+++HLS  D+D   D+D+L S PHR+IDEILN+S SS S
Sbjct: 1    MTKKLTQFEPQLAMELDLDSFLNSHLSLSDED---DDDNLNSVPHRTIDEILNDSDSSAS 60

Query: 61   SSPSSPPNSPPPRARRSIAARDGRASASRSIPPFKSPFEEIIKASKVPRSNQRNEKSVQL 120
            SSP S  +                  AS   PP        + ++K   S+Q     V+ 
Sbjct: 61   SSPPSTIHR----------------LASDPKPPHPPTDAVSVSSAKSDESSQ-----VRP 120

Query: 121  KPGSVSHTKVGELTDDPFRRGSRALPSLFGGVRSNAKPGAALAAAAAASRSMPAPHAAAI 180
            +P   +  K GEL+DDP  + S+  P L GG+R+NAKPGAALAAAAAASRSMP PHAAAI
Sbjct: 121  RPNLYTRVKSGELSDDPVGKVSKPSPWLLGGMRTNAKPGAALAAAAAASRSMPTPHAAAI 180

Query: 181  KSRRSGHGSV---VLDDDELASSSAVDSEFVFDDLYSTIDHSKESREKSISLVERNADYQ 240
            KS+RS    +   VL+  EL   S V S    D    T   S E  E + +  E   D+ 
Sbjct: 181  KSKRSAGSGIFQKVLESTELDDKSEVGSNSNND----TNVGSSEVTESNSN--EGEVDFG 240

Query: 241  GASVNVGVEFWARDNIRDCVLYNDEFRITKDTECEAEQSFVDDVNFDESSTTLPPVEANG 300
               +  G   W R+  R+    +    ++     E     V +V+FDE+ T L   +AN 
Sbjct: 241  DELLRKG-RAWERE--RELEETSQGIEVSAGNAPEE----VKNVSFDENLTNL---DAND 300

Query: 301  RSLSDSADNNVCSMDAEPTVLDGDESNEGAFPCSPKPDYDRSAVGYGRLELETQDFEKRS 360
               ++  +N     + +P + D DE++ G    S   D +   +G G       D E   
Sbjct: 301  VEDNEFNNNVEVVEECQPEIQDIDENSPG----SKHSDSEEERLGDGGGGGNDNDGEGGG 360

Query: 361  -QPSKDSEVLAIEDLSVVNDISES-RETAKQLDNFHTGERAETMSLSSSNPLELAEEIEK 420
                 +++  + +D  + + I++   E   QL++    ++AE        PLE+AEE+EK
Sbjct: 361  GDDDNNNDRDSNDDGELGSSITQLVEERIGQLESRRISKKAEK---KLQKPLEIAEELEK 420

Query: 421  KQAFTALHWEEGVAAQPMRLEGIKGGTTALGYFDIQADNSISRTISSHSFKREHGFPQAL 480
            KQA TALHWEEG AAQPMRLEG++ G+T LGYF++ A+N I+RT+S+ + +R+HG PQ L
Sbjct: 421  KQASTALHWEEGAAAQPMRLEGVRRGSTTLGYFNVDANNPITRTLSAPALRRDHGSPQVL 480

Query: 481  AVHANYIAVGMSKGNIVVVASKYSAQNGDNMDAKMLLLGSQGDKSTAPVTSLCFNQQGDL 540
            AVH+NYIA+GM++G I+V+ SKYSA N D MDAKML+LG QG++S A VTS+CFNQQGDL
Sbjct: 481  AVHSNYIAIGMARGAILVIPSKYSAHNADIMDAKMLILGLQGERSYAAVTSICFNQQGDL 540

Query: 541  LLAGYSDGQVTVWDVLRATAAKVISGEHTSPVVHSLFLGQEAQVTRQFKAVTGDSKGLVL 600
            LLAGY+DG +TVWDV R++ AKVI+GEHT+PVVH+LFLGQ++QVTRQFKAVTGDSKGLVL
Sbjct: 541  LLAGYADGHITVWDVQRSSVAKVITGEHTAPVVHTLFLGQDSQVTRQFKAVTGDSKGLVL 600

Query: 601  LHTFSVVPLLNRFSIKTQCLLDGQKTGTVLSASALLLNEFCGSSLPPSLSNVAVSTSSIG 660
            LH+FSVVPLLNRFSIKTQCLLDGQ+TGTVLSAS LL +EF G +   +  N  V+ SSIG
Sbjct: 601  LHSFSVVPLLNRFSIKTQCLLDGQRTGTVLSASPLLFDEFSGGASQSAQGNGTVTGSSIG 660

Query: 661  SMMGGVVGGDSGWKLFNEGSSLVE-GVVIFATHQTALVVRLSPTVEVYARLSKPDGIQEG 720
             MMGGVVGGD+ WKLFNEGSSLVE GVV+F THQTALVVRL+P +EVYA+LSKP+G++EG
Sbjct: 661  GMMGGVVGGDASWKLFNEGSSLVEEGVVVFVTHQTALVVRLTPNLEVYAQLSKPEGVREG 720

Query: 721  SMPYTAWKCSQ-------SIETSPSEAVERISLLAIAWDKMVQVAKLVKTELNVCGKWSL 780
            +MP TAWKC+        + E  P+E VER+SLLAIAWD+ VQVAKLVK+EL V GKWSL
Sbjct: 721  AMPSTAWKCTTQSRRLPANTENMPAEVVERVSLLAIAWDRKVQVAKLVKSELKVYGKWSL 780

Query: 781  ESAAIGVAWLDDQVLVILTVTGQLFLFEKDGTMIHQTSVFVDGFDKEDFIAHHTHFVNVL 840
            ESAAIGVAWLDDQ+LV+L +TGQL LF KDGT+IHQTS  VDGF  +D IA+HTHFVN+ 
Sbjct: 781  ESAAIGVAWLDDQMLVVLMMTGQLCLFAKDGTVIHQTSFSVDGFGGDDLIAYHTHFVNIF 840

Query: 841  GNPEKAYHNCVAVRGASVYVLGPKHLVISRLLPWKERVQVLRKAGDWMTALSMAITIYDG 900
            GNPEKAYHNCVAVRGASVYVLGP HL++SRLLPWKER+QVLR AGDWM AL+MA+TIYDG
Sbjct: 841  GNPEKAYHNCVAVRGASVYVLGPMHLIVSRLLPWKERIQVLRSAGDWMGALNMAMTIYDG 900

Query: 901  HAHGVIDLPRSLESLQELVMPFLIELLLSYVDEVFSYISVAFCNQIEKNEKLDDVTSGSH 960
             AHGV+DLPR+L ++QE +M +L+ELLLSYV+EVFSYISVA  NQI   +++DD+ S S 
Sbjct: 901  QAHGVVDLPRTLVAVQEAIMSYLVELLLSYVEEVFSYISVALGNQIGIMDQVDDLNSKSS 960

Query: 961  SAHSEIKEQYNRVGGVAVEFCVHITRTDILFDEIFSKFVAVQQRDTFLELLEPYILKDML 1020
            S HSEIKEQY RVGGVAVEFCVHI RTDILFDEIFSKFVAVQQRDTFLELLEPYILKDML
Sbjct: 961  SVHSEIKEQYTRVGGVAVEFCVHIKRTDILFDEIFSKFVAVQQRDTFLELLEPYILKDML 1020

Query: 1021 GSLPPEIMQALVEHYSQKGWLQRVEQCVLHMDISSLDFNQVVRLCRDHGLYSALVYLFNK 1080
            GSLPPEIMQALVEHYS+KGWLQRVEQCVLHMDISSLDFNQVVRLCR+HGLYSALVYLFNK
Sbjct: 1021 GSLPPEIMQALVEHYSRKGWLQRVEQCVLHMDISSLDFNQVVRLCREHGLYSALVYLFNK 1080

Query: 1081 GLDDFRTPLEELLAVLQTSLSLDSKKSLGGGWGSMGMYKTLVYLKYCFSGLAFPPGQGTL 1140
            GLDDFR+PLEELL VLQ     +SKK    G  ++G Y+ LVYLKYCFSGLAFPPGQGT+
Sbjct: 1081 GLDDFRSPLEELLVVLQ-----NSKKE---GATALG-YRMLVYLKYCFSGLAFPPGQGTI 1140

Query: 1141 AHSRVQSLRDELLQFLLENSDTVDTRSISNKSSEVGYLNLYHLLELDTGATLDVLRCAFV 1200
               R+ SLR ELLQFLLE SD  ++R+   +     YLNLY LLELDT ATLDVLRCAF+
Sbjct: 1141 PAPRLPSLRTELLQFLLEGSDAPNSRAGGGE-----YLNLYLLLELDTEATLDVLRCAFI 1200

Query: 1201 EGEIVKADSSLDGSVDASMQVQKEKNSTSGRKNFLVQNVVDALVHILDKAISQTYGSPGG 1260
            E EI K D S   S DA+M++    NS +  +N +VQN VD L+HI+ K ISQT GSP  
Sbjct: 1201 EDEISKPDVSSHDSADANMELPDGNNSMAQSQNSMVQNTVDTLIHIVSKGISQTDGSPSN 1260

Query: 1261 DNITLVEDWPSKKDLFHLFDFVANYVACGKATASKDVVGQILEHLISNSDIPEMEIDFVH 1320
            D      +WPSKKD+  LF+F+A YVACG+A  SK V+ QILE+L S+++ P        
Sbjct: 1261 DETASTVEWPSKKDIGDLFEFIAYYVACGRANVSKHVLSQILEYLTSDNNFPSW------ 1320

Query: 1321 SVTANSVHSRKREKQVLSLLEVIPETHWNPSSVLRMCEKAQFFQVCGLIHSISHQYSSAL 1380
             V+ +++ S+KREKQVL LLEV+PET W+ S VL++CEKA+F+QVCGLIH+  HQY +AL
Sbjct: 1321 -VSGDTITSKKREKQVLGLLEVVPETDWDSSYVLQLCEKARFYQVCGLIHNSRHQYLAAL 1380

Query: 1381 DGYMKDVEEPIHAFAFINRTLMELSNSEQTEFRGVVISRIPELLNLNREGTFFLVIDHF- 1440
            D YMKDV+EPIHAF+FIN+TL++L+++E   FR  VISRIPEL +LNREGTF LVIDHF 
Sbjct: 1381 DCYMKDVDEPIHAFSFINKTLLQLTDNESAAFRSEVISRIPELFDLNREGTFVLVIDHFT 1440

Query: 1441 GNDVSDILSRLHNHPRSLFLYLKTLIEVHQSGNLNFSCLKKDDNFGVNYSTKGLDDYLQK 1500
              + S ILS L +HP+SLFLYLKT+IEVH SG L+FS L+KDD   V   +K ++ YL++
Sbjct: 1441 SEEGSHILSELRSHPKSLFLYLKTVIEVHLSGTLDFSSLRKDDLVRVKDQSKAVEAYLER 1500

Query: 1501 LSDFPKFLSNNPVDVTDDIIELYVELLCRYERESVLKFLETFDSYHVEHCLRLCQQYEVI 1560
            + DFPK L NNPV+VTDD+IELY+ELLC+YER SVLKFLETFDSY VEHCLRLCQ+Y + 
Sbjct: 1501 ICDFPKLLRNNPVNVTDDMIELYLELLCQYERNSVLKFLETFDSYRVEHCLRLCQKYGIT 1560

Query: 1561 DAAAFLLERVGDVGSALFLTLSSLDKKFHDLEAAVGGIVTNGASSGSNDSQLFSSVLKLQ 1620
            DAA+FLLERVGDVGSAL LTLS+L++KF  L+ AVG +V    SSGS  ++ FS+ LKL+
Sbjct: 1561 DAASFLLERVGDVGSALLLTLSTLNEKFIKLDTAVGSLV----SSGSARTEHFSNALKLE 1620

Query: 1621 EVNDIYVLLHACIGLCQRNTPRLNSEESETLWFKLLDSFCEPLIESFNYRTASFGENQVQ 1680
            EV+DI  +LHACIGLCQRNT RLN +ESE LWF+LLDSFCEPL +S N    S G++   
Sbjct: 1621 EVSDINSILHACIGLCQRNTHRLNPDESEALWFRLLDSFCEPLTDSLNAGRVSKGDDLKT 1680

Query: 1681 FLNESSGSQKDKEAHIVTWRILKSNQTAHILRSLFSRFIREIVEGMIGYVHLPTIMSRLL 1740
             + ES  S++D+ A I+ WRI K ++ AHILR +FSRFI+EIVEGMIGYV LPTIMS+LL
Sbjct: 1681 VVAESLESEEDEVAFIIEWRISKLHKGAHILRKVFSRFIKEIVEGMIGYVRLPTIMSKLL 1740

Query: 1741 SDNGSQEFGDFKLTILGMLGTFGFERRILDTAKALIEDDTFYTMSLLKKGASHGYAPRGA 1800
            SDNGSQEFGDFK TILGML T+GFERRILDTAK+LIEDDTFYTMS+LKKGASHGYAPR  
Sbjct: 1741 SDNGSQEFGDFKFTILGMLSTYGFERRILDTAKSLIEDDTFYTMSILKKGASHGYAPRSQ 1800

Query: 1801 VCCICNRLLVKSSSSYRVRVFNCGHATHLQCEVLDNEASGGDFS--CPVCVHSNHSQRSR 1860
            +CCIC+ LL K+SSSY +R+FNCGHATHLQCEVL+N  S    S  CPVC+    SQRSR
Sbjct: 1801 ICCICDCLLDKNSSSY-IRIFNCGHATHLQCEVLENGTSSSSSSSGCPVCMPKKKSQRSR 1860

Query: 1861 GKA-LTEYSLVNKFSSRTQSSSGASVSYPQETDLLELPYTLQQIPRFEILANLQKNQRVI 1920
             K+ L E SLV  FSSRTQ   G +V +P E++  E  Y L QI RFE+L NLQ+++ ++
Sbjct: 1861 NKSVLPEKSLVKGFSSRTQQIHGTTV-HPHESNASENTYGLHQISRFEMLTNLQRDRGLV 1913

Query: 1921 DIENMPQLRLAPPAVYHDKVTKGYHLLVGDSSGGVEKVEKLNKSRQLKEVKVKRPSSLRF 1971
            +IENMPQLRLAPPAVYH+KV KG  L   +SS  +  + K +K++QL+E+KVK  SSLRF
Sbjct: 1921 EIENMPQLRLAPPAVYHEKVQKGTVLSPAESSSDLATIGKQSKTKQLRELKVK-GSSLRF 1913

BLAST of Cp4.1LG03g15580.1 vs. TAIR10
Match: AT4G00800.1 (AT4G00800.1 transducin family protein / WD-40 repeat family protein)

HSP 1 Score: 1843.2 bits (4773), Expect = 0.0e+00
Identity = 1050/1982 (52.98%), Postives = 1353/1982 (68.26%), Query Frame = 1

Query: 14   MELDLNAFIHAHLSSGDDDHDHDEDDLSFPHRSIDEILNESSSSTSSSPSSPPNSPPPRA 73
            MELDL++F+ +     D D D D D  S PHR++DEILN SSSS++SS  SPP+SPP   
Sbjct: 1    MELDLDSFLVS-----DSDSDSDLDSSSVPHRTVDEILNASSSSSASS--SPPSSPPSIN 60

Query: 74   RRSIAARDGRASASRSIPPFKSPFEEIIKASKVPRSNQRNEKSVQLKPGSVSHTKVGELT 133
            RR     + R S + +      P  E+ +     R N  +  S++  P            
Sbjct: 61   RRKQDDPNRRLSEALTNVAVLRPESELHRGFPPTRRNSTSSSSLRQLP------------ 120

Query: 134  DDPFRRGSRALPSLFGGVRSNAKPGAALAAAAAASRSMPAPHAAAIKSRRSGHGSVVL-- 193
                      LPSL  GVRSN KPGAALAAA AASR +P PHAA IKSRR+   S  L  
Sbjct: 121  ----------LPSLLAGVRSNVKPGAALAAAVAASRLVPTPHAAIIKSRRASSASSELLL 180

Query: 194  ------DDDELASSSAVDSEFVF------DDLYS----TIDHSKESREKSISLVERNADY 253
                  +DD    SS  DS  V       DD  S    ++   +++    ++ +E  A  
Sbjct: 181  QVSNQEEDDHEVLSSNGDSVGVAAGSVSADDFRSFGGESLLEDEDNGVSGVASLEDEAKV 240

Query: 254  QGASVNVGVEFWARDNIRDCVLYNDEFRITKDTECEAEQSFVDDVNFDESSTTLPPVEAN 313
                 +   E    D +     ++ E  ++ + E E      +    D++  T+      
Sbjct: 241  MEVQASDITESLNPDLVTVSSGFDSEGNVSTEKEAETTMEAGNAAIDDDTDETMLVA--- 300

Query: 314  GRSLSDSADNNVCSMDAEPTVLDGDESNEGAFPCSPKPDYDRSAVGYGRLELETQDFEKR 373
              SL +S+++   + D+E    D   SN+           + S+VG   ++ +  D    
Sbjct: 301  --SLVESSESQHLT-DSEGKCDDAKVSND-----------EESSVG--DVKSDKSDIIIP 360

Query: 374  SQPSKDSEVLAIEDLSVVNDISES-RETAKQLDNFHTGERAETMSLSSSNPLELAEEIEK 433
                +  +    +D S ++ ISE   E   +L+N    +R    S S    L LAEE EK
Sbjct: 361  ESKKEGGDAFIPDDGSSMSGISELVEERIAELENERMSKRERLKSQSFRKQLVLAEEFEK 420

Query: 434  KQAFTALHWEEGVAAQPMRLEGIKGGTTALGYFDIQADNSISRTISSHSFKREHGFPQAL 493
            KQA+T LHWEEG AAQPMRLEG+K G+T LGYFD+ ADN ISRTISS +FKR+HG PQ L
Sbjct: 421  KQAYTGLHWEEGAAAQPMRLEGVKIGSTNLGYFDVDADNVISRTISSQAFKRDHGSPQVL 480

Query: 494  AVHANYIAVGMSKGNIVVVASKYSAQNGDNMDAKMLLLGSQGDKSTAPVTSLCFNQQGDL 553
            AVH NYIAVG SKG IVVV SKYS+ + D M++KM+ LG QG++S +PVTS+CFNQ G L
Sbjct: 481  AVHLNYIAVGTSKGVIVVVPSKYSSDHADQMESKMIWLGLQGERSQSPVTSVCFNQIGSL 540

Query: 554  LLAGYSDGQVTVWDVLRATAAKVISGEHTSPVVHSLFLGQEAQVTRQFKAVTGDSKGLVL 613
            LLAGY DG VTVWD+ RA+ AKVI+ EHT+PVV++ FLG+++Q +RQFK +T D+KG+V 
Sbjct: 541  LLAGYGDGHVTVWDMQRASIAKVIT-EHTAPVVYAFFLGRDSQGSRQFKVITSDTKGVVF 600

Query: 614  LHTFSVVPLLNRFSIKTQCLLDGQKTGTVLSASALLLNEFCGSSLPPSLSNVAVSTSSIG 673
             H+FS   LLN ++++TQCLLDGQK GTVLSAS L    F  S +     N AV +SSI 
Sbjct: 601  KHSFSYARLLNMYTVETQCLLDGQKNGTVLSASPLPDENFGSSLVSSKGGNSAVPSSSIS 660

Query: 674  SMMGGVVGGDSGWKLFNEGSSLVE-GVVIFATHQTALVVRLSPTVEVYARLSKPDGIQEG 733
            SMMGGVVG  S WKLFNE S+ VE GVVIFAT+QT LVV+L P +EVYA+L +P+G++EG
Sbjct: 661  SMMGGVVGVGSTWKLFNEDSTSVEEGVVIFATYQTGLVVKLIPNLEVYAQLPRPEGVREG 720

Query: 734  SMPYTAWKCSQSIETSPSEAVERISLLAIAWDKMVQVAKLVKTELNVCGKWSLESAAIGV 793
            SMPYTAW+  +S E    EA +R+S L IAWD+ VQVAKLVK+++    KWSL+S AIGV
Sbjct: 721  SMPYTAWR--RSTENYSKEAEDRVSFLVIAWDRRVQVAKLVKSDIKEYAKWSLDSPAIGV 780

Query: 794  AWLDDQVLVILTVTGQLFLFEKDGTMIHQTSVFVDGFDKEDFIAHHTHFVNVLGNPEKAY 853
             WLDDQ+LVI TVTG L+LF +DG +IHQT+  V G    D I++HT+F NV GNPEKAY
Sbjct: 781  VWLDDQLLVIPTVTGHLYLFTRDGVVIHQTNFSVAGSSGNDLISYHTYFTNVFGNPEKAY 840

Query: 854  HNCVAVRGASVYVLGPKHLVISRLLPWKERVQVLRKAGDWMTALSMAITIYDGHAHGVID 913
            HN + VRGASVY+LG  HLVISRLLPWKERV VLR+ GDWM A +MA+++++G AHGV+D
Sbjct: 841  HNSMGVRGASVYILGTAHLVISRLLPWKERVDVLRRGGDWMGAFNMAMSLFNGQAHGVVD 900

Query: 914  LPRSLESLQELVMPFLIELLLSYVDEVFSYISVAFCNQIEKNEKLDDVTSGSHSAHSEIK 973
            LP+++++++E + P L ELLLSYVDEVFSYIS+AF NQIE N    + +SG ++ + EI+
Sbjct: 901  LPKTVDAIREAIAPSLAELLLSYVDEVFSYISIAFSNQIENNGVTHEPSSGINNVNLEIE 960

Query: 974  EQYNRVGGVAVEFCVHITRTDILFDEIFSKFVAVQQRDTFLELLEPYILKDMLGSLPPEI 1033
            EQYNRVGGVAVEFCVHI R D+LFDEIFS+FVAVQQRDTFLELLEPYIL+DMLGSLPPEI
Sbjct: 961  EQYNRVGGVAVEFCVHINRMDLLFDEIFSRFVAVQQRDTFLELLEPYILRDMLGSLPPEI 1020

Query: 1034 MQALVEHYSQKGWLQRVEQCVLHMDISSLDFNQVVRLCRDHGLYSALVYLFNKGLDDFRT 1093
            MQALVEHYS+KGWLQR+EQCVLHMDISSLDFNQVVR+CR+HGLY AL+YLFNKGLDDFR+
Sbjct: 1021 MQALVEHYSRKGWLQRIEQCVLHMDISSLDFNQVVRICREHGLYGALLYLFNKGLDDFRS 1080

Query: 1094 PLEELLAVLQTSLSLDSKKSLGGGWGSMGMYKTLVYLKYCFSGLAFPPGQGTLAHSRVQS 1153
            PLEELL VL+ S   + +++   G      Y+ LVYLKYCF GLAFPPG GTL  +R  S
Sbjct: 1081 PLEELLIVLRNS---EKQRATAIG------YRMLVYLKYCFLGLAFPPGHGTLNPTRWPS 1140

Query: 1154 LRDELLQFLLENSDTVDTRSISNKSSEVGYLNLYHLLELDTGATLDVLRCAFVEGEIVKA 1213
            LR EL+QFLLE S+  D+ +    +S + YLNLYHLLE+DT ATLDVLR AFVE E+VK 
Sbjct: 1141 LRSELIQFLLEKSNAHDSSTCV--TSRLNYLNLYHLLEMDTEATLDVLRYAFVENEMVKH 1200

Query: 1214 DSSLDGSVDASMQVQKEKNSTSGRKNFLVQNVVDALVHILDKAISQTYGSPGGDNITLVE 1273
            +S L    + S++ + + +      + L+QN+VDALVH+ D  +S   G P        +
Sbjct: 1201 ESHLLEYGEVSVESKTDGSLPEVSNDILIQNLVDALVHVPDWGVSNESGDPIDSKSD--K 1260

Query: 1274 DWPSKKDLFHLFDFVANYVACGKATASKDVVGQILEHLISNSDIPEMEIDFVHSVTANSV 1333
            +WPSK+D  HLF+FVA Y A G+ + SK V+ QIL++L S+           H +   +V
Sbjct: 1261 NWPSKEDTSHLFEFVAYYAARGRVSISKSVLAQILDYLTSD-----------HILPTYNV 1320

Query: 1334 HSRKREKQVLSLLEVIPETHWNPSSVLRMCEKAQFFQVCGLIHSISHQYSSALDGYMKDV 1393
             S+ RE Q+L+LL+ +PET W+   V ++CEKA F+QVCG IH I  +Y +ALD Y+K+ 
Sbjct: 1321 SSKMRENQLLNLLKAVPETDWDADYVSQLCEKAHFYQVCGYIHIIDRRYVAALDSYVKEA 1380

Query: 1394 EEPIHAFAFINRTLMELSNSEQTEFRGVVISRIPELLNLNREGTFFLVIDHFGNDVSDIL 1453
            +EPIH F ++N+ L +LS  E T F+  +ISRIPELL+L+R+G FFL+I +  + +  I 
Sbjct: 1381 DEPIHLFCYVNKMLSQLSGDEFTAFQSAIISRIPELLDLSRQGAFFLIICNLKDTIKRIQ 1440

Query: 1454 SRLHNHPRSLFLYLKTLIEVHQSGNLNFSCLKKD---DNFGVNYS---TKGLDDYLQKLS 1513
             +LH+HPRSLFLYLKT+IEV+ SG+L+FS L+K    D+ G N      K    YL+ L+
Sbjct: 1441 EQLHSHPRSLFLYLKTVIEVYLSGSLDFSRLRKHEAVDSSGENIRRDIPKEAKIYLEGLN 1500

Query: 1514 DFPKFLSNNPVDVTDDIIELYVELLCRYERESVLKFLETFDSYHVEHCLRLCQQYEVIDA 1573
            DFPKF+ +NPV+VTDD+IELYVELLC+YE +SVLKFLETFDSY VEHCLRLCQ+Y ++DA
Sbjct: 1501 DFPKFIQDNPVNVTDDMIELYVELLCKYEPKSVLKFLETFDSYRVEHCLRLCQEYGIVDA 1560

Query: 1574 AAFLLERVGDVGSALFLTLSSLDKKFHDLEAAVGGIVTN---GASSGSNDSQLFSSVLKL 1633
            AAFLLERVGD GSAL LTLS L++K+ +LE AV  +++    GAS G++  + FSS L+L
Sbjct: 1561 AAFLLERVGDAGSALSLTLSGLNEKYVELEIAVECLMSEMKLGASEGAS-LEHFSSALEL 1620

Query: 1634 QEVNDIYVLLHACIGLCQRNTPRLNSEESETLWFKLLDSFCEPLIESFNYRTASFGENQV 1693
            +EV+DI  +L ACIGLCQRNTPRLN EESE LWF+ LD+FCEPL+ES+     + G    
Sbjct: 1621 KEVHDIQGVLQACIGLCQRNTPRLNPEESEILWFRFLDTFCEPLMESYREPKNTDG---- 1680

Query: 1694 QFLNESSGSQKDKEAHI------VTWRILKSNQTA-HILRSLFSRFIREIVEGMIGYVHL 1753
              +N+ S   K  E H+      + WRI +S+  A HILR L S+FI+EIVEGMIGYV L
Sbjct: 1681 --INKGSLGVKSLERHVNESDVAIKWRIPRSDTAATHILRKLISQFIKEIVEGMIGYVRL 1740

Query: 1754 PTIMSRLLSDNGSQEFGDFKLTILGMLGTFGFERRILDTAKALIEDDTFYTMSLLKKGAS 1813
            PTIM++LLSDNG+QEFGDFKLTILGMLGT+GFERRILDTAK+LIEDDTFY+M+LLKKGAS
Sbjct: 1741 PTIMTKLLSDNGTQEFGDFKLTILGMLGTYGFERRILDTAKSLIEDDTFYSMNLLKKGAS 1800

Query: 1814 HGYAPRGAVCCICNRLLVKSSSSYRVRVFNCGHATHLQCEVLDNEASGGDFS-------C 1873
            HGYAPR  +CCIC+  L K+ S+ RVRVFNCGHATHLQCE  +NE S    S       C
Sbjct: 1801 HGYAPRSLLCCICSCPLTKTFSALRVRVFNCGHATHLQCEPSENETSTSASSIHVSSSGC 1860

Query: 1874 PVCVHSNHSQRS-RGKAL-TEYSLVNKFSSRTQSSSGASVSYPQETDLLELPYTLQQIPR 1933
            PVC+    S+ S +GK+   +Y L++  SS   SS  AS  Y  E ++ +  +  QQ+ R
Sbjct: 1861 PVCMTKKTSKSSLKGKSFYRDYGLISTVSSNAGSSQRAS-PYSHENEMSDHSHN-QQLSR 1898

Query: 1934 FEILANLQKNQRVIDIENMPQLRLAPPAVYHDKVTKGYHLLVGDSSGGVEKVEKLNKSRQ 1951
            FEIL NLQK+QR++ IE++P+LRLAPPAVYH+KV++      G+SSG   K  K  + ++
Sbjct: 1921 FEILTNLQKDQRLVQIESLPRLRLAPPAVYHEKVSRLSGFTPGESSGKDTKPVKTGQGKK 1898

BLAST of Cp4.1LG03g15580.1 vs. NCBI nr
Match: gi|778676625|ref|XP_011650623.1| (PREDICTED: vacuolar protein sorting-associated protein 8 homolog [Cucumis sativus])

HSP 1 Score: 3278.8 bits (8500), Expect = 0.0e+00
Identity = 1697/1973 (86.01%), Postives = 1795/1973 (90.98%), Query Frame = 1

Query: 1    MTKELTDTETLPPMELDLNAFIHAHLSSGDDDHDHDEDDLSFPHRSIDEILNESSSSTSS 60
            MT+ELTDTETLPPMELDLNAFIHAHLSSG DD D  +DDLSFPHRSIDEILN+SSSSTS 
Sbjct: 1    MTEELTDTETLPPMELDLNAFIHAHLSSGGDDDD--DDDLSFPHRSIDEILNDSSSSTSP 60

Query: 61   SPSSPPNSPPPRARRSIAARDGRASASRSIPPFKSPFEEIIKASKVPRSNQRNEKSVQLK 120
            SPSS P+ PPPR RR+I A D   SAS S  P+K         S+  R+N  NEKS QLK
Sbjct: 61   SPSSSPHFPPPRGRRNIVAGDDGVSASPSTSPYKD--------SEAARNNPWNEKSAQLK 120

Query: 121  PGSVSHTKVGELTDDPFRRGSRALPSLFGGVRSNAKPGAALAAAAAASRSMPAPHAAAIK 180
            PG+ SH+KVGELTDDPFRRGSR LPSLFG VRSNAKPGAALAAAAAASRS PAPHAAAIK
Sbjct: 121  PGTASHSKVGELTDDPFRRGSRPLPSLFGAVRSNAKPGAALAAAAAASRSTPAPHAAAIK 180

Query: 181  SRRSGHGSVVLDDDELASSSAVDSEFVFDDLYSTIDHSKESREKSISLVERNADYQGASV 240
            SRR+G+G++VLDDDELASSSAVDSEF  D LY    HSKES E SIS+V+R  DYQ AS+
Sbjct: 181  SRRAGYGNMVLDDDELASSSAVDSEFFSDSLYHANIHSKESGENSISVVDRITDYQIASM 240

Query: 241  NVGVEFWARDNIRDCVLYNDEFRITKDTECEAEQSFVDDVNFDESSTTLPPVEANGRSLS 300
            NV  E WA +NIRD V +NDEFR+T+D E EAE S VDDVNF ES +T+PPVE N RSL 
Sbjct: 241  NVSGELWATNNIRDGVPHNDEFRMTEDMEFEAETSSVDDVNFKESLSTVPPVETNDRSLL 300

Query: 301  DSADNNVCSMDAEPTVLDGDESNEGAFPCSPKPDYDRSAVGYGRLELETQDFEKRSQPSK 360
              A+ NVCS DA PT LD DESNEGA P   +PD + SAVGYG LELETQDFEK  QPSK
Sbjct: 301  GPAEKNVCSTDAHPTELDVDESNEGAIPRPTEPDDEESAVGYGSLELETQDFEKYHQPSK 360

Query: 361  DSEV-LAIEDLSVVNDISESRETAKQLDNFHTGERAETMSLSSSNPLELAEEIEKKQAFT 420
            D+EV LAIED S+VNDI ES ET +Q DN   G+R E +S+SS+NPL+LAEEIEKKQAFT
Sbjct: 361  DTEVDLAIEDPSIVNDIIESGETTEQPDNLQIGKRPEMISVSSTNPLDLAEEIEKKQAFT 420

Query: 421  ALHWEEGVAAQPMRLEGIKGGTTALGYFDIQADNSISRTISSHSFKREHGFPQALAVHAN 480
            ALHWEEGVAAQPMRLEGIKG TT LGYFDIQADNSISRTISSHSF+REHGFPQ LAVHAN
Sbjct: 421  ALHWEEGVAAQPMRLEGIKGVTTTLGYFDIQADNSISRTISSHSFRREHGFPQVLAVHAN 480

Query: 481  YIAVGMSKGNIVVVASKYSAQNGDNMDAKMLLLGSQGDKSTAPVTSLCFNQQGDLLLAGY 540
            YIAVGMSKGNIVVVASKYSAQNGDNMDAKM+LLGSQGDKSTAP TSLCF+QQGDLLLAGY
Sbjct: 481  YIAVGMSKGNIVVVASKYSAQNGDNMDAKMILLGSQGDKSTAPATSLCFSQQGDLLLAGY 540

Query: 541  SDGQVTVWDVLRATAAKVISGEHTSPVVHSLFLGQEAQVTRQFKAVTGDSKGLVLLHTFS 600
            SDG +TVWDVLRA+AAKVISGEH SPVVHSLFLGQEAQVTRQFKAVTGDSKGLVLLHTFS
Sbjct: 541  SDGHITVWDVLRASAAKVISGEHASPVVHSLFLGQEAQVTRQFKAVTGDSKGLVLLHTFS 600

Query: 601  VVPLLNRFSIKTQCLLDGQKTGTVLSASALLLNEFCGSSLPPSLSNVAVSTSSIGSMMGG 660
            VVPLLNRFS KTQCLLDGQKTGTVLSASALLLNEF GSSLPP+LSNVAVSTSSIGSMMGG
Sbjct: 601  VVPLLNRFSSKTQCLLDGQKTGTVLSASALLLNEFVGSSLPPTLSNVAVSTSSIGSMMGG 660

Query: 661  VVGGDSGWKLFNEGSSLVE-GVVIFATHQTALVVRLSPTVEVYARLSKPDGIQEGSMPYT 720
            VVGGDSGWKLFNEGSSLVE GVVIFATHQTALVVRLSPTVEVYA+LSKPDGI+EGSMPYT
Sbjct: 661  VVGGDSGWKLFNEGSSLVEEGVVIFATHQTALVVRLSPTVEVYAQLSKPDGIREGSMPYT 720

Query: 721  AWKCSQSIETSPSEAVERISLLAIAWDKMVQVAKLVKTELNVCGKWSLESAAIGVAWLDD 780
            AWKCSQS ETSPSEAVER+SLLAIAWDKMVQVAKLVKTEL VCGKWSLESAAIGV WLDD
Sbjct: 721  AWKCSQSFETSPSEAVERVSLLAIAWDKMVQVAKLVKTELKVCGKWSLESAAIGVVWLDD 780

Query: 781  QVLVILTVTGQLFLFEKDGTMIHQTSVFVDGFDKEDFIAHHTHFVNVLGNPEKAYHNCVA 840
            QVLVILTVTGQLFLFEKDGTMIHQTS+FVDGF KEDFIA+HTHF N+LGNPEKAYHNCVA
Sbjct: 781  QVLVILTVTGQLFLFEKDGTMIHQTSIFVDGFVKEDFIAYHTHFANILGNPEKAYHNCVA 840

Query: 841  VRGASVYVLGPKHLVISRLLPWKERVQVLRKAGDWMTALSMAITIYDGHAHGVIDLPRSL 900
            VRGAS+YVLGP HLVISRLLPWKERVQVLRKAGDWM+ALSMAITIYDGHAHGVIDLPRSL
Sbjct: 841  VRGASIYVLGPMHLVISRLLPWKERVQVLRKAGDWMSALSMAITIYDGHAHGVIDLPRSL 900

Query: 901  ESLQELVMPFLIELLLSYVDEVFSYISVAFCNQIEKNEKLDDVTSGSHSAHSEIKEQYNR 960
            ESLQELVMPFLIELLLSYVDEVFSYISVAFCNQIEKNEKLDD+T  SHSAHSEIKEQYNR
Sbjct: 901  ESLQELVMPFLIELLLSYVDEVFSYISVAFCNQIEKNEKLDDMTIESHSAHSEIKEQYNR 960

Query: 961  VGGVAVEFCVHITRTDILFDEIFSKFVAVQQRDTFLELLEPYILKDMLGSLPPEIMQALV 1020
            VGGVAVEFCVHI+RTDILFDEIFSKFV VQQRDTFLELLEPYILKDMLGSLPPEIMQALV
Sbjct: 961  VGGVAVEFCVHISRTDILFDEIFSKFVGVQQRDTFLELLEPYILKDMLGSLPPEIMQALV 1020

Query: 1021 EHYSQKGWLQRVEQCVLHMDISSLDFNQVVRLCRDHGLYSALVYLFNKGLDDFRTPLEEL 1080
            EHYS KGWLQRVEQCVLHMDISSLDFNQVVRLCRDHGLYSALVYLFNKGLDDFRTPLEEL
Sbjct: 1021 EHYSHKGWLQRVEQCVLHMDISSLDFNQVVRLCRDHGLYSALVYLFNKGLDDFRTPLEEL 1080

Query: 1081 LAVLQTSLSLDSKKSLGGGWGSMGMYKTLVYLKYCFSGLAFPPGQGTLAHSRVQSLRDEL 1140
            LAVL+TS S  +  SLG        YKTLVYLKYCFSGLAFPPGQGTLAHSRVQSLRDEL
Sbjct: 1081 LAVLRTSKSKHAS-SLG--------YKTLVYLKYCFSGLAFPPGQGTLAHSRVQSLRDEL 1140

Query: 1141 LQFLLENSDTVDTRSISNKSSEVGYLNLYHLLELDTGATLDVLRCAFVEGEIVKADSSLD 1200
            LQFLLENSD VDTRSISNKSSEVG LNLY LLELDT ATLDVLRCAFVEGEI+KA SSLD
Sbjct: 1141 LQFLLENSDAVDTRSISNKSSEVGCLNLYPLLELDTEATLDVLRCAFVEGEILKAISSLD 1200

Query: 1201 GSVDASMQVQKEKNSTSGRKNFLVQNVVDALVHILDKAISQTYGSPGGDNITLVEDWPSK 1260
            G VD SMQ+Q+EKNS SGRKNFL+QNVVDALVH+LDKAI +T  SP GDNITLV+DWPSK
Sbjct: 1201 GPVDTSMQLQEEKNSISGRKNFLIQNVVDALVHVLDKAICETDESPAGDNITLVDDWPSK 1260

Query: 1261 KDLFHLFDFVANYVACGKATASKDVVGQILEHLISNSDIPEMEIDFVHSVTANSVHSRKR 1320
            K+L HLFDF+A YVACGKAT SKDVVGQILEHLISNSDIPE   DF+  VTANSV SRKR
Sbjct: 1261 KELIHLFDFIATYVACGKATVSKDVVGQILEHLISNSDIPETVSDFLPRVTANSVLSRKR 1320

Query: 1321 EKQVLSLLEVIPETHWNPSSVLRMCEKAQFFQVCGLIHSISHQYSSALDGYMKDVEEPIH 1380
            EKQVLSLLEVIPETHWNPSSVLRMCEKAQFFQVCGLIHSI+HQYSSALD YMKDV+EPIH
Sbjct: 1321 EKQVLSLLEVIPETHWNPSSVLRMCEKAQFFQVCGLIHSITHQYSSALDSYMKDVDEPIH 1380

Query: 1381 AFAFINRTLMELSNSEQTEFRGVVISRIPELLNLNREGTFFLVIDHFGNDVSDILSRLHN 1440
             F FINRTL+EL NSEQTEFR VVISRIPEL NLNR  TFFLVIDHF NDVS+ILS+L N
Sbjct: 1381 TFTFINRTLLELGNSEQTEFRAVVISRIPELFNLNRGATFFLVIDHFNNDVSNILSQLRN 1440

Query: 1441 HPRSLFLYLKTLIEVHQSGNLNFSCLKKDDNFGVNYSTKGLDDYLQKLSDFPKFLSNNPV 1500
            HPRSLFLYLKTLIEVH SG+ +FSCLKKDDN GVNYSTKG+DDYLQKLSDFPK+LSNNPV
Sbjct: 1441 HPRSLFLYLKTLIEVHLSGSPDFSCLKKDDNLGVNYSTKGMDDYLQKLSDFPKYLSNNPV 1500

Query: 1501 DVTDDIIELYVELLCRYERESVLKFLETFDSYHVEHCLRLCQQYEVIDAAAFLLERVGDV 1560
            DVTDDIIELYVELLC++ERESVLKFLETFDSY VEHCLRLCQQYEVIDAAAFLLERVGDV
Sbjct: 1501 DVTDDIIELYVELLCQHERESVLKFLETFDSYRVEHCLRLCQQYEVIDAAAFLLERVGDV 1560

Query: 1561 GSALFLTLSSLDKKFHDLEAAVGGIVTNGASSGSNDSQLFSSVLKLQEVNDIYVLLHACI 1620
            GSALFLTLSSLDKKFHDLEAAVG  V+N ASSGSNDSQ F+SVLKLQEVN + VLLHACI
Sbjct: 1561 GSALFLTLSSLDKKFHDLEAAVGATVSNTASSGSNDSQNFNSVLKLQEVNAVKVLLHACI 1620

Query: 1621 GLCQRNTPRLNSEESETLWFKLLDSFCEPLIESFNYRTASFGENQVQFLNESSGSQKDKE 1680
            GLCQRNTPRLNSEES+TLWFKLLDSFCEPLI+S+N+RTASF +NQVQFLNESS SQKDKE
Sbjct: 1621 GLCQRNTPRLNSEESQTLWFKLLDSFCEPLIDSYNHRTASFEKNQVQFLNESSCSQKDKE 1680

Query: 1681 AHIVTWRILKSNQTAHILRSLFSRFIREIVEGMIGYVHLPTIMSRLLSDNGSQEFGDFKL 1740
            A+IVTWRILKSN+ AH+LR LFS+FIREIVEGM+GYVHLPTIMSRLL DNGSQEFGDFKL
Sbjct: 1681 ANIVTWRILKSNKVAHLLRKLFSQFIREIVEGMMGYVHLPTIMSRLLYDNGSQEFGDFKL 1740

Query: 1741 TILGMLGTFGFERRILDTAKALIEDDTFYTMSLLKKGASHGYAPRGAVCCICNRLLVKSS 1800
            TILGMLGTFGFERRILD+AKALIEDD+FYTMSLLKKGA+HGYAPR  VCCICNRLLVKSS
Sbjct: 1741 TILGMLGTFGFERRILDSAKALIEDDSFYTMSLLKKGAAHGYAPRSVVCCICNRLLVKSS 1800

Query: 1801 SSYRVRVFNCGHATHLQCEVLDNEASGGDFSCPVCVHSNHSQRSRGKALTEYSLVNKFSS 1860
            SSYRVRVFNCGHATHLQCE L+NEASGGD++CP+CVHSN SQ S+ KA TEYSLVNKFSS
Sbjct: 1801 SSYRVRVFNCGHATHLQCEDLENEASGGDYTCPICVHSNQSQGSKSKAPTEYSLVNKFSS 1860

Query: 1861 RTQSSSGASVSYPQETDLLELPYTLQQIPRFEILANLQKNQRVIDIENMPQLRLAPPAVY 1920
            RTQSSSGASVSYPQETDLLELPYTLQQIPRFEIL NLQKNQRVIDIEN+PQLRLAPPAVY
Sbjct: 1861 RTQSSSGASVSYPQETDLLELPYTLQQIPRFEILTNLQKNQRVIDIENVPQLRLAPPAVY 1920

Query: 1921 HDKVTKGYHLLVGDSSGGVEKVEKLNKSRQLKEVKVKRPSSLRFPLKANLFGE 1972
            HDKVTKGYHLLVG+SSGG EKVEKLNKSRQL  VKVKRPSSLRFPLK +LFG+
Sbjct: 1921 HDKVTKGYHLLVGESSGGREKVEKLNKSRQLTGVKVKRPSSLRFPLKTSLFGK 1954

BLAST of Cp4.1LG03g15580.1 vs. NCBI nr
Match: gi|659074757|ref|XP_008437780.1| (PREDICTED: vacuolar protein sorting-associated protein 8 homolog [Cucumis melo])

HSP 1 Score: 3267.2 bits (8470), Expect = 0.0e+00
Identity = 1695/1973 (85.91%), Postives = 1793/1973 (90.88%), Query Frame = 1

Query: 1    MTKELTDTETLPPMELDLNAFIHAHLSSGDDDHDHDEDDLSFPHRSIDEILNESSSSTSS 60
            MT+ELTDT TLPPMELDLNAFIHAHLSSGDDD   D+DDLSFPHRSIDEILN+SSSSTSS
Sbjct: 1    MTEELTDTRTLPPMELDLNAFIHAHLSSGDDD---DDDDLSFPHRSIDEILNDSSSSTSS 60

Query: 61   SPSSPPNSPPPRARRSIAARDGRASASRSIPPFKSPFEEIIKASKVPRSNQRNEKSVQLK 120
            SPSS P+SPP R RR+I A +G  SAS S  PFKS  EE IK S+ PR+N  NEKS Q K
Sbjct: 61   SPSSSPHSPPSRGRRNIVAGNGGVSASPSTSPFKSLLEETIKDSEAPRNNPWNEKSAQSK 120

Query: 121  PGSVSHTKVGELTDDPFRRGSRALPSLFGGVRSNAKPGAALAAAAAASRSMPAPHAAAIK 180
            PG VSH+K+GELTDDPFRRGSR LPSLFG VRSNAKPGAALAAAAAASRS PAPHAAAIK
Sbjct: 121  PGKVSHSKIGELTDDPFRRGSRPLPSLFGAVRSNAKPGAALAAAAAASRSTPAPHAAAIK 180

Query: 181  SRRSGHGSVVLDDDELASSSAVDSEFVFDDLYSTIDHSKESREKSISLVERNADYQGASV 240
            SRR+G+G++ LDDDELASSSAVDSEF+ D LY T  H KES E SIS+V+R  DYQ AS 
Sbjct: 181  SRRAGYGNMALDDDELASSSAVDSEFLSDSLYHTNIHLKESGENSISVVDRITDYQVASR 240

Query: 241  NVGVEFWARDNIRDCVLYNDEFRITKDTECEAEQSFVDDVNFDESSTTLPPVEANGRSLS 300
            +V  E W R+NIRD V +NDEFR+T+D E EAE S VDDVNF+ES TT+PP E N RSL 
Sbjct: 241  DVS-ELWDRNNIRDSVPHNDEFRMTEDMEFEAEPSSVDDVNFNESLTTVPPAETNDRSLL 300

Query: 301  DSADNNVCSMDAEPTVLDGDESNEGAFPCSPKPDYDRSAVGYGRLELETQDFEKRSQPSK 360
              A+ NVCS DA PT LD DESNEGA P S +PD + SAVGYG  ELETQDFEK  QPSK
Sbjct: 301  GPAEKNVCSTDAHPTELDVDESNEGAIPRSTEPDDEGSAVGYGSPELETQDFEKYHQPSK 360

Query: 361  DSEV-LAIEDLSVVNDISESRETAKQLDNFHTGERAETMSLSSSNPLELAEEIEKKQAFT 420
            D+EV LAIED S+VNDI ES ET +QLDN   G+  ETM +SS+NPLELAEEIEKKQAFT
Sbjct: 361  DTEVDLAIEDPSIVNDIIESGETTEQLDNLQIGKHPETMPVSSTNPLELAEEIEKKQAFT 420

Query: 421  ALHWEEGVAAQPMRLEGIKGGTTALGYFDIQADNSISRTISSHSFKREHGFPQALAVHAN 480
            ALHWEEGVAAQPMRLEGIKG TT LGYFDIQADNSISRTISSHSF+REHGFPQ LAVHAN
Sbjct: 421  ALHWEEGVAAQPMRLEGIKGVTTTLGYFDIQADNSISRTISSHSFRREHGFPQVLAVHAN 480

Query: 481  YIAVGMSKGNIVVVASKYSAQNGDNMDAKMLLLGSQGDKSTAPVTSLCFNQQGDLLLAGY 540
            YIAVGMSKG+IVVVASKYSAQNGDNMDAKM+LLGSQGDKSTAPVTSLCF+QQ DLLLAGY
Sbjct: 481  YIAVGMSKGSIVVVASKYSAQNGDNMDAKMILLGSQGDKSTAPVTSLCFSQQADLLLAGY 540

Query: 541  SDGQVTVWDVLRATAAKVISGEHTSPVVHSLFLGQEAQVTRQFKAVTGDSKGLVLLHTFS 600
            SDG +TVWDVLRA+AAKVISGEHTSPVVHSLFLGQEAQVTRQFKAVTGDSKGLVLLHTFS
Sbjct: 541  SDGHITVWDVLRASAAKVISGEHTSPVVHSLFLGQEAQVTRQFKAVTGDSKGLVLLHTFS 600

Query: 601  VVPLLNRFSIKTQCLLDGQKTGTVLSASALLLNEFCGSSLPPSLSNVAVSTSSIGSMMGG 660
            VVPLLNRFSIKTQCLLDGQKTGTVLSASALLLNEF GSSLPP+LSNVAVSTSSIGSMMGG
Sbjct: 601  VVPLLNRFSIKTQCLLDGQKTGTVLSASALLLNEFVGSSLPPTLSNVAVSTSSIGSMMGG 660

Query: 661  VVGGDSGWKLFNEGSSLVE-GVVIFATHQTALVVRLSPTVEVYARLSKPDGIQEGSMPYT 720
            VVGGDSGWKLFNEGSSLVE GVVIFATHQTALVVRLSP+VEVYA+LSKPDGI+EGSMPYT
Sbjct: 661  VVGGDSGWKLFNEGSSLVEEGVVIFATHQTALVVRLSPSVEVYAQLSKPDGIREGSMPYT 720

Query: 721  AWKCSQSIETSPSEAVERISLLAIAWDKMVQVAKLVKTELNVCGKWSLESAAIGVAWLDD 780
            AWKCSQS ETS SEAVER+SLLAIAWDKMVQVAKLVKTEL VCG WSLESAAIGV WLDD
Sbjct: 721  AWKCSQSFETSSSEAVERVSLLAIAWDKMVQVAKLVKTELKVCGNWSLESAAIGVVWLDD 780

Query: 781  QVLVILTVTGQLFLFEKDGTMIHQTSVFVDGFDKEDFIAHHTHFVNVLGNPEKAYHNCVA 840
            QVLVILTVTGQLFLFEKDGTMIHQTSVF DGF KEDFIA+HTHF NVLG+PEKAYHNCVA
Sbjct: 781  QVLVILTVTGQLFLFEKDGTMIHQTSVFADGFVKEDFIAYHTHFANVLGHPEKAYHNCVA 840

Query: 841  VRGASVYVLGPKHLVISRLLPWKERVQVLRKAGDWMTALSMAITIYDGHAHGVIDLPRSL 900
            VRGAS+YVLGP HLVISRLLPWKERVQVLRKAGDWM+ALSMAITIYDGHAHGVIDLPRSL
Sbjct: 841  VRGASIYVLGPTHLVISRLLPWKERVQVLRKAGDWMSALSMAITIYDGHAHGVIDLPRSL 900

Query: 901  ESLQELVMPFLIELLLSYVDEVFSYISVAFCNQIEKNEKLDDVTSGSHSAHSEIKEQYNR 960
            ESLQELVMPFLIELLLSYVDEVFSYISVAFCNQIEKNEKLDDVTS   SAHSEIKEQYNR
Sbjct: 901  ESLQELVMPFLIELLLSYVDEVFSYISVAFCNQIEKNEKLDDVTSERDSAHSEIKEQYNR 960

Query: 961  VGGVAVEFCVHITRTDILFDEIFSKFVAVQQRDTFLELLEPYILKDMLGSLPPEIMQALV 1020
            VGGVAVEFCVHI+RTDILFDEIFSKFV VQQRDTFLELLEPYILKDMLGSLPPEIMQALV
Sbjct: 961  VGGVAVEFCVHISRTDILFDEIFSKFVGVQQRDTFLELLEPYILKDMLGSLPPEIMQALV 1020

Query: 1021 EHYSQKGWLQRVEQCVLHMDISSLDFNQVVRLCRDHGLYSALVYLFNKGLDDFRTPLEEL 1080
            EHYS KGWLQRVEQCVLHMDISSLDFNQVVRLCRDHGLYSALVYLFNKGLDDFRTPLEEL
Sbjct: 1021 EHYSHKGWLQRVEQCVLHMDISSLDFNQVVRLCRDHGLYSALVYLFNKGLDDFRTPLEEL 1080

Query: 1081 LAVLQTSLSLDSKKSLGGGWGSMGMYKTLVYLKYCFSGLAFPPGQGTLAHSRVQSLRDEL 1140
            LAVL+TS S  +  SLG        YKTLVYLKYCFSGLAFPPGQGTLAHSRVQSLRDEL
Sbjct: 1081 LAVLRTSKSKHAS-SLG--------YKTLVYLKYCFSGLAFPPGQGTLAHSRVQSLRDEL 1140

Query: 1141 LQFLLENSDTVDTRSISNKSSEVGYLNLYHLLELDTGATLDVLRCAFVEGEIVKADSSLD 1200
            LQFLLENSD VDTRSISNKSSEVG LNLYHLLELDT ATLDVLRCAFVE E +K +SSLD
Sbjct: 1141 LQFLLENSDAVDTRSISNKSSEVGCLNLYHLLELDTEATLDVLRCAFVEVEFLKTNSSLD 1200

Query: 1201 GSVDASMQVQKEKNSTSGRKNFLVQNVVDALVHILDKAISQTYGSPGGDNITLVEDWPSK 1260
            G VDA M++Q EKNS SGRKNFL+QNVVDALVH+L KAI +T  SP GDNITLV+DWPSK
Sbjct: 1201 GPVDAIMELQDEKNSISGRKNFLIQNVVDALVHVLGKAICETDESPDGDNITLVDDWPSK 1260

Query: 1261 KDLFHLFDFVANYVACGKATASKDVVGQILEHLISNSDIPEMEIDFVHSVTANSVHSRKR 1320
            K+L HLFDF+A YVACGKAT SKDVVGQILEHLISN+ IPE   DF+  VTANSVHSRKR
Sbjct: 1261 KELIHLFDFIATYVACGKATVSKDVVGQILEHLISNTHIPETS-DFLPRVTANSVHSRKR 1320

Query: 1321 EKQVLSLLEVIPETHWNPSSVLRMCEKAQFFQVCGLIHSISHQYSSALDGYMKDVEEPIH 1380
            EKQVLSLLEV+PETHWNPSSVLRMCEKAQFFQVCGLIHSI  QYSSALD YMKDV EPIH
Sbjct: 1321 EKQVLSLLEVVPETHWNPSSVLRMCEKAQFFQVCGLIHSIGCQYSSALDSYMKDVGEPIH 1380

Query: 1381 AFAFINRTLMELSNSEQTEFRGVVISRIPELLNLNREGTFFLVIDHFGNDVSDILSRLHN 1440
            AFAFINR L++LSNSEQTEFR VVISRIPEL NLNR  TFFLVIDHF +DVS+IL +L N
Sbjct: 1381 AFAFINRALLKLSNSEQTEFRAVVISRIPELFNLNRGATFFLVIDHFNDDVSNILLQLRN 1440

Query: 1441 HPRSLFLYLKTLIEVHQSGNLNFSCLKKDDNFGVNYSTKGLDDYLQKLSDFPKFLSNNPV 1500
            HPRSLFLYLKTLIEVH SG+L+FSCLKKDDN GVNYSTKGLDDYL+KLSDFPK+LSNNPV
Sbjct: 1441 HPRSLFLYLKTLIEVHLSGSLDFSCLKKDDNLGVNYSTKGLDDYLKKLSDFPKYLSNNPV 1500

Query: 1501 DVTDDIIELYVELLCRYERESVLKFLETFDSYHVEHCLRLCQQYEVIDAAAFLLERVGDV 1560
            DVTDDIIELYVELLC++ERESVLKFLETFDSY VEHCLRLCQQYEVIDAAAFLLERVGDV
Sbjct: 1501 DVTDDIIELYVELLCQHERESVLKFLETFDSYRVEHCLRLCQQYEVIDAAAFLLERVGDV 1560

Query: 1561 GSALFLTLSSLDKKFHDLEAAVGGIVTNGASSGSNDSQLFSSVLKLQEVNDIYVLLHACI 1620
            GSALFLTLSSLDKKFHDLEAAVG IV+NGASSGS+DSQ F SVLKLQEVN + VLLHACI
Sbjct: 1561 GSALFLTLSSLDKKFHDLEAAVGAIVSNGASSGSSDSQHFDSVLKLQEVNTVEVLLHACI 1620

Query: 1621 GLCQRNTPRLNSEESETLWFKLLDSFCEPLIESFNYRTASFGENQVQFLNESSGSQKDKE 1680
            GLCQRNTPRLN EESETLWFKLLDSFCEPLI+S+N+RTASF +NQVQFLNE S SQKDKE
Sbjct: 1621 GLCQRNTPRLNCEESETLWFKLLDSFCEPLIDSYNHRTASFEKNQVQFLNEPSSSQKDKE 1680

Query: 1681 AHIVTWRILKSNQTAHILRSLFSRFIREIVEGMIGYVHLPTIMSRLLSDNGSQEFGDFKL 1740
            A+IVTWRILKSN+ AHILR LFS+FIREIVEGM+GYVHLPTIMSRLL DNGSQEFGDFKL
Sbjct: 1681 ANIVTWRILKSNKAAHILRKLFSQFIREIVEGMMGYVHLPTIMSRLLYDNGSQEFGDFKL 1740

Query: 1741 TILGMLGTFGFERRILDTAKALIEDDTFYTMSLLKKGASHGYAPRGAVCCICNRLLVKSS 1800
            TILGMLGTFGFERRILDTAKALIEDD+FYTM+LLKKGA+HGYAPR  VCCICNRLLVKSS
Sbjct: 1741 TILGMLGTFGFERRILDTAKALIEDDSFYTMNLLKKGAAHGYAPRSVVCCICNRLLVKSS 1800

Query: 1801 SSYRVRVFNCGHATHLQCEVLDNEASGGDFSCPVCVHSNHSQRSRGKALTEYSLVNKFSS 1860
            SSYRVRVFNCGHATHLQCE L+NEASGGD +CP+CVHSN SQ S+ KA TEYSLVNKFSS
Sbjct: 1801 SSYRVRVFNCGHATHLQCEDLENEASGGDSTCPICVHSNQSQGSKSKAPTEYSLVNKFSS 1860

Query: 1861 RTQSSSGASVSYPQETDLLELPYTLQQIPRFEILANLQKNQRVIDIENMPQLRLAPPAVY 1920
            RT SSSGASVSYPQETD+LELPYTLQQIPRFEIL NLQKNQRVIDIEN+PQLRLAPPAVY
Sbjct: 1861 RTSSSSGASVSYPQETDILELPYTLQQIPRFEILTNLQKNQRVIDIENVPQLRLAPPAVY 1920

Query: 1921 HDKVTKGYHLLVGDSSGGVEKVEKLNKSRQLKEVKVKRPSSLRFPLKANLFGE 1972
            HDKVTKGYHLLVG+SS G EKVEKLNKSRQL EVKVKRPSSLRFPLKA+LFG+
Sbjct: 1921 HDKVTKGYHLLVGESSSGREKVEKLNKSRQLTEVKVKRPSSLRFPLKASLFGK 1959

BLAST of Cp4.1LG03g15580.1 vs. NCBI nr
Match: gi|731420761|ref|XP_002267626.3| (PREDICTED: vacuolar protein sorting-associated protein 8 homolog isoform X1 [Vitis vinifera])

HSP 1 Score: 2219.5 bits (5750), Expect = 0.0e+00
Identity = 1232/2025 (60.84%), Postives = 1491/2025 (73.63%), Query Frame = 1

Query: 1    MTKELTDTETLPPMELDLNAFIHAHLSSGDDDHDHDEDDLSFPHRSIDEILNESSSSTSS 60
            MTK+L+     PPMELDL++FIH  L+S DDD D        PHR++DEILN+S SS+SS
Sbjct: 1    MTKKLS----APPMELDLDSFIH--LTSDDDDDDALN---RVPHRTVDEILNDSDSSSSS 60

Query: 61   SPSSPPNSPPPRARRSIAARDGRASASRSIPPFKSPFEEIIKASKVPRSNQRNEKSVQLK 120
               S  +     +     A D R        P K+  +E  K+++  + N+  ++ VQ K
Sbjct: 61   LSPSDHSYLAKHSSLFEDANDSRDDVVSVSTP-KTLSDERPKSAESLKFNEIEDRLVQFK 120

Query: 121  PGSVSHTKVGELTDDPF---RRGSRALPSLFGGVRSNAKPGAALAAAAAASRSMPAPHAA 180
              S+S  + G+L+ D F   RR SR LP LFG VRSNAKPGAALAAAAAASR +P PHAA
Sbjct: 121  ANSLSRVRTGDLSGDSFSLGRRVSRPLPPLFGSVRSNAKPGAALAAAAAASRPVPTPHAA 180

Query: 181  AIKSRRSGHGSV--VLDDDELASS------------SAVDSEFVFDDLYSTIDHSKESRE 240
            AIKSRR+G G++  VLD +EL  S            +   SE    D  S  +  K    
Sbjct: 181  AIKSRRAGSGALQRVLDTEELGGSGLDKLGSSSDVLNGAGSEIASSDWKSGEEDDKFEDF 240

Query: 241  KSISL---VERNADYQGASVNVGVEFWARDN-IRDC---------VLYNDEFRITKDTEC 300
            +S ++   V+ + D + +  +  VE   RD  + D           L  DE R+    E 
Sbjct: 241  QSATIEWTVKADVDDKVSVKDEIVESSHRDGEVFDLEKVPTEVVHTLEEDESRVNDSDEI 300

Query: 301  ----EAEQSFVDDVNFDESSTTLPPVEANGRSLSDSADNNVCSMDAEPTVLDGDESNEGA 360
                 AE      ++ +E S  L    A   S  D  D N+ S + E T      SN   
Sbjct: 301  LLNSSAETGLAASLSIEEESFDLNEGSAISGSY-DVKDQNIASDNVEETA-----SNSTF 360

Query: 361  FPCSPKPDYDRSAVGYGRLELETQDFEKRSQPSKDSEV-LAIEDLSVVNDISES-RETAK 420
               +   D D        L L+TQD E    PS D EV +A +D S  +D++E   E   
Sbjct: 361  LDAANSADKDEKV--REDLTLKTQDLEPVEPPSTDGEVNIAGDDWSPKSDVTELVEERLG 420

Query: 421  QLDNFHTGERAETMSLSSSNPLELAEEIEKKQAFTALHWEEGVAAQPMRLEGIKGGTTAL 480
            QL++    +R E        PLELAEE+EK QA T LHWEEG AAQPMRLEG++ G+T L
Sbjct: 421  QLESKMGSKRTEKKP--RLKPLELAEELEKSQASTGLHWEEGAAAQPMRLEGVRRGSTTL 480

Query: 481  GYFDIQADNSISRTISSHSFKREHGFPQALAVHANYIAVGMSKGNIVVVASKYSAQNGDN 540
            GYF+I  +N+I+RTISS +FKR+HG PQ LAVH N+IAVGMS+G ++VV SKYSA N DN
Sbjct: 481  GYFEIDNNNTITRTISSPAFKRDHGSPQVLAVHLNFIAVGMSRGVVMVVPSKYSAYNADN 540

Query: 541  MDAKMLLLGSQGDKSTAPVTSLCFNQQGDLLLAGYSDGQVTVWDVLRATAAKVISGEHTS 600
            MDAK+L+LG QG++S APVTS+CFN QGDLLLAGY DG +TVWDV RATAAKVI+GEH++
Sbjct: 541  MDAKILMLGLQGERSHAPVTSMCFNHQGDLLLAGYGDGHITVWDVQRATAAKVITGEHSA 600

Query: 601  PVVHSLFLGQEAQVTRQFKAVTGDSKGLVLLHTFSVVPLLNRFSIKTQCLLDGQKTGTVL 660
            PV+H+LFLGQ++QVTRQFKAVTGDSKGLVLLH FSVVPLLNRFSIKTQCLLDGQ+TGTVL
Sbjct: 601  PVIHTLFLGQDSQVTRQFKAVTGDSKGLVLLHAFSVVPLLNRFSIKTQCLLDGQRTGTVL 660

Query: 661  SASALLLNEFCGSSLPPSLSNVAVSTSSIGSMMGGVVGGDSGWKLFNEGSSLVE-GVVIF 720
            SAS LLL+E  GSSL  S  N   STSSIGSMMGGVVGGD+GWKLF+EGSSLVE GVVIF
Sbjct: 661  SASPLLLDESSGSSLMSSQGNATGSTSSIGSMMGGVVGGDAGWKLFSEGSSLVEEGVVIF 720

Query: 721  ATHQTALVVRLSPTVEVYARLSKPDGIQEGSMPYTAWKCSQ------SIETSPSEAVERI 780
             THQTALVVRLSP++EVYA+L+KPDG++EGSMPYTAWKC        S E +P EA ER+
Sbjct: 721  VTHQTALVVRLSPSLEVYAQLNKPDGVREGSMPYTAWKCMTIHSRGLSTENTPVEASERV 780

Query: 781  SLLAIAWDKMVQVAKLVKTELNVCGKWSLESAAIGVAWLDDQVLVILTVTGQLFLFEKDG 840
            SLLAIAWD+ VQVAKLVK+EL + GKW+LES AIGVAWLDDQ+LV+LT TGQL LF KDG
Sbjct: 781  SLLAIAWDRKVQVAKLVKSELKIYGKWTLESTAIGVAWLDDQILVVLTSTGQLCLFAKDG 840

Query: 841  TMIHQTSVFVDGFDKEDFIAHHTHFVNVLGNPEKAYHNCVAVRGASVYVLGPKHLVISRL 900
            T+IHQTS  VDG   +D +A+HT+F N+ GNPEKAY N +AVRGAS+Y+LGP HLV+SRL
Sbjct: 841  TVIHQTSFAVDGSGGDDPVAYHTYFTNIFGNPEKAYQNSIAVRGASIYILGPVHLVVSRL 900

Query: 901  LPWKERVQVLRKAGDWMTALSMAITIYDGHAHGVIDLPRSLESLQELVMPFLIELLLSYV 960
            L WKER+QVLRKAGDWM AL+MA+T+YDG++HGVIDLPRSLE++QE +MP+L+ELLLSYV
Sbjct: 901  LTWKERIQVLRKAGDWMGALNMAMTLYDGNSHGVIDLPRSLEAVQEAIMPYLVELLLSYV 960

Query: 961  DEVFSYISVAFCNQIEKNEKLDDVTSGSHSAHSEIKEQYNRVGGVAVEFCVHITRTDILF 1020
            DEVFSYISVAFCNQI K E+LDD  +   S H EIKEQ+ RVGGVAVEFCVHI RTDILF
Sbjct: 961  DEVFSYISVAFCNQIGKMEQLDDPKNRGSSVHFEIKEQFTRVGGVAVEFCVHIKRTDILF 1020

Query: 1021 DEIFSKFVAVQQRDTFLELLEPYILKDMLGSLPPEIMQALVEHYSQKGWLQRVEQCVLHM 1080
            DEIFSKFV VQ RDTFLELLEPYILKDMLGSLPPEIMQALVEHYS KGWLQRVEQCVLHM
Sbjct: 1021 DEIFSKFVGVQHRDTFLELLEPYILKDMLGSLPPEIMQALVEHYSSKGWLQRVEQCVLHM 1080

Query: 1081 DISSLDFNQVVRLCRDHGLYSALVYLFNKGLDDFRTPLEELLAVLQTSLSLDSKKSLGGG 1140
            DISSLDFNQVVRLCR+HGLY AL+YLFN+GLDDF+ PLEELL VL  +   +S  SLG  
Sbjct: 1081 DISSLDFNQVVRLCREHGLYGALIYLFNRGLDDFKAPLEELLVVL-LNRPRESASSLG-- 1140

Query: 1141 WGSMGMYKTLVYLKYCFSGLAFPPGQGTLAHSRVQSLRDELLQFLLENSDTVDTRSISNK 1200
                  Y+ LVYLKYCFSGLAFPPG GTL  +R+ SLR EL+QFLLE+ + ++++++S+ 
Sbjct: 1141 ------YRMLVYLKYCFSGLAFPPGHGTLPPTRLPSLRTELVQFLLEDLNALNSQAVSSL 1200

Query: 1201 SSEVGYLNLYHLLELDTGATLDVLRCAFVEGEIVKADSSLDGSVDASMQVQKEKNSTSGR 1260
            SS     NLYHLLELDT ATLDVLR AFVE EI K D SL  S DA+M+  KE +     
Sbjct: 1201 SSTRALPNLYHLLELDTEATLDVLRYAFVEDEITKPDVSLHDSTDANMEAGKEIDLMGEI 1260

Query: 1261 KNFLVQNVVDALVHILDKAISQTYGSPGGDNITLVEDWPSKKDLFHLFDFVANYVACGKA 1320
            +N LVQN V+AL+HILD  ISQ   S G  +I  +E WPSKKD+ HLF+FVA YVAC +A
Sbjct: 1261 QNLLVQNTVNALIHILD--ISQKNRSSGSSDIGSLELWPSKKDMGHLFEFVAYYVACKRA 1320

Query: 1321 TASKDVVGQILEHLISNSDIPEMEIDFVHSVTANSVHS-RKREKQVLSLLEVIPETHWNP 1380
              SK V+ QILE+L S + +P+       S +  SV + ++REKQVL+LLEV+PE  W+ 
Sbjct: 1321 NVSKTVLSQILEYLTSENKLPQ-------SSSKESVGTLKRREKQVLALLEVVPEKDWDA 1380

Query: 1381 SSVLRMCEKAQFFQVCGLIHSISHQYSSALDGYMKDVEEPIHAFAFINRTLMELSNSEQT 1440
            S VL +CEKA+F+QVCGLIHSI HQY +ALD YMKDV+EP+HAF+FIN TL +LS++E  
Sbjct: 1381 SYVLHLCEKAEFYQVCGLIHSIRHQYLTALDSYMKDVDEPVHAFSFINHTLSQLSDTESA 1440

Query: 1441 EFRGVVISRIPELLNLNREGTFFLVIDHFGNDVSDILSRLHNHPRSLFLYLKTLIEVHQS 1500
             FR  VISRIPEL+NL+REGTFFL+IDHF  +   ILS L +HP+SLFLYLKT+IEVH S
Sbjct: 1441 AFRSAVISRIPELVNLSREGTFFLIIDHFNKESPHILSELRSHPKSLFLYLKTVIEVHLS 1500

Query: 1501 GNLNFSCLKKDDNFGVNYSTK------GLDDYLQKLSDFPKFLSNNPVDVTDDIIELYVE 1560
            G LNFSCL+ DD    +   +      GL+ YL+++ DFPK L NNPV VTD++IELY+E
Sbjct: 1501 GTLNFSCLQNDDTMDASCGRRVKNQLYGLEAYLERILDFPKLLLNNPVHVTDEMIELYLE 1560

Query: 1561 LLCRYERESVLKFLETFDSYHVEHCLRLCQQYEVIDAAAFLLERVGDVGSALFLTLSSLD 1620
            LLC+YE  SVLKFLETF+SY VEHCLRLCQ+Y +IDAAAFLLERVGDVGSAL LTLS L+
Sbjct: 1561 LLCQYEHTSVLKFLETFESYRVEHCLRLCQEYGIIDAAAFLLERVGDVGSALLLTLSGLN 1620

Query: 1621 KKFHDLEAAVGGIVTNGASSGSNDSQLFSSVLKLQEVNDIYVLLHACIGLCQRNTPRLNS 1680
             KF+ LE AVG I++  ASS  +     ++VLK++EV+DIY +LH CIGLCQRNTPRL  
Sbjct: 1621 DKFNVLETAVGSILSEKASSVDH----LNTVLKMKEVSDIYDILHTCIGLCQRNTPRLVP 1680

Query: 1681 EESETLWFKLLDSFCEPLIESFNYRTASFGENQVQFLNESSGSQKDKEAHIVTWRILKSN 1740
            EESE+LWF+LLDSFCEPL++S++ +  S  E  V  L ES  +Q   EA +  W I KS+
Sbjct: 1681 EESESLWFQLLDSFCEPLMDSYDDKIVSEVEKPVGILAESLETQAGDEACLNKWSIPKSH 1740

Query: 1741 QTAHILRSLFSRFIREIVEGMIGYVHLPTIMSRLLSDNGSQEFGDFKLTILGMLGTFGFE 1800
            Q AH+LR LFS+FI+EIVEGM+G+V LP IMS+LLSDNG+QEFGDFK+TILGMLGT+GFE
Sbjct: 1741 QGAHLLRRLFSQFIKEIVEGMVGFVRLPVIMSKLLSDNGNQEFGDFKVTILGMLGTYGFE 1800

Query: 1801 RRILDTAKALIEDDTFYTMSLLKKGASHGYAPRGAVCCICNRLLVKSSSSYRVRVFNCGH 1860
            RRILDTAK+LIEDDTFYTMSLLKKGASHGYAPR  +CCICN L  K+SSS  +RVFNCGH
Sbjct: 1801 RRILDTAKSLIEDDTFYTMSLLKKGASHGYAPRSLICCICNCLFTKNSSSSSIRVFNCGH 1860

Query: 1861 ATHLQCEVLDNEASGGDFS--CPVCVHSNHSQRSRGKA-LTEYSLVNKFSSR-TQSSSGA 1920
            ATHLQCE+L+NEAS    S  CPVC+    +QRSR K+ L E  LV+K  SR TQ + G 
Sbjct: 1861 ATHLQCELLENEASNRSSSVGCPVCLPKKKTQRSRSKSVLMENGLVSKVPSRKTQQAQGT 1920

Query: 1921 SVSYPQETDLLELPYTLQQIPRFEILANLQKNQRVIDIENMPQLRLAPPAVYHDKVTKGY 1972
             V +P E D+LE PY LQQIPRFEIL NLQK++R I IEN+PQLRLAPPAVYH+KV KG 
Sbjct: 1921 IVLHPHENDVLENPYGLQQIPRFEILNNLQKDKRAIQIENLPQLRLAPPAVYHEKVAKGI 1980

BLAST of Cp4.1LG03g15580.1 vs. NCBI nr
Match: gi|657969010|ref|XP_008376221.1| (PREDICTED: vacuolar protein sorting-associated protein 8 homolog [Malus domestica])

HSP 1 Score: 2208.7 bits (5722), Expect = 0.0e+00
Identity = 1218/2002 (60.84%), Postives = 1477/2002 (73.78%), Query Frame = 1

Query: 1    MTKELTDTETLPPMELDLNAFIHAHLSSGDDDHDHDEDDLS-FPHRSIDEILNESSSSTS 60
            MTK+LT  E    MELDL++F+++HLS  D+D   D+DDLS  PHR+IDEILN++ SS S
Sbjct: 1    MTKKLTRFEPQLAMELDLDSFLNSHLSLSDED---DDDDLSSVPHRTIDEILNDTDSSAS 60

Query: 61   SSP--------SSPPNSPPPRARRSIAARDGRASASRSIPPFKSPFEEIIKASKVPRSNQ 120
            SSP        +S P  P P             + + S+P  KS  E +++    PRSN 
Sbjct: 61   SSPPSSTIYRLNSDPKPPEP------------PNDAVSVPVVKSEDERVVR----PRSNL 120

Query: 121  RNEKSVQLKPGSVSHTKVGELTDDPFRRGSRALPSLFGGVRSNAKPGAALAAAAAASRSM 180
                         S  K G+L++DP  R  +  P L GG+R+NAKPGAALAAAAAASRSM
Sbjct: 121  ------------YSRVKSGDLSEDPAGRVPKPSPWLLGGMRTNAKPGAALAAAAAASRSM 180

Query: 181  PAPHAAAIKSRRSGHGSV---VLDDDELASSSAVDSEFVFDDLYSTIDHSKESREKSISL 240
            P PHAAAIKSRRS    V   VL+  EL   S V S    D   +T   S   R  S  L
Sbjct: 181  PTPHAAAIKSRRSAVSGVIQKVLEGSELDEKSEVSSNSTGD---TTAIRSAVLRTNSNEL 240

Query: 241  VERNADYQGASVNVGVEFWARDNIRDCVLYNDEFRITKDT--ECEAEQSFVDDVNFDESS 300
                 D+ G  +  G    +   +         FR+ +    E  A  +  ++ N     
Sbjct: 241  ---QVDFGGELLREGGVLESESKLEQT------FRVDRSQVDEASAGNAVEEEKNVSSFD 300

Query: 301  TTLPPVEANGRSLSDSADNNVCSMDAEPTVLDGDESNEGAFPCSPKPDYDRSAVGYGRLE 360
              L  ++AN    ++ + N     + +  + D D+++    PCS + D + +  G G  +
Sbjct: 301  ENLTNLDANDVKDTEFSKNVEVVEEYKQDIQDLDDNS----PCSKESDSEDNDGGGGGKD 360

Query: 361  LETQDFEKRSQPSKDSEVLAIEDLSVVNDISESRETAKQLDNFHTGERAETMSLSSSN-- 420
                   K S+ +K      I + S + D  E   +  QL     GE  E+  +S  +  
Sbjct: 361  DGDDGGGKESEDNKGD----IGNDSDIGDDDELGSSITQLVEERIGE-LESRRISKKSEK 420

Query: 421  ----PLELAEEIEKKQAFTALHWEEGVAAQPMRLEGIKGGTTALGYFDIQADNSISRTIS 480
                PLE+AEE+EKKQA  ALHWEEG AAQPMRLEG++ G+T LGYF++ A+N I+RT+S
Sbjct: 421  KLRKPLEIAEELEKKQASNALHWEEGAAAQPMRLEGVRRGSTTLGYFNVDANNPITRTLS 480

Query: 481  SHSFKREHGFPQALAVHANYIAVGMSKGNIVVVASKYSAQNGDNMDAKMLLLGSQGDKST 540
            S + +R+HG PQ LAVH NYIA+GM +G+I+V+ SKYSA   D+MDAKML+LG QG++S 
Sbjct: 481  SPALRRDHGSPQVLAVHNNYIAIGMGRGSILVIPSKYSAHTADSMDAKMLILGLQGERSY 540

Query: 541  APVTSLCFNQQGDLLLAGYSDGQVTVWDVLRATAAKVISGEHTSPVVHSLFLGQEAQVTR 600
            A VTS+CFNQQGDLLLAGY+DG +TVWDV RA+AAK+I+GEHT+PVVH+LFLGQ++QVTR
Sbjct: 541  AAVTSMCFNQQGDLLLAGYADGHITVWDVQRASAAKIITGEHTAPVVHTLFLGQDSQVTR 600

Query: 601  QFKAVTGDSKGLVLLHTFSVVPLLNRFSIKTQCLLDGQKTGTVLSASALLLNEFCGSSLP 660
            QFKAVTGDSKGLVLLH+ SVVPLLNRFSIKTQCLLDGQ TGTVLSAS LL +EFCG +  
Sbjct: 601  QFKAVTGDSKGLVLLHSSSVVPLLNRFSIKTQCLLDGQNTGTVLSASPLLFDEFCGGASL 660

Query: 661  PSLSNVAVSTSSIGSMMGGVVGGDSGWKLFNEGSSLVE-GVVIFATHQTALVVRLSPTVE 720
             S  + AVS SSIG MMGGV GGD+GWKLFNEGSSLVE GVV+F TH T LVVRL+PT+E
Sbjct: 661  SSQGSGAVSGSSIGGMMGGVXGGDAGWKLFNEGSSLVEEGVVVFVTHHTVLVVRLTPTLE 720

Query: 721  VYARLSKPDGIQEGSMPYTAWKCS-------QSIETSPSEAVERISLLAIAWDKMVQVAK 780
            VYARLSKPDG++EGSMP TAWKC+        S E  P+EAVER+SLLA+AWD+ V VAK
Sbjct: 721  VYARLSKPDGVREGSMPCTAWKCTIQSHSSPASSENMPAEAVERVSLLALAWDRKVLVAK 780

Query: 781  LVKTELNVCGKWSLESAAIGVAWLDDQVLVILTVTGQLFLFEKDGTMIHQTSVFVDGFDK 840
            LVK+EL V GKWSLESAAIGVAWLDDQ+LV+LTVTGQL LF KDGT+IHQTS  VDGF  
Sbjct: 781  LVKSELKVYGKWSLESAAIGVAWLDDQMLVVLTVTGQLCLFAKDGTVIHQTSFSVDGFGG 840

Query: 841  EDFIAHHTHFVNVLGNPEKAYHNCVAVRGASVYVLGPKHLVISRLLPWKERVQVLRKAGD 900
            +D IA+HTHF+N+ GNPEKAYHNCVAVRGASVYVLGP HL++SRLLPWKER+QVLR AGD
Sbjct: 841  DDLIAYHTHFINIFGNPEKAYHNCVAVRGASVYVLGPMHLIVSRLLPWKERIQVLRGAGD 900

Query: 901  WMTALSMAITIYDGHAHGVIDLPRSLESLQELVMPFLIELLLSYVDEVFSYISVAFCNQI 960
            WM AL+MA+TIYDG AHGV+DLPR+L ++QE +M +L+ELLLSYV+EVFSYISVAFCNQI
Sbjct: 901  WMGALNMAMTIYDGQAHGVVDLPRTLVAVQETIMSYLVELLLSYVEEVFSYISVAFCNQI 960

Query: 961  EKNEKLDDVTSGSHSAHSEIKEQYNRVGGVAVEFCVHITRTDILFDEIFSKFVAVQQRDT 1020
             K ++ DDV S S S HSEIKEQY RVGGVAVEFCVHI RTDILFDEIFSKFVAVQQRDT
Sbjct: 961  GKRDQADDVNSKSSSMHSEIKEQYTRVGGVAVEFCVHIKRTDILFDEIFSKFVAVQQRDT 1020

Query: 1021 FLELLEPYILKDMLGSLPPEIMQALVEHYSQKGWLQRVEQCVLHMDISSLDFNQVVRLCR 1080
            FLELLEPYILKDMLGSLPPEIMQALVEHYS+ GWLQRVEQCVLHMDISSLDFNQVVRLCR
Sbjct: 1021 FLELLEPYILKDMLGSLPPEIMQALVEHYSRTGWLQRVEQCVLHMDISSLDFNQVVRLCR 1080

Query: 1081 DHGLYSALVYLFNKGLDDFRTPLEELLAVLQTSLSLDSKKSLGGGWGSMGMYKTLVYLKY 1140
            +HGLYSALVYLFNKGLDDFR+PLEELL VL+ S   +   +LG        Y+ LVYLKY
Sbjct: 1081 EHGLYSALVYLFNKGLDDFRSPLEELLVVLRNS-QREGATALG--------YRMLVYLKY 1140

Query: 1141 CFSGLAFPPGQGTLAHSRVQSLRDELLQFLLENSDTVDTRSISNKSSEVGYLNLYHLLEL 1200
            CFSGLAFPPGQGT+  SR+ SLR ELLQFLLE SD  ++RS+S+      Y+NLY LLEL
Sbjct: 1141 CFSGLAFPPGQGTIPPSRLPSLRTELLQFLLEGSDAPNSRSVSSVMPGGEYINLYLLLEL 1200

Query: 1201 DTGATLDVLRCAFVEGEIVKADSSLDGSVDASMQVQKEKNSTSGRKNFLVQNVVDALVHI 1260
            DT ATLDVLRCAFVE EI K+D S   S D+ M  Q   N  +  KN +VQN VD L+ I
Sbjct: 1201 DTEATLDVLRCAFVEDEISKSDLS---SHDSDM--QDGNNLMAQNKNSMVQNTVDTLIRI 1260

Query: 1261 LDKAISQTYGSPGGDNITLVEDWPSKKDLFHLFDFVANYVACGKATASKDVVGQILEHLI 1320
            + K  SQT GSP  D+   V  WPSKKD+ HLF+F+A YVACG+AT SK V+ QILE+L 
Sbjct: 1261 ISKDSSQTDGSPSNDDTGSVVVWPSKKDIDHLFEFIAYYVACGRATVSKSVLSQILEYLT 1320

Query: 1321 SNSDIPEMEIDFVHSVTANSVHSRKREKQVLSLLEVIPETHWNPSSVLRMCEKAQFFQVC 1380
            S+++ P         V+ +S+ S++REKQVL LLEV+PET W+ S VL++CEKAQF+QVC
Sbjct: 1321 SDNNFPP-------CVSRDSITSKRREKQVLGLLEVVPETDWDSSYVLQLCEKAQFYQVC 1380

Query: 1381 GLIHSISHQYSSALDGYMKDVEEPIHAFAFINRTLMELSNSEQTEFRGVVISRIPELLNL 1440
            GLIH+  HQY +ALD YMKDVEEPIHAF+FIN+TL++L++ E   FR  +ISRIPEL  L
Sbjct: 1381 GLIHTSRHQYLAALDCYMKDVEEPIHAFSFINKTLLQLTDKECAAFRSEIISRIPELFYL 1440

Query: 1441 NREGTFFLVIDHFG-NDVSDILSRLHNHPRSLFLYLKTLIEVHQSGNLNFSCLKKDDNFG 1500
            NREGTFFLVIDHF   + S ILS+L +HP+SLFLYLKT+IEVH SG L+FS L+KDD   
Sbjct: 1441 NREGTFFLVIDHFTIEEGSHILSKLRSHPKSLFLYLKTVIEVHLSGTLDFSSLRKDDLVR 1500

Query: 1501 VNYSTKGLDDYLQKLSDFPKFLSNNPVDVTDDIIELYVELLCRYERESVLKFLETFDSYH 1560
            V   +K ++ YL+++SDFPK L +NPV+VTDD+IELY+ELLC+YER SVLKFLETFDSY 
Sbjct: 1501 VKDQSKAVEAYLERISDFPKLLRSNPVNVTDDMIELYLELLCQYERNSVLKFLETFDSYR 1560

Query: 1561 VEHCLRLCQQYEVIDAAAFLLERVGDVGSALFLTLSSLDKKFHDLEAAVGGIVTNGASSG 1620
            VEHCLRLCQ+Y + DAA+FLLERVGDVGSAL LTLS+L  KF  L+ AV  +    ASS 
Sbjct: 1561 VEHCLRLCQKYGITDAASFLLERVGDVGSALLLTLSTLSDKFMKLDTAVASL----ASSN 1620

Query: 1621 SNDSQLFSSVLKLQEVNDIYVLLHACIGLCQRNTPRLNSEESETLWFKLLDSFCEPLIES 1680
            S  ++ FS+ LKL+EVNDI  +LHACIGLCQRNT RLN +ESE LWF+LLDSFCEPL +S
Sbjct: 1621 SARTEHFSNALKLEEVNDINSILHACIGLCQRNTHRLNPDESEALWFRLLDSFCEPLTDS 1680

Query: 1681 FNYRTASFGENQVQFLNESSGSQKDKEAHIVTWRILKSNQTAHILRSLFSRFIREIVEGM 1740
            F+  T S GE+    + +S  S++D+ A I+ WRI K ++  HILR LFSRFI+EIVEGM
Sbjct: 1681 FDAGTVSKGEDVKTTVAKSLDSEEDEMAFIIKWRISKLHKGFHILRKLFSRFIKEIVEGM 1740

Query: 1741 IGYVHLPTIMSRLLSDNGSQEFGDFKLTILGMLGTFGFERRILDTAKALIEDDTFYTMSL 1800
            IGYV LPTIMS+LLSDNG+QEFGDFK TILGML T+GFERRILDTAK+LIEDDTFYTMS+
Sbjct: 1741 IGYVRLPTIMSKLLSDNGNQEFGDFKFTILGMLSTYGFERRILDTAKSLIEDDTFYTMSI 1800

Query: 1801 LKKGASHGYAPRGAVCCICNRLLVKSSSSYRVRVFNCGHATHLQCEVLDNEASGGDFS-- 1860
            LKKGASHGYAPR  +CC+C+ LL K+SSSY +R+FNCGHATHLQCE L+N AS    S  
Sbjct: 1801 LKKGASHGYAPRSQICCLCDCLLDKNSSSY-IRIFNCGHATHLQCEALENGASSSSSSSG 1860

Query: 1861 CPVCVHSNHSQRSRGKA-LTEYSLVNKFSSRTQSSSGASVSYPQETDLLELPYTLQQIPR 1920
            CPVC+    SQRSR K+ L E SLV +F SRTQ + G + S+P E+   E  Y LQQI R
Sbjct: 1861 CPVCMPKKKSQRSRSKSVLPEKSLVKEFLSRTQQTHG-TTSHPHESSASENTYGLQQISR 1920

Query: 1921 FEILANLQKNQRVIDIENMPQLRLAPPAVYHDKVTKGYHLLVGDSSGGVEKVEKLNKSRQ 1971
            F+IL NLQ+++ +++IENMPQLRLAPPAVYH+KV KG  L   +SS  + +V + +K++Q
Sbjct: 1921 FDILTNLQRDRGLVEIENMPQLRLAPPAVYHEKVQKGTVLSPAESSTDLSRVGQQSKTKQ 1922

BLAST of Cp4.1LG03g15580.1 vs. NCBI nr
Match: gi|568876600|ref|XP_006491365.1| (PREDICTED: vacuolar protein sorting-associated protein 8 homolog isoform X1 [Citrus sinensis])

HSP 1 Score: 2207.6 bits (5719), Expect = 0.0e+00
Identity = 1209/2024 (59.73%), Postives = 1484/2024 (73.32%), Query Frame = 1

Query: 1    MTKELTDTETLPPMELDLNAFIHAHLSSGDDDHDHDEDDLSFPHRSIDEILNESSSSTS- 60
            MTKEL DT++L  MELD+++F+++HLSS     D D++  S PHR++DEILN+S SSTS 
Sbjct: 1    MTKELQDTKSL--MELDVDSFLNSHLSS-----DSDDEFNSVPHRTLDEILNDSESSTSP 60

Query: 61   SSPSSPPNSPPPRARRSIAARDGRASASRSIPPFKSPFEEIIKASKVPRSNQRNEKSVQL 120
            SSP+S  +       +     DG +S  +  P                            
Sbjct: 61   SSPTSSIHHSDTSLAKPQPQGDGVSSQDKPTP---------------------------- 120

Query: 121  KPGSVSHTKVGELTDDPFRR----GSRALPSLFGGVRSNAKPGAALAAAAAASRSMPAPH 180
            KPGS    K  EL+ DP  R     SR LPSLFGGVRS AKPGAALAAAAAASRS+P PH
Sbjct: 121  KPGSFHRVKSNELSGDPIWRVPPSSSRQLPSLFGGVRSTAKPGAALAAAAAASRSVPTPH 180

Query: 181  AAAIKSRRSGHGSVVL----DDDELASSSAVDSEFVFDDLYSTIDHSKESREKSISLVER 240
            AAAIKSRR+G G+++     DD E+AS S+           + I  S E  E    L+  
Sbjct: 181  AAAIKSRRAGSGTLLKVLDGDDHEIASVSS-----------NEISVSSEKLEGDAELI-- 240

Query: 241  NADYQGASVNVGVEFWARDNIRDCVLYNDEFRITKDTECEAEQSFVDD--------VNFD 300
              D+Q A VNV  E  +  + RD            DT+ E+E S VDD        +N D
Sbjct: 241  -GDFQSAQVNVSGELSSLASSRDV-----------DTKLESEVSNVDDEFLNTSSNLNTD 300

Query: 301  ESSTTLPPVEANGRSL--------SDSADNNVCSMDAEPTVLDGDE-------SNEGAFP 360
            +     P V     +L        SD A++   +    P   D D        S E +  
Sbjct: 301  QLIGCSPRVVVKDLNLREKSIIASSDDANDIDGNRIVAPVTADDDSMFLEVNASTESS-- 360

Query: 361  CSPKPDYDRSAVGYGRLELETQDFE---KRSQPSKDSE--VLAIEDLSVVNDISE-SRET 420
              P  + DR+ +    LE+ T + E   K    S+D E  V    D S ++DISE   E 
Sbjct: 361  VVPLNESDRTGLMEENLEIPTLEMESSDKSMSTSQDDEVGVDGSNDASSIDDISELVEER 420

Query: 421  AKQLDNFHTGERAETMSLSSSNPLELAEEIEKKQAFTALHWEEGVAAQPMRLEGIKGGTT 480
              QL++  T  RAE     S  PLELAEE+EKKQA T LHW+EG AAQPMRLEG++ G+T
Sbjct: 421  IGQLESEITSRRAEKKVQPSLKPLELAEELEKKQASTGLHWKEGAAAQPMRLEGVRRGST 480

Query: 481  ALGYFDIQADNSISRTISSHSFKREHGFPQALAVHANYIAVGMSKGNIVVVASKYSAQNG 540
             LGYFD+ A+N+I++TI+S +F+R+HG PQ LAVH ++IAVGMSKG IVVV  KYSA + 
Sbjct: 481  TLGYFDVDANNTITQTIASQAFRRDHGSPQVLAVHPSFIAVGMSKGAIVVVPGKYSAHHR 540

Query: 541  DNMDAKMLLLGSQGDKSTAPVTSLCFNQQGDLLLAGYSDGQVTVWDVLRATAAKVISGEH 600
            D+MD+KM++LG  GD+S APVT++CFNQ GDLLLAGY+DG VTVWDV RA+AAKVI+GEH
Sbjct: 541  DSMDSKMMMLGLLGDRSPAPVTAMCFNQPGDLLLAGYADGHVTVWDVQRASAAKVITGEH 600

Query: 601  TSPVVHSLFLGQEAQVTRQFKAVTGDSKGLVLLHTFSVVPLLNRFSIKTQCLLDGQKTGT 660
            TSPVVH+LFLGQ++QVTRQFKAVTGD+KGLV LH+ SVVPLLNRFSIKTQCLLDGQKTG 
Sbjct: 601  TSPVVHTLFLGQDSQVTRQFKAVTGDTKGLVQLHSLSVVPLLNRFSIKTQCLLDGQKTGI 660

Query: 661  VLSASALLLNEFCGSSLPPSLSNVAVSTSSIGSMMGGVVGGDSGWKLFNEGSSLV-EGVV 720
            VLSAS LL +E CG +   S  N   S SSIGSMMGGVVG D+GWKLFNEGSSLV EGVV
Sbjct: 661  VLSASPLLFDESCGGAPLSSQGNSTASASSIGSMMGGVVGSDTGWKLFNEGSSLVEEGVV 720

Query: 721  IFATHQTALVVRLSPTVEVYARLSKPDGIQEGSMPYTAWKC-----SQSIETSPSEAVER 780
            IF T+QTALVVRL+PT+EVYA++ +PDG++EG+MPYTAWKC     S + E+ P+EA ER
Sbjct: 721  IFVTYQTALVVRLTPTLEVYAQIPRPDGVREGAMPYTAWKCMTTCRSSTTESIPTEAAER 780

Query: 781  ISLLAIAWDKMVQVAKLVKTELNVCGKWSLESAAIGVAWLDDQVLVILTVTGQLFLFEKD 840
            +SLLAIAWD+ VQVAKLVK+EL V GKWSL+SAAIGVAWLDDQ+LV+LT+ GQL+L+ +D
Sbjct: 781  VSLLAIAWDRKVQVAKLVKSELKVYGKWSLDSAAIGVAWLDDQMLVVLTLLGQLYLYARD 840

Query: 841  GTMIHQTSVFVDGFDKEDFIAHHTHFVNVLGNPEKAYHNCVAVRGASVYVLGPKHLVISR 900
            GT+IHQTS  VDG    D + +H++F NV GNPEK+YH+C++VRGAS+YVLGP HLV+SR
Sbjct: 841  GTVIHQTSFAVDGSQGYDLVGYHSYFTNVFGNPEKSYHDCISVRGASIYVLGPMHLVVSR 900

Query: 901  LLPWKERVQVLRKAGDWMTALSMAITIYDGHAHGVIDLPRSLESLQELVMPFLIELLLSY 960
            LLPWKER+QVLRKAGDWM AL+MA+T+YDG AHGVIDLPR+L+++QE +MP+L+ELLLSY
Sbjct: 901  LLPWKERIQVLRKAGDWMGALNMAMTLYDGQAHGVIDLPRTLDAVQEAIMPYLVELLLSY 960

Query: 961  VDEVFSYISVAFCNQIEKNEKLDDVTSGSHSAHSEIKEQYNRVGGVAVEFCVHITRTDIL 1020
            VDEVFSYISVAFCNQIEK  +L++  S S + H+EIKEQ+ RVGGVAVEFCVHI RTDIL
Sbjct: 961  VDEVFSYISVAFCNQIEKLAQLNNPQSRSSTVHAEIKEQFTRVGGVAVEFCVHINRTDIL 1020

Query: 1021 FDEIFSKFVAVQQRDTFLELLEPYILKDMLGSLPPEIMQALVEHYSQKGWLQRVEQCVLH 1080
            FD+IFSKF AVQ RDTFLELLEPYILKDMLGSLPPEIMQALVEHYS KGWLQRVEQCVLH
Sbjct: 1021 FDDIFSKFEAVQHRDTFLELLEPYILKDMLGSLPPEIMQALVEHYSSKGWLQRVEQCVLH 1080

Query: 1081 MDISSLDFNQVVRLCRDHGLYSALVYLFNKGLDDFRTPLEELLAVLQTSLSLDSKKSLGG 1140
            MDISSLDFNQVVRLCR+HGL+ ALVYLFNKGLDDFR PLEELL VL+ S   +S  +LG 
Sbjct: 1081 MDISSLDFNQVVRLCREHGLHGALVYLFNKGLDDFRAPLEELLVVLRNS-ERESAYALG- 1140

Query: 1141 GWGSMGMYKTLVYLKYCFSGLAFPPGQGTLAHSRVQSLRDELLQFLLENSDTVDTRSISN 1200
                   Y+ LVYLKYCF GLAFPPG GTL  +R+ SLR EL+QFLLE SD  ++++ S+
Sbjct: 1141 -------YRMLVYLKYCFKGLAFPPGHGTLPSTRLPSLRAELVQFLLEESDAQNSQAASS 1200

Query: 1201 KSSEVGYLNLYHLLELDTGATLDVLRCAFVEGEIVKADSSLDGSVDASMQVQKEKNSTSG 1260
               +  YLNLYHLLELDT ATLDVLRCAF+E E  K+D       D + +        + 
Sbjct: 1201 LLLKGSYLNLYHLLELDTEATLDVLRCAFIEVETPKSDFYACDMADTNAEPNNGNKMVAE 1260

Query: 1261 RKNFLVQNVVDALVHILDKAISQTYGSPGGDNITLVEDWPSKKDLFHLFDFVANYVACGK 1320
             +N LVQN V+ALVHILD+ IS T GS   D+   VE WPS KD+ H+F+F+A YVA G+
Sbjct: 1261 YQNMLVQNTVNALVHILDEDISSTDGSASKDDSGSVEAWPSTKDIGHIFEFIACYVASGR 1320

Query: 1321 ATASKDVVGQILEHLISNSDIPEMEIDFVHSVTANSVHSRKREKQVLSLLEVIPETHWNP 1380
            AT SK V+ QIL++L S  ++P+       S+ ++   S++REKQ+L+LLE +PET WN 
Sbjct: 1321 ATVSKSVLSQILQYLTSEKNVPQ-------SILSHIETSKRREKQLLALLEAVPETDWNA 1380

Query: 1381 SSVLRMCEKAQFFQVCGLIHSISHQYSSALDGYMKDVEEPIHAFAFINRTLMELSNSEQT 1440
            S VL +CE A F+QVCGLIH+I + Y +ALD YMKDV+EPI AF+FI+ TL++L+++E T
Sbjct: 1381 SEVLHLCENAHFYQVCGLIHTIRYNYLAALDSYMKDVDEPICAFSFIHDTLLQLTDNEYT 1440

Query: 1441 EFRGVVISRIPELLNLNREGTFFLVIDHFGNDVSDILSRLHNHPRSLFLYLKTLIEVHQS 1500
             F   VISRIPEL+ L+RE TFFLVID F ++ S ILS L +HP+SLFLYLKT++EVH  
Sbjct: 1441 AFHSAVISRIPELICLSREATFFLVIDQFNDEASHILSELRSHPKSLFLYLKTVVEVHLH 1500

Query: 1501 GNLNFSCLKKDDNFG------VNYSTKGLDDYLQKLSDFPKFLSNNPVDVTDDIIELYVE 1560
            G LN S L+KDD         V Y +KGL  Y++++SD PKFLS+N V VTDD+IELY+E
Sbjct: 1501 GTLNLSYLRKDDTLDVANCKWVKYQSKGLGAYIERISDLPKFLSSNAVHVTDDMIELYLE 1560

Query: 1561 LLCRYERESVLKFLETFDSYHVEHCLRLCQQYEVIDAAAFLLERVGDVGSALFLTLSSLD 1620
            LLCRYER+SVLKFLETFDSY VE+CLRLCQ+Y + DAAAFLLERVGDVGSAL LTLS L+
Sbjct: 1561 LLCRYERDSVLKFLETFDSYRVEYCLRLCQEYGITDAAAFLLERVGDVGSALLLTLSELN 1620

Query: 1621 KKFHDLEAAVGGIVTNGASSGSNDSQLFSSVLKLQEVNDIYVLLHACIGLCQRNTPRLNS 1680
             KF  LE AVG  +    S+GS   + FS+VL ++EVND+  +L ACIGLCQRNTPRLN 
Sbjct: 1621 DKFAALETAVGSALPIAVSNGSVSVEHFSTVLNMEEVNDVNNILRACIGLCQRNTPRLNP 1680

Query: 1681 EESETLWFKLLDSFCEPLIESFNYRTASFGENQVQFLNESSGSQKDKEAHIVTWRILKSN 1740
            EESE LWFKLLDSFCEPL+ SF  R AS  EN  + L ES GSQ+D EA I+ WRI KS+
Sbjct: 1681 EESEVLWFKLLDSFCEPLMGSFVER-ASERENHSRMLEESFGSQEDAEACIIKWRISKSH 1740

Query: 1741 QTAHILRSLFSRFIREIVEGMIGYVHLPTIMSRLLSDNGSQEFGDFKLTILGMLGTFGFE 1800
            + +HILR LFS+FI+EIVEGMIGYVHLPTIMS+LLSDNGSQEFGDFKLTILGMLGT+ FE
Sbjct: 1741 RGSHILRKLFSQFIKEIVEGMIGYVHLPTIMSKLLSDNGSQEFGDFKLTILGMLGTYSFE 1800

Query: 1801 RRILDTAKALIEDDTFYTMSLLKKGASHGYAPRGAVCCICNRLLVKSSSSYRVRVFNCGH 1860
            RRILDTAK+LIEDDTFYTMS+LKK ASHGYAPR  +CCICN LL K+SSS+++RVFNCGH
Sbjct: 1801 RRILDTAKSLIEDDTFYTMSVLKKEASHGYAPRSLLCCICNCLLTKNSSSFQIRVFNCGH 1860

Query: 1861 ATHLQCEVLDNEASGGD--FSCPVCVHSNHSQRSRGK-ALTEYSLVNKFSSRTQSSSGAS 1920
            ATH+QCE+L+NE+S       CP+C+   ++QRSR K  L E  LV+KFSSR Q S G +
Sbjct: 1861 ATHIQCELLENESSSKSNLSGCPLCMPKKNTQRSRNKTVLAESGLVSKFSSRPQQSLGTT 1920

Query: 1921 VSYPQETDLLELPYTLQQIPRFEILANLQKNQRVIDIENMPQLRLAPPAVYHDKVTKGYH 1972
            + +  E+D  +    +QQ+ RFEIL NL+K+QRV+ IENMPQLRLAPPA+YH+KV KG  
Sbjct: 1921 L-HSHESDTSDYSNGIQQLSRFEILNNLRKDQRVVQIENMPQLRLAPPAIYHEKVKKGTD 1944

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
VPS8_MOUSE9.7e-8124.55Vacuolar protein sorting-associated protein 8 homolog OS=Mus musculus GN=Vps8 PE... [more]
VPS8_HUMAN2.8e-8024.89Vacuolar protein sorting-associated protein 8 homolog OS=Homo sapiens GN=VPS8 PE... [more]
VPS8_YEAST1.4e-1018.51Vacuolar protein sorting-associated protein 8 OS=Saccharomyces cerevisiae (strai... [more]
Match NameE-valueIdentityDescription
A0A0A0L2X7_CUCSA0.0e+0086.01Uncharacterized protein OS=Cucumis sativus GN=Csa_3G116870 PE=4 SV=1[more]
F6I2Y1_VITVI0.0e+0060.84Putative uncharacterized protein OS=Vitis vinifera GN=VIT_15s0048g02590 PE=4 SV=... [more]
V4U715_9ROSI0.0e+0059.68Uncharacterized protein OS=Citrus clementina GN=CICLE_v10018449mg PE=4 SV=1[more]
A0A067H3N5_CITSI0.0e+0059.78Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g000170mg PE=4 SV=1[more]
M5X747_PRUPE0.0e+0060.76Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa000078mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G00800.10.0e+0052.98 transducin family protein / WD-40 repeat family protein[more]
Match NameE-valueIdentityDescription
gi|778676625|ref|XP_011650623.1|0.0e+0086.01PREDICTED: vacuolar protein sorting-associated protein 8 homolog [Cucumis sativu... [more]
gi|659074757|ref|XP_008437780.1|0.0e+0085.91PREDICTED: vacuolar protein sorting-associated protein 8 homolog [Cucumis melo][more]
gi|731420761|ref|XP_002267626.3|0.0e+0060.84PREDICTED: vacuolar protein sorting-associated protein 8 homolog isoform X1 [Vit... [more]
gi|657969010|ref|XP_008376221.1|0.0e+0060.84PREDICTED: vacuolar protein sorting-associated protein 8 homolog [Malus domestic... [more]
gi|568876600|ref|XP_006491365.1|0.0e+0059.73PREDICTED: vacuolar protein sorting-associated protein 8 homolog isoform X1 [Cit... [more]
The following terms have been associated with this mRNA:
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
Vocabulary: Biological Process
TermDefinition
GO:0016192vesicle-mediated transport
GO:0006886intracellular protein transport
Vocabulary: INTERPRO
TermDefinition
IPR025941Vps8_central_dom
IPR019775WD40_repeat_CS
IPR017986WD40_repeat_dom
IPR015943WD40/YVTN_repeat-like_dom_sf
IPR013083Znf_RING/FYVE/PHD
IPR001841Znf_RING
IPR001680WD40_repeat
IPR000547Clathrin_H-chain/VPS_repeat
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006886 intracellular protein transport
biological_process GO:0016192 vesicle-mediated transport
cellular_component GO:0005622 intracellular
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG03g15580Cp4.1LG03g15580gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG03g15580.1Cp4.1LG03g15580.1-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG03g15580.1:five_prime_utr:001Cp4.1LG03g15580.1:five_prime_utr:001five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG03g15580.1:cds:001Cp4.1LG03g15580.1:cds:001CDS
Cp4.1LG03g15580.1:cds:002Cp4.1LG03g15580.1:cds:002CDS
Cp4.1LG03g15580.1:cds:003Cp4.1LG03g15580.1:cds:003CDS
Cp4.1LG03g15580.1:cds:004Cp4.1LG03g15580.1:cds:004CDS
Cp4.1LG03g15580.1:cds:005Cp4.1LG03g15580.1:cds:005CDS
Cp4.1LG03g15580.1:cds:006Cp4.1LG03g15580.1:cds:006CDS
Cp4.1LG03g15580.1:cds:007Cp4.1LG03g15580.1:cds:007CDS
Cp4.1LG03g15580.1:cds:008Cp4.1LG03g15580.1:cds:008CDS
Cp4.1LG03g15580.1:cds:009Cp4.1LG03g15580.1:cds:009CDS
Cp4.1LG03g15580.1:cds:010Cp4.1LG03g15580.1:cds:010CDS
Cp4.1LG03g15580.1:cds:011Cp4.1LG03g15580.1:cds:011CDS
Cp4.1LG03g15580.1:cds:012Cp4.1LG03g15580.1:cds:012CDS
Cp4.1LG03g15580.1:cds:013Cp4.1LG03g15580.1:cds:013CDS
Cp4.1LG03g15580.1:cds:014Cp4.1LG03g15580.1:cds:014CDS
Cp4.1LG03g15580.1:cds:015Cp4.1LG03g15580.1:cds:015CDS
Cp4.1LG03g15580.1:cds:016Cp4.1LG03g15580.1:cds:016CDS
Cp4.1LG03g15580.1:cds:017Cp4.1LG03g15580.1:cds:017CDS
Cp4.1LG03g15580.1:cds:018Cp4.1LG03g15580.1:cds:018CDS
Cp4.1LG03g15580.1:cds:019Cp4.1LG03g15580.1:cds:019CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG03g15580.1:three_prime_utr:001Cp4.1LG03g15580.1:three_prime_utr:001three_prime_UTR


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000547Clathrin, heavy chain/VPS, 7-fold repeatPFAMPF00637Clathrincoord: 1503..1573
score: 1.
IPR000547Clathrin, heavy chain/VPS, 7-fold repeatPROFILEPS50236CHCRcoord: 1450..1610
score: 11
IPR001680WD40 repeatSMARTSM00320WD40_4coord: 508..548
score: 0.
IPR001680WD40 repeatPROFILEPS50082WD_REPEATS_2coord: 516..557
score: 10
IPR001841Zinc finger, RING-typePROFILEPS50089ZF_RING_2coord: 1787..1833
score: 9
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3DG3DSA:3.30.40.10coord: 1784..1837
score: 7.
IPR015943WD40/YVTN repeat-like-containing domainGENE3DG3DSA:2.130.10.10coord: 464..571
score: 3.
IPR017986WD40-repeat-containing domainPROFILEPS50294WD_REPEATS_REGIONcoord: 516..557
score: 10
IPR017986WD40-repeat-containing domainunknownSSF50978WD40 repeat-likecoord: 738..792
score: 3.66E-11coord: 448..566
score: 3.66
IPR019775WD40 repeat, conserved sitePROSITEPS00678WD_REPEATS_1coord: 535..549
scor
IPR025941Vacuolar protein sorting-associated protein 8, central domainPFAMPF12816Vps8coord: 992..1185
score: 1.9
NoneNo IPR availablePANTHERPTHR12616VACUOLAR PROTEIN SORTING VPS41coord: 948..1084
score: 5.8E-94coord: 832..922
score: 5.8E-94coord: 1494..1571
score: 5.8E-94coord: 1704..1816
score: 5.8E-94coord: 1101..1235
score: 5.8E-94coord: 1603..1648
score: 5.8E-94coord: 1276..1456
score: 5.8
NoneNo IPR availablePANTHERPTHR12616:SF6RING ZINC FINGER-CONTAINING PROTEINcoord: 832..922
score: 5.8E-94coord: 1494..1571
score: 5.8E-94coord: 1603..1648
score: 5.8E-94coord: 1276..1456
score: 5.8E-94coord: 948..1084
score: 5.8E-94coord: 1704..1816
score: 5.8E-94coord: 1101..1235
score: 5.8