Cp4.1LG20g07410 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG20g07410
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionVacuolar fusion protein MON1 like A
LocationCp4.1LG20 : 6183026 .. 6197579 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAGCGATGCGCTTTCAGGTGTGATCGCATTTCCCTCTTCCTGATCATCAACATTGAGTTTTCAATCGGAGAACTGAATCGACAATTCAAGTTTTATTCCCAAAGATGTCTTCTGGGTTGAGTTCATTATCATCCAGTGATGAACTCGACAATTTCAACCCTAGAACATCACCCACTACGCCACCTAAACCTCTCGAAGAAGAATTGGCGTCTTTAGCATTGACTTTACCACCTCCTGAGCTGCTTTCCGACCAGGAGGATGTCAATGTCGTCTCAGATGGATCCGCGGCCGATGGCTTTGGGTTTGGAATTCAACGGAGTGATGAGGAATCAAGGGTCGGCGTAGTTTGTGAAGAGAATGTTGTGGGGAACTCGGCGGCGGCGGTGGTTGAGGGAGCGACCGGAAATTCGGTGCGGGAGGGTATTGATGGGACGGGGGTTGTGTGGGGGAGGACTAATTCGGAGATTGAGGTGGATAGACCGGTCAGTCCTAGCAGTAGTGGGTATGCAGGTGAAAGGGGAAGCAGCGGTGCTAGTAGCGGGAGGTCGGAGATGGATGGAATTGCTGATGACGAGATACAGGAACTGAATGATGATGCTTCTGTTGGTGATAATTCGAATTCTGTACACTCTTGGGTTCCTGGAAAGCGGCATGGTGATGAGGTACTGTAATTTGGCAACCCTTGATGACTGTTTGATTGGAGGATTTTGAACTCCATGCGTCTCATTATCTATAGGGTAGATCATGAATGAATATTGATGTTTGGGACTTGCTTGGTAACTGTTTCCAAGATATTTGTTCATAAATGCCGTGGAATCTTGTTCCGAGGACACTTATCTTTTCTTGTTTTCTCTTTAGAATGTTTCAAAATTAAAAGAGAAAGATCTGTCTTTCCTATTCGTTTTAATATGCATAATAACATAAATCATTAAGTAATTTGATCATGGTTCTTAAAAGATGTTTTAAATCAGGAATTCACGGAAGGCAATCTCGTTGGGTTCTCACGTTAGATAGGACATTGTCTCTGGATCCTGTGAGGAGGGTGGGTTGGGTTCAAGTAAATTTGAGGCATAGAAATTTGGCTTTGCTTGCTAAATGGCTATTATGGTTCCTTCTAGAGTTGAATGCTTTCCGGTGCAAGGTTATAGTGAGAAAATATGGGTGATATCTTTTTGGGTGGGTCTTGGATTGTGTGGTAAAGGCACCTTGAGGAACTTGCTGAAAGAAATCTCTTTGGGTCTCTCTTGTTTTTTCATGTTTAGGAAATTTTCTATCGAGGATGGTCACTCGTTCAATTTATGGAAGACCATTGTTAGGATGATGGTCCCTATATGGAGATTCCACTTTGTACAACTTTGATTTTAGGAGATCGGTCCTTGAATGTGAAAGTTTTGGTATATCTTCATTGATTATCTTCTTCATGGGTTAGGCCATTGCCTAGGTTTAGGAATACGTTTGTATGGAACTTTGGCCCTTGGGGTGTGTTATCTTCTGGAGATAGCTTTTGTCCTAGTCGAGTGCCCTTCACTACCATATGTTTTTTTTTTCTCTTCTCCTTTGCATTGAATGTATAAACCAAGTTAAAAGATTTGGAGGATTAACACGACTTTATCTAAGAGTCATCCTCACTAATATTAGTGAGGCTCTGAGTATTGAATTTTATGACAATGTGTTTAAAAAAGATGTGGATCACATACTGTGCAGTAGTCAGTGTGCAATTTTGATTTGTGATAAACTAAAGTGTTTCTGTCTTACAATAGGTCTTTTTTTGTTTGATGATGATGTGTTTCTTCATCTGCCATTTTAGGACAAGAGAAGAATACTTTGTCATGTTACTTTTGTTTGGACTAATTTGTGGATCATTTAACTTGAGAGAAACAAGAAGATCTTTAGAAGGGTTGAGGGATCTTTGGATAGGTGTTGTTCGGGATCATTTTGGTTATCTTTGTAGTCTCTGTTTTTCTTTTGTTGGCTGGCTCCCTTTCATTATGGGCTTCCTTTGTTCTGCTGTCTTTGTTCTTTTATTCATTTCAGTGAGGGTTTGTTTTCTCATATTGTGGTATGCTAGTCTCTTTGCAATCTCGTGGGGCATTGAGCTTGAGAAAAATAATAGAAATTTTAAAGAGGTAAAGGAGGTTTGGGAGGTCGTTAGGTTCAATGCCTCCTCGTGGACGTTGATCACTAGTTTTTTTTTTGTTTTTTTTGTTTTGGTAATTACGACTATGAGCTTGGTTTTTTTCTTTTAGATTGGAGTCATTTTCTGAAATTGGGCCAAACTCCTTTTTGTGGAGCTTATTTGATTTTATTCATTTATTATTTGTATGCCTTGTATCTTTCATTTCTCTCCATGAAAGCACAATTTCTTACCAAAAGAATAAAGACCTAACTGATAATTACAAAAGTTCTTAGCCATTAAGGTCCAAACCGAGGTATTAAACTTAATGACCTCCCACAACTCCTCCACAAATCTCTCAATGAGGCCTACTGATAATTTATCTCCTTTTTTGGTTTGATTTCTAAGAAGCAAGAGTAGAAGATATAAGCTAGAATTCTCGTGTAAGCTGATAAAGCTTTAAGCATCTGTTGTGAAAGATGTAGGATAACTTTGAAGAGGCTATGGCAACAAGCAGGCAAAGAAAAAATCAGAATGGGGCATTTCCTCCATGTCCTTTCTACACATAACGCACCAGTTAGGGGCTAGAGAGAGAGAGAGCATTCTCTTCTGCAATCTATAGCAACGTTTTTTTTTGAATCTGAATCCATGCTTTCACTGAGAGAAAAGAAAAAAAAAAGTACAAGTACGTACTAAAAAATATCAGCCTACTAAAAGGAGCTCCAAAACAAAAAATATGACTCCAATTCAGAAAAATTAAATCTAGTTGATAATTATAGGGGGAAAAGAACTGAGAGATCGAAGACCATAGTGAGCCATTAAAACTCATGGTATACTCCACACCTCAATCTCAAATCTCTCAACCCTCACAAAAATACAAGCTTGCCAAAGGACCATTCCTTTATCCTAAAAGGAAGGATTCAAGAGTTTCCTCAATCAAAGAGCCACAATGTCTGTTATGAACCCAACAAACCCCAAAGGAAGCTCAACAATGCTCCCAAATTGAGATGAAAAATTGCAGATCGCCTAAAATATGATCCAAGTCATCCGCCCCTCAATGACAAAGGAAGCATCAATGTGGATTCAACTCAAAGGATGTTAACTTTCCCATGAAGAATCTACTCTACAAAAAACTTGACCTTCTTGGAATTCTTAACCTTTCAAATAGAGAAGAAAGCTAAGGCTTCAAGAGGAGGAGAAGTGACACAAAGAAAAGGGAAAAAAAGAACTAGAAGAAAGCCCCATGAAAGACTGGGAATCAACCAACGAGTATACATCCTCCCTTGACTCACAAAAGAGTCCCCAATTAGTGAAAGATGGGTAGCCATGTTGACAGCTTCTATATGCCAGTAGATGGCTAAAAACACAAAGAGACCAAAGAAAAAAGATGAAGGAAAATCTAAAAAGAGTAGGACAAAGACTACGGAACACAACCTCTTAGTAGACAGGTGATATAGCTGAGGAAACACAGAGCAAAGTGGCTTGTCTACACCAATCTTCCCAAAAGTAAACCTTCAAGCCATCCCCAATAGTATTTTATAAATGGAGAGAGGAGGGGGAAAATGTGACGACATAGGACTCTAGAGGTACTTAGAGGAACCTCCCACTGACTACATCACACTCAAAAGGGTGTGAACAATACCTACTCACAATTGCCTTGTCCAAAAGAGCCTTGTTCTGACATCTTGGATTCCCATTACCCAAACCCCCGAGTTTCAAAGGCCTTGAGACCACCTCCCACCCAACAAAATGCGAACCTCCCCCACATGTATGTGTTCCCAAAGAAAATCTTTCATGGCCTTCTCCAATTTAAGAAGAGGAGAATGTATAGGCTCCCTGAGTAAGGGCACAAAGGATATGGAAAAAGAGTTGCAAGAGAAACTCTTTGAAGGCTTTGAAACCCAAAGCTTAGAATCCCTTCTCCCATGACGAATCACTTGGTCAAAAATAGTTGAAAGGAAATGTACTACCTCATCTGCATCTTGGTTAGTGAAGGGGCACAAAAATCCATTGGTTAGTGAAGGGGTAGCAAAAACCCAGAGAAAGCAAGGAATAAATATCAAAAGAGGGAAAAACAGTGACTATTCAAACCATGTGACTATTTGCATCCTCGGACAAATATGTGGAAATAAGTCACTCAGGGGTCTCTCCCTTGTTGAAAAATCTTCCCAAGAATGAGTATTCAAACCATCCCCAATAGAACATTTAACAAATTGAAAGGACAGGAAAACTCAAAGCAATAGTAGACCAGAGGTTTTTGTCAGAACCCTTTCGTCTGCTACCTGAAACTCAATCAAAAGGGTGAGGTCCATACATACAATAATCTTCCACTACGAGGCACTTGACCCCTTGAAAAATGCCACAACCACTTTGGTGAAAGAGCCTCATTACGTAATTATGAACTTTAAACTTTAAACTTTAAATCCCAACCGGTTGCCCAACATTGAAAAGGAGAATGTTTGAGTTTGCTACCCTGTAGAGTCTTGGAAACAAAATCAGACTTGAGAGTGCTATTTCCAATCTATCGATCATCCCAAAACAGATTTTTTCTTCTTGAGATTATTATTTCCAATCCAATGATCATGGCAAAACATACATCAGAATGCTGTTTGCAAATCAGGCCCAGCCCCTTCTAATTCTCACTCAGTTAGGTTTCTTCATTAATCATTTGAGGAGGGTTCAACTTCTCATCCCCCCACATCTGGTTAGCCCAGTCTAGGTAAAGATCCTGGTGAATAGTATGGGGGAAATCCTCATTCTCTTGTACTAAGGTTCTACTATTGTCCTAAATAGTAACATCCCCAACGAGTACGTACCATCCAACCCAACCTGTTTGAATGTAACTTGAAGCACTTGAATTTCAAGTCATTATGAGTTTGCTAGCGTTGTTGAATTTACTATATGTAGCATGGAGCAATGTTTTGTTTATAGTTAAGCCCTGCCATCAGTAGTTATGATGAAAATACAGACTTTAAACATATAGCTCCTTTCTGTTATGTTATTTTAGTTGCCAAGAAAAACTGCTCATCACTTTTGCTTCATTGTGGTGGCTGACCTTACAAACATTGATTACTAGCACTACACCTTCCCTGATCAGTTAATTCTTAAAGTTCAGGATGATGCTTCCATATCATGGAGAAAAAGGAAGAAGCATTTCTTTGTTCTGAGCCATTCTGGAAAGCCTATTTATTCCAGGTTTGTTCTTTTGCTGCATTCTAGTATAATAAACAGTTTTACATGTTTAAATTTTAAAGCACTAAAATTTTCAGCCTTCTACTGGGAGCTTATTTAAACGTAGTTAATGACATTCAGGTATGGAGACGAGCACAAGCTAGCAGGATTTTCAGCCAGCTTGCAGGCAATCATTTCCTTTGTGGAGGATGGGTAAGTCGCTGGACTTTGCTATGTTATCTACGAGTTTTAATATTTTTAAGCATTATGATATTCAAAATCTCAGGTAATGTTCCTGAATCACTGCTGCACAGTTCTGATTAATTAATTGCTTTTCAGGGGTGATCGTGTCAAATGGGTTAGAGCTGGAAAACACCTGGTCTGTTTTTTTATTATATATATATATATATATATATTATTTATTTATTTATGTGTGTGTCTCATTGTTACTTCCAATGTTGGTTATTGAGCTGAGTATTCTTTGAACTTTAGGTGATTTAGGGGAATGAGAATGGATCTTTGCCATGTCCCTTCTTGCTGTTCACTTCATTAAGGCCCAAGGGAATGAACTGATAGAGTGGGACCCAACTTAAATATTGAGAATAATTCTGGAATGTCATTCATTTACAAAAAGAAAAGGGGATAAATTCCATTCCCATTCTCATTCTTGGGTCCCTTACTCCCACTAAATTGTTCATTTATTTCTTTTCTTTCTACTTGTTTTGGTTTTCACATCAGTTTGAATTTGAGTCTCATTTTATGAACCGGTATACAAAATCAATGACATCCTCCTCCTTCCAATTTCTATTCCGTAACTTTTAGAAGTTCAGCTCCCATTCCACAATTTTTCTTTTCAGTCAATACTGTTTCCATTTCACTTTGATTCTTATTCTCATTAAGTAGATGGGTCTTTAGTTCTCTTTGTTGACTCACAAATATTCAAGCAACTCTAACTAATCTCCTAATGTACTGTATTTCTATATTTTCTAAGCCTACAAACGTAATCTAAATCATCAAGAAGGTATATAGACATTTTCTCTCCGAAGGAGCCAATGTGCCAGACGACACTCATCTTGTTAATTGTAGCAAATTTCAATAATTCCCTCTCTCAGTGCCAAGTGGATCGGAGCATAGAGAGCATTAAACAAAGATATTGAGCTCTTCTAGCAAAAATTCATATGAAAAATTGTTGAAAAAAATGAATGGATATCCCTCACAATGGGTGTGGATATAAACGTATGGGTGTGTTGATTTCTTCTTGAGTAGGGAAGGTGTGTGTGCACATTGTCACGGTCGTACTTCTTCAACTGTGCGGTGTCTTGATCTTGACACGCTCATGATAAGCCTTAAGGGTAACTCAAGCCCCATATTCATTTTTCGCTTTGCGGCTTTGTTGGACCTTGGGTGAAATTTGTCTTCACCAACCAAACATCACTTGTGCGGAATCCAAGTTTGTTTGGCCACACACGCCATTTGTTCGTGCATGTTCAATCGTCTACTAACCAACAACGTCGCATACACGCCCATGCATCGTGATGTCATCCCATGTCTCGGGACAGCTCGACCATCTTGACTTGCTTGGACACATCCGTTGGGAATTGACCCATGTGTTGATCATCTCATCAGGACATCCTAAACATCCATCCACCCGGGTTCTTATCGAGCATTAGACTCTCCCGTGTCCATGCTAGACCATTTACGGCATCCATTTTGGATAGCGGGTGCATAATCCAGCCACTGACACACAAGCTAGGCCGCAATACCGGCACACCCGTGCCGTTCGGTAGTGTCATTGAAAAACCCATTTAAGGAAGTCGCCCGAGGCACTCGTCTCAAAATGCTCGCCCTCACTGTGACACACACATGTGCCACAGGGTAGCTCACTTAGGCCACAGGGCCTAGGGGTGGTGGAGAGCTAACACCATGTCTCCCCCTATAGTACATTTTGAGACATTTCCAATTCTTCTTGAGTAGACATCATTTTGGCGAGTAGACATAGCCTCGTGCATGGTCCGTCGAGTATCCAATTGACACCGAATTTAGTGAGTTTTCATCTTTCATATCAACGAACTTAATTCCTGGGCTGTATGTCGAAATTCATATCGAAACCCCAACTTCTGAGGTTATGGCCCGGGGCCCTACCCAGCCCTCTTCAAGCTCGTCTTCGACACTCATGTCGCCTCGTTCTCTCAAACTTTGCGTATGCCTCGACTGGCTTCAATGCACACTCTTGGTGTGTTGGATGATGCACCGTCCTCCTCGGTCGCATCATGGAACTTCTCAATCTTGGCTCTCATGTGGCCGGATGGGCGCCACTTGAGGTCGCCCCTTGTCTTTCTTGACGTATGGGGGTCACAACCTACTTTCTCGTAGTGGGGTTGTGACGCGATATACAAAAGTGTTTTTCTCACTGGTTGAAGTGGCTACTTTTACTTGCTGATTTCAACCTGAGTAATTTATGTGTGTGCAAAATTTGTAGTTTCATCCGATCTTTGAAGTTATCTGTACGCAAATGTTATGACTTCACTTTCAAATGGTTATCTGTGTTTGTTAAATTTTCTTCTGGAGCTTATGATACTAGTGTACTGAAACTAGAAACTGGAGAGACAGATCCTGTTCTTTTTGCTTTCAACTTTCATATTTTTCTTCTGTAATATAAATAAAAAATGTTTATCATTTCCCCATCTTCAGGTGGTCTTCCTTGTGAAGGGTCCAATTTACTTGGTTTGCATCAGCTGCACAGAAGAGCCTTATGAATCATTAAGAGGCCAGTTGGAACTTATATATGGTCAGGTAACCTTGGTTCTGCATGTAACCCATTGAAATATTTCTTCTTGTCTTTAGTCTCTAGCATTCCTAAATGAATTGCTGCTTGTTTGTGCAATTGAGTGATCATCTTTCTTATTGTTTTCTTAAATTCTTATGAATGAAATATTTGTTTCTATCATGGTTTCTTCACGTATTTAATATATTGTCATATGTGCATGCAGATGATACTTATTCTAACAAAGTCTGTAAATAGATGTTTTGAGAGAAATCCCAAGTTTGATATGACTTCTCTTCTTGGAGGAACCGATGTTGTCTTCTCCTCTCTCATCCATTCATTTGGTTGGTTAGTTTCAGCGCTCTTAACCTACCTATATTTTTTTCTGGATGAGAAACACCAGCCTGTTCTTTTTCTCCGTGTAAATTGCTTGTGCTCGTTCTTGAACTGTTATTCTGCTTATGCAAAAATTTGGCACCACCCAGGCAACTCCAGTGTTATAAGACCTCTATTAACTAATGATTTTGAATAGAAAAATTCGGCATCAAATTTTTGTTAACTTGATATTGCAGTTTATTCAATAATACTTGTATATGGATCTTACGAGCATCTATTTCTGGATTGTTCTTTTCTATTACACATAACCTAGCAACTGTTCCCTTTTTTATGTTTTATTTCAATTTTCAGTTAATGTATAGTCTCTTTCCTTTTTTTTTGGTATAAAATGATGTTCAACTTTTCTTTATGATGGAGTGTGGGTGGGTGTACTTTTGGTATCATGTTCAGTGAAGGGCATTGGAATTTCATTTTTAAATTTATGATTTTGTTTTCCTTAGTTCTAATTGATTGGTCCAACTTCAGGAACCCTGCTACTTTTCTTCATGCATACACTTGTCTTCCTCTCGCTTATGGTACAAGACAAGCTGCAGGTGCGATATTACAAGATGTTGCTGATTCTGGCATTCTTTTTGCAATTTTAATGTGCAAACATAAGGTGAACTTTCATCTTCTTGGCCCCCTTCCTGTCCCTTAGTTCAGCGTATCATGTTCTCCATAGAATATATAAGAATTGAATCTAGATTGCAGGTTATCAGTCTTGTTGGTGCTCAAAAGGCTTCTCTTCATCCCGATGATATGTTACTACTTGCCAACTTTGTGATGTCATCAGAGTCCTTTAGGTTAGTTTCAAATTTTAGGTTATATTATTCTGCTTCATGTAGAACAACCAAATTTTGCATCTGGCATGAGCTCAATAAATGAACTTTGGCACAGCTGTAGGAAGTTTACTGATTTTCTTATTTTCTGGAACAATGTTCAGTTCACTGCTTCTTGGTGGTGGTTTCACACTTTGAGCAATTCTTTTGTAACTAGCTCTTGGTGGTGGTTTCGGGACTAGAAGGTGTTTTTGTATTAGCTTCTGTTTTGGGAGGGGGAGTTCTCTCGGGCCTTATATTCTTTTTGTTCAATACATCCTTTCAGGTTTCTTATAAGAAGAAAAGGGGAAGAAAAGGGGAAAAAAAGGAAAAAAAAAAAAAACTTTTGCACAAATGAAGCTGTAATAGTTAATGTTCTGTCCTTATTAGATTGCTGCCTTTACATACCATTGCTACTTCATGTGGGAGATTGTATAATTGTTTCTTATACGTATGTGTGCTTTACATGCTTATATCTATGTATGTGAATGCACATGCACATATGCTCATAATAGTTAAATCACGATTGTAGTTTCCTGATTTCTTATAGAAACCCTAAGCTGTTTTCATTACTATGAGATATTCATTTGATTTAAGAGGCAACGTCATTGTCTTTGTGAAATTCAAACAGGTTTCTTCATCATCTTCTATTGTCTGTTTTGTGTTTTACCTTTCATTCCCATGTCTTTCCAGATTCACTCTTTTATAACTTCACATTAGGCTCTTCTGATGAATGAGCTGTGGGTACTTTAGATATCTTTGATTCGGTTGATGTGGATTATCTTAAATTGACGTCTTATCAATACTCAAGTGGTTAATTTTTCATTTTATCTTTTAACCCTTTTAATGCAGGACATCCGAGTCTTTCTCTCCAATATGCCTTCCAAGATACAATCCTATGGCATTTTTGTATGCATATGTTCATTATTTCGATGTGAGTGAGACTGTTCCTTTAAATTTTATTAGGTAAAATACACATGTGGTCTAGGATTCAATTTAGTCTCTATGTGGTTCCCCGGTCTTCATGGCTAACATTTTTATTCTTTGAAATCTATAATTAGTTTCTATTTTGTCCTCGAGGTATTGACATGAAATTAGGCTGATAGGAGGAAAATATGTTGCCGTGTAAGTTTTGTGATGCAAGTGTGGTGGTCTTTTATAACTAAGTTAACGATGGAGAAGACGGCATATCTAGACTAGAGAGTAAATAGGAATCTCTTCTGTGGATGAGAAGCAGATTCTGGCTGTAGTGAGTATGAGGTAGGGAGAGAGGAACTCTCAACTTCTCCTAGACTTGTACTTCTTTGAAACTTTTTGAATGAAGTGGACTCTAGATTCATGACTTAATGACTTTAAGATTTTCTGTTTTGTTTCAACGTAGTATCAGAGATCTTAATTGCAAAACTCCTACCGTGTTTTCTTCACCCTCATTTAATATTAATAGTCTAAACGTTAAGAGTCCAATTCTTTTCCATTGTAGACTGCTGCCATTAGTAAGAATGGAAGTATTGTGATTTTCTTTATCGGCTGAACTCTTGTAATTTAATAGGTTTATAGTGTGGGTATAAATAACTGTGATGTTGTATATATTATCCTCACTGTATATTGACGGAATAAATGTTTTGTGGCAGGCTAACACTTACTTTATGCTCCTTACAACTAATTCAGATTCCTTCTATCACTTAAAGGAATGCAGGTGAATAGATGTCATATGTCAGTACTTTAGAGTGTATTTTGGCTGTCAACTAAGGCAAAAAATGGAATATGTTTTTACCATCCATAATGTTTGTTTTCCTTACACTGAATATCCTGTCCGGCCAACCCACCTGACATACTTTTACAAATTCAGGTCGGCTACATAAATTGCATTTAAGTTTGACCAAATTACTTTCTTTCTATCAAGATTTTTTTTTCCTGACCCTCTTTATCATGTATTTTAGAAGGATGATGCACTATATAAAATGTCATGATTAAAATGCATATTGGAGCAACATAAGATGACTTTGAAAATTTTGTCAAGAGTTAGGCTGATCACAACCTCATTATGCCTTTATGATTATTTCAACCTCAAGCCAACTACACAAAGCAGTTTTTCTTGTTATTGTAAGACGTTTGTTGGGTGCTTATTCAATGATTGATAGTTGATACTTTTTTTTTCTTTGCATGTTCTGGTTGACACTTCAGGATTCGGATTGAGACTGTCCTTTTGAAGTCAAATGTTCTTAGTGAAGTTCAGAGATCTATGTTAGATGGTGGGATGCATGTTGAAGATGTGCCTGTTGATTCTTTGCCTCGCTATAGAACCATATCTCCTCATTTGGGCCAACAAAGAGTTCCATCAGAATTTACTGAAAGATTCAAGGAATCTTCTGCTGGGATGGGTGGTCCTGGTGGACTCTGGCATTTCATTTACCGCAGTATATATCTGGATCAGTACGTTGCTTCTGAATTTTCATCTCCAATTAGCAGTCGTCAACAACAGAAGAGGTATTTGTGGCAAATTTCTAAAATCCCATACAATAACTTTTAGATGGCTTTTCCTTCTTATTAATCATGCATTTTGAGATTGTAAATTTTATTTCCTTGCATATTTGTTTTGATTTATTTATCATTGCAGACTGTACAGAGCATACCAAAATATTTATGATTCTATGCATGATAAAGAAATTGGCCCTCACAAAACCCAGTTTAGAAGAGATGAAAACTATGGTACGTTTTTTCTTTTCCCTTGACATTTCCAATTGGATTAATACAATCATTTAATTGAGTTTTCATAATTATACTGCCTTTCACTCTTGATGGTAAGGTTTTTTCTTTTTTCTTATTGTCTTTTCATCCAAATTCACAACTTTATTGGCCATATCTGTTCTTTATTCATTATATTTCTTTTTATCAGAATCAGACGTTTTCCTTTGACTTTAGTTTCTCTTATATATATTTTTTTCCGATTATCCATCACAATCAATTTCTGCTCTACTACATCTATTTTATCCCTTTGGACTTTGTCTATTCTATATCACCAATCCACCACCCTGCTAAGGAATTTGATTTTTGTTGGAGTTTTTTCATTCCAGAGCCTTAATCAAAGTATGCCCCACGTTCAACATATTATTAGTGTGAACATAAATTTTTACAATAAGAACCAATACCTTCTAATTTGCAGAATCTTGTATAGTATGCTTAGGTTTGAATTATGCTAGAGGAGAGAATCTGTTTCAGCTGCATTAAGCTCATTATCTGCTTAGATATAAAGATGCATGAACATACAGACAATCAAGTCTTAATGTTGGTTTTTATTGGCAGTTCTACTCTGCTGGGTCACCCAAGATTTTGAGCTTTATGCAGCATTTGATCCATTGGCTGATAAGGTATTTCTTTTTGTCAGATTTATGAGTATTCTGGTTGGCTAAAAAATTATTTGTATGAAGGCATAGTGACCATATTCTTGCATGTTGGTCTTCCATGATTGTTGATATGGTGAATTTTAATTCTATTCTTTTTTTCTTACAAGAAAATCACTTTTTGTTGATGCTTGAAAAGTTACACACTTGAAACTTCTAGCTAAACAATATGAACTCTAGCTATTAGAACCTCAACTCTTCAATACATGAACGGTTTATTGGATTTAACTCCTAATGTTAAAAACAAGAGCTTGGACATGATTTAGATGCACGAAGTTTGTGGAGGAATAGTTTCTCGTACTCATAATTTGAAATGAAGGAACGTAAGAATCTTGAAAGATAGTCTCCAAAACCTTATGAGTTATTTGGAGGATGACGTTTCATGTATCTTAGCTAGTTGGCTTTTCTGCTTATAGAGATTCTAGTTTGAGTGGTCTCTTTTACGGATCCACCTTTATGCGTTTTCTTGTGGTCAATTCTGATTTCATCCTCTCCTTTGAAAACAAATAAAAAGAAAAGGAAATTTTTTGATCGTTTACTTAAATTTGATACATGTCTTTCACGGTGCTATCCATGTTCAACATTAGAGATTTGGATAAGAAAGTTATCATTATCATCTTTATCGTTTCTTTTGCATACTTATTGATGGCAACATCAAGTCTTTCCATGCTGATTTCTTGTGCCTACACTCATTTTTTTAAAATAACAAATGAAACATGTAAACGATACTTTTCATTAAATAAATGNCTTTTTTTTTTTTTTTTTTTTGATAAGATACAAGTTCCTTGCTAAAAAATTCTGCATGGAAGGGAAATAAATAAAATTACAATGGAAATAGAAAGCGTCCCAATTTAAGCATTATCTTGTAAAGAAAAATCAACAGAAGACTTGGGAAGAGAACACCATGAGGCCTTTAATCTCGTGGACTTAAAAGATCTGACCATTTAAGTTGCTTGCTTTCGAGATTCTCTGATTTCTTTCCAACCATAATTCCACTTATAGAGTTTTCAGTGCATCAAACCATATGAATCTATGCTTATTGACTTCAAAAAATTGCATTAATCCATGGTAAGTTTGGTTTTGGCGCCAAAGAAATCCCGACTAGCAGTTGAAGAACGTTTTCCATTGTTATCCTTGGAAACAACAAAAAATAACCAATGCCTACACTCTTATTTGTGCTAATCTGATTATGACCGTGCTTGTATTGGGAGGATAGAGATCTGTTACAGAAGGTTTACTGAATCCTCCTCCATAATAAAGGAAGTTTTGGCAAGCTAGTTGTTTGTGCTCTAGATGGATTGTTTAGGGTAGAAGTAATCAAATTAGAGTTTGGAGCCTCTTCCATGGTTTTCTACTCTTCCTTTCTTTTTCTTCCTTTTTTTTCCCTTCTTTATCACTGTTCACTGGAACACATCCTCGTAAATATTTATTTTTTCCCCCTATGTTCCCATTTTGTGTCTGTGTGTGCCTGTGGCCTTGTTGTTATGTTTCCTTGGTAATGTGAATTACATTCCCTTTGGGGGAAACGAAAGCTTTTCTGATTGTAGAACATATGATTCCGCAGGCTATGGCGATAAAGATCTGCAATCGAGTTTGTCAATGGATTAAGGATGTTGAAAATGAAGTTTTTCTTCTGGGTGCGAGCCCGTTTTCATGGTAACACTCCATGTAGTATAATTTTTGTGTATTCATGTTTATACACATCCACGGTTTTGTATTTTATTTATTGTAGACTGTAGGCTAGCTCATAAAACCTGTATTCAATTTAACTCCATCTACTGGAAAACTAGGTACATTATATTAGTGAGAATCTATGAAATGAAGATCTATTTTTGATAACTGTTGCAGTAGGCAGGTTAGCTCTTATGAATTAGGCTTTGAATAATAACAATTCTTTTGATAGTTGATGTGG

mRNA sequence

TAGCGATGCGCTTTCAGGTGTGATCGCATTTCCCTCTTCCTGATCATCAACATTGAGTTTTCAATCGGAGAACTGAATCGACAATTCAAGTTTTATTCCCAAAGATGTCTTCTGGGTTGAGTTCATTATCATCCAGTGATGAACTCGACAATTTCAACCCTAGAACATCACCCACTACGCCACCTAAACCTCTCGAAGAAGAATTGGCGTCTTTAGCATTGACTTTACCACCTCCTGAGCTGCTTTCCGACCAGGAGGATGTCAATGTCGTCTCAGATGGATCCGCGGCCGATGGCTTTGGGTTTGGAATTCAACGGAGTGATGAGGAATCAAGGGTCGGCGTAGTTTGTGAAGAGAATGTTGTGGGGAACTCGGCGGCGGCGGTGGTTGAGGGAGCGACCGGAAATTCGGTGCGGGAGGGTATTGATGGGACGGGGGTTGTGTGGGGGAGGACTAATTCGGAGATTGAGGTGGATAGACCGGTCAGTCCTAGCAGTAGTGGGTATGCAGGTGAAAGGGGAAGCAGCGGTGCTAGTAGCGGGAGGTCGGAGATGGATGGAATTGCTGATGACGAGATACAGGAACTGAATGATGATGCTTCTGTTGGTGATAATTCGAATTCTGTACACTCTTGGGTTCCTGGAAAGCGGCATGGTGATGAGTTAATTCTTAAAGTTCAGGATGATGCTTCCATATCATGGAGAAAAAGGAAGAAGCATTTCTTTGTTCTGAGCCATTCTGGAAAGCCTATTTATTCCAGGTATGGAGACGAGCACAAGCTAGCAGGATTTTCAGCCAGCTTGCAGGCAATCATTTCCTTTGTGGAGGATGGGGGTGATCGTGTCAAATGGGTTAGAGCTGGAAAACACCTGGTGGTCTTCCTTGTGAAGGGTCCAATTTACTTGGTTTGCATCAGCTGCACAGAAGAGCCTTATGAATCATTAAGAGGCCAGTTGGAACTTATATATGGTCAGATGATACTTATTCTAACAAAGTCTGTAAATAGATGTTTTGAGAGAAATCCCAAGTTTGATATGACTTCTCTTCTTGGAGGAACCGATGTTGTCTTCTCCTCTCTCATCCATTCATTTGGTTGGAACCCTGCTACTTTTCTTCATGCATACACTTGTCTTCCTCTCGCTTATGGTACAAGACAAGCTGCAGGTGCGATATTACAAGATGTTGCTGATTCTGGCATTCTTTTTGCAATTTTAATGTGCAAACATAAGGTTATCAGTCTTGTTGGTGCTCAAAAGGCTTCTCTTCATCCCGATGATATGTTACTACTTGCCAACTTTGTGATGTCATCAGAGTCCTTTAGGATTCGGATTGAGACTGTCCTTTTGAAGTCAAATGTTCTTAGTGAAGTTCAGAGATCTATGTTAGATGGTGGGATGCATGTTGAAGATGTGCCTGTTGATTCTTTGCCTCGCTATAGAACCATATCTCCTCATTTGGGCCAACAAAGAGTTCCATCAGAATTTACTGAAAGATTCAAGGAATCTTCTGCTGGGATGGGTGGTCCTGGTGGACTCTGGCATTTCATTTACCGCAGTATATATCTGGATCAGTACGTTGCTTCTGAATTTTCATCTCCAATTAGCAGTCGTCAACAACAGAAGAGACTGTACAGAGCATACCAAAATATTTATGATTCTATGCATGATAAAGAAATTGGCCCTCACAAAACCCAGTTTAGAAGAGATGAAAACTATGTTCTACTCTGCTGGGTCACCCAAGATTTTGAGCTTTATGCAGCATTTGATCCATTGGCTGATAAGGCTATGGCGATAAAGATCTGCAATCGAGTTTGTCAATGGATTAAGGATGTTGAAAATGAAGTTTTTCTTCTGGGTGCGAGCCCGTTTTCATGGTAACACTCCATGTAGTATAATTTTTGTGTATTCATGTTTATACACATCCACGGTTTTGTATTTTATTTATTGTAGACTGTAGGCTAGCTCATAAAACCTGTATTCAATTTAACTCCATCTACTGGAAAACTAGGTACATTATATTAGTGAGAATCTATGAAATGAAGATCTATTTTTGATAACTGTTGCAGTAGGCAGGTTAGCTCTTATGAATTAGGCTTTGAATAATAACAATTCTTTTGATAGTTGATGTGG

Coding sequence (CDS)

ATGTCTTCTGGGTTGAGTTCATTATCATCCAGTGATGAACTCGACAATTTCAACCCTAGAACATCACCCACTACGCCACCTAAACCTCTCGAAGAAGAATTGGCGTCTTTAGCATTGACTTTACCACCTCCTGAGCTGCTTTCCGACCAGGAGGATGTCAATGTCGTCTCAGATGGATCCGCGGCCGATGGCTTTGGGTTTGGAATTCAACGGAGTGATGAGGAATCAAGGGTCGGCGTAGTTTGTGAAGAGAATGTTGTGGGGAACTCGGCGGCGGCGGTGGTTGAGGGAGCGACCGGAAATTCGGTGCGGGAGGGTATTGATGGGACGGGGGTTGTGTGGGGGAGGACTAATTCGGAGATTGAGGTGGATAGACCGGTCAGTCCTAGCAGTAGTGGGTATGCAGGTGAAAGGGGAAGCAGCGGTGCTAGTAGCGGGAGGTCGGAGATGGATGGAATTGCTGATGACGAGATACAGGAACTGAATGATGATGCTTCTGTTGGTGATAATTCGAATTCTGTACACTCTTGGGTTCCTGGAAAGCGGCATGGTGATGAGTTAATTCTTAAAGTTCAGGATGATGCTTCCATATCATGGAGAAAAAGGAAGAAGCATTTCTTTGTTCTGAGCCATTCTGGAAAGCCTATTTATTCCAGGTATGGAGACGAGCACAAGCTAGCAGGATTTTCAGCCAGCTTGCAGGCAATCATTTCCTTTGTGGAGGATGGGGGTGATCGTGTCAAATGGGTTAGAGCTGGAAAACACCTGGTGGTCTTCCTTGTGAAGGGTCCAATTTACTTGGTTTGCATCAGCTGCACAGAAGAGCCTTATGAATCATTAAGAGGCCAGTTGGAACTTATATATGGTCAGATGATACTTATTCTAACAAAGTCTGTAAATAGATGTTTTGAGAGAAATCCCAAGTTTGATATGACTTCTCTTCTTGGAGGAACCGATGTTGTCTTCTCCTCTCTCATCCATTCATTTGGTTGGAACCCTGCTACTTTTCTTCATGCATACACTTGTCTTCCTCTCGCTTATGGTACAAGACAAGCTGCAGGTGCGATATTACAAGATGTTGCTGATTCTGGCATTCTTTTTGCAATTTTAATGTGCAAACATAAGGTTATCAGTCTTGTTGGTGCTCAAAAGGCTTCTCTTCATCCCGATGATATGTTACTACTTGCCAACTTTGTGATGTCATCAGAGTCCTTTAGGATTCGGATTGAGACTGTCCTTTTGAAGTCAAATGTTCTTAGTGAAGTTCAGAGATCTATGTTAGATGGTGGGATGCATGTTGAAGATGTGCCTGTTGATTCTTTGCCTCGCTATAGAACCATATCTCCTCATTTGGGCCAACAAAGAGTTCCATCAGAATTTACTGAAAGATTCAAGGAATCTTCTGCTGGGATGGGTGGTCCTGGTGGACTCTGGCATTTCATTTACCGCAGTATATATCTGGATCAGTACGTTGCTTCTGAATTTTCATCTCCAATTAGCAGTCGTCAACAACAGAAGAGACTGTACAGAGCATACCAAAATATTTATGATTCTATGCATGATAAAGAAATTGGCCCTCACAAAACCCAGTTTAGAAGAGATGAAAACTATGTTCTACTCTGCTGGGTCACCCAAGATTTTGAGCTTTATGCAGCATTTGATCCATTGGCTGATAAGGCTATGGCGATAAAGATCTGCAATCGAGTTTGTCAATGGATTAAGGATGTTGAAAATGAAGTTTTTCTTCTGGGTGCGAGCCCGTTTTCATGGTAA

Protein sequence

MSSGLSSLSSSDELDNFNPRTSPTTPPKPLEEELASLALTLPPPELLSDQEDVNVVSDGSAADGFGFGIQRSDEESRVGVVCEENVVGNSAAAVVEGATGNSVREGIDGTGVVWGRTNSEIEVDRPVSPSSSGYAGERGSSGASSGRSEMDGIADDEIQELNDDASVGDNSNSVHSWVPGKRHGDELILKVQDDASISWRKRKKHFFVLSHSGKPIYSRYGDEHKLAGFSASLQAIISFVEDGGDRVKWVRAGKHLVVFLVKGPIYLVCISCTEEPYESLRGQLELIYGQMILILTKSVNRCFERNPKFDMTSLLGGTDVVFSSLIHSFGWNPATFLHAYTCLPLAYGTRQAAGAILQDVADSGILFAILMCKHKVISLVGAQKASLHPDDMLLLANFVMSSESFRIRIETVLLKSNVLSEVQRSMLDGGMHVEDVPVDSLPRYRTISPHLGQQRVPSEFTERFKESSAGMGGPGGLWHFIYRSIYLDQYVASEFSSPISSRQQQKRLYRAYQNIYDSMHDKEIGPHKTQFRRDENYVLLCWVTQDFELYAAFDPLADKAMAIKICNRVCQWIKDVENEVFLLGASPFSW
BLAST of Cp4.1LG20g07410 vs. Swiss-Prot
Match: MON1A_HUMAN (Vacuolar fusion protein MON1 homolog A OS=Homo sapiens GN=MON1A PE=1 SV=2)

HSP 1 Score: 197.6 bits (501), Expect = 3.8e-49
Identity = 126/427 (29.51%), Postives = 209/427 (48.95%), Query Frame = 1

Query: 180 GKRHGDELILKVQDDASISWRKRKKHFFVLSHSGKPIYSRYGDEHKLAGFSASLQAIISF 239
           G   GDE      +DA+ +WR  +KH FVLS +GKP+YSRYG E  L+     + A++SF
Sbjct: 138 GTTEGDE------EDATEAWRLHQKHVFVLSEAGKPVYSRYGSEEALSSTMGVMVALVSF 197

Query: 240 VEDGGDRVKWVRAGKHLVVFLVKGPIYLVCISCTEEPYESLRGQLELIYGQMILILT-KS 299
           +E   + ++ + A  + VVF+ + P+ LV ++ T +  + L  +L  IY Q++ +LT   
Sbjct: 198 LEADKNAIRSIHADGYKVVFVRRSPLVLVAVARTRQSAQELAQELLYIYYQILSLLTGAQ 257

Query: 300 VNRCFERNPKFDMTSLLGGTDVVFSSLIHSFGWNPATFLHAYTCLPLAYGTRQAAGAILQ 359
           ++  F++   +D+  LL G++ +  +L+     +P+  + A  CLPLA   R    A LQ
Sbjct: 258 LSHIFQQKQNYDLRRLLSGSERITDNLLQLMARDPSFLMGAARCLPLAAAVRDTVSASLQ 317

Query: 360 DVADSGILFAILMCKHKVISLVGAQKASLHPDDMLLLANFVMSSESFR---IRIETVLLK 419
                 ++F+IL+ ++++++LV  +   LHP D+ LL N + SS SFR         L K
Sbjct: 318 QARARSLVFSILLARNQLVALVRRKDQFLHPIDLHLLFNLISSSSSFREGEAWTPVCLPK 377

Query: 420 SNVLSEVQRSMLDGGMHVEDVPVDSLPRYRTISPHLGQQRVPSEFTERFKES-------- 479
            N              H+  +  D+      +S         S+   RF+E         
Sbjct: 378 FNAAGFFH-------AHISYLEPDTDLCLLLVSTDREDFFAVSDCRRRFQERLRKRGAHL 437

Query: 480 -----------SAGMGGPGGLWHFIYRSIYLDQYVASEFSSPISSRQQQKRLYRAYQNIY 539
                      S    G   L HF+Y+S     + + E  +P +S ++Q+RL   YQ ++
Sbjct: 438 ALREALRTPYYSVAQVGIPDLRHFLYKSKSSGLFTSPEIEAPYTSEEEQERLLGLYQYLH 497

Query: 540 DSMHDKEIGPHKTQFRRDENYVLLCWVTQDFELYAAFDPLADKAMAIKICNRVCQWIKDV 584
              H+    P KT +    N  LL WVT  FELY  + PL  KA A+   +++ +WI+  
Sbjct: 498 SRAHNAS-RPLKTIYYTGPNENLLAWVTGAFELYMCYSPLGTKASAVSAIHKLMRWIRKE 550

BLAST of Cp4.1LG20g07410 vs. Swiss-Prot
Match: MON1A_MOUSE (Vacuolar fusion protein MON1 homolog A OS=Mus musculus GN=Mon1a PE=1 SV=3)

HSP 1 Score: 197.6 bits (501), Expect = 3.8e-49
Identity = 124/415 (29.88%), Postives = 205/415 (49.40%), Query Frame = 1

Query: 192 QDDASISWRKRKKHFFVLSHSGKPIYSRYGDEHKLAGFSASLQAIISFVEDGGDRVKWVR 251
           ++DA+ +WR  +KH FVLS +GKP+YSRYG E  L+     + A++SF+E   + ++ + 
Sbjct: 145 EEDATEAWRLHQKHVFVLSEAGKPVYSRYGSEEALSSTMGVMVALVSFLEADKNAIRSIH 204

Query: 252 AGKHLVVFLVKGPIYLVCISCTEEPYESLRGQLELIYGQMILILT-KSVNRCFERNPKFD 311
           A  + VVF+ + P+ LV ++ T +  + L  +L  IY Q++ +LT   ++  F++   +D
Sbjct: 205 ADGYKVVFVRRSPLVLVAVARTRQSAQELAQELLYIYYQILSLLTGAQLSHIFQQKQNYD 264

Query: 312 MTSLLGGTDVVFSSLIHSFGWNPATFLHAYTCLPLAYGTRQAAGAILQDVADSGILFAIL 371
           +  LL G++ +  +L+     +P+  + A  CLPLA   R    A LQ      ++F+IL
Sbjct: 265 LRRLLSGSERITDNLLQLMARDPSFLMGAARCLPLAAAVRDTVSASLQQARARSLVFSIL 324

Query: 372 MCKHKVISLVGAQKASLHPDDMLLLANFVMSSESFR---IRIETVLLKSNVLSEVQRSML 431
           +  +++++LV  +   LHP D+ LL N + SS SFR         L K N          
Sbjct: 325 LAHNQLVALVRRKDQFLHPIDLHLLFNLISSSSSFREGEAWTPVCLPKFNAAGFFH---- 384

Query: 432 DGGMHVEDVPVDSLPRYRTISPHLGQQRVPSEFTERFKES-------------------S 491
               H+  +  D+      IS         S+   RF+E                    S
Sbjct: 385 ---AHISYLEPDTDLCLLLISTDREDFFAVSDCRRRFQERLRKRGTHLALREALRTPYYS 444

Query: 492 AGMGGPGGLWHFIYRSIYLDQYVASEFSSPISSRQQQKRLYRAYQNIYDSMHDKEIGPHK 551
               G   L HF+Y+S     + + E  +P SS ++Q+RL   YQ ++   H+    P K
Sbjct: 445 VAQVGIPDLRHFLYKSKSSGLFTSPEIEAPYSSEEEQERLLGLYQYLHSRAHNAS-RPLK 504

Query: 552 TQFRRDENYVLLCWVTQDFELYAAFDPLADKAMAIKICNRVCQWIKDVENEVFLL 584
           T +    N  LL WVT  FELY  + PL  KA A+   +++ +WI+  E+ +F+L
Sbjct: 505 TIYYTGPNENLLAWVTGAFELYMCYSPLGTKASAVSAIHKLMRWIRKEEDRLFIL 551

BLAST of Cp4.1LG20g07410 vs. Swiss-Prot
Match: MON1A_CHICK (Vacuolar fusion protein MON1 homolog A OS=Gallus gallus GN=MON1A PE=2 SV=1)

HSP 1 Score: 194.9 bits (494), Expect = 2.5e-48
Identity = 155/546 (28.39%), Postives = 251/546 (45.97%), Query Frame = 1

Query: 56  VSDGSAADGFGFGIQRSDEESRVGVVCEENVVGNSAAAVVEGATGNSVREGIDGTGVVWG 115
           V +GS A G G   +RS+  +       E   G   A  V   +   +    DG  VV  
Sbjct: 13  VPNGSLAPGDGQHAERSESPTPGLAQGTEPGAGQEGAMFVHTRSYEDLTSPEDGGAVV-- 72

Query: 116 RTNSEIEVDRPVSPSSSGYAGERGSSGASSGRSEMDGIADDEIQELNDDASVGDNSNSVH 175
             + E     P  P+S     E+ S   S   +++ G+A D  +E+            + 
Sbjct: 73  -RSPEERRGEPAEPTSM----EQISKDFSELSTQLTGMALDLEEEMRQS-----QEGKLE 132

Query: 176 SWVPGKRHGDELILKVQDDASI-SWRKRKKHFFVLSHSGKPIYSRYGDEHKLAGFSASLQ 235
                 RH   L  K ++D ++ +WR  +KH FVLS +GKP+YSRYG E  L+     + 
Sbjct: 133 PSPQATRHDSVLSGKEEEDVTMDTWRMHRKHVFVLSEAGKPVYSRYGSEEALSSTMGVMM 192

Query: 236 AIISFVEDGGDRVKWVRAGKHLVVFLVKGPIYLVCISCTEEPYESLRGQLELIYGQMILI 295
           A++SF+E   + ++ + A  + VVF+ + P+ LV ++ T +  + +  +L  IY Q++ +
Sbjct: 193 ALVSFLEAEKNAIRSIHADGYKVVFVRRSPLVLVAVARTRQSEQEIAHELLYIYYQILSL 252

Query: 296 LT-KSVNRCFERNPKFDMTSLLGGTDVVFSSLIHSFGWNPATFLHAYTCLPLAYGTRQAA 355
           LT   +N  F++   +D+  LL G++ +  +L+     +P+  + A  CLPLA   R A 
Sbjct: 253 LTWTQLNHIFQQKQNYDLRRLLAGSERITDNLLDLMAHDPSFLMGAVRCLPLAASVRDAV 312

Query: 356 GAILQDVADSGILFAILMCKHKVISLVGAQKASLHPDDMLLLANFVMSSESFR-----IR 415
              LQ      ++F+IL+  ++++SLV  +   LHP D+ LL N + SS SFR       
Sbjct: 313 STSLQQAKAKSLVFSILLSGNQLVSLVRKKDQFLHPIDLHLLFNLISSSSSFREGEAWTP 372

Query: 416 IETVLLKSNVLSEVQRSMLDGGMHV---------ED--VPVDSLPRYRTISPHLGQQRVP 475
           I      S+       S L+  M +         ED     D   R++      G     
Sbjct: 373 ICLPKFNSSGFFHAHISYLEQEMDLCLLLVSTDREDFFTVSDCKRRFQERLRRRGVHHAL 432

Query: 476 SEFTERFKESSAGMGGPGGLWHFIYRSIYLDQYVASEFSSPISSRQQQKRLYRAYQNIYD 535
            E       S A +G P  L HFIY+S     + + E  +P    ++++RL   YQ ++ 
Sbjct: 433 QEALRTPFYSVAQVGIP-DLRHFIYKSKSSGLFTSPEIEAPYVREEEKERLLGLYQYLHS 492

Query: 536 SMHDKEIGPHKTQFRRDENYVLLCWVTQDFELYAAFDPLADKAMAIKICNRVCQWIKDVE 584
             H+    P K  +       LL WVT  FELY  + PL  KA AI   N++ +WI+  E
Sbjct: 493 RAHNSSC-PLKNIYFTGPRENLLAWVTSAFELYICYSPLGTKAGAISAVNKLMKWIRKEE 544

BLAST of Cp4.1LG20g07410 vs. Swiss-Prot
Match: MON1A_BOVIN (Vacuolar fusion protein MON1 homolog A OS=Bos taurus GN=MON1A PE=2 SV=1)

HSP 1 Score: 187.6 bits (475), Expect = 4.0e-46
Identity = 147/556 (26.44%), Postives = 251/556 (45.14%), Query Frame = 1

Query: 58  DGSAADGFGFGIQRSDEESRVGVVCEENVVGNSAAAVVEGATGNSVREGIDGTGVVWGRT 117
           DG+ A   G  ++R++  +       E   G   A  V   +   + E  DG        
Sbjct: 15  DGTLAPSDGQSVERAESPTPGLAQGMEPGAGQEGAMFVHARSYEDLTESEDGAA------ 74

Query: 118 NSEIEVDRPVSPSSSGYAGERGSSGASSGRSEMDGIADDEIQELNDDASVGDNSNSVHSW 177
           + E   +    P        + S   S   +++ G+A D  +E+   +S     +   + 
Sbjct: 75  SGESPKEGAGGPPPLATDMRQISQDFSELSTQLTGVARDLQEEMLPGSSEDWPESPGAAR 134

Query: 178 VP-------GKRHGDELILKVQDDASISWRKRKKHFFVLSHSGKPIYSRYGDEHKLAGFS 237
            P       G   GDE      ++A+ +WR+ +KH FVLS +GKP+YSRYG E  L+   
Sbjct: 135 RPATEPPRDGAGEGDE------EEAAEAWRRHQKHVFVLSEAGKPVYSRYGSEEALSSTM 194

Query: 238 ASLQAIISFVEDGGDRVKWVRAGKHLVVFLVKGPIYLVCISCTEEPYESLRGQLELIYGQ 297
             + A++SF+E   + ++ + A  + VVF+ + P+ LV ++ T +  + L  +L  IY Q
Sbjct: 195 GVMVALVSFLEADKNAIRSIHADGYKVVFVRRSPLVLVAVARTRQSAQELAQELLYIYYQ 254

Query: 298 MILILT-KSVNRCFERNPKFDMTSLLGGTDVVFSSLIHSFGWNPATFLHAYTCLPLAYGT 357
           ++ +LT   ++  F++   +D+  LL G++ +  +L+     +P+  + A  CLPLA   
Sbjct: 255 ILSLLTGAQLSHIFQQKQNYDLRRLLSGSERITDNLLQLMARDPSFLMGAARCLPLAAAV 314

Query: 358 RQAAGAILQDVADSGILFAILMCKHKVISLVGAQKASLHPDDMLLLANFVMSSESFR--- 417
           R    A LQ      ++F+IL+ ++++++LV  +   LHP D+ LL N + SS SFR   
Sbjct: 315 RDVVSASLQQARARSLVFSILLARNQLVALVRRKDQFLHPIDLHLLFNLISSSSSFREGE 374

Query: 418 IRIETVLLKSNVLSEVQRSMLDGGMHVEDVPVDSLPRYRTISPHLGQQRVPSEFTERFKE 477
                 L K N              H+  +  D+      +S         S+   RF+E
Sbjct: 375 AWTPVCLPKFNAAGFFH-------AHISYLEPDTDLCLLLVSTDREDFFAVSDCRRRFQE 434

Query: 478 S-------------------SAGMGGPGGLWHFIYRSIYLDQYVASEFSSPISSRQQQKR 537
                               S    G   L HF+Y+S     + + E  +P  S ++Q+R
Sbjct: 435 RLRKRGAHLALREALRTPYYSVAQVGVPDLRHFLYKSKSSGLFTSPEIEAPYDSEEEQER 494

Query: 538 LYRAYQNIYDSMHDKEIGPHKTQFRRDENYVLLCWVTQDFELYAAFDPLADKAMAIKICN 584
           L   YQ ++   H+    P KT +    N  LL WVT  FELY  + PL  KA A+   +
Sbjct: 495 LLGLYQYLHSRAHNAS-RPLKTIYYTGPNENLLAWVTGAFELYMCYSPLGTKASAVSAIH 550

BLAST of Cp4.1LG20g07410 vs. Swiss-Prot
Match: MON1A_MACFA (Vacuolar fusion protein MON1 homolog A OS=Macaca fascicularis GN=MON1A PE=2 SV=2)

HSP 1 Score: 187.2 bits (474), Expect = 5.2e-46
Identity = 150/558 (26.88%), Postives = 252/558 (45.16%), Query Frame = 1

Query: 58  DGSAADGFGFGIQRSDEESRVGVVCEENVVGNSAAAVVEGATGNSVREGIDGTGVVWGRT 117
           DG+     G  ++R++  +       E   G   A  V   +   + E  DG     G +
Sbjct: 15  DGTLTPSDGHSVERAESPTPGLAQGMEPGAGQEGAMFVHARSYEDLTESEDGAAS--GDS 74

Query: 118 NSEIEVDRPVSPSSSGYAGERGSSGASSGRSEMDGIADDEIQE---------LNDDASVG 177
             E     P  P+       + S   S   +++ G+A D  +E         L+   +VG
Sbjct: 75  PKEGARGPPPLPADM----RQISQDFSELSTQLTGVARDLQEEMLPGSSEDWLDPPGAVG 134

Query: 178 DNSNSVHSWVPGKRHGDELILKVQDDASISWRKRKKHFFVLSHSGKPIYSRYGDEHKLAG 237
             +        G   GDE      +DA+ +WR  +KH FVLS +GKP+YSRYG E  L+ 
Sbjct: 135 RPATEPPR--EGTAEGDE------EDATEAWRLHQKHVFVLSEAGKPVYSRYGSEEALSS 194

Query: 238 FSASLQAIISFVEDGGDRVKWVRAGKHLVVFLVKGPIYLVCISCTEEPYESLRGQLELIY 297
               + A++SF+E   + ++ + A  + VVF+ + P+ LV ++ T +  + L  +L  IY
Sbjct: 195 TMGVMVALVSFLEADKNAIRSIHADGYKVVFVRRSPLVLVAVARTRQSAQELAQELLYIY 254

Query: 298 GQMILILT-KSVNRCFERNPKFDMTSLLGGTDVVFSSLIHSFGWNPATFLHAYTCLPLAY 357
            Q++ +LT   ++  F++   +D+  LL G++ +  +L+     +P+  + A  CLPLA 
Sbjct: 255 YQILSLLTGAQLSHIFQQKQNYDLRRLLSGSERITDNLLQLMARDPSFLMGAARCLPLAA 314

Query: 358 GTRQAAGAILQDVADSGILFAILMCKHKVISLVGAQKASLHPDDMLLLANFVMSSESFR- 417
             R    A LQ      ++F+IL+ ++++++LV  +   LHP D+ LL N + SS SFR 
Sbjct: 315 AVRDTVSASLQQARARSLVFSILLARNQLVALVRRKDQFLHPIDLHLLFNLISSSSSFRE 374

Query: 418 --IRIETVLLKSNVLSEVQRSMLDGGMHVEDVPVDSLPRYRTISPHLGQQRVPSEFTERF 477
                   L K N              H+  +  D+      +S         S+   RF
Sbjct: 375 GEAWTPVCLPKFNAAGFFH-------AHISYLEPDTDLCLLFVSTDREDFFAVSDCRRRF 434

Query: 478 KES-------------------SAGMGGPGGLWHFIYRSIYLDQYVASEFSSPISSRQQQ 537
           +E                    S    G   L HF+Y+S     + + E  +P +S ++Q
Sbjct: 435 QERLRKRGAHLALREALRTPYYSVAQVGIPDLRHFLYKSKSSGLFTSPEIEAPYTSEEEQ 494

Query: 538 KRLYRAYQNIYDSMHDKEIGPHKTQFRRDENYVLLCWVTQDFELYAAFDPLADKAMAIKI 584
           +RL   YQ ++   H+    P KT +    N  LL WVT  FELY  + PL  KA A+  
Sbjct: 495 ERLLGLYQYLHSRAHNAS-RPLKTIYYTGPNENLLAWVTGAFELYMCYSPLGTKASAVSA 550

BLAST of Cp4.1LG20g07410 vs. TrEMBL
Match: A0A059D684_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_B02502 PE=4 SV=1)

HSP 1 Score: 741.5 bits (1913), Expect = 7.9e-211
Identity = 405/637 (63.58%), Postives = 456/637 (71.59%), Query Frame = 1

Query: 8   LSSSDELDNFNPRTSPTTPP-----KPLEEELASLALTLPPPELLSDQEDVNVVSDGSAA 67
           +SSSD  D+ +  +   + P     KP+E+  + L +T             ++ + G+A 
Sbjct: 1   MSSSDGGDDGSASSGAGSGPVVLGSKPIEDGFSQLRVT-------------DLANGGAAN 60

Query: 68  DGFGFGIQRSDEESRVGVVCEENVVGNSAAAVVEGATGNSVREGIDGTGVVWGRTNSEIE 127
           DG G          R G+  E   V +  AA    A G S  E I    V+W R+NSE E
Sbjct: 61  DGDGEAAAAEGSGGREGIAAEVGEVEDGGAASAR-AGGRSEIEEIGEARVMW-RSNSEGE 120

Query: 128 VDRPVSPSSSGYAGERGSSGASSGRSEMDGIADDEIQELNDDASVGDNSNSVHSWVPGKR 187
            +   SPSSSGYAGERGSS   SG  E +G  +DEI+E+  DA     S+S  +W+PGKR
Sbjct: 121 GEAQGSPSSSGYAGERGSSSGGSGIGEEEGEEEDEIEEVRSDAV----SDSQAAWMPGKR 180

Query: 188 HGDELILKVQDDASISWRKRKKHFFVLSHSGKPIYSRYGDEHKLAGFSASLQAIISFVED 247
           H       +QDDASISWRKRKKHFF+LSHSGKPIYSRYGDEHKLAGFSA+LQAIISFVE+
Sbjct: 181 H------VMQDDASISWRKRKKHFFILSHSGKPIYSRYGDEHKLAGFSATLQAIISFVEN 240

Query: 248 GGDRVKWVRAGKHLVVFLVKGPIYLVCISCTEEPYESLRGQLELIYGQMILILTKSVNRC 307
           GGDRVK VRAGKH VVFLVKGPIYLVCISCTEEPYESLRGQLELIYGQMILILTKSVNRC
Sbjct: 241 GGDRVKLVRAGKHQVVFLVKGPIYLVCISCTEEPYESLRGQLELIYGQMILILTKSVNRC 300

Query: 308 FERNPKFDMTSLLGGTDVVFSSLIHSFGWNPATFLHAYTCLPLAYGTRQAAGAILQDVAD 367
           FE+NPKFDMT LLGGTDVVFSSLIHSF WNPATFLHAYTCLPL Y TRQAAGA+LQDVAD
Sbjct: 301 FEKNPKFDMTPLLGGTDVVFSSLIHSFSWNPATFLHAYTCLPLGYATRQAAGAVLQDVAD 360

Query: 368 SGILFAILMCKHKVISLVGAQKASLHPDDMLLLANFVMSSESFR---------------- 427
           SG+LFAILMCKHKV+SLVGAQKASLHPDDMLLLANFVMSSESFR                
Sbjct: 361 SGVLFAILMCKHKVVSLVGAQKASLHPDDMLLLANFVMSSESFRTSENFSPICLPRYNPM 420

Query: 428 ---------IRIETVLLKSN------------------------VLSEVQRSMLDGGMHV 487
                    + +ET L+                           VLSEVQRSMLDGGM V
Sbjct: 421 AFLYAYVHYLDVETYLMLLTTSSDAFYHLKDCRMRIETVLLKSNVLSEVQRSMLDGGMRV 480

Query: 488 EDVPVDSLPRYRTISPHLGQQRVPSEFTERFKESSAGMGGPGGLWHFIYRSIYLDQYVAS 547
           ED+P+D LPR  ++ P LGQ R  ++  E  KE+ AG+GGP GLWHFIYRSIYLDQYVAS
Sbjct: 481 EDLPIDPLPRSGSLLPRLGQGRPVTDSQESLKEAYAGIGGPAGLWHFIYRSIYLDQYVAS 540

Query: 548 EFSSPISSRQQQKRLYRAYQNIYDSMHDKEIGPHKTQFRRDENYVLLCWVTQDFELYAAF 591
           EFS P ++ +QQKRLYRAYQ +Y SMHDK IG HKTQFRRDE+YVLLCWVTQDFELYAAF
Sbjct: 541 EFSPPFNTLRQQKRLYRAYQKMYASMHDKGIGAHKTQFRRDEHYVLLCWVTQDFELYAAF 600

BLAST of Cp4.1LG20g07410 vs. TrEMBL
Match: D7SKY8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_06s0004g02820 PE=4 SV=1)

HSP 1 Score: 731.5 bits (1887), Expect = 8.2e-208
Identity = 407/643 (63.30%), Postives = 460/643 (71.54%), Query Frame = 1

Query: 1   MSSGLSSLSSSDELDNFNPRTSPTTPP-KPLEEELASLALTLP--PPELLSDQEDVNVVS 60
           MSS  SS  S+D   + NP  SPT  P   L++ LAS+ALT P    E  SDQE    V+
Sbjct: 1   MSSDSSSSISNDGSTDQNPNPSPTAKPLDSLQDRLASIALTEPNGGAESPSDQEPQAGVA 60

Query: 61  DGSAADGFGFGIQRSDEESRVGVVCEENVVGNSAAAVVEGATGNSVREGIDGTGVVWGRT 120
           +GS ++              +  V + N    S A V E      V E     GVVW R 
Sbjct: 61  NGSFSE-------------EIQEVVQNNQAAGSEAVVEE------VSESFT-HGVVW-RD 120

Query: 121 NSEIEVDRPVSPSSSGYAGERGSSGASSGRSEMDGIADDEIQELNDDASVGDNSNSVHSW 180
           NSE EVD P SPSSSGYAGERGSS A+S     +G  +DEI E+ +D SV   S+   SW
Sbjct: 121 NSEHEVDAPSSPSSSGYAGERGSSSATSESGIGEG-GEDEILEVRNDDSVDGVSDLQQSW 180

Query: 181 VPGKRHGDELILKVQDDASISWRKRKKHFFVLSHSGKPIYSRYGDEHKLAGFSASLQAII 240
           VPGKRH DE      DDASISWRKRKKHFF+LSHSGKPIYSRYGDEHKLAGFSA+LQAII
Sbjct: 181 VPGKRHVDE------DDASISWRKRKKHFFILSHSGKPIYSRYGDEHKLAGFSATLQAII 240

Query: 241 SFVEDGGDRVKWVRAGKHLVVFLVKGPIYLVCISCTEEPYESLRGQLELIYGQMILILTK 300
           SFVE+GGDRV+ +RAGKH VVFLVKGPIYLVCISCTEEPYESLR QLELIYGQM+LILTK
Sbjct: 241 SFVENGGDRVQLIRAGKHQVVFLVKGPIYLVCISCTEEPYESLRSQLELIYGQMLLILTK 300

Query: 301 SVNRCFERNPKFDMTSLLGGTDVVFSSLIHSFGWNPATFLHAYTCLPLAYGTRQAAGAIL 360
           SVNRCFE+NPKFDMT LLGGTDVVFSSLIHSF WNPATFLHAYTCLPLAY TRQA+GAIL
Sbjct: 301 SVNRCFEKNPKFDMTPLLGGTDVVFSSLIHSFNWNPATFLHAYTCLPLAYATRQASGAIL 360

Query: 361 QDVADSGILFAILMCKHKVISLVGAQKASLHPDDMLLLANFVMSSESFR----------- 420
           QDVADSG+LFAILMCKHKVISLVGAQKASLHPDDMLLL+NFVMSSESFR           
Sbjct: 361 QDVADSGVLFAILMCKHKVISLVGAQKASLHPDDMLLLSNFVMSSESFRTSESFSPICLP 420

Query: 421 --------------IRIETVLLKSNVLSEVQRS------------------------MLD 480
                         + ++T L+     S+                            +LD
Sbjct: 421 RYNPMAFLYAYVHYLDVDTYLMLLTTKSDAFYHLKDCRLRIETVLLKSNVLSEVQRSLLD 480

Query: 481 GGMHVEDVPVDSLPRYRTISPHLGQQRVPSEFTERFKESSAGMGGPGGLWHFIYRSIYLD 540
           GGM VED+PVD+ PR   +S HLGQ ++P++  E  +E   G+GGP GLWHFIYRSIYLD
Sbjct: 481 GGMRVEDLPVDTSPRSGILSAHLGQHKLPTDSPETSREECIGVGGPFGLWHFIYRSIYLD 540

Query: 541 QYVASEFSSPISSRQQQKRLYRAYQNIYDSMHDKEIG-PHKTQFRRDENYVLLCWVTQDF 591
           QYV+SEFS PI+S +QQKRLYRAYQ +Y SMHD+ +G PHKTQFRRDENYVLLCWVT +F
Sbjct: 541 QYVSSEFSPPINSSRQQKRLYRAYQKLYASMHDRGVGPPHKTQFRRDENYVLLCWVTPEF 600

BLAST of Cp4.1LG20g07410 vs. TrEMBL
Match: W9RV66_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_007648 PE=4 SV=1)

HSP 1 Score: 729.9 bits (1883), Expect = 2.4e-207
Identity = 399/600 (66.50%), Postives = 461/600 (76.83%), Query Frame = 1

Query: 2   SSGLSSLSSSDELDNFNPRTSPTTPPKPLEEELASLALTLPP---PELLSDQEDV-NVVS 61
           SS  SS +S+DE  + +P +     P+ +E   +S+ALT P    P+L ++ ++V N V 
Sbjct: 4   SSDSSSSASADENADRDPGSGS---PESIEVRFSSVALTEPDAEFPDLAAEHQEVENGVD 63

Query: 62  DGSAADGFGFGIQRSDEESRVGVVCEENVVGNSAAAVVEGATGNSVREGIDGTGVVWGRT 121
             SA +     I+ +DEE  V +  EE +  NS  A         V EG+   GVV GRT
Sbjct: 64  YASAGESDRQLIEGNDEEELV-IAAEEEIQSNSGEA--------EVSEGLARGGVVLGRT 123

Query: 122 NSEIEVDRPVSPSSSGYAGERGSSGASSGR-SEMDGIADDEIQELNDDASVGDNSNSVHS 181
            SE++V+ P SPSSSGYAGERGSSGASSG  S +D + DDEIQE+  DA V    +S  +
Sbjct: 124 ISELDVEEPSSPSSSGYAGERGSSGASSGGGSGIDEVGDDEIQEVRSDA-VDGVLDSGAT 183

Query: 182 WVPGKRHGDELILKVQDDASISWRKRKKHFFVLSHSGKPIYSRYGDEHKLAGFSASLQAI 241
           W PGKRH DE      DDAS+SWRKRKKHFF+LSHSGKPIYSRYGDEHKLAGFSA+LQAI
Sbjct: 184 WAPGKRHPDE------DDASVSWRKRKKHFFILSHSGKPIYSRYGDEHKLAGFSATLQAI 243

Query: 242 ISFVEDGGDRVKWVRAGKHLVVFLVKGPIYLVCISCTEEPYESLRGQLELIYGQMILILT 301
           ISFVE+GGDRVK VRAGKH VVFLVKGPIYLVCISCTEEPYESLRGQLEL+YGQMILILT
Sbjct: 244 ISFVENGGDRVKLVRAGKHQVVFLVKGPIYLVCISCTEEPYESLRGQLELMYGQMILILT 303

Query: 302 KSVNRCFERNPKFDMTSLLGGTDVVFSSLIHSFGWNPATFLHAYTCLPLAYGTRQAAGAI 361
           KSVNRCFE+NPKFDMT LLGGTD+VFSSL+H F WNPATFLHAYTCLPLA+ TRQAAGAI
Sbjct: 304 KSVNRCFEKNPKFDMTPLLGGTDIVFSSLVHLFSWNPATFLHAYTCLPLAFATRQAAGAI 363

Query: 362 LQDVADSGILFAILMCKHKVISLVGAQKASLHPDDMLLLANFVMSSESF------RIRIE 421
           LQDVADSG+LFAILMCKHK  + +           MLL  N    S++F      R+ IE
Sbjct: 364 LQDVADSGVLFAILMCKHKEDTYL-----------MLLTTN----SDAFYHLKDCRMCIE 423

Query: 422 TVLLKSNVLSEVQRSMLDGGMHVEDVPVDSLPRYRTISPHLGQQRVPSEFTERFKESSAG 481
            VLLKSNVLSEVQRS L+GGMHVED+PVD LPR  ++S   G  ++P++  ERF+E+   
Sbjct: 424 KVLLKSNVLSEVQRSTLEGGMHVEDLPVDPLPRSGSLS-RWGHAKLPTDSPERFRETCDC 483

Query: 482 MGGPGGLWHFIYRSIYLDQYVASEFSSPISSRQQQKRLYRAYQNIYDSMHDKEIGPHKTQ 541
           +GGP GLWHF+YRSI+LDQYV+SEFS+PISSR QQKRLYRAYQ +Y SMHD  IGPHKTQ
Sbjct: 484 VGGPAGLWHFMYRSIFLDQYVSSEFSAPISSRGQQKRLYRAYQKLYASMHDGGIGPHKTQ 543

Query: 542 FRRDENYVLLCWVTQDFELYAAFDPLADKAMAIKICNRVCQWIKDVENEVFLLGASPFSW 591
           FRRDENYVLLCWVTQDFELYAAFDPLADKA+AIK CNRVCQW+KDVENE+FLLGASPFSW
Sbjct: 544 FRRDENYVLLCWVTQDFELYAAFDPLADKALAIKTCNRVCQWVKDVENEIFLLGASPFSW 568

BLAST of Cp4.1LG20g07410 vs. TrEMBL
Match: A0A0B0MFR9_GOSAR (Protein SAND OS=Gossypium arboreum GN=F383_15283 PE=4 SV=1)

HSP 1 Score: 720.3 bits (1858), Expect = 1.9e-204
Identity = 374/530 (70.57%), Postives = 416/530 (78.49%), Query Frame = 1

Query: 124 DRPVSPSSSGYAGERGSSGASS-----GRSEMDGIADDEIQELNDDASV-GDNSNSVHSW 183
           +RP SPSSSGYAGERGSS AS+     G SE+DG   DEIQE+ +D S+ G +     +W
Sbjct: 82  ERPSSPSSSGYAGERGSSSASTASRIDGASEVDG---DEIQEVRNDCSLEGFSDTQASAW 141

Query: 184 VPGKRHGDELILKVQDDASISWRKRKKHFFVLSHSGKPIYSRYGDEHKLAGFSASLQAII 243
           VPGKRH DE      DD SISWRKRKKHFF+LS+SGKPIYSRYGDEHKLAGFSA+LQAII
Sbjct: 142 VPGKRHVDE------DDGSISWRKRKKHFFILSNSGKPIYSRYGDEHKLAGFSATLQAII 201

Query: 244 SFVEDGGDRVKWVRAGKHLVVFLVKGPIYLVCISCTEEPYESLRGQLELIYGQMILILTK 303
           SFVE+GGDRVK V+AGKH VVFLVKGPIYLVCISCTEEP+ESL+GQLELIYGQMILILTK
Sbjct: 202 SFVENGGDRVKLVKAGKHQVVFLVKGPIYLVCISCTEEPFESLKGQLELIYGQMILILTK 261

Query: 304 SVNRCFERNPKFDMTSLLGGTDVVFSSLIHSFGWNPATFLHAYTCLPLAYGTRQAAGAIL 363
           S+NRCFE+NPKFDMT LL GTDVVFSSLIHSF WNPATF+HAYTCLPLAY TRQAAGAIL
Sbjct: 262 SINRCFEKNPKFDMTPLLRGTDVVFSSLIHSFSWNPATFIHAYTCLPLAYATRQAAGAIL 321

Query: 364 QDVADSGILFAILMCKHKVISLVGAQKASLHPDDMLLLANFVMSSESFR----------- 423
           QDVADSG+LFAILMCKHKVISLVGAQKASLHPDDMLLL+NFVMSSESFR           
Sbjct: 322 QDVADSGVLFAILMCKHKVISLVGAQKASLHPDDMLLLSNFVMSSESFRTAESFSPICLP 381

Query: 424 --------------IRIETVLLKSNV------------------------LSEVQRSMLD 483
                         + ++T L+                            LSEVQRSM+D
Sbjct: 382 RYNPMAFLYAYVNFLDVDTYLILLTTRSDAFYHLKDCRIRIELVLSKSNVLSEVQRSMID 441

Query: 484 GGMHVEDVPVDSLPRYRTISPHLGQQRVPSEFTERF--------KESSAGMGGPGGLWHF 543
           GGMHVED+P+D LPR  + SPHLGQQR+P++  ER         +E   G+GGP GLWHF
Sbjct: 442 GGMHVEDLPLDPLPRSGS-SPHLGQQRLPTDSPERLPTDSPKRPREPFIGIGGPAGLWHF 501

Query: 544 IYRSIYLDQYVASEFSSPISSRQQQKRLYRAYQNIYDSMHDKEIGPHKTQFRRDENYVLL 591
           IYRSI+L+QYV+SEFS P+SS QQQKRLYRAYQ ++DSMHDK IGPHKTQFRRDENYVLL
Sbjct: 502 IYRSIFLEQYVSSEFSPPLSSPQQQKRLYRAYQRLHDSMHDKGIGPHKTQFRRDENYVLL 561

BLAST of Cp4.1LG20g07410 vs. TrEMBL
Match: A0A067L9X7_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_05515 PE=4 SV=1)

HSP 1 Score: 716.5 bits (1848), Expect = 2.7e-203
Identity = 395/639 (61.82%), Postives = 454/639 (71.05%), Query Frame = 1

Query: 9   SSSDELDNFNPRTSPTTPPKPLEEELASLALTLPPPELLSDQEDVNVVSDGSAADGFGFG 68
           S+SD   +     +P   PKPLE +  S+ L      + +D+  V  + + +       G
Sbjct: 3   SASDFSSSSGDDPNPIPNPKPLENQFESVTLEESNSIIQNDEVSVQPLEESN-------G 62

Query: 69  IQRSDEESRVGVVCE------ENVVGNSAAAVVEGATGNSVREGIDGT--GVVWGRTNSE 128
           I ++DE   V           ++ V +    +  G  G     G+D    G+VW RTNSE
Sbjct: 63  IIQNDEALAVPQQVPSLNGSLDDHVNHEEERIEVGKCG-----GVDSGSGGIVW-RTNSE 122

Query: 129 IEVDRPVSPSSSGYAGERGSSGASSGRSEMDGIADDEIQELNDDASVGDNSNSVHSWVPG 188
           +EVD P SPSSSGYAGERGSS A+S  S +  +++DEIQE+ +D  V    +S  +WVPG
Sbjct: 123 VEVDGPSSPSSSGYAGERGSSSATSA-SRIGEVSEDEIQEVGNDGRVDGVLDSQAAWVPG 182

Query: 189 KRHGDELILKVQDDASISWRKRKKHFFVLSHSGKPIYSRYGDEHKLAGFSASLQAIISFV 248
           KRH DE      DDASISWRKRKKHFF+LSHSGKPIYSRYGDEH+LAGFSA+LQAIISFV
Sbjct: 183 KRHVDE------DDASISWRKRKKHFFILSHSGKPIYSRYGDEHRLAGFSATLQAIISFV 242

Query: 249 EDGGDRVKWVRAGKHLVVFLVKGPIYLVCISCTEEPYESLRGQLELIYGQMILILTKSVN 308
           E+GGDRVK VRAGKH VVFLVKGPIYLVCISCT+EPYESLRGQLELIYGQMILILTKSVN
Sbjct: 243 ENGGDRVKLVRAGKHQVVFLVKGPIYLVCISCTDEPYESLRGQLELIYGQMILILTKSVN 302

Query: 309 RCFERNPKFDMTSLLGGTDVVFSSLIHSFGWNPATFLHAYTCLPLAYGTRQAAGAILQDV 368
           RCFERN KFDMT LLGGTDVVFSSLIHSF WNPATFLHAYTCLPLAY TRQAA AILQDV
Sbjct: 303 RCFERNSKFDMTPLLGGTDVVFSSLIHSFSWNPATFLHAYTCLPLAYATRQAACAILQDV 362

Query: 369 ADSGILFAILMCKHKVISLVGAQKASLHPDDMLLLANFVMSSESFR-------------- 428
           ADSG+LFAILMCKHKV+SLVGAQKASLHPDDMLLL+NF+MSSESFR              
Sbjct: 363 ADSGVLFAILMCKHKVVSLVGAQKASLHPDDMLLLSNFIMSSESFRTSESFSPICLPRYN 422

Query: 429 -----------IRIETVLLKSNVLSEVQRSMLDGGMHVEDVPVDS--LPRYR-------- 488
                      + ++T L+     S+    + D  + +E V + S  L   +        
Sbjct: 423 PMAFLYAYVHYLDVDTYLMLLTTSSDAFYHLKDCRIRIEMVLLKSSVLSEVQRSMLDGGM 482

Query: 489 --------------TISPHLGQQRVPSEFTERFKESSAGMGGPGGLWHFIYRSIYLDQYV 548
                         T SPHLGQ R+P++  ERF+ES  G+GGP GLWHFIYRSIYLDQYV
Sbjct: 483 HVEDLPGDPLPRSGTASPHLGQHRLPTDSPERFRESYVGIGGPAGLWHFIYRSIYLDQYV 542

Query: 549 ASEFSSPISSRQQQKRLYRAYQNIYDSMHDKEIGPHKTQFRRDENYVLLCWVTQDFELYA 591
           +SEFSSPI+S QQQKRLYRAYQ +Y SMHDK  GPHKTQFRRDENYVLLCWVT DFELYA
Sbjct: 543 SSEFSSPINSPQQQKRLYRAYQKLYASMHDKGNGPHKTQFRRDENYVLLCWVTPDFELYA 602

BLAST of Cp4.1LG20g07410 vs. TAIR10
Match: AT2G28390.1 (AT2G28390.1 SAND family protein)

HSP 1 Score: 431.0 bits (1107), Expect = 1.2e-120
Identity = 241/411 (58.64%), Postives = 289/411 (70.32%), Query Frame = 1

Query: 2   SSGLSSLSSSD-ELDNFNPRTSPTTPPKPLEEELASLALTLPPPELLSDQEDVNVVSDGS 61
           S   SS SSSD E  + NP + P T  + ++ +L S+ L+ P           + VSDGS
Sbjct: 4   SDSRSSPSSSDTEFADPNPSSDPETNSERVQSQLESMNLSQP-----------SEVSDGS 63

Query: 62  AADGFGFGIQRSDEESRVGVVCEENVVGNSAAAVVEGATGNSVREGIDGT--GVVWGRTN 121
             +  G G    DE +           GN        + G  +REG+ GT  G V  R  
Sbjct: 64  HTEFSGGGDDNDDEVASAN--------GNEGGV----SNGGLLREGVAGTSGGEVLLRAE 123

Query: 122 SEIEVD---RPVSPSSSGYAGERGSSGASSGRSEMDGIADDEIQELNDDASVGDNSNSVH 181
           + +E++    P SP+SSGY GERGSSG ++   + D  ++DEI+E N D        +  
Sbjct: 124 NPVEMEAGEEPPSPTSSGYDGERGSSGGATSTYKADDGSEDEIREANVDGDTASQHEA-- 183

Query: 182 SWVPGKRHGDELILKVQDDASISWRKRKKHFFVLSHSGKPIYSRYGDEHKLAGFSASLQA 241
           +W+PGKRH DE      DDAS SWRKRKKHFF+LS+SGKPIYSRYGDEHKLAGFSA+LQA
Sbjct: 184 AWLPGKRHVDE------DDASTSWRKRKKHFFILSNSGKPIYSRYGDEHKLAGFSATLQA 243

Query: 242 IISFVEDGGDRVKWVRAGKHLVVFLVKGPIYLVCISCTEEPYESLRGQLELIYGQMILIL 301
           IISFVE+GGDRV  V+AG H VVFLVKGPIYLVCISCT+E YE LRGQL+L+YGQMILIL
Sbjct: 244 IISFVENGGDRVNLVKAGNHQVVFLVKGPIYLVCISCTDETYEYLRGQLDLLYGQMILIL 303

Query: 302 TKSVNRCFERNPKFDMTSLLGGTDVVFSSLIHSFGWNPATFLHAYTCLPLAYGTRQAAGA 361
           TKS++RCFE+N KFDMT LLGGTD VFSSL+HSF WNPATFLHAYTCLPL Y  RQA G 
Sbjct: 304 TKSIDRCFEKNAKFDMTPLLGGTDAVFSSLVHSFSWNPATFLHAYTCLPLPYALRQATGT 363

Query: 362 ILQDVADSGILFAILMCKHKVISLVGAQKASLHPDDMLLLANFVMSSESFR 407
           ILQ+V  SG+LF++LMC+HKV+SL GAQKASLHPDD+LLL+NFVMSSESFR
Sbjct: 364 ILQEVCASGVLFSLLMCRHKVVSLAGAQKASLHPDDLLLLSNFVMSSESFR 383

BLAST of Cp4.1LG20g07410 vs. NCBI nr
Match: gi|658028640|ref|XP_008349751.1| (PREDICTED: protein SAND-like isoform X1 [Malus domestica])

HSP 1 Score: 747.3 bits (1928), Expect = 2.1e-212
Identity = 407/639 (63.69%), Postives = 456/639 (71.36%), Query Frame = 1

Query: 1   MSSGLSSLSSSDELDNFNPRTSPTTPPKPLEEELASLALTLPPPELLSDQEDVNVVSDGS 60
           MSS   + +S DE   + P  +P   PKP E++LA+LAL+             N V+ GS
Sbjct: 1   MSSETGTSTSGDEA--WEPNPNPI--PKPPEDQLAALALS-------EADSHANGVAHGS 60

Query: 61  AADGFGFGIQRSDEESRVGVVCEENVVGNSAAAVVEGATGNSVREGIDGTGVVWGRTNSE 120
           A       I+  +EE R   V           A VE A+    R GI G G+VW RTNSE
Sbjct: 61  AEGNNHQEIENEEEEVRANSVAP-------GLAEVEEASEGLPRGGIGG-GIVWARTNSE 120

Query: 121 IEVDRPVSPSSSGYAGERGSSGASSGRSEMDGIADDEIQELNDDASVGDNSNSVHSWVPG 180
           +EVD P SPSSSGYAGERGSS ASSG     G   DEI E+ +D      S+S   WVPG
Sbjct: 121 LEVDGPSSPSSSGYAGERGSSSASSGA----GSGVDEILEVRNDEIADGFSDSXTPWVPG 180

Query: 181 KRHGDELILKVQDDASISWRKRKKHFFVLSHSGKPIYSRYGDEHKLAGFSASLQAIISFV 240
           KRH DE      DDASISWRKRKKHFF+LSHSGKPIYSRYGDEHKLAGFSA+LQAIISFV
Sbjct: 181 KRHVDE------DDASISWRKRKKHFFILSHSGKPIYSRYGDEHKLAGFSATLQAIISFV 240

Query: 241 EDGGDRVKWVRAGKHLVVFLVKGPIYLVCISCTEEPYESLRGQLELIYGQMILILTKSVN 300
           E+GGD VK VRAGKH V+FLVKGPIYLVCISCTEEPYESLR QLELIYGQM+LILTKSVN
Sbjct: 241 ENGGDHVKLVRAGKHQVIFLVKGPIYLVCISCTEEPYESLRVQLELIYGQMLLILTKSVN 300

Query: 301 RCFERNPKFDMTSLLGGTDVVFSSLIHSFGWNPATFLHAYTCLPLAYGTRQAAGAILQDV 360
           RCFE+NPKFDMT LLGGTD+VF SLIHSF WNPATFLHAYTCLPLAY TRQAAGAIL DV
Sbjct: 301 RCFEKNPKFDMTPLLGGTDIVFXSLIHSFSWNPATFLHAYTCLPLAYATRQAAGAILHDV 360

Query: 361 ADSGILFAILMCKHKVISLVGAQKASLHPDDMLLLANFVMSSESFR-------------- 420
           ADSG+LF ILMCKHKVISLVGAQKASLHPDDMLLL+NFVM+SESFR              
Sbjct: 361 ADSGVLFTILMCKHKVISLVGAQKASLHPDDMLLLSNFVMASESFRTSESFSPICLPRYN 420

Query: 421 -----------IRIETVLLKSNVLSEVQRSMLD------------------------GGM 480
                      + ++T L+     S+    + D                        GGM
Sbjct: 421 PMAFLYAYVHFLDVDTYLMLLTTSSDAFYHLKDCRIRIESVLLKSSVLSEIQRSTVDGGM 480

Query: 481 HVEDVPVDSLPRYRTISPHLGQQRVPSEFTERFKESSAGMGGPGGLWHFIYRSIYLDQYV 540
            VE++P+D LPR  + SPHLGQ  VP++  +RF+E   G+GGP GLWHFIYRSI+LDQYV
Sbjct: 481 RVEELPLDPLPRSGSFSPHLGQHTVPTDSPDRFREPYIGVGGPAGLWHFIYRSIFLDQYV 540

Query: 541 ASEFSSPISSRQQQKRLYRAYQNIYDSMHDKEIGPHKTQFRRDENYVLLCWVTQDFELYA 591
           +SEFS PISS +QQKRLYRAYQ +Y SMHDK IGPHKTQFRRDENYVLLCWVTQDFELYA
Sbjct: 541 SSEFSPPISSPRQQKRLYRAYQKLYASMHDKGIGPHKTQFRRDENYVLLCWVTQDFELYA 600

BLAST of Cp4.1LG20g07410 vs. NCBI nr
Match: gi|657986383|ref|XP_008385318.1| (PREDICTED: protein SAND isoform X1 [Malus domestica])

HSP 1 Score: 746.9 bits (1927), Expect = 2.7e-212
Identity = 407/639 (63.69%), Postives = 456/639 (71.36%), Query Frame = 1

Query: 1   MSSGLSSLSSSDELDNFNPRTSPTTPPKPLEEELASLALTLPPPELLSDQEDVNVVSDGS 60
           MSS   + +S DE   + P  +P   PKP E++LA+LAL+             N V+ GS
Sbjct: 1   MSSETGTSTSGDEA--WEPNPNPI--PKPPEDQLAALALS-------EADSHANGVAHGS 60

Query: 61  AADGFGFGIQRSDEESRVGVVCEENVVGNSAAAVVEGATGNSVREGIDGTGVVWGRTNSE 120
           A       I+  +EE R   V           A VE A+    R GI G G+VW RTNSE
Sbjct: 61  AEGNNHQEIENEEEEVRANSVAP-------GLAEVEEASEGLPRGGIGG-GIVWARTNSE 120

Query: 121 IEVDRPVSPSSSGYAGERGSSGASSGRSEMDGIADDEIQELNDDASVGDNSNSVHSWVPG 180
           +EVD P SPSSSGYAGERGSS ASSG     G   DEI E+ +D      S+S   WVPG
Sbjct: 121 LEVDGPSSPSSSGYAGERGSSSASSGA----GSGVDEILEVRNDEIADGFSDSQTPWVPG 180

Query: 181 KRHGDELILKVQDDASISWRKRKKHFFVLSHSGKPIYSRYGDEHKLAGFSASLQAIISFV 240
           KRH DE      DDASISWRKRKKHFF+LSHSGKPIYSRYGDEHKLAGFSA+LQAIISFV
Sbjct: 181 KRHVDE------DDASISWRKRKKHFFILSHSGKPIYSRYGDEHKLAGFSATLQAIISFV 240

Query: 241 EDGGDRVKWVRAGKHLVVFLVKGPIYLVCISCTEEPYESLRGQLELIYGQMILILTKSVN 300
           E+GGD VK VRAGKH V+FLVKGPIYLVCISCTEEPYESLR QLELIYGQM+LILTKSVN
Sbjct: 241 ENGGDHVKLVRAGKHQVIFLVKGPIYLVCISCTEEPYESLRVQLELIYGQMLLILTKSVN 300

Query: 301 RCFERNPKFDMTSLLGGTDVVFSSLIHSFGWNPATFLHAYTCLPLAYGTRQAAGAILQDV 360
           RCFE+NPKFDMT LLGGTD+VF SLIHSF WNPATFLHAYTCLPLAY TRQAAGAIL DV
Sbjct: 301 RCFEKNPKFDMTPLLGGTDIVFXSLIHSFSWNPATFLHAYTCLPLAYATRQAAGAILHDV 360

Query: 361 ADSGILFAILMCKHKVISLVGAQKASLHPDDMLLLANFVMSSESFR-------------- 420
           ADSG+LF ILMCKHKVISLVGAQKASLHPDDMLLL+NFVM+SESFR              
Sbjct: 361 ADSGVLFTILMCKHKVISLVGAQKASLHPDDMLLLSNFVMASESFRTSESFSPICLPRYN 420

Query: 421 -----------IRIETVLLKSNVLSEVQRSMLD------------------------GGM 480
                      + ++T L+     S+    + D                        GGM
Sbjct: 421 PMAFLYAYVHFLDVDTYLMLLTTSSDAFYHLKDCRIRIESVLLKSSVLSEIQRSTVDGGM 480

Query: 481 HVEDVPVDSLPRYRTISPHLGQQRVPSEFTERFKESSAGMGGPGGLWHFIYRSIYLDQYV 540
            VE++P+D LPR  + SPHLGQ  VP++  +RF+E   G+GGP GLWHFIYRSI+LDQYV
Sbjct: 481 RVEELPLDPLPRSGSFSPHLGQHTVPTDSPDRFREPYIGVGGPAGLWHFIYRSIFLDQYV 540

Query: 541 ASEFSSPISSRQQQKRLYRAYQNIYDSMHDKEIGPHKTQFRRDENYVLLCWVTQDFELYA 591
           +SEFS PISS +QQKRLYRAYQ +Y SMHDK IGPHKTQFRRDENYVLLCWVTQDFELYA
Sbjct: 541 SSEFSPPISSPRQQKRLYRAYQKLYASMHDKGIGPHKTQFRRDENYVLLCWVTQDFELYA 600

BLAST of Cp4.1LG20g07410 vs. NCBI nr
Match: gi|658039226|ref|XP_008355184.1| (PREDICTED: protein SAND-like [Malus domestica])

HSP 1 Score: 745.7 bits (1924), Expect = 6.0e-212
Identity = 408/639 (63.85%), Postives = 459/639 (71.83%), Query Frame = 1

Query: 1   MSSGLSSLSSSDELDNFNPRTSPTTPPKPLEEELASLALTLPPPELLSDQEDVNVVSDGS 60
           MSS   + +S DE     P  +P   PKP E++LA+LAL+             N V+ GS
Sbjct: 1   MSSESGASTSGDEXSE--PNXNPDHTPKPPEDQLATLALS-------EVDSHGNGVAHGS 60

Query: 61  AADGFGFGIQRSDEESRVGVVCEENVVGNSAAAVVEGATGNSVREGIDGTGVVWGRTNSE 120
           AA+     I+  +EE       + N V    A V E + G S R G+ G GVVW RTNSE
Sbjct: 61  AAENNRQEIENDEEE------VQANSVAPGLAEVXEASEG-SPRGGVGG-GVVWARTNSE 120

Query: 121 IEVDRPVSPSSSGYAGERGSSGASSGRSEMDGIADDEIQELNDDASVGDNSNSVHSWVPG 180
           +E+D P SPSSSGYAGERGSS +S G     GI  DEI E+ +D  V    +S   WVPG
Sbjct: 121 LEIDGPSSPSSSGYAGERGSSASSGG----SGI--DEISEVRNDEIVDGFPDSQTPWVPG 180

Query: 181 KRHGDELILKVQDDASISWRKRKKHFFVLSHSGKPIYSRYGDEHKLAGFSASLQAIISFV 240
           KRH DE      DDASISWRKRKKHFF+LSHSGKPIYSRYGDEHKLAGFSA+LQAIISFV
Sbjct: 181 KRHVDE------DDASISWRKRKKHFFILSHSGKPIYSRYGDEHKLAGFSATLQAIISFV 240

Query: 241 EDGGDRVKWVRAGKHLVVFLVKGPIYLVCISCTEEPYESLRGQLELIYGQMILILTKSVN 300
           E+GGD VK VRAGKH V+FLVKGPIYLVCISCTEEPYESLR QLELIYGQM+LILTKSVN
Sbjct: 241 ENGGDXVKLVRAGKHQVIFLVKGPIYLVCISCTEEPYESLRVQLELIYGQMLLILTKSVN 300

Query: 301 RCFERNPKFDMTSLLGGTDVVFSSLIHSFGWNPATFLHAYTCLPLAYGTRQAAGAILQDV 360
           RCFE+NPKFDMT LLGGTD+VFSSLIHSF WNPATFLHAYTCLPLAY TRQAAGAILQDV
Sbjct: 301 RCFEKNPKFDMTPLLGGTDIVFSSLIHSFSWNPATFLHAYTCLPLAYATRQAAGAILQDV 360

Query: 361 ADSGILFAILMCKHKVISLVGAQKASLHPDDMLLLANFVMSSESFR-------------- 420
           ADSG+LFAILMCKHKVI LVGAQKASLHPDDMLLL+NFVM+SESFR              
Sbjct: 361 ADSGVLFAILMCKHKVICLVGAQKASLHPDDMLLLSNFVMASESFRTSESFSPICLPRYN 420

Query: 421 -----------IRIETVLLKSNVLSEVQRSMLD------------------------GGM 480
                      + ++T L+     S+    + D                        GGM
Sbjct: 421 PMAFLYAYVRYLDVDTYLMLLTTSSDAFYHLKDCRIRIESVLLKSNVLSEIQRSMLDGGM 480

Query: 481 HVEDVPVDSLPRYRTISPHLGQQRVPSEFTERFKESSAGMGGPGGLWHFIYRSIYLDQYV 540
            VED+P+D LPR  + SPHL Q  VP++  +RFKE   G+GGP GLWHFIYRSI+LDQYV
Sbjct: 481 RVEDLPLDPLPRSGSFSPHLVQHTVPTDSPDRFKEPYIGVGGPAGLWHFIYRSIFLDQYV 540

Query: 541 ASEFSSPISSRQQQKRLYRAYQNIYDSMHDKEIGPHKTQFRRDENYVLLCWVTQDFELYA 591
           +SEFS PISS +QQKRLYRAYQ +Y SMH+K IGPHKTQFRRDENYVLLCW TQDFELYA
Sbjct: 541 SSEFSPPISSPRQQKRLYRAYQKLYASMHNKGIGPHKTQFRRDENYVLLCWATQDFELYA 600

BLAST of Cp4.1LG20g07410 vs. NCBI nr
Match: gi|658028642|ref|XP_008349752.1| (PREDICTED: protein SAND-like isoform X2 [Malus domestica])

HSP 1 Score: 743.0 bits (1917), Expect = 3.9e-211
Identity = 406/638 (63.64%), Postives = 455/638 (71.32%), Query Frame = 1

Query: 1   MSSGLSSLSSSDELDNFNPRTSPTTPPKPLEEELASLALTLPPPELLSDQEDVNVVSDGS 60
           MSS   + +S DE   + P  +P   PKP E++LA+LAL+             N V+ GS
Sbjct: 1   MSSETGTSTSGDEA--WEPNPNPI--PKPPEDQLAALALS-------EADSHANGVAHGS 60

Query: 61  AADGFGFGIQRSDEESRVGVVCEENVVGNSAAAVVEGATGNSVREGIDGTGVVWGRTNSE 120
           A       I+  +EE R   V           A VE A+    R GI G G+VW RTNSE
Sbjct: 61  AEGNNHQEIENEEEEVRANSVAP-------GLAEVEEASEGLPRGGIGG-GIVWARTNSE 120

Query: 121 IEVDRPVSPSSSGYAGERGSSGASSGRSEMDGIADDEIQELNDDASVGDNSNSVHSWVPG 180
           +EVD P SPSSSGYAGERGSS ASSG     G   DEI E+ +D      S+S   WVPG
Sbjct: 121 LEVDGPSSPSSSGYAGERGSSSASSGA----GSGVDEILEVRNDEIADGFSDSXTPWVPG 180

Query: 181 KRHGDELILKVQDDASISWRKRKKHFFVLSHSGKPIYSRYGDEHKLAGFSASLQAIISFV 240
           KRH DE      DDASISWRKRKKHFF+LSHSGKPIYSRYGDEHKLAGFSA+LQAIISFV
Sbjct: 181 KRHVDE------DDASISWRKRKKHFFILSHSGKPIYSRYGDEHKLAGFSATLQAIISFV 240

Query: 241 EDGGDRVKWVRAGKHLVVFLVKGPIYLVCISCTEEPYESLRGQLELIYGQMILILTKSVN 300
           E+GGD VK VRAGKH V+FLVKGPIYLVCISCTEEPYESLR QLELIYGQM+LILTKSVN
Sbjct: 241 ENGGDHVKLVRAGKHQVIFLVKGPIYLVCISCTEEPYESLRVQLELIYGQMLLILTKSVN 300

Query: 301 RCFERNPKFDMTSLLGGTDVVFSSLIHSFGWNPATFLHAYTCLPLAYGTRQAAGAILQDV 360
           RCFE+NPKFDMT LLGGTD+VF SLIHSF WNPATFLHAYTCLPLAY TRQAAGAIL DV
Sbjct: 301 RCFEKNPKFDMTPLLGGTDIVFXSLIHSFSWNPATFLHAYTCLPLAYATRQAAGAILHDV 360

Query: 361 ADSGILFAILMCKHKVISLVGAQKASLHPDDMLLLANFVMSSESFR-------------- 420
           ADSG+LF ILMCKHKVISLVGAQKASLHPDDMLLL+NFVM+SESFR              
Sbjct: 361 ADSGVLFTILMCKHKVISLVGAQKASLHPDDMLLLSNFVMASESFRTSESFSPICLPRYN 420

Query: 421 -----------IRIETVLLKSNVLSEVQRSMLD------------------------GGM 480
                      + ++T L+     S+    + D                        GGM
Sbjct: 421 PMAFLYAYVHFLDVDTYLMLLTTSSDAFYHLKDCRIRIESVLLKSSVLSEIQRSTVDGGM 480

Query: 481 HVEDVPVDSLPRYRTISPHLGQQRVPSEFTERFKESSAGMGGPGGLWHFIYRSIYLDQYV 540
            VE++P+D LPR  + SPHLGQ  VP++  +RF+E   G+GGP GLWHFIYRSI+LDQYV
Sbjct: 481 RVEELPLDPLPRSGSFSPHLGQHTVPTDSPDRFREPYIGVGGPAGLWHFIYRSIFLDQYV 540

Query: 541 ASEFSSPISSRQQQKRLYRAYQNIYDSMHDKEIGPHKTQFRRDENYVLLCWVTQDFELYA 590
           +SEFS PISS +QQKRLYRAYQ +Y SMHDK IGPHKTQFRRDENYVLLCWVTQDFELYA
Sbjct: 541 SSEFSPPISSPRQQKRLYRAYQKLYASMHDKGIGPHKTQFRRDENYVLLCWVTQDFELYA 600

BLAST of Cp4.1LG20g07410 vs. NCBI nr
Match: gi|702272682|ref|XP_010043751.1| (PREDICTED: protein SAND [Eucalyptus grandis])

HSP 1 Score: 742.7 bits (1916), Expect = 5.1e-211
Identity = 406/637 (63.74%), Postives = 456/637 (71.59%), Query Frame = 1

Query: 8   LSSSDELDNFNPRTSPTTPP-----KPLEEELASLALTLPPPELLSDQEDVNVVSDGSAA 67
           +SSSD  D+ +  +   + P     KP+E+  + L +T             ++ + G+A 
Sbjct: 1   MSSSDGGDDGSASSGAGSGPVVLGSKPIEDGFSQLRVT-------------DLANGGAAN 60

Query: 68  DGFGFGIQRSDEESRVGVVCEENVVGNSAAAVVEGATGNSVREGIDGTGVVWGRTNSEIE 127
           DG G          R G+  E   V +  AA    A G S  E I    V+W R+NSE E
Sbjct: 61  DGDGEAAAAEGSGGREGIAAEVGEVEDGGAASAR-AGGRSEIEEIGEARVMW-RSNSEGE 120

Query: 128 VDRPVSPSSSGYAGERGSSGASSGRSEMDGIADDEIQELNDDASVGDNSNSVHSWVPGKR 187
            +   SPSSSGYAGERGSS   SG  E +G  +DEI+E+  DA     S+S  +W+PGKR
Sbjct: 121 GEAQGSPSSSGYAGERGSSSGGSGIGEEEGEEEDEIEEVRSDAV----SDSQAAWMPGKR 180

Query: 188 HGDELILKVQDDASISWRKRKKHFFVLSHSGKPIYSRYGDEHKLAGFSASLQAIISFVED 247
           H DE      DDASISWRKRKKHFF+LSHSGKPIYSRYGDEHKLAGFSA+LQAIISFVE+
Sbjct: 181 HVDE------DDASISWRKRKKHFFILSHSGKPIYSRYGDEHKLAGFSATLQAIISFVEN 240

Query: 248 GGDRVKWVRAGKHLVVFLVKGPIYLVCISCTEEPYESLRGQLELIYGQMILILTKSVNRC 307
           GGDRVK VRAGKH VVFLVKGPIYLVCISCTEEPYESLRGQLELIYGQMILILTKSVNRC
Sbjct: 241 GGDRVKLVRAGKHQVVFLVKGPIYLVCISCTEEPYESLRGQLELIYGQMILILTKSVNRC 300

Query: 308 FERNPKFDMTSLLGGTDVVFSSLIHSFGWNPATFLHAYTCLPLAYGTRQAAGAILQDVAD 367
           FE+NPKFDMT LLGGTDVVFSSLIHSF WNPATFLHAYTCLPL Y TRQAAGA+LQDVAD
Sbjct: 301 FEKNPKFDMTPLLGGTDVVFSSLIHSFSWNPATFLHAYTCLPLGYATRQAAGAVLQDVAD 360

Query: 368 SGILFAILMCKHKVISLVGAQKASLHPDDMLLLANFVMSSESFR---------------- 427
           SG+LFAILMCKHKV+SLVGAQKASLHPDDMLLLANFVMSSESFR                
Sbjct: 361 SGVLFAILMCKHKVVSLVGAQKASLHPDDMLLLANFVMSSESFRTSENFSPICLPRYNPM 420

Query: 428 ---------IRIETVLLKSN------------------------VLSEVQRSMLDGGMHV 487
                    + +ET L+                           VLSEVQRSMLDGGM V
Sbjct: 421 AFLYAYVHYLDVETYLMLLTTSSDAFYHLKDCRMRIETVLLKSNVLSEVQRSMLDGGMRV 480

Query: 488 EDVPVDSLPRYRTISPHLGQQRVPSEFTERFKESSAGMGGPGGLWHFIYRSIYLDQYVAS 547
           ED+P+D LPR  ++ P LGQ R  ++  E  KE+ AG+GGP GLWHFIYRSIYLDQYVAS
Sbjct: 481 EDLPIDPLPRSGSLLPRLGQGRPVTDSQESLKEAYAGIGGPAGLWHFIYRSIYLDQYVAS 540

Query: 548 EFSSPISSRQQQKRLYRAYQNIYDSMHDKEIGPHKTQFRRDENYVLLCWVTQDFELYAAF 591
           EFS P ++ +QQKRLYRAYQ +Y SMHDK IG HKTQFRRDE+YVLLCWVTQDFELYAAF
Sbjct: 541 EFSPPFNTLRQQKRLYRAYQKMYASMHDKGIGAHKTQFRRDEHYVLLCWVTQDFELYAAF 600

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
MON1A_HUMAN3.8e-4929.51Vacuolar fusion protein MON1 homolog A OS=Homo sapiens GN=MON1A PE=1 SV=2[more]
MON1A_MOUSE3.8e-4929.88Vacuolar fusion protein MON1 homolog A OS=Mus musculus GN=Mon1a PE=1 SV=3[more]
MON1A_CHICK2.5e-4828.39Vacuolar fusion protein MON1 homolog A OS=Gallus gallus GN=MON1A PE=2 SV=1[more]
MON1A_BOVIN4.0e-4626.44Vacuolar fusion protein MON1 homolog A OS=Bos taurus GN=MON1A PE=2 SV=1[more]
MON1A_MACFA5.2e-4626.88Vacuolar fusion protein MON1 homolog A OS=Macaca fascicularis GN=MON1A PE=2 SV=2[more]
Match NameE-valueIdentityDescription
A0A059D684_EUCGR7.9e-21163.58Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_B02502 PE=4 SV=1[more]
D7SKY8_VITVI8.2e-20863.30Putative uncharacterized protein OS=Vitis vinifera GN=VIT_06s0004g02820 PE=4 SV=... [more]
W9RV66_9ROSA2.4e-20766.50Uncharacterized protein OS=Morus notabilis GN=L484_007648 PE=4 SV=1[more]
A0A0B0MFR9_GOSAR1.9e-20470.57Protein SAND OS=Gossypium arboreum GN=F383_15283 PE=4 SV=1[more]
A0A067L9X7_JATCU2.7e-20361.82Uncharacterized protein OS=Jatropha curcas GN=JCGZ_05515 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G28390.11.2e-12058.64 SAND family protein[more]
Match NameE-valueIdentityDescription
gi|658028640|ref|XP_008349751.1|2.1e-21263.69PREDICTED: protein SAND-like isoform X1 [Malus domestica][more]
gi|657986383|ref|XP_008385318.1|2.7e-21263.69PREDICTED: protein SAND isoform X1 [Malus domestica][more]
gi|658039226|ref|XP_008355184.1|6.0e-21263.85PREDICTED: protein SAND-like [Malus domestica][more]
gi|658028642|ref|XP_008349752.1|3.9e-21163.64PREDICTED: protein SAND-like isoform X2 [Malus domestica][more]
gi|702272682|ref|XP_010043751.1|5.1e-21163.74PREDICTED: protein SAND [Eucalyptus grandis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004353Mon1
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006816 calcium ion transport
biological_process GO:0006882 cellular zinc ion homeostasis
biological_process GO:0009624 response to nematode
cellular_component GO:0005575 cellular_component
cellular_component GO:0005623 cell
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g07410.1Cp4.1LG20g07410.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004353Vacuolar fusion protein MON1PRINTSPR01546YEAST73DUFcoord: 539..553
score: 1.3E-43coord: 222..242
score: 1.3E-43coord: 307..319
score: 1.3E-43coord: 256..270
score: 1.3E-43coord: 336..357
score: 1.3E-43coord: 360..373
score: 1.3E-43coord: 204..221
score: 1.3E-43coord: 555..568
score: 1.3E-43coord: 384..400
score: 1.3E-43coord: 568..588
score: 1.3
IPR004353Vacuolar fusion protein MON1PANTHERPTHR13027SAND PROTEIN-RELATEDcoord: 473..590
score: 1.4E-221coord: 116..436
score: 1.4E
IPR004353Vacuolar fusion protein MON1PFAMPF03164Mon1coord: 198..406
score: 2.6E-76coord: 406..584
score: 1.0
NoneNo IPR availablePANTHERPTHR13027:SF7PROTEIN SAND-1coord: 473..590
score: 1.4E-221coord: 116..436
score: 1.4E