Cp4.1LG01g05840 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g05840
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionF23A5.27 isoform 5
LocationCp4.1LG01 : 432848 .. 442513 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTTTTATTTTTTTTTTTTTTTAAATATTTTGATGGACGTTTTCCTTAATTTTTTTCAGTACATGCAAGAGTGCTACGTTTTCGTTTCTTTCCCGCGCGTGGTTTCTGGAAGTTCCGCCAAACCTTGGATTTGGTTCTGCTGTCTGGCTTCGATTCATGACACTGTCGTCCGTTAGGGCTCAAACGAGTTCATATTTGAGCCACGCGGCTCTTTTCTCCCAATGCACCAATGCTCGAATCTCCAATCACCGGCATGTGCAACGATCATTCGCGTTTAGGGTTTTGTTGAGGCTCTGCTTCAGTTGATCTTTCTGTAGAGGTACGCCTTCTCCTCCGACAACTTCGTCGAATTTAAGTTTAGTGGTGTTGGGCTCTGATTTATGCCTATTTTAGGGAAATTCAGCTAAATTGAGGTCTTTTCGCAAGAATCACTAATTGGTATTTTTCTCCTTTCTGAAGGTTGTTGAAAAGTAAATAATGGGCCTGTCAACTGCTACTACTGCCGTTTCCGAGGCCATTAAACTCTGCGTATTTGATTCAAGGAGAGGACAACAGGAGGGTCAAGAGCTGGATAAGATATTGTTTTTCTATCCAGCCGATTTACCCTTTACAAAACAATTGTATATTATTGGGCTCAGTGAAGGACTTGTCACATTTACAAGGTCAGTATAATTTTTTCCCCCTTTTTTCAATCGGTAACCACCGTGCAGAAACATTTGAAGAGTAGATATAAGAAATAAGAATGACAGAAATTGCTCATTGAACATCGAGAACAGGTTGTGAAATAATCGAATAGACATTTTCTCTAATAAAACTTCTTGCTTTGTTTATATTGTTAATCTGTCACATTTTTGCAACTTTTAAATTTTGTTGTTTTCAATTTTATTTGTACTCGTATGTTGTAATATATGTATGAAAGACAAACGAACCTATTATTCATGCATATTTTTCTTTTGGGTTGAAAATAATCCTCACCAAACAAGGTCCTTGTGCACTCCTCCAGTTACGTTCTTTAGTAAAATATTTAGAGAGTTACTCAGGCCTGCTTCTATTGTAGTTTTAATGAGATAACAAAGTAAAACCTAGTCGAACTGAGTTATGTTCAAAGATTCCTTAAATATATTTACATTTTCTTGTTTGTTGTTGGTAGCTAAATGAAAAATGAAAACTTAATGCAGAACATTCTCCCCAGAGGCAGCTTGTGAGGTCATTGAAGCAGAGAAGCACTCTCATGTATTTTTTGAGGCGGAGCCAGATATATGGATGGTTTTGGTACAGTTTTGTTTGAAATCGTGTATCAGTAAATGGTAGTAGTATTATTTTGTTTGGGGATCGCATTCATATGGTCTCATATTTTATGTTTATTATCCATGCAAACTTTGATGGAAACATTTTTTTATCTGGTTCAATAATTTCTTTACACATTACTTTAAAACATCGACTACTTTATTTTGGTTTTTTCCCTCCATTCTTTACTGCTAATATTACTTGTTAAATGTCCTTATTTTTACAAAATAACCAGTGTTTTATCCTAGAATTCAGATTTACTTTCAGCCTTGTGGCATTAGAGTTTTGTTTTTAATACTTCTTGATTATTTATGGTGACATCTCTGTTTGTTTTGACATATATTCAAATGAAACGTTCAACGTTCACATTTTACTCTAGGCTTCTTAGGGTTAAAGGAAGTTCAATATGGTAGAATTAAAGGCTTACCACTTATCAAGTTTCTTTTTGAGAAAAGAAAAAGTACCTTACCCTTTTTCTTTGGTCTCTTTTATTTCAACCATGAATGTAGTTGCAACTGATCTGTGCACTGTTTCCAGTCATGCTTGGTAGTTCTCTAAAAAAAACATTCTCAAGCTTTATTTTCTCTGTCTTTCTAACGAGTTATCTATGCTGGGGTAGGTAGTGGAGAAAAACAAGGAGGTACAAGCAACTTGGCGGATGGATGCATTGCAGAAGCTTCTGAAGGAAATTCACTCTCTTTTTCTAATGTTTCACGGTCCTATAAGGTTATTACTTGAGAAAGAACCCACTGGAGAAGTTTCCAGATCACATTTGTATTCCTTCATCATGGATTATTTGAGTGGTAAGTCATTTACAGCCACTTTTCATCTAAATTACCTTTTCATGAAGTCAAGTATATGTTACCGTTGTGTCAAAACGTTGCATTATAAGTTACTAATTGTCAGCATGTCAAAAACGGTCTGCATTGGATAGGTGCTGCTGTGGTAATATTGACGTTGTTGAATGTCAGATTGAGATTGAAGTTACTGTATCTCATGCCTAACTCTTGCATCTCTTTTGATGCAGATTTTCTTGTTGGGAAGAAATTGCAGTTGCCATCATTTCTTGACTGTTTAAAGGAACGTGGAACTGTACATATGCTGACCATAGGGCGGGATGCTGCCATTGAAGTTCAGGTTTGTCCTTGTAAAGAACTTCAAGTACTTGTATGCGTGCCATGCACTTAATTATTATTATTTTTATTCCCCTCCCCCCCGTTTTCTCTTTTTATTAAGAAACAGAGAATCAGAAACAAAAGAATATAGTGATTCTCTGTTTTTTCGCTAACAAGAAACAGAGATGATGTATATTAATACTCAGGAGGAAGAAACAAGAGCAGAGTCTCCCAATCATTAATAATCTTTAGGAGGCTGTAGTTGAAAAAGAATTCCTTGCGGTTGGAGCACCAATGAGATGATATATGTGGTACATTGTTACAAAAAATCCCTCCAAAACCACCTCGTCTTCAAAAGTCCAATTATTTCACTTTGAGTCATAGAGCTTTAAGGCTTTAGATCAATCTGTTTTAATTCCTCTGAAACCACCACAATGTTTGCTGTACATTGTTAAAAAGGAAGGAGCGCTCATGAGAGATCTACATTGTGCCTTTCCTTGGGTTACTTTTGGGATCTTTTTATGTTATTTCAGTTAGAGTTGTAGAGCTTTAAGTCTTGATATCAATCTGTCTTTAAATCTAAATATGAATTGTTGACAATGAATTTGAATTTTAAGAACTGTCAACTTAACGTTATGGAGAAACTATTTGGTTTCAATGCTACCCTGATACGTCAGGAAAAAAGAAAGTAGCTTAGAGTTGGAGCGTCGTTGGCATTTTTAATTTTGTTCATCGGAGTTAATTGCATTATATACTTTTGTGCAGGCACTTGTTAGAACACTTGATTCTTGCATTGGAAACGCATCATGCTGCTCTCTGATTTTATTTCAGGACTTACTGGTGTCTACATCGCTTTCACCTGTATGGTTTGATCTCCAGTATTTAATGTTCATCATAACTTATTGTACTATTCAATTTTCATATTATAAAACCTAAAGCACTTGGGGATGCTAAAGTCCTAATACATATCCATAACTGAATTTTATTTTGAAATATAGTATAGATCATTCAAAATTTTAAAGCTGTTCATTTCAAGCTCTTTTTCAGGAGGATACTACAAATTTATTTTCATATGCTGTTCTGAGGTTAACTCCAAGTGTCTTATCTTCTAGCGCAAGTTCTTGGTCCTATTTAATTAGAGGCAATACTGTTTCTCATGTCACCCAACATGGAGGAAATGTTGGTAATCATGTCATACGACCCTTGCAGCATGGAAAATGGTCAAAAGGAAAGGATGGTTTTCTAGAAACTGATATTTGGGGTATGGAGGCAAGTGGCCGGGTCAATTCTACCCCAAAAATCTGGCCTTTTCAAACAGAGAAGCAAATGTACTTGTATGTACATCAGCATAAAAGCCTGACTCTGATTCTTCTTGTTCCTGCTTCATCAATACCTAATGGGGAACAAGGCATTTCAATCATCAGGCAGTATTTTCTTGAAAATGTTAGTTTCTCATCTCTTCTAATTGATTTTATTATCTTGCTACCATTTTTTTTTCTAAATGATTCTCTCCCTATGTGGATATAGTTGTTGCTTGTGAAATGGTGATTTATGCCAAAGCAGTACATGTTGGACAGATTGAAAGGGAAGAATAAGAACAGTACGAGGACAAGAGAAGAGAGGAAATATCATTTATTTACATTGGGAGAAGAATGTAAAAAATTGGACAATAAATAGAGTTTAGAGGAAGAGAGGAGCACAGGGAGAAAATATAGGGTTCCAATATTTATTGATTCTCAATTATCATCTCGTTTTTGTCTTTTAATGGAGTCTAAACCAAGTTAAGGAAACTTAGAAGAATAAAATTTACCAAAATAAATGTAAAAACGTAGTCAGCATCGTGATAGGTTTAAACTGATTTATGAATCTTCTGTTTGAATTCCTTCCACTCATTGGATGCATTTCTTGCTATCTTCTAGGCTTCCCTGAAGATCATGACCGTTGAAGAAAAATTGTCAAAAGGCTGGGGTGGGGAAAATGCGTATCATGTTGGTGGGTATCGTTATTTACTGATAGATGGTGATAGACAAATATCCAGGGCCTCACCTCCAGGAAAAGTTACCACCCTCACAAAGGAATCTTTACTTGCCATGAGCAAACTCAGAGAAAAGGTTGATCTGGAGAAGAGTGGAGCAAAACAGGATGGTGTTGGTGAGGAATAGGAGCTGGAAGTAACTATTAGAGCAAAAAACAACGCATGGGTTATTTCTCGAATCACAAGAAGGAAAGAGCTTTATATGGTTCTGGAGAAAGCCAACGATACTGTACTCTATGCCTCTGATATGGTGGAGAAATTCAGCAACAGGTATTGTTTTATGTTTGATTTTAATTTTAATTTTCAAGGTGGATGCCTAAAATACATGACATTATGTATGATGAATGATACTAGTAGTCAACTACCATGGATTTATATTTGGGTTAAAAGGCATGTAAATAATAAAGGATTAGAGGAGATGGGTTTGGTTACCTACTTAGGTTTTAATATTACGAGTTACCTTAACAACCAAATGTATTAGGAATGTAGAATAAGACAATCGAGTCATGAGAATAATTGAAGTCATTTTCTATTGATTGACGGACGATGAAAGGGCCCTTATTTTACTAAGAGAAATGGAATATGTCATAAGATTGATGAAAATACTGGAGCCACCCCTAAGGCAAACGAGCTTGGTCAGAACATTGCAAAGAGAGTAGTTACATAAAAGTTTACTTAGCCTATGTCAAGAAGTTGCACAAAGAGGATTTTTGCCAAAAAAGATGAAAAATCTCTTTATCTTTGCCTACAAACTGAGAAGTTATTTTGTATTTGGGCTCTACAGGTACTGCAATGGAGCATTCTCCTTGGATTAGAGGCACGTGCTACCAGTGTCGTGTATACAAACCTGTATAACGAAGAGCAGAGGTTAGAGGGATGTGCGGAGTTCCTAGTTTTTCTGCTTCGTAAATGTAATCTCCCTTCTGTAAATTCCTATTAACAATAAGATGAAAATCCTCTCTCATTTTCTGTAGAGAAGAGGAAAATGTGAAATGGACATCGGCCTTCAATTATGGAGAGAAAGAACGGCTAAGTTAAAAGCATGCATTTAGTCCAATTCATTTATTTTAGGAGCTGTTTAAAACTCGAGTTTCTTCCTTTTTGGTGAACTTTCTAAAGAAAATTGTATATTTCTTTTAATACTAAAAAATTGTGAAGGATCTAAACGAGAACTCAAAGGATTTACCATTATTTTCGCTTTGGATATTGTATTCCCGGTCTTAAAAGGCGACGTGACTGTGCTACTCACTTGACCACCCATATTATGGCATTTTGCGATGTGCCAATTTTGAAGTCCCATGAGGTTCATGACTTAGGTGATGTGATGTACAATTTTTCAGTAACATTTCGGATTGTATATTTCATTTTCTGAGGATCTAGTTGCTATAGTCTACAAAATATAGATATGATCTTTAGTGTTGAAAGCCTTATACCAAAATATAGGAATGAATACGTAAACGAAACCATCGACAAAGAATTTCTATATTTATCATTTTTCAAGCAAAGTTTTAACTAATAACTTAAAAAGCGATCAAAACATTACAAATATTCAAAGAATAAAGAAAATGAACAAATATCGTGTGAGATATGTGTGCATGAGCACGCACATGCAAGAGAGATAATGAACATCAAATAAATTTAATATCTAGTGATCATAACAGAAGAGTGTCTGCCAAGCTATACAGAAAAACATATGGCAAAGGCAGGCAAGAAGCAGTTGAGAAACCACTTCTTCAAAATTGATAATATATATATATATATATAGACTGTGTGTGCATTAAAATTGTAACAATGCCCACAAGATATAATGCTTGCAAGTTGAAACAAAAACATTACCTTGAGGTTATCACCAACAAAAAGATGGCCGAAATTCCTAAACCTGTTTGAAATTCAATACTCTGCTCAAGACATTGAAATAGCAGCAGCAGCCTTATCACCATAACTTCATAGCCAACCAATGGGTACTTGGTATGGATGGTGAAGACAAAAGAAACTTATACTGATCTTTGAGATTTAAATCCATGAAGAAACATAAATCCGCCTAAAAAGCATCTAATTCTTTTAATGGCCTTGCATCTTCGGCTGTGTTTCATATAGGATTTCTTGATCAGATGCCATAAGTTCCATAAGATTAACAGTATTTGATTTCTCAATATACGATTCATCCAACCTTGGTTGCAACCCGAAATTTTCACCTTTCTCCATTTGTTTGATTAGAAGTTCGAAATCTTGCAGATTCTCTTCATGAGTTCGAGTATCAATTGTTTCCATATCCAGTTCATGGTCCTCATGGTCCATAGGCTCCCGTTCGTCGTCAACTTTTCTAACATTATCTGAATTACAAGAAACAGTGCTCGCTAAAAGAAGCTGCTCCCATGCGACATCCTGTACATCTGGTAGAACAAATTGGCTGTGATTGTCTAACTTTTCATCCATAAGTACTTTCATGAAGTCGGAATTAAGGAAAAAGTCCTTCATTCCATCAGGGAATGGATCAGATTCTTGTTGTTTCCCCAGACCCGACACTGGAGGAAGTAGAGGTGCTGGCAGTTCATCTATTGGACGTTGATACCTGACTATCAAACCATTTGAAGGTACTTGATTATCATCTGGAATTTGTTCTAACATATTGCCAGCATCAGCCATGCGCCAATTCTTCTCTTTTGGTTGAAGAAACTGAACCAAAAATCCAGGGCTTTGCACTGCCATTACTAAGAATGATAGCATCTGCTGTTGATTCTTTTCCATACCTTGAAGGCGGTCTCTCAATAGGAGCATCTTGTTCTCTGAAGTTTCCTGCTGCTGCCTAAGTTTAACCAATTCCTGCATTACAGCATTTTTATCTATTTTAAGATTCTCAACTTCCTTCCACAGCCCTGTATAATCATGTAGTTCACCTTGTCCCTCAGAATTGTCTTGGGGCTGTGACGCTTTACGCTGATCTGTGCCCTGAATGTTTTTCCTTCGGTAGATGTTCTTCAACAGATGTTTTTGTCCTTTAACGAATCCATCAGTTGCAAATTCCCAGCAATCTGTCTCAATTTTCCGAAAACCCTGAAATATAAGACGCCATTTGTTATATAACTAAGATGTGACAATTCAAAACTAAATGGAGAAAATGTAGTTGTTTCTCCTCTAGTTCGCCATTATGTTGAACAGTTTTTCCCTTTTCCACAAGGGCCTAAAAGTAAAAACATGCATTACGAACCACAACTAACTAGCTTGACTGAAAAGATTTCTCAATGATAATAGCATCAGGAACAAAACACCCATTCTAAAAAATAAACAAATAAATAAATAAAAGAGAGAAGAAATAAACAAGGAAAGAAAATGACACTTGGAGTCTATTTCTGCAGACTGCATTATGACTTAATCAAGAGTCGATAAGAAGTATTTGTAGAGAATATACCAGAAAGGGGTAAGAGAAAAGGCAATAGACAAAGGAAATATAATTTAAAGTAGGATAAAAAGTGAGATGTTTGTGAAATTTCTCCGTCGCCCCAACCCCAACCACTCAATGATCAACCTGCTCGGGGGAATGAATTTAGAAGAACAAAAATTCACTTGTTTAATTGGTCTTCGAGGACTTTATCAGAAAACCTAAGAGCTGCGTAAGGCATGCGTCCAAGTAGTGACAAGATAGAGCCATTTCTCTCTTCATCCCTATTTCAGAATGCCAAATACTAAAATCACATGTACGTTGGTGGGGTTAAATAGATCGTGTGGCAATGTTAGGATTTGGCGAACTTCAATGCTTCCTCTTCAGGCTCCTAATTCAAATTTGTTTTATAATTATTCTTCACTTCTGATTAACAAAACCCGATTAGTTCATTTCGCATGGCAAATGCCAAAATAGAACTTCTGATTAGTTTGTCTTTGTCTGGCAAGCCTTGGGGCTCATCCATGAACCAGCACCTAAGAGTGAAACACAAGAAGAATTTAAAAATGTATACTATGGGTAGTGCAACAAGTTTGGGATTCAGAAAATTCTCCTCAAGTGTTCAGCCAATCAATACAAGCAAAAGGATGATCTATCCTATTTTGATGGCCACCATCACATAGAAGATTTTCTTGAATGTACATGCAGTAGAAAATTATTGCTCGGGAATATGAACATACTAAAAGAAAAAAAAAAAAAAAGAGTTTAAATGGTTGCTCAGAAATTGTGGGTGGAGGAGACCAGCAGCCTAGTGAGCAATTTCAACTGAATAGAACTCATGAAGGAAGGAAAACCGATACACTTCTAGTTGAGGATGAAACATCAACTCAGAGCCAGGGTTCTTCCTCTACCAGAAAATTTGAATAAAAAAATTCTGAAATACCCTCAACGATATACCGAACATAATCAGACAGTAAATAGTATGTTGAAGAGTGGTTTTATCCAAGCGGTTCATTGCATTAGCAACCCTGCAAGAATCCCTTAGGAATCATTTTAATCATCCTAGAAAATGAGCTTCTTTCAATGACTACAAACCTTGCCTAATGACCCAACGCACAATGCCCTTTCTGTGGCCAAAGACAAAAACAACTTGGTGAGCATTTATGACAATGGAAAAACCCCAATACGAAACTCCTGCCCAATGGTACTCCACCCAAATCCCCTTTTGACAATAGCCAACCTTCAAAACCTCCATGGACGCCCCTTCAATCCAGTAGCATCATCACCCACGTGCTAGTCCCAACTCCTAATGACAGTACCAACCACAAGCAACCTACGCTAGTAGATGGAAATAAAGAGTCCTTTGAACTTCCAACTAATACTTTTGATGAAAAGAATTAGAATATGATGATAACGAATTTGAAGAAGAGGAAGCTGGTAACTATGTCATGATGTATGGTGCAGCTCCTCTTATTAGGCCCTGAGCAAGCTGGTTCTAATAAATAGTATATGTTATTTGGCATTTACTAAAAAAAGGATTTGCAATGTTTTTGTTGATAACACAGTACTGAACGTGTTGTTGAAAAAGAAAGGGCTAAGGTTCCCCCTGTGTGGAGGCAAAATTCTCCTCTGAGCTTACCATTAACCAAAACAGAGTAATTAAGCGACCTAACATAATTTCAATCAAGTCCCCATCGAAGCCTTTTCCAAGTCAATCTTGAATATAACTCCTCCCAGTTCGTGAATACCTATAATCCTCAATTGCGTTATTACCAATTAGGGCAACTAGAAAGAAGATGAGTCCACGACAGAGCAACTTTCCCATTTAATCAGAACTCAAGAACAACAAA

mRNA sequence

TTTTTTATTTTTTTTTTTTTTTAAATATTTTGATGGACGTTTTCCTTAATTTTTTTCAGTACATGCAAGAGTGCTACGTTTTCGTTTCTTTCCCGCGCGTGGTTTCTGGAAGTTCCGCCAAACCTTGGATTTGGTTCTGCTGTCTGGCTTCGATTCATGACACTGTCGTCCGTTAGGGCTCAAACGAGTTCATATTTGAGCCACGCGGCTCTTTTCTCCCAATGCACCAATGCTCGAATCTCCAATCACCGGCATGTGCAACGATCATTCGCGTTTAGGGTTTTGTTGAGGCTCTGCTTCAGTTGATCTTTCTGTAGAGGTTGTTGAAAAGTAAATAATGGGCCTGTCAACTGCTACTACTGCCGTTTCCGAGGCCATTAAACTCTGCGTATTTGATTCAAGGAGAGGACAACAGGAGGGTCAAGAGCTGGATAAGATATTGTTTTTCTATCCAGCCGATTTACCCTTTACAAAACAATTGTATATTATTGGGCTCAGTGAAGGACTTGTCACATTTACAAGAACATTCTCCCCAGAGGCAGCTTGTGAGGTCATTGAAGCAGAGAAGCACTCTCATGTATTTTTTGAGGCGGAGCCAGATATATGGATGGTTTTGGTAGTGGAGAAAAACAAGGAGGTACAAGCAACTTGGCGGATGGATGCATTGCAGAAGCTTCTGAAGGAAATTCACTCTCTTTTTCTAATGTTTCACGGTCCTATAAGGTTATTACTTGAGAAAGAACCCACTGGAGAAGTTTCCAGATCACATTTGTATTCCTTCATCATGGATTATTTGAGTGATTTTCTTGTTGGGAAGAAATTGCAGTTGCCATCATTTCTTGACTGTTTAAAGGAACGTGGAACTGTACATATGCTGACCATAGGGCGGGATGCTGCCATTGAAGTTCAGGCACTTGTTAGAACACTTGATTCTTGCATTGGAAACGCATCATGCTGCTCTCTGATTTTATTTCAGGACTTACTGGTGTCTACATCGCTTTCACCTGAGGATACTACAAATTTATTTTCATATGCTGTTCTGAGGTTAACTCCAAGTGTCTTATCTTCTAGCGCAAGTTCTTGGTCCTATTTAATTAGAGGCAATACTGTTTCTCATGTCACCCAACATGGAGGAAATGTTGGTAATCATGTCATACGACCCTTGCAGCATGGAAAATGGTCAAAAGGAAAGGATGGTTTTCTAGAAACTGATATTTGGGGTATGGAGGCAAGTGGCCGGGTCAATTCTACCCCAAAAATCTGGCCTTTTCAAACAGAGAAGCAAATGTACTTGTATGTACATCAGCATAAAAGCCTGACTCTGATTCTTCTTGTTCCTGCTTCATCAATACCTAATGGGGAACAAGGCATTTCAATCATCAGGCAGTATTTTCTTGAAAATGCTTCCCTGAAGATCATGACCGTTGAAGAAAAATTGTCAAAAGGCTGGGGTGGGGAAAATGCGTATCATGTTGGTGGGTATCGTTATTTACTGATAGATGGTGATAGACAAATATCCAGGGCCTCACCTCCAGGAAAAGTTACCACCCTCACAAAGGAATCTTTACTTGCCATGAGCAAACTCAGAGAAAAGGTTGATCTGGAGAAGAGTGGAGCAAAACAGGATGGTGTTGGTGAGGAATAGGAGCTGGAAGTAACTATTAGAGCAAAAAACAACGCATGGGTTATTTCTCGAATCACAAGAAGGAAAGAGCTTTATATGGTTCTGGAGAAAGCCAACGATACTGTACTCTATGCCTCTGATATGGTGGAGAAATTCAGCAACAGGTACTGCAATGGAGCATTCTCCTTGGATTAGAGGCACGTGCTACCAGTGTCGTGTATACAAACCTGTATAACGAAGAGCAGAGGTTAGAGGGATGTGCGGAGTTCCTAGTTTTTCTGCTTCGTAAATGTAATCTCCCTTCTGTAAATTCCTATTAACAATAAGATGAAAATCCTCTCTCATTTTCTGTAGAGAAGAGGAAAATGTGAAATGGACATCGGCCTTCAATTATGGAGAGAAAGAACGGCTAAGTTAAAAGCATGCATTTAGTCCAATTCATTTATTTTAGGAGCTGTTTAAAACTCGAGTTTCTTCCTTTTTGGTGAACTTTCTAAAGAAAATTGTATATTTCTTTTAATACTAAAAAATTGTGAAGGATCTAAACGAGAACTCAAAGGATTTACCATTATTTTCGCTTTGGATATTGTATTCCCGGTCTTAAAAGGCGACGTGACTGTGCTACTCACTTGACCACCCATATTATGGCATTTTGCGATGTGCCAATTTTGAAGTCCCATGAGGTTCATGACTTAGGTGATGTGATGTACAATTTTTCAGTAACATTTCGGATTGTATATTTCATTTTCTGAGGATCTAGTTGCTATAGTCTACAAAATATAGATATGATCTTTAGTGTTGAAAGCCTTATACCAAAATATAGGAATGAATACGTAAACGAAACCATCGACAAAGAATTTCTATATTTATCATTTTTCAAGCAAAGTTTTAACTAATAACTTAAAAAGCGATCAAAACATTACAAATATTCAAAGAATAAAGAAAATGAACAAATATCGTGTGAGATATGTGTGCATGAGCACGCACATGCAAGAGAGATAATGAACATCAAATAAATTTAATATCTAGTGATCATAACAGAAGAGTGTCTGCCAAGCTATACAGAAAAACATATGGCAAAGGCAGGCAAGAAGCAGTTGAGAAACCACTTCTTCAAAATTGATAATATATATATATATATATAGACTGTGTGTGCATTAAAATTGTAACAATGCCCACAAGATATAATGCTTGCAAGTTGAAACAAAAACATTACCTTGAGGTTATCACCAACAAAAAGATGGCCGAAATTCCTAAACCTGTTTGAAATTCAATACTCTGCTCAAGACATTGAAATAGCAGCAGCAGCCTTATCACCATAACTTCATAGCCAACCAATGGGTACTTGGTATGGATGGTGAAGACAAAAGAAACTTATACTGATCTTTGAGATTTAAATCCATGAAGAAACATAAATCCGCCTAAAAAGCATCTAATTCTTTTAATGGCCTTGCATCTTCGGCTGTGTTTCATATAGGATTTCTTGATCAGATGCCATAAGTTCCATAAGATTAACAGTATTTGATTTCTCAATATACGATTCATCCAACCTTGGTTGCAACCCGAAATTTTCACCTTTCTCCATTTGTTTGATTAGAAGTTCGAAATCTTGCAGATTCTCTTCATGAGTTCGAGTATCAATTGTTTCCATATCCAGTTCATGGTCCTCATGGTCCATAGGCTCCCGTTCGTCGTCAACTTTTCTAACATTATCTGAATTACAAGAAACAGTGCTCGCTAAAAGAAGCTGCTCCCATGCGACATCCTGTACATCTGGTAGAACAAATTGGCTGTGATTGTCTAACTTTTCATCCATAAGTACTTTCATGAAGTCGGAATTAAGGAAAAAGTCCTTCATTCCATCAGGGAATGGATCAGATTCTTGTTGTTTCCCCAGACCCGACACTGGAGGAAGTAGAGGTGCTGGCAGTTCATCTATTGGACGTTGATACCTGACTATCAAACCATTTGAAGGTACTTGATTATCATCTGGAATTTGTTCTAACATATTGCCAGCATCAGCCATGCGCCAATTCTTCTCTTTTGGTTGAAGAAACTGAACCAAAAATCCAGGGCTTTGCACTGCCATTACTAAGAATGATAGCATCTGCTGTTGATTCTTTTCCATACCTTGAAGGCGGTCTCTCAATAGGAGCATCTTGTTCTCTGAAGTTTCCTGCTGCTGCCTAAGTTTAACCAATTCCTGCATTACAGCATTTTTATCTATTTTAAGATTCTCAACTTCCTTCCACAGCCCTGTATAATCATGTAGTTCACCTTGTCCCTCAGAATTGTCTTGGGGCTGTGACGCTTTACGCTGATCTGTGCCCTGAATGTTTTTCCTTCGGTAGATGTTCTTCAACAGATGTTTTTGTCCTTTAACGAATCCATCAGTTGCAAATTCCCAGCAATCTGTCTCAATTTTCCGAAAACCCTGAAATATAAGACGCCATTTGTTATATAACTAAGATGTGACAATTCAAAACTAAATGGAGAAAATGTAGTTGTTTCTCCTCTAGTTCGCCATTATGTTGAACAGTTTTTCCCTTTTCCACAAGGGCCTAAAAGTAAAAACATGCATTACGAACCACAACTAACTAGCTTGACTGAAAAGATTTCTCAATGATAATAGCATCAGGAACAAAACACCCATTCTAAAAAATAAACAAATAAATAAATAAAAGAGAGAAGAAATAAACAAGGAAAGAAAATGACACTTGGAGTCTATTTCTGCAGACTGCATTATGACTTAATCAAGAGTCGATAAGAAGTATTTGTAGAGAATATACCAGAAAGGGGTAAGAGAAAAGGCAATAGACAAAGGAAATATAATTTAAAGTAGGATAAAAAGTGAGATGTTTGTGAAATTTCTCCGTCGCCCCAACCCCAACCACTCAATGATCAACCTGCTCGGGGGAATGAATTTAGAAGAACAAAAATTCACTTGTTTAATTGGTCTTCGAGGACTTTATCAGAAAACCTAAGAGCTGCGTAAGGCATGCGTCCAAGTAGTGACAAGATAGAGCCATTTCTCTCTTCATCCCTATTTCAGAATGCCAAATACTAAAATCACATGTACGTTGGTGGGGTTAAATAGATCGTGTGGCAATGTTAGGATTTGGCGAACTTCAATGCTTCCTCTTCAGGCTCCTAATTCAAATTTGTTTTATAATTATTCTTCACTTCTGATTAACAAAACCCGATTAGTTCATTTCGCATGGCAAATGCCAAAATAGAACTTCTGATTAGTTTGTCTTTGTCTGGCAAGCCTTGGGGCTCATCCATGAACCAGCACCTAAGAGTGAAACACAAGAAGAATTTAAAAATGTATACTATGGGTAGTGCAACAAGTTTGGGATTCAGAAAATTCTCCTCAAGTGTTCAGCCAATCAATACAAGCAAAAGGATGATCTATCCTATTTTGATGGCCACCATCACATAGAAGATTTTCTTGAATGTACATGCAGTAGAAAATTATTGCTCGGGAATATGAACATACTAAAAGAAAAAAAAAAAAAAAGAGTTTAAATGGTTGCTCAGAAATTGTGGGTGGAGGAGACCAGCAGCCTAGTGAGCAATTTCAACTGAATAGAACTCATGAAGGAAGGAAAACCGATACACTTCTAGTTGAGGATGAAACATCAACTCAGAGCCAGGGTTCTTCCTCTACCAGAAAATTTGAATAAAAAAATTCTGAAATACCCTCAACGATATACCGAACATAATCAGACAGTAAATAGTATGTTGAAGAGTGGTTTTATCCAAGCGGTTCATTGCATTAGCAACCCTGCAAGAATCCCTTAGGAATCATTTTAATCATCCTAGAAAATGAGCTTCTTTCAATGACTACAAACCTTGCCTAATGACCCAACGCACAATGCCCTTTCTGTGGCCAAAGACAAAAACAACTTGGTGAGCATTTATGACAATGGAAAAACCCCAATACGAAACTCCTGCCCAATGGTACTCCACCCAAATCCCCTTTTGACAATAGCCAACCTTCAAAACCTCCATGGACGCCCCTTCAATCCAGTAGCATCATCACCCACGTGCTAGTCCCAACTCCTAATGACAGTACCAACCACAAGCAACCTACGCTAGTAGATGGAAATAAAGAGTCCTTTGAACTTCCAACTAATACTTTTGATGAAAAGAATTAGAATATGATGATAACGAATTTGAAGAAGAGGAAGCTGGTAACTATGTCATGATGTATGGTGCAGCTCCTCTTATTAGGCCCTGAGCAAGCTGGTTCTAATAAATAGTATATGTTATTTGGCATTTACTAAAAAAAGGATTTGCAATGTTTTTGTTGATAACACAGTACTGAACGTGTTGTTGAAAAAGAAAGGGCTAAGGTTCCCCCTGTGTGGAGGCAAAATTCTCCTCTGAGCTTACCATTAACCAAAACAGAGTAATTAAGCGACCTAACATAATTTCAATCAAGTCCCCATCGAAGCCTTTTCCAAGTCAATCTTGAATATAACTCCTCCCAGTTCGTGAATACCTATAATCCTCAATTGCGTTATTACCAATTAGGGCAACTAGAAAGAAGATGAGTCCACGACAGAGCAACTTTCCCATTTAATCAGAACTCAAGAACAACAAA

Coding sequence (CDS)

ATGGGCCTGTCAACTGCTACTACTGCCGTTTCCGAGGCCATTAAACTCTGCGTATTTGATTCAAGGAGAGGACAACAGGAGGGTCAAGAGCTGGATAAGATATTGTTTTTCTATCCAGCCGATTTACCCTTTACAAAACAATTGTATATTATTGGGCTCAGTGAAGGACTTGTCACATTTACAAGAACATTCTCCCCAGAGGCAGCTTGTGAGGTCATTGAAGCAGAGAAGCACTCTCATGTATTTTTTGAGGCGGAGCCAGATATATGGATGGTTTTGGTAGTGGAGAAAAACAAGGAGGTACAAGCAACTTGGCGGATGGATGCATTGCAGAAGCTTCTGAAGGAAATTCACTCTCTTTTTCTAATGTTTCACGGTCCTATAAGGTTATTACTTGAGAAAGAACCCACTGGAGAAGTTTCCAGATCACATTTGTATTCCTTCATCATGGATTATTTGAGTGATTTTCTTGTTGGGAAGAAATTGCAGTTGCCATCATTTCTTGACTGTTTAAAGGAACGTGGAACTGTACATATGCTGACCATAGGGCGGGATGCTGCCATTGAAGTTCAGGCACTTGTTAGAACACTTGATTCTTGCATTGGAAACGCATCATGCTGCTCTCTGATTTTATTTCAGGACTTACTGGTGTCTACATCGCTTTCACCTGAGGATACTACAAATTTATTTTCATATGCTGTTCTGAGGTTAACTCCAAGTGTCTTATCTTCTAGCGCAAGTTCTTGGTCCTATTTAATTAGAGGCAATACTGTTTCTCATGTCACCCAACATGGAGGAAATGTTGGTAATCATGTCATACGACCCTTGCAGCATGGAAAATGGTCAAAAGGAAAGGATGGTTTTCTAGAAACTGATATTTGGGGTATGGAGGCAAGTGGCCGGGTCAATTCTACCCCAAAAATCTGGCCTTTTCAAACAGAGAAGCAAATGTACTTGTATGTACATCAGCATAAAAGCCTGACTCTGATTCTTCTTGTTCCTGCTTCATCAATACCTAATGGGGAACAAGGCATTTCAATCATCAGGCAGTATTTTCTTGAAAATGCTTCCCTGAAGATCATGACCGTTGAAGAAAAATTGTCAAAAGGCTGGGGTGGGGAAAATGCGTATCATGTTGGTGGGTATCGTTATTTACTGATAGATGGTGATAGACAAATATCCAGGGCCTCACCTCCAGGAAAAGTTACCACCCTCACAAAGGAATCTTTACTTGCCATGAGCAAACTCAGAGAAAAGGTTGATCTGGAGAAGAGTGGAGCAAAACAGGATGGTGTTGGTGAGGAATAG

Protein sequence

MGLSTATTAVSEAIKLCVFDSRRGQQEGQELDKILFFYPADLPFTKQLYIIGLSEGLVTFTRTFSPEAACEVIEAEKHSHVFFEAEPDIWMVLVVEKNKEVQATWRMDALQKLLKEIHSLFLMFHGPIRLLLEKEPTGEVSRSHLYSFIMDYLSDFLVGKKLQLPSFLDCLKERGTVHMLTIGRDAAIEVQALVRTLDSCIGNASCCSLILFQDLLVSTSLSPEDTTNLFSYAVLRLTPSVLSSSASSWSYLIRGNTVSHVTQHGGNVGNHVIRPLQHGKWSKGKDGFLETDIWGMEASGRVNSTPKIWPFQTEKQMYLYVHQHKSLTLILLVPASSIPNGEQGISIIRQYFLENASLKIMTVEEKLSKGWGGENAYHVGGYRYLLIDGDRQISRASPPGKVTTLTKESLLAMSKLREKVDLEKSGAKQDGVGEE
BLAST of Cp4.1LG01g05840 vs. Swiss-Prot
Match: CCZ1_NEMVE (Vacuolar fusion protein CCZ1 homolog OS=Nematostella vectensis GN=v1g238755 PE=3 SV=1)

HSP 1 Score: 102.4 bits (254), Expect = 1.2e-20
Identity = 68/256 (26.56%), Postives = 121/256 (47.27%), Query Frame = 1

Query: 14  IKLCVFDSRRGQQEGQELDKILFFYPADLPFTKQLYIIGLSEGLVTFTRTFSPEAACEVI 73
           +   +F+S  G +EG+E +KI+ + P +    +++  IGL E LV FT TF+P+  CE +
Sbjct: 11  VNFFIFNSTYGPREGEEHEKIILYIPTEEDIDRKIKTIGLCEALVKFTETFAPDKPCESL 70

Query: 74  EAEKHSHVFFEAEPDIWMVLVVE--------KNKEVQATWRMD-----ALQKLLKEIHSL 133
             +K   +F++ EPD WM++ +         K+ +    +  D      L  +LK+ + +
Sbjct: 71  HTQKSRQIFYQPEPDFWMIMTISIPFSEKIAKDGKNTIEYHYDDVLDNVLDAVLKQSYKM 130

Query: 134 FLMFHGPIRLLLEKEPTGEVSRSHLYSFIMDYLSDFLVGKKLQLPSFLDCLKERGTVHML 193
           F +F+GP   L E      + +   Y F + YL      + L   SF D L     +  L
Sbjct: 131 FKLFNGPFNYLSETYGREALKKRSEY-FFLSYL------QTLNFSSF-DLLDIFAGIQFL 190

Query: 194 TIGRDAAIEVQALVRTLDSCIGNASCCSLILFQDLLVSTSLSPEDTTNLFSYAVLRLTPS 253
            + ++  +++Q+ V  ++         +  L+ D LV + L  ED   L+ Y V  L P+
Sbjct: 191 PLDKNTFLKIQSFVNLIEHTFSQIK-YTAFLYSDKLVWSGLEQEDMRILYKYLVTSLFPA 250

BLAST of Cp4.1LG01g05840 vs. Swiss-Prot
Match: CCZ1_DICDI (Vacuolar fusion protein CCZ1 homolog OS=Dictyostelium discoideum GN=DDB_G0288589 PE=3 SV=1)

HSP 1 Score: 97.8 bits (242), Expect = 3.1e-19
Identity = 102/416 (24.52%), Postives = 188/416 (45.19%), Query Frame = 1

Query: 18  VFDSRRGQQEGQELDKILFFYPADLPFTKQLYIIGLSEGLVTFTRTFSPEAACEVIEAEK 77
           ++ S+ GQ+EG E +KILFFYP  +   +Q   +G+SE  V FT+ FSP   CE I  +K
Sbjct: 15  IYCSKLGQKEGTEHEKILFFYPPTINIGEQTNSVGISEAYVLFTKQFSPGQPCEFIHTKK 74

Query: 78  HSHVFFEAEPDIWMVLVV--------EKNKEVQATWRMD--ALQKLLKEIHSLFLMFHGP 137
            +      E DIWMVL V        + NK       +D   L K +++I+  +  F+G 
Sbjct: 75  STLALLHPEEDIWMVLSVYNPTGITGKDNKREYIEDEVDDIILMKTIQQIYQTWQTFNGS 134

Query: 138 IRLLLEKEPTGEVSRSHLYSFIMDYLSDFLVGKKLQLPSFLDCLKERGTVHMLTIGRDAA 197
           I  L  K     V R  L SF+  Y+   +   +L L + LD +K       L + ++  
Sbjct: 135 IMSLASKTSYDNV-RKRLESFVKPYIQQ-IQFDQLDLFTSLDGIK------FLPLNKNVY 194

Query: 198 IEVQALVRTLDSCIGNASCC---SLILFQDLLVSTSLSPEDTTNLFSYAV--LRLTPSVL 257
           + +   + ++D    +        L+L++D L+ +SL   +T  L++Y +  +++ P + 
Sbjct: 195 LTIFGYINSVDLHFQSTLSSFRFGLVLYKDNLILSSLEQNETRILYNYLINMVKVGPDIN 254

Query: 258 SSSASSWSYLIRGNTVSHVTQHGGNVGNHVIRPLQHGKW-SKG-KDGFLETDIWGMEASG 317
           SS+    S +++ N+            N++  P+    W +KG + GF+           
Sbjct: 255 SSN----SMIVKNNS------------NNI--PI----WQTKGVRTGFMI---------- 314

Query: 318 RVNSTPKIWPFQTEKQMYLYVHQHKSLTLILLVPASSIPNGEQGISIIRQYFLENASLKI 377
           + +S P +W     K   + V++ K   L+ L+  S +P  +     +    ++N     
Sbjct: 315 QKDSLPMVW--LGGKPQAMIVYEQKDTFLLFLIDPSDLP--QLPFEDLSASLVQNFEFVN 374

Query: 378 MTVEEKLSKGWGGENAYHVGGYRYLLIDGDRQISRASPPGKVTTLTKESLLAMSKL 417
           +T+E+  +K      A     Y+Y+  +      R+    K   L KE++  ++++
Sbjct: 375 LTLEQHYAK-----KANFDEQYKYIYFNQMNLAIRSPIKPKGPELNKETMKLLNEI 381

BLAST of Cp4.1LG01g05840 vs. Swiss-Prot
Match: CCZ1B_HUMAN (Vacuolar fusion protein CCZ1 homolog B OS=Homo sapiens GN=CCZ1B PE=1 SV=1)

HSP 1 Score: 82.4 bits (202), Expect = 1.3e-14
Identity = 65/261 (24.90%), Postives = 123/261 (47.13%), Query Frame = 1

Query: 18  VFDSRRGQQEGQELDKILFFYPADLPFTKQLYIIGLSEGLVTFTRTFSPEAACEVIEAEK 77
           +++ R G +EGQE +KILF++P ++   +++  +GL E +V FTRTFSP    + +  +K
Sbjct: 29  IYNPRFGPREGQEENKILFYHPNEVEKNEKIRNVGLCEAIVQFTRTFSPSKPAKSLHTQK 88

Query: 78  HSHVFFEAEPDIWMVLVV-----EKNK-------EVQATWRMDAL-QKLLKEIHSLFLMF 137
           +   F E E + WMV+VV     EK         E Q    +D +   +L++ +S++ +F
Sbjct: 89  NRQFFNEPEENFWMVMVVRNPIIEKQSKDGKPVIEYQEEELLDKVYSSVLRQCYSMYKLF 148

Query: 138 HGPIRLLLEKEPTGEVSRSHLYSFIMDYLSDFLVGKKLQLPSFLDCLKERGTVHMLTIGR 197
           +G     +E +   ++ +  L  F   YL      + L L S  D L   G +    + +
Sbjct: 149 NGTFLKAME-DGGVKLLKERLEKFFHRYL------QTLHLQS-CDLLDIFGGISFFPLDK 208

Query: 198 DAAIEVQALVRTLDSCIGNASCCSLILFQDLLVSTSLSPEDTTNLFSYAVLRLTPSVLSS 257
              +++Q+ +  ++  + N    +  L+ D L+ + L  +D   L+ Y    L P  +  
Sbjct: 209 MTYLKIQSFINRMEESL-NIVKYTAFLYNDQLIWSGLEQDDMRILYKYLTTSLFPRHIEP 268

Query: 258 SASSWSYLIRGNTVSHVTQHG 266
             +     IR     ++  +G
Sbjct: 269 ELAGRDSPIRAEMPGNLQHYG 280

BLAST of Cp4.1LG01g05840 vs. Swiss-Prot
Match: CCZ1_HUMAN (Vacuolar fusion protein CCZ1 homolog OS=Homo sapiens GN=CCZ1 PE=1 SV=1)

HSP 1 Score: 82.4 bits (202), Expect = 1.3e-14
Identity = 65/261 (24.90%), Postives = 123/261 (47.13%), Query Frame = 1

Query: 18  VFDSRRGQQEGQELDKILFFYPADLPFTKQLYIIGLSEGLVTFTRTFSPEAACEVIEAEK 77
           +++ R G +EGQE +KILF++P ++   +++  +GL E +V FTRTFSP    + +  +K
Sbjct: 29  IYNPRFGPREGQEENKILFYHPNEVEKNEKIRNVGLCEAIVQFTRTFSPSKPAKSLHTQK 88

Query: 78  HSHVFFEAEPDIWMVLVV-----EKNK-------EVQATWRMDAL-QKLLKEIHSLFLMF 137
           +   F E E + WMV+VV     EK         E Q    +D +   +L++ +S++ +F
Sbjct: 89  NRQFFNEPEENFWMVMVVRNPIIEKQSKDGKPVIEYQEEELLDKVYSSVLRQCYSMYKLF 148

Query: 138 HGPIRLLLEKEPTGEVSRSHLYSFIMDYLSDFLVGKKLQLPSFLDCLKERGTVHMLTIGR 197
           +G     +E +   ++ +  L  F   YL      + L L S  D L   G +    + +
Sbjct: 149 NGTFLKAME-DGGVKLLKERLEKFFHRYL------QTLHLQS-CDLLDIFGGISFFPLDK 208

Query: 198 DAAIEVQALVRTLDSCIGNASCCSLILFQDLLVSTSLSPEDTTNLFSYAVLRLTPSVLSS 257
              +++Q+ +  ++  + N    +  L+ D L+ + L  +D   L+ Y    L P  +  
Sbjct: 209 MTYLKIQSFINRMEESL-NIVKYTAFLYNDQLIWSGLEQDDMRILYKYLTTSLFPRHIEP 268

Query: 258 SASSWSYLIRGNTVSHVTQHG 266
             +     IR     ++  +G
Sbjct: 269 ELAGRDSPIRAEMPGNLQHYG 280

BLAST of Cp4.1LG01g05840 vs. Swiss-Prot
Match: CCZ1_BOVIN (Vacuolar fusion protein CCZ1 homolog OS=Bos taurus GN=CCZ1 PE=2 SV=1)

HSP 1 Score: 82.0 bits (201), Expect = 1.7e-14
Identity = 64/261 (24.52%), Postives = 122/261 (46.74%), Query Frame = 1

Query: 18  VFDSRRGQQEGQELDKILFFYPADLPFTKQLYIIGLSEGLVTFTRTFSPEAACEVIEAEK 77
           +++ R G +EG+E +KILF+YP ++   +++  +GL E +V FTRTFSP    + +  +K
Sbjct: 27  IYNPRFGPREGEEENKILFYYPNEVEKNEKIRNVGLCEAIVQFTRTFSPSKPAKSLHTQK 86

Query: 78  HSHVFFEAEPDIWMVLVV-----EKNK-------EVQATWRMDAL-QKLLKEIHSLFLMF 137
           +   F E E + WMV+VV     EK         E Q    +D +   +L++ +S++ +F
Sbjct: 87  NRQFFNEPEENFWMVMVVRNPIIEKQSKDGKPVVEYQEEELLDKVYSSVLQQCYSMYKLF 146

Query: 138 HGPIRLLLEKEPTGEVSRSHLYSFIMDYLSDFLVGKKLQLPSFLDCLKERGTVHMLTIGR 197
           +G     +E +   ++ +  L  F   YL      + L L S  D L   G +    + +
Sbjct: 147 NGTFLRAME-DGGVKLLKERLEKFFHRYL------QTLHLQS-CDLLDIFGGISFFPLDK 206

Query: 198 DAAIEVQALVRTLDSCIGNASCCSLILFQDLLVSTSLSPEDTTNLFSYAVLRLTPSVLSS 257
              +++Q+ +  ++  +      +  L+ D L+ + L  +D   L+ Y    L P  +  
Sbjct: 207 MTYLKIQSFINRMEESLSIVK-YTAFLYNDQLIWSGLEQDDMRILYKYLTTSLFPRHIEP 266

Query: 258 SASSWSYLIRGNTVSHVTQHG 266
             +     IR     ++  +G
Sbjct: 267 ELAGRDSPIRAEMPGNLQHYG 278

BLAST of Cp4.1LG01g05840 vs. TrEMBL
Match: A0A0A0KL91_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G025930 PE=4 SV=1)

HSP 1 Score: 780.8 bits (2015), Expect = 8.7e-223
Identity = 392/435 (90.11%), Postives = 408/435 (93.79%), Query Frame = 1

Query: 1   MGLSTATTAVSEAIKLCVFDSRRGQQEGQELDKILFFYPADLPFTKQLYIIGLSEGLVTF 60
           MGLSTATTAVSEAI+LCVFDSRRGQQEGQELDKILFFYPADLPFTKQLYIIGLSEGL+TF
Sbjct: 1   MGLSTATTAVSEAIQLCVFDSRRGQQEGQELDKILFFYPADLPFTKQLYIIGLSEGLITF 60

Query: 61  TRTFSPEAACEVIEAEKHSHVFFEAEPDIWMVLVVEKNKEVQATWRMDALQKLLKEIHSL 120
           TRTFSPEAACEVIEAEKHSHVFFEAE DIWMVLVVEKNKE++A WR+DALQKLLKEIHSL
Sbjct: 61  TRTFSPEAACEVIEAEKHSHVFFEAEQDIWMVLVVEKNKELEAIWRIDALQKLLKEIHSL 120

Query: 121 FLMFHGPIRLLLEKEPTGEVSRSHLYSFIMDYLSDFLVGKKLQLPSFLDCLKERGTVHML 180
           FLMFHG IRLLLEKEPTGEVSRSHLYSFIMDYL+DFLVGKKL LPSF DCLKERGTV ML
Sbjct: 121 FLMFHGSIRLLLEKEPTGEVSRSHLYSFIMDYLNDFLVGKKLHLPSFTDCLKERGTVQML 180

Query: 181 TIGRDAAIEVQALVRTLDSCIGNASCCSLILFQDLLVSTSLSPEDTTNLFSYAVLRLTPS 240
           TIGRDAA++VQALVRTLDSCIGN SC SLILFQDLLVST+LSP+DTTNLFSYAVLRL P 
Sbjct: 181 TIGRDAALDVQALVRTLDSCIGNTSCHSLILFQDLLVSTTLSPDDTTNLFSYAVLRLIPR 240

Query: 241 VLSSSASSWSYLIRGNTVSHVTQHGGNVGNHVIRPLQHGKWSKGKDGFLETDIWGMEASG 300
           VLSS ASSWSYLIRGN  SHV QHGGNVGN VIRPLQHGKWSKGKDGFLETDIWGMEASG
Sbjct: 241 VLSSGASSWSYLIRGNVASHVGQHGGNVGNRVIRPLQHGKWSKGKDGFLETDIWGMEASG 300

Query: 301 RVNSTPKIWPFQTEKQMYLYVHQHKSLTLILLVPASSIPNGEQGISIIRQYFLENASLKI 360
            V STPKIW FQTE+QM LYVHQHK+LTLILLVP SSIPNGEQG+SI+RQY LENASLKI
Sbjct: 301 WVGSTPKIWLFQTEEQMCLYVHQHKTLTLILLVPVSSIPNGEQGVSIVRQYILENASLKI 360

Query: 361 MTVEEKLSKGWGGENAYHVGGYRYLLIDGDRQISRASPPGKVTTLTKESLLAMSKLREKV 420
           + VEEKLSKGWGGENAYHVGGYRYLL+DGDRQISRASPPGKVTTL KESLLAMSKLRE V
Sbjct: 361 VKVEEKLSKGWGGENAYHVGGYRYLLVDGDRQISRASPPGKVTTLAKESLLAMSKLRENV 420

Query: 421 DLEKSGAKQDGVGEE 436
           DLEKS AKQD  GEE
Sbjct: 421 DLEKSRAKQDSDGEE 435

BLAST of Cp4.1LG01g05840 vs. TrEMBL
Match: W9R3N9_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_006765 PE=4 SV=1)

HSP 1 Score: 627.5 bits (1617), Expect = 1.2e-176
Identity = 314/456 (68.86%), Postives = 367/456 (80.48%), Query Frame = 1

Query: 1   MGLSTATTAVSEAIKLCVFDSRRGQQEGQELDKILFFYPADLPFTKQLYIIGLSEGLVTF 60
           MGLS+A T ++E ++LCVFD RRGQQEGQELDKILFF+PADLPF+ QL +IGLSEGL+TF
Sbjct: 1   MGLSSAATTMAEGLQLCVFDLRRGQQEGQELDKILFFFPADLPFSTQLSVIGLSEGLITF 60

Query: 61  TRTFSPEAACEVIEAEKHSHVFFEAEPDIWMVLVVEKNKEVQATWRMDALQKLLKEIHSL 120
           TR FSPEAACEVIEAE+HSHVF+EAEPDIWMV+VVEK+KE +A WR+DAL+K+L E+HSL
Sbjct: 61  TRIFSPEAACEVIEAERHSHVFYEAEPDIWMVMVVEKSKESEAIWRVDALRKVLMEVHSL 120

Query: 121 FLMFHGPIRLLLEKEPTGEVSRSHLYSFIMDYLSDFLVGKKLQLPSFLDCLKERGTVHML 180
           F MF+G IR LLEKEP GE+ RSHLY F+MDYL DFL GKKL LPSF DCLKERGTV ML
Sbjct: 121 FTMFNGSIRALLEKEPGGELVRSHLYPFVMDYLCDFLAGKKLLLPSFRDCLKERGTVQML 180

Query: 181 TIGRDAAIEVQALVRTLDSCIGNASCCSLILFQDLLVSTSLSPEDTTNLFSYAVLRLTPS 240
           T+GR+AAIEVQ+L R ++SC GNA C S+ILFQDLLVST+LSP+DT NLF+YAVLRLTP 
Sbjct: 181 TVGREAAIEVQSLARVIESCTGNAPCYSMILFQDLLVSTTLSPDDTMNLFTYAVLRLTPR 240

Query: 241 VLSSSASSWSYLIRGNTVSHVT-----QHGGNV----------------GNHVIRPLQHG 300
            LSS  SSWSYL +G T  HV       H   +                GN V RPL+H 
Sbjct: 241 ALSSGVSSWSYLRKG-TAPHVATASMLSHYRTISEQFYASRDISSAVDNGNRVTRPLRHN 300

Query: 301 KWSKGKDGFLETDIWGMEASGRVNSTPKIWPFQTEKQMYLYVHQHKSLTLILLVPASSIP 360
           KWSKGKDGFL TDIWGMEA   V STP +   Q+E +MYL  HQHK+LT++ LVP SS+P
Sbjct: 301 KWSKGKDGFLVTDIWGMEAGSSVASTPTVLLRQSEDRMYLCPHQHKNLTIVFLVPVSSMP 360

Query: 361 NGEQGISIIRQYFLENASLKIMTVEEKLSKGWGGENAYHVGGYRYLLIDGDRQISRASPP 420
           NGEQG+S+++Q FLENA+LKI+ VEEKLSKGWGGENAYHV GYRYLL+DGDR +SRASPP
Sbjct: 361 NGEQGVSVMKQQFLENAALKILKVEEKLSKGWGGENAYHVSGYRYLLVDGDRNVSRASPP 420

Query: 421 GKVTTLTKESLLAMSKLREKVDLEKSGAKQDGVGEE 436
           GKV TLTKES LA++KLRE+VDL+KS A+ D  G E
Sbjct: 421 GKVATLTKESFLALNKLREEVDLDKSRAQWDNAGHE 455

BLAST of Cp4.1LG01g05840 vs. TrEMBL
Match: A0A061EAD3_THECC (F23A5.27 isoform 5 OS=Theobroma cacao GN=TCM_011272 PE=4 SV=1)

HSP 1 Score: 626.3 bits (1614), Expect = 2.7e-176
Identity = 316/456 (69.30%), Postives = 371/456 (81.36%), Query Frame = 1

Query: 1   MGLSTATTAVSEAIKLCVFDSRRGQQEGQELDKILFFYPADLPFTKQLYIIGLSEGLVTF 60
           MGL++  TA SE ++LC+FD RRGQ EGQELDKILFF+PADLPF+ QL +IGLSEGL+TF
Sbjct: 1   MGLASIGTA-SEGMQLCIFDLRRGQHEGQELDKILFFFPADLPFSTQLSVIGLSEGLITF 60

Query: 61  TRTFSPEAACEVIEAEKHSHVFFEAEPDIWMVLVVEKNKEVQATWRMDALQKLLKEIHSL 120
           TR FSPEAACEVIEAE+HSHVF+EAEPDIWMV+VVEK+KE++A WR+DAL+++LKEIHSL
Sbjct: 61  TRIFSPEAACEVIEAERHSHVFYEAEPDIWMVMVVEKSKELEAIWRIDALREVLKEIHSL 120

Query: 121 FLMFHGPIRLLLEKEPTGEVSRSHLYSFIMDYLSDFLVGKKLQLPSFLDCLKERGTVHML 180
           F+MFHG IR LL+KEP+GE++R+HLY FIMDYL DFLVGKKLQLPSF DCLKER TV ML
Sbjct: 121 FMMFHGSIRALLDKEPSGELTRAHLYPFIMDYLRDFLVGKKLQLPSFRDCLKERRTVQML 180

Query: 181 TIGRDAAIEVQALVRTLDSCIGNASCCSLILFQDLLVSTSLSPEDTTNLFSYAVLRLTPS 240
           T+GR+AAIEVQ LVR L+ C GN  C SLILFQDLLVST+LSPEDT NLF+YAVLRLTP 
Sbjct: 181 TVGREAAIEVQTLVRVLELCAGNTPCSSLILFQDLLVSTTLSPEDTINLFTYAVLRLTPH 240

Query: 241 VLSSSASSWSYLIRGNTVSHV---------------------TQHGGNVGNHVIRPLQHG 300
            LSS ASSWSYL +GN+ SHV                     T   G+    + RPLQH 
Sbjct: 241 ALSSGASSWSYLRKGNSSSHVATVSTLAPSGSVSEQFYGSRDTSPAGDNRYRITRPLQHD 300

Query: 301 KWSKGKDGFLETDIWGMEASGRVNSTPKIWPFQTEKQMYLYVHQHKSLTLILLVPASSIP 360
           KW KGKDGFL TDIWGM+A     +TP +W  QTE++MYL  +Q++SLTLILL+P SSI 
Sbjct: 301 KWFKGKDGFLSTDIWGMDAGSLNVTTPTVWLRQTEERMYLCAYQYRSLTLILLIPFSSIL 360

Query: 361 NGEQGISIIRQYFLENASLKIMTVEEKLSKGWGGENAYHVGGYRYLLIDGDRQISRASPP 420
           NGEQG+SI++Q  LENASLKI+ VEEKLSKGWGGENAYHV GYRYLL+DG+R+ISRASPP
Sbjct: 361 NGEQGVSIVKQQLLENASLKILKVEEKLSKGWGGENAYHVSGYRYLLVDGNREISRASPP 420

Query: 421 GKVTTLTKESLLAMSKLREKVDLEKSGAKQDGVGEE 436
            KVTTLTKESLLA+++LRE+VD EKS AK D  G +
Sbjct: 421 AKVTTLTKESLLALNRLREEVDSEKSRAKWDNPGHD 455

BLAST of Cp4.1LG01g05840 vs. TrEMBL
Match: D7T8I4_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g06030 PE=4 SV=1)

HSP 1 Score: 624.8 bits (1610), Expect = 8.0e-176
Identity = 315/470 (67.02%), Postives = 372/470 (79.15%), Query Frame = 1

Query: 1   MGLSTATTAVSEAIKLCVFDSRRGQQEGQELDKILFFYPADLPFTKQLYIIGLSEGLVTF 60
           MGLS+A T V+E  + C+FD RRGQ EGQELDKILFFYPADLPF+ QL +IGLSEGL+TF
Sbjct: 1   MGLSSANTVVNEGFQFCIFDLRRGQHEGQELDKILFFYPADLPFSTQLSVIGLSEGLITF 60

Query: 61  TRTFSPEAACEVIEAEKHSHVFFEAEPDIWMVLVVEKNKEVQATWRMDALQKLLKEIHSL 120
           TR FSPEAACEVIEAE+HSHVF +AEPDIWMV+VVEK+KE  A WR+DAL+++LKE+HSL
Sbjct: 61  TRIFSPEAACEVIEAERHSHVFHQAEPDIWMVMVVEKSKESDAIWRIDALRRVLKEVHSL 120

Query: 121 FLMFHGPIRLLLEKEPTGEVSRSHLYSFIMDYLS-------------DFLVGKKLQLPSF 180
           F+MFHG IR LL+KEP+GE+ RSHLY+FIMDYLS             DFLVGKK++LPSF
Sbjct: 121 FVMFHGSIRSLLDKEPSGELVRSHLYAFIMDYLSAFQKRSPWDICCCDFLVGKKIKLPSF 180

Query: 181 LDCLKERGTVHMLTIGRDAAIEVQALVRTLDSCIGNASCCSLILFQDLLVSTSLSPEDTT 240
            DCLKERGTV MLT+GR+AA+EVQ+LVR L+SC GNA C SL+LFQDLLVST+LSP+DT 
Sbjct: 181 RDCLKERGTVQMLTVGREAALEVQSLVRVLESCAGNAPCYSLVLFQDLLVSTTLSPDDTI 240

Query: 241 NLFSYAVLRLTPSVLSSSASSWSYLIRGNTVSHV----------------------TQHG 300
           NLF+YAVLRL P+ L S ASSWSYL +GNT S +                      + HG
Sbjct: 241 NLFTYAVLRLAPNALLSRASSWSYLRKGNTASQIAAASVMASSGSVSEQFYGSRDTSPHG 300

Query: 301 GNVGNHVIRPLQHGKWSKGKDGFLETDIWGMEASGRVNSTPKIWPFQTEKQMYLYVHQHK 360
           G   +HV+RPLQH KW KG DGFL TDIWG E    V++TP +   QTE++MYL V+QHK
Sbjct: 301 GE-RSHVVRPLQHNKWYKGTDGFLVTDIWGPEVGSMVSATPTVLLHQTEERMYLCVYQHK 360

Query: 361 SLTLILLVPASSIPNGEQGISIIRQYFLENASLKIMTVEEKLSKGWGGENAYHVGGYRYL 420
           SLTLILL P SSI NGEQGIS+++Q  +ENASLK++ VEEKLSKGWGGENAYHVGGYRYL
Sbjct: 361 SLTLILLFPISSILNGEQGISVVKQQIVENASLKMLKVEEKLSKGWGGENAYHVGGYRYL 420

Query: 421 LIDGDRQISRASPPGKVTTLTKESLLAMSKLREKVDLEKSGAKQDGVGEE 436
           L+DGDR +SRASPPGKVTTLTKESL+++S LRE++DLEKS AK D    E
Sbjct: 421 LVDGDRNVSRASPPGKVTTLTKESLISLSNLREEIDLEKSRAKWDDPDHE 469

BLAST of Cp4.1LG01g05840 vs. TrEMBL
Match: A0A061E9N8_THECC (F23A5.27 isoform 6 OS=Theobroma cacao GN=TCM_011272 PE=4 SV=1)

HSP 1 Score: 622.5 bits (1604), Expect = 4.0e-175
Identity = 311/435 (71.49%), Postives = 365/435 (83.91%), Query Frame = 1

Query: 1   MGLSTATTAVSEAIKLCVFDSRRGQQEGQELDKILFFYPADLPFTKQLYIIGLSEGLVTF 60
           MGL++  TA SE ++LC+FD RRGQ EGQELDKILFF+PADLPF+ QL +IGLSEGL+TF
Sbjct: 1   MGLASIGTA-SEGMQLCIFDLRRGQHEGQELDKILFFFPADLPFSTQLSVIGLSEGLITF 60

Query: 61  TRTFSPEAACEVIEAEKHSHVFFEAEPDIWMVLVVEKNKEVQATWRMDALQKLLKEIHSL 120
           TR FSPEAACEVIEAE+HSHVF+EAEPDIWMV+VVEK+KE++A WR+DAL+++LKEIHSL
Sbjct: 61  TRIFSPEAACEVIEAERHSHVFYEAEPDIWMVMVVEKSKELEAIWRIDALREVLKEIHSL 120

Query: 121 FLMFHGPIRLLLEKEPTGEVSRSHLYSFIMDYLSDFLVGKKLQLPSFLDCLKERGTVHML 180
           F+MFHG IR LL+KEP+GE++R+HLY FIMDYL DFLVGKKLQLPSF DCLKER TV ML
Sbjct: 121 FMMFHGSIRALLDKEPSGELTRAHLYPFIMDYLRDFLVGKKLQLPSFRDCLKERRTVQML 180

Query: 181 TIGRDAAIEVQALVRTLDSCIGNASCCSLILFQDLLVSTSLSPEDTTNLFSYAVLRLTPS 240
           T+GR+AAIEVQ LVR L+ C GN  C SLILFQDLLVST+LSPEDT NLF+YAVLRLTP 
Sbjct: 181 TVGREAAIEVQTLVRVLELCAGNTPCSSLILFQDLLVSTTLSPEDTINLFTYAVLRLTPH 240

Query: 241 VLSSSASSWSYLIRGNTVSHVTQHGGNVGNHVIRPLQHGKWSKGKDGFLETDIWGMEASG 300
            LSS ASSWSYL +G+    +T           RPLQH KW KGKDGFL TDIWGM+A  
Sbjct: 241 ALSSGASSWSYLRKGDNRYRIT-----------RPLQHDKWFKGKDGFLSTDIWGMDAGS 300

Query: 301 RVNSTPKIWPFQTEKQMYLYVHQHKSLTLILLVPASSIPNGEQGISIIRQYFLENASLKI 360
              +TP +W  QTE++MYL  +Q++SLTLILL+P SSI NGEQG+SI++Q  LENASLKI
Sbjct: 301 LNVTTPTVWLRQTEERMYLCAYQYRSLTLILLIPFSSILNGEQGVSIVKQQLLENASLKI 360

Query: 361 MTVEEKLSKGWGGENAYHVGGYRYLLIDGDRQISRASPPGKVTTLTKESLLAMSKLREKV 420
           + VEEKLSKGWGGENAYHV GYRYLL+DG+R+ISRASPP KVTTLTKESLLA+++LRE+V
Sbjct: 361 LKVEEKLSKGWGGENAYHVSGYRYLLVDGNREISRASPPAKVTTLTKESLLALNRLREEV 420

Query: 421 DLEKSGAKQDGVGEE 436
           D EKS AK D  G +
Sbjct: 421 DSEKSRAKWDNPGHD 423

BLAST of Cp4.1LG01g05840 vs. TAIR10
Match: AT1G80910.1 (AT1G80910.1 Protein of unknown function (DUF1712))

HSP 1 Score: 562.0 bits (1447), Expect = 3.2e-160
Identity = 282/446 (63.23%), Postives = 348/446 (78.03%), Query Frame = 1

Query: 1   MGLSTATTAVSEAIKLCVFDSRRGQQEGQELDKILFFYPADLPFTKQLYIIGLSEGLVTF 60
           MG+++ ++  +E+++LCVFD RRGQ EGQELDKILFFYP DL F+ QL +IGLSEGL+TF
Sbjct: 1   MGMASMSSG-TESLRLCVFDLRRGQHEGQELDKILFFYPPDLTFSTQLSVIGLSEGLITF 60

Query: 61  TRTFSPEAACEVIEAEKHSHVFFEAEPDIWMVLVVEKNKEVQATWRMDALQKLLKEIHSL 120
           TR FSPEAACEVIEAE+HSHVF+EAEPDIWMV++VEKNKE++A WR+DAL+++LKE+HSL
Sbjct: 61  TRLFSPEAACEVIEAERHSHVFYEAEPDIWMVMIVEKNKEIEAVWRIDALRRVLKEVHSL 120

Query: 121 FLMFHGPIRLLLEKEPTGEVSRSHLYSFIMDYLSDFLVGKKLQLPSFLDCLKERGTVHML 180
           F+MF G IR LLEKEPTG + RSHLY FI DYL+D  VGKK QLPSF D LKERGTV ML
Sbjct: 121 FVMFQGSIRALLEKEPTGGLVRSHLYPFITDYLNDLFVGKKQQLPSFRDTLKERGTVQML 180

Query: 181 TIGRDAAIEVQALVRTLDSCIGNASCCSLILFQDLLVSTSLSPEDTTNLFSYAVLRLTPS 240
           T+ RDAA+EVQ+LV  LDSC G   C S+ILF DLLVST+LSP+DT +LF+++V+RLT +
Sbjct: 181 TLARDAALEVQSLVGVLDSCAGTVRCHSVILFHDLLVSTTLSPDDTVDLFAFSVMRLTTN 240

Query: 241 VLSSSASSWSYLIRGNTVSHVTQH----------------GGNVGNHVIRPLQHGKWSKG 300
            LSS  SSWSYL +G+    ++                   G+    VIRPLQH KWSKG
Sbjct: 241 ALSSGTSSWSYLRKGSGSPQISSRSTTVPPLGSGGTLPSGNGSSTGRVIRPLQHDKWSKG 300

Query: 301 KDGFLETDIWGMEASGRVNSTPKIWPFQTEKQMYLYVHQHKSLTLILLVPASSIPNGEQG 360
           KDGFL TDIWG++A      TP I   +T++  YL  +Q+KSLTL+LLVP ++I NGE  
Sbjct: 301 KDGFLVTDIWGLDA------TPTILIQKTQESFYLLTYQYKSLTLVLLVPIAAIVNGELD 360

Query: 361 ISIIRQYFLENASLKIMTVEEKLSKGWGGENAYHVGGYRYLLIDGDRQISRASPPGKVTT 420
           IS ++Q  +ENAS KI+ VEEKLSKGWGGENAYHV GYRYLL+D D ++SRASPPGKV T
Sbjct: 361 ISFVKQQVIENASTKILKVEEKLSKGWGGENAYHVSGYRYLLVDNDMEVSRASPPGKVAT 420

Query: 421 LTKESLLAMSKLREKVDLEKSGAKQD 431
           L KESLLA++KLRE+VD EK+ +KQ+
Sbjct: 421 LAKESLLALNKLREEVDTEKNRSKQE 439

BLAST of Cp4.1LG01g05840 vs. TAIR10
Match: AT1G16020.1 (AT1G16020.1 Protein of unknown function (DUF1712))

HSP 1 Score: 549.7 bits (1415), Expect = 1.7e-156
Identity = 274/446 (61.43%), Postives = 346/446 (77.58%), Query Frame = 1

Query: 12  EAIKLCVFDSRRGQQEGQELDKILFFYPADLPFTKQLYIIGLSEGLVTFTRTFSPEAACE 71
           E+++LC+FD RRGQ EGQEL+KILFFYPADL F+ QL +IGLSEGL+TFTR FSPEAACE
Sbjct: 9   ESLRLCMFDLRRGQTEGQELEKILFFYPADLDFSTQLSVIGLSEGLITFTRLFSPEAACE 68

Query: 72  VIEAEKHSHVFFEAEPDIWMVLVVEKNKEVQATWRMDALQKLLKEIHSLFLMFHGPIRLL 131
           VIEAE+HSHVF+EAEPDIWMV+VVEKNKE  A WR+DAL+++LKE+HSLF+MFHG IR L
Sbjct: 69  VIEAERHSHVFYEAEPDIWMVMVVEKNKETGAIWRIDALRRVLKEVHSLFVMFHGSIRAL 128

Query: 132 LEKEPTGEVSRSHLYSFIMDYLS-------------DFLVGKKLQLPSFLDCLKERGTVH 191
           +EKEPTG ++RS LY FI DYLS             +F VGKKLQLP+F + L+ERGTV 
Sbjct: 129 IEKEPTGGLTRSLLYPFITDYLSTFQIWSLSEDCCCEFFVGKKLQLPTFRETLRERGTVQ 188

Query: 192 MLTIGRDAAIEVQALVRTLDSCIGNASCCSLILFQDLLVSTSLSPEDTTNLFSYAVLRLT 251
           MLT+ RD A+EVQ+LV+ LDSC G+  C S+ILFQDLLVST+LS +DT +LF++AV+RLT
Sbjct: 189 MLTLARDTAVEVQSLVQVLDSCAGSLRCHSMILFQDLLVSTTLSADDTVDLFTFAVMRLT 248

Query: 252 PSVLSSSASSWSYLIRG---------------NTVSHVTQHGGNVGNHVIRPLQHGKWSK 311
              LSS  SSWSYL +G                ++  +    GN  +HVIRPLQ+ KW+K
Sbjct: 249 SKALSSDTSSWSYLRKGPGSSEISSRSNLAPVGSIDSLHSRNGNNMHHVIRPLQNDKWTK 308

Query: 312 GKDGFLETDIWGMEASGRVNST-PKIWPFQTEKQMYLYVHQHKSLTLILLVPASSIPNGE 371
           GKDGFL TDIWG+E  G  +S  P IW  QT+++MYL  +QHKSLTL+LL+P ++I NG+
Sbjct: 309 GKDGFLITDIWGLETGGSPDSAIPTIWLQQTQERMYLLAYQHKSLTLLLLMPTNAIVNGD 368

Query: 372 QGISIIRQYFLENASLKIMTVEEKLSKGWGGENAYHVGGYRYLLIDGDRQISRASPPGKV 429
             IS ++Q  +E+ASL+I+ +EE +S+GWGGENAYH+ GYRYL++D D ++SR+SP GKV
Sbjct: 369 LSISAVKQQVIEDASLRILKIEENISRGWGGENAYHIKGYRYLVVDNDTKVSRSSPSGKV 428

BLAST of Cp4.1LG01g05840 vs. NCBI nr
Match: gi|449449030|ref|XP_004142268.1| (PREDICTED: vacuolar fusion protein CCZ1 homolog [Cucumis sativus])

HSP 1 Score: 780.8 bits (2015), Expect = 1.2e-222
Identity = 392/435 (90.11%), Postives = 408/435 (93.79%), Query Frame = 1

Query: 1   MGLSTATTAVSEAIKLCVFDSRRGQQEGQELDKILFFYPADLPFTKQLYIIGLSEGLVTF 60
           MGLSTATTAVSEAI+LCVFDSRRGQQEGQELDKILFFYPADLPFTKQLYIIGLSEGL+TF
Sbjct: 1   MGLSTATTAVSEAIQLCVFDSRRGQQEGQELDKILFFYPADLPFTKQLYIIGLSEGLITF 60

Query: 61  TRTFSPEAACEVIEAEKHSHVFFEAEPDIWMVLVVEKNKEVQATWRMDALQKLLKEIHSL 120
           TRTFSPEAACEVIEAEKHSHVFFEAE DIWMVLVVEKNKE++A WR+DALQKLLKEIHSL
Sbjct: 61  TRTFSPEAACEVIEAEKHSHVFFEAEQDIWMVLVVEKNKELEAIWRIDALQKLLKEIHSL 120

Query: 121 FLMFHGPIRLLLEKEPTGEVSRSHLYSFIMDYLSDFLVGKKLQLPSFLDCLKERGTVHML 180
           FLMFHG IRLLLEKEPTGEVSRSHLYSFIMDYL+DFLVGKKL LPSF DCLKERGTV ML
Sbjct: 121 FLMFHGSIRLLLEKEPTGEVSRSHLYSFIMDYLNDFLVGKKLHLPSFTDCLKERGTVQML 180

Query: 181 TIGRDAAIEVQALVRTLDSCIGNASCCSLILFQDLLVSTSLSPEDTTNLFSYAVLRLTPS 240
           TIGRDAA++VQALVRTLDSCIGN SC SLILFQDLLVST+LSP+DTTNLFSYAVLRL P 
Sbjct: 181 TIGRDAALDVQALVRTLDSCIGNTSCHSLILFQDLLVSTTLSPDDTTNLFSYAVLRLIPR 240

Query: 241 VLSSSASSWSYLIRGNTVSHVTQHGGNVGNHVIRPLQHGKWSKGKDGFLETDIWGMEASG 300
           VLSS ASSWSYLIRGN  SHV QHGGNVGN VIRPLQHGKWSKGKDGFLETDIWGMEASG
Sbjct: 241 VLSSGASSWSYLIRGNVASHVGQHGGNVGNRVIRPLQHGKWSKGKDGFLETDIWGMEASG 300

Query: 301 RVNSTPKIWPFQTEKQMYLYVHQHKSLTLILLVPASSIPNGEQGISIIRQYFLENASLKI 360
            V STPKIW FQTE+QM LYVHQHK+LTLILLVP SSIPNGEQG+SI+RQY LENASLKI
Sbjct: 301 WVGSTPKIWLFQTEEQMCLYVHQHKTLTLILLVPVSSIPNGEQGVSIVRQYILENASLKI 360

Query: 361 MTVEEKLSKGWGGENAYHVGGYRYLLIDGDRQISRASPPGKVTTLTKESLLAMSKLREKV 420
           + VEEKLSKGWGGENAYHVGGYRYLL+DGDRQISRASPPGKVTTL KESLLAMSKLRE V
Sbjct: 361 VKVEEKLSKGWGGENAYHVGGYRYLLVDGDRQISRASPPGKVTTLAKESLLAMSKLRENV 420

Query: 421 DLEKSGAKQDGVGEE 436
           DLEKS AKQD  GEE
Sbjct: 421 DLEKSRAKQDSDGEE 435

BLAST of Cp4.1LG01g05840 vs. NCBI nr
Match: gi|659129454|ref|XP_008464695.1| (PREDICTED: vacuolar fusion protein CCZ1 homolog [Cucumis melo])

HSP 1 Score: 733.0 bits (1891), Expect = 3.0e-208
Identity = 365/407 (89.68%), Postives = 380/407 (93.37%), Query Frame = 1

Query: 1   MGLSTATTAVSEAIKLCVFDSRRGQQEGQELDKILFFYPADLPFTKQLYIIGLSEGLVTF 60
           MGLSTATTAVSEAI+LCVFDSRRGQQEGQELDKILFFYPADLPFTKQLYIIGLSEGL+TF
Sbjct: 1   MGLSTATTAVSEAIQLCVFDSRRGQQEGQELDKILFFYPADLPFTKQLYIIGLSEGLITF 60

Query: 61  TRTFSPEAACEVIEAEKHSHVFFEAEPDIWMVLVVEKNKEVQATWRMDALQKLLKEIHSL 120
           TRTFSPEAACEVIEAEKHSHVFFEAE DIWMVLVVEKNKE++A WR+ ALQKLLKEIHSL
Sbjct: 61  TRTFSPEAACEVIEAEKHSHVFFEAEQDIWMVLVVEKNKELEAIWRIGALQKLLKEIHSL 120

Query: 121 FLMFHGPIRLLLEKEPTGEVSRSHLYSFIMDYLSDFLVGKKLQLPSFLDCLKERGTVHML 180
           FLMFHG IRLLLEKEPTGEVSRSHLYSFIMDYL+DFLVGKKL LPSF DCLKERGTV ML
Sbjct: 121 FLMFHGSIRLLLEKEPTGEVSRSHLYSFIMDYLNDFLVGKKLHLPSFTDCLKERGTVQML 180

Query: 181 TIGRDAAIEVQALVRTLDSCIGNASCCSLILFQDLLVSTSLSPEDTTNLFSYAVLRLTPS 240
           TIGRDAA++VQALVR +DSCIGN SC SLILFQDLLVST+LSP+DT NLFSYAVLRL P 
Sbjct: 181 TIGRDAALDVQALVRIVDSCIGNTSCHSLILFQDLLVSTTLSPDDTANLFSYAVLRLIPR 240

Query: 241 VLSSSASSWSYLIRGNTVSHVTQHGGNVGNHVIRPLQHGKWSKGKDGFLETDIWGMEASG 300
           VLSS ASSWSYLIRGN  SHV QHGGNVGN VIRPLQHGKWSKGKDGFLETDIWGMEASG
Sbjct: 241 VLSSGASSWSYLIRGNVASHVAQHGGNVGNRVIRPLQHGKWSKGKDGFLETDIWGMEASG 300

Query: 301 RVNSTPKIWPFQTEKQMYLYVHQHKSLTLILLVPASSIPNGEQGISIIRQYFLENASLKI 360
            V STPKIW  QTE+QM LYVHQHKSLTLILLVP SSIPNGEQGISI+RQY LENASLKI
Sbjct: 301 WVGSTPKIWLSQTEEQMCLYVHQHKSLTLILLVPVSSIPNGEQGISIVRQYILENASLKI 360

Query: 361 MTVEEKLSKGWGGENAYHVGGYRYLLIDGDRQISRASPPGKVTTLTK 408
           + VEEKLSKGWGGENAYHVGGYRYLL+DGDRQISRASPPGKVTTL K
Sbjct: 361 VKVEEKLSKGWGGENAYHVGGYRYLLVDGDRQISRASPPGKVTTLAK 407

BLAST of Cp4.1LG01g05840 vs. NCBI nr
Match: gi|225424418|ref|XP_002285048.1| (PREDICTED: vacuolar fusion protein CCZ1 homolog isoform X4 [Vitis vinifera])

HSP 1 Score: 634.0 bits (1634), Expect = 1.9e-178
Identity = 315/457 (68.93%), Postives = 372/457 (81.40%), Query Frame = 1

Query: 1   MGLSTATTAVSEAIKLCVFDSRRGQQEGQELDKILFFYPADLPFTKQLYIIGLSEGLVTF 60
           MGLS+A T V+E  + C+FD RRGQ EGQELDKILFFYPADLPF+ QL +IGLSEGL+TF
Sbjct: 1   MGLSSANTVVNEGFQFCIFDLRRGQHEGQELDKILFFYPADLPFSTQLSVIGLSEGLITF 60

Query: 61  TRTFSPEAACEVIEAEKHSHVFFEAEPDIWMVLVVEKNKEVQATWRMDALQKLLKEIHSL 120
           TR FSPEAACEVIEAE+HSHVF +AEPDIWMV+VVEK+KE  A WR+DAL+++LKE+HSL
Sbjct: 61  TRIFSPEAACEVIEAERHSHVFHQAEPDIWMVMVVEKSKESDAIWRIDALRRVLKEVHSL 120

Query: 121 FLMFHGPIRLLLEKEPTGEVSRSHLYSFIMDYLSDFLVGKKLQLPSFLDCLKERGTVHML 180
           F+MFHG IR LL+KEP+GE+ RSHLY+FIMDYLSDFLVGKK++LPSF DCLKERGTV ML
Sbjct: 121 FVMFHGSIRSLLDKEPSGELVRSHLYAFIMDYLSDFLVGKKIKLPSFRDCLKERGTVQML 180

Query: 181 TIGRDAAIEVQALVRTLDSCIGNASCCSLILFQDLLVSTSLSPEDTTNLFSYAVLRLTPS 240
           T+GR+AA+EVQ+LVR L+SC GNA C SL+LFQDLLVST+LSP+DT NLF+YAVLRL P+
Sbjct: 181 TVGREAALEVQSLVRVLESCAGNAPCYSLVLFQDLLVSTTLSPDDTINLFTYAVLRLAPN 240

Query: 241 VLSSSASSWSYLIRGNTVSHV----------------------TQHGGNVGNHVIRPLQH 300
            L S ASSWSYL +GNT S +                      + HGG   +HV+RPLQH
Sbjct: 241 ALLSRASSWSYLRKGNTASQIAAASVMASSGSVSEQFYGSRDTSPHGGE-RSHVVRPLQH 300

Query: 301 GKWSKGKDGFLETDIWGMEASGRVNSTPKIWPFQTEKQMYLYVHQHKSLTLILLVPASSI 360
            KW KG DGFL TDIWG E    V++TP +   QTE++MYL V+QHKSLTLILL P SSI
Sbjct: 301 NKWYKGTDGFLVTDIWGPEVGSMVSATPTVLLHQTEERMYLCVYQHKSLTLILLFPISSI 360

Query: 361 PNGEQGISIIRQYFLENASLKIMTVEEKLSKGWGGENAYHVGGYRYLLIDGDRQISRASP 420
            NGEQGIS+++Q  +ENASLK++ VEEKLSKGWGGENAYHVGGYRYLL+DGDR +SRASP
Sbjct: 361 LNGEQGISVVKQQIVENASLKMLKVEEKLSKGWGGENAYHVGGYRYLLVDGDRNVSRASP 420

Query: 421 PGKVTTLTKESLLAMSKLREKVDLEKSGAKQDGVGEE 436
           PGKVTTLTKESL+++S LRE++DLEKS AK D    E
Sbjct: 421 PGKVTTLTKESLISLSNLREEIDLEKSRAKWDDPDHE 456

BLAST of Cp4.1LG01g05840 vs. NCBI nr
Match: gi|731369129|ref|XP_010649909.1| (PREDICTED: vacuolar fusion protein CCZ1 homolog isoform X3 [Vitis vinifera])

HSP 1 Score: 629.4 bits (1622), Expect = 4.7e-177
Identity = 315/458 (68.78%), Postives = 372/458 (81.22%), Query Frame = 1

Query: 1   MGLSTATTAVSEAIKLCVFDSRRGQQEGQELDKILFFYPADLPFTKQLYIIGLSEGLVTF 60
           MGLS+A T V+E  + C+FD RRGQ EGQELDKILFFYPADLPF+ QL +IGLSEGL+TF
Sbjct: 1   MGLSSANTVVNEGFQFCIFDLRRGQHEGQELDKILFFYPADLPFSTQLSVIGLSEGLITF 60

Query: 61  TRTFSPEAACEVIEAEKHSHVFFEAEPDIWMVLVVEKNKEVQATWRMDALQKLLKEIHSL 120
           TR FSPEAACEVIEAE+HSHVF +AEPDIWMV+VVEK+KE  A WR+DAL+++LKE+HSL
Sbjct: 61  TRIFSPEAACEVIEAERHSHVFHQAEPDIWMVMVVEKSKESDAIWRIDALRRVLKEVHSL 120

Query: 121 FLMFHGPIRLLLEKEPTGEVSRSHLYSFIMDYLSDFLVGKKLQLPSFLDCLKERGTVHML 180
           F+MFHG IR LL+KEP+GE+ RSHLY+FIMDYLSDFLVGKK++LPSF DCLKERGTV ML
Sbjct: 121 FVMFHGSIRSLLDKEPSGELVRSHLYAFIMDYLSDFLVGKKIKLPSFRDCLKERGTVQML 180

Query: 181 TIGRDAAIEV-QALVRTLDSCIGNASCCSLILFQDLLVSTSLSPEDTTNLFSYAVLRLTP 240
           T+GR+AA+EV Q+LVR L+SC GNA C SL+LFQDLLVST+LSP+DT NLF+YAVLRL P
Sbjct: 181 TVGREAALEVQQSLVRVLESCAGNAPCYSLVLFQDLLVSTTLSPDDTINLFTYAVLRLAP 240

Query: 241 SVLSSSASSWSYLIRGNTVSHV----------------------TQHGGNVGNHVIRPLQ 300
           + L S ASSWSYL +GNT S +                      + HGG   +HV+RPLQ
Sbjct: 241 NALLSRASSWSYLRKGNTASQIAAASVMASSGSVSEQFYGSRDTSPHGGE-RSHVVRPLQ 300

Query: 301 HGKWSKGKDGFLETDIWGMEASGRVNSTPKIWPFQTEKQMYLYVHQHKSLTLILLVPASS 360
           H KW KG DGFL TDIWG E    V++TP +   QTE++MYL V+QHKSLTLILL P SS
Sbjct: 301 HNKWYKGTDGFLVTDIWGPEVGSMVSATPTVLLHQTEERMYLCVYQHKSLTLILLFPISS 360

Query: 361 IPNGEQGISIIRQYFLENASLKIMTVEEKLSKGWGGENAYHVGGYRYLLIDGDRQISRAS 420
           I NGEQGIS+++Q  +ENASLK++ VEEKLSKGWGGENAYHVGGYRYLL+DGDR +SRAS
Sbjct: 361 ILNGEQGISVVKQQIVENASLKMLKVEEKLSKGWGGENAYHVGGYRYLLVDGDRNVSRAS 420

Query: 421 PPGKVTTLTKESLLAMSKLREKVDLEKSGAKQDGVGEE 436
           PPGKVTTLTKESL+++S LRE++DLEKS AK D    E
Sbjct: 421 PPGKVTTLTKESLISLSNLREEIDLEKSRAKWDDPDHE 457

BLAST of Cp4.1LG01g05840 vs. NCBI nr
Match: gi|703087572|ref|XP_010093307.1| (hypothetical protein L484_006765 [Morus notabilis])

HSP 1 Score: 627.5 bits (1617), Expect = 1.8e-176
Identity = 314/456 (68.86%), Postives = 367/456 (80.48%), Query Frame = 1

Query: 1   MGLSTATTAVSEAIKLCVFDSRRGQQEGQELDKILFFYPADLPFTKQLYIIGLSEGLVTF 60
           MGLS+A T ++E ++LCVFD RRGQQEGQELDKILFF+PADLPF+ QL +IGLSEGL+TF
Sbjct: 1   MGLSSAATTMAEGLQLCVFDLRRGQQEGQELDKILFFFPADLPFSTQLSVIGLSEGLITF 60

Query: 61  TRTFSPEAACEVIEAEKHSHVFFEAEPDIWMVLVVEKNKEVQATWRMDALQKLLKEIHSL 120
           TR FSPEAACEVIEAE+HSHVF+EAEPDIWMV+VVEK+KE +A WR+DAL+K+L E+HSL
Sbjct: 61  TRIFSPEAACEVIEAERHSHVFYEAEPDIWMVMVVEKSKESEAIWRVDALRKVLMEVHSL 120

Query: 121 FLMFHGPIRLLLEKEPTGEVSRSHLYSFIMDYLSDFLVGKKLQLPSFLDCLKERGTVHML 180
           F MF+G IR LLEKEP GE+ RSHLY F+MDYL DFL GKKL LPSF DCLKERGTV ML
Sbjct: 121 FTMFNGSIRALLEKEPGGELVRSHLYPFVMDYLCDFLAGKKLLLPSFRDCLKERGTVQML 180

Query: 181 TIGRDAAIEVQALVRTLDSCIGNASCCSLILFQDLLVSTSLSPEDTTNLFSYAVLRLTPS 240
           T+GR+AAIEVQ+L R ++SC GNA C S+ILFQDLLVST+LSP+DT NLF+YAVLRLTP 
Sbjct: 181 TVGREAAIEVQSLARVIESCTGNAPCYSMILFQDLLVSTTLSPDDTMNLFTYAVLRLTPR 240

Query: 241 VLSSSASSWSYLIRGNTVSHVT-----QHGGNV----------------GNHVIRPLQHG 300
            LSS  SSWSYL +G T  HV       H   +                GN V RPL+H 
Sbjct: 241 ALSSGVSSWSYLRKG-TAPHVATASMLSHYRTISEQFYASRDISSAVDNGNRVTRPLRHN 300

Query: 301 KWSKGKDGFLETDIWGMEASGRVNSTPKIWPFQTEKQMYLYVHQHKSLTLILLVPASSIP 360
           KWSKGKDGFL TDIWGMEA   V STP +   Q+E +MYL  HQHK+LT++ LVP SS+P
Sbjct: 301 KWSKGKDGFLVTDIWGMEAGSSVASTPTVLLRQSEDRMYLCPHQHKNLTIVFLVPVSSMP 360

Query: 361 NGEQGISIIRQYFLENASLKIMTVEEKLSKGWGGENAYHVGGYRYLLIDGDRQISRASPP 420
           NGEQG+S+++Q FLENA+LKI+ VEEKLSKGWGGENAYHV GYRYLL+DGDR +SRASPP
Sbjct: 361 NGEQGVSVMKQQFLENAALKILKVEEKLSKGWGGENAYHVSGYRYLLVDGDRNVSRASPP 420

Query: 421 GKVTTLTKESLLAMSKLREKVDLEKSGAKQDGVGEE 436
           GKV TLTKES LA++KLRE+VDL+KS A+ D  G E
Sbjct: 421 GKVATLTKESFLALNKLREEVDLDKSRAQWDNAGHE 455

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CCZ1_NEMVE1.2e-2026.56Vacuolar fusion protein CCZ1 homolog OS=Nematostella vectensis GN=v1g238755 PE=3... [more]
CCZ1_DICDI3.1e-1924.52Vacuolar fusion protein CCZ1 homolog OS=Dictyostelium discoideum GN=DDB_G0288589... [more]
CCZ1B_HUMAN1.3e-1424.90Vacuolar fusion protein CCZ1 homolog B OS=Homo sapiens GN=CCZ1B PE=1 SV=1[more]
CCZ1_HUMAN1.3e-1424.90Vacuolar fusion protein CCZ1 homolog OS=Homo sapiens GN=CCZ1 PE=1 SV=1[more]
CCZ1_BOVIN1.7e-1424.52Vacuolar fusion protein CCZ1 homolog OS=Bos taurus GN=CCZ1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KL91_CUCSA8.7e-22390.11Uncharacterized protein OS=Cucumis sativus GN=Csa_5G025930 PE=4 SV=1[more]
W9R3N9_9ROSA1.2e-17668.86Uncharacterized protein OS=Morus notabilis GN=L484_006765 PE=4 SV=1[more]
A0A061EAD3_THECC2.7e-17669.30F23A5.27 isoform 5 OS=Theobroma cacao GN=TCM_011272 PE=4 SV=1[more]
D7T8I4_VITVI8.0e-17667.02Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g06030 PE=4 SV=... [more]
A0A061E9N8_THECC4.0e-17571.49F23A5.27 isoform 6 OS=Theobroma cacao GN=TCM_011272 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G80910.13.2e-16063.23 Protein of unknown function (DUF1712)[more]
AT1G16020.11.7e-15661.43 Protein of unknown function (DUF1712)[more]
Match NameE-valueIdentityDescription
gi|449449030|ref|XP_004142268.1|1.2e-22290.11PREDICTED: vacuolar fusion protein CCZ1 homolog [Cucumis sativus][more]
gi|659129454|ref|XP_008464695.1|3.0e-20889.68PREDICTED: vacuolar fusion protein CCZ1 homolog [Cucumis melo][more]
gi|225424418|ref|XP_002285048.1|1.9e-17868.93PREDICTED: vacuolar fusion protein CCZ1 homolog isoform X4 [Vitis vinifera][more]
gi|731369129|ref|XP_010649909.1|4.7e-17768.78PREDICTED: vacuolar fusion protein CCZ1 homolog isoform X3 [Vitis vinifera][more]
gi|703087572|ref|XP_010093307.1|1.8e-17668.86hypothetical protein L484_006765 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR013176DUF1712_fun
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0016310 phosphorylation
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0016301 kinase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g05840.1Cp4.1LG01g05840.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013176Protein of unknown function DUF1712, fungiPFAMPF08217DUF1712coord: 16..417
score: 2.8
NoneNo IPR availablePANTHERPTHR13056UNCHARACTERIZEDcoord: 16..432
score: 4.3E