Cp4.1LG07g05460 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG07g05460
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionGanglioside-induced differentiation-associated protein 2
LocationCp4.1LG07 : 4352174 .. 4374019 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GGTTTGTTTCCGGGATTTCTTTGACGAATGTAGGTCTTTGCATGGGCAATCCATTTTTGAGGTTTGCAAGGACTTGGTGGCGTTGTTAAGATGTACAGGACAGTGGCTACATCTGCGGCCACAACCACAGCAACAACAACAACAGATAGCGTTGATTATGTTGTAAATTTGGATCAAATTCCACGGTGGAGTGATGCAGAACATAGGTCTTCATTGGAGTTCGTCAACGAAGATCCTTCCTTCTCTAATTCGTATTTTCCTGATCCTCTGACGTCTCCATCTGACGCGGAGGGTGGTAATTATGGAGTGGTTTCGAGATTTCCCGTCGATCATGAAATTAATTCCAAAATATATCTGTGGCGGGGGAATCCTTGGAATCTCGAGGTAGATGCTGTCGTGAATTCTACAAACGAGGTGAGTGTTATGTTCTTCATCTCCTTCCTTCTGCAATAGAAAACCTTAATCAAATAAACAAAAGGGAAAGCAATGAGATGCTAAAAGTCCCAAGAACTCAATAATAGTTCCATTAGAACACACAGGGGACAATCTCAGCCCCGAGTTCTGTACATTTCAAAGAAGAAGACAAAACATAGACATAGTTGCAAGCACAATTCTTCAATGGGTACTAACTATCCCTCGTAATCATATGGTTTTATGTGGTTGAGCTTTAAGGTGATATCATTTGAAGTGGGATATATTAGTAGGAAAATTAGTTGAAATTGAAAATTTGTTGATTATTAAGTTTTGGTATGTCTAATCCACTACGAAACGTATGCATATTTCACATGGGATAAAGTGTCGTTTATCAACTTAACGATCACATTGCCATTCTGCAGTTGCTTCCATTGGCAAGATGGAATTAGATGGTGGTCTCCTTACCTTCAGAGGTTCTTGAAAAAATACCCCAAAACTGAAATGCCTGAGCTTAAGCAACATTTAAATTATGGGTGGATTGAAAGTTTTGTCTGATAAAGTTGAAAGGACATAAGTCTGTCCTGTACAAAAGAGATAGAAGATGAGAGCTTCATATCCTCCATTAGTGTTGACTTTGGACTGCTGGAAGACGTACTTAATTCCAAGCACCGACCAGCACTATCATATTAGAAAATCATCTTCTCTAACATTGCACCATACCAACCAAGGCAATTATTTTTACGAATTGTTGTATGAGATAATAGAGGAAGTAGGATAGTATATTGATATTTACCTTTATGAAGAGCGAGGGGAGATCATCACTTGTTCATCTATCTAATGATGAAGGATCTTTTGGTTCAAGACATGCACCTGGAAGTTGTCGGCTAGATTAATCGAGTTATTAGTGGGTGAATTGATATTGAGGAAGATGGATTAGGGAAGACAACCTTAGGTAAGGGAAAACGAACAATATAGACAAGAAGATCATGGCTCTCCCATGTGCTCTTCCTTGACTACTAGGAGAGATGATTAAAAGGAGAAAAATAGGCATCTTTAAAGAAAGTAAGACAAGGTAGTACAAAATTTAAACTAGGATAATAGTACTTATTCCCTTTTTGAACTAAAGATAACCGAAGAAAGTACATATTGAAGATTTTGGGTTTCAACTTTGTGAATTCGGAGCAAACAAGTATAGGCTAATATTCTTGTTATGGGAAATAAAGGCTTTAAAGGCTTTATAGGAACAAATCCTAGCATGAAATTTCGCCAATCTGCATTCAAGCTATGAAGGAGTTTTATGAAATAGTACAGGAATTTCAGATATTCAAGTAACTCCTCCACAAAGTCACCATGATTTATCAAACCAACAACCTGACTCTTGATTGAATTGGTTATCGTCAAGGCCAAACTGGCATAAACTTGGAACCAAGTTCTTCTAATGTCATTAGTTGATGGATAGGTCATGTCACATGTGAGTATGATGGAAACCCATTCCATAAAATTGGTGGATAAAATTAAAAACCAGAAACTCTAACCATCTTCTGAAGTGGTTCCACATGATGATCAAGCTTAGCACACGGACAAAATGAGAATGTGAGATGTTGAGACAAGAACTAGTAATGAAAGTAAACAGAAGGCGGACTGACTCACTATATTACGTTATTTGGTTGCGATCTTTATTGTATTGGGTTCGAATAACTTAAATTTTATGTACAATTGTCATGCCTCTAAAGGATGTAGAGATGCTTGATCATTGTTTATGACTTGACAGTATCTCAACCTATTCATACTAAAAGAAAATGAAGAGGTTGTCAGCACAATTGTAGACATGGTGGTAGTCTATGGCCTATCACTTTAAAACCTCTTAGCATAAAAGCCTGTTTGGTGATAGTTATGTTGTTGATACTGGTTGATCTGCAACACGGATTTGTATCTGTAATCAGAAAACTCCTTTTTCGGTTAAGTTTATATAAAATGAAATAAGCTGCTAATTTCTATATCTGCCATCTTTTCTGATCGTAAAATTAAATTATACAGAACTTGGATGAAGCACACAGTAGCCCTGGTTTGCATGCTGCAGCAGGACCTGGTCTATCAGAAGAATGTGGAACACTTGTACGCCTGTGTTTCTATTTACTTCAGTTTGGCTTTTGTTTATTGCTTTTTCCCTGATTATGAACTAAAAATATTCACAAATTAACATCTGCATATATCTTATAAGAAAAAACTGTGGATAGTTTTGGCCTTTTGGAGGTACAATGGATTTCACTATTTTCAGAAAGATTTCTTTATAGAACTTCTTTAGACAATAAGAATGTCAAAACCTGACTCATTATATACATTTCACATCCTGAAGGGTGGTTGTCGCACAGGAATGGCAAAAGTCACTAATGCTTATGACCTTCCTGCTAGGTGAGTATGCAGTTGATGTTTGGACCTAGATTAAAGACCTATACATCCATTTAATCTTTGGCATTCTTTTTTAGGAAGGTAATGCATACTGTAGGTCCAAAATATGCTGTCAAATACCATACTGCTGCAGAGAATGCTCTGAGTCATTGCTACCGTTCTTGCCTTGAACTTCTCATCGAAAATGGGCTTCAGAGGTTGTGAAAGTAGTCGATGCCACATTATTTTTTTCTCCATATCGTTTGCTCATCTCATGCATCTCTTGCCATTTTTAGCTTACATGTTAATGGCGATTTAATGCAGCATTGCTATGGGTTGTATATATACAGAGGGAAAAAACTATCCTCGTGAACCTGCTGCACACGTAGCAATAAGTGAGATATTTTTCTACATGTTCCTTTAAGATGTCTATGTTTTTTCTCTATTCTACTATGATTCAATTTTCTTGTGCTACTTATGTTCACAAATTATGGACATCCATTAACCGCCAATTTACCTAGTATGTTTAGCTTAGTAGGGGTGTTATCAGGGTGGGGCAGGGGCTCGGAAGTATTTTGGACCTAACCTACTAGGGAGTTGGGCTTGAGGACTTGCACTGATCCATGCTAGCAAGCTCCGGTATAAGAAACACAAAATTGTGAAAAGTTGTTTCTAAATGTTTATCACTTCTTTATTACTTTTCTTTGATGGATATAGGTCCATTTGGATCTAATCTTTACGATTTGTGATTGTTACCAACTTGCATTCTGTAAATCAGCACAGCGATTGCATTTAAATCATCGGAGATGATGAAATCTCCAGTAGGTAGCGGCATCCATTTCAAGTGTTTCTAAATTAGAAGTCTATGTTAGTATTAGTTAGGCATACAGCTGTATACTTGATCAACTTATTGACATGCTGCCAAAGAAGTTTCCCATTAAATGATGGATTTTGTTGGAACTCAAATTTAACATTTTTTTTTACAAAAAAAGAAAAAAATAAAAATAAAAAAAAGGAAAAGTAAAGAAAATTTAACATCTTATCCTCATTTTTATTTTGCTTCTTCTTCTGTTTAGGAACTGTGCGTCGTTTGATTGAGAAGCAGAAAGGTAAAATTAAAGCTGTTGTCTTTTGTACAACATCATCAATCGATACTGAGATATACAAAAGGTACTTCACTTTCAAGTAACTGAGGCCTGTCATTCCTAAAGGATATATTTTCCATGCCGTATAATATATAACAATCCTTGTTGCATTTATTCGATTTGCTTTTATATGTTTCTGAGCCAGGTTGCTTCCTCTGTACTTTCCCCGTGATAAACACGAAGAAGAAGTTGCATTGTCAAAGCTTCCTGCAGATGTTGGAGATGAGAATGGTGAGACTATTATAGATGAGCGGAAAATCAGGATAAACACTTTGCCCAAAAAGAATGTTCTGAAACCTCCCCAAGTTCCCGATGATCCTCCTGTCAGTCATGTAAGGTTGACACAGAGGTTAGTGTAATGATTTATGCAACGTAACAGTTCTTTTTCAAAAAATATTTCATCTTCTATCTGATGGTAGATTCATGAAAATTGCATAGCTGTATGAGTCAGAGTTGCCCTGAATTTGATGACATTGATCTGTGGCATCTACTTTATCTGTGTTCATCATGACTCCCGGGCCAGTTATATTTATTTGAAAGAAACAAAATTATATTGAAAACCTGAAAATCACGAAGGTAGATGGGCAATCCTCACTACAAGCTATGGGGATTACCAAAGTGGACCCCTATCTGCAAATGTACGAGGAAGATTGTAGTTACAAAGATCTTGATGAAGAGAAAAAGTGCAAGGTTTCTTGCATTCTACAACTCCGAAGCGAAGGTCGTGATTGCATCAGACCATTTGTTTTTTTCTTTTTTGATATTCGTTGAGTGGGTGGGCTAGCTTGACTATTCTCATAGAATAACTACTTGAATTTCTACATTTGGTTGTCAAGGTAGTTGTAAGATATTAAATCCTAGATAGGTGACCACCAAGGCTTGAACCCATTATCTCTAAGCTCTTTGTAATCTTTAAGATCAACTCTGAGAAAACTTTATGATCATGTCGATTAGATCTTCTTGGATAGAGTTTTGGGAAAGAAGGGTTTTGGGATGAAAAAAATTGGAGAGTTTGGATGTGGAATTGTATTAGGTCAATAAACTTTTCCATCTTTGTTAGAGGGAGGTCCCAAGGGAAGGGTAAAATTTTTCGCTATGAGGTTATCTGCTATTGCCTTTCCTCTTCCTCTGAGTGGTTGATGCCTTAAGTAGGTTAGTGTGACCGGAACCGAGAAGGGTGCGATATGAGTCTAGTTTTCCATGTACCTACTCATTATTCCAACAACATATGGAAGATTCGAACGAGTATGAGTCTAGTACCTTAGACTTCTGATGACATGACTATACTCAGTGGAATTCACCAAGTTTGCTTCCATGAGATACTTGGTCAGGTAGCACTCTACCATCTTAAATTACTGCAACAACTTCTTAGTATAGGCCGATTGCTTTAGCATGATACAATTTTTCCTTTGGTCCACTTTATTACTGAAGTATTACGTGAGTAATCTGAGATTTGTCATCTCAAATTCTTTCATCATTTGCTGCTTGAACTCTTTGATGCCTTCCACACTTGTACCAGTGATGATCAGGTCATCAGGTACGCACCAACTATGAGTGTTTTAGTCCCATTGTTTCTTGTATACACTGATTGTTCTTGCGAGCACTTCATGAAATTTAGACTCTTTAAGCTCTTGTCCAGATGTATGTTCCACGCCCTCGGTGCTTGTCGTAAATCGTAAAGGGCCCTTGACAACTTGTACACTTTGTGCTCTTTGGTTTTGATGACGAACCCTTCAGGTTGGGCAACATACACTTCTTCTTGGAGGTCACCATTTGGGAATGCTGATTTGACGTCCAAGTGATGGACCTCCCACTCGTCTTGAGCTGCGAGGGCAAGAATCAACCTTACAGTTCAGTCTAGCCATAGGCGCGAAAACCTCCTCAAAATCAACTCCTTGTCGTTGCACGTATCCCTTCGCCACGAGTCTTGCTTTATGCTTGATGACATTTCTTCACTATTCTTCTTCAACTTAAACACCCACTTCAAAATGATGGGCTTGTGTCTGGACGATAAGTTAGTTAGTGCCCACGTCTTGTTCTTCTCAATTGCTTCAAGCTCTTTTTGCATGGCCTCCTGCCACATCGTCTCAGTTTCTGCCTCGCGATAGGTTGTTGGCTCCTCGACCGCGAGTAACACTAACTCATCGGGATCCAATTCTTCTTCAAGTGTATCCGCATAAATTTCAGCAAGAAAACTAAACTTCTTTGGTCCATCCTCTGTTGTGGATCCTCTTGTGCTCTCATCTGAGCTCTCTTATTCTAGAGGGTCCATCACGCCCTTCGGTATCACCGGTGATGACATGTGTGGACTTGTTGGTGTTGCTGTAGGGGAACTTTCTAGGTCTATTGCATCTCCTTCTTCTTCTAGAGTAGTGAACTCTGTAACCGTTTGCTTGTTATCGCCAACACTGCACCAATCTCACTTCTCTTCTTCGAATACGACATCTCTACTTACACAAATTTTCTCATGTGCATATCATACAGCTTGTGTGCCTTGGTCTCATCCTCTATGCCAAAATACACTATCTTTTGGCTTATGTCATCAAGCTTTTTGATATATGGCTTTGCCATCTTAACATGTGCTGTACAACCTAAGACTCTTAGGTACTCGAAGTGAGGTGTCTTGCCGAACCATGCTTCATAGGGGGTGCGAGTGTCCAGCGCCTTTGTCCGTCAACGGTTTAGAAGATAAACCGCATGTTGTTTTGCCTCACTCTAGAAAATCACTGGGACTTGCATGCTTTTAAGTAAGCTTCTTGCAGTGTTCATTATGGTGCGGTTTTTTTTTCGACCACACCATTTCATGGTGAAGTGTACGGTGCGGTGAGTTGACGCTTGATACCTTCCTCTTTGCAAAACTTTGAGAAGTTATGGGATAAGAACTCTTCACCTCTGTCGGTGCACAAGGCTTTCAGCTTGTGGTTAGACCCGTTTTCAACGAGTCTTTTGAACTTCTTGAATGCATATAATGCCTCTCCTTTTTCTTTTAGCATGTACACCCACATCTAATGACTATAATCGTCAACAAGTAAGAGAAAATATTTGTTACCAGCAATCGATGATGGTGTGATCAGGCCACACAAATCTGCATGAACCAGTAGCAGTGGTCACTCAACATAGAAACTGGTTTGAGCTAGGAACGAAAATCTTGTTTGTTTGGAGACTAGGCAACTTTCACGTAGCTAGTTTGGGTGTATGATTTTGGGTAGCCTCGTAGCCATCTTCTTCTCTGCCATCATTTTGAACGTCTGAAAGTTCACGTGCCCAAGTCTTGCATGCCATAGCCAAGCTAGGTCTGCAAGTGCCGTCAATAGGGTTATGCGACCATTTTACCTTCATCAACAAGGTTCCATTTCTGTTGATCATCTTGAGGAACAAGCCAGCTAACTCCACTCTGCTTCCTTCTTCCATCATCTAACCAAGACTGATAATATTACTCTTCAAGTTTGGGATATAGTATACCTTGGTCAGTAAACGTTGGTCACCATTCTTGCACTGGAACATGGTAGATCCTTTGACTTGGATCGATTCAATTGATCCATCACCAAACTTCACGTTTCCAATGAGCTTCTCATCAAGCTCCTTAAACTTTGTTCGATCTCCAGTCATGTGGTTGCTTGCTCTGTTGTCAAGGTACCATATGTTGGTCTCTACTCGGTCTTCTCCTTTTGTGAGAAGGTTTGCCATACCTTTTCTTCGTTGAGTATCAACAGGTTGATCATCTTCTCGGCCAACATCAGTGCGGGCTCTTGATCTTGCGTGAGTGTGAGGTTTGTCTCCTCATCTCACTTCTTCTTGCAGCACTCTATCGCATAGTGCCCATATTTTCCACCAGAGTAACACTTGATCATACTTTTGTCCTTCTAGGGGTTGGCATTGTCATGGGTTTGTGAGGTATTGTCACGGCCTCTTCTGCCACCACGTCCACGACTATGTTCGCGACCACGTCCACGACCTCTATTTTCCTTGTCATGGCTGCCACGTCCCCTTGTACTTGAAAAAGAAGAGTCAGCTGCATCATTCTTTTCCGTTCGTGTGAGCCACTCTTCATGTGTGAGTAAGAGGTGTTTTTCTCATCTTTGTCTTCATAGACACGAAGTCTCTCCTCATGGACCTTAAGACGATCAACCTCCTCAATTGACATGTTCTTGAGGTCGCTGAACTGCTCTATGGAGGTAAAGATTTGCATGAATCTTGGGGGAACAGCTCGAAGGAACTTCTTGACGACGAAGATCTCCTCCACCATGTCACCTAATGGACGGATGCCGCTGACGATCGTCGTTAACTTCATGGAAAAGTCGTCTATTGACTCACCGTTCTTCATGCGGATAGCCTTGAACTCACTTTTCAAGGTCTGCGCCTTTGCTTCCTTGACACGTTCCACACCCACATGCATTGTTTGTAACGTCTCCCACGTTGCCTTTGTCGAGTCCTTCTCTGCCAACATCAGAAGAACGTCCTTCGGGACTGCTTGGTAGATGGCGGCAAGAGCCATCGTATCCTTACACTCCTCAACGTCGCCATGCTCGAGGGCGTCCCATACGCCTTGTGCCTGTAAGTTGACACGCATCTTGATCGACCCATGCTGCATAGTTACTCTTCGTGAGTCGAATACTGGAGCGTTAAACTCCTCTTCTTTCCGCTCTTTATCGTTGTCACTTCTCCTTCAGTGACTTTGGATGGTGACGACATTTTCTTCGACATTGTGGCTTTGATACCAAGGGTTGGATTTGACCATGATTGAGATATGTTGAAACAAGTAAAAGAGGCAAAAGAAAAACTTGAGGGGAGAGAGATTTCAGAACAAAGTAATCTTAAGCACTTAGCTTTTCAATTCTCTTTGTTTTTGTGTTGTGAAAAAAAAAAAAAAACCTACTTCTCATGAGTATTTATAGTGTTGAAAGCACTAACTTGCTTCCTAAACTTTCTCCTCAAGTTAAGTAGTAACAATCTTTCTCATTAAAGTACAAAACTTGGTCCACAATTTGGCAACTACTACCAATTCTCCTTTAGTTGCTCCATGAGTTAGTAACTTATTCTAATAAGTCTACTAACAATTCTCCTAAGGTTACTCCACAAGTTAGTAACTGGTTCTACTAAGTCTACTAACCATTAGGAAGTTAGTAACTGGTTCTACTAAGTCTACTAACCATTCTCCTAAAGTTACTATATAAGTTAGTAACTTGTTCAACTAGCTCAAGTTTATTAGCTCACAATGTCTTCACTTGGTAAAATGATAAGACATAGGATGAATATTTAAAAATACTTAAATAACCTGGTTTGGGAAATGCAATCATATATAACATGGTTCATGATTGGTGAGACCTATTTTTTTTTACGGTTTCTTGGGCTCTACTTTCTCTTTCGAGTAGGTTGTATGCAAAGTGCTTCTCTACACATGGTTCACACATGCATGCACGCACCTACTAGGCAAACCCATATATGTACCCTTAGGCCTAACTATGGGCGAGCACACGCTTATGCTGCTGAAAGCTAACGAAAAAGAAAGGAGAACATAAAATATTGTGGTATACACATCCGTACAGAACATCATAGCATAGAATAATATTACATAAAAAGGGTTAGAGATGATGTGATATTCATGCTTCATTCAACCTAGCATGGAGGCAGTAAGTCTACTTATAGACTCAACTTGGTTCTCTAAGCTCATAGTTTCTAGATTTAATTTCTGAAATTCCATTCATTCAAAAAACCAAACATAGAGCCTATTCATGGAAACAACATAACCTAACTATCCAAAATTAATCATAAACCGTGTTAAATATTAATACTTGTCCAAAACAATTGCTCTATGGGCCTTTTCTGGCCCTAATTACGCCAGAGATGATTCCAGGACTGTTCATAGTGGTTAAAACAAGCCAACCAAGCCAACGGAAGCCAAATCGAGCTTAGGAAGCCGACCAAAGATTTTTCTTTCCAACTATAACAGGATCTTTGGGTGGGACTTAAGCCCTTGTGTGTTTTATTTTTGCATTTATACTTTCTTTAAAACAAGAGGTTGCATTTAGTGGCCTCAAAAGTTTTCTTTCTCCCACCGTTCTTTGATTTCCTTAGATTTTGTCGTCCTCGGTATGATAGAGAGACGATTGTGGTGGTGGGTCTTCTTATATTCTTTGTGATCGACATGTGAACTCAGGGGAAGGGAGGTTAGGATATGTACCCTAGATCCTTCTAGGGGTTTCTCTTGCCATTCTTCTTTTCATATCTCGTCTATCCCTTCTCCTTCAACTGTTGCTCCTGAGTCTCAAGTGTTTTCTTCAATATGGAAGGTTAAAATTTCAAAGAAGGTTAAATTTGTTGTTCAGTACATGGGAGGATTAACCCCATGGATTGTGTCCAAAGACAATCTTCTTTCCTTTTATACCTACGATGGTGCAGTCTCTGCAAGAGGTATGAGGCATGAGGGTGATGTTTAATCATTTGCTGTGGGTGTCCACTTTGAGTCGTTGGATGAGTTGTGTTTGTGGGGCTTGCAATAGAGATAGGTGATTTATGTTGGAGGAGGTGCTTCCGAATCTTTCTTTTAAGAAGAGTGGCATTGTGTTGTGGTGTGCTTGCTTATTTGTTGTTTTGTAGGGTCTTTGGTTTGAGAGAAACCATAGGATACTCATTCTCAGATATCTTAAACAAAACCTCCAACCAAACAATTCCTTAATGCTTTCTTGTTTCTTGTAAGGTTCAAATTCCTACGCGTGGAACTCACTCCCTGCCCAAAATTAAAACTCGTGTAGTCTTCAAGCCTTGTGTCAAGTTGTTAACACACTACCCATCAATACACTCCCGTTGCATGATAATGTTCATTTGATCCCCTTAAATCTTTCTCCTTTTCTTTTGGCCAACCTAATCGTTAACAATCCATTGAGCAAGGACCAATATCTTTGTTATGGAACTTTTATCACAACAATTTTAGAAAAGCTCCCATGGAATTAAGCGACCTTACAAGCGGATTAGTCTGACTGAAATGCAGCCACCAGTTGTCTCCTGTTCTGCATTGATCTCCACCTTCGTAGAGGAAGAATTCTACAGTACTTTCATTTATTGCAAATGCATTCCTGCATACCGTTCGACATATTTATTGCTTTTCCTTGTACTCTAGATACAATTGCATCATTTATTCCCTATCCATAATAAGATTTCTTCATTCTGCAGAGAAAAAGTGAGGTTTTGACTCTTCATCTCCACCTTGCTTCCAACTCTGCCTTCATCACCTTCTATAACAGAAAATACATCTGGCTCTCCATGTAACCATTGGTTCCGTAGTTCAAATTTCTCTGGAAAGTTCTCTAGATAGATGACCAGTTCTTCAGTCTAAGATGTAAGATTCTCGTCTCTTCCTCAATTTGGAACGTCTAAGCTCGCAATCTTCCCTAGTTGCAAGCTAGTCATCTTGAAATAATCGTATGATTGCAGTAACCCGTTCTCATACAACCCTTTCAAAGTCCCTCTCAGGCACTATTCTGTTTTGTGGATCTGAGGATTATCAAGTCTGTCTACTCTGCCCCTCCTTTTCAGGTTTCAAACCCTCCCTTTCTAGTTTATTCTTGGAGCCTAAATAGAACCGTATTAATGAACTCCCTTCTAGATTTGATCCCGTCGAAATTTCTGATTTTTTTCGCTTTCATTTGGTAGAACCAGAGTGCCCTCACTGGGGGAATTTGTTTGCTTCTATTCGGTATTTTCAGACCGGTGAAGAGAATCTCTCTGGGGGTTTTGAGATAGAGGTCCTTTAAGGTGAACTTATTCCTTACTAGTTAGAAACACTTTTTCATACAGTCTTTCCAACTGACTATTCTCGGGCTTGAAAGCTTGAACGTGACGAATAGGCTGCACTTCTTTTTTATTACAATAGGTGGAGTCTCTTAAAGCCTTATACGAGCATGAGACTGAAGAAGTATGGCATTGGTGTGAGTGTTATTACCTCCAGATATTCACCCACTTTGATTAAAAACTTCCAGCATTCAACAATAATTTAGCCCTTGTTTTATTGAAAAACTTCGTTCTGAATACGTTTCCAATCGTTTTTTTCTATCATCATGTCTGCTGCCATCCTAAGATTTTTTTTTAAAAAAAAAAAAAAAAAAAAAAACTCTAGTTCATGCTATCCTCAAATTTCACTCTTCAAATTTCAGTTTCCCTGTCCTAATCCTACTGAGAAGACATTTCTGTTCTACCCCATAGTTACGTTTTTTGGTAGTTCGTGTTGCCTTCAAGGCCATTAGAAATGAAAAGAGAGATTGTACCAAAGCAAAGGCAAGAGAAATGTCATTTTACACTTGCGTTTTCTTTTAATATGAAGAAAGTGTAGTATCACTTTCGATGGGGGGGAATAGAAACCACGGGCACTACTGCATCATTACCCAGATGAGAGTGAAACCTGTAATCTGGAATTTGATCCTTTCGGTACTGTGTTGCTTCATTTCCACATTTGGTTTCATATGTTTTCTTTTCCTTTTTAGTAAAAGAAATTAGTTTCTTAAGAAGAAAAGGAAATAAGAAAAAGAACTGATTATTGCTAACGTGCACGGTGCTTTCTTCCTCATCTATTTCCATCCCCACACAAGACCTCATCTACTTTGATCATTTCAACTATCAAGAGCTAATATGGTTCCTGCACTTAGGTCTCTTCTGAAATAGTTTTGTTTCGGGATTTAGGTCTTCTCGATCCTTGTTGCATCTGTTAAACGTATTTTTACTTAACTAATGGTGCTTGGTATTTTTAGGAACCCATCATATTTGGATTCATACCTGGATCCTGCTTTCATGGCTTTAATCAAAGATCCAGATCAAAGACGCAAGGAACAATGGGAAAAAACTGCCCAGGCTCAGACTGGATGGAATTATGGTAGAATACTTGGATTTGGTGAACTTGGTGGAGCTCCTTTATCTGCTGCTGAAGAATACTCACTTCATTCAAGATACCTTGCTAAAGCAAATTCTCTTAATCTCTCTGAAATTGGGGAGATGAAAATTGTGTGAGTATCAGATTTCTTGTTTGTTTCCACATGCTTATGTATATAGTGGTCCGCTATCCGCCACCCACACAATCAAAAGGTGAGTTGGACGTGACTAAGAAAAATTAGACTACAAAATCATTAAATTTTGTATACTTGATAATAATTTATGCTTTATTTGTTGAAATCATGTATCCAAGGATAGTTTCTCTTGAAGTTCTGGATCTTCAGGGCAATTAGTACTGATTTTACTTCTCTTCGGTCTTCACTTTTCGAGGAGTTTTGTTACTTTTGCCTTTGTCTAGAACATTTCAAGGAAGTTGATTTCTTTGAGTCATTAGAGTTTATATGTGCCAGTGGTTAATCGAAGTGTATATCGAGAAATATCATGTTGTTTCCCATTACGGAATGTACGGAGTTCTAGTTAATGTCTCTCATGGGCATGTTAATTAGTCAGACGTTTGTGAAAACCAGTTGAGTCAGATGCTTATACTAAACATATTCGGTTTGTGCATTTCTGATTATATTACTAGCTTGAGATGCATATAATTATACTCTATTGCAGTTACCGTGGTGGAGTTGATAGCGAGGGTCGCCCTGTTATGGTCGTTGTGGGAGCACATTTTCTACTGCGTTGTCTCGACCTTGAGCGATTTGTGCTCTATGTTGTAAAGGTAATTTTGTTATAATCTTGAATAATATGTTCTTGTTTCTAATATTGAATTTTTATATCTTAAGACTTCTATGAAGCATGAGTGTAGTTACGGATGAAGAACACAATGAATTCTCAACAATGTTTCAAACATTGATATGAACCTTGTAGGATGATATTGATGAATTTTTGTTCTCCATCTTGCTTTGTTAAATGTTTATGCTGTGTTGTACACAATTTTCTTAAAATCATTTAAGACATAGTGAGGCAGTTGTGAGGTACCTCATGCCAACATAAAAAGAGGCTGAGACCTTGCTGATTAAGCCTTTGGTGGCGCGGTGGCTAATGGCATTACGGGGCCAAGCCATATATGAGGCTAGCCTCATTTTTCCTTGTTTATAATTTCATTCTCTTTGTTAGTTATTAACTTCTTAGTTGATTCGTATTTATTGATTTGTTGTAATATATTCCCGGAAATTGATTTCCTTATGGTGTTTCTTCTATTTAAGAAACCCCTTTATTCTCATGATAAAATAGATAAGAGTATTATTTCTCTTATATTTTGCACTGTCATAAGAGCATTAGGACCCTAAAAAACCAACCCTAAGCCTGGTCATATCTGCCGACCGTTCGCATTTACCGAACACCACCCTCACCCCGTCTAAGTTCGCATCTCCTTAGTCTCCAGAAATTCACAGTCTTCATTCATGGATTCGCATTCTTTGTTCCAGTTTCATACACGGATTTGGGTGGGTGTTTGCACATGAGTTTGTTGCTGTTTTACCGTCATCCACCACTGTTACAGCTTTCGTCGTTGACCCACCACAACTTGTACCATCGATGCCATTGCCTGCACGGTCGATAAGCTGCACCACCACTATTGGTCAGCCAAGTGTTGTGCTCTTCTGCCAAGTTTCTTGCCCTGATTTTTGGCTCTCTGGTATCTTTTTTGGCTGCTGTTTTTTTGTGGGTTTGGTTTCTGTTTCTTTTTCAGTTTTGAAATTTGTCGCATTAATACGCATCCCAGTCCTTTCCTTAGATGCATCGCTGTAGATCTCGTACCCTTCATAACTCTCGGGTACTGTGAGTACTGGGGCGGTTATCAACCTTTGTTTTAGGTCTTGGAAGCTTGCCTCACAAGCATTGTTCTACACAAACGGTACACCCTGTCCTAGCGAAGTCCTGCATCAACCTTTGATAGTATCCCCCAACCCTAGGAAACTTTGCACCTCTGTAACCGTAGTTGGGCGTTCTCACTTCGTAACCGCTTCAACTTTAGTGGGATCTACGGATATTTCGTCCTTTGACACCACGTGTCCTAAGAACGAAACTTGTTGCAGCCGGAACTAACATTTAGAGAACTTGACATACAACTTGTTCTCTCTCAGGGTGGTCAAGACTTTTCATAGGTGTTCTTGGTGTTCTAAGTTTGTTTTCGAGTATACAAGGATGTTGTGTATGAACACGATAACAAATGTGTCCAATAGTCTTTGAATACGCGGTTCATTAATTCCATAAATACAGCTGGGACATTGGTGAGACCAAACGACATCACTACGAATTCGTAGTGACTGTATCTTGTTCTAAACGCTGTCTTTGGTACGTCTTTTTCTTTAATTTTGATTTGGTGGTAACCTGACTGAAGGATCTTGAAGAATATCGTTTCCTCCCTGAGTTGATCAAATAGATTTTCTATACGAGGCAGAGGGTATTTGTTCTTTATTATTCTCTTATTTAGCTTTGTATAATTGATGCACAGACGTATCGACCCGTCTCTCTTTTTGCAAACAACACTGGTGCACCCCAAGGGGATATATTAGGTCGAATGAAACCTTTGTCCAATAGCTCTTGTAACTGCATCTTGAGTTCTTTTAACTCTGCTGGTGCCATACAATAGGGTGCTTTGAAGATGGGCCTAGTTCCCGGTTCGAGTTCTATAGCGAAGTCGACCACTCGGGAAGGGGGTATTCTTGGTAGGTCTTCTAGAAATACGTTGGGAACTCATTTACTACAGGTACGTTGTCTAAGGTCTTTTCATTTCCTCTTACATCTACGACACATGCTAATATAGCCCAATCGCCTTGTTGGACTAGTTTCTTTGCTTTCATCATCGAGACTACCTTTGGAGCAGTCCTGGAGCTTGTGCCTTTAAACTTAAAACTGGCTCTTGCCAGTGGTGAGAATACTACCTCCTTTTTGCGACAGTCTAGTAGCACGGTTTTCAGCTAACCAATTCATGCCTAGTATTATGTCGAAGTTAGTCATGTTAACTACTATCAGGTCTACACGTAGGTGATTACTTGGCCATTGTTTACTAGCCCAGATCTACCCTGTAGGGGTGCTTACAAACAATTCATGCAATAAGGGTTCTAATTCGTACCCTGCTTGTGCAACAAAAGGCATAGAGATAAATGAATGCATAGAGCCAGAATCAAATAAAGTAAAAGCGTAATGACCTAAAATGGGTAGTGTACCTGTCACCACGGTATTGGACCTCCTAGTGTCCCTGCTGGTGGATGCATACGCCTTAGCTTGAACCCGCGTTTGCACCAGTTGGTTCGCCTCTTCCACTGCTCTAGGCGGGTTAGGTTGGGTTTCTGCATCCCTGGTCATGCAGTTGATGGCGATGTGACCTTGACCACATCGATAACGGGCTCGCAATCCAGCCATGCACCTTCATCCATGCATTCTTCCACATATCCTGCACCCCGCTTCTCCCTTTCCTCTAGCCCCATCCTGATTTATGGGGTTTTGTTCAGTGCGTCAGGGAGGGTACCTCTCATTCTATCCTGGGGCAGGTCTATTGTTATAGTGGGGCCTACGTGCCGGTGGCTGGCATTCTGAGTCCCCTGAATCATGTGGACGCTTACGTCCAATTGTTACTGGTTGTTCCCGTCGCGCCTCATCCTTAGGTTTTTCTAAAGCCTTGGCAGTTCTCAAGGCTTCGTCATAGGTCTTTGGATCTATGGCTTCCACCGTGCAGCGGATCTTCAGTTCCAGACCCAAAACAAACTTCTCTGTTATCTTCTGCTCTGTGTCAACCAAGGACGGGACGAATCTCTTCAACCTCATGAACTCTCTGTTATCATATTTCTCTACGGTAAGTCCGCCCTGCACCAGATGGATGAACTCTTGTTGTCTCTTGAGCCGTTCCGCCTTAGGATAGTATTTAGGAAAGCTTTCTTAAATTGTTTCCACTCCATTGCTCTTCCGGCTGGGTTTAGACTCTCTTTGTTGTCCCGTCACCACACCTCAACATCCTTCTGTAGGACGAATGTCGCACAGGCGACCTTTTGGTTATCAGGGCACTCCATCGACTTAAATATGGTTTCTACAGATGAAATCCACATCTGGGTCTCTGTTGGATCCTTTGAGGTACCGGTGAATGGCCGAGGGTCATACCGCTTGAAGTCCTTTAGCCAGTTTGCCCCTCTAGTAGGTTGCGAAGGGTTAGCTTGGGTGTTGGCTTGTTGGTTGGCCTATTGGGTGACTATTAAGGTGTGGACTAAGGCCTGGAGCGACTCCCACATTAGTCTGGTGGTAGCGTCTACTCCAGCCACTGGGGGCGTAGCCTCAGGTGCGGCTTGCGGTGGCGGCACAGGTTGTGGTGGCAGTGCATTTTGTGGCGGCACAAATGGTTGCGGCGCAACTGATGGCGGCATACTTGTATCCGATCCAGTCTATAATCCTTCTCGAGGTGGGTTATCTTCAGCTTCACCTCGTCCTCTTTCTCTAATAGGAAGTTGTCTCCTTCCTCGTTTTCCTCTTCTGTGTCCTCTGGGCGGCATGATTCTAGAAACCAAGATAACGTTACTTAAGTCTAAGGCTCATTACTATGAACATATACTATGTCTCACATAATCTAAGTTCACATCATATTGCAATACAACACAGCAGCAAATCAATCAACATGCTACAATCACATCATACTCTTTAATTTAAAGCACATGGAATGTCAATTCAACACAACGACATTGTAAAAACAATTGCTTGCGAGTTTCTAAAAGCATTTACTTTTTCTTATCTTATGCAAGGCATACCTGAACGGTGACGGGACATCGACGTTTGGACCATAGAATGTGCCAAGCCTACATCGTAGGCTAGTCTACAGAATCTATAACCTAGAGCTTTGATACTAACTCTAACGACCCTAAATTTCTACTTACTCAGAGTCGCTACTAACGTCTCTTGCTCAACTCATATGCGGACTATAACTCCTAAAAGAAACATCATTTATACATAAATATCGAAAAATTTAAAACAACTTTATCTCTGGAAAACACAGCCATGGTCTTATATGTTTCATACGAAATACTAAAATTTAAAATAAAATAAATATCAAATAATTTAAAACATATGAAATGCAAGACAACCTATTCTAACCTAAGACTAGGAATTTAAATATGCAACTACCCTATGCATGTGCCATGGTCGCTACTCGCGAATGCTGTCGTCATCCGTACATGGGTGTCTTGCCTTTACCTGAACAATGATATGGCACACGGCCTGAGTATTTCGAAAAATACTCCGTAAGTGACCCCACTATAGGGGCCATGTTGAATGCAAACACATGCAAATATGCTTTAGGGACCTATCTCTATCTCATACTAGGGGTAGCCTCCAGTCCGGACTAACTTGGATCTGTAGGACTTTCCTACACACGACCCACACGTGCGAGTATGAATCCCCAGGTGGTTCGCACACCACTTGGTCCCATACTGGTCGAGCTATCTGTCTGAAGTTCGGAGGTCAGTAGACCCTTGGGTGTCATGCTCAACATGATCACTGTCATCATCATATTGTGCACGTCCGCACTCATCATCATAAAGGGAATGTATCCCGATGCTTATGTACACACAACATGCATGAGGTCCCTCGATTTATCGTCTTAAAACATCCTTCTGTCACAACCCTAGTCTCATCTCTATATTACATTTAGTAATACATGCTCATTAGATATTTATATTAATATCATTACTTAGGGTTGTGTCCTAGGTCGATCTCTCGACATGATGCAACATGACATTCATCATCATAAAGCATGTTTAATATGGAAAACATCATCATATTAAGGTAAACGTCATACATCATAATCATCATAATGTCATCAAATGCATCAAGCACAACAAGTACATCATATAACTCACTTATGTCAACATACATCATCACATAATTCACGTACGGTATCATACGACACAGTACATACATCATACATCATATAATTCACATACGTCATCATAAGTCATCATACAACATAGCTCATACAATTTTTCTAGCATATCCTATAGTAAGGCCACTTACTTGGTTGGCCTTAGCCAGCGTTCTCCCTAATTATTATACGTTAGCTCTCATTCCTCGAAAGCTGTTCCAAACGTCGCTGGAGTTCGTTTCCTATTTTTGGAAGAGATGAATGCCATTAAACAAAATTGCACATTGGACATAGTAGAATTACCCAATGATAAGAAACAATAGGACTCAAATGAGTGTTTTCTATAGCGTGCAAAGCCTATGGTAGCGTTGAAAGATTCAAAACTGGACTCGTTGCTTAAGGCTTCACTCAGACCTATGAAATTGGTTAGCAAGAAACGTTCTCTCTTGTTGCGATAATTAACTCTAATAGAGTTCTGCTATCTATTTCTATAAAAAAATTGGTCTCATGCTTTTCTCCATGGCGAACTTGGAGAAGAGGTATTTAAGAACTTACCACTAGGTTTTGGGGAAGATCTCTTCCTCTTGGATAAGGTGCGTAGAGACTTATAGAAACCTCTTCCTTTTGGGATGCTGTGGTGGTGGGTTCTCTTCTGAGTCCAACACCCTTTTGACATAGGATCATTGCGAATAAATACGGTCCTCAACCTTTCAAGTGGCTTAACGAACTTACAGGAGCCCTTGGAAAGACATTTCTTTCGAGCTCGCTTTCTCTTCTTTTGTCCAACAAGGTGGGGAGATGGGAAGGAAACGTACTTCGAAAGGATAATCGGGCAGGGGATAGGCCCCTCTATGCTATGTCTCCCTATTTATACCATTCATCTTTTATGGAAAATTATTATGTGGCTGAGGCTCTAGACCCTTGTGGGAGCTCTCCCTCCTCATTTGGTTTTCCTTGTCCGTTATCCGTAGGGAATCAACATACAGCTTAGCTCTCTTATCCTTGATTAGGGAGGTCATTCATAGATAGGGGAGGAGGTTTTTTTTGTTCGTGAGCCCTAACCCGACTAAGGGTTTTTCTTGTCGCGGGTTCTTTTAGTGCTTATATTGAGTCTCTCCTCAATCGGCTAGTTGATTTTTTTTTTTGTTACGAAAGGTGAAAGTTCCAAAACAGTTCAGTTCTTCGTTTTACTGATCATCCATGAAAGAAAAAATAAAGCTTGATTGACACCCTTTTCACAATCTCAATAAGAATCTTATTTTATTTTTTTTCATTTTTGACATAAAAAAATACTCCTATTCTTTCAAAAGTTACAATAATTTTGTTTAAGAGATTATAAGTTGCAATATTACCTTCACCCTTCATAAACGTTTCTAAATTGTTTGGAACGGTTTTAATTTTGTTCCGACCATAATTGAAGAATGTTAGGAAGTATTTTTGTTTGAACTATGGTCTGTGATATGTCTCCCTTCCATTCTTCACCTTTTGACTGCTCTCTTATGGATTCTAGCCAGATTGATAACTTTCTGGAGTTGATTTTGTAACTTGGGGGCTGTCCATCTTTAAGTTCTTGATTTTGCAAAGTTAGGAAAAAAATTACTCATGAAGTTGAAATTCTTCCGAATGTTGGACAGGAAATCGACATAGCAGTGACTGGAATAACGTGTCAATGACTAGAATGCTTGTTTGGATTAAGTTTATAAGACCGTCGTTTAATCTCAATGACTTTTTGTCCAACATTCTAATGGTTTTTTTTTGGTCTACATGCAATTTTGAAAGTTCAAGGATAGTACGACTACTTTCGAAAGAATAGGAGTGTTTTTGAAACAAAAGACGCAGTTACAATATATTTTTTTATAATTTAGCCTAGAATTTATTTCCGTTTGAGTTGATTTAATTCTAGTGAATGGTAGCACATGAAACTCTATTCAAAAGTTCATATTTTTGGTTCCTGTTTAACCTCTAATCTTGCCACTACTGTCTGGAACATTCTTTCTTTGCTGTTTTGTGCAGACATTTCTTAGTTTATGCACTTTTGTTGTATAAGAAGTTATGCACTTAAACTTGTTATTAAATTCCAGGAATTTGAGCCATTAATTCAGAGGCCTTACACCATTGTGTACTTCCACTCAGCAGCTTCTTTACAGCCGTAAGTGATCTTCTATATTTATTTTAAAACGAGCAAGAAACTTTGGGATGAACATGGAAAGCAGCTTAGACAATGAGAAGTTCGTAATGAAACCTAGTTGTGTGCATTTTGTTCTTGATTTTGTATCGGTTAGTAGTCTACCGGAAGTTATTCAACTTTTCTGACCGTGTTTACGGTTTCTATACCGTTAGTCGGCCCGACATGGGATGGATGAAGAGATTGCAGCAGATTCTTGGCCGAAAACACCAGCGTAACCTCCAGGTATATGAACTCCGACTTATTCCAATGCTAGATGTGTTCTGAGCAAAAATCCCCCTTTATCTTTCTCATCTATCGGTATGATTTTTGTAGGCAATATATGTTCTTCACCCGACTTTCGGATTGAAAGCTGCAGTATTTGCCATGCAATTGCTTGTGGATAACGCGGTAGGCTTAAAGTTCGTTTAAATATCTCGTTTCTAAGCATCGAATGAGATTAAAAGTGTTAGTAACCTGAAGTAACTGATATCCAGGTATGGAACAAGGTTGTGTATATCGATCGACTACTGCAGCTGTTCAAATATGTTCCACGAGAGCAGTTGACCATCCCGGACTTCGTATTTCAGTGAGTTCATGCTTCCTTTGAACTTTTATTCAACAATGATTACAAATCCTCGAATTTAGAGCATAGATGCCTTGTATGTCTCTGTATTATGCCTTTGTTCGTGAAGTTTACGATTAAGATAAATTAGACGCGTCGTTCTATGATAAATGCAGGCACGACTTAGAAGTAAATGGAGGGAAGGGCCTAATTGTGGACCCTCGAACAAAATACGTATATCATCGACCTTAACATCGAAGAAAATGTATGGATAATGATCCATTAGCGTCTCGTATTAGGTGGGTAGGTGTTGGTTATGTTTGTAGAAATGGCATACCATTTTCTTTCATCCTTATTTTTGTTGAAGTATTGGTTCTTTTGTGTAATGAAGATGAAA

mRNA sequence

GGTTTGTTTCCGGGATTTCTTTGACGAATGTAGGTTTGCAAGGACTTGGTGGCGTTGTTAAGATGTACAGGACAGTGGCTACATCTGCGGCCACAACCACAGCAACAACAACAACAGATAGCGTTGATTATGTTGTAAATTTGGATCAAATTCCACGGTGGAGTGATGCAGAACATAGGTCTTCATTGGAGTTCGTCAACGAAGATCCTTCCTTCTCTAATTCGTATTTTCCTGATCCTCTGACGTCTCCATCTGACGCGGAGGGTGGTAATTATGGAGTGGTTTCGAGATTTCCCGTCGATCATGAAATTAATTCCAAAATATATCTGTGGCGGGGGAATCCTTGGAATCTCGAGGTAGATGCTGTCGTGAATTCTACAAACGAGAACTTGGATGAAGCACACAGTAGCCCTGGTTTGCATGCTGCAGCAGGACCTGGTCTATCAGAAGAATGTGGAACACTTGGTGGTTGTCGCACAGGAATGGCAAAAGTCACTAATGCTTATGACCTTCCTGCTAGGAAGGTAATGCATACTGTAGGTCCAAAATATGCTGTCAAATACCATACTGCTGCAGAGAATGCTCTGAGTCATTGCTACCGTTCTTGCCTTGAACTTCTCATCGAAAATGGGCTTCAGAGCATTGCTATGGGTTGTATATATACAGAGGGAAAAAACTATCCTCGTGAACCTGCTGCACACGTAGCAATAAGAACTGTGCGTCGTTTGATTGAGAAGCAGAAAGGTAAAATTAAAGCTGTTGTCTTTTGTACAACATCATCAATCGATACTGAGATATACAAAAGGTTGCTTCCTCTGTACTTTCCCCGTGATAAACACGAAGAAGAAGTTGCATTGTCAAAGCTTCCTGCAGATGTTGGAGATGAGAATGGTGAGACTATTATAGATGAGCGGAAAATCAGGATAAACACTTTGCCCAAAAAGAATGTTCTGAAACCTCCCCAAGTTCCCGATGATCCTCCTGTCAGTCATGTAAGGTTGACACAGAGGAACCCATCATATTTGGATTCATACCTGGATCCTGCTTTCATGGCTTTAATCAAAGATCCAGATCAAAGACGCAAGGAACAATGGGAAAAAACTGCCCAGGCTCAGACTGGATGGAATTATGGTAGAATACTTGGATTTGGTGAACTTGGTGGAGCTCCTTTATCTGCTGCTGAAGAATACTCACTTCATTCAAGATACCTTGCTAAAGCAAATTCTCTTAATCTCTCTGAAATTGGGGAGATGAAAATTGTTTACCGTGGTGGAGTTGATAGCGAGGGTCGCCCTGTTATGGTCGTTGTGGGAGCACATTTTCTACTGCGTTGTCTCGACCTTGAGCGATTTGTGCTCTATGTTGTAAAGGAATTTGAGCCATTAATTCAGAGGCCTTACACCATTGTGTACTTCCACTCAGCAGCTTCTTTACAGCCTCGGCCCGACATGGGATGGATGAAGAGATTGCAGCAGATTCTTGGCCGAAAACACCAGCGTAACCTCCAGGCAATATATGTTCTTCACCCGACTTTCGGATTGAAAGCTGCAGTATTTGCCATGCAATTGCTTGTGGATAACGCGGTATGGAACAAGGTTGTGTATATCGATCGACTACTGCAGCTGTTCAAATATGTTCCACGAGAGCAGTTGACCATCCCGGACTTCGTATTTCAGCACGACTTAGAAGTAAATGGAGGGAAGGGCCTAATTGTGGACCCTCGAACAAAATACTATTGGTTCTTTTGTGTAATGAAGATGAAA

Coding sequence (CDS)

ATGTACAGGACAGTGGCTACATCTGCGGCCACAACCACAGCAACAACAACAACAGATAGCGTTGATTATGTTGTAAATTTGGATCAAATTCCACGGTGGAGTGATGCAGAACATAGGTCTTCATTGGAGTTCGTCAACGAAGATCCTTCCTTCTCTAATTCGTATTTTCCTGATCCTCTGACGTCTCCATCTGACGCGGAGGGTGGTAATTATGGAGTGGTTTCGAGATTTCCCGTCGATCATGAAATTAATTCCAAAATATATCTGTGGCGGGGGAATCCTTGGAATCTCGAGGTAGATGCTGTCGTGAATTCTACAAACGAGAACTTGGATGAAGCACACAGTAGCCCTGGTTTGCATGCTGCAGCAGGACCTGGTCTATCAGAAGAATGTGGAACACTTGGTGGTTGTCGCACAGGAATGGCAAAAGTCACTAATGCTTATGACCTTCCTGCTAGGAAGGTAATGCATACTGTAGGTCCAAAATATGCTGTCAAATACCATACTGCTGCAGAGAATGCTCTGAGTCATTGCTACCGTTCTTGCCTTGAACTTCTCATCGAAAATGGGCTTCAGAGCATTGCTATGGGTTGTATATATACAGAGGGAAAAAACTATCCTCGTGAACCTGCTGCACACGTAGCAATAAGAACTGTGCGTCGTTTGATTGAGAAGCAGAAAGGTAAAATTAAAGCTGTTGTCTTTTGTACAACATCATCAATCGATACTGAGATATACAAAAGGTTGCTTCCTCTGTACTTTCCCCGTGATAAACACGAAGAAGAAGTTGCATTGTCAAAGCTTCCTGCAGATGTTGGAGATGAGAATGGTGAGACTATTATAGATGAGCGGAAAATCAGGATAAACACTTTGCCCAAAAAGAATGTTCTGAAACCTCCCCAAGTTCCCGATGATCCTCCTGTCAGTCATGTAAGGTTGACACAGAGGAACCCATCATATTTGGATTCATACCTGGATCCTGCTTTCATGGCTTTAATCAAAGATCCAGATCAAAGACGCAAGGAACAATGGGAAAAAACTGCCCAGGCTCAGACTGGATGGAATTATGGTAGAATACTTGGATTTGGTGAACTTGGTGGAGCTCCTTTATCTGCTGCTGAAGAATACTCACTTCATTCAAGATACCTTGCTAAAGCAAATTCTCTTAATCTCTCTGAAATTGGGGAGATGAAAATTGTTTACCGTGGTGGAGTTGATAGCGAGGGTCGCCCTGTTATGGTCGTTGTGGGAGCACATTTTCTACTGCGTTGTCTCGACCTTGAGCGATTTGTGCTCTATGTTGTAAAGGAATTTGAGCCATTAATTCAGAGGCCTTACACCATTGTGTACTTCCACTCAGCAGCTTCTTTACAGCCTCGGCCCGACATGGGATGGATGAAGAGATTGCAGCAGATTCTTGGCCGAAAACACCAGCGTAACCTCCAGGCAATATATGTTCTTCACCCGACTTTCGGATTGAAAGCTGCAGTATTTGCCATGCAATTGCTTGTGGATAACGCGGTATGGAACAAGGTTGTGTATATCGATCGACTACTGCAGCTGTTCAAATATGTTCCACGAGAGCAGTTGACCATCCCGGACTTCGTATTTCAGCACGACTTAGAAGTAAATGGAGGGAAGGGCCTAATTGTGGACCCTCGAACAAAATACTATTGGTTCTTTTGTGTAATGAAGATGAAA

Protein sequence

MYRTVATSAATTTATTTTDSVDYVVNLDQIPRWSDAEHRSSLEFVNEDPSFSNSYFPDPLTSPSDAEGGNYGVVSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNENLDEAHSSPGLHAAAGPGLSEECGTLGGCRTGMAKVTNAYDLPARKVMHTVGPKYAVKYHTAAENALSHCYRSCLELLIENGLQSIAMGCIYTEGKNYPREPAAHVAIRTVRRLIEKQKGKIKAVVFCTTSSIDTEIYKRLLPLYFPRDKHEEEVALSKLPADVGDENGETIIDERKIRINTLPKKNVLKPPQVPDDPPVSHVRLTQRNPSYLDSYLDPAFMALIKDPDQRRKEQWEKTAQAQTGWNYGRILGFGELGGAPLSAAEEYSLHSRYLAKANSLNLSEIGEMKIVYRGGVDSEGRPVMVVVGAHFLLRCLDLERFVLYVVKEFEPLIQRPYTIVYFHSAASLQPRPDMGWMKRLQQILGRKHQRNLQAIYVLHPTFGLKAAVFAMQLLVDNAVWNKVVYIDRLLQLFKYVPREQLTIPDFVFQHDLEVNGGKGLIVDPRTKYYWFFCVMKMK
BLAST of Cp4.1LG07g05460 vs. Swiss-Prot
Match: GDAP2_NEMVE (Protein GDAP2 homolog OS=Nematostella vectensis GN=gdap2 PE=3 SV=1)

HSP 1 Score: 322.8 bits (826), Expect = 7.6e-87
Identity = 181/509 (35.56%), Postives = 285/509 (55.99%), Query Frame = 1

Query: 40  SSLEFVNEDPSFSNSYFPDPLTSPSDAEGGNYGVVSRFPVDHEINSKIYLWRGNPWNLEV 99
           ++L  +N  P +S +  P P   P   E  +   +S FPVD EIN+K+ LW G+   L  
Sbjct: 8   ANLVDINSLPKWSTT--PVPNYEPGSNESQSSSFLSPFPVDEEINAKVVLWNGDITKLAA 67

Query: 100 DAVVNSTNENL-DEAHSSPGLHAAAGPGLSEECGT-LGGCRTGMAKVTNAYDLPARKVMH 159
           DA+VN+TNE+L D    S  +H AAGP L +EC   L GCRTG AK++  Y+LPAR V+H
Sbjct: 68  DAIVNTTNESLSDRGALSERVHRAAGPELMQECRQQLLGCRTGEAKISEGYNLPARYVIH 127

Query: 160 TVGPKYAVKYHTAAENALSHCYRSCLELLIENGLQSIAMGCIYTEGKNYPREPAAHVAIR 219
           TVGP+Y  KY TAAE+AL  CYR+ + L+ EN + +I +  + T  + YP E  AH+A+R
Sbjct: 128 TVGPRYNTKYKTAAESALFSCYRNTMRLVRENKISTIGVCVVNTTKRGYPPEDGAHIALR 187

Query: 220 TVRRLIEKQKGKIKAVVFCTTSSIDTEIYKRLLPLYFPRDKHEEEVALSKLPADVGDENG 279
           TVRR +EK    +  V F    + +  +Y +++P+YFPRDK EE  AL+ +P D+G+E G
Sbjct: 188 TVRRFLEKYGSAVDTVAFVVEGA-EAVVYAKVMPIYFPRDKLEEAHALTLMPDDIGNEEG 247

Query: 280 ETIIDERKIRINTLPKKNVLKPPQVPDDPPVSHVRLTQRNPSYLDSYLDP-AFMALIKDP 339
           E II ER+IRI       V KPP +     V      + +    + ++   AF  +  D 
Sbjct: 248 EPIIPERQIRI-------VPKPPSLQHGEDVEEAEEAEGHLDMTELHVGKHAFAVMAGDH 307

Query: 340 DQRRKEQWEKTAQAQTGWNYGRILGFGELGGAPLSAAEEYSLHSRYLAKANSLNLSEIGE 399
           DQ  K++  ++                      +   E+  ++ R+L +A + N ++   
Sbjct: 308 DQMTKQRAHRSDDG-------------------MKVVEQQRVYQRWLRRARTENFADFSR 367

Query: 400 MKIVYRGGVDSEGRPVMVVVGAHFLLRCLDLERFVLYVVKEFEPLIQRPYTIVYFHSAAS 459
            KI+Y+ GVD  GRPV+V V  HF  +  DL + V Y +   + ++ R Y +VYFH+ ++
Sbjct: 368 QKILYQSGVDFLGRPVVVFVARHFTAQNTDLGKAVAYFISVLDRIVNRDYVVVYFHTHST 427

Query: 460 LQPRPDMGWMKRLQQILGRKHQRNLQAIYVLHPTFGLKAAVFAMQLLVDNAVWNKVVYID 519
            + +P M ++K L  I+  K++RNL+A Y++HPT   +   +       ++V  KV ++ 
Sbjct: 428 EENQPPMSFLKELYHIVDNKYRRNLKAFYIVHPTVWARIVTWFFTTFTASSVKEKVHFLS 487

Query: 520 RLLQLFKYVPREQLTIPDFVFQHDLEVNG 546
            +  L+ ++  +QL IP +V ++D++ NG
Sbjct: 488 GVQYLYDWINPDQLDIPAYVLEYDMKENG 487

BLAST of Cp4.1LG07g05460 vs. Swiss-Prot
Match: GDAP2_XENTR (Ganglioside-induced differentiation-associated protein 2 OS=Xenopus tropicalis PE=2 SV=1)

HSP 1 Score: 301.2 bits (770), Expect = 2.4e-80
Identity = 185/522 (35.44%), Postives = 285/522 (54.60%), Query Frame = 1

Query: 25  VNLDQIPRWSDAEHRSSLEFVNEDPSFSNSYFPDPLTSPSDAEGGNYGVVSRFPVDHEIN 84
           V+ D +P W+D       +               P     DA  G  G+ S FP  ++IN
Sbjct: 11  VDADALPCWADVRDGEGEDV--------------PDGGRKDAPHG--GLHSPFPYRNDIN 70

Query: 85  SKIYLWRGNPWNLEVDAVVNSTNENL-DEAHSSPGLHAAAGPGLSEECGTLGGCRTGMAK 144
            K+ LWRG+   L   A+VN++NE L D+   S  +   +GP LSEE   L GCRTG AK
Sbjct: 71  KKVILWRGDVALLSCTALVNTSNETLTDKNPVSDSIFRYSGPELSEEMQKLKGCRTGEAK 130

Query: 145 VTNAYDLPARKVMHTVGPKYAVKYHTAAENALSHCYRSCLELLIENGLQSIAMGCIYTEG 204
           +T  ++L AR ++HTVGPKY  KY TAAE++L  CYR+ L+L  E G+ S+    I T+ 
Sbjct: 131 LTKGFNLAARYIIHTVGPKYKTKYRTAAESSLYSCYRNVLQLAKEQGMASVGFCVIATQK 190

Query: 205 KNYPREPAAHVAIRTVRRLIEKQKGKIKAVVFCTTSSIDTEIYKRLLPLYFPRDKHEEEV 264
           + YP E + H+A+RTVRR +E     ++ VVF  T   +   Y+RLLPLYFPR   EE+ 
Sbjct: 191 RCYPPEDSTHIALRTVRRFLEAHGAALEKVVFAVTEQ-EEGTYRRLLPLYFPRSLEEEQR 250

Query: 265 ALSKLPADVGDENGETIIDERKIRINTLPKKNVLKPPQVPDDPPVSHVRLTQRNPSYLDS 324
           ++  LP D+G+  GE ++ ER+IRI+        KP    DD   S      ++ S + S
Sbjct: 251 SIPFLPQDIGNAEGEPVVPERQIRISE-------KPGGQDDD---SEEEGLVKDLSVIGS 310

Query: 325 YLDPAFMALIKDPDQRRKEQWEKTAQAQTGWNYGRILGFGELGGAPLSAAEEYSLHSRYL 384
           +   AF  +  D D++R                 R+   G+L GA +    + + ++R+L
Sbjct: 311 H---AFARMEGDVDKQR-----------------RLALQGQLSGAAMQKQHQRN-YNRWL 370

Query: 385 AKANSLNLSEIGEMKIVYRGGVDSEGRPVMVVVGAHFLLRCLDLERFVLYVVKEFEPLIQ 444
           ++A + +LS+I  +K +Y+ GVD+ GR VMVVVG +  +  +D+E+ +LY +   + +  
Sbjct: 371 SRARTEDLSDIAALKALYQSGVDNCGRTVMVVVGRNIPVLLIDMEKALLYFIHMMDHVAA 430

Query: 445 RPYTIVYFHSAASLQPRPDMGWMKRLQQILGRKHQRNLQAIYVLHPTFGLKAAVFAMQLL 504
           + Y +VYFH+       PD  ++K +  I+  K+++NL+A+Y +HPTF  K + +     
Sbjct: 431 KEYVLVYFHTLTGEHNHPDSDFLKNMYDIVDVKYKKNLKALYFVHPTFRSKVSSWFFTTF 484

Query: 505 VDNAVWNKVVYIDRLLQLFKYVPREQLTIPDFVFQHDLEVNG 546
             + + +KV  ++ L QLF  VP EQ+ IP FV  +D   NG
Sbjct: 491 TVSGLKDKVHQVESLHQLFSAVPPEQIEIPPFVLDYDARENG 484

BLAST of Cp4.1LG07g05460 vs. Swiss-Prot
Match: GDAP2_XENLA (Ganglioside-induced differentiation-associated protein 2 OS=Xenopus laevis PE=2 SV=1)

HSP 1 Score: 290.8 bits (743), Expect = 3.2e-77
Identity = 180/522 (34.48%), Postives = 285/522 (54.60%), Query Frame = 1

Query: 25  VNLDQIPRWSDAEHRSSLEFVNEDPSFSNSYFPDPLTSPSDAEGGNYGVVSRFPVDHEIN 84
           V+ D +P W+D  +        + P       PD         GG++   S FP   +IN
Sbjct: 11  VDADILPCWADVSNEEG----EDVPDGGRKDVPD---------GGSH---SPFPYRKDIN 70

Query: 85  SKIYLWRGNPWNLEVDAVVNSTNENL-DEAHSSPGLHAAAGPGLSEECGTLGGCRTGMAK 144
            K+ LW+G+   L   A+VN++NE L D+   S  +   +GP L EE   L GCRTG AK
Sbjct: 71  EKVILWKGDVALLNCTALVNTSNETLTDKNPVSDSIFRYSGPELLEEMQKLKGCRTGEAK 130

Query: 145 VTNAYDLPARKVMHTVGPKYAVKYHTAAENALSHCYRSCLELLIENGLQSIAMGCIYTEG 204
           +T  ++L AR ++HTVGPKY  KY TAAE++L  CYR+ L+L  E G+ S+    I T+ 
Sbjct: 131 LTKGFNLAARYIIHTVGPKYKTKYRTAAESSLYSCYRNVLQLAKEQGMASVGFCVITTQK 190

Query: 205 KNYPREPAAHVAIRTVRRLIEKQKGKIKAVVFCTTSSIDTEIYKRLLPLYFPRDKHEEEV 264
           + YP + A H+A+RTVRR +E     ++ VVF  T   +   Y+RLLPLYFPR   EE+ 
Sbjct: 191 RCYPLDDATHIALRTVRRFLEVHGQALEKVVFAVTEE-EEGTYRRLLPLYFPRSLEEEQR 250

Query: 265 ALSKLPADVGDENGETIIDERKIRINTLPKKNVLKPPQVPDDPPVSHVRLTQRNPSYLDS 324
           ++  LP D+G+ +GE ++ ER+IRI+        KP    +D   S      ++ S + S
Sbjct: 251 SILLLPQDIGNSDGEPVVPERQIRISE-------KPGVQEED---SEEEGLVKDLSVIGS 310

Query: 325 YLDPAFMALIKDPDQRRKEQWEKTAQAQTGWNYGRILGFGELGGAPLSAAEEYSLHSRYL 384
           +   AF  +  D D++R                 R+   G+L GA +    + + ++R+L
Sbjct: 311 H---AFARMEGDVDKQR-----------------RLALQGQLSGAAMQKQHQRN-YNRWL 370

Query: 385 AKANSLNLSEIGEMKIVYRGGVDSEGRPVMVVVGAHFLLRCLDLERFVLYVVKEFEPLIQ 444
           ++A + +LS+I  +K +Y+ GVD+ GR VMVVVG +  +  +D+E+ +LY +   + +  
Sbjct: 371 SRARTEDLSDIAALKALYQSGVDNCGRSVMVVVGRNIPVLLIDMEKALLYFIHMMDHVTA 430

Query: 445 RPYTIVYFHSAASLQPRPDMGWMKRLQQILGRKHQRNLQAIYVLHPTFGLKAAVFAMQLL 504
           + Y +VYFH+        D  ++K +  I+  K+++NL+A+Y +HPTF  K + +     
Sbjct: 431 KDYVLVYFHTLTGEHNHLDSDFLKNMYDIIDVKYKKNLKALYFVHPTFRSKVSTWFFTTF 484

Query: 505 VDNAVWNKVVYIDRLLQLFKYVPREQLTIPDFVFQHDLEVNG 546
             + + +KV  ++ L QLF  +P EQ+ IP FV  +D   NG
Sbjct: 491 TVSGLKDKVHQVESLHQLFTAIPPEQIEIPPFVLDYDARENG 484

BLAST of Cp4.1LG07g05460 vs. Swiss-Prot
Match: GDAP2_HUMAN (Ganglioside-induced differentiation-associated protein 2 OS=Homo sapiens GN=GDAP2 PE=1 SV=1)

HSP 1 Score: 290.4 bits (742), Expect = 4.2e-77
Identity = 176/522 (33.72%), Postives = 286/522 (54.79%), Query Frame = 1

Query: 25  VNLDQIPRWSDAEHRSSLEFVNEDPSFSNSYFPDPLTSPSDAEGGNYGVVSRFPVDHEIN 84
           V++D +P W D    S  + +N   + +  +  D + SP             F  + ++N
Sbjct: 11  VDVDTLPSWGD----SCQDELNSSDTTAEIFQEDTVRSP-------------FLYNKDVN 70

Query: 85  SKIYLWRGNPWNLEVDAVVNSTNENL-DEAHSSPGLHAAAGPGLSEECGTLGGCRTGMAK 144
            K+ LW+G+   L   A+VN++NE+L D+   S  +   AGP L E+   L GCRTG AK
Sbjct: 71  GKVVLWKGDVALLNCTAIVNTSNESLTDKNPVSESIFMLAGPDLKEDLQKLKGCRTGEAK 130

Query: 145 VTNAYDLPARKVMHTVGPKYAVKYHTAAENALSHCYRSCLELLIENGLQSIAMGCIYTEG 204
           +T  ++L AR ++HTVGPKY  +Y TAAE++L  CYR+ L+L  E  + S+    I +  
Sbjct: 131 LTKGFNLAARFIIHTVGPKYKSRYRTAAESSLYSCYRNVLQLAKEQSMSSVGFCVINSAK 190

Query: 205 KNYPREPAAHVAIRTVRRLIEKQKGKIKAVVFCTTSSIDTEIYKRLLPLYFPRDKHEEEV 264
           + YP E A H+A+RTVRR +E     I+ VVF   S ++   Y++LLPLYFPR   EE  
Sbjct: 191 RGYPLEDATHIALRTVRRFLEIHGETIEKVVFAV-SDLEEGTYQKLLPLYFPRSLKEENR 250

Query: 265 ALSKLPADVGDENGETIIDERKIRINTLPKKNVLKPPQVPDDPPVSHVRLTQRNPSYLDS 324
           +L  LPAD+G+  GE ++ ER+IRI+        + P  P+D           + S++ S
Sbjct: 251 SLPYLPADIGNAEGEPVVPERQIRIS--------EKPGAPEDNQEEEDEGLGVDLSFIGS 310

Query: 325 YLDPAFMALIKDPDQRRKEQWEKTAQAQTGWNYGRILGFGELGGAPLSAAEEYSLHSRYL 384
           +   AF  +  D D++RK                 ++  G+L  A L    + + ++R+L
Sbjct: 311 H---AFARMEGDIDKQRK-----------------LILQGQLSEAALQKQHQRN-YNRWL 370

Query: 385 AKANSLNLSEIGEMKIVYRGGVDSEGRPVMVVVGAHFLLRCLDLERFVLYVVKEFEPLIQ 444
            +A S +LS+I  +K +Y+ GVD+ GR VMVVVG +  +  +D+++ +LY +   + +  
Sbjct: 371 CQARSEDLSDIASLKALYQTGVDNCGRTVMVVVGRNIPVTLIDMDKALLYFIHVMDHIAV 430

Query: 445 RPYTIVYFHSAASLQPRPDMGWMKRLQQILGRKHQRNLQAIYVLHPTFGLKAAVFAMQLL 504
           + Y +VYFH+  S     D  ++K+L  ++  K++RNL+A+Y +HPTF  K + +     
Sbjct: 431 KEYVLVYFHTLTSEYNHLDSDFLKKLYDVVDVKYKRNLKAVYFVHPTFRSKVSTWFFTTF 485

Query: 505 VDNAVWNKVVYIDRLLQLFKYVPREQLTIPDFVFQHDLEVNG 546
             + + +K+ ++D L QLF  +  EQ+  P FV ++D   NG
Sbjct: 491 SVSGLKDKIHHVDSLHQLFSAISPEQIDFPPFVLEYDARENG 485

BLAST of Cp4.1LG07g05460 vs. Swiss-Prot
Match: GDAP2_DANRE (Ganglioside-induced differentiation-associated protein 2 OS=Danio rerio GN=gdap2 PE=2 SV=1)

HSP 1 Score: 289.7 bits (740), Expect = 7.1e-77
Identity = 180/482 (37.34%), Postives = 263/482 (54.56%), Query Frame = 1

Query: 73  VVSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNENL-DEAHSSPGLHAAAGPGLSEEC 132
           V S F    +IN+KI L+ G+   L   A+VN++NE L D+   S  +H  AGP L +E 
Sbjct: 53  VNSPFTFRQDINNKIVLFNGDVALLNCTAIVNTSNETLTDKNPISDSIHRHAGPELRDEL 112

Query: 133 GTLGGCRTGMAKVTNAYDLPARKVMHTVGPKYAVKYHTAAENALSHCYRSCLELLIENGL 192
             L GCRTG AK+T  +DL AR ++HTVGPKY  KY TAAE++L  CYR+ L+L  E+ +
Sbjct: 113 LKLKGCRTGEAKMTEGFDLAARFIIHTVGPKYKAKYRTAAESSLYSCYRNVLQLAKEHAM 172

Query: 193 QSIAMGCIYTEGKNYPREPAAHVAIRTVRRLIEKQKGKIKAVVFCTTSSIDTEIYKRLLP 252
            S+    I T  + YP E A H+A+RTVRR +E     I+ +VF   S ++  +Y++LLP
Sbjct: 173 VSVGFCVISTVKRAYPVEDATHIALRTVRRFLENHGENIETLVFAV-SDVEEPVYRKLLP 232

Query: 253 LYFPRDKHEEEVALSKLPADVGDENGETIIDERKIRINTLPKKNVLKPPQVPDDPPVSHV 312
           LY+PR K EE ++L  LPAD+G+  GE ++ ER+IRI         KP  + DDP     
Sbjct: 233 LYYPRSKQEERISLPLLPADIGNSEGEPVVPERQIRI-------AEKPVNLEDDP----- 292

Query: 313 RLTQRNPSYLDSYL----DPAFMALIKDPDQRRK-----EQWEKTAQAQTGWNYGRILGF 372
                    LDS L      AF  +  D D++RK     +  E   Q Q   NY      
Sbjct: 293 -----EDDSLDSDLGLVGSHAFARMEGDVDKQRKLILQGQMSEVAQQKQHQRNY------ 352

Query: 373 GELGGAPLSAAEEYSLHSRYLAKANSLNLSEIGEMKIVYRGGVDSEGRPVMVVVGAHFLL 432
                            +R+L KA + +LS+I  +K +Y+ GVD  GR VMVVVG +  +
Sbjct: 353 -----------------NRWLCKARAEDLSDIAALKALYQTGVDLCGRTVMVVVGRNIPV 412

Query: 433 RCLDLERFVLYVVKEFEPLIQRPYTIVYFHSAASLQPRPDMGWMKRLQQILGRKHQRNLQ 492
             +D+E+ +LY +   + +  + Y +VYFH+        D  ++K+L  I+  K ++NL+
Sbjct: 413 MLIDMEKALLYFIHVMDHITVKEYVMVYFHTLTGEHNHLDTDFLKKLYDIVDAKFKKNLR 472

Query: 493 AIYVLHPTFGLKAAVFAMQLLVDNAVWNKVVYIDRLLQLFKYVPREQLTIPDFVFQHDLE 545
           A Y +HPTF  K + +       + + +KV +I+ L QLF  V  EQ+ IP FV ++D  
Sbjct: 473 AFYFVHPTFRSKVSTWFFTTFSVSGLKDKVHHIENLQQLFTCVLPEQIDIPPFVLEYDSR 493

BLAST of Cp4.1LG07g05460 vs. TrEMBL
Match: A0A0A0KS03_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G487740 PE=4 SV=1)

HSP 1 Score: 1060.1 bits (2740), Expect = 9.7e-307
Identity = 526/559 (94.10%), Postives = 538/559 (96.24%), Query Frame = 1

Query: 1   MYRTVATSAATTTATTTTDSVDYVVNLDQIPRWSDAEHRSSLEFVNEDPSFSNSYFPDPL 60
           MYRTVATSAAT   TTTTDS+DYVVNLDQIPRWSDAEHRSSLEFVNEDPSFSNSYFPDPL
Sbjct: 1   MYRTVATSAAT---TTTTDSLDYVVNLDQIPRWSDAEHRSSLEFVNEDPSFSNSYFPDPL 60

Query: 61  TSPSDAEGGNYGVVSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNENLDEAHSSPGLH 120
           TSPSDAEGG  GVVSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNENLDEAHSSPGLH
Sbjct: 61  TSPSDAEGGTNGVVSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNENLDEAHSSPGLH 120

Query: 121 AAAGPGLSEECGTLGGCRTGMAKVTNAYDLPARKVMHTVGPKYAVKYHTAAENALSHCYR 180
           AAAGPGL +ECGTLGGCRTGMAKVTNAYDLPARKV+HTVGPKYAVKYHTAAENALSHCYR
Sbjct: 121 AAAGPGLLDECGTLGGCRTGMAKVTNAYDLPARKVIHTVGPKYAVKYHTAAENALSHCYR 180

Query: 181 SCLELLIENGLQSIAMGCIYTEGKNYPREPAAHVAIRTVRRLIEKQKGKIKAVVFCTTSS 240
           SCLELLIENGLQSIAMGCIYTE KNYPREPAAHVAIRTVRR +EKQ+ KIKAVVFCTTSS
Sbjct: 181 SCLELLIENGLQSIAMGCIYTEAKNYPREPAAHVAIRTVRRFLEKQRDKIKAVVFCTTSS 240

Query: 241 IDTEIYKRLLPLYFPRDKHEEEVALSKLPADVGDENGETIIDERKIRINTLPKKNVLKPP 300
           +DTEIYKRLLPLYFPRDKHEEEVALSKLPADVGDENGETIIDERKIRI +LPKKNV KPP
Sbjct: 241 VDTEIYKRLLPLYFPRDKHEEEVALSKLPADVGDENGETIIDERKIRIKSLPKKNVPKPP 300

Query: 301 QVPDDPPVSHVRLTQRNPSYLDSYLDPAFMALIKDPDQRRKEQWEKTAQAQTGWNYGRIL 360
           QV +D PVS VRLT+RN SYLDSYLDPAFMALIKDPDQRRKEQWEKTAQAQTGWNYGRIL
Sbjct: 301 QVLNDTPVSDVRLTRRNSSYLDSYLDPAFMALIKDPDQRRKEQWEKTAQAQTGWNYGRIL 360

Query: 361 GFGELGGAPLSAAEEYSLHSRYLAKANSLNLSEIGEMKIVYRGGVDSEGRPVMVVVGAHF 420
           GFG+LGG PLSAAEEYSLHSRYLAKANSLNLSEI EMKIVYRGGVDSEGRPVMVVVGAHF
Sbjct: 361 GFGDLGGPPLSAAEEYSLHSRYLAKANSLNLSEIAEMKIVYRGGVDSEGRPVMVVVGAHF 420

Query: 421 LLRCLDLERFVLYVVKEFEPLIQRPYTIVYFHSAASLQPRPDMGWMKRLQQILGRKHQRN 480
           LLRCLDLERFVLYVVKEFEPLIQ+PYTIVYFHSAASLQPRPDMGWMKRLQQILGRKHQRN
Sbjct: 421 LLRCLDLERFVLYVVKEFEPLIQKPYTIVYFHSAASLQPRPDMGWMKRLQQILGRKHQRN 480

Query: 481 LQAIYVLHPTFGLKAAVFAMQLLVDNAVWNKVVYIDRLLQLFKYVPREQLTIPDFVFQHD 540
           L AIYVLHPTFGLKAAV AMQLLVDN VWNKVVYIDRLLQLFKYVPREQLTIPDFVFQHD
Sbjct: 481 LHAIYVLHPTFGLKAAVLAMQLLVDNVVWNKVVYIDRLLQLFKYVPREQLTIPDFVFQHD 540

Query: 541 LEVNGGKGLIVDPRTKYYW 560
           LEVNGGKGLIVDPRTKY +
Sbjct: 541 LEVNGGKGLIVDPRTKYVY 556

BLAST of Cp4.1LG07g05460 vs. TrEMBL
Match: W9RFR4_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_017239 PE=4 SV=1)

HSP 1 Score: 979.5 bits (2531), Expect = 1.7e-282
Identity = 480/559 (85.87%), Postives = 516/559 (92.31%), Query Frame = 1

Query: 1   MYRTVATSAATTTATTTTDSVDYVVNLDQIPRWSDAEHRSSLEFVNEDPSFSNSYFPDPL 60
           MYRT AT+A  T   + TD+VDYVV+LDQ+PRWSD+E+RSSLEF NEDPSFS SYFPDPL
Sbjct: 1   MYRTAATAATMTRGGSPTDNVDYVVSLDQVPRWSDSEYRSSLEFGNEDPSFSTSYFPDPL 60

Query: 61  TSPSDAEGGNYGVVSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNENLDEAHSSPGLH 120
           TS S AE  +  +VSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNE+LDEAHSSPGLH
Sbjct: 61  TSSSGAESSSNDMVSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNESLDEAHSSPGLH 120

Query: 121 AAAGPGLSEECGTLGGCRTGMAKVTNAYDLPARKVMHTVGPKYAVKYHTAAENALSHCYR 180
           AAAGPGL+EEC TLGGCRTGMAKVTNAYDLPAR+V+HTVGPKYAVKYHTAAENALSHCYR
Sbjct: 121 AAAGPGLAEECATLGGCRTGMAKVTNAYDLPARRVIHTVGPKYAVKYHTAAENALSHCYR 180

Query: 181 SCLELLIENGLQSIAMGCIYTEGKNYPREPAAHVAIRTVRRLIEKQKGKIKAVVFCTTSS 240
           SCLELLIENGLQSIAMGCIYTE KNYPREPAAHVAIRTVRR +EKQK KIKAVVFCTT+S
Sbjct: 181 SCLELLIENGLQSIAMGCIYTETKNYPREPAAHVAIRTVRRFLEKQKDKIKAVVFCTTTS 240

Query: 241 IDTEIYKRLLPLYFPRDKHEEEVALSKLPADVGDENGETIIDERKIRINTLPKKNVLKPP 300
            DTEIYKRLLPLYFPRDKHEEE+A SKLPADVGDENGETIIDERKIRI  LPKK + KP 
Sbjct: 241 SDTEIYKRLLPLYFPRDKHEEEIAFSKLPADVGDENGETIIDERKIRIKPLPKKTIPKPT 300

Query: 301 QVPDDPPVSHVRLTQRNPSYLDSYLDPAFMALIKDPDQRRKEQWEKTAQAQTGWNYGRIL 360
           + P D PVS V L +RN SYLD+YLDPAFM+LIKDPDQRRKEQWEKTAQA++G+N  ++L
Sbjct: 301 EAPIDLPVSDVSLVRRNSSYLDTYLDPAFMSLIKDPDQRRKEQWEKTAQARSGFNCAKLL 360

Query: 361 GFGELGGAPLSAAEEYSLHSRYLAKANSLNLSEIGEMKIVYRGGVDSEGRPVMVVVGAHF 420
           GFG+LGG PLSAAEEYSLHSRYLAKANSLNLSEI EMKIVYRGGVDSEGRPVMVVVGAHF
Sbjct: 361 GFGDLGGPPLSAAEEYSLHSRYLAKANSLNLSEIAEMKIVYRGGVDSEGRPVMVVVGAHF 420

Query: 421 LLRCLDLERFVLYVVKEFEPLIQRPYTIVYFHSAASLQPRPDMGWMKRLQQILGRKHQRN 480
           LLRCLDLERFVLYV+KEFEPLIQ+PYTIVYFHSAASLQ +PD+GWM+RLQQILGRKHQRN
Sbjct: 421 LLRCLDLERFVLYVIKEFEPLIQKPYTIVYFHSAASLQIQPDLGWMRRLQQILGRKHQRN 480

Query: 481 LQAIYVLHPTFGLKAAVFAMQLLVDNAVWNKVVYIDRLLQLFKYVPREQLTIPDFVFQHD 540
           L AIYVLHPTFGLKAAVFA+QL VDN VW KVVY+DRLLQLF+YVPREQLTIPDFVFQHD
Sbjct: 481 LHAIYVLHPTFGLKAAVFALQLFVDNVVWKKVVYVDRLLQLFRYVPREQLTIPDFVFQHD 540

Query: 541 LEVNGGKGLIVDPRTKYYW 560
           LEVNGGKGLIVDPRTKY +
Sbjct: 541 LEVNGGKGLIVDPRTKYVY 559

BLAST of Cp4.1LG07g05460 vs. TrEMBL
Match: F6HEZ4_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g00870 PE=4 SV=1)

HSP 1 Score: 967.2 bits (2499), Expect = 8.6e-279
Identity = 474/559 (84.79%), Postives = 511/559 (91.41%), Query Frame = 1

Query: 1   MYRTVATSAATTTATTTTDSVDYVVNLDQIPRWSDAEHRSSLEFVNEDPSFSNSYFPDPL 60
           MYR VAT  A T     T+S DYVV L+Q+PRWSDAEH+SSLE+ NED SF  SYFPDPL
Sbjct: 1   MYRPVAT--APTQGGIATESADYVVELNQVPRWSDAEHKSSLEYDNEDSSFPTSYFPDPL 60

Query: 61  TSPSDAEGGNYGVVSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNENLDEAHSSPGLH 120
           TS S+AE G  G++SRFPV+HEINSKIYLWRGNPWNLEVDAVVNSTNENLDEAHSSPGLH
Sbjct: 61  TSTSEAESGGNGMMSRFPVNHEINSKIYLWRGNPWNLEVDAVVNSTNENLDEAHSSPGLH 120

Query: 121 AAAGPGLSEECGTLGGCRTGMAKVTNAYDLPARKVMHTVGPKYAVKYHTAAENALSHCYR 180
           AAAGPGL+EEC TLGGCRTGMAKVTNAYDLPAR+V+HTVGPKYAVKYHTAAENALSHCYR
Sbjct: 121 AAAGPGLAEECATLGGCRTGMAKVTNAYDLPARRVIHTVGPKYAVKYHTAAENALSHCYR 180

Query: 181 SCLELLIENGLQSIAMGCIYTEGKNYPREPAAHVAIRTVRRLIEKQKGKIKAVVFCTTSS 240
           SCLELLIENGLQSIAMGCIYTE KNYPREPAAHVAIRTVRR +EKQK KI AVVFCTT++
Sbjct: 181 SCLELLIENGLQSIAMGCIYTEAKNYPREPAAHVAIRTVRRFLEKQKDKITAVVFCTTTA 240

Query: 241 IDTEIYKRLLPLYFPRDKHEEEVALSKLPADVGDENGETIIDERKIRINTLPKKNVLKPP 300
            DTEIYKRLLPLYFPRDKHEEEVA+SKLPADVGDENGETIIDERKIRI  LPKK   KPP
Sbjct: 241 NDTEIYKRLLPLYFPRDKHEEEVAMSKLPADVGDENGETIIDERKIRIKPLPKKTAPKPP 300

Query: 301 QVPDDPPVSHVRLTQRNPSYLDSYLDPAFMALIKDPDQRRKEQWEKTAQAQTGWNYGRIL 360
           + P D PVS V L +RN SYLDSYLDPAFM+LIKDPDQRRKEQWEKTAQAQ+GWN  ++L
Sbjct: 301 KAPVDLPVSDVGLIRRNSSYLDSYLDPAFMSLIKDPDQRRKEQWEKTAQAQSGWNCAKLL 360

Query: 361 GFGELGGAPLSAAEEYSLHSRYLAKANSLNLSEIGEMKIVYRGGVDSEGRPVMVVVGAHF 420
           GFG+LGG PLSAAEEYSLHSRYL+KANSLNLSEI EMKIVYRGGVDSEGRP+MVVVGAHF
Sbjct: 361 GFGDLGGPPLSAAEEYSLHSRYLSKANSLNLSEIAEMKIVYRGGVDSEGRPIMVVVGAHF 420

Query: 421 LLRCLDLERFVLYVVKEFEPLIQRPYTIVYFHSAASLQPRPDMGWMKRLQQILGRKHQRN 480
           LLRCLDLERFV +VVKEFEP+IQ+PYTIVYFHSAASLQ +PD+GWM+RLQQILGRKHQRN
Sbjct: 421 LLRCLDLERFVFHVVKEFEPVIQKPYTIVYFHSAASLQIQPDLGWMRRLQQILGRKHQRN 480

Query: 481 LQAIYVLHPTFGLKAAVFAMQLLVDNAVWNKVVYIDRLLQLFKYVPREQLTIPDFVFQHD 540
           L AIYVLHPTFGLKAAVFA+QL VDN VW KVVY+DRL+QLF+YVPREQLTIPDFVFQHD
Sbjct: 481 LHAIYVLHPTFGLKAAVFALQLFVDNVVWKKVVYVDRLMQLFRYVPREQLTIPDFVFQHD 540

Query: 541 LEVNGGKGLIVDPRTKYYW 560
           LEVNGGKGL+VDPRTKY +
Sbjct: 541 LEVNGGKGLMVDPRTKYVY 557

BLAST of Cp4.1LG07g05460 vs. TrEMBL
Match: A0A067GTW9_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g008599mg PE=4 SV=1)

HSP 1 Score: 966.5 bits (2497), Expect = 1.5e-278
Identity = 475/559 (84.97%), Postives = 509/559 (91.06%), Query Frame = 1

Query: 1   MYRTVATSAATTTATTTTDSVDYVVNLDQIPRWSDAEHRSSLEFVNEDPSFSNSYFPDPL 60
           MYR VAT  AT      +DS D VV LDQ+PRWSDAEHR SL++ +EDPSFSNSYF DPL
Sbjct: 1   MYRPVAT--ATPRGGLPSDSGDSVVTLDQVPRWSDAEHRLSLDYESEDPSFSNSYFADPL 60

Query: 61  TSPSDAEGGNYGVVSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNENLDEAHSSPGLH 120
            S S AE    G+VSRFPVDHEINSKIYLWRGNPWNLEVD VVNSTNENLDEAHSSPGLH
Sbjct: 61  ASSSGAESSGNGMVSRFPVDHEINSKIYLWRGNPWNLEVDTVVNSTNENLDEAHSSPGLH 120

Query: 121 AAAGPGLSEECGTLGGCRTGMAKVTNAYDLPARKVMHTVGPKYAVKYHTAAENALSHCYR 180
           AAAGPGL+EEC TLGGCRTGMAKVTNAYDLPAR+V+HTVGPKYAVKYHTAAENALSHCYR
Sbjct: 121 AAAGPGLAEECATLGGCRTGMAKVTNAYDLPARRVIHTVGPKYAVKYHTAAENALSHCYR 180

Query: 181 SCLELLIENGLQSIAMGCIYTEGKNYPREPAAHVAIRTVRRLIEKQKGKIKAVVFCTTSS 240
           SCLELLIENGL+SIAMGCIYTE KNYPREPAAHVAIRTVRR +EKQK KI AVVFCTT++
Sbjct: 181 SCLELLIENGLKSIAMGCIYTEAKNYPREPAAHVAIRTVRRFLEKQKDKISAVVFCTTTA 240

Query: 241 IDTEIYKRLLPLYFPRDKHEEEVALSKLPADVGDENGETIIDERKIRINTLPKKNVLKPP 300
            DTEIYKRLLPLYFPRDKHEEEVA+SKLPADVGDENGETIIDERKIRI  LPKKN+ KPP
Sbjct: 241 SDTEIYKRLLPLYFPRDKHEEEVAISKLPADVGDENGETIIDERKIRIKPLPKKNIPKPP 300

Query: 301 QVPDDPPVSHVRLTQRNPSYLDSYLDPAFMALIKDPDQRRKEQWEKTAQAQTGWNYGRIL 360
           + P +PPVS V L +RN SYLDSYLDPAFM+LIKDPDQRRKEQWEKTAQAQ+GWN  ++L
Sbjct: 301 KAPVEPPVSDVGLIRRNSSYLDSYLDPAFMSLIKDPDQRRKEQWEKTAQAQSGWNCAKML 360

Query: 361 GFGELGGAPLSAAEEYSLHSRYLAKANSLNLSEIGEMKIVYRGGVDSEGRPVMVVVGAHF 420
           GFG+LGG PLSAAEEYSLHSRYLAKANSLNLSEI EMKIVYRGGVDSEGRPVMVVVGAHF
Sbjct: 361 GFGDLGGPPLSAAEEYSLHSRYLAKANSLNLSEIAEMKIVYRGGVDSEGRPVMVVVGAHF 420

Query: 421 LLRCLDLERFVLYVVKEFEPLIQRPYTIVYFHSAASLQPRPDMGWMKRLQQILGRKHQRN 480
           LLRCLDLERFVLYVVKEFEPLIQ+PY+IVYFHSAASLQ +PD+GWM+RLQQ+LGRKHQRN
Sbjct: 421 LLRCLDLERFVLYVVKEFEPLIQKPYSIVYFHSAASLQLQPDLGWMRRLQQVLGRKHQRN 480

Query: 481 LQAIYVLHPTFGLKAAVFAMQLLVDNAVWNKVVYIDRLLQLFKYVPREQLTIPDFVFQHD 540
           L AIYVLHPTF LKA +F +QLLVDN VW KVVY+DRLLQLF+YVPREQLTIPDFVFQHD
Sbjct: 481 LHAIYVLHPTFHLKATIFTLQLLVDNVVWKKVVYVDRLLQLFRYVPREQLTIPDFVFQHD 540

Query: 541 LEVNGGKGLIVDPRTKYYW 560
           LEVNGGKGLIVDPRTKY +
Sbjct: 541 LEVNGGKGLIVDPRTKYVY 557

BLAST of Cp4.1LG07g05460 vs. TrEMBL
Match: M5XY84_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003605m1g PE=4 SV=1)

HSP 1 Score: 965.3 bits (2494), Expect = 3.2e-278
Identity = 472/559 (84.44%), Postives = 509/559 (91.06%), Query Frame = 1

Query: 1   MYRTVATSAATTTATTTTDSVDYVVNLDQIPRWSDAEHRSSLEFVNEDPSFSNSYFPDPL 60
           MYRTVAT+   T   + TDS D VV+LDQ+PRWS+A+HRSSLE+ NEDPSFSN YFPDPL
Sbjct: 1   MYRTVATATTATRGGSPTDSGDCVVSLDQVPRWSNADHRSSLEYDNEDPSFSNKYFPDPL 60

Query: 61  TSPSDAEGGNYGVVSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNENLDEAHSSPGLH 120
           TS S  E  +  +VSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNEN+DEAH SPGLH
Sbjct: 61  TSQSGGESSSSWIVSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNENMDEAHCSPGLH 120

Query: 121 AAAGPGLSEECGTLGGCRTGMAKVTNAYDLPARKVMHTVGPKYAVKYHTAAENALSHCYR 180
           AAAGPGL+EEC  LGGCRTGMAKVT AYDLPAR+V+HTVGPKYAVKYHTAAENALSHCYR
Sbjct: 121 AAAGPGLAEECAALGGCRTGMAKVTKAYDLPARRVIHTVGPKYAVKYHTAAENALSHCYR 180

Query: 181 SCLELLIENGLQSIAMGCIYTEGKNYPREPAAHVAIRTVRRLIEKQKGKIKAVVFCTTSS 240
           SCLELLIENGLQSIAMGCIYTE KNYPREPAAHVAIRTVRR +EKQK KI AVVFCTT+S
Sbjct: 181 SCLELLIENGLQSIAMGCIYTEAKNYPREPAAHVAIRTVRRFLEKQKDKIAAVVFCTTTS 240

Query: 241 IDTEIYKRLLPLYFPRDKHEEEVALSKLPADVGDENGETIIDERKIRINTLPKKNVLKPP 300
            DTEIYKRLLPLYFPRDK EEEVA+SKLPADVGDENGETIIDERKIRI  LPKKN+ KPP
Sbjct: 241 TDTEIYKRLLPLYFPRDKLEEEVAMSKLPADVGDENGETIIDERKIRIKPLPKKNIPKPP 300

Query: 301 QVPDDPPVSHVRLTQRNPSYLDSYLDPAFMALIKDPDQRRKEQWEKTAQAQTGWNYGRIL 360
           + P D PVS V L +RN SYLDS+LDPAFM+LIKDPDQRRKEQWEKTAQAQ GWN  ++L
Sbjct: 301 KAPVDLPVSDVGLVRRNSSYLDSFLDPAFMSLIKDPDQRRKEQWEKTAQAQGGWNCAKML 360

Query: 361 GFGELGGAPLSAAEEYSLHSRYLAKANSLNLSEIGEMKIVYRGGVDSEGRPVMVVVGAHF 420
           GFG+LGG PLSAAEEYSLHSRYLAKANSL+LSEI EMKIVYRGGVDSEGRPVMVVVGAHF
Sbjct: 361 GFGDLGGPPLSAAEEYSLHSRYLAKANSLHLSEIAEMKIVYRGGVDSEGRPVMVVVGAHF 420

Query: 421 LLRCLDLERFVLYVVKEFEPLIQRPYTIVYFHSAASLQPRPDMGWMKRLQQILGRKHQRN 480
           LLRCLDLERF+ YVVKEFEP+IQ+PYTIVYFHSAASLQ +PD+GWM+R+QQILGRKHQRN
Sbjct: 421 LLRCLDLERFIHYVVKEFEPIIQKPYTIVYFHSAASLQLQPDLGWMRRVQQILGRKHQRN 480

Query: 481 LQAIYVLHPTFGLKAAVFAMQLLVDNAVWNKVVYIDRLLQLFKYVPREQLTIPDFVFQHD 540
           L AIYVLHPTFGLKAA+FA+QL VDN VW KVVY+DRLLQLF+YVPREQLTIPDFVFQHD
Sbjct: 481 LHAIYVLHPTFGLKAAIFALQLFVDNLVWKKVVYVDRLLQLFRYVPREQLTIPDFVFQHD 540

Query: 541 LEVNGGKGLIVDPRTKYYW 560
           LEVNGGKGLIVDPRTKY +
Sbjct: 541 LEVNGGKGLIVDPRTKYVY 559

BLAST of Cp4.1LG07g05460 vs. TAIR10
Match: AT1G69340.1 (AT1G69340.1 appr-1-p processing enzyme family protein)

HSP 1 Score: 902.9 bits (2332), Expect = 1.0e-262
Identity = 444/559 (79.43%), Postives = 491/559 (87.84%), Query Frame = 1

Query: 1   MYRTVATSAATTTATTTTDSVDYVVNLDQIPRWSDAEHRSSLEFVNEDPSFSNSYFPDPL 60
           MY+T+ T A T    T T+S DYVV LDQIPRWSD E RSSLE    DP  SN  + +PL
Sbjct: 2   MYQTIPT-APTIRGGTPTESGDYVVTLDQIPRWSDVEQRSSLEDETGDPEHSNPRYANPL 61

Query: 61  TSPSDAEGGNYGVVSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNENLDEAHSSPGLH 120
            S S+A     G+VS+FPVDHEINS+IYLWRG PWNLEVDAVVNSTNENLDEAHSSPGLH
Sbjct: 62  ASSSEAGSSGNGMVSKFPVDHEINSRIYLWRGEPWNLEVDAVVNSTNENLDEAHSSPGLH 121

Query: 121 AAAGPGLSEECGTLGGCRTGMAKVTNAYDLPARKVMHTVGPKYAVKYHTAAENALSHCYR 180
            AAGPGL+E+C TLGGCRTGMAKVTNAYDLPAR+V+HTVGPKYAVKYHTAAENALSHCYR
Sbjct: 122 VAAGPGLAEQCATLGGCRTGMAKVTNAYDLPARRVIHTVGPKYAVKYHTAAENALSHCYR 181

Query: 181 SCLELLIENGLQSIAMGCIYTEGKNYPREPAAHVAIRTVRRLIEKQKGKIKAVVFCTTSS 240
           SCLELLI++GLQSIA+GCIYTE KNYPREPAAHVAIRTVRR +EKQK KI AVVFCTT+S
Sbjct: 182 SCLELLIDSGLQSIALGCIYTEAKNYPREPAAHVAIRTVRRFLEKQKDKISAVVFCTTTS 241

Query: 241 IDTEIYKRLLPLYFPRDKHEEEVALSKLPADVGDENGETIIDERKIRINTLPKKNVLKPP 300
            DTEIYKRLLPLYFPRD+HEEEVA+SKLPADVGDENGET+IDERKIRI  LP K   +  
Sbjct: 242 SDTEIYKRLLPLYFPRDEHEEEVAISKLPADVGDENGETVIDERKIRIQALPNKPPPRSF 301

Query: 301 QVPDDPPVSHVRLTQRNPSYLDSYLDPAFMALIKDPDQRRKEQWEKTAQAQTGWNYGRIL 360
             P + P + + L +RN ++LDSYLDPAFM+LIKDPD+RRKEQWEKTAQAQ+G+N+ ++L
Sbjct: 302 PTPLERPSTDLTLLRRNSNHLDSYLDPAFMSLIKDPDERRKEQWEKTAQAQSGFNFVKLL 361

Query: 361 GFGELGGAPLSAAEEYSLHSRYLAKANSLNLSEIGEMKIVYRGGVDSEGRPVMVVVGAHF 420
           GFG+LGG PLSAAEEYSLHSRYLAKANS+NLSEI EMKIVYRGGVD+EG PVMVVVGAHF
Sbjct: 362 GFGDLGGPPLSAAEEYSLHSRYLAKANSINLSEIAEMKIVYRGGVDTEGHPVMVVVGAHF 421

Query: 421 LLRCLDLERFVLYVVKEFEPLIQRPYTIVYFHSAASLQPRPDMGWMKRLQQILGRKHQRN 480
           LLRCLDLERFVLYV+KEFEP+IQ+PY+IVYFHSAASLQ +PD+GWMKRL+QILGRKHQRN
Sbjct: 422 LLRCLDLERFVLYVIKEFEPVIQKPYSIVYFHSAASLQVQPDLGWMKRLEQILGRKHQRN 481

Query: 481 LQAIYVLHPTFGLKAAVFAMQLLVDNAVWNKVVYIDRLLQLFKYVPREQLTIPDFVFQHD 540
           LQAIYVLHPTF LKA +  MQ  VDN VW KVVY DRLLQLFKYVPREQLTIPDFVFQHD
Sbjct: 482 LQAIYVLHPTFHLKATILTMQFFVDNVVWKKVVYADRLLQLFKYVPREQLTIPDFVFQHD 541

Query: 541 LEVNGGKGLIVDPRTKYYW 560
           LEVNGGKGLIVDPRTKY +
Sbjct: 542 LEVNGGKGLIVDPRTKYVY 559

BLAST of Cp4.1LG07g05460 vs. TAIR10
Match: AT2G40600.1 (AT2G40600.1 appr-1-p processing enzyme family protein)

HSP 1 Score: 67.8 bits (164), Expect = 2.5e-11
Identity = 43/129 (33.33%), Postives = 69/129 (53.49%), Query Frame = 1

Query: 100 DAVVNSTNENLDEAHSSPG-LHAAAGPGLSEECGTLGG------CRTGMAKVTNAYDLPA 159
           DA+VN  NE +     + G +H AAGP L   C  +        C TG A++T  ++LPA
Sbjct: 99  DAIVNPANERMLGGGGADGAIHRAAGPQLRAACYEVPEVRPGVRCPTGEARITPGFNLPA 158

Query: 160 RKVMHTVGPKYAVKYHTAAENALSHCYRSCLELLIENGLQSIAMGCIYTEGKNYPREPAA 219
            +V+HTVGP Y    +   + +L++ Y++ L +  EN ++ IA   I      YP + AA
Sbjct: 159 SRVIHTVGPIYDSDVN--PQESLTNSYKNSLRVAKENNIKYIAFPAISCGIYGYPFDEAA 218

Query: 220 HVAIRTVRR 222
            + I T+++
Sbjct: 219 AIGISTIKQ 225

BLAST of Cp4.1LG07g05460 vs. TAIR10
Match: AT4G35750.1 (AT4G35750.1 SEC14 cytosolic factor family protein / phosphoglyceride transfer family protein)

HSP 1 Score: 67.8 bits (164), Expect = 2.5e-11
Identity = 41/152 (26.97%), Postives = 80/152 (52.63%), Query Frame = 1

Query: 394 IGEMKIVYRGGVDSEGRPVMVVVGAHFLLRCLDLERFVLYVVKEFEPLIQR-PYTIVYFH 453
           I +++I    G D  GR ++ ++G  F  R L L+    Y+ ++  P + R P+ ++Y H
Sbjct: 14  IEKLEIFKIHGRDKRGRKILRIIGKFFPARFLSLDVLKKYLEEKIFPRLGRKPFAVLYVH 73

Query: 454 SAASLQPR-PDMGWMKRLQQILGRKHQRNLQAIYVLHPTFGLKAAVFAM---QLLVDNAV 513
           +        P +  ++ +   +    + NLQ +Y LHP  GL++ +F     + L    +
Sbjct: 74  TGVQRSENFPGISALRAIYDAIPVNVRDNLQEVYFLHP--GLQSRLFLATCGRFLFSGGL 133

Query: 514 WNKVVYIDRLLQLFKYVPREQLTIPDFVFQHD 541
           + K+ YI R+  L+++V R ++ +P+FV+ HD
Sbjct: 134 YGKLRYISRVDYLWEHVRRNEIEMPEFVYDHD 163

BLAST of Cp4.1LG07g05460 vs. TAIR10
Match: AT3G10210.1 (AT3G10210.1 SEC14 cytosolic factor family protein / phosphoglyceride transfer family protein)

HSP 1 Score: 60.1 bits (144), Expect = 5.2e-09
Identity = 37/154 (24.03%), Postives = 75/154 (48.70%), Query Frame = 1

Query: 390 NLSEIGEMKIVYRGGVDSEGRPVMVVVGAHFLLRCLDLERFVLYVVKEFEPLI-QRPYTI 449
           + S++  ++     G+D  G  +  +VG +F  R +  ER   Y+ ++      + P  +
Sbjct: 45  DFSDLDLLQFFTLQGLDRSGNRIFRIVGKYFPARVVSAERLKKYISQKISNQCPEGPLCL 104

Query: 450 VYFHSAASLQPR-PDMGWMKRLQQILGRKHQRNLQAIYVLHPTFGLKAAVFAM-QLLVDN 509
           VY HS        P +  ++ + + L    +  LQ +Y +HP    +  +  + +LL+  
Sbjct: 105 VYMHSTVQKDDNSPGITILRWIYEDLPSDIKDRLQLVYFIHPGLRSRLVIATLGRLLLSG 164

Query: 510 AVWNKVVYIDRLLQLFKYVPREQLTIPDFVFQHD 541
            ++ K+ Y+ RL  L++ + + ++ IPDFV  HD
Sbjct: 165 GLYWKIKYVSRLQYLWEDIKKGEVEIPDFVKNHD 198

BLAST of Cp4.1LG07g05460 vs. NCBI nr
Match: gi|449452092|ref|XP_004143794.1| (PREDICTED: protein GDAP2 homolog [Cucumis sativus])

HSP 1 Score: 1060.1 bits (2740), Expect = 1.4e-306
Identity = 526/559 (94.10%), Postives = 538/559 (96.24%), Query Frame = 1

Query: 1   MYRTVATSAATTTATTTTDSVDYVVNLDQIPRWSDAEHRSSLEFVNEDPSFSNSYFPDPL 60
           MYRTVATSAAT   TTTTDS+DYVVNLDQIPRWSDAEHRSSLEFVNEDPSFSNSYFPDPL
Sbjct: 1   MYRTVATSAAT---TTTTDSLDYVVNLDQIPRWSDAEHRSSLEFVNEDPSFSNSYFPDPL 60

Query: 61  TSPSDAEGGNYGVVSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNENLDEAHSSPGLH 120
           TSPSDAEGG  GVVSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNENLDEAHSSPGLH
Sbjct: 61  TSPSDAEGGTNGVVSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNENLDEAHSSPGLH 120

Query: 121 AAAGPGLSEECGTLGGCRTGMAKVTNAYDLPARKVMHTVGPKYAVKYHTAAENALSHCYR 180
           AAAGPGL +ECGTLGGCRTGMAKVTNAYDLPARKV+HTVGPKYAVKYHTAAENALSHCYR
Sbjct: 121 AAAGPGLLDECGTLGGCRTGMAKVTNAYDLPARKVIHTVGPKYAVKYHTAAENALSHCYR 180

Query: 181 SCLELLIENGLQSIAMGCIYTEGKNYPREPAAHVAIRTVRRLIEKQKGKIKAVVFCTTSS 240
           SCLELLIENGLQSIAMGCIYTE KNYPREPAAHVAIRTVRR +EKQ+ KIKAVVFCTTSS
Sbjct: 181 SCLELLIENGLQSIAMGCIYTEAKNYPREPAAHVAIRTVRRFLEKQRDKIKAVVFCTTSS 240

Query: 241 IDTEIYKRLLPLYFPRDKHEEEVALSKLPADVGDENGETIIDERKIRINTLPKKNVLKPP 300
           +DTEIYKRLLPLYFPRDKHEEEVALSKLPADVGDENGETIIDERKIRI +LPKKNV KPP
Sbjct: 241 VDTEIYKRLLPLYFPRDKHEEEVALSKLPADVGDENGETIIDERKIRIKSLPKKNVPKPP 300

Query: 301 QVPDDPPVSHVRLTQRNPSYLDSYLDPAFMALIKDPDQRRKEQWEKTAQAQTGWNYGRIL 360
           QV +D PVS VRLT+RN SYLDSYLDPAFMALIKDPDQRRKEQWEKTAQAQTGWNYGRIL
Sbjct: 301 QVLNDTPVSDVRLTRRNSSYLDSYLDPAFMALIKDPDQRRKEQWEKTAQAQTGWNYGRIL 360

Query: 361 GFGELGGAPLSAAEEYSLHSRYLAKANSLNLSEIGEMKIVYRGGVDSEGRPVMVVVGAHF 420
           GFG+LGG PLSAAEEYSLHSRYLAKANSLNLSEI EMKIVYRGGVDSEGRPVMVVVGAHF
Sbjct: 361 GFGDLGGPPLSAAEEYSLHSRYLAKANSLNLSEIAEMKIVYRGGVDSEGRPVMVVVGAHF 420

Query: 421 LLRCLDLERFVLYVVKEFEPLIQRPYTIVYFHSAASLQPRPDMGWMKRLQQILGRKHQRN 480
           LLRCLDLERFVLYVVKEFEPLIQ+PYTIVYFHSAASLQPRPDMGWMKRLQQILGRKHQRN
Sbjct: 421 LLRCLDLERFVLYVVKEFEPLIQKPYTIVYFHSAASLQPRPDMGWMKRLQQILGRKHQRN 480

Query: 481 LQAIYVLHPTFGLKAAVFAMQLLVDNAVWNKVVYIDRLLQLFKYVPREQLTIPDFVFQHD 540
           L AIYVLHPTFGLKAAV AMQLLVDN VWNKVVYIDRLLQLFKYVPREQLTIPDFVFQHD
Sbjct: 481 LHAIYVLHPTFGLKAAVLAMQLLVDNVVWNKVVYIDRLLQLFKYVPREQLTIPDFVFQHD 540

Query: 541 LEVNGGKGLIVDPRTKYYW 560
           LEVNGGKGLIVDPRTKY +
Sbjct: 541 LEVNGGKGLIVDPRTKYVY 556

BLAST of Cp4.1LG07g05460 vs. NCBI nr
Match: gi|659131461|ref|XP_008465697.1| (PREDICTED: protein GDAP2 homolog [Cucumis melo])

HSP 1 Score: 1059.7 bits (2739), Expect = 1.8e-306
Identity = 525/559 (93.92%), Postives = 537/559 (96.06%), Query Frame = 1

Query: 1   MYRTVATSAATTTATTTTDSVDYVVNLDQIPRWSDAEHRSSLEFVNEDPSFSNSYFPDPL 60
           MYRTVATSAAT   TTTTDS+DYVVNLDQIPRWSDAEHRSSLEFVNEDPSFSNSYFPDPL
Sbjct: 1   MYRTVATSAAT---TTTTDSLDYVVNLDQIPRWSDAEHRSSLEFVNEDPSFSNSYFPDPL 60

Query: 61  TSPSDAEGGNYGVVSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNENLDEAHSSPGLH 120
           TSPSDAEGG  GVVSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNENLDEAHSSPGLH
Sbjct: 61  TSPSDAEGGTNGVVSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNENLDEAHSSPGLH 120

Query: 121 AAAGPGLSEECGTLGGCRTGMAKVTNAYDLPARKVMHTVGPKYAVKYHTAAENALSHCYR 180
           AAAGPGL +ECGTLGGCRTGMAKVTNAYDLPARKV+HTVGPKYAVKYHTAAENALSHCYR
Sbjct: 121 AAAGPGLLDECGTLGGCRTGMAKVTNAYDLPARKVIHTVGPKYAVKYHTAAENALSHCYR 180

Query: 181 SCLELLIENGLQSIAMGCIYTEGKNYPREPAAHVAIRTVRRLIEKQKGKIKAVVFCTTSS 240
           SCLELLIENGLQSIAMGCIYTE KNYPREPAAHVAIRTVRR +EKQ+ KIKAVVFCTTSS
Sbjct: 181 SCLELLIENGLQSIAMGCIYTEAKNYPREPAAHVAIRTVRRFLEKQRDKIKAVVFCTTSS 240

Query: 241 IDTEIYKRLLPLYFPRDKHEEEVALSKLPADVGDENGETIIDERKIRINTLPKKNVLKPP 300
           +DTEIYKRLLPLYFPRDKHEEEVALSKLPADVGDENGETIIDERKIRI  LPKKNV KPP
Sbjct: 241 VDTEIYKRLLPLYFPRDKHEEEVALSKLPADVGDENGETIIDERKIRIKPLPKKNVPKPP 300

Query: 301 QVPDDPPVSHVRLTQRNPSYLDSYLDPAFMALIKDPDQRRKEQWEKTAQAQTGWNYGRIL 360
            VP+D PVS VRLT+RN SYLDSYLDPAFMALIKDPDQRRKEQWEKTAQAQTGWNYGR+L
Sbjct: 301 PVPNDTPVSDVRLTRRNSSYLDSYLDPAFMALIKDPDQRRKEQWEKTAQAQTGWNYGRML 360

Query: 361 GFGELGGAPLSAAEEYSLHSRYLAKANSLNLSEIGEMKIVYRGGVDSEGRPVMVVVGAHF 420
           GFG+LGG PLSAAEEYSLHSRYLAKANSLNLSEI EMKIVYRGGVDSEGRPVMVVVGAHF
Sbjct: 361 GFGDLGGPPLSAAEEYSLHSRYLAKANSLNLSEIAEMKIVYRGGVDSEGRPVMVVVGAHF 420

Query: 421 LLRCLDLERFVLYVVKEFEPLIQRPYTIVYFHSAASLQPRPDMGWMKRLQQILGRKHQRN 480
           LLRCLDLERFVLYVVKEFEPLIQ+PYTIVYFHSAASLQPRPDMGWMKRLQQILGRKHQRN
Sbjct: 421 LLRCLDLERFVLYVVKEFEPLIQKPYTIVYFHSAASLQPRPDMGWMKRLQQILGRKHQRN 480

Query: 481 LQAIYVLHPTFGLKAAVFAMQLLVDNAVWNKVVYIDRLLQLFKYVPREQLTIPDFVFQHD 540
           L AIYVLHPTFGLKAAV AMQLLVDN VWNKVVYIDRLLQLFKYVPREQLTIPDFVFQHD
Sbjct: 481 LHAIYVLHPTFGLKAAVLAMQLLVDNVVWNKVVYIDRLLQLFKYVPREQLTIPDFVFQHD 540

Query: 541 LEVNGGKGLIVDPRTKYYW 560
           LEVNGGKGLIVDPRTKY +
Sbjct: 541 LEVNGGKGLIVDPRTKYVY 556

BLAST of Cp4.1LG07g05460 vs. NCBI nr
Match: gi|703117932|ref|XP_010101490.1| (hypothetical protein L484_017239 [Morus notabilis])

HSP 1 Score: 979.5 bits (2531), Expect = 2.4e-282
Identity = 480/559 (85.87%), Postives = 516/559 (92.31%), Query Frame = 1

Query: 1   MYRTVATSAATTTATTTTDSVDYVVNLDQIPRWSDAEHRSSLEFVNEDPSFSNSYFPDPL 60
           MYRT AT+A  T   + TD+VDYVV+LDQ+PRWSD+E+RSSLEF NEDPSFS SYFPDPL
Sbjct: 1   MYRTAATAATMTRGGSPTDNVDYVVSLDQVPRWSDSEYRSSLEFGNEDPSFSTSYFPDPL 60

Query: 61  TSPSDAEGGNYGVVSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNENLDEAHSSPGLH 120
           TS S AE  +  +VSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNE+LDEAHSSPGLH
Sbjct: 61  TSSSGAESSSNDMVSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNESLDEAHSSPGLH 120

Query: 121 AAAGPGLSEECGTLGGCRTGMAKVTNAYDLPARKVMHTVGPKYAVKYHTAAENALSHCYR 180
           AAAGPGL+EEC TLGGCRTGMAKVTNAYDLPAR+V+HTVGPKYAVKYHTAAENALSHCYR
Sbjct: 121 AAAGPGLAEECATLGGCRTGMAKVTNAYDLPARRVIHTVGPKYAVKYHTAAENALSHCYR 180

Query: 181 SCLELLIENGLQSIAMGCIYTEGKNYPREPAAHVAIRTVRRLIEKQKGKIKAVVFCTTSS 240
           SCLELLIENGLQSIAMGCIYTE KNYPREPAAHVAIRTVRR +EKQK KIKAVVFCTT+S
Sbjct: 181 SCLELLIENGLQSIAMGCIYTETKNYPREPAAHVAIRTVRRFLEKQKDKIKAVVFCTTTS 240

Query: 241 IDTEIYKRLLPLYFPRDKHEEEVALSKLPADVGDENGETIIDERKIRINTLPKKNVLKPP 300
            DTEIYKRLLPLYFPRDKHEEE+A SKLPADVGDENGETIIDERKIRI  LPKK + KP 
Sbjct: 241 SDTEIYKRLLPLYFPRDKHEEEIAFSKLPADVGDENGETIIDERKIRIKPLPKKTIPKPT 300

Query: 301 QVPDDPPVSHVRLTQRNPSYLDSYLDPAFMALIKDPDQRRKEQWEKTAQAQTGWNYGRIL 360
           + P D PVS V L +RN SYLD+YLDPAFM+LIKDPDQRRKEQWEKTAQA++G+N  ++L
Sbjct: 301 EAPIDLPVSDVSLVRRNSSYLDTYLDPAFMSLIKDPDQRRKEQWEKTAQARSGFNCAKLL 360

Query: 361 GFGELGGAPLSAAEEYSLHSRYLAKANSLNLSEIGEMKIVYRGGVDSEGRPVMVVVGAHF 420
           GFG+LGG PLSAAEEYSLHSRYLAKANSLNLSEI EMKIVYRGGVDSEGRPVMVVVGAHF
Sbjct: 361 GFGDLGGPPLSAAEEYSLHSRYLAKANSLNLSEIAEMKIVYRGGVDSEGRPVMVVVGAHF 420

Query: 421 LLRCLDLERFVLYVVKEFEPLIQRPYTIVYFHSAASLQPRPDMGWMKRLQQILGRKHQRN 480
           LLRCLDLERFVLYV+KEFEPLIQ+PYTIVYFHSAASLQ +PD+GWM+RLQQILGRKHQRN
Sbjct: 421 LLRCLDLERFVLYVIKEFEPLIQKPYTIVYFHSAASLQIQPDLGWMRRLQQILGRKHQRN 480

Query: 481 LQAIYVLHPTFGLKAAVFAMQLLVDNAVWNKVVYIDRLLQLFKYVPREQLTIPDFVFQHD 540
           L AIYVLHPTFGLKAAVFA+QL VDN VW KVVY+DRLLQLF+YVPREQLTIPDFVFQHD
Sbjct: 481 LHAIYVLHPTFGLKAAVFALQLFVDNVVWKKVVYVDRLLQLFRYVPREQLTIPDFVFQHD 540

Query: 541 LEVNGGKGLIVDPRTKYYW 560
           LEVNGGKGLIVDPRTKY +
Sbjct: 541 LEVNGGKGLIVDPRTKYVY 559

BLAST of Cp4.1LG07g05460 vs. NCBI nr
Match: gi|1009154088|ref|XP_015894980.1| (PREDICTED: protein GDAP2 homolog isoform X2 [Ziziphus jujuba])

HSP 1 Score: 976.9 bits (2524), Expect = 1.5e-281
Identity = 477/559 (85.33%), Postives = 512/559 (91.59%), Query Frame = 1

Query: 1   MYRTVATSAATTTATTTTDSVDYVVNLDQIPRWSDAEHRSSLEFVNEDPSFSNSYFPDPL 60
           MYRTVAT+  +T     TD+ DYVV+LDQ+PRWSDAE+R+SLE+ NEDPS+SNSYFPDPL
Sbjct: 1   MYRTVATAPTSTPGGPPTDNGDYVVSLDQVPRWSDAEYRASLEYENEDPSYSNSYFPDPL 60

Query: 61  TSPSDAEGGNYGVVSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNENLDEAHSSPGLH 120
           TSP   +  + G VSRFPVDHEIN KIYLWRGNPWNLEVDAVVNSTNE+LDEAHSSPGLH
Sbjct: 61  TSPPGEDSSSNGNVSRFPVDHEINLKIYLWRGNPWNLEVDAVVNSTNESLDEAHSSPGLH 120

Query: 121 AAAGPGLSEECGTLGGCRTGMAKVTNAYDLPARKVMHTVGPKYAVKYHTAAENALSHCYR 180
           AAAGPGL+EEC TLGGCRTGMAKVTNAYDLPAR+V+HTVGPKYAVKYHTAAENALSHCYR
Sbjct: 121 AAAGPGLAEECATLGGCRTGMAKVTNAYDLPARRVIHTVGPKYAVKYHTAAENALSHCYR 180

Query: 181 SCLELLIENGLQSIAMGCIYTEGKNYPREPAAHVAIRTVRRLIEKQKGKIKAVVFCTTSS 240
           SCLELLIEN L+SIAMGCIYTE KNYPREPAAHVAIRTVRR +EKQK KI AVVFCTT+S
Sbjct: 181 SCLELLIENRLRSIAMGCIYTENKNYPREPAAHVAIRTVRRFLEKQKDKITAVVFCTTTS 240

Query: 241 IDTEIYKRLLPLYFPRDKHEEEVALSKLPADVGDENGETIIDERKIRINTLPKKNVLKPP 300
            DTEIYKRLLPLYFPRDKHEEEVA+SKLPADVGDENGETIIDERKIRI  LPKK + +PP
Sbjct: 241 TDTEIYKRLLPLYFPRDKHEEEVAISKLPADVGDENGETIIDERKIRIKPLPKKTIPRPP 300

Query: 301 QVPDDPPVSHVRLTQRNPSYLDSYLDPAFMALIKDPDQRRKEQWEKTAQAQTGWNYGRIL 360
           Q P D PVS V L +RN SYLDSYLDPAFM+LIKDPDQRRKEQWEKTAQAQ GWN  ++L
Sbjct: 301 QAPSDLPVSDVGLARRNSSYLDSYLDPAFMSLIKDPDQRRKEQWEKTAQAQGGWNCAKLL 360

Query: 361 GFGELGGAPLSAAEEYSLHSRYLAKANSLNLSEIGEMKIVYRGGVDSEGRPVMVVVGAHF 420
           GFG+LGG PLSAAEEYSLHSRYLAKANSLNLSEI EMKIVYRGGVDSEGRPVMVVVGAHF
Sbjct: 361 GFGDLGGPPLSAAEEYSLHSRYLAKANSLNLSEIAEMKIVYRGGVDSEGRPVMVVVGAHF 420

Query: 421 LLRCLDLERFVLYVVKEFEPLIQRPYTIVYFHSAASLQPRPDMGWMKRLQQILGRKHQRN 480
           LLRCLDLERFVLYVVKEFEPLIQ+PYTIVYFHSAASLQ +PD+GWM+RLQQILGRKHQRN
Sbjct: 421 LLRCLDLERFVLYVVKEFEPLIQKPYTIVYFHSAASLQLQPDLGWMRRLQQILGRKHQRN 480

Query: 481 LQAIYVLHPTFGLKAAVFAMQLLVDNAVWNKVVYIDRLLQLFKYVPREQLTIPDFVFQHD 540
           L AIYVLHPTFGLKAA+FA+QL VDN  W KVVY+DRLLQLF+YVPREQLTIPDFVFQHD
Sbjct: 481 LHAIYVLHPTFGLKAAIFALQLFVDNVAWKKVVYVDRLLQLFRYVPREQLTIPDFVFQHD 540

Query: 541 LEVNGGKGLIVDPRTKYYW 560
           LEVNGGKGLIVDPRTKY +
Sbjct: 541 LEVNGGKGLIVDPRTKYVY 559

BLAST of Cp4.1LG07g05460 vs. NCBI nr
Match: gi|658009262|ref|XP_008339836.1| (PREDICTED: protein GDAP2 homolog isoform X1 [Malus domestica])

HSP 1 Score: 971.8 bits (2511), Expect = 5.0e-280
Identity = 476/559 (85.15%), Postives = 510/559 (91.23%), Query Frame = 1

Query: 1   MYRTVATSAATTTATTTTDSVDYVVNLDQIPRWSDAEHRSSLEFVNEDPSFSNSYFPDPL 60
           MYRTVAT+   T   + TDS D VV LDQ+PRWS++EHRSSLE+ NEDPSFSNS+FPDPL
Sbjct: 1   MYRTVATATTATRGGSPTDSGDCVVTLDQVPRWSNSEHRSSLEYDNEDPSFSNSFFPDPL 60

Query: 61  TSPSDAEGGNYGVVSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNENLDEAHSSPGLH 120
           TS S  E  + G+VSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNEN+DEAH SPGLH
Sbjct: 61  TSQSGGESSSNGIVSRFPVDHEINSKIYLWRGNPWNLEVDAVVNSTNENMDEAHCSPGLH 120

Query: 121 AAAGPGLSEECGTLGGCRTGMAKVTNAYDLPARKVMHTVGPKYAVKYHTAAENALSHCYR 180
           AAAGPGL+EEC +LGGCRTGMAKVT AYDLPAR+V+HTVGPKYAVKYHTAAENALSHCYR
Sbjct: 121 AAAGPGLAEECASLGGCRTGMAKVTKAYDLPARRVIHTVGPKYAVKYHTAAENALSHCYR 180

Query: 181 SCLELLIENGLQSIAMGCIYTEGKNYPREPAAHVAIRTVRRLIEKQKGKIKAVVFCTTSS 240
           SCLELLIENGLQSIAMGCIYTE KNYPREPAAHVAIRTVRR +EKQK KI AVVFCTT+S
Sbjct: 181 SCLELLIENGLQSIAMGCIYTEAKNYPREPAAHVAIRTVRRFLEKQKDKIAAVVFCTTTS 240

Query: 241 IDTEIYKRLLPLYFPRDKHEEEVALSKLPADVGDENGETIIDERKIRINTLPKKNVLKPP 300
           +DTEIYKRLLPLYFPRDK EEE+ALSKLPADVGDENGETIIDERKIRI  LPKKN+ KP 
Sbjct: 241 MDTEIYKRLLPLYFPRDKLEEEIALSKLPADVGDENGETIIDERKIRIKPLPKKNIPKPX 300

Query: 301 QVPDDPPVSHVRLTQRNPSYLDSYLDPAFMALIKDPDQRRKEQWEKTAQAQTGWNYGRIL 360
           Q P + PVS V L QRN  YLDSYLDPAFM+LIKDPDQRRKEQWEKTAQAQ+GWN  +IL
Sbjct: 301 QAPVELPVSDVGLVQRNSPYLDSYLDPAFMSLIKDPDQRRKEQWEKTAQAQSGWNCAKIL 360

Query: 361 GFGELGGAPLSAAEEYSLHSRYLAKANSLNLSEIGEMKIVYRGGVDSEGRPVMVVVGAHF 420
           GFG+LGG PLSAAEEYSLHSRYLAKANSLNLSE+ EMKIVYRGGVDSEGRPVMVVVGAHF
Sbjct: 361 GFGDLGGPPLSAAEEYSLHSRYLAKANSLNLSELAEMKIVYRGGVDSEGRPVMVVVGAHF 420

Query: 421 LLRCLDLERFVLYVVKEFEPLIQRPYTIVYFHSAASLQPRPDMGWMKRLQQILGRKHQRN 480
           LLRCLDLERFV YV KEFEPLIQ+PYTIVYFHSAASLQ +PD+GWMKR+QQILGRKHQRN
Sbjct: 421 LLRCLDLERFVHYVXKEFEPLIQKPYTIVYFHSAASLQLQPDLGWMKRVQQILGRKHQRN 480

Query: 481 LQAIYVLHPTFGLKAAVFAMQLLVDNAVWNKVVYIDRLLQLFKYVPREQLTIPDFVFQHD 540
           L AIYVLHPTFGLKAA+FA+QL VDN VW KVVY+DRLLQLF+YVPREQLTIPDFVFQHD
Sbjct: 481 LHAIYVLHPTFGLKAAIFALQLFVDNVVWKKVVYVDRLLQLFRYVPREQLTIPDFVFQHD 540

Query: 541 LEVNGGKGLIVDPRTKYYW 560
           LEVNGGKGLIVDPRTKY +
Sbjct: 541 LEVNGGKGLIVDPRTKYVY 559

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
GDAP2_NEMVE7.6e-8735.56Protein GDAP2 homolog OS=Nematostella vectensis GN=gdap2 PE=3 SV=1[more]
GDAP2_XENTR2.4e-8035.44Ganglioside-induced differentiation-associated protein 2 OS=Xenopus tropicalis P... [more]
GDAP2_XENLA3.2e-7734.48Ganglioside-induced differentiation-associated protein 2 OS=Xenopus laevis PE=2 ... [more]
GDAP2_HUMAN4.2e-7733.72Ganglioside-induced differentiation-associated protein 2 OS=Homo sapiens GN=GDAP... [more]
GDAP2_DANRE7.1e-7737.34Ganglioside-induced differentiation-associated protein 2 OS=Danio rerio GN=gdap2... [more]
Match NameE-valueIdentityDescription
A0A0A0KS03_CUCSA9.7e-30794.10Uncharacterized protein OS=Cucumis sativus GN=Csa_5G487740 PE=4 SV=1[more]
W9RFR4_9ROSA1.7e-28285.87Uncharacterized protein OS=Morus notabilis GN=L484_017239 PE=4 SV=1[more]
F6HEZ4_VITVI8.6e-27984.79Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g00870 PE=4 SV=... [more]
A0A067GTW9_CITSI1.5e-27884.97Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g008599mg PE=4 SV=1[more]
M5XY84_PRUPE3.2e-27884.44Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003605m1g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G69340.11.0e-26279.43 appr-1-p processing enzyme family protein[more]
AT2G40600.12.5e-1133.33 appr-1-p processing enzyme family protein[more]
AT4G35750.12.5e-1126.97 SEC14 cytosolic factor family protein / phosphoglyceride transfer fa... [more]
AT3G10210.15.2e-0924.03 SEC14 cytosolic factor family protein / phosphoglyceride transfer fa... [more]
Match NameE-valueIdentityDescription
gi|449452092|ref|XP_004143794.1|1.4e-30694.10PREDICTED: protein GDAP2 homolog [Cucumis sativus][more]
gi|659131461|ref|XP_008465697.1|1.8e-30693.92PREDICTED: protein GDAP2 homolog [Cucumis melo][more]
gi|703117932|ref|XP_010101490.1|2.4e-28285.87hypothetical protein L484_017239 [Morus notabilis][more]
gi|1009154088|ref|XP_015894980.1|1.5e-28185.33PREDICTED: protein GDAP2 homolog isoform X2 [Ziziphus jujuba][more]
gi|658009262|ref|XP_008339836.1|5.0e-28085.15PREDICTED: protein GDAP2 homolog isoform X1 [Malus domestica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002589Macro_dom
IPR001251CRAL-TRIO_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG07g05460.1Cp4.1LG07g05460.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001251CRAL-TRIO lipid binding domainGENE3DG3DSA:3.40.525.10coord: 394..530
score: 1.
IPR001251CRAL-TRIO lipid binding domainPFAMPF13716CRAL_TRIO_2coord: 409..541
score: 2.7
IPR001251CRAL-TRIO lipid binding domainSMARTSM00516sec14_4coord: 393..539
score: 2.
IPR001251CRAL-TRIO lipid binding domainunknownSSF52087CRAL/TRIO domaincoord: 392..530
score: 1.83
IPR002589Macro domainPFAMPF01661Macrocoord: 103..215
score: 8.3
IPR002589Macro domainSMARTSM00506YBR022w_8coord: 86..215
score: 1.5
IPR002589Macro domainPROFILEPS51154MACROcoord: 74..254
score: 20
NoneNo IPR availableGENE3DG3DSA:3.40.220.10coord: 84..261
score: 2.1
NoneNo IPR availablePANTHERPTHR11106GANGLIOSIDE INDUCED DIFFERENTIATION ASSOCIATED PROTEIN 2-RELATEDcoord: 25..557
score:
NoneNo IPR availablePANTHERPTHR11106:SF65SUBFAMILY NOT NAMEDcoord: 25..557
score:
NoneNo IPR availableunknownSSF52949Macro domain-likecoord: 76..283
score: 3.14

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG07g05460Cp4.1LG11g07550Cucurbita pepo (Zucchini)cpecpeB149
The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG07g05460Cucurbita moschata (Rifu)cmocpeB753