Cp4.1LG13g09640 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG13g09640
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPax6
LocationCp4.1LG13 : 7353737 .. 7362815 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CGTCATCTGCTCGTTGATTTTGGTTTCATAGTTGTAAACGCTTATGTGATCAGTTTCATTCTTGTGTACTCTCATCGATCTGAAAAGCTCGTAGGAATCTGATTTCTGGTATGAATTTTGCATCTCTGCCTTAAGAACACAAATTTTCCACGAATCGAACGAAGAAGATACGTTGATTTGTTAGGTTTCTGTTTTTTTTCTTTATAAATCGAAGCAGAGGTAGGGTTTTGAGTTGTTGAGCTGCGAAAAGTTGTTGGAGTTTTCTGAAAATTAAATTGGATGTGATTTCGATGTTTAGCTGGTCGAACGCGAATATATCTGACTAAGACGAACAGATTGATTGAAGATCGAAAAATATATTAAGCTTCTGAGATCTCGGAAGCTTGGTTGTACCCGAAGTGAAGAGATTTTTTCATTATTTCTTGTTGTTTCTGTAATTAGAATGTCGTACTCGTTTCTTCTTTTGTGGTTATAATTTTTCTCAGTATCTATTCATTTGGCAGGTTGGGTTTAAGCGTACTGTCATGTCAACTAGTGATTATAATGCAATTGTTCCTATCAAGAAGAGGAGGTTTCCTTCAATGCAATCTCCTCCGCCCAAAGAAATATCTTCTCTTCCATTAGTAGATGATAACATAGCAAAAGTAGACGAGCCTTGCGTATCAGATGGTCCAACAGTTTCCAATTCTAGTACAATCACAACTTCTGAATTTTCAGAGAAGAAGATTTCATTTTCTGAGGACGGTAAACGGAAATCTGATTTATGCAATATGAATATGGTCCAAAGCATTATTGGACCTTCCAGAGTCGAGTTTCAGGAAAATGATGTGTGTTCTACTGGCTGCGTGGAAAATAAAGAAACATGTATGGTGAATGAAAATCATGCGCTTGTTCTGCATGAGAAACCAGAGTTCAAGTTACCACATTCTGATGCTAACTCTAACCCTGGACTTTGTGCCGAAAAGGAAAGTGATGAGATTGACAGAAAACAACTTGATAGATTAGAGTTTTCAACTTCTGTAGCGAAAAAAGAAGCCGAATTATCAATTGGTTCGAAGGAACATCTTGTTCCAGACTCAGTTCTTGAAGGGAGTGACTTGAAATCTCTGAAGCAGATTAATTTAGAACCAGGGTTATTGAACTTAAGTTTAAGCAAGGAAGGAAGTCTCGATCAGCCTCTCACTGTAAATGTTGGGTCTAGTTATGATGGTTCTATTCAGGAGTCAAATAGGGAAAATTGGGATCTAAATACCTCGATGGAGTTTTGGGAAGGCTGTTCAAGTGGCGATCCTCCCGAGCATGTTCCAGCAGTTCAGACAAACACGGTTGTCACTATGCACAGATTCTCAACAGAAATGGTTAATACTGATACTCTGTCAGGAAAGCTAACTCCTTTAGATGACAGTGATCATCTTCATCTAAGTCTTAGTTCATCAGATCATAGGCATGTAATAAGTCAGGAACAAAGTTCATTTGTCAAGTTAGGCTTTAGGAAAACAAGTCCTTCTTTAAGCTCAACAGGAAGAGGTTTGCAGTTTGATGATCTTAACGGTGCACTAAAAGTCGTAAAGCCAGAGCCATTTGTTGAGGCTTCCAAACTCGAGTCTAAAAGTGATGAAGTTAATGTGCTGGGATTATCAGACAGTGCTATTGTGAAGCGCGAATTTCTTCAAATTCCCAATGCTTCAGATATTTATATACCAATGAACACAGTTAAGGCCAAGTCTGTTAACTCTGAATCAAATTACGAAAGCAAACAGGTAGCACTCGAAACATTAGGTGGTAGATTAGATTTGGTAGCTAAGCAAGTTCTTCCAGAGGTAGATAGTTCTTGTCCTGCACCGATGCCTTTTGTGGCAGAGATGACTGAAGCAGCTGGAAACTCTTGTTCAACTGATTTGATCACAGATGGAGGCATGTCAAACCATTCAGAATTGCAAACTCCTACTGAAGAACATCTTAATTTGAAAGTGCACGAAGGAGCATATCGTTGTGGTGGTGAACTCGTCGATTCAGAAATGACTGATATAAGTAAGGATCCAGGTTCCAAAGATTTCAATAGTCCTATTATAAAGCCTATAGCAATGCCTAGAAATCCTTCTCGTACAAATGATTCAATTATAGAGGCAAACATGTCAAGCCCTTCCGAATTACATATTCCAACTACAGGACCTCTTAATACGAAAGTGCACCAAGCGGGATATGGCTGTGACGGTGGACTTGTGAATTCGGTAATGACAGATGTAAGTAAGGATACATGTTCCAAAGATTCCAGTAGTTCTGTTATAAAACCAGTCATTGTTGAAGATGAAAATCAGAATAACCCGCTCTGGCGTCCTTCGACACACACGAATGAGCAGTGCTCTAGTTTGCAGGGAGGTGAGGAAAGTTCTGTAAATGATGAGGAAAAGATCAGTCTATCAGCCGATTTATTAGAAGAAGATCCATATAGTTCTGAATATGAATCAGATGGTAAGTTGGATGTAAATGAGGCCATGGATGCAGTTGATAACGATATAGAAGAAGATTATGAAGACGGAGAGGTTCGGGAACCGACATTGACGACTCAAGTAGAAAGCAGTATATGCGAGACGAAAAAAGTAAAAAATTTTGATCATGGTGATTCTAGCAATGGACTGCCTGGTTCTGATTGCTGTTCCTCCTTGGTTTCTGTTAAGCAGGAAAATAAATTAGAAATCCTCGATGTTAAACGAGAAGACAATCTTCATTCTGTAACTTCGAATCAATCGTCCGAGCAAGAACGATCGAAAGAGTTGCCTGTCGAAGAGCATACCACTAGAGTGTGTTTGAACAAGGCCAACAAGGCTAAAACATCTGCCTTAGAGGACCAGGAAACTTCCCCTGAGAAAGCCAGTAATGGAATCGAAGAATCGATTACGACAGTTTCTCAGAGCGACGCAGAGAAGGTTAAAACAGTAGATATCGTGCGAAACGACAATCCAGCTTTGCCAAATGTCGAGCCTTTAAATGATGATGATGTGACTGACGATATTACTCGTGGCAGTAAGCATAGCCGCATTGTTAGTCCCTGCAAACCTTCATCGTCTTCACTTCCTAGTAAAACGAAATCGAGTTTGGCGAGGTCGGTTTTAACACAAACTGATAGAGAACGAATACCCGACATGGGGCATGAAGGGGAAAAATTACATCCACAAGGAAGGTGATTATCTATTTCTTAGCTGTGCATTAATTTGATTACATTCTGCATTGCTTTGCTGATTTCTTTCTATTTTCTAGAGATGAACCATATAGGGACGTTTTCCAGAGATTTTATGTGAATAGACATCAGAATCTATCACCCCAAACCAATTTTAGCCGTAGAAGAGGTAGATTCACTATCCGGATTAACTCTGTCCAAGGTGAATGGGATTTTAATCCAACAATCTCTCCAGGAAATTACAATGATCAAGTACCACCACCCTATGATGCCCGTAGACGTAAATACATGCCTGCTGTTTCTGATGATGATATTGATCAAAACCATTATAAAATGAAACCCGATGGTCCATTTCGTAGCGCTGGTGATCATCGAGGTAGACAGATATTAGACGACGAAGGCCCCCTTTTTTGTCATATGGCCTCTAGGAGGAAGTCGCCTGGTCGAAGAGATGGGCCTCCTCCAGTGCGAGGTGTTAAAATGGTACACAGAATGCCTAGAAACATCAGTCCAAGTAGATGTAATCGTGAACGTGGATCGGAACTGGTTGGACCGCGACACGGTGAGAAGTTCATGAGGACATTTGAAGACGAAACTATGGATCCATTATACGCCCACCCTCAACCTTCGTTTGAAGTAGATCGGCCTCCTTTTATCCGAGACCGAAGGAACTTTCCTATTCAAAGAAAAAGTTTTCAAAGAGTTGATTCTAAATCTCCAGGAAGGTCCAGAGGACGCTCTCCTTCCCAATGGTTTCCATCCAAAAGAAAGTCTGAGAGGTTCTTTGGACATCCAGAAATGGCACGGCGAAGTCCTCCACCCGGTTATAGGATGAGATCGCCCGATCAACCTCCTCAAATCCATGGAGATATGCCAGATCGAAGACATGGTTTCCCGTTTCCGTCACTGCCACCTAATGATTTGAGGGATATGGGTTCTGCTCGTGACCATGGCCATATGAGACCAGGTCTACGAAGTCGAAACCGAACCGACAGAATGTCTTTTAGAAACAGGAGGTTTGAAGATATGGATCCTCGAGATAATAGGATAGAGAGTAACGAATACTTCGATGGACCTGTACATCCTGGTCAAATGAATGAACTGATTGATGATGGCAACGACGACGATCGAAGAAGGTTTTCGGACAGACACGAACATCTTCACCAATTCCGGCCACAATGTAATGATTCAGACGGTGAAAACTATCACAACGATGCAGACGAAAGAGCGAGGCCTTACAGATACTGCACGGAGGATGAAGAAGAGTTCCATGAAAGAGGTAAGATGAGGGAGAGGGAATTTGATAGACGTGTAAAGAACCAACCAGAAAATTTAGGTAGACGAACAGTGATTGAAGAACATGAAGTAGAAGAATACAGACATGGTCATGGTCGGCAGATGTGGAATGAACATCATCACCATCATCATCATCATAGCTTTGAAGACATTTCCCGAATGAAAAGAAAAAGAATTTGAAATTTCCTTTTCAATTCAGCTCAGAAGCCAGGTGAGATTCAGAAGAAGCTGCATTGTTCTTGTGCTTAGGAGGTGTTTGGATCCCTTTTATATAGAAAAATGGCCTGCTGCTCATTGGATGGATGGACTTATCCCATCCATATCTTATTAAATATTAAAACAATTCATTCACTTAAAAATATAATTCAATTCAATTTCCTAATTAGTCTCTAGGTGATTCTGATACGTTCAAGAAGAGAATTTAACCGCAAACTTCAAAATCCATACCTCCTTCCAGGTACTCATGGAGCTTGATGATCAAATAAACAATAATGTAAGATTCTCTCAGGTTTCTTCATATATAATTCAAAAGCGTTGGTTGAATCTTAAGTGCGGAAAAGAGAGAAATCCGGAAGATTCAAGAACAAGCAGATGAACAAAAGACTAAATTATTGAAGTTAACTTTTGATTTCATCTTGCGGCCAAACGCTAACCTTCTCCTTCTCGTATTTGATATACAGAGCAGCCGCCGCCGCTACGAAGTATACTTGAATCAGTATAATCGCTGGCGTGTGGTAATTGATGACGCCTTTTCCAGTGGCTAATTCCACTAGAACCAGTGCCGATAATCCGAGTATCGCAATCCATCCATTGATCCTCTCGCTGTACGGACTGAATCCGAATTCTACCTTAAGTCCGCCGGAAACCGCTTCCTTCAGCGGCACTGGCGGAAGATACACTGACGAGGCGGCGGCGGTTGATCCTTTCCTGGACTTCCGAGCCTTACGAATCTCTGCCTTCTTTCCCGTTCCGCGTCGGAGTTGGAGTCGGGCCAGTCGGTCCTCGAAATCGTCGCCGTCCGATCTAGGTTCCGAATCGGCGGCAGGAGATTCCGGATCTGCTGCTCTGGTTACGAAGGTGAGAGACTTCGGCTTGTGAGGCTTGGAATTGGAAGCGAAAGCGAGGGAGTTTGGAAATTGCAGTGAGGAAGATGGAATTGCCGCCATGGAAGGAGAATGCTGGGCTCGAGGATAAACTCGCCCCTCTGCTGTAACCGTCTGGACCTTATATTTTTTATTTTTCTTTTTATATTTTATTTTTCCGCTCTTTAAAATTAGTGGTTTATTTTTTTCCTTCTTTTTTTGAATTAAATAATTTAGCGGTTTCTATTTGTTCCGTTTTTATATTTTTGTTTATACGGTAATGACTAGAAATGATAATTATTTAAAGTTTAGGGACTAAAAAAATAAATTTAAAAATATTAAGACAAATAAATATTTTAAAAGTATTGGTGTGAAGTTTTAGAAACCAAATTCGTTAGTTCGCTCTCCAGGTCAATGCCATCGCCGATTTGGATTTCTCGTGATTCTTCCTTGATTCCATTCTTACTCTCTCTCGGTATCACCAGCAAAACGAGCATGTATTAGATCTGCGTTTCGGTATTGTTCGCTCTCTGCCTTTAGGATAGTAATCTCTTTGAATCGGTTGAGTGTGTTTCTTTCGAGGCAGAGCGGAGTTTTGTGGAAGTTTGAGTCGTTGTTGAGGATCAGCTGTTCAAATACTGGAATGTTTTCTACCAGTTCAATCGATTTCCTTATCAAGTGATTCTGGTGATTTGATGATCGGTTAGATGTGGAATGTGTTTGCGTAGATGGAGGTTTATGCTTCTTTTATTTTGTTCCTTTTAATTTTATCGTCAATTTAGTGTTGTTCGTTACGATTCTGGAATCTTTTGTTCTTTCATGAGATGGATGGAGAAGTTTTTGCGTGGATATTCTGCATAGTTGCTCTGAAATTTATTAACATTATGCCGAAAATTTCACTGCTGGAGTTTGATATACGGTTCATTGTGCTGTGTCTCCCCTGATCCAATCTCGAGGCGTGGATCATCTAAATTTTTGTGGTTAAAATTTGAACACAGGAACCTCTCATATGCTGGATTTAAGATTTGGATGCTGGTTTTATGATATTTTCACGATGCAACAGTGCTGCATCGCTAATATCTTTAGTTTTCGCTTGAGATTCATTGATTATTTCGACACAATGCCTGTCATATGCTATCTCGCATACTGAATCTTTTTTAAGATTACTAATGTATTGGTATCCATACGGTGCAGAAATCGGAAGTGAATTCTAAGAAGTTAGTGAGATTTTGGTGTTGGATTGAAGTGGTAAAAGATCAAGAAACTGTATTAAGCCGAGGTTTTGGAATAATGGCGGTACCAGAAAGTGAAGAGGTGCTCAACATTTTTCTTTGCTGTTTTCTGGATTTTGAAGCTCGTGCAAGTTTCTTCTTATAGGGTTTTATAATTATTCAGTAAAATGTTCCATTGGCACTATGCAGGTTGGTTTTAAGCGCATTGGATTCTCAGTTAGTGATTATGATGCAAATCTTCCTATCAAGAAAAGGAGATTTCCGGTAGTGCAGATCTCTCCATCTCCATCTGAAGGTATATCTTCATTCCATCCAGATGGAAATTTATTGAAGATTGAGCGGCCATCTCCACCTAAAAAGCTATCTTCATTTAATCCCGATGAAAATTCGTTAGAGGTCGAACAGCCGAGTCTATCTGTGACAATAGTTTCAAGTTCTAGTGCAGACACATGTTATGGGTTGTCAAACAGGAACCAGGACTGTGTTTCTAATGAGAATAAACGAAAATCTGATACTCATTCATGCTATGTGGATATGGTCCAGAACGATATTGGGATGCCAGGAGTCGAGTTTCCGGGACCCGGTTTGGGAGGACATGAAGATAAGTCCTTGGTAACTGAAAAACACTCCGTTCATCGATCACCGGAGATCTACGGTGAGTTGAAGTTATCATCAACTGGCGTCGACTCGGATCCTCTTGGTAGTAACAAAGAGGAAGAAATTGATGTAAAAATGCCTGAAGAAAAGTGCAGCTCTTCAATTTGTCAAGTTGAAGGAGGAGCTGAAGTATCAGAGAAATTGGTTTCTTACAAGAGTGACCTGAATAAGCAGAATTCTTTGGAGCCTGTGTTAATGGACTTGTCTTTAAACAAGCAAGGAAGTAGCTGCCATTGTGTCAAAGGTAACGTAGGGTCTGATTGCGATGATTGAATCAGATGGTAGCTGGAATATTGCCGAGGTTGAAGACGACGACGACGATGATAATAACATAGAAGAAGACTATGAAAATGGCGAGGTTCGGGAATCAATGCAAAAAGAGGCCCGTGCTTGTGAGAAAAGAGAAATTGAGCCATTGGATCATGCTGATTGTGATGATAAGAAGATCAATTCTGCTGGATTGCCTGATCATGAATGTTTCACATTAGGCCCTCTGGAACAGGAAACGAAACCTGAAAATCTGGACTCTAAGAGTGAAGACGATGTTCATACTACAACTGAAAGTACATCTTGTGAGCAAGAACATGAAGATCTTTGTGTGAAAGAACCACTTGACGTAGAGAATACTATTGGTGAGGATGTAAACAGGCCTATGAAGGCTGCAGGAAGAAGCCAATTATCTCAATATGTTAATAAGGACAAGTTAGAGGGCCACGACACCGCCGATGAAATCGAGGAACTGATTCCGAAATTTTCTCAGGGTGAGATGGAGAAAGCTATTGCTGTAGAGGTAGAGAGCAGAATAGGGATCTAACTTTGCCTACCAATATGTTGGACAAACGATCTGGGGAATGGGACTTTGGTCCCAACTTTTCTCCTGAAACATACAGTGACCAGCAGATAGATTACCATGTTCCTGATCTTGATCACGACCGATATAAAATTATTCCTGATGGTCGATTTGTCGGTGCTAACCGTCGCAGTAGGTCATTGCTGGACAATGAGGGACCTTTTTTTTTTCCATGGACCCTCAAGGAGGAGGTCACCTGGAAGAACTCATGGACCATGGTGGCAAAATGGTTAACAGAATGCCTAGAGATTATAGTCTTAGTAGATGCATGATGAAGTTGGTTCTTTTGATCAACATGTAGGAAAGTTCACTAGGAACTTTGCTGATGACACGGGATCCGATATATCGACGACCTCATCCTGCATACGAATTAGACAGACCTTTGTTCCGGGAAAGAAGGAACTTCTCATTCCAAAGAAGTGATTCTAAGTCTATAGTAAGATCCCGATCTCGCTCTCCGAGCCAATGTCTCTTTGAAAGATCTGATAGGTTTTATGGACGTCCCGACATGACACGTCGAAGATCTCCAAATTATAGGACAGACGGGACGAGATCGCCCGATCAGCATCCTATATGTGCGCATATGACAGGCCAAAGACAAGGATTCTGTTTCCTTTCACCATCTGATGATTTGAGGGATGTTGGTCCTACACCCAACCATGGCCATATGAGATCTATCATTCCTAATAGGAATCAAACTGAAAGATTACCTCTTAGAAACAGAAGTTATGATGCTATAGATCATCAAGTAAGGATAGGGAGCAATGAACTTTTTGATGATCCC

mRNA sequence

CGTCATCTGCTCGTTGATTTTGGTTTCATAGTTGTAAACGCTTATGTGATCAGTTTCATTCTTGTGTACTCTCATCGATCTGAAAAGCTCGTAGGAATCTGATTTCTGGTTGGGTTTAAGCGTACTGTCATGTCAACTAGTGATTATAATGCAATTGTTCCTATCAAGAAGAGGAGGTTTCCTTCAATGCAATCTCCTCCGCCCAAAGAAATATCTTCTCTTCCATTAGTAGATGATAACATAGCAAAAGTAGACGAGCCTTGCGTATCAGATGGTCCAACAGTTTCCAATTCTAGTACAATCACAACTTCTGAATTTTCAGAGAAGAAGATTTCATTTTCTGAGGACGGTAAACGGAAATCTGATTTATGCAATATGAATATGGTCCAAAGCATTATTGGACCTTCCAGAGTCGAGTTTCAGGAAAATGATGTGTGTTCTACTGGCTGCGTGGAAAATAAAGAAACATGTATGGTGAATGAAAATCATGCGCTTGTTCTGCATGAGAAACCAGAGTTCAAGTTACCACATTCTGATGCTAACTCTAACCCTGGACTTTGTGCCGAAAAGGAAAGTGATGAGATTGACAGAAAACAACTTGATAGATTAGAGTTTTCAACTTCTGTAGCGAAAAAAGAAGCCGAATTATCAATTGGTTCGAAGGAACATCTTGTTCCAGACTCAGTTCTTGAAGGGAGTGACTTGAAATCTCTGAAGCAGATTAATTTAGAACCAGGGTTATTGAACTTAAGTTTAAGCAAGGAAGGAAGTCTCGATCAGCCTCTCACTGTAAATGTTGGGTCTAGTTATGATGGTTCTATTCAGGAGTCAAATAGGGAAAATTGGGATCTAAATACCTCGATGGAGTTTTGGGAAGGCTGTTCAAGTGGCGATCCTCCCGAGCATGTTCCAGCAGTTCAGACAAACACGGTTGTCACTATGCACAGATTCTCAACAGAAATGGTTAATACTGATACTCTGTCAGGAAAGCTAACTCCTTTAGATGACAGTGATCATCTTCATCTAAGTCTTAGTTCATCAGATCATAGGCATGTAATAAGTCAGGAACAAAGTTCATTTGTCAAGTTAGGCTTTAGGAAAACAAGTCCTTCTTTAAGCTCAACAGGAAGAGGTTTGCAGTTTGATGATCTTAACGGTGCACTAAAAGTCGTAAAGCCAGAGCCATTTGTTGAGGCTTCCAAACTCGAGTCTAAAAGTGATGAAGTTAATGTGCTGGGATTATCAGACAGTGCTATTGTGAAGCGCGAATTTCTTCAAATTCCCAATGCTTCAGATATTTATATACCAATGAACACAGTTAAGGCCAAGTCTGTTAACTCTGAATCAAATTACGAAAGCAAACAGGTAGCACTCGAAACATTAGGTGGTAGATTAGATTTGGTAGCTAAGCAAGTTCTTCCAGAGGTAGATAGTTCTTGTCCTGCACCGATGCCTTTTGTGGCAGAGATGACTGAAGCAGCTGGAAACTCTTGTTCAACTGATTTGATCACAGATGGAGGCATGTCAAACCATTCAGAATTGCAAACTCCTACTGAAGAACATCTTAATTTGAAAGTGCACGAAGGAGCATATCGTTGTGGTGGTGAACTCGTCGATTCAGAAATGACTGATATAAGTAAGGATCCAGGTTCCAAAGATTTCAATAGTCCTATTATAAAGCCTATAGCAATGCCTAGAAATCCTTCTCGTACAAATGATTCAATTATAGAGGCAAACATGTCAAGCCCTTCCGAATTACATATTCCAACTACAGGACCTCTTAATACGAAAGTGCACCAAGCGGGATATGGCTGTGACGGTGGACTTGTGAATTCGGTAATGACAGATGTAAGTAAGGATACATGTTCCAAAGATTCCAGTAGTTCTGTTATAAAACCAGTCATTGTTGAAGATGAAAATCAGAATAACCCGCTCTGGCGTCCTTCGACACACACGAATGAGCAGTGCTCTAGTTTGCAGGGAGGTGAGGAAAGTTCTGTAAATGATGAGGAAAAGATCAGTCTATCAGCCGATTTATTAGAAGAAGATCCATATAGTTCTGAATATGAATCAGATGGTAAGTTGGATGTAAATGAGGCCATGGATGCAGTTGATAACGATATAGAAGAAGATTATGAAGACGGAGAGGTTCGGGAACCGACATTGACGACTCAAGTAGAAAGCAGTATATGCGAGACGAAAAAAGTAAAAAATTTTGATCATGGTGATTCTAGCAATGGACTGCCTGGTTCTGATTGCTGTTCCTCCTTGGTTTCTGTTAAGCAGGAAAATAAATTAGAAATCCTCGATGTTAAACGAGAAGACAATCTTCATTCTGTAACTTCGAATCAATCGTCCGAGCAAGAACGATCGAAAGAGTTGCCTGTCGAAGAGCATACCACTAGAGTGTGTTTGAACAAGGCCAACAAGGCTAAAACATCTGCCTTAGAGGACCAGGAAACTTCCCCTGAGAAAGCCAGTAATGGAATCGAAGAATCGATTACGACAGTTTCTCAGAGCGACGCAGAGAAGGTTAAAACAGTAGATATCGTGCGAAACGACAATCCAGCTTTGCCAAATGTCGAGCCTTTAAATGATGATGATGTGACTGACGATATTACTCGTGGCAGTAAGCATAGCCGCATTGTTAGTCCCTGCAAACCTTCATCGTCTTCACTTCCTAGTAAAACGAAATCGAGTTTGGCGAGGTCGGTTTTAACACAAACTGATAGAGAACGAATACCCGACATGGGGCATGAAGGGGAAAAATTACATCCACAAGGAAGAGATGAACCATATAGGGACGTTTTCCAGAGATTTTATGTGAATAGACATCAGAATCTATCACCCCAAACCAATTTTAGCCGTAGAAGAGGTAGATTCACTATCCGGATTAACTCTGTCCAAGGTGAATGGGATTTTAATCCAACAATCTCTCCAGGAAATTACAATGATCAAGTACCACCACCCTATGATGCCCGTAGACGTAAATACATGCCTGCTGTTTCTGATGATGATATTGATCAAAACCATTATAAAATGAAACCCGATGGTCCATTTCGTAGCGCTGGTGATCATCGAGGTAGACAGATATTAGACGACGAAGGCCCCCTTTTTTGTCATATGGCCTCTAGGAGGAAGTCGCCTGGTCGAAGAGATGGGCCTCCTCCAGTGCGAGGTGTTAAAATGGTACACAGAATGCCTAGAAACATCAGTCCAAGTAGATGTAATCGTGAACGTGGATCGGAACTGGTTGGACCGCGACACGGTGAGAAGTTCATGAGGACATTTGAAGACGAAACTATGGATCCATTATACGCCCACCCTCAACCTTCGTTTGAAGTAGATCGGCCTCCTTTTATCCGAGACCGAAGGAACTTTCCTATTCAAAGAAAAAGTTTTCAAAGAGTTGATTCTAAATCTCCAGGAAGGTCCAGAGGACGCTCTCCTTCCCAATGGTTTCCATCCAAAAGAAAGTCTGAGAGGTTCTTTGGACATCCAGAAATGGCACGGCGAAGTCCTCCACCCGGTTATAGGATGAGATCGCCCGATCAACCTCCTCAAATCCATGGAGATATGCCAGATCGAAGACATGGTTTCCCGTTTCCGTCACTGCCACCTAATGATTTGAGGGATATGGGTTCTGCTCGTGACCATGGCCATATGAGACCAGGTCTACGAAGTCGAAACCGAACCGACAGAATGTCTTTTAGAAACAGGAGGTTTGAAGATATGGATCCTCGAGATAATAGGATAGAGAGTAACGAATACTTCGATGGACCTGTACATCCTGGTCAAATGAATGAACTGATTGATGATGGCAACGACGACGATCGAAGAAGGTTTTCGGACAGACACGAACATCTTCACCAATTCCGGCCACAATGTAATGATTCAGACGGTGAAAACTATCACAACGATGCAGACGAAAGAGCGAGGCCTTACAGATACTGCACGGAGGATGAAGAAGAGTTCCATGAAAGAGAGCAGCCGCCGCCGCTACGAATGGCTAATTCCACTAGAACCAGTGCCGATAATCCGAGTATCGCAATCCATCCATTGATCCTCTCGCTGTACGGACTGAATCCGAATTCTACCTTAAGTCCGCCGGAAACCGCTTCCTTCAGCGGCACTGGCGGAAGATACACTGACGAGGCGGCGGCGGTTGATCCTTTCCTGGACTTCCGAGCCTTACGAATCTCTGCCTTCTTTCCCGTTCCGCGTCGGAGTTGGAGTCGGGCCAGTCGGTCCTCGAAATCGTCGCCGTCCGATCTAGGTTCCGAATCGGCGGCAGGAGATTCCGGATCTGCTGCTCTGGTTACGAAGAGCGGAGTTTTGTGGAAGTTTGAGTCGTTGTTGAGGATCAGCTGTTCAAATACTGGAATGTTTTCTACCATAAAATGTTCCATTGGCACTATGCAGGTTGGTTTTAAGCGCATTGGATTCTCAGTTAGTGATTATGATGCAAATCTTCCTATCAAGAAAAGGAGATTTCCGGTAGTGCAGATCTCTCCATCTCCATCTGAAGGTATATCTTCATTCCATCCAGATGGAAATTTATTGAAGATTGAGCGGCCATCTCCACCTAAAAAGCTATCTTCATTTAATCCCGATGAAAATTCGTTAGAGGTCGAACAGCCGAGTCTATCTGTGACAATAGTTTCAAGTTCTAGTGCAGACACATGTTATGGGTTGTCAAACAGGAACCAGGACTGTGTTTCTAATGAGAATAAACGAAAATCTGATACTCATTCATGCTATGTGGATATGGTCCAGAACGATATTGGGATGCCAGGAGTCGAGTTTCCGGGACCCGGTTTGGGAGGACATGAAGATAAGTCCTTGGTAACTGAAAAACACTCCGTTCATCGATCACCGGAGATCTACGGTGAGTTGAAGTTATCATCAACTGGCGTCGACTCGGATCCTCTTGGTAGTAACAAAGAGGAAGAAATTGATGTAAAAATGCCTGAAGAAAAGTGCAGCTCTTCAATTTGTCAAGTTGAAGGAGGAGCTGAAGTATCAGAGAAATTGGTTTCTTACAAGAGTGACCTGAATAAGCAGAATTCTTTGGAGCCTGTGTTAATGGACTTGTCTTTAAACAAGCAAGGAAGTAGCTGCCATTGTGTCAAAGGTAACGGTCTGATTGCGATGATTGAATCAGATGGTAGCTGGAATATTGCCGAGGTTGAAGACGACGACGACGATGATAATAACATAGAAGAAGACTATGAAAATGGCGAGGTTCGGGAATCAATGCAAAAAGAGGCCCGTGCTTGTGAGAAAAGAGAAATTGAGCCATTGGATCATGCTGATTGTGATGATAAGAAGATCAATTCTGCTGGATTGCCTGATCATGAATGTTTCACATTAGGCCCTCTGGAACAGGAAACGAAACCTGAAAATCTGGACTCTAAGAGTGAAGACGATGTTCATACTACAACTGAAAGTACATCTTGTGAGCAAGAACATGAAGATCTTTGTGTGAAAGAACCACTTGACGTAGAGAATACTATTGGTGAGGATGTAAACAGGCCTATGAAGGCTGCAGGAAGAAGCCAATTATCTCAATATGTTAATAAGGACAAGTTAGAGGGCCACGACACCGCCGATGAAATCGAGGAACTGATTCCGAAATTTTCTCAGGGTGAGATGGAGAAAGCTATTGCTGTAGAGAATAGGGATCTAACTTTGCCTACCAATATGTTGGACAAACGATCTGGGGAATGGGACTTTGGTCCCAACTTTTCTCCTGAAACATACAGTGACCAGCAGATAGATTACCATGTTCCTGATCTTGATCACGACCGATATAAAATTATTCCTGATGGTCGATTTGTCGGTGCTAACCGTCGCAGTAGGTCATTGCTGGACAATGAGGGACCTTTTTTTTTTCCATGGACCCTCAAGGAGGAGGTCACCTGGAAGAACTCATGGACCATGGTGGCAAAATGGTTAACAGAATGCCTAGAGATTATAGTCTTAGAAAGTTCACTAGGAACTTTGCTGATGACACGGGATCCGATATATCGACGACCTCATCCTGCATACGAATTAGACAGACCTTTGTTCCGGGAAAGAAGGAACTTCTCATTCCAAAGAAGTGATTCTAAGTCTATAGTAAGATCCCGATCTCGCTCTCCGAGCCAATGTCTCTTTGAAAGATCTGATAGGTTTTATGGACGTCCCGACATGACACGTCGAAGATCTCCAAATTATAGGACAGACGGGACGAGATCGCCCGATCAGCATCCTATATGTGCGCATATGACAGGCCAAAGACAAGGATTCTGTTTCCTTTCACCATCTGATGATTTGAGGGATGTTGGTCCTACACCCAACCATGGCCATATGAGATCTATCATTCCTAATAGGAATCAAACTGAAAGATTACCTCTTAGAAACAGAAGTTATGATGCTATAGATCATCAAGTAAGGATAGGGAGCAATGAACTTTTTGATGATCCC

Coding sequence (CDS)

ATGTCAACTAGTGATTATAATGCAATTGTTCCTATCAAGAAGAGGAGGTTTCCTTCAATGCAATCTCCTCCGCCCAAAGAAATATCTTCTCTTCCATTAGTAGATGATAACATAGCAAAAGTAGACGAGCCTTGCGTATCAGATGGTCCAACAGTTTCCAATTCTAGTACAATCACAACTTCTGAATTTTCAGAGAAGAAGATTTCATTTTCTGAGGACGGTAAACGGAAATCTGATTTATGCAATATGAATATGGTCCAAAGCATTATTGGACCTTCCAGAGTCGAGTTTCAGGAAAATGATGTGTGTTCTACTGGCTGCGTGGAAAATAAAGAAACATGTATGGTGAATGAAAATCATGCGCTTGTTCTGCATGAGAAACCAGAGTTCAAGTTACCACATTCTGATGCTAACTCTAACCCTGGACTTTGTGCCGAAAAGGAAAGTGATGAGATTGACAGAAAACAACTTGATAGATTAGAGTTTTCAACTTCTGTAGCGAAAAAAGAAGCCGAATTATCAATTGGTTCGAAGGAACATCTTGTTCCAGACTCAGTTCTTGAAGGGAGTGACTTGAAATCTCTGAAGCAGATTAATTTAGAACCAGGGTTATTGAACTTAAGTTTAAGCAAGGAAGGAAGTCTCGATCAGCCTCTCACTGTAAATGTTGGGTCTAGTTATGATGGTTCTATTCAGGAGTCAAATAGGGAAAATTGGGATCTAAATACCTCGATGGAGTTTTGGGAAGGCTGTTCAAGTGGCGATCCTCCCGAGCATGTTCCAGCAGTTCAGACAAACACGGTTGTCACTATGCACAGATTCTCAACAGAAATGGTTAATACTGATACTCTGTCAGGAAAGCTAACTCCTTTAGATGACAGTGATCATCTTCATCTAAGTCTTAGTTCATCAGATCATAGGCATGTAATAAGTCAGGAACAAAGTTCATTTGTCAAGTTAGGCTTTAGGAAAACAAGTCCTTCTTTAAGCTCAACAGGAAGAGGTTTGCAGTTTGATGATCTTAACGGTGCACTAAAAGTCGTAAAGCCAGAGCCATTTGTTGAGGCTTCCAAACTCGAGTCTAAAAGTGATGAAGTTAATGTGCTGGGATTATCAGACAGTGCTATTGTGAAGCGCGAATTTCTTCAAATTCCCAATGCTTCAGATATTTATATACCAATGAACACAGTTAAGGCCAAGTCTGTTAACTCTGAATCAAATTACGAAAGCAAACAGGTAGCACTCGAAACATTAGGTGGTAGATTAGATTTGGTAGCTAAGCAAGTTCTTCCAGAGGTAGATAGTTCTTGTCCTGCACCGATGCCTTTTGTGGCAGAGATGACTGAAGCAGCTGGAAACTCTTGTTCAACTGATTTGATCACAGATGGAGGCATGTCAAACCATTCAGAATTGCAAACTCCTACTGAAGAACATCTTAATTTGAAAGTGCACGAAGGAGCATATCGTTGTGGTGGTGAACTCGTCGATTCAGAAATGACTGATATAAGTAAGGATCCAGGTTCCAAAGATTTCAATAGTCCTATTATAAAGCCTATAGCAATGCCTAGAAATCCTTCTCGTACAAATGATTCAATTATAGAGGCAAACATGTCAAGCCCTTCCGAATTACATATTCCAACTACAGGACCTCTTAATACGAAAGTGCACCAAGCGGGATATGGCTGTGACGGTGGACTTGTGAATTCGGTAATGACAGATGTAAGTAAGGATACATGTTCCAAAGATTCCAGTAGTTCTGTTATAAAACCAGTCATTGTTGAAGATGAAAATCAGAATAACCCGCTCTGGCGTCCTTCGACACACACGAATGAGCAGTGCTCTAGTTTGCAGGGAGGTGAGGAAAGTTCTGTAAATGATGAGGAAAAGATCAGTCTATCAGCCGATTTATTAGAAGAAGATCCATATAGTTCTGAATATGAATCAGATGGTAAGTTGGATGTAAATGAGGCCATGGATGCAGTTGATAACGATATAGAAGAAGATTATGAAGACGGAGAGGTTCGGGAACCGACATTGACGACTCAAGTAGAAAGCAGTATATGCGAGACGAAAAAAGTAAAAAATTTTGATCATGGTGATTCTAGCAATGGACTGCCTGGTTCTGATTGCTGTTCCTCCTTGGTTTCTGTTAAGCAGGAAAATAAATTAGAAATCCTCGATGTTAAACGAGAAGACAATCTTCATTCTGTAACTTCGAATCAATCGTCCGAGCAAGAACGATCGAAAGAGTTGCCTGTCGAAGAGCATACCACTAGAGTGTGTTTGAACAAGGCCAACAAGGCTAAAACATCTGCCTTAGAGGACCAGGAAACTTCCCCTGAGAAAGCCAGTAATGGAATCGAAGAATCGATTACGACAGTTTCTCAGAGCGACGCAGAGAAGGTTAAAACAGTAGATATCGTGCGAAACGACAATCCAGCTTTGCCAAATGTCGAGCCTTTAAATGATGATGATGTGACTGACGATATTACTCGTGGCAGTAAGCATAGCCGCATTGTTAGTCCCTGCAAACCTTCATCGTCTTCACTTCCTAGTAAAACGAAATCGAGTTTGGCGAGGTCGGTTTTAACACAAACTGATAGAGAACGAATACCCGACATGGGGCATGAAGGGGAAAAATTACATCCACAAGGAAGAGATGAACCATATAGGGACGTTTTCCAGAGATTTTATGTGAATAGACATCAGAATCTATCACCCCAAACCAATTTTAGCCGTAGAAGAGGTAGATTCACTATCCGGATTAACTCTGTCCAAGGTGAATGGGATTTTAATCCAACAATCTCTCCAGGAAATTACAATGATCAAGTACCACCACCCTATGATGCCCGTAGACGTAAATACATGCCTGCTGTTTCTGATGATGATATTGATCAAAACCATTATAAAATGAAACCCGATGGTCCATTTCGTAGCGCTGGTGATCATCGAGGTAGACAGATATTAGACGACGAAGGCCCCCTTTTTTGTCATATGGCCTCTAGGAGGAAGTCGCCTGGTCGAAGAGATGGGCCTCCTCCAGTGCGAGGTGTTAAAATGGTACACAGAATGCCTAGAAACATCAGTCCAAGTAGATGTAATCGTGAACGTGGATCGGAACTGGTTGGACCGCGACACGGTGAGAAGTTCATGAGGACATTTGAAGACGAAACTATGGATCCATTATACGCCCACCCTCAACCTTCGTTTGAAGTAGATCGGCCTCCTTTTATCCGAGACCGAAGGAACTTTCCTATTCAAAGAAAAAGTTTTCAAAGAGTTGATTCTAAATCTCCAGGAAGGTCCAGAGGACGCTCTCCTTCCCAATGGTTTCCATCCAAAAGAAAGTCTGAGAGGTTCTTTGGACATCCAGAAATGGCACGGCGAAGTCCTCCACCCGGTTATAGGATGAGATCGCCCGATCAACCTCCTCAAATCCATGGAGATATGCCAGATCGAAGACATGGTTTCCCGTTTCCGTCACTGCCACCTAATGATTTGAGGGATATGGGTTCTGCTCGTGACCATGGCCATATGAGACCAGGTCTACGAAGTCGAAACCGAACCGACAGAATGTCTTTTAGAAACAGGAGGTTTGAAGATATGGATCCTCGAGATAATAGGATAGAGAGTAACGAATACTTCGATGGACCTGTACATCCTGGTCAAATGAATGAACTGATTGATGATGGCAACGACGACGATCGAAGAAGGTTTTCGGACAGACACGAACATCTTCACCAATTCCGGCCACAATGTAATGATTCAGACGGTGAAAACTATCACAACGATGCAGACGAAAGAGCGAGGCCTTACAGATACTGCACGGAGGATGAAGAAGAGTTCCATGAAAGAGAGCAGCCGCCGCCGCTACGAATGGCTAATTCCACTAGAACCAGTGCCGATAATCCGAGTATCGCAATCCATCCATTGATCCTCTCGCTGTACGGACTGAATCCGAATTCTACCTTAAGTCCGCCGGAAACCGCTTCCTTCAGCGGCACTGGCGGAAGATACACTGACGAGGCGGCGGCGGTTGATCCTTTCCTGGACTTCCGAGCCTTACGAATCTCTGCCTTCTTTCCCGTTCCGCGTCGGAGTTGGAGTCGGGCCAGTCGGTCCTCGAAATCGTCGCCGTCCGATCTAGGTTCCGAATCGGCGGCAGGAGATTCCGGATCTGCTGCTCTGGTTACGAAGAGCGGAGTTTTGTGGAAGTTTGAGTCGTTGTTGAGGATCAGCTGTTCAAATACTGGAATGTTTTCTACCATAAAATGTTCCATTGGCACTATGCAGGTTGGTTTTAAGCGCATTGGATTCTCAGTTAGTGATTATGATGCAAATCTTCCTATCAAGAAAAGGAGATTTCCGGTAGTGCAGATCTCTCCATCTCCATCTGAAGGTATATCTTCATTCCATCCAGATGGAAATTTATTGAAGATTGAGCGGCCATCTCCACCTAAAAAGCTATCTTCATTTAATCCCGATGAAAATTCGTTAGAGGTCGAACAGCCGAGTCTATCTGTGACAATAGTTTCAAGTTCTAGTGCAGACACATGTTATGGGTTGTCAAACAGGAACCAGGACTGTGTTTCTAATGAGAATAAACGAAAATCTGATACTCATTCATGCTATGTGGATATGGTCCAGAACGATATTGGGATGCCAGGAGTCGAGTTTCCGGGACCCGGTTTGGGAGGACATGAAGATAAGTCCTTGGTAACTGAAAAACACTCCGTTCATCGATCACCGGAGATCTACGGTGAGTTGAAGTTATCATCAACTGGCGTCGACTCGGATCCTCTTGGTAGTAACAAAGAGGAAGAAATTGATGTAAAAATGCCTGAAGAAAAGTGCAGCTCTTCAATTTGTCAAGTTGAAGGAGGAGCTGAAGTATCAGAGAAATTGGTTTCTTACAAGAGTGACCTGAATAAGCAGAATTCTTTGGAGCCTGTGTTAATGGACTTGTCTTTAAACAAGCAAGGAAGTAGCTGCCATTGTGTCAAAGGTAACGGTCTGATTGCGATGATTGAATCAGATGGTAGCTGGAATATTGCCGAGGTTGAAGACGACGACGACGATGATAATAACATAGAAGAAGACTATGAAAATGGCGAGGTTCGGGAATCAATGCAAAAAGAGGCCCGTGCTTGTGAGAAAAGAGAAATTGAGCCATTGGATCATGCTGATTGTGATGATAAGAAGATCAATTCTGCTGGATTGCCTGATCATGAATGTTTCACATTAGGCCCTCTGGAACAGGAAACGAAACCTGAAAATCTGGACTCTAAGAGTGAAGACGATGTTCATACTACAACTGAAAGTACATCTTGTGAGCAAGAACATGAAGATCTTTGTGTGAAAGAACCACTTGACGTAGAGAATACTATTGGTGAGGATGTAAACAGGCCTATGAAGGCTGCAGGAAGAAGCCAATTATCTCAATATGTTAATAAGGACAAGTTAGAGGGCCACGACACCGCCGATGAAATCGAGGAACTGATTCCGAAATTTTCTCAGGGTGAGATGGAGAAAGCTATTGCTGTAGAGAATAGGGATCTAACTTTGCCTACCAATATGTTGGACAAACGATCTGGGGAATGGGACTTTGGTCCCAACTTTTCTCCTGAAACATACAGTGACCAGCAGATAGATTACCATGTTCCTGATCTTGATCACGACCGATATAAAATTATTCCTGATGGTCGATTTGTCGGTGCTAACCGTCGCAGTAGGTCATTGCTGGACAATGAGGGACCTTTTTTTTTTCCATGGACCCTCAAGGAGGAGGTCACCTGGAAGAACTCATGGACCATGGTGGCAAAATGGTTAACAGAATGCCTAGAGATTATAGTCTTAGAAAGTTCACTAGGAACTTTGCTGATGACACGGGATCCGATATATCGACGACCTCATCCTGCATACGAATTAGACAGACCTTTGTTCCGGGAAAGAAGGAACTTCTCATTCCAAAGAAGTGATTCTAAGTCTATAGTAAGATCCCGATCTCGCTCTCCGAGCCAATGTCTCTTTGAAAGATCTGATAGGTTTTATGGACGTCCCGACATGACACGTCGAAGATCTCCAAATTATAGGACAGACGGGACGAGATCGCCCGATCAGCATCCTATATGTGCGCATATGACAGGCCAAAGACAAGGATTCTGTTTCCTTTCACCATCTGATGATTTGAGGGATGTTGGTCCTACACCCAACCATGGCCATATGAGATCTATCATTCCTAATAGGAATCAAACTGAAAGATTACCTCTTAGAAACAGAAGTTATGATGCTATAGATCATCAAGTAAGGATAGGGAGCAATGAACTTTTTGATGATCCC

Protein sequence

MSTSDYNAIVPIKKRRFPSMQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITTSEFSEKKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDVCSTGCVENKETCMVNENHALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSIGSKEHLVPDSVLEGSDLKSLKQINLEPGLLNLSLSKEGSLDQPLTVNVGSSYDGSIQESNRENWDLNTSMEFWEGCSSGDPPEHVPAVQTNTVVTMHRFSTEMVNTDTLSGKLTPLDDSDHLHLSLSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLESKSDEVNVLGLSDSAIVKREFLQIPNASDIYIPMNTVKAKSVNSESNYESKQVALETLGGRLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGNSCSTDLITDGGMSNHSELQTPTEEHLNLKVHEGAYRCGGELVDSEMTDISKDPGSKDFNSPIIKPIAMPRNPSRTNDSIIEANMSSPSELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNNPLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDAVDNDIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGDSSNGLPGSDCCSSLVSVKQENKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKTSALEDQETSPEKASNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLNDDDVTDDITRGSKHSRIVSPCKPSSSSLPSKTKSSLARSVLTQTDRERIPDMGHEGEKLHPQGRDEPYRDVFQRFYVNRHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYNDQVPPPYDARRRKYMPAVSDDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGVKMVHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFIRDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMRSPDQPPQIHGDMPDRRHGFPFPSLPPNDLRDMGSARDHGHMRPGLRSRNRTDRMSFRNRRFEDMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDDDRRRFSDRHEHLHQFRPQCNDSDGENYHNDADERARPYRYCTEDEEEFHEREQPPPLRMANSTRTSADNPSIAIHPLILSLYGLNPNSTLSPPETASFSGTGGRYTDEAAAVDPFLDFRALRISAFFPVPRRSWSRASRSSKSSPSDLGSESAAGDSGSAALVTKSGVLWKFESLLRISCSNTGMFSTIKCSIGTMQVGFKRIGFSVSDYDANLPIKKRRFPVVQISPSPSEGISSFHPDGNLLKIERPSPPKKLSSFNPDENSLEVEQPSLSVTIVSSSSADTCYGLSNRNQDCVSNENKRKSDTHSCYVDMVQNDIGMPGVEFPGPGLGGHEDKSLVTEKHSVHRSPEIYGELKLSSTGVDSDPLGSNKEEEIDVKMPEEKCSSSICQVEGGAEVSEKLVSYKSDLNKQNSLEPVLMDLSLNKQGSSCHCVKGNGLIAMIESDGSWNIAEVEDDDDDDNNIEEDYENGEVRESMQKEARACEKREIEPLDHADCDDKKINSAGLPDHECFTLGPLEQETKPENLDSKSEDDVHTTTESTSCEQEHEDLCVKEPLDVENTIGEDVNRPMKAAGRSQLSQYVNKDKLEGHDTADEIEELIPKFSQGEMEKAIAVENRDLTLPTNMLDKRSGEWDFGPNFSPETYSDQQIDYHVPDLDHDRYKIIPDGRFVGANRRSRSLLDNEGPFFFPWTLKEEVTWKNSWTMVAKWLTECLEIIVLESSLGTLLMTRDPIYRRPHPAYELDRPLFRERRNFSFQRSDSKSIVRSRSRSPSQCLFERSDRFYGRPDMTRRRSPNYRTDGTRSPDQHPICAHMTGQRQGFCFLSPSDDLRDVGPTPNHGHMRSIIPNRNQTERLPLRNRSYDAIDHQVRIGSNELFDDP
BLAST of Cp4.1LG13g09640 vs. TrEMBL
Match: A0A0A0KU39_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G504130 PE=4 SV=1)

HSP 1 Score: 445.7 bits (1145), Expect = 3.2e-121
Identity = 316/772 (40.93%), Postives = 435/772 (56.35%), Query Frame = 1

Query: 559  GYGCDGGLV--NSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNNPLWRPSTHTNEQCSS- 618
            G+ CDG  +  N    D++    S +  +S   PV+     + N      + ++E   S 
Sbjct: 267  GFDCDGSFLQSNREKWDLNTSMESWEGCTSGDAPVVQISATRTNTTIETYSCSSEMVESD 326

Query: 619  -------LQGGEESSVNDEEKISLSAD------LLEEDPYSSEYESDGKLDVNEAMDAVD 678
                   L   E+   + +E + LS D      +L+EDPY SEYESDG  D+ E +D  D
Sbjct: 327  SPCGKQTLLDNEDKGDSTKEHLHLSLDSSYLKSVLDEDPYISEYESDGNWDIAETVDDND 386

Query: 679  -----------NDIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGD------SSNGL 738
                       N++EEDYEDGEVRE    T+VE  + E ++++  DH        +S GL
Sbjct: 387  DNDDNDNDDNDNNVEEDYEDGEVRETMQETEVEVHVYEKREIEPLDHAGCNDKKINSVGL 446

Query: 739  PGSDCCSSLVSVKQENKLEILDVKREDN--LHSVTSNQSSEQERS----KELPVEEHTTR 798
               +  + L   KQE KLE LD + ED   + + T + S EQE      KEL   E+   
Sbjct: 447  LDHEFFT-LGPKKQETKLENLDYRSEDEDEVQTTTKSNSYEQENEDLCVKELHAVENAIG 506

Query: 799  VCLNKANKA----------KTSALEDQETSPEKASNGIEESITTVSQSDAEKVKTVDIVR 858
              +N + KA          K    E Q T+ +K  N  EE + T SQ++ E    VD+V+
Sbjct: 507  EDVNISAKATERSQLSQYDKKGNFEGQGTA-DKILN--EEPVPTFSQNEVENAVAVDVVQ 566

Query: 859  NDNPALPNVEPLNDDDVTDDITRGSKHSRIVSPCKPSSSSLPSKTKSSLARSVLTQTDRE 918
            N +  LP V+   ++D   DI  G+++SRI++  + S+ S P K KS+ A+ VL+  DRE
Sbjct: 567  NRDLTLPTVKESVNEDDAKDINGGTRNSRIINFNRTSTDSTPCKAKSNFAKPVLSHKDRE 626

Query: 919  RIPDMGHEGEKLHPQGRDEPYRDVFQRFYVNRHQNLSPQTNFSRRRGRFTIRINSVQGEW 978
             +P+M  E   + PQ RD+ Y ++ ++  +++ Q   P   FS RRGR T R+++   EW
Sbjct: 627  FVPNMVVERANMKPQERDDVYSNISKKISIDKRQGPPPLMGFSHRRGRNTNRLDNRSEEW 686

Query: 979  DFNPTISPGNYNDQVPPPYDARRRKYMPAVSDDDIDQNHYKMKPDGPFRSAGDHRGRQIL 1038
            DF P  SP  Y++Q    +               +DQN YK+ PDGPF  A + RGR+++
Sbjct: 687  DFGPNFSPETYSEQQIDYHVT------------GLDQNRYKIIPDGPFGGA-NRRGRELV 746

Query: 1039 DDEGPLFCHMASRRKSPGRRDGPPPVRGVKMVHRMPRNISPSRCNRERGSELVGPRHGEK 1098
            +DE P F H  SRRKSPGRR G   VRG KMV+RMPR+ SP RC  E GS     +HGEK
Sbjct: 747  EDEEPFFFHGPSRRKSPGRRHG-HSVRGGKMVNRMPRDFSPGRCMDEGGS--FDRQHGEK 806

Query: 1099 FMRTFEDETMDPLYAHPQPSFEVDRPPFIRDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQ 1158
            F R F D+T+D +Y  PQP ++VDR PF R+RRNF  QRK+F ++DSKSP RSR RSPSQ
Sbjct: 807  FTRNFADDTVDEMYPRPQPPYDVDR-PFFRERRNFSFQRKTFPKIDSKSPVRSRARSPSQ 866

Query: 1159 WFPSKRKSERFFGHPEMARRSPPPGY--RMRSPDQPPQIHGDMPDRRHGFPFPSLPPNDL 1218
            WF SKR S+RF   P M  R  P     RMRSPDQ   I G MP +R GF + S PP++L
Sbjct: 867  WFSSKR-SDRFCERPNMTHRRSPNYMTDRMRSPDQ-RSIRGYMPGQRQGFRYLS-PPDEL 926

Query: 1219 RDMGSARDHGHMRPGLRSRNRTDRMSFRNRRFEDMDPRDNRIESNEYFDGPVHPGQMNEL 1278
            RD+G A DHGHMRP + +RN+T R+  RNR ++ +DPR  RIE++  F GPV  GQ+   
Sbjct: 927  RDVGPAPDHGHMRPFIPNRNQTKRLPLRNRSYDAIDPR-GRIENDGLFYGPVRLGQLTGY 986

Query: 1279 IDDGNDDDRRRFSDRHEHLHQFRPQCNDSDGENYHNDADERARPYRYCTEDE 1280
                 DDD RRF++RHE LH F+    DSDGE Y N  ++ +RP+R+C ED+
Sbjct: 987  NGGEPDDDERRFNERHEPLHSFKHGFRDSDGERYRNKGEDCSRPFRFCAEDD 1013

BLAST of Cp4.1LG13g09640 vs. TrEMBL
Match: A0A0A0KNM2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G503580 PE=4 SV=1)

HSP 1 Score: 439.1 bits (1128), Expect = 3.0e-119
Identity = 309/761 (40.60%), Postives = 443/761 (58.21%), Query Frame = 1

Query: 551  LNTKVHQAGYGCDGG-----LVNSVMTDVSKDT--CSKD---SSSSVIKPVIVEDENQNN 610
            LNT + ++  GC GG      +++  T+ + +T  C  +   S S   K  +++ E++ N
Sbjct: 326  LNTSM-ESWEGCTGGDSPVVQMSATQTNTTIETHACPSEMVESDSPCGKQTLLDGEDKGN 385

Query: 611  PLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDA 670
             ++         C   +   + S+ D   +     +LEEDPY SEYESDG  D+ EA+D 
Sbjct: 386  SIY--------DCMPSKENLDLSL-DSSYLKPVQPVLEEDPYISEYESDGNWDIAEAVDD 445

Query: 671  VDND--IEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGD------SSNGLPGSDCCS 730
             DND  +EEDYEDGEVRE    ++VE    E ++++  DH        +S  LP  +   
Sbjct: 446  DDNDNHLEEDYEDGEVRETLQESEVEVLAYEKREIEPLDHAGCDDKKINSIRLPDHEL-H 505

Query: 731  SLVSVKQENKLEILDVKREDNLHSVTSNQSSEQERS----KELPVEEHTTRVCLNKANKA 790
            +L  ++QE K E LD++ ED++ + T+++S EQE      KEL   E+T    +NKA K 
Sbjct: 506  ALGPLEQETKPENLDLRSEDDVRTTTNSKSYEQENEDLCVKELHAVENTISGDVNKAVKV 565

Query: 791  ----------KTSALEDQETSPEKASNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNV 850
                      K    E Q+T+ E      EE I T SQ + E    VD+V+N +  LP V
Sbjct: 566  TGRGQLFQFDKKHNFEAQDTADEMVD---EELIPTFSQGEVENAVAVDVVQNRDLTLPTV 625

Query: 851  EPLNDDDVTDDITRGSKHSRIVSPCKPSSSSLPSKTKSSLARSVLTQTDRERIPDMGHEG 910
            +   ++D   DI  G+++SRI++  + S  S P K KSS +RSVL+  +RE +P+M  EG
Sbjct: 626  KESVNEDDAKDINGGTRNSRIINFNRASIDSTPCKEKSSFSRSVLSHKEREFVPNMAVEG 685

Query: 911  EKLHPQGRDEPYRDVFQRFYVNRHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPG 970
              + PQ RD+ Y ++ ++  +++ +   P   FS RRGR + R++    EWDF P  SP 
Sbjct: 686  ANMQPQERDDAYSNITKKISIDKREGQPPLMGFSHRRGRSSNRLDHRSEEWDFGPNFSPE 745

Query: 971  NYNDQVPPPYDARRRKYMPAVSDDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCH 1030
             Y++Q       +   ++P      +DQN YK+ PDGPF  A + RGR++L+DE P F H
Sbjct: 746  TYSEQ-------QIDYHVPG-----LDQNRYKITPDGPFGGA-NRRGRELLEDEEPFFFH 805

Query: 1031 MASRRKSPGRRDGPPPVRGVKMVHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDET 1090
              SRRKS GRR GP  V G KMV+++PR+ SP RC  E GS     +HGEKF R F D+T
Sbjct: 806  GPSRRKSLGRRHGPN-VGGGKMVYKIPRDFSPGRCMDEGGS--FDRQHGEKFSRNFADDT 865

Query: 1091 MDPLYAHPQPSFEVDRPPFIRDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSE 1150
            +D +Y  PQP +++D+P F R+RRNF  QRKSF R+DSKSP RSR RSP QWF SKR S+
Sbjct: 866  VDLMYPRPQPPYDIDKP-FFRERRNFSFQRKSFPRIDSKSPVRSRARSPGQWFSSKR-SD 925

Query: 1151 RFFGHPEMARRSPPP--GYRMRSPDQPPQIHGDMPD-RRHGFPFPSLPPNDLRDMGSARD 1210
            RF    +M  R  P     RMRSPDQ P I G MP  RR GF F S   +++RD+G A D
Sbjct: 926  RFCERSDMTHRRSPNYRSERMRSPDQRP-IRGHMPPGRRQGFHFLSAS-DEMRDVGPAPD 985

Query: 1211 HGHMRPGLRSRNRTDRMSFRNRRFEDMDPRDNRIESNEYFDGP-VHPGQMNELIDDGNDD 1270
            HGHMR  +  RN+T+R+  RNR ++ +DP+  RIE++++F GP V  GQ+    D   DD
Sbjct: 986  HGHMRSIIPDRNQTERLPLRNRSYDAIDPQ-GRIENDDFFYGPPVRLGQLTGYNDGVPDD 1045

Query: 1271 DRRRFSDRHEHLHQFRPQCNDSDGENYHNDADERARPYRYC 1276
            D RRF++RHE L+ F+    DSDGE + N+ ++ +RP+R+C
Sbjct: 1046 DERRFNERHEPLYSFKHPFGDSDGERFRNNREDCSRPFRFC 1051

BLAST of Cp4.1LG13g09640 vs. TrEMBL
Match: A0A0B0PJI5_GOSAR (Putative sucrose-phosphate synthase 2 OS=Gossypium arboreum GN=F383_10325 PE=4 SV=1)

HSP 1 Score: 417.5 bits (1072), Expect = 9.4e-113
Identity = 401/1291 (31.06%), Postives = 616/1291 (47.71%), Query Frame = 1

Query: 54   NSSTITTSEFS---EKKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDVC----STG 113
            N+S++  S  S     +IS  E  KR SD  N++MVQ      RV+ +E        S  
Sbjct: 207  NASSVGGSGLSFPDASEISAHEKEKRSSDDTNVSMVQGNTNLLRVKLEEQSFAVQSRSLA 266

Query: 114  CVENKETCMVNENHALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLD-RLEFSTS 173
             +  K   +       ++ +  + +L   D   N  L      D   +K +D +      
Sbjct: 267  DISCKGKLVATGTSDNIMRKLAKSEL---DLVGNDSLTFSVGKDVYSQKSVDGKFGSKLP 326

Query: 174  VAKKEAELSIGSKEHLVPDSVLEGSDLKSLK-QINLEPGLLNLSLSKEGSLDQPLTVNVG 233
            +      LS+G +E+  P  +   ++ +  + Q   E   LNLSLSK     Q  +  V 
Sbjct: 327  LVSGSPGLSLGLREY--PSVMASRNNEQGFRNQEKTEHVSLNLSLSKGEGTTQLRSTAVQ 386

Query: 234  SSYDGSIQESNRENWDLNTSMEFWEGCSSGDPPEH-------VPAVQTNTVVTMHRFSTE 293
             +  GS   ++R NWDLNT+M++WEG +S D           +  V  +  +T+   ST+
Sbjct: 387  PNTKGSNMLADRTNWDLNTTMDYWEGPASDDGASKMATQMYDIKPVICSAGMTVASISTQ 446

Query: 294  MVNTDTLSGKLTPLDDSDHLHLSLSSSDHRHVISQEQSSFVKLG----FRKTSPSLSSTG 353
            +           P +  +   + +SS       S E S  ++LG    +   +P+   TG
Sbjct: 447  LQ---------IPEEIENRAKIKMSSIVSSQQYSAEDS--LRLGLTTPYLHLNPNEKPTG 506

Query: 354  RGLQFDDLNGALKVVKPEPFVEASKLES------KSDEVNVLGLSDSAIVKREFLQIPNA 413
               + D  N    V  P   V ASK         KS+ ++    SDS + K +    P  
Sbjct: 507  SSGKIDSGNVVANVSSPGEPVPASKPTMLNYKPVKSEPLDESVRSDSGVTKAK----PTG 566

Query: 414  SDIYIPMNTVKAKSVNSESNYESKQVALETLGGRLDLVAKQVLPEVDSSCPAPMPFVAEM 473
            S   + +  VK++ +   S    K   + TL   +D  + +  P  +S+   P      M
Sbjct: 567  S---LNITRVKSEIIEKCSLERLKSSTISTLKS-VDARSIKPEPACESNKEMPERMEGPM 626

Query: 474  TEAAGNSCSTDLITDGGMSNHSELQTPTEEHLNLKVHEGAYRCGGELVDSEMTDISKDPG 533
             ++     +    TD  +  H  + T  E  +  K  E +     ++    ++       
Sbjct: 627  NQSDEQMLAVPTSTDSSL--HGGVATHAEHFMQAKETEASVEA--QVASKMISSAGVTTN 686

Query: 534  SKDFNSPIIKPIAMPRNPSRTNDSIIEANMSSPSELHIPTTGPLNTK-VHQAGYGCDGG- 593
            ++ F        A    PS   +  + + M S +++       +  K    +G G     
Sbjct: 687  AEHFMQ------AKETEPS--GEGQVASQMISSADVTTHAEHFMQAKETEPSGEGLVASE 746

Query: 594  LVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNNPLWRPSTHTNEQCSSLQGGEESSVN 653
            +++S   DV++   +    +S  +  +VED +     +      + Q    +G  E S +
Sbjct: 747  MISSADHDVNESNIAGKLDNSTSQSKMVEDSDHCKLKFM-----DVQLPDSRGSVEGSAS 806

Query: 654  DEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDAV-DNDIEEDYEDGEVREPTLTTQVE 713
            DEEKI+LSAD+LEED Y S+YESD K ++  AMD   D   EED+EDGEVREP + T++E
Sbjct: 807  DEEKINLSADVLEEDSYGSDYESDDKRELATAMDIEHDRRAEEDFEDGEVREPVVNTEIE 866

Query: 714  SSICETKKVKNFDHGDSSNGLPGSDCCSSLVSVKQENKLEILDVKREDNLHSVTSNQSSE 773
              ICE ++  N + GD++   P S       +V ++  +   D+   + +   + N+ S 
Sbjct: 867  VPICEMQEAGNGNDGDNN---PSSSSFREKETVIKDPGITSNDINTNECI-DTSVNKDSA 926

Query: 774  QERSKELPVEEHTT------------------RVCLNKANKAKTSALEDQETSPEKASNG 833
             E +KE  ++E +                   R  L+ + K  T   ++ E +  + S+ 
Sbjct: 927  TEANKEACLQESSAVEMPSSQMDGKRHIKAIPRKSLDASEKKDTVKGQEGEQASIQFSDT 986

Query: 834  IEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLND-DDVTDDITRGSKHSRIVSPCKP 893
             + +  T+SQ   +  KT D     N  LP  E  +  DD   D+  G   SRI++  + 
Sbjct: 987  SQGTSVTISQGTDDAKKT-DSEGKGNSVLPKGEAFSSGDDAGKDVDNGGNRSRIINLSRA 1046

Query: 894  SSSSLPSKTKSSLARSVLTQTDRERIPDMGHEGEKLHPQGRDEPYRDVFQRFYVNRHQNL 953
            S+ S P +T+S   R++ +Q  RER+PD+  EG+K H +GRDE Y D   RF   RH ++
Sbjct: 1047 SNLSSPGRTRSISGRTLQSQIGRERLPDVALEGDKFHHRGRDEAYADSLHRFPRERH-HV 1106

Query: 954  SPQTN----FSRRRGRFTIRINSVQGEWDFNPTISPGNYNDQVPPPYDARRRKYMPAVSD 1013
             P  N    F R RGR + RI++++G+ D     +   YN   P  +   R K   AVSD
Sbjct: 1107 QPSRNNRISFMRGRGRISSRIDTLRGDQDSECNFASEFYNG--PTEFRVVRHKNASAVSD 1166

Query: 1014 DDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGVKMV 1073
             D + + Y    DG +   G   GR+IL+D+ P+F  +  RR+SPG RDGP   RG+ MV
Sbjct: 1167 ADPNFSSYNNGQDGAYFGTG-RGGRKILNDDPPIFSQLPPRRRSPGGRDGPAG-RGLPMV 1226

Query: 1074 HRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFIRDR 1133
             R+PRN+SPSRC  E GSELVG RH    MR F D+  DP++A  QPSFE    PF+R  
Sbjct: 1227 RRVPRNLSPSRCIAEDGSELVGLRH----MRGFADDHTDPMFARCQPSFEGLDGPFVRGN 1286

Query: 1134 RNF-PIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHP-EMARRSPPPGYRM-- 1193
            R F  +QR+   R  SKSP R R RSP  W   +R+S   FG P E+  R  PP YRM  
Sbjct: 1287 REFTSVQRRGIPRTRSKSPTRQRTRSPGPWSSLRRRSPDGFGGPLELPHRRSPPLYRMER 1346

Query: 1194 -RSPDQPPQIHGDMPDRRHGF-PFPSLPPNDLRDMGSARDHGHMRPGLRSRNRTDRMSFR 1253
             RSPD+ P   G+M  RRHG  P+   P NDLRD+  +RDHGH R G+ +R+ + R+  R
Sbjct: 1347 IRSPDR-PCFAGEMGVRRHGSPPYLPRPSNDLRDLDPSRDHGHPRSGISNRSPSGRILLR 1406

Query: 1254 N-RRFEDMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDDDRRRFSDRHEHLHQFRPQCN 1286
            N RR + +DPR+ R E ++YF GP+  G+ ++L  DGN D+RRR+ DR   +  FR    
Sbjct: 1407 NSRRLDLVDPRE-RNEGDDYFGGPMPSGRFHDLGTDGNPDERRRYGDRRGPVRSFRSPYG 1440

BLAST of Cp4.1LG13g09640 vs. TrEMBL
Match: A0A0D2V2P5_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_012G066900 PE=4 SV=1)

HSP 1 Score: 405.6 bits (1041), Expect = 3.7e-109
Identity = 403/1296 (31.10%), Postives = 613/1296 (47.30%), Query Frame = 1

Query: 48   DGPTVSN--SSTITTSEFSEKKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDVC-- 107
            + P  S+   S ++  + SEK  S  E  KR SD  N++MVQ      RV+ +E      
Sbjct: 204  ENPNASSVGGSGLSFPDASEK--SAHEKEKRSSDDTNVSMVQGNTNLLRVKLEEQSFAIQ 263

Query: 108  --STGCVENKETCMVNENHALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLD-RL 167
              S   +  K   +       ++ +  + +L   D   N  L      D   +K +D + 
Sbjct: 264  SRSLADISCKGKLVATGTSDNIMRKLAKSEL---DLVGNDSLTFSVGKDVYSQKSVDGKF 323

Query: 168  EFSTSVAKKEAELSIGSKEHLVPDSVLEGSDLKSLK-QINLEPGLLNLSLSKEGSLDQPL 227
                 +      LS+G +E+  P ++  G++ +  + Q   EP  LNLSLSK     QP 
Sbjct: 324  GSQLPLVSGSPGLSLGLREY--PSAMASGNNEQRFRNQEKTEPVSLNLSLSKGEGTTQPR 383

Query: 228  TVNVGSSYDGSIQESNRENWDLNTSMEFWEGCSSGDPPEHVPAVQTNTVVTMHRFSTEMV 287
            +  V  +  GS   ++R NWDLNT+M++WEG +S D    +     +    +   S  M 
Sbjct: 384  STAVQPNTKGSNMLADRTNWDLNTTMDYWEGPASDDGARKMATQMYDIKPVI--CSAGMT 443

Query: 288  NTDTLSGKLTPLDDSDHLHLSLSSSDHRHVISQEQSSFVKLG----FRKTSPSLSSTGRG 347
                 +    P +  +   + +SS       S E S  ++LG    +   +P+    G  
Sbjct: 444  VASMPTQLQIPEEIENRAKIKMSSIVSSQQYSAEDS--LRLGLTTPYLHLNPNEKPAGSS 503

Query: 348  LQFDDLNGALKVVKPEPFVEASKLES------KSDEVNVLGLSDSAIVKREFLQIPNASD 407
             +    +    V  P   V ASK         KS+ ++    SDS + K +         
Sbjct: 504  GKIVSGHVVANVSSPGEPVPASKPTMVNYKPVKSEPLDERVRSDSGVTKAK--------- 563

Query: 408  IYIPMNTVKAKSVNSESNYESKQVALETLGGRLDLVAKQVLPEVDSSCPAPMPFVAEMTE 467
               P   +    V SE     ++ +LE    RL       L  VD+S   P P      E
Sbjct: 564  ---PTGLLNITQVKSEI---IEKCSLE----RLKSSTISTLKSVDASSIKPEPVCESNKE 623

Query: 468  AAGNSCSTDLITDGGMSNHSE--LQTPTEEHLNLKVHEGAYRCGGELVDSEMTDISKDP- 527
                   T    +G M+   E  L  PT    +L    G    G   + ++ T+ S +  
Sbjct: 624  -------TPQRMEGPMNQSDEQMLAVPTSTDSSL---HGVTTHGEHFMQAKETEASVEAQ 683

Query: 528  -GSKDFNSPIIKPIA----MPRNPSRTNDSIIEANMSSPSELHIPTTGPLNTK-VHQAGY 587
              SK  +S  +   A      +    + +  + + M S +++       +  K    +G 
Sbjct: 684  VASKMISSAGVTTHAEHFIQAKETEPSGEGQVASQMISSADVTTHAEHFMQAKETEPSGE 743

Query: 588  GCDGG-LVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNNPLWRPSTHTNEQCSSLQGG 647
            G     +++SV  D ++   +    +S  +  +VED +     +      + Q    +G 
Sbjct: 744  GLVASEMISSVDHDDNESNIAGKLDNSTSQSKMVEDSDHCKLKFM-----DVQLPDSRGS 803

Query: 648  EESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDAV-DNDIEEDYEDGEVREPT 707
             E S +DEEKI+LS D+LEED Y S+YESD K ++  AMD   D   EE++EDGEVREP 
Sbjct: 804  VEGSASDEEKINLSGDVLEEDSYGSDYESDDKRELATAMDIEHDRRGEEEFEDGEVREPV 863

Query: 708  LTTQVESSICETKKVKNFDHGDSSN---------------GLPGSDCCSSLVSVKQENKL 767
            + T++E  ICE ++  N + G ++                G+  +D  ++  +    NK 
Sbjct: 864  VNTEIEVLICEMQEAGNGNDGGNNPLSSSFREKETLIKDPGITSNDTNTNECTDTSVNK- 923

Query: 768  EILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKAKTSALEDQE--TSPE 827
               D   E N  +     S+ +  S ++  + H   +     + ++   ++ QE   +  
Sbjct: 924  ---DSATEANKEACLQESSAVEMPSSQMDGKRHIKAIPRKSLDASEKDTVKGQEGELASI 983

Query: 828  KASNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLND-DDVTDDITRGSKHSRIV 887
            + S+  + +  T+SQ   +  KT D     N  LP  E  +  DD   D+  G   SRI+
Sbjct: 984  QFSDTSQGTSVTISQGTDDAKKT-DSEGKGNSVLPKGEAFSSGDDAGKDVDNGGNRSRII 1043

Query: 888  SPCKPSSSSLPSKTKSSLARSVLTQTDRERIPDMGHEGEKLHPQGRDEPYRDVFQRFYVN 947
            +  + S+ S P +T+S   R++ +Q  RER+PD+  EG+K H +GRDE Y D   RF   
Sbjct: 1044 NLSRASNLSSPGRTRSISGRTLQSQIGRERLPDVALEGDKFHHRGRDEAYADSLHRFPRE 1103

Query: 948  RHQNLSPQTN----FSRRRGRFTIRINSVQGEWDFNPTISPGNYNDQVPPPYDARRRKYM 1007
            RH ++ P  N    F R RGR + RI++++G+ D     +   YN   P  Y   R K  
Sbjct: 1104 RH-HVQPSRNNRISFMRGRGRISSRIDTLRGDQDSECNFASEFYNG--PTEYRVVRHKNA 1163

Query: 1008 PAVSDDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVR 1067
             AVSD D + + Y    DG +   G   GR+IL+D+ P+F  +  RR+SPG RDGP   R
Sbjct: 1164 SAVSDADPNFSSYNNGQDGAYFGTG-RGGRKILNDDPPIFSQLPPRRRSPGGRDGPAG-R 1223

Query: 1068 GVKMVHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPP 1127
            G+ MV R+PRN+SPSRC  E GSELVG RH    MR F D+  DP++A  QPSFE    P
Sbjct: 1224 GLPMVRRVPRNLSPSRCIAEDGSELVGLRH----MRGFADDHTDPMFARCQPSFEGLDGP 1283

Query: 1128 FIRDRRNF-PIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHP-EMARRSPPPG 1187
            F+R  R F  +QR+   R  SKSP R R RSP  W   +R+S   FG P E+  R  PP 
Sbjct: 1284 FVRGNREFTSVQRRGIPRTRSKSPTRQRTRSPGPWSSLRRRSPDGFGGPLELPHRRSPPL 1343

Query: 1188 YRM---RSPDQPPQIHGDMPDRRHGF-PFPSLPPNDLRDMGSARDHGHMRPGLRSRNRTD 1247
            YRM   RSPD+ P   G+M  RRHG  P+ S P NDLRD+  +RDHGH R G+ +R+ + 
Sbjct: 1344 YRMERIRSPDR-PCFAGEMGVRRHGSPPYLSRPSNDLRDLDPSRDHGHPRSGISNRSPSG 1403

Query: 1248 RMSFRN-RRFEDMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDDDRRRFSDRHEHLHQF 1286
            R+  RN RR + +DPR+ R E ++YF GP+  G+ ++L  DGN D+RRR+ DR   +  F
Sbjct: 1404 RILLRNSRRLDLVDPRE-RNEGDDYFGGPMPSGRFHDLGTDGNPDERRRYGDRRGPVRPF 1438

BLAST of Cp4.1LG13g09640 vs. TrEMBL
Match: V4VAV3_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10033332mg PE=4 SV=1)

HSP 1 Score: 378.6 bits (971), Expect = 4.8e-101
Identity = 284/740 (38.38%), Postives = 407/740 (55.00%), Query Frame = 1

Query: 569  SVMTDVSKDTCSKDSSSSVIKPVIVEDENQNNPLWRPSTHTNEQCSSLQGGEESSVNDEE 628
            SV  D+++   S    S++ +  IV+D  Q       +T+         G  E S +D+E
Sbjct: 532  SVGHDINEANVSGIVDSTIAEDKIVDDPGQCR---LKNTNVGPTPPDSMGNGEGSASDDE 591

Query: 629  KISLSADLLEEDPYSSEYESDGKLDVNEAMDAVDNDI-EEDYEDGEVREPTLTTQVESSI 688
            KI+LS D+LEED Y S+YESDG LD+  AMD   + I EED+EDGEVREP   T +E   
Sbjct: 592  KINLSGDMLEEDSYGSDYESDGNLDLGTAMDTEQDGIREEDFEDGEVREPLADTTMEEPT 651

Query: 689  CETKKVKNFDHGDSSN------GLPGSDCCSSLVSVKQENKLEILDVKREDNLHSVTSNQ 748
            CE ++V+ F+  DS        GLP  D  +S     +++K E       + ++  +   
Sbjct: 652  CEKREVEPFNSDDSHKEQMSYVGLPSDDHPTSSYVENKDSKTEEPSEANYNIVNKFSETA 711

Query: 749  SSEQERSKELPVEEHTTR----VCLNKANKAKTSALEDQETSPEKASNGIEESITTVSQS 808
              E++ +++   ++H  +    V +     A     E+ E S ++A    + +  TV Q 
Sbjct: 712  HDEKKPNEDADDKDHVLQESQAVEMPTNGVANCPRSEETEQSTDQAPGSSQGNSATVVQG 771

Query: 809  DAEKVKTVDIVRNDNPALPNVEPL-NDDDVTDDITRGSKHSRIVSPCKPSSSSLPSKTKS 868
              E  K  D++  +  ALP VE   N DD T D   G + SRI++  + S SS P +T++
Sbjct: 772  SDEDTKNTDVIDKNISALPKVETSSNVDDATKDANSGGQKSRIIN-LRASISSSPGETRT 831

Query: 869  SLARSVLTQTDRERIPDMGHEGEKLHPQGRDEPYRDVFQRFYVNRHQNLSPQT---NFSR 928
              ARS+  +  R  +PD+  E +KL P+GRDE Y    ++   +RHQ+ S +    NF R
Sbjct: 832  ISARSLPARAGR--VPDVALEEDKLCPRGRDEIYTGDSRKLSRDRHQDQSSRNSRFNFMR 891

Query: 929  RRGRFTIRINSVQGEWDFNPTISPGNYNDQVPPPYDARRRKYMPAVSDDDIDQNHYKMKP 988
             RGR + RI++V+G WD     +P  YN   P  +   R KY    S  DI+ N Y    
Sbjct: 892  GRGRISSRIDTVRGNWDSERDFAPEFYNG--PAEFRIPRHKY---ASQTDIEFNSYNGGL 951

Query: 989  DGPFRSAGDHRG-RQILDDEGPLFCHMASRRKSPGRRDGPPPVRGVKM--VHRMPRNISP 1048
             G F  AG  RG R+ L+D  P+F     RR+SPG R GPP VRG++M  VHR+PRNISP
Sbjct: 952  SGAF--AGTCRGGRKPLNDGAPVF---RPRRRSPGGRGGPP-VRGIEMDMVHRIPRNISP 1011

Query: 1049 SRCNRERGSELVGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPFIRDRRNF-PIQRK 1108
            SRC  E  SELVG RHGE+FMR   ++  +P+YAHPQ SFE     F+R  RNF  +QR+
Sbjct: 1012 SRCIGEGSSELVGLRHGEEFMRGLPNDNSNPIYAHPQASFEGIDSQFVRSNRNFLSVQRR 1071

Query: 1109 SFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGY---RMRSPDQPPQI 1168
               R+ SKSP  SR  +P  W P +R  + F GH E   +  PP +   RMRSPD+    
Sbjct: 1072 GLPRIRSKSPVASRTHAPRTWSPRRRSPDGFGGHSEFPNQRSPPMFRMERMRSPDR-SCF 1131

Query: 1169 HGDMPDRRHGFPFPSLPPNDLRDMGSARDHGHMRPGLRSRNRTDRMSFRNRRFEDM-DPR 1228
              +M  RRHG P+ S   N+LRDM S RD GH R  +  R+ + R+  RN R  DM DPR
Sbjct: 1132 PAEMVVRRHGSPYMSRQSNELRDMDSGRDLGHPRSVIPDRSPSGRVLLRNPRGLDMLDPR 1191

Query: 1229 DNRIESNEYFDGPVHPGQMNELIDDGNDDDRRRFSDRHEHLHQFRPQCNDSDGENYHNDA 1286
            + R  ++++F  P+  G+  EL  DG +++RRR S+R   +  FRP  N ++GE++H +A
Sbjct: 1192 E-RTANDDFFGRPMRSGRYQELGADGTNEERRRLSERRGPVRPFRPPFNGAEGEDFHLNA 1251

BLAST of Cp4.1LG13g09640 vs. NCBI nr
Match: gi|778703228|ref|XP_011655341.1| (PREDICTED: uncharacterized protein LOC101204083 [Cucumis sativus])

HSP 1 Score: 454.1 bits (1167), Expect = 1.3e-123
Identity = 316/761 (41.52%), Postives = 435/761 (57.16%), Query Frame = 1

Query: 559  GYGCDGGLV--NSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNNPLWRPSTHTNEQCSS- 618
            G+ CDG  +  N    D++    S +  +S   PV+     + N      + ++E   S 
Sbjct: 267  GFDCDGSFLQSNREKWDLNTSMESWEGCTSGDAPVVQISATRTNTTIETYSCSSEMVESD 326

Query: 619  -------LQGGEESSVNDEEKISLSAD------LLEEDPYSSEYESDGKLDVNEAMDAVD 678
                   L   E+   + +E + LS D      +L+EDPY SEYESDG  D+ E +D  D
Sbjct: 327  SPCGKQTLLDNEDKGDSTKEHLHLSLDSSYLKSVLDEDPYISEYESDGNWDIAETVDDND 386

Query: 679  NDIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGD------SSNGLPGSDCCSSLVS 738
            N++EEDYEDGEVRE    T+VE  + E ++++  DH        +S GL   +  + L  
Sbjct: 387  NNVEEDYEDGEVRETMQETEVEVHVYEKREIEPLDHAGCNDKKINSVGLLDHEFFT-LGP 446

Query: 739  VKQENKLEILDVKREDN--LHSVTSNQSSEQERS----KELPVEEHTTRVCLNKANKA-- 798
             KQE KLE LD + ED   + + T + S EQE      KEL   E+     +N + KA  
Sbjct: 447  KKQETKLENLDYRSEDEDEVQTTTKSNSYEQENEDLCVKELHAVENAIGEDVNISAKATE 506

Query: 799  --------KTSALEDQETSPEKASNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEP 858
                    K    E Q T+ +K  N  EE + T SQ++ E    VD+V+N +  LP V+ 
Sbjct: 507  RSQLSQYDKKGNFEGQGTA-DKILN--EEPVPTFSQNEVENAVAVDVVQNRDLTLPTVKE 566

Query: 859  LNDDDVTDDITRGSKHSRIVSPCKPSSSSLPSKTKSSLARSVLTQTDRERIPDMGHEGEK 918
              ++D   DI  G+++SRI++  + S+ S P K KS+ A+ VL+  DRE +P+M  E   
Sbjct: 567  SVNEDDAKDINGGTRNSRIINFNRTSTDSTPCKAKSNFAKPVLSHKDREFVPNMVVERAN 626

Query: 919  LHPQGRDEPYRDVFQRFYVNRHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNY 978
            + PQ RD+ Y ++ ++  +++ Q   P   FS RRGR T R+++   EWDF P  SP  Y
Sbjct: 627  MKPQERDDVYSNISKKISIDKRQGPPPLMGFSHRRGRNTNRLDNRSEEWDFGPNFSPETY 686

Query: 979  NDQVPPPYDARRRKYMPAVSDDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMA 1038
            ++Q    +               +DQN YK+ PDGPF  A + RGR++++DE P F H  
Sbjct: 687  SEQQIDYHVT------------GLDQNRYKIIPDGPFGGA-NRRGRELVEDEEPFFFHGP 746

Query: 1039 SRRKSPGRRDGPPPVRGVKMVHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMD 1098
            SRRKSPGRR G   VRG KMV+RMPR+ SP RC  E GS     +HGEKF R F D+T+D
Sbjct: 747  SRRKSPGRRHG-HSVRGGKMVNRMPRDFSPGRCMDEGGS--FDRQHGEKFTRNFADDTVD 806

Query: 1099 PLYAHPQPSFEVDRPPFIRDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERF 1158
             +Y  PQP ++VDR PF R+RRNF  QRK+F ++DSKSP RSR RSPSQWF SKR S+RF
Sbjct: 807  EMYPRPQPPYDVDR-PFFRERRNFSFQRKTFPKIDSKSPVRSRARSPSQWFSSKR-SDRF 866

Query: 1159 FGHPEMARRSPPPGY--RMRSPDQPPQIHGDMPDRRHGFPFPSLPPNDLRDMGSARDHGH 1218
               P M  R  P     RMRSPDQ   I G MP +R GF + S PP++LRD+G A DHGH
Sbjct: 867  CERPNMTHRRSPNYMTDRMRSPDQ-RSIRGYMPGQRQGFRYLS-PPDELRDVGPAPDHGH 926

Query: 1219 MRPGLRSRNRTDRMSFRNRRFEDMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDDDRRR 1278
            MRP + +RN+T R+  RNR ++ +DPR  RIE++  F GPV  GQ+        DDD RR
Sbjct: 927  MRPFIPNRNQTKRLPLRNRSYDAIDPR-GRIENDGLFYGPVRLGQLTGYNGGEPDDDERR 986

Query: 1279 FSDRHEHLHQFRPQCNDSDGENYHNDADERARPYRYCTEDE 1280
            F++RHE LH F+    DSDGE Y N  ++ +RP+R+C ED+
Sbjct: 987  FNERHEPLHSFKHGFRDSDGERYRNKGEDCSRPFRFCAEDD 1002

BLAST of Cp4.1LG13g09640 vs. NCBI nr
Match: gi|659127128|ref|XP_008463540.1| (PREDICTED: uncharacterized protein LOC103501669 isoform X2 [Cucumis melo])

HSP 1 Score: 452.2 bits (1162), Expect = 4.9e-123
Identity = 315/759 (41.50%), Postives = 431/759 (56.79%), Query Frame = 1

Query: 559  GYGCDGGLV--NSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNNPLWRPSTHTNEQCSS- 618
            G+ CDG  +  N    D++    S +  +S   PV+     + N        ++E   S 
Sbjct: 264  GFDCDGSFLQSNREKWDLNTSMESWEGCTSGDDPVVQISTTRTNTTTETYACSSEMVESD 323

Query: 619  -------LQGGEESSVNDEEKISLSAD------LLEEDPYSSEYESDGKLDVNEAMDAVD 678
                   L   E+   + +E + LS D      +L+EDPY SEYESDG  D+ E +D  D
Sbjct: 324  SPCRKQTLLDSEDKVDSTKEHLHLSLDSSYLKSVLDEDPYISEYESDGNWDIAETVDDND 383

Query: 679  NDIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGD------SSNGLPGSDCCSSLVS 738
            N++EEDYEDGEVRE     +VE  + E ++++  DH        +S GL   +  + L  
Sbjct: 384  NNVEEDYEDGEVRETMQENEVEVHVHEKREIEPLDHAGCNEEKINSVGLLDHEFFT-LGP 443

Query: 739  VKQENKLEILDVKREDNLHSVTSNQSSEQERS----KELPVEEHTTRVCLNKANKA---- 798
             +QE K E LD + ED + + T ++S EQE      KEL   E+     +N + KA    
Sbjct: 444  QEQETKSENLDYRSEDEVQTTTKSKSYEQENEDLCVKELHAVENAISEDVNISAKATGRI 503

Query: 799  ------KTSALEDQETSPEKASNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLN 858
                  K    E Q T+ +K  N  EE I T SQ + E    VD+V+N +  LP V    
Sbjct: 504  QLSQYDKKGNFEGQGTA-DKIIN--EEPIPTFSQDEVENAVAVDVVQNRDLTLPTVNESV 563

Query: 859  DDDVTDDITRGSKHSRIVSPCKPSSSSLPSKTKSSLARSVLTQTDRERIPDMGHEGEKLH 918
              D T DI  G+++SRI++  + S+ S P K KSS  R VL+  DRE +P+MG E   + 
Sbjct: 564  TRDDTKDINGGTRNSRIINFNRTSTDSTPCKAKSSFVRPVLSHKDREFVPNMGVEEANMK 623

Query: 919  PQGRDEPYRDVFQRFYVNRHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYND 978
            PQ RD+ Y ++ ++  +++ Q   P   FS RRGR+T R+++   EWDF    SP  Y++
Sbjct: 624  PQERDDVYSNITKKISIDKRQGPPPLMGFSHRRGRYTNRLDNRSEEWDFGANFSPEIYSE 683

Query: 979  QVPPPYDARRRKYMPAVSDDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASR 1038
            Q    + A              D+N YK+ PDGPF  A + RGR++++DE P F H  SR
Sbjct: 684  QQIDYHVA------------GFDKNRYKIIPDGPFGGA-NRRGRELVEDEEPFFFHGPSR 743

Query: 1039 RKSPGRRDGPPPVRGVKMVHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPL 1098
            RKSPGRR G   VRG KMV+RMPR+ SP RC  E GS     +HGEKF R+F D+T+D +
Sbjct: 744  RKSPGRRHG-HSVRGGKMVNRMPRDFSPGRCMDEGGS--FDRQHGEKFTRSFADDTVDGM 803

Query: 1099 YAHPQPSFEVDRPPFIRDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFG 1158
            Y  PQP ++VDR PF R+RRNF  QRK+F ++DSKSP RSR RSPSQWF SKR S+RF  
Sbjct: 804  YPRPQPPYDVDR-PFFRERRNFSFQRKTFPKIDSKSPVRSRARSPSQWFSSKR-SDRFCE 863

Query: 1159 HPEMARRSPPPGY--RMRSPDQPPQIHGDMPDRRHGFPFPSLPPNDLRDMGSARDHGHMR 1218
             P M  +  P     RMRSPDQ   I G MP +R GF + S PP++LRD+GSA DHGHMR
Sbjct: 864  RPNMTHQRSPNYMTDRMRSPDQ-CSIRGYMPGQRQGFRYLS-PPDELRDVGSAPDHGHMR 923

Query: 1219 PGLRSRNRTDRMSFRNRRFEDMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDDDRRRFS 1278
            P + +RN+T R+  RNR ++ +DPR  RIE +  F GPV  GQ+        DDD RRF+
Sbjct: 924  PFIPNRNQTKRLPLRNRSYDAIDPR-GRIEDDGLFYGPVRLGQLTGYNGGKPDDDERRFN 983

Query: 1279 DRHEHLHQFRPQCNDSDGENYHNDADERARPYRYCTEDE 1280
            +RHE LH F+    DSDG+ Y N  ++ +RP+R+C ED+
Sbjct: 984  ERHEPLHSFKHGFRDSDGDRYRNKGEDCSRPFRFCAEDD 997

BLAST of Cp4.1LG13g09640 vs. NCBI nr
Match: gi|659127134|ref|XP_008463543.1| (PREDICTED: uncharacterized protein LOC103501669 isoform X3 [Cucumis melo])

HSP 1 Score: 452.2 bits (1162), Expect = 4.9e-123
Identity = 315/759 (41.50%), Postives = 431/759 (56.79%), Query Frame = 1

Query: 559  GYGCDGGLV--NSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNNPLWRPSTHTNEQCSS- 618
            G+ CDG  +  N    D++    S +  +S   PV+     + N        ++E   S 
Sbjct: 242  GFDCDGSFLQSNREKWDLNTSMESWEGCTSGDDPVVQISTTRTNTTTETYACSSEMVESD 301

Query: 619  -------LQGGEESSVNDEEKISLSAD------LLEEDPYSSEYESDGKLDVNEAMDAVD 678
                   L   E+   + +E + LS D      +L+EDPY SEYESDG  D+ E +D  D
Sbjct: 302  SPCRKQTLLDSEDKVDSTKEHLHLSLDSSYLKSVLDEDPYISEYESDGNWDIAETVDDND 361

Query: 679  NDIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGD------SSNGLPGSDCCSSLVS 738
            N++EEDYEDGEVRE     +VE  + E ++++  DH        +S GL   +  + L  
Sbjct: 362  NNVEEDYEDGEVRETMQENEVEVHVHEKREIEPLDHAGCNEEKINSVGLLDHEFFT-LGP 421

Query: 739  VKQENKLEILDVKREDNLHSVTSNQSSEQERS----KELPVEEHTTRVCLNKANKA---- 798
             +QE K E LD + ED + + T ++S EQE      KEL   E+     +N + KA    
Sbjct: 422  QEQETKSENLDYRSEDEVQTTTKSKSYEQENEDLCVKELHAVENAISEDVNISAKATGRI 481

Query: 799  ------KTSALEDQETSPEKASNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLN 858
                  K    E Q T+ +K  N  EE I T SQ + E    VD+V+N +  LP V    
Sbjct: 482  QLSQYDKKGNFEGQGTA-DKIIN--EEPIPTFSQDEVENAVAVDVVQNRDLTLPTVNESV 541

Query: 859  DDDVTDDITRGSKHSRIVSPCKPSSSSLPSKTKSSLARSVLTQTDRERIPDMGHEGEKLH 918
              D T DI  G+++SRI++  + S+ S P K KSS  R VL+  DRE +P+MG E   + 
Sbjct: 542  TRDDTKDINGGTRNSRIINFNRTSTDSTPCKAKSSFVRPVLSHKDREFVPNMGVEEANMK 601

Query: 919  PQGRDEPYRDVFQRFYVNRHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYND 978
            PQ RD+ Y ++ ++  +++ Q   P   FS RRGR+T R+++   EWDF    SP  Y++
Sbjct: 602  PQERDDVYSNITKKISIDKRQGPPPLMGFSHRRGRYTNRLDNRSEEWDFGANFSPEIYSE 661

Query: 979  QVPPPYDARRRKYMPAVSDDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASR 1038
            Q    + A              D+N YK+ PDGPF  A + RGR++++DE P F H  SR
Sbjct: 662  QQIDYHVA------------GFDKNRYKIIPDGPFGGA-NRRGRELVEDEEPFFFHGPSR 721

Query: 1039 RKSPGRRDGPPPVRGVKMVHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPL 1098
            RKSPGRR G   VRG KMV+RMPR+ SP RC  E GS     +HGEKF R+F D+T+D +
Sbjct: 722  RKSPGRRHG-HSVRGGKMVNRMPRDFSPGRCMDEGGS--FDRQHGEKFTRSFADDTVDGM 781

Query: 1099 YAHPQPSFEVDRPPFIRDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFG 1158
            Y  PQP ++VDR PF R+RRNF  QRK+F ++DSKSP RSR RSPSQWF SKR S+RF  
Sbjct: 782  YPRPQPPYDVDR-PFFRERRNFSFQRKTFPKIDSKSPVRSRARSPSQWFSSKR-SDRFCE 841

Query: 1159 HPEMARRSPPPGY--RMRSPDQPPQIHGDMPDRRHGFPFPSLPPNDLRDMGSARDHGHMR 1218
             P M  +  P     RMRSPDQ   I G MP +R GF + S PP++LRD+GSA DHGHMR
Sbjct: 842  RPNMTHQRSPNYMTDRMRSPDQ-CSIRGYMPGQRQGFRYLS-PPDELRDVGSAPDHGHMR 901

Query: 1219 PGLRSRNRTDRMSFRNRRFEDMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDDDRRRFS 1278
            P + +RN+T R+  RNR ++ +DPR  RIE +  F GPV  GQ+        DDD RRF+
Sbjct: 902  PFIPNRNQTKRLPLRNRSYDAIDPR-GRIEDDGLFYGPVRLGQLTGYNGGKPDDDERRFN 961

Query: 1279 DRHEHLHQFRPQCNDSDGENYHNDADERARPYRYCTEDE 1280
            +RHE LH F+    DSDG+ Y N  ++ +RP+R+C ED+
Sbjct: 962  ERHEPLHSFKHGFRDSDGDRYRNKGEDCSRPFRFCAEDD 975

BLAST of Cp4.1LG13g09640 vs. NCBI nr
Match: gi|659127120|ref|XP_008463536.1| (PREDICTED: uncharacterized protein LOC103501669 isoform X1 [Cucumis melo])

HSP 1 Score: 452.2 bits (1162), Expect = 4.9e-123
Identity = 315/759 (41.50%), Postives = 431/759 (56.79%), Query Frame = 1

Query: 559  GYGCDGGLV--NSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNNPLWRPSTHTNEQCSS- 618
            G+ CDG  +  N    D++    S +  +S   PV+     + N        ++E   S 
Sbjct: 271  GFDCDGSFLQSNREKWDLNTSMESWEGCTSGDDPVVQISTTRTNTTTETYACSSEMVESD 330

Query: 619  -------LQGGEESSVNDEEKISLSAD------LLEEDPYSSEYESDGKLDVNEAMDAVD 678
                   L   E+   + +E + LS D      +L+EDPY SEYESDG  D+ E +D  D
Sbjct: 331  SPCRKQTLLDSEDKVDSTKEHLHLSLDSSYLKSVLDEDPYISEYESDGNWDIAETVDDND 390

Query: 679  NDIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGD------SSNGLPGSDCCSSLVS 738
            N++EEDYEDGEVRE     +VE  + E ++++  DH        +S GL   +  + L  
Sbjct: 391  NNVEEDYEDGEVRETMQENEVEVHVHEKREIEPLDHAGCNEEKINSVGLLDHEFFT-LGP 450

Query: 739  VKQENKLEILDVKREDNLHSVTSNQSSEQERS----KELPVEEHTTRVCLNKANKA---- 798
             +QE K E LD + ED + + T ++S EQE      KEL   E+     +N + KA    
Sbjct: 451  QEQETKSENLDYRSEDEVQTTTKSKSYEQENEDLCVKELHAVENAISEDVNISAKATGRI 510

Query: 799  ------KTSALEDQETSPEKASNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNVEPLN 858
                  K    E Q T+ +K  N  EE I T SQ + E    VD+V+N +  LP V    
Sbjct: 511  QLSQYDKKGNFEGQGTA-DKIIN--EEPIPTFSQDEVENAVAVDVVQNRDLTLPTVNESV 570

Query: 859  DDDVTDDITRGSKHSRIVSPCKPSSSSLPSKTKSSLARSVLTQTDRERIPDMGHEGEKLH 918
              D T DI  G+++SRI++  + S+ S P K KSS  R VL+  DRE +P+MG E   + 
Sbjct: 571  TRDDTKDINGGTRNSRIINFNRTSTDSTPCKAKSSFVRPVLSHKDREFVPNMGVEEANMK 630

Query: 919  PQGRDEPYRDVFQRFYVNRHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYND 978
            PQ RD+ Y ++ ++  +++ Q   P   FS RRGR+T R+++   EWDF    SP  Y++
Sbjct: 631  PQERDDVYSNITKKISIDKRQGPPPLMGFSHRRGRYTNRLDNRSEEWDFGANFSPEIYSE 690

Query: 979  QVPPPYDARRRKYMPAVSDDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASR 1038
            Q    + A              D+N YK+ PDGPF  A + RGR++++DE P F H  SR
Sbjct: 691  QQIDYHVA------------GFDKNRYKIIPDGPFGGA-NRRGRELVEDEEPFFFHGPSR 750

Query: 1039 RKSPGRRDGPPPVRGVKMVHRMPRNISPSRCNRERGSELVGPRHGEKFMRTFEDETMDPL 1098
            RKSPGRR G   VRG KMV+RMPR+ SP RC  E GS     +HGEKF R+F D+T+D +
Sbjct: 751  RKSPGRRHG-HSVRGGKMVNRMPRDFSPGRCMDEGGS--FDRQHGEKFTRSFADDTVDGM 810

Query: 1099 YAHPQPSFEVDRPPFIRDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFG 1158
            Y  PQP ++VDR PF R+RRNF  QRK+F ++DSKSP RSR RSPSQWF SKR S+RF  
Sbjct: 811  YPRPQPPYDVDR-PFFRERRNFSFQRKTFPKIDSKSPVRSRARSPSQWFSSKR-SDRFCE 870

Query: 1159 HPEMARRSPPPGY--RMRSPDQPPQIHGDMPDRRHGFPFPSLPPNDLRDMGSARDHGHMR 1218
             P M  +  P     RMRSPDQ   I G MP +R GF + S PP++LRD+GSA DHGHMR
Sbjct: 871  RPNMTHQRSPNYMTDRMRSPDQ-CSIRGYMPGQRQGFRYLS-PPDELRDVGSAPDHGHMR 930

Query: 1219 PGLRSRNRTDRMSFRNRRFEDMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDDDRRRFS 1278
            P + +RN+T R+  RNR ++ +DPR  RIE +  F GPV  GQ+        DDD RRF+
Sbjct: 931  PFIPNRNQTKRLPLRNRSYDAIDPR-GRIEDDGLFYGPVRLGQLTGYNGGKPDDDERRFN 990

Query: 1279 DRHEHLHQFRPQCNDSDGENYHNDADERARPYRYCTEDE 1280
            +RHE LH F+    DSDG+ Y N  ++ +RP+R+C ED+
Sbjct: 991  ERHEPLHSFKHGFRDSDGDRYRNKGEDCSRPFRFCAEDD 1004

BLAST of Cp4.1LG13g09640 vs. NCBI nr
Match: gi|700196074|gb|KGN51251.1| (hypothetical protein Csa_5G504130 [Cucumis sativus])

HSP 1 Score: 445.7 bits (1145), Expect = 4.6e-121
Identity = 316/772 (40.93%), Postives = 435/772 (56.35%), Query Frame = 1

Query: 559  GYGCDGGLV--NSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNNPLWRPSTHTNEQCSS- 618
            G+ CDG  +  N    D++    S +  +S   PV+     + N      + ++E   S 
Sbjct: 267  GFDCDGSFLQSNREKWDLNTSMESWEGCTSGDAPVVQISATRTNTTIETYSCSSEMVESD 326

Query: 619  -------LQGGEESSVNDEEKISLSAD------LLEEDPYSSEYESDGKLDVNEAMDAVD 678
                   L   E+   + +E + LS D      +L+EDPY SEYESDG  D+ E +D  D
Sbjct: 327  SPCGKQTLLDNEDKGDSTKEHLHLSLDSSYLKSVLDEDPYISEYESDGNWDIAETVDDND 386

Query: 679  -----------NDIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGD------SSNGL 738
                       N++EEDYEDGEVRE    T+VE  + E ++++  DH        +S GL
Sbjct: 387  DNDDNDNDDNDNNVEEDYEDGEVRETMQETEVEVHVYEKREIEPLDHAGCNDKKINSVGL 446

Query: 739  PGSDCCSSLVSVKQENKLEILDVKREDN--LHSVTSNQSSEQERS----KELPVEEHTTR 798
               +  + L   KQE KLE LD + ED   + + T + S EQE      KEL   E+   
Sbjct: 447  LDHEFFT-LGPKKQETKLENLDYRSEDEDEVQTTTKSNSYEQENEDLCVKELHAVENAIG 506

Query: 799  VCLNKANKA----------KTSALEDQETSPEKASNGIEESITTVSQSDAEKVKTVDIVR 858
              +N + KA          K    E Q T+ +K  N  EE + T SQ++ E    VD+V+
Sbjct: 507  EDVNISAKATERSQLSQYDKKGNFEGQGTA-DKILN--EEPVPTFSQNEVENAVAVDVVQ 566

Query: 859  NDNPALPNVEPLNDDDVTDDITRGSKHSRIVSPCKPSSSSLPSKTKSSLARSVLTQTDRE 918
            N +  LP V+   ++D   DI  G+++SRI++  + S+ S P K KS+ A+ VL+  DRE
Sbjct: 567  NRDLTLPTVKESVNEDDAKDINGGTRNSRIINFNRTSTDSTPCKAKSNFAKPVLSHKDRE 626

Query: 919  RIPDMGHEGEKLHPQGRDEPYRDVFQRFYVNRHQNLSPQTNFSRRRGRFTIRINSVQGEW 978
             +P+M  E   + PQ RD+ Y ++ ++  +++ Q   P   FS RRGR T R+++   EW
Sbjct: 627  FVPNMVVERANMKPQERDDVYSNISKKISIDKRQGPPPLMGFSHRRGRNTNRLDNRSEEW 686

Query: 979  DFNPTISPGNYNDQVPPPYDARRRKYMPAVSDDDIDQNHYKMKPDGPFRSAGDHRGRQIL 1038
            DF P  SP  Y++Q    +               +DQN YK+ PDGPF  A + RGR+++
Sbjct: 687  DFGPNFSPETYSEQQIDYHVT------------GLDQNRYKIIPDGPFGGA-NRRGRELV 746

Query: 1039 DDEGPLFCHMASRRKSPGRRDGPPPVRGVKMVHRMPRNISPSRCNRERGSELVGPRHGEK 1098
            +DE P F H  SRRKSPGRR G   VRG KMV+RMPR+ SP RC  E GS     +HGEK
Sbjct: 747  EDEEPFFFHGPSRRKSPGRRHG-HSVRGGKMVNRMPRDFSPGRCMDEGGS--FDRQHGEK 806

Query: 1099 FMRTFEDETMDPLYAHPQPSFEVDRPPFIRDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQ 1158
            F R F D+T+D +Y  PQP ++VDR PF R+RRNF  QRK+F ++DSKSP RSR RSPSQ
Sbjct: 807  FTRNFADDTVDEMYPRPQPPYDVDR-PFFRERRNFSFQRKTFPKIDSKSPVRSRARSPSQ 866

Query: 1159 WFPSKRKSERFFGHPEMARRSPPPGY--RMRSPDQPPQIHGDMPDRRHGFPFPSLPPNDL 1218
            WF SKR S+RF   P M  R  P     RMRSPDQ   I G MP +R GF + S PP++L
Sbjct: 867  WFSSKR-SDRFCERPNMTHRRSPNYMTDRMRSPDQ-RSIRGYMPGQRQGFRYLS-PPDEL 926

Query: 1219 RDMGSARDHGHMRPGLRSRNRTDRMSFRNRRFEDMDPRDNRIESNEYFDGPVHPGQMNEL 1278
            RD+G A DHGHMRP + +RN+T R+  RNR ++ +DPR  RIE++  F GPV  GQ+   
Sbjct: 927  RDVGPAPDHGHMRPFIPNRNQTKRLPLRNRSYDAIDPR-GRIENDGLFYGPVRLGQLTGY 986

Query: 1279 IDDGNDDDRRRFSDRHEHLHQFRPQCNDSDGENYHNDADERARPYRYCTEDE 1280
                 DDD RRF++RHE LH F+    DSDGE Y N  ++ +RP+R+C ED+
Sbjct: 987  NGGEPDDDERRFNERHEPLHSFKHGFRDSDGERYRNKGEDCSRPFRFCAEDD 1013

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KU39_CUCSA3.2e-12140.93Uncharacterized protein OS=Cucumis sativus GN=Csa_5G504130 PE=4 SV=1[more]
A0A0A0KNM2_CUCSA3.0e-11940.60Uncharacterized protein OS=Cucumis sativus GN=Csa_5G503580 PE=4 SV=1[more]
A0A0B0PJI5_GOSAR9.4e-11331.06Putative sucrose-phosphate synthase 2 OS=Gossypium arboreum GN=F383_10325 PE=4 S... [more]
A0A0D2V2P5_GOSRA3.7e-10931.10Uncharacterized protein OS=Gossypium raimondii GN=B456_012G066900 PE=4 SV=1[more]
V4VAV3_9ROSI4.8e-10138.38Uncharacterized protein OS=Citrus clementina GN=CICLE_v10033332mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
gi|778703228|ref|XP_011655341.1|1.3e-12341.52PREDICTED: uncharacterized protein LOC101204083 [Cucumis sativus][more]
gi|659127128|ref|XP_008463540.1|4.9e-12341.50PREDICTED: uncharacterized protein LOC103501669 isoform X2 [Cucumis melo][more]
gi|659127134|ref|XP_008463543.1|4.9e-12341.50PREDICTED: uncharacterized protein LOC103501669 isoform X3 [Cucumis melo][more]
gi|659127120|ref|XP_008463536.1|4.9e-12341.50PREDICTED: uncharacterized protein LOC103501669 isoform X1 [Cucumis melo][more]
gi|700196074|gb|KGN51251.1|4.6e-12140.93hypothetical protein Csa_5G504130 [Cucumis sativus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG13g09640.1Cp4.1LG13g09640.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR34536FAMILY NOT NAMEDcoord: 8..1287
score: 5.1E
NoneNo IPR availablePANTHERPTHR34536:SF1SUBFAMILY NOT NAMEDcoord: 8..1287
score: 5.1E

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG13g09640Csa5G504130Cucumber (Chinese Long) v2cpecuB176
Cp4.1LG13g09640MELO3C025488Melon (DHL92) v3.5.1cpemeB155
Cp4.1LG13g09640Lsi02G016840Bottle gourd (USVL1VR-Ls)cpelsiB141
Cp4.1LG13g09640MELO3C025488.2Melon (DHL92) v3.6.1cpemedB182
Cp4.1LG13g09640Bhi06G000596Wax gourdcpewgoB0235
Cp4.1LG13g09640Carg04521Silver-seed gourdcarcpeB0961
The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG13g09640Cp4.1LG01g24120Cucurbita pepo (Zucchini)cpecpeB211