Lsi08G005000 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi08G005000
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionGag-pro-like protein
Locationchr08 : 12873889 .. 12884770 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTAGTAACAATGAGAAGTCCAGCAGAGAGGCTTATTAAGATCATAATTTCATTATCAGCAACTTTGTTTGCGGCGGCAGCGGCGGTGGAGGAGTTAAAGGACTGCGACGAATGGTGTGGGAACTTGAAGATTCCATATCCATTTGGGATGAAGGAAGGGTGTTATCTTAACAAAAATTTCTCCATTACTTGCAACAAAACTCATTATAATCCACCAAAAGCATTTCTACAAGACGGCAACATATATATCACCAATATATCCATAATCCAAGGCCAGCTCCACATCTTGCAATTTGTAGCCACAGATTGCTACACAAAAAATGGTCATCTTGAATCCTACAATACCACTCTCCAAATCCGCACATTCACAATTTCCAACACCATGAACAAGTTCACAGTCGTTGGCTGCGATACTTACGCTCTTATTTCTGGTATACTTGAGAAAGAATCCTACATAAGTGGGTGCATGGCGTTCTGTGGAAACAGTACTGAAACTATAAGAGATGGGTCGTGGTGGTGAGTGCTGTCAGTTGGAGATTTCTAAAGACCTAAAATATTTGGAGGTTGAGGTGAGAAGCGTCAACAATCATAGTGACGTACTCAGTTTAATCCATGTGGGTATGCTTTTGTAATCCAACAAGACAAATTCAATTTCTCCAAAAAATATATTCATAATTTTACACAAGAGAAAGTTTCGTTGGTGCTTGATTATGGACCATCCCAACCAATACTTCTTGTTTAAACAAAAGCAATTGCAGTATATGTGGACTAAACACCAAAATGATTAACTTTCTCGATGATGGATCTCAATATCGTCGCCAGTGTTTGGAAGGCTTTGAGGGAAATCCATATCTCCAAAAAGGTTGCCAAGGTAAAATTATTATTTCTCTTTTCCAAATTTTTTTAGTCTTTTTACTTCATGTGGTGGGGTGGTTTGATATATATCAAACCTTCTAAGTGAGGATCTCACATTGAAGAGATCTCAAACCAAGCTCATTCGATACAAAAATAAAAATTATGATCCAATCTAAGAATGGTGAACCCAAAGATACGTTTGAAGGACATGTTGAAGATCAGACATATAAAGAGAAAATCAAGGGATCTTACAATATTTATAAGATAGGTGTAGGCTATCCCTCTCATTATCAATTAAATTTGAGATGGAGCCTCATGTTAATCTAATACTTTACTCATAGTCAATTGGTTTTGAGATGAAATCAAACATATTGGGATTCGATTCAAGAATAGTGAACCTCGAAAAATAATTTCTATAAGATACGTTCATTACTCTACTCATAGCCAATTAATTTTGAGATTGAACATCATATTTATCTAATAGAGTAAAAATTCTACGTAAGCTGAACTATATGCTTAATTACTATGACCTTTTTCTTTTCTTTTTTACATATAGTTTAATGGACATTACTTTTGGCATTTTACAGATATAGATGAATGTAAGAACGGAAGCAATAAATGTAAATACAAAGAGTTGTGTGTTAACGAACCAGGAAACTATACTTGCCATTGTCCTAAGAACCATAAAGGAGATGGCAGATGTGGAGGAGAAGGCTGCACTCGAAACCCCATGTCTTCGATTCATCTCATCGTCGGTGAGTTTTTTACATATAGTATATTATTGGGATGTAGACATCGTATTTAGTCCGGCTCTTTTTCTAAAAAAATAATGTTTGATTTTGATCGAATTTTGATAATTTAATGAAACTAGAAGTTTAGTTTTTTTTTTTTTTTTTTAATAAAGGGGTAATAACAAATATTTAAAAATGAAGGGGTTGGTTATTAAATTTAAGGTTTTATTTATTTAGTTATTTATTATTTTAGAAAAATTGGTTATTTATGTATTTATTTATTTATTATTATTTTTTTTAAAAAAAGAAAGGTTTTTCATCCTCTCTCTCCATTCAAACTTACCCCTTCAACGATTTTATTTTTCTCTCCCTTTCTTTCCTCAACTTCTCAACGTTCAAATTCTATTTTTTTTAAAAAAAAAATTAACCTATAAATATAGGAGTGAAGGGCTATGAAAAGAGGGAAAAAAAGGAAGTGAAATAAGAAGTGAAAGTAAGGAAAAAAAAAGAGGTGGAAGAAAAGGAAGTGAAATAAGAAGTGAAAGTCTGGAAAAAAAAGAGGTGAAAGAAAGAAGGAAAAGAAGTGAAAGATTGGGATATTAGAGAAGGAAAGTTGGGAAGAAAAAAAAGTAAAGAAACGAGGTGTAAAAAGGTAGGAGAAGACCCAACGGCCACCCACCGGCGGATCTCCGAGGACAATCTCCGGCGAAACTCCGTGAATAATCTCCGGCGACCGGCGGCGAATCCATTGGTAGGAAAGAGTAAAAGAAGAGAAAGTGGAGAAGGTGGAGGAGGAAAAAGCAAAGGAAGAAGAAAAGGAAGAAGAAGAGAAATCTGGGAAGAAGAAGAGGGGAAGATATTTGGTGGGTCCCACGTACTTTATTATATATATATATATATTTATTTATTTAAAAAATAGTGGGTCCTACGTACTTTATTATGTATATATATCTATTTATTTAAATCAACAGTGGGTCCCACTTATGTTTTCAAACCTTTTCTCTCCCTTTCATTTTTACCATAGTTATTCATTTATTATTATTATTATTATTTTTTATATATTTATTATCATATCTTTTCCTTTTTCTCACTATTTCATTCATCCTATTGTTCACTAAGGTTTTTTTTTTCTCATATACTCTCAAACATTTTACTTTTTGATGTTCCATGAATGTTGCTTAGTAAAATTGGATTCTCAATTTCATGCATTAAACCTTAAATTGTGAATTTTTGTTAACATAAGACTTCCATGATTTTGGTATTTTAATCAATTTAAGTGAAAATGTAACTCTTTAACTTTACTACTCTCAAGTATGTTTTTTTTTTTTTAAATTATTTTCTCTAATGTCGAAATTTCTTTTCCAAATTCTTGGATTTTAAACTTTAAACACAAACATTTTCCATATTATAATTGAGCAAAATGAATTGAATAGAATTTGGAGAGCATCTCCTAATTTAAGTTAGAATTGCAAGTGAATTTCAAAACTACTTGATGTTGAGATATTTAAATGAGTTTCAAATTAGTTATTACTTAAGTTTCCAAAGTAAAACTTTAGTTCAATTTAAATTCTGGAATTTTCAAATGAATTTCAAATGCATTTAAAGTTTTTCATAAATCTCATGATAGAGTCAATAGGGCTCTAGAGGAAAATAGTTCATAAGAGAATTCGAACATAAATTGATGTACATCATCACTCATGAATTTCAACTATACTTTATTATTAAAGTTATAGTAAAATTTAGTGAGACATTACCATTATTTTTTTTTATTTTTTATCAAATATTTCTAAAATACAATTGACTATAATAATTATGTCTAGGCTGAGGTACTTAAGTTGACCGTAAGGGAACACCTCTATCTGGGAATTACTTTAGTCAATTTTTTTTTATTATTTTTACAATTTTGGATAATAGAAATAAATGACTTCAAACGGGCGTTGTAGGGTGCTAATACCTTCCTACACTCAACGACCCCCGAACCTATCTTTGGATAATTAGACCATTTTTTTTGTTTTCTTTTTAAATGGGTGACCAATCACACCCAAAAATGATTGGTGGCGACTCCAAACCTTTCACCTCAAAAGAGACCCCTAGGGAAGGACGTCGGCCGTTCCGCGTCGCGATGACATTGCGACAGCTTGGCGACTCCACTGGGGACCTTAGAGGGTCAAAGCCGTTATGTTTTCAATTCTATTATCTTTTATTTTTATTTTTATTTTTTTGTTTTTCTTCCTTTCAAAATTAGTTTTTATTATCACCTATTTATTTATTTTAGTATTATTTACTTTCTAGTGTAAAATTATTATTTATGCCTTATTCTTATTATTTGTTTTAACTTTATACTTATTGTTATTATTTTTATCTATTTATTTATCTCATAGATTAATGTGTTACATGATCTCTTATTATATTTTAATGCATTTCACACATTCACAAGCCTCCTACCGGGCTATACCTTTTAGGATTAAATTTGGAGGCTGAGTAGTGTGACCTTCGTGGAACCTANTATACCTTTTAGGATTAAATTTGGAGGTTGAGTAGTGTGACCTTTGTGGAACCTGGTTCCCGTGCAGGTGCATGAAAATCTCCACTCAAGTCAACTGCTGTGAAAACAGTTGGAGGGCTTGGGGACCTTATTTCCTTTAGGATATATTAGAACTTAACCTCACAATCATACATACATGGATTTGATCTTGGAGTATTGGTTTAAATTTGCTTTGTGCAGATCTTCGTCAAATTTTTTTTTCTCGTGAAATTAATCAAGATGTTATCTTTCGTTTATTTTGATTATGGTTCTCTATTCTTGCTTTTTTTTTATTATGTTGATTGTTAAATTACATTTATTTTTCTAGTATTATCATGCATTCCATGTTAAATTACATTTATTTTTCTAATATTATCATTCATTTCATGCATGTTCACCTAGTTTATTATTATATGACGTTAATTTTATTTTTTTATTTATTTATTATTTTATTTATTATTTATTATTATTATTTTGTACTATTATCATGCATTCAACGCATGTTCACCTGGTTTATTATTAAAATGTATTTATTTCTCTTTAGTATTATCATGCATTTCATCCATGTTCAACCAGTTTATGATTAAAATACATTTATTTCTCTTCAGCATTATCATGCATTTCACACATTCACAAGCCTCCTGCCCAGGCTATACCTTTTAGGGTTAGATTCGGAGGCTGAGTAGTGCGACCTTCGTGGAACCTGGTTCCCATGCAGGCGCATGAAAATCTCCACTCAAGTCAACTATGCTGTGAAAGCAGTTGAGAGGCTTGGGGACCACTTCCTGTAGGATTTGTTAGATGTTAACCTCACACTCATAATTGTTTGGCATTCTAGAGTCTTTATGTGTCTTTTCAAAATAAGTGTCATGACAACCTTTGATTTTGACTGTAGATTAAAAATGATTTCAATAAAATCGCCTCCTTTAGTACCAACCCCACAAGAATTGAGCATAGAAGATCAGTCTGTAGTGCATCAGTGGTCAGAAAGCATCCAGAGAGCTCATGGGGATTGCTTAGTATCTGGTGAGATTATAAAGATCAAAGACATTAATATTCCTGAGAATCAGTTGGATGCTCTAAAGCTAGCATGGGAAGGTTTAGCCACAGCTAGGAAGGAGAGATTCATCAGCAAGTATGGACAGATGGCCCAACTTCTCTATGTGCAGGTTAATGTTTCTGTGTTGAAGGCTTTAGTACGACATTGGGACCCAAACTATCGATGCTTCACATTTAACTCGATAGACATGACTCCAACTATTGAAGAGTATCAGTCTCTTCTAAGAACACCATCACAGGAGAAAGTGAGGGCATATTCCTACAATGGGTCCCTTACAATGAAAAGGGCGTTGTCGTCATTTTTAGGCAAAATTTGTTCAAATGAAATCGAAAAACATGTGAAGACGAAAGGGAAAAGTGTGTGCTTGCCTATGGAGTACATACTTTCTCTTCAACAGAGATTTACAAATGAAGACAAAGGATTATCACTATTGGCCTTATGTATTTTCAATGCAGTCTTATTCCCGAACGTAATTGGGTATGTCGAAGAACAGGTAGTCAAGTTGTTTTTGAAAGTAGAGGAAGGTGTAGATCCTGTCATACCAATATTAGCTGAGACATTTCGGTCGTTGAATCATTGCAGAATAGAAAAAACAGGGAATTTCATTGGTTGCGCTTCGTTGCTGTATATTTAGATTTTAGACCATGTGAATTGTCCGTCAGAGTTCAAGTGTCATCAAATTAAATTTTAAAAATCATGGAACAAACTCCAAAACCCAATCTTAGAGTTTGTGCAATCGGGTTGGAGCTCAACGTTTCCAGAAAACAATATTTGGAAAGCTTTCTTTGCTGAGTTAAAAATAGAGGATGTTATATGGAGGTCTCCGTGGACGTTTGTCAGGCCAATGATGTACAAATGTGGTGAATTCCAGAATTTACCCCTCCTAGGCCCGTGGGGATGTATATCCTACGCTCCTTTGATGGTATTGCAACAAATGTGGATACGCCAATTTATCCCTGCAACACATGAGCTGAATAATTCAGAATTTGCTTACAAGAAGGGGTTCTGTGAAAGCAAGATTCAGCTGATTGTAAAGGCTTGGAAGAAGATCAACAAAATTCAAAGTGGTCAAGTCCATGATGATACCACAGAGGATTACAGGACATGGCATTTAAACAGAACAAAGATAATGAACACATCTCCATCTACAAACATTAAAATACATCTCCAAGAAGAAGTAGTGATTCCCAATCAATTAGCAGAACATGAGGCGCGTGAAGAAGATTTGAGAAGGACAAATGCATCTCTAGTTCAGGAAAACGAACGATTAAAAGTTGAAATGAATCAATGCTTGATGTGAAACACTACACTTGAAAAAGAATTACGCGAATTAAAGGATAGTGTCGAAAAGTAGAGGAAATTAGAAAGAGAAGTATCTGCGCTACATACAGAAACTCGTGATCTAAACAGGAAAATGCATCGATTAAGGAAAGATAAAGAAGCCACTCAAGCAACTCTGAAGTCTCAAGATGATCAAATCTCACAATTGCAGTCCGATATATCTGTACTTTATGAATGGGTAAAAGAATTGGAGAAAGGAATTGATCTTAGAAATAAAGAAATTCATGATTTAGAAGAGAAGAACAAGACGTTGCATCAAACAGTTGATGTAAAGGAAGGGCAACTATGTGAAATCAGCAAGGAAAATACGACCTTGAGGGAATCCATCCAGTCCCTCAACATTCGATTTACCAAATATCAGGATGTTGCAGAAGGATTAATGCGGGATCATACGCAATTAAAAGAACAATATGATGGATTAAAAGTAGATTACGAATTTCTGAGACTTGATTACACTACTATGCGTAATAAGTCAGAATATATGCTTACAGAAATCATAAAAGTTACAAAGAAATCAGACGACCTGGCAGAATAAGCACGTCACATGATATCGGCTATTGCACCCACGCAACCAAACGGCAAACACATGCTTAAATTCTTGGGGAAATTGCGTACAAATTTAGAATACTGGGGGCGATTTTATTGATCTCCTGTTTTGATCAGATTTTTTTCTCTATTTTTTTCTTTTAACAATTTGTAATCCTTTTATCCATTTATTTATTTATTTTTCCTCTATTTTCACTTAATTGAATATATTTTTCTTGATTTTGATTTTGTCATTCGTATCAATAAATTCTTTTTCTTTTTCTTTTTCTTTTTCTTTTTTTTCTTCATAAAGAAAAATAAGATAGCTCGATCACCACGTATCAGTCGCACTTACGTCACCAGATACAGATTAAGGAAAATGGATGAACAAAATACAGAACTGGAAGACATTGAAGAGTTAAAGGAAAAAATGGAAGCCATACTTCTGCTGTTGGAGAAAGGCAAATTTACAGCGGATGTAGCTCAACCCAGTACCACGACTGGGGTAAATTTGCAGTCACATCCACCAATTGAGGGATTTACACCCAAATACACCACATATAACCCACTCTACAATGCTCCCGTTGATCAGTTTCCTTTTCCTTTCACAACAAAAGTTGAGCCAGTTCCTTTGCCGAATCAAGCAAGCTTCCGACCTGTTAATGAGGACCCGGCAAAAGTTACAATCACAATTCCTAATCTGGACGATCCAGAGGTCAGAAAAAATTTGATGAAAGAAGAGCCTAAAACCTCTTCTAGCGGAAAGTTCGAGAATCTAGAAGAAAAGCTGAGAGTTGTGGAGGGAACAGATGTTTTCGGTAACATAGACGCTACTCAGTTGTGCCTGGTGCCAGATGTAGTTCTTCCAGCAAAATTTATGGTTCCAGATTTTGAAAAGTATGATGGGTCTTCATGCCCTAAAAGCCATATTATCATGTACTGCAGAAAGATGGCAGCGTATGTCCACGATAACAAGTTACTCATAAATGACAAGTTACTCATACACTGTTTCCAAGATAGTCTGACAGGTCCAGCTTCTCGATGGTATATGCAGTTGGATGGAGCTCACATCAACACATGGAGGAACTTAGCAGATTCCTTCCTAAAACAATATAAGCATAACATAGATATGGCTCCAGATCGTCTAGATCTGCAGAGGATGGAGAAGAAAAGTACAGAGAATTTCAAAGAGTATGCACAGAGATGGAGGGATACTGCCGCACAAGTACAACCCCTCTAACGGACAAGGAGTTGTCTGTCATGTTCATTAATACTCTAAAAGCTCCATTTTATGACTGAATGATTGGAAGTGCATCAAATAACTTTTCGGATCTAATGACGATTGGAGAAAGAATCGAGTATGGGATCAAGCATGGCAAGATAACCGATGTAGCAGGATCTTCAACGTCAAGTGTAAAGAAGACTAATTTTTCAAAGAAAAAAGAGGGAGAGGTACAAATGATAAGCAGAGTTAACCATGGTAACTCAAGCCAGTTGCCGCATCCAAACTCTAGAATTCCTCATTATCCACATAATTATTATGCACCACCTCATTCATATTATCAACCTTACGTAAATCATGCAGCAGTCCAATATTATCCCCCTCCTCCTCAAAATCAACGCACTTATGTGAATCAGGGGTACCAGCCTCAAGGTCAACAAAATTCTTATGCTCGAGGACAAGATAATAATAAAGGGGGTCGTAAACAAGTGCAGTTTGATCCGATTCCAATGACTTACACAGAGCTTCTGCCACGACTATTTCAGAATAATCAACTAGCACCTGTACCAATAGAACCACTGCAACCTCCTTATCCAAAGTGGTATGACCCAAATGTCCGTTGTGATTATCATGCAGGAGTCATTGGGCATTCAACTGAGAATTGTACTGCACTTAAGTACAGGGTACAGGCATTAATCAAGGCGGGATGGCTGAATTTTAAGAAAGACAATGGACCTGACGTAAATAATAACCCTTTTCTAAACCACCAAAATCCACAAGTAAATGCTGTAGATGTGGCTAAGGTTGGTTTAAGAGATGTCGTGAGAATGATAACATCCAAAGAAGAGCTTTTTCGAATTTTGTTTAACAACAGATTGATCGAATAAGGGAATTTGCAAGATGATGTCCTCGTTGGCCAATACGATGACAGTTTATTATGTTCATATCATGCTGGGGCAAAAGGTCACTCTATTGACCAGTGCCCTCATTTCTCCCAGAAGGTACAAGAATTATTAAATTCTCATTTCCTAACAGTCTCACAGAAGGTAGTTGGAGAATCAGAGCGTGAGCATGAAATAGATGTTGCAGAAGAGTTCTCAACAGGTGAATGCTCGAAGGTTTATCTAGAACGAAAGCCATTAACAATCTTTTATAAGGAAAAACCCAGTACTACCACTTCCAAACCAAAACCAATTACCATACAGGTTCCATCTCCATTTGAGTATAAAAGTTCAAAAGCAGTACCATGGAATTATGAATACAAGGTACTTGTTGAGTCTGTACCAGTTCCTATAGACAATATTAATGAAATTGGAGGTACAACACGAAGTGGGAAGTGTTATACGCCAGAAGCTTTATTGAAGTACAATAGCAAGGAAAAAGGAAAAGCCAAGCTTAGTGATGTCATTGATTGCAGGATAGAAGAGCCACTGTTGGTGAGAAATCAAGAAGTAAAGGAGCCAGCATCTGAAGATGACATCCAAGAGTTTTTAAGACTTATAAAACAAAGTGATTATAAAGTAATTGAACAGTTGGGCAAAACGCCTGCTAGAATCTCTATTTTGGCCTTATTGTTATCTTCAGACCTGCATCGCAAAACTCTAATGAATATCCTGAACCAGACTTATGTTCCATCAGACATTACAACAGACAATTTAGACAACATCGTTGGCAATATAACAGCATCAAGTTCCATAACTTTCACTGATGATGAGATACCGCCGGAAGGCATGGGTCACACAAAGGCCTTGCATATTACAGTTAAGTGCAAGAAATTTGCTGTGGCCAAAGTTTTAGTTGACAATGGCTCTTCTTTGAACATAATGCCTATGTCTACATTGGAAAAATTGCATGTTGATATGTCACACCTCAAATCAAGCACTATGATAGTGAGGGCCTTTGATGGATCGCGCAGTGAAGTGGTTGGGGACATAGAAATCCCTATTCAAATAGGCCCTTGCACCTTTGACATAACTTTCCAGGTCATGAACATCAACTCAGCCTACAGTTTCCTGTTAGGTCGTCCATGGATTCATTCAGCAGGAGTAGTCCCTTCGTCATTGTACCAACGCCTTAAATTTGTAGTCGATCGCAAGCTGGTGATCGTATCAGGACAAGAGGATATTTTTGTGTCAAGGCCATCCTCAATGCCATATATAGAAGCAGCAGAAGAAGCTTTTGAATCCTCATTCCAATCTTTCGAGGTTGCAAATTCCACCACTATATACGGAGAAAGAGGAACAAGGAAGCCACGGTTTTCAAAGCTACCTTCAAGGGGAAATGGCAGAAGTTTGGATGACTTATTGAATGTGCAAAAAAATATGAAAAGATTTGGTTTGGGATATAAACCAAACAGGGAAGAAATGATCAGAACACAAAAGCGGGAAAACAAGAATCGAATGATGAATTTTGAGCATCATAGGCCTCGAAGCCAAATGAGTATTCCCCACCTTTACGAATCCTTCAAATTCGCTGGCACAATCCATCCAGAAAGTTTTGAAGTAATGGCTGTCACAGAAGGCAAAGACAAAGAGTATCCCCTAGTCTACTTATGCCTAGAGGATTTTGAGCTTAATAATTGGACCGTATTTGAGCTACCGTCAATTGTTAACGATTTATCAAAG

mRNA sequence

TCTAGTAACAATGAGAAGTCCAGCAGAGAGGCTTATTAAGATCATAATTTCATTATCAGCAACTTTGTTTGCGGCGGCAGCGGCGGTGGAGGAGTTAAAGGACTGCGACGAATGGTGTGGGAACTTGAAGATTCCATATCCATTTGGGATGAAGGAAGGGTGTTATCTTAACAAAAATTTCTCCATTACTTGCAACAAAACTCATTATAATCCACCAAAAGCATTTCTACAAGACGGCAACATATATATCACCAATATATCCATAATCCAAGGCCAGCTCCACATCTTGCAATTTGTAGCCACAGATTGCTACACAAAAAATGGTCATCTTGAATCCTACAATACCACTCTCCAAATCCGCACATTCACAATTTCCAACACCATGAACAAGTTCACAGTCGTTGGCTGCGATACTTACGCTCTTATTTCTGGTATACTTGAGAAAGAATCCTACATAAGTGGTATATGTGGACTAAACACCAAAATGATTAACTTTCTCGATGATGGATCTCAATATCGTCGCCAGTGTTTGGAAGGCTTTGAGGGAAATCCATATCTCCAAAAAGGTTGCCAAGATATAGATGAATGTAAGAACGGAAGCAATAAATGTAAATACAAAGAGTTGTGTGTTAACGAACCAGGAAACTATACTTGCCATTGTCCTAAGAACCATAAAGGAGATGGCAGATGTGGAGGAGAAGGCTGCACTCGAAACCCCATAGTTAACCATGGTAACTCAAGCCAGTTGCCGCATCCAAACTCTAGAATTCCTCATTATCCACATAATTATTATGCACCACCTCATTCATATTATCAACCTTACGTAAATCATGCAGCAGTCCAATATTATCCCCCTCCTCCTCAAAATCAACGCACTTATGTGAATCAGGGGTACCAGCCTCAAGTCTCACAGAAGGTAGTTGGAGAATCAGAGCGTGAGCATGAAATAGATGTTGCAGAAGAGTTCTCAACAGGTGAATGCTCGAAGGTTTATCTAGAACGAAAGCCATTAACAATCTTTTATAAGGAAAAACCCAGTACTACCACTTCCAAACCAAAACCAATTACCATACAGGTTCCATCTCCATTTGAGTATAAAAGTTCAAAAGCAGTACCATGGAATTATGAATACAAGGTACTTGTTGAGTCTGTACCAGTTCCTATAGACAATATTAATGAAATTGGAGGTACAACACGAAGTGGGAAGTGTTATACGCCAGAAGCTTTATTGAAGTACAATAGCAAGGAAAAAGGAAAAGCCAAGCTTAGTGATGTCATTGATTGCAGGATAGAAGAGCCACTGTTGGTGAGAAATCAAGAAGTAAAGGAGCCAGCATCTGAAGATGACATCCAAGAGTTTTTAAGACTTATAAAACAAAGTGATTATAAAGTAATTGAACAGTTGGGCAAAACGCCTGCTAGAATCTCTATTTTGGCCTTATTGTTATCTTCAGACCTGCATCGCAAAACTCTAATGAATATCCTGAACCAGACTTATGTTCCATCAGACATTACAACAGACAATTTAGACAACATCGTTGGCAATATAACAGCATCAAGTTCCATAACTTTCACTGATGATGAGATACCGCCGGAAGGCATGGGTCACACAAAGGCCTTGCATATTACAGTTAAGTGCAAGAAATTTGCTGTGGCCAAAGTTTTAGTTGACAATGGCTCTTCTTTGAACATAATGCCTATGTCTACATTGGAAAAATTGCATGTTGATATGTCACACCTCAAATCAAGCACTATGATAGTGAGGGCCTTTGATGGATCGCGCAGTGAAGTGGTTGGGGACATAGAAATCCCTATTCAAATAGGCCCTTGCACCTTTGACATAACTTTCCAGGTCATGAACATCAACTCAGCCTACAGTTTCCTGTTAGGTCGTCCATGGATTCATTCAGCAGGAGTAGTCCCTTCGTCATTGTACCAACGCCTTAAATTTGTAGTCGATCGCAAGCTGGTGATCGTATCAGGACAAGAGGATATTTTTGTGTCAAGGCCATCCTCAATGCCATATATAGAAGCAGCAGAAGAAGCTTTTGAATCCTCATTCCAATCTTTCGAGGTTGCAAATTCCACCACTATATACGGAGAAAGAGGAACAAGGAAGCCACGGTTTTCAAAGCTACCTTCAAGGGGAAATGGCAGAAGTTTGGATGACTTATTGAATGTGCAAAAAAATATGAAAAGATTTGGTTTGGGATATAAACCAAACAGGGAAGAAATGATCAGAACACAAAAGCGGGAAAACAAGAATCGAATGATGAATTTTGAGCATCATAGGCCTCGAAGCCAAATGAGTATTCCCCACCTTTACGAATCCTTCAAATTCGCTGGCACAATCCATCCAGAAAGTTTTGAAGTAATGGCTGTCACAGAAGGCAAAGACAAAGAGTATCCCCTAGTCTACTTATGCCTAGAGGATTTTGAGCTTAATAATTGGACCGTATTTGAGCTACCGTCAATTGTTAACGATTTATCAAAG

Coding sequence (CDS)

ATGAGAAGTCCAGCAGAGAGGCTTATTAAGATCATAATTTCATTATCAGCAACTTTGTTTGCGGCGGCAGCGGCGGTGGAGGAGTTAAAGGACTGCGACGAATGGTGTGGGAACTTGAAGATTCCATATCCATTTGGGATGAAGGAAGGGTGTTATCTTAACAAAAATTTCTCCATTACTTGCAACAAAACTCATTATAATCCACCAAAAGCATTTCTACAAGACGGCAACATATATATCACCAATATATCCATAATCCAAGGCCAGCTCCACATCTTGCAATTTGTAGCCACAGATTGCTACACAAAAAATGGTCATCTTGAATCCTACAATACCACTCTCCAAATCCGCACATTCACAATTTCCAACACCATGAACAAGTTCACAGTCGTTGGCTGCGATACTTACGCTCTTATTTCTGGTATACTTGAGAAAGAATCCTACATAAGTGGTATATGTGGACTAAACACCAAAATGATTAACTTTCTCGATGATGGATCTCAATATCGTCGCCAGTGTTTGGAAGGCTTTGAGGGAAATCCATATCTCCAAAAAGGTTGCCAAGATATAGATGAATGTAAGAACGGAAGCAATAAATGTAAATACAAAGAGTTGTGTGTTAACGAACCAGGAAACTATACTTGCCATTGTCCTAAGAACCATAAAGGAGATGGCAGATGTGGAGGAGAAGGCTGCACTCGAAACCCCATAGTTAACCATGGTAACTCAAGCCAGTTGCCGCATCCAAACTCTAGAATTCCTCATTATCCACATAATTATTATGCACCACCTCATTCATATTATCAACCTTACGTAAATCATGCAGCAGTCCAATATTATCCCCCTCCTCCTCAAAATCAACGCACTTATGTGAATCAGGGGTACCAGCCTCAAGTCTCACAGAAGGTAGTTGGAGAATCAGAGCGTGAGCATGAAATAGATGTTGCAGAAGAGTTCTCAACAGGTGAATGCTCGAAGGTTTATCTAGAACGAAAGCCATTAACAATCTTTTATAAGGAAAAACCCAGTACTACCACTTCCAAACCAAAACCAATTACCATACAGGTTCCATCTCCATTTGAGTATAAAAGTTCAAAAGCAGTACCATGGAATTATGAATACAAGGTACTTGTTGAGTCTGTACCAGTTCCTATAGACAATATTAATGAAATTGGAGGTACAACACGAAGTGGGAAGTGTTATACGCCAGAAGCTTTATTGAAGTACAATAGCAAGGAAAAAGGAAAAGCCAAGCTTAGTGATGTCATTGATTGCAGGATAGAAGAGCCACTGTTGGTGAGAAATCAAGAAGTAAAGGAGCCAGCATCTGAAGATGACATCCAAGAGTTTTTAAGACTTATAAAACAAAGTGATTATAAAGTAATTGAACAGTTGGGCAAAACGCCTGCTAGAATCTCTATTTTGGCCTTATTGTTATCTTCAGACCTGCATCGCAAAACTCTAATGAATATCCTGAACCAGACTTATGTTCCATCAGACATTACAACAGACAATTTAGACAACATCGTTGGCAATATAACAGCATCAAGTTCCATAACTTTCACTGATGATGAGATACCGCCGGAAGGCATGGGTCACACAAAGGCCTTGCATATTACAGTTAAGTGCAAGAAATTTGCTGTGGCCAAAGTTTTAGTTGACAATGGCTCTTCTTTGAACATAATGCCTATGTCTACATTGGAAAAATTGCATGTTGATATGTCACACCTCAAATCAAGCACTATGATAGTGAGGGCCTTTGATGGATCGCGCAGTGAAGTGGTTGGGGACATAGAAATCCCTATTCAAATAGGCCCTTGCACCTTTGACATAACTTTCCAGGTCATGAACATCAACTCAGCCTACAGTTTCCTGTTAGGTCGTCCATGGATTCATTCAGCAGGAGTAGTCCCTTCGTCATTGTACCAACGCCTTAAATTTGTAGTCGATCGCAAGCTGGTGATCGTATCAGGACAAGAGGATATTTTTGTGTCAAGGCCATCCTCAATGCCATATATAGAAGCAGCAGAAGAAGCTTTTGAATCCTCATTCCAATCTTTCGAGGTTGCAAATTCCACCACTATATACGGAGAAAGAGGAACAAGGAAGCCACGGTTTTCAAAGCTACCTTCAAGGGGAAATGGCAGAAGTTTGGATGACTTATTGAATGTGCAAAAAAATATGAAAAGATTTGGTTTGGGATATAAACCAAACAGGGAAGAAATGATCAGAACACAAAAGCGGGAAAACAAGAATCGAATGATGAATTTTGAGCATCATAGGCCTCGAAGCCAAATGAGTATTCCCCACCTTTACGAATCCTTCAAATTCGCTGGCACAATCCATCCAGAAAGTTTTGAAGTAATGGCTGTCACAGAAGGCAAAGACAAAGAGTATCCCCTAGTCTACTTATGCCTAGAGGATTTTGAGCTTAATAATTGGACCGTATTTGAGCTACCGTCAATTGTTAACGATTTATCAAAG

Protein sequence

MRSPAERLIKIIISLSATLFAAAAAVEELKDCDEWCGNLKIPYPFGMKEGCYLNKNFSITCNKTHYNPPKAFLQDGNIYITNISIIQGQLHILQFVATDCYTKNGHLESYNTTLQIRTFTISNTMNKFTVVGCDTYALISGILEKESYISGICGLNTKMINFLDDGSQYRRQCLEGFEGNPYLQKGCQDIDECKNGSNKCKYKELCVNEPGNYTCHCPKNHKGDGRCGGEGCTRNPIVNHGNSSQLPHPNSRIPHYPHNYYAPPHSYYQPYVNHAAVQYYPPPPQNQRTYVNQGYQPQVSQKVVGESEREHEIDVAEEFSTGECSKVYLERKPLTIFYKEKPSTTTSKPKPITIQVPSPFEYKSSKAVPWNYEYKVLVESVPVPIDNINEIGGTTRSGKCYTPEALLKYNSKEKGKAKLSDVIDCRIEEPLLVRNQEVKEPASEDDIQEFLRLIKQSDYKVIEQLGKTPARISILALLLSSDLHRKTLMNILNQTYVPSDITTDNLDNIVGNITASSSITFTDDEIPPEGMGHTKALHITVKCKKFAVAKVLVDNGSSLNIMPMSTLEKLHVDMSHLKSSTMIVRAFDGSRSEVVGDIEIPIQIGPCTFDITFQVMNINSAYSFLLGRPWIHSAGVVPSSLYQRLKFVVDRKLVIVSGQEDIFVSRPSSMPYIEAAEEAFESSFQSFEVANSTTIYGERGTRKPRFSKLPSRGNGRSLDDLLNVQKNMKRFGLGYKPNREEMIRTQKRENKNRMMNFEHHRPRSQMSIPHLYESFKFAGTIHPESFEVMAVTEGKDKEYPLVYLCLEDFELNNWTVFELPSIVNDLSK
BLAST of Lsi08G005000 vs. Swiss-Prot
Match: WAK2_ARATH (Wall-associated receptor kinase 2 OS=Arabidopsis thaliana GN=WAK2 PE=1 SV=1)

HSP 1 Score: 77.0 bits (188), Expect = 1.1e-12
Identity = 36/90 (40.00%), Postives = 50/90 (55.56%), Query Frame = 1

Query: 145 KESYISGICGLNTKMINFLDDGSQYRRQCLEGFEGNPYLQKGCQDIDECKNGSNKCKYKE 204
           K+    G+CG N+   +    G+ Y  +CLEGFEGNPYL  GCQDI+EC +  + C    
Sbjct: 235 KQVEYRGVCGGNSTCFDSTG-GTGYNCKCLEGFEGNPYLPNGCQDINECISSRHNCSEHS 294

Query: 205 LCVNEPGNYTCHCPKNHKGDGRCGGEGCTR 235
            C N  G++ C+CP  ++ D       CTR
Sbjct: 295 TCENTKGSFNCNCPSGYRKDSL---NSCTR 320

BLAST of Lsi08G005000 vs. Swiss-Prot
Match: WAK3_ARATH (Wall-associated receptor kinase 3 OS=Arabidopsis thaliana GN=WAK3 PE=2 SV=2)

HSP 1 Score: 66.6 bits (161), Expect = 1.4e-09
Identity = 42/128 (32.81%), Postives = 66/128 (51.56%), Query Frame = 1

Query: 30  KDCDEWCGNLKIPYPFGMKEGCYL--NKNFSITCNKTHYNPPKAFLQDGNIYITNISIIQ 89
           +DC   CGN+ I YPFG+  GCY   + NF++TC        +  L  G I +TNIS   
Sbjct: 29  EDCKLKCGNVTIEYPFGISTGCYYPGDDNFNLTC-----VVEEKLLLFGIIQVTNIS-HS 88

Query: 90  GQLHILQFVATDCYTKNGHLESYNTTLQIRTFTISNTMNKFTVVGCDTYALISGILEKES 149
           G + +L    ++CY +           Q+ +    ++ NKFT+VGC+  +L+S    K++
Sbjct: 89  GHVSVLFERFSECYEQKNETNGTALGYQLGSSFSLSSNNKFTLVGCNALSLLS-TFGKQN 148

Query: 150 YISGICGL 156
           Y +G   L
Sbjct: 149 YSTGCLSL 149

BLAST of Lsi08G005000 vs. Swiss-Prot
Match: WAK5_ARATH (Wall-associated receptor kinase 5 OS=Arabidopsis thaliana GN=WAK5 PE=2 SV=1)

HSP 1 Score: 65.1 bits (157), Expect = 4.2e-09
Identity = 29/67 (43.28%), Postives = 38/67 (56.72%), Query Frame = 1

Query: 152 ICGLNTKMINFLDDGSQYRRQCLEGFEGNPYLQKGCQDIDECKNGSNKCKYKELCVNEPG 211
           ICG N+   +    G  Y  +CL+GF+GNPYL  GCQDI+EC    + C     C N  G
Sbjct: 243 ICGGNSTCFDSTR-GKGYNCKCLQGFDGNPYLSDGCQDINECTTRIHNCSDTSTCENTLG 302

Query: 212 NYTCHCP 219
           ++ C CP
Sbjct: 303 SFHCQCP 308

BLAST of Lsi08G005000 vs. Swiss-Prot
Match: WAKLG_ARATH (Wall-associated receptor kinase-like 8 OS=Arabidopsis thaliana GN=WAKL8 PE=2 SV=1)

HSP 1 Score: 65.1 bits (157), Expect = 4.2e-09
Identity = 42/144 (29.17%), Postives = 73/144 (50.69%), Query Frame = 1

Query: 6   ERLIKIIISLSATLFAAAAAVE-ELKDCDEWCGNLKIPYPFGMKEGCYLNKNFSITCNKT 65
           +R + +++ L    +AAA+     L++C + CGN+ +PYPFG+ +GCY NK F I C  +
Sbjct: 6   KRFLVVMLLLRICEYAAASTFPLALRNCSDHCGNVSVPYPFGIGKGCYKNKWFEIVCKSS 65

Query: 66  HYNPPKAFLQDGNIYITNISI-------IQGQLHILQ-FVATDCYTKNGHLESYNTTLQI 125
               P   L      +T+ ++       +  + +I      + C  ++G+  S +  L+ 
Sbjct: 66  SDQQPILLLPRIRRAVTSFNLGDPFSISVYNKFYIQSPLKHSGCPNRDGY-SSSSLNLKG 125

Query: 126 RTFTISNTMNKFTVVGCDTYALIS 141
             F IS   NKFT VGC+  A ++
Sbjct: 126 SPFFISEN-NKFTAVGCNNKAFMN 147

BLAST of Lsi08G005000 vs. Swiss-Prot
Match: WAK1_ARATH (Wall-associated receptor kinase 1 OS=Arabidopsis thaliana GN=WAK1 PE=1 SV=2)

HSP 1 Score: 63.5 bits (153), Expect = 1.2e-08
Identity = 42/134 (31.34%), Postives = 67/134 (50.00%), Query Frame = 1

Query: 8   LIKIIISLSATLFAAAAAVEELKDCDEWCGNLKIPYPFGMKEGCYL--NKNFSITCNKTH 67
           L+ I  SL+ T        +  ++C   CGN+ I YPFG+  GCY   N++FSITC +  
Sbjct: 9   LVAIFFSLACTQLVKGQH-QPGENCQNKCGNITIEYPFGISSGCYYPGNESFSITCKEDR 68

Query: 68  YNPPKAFLQDGNIYITNISIIQGQLHILQFVATDCYTKNGHLESYNTTLQIRTFTISNTM 127
            +     L D  +   N S   GQL +L   ++ CY + G     +++  +   ++S   
Sbjct: 69  PH----VLSDIEVANFNHS---GQLQVLLNRSSTCYDEQGKKTEEDSSFTLENLSLS-AN 128

Query: 128 NKFTVVGCDTYALI 140
           NK T VGC+  +L+
Sbjct: 129 NKLTAVGCNALSLL 133

BLAST of Lsi08G005000 vs. TrEMBL
Match: A0A061EXR3_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_024883 PE=4 SV=1)

HSP 1 Score: 438.3 bits (1126), Expect = 2.0e-119
Identity = 245/536 (45.71%), Postives = 332/536 (61.94%), Query Frame = 1

Query: 332  KPLTIFYKEKPS-----TTTSKPKPITIQVPSPFEYKSSKAVPWNYEYKVLVESVPVP-- 391
            KPLTIFY+E  S     + T     ITI+VPSPF YK+ KAVPWNYE  +L  +   P  
Sbjct: 1263 KPLTIFYEENKSPMNDTSPTMIRNGITIEVPSPFPYKNDKAVPWNYECNILGTASSAPQA 1322

Query: 392  -IDNINEIGGTTRSGKCYTPEALLKYNSK-----EKGKAKLSDVIDCRIEEPLLVRNQEV 451
              ++I  +GG TRSG+CY+PE   +         E G  K       +++E ++  N EV
Sbjct: 1323 SFEDITGVGGITRSGRCYSPEVAERVEKGKPAQGEGGLKKADTFSKDQVDEFVVAPNNEV 1382

Query: 452  KEPASEDDIQEFLRLIKQSDYKVIEQLGKTPARISILALLLSSDLHRKTLMNILNQTYVP 511
            K P +E +  EFL+ IK S+Y V+EQL K PA IS+L+LLL+S+ H+  L+ +LNQ YV 
Sbjct: 1383 KSPVTEKEAGEFLKFIKHSEYSVVEQLTKMPAPISLLSLLLNSEAHKNALLKVLNQAYVA 1442

Query: 512  SDITTDNLDNIVGNITASSSITFTDDEIPPEGMGHTKALHITVKCKKFAVAKVLVDNGSS 571
             DI+ + LD+IVGNIT  + I F D+EIPP G G  KALHIT+KCK  AV +VLVDNGS+
Sbjct: 1443 QDISVEKLDHIVGNITVGNFIAFNDEEIPPGGRGSNKALHITIKCKDHAVPRVLVDNGSA 1502

Query: 572  LNIMPMSTLEKLHVDMSHLKSSTMIVRAFDGSRSEVVGDIEIPIQIGPCTFDITFQVMNI 631
            LN+MP STL KL VD+S+++ S M+VRAFDG+  EVVGDIE+PI+IGPC F++ FQVM+I
Sbjct: 1503 LNVMPRSTLTKLLVDVSYMRPSRMVVRAFDGTTREVVGDIELPIKIGPCIFEVQFQVMDI 1562

Query: 632  NSAYSFLLGRPWIHSAGVVPSSLYQRLKFVVDRKLVIVSGQEDIFVSRPSSMPYIEAAEE 691
              +Y+ LLGRPWIH AG +PSSL+Q++KF+ + +L+ V  +EDI   +PSS PY+EA EE
Sbjct: 1563 APSYNCLLGRPWIHMAGAIPSSLHQKVKFIAEGQLISVCAEEDILAIQPSSAPYVEATEE 1622

Query: 692  AFESSFQSFEVANSTTIYGERGTRKPRFSKLPSRG-----------------NGRSLDDL 751
              E SF+SFE  N+T +   +    PR S     G                 N + ++  
Sbjct: 1623 VPECSFRSFEFVNATYVGERKVIPTPRLSVATKMGVKQTVGKGCRVGLGLGKNLQGINRP 1682

Query: 752  LNVQKNMKRFGLGYKPNREEMIRTQKRENKNRMMNFEHHRPR-SQMSIPHLYESFKFAGT 811
            L   KN +RFGLGYKP +EE  +   ++   RM   E  +    + +IPHLYE+F+ AG 
Sbjct: 1683 LTPMKNEERFGLGYKPTKEERRKLTAQKKIKRMAQLEGKKEEFGERTIPHLYETFRSAGF 1742

Query: 812  IHPES----------FEVMAV----TEGKDKEYPLVYLCLEDFELNNWTVFELPSI 823
            IHPE+          F+ +++     E  D + P+VY  L   EL+NWT  ELP I
Sbjct: 1743 IHPEAPPKVNQVLRIFDELSIHMIRDEEPDGKIPVVYPVLPGEELSNWTATELPII 1798

BLAST of Lsi08G005000 vs. TrEMBL
Match: A0A061ESA1_THECC (Gag-pro-like protein OS=Theobroma cacao GN=TCM_022266 PE=4 SV=1)

HSP 1 Score: 438.0 bits (1125), Expect = 2.6e-119
Identity = 245/536 (45.71%), Postives = 332/536 (61.94%), Query Frame = 1

Query: 332 KPLTIFYKEKPS-----TTTSKPKPITIQVPSPFEYKSSKAVPWNYEYKVLVESVPVP-- 391
           KPLTIFY+E  S     + T     ITI+VP+PF YKS KAVPWNY+  +   +   P  
Sbjct: 414 KPLTIFYEENRSPMNDTSPTMIRSGITIEVPNPFPYKSDKAVPWNYQCNISGTASSAPQA 473

Query: 392 -IDNINEIGGTTRSGKCYTPEALLKYNSK-----EKGKAKLSDVIDCRIEEPLLVRNQEV 451
             +++  +GG TRSG+CY+PE   K   +     E G  K       +++E ++  N EV
Sbjct: 474 SFEDLTGVGGITRSGRCYSPEVAEKVGKEKLTQGEGGLKKADTFSKDQVDESVVAPNNEV 533

Query: 452 KEPASEDDIQEFLRLIKQSDYKVIEQLGKTPARISILALLLSSDLHRKTLMNILNQTYVP 511
           K P +E +  EFL+ IK S+Y V+EQL K PARIS+L+LLL+S+ HR  L+ +LNQ YV 
Sbjct: 534 KNPVTEKEAGEFLKFIKHSEYSVVEQLTKMPARISLLSLLLNSEAHRNALLKVLNQAYVA 593

Query: 512 SDITTDNLDNIVGNITASSSITFTDDEIPPEGMGHTKALHITVKCKKFAVAKVLVDNGSS 571
            DI+ + LD+IVGNIT  + I F D+EIP  G G  KALHIT+KCK  AV +VLVDNGS+
Sbjct: 594 QDISVEKLDHIVGNITVGNFIAFNDEEIPSGGRGSNKALHITIKCKDHAVPRVLVDNGSA 653

Query: 572 LNIMPMSTLEKLHVDMSHLKSSTMIVRAFDGSRSEVVGDIEIPIQIGPCTFDITFQVMNI 631
           LN+MP STL KL VD+S++++S M+VRAFDG+  EVVGDIE+PI+IGPC F++ FQVM+I
Sbjct: 654 LNVMPRSTLTKLPVDVSYMRTSRMVVRAFDGTTREVVGDIELPIKIGPCIFEVQFQVMDI 713

Query: 632 NSAYSFLLGRPWIHSAGVVPSSLYQRLKFVVDRKLVIVSGQEDIFVSRPSSMPYIEAAEE 691
             +Y+ LLGRPWIH AG VPSSL+Q++KF+   +L+ V  +EDI   +PSS PY+EA EE
Sbjct: 714 APSYNCLLGRPWIHMAGAVPSSLHQKVKFIAKGQLISVCAEEDILAIQPSSAPYVEATEE 773

Query: 692 AFESSFQSFEVANSTTIYGERGTRKPRFSKLPSRG-----------------NGRSLDDL 751
             E SF+SFE  N+T I  ++    PR S     G                 N + ++  
Sbjct: 774 VPECSFRSFEFVNATYIGEKKVIPTPRLSVATKMGVKQTVGKGCRAGLGLGKNLQGINRP 833

Query: 752 LNVQKNMKRFGLGYKPNREEMIRTQKRENKNRMMNFEHHRPR-SQMSIPHLYESFKFAGT 811
           L   KN +RFGLGYKP +EE  +   ++   RM   E       + +IPHLYE+F+ AG 
Sbjct: 834 LTPMKNEERFGLGYKPTKEERRKLTAQKKIKRMAQLEGKEEEFGERTIPHLYETFRSAGF 893

Query: 812 IHPES----------FEVMAV----TEGKDKEYPLVYLCLEDFELNNWTVFELPSI 823
           IHPE+          F+ +++     E  D + P+VY  L   EL+NWT  ELP +
Sbjct: 894 IHPEAPPKVNQVLRMFDELSIHMIRDEEPDGKIPVVYPVLPGEELSNWTATELPIV 949

BLAST of Lsi08G005000 vs. TrEMBL
Match: A0A151R2D5_CAJCA (Uncharacterized protein OS=Cajanus cajan GN=KK1_042162 PE=4 SV=1)

HSP 1 Score: 437.2 bits (1123), Expect = 4.5e-119
Identity = 237/511 (46.38%), Postives = 332/511 (64.97%), Query Frame = 1

Query: 350 KPITIQVPSPFEYKSSKAVPWNYEYKVL-----------VESVPVPIDNINEIGGTTRSG 409
           KP+ +Q+P+PF YK +KAVPW Y+ KV            V++    I NI  +GG TRSG
Sbjct: 15  KPLVVQIPAPFHYKDTKAVPWRYDAKVKSDYLNAQQKKGVDTARTNITNITGVGGMTRSG 74

Query: 410 KCYTPEALLKYNSKEKGKAKLSDVID-----CRIEEPLLVRNQEVKEPASEDDIQEFLRL 469
           + YTPE L   +     + K + +I+      R  +   V ++  KE  S+++  EFL+ 
Sbjct: 75  RVYTPEELRVKDFTRHHEEKENTIINEGVSGVRRRDDKKVVDERKKE-VSDEEASEFLKF 134

Query: 470 IKQSDYKVIEQLGKTPARISILALLLSSDLHRKTLMNILNQTYVPSDITTDNLDNIVGNI 529
           I+QS+YK+I+QL  TPAR+S+L++L++S+ HRK LM ILN+ +V +DIT D    IVGNI
Sbjct: 135 IRQSEYKLIDQLNHTPARVSLLSVLMNSESHRKLLMKILNEAHVSNDITLDTFGGIVGNI 194

Query: 530 TASSSITFTDDEIPPEGMGHTKALHITVKCKKFAVAKVLVDNGSSLNIMPMSTLEKLHVD 589
           TA++ +TFTDDE+P EG GH KALHI+VKC    +A+VL+DNGSSLN+MP STL++L  D
Sbjct: 195 TANNHLTFTDDEVPAEGRGHNKALHISVKCANHILARVLIDNGSSLNVMPKSTLDRLPCD 254

Query: 590 MSHLKSSTMIVRAFDGSRSEVVGDIEIPIQIGPCTFDITFQVMNINSAYSFLLGRPWIHS 649
            +H+K S+MIVRAFDGSR EV+G+IEIP+QIGP TF+ITFQVM+I  AYS LLGRPWIHS
Sbjct: 255 GTHMKPSSMIVRAFDGSRREVMGEIEIPVQIGPFTFNITFQVMDIKPAYSCLLGRPWIHS 314

Query: 650 AGVVPSSLYQRLKFVVDRKLVIVSGQEDIFVSRPSSMPYIEAAEEAFESSFQSFEVANST 709
           AGVVPSSL+Q+LKF+V+ KLVIVSG+ED+ VS P+   YIEA EEA E+SFQS E+ ++ 
Sbjct: 315 AGVVPSSLHQKLKFIVEDKLVIVSGEEDMLVSCPTPTRYIEATEEALETSFQSLEIISTA 374

Query: 710 TIYGERGTRKPRFSKL------------PSRGNGRSLD---DLLNVQKNMKRFGLGYKPN 769
            +    G+ +   + +            P  G G+ L+    L+++ +N  R+GLGYKP 
Sbjct: 375 YVESPMGSPQSSSASMMVAKVMMNGGYQPGLGLGKCLEGVTKLIDLPENKNRWGLGYKPT 434

Query: 770 REEMIRTQKRENKNRMMNFEHHRPRSQ-MSIPHLYESFKFAGTIHPESFEVMAVTEGKDK 829
           + +  R  +   + R+   E+  P+ Q + I HLY+SF+  G +H +    M   +  D 
Sbjct: 435 QADKRRMAEENKEKRLARLENREPKVQKIPICHLYQSFRSGGVVHADQV-AMIKEDDDDN 494

BLAST of Lsi08G005000 vs. TrEMBL
Match: A0A061E378_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_008095 PE=4 SV=1)

HSP 1 Score: 433.7 bits (1114), Expect = 5.0e-118
Identity = 247/536 (46.08%), Postives = 330/536 (61.57%), Query Frame = 1

Query: 332  KPLTIFYKEKPS-----TTTSKPKPITIQVPSPFEYKSSKAVPWNYEYKVL--VESVP-V 391
            KPLTIFY+E  S     + T     ITI+VPSPF YKS KAVPWNYE  +L  V S P  
Sbjct: 1337 KPLTIFYEENKSPMNDTSPTMSRNGITIEVPSPFPYKSDKAVPWNYECNILGTVSSTPQA 1396

Query: 392  PIDNINEIGGTTRSGKCYTPEALLKYNSK-----EKGKAKLSDVIDCRIEEPLLVRNQEV 451
              ++I  +GG TRSG+CY+PEA  K         E G  K       +++E ++  N EV
Sbjct: 1397 SFEDITGVGGITRSGRCYSPEAAEKVGKGKPAQGEGGLKKADTFSKNQVDESVVAPNNEV 1456

Query: 452  KEPASEDDIQEFLRLIKQSDYKVIEQLGKTPARISILALLLSSDLHRKTLMNILNQTYVP 511
            K P +E +  EFL+ IK S+Y V+EQL K PARIS+L+LLL+ + HR  L+ +LNQ YV 
Sbjct: 1457 KNPVTEKEEGEFLKFIKHSEYSVVEQLTKMPARISLLSLLLNLEAHRNALLKVLNQAYVA 1516

Query: 512  SDITTDNLDNIVGNITASSSITFTDDEIPPEGMGHTKALHITVKCKKFAVAKVLVDNGSS 571
             DI+ + LD+IVGNIT  + I F D+EIP  G    KALHIT+KCK  AV +VLVDNGS+
Sbjct: 1517 QDISVEKLDHIVGNITVGNFIAFNDEEIPSGGRRGNKALHITIKCKDHAVPRVLVDNGSA 1576

Query: 572  LNIMPMSTLEKLHVDMSHLKSSTMIVRAFDGSRSEVVGDIEIPIQIGPCTFDITFQVMNI 631
            LN+MP STL KL VD+S++++S M+VRAFDG+  EVVGDIE+PI+IGPC F++ FQVM+I
Sbjct: 1577 LNVMPRSTLTKLPVDVSYMRTSRMVVRAFDGTTREVVGDIELPIKIGPCIFEVQFQVMDI 1636

Query: 632  NSAYSFLLGRPWIHSAGVVPSSLYQRLKFVVDRKLVIVSGQEDIFVSRPSSMPYIEAAEE 691
              +Y+ LLGRPWIH AG +PSSL+Q++KF+ + +L+ V  +EDI   +PSS PY+EA EE
Sbjct: 1637 APSYNCLLGRPWIHMAGAIPSSLHQKVKFIAEGQLISVCAEEDILAIQPSSAPYVEATEE 1696

Query: 692  AFESSFQSFEVANSTTIYGERGTRKPRFSKLPSRG-----------------NGRSLDDL 751
              E SF+SFE  N+T +   +    PR S     G                 N + ++  
Sbjct: 1697 VPECSFRSFEFVNATYVGERKVIPTPRLSVATKMGVKQTVGKGCRAGLGLGKNLQGINRP 1756

Query: 752  LNVQKNMKRFGLGYKPNREEMIRTQKRENKNRMMNFEHHRPR-SQMSIPHLYESFKFAGT 811
            L   KN +RFGLGYK  +EE  +   ++   RM   E       + +IP LYE+F+ AG 
Sbjct: 1757 LTPMKNEERFGLGYKHTKEERRKLTAQKKIKRMAQLEGKEEEFGERTIPRLYETFRSAGF 1816

Query: 812  IHPES----------FEVMAV----TEGKDKEYPLVYLCLEDFELNNWTVFELPSI 823
            IHPE+          F+ +++     E  D + P+VY  L   EL+NWT  ELP I
Sbjct: 1817 IHPEAPPKVNQVLRIFDELSIHMIRDEEPDGKIPMVYPVLPGEELSNWTATELPII 1872

BLAST of Lsi08G005000 vs. TrEMBL
Match: A0A061E6J4_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_010507 PE=4 SV=1)

HSP 1 Score: 426.8 bits (1096), Expect = 6.1e-116
Identity = 259/584 (44.35%), Postives = 353/584 (60.45%), Query Frame = 1

Query: 301  QKVVGESEREHEIDVAEEFSTGECSKVY---LERKPLTIFYKEKPSTTTSKPKP-ITIQV 360
            Q+++ ES+ E   + A E +    SK     ++ KPLTIFY+ K      K    + I+V
Sbjct: 619  QRMMDESKIEFYTE-ASESAVNMISKESTHPMKIKPLTIFYEPKGELVEDKNHAKMVIEV 678

Query: 361  PSPFEYKSSKAVPWNYEYKVLVESVPVPID-------NINEIGGTTRSGKCYTPEALLKY 420
            P PF YK +KAVPWNY   V V      I        NI  +GG TRSG+CY+PEA    
Sbjct: 679  PKPFPYKDNKAVPWNYNCNVQVSEAKKWIAESQDDAANITGVGGITRSGRCYSPEAFENL 738

Query: 421  NSKEKGKAKLSDVIDCRIEEPLLVRNQEVKEPASEDDIQEFLRLIKQSDYKVIEQLGKTP 480
             + EKG  K     + +++ P        K   +E +  EFL+ IK S+Y V+EQL + P
Sbjct: 739  KN-EKGGEKEQSPREEKVQPPESTDGS--KRSVTEKEAAEFLKFIKHSEYNVVEQLNRMP 798

Query: 481  ARISILALLLSSDLHRKTLMNILNQTYVPSDITTDNLDNIVGNITASSSITFTDDEIPPE 540
            ARIS+L+LLLSS+ HR +LM ILNQ YV  DI+ +NLD IVGNI+  + I+F+D+EIP  
Sbjct: 799  ARISLLSLLLSSEPHRNSLMKILNQAYVDHDISVENLDYIVGNISVGNIISFSDEEIPSG 858

Query: 541  GMGHTKALHITVKCKKFAVAKVLVDNGSSLNIMPMSTLEKLHVDMSHLKSSTMIVRAFDG 600
            G G+ KALHIT KCK   VAKVL+DNGSSLN+MPM TL +L ++MS+++ S MIVRAFDG
Sbjct: 859  GRGNYKALHITTKCKGCTVAKVLLDNGSSLNVMPMRTLARLPINMSYMRKSQMIVRAFDG 918

Query: 601  SRSEVVGDIEIPIQIGPCTFDITFQVMNINSAYSFLLGRPWIHSAGVVPSSLYQRLKFVV 660
            +R EVVGDIEIP++IGPCTF I FQVM+I  +Y++LLGRPWIH AG +PSSL+Q++KF++
Sbjct: 919  TRREVVGDIEIPVEIGPCTFTIEFQVMDIAPSYNYLLGRPWIHMAGAIPSSLHQKVKFIM 978

Query: 661  DRKLVIVSGQEDIFVSRPSSMPYIEAAEEAFESSFQSFEVANSTTIYGERGTRK-PRFSK 720
            + K+V V+G+ED+ +S+P+  PY+EAAEE  E SF+SFE  N TT  GE  T   PR SK
Sbjct: 979  EGKIVCVNGEEDLLISKPADTPYVEAAEEVPECSFRSFEFVN-TTYVGEGTTPPIPRLSK 1038

Query: 721  L--------------PSRGNGRSLDDL---LNVQKNMKRFGLGYKP---NREEMIRTQKR 780
                              G G+ L  +   ++  KN ++FGLGYKP    REEMI  +++
Sbjct: 1039 TTKMIVSQILGKGYRAGAGLGKELQGIRSPIHTTKNEEKFGLGYKPTKKEREEMIAGRRK 1098

Query: 781  ENKNRMMNFEHHRPRSQ-MSIPHLYESFKFAGTIHPES------------------FEVM 829
            E   R+  F+ H    + M+ PHLY++F+  G I PES                    + 
Sbjct: 1099 E---RLARFKGHELEIRGMTYPHLYKTFRSGGCIFPESLTVENQESVSALGGTFSDLSIC 1158

BLAST of Lsi08G005000 vs. TAIR10
Match: AT1G21270.1 (AT1G21270.1 wall-associated kinase 2)

HSP 1 Score: 77.0 bits (188), Expect = 6.0e-14
Identity = 36/90 (40.00%), Postives = 50/90 (55.56%), Query Frame = 1

Query: 145 KESYISGICGLNTKMINFLDDGSQYRRQCLEGFEGNPYLQKGCQDIDECKNGSNKCKYKE 204
           K+    G+CG N+   +    G+ Y  +CLEGFEGNPYL  GCQDI+EC +  + C    
Sbjct: 235 KQVEYRGVCGGNSTCFDSTG-GTGYNCKCLEGFEGNPYLPNGCQDINECISSRHNCSEHS 294

Query: 205 LCVNEPGNYTCHCPKNHKGDGRCGGEGCTR 235
            C N  G++ C+CP  ++ D       CTR
Sbjct: 295 TCENTKGSFNCNCPSGYRKDSL---NSCTR 320

BLAST of Lsi08G005000 vs. TAIR10
Match: AT1G21240.1 (AT1G21240.1 wall associated kinase 3)

HSP 1 Score: 66.6 bits (161), Expect = 8.1e-11
Identity = 42/128 (32.81%), Postives = 66/128 (51.56%), Query Frame = 1

Query: 30  KDCDEWCGNLKIPYPFGMKEGCYL--NKNFSITCNKTHYNPPKAFLQDGNIYITNISIIQ 89
           +DC   CGN+ I YPFG+  GCY   + NF++TC        +  L  G I +TNIS   
Sbjct: 29  EDCKLKCGNVTIEYPFGISTGCYYPGDDNFNLTC-----VVEEKLLLFGIIQVTNIS-HS 88

Query: 90  GQLHILQFVATDCYTKNGHLESYNTTLQIRTFTISNTMNKFTVVGCDTYALISGILEKES 149
           G + +L    ++CY +           Q+ +    ++ NKFT+VGC+  +L+S    K++
Sbjct: 89  GHVSVLFERFSECYEQKNETNGTALGYQLGSSFSLSSNNKFTLVGCNALSLLS-TFGKQN 148

Query: 150 YISGICGL 156
           Y +G   L
Sbjct: 149 YSTGCLSL 149

BLAST of Lsi08G005000 vs. TAIR10
Match: AT1G21230.1 (AT1G21230.1 wall associated kinase 5)

HSP 1 Score: 65.1 bits (157), Expect = 2.4e-10
Identity = 29/67 (43.28%), Postives = 38/67 (56.72%), Query Frame = 1

Query: 152 ICGLNTKMINFLDDGSQYRRQCLEGFEGNPYLQKGCQDIDECKNGSNKCKYKELCVNEPG 211
           ICG N+   +    G  Y  +CL+GF+GNPYL  GCQDI+EC    + C     C N  G
Sbjct: 243 ICGGNSTCFDSTR-GKGYNCKCLQGFDGNPYLSDGCQDINECTTRIHNCSDTSTCENTLG 302

Query: 212 NYTCHCP 219
           ++ C CP
Sbjct: 303 SFHCQCP 308

BLAST of Lsi08G005000 vs. TAIR10
Match: AT1G16260.1 (AT1G16260.1 Wall-associated kinase family protein)

HSP 1 Score: 65.1 bits (157), Expect = 2.4e-10
Identity = 42/144 (29.17%), Postives = 73/144 (50.69%), Query Frame = 1

Query: 6   ERLIKIIISLSATLFAAAAAVE-ELKDCDEWCGNLKIPYPFGMKEGCYLNKNFSITCNKT 65
           +R + +++ L    +AAA+     L++C + CGN+ +PYPFG+ +GCY NK F I C  +
Sbjct: 6   KRFLVVMLLLRICEYAAASTFPLALRNCSDHCGNVSVPYPFGIGKGCYKNKWFEIVCKSS 65

Query: 66  HYNPPKAFLQDGNIYITNISI-------IQGQLHILQ-FVATDCYTKNGHLESYNTTLQI 125
               P   L      +T+ ++       +  + +I      + C  ++G+  S +  L+ 
Sbjct: 66  SDQQPILLLPRIRRAVTSFNLGDPFSISVYNKFYIQSPLKHSGCPNRDGY-SSSSLNLKG 125

Query: 126 RTFTISNTMNKFTVVGCDTYALIS 141
             F IS   NKFT VGC+  A ++
Sbjct: 126 SPFFISEN-NKFTAVGCNNKAFMN 147

BLAST of Lsi08G005000 vs. TAIR10
Match: AT1G21250.1 (AT1G21250.1 cell wall-associated kinase)

HSP 1 Score: 63.5 bits (153), Expect = 6.8e-10
Identity = 42/134 (31.34%), Postives = 67/134 (50.00%), Query Frame = 1

Query: 8   LIKIIISLSATLFAAAAAVEELKDCDEWCGNLKIPYPFGMKEGCYL--NKNFSITCNKTH 67
           L+ I  SL+ T        +  ++C   CGN+ I YPFG+  GCY   N++FSITC +  
Sbjct: 9   LVAIFFSLACTQLVKGQH-QPGENCQNKCGNITIEYPFGISSGCYYPGNESFSITCKEDR 68

Query: 68  YNPPKAFLQDGNIYITNISIIQGQLHILQFVATDCYTKNGHLESYNTTLQIRTFTISNTM 127
            +     L D  +   N S   GQL +L   ++ CY + G     +++  +   ++S   
Sbjct: 69  PH----VLSDIEVANFNHS---GQLQVLLNRSSTCYDEQGKKTEEDSSFTLENLSLS-AN 128

Query: 128 NKFTVVGCDTYALI 140
           NK T VGC+  +L+
Sbjct: 129 NKLTAVGCNALSLL 133

BLAST of Lsi08G005000 vs. NCBI nr
Match: gi|590636870|ref|XP_007028966.1| (Uncharacterized protein TCM_024883 [Theobroma cacao])

HSP 1 Score: 438.3 bits (1126), Expect = 2.9e-119
Identity = 245/536 (45.71%), Postives = 332/536 (61.94%), Query Frame = 1

Query: 332  KPLTIFYKEKPS-----TTTSKPKPITIQVPSPFEYKSSKAVPWNYEYKVLVESVPVP-- 391
            KPLTIFY+E  S     + T     ITI+VPSPF YK+ KAVPWNYE  +L  +   P  
Sbjct: 1263 KPLTIFYEENKSPMNDTSPTMIRNGITIEVPSPFPYKNDKAVPWNYECNILGTASSAPQA 1322

Query: 392  -IDNINEIGGTTRSGKCYTPEALLKYNSK-----EKGKAKLSDVIDCRIEEPLLVRNQEV 451
              ++I  +GG TRSG+CY+PE   +         E G  K       +++E ++  N EV
Sbjct: 1323 SFEDITGVGGITRSGRCYSPEVAERVEKGKPAQGEGGLKKADTFSKDQVDEFVVAPNNEV 1382

Query: 452  KEPASEDDIQEFLRLIKQSDYKVIEQLGKTPARISILALLLSSDLHRKTLMNILNQTYVP 511
            K P +E +  EFL+ IK S+Y V+EQL K PA IS+L+LLL+S+ H+  L+ +LNQ YV 
Sbjct: 1383 KSPVTEKEAGEFLKFIKHSEYSVVEQLTKMPAPISLLSLLLNSEAHKNALLKVLNQAYVA 1442

Query: 512  SDITTDNLDNIVGNITASSSITFTDDEIPPEGMGHTKALHITVKCKKFAVAKVLVDNGSS 571
             DI+ + LD+IVGNIT  + I F D+EIPP G G  KALHIT+KCK  AV +VLVDNGS+
Sbjct: 1443 QDISVEKLDHIVGNITVGNFIAFNDEEIPPGGRGSNKALHITIKCKDHAVPRVLVDNGSA 1502

Query: 572  LNIMPMSTLEKLHVDMSHLKSSTMIVRAFDGSRSEVVGDIEIPIQIGPCTFDITFQVMNI 631
            LN+MP STL KL VD+S+++ S M+VRAFDG+  EVVGDIE+PI+IGPC F++ FQVM+I
Sbjct: 1503 LNVMPRSTLTKLLVDVSYMRPSRMVVRAFDGTTREVVGDIELPIKIGPCIFEVQFQVMDI 1562

Query: 632  NSAYSFLLGRPWIHSAGVVPSSLYQRLKFVVDRKLVIVSGQEDIFVSRPSSMPYIEAAEE 691
              +Y+ LLGRPWIH AG +PSSL+Q++KF+ + +L+ V  +EDI   +PSS PY+EA EE
Sbjct: 1563 APSYNCLLGRPWIHMAGAIPSSLHQKVKFIAEGQLISVCAEEDILAIQPSSAPYVEATEE 1622

Query: 692  AFESSFQSFEVANSTTIYGERGTRKPRFSKLPSRG-----------------NGRSLDDL 751
              E SF+SFE  N+T +   +    PR S     G                 N + ++  
Sbjct: 1623 VPECSFRSFEFVNATYVGERKVIPTPRLSVATKMGVKQTVGKGCRVGLGLGKNLQGINRP 1682

Query: 752  LNVQKNMKRFGLGYKPNREEMIRTQKRENKNRMMNFEHHRPR-SQMSIPHLYESFKFAGT 811
            L   KN +RFGLGYKP +EE  +   ++   RM   E  +    + +IPHLYE+F+ AG 
Sbjct: 1683 LTPMKNEERFGLGYKPTKEERRKLTAQKKIKRMAQLEGKKEEFGERTIPHLYETFRSAGF 1742

Query: 812  IHPES----------FEVMAV----TEGKDKEYPLVYLCLEDFELNNWTVFELPSI 823
            IHPE+          F+ +++     E  D + P+VY  L   EL+NWT  ELP I
Sbjct: 1743 IHPEAPPKVNQVLRIFDELSIHMIRDEEPDGKIPVVYPVLPGEELSNWTATELPII 1798

BLAST of Lsi08G005000 vs. NCBI nr
Match: gi|590630969|ref|XP_007027433.1| (Gag-pro-like protein [Theobroma cacao])

HSP 1 Score: 438.0 bits (1125), Expect = 3.8e-119
Identity = 245/536 (45.71%), Postives = 332/536 (61.94%), Query Frame = 1

Query: 332 KPLTIFYKEKPS-----TTTSKPKPITIQVPSPFEYKSSKAVPWNYEYKVLVESVPVP-- 391
           KPLTIFY+E  S     + T     ITI+VP+PF YKS KAVPWNY+  +   +   P  
Sbjct: 414 KPLTIFYEENRSPMNDTSPTMIRSGITIEVPNPFPYKSDKAVPWNYQCNISGTASSAPQA 473

Query: 392 -IDNINEIGGTTRSGKCYTPEALLKYNSK-----EKGKAKLSDVIDCRIEEPLLVRNQEV 451
             +++  +GG TRSG+CY+PE   K   +     E G  K       +++E ++  N EV
Sbjct: 474 SFEDLTGVGGITRSGRCYSPEVAEKVGKEKLTQGEGGLKKADTFSKDQVDESVVAPNNEV 533

Query: 452 KEPASEDDIQEFLRLIKQSDYKVIEQLGKTPARISILALLLSSDLHRKTLMNILNQTYVP 511
           K P +E +  EFL+ IK S+Y V+EQL K PARIS+L+LLL+S+ HR  L+ +LNQ YV 
Sbjct: 534 KNPVTEKEAGEFLKFIKHSEYSVVEQLTKMPARISLLSLLLNSEAHRNALLKVLNQAYVA 593

Query: 512 SDITTDNLDNIVGNITASSSITFTDDEIPPEGMGHTKALHITVKCKKFAVAKVLVDNGSS 571
            DI+ + LD+IVGNIT  + I F D+EIP  G G  KALHIT+KCK  AV +VLVDNGS+
Sbjct: 594 QDISVEKLDHIVGNITVGNFIAFNDEEIPSGGRGSNKALHITIKCKDHAVPRVLVDNGSA 653

Query: 572 LNIMPMSTLEKLHVDMSHLKSSTMIVRAFDGSRSEVVGDIEIPIQIGPCTFDITFQVMNI 631
           LN+MP STL KL VD+S++++S M+VRAFDG+  EVVGDIE+PI+IGPC F++ FQVM+I
Sbjct: 654 LNVMPRSTLTKLPVDVSYMRTSRMVVRAFDGTTREVVGDIELPIKIGPCIFEVQFQVMDI 713

Query: 632 NSAYSFLLGRPWIHSAGVVPSSLYQRLKFVVDRKLVIVSGQEDIFVSRPSSMPYIEAAEE 691
             +Y+ LLGRPWIH AG VPSSL+Q++KF+   +L+ V  +EDI   +PSS PY+EA EE
Sbjct: 714 APSYNCLLGRPWIHMAGAVPSSLHQKVKFIAKGQLISVCAEEDILAIQPSSAPYVEATEE 773

Query: 692 AFESSFQSFEVANSTTIYGERGTRKPRFSKLPSRG-----------------NGRSLDDL 751
             E SF+SFE  N+T I  ++    PR S     G                 N + ++  
Sbjct: 774 VPECSFRSFEFVNATYIGEKKVIPTPRLSVATKMGVKQTVGKGCRAGLGLGKNLQGINRP 833

Query: 752 LNVQKNMKRFGLGYKPNREEMIRTQKRENKNRMMNFEHHRPR-SQMSIPHLYESFKFAGT 811
           L   KN +RFGLGYKP +EE  +   ++   RM   E       + +IPHLYE+F+ AG 
Sbjct: 834 LTPMKNEERFGLGYKPTKEERRKLTAQKKIKRMAQLEGKEEEFGERTIPHLYETFRSAGF 893

Query: 812 IHPES----------FEVMAV----TEGKDKEYPLVYLCLEDFELNNWTVFELPSI 823
           IHPE+          F+ +++     E  D + P+VY  L   EL+NWT  ELP +
Sbjct: 894 IHPEAPPKVNQVLRMFDELSIHMIRDEEPDGKIPVVYPVLPGEELSNWTATELPIV 949

BLAST of Lsi08G005000 vs. NCBI nr
Match: gi|1012324735|gb|KYP36696.1| (hypothetical protein KK1_042162 [Cajanus cajan])

HSP 1 Score: 437.2 bits (1123), Expect = 6.5e-119
Identity = 237/511 (46.38%), Postives = 332/511 (64.97%), Query Frame = 1

Query: 350 KPITIQVPSPFEYKSSKAVPWNYEYKVL-----------VESVPVPIDNINEIGGTTRSG 409
           KP+ +Q+P+PF YK +KAVPW Y+ KV            V++    I NI  +GG TRSG
Sbjct: 15  KPLVVQIPAPFHYKDTKAVPWRYDAKVKSDYLNAQQKKGVDTARTNITNITGVGGMTRSG 74

Query: 410 KCYTPEALLKYNSKEKGKAKLSDVID-----CRIEEPLLVRNQEVKEPASEDDIQEFLRL 469
           + YTPE L   +     + K + +I+      R  +   V ++  KE  S+++  EFL+ 
Sbjct: 75  RVYTPEELRVKDFTRHHEEKENTIINEGVSGVRRRDDKKVVDERKKE-VSDEEASEFLKF 134

Query: 470 IKQSDYKVIEQLGKTPARISILALLLSSDLHRKTLMNILNQTYVPSDITTDNLDNIVGNI 529
           I+QS+YK+I+QL  TPAR+S+L++L++S+ HRK LM ILN+ +V +DIT D    IVGNI
Sbjct: 135 IRQSEYKLIDQLNHTPARVSLLSVLMNSESHRKLLMKILNEAHVSNDITLDTFGGIVGNI 194

Query: 530 TASSSITFTDDEIPPEGMGHTKALHITVKCKKFAVAKVLVDNGSSLNIMPMSTLEKLHVD 589
           TA++ +TFTDDE+P EG GH KALHI+VKC    +A+VL+DNGSSLN+MP STL++L  D
Sbjct: 195 TANNHLTFTDDEVPAEGRGHNKALHISVKCANHILARVLIDNGSSLNVMPKSTLDRLPCD 254

Query: 590 MSHLKSSTMIVRAFDGSRSEVVGDIEIPIQIGPCTFDITFQVMNINSAYSFLLGRPWIHS 649
            +H+K S+MIVRAFDGSR EV+G+IEIP+QIGP TF+ITFQVM+I  AYS LLGRPWIHS
Sbjct: 255 GTHMKPSSMIVRAFDGSRREVMGEIEIPVQIGPFTFNITFQVMDIKPAYSCLLGRPWIHS 314

Query: 650 AGVVPSSLYQRLKFVVDRKLVIVSGQEDIFVSRPSSMPYIEAAEEAFESSFQSFEVANST 709
           AGVVPSSL+Q+LKF+V+ KLVIVSG+ED+ VS P+   YIEA EEA E+SFQS E+ ++ 
Sbjct: 315 AGVVPSSLHQKLKFIVEDKLVIVSGEEDMLVSCPTPTRYIEATEEALETSFQSLEIISTA 374

Query: 710 TIYGERGTRKPRFSKL------------PSRGNGRSLD---DLLNVQKNMKRFGLGYKPN 769
            +    G+ +   + +            P  G G+ L+    L+++ +N  R+GLGYKP 
Sbjct: 375 YVESPMGSPQSSSASMMVAKVMMNGGYQPGLGLGKCLEGVTKLIDLPENKNRWGLGYKPT 434

Query: 770 REEMIRTQKRENKNRMMNFEHHRPRSQ-MSIPHLYESFKFAGTIHPESFEVMAVTEGKDK 829
           + +  R  +   + R+   E+  P+ Q + I HLY+SF+  G +H +    M   +  D 
Sbjct: 435 QADKRRMAEENKEKRLARLENREPKVQKIPICHLYQSFRSGGVVHADQV-AMIKEDDDDN 494

BLAST of Lsi08G005000 vs. NCBI nr
Match: gi|590690716|ref|XP_007043584.1| (Uncharacterized protein TCM_008095 [Theobroma cacao])

HSP 1 Score: 433.7 bits (1114), Expect = 7.1e-118
Identity = 247/536 (46.08%), Postives = 330/536 (61.57%), Query Frame = 1

Query: 332  KPLTIFYKEKPS-----TTTSKPKPITIQVPSPFEYKSSKAVPWNYEYKVL--VESVP-V 391
            KPLTIFY+E  S     + T     ITI+VPSPF YKS KAVPWNYE  +L  V S P  
Sbjct: 1337 KPLTIFYEENKSPMNDTSPTMSRNGITIEVPSPFPYKSDKAVPWNYECNILGTVSSTPQA 1396

Query: 392  PIDNINEIGGTTRSGKCYTPEALLKYNSK-----EKGKAKLSDVIDCRIEEPLLVRNQEV 451
              ++I  +GG TRSG+CY+PEA  K         E G  K       +++E ++  N EV
Sbjct: 1397 SFEDITGVGGITRSGRCYSPEAAEKVGKGKPAQGEGGLKKADTFSKNQVDESVVAPNNEV 1456

Query: 452  KEPASEDDIQEFLRLIKQSDYKVIEQLGKTPARISILALLLSSDLHRKTLMNILNQTYVP 511
            K P +E +  EFL+ IK S+Y V+EQL K PARIS+L+LLL+ + HR  L+ +LNQ YV 
Sbjct: 1457 KNPVTEKEEGEFLKFIKHSEYSVVEQLTKMPARISLLSLLLNLEAHRNALLKVLNQAYVA 1516

Query: 512  SDITTDNLDNIVGNITASSSITFTDDEIPPEGMGHTKALHITVKCKKFAVAKVLVDNGSS 571
             DI+ + LD+IVGNIT  + I F D+EIP  G    KALHIT+KCK  AV +VLVDNGS+
Sbjct: 1517 QDISVEKLDHIVGNITVGNFIAFNDEEIPSGGRRGNKALHITIKCKDHAVPRVLVDNGSA 1576

Query: 572  LNIMPMSTLEKLHVDMSHLKSSTMIVRAFDGSRSEVVGDIEIPIQIGPCTFDITFQVMNI 631
            LN+MP STL KL VD+S++++S M+VRAFDG+  EVVGDIE+PI+IGPC F++ FQVM+I
Sbjct: 1577 LNVMPRSTLTKLPVDVSYMRTSRMVVRAFDGTTREVVGDIELPIKIGPCIFEVQFQVMDI 1636

Query: 632  NSAYSFLLGRPWIHSAGVVPSSLYQRLKFVVDRKLVIVSGQEDIFVSRPSSMPYIEAAEE 691
              +Y+ LLGRPWIH AG +PSSL+Q++KF+ + +L+ V  +EDI   +PSS PY+EA EE
Sbjct: 1637 APSYNCLLGRPWIHMAGAIPSSLHQKVKFIAEGQLISVCAEEDILAIQPSSAPYVEATEE 1696

Query: 692  AFESSFQSFEVANSTTIYGERGTRKPRFSKLPSRG-----------------NGRSLDDL 751
              E SF+SFE  N+T +   +    PR S     G                 N + ++  
Sbjct: 1697 VPECSFRSFEFVNATYVGERKVIPTPRLSVATKMGVKQTVGKGCRAGLGLGKNLQGINRP 1756

Query: 752  LNVQKNMKRFGLGYKPNREEMIRTQKRENKNRMMNFEHHRPR-SQMSIPHLYESFKFAGT 811
            L   KN +RFGLGYK  +EE  +   ++   RM   E       + +IP LYE+F+ AG 
Sbjct: 1757 LTPMKNEERFGLGYKHTKEERRKLTAQKKIKRMAQLEGKEEEFGERTIPRLYETFRSAGF 1816

Query: 812  IHPES----------FEVMAV----TEGKDKEYPLVYLCLEDFELNNWTVFELPSI 823
            IHPE+          F+ +++     E  D + P+VY  L   EL+NWT  ELP I
Sbjct: 1817 IHPEAPPKVNQVLRIFDELSIHMIRDEEPDGKIPMVYPVLPGEELSNWTATELPII 1872

BLAST of Lsi08G005000 vs. NCBI nr
Match: gi|590695072|ref|XP_007044788.1| (Uncharacterized protein TCM_010507 [Theobroma cacao])

HSP 1 Score: 426.8 bits (1096), Expect = 8.7e-116
Identity = 259/584 (44.35%), Postives = 353/584 (60.45%), Query Frame = 1

Query: 301  QKVVGESEREHEIDVAEEFSTGECSKVY---LERKPLTIFYKEKPSTTTSKPKP-ITIQV 360
            Q+++ ES+ E   + A E +    SK     ++ KPLTIFY+ K      K    + I+V
Sbjct: 619  QRMMDESKIEFYTE-ASESAVNMISKESTHPMKIKPLTIFYEPKGELVEDKNHAKMVIEV 678

Query: 361  PSPFEYKSSKAVPWNYEYKVLVESVPVPID-------NINEIGGTTRSGKCYTPEALLKY 420
            P PF YK +KAVPWNY   V V      I        NI  +GG TRSG+CY+PEA    
Sbjct: 679  PKPFPYKDNKAVPWNYNCNVQVSEAKKWIAESQDDAANITGVGGITRSGRCYSPEAFENL 738

Query: 421  NSKEKGKAKLSDVIDCRIEEPLLVRNQEVKEPASEDDIQEFLRLIKQSDYKVIEQLGKTP 480
             + EKG  K     + +++ P        K   +E +  EFL+ IK S+Y V+EQL + P
Sbjct: 739  KN-EKGGEKEQSPREEKVQPPESTDGS--KRSVTEKEAAEFLKFIKHSEYNVVEQLNRMP 798

Query: 481  ARISILALLLSSDLHRKTLMNILNQTYVPSDITTDNLDNIVGNITASSSITFTDDEIPPE 540
            ARIS+L+LLLSS+ HR +LM ILNQ YV  DI+ +NLD IVGNI+  + I+F+D+EIP  
Sbjct: 799  ARISLLSLLLSSEPHRNSLMKILNQAYVDHDISVENLDYIVGNISVGNIISFSDEEIPSG 858

Query: 541  GMGHTKALHITVKCKKFAVAKVLVDNGSSLNIMPMSTLEKLHVDMSHLKSSTMIVRAFDG 600
            G G+ KALHIT KCK   VAKVL+DNGSSLN+MPM TL +L ++MS+++ S MIVRAFDG
Sbjct: 859  GRGNYKALHITTKCKGCTVAKVLLDNGSSLNVMPMRTLARLPINMSYMRKSQMIVRAFDG 918

Query: 601  SRSEVVGDIEIPIQIGPCTFDITFQVMNINSAYSFLLGRPWIHSAGVVPSSLYQRLKFVV 660
            +R EVVGDIEIP++IGPCTF I FQVM+I  +Y++LLGRPWIH AG +PSSL+Q++KF++
Sbjct: 919  TRREVVGDIEIPVEIGPCTFTIEFQVMDIAPSYNYLLGRPWIHMAGAIPSSLHQKVKFIM 978

Query: 661  DRKLVIVSGQEDIFVSRPSSMPYIEAAEEAFESSFQSFEVANSTTIYGERGTRK-PRFSK 720
            + K+V V+G+ED+ +S+P+  PY+EAAEE  E SF+SFE  N TT  GE  T   PR SK
Sbjct: 979  EGKIVCVNGEEDLLISKPADTPYVEAAEEVPECSFRSFEFVN-TTYVGEGTTPPIPRLSK 1038

Query: 721  L--------------PSRGNGRSLDDL---LNVQKNMKRFGLGYKP---NREEMIRTQKR 780
                              G G+ L  +   ++  KN ++FGLGYKP    REEMI  +++
Sbjct: 1039 TTKMIVSQILGKGYRAGAGLGKELQGIRSPIHTTKNEEKFGLGYKPTKKEREEMIAGRRK 1098

Query: 781  ENKNRMMNFEHHRPRSQ-MSIPHLYESFKFAGTIHPES------------------FEVM 829
            E   R+  F+ H    + M+ PHLY++F+  G I PES                    + 
Sbjct: 1099 E---RLARFKGHELEIRGMTYPHLYKTFRSGGCIFPESLTVENQESVSALGGTFSDLSIC 1158

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
WAK2_ARATH1.1e-1240.00Wall-associated receptor kinase 2 OS=Arabidopsis thaliana GN=WAK2 PE=1 SV=1[more]
WAK3_ARATH1.4e-0932.81Wall-associated receptor kinase 3 OS=Arabidopsis thaliana GN=WAK3 PE=2 SV=2[more]
WAK5_ARATH4.2e-0943.28Wall-associated receptor kinase 5 OS=Arabidopsis thaliana GN=WAK5 PE=2 SV=1[more]
WAKLG_ARATH4.2e-0929.17Wall-associated receptor kinase-like 8 OS=Arabidopsis thaliana GN=WAKL8 PE=2 SV=... [more]
WAK1_ARATH1.2e-0831.34Wall-associated receptor kinase 1 OS=Arabidopsis thaliana GN=WAK1 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A061EXR3_THECC2.0e-11945.71Uncharacterized protein OS=Theobroma cacao GN=TCM_024883 PE=4 SV=1[more]
A0A061ESA1_THECC2.6e-11945.71Gag-pro-like protein OS=Theobroma cacao GN=TCM_022266 PE=4 SV=1[more]
A0A151R2D5_CAJCA4.5e-11946.38Uncharacterized protein OS=Cajanus cajan GN=KK1_042162 PE=4 SV=1[more]
A0A061E378_THECC5.0e-11846.08Uncharacterized protein OS=Theobroma cacao GN=TCM_008095 PE=4 SV=1[more]
A0A061E6J4_THECC6.1e-11644.35Uncharacterized protein OS=Theobroma cacao GN=TCM_010507 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G21270.16.0e-1440.00 wall-associated kinase 2[more]
AT1G21240.18.1e-1132.81 wall associated kinase 3[more]
AT1G21230.12.4e-1043.28 wall associated kinase 5[more]
AT1G16260.12.4e-1029.17 Wall-associated kinase family protein[more]
AT1G21250.16.8e-1031.34 cell wall-associated kinase[more]
Match NameE-valueIdentityDescription
gi|590636870|ref|XP_007028966.1|2.9e-11945.71Uncharacterized protein TCM_024883 [Theobroma cacao][more]
gi|590630969|ref|XP_007027433.1|3.8e-11945.71Gag-pro-like protein [Theobroma cacao][more]
gi|1012324735|gb|KYP36696.1|6.5e-11946.38hypothetical protein KK1_042162 [Cajanus cajan][more]
gi|590690716|ref|XP_007043584.1|7.1e-11846.08Uncharacterized protein TCM_008095 [Theobroma cacao][more]
gi|590695072|ref|XP_007044788.1|8.7e-11644.35Uncharacterized protein TCM_010507 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0030247polysaccharide binding
GO:0005509calcium ion binding
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR025287WAK_GUB
IPR021109Peptidase_aspartic_dom_sf
IPR018097EGF_Ca-bd_CS
IPR001881EGF-like_Ca-bd_dom
IPR000742EGF-like_dom
IPR000152EGF-type_Asp/Asn_hydroxyl_site
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006468 protein phosphorylation
biological_process GO:0009069 serine family amino acid metabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0005509 calcium ion binding
molecular_function GO:0030247 polysaccharide binding
molecular_function GO:0005515 protein binding
molecular_function GO:0005524 ATP binding
molecular_function GO:0004674 protein serine/threonine kinase activity
molecular_function GO:0004672 protein kinase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi08G005000.1Lsi08G005000.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000152EGF-type aspartate/asparagine hydroxylation sitePROSITEPS00010ASX_HYDROXYLcoord: 206..217
scor
IPR000742EGF-like domainPROFILEPS50026EGF_3coord: 189..228
score: 12
IPR001881EGF-like calcium-binding domainSMARTSM00179egfca_6coord: 189..228
score: 6.
IPR018097EGF-like calcium-binding, conserved sitePROSITEPS01187EGF_CAcoord: 189..215
scor
IPR021109Aspartic peptidase domainunknownSSF50630Acid proteasescoord: 548..636
score: 8.8
IPR025287Wall-associated receptor kinase, galacturonan-binding domainPFAMPF13947GUB_WAK_bindcoord: 30..135
score: 6.6
NoneNo IPR availableGENE3DG3DSA:2.10.25.10coord: 187..228
score: 2.1
NoneNo IPR availablePANTHERPTHR27005FAMILY NOT NAMEDcoord: 7..234
score: 6.8
NoneNo IPR availableunknownSSF57196EGF/Laminincoord: 188..224
score: 1.0

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Lsi08G005000MELO3C008446.2Melon (DHL92) v3.6.1lsimedB494
Lsi08G005000Cla97C08G148980Watermelon (97103) v2lsiwmbB445
Lsi08G005000Cp4.1LG01g20880Cucurbita pepo (Zucchini)cpelsiB382
Lsi08G005000CsGy6G017900Cucumber (Gy14) v2cgyblsiB447
The following gene(s) are paralogous to this gene:

None