Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGAAAGGATAAGGACGCTCTGCTAGTGCCCTTTGATTTTGAAATTGAAAAGACTTGCAAAAAGAACAGGAAAGAGAAGAGGGAAAGACTTGCATCAATGGCTAATCCAAATCCACAAGATGAGCAAAAGCCGATACGGGACTATTTCCAGCCTACTTTTCCTGATCAACAATCTGGGATAGTCTACGCGCCGATTAATGCAAATAATTTTGAGCTGAAAACTGGCCTCATCCAGATGGCTAGAGATAGTGCATTTAGAGGATTCCCCTCTGAGGATCCTAATTCTCATTTAAAATCTTTTCTTGACATCTGTGGGACTGTGAAGTTGAATGGTGTCTCTGAAGATGCCATACGCTTACGATTGTTTCCTTTCTCTTTGCAGGACAAAGCTAGAGACTGGCTCAGATCACTTCCATCTGGAAGCATAACTACGTGGGACGCGTTGGTTCAAGCTTTTCTTGCCAAATATTTCTCACCTGCAAAGACAGTCAAGCTTAGGACAGAGATTGGGACATTCCAGCAGCTAGGCGATGAACAACTATTTGAGGCTTGGGAGCGCTATAAGGAGTTACTGAGGAAATGCCCTCAGCATGGATACCCAGATTGGCTGCAGATCCAGTTGTTTTATAATGGTCTAAATCCAAATACTAAAACCATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACTGTAGAGAATGCAAGAACTTTACTTGAGGATATGGCCACGAACAGCTACCAGTGGCCAACTGAGCGGTCTGCACCTAAGAAAATTGCAGCTGGGATTTATGAGATCGATAATGTAAGTTCGCTTCAAGCCCAGATGACCTCCCTTGCTAATGCTTTCATGAAATTCTCAGGTACAGGGAGTGCACAATCTATTGAGTCAGCTGCTCTTGCATCTCAGTGTCAGGAGGAGACCACTACTGAACAGGTTCATTATGTTGAAAGAAATTCTAACTATAGGGGACACCATAATTCTACACCCACACACTACCACCCTAATGTTAGAAATCATGAAAATTTCTCATATGCTAACACTAAGAATATATTGAACCCTCCTGGTTATGCATCTCAAAAGGTTGAAAATAAGCCTTCTCTTGAAGATATAGTTGGAGCTTTTATTGCAGAATCGAGCAAAAGGACCAACAAGTTGGAGGAAGCAGTGATTGCAATAAATACCACTGTGACAGGCCACGGAGCAGCAATTAAAAACATCGAAACTCAATTGGGTCAACTGGTGAATGCTGTAAGCAGTATGCAGAAAGGTAAAACCACAGCTGAACCAGAGAAAACCCAAATGGAGTATTGCAAGGCCATTACCGTACACCAAGTGGAGGAAGTTCAAGTAGTTGATACACAGGAGATTCATGAGCCTGAAGTCACTAAGGAAGAAGTTGAAGAGGGTTCATCTTCAACCGAAGCTGAAAAACTCACTTCTGACCCTCTTATTCCTTCACCTACTGTTTTGGTTCCAAAGCCCAAGAAAAAGAAGAAAAAGAATTACTCAACTCAATTCAAAAAGTTTCTTGATATTTTTATGAGTTTAAATATTAATTTACCATTTGCAGAGGCTTTGGAGCAGATGCCCAAATATGTACAATTTATGAAGGAATGGCTTTCGAGGAAAAAGAAGGAAAAGAAGTTGAGACAATATTCCTTACCTCTACATGCAGTGCCCGACTTCAAAAGAATGTGCCTGACAAACTTGCTGATCCAGGGAGTTTTTCTTTTCCCTGCAATTTTGGTACTCATTCTTTTCGTGCTTTGTGTGATTTAGGCGCGAGTATTAACATAATTCCTTTATCTTTATGTAAAAAGTTAAACATAGGAGAGATTAAACCTACTGCAGTAAAACTCCAGTTAGCTGACCAATCTGTAGTTAGTCCGTATGGAGTAGTAGAGAATGTATTGATTCAAGTAGGGAAATTTTTTCTGCCAGTTGACTTCTTTGTGATGGATGTAAAGGAGAACCCTGTAGTGCCTGTGATTTTAGGGAGACCATTCCTCGCAACTGGGCGAGTTATCATTGACATTGAGCGTAGGGAGCTGACTGTGCGAGTCTTACATGAAAAAGAAATTTTTAAAGCAGTTGGAGCCTCTACAAATTGCTCTGAAGTTATGATGGTAAGTTACAGAAAAGGTGCAAGCAAGAGCACCTCCGGTGGAGCCCGTGACAATAGACCTCCTTGAAGCATATACGGTCACCACGTCGAGCTAACGACGTTAAACAAGCGCTTTTGGGAGGCAACCCAAGTGTTTTTATCTTAGTCTTGTTTTTATTTTATTTTATTTTATTGTACTTCTATTGATTTAATTCTTTAAATTCGATCTTCTTATTTTTATTTTTATTTTTTTTTTTTTCTCACTTTTTCTTTCTTTTTGATGCAGGTGAAATTTTATTTTTGTTGAAAAAAAAAAACAGCGCAAAAACGCGCGTTTTGGGCTAAAATCGCGACAGCGTCGCGACGCTTTAGCCCTTAGCGTCGCGACGCTGTCGCGATTCAGAGCCTAAACAGCACAGAATCGAAAATTTTTTTTTCTGAAGTGAAACCCTAGCCGCCGCCCTTAGCCCCTTCGTTTCAATGCTCTTCTTCTTCGATTTTAAGCATTTTCTTCAAATTTCTTGCTCACACTCTTAGATTTACCCTTTAGTGCCCGATTTTTGCAGTTTCGAAGTGTTTGTGCCGTGGGTTGCTCGATTTTCGGCCGGTTGCAGCGTTTAGTTAGTGGGTTTTAGCGTTTTTGGCTTAATCTCATTGTGGGTTTTGATTTCTTGGTATTGTTTTCAGAGATTGTCTAGCATTTGAAGCCTCTTCTCGCCGCCAGTTGAAGTTTTTTGGTCGAGTTGCAGCCGTTGTATGCTCGAGTTCGCCGTGGGTTTCGAGTCCCTCAAGTTTGGGCGAGCTTTTGGGTTGTTTTCTTGCGTTTTGTCCGAGTTCGCCGTGGGTTGCAAGTTTCGAGATTTTCAGCAAGTTTCCGGCGATTCTTCAGCTTGTTTTAGCCGTGGGTATAAATTTTTGGTATCACCCTCCTCTTTTCAGCATAAATCTTAACATAAGAGTTAATTTCAGTTGGTTTAGAGCTTGGATTGTTTGAGTTTGAGGCTGAAAATTGATTTCTTGTTGCTGTGAGGTGCTTTGAGCTGAATATTGAAGTTGTTTATTGCTGTTTATGGGTGTTTTTAGAGAGGATTAGCTGCTGGTTTTTATGTATTTGAGCCCATATATGCTGAAATATTGATGTTTTGCTAGTTTTGAGTATTGGGTTTCATTCAAGCATCATTGTGTGCCTTCTTGGAGAAGTTTTATTAGGCTATTTGTATTGTTTAGTGTTTTTTGGACTCTACTGCTGTTGTTGTTGGGCTCTGTTGCTGCTGTTGTGACTTTATTTCTGAGGCGGACGGGCTATATTTGAAGTTTGTGGAGTGGTTGGTAGCTTGGGACTGTTGATAGAAGTATTTTTCTCCTAGCTTTTAGTACCTCTCCATTAGTTACAATGCATAAAAGATCTAGGAGACATGCTGCCTCTTCTTCCTCCTCCCCTCCACCACCTTTTGACCAAGATAGGTTCGTTAGTGCTGAAGCCTCAGCGAGGTTTGATAAATATGTGGAGGGTAGAAATTTTATTCCTGAGCGTGGTTTTAGCCCCGACCCCGAGATGCAACCAAACCTTGTTAATAACATTGTTGCGCGGGGTTGGGGCGACTTTGTTCATCATCCTGCACCTGGGGTAGCAACTATAGTGAGGGAGTTCTATGCCAACATGGAAACCTCCTCTTCATCATCCTTTGTTCGAGGACACCGTGTCCTCTACGACCCCCTCACCATAAATAGGTTTTATAGTTTGCCCAACTTTGACAGGGATGAATATAGTACCTATCTTCATGGCCACTTGGATGTAAACGAAGTCATTCAGACTATTTGTAGGCCAGGGGCAGAATGGATTATGACTGGCGCAGAGGTGGTGAGATTCAAAACCACTGATTTGTTCGTAGATTACAGAGCGTGGCATACCTTCCTCTGTGCCAAGTTGATGCCTGTGGCGCATCTTAGCGATGTTACCAAGAGTCGTGCCATCCTGCTTTTCGCTATCGCCACAGGTCGCTCCGTCAATGTCGGTCAGGTCATCCATCAGTCTATGAACCATATTCGCCGACGTTACACGACAGTGGGGCTCGGGCATCCTTCACTGATCACAGCCCTTTGCCGAGCTGCTGGTTTCGTGTGGGACGCCCAAGAGGAGTTGGTTCATCCTGGAGCGCTAATCGACAAGAACTTCATCAGTCGCTACAGAGGACCTGGACCACAGGGCGCACAACCACCCCTCACTATCCATGCACCGCCACAGCATCATGAGCAGCACGAGCAGCCTGCAGAGCCTGAGCAGCAGGAGCAGGAGATTCCACATCCCTCCATCGAGGAGCAGCTGCAGCAGCTGCGTATGGAGTTTCAGAGCCATCGCCTAGACTTCCAGACTCTCCAGCAGGGCATCCAGAGCCAGCAGCGAGAGCACCAGAGGAGAGGCGTAGAGATCGTCGCCATTTCCTCTACTCCACGAGCATGCATGCCCACACCTATCAGTGTCAGGTAGCTATGAGTACGGGTCAGCCTTTACCGCCACCTTTACCACCGTACGAGTCGCCTGAGGACGAGGATGAGGACGCGTGATGCTCTGCCGCCTTCCCTCTGCACACGGTTGTTTTCTTTCCCCCTCTTTCTTATTTTTCATTGTATTTGTTCTCATATTGGTCTGTACATCTTATTATTTTACATTGAGGACAATGTAATTTCTAAGTTTGGGGGTGGAAAGTATTTACTTTTACTTTTGGGTCTGTAATTATTATTTGATATATCTCTTTTGACCATTCTCTTTGTTGGATTATCAGGTGTAAATTTTTTTTTATTATTCAGGTTATTATTTTTAATTATTATTTTTTTTCCTTGAAATTGAGTTGATTGTGTTATCTGTCTGAGATATATTTTCTTTGAGTGTGTTGTTTTGATTATGACAGCCGATGAGGATACCTAATTTCAGAAAACAGGTAAGTTATTCTGTGCATCAATAATTAAAAACAGGATTCGAGACAGGTTGATTTTATTTTATGAATCTCTTGCACCATTTGAGCACGTAGTCTCCTGTCTTAGTTTGGCGAGCTTAGAATAGAGTTATGTGATCTAGATCTTGTTGTCACCTCGAAAGGTTTAGGAACTGATTTGATGGTTAATCTGCATGATAGTTAGTAATGTTACACTTAGGAAAAAATGATGCCATATATTGCATGTTAATGAGAAAAAAAAAAATGAATGAATGAAAGAAAAGAGAATAATAATGAGTTGAATAAAGAAAAAAAAATGAATTAAGGTCAAAAAGAGCTACCCCTGTTGTGATCAGTTAGGCCAACCAGAGAAAACTCAGGTTAAGGGTCCCTGCAATAGGCGTGTGAAAGCTAGTGACCACTACTTTTGATTATGAAAAAAACTCAAAGTTTGGGGGTTGTATACCCCGTTTAGGCACCCTAGTGGCATATTACCATCTTGAGTGTGGAGAGGCTTGATGATTGGGACTTAGGACCAGAGTAGGGGGAAAACGAGTGAAGGGCTGCATAAAACTGTTCCGGGCTTAAAGGACTGAGATAGAGAGGACTAGTCACACACACACGTGAGAGATGAATGAAATAAATGAGGCTTATCATGATGAATGTGTTTAATCATTAATGTTTCAGTGTGATTTATCTGTGATTTGGGATTCCTGAGGCAAGAGTCATTAGCAACACTTCCTCAAGTATTTTTTTGTAGACAGATTATTTTGATTCTTACTTGGCTTGATTCTTACTTTGCTCGGGACTAGCAAATGCTTAAGTTTGGGGGTATTTGATAGCGCAATTTATTGTGCTATATTTTAGATAATTTTACCGTTAGTCTTGAGTATTATTTGGGCATTTTTATCTCTATTTGGTTTTGTTTTGTAGGAATCAAGTATTTCAATGGAAATTGACGTTTTTAACCACTACCGATGAATTTAGAGCTAAAGAATGAGTTCGGGCCAAGAAAATAGAAAAGGAAGATAAATTAAAGGCCTAGCGTTGAGACGCTAGCCCTTGGAGCGTCTCAACGCTGGGCGCAAGTCAGGAGGCGCGCGCATCGGGACAGCGTCGAGACGCTACCTCCACAGCGTCTCGACGCTGCGAACTTTCCAGATCTAATTAGGGCATACGCGCGACAGCGTCGCGACGCTATGTCCTCAGCGTCTCGACGCTGTCACGCAGATTCGACAGTATTTAAAGGAAACGATGCTTTCAGAATGGGGAGCTGTTTTGGAGAGAGAAAAACGACTTCTAGAGAGAGAAAAACGACCAGAACAAGGCAGAGAGCGAGGGGGCTTGATTCGAGGAGCTTGGATCGAGAGGTTCGTGACGGGAACGCACGGGAATCACGGCCAACGAGTTTAATCTTTCTCCCTTATTTTTTGTATTGATTTGTAGCATGTTGAGACATATTTCGATTGTATTTTAGAGACCATGTGTAGCTAGATTTCGTTTCTTAGGGGCTTGATGTAGCCATTAGTTTGTATGGATTAATTTACGGCTTTTTAGGACGTTTCGTAACGCGGTTTATTTATGCAATTGTATGATTAATGCTTATAATCTCTTGATCACGGATTGTATGTTGTTGTTTCTGAACTAATCGCTCGGGAGAGGGTAGTTTAGATTAGACTTTCAATTGCATGACTTAGGTAATTCGTAAATCGGGAGATTAGGAATTGCTTGAGAAACCGTCATGAGGGTTGTTCTTATTGCTTAGGCTTATTATTGAAATTAGATAGGGATATCAATTTCTTAATTTGCTTAAAGGATTAATTGCACTCGGGAGAGGCTTTTAATTATTAATAATCCTCGGTTTATAAACTTGTTACGAATTTGTTCTTAATTATTCGTTTAGTTAATTCATGCATCTAATAGGTGAAATCAATTCCCTAAGGCCTTTCTCCATTGAGTTTCCACATTTTATTGCTTTTATTAGTTTTTAATTGTTTTGCATATCTCTTGTTTATTTTTCTCTAAATAAAATTGGGCAAATATACCTTTAATAATTAGGGGAATTAGAAAACCAATCCAGAGGACGACATTCGATTTTAAATCACTATATTATAACTTGGGCCCGTACACTTGCGGTTAGCAATTTGCGTATCAAGTCCCCCATTTTTTATTGCTTTAGATTTTAAAACAAACTGGTACAAAATCCCTTGATTATCGGATACAGAATATGTTTGGCCAAATATACGGTTTACATATTTACCACCATTTCCGCAGAGGTATGCCAAGGATAAGCACGTGTCAACTAGAGCGAGCGATCGTTTGAAGGCTGCGGGAGTAACGCCAGGAAGAAAACCCCCGGAACAAACGTCCCCAATCACATTGGGGAGCGAACAGGACTCTGAAGAAGCCATGAGTACAACAGTTTCAGTCACTAAGGGATCCGACGAAAAGACGAAAGGGGTAAAAAGGGACAGAGACGATGGAGGTCCGAGCAAAAAGTAATTCCATCAAAGAAAACAAAAGTTCGCAACCGGACCAAGAAGACCAAAGATGAGGTAAACAAAGTTAAATGCCCCTCACTTTAGATAGAAAGTTAAAAATGTTCATTCTCATTTGTTCATAATTTGCAAGATTGAGAAACCCACTGAGGCACGAAGCAATAAGAAGACAAAGCGGTCGAAACAGACTAAAAACACAGATAAGGCGAGTCATGTGACACCAGAGGTAATCCCCTTGAAAATTAACAACCCCCTATATTTTACAACGCATTTTTTATTAGGGTATATATGTCTATATACGCTTTCCAACTAATTATACTGGGAAGGTTGCCCCTGAAACAAGCGAGGACACCGCTAAACATGACACCGAAGACACCGAATCTGATAGTGTGACGAATGAAAACTCCACGAGTGATGACGGGGAAGAACAAGGGAAAAAGAAGGCATCACTTGCTAAAAAGGTATAAAGTCGAACCACGTCCAAACTTGATAACCAACAACTTTAGAAATTGTTATAAGTTGTTTCTTTTTTTGTGCATATCATTTAGGAAGCTCCTAAAAAAAAGAAGGGTGGAAAAAAGGGGAAAAAAGCTGAAGACCATGGTTGAAGAAGGTGACACCGTCCGAGTGGTATGTACTAGTATATACATATTATTTACTCGCATTTATGTTGCACTAGTGTTGTGAATCTCCATAGATATCGTTTATATATATGATAATCCCTTTCTTATTCGTTTGTGGCATTGATACATGCATGTCTGTACTGGTATACACGGGGTTTAGACTTGTTACCGATGTCCTAATACTTTGTTTCGTGACTATACTAGCAATGTAAATACATGCATGTTTGTACCGTACATGCGTGTACATGCATGTCTGTGCTAGTATACATGAGTTTCAGTTGTTCATACTGTTTCGTGTCTGTCCTAACATTTATGTTGTACTAGTGCTGTGAATTTCAGTTGTATGTTCGTGTCTGTCCTAACACTTATTTGTCTGTGGCACTGATACATGCATGTATTTAGTGATATATACTCTGCTACATGCCTGTTTCGTTATTGACTTGTACCGTGTATCCTAACGCAGGACGATGATTACTTTATGTCACCATCGAAAAGAAGTAAGGCCCTAAAGATTAACCTATGTTGCAGAACAGAAATAATGGACACCATCAACAACATCCTAGGAGATAGGTGCAGAGAAGCTTTCAGAAACACGTGCTTCGGCCACCTACTTGACTTCACGTTCAAAAAGACGTCTTCCCAGTTACTATTGCACTTGATCCAGCATCAGTGCAAACCCAACTTTACTTCAAGATTGGAGGGAAAATCTTAA
mRNA sequence
ATGCGAAAGGATAAGGACGCTCTGCTAGTGCCCTTTGATTTTGAAATTGAAAAGACTTGCAAAAAGAACAGGAAAGAGAAGAGGGAAAGACTTGCATCAATGGCTAATCCAAATCCACAAGATGAGCAAAAGCCGATACGGGACTATTTCCAGCCTACTTTTCCTGATCAACAATCTGGGATAGTCTACGCGCCGATTAATGCAAATAATTTTGAGCTGAAAACTGGCCTCATCCAGATGGCTAGAGATAGTGCATTTAGAGGATTCCCCTCTGAGGATCCTAATTCTCATTTAAAATCTTTTCTTGACATCTGTGGGACTGTGAAGTTGAATGGTGTCTCTGAAGATGCCATACGCTTACGATTGTTTCCTTTCTCTTTGCAGGACAAAGCTAGAGACTGGCTCAGATCACTTCCATCTGGAAGCATAACTACGTGGGACGCGTTGGTTCAAGCTTTTCTTGCCAAATATTTCTCACCTGCAAAGACAGTCAAGCTTAGGACAGAGATTGGGACATTCCAGCAGCTAGGCGATGAACAACTATTTGAGGCTTGGGAGCGCTATAAGGAGTTACTGAGGAAATGCCCTCAGCATGGATACCCAGATTGGCTGCAGATCCAGTTGTTTTATAATGGTCTAAATCCAAATACTAAAACCATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACTGTAGAGAATGCAAGAACTTTACTTGAGGATATGGCCACGAACAGCTACCAGTGGCCAACTGAGCGGTCTGCACCTAAGAAAATTGCAGCTGGGATTTATGAGATCGATAATGTAAGTTCGCTTCAAGCCCAGATGACCTCCCTTGCTAATGCTTTCATGAAATTCTCAGGTACAGGGAGTGCACAATCTATTGAGTCAGCTGCTCTTGCATCTCAGTGTCAGGAGGAGACCACTACTGAACAGGTTCATTATGTTGAAAGAAATTCTAACTATAGGGGACACCATAATTCTACACCCACACACTACCACCCTAATGTTAGAAATCATGAAAATTTCTCATATGCTAACACTAAGAATATATTGAACCCTCCTGGTTATGCATCTCAAAAGGTTGAAAATAAGCCTTCTCTTGAAGATATAGTTGGAGCTTTTATTGCAGAATCGAGCAAAAGGACCAACAAGTTGGAGGAAGCAGTGATTGCAATAAATACCACTGTGACAGGCCACGGAGCAGCAATTAAAAACATCGAAACTCAATTGGGTCAACTGGTGAATGCTGTAAGCAGTATGCAGAAAGGTAAAACCACAGCTGAACCAGAGAAAACCCAAATGGAGTATTGCAAGGCCATTACCGTACACCAAGTGGAGGAAGTTCAAGTAGTTGATACACAGGAGATTCATGAGCCTGAAGTCACTAAGGAAGAAGTTGAAGAGGGTTCATCTTCAACCGAAGCTGAAAAACTCACTTCTGACCCTCTTATTCCTTCACCTACTGTTTTGGTTCCAAAGCCCAAGAAAAAGAAGAAAAAGAATTACTCAACTCAATTCAAAAAGTTTCTTGATATTTTTATGAGTTTAAATATTAATTTACCATTTGCAGAGGCTTTGGAGCAGATGCCCAAATATGTACAATTTATGAAGGAATGGCTTTCGAGGAAAAAGAAGGAAAAGAAGTTGAGACAATATTCCTTACCTCTACATGCAGTGCCCGACTTCAAAAGAATGTGCCTGACAAACTTGCTGATCCAGGGAGTTTTTCTTTTCCCTGCAATTTTGTTAAACATAGGAGAGATTAAACCTACTGCAGTAAAACTCCAGTTAGCTGACCAATCTGTAGTTAGTCCGTATGGAGTAGTAGAGAATGTATTGATTCAAGTAGGGAAATTTTTTCTGCCAGTTGACTTCTTTGTGATGGATGTAAAGGAGAACCCTGTAGTGCCTGTGATTTTAGGGAGACCATTCCTCGCAACTGGGCGAGTTATCATTGACATTGAGCGTAGGGAGCTGACTGTGCGAGTCTTACATGAAAAAGAAATTTTTAAAGCAGTTGGAGCCTCTACAAATTGCTCTGAAGTTATGATGTTTCGAAGTGTTTGTGCCGTGGGTTGCTCGATTTTCGGCCGGTTGCAGCGTTTAGTTAGTGGAGATTGTCTAGCATTTGAAGCCTCTTCTCGCCGCCAGTTGAAGTTTTTTGGTCGAGTTGCAGCCGTTGTATGCTCGAGTTCGCCGTGGGTTTCGAGTCCCTCAAGTTTGGGCGAGCTTTTGGGTTGTTTTCTTGCGTTTTGTCCGAGTTCGCCGTGGGTTGCAAGTTTCGAGATTTTCAGCAAGTTTCCGGCGATTCTTCAGCTTGTTTTAGCCGTGGCTTTTAGTACCTCTCCATTAGTTACAATGCATAAAAGATCTAGGAGACATGCTGCCTCTTCTTCCTCCTCCCCTCCACCACCTTTTGACCAAGATAGGTTCGTTAGTGCTGAAGCCTCAGCGAGGTTTGATAAATATGTGGAGGGTAGAAATTTTATTCCTGAGCGTGGTTTTAGCCCCGACCCCGAGATGCAACCAAACCTTGTTAATAACATTGTTGCGCGGGGTTGGGGCGACTTTGTTCATCATCCTGCACCTGGGGTAGCAACTATAGTGAGGGAGTTCTATGCCAACATGGAAACCTCCTCTTCATCATCCTTTGTTCGAGGACACCGTGTCCTCTACGACCCCCTCACCATAAATAGGTTTTATAGTTTGCCCAACTTTGACAGGGATGAATATAGTACCTATCTTCATGGCCACTTGGATGTAAACGAAGTCATTCAGACTATTTGTAGGCCAGGGGCAGAATGGATTATGACTGGCGCAGAGGTGGTGAGATTCAAAACCACTGATTTGTTCGTAGATTACAGAGCGTGGCATACCTTCCTCTGTGCCAAGTTGATGCCTGTGGCGCATCTTAGCGATGTTACCAAGAGTCGTGCCATCCTGCTTTTCGCTATCGCCACAGGTCGCTCCGTCAATGTCGGTCAGGTCATCCATCAGTCTATGAACCATATTCGCCGACGTTACACGACAGTGGGGCTCGGGCATCCTTCACTGATCACAGCCCTTTGCCGAGCTGCTGGTTTCGTGTGGGACGCCCAAGAGGAGTTGGTTCATCCTGGAGCGCTAATCGACAAGAACTTCATCAGTCGCTACAGAGGACCTGGACCACAGGGCGCACAACCACCCCTCACTATCCATGCACCGCCACAGCATCATGAGCAGCACGAGCAGCCTGCAGAGCCTGAGCAGCAGGAGCAGGAGATTCCACATCCCTCCATCGAGGAGCAGCTGCAGCAGCTGCGTATGGAGTTTCAGAGCCATCGCCTAGACTTCCAGACTCTCCAGCAGGGCATCCAGAGCCAGCAGCGAGAGCACCAGAGGAGAGGCGTAGAGATCGTCGCCATTTCCTCTACTCCACGAGCATGCATGCCCACACCTATCAGTGTCAGGAGGCGCGCGCATCGGGACAGCGTCGAGACGCTACCTCCACAGCGTCTCGACGCTGCGAACTTTCCAGATCTAATTAGGGCATACGCGCGACAGCGTCGCGACGCTATGTCCTCAGCGTCTCGACGCTGTCACGCAGATTCGACAGTATTTAAAGGAAACGATGCTTTCAGAATGGGGAGCTGTTTTGGAGAGAGAAAAACGACTTCTAGAGAGAGAAAAACGACCAGAACAAGGCAGAGAGCGAGGGGGCTTGATTCGAGGAGCTTGGATCGAGAGAGGTATGCCAAGGATAAGCACGTGTCAACTAGAGCGAGCGATCGTTTGAAGGCTGCGGGAGTAACGCCAGGAAGAAAACCCCCGGAACAAACGTCCCCAATCACATTGGGGAGCGAACAGGACTCTGAAGAAGCCATGAGTACAACAGTTTCAGTCACTAAGGGATCCGACGAAAAGACGAAAGGGATTGAGAAACCCACTGAGGCACGAAGCAATAAGAAGACAAAGCGGTCGAAACAGACTAAAAACACAGATAAGGCGAGTCATGTGACACCAGAGGTTGCCCCTGAAACAAGCGAGGACACCGCTAAACATGACACCGAAGACACCGAATCTGATAGTGTGACGAATGAAAACTCCACGAGTGATGACGGGGAAGAACAAGGGAAAAAGAAGGCATCACTTGCTAAAAAGGACGATGATTACTTTATGTCACCATCGAAAAGAAGTAAGGCCCTAAAGATTAACCTATGTTGCAGAACAGAAATAATGGACACCATCAACAACATCCTAGGAGATAGGTGCAGAGAAGCTTTCAGAAACACGTGCTTCGGCCACCTACTTGACTTCACGTTCAAAAAGACGTCTTCCCAGTTACTATTGCACTTGATCCAGCATCAGTGCAAACCCAACTTTACTTCAAGATTGGAGGGAAAATCTTAA
Coding sequence (CDS)
ATGCGAAAGGATAAGGACGCTCTGCTAGTGCCCTTTGATTTTGAAATTGAAAAGACTTGCAAAAAGAACAGGAAAGAGAAGAGGGAAAGACTTGCATCAATGGCTAATCCAAATCCACAAGATGAGCAAAAGCCGATACGGGACTATTTCCAGCCTACTTTTCCTGATCAACAATCTGGGATAGTCTACGCGCCGATTAATGCAAATAATTTTGAGCTGAAAACTGGCCTCATCCAGATGGCTAGAGATAGTGCATTTAGAGGATTCCCCTCTGAGGATCCTAATTCTCATTTAAAATCTTTTCTTGACATCTGTGGGACTGTGAAGTTGAATGGTGTCTCTGAAGATGCCATACGCTTACGATTGTTTCCTTTCTCTTTGCAGGACAAAGCTAGAGACTGGCTCAGATCACTTCCATCTGGAAGCATAACTACGTGGGACGCGTTGGTTCAAGCTTTTCTTGCCAAATATTTCTCACCTGCAAAGACAGTCAAGCTTAGGACAGAGATTGGGACATTCCAGCAGCTAGGCGATGAACAACTATTTGAGGCTTGGGAGCGCTATAAGGAGTTACTGAGGAAATGCCCTCAGCATGGATACCCAGATTGGCTGCAGATCCAGTTGTTTTATAATGGTCTAAATCCAAATACTAAAACCATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACTGTAGAGAATGCAAGAACTTTACTTGAGGATATGGCCACGAACAGCTACCAGTGGCCAACTGAGCGGTCTGCACCTAAGAAAATTGCAGCTGGGATTTATGAGATCGATAATGTAAGTTCGCTTCAAGCCCAGATGACCTCCCTTGCTAATGCTTTCATGAAATTCTCAGGTACAGGGAGTGCACAATCTATTGAGTCAGCTGCTCTTGCATCTCAGTGTCAGGAGGAGACCACTACTGAACAGGTTCATTATGTTGAAAGAAATTCTAACTATAGGGGACACCATAATTCTACACCCACACACTACCACCCTAATGTTAGAAATCATGAAAATTTCTCATATGCTAACACTAAGAATATATTGAACCCTCCTGGTTATGCATCTCAAAAGGTTGAAAATAAGCCTTCTCTTGAAGATATAGTTGGAGCTTTTATTGCAGAATCGAGCAAAAGGACCAACAAGTTGGAGGAAGCAGTGATTGCAATAAATACCACTGTGACAGGCCACGGAGCAGCAATTAAAAACATCGAAACTCAATTGGGTCAACTGGTGAATGCTGTAAGCAGTATGCAGAAAGGTAAAACCACAGCTGAACCAGAGAAAACCCAAATGGAGTATTGCAAGGCCATTACCGTACACCAAGTGGAGGAAGTTCAAGTAGTTGATACACAGGAGATTCATGAGCCTGAAGTCACTAAGGAAGAAGTTGAAGAGGGTTCATCTTCAACCGAAGCTGAAAAACTCACTTCTGACCCTCTTATTCCTTCACCTACTGTTTTGGTTCCAAAGCCCAAGAAAAAGAAGAAAAAGAATTACTCAACTCAATTCAAAAAGTTTCTTGATATTTTTATGAGTTTAAATATTAATTTACCATTTGCAGAGGCTTTGGAGCAGATGCCCAAATATGTACAATTTATGAAGGAATGGCTTTCGAGGAAAAAGAAGGAAAAGAAGTTGAGACAATATTCCTTACCTCTACATGCAGTGCCCGACTTCAAAAGAATGTGCCTGACAAACTTGCTGATCCAGGGAGTTTTTCTTTTCCCTGCAATTTTGTTAAACATAGGAGAGATTAAACCTACTGCAGTAAAACTCCAGTTAGCTGACCAATCTGTAGTTAGTCCGTATGGAGTAGTAGAGAATGTATTGATTCAAGTAGGGAAATTTTTTCTGCCAGTTGACTTCTTTGTGATGGATGTAAAGGAGAACCCTGTAGTGCCTGTGATTTTAGGGAGACCATTCCTCGCAACTGGGCGAGTTATCATTGACATTGAGCGTAGGGAGCTGACTGTGCGAGTCTTACATGAAAAAGAAATTTTTAAAGCAGTTGGAGCCTCTACAAATTGCTCTGAAGTTATGATGTTTCGAAGTGTTTGTGCCGTGGGTTGCTCGATTTTCGGCCGGTTGCAGCGTTTAGTTAGTGGAGATTGTCTAGCATTTGAAGCCTCTTCTCGCCGCCAGTTGAAGTTTTTTGGTCGAGTTGCAGCCGTTGTATGCTCGAGTTCGCCGTGGGTTTCGAGTCCCTCAAGTTTGGGCGAGCTTTTGGGTTGTTTTCTTGCGTTTTGTCCGAGTTCGCCGTGGGTTGCAAGTTTCGAGATTTTCAGCAAGTTTCCGGCGATTCTTCAGCTTGTTTTAGCCGTGGCTTTTAGTACCTCTCCATTAGTTACAATGCATAAAAGATCTAGGAGACATGCTGCCTCTTCTTCCTCCTCCCCTCCACCACCTTTTGACCAAGATAGGTTCGTTAGTGCTGAAGCCTCAGCGAGGTTTGATAAATATGTGGAGGGTAGAAATTTTATTCCTGAGCGTGGTTTTAGCCCCGACCCCGAGATGCAACCAAACCTTGTTAATAACATTGTTGCGCGGGGTTGGGGCGACTTTGTTCATCATCCTGCACCTGGGGTAGCAACTATAGTGAGGGAGTTCTATGCCAACATGGAAACCTCCTCTTCATCATCCTTTGTTCGAGGACACCGTGTCCTCTACGACCCCCTCACCATAAATAGGTTTTATAGTTTGCCCAACTTTGACAGGGATGAATATAGTACCTATCTTCATGGCCACTTGGATGTAAACGAAGTCATTCAGACTATTTGTAGGCCAGGGGCAGAATGGATTATGACTGGCGCAGAGGTGGTGAGATTCAAAACCACTGATTTGTTCGTAGATTACAGAGCGTGGCATACCTTCCTCTGTGCCAAGTTGATGCCTGTGGCGCATCTTAGCGATGTTACCAAGAGTCGTGCCATCCTGCTTTTCGCTATCGCCACAGGTCGCTCCGTCAATGTCGGTCAGGTCATCCATCAGTCTATGAACCATATTCGCCGACGTTACACGACAGTGGGGCTCGGGCATCCTTCACTGATCACAGCCCTTTGCCGAGCTGCTGGTTTCGTGTGGGACGCCCAAGAGGAGTTGGTTCATCCTGGAGCGCTAATCGACAAGAACTTCATCAGTCGCTACAGAGGACCTGGACCACAGGGCGCACAACCACCCCTCACTATCCATGCACCGCCACAGCATCATGAGCAGCACGAGCAGCCTGCAGAGCCTGAGCAGCAGGAGCAGGAGATTCCACATCCCTCCATCGAGGAGCAGCTGCAGCAGCTGCGTATGGAGTTTCAGAGCCATCGCCTAGACTTCCAGACTCTCCAGCAGGGCATCCAGAGCCAGCAGCGAGAGCACCAGAGGAGAGGCGTAGAGATCGTCGCCATTTCCTCTACTCCACGAGCATGCATGCCCACACCTATCAGTGTCAGGAGGCGCGCGCATCGGGACAGCGTCGAGACGCTACCTCCACAGCGTCTCGACGCTGCGAACTTTCCAGATCTAATTAGGGCATACGCGCGACAGCGTCGCGACGCTATGTCCTCAGCGTCTCGACGCTGTCACGCAGATTCGACAGTATTTAAAGGAAACGATGCTTTCAGAATGGGGAGCTGTTTTGGAGAGAGAAAAACGACTTCTAGAGAGAGAAAAACGACCAGAACAAGGCAGAGAGCGAGGGGGCTTGATTCGAGGAGCTTGGATCGAGAGAGGTATGCCAAGGATAAGCACGTGTCAACTAGAGCGAGCGATCGTTTGAAGGCTGCGGGAGTAACGCCAGGAAGAAAACCCCCGGAACAAACGTCCCCAATCACATTGGGGAGCGAACAGGACTCTGAAGAAGCCATGAGTACAACAGTTTCAGTCACTAAGGGATCCGACGAAAAGACGAAAGGGATTGAGAAACCCACTGAGGCACGAAGCAATAAGAAGACAAAGCGGTCGAAACAGACTAAAAACACAGATAAGGCGAGTCATGTGACACCAGAGGTTGCCCCTGAAACAAGCGAGGACACCGCTAAACATGACACCGAAGACACCGAATCTGATAGTGTGACGAATGAAAACTCCACGAGTGATGACGGGGAAGAACAAGGGAAAAAGAAGGCATCACTTGCTAAAAAGGACGATGATTACTTTATGTCACCATCGAAAAGAAGTAAGGCCCTAAAGATTAACCTATGTTGCAGAACAGAAATAATGGACACCATCAACAACATCCTAGGAGATAGGTGCAGAGAAGCTTTCAGAAACACGTGCTTCGGCCACCTACTTGACTTCACGTTCAAAAAGACGTCTTCCCAGTTACTATTGCACTTGATCCAGCATCAGTGCAAACCCAACTTTACTTCAAGATTGGAGGGAAAATCTTAA
Protein sequence
MRKDKDALLVPFDFEIEKTCKKNRKEKRERLASMANPNPQDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSEDPNSHLKSFLDICGTVKLNGVSEDAIRLRLFPFSLQDKARDWLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFMKFSGTGSAQSIESAALASQCQEETTTEQVHYVERNSNYRGHHNSTPTHYHPNVRNHENFSYANTKNILNPPGYASQKVENKPSLEDIVGAFIAESSKRTNKLEEAVIAINTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTTAEPEKTQMEYCKAITVHQVEEVQVVDTQEIHEPEVTKEEVEEGSSSTEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKKEKKLRQYSLPLHAVPDFKRMCLTNLLIQGVFLFPAILLNIGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMDVKENPVVPVILGRPFLATGRVIIDIERRELTVRVLHEKEIFKAVGASTNCSEVMMFRSVCAVGCSIFGRLQRLVSGDCLAFEASSRRQLKFFGRVAAVVCSSSPWVSSPSSLGELLGCFLAFCPSSPWVASFEIFSKFPAILQLVLAVAFSTSPLVTMHKRSRRHAASSSSSPPPPFDQDRFVSAEASARFDKYVEGRNFIPERGFSPDPEMQPNLVNNIVARGWGDFVHHPAPGVATIVREFYANMETSSSSSFVRGHRVLYDPLTINRFYSLPNFDRDEYSTYLHGHLDVNEVIQTICRPGAEWIMTGAEVVRFKTTDLFVDYRAWHTFLCAKLMPVAHLSDVTKSRAILLFAIATGRSVNVGQVIHQSMNHIRRRYTTVGLGHPSLITALCRAAGFVWDAQEELVHPGALIDKNFISRYRGPGPQGAQPPLTIHAPPQHHEQHEQPAEPEQQEQEIPHPSIEEQLQQLRMEFQSHRLDFQTLQQGIQSQQREHQRRGVEIVAISSTPRACMPTPISVRRRAHRDSVETLPPQRLDAANFPDLIRAYARQRRDAMSSASRRCHADSTVFKGNDAFRMGSCFGERKTTSRERKTTRTRQRARGLDSRSLDRERYAKDKHVSTRASDRLKAAGVTPGRKPPEQTSPITLGSEQDSEEAMSTTVSVTKGSDEKTKGIEKPTEARSNKKTKRSKQTKNTDKASHVTPEVAPETSEDTAKHDTEDTESDSVTNENSTSDDGEEQGKKKASLAKKDDDYFMSPSKRSKALKINLCCRTEIMDTINNILGDRCREAFRNTCFGHLLDFTFKKTSSQLLLHLIQHQCKPNFTSRLEGKS
Homology
BLAST of Lag0029754 vs. NCBI nr
Match:
KAG7990634.1 (hypothetical protein I3843_02G035100 [Carya illinoinensis])
HSP 1 Score: 562.8 bits (1449), Expect = 8.9e-156
Identity = 321/722 (44.46%), Postives = 454/722 (62.88%), Query Frame = 0
Query: 1 MRKDKDALLVPFDFEIEKTCKKNRKEKRERLASMANPNPQDEQKPIRDYFQPTFPDQQSG 60
MR+ + ++P D EIE+T R +R ++ +MA + + + ++DY +P S
Sbjct: 1 MRRARSRDIIPVDPEIERTL---RSLRRNKILAMAEEDREVLPRTLKDYVRPVVNGNYSS 60
Query: 61 IVYAPINANNFELKTGLIQMARDSAFRGFPSEDPNSHLKSFLDICGTVKLNGVSEDAIRL 120
I+ PINANNFELK LI M + + F G P +DPN HL FL+IC TVK+NGV+ED IRL
Sbjct: 61 IMRQPINANNFELKPALISMVQQAQFSGSPLDDPNIHLAMFLEICDTVKINGVTEDTIRL 120
Query: 121 RLFPFSLQDKARDWLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQLGDEQ 180
RLFPFSL+DKAR WL+SL GSI +W + + FLAK+F PAKT +LR+EIG F+Q E
Sbjct: 121 RLFPFSLRDKARGWLQSLQPGSIVSWQDMAERFLAKFFPPAKTAQLRSEIGQFKQNDFES 180
Query: 181 LFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKTVENARTL 240
L+EAWERYK+L+R+CPQHG PDWLQ+Q+FYNGLN T+TIVDAA+GGTL+SKT E A L
Sbjct: 181 LYEAWERYKDLIRRCPQHGLPDWLQVQMFYNGLNGQTRTIVDAASGGTLMSKTAEGATAL 240
Query: 241 LEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFMKFSGTGSAQSIESA 300
LE+MA+N+YQWPTER+ KK+ AGI++++ +++L AQ+ +L++ + QS E
Sbjct: 241 LEEMASNNYQWPTERTLAKKV-AGIHDLEPIAALSAQVATLSHQISALTTQRIPQSTEYL 300
Query: 301 ALASQC--QEETTTEQVHYV-ERNSNYRGHHNSTPTHYHPNVRNHENFSYANTKNIL--- 360
A S E + EQV YV RN NYRG N P +YHP +RNHEN SY NTKN+L
Sbjct: 301 ASTSMIVPSNEASQEQVQYVNNRNYNYRG--NPMPNYYHPGLRNHENLSYGNTKNVLQPQ 360
Query: 361 NPPGYASQKVENKPSLEDIVGAFIAESSKRTNKLEEAVIAINTTVTGHGAAIKNIETQLG 420
+PPG+ SQ E K SLED + +F+ E++ R K + + I T + GAAIKNIE Q+G
Sbjct: 361 HPPGFDSQPSERKMSLEDAMVSFVQETNARFKKTDSRLDNIETHCSNMGAAIKNIEVQIG 420
Query: 421 QLVNAVSSMQKGKTTAEPEKTQMEYCKAITVHQVEEVQVVDTQE---------------- 480
QL +++ Q+G + E E CKAIT+ +E++ +E
Sbjct: 421 QLATTINAQQRGAFPSNTEVNPKEQCKAITLRSGKEIERSPLKESKSTPTAVNIGQSKNK 480
Query: 481 IHEPEVTKEEVEEGSSSTEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKKFLDIFMS 540
+ E E+ + +EE + + P++ P +P P++ +K+ QF KFLDIF
Sbjct: 481 VEEDEIVNDTLEETDFAPTISFPDNPPILAPP---LPYPQRFQKQKLDKQFSKFLDIFKK 540
Query: 541 LNINLPFAEALEQMPKYVQFMKEWLSRKKK-----------------EKKLRQ------- 600
++IN+PFA+ALEQMP YV+F+K+ +S+K++ +KKL Q
Sbjct: 541 IHINIPFADALEQMPNYVKFLKDIISKKRRLEEFETVKLSEECSAILQKKLPQKLKDPGS 600
Query: 601 YSLPLHAVPDF--KRMCLTNLLIQGVFLFPAILLNIGEIKPTAVKLQLADQSVVSPYGVV 660
++LP F K +C I + L L + E+KPT + LQLAD+S+ P G++
Sbjct: 601 FTLPCTIGDSFFDKVLCDLGASINLMPLSVCRKLGLEEMKPTTISLQLADRSIKYPRGII 660
Query: 661 ENVLIQVGKFFLPVDFFVMDVKENPVVPVILGRPFLATGRVIIDIERRELTVRVLHEKEI 675
E+VL++V KF P DF V+D++E+ VP+ILGRPFLATGR +ID+++ ELT+RV E+ +
Sbjct: 661 EDVLVKVDKFIFPADFVVLDMEEDEEVPLILGRPFLATGRALIDVQKGELTLRVNKEEVL 713
BLAST of Lag0029754 vs. NCBI nr
Match:
KAG6734747.1 (hypothetical protein I3842_01G285500 [Carya illinoinensis])
HSP 1 Score: 544.3 bits (1401), Expect = 3.3e-150
Identity = 307/688 (44.62%), Postives = 432/688 (62.79%), Query Frame = 0
Query: 34 MANPNPQDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSED 93
MA + + + ++DY +P S I+ PINANNFELK LI M + + F G P +D
Sbjct: 1 MAEEDREVLPRTLKDYVRPVVNGNYSSIMRQPINANNFELKPALISMVQQAQFSGSPLDD 60
Query: 94 PNSHLKSFLDICGTVKLNGVSEDAIRLRLFPFSLQDKARDWLRSLPSGSITTWDALVQAF 153
PN HL FL+IC TVK+NGV+ED IRLRLFPFSL+DKAR WL+SL GSI +W + + F
Sbjct: 61 PNVHLAMFLEICDTVKINGVTEDTIRLRLFPFSLRDKARGWLQSLQPGSIVSWQDMAERF 120
Query: 154 LAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGL 213
LAK+F PAKT +LR+EIG F+Q E L+EAWERYK+L+R+CPQHG PDWLQ+Q+FYNGL
Sbjct: 121 LAKFFPPAKTAQLRSEIGQFKQNDFESLYEAWERYKDLIRRCPQHGLPDWLQVQMFYNGL 180
Query: 214 NPNTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSS 273
N T+TIVDAA+GGTL+SKT E A LLE+MA+N+YQWPTER+ KK+ AGI+E++ +++
Sbjct: 181 NGQTRTIVDAASGGTLMSKTAEGATALLEEMASNNYQWPTERTLAKKV-AGIHELEPIAA 240
Query: 274 LQAQMTSLANAFMKFSGTGSAQSIESAALASQC--QEETTTEQVHYV-ERNSNYRGHHNS 333
L AQ+ +L++ + QS E A S E + EQV YV RN NYRG N
Sbjct: 241 LSAQVATLSHQISALTTQRIPQSTEYVASTSMIVPSNEASQEQVQYVNNRNYNYRG--NP 300
Query: 334 TPTHYHPNVRNHENFSYANTKNIL---NPPGYASQKVENKPSLEDIVGAFIAESSKRTNK 393
P +YHP +RNHEN SY NTKN+L +PPG+ SQ E K SLED + +F+ E++ R K
Sbjct: 301 MPNYYHPGLRNHENLSYGNTKNVLQPQHPPGFDSQPSEKKMSLEDAMVSFVQETNARFKK 360
Query: 394 LEEAVIAINTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTTAEPEKTQMEYCKAITVHQ 453
+ + I T + GA +KN+E Q+GQL +++ Q+G + E E CKAIT+
Sbjct: 361 TDSRLDNIETHCSNMGATMKNLEVQIGQLATTINAQQRGAFPSNTEVNPKEQCKAITLRS 420
Query: 454 VEEVQVVDTQE----------------IHEPEVTKEEVEEGSSSTEAEKLTSDPLIPSPT 513
+E++ +E + E E+ + +EE + P++ P
Sbjct: 421 GKEIERAPLKESKSTPTAANNGQSKDQVEEEEIVNDTLEETDLPPTISFPDNPPILAPP- 480
Query: 514 VLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKK--- 573
+P P++ +K+ QF KFLDIF ++IN+PFA+ALEQMP YV+F+K+ +S+K++
Sbjct: 481 --LPYPQRFQKQKLDKQFSKFLDIFKKIHINIPFADALEQMPNYVKFLKDIISKKRRLEE 540
Query: 574 --------------EKKLRQ-------YSLPLHAVPDF--KRMCLTNLLIQGVFLFPAIL 633
+KKL Q ++LP F + +C I + F
Sbjct: 541 FETVKLSEECSAILQKKLPQKLKDPESFTLPCTIGDSFFDRVLCDLGASINLMPFFVCRK 600
Query: 634 LNIGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMDVKENPVVPVILGR 674
L +GE+K T + LQLAD+S+ P G++E+VL++V KF P DF V+D++E+ VP+ILGR
Sbjct: 601 LGLGEMKHTTISLQLADRSIKYPRGIIEDVLVKVDKFIFPADFVVLDMEEDEDVPLILGR 660
BLAST of Lag0029754 vs. NCBI nr
Match:
XP_022843226.1 (uncharacterized protein LOC111366761 [Olea europaea var. sylvestris])
HSP 1 Score: 543.1 bits (1398), Expect = 7.3e-150
Identity = 317/703 (45.09%), Postives = 436/703 (62.02%), Query Frame = 0
Query: 1 MRKDKDALLVPFDFEIEKTCKKNRKEKRERLASMAN-----PNPQDEQKPIRDYFQPTFP 60
MR+ ++ L+ D E E+T + R +R +MA N ++Q+ IRDY +P
Sbjct: 94 MRRARNLDLLHVDPEPERTFRILRGIQRNEREAMAEQDVRAANEDNQQRAIRDYIRPVVN 153
Query: 61 DQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSEDPNSHLKSFLDICGTVKLNGVSE 120
D SGI I A NFELK GLI M + + F G EDPN+HL SFL+IC TVK+NGV+E
Sbjct: 154 DNYSGIARPAIVAKNFELKPGLIDMVQQNQFGGAAVEDPNAHLGSFLEICDTVKMNGVTE 213
Query: 121 DAIRLRLFPFSLQDKARDWLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQ 180
DAIRLRLF FSL+DKA+ W +SLP GSITTWD L Q FL KYF P+K+ +LR EI F+Q
Sbjct: 214 DAIRLRLFSFSLRDKAKAWFQSLPYGSITTWDDLAQKFLTKYFPPSKSAQLRGEISQFKQ 273
Query: 181 LGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKTVE 240
L E +EAWER+K+LLR+CPQHG+ W+QI++FYNGLN T+T+VDAAAGG L++KT E
Sbjct: 274 LDFEPFYEAWERFKDLLRRCPQHGFQKWVQIEIFYNGLNGQTRTMVDAAAGGILMAKTAE 333
Query: 241 NARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFMKFSGTGSAQ 300
A LL+D+ATNSYQWP+ERS KK+ AG++E+D +++L AQ+ SL N + + G+ Q
Sbjct: 334 AAYALLDDIATNSYQWPSERSGVKKV-AGLHEVDPITALAAQVASLTNQIVMLTTQGNQQ 393
Query: 301 SIESAALASQCQEET--TTEQVHYVE-RNSNYRGHHNSTPTHYHPNVRNHENFSYANTKN 360
+++S S +ET EQV Y++ RN N RG + + HYHP +RNHEN SY N +N
Sbjct: 394 NVDSVISTSSSHQETEVANEQVQYIDSRNYNQRGGYQA--NHYHPGLRNHENLSYGNNRN 453
Query: 361 ILN-PPGYASQKVENKPSLEDIVGAFIAESSKRTNKLEEAVIAINTTVTGHGAAIKNIET 420
L PPG+ +Q + KP LEDI+G FI+E+ R NK E + I T V+ GA +KN+E
Sbjct: 454 TLQPPPGFNTQNSDGKPPLEDILGTFISETRSRFNKNELRLDNIETHVSKIGATMKNLEV 513
Query: 421 QLGQLVNAVSSMQKGKTTAEPEKTQMEYCKAITVHQVEEVQVVDTQEIHEPE---VTKEE 480
Q+GQL + S QKGK ++ E E+C AIT+ + V+ ++I P + +E
Sbjct: 514 QIGQLATLMKSQQKGKFPSDTEVNPREHCNAITLRSGKMVEESKPKKIMVPTPDVIVTDE 573
Query: 481 VEEGSSSTEAE-----KLTSDPLIPSPTVL---VPKPKKKKKKNYSTQFKKFLDIFMSLN 540
+ TEAE K S +P +L +P P++ KK + QF KFL++F ++
Sbjct: 574 RQSERQKTEAEGTKIYKPYSISFPDNPPILKPPLPFPQRFMKKKFDDQFAKFLEVFKKIH 633
Query: 541 INLPFAEALEQMPKYVQFMKEWLSRKKKEKKLRQYSLPLHAVPDFKRMCLTNLLIQGVFL 600
IN+PFAE L QMP Y +F+KE +S KKK ++ L D + L G F
Sbjct: 634 INIPFAETLAQMPNYAKFLKEVMSNKKKLEEFETIKL-TEGCSDILQKLPHKLKDPGSFN 693
Query: 601 FPAIL--------------------------LNIGEIKPTAVKLQLADQSVVSPYGVVEN 658
P + L +GE+KPT + LQLAD+S+ P G++E+
Sbjct: 694 IPCNIGGITFDRALCDFGASINLMPLSVFKKLGLGEVKPTTLTLQLADRSITYPKGMIED 753
BLAST of Lag0029754 vs. NCBI nr
Match:
KAG7947748.1 (hypothetical protein I3843_14G109500 [Carya illinoinensis])
HSP 1 Score: 542.3 bits (1396), Expect = 1.2e-149
Identity = 302/688 (43.90%), Postives = 426/688 (61.92%), Query Frame = 0
Query: 34 MANPNPQDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSED 93
MA + + + ++DY +P S I+ PINANNFELK LI M + + F G P +D
Sbjct: 1 MAEEDREVLPRTLKDYVRPVVNGNYSSIMRQPINANNFELKPALISMVQQAQFSGSPLDD 60
Query: 94 PNSHLKSFLDICGTVKLNGVSEDAIRLRLFPFSLQDKARDWLRSLPSGSITTWDALVQAF 153
PN HL FL+IC TVK+NGV+ED IRLRLFPFSL+DKAR WL+SL GSI +W + + F
Sbjct: 61 PNVHLAMFLEICDTVKINGVTEDTIRLRLFPFSLRDKARGWLQSLQPGSIVSWQDMAERF 120
Query: 154 LAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGL 213
LAK+F PAKT +LR+EIG F+Q E L+EAWERYK+L+R+CPQHG PDWLQ+Q+FYNGL
Sbjct: 121 LAKFFPPAKTAQLRSEIGQFKQNDFESLYEAWERYKDLIRRCPQHGLPDWLQVQMFYNGL 180
Query: 214 NPNTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSS 273
N T+TIVDAA+GGTL+SKT E A LLE+MA+N+YQWPTER+ KK+ AGI+E++ +++
Sbjct: 181 NGQTRTIVDAASGGTLMSKTAEGATALLEEMASNNYQWPTERTLAKKV-AGIHELEPIAA 240
Query: 274 LQAQMTSLANAFMKFSGTGSAQSIESAALASQC--QEETTTEQVHYV-ERNSNYRGHHNS 333
L AQ+ +L++ + QS E A S E + EQV YV RN NYRG N
Sbjct: 241 LSAQVATLSHQISALTTQRIPQSTEYVASTSMIVPSNEASQEQVQYVNNRNYNYRG--NP 300
Query: 334 TPTHYHPNVRNHENFSYANTKNIL---NPPGYASQKVENKPSLEDIVGAFIAESSKRTNK 393
P +YHP +RNHEN SY NTKN+L +PPG+ SQ E K SLED + +F+ E++ R K
Sbjct: 301 MPNYYHPGLRNHENLSYGNTKNVLQPQHPPGFDSQPSEKKMSLEDAMVSFVQETNARFKK 360
Query: 394 LEEAVIAINTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTTAEPEKTQMEYCKAITVHQ 453
+ + I T + GA +KN+E Q+GQL +++ Q+G + E E CKAIT+
Sbjct: 361 TDSRLDNIETHCSNMGATMKNLEVQIGQLATTINAQQRGAFPSNTEVNPKEQCKAITLRS 420
Query: 454 VEEVQVVDTQE----------------IHEPEVTKEEVEEGSSSTEAEKLTSDPLIPSPT 513
+E++ +E + E E+ + +EE + P++ P
Sbjct: 421 GKEIERAPLKESKSTPTAANNGQSKDQVEEEEIVNDTLEETDLPPTISFPDNPPILAPP- 480
Query: 514 VLVPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKKEKK 573
+P P++ +K+ QF KFLDIF ++IN+PFA+ALEQMP YV+F+K+ +S+K++ ++
Sbjct: 481 --LPYPQRFQKQKLDKQFSKFLDIFKKIHINIPFADALEQMPNYVKFLKDIISKKRRLEE 540
Query: 574 LRQYSLPLHAVPDFKRMCLTNLLIQGVFLFPAIL-------------------------- 633
L ++ L G F P +
Sbjct: 541 FETVKLSEECSAILQKKLPQKLKDPGSFTLPCTIGDSFFDRVLCDLGASINLMPFSVCRK 600
Query: 634 LNIGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMDVKENPVVPVILGR 674
L +GE+K T + LQLAD+S+ P G++E+VL++V KF P DF V+D++E+ VP+ILGR
Sbjct: 601 LGLGEMKHTTISLQLADRSIKYPRGIIEDVLVKVDKFIFPADFVVLDMEEDEDVPLILGR 660
BLAST of Lag0029754 vs. NCBI nr
Match:
XP_023874613.1 (uncharacterized protein LOC111987139 [Quercus suber])
HSP 1 Score: 538.5 bits (1386), Expect = 1.8e-148
Identity = 305/686 (44.46%), Postives = 428/686 (62.39%), Query Frame = 0
Query: 34 MANPNPQDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSED 93
MA + + ++DY +P D SGI INANNFELK LI M + + F G P +D
Sbjct: 1 MAEGEQNAQPRTLKDYVRPIVNDNYSGIRRQTINANNFELKPALISMVQQAQFSGSPLDD 60
Query: 94 PNSHLKSFLDICGTVKLNGVSEDAIRLRLFPFSLQDKARDWLRSLPSGSITTWDALVQAF 153
PN HL FL+IC T+K+NGV+ED IRLRLFPFSL+DKAR WL+SL GSIT+W + + F
Sbjct: 61 PNIHLAMFLEICDTIKMNGVTEDTIRLRLFPFSLRDKARGWLQSLQPGSITSWQDMAEKF 120
Query: 154 LAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGL 213
LAK+F PAKT +LR+EIG F+Q E L+EAWERYK+L+R CPQHG PDWLQ+Q+FYNGL
Sbjct: 121 LAKFFPPAKTAQLRSEIGQFRQNDFESLYEAWERYKDLIRCCPQHGLPDWLQVQMFYNGL 180
Query: 214 NPNTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSS 273
N T+TIVDAA+GGTL+SKT E A +LLE+MA+N+YQWPTER+ KK+ AGI+E++ ++
Sbjct: 181 NGQTRTIVDAASGGTLMSKTAEGATSLLEEMASNNYQWPTERTMAKKV-AGIHELEPFAA 240
Query: 274 LQAQMTSLANAFMKFSGTGSAQSIESAALASQC--QEETTTEQVHYV-ERNSNYRGHHNS 333
L AQ+ SL++ + Q E A +S E + EQV Y+ RN NYRG N
Sbjct: 241 LSAQVASLSHQVSALTTQRIPQGAEYVAASSMTVPMNEASQEQVQYINNRNYNYRG--NP 300
Query: 334 TPTHYHPNVRNHENFSYANTKNILN-PPGYASQKVENKPSLEDIVGAFIAESSKRTNKLE 393
P +YHP +RNHENFSY NTKN+L PPG+ SQ E K SLED + +F+ E+ K +
Sbjct: 301 MPNYYHPGLRNHENFSYGNTKNVLQPPPGFDSQPSEKKMSLEDAMVSFVEETKATFKKSD 360
Query: 394 EAVIAINTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTTAEPEKTQMEYCKAITVHQVE 453
+ I T + GA +KN+E Q+GQL +++ Q+G + E E CKAIT+
Sbjct: 361 SQLDNIETHCSNMGATMKNLEVQIGQLATTINAQQRGTFPSNTEVNPKEQCKAITLRSGR 420
Query: 454 EVQVVDTQE----------------IHEPEVTKEEVEEGSSSTEAEKLTSDPLIPSPTVL 513
E++ ++E + E E+ ++ + E + P++ +P
Sbjct: 421 EIERSPSKETETTPTAPNNGQSKNKVEEEEIVEDTLRETDMPPSISFPDNPPILSTP--- 480
Query: 514 VPKPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKK----- 573
+P P++ +K+ QF KFLDIF ++IN+PFA+ALEQMP Y +F+K+ +S+K++
Sbjct: 481 LPYPQRFQKQKLDKQFSKFLDIFKKIHINIPFADALEQMPNYAKFLKDIISKKRRLEEFE 540
Query: 574 ------------EKKLRQ-------YSLPLHAVPDF--KRMCLTNLLIQGVFLFPAILLN 633
+KKL Q ++LP F K +C I + L L
Sbjct: 541 TVKLSEECSAIIQKKLPQKLKDPGSFTLPCTIGNSFFDKVLCDLGASINLMPLSVYRKLG 600
Query: 634 IGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMDVKENPVVPVILGRPF 674
+GE+K T + LQLAD+S+ P G++E+VL++V KF P DF V+D++E+ VP+ILGRPF
Sbjct: 601 LGEMKQTTISLQLADRSIKYPRGIIEDVLVKVDKFIFPADFVVLDMEEDQEVPLILGRPF 660
BLAST of Lag0029754 vs. ExPASy TrEMBL
Match:
A0A6J0ZX64 (LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 OS=Herrania umbratica OX=108875 GN=LOC110412945 PE=4 SV=1)
HSP 1 Score: 466.8 bits (1200), Expect = 3.2e-127
Identity = 293/751 (39.01%), Postives = 423/751 (56.32%), Query Frame = 0
Query: 1 MRKDKDALLVPFDFEIEKTCKKNRKEK------RERLASMANPNPQ-------DEQKPIR 60
M++ + LVPFD +IE+T +++R+E + +A N N + + +R
Sbjct: 1 MQRRNNLNLVPFDPDIERTFRRHRRENLQVATLNQTMAEDNNNNGNNAINLVPEANRALR 60
Query: 61 DYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSA-FRGFPSEDPNSHLKSFLDICG 120
DY P I INANNFE+K IQM + S F G PS+DPNSHL +FL+IC
Sbjct: 61 DYVVPLVQGLHQSIRRPSINANNFEIKPAYIQMIQSSVQFSGLPSDDPNSHLVNFLEICD 120
Query: 121 TVKLNGVSEDAIRLRLFPFSLQDKARDWLRSLPSGSITTWDALVQAFLAKYFSPAKTVKL 180
T K NGV++DAIRLRLFPFSL+DKA+ WL SLP+GSITTW+ L Q FLAK+F PAKT K+
Sbjct: 121 TFKYNGVTDDAIRLRLFPFSLRDKAKSWLNSLPNGSITTWEDLAQKFLAKFFPPAKTAKM 180
Query: 181 RTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAG 240
R +I +F Q E L+EAWER+KELLR+CP HG PDWLQ+Q FYNGL + KTI+DAAAG
Sbjct: 181 RNDITSFIQFDGESLYEAWERFKELLRRCPHHGIPDWLQVQTFYNGLVGSIKTIIDAAAG 240
Query: 241 GTLLSKTVENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFM 300
G L+SK +A LLE+MA+N+YQWP+ERS +K A G YEID + +L Q+ +L+
Sbjct: 241 GALMSKNAVDAYNLLEEMASNNYQWPSERSGSRK-AVGAYEIDALGTLTTQVAALSK--- 300
Query: 301 KFSGTGSAQSIESAALASQCQEETTTEQVHYVERNSNYRGHHNSTPTH-----YHPNVRN 360
K G S + C + + +Q Y + + G+ N + Y+P RN
Sbjct: 301 KLDTLGVHAVQNSLVVCEMCGDSHSYDQCPYNSESVQFVGNFNRQQNNPYSNTYNPGWRN 360
Query: 361 HENFSYANTKNILN-----PPGYASQK----VENKPSLEDIVGAFIAESSKRTNKLEEAV 420
H NFS++N N PPG+ Q E K LE+++ +I+++
Sbjct: 361 HPNFSWSNNAGPSNPKPIMPPGFQQQARPQIPEKKSQLEELLLQYISKT----------- 420
Query: 421 IAINTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTTAEPEKTQM-----EYCKAITVHQ 480
+ + GA+++N+ETQ+GQL N++++ +G P TQ+ E C+AIT+
Sbjct: 421 ---DAIIQSQGASLRNLETQVGQLANSINNRPQGSL---PSDTQINPKGKEQCQAITLRS 480
Query: 481 VEEVQVVDTQEIHE--PEVTKE-------EVEEGSSSTEAEKLTSDPLIPSPTVLVPKPK 540
+E++ V+ + + V KE E+++ + TS + P P P P+
Sbjct: 481 GKEIEGVNQKAVESEIEHVDKEGMCENEIEIQQKDDDKAENQGTSQVIHPPP----PFPQ 540
Query: 541 KKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKKEKKLRQYSLP 600
+ +K+ QF+KFL++F L+IN+PFAEALEQMP YV+F+K+ LS+K+K + L
Sbjct: 541 RLQKQKLEKQFQKFLNVFKKLHINIPFAEALEQMPSYVKFLKDILSKKRKLGEFETVFLT 600
Query: 601 LHAVPDFKRMCLTNLLIQGVFLFPAIL--------------------------LNIGEIK 660
+ L G F P + L +GE K
Sbjct: 601 EECSAILQNKLPPKLKDPGSFTIPCTIGNLFFTKALSDLGASINLMPWSIFEKLGLGECK 660
Query: 661 PTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMDVKENPVVPVILGRPFLATGR 684
PT+V LQLAD+S V P G++E+VL++V KF PVDF ++D++E+ +P+ILGRPFLAT
Sbjct: 661 PTSVTLQLADRSYVYPRGIIEDVLVKVDKFIFPVDFLILDMEEDRQIPIILGRPFLATAG 720
BLAST of Lag0029754 vs. ExPASy TrEMBL
Match:
A0A6J1DU19 (uncharacterized protein LOC111024361 OS=Momordica charantia OX=3673 GN=LOC111024361 PE=4 SV=1)
HSP 1 Score: 430.6 bits (1106), Expect = 2.6e-116
Identity = 269/666 (40.39%), Postives = 372/666 (55.86%), Query Frame = 0
Query: 40 QDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSEDPNSHLK 99
Q+ Q IRDY QP FP+ GI+ PINANN ELK GLIQM R++ FRG +EDPN+HL
Sbjct: 21 QNNQMTIRDYCQPNFPN-HVGIINLPINANNSELKPGLIQMVRENTFRGNATEDPNNHLT 80
Query: 100 SFLDICGTVKLNGVSEDAIRLRLFPFSLQDKARDWLRSLPSGSITTWDALVQAFLAKYFS 159
FLD+CGTVK+NGV +DAIRLRLFP SLQDK +VQAFL +F
Sbjct: 81 IFLDVCGTVKMNGVIDDAIRLRLFPLSLQDK-----------------EMVQAFLTNFFP 140
Query: 160 PAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKT 219
PAKT +LRTEI +F++ EQLFE WERYKELLRKCPQHG +WLQIQ+FYNGLN T+T
Sbjct: 141 PAKTTQLRTEIRSFRKYDYEQLFEVWERYKELLRKCPQHGNLEWLQIQMFYNGLNGQTRT 200
Query: 220 IVDAAAGGTLLSKTVENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMT 279
I+DAAAGGTLLS+T ENA LL+DMA NS+QWP+ERS KK+ AG+YEID +SSL+AQ+
Sbjct: 201 ILDAAAGGTLLSRTPENAYILLKDMADNSFQWPSERSNAKKV-AGMYEIDELSSLKAQVQ 260
Query: 280 SLANAFMKFSGTGSAQSIESAALASQCQEETTTEQVHYVERNSNYRGHHNSTPTHYHPNV 339
+L NA K SG G++ S E A T ++Y P +
Sbjct: 261 ALTNAVSKLSGPGTSHSNELVAA--------------------------TDTYSYYEPTI 320
Query: 340 RNHENFSYANTKNILNPPGYASQKVENKPSLEDIVGAFIAESSKRTNKLEEAVIAINTTV 399
+ + S E K SLED++GAFI E R +++E V + +
Sbjct: 321 EQAQ---------------FTSHPAEKKSSLEDLLGAFINECRSRASRIENQVEGMEVKL 380
Query: 400 TGHGAAIKNIETQLGQLVNAVSSMQKGKTTAEPEKTQMEYCKAITVHQVEEVQVVDTQEI 459
G+ +IKN+E Q+GQ+ +++MQKGK ++ E E+CKA+T+ +E+Q + +++
Sbjct: 381 EGNTTSIKNMEVQIGQIAPTLNTMQKGKFPSDIEVKPREHCKAVTLRSGKELQEPEKKKM 440
Query: 460 HEPEVTKEE-------VEEGSSSTEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQFKKF 519
EP +T EE V+E + + +A+K TS ++ SP +P P+
Sbjct: 441 EEPVITTEERENKEEVVKEATPALQADKPTSS-IVSSPPNSLPYPQ-------------- 500
Query: 520 LDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKKEKKLRQYSLPLHAVPDFKRMCLTN 579
ALEQMP YV+FMK+ ++ K+K + +L +R
Sbjct: 501 --------------HALEQMPNYVRFMKDIMTGKRKLEAYETVNLTEECSAILQRKLPQK 560
Query: 580 LLIQGVFLFPAILLNIGEIKPTAVKLQLADQSVVSPYGVVENVLIQVGKFFLPVDFFVMD 639
L G F P + + K + + P GV+E+VL++V + P DF V+
Sbjct: 561 LKDPGSFTIPCTISSSSFNKALC---DICASINLMPLGVIEDVLVKVDRLIFPADFVVLX 594
Query: 640 VKENPVVPVILGRPFLATGRVIIDIERRELTVRVLHEKEIFKAVGASTNCSEVMMFRSVC 699
+E+ +P+ILGR FLATG +ID++ LT+RV E +F A EV +
Sbjct: 621 XEEDSEIPIILGRXFLATGXALIDVQLGXLTLRVNEEVVVFDISXAMKYXEEVSTCHRID 594
BLAST of Lag0029754 vs. ExPASy TrEMBL
Match:
A0A6P8DD93 (uncharacterized protein LOC116206453 OS=Punica granatum OX=22663 GN=LOC116206453 PE=4 SV=1)
HSP 1 Score: 409.8 bits (1052), Expect = 4.7e-110
Identity = 273/738 (36.99%), Postives = 401/738 (54.34%), Query Frame = 0
Query: 1 MRKDKDALLVPFDFEIEKTCKKNRKEKRER----LASMA----NPNPQDEQKPIRDYFQP 60
MR+ + A L+P D EIE+T + R+E R R + MA N Q + +RDY P
Sbjct: 1 MRRSRSAELLPLDPEIERTLHRLRRENRRREELQVVEMADDDINRQIQGAARALRDYAVP 60
Query: 61 TFPDQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSEDPNSHLKSFLDICGTVKLNG 120
T S I I ANNFELK LIQM + + F G+P+E P+ H+ FL C TVK+N
Sbjct: 61 TI--MGSAIRRPTIPANNFELKPALIQMVQSNQFGGYPNESPDEHIAGFLQYCNTVKMNN 120
Query: 121 VSEDAIRLRLFPFSLQDKARDWLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGT 180
V++D IRL+LFPFSL+DKAR W SLP SITTW L FL ++F PA+T +LR EI
Sbjct: 121 VTDDVIRLQLFPFSLRDKARAWFNSLPQESITTWADLSSKFLRRFFPPARTARLRNEITN 180
Query: 181 FQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSK 240
F + E L+EAWER+KE +RKCP HG PD L I++FY L+ +++VDAAAGG L+ K
Sbjct: 181 FTKFNGESLYEAWERFKEAIRKCPHHGLPDNLLIEVFYLSLDDTLRSLVDAAAGGALMGK 240
Query: 241 TVENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFMKFSGTG 300
+ A L+E+MA++++ W ERS K A + ++D +++L Q+++L K +
Sbjct: 241 NYDEASALIEEMASSAHNWQNERS--KSRVASVNDMDTIANLTTQISALTTQVSKLTSAH 300
Query: 301 SAQSIESAALASQCQEETTT--------------EQVHYVERNSNYRGHHNSTPTHYHPN 360
S + A C +T EQV++V N+ R + Y+P
Sbjct: 301 SFNT-NQVAFCELCSGPHSTLECMSGNPSASPNGEQVNFV--NNFQRSNQGPYSNTYNPG 360
Query: 361 VRNHENFSYANTKNILNPPGYASQKVENKPSLEDIVGAFIAESSKRTNKLEEAVIA---- 420
RNH NFS+ N N L PP P + A A + +++EE +++
Sbjct: 361 WRNHPNFSWRNENNALKPP----------PGFQKQGPAQNAPPQQSQSRMEELMLSYMQK 420
Query: 421 INTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTTAEPEKTQMEYCKAITVHQVEEVQVV 480
+T + A I+N+E Q+ Q+ +S+ G + E+ + AI + +E+++V
Sbjct: 421 TDTMLQNQQATIRNLEGQISQISQQLSNRPSGSLPSNTEENP-KGVNAIMLRSGKELEIV 480
Query: 481 D-----TQEIHEPEVTKEEVEEGSSSTEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQF 540
+ +E E + K++VEE + L P +P VP P++ K++ QF
Sbjct: 481 NRKAQTQEESPEKDKGKQKVEE----PRQKSLGVKPYVPP----VPFPRRLKQQQLDAQF 540
Query: 541 KKFLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKK-----------------EKKL 600
KFLD+F L IN+PFAEAL+QMP Y +FMK+ L++K+K +K L
Sbjct: 541 AKFLDVFKKLQINIPFAEALQQMPSYARFMKDLLTKKRKFDGSEPVMLTGECSMILQKDL 600
Query: 601 RQYSLPLHAVPDFKRMC------LTNLLIQ---GVFLFPAIL---LNIGEIKPTAVKLQL 660
F C N+LI + L P + L +GE K T V LQL
Sbjct: 601 PNLPRKQRDQGSFTVPCTIGNFHFENVLIDSGASINLMPLSIFRKLGLGECKKTHVTLQL 660
Query: 661 ADQSVVSPYGVVENVLIQVGKFFLPVDFFVMDVKENPVVPVILGRPFLATGRVIIDIERR 679
AD+S+ P G+VENVL++V KF PVDF V++++E+ VP+ILGRPFLATG+ +ID+E+
Sbjct: 661 ADRSIKYPKGIVENVLVKVDKFIFPVDFIVLEMEEDREVPMILGRPFLATGKALIDVEQG 712
BLAST of Lag0029754 vs. ExPASy TrEMBL
Match:
A0A6P8DKJ2 (uncharacterized protein LOC116204231 OS=Punica granatum OX=22663 GN=LOC116204231 PE=4 SV=1)
HSP 1 Score: 407.9 bits (1047), Expect = 1.8e-109
Identity = 272/738 (36.86%), Postives = 400/738 (54.20%), Query Frame = 0
Query: 1 MRKDKDALLVPFDFEIEKTCKKNRKEKRER----LASMA----NPNPQDEQKPIRDYFQP 60
MR+ + A L+P D EIE+T + R+E R R + MA N Q + +RDY P
Sbjct: 107 MRRSRSAELLPLDPEIERTLHRLRRENRRREELQVVEMADDDINRQIQGAARALRDYAVP 166
Query: 61 TFPDQQSGIVYAPINANNFELKTGLIQMARDSAFRGFPSEDPNSHLKSFLDICGTVKLNG 120
T S I I ANNFELK LIQM + + F G+P+E P+ H+ FL C TVK+N
Sbjct: 167 TI--MGSAIRRPTIPANNFELKPALIQMVQSNQFGGYPNESPDEHIAGFLQYCNTVKMNN 226
Query: 121 VSEDAIRLRLFPFSLQDKARDWLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGT 180
V++D IRL+LFPFSL+DKAR W SLP SITTW L FL ++F PA+T +LR EI
Sbjct: 227 VTDDVIRLQLFPFSLRDKARAWFNSLPQESITTWADLSSKFLRRFFPPARTARLRNEITN 286
Query: 181 FQQLGDEQLFEAWERYKELLRKCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSK 240
F + E L+EAWER+KE +RKCP HG PD L I++FY L+ +++VDAAAGG L+ K
Sbjct: 287 FTKFNGESLYEAWERFKEAIRKCPHHGLPDNLLIEVFYLSLDDTLRSLVDAAAGGALMGK 346
Query: 241 TVENARTLLEDMATNSYQWPTERSAPKKIAAGIYEIDNVSSLQAQMTSLANAFMKFSGTG 300
+ A L+E+MA++++ W ERS K A + ++D +++L Q+++L K +
Sbjct: 347 NYDEASALIEEMASSAHNWQNERS--KSRVASVNDMDTIANLTTQISALTTQVSKLTSAH 406
Query: 301 SAQSIESAALASQCQEETTT--------------EQVHYVERNSNYRGHHNSTPTHYHPN 360
S + A C +T EQV++V N+ R + Y+P
Sbjct: 407 SFNT-NQVAFCELCSGPHSTLECMSGNPSASPNGEQVNFV--NNFQRSNQGPYSNTYNPG 466
Query: 361 VRNHENFSYANTKNILNPPGYASQKVENKPSLEDIVGAFIAESSKRTNKLEEAVIA---- 420
RNH NFS+ N N L PP P + A A + +++EE +++
Sbjct: 467 WRNHPNFSWRNENNALKPP----------PGFQKQGPAQNAPPQQSQSRMEELMLSYMQK 526
Query: 421 INTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTTAEPEKTQMEYCKAITVHQVEEVQVV 480
+T + A I+N+E Q+ Q+ +S+ G + E+ + AI + +E+++V
Sbjct: 527 TDTMLQNQQATIRNLEGQISQISQQLSNRPSGSLPSNTEENP-KGVNAIMLRSGKELEIV 586
Query: 481 D-----TQEIHEPEVTKEEVEEGSSSTEAEKLTSDPLIPSPTVLVPKPKKKKKKNYSTQF 540
+ +E E + K++VEE + L P +P VP P + K++ QF
Sbjct: 587 NRKAQTQEESPEKDKGKQKVEE----PRRKSLGVKPYVPP----VPFPGRLKQQQLDAQF 646
Query: 541 KKFLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKK-----------------EKKL 600
KFLD+F L IN+PFAEAL+QMP Y +FMK+ L++K+K +K L
Sbjct: 647 AKFLDVFKKLQINIPFAEALQQMPSYARFMKDLLTKKRKFDGSEPVMLTGECSMILQKDL 706
Query: 601 RQYSLPLHAVPDFKRMC------LTNLLIQ---GVFLFPAIL---LNIGEIKPTAVKLQL 660
F C N+LI + L P + L +GE K T + LQL
Sbjct: 707 PNLPRKQRDQGSFTVPCTIGNFHFENVLIDSGASINLMPLSIFRKLGLGECKKTHITLQL 766
Query: 661 ADQSVVSPYGVVENVLIQVGKFFLPVDFFVMDVKENPVVPVILGRPFLATGRVIIDIERR 679
AD+S+ P G+VENVL++V KF PVDF V++++E+ VP+ILGRPFLATG+ +ID+E+
Sbjct: 767 ADRSIKYPKGIVENVLVKVDKFIFPVDFIVLEMEEDREVPMILGRPFLATGKALIDVEQG 818
BLAST of Lag0029754 vs. ExPASy TrEMBL
Match:
A0A5N6LUB5 (Retrotrans_gag domain-containing protein OS=Mikania micrantha OX=192012 GN=E3N88_38555 PE=4 SV=1)
HSP 1 Score: 405.6 bits (1041), Expect = 8.8e-109
Identity = 263/745 (35.30%), Postives = 395/745 (53.02%), Query Frame = 0
Query: 15 EIEKTCKKNRKEKRERLASMANPNPQDEQKPIRDYFQPTFPDQQSGIVYAPINANNFELK 74
E EK + E +A N E++ I DY +P+ D S IV INANNFE++
Sbjct: 35 EFEKEANHEEENTGETMAGQGN-----ERRSISDYARPSLGDLASSIVRPTINANNFEIR 94
Query: 75 TGLIQMARDS-AFRGFPSEDPNSHLKSFLDICGTVKLNGVSEDAIRLRLFPFSLQDKARD 134
IQM +++ F G EDP++H+ SF++IC T K NGVS+DAI+LR+FPFSL+D+A+
Sbjct: 95 PHFIQMIQNNLQFYGLTDEDPSAHITSFIEICDTFKANGVSDDAIKLRMFPFSLKDRAKA 154
Query: 135 WLRSLPSGSITTWDALVQAFLAKYFSPAKTVKLRTEIGTFQQLGDEQLFEAWERYKELLR 194
WL SLP GS+TTW+ L Q FL KYF P+KT +LR I +F Q E L++AWERYK+L+R
Sbjct: 155 WLSSLPPGSVTTWEDLAQKFLFKYFPPSKTARLRNNITSFVQDDGESLYDAWERYKDLMR 214
Query: 195 KCPQHGYPDWLQIQLFYNGLNPNTKTIVDAAAGGTLLSKTVENARTLLEDMATNSYQWPT 254
KCP HG W+Q+ FYNGL P + ++DA AGG KT E LLE +A N++QW
Sbjct: 215 KCPHHGLDSWMQVTTFYNGLFPQDRQMIDATAGGAFTDKTPEEGYALLEQLAANNHQWQA 274
Query: 255 ERSAPKKIAAGIYEIDNVSSLQAQ---MTSLANAFMKFSGTGSAQSIESAALASQCQE-- 314
R K G++++D+ +SL AQ MT N L+ CQ
Sbjct: 275 TRGKTPK--QGVHQVDDYTSLVAQVEAMTRKINQMQMNQVQTWCDFCGGPHLSVNCQAGN 334
Query: 315 -ETTTEQVHYVERNSNYRGHHNSTPTHYHPNVRNHENFSY-ANTKNILNPPGY------- 374
+T+ EQV ++ S R +N Y+P +NH NFS+ A+ N PPG+
Sbjct: 335 LKTSREQVDFM--GSQNRPQNNPYSNTYNPGWKNHPNFSWKASGSN--QPPGFQQRPPYQ 394
Query: 375 -------ASQKVENKP------------------------SLEDIVGAFIA-------ES 434
A Q+ +++P +LE ++ F++ +S
Sbjct: 395 QNQQPFEARQQNQSQPQNQNQYQQYQDQGSSSGSQPEKMSNLEKMMTQFLSTAEARHQKS 454
Query: 435 SKRTNKLEEAVIAINTTVTGHGAAIKNIETQLGQLVNAVSSMQKGKTTAEPEKTQMEYCK 494
R ++ + + G+AI+ IE Q+GQ+ ++ +KGK + E E+CK
Sbjct: 455 EARHDQADARHQQFENELRSQGSAIRGIENQMGQIAKLLADREKGKLPSNTETNPKEHCK 514
Query: 495 AITVHQVEEVQVVDTQEIHEPEVTKE-EVEEGSSSTEAEKLTSDPL-----IPSPTVLVP 554
A+T+ + + D +P V +E EV++ +T+ + P+ + PT +P
Sbjct: 515 AVTLRSGKTTKSDDLASTSKPIVEEEVEVQDEVKNTKQDSTGKAPVKEPLRVYKPT--IP 574
Query: 555 KPKKKKKKNYSTQFKKFLDIFMSLNINLPFAEALEQMPKYVQFMKEWLSRKKKEKKLRQY 614
P + K +N + KFLD+F L+INLPF EAL QMPKY +F+K+ L+ K+K ++L
Sbjct: 575 YPGRLKNENMEKHYGKFLDLFKQLHINLPFVEALSQMPKYAKFLKDLLTNKQKLEELSHV 634
Query: 615 SLPLHAVPDFKRMCLTNLLIQGVFLFPAIL--------------------------LNIG 674
L + + G F P ++ L++G
Sbjct: 635 ILNEECSAVLQNKLPEKMKDPGSFTIPCLIGGLSVNNALADLGASINLMPYSMFSKLDLG 694
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
KAG7990634.1 | 8.9e-156 | 44.46 | hypothetical protein I3843_02G035100 [Carya illinoinensis] | [more] |
KAG6734747.1 | 3.3e-150 | 44.62 | hypothetical protein I3842_01G285500 [Carya illinoinensis] | [more] |
XP_022843226.1 | 7.3e-150 | 45.09 | uncharacterized protein LOC111366761 [Olea europaea var. sylvestris] | [more] |
KAG7947748.1 | 1.2e-149 | 43.90 | hypothetical protein I3843_14G109500 [Carya illinoinensis] | [more] |
XP_023874613.1 | 1.8e-148 | 44.46 | uncharacterized protein LOC111987139 [Quercus suber] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J0ZX64 | 3.2e-127 | 39.01 | LOW QUALITY PROTEIN: uncharacterized protein LOC110412945 OS=Herrania umbratica ... | [more] |
A0A6J1DU19 | 2.6e-116 | 40.39 | uncharacterized protein LOC111024361 OS=Momordica charantia OX=3673 GN=LOC111024... | [more] |
A0A6P8DD93 | 4.7e-110 | 36.99 | uncharacterized protein LOC116206453 OS=Punica granatum OX=22663 GN=LOC116206453... | [more] |
A0A6P8DKJ2 | 1.8e-109 | 36.86 | uncharacterized protein LOC116204231 OS=Punica granatum OX=22663 GN=LOC116204231... | [more] |
A0A5N6LUB5 | 8.8e-109 | 35.30 | Retrotrans_gag domain-containing protein OS=Mikania micrantha OX=192012 GN=E3N88... | [more] |
Match Name | E-value | Identity | Description | |