Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAAGCAACGAACGAGGAATGATGATAGTAGGGAGAGGAAAGTCGATTTGGAGGGTGGGCGCGAGTGAGGCAGAATGGTGGAGGGCGAAGAGCGGCAGCCATGGACGACAGACGACGGATGGCGGCGGCAGTGTAGGACTACGGAAAGACGAATGGCGAGGAGCGGTTGTGCTTGGGGAATGGTGATGAAAAATTGATTTGGGGGGTGGGAATGAATGGGAAAGAGAAATGGATTTCAGGATTATTTTTAAAACACCATCTCCCCACGTATTTACCAAAATCTTTAAGTTAATTTACCTTTTTTCTCTATTCTTTTTCTATGCTTTGTAATTAAATTTGAAAAAAATAATTTGTAATTTCAAATTGCAATCCTTCGTATTGAAATTATCAGCATTAATTAAATCTTTTTCGATGGAAGAAAAAAAAATCTATGAACATTTTTTGTATTAAAATATATAGATAAAATTAAAAAGTAAAAAATAAGATTTGCAATGGTGTGTAAGTTAATATTTAAATAATATTTTGACTTAAACCTATTTTCTAAAATCTATAAATTTAAGGATCTGTTTGATAATGATTTCTCCTTTTGGGTCCGTGGTTTTTCTGTTTTAAAAATCATAATTTGTATATGTTTAATAACTATTTTTCATTTTATGTTTCTAAAAGTTAAAAAATTGAAAATATATGTTACAGAAATAGAATTCTATATAGAGTATAGATGGAAATGGTATGGGGTGGGAGTGAGGATGCCCTCTCCCGTTCGTGCCTGTGGAAAAAAACTCGTGTGCACGAATATAAATACATTTTCTTGCTAAAAAACACCGCTCACCATTTCATGATGCTTTAAAAATGTGACAAATCATTATAAATTTGTTGCACACTCTTTCATGATGTTTTTTAAACGTCACTAAAGATATATGTTTCCCACTTTCCACAAAAGCCTCACCTTTTCATGACGTGTCTTAAATGTCACGATAAGGTTATATTTCCAACTTTTCATGACGTTTTTTAGACGTCGGAAAAAGACAGAAATCTGATTTTCCCGCCAACTTCCTCACTGTTCTTTCGTAACATTTAATAAAATGTCATGAAAAAGCTTTTCGTGACGTTCAAAAAATGTCACAGAAAAGTTGAATATTCGAAGAATACAATCTTTTCCCTCCAAAAACGCATCTCTACATTTTGTGACGTTTAATAAACGTCATGAAAATGTACTTACTTTTGACTTTTTGGTGATGGTTAGAAAAAATGTCACGAAAACTCAAAATCCGGAACATTCTGTCATTTCCCACCAAAAACGCGCCTCACTCTTTTGTGACATTTTTTAAACGTCACAAAAAGGTCAAAATTCTTGAAATCTCCATTTTTCCCCTCCAAAAACGTGTTTTTTTTTCTTTTCGTGACGTTTTTTTTAATTTGAACAGTATTTTAGTGTTCATGAGCATCACTAAAACCCAAAATTCTAGTAGTGCGAAATGGTCCCGGGAGGTTTGTTGTTGCGCTTCTATCGAAATGCCATAGCCAGAGGTCGCTGGAGTATGGGTTAGTTTGCAAAGATAGGAGATAATGATGTAGAAGATGAAGAAGATGATGGGTTTGTAAAGAGGGGCATAATAGAAAGGAGAAAAGAAAAAAGAAAAAGTAAAAATGTAACTATACAGGAAGATGGTGGGGTTTTAGAGAGAGAAAGAATAAAAATGGAAAAGAAAGGTGGCTTGGGTTTTATGGCCCTCTTTGGTCGCGAAAAACAACCCCAATTGTCGATTGCTTCGATGCTACCGGAGGCTCTGGGGAAGTCGGAATAGGAGAGTACTCATCTTGGGGTGGGCTTACTACTTAGATGCTTTCAGCAGTTATCCGCTCCGCACTTGGCTACCTAGCGTTTACCGTGGGCACGATAACTGGTACACCAGAGGAGCTCATATTGGCCATGGGTCAGAACGAGAATTGAATTCTATGAGACCTAATTTCCCGTTGTTCCTCAGTAGCTCAGTGGTAGAGCGGTCGGCTGTTAACCGATTGGTCGTAGGTTCGAATCCTCCTTGGGGAGATTTGATTCATTCCGAATTAAAGAATTCGGAATGAAAGGGTTCGCTTTGACCGTTAAGAGTAGGTAATCTGTTCCCTATGTCTTTGTTTCTATTGCATTCTATCTCATCGTATCAAATTCTGTTCTGCGATATTTGAGAATCACCGTCAATACCTCGGTGTAGGTCCGAGATAATCCTTTGTCCCACAGTCTGGGTCTATTTACAACTAGCCAATTAAGAATTCTCAGATGTACTAGTACTAGCAAGTGCATCTTTGATGCAGTCATCGATTCTCCCGAGAGGTTACAATTACCGTGAGCAAACATATTAATGACGAGGAACACATTTTTGCTATTCTACTAATACTTGTACTTGCTCTGCTATTCTGCCCAAGCCTGGCTGAGGAAGAATTACGGGGCGTAAAACAAAAAAATATGCTGATTCGGGTTGGGTATACTATATTTAATTTATAACGGAACCCCCGCCCTTAACCCATTAACGAAAAAAGAAAAAATAAAAAGATAAGGCCATTCCATTTCGACAAAAGACCCACACCCAAGTTCCATAGCTTTGGGTCCGCTATCCCGATCATGATTTTCCTACCCCCAGAGGGAAAAGTCCTTCCCTTTTTGGCCGGTTGTGGGCGAGGAGGGATTCGAACCCCCGACACCGTGGTTCGTAGCCACGTGCTCTAATCCTCTGAGCTACAGGCCCCACCCCGTCTCCACTGGATCTGTTCCCGGGGGTACCCTAAAAAAAGGAACCTTTCCTCTCCCCAGCCATTTGCCATTTCGGGTTAAGAAGATGTGAAAGCGCCTCTCTCTCTATAAGAACGGTGCGTTCCAAGGTGTGAAGTGAGAGAGAAAAGTGGGAGAGAAGGGGTTTCATAATTGAGGTTTTGAATAAGACGACCTTTTCATTTTTCATTTTTTTTTTCCATATTGAATATTGAAAAGTAATAAGAATTAGAGGTGTTAAACTTTTTATCATCCTTGCGTCGAGCTATTTTTCCGCAGGACCTCCCCTACAGTATCGTCACCGCAGTAGAGTTTAACCACCAAGTTCGGGATGGATTGGTGTGGTTCCTCTACGCCTAGGACACCAGAATATCGAACCATGAACGAAGAAAGGCATGAGAGAAAAGCATATTGGCTAGTGATTGTGAGGCCCCAATTCTTGACTGGAGGGGACACCAAAGGCCTCTGCCCTTCCATCCCTTGGATAGATAGAGAGGGAGGGCAGAGCTTTTTTTGGTTTTTTCATGTTGTCAAAGAGTTGAACAATGATTTTTTCGTGTTGTCAAAGAGTTGAACAATGAAAATAGATGGCGAGTGCCTAATCGAATTGATCGGGTCATGTAGGAACAAGGTTCAAGTCTACCGGTCTGTTAGGATGCCTCAGCTGCATACATCACTGCACTTCCACTTGACACCTATCGTGATGATAAACGGCTCATCTCGCCGTGACCTTCTCTTGAATTCTCAAAACTTCTGTCGCTCCATCCCCGCAGGGGCAGAGAACCCATCGCTGTCTCGGCTGTGCTACCGGAGGCTCTGGGGAAGTCGGAATAGGAGAGCACTCATCTTGGGGTGGGCTTACTACTTAGATGCTTTCAGCAGTTATCCGCTCCGTACTTGGCTACCCAGCGTTTACCGTGGGCACGATAACTGGTACACCAGAGGTGCGTCCTTCCCGATCCTCTCGTACTAGGGAAAGGTCCTCTCAATGCTCTAACGCCCGGATATGGACCGAACTGTCTCACGACGTTCTGAACCCAGCTCACGTACCGCTTTAATGGGCGAACAGCCCAACCCTTGGAACATACTACAGCCCCAGGTGGTGAAGAGCCGACATCGAGGTGCCAAACCTTCCCGTCGATGTGAGCTCTTGGGGAAGATCAGCCTGTTATCCCTAGAGTAACTTTTATCTGTTGAGCGACGACCCTTCCACTCGGCACCGTCGGATCACTAAGGCCGACTTTCGTCCCTGCTCGACGGGTGGGTCTTGCAGTCAAGCTCCCTTCTGCTTTTGCACTCGAGGGCCAATCTCCGTCTGGCCCGAGGAAACCTTTGCACGCCTCCGTTACCTTTTGGGAGGCCTACGCCCCATAGAAACTGTCTACCTGAGACTGTCCCTTGGCTCGTAGGTCCTGACACAAGGTTAGAATTCTAGCTCTTCCAGAGTGGTATCTCACTGATGGCTCGGGCCCCCCCGGAAGGGGGCCTTTTTCGCCCTCCACCTAAGCTGCGCAGGAAAGGCCCAAAGCCAATCCCACAGAACAGTGAAGCTTCATAGGGTCTTTCTGTCCAGGTGCAGGTAGTCCGCATTTTCACAGACATGTCTATTTCACCGAGCCTCTCTCTGAGTTAGTGCCCAGATCGTTACGCCTTTCGTGCGGGTCGGAACTTACCCGACAAGGAATTTCGCTACCTTAGGACCGTTATAGTTACGGTCACCGTTCACCGGAGCTTCGGTCGCCGGCTCCCCTATCATCAGGTCACCAACTTCCTTGACCTTCCGGCACTGGGCAGGCGTCAGCCCCCATACATGGTCTTACGACTTTGCGGAGACCTGTGTTTTTGGTAAACAGTCGCCCGGGCCTGGTCACTGCGACCCCCTTTGTGAGGAGGCACCCCTTCTCCCGAAGTTACGGGGCTATTTTGCCGAGTTCCTTAGAGAGAGTTGTCTCGCGCCCCTAGGTATTCTCTACCTACCCACCTGTGTCGGTTTCGGGTACAGGTACCCTTTTGTTGAAGGTCGTTCGAGCTTTTCCTGGGAGTATGGCATGGGTTACTTCAGCGCCGTAGCGCCTGGTACTCGAACATTGGCTCGAGGTATTTTCTCTACCCCTTCTTACCCTGAAAAAGCAGGGGCACCTTGCGTCCTTGAACCGATAACCATCTTTCGGCTAAACCTAGCCTCCTCCGTCCCTCGGGACTAACAAGGGGTAGTACAGGAATATTCACCTGTTGTCCATCGACTACGCCTTTCGGCCTGATCTTAGGCCCTGACTCACCCTCCGTGGACGAACCTTGCGGAGGAACCCTTACGTTTTCGGGGCATTGGATTCTCACCAATGTTTGCATTACTCAAGCCGACATTCTCGCTTCCGCTTCGTCCACCACCGCTCGCGCGGGTGCTTCCCTCTAAGGCGGAACGCTCCCACAGCTTCGGCAGATTGCTTAGCCCCGTTCATCTTCGGCTCAAGAGCGCTCGATCAGTGTGCTATTACGCACTCTTTCAAGGGTGGCTGCTTCTAGGCAAACCTCCTGGCTGTCTCTGCACCCCTACCTCCTTTATCACTGAGCGGTCATTTAGGGGCCTTAGCTGGTGATCCGGGCTGTTTCCCTCTCGACGATGAAGCTTATCCCCCATCGTCTCACTGGCCAACCTTGACCCCTGTTATTTTGAGATCATATCTAGTATTTAGAGTTTGCCTCGATTTGGTACCGCTCTCGTGGCCCGCACCGAAACAGTGCTTTACCCCTAGATGTCCAGTCAACTGCTGCGCCTCAACGCATTTCGGGGAGAACCAGCTAGCTCTGGGTTCGAGTGGCATTTCACCCCTAACCACAACTCATCCGCTGATTCTTCAACATCAGTCGGTTCGGACCTCCACTTAGTTTCACCCAAGCTTCATCCTGGTCATGGATAGATCACCCAGGTTCGGGTCCATAAGCAGTGACAATTGCCCCATGAAGACTCGCTTTCGCTACGGCTCCGGTGGGTTCCCTTAACCAAGCCACTGCCTATGAGTCGCCGGCTCATTCTTCAACAGGCACGCGGTCAGAGTCCTGGTCTCCTCCCACTGCTTGGAAGCTTACGGTTTCATGTTCTATTTCACTCCCCGATGGGGGTTCTTTTCACCCTTCCCTCACGGTACTACTTCACTATCGGTCACCCAGGAGCGTAAGCTAGTGATGCTTTCGGCTACTGGACTCTCGCCATCTAGGGTGCAGCACTCCACCGCTTCGCCTAGCAGCACGACGCTTGTATTGCTCTCCCACAACCCCGTTTTCACGGTTTAGGCTGCTCCCATTTCTCTCGCCGCTACTACGGGAATCGCTTTTGCTTTCTTTTCCTCTGGCTACTAAGATGTTTCAGTTCGCCAGGTTGTCTCTTGCCTGCCCATGGATTCAGCAGCTATTTGAAAGGTTGACCTATTCGGGAATCTCCGGATCTATGCTTATTTTCAACTCCCCGAAGCATTTCGTCGCTTACTACGCCCTTCCTCGTCTCTGGGTGCCTAGGTATCCACCGTAAGCCTTTCTTCGTTTGAACCTCGCCCTTAACTTTAAGGCTATGCCATCCTAAGGTGCTGCTAAATAGAAGGATCTTATCAACGTCCATGAATGATAAATCATAGATCGAACTGCTGAATCGGAAAAATGGAGTGCTATCATATAGCTTTGTATCGGCTAAGTTCACGAGTTGGAGATAAGCGGACTCGAACCGCTGACATCCGCCACAAGGTAAACCACCGCCTCTCAGGCCCCCGACTGATTCTACCATAGAGGCCAACGATAGACAATAACTCCCCCCCGAACACAGCTTACAACTTTCATCGTACTGTGCTCTCTAAAGAGCAACTCTTCTCAAAATCTCAAAAGGTACTGAGTTGGAATCCCATTATAACTAAGGATTCTTGTGGTTCCGGAGAATCCAGCTACAGGAGAACCAGGAACGGAGAGCTTTCCCCCCTTTTCCGCCCGCCTCTTTGGTCTTAAGAATGCTGGTTTTAAGAATGAGTGATTGCCCTTCTCCGACCCTTACTGCCCAACCGGAGAGCGGACAGCTAATGCGTTCCACTTATTGAACAGGGTTCTATGGTCGGTCTGCGACCCCTGGATACCGAAGGCATCCTTGGGGTGATCTCGTAGTTCCTACGGGGTGGAGACGATGGGGTCGGTCCATGGATTTTCCTTCCTTTTGCCGCATTTCGCTCAAAGGGTTGAAGGGAGATAGTGCATCAAGCTGTTCGCAAGGGCCAACTTGATCCTCTTCCCCAGGGATCCCAAATGAGGGAACCCTAAGAGAGCCGCCGACTCCAACTACCGTCCATGTACAATCCATACTAGATCTGACCAACTGCCCATCCTACCTCCTCTACGTTCTTGACAGCCCATCTTTGTCTCAGTAGAGTCTTTCAGTGGCATGTTTCGGTCTTCTTCCCCATTACTTAGAAAAAGTGAGCCACCGGTTCAGGTACAAGATACTATCATTACCGCCTGGACAATTAGACATCCAACCCGTAATCGCAATGACCCAATTACAAGAGCGGAGCTCTACCAACTGAGCTATATCCCCCCGAGCCAAGTGGGGCCTGCATGATGGAGTCAGATGCTTCTTCTATTCTTTTCGTTGGCGTAGCTGGGCCATCCTGGACTTGAACCAGAGACCTCGCCCGTGAAGTAAATCATCGCACCTACGGTCCAACCAATTGGGAGAGAATCAATAGATTCCTTTTCGGGAGCGATTCATCCTTCCCAACGCAGCATACAACTCTTCGTTGTACTGCGCTCTCCAAGTGTGCTTGTTCCCCCCTTCTTCCTTACCATGGCAAGTCTTTGTGAAATAACTCCGATGAGAAGAAAAAAGAAGGCGTTAAGAAACCCTCCTGGCCCAACCCTAGACACTCTAAGATCCTTTTTCAAACCTGCTCCCATTTCGAGTCAAGAGATAGATAAATAGACACATCCCATTGCACTGATCGGGGGTGTTCGTAGTGACTGAGGGGGTCGACGACCAAGAAGTGAGTTATTCATCAGCCAAACATTCTTCTTACGGCTAGATCAAATCTCCTGGTCCCTGCGGAAAAAAGGAAAAAGAATTTCAAGTTCTTCCTTTCGCTTTCGGGAAGGGAGGATTAAGAAAATCCTATTGATTGCAGCTTTCTCCAGACCTCCGGGAAAAGCATGAAAAAAAAAGGCTCGAATGGTACGATCACTCCGTCACCCCAGAATGAAAGGGGCGATCTCGTAGTTCTTGGTCTGTGAAGATACGTTGTTAGGTGCTCCGTTTTATTTTCCCATTAAGGCCGAACCTAAACCCGTGCTCGAGAGATAGCTGTCCATATACTGATAAGGGATGTATGGATTCTCGAGAAGAGAGGAGCCGAGGTGGTCCCCCCCGGACCGCCCGGATCCCACGAGTGAATAGAAAGTTGGATCTATATTGGATCTCACCTGAATCGCCCCATCTATCCTCCTGAGGAGAAGTTTGGTTTCAAACCCCGGTTCGAACAGGAGAAGTACGCCATGCTAATGTGCCTTGGATGATCCACATCTCAGGGTCAGGCGCTGATGAGCACATTGAACTATCCATGTGGCTGAGAGCCCTCACAGCCCAGGCACAACGACGCAATTATCAGGGGCGCGCTCTACCACTGAGCTAATAGCCCGTCGTGTTGGCCTCCCGCTGGGGGCCCGCTATGCCAAAAGCGAGAGAAACCCCATCCCTCTCTTTCCCTTTTTCGCCCCCATGTCGCCACACGGGAGGGACATGGGGACGTAAAAAAGGGGATCCTATCAACTTGTTTCGACCTAGGATAATAAGCTCATGAGCTTAGTCTTACTTCACCGTCGACAAACGAAAGAAGACTTCCATCTCCAAGTTTAACTCAGACGTAGCTCGCTTCTTTTTGGGTGTGAAGCAGTGTCAAACCAAAATACTCAACAAGCGTTAGCTCTCCCTGAAAAGGAGGTGATCCAGCCGCACCTTCCAGTACGGCTACCTTGTTACGACTTCACTCCAGTCACTAGCCCTGCCTTCGGCATCCCCCTCCTTGCGGTTAAGGTAACGACTTCGGGCATGGCCAGCTCCCATAGTGTGACGGACGGTGTGTACAAGGCCCGGGAATGAATTCACCGCCGTATGGCTGACCGACGATTACTAGCGATTCCGGCTTCATGCAGGCGAGTTGCAGCCTACAATCCAAACTGAGGACGGGTTTTTGGAGTTAGCTCACCCTCGCGGGATCACGACCCTTTGTCCCGGCCATTGTAGCACGTGTGTCGCCCAGGGCATAAGGGGCATGATGACTTGACGTCATCCTCACCTTCCTCCGGCTTATCACCGGCAGTCTGCTCAGGGTTCCAACCTCAACGGTTGGCAACTAAACATGAAGGTTGCGCTGCGGGACTTAACCCAACACCTTACGGCACGAGCTGACGACAGCCATGCACCACCTGTGTCCGCGTTCCCGAAGGCACCCCTCTCTTTCAAGAGGATTCACGGCATGTCAAGCCCTGGTAAGGTTCTTCGCTTTGCATAGAATTAAACCACATGCTGCACCGCTTGTGCGGGCCCCCGTCAATTCCTTTGAGTTTCATTCTTGCGAACGTACTCCCCAGGCGGGATACTTAACGCGTTAGCTACAGCACTGCACGGGTCGATACGCACAGCGCCTAGTATCCATCGTTTACAGCTAGGACTACTGGGGTATCTAATCCCATTCGCTCCCCTAGCTTTCGTCTCTCAGTGTCAGTGTCGGCCCAGTAGAGTGCTTTTGCCGTTGGTGTTCTTTCCGATCTCTACGCATTTCACCGCTCCACCGAAAATTCCCTTTGCCCCTACCGTACTCCAGCTTGGTAGTTTCCACCGCCTGTCCAGGGTTGAGCCCTGGGATTTGACGACGGACTTAAAAAGCCACCTACAGACGCTTTACGCCCAATCATTCCGGATAACGCTTGCATCCTCTGTATTACCGCGGCTGCTGGCACAGAGTTAGCTGATGCTTATTCCCCAGATACCGTCATTGCTTCTTCTCCGGGAAAAGAAGTTCACGACCCGTAGGCCTTCTACCTCCACGCGGCATTGCACCGTCAGGCTTTCGCCCATTGCGGAAAGTTCCCCACTGCTGCCTCCCGTAGGAGTCTGGGCCGTGTCTCAGTCCCAGTGTGGCTGATCATCCTCTCGGACCAGCTACTGATCATCGCCTTGGTAAGCTATTGCCTCACCAACTAGCTAATCAGACGCGAGCCCCTCCTCAGGCGGATTCCTCCTTTTGCTCCTCAGCGTACGGGGTATTAGCAGCCGTTTCCAGCTGTTGTTCCCCTCCCAAGGGCAGGTTCTTACGCGTTACTCACCCGTCCGCCACTGGAAACACCACTTCCCGTCCGACTTGCATGTGTTAAGCATGCCGCCAGCGTTCATCCTGAGCCAGAATCGAACTCTCCCTGAGATTCATAGTTGCATTACTTATAGCTTCCTTGTTCGTAGACAAAGCTAATTCGGAATTGTCTTTCATTCCAAGGCATAACTTGTATCCATGCGCTTCATATTCGCTTGGAGTTCGCTCCCAGAAATATAGCCATCCCCACCCCCTCACGTCAATCCCACGAGCCTCTTATCCATTCTCATTCGATCACGGCGGGGGAGCAAGTAAAAATAGAAAAACTCACATTGGGTTTAGGGATAATCAGGCTCGAACTGATGACTTCCGCCACGTCAAGGCAACACTCTACCGCTGA
mRNA sequence
ATGGCAAGCAACGAACGAGGAATGATGATAGTAGGGAGAGGAAAGTCGATTTGGAGGGTGGGCGCGAGTGAGGCAGAATGGTGGAGGGCGAAGAGCGGCAGCCATGGACGACAGACGACGGATGGCGGCGGCAGTGTAGGACTACGGAAAGACGAATGGCGAGGAGCGGTTGTGCTTGGGGAATGGAACAAGGTTCAAGTCTACCGGTCTGTTAGGATGCCTCAGCTGCATACATCACTGCACTTCCACTTGACACCTATCGTGATGATAAACGGCTCATCTCGCCGTGACCTTCTCTTGAATTCTCAAAACTTCTGTCGCTCCATCCCCGCAGGGGCAGAGAACCCATCGCTGTCTCGGCTGTGCTACCGGAGGCTCTGGGGAAGTCGGAATAGGAGAGCACTCATCTTGGGGTGGGCTTACTACTTAGATGCTTTCAGCAGTTATCCGCTCCGTACTTGGCTACCCAGCGTTTACCGTGGGCACGATAACTGGGCCAATCTCCGTCTGGCCCGAGGAAACCTTTGCACGCCTCCGTTACCTTTTGGGAGGCCTACGCCCCATAGAAACTGTCTACCTGAGACTGTCCCTTGGCTCGTAGGTCCTGACACAAGGTCACCAACTTCCTTGACCTTCCGGCACTGGGCAGGCGTCAGCCCCCATACATGGTCTTACGACTTTGCGGAGACCTGTGTTTTTGGTAAACAGTCGCCCGGGCCTGGTCACTGCGACCCCCTTTGTATTCTCTACCTACCCACCTGTGTCGGTTTCGGGTACAGGTACCCTTTTGTTGAAGGTCGTTCGAGCTTTTCCTGGGAGTATGGCATGGGTTACTTCAGCGCCGTAGCGCCTGGTACTCGAACATTGGCTCGAGGTATTTTCTCTACCCCTTCTTACCCTGAAAAAGCAGGGGCACCTTGCGCCCTGACTCACCCTCCGTGGACGAACCTTGCGGAGGAACCCTTACGTTTTCGGGGCATTGGATTCTCACCAATGTTTGCATTACTCAAGCCGACATTCTCGCTTCCGCTTCGTCCACCACCGCTCGCGCGGGTGCTTCCCTCTAAGGCGGAACGCTCCCACAGCTTCGGCAGATTGCTTAGCCCCGTTCATCTTCGGCTCAAGAGCGCTCGATCAGTGTGCTATTACGCACTCTTTCAAGGGTGGCTGCTTCTAGGCAAACCTCCTGGCTGTCTCTGCACCCCTACCTCCTTTATCACTGAGCGACTCGCTTTCGCTACGGCTCCGGTGGGTTCCCTTAACCAAGCCACTGCCTATGAGTCGCCGGCTCATTCTTCAACAGGCACGCGGTCAGAGTCCTGGTCTCCTCCCACTGCTTGGAAGCTTACGGTTTCATGTTCTATTTCACTCCCCGATGGGGGTTCTTTTCACCCTTCCCTCACGGTACTACTTCACTATCGGTCACCCAGGAGCCTATTTGAAAGGTTGACCTATTCGGGAATCTCCGGATCTATGCTTATTTTCAACTCCCCGAAGCATTTCGTCGCTTACTACGCCCTTCCTCGTCTCTGGGTGCCTAGGATTCTTGTGGTTCCGGAGAATCCAGCTACAGGAGAACCAGGAACGGAGAGCTTTCCCCCCTTTTCCGCCCGCCTCTTTGGTCTTAAGAATGCTGGGTTCTATGGTCGGTCTGCGACCCCTGGATACCGAAGGCATCCTTGGGGTGATCTCGTAGTTCCTACGGGGTGGAGACGATGGGGTCGGTCCATGGATTTTCCTTCCTTTTGCCGCATTTCGCTCAAAGGGTTGAAGGGAGATAGTGCATCAAGCTTGTCAAACCAAAATACTCAACAAGCGTTAGCTCTCCCTGAAAAGGAGGTGATCCAGCCGCACCTTCCAGTACGGCTACCTTGTTACGACTTCACTCCAGTCACTAGCCCTGCCTTCGGCATCCCCCTCCTTGCGGTTAAGGTAACGACTTCGGGCATGGCCAGCTCCCATAGTGTGACGGACGGCGAGTTGCAGCCTACAATCCAAACTGAGGACGGGTTTTTGGAGTTAGCTCACCCTCGCGGGATCACGACCCTTTACGCTTTACGCCCAATCATTCCGGATAACGCTTGCATCCTCTGTATTACCGCGGCTGCTGGCACAGAGTTAGCTGATGCTTATTCCCCAGATACCGTCATTGCTTCTTCTCCGGGAAAAGAAGAGTCTGGGCCGTGTCTCAGTCCCAGTGTGGCTGATCATCCTCTCGGACCAGCTACTGATCATCGCCTTGGTAAGCTATTGCCTCACCAACTAGCTAATCAGACGCGAGCCCCTCCTCAGGCGGATTCCTCCTTTTGCTCCTCAGCGTACGGGGTATTAGCAGCCGTTTCCAGCTGTTGTTCCCCTCCCAAGGGCAGGTTCTTACGCGTTACTCACCCGTCCGCCACTGGAAACACCACTTCCCACAAAGCTAATTCGGAATTGTCTTTCATTCCAAGGCATAACTTGTATCCATGCGCTTCATATTCGCTTGGAGTTCGCTCCCAGAAATATAGCCATCCCCACCCCCTCACGTCAATCCCACGAGCCTCTTATCCATTCTCATTCGATCACGGCGGGGGAGCAAGTAAAAATAGAAAAACTCACATTGGGTTTAGGGATAATCAGGCTCGAACTGATGACTTCCGCCACGTCAAGGCAACACTCTACCGCTGA
Coding sequence (CDS)
ATGGCAAGCAACGAACGAGGAATGATGATAGTAGGGAGAGGAAAGTCGATTTGGAGGGTGGGCGCGAGTGAGGCAGAATGGTGGAGGGCGAAGAGCGGCAGCCATGGACGACAGACGACGGATGGCGGCGGCAGTGTAGGACTACGGAAAGACGAATGGCGAGGAGCGGTTGTGCTTGGGGAATGGAACAAGGTTCAAGTCTACCGGTCTGTTAGGATGCCTCAGCTGCATACATCACTGCACTTCCACTTGACACCTATCGTGATGATAAACGGCTCATCTCGCCGTGACCTTCTCTTGAATTCTCAAAACTTCTGTCGCTCCATCCCCGCAGGGGCAGAGAACCCATCGCTGTCTCGGCTGTGCTACCGGAGGCTCTGGGGAAGTCGGAATAGGAGAGCACTCATCTTGGGGTGGGCTTACTACTTAGATGCTTTCAGCAGTTATCCGCTCCGTACTTGGCTACCCAGCGTTTACCGTGGGCACGATAACTGGGCCAATCTCCGTCTGGCCCGAGGAAACCTTTGCACGCCTCCGTTACCTTTTGGGAGGCCTACGCCCCATAGAAACTGTCTACCTGAGACTGTCCCTTGGCTCGTAGGTCCTGACACAAGGTCACCAACTTCCTTGACCTTCCGGCACTGGGCAGGCGTCAGCCCCCATACATGGTCTTACGACTTTGCGGAGACCTGTGTTTTTGGTAAACAGTCGCCCGGGCCTGGTCACTGCGACCCCCTTTGTATTCTCTACCTACCCACCTGTGTCGGTTTCGGGTACAGGTACCCTTTTGTTGAAGGTCGTTCGAGCTTTTCCTGGGAGTATGGCATGGGTTACTTCAGCGCCGTAGCGCCTGGTACTCGAACATTGGCTCGAGGTATTTTCTCTACCCCTTCTTACCCTGAAAAAGCAGGGGCACCTTGCGCCCTGACTCACCCTCCGTGGACGAACCTTGCGGAGGAACCCTTACGTTTTCGGGGCATTGGATTCTCACCAATGTTTGCATTACTCAAGCCGACATTCTCGCTTCCGCTTCGTCCACCACCGCTCGCGCGGGTGCTTCCCTCTAAGGCGGAACGCTCCCACAGCTTCGGCAGATTGCTTAGCCCCGTTCATCTTCGGCTCAAGAGCGCTCGATCAGTGTGCTATTACGCACTCTTTCAAGGGTGGCTGCTTCTAGGCAAACCTCCTGGCTGTCTCTGCACCCCTACCTCCTTTATCACTGAGCGACTCGCTTTCGCTACGGCTCCGGTGGGTTCCCTTAACCAAGCCACTGCCTATGAGTCGCCGGCTCATTCTTCAACAGGCACGCGGTCAGAGTCCTGGTCTCCTCCCACTGCTTGGAAGCTTACGGTTTCATGTTCTATTTCACTCCCCGATGGGGGTTCTTTTCACCCTTCCCTCACGGTACTACTTCACTATCGGTCACCCAGGAGCCTATTTGAAAGGTTGACCTATTCGGGAATCTCCGGATCTATGCTTATTTTCAACTCCCCGAAGCATTTCGTCGCTTACTACGCCCTTCCTCGTCTCTGGGTGCCTAGGATTCTTGTGGTTCCGGAGAATCCAGCTACAGGAGAACCAGGAACGGAGAGCTTTCCCCCCTTTTCCGCCCGCCTCTTTGGTCTTAAGAATGCTGGGTTCTATGGTCGGTCTGCGACCCCTGGATACCGAAGGCATCCTTGGGGTGATCTCGTAGTTCCTACGGGGTGGAGACGATGGGGTCGGTCCATGGATTTTCCTTCCTTTTGCCGCATTTCGCTCAAAGGGTTGAAGGGAGATAGTGCATCAAGCTTGTCAAACCAAAATACTCAACAAGCGTTAGCTCTCCCTGAAAAGGAGGTGATCCAGCCGCACCTTCCAGTACGGCTACCTTGTTACGACTTCACTCCAGTCACTAGCCCTGCCTTCGGCATCCCCCTCCTTGCGGTTAAGGTAACGACTTCGGGCATGGCCAGCTCCCATAGTGTGACGGACGGCGAGTTGCAGCCTACAATCCAAACTGAGGACGGGTTTTTGGAGTTAGCTCACCCTCGCGGGATCACGACCCTTTACGCTTTACGCCCAATCATTCCGGATAACGCTTGCATCCTCTGTATTACCGCGGCTGCTGGCACAGAGTTAGCTGATGCTTATTCCCCAGATACCGTCATTGCTTCTTCTCCGGGAAAAGAAGAGTCTGGGCCGTGTCTCAGTCCCAGTGTGGCTGATCATCCTCTCGGACCAGCTACTGATCATCGCCTTGGTAAGCTATTGCCTCACCAACTAGCTAATCAGACGCGAGCCCCTCCTCAGGCGGATTCCTCCTTTTGCTCCTCAGCGTACGGGGTATTAGCAGCCGTTTCCAGCTGTTGTTCCCCTCCCAAGGGCAGGTTCTTACGCGTTACTCACCCGTCCGCCACTGGAAACACCACTTCCCACAAAGCTAATTCGGAATTGTCTTTCATTCCAAGGCATAACTTGTATCCATGCGCTTCATATTCGCTTGGAGTTCGCTCCCAGAAATATAGCCATCCCCACCCCCTCACGTCAATCCCACGAGCCTCTTATCCATTCTCATTCGATCACGGCGGGGGAGCAAGTAAAAATAGAAAAACTCACATTGGGTTTAGGGATAATCAGGCTCGAACTGATGACTTCCGCCACGTCAAGGCAACACTCTACCGCTGA
Protein sequence
MASNERGMMIVGRGKSIWRVGASEAEWWRAKSGSHGRQTTDGGGSVGLRKDEWRGAVVLGEWNKVQVYRSVRMPQLHTSLHFHLTPIVMINGSSRRDLLLNSQNFCRSIPAGAENPSLSRLCYRRLWGSRNRRALILGWAYYLDAFSSYPLRTWLPSVYRGHDNWANLRLARGNLCTPPLPFGRPTPHRNCLPETVPWLVGPDTRSPTSLTFRHWAGVSPHTWSYDFAETCVFGKQSPGPGHCDPLCILYLPTCVGFGYRYPFVEGRSSFSWEYGMGYFSAVAPGTRTLARGIFSTPSYPEKAGAPCALTHPPWTNLAEEPLRFRGIGFSPMFALLKPTFSLPLRPPPLARVLPSKAERSHSFGRLLSPVHLRLKSARSVCYYALFQGWLLLGKPPGCLCTPTSFITERLAFATAPVGSLNQATAYESPAHSSTGTRSESWSPPTAWKLTVSCSISLPDGGSFHPSLTVLLHYRSPRSLFERLTYSGISGSMLIFNSPKHFVAYYALPRLWVPRILVVPENPATGEPGTESFPPFSARLFGLKNAGFYGRSATPGYRRHPWGDLVVPTGWRRWGRSMDFPSFCRISLKGLKGDSASSLSNQNTQQALALPEKEVIQPHLPVRLPCYDFTPVTSPAFGIPLLAVKVTTSGMASSHSVTDGELQPTIQTEDGFLELAHPRGITTLYALRPIIPDNACILCITAAAGTELADAYSPDTVIASSPGKEESGPCLSPSVADHPLGPATDHRLGKLLPHQLANQTRAPPQADSSFCSSAYGVLAAVSSCCSPPKGRFLRVTHPSATGNTTSHKANSELSFIPRHNLYPCASYSLGVRSQKYSHPHPLTSIPRASYPFSFDHGGGASKNRKTHIGFRDNQARTDDFRHVKATLYR
Homology
BLAST of Moc06g30340 vs. NCBI nr
Match:
KAG4154841.1 (hypothetical protein ERO13_D03G074976v2 [Gossypium hirsutum])
HSP 1 Score: 587.4 bits (1513), Expect = 2.0e-163
Identity = 370/753 (49.14%), Postives = 399/753 (52.99%), Query Frame = 0
Query: 63 NKVQVYRSVRMPQLHTSLHFHLTPIVMINGSSRRDLLLNSQNFCRSIPAGAENPSLSRLC 122
NKVQVYRS+RMPQLHTSLHFHLTPI+MINGSS RDLLLNSQNFC SIPAG ENP SRLC
Sbjct: 17 NKVQVYRSIRMPQLHTSLHFHLTPIIMINGSSHRDLLLNSQNFCHSIPAGIENPLPSRLC 76
Query: 123 YRRLWGSRNRRALILGWAYYLDAFSSYPLRTWLPSVYRGHDNWANLRLARGNLCTPPLPF 182
YRRLWGSRNRRALILGWAYYLDAFSSYPLRTWLP+VYRGHDNW RGNLCTPPLPF
Sbjct: 77 YRRLWGSRNRRALILGWAYYLDAFSSYPLRTWLPNVYRGHDNW----YTRGNLCTPPLPF 136
Query: 183 GRPTPHRNCLPETVPWLVG-----PDTRSPTSLTFRHWAGVSP--HTWSYDFAETCVFGK 242
GRPTPHRNCLPETVPW V D SL+ R P T A FG+
Sbjct: 137 GRPTPHRNCLPETVPWPVRVVRIFTDMSISPSLSPRQCPDRYPLCGTVIVTAAVHRGFGR 196
Query: 243 -----QSPGPGHCDPLC--------------------------ILYLPTCVGFGYRYPFV 302
QSPGPGHCDPLC ILYLPTCVGFGYRYPFV
Sbjct: 197 RLPCHQSPGPGHCDPLCEEAPLLPKLRGYFAEFLRESCLVPLGILYLPTCVGFGYRYPFV 256
Query: 303 EGRSSFSWEYGMGYFSAVAPGTRTLARGIFSTPSYPEKAGAPCALTHPPWTNLAEEPLRF 362
EGRSSFSWEY MGYFSAVAPG +PS E PC T
Sbjct: 257 EGRSSFSWEYSMGYFSAVAPGP--------DSPSVDE----PCGGT-------------- 316
Query: 363 RGIGFSPMFALLKPTFSLPLRPPPLARVLPSKAERSHSFGRLLSPVHLRLKSARSVCYYA 422
+GFS + L + A +L S + +P H
Sbjct: 317 --LGFSGHWILTNVCVT-------QADILASASS---------TPAH------------- 376
Query: 423 LFQGWLLLGKPPGCLCTPTSFITER----------------------------------- 482
GWLLLGKPPGCLCTPTSFITER
Sbjct: 377 --AGWLLLGKPPGCLCTPTSFITERSFRGLSWSYLVFRVCLDLVPLSRPAPKQCFTPRCP 436
Query: 483 -------------LAFATAPVGSL---------NQATAYESPA--------------HSS 542
LA ++ + L +Q+ +SP HS
Sbjct: 437 VNCCASTHFGENQLALGSSGISPLTTTHPLILQHQSARGQSPGLLPLLGSLRFHVLFHSP 496
Query: 543 TGTRSESWSPPTAWKLTVSCSISLPDGGSFHPSLTVLLHYRSP----------------- 602
TG ++ P+ W L + +P + LL +RSP
Sbjct: 497 TGV---LFTLPSRWSLLIHTGFHVPHA----TRVRALLPFRSPLLRESLLLSFPPATKMF 556
Query: 603 ------------RSLFERLTYSGISGSMLIFNSPKHFVAYYALPRLWVPRILVVPENPAT 660
+ FERLTYSGISGS LIFNSPKHFVAYYALPRLWVP V + T
Sbjct: 557 QFARLSLVCPWIQQQFERLTYSGISGSTLIFNSPKHFVAYYALPRLWVPSSRVGDKRTRT 616
BLAST of Moc06g30340 vs. NCBI nr
Match:
KAG4211951.1 (hypothetical protein ERO13_A02G134101v2 [Gossypium hirsutum])
HSP 1 Score: 500.4 bits (1287), Expect = 3.3e-137
Identity = 342/733 (46.66%), Postives = 377/733 (51.43%), Query Frame = 0
Query: 63 NKVQVYRSVRMPQLHTSLHFHLTPIVMINGSSRRDLLLNSQNFCRSIPAGAENPSLSRLC 122
NKVQVYRSVRMPQLHTSLHFHLTPIVMINGSSRRDLLLNSQNFC SIPAG ENP SRLC
Sbjct: 17 NKVQVYRSVRMPQLHTSLHFHLTPIVMINGSSRRDLLLNSQNFCHSIPAGTENPLPSRLC 76
Query: 123 YRRLWGSRNRRALILGWAYYLDAFSSYPLRTWLPSVYRGHDNWANLRLARGNLCTPPLPF 182
YRRLWGSRNRRALILGWAYYLDAFSSYPLRTWLPSVYRGHDNW RG + P+
Sbjct: 77 YRRLWGSRNRRALILGWAYYLDAFSSYPLRTWLPSVYRGHDNW----YTRG--ASFPVRV 136
Query: 183 GRPTPHRNCLPETVPWLVGPDTRSPTSLTFRHWAGVSPHTWSYDFAETCV--------FG 242
R + P P + P FR AG + + + T + F
Sbjct: 137 VRIFTDMSISPSLSP------RQCPDRYAFR--AGRNLPDKEFRYLRTVIVTAAVHRGFS 196
Query: 243 K-----QSPGPGHCDPLC--------------------------ILYLPTCVGFGYRYPF 302
+ QSPGPGHCDPLC ILYLPTCVGFGYRYPF
Sbjct: 197 RRLPCHQSPGPGHCDPLCEEAPLLPKLRGYFAEFLRESCLAPLGILYLPTCVGFGYRYPF 256
Query: 303 VEGRSSFSWEYGMGYFSAVAPGTRTLARGIFSTPSYPEKAGAPCALT-HPPWTNLAEE-- 362
VEGRSSFSWEY MGYFSAVAPG +PS E G + H TN+
Sbjct: 257 VEGRSSFSWEYSMGYFSAVAPGP--------DSPSVDEPCGGTLGFSGHWILTNVCANLL 316
Query: 363 -------PLRFRG-----IGFSPMFAL----LKPTFSLPLRPP----------------P 422
PL G G F L P+ P P P
Sbjct: 317 AVSAPLPPLSLSGHLGALAGDPGCFPLDDEAYPPSSHWPTSTPVILRSYLVFRVCLDLVP 376
Query: 423 LARVLPSKAE---------RSHSFGR--------------LLSPVHLRLKSAR------- 482
L+R P + S FG P+ L+ +SAR
Sbjct: 377 LSRPAPKQCFTPRCPVNCCASTHFGENQLALGSSGISPLTTTHPLILQHQSARGQSPGLL 436
Query: 483 ----SVCYYALFQGWLLLGKPPGCLCTPTSFITERLAFATAPVGSLN---QATAYESPAH 542
S+ ++ LF P G L T + R FA G + ++ + H
Sbjct: 437 PLLGSLRFHVLFH------SPTGVLFT----LPSRYYFAIGHPGVFSLARRSLLIHTGFH 496
Query: 543 SSTGTRSESWSPPTAWKLTVSCSISLPDGGSFHPSLTVLLHYRSPRSLFERLTYSGISGS 602
TR + P + L S +S P + L + FERLTYSGISGS
Sbjct: 497 VPHATRVRALLPFHSPLLRESLLLSFPPATKMFQFARLSLVCPWIQQQFERLTYSGISGS 556
Query: 603 MLIFNSPKHFVAYYALPRLWVP--------------RILVVPENPATGEPGTESFPPF-- 660
LIFNSPKHFVAYYALPRLWVP R V+ NP + S +
Sbjct: 557 TLIFNSPKHFVAYYALPRLWVPSSRVGDKRTRTADIRHRVLSWNPILTKDSCGSRGSYYR 616
BLAST of Moc06g30340 vs. NCBI nr
Match:
KAG6516207.1 (hypothetical protein ZIOFF_026657 [Zingiber officinale])
HSP 1 Score: 449.5 bits (1155), Expect = 6.7e-122
Identity = 258/462 (55.84%), Postives = 270/462 (58.44%), Query Frame = 0
Query: 112 GAENPSLSRLCYRRLWGSRNRRALILGWAYYLDAFSSYPLRTWLPSVYRGHDNWANLRLA 171
G ENPSLSRLCYR LWGSRN+RALIL WA YLDAFSSYPLRTWLPSVYRGHDNW
Sbjct: 264 GTENPSLSRLCYRGLWGSRNKRALILRWACYLDAFSSYPLRTWLPSVYRGHDNW------ 323
Query: 172 RGNLCTPPLPFGRPTPHRNCLPETVPWLVGPDTRSPTSLTFRHWAGVSPHTWSYDFAETC 231
+ GVSPHTWSYDFAETC
Sbjct: 324 ------------------------------------------YTRGVSPHTWSYDFAETC 383
Query: 232 VFGKQSPGPGHCDPLC--------------------------ILYLPTCVGFGYRYPFVE 291
VFGKQSPGPGHCDPLC ILYLPT VGFGYRYPFVE
Sbjct: 384 VFGKQSPGPGHCDPLCEEAPLLPKLRGYFAEFLRESCLAPLGILYLPTSVGFGYRYPFVE 443
Query: 292 GRSSFSWEYGMGYFSAVAPGTRTLARGIFSTPSYPEKAGAPCALTHPPWTNLAEEPLRFR 351
GRSSFSWEYGMGYF AVAP TRTLARGI STPSYPEKA +PCALTHPPWTNLAEEPL FR
Sbjct: 444 GRSSFSWEYGMGYFRAVAPRTRTLARGISSTPSYPEKARSPCALTHPPWTNLAEEPLGFR 503
Query: 352 GIGFSPMFALLKPTFSLPLRPPPLARVLPSKAERSHSFGRLLSPVHLRLKSARSVCYYAL 411
GIGFSPMFALLKPTFSLPLRP LARVL SKAERS
Sbjct: 504 GIGFSPMFALLKPTFSLPLRPHLLARVLRSKAERS------------------------- 563
Query: 412 FQGWLLLGKPPGCLCTPTSFITERLAFATAPVGSLNQATAYESPAHSSTGTRSESWSPPT 471
P PT+ +RLA +L+Q
Sbjct: 564 ---------PTDAFLHPTA-SADRLAPFIFSARALDQ----------------------- 596
Query: 472 AWKLTVSCSISLPDGGSFHPSLTVLLHYRSPRSLFERLTYSGISGSMLIFNSPKHFVAYY 531
L+++C P + + FERLTYSGISGSMLIFNS KHFVA Y
Sbjct: 624 ---LSLAC-----------PWI---------QQQFERLTYSGISGSMLIFNSQKHFVACY 596
Query: 532 ALPRLWVPRILVVPENPATGEPGTESFPPFSARLFGLKNAGF 548
ALPRLWVPRILVVPENPATGEPGTES PPFS RLFGLKNAGF
Sbjct: 684 ALPRLWVPRILVVPENPATGEPGTESSPPFSVRLFGLKNAGF 596
BLAST of Moc06g30340 vs. NCBI nr
Match:
KAG6467616.1 (hypothetical protein ZIOFF_074572 [Zingiber officinale] >KAG6510415.1 hypothetical protein ZIOFF_028433 [Zingiber officinale])
HSP 1 Score: 427.6 bits (1098), Expect = 2.7e-115
Identity = 335/856 (39.14%), Postives = 385/856 (44.98%), Query Frame = 0
Query: 89 MINGSSRRDLLLNSQNFCRSIPAGAENPSLSRLCYRRLWGSRNRRALILGWAYYLDAFSS 148
MINGSSRRDL+L+SQ FCRSIPAG ENPSLSRLCYR LWGSRN+RALIL WA YLDAFSS
Sbjct: 1 MINGSSRRDLILDSQYFCRSIPAGTENPSLSRLCYRGLWGSRNKRALILRWACYLDAFSS 60
Query: 149 YPLRTWLPSVYRGHDNWANLRLARGNLCTPPLPFGRPTPHRNCLPETVPWLVGPDTRSPT 208
YPLRTWLPSVYRGHDNW + GPDT
Sbjct: 61 YPLRTWLPSVYRGHDNW--------------------------------YTRGPDT---- 120
Query: 209 SLTFRHWAGVSPHTWSYDFAETCVFGKQSPGPGHCDPLCILYLPTCVGFGYRYPFVEGRS 268
+C QS G + F RYPFVEGRS
Sbjct: 121 ---------------------SCAGKAQSQSQGTVKLHRV--------FLSRYPFVEGRS 180
Query: 269 SFSWEYGMGYFSAVAPGTRTLARGIFSTPSYPEKAGAPCALTHPPWTNLAEEPLRFRGIG 328
SFSWEYGMGYF AVAP TRTLARGI STPSYPEKA +PCALTHPPWTNLAEEPL FRGIG
Sbjct: 181 SFSWEYGMGYFRAVAPRTRTLARGISSTPSYPEKARSPCALTHPPWTNLAEEPLGFRGIG 240
Query: 329 FSPMFALLKPTFSLPLRPPPLARV--------LPSKAERSHSFGRLLSPVHLRLKS---- 388
FSPMFALLKPTFSLPLRP LAR LP + H P L
Sbjct: 241 FSPMFALLKPTFSLPLRPHLLARANLLAVSAPLPPLSLSGHLGALAGDPGCFPLDDEAYP 300
Query: 389 --------------ARSVCYYALFQGWLLLGKP-------PGC---LCTPTSFITERLAF 448
S + + + L +P P C C T F +LA
Sbjct: 301 PSSHWPTLIPLIFFGGSYLVFRVCLDLVPLSRPAPKQCFTPRCPVNCCASTHFGENQLAL 360
Query: 449 ATAPVGSL---------NQATAYESPAHSSTGTRSESWSPPTAWK-----------LTVS 508
++ + L +Q+ A P+ R SP T + L S
Sbjct: 361 GSSGISPLTTTHPLILQHQSGAAFIPS-LRLAARRLYCSPTTPFSRFRLLPFRSPLLRES 420
Query: 509 CSISLPDGGSFHPSLTVLLHYRSPRSLFERLTYSGISGSMLIFNSPKHFVAYYALPRLWV 568
+S P + L + FERLTYSGISGSMLIFNSPKHFVA YALPRLWV
Sbjct: 421 LLLSFPLATKMFQFARLSLACPWIQQQFERLTYSGISGSMLIFNSPKHFVACYALPRLWV 480
Query: 569 PRILVVPENPATGEPGTESFPPFSARLFGLKNAGFYGRSATPGYRRHPWGDLVVPTGWRR 628
Sbjct: 481 ------------------------------------------------------------ 540
Query: 629 WGRSMDFPSFCRISLKGLKGDSASSLSNQNTQQALALPEKEVIQPHLPVRLPCYDFTPVT 688
PSF R+S+ S+S Q +
Sbjct: 541 -------PSF-RLSV---------SVSAQQS----------------------------- 600
Query: 689 SPAFGIPLLAVKVTTSGMASSHSVTDGELQPTIQTEDGFLELAHPRGITTLYALRPIIPD 748
AF + +L S + + H T L P +ALRPIIPD
Sbjct: 601 --AFAVGVL------SDLYAFHRSTGNSLCP-------------------YHALRPIIPD 608
Query: 749 NACILCITAAAGTELADAYSPDTVIASSPGKEESGPCLSPSVADHPLGPATDHRLGKLLP 808
NACILC+TAAAGTELADAYS DTVIASSP KE P
Sbjct: 661 NACILCLTAAAGTELADAYSSDTVIASSPRKEVHDPW----------------------- 608
Query: 809 HQLANQTRAPPQADSSFCSSAYGVLAAVSSCCSPPKGRFLRVTHPSATGNTTSHKANSEL 868
+F A + A + C G+F P+A + + + +
Sbjct: 721 ---------------AFYLHAALLRQAFAHC-----GKF-----PTAASRRSLGRVSVPV 608
Query: 869 SFIPRHNLYPCASYSLGVRSQKYSHPHPLTSIPRASYPFSFDHGGGASKNRKTHIGFRDN 889
I + + GVRSQ+YSHP+P+TSIP+ASY FSFDHG GAS+NRKT+IGFRDN
Sbjct: 781 WLIILSDQLLIIALP-GVRSQQYSHPYPITSIPQASYAFSFDHGRGASQNRKTYIGFRDN 608
BLAST of Moc06g30340 vs. NCBI nr
Match:
TYH76817.1 (hypothetical protein ES332_D04G112800v1 [Gossypium tomentosum])
HSP 1 Score: 424.9 bits (1091), Expect = 1.8e-114
Identity = 311/726 (42.84%), Postives = 348/726 (47.93%), Query Frame = 0
Query: 63 NKVQVYRSVRMPQLHTSLHFHLTPIVMINGSSRRDLLLNSQNFCRSIPAGAENPSLSRLC 122
NKVQVY+SVRMPQLHTSLHFHLTPIVMINGSS R+LLLNSQNFC SIP G ENP SRLC
Sbjct: 17 NKVQVYQSVRMPQLHTSLHFHLTPIVMINGSSHRNLLLNSQNFCHSIPTGTENPLSSRLC 76
Query: 123 YRRLWGSRNRRALILGWAYYLDAFSSYPLRTWLPSVYRGHDNWANLRLARGNLCTPPLPF 182
Y+RLWGSRNRRALILGWAYYLDAFSSY L TWLPSVYRGHDNW RG + P+
Sbjct: 77 YQRLWGSRNRRALILGWAYYLDAFSSYLLHTWLPSVYRGHDNW----YTRG--ASFPVRL 136
Query: 183 GRPTPHRNCLPETVPWLVGPDTRSPTSLTFRHWAGVSPHTWSYDFAETCVFGK-----QS 242
R + + P P PD H+A + T + VFG+ QS
Sbjct: 137 VRIFTNMSISPSLSPRQC-PD----------HYAFCTVTTTVHR-----VFGRRLPCHQS 196
Query: 243 PGPGHCDPLC--------------------------ILYLPTCVGFGYRYPFVEGRSSFS 302
P PGHC+PLC ILYLPTCV F YRYPF EGRSSFS
Sbjct: 197 PEPGHCEPLCEEAPLLPKLRGYFVEFLRESCLAPLGILYLPTCVSFRYRYPFFEGRSSFS 256
Query: 303 WEYGMGYFSAVAPGTRTLARGIFSTPSYPEKAGAPCALTHPPWTNLAEEPLRFRGIGFSP 362
WEY MGYFSAVAPG +PS E +G +GFS
Sbjct: 257 WEYSMGYFSAVAPGP--------DSPSVDEPSGGT--------------------LGFSG 316
Query: 363 MFALLKPTFSLPLRPPPLARVLPSKAERSHSFGRLLSPVHLRLKSARSVCYYAL------ 422
+ L + S SFGR LSP+HL KSARS A+
Sbjct: 317 HWILTN------------VCITQVDILASTSFGRSLSPIHLWRKSARSANLLAVSTPLPP 376
Query: 423 --FQGWL-------------LLGKPPGCLC-TP---------TSFITERLAFATAPVGSL 482
G L L +P C TP T F +LA ++ + L
Sbjct: 377 LSLSGHLGALAGFRVCLDLVPLSRPTPEQCFTPRCPVNYRASTHFGENQLALGSSGISPL 436
Query: 483 ---------NQATAYESPA---------------------------HSSTGTRSESWSPP 542
+Q+ +SP H TR + P
Sbjct: 437 TTTHLLILQHQSARGQSPGLLPLLGRLCHPGVFSLARWSLLIHTGFHVPYATRVRALLPF 496
Query: 543 TAWKLTVSCSISLPDGGSFHPSLTVLLHYRSPRSLFERLTYSGISGSMLIFNSPKHFVAY 602
+ L S +S P + L + + FERLTYSGISGS LIFNSPKHFVAY
Sbjct: 497 RSPLLRESLLLSFPPATKMFQFARLSLVFPWIQQQFERLTYSGISGSKLIFNSPKHFVAY 556
Query: 603 YALPR-------------------------LWVPRILVVPENPATGEPG-----TESFPP 660
YALP W R + + G G T +
Sbjct: 557 YALPHRTNCRIGKIGYYYIALFRLSSRVGDKWT-RTADIRHRDSCGSGGSCYRKTRNGGE 616
BLAST of Moc06g30340 vs. ExPASy TrEMBL
Match:
A0A2N9FRF2 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS17704 PE=4 SV=1)
HSP 1 Score: 970.7 bits (2508), Expect = 4.2e-279
Identity = 590/1070 (55.14%), Postives = 627/1070 (58.60%), Query Frame = 0
Query: 63 NKVQVYRSVRMPQLHTSLHFHLTPIVMINGSSRRDLLLNSQNFCRSIPAGAENPSLSRLC 122
NKVQVYRSVRMPQLHTSLHFHLTPIVMINGSSRRDLLLNSQNFCRSIPAGAENPSLSRLC
Sbjct: 17 NKVQVYRSVRMPQLHTSLHFHLTPIVMINGSSRRDLLLNSQNFCRSIPAGAENPSLSRLC 76
Query: 123 YRRLWGSRNRRALILGWAYYLDAFSSYPLRTWLPSVYRGHDN-----------W------ 182
YRRLWGSRNRRALILGWAYYLDAFSSYPLRTWLPSVYRGHDN W
Sbjct: 77 YRRLWGSRNRRALILGWAYYLDAFSSYPLRTWLPSVYRGHDNCPTLGTYYSPRWRRADIE 136
Query: 183 -----------------ANLRLARGNLCTPPLPFGRPTPHRNCLPETVPW---------L 242
ANLRLARGNLCTPPLPFGRPTPHRNCLPETVPW
Sbjct: 137 VPNLPVDSSSLLPLHSRANLRLARGNLCTPPLPFGRPTPHRNCLPETVPWPVVRIFTDMS 196
Query: 243 VGPD---TRSPTSLTFRHWAGVSPHTWSYDFAETCV--------FGK-----QSPGPGHC 302
+ P + P FR AG + + + T + FG+ QSPGPGHC
Sbjct: 197 ISPSLSPRQCPDRYAFR--AGRNLPDKEFRYLRTVIVTAAVHRGFGRRLPCHQSPGPGHC 256
Query: 303 DPLC--------------------------ILYLPTCVGFGYRYPFVEGRSSFSWEYGMG 362
DPLC ILYLPTCVGFGYRYPFVEGRSSFSWEYGMG
Sbjct: 257 DPLCEEAPLLPKLRGYFAEFLRESCLAPLGILYLPTCVGFGYRYPFVEGRSSFSWEYGMG 316
Query: 363 YFSAVAPGTRTLARGIFSTPSYPEKAGAPCALTHPPWTNLAEEPLRFRGIGFSPMFALLK 422
YFSAVAPGTRTLARGIFSTPSYPEKAGAPCALTHPPWTNLAEEPL FRGIGFSPMFALLK
Sbjct: 317 YFSAVAPGTRTLARGIFSTPSYPEKAGAPCALTHPPWTNLAEEPLGFRGIGFSPMFALLK 376
Query: 423 PTFSLPLRPPPLARVLPSKAERSHS-----------------FG-RLLSPVHLRLKSARS 482
PTFSLPLRPPPL+RVLPSKAERS + FG R L HL +
Sbjct: 377 PTFSLPLRPPPLSRVLPSKAERSPTDAFLHPTASADRLAPFIFGARALDHGHLGALAGDP 436
Query: 483 VCY-----------------------YALFQGWL---LLGKP-------PGC---LCTPT 542
C+ Y +F+ L L +P P C C T
Sbjct: 437 GCFPLDDEAYPPSSHWPTLTPVILRSYLVFRVCLDLVPLSRPAPKQCFTPRCPVNCCAST 496
Query: 543 SFITERLAFATAPVGSL---------NQATAYESPA--------------HSSTGTRSE- 602
F +LA ++ + L +Q+ +SP HS G
Sbjct: 497 HFGENQLALGSSGISPLTTTHPLILQHQSARGQSPGLLPLLGSLRFHVLFHSPMGVLFTL 556
Query: 603 ------SWSPPTAWKLTVSCSISLPDGGSFHPSLTV-----LLHYRSP------------ 662
+ P + LT + + L + T LL +RSP
Sbjct: 557 PSRYYFTIGHPGVFSLTSTPLLRLAARRLYCSPTTPFSRFRLLPFRSPLLRESLLLSFPL 616
Query: 663 -----------------RSLFERLTYSGISGSMLIFNSPKHFVAYYALPRLWVPR----- 722
+ FERLTYSGISGS LIFNSPKHFVAYYALPRLWVPR
Sbjct: 617 ATKMFQFARLSLACPWIQQQFERLTYSGISGSTLIFNSPKHFVAYYALPRLWVPRSNCRI 676
Query: 723 -------ILVVPENPATGEPGTES--------------FPPFSARLFGLKNAGFYGRSAT 782
I + + G+ T + PP SARLFGLKNAGFYGRSAT
Sbjct: 677 GKIGCYHIALYRLSSRVGDKRTRTADIRHRLQENQERKAPPLSARLFGLKNAGFYGRSAT 736
Query: 783 PGYRRHPWGDLVVPTGWRRWGRSMDFPSFCRISLKGLKGDSASSL-SNQNTQQALALPEK 806
PG RR PWGDLVVPTGWRRWGR MDFPSFCRISLKGLKGDSASSL SNQNTQQALALPEK
Sbjct: 737 PGCRRRPWGDLVVPTGWRRWGRPMDFPSFCRISLKGLKGDSASSLVSNQNTQQALALPEK 796
BLAST of Moc06g30340 vs. ExPASy TrEMBL
Match:
A0A2N9I7K2 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS26797 PE=4 SV=1)
HSP 1 Score: 816.6 bits (2108), Expect = 1.0e-232
Identity = 510/969 (52.63%), Postives = 544/969 (56.14%), Query Frame = 0
Query: 63 NKVQVYRSVRMPQLHTSLHFHLTPIVMINGSSRRDLLLNSQNFCRSIPAGAENPSLSRLC 122
NKVQVYRSVRMPQLHTSLHFHLTPIVMINGSSRRDLLLNSQNFCRSIPAGAENPSLSRLC
Sbjct: 17 NKVQVYRSVRMPQLHTSLHFHLTPIVMINGSSRRDLLLNSQNFCRSIPAGAENPSLSRLC 76
Query: 123 YRRLWGSRNRRALILGWAYYLDAFSSYPLRTWLPSVYRGHDN-----------W------ 182
YRRLWGSRNRRALILGWAYYLDAFSSYPLRTWLPSVYRGHDN W
Sbjct: 77 YRRLWGSRNRRALILGWAYYLDAFSSYPLRTWLPSVYRGHDNCPTLGTYYSPRWRRADIE 136
Query: 183 -----------------ANLRLARGNLCTPPLPFGRPTPHRNCLPETVPW---------L 242
ANLRLARGNLCTPPLPFGRPTPHRNCLPETVPW
Sbjct: 137 VPNLPVDSSSLLPLHSRANLRLARGNLCTPPLPFGRPTPHRNCLPETVPWPVVRIFTDMS 196
Query: 243 VGPD---TRSPTSLTFRHWAGVSPHTWSYDFAETCV--------FGK-----QSPGPGHC 302
+ P + P FR AG + + + T + FG+ QSPGPGHC
Sbjct: 197 ISPSLSPRQCPDRYAFR--AGRNLPDKEFRYLRTVIVTAAVHRGFGRRLPCHQSPGPGHC 256
Query: 303 DPLC--------------------------ILYLPTCVGFGYRYPFVEGRSSFSWEYGMG 362
DPLC ILYLPTCVGFGYRYPFVEGRSSFSWEYGMG
Sbjct: 257 DPLCEEAPLLPKLRGYFAEFLRESCLAPLGILYLPTCVGFGYRYPFVEGRSSFSWEYGMG 316
Query: 363 YFSAVAPGTRTLARGIFSTPSYPEKAGAPCALTHPPWTNLAEEPLRFRGIGFSPMFALLK 422
YFSAVAPGTRTLARGIFSTPSYPEKAGAPCALTHPPWTNLAEEPL FRGIGFSPMFALLK
Sbjct: 317 YFSAVAPGTRTLARGIFSTPSYPEKAGAPCALTHPPWTNLAEEPLGFRGIGFSPMFALLK 376
Query: 423 PTFSLPLRPPPLARVLPSKAERSHS-----------------FG-RLLSPVHLRLKSARS 482
PTFSLPLRPPPL+RVLPSKAERS + FG R L HL +
Sbjct: 377 PTFSLPLRPPPLSRVLPSKAERSPTDAFLHPTASADRLAPFIFGARALDHGHLGALAGDP 436
Query: 483 VCY-----------------------YALFQGWL---LLGKP-------PGC---LCTPT 542
C+ Y +F+ L L +P P C C T
Sbjct: 437 GCFPLDDEAYPPSSHWPTLTPVILRSYLVFRVCLDLVPLSRPAPKQCFTPRCPVNCCAST 496
Query: 543 SFITERLAFATAPVGSL---------NQATAYESPA--------------HSSTGTRSE- 602
F +LA ++ + L +Q+ +SP HS G
Sbjct: 497 HFGENQLALGSSGISPLTTTHPLILQHQSARGQSPGLLPLLGSLRFHVLFHSPMGVLFTL 556
Query: 603 ------SWSPPTAWKLTVSCSISLPDGGSFHPSLTV-----LLHYRSP------------ 662
+ P + LT + + L + T LL +RSP
Sbjct: 557 PSRYYFTIGHPGVFSLTSTPLLRLAARRLYCSPTTPFSRFRLLPFRSPLLRESLLLSFPL 616
Query: 663 -----------------RSLFERLTYSGISGSMLIFNSPKHFVAYYALPRLWVPRILVVP 722
+ FERLTYSGISGS LIFNSPKHFVAYYALPRLWVPRI +
Sbjct: 617 ATKMFQFARLSLACPWIQQQFERLTYSGISGSTLIFNSPKHFVAYYALPRLWVPRIQL-- 676
Query: 723 ENPATGEPGTESFPPFSARLFGLKNAGFYGRSATPGYRRHPWGDLVVPTGWRRWGRSMDF 731
E PP SARLFGLKNAGFYGRSATPG RR PWGDLVVPTGWRRWGR MDF
Sbjct: 677 -----QENQERKAPPLSARLFGLKNAGFYGRSATPGCRRRPWGDLVVPTGWRRWGRPMDF 736
BLAST of Moc06g30340 vs. ExPASy TrEMBL
Match:
A0A2N9I5R1 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS47221 PE=4 SV=1)
HSP 1 Score: 815.8 bits (2106), Expect = 1.7e-232
Identity = 510/971 (52.52%), Postives = 544/971 (56.02%), Query Frame = 0
Query: 63 NKVQVYRSVRMPQLHTSLHFHLTPIVMINGSSRRDLLLNSQNFCRSIPAGAENPSLSRLC 122
NKVQVYRSVRMPQLHTSLHFHLTPIVMINGSSRRDLLLNSQNFCRSIPAGAENPSLSRLC
Sbjct: 17 NKVQVYRSVRMPQLHTSLHFHLTPIVMINGSSRRDLLLNSQNFCRSIPAGAENPSLSRLC 76
Query: 123 YRRLWGSRNRRALILGWAYYLDAFSSYPLRTWLPSVYRGHDN-----------W------ 182
YRRLWGSRNRRALILGWAYYLDAFSSYPLRTWLPSVYRGHDN W
Sbjct: 77 YRRLWGSRNRRALILGWAYYLDAFSSYPLRTWLPSVYRGHDNCPTLGTYYSPRWRRADIE 136
Query: 183 -----------------ANLRLARGNLCTPPLPFGRPTPHRNCLPETVPW---------L 242
ANLRLARGNLCTPPLPFGRPTPHRNCLPETVPW
Sbjct: 137 VPNLPVDSSSLLPLHSRANLRLARGNLCTPPLPFGRPTPHRNCLPETVPWPVVRIFTDMS 196
Query: 243 VGPD---TRSPTSLTFRHWAGVSPHTWSYDFAETCV----------FGK-----QSPGPG 302
+ P + P FR AG + + + T + FG+ QSPGPG
Sbjct: 197 ISPSLSPRQCPDRYAFR--AGRNLPDKEFRYLRTVIVTAARSVHRGFGRRLPCHQSPGPG 256
Query: 303 HCDPLC--------------------------ILYLPTCVGFGYRYPFVEGRSSFSWEYG 362
HCDPLC ILYLPTCVGFGYRYPFVEGRSSFSWEYG
Sbjct: 257 HCDPLCEEAPLLPKLRGYFAEFLRESCLAPLGILYLPTCVGFGYRYPFVEGRSSFSWEYG 316
Query: 363 MGYFSAVAPGTRTLARGIFSTPSYPEKAGAPCALTHPPWTNLAEEPLRFRGIGFSPMFAL 422
MGYFSAVAPGTRTLARGIFSTPSYPEKAGAPCALTHPPWTNLAEEPL FRGIGFSPMFAL
Sbjct: 317 MGYFSAVAPGTRTLARGIFSTPSYPEKAGAPCALTHPPWTNLAEEPLGFRGIGFSPMFAL 376
Query: 423 LKPTFSLPLRPPPLARVLPSKAERSHS-----------------FG-RLLSPVHLRLKSA 482
LKPTFSLPLRPPPL+RVLPSKAERS + FG R L HL +
Sbjct: 377 LKPTFSLPLRPPPLSRVLPSKAERSPTDAFLHPTASADRLAPFIFGARALDHGHLGALAG 436
Query: 483 RSVCY-----------------------YALFQGWL---LLGKP-------PGC---LCT 542
C+ Y +F+ L L +P P C C
Sbjct: 437 DPGCFPLDDEAYPPSSHWPTLTPVILRSYLVFRVCLDLVPLSRPAPKQCFTPRCPVNCCA 496
Query: 543 PTSFITERLAFATAPVGSL---------NQATAYESPA--------------HSSTGTRS 602
T F +LA ++ + L +Q+ +SP HS G
Sbjct: 497 STHFGENQLALGSSGISPLTTTHPLILQHQSARGQSPGLLPLLGSLRFHVLFHSPMGVLF 556
Query: 603 E-------SWSPPTAWKLTVSCSISLPDGGSFHPSLTV-----LLHYRSP---------- 662
+ P + LT + + L + T LL +RSP
Sbjct: 557 TLPSRYYFTIGHPGVFSLTSTPLLRLAARRLYCSPTTPFSRFRLLPFRSPLLRESLLLSF 616
Query: 663 -------------------RSLFERLTYSGISGSMLIFNSPKHFVAYYALPRLWVPRILV 722
+ FERLTYSGISGS LIFNSPKHFVAYYALPRLWVPRI +
Sbjct: 617 PLATKMFQFARLSLACPWIQQQFERLTYSGISGSTLIFNSPKHFVAYYALPRLWVPRIQL 676
Query: 723 VPENPATGEPGTESFPPFSARLFGLKNAGFYGRSATPGYRRHPWGDLVVPTGWRRWGRSM 731
E PP SARLFGLKNAGFYGRSATPG RR PWGDLVVPTGWRRWGR M
Sbjct: 677 -------QENQERKAPPLSARLFGLKNAGFYGRSATPGCRRRPWGDLVVPTGWRRWGRPM 736
BLAST of Moc06g30340 vs. ExPASy TrEMBL
Match:
A0A2N9I3B0 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS21723 PE=4 SV=1)
HSP 1 Score: 807.7 bits (2085), Expect = 4.7e-230
Identity = 511/995 (51.36%), Postives = 548/995 (55.08%), Query Frame = 0
Query: 63 NKVQVYRSVRMPQLHTSLHFHLTPIVMINGSSRRDLLLNSQNFCRSIPAGAENPSLSRLC 122
NKVQVYRSVRMPQLHTSLHFHLTPIVMINGSSRRDLLLNSQNFCRSIPAGAENPSLSRLC
Sbjct: 17 NKVQVYRSVRMPQLHTSLHFHLTPIVMINGSSRRDLLLNSQNFCRSIPAGAENPSLSRLC 76
Query: 123 YRRLWGSRNRRALILGWAYYLDAFSSYPLRTWLPSVYRGHDN-----------W------ 182
YRRLWGSRNRRALILGWAYYLDAFSSYPLRTWLPSVYRGHDN W
Sbjct: 77 YRRLWGSRNRRALILGWAYYLDAFSSYPLRTWLPSVYRGHDNCPTLGTYYSPRWRRADIE 136
Query: 183 -----------------ANLRLARGNLCTPPLPFGRPTPHRNCLPETVPW---------L 242
ANLRLARGNLCTPPLPFGRPTPHRNCLPETVPW
Sbjct: 137 VPNLPVDSSSLLPLHSRANLRLARGNLCTPPLPFGRPTPHRNCLPETVPWPVVRIFTDMS 196
Query: 243 VGPD---TRSPTSLTFRHWAGVSPHTWSYDFAETCV--------FGK-----QSPGPGHC 302
+ P + P FR AG + + + T + FG+ QSPGPGHC
Sbjct: 197 ISPSLSPRQCPDRYAFR--AGRNLPDKEFRYLRTVIVTAAVHRGFGRRLPCHQSPGPGHC 256
Query: 303 DPLC--------------------------ILYLPTCVGFGYRYPFVEGRSSFSWEYGMG 362
DPLC ILYLPTCVGFGYRYPFVEGRSSFSWEYGMG
Sbjct: 257 DPLCEEAPLLPKLRGYFAEFLRESCLAPLGILYLPTCVGFGYRYPFVEGRSSFSWEYGMG 316
Query: 363 YFSAVAPGTRTLARGIFSTPSYPEKAGAPCALTHPPWTNLAEEPLRFRGIGFSPMFALLK 422
YFSAVAPGTRTLARGIFSTPSYPEKAGAPCALTHPPWTNLAEEPL FRGIGFSPMFALLK
Sbjct: 317 YFSAVAPGTRTLARGIFSTPSYPEKAGAPCALTHPPWTNLAEEPLGFRGIGFSPMFALLK 376
Query: 423 PTFSLPLRPPPLARVLPSKAERSHS-----------------FG-RLLSPVHLRLKSARS 482
PTFSLPLRPPPL+RVLPSKAERS + FG R L HL +
Sbjct: 377 PTFSLPLRPPPLSRVLPSKAERSPTDAFLHPTASADRLAPFIFGARALDHGHLGALAGDP 436
Query: 483 VCY-----------------------YALFQGWL---LLGKP-------PGC---LCTPT 542
C+ Y +F+ L L +P P C C T
Sbjct: 437 GCFPLDDEAYPPSSHWPTLTPVILRSYLVFRVCLDLVPLSRPAPKQCFTPRCPVNCCAST 496
Query: 543 SFITERLAFATAPVGSL---------NQATAYESPA--------------HSSTGTRSE- 602
F +LA ++ + L +Q+ +SP HS G
Sbjct: 497 HFGENQLALGSSGISPLTTTHPLILQHQSARGQSPGLLPLLGSLRFHVLFHSPMGVLFTL 556
Query: 603 ------SWSPPTAWKLTVSCSISLPDGGSFHPSLTV-----LLHYRSP------------ 662
+ P + LT + + L + T LL +RSP
Sbjct: 557 PSRYYFTIGHPGVFSLTSTPLLRLAARRLYCSPTTPFSRFRLLPFRSPLLRESLLLSFPL 616
Query: 663 -----------------RSLFERLTYSGISGSMLIFNSPKHFVAYYALPRLWVPR----- 722
+ FERLTYSGISGS LIFNSPKHFVAYYALPRLWVPR
Sbjct: 617 ATKMFQFARLSLACPWIQQQFERLTYSGISGSTLIFNSPKHFVAYYALPRLWVPRSNCRI 676
Query: 723 -------ILVVPENPATGEPGTES--------------FPPFSARLFGLKNAGFYGRSAT 731
I + + G+ T + PP SARLFGLKNAGFYGRSAT
Sbjct: 677 GKIGCYHIALYRLSSRVGDKRTRTADIRHRLQENQERKAPPLSARLFGLKNAGFYGRSAT 736
BLAST of Moc06g30340 vs. ExPASy TrEMBL
Match:
A0A2N9H1D9 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS33333 PE=4 SV=1)
HSP 1 Score: 757.3 bits (1954), Expect = 7.2e-215
Identity = 504/1087 (46.37%), Postives = 543/1087 (49.95%), Query Frame = 0
Query: 63 NKVQVYRSVRMPQLHTSLHFHLTPIVMINGSSRRDLLLNSQNFCRSIPAGAENPSLSRLC 122
NKVQVYRSVRMPQLHTSLHFHLTPIVMINGSSRRDLLLNSQNFCRSIPAGAENPSLSRLC
Sbjct: 17 NKVQVYRSVRMPQLHTSLHFHLTPIVMINGSSRRDLLLNSQNFCRSIPAGAENPSLSRLC 76
Query: 123 YRRLWGSRNRRALILGWAYYLDAFSSYPLRTWLPSVYRGHDN-----------W------ 182
YRRLWGSRNRRALILGWAYYLDAFSSYPLRTWLPSVYRGHDN W
Sbjct: 77 YRRLWGSRNRRALILGWAYYLDAFSSYPLRTWLPSVYRGHDNCPTLGTYYSPRWRRADIE 136
Query: 183 -----------------ANLRLARGNLCTPPLPFGRPTPHRNCLPETVPW---------L 242
ANLRLARGNLCTPPLPFGRPTPHRNCLPETVPW
Sbjct: 137 VPNLPVDSSSLLPLHSRANLRLARGNLCTPPLPFGRPTPHRNCLPETVPWPVVRIFTDMS 196
Query: 243 VGPD---TRSPTSLTFRHWAGVSPHTWSYDFAETCV--------FGK-----QSPGPGHC 302
+ P + P FR AG + + + T + FG+ QSPGPGHC
Sbjct: 197 ISPSLSPRQCPDRYAFR--AGRNLPDKEFRYLRTVIVTAAVHRGFGRRLPCHQSPGPGHC 256
Query: 303 DPLC--------------------------ILYLPTCVGFGYRYPFVEGRSSFSWEYGMG 362
DPLC ILYLPTCVGFGYRYPFVEGRSSFSWEYGMG
Sbjct: 257 DPLCEEAPLLPKLRGYFAEFLRESCLAPLGILYLPTCVGFGYRYPFVEGRSSFSWEYGMG 316
Query: 363 YFSAVAPGTRTLARGIFSTPSYPEKAGAPCALTHPPWTNLAEEPLRFRGIGFSPMFALLK 422
YFSAVAPGTRTLARGIFSTPSYPEKAGAPCALTHPPWTNLAEEPL FRGIGFSPMFALLK
Sbjct: 317 YFSAVAPGTRTLARGIFSTPSYPEKAGAPCALTHPPWTNLAEEPLGFRGIGFSPMFALLK 376
Query: 423 PTFSLPLRPPPLARVLPSKAERSHS-----------------FG-RLLSPVHLRLKSARS 482
PTFSLPLRPPPL+RVLPSKAERS + FG R L HL +
Sbjct: 377 PTFSLPLRPPPLSRVLPSKAERSPTDAFLHPTASADRLAPFIFGARALDHGHLGALAGDP 436
Query: 483 VCY-----------------------YALFQGWL---LLGKP-------PGC---LCTPT 542
C+ Y +F+ L L +P P C C T
Sbjct: 437 GCFPLDDEAYPPSSHWPTLTPVILRSYLVFRVCLDLVPLSRPAPKQCFTPRCPVNCCAST 496
Query: 543 SFITERLAFATAPVGSL---------NQATAYESPA--------------HSSTGTRSE- 602
F +LA ++ + L +Q+ +SP HS G
Sbjct: 497 HFGENQLALGSSGISPLTTTHPLILQHQSARGQSPGLLPLLGSLRFHVLFHSPMGVLFTL 556
Query: 603 ------SWSPPTAWKLTVSCSISLPDGGSFHPSLTV-----LLHYRSP------------ 662
+ P + LT + + L + T LL +RSP
Sbjct: 557 PSRYYFTIGHPGVFSLTSTPLLRLAARRLYCSPTTPFSRFRLLPFRSPLLRESLLLSFPL 616
Query: 663 -----------------RSLFERLTYSGISGSMLIFNSPKHFVAYYALPRLWVPR----- 722
+ FERLTYSGISGS LIFNSPKHFVAYYALPRLWVPR
Sbjct: 617 ATKMFQFARLSLACPWIQQQFERLTYSGISGSTLIFNSPKHFVAYYALPRLWVPRSNCRI 676
Query: 723 ------------------------ILVVPENPATGEPGTESFPPFSARLFGLKNAGFYGR 731
+ +PATGEPGTES PP SARLFGLKNAGFYGR
Sbjct: 677 GKIGCYHIALYRLSSRVGDKRTRTADIRHRDPATGEPGTES-PPLSARLFGLKNAGFYGR 736
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
KAG4154841.1 | 2.0e-163 | 49.14 | hypothetical protein ERO13_D03G074976v2 [Gossypium hirsutum] | [more] |
KAG4211951.1 | 3.3e-137 | 46.66 | hypothetical protein ERO13_A02G134101v2 [Gossypium hirsutum] | [more] |
KAG6516207.1 | 6.7e-122 | 55.84 | hypothetical protein ZIOFF_026657 [Zingiber officinale] | [more] |
KAG6467616.1 | 2.7e-115 | 39.14 | hypothetical protein ZIOFF_074572 [Zingiber officinale] >KAG6510415.1 hypothetic... | [more] |
TYH76817.1 | 1.8e-114 | 42.84 | hypothetical protein ES332_D04G112800v1 [Gossypium tomentosum] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A2N9FRF2 | 4.2e-279 | 55.14 | Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS17704 PE=4 SV=1 | [more] |
A0A2N9I7K2 | 1.0e-232 | 52.63 | Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS26797 PE=4 SV=1 | [more] |
A0A2N9I5R1 | 1.7e-232 | 52.52 | Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS47221 PE=4 SV=1 | [more] |
A0A2N9I3B0 | 4.7e-230 | 51.36 | Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS21723 PE=4 SV=1 | [more] |
A0A2N9H1D9 | 7.2e-215 | 46.37 | Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS33333 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |