Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGTTGAACACCGCATCCAATGGCTCGTTGTTAGAGAAGTCGATAAATGAGATCGTTGATATCTTGAATAAGATGACAGACATTAATGACCAAGGCGAAATAGGAAGGTCATTACCAAAGAAGCAAGTATCAGCTGGAATCTTTGAGTTAGACACAGTAGCTTCAATGCAAGCCCAAATGGCGGCTATGAACCAAATGTTAAAGCAGTTGACAATGGAGAAGGAAACCAAAACCGTCACTTCGGCGATACCTGAACCCTCTCCTATTTTGCAAATTTCAGATATATCTTGTGTCTATTGTGGTGATAACCACTTGTATGAGAACTGTCCAGCTAATCCAGCGTCTATTTTTTATGTAGGTCAAGGTGCCCAGCAGAATTTCAACCCGTATTCGAACACTTACAACCCTGGATGGAGGCACCATCCAAACTTTTCCTGGAGTAACCAAGGAGTAGCTAGTAGCAGTGCACAAGCACCCGCTCAACAATACAAGCAAAACTATACTCCTCCTGGTTTTCCAACTCAACCGGCGTCGCAGCCTCAACAATACAATCAGCAAAGAGGTCAAAATACTACTCAGCAAAGTGGTAGCAACGTAAGTTTGGAGGCCATGATGAAAGAGTTCATGACAAGAACTGATGCTGCGATAAGAAGCTTGGAGATGCAAGTGGGGCAGATTGCAAATGACCAGAAATCTAGACCCCAAGGTACATTGCCTGGACACACAGAAAACCTGAAGCGAGATCGTGAGGGAAAGGACCACTGTAAGGCGGTTATCACAAGAAGCGGATTAAGTTATGAAGGACCCTCACTTCCAGACGAAGGAACTCATGTAGTTACACCTATTCCTGCATCCACCTCCAATCCACAACAAGAAGAGAAAGCAGAACCTGTAAGTTCAGAAGAAAAAGATAAGAAGGCGGATAAAGGTAAGCAAGTAGTGCCCAACACTACTCCACAGGTAGGTAATATTAAGATTCCTCCACCTTTCCCCCAAAGGCTAGTTAAAAAGAATCAAGATGGTCATTTTAAGAAGTTCTTTGAGATTCTAAAACAGTTGCATATCAATATACGTCTCATAGATGCTTTAGAACAAATGCCTAACTACACTAAGTTTTTAAAGGACATCATATCTAGGCGTAAGAAGTTAGGTGAGCATGAGACGGTAGCCTTGACAAAGTGTAGTAGTGATGCTCTAGGGAATCCATTGCCTGTTAAATGTAACGACCCAGGTAGTTTTACCATCCCCTGCTCAATAGGAGGTAAGAACCTAGGAAGAGCATTGTGTGACTTAGGAGCAAGCATTAATCTTATGCCTCTTTCTGTCTTTAAAGAGTTAAATATAGGTGAAGCTCGTCCCACTACTGTCACTTTACAACTAGCTGATAGATCCATAAAGAAACCGGAAGGAAAAATAGAAGATGTGCTTGTTAAAGTTGATAAGTTTATTTTTCCCGTCGATTTCATAATTTTGGACTGTGAAGCAGATCTTGAGGTGCCGAACATTCTTGGGAGGCCATTTTTAGCAACTGGAGATACAGTTTTCAATGTTAGGAAAGGAGAGATCACGATGAAGGTCAATGATGAACAGGTAACCTTCAATGTCCTCGATGCGATGCGTCTCCCGGATGAAGTCGAGGAGTGCTCTACAATAGGAGCAATCATGGAGGAACTCCAGCAAATGATGGTGGAAGACTTGGAAGCAGATTTGGAGGCCGTAGAAAAAGAATCCAAAATTGCGCCTGGCGCAATTTTGCCCCAACTTGAGCGTTTTGAGTTTTTGCAGCCGACAATAGCGGATTTGAAGGCCTTGCAACCTTCCATCATTGAACCTCCAGAATTGGAAAAGAAACCCCTACCTTCTCATTTAAAATATGCTTATTTGGGTTTAAACGATACTTTGCCCGTTATTATTTCTTCATGTTTGTCTAATGAACATGAATCTTTGCTTTTGCAGGTTTTATCAAAGTACAAGAGGGCTATTGGATAGACCATAGCAGACATAAGAGGAATTAGCCCTACATTTTGCATGCACAAAATCATCCTGGAGGAAGAAGCCACCAATTCTATCGAGTGCCAAAGGAGGCTTAACCCGAAAATGAAAGAGGTGGTCAAAAAGGAAATCATAAAATGGTTAGATGCTGGAGTGATCTACCCCATTTCAGATAGCAGTTGGGTTAGCCCGGTTCAATGCGTTCCCAAGAAGGGTGGTATGACGGTGATTACAAACGACGACAACGAGTTGATCCCAACTAGAACGGTGACTGGTTGGAGAGTGTGTATGGACTACCGTCGGTTAAACAAGGTAACTCGAAAAGACCATTTTCCATTACCCTTTATTGACCAAATGCTTGATCGTCTTGCAGGGAAGGAATGTTATTGTTTCCTCGATGGGTATTCGGGTTACAACCAGATTACTATAGCTCCGGAGGACTAAGAAAAGACCACGTTCACGTGTCCCTATGGGACATTTGCATTCCGGAGAATGCCGTTCGGTCTTTGTAATGCTCCCTCTACCTTCCAAAGATGCATGACCGCTATTTTTTCAGACATGATAGAAAGCACGGTAGAGATTTTTATGGATGATTTTTCTGTGTTTGGTGAGTCGTTTGATGTTTGTTTAGAGAACTTAGAGAATGTTTTAAAAAGGTGTGAAGAGACTAACCTTGTGCTTAATTAGGAGAAGTGTCATTTTATGGTACGTGAAGGTATAGTCTTAGGTCACAGAATTTCAGAAGAAGGAATTGAGGTTGATAAGGCTAAAATAGATGTGATTGCTAAACTACCCCCTCCTACAACTGTAAAAGGTGTTAGAAGTTTCTTAGGACACGCGGGTTTCTACCGCCGTTTTATAAAAGATTTTGCGAAAATTTATAAGTCCTTGTGTCAGTTGTTGGAAGTGGATCGACCCTTTGTCTTTGATGAACATTGTTTGAAAGCTTTTGAGACACTTAAGAAAGCACTTAGCTCAGCACCTATCATAATTGAACCAGATTGGAACTTGTCATTTGAACTAATGTGTGATGCAAGTGATTTTGTTGTAGGTGTGATGTTGGGTCAACGGAAAGGTAAAATTCTTCATCCTGTTTATTATGCAAGTAAAACTTTGAATAGTAGTCAGTTGAACTATACGGTGACTGAGAAAAAACTTTTGGCTGTTGTTTTCGCATTTGATAAGTTTCGTTCATATTTGATTGGGACAAAAGTGATAGTGTTTACCGATCATTCTGCAATTAAATATTTGTTTGCTAAAAAAGAAGCTAAGCCTAGGTTGATTAGATGGATCTTGTTGTTACAAGAGTTTGACTTAGAGATAAAAGATAGAAAAGGAACAGAAAACCAAGTAGCGGATCATCTATCTAGGCTAGAGTCTTCTTTGCAGCATGATAGTTGTGAGATTAAAGAGCATTTTGTGGATGAGTATCTTTTAGCTATTTCTGAAGTCCCTTGGTATGCTAACTATGCGAACTATCTGGTGAGCAGAATAATTCCTAATGATTTGAGTAAGCACCAAATAAAGAAGTTCTTTCATGATGTCAGGTTTTATAGATGGGATGAACCTTTCCTGTTTAAACTAGGACCTGATAATATTCTTCGTAGGTGTGTTGCTGGGCATGAACAGAATGAGAGTCTAGAGGCATGCCACAAATCACCATATGGTGGGCATTTTGTTGGACGTAAGACTGCAGCAAAGGTCCTCCAAAGTGGTTTCTTTTGGCCGTCTCTATTTCGAGATGCTCATATATTCACTCAAGGATGTGACCGCTGTCAAAGATCAGGTAACTTGACAAGGAAGCATGAGATGCCCTTAAACCCAATATTGGAAGTTGAACTTTTCGATGTTTGGGGCATTGACTTCATGGGCCCGTTCCCACCTTCATTCGGTAAGAATTACATTCTCTTGGCTGTTGATTATGTCTCGAAATGGGTTGAGGCTTTGGCCACCCCTACTAACGATGCCAAGGTAGTTTCTCTCTTCTTGCAAAAGTTTATTTTTACTAGATATGGTACACCTAGATGCTTGATTAGTGATGAGGGTACGCAATTTTTGAATAGAGCAGTAAGTAATTTGCTTAAAAAGTATAACATTCACCATAAGATTGCTACTGCATATCACCCACAAACTAATGGACTAGCAGAGGTATCAAATAGGGCAATAAAATCAATTTTAGAGAAGTCTGTGAATATAAATAGGAAGGACTTGGCTATTAAACTTGATGATGCTTTGTGGGCGTATAGAACTGTTTATAAAACTCCATTAGGCATGTCCCATATCGCATTGTTTTTTGTAAAGCATGTCATCTTCCATTAGAATTAGAACATAGAACATATTGGGCTACGCGGAAACTAAATTTTGATTATGTTGCTGCAGGTGAAGCGAGATTGCTCCAGTTACATGAGTTGGATGAATTTCGTCAGTTTTCTTATGAGAATGCCAAGATGTACAAAGAACAAACTAAGAAGTGGCATGATAAAAAGATAATGGACAAGAATTTAGAACCTGAAAACTTAGTTTTGCTTTTTAACTCTAGATTAAAGTTGTTTCCAGGTAAGCTTAAGTCTAGATGGTCAGGGTCATTCGTAGTTGAACATGTTTATCCACATGGAGCGGTGGATTTAAAGGGTTCTAAACGTAATATCTTTAAGGTAAATGGGCATCGGGTCAAGAAATATTATGGCGAGTATGTTGACAGAGGTAAAACCTCTGTAATACTGAAAGACCCATAAGAAAACAGTGGAAACAATCCCTCGGACAATTTGGGGGCGTTTTTATGTAAAATATTTTTTTTGCTTTCTTTTTACAAAATTTTTACACTTTCTTTAAGGCTTATTGGTGTAGATTCTAAATCTAATTTTTGTTTTTGCATTTGAATTTTGCAGGATCAAGATCCGACCCACGTCAAAATTGCATCAAGCGCAATTTAAGAGACAATTTCGATTTTTTTGAGTTATGGATTAGGGCTTTCGTGGGCTTTATGTGGGTTATGTTTAAATGGGCTTTGGGTTGTTTTTAAATCAAAAAGAGGCCCAATTTTAAACCCAATTATGTCTTTAATAGCTTTTAAAACGTAAAACAAAAAGCCCTAGAATTTGAAAACACCCCATTCACGATTTTTACCATAATTTTCTCACTTGTTTTCAAATCTTCTCTCAAGAAATCTCAAAGATTCTCAAAGTCTCTTTCATCTCTCTTTTTCGGCACTTTTAGGGTTTATTGGAGGGATTTTCGTTGTCGATCGTGAAGCAAACTCAAGAACCATGGAAGGTTCATCTTCCTCCAAGCCACACGACAAAGAGAAGGAAAAGAAGAGAGTGTTGTTGCCTCCACCAACCAAACCGGGTATGATTCCTCTTGAACCTCCTAGGATTTCTCATGAAAAGTTAGTTTTTGATCCTAGGGAACAAAGAAGAAAATATGAAGAAGCTATAAGAATGAACCCTAGGAGAAATCAATCCTTAGATGGTACAAATTCTGAAAAAATTAATATGGAATCTAAGGATGCTAGGGTTAATAAAGAGGGTCATAGTGAGAAGAAATTAGGAGAAGTTAATAAAGTCTATCTTCGAAAAAATCAATCTTTAGAGGAAAAAGGTGTTGTTTTAGATGAATAAATAGCTAGACTTCAAGAGAGGGCGGAGATGTTCAGTAAAAATAATGAAATTAGGGACAAAGAAAATGAAAGGGTTTATGCGAAAATTTAGGAACTAAACATTAAATGGCAAGAATTCATGGAAAACTCAAAGAAAGTGAGTGAGGAGATTCAACTTGAGTTAAATAGCATGAGTATACGTCGTAGGATGAATCTTTCTCAAGATAACCCCGTTTCCGAGTCTTTAGAACTGTCTATCCCTCCCCCTCTTTCCACTACTGTTGCTGTGCATGTTGAAGGTCAAGAACAGGGTAGTGGGGATTCAGAACACGACACGGAGCCCTTGGAGCATTCAGATTCAGCCACGGTCGAAATTCAATGCCAAATTACGCCTGGCGCAATTATGGATGAGACTCCACCGGCCACTCTACAAGGTATTTTGTCTCCATCTTTTCCAGATCCTATCTTGACTAAAAAGCCCCTAGTTTTTTATGATTTAGAACAGGAAAGGACAATGTCGAAAATTGCCGAAATTTTGGTGGCATTGAATGAAGCAAGGGGAGAGGATCCATTGAAGGATGATGGAAACAGTGGGGCAGCACAAGAACAATTGAATGTTGATAGGGAAGATGAAGATTTTGGAGAATTACCCCAAGAAGTGCATGGAGACGATTTTGAGGACAAGGAAGACAATGACGATATCTTCCAATATGAAGTGAGAGTACGAACTCCGGTGCACCAATCTCAGCAAGTTGATGAGGAGCCCCCTACAAAAGAGCAAGAAGGAACATCAGAGACTTCAGATGAGGAGGTGAGCTTGACTTCGTGAAGAAAACACAAAAGAAGAAAAAAGTGGCGGAAATTGCGCCAGGCGCAATTTCTAGGCCTAGGACCCGCGCCGCTGTAGCACGTTTGGCTGCTCAAAAAGAAGCCGAGGCCGGTCCATCAAAAAAGCCAAGAGGGCTAGGGTGCAAAGTGGGGCAAAAGAGCCACTTGAGGAGGCCAATGAAGAGGATACCGATTCTACCGAGCAGACACCATCAAGAGTAAAAAGGGTGAGATTGGAGGTGAGAAGGCCCACTTTCACAATACGTGATATCCTCCTTGAGAGAGGTTTTGATGAGGCTCAAGAGCCGGTGCCGGAATATGTTAGGAGGAGGCTTGTGGAGAATGGTTGGGAGACATTGTTTGCCCCCATTACACGTGTATCAGAGGACTTGGTGAAAGAGTTTTACGTTGCCATCAACCCACACCAAGGGGATGTAGTGAGAGTATGGGGTAAAGTGGTAAAATTCTCGCCTTCCATTATTAATACTCATTATGGTTTGTTGGATGTTTTTAATGCCATAGGTAATGAAATTTGGGTGCATCCATCGGACGAGCAAGTGGAGGAGGCGCGTAGACATATTTGTAGACCACATAAGACATGGATTATCTCAACCACGGGGAAGCTTTCCTTAAAGCCCCTTGACATTAATGAGCAAGCGATAGTTTGGATGTATGTGGTGAAGAACCGGTTGATACCCACTTCTCACGATTCCTCCATTAAGCGCAATAGGGCGATGATGGTGTACATTCTCATGAAGGGCGTTGAGTTCAACTTTGGGGAACTCATAAGAAATGAGATACGGAGTTGCTCGGAGAAAATGGTAGGCCCACTTGTTTTTCCTGGATTAATAACTGAGTTATGCTTACAGGCGGGAGTGGAGGCCGATGATGCCAATGTTGTGATGCCCAAGAAGCCTTTCACATCTCTAAGAAGAGTTCAGGGGTATTCCATTGTTCGAGAGGAAGATTCTCCCATTACCGCCGCGGATCCCGAGACCCGAGGGGTGGTGACTAGGGAGCAGTATGATGAGCTTAGGCACAAGTATGAGCTTCTGTTAGTTACTCAACGTGCCACATGTGCTTTCCTCAAGAAGATATACGGTGATGAAGCAGCTTCTTTCCCCGATGAGCTGACGCCGATTTACCATCTTCTTCCCGTCTTCCTACCGATTCCACCGATGATGAGTCTTCCGATGATGAATAGGGGAGTATCTCACTCTCTTTGTTTTGATATTTTGATCTTGATTTTGTATGTTAGGTTGTTTAGATTAGTTTGCATGTTAGGTTGTGTAGGTTGTGTAGGTTGTTTTTGTTCTTTTAGTTTGCACGATAGGTTGTTTAGTTTATTTTTTGTTTTAGGTTAGTGTTGGGATGTTGTGCTGTTAGGAAGGTATACATGTTATGTTTTTGCATCTGTTTAGAGTTCAAGCTTGCATGCTTCCTTGTTCTAGCTACTAGCTTGTCCTTAGCTTCTTTAATGCATCTTTTCTAA
mRNA sequence
ATGATGTTGAACACCGCATCCAATGGCTCGTTGTTAGAGAAGTCGATAAATGAGATCGTTGATATCTTGAATAAGATGACAGACATTAATGACCAAGGCGAAATAGGAAGGTCATTACCAAAGAAGCAAGTATCAGCTGGAATCTTTGAGTTAGACACAGTAGCTTCAATGCAAGCCCAAATGGCGGCTATGAACCAAATGTTAAAGCAGTTGACAATGGAGAAGGAAACCAAAACCGTCACTTCGGCGATACCTGAACCCTCTCCTATTTTGCAAATTTCAGATATATCTTGTGTCTATTGTGGTGATAACCACTTGTATGAGAACTGTCCAGCTAATCCAGCGTCTATTTTTTATGTAGGTCAAGGTGCCCAGCAGAATTTCAACCCGTATTCGAACACTTACAACCCTGGATGGAGGCACCATCCAAACTTTTCCTGGAGTAACCAAGGAGTAGCTAGTAGCAGTGCACAAGCACCCGCTCAACAATACAAGCAAAACTATACTCCTCCTGGTTTTCCAACTCAACCGGCGTCGCAGCCTCAACAATACAATCAGCAAAGAGGTCAAAATACTACTCAGCAAAGTGGTAGCAACGTAAGTTTGGAGGCCATGATGAAAGAGTTCATGACAAGAACTGATGCTGCGATAAGAAGCTTGGAGATGCAAGTGGGGCAGATTGCAAATGACCAGAAATCTAGACCCCAAGGTACATTGCCTGGACACACAGAAAACCTGAAGCGAGATCGTGAGGGAAAGGACCACTGTAAGGCGGTTATCACAAGAAGCGGATTAAGTTATGAAGGACCCTCACTTCCAGACGAAGGAACTCATGTAGTTACACCTATTCCTGCATCCACCTCCAATCCACAACAAGAAGAGAAAGCAGAACCTGTAAGTTCAGAAGAAAAAGATAAGAAGGCGGATAAAGGTAAGCAAGTAGTGCCCAACACTACTCCACAGGTAGGTAATATTAAGATTCCTCCACCTTTCCCCCAAAGGCTAGTTAAAAAGAATCAAGATGGTCATTTTAAGAAGTTCTTTGAGATTCTAAAACAGTTGCATATCAATATACGTCTCATAGATGCTTTAGAACAAATGCCTAACTACACTAAGTTTTTAAAGGACATCATATCTAGGCGTAAGAAGTTAGGTGAGCATGAGACGGTAGCCTTGACAAAGTGTAGTAGTGATGCTCTAGGGAATCCATTGCCTGTTAAATGTAACGACCCAGGTAGTTTTACCATCCCCTGCTCAATAGGAGGTAAGAACCTAGGAAGAGCATTGTGTGACTTAGGAGCAAGCATTAATCTTATGCCTCTTTCTGTCAATGATGAACAGGTAACCTTCAATGTCCTCGATGCGATGCGTCTCCCGGATGAAGTCGAGGAGTGCTCTACAATAGGAGCAATCATGGAGGAACTCCAGCAAATGATGGTGGAAGACTTGGAAGCAGATTTGGAGGCCGTAGAAAAAGAATCCAAAATTGCGCCTGGCGCAATTTTGCCCCAACTTGAGCGTTTTGAGTTTTTGCAGCCGACAATAGCGGATTTGAAGGCCTTGCAACCTTCCATCATTGAACCTCCAGAATTGGAAAAGAAACCCCTACCTTCTCATTTAAAATATGCTTATTTGGGTTTAAACGATACTTTGCCCGTTATTATTTCTTCATGTTTGTCTAATGAACATGAATCTTTGCTTTTGCAGGTAAATGGGCATCGGGTCAAGAAATATTATGGCGAGTATGTTGACAGAGGGTTTATTGGAGGGATTTTCGTTGTCGATCGTGAAGCAAACTCAAGAACCATGGAAGGTTCATCTTCCTCCAAGCCACACGACAAAGAGAAGGAAAAGAAGAGAGTGTTGTTGCCTCCACCAACCAAACCGGGTATGATTCCTCTTGAACCTCCTAGGATTTCTCATGAAAAGTTAGTTTTTGATCCTAGGGAACAAAGAAGAAAATATGAAGAAGCTATAAGAATGAACCCTAGGAGAAATCAATCCTTAGATGAATTCATGGAAAACTCAAAGAAAGTGAGTGAGGAGATTCAACTTGAGTTAAATAGCATGAGTATACGTCGTAGGATGAATCTTTCTCAAGATAACCCCGTTTCCGAGTCTTTAGAACTGTCTATCCCTCCCCCTCTTTCCACTACTGTTGCTGTGCATGTTGAAGGTCAAGAACAGGGTAGTGGGGATTCAGAACACGACACGGAGCCCTTGGAGCATTCAGATTCAGCCACGGTCGAAATTCAATGCCAAATTACGCCTGGCGCAATTATGGATGAGACTCCACCGGCCACTCTACAAGGTATTTTGTCTCCATCTTTTCCAGATCCTATCTTGACTAAAAAGCCCCTAGTTTTTTATGATTTAGAACAGGAAAGGACAATGTCGAAAATTGCCGAAATTTTGGTGGCATTGAATGAAGCAAGGGGAGAGGATCCATTGAAGGATGATGGAAACAGTGGGGCAGCACAAGAACAATTGAATGTTGATAGGGAAGATGAAGATTTTGGAGAATTACCCCAAGAAGTGCATGGAGACGATTTTGAGGACAAGGAAGACAATGACGATATCTTCCAATATGAAGTGAGAGTACGAACTCCGGTGCACCAATCTCAGCAAGTTGATGAGGAGCCCCCTACAAAAGAGCAAGAAGGAACATCAGAGACTTCAGATGAGGAGGACCCGCGCCGCTGTAGCACGTTTGGCTGCTCAAAAAGAAGCCGAGGCCGGTCCATCAAAAAAGCCAAGAGGGCTAGGGTGCAAAGTGGGGCAAAAGAGCCACTTGAGGAGGCCAATGAAGAGGATACCGATTCTACCGAGCAGACACCATCAAGAGTAAAAAGGGTGAGATTGGAGGTGAGAAGGCCCACTTTCACAATACGTGATATCCTCCTTGAGAGAGGTTTTGATGAGGCTCAAGAGCCGGTGCCGGAATATGTTAGGAGGAGGCTTGTGGAGAATGGTTGGGAGACATTGTTTGCCCCCATTACACGTGTATCAGAGGACTTGGTGAAAGAGTTTTACGTTGCCATCAACCCACACCAAGGGGATGTAGTGAGAGTATGGGGTAAAGTGGTAAAATTCTCGCCTTCCATTATTAATACTCATTATGGTTTGTTGGATGTTTTTAATGCCATAGGTAATGAAATTTGGGTGCATCCATCGGACGAGCAAGTGGAGGAGGCGCGTAGACATATTTGTAGACCACATAAGACATGGATTATCTCAACCACGGGGAAGCTTTCCTTAAAGCCCCTTGACATTAATGAGCAAGCGATAGTTTGGATGTATGTGGTGAAGAACCGGTTGATACCCACTTCTCACGATTCCTCCATTAAGCGCAATAGGGCGATGATGGTGTACATTCTCATGAAGGGCGTTGAGTTCAACTTTGGGGAACTCATAAGAAATGAGATACGGAGTTGCTCGGAGAAAATGGTAGGCCCACTTGTTTTTCCTGGATTAATAACTGAGTTATGCTTACAGGCGGGAGTGGAGGCCGATGATGCCAATGTTGTGATGCCCAAGAAGCCTTTCACATCTCTAAGAAGAGTTCAGGGGTATTCCATTGTTCGAGAGGAAGATTCTCCCATTACCGCCGCGGATCCCGAGACCCGAGGGGTGGTGACTAGGGAGCAGTATGATGAGCTTAGGCACAAGTATGAGCTTCTGTTAGTTACTCAACGTGCCACATGTGCTTTCCTCAAGAAGATATACGGTGATGAAGCAGCTTCTTTCCCCGATGAGCTGACGCCGATTTACCATCTTCTTCCCGTCTTCCTACCGATTCCACCGATGATGAGTCTTCCGATGATGAATAGGGGAGTATCTCACTCTCTTTGTTTTGATATTTTGATCTTGATTTTGTATGTTAGGTTAGTGTTGGGATGTTGTGCTGTTAGGAAGGTATACATGTTATGTTTTTGCATCTGTTTAGAGTTCAAGCTTGCATGCTTCCTTGTTCTAGCTACTAGCTTGTCCTTAGCTTCTTTAATGCATCTTTTCTAA
Coding sequence (CDS)
ATGATGTTGAACACCGCATCCAATGGCTCGTTGTTAGAGAAGTCGATAAATGAGATCGTTGATATCTTGAATAAGATGACAGACATTAATGACCAAGGCGAAATAGGAAGGTCATTACCAAAGAAGCAAGTATCAGCTGGAATCTTTGAGTTAGACACAGTAGCTTCAATGCAAGCCCAAATGGCGGCTATGAACCAAATGTTAAAGCAGTTGACAATGGAGAAGGAAACCAAAACCGTCACTTCGGCGATACCTGAACCCTCTCCTATTTTGCAAATTTCAGATATATCTTGTGTCTATTGTGGTGATAACCACTTGTATGAGAACTGTCCAGCTAATCCAGCGTCTATTTTTTATGTAGGTCAAGGTGCCCAGCAGAATTTCAACCCGTATTCGAACACTTACAACCCTGGATGGAGGCACCATCCAAACTTTTCCTGGAGTAACCAAGGAGTAGCTAGTAGCAGTGCACAAGCACCCGCTCAACAATACAAGCAAAACTATACTCCTCCTGGTTTTCCAACTCAACCGGCGTCGCAGCCTCAACAATACAATCAGCAAAGAGGTCAAAATACTACTCAGCAAAGTGGTAGCAACGTAAGTTTGGAGGCCATGATGAAAGAGTTCATGACAAGAACTGATGCTGCGATAAGAAGCTTGGAGATGCAAGTGGGGCAGATTGCAAATGACCAGAAATCTAGACCCCAAGGTACATTGCCTGGACACACAGAAAACCTGAAGCGAGATCGTGAGGGAAAGGACCACTGTAAGGCGGTTATCACAAGAAGCGGATTAAGTTATGAAGGACCCTCACTTCCAGACGAAGGAACTCATGTAGTTACACCTATTCCTGCATCCACCTCCAATCCACAACAAGAAGAGAAAGCAGAACCTGTAAGTTCAGAAGAAAAAGATAAGAAGGCGGATAAAGGTAAGCAAGTAGTGCCCAACACTACTCCACAGGTAGGTAATATTAAGATTCCTCCACCTTTCCCCCAAAGGCTAGTTAAAAAGAATCAAGATGGTCATTTTAAGAAGTTCTTTGAGATTCTAAAACAGTTGCATATCAATATACGTCTCATAGATGCTTTAGAACAAATGCCTAACTACACTAAGTTTTTAAAGGACATCATATCTAGGCGTAAGAAGTTAGGTGAGCATGAGACGGTAGCCTTGACAAAGTGTAGTAGTGATGCTCTAGGGAATCCATTGCCTGTTAAATGTAACGACCCAGGTAGTTTTACCATCCCCTGCTCAATAGGAGGTAAGAACCTAGGAAGAGCATTGTGTGACTTAGGAGCAAGCATTAATCTTATGCCTCTTTCTGTCAATGATGAACAGGTAACCTTCAATGTCCTCGATGCGATGCGTCTCCCGGATGAAGTCGAGGAGTGCTCTACAATAGGAGCAATCATGGAGGAACTCCAGCAAATGATGGTGGAAGACTTGGAAGCAGATTTGGAGGCCGTAGAAAAAGAATCCAAAATTGCGCCTGGCGCAATTTTGCCCCAACTTGAGCGTTTTGAGTTTTTGCAGCCGACAATAGCGGATTTGAAGGCCTTGCAACCTTCCATCATTGAACCTCCAGAATTGGAAAAGAAACCCCTACCTTCTCATTTAAAATATGCTTATTTGGGTTTAAACGATACTTTGCCCGTTATTATTTCTTCATGTTTGTCTAATGAACATGAATCTTTGCTTTTGCAGGTAAATGGGCATCGGGTCAAGAAATATTATGGCGAGTATGTTGACAGAGGGTTTATTGGAGGGATTTTCGTTGTCGATCGTGAAGCAAACTCAAGAACCATGGAAGGTTCATCTTCCTCCAAGCCACACGACAAAGAGAAGGAAAAGAAGAGAGTGTTGTTGCCTCCACCAACCAAACCGGGTATGATTCCTCTTGAACCTCCTAGGATTTCTCATGAAAAGTTAGTTTTTGATCCTAGGGAACAAAGAAGAAAATATGAAGAAGCTATAAGAATGAACCCTAGGAGAAATCAATCCTTAGATGAATTCATGGAAAACTCAAAGAAAGTGAGTGAGGAGATTCAACTTGAGTTAAATAGCATGAGTATACGTCGTAGGATGAATCTTTCTCAAGATAACCCCGTTTCCGAGTCTTTAGAACTGTCTATCCCTCCCCCTCTTTCCACTACTGTTGCTGTGCATGTTGAAGGTCAAGAACAGGGTAGTGGGGATTCAGAACACGACACGGAGCCCTTGGAGCATTCAGATTCAGCCACGGTCGAAATTCAATGCCAAATTACGCCTGGCGCAATTATGGATGAGACTCCACCGGCCACTCTACAAGGTATTTTGTCTCCATCTTTTCCAGATCCTATCTTGACTAAAAAGCCCCTAGTTTTTTATGATTTAGAACAGGAAAGGACAATGTCGAAAATTGCCGAAATTTTGGTGGCATTGAATGAAGCAAGGGGAGAGGATCCATTGAAGGATGATGGAAACAGTGGGGCAGCACAAGAACAATTGAATGTTGATAGGGAAGATGAAGATTTTGGAGAATTACCCCAAGAAGTGCATGGAGACGATTTTGAGGACAAGGAAGACAATGACGATATCTTCCAATATGAAGTGAGAGTACGAACTCCGGTGCACCAATCTCAGCAAGTTGATGAGGAGCCCCCTACAAAAGAGCAAGAAGGAACATCAGAGACTTCAGATGAGGAGGACCCGCGCCGCTGTAGCACGTTTGGCTGCTCAAAAAGAAGCCGAGGCCGGTCCATCAAAAAAGCCAAGAGGGCTAGGGTGCAAAGTGGGGCAAAAGAGCCACTTGAGGAGGCCAATGAAGAGGATACCGATTCTACCGAGCAGACACCATCAAGAGTAAAAAGGGTGAGATTGGAGGTGAGAAGGCCCACTTTCACAATACGTGATATCCTCCTTGAGAGAGGTTTTGATGAGGCTCAAGAGCCGGTGCCGGAATATGTTAGGAGGAGGCTTGTGGAGAATGGTTGGGAGACATTGTTTGCCCCCATTACACGTGTATCAGAGGACTTGGTGAAAGAGTTTTACGTTGCCATCAACCCACACCAAGGGGATGTAGTGAGAGTATGGGGTAAAGTGGTAAAATTCTCGCCTTCCATTATTAATACTCATTATGGTTTGTTGGATGTTTTTAATGCCATAGGTAATGAAATTTGGGTGCATCCATCGGACGAGCAAGTGGAGGAGGCGCGTAGACATATTTGTAGACCACATAAGACATGGATTATCTCAACCACGGGGAAGCTTTCCTTAAAGCCCCTTGACATTAATGAGCAAGCGATAGTTTGGATGTATGTGGTGAAGAACCGGTTGATACCCACTTCTCACGATTCCTCCATTAAGCGCAATAGGGCGATGATGGTGTACATTCTCATGAAGGGCGTTGAGTTCAACTTTGGGGAACTCATAAGAAATGAGATACGGAGTTGCTCGGAGAAAATGGTAGGCCCACTTGTTTTTCCTGGATTAATAACTGAGTTATGCTTACAGGCGGGAGTGGAGGCCGATGATGCCAATGTTGTGATGCCCAAGAAGCCTTTCACATCTCTAAGAAGAGTTCAGGGGTATTCCATTGTTCGAGAGGAAGATTCTCCCATTACCGCCGCGGATCCCGAGACCCGAGGGGTGGTGACTAGGGAGCAGTATGATGAGCTTAGGCACAAGTATGAGCTTCTGTTAGTTACTCAACGTGCCACATGTGCTTTCCTCAAGAAGATATACGGTGATGAAGCAGCTTCTTTCCCCGATGAGCTGACGCCGATTTACCATCTTCTTCCCGTCTTCCTACCGATTCCACCGATGATGAGTCTTCCGATGATGAATAGGGGAGTATCTCACTCTCTTTGTTTTGATATTTTGATCTTGATTTTGTATGTTAGGTTAGTGTTGGGATGTTGTGCTGTTAGGAAGGTATACATGTTATGTTTTTGCATCTGTTTAGAGTTCAAGCTTGCATGCTTCCTTGTTCTAGCTACTAGCTTGTCCTTAGCTTCTTTAATGCATCTTTTCTAA
Protein sequence
MMLNTASNGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKKQVSAGIFELDTVASMQAQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGDNHLYENCPANPASIFYVGQGAQQNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAQAPAQQYKQNYTPPGFPTQPASQPQQYNQQRGQNTTQQSGSNVSLEAMMKEFMTRTDAAIRSLEMQVGQIANDQKSRPQGTLPGHTENLKRDREGKDHCKAVITRSGLSYEGPSLPDEGTHVVTPIPASTSNPQQEEKAEPVSSEEKDKKADKGKQVVPNTTPQVGNIKIPPPFPQRLVKKNQDGHFKKFFEILKQLHINIRLIDALEQMPNYTKFLKDIISRRKKLGEHETVALTKCSSDALGNPLPVKCNDPGSFTIPCSIGGKNLGRALCDLGASINLMPLSVNDEQVTFNVLDAMRLPDEVEECSTIGAIMEELQQMMVEDLEADLEAVEKESKIAPGAILPQLERFEFLQPTIADLKALQPSIIEPPELEKKPLPSHLKYAYLGLNDTLPVIISSCLSNEHESLLLQVNGHRVKKYYGEYVDRGFIGGIFVVDREANSRTMEGSSSSKPHDKEKEKKRVLLPPPTKPGMIPLEPPRISHEKLVFDPREQRRKYEEAIRMNPRRNQSLDEFMENSKKVSEEIQLELNSMSIRRRMNLSQDNPVSESLELSIPPPLSTTVAVHVEGQEQGSGDSEHDTEPLEHSDSATVEIQCQITPGAIMDETPPATLQGILSPSFPDPILTKKPLVFYDLEQERTMSKIAEILVALNEARGEDPLKDDGNSGAAQEQLNVDREDEDFGELPQEVHGDDFEDKEDNDDIFQYEVRVRTPVHQSQQVDEEPPTKEQEGTSETSDEEDPRRCSTFGCSKRSRGRSIKKAKRARVQSGAKEPLEEANEEDTDSTEQTPSRVKRVRLEVRRPTFTIRDILLERGFDEAQEPVPEYVRRRLVENGWETLFAPITRVSEDLVKEFYVAINPHQGDVVRVWGKVVKFSPSIINTHYGLLDVFNAIGNEIWVHPSDEQVEEARRHICRPHKTWIISTTGKLSLKPLDINEQAIVWMYVVKNRLIPTSHDSSIKRNRAMMVYILMKGVEFNFGELIRNEIRSCSEKMVGPLVFPGLITELCLQAGVEADDANVVMPKKPFTSLRRVQGYSIVREEDSPITAADPETRGVVTREQYDELRHKYELLLVTQRATCAFLKKIYGDEAASFPDELTPIYHLLPVFLPIPPMMSLPMMNRGVSHSLCFDILILILYVRLVLGCCAVRKVYMLCFCICLEFKLACFLVLATSLSLASLMHLF
Homology
BLAST of Moc08g31580 vs. NCBI nr
Match:
XP_022154847.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia])
HSP 1 Score: 612.5 bits (1578), Expect = 8.9e-171
Identity = 535/1360 (39.34%), Postives = 685/1360 (50.37%), Query Frame = 0
Query: 1 MMLNTASNGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKKQVSAGIFELDTVASMQAQ 60
MMLNTA+NGSLLEKS+NEIVDILNKM DINDQGE GRSL KKQVSAGIFELDTVA MQAQ
Sbjct: 131 MMLNTAANGSLLEKSVNEIVDILNKMIDINDQGERGRSLSKKQVSAGIFELDTVALMQAQ 190
Query: 61 MAAMNQMLKQLTMEKETKTVTSA---IPEPSPILQISDISCVYCGDNHLYENCPANPASI 120
MAAMNQMLKQ TMEKETKTVTS IP + Q+ + + + D ++
Sbjct: 191 MAAMNQMLKQFTMEKETKTVTSLHINIPLIDALEQMPNYT-KFLKDIISRRKKLGEHETV 250
Query: 121 FYVGQGAQQNFNPYSNTYNPGWRH---HPNFSWSNQGV----ASSSAQAPAQQYKQ---- 180
N NP + + W + GV + SS P Q +
Sbjct: 251 ALTKCSRATNSIECQKRLNPKMKEVVKNEIIKWLDAGVIYPISGSSWVNPVQCVPKKGGM 310
Query: 181 ----NYTPPGFPTQPASQPQQYNQQRGQN--TTQQSGSNVSLEAMMKEFMTRTDAAIRSL 240
N PT + + R N T + ++ M+ +
Sbjct: 311 TVITNDDNKLIPTGTVTSWRVCMDCRRLNKVTRKYHFPLPFIDQMLDRLAGKECYCFLDG 370
Query: 241 EMQVGQ--IANDQKSRPQGTLPGHTENLKRDREGKDHCKAVITRSGLSYEGPSLP----- 300
Q IA + + T P T +R G + A R ++ L
Sbjct: 371 YSGYNQITIAPKDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMIAIFSDMLESTVEI 430
Query: 301 --DEGTHVVTPIPASTSNPQQE----EKAEPVSSEEK------------DKKADKGKQVV 360
D+ + N + E+ V + EK + ++KG +V
Sbjct: 431 FMDDFSVFGESFDVCLENLENVLKRCEETNLVLNWEKXHFMVGEGTVLGHRISEKGIEV- 490
Query: 361 PNTTPQVGNI-KIPPPFPQRLVKK--NQDGHFKKFFEILKQLHINI-RLIDALEQMPNYT 420
T ++ I K+P P + V+ G +++F + ++ + +L++ + Y
Sbjct: 491 --DTAKIDVIAKLPRPITVKGVRSFLGHAGFYRRFIKDFAKISKPLCQLLEVDQPFVFYE 550
Query: 421 KFLKDIISRRKKLGEHETVALTKCSSDALGNPLPVKCNDPGSFTIPCSIGGKNLGRALCD 480
LK + +K L + P + C D F + +G + G+ L
Sbjct: 551 HCLKAFETLKKALSSAPIII-----EXXWNXPXELMC-DASDFAVGVMLGQRK-GKILHP 610
Query: 481 LGASINLMPLSVNDEQVTFNVLDAMRLPDEVEECSTIGA---------------IMEELQ 540
+ + + S + VT L A+ + IG +E +
Sbjct: 611 IYYASKTLNSSQLNYTVTEKELLAVVFAFDKFRSYLIGTKVIVFTDHSALKYLFAKKEAK 670
Query: 541 QMMVEDL----EADLEAVEKESKIAPGAILPQLERFE-FLQPTIADLKALQPSIIEPPEL 600
++ + E DLE K+ K + L R E LQ ++K ++ L
Sbjct: 671 PRLIRWILLLQEFDLEI--KDRKGTXNQVXDHLSRLESSLQHDSCEIK---EHFVDEYLL 730
Query: 601 EKKPLPSHLKYAYLGLNDTLPVIISSCLSNEHESLLLQVNGHRVKKYYGEY----VDRGF 660
+P + YA ++S + N+ ++ H++KK++ + D F
Sbjct: 731 AISEVPWYADYA--------NYLVSKIIPND-------LSKHQIKKFFHDVRFYKWDEPF 790
Query: 661 IGGI--------FVVDREANSRTMEGSSSSKP-HDKEKEKKRVLLPPPTKPGMIPLEPPR 720
+ + V + E N + + K +K R K I L+
Sbjct: 791 LFRLGPDNILCRCVAEHETNGLAEVSNRAIKSILEKSVNVNR-------KDWAIKLDDAL 850
Query: 721 ISHEKLVFDPREQRRKYEEAIRMNPRRNQSLDEFMENSKKVSEEIQLELNSMSIRRRMNL 780
++ R Y+ + M+P R + K ++LE + R++N
Sbjct: 851 WAY----------RTAYKTPLGMSPYR-------IVFGKACHLPLELEHRAYWATRKLNF 910
Query: 781 ----SQDNPVSESLELSIPPPLSTTVAVHVEGQEQGSGDSEHDTEPLEHSDSATV-EIQC 840
+ + + + EL S A Q + D + + LE + + +
Sbjct: 911 DYVAAGEARLLQLHELDEFRQFSYENAKMYNEQTKKWLDKKIMDKNLEPENLVLLFNSRL 970
Query: 841 QITPGAIMDETPPATLQGILSPSFPDPILTKKPLVFYDLEQERTMSKIAEILVALNEARG 900
++ PG + + + + P + + K +F + R E +
Sbjct: 971 KLFPGKLKSRWSGSFVVEHVYPHGAVDLKSSKRNIF-KVNGHRVKKYYGEYI-------- 1030
Query: 901 EDPLKDDGNSGAAQEQLNVDREDEDFGELPQEVHGDDFEDKEDNDDIFQYEVRVRTPVHQ 960
+EQLNVDREDEDFGELPQEVHGD+FED+EDNDDI QYEV+VRTPVH+
Sbjct: 1031 ------------DREQLNVDREDEDFGELPQEVHGDEFEDEEDNDDISQYEVKVRTPVHE 1090
Query: 961 SQQVDEEPPTKEQEGTSETSD-----EEDPRRCSTFGCSKRSRGRSI------------- 1020
SQQVDEEPPTKEQEGTS D E+ S+ G R R R+
Sbjct: 1091 SQQVDEEPPTKEQEGTSGPVDVPSEAMEESSSSSSQGAVSRPRTRTAVARLAAQKEAEAG 1150
Query: 1021 --KKAKRARVQSGAKEPLEEANEEDTDSTEQTPSRVKRVRLEVRRPTFTIRDILLERGFD 1080
KKAK ARVQ A+EPLEEANEE+ DSTEQTPSRVKRVRLEVRRPTFT RDILLERGFD
Sbjct: 1151 PSKKAKTARVQRVAEEPLEEANEEEPDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFD 1210
Query: 1081 EAQEPVPEYVRRRLVENGWETLFAPITRVSEDLVKEFYVAINPHQGDVVRVWGKVVKFSP 1140
EAQEPVPEYVR+R+VENGWETLFAPITRVSE LVKEFY AINP++GD VRV
Sbjct: 1211 EAQEPVPEYVRKRIVENGWETLFAPITRVSEALVKEFYTAINPNRGDEVRV--------- 1270
Query: 1141 SIINTHYGLLDVFNAIGNEIWVHPSDEQVEEARRHICRPHKTWIISTTGKLSLKPLDINE 1200
GNEI VHPSDEQVEEARR ICRPHKTW IST GKLSLKPLDINE
Sbjct: 1271 ---------------RGNEILVHPSDEQVEEARRLICRPHKTWTISTMGKLSLKPLDINE 1330
Query: 1201 QAIVWMYVVKNRLIPTSHDSSIKRNRAMMVYILMKGVEFNFGELIRNEIRSCSEKMVGPL 1254
QA VWMYVVKNRLIPTS+DSSIKRNRAM+VYIL+KGVEFNFGELIRNEI+SCSEK+
Sbjct: 1331 QATVWMYVVKNRLIPTSYDSSIKRNRAMIVYILVKGVEFNFGELIRNEIQSCSEKL---- 1374
BLAST of Moc08g31580 vs. NCBI nr
Match:
XP_022159235.1 (uncharacterized protein LOC111025653 [Momordica charantia])
HSP 1 Score: 443.0 bits (1138), Expect = 9.4e-120
Identity = 277/652 (42.48%), Postives = 356/652 (54.60%), Query Frame = 0
Query: 1 MMLNTASNGSLLEKSINEIVDILNKMTDINDQ--GEIGRSLPKKQVSAGIFELDTVASMQ 60
MMLN A+NG KS NEIV+IL+++++ N Q E R+ K+ AG+ LD + SMQ
Sbjct: 194 MMLNGAANGKFTSKSFNEIVEILDQLSEHNYQWCSEKSRTQSKRADPAGVLALDNMTSMQ 253
Query: 61 AQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGDNHLYENCPANPASIF 120
Q+ + QMLK + A PSP+ QI++ +C YCGD H ENCP+NP+S++
Sbjct: 254 KQIDTITQMLKNMEKNNAXAASAXATTNPSPVYQIAESTCYYCGDLHPSENCPSNPSSMY 313
Query: 121 YVGQGAQQNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAQAPAQQYKQNYTPPGFPTQPA 180
YVGQ QQ FNPYSNTYNPGW+ HPNFSWS QG SS+ QQYK+ YTPPGFP PA
Sbjct: 314 YVGQMNQQKFNPYSNTYNPGWKQHPNFSWSGQG--SSNTTGHNQQYKEAYTPPGFPNSPA 373
Query: 181 --SQPQQYNQQRGQ-NTTQQSGSNVSL---------EAMMKEFMTRT------------- 240
P QYNQQ+ QQ+ SN+ + +A MKE MTRT
Sbjct: 374 FPPTPHQYNQQKNYVQPAQQNLSNMEILMKELITKNDATMKELMTRTDVTMKDMKDVKDY 433
Query: 241 ----DAAIRSLEMQVGQIANDQKSRPQGTLPGHTENLKRDREGKDHCKAVITRSGLSYEG 300
D +R LEMQ+GQ+ N+ ++RPQG+LP TE + R GK+HC ++ TRSGL YEG
Sbjct: 434 MGRNDVTVRKLEMQLGQLVNEVRTRPQGSLPSSTEEPR--RIGKEHCNSIATRSGLKYEG 493
Query: 301 PSLPDEGTHVVTPIPASTSNPQQEEKAEPVSSEEKDKKADKGKQVVPNTT----PQVGNI 360
P +PDE +H S EKD +A K V P + PQV N
Sbjct: 494 PRMPDESSH--------------------SPSREKDTQAVPDKIVEPAVSVPVAPQVSNS 553
Query: 361 KIPPPFPQRLVKKNQDGHFKKFFEILKQLHINIRLIDALEQMPNYTKFLKDIISRRKKLG 420
+ PPPFPQRLV+KNQD +F+KF +ILKQLHINI ++ALEQMP Y KF+KDII+R+KKLG
Sbjct: 554 RPPPPFPQRLVRKNQDNNFRKFLDILKQLHINIPFVEALEQMPTYAKFIKDIITRKKKLG 613
Query: 421 EHETVALTKCSSDALGNPLPVKCNDPGSFTIPCSIGGKNLGRALCDLGASINLMPLS--- 480
E+ETVALT+CSS+ + +P K DPGSFTIPC IGGK++GRALCDLGASINLMPLS
Sbjct: 614 EYETVALTECSSNVFKSKMPPKLKDPGSFTIPCLIGGKDVGRALCDLGASINLMPLSIFK 673
Query: 481 ------------------------------------------------------------ 530
Sbjct: 674 KFEIGKASPTTVTLQLADRSITKPEGKIEDVLVKVDKFIFPTDFIILDCEADKDVPIILG 733
BLAST of Moc08g31580 vs. NCBI nr
Match:
XP_022158314.1 (uncharacterized protein LOC111024824 [Momordica charantia])
HSP 1 Score: 405.2 bits (1040), Expect = 2.2e-108
Identity = 212/258 (82.17%), Postives = 224/258 (86.82%), Query Frame = 0
Query: 1 MMLNTASNGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKKQVSAGIFELDTVASMQAQ 60
MMLNTA+N SL EKSI+EI+DILNKMTD NDQGEIGRSLPKKQVSA +FELDTVASMQAQ
Sbjct: 160 MMLNTAANDSLFEKSISEIIDILNKMTDSNDQGEIGRSLPKKQVSARVFELDTVASMQAQ 219
Query: 61 MAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGDNHLYENCPANPASIFYV 120
MA +NQMLKQLTMEKETKT TSA+ EPS LQISDISCVYCGDN LYENCPANP S+FYV
Sbjct: 220 MATINQMLKQLTMEKETKTATSAMLEPSLCLQISDISCVYCGDNQLYENCPANPTSVFYV 279
Query: 121 GQGAQQNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAQAPAQQYKQNYTPPGFPTQPASQ 180
GQ AQ+NFNPYSNTY+P WR+HPNFSWSNQGVASSSAQ PAQQYKQNYTPP FPTQPASQ
Sbjct: 280 GQRAQRNFNPYSNTYDPRWRNHPNFSWSNQGVASSSAQTPAQQYKQNYTPPDFPTQPASQ 339
Query: 181 PQQYNQQRGQNTTQQSGSNVSLEAM-----------MKEFMTRTDAAIRSLEMQVGQIAN 240
PQQYNQQR QNTTQQ GSN SLEAM KEFMTRTD IR LEMQVGQIAN
Sbjct: 340 PQQYNQQRAQNTTQQGGSNGSLEAMRKEFMTRSEATTKEFMTRTDTGIRKLEMQVGQIAN 399
Query: 241 DQKSRPQGTLPGHTENLK 248
D+KSRPQGTLPG+TEN K
Sbjct: 400 DKKSRPQGTLPGNTENPK 417
BLAST of Moc08g31580 vs. NCBI nr
Match:
XP_022142953.1 (uncharacterized protein LOC111012947 [Momordica charantia])
HSP 1 Score: 351.7 bits (901), Expect = 2.8e-92
Identity = 252/652 (38.65%), Postives = 319/652 (48.93%), Query Frame = 0
Query: 1 MMLNTASNGSLLEKSINEIVDILNKMTDINDQ--GEIGRSLPKKQVSAGIFELDTVASMQ 60
MMLN A+NG KS NEIV+IL+++++ NDQ E R+ K+ A + LD + SMQ
Sbjct: 239 MMLNGAANGKFTSKSFNEIVEILDQLSEHNDQWCSEKPRTQSKRADPAIVLALDNMTSMQ 298
Query: 61 AQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGDNHLYENCPANPASIF 120
Q+ + QMLK + + A PSP+ QI++ +C
Sbjct: 299 KQIDTITQMLKNMEKNNAAAALAPATTNPSPVYQIAESTC-------------------- 358
Query: 121 YVGQGAQQNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAQAPAQQYKQNYTPPGFPTQPA 180
Q QQ FNPYSN YNPGW+ HPNFSWS QG SSS QQYKQ YTPP FP PA
Sbjct: 359 ---QMNQQKFNPYSNIYNPGWKQHPNFSWSGQG--SSSGTGQNQQYKQAYTPPRFPNSPA 418
Query: 181 --SQPQQYNQQRGQ-NTTQQSGSNVSL---------EAMMKEFMTRTDAAI--------- 240
PQQYNQQ+ QQ+ SN+ + +A MKE MTRTDA I
Sbjct: 419 FPPTPQQYNQQKNYGQPAQQNLSNMEILMKEFITKNDATMKELMTRTDATIKDMKEVKDY 478
Query: 241 --------RSLEMQVGQIANDQKSRPQGTLPGHTENLKRDREGKDHCKAVITRSGLSYEG 300
R+LEMQ+GQ+AN+ ++RPQG+LP TE +R
Sbjct: 479 MGRNDVTVRNLEMQLGQLANEVRTRPQGSLPSSTEEPRR--------------------- 538
Query: 301 PSLPDEGTHVVTPIPASTSNPQQEEKAEPVSSEEKDKKADKGKQVVP----NTTPQVGNI 360
+V+P P S EKD + K V P + PQV N
Sbjct: 539 ---------IVSPSP----------------SREKDTQVVPDKIVEPEVSVSVAPQVSNC 598
Query: 361 KIPPPFPQRLVKKNQDGHFKKFFEILKQLHINIRLIDALEQMPNYTKFLKDIISRRKKLG 420
+ PPPFPQRLV+KNQD +F+KF +ILKQLHINI ++ALEQMP Y KFLKDII+R+KKLG
Sbjct: 599 RSPPPFPQRLVRKNQDNNFRKFLDILKQLHINIPFVEALEQMPTYAKFLKDIITRKKKLG 658
Query: 421 EHETVALTKCSSDALGNPLPVKCNDPGSFTIPCSIGGKNLGRALCDLGASINLMPLS--- 480
E+ETVALT+CSS+ + P K DPGSFTI C IGGK++GRALCDLGA INLMPLS
Sbjct: 659 EYETVALTECSSNVFKSKXPPKLKDPGSFTIXCLIGGKBVGRALCDLGAXINLMPLSIFK 718
Query: 481 ------------------------------------------------------------ 530
Sbjct: 719 KLEIGKAXPTTVTLXLADRSITKPEGKIEDVLVKVDKFIFPADFIILDCEADKDVPIILG 778
BLAST of Moc08g31580 vs. NCBI nr
Match:
XP_024028757.1 (uncharacterized protein LOC112093792 [Morus notabilis])
HSP 1 Score: 349.4 bits (895), Expect = 1.4e-91
Identity = 262/741 (35.36%), Postives = 375/741 (50.61%), Query Frame = 0
Query: 2 MLNTASNGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKKQVSAGIFELDTVASMQAQM 61
M++ ++N +LL K+ NE +IL +M++ N Q R ++V AGI E+D V ++ AQ+
Sbjct: 15 MVDASANRALLVKTYNEAYEILERMSNNNYQWPTERVFAGRRV-AGIHEVDAVTALTAQV 74
Query: 62 AAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGDNHLYENCPANPASIFYVG 121
++++ +LK L + T P+ ++CVYCG H +ENCP+NP S+ YV
Sbjct: 75 SSLSNILKSLNVAAPANAAT-------PVA----LTCVYCGAEHSFENCPSNPESVCYV- 134
Query: 122 QGAQQNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAQAPAQQYKQNYTPPGFPTQPASQP 181
+N NPYSN+YN GW+ HPNFSWSNQ A K Y PPGF Q Q
Sbjct: 135 NNFNRNNNPYSNSYNQGWKQHPNFSWSNQ-----EANPMPGPSKPAY-PPGF-HQHQHQR 194
Query: 182 QQYNQQRGQNTTQQSGSNVSLEAMMKEFMTRTD--------------AAIRSLEMQVGQI 241
Q +Q Q Q+ S+ +EA++KE+M R D A++R+LE QVGQ+
Sbjct: 195 QPPQEQSNQRQPHQA-SSTPMEALLKEYMARNDSLIPGQAALLQSQAASLRTLENQVGQL 254
Query: 242 ANDQKSRPQGTLPGHTENLKRD--REGKDHCKAVITRSGLSYEGPSLPDEGTHVVTPIPA 301
AN +RPQG+LP T+N +RD K+HCKA+ ++G E + T
Sbjct: 255 ANVLSNRPQGSLPSDTKNPRRDGKEHCKEHCKAITLQNGREIEQLTRQTAAT-------- 314
Query: 302 STSNPQQEEKAEPVSSEEKDKKADKGKQVVPNTTPQVGNIKIPPPFPQRLVKKNQDGHFK 361
S+ Q +E +P + E+D + P+ + PPPFPQR + QD F+
Sbjct: 315 EHSSIQTQEVQQPPAESEQDVVDQDATAKLKQNKPE----RPPPPFPQRFQNQKQDKQFR 374
Query: 362 KFFEILKQLHINIRLIDALEQMPNYTKFLKDIISRRKKLGEHETVALTKCSSDALGNPLP 421
+F ++LKQLHINI L++ALEQMP+Y KF+KDI++++++LGE ETVALT+ S L N LP
Sbjct: 375 RFLDVLKQLHINIPLVEALEQMPSYVKFMKDILTKKRRLGEFETVALTEECSAILKNRLP 434
Query: 422 VKCNDPGSFTIPCSIGGKNLGRALCDLGASINLMPLS----------------------- 481
K DPGSFTIPCSIG + +G+ALCDLGASINLMP+S
Sbjct: 435 PKLKDPGSFTIPCSIGDQYIGKALCDLGASINLMPMSIFRKLGIGEVSPTTVTLQLADRS 494
Query: 482 ------------------------------------------------------------ 541
Sbjct: 495 YAHPEGKIEDVLVRVDKFIFPADFIVLDYEADKEVPIILGRPFLATGKTLIDVQKGELTM 554
Query: 542 -VNDEQVTFNVLDAMRLPDEVEECSTIGA----IMEELQQMMVEDLEADLEAVEKE---- 601
V+D+QVTFNV AMR DEVEECS + + E ++ E L + + ++ E
Sbjct: 555 RVHDQQVTFNVFKAMRFTDEVEECSAMNVLDSLVAAEFEKTCAEKLMTEEDLIDSEINED 614
Query: 602 ------SKI-APGAILPQLERFEFLQPTIADLKALQPSIIEPPELEKKPLPSHLKYAYLG 623
S++ A FE L + L+ +PS+ EPP LE +PLP+HL+YAYLG
Sbjct: 615 NNDKQVSRLEGRHAATKSRRHFESLDLSTEPLRQHKPSVEEPPILELRPLPAHLRYAYLG 674
BLAST of Moc08g31580 vs. ExPASy TrEMBL
Match:
A0A6J1DMT3 (LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 OS=Momordica charantia OX=3673 GN=LOC111022007 PE=4 SV=1)
HSP 1 Score: 612.5 bits (1578), Expect = 4.3e-171
Identity = 535/1360 (39.34%), Postives = 685/1360 (50.37%), Query Frame = 0
Query: 1 MMLNTASNGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKKQVSAGIFELDTVASMQAQ 60
MMLNTA+NGSLLEKS+NEIVDILNKM DINDQGE GRSL KKQVSAGIFELDTVA MQAQ
Sbjct: 131 MMLNTAANGSLLEKSVNEIVDILNKMIDINDQGERGRSLSKKQVSAGIFELDTVALMQAQ 190
Query: 61 MAAMNQMLKQLTMEKETKTVTSA---IPEPSPILQISDISCVYCGDNHLYENCPANPASI 120
MAAMNQMLKQ TMEKETKTVTS IP + Q+ + + + D ++
Sbjct: 191 MAAMNQMLKQFTMEKETKTVTSLHINIPLIDALEQMPNYT-KFLKDIISRRKKLGEHETV 250
Query: 121 FYVGQGAQQNFNPYSNTYNPGWRH---HPNFSWSNQGV----ASSSAQAPAQQYKQ---- 180
N NP + + W + GV + SS P Q +
Sbjct: 251 ALTKCSRATNSIECQKRLNPKMKEVVKNEIIKWLDAGVIYPISGSSWVNPVQCVPKKGGM 310
Query: 181 ----NYTPPGFPTQPASQPQQYNQQRGQN--TTQQSGSNVSLEAMMKEFMTRTDAAIRSL 240
N PT + + R N T + ++ M+ +
Sbjct: 311 TVITNDDNKLIPTGTVTSWRVCMDCRRLNKVTRKYHFPLPFIDQMLDRLAGKECYCFLDG 370
Query: 241 EMQVGQ--IANDQKSRPQGTLPGHTENLKRDREGKDHCKAVITRSGLSYEGPSLP----- 300
Q IA + + T P T +R G + A R ++ L
Sbjct: 371 YSGYNQITIAPKDQEKTTFTCPYGTFAFRRMSFGLCNAPATFQRCMIAIFSDMLESTVEI 430
Query: 301 --DEGTHVVTPIPASTSNPQQE----EKAEPVSSEEK------------DKKADKGKQVV 360
D+ + N + E+ V + EK + ++KG +V
Sbjct: 431 FMDDFSVFGESFDVCLENLENVLKRCEETNLVLNWEKXHFMVGEGTVLGHRISEKGIEV- 490
Query: 361 PNTTPQVGNI-KIPPPFPQRLVKK--NQDGHFKKFFEILKQLHINI-RLIDALEQMPNYT 420
T ++ I K+P P + V+ G +++F + ++ + +L++ + Y
Sbjct: 491 --DTAKIDVIAKLPRPITVKGVRSFLGHAGFYRRFIKDFAKISKPLCQLLEVDQPFVFYE 550
Query: 421 KFLKDIISRRKKLGEHETVALTKCSSDALGNPLPVKCNDPGSFTIPCSIGGKNLGRALCD 480
LK + +K L + P + C D F + +G + G+ L
Sbjct: 551 HCLKAFETLKKALSSAPIII-----EXXWNXPXELMC-DASDFAVGVMLGQRK-GKILHP 610
Query: 481 LGASINLMPLSVNDEQVTFNVLDAMRLPDEVEECSTIGA---------------IMEELQ 540
+ + + S + VT L A+ + IG +E +
Sbjct: 611 IYYASKTLNSSQLNYTVTEKELLAVVFAFDKFRSYLIGTKVIVFTDHSALKYLFAKKEAK 670
Query: 541 QMMVEDL----EADLEAVEKESKIAPGAILPQLERFE-FLQPTIADLKALQPSIIEPPEL 600
++ + E DLE K+ K + L R E LQ ++K ++ L
Sbjct: 671 PRLIRWILLLQEFDLEI--KDRKGTXNQVXDHLSRLESSLQHDSCEIK---EHFVDEYLL 730
Query: 601 EKKPLPSHLKYAYLGLNDTLPVIISSCLSNEHESLLLQVNGHRVKKYYGEY----VDRGF 660
+P + YA ++S + N+ ++ H++KK++ + D F
Sbjct: 731 AISEVPWYADYA--------NYLVSKIIPND-------LSKHQIKKFFHDVRFYKWDEPF 790
Query: 661 IGGI--------FVVDREANSRTMEGSSSSKP-HDKEKEKKRVLLPPPTKPGMIPLEPPR 720
+ + V + E N + + K +K R K I L+
Sbjct: 791 LFRLGPDNILCRCVAEHETNGLAEVSNRAIKSILEKSVNVNR-------KDWAIKLDDAL 850
Query: 721 ISHEKLVFDPREQRRKYEEAIRMNPRRNQSLDEFMENSKKVSEEIQLELNSMSIRRRMNL 780
++ R Y+ + M+P R + K ++LE + R++N
Sbjct: 851 WAY----------RTAYKTPLGMSPYR-------IVFGKACHLPLELEHRAYWATRKLNF 910
Query: 781 ----SQDNPVSESLELSIPPPLSTTVAVHVEGQEQGSGDSEHDTEPLEHSDSATV-EIQC 840
+ + + + EL S A Q + D + + LE + + +
Sbjct: 911 DYVAAGEARLLQLHELDEFRQFSYENAKMYNEQTKKWLDKKIMDKNLEPENLVLLFNSRL 970
Query: 841 QITPGAIMDETPPATLQGILSPSFPDPILTKKPLVFYDLEQERTMSKIAEILVALNEARG 900
++ PG + + + + P + + K +F + R E +
Sbjct: 971 KLFPGKLKSRWSGSFVVEHVYPHGAVDLKSSKRNIF-KVNGHRVKKYYGEYI-------- 1030
Query: 901 EDPLKDDGNSGAAQEQLNVDREDEDFGELPQEVHGDDFEDKEDNDDIFQYEVRVRTPVHQ 960
+EQLNVDREDEDFGELPQEVHGD+FED+EDNDDI QYEV+VRTPVH+
Sbjct: 1031 ------------DREQLNVDREDEDFGELPQEVHGDEFEDEEDNDDISQYEVKVRTPVHE 1090
Query: 961 SQQVDEEPPTKEQEGTSETSD-----EEDPRRCSTFGCSKRSRGRSI------------- 1020
SQQVDEEPPTKEQEGTS D E+ S+ G R R R+
Sbjct: 1091 SQQVDEEPPTKEQEGTSGPVDVPSEAMEESSSSSSQGAVSRPRTRTAVARLAAQKEAEAG 1150
Query: 1021 --KKAKRARVQSGAKEPLEEANEEDTDSTEQTPSRVKRVRLEVRRPTFTIRDILLERGFD 1080
KKAK ARVQ A+EPLEEANEE+ DSTEQTPSRVKRVRLEVRRPTFT RDILLERGFD
Sbjct: 1151 PSKKAKTARVQRVAEEPLEEANEEEPDSTEQTPSRVKRVRLEVRRPTFTTRDILLERGFD 1210
Query: 1081 EAQEPVPEYVRRRLVENGWETLFAPITRVSEDLVKEFYVAINPHQGDVVRVWGKVVKFSP 1140
EAQEPVPEYVR+R+VENGWETLFAPITRVSE LVKEFY AINP++GD VRV
Sbjct: 1211 EAQEPVPEYVRKRIVENGWETLFAPITRVSEALVKEFYTAINPNRGDEVRV--------- 1270
Query: 1141 SIINTHYGLLDVFNAIGNEIWVHPSDEQVEEARRHICRPHKTWIISTTGKLSLKPLDINE 1200
GNEI VHPSDEQVEEARR ICRPHKTW IST GKLSLKPLDINE
Sbjct: 1271 ---------------RGNEILVHPSDEQVEEARRLICRPHKTWTISTMGKLSLKPLDINE 1330
Query: 1201 QAIVWMYVVKNRLIPTSHDSSIKRNRAMMVYILMKGVEFNFGELIRNEIRSCSEKMVGPL 1254
QA VWMYVVKNRLIPTS+DSSIKRNRAM+VYIL+KGVEFNFGELIRNEI+SCSEK+
Sbjct: 1331 QATVWMYVVKNRLIPTSYDSSIKRNRAMIVYILVKGVEFNFGELIRNEIQSCSEKL---- 1374
BLAST of Moc08g31580 vs. ExPASy TrEMBL
Match:
A0A6J1DY39 (uncharacterized protein LOC111025653 OS=Momordica charantia OX=3673 GN=LOC111025653 PE=4 SV=1)
HSP 1 Score: 443.0 bits (1138), Expect = 4.5e-120
Identity = 277/652 (42.48%), Postives = 356/652 (54.60%), Query Frame = 0
Query: 1 MMLNTASNGSLLEKSINEIVDILNKMTDINDQ--GEIGRSLPKKQVSAGIFELDTVASMQ 60
MMLN A+NG KS NEIV+IL+++++ N Q E R+ K+ AG+ LD + SMQ
Sbjct: 194 MMLNGAANGKFTSKSFNEIVEILDQLSEHNYQWCSEKSRTQSKRADPAGVLALDNMTSMQ 253
Query: 61 AQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGDNHLYENCPANPASIF 120
Q+ + QMLK + A PSP+ QI++ +C YCGD H ENCP+NP+S++
Sbjct: 254 KQIDTITQMLKNMEKNNAXAASAXATTNPSPVYQIAESTCYYCGDLHPSENCPSNPSSMY 313
Query: 121 YVGQGAQQNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAQAPAQQYKQNYTPPGFPTQPA 180
YVGQ QQ FNPYSNTYNPGW+ HPNFSWS QG SS+ QQYK+ YTPPGFP PA
Sbjct: 314 YVGQMNQQKFNPYSNTYNPGWKQHPNFSWSGQG--SSNTTGHNQQYKEAYTPPGFPNSPA 373
Query: 181 --SQPQQYNQQRGQ-NTTQQSGSNVSL---------EAMMKEFMTRT------------- 240
P QYNQQ+ QQ+ SN+ + +A MKE MTRT
Sbjct: 374 FPPTPHQYNQQKNYVQPAQQNLSNMEILMKELITKNDATMKELMTRTDVTMKDMKDVKDY 433
Query: 241 ----DAAIRSLEMQVGQIANDQKSRPQGTLPGHTENLKRDREGKDHCKAVITRSGLSYEG 300
D +R LEMQ+GQ+ N+ ++RPQG+LP TE + R GK+HC ++ TRSGL YEG
Sbjct: 434 MGRNDVTVRKLEMQLGQLVNEVRTRPQGSLPSSTEEPR--RIGKEHCNSIATRSGLKYEG 493
Query: 301 PSLPDEGTHVVTPIPASTSNPQQEEKAEPVSSEEKDKKADKGKQVVPNTT----PQVGNI 360
P +PDE +H S EKD +A K V P + PQV N
Sbjct: 494 PRMPDESSH--------------------SPSREKDTQAVPDKIVEPAVSVPVAPQVSNS 553
Query: 361 KIPPPFPQRLVKKNQDGHFKKFFEILKQLHINIRLIDALEQMPNYTKFLKDIISRRKKLG 420
+ PPPFPQRLV+KNQD +F+KF +ILKQLHINI ++ALEQMP Y KF+KDII+R+KKLG
Sbjct: 554 RPPPPFPQRLVRKNQDNNFRKFLDILKQLHINIPFVEALEQMPTYAKFIKDIITRKKKLG 613
Query: 421 EHETVALTKCSSDALGNPLPVKCNDPGSFTIPCSIGGKNLGRALCDLGASINLMPLS--- 480
E+ETVALT+CSS+ + +P K DPGSFTIPC IGGK++GRALCDLGASINLMPLS
Sbjct: 614 EYETVALTECSSNVFKSKMPPKLKDPGSFTIPCLIGGKDVGRALCDLGASINLMPLSIFK 673
Query: 481 ------------------------------------------------------------ 530
Sbjct: 674 KFEIGKASPTTVTLQLADRSITKPEGKIEDVLVKVDKFIFPTDFIILDCEADKDVPIILG 733
BLAST of Moc08g31580 vs. ExPASy TrEMBL
Match:
A0A6J1DZ19 (uncharacterized protein LOC111024824 OS=Momordica charantia OX=3673 GN=LOC111024824 PE=4 SV=1)
HSP 1 Score: 405.2 bits (1040), Expect = 1.0e-108
Identity = 212/258 (82.17%), Postives = 224/258 (86.82%), Query Frame = 0
Query: 1 MMLNTASNGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKKQVSAGIFELDTVASMQAQ 60
MMLNTA+N SL EKSI+EI+DILNKMTD NDQGEIGRSLPKKQVSA +FELDTVASMQAQ
Sbjct: 160 MMLNTAANDSLFEKSISEIIDILNKMTDSNDQGEIGRSLPKKQVSARVFELDTVASMQAQ 219
Query: 61 MAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGDNHLYENCPANPASIFYV 120
MA +NQMLKQLTMEKETKT TSA+ EPS LQISDISCVYCGDN LYENCPANP S+FYV
Sbjct: 220 MATINQMLKQLTMEKETKTATSAMLEPSLCLQISDISCVYCGDNQLYENCPANPTSVFYV 279
Query: 121 GQGAQQNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAQAPAQQYKQNYTPPGFPTQPASQ 180
GQ AQ+NFNPYSNTY+P WR+HPNFSWSNQGVASSSAQ PAQQYKQNYTPP FPTQPASQ
Sbjct: 280 GQRAQRNFNPYSNTYDPRWRNHPNFSWSNQGVASSSAQTPAQQYKQNYTPPDFPTQPASQ 339
Query: 181 PQQYNQQRGQNTTQQSGSNVSLEAM-----------MKEFMTRTDAAIRSLEMQVGQIAN 240
PQQYNQQR QNTTQQ GSN SLEAM KEFMTRTD IR LEMQVGQIAN
Sbjct: 340 PQQYNQQRAQNTTQQGGSNGSLEAMRKEFMTRSEATTKEFMTRTDTGIRKLEMQVGQIAN 399
Query: 241 DQKSRPQGTLPGHTENLK 248
D+KSRPQGTLPG+TEN K
Sbjct: 400 DKKSRPQGTLPGNTENPK 417
BLAST of Moc08g31580 vs. ExPASy TrEMBL
Match:
A0A6J1CPJ3 (uncharacterized protein LOC111012947 OS=Momordica charantia OX=3673 GN=LOC111012947 PE=4 SV=1)
HSP 1 Score: 350.5 bits (898), Expect = 3.1e-92
Identity = 252/652 (38.65%), Postives = 319/652 (48.93%), Query Frame = 0
Query: 1 MMLNTASNGSLLEKSINEIVDILNKMTDINDQ--GEIGRSLPKKQVSAGIFELDTVASMQ 60
MMLN A+NG KS NEIV+IL+++++ NDQ E R+ K+ A + LD + SMQ
Sbjct: 239 MMLNGAANGKFTSKSFNEIVEILDQLSEHNDQWCSEKPRTQSKRADPAIVLALDNMTSMQ 298
Query: 61 AQMAAMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGDNHLYENCPANPASIF 120
Q+ + QMLK + + A PSP+ QI++ +C
Sbjct: 299 KQIDTITQMLKNMEKNNAAAALAPATTNPSPVYQIAESTC-------------------- 358
Query: 121 YVGQGAQQNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAQAPAQQYKQNYTPPGFPTQPA 180
Q QQ FNPYSN YNPGW+ HPNFSWS QG SSS QQYKQ YTPP FP PA
Sbjct: 359 ---QMNQQKFNPYSNIYNPGWKQHPNFSWSGQG--SSSGTGQNQQYKQAYTPPRFPNSPA 418
Query: 181 --SQPQQYNQQRGQ-NTTQQSGSNVSL---------EAMMKEFMTRTDAAI--------- 240
PQQYNQQ+ QQ+ SN+ + +A MKE MTRTDA I
Sbjct: 419 FPPTPQQYNQQKNYGQPAQQNLSNMEILMKEFITKNDATMKELMTRTDATIKDMKEVKDY 478
Query: 241 --------RSLEMQVGQIANDQKSRPQGTLPGHTENLKRDREGKDHCKAVITRSGLSYEG 300
R+LEMQ+GQ+AN+ ++RPQG+LP TE +R
Sbjct: 479 MGRNDVTVRNLEMQLGQLANEVRTRPQGSLPSSTEEPRR--------------------- 538
Query: 301 PSLPDEGTHVVTPIPASTSNPQQEEKAEPVSSEEKDKKADKGKQVVP----NTTPQVGNI 360
+V+P P S EKD + K V P + PQV N
Sbjct: 539 ---------IVSPSP----------------SREKDTQVVPDKIVEPEVSVSVAPQVSNC 598
Query: 361 KIPPPFPQRLVKKNQDGHFKKFFEILKQLHINIRLIDALEQMPNYTKFLKDIISRRKKLG 420
+ PPPFPQRLV+KNQD +F+KF +ILKQLHINI ++ALEQMP Y KFLKDII+R+KKLG
Sbjct: 599 RSPPPFPQRLVRKNQDNNFRKFLDILKQLHINIPFVEALEQMPTYAKFLKDIITRKKKLG 658
Query: 421 EHETVALTKCSSDALGNPLPVKCNDPGSFTIPCSIGGKNLGRALCDLGASINLMPLS--- 480
E+ETVALT+CSS+ + P K DPGSFTI C IGGK++GRALCDLGA INLMPLS
Sbjct: 659 EYETVALTECSSNVFKSKXPPKLKDPGSFTIXCLIGGKDVGRALCDLGAXINLMPLSIFK 718
Query: 481 ------------------------------------------------------------ 530
Sbjct: 719 KLEIGKAXPTTVTLXLADRSITKPEGKIEDVLVKVDKFIFPADFIILDCEADKDVPIILG 778
BLAST of Moc08g31580 vs. ExPASy TrEMBL
Match:
A0A2G9GK35 (Reverse transcriptase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_21798 PE=4 SV=1)
HSP 1 Score: 302.0 bits (772), Expect = 1.3e-77
Identity = 228/670 (34.03%), Postives = 323/670 (48.21%), Query Frame = 0
Query: 3 LNTASNGSLLEKSINEIVDILNKMTDINDQGEIGRSLPKKQVSAGIFELDTVASMQAQMA 62
L+ + S L + E ++LN + + + + R+ P K +AG+ E+D V ++ A++
Sbjct: 51 LDHLNGDSFLSGTTAECHNLLNNLVANHYEKKSERATPPK--AAGVIEVDQVTALNAKID 110
Query: 63 AMNQMLKQLTMEKETKTVTSAIPEPSPILQISDISCVYCGDNHLYENCPANPASIFYVGQ 122
+ Q +K + + +Q + ++C CG++H + CP + SI +V
Sbjct: 111 FLMQSMKNFGVNQ---------------VQHTPVTCDECGESHPSDQCPHSVESIQFVSN 170
Query: 123 GAQQNFNPYSNTYNPGWRHHPNFSW-SNQGVASSSAQAPAQQYKQNYTPPGFPTQPASQP 182
+ NPYSNTYNPGWR HPNFSW +NQG S+ P F
Sbjct: 171 ARKPQNNPYSNTYNPGWRQHPNFSWNNNQGQGSA---------------PRF-------- 230
Query: 183 QQYNQQRGQNTTQQSGSNVSLEAMMKEFMTRTDAAIRSLEMQVGQIANDQKSRPQGTLPG 242
QQ QQ+ Q Q+ SLE + +FM T A +++E Q+GQ+AN SRPQG+LP
Sbjct: 231 QQGGQQQVQQPIQE--KKPSLEETLIQFMASTAANFKTMETQIGQLANAINSRPQGSLPS 290
Query: 243 HTENLKRDREGKDHCKAVITRSGLSYEGPSLPDEGTHVVTPIPASTSNPQQEEKAEPVSS 302
+TE R ++GK C+AV R+G + + K + V S
Sbjct: 291 NTEPNPR-QDGKAQCQAVTLRNGRELQ-----------------EVVKEPTKSKEKEVIS 350
Query: 303 EEKDKKADKGKQVVPNTTPQVGNIKIPPPFPQRLVKKNQDGHFKKFFEILKQLHINIRLI 362
EEK+K+ + +V TT Q PPFPQRL K+ + F KF E+ K+LHINI
Sbjct: 351 EEKEKEVEAPLEVSKPTTLQ-------PPFPQRLQKQKLEKQFLKFLEVFKKLHINIPFA 410
Query: 363 DALEQMPNYTKFLKDIISRRKKLGEHETVALTKCSSDALGNPLPVKCNDPGSFTIPCSIG 422
+ALEQMP+Y KF+KDI+S++++LG++ETVALT+ S + N LP K DPGSFTIPC+IG
Sbjct: 411 EALEQMPSYVKFMKDILSKKRRLGDYETVALTEECSAIIQNKLPPKLKDPGSFTIPCTIG 470
Query: 423 GKNLGRALCDLGASINLMPLS--------------------------------------- 482
GRALCDLGASINLMP S
Sbjct: 471 THFSGRALCDLGASINLMPYSIYRTLGLGEAKPTSITLQLADRSLTYPKGVIEDILVKVD 530
Query: 483 ---------------------------------------------VNDEQVTFNVLDAMR 542
V D+Q+TFNV AM+
Sbjct: 531 KFIFPADFVVLDMEVDIEVPIILGRPFLATGRTLIDVQKGELTMRVQDQQITFNVFKAMK 590
Query: 543 LPDEVEECSTIG-----AIMEELQQMMVEDLEADLEAVEKESKIAPGAILPQLERFEFLQ 575
P+E +EC + A E + + ++ LE L + E ++ L+ ++ +
Sbjct: 591 FPNESDECFAVNLFDNLAGNESIAEQSLDPLERALLDLLDEENEEDYEVVKTLDASKYFK 650
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022154847.1 | 8.9e-171 | 39.34 | LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia] | [more] |
XP_022159235.1 | 9.4e-120 | 42.48 | uncharacterized protein LOC111025653 [Momordica charantia] | [more] |
XP_022158314.1 | 2.2e-108 | 82.17 | uncharacterized protein LOC111024824 [Momordica charantia] | [more] |
XP_022142953.1 | 2.8e-92 | 38.65 | uncharacterized protein LOC111012947 [Momordica charantia] | [more] |
XP_024028757.1 | 1.4e-91 | 35.36 | uncharacterized protein LOC112093792 [Morus notabilis] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1DMT3 | 4.3e-171 | 39.34 | LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 OS=Momordica charantia... | [more] |
A0A6J1DY39 | 4.5e-120 | 42.48 | uncharacterized protein LOC111025653 OS=Momordica charantia OX=3673 GN=LOC111025... | [more] |
A0A6J1DZ19 | 1.0e-108 | 82.17 | uncharacterized protein LOC111024824 OS=Momordica charantia OX=3673 GN=LOC111024... | [more] |
A0A6J1CPJ3 | 3.1e-92 | 38.65 | uncharacterized protein LOC111012947 OS=Momordica charantia OX=3673 GN=LOC111012... | [more] |
A0A2G9GK35 | 1.3e-77 | 34.03 | Reverse transcriptase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_21798 PE=... | [more] |
Match Name | E-value | Identity | Description | |