Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTCCTTCAGCAGCTGCAGCCATGATGACAAATGGGGGCTGTGAGATCCAAAGTGCCTTTTTTTTTTTCCTTTTTCGCTTTCTAAACTTTGGATCAAATCAAATGATATCAGAAAGGGAAAACTCCAACCTAAAAGCTCACGTGCTGCACGTGCAGGACATGGTCATACTTTCCTCTCTAATTATTAACCTAAACTCTTCCTTTCTCTCCACGTGTCGCTTCGCTGTCATTCCAATTCATTCACGTGCCTATTTTTTTTACCTTGCCTGTACTAGGGACGGGTATTACTTTTTCTCGAAGGGAAAACGGGGTGGGAATCCCCATTTAAGCACCCCACAGAGAGAGGAGAAAATTTTCTCCGAGAATATGTTTGCTATCTTGTTCTCTATCCCCACCCCGTTTATTATTTCTATTATATATATATATATATATATATATATATATAACCTTAAATCTAAATATTTATATGTAAAATTTATTAGACATTGTCACGGTGTTTATATTTTGGGACTTATTTGACTTAGGGGTGAGCATTGGTTGGTTTTTGGCCGATCCCGACACCGACATCTCCCTTCCTATGGATTGCGCCAATCGACTGACTGATCGGTTCAGTCGGTCGGTTTGTCAGTTTAACCGACCTTTTTTTTTAATTATTACTTTATCCTTTTGATTTTTTCCCATATTCACGTTTGTAGTGCATAGGACGACGCCTATGGCAGAGGGCAAAAAAAAATAGAAACAGAAAAGGCGATGGCGGAGGGCGACGAGCGACTGAGAGAATAGAGAGTAAGGAGAGGACTCGTGAGGGACTCGGAGAAAAGGAAAACAAAAAAGAAATTGAGATGTGAAACCCTAAAAATATTTGTTTTATTTTTATTTTTTTAAAAAAGATATTAGATAAAGATTACATAGTCGGTCGGTATCGCCTAGTCGACGACAAAATTGGTTTGGGAACCCCAAAACCGATGCCGAACATACGGCATGTAAAACCGACCGACCAACTTCTTTTCGATCGGTTTCGATGGATCGGCTCAGTTTTTTGGGTTCCCATGCTCACCCCTAATTTGACTTGATAATTTTAAGGGTAAGTTTTAAATTTTAGGATTTGTCTTTTTATGCATATAAATTATGGGACCTTAAAAAATTTAAGAATTTCACTCTCGATGGAGCATTAACAAGAAAGGGAATAATTCCTTGTAGGAAACTTGTGGGAGACCCAATTCCTGCAAAATTTTGTGAAGATGGAAAATAAAATGGAGAGCAGAAACGGGGATGGCTGGATAACATTACTAGCCTACATCCTGTTGATCTAAACTAATTAAGAAAAAGAAAGAGGAATGGTTGTTATTAGTCTTGTTTGGGGTAAACTTACCCAGGACACGTGGAGGACCATTATTCGTGTTGGAAGAAATTAGATAAAAATGATTGTTTAGGTCCACCTCGGCCCCACCGAGGTGGACCGAGTTTGCCCCTTAAAAGATGAAGTATGGGCACCAAAGCTCGACCTGATTGTGGTCCGACCTGCTCGGAACCCGACAGGTCCGATATGAGAAAAGACATGTAACCGCCGGTCGCGCGTGCTGTCGGGCCCATTACCTATAAATAGAGGAGTACATTTCGCGCTCAGGTATCGAATCTGACCTCGAACTAAATAAGGAGTCCGATCTATACTAACTTGAGCGTCGGAGTGTTCGCCCTCTTGTGCAGGTCCATCTAGTGTTCAGGTCGAAACCGGAGACCGGGTTCGAGCTCGATTCGTGAAGAACCGTTGTGCAGATTCCTGCATAAACATTTGGCGCCGTCTGTGGGAACGACATCTTAAGTTATCCCGATTTAAAAAAAAAATATACGCAAAGATGGTGCATCCAGCAAACTCTGCCAATACGACAGAACAGAGGGGTGTGAACGCTGATCACGACCCTCAGCAAGACCTCGGTGCAAGAATAGTCGAGGACCAGGTCCGAGCAGGGCAAGAGGGAGATCTGCCGCGCAGATCTGCCCGTCATGCGAACCAAGAGCTACCACCTGCTCACCCGAAACCCTCAAAAGCCAACAGAGGCCGAAGAGGGACATCGAGAAAAACCTCCCAAAGGGCTAACCAAGCAGCAGACCCTGAAGCTCTGTCTGCTCTTCAGCGCGAGTTGGATGATATGCACCATCGGTTGCGCACAGTGGAAGAGATGTACGCCGAGGCAACGCGCGTTAACCGAACTGCGTCTCCCTCTATGGCTCCAGGCGCACCCGGTGAAAAGGGAGTTCCATCTATCCAACCTGGCGATCGCGAGCCCATTCCGAACGATGGGGGAGTGGATTACAGCTTGCGAGACAACGATTTGAGAAAGCATCTCACTGAAAAGAAGAAGAGAGCATCTCGGGAGCCGGAAGACTCTCCTTCCTACTCCCGAGAATTCTCCAACTCGAACCTAAAGGCTCAATCAAAATACAAGCCCCTGGCACCAGAAGCTGTGATCACGAGGGAAGAGTTCGACCTGATGAAGCACAAGTTCGATGAGCAGGTCGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGACTGTTCGTTCGACGATGGCGACTTGGGAGAATCGCCATTCATCGCAGACATCCTGGAGGCTCCAATCCCTCCGAAGTTCAAGACTCCCGCCATGAAGCCCTATGATGGGTCTAAGGACCCTAAAGATTATGTTGAGGTTTTCGAGGGCCTCATGGACTTTCAAGCGGCGACAGATGCGATCAAATGCCGCGCCTTCCAGATCGCGCTTACCGGTAGCGCGCGCCTGTGGTATCGAAGACTGACGGCCAGGTCGATCTCGACCTACTCCCAGCTGAGAAAGGAGTTCATTAGTCAATTCTCCTCTCGGCATTATGATAGAAAGACAGCGACTCACCTTGCCACCATCAGGCAGAAGGAAGGAGAGACGCTGAGAGAGTATGTCACACGGTTCCAGGAAGAGCAGCTTAAGGTCGTGCACTGCTCTGACGATTCGGCTATGTGCTACTTTCTCACCGGCCTGGCCGATGAGGCCTTAACCGTAAAACTTGGAGAGGAAGCTCCAGCCACTTTCGCCGAAGTTTTACAGAAGGCGAAGAAAGTCATTGATGGGCAGGAGCTCCTCCGAACCAAGACTGGCCGACCTAAAAAACAGATCGATCAGAAGAAACTTAGCCAGGAGAGGATGAGGATTGATGTCAAGTCCAAAGATAAGGGACCATCCTCATCCAGTGGCAGAACAGAGTACCGAAGGTCGGAGGGCGGCTCCATCCGGAGCCGACCTTATGAGCGGTATACTCCAACCACCATCCCCATCTCCGAGATACTCACGAACATTGAGGAAAATGGGATGGAAAAGCTCCTCAAGCGACCTGAGAAGCTCCGAGGAGACCCAGAGAATCGCAACAAAGATAAGTACTGCCGTTTTCACCGCGATAACGGCCATAATACGATAAGCTGCTGGGAACTGAAGCGCCAGATTGAAGATCTCATTCAAGATGGCTACTTCAAAAAATTTGTGGGCAAACCGAGGTTTAGCTCAATCGAAAAGAAAGAAGAGAGGAAGCGTTCAAGAACTCCGCCCCGCCGGAATGACCGACCTGCAGTCATCAACACTATTTTCGGAGGCCCGAACGGGGGCCAGTCCGGAAACAAGAGGAATGAGCTAGCTCGCGAAGCCAGGCGCGAGGTATGCATCATTAGGGAGCAGAAACCTACTTGCTCCATCACTTTCGGCGATGCCGATTTGGGGGGGATTCATTTGCCCCACAATGACGCGCTCGTGATCGCCCCTCTCATTGATCACGTCCTGGTCCGAAGGGTATTGGTTGATGGAGGCGCATCTGCGAACATCTTGTCCCTCCCAACATATTTAGCATTGGGATGGACCTGGTCACAATTGAAGAAGAGTCCAACACCCCTGGTTGGATTCTCTGGTGAATCGGTCTCCCCAGAAGGGTGCATCAACCTGCCGGTAACTATAGGGCAAGATGCCACCCAAGTAACGCAGATGGCCGAGTTCGTGGTAGTCGACGGCAAATCGGCCTACAACGCCATTTTTGGGAGACCCATTATCCACTCATTTCGAGCTGTTCCTTCCACGTTGCATCAAGTCTTGAAGTACTCAACCCTTAATGGAGTGGGCACGGTCCGAGGTGAACAAAAAACTTCAAGAGAGTGCTACGCGTCCGAGCTCAAGGGATCGTCGGTATGTGCCCTGGAGGAACAAATCAATCATGGCAAGCAGCAGGAGTCAGGGACCGACCTGCCAAAAGAAGGTAAAAGGCAGTTCTCCCCGCCAACAGAAGAGCTCAAGCTTGTTCCTTTACTTAGCCTCGAAAAACAAGTAAGTATAGGAACCAAGCTGGGGGCCACTGACAGGGAAGAACTGATCAACTTCCTCAGGTCTAACTCGGACGTCTTCGCATGGTCTCACGAGGACATGCCTGGTATTGACCCGAAGATTATGGTGCATCGCCTCAACATAGACCCATCATTCCGACCTGTAAAGCAAAAAAGAAGACCTGTAAATAAAGAGAGGAGTGATGTAATTGTTGAGGAAGTAAACAAACTCTTAAAAGCTGAATATATACGAGAAATTTTGTATCCTGAGTGGCTCTCTAACGTTGTATTAGTTAAAAAATCCAACGGGAAGTGGAGGATGTGCGTGGACTTTACAAATTTAAATAAGGCGTGCCCAAAAGATTGCTTCCCCCTTCCAAGGATCGACCAGTTCGTGGACGCAACAGCCGGGCACGAGCTACTCACCTTCATGGATGCCTACTCCGGATACAACCAAATTAAAATGCATATACCAGACCAAGATCACACCGCGTTTATAACAGACCAAGGTCTGTACTGCTACAAGGTGATGCCTTTTGGCCTAAAGAATACAGGTGCGACCTACCAAAGGATGGTGAACAAAATGTTTGCCAAGCAGATCGGTTGGAATATGGAAGTGTATGTGGACGACATGCTTGTCAAAAGCAAGCAGTCAAAGTCGCATCTTTCCGATCTGACTGAAGCCTTTGAAGTGCTACGGAAGTACCAAATGAAGCTCAATCCGACCAAGTGCGCCTTCGGAGTTTCCTCGGGAAAGTTCCTTGGTTTTATGATAAATAACCGCGGGATTGAGGCTAACCCAGAGAAGATAAAGGTCGTGCTCGACATGGAGGCACCCAAGACTCTAAAGCAGCTTCAGTGCCTCAATGGCAGGATTGCAGCCTTGAATAGGTTCGTGTCAAGATCGACGGACAAGTGTCTTCCGTTCTTCAAAGTTCTTAGGAAGAAGGGACCGTTCGAATGGACGGAAGAGTGTGAGCAAGTGTTGGGGCAGTTAAAAGACTATCTCTGCTCGGCACCTCTACTTGAGAAGCCATTACCAGGGGAAAAACTCCATTTGTACCTGGCGGTATCTGCTAGCGCCGTCAGCTCGGCACTGATCAAGCAGGAGGGCGCAAGCCAGAGCCCCGTTTATTATACCAGTAAGGCCATGACCGAGGCCGAGACTAGATATCCCCAAATGGAGAAGCTGGCCCTCGCTTTAGTGACCTCGGCCCGAAGACTTAGGCCATACTTCCAAGCGCACACGGTCGTAGTACTTACAAACCTGCCCCTGAAGAATATCTTCCTTAAGCCTGAAACATCAGGGTGTCTGATGAAGTGGGCATTGGAATTGAGCGAGTACGACGTCCAGTTCGAGCCCAGAACTGCGATGAAAGGACAAGTAGTGGCCGATTTTATCGCAGAGCTCACTCCACCACCTCAGTTGGTCAAATCCGACCTCCCGTGGACGATCTTCGTTGATGGGTCTTCTAACGAGAGGGGGTGCGGGGCAGGAATCCTCTTGCTCGCACCAGGAGGTGAGCGATTCGAATATGCTCTGCGATTCAACTTTCGGACCTCAAATAATGAGGCCGAGTACGAAGCACTCCTAGCAGGCCTACGCGTTGCCAAAGGACTGGGGGCCAGTCACATAAAGGTCTTTAGTGACTCCCAGCTGATTGTAAATCAGATCAAGGAGGAGTACCAAGCGAAGGACCCCCGGATGGAAAAATATCTGAGCAAGGTCAGATCGCACCTCGCCCAGTTCCCGACTCACGAGGTGAGTCAAGTTCCAAGATCTGAGAACTCTAATGTAGATGCCTTAGCCAAATTGGGGTCAGCATACGAGACTGACCTGGCTAGGTCGGTCCCGATCGAAATCCTAGACACTCCTTCAATCTTAGAACCAGATGTGATGAAGGTGGATACTCCGTCACCCACTTGGATGGACCCAATCGTGGAGTTCATCAAAGGAAGCCCACCGCAAGAGCCGAAGGAGCAAAAGAAGATGACGAGAAGAGCAGCTCGGTTCACGCTCCGAGAAGGAGTGTTGTACCGACGTGGCTTCTCCCTGCCTCTCCTCAAGTGTGTGACTCTCGAAGAAGGCCTCTACATTCTTAGGGAAATTCATGAAGAGGTGTGTGGAAACCACTCCGGCGCCAGGTCGTTATCGGCTAAGGTAATTTGACAAGGGTACTATTGGCCAAGTGTCGAGTAGGATGCAAGGCAGTTTGTGAAAGCTTGCGACAACTGCCAGCGTTTCGCAAATATTATCCACCAACCTCCCGAACTGCTCACCCCCATCTCGGCCCTATGGCCATTTGCCCAGTGGGGGGTGGACATCATAGGTCCTTTTCCTTTGGGCAAAGGGCAAACAAAGTTCGCCGTTGTCGTTGTGGACTACTTCACTAAGTGGGCTGAAGCCGAGGCACTATCCCACATCACAGAGTCCAGAATCACGTCGTTCATATGGACAAACATTGTGTGCCGCTTCGGCATACCAAATGCTATCGTGACAGACAACGGAAAACAATTTGACAATGCAAAGTTCAAGGACTTTTGCAGAAAACTTGGTATAAGCCACCTCAGTTCGTCCCCTGCGCATCCAAAAGCGAATGGACAAGTTGAAGCAGTAAACAAGATCATAAAGCGAGGACTCAAACTAAGGTTGGACTCCAGAAAGGGAAGGTGGGCCGAGGAGCTACCTGAGGTTCTGTGGTCATATCGAACCACCCCACGGGAGTCAACTGGCGAAACTCCGTTCTCGCTAGCCTTTGGTTCCCAAGCTGTTGTACCAGTCGAGATCGGCATACCAACAGACAGGGTAGAACAGTACGAGCCAACGAAGAACGAGGAAGAGCTACTCCTTAACCTGGACTTGTTGGAAGGGAAAAGGGAAATGGCTCAGCTGCGCTTAGCAGAGTATCAGAACAGAATGGCCAGACATTACAATGCCCGAGTTCAACCTCGAAGCTTCCAAGTTGGACATTTGGTCTTAAGAAAAATTCAGAGTCGTGTTGGCACCCTTGACCCAAGTTGGGAGGGACCGTTCGAAGTCAAAGGCATAGTCCGACCTGGAACTTATATGCTGGCCGACTTGGAAGGAAGAGTGCTTGCGCATCCATGGAACGCGGAGCACTTAAAGTGCTATTACCCTTGAAATGTCAAAATCGTCCCAAAATGGACTTGTCAAAATTTTCAAGGGGCGAAATTTTCAATAAATGGAGTTATTTAATTTCATAACTCCGAGTTCGATTAGAAATTAAATGGGGGCCACGGACTCCCATAGGACTTAAAATTGATCAAATTCGATATGTCAAAACCCACGAGTTCGAGGTGCGATGTCAAGATCAATTACAAACCAAAATGCAATCCTTTGAATGCTTAAGTTAAAAATGCGATGTTGAAGGTTCAAGAAATTCAATCCTCTAAACCTACGGGTTCGAGGTGCGATGTGA
mRNA sequence
ATGCCTCCTTCAGCAGCTGCAGCCATGATGACAAATGGGGGCTGGGTGAGCATTGGTTGGTTTTTGGCCGATCCCGACACCGACATCTCCCTTCCTATGGATTGCGCCAATCGACTGACTGATCGGTTCAGTCGGTCGGTCCACCTCGGCCCCACCGAGGTGGACCGAGTTTGCCCCTTAAAAGATGAAGTATGGGCACCAAAGCTCGACCTGATTGTGGTCCGACCTGCTCGGAACCCGACAGGTCCATCTAGTGTTCAGGTCGAAACCGGAGACCGGGTTCGAGCTCGATTCGTGAAGAACCGTTGTGCAGATTCCTGCATAAACATTTGGCGCCGTCTGTGGGAACGACATCTTAAGTTATCCCGATTTAAAAAAAAAATATACGCAAAGATGGTGCATCCAGCAAACTCTGCCAATACGACAGAACAGAGGGGTGTGAACGCTGATCACGACCCTCAGCAAGACCTCGGTGCAAGAATAGTCGAGGACCAGGTCCGAGCAGGGCAAGAGGGAGATCTGCCGCGCAGATCTGCCCGTCATGCGAACCAAGAGCTACCACCTGCTCACCCGAAACCCTCAAAAGCCAACAGAGGCCGAAGAGGGACATCGAGAAAAACCTCCCAAAGGGCTAACCAAGCAGCAGACCCTGAAGCTCTGTCTGCTCTTCAGCGCGAGTTGGATGATATGCACCATCGGTTGCGCACAGTGGAAGAGATGTACGCCGAGGCAACGCGCGTTAACCGAACTGCGTCTCCCTCTATGGCTCCAGGCGCACCCGGTGAAAAGGGAGTTCCATCTATCCAACCTGGCGATCGCGAGCCCATTCCGAACGATGGGGGAGTGGATTACAGCTTGCGAGACAACGATTTGAGAAAGCATCTCACTGAAAAGAAGAAGAGAGCATCTCGGGAGCCGGAAGACTCTCCTTCCTACTCCCGAGAATTCTCCAACTCGAACCTAAAGGCTCAATCAAAATACAAGCCCCTGGCACCAGAAGCTGTGATCACGAGGGAAGAGTTCGACCTGATGAAGCACAAGTTCGATGAGCAGGTCGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGACTGTTCGTTCGACGATGGCGACTTGGGAGAATCGCCATTCATCGCAGACATCCTGGAGGCTCCAATCCCTCCGAAGTTCAAGACTCCCGCCATGAAGCCCTATGATGGGTCTAAGGACCCTAAAGATTATGTTGAGGTTTTCGAGGGCCTCATGGACTTTCAAGCGGCGACAGATGCGATCAAATGCCGCGCCTTCCAGATCGCGCTTACCGGTAGCGCGCGCCTGTGGTATCGAAGACTGACGGCCAGGTCGATCTCGACCTACTCCCAGCTGAGAAAGGAGTTCATTAGTCAATTCTCCTCTCGGCATTATGATAGAAAGACAGCGACTCACCTTGCCACCATCAGGCAGAAGGAAGGAGAGACGCTGAGAGAGTATGTCACACGGTTCCAGGAAGAGCAGCTTAAGGTCGTGCACTGCTCTGACGATTCGGCTATGTGCTACTTTCTCACCGGCCTGGCCGATGAGGCCTTAACCGTAAAACTTGGAGAGGAAGCTCCAGCCACTTTCGCCGAAGTTTTACAGAAGGCGAAGAAAGTCATTGATGGGCAGGAGCTCCTCCGAACCAAGACTGGCCGACCTAAAAAACAGATCGATCAGAAGAAACTTAGCCAGGAGAGGATGAGGATTGATGTCAAGTCCAAAGATAAGGGACCATCCTCATCCAGTGGCAGAACAGAGTACCGAAGGTCGGAGGGCGGCTCCATCCGGAGCCGACCTTATGAGCGGTATACTCCAACCACCATCCCCATCTCCGAGATACTCACGAACATTGAGGAAAATGGGATGGAAAAGCTCCTCAAGCGACCTGAGAAGCTCCGAGGAGACCCAGAGAATCGCAACAAAGATAAGTACTGCCGTTTTCACCGCGATAACGGCCATAATACGATAAGCTGCTGGGAACTGAAGCGCCAGATTGAAGATCTCATTCAAGATGGCTACTTCAAAAAATTTGTGGGCAAACCGAGGTTTAGCTCAATCGAAAAGAAAGAAGAGAGGAAGCGTTCAAGAACTCCGCCCCGCCGGAATGACCGACCTGCAGTCATCAACACTATTTTCGGAGGCCCGAACGGGGGCCAGTCCGGAAACAAGAGGAATGAGCTAGCTCGCGAAGCCAGGCGCGAGGTATGCATCATTAGGGAGCAGAAACCTACTTGCTCCATCACTTTCGGCGATGCCGATTTGGGGGGGATTCATTTGCCCCACAATGACGCGCTCGTGATCGCCCCTCTCATTGATCACGTCCTGGTCCGAAGGGTATTGGTTGATGGAGGCGCATCTGCGAACATCTTGTCCCTCCCAACATATTTAGCATTGGGATGGACCTGGTCACAATTGAAGAAGAGTCCAACACCCCTGGTTGGATTCTCTGGTGAATCGGTCTCCCCAGAAGGGTGCATCAACCTGCCGGTAACTATAGGGCAAGATGCCACCCAAGTAACGCAGATGGCCGAGTTCGTGGTAGTCGACGGCAAATCGGCCTACAACGCCATTTTTGGGAGACCCATTATCCACTCATTTCGAGCTGTTCCTTCCACGTTGCATCAAGTCTTGAAGTACTCAACCCTTAATGGAGTGGGCACGGTCCGAGGTGAACAAAAAACTTCAAGAGAGTGCTACGCGTCCGAGCTCAAGGGATCGTCGGTATGTGCCCTGGAGGAACAAATCAATCATGGCAAGCAGCAGGAGTCAGGGACCGACCTGCCAAAAGAAGGTAAAAGGCAGTTCTCCCCGCCAACAGAAGAGCTCAAGCTTGTTCCTTTACTTAGCCTCGAAAAACAAGTAAGTATAGGAACCAAGCTGGGGGCCACTGACAGGGAAGAACTGATCAACTTCCTCAGGTCTAACTCGGACGTCTTCGCATGGTCTCACGAGGACATGCCTGGTATTGACCCGAAGATTATGGTCGGTCCCGATCGAAATCCTAGACACTCCTTCAATCTTAGAACCAGATGTGATGAAGGTGGATACTCCGTCACCCACTTGGATGGACCCAATCGTGGAGTTCATCAAAGGAAGCCCACCGCAAGAGCCGAAGGAGCAAAAGAAGATGACGAGAAGAGCAGCTCGGTTCACGCTCCGAGAAGGAGTGTTGTACCGACGTGGCTTCTCCCTGCCTCTCCTCAAGTGTTCAAGAAATTCAATCCTCTAAACCTACGGGTTCGAGGTGCGATGTGA
Coding sequence (CDS)
ATGCCTCCTTCAGCAGCTGCAGCCATGATGACAAATGGGGGCTGGGTGAGCATTGGTTGGTTTTTGGCCGATCCCGACACCGACATCTCCCTTCCTATGGATTGCGCCAATCGACTGACTGATCGGTTCAGTCGGTCGGTCCACCTCGGCCCCACCGAGGTGGACCGAGTTTGCCCCTTAAAAGATGAAGTATGGGCACCAAAGCTCGACCTGATTGTGGTCCGACCTGCTCGGAACCCGACAGGTCCATCTAGTGTTCAGGTCGAAACCGGAGACCGGGTTCGAGCTCGATTCGTGAAGAACCGTTGTGCAGATTCCTGCATAAACATTTGGCGCCGTCTGTGGGAACGACATCTTAAGTTATCCCGATTTAAAAAAAAAATATACGCAAAGATGGTGCATCCAGCAAACTCTGCCAATACGACAGAACAGAGGGGTGTGAACGCTGATCACGACCCTCAGCAAGACCTCGGTGCAAGAATAGTCGAGGACCAGGTCCGAGCAGGGCAAGAGGGAGATCTGCCGCGCAGATCTGCCCGTCATGCGAACCAAGAGCTACCACCTGCTCACCCGAAACCCTCAAAAGCCAACAGAGGCCGAAGAGGGACATCGAGAAAAACCTCCCAAAGGGCTAACCAAGCAGCAGACCCTGAAGCTCTGTCTGCTCTTCAGCGCGAGTTGGATGATATGCACCATCGGTTGCGCACAGTGGAAGAGATGTACGCCGAGGCAACGCGCGTTAACCGAACTGCGTCTCCCTCTATGGCTCCAGGCGCACCCGGTGAAAAGGGAGTTCCATCTATCCAACCTGGCGATCGCGAGCCCATTCCGAACGATGGGGGAGTGGATTACAGCTTGCGAGACAACGATTTGAGAAAGCATCTCACTGAAAAGAAGAAGAGAGCATCTCGGGAGCCGGAAGACTCTCCTTCCTACTCCCGAGAATTCTCCAACTCGAACCTAAAGGCTCAATCAAAATACAAGCCCCTGGCACCAGAAGCTGTGATCACGAGGGAAGAGTTCGACCTGATGAAGCACAAGTTCGATGAGCAGGTCGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGACTGTTCGTTCGACGATGGCGACTTGGGAGAATCGCCATTCATCGCAGACATCCTGGAGGCTCCAATCCCTCCGAAGTTCAAGACTCCCGCCATGAAGCCCTATGATGGGTCTAAGGACCCTAAAGATTATGTTGAGGTTTTCGAGGGCCTCATGGACTTTCAAGCGGCGACAGATGCGATCAAATGCCGCGCCTTCCAGATCGCGCTTACCGGTAGCGCGCGCCTGTGGTATCGAAGACTGACGGCCAGGTCGATCTCGACCTACTCCCAGCTGAGAAAGGAGTTCATTAGTCAATTCTCCTCTCGGCATTATGATAGAAAGACAGCGACTCACCTTGCCACCATCAGGCAGAAGGAAGGAGAGACGCTGAGAGAGTATGTCACACGGTTCCAGGAAGAGCAGCTTAAGGTCGTGCACTGCTCTGACGATTCGGCTATGTGCTACTTTCTCACCGGCCTGGCCGATGAGGCCTTAACCGTAAAACTTGGAGAGGAAGCTCCAGCCACTTTCGCCGAAGTTTTACAGAAGGCGAAGAAAGTCATTGATGGGCAGGAGCTCCTCCGAACCAAGACTGGCCGACCTAAAAAACAGATCGATCAGAAGAAACTTAGCCAGGAGAGGATGAGGATTGATGTCAAGTCCAAAGATAAGGGACCATCCTCATCCAGTGGCAGAACAGAGTACCGAAGGTCGGAGGGCGGCTCCATCCGGAGCCGACCTTATGAGCGGTATACTCCAACCACCATCCCCATCTCCGAGATACTCACGAACATTGAGGAAAATGGGATGGAAAAGCTCCTCAAGCGACCTGAGAAGCTCCGAGGAGACCCAGAGAATCGCAACAAAGATAAGTACTGCCGTTTTCACCGCGATAACGGCCATAATACGATAAGCTGCTGGGAACTGAAGCGCCAGATTGAAGATCTCATTCAAGATGGCTACTTCAAAAAATTTGTGGGCAAACCGAGGTTTAGCTCAATCGAAAAGAAAGAAGAGAGGAAGCGTTCAAGAACTCCGCCCCGCCGGAATGACCGACCTGCAGTCATCAACACTATTTTCGGAGGCCCGAACGGGGGCCAGTCCGGAAACAAGAGGAATGAGCTAGCTCGCGAAGCCAGGCGCGAGGTATGCATCATTAGGGAGCAGAAACCTACTTGCTCCATCACTTTCGGCGATGCCGATTTGGGGGGGATTCATTTGCCCCACAATGACGCGCTCGTGATCGCCCCTCTCATTGATCACGTCCTGGTCCGAAGGGTATTGGTTGATGGAGGCGCATCTGCGAACATCTTGTCCCTCCCAACATATTTAGCATTGGGATGGACCTGGTCACAATTGAAGAAGAGTCCAACACCCCTGGTTGGATTCTCTGGTGAATCGGTCTCCCCAGAAGGGTGCATCAACCTGCCGGTAACTATAGGGCAAGATGCCACCCAAGTAACGCAGATGGCCGAGTTCGTGGTAGTCGACGGCAAATCGGCCTACAACGCCATTTTTGGGAGACCCATTATCCACTCATTTCGAGCTGTTCCTTCCACGTTGCATCAAGTCTTGAAGTACTCAACCCTTAATGGAGTGGGCACGGTCCGAGGTGAACAAAAAACTTCAAGAGAGTGCTACGCGTCCGAGCTCAAGGGATCGTCGGTATGTGCCCTGGAGGAACAAATCAATCATGGCAAGCAGCAGGAGTCAGGGACCGACCTGCCAAAAGAAGGTAAAAGGCAGTTCTCCCCGCCAACAGAAGAGCTCAAGCTTGTTCCTTTACTTAGCCTCGAAAAACAAGTAAGTATAGGAACCAAGCTGGGGGCCACTGACAGGGAAGAACTGATCAACTTCCTCAGGTCTAACTCGGACGTCTTCGCATGGTCTCACGAGGACATGCCTGGTATTGACCCGAAGATTATGGTCGGTCCCGATCGAAATCCTAGACACTCCTTCAATCTTAGAACCAGATGTGATGAAGGTGGATACTCCGTCACCCACTTGGATGGACCCAATCGTGGAGTTCATCAAAGGAAGCCCACCGCAAGAGCCGAAGGAGCAAAAGAAGATGACGAGAAGAGCAGCTCGGTTCACGCTCCGAGAAGGAGTGTTGTACCGACGTGGCTTCTCCCTGCCTCTCCTCAAGTGTTCAAGAAATTCAATCCTCTAAACCTACGGGTTCGAGGTGCGATGTGA
Protein sequence
MPPSAAAAMMTNGGWVSIGWFLADPDTDISLPMDCANRLTDRFSRSVHLGPTEVDRVCPLKDEVWAPKLDLIVVRPARNPTGPSSVQVETGDRVRARFVKNRCADSCINIWRRLWERHLKLSRFKKKIYAKMVHPANSANTTEQRGVNADHDPQQDLGARIVEDQVRAGQEGDLPRRSARHANQELPPAHPKPSKANRGRRGTSRKTSQRANQAADPEALSALQRELDDMHHRLRTVEEMYAEATRVNRTASPSMAPGAPGEKGVPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNIEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIEDLIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFGGPNGGQSGNKRNELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYLALGWTWSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASELKGSSVCALEEQINHGKQQESGTDLPKEGKRQFSPPTEELKLVPLLSLEKQVSIGTKLGATDREELINFLRSNSDVFAWSHEDMPGIDPKIMVGPDRNPRHSFNLRTRCDEGGYSVTHLDGPNRGVHQRKPTARAEGAKEDDEKSSSVHAPRRSVVPTWLLPASPQVFKKFNPLNLRVRGAM
Homology
BLAST of Moc08g18100 vs. NCBI nr
Match:
XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])
HSP 1 Score: 945.3 bits (2442), Expect = 4.7e-271
Identity = 523/787 (66.45%), Postives = 560/787 (71.16%), Query Frame = 0
Query: 132 MVHPANSANTTEQRGVNADHDPQQDLGARIVEDQVRAGQEGDLPRRSARHANQELPPAHP 191
MV PANS NT ++R + A+H Q+++GA +VE Q + RSAR LPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 192 KPSKANRGRRGTSRKTSQRANQAADPEALSALQRELDDMHHRLRTVEEMYAEATRVNRTA 251
KPS
Sbjct: 61 KPS--------------------------------------------------------- 120
Query: 252 SPSMAPGAPGEKGVPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPS 311
Sbjct: 121 ------------------------------------------------------------ 180
Query: 312 YSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDL 371
KA+S Y P+ P VITREEFD +K KFD QVEALKARCEKK+ SFDDGDL
Sbjct: 181 ----------KAESSYNPITP-GVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDL 240
Query: 372 GESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIA 431
GE F +DILEA IPPKFKTP MKPYDGSKDPKDYVEVFE LMDFQAATDAIKC AFQIA
Sbjct: 241 GELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIA 300
Query: 432 LTGSARLWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVT 491
LTGSARLWYRRL AR ISTYSQLRKEFISQFSSRHYDRKT THLATIRQKEGETLREYVT
Sbjct: 301 LTGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVT 360
Query: 492 RFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLR 551
RF EEQLKV HCSDDSAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDGQELLR
Sbjct: 361 RFPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLR 420
Query: 552 TKTGRPKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTT 611
TKTGRP+K IDQ + +++ + D KS+DKGPSSSS R +YRRS +SRPYE YTPTT
Sbjct: 421 TKTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTT 480
Query: 612 IPISEILTNIEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIED 671
IPI EILTNIEE GMEKLLKRPEKLRGDPE RN DKYCRFHRD+GHNT + WELKRQIED
Sbjct: 481 IPIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIED 540
Query: 672 LIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFGGPNGGQSGNKRNEL 731
LIQDGYFKKFVGKPR +S+EKKEERKR RTPPRR+DRPAVI NK+ EL
Sbjct: 541 LIQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVI-------------NKKKEL 600
Query: 732 AREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASA 791
AREARREVCIIREQ+PT SI F ADL G+HLPHNDALVIAPLID VLVRR+LVDGGASA
Sbjct: 601 AREARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASA 646
Query: 792 NILSLPTYLALGWTWSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVV 851
NILSL TYLALGWT SQLKKSPTPLVGFSGES+S EGCI+LPV+I QD TQVTQMAEFVV
Sbjct: 661 NILSLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVV 646
Query: 852 VDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASELKGSS 911
+DG+SAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGE KTSRECYAS K SS
Sbjct: 721 IDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSS 646
Query: 912 VCALEEQ 919
VCALEEQ
Sbjct: 781 VCALEEQ 646
BLAST of Moc08g18100 vs. NCBI nr
Match:
XP_022155139.1 (uncharacterized protein LOC111022280 [Momordica charantia])
HSP 1 Score: 932.2 bits (2408), Expect = 4.1e-267
Identity = 511/701 (72.90%), Postives = 537/701 (76.60%), Query Frame = 0
Query: 257 PGAPGEKGVPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREF 316
PGAPGEKG PSIQPG+REPIPND GVDYSLRDNDLRKHLT+KKK+AS EPEDS SYSREF
Sbjct: 5 PGAPGEKGAPSIQPGNREPIPNDEGVDYSLRDNDLRKHLTDKKKKASWEPEDSLSYSREF 64
Query: 317 SNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPF 376
SNSNLKAQSKYKPL PEAVI REEFDLMKH+FDEQVEALKARCEKK+ FDD DLGESPF
Sbjct: 65 SNSNLKAQSKYKPLIPEAVINREEFDLMKHRFDEQVEALKARCEKKESPFDDDDLGESPF 124
Query: 377 IADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSA 436
+DI+EAPIPPKFKTP MKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSA
Sbjct: 125 TSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCLAFQIALTGSA 184
Query: 437 RLWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEE 496
RLW RRL ARSISTYSQLRKEFI QFS RHYDRKTATHLATIRQKE
Sbjct: 185 RLWCRRLPARSISTYSQLRKEFIGQFSFRHYDRKTATHLATIRQKE-------------- 244
Query: 497 QLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGR 556
DE LTVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT R
Sbjct: 245 ---------------------DETLTVKLGEEAPATFAEVLQNAKKVIDGQELLRTKTDR 304
Query: 557 PKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISE 616
P+KQIDQK+LSQ++ + D KSKDKG SSS RTEYRRSE G RSRPYER
Sbjct: 305 PEKQIDQKRLSQKKRKDDSKSKDKGSSSSGSRTEYRRSESGPSRSRPYER---------- 364
Query: 617 ILTNIEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIEDLIQDG 676
CWELKRQIEDLIQD
Sbjct: 365 ---------------------------------------------CWELKRQIEDLIQDS 424
Query: 677 YFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFGGPNGGQSGNKRNELAREAR 736
YFKKFVGKPR +S+EKKEERKRSRTPPRR DRPAVINTIFGGP+GGQ NKR ELA EAR
Sbjct: 425 YFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQFENKRKELACEAR 484
Query: 737 REVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSL 796
R+V IIREQKPTCSITF D DL G+HLPHNDALVIAPLIDHVLVRRVLVDGGASANILSL
Sbjct: 485 RKVSIIREQKPTCSITFKDTDLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSL 544
Query: 797 PTYLALGWTWSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKS 856
PTYLAL T SQLKKSPTPLVGFS ESVSPEGCI+LPVTIGQD+TQVTQMAEFVV+DG+
Sbjct: 545 PTYLALRGTRSQLKKSPTPLVGFSAESVSPEGCIDLPVTIGQDSTQVTQMAEFVVIDGRL 604
Query: 857 AYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASELKGSSVCALE 916
AYNAIF RPIIHSF+AVPS LHQVLKYST NGVGTVRGEQKTSRECYAS LK SSVCALE
Sbjct: 605 AYNAIFERPIIHSFQAVPSILHQVLKYSTPNGVGTVRGEQKTSRECYASALKRSSVCALE 608
Query: 917 EQINHGKQQESGTDLPKEGKRQFSPPTEELKLVPLLSLEKQ 958
E Q S DLP+E K L P L+L+ +
Sbjct: 665 E-------QTSQDDLPREAKGSSPHQQRSSSLFPCLALKNK 608
BLAST of Moc08g18100 vs. NCBI nr
Match:
XP_022158414.1 (uncharacterized protein LOC111024904 [Momordica charantia])
HSP 1 Score: 852.8 bits (2202), Expect = 3.2e-243
Identity = 446/546 (81.68%), Postives = 466/546 (85.35%), Query Frame = 0
Query: 414 MDFQAATDAIKCRAFQIALTGSARLWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTAT 473
MDFQAATDAIKCRAFQIALTGSARLWYRRL ARSISTYSQLRKEFISQFSS HYDRKTAT
Sbjct: 1 MDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSWHYDRKTAT 60
Query: 474 HLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATF 533
HLATIRQKE ETLREYVTRFQEEQLKV HCSDDSAMCYFLT LADE LTVKLGEEAP TF
Sbjct: 61 HLATIRQKERETLREYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETLTVKLGEEAPTTF 120
Query: 534 AEVLQKAKKVIDGQELLRTKTGRPKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRR 593
EVLQKAKKVIDGQELLRTKTGRP+KQIDQKKLSQE+ + D KS+DKG SSS+ RTEYRR
Sbjct: 121 VEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRR 180
Query: 594 SEGGSIRSRPYERYTPTTIPISEILTNIEENGMEKLLKRPEKLRGDPENRNKDKYCRFHR 653
E G RSRPYERYT +TIPISEILTNIEE+GMEKLLKRPEKLRGD E RNK+KYCRFHR
Sbjct: 181 LESGPSRSRPYERYTSSTIPISEILTNIEESGMEKLLKRPEKLRGDLEKRNKEKYCRFHR 240
Query: 654 DNGHNTISCWELKRQIEDLIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVIN 713
D+GHNT SCWELKRQIEDLIQDGYFKKFVGKPR +S+EKKEERKRSRTPPRR DRPAVIN
Sbjct: 241 DHGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVIN 300
Query: 714 TIFGGPNGGQSGNKRNELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAP 773
TIFGGPNGGQSGNKR ELAREARREVCIIRE KPTCSITFGDADL G+HLPHNDALVIA
Sbjct: 301 TIFGGPNGGQSGNKRKELAREARREVCIIREHKPTCSITFGDADLEGVHLPHNDALVIAS 360
Query: 774 LIDHVLVRRVLVDGGASANILSLPTYLALGWTWSQLKKSPTPLVGFSGESVSPEGCINLP 833
LIDH LVRRVL+DG GCI+LP
Sbjct: 361 LIDHDLVRRVLIDG----------------------------------------GCIDLP 420
Query: 834 VTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVR 893
VTIGQDATQVTQMAEFVV+DG+SAYNAIFGRPIIHSFRAVPSTLHQVLKYST N VG VR
Sbjct: 421 VTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNEVGMVR 480
Query: 894 GEQKTSRECYASELKGSSVCALEEQINHGKQQESGTDLPKEGKRQFSPPTEELKLVPLLS 953
GEQKTSRECYAS LKGS+VCALEEQ N GK QES DLPKEGKRQF PPTEEL+LVPLLS
Sbjct: 481 GEQKTSRECYASALKGSAVCALEEQTNRGKLQESEADLPKEGKRQFPPPTEELELVPLLS 506
Query: 954 LEKQVS 960
E+Q +
Sbjct: 541 PERQAN 506
BLAST of Moc08g18100 vs. NCBI nr
Match:
XP_022137317.1 (uncharacterized protein LOC111008813 [Momordica charantia])
HSP 1 Score: 850.1 bits (2195), Expect = 2.1e-242
Identity = 433/530 (81.70%), Postives = 471/530 (88.87%), Query Frame = 0
Query: 321 LKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADI 380
+KA+S P P VITREEFD ++ + D QVEALKA+CE+K+ +DGDLGESPF +D+
Sbjct: 2 VKAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDV 61
Query: 381 LEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWY 440
LEAPIPPKFK P +KPYDGSKDPKDYVEVFE LMDFQAA+DAIKCRAF+IALTGSARLWY
Sbjct: 62 LEAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWY 121
Query: 441 RRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKV 500
RRL A SISTYSQLR+EF++ FSSRHYD+KTATHLATIRQKEGETLREYVTRFQEEQLKV
Sbjct: 122 RRLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV 181
Query: 501 VHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPKKQ 560
HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP+++
Sbjct: 182 AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERK 241
Query: 561 IDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTN 620
I + + ++ D KSKDKG S SSGR EYRR+E G RSRPYER+TPTTIPISEILTN
Sbjct: 242 IGRGRSGKDIENADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTN 301
Query: 621 IEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIEDLIQDGYFKK 680
IEE+GMEKLLKRPEKLRG PE R+KDKYCRFHR++GHNT WELKRQIE+LIQDGYFKK
Sbjct: 302 IEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKK 361
Query: 681 FVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFGGPNGGQSGNKRNELAREARREVC 740
FVGKPR SS EKKEERKRSRTPPRR DRPAVINTIFGGP+GGQSG KR ELAR ARREVC
Sbjct: 362 FVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVC 421
Query: 741 IIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYL 800
IIREQ+PTC ITF ADL +HLPHNDALVIAPLIDHV+V RVLVDGG SANILSLPTYL
Sbjct: 422 IIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYL 481
Query: 801 ALGWTWSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFV 851
ALGWT SQLKKSPTPLVGFSGESV PEG I+LPVT+GQD TQVTQMAEFV
Sbjct: 482 ALGWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc08g18100 vs. NCBI nr
Match:
XP_022150760.1 (uncharacterized protein LOC111018823 [Momordica charantia])
HSP 1 Score: 823.9 bits (2127), Expect = 1.6e-234
Identity = 444/635 (69.92%), Postives = 494/635 (77.80%), Query Frame = 0
Query: 318 NSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFI 377
+SN +A+S + P P+ VITREEFD ++ K + QVEALKA+CE+K+ +DGDLGESPF
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 378 ADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSAR 437
+D+LEA P +K YDGSKDPKDYVEVFEGLMDFQAA+DAIKCRAFQIALTGSAR
Sbjct: 62 SDVLEA--------PTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 438 LWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQ 497
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 498 LKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP 557
LKV SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 558 KKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEI 617
++ ID+ + ++ + D+KSKDKG S SSGR E+RR+ G RSRPYER+TPTTIPISEI
Sbjct: 242 ERGIDRGRSGKDE-KADLKSKDKG-SFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEI 301
Query: 618 LTNIEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIEDLIQDGY 677
LTNIEE+GMEKLLKRPEKLRG PE RNKDKYCRFHR++ HNT WELKRQIEDLIQD Y
Sbjct: 302 LTNIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDY 361
Query: 678 FKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFGGPNGGQSGNKRNELAREARR 737
FKKFVGKPR SS EKKEERK SRTP RR DRPAVINTIFGGP+GGQSG+KR ELAR ARR
Sbjct: 362 FKKFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARR 421
Query: 738 EVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLP 797
EVCIIREQ+PTC ITF ADL +HLPHNDALVIAPLIDHV+VRRVLVD G SANI+SL
Sbjct: 422 EVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLL 481
Query: 798 TYLALGWTWSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSA 857
TYLALGWT SQLKKS TPLVGFS ESV PEGCI+LPVT+G D TQVTQMAEFVV+DG+SA
Sbjct: 482 TYLALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSA 541
Query: 858 YNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASELKGSSVCALEE 917
YNAIFGRPIIHSFRA+PSTLHQVLKYST NGVG VRGEQ SRECYAS LKGSSVCALE
Sbjct: 542 YNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALET 570
Query: 918 QINHGKQQESGTDLPKEGKRQFSPPTEELKLVPLL 953
++ E +LP +R+F+ PTEEL+LVPLL
Sbjct: 602 LVSRDGTLEFKANLP---RREFAAPTEELELVPLL 570
BLAST of Moc08g18100 vs. ExPASy TrEMBL
Match:
A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)
HSP 1 Score: 945.3 bits (2442), Expect = 2.3e-271
Identity = 523/787 (66.45%), Postives = 560/787 (71.16%), Query Frame = 0
Query: 132 MVHPANSANTTEQRGVNADHDPQQDLGARIVEDQVRAGQEGDLPRRSARHANQELPPAHP 191
MV PANS NT ++R + A+H Q+++GA +VE Q + RSAR LPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 192 KPSKANRGRRGTSRKTSQRANQAADPEALSALQRELDDMHHRLRTVEEMYAEATRVNRTA 251
KPS
Sbjct: 61 KPS--------------------------------------------------------- 120
Query: 252 SPSMAPGAPGEKGVPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPS 311
Sbjct: 121 ------------------------------------------------------------ 180
Query: 312 YSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDL 371
KA+S Y P+ P VITREEFD +K KFD QVEALKARCEKK+ SFDDGDL
Sbjct: 181 ----------KAESSYNPITP-GVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDL 240
Query: 372 GESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIA 431
GE F +DILEA IPPKFKTP MKPYDGSKDPKDYVEVFE LMDFQAATDAIKC AFQIA
Sbjct: 241 GELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIA 300
Query: 432 LTGSARLWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVT 491
LTGSARLWYRRL AR ISTYSQLRKEFISQFSSRHYDRKT THLATIRQKEGETLREYVT
Sbjct: 301 LTGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVT 360
Query: 492 RFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLR 551
RF EEQLKV HCSDDSAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDGQELLR
Sbjct: 361 RFPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLR 420
Query: 552 TKTGRPKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTT 611
TKTGRP+K IDQ + +++ + D KS+DKGPSSSS R +YRRS +SRPYE YTPTT
Sbjct: 421 TKTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTT 480
Query: 612 IPISEILTNIEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIED 671
IPI EILTNIEE GMEKLLKRPEKLRGDPE RN DKYCRFHRD+GHNT + WELKRQIED
Sbjct: 481 IPIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIED 540
Query: 672 LIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFGGPNGGQSGNKRNEL 731
LIQDGYFKKFVGKPR +S+EKKEERKR RTPPRR+DRPAVI NK+ EL
Sbjct: 541 LIQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVI-------------NKKKEL 600
Query: 732 AREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASA 791
AREARREVCIIREQ+PT SI F ADL G+HLPHNDALVIAPLID VLVRR+LVDGGASA
Sbjct: 601 AREARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASA 646
Query: 792 NILSLPTYLALGWTWSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVV 851
NILSL TYLALGWT SQLKKSPTPLVGFSGES+S EGCI+LPV+I QD TQVTQMAEFVV
Sbjct: 661 NILSLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVV 646
Query: 852 VDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASELKGSS 911
+DG+SAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGE KTSRECYAS K SS
Sbjct: 721 IDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSS 646
Query: 912 VCALEEQ 919
VCALEEQ
Sbjct: 781 VCALEEQ 646
BLAST of Moc08g18100 vs. ExPASy TrEMBL
Match:
A0A6J1DPC9 (uncharacterized protein LOC111022280 OS=Momordica charantia OX=3673 GN=LOC111022280 PE=4 SV=1)
HSP 1 Score: 932.2 bits (2408), Expect = 2.0e-267
Identity = 511/701 (72.90%), Postives = 537/701 (76.60%), Query Frame = 0
Query: 257 PGAPGEKGVPSIQPGDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREF 316
PGAPGEKG PSIQPG+REPIPND GVDYSLRDNDLRKHLT+KKK+AS EPEDS SYSREF
Sbjct: 5 PGAPGEKGAPSIQPGNREPIPNDEGVDYSLRDNDLRKHLTDKKKKASWEPEDSLSYSREF 64
Query: 317 SNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPF 376
SNSNLKAQSKYKPL PEAVI REEFDLMKH+FDEQVEALKARCEKK+ FDD DLGESPF
Sbjct: 65 SNSNLKAQSKYKPLIPEAVINREEFDLMKHRFDEQVEALKARCEKKESPFDDDDLGESPF 124
Query: 377 IADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSA 436
+DI+EAPIPPKFKTP MKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSA
Sbjct: 125 TSDIMEAPIPPKFKTPTMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCLAFQIALTGSA 184
Query: 437 RLWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEE 496
RLW RRL ARSISTYSQLRKEFI QFS RHYDRKTATHLATIRQKE
Sbjct: 185 RLWCRRLPARSISTYSQLRKEFIGQFSFRHYDRKTATHLATIRQKE-------------- 244
Query: 497 QLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGR 556
DE LTVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT R
Sbjct: 245 ---------------------DETLTVKLGEEAPATFAEVLQNAKKVIDGQELLRTKTDR 304
Query: 557 PKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISE 616
P+KQIDQK+LSQ++ + D KSKDKG SSS RTEYRRSE G RSRPYER
Sbjct: 305 PEKQIDQKRLSQKKRKDDSKSKDKGSSSSGSRTEYRRSESGPSRSRPYER---------- 364
Query: 617 ILTNIEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIEDLIQDG 676
CWELKRQIEDLIQD
Sbjct: 365 ---------------------------------------------CWELKRQIEDLIQDS 424
Query: 677 YFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFGGPNGGQSGNKRNELAREAR 736
YFKKFVGKPR +S+EKKEERKRSRTPPRR DRPAVINTIFGGP+GGQ NKR ELA EAR
Sbjct: 425 YFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVINTIFGGPSGGQFENKRKELACEAR 484
Query: 737 REVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSL 796
R+V IIREQKPTCSITF D DL G+HLPHNDALVIAPLIDHVLVRRVLVDGGASANILSL
Sbjct: 485 RKVSIIREQKPTCSITFKDTDLEGVHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSL 544
Query: 797 PTYLALGWTWSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKS 856
PTYLAL T SQLKKSPTPLVGFS ESVSPEGCI+LPVTIGQD+TQVTQMAEFVV+DG+
Sbjct: 545 PTYLALRGTRSQLKKSPTPLVGFSAESVSPEGCIDLPVTIGQDSTQVTQMAEFVVIDGRL 604
Query: 857 AYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASELKGSSVCALE 916
AYNAIF RPIIHSF+AVPS LHQVLKYST NGVGTVRGEQKTSRECYAS LK SSVCALE
Sbjct: 605 AYNAIFERPIIHSFQAVPSILHQVLKYSTPNGVGTVRGEQKTSRECYASALKRSSVCALE 608
Query: 917 EQINHGKQQESGTDLPKEGKRQFSPPTEELKLVPLLSLEKQ 958
E Q S DLP+E K L P L+L+ +
Sbjct: 665 E-------QTSQDDLPREAKGSSPHQQRSSSLFPCLALKNK 608
BLAST of Moc08g18100 vs. ExPASy TrEMBL
Match:
A0A6J1DZB9 (uncharacterized protein LOC111024904 OS=Momordica charantia OX=3673 GN=LOC111024904 PE=4 SV=1)
HSP 1 Score: 852.8 bits (2202), Expect = 1.5e-243
Identity = 446/546 (81.68%), Postives = 466/546 (85.35%), Query Frame = 0
Query: 414 MDFQAATDAIKCRAFQIALTGSARLWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTAT 473
MDFQAATDAIKCRAFQIALTGSARLWYRRL ARSISTYSQLRKEFISQFSS HYDRKTAT
Sbjct: 1 MDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSWHYDRKTAT 60
Query: 474 HLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATF 533
HLATIRQKE ETLREYVTRFQEEQLKV HCSDDSAMCYFLT LADE LTVKLGEEAP TF
Sbjct: 61 HLATIRQKERETLREYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETLTVKLGEEAPTTF 120
Query: 534 AEVLQKAKKVIDGQELLRTKTGRPKKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRR 593
EVLQKAKKVIDGQELLRTKTGRP+KQIDQKKLSQE+ + D KS+DKG SSS+ RTEYRR
Sbjct: 121 VEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRR 180
Query: 594 SEGGSIRSRPYERYTPTTIPISEILTNIEENGMEKLLKRPEKLRGDPENRNKDKYCRFHR 653
E G RSRPYERYT +TIPISEILTNIEE+GMEKLLKRPEKLRGD E RNK+KYCRFHR
Sbjct: 181 LESGPSRSRPYERYTSSTIPISEILTNIEESGMEKLLKRPEKLRGDLEKRNKEKYCRFHR 240
Query: 654 DNGHNTISCWELKRQIEDLIQDGYFKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVIN 713
D+GHNT SCWELKRQIEDLIQDGYFKKFVGKPR +S+EKKEERKRSRTPPRR DRPAVIN
Sbjct: 241 DHGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVIN 300
Query: 714 TIFGGPNGGQSGNKRNELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAP 773
TIFGGPNGGQSGNKR ELAREARREVCIIRE KPTCSITFGDADL G+HLPHNDALVIA
Sbjct: 301 TIFGGPNGGQSGNKRKELAREARREVCIIREHKPTCSITFGDADLEGVHLPHNDALVIAS 360
Query: 774 LIDHVLVRRVLVDGGASANILSLPTYLALGWTWSQLKKSPTPLVGFSGESVSPEGCINLP 833
LIDH LVRRVL+DG GCI+LP
Sbjct: 361 LIDHDLVRRVLIDG----------------------------------------GCIDLP 420
Query: 834 VTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVR 893
VTIGQDATQVTQMAEFVV+DG+SAYNAIFGRPIIHSFRAVPSTLHQVLKYST N VG VR
Sbjct: 421 VTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNEVGMVR 480
Query: 894 GEQKTSRECYASELKGSSVCALEEQINHGKQQESGTDLPKEGKRQFSPPTEELKLVPLLS 953
GEQKTSRECYAS LKGS+VCALEEQ N GK QES DLPKEGKRQF PPTEEL+LVPLLS
Sbjct: 481 GEQKTSRECYASALKGSAVCALEEQTNRGKLQESEADLPKEGKRQFPPPTEELELVPLLS 506
Query: 954 LEKQVS 960
E+Q +
Sbjct: 541 PERQAN 506
BLAST of Moc08g18100 vs. ExPASy TrEMBL
Match:
A0A6J1C7X5 (uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008813 PE=4 SV=1)
HSP 1 Score: 850.1 bits (2195), Expect = 1.0e-242
Identity = 433/530 (81.70%), Postives = 471/530 (88.87%), Query Frame = 0
Query: 321 LKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADI 380
+KA+S P P VITREEFD ++ + D QVEALKA+CE+K+ +DGDLGESPF +D+
Sbjct: 2 VKAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDV 61
Query: 381 LEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWY 440
LEAPIPPKFK P +KPYDGSKDPKDYVEVFE LMDFQAA+DAIKCRAF+IALTGSARLWY
Sbjct: 62 LEAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWY 121
Query: 441 RRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQLKV 500
RRL A SISTYSQLR+EF++ FSSRHYD+KTATHLATIRQKEGETLREYVTRFQEEQLKV
Sbjct: 122 RRLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV 181
Query: 501 VHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPKKQ 560
HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP+++
Sbjct: 182 AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERK 241
Query: 561 IDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTN 620
I + + ++ D KSKDKG S SSGR EYRR+E G RSRPYER+TPTTIPISEILTN
Sbjct: 242 IGRGRSGKDIENADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTN 301
Query: 621 IEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIEDLIQDGYFKK 680
IEE+GMEKLLKRPEKLRG PE R+KDKYCRFHR++GHNT WELKRQIE+LIQDGYFKK
Sbjct: 302 IEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKK 361
Query: 681 FVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFGGPNGGQSGNKRNELAREARREVC 740
FVGKPR SS EKKEERKRSRTPPRR DRPAVINTIFGGP+GGQSG KR ELAR ARREVC
Sbjct: 362 FVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVC 421
Query: 741 IIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLPTYL 800
IIREQ+PTC ITF ADL +HLPHNDALVIAPLIDHV+V RVLVDGG SANILSLPTYL
Sbjct: 422 IIREQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYL 481
Query: 801 ALGWTWSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFV 851
ALGWT SQLKKSPTPLVGFSGESV PEG I+LPVT+GQD TQVTQMAEFV
Sbjct: 482 ALGWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc08g18100 vs. ExPASy TrEMBL
Match:
A0A6J1D9E1 (uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018823 PE=4 SV=1)
HSP 1 Score: 823.9 bits (2127), Expect = 7.7e-235
Identity = 444/635 (69.92%), Postives = 494/635 (77.80%), Query Frame = 0
Query: 318 NSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFI 377
+SN +A+S + P P+ VITREEFD ++ K + QVEALKA+CE+K+ +DGDLGESPF
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 378 ADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSAR 437
+D+LEA P +K YDGSKDPKDYVEVFEGLMDFQAA+DAIKCRAFQIALTGSAR
Sbjct: 62 SDVLEA--------PTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 438 LWYRRLTARSISTYSQLRKEFISQFSSRHYDRKTATHLATIRQKEGETLREYVTRFQEEQ 497
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 498 LKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP 557
LKV SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 558 KKQIDQKKLSQERMRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEI 617
++ ID+ + ++ + D+KSKDKG S SSGR E+RR+ G RSRPYER+TPTTIPISEI
Sbjct: 242 ERGIDRGRSGKDE-KADLKSKDKG-SFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEI 301
Query: 618 LTNIEENGMEKLLKRPEKLRGDPENRNKDKYCRFHRDNGHNTISCWELKRQIEDLIQDGY 677
LTNIEE+GMEKLLKRPEKLRG PE RNKDKYCRFHR++ HNT WELKRQIEDLIQD Y
Sbjct: 302 LTNIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDY 361
Query: 678 FKKFVGKPRFSSIEKKEERKRSRTPPRRNDRPAVINTIFGGPNGGQSGNKRNELAREARR 737
FKKFVGKPR SS EKKEERK SRTP RR DRPAVINTIFGGP+GGQSG+KR ELAR ARR
Sbjct: 362 FKKFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARR 421
Query: 738 EVCIIREQKPTCSITFGDADLGGIHLPHNDALVIAPLIDHVLVRRVLVDGGASANILSLP 797
EVCIIREQ+PTC ITF ADL +HLPHNDALVIAPLIDHV+VRRVLVD G SANI+SL
Sbjct: 422 EVCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLL 481
Query: 798 TYLALGWTWSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSA 857
TYLALGWT SQLKKS TPLVGFS ESV PEGCI+LPVT+G D TQVTQMAEFVV+DG+SA
Sbjct: 482 TYLALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSA 541
Query: 858 YNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGEQKTSRECYASELKGSSVCALEE 917
YNAIFGRPIIHSFRA+PSTLHQVLKYST NGVG VRGEQ SRECYAS LKGSSVCALE
Sbjct: 542 YNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALET 570
Query: 918 QINHGKQQESGTDLPKEGKRQFSPPTEELKLVPLL 953
++ E +LP +R+F+ PTEEL+LVPLL
Sbjct: 602 LVSRDGTLEFKANLP---RREFAAPTEELELVPLL 570
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1DHB3 | 2.3e-271 | 66.45 | uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A6J1DPC9 | 2.0e-267 | 72.90 | uncharacterized protein LOC111022280 OS=Momordica charantia OX=3673 GN=LOC111022... | [more] |
A0A6J1DZB9 | 1.5e-243 | 81.68 | uncharacterized protein LOC111024904 OS=Momordica charantia OX=3673 GN=LOC111024... | [more] |
A0A6J1C7X5 | 1.0e-242 | 81.70 | uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1D9E1 | 7.7e-235 | 69.92 | uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
Match Name | E-value | Identity | Description | |