Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTCCATGCGATGCTGAGATGCTCCACATATCTAACGGGAGATGGATTTGTCTTCAGAGGTGCTCGCCTGCGCCTCGCTGTTCGTCGAACTACGAGCCCTTCTTTTTGTTCGGGGCGAACGATAAGGAATTATCACTCTCACGGGTGGTGTTCTTGATTGGATGCTGAAGGTGGAGAAATGGCGGCTGAGGTCCCACTCCCTTCTGCATTACTCTTACTGGTCGATCGACACGTGATAGTTTTGATACATAAATATTTAAGGTTGTACGGCATGTCTTTTAAAATTTGTTTTAATTTCTGTCTTTAATTTTAAAATTTTGTAATTCCATTTATAAATGTTTAATATAATTTTGATTAAAACAACAATTATTAATATAAAATAGTATTAAATTTTATTGTTAATGTATTTAATAATAATGAATTACATTTTTAACAATTTATTTGGTATTAACCAATAATAACTTATTAAAATATAATTACATCAACTAAACAAATATTAATGTAGTTTTTTATTAGAAAAACAATTATTGATATAAATTATTATTAAATTTTATTGTTATGATATTTAGTAATAATTAATTATATTTTAAAATTTTAATTGATAATAATTAATAACAACAAAGTAAAATAAAAAATTAGTATGATTTTTTTATTAAAATAAAAATTATTGGTAATATAAATAAGAGATAGTAAAAGTTGTAAAATAATTGATAAAAATGTTGTAAGTCCCACTAATAGATGAGAGATAAGTGATAAATGTTGTAAAGTAATAAATGAAAACGTAGTAGGGCCATAGAATAAATAATGATAATGAGATTGATGAGATTGTAGAATATGAGTGAGATTACCCTTTTTTGGGGAAGTGATTCATAAATTGGTGGTAGGTCTCACAAATTGACTAAATTCTTGCATTTTAGTAAACATCATCAAGAGAATGAATTAATTCTCATTCCTCCATTCCAATTCATTTTAGTATTTTTTAAATTTTCATAACTACTTTTGATTTTTCAATATTATGAAAATTTTGAAATATATTTTTAAAGGGCTGAAAAGCACCTTTTACATTTCAAAAAGTATTCTCCAAACTCATTTTCTACAACAATGGCCAATTTTGAGATAGGTTCATCATTATATTATGTGAAGTGGGTAACTTTGCAATTTGTTTAAAAGAATGTGTGATAAAATTAACAGTGTGCACCTACCTCGTTACCATACATAAAGGTTCAAAATTTGATGCATTCTATTTGATACATCTCAATAATCCTTAACCTTCTATGGTAGCGTGGTTGAATAAGATGTATCCATATGCACGCTCTCCTCTAAAAAAACCTTAATTCTCTTTGAGGTTTCTACCGAATTTTTCCTCTTTGCATTGCTCTTCTGTTTATAGCTAAGGGTGTCACTTTTTCCCGTGGGGACGGGGCCTCGCGAGGACCTGCCCCTAACGGCAGGAGATTACCCGCTTTGAGCGAAGACGGGGGAAGATTTTTCCCCATTTGAATTTCGAAGACGGTGACGGGGATTGGGGTCTCCATCCCCGTCCCCAACCCCGACCCCGCCCCGTTCCCTGCTTAAATATATATATTTTTAATTTGTTTATAATAGATATATATATATATATATATATATATATATACATACACACGTATTATAACAACTATTATTTTTGTTTTTTACTTTTTCATAAGAGATTGTTGTTTATGAGAGATTGTTATGATGTTTAAGTTTTAAAATTTATTTGTATTGATAATTTGATATTTAAGACTCACGAATAAAATATTTCGACTAAAATAAAATATTTATTAGATTTTAAAAATTAAAATAATATTTTTATTTTTATTAAATTGTTTAATAATTTAGCTATTTACATGAGGATTTTCACAACAAAATTTACATAATAAGAATTGTTCATTTTTCTAAAAAAAAAATAAAAATAAAAATTGTAAAACGGGGAAAAATTCGCAATGCTATAATTAGGGGTGTCAACAGGGCAGGGCGGGGCGGGGCGGGGGGGGGGGGGCCGATGGGGATACATATTTCAAGTGAAAAGAACCTCATTTGTGCTAGTTTACTAAAGAAAGTATTTATTCGATCAAATGAGGCATCATAGTGAGTCGGTTCTAATATGTCAATAACAATAAATGCATATTCGTAACCCTACGAGACGGATGTCACTTATTCAAGAGATCTTGGTGCAAATTCCCCGATAACTCCTTGAGGTCGAACTTGTTGCGAATGCGCAAATATAGTTCCACATTAACTAAAAGTGAGAAAGACTCTAGTATATAAGTAATTAGAATATCTCAATCGATGATTCAAATATTTGTTTTATTTTATGTTCTAAAAAAAATCTTTTATTTTGATTGTTTAAAAAACTTGGATTATTTTTTCCAAAAAAAAAAAAGAGGGGAAAGATTATATTTCAGTTTTGTTGTTAGATTTGTGTACATTCCAAAAAAAAAAAAAAAGAAGAAGAATAAAAGAAAAAAAAAAAACCCCGTCAAAGACATCAAAGAAAGAGTGTGGCGGGGAGTGTATTTTTAGAGGGAAAAAGAAAAGCAACAAACAAAAATCCTAGGGTTTATGATTCAGACCGTTTGAATTGAAAGCTTTTCGATCTATCCTCCAATTGTCGAGGAGATGTCCAACAACAAGAGTTTTTCCATGGCCTCTGTTTTCGGTTCACTGTCTGAAAGCTTTTGGAGAGTGCCTCCAATGGCTGTTCCAGCTATTTTGGATTGCATTTTAGCTTCTACCGGGTTATCCCCATCCGATCTTTTTGCTTCGCTTCTTGAATCTTTTCCGAACAATATTGATGTACTACCCTGAAAAACCTCACTTTGTCAACTGTATATAATTTTGGTTTGTGAAAAATGCAGGCAAACACATACAAACTACTTGATTATGGGGGCAAATAACTTTATGTGTGTGTGCTGTGTATATTTATGTATGCATGTATGATGCTCATCATTGAAACTATATTATATATATTTATTTATATATGTCTGTAGCATCTATACATGTCGTTCTTGTTATGGGTTTTCCTATTGCTGACTTCCCTTAACTCGAACTAGAGGATGGCTGTTATTTCATAACTCTTAACTTTAGTGAGAAGGAAGGTCAGAATTGAACCAGCAAAGAGATTTGATAAGTCAATAAATTGTCTAATAATGGTGGAAAATGACATGGGCTGAGAGATAATTGAAGTCTCTGTCTAAGCCGGGGTAGTTGTTTTGGATAGGATGTCACCACGAAGGAGGGAAAGCTTGATGCTGATCAATGTAATTACATCACATCTTTGGTGTGCGCGCTGTGCCATATACTTAAAAAAAATGGTCAGTTATCATCTGCACTCATAGTTTTACAAATGGTTTTCCTTTTTAATCTGCATGCCAGGATCCTGTATTAAAAACGAGGAATGTCAATTATTTCATATGCTATGATTTATATCTTACAATTTATTAGCGACAACTCTACTGTCAAAATCCAATTGATTTATTGGTCCTCTGTCAAAATTGAAAAAAGAAAAGAAAAGAAGTTTTACATGTCTTTGTACTCGGTTCGAATGGAGGTTTGCCATATATTCTGTAAGGGTTTTCTTTTGGGGCATAAAGAGATGTTTAATCAATCCTGTTTGTTTGCCCACACGTCACAATGGATCTGAATCATTTGAATACTGTACTTTTTATGACTTGAGAAGTTTTGTTTTACTATGAAATGACTCTTGTTTCTATCCAAGCTGACTAGTGTGCCCTATTACTCCAGCAATTAGGATTAGTTGGATAGAATGTTTTAGGTTGTAATGTGCGGTTAGTCATGTAGTGAACTGATTTTATCTGCAGGTGCTGATCCTGATGCTTTGAAGTCATTTATATGGAAATGTTTCGTTCCTTTGATAAATAGGGCGATTGCATTTAATCGGGAAATGCTTAACCAGGTACTATTTTTTGAAAGCCAATTTTACTAGATGATTAGATGATAATGGGGAGGATACTTACTTTTGGGAGGATAGATGGTTGGGGGATAAACCTCTTTGCTTTTTGTTTCCTCGTTTGTACTCCCTTGCCTCTTCGAGGTTCGCTTTGGTGGCTTTTTGTCTTATCATCGACAAGTCACTTCTCTTCTTTGTCCCTTTCTTTTGGCTTTCGTCGTCCTTTGATTGGTAGGGAGACGACTGGATGTTATGACCCTTTTATCCTTGTTGGGAAACTTCGCCTTTCGTTTGGGGAGGAGGGATTTGCACATTTGGGATCCTAGTCCTTCGAAGGGTTTCTCTTGTAAGTCCTTTTTCTTGCATCTTCTGGATGCCTTGCCTCTAGCTCCCCCATCTTTTCCTCGTAATGGAAGGTTAAAATTCCCAAGAAGGTCAAATTTTTTGTTTGGTAGGTTTTGCATGGTAGAGTTACTACCCTCGATCGAGTTATGAGGCATTTGTCCTTTCTTATTGGGCCACAGTGTTGCATTCTTTGTAAAAGAGCAGCTACGGATATTGATCATTTGTTGTGGAGTTGCCAGTTTGCTTATTCGGTGTGGAATTCCTTCTTTTCGGCATTTGAGGTGAGCTTGGCGCATAATAGACCGTGCAGCTCGATGTTGGAGGATATCTTTCTACATTTGCCTTTTAGGGAGTAGGGTCATTTTTTGTGGCTTGCAACCTCCTTGGCTATTGTAAGGGGTTTATGGATGGAGAGAAATAATAGAATTTTTAGGGGGACTGAGTGTTCTGGTGAGGAGGAGTGGGATCTTGCTAGGTTTCATGCCTCCCTGTGGACTTCGGTTTCCAAGGCTTTTTGTAGTTACCCTATAGGTCTCATTTTGTTAGATTGGTGCCCTTTTTTGAAGAGCTTGGGTTTGGCTTCTTTTGTGGGTTGGTTTTTTTGGATGCCGCTTTGTATTCTTTCATTCTTCTCAATGAAAGCTAAGTTCTTCATCCAAAAATGATAATTGGACCTTCAATTTCCTAATATGTTGTTGCTTTCTGCTAATTATCTTACTGACTTCATTTCTTGTATTAGGTCGCTGAATCATTCATTGATGTCGTCATTGAGATGAACTCGTGGCCAATTGTTGAAGCAACTCTAATTCCATTCTGTATAAGTTCAGCTCTTTATTCCACTAGCGTGCTGCAAAATGAAGAGTTTGACACCTTTGAGGGCGACAGATGTTCTGTCATTTTGGGCTCAAATGGCCCAATAAATGAACCTAGAATGGATAAACAGATGATGAAAGCATACGGGTTCCTTCCACTACCATTAGCATGCCATATTTTGGCTATAATGTTAGATGCTGTCCTTTGTAATAGACATGCACCACAAATATCAGATGCAGTGGTGGCAAATGGATGTCAAAAAGCTGAAGAATTTACTGTTAAACTGATTTGGGATATTTGCAATTTATCTGAACAAATGCTTTTACAAAGCTCAGATCATCGATCTTGTACCATTCGCTATCTTCTCCCATTAATCTTTGAAGTGCTTCTTTCTCACCACTCTCTAGAGATCTCCATTCAAGGGCATGTGCATAATCTCTCCAGGTTAGGCTTACAAGTTCTCTAAACTCTATGATCCAGACATGTACCATCTCTTTGTTTTAAGAATTCTATTTCACCTCATTATTCTCTTAATGCATGTGATTATCTTACAGGAATCGTTTTCTCATGAAAATATGGAAATGTTGCAAAAAACTATTTTCATTTGGAACTTTGGAGAGAAGAGATGCCTATAAGATTTTGTCTCTTTATTTATGTTTTTTCCCTCACAATGAAGAGCTTGGAGGTGCTGGAATCTGTGACGACGCAGAAGAATTTGACATAAAGGCTGATAAAGATTTTTGGGATGCAATTAAAAGAGGCTTGGTAGTCAAGCATACCTTTGATTGTTTCTCATTGATTGAGCTTCATTTGCTTGTACAATTAGGCATTTTAGAATGACTTGAAACAATTTTATTTCATGTTCGTTTCTAGAAAAAATATATTTCACATTTGTGGTTGGGTTGATTTTGTTTGTTTCTTGAGCCTGGTTAGGTGGATAAGGAGGGCTTGGTGAGGAAGCAGTCACTACATATATTGAAGAAAGCACTATATATAAATGGAAGAGGCAATACATCTAGGGTTCCAAAGACAATTTCAAGTGGGAAAGATAATAATGCTCGAGGTATTACAAAAAGGGAAAGATGGGCCCACAAGGAAGCAAAATCACTTGGTGTAGGGCAAATTTGCAGCGAAAGTGAAATTGTTATAAATAGCCAGCAGCAACAGTGGGAAGCATTCATCCTTCTATATGAAATGCTTGAAGAATATGGTTCACACTTGGTCGAAGCTGCTTGGAATCACCAGGTTCATTCTCATTTATTTAATGTCCCCGCCCTCCCCCCCACTACGTCTTTTATTGGCATTTAGTCTTAGGAGATGTCTAGTTCTCCTTTACTTTTTATACAATGTTATAAGAAAAAAAAAACTTAATGTCTACTTCATGTTGTGTATATATGTGTGTGTGTGTGTGTGTATCGTGTATGTTTGTATGGTTGTACGTATGGTACTATTACATCCACTCCTTTTCTACGTTAATATATAAATCAAATAAAGTCTCACCTTGACAAAGCACTGTTTAGATGAGCTAATGTGGAGGAAGCAAAATTTATTATGTCAGTACAAAAGGTGGTCGACTTTGATCATTGCCGTTCTTTCCAGTGTCTCATGTTATTATCTTTCTGTTTTCAGAATTCTGGTGGAAGTTTGCAAGAAGCTGGAAAAGATTTTAACTTCTTTTGCATGATGATGCCCAAGTTTACTGTCAACATAGTATATAATCTATAATTCTATAATATATTAAAAAGGGGCTTGAGGAGAGACATTTGATTTTCCCTTTTGGCCCTACTTTAATATTTATTTCTAATTTATCTAATAAATGGTATTGATTGCTTTAATAAAACATTTCATTATGTCAATTTGCCCAGTGGCCAGGGGCAATGTCTAAAGATGACATTTTTTCGATAAGAGGATCATCCTCAAGCCATACTGTTAGGCATGAACTGGAGAACTATTTTTCTTAGCAGATAGTTGTTCCAATGAAAATAGAGGAATATTGTAATATGTTTTAATATATTGTTTTATCAAGTTTTTCTAAATTAATTAAAAATAAACTTCAAAGTATTGATTATTATTGATATGCTACCAAATAGTATAATTATTTTAAATAAATGTAATCAATTTGTATTCTTGTTGAAAAGTGAAAGTAGTTATTTAGTGTATAATTTACTAACTTAATTGTTATCTATAAAATATTATAAATCAAGATCATTTAAAAATTTAAAACCTGTGTGTTTCATGTTGCTTTATTGTTGATTCTAACTACTAATTTTTTCTGGCAGATATTCTTGTTACTACGAGATCCGACCTCAATTAATTTTGACAGCGTCACTGGTGGCATTCATCAAAACCAAATTGATATGTCTAGCGAAATCTTTAGTTGGTTATCAATCTTGTGGGTTCGGGGCTTCCACCATGATAATCCTTTAGGTACCACTGGTTTGTTCATGAGTTAATTGTATGGGTGGGAAAACTAGAACTCTTGCTGTATGTCTGGCATCTTTTTGGATTAAAATTTTACGAGATTTGCTCAATGAATTATGTCATCCCTTCTTGTTTAATGCAGTTAGATGCTTGATCATGCAGTCCTTTTTGGCCATTGACTGGAGGAATTATGTACCCTGTTTAAAGTCATTGCCAGAAACTTTGATCATTGGACCATTCATTGAAGCACTAAACGATCCTGTGCAGCACAAAGATTTCGGTCAGTCCATGATATATTTACGTCCTCAGGGCTATCTTCATGCTTTAAGTGTGCTTGAACCTGTTATGTTGAATCAAAAGATTATTCTTACATCACATTAAACAAATTGCTTAAAACATTGTCCACATGAACAAAATTTTTTCTCACATTTGGATGGGCTATAATTTGTGGCGGATAGAAGTGGGGATGTCCAAATTTCCTAGTCATTATTTTAATGGTATATGGAGTGGAGCATTATGGCTCCAACTTAAAGATGATATGATGACGTATCATATTTATTCTCTTGATTGTTGAAGGTGTAAAAGGAGTTTACTCATCTAAGACAATTGAAGGTGCAGCCCATTTTATACGCCAATATGCAAATTGTCTTGATGCAAGGTATGATATAATATGCTTTTTGGTTGAAAACAAATATTTGACGGGGTATGTCTATGTCCAACCCTTTTTTTTATGAAAAACAAAACTTTCAACAAGAGCTCACTACAAAAATGTAGTGAGATTTTTGGTTTAATTATTTTAGTTTTTTTAAATCTTTAGTTAGTTGTTAAATCCTGTATAAATAATAAAGGGTTAATCTCTTGTATTAGCATACTTTGAATCATAATAAACTCTTTGGATTAGAGTTCTTGGAGAATTCCCTCCTTTGTTTATTTAGGCTACACCAAAAAGGAGCCAGTACAAGGAAACCTGACTAAGTATTAAAAAAATGGGCAATAGTTATTAAGAATAAGATTTGGCTTGACCATCCTTCCTTAATCGAATATCTTGACAACTGCTGGAAAGAAAGTCCTTCTGTAGGGTGGGCAGGCTTTTGCTTCATGAGCAAATTGAATTGTGTAAAGAATAAACTCAAAGTGTGAAATCGGGAAGTGTTTGGCAATATCAAAGACGAGAAGCTCTATTTAACCTCAAAGATCGATAGTCTGGACCGGTTGGAAGAGTCAGCCCCCCTTAACTCAAATCAAATCGCCGAAAGATTTAAGGCTAAGGCTATTTTGAACGACTTGATTCTCAAGGAGCAACGATTGGGGATGCAAAAGGCTAAGCTTAAGTGGCTAAAAGAGGGTGATCTTTACTCCAAATTTTTTCACAGATGGGTTTCTTTCCGTGCTAGCAAGAATTCCATTTCTAATCTAATGAGTAGAGATGCAGAGTTTCTTTCTAAACAAGAGGATATTGAGAAGGAAGTGCTGGATTTCTTCTTTTCTTTATATAGAACACAGATACACAACGTTTTGTTCTTGAAGGAGTGGAGCGGCAGCCCATCAGTTCTGATCAAGTGGCTTGGCTGGAAAAACCCTTTGAAGAATTTTCTTGGCGATCAATCAGCTGGGCAATAAAAAATCCCCGGGTCCTGATGGCTTCACTAGCGAATTTTTGAAAAAAAACTTGGAACATCACTAAATCAGATTTACTTAGGGTGTTCCAAGAGTTTTTCAATAGTGGTATTATCAATAAAAGGACCAATGGGACCTATATTTGTTTGATCCCTAAGAAAAAGGGAGTGGTGAAGGTGAGTGATTTCAGACCCATTAGCCTTGTTACCTCCCTTTATAAAGTTATTGCGAAAGTGTTGGCTGAAAGGCTGAGGAAAGTCTTCTCGACATGATTGATGCCTCCCAAACAACCTTTGTCAAAGGAAGGCAGATCCTAGATGCTATTCTCCTTGCAGCCGAATCAGTTGATGATTATCTTTCTACGAAGGAAGCTGGTTTCATAATAAAGATGGACTTTGAGAAAGCTTATGACAAAGTAGATTGGGGCTATCTTGATTACATCTTGTAGCTAAAAGGCTTTGGTAAGAAATGGAGAACTTGAGTTTCGGGTTGCTCTCAAATTCTAATTTCTCTATCTTGGTAAATGGTAGATCCAAAGGTAAGCTGTGGGCTTCTAGGGGCATTAGACAAGGGGATCCCCTCTCCCCCTTCCTTTTCACTATTGTGGCTGATTCTTTTAGTTGTTTTGTCAATATTTGTAAGGATAAGGGCTTTATGAAGGGATTCAAAGTGGGAAAGGACAAGGTCGGTCCATATCTCTCATATACAGTATGCTGATACTCTTCTGTTTTGCGAGAGAAGTCCTTCCAAGGTAGAAAATTGGTGCAAACTTCTAGCTTTGTTCGAAAATGCTTCTGGGTTGTCGAATAATATGAAAAAGTTGGCCATTATTGGCATCAATGTAAAAAATTGCGAGGTTGTTCAGCTAGCTAGCCGGTTGGGGTGTAAGGTGGGATCCCTGCCTTTTTCCTACCTTGGTTTACCTTTAGGAGGGAGACCGAAATGTGCTGCGTTTTGGGACTCCATTTTAGACAAAATCAGGGGCAAGTTGGCTAAGTGGAGAGGCTTTCCCATTTCTAAAGGGGGAAGGACAACTTTGATTAATGCCGTTATTTCCAGCCTTCCGAGCTACTATCTTTCTATTTTCAGAATTCCGGTGGGGATATGCAGGAAGCTGGACAAGATTTTTAGGAATTTTTTGTGGGAAGGCACAGAGGAAAATGGTGGAAGCCATCTTGTAAAATGGTTCAAAGTAACCACCCTGCTTTCTTTAGGTGGGCTGGGTGTTGTAAATCTTAGAACTAAGAACCTAGCCTTACTCATTAAATGGTTATGGGGATTTGAAAAAGAAGAAGAAGCTCCCTAGAGGAGGGTCATCAAAAGCAAGTTTGGGATAGATGCTGGTAAGAATCCTTTAGCCCATAAGAAATGGAAAGGTCTTCATAGCCCATGGAAGGATATTGTGAAATCAATCTGCTTCTTCTGGTCAAATACGTGTTTTAAATTGGGGAATGGCAGTAAGATTTCTTTTTGGAAAGGACATTTGGTTCTCTTCTCAGCCGTTCAAGCTTGTTTTCCCCTCCATTTTCCGAGTCTCGAATGCTAAAGATAGCATTGTTAAGGATCTTTGGAATCCCTATTCTTTGACTTGGGATCTGAAGCTTAAAAGGAATCTTTTTGATAGGGAATTGGCTGAATGGGCAATCTCTCCTTGATCCTGGAAAACGTGAGGCTTAATCATCTCTCAGACACTTTATGCTGGAATTTGGAATCTTCTGGGATGTTCTCTGTTAAATCTGGTTCTTTAGCTCTGGAAGAAAATTCAAGATTGCTAGACAAGCTTGTGTGTAATCTGATTTGGGAAGGGAACAGCCCTAAAAAGGTCAAGTTGTTTTTGTGGACAGCAGCTCTTGGCAGCATTAATACCATGGATAAAGTCCAAAAAAGGAATCCCTCCATCCAGCTTTCTCCTCAAATCTGTGTCGTGTAAAAGGAATGAAGAATCTCTTTCTCATCTCTTTATTCACTGTAGCTTTGCTGCTAAAGTTTGGCATCATTTTGGTAATTTATGCGGGATGATGTGGTGTAGCCCTAAGCACTTTGAAGGTTGGCTCCTGGAGGTGCTCTCTGGTTGGTGGTTCAAAGAGAAGTCCAAAGTGTTATGGTCAAACGTTTCGATAGCTATTTTGTGGCTTATTTGGAAGGAAAGGAACTCTAGAATTTTTTTAGATAAATCTAATTTTTTTATCTACTTTTGTGACCTTGTACAGTTCACGGCCTCTAATTGGAGTGTCCGAAACAAGCTTTTTTGTAACTACTCTTCTACTATTATCAACTTAGATTGGAGGGCCCTTTTGTGATCCCCTTGTTTGGGAATAGGGACGTCTTGTCCCTATGTCCCATAGGTTGTAACTTTTCGGTTATTGCCTCTTTTGAATGAATCTTTGTTTCTTATAAAAAAAAGTTATTAAAAATAAGAGACAAATAATAGTTACAAAACTCCTTAAAAGCAGACCCCTAAAGAGAGGTGTTAAACCTTACTAACACACATGACTCTAACATATCTAAAGATTCTCTTATTTCACTCCAACCAAATATTCCACGGAATGTAGTAAAGAAGGCAATACGCCTAAGAGCATCTCCCCTTATCTTGAAATGGTAGGTGAAAGAGGATCTCCTTTATCATCGAACGACACAATCTAGTATAAGCTACACTCACTCCAAAAGTCTTTAAAAAACTATTCCAAATCAAGTACGCAAACTGTCAATTCACAAAATGTGATCCAAGCTCTCAAAAGCTGTCTTGCATAATGCCCCACTTCTGCCCTGACGACAAAGAAACATGCTTGGTGTTATCTTTCCATGAAAGACTTGTTAGTTGAAAATTTTTACCTTCTCGGGGATTCCACCCTTTTATGGTGAGGAATACATAGGCTGGACAAAAGGATGCAAGTTGCACAATAGAAAGACAAAATAACTACAGGAAAAAGAGTCCATCAGCAAACATCCCTAATCCCAAGTCGACAATAAAATCCAACCAATAGATAGAATAATAGAGAAGCATTAGAGGTTTCATTGTTGAACAATAGACATGAAAAACCAAAGTTGAACGAAGTGGAATCGCTCAAATGAGAGAGGACAAAAGCCAACAACGCAAATTTCAAAGAAGAGAGATGATAAAGGCGAGGGAAGCGAAGGTATTGATATTTGCAACATTTAGAATAGGGGAGAAAATGAAGATCTATTTCAATTCTTTGAAGAAGTAGGGATTACTCTACTTGAAGTTAAATCTCGAGGATGATCATAGTCTTTTTGGAAAAGGTGGGCCAGGTGCCAGGTGTCTAGATGTTTTCAGTATAAGATTAGAGGTTTTGCTTATATGGAAAACAACTCTTTGGTTGCCAAATTTTTAGATTCTTTTTTTCTATGGGGTGTTCTCAATGGACCCTTTTCCTTATGTCTATAGCCAAAGATATATCAAAGATGGTTCACAGTTGATTGCATTTTCCTCTAATAATGTTGATTTGTGTTCGTCTGTGAATATTTCAGGATCTGCTGGGTTTTCAATTGATTTCTCAAAGCGGATCCACTCTTTATCTCATAGTATTATTCATTCAACTAATAATGATAATACCATTAAAGCTGTTTTAGAAGAAGAATCCTCGGATTTTAGCGACAATCAAGTTGTTAGTCTTAGTAGCATCGATTCAGATTTGGAGTGGAGTTTAGATGGTCTTTCCAAAGATCTAAATGAATTAGCATTGCTTCTTCTTACTCCAGAGAGAAATCCTCCATCTTTAGGGAAGCTCCAAATGATTTTCTTGCTATTATTTGAAAACTTTTTTTTTTTTTTTTTTCCAGTTTGCGTTAGATATCCAATGACCAATCGGAGTAAAGGCTTGATTTCATTGTTCATTGGGGTGCTTATTAGCTCTTGGGAGTTGGAGCAACCTCTTTTGCTGGATTCTTCATGGCTTTTTCCCTCAGAGTTTGTCATTTAGGCAATAGTTCAGTAATTGGTTTTGCATTACGTTGATTTGGTTTCAGCTTCAGTGGTTCTTTAGTTTTTGTAGCCCTGATTCTGGGTTATTTTTGCTGTTTGTTTCTCCCTTGGGTGTCTTGTTCATCAACGTTGTATTATGGTTTTTGTAATTTTTGGGGGCTTTTTCTTAATTGGTTTGGTGGTTTGTTCTTTCACTCCCTTTGGGAGTTTGTATCCTTGAACTTTTTCCTTTCTTTTTATTTATCAATGAAAACTTTGTTTCTTGTTATAAAGGCGAGGGAAGCAAAGGTAAAGACGATTATCCCCCAACTATCCTCCCAAAAAGCCATATTCAATAGTGAATTTTAAAAACTTGGAAAAAAGGGAATCTAGCTAAGATAGCCTTCCAAGGGTCGCTGAAAGTGCCTTTAGCTCCAAAATACGACATCCACTTGTTTGGCTATAGACCATATTTGCTCATTATAAGCCCATGCCACAGGGCATCAGATTCTAACAGATTTTAAAGGTAAGCACCATAACCACTTTCGCATAGTGTCAAATTACGAAGTCTCAAGTTTCCTATTCTTGACCCACCTATATCAATTGGTCTTGAAACAATTTCCCACCTAACCAAATGAGAGCCCTCCCTTCATTTACACCATCCTAAAGTCCTTCTTGATCTTCTCCACCCTATTGTTGATAGAGAAAAAGAGGTCTTAATTAGAGAATTAAAGTAAGTTGGAATCCTGTCTTTTTCCAAGAAGTAAATCATTTTTGAACCTTTTCCACAACAAGGTTCCAAAATGACAAAAGTTTGTGGTTGTGCACGAGAGGGAAACCTAAGTGCTAGGGAGATTGCTAACTTCACACCCTATTAAGGTCGCCCATATAGCTAGCTGGGAGGAATCACAATTAATACCAATCATAAAGCTCTTGTTTCTATTGATCTTAAGATTGATAATAATCTCAAACACCTTTACAATTTTGTTAAGATTAATGAAGTAATCCTTCTTGTTTGAACAAAAAGATTGTATCTTAGCGGCAGAAGATAGTTCTTATGGATGGGCTGGCCTTGCAATCTCTTCGAAACTGAGAAAGCTGAAATTGCATCTAAAAATCATGGAATTCTTATCGGGTATGCAAAAGAAAGAACAGGAAAAGAACTTTTTGAGAGATCTTGCTGAAATTGGCACAAAGGAAGAACCTGATTCTATTTCCATGGCGGAAAGGAGTTTAAGAACCTGATTCTATTTCCAGAGAATGTGGCTTCAGAAGTGTAAGTTGATTTGGTTGAGAGAGGGGGATGAAAACTCCTATTTTTTTCATAGATGGACCTTTGCAAGGAAAAGTAAAAGTTTTATTTCTTCATTAGAAAGTGCTGATTGCAGATTCTTGACTTCGGGAGGGCAGATCGAGGAGGAGATAATAACCTTTTTCTCTAAGCTCTATGGAGAGGCAGATGGTCATAAATTCACCTTTGAGAACTTGGAGTGGTCCCCGCTTTCCCCGGCTTGGAGGGATCAATTGGAGGCTCCTTTTGAGGAGATTGAGATCTTCAAGGCCATTTTTGATCTGTGTAATTTGAAGTCTCCGGGCCCCGATGGTTTCTCGAATGAGTTTTTTAAAAAATCTTGGAACATCGTGAAAGTCGACTTAGTAGCGGTGTTCAAAGATTTTTTTGAAAAAGGTGTCATAAATAGGTGCACAAATGAGACTTATATTTGTTCATTTCCTAAGAAAATTAATGCTTCAAAGGTGAATGACTTTAGGCCCATTAGCTTGGTTTCCTCATATAAGATTATTACGAAGGTTCTGGCTGAAAGGTTGAAAAAGTCCTCCCTCTGACAGTGGATGATGAGCAAGCTGCTTTTGTGGAAGGTCGACAGATTCTTGATGCTATCTTAGTGGCCTCTGGGGTTGTAGGGGATTGGAAAAGGCGCAAGGAGAAGGGTTTCCTCTTAAAACTTGATTGTGAAAAAGGTTATGACAAAGTTGATTGGTCTTACCTAGATACTATCCTTGAGCTAAAAGGCTTTGGCCAAAGATGGAGAAGATGGATTTGGGGGTGTATCTCGACTGCAAATTTCTCCAATCCTTATCAACGGTAGGCCCAGTGTTTTTTAAAGCGCGAGGCGCACCAAAGCGCAATAGCCCTCTGGGGCTTAAACGTGCGGCGCCAAAAAAAGCGAGGGACAGAACAGCTTGTCCTCTTTTTTCTCCATTTTTTGGTATTTCATTTCCTATAAAAAAAAAAAGAACCTGGATAAAAAGTGAGATCATGCATTTATAGCTTTGTGAAGTGAGGAATCTCCATCAGAAATGCAATCTACAGTGGTTAAAAGAAGGAGATGAAAATACAGGCTTTTTCCATAGGTTTTTCGCTGCCAAGAAGAGAAAATTGCTGATTTCTGAGTTGCAAGCAGTTACTAGGGACGTCCTAGTGAATTTCGGTGCTATTGAATATAAAATTCTTCTAGTAGAGATTCTTTGGAAATTTTATTTACTAATGATGAGATTTGGAGAGCTATCAACTTGGAATGAAGACAGCCCTTGGCCACGTTGGATTCTCCATAGAAAGAAATATGGTTTGAAAGAAATCAAGAATATTTGAAGACAAGTCTAGAGTTTGGGTGGAGCGGTTTGATCTAACCATATTTAAAGCTTCGCAATGGTGTTCTCTTTCTAATTTGTTTCATCATTATTTTCCTAGTAATGTTTGTATGAATCGGGAGGCCTTCGTTGTCCCTCATTAGCAACTTTTCTTCATTTTGCTATCTTTCACTTCTTTTAGGAGTTTATTTCCCTTGAACATTTTTGTATCTTTTCATTATATGACCTACTAAGGTCATCAACCACCACTAGGAAAAGGCAATGCAATAAAAGGTCTCCTTGCCTAACACCTCTAGAAGCTATAATCCTACCCTTGGACCACAATTGTGTAATTCAATGCCTGAAGGCAATTCCAAATTGGTTATCTCCATTTGAACCCAAAACCCTTCCTTCACAATACTTTGTAAAAGAATCCCAATCAACATGGTCACAAGCTTTCTCAAGGTCGATCTTGAAAATGACGTTGAAAATGACTCCTTCTTTACAGGCCCTATACTCCTCAACTGCCTCATTAGCATTTAACAATAAAAACCTCGTCCAAAATTTGTCGTTTTGTCACCTATGTAATTACTCTTTAACTTCTATACATGCCAATTGGAGAGCCTTTTTGTAAATCTTGTGGTGCCACTTTTGTAATTCATTAATATAGGCATTTGTTATTTGTAAAGTTTTGGTAGTTTTTTTTTTCTAAAGATTTGTAGGTTGGTCTTATTTCTAGTGTGTTGGGTTCAGAGTCTTGTACTTAGTTTGTGTATTAGTTTCTCTGTGCTTTGTAATATGTTGGTACTCATGTGTAATTTTTCATTTCATCAATGAAAAATTCTGTTTCCTTTTAAAAATATTGTCTCTTATCCAAAAGAAAAAGATTTTATTGTGGTTTATCGGAATCACTATTGTCGTCTTTAGTTCTTGAGTTTTGGAGTTGCCTAAAAGGACATAAGAAGATGATTGAAGAAGTTCGAGGACATGATATGTTACCTGCCAAGAAGTTACACAGTGGTAGATGGCTATTGAGAAAAGCATCAAGGTGTGATATATGTGTTTGAGATTGTGGAACTTTGTGTAGGTACATAAAATAGCATATAAATATACCTAAAGTTGTGAGGGCCATTATATAGCTTATGAGAGGAATCTGCTCATGGGTAGTGTTTTCAAGCTTGCAGCTAATGAGCGGTGCCTTCATGGGTAGCCTGTTGTAGCATAACTATAACATTACATGGATGGAATATTGACATGGTTTGAGTCTGCTTTGGTTGGTGGCTACTTGATAAGATTTGTAATCTAGTGTTAACGCATGGTAAGTGAGGACCTTGGCACATGTTCTCTAGCTTAGCTGATTTAGCAGAGTTAGGTCTTGAGATAATGATAGATGGGGCCAACTAAGCTAGCAGACTTAACCACTTGAATGAACCTAGACAAGTTGTATCTATCTTAGGTGGGGTCTATCTCTCTCCTATGGTGGCTCCTTTGCTTGGTTTCAATCTATGTTTTTGCATGGCCTTTATGGTCTTTTATTTCTCTTCATGAAAGCAGTTCTCTTATAAAGAAAGTTTCTGTATCTTAAGGTGGTTTTGCTTGGGCATGGGTCCAAAGTAATGGTATGCTGGAATCTTGGGAAGTCTGGTAATTGCCTTTGGCTGTAGCTGTAAAAATCACCTAGATTGCGTGCAGCATCCCCTATTTGTTGTAGCCTCCACCTATCCTGGAGCTTGGAAGGTCAAAACTATTCCTCTACTGGAAGCCTGTAATTAGCCGTACTATGGTTGAAGCTTTTTTTCTAGGGGAGGTAACAATCACCTTCCAATTAATATTAAAAATATGTTTCAAAGAAAGATGTAGTGAAAATCACACTTGTGTCACCAAGGGTTCAAGTCGTACGATCATAACACAAAAGTAATGGTGGTGGCATCTGAAAAGTGTTTGTTAAATTAGTGAATTAAAATGTTTTCTTCGAAAGGACAGATCCCACCTAAAACTAAGATGAGTTAGTTGAACGAAATCGTCATAGTTAAAGACGATATTTAAATTCAAGGAAAGAAGATTTTTTAGAGCAAAATTGCCAATTTAAATTTGTATCCAAAAGCCAAATTCTAATAATTGATGTTGTACTTTATTTTGAAAATTCCCTATGATATGTCCATCATTTGTTGGTTTTGTTGCAAAATTTGTTTTAAGATTTGATGTGATTGTTACATGTTTTCTTTTCAATTTGAATGTAGGACAAGTGCTGTGTTTTTGCAGCAGCTCACATCTTTGGCTAAAAAGAAATCATTTGGTCGAGTTGGGTTGATCAGCCTATCTGAATGCATTGCTTCAGCTGCTTCAGTTGGTGGATATGATAACGACAGTGAAGGAGAGTGCTTTGAAGGTTCTTCACTGTCAGCCCAAGGGGATTTGATAACTTATTCTCTGGGATGTAAATTGGAATTGCTGGATGATCTAAGATTTGTGGTTGAGAGTAGCAAACAACACTTCAATCCTAGTTATCGCCTTCAAGGTTTATGGTTATTTTATATCGTTATTCATGTTTTTCCTTTCACCATTGAATTTTGAAACTCAAATCAGTAGATGATCAAAGAGTGAATAACCATTTCGCATGTATGGGTGTTAGTTTGTGCAAAAGCTCTGGATGCTGCTGCTTCAGTCTTGTGTACATCTGACTTGGCTCTTGAGGTTCTTTTGCATTTTATTTCATCTCTACCACGAGAGGCTACTGACTGTGGAGGTGAGAGATATATTTTTTTGTTTTATGCTTACTTTTCTTCTGCCCACGTTCTTGCACCGATTCTTTGATCTCTACTGTTATTTGGTGCTTTCTGTGTGAATGTGCTAGAAATGAAACATATTTGGTGAAATCTGTTATATTATATGACTTCCCTTTTCTGTTTATATCCTTGTATTTATTATTGCTTTTAAATTAGCATGCCTTTTTTACTTTCCTTTTCTAGGTTGCTTGCTGTTGGCTTTTTAGTCAGGGGAGCCCAAGGCCTGTTGAACTAGTACACCCGGTTTTCCTATTCAGGTAATGTGGTTGATTCATTGGGCTGGGGCTCCCTGCTCCACCTTCCGTAGTGGTTAAAGGGAGGAGGGACTGGTACCGTGTCTCTCTCTCAGGCAGTAGGTACCCTGTTTAGCCAAGGTAGTAATGCTGAGGTTTGCAAAGGAGAAAGAAGTGCTTAAAAAGTTGAGTGCTGGAGTGTCTTTTAAGCCTAATTTTCTTTTTTCCTCAACAAGCAGGTTTGGATGGAAACCACTCCGAGGTCACTGCCATTTTTTGGGCCATTCTGCTTTTGCTAACATTTCTCATCAAACCCCTTCGATCCCAAGGCTCACAATAGTATTGTTTGTTAAAGGCATTCCTTCGTGTTCTTCTACTGGAAGGAAAAATTTACCACACTATCCTTTGCACAAGTGCAACACAATAGATTCAGAGAAACATTCCCACTCTTCTCCCTTGGAAGGCGATCTGTCCAGTTCTTTGCTAGAGAAACTTTGGAGGATTGCTGTACCCTGGAACAGGCTTATTTTGCAGAAAAGGATTAAAAATACCAATACAAAGAAACTGAAGATGCTATGGTTCAATCATTTGACAATGAGGAAAATGATTTGGTTCAGAATAGCTTACTTCAGCGGGTGGATCTTCTCTATTCGAAAGTCCTAGTATTGACCTAAAAGAAGTGAAGGCTTTATCCAATGCTAAAGAAAAGGGATCCAAACCAATCATCCTATTATTCAATCAAGCTTCGCTGATTGGCTACATTTTAACAACCCTTGTGTAGCGTCCATTTTTGTGGAGCTTAAACAATGTGATTAGCGTGACTGAAAAAATGTACCAAATGGACAGGGTGTTAAAAAGTCTCGTTTCTTCTCTGAATTATGATAAATACGCTATGCAATGGCTCTTTGGAGGAGCCTCAGTTTCCGAATGATGGTTTTGAGTTGGAACTTTAGAGGCCTAGGAACCTCTGGTAAGAGGCGTTGGTCAAAGATGCTATTCATTAATTCAACTCAAAAATTTGTATCTTAGATTCTTAGTTGAAACAAAATTTAGTCATGTTTTTAAATGGTTGGTTAAATCCCTGTGGAGCTATAATAGACATTTTTCTCACAATAGCTGAAAACTAAAATTACTAGTATTTATCTATGACCTAAGTTTTTCTTTTTATGGTTTGTAACTAAAAGCAATAAAGGCTACCTATATCTATTTGTCTTTATACTTCTCTCTTTTTGTTTCTATGTCTGTAGCTGTGTGTGCATTTCCTTGTTCTTCTAAGTATTGACGTCATTATAATTCTCTTTTCATCTGTTTTTTTTAGCTTACTTCTATCTCTCCCCTTTCTTTCCTAGGTTGCTTAAGAGGGAAAATGCAAAATTGGCTCTCAGGCTGTGGTAAGAAAAGCTGCAGTGGCAGTTGCTGCAGTACTGAGACGAAGTTTATGAAGAGTCTCATTGAGTTCCCTAAAAGATTTATAAGTCATAATCATTCATCTGATGCTTCTGTTACGTATGATGACGAAGAATTGGAAGCATGGGAATTTGAGGCAAAACGTTGGGCAAGAGTGGTTTTTCTTGCAGTCAAGGAGGAACATCATTTAAGACCTATACTGACGGTTCACCAATAATTTCTTTTTTCTATTTTTTGTAATGAATGAGTTTACTTTAAGATTGATTGCTAAGGAATGTAGACATACAATACCTTTCAAGAAAGAATAGAAAATCATTTGTTAATTAAATTTATTCTAGAAAATTTAAAATAAGAACTCATCCACATTGATCATTAAAGGGGACATCCGTAAGATGTTCCAAGAATTTTTTGAAAAAGGTATTTTAAACAAATGCATCAATGAGACTTACATATGCTTGATCCCTAAGAAAATTAGAGCCTCTAAAGTTTCAGATTTTAGACCCACTAGCCTTGTGACCAGTGTTTATAAAATTGTGGCTAAGGTGATGGCCGTTCGTCTAAAGCAAGTGATGCCTCAAATCATAGCTTCTAATCAGTCTGCTTTTGTTGCTGGTAGGCAGATTGTTGACTCAATTCTAGTTCAAACGAAATGGTGGATGATTATAAGACCAGAAAGGAAAAAGGGTGGGTTGTCAAGTTAGATTTTGAGAAAGCTTATGATATGGTTGATTGGGATTTTTTGCTCAAGGTGCTTGAAAAGAAAGGATTCGGGAGGAGATGGATCTCATGGATTAGAGCTTGTGTCTCCAATATTCAATTCTCGGTTTTGATTAATGGGAGACCTAGAGGGCGTATTCAGGCCACTAGCATTAGACAAGGTGACCCCCTTGCACCTTTCCTTTTTGTAATTATTGCAGACGTGCTTAGTAGTCTTCTGAATAAAGGGATTGATGATGGATCTCTCCAAGGTTTCTTAATAGGTAAGGAACTCATCCACATTAATCATCTCCAATTTGCAGATGACACACTCCTTTTCTCTACTTGTGAGGAGGAAAAGCTTCAAAATCTGTTCAATGTGGTCAAACTTTTCGAGAAGATATCAGGCCTAAAAATTAATTATGCCAAGTCTTCAATCACTGGCATCAATTAATGAAGACTTGATCACTTCCATTGCTGCAAATTGGGGCTGTAATGTTGTCTCTTGGCCTTGCTCCTACTTGGGTATGCCGTTGGGGGTCTCTGCAAACAAAGCTGGCTTTTGGGACCTTATTGAAGAGAAAATTTAGAATAGGCTGGACAGATGGAGGAACCATAATTTTTCTAGAGGCGGGAGAGTAACTTTGATCAACTCTGTCCTTTCTAGTTTGCCTATCTATTTTTTGTCGGTCTTTAAATGCCCAGTTATGATCTTGAAAAGGTGGGAAAGGAGTATTAGAGACTTTTTGTGGAAAGGTAGCAGGGATTCTAATAGTTGTAACCTTGTCAACTGGCATACAACTGCTCTCCCCATTGGTAAAAGGAGGTCTCTGTGTAGGGAATCTCAAAGTAAAAAATGTGGCTCTTCTTGCAAAATGGGGATGGAGGTATATGAACGAAAAGAATGCCCTTTGGAGAGGTATTGTGGCTAGCAAGTATGGTACCTTAAATGAGTGGATTCCTTTCCCAAAAGTAAGTAACAGGTTTAAGGGCCATCGGGTGCAAATTTCCAAATGCTGTCTTAAGGCTGAAAAATGGTACAAATGTGAAGTTGGTAGAGGTGACAAAGTCCTGTTTGGGAAGATCAGTGGCTCTCAAATAATGCATTGAAAGTGTCTTACCCTAGGCTCTACAGAATGACCAATTCAAAAGGAAGTTTAGTATCCCAATGCTGGTCAGAAGATTATGATTGCTGGGATCTCAAATTCAGAAGGAATTTAACTGATTTGGAGACTCAGGAATGGGCGTCGCTCCTTGGCATGTTAATTCACTTCACTCCTGGAGAGATTGATGACTGCAAAAGATGGACTCTTGAAAAAGATGGTATTTTCACAGTTAAATCAATGGCCCAAGAATTGTGTGGTAGGTGCGAAGATATGGGTAAATCCTTTATAAGAATCATTTGGCACTCCAAAGTTCCAAAGAAGGTGAAAGTCTTTTTGTGGATCTACGTTTTGGGAAAGCTAAATACTCATGTCAGATTACAGAAAAGAGCTCCCAATTGGGCTTTGTCCCCTAGCTGTTGTGTGATGTGCAAAAATGATGGGGAAGATAGCCAACACCTCTTTTTCTCTTGTCCATATGCTGCTGTGTGCTGGAGTAACCTTCTTAAATGTTTTGGTCTTTTGTGGGCGTTTCCTAAGAACGGATCAGATTGTTTGCTTCAACTTATATGTGGAACGTTTTATGGTGGTCAGGCAAAAGTCCTTTGGTATAATGCAGTGGTTGGCCTTCTTTGGAAATTATGGGTAGAGAGGAATAGAAGGATTTTTCAAGGGGAAGAAATGCCCAACGAGGTCTTGTAGGATGGAGTCAAATATCACGCCACTATTTGGTGCTCCCGTTACAAAGAGTTTTGTAATTATGGTTTCTCTCAAATTTACGCCAATTGGGAGTCCTTTTTGTAATCCTTTGGATCTTTGGGGATATCTCATCTCTCTCTTTTGTACTTCCTCTCTTTTTTCAATACATCCTTTGTTTCTTATCAAAAAAAAAAAAAAAAAGAATATACAAACTGTTATTGTTTTGATAATAAATGAAGAAAATTGAAGCTTATAATGATTTGATATACAAAAATTTTAATAAAAAATTTCATTATAAATTCTTTGTCAGTCTAGTTCGAAGTAAATCTTAACTGCATAACATTGGAAAAACTTTATGGTGTTTCAAGGTAATATGAAAGACTGATACCATTACTAAATTACATTAATAAATGTTGGTTTATCTTAAAAATAAAGAGAGAAAAAAGGGGAATCTGGTCCTTGGACGGTGGCTTATTTATCTTCCTTTCTTGAAGTTGTATTTGCCTCTCATAGTTTCTGCTTTATATTTGTTTACTGGGCTTGCCTTTCGCTGTACTGCCCAGAGGCTTCTGTGTGTGTGCGCACTTTGTCCCGTACTTTTGGATTATCTTTGTTGGATGAAAGTTTTGAAGCCATCTGTATTATCTTGTGTAGTTTATTCACAATCATGGTGTAAATATCTGCAAACAAAAGAGTGATTTGGAAGGGATACGTGTGAAGTTTCTAATACTTATCTTGAGCTTGGTTCAAGAACATCAATTAGTTCAGGAGAAAATTGCTGACTACAACTACAAATGTGAAACTAAGGATGATTATACCTTGTCTCAGCCAAGTGACAATTGGAGTTATTCAGAACCAACTACTTGTATCAAAAAATTTGCAAACCTTTTTCCGTCTCTACTGGTATTGGAAATTAATTGCACTTATCTCATAGGTTTTCCTTTTATGGAGCAAAACCTATTATTTGCTTCACGGGTGGTATTTGTTGGCTTGCAGGTAGAGTTGGTTTCTTTTGCTACCGTGTCTTGTTCCATATTCTGGTCCAATGTCAAGTCGGATGAGACAGGACTACCGTGTTCTGTGAAAGGGAAACTTGGAGGCCCCAGTCAACGCCGGTTACCATCCTCTACTGCTACTTTGGTTCTGCTAGCTGTATGATCTTCTATGCTCTTTAGCTCTCTTACGTTGTGGATTTGATCCCTCCATAGATGTGGCTATCAGAATCTAATAAATAAGAATACAGGGTTTAGAGTCGTGGACATGCTGTTACTTGATTGTTGAGTCTATGGCATTTCAATTTATGTAGACTTTTTTTTTATTAAAGCTAAAATATTAATATAGTTCTTTATCTGAGCTTAGACGTTGGTTCTTTTGAGTTATTGTAGAGCATGACAGTCAAGTTCCAGACTCAAAAGAATAGTAAATAAGGTTGACGCCTATAGAAGTTGGTTTGTGGCTATCATTCTCTATTTAATACTCTTAAAATGTCGTACAATATAGTTTGTGCTAATTATTGCTTCATTCCCTACCCTTTTTATTTGGGTGAGTGATAAATGTTTCTAATTTCCTGGTTCCAAATTTTGTTATGACTTTATTGGTGCATATTTTTTCCCTTCTTTTGTATGCGTGGACAGGTAACATCAATGAAGGCTATTGCATCTATCTTGTCATGTTGCAGACAGTTTAGAATCACTGGTTCACAGAATTTTGGAGTTGAATTTTTATTGAAGTTTGTGTGGAAGACTGTTTCATCTCCAGCTTATCACTCAGAGGTACAAAGATAGTTTTATGGTTTTACTTTTTCTGTAAGACATTAGATATTTTCCTATTTATGTTATGAAGAAAAACTTAAAAAAATCTGCTGTTTATATTTTCCTATTTTTCACTTTTTCAAGTTTTCAATGTTGTATCTAGAAAGAAAAACAAAAGAATTCATTTACTCTAATGTAGGGGTTTATGGGTTCCTTATGGAGATTTCTGGGTGTCTCTGATCCCAATTCATTTACTCTAATGTAGAGTGGAGCAGAAATATGTCTTGCAACATATGAAGCGCTAGCCCCTGTTCTCCAAGTGCTTGTGTCCGAGTTTTCTTCTCAAGCTCTAAAGTTCATACGGGACGAGAATACAATCATGCATCTAGGAGTAGAAGGAAGACCACTGTTGGACTCTCTTGTTCTTACTTTTCATCAGCATGTAAATGGTATACTTGATGCGGGAGTTTTGGTTCGAAGTAGAAGGGCAGTTCTACTGAAGTGGAAGGTGGCTTTTTAAGCTTTGCATTTGTCTATTCTTTATATGTTTTCTTATCTTTAGTATTTTTACAAACCTAAAAATATTAGTAATAGCATTTGAAAATTTTCTATTTTAGTAATGAAGTGGTTTAATATGACACTATACCATAATAGTAGCTTCAATTTGACTTTATTTTAATAACTTGTTAATTTTATAACTTTCAGCTTATGTCAAATAGCAAACTGCTTCTATCCTTACATATTTTGTTTCATCCTTCATGAATTTTGCACTCAGTTTTTTGGCCACATAAATTTGCACGTTATTTCTAGATTTGTTCTTCTCAACTGCATATGTTCATGTGATGATGGTTGTATCATCATCGGACTTAAAGTGATCTGCATGTCTTATGCAGTGGCTTTGCCTAGAATCTCTTTTATCAATTCCCTATCGTGCTCTTCAAAATGGACTCAATTTAGTGGATAACAACTCTTTTTTATCAGAGGCAACTCTTGTACGGATATTTAGTGATCTTGTTGAAAGGTAAGATCTGATATTAACTTCCTGTTAGTTATGGTCTTGAGGCATGTATTTGTTAGATAATCATTTTGGGTGCTCCTGTTGTTTTTATACTTTTGACGGGAAAAGGACTTATAATGATCAAAATTCGTTAGTGCTCTGTTTCTTATCGGAAAAAAAAGATGAAGTTCGTCAGTTACCAAATGTTGCAAGCATCCGAAGTTATTTATTCCCAGTCATTAGTGAATTATAAATGACGAGACTGACAGCTGTGGAAATTATATTGTTGCTTTTTGTCAAGTTCTGGTTTCTAAATTAGTTGCTGTAACTGCTACGACTTGCTAGAGAAATTTACTTTTCTAGCCAACACTAACCTGGATGGCTTTTCTATTGAATTCTTCAAAAAGTATTGGAACACTCTTAAATCCAATATTTTAGGGGTGTTTTATGAATATTTTAAGGATGAGATTATCAGTGCTAGTCTCAATGAGACTTATATTTGTCTCATTTCAAAAATATTAGAGGCAAAATTAGTTAATAATTACTGACCAATGAATCTCACTTCTTAGTTTCTGCAAAATCATAGCTAAAGTTCTTTTAGAAAGACTCAAAATGGTCTTCCCCTTTACTATTTCAGAATTTCAACCAGCTTTTGTCTATAACTCGAAGATTCAGATTCTTGATACTTGCCTTACTGCAAATGAGCTTATTGACGAGTGGAGGAGAAAGAAAAAGAAAGATGTTGTGATTAAGCTTGCTCTTTGAAAAATCTTTTGACAAGGTTGATGGGAAATTTCTAGATGCCATTTTACTTGTTAAGGGCTTTGGTCCTTGATAGAGACTTTGGATTTGGGGTTGCATTTCTAAGTGCAAAATTTTCAATTATCTTTAATGGCAAAATCTAGGGGCAATGCAAGCGTCTAGAGGTATTAATCAAGGGAATTCTCCCTCCCCATTTCTTTTTATTATCATTGTTGATTTCTTAAGTAGACTCCTCTCTAGAGGGTTAAGTCTCGGCTTGGTGGAAGGTTTCCATGTGGGGAATAACTCTTTGTCTATCCATTAACCAATTTCGATTTTGTGGATGACACCATCCCCTTTTCTTCTTTGGAAAAGAAGCCTCGACATTATCTTCAATATCCTCAAAATCTTTTGAGGAGGCTTTTCGGTTTAATATTGGTTGTAGCAAATCGGAGATTGTAGGCATTAGTTGTAAGAGCTCTCTCATTGATCAATTTGTTCTTGGCTTTGGGAGCAAGGTCGGAAATTGGCCCCTCCCTCACTGACCTCACTGATATAGTGTGATTAAGAGTAATTTTGATTTGAACTTGTTAATTGTTAGGTTTCAGTTGTTAATTGGTTAGTAGTTGGGTTAGGTAGTTCGTTACTGTTTGGTTCACTTAGTTAGTAAGTTGACTTCCTCTATAAACAGAGGGTTACTCAACCGGTAAGATGCCGAAATTTGGAAGTTCGATTCTACTGGCTTATTTTTTGTCCAATCCTTGTGTTTTAATTATTCCTCTTCTTCTCCTACTGATTTGTTTCAAGCAATATGGAAGTCCAAAAGTCCTCAGGAAATTAGCATTCTTATTTGGATTTTGTTCTCCAGAGGGCTGAATACCTCAGATTTGCTGCAGAGGAGACTTCCTAATTTTGTGTTGCTTTCTTCGCTTTGTGTTCTTTGTTGGCAGGATATTGAAGACCAAAATCACCTATTTTTTCTTTGCTCGTATGCAAATATTGCTGGAATTTCTTATTCAGCCTATTCTCTATGACTCTATTGAGTGGGTATTTGGGCCGACTCTTTAGCCAATTTAATTTAGATTTTAATTGGTCCGTCCTTGTCTTCAACAGCCCAAATTCTTTCTTTGGGTCAATGCTGTCAAAGCAATTATTTTTGAAATTTAGTTGGAGAGAAATCAGAGGATCTTCAAAGACAATCTTTTAACTTGGAGGGAGTGGTTTGACTTAGCCAAGATCAAGTCTTCCCATTGGTGTTCTTTATCATCATTGTTTGATCATTATTCTTTTAACCATTTGTTTTAATTGGGATGCTTTAATATCCCCATGTTAGCTTCTGTTATTTACTTTTTGCTTCTTTTCTTTCCTTTTTGGGAGATTGTATCTTCGAACATTTTCTGTTTCTTTTCATCAATTCAATGAAAAGTTCGTTTCTTTTTCAAAAAAATAATAAAGACTCTTGTTGAGCATTATTTGGAGGTTACTCTTTTAGATACTAGGTTGCGTCACTCACATAGGCCACTCTCTCTAACATCCCTATTTATTGTCTCTTTTTTACTACAAGGTTTAAAATTCCATTAAAAACATTTGCATGGACTTTCTTTTGAAAGGTGCTAGAAAAAGAGGCAATCATCGGGTAAATTGGACTAAGGTCCAATTCCCCTTTTGTTCGGGTGGGATTCAAATTAGCAATATAAAATTAAGGAATTGCTTTCTTCTTGCATAATGTATTTGGAGATTTCACTTTCATAATGGGAAACTTTGGAGAAAAGTCATAATGGCAAAATATGGCATCTCCAATGATGATTGTTGGTCTAGTAATCCTCTCTTCATACTTCTCAAGGTCCCTGGCATTCCATCTTTAAACTTAAAGACTTAATGTCAACGTGCACTGTTGGTAGTGGTTTGGATAATAGCATTTGGAAAGACTTTTAGATTGGTTGTGGCTTTTTTATAAGGTTTCCTGGACTTTTCTCTTTCTTGAACTGGAAGGAGTACTCCATTAAAGAAGCTTGGAAACAAGAATGGGCATTTCATGCCAATTTACTATCTAATTTCTCTCTCATCAATGAGTTTGACAAATGCATTTGGATGGCTGATTCTTCTGGGATTTTTACCGGTAAATCCTTATTCAAGGACCTTGCAAACAAAAGGAACTCTTGTGGATTATCACCTCTGCAATAATATTTTGTAATCTAGATATCCTTAAAAAGTGAATTTCTTTTTGTTGGAGTCTGCTTATGAAGGCATTAACACCGAAAGGATTCAGGAAAGAATTTGGAAAAGGTTTCCTTTGTGCTTCTCTCCCATGGGTGGTGTTTCATTTGTGGAAGTGATGCCGAAACTCAAAGGCACTTCTTAAACTGTCTCTTTGCAAATAAATTCTGGAACATTGTGCTTCTCAATTTTAAATGGTATATGGCTCTTCCCCTTGGTTTGAATGGCATTCTCTTTCTAGTCCTTTGGAGTCATGCTCTTGAAAGGGAGAAAAAGAGACTTTGGTTGCAACTTGTTAGAGCCTTCTTTTGGAACTTATAGCTGGAAAGAAACATTAGTCTTTAACAACTTAGAAAGCTCTTTGTTGTGTGTCTTATATAAATGTGCCTTAGAGAGCTTGGATCATGCGCTTTGGAGATGCAATTTTTCTTTCTTGATTTTGGATGTGTTTATGGAGTCTTTTGGTATTTCCCAGGCCTGTAATAACTCCTGTTGCTTGATGGCTGAGGAGGTTCGATTCTATCCTTTGTTTAGAGGCAAAGGAGAAATGTTGAGGAAGATTGCCTTTCTAGCCATTTCGTACAATATCTGGCTTGAGAGAAACAAAAGGGGGAGGTTTAGTCTACAGCTAGGTTCAATACTCCCCACTAGGTGTCGGTGACGAAAGAGTTTTGTATTCTTCATTCATCTCTTATTCTTAACTTTTTTTTCTTCTTTTGAAACAAAACTCTTATTCTTAACTGTTGAAACTCTTTCTTTTAGTTGTTTGGCCAATTGTTTTGTGTTGGCTCTATATGGTTTGGGCTGGTATTTTGTATTTCCCGTTGTATATCCTTTCATTTTCCCTATGAAAGTATGGTTTTTCATCCAAAATGATAAGAAAAAAACATAGAAAGCTTTTTTGATCATTTTAGGGATAATCTCATTTTTGTGGTGTTTGGTGTAAAGCTATATGACCCTTTTTGTAACTACAATATGAGTTCTCTTTGTGCCACTTGGAAAAGTATTTTGTAATCTTCTCAGCTTAGCTTTCCTCCAACTTTTGTAAGTTCATTTATTCAATGAAATTAGTTTGTTATAAAAACCTTCTCCATTCAAATTGATTCTTGCTAAATTTGCAGCCTCGAGAATGCTGGAGAATGCTCTGTTTTACCCATGCTGAGATTGGTTAGATTGAATTTGTGGCTATTTTGCAAGGGAAAGTCTGGTCTGCTTGTTACATCGTGTAATGGCGTGAATGCAGAG
mRNA sequence
ATGCTCCATGCGATGCTGAGATGCTCCACATATCTAACGGGAGATGGATTTGTCTTCAGAGGTGCTCGCCTGCGCCTCGCTGTTCGTCGAACTACGAGCCCTTCTTTTTGTGCTGATCCTGATGCTTTGAAGTCATTTATATGGAAATGTTTCGTTCCTTTGATAAATAGGGCGATTGCATTTAATCGGGAAATGCTTAACCAGGTCGCTGAATCATTCATTGATGTCGTCATTGAGATGAACTCGTGGCCAATTGTTGAAGCAACTCTAATTCCATTCTGTATAAGTTCAGCTCTTTATTCCACTAGCGTGCTGCAAAATGAAGAGTTTGACACCTTTGAGGGCGACAGATGTTCTGTCATTTTGGGCTCAAATGGCCCAATAAATGAACCTAGAATGGATAAACAGATGATGAAAGCATACGGGTTCCTTCCACTACCATTAGCATGCCATATTTTGGCTATAATGTTAGATGCTGTCCTTTGTAATAGACATGCACCACAAATATCAGATGCAGTGGTGGCAAATGGATGTCAAAAAGCTGAAGAATTTACTGTTAAACTGATTTGGGATATTTGCAATTTATCTGAACAAATGCTTTTACAAAGCTCAGATCATCGATCTTGTACCATTCGCTATCTTCTCCCATTAATCTTTGAAGTGCTTCTTTCTCACCACTCTCTAGAGATCTCCATTCAAGGGCATGTGCATAATCTCTCCAGGAATCGTTTTCTCATGAAAATATGGAAATGTTGCAAAAAACTATTTTCATTTGGAACTTTGGAGAGAAGAGATGCCTATAAGATTTTGTCTCTTTATTTATGTTTTTTCCCTCACAATGAAGAGCTTGGAGGTGCTGGAATCTGTGACGACGCAGAAGAATTTGACATAAAGGCTGATAAAGATTTTTGGGATGCAATTAAAAGAGGCTTGGTGGATAAGGAGGGCTTGGTGAGGAAGCAGTCACTACATATATTGAAGAAAGCACTATATATAAATGGAAGAGGCAATACATCTAGGGTTCCAAAGACAATTTCAAGTGGGAAAGATAATAATGCTCGAGGTATTACAAAAAGGGAAAGATGGGCCCACAAGGAAGCAAAATCACTTGGTGTAGGGCAAATTTGCAGCGAAAGTGAAATTGTTATAAATAGCCAGCAGCAACAGTGGGAAGCATTCATCCTTCTATATGAAATGCTTGAAGAATATGGTTCACACTTGGTCGAAGCTGCTTGGAATCACCAGATATTCTTGTTACTACGAGATCCGACCTCAATTAATTTTGACAGCGTCACTGGTGGCATTCATCAAAACCAAATTGATATGTCTAGCGAAATCTTTAGTTGGTTATCAATCTTGTGGGTTCGGGGCTTCCACCATGATAATCCTTTAGTTAGATGCTTGATCATGCAGTCCTTTTTGGCCATTGACTGGAGGAATTATGTACCCTGTTTAAAGTCATTGCCAGAAACTTTGATCATTGGACCATTCATTGAAGCACTAAACGATCCTGTGCAGCACAAAGATTTCGGTGTAAAAGGAGTTTACTCATCTAAGACAATTGAAGGTGCAGCCCATTTTATACGCCAATATGCAAATTGTCTTGATGCAAGGACAAGTGCTGTGTTTTTGCAGCAGCTCACATCTTTGGCTAAAAAGAAATCATTTGGTCGAGTTGGGTTGATCAGCCTATCTGAATGCATTGCTTCAGCTGCTTCAGTTGGTGGATATGATAACGACAGTGAAGGAGAGTGCTTTGAAGGTTCTTCACTGTCAGCCCAAGGGGATTTGATAACTTATTCTCTGGGATGTAAATTGGAATTGCTGGATGATCTAAGATTTGTGGTTGAGAGTAGCAAACAACACTTCAATCCTAGTTATCGCCTTCAAGTTTGTGCAAAAGCTCTGGATGCTGCTGCTTCAGTCTTGTGTACATCTGACTTGGCTCTTGAGGTTCTTTTGCATTTTATTTCATCTCTACCACGAGAGGCTACTGACTGTGGAGGTTGCTTAAGAGGGAAAATGCAAAATTGGCTCTCAGGCTGTGGTAAGAAAAGCTGCAGTGGCAGTTGCTGCAGTACTGAGACGAAGTTTATGAAGAGTCTCATTGAGTTCCCTAAAAGATTTATAAGTCATAATCATTCATCTGATGCTTCTGTTACGTATGATGACGAAGAATTGGAAGCATGGGAATTTGAGGCAAAACGTTGGGCAAGAGTGGTTTTTCTTGCAGTCAAGGAGGAACATCATTTAAGACCTATACTGACGTTTATTCACAATCATGGTGTAAATATCTGCAAACAAAAGAGTGATTTGGAAGGGATACGTGTGAAGTTTCTAATACTTATCTTGAGCTTGGTTCAAGAACATCAATTAGTTCAGGAGAAAATTGCTGACTACAACTACAAATGTGAAACTAAGGATGATTATACCTTGTCTCAGCCAAGTGACAATTGGAGTTATTCAGAACCAACTACTTGTATCAAAAAATTTGCAAACCTTTTTCCGTCTCTACTGGTAGAGTTGGTTTCTTTTGCTACCGTGTCTTGTTCCATATTCTGGTCCAATGTCAAGTCGGATGAGACAGGACTACCGTGTTCTGTGAAAGGGAAACTTGGAGGCCCCAGTCAACGCCGGTTACCATCCTCTACTGCTACTTTGGTTCTGCTAGCTGTAACATCAATGAAGGCTATTGCATCTATCTTGTCATGTTGCAGACAGTTTAGAATCACTGGTTCACAGAATTTTGGAGTTGAATTTTTATTGAAGTTTGTGTGGAAGACTGTTTCATCTCCAGCTTATCACTCAGAGAGTGGAGCAGAAATATGTCTTGCAACATATGAAGCGCTAGCCCCTGTTCTCCAAGTGCTTGTGTCCGAGTTTTCTTCTCAAGCTCTAAAGTTCATACGGGACGAGAATACAATCATGCATCTAGGAGTAGAAGGAAGACCACTGTTGGACTCTCTTGTTCTTACTTTTCATCAGCATGTAAATGGTATACTTGATGCGGGAGTTTTGGTTCGAAGTAGAAGGGCAGTTCTACTGAAGTGGAAGTGGCTTTGCCTAGAATCTCTTTTATCAATTCCCTATCGTGCTCTTCAAAATGGACTCAATTTAGTGGATAACAACTCTTTTTTATCAGAGGCAACTCTTGTACGGATATTTAGTGATCTTGTTGAAAGCCTCGAGAATGCTGGAGAATGCTCTGTTTTACCCATGCTGAGATTGGTTAGATTGAATTTGTGGCTATTTTGCAAGGGAAAGTCTGGTCTGCTTGTTACATCGTGTAATGGCGTGAATGCAGAG
Coding sequence (CDS)
ATGCTCCATGCGATGCTGAGATGCTCCACATATCTAACGGGAGATGGATTTGTCTTCAGAGGTGCTCGCCTGCGCCTCGCTGTTCGTCGAACTACGAGCCCTTCTTTTTGTGCTGATCCTGATGCTTTGAAGTCATTTATATGGAAATGTTTCGTTCCTTTGATAAATAGGGCGATTGCATTTAATCGGGAAATGCTTAACCAGGTCGCTGAATCATTCATTGATGTCGTCATTGAGATGAACTCGTGGCCAATTGTTGAAGCAACTCTAATTCCATTCTGTATAAGTTCAGCTCTTTATTCCACTAGCGTGCTGCAAAATGAAGAGTTTGACACCTTTGAGGGCGACAGATGTTCTGTCATTTTGGGCTCAAATGGCCCAATAAATGAACCTAGAATGGATAAACAGATGATGAAAGCATACGGGTTCCTTCCACTACCATTAGCATGCCATATTTTGGCTATAATGTTAGATGCTGTCCTTTGTAATAGACATGCACCACAAATATCAGATGCAGTGGTGGCAAATGGATGTCAAAAAGCTGAAGAATTTACTGTTAAACTGATTTGGGATATTTGCAATTTATCTGAACAAATGCTTTTACAAAGCTCAGATCATCGATCTTGTACCATTCGCTATCTTCTCCCATTAATCTTTGAAGTGCTTCTTTCTCACCACTCTCTAGAGATCTCCATTCAAGGGCATGTGCATAATCTCTCCAGGAATCGTTTTCTCATGAAAATATGGAAATGTTGCAAAAAACTATTTTCATTTGGAACTTTGGAGAGAAGAGATGCCTATAAGATTTTGTCTCTTTATTTATGTTTTTTCCCTCACAATGAAGAGCTTGGAGGTGCTGGAATCTGTGACGACGCAGAAGAATTTGACATAAAGGCTGATAAAGATTTTTGGGATGCAATTAAAAGAGGCTTGGTGGATAAGGAGGGCTTGGTGAGGAAGCAGTCACTACATATATTGAAGAAAGCACTATATATAAATGGAAGAGGCAATACATCTAGGGTTCCAAAGACAATTTCAAGTGGGAAAGATAATAATGCTCGAGGTATTACAAAAAGGGAAAGATGGGCCCACAAGGAAGCAAAATCACTTGGTGTAGGGCAAATTTGCAGCGAAAGTGAAATTGTTATAAATAGCCAGCAGCAACAGTGGGAAGCATTCATCCTTCTATATGAAATGCTTGAAGAATATGGTTCACACTTGGTCGAAGCTGCTTGGAATCACCAGATATTCTTGTTACTACGAGATCCGACCTCAATTAATTTTGACAGCGTCACTGGTGGCATTCATCAAAACCAAATTGATATGTCTAGCGAAATCTTTAGTTGGTTATCAATCTTGTGGGTTCGGGGCTTCCACCATGATAATCCTTTAGTTAGATGCTTGATCATGCAGTCCTTTTTGGCCATTGACTGGAGGAATTATGTACCCTGTTTAAAGTCATTGCCAGAAACTTTGATCATTGGACCATTCATTGAAGCACTAAACGATCCTGTGCAGCACAAAGATTTCGGTGTAAAAGGAGTTTACTCATCTAAGACAATTGAAGGTGCAGCCCATTTTATACGCCAATATGCAAATTGTCTTGATGCAAGGACAAGTGCTGTGTTTTTGCAGCAGCTCACATCTTTGGCTAAAAAGAAATCATTTGGTCGAGTTGGGTTGATCAGCCTATCTGAATGCATTGCTTCAGCTGCTTCAGTTGGTGGATATGATAACGACAGTGAAGGAGAGTGCTTTGAAGGTTCTTCACTGTCAGCCCAAGGGGATTTGATAACTTATTCTCTGGGATGTAAATTGGAATTGCTGGATGATCTAAGATTTGTGGTTGAGAGTAGCAAACAACACTTCAATCCTAGTTATCGCCTTCAAGTTTGTGCAAAAGCTCTGGATGCTGCTGCTTCAGTCTTGTGTACATCTGACTTGGCTCTTGAGGTTCTTTTGCATTTTATTTCATCTCTACCACGAGAGGCTACTGACTGTGGAGGTTGCTTAAGAGGGAAAATGCAAAATTGGCTCTCAGGCTGTGGTAAGAAAAGCTGCAGTGGCAGTTGCTGCAGTACTGAGACGAAGTTTATGAAGAGTCTCATTGAGTTCCCTAAAAGATTTATAAGTCATAATCATTCATCTGATGCTTCTGTTACGTATGATGACGAAGAATTGGAAGCATGGGAATTTGAGGCAAAACGTTGGGCAAGAGTGGTTTTTCTTGCAGTCAAGGAGGAACATCATTTAAGACCTATACTGACGTTTATTCACAATCATGGTGTAAATATCTGCAAACAAAAGAGTGATTTGGAAGGGATACGTGTGAAGTTTCTAATACTTATCTTGAGCTTGGTTCAAGAACATCAATTAGTTCAGGAGAAAATTGCTGACTACAACTACAAATGTGAAACTAAGGATGATTATACCTTGTCTCAGCCAAGTGACAATTGGAGTTATTCAGAACCAACTACTTGTATCAAAAAATTTGCAAACCTTTTTCCGTCTCTACTGGTAGAGTTGGTTTCTTTTGCTACCGTGTCTTGTTCCATATTCTGGTCCAATGTCAAGTCGGATGAGACAGGACTACCGTGTTCTGTGAAAGGGAAACTTGGAGGCCCCAGTCAACGCCGGTTACCATCCTCTACTGCTACTTTGGTTCTGCTAGCTGTAACATCAATGAAGGCTATTGCATCTATCTTGTCATGTTGCAGACAGTTTAGAATCACTGGTTCACAGAATTTTGGAGTTGAATTTTTATTGAAGTTTGTGTGGAAGACTGTTTCATCTCCAGCTTATCACTCAGAGAGTGGAGCAGAAATATGTCTTGCAACATATGAAGCGCTAGCCCCTGTTCTCCAAGTGCTTGTGTCCGAGTTTTCTTCTCAAGCTCTAAAGTTCATACGGGACGAGAATACAATCATGCATCTAGGAGTAGAAGGAAGACCACTGTTGGACTCTCTTGTTCTTACTTTTCATCAGCATGTAAATGGTATACTTGATGCGGGAGTTTTGGTTCGAAGTAGAAGGGCAGTTCTACTGAAGTGGAAGTGGCTTTGCCTAGAATCTCTTTTATCAATTCCCTATCGTGCTCTTCAAAATGGACTCAATTTAGTGGATAACAACTCTTTTTTATCAGAGGCAACTCTTGTACGGATATTTAGTGATCTTGTTGAAAGCCTCGAGAATGCTGGAGAATGCTCTGTTTTACCCATGCTGAGATTGGTTAGATTGAATTTGTGGCTATTTTGCAAGGGAAAGTCTGGTCTGCTTGTTACATCGTGTAATGGCGTGAATGCAGAG
Protein sequence
MLHAMLRCSTYLTGDGFVFRGARLRLAVRRTTSPSFCADPDALKSFIWKCFVPLINRAIAFNREMLNQVAESFIDVVIEMNSWPIVEATLIPFCISSALYSTSVLQNEEFDTFEGDRCSVILGSNGPINEPRMDKQMMKAYGFLPLPLACHILAIMLDAVLCNRHAPQISDAVVANGCQKAEEFTVKLIWDICNLSEQMLLQSSDHRSCTIRYLLPLIFEVLLSHHSLEISIQGHVHNLSRNRFLMKIWKCCKKLFSFGTLERRDAYKILSLYLCFFPHNEELGGAGICDDAEEFDIKADKDFWDAIKRGLVDKEGLVRKQSLHILKKALYINGRGNTSRVPKTISSGKDNNARGITKRERWAHKEAKSLGVGQICSESEIVINSQQQQWEAFILLYEMLEEYGSHLVEAAWNHQIFLLLRDPTSINFDSVTGGIHQNQIDMSSEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNYVPCLKSLPETLIIGPFIEALNDPVQHKDFGVKGVYSSKTIEGAAHFIRQYANCLDARTSAVFLQQLTSLAKKKSFGRVGLISLSECIASAASVGGYDNDSEGECFEGSSLSAQGDLITYSLGCKLELLDDLRFVVESSKQHFNPSYRLQVCAKALDAAASVLCTSDLALEVLLHFISSLPREATDCGGCLRGKMQNWLSGCGKKSCSGSCCSTETKFMKSLIEFPKRFISHNHSSDASVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLRPILTFIHNHGVNICKQKSDLEGIRVKFLILILSLVQEHQLVQEKIADYNYKCETKDDYTLSQPSDNWSYSEPTTCIKKFANLFPSLLVELVSFATVSCSIFWSNVKSDETGLPCSVKGKLGGPSQRRLPSSTATLVLLAVTSMKAIASILSCCRQFRITGSQNFGVEFLLKFVWKTVSSPAYHSESGAEICLATYEALAPVLQVLVSEFSSQALKFIRDENTIMHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRSRRAVLLKWKWLCLESLLSIPYRALQNGLNLVDNNSFLSEATLVRIFSDLVESLENAGECSVLPMLRLVRLNLWLFCKGKSGLLVTSCNGVNAE
Homology
BLAST of Sgr020180 vs. NCBI nr
Match:
XP_022155532.1 (uncharacterized protein LOC111022655 isoform X1 [Momordica charantia])
HSP 1 Score: 1831.6 bits (4743), Expect = 0.0e+00
Identity = 921/1065 (86.48%), Postives = 972/1065 (91.27%), Query Frame = 0
Query: 38 ADPDALKSFIWKCFVPLINRAIAFNREMLNQVAESFIDVVIEMNSWPIVEATLIPFCISS 97
ADPDALK FIWK FVPLIN+A AFNREMLNQV+ESFIDVVIE NSWPIVE TL+P CISS
Sbjct: 92 ADPDALKLFIWKSFVPLINKAAAFNREMLNQVSESFIDVVIETNSWPIVEETLVPLCISS 151
Query: 98 ALYSTSVLQNEEFDTFEGDRCSVILGSNGPINEPRMDKQMMKAYGFLPLPLACHILAIML 157
ALYST++LQNE+ TFEGDRCSVILGSNG ++EP+MDKQM+K YGFLPLPLACHILAIML
Sbjct: 152 ALYSTTMLQNEQLGTFEGDRCSVILGSNGSVHEPKMDKQMIKGYGFLPLPLACHILAIML 211
Query: 158 DAVLCNRHAPQISDAVVANGCQKAEEFTVKLIWDICNLSEQMLLQSSDHRSCTIRYLLPL 217
DAVLCNR APQ ++ VV+NGCQKAEEFTVKLI DICNLS+QMLLQSSDHRSC IRYLLP+
Sbjct: 212 DAVLCNRQAPQTTEVVVSNGCQKAEEFTVKLIRDICNLSDQMLLQSSDHRSCAIRYLLPV 271
Query: 218 IFEVLLSHHSLEISIQGHVHNLSRNRFLMKIWKCCKKLFSFGTLERRDAYKILSLYLCFF 277
IFE LLS H+LEISIQG+ +LSRNRFLMKIW CCKKLFSFGTLERRDAY ILSLYL FF
Sbjct: 272 IFEALLSQHNLEISIQGYACSLSRNRFLMKIWNCCKKLFSFGTLERRDAYTILSLYLSFF 331
Query: 278 PHNEELGGAGICDDAEEFDIKADKDFWDAIKRGLVDKEGLVRKQSLHILKKALYINGRGN 337
PHNEEL GAG+CDDAEEFDIKADKDFW IKRGLVDKEGLVRKQS+HILKKAL INGRGN
Sbjct: 332 PHNEELEGAGMCDDAEEFDIKADKDFWVEIKRGLVDKEGLVRKQSVHILKKALSINGRGN 391
Query: 338 TSRVPKTISSGKDNNARGITKRERWAHKEAKSLGVGQICSESEIVINSQQQQWEAFILLY 397
TS VP TISSGKDNNARGITKRERWA+KEAKSLGV Q CS+ EIV NS QQ+WEAFILLY
Sbjct: 392 TSSVPNTISSGKDNNARGITKRERWANKEAKSLGVVQTCSQHEIVTNSLQQKWEAFILLY 451
Query: 398 EMLEEYGSHLVEAAWNHQIFLLLRDPTSINFDSVTGGIHQNQIDMSSEIFSWLSILWVRG 457
EMLEEYGSHLVEAAWNHQI LLLRDPTSI FDS TGG +QNQI+MS EIFSWLSILWVRG
Sbjct: 452 EMLEEYGSHLVEAAWNHQISLLLRDPTSIKFDSFTGGFYQNQIEMSGEIFSWLSILWVRG 511
Query: 458 FHHDNPLVRCLIMQSFLAIDWRNYVPCLKSLPETLIIGPFIEALNDPVQHKDFGVKGVYS 517
FHHDNPLVRCLIMQSFL IDWRNYV CL SLP+T IIGPFIEALNDPVQHKDFGVKGVYS
Sbjct: 512 FHHDNPLVRCLIMQSFLGIDWRNYVSCLMSLPQTFIIGPFIEALNDPVQHKDFGVKGVYS 571
Query: 518 SKTIEGAAHFIRQYANCLDARTSAVFLQQLTSLAKKKSFGRVGLISLSECIASAASVGGY 577
SKTIEGAAHFIRQYANCLDART+ VFLQQLTSL KKKSFGRVGLISLSECIASAAS+ G+
Sbjct: 572 SKTIEGAAHFIRQYANCLDARTTVVFLQQLTSLTKKKSFGRVGLISLSECIASAASIVGF 631
Query: 578 DNDSEGECFEGSSLSAQGDLITYSLGCKLELLDDLRFVVESSKQHFNPSYRLQVCAKALD 637
+ND EGECF+ QG+LITYSLG K+ELLDDLRFVV+SSKQHFNPSYR QVCAKAL+
Sbjct: 632 ENDCEGECFD-----PQGNLITYSLGYKMELLDDLRFVVQSSKQHFNPSYRFQVCAKALE 691
Query: 638 AAASVLCTSDLALEVLLHFISSLPREATDCGGCLRGKMQNWLSGCGKKSCSGSCCSTETK 697
AAASVLCTSDL LEVLL FIS+LPREATD GGCLRGKMQ+WL GCGKK CSGSCCSTETK
Sbjct: 692 AAASVLCTSDLDLEVLLLFISALPREATDYGGCLRGKMQSWLLGCGKKCCSGSCCSTETK 751
Query: 698 FMKSLIEFPKRFISHNHSSDASVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLRPILTF 757
FMKSLIEFPKRFI HNHSS+ SVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLR ILTF
Sbjct: 752 FMKSLIEFPKRFICHNHSSNVSVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLRSILTF 811
Query: 758 IHNHGVNICKQKSDLEGIRVKFLILILSLVQEHQLVQEKIADYNYKCETKDDYTLSQPSD 817
IHNHGVNI KQKSDLEGIRVKFLILILSLVQE QLVQEK D+NYKCETKD+YTL QPSD
Sbjct: 812 IHNHGVNIGKQKSDLEGIRVKFLILILSLVQELQLVQEKATDHNYKCETKDEYTLCQPSD 871
Query: 818 NWSYSEPTTCIKKFANLFPSLLVELVSFATVSCSIFWSNVKSDETGLPCSVKGKLGGPSQ 877
+ +Y+ PTT IKKF NLF SLL ELVSFAT SCSIFWSNVKSDETGLP SVKGKLGGPSQ
Sbjct: 872 DLNYAVPTTFIKKFVNLFSSLLEELVSFATASCSIFWSNVKSDETGLPSSVKGKLGGPSQ 931
Query: 878 RRLPSSTATLVLLAVTSMKAIASILSCCRQFRITGSQNFGVEFLLKFVWKTVSSPAYHSE 937
RRLPS TATLVLLAVTSMKAIA +LSCCRQF+I GSQNFGVEFLLKF+ KTVSSPAY SE
Sbjct: 932 RRLPSYTATLVLLAVTSMKAIAFVLSCCRQFKIIGSQNFGVEFLLKFLLKTVSSPAYRSE 991
Query: 938 SGAEICLATYEALAPVLQVLVSEFSSQALKFIRDENTIMHLGVEGRPLLDSLVLTFHQHV 997
SG EI LATYEALA VLQVLVSEFSSQ L+FI DENTIMHLGVEGR LDSLVLTFHQHV
Sbjct: 992 SGVEIRLATYEALASVLQVLVSEFSSQVLQFIWDENTIMHLGVEGRQPLDSLVLTFHQHV 1051
Query: 998 NGILDAGVLVRSRRAVLLKWKWLCLESLLSIPYRALQNGLNLVDNNSFLSEATLVRIFSD 1057
NGILDAGVLVRSRRAVLLKWKWLCLESLLSIP R Q+GL LVDNNSFLSEATLV+IF+D
Sbjct: 1052 NGILDAGVLVRSRRAVLLKWKWLCLESLLSIPQRTFQDGLCLVDNNSFLSEATLVQIFND 1111
Query: 1058 LVESLENAGECSVLPMLRLVRLNLWLFCKGKSGLLVTSCNGVNAE 1103
LVESLENAGECSVLPMLRLVRL LWLFCKGKSGLLVT CNGVN+E
Sbjct: 1112 LVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTFCNGVNSE 1151
BLAST of Sgr020180 vs. NCBI nr
Match:
XP_022155533.1 (uncharacterized protein LOC111022655 isoform X2 [Momordica charantia])
HSP 1 Score: 1831.6 bits (4743), Expect = 0.0e+00
Identity = 921/1065 (86.48%), Postives = 972/1065 (91.27%), Query Frame = 0
Query: 38 ADPDALKSFIWKCFVPLINRAIAFNREMLNQVAESFIDVVIEMNSWPIVEATLIPFCISS 97
ADPDALK FIWK FVPLIN+A AFNREMLNQV+ESFIDVVIE NSWPIVE TL+P CISS
Sbjct: 92 ADPDALKLFIWKSFVPLINKAAAFNREMLNQVSESFIDVVIETNSWPIVEETLVPLCISS 151
Query: 98 ALYSTSVLQNEEFDTFEGDRCSVILGSNGPINEPRMDKQMMKAYGFLPLPLACHILAIML 157
ALYST++LQNE+ TFEGDRCSVILGSNG ++EP+MDKQM+K YGFLPLPLACHILAIML
Sbjct: 152 ALYSTTMLQNEQLGTFEGDRCSVILGSNGSVHEPKMDKQMIKGYGFLPLPLACHILAIML 211
Query: 158 DAVLCNRHAPQISDAVVANGCQKAEEFTVKLIWDICNLSEQMLLQSSDHRSCTIRYLLPL 217
DAVLCNR APQ ++ VV+NGCQKAEEFTVKLI DICNLS+QMLLQSSDHRSC IRYLLP+
Sbjct: 212 DAVLCNRQAPQTTEVVVSNGCQKAEEFTVKLIRDICNLSDQMLLQSSDHRSCAIRYLLPV 271
Query: 218 IFEVLLSHHSLEISIQGHVHNLSRNRFLMKIWKCCKKLFSFGTLERRDAYKILSLYLCFF 277
IFE LLS H+LEISIQG+ +LSRNRFLMKIW CCKKLFSFGTLERRDAY ILSLYL FF
Sbjct: 272 IFEALLSQHNLEISIQGYACSLSRNRFLMKIWNCCKKLFSFGTLERRDAYTILSLYLSFF 331
Query: 278 PHNEELGGAGICDDAEEFDIKADKDFWDAIKRGLVDKEGLVRKQSLHILKKALYINGRGN 337
PHNEEL GAG+CDDAEEFDIKADKDFW IKRGLVDKEGLVRKQS+HILKKAL INGRGN
Sbjct: 332 PHNEELEGAGMCDDAEEFDIKADKDFWVEIKRGLVDKEGLVRKQSVHILKKALSINGRGN 391
Query: 338 TSRVPKTISSGKDNNARGITKRERWAHKEAKSLGVGQICSESEIVINSQQQQWEAFILLY 397
TS VP TISSGKDNNARGITKRERWA+KEAKSLGV Q CS+ EIV NS QQ+WEAFILLY
Sbjct: 392 TSSVPNTISSGKDNNARGITKRERWANKEAKSLGVVQTCSQHEIVTNSLQQKWEAFILLY 451
Query: 398 EMLEEYGSHLVEAAWNHQIFLLLRDPTSINFDSVTGGIHQNQIDMSSEIFSWLSILWVRG 457
EMLEEYGSHLVEAAWNHQI LLLRDPTSI FDS TGG +QNQI+MS EIFSWLSILWVRG
Sbjct: 452 EMLEEYGSHLVEAAWNHQISLLLRDPTSIKFDSFTGGFYQNQIEMSGEIFSWLSILWVRG 511
Query: 458 FHHDNPLVRCLIMQSFLAIDWRNYVPCLKSLPETLIIGPFIEALNDPVQHKDFGVKGVYS 517
FHHDNPLVRCLIMQSFL IDWRNYV CL SLP+T IIGPFIEALNDPVQHKDFGVKGVYS
Sbjct: 512 FHHDNPLVRCLIMQSFLGIDWRNYVSCLMSLPQTFIIGPFIEALNDPVQHKDFGVKGVYS 571
Query: 518 SKTIEGAAHFIRQYANCLDARTSAVFLQQLTSLAKKKSFGRVGLISLSECIASAASVGGY 577
SKTIEGAAHFIRQYANCLDART+ VFLQQLTSL KKKSFGRVGLISLSECIASAAS+ G+
Sbjct: 572 SKTIEGAAHFIRQYANCLDARTTVVFLQQLTSLTKKKSFGRVGLISLSECIASAASIVGF 631
Query: 578 DNDSEGECFEGSSLSAQGDLITYSLGCKLELLDDLRFVVESSKQHFNPSYRLQVCAKALD 637
+ND EGECF+ QG+LITYSLG K+ELLDDLRFVV+SSKQHFNPSYR QVCAKAL+
Sbjct: 632 ENDCEGECFD-----PQGNLITYSLGYKMELLDDLRFVVQSSKQHFNPSYRFQVCAKALE 691
Query: 638 AAASVLCTSDLALEVLLHFISSLPREATDCGGCLRGKMQNWLSGCGKKSCSGSCCSTETK 697
AAASVLCTSDL LEVLL FIS+LPREATD GGCLRGKMQ+WL GCGKK CSGSCCSTETK
Sbjct: 692 AAASVLCTSDLDLEVLLLFISALPREATDYGGCLRGKMQSWLLGCGKKCCSGSCCSTETK 751
Query: 698 FMKSLIEFPKRFISHNHSSDASVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLRPILTF 757
FMKSLIEFPKRFI HNHSS+ SVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLR ILTF
Sbjct: 752 FMKSLIEFPKRFICHNHSSNVSVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLRSILTF 811
Query: 758 IHNHGVNICKQKSDLEGIRVKFLILILSLVQEHQLVQEKIADYNYKCETKDDYTLSQPSD 817
IHNHGVNI KQKSDLEGIRVKFLILILSLVQE QLVQEK D+NYKCETKD+YTL QPSD
Sbjct: 812 IHNHGVNIGKQKSDLEGIRVKFLILILSLVQELQLVQEKATDHNYKCETKDEYTLCQPSD 871
Query: 818 NWSYSEPTTCIKKFANLFPSLLVELVSFATVSCSIFWSNVKSDETGLPCSVKGKLGGPSQ 877
+ +Y+ PTT IKKF NLF SLL ELVSFAT SCSIFWSNVKSDETGLP SVKGKLGGPSQ
Sbjct: 872 DLNYAVPTTFIKKFVNLFSSLLEELVSFATASCSIFWSNVKSDETGLPSSVKGKLGGPSQ 931
Query: 878 RRLPSSTATLVLLAVTSMKAIASILSCCRQFRITGSQNFGVEFLLKFVWKTVSSPAYHSE 937
RRLPS TATLVLLAVTSMKAIA +LSCCRQF+I GSQNFGVEFLLKF+ KTVSSPAY SE
Sbjct: 932 RRLPSYTATLVLLAVTSMKAIAFVLSCCRQFKIIGSQNFGVEFLLKFLLKTVSSPAYRSE 991
Query: 938 SGAEICLATYEALAPVLQVLVSEFSSQALKFIRDENTIMHLGVEGRPLLDSLVLTFHQHV 997
SG EI LATYEALA VLQVLVSEFSSQ L+FI DENTIMHLGVEGR LDSLVLTFHQHV
Sbjct: 992 SGVEIRLATYEALASVLQVLVSEFSSQVLQFIWDENTIMHLGVEGRQPLDSLVLTFHQHV 1051
Query: 998 NGILDAGVLVRSRRAVLLKWKWLCLESLLSIPYRALQNGLNLVDNNSFLSEATLVRIFSD 1057
NGILDAGVLVRSRRAVLLKWKWLCLESLLSIP R Q+GL LVDNNSFLSEATLV+IF+D
Sbjct: 1052 NGILDAGVLVRSRRAVLLKWKWLCLESLLSIPQRTFQDGLCLVDNNSFLSEATLVQIFND 1111
Query: 1058 LVESLENAGECSVLPMLRLVRLNLWLFCKGKSGLLVTSCNGVNAE 1103
LVESLENAGECSVLPMLRLVRL LWLFCKGKSGLLVT CNGVN+E
Sbjct: 1112 LVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTFCNGVNSE 1151
BLAST of Sgr020180 vs. NCBI nr
Match:
XP_022155536.1 (uncharacterized protein LOC111022655 isoform X5 [Momordica charantia] >XP_022155537.1 uncharacterized protein LOC111022655 isoform X6 [Momordica charantia])
HSP 1 Score: 1831.6 bits (4743), Expect = 0.0e+00
Identity = 921/1065 (86.48%), Postives = 972/1065 (91.27%), Query Frame = 0
Query: 38 ADPDALKSFIWKCFVPLINRAIAFNREMLNQVAESFIDVVIEMNSWPIVEATLIPFCISS 97
ADPDALK FIWK FVPLIN+A AFNREMLNQV+ESFIDVVIE NSWPIVE TL+P CISS
Sbjct: 92 ADPDALKLFIWKSFVPLINKAAAFNREMLNQVSESFIDVVIETNSWPIVEETLVPLCISS 151
Query: 98 ALYSTSVLQNEEFDTFEGDRCSVILGSNGPINEPRMDKQMMKAYGFLPLPLACHILAIML 157
ALYST++LQNE+ TFEGDRCSVILGSNG ++EP+MDKQM+K YGFLPLPLACHILAIML
Sbjct: 152 ALYSTTMLQNEQLGTFEGDRCSVILGSNGSVHEPKMDKQMIKGYGFLPLPLACHILAIML 211
Query: 158 DAVLCNRHAPQISDAVVANGCQKAEEFTVKLIWDICNLSEQMLLQSSDHRSCTIRYLLPL 217
DAVLCNR APQ ++ VV+NGCQKAEEFTVKLI DICNLS+QMLLQSSDHRSC IRYLLP+
Sbjct: 212 DAVLCNRQAPQTTEVVVSNGCQKAEEFTVKLIRDICNLSDQMLLQSSDHRSCAIRYLLPV 271
Query: 218 IFEVLLSHHSLEISIQGHVHNLSRNRFLMKIWKCCKKLFSFGTLERRDAYKILSLYLCFF 277
IFE LLS H+LEISIQG+ +LSRNRFLMKIW CCKKLFSFGTLERRDAY ILSLYL FF
Sbjct: 272 IFEALLSQHNLEISIQGYACSLSRNRFLMKIWNCCKKLFSFGTLERRDAYTILSLYLSFF 331
Query: 278 PHNEELGGAGICDDAEEFDIKADKDFWDAIKRGLVDKEGLVRKQSLHILKKALYINGRGN 337
PHNEEL GAG+CDDAEEFDIKADKDFW IKRGLVDKEGLVRKQS+HILKKAL INGRGN
Sbjct: 332 PHNEELEGAGMCDDAEEFDIKADKDFWVEIKRGLVDKEGLVRKQSVHILKKALSINGRGN 391
Query: 338 TSRVPKTISSGKDNNARGITKRERWAHKEAKSLGVGQICSESEIVINSQQQQWEAFILLY 397
TS VP TISSGKDNNARGITKRERWA+KEAKSLGV Q CS+ EIV NS QQ+WEAFILLY
Sbjct: 392 TSSVPNTISSGKDNNARGITKRERWANKEAKSLGVVQTCSQHEIVTNSLQQKWEAFILLY 451
Query: 398 EMLEEYGSHLVEAAWNHQIFLLLRDPTSINFDSVTGGIHQNQIDMSSEIFSWLSILWVRG 457
EMLEEYGSHLVEAAWNHQI LLLRDPTSI FDS TGG +QNQI+MS EIFSWLSILWVRG
Sbjct: 452 EMLEEYGSHLVEAAWNHQISLLLRDPTSIKFDSFTGGFYQNQIEMSGEIFSWLSILWVRG 511
Query: 458 FHHDNPLVRCLIMQSFLAIDWRNYVPCLKSLPETLIIGPFIEALNDPVQHKDFGVKGVYS 517
FHHDNPLVRCLIMQSFL IDWRNYV CL SLP+T IIGPFIEALNDPVQHKDFGVKGVYS
Sbjct: 512 FHHDNPLVRCLIMQSFLGIDWRNYVSCLMSLPQTFIIGPFIEALNDPVQHKDFGVKGVYS 571
Query: 518 SKTIEGAAHFIRQYANCLDARTSAVFLQQLTSLAKKKSFGRVGLISLSECIASAASVGGY 577
SKTIEGAAHFIRQYANCLDART+ VFLQQLTSL KKKSFGRVGLISLSECIASAAS+ G+
Sbjct: 572 SKTIEGAAHFIRQYANCLDARTTVVFLQQLTSLTKKKSFGRVGLISLSECIASAASIVGF 631
Query: 578 DNDSEGECFEGSSLSAQGDLITYSLGCKLELLDDLRFVVESSKQHFNPSYRLQVCAKALD 637
+ND EGECF+ QG+LITYSLG K+ELLDDLRFVV+SSKQHFNPSYR QVCAKAL+
Sbjct: 632 ENDCEGECFD-----PQGNLITYSLGYKMELLDDLRFVVQSSKQHFNPSYRFQVCAKALE 691
Query: 638 AAASVLCTSDLALEVLLHFISSLPREATDCGGCLRGKMQNWLSGCGKKSCSGSCCSTETK 697
AAASVLCTSDL LEVLL FIS+LPREATD GGCLRGKMQ+WL GCGKK CSGSCCSTETK
Sbjct: 692 AAASVLCTSDLDLEVLLLFISALPREATDYGGCLRGKMQSWLLGCGKKCCSGSCCSTETK 751
Query: 698 FMKSLIEFPKRFISHNHSSDASVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLRPILTF 757
FMKSLIEFPKRFI HNHSS+ SVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLR ILTF
Sbjct: 752 FMKSLIEFPKRFICHNHSSNVSVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLRSILTF 811
Query: 758 IHNHGVNICKQKSDLEGIRVKFLILILSLVQEHQLVQEKIADYNYKCETKDDYTLSQPSD 817
IHNHGVNI KQKSDLEGIRVKFLILILSLVQE QLVQEK D+NYKCETKD+YTL QPSD
Sbjct: 812 IHNHGVNIGKQKSDLEGIRVKFLILILSLVQELQLVQEKATDHNYKCETKDEYTLCQPSD 871
Query: 818 NWSYSEPTTCIKKFANLFPSLLVELVSFATVSCSIFWSNVKSDETGLPCSVKGKLGGPSQ 877
+ +Y+ PTT IKKF NLF SLL ELVSFAT SCSIFWSNVKSDETGLP SVKGKLGGPSQ
Sbjct: 872 DLNYAVPTTFIKKFVNLFSSLLEELVSFATASCSIFWSNVKSDETGLPSSVKGKLGGPSQ 931
Query: 878 RRLPSSTATLVLLAVTSMKAIASILSCCRQFRITGSQNFGVEFLLKFVWKTVSSPAYHSE 937
RRLPS TATLVLLAVTSMKAIA +LSCCRQF+I GSQNFGVEFLLKF+ KTVSSPAY SE
Sbjct: 932 RRLPSYTATLVLLAVTSMKAIAFVLSCCRQFKIIGSQNFGVEFLLKFLLKTVSSPAYRSE 991
Query: 938 SGAEICLATYEALAPVLQVLVSEFSSQALKFIRDENTIMHLGVEGRPLLDSLVLTFHQHV 997
SG EI LATYEALA VLQVLVSEFSSQ L+FI DENTIMHLGVEGR LDSLVLTFHQHV
Sbjct: 992 SGVEIRLATYEALASVLQVLVSEFSSQVLQFIWDENTIMHLGVEGRQPLDSLVLTFHQHV 1051
Query: 998 NGILDAGVLVRSRRAVLLKWKWLCLESLLSIPYRALQNGLNLVDNNSFLSEATLVRIFSD 1057
NGILDAGVLVRSRRAVLLKWKWLCLESLLSIP R Q+GL LVDNNSFLSEATLV+IF+D
Sbjct: 1052 NGILDAGVLVRSRRAVLLKWKWLCLESLLSIPQRTFQDGLCLVDNNSFLSEATLVQIFND 1111
Query: 1058 LVESLENAGECSVLPMLRLVRLNLWLFCKGKSGLLVTSCNGVNAE 1103
LVESLENAGECSVLPMLRLVRL LWLFCKGKSGLLVT CNGVN+E
Sbjct: 1112 LVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTFCNGVNSE 1151
BLAST of Sgr020180 vs. NCBI nr
Match:
XP_022155535.1 (uncharacterized protein LOC111022655 isoform X4 [Momordica charantia])
HSP 1 Score: 1831.6 bits (4743), Expect = 0.0e+00
Identity = 921/1065 (86.48%), Postives = 972/1065 (91.27%), Query Frame = 0
Query: 38 ADPDALKSFIWKCFVPLINRAIAFNREMLNQVAESFIDVVIEMNSWPIVEATLIPFCISS 97
ADPDALK FIWK FVPLIN+A AFNREMLNQV+ESFIDVVIE NSWPIVE TL+P CISS
Sbjct: 92 ADPDALKLFIWKSFVPLINKAAAFNREMLNQVSESFIDVVIETNSWPIVEETLVPLCISS 151
Query: 98 ALYSTSVLQNEEFDTFEGDRCSVILGSNGPINEPRMDKQMMKAYGFLPLPLACHILAIML 157
ALYST++LQNE+ TFEGDRCSVILGSNG ++EP+MDKQM+K YGFLPLPLACHILAIML
Sbjct: 152 ALYSTTMLQNEQLGTFEGDRCSVILGSNGSVHEPKMDKQMIKGYGFLPLPLACHILAIML 211
Query: 158 DAVLCNRHAPQISDAVVANGCQKAEEFTVKLIWDICNLSEQMLLQSSDHRSCTIRYLLPL 217
DAVLCNR APQ ++ VV+NGCQKAEEFTVKLI DICNLS+QMLLQSSDHRSC IRYLLP+
Sbjct: 212 DAVLCNRQAPQTTEVVVSNGCQKAEEFTVKLIRDICNLSDQMLLQSSDHRSCAIRYLLPV 271
Query: 218 IFEVLLSHHSLEISIQGHVHNLSRNRFLMKIWKCCKKLFSFGTLERRDAYKILSLYLCFF 277
IFE LLS H+LEISIQG+ +LSRNRFLMKIW CCKKLFSFGTLERRDAY ILSLYL FF
Sbjct: 272 IFEALLSQHNLEISIQGYACSLSRNRFLMKIWNCCKKLFSFGTLERRDAYTILSLYLSFF 331
Query: 278 PHNEELGGAGICDDAEEFDIKADKDFWDAIKRGLVDKEGLVRKQSLHILKKALYINGRGN 337
PHNEEL GAG+CDDAEEFDIKADKDFW IKRGLVDKEGLVRKQS+HILKKAL INGRGN
Sbjct: 332 PHNEELEGAGMCDDAEEFDIKADKDFWVEIKRGLVDKEGLVRKQSVHILKKALSINGRGN 391
Query: 338 TSRVPKTISSGKDNNARGITKRERWAHKEAKSLGVGQICSESEIVINSQQQQWEAFILLY 397
TS VP TISSGKDNNARGITKRERWA+KEAKSLGV Q CS+ EIV NS QQ+WEAFILLY
Sbjct: 392 TSSVPNTISSGKDNNARGITKRERWANKEAKSLGVVQTCSQHEIVTNSLQQKWEAFILLY 451
Query: 398 EMLEEYGSHLVEAAWNHQIFLLLRDPTSINFDSVTGGIHQNQIDMSSEIFSWLSILWVRG 457
EMLEEYGSHLVEAAWNHQI LLLRDPTSI FDS TGG +QNQI+MS EIFSWLSILWVRG
Sbjct: 452 EMLEEYGSHLVEAAWNHQISLLLRDPTSIKFDSFTGGFYQNQIEMSGEIFSWLSILWVRG 511
Query: 458 FHHDNPLVRCLIMQSFLAIDWRNYVPCLKSLPETLIIGPFIEALNDPVQHKDFGVKGVYS 517
FHHDNPLVRCLIMQSFL IDWRNYV CL SLP+T IIGPFIEALNDPVQHKDFGVKGVYS
Sbjct: 512 FHHDNPLVRCLIMQSFLGIDWRNYVSCLMSLPQTFIIGPFIEALNDPVQHKDFGVKGVYS 571
Query: 518 SKTIEGAAHFIRQYANCLDARTSAVFLQQLTSLAKKKSFGRVGLISLSECIASAASVGGY 577
SKTIEGAAHFIRQYANCLDART+ VFLQQLTSL KKKSFGRVGLISLSECIASAAS+ G+
Sbjct: 572 SKTIEGAAHFIRQYANCLDARTTVVFLQQLTSLTKKKSFGRVGLISLSECIASAASIVGF 631
Query: 578 DNDSEGECFEGSSLSAQGDLITYSLGCKLELLDDLRFVVESSKQHFNPSYRLQVCAKALD 637
+ND EGECF+ QG+LITYSLG K+ELLDDLRFVV+SSKQHFNPSYR QVCAKAL+
Sbjct: 632 ENDCEGECFD-----PQGNLITYSLGYKMELLDDLRFVVQSSKQHFNPSYRFQVCAKALE 691
Query: 638 AAASVLCTSDLALEVLLHFISSLPREATDCGGCLRGKMQNWLSGCGKKSCSGSCCSTETK 697
AAASVLCTSDL LEVLL FIS+LPREATD GGCLRGKMQ+WL GCGKK CSGSCCSTETK
Sbjct: 692 AAASVLCTSDLDLEVLLLFISALPREATDYGGCLRGKMQSWLLGCGKKCCSGSCCSTETK 751
Query: 698 FMKSLIEFPKRFISHNHSSDASVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLRPILTF 757
FMKSLIEFPKRFI HNHSS+ SVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLR ILTF
Sbjct: 752 FMKSLIEFPKRFICHNHSSNVSVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLRSILTF 811
Query: 758 IHNHGVNICKQKSDLEGIRVKFLILILSLVQEHQLVQEKIADYNYKCETKDDYTLSQPSD 817
IHNHGVNI KQKSDLEGIRVKFLILILSLVQE QLVQEK D+NYKCETKD+YTL QPSD
Sbjct: 812 IHNHGVNIGKQKSDLEGIRVKFLILILSLVQELQLVQEKATDHNYKCETKDEYTLCQPSD 871
Query: 818 NWSYSEPTTCIKKFANLFPSLLVELVSFATVSCSIFWSNVKSDETGLPCSVKGKLGGPSQ 877
+ +Y+ PTT IKKF NLF SLL ELVSFAT SCSIFWSNVKSDETGLP SVKGKLGGPSQ
Sbjct: 872 DLNYAVPTTFIKKFVNLFSSLLEELVSFATASCSIFWSNVKSDETGLPSSVKGKLGGPSQ 931
Query: 878 RRLPSSTATLVLLAVTSMKAIASILSCCRQFRITGSQNFGVEFLLKFVWKTVSSPAYHSE 937
RRLPS TATLVLLAVTSMKAIA +LSCCRQF+I GSQNFGVEFLLKF+ KTVSSPAY SE
Sbjct: 932 RRLPSYTATLVLLAVTSMKAIAFVLSCCRQFKIIGSQNFGVEFLLKFLLKTVSSPAYRSE 991
Query: 938 SGAEICLATYEALAPVLQVLVSEFSSQALKFIRDENTIMHLGVEGRPLLDSLVLTFHQHV 997
SG EI LATYEALA VLQVLVSEFSSQ L+FI DENTIMHLGVEGR LDSLVLTFHQHV
Sbjct: 992 SGVEIRLATYEALASVLQVLVSEFSSQVLQFIWDENTIMHLGVEGRQPLDSLVLTFHQHV 1051
Query: 998 NGILDAGVLVRSRRAVLLKWKWLCLESLLSIPYRALQNGLNLVDNNSFLSEATLVRIFSD 1057
NGILDAGVLVRSRRAVLLKWKWLCLESLLSIP R Q+GL LVDNNSFLSEATLV+IF+D
Sbjct: 1052 NGILDAGVLVRSRRAVLLKWKWLCLESLLSIPQRTFQDGLCLVDNNSFLSEATLVQIFND 1111
Query: 1058 LVESLENAGECSVLPMLRLVRLNLWLFCKGKSGLLVTSCNGVNAE 1103
LVESLENAGECSVLPMLRLVRL LWLFCKGKSGLLVT CNGVN+E
Sbjct: 1112 LVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTFCNGVNSE 1151
BLAST of Sgr020180 vs. NCBI nr
Match:
XP_022155534.1 (uncharacterized protein LOC111022655 isoform X3 [Momordica charantia])
HSP 1 Score: 1819.7 bits (4712), Expect = 0.0e+00
Identity = 918/1065 (86.20%), Postives = 967/1065 (90.80%), Query Frame = 0
Query: 38 ADPDALKSFIWKCFVPLINRAIAFNREMLNQVAESFIDVVIEMNSWPIVEATLIPFCISS 97
ADPDALK FIWK FVPLIN+A AFNREMLNQV+ESFIDVVIE NSWPIVE TL+P CISS
Sbjct: 92 ADPDALKLFIWKSFVPLINKAAAFNREMLNQVSESFIDVVIETNSWPIVEETLVPLCISS 151
Query: 98 ALYSTSVLQNEEFDTFEGDRCSVILGSNGPINEPRMDKQMMKAYGFLPLPLACHILAIML 157
ALYST++LQNE+ TFEGDRCSVILGSNG ++EP+MDKQM+K YGFLPLPLACHILAIML
Sbjct: 152 ALYSTTMLQNEQLGTFEGDRCSVILGSNGSVHEPKMDKQMIKGYGFLPLPLACHILAIML 211
Query: 158 DAVLCNRHAPQISDAVVANGCQKAEEFTVKLIWDICNLSEQMLLQSSDHRSCTIRYLLPL 217
DAVLCNR APQ ++ VV+NGCQKAEEFTVKLI DICNLS+QMLLQSSDHRSC IRYLLP+
Sbjct: 212 DAVLCNRQAPQTTEVVVSNGCQKAEEFTVKLIRDICNLSDQMLLQSSDHRSCAIRYLLPV 271
Query: 218 IFEVLLSHHSLEISIQGHVHNLSRNRFLMKIWKCCKKLFSFGTLERRDAYKILSLYLCFF 277
IFE LLS H+LEISIQG NRFLMKIW CCKKLFSFGTLERRDAY ILSLYL FF
Sbjct: 272 IFEALLSQHNLEISIQG-------NRFLMKIWNCCKKLFSFGTLERRDAYTILSLYLSFF 331
Query: 278 PHNEELGGAGICDDAEEFDIKADKDFWDAIKRGLVDKEGLVRKQSLHILKKALYINGRGN 337
PHNEEL GAG+CDDAEEFDIKADKDFW IKRGLVDKEGLVRKQS+HILKKAL INGRGN
Sbjct: 332 PHNEELEGAGMCDDAEEFDIKADKDFWVEIKRGLVDKEGLVRKQSVHILKKALSINGRGN 391
Query: 338 TSRVPKTISSGKDNNARGITKRERWAHKEAKSLGVGQICSESEIVINSQQQQWEAFILLY 397
TS VP TISSGKDNNARGITKRERWA+KEAKSLGV Q CS+ EIV NS QQ+WEAFILLY
Sbjct: 392 TSSVPNTISSGKDNNARGITKRERWANKEAKSLGVVQTCSQHEIVTNSLQQKWEAFILLY 451
Query: 398 EMLEEYGSHLVEAAWNHQIFLLLRDPTSINFDSVTGGIHQNQIDMSSEIFSWLSILWVRG 457
EMLEEYGSHLVEAAWNHQI LLLRDPTSI FDS TGG +QNQI+MS EIFSWLSILWVRG
Sbjct: 452 EMLEEYGSHLVEAAWNHQISLLLRDPTSIKFDSFTGGFYQNQIEMSGEIFSWLSILWVRG 511
Query: 458 FHHDNPLVRCLIMQSFLAIDWRNYVPCLKSLPETLIIGPFIEALNDPVQHKDFGVKGVYS 517
FHHDNPLVRCLIMQSFL IDWRNYV CL SLP+T IIGPFIEALNDPVQHKDFGVKGVYS
Sbjct: 512 FHHDNPLVRCLIMQSFLGIDWRNYVSCLMSLPQTFIIGPFIEALNDPVQHKDFGVKGVYS 571
Query: 518 SKTIEGAAHFIRQYANCLDARTSAVFLQQLTSLAKKKSFGRVGLISLSECIASAASVGGY 577
SKTIEGAAHFIRQYANCLDART+ VFLQQLTSL KKKSFGRVGLISLSECIASAAS+ G+
Sbjct: 572 SKTIEGAAHFIRQYANCLDARTTVVFLQQLTSLTKKKSFGRVGLISLSECIASAASIVGF 631
Query: 578 DNDSEGECFEGSSLSAQGDLITYSLGCKLELLDDLRFVVESSKQHFNPSYRLQVCAKALD 637
+ND EGECF+ QG+LITYSLG K+ELLDDLRFVV+SSKQHFNPSYR QVCAKAL+
Sbjct: 632 ENDCEGECFD-----PQGNLITYSLGYKMELLDDLRFVVQSSKQHFNPSYRFQVCAKALE 691
Query: 638 AAASVLCTSDLALEVLLHFISSLPREATDCGGCLRGKMQNWLSGCGKKSCSGSCCSTETK 697
AAASVLCTSDL LEVLL FIS+LPREATD GGCLRGKMQ+WL GCGKK CSGSCCSTETK
Sbjct: 692 AAASVLCTSDLDLEVLLLFISALPREATDYGGCLRGKMQSWLLGCGKKCCSGSCCSTETK 751
Query: 698 FMKSLIEFPKRFISHNHSSDASVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLRPILTF 757
FMKSLIEFPKRFI HNHSS+ SVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLR ILTF
Sbjct: 752 FMKSLIEFPKRFICHNHSSNVSVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLRSILTF 811
Query: 758 IHNHGVNICKQKSDLEGIRVKFLILILSLVQEHQLVQEKIADYNYKCETKDDYTLSQPSD 817
IHNHGVNI KQKSDLEGIRVKFLILILSLVQE QLVQEK D+NYKCETKD+YTL QPSD
Sbjct: 812 IHNHGVNIGKQKSDLEGIRVKFLILILSLVQELQLVQEKATDHNYKCETKDEYTLCQPSD 871
Query: 818 NWSYSEPTTCIKKFANLFPSLLVELVSFATVSCSIFWSNVKSDETGLPCSVKGKLGGPSQ 877
+ +Y+ PTT IKKF NLF SLL ELVSFAT SCSIFWSNVKSDETGLP SVKGKLGGPSQ
Sbjct: 872 DLNYAVPTTFIKKFVNLFSSLLEELVSFATASCSIFWSNVKSDETGLPSSVKGKLGGPSQ 931
Query: 878 RRLPSSTATLVLLAVTSMKAIASILSCCRQFRITGSQNFGVEFLLKFVWKTVSSPAYHSE 937
RRLPS TATLVLLAVTSMKAIA +LSCCRQF+I GSQNFGVEFLLKF+ KTVSSPAY SE
Sbjct: 932 RRLPSYTATLVLLAVTSMKAIAFVLSCCRQFKIIGSQNFGVEFLLKFLLKTVSSPAYRSE 991
Query: 938 SGAEICLATYEALAPVLQVLVSEFSSQALKFIRDENTIMHLGVEGRPLLDSLVLTFHQHV 997
SG EI LATYEALA VLQVLVSEFSSQ L+FI DENTIMHLGVEGR LDSLVLTFHQHV
Sbjct: 992 SGVEIRLATYEALASVLQVLVSEFSSQVLQFIWDENTIMHLGVEGRQPLDSLVLTFHQHV 1051
Query: 998 NGILDAGVLVRSRRAVLLKWKWLCLESLLSIPYRALQNGLNLVDNNSFLSEATLVRIFSD 1057
NGILDAGVLVRSRRAVLLKWKWLCLESLLSIP R Q+GL LVDNNSFLSEATLV+IF+D
Sbjct: 1052 NGILDAGVLVRSRRAVLLKWKWLCLESLLSIPQRTFQDGLCLVDNNSFLSEATLVQIFND 1111
Query: 1058 LVESLENAGECSVLPMLRLVRLNLWLFCKGKSGLLVTSCNGVNAE 1103
LVESLENAGECSVLPMLRLVRL LWLFCKGKSGLLVT CNGVN+E
Sbjct: 1112 LVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTFCNGVNSE 1144
BLAST of Sgr020180 vs. ExPASy TrEMBL
Match:
A0A6J1DPL7 (uncharacterized protein LOC111022655 isoform X5 OS=Momordica charantia OX=3673 GN=LOC111022655 PE=4 SV=1)
HSP 1 Score: 1831.6 bits (4743), Expect = 0.0e+00
Identity = 921/1065 (86.48%), Postives = 972/1065 (91.27%), Query Frame = 0
Query: 38 ADPDALKSFIWKCFVPLINRAIAFNREMLNQVAESFIDVVIEMNSWPIVEATLIPFCISS 97
ADPDALK FIWK FVPLIN+A AFNREMLNQV+ESFIDVVIE NSWPIVE TL+P CISS
Sbjct: 92 ADPDALKLFIWKSFVPLINKAAAFNREMLNQVSESFIDVVIETNSWPIVEETLVPLCISS 151
Query: 98 ALYSTSVLQNEEFDTFEGDRCSVILGSNGPINEPRMDKQMMKAYGFLPLPLACHILAIML 157
ALYST++LQNE+ TFEGDRCSVILGSNG ++EP+MDKQM+K YGFLPLPLACHILAIML
Sbjct: 152 ALYSTTMLQNEQLGTFEGDRCSVILGSNGSVHEPKMDKQMIKGYGFLPLPLACHILAIML 211
Query: 158 DAVLCNRHAPQISDAVVANGCQKAEEFTVKLIWDICNLSEQMLLQSSDHRSCTIRYLLPL 217
DAVLCNR APQ ++ VV+NGCQKAEEFTVKLI DICNLS+QMLLQSSDHRSC IRYLLP+
Sbjct: 212 DAVLCNRQAPQTTEVVVSNGCQKAEEFTVKLIRDICNLSDQMLLQSSDHRSCAIRYLLPV 271
Query: 218 IFEVLLSHHSLEISIQGHVHNLSRNRFLMKIWKCCKKLFSFGTLERRDAYKILSLYLCFF 277
IFE LLS H+LEISIQG+ +LSRNRFLMKIW CCKKLFSFGTLERRDAY ILSLYL FF
Sbjct: 272 IFEALLSQHNLEISIQGYACSLSRNRFLMKIWNCCKKLFSFGTLERRDAYTILSLYLSFF 331
Query: 278 PHNEELGGAGICDDAEEFDIKADKDFWDAIKRGLVDKEGLVRKQSLHILKKALYINGRGN 337
PHNEEL GAG+CDDAEEFDIKADKDFW IKRGLVDKEGLVRKQS+HILKKAL INGRGN
Sbjct: 332 PHNEELEGAGMCDDAEEFDIKADKDFWVEIKRGLVDKEGLVRKQSVHILKKALSINGRGN 391
Query: 338 TSRVPKTISSGKDNNARGITKRERWAHKEAKSLGVGQICSESEIVINSQQQQWEAFILLY 397
TS VP TISSGKDNNARGITKRERWA+KEAKSLGV Q CS+ EIV NS QQ+WEAFILLY
Sbjct: 392 TSSVPNTISSGKDNNARGITKRERWANKEAKSLGVVQTCSQHEIVTNSLQQKWEAFILLY 451
Query: 398 EMLEEYGSHLVEAAWNHQIFLLLRDPTSINFDSVTGGIHQNQIDMSSEIFSWLSILWVRG 457
EMLEEYGSHLVEAAWNHQI LLLRDPTSI FDS TGG +QNQI+MS EIFSWLSILWVRG
Sbjct: 452 EMLEEYGSHLVEAAWNHQISLLLRDPTSIKFDSFTGGFYQNQIEMSGEIFSWLSILWVRG 511
Query: 458 FHHDNPLVRCLIMQSFLAIDWRNYVPCLKSLPETLIIGPFIEALNDPVQHKDFGVKGVYS 517
FHHDNPLVRCLIMQSFL IDWRNYV CL SLP+T IIGPFIEALNDPVQHKDFGVKGVYS
Sbjct: 512 FHHDNPLVRCLIMQSFLGIDWRNYVSCLMSLPQTFIIGPFIEALNDPVQHKDFGVKGVYS 571
Query: 518 SKTIEGAAHFIRQYANCLDARTSAVFLQQLTSLAKKKSFGRVGLISLSECIASAASVGGY 577
SKTIEGAAHFIRQYANCLDART+ VFLQQLTSL KKKSFGRVGLISLSECIASAAS+ G+
Sbjct: 572 SKTIEGAAHFIRQYANCLDARTTVVFLQQLTSLTKKKSFGRVGLISLSECIASAASIVGF 631
Query: 578 DNDSEGECFEGSSLSAQGDLITYSLGCKLELLDDLRFVVESSKQHFNPSYRLQVCAKALD 637
+ND EGECF+ QG+LITYSLG K+ELLDDLRFVV+SSKQHFNPSYR QVCAKAL+
Sbjct: 632 ENDCEGECFD-----PQGNLITYSLGYKMELLDDLRFVVQSSKQHFNPSYRFQVCAKALE 691
Query: 638 AAASVLCTSDLALEVLLHFISSLPREATDCGGCLRGKMQNWLSGCGKKSCSGSCCSTETK 697
AAASVLCTSDL LEVLL FIS+LPREATD GGCLRGKMQ+WL GCGKK CSGSCCSTETK
Sbjct: 692 AAASVLCTSDLDLEVLLLFISALPREATDYGGCLRGKMQSWLLGCGKKCCSGSCCSTETK 751
Query: 698 FMKSLIEFPKRFISHNHSSDASVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLRPILTF 757
FMKSLIEFPKRFI HNHSS+ SVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLR ILTF
Sbjct: 752 FMKSLIEFPKRFICHNHSSNVSVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLRSILTF 811
Query: 758 IHNHGVNICKQKSDLEGIRVKFLILILSLVQEHQLVQEKIADYNYKCETKDDYTLSQPSD 817
IHNHGVNI KQKSDLEGIRVKFLILILSLVQE QLVQEK D+NYKCETKD+YTL QPSD
Sbjct: 812 IHNHGVNIGKQKSDLEGIRVKFLILILSLVQELQLVQEKATDHNYKCETKDEYTLCQPSD 871
Query: 818 NWSYSEPTTCIKKFANLFPSLLVELVSFATVSCSIFWSNVKSDETGLPCSVKGKLGGPSQ 877
+ +Y+ PTT IKKF NLF SLL ELVSFAT SCSIFWSNVKSDETGLP SVKGKLGGPSQ
Sbjct: 872 DLNYAVPTTFIKKFVNLFSSLLEELVSFATASCSIFWSNVKSDETGLPSSVKGKLGGPSQ 931
Query: 878 RRLPSSTATLVLLAVTSMKAIASILSCCRQFRITGSQNFGVEFLLKFVWKTVSSPAYHSE 937
RRLPS TATLVLLAVTSMKAIA +LSCCRQF+I GSQNFGVEFLLKF+ KTVSSPAY SE
Sbjct: 932 RRLPSYTATLVLLAVTSMKAIAFVLSCCRQFKIIGSQNFGVEFLLKFLLKTVSSPAYRSE 991
Query: 938 SGAEICLATYEALAPVLQVLVSEFSSQALKFIRDENTIMHLGVEGRPLLDSLVLTFHQHV 997
SG EI LATYEALA VLQVLVSEFSSQ L+FI DENTIMHLGVEGR LDSLVLTFHQHV
Sbjct: 992 SGVEIRLATYEALASVLQVLVSEFSSQVLQFIWDENTIMHLGVEGRQPLDSLVLTFHQHV 1051
Query: 998 NGILDAGVLVRSRRAVLLKWKWLCLESLLSIPYRALQNGLNLVDNNSFLSEATLVRIFSD 1057
NGILDAGVLVRSRRAVLLKWKWLCLESLLSIP R Q+GL LVDNNSFLSEATLV+IF+D
Sbjct: 1052 NGILDAGVLVRSRRAVLLKWKWLCLESLLSIPQRTFQDGLCLVDNNSFLSEATLVQIFND 1111
Query: 1058 LVESLENAGECSVLPMLRLVRLNLWLFCKGKSGLLVTSCNGVNAE 1103
LVESLENAGECSVLPMLRLVRL LWLFCKGKSGLLVT CNGVN+E
Sbjct: 1112 LVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTFCNGVNSE 1151
BLAST of Sgr020180 vs. ExPASy TrEMBL
Match:
A0A6J1DMP5 (uncharacterized protein LOC111022655 isoform X4 OS=Momordica charantia OX=3673 GN=LOC111022655 PE=4 SV=1)
HSP 1 Score: 1831.6 bits (4743), Expect = 0.0e+00
Identity = 921/1065 (86.48%), Postives = 972/1065 (91.27%), Query Frame = 0
Query: 38 ADPDALKSFIWKCFVPLINRAIAFNREMLNQVAESFIDVVIEMNSWPIVEATLIPFCISS 97
ADPDALK FIWK FVPLIN+A AFNREMLNQV+ESFIDVVIE NSWPIVE TL+P CISS
Sbjct: 92 ADPDALKLFIWKSFVPLINKAAAFNREMLNQVSESFIDVVIETNSWPIVEETLVPLCISS 151
Query: 98 ALYSTSVLQNEEFDTFEGDRCSVILGSNGPINEPRMDKQMMKAYGFLPLPLACHILAIML 157
ALYST++LQNE+ TFEGDRCSVILGSNG ++EP+MDKQM+K YGFLPLPLACHILAIML
Sbjct: 152 ALYSTTMLQNEQLGTFEGDRCSVILGSNGSVHEPKMDKQMIKGYGFLPLPLACHILAIML 211
Query: 158 DAVLCNRHAPQISDAVVANGCQKAEEFTVKLIWDICNLSEQMLLQSSDHRSCTIRYLLPL 217
DAVLCNR APQ ++ VV+NGCQKAEEFTVKLI DICNLS+QMLLQSSDHRSC IRYLLP+
Sbjct: 212 DAVLCNRQAPQTTEVVVSNGCQKAEEFTVKLIRDICNLSDQMLLQSSDHRSCAIRYLLPV 271
Query: 218 IFEVLLSHHSLEISIQGHVHNLSRNRFLMKIWKCCKKLFSFGTLERRDAYKILSLYLCFF 277
IFE LLS H+LEISIQG+ +LSRNRFLMKIW CCKKLFSFGTLERRDAY ILSLYL FF
Sbjct: 272 IFEALLSQHNLEISIQGYACSLSRNRFLMKIWNCCKKLFSFGTLERRDAYTILSLYLSFF 331
Query: 278 PHNEELGGAGICDDAEEFDIKADKDFWDAIKRGLVDKEGLVRKQSLHILKKALYINGRGN 337
PHNEEL GAG+CDDAEEFDIKADKDFW IKRGLVDKEGLVRKQS+HILKKAL INGRGN
Sbjct: 332 PHNEELEGAGMCDDAEEFDIKADKDFWVEIKRGLVDKEGLVRKQSVHILKKALSINGRGN 391
Query: 338 TSRVPKTISSGKDNNARGITKRERWAHKEAKSLGVGQICSESEIVINSQQQQWEAFILLY 397
TS VP TISSGKDNNARGITKRERWA+KEAKSLGV Q CS+ EIV NS QQ+WEAFILLY
Sbjct: 392 TSSVPNTISSGKDNNARGITKRERWANKEAKSLGVVQTCSQHEIVTNSLQQKWEAFILLY 451
Query: 398 EMLEEYGSHLVEAAWNHQIFLLLRDPTSINFDSVTGGIHQNQIDMSSEIFSWLSILWVRG 457
EMLEEYGSHLVEAAWNHQI LLLRDPTSI FDS TGG +QNQI+MS EIFSWLSILWVRG
Sbjct: 452 EMLEEYGSHLVEAAWNHQISLLLRDPTSIKFDSFTGGFYQNQIEMSGEIFSWLSILWVRG 511
Query: 458 FHHDNPLVRCLIMQSFLAIDWRNYVPCLKSLPETLIIGPFIEALNDPVQHKDFGVKGVYS 517
FHHDNPLVRCLIMQSFL IDWRNYV CL SLP+T IIGPFIEALNDPVQHKDFGVKGVYS
Sbjct: 512 FHHDNPLVRCLIMQSFLGIDWRNYVSCLMSLPQTFIIGPFIEALNDPVQHKDFGVKGVYS 571
Query: 518 SKTIEGAAHFIRQYANCLDARTSAVFLQQLTSLAKKKSFGRVGLISLSECIASAASVGGY 577
SKTIEGAAHFIRQYANCLDART+ VFLQQLTSL KKKSFGRVGLISLSECIASAAS+ G+
Sbjct: 572 SKTIEGAAHFIRQYANCLDARTTVVFLQQLTSLTKKKSFGRVGLISLSECIASAASIVGF 631
Query: 578 DNDSEGECFEGSSLSAQGDLITYSLGCKLELLDDLRFVVESSKQHFNPSYRLQVCAKALD 637
+ND EGECF+ QG+LITYSLG K+ELLDDLRFVV+SSKQHFNPSYR QVCAKAL+
Sbjct: 632 ENDCEGECFD-----PQGNLITYSLGYKMELLDDLRFVVQSSKQHFNPSYRFQVCAKALE 691
Query: 638 AAASVLCTSDLALEVLLHFISSLPREATDCGGCLRGKMQNWLSGCGKKSCSGSCCSTETK 697
AAASVLCTSDL LEVLL FIS+LPREATD GGCLRGKMQ+WL GCGKK CSGSCCSTETK
Sbjct: 692 AAASVLCTSDLDLEVLLLFISALPREATDYGGCLRGKMQSWLLGCGKKCCSGSCCSTETK 751
Query: 698 FMKSLIEFPKRFISHNHSSDASVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLRPILTF 757
FMKSLIEFPKRFI HNHSS+ SVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLR ILTF
Sbjct: 752 FMKSLIEFPKRFICHNHSSNVSVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLRSILTF 811
Query: 758 IHNHGVNICKQKSDLEGIRVKFLILILSLVQEHQLVQEKIADYNYKCETKDDYTLSQPSD 817
IHNHGVNI KQKSDLEGIRVKFLILILSLVQE QLVQEK D+NYKCETKD+YTL QPSD
Sbjct: 812 IHNHGVNIGKQKSDLEGIRVKFLILILSLVQELQLVQEKATDHNYKCETKDEYTLCQPSD 871
Query: 818 NWSYSEPTTCIKKFANLFPSLLVELVSFATVSCSIFWSNVKSDETGLPCSVKGKLGGPSQ 877
+ +Y+ PTT IKKF NLF SLL ELVSFAT SCSIFWSNVKSDETGLP SVKGKLGGPSQ
Sbjct: 872 DLNYAVPTTFIKKFVNLFSSLLEELVSFATASCSIFWSNVKSDETGLPSSVKGKLGGPSQ 931
Query: 878 RRLPSSTATLVLLAVTSMKAIASILSCCRQFRITGSQNFGVEFLLKFVWKTVSSPAYHSE 937
RRLPS TATLVLLAVTSMKAIA +LSCCRQF+I GSQNFGVEFLLKF+ KTVSSPAY SE
Sbjct: 932 RRLPSYTATLVLLAVTSMKAIAFVLSCCRQFKIIGSQNFGVEFLLKFLLKTVSSPAYRSE 991
Query: 938 SGAEICLATYEALAPVLQVLVSEFSSQALKFIRDENTIMHLGVEGRPLLDSLVLTFHQHV 997
SG EI LATYEALA VLQVLVSEFSSQ L+FI DENTIMHLGVEGR LDSLVLTFHQHV
Sbjct: 992 SGVEIRLATYEALASVLQVLVSEFSSQVLQFIWDENTIMHLGVEGRQPLDSLVLTFHQHV 1051
Query: 998 NGILDAGVLVRSRRAVLLKWKWLCLESLLSIPYRALQNGLNLVDNNSFLSEATLVRIFSD 1057
NGILDAGVLVRSRRAVLLKWKWLCLESLLSIP R Q+GL LVDNNSFLSEATLV+IF+D
Sbjct: 1052 NGILDAGVLVRSRRAVLLKWKWLCLESLLSIPQRTFQDGLCLVDNNSFLSEATLVQIFND 1111
Query: 1058 LVESLENAGECSVLPMLRLVRLNLWLFCKGKSGLLVTSCNGVNAE 1103
LVESLENAGECSVLPMLRLVRL LWLFCKGKSGLLVT CNGVN+E
Sbjct: 1112 LVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTFCNGVNSE 1151
BLAST of Sgr020180 vs. ExPASy TrEMBL
Match:
A0A6J1DN74 (uncharacterized protein LOC111022655 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111022655 PE=4 SV=1)
HSP 1 Score: 1831.6 bits (4743), Expect = 0.0e+00
Identity = 921/1065 (86.48%), Postives = 972/1065 (91.27%), Query Frame = 0
Query: 38 ADPDALKSFIWKCFVPLINRAIAFNREMLNQVAESFIDVVIEMNSWPIVEATLIPFCISS 97
ADPDALK FIWK FVPLIN+A AFNREMLNQV+ESFIDVVIE NSWPIVE TL+P CISS
Sbjct: 92 ADPDALKLFIWKSFVPLINKAAAFNREMLNQVSESFIDVVIETNSWPIVEETLVPLCISS 151
Query: 98 ALYSTSVLQNEEFDTFEGDRCSVILGSNGPINEPRMDKQMMKAYGFLPLPLACHILAIML 157
ALYST++LQNE+ TFEGDRCSVILGSNG ++EP+MDKQM+K YGFLPLPLACHILAIML
Sbjct: 152 ALYSTTMLQNEQLGTFEGDRCSVILGSNGSVHEPKMDKQMIKGYGFLPLPLACHILAIML 211
Query: 158 DAVLCNRHAPQISDAVVANGCQKAEEFTVKLIWDICNLSEQMLLQSSDHRSCTIRYLLPL 217
DAVLCNR APQ ++ VV+NGCQKAEEFTVKLI DICNLS+QMLLQSSDHRSC IRYLLP+
Sbjct: 212 DAVLCNRQAPQTTEVVVSNGCQKAEEFTVKLIRDICNLSDQMLLQSSDHRSCAIRYLLPV 271
Query: 218 IFEVLLSHHSLEISIQGHVHNLSRNRFLMKIWKCCKKLFSFGTLERRDAYKILSLYLCFF 277
IFE LLS H+LEISIQG+ +LSRNRFLMKIW CCKKLFSFGTLERRDAY ILSLYL FF
Sbjct: 272 IFEALLSQHNLEISIQGYACSLSRNRFLMKIWNCCKKLFSFGTLERRDAYTILSLYLSFF 331
Query: 278 PHNEELGGAGICDDAEEFDIKADKDFWDAIKRGLVDKEGLVRKQSLHILKKALYINGRGN 337
PHNEEL GAG+CDDAEEFDIKADKDFW IKRGLVDKEGLVRKQS+HILKKAL INGRGN
Sbjct: 332 PHNEELEGAGMCDDAEEFDIKADKDFWVEIKRGLVDKEGLVRKQSVHILKKALSINGRGN 391
Query: 338 TSRVPKTISSGKDNNARGITKRERWAHKEAKSLGVGQICSESEIVINSQQQQWEAFILLY 397
TS VP TISSGKDNNARGITKRERWA+KEAKSLGV Q CS+ EIV NS QQ+WEAFILLY
Sbjct: 392 TSSVPNTISSGKDNNARGITKRERWANKEAKSLGVVQTCSQHEIVTNSLQQKWEAFILLY 451
Query: 398 EMLEEYGSHLVEAAWNHQIFLLLRDPTSINFDSVTGGIHQNQIDMSSEIFSWLSILWVRG 457
EMLEEYGSHLVEAAWNHQI LLLRDPTSI FDS TGG +QNQI+MS EIFSWLSILWVRG
Sbjct: 452 EMLEEYGSHLVEAAWNHQISLLLRDPTSIKFDSFTGGFYQNQIEMSGEIFSWLSILWVRG 511
Query: 458 FHHDNPLVRCLIMQSFLAIDWRNYVPCLKSLPETLIIGPFIEALNDPVQHKDFGVKGVYS 517
FHHDNPLVRCLIMQSFL IDWRNYV CL SLP+T IIGPFIEALNDPVQHKDFGVKGVYS
Sbjct: 512 FHHDNPLVRCLIMQSFLGIDWRNYVSCLMSLPQTFIIGPFIEALNDPVQHKDFGVKGVYS 571
Query: 518 SKTIEGAAHFIRQYANCLDARTSAVFLQQLTSLAKKKSFGRVGLISLSECIASAASVGGY 577
SKTIEGAAHFIRQYANCLDART+ VFLQQLTSL KKKSFGRVGLISLSECIASAAS+ G+
Sbjct: 572 SKTIEGAAHFIRQYANCLDARTTVVFLQQLTSLTKKKSFGRVGLISLSECIASAASIVGF 631
Query: 578 DNDSEGECFEGSSLSAQGDLITYSLGCKLELLDDLRFVVESSKQHFNPSYRLQVCAKALD 637
+ND EGECF+ QG+LITYSLG K+ELLDDLRFVV+SSKQHFNPSYR QVCAKAL+
Sbjct: 632 ENDCEGECFD-----PQGNLITYSLGYKMELLDDLRFVVQSSKQHFNPSYRFQVCAKALE 691
Query: 638 AAASVLCTSDLALEVLLHFISSLPREATDCGGCLRGKMQNWLSGCGKKSCSGSCCSTETK 697
AAASVLCTSDL LEVLL FIS+LPREATD GGCLRGKMQ+WL GCGKK CSGSCCSTETK
Sbjct: 692 AAASVLCTSDLDLEVLLLFISALPREATDYGGCLRGKMQSWLLGCGKKCCSGSCCSTETK 751
Query: 698 FMKSLIEFPKRFISHNHSSDASVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLRPILTF 757
FMKSLIEFPKRFI HNHSS+ SVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLR ILTF
Sbjct: 752 FMKSLIEFPKRFICHNHSSNVSVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLRSILTF 811
Query: 758 IHNHGVNICKQKSDLEGIRVKFLILILSLVQEHQLVQEKIADYNYKCETKDDYTLSQPSD 817
IHNHGVNI KQKSDLEGIRVKFLILILSLVQE QLVQEK D+NYKCETKD+YTL QPSD
Sbjct: 812 IHNHGVNIGKQKSDLEGIRVKFLILILSLVQELQLVQEKATDHNYKCETKDEYTLCQPSD 871
Query: 818 NWSYSEPTTCIKKFANLFPSLLVELVSFATVSCSIFWSNVKSDETGLPCSVKGKLGGPSQ 877
+ +Y+ PTT IKKF NLF SLL ELVSFAT SCSIFWSNVKSDETGLP SVKGKLGGPSQ
Sbjct: 872 DLNYAVPTTFIKKFVNLFSSLLEELVSFATASCSIFWSNVKSDETGLPSSVKGKLGGPSQ 931
Query: 878 RRLPSSTATLVLLAVTSMKAIASILSCCRQFRITGSQNFGVEFLLKFVWKTVSSPAYHSE 937
RRLPS TATLVLLAVTSMKAIA +LSCCRQF+I GSQNFGVEFLLKF+ KTVSSPAY SE
Sbjct: 932 RRLPSYTATLVLLAVTSMKAIAFVLSCCRQFKIIGSQNFGVEFLLKFLLKTVSSPAYRSE 991
Query: 938 SGAEICLATYEALAPVLQVLVSEFSSQALKFIRDENTIMHLGVEGRPLLDSLVLTFHQHV 997
SG EI LATYEALA VLQVLVSEFSSQ L+FI DENTIMHLGVEGR LDSLVLTFHQHV
Sbjct: 992 SGVEIRLATYEALASVLQVLVSEFSSQVLQFIWDENTIMHLGVEGRQPLDSLVLTFHQHV 1051
Query: 998 NGILDAGVLVRSRRAVLLKWKWLCLESLLSIPYRALQNGLNLVDNNSFLSEATLVRIFSD 1057
NGILDAGVLVRSRRAVLLKWKWLCLESLLSIP R Q+GL LVDNNSFLSEATLV+IF+D
Sbjct: 1052 NGILDAGVLVRSRRAVLLKWKWLCLESLLSIPQRTFQDGLCLVDNNSFLSEATLVQIFND 1111
Query: 1058 LVESLENAGECSVLPMLRLVRLNLWLFCKGKSGLLVTSCNGVNAE 1103
LVESLENAGECSVLPMLRLVRL LWLFCKGKSGLLVT CNGVN+E
Sbjct: 1112 LVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTFCNGVNSE 1151
BLAST of Sgr020180 vs. ExPASy TrEMBL
Match:
A0A6J1DPL1 (uncharacterized protein LOC111022655 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111022655 PE=4 SV=1)
HSP 1 Score: 1831.6 bits (4743), Expect = 0.0e+00
Identity = 921/1065 (86.48%), Postives = 972/1065 (91.27%), Query Frame = 0
Query: 38 ADPDALKSFIWKCFVPLINRAIAFNREMLNQVAESFIDVVIEMNSWPIVEATLIPFCISS 97
ADPDALK FIWK FVPLIN+A AFNREMLNQV+ESFIDVVIE NSWPIVE TL+P CISS
Sbjct: 92 ADPDALKLFIWKSFVPLINKAAAFNREMLNQVSESFIDVVIETNSWPIVEETLVPLCISS 151
Query: 98 ALYSTSVLQNEEFDTFEGDRCSVILGSNGPINEPRMDKQMMKAYGFLPLPLACHILAIML 157
ALYST++LQNE+ TFEGDRCSVILGSNG ++EP+MDKQM+K YGFLPLPLACHILAIML
Sbjct: 152 ALYSTTMLQNEQLGTFEGDRCSVILGSNGSVHEPKMDKQMIKGYGFLPLPLACHILAIML 211
Query: 158 DAVLCNRHAPQISDAVVANGCQKAEEFTVKLIWDICNLSEQMLLQSSDHRSCTIRYLLPL 217
DAVLCNR APQ ++ VV+NGCQKAEEFTVKLI DICNLS+QMLLQSSDHRSC IRYLLP+
Sbjct: 212 DAVLCNRQAPQTTEVVVSNGCQKAEEFTVKLIRDICNLSDQMLLQSSDHRSCAIRYLLPV 271
Query: 218 IFEVLLSHHSLEISIQGHVHNLSRNRFLMKIWKCCKKLFSFGTLERRDAYKILSLYLCFF 277
IFE LLS H+LEISIQG+ +LSRNRFLMKIW CCKKLFSFGTLERRDAY ILSLYL FF
Sbjct: 272 IFEALLSQHNLEISIQGYACSLSRNRFLMKIWNCCKKLFSFGTLERRDAYTILSLYLSFF 331
Query: 278 PHNEELGGAGICDDAEEFDIKADKDFWDAIKRGLVDKEGLVRKQSLHILKKALYINGRGN 337
PHNEEL GAG+CDDAEEFDIKADKDFW IKRGLVDKEGLVRKQS+HILKKAL INGRGN
Sbjct: 332 PHNEELEGAGMCDDAEEFDIKADKDFWVEIKRGLVDKEGLVRKQSVHILKKALSINGRGN 391
Query: 338 TSRVPKTISSGKDNNARGITKRERWAHKEAKSLGVGQICSESEIVINSQQQQWEAFILLY 397
TS VP TISSGKDNNARGITKRERWA+KEAKSLGV Q CS+ EIV NS QQ+WEAFILLY
Sbjct: 392 TSSVPNTISSGKDNNARGITKRERWANKEAKSLGVVQTCSQHEIVTNSLQQKWEAFILLY 451
Query: 398 EMLEEYGSHLVEAAWNHQIFLLLRDPTSINFDSVTGGIHQNQIDMSSEIFSWLSILWVRG 457
EMLEEYGSHLVEAAWNHQI LLLRDPTSI FDS TGG +QNQI+MS EIFSWLSILWVRG
Sbjct: 452 EMLEEYGSHLVEAAWNHQISLLLRDPTSIKFDSFTGGFYQNQIEMSGEIFSWLSILWVRG 511
Query: 458 FHHDNPLVRCLIMQSFLAIDWRNYVPCLKSLPETLIIGPFIEALNDPVQHKDFGVKGVYS 517
FHHDNPLVRCLIMQSFL IDWRNYV CL SLP+T IIGPFIEALNDPVQHKDFGVKGVYS
Sbjct: 512 FHHDNPLVRCLIMQSFLGIDWRNYVSCLMSLPQTFIIGPFIEALNDPVQHKDFGVKGVYS 571
Query: 518 SKTIEGAAHFIRQYANCLDARTSAVFLQQLTSLAKKKSFGRVGLISLSECIASAASVGGY 577
SKTIEGAAHFIRQYANCLDART+ VFLQQLTSL KKKSFGRVGLISLSECIASAAS+ G+
Sbjct: 572 SKTIEGAAHFIRQYANCLDARTTVVFLQQLTSLTKKKSFGRVGLISLSECIASAASIVGF 631
Query: 578 DNDSEGECFEGSSLSAQGDLITYSLGCKLELLDDLRFVVESSKQHFNPSYRLQVCAKALD 637
+ND EGECF+ QG+LITYSLG K+ELLDDLRFVV+SSKQHFNPSYR QVCAKAL+
Sbjct: 632 ENDCEGECFD-----PQGNLITYSLGYKMELLDDLRFVVQSSKQHFNPSYRFQVCAKALE 691
Query: 638 AAASVLCTSDLALEVLLHFISSLPREATDCGGCLRGKMQNWLSGCGKKSCSGSCCSTETK 697
AAASVLCTSDL LEVLL FIS+LPREATD GGCLRGKMQ+WL GCGKK CSGSCCSTETK
Sbjct: 692 AAASVLCTSDLDLEVLLLFISALPREATDYGGCLRGKMQSWLLGCGKKCCSGSCCSTETK 751
Query: 698 FMKSLIEFPKRFISHNHSSDASVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLRPILTF 757
FMKSLIEFPKRFI HNHSS+ SVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLR ILTF
Sbjct: 752 FMKSLIEFPKRFICHNHSSNVSVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLRSILTF 811
Query: 758 IHNHGVNICKQKSDLEGIRVKFLILILSLVQEHQLVQEKIADYNYKCETKDDYTLSQPSD 817
IHNHGVNI KQKSDLEGIRVKFLILILSLVQE QLVQEK D+NYKCETKD+YTL QPSD
Sbjct: 812 IHNHGVNIGKQKSDLEGIRVKFLILILSLVQELQLVQEKATDHNYKCETKDEYTLCQPSD 871
Query: 818 NWSYSEPTTCIKKFANLFPSLLVELVSFATVSCSIFWSNVKSDETGLPCSVKGKLGGPSQ 877
+ +Y+ PTT IKKF NLF SLL ELVSFAT SCSIFWSNVKSDETGLP SVKGKLGGPSQ
Sbjct: 872 DLNYAVPTTFIKKFVNLFSSLLEELVSFATASCSIFWSNVKSDETGLPSSVKGKLGGPSQ 931
Query: 878 RRLPSSTATLVLLAVTSMKAIASILSCCRQFRITGSQNFGVEFLLKFVWKTVSSPAYHSE 937
RRLPS TATLVLLAVTSMKAIA +LSCCRQF+I GSQNFGVEFLLKF+ KTVSSPAY SE
Sbjct: 932 RRLPSYTATLVLLAVTSMKAIAFVLSCCRQFKIIGSQNFGVEFLLKFLLKTVSSPAYRSE 991
Query: 938 SGAEICLATYEALAPVLQVLVSEFSSQALKFIRDENTIMHLGVEGRPLLDSLVLTFHQHV 997
SG EI LATYEALA VLQVLVSEFSSQ L+FI DENTIMHLGVEGR LDSLVLTFHQHV
Sbjct: 992 SGVEIRLATYEALASVLQVLVSEFSSQVLQFIWDENTIMHLGVEGRQPLDSLVLTFHQHV 1051
Query: 998 NGILDAGVLVRSRRAVLLKWKWLCLESLLSIPYRALQNGLNLVDNNSFLSEATLVRIFSD 1057
NGILDAGVLVRSRRAVLLKWKWLCLESLLSIP R Q+GL LVDNNSFLSEATLV+IF+D
Sbjct: 1052 NGILDAGVLVRSRRAVLLKWKWLCLESLLSIPQRTFQDGLCLVDNNSFLSEATLVQIFND 1111
Query: 1058 LVESLENAGECSVLPMLRLVRLNLWLFCKGKSGLLVTSCNGVNAE 1103
LVESLENAGECSVLPMLRLVRL LWLFCKGKSGLLVT CNGVN+E
Sbjct: 1112 LVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTFCNGVNSE 1151
BLAST of Sgr020180 vs. ExPASy TrEMBL
Match:
A0A6J1DQJ7 (uncharacterized protein LOC111022655 isoform X3 OS=Momordica charantia OX=3673 GN=LOC111022655 PE=4 SV=1)
HSP 1 Score: 1819.7 bits (4712), Expect = 0.0e+00
Identity = 918/1065 (86.20%), Postives = 967/1065 (90.80%), Query Frame = 0
Query: 38 ADPDALKSFIWKCFVPLINRAIAFNREMLNQVAESFIDVVIEMNSWPIVEATLIPFCISS 97
ADPDALK FIWK FVPLIN+A AFNREMLNQV+ESFIDVVIE NSWPIVE TL+P CISS
Sbjct: 92 ADPDALKLFIWKSFVPLINKAAAFNREMLNQVSESFIDVVIETNSWPIVEETLVPLCISS 151
Query: 98 ALYSTSVLQNEEFDTFEGDRCSVILGSNGPINEPRMDKQMMKAYGFLPLPLACHILAIML 157
ALYST++LQNE+ TFEGDRCSVILGSNG ++EP+MDKQM+K YGFLPLPLACHILAIML
Sbjct: 152 ALYSTTMLQNEQLGTFEGDRCSVILGSNGSVHEPKMDKQMIKGYGFLPLPLACHILAIML 211
Query: 158 DAVLCNRHAPQISDAVVANGCQKAEEFTVKLIWDICNLSEQMLLQSSDHRSCTIRYLLPL 217
DAVLCNR APQ ++ VV+NGCQKAEEFTVKLI DICNLS+QMLLQSSDHRSC IRYLLP+
Sbjct: 212 DAVLCNRQAPQTTEVVVSNGCQKAEEFTVKLIRDICNLSDQMLLQSSDHRSCAIRYLLPV 271
Query: 218 IFEVLLSHHSLEISIQGHVHNLSRNRFLMKIWKCCKKLFSFGTLERRDAYKILSLYLCFF 277
IFE LLS H+LEISIQG NRFLMKIW CCKKLFSFGTLERRDAY ILSLYL FF
Sbjct: 272 IFEALLSQHNLEISIQG-------NRFLMKIWNCCKKLFSFGTLERRDAYTILSLYLSFF 331
Query: 278 PHNEELGGAGICDDAEEFDIKADKDFWDAIKRGLVDKEGLVRKQSLHILKKALYINGRGN 337
PHNEEL GAG+CDDAEEFDIKADKDFW IKRGLVDKEGLVRKQS+HILKKAL INGRGN
Sbjct: 332 PHNEELEGAGMCDDAEEFDIKADKDFWVEIKRGLVDKEGLVRKQSVHILKKALSINGRGN 391
Query: 338 TSRVPKTISSGKDNNARGITKRERWAHKEAKSLGVGQICSESEIVINSQQQQWEAFILLY 397
TS VP TISSGKDNNARGITKRERWA+KEAKSLGV Q CS+ EIV NS QQ+WEAFILLY
Sbjct: 392 TSSVPNTISSGKDNNARGITKRERWANKEAKSLGVVQTCSQHEIVTNSLQQKWEAFILLY 451
Query: 398 EMLEEYGSHLVEAAWNHQIFLLLRDPTSINFDSVTGGIHQNQIDMSSEIFSWLSILWVRG 457
EMLEEYGSHLVEAAWNHQI LLLRDPTSI FDS TGG +QNQI+MS EIFSWLSILWVRG
Sbjct: 452 EMLEEYGSHLVEAAWNHQISLLLRDPTSIKFDSFTGGFYQNQIEMSGEIFSWLSILWVRG 511
Query: 458 FHHDNPLVRCLIMQSFLAIDWRNYVPCLKSLPETLIIGPFIEALNDPVQHKDFGVKGVYS 517
FHHDNPLVRCLIMQSFL IDWRNYV CL SLP+T IIGPFIEALNDPVQHKDFGVKGVYS
Sbjct: 512 FHHDNPLVRCLIMQSFLGIDWRNYVSCLMSLPQTFIIGPFIEALNDPVQHKDFGVKGVYS 571
Query: 518 SKTIEGAAHFIRQYANCLDARTSAVFLQQLTSLAKKKSFGRVGLISLSECIASAASVGGY 577
SKTIEGAAHFIRQYANCLDART+ VFLQQLTSL KKKSFGRVGLISLSECIASAAS+ G+
Sbjct: 572 SKTIEGAAHFIRQYANCLDARTTVVFLQQLTSLTKKKSFGRVGLISLSECIASAASIVGF 631
Query: 578 DNDSEGECFEGSSLSAQGDLITYSLGCKLELLDDLRFVVESSKQHFNPSYRLQVCAKALD 637
+ND EGECF+ QG+LITYSLG K+ELLDDLRFVV+SSKQHFNPSYR QVCAKAL+
Sbjct: 632 ENDCEGECFD-----PQGNLITYSLGYKMELLDDLRFVVQSSKQHFNPSYRFQVCAKALE 691
Query: 638 AAASVLCTSDLALEVLLHFISSLPREATDCGGCLRGKMQNWLSGCGKKSCSGSCCSTETK 697
AAASVLCTSDL LEVLL FIS+LPREATD GGCLRGKMQ+WL GCGKK CSGSCCSTETK
Sbjct: 692 AAASVLCTSDLDLEVLLLFISALPREATDYGGCLRGKMQSWLLGCGKKCCSGSCCSTETK 751
Query: 698 FMKSLIEFPKRFISHNHSSDASVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLRPILTF 757
FMKSLIEFPKRFI HNHSS+ SVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLR ILTF
Sbjct: 752 FMKSLIEFPKRFICHNHSSNVSVTYDDEELEAWEFEAKRWARVVFLAVKEEHHLRSILTF 811
Query: 758 IHNHGVNICKQKSDLEGIRVKFLILILSLVQEHQLVQEKIADYNYKCETKDDYTLSQPSD 817
IHNHGVNI KQKSDLEGIRVKFLILILSLVQE QLVQEK D+NYKCETKD+YTL QPSD
Sbjct: 812 IHNHGVNIGKQKSDLEGIRVKFLILILSLVQELQLVQEKATDHNYKCETKDEYTLCQPSD 871
Query: 818 NWSYSEPTTCIKKFANLFPSLLVELVSFATVSCSIFWSNVKSDETGLPCSVKGKLGGPSQ 877
+ +Y+ PTT IKKF NLF SLL ELVSFAT SCSIFWSNVKSDETGLP SVKGKLGGPSQ
Sbjct: 872 DLNYAVPTTFIKKFVNLFSSLLEELVSFATASCSIFWSNVKSDETGLPSSVKGKLGGPSQ 931
Query: 878 RRLPSSTATLVLLAVTSMKAIASILSCCRQFRITGSQNFGVEFLLKFVWKTVSSPAYHSE 937
RRLPS TATLVLLAVTSMKAIA +LSCCRQF+I GSQNFGVEFLLKF+ KTVSSPAY SE
Sbjct: 932 RRLPSYTATLVLLAVTSMKAIAFVLSCCRQFKIIGSQNFGVEFLLKFLLKTVSSPAYRSE 991
Query: 938 SGAEICLATYEALAPVLQVLVSEFSSQALKFIRDENTIMHLGVEGRPLLDSLVLTFHQHV 997
SG EI LATYEALA VLQVLVSEFSSQ L+FI DENTIMHLGVEGR LDSLVLTFHQHV
Sbjct: 992 SGVEIRLATYEALASVLQVLVSEFSSQVLQFIWDENTIMHLGVEGRQPLDSLVLTFHQHV 1051
Query: 998 NGILDAGVLVRSRRAVLLKWKWLCLESLLSIPYRALQNGLNLVDNNSFLSEATLVRIFSD 1057
NGILDAGVLVRSRRAVLLKWKWLCLESLLSIP R Q+GL LVDNNSFLSEATLV+IF+D
Sbjct: 1052 NGILDAGVLVRSRRAVLLKWKWLCLESLLSIPQRTFQDGLCLVDNNSFLSEATLVQIFND 1111
Query: 1058 LVESLENAGECSVLPMLRLVRLNLWLFCKGKSGLLVTSCNGVNAE 1103
LVESLENAGECSVLPMLRLVRL LWLFCKGKSGLLVT CNGVN+E
Sbjct: 1112 LVESLENAGECSVLPMLRLVRLTLWLFCKGKSGLLVTFCNGVNSE 1144
BLAST of Sgr020180 vs. TAIR 10
Match:
AT4G17610.1 (tRNA/rRNA methyltransferase (SpoU) family protein )
HSP 1 Score: 815.5 bits (2105), Expect = 5.3e-236
Identity = 475/1087 (43.70%), Postives = 660/1087 (60.72%), Query Frame = 0
Query: 41 DALKSFIWKCFVPLINRAIAFNREMLNQVAESFIDVVIEMNSWPIVEATLIPFCISSALY 100
+AL+ F+W+ F+PL+ A++ +MLN++ ESF DVVIE N ++ +L+PF + S +
Sbjct: 92 NALQLFVWRVFIPLMKMVRAYDLDMLNKIVESFFDVVIETNVLDMLGVSLVPFLLRSVGF 151
Query: 101 STSVLQNEEFDTFE-GDRCSVILGSNGPINEPRMDKQ-MMKAYGFLPLPLACHILAIMLD 160
S + Q+EE D + GD C +N MD+ + + G P+PL+CH+L ++L+
Sbjct: 152 SMGMRQHEESDFIKWGDLC-----LRDSLNTIDMDENYIAQLSGSFPIPLSCHLLNLILN 211
Query: 161 AVLCNRHAPQISDAVVANGCQKAEEFTVKLIWDICNLSEQMLLQSSDHRSCTIRYLLPLI 220
A + A K E F ++WD+CN +E++L QS +HRSC + +LLP I
Sbjct: 212 AAFQSHQA-----------APKVESFAAGMLWDLCNTTERLLSQSVEHRSCAVSFLLPAI 271
Query: 221 FEVLLSHHSLEISIQGHVHNLSRNRFLMKIWKCCKKLFSFGTLERRDAYKILSLYLCFFP 280
F+ S SL+IS QG++ LSRN F+ +IW+CCKKLFS G++ERRDAY +LSL L
Sbjct: 272 FKAFSSQSSLKISHQGNLFILSRNGFIKRIWECCKKLFSVGSIERRDAYSVLSLCLSSGS 331
Query: 281 HNEELGGAGICDDAEEFDIKADKDFWDAIKRGLVDKEGLVRKQSLHILKKALYINGRGNT 340
+ DA +FD++++++FWD IK GLV E LVRKQSLHILK L I
Sbjct: 332 WTDGTESFVSEKDAVQFDLRSEQEFWDEIKIGLVVDESLVRKQSLHILKSVLSI------ 391
Query: 341 SRVPKTISSGK-DNNA--RGITKRERWAHKEAKSLGVGQICSESEIVINSQQQQWEAFIL 400
V +TIS K + N+ R +T++E WA KEAKSLGVG++ + + S QQ W+AF+L
Sbjct: 392 IEVSETISEKKPEGNSVNRSMTRKETWAEKEAKSLGVGELYGSVDSGLTS-QQGWQAFLL 451
Query: 401 LYEMLEEYGSHLVEAAWNHQIFLLLRDPTSINFDSVTGGI-------HQNQIDMSSEIFS 460
LYEMLEEYG+HLVEAAW++QI LL++ +S+ +D H D ++IF+
Sbjct: 452 LYEMLEEYGTHLVEAAWSNQIDLLIK--SSLRYDGTLKSDCNNSHHGHMETPDEEAKIFN 511
Query: 461 WLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNYVPCLKSLPETLIIGPFIEALNDPVQHK 520
WL +LW RGF HDNPLVRC +M+SF I+WR Y C +S+ +T ++GPFIE LNDP HK
Sbjct: 512 WLEVLWNRGFRHDNPLVRCTVMESFFGIEWRRYKTCTQSMSQTFVLGPFIEGLNDPTHHK 571
Query: 521 DFGVKGVYSSKTIEGAAHFIRQYANCLDARTSAVFLQQLTSLAKKKSFGRVGLISLSECI 580
DFG+KG+Y+S+TIEGAA ++ Y +CL+ R FL L SLAKK+SF R G ++L +CI
Sbjct: 572 DFGLKGIYTSRTIEGAAQYVSAYTSCLNPRNRVGFLINLASLAKKQSFCRAGFMALVQCI 631
Query: 581 ASAA-SVGGYDNDSEGECFEGSSLSAQGDLITY-SLGCKLELLDDLRFVVESSKQHFNPS 640
S A VGGY + G + S +AQ + S +LD L+FV ESS+QHFN
Sbjct: 632 VSTAYVVGGYGDKEMGHLEDKFSGTAQESSCGHLSQDDMTHILDVLKFVAESSRQHFNHK 691
Query: 641 YRLQ---------------------VCAKALDAAASVLCTSDLALEVLLHFISSLPREAT 700
YR++ V K L+ AASV+ ++ L LL F+S++PRE T
Sbjct: 692 YRIRASLTILNTLKFLLLIVLFHFLVYQKVLETAASVVNPCNVPLGTLLQFVSAIPREFT 751
Query: 701 DCGGCLRGKMQNWLSGCGKKSCSGSCCSTETKFMKSLIEFPKRFISHNHSSDASVTYDDE 760
D G LR M WL GC +K+ S S C+ T+ + SL E+ K F S N S +DDE
Sbjct: 752 DHDGLLRKMMLEWLQGCNRKT-SNSLCTDGTRLLASLYEYLKGFTSDNVES-----FDDE 811
Query: 761 ELEAWEFEAKRWARVVFLAVKEEHHLRPILTFIHNHGVNICKQKSDLEGIRVKFLILILS 820
+LEAW+ + KRWARV FL + +E HL I+ F+ N+G++ ++K+ L+ KFLI ILS
Sbjct: 812 DLEAWDSQTKRWARVFFLMINKEEHLTDIIMFVQNNGLSFFQEKNHLKRAPAKFLIFILS 871
Query: 821 LVQEHQLVQEKIADYNYKCETKDDYTLSQPSDNWSYSEPTTCIKKFANLFPSLLVELVSF 880
++ E Q +Q+ I++ + ++K + + + ++ KKFA + S+L EL+ F
Sbjct: 872 MLLELQNMQDGISELSSSVKSKSGIGSDEQTGKQIVVDASSIKKKFAVVLLSILKELIPF 931
Query: 881 ATVSCSIFWSNVKSDETGLPCSVKGKLGGPSQRRLPSSTATLVLLAVTSMKAIASILSCC 940
A SCSIFWS+ + LP SV GKLGGPSQRRL T T VL AV S+K I I S C
Sbjct: 932 ADSSCSIFWSHTTVENGALPGSVIGKLGGPSQRRLSVPTTTAVLEAVLSVKTIGLISSYC 991
Query: 941 RQFRI-TGSQNFGVEFLLKFVWKTVSSPAYHSESGAEICLATYEALAPVLQVLVSEFSSQ 1000
QF G + F KF T+SS +SE+ AEI LA +EALA VL VS S+
Sbjct: 992 AQFTSGVGELKLALAFFWKFTQHTISSQICNSEAAAEIYLAAFEALASVLNAFVSLCSAG 1051
Query: 1001 ALKFIRDENTIMHLGVEGRPLLDSLVLTFHQHVNGILDAGVLVRSRRAVLLKWKWLCLES 1060
A + +++T++ + V+G L V F +++N +L AGVLVRSRRAVLL WKWLC+ES
Sbjct: 1052 AFNLLENDSTLLSM-VDGEFWLQVSVPAFVRNINHLLTAGVLVRSRRAVLLSWKWLCVES 1111
Query: 1061 LLSIPYRALQNGLNLVDNNSFLSEATLVRIFSDLVESLENAGECSVLPMLRLVRLNLWLF 1092
LLS+ + L D SF S+ T+ IF D+VESLENAGE S LPML+ VRL L +
Sbjct: 1112 LLSVMH-ILDARRIPEDRKSFFSDDTVKSIFQDIVESLENAGEGSALPMLKSVRLALGIL 1145
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022155532.1 | 0.0e+00 | 86.48 | uncharacterized protein LOC111022655 isoform X1 [Momordica charantia] | [more] |
XP_022155533.1 | 0.0e+00 | 86.48 | uncharacterized protein LOC111022655 isoform X2 [Momordica charantia] | [more] |
XP_022155536.1 | 0.0e+00 | 86.48 | uncharacterized protein LOC111022655 isoform X5 [Momordica charantia] >XP_022155... | [more] |
XP_022155535.1 | 0.0e+00 | 86.48 | uncharacterized protein LOC111022655 isoform X4 [Momordica charantia] | [more] |
XP_022155534.1 | 0.0e+00 | 86.20 | uncharacterized protein LOC111022655 isoform X3 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1DPL7 | 0.0e+00 | 86.48 | uncharacterized protein LOC111022655 isoform X5 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1DMP5 | 0.0e+00 | 86.48 | uncharacterized protein LOC111022655 isoform X4 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1DN74 | 0.0e+00 | 86.48 | uncharacterized protein LOC111022655 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1DPL1 | 0.0e+00 | 86.48 | uncharacterized protein LOC111022655 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1DQJ7 | 0.0e+00 | 86.20 | uncharacterized protein LOC111022655 isoform X3 OS=Momordica charantia OX=3673 G... | [more] |
Match Name | E-value | Identity | Description | |
AT4G17610.1 | 5.3e-236 | 43.70 | tRNA/rRNA methyltransferase (SpoU) family protein | [more] |