Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTATAATCTAAAATCAGAGGTAGTGAAGATTAAAAGAAGAGTGAAGAATGTAGGAAGTCCAGGCACCTATGTTGCTAAAGTCGAGGCACCACCAGGAGTTTTGGTTTCGGTAGACCCAAGTACTTTGAAGTTCACTAAAACGGGTGAAGAGAAGGATTTTCAAGTTGTGTTGAGGAGGGTTCAAAGTAATCAAACTGAAGAAAAGCATGTGTTCGAAAAACTTGTGTGGCCTGATGGAAAACATCGTGTTAGTAGTCCAATTTTTGTGACATTAGAGATTTAGTAAAAGGTTGAGTTTAGCTTGTTTTAGCTTGAAAAAACAACTTGAAGGGAGAATTTTTTGGACTAAGAACCTTAACACTATGCTCTTTGATTCTCTTATGTTGTTGTAAATAGAACATACACATTAATATTTATTGATAAAACATTAACAAATTTCCTAAACTTTATCCTCAGTTAAAACAGCTAACCATTTTCCTAAATAAATCACTAGCCAGTTTCAAAAACTTGCTTAGGAACTTCATCATGTTTTAAGGTTCTTCCACCTCTTTTGCTAGGAACTAAGGTACTCCTCTGTTTTTGAACCCTATTATTATTGAATTTTTGTTTCTTTTTAGGTTTTTTTCGTTTCTTTTTACGTTTTCTTCTAGGTCTCTTTGGTTGAGACGGTTTGAAGCACTCTAAGTACTGAAGTTTTCTCTAATTTCTTCTTAGCTCTTTTCTTTGGTTCCTTCTTCTCATGACTCCGAGATTTCTTCTTCTCTTTCTTGCTATCAATGCTAATAGAAGCCTTTGAAAGATGTTGATAGTTTTTTATTAGTATCTTATACTAGCGATTGTACGACCGTTTCATTTTCATCAACAGAGCTTCGTTTTTGTCGAATATGTCAAGGAACAAGCCAATAAACTCTATTCTGTAGACTTCTTCTATTATTTTGACCCACGCTAATAATATATCTTTTTAAACTCGGGATGTAGTATTCATTGGTCAATAAACGTTGATCGTCATTCTTACATTGAAATAAGATAGATCCTTTTCATTGAATCGATACAATTATTGGTTTTAACTTTGATCAAAATATACCAAAACAAATAAATGGAAGAGAAAGAACAACTTGAGGGAAGAATTTCTTGGACTAAGAACCTTAAGCATTAAGCCTTTTAGTTCTTTTATGTCGGTTGTGAATAAAACATTTGAGTATTTATACGGGGGACAAGGCACTAACAAATTTTCTAAACTTTATCCTCAAGTTAAAACATGCTAATAATTCTCTTAAATAAATCACCAACTATTTTCCAAAACTTAGCTCTAGAATTTGGCAACTATTAACGGTTAGTTTACTACTAACAATTCTTTTACTAAGTCAAGATTATTGAATCACAAATGGATCTAAAATAAGTGATAGTTATACATAGTGTCCTATACAAAAATTAAATAATAAATTATGAGGTTTTACTATATAAATCACAACTTCTACTTCCATTATAGTTAGTGAGACATGAGTTCTAGAGCAGACTCTACATCCAACCTTTGTCTTAGAGAAGATCGAGATATGTCTTGTAATATGACATTTTATGTAACTCTTTAAACACTTTAACTCAAAGAGTTTAAATAGTCATTTTGGTCTCTAGCACTATACTTTTTCTCTGGAAATCATTCCTTATACATTGAACTTTCAAATTTCCTTATCAATTTGTATATTAATATTATCATAACAAAGAGTGGCAACCAGTGACAAACTGACAAATTAAATTGCGTTTACAATAGCAGATTATGCAATAAAAACATGTATGAAAGATAACATTAGAAATTACAATAAAATTTTATGTGATTTAAAAGAAATAAATGAACTCAGTAAGAACGTTTACAAGCACAGGTATGAGATTGTTTACAAGGTAGGAAACAAGATTGGGATACAAGAGTTATAAAATATGACATTCTATAACATGCAAACATACTAGATGGACACCCTCTATGGCTTCAAAGTTCCCGAGTACCTCTTGTCTCAACCTGTGACATGAGTCACTTGAGAAAAAATAAAGGAACGAAGGGGTGAGTATAAAATTATACTTAGCAAGCAACTTACTTGTGGGCTCTCATCACATCCATTGCCTAGTTGGTGTCTACAAACTTTTCTCTAGGCTCAAGGTATGCAACCTTTACTTCTCTCGGACTTAAGTAACTATCTTAGCCTAAGTAGTGGTTACTCTCGGGTTGAGACATTACCTCATAAAACTCTTTCACTAATGGAGACATTGTGGTATGTGTTATCTTGCTCTAATCTATGGGAGGTTAGTTGGCAAACCCTACCTACAGGACCATCATGTTATCCAAGGTTGGTGGAATGACCCCTTTGTGCTTGCTAGCAACTAGGACTACCTACATAATCCTACGTCCTCTCGAAGGATGGATCCGTTGTGTAGGGTAGTCCAACTAACTTGTCAGTCACACAACCCTAACAAATTTAGGCTTGACTTAACGGTATTGGTACTAGTATGGAGGAAAGTGGAAGTCTATTTGATACCATACCTACTGCTATAAGCTTGGATACTACCTTAGTCTATCTAGTTTGTTGTGGCAGTACTCTTAGGTTACTAAGGCGGGTTGGTGGCTTTGACATGGACTCTCCTCTCTATGATGATCTGTTCTGGCTCCTTGACCCTTACTTGGTTGAGTGAGTGTAGACCCATTGATCGCCACTCATGATTATAAGGTGCTCAGTCTGTTAGAGTCATTCGATCTCTACGAATCGTCTAGCCAATCTGTTTCTAATACCCTAAGCTTATCTCTTAATAACTAAATTATTTCTATTAGTGAGAATACACATTCATGCTCCTGAAGCTAATGTACAATAAGACATGCAATTCAACCACACTTACCAGGGTTATAACATCCATCAAATAACCTGAGCACATATCACAAGTTTTCATCATCTGTCTGCATATTTATGAACATATTAGTAATAAACCATCATGGACTTACGTGTTCGAAAACTTAAATTCCATCTAAAAATACAGTTATGGACTTACCCATTCTAAAATATCTTTAAAATGACGTAATAATAAACTAAATTAAACAAATAAACATTTAAATTAAAGACATCTCATTCTAACTCAAATCTAAGAAAAATAAATGTGACTACCCTACGTATGTGACATGGTCTCGAGTTGCGATTTCGTCATGAACCATATAGGAATTCCTCGCCATTACTTGAGATAAGAGTAGCATATGGCTTGAGTATTTTAAAACATACTTCGTAAGTGACCCCACTATTAAGGTCAAATGCAAACACATACAATCTCATCAATGAGACCTATCATAATTGTCTATTTTCTTACGACGGCTATCCGACCAGGCAATTGGGATGTGTACTTAATTTTCCTACACACAGTCTACACGTGCGAATACGAACCCACCAATCGGTTCGCACACCTCCTCGGCACCCTTTCTAGTCGAGCAAGCATTCTCCAGGAGTTGTTGCTGCGTAGCACCTGTTAGGAATAGCAACCACCACAGCAACATTATGATAAACATAATGAACGTTCCCCAGACTAGTAGTGTACGCGTCTGTACTATCGTCAAGGAATAGGGGTATCCCCTAATGGATGAACACACAATAATGCATGAGGTCCCAACATTTTTCTATTTTAATTATCATTAATACAACCCTGATAGCATCTCGTACTTATCATATCATTTCTCATATCATTTATCATAATTCTCATGTGTATGCCGGAGTTGTATTTTATACTAGTTCATCGTTTTGCATGTCAATCATAACATATCATTCATGTCACATATCAATCATATCATAACGTTTTTATGCTTTCAGACATAACAATCCATATCGTGCACATATTATATCATATTGTGTATCATAACAATCACATAACATCCCTAACATATCGGTTCACATTATATCATATCATAACCGTGTCAATAATTACATATACCGTACCATAACCTATATCATTTAAATCTCATCATTTTACTATGCATATATATTCGACAAATGGGTTAAGACTTCTAATAATAAAAGATATTTATGCATTGTTATTATTTATTCCAAGTTAAAACTTCTAATAGTAATAAAAGGGTTTAAAATGGAGAGATACCATTGCATAAAAATCTCCATTTATTTATAGCAGGTGCCATGAACAAAATTTAAGAGGATGATCTTGCAATTGGAACAGAATCTCTTCTGGTGCTGTCAAATGGGACAAAACCATTAAAACATACCATATTCATTTTGATAATATCCCTTATTGATTCAATGTTTTTAGAACAGTATGAGTTGTTAAATAATTTACATTCCTGCCATAATTATCTTAATAATTTCTTCAACAAACCATGAATTTCCACTATTCCCAACTCTATAAAGAGAAATGGAAGATGAAGCCCAAATGAAACACTGATCACCAAGGAATCATTATTCTTAAAGAGGAAAAGAAATGGAGGCCTTCAATCTTCCTCCTTTACTTCTGTCCTTTCTTTTTGTTCTTTTGCAAACATCCACCATTGCAGCCAAGAAGGTTAGTCATTCATCACTTATAAATCCTTTTTAATGTTTATACCAAATGCGTTTTAGACACTCAAGTTTTTTTCGAAGTTTTTTTCTTAATTGTGTGCAGTCTTATATTGTTTACTTGGGATCACATTCACATGGGTTGAATCCTTCCGCAATTGATCTTCAACGTGCAACACAAACCCACTACAATCTACTTGGATCCATGTTAGGAAGGTGAATCTTTTAGTTTCAAATCAGGTAGTGCAGATTTGTTGAACTCTCACTCCTAATTTACTATCGTATGTTTCTGTTTCCAAGTATAGCAACGAAGCAGCTAAGGAAGCAATCTTTTACTCATACAATAGACATATCAATGCCTTTGCAGCTGTTCTTGATCAGAAAGTTGCAGAAGATATAGCAAGTAAAATGATTCTATCTGTTTTGATAATGATTTTCTTCCTAGTAAAATTATATTTGTTTATTCCTAATTTATTGGGTTTGATAGTTTTATGGAAACATTACAAGACTCATAAAAAAAAGTTCCGAAAACCCTTTTTAGTTTCCAAAATTTTGTAGGTGGGAAGTATTCACAAGCTTAATTTCAAAAGTCAAATAGTAATCAAGGCGATTCTCGTTTCTTTAATTTAATAATGATGATGTTTTTATCCTTTTTCTTTTTTAGAGCATCATGATGTGGTATCAGTACATGAGAACAAAAAACTAAAACTGCACACTACACGATCATGGAACTTTCTTGGAGTTGAGAATGATGGTGGAATTCCTTTAGACTCACTTTGGAATCTTTCAAGATTTGGTGAATCTACAATCATTGGCAACATTGACTCTGGTTGGTCTTACTCATGTTTGAATTGTTTTTTTTTTTTTTTTGAGCTTTTGTAATAATTTTTATATGAAATTGAAATTCATACAGGTGTTTGGCCTGAATCAAAGAGTTTTAGTGACGAAGGATATGGACCTATCCCAACAAGATGGAAGGGAAGTTGTGAAGGTGGCACCCACTTTAGTTGCAATAGGTCCCTAACATTAAATCTCTTATTGTTACTTCATCTCTAATTAATTCTTTTAGAAATATGCACACTTTCAATTGTCTAATATGTTAGGCACAAATTAGACCCTTTCATTAGCACAATAGATAAATAATTGACAATAGTTGTTCAAAGTGAATATTAATCTAAAATTTGTTAGATCTCTATCGAAACATTATTAATGATACTAAAAAGAAATTTATAAATTTTTAGAGCAAAATTAAGTCAAAGTCATTTTCTTGGTTAAATTATAAGTATAGTTCATAAACTATCAAATTTATGTTTAACATATCTTTAAATTTTAAAAATTATATAATAAGGATTTGAACTTAACTCCTTTTGAATTTTCTTTTTTATATACAAATGATAATAGATTTTTCTTTGTCGCGTTTGTATTCTAGCTAATCATTGTATCAAATGCTTGTATACAATTTTAAATGTAGGAAGCTGATTGGAGTACGATATTTCAACAAAGGTTTTGCATCCGACGTGGGACCTCTCAACTCAAGCTATGAAACAGCAAGGGACGTTGATGGGCATGGAACACACACCTTATCCACGGCTGGAGGCAATTTCGTTAAAGGAGTAAGCATTTTAGGGAATAGTTATGGCACTGCAAAAGGGGGTTCCCCTAAAGCCCTTGTTGCTGCCTATAAAGTATGTTGGTCTACAGATAGAGGTGATGGGTGTTTTATGGCCGATATTCTAGCTAGCTTTGAAGCTGCCATTAGTGATGGAGTTGATGTTCTATCGGTTTCACTCGGTGGAGGTATTCAAGAATTTTCCGACGATCTTATAGCTATAGGGTCCTTCCATGCAGTGAAGAACGGCATCACTGTTGTTTGTTCAGCTGGAAATTCAGGACCAAGTGAAGATACTGTCTTAAATGTCGCACCATGGATGATAACTGTGGGAGCTAGCACAGTTGACAGGCTTTTTACAAGTTACGTGGTGTTGGGAGACAAGAGGCAATTCAAGGTACTCTTTCTAACACTGGTTAGTAGTATATAGTATTGTCCTTATACATATTTAGGTAAGTATGGAACCAGTTTTAAATATAGTAAATTGCATTAGGCATGTGAAATTGTATGTATACGTGTCTATATTTGTATCCATTGAGTTTATGGCTCTCTACTATGTTTATGTTCTCTTCAATGATGTTTCATTATCAATACCAGGGTGAAAGTCTTTCTAGTAAATTATTGCCACCTAAGAAGTTCTATCCGTTGATCCGTGCTTTAGATGCAAAATCCAACAACACCTCTAATCACGAAGCGTGAGTATTTTTTTCTCTAAACTATACAAAATATCCCTAAATTTTAGATACATTTCAATCATGTCTTAAACTTGAATTTTAAATTAGATCCAAAAATTTTCCAACAATAAAAATTAATCATTTTTTTTATATAGTGAGTTATGATCATAATTTTTTCTAATTTGTAGAATATTTGCATCTTTCACAATTTTTGAAAGTTTTAAAATAAAATTTTTCAAAATCGTTTAAGAGTATTTTATATAATTTAATCCCGATAGTTTTACAATAATAATTAGAATAAATTGATTATAGCACATGATACTAAGCAAAATTTATTGTGATCTTGTAGCATACTATGTCGTCAAGGGTCTCTTAATCCTGAAAAGGTAAGAGGAAAAATTGTAGTTTGCCTTAGAGGGGAAAATTCAAGAGGGGAGAAAGGTTACGTGGTTGCTCAAGCAGGTGGTGTTGGGATGATTCTCGCTAATGACAAAGAAAGTGGGGATGAACTTTCGGCTAGTCCTCACTTTCTTCCTGCTTCACATATAAGCTATACCGATGGTGAATCAGTCTACCAATATATCCAATCTACTAAGTAATTACGCCATCTTATTCCATCAACTTCCATATTTATTCACGAAGTATTTTGAACTTTTTGATCTAATAAACATAAAATTAGAAGCATAGGAATCTATTAAATGTATTTTTTAAGTTAAAAAATTTATCAAACTCAAAATTTGTTACTTATATTTGATGTTTGTGGTTGAAACAGCACTCCAATAGCTTACATGACTTCCGTGACTGAGTTGGGAATCAAACCAGCACCAATTATGGCTTCATTCTCTTCGAGAGGTCCCAACATAGTTGAGCCCTCAATACTCAAGGTTGGTTAAAAACGATGTGTTGAGTACTTGTATAGTATAAAGTTCAATGTGTCATCCCCTAACAATCACAAAATTTTAAATAATGCATTTTTTTAAAAAATTCAGCCTGATATAACAGCACCAGGGGTGAATATAATAGCGGCTTTCTCTGAAGCAACATCCATATCTGGTTTACCTTATGATAAGCGTCAAGCTCAATTTATTACATTATCTGGCACTTCCATGTCATGCCCCCATATTTCTGGCATTGTCGGCCTTCTCAAAACGCTTTATCCAAAATGGAGTCCAGCAGCTATCAGATCTGCAATCATGACGACAGGTTCATTATTCAAAATTCTTCTATCATTTTAAAAGTTAAGGGACTAATTAAACTAATAGTTTATATATAGTAAGAGATGTTATTTATGGTATTTGCATTATTTTGCATTAAGAAACTTTTGATTTAAATCTTGCTGACTAACGGTTTTGTTTGGTAGCCGAAACCGAAGCCAATGACTTAAATCCAATACTAACCTCAGAAAAGGAGAAAGCAAACCCATTGGCATATGGTGCAGGTCATGTCCAACCAAACAAAGCATCAAATCCTGGCCTTGTTTACGACCTCACCACCCAAGACTACTTGAACTTCCTATGTGCCCGTGGCTACAATAAAACACTATTGAAACTATTCACTAATGATACTTCATTTGTTTGTTCAAAGTCATTCAAAGTAACAGATTTAAACTACCCATCAATCTCAATGAATAATCTGAAATCAGAGGCAGTAGAGATCAAAAGAAGAGTAAAAAATGTGGGAAGTCCAGGCATGTATGTTGCTCAAGTCGAGGCACCACCAGGAGTTTCTGTTTCGGTAGACCCGAGTACTTTGAAGTTCACTAAAACTGATGAAGAGAAGGATTTCAAAGTTGTGCTAAGGAGGGTGCCAAATAATCAAACTGAACAGAATGTGTTCGGAAAACTTGTATGGTCTGATGGAAAACATCGTGTTAGTAGCCCGATTTTTGTGATATTAGGGATTTAGTAAAACAAGCACATGAGTTGGATTGAGTTTAGCTCAATCTTTTGAAAATTTATTTTCTGGTTAACAGTTCTTATGTCGCATAATAATGAATTTTTAGTTGTTTTTTTTTAGCTTAATTTATTTAAGTTTTTTTTAGTTTAATTTATTTAAGTTGTTTTTTATTTAAGTTTTCGTCATGTTGTAAGCTACTCCTCTTTTTCAACCCTATTATTATTGAATTTTTGTTTTTTAAGGTCGGCCTCTAGGCCTCTTTGGTTAAGACAGTAAGTTTTATGAAACACCTTGACTCGTAGATGGCTATTTCTGACTAAGGTGACCTCTAATTTCTTCTTAGCTATGTTCTTTCGTTCCTTACTCTCACGACTCCCCGATTTCTTCTTCTCTTTCTTGCCCTCGATTCCAATGGAGGTTTTTGGAAGATATTGATGGTATTTTATTAGTATCTTATACTGGCGATTTTGCGACTGTTTTGCCTTCTTCAACAGAGCTTCATTTCTGTCGAATATCTTGAAGAACGAGTCAACTAACTCCATTATTCTACCTTCTTTCTTCGGTTACTTGATCAAAGCTAATAATATTGTTTTTCAAACTTGGGATGTAATATTCCTTGGTCAATAGAAATAGACATTGATCGTGGTTCTTACACTGGAATCATATAGATCCTTTGCCTTTGATCGGTACAATTATTGGTTTTGACCTAAGATCAAAATATACCAAAACAAATAAATGAAGGAGAAAGAACAACTTGAGGAGAGAGTTTCTTGGACTAAGAACTTTAAACACTATGCTTTTTAGTTCTCTTTTGTCGTTGTGAATATAGCTTACATTTAATGAGTATTTATATATGGGAGAAAACACTAACAAATTTTCTAAACGTTTTCTTCCGGTTAAAACGTCTAATAATTCTCCTAAATAAATTATTAAATATTTTCCAAAATTTAGCTCCACAATTTGTCAACTACTAAAAATTGTTTTACAACTTACAGTTCTTTTACCAAGTCAAGTTTATTGACTCACATAACGATCTAAGGTAGGTGACAACTATAAAAAAAGTATCTTATACAAAATTTAAATAATAAATTACTGAGTTTTTATGACATAAATCCCAACAATCTTCCACGAGCATTCAACATATTTTTCTTAGGGAGGATGAGATGTCTTATTTTTATATAAATCAAAACAATTGTCATTAAATAGTCATTTTGGTCCATACCACTATATTTTCTCTGTCAAAATCATTCCGTATACATTTATCTTTCACCCTTTCCTTGAAATCAATTTGTACATTAATATTTTGATAAAGAGCGGCAAAGAGAGACAAATTAAATCAAATTTACTATACTGTCACGACTTGAAATTTGAGCTCGTAATAAAAACATGTATGAAAGATTACTTTGAAAATTAAAATTTTATGCGATTTAACAGAAATAA
mRNA sequence
ATGTATAATCTAAAATCAGAGGTAGTGAAGATTAAAAGAAGAGTGAAGAATGTAGGAAGTCCAGGCACCTATGTTGCTAAAGTCGAGGCACCACCAGGAGTTTTGGTTTCGGTAGACCCAAGTACTTTGAAGTTCACTAAAACGGGTGAAGAGAAGGATTTTCAAGTTGTGTTGAGGAGGGTTCAAAGTAATCAAACTGAAGAAAAGCATGTGTTCGAAAAACTTGTGTGGCCTGATGGAAAACATCGTTCTTATATTGTTTACTTGGGATCACATTCACATGGGTTGAATCCTTCCGCAATTGATCTTCAACGTGCAACACAAACCCACTACAATCTACTTGGATCCATGTTAGGAAGCAACGAAGCAGCTAAGGAAGCAATCTTTTACTCATACAATAGACATATCAATGCCTTTGCAGCTGTTCTTGATCAGAAAGTTGCAGAAGATATAGCAATACATGAGAACAAAAAACTAAAACTGCACACTACACGATCATGGAACTTTCTTGGAGTTGAGAATGATGGTGGAATTCCTTTAGACTCACTTTGGAATCTTTCAAGATTTGGTGAATCTACAATCATTGGCAACATTGACTCTGGTGTTTGGCCTGAATCAAAGAGTTTTAGTGACGAAGGATATGGACCTATCCCAACAAGATGGAAGGGAAGTTGTGAAGGTGGCACCCACTTTAGTTGCAATAGGAAGCTGATTGGAGTACGATATTTCAACAAAGGTTTTGCATCCGACGTGGGACCTCTCAACTCAAGCTATGAAACAGCAAGGGACGTTGATGGGCATGGAACACACACCTTATCCACGGCTGGAGGCAATTTCGTTAAAGGAGTAAGCATTTTAGGGAATAGTTATGGCACTGCAAAAGGGGGTTCCCCTAAAGCCCTTGTTGCTGCCTATAAAGTATGTTGGTCTACAGATAGAGGTGATGGGTGTTTTATGGCCGATATTCTAGCTAGCTTTGAAGCTGCCATTAGTGATGGAGTTGATGTTCTATCGGTTTCACTCGGTGGAGGTATTCAAGAATTTTCCGACGATCTTATAGCTATAGGGTCCTTCCATGCAGTGAAGAACGGCATCACTGTTGTTTGTTCAGCTGGAAATTCAGGACCAAGTGAAGATACTGTCTTAAATGTCGCACCATGGATGATAACTGTGGGAGCTAGCACAGTTGACAGGCTTTTTACAAGTTACGTGGGTGAAAGTCTTTCTAGTAAATTATTGCCACCTAAGAAGTTCTATCCGTTGATCCGTGCTTTAGATGCAAAATCCAACAACACCTCTAATCACGAAGCCATACTATGTCGTCAAGGGTCTCTTAATCCTGAAAAGGTAAGAGGAAAAATTGTAGTTTGCCTTAGAGGGGAAAATTCAAGAGGGGAGAAAGGTTACGTGGTTGCTCAAGCAGGTGGTGTTGGGATGATTCTCGCTAATGACAAAGAAAGTGGGGATGAACTTTCGGCTAGTCCTCACTTTCTTCCTGCTTCACATATAAGCTATACCGATGCTTACATGACTTCCGTGACTGAGTTGGGAATCAAACCAGCACCAATTATGGCTTCATTCTCTTCGAGAGGTCCCAACATAGTTGAGCCCTCAATACTCAAGCCTGATATAACAGCACCAGGGGTGAATATAATAGCGGCTTTCTCTGAAGCAACATCCATATCTGGTTTACCTTATGATAAGCGTCAAGCTCAATTTATTACATTATCTGGCACTTCCATGTCATGCCCCCATATTTCTGGCATTGTCGGCCTTCTCAAAACGCTTTATCCAAAATGGAGTCCAGCAGCTATCAGATCTGCAATCATGACGACAGCCGAAACCGAAGCCAATGACTTAAATCCAATACTAACCTCAGAAAAGGAGAAAGCAAACCCATTGGCATATGGTGCAGGTCATGTCCAACCAAACAAAGCATCAAATCCTGGCCTTGTTTACGACCTCACCACCCAAGACTACTTGAACTTCCTATGTGCCCGTGGCTACAATAAAACACTATTGAAACTATTCACTAATGATACTTCATTTGTTTGTTCAAAGTCATTCAAAGTAACAGATTTAAACTACCCATCAATCTCAATGAATAATCTGAAATCAGAGGCAGTAGAGATCAAAAGAAGAGTAAAAAATGTGGGAAGTCCAGGCATGTATGTTGCTCAAGTCGAGGCACCACCAGGAGTTTCTGTTTCGGTAGACCCGAGTACTTTGAAGTTCACTAAAACTGATGAAGAGAAGGATTTCAAAGTTGTGCTAAGGAGGGTGCCAAATAATCAAACTGAACAGAATGTGTTCGGAAAACTTAAATAA
Coding sequence (CDS)
ATGTATAATCTAAAATCAGAGGTAGTGAAGATTAAAAGAAGAGTGAAGAATGTAGGAAGTCCAGGCACCTATGTTGCTAAAGTCGAGGCACCACCAGGAGTTTTGGTTTCGGTAGACCCAAGTACTTTGAAGTTCACTAAAACGGGTGAAGAGAAGGATTTTCAAGTTGTGTTGAGGAGGGTTCAAAGTAATCAAACTGAAGAAAAGCATGTGTTCGAAAAACTTGTGTGGCCTGATGGAAAACATCGTTCTTATATTGTTTACTTGGGATCACATTCACATGGGTTGAATCCTTCCGCAATTGATCTTCAACGTGCAACACAAACCCACTACAATCTACTTGGATCCATGTTAGGAAGCAACGAAGCAGCTAAGGAAGCAATCTTTTACTCATACAATAGACATATCAATGCCTTTGCAGCTGTTCTTGATCAGAAAGTTGCAGAAGATATAGCAATACATGAGAACAAAAAACTAAAACTGCACACTACACGATCATGGAACTTTCTTGGAGTTGAGAATGATGGTGGAATTCCTTTAGACTCACTTTGGAATCTTTCAAGATTTGGTGAATCTACAATCATTGGCAACATTGACTCTGGTGTTTGGCCTGAATCAAAGAGTTTTAGTGACGAAGGATATGGACCTATCCCAACAAGATGGAAGGGAAGTTGTGAAGGTGGCACCCACTTTAGTTGCAATAGGAAGCTGATTGGAGTACGATATTTCAACAAAGGTTTTGCATCCGACGTGGGACCTCTCAACTCAAGCTATGAAACAGCAAGGGACGTTGATGGGCATGGAACACACACCTTATCCACGGCTGGAGGCAATTTCGTTAAAGGAGTAAGCATTTTAGGGAATAGTTATGGCACTGCAAAAGGGGGTTCCCCTAAAGCCCTTGTTGCTGCCTATAAAGTATGTTGGTCTACAGATAGAGGTGATGGGTGTTTTATGGCCGATATTCTAGCTAGCTTTGAAGCTGCCATTAGTGATGGAGTTGATGTTCTATCGGTTTCACTCGGTGGAGGTATTCAAGAATTTTCCGACGATCTTATAGCTATAGGGTCCTTCCATGCAGTGAAGAACGGCATCACTGTTGTTTGTTCAGCTGGAAATTCAGGACCAAGTGAAGATACTGTCTTAAATGTCGCACCATGGATGATAACTGTGGGAGCTAGCACAGTTGACAGGCTTTTTACAAGTTACGTGGGTGAAAGTCTTTCTAGTAAATTATTGCCACCTAAGAAGTTCTATCCGTTGATCCGTGCTTTAGATGCAAAATCCAACAACACCTCTAATCACGAAGCCATACTATGTCGTCAAGGGTCTCTTAATCCTGAAAAGGTAAGAGGAAAAATTGTAGTTTGCCTTAGAGGGGAAAATTCAAGAGGGGAGAAAGGTTACGTGGTTGCTCAAGCAGGTGGTGTTGGGATGATTCTCGCTAATGACAAAGAAAGTGGGGATGAACTTTCGGCTAGTCCTCACTTTCTTCCTGCTTCACATATAAGCTATACCGATGCTTACATGACTTCCGTGACTGAGTTGGGAATCAAACCAGCACCAATTATGGCTTCATTCTCTTCGAGAGGTCCCAACATAGTTGAGCCCTCAATACTCAAGCCTGATATAACAGCACCAGGGGTGAATATAATAGCGGCTTTCTCTGAAGCAACATCCATATCTGGTTTACCTTATGATAAGCGTCAAGCTCAATTTATTACATTATCTGGCACTTCCATGTCATGCCCCCATATTTCTGGCATTGTCGGCCTTCTCAAAACGCTTTATCCAAAATGGAGTCCAGCAGCTATCAGATCTGCAATCATGACGACAGCCGAAACCGAAGCCAATGACTTAAATCCAATACTAACCTCAGAAAAGGAGAAAGCAAACCCATTGGCATATGGTGCAGGTCATGTCCAACCAAACAAAGCATCAAATCCTGGCCTTGTTTACGACCTCACCACCCAAGACTACTTGAACTTCCTATGTGCCCGTGGCTACAATAAAACACTATTGAAACTATTCACTAATGATACTTCATTTGTTTGTTCAAAGTCATTCAAAGTAACAGATTTAAACTACCCATCAATCTCAATGAATAATCTGAAATCAGAGGCAGTAGAGATCAAAAGAAGAGTAAAAAATGTGGGAAGTCCAGGCATGTATGTTGCTCAAGTCGAGGCACCACCAGGAGTTTCTGTTTCGGTAGACCCGAGTACTTTGAAGTTCACTAAAACTGATGAAGAGAAGGATTTCAAAGTTGTGCTAAGGAGGGTGCCAAATAATCAAACTGAACAGAATGTGTTCGGAAAACTTAAATAA
Protein sequence
MYNLKSEVVKIKRRVKNVGSPGTYVAKVEAPPGVLVSVDPSTLKFTKTGEEKDFQVVLRRVQSNQTEEKHVFEKLVWPDGKHRSYIVYLGSHSHGLNPSAIDLQRATQTHYNLLGSMLGSNEAAKEAIFYSYNRHINAFAAVLDQKVAEDIAIHENKKLKLHTTRSWNFLGVENDGGIPLDSLWNLSRFGESTIIGNIDSGVWPESKSFSDEGYGPIPTRWKGSCEGGTHFSCNRKLIGVRYFNKGFASDVGPLNSSYETARDVDGHGTHTLSTAGGNFVKGVSILGNSYGTAKGGSPKALVAAYKVCWSTDRGDGCFMADILASFEAAISDGVDVLSVSLGGGIQEFSDDLIAIGSFHAVKNGITVVCSAGNSGPSEDTVLNVAPWMITVGASTVDRLFTSYVGESLSSKLLPPKKFYPLIRALDAKSNNTSNHEAILCRQGSLNPEKVRGKIVVCLRGENSRGEKGYVVAQAGGVGMILANDKESGDELSASPHFLPASHISYTDAYMTSVTELGIKPAPIMASFSSRGPNIVEPSILKPDITAPGVNIIAAFSEATSISGLPYDKRQAQFITLSGTSMSCPHISGIVGLLKTLYPKWSPAAIRSAIMTTAETEANDLNPILTSEKEKANPLAYGAGHVQPNKASNPGLVYDLTTQDYLNFLCARGYNKTLLKLFTNDTSFVCSKSFKVTDLNYPSISMNNLKSEAVEIKRRVKNVGSPGMYVAQVEAPPGVSVSVDPSTLKFTKTDEEKDFKVVLRRVPNNQTEQNVFGKLK
Homology
BLAST of CmaCh16G010020 vs. ExPASy Swiss-Prot
Match:
Q9ZSP5 (Subtilisin-like protease SBT5.3 OS=Arabidopsis thaliana OX=3702 GN=AIR3 PE=2 SV=1)
HSP 1 Score: 783.1 bits (2021), Expect = 2.9e-225
Identity = 404/739 (54.67%), Postives = 510/739 (69.01%), Query Frame = 0
Query: 70 HVFEKLVWPDGKHRSYIVYLGSHSHGLNPSAIDLQRATQTHYNLLGSMLGSNEAAKEAIF 129
H+ K + SY+VY G+HSH + + R +THY+ LGS GS E A +AIF
Sbjct: 17 HMSSKHILASKDSSSYVVYFGAHSHVGEITEDAMDRVKETHYDFLGSFTGSRERATDAIF 76
Query: 130 YSYNRHINAFAAVLDQKVAEDIAIH-------ENKKLKLHTTRSWNFLGVENDGGIPLDS 189
YSY +HIN FAA LD +A +I+ H NK LKLHTTRSW+FLG+E++ +P S
Sbjct: 77 YSYTKHINGFAAHLDHDLAYEISKHPEVVSVFPNKALKLHTTRSWDFLGLEHNSYVPSSS 136
Query: 190 LWNLSRFGESTIIGNIDSGVWPESKSFSDEGYGPIPTRWKGSCEG--GTHFSCNRKLIGV 249
+W +RFGE TII N+D+GVWPESKSF DEG GPIP+RWKG C+ F CNRKLIG
Sbjct: 137 IWRKARFGEDTIIANLDTGVWPESKSFRDEGLGPIPSRWKGICQNQKDATFHCNRKLIGA 196
Query: 250 RYFNKGFASDVGPLNSSYETARDVDGHGTHTLSTAGGNFVKGVSILGNSYGTAKGGSPKA 309
RYFNKG+A+ VG LNSS+++ RD+DGHG+HTLSTA G+FV GVSI G GTAKGGSP+A
Sbjct: 197 RYFNKGYAAAVGHLNSSFDSPRDLDGHGSHTLSTAAGDFVPGVSIFGQGNGTAKGGSPRA 256
Query: 310 LVAAYKVCWSTDRGDGCFMADILASFEAAISDGVDVLSVSLGGGIQEFSDDLIAIGSFHA 369
VAAYKVCW +G+ C+ AD+LA+F+AAI DG DV+SVSLGG F +D +AIGSFHA
Sbjct: 257 RVAAYKVCWPPVKGNECYDADVLAAFDAAIHDGADVISVSLGGEPTSFFNDSVAIGSFHA 316
Query: 370 VKNGITVVCSAGNSGPSEDTVLNVAPWMITVGASTVDRLFTS---------YVGESLSSK 429
K I VVCSAGNSGP++ TV NVAPW ITVGAST+DR F S Y G+SLSS
Sbjct: 317 AKKRIVVVCSAGNSGPADSTVSNVAPWQITVGASTMDREFASNLVLGNGKHYKGQSLSST 376
Query: 430 LLPPKKFYPLIRALDAKSNNTSNHEAILCRQGSLNPEKVRGKIVVCLRGENSRGEKGYVV 489
LP KFYP++ +++AK+ N S +A LC+ GSL+P K +GKI+VCLRG+N R EKG V
Sbjct: 377 ALPHAKFYPIMASVNAKAKNASALDAQLCKLGSLDPIKTKGKILVCLRGQNGRVEKGRAV 436
Query: 490 AQAGGVGMILANDKESGDELSASPHFLPASHISYTDAYMT----------------SVTE 549
A GG+GM+L N +G++L A PH LPA+ ++ D++ S T+
Sbjct: 437 ALGGGIGMVLENTYVTGNDLLADPHVLPATQLTSKDSFAVSRYISQTKKPIAHITPSRTD 496
Query: 550 LGIKPAPIMASFSSRGPNIVEPSILKPDITAPGVNIIAAFSEATSISGLPYDKRQAQFIT 609
LG+KPAP+MASFSS+GP+IV P ILKPDITAPGV++IAA++ A S + +D R+ F
Sbjct: 497 LGLKPAPVMASFSSKGPSIVAPQILKPDITAPGVSVIAAYTGAVSPTNEQFDPRRLLFNA 556
Query: 610 LSGTSMSCPHISGIVGLLKTLYPKWSPAAIRSAIMTTAETEANDLNPILTSEKEKANPLA 669
+SGTSMSCPHISGI GLLKT YP WSPAAIRSAIMTTA + PI + KA P +
Sbjct: 557 ISGTSMSCPHISGIAGLLKTRYPSWSPAAIRSAIMTTATIMDDIPGPIQNATNMKATPFS 616
Query: 670 YGAGHVQPNKASNPGLVYDLTTQDYLNFLCARGYNKTLLKLFTNDTSFVCSKSFKVTDLN 729
+GAGHVQPN A NPGLVYDL +DYLNFLC+ GYN + + +F+ + S + +LN
Sbjct: 617 FGAGHVQPNLAVNPGLVYDLGIKDYLNFLCSLGYNASQISVFSGNNFTCSSPKISLVNLN 676
Query: 730 YPSISMNNLKSEAVEIKRRVKNVGSPGMYVAQVEAPPGVSVSVDPSTLKFTKTDEEKDFK 775
YPSI++ NL S V + R VKNVG P MY +V P GV V+V P++L FTK E+K FK
Sbjct: 677 YPSITVPNLTSSKVTVSRTVKNVGRPSMYTVKVNNPQGVYVAVKPTSLNFTKVGEQKTFK 736
BLAST of CmaCh16G010020 vs. ExPASy Swiss-Prot
Match:
F4JXC5 (Subtilisin-like protease SBT5.4 OS=Arabidopsis thaliana OX=3702 GN=SBT5.4 PE=1 SV=1)
HSP 1 Score: 775.8 bits (2002), Expect = 4.6e-223
Identity = 411/724 (56.77%), Postives = 503/724 (69.48%), Query Frame = 0
Query: 83 RSYIVYLGSHSHGLNPSAIDLQRATQTHYNLLGSMLGSNEAAKEAIFYSYNRHINAFAAV 142
+SYIVYLGSH+H S+ L +H L S +GS+E AKEAIFYSY RHIN FAA+
Sbjct: 40 KSYIVYLGSHAHLPQISSAHLDGVAHSHRTFLASFVGSHENAKEAIFYSYKRHINGFAAI 99
Query: 143 LDQKVAEDIAIH-------ENKKLKLHTTRSWNFLGVENDGGIPLDSLWNLSRFGESTII 202
LD+ A +IA H NK KLHTT SWNF+ + +G + SLWN + +GE TII
Sbjct: 100 LDENEAAEIAKHPDVVSVFPNKGRKLHTTHSWNFMLLAKNGVVHKSSLWNKAGYGEDTII 159
Query: 203 GNIDSGVWPESKSFSDEGYGPIPTRWKGSCEGGTHFSCNRKLIGVRYFNKGFASDVG-PL 262
N+D+GVWPESKSFSDEGYG +P RWKG C CNRKLIG RYFNKG+ + G P
Sbjct: 160 ANLDTGVWPESKSFSDEGYGAVPARWKGRCH--KDVPCNRKLIGARYFNKGYLAYTGLPS 219
Query: 263 NSSYETARDVDGHGTHTLSTAGGNFVKGVSILGNSYGTAKGGSPKALVAAYKVCWSTDRG 322
N+SYET RD DGHG+HTLSTA GNFV G ++ G GTA GGSPKA VAAYKVCW G
Sbjct: 220 NASYETCRDHDGHGSHTLSTAAGNFVPGANVFGIGNGTASGGSPKARVAAYKVCWPPVDG 279
Query: 323 DGCFMADILASFEAAISDGVDVLSVSLGGGIQEFSDDLIAIGSFHAVKNGITVVCSAGNS 382
CF ADILA+ EAAI DGVDVLS S+GG ++ D IAIGSFHAVKNG+TVVCSAGNS
Sbjct: 280 AECFDADILAAIEAAIEDGVDVLSASVGGDAGDYMSDGIAIGSFHAVKNGVTVVCSAGNS 339
Query: 383 GPSEDTVLNVAPWMITVGASTVDRLFTSYV----GESLS----SKLLPPKKFYPLIRALD 442
GP TV NVAPW+ITVGAS++DR F ++V G+S SK LP +K Y LI A D
Sbjct: 340 GPKSGTVSNVAPWVITVGASSMDREFQAFVELKNGQSFKGTSLSKPLPEEKMYSLISAAD 399
Query: 443 AKSNNTSNHEAILCRQGSLNPEKVRGKIVVCLRGENSRGEKGYVVAQAGGVGMILANDKE 502
A N + +A+LC++GSL+P+KV+GKI+VCLRG+N+R +KG A AG GM+L NDK
Sbjct: 400 ANVANGNVTDALLCKKGSLDPKKVKGKILVCLRGDNARVDKGMQAAAAGAAGMVLCNDKA 459
Query: 503 SGDELSASPHFLPASHISYTD-----AYMTSVTE-----------LGIKPAPIMASFSSR 562
SG+E+ + H LPAS I Y D +Y++S + L KPAP MASFSSR
Sbjct: 460 SGNEIISDAHVLPASQIDYKDGETLFSYLSSTKDPKGYIKAPTATLNTKPAPFMASFSSR 519
Query: 563 GPNIVEPSILKPDITAPGVNIIAAFSEATSISGLPYDKRQAQFITLSGTSMSCPHISGIV 622
GPN + P ILKPDITAPGVNIIAAF+EAT + L D R+ F T SGTSMSCPHISG+V
Sbjct: 520 GPNTITPGILKPDITAPGVNIIAAFTEATGPTDLDSDNRRTPFNTESGTSMSCPHISGVV 579
Query: 623 GLLKTLYPKWSPAAIRSAIMTTAETEANDLNPILTSEKEKANPLAYGAGHVQPNKASNPG 682
GLLKTL+P WSPAAIRSAIMTT+ T N P++ +KANP +YG+GHVQPNKA++PG
Sbjct: 580 GLLKTLHPHWSPAAIRSAIMTTSRTRNNRRKPMVDESFKKANPFSYGSGHVQPNKAAHPG 639
Query: 683 LVYDLTTQDYLNFLCARGYNKTLLKLFTNDTSFVCSKSFKVTDLNYPSISMNNLKSEAVE 742
LVYDLTT DYL+FLCA GYN T+++LF D + C + + D NYPSI++ NL + ++
Sbjct: 640 LVYDLTTGDYLDFLCAVGYNNTVVQLFAEDPQYTCRQGANLLDFNYPSITVPNL-TGSIT 699
Query: 743 IKRRVKNVGSPGMYVAQVEAPPGVSVSVDPSTLKFTKTDEEKDFKVVLRRVPNNQTEQNV 775
+ R++KNVG P Y A+ P GV VSV+P L F KT E K F++ LR +P + V
Sbjct: 700 VTRKLKNVGPPATYNARFREPLGVRVSVEPKQLTFNKTGEVKIFQMTLRPLPVTPSGY-V 759
BLAST of CmaCh16G010020 vs. ExPASy Swiss-Prot
Match:
I1N462 (Subtilisin-like protease Glyma18g48580 OS=Glycine max OX=3847 GN=Glyma18g48580 PE=1 SV=3)
HSP 1 Score: 666.0 bits (1717), Expect = 5.2e-190
Identity = 374/751 (49.80%), Postives = 475/751 (63.25%), Query Frame = 0
Query: 79 DGKHRSYIVYLGSHSHGLNPSAIDLQRATQTHYNLLGSMLGSNEAAKEAIFYSYNRHINA 138
+G + YIVY+G+HSHG +P++ DL+ AT +HY+LLGS+ GS E AKEAI YSYNRHIN
Sbjct: 26 NGSKKCYIVYMGAHSHGPSPTSADLELATDSHYDLLGSIFGSREKAKEAIIYSYNRHING 85
Query: 139 FAAVLDQKVAEDIAIHEN-------KKLKLHTTRSWNFLGVENDGGIPLDSLWNLSRFGE 198
FAA+L+++ A DIA + N K+ KLHTTRSW FLG+ G +S W RFGE
Sbjct: 86 FAALLEEEEAADIAKNPNVVSVFLSKEHKLHTTRSWEFLGLHRRG---QNSAWQKGRFGE 145
Query: 199 STIIGNIDSGVWPESKSFSDEGYGPIPTRWKGS-CE-----GGTHFSCNRKLIGVRYFNK 258
+TIIGNID+GVWPES+SFSD+GYG +P++W+G C+ G +CNRKLIG RY+NK
Sbjct: 146 NTIIGNIDTGVWPESQSFSDKGYGTVPSKWRGGLCQINKLPGSMKNTCNRKLIGARYYNK 205
Query: 259 GFASDVGPLNSSYETARDVDGHGTHTLSTAGGNFVKGVSILGNSYGTAKGGSPKALVAAY 318
F + G L+ TARD GHGTHTLSTAGGNFV G + GTAKGGSP+A VAAY
Sbjct: 206 AFEAHNGQLDPLLHTARDFVGHGTHTLSTAGGNFVPGARVFAVGNGTAKGGSPRARVAAY 265
Query: 319 KVCWSTDRGDGCFMADILASFEAAISDGVDVLSVSLGGG----IQEFSDDLIAIGSFHAV 378
KVCWS C+ AD+LA+ + AI DGVDV++VS G + D I+IG+FHA+
Sbjct: 266 KVCWSLTDPASCYGADVLAAIDQAIDDGVDVINVSFGVSYVVTAEGIFTDEISIGAFHAI 325
Query: 379 KNGITVVCSAGNSGPSEDTVLNVAPWMITVGASTVDRLFTSYV--------GESLSSKLL 438
I +V SAGN GP+ TV NVAPW+ T+ AST+DR F+S + G SL L
Sbjct: 326 SKNILLVASAGNDGPTPGTVANVAPWVFTIAASTLDRDFSSNLTINNQLIEGASLFVN-L 385
Query: 439 PPKKFYPLIRALDAKSNNTSNHEAILCRQGSLNPEKVRGKIVVCLR-GENSRGEKGYVVA 498
PP + + LI + DAK N + +A LCR+G+L+ KV GKIV+C R G+ +G
Sbjct: 386 PPNQAFSLILSTDAKLANATFRDAQLCRRGTLDRTKVNGKIVLCTREGKIKSVAEGLEAL 445
Query: 499 QAGGVGMILANDKESGDELSASPHFL------PASHISYTDAYMTSV------------- 558
AG GMIL N ++G LSA PH P S T+
Sbjct: 446 TAGARGMILNNQMQNGKTLSAEPHVFSTVNTPPRRAKSRPHGVKTTAIGDEDDPLKTGDT 505
Query: 559 -------TELGIKPAPIMASFSSRGPNIVEPSILKPDITAPGVNIIAAFSEATSISGLPY 618
T G KPAP+MASFSSRGPN ++PSILKPD+TAPGVNI+AA+SE S S L
Sbjct: 506 IKMSRARTLFGRKPAPVMASFSSRGPNKIQPSILKPDVTAPGVNILAAYSEFASASSLLV 565
Query: 619 DKRQA-QFITLSGTSMSCPHISGIVGLLKTLYPKWSPAAIRSAIMTTAETEANDLNPILT 678
D R+ +F L GTSMSCPH SGI GLLKT +P WSPAAI+SAIMTTA T N PI
Sbjct: 566 DNRRGFKFNVLQGTSMSCPHASGIAGLLKTRHPSWSPAAIKSAIMTTATTLDNTNRPIQD 625
Query: 679 S-EKEKANPLAYGAGHVQPNKASNPGLVYDLTTQDYLNFLCARGYNKTLLKLFTNDTSFV 738
+ +K A+ AYG+GHV+P+ A PGLVYDL+ DYLNFLCA GY++ L+ + +F+
Sbjct: 626 AFDKTLADAFAYGSGHVRPDLAIEPGLVYDLSLTDYLNFLCASGYDQQLISALNFNRTFI 685
Query: 739 CSKSFKVTDLNYPSISMNNLKSEAVEIKRRVKNVGSPGMYVAQVEAPPGVSVSVDPSTLK 776
CS S V DLNYPSI++ NL+ + V I R V NVG P Y +P G S++V P +L
Sbjct: 686 CSGSHSVNDLNYPSITLPNLRLKPVTIARTVTNVGPPSTYTVSTRSPNGYSIAVVPPSLT 745
BLAST of CmaCh16G010020 vs. ExPASy Swiss-Prot
Match:
O65351 (Subtilisin-like protease SBT1.7 OS=Arabidopsis thaliana OX=3702 GN=SBT1.7 PE=1 SV=1)
HSP 1 Score: 561.6 bits (1446), Expect = 1.4e-158
Identity = 327/733 (44.61%), Postives = 453/733 (61.80%), Query Frame = 0
Query: 84 SYIVYLGSHSHGLNPSAIDLQRATQTHYNLLGSMLGSNEAAKEAIFYSYNRHINAFAAVL 143
+YIV++ PS+ DL H N S L S + E + Y+Y I+ F+ L
Sbjct: 31 TYIVHMAKSQ---MPSSFDL------HSNWYDSSLRSISDSAE-LLYTYENAIHGFSTRL 90
Query: 144 DQKVAED-------IAIHENKKLKLHTTRSWNFLGVENDGGIPLDSLWNLSRFGESTIIG 203
Q+ A+ I++ + +LHTTR+ FLG++ L+ + ++G
Sbjct: 91 TQEEADSLMTQPGVISVLPEHRYELHTTRTPLFLGLDEHTA----DLFPEAGSYSDVVVG 150
Query: 204 NIDSGVWPESKSFSDEGYGPIPTRWKGSCEGGTHFS---CNRKLIGVRYFNKGFASDVGP 263
+D+GVWPESKS+SDEG+GPIP+ WKG CE GT+F+ CNRKLIG R+F +G+ S +GP
Sbjct: 151 VLDTGVWPESKSYSDEGFGPIPSSWKGGCEAGTNFTASLCNRKLIGARFFARGYESTMGP 210
Query: 264 LNSSYE--TARDVDGHGTHTLSTAGGNFVKGVSILGNSYGTAKGGSPKALVAAYKVCWST 323
++ S E + RD DGHGTHT STA G+ V+G S+LG + GTA+G +P+A VA YKVCW
Sbjct: 211 IDESKESRSPRDDDGHGTHTSSTAAGSVVEGASLLGYASGTARGMAPRARVAVYKVCWL- 270
Query: 324 DRGDGCFMADILASFEAAISDGVDVLSVSLGGGIQEFSDDLIAIGSFHAVKNGITVVCSA 383
GCF +DILA+ + AI+D V+VLS+SLGGG+ ++ D +AIG+F A++ GI V CSA
Sbjct: 271 ---GGCFSSDILAAIDKAIADNVNVLSMSLGGGMSDYYRDGVAIGAFAAMERGILVSCSA 330
Query: 384 GNSGPSEDTVLNVAPWMITVGASTVDRLF---------TSYVGESLSSKLLPPKKFYPLI 443
GN+GPS ++ NVAPW+ TVGA T+DR F ++ G SL P K P I
Sbjct: 331 GNAGPSSSSLSNVAPWITTVGAGTLDRDFPALAILGNGKNFTGVSLFKGEALPDKLLPFI 390
Query: 444 RALDAKSNNTSNHEAILCRQGSLNPEKVRGKIVVCLRGENSRGEKGYVVAQAGGVGMILA 503
A +A + N LC G+L PEKV+GKIV+C RG N+R +KG VV AGGVGMILA
Sbjct: 391 YAGNASNATNGN----LCMTGTLIPEKVKGKIVMCDRGINARVQKGDVVKAAGGVGMILA 450
Query: 504 NDKESGDELSASPHFLPAS-----------HISYTDAYMTSV-----TELGIKPAPIMAS 563
N +G+EL A H LPA+ H TD T+ T +G+KP+P++A+
Sbjct: 451 NTAANGEELVADAHLLPATTVGEKAGDIIRHYVTTDPNPTASISILGTVVGVKPSPVVAA 510
Query: 564 FSSRGPNIVEPSILKPDITAPGVNIIAAFSEATSISGLPYDKRQAQFITLSGTSMSCPHI 623
FSSRGPN + P+ILKPD+ APGVNI+AA++ A +GL D R+ +F +SGTSMSCPH+
Sbjct: 511 FSSRGPNSITPNILKPDLIAPGVNILAAWTGAAGPTGLASDSRRVEFNIISGTSMSCPHV 570
Query: 624 SGIVGLLKTLYPKWSPAAIRSAIMTTAETEANDLNPILTSEKEK-ANPLAYGAGHVQPNK 683
SG+ LLK+++P+WSPAAIRSA+MTTA D P+L K + P +GAGHV P
Sbjct: 571 SGLAALLKSVHPEWSPAAIRSALMTTAYKTYKDGKPLLDIATGKPSTPFDHGAGHVSPTT 630
Query: 684 ASNPGLVYDLTTQDYLNFLCARGYNKTLLKLFTNDTSFVC--SKSFKVTDLNYPSISMNN 743
A+NPGL+YDLTT+DYL FLCA Y ++ + ++ C SKS+ V DLNYPS ++N
Sbjct: 631 ATNPGLIYDLTTEDYLGFLCALNYTSPQIRSVSR-RNYTCDPSKSYSVADLNYPSFAVNV 690
Query: 744 LKSEAVEIKRRVKNVGSPGMYVAQVEA-PPGVSVSVDPSTLKFTKTDEEKDFKVVLRRVP 776
A + R V +VG G Y +V + GV +SV+P+ L F + +E+K + V
Sbjct: 691 DGVGAYKYTRTVTSVGGAGTYSVKVTSETTGVKISVEPAVLNFKEANEKKSYTVTFTVDS 740
BLAST of CmaCh16G010020 vs. ExPASy Swiss-Prot
Match:
Q9ZUF6 (Subtilisin-like protease SBT1.8 OS=Arabidopsis thaliana OX=3702 GN=SBT1.8 PE=1 SV=1)
HSP 1 Score: 517.7 bits (1332), Expect = 2.3e-145
Identity = 303/684 (44.30%), Postives = 410/684 (59.94%), Query Frame = 0
Query: 109 THYNLLGSMLGSNEAAKEAIFYSYNRHINAFAAVLDQKVAEDIA--------IHENKKLK 168
TH++ S L S + ++ Y+Y + F+A LD A+ + I E+
Sbjct: 45 THHDWYTSQLNS----ESSLLYTYTTSFHGFSAYLDSTEADSLLSSSNSILDIFEDPLYT 104
Query: 169 LHTTRSWNFLGVENDGGIPLDSLWNLSRFGESTIIGNIDSGVWPESKSFSDEGYGPIPTR 228
LHTTR+ FLG+ ++ G+ +L IIG +D+GVWPES+SF D IP++
Sbjct: 105 LHTTRTPEFLGLNSEFGV-----HDLGSSSNGVIIGVLDTGVWPESRSFDDTDMPEIPSK 164
Query: 229 WKGSCEGGTHFS---CNRKLIGVRYFNKGF-ASDVGPLNSSYETA--RDVDGHGTHTLST 288
WKG CE G+ F CN+KLIG R F+KGF + G +S E+ RDVDGHGTHT +T
Sbjct: 165 WKGECESGSDFDSKLCNKKLIGARSFSKGFQMASGGGFSSKRESVSPRDVDGHGTHTSTT 224
Query: 289 AGGNFVKGVSILGNSYGTAKGGSPKALVAAYKVCWSTDRGDGCFMADILASFEAAISDGV 348
A G+ V+ S LG + GTA+G + +A VA YKVCWST GCF +DILA+ + AI DGV
Sbjct: 225 AAGSAVRNASFLGYAAGTARGMATRARVATYKVCWST----GCFGSDILAAMDRAILDGV 284
Query: 349 DVLSVSLGGGIQEFSDDLIAIGSFHAVKNGITVVCSAGNSGPSEDTVLNVAPWMITVGAS 408
DVLS+SLGGG + D IAIG+F A++ G+ V CSAGNSGP+ +V NVAPW++TVGA
Sbjct: 285 DVLSLSLGGGSAPYYRDTIAIGAFSAMERGVFVSCSAGNSGPTRASVANVAPWVMTVGAG 344
Query: 409 TVDRLFTSYVGESLSSKLLPPKKFYPL---IRALDAKSNNTSNHEAILCRQGSLNPEKVR 468
T+DR F ++ +L + + + L+ N ++ + LC GSL+ VR
Sbjct: 345 TLDRDFPAFANLGNGKRLTGVSLYSGVGMGTKPLELVYNKGNSSSSNLCLPGSLDSSIVR 404
Query: 469 GKIVVCLRGENSRGEKGYVVAQAGGVGMILANDKESGDELSASPHFLPASHIS------- 528
GKIVVC RG N+R EKG VV AGG+GMI+AN SG+EL A H LPA +
Sbjct: 405 GKIVVCDRGVNARVEKGAVVRDAGGLGMIMANTAASGEELVADSHLLPAIAVGKKTGDLL 464
Query: 529 ----YTDAYMTSV-----TELGIKPAPIMASFSSRGPNIVEPSILKPDITAPGVNIIAAF 588
+D+ T++ T L +KP+P++A+FSSRGPN V P ILKPD+ PGVNI+A +
Sbjct: 465 REYVKSDSKPTALLVFKGTVLDVKPSPVVAAFSSRGPNTVTPEILKPDVIGPGVNILAGW 524
Query: 589 SEATSISGLPYDKRQAQFITLSGTSMSCPHISGIVGLLKTLYPKWSPAAIRSAIMTTAET 648
S+A +GL D R+ QF +SGTSMSCPHISG+ GLLK +P+WSP+AI+SA+MTTA
Sbjct: 525 SDAIGPTGLDKDSRRTQFNIMSGTSMSCPHISGLAGLLKAAHPEWSPSAIKSALMTTAYV 584
Query: 649 EANDLNPIL-TSEKEKANPLAYGAGHVQPNKASNPGLVYDLTTQDYLNFLCARGYNKTLL 708
N P+ ++ +NP A+G+GHV P KA +PGLVYD++T++Y+ FLC+ Y +
Sbjct: 585 LDNTNAPLHDAADNSLSNPYAHGSGHVDPQKALSPGLVYDISTEEYIRFLCSLDYTVDHI 644
Query: 709 KLFTNDTSFVCSKSFK-VTDLNYPSISMNNLKSEAVEIKRRVKNVG-SPGMYVAQVEAPP 757
S CSK F LNYPS S+ V R V NVG + +Y V P
Sbjct: 645 VAIVKRPSVNCSKKFSDPGQLNYPSFSVLFGGKRVVRYTREVTNVGAASSVYKVTVNGAP 704
BLAST of CmaCh16G010020 vs. TAIR 10
Match:
AT2G04160.1 (Subtilisin-like serine endopeptidase family protein )
HSP 1 Score: 783.1 bits (2021), Expect = 2.1e-226
Identity = 404/739 (54.67%), Postives = 510/739 (69.01%), Query Frame = 0
Query: 70 HVFEKLVWPDGKHRSYIVYLGSHSHGLNPSAIDLQRATQTHYNLLGSMLGSNEAAKEAIF 129
H+ K + SY+VY G+HSH + + R +THY+ LGS GS E A +AIF
Sbjct: 17 HMSSKHILASKDSSSYVVYFGAHSHVGEITEDAMDRVKETHYDFLGSFTGSRERATDAIF 76
Query: 130 YSYNRHINAFAAVLDQKVAEDIAIH-------ENKKLKLHTTRSWNFLGVENDGGIPLDS 189
YSY +HIN FAA LD +A +I+ H NK LKLHTTRSW+FLG+E++ +P S
Sbjct: 77 YSYTKHINGFAAHLDHDLAYEISKHPEVVSVFPNKALKLHTTRSWDFLGLEHNSYVPSSS 136
Query: 190 LWNLSRFGESTIIGNIDSGVWPESKSFSDEGYGPIPTRWKGSCEG--GTHFSCNRKLIGV 249
+W +RFGE TII N+D+GVWPESKSF DEG GPIP+RWKG C+ F CNRKLIG
Sbjct: 137 IWRKARFGEDTIIANLDTGVWPESKSFRDEGLGPIPSRWKGICQNQKDATFHCNRKLIGA 196
Query: 250 RYFNKGFASDVGPLNSSYETARDVDGHGTHTLSTAGGNFVKGVSILGNSYGTAKGGSPKA 309
RYFNKG+A+ VG LNSS+++ RD+DGHG+HTLSTA G+FV GVSI G GTAKGGSP+A
Sbjct: 197 RYFNKGYAAAVGHLNSSFDSPRDLDGHGSHTLSTAAGDFVPGVSIFGQGNGTAKGGSPRA 256
Query: 310 LVAAYKVCWSTDRGDGCFMADILASFEAAISDGVDVLSVSLGGGIQEFSDDLIAIGSFHA 369
VAAYKVCW +G+ C+ AD+LA+F+AAI DG DV+SVSLGG F +D +AIGSFHA
Sbjct: 257 RVAAYKVCWPPVKGNECYDADVLAAFDAAIHDGADVISVSLGGEPTSFFNDSVAIGSFHA 316
Query: 370 VKNGITVVCSAGNSGPSEDTVLNVAPWMITVGASTVDRLFTS---------YVGESLSSK 429
K I VVCSAGNSGP++ TV NVAPW ITVGAST+DR F S Y G+SLSS
Sbjct: 317 AKKRIVVVCSAGNSGPADSTVSNVAPWQITVGASTMDREFASNLVLGNGKHYKGQSLSST 376
Query: 430 LLPPKKFYPLIRALDAKSNNTSNHEAILCRQGSLNPEKVRGKIVVCLRGENSRGEKGYVV 489
LP KFYP++ +++AK+ N S +A LC+ GSL+P K +GKI+VCLRG+N R EKG V
Sbjct: 377 ALPHAKFYPIMASVNAKAKNASALDAQLCKLGSLDPIKTKGKILVCLRGQNGRVEKGRAV 436
Query: 490 AQAGGVGMILANDKESGDELSASPHFLPASHISYTDAYMT----------------SVTE 549
A GG+GM+L N +G++L A PH LPA+ ++ D++ S T+
Sbjct: 437 ALGGGIGMVLENTYVTGNDLLADPHVLPATQLTSKDSFAVSRYISQTKKPIAHITPSRTD 496
Query: 550 LGIKPAPIMASFSSRGPNIVEPSILKPDITAPGVNIIAAFSEATSISGLPYDKRQAQFIT 609
LG+KPAP+MASFSS+GP+IV P ILKPDITAPGV++IAA++ A S + +D R+ F
Sbjct: 497 LGLKPAPVMASFSSKGPSIVAPQILKPDITAPGVSVIAAYTGAVSPTNEQFDPRRLLFNA 556
Query: 610 LSGTSMSCPHISGIVGLLKTLYPKWSPAAIRSAIMTTAETEANDLNPILTSEKEKANPLA 669
+SGTSMSCPHISGI GLLKT YP WSPAAIRSAIMTTA + PI + KA P +
Sbjct: 557 ISGTSMSCPHISGIAGLLKTRYPSWSPAAIRSAIMTTATIMDDIPGPIQNATNMKATPFS 616
Query: 670 YGAGHVQPNKASNPGLVYDLTTQDYLNFLCARGYNKTLLKLFTNDTSFVCSKSFKVTDLN 729
+GAGHVQPN A NPGLVYDL +DYLNFLC+ GYN + + +F+ + S + +LN
Sbjct: 617 FGAGHVQPNLAVNPGLVYDLGIKDYLNFLCSLGYNASQISVFSGNNFTCSSPKISLVNLN 676
Query: 730 YPSISMNNLKSEAVEIKRRVKNVGSPGMYVAQVEAPPGVSVSVDPSTLKFTKTDEEKDFK 775
YPSI++ NL S V + R VKNVG P MY +V P GV V+V P++L FTK E+K FK
Sbjct: 677 YPSITVPNLTSSKVTVSRTVKNVGRPSMYTVKVNNPQGVYVAVKPTSLNFTKVGEQKTFK 736
BLAST of CmaCh16G010020 vs. TAIR 10
Match:
AT5G59810.1 (Subtilase family protein )
HSP 1 Score: 775.8 bits (2002), Expect = 3.3e-224
Identity = 411/724 (56.77%), Postives = 503/724 (69.48%), Query Frame = 0
Query: 83 RSYIVYLGSHSHGLNPSAIDLQRATQTHYNLLGSMLGSNEAAKEAIFYSYNRHINAFAAV 142
+SYIVYLGSH+H S+ L +H L S +GS+E AKEAIFYSY RHIN FAA+
Sbjct: 40 KSYIVYLGSHAHLPQISSAHLDGVAHSHRTFLASFVGSHENAKEAIFYSYKRHINGFAAI 99
Query: 143 LDQKVAEDIAIH-------ENKKLKLHTTRSWNFLGVENDGGIPLDSLWNLSRFGESTII 202
LD+ A +IA H NK KLHTT SWNF+ + +G + SLWN + +GE TII
Sbjct: 100 LDENEAAEIAKHPDVVSVFPNKGRKLHTTHSWNFMLLAKNGVVHKSSLWNKAGYGEDTII 159
Query: 203 GNIDSGVWPESKSFSDEGYGPIPTRWKGSCEGGTHFSCNRKLIGVRYFNKGFASDVG-PL 262
N+D+GVWPESKSFSDEGYG +P RWKG C CNRKLIG RYFNKG+ + G P
Sbjct: 160 ANLDTGVWPESKSFSDEGYGAVPARWKGRCH--KDVPCNRKLIGARYFNKGYLAYTGLPS 219
Query: 263 NSSYETARDVDGHGTHTLSTAGGNFVKGVSILGNSYGTAKGGSPKALVAAYKVCWSTDRG 322
N+SYET RD DGHG+HTLSTA GNFV G ++ G GTA GGSPKA VAAYKVCW G
Sbjct: 220 NASYETCRDHDGHGSHTLSTAAGNFVPGANVFGIGNGTASGGSPKARVAAYKVCWPPVDG 279
Query: 323 DGCFMADILASFEAAISDGVDVLSVSLGGGIQEFSDDLIAIGSFHAVKNGITVVCSAGNS 382
CF ADILA+ EAAI DGVDVLS S+GG ++ D IAIGSFHAVKNG+TVVCSAGNS
Sbjct: 280 AECFDADILAAIEAAIEDGVDVLSASVGGDAGDYMSDGIAIGSFHAVKNGVTVVCSAGNS 339
Query: 383 GPSEDTVLNVAPWMITVGASTVDRLFTSYV----GESLS----SKLLPPKKFYPLIRALD 442
GP TV NVAPW+ITVGAS++DR F ++V G+S SK LP +K Y LI A D
Sbjct: 340 GPKSGTVSNVAPWVITVGASSMDREFQAFVELKNGQSFKGTSLSKPLPEEKMYSLISAAD 399
Query: 443 AKSNNTSNHEAILCRQGSLNPEKVRGKIVVCLRGENSRGEKGYVVAQAGGVGMILANDKE 502
A N + +A+LC++GSL+P+KV+GKI+VCLRG+N+R +KG A AG GM+L NDK
Sbjct: 400 ANVANGNVTDALLCKKGSLDPKKVKGKILVCLRGDNARVDKGMQAAAAGAAGMVLCNDKA 459
Query: 503 SGDELSASPHFLPASHISYTD-----AYMTSVTE-----------LGIKPAPIMASFSSR 562
SG+E+ + H LPAS I Y D +Y++S + L KPAP MASFSSR
Sbjct: 460 SGNEIISDAHVLPASQIDYKDGETLFSYLSSTKDPKGYIKAPTATLNTKPAPFMASFSSR 519
Query: 563 GPNIVEPSILKPDITAPGVNIIAAFSEATSISGLPYDKRQAQFITLSGTSMSCPHISGIV 622
GPN + P ILKPDITAPGVNIIAAF+EAT + L D R+ F T SGTSMSCPHISG+V
Sbjct: 520 GPNTITPGILKPDITAPGVNIIAAFTEATGPTDLDSDNRRTPFNTESGTSMSCPHISGVV 579
Query: 623 GLLKTLYPKWSPAAIRSAIMTTAETEANDLNPILTSEKEKANPLAYGAGHVQPNKASNPG 682
GLLKTL+P WSPAAIRSAIMTT+ T N P++ +KANP +YG+GHVQPNKA++PG
Sbjct: 580 GLLKTLHPHWSPAAIRSAIMTTSRTRNNRRKPMVDESFKKANPFSYGSGHVQPNKAAHPG 639
Query: 683 LVYDLTTQDYLNFLCARGYNKTLLKLFTNDTSFVCSKSFKVTDLNYPSISMNNLKSEAVE 742
LVYDLTT DYL+FLCA GYN T+++LF D + C + + D NYPSI++ NL + ++
Sbjct: 640 LVYDLTTGDYLDFLCAVGYNNTVVQLFAEDPQYTCRQGANLLDFNYPSITVPNL-TGSIT 699
Query: 743 IKRRVKNVGSPGMYVAQVEAPPGVSVSVDPSTLKFTKTDEEKDFKVVLRRVPNNQTEQNV 775
+ R++KNVG P Y A+ P GV VSV+P L F KT E K F++ LR +P + V
Sbjct: 700 VTRKLKNVGPPATYNARFREPLGVRVSVEPKQLTFNKTGEVKIFQMTLRPLPVTPSGY-V 759
BLAST of CmaCh16G010020 vs. TAIR 10
Match:
AT5G67360.1 (Subtilase family protein )
HSP 1 Score: 561.6 bits (1446), Expect = 9.8e-160
Identity = 327/733 (44.61%), Postives = 453/733 (61.80%), Query Frame = 0
Query: 84 SYIVYLGSHSHGLNPSAIDLQRATQTHYNLLGSMLGSNEAAKEAIFYSYNRHINAFAAVL 143
+YIV++ PS+ DL H N S L S + E + Y+Y I+ F+ L
Sbjct: 31 TYIVHMAKSQ---MPSSFDL------HSNWYDSSLRSISDSAE-LLYTYENAIHGFSTRL 90
Query: 144 DQKVAED-------IAIHENKKLKLHTTRSWNFLGVENDGGIPLDSLWNLSRFGESTIIG 203
Q+ A+ I++ + +LHTTR+ FLG++ L+ + ++G
Sbjct: 91 TQEEADSLMTQPGVISVLPEHRYELHTTRTPLFLGLDEHTA----DLFPEAGSYSDVVVG 150
Query: 204 NIDSGVWPESKSFSDEGYGPIPTRWKGSCEGGTHFS---CNRKLIGVRYFNKGFASDVGP 263
+D+GVWPESKS+SDEG+GPIP+ WKG CE GT+F+ CNRKLIG R+F +G+ S +GP
Sbjct: 151 VLDTGVWPESKSYSDEGFGPIPSSWKGGCEAGTNFTASLCNRKLIGARFFARGYESTMGP 210
Query: 264 LNSSYE--TARDVDGHGTHTLSTAGGNFVKGVSILGNSYGTAKGGSPKALVAAYKVCWST 323
++ S E + RD DGHGTHT STA G+ V+G S+LG + GTA+G +P+A VA YKVCW
Sbjct: 211 IDESKESRSPRDDDGHGTHTSSTAAGSVVEGASLLGYASGTARGMAPRARVAVYKVCWL- 270
Query: 324 DRGDGCFMADILASFEAAISDGVDVLSVSLGGGIQEFSDDLIAIGSFHAVKNGITVVCSA 383
GCF +DILA+ + AI+D V+VLS+SLGGG+ ++ D +AIG+F A++ GI V CSA
Sbjct: 271 ---GGCFSSDILAAIDKAIADNVNVLSMSLGGGMSDYYRDGVAIGAFAAMERGILVSCSA 330
Query: 384 GNSGPSEDTVLNVAPWMITVGASTVDRLF---------TSYVGESLSSKLLPPKKFYPLI 443
GN+GPS ++ NVAPW+ TVGA T+DR F ++ G SL P K P I
Sbjct: 331 GNAGPSSSSLSNVAPWITTVGAGTLDRDFPALAILGNGKNFTGVSLFKGEALPDKLLPFI 390
Query: 444 RALDAKSNNTSNHEAILCRQGSLNPEKVRGKIVVCLRGENSRGEKGYVVAQAGGVGMILA 503
A +A + N LC G+L PEKV+GKIV+C RG N+R +KG VV AGGVGMILA
Sbjct: 391 YAGNASNATNGN----LCMTGTLIPEKVKGKIVMCDRGINARVQKGDVVKAAGGVGMILA 450
Query: 504 NDKESGDELSASPHFLPAS-----------HISYTDAYMTSV-----TELGIKPAPIMAS 563
N +G+EL A H LPA+ H TD T+ T +G+KP+P++A+
Sbjct: 451 NTAANGEELVADAHLLPATTVGEKAGDIIRHYVTTDPNPTASISILGTVVGVKPSPVVAA 510
Query: 564 FSSRGPNIVEPSILKPDITAPGVNIIAAFSEATSISGLPYDKRQAQFITLSGTSMSCPHI 623
FSSRGPN + P+ILKPD+ APGVNI+AA++ A +GL D R+ +F +SGTSMSCPH+
Sbjct: 511 FSSRGPNSITPNILKPDLIAPGVNILAAWTGAAGPTGLASDSRRVEFNIISGTSMSCPHV 570
Query: 624 SGIVGLLKTLYPKWSPAAIRSAIMTTAETEANDLNPILTSEKEK-ANPLAYGAGHVQPNK 683
SG+ LLK+++P+WSPAAIRSA+MTTA D P+L K + P +GAGHV P
Sbjct: 571 SGLAALLKSVHPEWSPAAIRSALMTTAYKTYKDGKPLLDIATGKPSTPFDHGAGHVSPTT 630
Query: 684 ASNPGLVYDLTTQDYLNFLCARGYNKTLLKLFTNDTSFVC--SKSFKVTDLNYPSISMNN 743
A+NPGL+YDLTT+DYL FLCA Y ++ + ++ C SKS+ V DLNYPS ++N
Sbjct: 631 ATNPGLIYDLTTEDYLGFLCALNYTSPQIRSVSR-RNYTCDPSKSYSVADLNYPSFAVNV 690
Query: 744 LKSEAVEIKRRVKNVGSPGMYVAQVEA-PPGVSVSVDPSTLKFTKTDEEKDFKVVLRRVP 776
A + R V +VG G Y +V + GV +SV+P+ L F + +E+K + V
Sbjct: 691 DGVGAYKYTRTVTSVGGAGTYSVKVTSETTGVKISVEPAVLNFKEANEKKSYTVTFTVDS 740
BLAST of CmaCh16G010020 vs. TAIR 10
Match:
AT2G05920.1 (Subtilase family protein )
HSP 1 Score: 517.7 bits (1332), Expect = 1.6e-146
Identity = 303/684 (44.30%), Postives = 410/684 (59.94%), Query Frame = 0
Query: 109 THYNLLGSMLGSNEAAKEAIFYSYNRHINAFAAVLDQKVAEDIA--------IHENKKLK 168
TH++ S L S + ++ Y+Y + F+A LD A+ + I E+
Sbjct: 45 THHDWYTSQLNS----ESSLLYTYTTSFHGFSAYLDSTEADSLLSSSNSILDIFEDPLYT 104
Query: 169 LHTTRSWNFLGVENDGGIPLDSLWNLSRFGESTIIGNIDSGVWPESKSFSDEGYGPIPTR 228
LHTTR+ FLG+ ++ G+ +L IIG +D+GVWPES+SF D IP++
Sbjct: 105 LHTTRTPEFLGLNSEFGV-----HDLGSSSNGVIIGVLDTGVWPESRSFDDTDMPEIPSK 164
Query: 229 WKGSCEGGTHFS---CNRKLIGVRYFNKGF-ASDVGPLNSSYETA--RDVDGHGTHTLST 288
WKG CE G+ F CN+KLIG R F+KGF + G +S E+ RDVDGHGTHT +T
Sbjct: 165 WKGECESGSDFDSKLCNKKLIGARSFSKGFQMASGGGFSSKRESVSPRDVDGHGTHTSTT 224
Query: 289 AGGNFVKGVSILGNSYGTAKGGSPKALVAAYKVCWSTDRGDGCFMADILASFEAAISDGV 348
A G+ V+ S LG + GTA+G + +A VA YKVCWST GCF +DILA+ + AI DGV
Sbjct: 225 AAGSAVRNASFLGYAAGTARGMATRARVATYKVCWST----GCFGSDILAAMDRAILDGV 284
Query: 349 DVLSVSLGGGIQEFSDDLIAIGSFHAVKNGITVVCSAGNSGPSEDTVLNVAPWMITVGAS 408
DVLS+SLGGG + D IAIG+F A++ G+ V CSAGNSGP+ +V NVAPW++TVGA
Sbjct: 285 DVLSLSLGGGSAPYYRDTIAIGAFSAMERGVFVSCSAGNSGPTRASVANVAPWVMTVGAG 344
Query: 409 TVDRLFTSYVGESLSSKLLPPKKFYPL---IRALDAKSNNTSNHEAILCRQGSLNPEKVR 468
T+DR F ++ +L + + + L+ N ++ + LC GSL+ VR
Sbjct: 345 TLDRDFPAFANLGNGKRLTGVSLYSGVGMGTKPLELVYNKGNSSSSNLCLPGSLDSSIVR 404
Query: 469 GKIVVCLRGENSRGEKGYVVAQAGGVGMILANDKESGDELSASPHFLPASHIS------- 528
GKIVVC RG N+R EKG VV AGG+GMI+AN SG+EL A H LPA +
Sbjct: 405 GKIVVCDRGVNARVEKGAVVRDAGGLGMIMANTAASGEELVADSHLLPAIAVGKKTGDLL 464
Query: 529 ----YTDAYMTSV-----TELGIKPAPIMASFSSRGPNIVEPSILKPDITAPGVNIIAAF 588
+D+ T++ T L +KP+P++A+FSSRGPN V P ILKPD+ PGVNI+A +
Sbjct: 465 REYVKSDSKPTALLVFKGTVLDVKPSPVVAAFSSRGPNTVTPEILKPDVIGPGVNILAGW 524
Query: 589 SEATSISGLPYDKRQAQFITLSGTSMSCPHISGIVGLLKTLYPKWSPAAIRSAIMTTAET 648
S+A +GL D R+ QF +SGTSMSCPHISG+ GLLK +P+WSP+AI+SA+MTTA
Sbjct: 525 SDAIGPTGLDKDSRRTQFNIMSGTSMSCPHISGLAGLLKAAHPEWSPSAIKSALMTTAYV 584
Query: 649 EANDLNPIL-TSEKEKANPLAYGAGHVQPNKASNPGLVYDLTTQDYLNFLCARGYNKTLL 708
N P+ ++ +NP A+G+GHV P KA +PGLVYD++T++Y+ FLC+ Y +
Sbjct: 585 LDNTNAPLHDAADNSLSNPYAHGSGHVDPQKALSPGLVYDISTEEYIRFLCSLDYTVDHI 644
Query: 709 KLFTNDTSFVCSKSFK-VTDLNYPSISMNNLKSEAVEIKRRVKNVG-SPGMYVAQVEAPP 757
S CSK F LNYPS S+ V R V NVG + +Y V P
Sbjct: 645 VAIVKRPSVNCSKKFSDPGQLNYPSFSVLFGGKRVVRYTREVTNVGAASSVYKVTVNGAP 704
BLAST of CmaCh16G010020 vs. TAIR 10
Match:
AT1G04110.1 (Subtilase family protein )
HSP 1 Score: 503.8 bits (1296), Expect = 2.4e-142
Identity = 299/686 (43.59%), Postives = 406/686 (59.18%), Query Frame = 0
Query: 116 SMLGSNEAAKE---AIFYSYNRHINAFAAVLDQKVA-------EDIAIHENKKLKLHTTR 175
++LG E +E + YSY I FAA L + A E +A+ + L++ TT
Sbjct: 56 AVLGVEEEEEEPSSRLLYSYGSAIEGFAAQLTESEAEILRYSPEVVAVRPDHVLQVQTTY 115
Query: 176 SWNFLGVENDGGIPLDSLWNLSRFGESTIIGNIDSGVWPESKSFSDEGYGPIPTRWKGSC 235
S+ FLG++ G +W+ SRFG+ TIIG +D+GVWPES SF D G IP +WKG C
Sbjct: 116 SYKFLGLDGFGN---SGVWSKSRFGQGTIIGVLDTGVWPESPSFDDTGMPSIPRKWKGIC 175
Query: 236 EGGTHF---SCNRKLIGVRYFNKGFASDVGPLNS-----SYETARDVDGHGTHTLSTAGG 295
+ G F SCNRKLIG R+F +G P S Y +ARD GHGTHT ST GG
Sbjct: 176 QEGESFSSSSCNRKLIGARFFIRGHRVANSPEESPNMPREYISARDSTGHGTHTASTVGG 235
Query: 296 NFVKGVSILGNSYGTAKGGSPKALVAAYKVCWSTDRGDGCFMADILASFEAAISDGVDVL 355
+ V ++LGN G A+G +P A +A YKVCW +GC+ +DILA+ + AI D VDVL
Sbjct: 236 SSVSMANVLGNGAGVARGMAPGAHIAVYKVCWF----NGCYSSDILAAIDVAIQDKVDVL 295
Query: 356 SVSLGGGIQEFSDDLIAIGSFHAVKNGITVVCSAGNSGPSEDTVLNVAPWMITVGASTVD 415
S+SLGG DD IAIG+F A++ GI+V+C+AGN+GP E +V N APW+ T+GA T+D
Sbjct: 296 SLSLGGFPIPLYDDTIAIGTFRAMERGISVICAAGNNGPIESSVANTAPWVSTIGAGTLD 355
Query: 416 RLFTSYVGESLSSKLLPPKKFYP------LIRALDAKSNNTSNHEAILCRQGSLNPEKVR 475
R F + V + KLL + YP R ++ + + C +GSL E++R
Sbjct: 356 RRFPAVV-RLANGKLLYGESLYPGKGIKNAGREVEVIYVTGGDKGSEFCLRGSLPREEIR 415
Query: 476 GKIVVCLRGENSRGEKGYVVAQAGGVGMILANDKESGDELSASPHFLPASHISYTD---- 535
GK+V+C RG N R EKG V +AGGV MILAN + + +E S H LPA+ I YT+
Sbjct: 416 GKMVICDRGVNGRSEKGEAVKEAGGVAMILANTEINQEEDSIDVHLLPATLIGYTESVLL 475
Query: 536 -AYMTSV-----------TELGIKPAPIMASFSSRGPNIVEPSILKPDITAPGVNIIAAF 595
AY+ + T +G AP +A FS+RGP++ PSILKPD+ APGVNIIAA+
Sbjct: 476 KAYVNATVKPKARIIFGGTVIGRSRAPEVAQFSARGPSLANPSILKPDMIAPGVNIIAAW 535
Query: 596 SEATSISGLPYDKRQAQFITLSGTSMSCPHISGIVGLLKTLYPKWSPAAIRSAIMTTAET 655
+ +GLPYD R+ F +SGTSMSCPH+SGI L+++ YP WSPAAI+SA+MTTA+
Sbjct: 536 PQNLGPTGLPYDSRRVNFTVMSGTSMSCPHVSGITALIRSAYPNWSPAAIKSALMTTADL 595
Query: 656 EANDLNPILTSEKEKANPLAYGAGHVQPNKASNPGLVYDLTTQDYLNFLCARGYNKTLLK 715
I K A A GAGHV P KA NPGLVY++ DY+ +LC G+ ++ +
Sbjct: 596 YDRQGKAIKDGNK-PAGVFAIGAGHVNPQKAINPGLVYNIQPVDYITYLCTLGFTRSDIL 655
Query: 716 LFTNDTSFVCSKSFKVT---DLNYPSISMNNLKSEAVE-IKRRVKNVGSP-GMYVAQVEA 757
T+ + C+ + LNYPSI++ + + E I RRV NVGSP +Y V+A
Sbjct: 656 AITH-KNVSCNGILRKNPGFSLNYPSIAVIFKRGKTTEMITRRVTNVGSPNSIYSVNVKA 715
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9ZSP5 | 2.9e-225 | 54.67 | Subtilisin-like protease SBT5.3 OS=Arabidopsis thaliana OX=3702 GN=AIR3 PE=2 SV=... | [more] |
F4JXC5 | 4.6e-223 | 56.77 | Subtilisin-like protease SBT5.4 OS=Arabidopsis thaliana OX=3702 GN=SBT5.4 PE=1 S... | [more] |
I1N462 | 5.2e-190 | 49.80 | Subtilisin-like protease Glyma18g48580 OS=Glycine max OX=3847 GN=Glyma18g48580 P... | [more] |
O65351 | 1.4e-158 | 44.61 | Subtilisin-like protease SBT1.7 OS=Arabidopsis thaliana OX=3702 GN=SBT1.7 PE=1 S... | [more] |
Q9ZUF6 | 2.3e-145 | 44.30 | Subtilisin-like protease SBT1.8 OS=Arabidopsis thaliana OX=3702 GN=SBT1.8 PE=1 S... | [more] |