Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTTTTTCTACTTCTCACTTGTAAGTCAACTTTTTCTCAACCAACTTGGTTATGAGTTACTAGTAGGATCAACCACCTTACGTTTTTCGCACGCTAAAAATCCCTTTGTTCGAGTCCAATTTGCGTTGTTGAGCTGAACACATTGACAATTATGAACCCATTCATGAGGAAAGGTCAGACCAGGTTTGTTAAGGATCTCTGGTTCCTGGAAAATCTCCTAAACAATCCTCCATTAATGTCAAAATTTCCAATTCTTGACAAATTTCCAATATGAATAAAGATATTAATTATAAGCATAACTTACTCGAGTAACGTATTGACGAATTCAAATAAATTTTGAGCTTTTATTATCATGGTTTTTTAGAAAAATCATTTAGATTGATGTTTGCATTGGCATCGTCATAAAACTAAGGCTCCGTACACTTCATTCATGTTAGATGGCTGTGGTTTATGGAGTTAAGCTTACTTGAAATGAATTTTTATGCCGCTGTCTTCTAGATTTTGTCTTAGCGCAGCATTTAACTATTAAGTTCTGATTTTTGTGTATGAGAATAATTCTGATTTTAGAATATTATTTCATTATAGTATTGAATGACTTATCCCTGTCTCTTTTCCAGTGCAGGCAATGGAGACTGTTGTATTGCTCATAGAGTATTTTCGGGCATATTGCGATCTGATGTCATGTGTATGGCATGTGGTTTTACATCGACAACTTATGATCCATGTGTAGACATCTCTTTGGATTTGGAACCAAATCAAGCGGGATCTGCGAAGATGGCGGCCACAAGGAATCTTCCTTGCAATGGCGAGGCAGATTGCATGAACTCGAGCCAAAACCAGCGGCTATCTTCCCTGGTGGGTTGTTTGGATCAATTCACTAGACCCGAAAAACTCGGGTCCGACCAAAAATTCTTCTGCCAACAGTGTCAAGTGCGGCAGGAATCACTCAAACAGATGTCCATAAGAAAGCTTCCATTAGTTTCTTGTTTCCACATTAAAAGATTCGAGCATTCTTCAGTTCGAAAAATGTCAAGGAAGGTGGACCGATATTTGCAGTTTCCGTTTTCTTTAGACATGGCACCTTATCTCTCATCTTCAATCTTGAGAAGTCGATTTGGTAATAGGATTTTTCCTTTTGATGGAGATGAACAAGATGGTTGTAATGAGACATCGTCTGAGTTCGAGTTGTTTGCCGTTATCACTCACAAAGGAAAATTGGACGCTGGTCATTACGTGACCTACTTGCGGCTAAGTAATCAATGGTATAAGTGCGATGATGCTTGGATTACCCAAGTCAACGAAAACATCGTCAGGGCTGCACAAGGATACATGATGTTTTATGTGCAGAAGATGCTCTATTACAAATCTAGTGAGAAACAGAATGCTTCATAACTTGATTATGGGACGAGATCGGTTGAAACTCAATCGATTATGTTTGTTTTCCATGAAAGCCAGGAATCGATATTGTTTTCAGTGACAAGCTGAGCTCTAAGTGGATGCCTACACATTCTAGTAGAAGATCCAGGCGTGCCCAAACTTTTTGGGGAGGATCATTGTCAGATCCCTACTATTTGAAGGGAACTTCTGGTACCTTTTTCTAACTCCCTTTTCACTACTAATCAAGTAATGTTGGTGACTTGCCATGGAGTGTTTCAGTACTTGGAGTTGTGGCTCAGCAGATTACTGATCTCTCTTCACATACCATGAAGAAGAGGTATGAAAAATCTAATTTTAGAGCTCTCTCTCTCTTTAGATATCTCTAAGAATTTCGGGTTCTTCGTGTCCCGGGCTTCCGATTTTGAAATTTATCCATCATTTTTTTTGGTGGATAACCACCTATTAATTCAAACATGAAACAGTACAATGAAAGGCTGCCTCTTCAGCCTTTTTATTACACGTGATTAATACTGATGAATTTAAGTCTCCATTGATTCTGTCTCTGATTATTCTATTTATTGACATGAACATATGAACGTTTATGACATTTTCAGTCTGATTCTGTTGACATATGTTTCAAAAAAGTTACAAAAAAGTTGTGTTCAAAGTCGAGTTTTGCTGTTGATGAAGAAAGAGCTCTCTATAAACTTGTGTGTACGTTCCAAGTTCTTGTTGATCACACCCTTTGAATTACTTCTGTGCATTCTGAATATCTCTATCTGAACTTTGTTGTCAGCTATGAACCCTACAAAGATCTGAGAGAAAAAAGGGAGAATTTTACAGCATTTGAAATTAAATTGTACAATCAAGTCGAAATCGATGCTTTTTATGTGATTGGGAACGAGATGCTTTCTTTGGTAGATCGGATTGGAAGTATTCTGTCTTCGTTTTCAAAACTATTCATTAGGGAAACAGCTTCATTTTGTTCTTAAAAACAGGTTGTACATGAAAGTCGACACTCAATAGTTCGAGAGCGTGTAGTTTTGAAGTCTGATAATAATTGGTCTCTTACCCTAAAACATTTGATAAGAATAAACGTTCCTATGTTAGGCTTGTAAGTTAGACGTTTGTATCGGTCGGGAATGTGAGTTGTCAAACTCGCACTTCACAAAAACTGTTTACTTATCGATATAATTAAAAGAATTGAGAGTTTAGTATAAAGTTTTTTTTTTTTAAAAAAAATTAAATTCAAATAATAAAATGTTTAAAAAAGTATAAATGTTAGCTAAACATTTATATACTTACATTTATATATCTAAGTAGGTCCTAAACATTTATATACTTAGAATTTTTTGAATAAATGAGCAAACTTTATTATAATTATAAGGTTATGTAAAATTTAGTAAGTGACATCAATTAGTACCCTAATAATTAATCCATTGAATTAAGATATCAATAATTAATCCTTTAATTTCTTACCTTATTGAATTAAAAAATTCAATTTTGAAATTTAAATATTTTTTAATGGATTTTTCTTTAAAAAAGAAAAAAACAAAAAGTGATATTGCATTGACCTCTCTCCTTCCACCACGTGTTTTTCATTACCCTTCCAGGTATTTATCGTGTGCAGCCATGCTCATCATAATCACATGCTTACCACGTGTCGCATTTTAAAATTAAACTATCATGGTTCTTTCCGCACCTTATAGTTACCCGTTGTCCTCTCTTGTGCTTCCGTTTCATTTCAGTTTCCAAATTGTGTTTCTGTCTTATCCCTTCTCTCTCGCCTTAGCCCTCGCAATTTTGCTCGCTCCTTTGTCCTCACCGTACTCGTCAATCGGAATCGGAATCGGAATCTCAATCTGCCGGACCTTTGTTGGTCGGTTTCCGACAGGAAGTTCTGTAATTGGGTTGTGAACGTAATATTGGCTCAACTTTGGCTTGGTTCTTGGAGAAGTTGAGCTGATGCTATCGCTTCGACGATTTGTTTGATTGAATTCGGGTTGATTTCACTGGTGTTTAGGTGAGAATTTATCTCTTTCTTTCACAATTTAGCTGAATTTATTGTTCTTATTTTTACGCATTCAATGGAAAAATCTGGTGAGTATATAGTATAGTTGAACTGGATATAGGTTGCTGTTTAGGAGGTATTTTATTGTGTCGTTGAGCTTGGAAGTGCGAATAGAGGGTAGGGAAAGATAAAAGAATGAGGATTTGTGGTGATGGTTAAGGACATGCATATCTTGGAAAATGTCTGAGTCGTATCCGTTGAGTTTTTAGTAACTCTTGGTGTCTTTTTGCCTCATGATGTCTTGGATTATTAGGGTAGATCTTGTCAGATGGATTACAAGATTTTTGGGATTGAAGTTGATAATAGTGGATGTAAAGGGAAAGTGCTCGTCTATTGAAGAGAAGTGGACATAATATTATTTGGCCCCTTTGGATTAGATCTTGTGAACTCTCTCATATCGTTTCTAGTTGTGGTAGGAAAATTTTTGTGATGATTTCTTTGGGAACTTGGAAGAGATGGAAAAGGGTGGATGTAGTTTTGAACTTGATCAAAGGAAATATGTTCCGGGTTTTTACAGAGGACTTCACATTAATTCAAGGTGGATAGACTTTTGAAGCGGATTGAAGGGAAACCGGGATGTTTTAGGCACACGACTTTATTGACTTAAAAGTTCTCGTAGAGGTAGTTCAAGGAGTTAACGGGACCGCCATTTATTACTGGGGATGAGGTGGGACAAGAATCTTTAGGTGTGTTTTATTTGATGTGGTGAACTTTTGTTGGGTCTGAATGGTAGTCATTTTTACTGGGATGATCTATATTGCCGAGGAACAGAGGGCCTTTGTATGTCATCAGCAAGTGCTTTACAAGATTGGACCTGCCCAAGGTCAAGCCATTTTATGCAAATAATTTGTGTTATTTTTCTTCTCATTTGGACTTAGGAGGTGTGATCAATTGATAAGGCACATGAACCATGGTATTTTCATTTAATATTTTGCATACCTTTTTAATATTTTGCAGGCAGAATGAAGGGTGTTAGAAACTGGTTGTTTTCTCAATTAGTATCCAAGTCAGTGGTTTCATCAAGACCATTACTGGGGAGTGACAGTTTCTTTGGTGAGGAAAATAAAGAACATGTGGATGAAGACCAAGATGGCGAAGGTACATATCAGTCTAATTGATGAATTTGTCTTTGTTTTAGATTCAAAATTCTCTTGCCTCTGCTGATTGCATAGATTTTGAGAAAATTCATAGCTTGTTTGGTTTACTTATTCAACAGGATATCTCTTGTAAGGATGCTGCCACATTAAAATTGTCATTGTTGCTCTAAACTATTAAGGGATATATTTTAGACAATAGTGAAGGATTGATGTAATATGTACCTTTGCTTTTCGTAAATTTTGGAACTCTTGAAGCTCTGGTTCCCTGGGAACTTCTTGGAATTTAAACTTCCCCACCGCCACCGCCACACACACACACACACACACACAAAAGGTAGTCTTGATGGTTACGCTGGGGTCTTTAAGTTCTTGGATTCCAAGCCTTTAGATAGATAACTTTAAAATATATGGTGTCACTAGGTGCTAGCCTTGGAGTGTGTGAAGTTCCTAGGAGACTAGTATGTTGGGTCCTAAGCATAATGCCAGTGGAGTGGGGTCCTTTAATCTTCTTCTTATAAAAAAAAAATGTTCAAATACTATTTTCGTTTTTGTGCTTTCAGTTTCGGTTGGTCTCTGTACTTTCAATTTTGGTTCAGTTTGGCCCTGTACTTAAAAAATGGACCATTTTGTTCACTTAAAAAGTGAAATAAAAAATGGAAATGAATGGACCAAAATGGTCCGTTTTTTAAGTTGTTCAGGATCAAAATGGATATTTTGAAAGTACAGAGATGAAATGATCTAAAGTTGAGAGTACAAGGACCAAAATGAACCTTTCGAAAATGTAGGGACCAAAGTAGTATTTAAATAAAAAGATTGAACTAAAAACAATTTCTCGAAGGCATAAAAGATCAAGGAATACAAGAGTAAAAAAAGTACGAAACAGAAAATACACCCATATAGTGATTAGCCATGAACATGAACTTTTCATAGACCTAACAAAATGAACCCCTATTGAAAATGACTCTTTTACTGACTTTTGGTACCATCTTGAGAAAATTGAGAATATCAATAAGGGCCTTCCAAATCTGCTGCTAAAGATTTGAGTCATTTTGGCAGAGTTATTTGTTGGTCATCAATACATTTATAGTTTCTTTTTTTACCAATACTTTTAGTTGTTTGAATTTTTTCTTCCTGCCTGATTTGGTTGTTATTTGTATTTCATTGACTCATGAGTTAATTAACTATTTATGGTCACCATTTGTAAAGATTCAAGAAACATATATATTTATGTTCATCACTTGTAGAGATTCATATTCTGTTATCTCTTATCCTTGTACCCCAGCAGCCTTTGTGGTAGATTGGCTCCACTTCTGTTATTTGTGGCTAGATGTAGGCTAAACTACTGAGCTTTGTGAAATTTTTTTAATGGCCAAGTTTATGCCTGGCTTCCTTAACATTAATGTTCTTATGTCAACCAATTTGTCCCAATGAGTCACAGGGTAGAAATATAATGTTGATGGTGCTATTTGGTCACTGATCAGAGAAAATGGTTTTATTGAAGACTTCTTTACTGTTAAAGCTGCTTAGAATGAGCTACATTCTACTGTAGAATAATGTAAACTTGCTTTATAAACCATTTCTGTTTTTTTTTTCAAAGTTTGCAAATTCTATAATAACATCACATTATCTTTTGAAGTTGCACAAGCAACCACTATTGTACCGCCCACTGCCCCTCATACATCAAACTCTGGCGGTAGTTTGGAGAGTCAAGAAGATTTGCCCTTGGAACAGTCTCAACATAGCTCTAACAGGGTAAAGGTGGATGTGTTGACAATGATTGAGGACCTCCAAGTTCAGTTCTTTCGACTTCTACAGAGAATTGGGCAGACACCGAACAATTTGCTAGTGGAAAAGGTCCTGTATCGAATACATCTAGCAACTTTGATACAGGTAGGAGAATCCGATCTCAAAAGAGTAAATTTTGAAAGAGGCAAAGCCAGAGAAAAAGCAGCTGAACAAGAAGCAGCTGGGATACCAGAATCAAGCTTCACATTTAGAGTACTTGTATTGGGAAAAACAGGAGTTGGCAAGAGTGCTACAATAAATTCCCTCTTTGATCAAGCAAAAACTGCAACGGATGCATTTCAACCTGCAACCGATCGTATTCAGGAGATTGTGGGAACGATCAATGGGATTAAAGTATCCATCATTGATACCCCCGGTCTTTCCCAATCATCCTCGGGAAATATGAAGAGAAATAAGAAAATCTTGTTTTCTGTGAAGAGATATATAAGGAAATCCCCACCAGATATCGTTTTGTACTTCGAGCGCCTTGACCTCATAAACAAGAATCATGCTGACTACTTTTTAATGAAGCTAATAAATGAGGTCTTTGGTCCTGCAATTTGGTTCAACACCATCCTAGTCTTGACACATTGTTCCTCAGCTCTTCCAGAAGGACCTGATGGATATCCTGTCTCCTTCGAATCGTATGTGTCCCATTGCTCAGAGCTTTTGCAGCAAAATATACATCAGGCAGTGTCAGACCCAAGACTTGAAAATCCCGTCCTCTTGGTCGAGAACCATCCTCAGTGTAAGAAAAATATTATGGGGGAAAAAGTTCTTCCAAATGGGCAGGTCTGGAGATCACATTTCTTGTTGTTGTGCATTTGTTATAAAGTTCTGGGCAGCATTAATACCCTATTGAAATTTCAAAATTGCATTGAGCTAGGGCCATTAGCTAATACCCGGCTGCCTTCACTTCCCCACTTACTCTCATCTATTTTACGGCACCGAAATACGACAAGTCCATCAGGTGTAGACTATGACAGTGAAGCTATTATACTCAGGGACAATGAAGAAGATGAGTACGATGATCTACCTTCGATTCGCATTCTGACGAAATCCCAATTCGATAAGCTGTCGAACTCACAGAAAAAGGAATACCTGGATGAGCTGGAATACAGGGAAACTCTATATTTAAAGAAACAGCTAAGAGAAGAGTATCAAAGGAGGAAGGAAATCAAGCTTTTAAAACATAGAGACTCGGAACACAATGATAATAATGGTGATTTGCAGCCATCGCCAGAGGCGGAAGCTGTTCTGCTTCCAGATATGGCTGTTCCACCAAGTTTTGACTCAGATTGTCCTGTTCATAGATATCGTTGCATTGCAGTAGATGATCAGTGGATTGTGAGACCTGTTCTTGACCCACAAGGATGGGACCATGATGTAGGCTTCGATGGGATAAATCTCGAAACAGCAATGGAGATGAACAAAAATGTTTTTACCTCAGTCACTGGACAGGTGAGTAAAGATAAGCGTTTCTTTAACATTCAATCTGAGTGTGCTGCTTCTTACATGGATTCTAGGGGATCTTCTTATACTTTAGGTCTAGATGTTCAATCTTCTGGTACAGATAGGATATACACTGTTCATAGTAATGCTAAGCTGGGTAGCATTAAGCACAACCATCCTGGAATTGGACTTTCTCTGATATCTTTCAAGAGAAATTGCTATTATGGTGTGAAGCTTGAAGATACCATATCTATAGGTAAGCGAGTGAGGCTCGTAGCCAATGGCGGTCGTATAGAAGGAGCAGGACAAATGGCATATGGTGGAAGCATAGTAGCTACTTTAAGAGGTGATGACTACCCAGTGAGGAATGACCATCTCAGTCTAACGATGACCGTCCTCTCTTTTGACAAGGAAACCATCCTGAGCGGGAATGTAGAGTCTGAGTTTCGGGTTAGCCGAAGCATGAGACTGTCGGTTAATGCCAACTTAAATACACGTAAAATGGGTCAGATCTGCATAAAAACAAGTAGCTGTGAGCATTTGCAGATTGCTTTGATTTCTGGTTTTACAATCTTGAGAGCCCTTCTGCGTAGAAAGGAAATTGAAACGTTGTAG
mRNA sequence
ATGGTTTTTTCTACTTCTCACTTTGCAGGCAATGGAGACTGTTGTATTGCTCATAGAGTATTTTCGGGCATATTGCGATCTGATGTCATGTGTATGGCATGTGGTTTTACATCGACAACTTATGATCCATGTGTAGACATCTCTTTGGATTTGGAACCAAATCAAGCGGGATCTGCGAAGATGGCGGCCACAAGGAATCTTCCTTGCAATGGCGAGGCAGATTGCATGAACTCGAGCCAAAACCAGCGGCTATCTTCCCTGGTGGGTTGTTTGGATCAATTCACTAGACCCGAAAAACTCGGGTCCGACCAAAAATTCTTCTGCCAACAGTGTCAAGTGCGGCAGGAATCACTCAAACAGATGTCCATAAGAAAGCTTCCATTAGTTTCTTGTTTCCACATTAAAAGATTCGAGCATTCTTCAGTTCGAAAAATGTCAAGGAAGGTGGACCGATATTTGCAGTTTCCGTTTTCTTTAGACATGGCACCTTATCTCTCATCTTCAATCTTGAGAAGTCGATTTGGTAATAGGATTTTTCCTTTTGATGGAGATGAACAAGATGGTTGTAATGAGACATCGTCTGAGTTCGAGTTGTTTGCCGTTATCACTCACAAAGGAAAATTGGACGCTGGTCATTACGTGACCTACTTGCGGCTAAGTAATCAATGGTATAAGTGCGATGATGCTTGGATTACCCAAGTCAACGAAAACATCGTCAGGGCTGCACAAGGATACATGATGTTTTATGTGCAGAAGATGCTCTATTACAAATCTAGAATCGATATTGTTTTCAGTGACAAGCTGAGCTCTAAGTGGATGCCTACACATTCTAGTAGAAGATCCAGGCGTGCCCAAACTTTTTGGGGAGGATCATTGTCAGATCCCTACTATTTGAAGGGAACTTCTGTACTTGGAGTTGTGGCTCAGCAGATTACTGATCTCTCTTCACATACCATGAAGAAGAGTTTCCAAATTGTGTTTCTGTCTTATCCCTTCTCTCTCGCCTTAGCCCTCGCAATTTTGCTCGCTCCTTTGTCCTCACCGTACTCGTCAATCGGAATCGGAATCGGAATCTCAATCTGCCGGACCTTTGTTGGCAGAATGAAGGGTGTTAGAAACTGGTTGTTTTCTCAATTAGTATCCAAGTCAGTGGTTTCATCAAGACCATTACTGGGGAGTGACAGTTTCTTTGGTGAGGAAAATAAAGAACATGTGGATGAAGACCAAGATGGCGAAGTTGCACAAGCAACCACTATTGTACCGCCCACTGCCCCTCATACATCAAACTCTGGCGGTAGTTTGGAGAGTCAAGAAGATTTGCCCTTGGAACAGTCTCAACATAGCTCTAACAGGGTAAAGGTGGATGTGTTGACAATGATTGAGGACCTCCAAGTTCAGTTCTTTCGACTTCTACAGAGAATTGGGCAGACACCGAACAATTTGCTAGTGGAAAAGGTCCTGTATCGAATACATCTAGCAACTTTGATACAGGTAGGAGAATCCGATCTCAAAAGAGTAAATTTTGAAAGAGGCAAAGCCAGAGAAAAAGCAGCTGAACAAGAAGCAGCTGGGATACCAGAATCAAGCTTCACATTTAGAGTACTTGTATTGGGAAAAACAGGAGTTGGCAAGAGTGCTACAATAAATTCCCTCTTTGATCAAGCAAAAACTGCAACGGATGCATTTCAACCTGCAACCGATCGTATTCAGGAGATTGTGGGAACGATCAATGGGATTAAAGTATCCATCATTGATACCCCCGGTCTTTCCCAATCATCCTCGGGAAATATGAAGAGAAATAAGAAAATCTTGTTTTCTGTGAAGAGATATATAAGGAAATCCCCACCAGATATCGTTTTGTACTTCGAGCGCCTTGACCTCATAAACAAGAATCATGCTGACTACTTTTTAATGAAGCTAATAAATGAGGTCTTTGGTCCTGCAATTTGGTTCAACACCATCCTAGTCTTGACACATTGTTCCTCAGCTCTTCCAGAAGGACCTGATGGATATCCTGTCTCCTTCGAATCGTATGTGTCCCATTGCTCAGAGCTTTTGCAGCAAAATATACATCAGGCAGTGTCAGACCCAAGACTTGAAAATCCCGTCCTCTTGGTCGAGAACCATCCTCAGTGTAAGAAAAATATTATGGGGGAAAAAGTTCTTCCAAATGGGCAGGTCTGGAGATCACATTTCTTGTTGTTGTGCATTTGTTATAAAGTTCTGGGCAGCATTAATACCCTATTGAAATTTCAAAATTGCATTGAGCTAGGGCCATTAGCTAATACCCGGCTGCCTTCACTTCCCCACTTACTCTCATCTATTTTACGGCACCGAAATACGACAAGTCCATCAGGTGTAGACTATGACAGTGAAGCTATTATACTCAGGGACAATGAAGAAGATGAGTACGATGATCTACCTTCGATTCGCATTCTGACGAAATCCCAATTCGATAAGCTGTCGAACTCACAGAAAAAGGAATACCTGGATGAGCTGGAATACAGGGAAACTCTATATTTAAAGAAACAGCTAAGAGAAGAGTATCAAAGGAGGAAGGAAATCAAGCTTTTAAAACATAGAGACTCGGAACACAATGATAATAATGGTGATTTGCAGCCATCGCCAGAGGCGGAAGCTGTTCTGCTTCCAGATATGGCTGTTCCACCAAGTTTTGACTCAGATTGTCCTGTTCATAGATATCGTTGCATTGCAGTAGATGATCAGTGGATTGTGAGACCTGTTCTTGACCCACAAGGATGGGACCATGATGTAGGCTTCGATGGGATAAATCTCGAAACAGCAATGGAGATGAACAAAAATGTTTTTACCTCAGTCACTGGACAGGTGAGTAAAGATAAGCGTTTCTTTAACATTCAATCTGAGTGTGCTGCTTCTTACATGGATTCTAGGGGATCTTCTTATACTTTAGGTCTAGATGTTCAATCTTCTGGTACAGATAGGATATACACTGTTCATAGTAATGCTAAGCTGGGTAGCATTAAGCACAACCATCCTGGAATTGGACTTTCTCTGATATCTTTCAAGAGAAATTGCTATTATGGTGTGAAGCTTGAAGATACCATATCTATAGGTAAGCGAGTGAGGCTCGTAGCCAATGGCGGTCGTATAGAAGGAGCAGGACAAATGGCATATGGTGGAAGCATAGTAGCTACTTTAAGAGGTGATGACTACCCAGTGAGGAATGACCATCTCAGTCTAACGATGACCGTCCTCTCTTTTGACAAGGAAACCATCCTGAGCGGGAATGTAGAGTCTGAGTTTCGGGTTAGCCGAAGCATGAGACTGTCGGTTAATGCCAACTTAAATACACGTAAAATGGGTCAGATCTGCATAAAAACAAGTAGCTGTGAGCATTTGCAGATTGCTTTGATTTCTGGTTTTACAATCTTGAGAGCCCTTCTGCGTAGAAAGGAAATTGAAACGTTGTAG
Coding sequence (CDS)
ATGGTTTTTTCTACTTCTCACTTTGCAGGCAATGGAGACTGTTGTATTGCTCATAGAGTATTTTCGGGCATATTGCGATCTGATGTCATGTGTATGGCATGTGGTTTTACATCGACAACTTATGATCCATGTGTAGACATCTCTTTGGATTTGGAACCAAATCAAGCGGGATCTGCGAAGATGGCGGCCACAAGGAATCTTCCTTGCAATGGCGAGGCAGATTGCATGAACTCGAGCCAAAACCAGCGGCTATCTTCCCTGGTGGGTTGTTTGGATCAATTCACTAGACCCGAAAAACTCGGGTCCGACCAAAAATTCTTCTGCCAACAGTGTCAAGTGCGGCAGGAATCACTCAAACAGATGTCCATAAGAAAGCTTCCATTAGTTTCTTGTTTCCACATTAAAAGATTCGAGCATTCTTCAGTTCGAAAAATGTCAAGGAAGGTGGACCGATATTTGCAGTTTCCGTTTTCTTTAGACATGGCACCTTATCTCTCATCTTCAATCTTGAGAAGTCGATTTGGTAATAGGATTTTTCCTTTTGATGGAGATGAACAAGATGGTTGTAATGAGACATCGTCTGAGTTCGAGTTGTTTGCCGTTATCACTCACAAAGGAAAATTGGACGCTGGTCATTACGTGACCTACTTGCGGCTAAGTAATCAATGGTATAAGTGCGATGATGCTTGGATTACCCAAGTCAACGAAAACATCGTCAGGGCTGCACAAGGATACATGATGTTTTATGTGCAGAAGATGCTCTATTACAAATCTAGAATCGATATTGTTTTCAGTGACAAGCTGAGCTCTAAGTGGATGCCTACACATTCTAGTAGAAGATCCAGGCGTGCCCAAACTTTTTGGGGAGGATCATTGTCAGATCCCTACTATTTGAAGGGAACTTCTGTACTTGGAGTTGTGGCTCAGCAGATTACTGATCTCTCTTCACATACCATGAAGAAGAGTTTCCAAATTGTGTTTCTGTCTTATCCCTTCTCTCTCGCCTTAGCCCTCGCAATTTTGCTCGCTCCTTTGTCCTCACCGTACTCGTCAATCGGAATCGGAATCGGAATCTCAATCTGCCGGACCTTTGTTGGCAGAATGAAGGGTGTTAGAAACTGGTTGTTTTCTCAATTAGTATCCAAGTCAGTGGTTTCATCAAGACCATTACTGGGGAGTGACAGTTTCTTTGGTGAGGAAAATAAAGAACATGTGGATGAAGACCAAGATGGCGAAGTTGCACAAGCAACCACTATTGTACCGCCCACTGCCCCTCATACATCAAACTCTGGCGGTAGTTTGGAGAGTCAAGAAGATTTGCCCTTGGAACAGTCTCAACATAGCTCTAACAGGGTAAAGGTGGATGTGTTGACAATGATTGAGGACCTCCAAGTTCAGTTCTTTCGACTTCTACAGAGAATTGGGCAGACACCGAACAATTTGCTAGTGGAAAAGGTCCTGTATCGAATACATCTAGCAACTTTGATACAGGTAGGAGAATCCGATCTCAAAAGAGTAAATTTTGAAAGAGGCAAAGCCAGAGAAAAAGCAGCTGAACAAGAAGCAGCTGGGATACCAGAATCAAGCTTCACATTTAGAGTACTTGTATTGGGAAAAACAGGAGTTGGCAAGAGTGCTACAATAAATTCCCTCTTTGATCAAGCAAAAACTGCAACGGATGCATTTCAACCTGCAACCGATCGTATTCAGGAGATTGTGGGAACGATCAATGGGATTAAAGTATCCATCATTGATACCCCCGGTCTTTCCCAATCATCCTCGGGAAATATGAAGAGAAATAAGAAAATCTTGTTTTCTGTGAAGAGATATATAAGGAAATCCCCACCAGATATCGTTTTGTACTTCGAGCGCCTTGACCTCATAAACAAGAATCATGCTGACTACTTTTTAATGAAGCTAATAAATGAGGTCTTTGGTCCTGCAATTTGGTTCAACACCATCCTAGTCTTGACACATTGTTCCTCAGCTCTTCCAGAAGGACCTGATGGATATCCTGTCTCCTTCGAATCGTATGTGTCCCATTGCTCAGAGCTTTTGCAGCAAAATATACATCAGGCAGTGTCAGACCCAAGACTTGAAAATCCCGTCCTCTTGGTCGAGAACCATCCTCAGTGTAAGAAAAATATTATGGGGGAAAAAGTTCTTCCAAATGGGCAGGTCTGGAGATCACATTTCTTGTTGTTGTGCATTTGTTATAAAGTTCTGGGCAGCATTAATACCCTATTGAAATTTCAAAATTGCATTGAGCTAGGGCCATTAGCTAATACCCGGCTGCCTTCACTTCCCCACTTACTCTCATCTATTTTACGGCACCGAAATACGACAAGTCCATCAGGTGTAGACTATGACAGTGAAGCTATTATACTCAGGGACAATGAAGAAGATGAGTACGATGATCTACCTTCGATTCGCATTCTGACGAAATCCCAATTCGATAAGCTGTCGAACTCACAGAAAAAGGAATACCTGGATGAGCTGGAATACAGGGAAACTCTATATTTAAAGAAACAGCTAAGAGAAGAGTATCAAAGGAGGAAGGAAATCAAGCTTTTAAAACATAGAGACTCGGAACACAATGATAATAATGGTGATTTGCAGCCATCGCCAGAGGCGGAAGCTGTTCTGCTTCCAGATATGGCTGTTCCACCAAGTTTTGACTCAGATTGTCCTGTTCATAGATATCGTTGCATTGCAGTAGATGATCAGTGGATTGTGAGACCTGTTCTTGACCCACAAGGATGGGACCATGATGTAGGCTTCGATGGGATAAATCTCGAAACAGCAATGGAGATGAACAAAAATGTTTTTACCTCAGTCACTGGACAGGTGAGTAAAGATAAGCGTTTCTTTAACATTCAATCTGAGTGTGCTGCTTCTTACATGGATTCTAGGGGATCTTCTTATACTTTAGGTCTAGATGTTCAATCTTCTGGTACAGATAGGATATACACTGTTCATAGTAATGCTAAGCTGGGTAGCATTAAGCACAACCATCCTGGAATTGGACTTTCTCTGATATCTTTCAAGAGAAATTGCTATTATGGTGTGAAGCTTGAAGATACCATATCTATAGGTAAGCGAGTGAGGCTCGTAGCCAATGGCGGTCGTATAGAAGGAGCAGGACAAATGGCATATGGTGGAAGCATAGTAGCTACTTTAAGAGGTGATGACTACCCAGTGAGGAATGACCATCTCAGTCTAACGATGACCGTCCTCTCTTTTGACAAGGAAACCATCCTGAGCGGGAATGTAGAGTCTGAGTTTCGGGTTAGCCGAAGCATGAGACTGTCGGTTAATGCCAACTTAAATACACGTAAAATGGGTCAGATCTGCATAAAAACAAGTAGCTGTGAGCATTTGCAGATTGCTTTGATTTCTGGTTTTACAATCTTGAGAGCCCTTCTGCGTAGAAAGGAAATTGAAACGTTGTAG
Protein sequence
MVFSTSHFAGNGDCCIAHRVFSGILRSDVMCMACGFTSTTYDPCVDISLDLEPNQAGSAKMAATRNLPCNGEADCMNSSQNQRLSSLVGCLDQFTRPEKLGSDQKFFCQQCQVRQESLKQMSIRKLPLVSCFHIKRFEHSSVRKMSRKVDRYLQFPFSLDMAPYLSSSILRSRFGNRIFPFDGDEQDGCNETSSEFELFAVITHKGKLDAGHYVTYLRLSNQWYKCDDAWITQVNENIVRAAQGYMMFYVQKMLYYKSRIDIVFSDKLSSKWMPTHSSRRSRRAQTFWGGSLSDPYYLKGTSVLGVVAQQITDLSSHTMKKSFQIVFLSYPFSLALALAILLAPLSSPYSSIGIGIGISICRTFVGRMKGVRNWLFSQLVSKSVVSSRPLLGSDSFFGEENKEHVDEDQDGEVAQATTIVPPTAPHTSNSGGSLESQEDLPLEQSQHSSNRVKVDVLTMIEDLQVQFFRLLQRIGQTPNNLLVEKVLYRIHLATLIQVGESDLKRVNFERGKAREKAAEQEAAGIPESSFTFRVLVLGKTGVGKSATINSLFDQAKTATDAFQPATDRIQEIVGTINGIKVSIIDTPGLSQSSSGNMKRNKKILFSVKRYIRKSPPDIVLYFERLDLINKNHADYFLMKLINEVFGPAIWFNTILVLTHCSSALPEGPDGYPVSFESYVSHCSELLQQNIHQAVSDPRLENPVLLVENHPQCKKNIMGEKVLPNGQVWRSHFLLLCICYKVLGSINTLLKFQNCIELGPLANTRLPSLPHLLSSILRHRNTTSPSGVDYDSEAIILRDNEEDEYDDLPSIRILTKSQFDKLSNSQKKEYLDELEYRETLYLKKQLREEYQRRKEIKLLKHRDSEHNDNNGDLQPSPEAEAVLLPDMAVPPSFDSDCPVHRYRCIAVDDQWIVRPVLDPQGWDHDVGFDGINLETAMEMNKNVFTSVTGQVSKDKRFFNIQSECAASYMDSRGSSYTLGLDVQSSGTDRIYTVHSNAKLGSIKHNHPGIGLSLISFKRNCYYGVKLEDTISIGKRVRLVANGGRIEGAGQMAYGGSIVATLRGDDYPVRNDHLSLTMTVLSFDKETILSGNVESEFRVSRSMRLSVNANLNTRKMGQICIKTSSCEHLQIALISGFTILRALLRRKEIETL
Homology
BLAST of CmaCh05G012660 vs. ExPASy Swiss-Prot
Match:
Q6S5G3 (Translocase of chloroplast 90, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=TOC90 PE=1 SV=1)
HSP 1 Score: 795.0 bits (2052), Expect = 1.1e-228
Identity = 416/791 (52.59%), Postives = 562/791 (71.05%), Query Frame = 0
Query: 368 MKGVRNWLFSQLVSKSVVSSRPLLGSDSFFGEENKEHVDEDQDGEVAQATTIVPPTAPHT 427
MKG ++W+F+ +S S+ SSRPLLGSD FF + ++E + Q Q T+ P +
Sbjct: 1 MKGFKDWVFA--LSNSMASSRPLLGSDPFFRDPHQEQDNHSQAPAAPQPVTLSEPPCSTS 60
Query: 428 SNSGGSLE-----SQEDLPLEQSQHSS---NRVKVDVLTMIEDLQVQFFRLLQRIGQTPN 487
S+ LE SQ+ +PLE SS N K + L I LQVQF RL+QR GQ+ N
Sbjct: 61 SD----LEILPPLSQQQVPLESLYQSSIDLNGKKHNPLAKIGGLQVQFLRLVQRFGQSQN 120
Query: 488 NLLVEKVLYRIHLATLIQVGESDLKRVNFERGKAREKAAEQEAAGIPESSFTFRVLVLGK 547
N+LV KVLYR+HLA LI+ ES+LK V + +A+ A EQE++GIPE F+ R+LVLGK
Sbjct: 121 NILVSKVLYRVHLAMLIRAEESELKNVKLRQDRAKALAREQESSGIPELDFSLRILVLGK 180
Query: 548 TGVGKSATINSLFDQAKTATDAFQPATDRIQEIVGTINGIKVSIIDTPGLSQSSSGNMKR 607
TGVGKSATINS+F Q K+ TDAF+P TDRI+E++GT++G+KV+ IDTPG SS + ++
Sbjct: 181 TGVGKSATINSIFGQPKSETDAFRPGTDRIEEVMGTVSGVKVTFIDTPGFHPLSSSSTRK 240
Query: 608 NKKILFSVKRYIRKSPPDIVLYFERLDLINKNHADYFLMKLINEVFGPAIWFNTILVLTH 667
N+KIL S+KRY++K PPD+VLY +RLD+I+ ++D+ L++LI E+FG AIW NTILV+TH
Sbjct: 241 NRKILLSIKRYVKKRPPDVVLYLDRLDMIDMRYSDFSLLQLITEIFGAAIWLNTILVMTH 300
Query: 668 CSSALPEGPDGYPVSFESYVSHCSELLQQNIHQAVSDPRLENPVLLVENHPQCKKNIMGE 727
S+A EG +G V++ESYV +++Q IHQAVSD +LENPVLLVENHP CKKN+ GE
Sbjct: 301 -SAATTEGRNGQSVNYESYVGQRMDVVQHYIHQAVSDTKLENPVLLVENHPSCKKNLAGE 360
Query: 728 KVLPNGQVWRSHFLLLCICYKVLGSINTLLKFQNCIELGPLANTRLPSLPHLLSSILRHR 787
VLPNG VW+ F+ LC+C KVLG + +LL+F++ I LG ++TR SLPHLLS LR R
Sbjct: 361 YVLPNGVVWKPQFMFLCVCTKVLGDVQSLLRFRDSIGLGQPSSTRTASLPHLLSVFLRRR 420
Query: 788 NTTSPSGVDYDSEAIILRD-NEEDEYDDLPSIRILTKSQFDKLSNSQKKEYLDELEYRET 847
++ + + + ++ D EEDEYD LP+IRIL KS+F+KLS SQKKEYLDEL+YRET
Sbjct: 421 LSSGADETEKEIDKLLNLDLEEEDEYDQLPTIRILGKSRFEKLSKSQKKEYLDELDYRET 480
Query: 848 LYLKKQLREEYQRRKEIKLLKHRDSEHNDNNGDLQPSPEAEAVLLPDMAVPPSFDSDCPV 907
LYLKKQL+EE +RR++ KL++ + E + + AV LPDMA P SFDSD P
Sbjct: 481 LYLKKQLKEECRRRRDEKLVEEENLEDTEQR-------DQAAVPLPDMAGPDSFDSDFPA 540
Query: 908 HRYRCIAVDDQWIVRPVLDPQGWDHDVGFDGINLETAMEMNKNVFTSVTGQVSKDKRFFN 967
HRYRC++ DQW+VRPV DPQGWD DVGFDGIN+ETA ++N+N+F S TGQVS+DK+ F
Sbjct: 541 HRYRCVSAGDQWLVRPVYDPQGWDRDVGFDGINIETAAKINRNLFASATGQVSRDKQRFT 600
Query: 968 IQSECAASY-MDSRGSSYTLGLDVQSSGTDRIYTVHSNAKLGSIKHNHPGIGLSLISFKR 1027
IQSE A+Y + R ++++ +D+QSSG D +Y+ KL + KHN +G+ L SF
Sbjct: 601 IQSETNAAYTRNFREQTFSVAVDLQSSGEDLVYSFQGGTKLQTFKHNTTDVGVGLTSFGG 660
Query: 1028 NCYYGVKLEDTISIGKRVRLVANGGRIEGAGQMAYGGSIVATLRGDDYPVRNDHLSLTMT 1087
Y G KLEDT+ +GKRV+L AN G++ G+GQ A GGS A +RG DYPVRN+ + LTMT
Sbjct: 661 KYYVGGKLEDTLLVGKRVKLTANAGQMRGSGQTANGGSFEACIRGRDYPVRNEQIGLTMT 720
Query: 1088 VLSFDKETILSGNVESEFRVSRSMRLSVNANLNTRKMGQICIKTSSCEHLQIALISGFTI 1147
LSF +E +L+ ++++FR +R + VN N+N RKMG+I +K +S EH +IALIS T+
Sbjct: 721 ALSFKRELVLNYGLQTQFRPARGTNIDVNINMNNRKMGKINVKLNSSEHWEIALISALTM 777
Query: 1148 LRALLRRKEIE 1149
+AL+RR + E
Sbjct: 781 FKALVRRSKTE 777
BLAST of CmaCh05G012660 vs. ExPASy Swiss-Prot
Match:
A9SV59 (Translocase of chloroplast 101, chloroplastic OS=Physcomitrium patens OX=3218 GN=TOC101 PE=3 SV=1)
HSP 1 Score: 573.2 bits (1476), Expect = 6.8e-162
Identity = 319/744 (42.88%), Postives = 461/744 (61.96%), Query Frame = 0
Query: 416 ATTIVPPTAPHTSNSGGSLESQEDLPLEQSQHSSNRVKVDVLTMIEDLQVQFFRLLQRIG 475
A T + T G+ +Q E++ +S + +++++V+F RL R+G
Sbjct: 170 AATALDTAGRITQRPNGAPSTQLTATTEENANSDTAEGNETREKLQNIRVKFLRLAHRLG 229
Query: 476 QTPNNLLVEKVLYRIHLATLIQVGESDLKRVNFERGKAREKAAEQEAAGIPES-SFTFRV 535
Q+P N++V +VLYR+ LA ++ G + + F +A A EQEAA E F +
Sbjct: 230 QSPQNVVVAQVLYRLGLAESLRGGNTSNRAGAFSFDRANALAEEQEAANQEEELDFACTI 289
Query: 536 LVLGKTGVGKSATINSLFDQAKTATDAFQPATDRIQEIVGTINGIKVSIIDTPGLSQSSS 595
LVLGKTGVGKSATINS+FD K+ T AF+P+T+++QEIVGT++GIKV +IDTPGL S +
Sbjct: 290 LVLGKTGVGKSATINSIFDDRKSVTSAFKPSTNKVQEIVGTVHGIKVRVIDTPGLLPSVA 349
Query: 596 GNMKRNKKILFSVKRYIRKSPPDIVLYFERLDLINKNHADYFLMKLINEVFGPAIWFNTI 655
+ + N++I+ VK++I+K+ PDIVLYF+RLD+ +++ D L+K I ++FG A+WFN I
Sbjct: 350 -DQQHNERIMGQVKKHIKKASPDIVLYFDRLDMQSRDFGDLPLLKTITDLFGAAVWFNAI 409
Query: 656 LVLTHCSSALPEGPDGYPVSFESYVSHCSELLQQNIHQAVSDPRLENPVLLVENHPQCKK 715
+VLTH SSA P+GP+G P+S+E +V+ S ++QQ I QA D RL NPV LVENHP C+
Sbjct: 410 VVLTHASSAPPDGPNGVPLSYEMFVAQRSHVVQQTIRQAAGDMRLMNPVSLVENHPACRT 469
Query: 716 NIMGEKVLPNGQVWRSHFLLLCICYKVLGSINTLLKFQNCIELG-PLA-NTRLPSLPHLL 775
N G++VLPNGQ+W+ LLLC K+L N+LLK Q G P +R+P LP LL
Sbjct: 470 NRNGQRVLPNGQIWKPQLLLLCFASKILAEANSLLKLQETATPGRPFGQRSRVPPLPFLL 529
Query: 776 SSILRHR----------NTTSPSGVDYDSEAIILRDNEEDEYDDLPSIRILTKSQFDKLS 835
SS+L+ R + + S D + E D+E D+YD+LP R L+K + ++L+
Sbjct: 530 SSLLQSRAQLKLPDEQLDESDESDDDEEEE-----DSEADDYDELPPFRPLSKEELEELT 589
Query: 836 NSQKKEYLDELEYRETLYLKKQLREEYQRRKEIKLLKHRDSEHN----DNNGDLQPSPEA 895
Q+++Y+DEL RE L+ KKQ REE +RRKE+K + + S+ D D P A
Sbjct: 590 KEQRQDYMDELADRERLFQKKQYREEMRRRKEMKKRQAQMSKEELAQPDEADDEAGQPAA 649
Query: 896 EAVLLPDMAVPPSFDSDCPVHRYRCIAVDDQWIVRPVLDPQGWDHDVGFDGINLETAMEM 955
V +PDMA+PPSFDSD P HRYR + +QW+VRPVL+ GWDHD G+DG N+E +
Sbjct: 650 VPVPMPDMALPPSFDSDNPTHRYRYLETANQWLVRPVLETHGWDHDAGYDGFNVEKMFVV 709
Query: 956 NKNVFTSVTGQVSKDKRFFNIQSECAASYMDSRGSSYTLGLDVQSSGTDRIYTVHSNAKL 1015
+ S++GQV+KDK+ + E AAS G G DVQ+ G D YT+ + +
Sbjct: 710 KNKIPASISGQVTKDKKESQVNFEAAASLKHGEGKVTLTGFDVQTIGKDLAYTLRAETRF 769
Query: 1016 GSIKHNHPGIGLSLISFKRNCYYGVKLEDTISIGKRVRLVANGGRIEGAGQMAYGGSIVA 1075
+ K N G++ GVKLED I IGKRV++V NGG + G G A+GGS+ A
Sbjct: 770 NNFKRNKTTAGVTATYLNDTIAAGVKLEDRILIGKRVKMVVNGGVLTGKGDKAFGGSLEA 829
Query: 1076 TLRGDDYPVRNDHLSLTMTVLSFDKETILSGNVESEFRVSRSMRLSVNANLNTRKMGQIC 1135
TLRG +YP+ +L ++V+ + + + GN++S+F V ++M + ANLN R GQ+
Sbjct: 830 TLRGKEYPLSRTLSTLGLSVMDWHGDLAIGGNLQSQFMVGKTMMVG-RANLNNRGSGQVS 889
Query: 1136 IKTSSCEHLQIALISGFTILRALL 1143
I+ SS E LQ+ LI ILR+L+
Sbjct: 890 IRASSSEQLQMVLIGIVPILRSLI 906
BLAST of CmaCh05G012660 vs. ExPASy Swiss-Prot
Match:
A9SY65 (Translocase of chloroplast 108, chloroplastic OS=Physcomitrium patens OX=3218 GN=TOC108 PE=3 SV=1)
HSP 1 Score: 567.4 bits (1461), Expect = 3.7e-160
Identity = 315/742 (42.45%), Postives = 458/742 (61.73%), Query Frame = 0
Query: 416 ATTIVPPTAPHTSNSGGSLESQEDLPLEQSQHSSNRVKVDVLTMIEDLQVQFFRLLQRIG 475
A T +T G+L +Q ++S S + +++++V+F RL R+G
Sbjct: 246 AATASDSPGRNTQRPNGALSTQITSTTDESASSDAAEGDETREKLQNIRVKFLRLAHRLG 305
Query: 476 QTPNNLLVEKVLYRIHLATLIQVGESDLKRVNFERGKAREKAAEQEAAGIPES-SFTFRV 535
Q+P N++V +VLYR+ LA ++ G + + F +A A EQEAA E F +
Sbjct: 306 QSPQNVVVAQVLYRLGLAESLRGGSAPNRSGAFSFDRANALAEEQEAANQEEELDFACTI 365
Query: 536 LVLGKTGVGKSATINSLFDQAKTATDAFQPATDRIQEIVGTINGIKVSIIDTPGLSQSSS 595
LVLGKTGVGKS+TINS+FD+ K+ T AF+P+T+++QE++GT++GIKV +IDTPGL S +
Sbjct: 366 LVLGKTGVGKSSTINSIFDERKSVTSAFKPSTNKVQEVIGTVHGIKVRVIDTPGLLPSVA 425
Query: 596 GNMKRNKKILFSVKRYIRKSPPDIVLYFERLDLINKNHADYFLMKLINEVFGPAIWFNTI 655
+ + N++I+ VK+YI+K+ PDIVLYF+RLD+ +++ D L++ I ++FG A+WFN I
Sbjct: 426 -DQQHNERIMGQVKKYIKKASPDIVLYFDRLDMQSRDFGDLPLLRTITDLFGAAVWFNAI 485
Query: 656 LVLTHCSSALPEGPDGYPVSFESYVSHCSELLQQNIHQAVSDPRLENPVLLVENHPQCKK 715
+VLTH SSA P+GP+G P+S+E +V+ S ++QQ I QA D RL NPV LVENHP C+
Sbjct: 486 VVLTHASSAPPDGPNGVPLSYEMFVAQRSHVVQQTIRQAAGDMRLMNPVSLVENHPACRT 545
Query: 716 NIMGEKVLPNGQVWRSHFLLLCICYKVLGSINTLLKFQNCIELG-PLA-NTRLPSLPHLL 775
N G++VLPNGQ+W+ LLLC K+L N+LLK Q G P +R+P LP LL
Sbjct: 546 NRTGQRVLPNGQIWKPQLLLLCFASKILAEANSLLKLQETTAPGRPFGQRSRVPPLPFLL 605
Query: 776 SSILRHR--------NTTSPSGVDYDSEAIILRDNEEDEYDDLPSIRILTKSQFDKLSNS 835
SS+L+ R D D E D++ D+YD+LP R L+K + + L+
Sbjct: 606 SSLLQSRAQLKLPDEQAGESDESDDDEEE---EDSDADDYDELPPFRPLSKEELEDLTKE 665
Query: 836 QKKEYLDELEYRETLYLKKQLREEYQRRKEIKLLKHRDSEHN----DNNGDLQPSPEAEA 895
Q+++Y++EL RE ++ KKQ REE +RRKE K + + S+ + D + A
Sbjct: 666 QREDYMEELADRERMFQKKQYREEIRRRKEAKKRQAQMSKEELAEAEEAEDEAGNAAAVP 725
Query: 896 VLLPDMAVPPSFDSDCPVHRYRCIAVDDQWIVRPVLDPQGWDHDVGFDGINLETAMEMNK 955
V +PDMA+PPSFDSD P HRYR + +QW+VRPVL+ GWDHD G+DG N+E + +
Sbjct: 726 VPMPDMALPPSFDSDNPTHRYRYLETANQWLVRPVLETHGWDHDAGYDGFNVEKMFVVKE 785
Query: 956 NVFTSVTGQVSKDKRFFNIQSECAASYMDSRGSSYTLGLDVQSSGTDRIYTVHSNAKLGS 1015
+ SV+GQV+KDK+ + E AAS G G DVQ+ G D YTV + + +
Sbjct: 786 KIPASVSGQVTKDKKEAQVNFEAAASLRHGEGKVTLTGFDVQTIGKDLAYTVRAETRFNN 845
Query: 1016 IKHNHPGIGLSLISFKRNCYYGVKLEDTISIGKRVRLVANGGRIEGAGQMAYGGSIVATL 1075
K N G++ GVKLED + IGKRV+LV NGG + G G AYGGS+ ATL
Sbjct: 846 FKRNKTTAGVTATYLNDTIAAGVKLEDRVLIGKRVKLVVNGGVLTGKGDKAYGGSLEATL 905
Query: 1076 RGDDYPVRNDHLSLTMTVLSFDKETILSGNVESEFRVSRSMRLSVNANLNTRKMGQICIK 1135
RG +YP+ +L ++V+ + + + GN++S+F V ++M + ANLN R GQ+ I+
Sbjct: 906 RGKEYPLSRTLSTLGLSVMDWHGDLAIGGNLQSQFMVGKTMMVG-RANLNNRGSGQVSIR 965
Query: 1136 TSSCEHLQIALISGFTILRALL 1143
SS E LQ+ LI ILR+L+
Sbjct: 966 ASSSEQLQMVLIGIVPILRSLI 982
BLAST of CmaCh05G012660 vs. ExPASy Swiss-Prot
Match:
A9SY64 (Translocase of chloroplast 125, chloroplastic OS=Physcomitrium patens OX=3218 GN=TOC125 PE=2 SV=1)
HSP 1 Score: 564.3 bits (1453), Expect = 3.1e-159
Identity = 316/740 (42.70%), Postives = 448/740 (60.54%), Query Frame = 0
Query: 416 ATTIVPPTAPHTSNSGGSLESQEDLPLEQSQHSSNRVKVDVLTMIEDLQVQFFRLLQRIG 475
ATT P+T++S S + D + +N ++ +++++++F RL +R+
Sbjct: 397 ATTATGVPRPNTASSTQS-AATSDASISSESSEANEIR----EKLQNIRIKFLRLARRLN 456
Query: 476 QTPNNLLVEKVLYRIHLATLIQVGESDLKRVNFERGKAREKAAEQEAAGIPESSFTFRVL 535
Q+P N++V +VLYR+ LA ++ G S + F A A EQEAA + F +L
Sbjct: 457 QSPQNVVVAQVLYRLGLAESLRGGSSLNRTRAFSFDHANALAEEQEAAKYEDLDFACTIL 516
Query: 536 VLGKTGVGKSATINSLFDQAKTATDAFQPATDRIQEIVGTINGIKVSIIDTPGLSQSSSG 595
VLGKTGVGKSATINS+FD+ KT T A+ P+T ++ E+ GT+ G+KV IDTPGL S++
Sbjct: 517 VLGKTGVGKSATINSIFDECKTVTSAYYPSTTKVHEVSGTVLGVKVRFIDTPGLLPSTA- 576
Query: 596 NMKRNKKILFSVKRYIRKSPPDIVLYFERLDLINKNHADYFLMKLINEVFGPAIWFNTIL 655
+ + NK I+ VK+YI+K PDIVLYF+R+D+ ++ D L++ I +VFG A+WFN +
Sbjct: 577 DQRHNKNIMRQVKKYIKKVSPDIVLYFDRMDMQTRDSGDVPLLRTITDVFGAAVWFNATV 636
Query: 656 VLTHCSSALPEGPDGYPVSFESYVSHCSELLQQNIHQAVSDPRLENPVLLVENHPQCKKN 715
VLTH S A P+G +G P+S++ +V+ S +QQ I QA D RL+NPV LVENHP C+ N
Sbjct: 637 VLTHASKAPPDGSNGTPMSYDYFVAQRSHFVQQTIRQAAGDARLQNPVSLVENHPACRIN 696
Query: 716 IMGEKVLPNGQVWRSHFLLLCICYKVLGSINTLLKFQNCIELG-PLA-NTRLPSLPHLLS 775
G++VLPNGQ W+ LLLC K+L NTLLK Q G P +R+P LP+LLS
Sbjct: 697 RSGQRVLPNGQPWKQQLLLLCFASKILAEANTLLKLQEASTPGKPFGQRSRVPPLPYLLS 756
Query: 776 SILRHR------NTTSPSGVDYDSEAIILRDNEEDEYDDLPSIRILTKSQFDKLSNSQKK 835
S+L+ R + D D ++ + E DEYDDLP R L+K + + LS Q++
Sbjct: 757 SLLQSRAQLKMPDEQHGESEDSDDDSDEEDEEEGDEYDDLPPFRPLSKQELEDLSKEQRQ 816
Query: 836 EYLDELEYRETLYLKKQLREEYQRRKEIK-----LLKHRDSEHNDNNGDLQPSPEAEAVL 895
EY +EL RE L+ KKQ RE+ +RR+E K + K S D D P AV
Sbjct: 817 EYAEELADRERLFQKKQYREQIRRRRERKKQASVMSKEEPSIPGDGAEDESGQPATVAVP 876
Query: 896 LPDMAVPPSFDSDCPVHRYRCIAVDDQWIVRPVLDPQGWDHDVGFDGINLETAMEMNKNV 955
+PDMA+PPSFDSD P HRYR + +QW+VRPVL+ GWDHD G+DG N+E + + +
Sbjct: 877 MPDMALPPSFDSDNPTHRYRYLETANQWLVRPVLETHGWDHDAGYDGFNVEKMFVVKEKI 936
Query: 956 FTSVTGQVSKDKRFFNIQSECAASYMDSRGSSYTLGLDVQSSGTDRIYTVHSNAKLGSIK 1015
SV+GQV+KDK+ + E AAS G G DVQ+ G D YTV + + + K
Sbjct: 937 PASVSGQVTKDKKEAQVNFEAAASLRHGEGKVTLTGFDVQTIGKDLAYTVRAETRFNNFK 996
Query: 1016 HNHPGIGLSLISFKRNCYYGVKLEDTISIGKRVRLVANGGRIEGAGQMAYGGSIVATLRG 1075
N G++ GVKLED + IGKRV+LV NGG + G G AYGGS+ ATLRG
Sbjct: 997 RNKTTAGVTATYLNDTIAAGVKLEDRVLIGKRVKLVVNGGVLTGKGDKAYGGSLEATLRG 1056
Query: 1076 DDYPVRNDHLSLTMTVLSFDKETILSGNVESEFRVSRSMRLSVNANLNTRKMGQICIKTS 1135
+YP+ +L ++V+ + + + GN++S+F V ++M + ANLN R GQ+ I+ S
Sbjct: 1057 KEYPLSRTLSTLGLSVMDWHGDLAIGGNLQSQFMVGKTMMVG-RANLNNRGSGQVSIRAS 1116
Query: 1136 SCEHLQIALISGFTILRALL 1143
S E LQ+ LI ILR+L+
Sbjct: 1117 SSEQLQMVLIGIVPILRSLI 1129
BLAST of CmaCh05G012660 vs. ExPASy Swiss-Prot
Match:
A9SV60 (Translocase of chloroplast 126, chloroplastic OS=Physcomitrium patens OX=3218 GN=TOC126 PE=3 SV=1)
HSP 1 Score: 562.0 bits (1447), Expect = 1.6e-158
Identity = 308/695 (44.32%), Postives = 438/695 (63.02%), Query Frame = 0
Query: 460 IEDLQVQFFRLLQRIGQTPNNLLVEKVLYRIHLATLIQVGESDLKRVNFERGKAREKAAE 519
+++++V+F RL+ R+GQ+P N++V +VLYR+ LA ++ G + F+ +A A E
Sbjct: 444 LQNIRVKFLRLVHRLGQSPQNVVVAQVLYRLGLAESLRGGSTRNHTRAFDFDRANAIAEE 503
Query: 520 QEAAGIPES-SFTFRVLVLGKTGVGKSATINSLFDQAKTATDAFQPATDRIQEIVGTING 579
QEA E F +LVLGKTGVGKSATINS+FD+ K+ T+A+ P+T + E+VGT+ G
Sbjct: 504 QEADNQEEELDFACTILVLGKTGVGKSATINSIFDEHKSVTNAYNPSTTNVYEVVGTMLG 563
Query: 580 IKVSIIDTPGLSQSSSGNMKRNKKILFSVKRYIRKSPPDIVLYFERLDLINKNHADYFLM 639
+KV +DTPGL S + + N++I+ VK+YI+K+ PDIVLYF+R+D+ + D L+
Sbjct: 564 VKVRFVDTPGL-LFSVADQRHNERIMGRVKKYIKKASPDIVLYFDRMDMQTREFGDVPLL 623
Query: 640 KLINEVFGPAIWFNTILVLTHCSSALPEGPDGYPVSFESYVSHCSELLQQNIHQAVSDPR 699
+ I VFG A+WFNTI+VLTH S+A P+GP+G P+ +E +V+ S +QQ+I Q D R
Sbjct: 624 RTITNVFGTAVWFNTIVVLTHASTAPPDGPNGTPMGYELFVAQRSHSVQQSIRQVAGDMR 683
Query: 700 LENPVLLVENHPQCKKNIMGEKVLPNGQVWRSHFLLLCICYKVLGSINTLLKFQNCIELG 759
L+NPV LVENHP C+ N G++VLPNGQ+W+ H +LLC K+L NTLLK Q+ G
Sbjct: 684 LQNPVSLVENHPACRANRNGQRVLPNGQIWKPHLMLLCFASKILAEANTLLKLQDTAAPG 743
Query: 760 -PLA-NTRLPSLPHLLSSILRHRNTTS--PSGVDYDSEAIILRDNEE--DEYDDLPSIRI 819
P +R+P LP LLSS+L+ R +D E+ ++EE DEYDDLP R
Sbjct: 744 RPFGQRSRVPPLPFLLSSLLQSRAQLKLPDEQLDESDESDDDEEDEEEGDEYDDLPPFRS 803
Query: 820 LTKSQFDKLSNSQKKEYLDELEYRETLYLKKQLREEYQRRKEIK-----LLKHRDSEHND 879
L+K + ++LS Q++EY +EL RE L+ KKQ RE+ QRRKE+K + K S D
Sbjct: 804 LSKEELEELSKDQRQEYAEELAVRERLFQKKQHREQLQRRKEMKKRATAMRKEGLSHPAD 863
Query: 880 NNGDLQPSPEAEAVLLPDMAVPPSFDSDCPVHRYRCIAVDDQWIVRPVLDPQGWDHDVGF 939
D P A V +PDMA+PPSFDSD P HRYR + +QW+VRPVL+ GWDHD G+
Sbjct: 864 EADDEAGQPAAVPVPMPDMALPPSFDSDNPTHRYRYLETANQWLVRPVLETHGWDHDAGY 923
Query: 940 DGINLETAMEMNKNVFTSVTGQVSKDKRFFNIQSECAASYMDSRGSSYTLGLDVQSSGTD 999
DG N+E + + S++GQV+KDK+ + E AAS G G DVQ+ G D
Sbjct: 924 DGFNVEKMFVVKNKIPASISGQVTKDKKESQVNFEAAASLKHGEGKVTLTGFDVQTIGKD 983
Query: 1000 RIYTVHSNAKLGSIKHNHPGIGLSLISFKRNCYYGVKLEDTISIGKRVRLVANGGRIEGA 1059
YT+ + + + K N G++ GVKLED I IGKRV++V NGG + G
Sbjct: 984 LAYTLRAETRFNNFKRNKTTAGVTATYLNDTIAAGVKLEDRILIGKRVKMVVNGGVLTGK 1043
Query: 1060 GQMAYGGSIVATLRGDDYPVRNDHLSLTMTVLSFDKETILSGNVESEFRVSRSMRLSVNA 1119
G A+GGS+ ATLRG +YP+ +L ++V+ + + + GN++S+F V ++M + A
Sbjct: 1044 GDKAFGGSLEATLRGKEYPLSRTLSTLGLSVMDWHGDLAIGGNLQSQFMVGKTMMVG-RA 1103
Query: 1120 NLNTRKMGQICIKTSSCEHLQIALISGFTILRALL 1143
NLN R GQ+ I+ SS E LQ+ LI ILR+L+
Sbjct: 1104 NLNNRGSGQVSIRASSSEQLQMVLIGIVPILRSLI 1136
BLAST of CmaCh05G012660 vs. TAIR 10
Match:
AT5G20300.1 (Avirulence induced gene (AIG1) family protein )
HSP 1 Score: 795.0 bits (2052), Expect = 7.8e-230
Identity = 416/791 (52.59%), Postives = 562/791 (71.05%), Query Frame = 0
Query: 368 MKGVRNWLFSQLVSKSVVSSRPLLGSDSFFGEENKEHVDEDQDGEVAQATTIVPPTAPHT 427
MKG ++W+F+ +S S+ SSRPLLGSD FF + ++E + Q Q T+ P +
Sbjct: 1 MKGFKDWVFA--LSNSMASSRPLLGSDPFFRDPHQEQDNHSQAPAAPQPVTLSEPPCSTS 60
Query: 428 SNSGGSLE-----SQEDLPLEQSQHSS---NRVKVDVLTMIEDLQVQFFRLLQRIGQTPN 487
S+ LE SQ+ +PLE SS N K + L I LQVQF RL+QR GQ+ N
Sbjct: 61 SD----LEILPPLSQQQVPLESLYQSSIDLNGKKHNPLAKIGGLQVQFLRLVQRFGQSQN 120
Query: 488 NLLVEKVLYRIHLATLIQVGESDLKRVNFERGKAREKAAEQEAAGIPESSFTFRVLVLGK 547
N+LV KVLYR+HLA LI+ ES+LK V + +A+ A EQE++GIPE F+ R+LVLGK
Sbjct: 121 NILVSKVLYRVHLAMLIRAEESELKNVKLRQDRAKALAREQESSGIPELDFSLRILVLGK 180
Query: 548 TGVGKSATINSLFDQAKTATDAFQPATDRIQEIVGTINGIKVSIIDTPGLSQSSSGNMKR 607
TGVGKSATINS+F Q K+ TDAF+P TDRI+E++GT++G+KV+ IDTPG SS + ++
Sbjct: 181 TGVGKSATINSIFGQPKSETDAFRPGTDRIEEVMGTVSGVKVTFIDTPGFHPLSSSSTRK 240
Query: 608 NKKILFSVKRYIRKSPPDIVLYFERLDLINKNHADYFLMKLINEVFGPAIWFNTILVLTH 667
N+KIL S+KRY++K PPD+VLY +RLD+I+ ++D+ L++LI E+FG AIW NTILV+TH
Sbjct: 241 NRKILLSIKRYVKKRPPDVVLYLDRLDMIDMRYSDFSLLQLITEIFGAAIWLNTILVMTH 300
Query: 668 CSSALPEGPDGYPVSFESYVSHCSELLQQNIHQAVSDPRLENPVLLVENHPQCKKNIMGE 727
S+A EG +G V++ESYV +++Q IHQAVSD +LENPVLLVENHP CKKN+ GE
Sbjct: 301 -SAATTEGRNGQSVNYESYVGQRMDVVQHYIHQAVSDTKLENPVLLVENHPSCKKNLAGE 360
Query: 728 KVLPNGQVWRSHFLLLCICYKVLGSINTLLKFQNCIELGPLANTRLPSLPHLLSSILRHR 787
VLPNG VW+ F+ LC+C KVLG + +LL+F++ I LG ++TR SLPHLLS LR R
Sbjct: 361 YVLPNGVVWKPQFMFLCVCTKVLGDVQSLLRFRDSIGLGQPSSTRTASLPHLLSVFLRRR 420
Query: 788 NTTSPSGVDYDSEAIILRD-NEEDEYDDLPSIRILTKSQFDKLSNSQKKEYLDELEYRET 847
++ + + + ++ D EEDEYD LP+IRIL KS+F+KLS SQKKEYLDEL+YRET
Sbjct: 421 LSSGADETEKEIDKLLNLDLEEEDEYDQLPTIRILGKSRFEKLSKSQKKEYLDELDYRET 480
Query: 848 LYLKKQLREEYQRRKEIKLLKHRDSEHNDNNGDLQPSPEAEAVLLPDMAVPPSFDSDCPV 907
LYLKKQL+EE +RR++ KL++ + E + + AV LPDMA P SFDSD P
Sbjct: 481 LYLKKQLKEECRRRRDEKLVEEENLEDTEQR-------DQAAVPLPDMAGPDSFDSDFPA 540
Query: 908 HRYRCIAVDDQWIVRPVLDPQGWDHDVGFDGINLETAMEMNKNVFTSVTGQVSKDKRFFN 967
HRYRC++ DQW+VRPV DPQGWD DVGFDGIN+ETA ++N+N+F S TGQVS+DK+ F
Sbjct: 541 HRYRCVSAGDQWLVRPVYDPQGWDRDVGFDGINIETAAKINRNLFASATGQVSRDKQRFT 600
Query: 968 IQSECAASY-MDSRGSSYTLGLDVQSSGTDRIYTVHSNAKLGSIKHNHPGIGLSLISFKR 1027
IQSE A+Y + R ++++ +D+QSSG D +Y+ KL + KHN +G+ L SF
Sbjct: 601 IQSETNAAYTRNFREQTFSVAVDLQSSGEDLVYSFQGGTKLQTFKHNTTDVGVGLTSFGG 660
Query: 1028 NCYYGVKLEDTISIGKRVRLVANGGRIEGAGQMAYGGSIVATLRGDDYPVRNDHLSLTMT 1087
Y G KLEDT+ +GKRV+L AN G++ G+GQ A GGS A +RG DYPVRN+ + LTMT
Sbjct: 661 KYYVGGKLEDTLLVGKRVKLTANAGQMRGSGQTANGGSFEACIRGRDYPVRNEQIGLTMT 720
Query: 1088 VLSFDKETILSGNVESEFRVSRSMRLSVNANLNTRKMGQICIKTSSCEHLQIALISGFTI 1147
LSF +E +L+ ++++FR +R + VN N+N RKMG+I +K +S EH +IALIS T+
Sbjct: 721 ALSFKRELVLNYGLQTQFRPARGTNIDVNINMNNRKMGKINVKLNSSEHWEIALISALTM 777
Query: 1148 LRALLRRKEIE 1149
+AL+RR + E
Sbjct: 781 FKALVRRSKTE 777
BLAST of CmaCh05G012660 vs. TAIR 10
Match:
AT5G20300.2 (Avirulence induced gene (AIG1) family protein )
HSP 1 Score: 795.0 bits (2052), Expect = 7.8e-230
Identity = 416/791 (52.59%), Postives = 562/791 (71.05%), Query Frame = 0
Query: 368 MKGVRNWLFSQLVSKSVVSSRPLLGSDSFFGEENKEHVDEDQDGEVAQATTIVPPTAPHT 427
MKG ++W+F+ +S S+ SSRPLLGSD FF + ++E + Q Q T+ P +
Sbjct: 1 MKGFKDWVFA--LSNSMASSRPLLGSDPFFRDPHQEQDNHSQAPAAPQPVTLSEPPCSTS 60
Query: 428 SNSGGSLE-----SQEDLPLEQSQHSS---NRVKVDVLTMIEDLQVQFFRLLQRIGQTPN 487
S+ LE SQ+ +PLE SS N K + L I LQVQF RL+QR GQ+ N
Sbjct: 61 SD----LEILPPLSQQQVPLESLYQSSIDLNGKKHNPLAKIGGLQVQFLRLVQRFGQSQN 120
Query: 488 NLLVEKVLYRIHLATLIQVGESDLKRVNFERGKAREKAAEQEAAGIPESSFTFRVLVLGK 547
N+LV KVLYR+HLA LI+ ES+LK V + +A+ A EQE++GIPE F+ R+LVLGK
Sbjct: 121 NILVSKVLYRVHLAMLIRAEESELKNVKLRQDRAKALAREQESSGIPELDFSLRILVLGK 180
Query: 548 TGVGKSATINSLFDQAKTATDAFQPATDRIQEIVGTINGIKVSIIDTPGLSQSSSGNMKR 607
TGVGKSATINS+F Q K+ TDAF+P TDRI+E++GT++G+KV+ IDTPG SS + ++
Sbjct: 181 TGVGKSATINSIFGQPKSETDAFRPGTDRIEEVMGTVSGVKVTFIDTPGFHPLSSSSTRK 240
Query: 608 NKKILFSVKRYIRKSPPDIVLYFERLDLINKNHADYFLMKLINEVFGPAIWFNTILVLTH 667
N+KIL S+KRY++K PPD+VLY +RLD+I+ ++D+ L++LI E+FG AIW NTILV+TH
Sbjct: 241 NRKILLSIKRYVKKRPPDVVLYLDRLDMIDMRYSDFSLLQLITEIFGAAIWLNTILVMTH 300
Query: 668 CSSALPEGPDGYPVSFESYVSHCSELLQQNIHQAVSDPRLENPVLLVENHPQCKKNIMGE 727
S+A EG +G V++ESYV +++Q IHQAVSD +LENPVLLVENHP CKKN+ GE
Sbjct: 301 -SAATTEGRNGQSVNYESYVGQRMDVVQHYIHQAVSDTKLENPVLLVENHPSCKKNLAGE 360
Query: 728 KVLPNGQVWRSHFLLLCICYKVLGSINTLLKFQNCIELGPLANTRLPSLPHLLSSILRHR 787
VLPNG VW+ F+ LC+C KVLG + +LL+F++ I LG ++TR SLPHLLS LR R
Sbjct: 361 YVLPNGVVWKPQFMFLCVCTKVLGDVQSLLRFRDSIGLGQPSSTRTASLPHLLSVFLRRR 420
Query: 788 NTTSPSGVDYDSEAIILRD-NEEDEYDDLPSIRILTKSQFDKLSNSQKKEYLDELEYRET 847
++ + + + ++ D EEDEYD LP+IRIL KS+F+KLS SQKKEYLDEL+YRET
Sbjct: 421 LSSGADETEKEIDKLLNLDLEEEDEYDQLPTIRILGKSRFEKLSKSQKKEYLDELDYRET 480
Query: 848 LYLKKQLREEYQRRKEIKLLKHRDSEHNDNNGDLQPSPEAEAVLLPDMAVPPSFDSDCPV 907
LYLKKQL+EE +RR++ KL++ + E + + AV LPDMA P SFDSD P
Sbjct: 481 LYLKKQLKEECRRRRDEKLVEEENLEDTEQR-------DQAAVPLPDMAGPDSFDSDFPA 540
Query: 908 HRYRCIAVDDQWIVRPVLDPQGWDHDVGFDGINLETAMEMNKNVFTSVTGQVSKDKRFFN 967
HRYRC++ DQW+VRPV DPQGWD DVGFDGIN+ETA ++N+N+F S TGQVS+DK+ F
Sbjct: 541 HRYRCVSAGDQWLVRPVYDPQGWDRDVGFDGINIETAAKINRNLFASATGQVSRDKQRFT 600
Query: 968 IQSECAASY-MDSRGSSYTLGLDVQSSGTDRIYTVHSNAKLGSIKHNHPGIGLSLISFKR 1027
IQSE A+Y + R ++++ +D+QSSG D +Y+ KL + KHN +G+ L SF
Sbjct: 601 IQSETNAAYTRNFREQTFSVAVDLQSSGEDLVYSFQGGTKLQTFKHNTTDVGVGLTSFGG 660
Query: 1028 NCYYGVKLEDTISIGKRVRLVANGGRIEGAGQMAYGGSIVATLRGDDYPVRNDHLSLTMT 1087
Y G KLEDT+ +GKRV+L AN G++ G+GQ A GGS A +RG DYPVRN+ + LTMT
Sbjct: 661 KYYVGGKLEDTLLVGKRVKLTANAGQMRGSGQTANGGSFEACIRGRDYPVRNEQIGLTMT 720
Query: 1088 VLSFDKETILSGNVESEFRVSRSMRLSVNANLNTRKMGQICIKTSSCEHLQIALISGFTI 1147
LSF +E +L+ ++++FR +R + VN N+N RKMG+I +K +S EH +IALIS T+
Sbjct: 721 ALSFKRELVLNYGLQTQFRPARGTNIDVNINMNNRKMGKINVKLNSSEHWEIALISALTM 777
Query: 1148 LRALLRRKEIE 1149
+AL+RR + E
Sbjct: 781 FKALVRRSKTE 777
BLAST of CmaCh05G012660 vs. TAIR 10
Match:
AT5G20300.3 (Avirulence induced gene (AIG1) family protein )
HSP 1 Score: 709.5 bits (1830), Expect = 4.3e-204
Identity = 356/656 (54.27%), Postives = 482/656 (73.48%), Query Frame = 0
Query: 495 LIQVGESDLKRVNFERGKAREKAAEQEAAGIPESSFTFRVLVLGKTGVGKSATINSLFDQ 554
LI+ ES+LK V + +A+ A EQE++GIPE F+ R+LVLGKTGVGKSATINS+F Q
Sbjct: 2 LIRAEESELKNVKLRQDRAKALAREQESSGIPELDFSLRILVLGKTGVGKSATINSIFGQ 61
Query: 555 AKTATDAFQPATDRIQEIVGTINGIKVSIIDTPGLSQSSSGNMKRNKKILFSVKRYIRKS 614
K+ TDAF+P TDRI+E++GT++G+KV+ IDTPG SS + ++N+KIL S+KRY++K
Sbjct: 62 PKSETDAFRPGTDRIEEVMGTVSGVKVTFIDTPGFHPLSSSSTRKNRKILLSIKRYVKKR 121
Query: 615 PPDIVLYFERLDLINKNHADYFLMKLINEVFGPAIWFNTILVLTHCSSALPEGPDGYPVS 674
PPD+VLY +RLD+I+ ++D+ L++LI E+FG AIW NTILV+TH S+A EG +G V+
Sbjct: 122 PPDVVLYLDRLDMIDMRYSDFSLLQLITEIFGAAIWLNTILVMTH-SAATTEGRNGQSVN 181
Query: 675 FESYVSHCSELLQQNIHQAVSDPRLENPVLLVENHPQCKKNIMGEKVLPNGQVWRSHFLL 734
+ESYV +++Q IHQAVSD +LENPVLLVENHP CKKN+ GE VLPNG VW+ F+
Sbjct: 182 YESYVGQRMDVVQHYIHQAVSDTKLENPVLLVENHPSCKKNLAGEYVLPNGVVWKPQFMF 241
Query: 735 LCICYKVLGSINTLLKFQNCIELGPLANTRLPSLPHLLSSILRHRNTTSPSGVDYDSEAI 794
LC+C KVLG + +LL+F++ I LG ++TR SLPHLLS LR R ++ + + + +
Sbjct: 242 LCVCTKVLGDVQSLLRFRDSIGLGQPSSTRTASLPHLLSVFLRRRLSSGADETEKEIDKL 301
Query: 795 ILRD-NEEDEYDDLPSIRILTKSQFDKLSNSQKKEYLDELEYRETLYLKKQLREEYQRRK 854
+ D EEDEYD LP+IRIL KS+F+KLS SQKKEYLDEL+YRETLYLKKQL+EE +RR+
Sbjct: 302 LNLDLEEEDEYDQLPTIRILGKSRFEKLSKSQKKEYLDELDYRETLYLKKQLKEECRRRR 361
Query: 855 EIKLLKHRDSEHNDNNGDLQPSPEAEAVLLPDMAVPPSFDSDCPVHRYRCIAVDDQWIVR 914
+ KL++ + E + + AV LPDMA P SFDSD P HRYRC++ DQW+VR
Sbjct: 362 DEKLVEEENLEDTEQR-------DQAAVPLPDMAGPDSFDSDFPAHRYRCVSAGDQWLVR 421
Query: 915 PVLDPQGWDHDVGFDGINLETAMEMNKNVFTSVTGQVSKDKRFFNIQSECAASY-MDSRG 974
PV DPQGWD DVGFDGIN+ETA ++N+N+F S TGQVS+DK+ F IQSE A+Y + R
Sbjct: 422 PVYDPQGWDRDVGFDGINIETAAKINRNLFASATGQVSRDKQRFTIQSETNAAYTRNFRE 481
Query: 975 SSYTLGLDVQSSGTDRIYTVHSNAKLGSIKHNHPGIGLSLISFKRNCYYGVKLEDTISIG 1034
++++ +D+QSSG D +Y+ KL + KHN +G+ L SF Y G KLEDT+ +G
Sbjct: 482 QTFSVAVDLQSSGEDLVYSFQGGTKLQTFKHNTTDVGVGLTSFGGKYYVGGKLEDTLLVG 541
Query: 1035 KRVRLVANGGRIEGAGQMAYGGSIVATLRGDDYPVRNDHLSLTMTVLSFDKETILSGNVE 1094
KRV+L AN G++ G+GQ A GGS A +RG DYPVRN+ + LTMT LSF +E +L+ ++
Sbjct: 542 KRVKLTANAGQMRGSGQTANGGSFEACIRGRDYPVRNEQIGLTMTALSFKRELVLNYGLQ 601
Query: 1095 SEFRVSRSMRLSVNANLNTRKMGQICIKTSSCEHLQIALISGFTILRALLRRKEIE 1149
++FR +R + VN N+N RKMG+I +K +S EH +IALIS T+ +AL+RR + E
Sbjct: 602 TQFRPARGTNIDVNINMNNRKMGKINVKLNSSEHWEIALISALTMFKALVRRSKTE 649
BLAST of CmaCh05G012660 vs. TAIR 10
Match:
AT3G16620.1 (translocon outer complex protein 120 )
HSP 1 Score: 547.4 bits (1409), Expect = 2.8e-155
Identity = 299/719 (41.59%), Postives = 448/719 (62.31%), Query Frame = 0
Query: 435 ESQEDLPLEQSQHSSNRVKVDVLTMIEDLQVQFFRLLQRIGQTPNNLLVEKVLYRIHLAT 494
++++ E +H R K ++ ++V+F RL R+GQTP+N++V +VLYR+ LA
Sbjct: 367 QAEDSTTAETDEHDETREK------LQFIRVKFLRLSHRLGQTPHNVVVAQVLYRLGLAE 426
Query: 495 LIQVGESDLKRVNFERGKAREKAAEQEAAGIPESSFTFRVLVLGKTGVGKSATINSLFDQ 554
++ G + + F +A A + EAA F+ ++VLGK+GVGKSATINS+FD+
Sbjct: 427 QLR-GRNGSRVGAFSFDRASAMAEQLEAAAQDPLDFSCTIMVLGKSGVGKSATINSIFDE 486
Query: 555 AKTATDAFQPATDRIQEIVGTINGIKVSIIDTPGLSQSSSGNMKRNKKILFSVKRYIRKS 614
K +TDAFQ T ++Q+I G + GIKV +IDTPGL S S + +N+KIL SV+ +I+KS
Sbjct: 487 LKISTDAFQVGTKKVQDIEGFVQGIKVRVIDTPGLLPSWS-DQHKNEKILKSVRAFIKKS 546
Query: 615 PPDIVLYFERLDLINKNHADYFLMKLINEVFGPAIWFNTILVLTHCSSALPEGPDGYPVS 674
PPDIVLY +RLD+ +++ D L++ I +VFGP+IWFN I+ LTH +SA P+GP+G S
Sbjct: 547 PPDIVLYLDRLDMQSRDSGDMPLLRTITDVFGPSIWFNAIVGLTHAASAPPDGPNGTASS 606
Query: 675 FESYVSHCSELLQQNIHQAVSDPRLENPVLLVENHPQCKKNIMGEKVLPNGQVWRSHFLL 734
++ +V+ S ++QQ I QA D RL NPV LVENH C+ N G++VLPNGQVW+ H LL
Sbjct: 607 YDMFVTQRSHVIQQAIRQAAGDMRLMNPVSLVENHSACRTNRAGQRVLPNGQVWKPHLLL 666
Query: 735 LCICYKVLGSINTLLKFQNCIELGPLA-NTRLPSLPHLLSSILRHRNTTSPSGVDYDSE- 794
L K+L N LLK Q+ I G A ++ P LP LLSS+L+ R YD E
Sbjct: 667 LSFASKILAEANALLKLQDNIPGGQFATRSKAPPLPLLLSSLLQSRPQAKLPEQQYDDED 726
Query: 795 -----AIILRDNEEDEYDDLPSIRILTKSQFDKLSNSQKKEYLDELEYRETLYLKKQLRE 854
EE EYD+LP + LTK++ KLS SQKKEYLDE+EYRE L++K+Q++E
Sbjct: 727 DEDDLDESSDSEEESEYDELPPFKRLTKAEMTKLSKSQKKEYLDEMEYREKLFMKRQMKE 786
Query: 855 EYQRRKEIKL----LKHRDSEHNDNNGDLQPSPEAEAVLLPDMAVPPSFDSDCPVHRYRC 914
E +RRK +K +K + +++N + + P + V +PD+++P SFDSD P HRYR
Sbjct: 787 ERKRRKLLKKFAAEIKDMPNGYSENVEEERSEPASVPVPMPDLSLPASFDSDNPTHRYRY 846
Query: 915 IAVDDQWIVRPVLDPQGWDHDVGFDGINLETAMEMNKNVFTSVTGQVSKDKRFFNIQSEC 974
+ +QW+VRPVL+ GWDHD+G++G+N E + + S +GQV+KDK+ ++Q E
Sbjct: 847 LDTSNQWLVRPVLETHGWDHDIGYEGVNAERLFVVKDKIPVSFSGQVTKDKKDAHVQLEL 906
Query: 975 AASYMDSRGSSYTLGLDVQSSGTDRIYTVHSNAKLGSIKHNHPGIGLSLISFKRNCYYGV 1034
A+S G S +LG D+Q++G + YT+ S + + N GLS+ + G+
Sbjct: 907 ASSVKHGEGRSTSLGFDMQNAGKELAYTIRSETRFNKFRKNKAAAGLSVTLLGDSVSAGL 966
Query: 1035 KLEDTISIGKRVRLVANGGRIEGAGQMAYGGSIVATLRGDDYPVRNDHLSLTMTVLSFDK 1094
K+ED + KR R+V +GG + G +AYGG++ A R DYP+ +L ++V+ +
Sbjct: 967 KVEDKLIANKRFRMVMSGGAMTSRGDVAYGGTLEAQFRDKDYPLGRFLSTLGLSVMDWHG 1026
Query: 1095 ETILSGNVESEFRVSRSMRLSVNANLNTRKMGQICIKTSSCEHLQIALISGFTILRALL 1143
+ + GN++S+ + RS L ANLN R GQ+ I+ +S E LQ+A+++ + + LL
Sbjct: 1027 DLAIGGNIQSQVPIGRSSNLIARANLNNRGAGQVSIRVNSSEQLQLAVVALVPLFKKLL 1077
BLAST of CmaCh05G012660 vs. TAIR 10
Match:
AT2G16640.1 (multimeric translocon complex in the outer envelope membrane 132 )
HSP 1 Score: 544.3 bits (1401), Expect = 2.4e-154
Identity = 308/749 (41.12%), Postives = 454/749 (60.61%), Query Frame = 0
Query: 413 VAQATTIVPPT--APHTS--NSGGS----LESQEDLPLEQSQHSSNRVKVDVLTMIEDLQ 472
+ +A+ ++ P AP S N GS ++++ E +H R K+ + ++
Sbjct: 455 LGRASPLLEPASRAPQQSRVNGNGSHNQFQQAEDSTTTEADEHDETREKLQL------IR 514
Query: 473 VQFFRLLQRIGQTPNNLLVEKVLYRIHLATLIQVGESDLKRVNFERGKAREKAAEQEAAG 532
V+F RL R+GQTP+N++V +VLYR+ LA ++ G + + F +A A + EAAG
Sbjct: 515 VKFLRLAHRLGQTPHNVVVAQVLYRLGLAEQLR-GRNGSRVGAFSFDRASAMAEQLEAAG 574
Query: 533 IPESSFTFRVLVLGKTGVGKSATINSLFDQAKTATDAFQPATDRIQEIVGTINGIKVSII 592
F+ ++VLGK+GVGKSATINS+FD+ K TDAFQ T R+Q++ G + GIKV +I
Sbjct: 575 QDPLDFSCTIMVLGKSGVGKSATINSIFDEVKFCTDAFQMGTKRVQDVEGLVQGIKVRVI 634
Query: 593 DTPGLSQSSSGNMKRNKKILFSVKRYIRKSPPDIVLYFERLDLINKNHADYFLMKLINEV 652
DTPGL S S K N+KIL SVK +I+K+PPDIVLY +RLD+ +++ D L++ I++V
Sbjct: 635 DTPGLLPSWSDQAK-NEKILNSVKAFIKKNPPDIVLYLDRLDMQSRDSGDMPLLRTISDV 694
Query: 653 FGPAIWFNTILVLTHCSSALPEGPDGYPVSFESYVSHCSELLQQNIHQAVSDPRLENPVL 712
FGP+IWFN I+ LTH +S P+GP+G S++ +V+ S ++QQ I QA D RL NPV
Sbjct: 695 FGPSIWFNAIVGLTHAASVPPDGPNGTASSYDMFVTQRSHVIQQAIRQAAGDMRLMNPVS 754
Query: 713 LVENHPQCKKNIMGEKVLPNGQVWRSHFLLLCICYKVLGSINTLLKFQNCIELGPL-ANT 772
LVENH C+ N G++VLPNGQVW+ H LLL K+L N LLK Q+ I P A +
Sbjct: 755 LVENHSACRTNRAGQRVLPNGQVWKPHLLLLSFASKILAEANALLKLQDNIPGRPFAARS 814
Query: 773 RLPSLPHLLSSILRHRNTTSPSGVDYDSE------AIILRDNEEDEYDDLPSIRILTKSQ 832
+ P LP LLSS+L+ R Y E +EE EYD LP + LTK+Q
Sbjct: 815 KAPPLPFLLSSLLQSRPQPKLPEQQYGDEEDEDDLEESSDSDEESEYDQLPPFKSLTKAQ 874
Query: 833 FDKLSNSQKKEYLDELEYRETLYLKKQLREEYQRRKEIKL----LKHRDSEHNDNNGDLQ 892
LS SQKK+YLDE+EYRE L +KKQ++EE +RRK K +K +++N +
Sbjct: 875 MATLSKSQKKQYLDEMEYREKLLMKKQMKEERKRRKMFKKFAAEIKDLPDGYSENVEEES 934
Query: 893 PSPEAEAVLLPDMAVPPSFDSDCPVHRYRCIAVDDQWIVRPVLDPQGWDHDVGFDGINLE 952
P + V +PD+++P SFDSD P HRYR + +QW+VRPVL+ GWDHD+G++G+N E
Sbjct: 935 GGPASVPVPMPDLSLPASFDSDNPTHRYRYLDSSNQWLVRPVLETHGWDHDIGYEGVNAE 994
Query: 953 TAMEMNKNVFTSVTGQVSKDKRFFNIQSECAASYMDSRGSSYTLGLDVQSSGTDRIYTVH 1012
+ + + SV+GQV+KDK+ N+Q E A+S G S +LG D+Q+ G + YT+
Sbjct: 995 RLFVVKEKIPISVSGQVTKDKKDANVQLEMASSVKHGEGKSTSLGFDMQTVGKELAYTLR 1054
Query: 1013 SNAKLGSIKHNHPGIGLSLISFKRNCYYGVKLEDTISIGKRVRLVANGGRIEGAGQMAYG 1072
S + + + N GLS+ + G+K+ED K R+V +GG + G AYG
Sbjct: 1055 SETRFNNFRRNKAAAGLSVTHLGDSVSAGLKVEDKFIASKWFRIVMSGGAMTSRGDFAYG 1114
Query: 1073 GSIVATLRGDDYPVRNDHLSLTMTVLSFDKETILSGNVESEFRVSRSMRLSVNANLNTRK 1132
G++ A LR DYP+ +L ++V+ + + + GN++S+ + RS L ANLN R
Sbjct: 1115 GTLEAQLRDKDYPLGRFLTTLGLSVMDWHGDLAIGGNIQSQVPIGRSSNLIARANLNNRG 1174
Query: 1133 MGQICIKTSSCEHLQIALISGFTILRALL 1143
GQ+ ++ +S E LQ+A+++ + + LL
Sbjct: 1175 AGQVSVRVNSSEQLQLAMVAIVPLFKKLL 1195
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q6S5G3 | 1.1e-228 | 52.59 | Translocase of chloroplast 90, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=... | [more] |
A9SV59 | 6.8e-162 | 42.88 | Translocase of chloroplast 101, chloroplastic OS=Physcomitrium patens OX=3218 GN... | [more] |
A9SY65 | 3.7e-160 | 42.45 | Translocase of chloroplast 108, chloroplastic OS=Physcomitrium patens OX=3218 GN... | [more] |
A9SY64 | 3.1e-159 | 42.70 | Translocase of chloroplast 125, chloroplastic OS=Physcomitrium patens OX=3218 GN... | [more] |
A9SV60 | 1.6e-158 | 44.32 | Translocase of chloroplast 126, chloroplastic OS=Physcomitrium patens OX=3218 GN... | [more] |
Match Name | E-value | Identity | Description | |
AT5G20300.1 | 7.8e-230 | 52.59 | Avirulence induced gene (AIG1) family protein | [more] |
AT5G20300.2 | 7.8e-230 | 52.59 | Avirulence induced gene (AIG1) family protein | [more] |
AT5G20300.3 | 4.3e-204 | 54.27 | Avirulence induced gene (AIG1) family protein | [more] |
AT3G16620.1 | 2.8e-155 | 41.59 | translocon outer complex protein 120 | [more] |
AT2G16640.1 | 2.4e-154 | 41.12 | multimeric translocon complex in the outer envelope membrane 132 | [more] |