Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGCGCCCCGCCAAAATCCCTCCAACGAATCGCATCAATGGCCCCAATCTTCCTTCCAACGACTCTTCACCTCGATCACAGCCGACATTTTGCCAACCCCCCTCTCTTTCTCACTCTCCTTTTTATTTCCACTTTCCCCTTCGCTCATCACCCTTCTCTCTACTTTCACTATCTATTCTTCTCTGCTCCGTGTCCATTTGAAGGCCTTTTGTGTGTGTGTGTGTGTGTGCGTGTCGGTCTAGCCGAGCCATTCTTCCGGTTTTGGGACTTGCCCACATTAGAATGTGGTGAAGTGAGTTTCAAGACCTTGAAATTGCCTCTTCTTCACATGCAATTTTGTTATCTGTATCTGGGTTTTGTGGTGATTCTGTGTTTTTTTTTTCCTTTTCTTGTAATGGGTTGGGTTTCGTGATTCTGCTGGATTTTTCACCTGGGTTTCTGTTTTGAGGTATATTTCCTCCTTTTCTCTTCCTTTTCGAGTTTGAATAGACTCAACTCTTGCAGGTTCTCCATCCTTTTGTTTCTGCTGGAATGTGAAATTTGTTTGACTTGTTTGAATACCCATTGAGGCGTTCTTCAGCTTTGAAGCTTTTTGTGGGATGGGGGTGGTTTCTCTTGTTTCTTGTGGAAAAGGGTCATGTGGGTTTTGTCTGTTTTTGTAAATTATCACTTGCTGATGGATACAGCTGAAGGTTTTGTTATGAACTTAATGGTATTTAATGATCATTGTTTCATTGGTGGCTGCAGATTTCTCTTTTTGGGGTTGAAGAAACATAGTGGATAAAGTGATGGACCAGAAGAGCTGGCTTTGGAGGAAGAAATCTTCAGAGAAGATTATCGTTTCCAGTGATAAAGTAAACCTCTCTGTGAATAAGAATGAAGAAGAGGTAATGTAATCTTCTTGTTTAGTTTCTTACTGGGATTTCATCAATTCTCTGCAAATTTCTCTTAATAATATCAGATCTCTGAAGAATCCTATTATATGATTTGCTCTGCTCAAGCACAGCTTCTGATATTGATATGTTGCTTCTTTTGTGATATGCTTTCATCAATGGAAGTAATATTGTCTTATTTGTTTAATATCCTTTTGCAGACACTTCTTATAGACAAAGCACGGTTAGAGAAAGATCTTGAGATTGCAAATGATAAGCTTTCTGTAGCTCTCTCTGATTGTAAGACAAAAGATGAACTTGTGAAGAAACTTACGAACATGGAACAAGAAGCCATTGCCAGTAATTCTCAGTTCTTTCTTTAAGCTTTTATTTGCATGGTATAAGTATAACTTTCTTGAATGCTTCAACATAGGATGGGAAAAGGCAAAATATGAAGTAGCAATCTTAAAGCAAGATCTCAATGATGCTGTACAGAAGAGGCTTGCTGGTGAAGAAAGATTGATTCATTTAGATGCAGCGCTAAAGGAATGTATGCAGCAGCTCCGGTTTGTTCGAGAAGAGCAGGAACAGAGGATTCATGATGCTGTCTCAAAGACATCGAATGAATTTGAGAAAACCCGAAAGATTTTGGAGGAGAAGTTAGCTGATACTTGTAAAAGGCTTTCGAAATTGGGTGGGGAGAACACTCAACTTAGCAAGGCTTTGTTGGTGAAGGACAAGATGATTGAAGATCTAAATAGAGAGTTGATCGGTGTGGAAACGGATCTTAATGCTTTAGTATCGAGATTAGAATCCACGGAGAAAGAAAAAGGTTCTCTAAAGTATGAAGTTAGAGTTCTTGAAAAGGAGGTCGAGATTCGGAATGAGGAGCGAGAATTTCATAGACGAACTGCTGATGCATCGCATAAGCAACATTTGGAGGGTGTGAAAAAGATTGCAAAGCTAGAATCTGAGTGTCAAAGGCTGCGTCTCCTCGTTCGGAAGAGGTTGCCAGGTCCTGCAGCCTTGGTAAAGATGAAAAATGAAGTTGAAATGCTAGGAAGGGATTCATTTGAGATCAGAAGACGGCAATCGAATCCAACAAGTTCTTTGGACTCTTCACTAGAGATCTCTCCAGAGACTCCCAACGAACATCTTAATGTTGCAACTTGTAGAGTGTCAGCTTTGGAAGAAGAGAACTGTGCCCTCAAGGAAGCTCTCAACAAAAAGAATAACGAACTTCAAGTTATAAAAATCATGCAAGCCCAAGCATCTTCGTTACAAATTGCATCACCCCATGAATTATCGAATGGTCAAAAAGTTATGGAATCAGGAAAGAGTGGTCTAACATTGTCTGAGCTTCCAGTTGCCTTGATGTCTGATGCGGGGAGCGATGATAAGGGTAGCTCTGCTGAATTATGGGCTTCTCCATTGATTTCAGAATTCGAGCACTTCAACAATGGAAAGCTAAAAGGATCGCCAACAACATGCAAAATAGTTGGATCTTCTGATTTGGAACTGATGGATGACTTTGTTGAAATGGAGAAATTGGCTATTGTCTCTGTTGAAAAATCTGCTGCAAATTCTCATTTTCTTTCAAATGAAGTTAATGGAAAACCGAAGTCCGTGGAAACTGAGCTAAAACGATGTTACCCTGAAGCAATGTCGAAAAGTAGTAATCCGGGCTCCTGTTTATCATACCCAGATGTTATATCTGGAGATATATCAACGAGTAAGGTTCCTGATTGGCTTCAAAATATCTCGAAAATGGTCCTTGACCAAAGTAGTTTCTCCAAAAGAGACCCTGAACAAATGCTGGAGGATATTCGAGCAGCGATGATACACCGGAGTCCTGAAAAACTTATTGATACAGAACTGTTTGCAAATCGTTGCGATGAACCTAATGTCCCTTGTAATAATGGCAGCATGTTGCCGAAGCCTTCGGGGATAGATTCAGTGAGTGATGCGAATGAGGTCGACATCACTCATCAGGTCGATATACGTGGTTCAGTGTCAAGACTGATTGAGCTCGTTGAAGGGATTAGCGTGTCGTCTTTGGATGATGATAAATCTTCCTACAAAAAGGATGGTAGTTTCTATTCAGAAACACCTACCGGCTATATGGTACGAGTTTTCCAATGGAAGATGTCTGAACTTAACACTACTTTGAAGCAGTTTATGCATAATTGTTATGACCTGTTGAATGGAAAGGCAAGCATTGAAAACTTTCTACAAGACCTAAATTCTACCTTGGATTGGATCATGAACCACTGTTTTTCACTTCAAGACGTTTCAAGCATGAGAGAATCCATAAAGAAGCATTTCGATTGGGATGAATCACGTAGCGACTGTGATCTGGAAACAGGGACGATGGTGCACGTTTCAGAAGTTGATAAATCACGTGTTTCAACAGGAGAGCTGCAATCTACCTTAACAGAAGAACATAGAAAATTGAAAGAAGAGCTTACAAGCGTAGAATCTGCGAAGAACGATCTTGAAGCAAAGTTTCAGTCTACGAACGGTGCAAGGGAAACACGAACAAATCAACTTCAAGAATCAGAAAAGAAGATTGTCAGCTTGGAGAAGGAATTAGAAACTCTTAGAGTATTGAAGGGAACAATTGAAGGTCAAATTGCCGATCAGCAAGTGGTGAATCATGATCTCGATGCACAGCTAACGGCAGCGAAAAACGAACTAAATGAGACTCGAAGAAAGTTTACAGCTCTTGAAGTTGAATTGGACAATAAAAACAATTTCTTTGAAGAATTAGAGGCCACATGCCTTGAACTGCAACTTCAGCTGGAAAGGTTCTGAATTCTCTATGATTTACTTAACTAAATGCATTAGTTTTAAGATTATTTGATTCTATTACCGAACTTCATTAGTGATCATAAAATTTCTTTATTCAAATATGGTATGATTTTTCTTCAGCACAAGGAAACAAAACCCGAGCATGGATCTCGTTCAGGAAGAGAAGCAACTACGCACGGTAAGTTCATTTCAACGTATAAGCGTCTTCATTACTGGACACTGATTATGCATCTTGAGATGTATAGTTTGGATGAAGAATTTGATATCTGGCTGTTCCTTAGTAGTTTTATCTTTCCTAGTTCAAAGATGTAGAAATTCTGATCAATATGTTCGTTCTACTGTCTTGTTCACCGATCAGTTGAAGAGACTTATGAACTTACAAAATTTTTAAGCTTTTATGGATCCCACGTGAGTTGGAGAGGAGAACGAAGCATTCTTTATAAGGGTGTGGAAACCTCTTCCTAGCATACGCGTTTTAAAAACCTTAAGGGGGAAGCCCGAAAAGGAAAGCCCAAAAAGGACAATATATACTAGCAGTGGGCTTAGACGGTTACAAATGGTATCAAAGCCAAACACCGGGCGATATGCCATCGAGGAGGCTAAGCCCCGAAGAAGAGTGGACATGAGGCGGTGTGCCAGCAAGGACATTGATCTCAAAGGGAGTAGATTGGGGGGTCCGACATCGATTGGAGAAGGGTTCGAGTATCAGTGACAACACAGGGCCCTAAAGGGGTGTAGATTGTGAGATCTCACGTCGGTAGTTGGGGAGGAGAACGAAACATTCGTTATAAGGGTGTGGAAACCTCTTCCTAGCAGACGCGTTTTAAAAACCTTGAGGGAAAGCCCGAAAGGAAAAACACAAATAGGACAATATATGAGCGACCATAAAAGTATATTCTACTCCAGTCTCATGTTCCAATGGTTTGATGGGTTATATTGTTTTCATTTGTAATGTATTCCACAAACTTGCAGGAATGGGAGATAACAACTGCTTCTGAAAGATTAGCCGAGTGCCAAGAGACGATTCTAAACCTCGGGAAGCAGTTGAAGGCTCTGGCTACTCCCAAGGAAGCTGCACTTTTAGACAAGGTCATTCTTAATCCAAACGACGAAACGCAAACATTGAGCGTCTCCACCACCACAACCACCCCCACCCCTATGACAGACACAGCCTCGACCCCAACTGTCTCTAATATCAAGACGACAAATAACCGGTTTTCTCTGCTAGATCAAATGCTAGCCGAGGATGATGCCTTCCCTAAAGATCATGAAATGCCGAAGCCTGTGGAAGTCGATGCCAATCACACTTCAACGTCAGACCCTGATAAGGCAATCGATCCACAGAAAGCGGTCCTCATATGGAATGGACATAAAAATGGAGTCGACAAAGATACAGTTGGTAATTTGGCTATCGTACCGAGTCGAAAGCAGGGAGATGGGGATGGGGGGTTGTGGAGAAAACTCTTATGGAGAAAGAAGAAAGGCAGGAGCCATAAAAAGGCCTTTCTATGACAAAAAAGGAAGCAAAGCACTGAAGTGCATAACAAGTTGACGACAAGATTGGCAGCTAAGTTTACTCTTCTTTGACTTCTGGTTTTTGTTTTGTTTGGTTTGGTTTGGTTTGTTGCTGCACAAATAGGAAATATAGATAATAAGGGATTGGATGCTTCCTAATTTTGCAGGTTTTGTAATGATTGAATGAATTTGTGTTGTACATATGTAAATCCATGCTTAAACATGAACATATAACCTTACCAATGAATGTTGTTTTGCTGAGTTGCTGTCATTATGTGAATCTGTAAGGATACTCATGTGACAAGGATGGTGTATACTATACATCGCC
mRNA sequence
AGCGCCCCGCCAAAATCCCTCCAACGAATCGCATCAATGGCCCCAATCTTCCTTCCAACGACTCTTCACCTCGATCACAGCCGACATTTTGCCAACCCCCCTCTCTTTCTCACTCTCCTTTTTATTTCCACTTTCCCCTTCGCTCATCACCCTTCTCTCTACTTTCACTATCTATTCTTCTCTGCTCCGTGTCCATTTGAAGGCCTTTTGTGTGTGTGTGTGTGTGTGCGTGTCGGTCTAGCCGAGCCATTCTTCCGGTTTTGGGACTTGCCCACATTAGAATGTGGTGAAGTGAGTTTCAAGACCTTGAAATTGCCTCTTCTTCACATGCAATTTTGTTATCTGTATCTGGGTTTTGTGGTGATTCTGTGTTTTTTTTTTCCTTTTCTTGTAATGGGTTGGGTTTCGTGATTCTGCTGGATTTTTCACCTGGGTTTCTGTTTTGAGATTTCTCTTTTTGGGGTTGAAGAAACATAGTGGATAAAGTGATGGACCAGAAGAGCTGGCTTTGGAGGAAGAAATCTTCAGAGAAGATTATCGTTTCCAGTGATAAAGTAAACCTCTCTGTGAATAAGAATGAAGAAGAGACACTTCTTATAGACAAAGCACGGTTAGAGAAAGATCTTGAGATTGCAAATGATAAGCTTTCTGTAGCTCTCTCTGATTGTAAGACAAAAGATGAACTTGTGAAGAAACTTACGAACATGGAACAAGAAGCCATTGCCAGATGGGAAAAGGCAAAATATGAAGTAGCAATCTTAAAGCAAGATCTCAATGATGCTGTACAGAAGAGGCTTGCTGGTGAAGAAAGATTGATTCATTTAGATGCAGCGCTAAAGGAATGTATGCAGCAGCTCCGGTTTGTTCGAGAAGAGCAGGAACAGAGGATTCATGATGCTGTCTCAAAGACATCGAATGAATTTGAGAAAACCCGAAAGATTTTGGAGGAGAAGTTAGCTGATACTTGTAAAAGGCTTTCGAAATTGGGTGGGGAGAACACTCAACTTAGCAAGGCTTTGTTGGTGAAGGACAAGATGATTGAAGATCTAAATAGAGAGTTGATCGGTGTGGAAACGGATCTTAATGCTTTAGTATCGAGATTAGAATCCACGGAGAAAGAAAAAGGTTCTCTAAAGTATGAAGTTAGAGTTCTTGAAAAGGAGGTCGAGATTCGGAATGAGGAGCGAGAATTTCATAGACGAACTGCTGATGCATCGCATAAGCAACATTTGGAGGGTGTGAAAAAGATTGCAAAGCTAGAATCTGAGTGTCAAAGGCTGCGTCTCCTCGTTCGGAAGAGGTTGCCAGGTCCTGCAGCCTTGGTAAAGATGAAAAATGAAGTTGAAATGCTAGGAAGGGATTCATTTGAGATCAGAAGACGGCAATCGAATCCAACAAGTTCTTTGGACTCTTCACTAGAGATCTCTCCAGAGACTCCCAACGAACATCTTAATGTTGCAACTTGTAGAGTGTCAGCTTTGGAAGAAGAGAACTGTGCCCTCAAGGAAGCTCTCAACAAAAAGAATAACGAACTTCAAGTTATAAAAATCATGCAAGCCCAAGCATCTTCGTTACAAATTGCATCACCCCATGAATTATCGAATGGTCAAAAAGTTATGGAATCAGGAAAGAGTGGTCTAACATTGTCTGAGCTTCCAGTTGCCTTGATGTCTGATGCGGGGAGCGATGATAAGGGTAGCTCTGCTGAATTATGGGCTTCTCCATTGATTTCAGAATTCGAGCACTTCAACAATGGAAAGCTAAAAGGATCGCCAACAACATGCAAAATAGTTGGATCTTCTGATTTGGAACTGATGGATGACTTTGTTGAAATGGAGAAATTGGCTATTGTCTCTGTTGAAAAATCTGCTGCAAATTCTCATTTTCTTTCAAATGAAGTTAATGGAAAACCGAAGTCCGTGGAAACTGAGCTAAAACGATGTTACCCTGAAGCAATGTCGAAAAGTAGTAATCCGGGCTCCTGTTTATCATACCCAGATGTTATATCTGGAGATATATCAACGAGTAAGGTTCCTGATTGGCTTCAAAATATCTCGAAAATGGTCCTTGACCAAAGTAGTTTCTCCAAAAGAGACCCTGAACAAATGCTGGAGGATATTCGAGCAGCGATGATACACCGGAGTCCTGAAAAACTTATTGATACAGAACTGTTTGCAAATCGTTGCGATGAACCTAATGTCCCTTGTAATAATGGCAGCATGTTGCCGAAGCCTTCGGGGATAGATTCAGTGAGTGATGCGAATGAGGTCGACATCACTCATCAGGTCGATATACGTGGTTCAGTGTCAAGACTGATTGAGCTCGTTGAAGGGATTAGCGTGTCGTCTTTGGATGATGATAAATCTTCCTACAAAAAGGATGGTAGTTTCTATTCAGAAACACCTACCGGCTATATGGTACGAGTTTTCCAATGGAAGATGTCTGAACTTAACACTACTTTGAAGCAGTTTATGCATAATTGTTATGACCTGTTGAATGGAAAGGCAAGCATTGAAAACTTTCTACAAGACCTAAATTCTACCTTGGATTGGATCATGAACCACTGTTTTTCACTTCAAGACGTTTCAAGCATGAGAGAATCCATAAAGAAGCATTTCGATTGGGATGAATCACGTAGCGACTGTGATCTGGAAACAGGGACGATGGTGCACGTTTCAGAAGTTGATAAATCACGTGTTTCAACAGGAGAGCTGCAATCTACCTTAACAGAAGAACATAGAAAATTGAAAGAAGAGCTTACAAGCGTAGAATCTGCGAAGAACGATCTTGAAGCAAAGTTTCAGTCTACGAACGGTGCAAGGGAAACACGAACAAATCAACTTCAAGAATCAGAAAAGAAGATTGTCAGCTTGGAGAAGGAATTAGAAACTCTTAGAGTATTGAAGGGAACAATTGAAGGTCAAATTGCCGATCAGCAAGTGGTGAATCATGATCTCGATGCACAGCTAACGGCAGCGAAAAACGAACTAAATGAGACTCGAAGAAAGTTTACAGCTCTTGAAGTTGAATTGGACAATAAAAACAATTTCTTTGAAGAATTAGAGGCCACATGCCTTGAACTGCAACTTCAGCTGGAAAGCACAAGGAAACAAAACCCGAGCATGGATCTCGTTCAGGAAGAGAAGCAACTACGCACGGAATGGGAGATAACAACTGCTTCTGAAAGATTAGCCGAGTGCCAAGAGACGATTCTAAACCTCGGGAAGCAGTTGAAGGCTCTGGCTACTCCCAAGGAAGCTGCACTTTTAGACAAGGTCATTCTTAATCCAAACGACGAAACGCAAACATTGAGCGTCTCCACCACCACAACCACCCCCACCCCTATGACAGACACAGCCTCGACCCCAACTGTCTCTAATATCAAGACGACAAATAACCGGTTTTCTCTGCTAGATCAAATGCTAGCCGAGGATGATGCCTTCCCTAAAGATCATGAAATGCCGAAGCCTGTGGAAGTCGATGCCAATCACACTTCAACGTCAGACCCTGATAAGGCAATCGATCCACAGAAAGCGGTCCTCATATGGAATGGACATAAAAATGGAGTCGACAAAGATACAGTTGGTAATTTGGCTATCGTACCGAGTCGAAAGCAGGGAGATGGGGATGGGGGGTTGTGGAGAAAACTCTTATGGAGAAAGAAGAAAGGCAGGAGCCATAAAAAGGCCTTTCTATGACAAAAAAGGAAGCAAAGCACTGAAGTGCATAACAAGTTGACGACAAGATTGGCAGCTAAGTTTACTCTTCTTTGACTTCTGGTTTTTGTTTTGTTTGGTTTGGTTTGGTTTGTTGCTGCACAAATAGGAAATATAGATAATAAGGGATTGGATGCTTCCTAATTTTGCAGGTTTTGTAATGATTGAATGAATTTGTGTTGTACATATGTAAATCCATGCTTAAACATGAACATATAACCTTACCAATGAATGTTGTTTTGCTGAGTTGCTGTCATTATGTGAATCTGTAAGGATACTCATGTGACAAGGATGGTGTATACTATACATCGCC
Coding sequence (CDS)
ATGGACCAGAAGAGCTGGCTTTGGAGGAAGAAATCTTCAGAGAAGATTATCGTTTCCAGTGATAAAGTAAACCTCTCTGTGAATAAGAATGAAGAAGAGACACTTCTTATAGACAAAGCACGGTTAGAGAAAGATCTTGAGATTGCAAATGATAAGCTTTCTGTAGCTCTCTCTGATTGTAAGACAAAAGATGAACTTGTGAAGAAACTTACGAACATGGAACAAGAAGCCATTGCCAGATGGGAAAAGGCAAAATATGAAGTAGCAATCTTAAAGCAAGATCTCAATGATGCTGTACAGAAGAGGCTTGCTGGTGAAGAAAGATTGATTCATTTAGATGCAGCGCTAAAGGAATGTATGCAGCAGCTCCGGTTTGTTCGAGAAGAGCAGGAACAGAGGATTCATGATGCTGTCTCAAAGACATCGAATGAATTTGAGAAAACCCGAAAGATTTTGGAGGAGAAGTTAGCTGATACTTGTAAAAGGCTTTCGAAATTGGGTGGGGAGAACACTCAACTTAGCAAGGCTTTGTTGGTGAAGGACAAGATGATTGAAGATCTAAATAGAGAGTTGATCGGTGTGGAAACGGATCTTAATGCTTTAGTATCGAGATTAGAATCCACGGAGAAAGAAAAAGGTTCTCTAAAGTATGAAGTTAGAGTTCTTGAAAAGGAGGTCGAGATTCGGAATGAGGAGCGAGAATTTCATAGACGAACTGCTGATGCATCGCATAAGCAACATTTGGAGGGTGTGAAAAAGATTGCAAAGCTAGAATCTGAGTGTCAAAGGCTGCGTCTCCTCGTTCGGAAGAGGTTGCCAGGTCCTGCAGCCTTGGTAAAGATGAAAAATGAAGTTGAAATGCTAGGAAGGGATTCATTTGAGATCAGAAGACGGCAATCGAATCCAACAAGTTCTTTGGACTCTTCACTAGAGATCTCTCCAGAGACTCCCAACGAACATCTTAATGTTGCAACTTGTAGAGTGTCAGCTTTGGAAGAAGAGAACTGTGCCCTCAAGGAAGCTCTCAACAAAAAGAATAACGAACTTCAAGTTATAAAAATCATGCAAGCCCAAGCATCTTCGTTACAAATTGCATCACCCCATGAATTATCGAATGGTCAAAAAGTTATGGAATCAGGAAAGAGTGGTCTAACATTGTCTGAGCTTCCAGTTGCCTTGATGTCTGATGCGGGGAGCGATGATAAGGGTAGCTCTGCTGAATTATGGGCTTCTCCATTGATTTCAGAATTCGAGCACTTCAACAATGGAAAGCTAAAAGGATCGCCAACAACATGCAAAATAGTTGGATCTTCTGATTTGGAACTGATGGATGACTTTGTTGAAATGGAGAAATTGGCTATTGTCTCTGTTGAAAAATCTGCTGCAAATTCTCATTTTCTTTCAAATGAAGTTAATGGAAAACCGAAGTCCGTGGAAACTGAGCTAAAACGATGTTACCCTGAAGCAATGTCGAAAAGTAGTAATCCGGGCTCCTGTTTATCATACCCAGATGTTATATCTGGAGATATATCAACGAGTAAGGTTCCTGATTGGCTTCAAAATATCTCGAAAATGGTCCTTGACCAAAGTAGTTTCTCCAAAAGAGACCCTGAACAAATGCTGGAGGATATTCGAGCAGCGATGATACACCGGAGTCCTGAAAAACTTATTGATACAGAACTGTTTGCAAATCGTTGCGATGAACCTAATGTCCCTTGTAATAATGGCAGCATGTTGCCGAAGCCTTCGGGGATAGATTCAGTGAGTGATGCGAATGAGGTCGACATCACTCATCAGGTCGATATACGTGGTTCAGTGTCAAGACTGATTGAGCTCGTTGAAGGGATTAGCGTGTCGTCTTTGGATGATGATAAATCTTCCTACAAAAAGGATGGTAGTTTCTATTCAGAAACACCTACCGGCTATATGGTACGAGTTTTCCAATGGAAGATGTCTGAACTTAACACTACTTTGAAGCAGTTTATGCATAATTGTTATGACCTGTTGAATGGAAAGGCAAGCATTGAAAACTTTCTACAAGACCTAAATTCTACCTTGGATTGGATCATGAACCACTGTTTTTCACTTCAAGACGTTTCAAGCATGAGAGAATCCATAAAGAAGCATTTCGATTGGGATGAATCACGTAGCGACTGTGATCTGGAAACAGGGACGATGGTGCACGTTTCAGAAGTTGATAAATCACGTGTTTCAACAGGAGAGCTGCAATCTACCTTAACAGAAGAACATAGAAAATTGAAAGAAGAGCTTACAAGCGTAGAATCTGCGAAGAACGATCTTGAAGCAAAGTTTCAGTCTACGAACGGTGCAAGGGAAACACGAACAAATCAACTTCAAGAATCAGAAAAGAAGATTGTCAGCTTGGAGAAGGAATTAGAAACTCTTAGAGTATTGAAGGGAACAATTGAAGGTCAAATTGCCGATCAGCAAGTGGTGAATCATGATCTCGATGCACAGCTAACGGCAGCGAAAAACGAACTAAATGAGACTCGAAGAAAGTTTACAGCTCTTGAAGTTGAATTGGACAATAAAAACAATTTCTTTGAAGAATTAGAGGCCACATGCCTTGAACTGCAACTTCAGCTGGAAAGCACAAGGAAACAAAACCCGAGCATGGATCTCGTTCAGGAAGAGAAGCAACTACGCACGGAATGGGAGATAACAACTGCTTCTGAAAGATTAGCCGAGTGCCAAGAGACGATTCTAAACCTCGGGAAGCAGTTGAAGGCTCTGGCTACTCCCAAGGAAGCTGCACTTTTAGACAAGGTCATTCTTAATCCAAACGACGAAACGCAAACATTGAGCGTCTCCACCACCACAACCACCCCCACCCCTATGACAGACACAGCCTCGACCCCAACTGTCTCTAATATCAAGACGACAAATAACCGGTTTTCTCTGCTAGATCAAATGCTAGCCGAGGATGATGCCTTCCCTAAAGATCATGAAATGCCGAAGCCTGTGGAAGTCGATGCCAATCACACTTCAACGTCAGACCCTGATAAGGCAATCGATCCACAGAAAGCGGTCCTCATATGGAATGGACATAAAAATGGAGTCGACAAAGATACAGTTGGTAATTTGGCTATCGTACCGAGTCGAAAGCAGGGAGATGGGGATGGGGGGTTGTGGAGAAAACTCTTATGGAGAAAGAAGAAAGGCAGGAGCCATAAAAAGGCCTTTCTATGA
Protein sequence
MDQKSWLWRKKSSEKIIVSSDKVNLSVNKNEEETLLIDKARLEKDLEIANDKLSVALSDCKTKDELVKKLTNMEQEAIARWEKAKYEVAILKQDLNDAVQKRLAGEERLIHLDAALKECMQQLRFVREEQEQRIHDAVSKTSNEFEKTRKILEEKLADTCKRLSKLGGENTQLSKALLVKDKMIEDLNRELIGVETDLNALVSRLESTEKEKGSLKYEVRVLEKEVEIRNEEREFHRRTADASHKQHLEGVKKIAKLESECQRLRLLVRKRLPGPAALVKMKNEVEMLGRDSFEIRRRQSNPTSSLDSSLEISPETPNEHLNVATCRVSALEEENCALKEALNKKNNELQVIKIMQAQASSLQIASPHELSNGQKVMESGKSGLTLSELPVALMSDAGSDDKGSSAELWASPLISEFEHFNNGKLKGSPTTCKIVGSSDLELMDDFVEMEKLAIVSVEKSAANSHFLSNEVNGKPKSVETELKRCYPEAMSKSSNPGSCLSYPDVISGDISTSKVPDWLQNISKMVLDQSSFSKRDPEQMLEDIRAAMIHRSPEKLIDTELFANRCDEPNVPCNNGSMLPKPSGIDSVSDANEVDITHQVDIRGSVSRLIELVEGISVSSLDDDKSSYKKDGSFYSETPTGYMVRVFQWKMSELNTTLKQFMHNCYDLLNGKASIENFLQDLNSTLDWIMNHCFSLQDVSSMRESIKKHFDWDESRSDCDLETGTMVHVSEVDKSRVSTGELQSTLTEEHRKLKEELTSVESAKNDLEAKFQSTNGARETRTNQLQESEKKIVSLEKELETLRVLKGTIEGQIADQQVVNHDLDAQLTAAKNELNETRRKFTALEVELDNKNNFFEELEATCLELQLQLESTRKQNPSMDLVQEEKQLRTEWEITTASERLAECQETILNLGKQLKALATPKEAALLDKVILNPNDETQTLSVSTTTTTPTPMTDTASTPTVSNIKTTNNRFSLLDQMLAEDDAFPKDHEMPKPVEVDANHTSTSDPDKAIDPQKAVLIWNGHKNGVDKDTVGNLAIVPSRKQGDGDGGLWRKLLWRKKKGRSHKKAFL
Homology
BLAST of CmaCh02G005220 vs. ExPASy Swiss-Prot
Match:
Q9SLN1 (Filament-like plant protein 7 OS=Arabidopsis thaliana OX=3702 GN=FPP7 PE=3 SV=2)
HSP 1 Score: 535.4 bits (1378), Expect = 1.5e-150
Identity = 406/1075 (37.77%), Postives = 585/1075 (54.42%), Query Frame = 0
Query: 1 MDQKSWLWRKKSSEKIIVSSDKVNLSVNKNEEETLLIDKARLEKDLEIANDKL-SVALSD 60
MD K+W W+KKS EK +V S+ ++ DK LE ++ NDKL SV
Sbjct: 1 MDHKAWPWKKKSMEKTVVESN-----------GEVVADKIELEHRVKSLNDKLNSVEAES 60
Query: 61 CKTKDELVKKLTNMEQEAIARWEKAKYEVAILKQDLNDAVQKRLAGEERLIHLDAALKEC 120
K + E QEAI WEK K EVA LK+ L++A+ ++ EER H DA LKEC
Sbjct: 61 NKHETE--------AQEAIVGWEKTKAEVASLKKKLDEALNEKHRSEERSSHTDAGLKEC 120
Query: 121 MQQLRFVREEQEQRIHDAVSKTSNEFEKTRKILEEKLADTCKRLSKLGGENTQLSKALLV 180
+QQLRFVREEQE+R+HDA++K S E+E+ +++ +LA + KRL++ GEN QLSKALL
Sbjct: 121 VQQLRFVREEQERRMHDALTKASQEYERRLIVIKTELAGSGKRLAEAEGENAQLSKALLA 180
Query: 181 KDKMIEDLNRELIGVETDLNALVSRLESTEKEKGSLKYEVRVLEKEVEIRNEEREFHRRT 240
K+K +EDLNRE +E D N+LVS LES EKE SL+YEVRVLEKE+E+RNEEREF RRT
Sbjct: 181 KNKTVEDLNRERDRIEVDFNSLVSSLESKEKENVSLRYEVRVLEKELELRNEEREFSRRT 240
Query: 241 ADASHKQHLEGVKKIAKLESECQRLRLLVRKRLPGPAALVKMKNEVEMLGRDSFEIRRRQ 300
A+ASHK HLE VKK+AKLESECQRLR+LVRKRLPGPAAL KM NEVEMLGR RR
Sbjct: 241 AEASHKLHLENVKKVAKLESECQRLRVLVRKRLPGPAALSKMSNEVEMLGR-----RRVN 300
Query: 301 SNPTSSLDSSLEISPETPNEHLNVATCRVSALEEENCALKEALNKKNNELQVIKIMQAQA 360
+P SP +E +N T ++ LEEEN L+EALNKK +ELQ + M ++
Sbjct: 301 GSPH---------SPMIDSEKINNLTEQLCLLEEENKTLREALNKKVSELQFSRNMYSRT 360
Query: 361 SS--LQIASPHELSNGQKVMESGKSGLTLSELPVALMSDAGSDDKGSSAELWASPLISEF 420
+S L+ S E S+ +E +S E+ +A +++ +DDK S A+ WAS L+SE
Sbjct: 361 ASRLLEFESHLEESSRGTNIEPSRSSNVSHEVSLASVTEFDNDDKVSCADSWASALLSEL 420
Query: 421 EHFNNGKLKGSPTTCKIVG---SSDLELMDDFVEMEKLAIV--SVEKSAANSHFLSNEVN 480
++F N K G+ +VG +++++LMDDF EMEKLA+V +++ +S S++
Sbjct: 421 DNFKNKKEMGT----SLVGTPKAAEMKLMDDFAEMEKLAMVASTIDNRPGSSPICSSDSI 480
Query: 481 GKPKSVETELKRCYPEAMSKSSNPGSCLSYPDVISGDISTSKVPDWLQNISKMVLDQSSF 540
VE E EA +K+S L+ DI + +P L + K V++
Sbjct: 481 SATGPVENESNENSSEA-TKTSGTVYSLNPDASPKDDIKSDSLPQSLHIVLKAVMEHKHI 540
Query: 541 SKRDPEQMLEDIRAAMIHRSPEKLIDTELFANRCDEPNVPCNNGSMLPKPSGIDSVSDAN 600
++R+ +++LEDIR A+ N+ S +++ +
Sbjct: 541 TQRNTDEVLEDIRKAL----------------------SSVNHSSFSTNHQETKTLTVED 600
Query: 601 EVDITHQVDIRGSVSRLIELVEGISVSSLDDDKSSYKKDGSFYSETPTGYMVRVFQWKMS 660
+D+ + +I S+ R+I+++EG+ SL D++ ++ SE +GY RV QWK +
Sbjct: 601 RLDM--ECNISKSIHRIIDVIEGV---SLKDERHVSNRE----SERLSGYTARVLQWKTT 660
Query: 661 ELNTTLKQFMHNCYDLLNGKASIENFLQDLNSTLDWIMNHCFSLQDVSSMRESIKKHFDW 720
EL++ L++F+ CYDLL+ KA ++ F Q+L+S L+W++NHCFSLQDVS+MR+ IKK F+W
Sbjct: 661 ELSSVLQRFLQACYDLLDRKADMKKFAQELSSVLEWMVNHCFSLQDVSTMRDEIKKQFEW 720
Query: 721 DESRSDCDLETGTMVHVSEVDKSRVSTGELQSTLTEEHRKLKEELTSVESAKNDLEAKFQ 780
DESRS +++ G VSE +K R E+++ + +E K
Sbjct: 721 DESRSGSEVDIGIFRQVSEAEKLRT-----------------EDVSFLACKDQLIEDKPG 780
Query: 781 STNGARETRTNQLQESEKKIVSLEKELETLRVLKGTIEGQIADQQVVNHDLDAQLTAAKN 840
+ N +R+T +E+E D +A++N
Sbjct: 781 NQNLSRKT--------------VEEEAN-----------------------DKTASASEN 840
Query: 841 ELNETRRKFTALEVELDNKNNFFEELEATCLELQLQLESTRKQNPSMDLVQEEKQLRTEW 900
EL ++E++ +RTE
Sbjct: 841 ELK----------------------------------------------LEEKQNMRTEL 891
Query: 901 EITTASERLAECQETILNLGKQLKALATPKEAALLDKVIL-NPNDETQTLSVSTTTTTPT 960
EI ASE+LAECQETILNLGKQLKAL KE ALL + ++ + D++ L
Sbjct: 901 EIAAASEKLAECQETILNLGKQLKALTNSKETALLSETLMYDVTDKSNNL---------- 891
Query: 961 PMTDTASTPTVSNIKTTNNRFSLLDQMLAED-DAFPKDHEMPKPVEVDANHTSTSDPDKA 1020
P + T + T+ R SLLDQM AED + + P+ + + ++S ++
Sbjct: 961 PDAQPSHETTKPEKRLTSQRSSLLDQMKAEDHNTGESKDQKPQAADKNGKGGNSSVYNET 891
Query: 1021 IDPQKAVLIWNGHKNGVDKDTVGNLAIVPSRKQGDGDGGLWRKLLWRKKKGRSHK 1066
I+ + +L+ + K G D + AIVP +K G G LWRKLL R KKG+S K
Sbjct: 1021 IEALEQILLSDKSK-GSDSNC---FAIVPQKKTG-GVKSLWRKLLGRNKKGKSKK 891
BLAST of CmaCh02G005220 vs. ExPASy Swiss-Prot
Match:
Q0WSY2 (Filament-like plant protein 4 OS=Arabidopsis thaliana OX=3702 GN=FPP4 PE=1 SV=1)
HSP 1 Score: 300.8 bits (769), Expect = 6.0e-80
Identity = 311/1049 (29.65%), Postives = 500/1049 (47.66%), Query Frame = 0
Query: 1 MDQKSWLWRKKSSEK------------------IIVSSDK-VNLSVNKNEEETLLIDKAR 60
MD+KSW W+KKSSEK I +S D+ NL+ K+E ++ +
Sbjct: 1 MDRKSWPWKKKSSEKTATVTEVVDQENGKKPSYIQISFDQYTNLNGLKDEVKSYEEKVTK 60
Query: 61 LEKDLEIANDKLSVALSDCKTKDELVKKLTNMEQEAIARWEKAKYEVAILKQDLNDAVQK 120
LE ++ + KLS A +D K+ LVK+ + + +EA+ WEKA+ E + LK L
Sbjct: 61 LEDQIKDLDLKLSTANADIVAKEVLVKQHSKVAEEAVTGWEKAEAEASALKTHLETITLA 120
Query: 121 RLAGEERLIHLDAALKECMQQLRFVREEQEQRIHDAVSKTSNEFEKTRKILEEKLADTCK 180
+L E+R HLD ALKECM+Q+R ++EE EQ++HD ++ +N+ + R E ++ + +
Sbjct: 121 KLTVEDRAAHLDGALKECMRQIRSLKEENEQKLHDVIATKTNQMDNLRAEFESRIGEYEE 180
Query: 181 RLSKLGGENTQLSKALLVKDKMIEDLNRELIGVETDLNALVSRLESTEKEKGSLKYEVRV 240
L + G EN LS++L + M+ ++ E E+++ L + +ES E+E +LKYE V
Sbjct: 181 ELLRCGAENDALSRSLQERSNMLMRISEEKSQAESEIEHLKNNIESCEREINTLKYETHV 240
Query: 241 LEKEVEIRNEEREFHRRTADASHKQHLEGVKKIAKLESECQRLRLLVRKRLPGPAALVKM 300
+ KE+EIRNEE+ R+A+A++KQHLEGVKKIAKLE+ECQRLR LVRK+LPGPAAL +M
Sbjct: 241 ITKELEIRNEEKNMSMRSAEAANKQHLEGVKKIAKLEAECQRLRTLVRKKLPGPAALAQM 300
Query: 301 KNEVEMLG----RDSFEIRRRQSNPTSSLDSSLEISPETPNEHL----------NVATCR 360
K EVE LG R RR P+S L S + + L ++ T R
Sbjct: 301 KMEVESLGFGDHRQDHRQRRSPVRPSSPLMSPMSHMSQVSEFSLDNMQKFHKENDLLTER 360
Query: 361 VSALEEENCALKEALNKKNNELQVIKIMQA------QASSLQIASPHELSNGQKVMESGK 420
+ A+EEE LKEAL K+N+ELQV + + A Q Q+ S G ++
Sbjct: 361 LLAMEEETKMLKEALAKRNSELQVSRNLCAKTANRLQTLEAQMMSKSPTKRGFEMPAEIF 420
Query: 421 SGLTLSELP-VALMSDAGSDDKGSSAELWASPLISEFEHFNNGKLKGSPTTCKIVGSSDL 480
S S P +A MS+ G++D S A S L + N K+K K ++ L
Sbjct: 421 SRQNASNPPSMASMSEDGNEDARSVAGSLMSELSQSNKDKANAKIK------KTESANQL 480
Query: 481 ELMDDFVEMEKLAIVSVEKSAANSHFLSNEVNGKPKSVETELKRCYPEAMSKSSNPGSCL 540
ELMDDF+EMEKLA + +A S S+ + T+LK+
Sbjct: 481 ELMDDFLEMEKLACLPNGSNANGSTDHSSADSDAEIPPATQLKK---------------- 540
Query: 541 SYPDVISGDISTSKVPDWLQNISKMVLDQSSFSKRDPEQMLEDIRAAMIHRSPEKLIDTE 600
++ + LQ++ K ++F E++L +I+ A + + KL
Sbjct: 541 -------------RISNVLQSLPK----DAAF-----EKILAEIQCA-VKDAGVKLPSKS 600
Query: 601 LFANRCDEPNVPCNNGSMLPKPSGIDSVSDANEVDITHQV--DIRGSVSRLIELVEGISV 660
AN NG K + + + +V I + ++ ++S++ + V +S
Sbjct: 601 HGANL---------NGLTEEKVIAMSNETTEEKVTIVEVITQELSDALSQIYQFVTYLS- 660
Query: 661 SSLDDDKSSYKKDGSFYSETPTGYMVRVFQWKMSELNTTLKQFMHNCYDLLNGKASIENF 720
K+ + SE R F K+ E +TT + +L + ++ +F
Sbjct: 661 -----------KEATACSEN------RTFSQKVQEFSTTFE-------GVLGKEKTLVDF 720
Query: 721 LQDLNSTL--------DWIMNHCFSLQ-------DVSSMRE--SIKKHFDWDESRSDCDL 780
L DL+ L D + H +++ D ++ E +++K + ++ C
Sbjct: 721 LFDLSRVLVEASELKIDVLGFHTSTVEIHSPDCIDKVALPENKALQKDSSGEHYQNGCSQ 780
Query: 781 ETGTMVHVSEVDKSRVSTGELQSTL------TEEHRKLKEELTSVESAKNDLEAKFQSTN 840
+ SE+ T + L TEE LK E ES EA ++T
Sbjct: 781 SSD-----SEIPDDCNGTSGYEPKLATCKFTTEEFEGLKLEKEKAESNLASCEADLEATK 840
Query: 841 GARETRTNQLQESEKKIVSLEKELETLRVLKGTIEGQIADQQVVNHDLDAQLTAAKNELN 900
+LQE+EK + ++ +LE+ + G E Q+ L+ + + + EL
Sbjct: 841 -------TKLQETEKLLAEVKSDLESAQKSNGMGETQLKCMVESYRSLETRSSELEIELT 900
Query: 901 ETRRKFTALEVELDNKNNFFEELEATCLELQLQLESTRKQNPSMDLVQEEKQLRTEWEIT 960
+ K LE EL ++ E A C EL+ QL+ + P+ +++++ + + + E+
Sbjct: 901 SLKGKIENLEDELHDEKENHREALAKCQELEEQLQRNNQNCPNCSVIEDDPKSKQDNELA 958
Query: 961 TASERLAECQETILNLGKQLKALATPKE----AALLDKVILNPNDE-------TQTLSVS 974
A+E+LAECQETIL LGKQLK++ E + ++ LNP +E Q +S
Sbjct: 961 AAAEKLAECQETILLLGKQLKSMCPQTEQVASSPSQEQQALNPEEEEYATSTNPQDSKLS 958
BLAST of CmaCh02G005220 vs. ExPASy Swiss-Prot
Match:
Q9C698 (Filament-like plant protein 6 OS=Arabidopsis thaliana OX=3702 GN=FPP6 PE=1 SV=1)
HSP 1 Score: 270.8 bits (691), Expect = 6.7e-71
Identity = 284/956 (29.71%), Postives = 473/956 (49.48%), Query Frame = 0
Query: 21 DKVNLSVNKNEEETLLIDKARLEKDLEIANDKLSVALSDCKTKDELVKKLTNMEQEAIAR 80
D + V++ EE+ +L +D+E N+KLSVA + TK+ LVK+ + + ++A++
Sbjct: 95 DAYDEKVHEYEEQV-----QKLNEDVEDLNEKLSVANEEIVTKEALVKQHSKVAEDAVSG 154
Query: 81 WEKAKYEVAILKQDLNDAVQKRLAGEERLIHLDAALKECMQQLRFVREEQEQRIHDAVSK 140
WEKA E LK L +L E+R HLD ALKECM+Q+R ++++ E ++HD
Sbjct: 155 WEKADAEALALKNTLESVTLSKLTAEDRAAHLDGALKECMRQIRNLKKDHEVKLHDVALS 214
Query: 141 TSNEFEKTRKILEEKLADTCKRLSKLGGENTQLSKALLVKDKMIEDLNRELIGVETDLNA 200
+ + EK E+++ D + L + ++ LS+ L + M+ ++ E + ++
Sbjct: 215 KTKQIEKMTMEFEKRMCDYEQELLRSAADSDALSRTLQERSNMLVKVSEEKSRADAEIET 274
Query: 201 LVSRLESTEKEKGSLKYEVRVLEKEVEIRNEEREFHRRTADASHKQHLEGVKKIAKLESE 260
L S LE E+E SLKYEV V+ KE+EIRNEE+ R+A++++KQHLEGVKKIAKLE+E
Sbjct: 275 LKSNLEMCEREIKSLKYEVHVVSKELEIRNEEKNMCIRSAESANKQHLEGVKKIAKLEAE 334
Query: 261 CQRLRLLVRKRLPGPAALVKMKNEVEMLGRDSFEIRRRQS-----------------NPT 320
CQRLR LVRK+LPGPAAL +MK EVE LGRDS + R+++S
Sbjct: 335 CQRLRSLVRKKLPGPAALAQMKLEVENLGRDSGDARQKRSPVKVSSPCKSPGGYSSTGSE 394
Query: 321 SSLDSSLEISPETPNEHLNVATCRVSALEEENCALKEALNKKNNELQVIKIMQAQASS-- 380
SLD++ + E NE L T R+ A+EEE LKEAL K+N+EL + + AQ++S
Sbjct: 395 FSLDNAQKFQKE--NEFL---TERLLAMEEETKMLKEALAKRNSELLESRNLCAQSTSKL 454
Query: 381 LQIASPHELSNGQKVMESGKSGLTLSELPVAL-MSDAGSDDKGS-SAELWASPLISEFEH 440
+ + + +N QK L S ++ +S+ G+DD GS S L +P +
Sbjct: 455 QSLEAQLQQNNSQKSSLEVCPNLNTSNPSSSISVSEDGNDDSGSCSGSLSTNPSQQIKKE 514
Query: 441 FNNGKLKGSPTTCKIVGSSDLELMDDFVEMEKLAIVSVEKSAANSHFLSNEVNGKPKSVE 500
+ L+ + +S +ELMDDF+EMEKLA + S++N S + +G KS E
Sbjct: 515 KDMAALERVESV-----NSHVELMDDFLEMEKLACLP-NLSSSNGSIDSKDGSGDQKS-E 574
Query: 501 TELKRCYPEAMSKSSNPGSCLSYPDVISGDISTSKVPDWLQNISKMVLDQSSFSKRDPEQ 560
+ + + + + + +S + S PD +I K+V D +
Sbjct: 575 MVILDAHTDLEDSDRGSPAVMKFRSRLS-KVLESVSPD--ADIQKIV--------GDIKC 634
Query: 561 MLEDIRAAMIHRSPEKL-IDTELFANRCDEPNVPCNNGSMLPKPSGIDSVSDANEVDITH 620
+L+D+ A M P ++ + E ++ C E N+ D ++ H
Sbjct: 635 ILQDVNACMDQEKPSEVHVHPEEVSDLCPEQNLV------------EDCHLAEQKLQSIH 694
Query: 621 QVDIRGSVSRLIELVEGISVSSLDDDKSSYKKDGSFYSETPTGYMVRVFQWKMSELNTTL 680
Q D++ +VSR+ + V + +S +G+ + E G+ V F +S + +L
Sbjct: 695 Q-DLKNAVSRIHDFVLLLRNEVKAGQDTSI--EGNDFVELIEGFSV-TFNHVLSG-DKSL 754
Query: 681 KQFMHNCYDLLNGKASIENFLQDL-NSTLDWIMNHCFSLQDVSSMRESIKKHFDWDESRS 740
F+ N ++ N + + L +S ++ + C D ++ ES K D D S+
Sbjct: 755 DDFVSNLANVFNEAMERKVSFRGLASSEVETLSPDCI---DKVALPES--KVVDKDSSQ- 814
Query: 741 DCDLETGTMVHVS---EVDKSRVSTGELQSTLTEEHRKLKEELTSVESAKNDLEAKFQST 800
++ VH D++RVS E S L E +E +++ E
Sbjct: 815 --EIYQNGCVHNEPGVPCDENRVSGYESDSKLQE-----------IEELRSEKEKMAVDI 874
Query: 801 NGARETRTNQLQESEKKIVSLEKELETLRVLKGTIEGQIADQQVVNHDLDAQLTAAKNEL 860
G + QLQESE+ + + + ++ + + Q+ L+++ + ++
Sbjct: 875 EGLK----CQLQESEQLLADIRSQFDSAQRSNRLADTQLRCMTESYRSLESRAADLEIDV 934
Query: 861 NETRRKFTALEVELDNKNNFFEELEATCLELQLQLESTRKQNPSMDLVQEEK---QLRTE 920
N+ + K LE EL+++ +E C EL+ ++ R +N S+ +E+ + + E
Sbjct: 935 NQLKEKIQKLENELEDEKCNHQEAILRCHELEEHIQ--RHRNTSLVAEDDEEADIKSKQE 980
Query: 921 WEITTASERLAECQETILNLGKQLKALATPKEAALLDKVILNPNDETQTLSVSTTT 948
E++ A+E+LAECQETI LGKQLK+ E + E + L +TT+
Sbjct: 995 RELSAAAEKLAECQETIFVLGKQLKSFRPQPEQMRSPQTRNESYSEEEELGTTTTS 980
BLAST of CmaCh02G005220 vs. ExPASy Swiss-Prot
Match:
O65649 (Filament-like plant protein 5 OS=Arabidopsis thaliana OX=3702 GN=FPP5 PE=2 SV=2)
HSP 1 Score: 231.9 bits (590), Expect = 3.4e-59
Identity = 273/981 (27.83%), Postives = 451/981 (45.97%), Query Frame = 0
Query: 1 MDQKSWLWRKKSSEKIIV----------------------------SSDKVNLSVNKNEE 60
M+ + W W++KSS+K +++ V ++++
Sbjct: 1 MEGRGWPWKRKSSDKATTEKPVVGIESTPVCSLSYLASLENQEKCKNTNYVQITMDSYTH 60
Query: 61 ETLLIDKARL-EKDLEIANDKLSVALSDCKTKDELVKKLTNMEQEAIARWEKAKYEVAIL 120
+ + D+ +L E ++ +KL++A S+ TK+ L+ + + +EA++ WEKA E L
Sbjct: 61 MSRMEDQVKLFEVQVKDLKEKLTLAHSEINTKESLILQHAKVAEEAVSGWEKADAETLAL 120
Query: 121 KQDLNDAVQKRLAGEERLIHLDAALKECMQQLRFVREEQEQRIHDAVSKTSNEFEKTRKI 180
K+ L +L E+R HLD ALKEC +Q+R V+EE ++++ D + +++++K +
Sbjct: 121 KRQLESVTLLKLTAEDRASHLDDALKECTRQIRIVKEESDKKLQDVILAKTSQWDKIKAE 180
Query: 181 LEEKLADTCKRLSKLGGENTQLSKALLVKDKMIEDLNRELIGVETDLNALVSRLESTEKE 240
LE K+ + + L + +N L+++L + +MI ++ E E D+ L + L+ EKE
Sbjct: 181 LEGKIDELSEGLHRAASDNAALTRSLQERSEMIVRISEERSKAEADVEKLKTNLQLAEKE 240
Query: 241 KGSLKYEVRVLEKEVEIRNEEREFHRRTADASHKQHLEGVKKIAKLESECQRLRLLVRKR 300
LKY++ V KEVEIRNEE+ ++AD ++KQHLEGVKKIAKLE+EC RLR L+RK+
Sbjct: 241 ISYLKYDLHVASKEVEIRNEEKNMSLKSADIANKQHLEGVKKIAKLEAECHRLRGLLRKK 300
Query: 301 LPGPAALVKMKNEVEMLGRDSFEIR--RRQSNPTSSLDSSLEISPETPNEHLN----VAT 360
LPGPAA+ +MK EVE LG + + R R S ++ + EIS + E T
Sbjct: 301 LPGPAAMAQMKLEVEGLGHEFTDPRAQRNMSQNHNAHIAKAEISTDHKLEECKRENVYLT 360
Query: 361 CRVSALEEENCALKEALNKKNNELQV-----------IKIMQAQASSLQIASPHELSNGQ 420
R +EEE LKE L+ +NNELQV +KI++ Q SN +
Sbjct: 361 RRTLEMEEEIQTLKEHLSARNNELQVSRNVCAKTLGKLKILEGQMHMFNNDKNAPKSNSR 420
Query: 421 KVMESGKSGLTLSELP-VALMSDAGSDDKGSSAELWASPLISEFEH-FNNGKLKGSPTTC 480
+ ES SG P V +S+ G D++GSS+E P S H + GS
Sbjct: 421 NLSESLSSGHDHHYPPSVTSVSEDGFDEEGSSSE--CGPATSLDSHKVRKVSVNGS---S 480
Query: 481 KIVGSSDLELMDDFVEMEKLAIVSVEKSAANSHFLSNEVNGKPKSVETELKRCYPEAMSK 540
K SS LELMDDF+E+EKL V + ANS S+ +SVE ++ SK
Sbjct: 481 KPRSSSRLELMDDFLEIEKL--VGSDPDGANSASKSSNSVCSRRSVE-------KQSSSK 540
Query: 541 SSNPGSCLSYPDVISGDISTSKVPDWL----QNISKMVLDQSSFSKRDPEQMLEDIRAAM 600
SS P D T+ + L I+++ Q S ++++E R ++
Sbjct: 541 SSEP------------DEDTTTLDQLLMVLRSRINRIFESQEGIS---IDKIVEAARFSI 600
Query: 601 --IHRSPEKLIDTELFANRCDEPNVPCNNGSMLPKPSGIDSVSDANEVDITHQVDIRGSV 660
+ S K + + LF DE +D + E T Q D+ +V
Sbjct: 601 QEMQGSSTKRMSSHLF-EVTDE-----------TLEKHVDIQNSEKEQKNTKQQDLEAAV 660
Query: 661 SRLIELVEGIS--VSSLDDD------KSSYKKDGSFYSETPTGYMVRVFQWKMSELNTTL 720
+ + ++ + + L D + S + S S+ PTG + +S++ L
Sbjct: 661 ANIHHFIKSTTKEATQLQDMNGNGQLRESLEDFSSSVSKYPTG------ESSLSDVMLEL 720
Query: 721 KQFMHNCYDLLNGKASIENFLQDLNSTLDWIMNHCFSLQDVSSMRESIKKHFDWDESRSD 780
+ +L NG +++ +++ V+ + + F+ +S
Sbjct: 721 SRISVLASNLNNGALTLKPHSKEI---------------PVTESNDKVTLLFEESDSNPL 780
Query: 781 CDLETGTMVHVSEVDKSRVSTGELQSTLTE-EHRKLKEELTSVESAKNDLEAKFQSTNGA 840
D T V + S+ +S L E E KL++E +VE L Q+
Sbjct: 781 GDTFAKTDHCVDNLINGDDSS--CKSLLKEVEQLKLEKENIAVE-----LSRCLQNL--- 840
Query: 841 RETRTNQLQESEKKIVSLEKELETLRVLKGTIEGQIADQQVVNHDLDAQLTAAKNELNET 900
E+ L+E E+ I L+ +L + L+ E Q+ LD + ++
Sbjct: 841 -ESTKAWLEEKEQLISKLKSQLTSSEDLQSLAETQLKCVTESYKSLDLHAKELEAKVKSL 900
Query: 901 RRKFTALEVELDNKNNFFEELEATCLELQLQLESTRKQNPSMDLVQEEKQLRTEWEITTA 919
+ LE+ + + EE A C +LQ E ++ + + Q E +I +A
Sbjct: 901 EEETKRLEMAFTTEKHGHEETLAKCRDLQ---EKMQRNETCENCSSSKLQPNQEKDIVSA 905
BLAST of CmaCh02G005220 vs. ExPASy Swiss-Prot
Match:
Q9MA92 (Filament-like plant protein 3 OS=Arabidopsis thaliana OX=3702 GN=FPP3 PE=1 SV=2)
HSP 1 Score: 137.5 bits (345), Expect = 8.8e-31
Identity = 156/555 (28.11%), Postives = 260/555 (46.85%), Query Frame = 0
Query: 1 MDQKSWLWRKKSSEKI--------IVSSDKVNLSVNKNEEETLLIDK--ARLEK---DLE 60
MD++SWLWR+KSSEK VSS S ++ + L K R E+ D++
Sbjct: 1 MDRRSWLWRRKSSEKSPGETESTGSVSSHSERFSDDQRSQSPELNSKPVTREEEATADIK 60
Query: 61 IANDKLSVALSDCKTKDELVKKLTNMEQEAIARWEKAKYEVAILKQDLNDAVQKRLAGEE 120
I ++LS AL + K++L K+ + +EA++ WEKA+ E A LKQ L+ + K A E+
Sbjct: 61 ILTERLSAALLNVSLKEDLAKQHAKVAEEAVSGWEKAENEAAALKQQLDASTSKVSALED 120
Query: 121 RLIHLDAALKECMQQLRFVREEQEQRIHDAVSKTSNEFEKTRKILEEKLADTCKRLSKLG 180
R HLD+ALKEC++QL REEQ Q+I +A++ E+E T+ LE +
Sbjct: 121 RNSHLDSALKECVRQLWQGREEQNQKIEEAINNKCKEWETTKSQLEAR------------ 180
Query: 181 GENTQLSKALLVKDKMIEDLN-RELIGVETDLNALVSRLESTEKEKGSLKYEVRVLEKEV 240
IE+L R+ + + L +LE+ EKE +LK ++ +EV
Sbjct: 181 ----------------IEELQARQDVTTSSVHEDLYPKLEALEKENSALKLQLLSKSEEV 240
Query: 241 EIRNEEREFHRRTADASHKQHLEGVKKIAKLESECQRLRLLVRKRLPGPAALVKMKNEVE 300
+IR ER+ + A+++ KQ LEG+KK+ KLE+EC++LR++VR+ + N+ +
Sbjct: 241 KIRTIERDLSTQAAESASKQQLEGIKKLTKLEAECRKLRVMVRRSDNSSDLKSSIDNQSD 300
Query: 301 MLGRDSFEIRRRQSNP-----TSSLDSSLEI----------------------SPETPNE 360
GR SF QS SS+ +S++I N+
Sbjct: 301 YSGRVSFSDNEMQSPSEKIIGKSSMATSVDIGLMDDFLEMEKLAALPHSEPGRKHSESNK 360
Query: 361 HLNVATCRVSALEEENCALKEALNKKNNELQVIKIMQAQASSLQIASPHELSNGQKVMES 420
L + V+ L+ E LK +L + + + +++++ + L++A L+ ++ +E+
Sbjct: 361 ELEKSNAHVNQLKHE---LKTSLRRISELEEKVEMVEVEKLQLEMA----LNGSKEQIEA 420
Query: 421 GKSGL-----TLSELP---------VALMSDAGSDDKGSSAEL-WASPLISEFEHFNNGK 480
+S L LSE+ L+ ++G + +L A +SE E K
Sbjct: 421 LQSRLKEIEGKLSEMKKLEAENQELELLLGESGKQMEDLQRQLNKAQVNLSELETRRAEK 480
Query: 481 LKGSPTTCKIVGSSDLELMDD--------FVEMEKLAIVSVEKSAANSHFLSNEVNGKPK 492
L+ T C LE + E++ L ++ + A L NGK +
Sbjct: 481 LE--LTMCLNGTKKQLETSQNRLKETERKLTELQTLLHLTKDAKEAAEDGL-KAANGKTE 517
BLAST of CmaCh02G005220 vs. TAIR 10
Match:
AT2G23360.1 (Plant protein of unknown function (DUF869) )
HSP 1 Score: 535.4 bits (1378), Expect = 1.0e-151
Identity = 406/1075 (37.77%), Postives = 585/1075 (54.42%), Query Frame = 0
Query: 1 MDQKSWLWRKKSSEKIIVSSDKVNLSVNKNEEETLLIDKARLEKDLEIANDKL-SVALSD 60
MD K+W W+KKS EK +V S+ ++ DK LE ++ NDKL SV
Sbjct: 1 MDHKAWPWKKKSMEKTVVESN-----------GEVVADKIELEHRVKSLNDKLNSVEAES 60
Query: 61 CKTKDELVKKLTNMEQEAIARWEKAKYEVAILKQDLNDAVQKRLAGEERLIHLDAALKEC 120
K + E QEAI WEK K EVA LK+ L++A+ ++ EER H DA LKEC
Sbjct: 61 NKHETE--------AQEAIVGWEKTKAEVASLKKKLDEALNEKHRSEERSSHTDAGLKEC 120
Query: 121 MQQLRFVREEQEQRIHDAVSKTSNEFEKTRKILEEKLADTCKRLSKLGGENTQLSKALLV 180
+QQLRFVREEQE+R+HDA++K S E+E+ +++ +LA + KRL++ GEN QLSKALL
Sbjct: 121 VQQLRFVREEQERRMHDALTKASQEYERRLIVIKTELAGSGKRLAEAEGENAQLSKALLA 180
Query: 181 KDKMIEDLNRELIGVETDLNALVSRLESTEKEKGSLKYEVRVLEKEVEIRNEEREFHRRT 240
K+K +EDLNRE +E D N+LVS LES EKE SL+YEVRVLEKE+E+RNEEREF RRT
Sbjct: 181 KNKTVEDLNRERDRIEVDFNSLVSSLESKEKENVSLRYEVRVLEKELELRNEEREFSRRT 240
Query: 241 ADASHKQHLEGVKKIAKLESECQRLRLLVRKRLPGPAALVKMKNEVEMLGRDSFEIRRRQ 300
A+ASHK HLE VKK+AKLESECQRLR+LVRKRLPGPAAL KM NEVEMLGR RR
Sbjct: 241 AEASHKLHLENVKKVAKLESECQRLRVLVRKRLPGPAALSKMSNEVEMLGR-----RRVN 300
Query: 301 SNPTSSLDSSLEISPETPNEHLNVATCRVSALEEENCALKEALNKKNNELQVIKIMQAQA 360
+P SP +E +N T ++ LEEEN L+EALNKK +ELQ + M ++
Sbjct: 301 GSPH---------SPMIDSEKINNLTEQLCLLEEENKTLREALNKKVSELQFSRNMYSRT 360
Query: 361 SS--LQIASPHELSNGQKVMESGKSGLTLSELPVALMSDAGSDDKGSSAELWASPLISEF 420
+S L+ S E S+ +E +S E+ +A +++ +DDK S A+ WAS L+SE
Sbjct: 361 ASRLLEFESHLEESSRGTNIEPSRSSNVSHEVSLASVTEFDNDDKVSCADSWASALLSEL 420
Query: 421 EHFNNGKLKGSPTTCKIVG---SSDLELMDDFVEMEKLAIV--SVEKSAANSHFLSNEVN 480
++F N K G+ +VG +++++LMDDF EMEKLA+V +++ +S S++
Sbjct: 421 DNFKNKKEMGT----SLVGTPKAAEMKLMDDFAEMEKLAMVASTIDNRPGSSPICSSDSI 480
Query: 481 GKPKSVETELKRCYPEAMSKSSNPGSCLSYPDVISGDISTSKVPDWLQNISKMVLDQSSF 540
VE E EA +K+S L+ DI + +P L + K V++
Sbjct: 481 SATGPVENESNENSSEA-TKTSGTVYSLNPDASPKDDIKSDSLPQSLHIVLKAVMEHKHI 540
Query: 541 SKRDPEQMLEDIRAAMIHRSPEKLIDTELFANRCDEPNVPCNNGSMLPKPSGIDSVSDAN 600
++R+ +++LEDIR A+ N+ S +++ +
Sbjct: 541 TQRNTDEVLEDIRKAL----------------------SSVNHSSFSTNHQETKTLTVED 600
Query: 601 EVDITHQVDIRGSVSRLIELVEGISVSSLDDDKSSYKKDGSFYSETPTGYMVRVFQWKMS 660
+D+ + +I S+ R+I+++EG+ SL D++ ++ SE +GY RV QWK +
Sbjct: 601 RLDM--ECNISKSIHRIIDVIEGV---SLKDERHVSNRE----SERLSGYTARVLQWKTT 660
Query: 661 ELNTTLKQFMHNCYDLLNGKASIENFLQDLNSTLDWIMNHCFSLQDVSSMRESIKKHFDW 720
EL++ L++F+ CYDLL+ KA ++ F Q+L+S L+W++NHCFSLQDVS+MR+ IKK F+W
Sbjct: 661 ELSSVLQRFLQACYDLLDRKADMKKFAQELSSVLEWMVNHCFSLQDVSTMRDEIKKQFEW 720
Query: 721 DESRSDCDLETGTMVHVSEVDKSRVSTGELQSTLTEEHRKLKEELTSVESAKNDLEAKFQ 780
DESRS +++ G VSE +K R E+++ + +E K
Sbjct: 721 DESRSGSEVDIGIFRQVSEAEKLRT-----------------EDVSFLACKDQLIEDKPG 780
Query: 781 STNGARETRTNQLQESEKKIVSLEKELETLRVLKGTIEGQIADQQVVNHDLDAQLTAAKN 840
+ N +R+T +E+E D +A++N
Sbjct: 781 NQNLSRKT--------------VEEEAN-----------------------DKTASASEN 840
Query: 841 ELNETRRKFTALEVELDNKNNFFEELEATCLELQLQLESTRKQNPSMDLVQEEKQLRTEW 900
EL ++E++ +RTE
Sbjct: 841 ELK----------------------------------------------LEEKQNMRTEL 891
Query: 901 EITTASERLAECQETILNLGKQLKALATPKEAALLDKVIL-NPNDETQTLSVSTTTTTPT 960
EI ASE+LAECQETILNLGKQLKAL KE ALL + ++ + D++ L
Sbjct: 901 EIAAASEKLAECQETILNLGKQLKALTNSKETALLSETLMYDVTDKSNNL---------- 891
Query: 961 PMTDTASTPTVSNIKTTNNRFSLLDQMLAED-DAFPKDHEMPKPVEVDANHTSTSDPDKA 1020
P + T + T+ R SLLDQM AED + + P+ + + ++S ++
Sbjct: 961 PDAQPSHETTKPEKRLTSQRSSLLDQMKAEDHNTGESKDQKPQAADKNGKGGNSSVYNET 891
Query: 1021 IDPQKAVLIWNGHKNGVDKDTVGNLAIVPSRKQGDGDGGLWRKLLWRKKKGRSHK 1066
I+ + +L+ + K G D + AIVP +K G G LWRKLL R KKG+S K
Sbjct: 1021 IEALEQILLSDKSK-GSDSNC---FAIVPQKKTG-GVKSLWRKLLGRNKKGKSKK 891
BLAST of CmaCh02G005220 vs. TAIR 10
Match:
AT1G19835.1 (Plant protein of unknown function (DUF869) )
HSP 1 Score: 300.8 bits (769), Expect = 4.3e-81
Identity = 311/1049 (29.65%), Postives = 500/1049 (47.66%), Query Frame = 0
Query: 1 MDQKSWLWRKKSSEK------------------IIVSSDK-VNLSVNKNEEETLLIDKAR 60
MD+KSW W+KKSSEK I +S D+ NL+ K+E ++ +
Sbjct: 1 MDRKSWPWKKKSSEKTATVTEVVDQENGKKPSYIQISFDQYTNLNGLKDEVKSYEEKVTK 60
Query: 61 LEKDLEIANDKLSVALSDCKTKDELVKKLTNMEQEAIARWEKAKYEVAILKQDLNDAVQK 120
LE ++ + KLS A +D K+ LVK+ + + +EA+ WEKA+ E + LK L
Sbjct: 61 LEDQIKDLDLKLSTANADIVAKEVLVKQHSKVAEEAVTGWEKAEAEASALKTHLETITLA 120
Query: 121 RLAGEERLIHLDAALKECMQQLRFVREEQEQRIHDAVSKTSNEFEKTRKILEEKLADTCK 180
+L E+R HLD ALKECM+Q+R ++EE EQ++HD ++ +N+ + R E ++ + +
Sbjct: 121 KLTVEDRAAHLDGALKECMRQIRSLKEENEQKLHDVIATKTNQMDNLRAEFESRIGEYEE 180
Query: 181 RLSKLGGENTQLSKALLVKDKMIEDLNRELIGVETDLNALVSRLESTEKEKGSLKYEVRV 240
L + G EN LS++L + M+ ++ E E+++ L + +ES E+E +LKYE V
Sbjct: 181 ELLRCGAENDALSRSLQERSNMLMRISEEKSQAESEIEHLKNNIESCEREINTLKYETHV 240
Query: 241 LEKEVEIRNEEREFHRRTADASHKQHLEGVKKIAKLESECQRLRLLVRKRLPGPAALVKM 300
+ KE+EIRNEE+ R+A+A++KQHLEGVKKIAKLE+ECQRLR LVRK+LPGPAAL +M
Sbjct: 241 ITKELEIRNEEKNMSMRSAEAANKQHLEGVKKIAKLEAECQRLRTLVRKKLPGPAALAQM 300
Query: 301 KNEVEMLG----RDSFEIRRRQSNPTSSLDSSLEISPETPNEHL----------NVATCR 360
K EVE LG R RR P+S L S + + L ++ T R
Sbjct: 301 KMEVESLGFGDHRQDHRQRRSPVRPSSPLMSPMSHMSQVSEFSLDNMQKFHKENDLLTER 360
Query: 361 VSALEEENCALKEALNKKNNELQVIKIMQA------QASSLQIASPHELSNGQKVMESGK 420
+ A+EEE LKEAL K+N+ELQV + + A Q Q+ S G ++
Sbjct: 361 LLAMEEETKMLKEALAKRNSELQVSRNLCAKTANRLQTLEAQMMSKSPTKRGFEMPAEIF 420
Query: 421 SGLTLSELP-VALMSDAGSDDKGSSAELWASPLISEFEHFNNGKLKGSPTTCKIVGSSDL 480
S S P +A MS+ G++D S A S L + N K+K K ++ L
Sbjct: 421 SRQNASNPPSMASMSEDGNEDARSVAGSLMSELSQSNKDKANAKIK------KTESANQL 480
Query: 481 ELMDDFVEMEKLAIVSVEKSAANSHFLSNEVNGKPKSVETELKRCYPEAMSKSSNPGSCL 540
ELMDDF+EMEKLA + +A S S+ + T+LK+
Sbjct: 481 ELMDDFLEMEKLACLPNGSNANGSTDHSSADSDAEIPPATQLKK---------------- 540
Query: 541 SYPDVISGDISTSKVPDWLQNISKMVLDQSSFSKRDPEQMLEDIRAAMIHRSPEKLIDTE 600
++ + LQ++ K ++F E++L +I+ A + + KL
Sbjct: 541 -------------RISNVLQSLPK----DAAF-----EKILAEIQCA-VKDAGVKLPSKS 600
Query: 601 LFANRCDEPNVPCNNGSMLPKPSGIDSVSDANEVDITHQV--DIRGSVSRLIELVEGISV 660
AN NG K + + + +V I + ++ ++S++ + V +S
Sbjct: 601 HGANL---------NGLTEEKVIAMSNETTEEKVTIVEVITQELSDALSQIYQFVTYLS- 660
Query: 661 SSLDDDKSSYKKDGSFYSETPTGYMVRVFQWKMSELNTTLKQFMHNCYDLLNGKASIENF 720
K+ + SE R F K+ E +TT + +L + ++ +F
Sbjct: 661 -----------KEATACSEN------RTFSQKVQEFSTTFE-------GVLGKEKTLVDF 720
Query: 721 LQDLNSTL--------DWIMNHCFSLQ-------DVSSMRE--SIKKHFDWDESRSDCDL 780
L DL+ L D + H +++ D ++ E +++K + ++ C
Sbjct: 721 LFDLSRVLVEASELKIDVLGFHTSTVEIHSPDCIDKVALPENKALQKDSSGEHYQNGCSQ 780
Query: 781 ETGTMVHVSEVDKSRVSTGELQSTL------TEEHRKLKEELTSVESAKNDLEAKFQSTN 840
+ SE+ T + L TEE LK E ES EA ++T
Sbjct: 781 SSD-----SEIPDDCNGTSGYEPKLATCKFTTEEFEGLKLEKEKAESNLASCEADLEATK 840
Query: 841 GARETRTNQLQESEKKIVSLEKELETLRVLKGTIEGQIADQQVVNHDLDAQLTAAKNELN 900
+LQE+EK + ++ +LE+ + G E Q+ L+ + + + EL
Sbjct: 841 -------TKLQETEKLLAEVKSDLESAQKSNGMGETQLKCMVESYRSLETRSSELEIELT 900
Query: 901 ETRRKFTALEVELDNKNNFFEELEATCLELQLQLESTRKQNPSMDLVQEEKQLRTEWEIT 960
+ K LE EL ++ E A C EL+ QL+ + P+ +++++ + + + E+
Sbjct: 901 SLKGKIENLEDELHDEKENHREALAKCQELEEQLQRNNQNCPNCSVIEDDPKSKQDNELA 958
Query: 961 TASERLAECQETILNLGKQLKALATPKE----AALLDKVILNPNDE-------TQTLSVS 974
A+E+LAECQETIL LGKQLK++ E + ++ LNP +E Q +S
Sbjct: 961 AAAEKLAECQETILLLGKQLKSMCPQTEQVASSPSQEQQALNPEEEEYATSTNPQDSKLS 958
BLAST of CmaCh02G005220 vs. TAIR 10
Match:
AT1G19835.2 (Plant protein of unknown function (DUF869) )
HSP 1 Score: 300.8 bits (769), Expect = 4.3e-81
Identity = 311/1049 (29.65%), Postives = 500/1049 (47.66%), Query Frame = 0
Query: 1 MDQKSWLWRKKSSEK------------------IIVSSDK-VNLSVNKNEEETLLIDKAR 60
MD+KSW W+KKSSEK I +S D+ NL+ K+E ++ +
Sbjct: 1 MDRKSWPWKKKSSEKTATVTEVVDQENGKKPSYIQISFDQYTNLNGLKDEVKSYEEKVTK 60
Query: 61 LEKDLEIANDKLSVALSDCKTKDELVKKLTNMEQEAIARWEKAKYEVAILKQDLNDAVQK 120
LE ++ + KLS A +D K+ LVK+ + + +EA+ WEKA+ E + LK L
Sbjct: 61 LEDQIKDLDLKLSTANADIVAKEVLVKQHSKVAEEAVTGWEKAEAEASALKTHLETITLA 120
Query: 121 RLAGEERLIHLDAALKECMQQLRFVREEQEQRIHDAVSKTSNEFEKTRKILEEKLADTCK 180
+L E+R HLD ALKECM+Q+R ++EE EQ++HD ++ +N+ + R E ++ + +
Sbjct: 121 KLTVEDRAAHLDGALKECMRQIRSLKEENEQKLHDVIATKTNQMDNLRAEFESRIGEYEE 180
Query: 181 RLSKLGGENTQLSKALLVKDKMIEDLNRELIGVETDLNALVSRLESTEKEKGSLKYEVRV 240
L + G EN LS++L + M+ ++ E E+++ L + +ES E+E +LKYE V
Sbjct: 181 ELLRCGAENDALSRSLQERSNMLMRISEEKSQAESEIEHLKNNIESCEREINTLKYETHV 240
Query: 241 LEKEVEIRNEEREFHRRTADASHKQHLEGVKKIAKLESECQRLRLLVRKRLPGPAALVKM 300
+ KE+EIRNEE+ R+A+A++KQHLEGVKKIAKLE+ECQRLR LVRK+LPGPAAL +M
Sbjct: 241 ITKELEIRNEEKNMSMRSAEAANKQHLEGVKKIAKLEAECQRLRTLVRKKLPGPAALAQM 300
Query: 301 KNEVEMLG----RDSFEIRRRQSNPTSSLDSSLEISPETPNEHL----------NVATCR 360
K EVE LG R RR P+S L S + + L ++ T R
Sbjct: 301 KMEVESLGFGDHRQDHRQRRSPVRPSSPLMSPMSHMSQVSEFSLDNMQKFHKENDLLTER 360
Query: 361 VSALEEENCALKEALNKKNNELQVIKIMQA------QASSLQIASPHELSNGQKVMESGK 420
+ A+EEE LKEAL K+N+ELQV + + A Q Q+ S G ++
Sbjct: 361 LLAMEEETKMLKEALAKRNSELQVSRNLCAKTANRLQTLEAQMMSKSPTKRGFEMPAEIF 420
Query: 421 SGLTLSELP-VALMSDAGSDDKGSSAELWASPLISEFEHFNNGKLKGSPTTCKIVGSSDL 480
S S P +A MS+ G++D S A S L + N K+K K ++ L
Sbjct: 421 SRQNASNPPSMASMSEDGNEDARSVAGSLMSELSQSNKDKANAKIK------KTESANQL 480
Query: 481 ELMDDFVEMEKLAIVSVEKSAANSHFLSNEVNGKPKSVETELKRCYPEAMSKSSNPGSCL 540
ELMDDF+EMEKLA + +A S S+ + T+LK+
Sbjct: 481 ELMDDFLEMEKLACLPNGSNANGSTDHSSADSDAEIPPATQLKK---------------- 540
Query: 541 SYPDVISGDISTSKVPDWLQNISKMVLDQSSFSKRDPEQMLEDIRAAMIHRSPEKLIDTE 600
++ + LQ++ K ++F E++L +I+ A + + KL
Sbjct: 541 -------------RISNVLQSLPK----DAAF-----EKILAEIQCA-VKDAGVKLPSKS 600
Query: 601 LFANRCDEPNVPCNNGSMLPKPSGIDSVSDANEVDITHQV--DIRGSVSRLIELVEGISV 660
AN NG K + + + +V I + ++ ++S++ + V +S
Sbjct: 601 HGANL---------NGLTEEKVIAMSNETTEEKVTIVEVITQELSDALSQIYQFVTYLS- 660
Query: 661 SSLDDDKSSYKKDGSFYSETPTGYMVRVFQWKMSELNTTLKQFMHNCYDLLNGKASIENF 720
K+ + SE R F K+ E +TT + +L + ++ +F
Sbjct: 661 -----------KEATACSEN------RTFSQKVQEFSTTFE-------GVLGKEKTLVDF 720
Query: 721 LQDLNSTL--------DWIMNHCFSLQ-------DVSSMRE--SIKKHFDWDESRSDCDL 780
L DL+ L D + H +++ D ++ E +++K + ++ C
Sbjct: 721 LFDLSRVLVEASELKIDVLGFHTSTVEIHSPDCIDKVALPENKALQKDSSGEHYQNGCSQ 780
Query: 781 ETGTMVHVSEVDKSRVSTGELQSTL------TEEHRKLKEELTSVESAKNDLEAKFQSTN 840
+ SE+ T + L TEE LK E ES EA ++T
Sbjct: 781 SSD-----SEIPDDCNGTSGYEPKLATCKFTTEEFEGLKLEKEKAESNLASCEADLEATK 840
Query: 841 GARETRTNQLQESEKKIVSLEKELETLRVLKGTIEGQIADQQVVNHDLDAQLTAAKNELN 900
+LQE+EK + ++ +LE+ + G E Q+ L+ + + + EL
Sbjct: 841 -------TKLQETEKLLAEVKSDLESAQKSNGMGETQLKCMVESYRSLETRSSELEIELT 900
Query: 901 ETRRKFTALEVELDNKNNFFEELEATCLELQLQLESTRKQNPSMDLVQEEKQLRTEWEIT 960
+ K LE EL ++ E A C EL+ QL+ + P+ +++++ + + + E+
Sbjct: 901 SLKGKIENLEDELHDEKENHREALAKCQELEEQLQRNNQNCPNCSVIEDDPKSKQDNELA 958
Query: 961 TASERLAECQETILNLGKQLKALATPKE----AALLDKVILNPNDE-------TQTLSVS 974
A+E+LAECQETIL LGKQLK++ E + ++ LNP +E Q +S
Sbjct: 961 AAAEKLAECQETILLLGKQLKSMCPQTEQVASSPSQEQQALNPEEEEYATSTNPQDSKLS 958
BLAST of CmaCh02G005220 vs. TAIR 10
Match:
AT1G47900.1 (Plant protein of unknown function (DUF869) )
HSP 1 Score: 270.8 bits (691), Expect = 4.7e-72
Identity = 284/956 (29.71%), Postives = 473/956 (49.48%), Query Frame = 0
Query: 21 DKVNLSVNKNEEETLLIDKARLEKDLEIANDKLSVALSDCKTKDELVKKLTNMEQEAIAR 80
D + V++ EE+ +L +D+E N+KLSVA + TK+ LVK+ + + ++A++
Sbjct: 95 DAYDEKVHEYEEQV-----QKLNEDVEDLNEKLSVANEEIVTKEALVKQHSKVAEDAVSG 154
Query: 81 WEKAKYEVAILKQDLNDAVQKRLAGEERLIHLDAALKECMQQLRFVREEQEQRIHDAVSK 140
WEKA E LK L +L E+R HLD ALKECM+Q+R ++++ E ++HD
Sbjct: 155 WEKADAEALALKNTLESVTLSKLTAEDRAAHLDGALKECMRQIRNLKKDHEVKLHDVALS 214
Query: 141 TSNEFEKTRKILEEKLADTCKRLSKLGGENTQLSKALLVKDKMIEDLNRELIGVETDLNA 200
+ + EK E+++ D + L + ++ LS+ L + M+ ++ E + ++
Sbjct: 215 KTKQIEKMTMEFEKRMCDYEQELLRSAADSDALSRTLQERSNMLVKVSEEKSRADAEIET 274
Query: 201 LVSRLESTEKEKGSLKYEVRVLEKEVEIRNEEREFHRRTADASHKQHLEGVKKIAKLESE 260
L S LE E+E SLKYEV V+ KE+EIRNEE+ R+A++++KQHLEGVKKIAKLE+E
Sbjct: 275 LKSNLEMCEREIKSLKYEVHVVSKELEIRNEEKNMCIRSAESANKQHLEGVKKIAKLEAE 334
Query: 261 CQRLRLLVRKRLPGPAALVKMKNEVEMLGRDSFEIRRRQS-----------------NPT 320
CQRLR LVRK+LPGPAAL +MK EVE LGRDS + R+++S
Sbjct: 335 CQRLRSLVRKKLPGPAALAQMKLEVENLGRDSGDARQKRSPVKVSSPCKSPGGYSSTGSE 394
Query: 321 SSLDSSLEISPETPNEHLNVATCRVSALEEENCALKEALNKKNNELQVIKIMQAQASS-- 380
SLD++ + E NE L T R+ A+EEE LKEAL K+N+EL + + AQ++S
Sbjct: 395 FSLDNAQKFQKE--NEFL---TERLLAMEEETKMLKEALAKRNSELLESRNLCAQSTSKL 454
Query: 381 LQIASPHELSNGQKVMESGKSGLTLSELPVAL-MSDAGSDDKGS-SAELWASPLISEFEH 440
+ + + +N QK L S ++ +S+ G+DD GS S L +P +
Sbjct: 455 QSLEAQLQQNNSQKSSLEVCPNLNTSNPSSSISVSEDGNDDSGSCSGSLSTNPSQQIKKE 514
Query: 441 FNNGKLKGSPTTCKIVGSSDLELMDDFVEMEKLAIVSVEKSAANSHFLSNEVNGKPKSVE 500
+ L+ + +S +ELMDDF+EMEKLA + S++N S + +G KS E
Sbjct: 515 KDMAALERVESV-----NSHVELMDDFLEMEKLACLP-NLSSSNGSIDSKDGSGDQKS-E 574
Query: 501 TELKRCYPEAMSKSSNPGSCLSYPDVISGDISTSKVPDWLQNISKMVLDQSSFSKRDPEQ 560
+ + + + + + +S + S PD +I K+V D +
Sbjct: 575 MVILDAHTDLEDSDRGSPAVMKFRSRLS-KVLESVSPD--ADIQKIV--------GDIKC 634
Query: 561 MLEDIRAAMIHRSPEKL-IDTELFANRCDEPNVPCNNGSMLPKPSGIDSVSDANEVDITH 620
+L+D+ A M P ++ + E ++ C E N+ D ++ H
Sbjct: 635 ILQDVNACMDQEKPSEVHVHPEEVSDLCPEQNLV------------EDCHLAEQKLQSIH 694
Query: 621 QVDIRGSVSRLIELVEGISVSSLDDDKSSYKKDGSFYSETPTGYMVRVFQWKMSELNTTL 680
Q D++ +VSR+ + V + +S +G+ + E G+ V F +S + +L
Sbjct: 695 Q-DLKNAVSRIHDFVLLLRNEVKAGQDTSI--EGNDFVELIEGFSV-TFNHVLSG-DKSL 754
Query: 681 KQFMHNCYDLLNGKASIENFLQDL-NSTLDWIMNHCFSLQDVSSMRESIKKHFDWDESRS 740
F+ N ++ N + + L +S ++ + C D ++ ES K D D S+
Sbjct: 755 DDFVSNLANVFNEAMERKVSFRGLASSEVETLSPDCI---DKVALPES--KVVDKDSSQ- 814
Query: 741 DCDLETGTMVHVS---EVDKSRVSTGELQSTLTEEHRKLKEELTSVESAKNDLEAKFQST 800
++ VH D++RVS E S L E +E +++ E
Sbjct: 815 --EIYQNGCVHNEPGVPCDENRVSGYESDSKLQE-----------IEELRSEKEKMAVDI 874
Query: 801 NGARETRTNQLQESEKKIVSLEKELETLRVLKGTIEGQIADQQVVNHDLDAQLTAAKNEL 860
G + QLQESE+ + + + ++ + + Q+ L+++ + ++
Sbjct: 875 EGLK----CQLQESEQLLADIRSQFDSAQRSNRLADTQLRCMTESYRSLESRAADLEIDV 934
Query: 861 NETRRKFTALEVELDNKNNFFEELEATCLELQLQLESTRKQNPSMDLVQEEK---QLRTE 920
N+ + K LE EL+++ +E C EL+ ++ R +N S+ +E+ + + E
Sbjct: 935 NQLKEKIQKLENELEDEKCNHQEAILRCHELEEHIQ--RHRNTSLVAEDDEEADIKSKQE 980
Query: 921 WEITTASERLAECQETILNLGKQLKALATPKEAALLDKVILNPNDETQTLSVSTTT 948
E++ A+E+LAECQETI LGKQLK+ E + E + L +TT+
Sbjct: 995 RELSAAAEKLAECQETIFVLGKQLKSFRPQPEQMRSPQTRNESYSEEEELGTTTTS 980
BLAST of CmaCh02G005220 vs. TAIR 10
Match:
AT1G47900.2 (Plant protein of unknown function (DUF869) )
HSP 1 Score: 268.9 bits (686), Expect = 1.8e-71
Identity = 283/956 (29.60%), Postives = 472/956 (49.37%), Query Frame = 0
Query: 21 DKVNLSVNKNEEETLLIDKARLEKDLEIANDKLSVALSDCKTKDELVKKLTNMEQEAIAR 80
D + V++ EE+ +L +D+E N+KLSVA + TK+ LVK+ + + ++A++
Sbjct: 95 DAYDEKVHEYEEQV-----QKLNEDVEDLNEKLSVANEEIVTKEALVKQHSKVAEDAVSG 154
Query: 81 WEKAKYEVAILKQDLNDAVQKRLAGEERLIHLDAALKECMQQLRFVREEQEQRIHDAVSK 140
WEKA E LK L +L E+R HLD ALKECM+Q+R ++++ E ++HD
Sbjct: 155 WEKADAEALALKNTLESVTLSKLTAEDRAAHLDGALKECMRQIRNLKKDHEVKLHDVALS 214
Query: 141 TSNEFEKTRKILEEKLADTCKRLSKLGGENTQLSKALLVKDKMIEDLNRELIGVETDLNA 200
+ + EK E+++ D + L + ++ LS+ L + M+ ++ E + ++
Sbjct: 215 KTKQIEKMTMEFEKRMCDYEQELLRSAADSDALSRTLQERSNMLVKVSEEKSRADAEIET 274
Query: 201 LVSRLESTEKEKGSLKYEVRVLEKEVEIRNEEREFHRRTADASHKQHLEGVKKIAKLESE 260
L S LE E+E SLKYEV V+ KE+EIRNEE+ R+A++++KQHLEGVKKIAKLE+E
Sbjct: 275 LKSNLEMCEREIKSLKYEVHVVSKELEIRNEEKNMCIRSAESANKQHLEGVKKIAKLEAE 334
Query: 261 CQRLRLLVRKRLPGPAALVKMKNEVEMLGRDSFEIRRRQS-----------------NPT 320
CQRLR LVRK+LPGPAAL +MK EVE LGRDS + R+++S
Sbjct: 335 CQRLRSLVRKKLPGPAALAQMKLEVENLGRDSGDARQKRSPVKVSSPCKSPGGYSSTGSE 394
Query: 321 SSLDSSLEISPETPNEHLNVATCRVSALEEENCALKEALNKKNNELQVIKIMQAQASS-- 380
SLD++ + E NE L T R+ A+EEE LKEAL K+N+EL + + AQ++S
Sbjct: 395 FSLDNAQKFQKE--NEFL---TERLLAMEEETKMLKEALAKRNSELLESRNLCAQSTSKL 454
Query: 381 LQIASPHELSNGQKVMESGKSGLTLSELPVAL-MSDAGSDDKGS-SAELWASPLISEFEH 440
+ + + +N QK L S ++ +S+ G+DD GS S L +P +
Sbjct: 455 QSLEAQLQQNNSQKSSLEVCPNLNTSNPSSSISVSEDGNDDSGSCSGSLSTNPSQQIKKE 514
Query: 441 FNNGKLKGSPTTCKIVGSSDLELMDDFVEMEKLAIVSVEKSAANSHFLSNEVNGKPKSVE 500
+ L+ + +S +ELMDDF+EMEKLA + S++N S + +G KS E
Sbjct: 515 KDMAALERVESV-----NSHVELMDDFLEMEKLACLP-NLSSSNGSIDSKDGSGDQKS-E 574
Query: 501 TELKRCYPEAMSKSSNPGSCLSYPDVISGDISTSKVPDWLQNISKMVLDQSSFSKRDPEQ 560
+ + + + + + +S + S PD +I K+V D +
Sbjct: 575 MVILDAHTDLEDSDRGSPAVMKFRSRLS-KVLESVSPD--ADIQKIV--------GDIKC 634
Query: 561 MLEDIRAAMIHRSPEKL-IDTELFANRCDEPNVPCNNGSMLPKPSGIDSVSDANEVDITH 620
+L+D+ A M P ++ + E ++ C E N+ D ++ H
Sbjct: 635 ILQDVNACMDQEKPSEVHVHPEEVSDLCPEQNLV------------EDCHLAEQKLQSIH 694
Query: 621 QVDIRGSVSRLIELVEGISVSSLDDDKSSYKKDGSFYSETPTGYMVRVFQWKMSELNTTL 680
Q D++ +VSR+ + V + +S +G+ + E G+ V F +S + +L
Sbjct: 695 Q-DLKNAVSRIHDFVLLLRNEVKAGQDTSI--EGNDFVELIEGFSV-TFNHVLSG-DKSL 754
Query: 681 KQFMHNCYDLLNGKASIENFLQDL-NSTLDWIMNHCFSLQDVSSMRESIKKHFDWDESRS 740
F+ N ++ N + + L +S ++ + C D ++ ES K D D S+
Sbjct: 755 DDFVSNLANVFNEAMERKVSFRGLASSEVETLSPDCI---DKVALPES--KVVDKDSSQ- 814
Query: 741 DCDLETGTMVHVS---EVDKSRVSTGELQSTLTEEHRKLKEELTSVESAKNDLEAKFQST 800
++ VH D++RVS E S L E +E +++ E
Sbjct: 815 --EIYQNGCVHNEPGVPCDENRVSGYESDSKLQE-----------IEELRSEKEKMAVDI 874
Query: 801 NGARETRTNQLQESEKKIVSLEKELETLRVLKGTIEGQIADQQVVNHDLDAQLTAAKNEL 860
G + QLQESE+ + + + ++ + + Q+ L+++ + ++
Sbjct: 875 EGLK----CQLQESEQLLADIRSQFDSAQRSNRLADTQLRCMTESYRSLESRAADLEIDV 934
Query: 861 NETRRKFTALEVELDNKNNFFEELEATCLELQLQLESTRKQNPSMDLVQEEK---QLRTE 920
N+ + K LE EL+++ +E C EL+ ++ +N S+ +E+ + + E
Sbjct: 935 NQLKEKIQKLENELEDEKCNHQEAILRCHELEEHIQ----RNTSLVAEDDEEADIKSKQE 978
Query: 921 WEITTASERLAECQETILNLGKQLKALATPKEAALLDKVILNPNDETQTLSVSTTT 948
E++ A+E+LAECQETI LGKQLK+ E + E + L +TT+
Sbjct: 995 RELSAAAEKLAECQETIFVLGKQLKSFRPQPEQMRSPQTRNESYSEEEELGTTTTS 978
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9SLN1 | 1.5e-150 | 37.77 | Filament-like plant protein 7 OS=Arabidopsis thaliana OX=3702 GN=FPP7 PE=3 SV=2 | [more] |
Q0WSY2 | 6.0e-80 | 29.65 | Filament-like plant protein 4 OS=Arabidopsis thaliana OX=3702 GN=FPP4 PE=1 SV=1 | [more] |
Q9C698 | 6.7e-71 | 29.71 | Filament-like plant protein 6 OS=Arabidopsis thaliana OX=3702 GN=FPP6 PE=1 SV=1 | [more] |
O65649 | 3.4e-59 | 27.83 | Filament-like plant protein 5 OS=Arabidopsis thaliana OX=3702 GN=FPP5 PE=2 SV=2 | [more] |
Q9MA92 | 8.8e-31 | 28.11 | Filament-like plant protein 3 OS=Arabidopsis thaliana OX=3702 GN=FPP3 PE=1 SV=2 | [more] |