Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTCCCACAAGCAAAAATATGCTCTCAAGCGGTCAAATCCTTCCATTGATAATCAATCACATTCAGAAATTCTTTCATATTCCTTTCTTCTTCAACCTCCCGATAAATCCTTTCATTTCCCTCTCCCTTTTTCTCTCTCTAGAAATCTCTGTGCCCTACAATGGCGACTCTCGCGGATATTGGAGTTTCCGCCTTAATCAATATCATCACTGCGTTTGTGTTCCTTCTTGCTTTTGCGATTCTGCGGATTCAACCGATCAACGACAGAGTTTATTTTCCGAAATGGTACATCAATGGCGGACGAAATAGCCCTCGCGGCTCCGGAAGTTCTGTGCGGAAATATGTTAATCTCAACATTTGGACTTACTTCACTTTCTTGAAATGGATGCCGGCGGCTTTGAAGATGAGCGAAGATGAGATTATTAGCCATGCTGGATTTGATTCTGCTGTTTTTCTCAGGATTTATACTCTCGGGTTTGTTCATCTTCTCTCCCTGTTGTTCTTGTTTGTTCATCTTCTTTTGCTGTGTGAATCTGTTCTGTTTGGGGATTTTGGCCATTTTTGTGTTTTTATAAATGGATTTTGCTCGTTTTTCACTGTGTGTTTCTGAAGATTGAGCTAGTTTGCTTGAATTTATTTATGGTGTTCTTCATCTTTGTTAAAGGATTCGTGGATTCTTTTTCCCAGATTTTGTATAGAGAAACAGAGTTCTAGAACCTACGGCTCTTTTTTTTTTTTTTTAAGATATTGTTTTTTCTTACTTTTCAATTGGTGGGCCGGAGCACCGCGTTCCAAACGGGCGGGTAATACCCATGGACGGGAATTTATTCCCGCCCCACTATTTTATATGTATATATATATTTCATTTGAGTGAGTATGTTCGAACATTTAAAAAAAAATGTTTAAATATTATAGAAATTATATTAGAAAATCAAAGTTATCTATTAAAAATTATTTCAAATTAATAAAAAAAGTTAAAAATATTAAAAAAAATGAAATTTAAATGAATTATATTTTTCTTATGAAATATTTTTGAATTTATTAAAATATTTTATATTTTTAATATAATTATTTCAGAAAATAAAAATTATCGGAAATTAACAAATAATAATAATAAATATAAATATTAAAAAATGAAGAAAAAAAATTCTAATAAAAAAAAATCGAGAATAGAAAGGAGCTATTTAAACTTTGGACAAATTTTTTTATTTATTTCTTTAAGTACGTTGAATTTTTTAAAAAAAAAATGTTGAAAACTGAGCTATATTATTAAATTTTAAATTTTTAGAAATTATAAAACTTTAGTATAATATTATATTTATTTGAATGTTTTATATTTTTAATAGAAATTATATTAAAAATGAAAACAATCTATTAAATATTCTTCAAAATTAAAAAATAAATGGAGAATATGTAGTGGTTGTTTTAGTATTTAGGATTTTGTTGGTTGAAAAAAAAGTAGTTCTGGAAGGCGAATTTTGTTGGTTTTATAAACAATTTATAAGGTATTTCTTTCTCACTATGTCTATTGGCAGGTTGAAGATATTCTTCCCAATAGCTATTGTTGCGCTACTTGTTCTCATCCCAGTCAATGTGTCTAGTGGGACACTCTCCTTCCTGCGGAAAGAATTGGTTGTAAGCAACATTGATAAGCTTTCAATCTCCAATGTTCGTCCTCATTCTATAAGGTCAGCTTATCCCCTCATCTTAAAGCACAAATACATGAACGATATCCCACATCAGTTGGAGATAAGAGTCATACATTCCTTATAAGGGTGTGGAAACCTCTACTAGTAGATGCATTTTAGAACTTTGAGGAAAAAATGCAAAAAGGAAAGCTAAAAAAGGACAATATCTGCTAGTAATGGGCTTGGGCTAGTACAAATGGTATGACTGCCAAACATTGGGCAGGGTGCCAATGAAGATATTGGCCCCTAGTGGGGTGAATTGTGAGATCTCGCATGGAGAAACAGATACCGGGTGGTGTGCCAGTGGGAACATTGGGCCCTTAATGGGGTGGATTATTGAGATCCCACATCAGTTGGAGAAACAAACACCGGATGGTTTGCCAGCGAGGACGTTGGGCCCCAGTGGGGGTGGATTAATGAGATCCCACATCAGTTGGAGAAACAAACACCGGATGGTTTGCCAGCGAGGACGTTGGGCCCCCAGTGGGGGTGTATTGTGAGATCCCACATCGATTGGAGAACCAGACTTCGAGCAGTGTGCCAGTGGGACGCTCATCCCTGCAGTGTGCCAGTGGGACGCTCATCCCTGCAGTGTGCCAGTGGGACGCTCATCCCTGCAGTGTGCCAGCGAGGATGCTGGTCCCTAATGGGAGTGGATTGTGAGGTCTCACATTGGTTGGAGAAACAGACATCGGGCGGTGTGTCAGCAAGGACGTGGGCCCCTAATGAGAGTGGATTGTGAGATCCCACATCGATGGGAGAAACAAACACTCGGCGGTGTGCCAACAAGAACGCTGACCCCCCTAATAGGAGTGGATTGTAAGATCCCACATCGATTGGGGCTTGGGCTATTATATTTTCTATCAATTTTGGTGTTAGAAAATGCTGAAATATGTGTGTTTATGCCAAATTATTGCATTCTTGTTTCAGGTTTTTTGCTCATATAGGATTGGAGTATTTGTTTACCCTATGGATTTGTTTCATGCTTTACAAAGAATATGACAATGTAGCACGAATGAGATTGAATTTCTTGGCGTCGCAACGTAGATGTGCTGATCAATTTACTGTGAGTCAATCCTTATCTTCACAAAATTCTTTTAAAAACTCAAATTTCAATATTACAATATATCTCTTATTTATATATATATATATATTAAATAAGGTGCGGTGGTGGAGAGAGAAGACGGGATGGGGAATGTCTCGCCCCATTTAACTAATGAGGAAAAGTTTCTCCCTAACTCCCTCCCCCGAACTGACATCTCTCTTTTATATATATTTTACAGGTGTTGGTTAGAAACGTACCACGTTCATCGGGTCGCTCGAACTCAGATATAGTTGATCAGTACTTCCACAAAAATCATCCCCAACACTATCTTTCTCATCAGGTAATCAATCATAATACCATAGTTAGATATATTTAAGGCATGATTCATGGCCATCTTTTCTCATTTCTTGTTTTTAGGCTGTATATAATGCCAACAAGCTTGCTAAACTTGCAAAAAAGAGAGCAAGGCTCCAGAACTGGTTGGACTACAATCTCTTGAAGTTCGAACGGCACCCCGATCAGAGACCAACTCGAAGGGTACCCAACTTCCCAGCCTTCAAACCAATCAGTTCGTTTTCATGGCCAGAGATTTGAACTATTGATCTCTTTCTTTCAGGCAGGATGGTTTGGACTTTGCGGTAAACGAGTCGACTCGATCGAGTATTACAAACGACAAATGAAGGATCTCGATGCCCGAGTAAGCTAGTTCTCCATATAGTCCTTAAATTTTTGTTCATTATAAAGCATTCTAAATTGATGTGAACCCCATGTGTTAGATGGCATTGGAGAGGCCGAAAATTGTCAAAGATCCGAAAGCAATATTGCCTGTTTCTTTTGTTTCGTTTAAGTCTCGATGGGGTGCTGCCGTTTGTGCACAGACTCAGCAGAGTAAGAATCCCACGTTATGGCTCACTAATTGGGCTCCTGAGCCTCATGATGTTTATTGGCAGAATTTGGCTATACCATTTGTTTCTGTTAGCATCAGAAAACTACTCATATCGTTGTCGGTTTTCGCTCTCGTGTTCTTCTATATGATACCGATTGCGTTCGTACAATCACTTGCCAATTTGGAAGGTCTCGAACGAGTGGCTCCTTTCCTAAGACCCGTCATAGAATTGTAAGTCATTATGACGCTGATATCTTGCATGTTGTTTTCTCTGTATTTTGTGCTTTATATAGATGTCCATAAGGCTGGGCGGAGATAGGAAACCTTTCCCATCTCTGATATATATTTTTATCTTCATATGTAAAAAAGTGAAGATCTGGTGGGACAGGGCTGGGAATTGGACAGAGACGGGGATGGGGAATACAATCCCCGTCCCTAGCCCGTATATCTAATGGGAATAAATCTCTTCCTTTCTAACAAAAAGAATTCCGTCCCTGGGAAATGCATAGTGTTATCTTTGTATTAGTGCTCCATAGAGATGTCCATAAGGCAGGGCGGAGATAGGGAAGCCTTCCCCATTCCCGATATCTATCTTTATCTTGGAGGTATAGTGGACAGGGCTGTGAATTGGGCAGAGACGGGGATCCCCATCCTTGGCCCCAAATGTCTAATGGGAAGAAATCTCTTCCTACTCCTTCCCCCATTCCTCGTTCAATCAAACTGGAAAAAATCTCTCCTCATTCTCCTTTTGGAACGAGGATTCCCGCACTACTCGAGGTGAGTCCCTGCAGGGTTAATGGACACCTGCAAAATACATATATAGGGAAGCCTTCCCCATCCCCTATATATATTTTTATCTTCATATGCTAGGAATTGGCCCCATATATCTAATGGGAAGAATCCGTCCGTCTGAATTGCACATTGTTTTCTCTGAATGTGTGCTCCATAGAGATGTCCATAGGGCAGAGATAGATAGGGAAGTCTTTCCCATCCTTGATATATATCTTTTATCTTCATATGTAAAAAATTGGAGATATGGTGGGACAGGAATGGGAATTGGGCCCCGTCCGTGGCCCCATACATATAACGGGAAGAAATCTCTTCCTACTCTCTCCCCCATTCCCGTTCAAATGGGAAAAAATCTCTCCCCATTCTCATTTTGAACAGAGATTCCCACACTACTCGGGGTTAATGGACACTTGCAATATATATATAGGGAAGCCTTCCTCATCCACGATATACATTTTTATCTTCGTATGCTAGGAATTTGACAGAAACGGGGATGGAGAATACAATCCCCGTCCTTGGCCCCGTACATCTAACGGGAAGAAATCTTTCCCCTCTCCCTCCCTCATTCTCCTTTTTTCAGGATTCCCTGCACTATTCAGTGCAGATTCCAGCAGGACAAATAGACATCTCTACTGCTCGGGTTTTGATCCATTTGATTACTTTCTTGTGTTACGCAGGAAGCTCGTAAAATCGTTTCTACAGGGCTTCCTTCCTGGTTTGGCTCTCAAAATCTTTCTATTTATACTGCCAAAAGTTCTAATGATCATGTCCAAAATTGAGGGGCATGTAGCAGTTTCTATGCTGGAAAGAAGGGCAGCGGCTAAGTACTATTACTTCATGCTGGTAAATGTGTTCTTGGGAAGTATTGTGACTGGTACAGCTTTTGAGCAACTGGATGCCTTCATTCACCAATCTCCTACACAGTAATGCCTCACCCCCTACATTCAATGATATGCATTTCACCATGATATTAAGCCTTCTTATGCACTTTTTTACAGAATTCCTCGGACAATCGGAGTTTCCATACCGATGAAGGCGACGTTCTTCATTACGTACATAATGGTCGATGGATGGGCCGGAATGGCGAGCGAGATTCTTCGATTGAAACCGTTGGTCATCTTTCATCTCAAGAATCTCTTTTTGGTGAAAACTGATAGAGATAGAGAGAAGGCAATGAACCCAAAAGGTGTGGATTTTCCTGAAACTTTACCAACCTTACAATTGTACTTTCTACTCGGAATTGTCTACGCCGTCGTCACCCCGATTCTTCTCCCGTTTATACTCGTCTTCTTCGCGTTTGCATACTTGGTTTACCGACATCAGGTTCGTTATCTAGTACTCGAGTTATTTGACTATTTAGTATTTTCATTGTACTCATTAGAAAGAGCAAAAGTACTCGAGCACACTTAGAGATGTCCGTTTACCCGCAAGGACTTGTAAGGACCTGCTGTGAACAGGGCAAGGAATCCCTATTAAAACTAAGATTAGTGCACACTCAGAGATGTCAATTTACCCTACGAGGTCCTGTAGGGACCCGCCCCGAACAGGGAAAGGAATCCCTATTAAAACTGAGAATAGTGCACACTCAAAGATGTCCATTTACCCTACGAGGTCCTGTAGGGACCCGTCCCAAACAGGGTAAGGAATCCCTATTAAAACTGAGAATAGTGCACACTCAGAGATGTCCATTTACCCTACAAGGTCCTGTAGGGACCCGCTCCGAACAGGGCAAGGAATCCCTGCTTGAACTGAGAATAGTGCACACTTAGAGATGTCTATTTACCCTACAAAGTCCTATAGGGTCTTTCCCAAGGCGAGGAATCCCTATTTAAATTGGGAACGAAGAGATAGTAGGGAGAGATTCCTTCCTATTAGATAAATTGAGCCGGAAACTAGAATTGTATTCTCCGTCCCTACCCATGGACATCTCTAGGCACACTTTACCAATCCAACCAACCCAAACTTTTGGGTTGATCCAAAAAATTTCCTCAACCCAACCCAACTTGGACCATTTTCACCCCTACCGGGAACTAGGATTGTATTCTCTGTCTACCTCCTGCCCTGTCCTTGTACCGTGGACATCTTTAGGCACAATTCAATACATAGTTTCTCTTATTATTGCGATCGTGAAGTAACCCAAAGTTTTGGGTTGGGTTACCAATCCAACCAACCCGAAGTTTTGGGTTGATCCAAAAAATTTCCTCAGCAAACATGGGATAACAAATCTAGTCTTTCACTAAGATTTCGAATAGCTGTCCCGAAACCCATCCCAACCCGGGCCATGTACAACCCTACCTAAAACTAGGATTGTATTGTCCATCTACCCCTTGCACTGCCCTTGTCCTTTCTCCATGGACACCTTTCGGCGCACTTGGATACATATTTTTTCTTATTGTTGTGATTGTGAAGTAACCCAAACTTTTGGGTTGGTCCAAAAAATTCTCTCAACCCAATCAAATCCGAACCATGTACACCTCTACCAAAAACTAGGGTTGTATTGTCCGTCTACCCCCTGTCCTGTCCCTGTCCCGTGGACATCTCTAGGCACACTTCAATACATATTTTATCTTATTGTCATGATCATGAAGTAACACGGTTGACGTACTCGGTTTTTCTCAATCCTGTAGTTTTTGTCTTGCTGAATCATGCATTGTCAATGCTGGTTTGTTTGTGCAGATCATCAATGTATATAATCAGCAATATGAGAGTGTTGGTGCCTTTTGGCCCCATGTCCATGGCCGCATCATAGCAAGCTTGTTGATATCTCAGTTACTTTTATTGGGCTTGCTCAGTACAAAAAAGGCAGCCAATTCTACTCCCTTGCTCGTCGCCTTACCGATATTGACATTTTTCTTCCACAAGTACTGCAAGAACCGGTTCGAACCCGCCTTTCGTAAATACCCTCTCGAGGTAAATTCTAAGTTCTTCAAGTCTAGAACGATTTCATACGTTTAAAAGTATCGTTAATCTTTAAACTCTTTTAAAAAGTTAAAAATTACTCTAACCGTTAGTACTTTGTTTAAAAAATATCCATGAACTTTCACGGTTTTCCCTAAAGTTACATATGCTATTGTTTTCTTTTGCTCGTTTAGGAAGCAATGGCCAAAGATACAATGGAGCGGAGCACGGAACCTGACCTCGACGTAAAAGCATTCTTATCAGATGCATACTTGCATCCAATTTTCAGGTCTGTAGAGGAAGAAGAATTAGCCGAGATTAAAGTTGAGAAACAAAAATCTCCAATACAAGAAGCAACTTGTTCATGAGAGCGAAGATCATTCTAAATAAGTTGAAAAAAAACTTATTTTTAACTATGACCAAAACCAAACCAATTCAAGAATAGTTCAGTAGATAAACTACTAACTACCGTTCCAACAATCGATACGGTTCCATCCTCCACCCGACCATTGTTGAACTAAAAAATTCAATTTCTCGACTCACACAAACCGGGTCGACTTTTTAAGCCTTTGCTGATTGAGTTTGAGATAGTTGTTTGGAGGCTATAATTTGGTGATAAATGTCTCTAATTTTCAAGCCAATGATGGTCTGAAAAAGGCCTGTTCCATTATTGCTTCCTGGAAATTCAGAATGCTCCAATATTTGTTGTGTAATTTCACCATAAAATTTGGTGGAAATGTTGCTTAGATGGCTCTCTACATATATAAGCTCATCCAAT
mRNA sequence
ATTCCCACAAGCAAAAATATGCTCTCAAGCGGTCAAATCCTTCCATTGATAATCAATCACATTCAGAAATTCTTTCATATTCCTTTCTTCTTCAACCTCCCGATAAATCCTTTCATTTCCCTCTCCCTTTTTCTCTCTCTAGAAATCTCTGTGCCCTACAATGGCGACTCTCGCGGATATTGGAGTTTCCGCCTTAATCAATATCATCACTGCGTTTGTGTTCCTTCTTGCTTTTGCGATTCTGCGGATTCAACCGATCAACGACAGAGTTTATTTTCCGAAATGGTACATCAATGGCGGACGAAATAGCCCTCGCGGCTCCGGAAGTTCTGTGCGGAAATATGTTAATCTCAACATTTGGACTTACTTCACTTTCTTGAAATGGATGCCGGCGGCTTTGAAGATGAGCGAAGATGAGATTATTAGCCATGCTGGATTTGATTCTGCTGTTTTTCTCAGGATTTATACTCTCGGGTTGAAGATATTCTTCCCAATAGCTATTGTTGCGCTACTTGTTCTCATCCCAGTCAATGTGTCTAGTGGGACACTCTCCTTCCTGCGGAAAGAATTGGTTGTAAGCAACATTGATAAGCTTTCAATCTCCAATGTTCGTCCTCATTCTATAAGGTTTTTTGCTCATATAGGATTGGAGTATTTGTTTACCCTATGGATTTGTTTCATGCTTTACAAAGAATATGACAATGTAGCACGAATGAGATTGAATTTCTTGGCGTCGCAACGTAGATGTGCTGATCAATTTACTGTGTTGGTTAGAAACGTACCACGTTCATCGGGTCGCTCGAACTCAGATATAGTTGATCAGTACTTCCACAAAAATCATCCCCAACACTATCTTTCTCATCAGGCTGTATATAATGCCAACAAGCTTGCTAAACTTGCAAAAAAGAGAGCAAGGCTCCAGAACTGGTTGGACTACAATCTCTTGAAGTTCGAACGGCACCCCGATCAGAGACCAACTCGAAGGGCAGGATGGTTTGGACTTTGCGGTAAACGAGTCGACTCGATCGAGTATTACAAACGACAAATGAAGGATCTCGATGCCCGAATGGCATTGGAGAGGCCGAAAATTGTCAAAGATCCGAAAGCAATATTGCCTGTTTCTTTTGTTTCGTTTAAGTCTCGATGGGGTGCTGCCGTTTGTGCACAGACTCAGCAGAGTAAGAATCCCACGTTATGGCTCACTAATTGGGCTCCTGAGCCTCATGATGTTTATTGGCAGAATTTGGCTATACCATTTGTTTCTGTTAGCATCAGAAAACTACTCATATCGTTGTCGGTTTTCGCTCTCGTGTTCTTCTATATGATACCGATTGCGTTCGTACAATCACTTGCCAATTTGGAAGGTCTCGAACGAGTGGCTCCTTTCCTAAGACCCGTCATAGAATTGAAGCTCGTAAAATCGTTTCTACAGGGCTTCCTTCCTGGTTTGGCTCTCAAAATCTTTCTATTTATACTGCCAAAAGTTCTAATGATCATGTCCAAAATTGAGGGGCATGTAGCAGTTTCTATGCTGGAAAGAAGGGCAGCGGCTAAGTACTATTACTTCATGCTGGTAAATGTGTTCTTGGGAAGTATTGTGACTGGTACAGCTTTTGAGCAACTGGATGCCTTCATTCACCAATCTCCTACACAAATTCCTCGGACAATCGGAGTTTCCATACCGATGAAGGCGACGTTCTTCATTACGTACATAATGGTCGATGGATGGGCCGGAATGGCGAGCGAGATTCTTCGATTGAAACCGTTGGTCATCTTTCATCTCAAGAATCTCTTTTTGGTGAAAACTGATAGAGATAGAGAGAAGGCAATGAACCCAAAAGGTGTGGATTTTCCTGAAACTTTACCAACCTTACAATTGTACTTTCTACTCGGAATTGTCTACGCCGTCGTCACCCCGATTCTTCTCCCGTTTATACTCGTCTTCTTCGCGTTTGCATACTTGGTTTACCGACATCAGATCATCAATGTATATAATCAGCAATATGAGAGTGTTGGTGCCTTTTGGCCCCATGTCCATGGCCGCATCATAGCAAGCTTGTTGATATCTCAGTTACTTTTATTGGGCTTGCTCAGTACAAAAAAGGCAGCCAATTCTACTCCCTTGCTCGTCGCCTTACCGATATTGACATTTTTCTTCCACAAGTACTGCAAGAACCGGTTCGAACCCGCCTTTCGTAAATACCCTCTCGAGGAAGCAATGGCCAAAGATACAATGGAGCGGAGCACGGAACCTGACCTCGACGTAAAAGCATTCTTATCAGATGCATACTTGCATCCAATTTTCAGGTCTGTAGAGGAAGAAGAATTAGCCGAGATTAAAGTTGAGAAACAAAAATCTCCAATACAAGAAGCAACTTGTTCATGAGAGCGAAGATCATTCTAAATAAGTTGAAAAAAAACTTATTTTTAACTATGACCAAAACCAAACCAATTCAAGAATAGTTCAGTAGATAAACTACTAACTACCGTTCCAACAATCGATACGGTTCCATCCTCCACCCGACCATTGTTGAACTAAAAAATTCAATTTCTCGACTCACACAAACCGGGTCGACTTTTTAAGCCTTTGCTGATTGAGTTTGAGATAGTTGTTTGGAGGCTATAATTTGGTGATAAATGTCTCTAATTTTCAAGCCAATGATGGTCTGAAAAAGGCCTGTTCCATTATTGCTTCCTGGAAATTCAGAATGCTCCAATATTTGTTGTGTAATTTCACCATAAAATTTGGTGGAAATGTTGCTTAGATGGCTCTCTACATATATAAGCTCATCCAAT
Coding sequence (CDS)
ATGGCGACTCTCGCGGATATTGGAGTTTCCGCCTTAATCAATATCATCACTGCGTTTGTGTTCCTTCTTGCTTTTGCGATTCTGCGGATTCAACCGATCAACGACAGAGTTTATTTTCCGAAATGGTACATCAATGGCGGACGAAATAGCCCTCGCGGCTCCGGAAGTTCTGTGCGGAAATATGTTAATCTCAACATTTGGACTTACTTCACTTTCTTGAAATGGATGCCGGCGGCTTTGAAGATGAGCGAAGATGAGATTATTAGCCATGCTGGATTTGATTCTGCTGTTTTTCTCAGGATTTATACTCTCGGGTTGAAGATATTCTTCCCAATAGCTATTGTTGCGCTACTTGTTCTCATCCCAGTCAATGTGTCTAGTGGGACACTCTCCTTCCTGCGGAAAGAATTGGTTGTAAGCAACATTGATAAGCTTTCAATCTCCAATGTTCGTCCTCATTCTATAAGGTTTTTTGCTCATATAGGATTGGAGTATTTGTTTACCCTATGGATTTGTTTCATGCTTTACAAAGAATATGACAATGTAGCACGAATGAGATTGAATTTCTTGGCGTCGCAACGTAGATGTGCTGATCAATTTACTGTGTTGGTTAGAAACGTACCACGTTCATCGGGTCGCTCGAACTCAGATATAGTTGATCAGTACTTCCACAAAAATCATCCCCAACACTATCTTTCTCATCAGGCTGTATATAATGCCAACAAGCTTGCTAAACTTGCAAAAAAGAGAGCAAGGCTCCAGAACTGGTTGGACTACAATCTCTTGAAGTTCGAACGGCACCCCGATCAGAGACCAACTCGAAGGGCAGGATGGTTTGGACTTTGCGGTAAACGAGTCGACTCGATCGAGTATTACAAACGACAAATGAAGGATCTCGATGCCCGAATGGCATTGGAGAGGCCGAAAATTGTCAAAGATCCGAAAGCAATATTGCCTGTTTCTTTTGTTTCGTTTAAGTCTCGATGGGGTGCTGCCGTTTGTGCACAGACTCAGCAGAGTAAGAATCCCACGTTATGGCTCACTAATTGGGCTCCTGAGCCTCATGATGTTTATTGGCAGAATTTGGCTATACCATTTGTTTCTGTTAGCATCAGAAAACTACTCATATCGTTGTCGGTTTTCGCTCTCGTGTTCTTCTATATGATACCGATTGCGTTCGTACAATCACTTGCCAATTTGGAAGGTCTCGAACGAGTGGCTCCTTTCCTAAGACCCGTCATAGAATTGAAGCTCGTAAAATCGTTTCTACAGGGCTTCCTTCCTGGTTTGGCTCTCAAAATCTTTCTATTTATACTGCCAAAAGTTCTAATGATCATGTCCAAAATTGAGGGGCATGTAGCAGTTTCTATGCTGGAAAGAAGGGCAGCGGCTAAGTACTATTACTTCATGCTGGTAAATGTGTTCTTGGGAAGTATTGTGACTGGTACAGCTTTTGAGCAACTGGATGCCTTCATTCACCAATCTCCTACACAAATTCCTCGGACAATCGGAGTTTCCATACCGATGAAGGCGACGTTCTTCATTACGTACATAATGGTCGATGGATGGGCCGGAATGGCGAGCGAGATTCTTCGATTGAAACCGTTGGTCATCTTTCATCTCAAGAATCTCTTTTTGGTGAAAACTGATAGAGATAGAGAGAAGGCAATGAACCCAAAAGGTGTGGATTTTCCTGAAACTTTACCAACCTTACAATTGTACTTTCTACTCGGAATTGTCTACGCCGTCGTCACCCCGATTCTTCTCCCGTTTATACTCGTCTTCTTCGCGTTTGCATACTTGGTTTACCGACATCAGATCATCAATGTATATAATCAGCAATATGAGAGTGTTGGTGCCTTTTGGCCCCATGTCCATGGCCGCATCATAGCAAGCTTGTTGATATCTCAGTTACTTTTATTGGGCTTGCTCAGTACAAAAAAGGCAGCCAATTCTACTCCCTTGCTCGTCGCCTTACCGATATTGACATTTTTCTTCCACAAGTACTGCAAGAACCGGTTCGAACCCGCCTTTCGTAAATACCCTCTCGAGGAAGCAATGGCCAAAGATACAATGGAGCGGAGCACGGAACCTGACCTCGACGTAAAAGCATTCTTATCAGATGCATACTTGCATCCAATTTTCAGGTCTGTAGAGGAAGAAGAATTAGCCGAGATTAAAGTTGAGAAACAAAAATCTCCAATACAAGAAGCAACTTGTTCATGA
Protein sequence
MATLADIGVSALINIITAFVFLLAFAILRIQPINDRVYFPKWYINGGRNSPRGSGSSVRKYVNLNIWTYFTFLKWMPAALKMSEDEIISHAGFDSAVFLRIYTLGLKIFFPIAIVALLVLIPVNVSSGTLSFLRKELVVSNIDKLSISNVRPHSIRFFAHIGLEYLFTLWICFMLYKEYDNVARMRLNFLASQRRCADQFTVLVRNVPRSSGRSNSDIVDQYFHKNHPQHYLSHQAVYNANKLAKLAKKRARLQNWLDYNLLKFERHPDQRPTRRAGWFGLCGKRVDSIEYYKRQMKDLDARMALERPKIVKDPKAILPVSFVSFKSRWGAAVCAQTQQSKNPTLWLTNWAPEPHDVYWQNLAIPFVSVSIRKLLISLSVFALVFFYMIPIAFVQSLANLEGLERVAPFLRPVIELKLVKSFLQGFLPGLALKIFLFILPKVLMIMSKIEGHVAVSMLERRAAAKYYYFMLVNVFLGSIVTGTAFEQLDAFIHQSPTQIPRTIGVSIPMKATFFITYIMVDGWAGMASEILRLKPLVIFHLKNLFLVKTDRDREKAMNPKGVDFPETLPTLQLYFLLGIVYAVVTPILLPFILVFFAFAYLVYRHQIINVYNQQYESVGAFWPHVHGRIIASLLISQLLLLGLLSTKKAANSTPLLVALPILTFFFHKYCKNRFEPAFRKYPLEEAMAKDTMERSTEPDLDVKAFLSDAYLHPIFRSVEEEELAEIKVEKQKSPIQEATCS
Homology
BLAST of CmaCh16G012020 vs. ExPASy Swiss-Prot
Match:
Q9FVQ5 (CSC1-like protein At1g32090 OS=Arabidopsis thaliana OX=3702 GN=At1g32090 PE=1 SV=1)
HSP 1 Score: 1099.0 bits (2841), Expect = 0.0e+00
Identity = 556/740 (75.14%), Postives = 640/740 (86.49%), Query Frame = 0
Query: 1 MATLADIGVSALINIITAFVFLLAFAILRIQPINDRVYFPKWYINGGRNSPRGSGSS-VR 60
MATL DIGVSALIN+ AF+FL+AFA+LRIQPINDRVYFPKWY+ G RNSPR S + V
Sbjct: 1 MATLQDIGVSALINLFGAFLFLIAFAVLRIQPINDRVYFPKWYLTGERNSPRRSDRTLVG 60
Query: 61 KYVNLNIWTYFTFLKWMPAALKMSEDEIISHAGFDSAVFLRIYTLGLKIFFPIAIVALLV 120
K+VNLN TYFTFL WMP A+KMSE EII HAG DSA+FLRIYTLGLKIF P+ ++AL+V
Sbjct: 61 KFVNLNYKTYFTFLNWMPQAMKMSESEIIRHAGLDSAIFLRIYTLGLKIFAPVMVLALVV 120
Query: 121 LIPVNVSSGTLSFLRKELVVSNIDKLSISNVRPHSIRFFAHIGLEYLFTLWICFMLYKEY 180
L+PVNVSSGTL FL+KELVVSNIDKLSISNV+P S +FF HI +EY+FT W CFMLY+EY
Sbjct: 121 LVPVNVSSGTLFFLKKELVVSNIDKLSISNVQPKSSKFFFHIAVEYIFTFWACFMLYREY 180
Query: 181 DNVARMRLNFLASQRRCADQFTVLVRNVPRSSGRSNSDIVDQYFHKNHPQHYLSHQAVYN 240
+NVA MRL +LASQRR +QFTV+VRNVP G S D VDQ+F NHP+HYL HQAVYN
Sbjct: 181 NNVAIMRLQYLASQRRRPEQFTVVVRNVPDMPGHSVPDTVDQFFKTNHPEHYLCHQAVYN 240
Query: 241 ANKLAKLAKKRARLQNWLDYNLLKFERHPDQRPTRRAGWFGLCGKRVDSIEYYKRQMKDL 300
AN AKL K+RA+LQ W DY +LK +R+P ++PT R G+ GL GKRVDSIEYYK+Q+K+
Sbjct: 241 ANTYAKLVKQRAKLQRWFDYYVLKHQRNPHKQPTCRTGFLGLWGKRVDSIEYYKQQIKEF 300
Query: 301 DARMALERPKIVKDPKAILPVSFVSFKSRWGAAVCAQTQQSKNPTLWLTNWAPEPHDVYW 360
D M+LER K++KD K +LPV+FVSF SRWGAAVCAQTQQSKNPTLWLT+ APEP D+YW
Sbjct: 301 DHNMSLERQKVLKDSKLMLPVAFVSFDSRWGAAVCAQTQQSKNPTLWLTSSAPEPRDIYW 360
Query: 361 QNLAIPFVSVSIRKLLISLSVFALVFFYMIPIAFVQSLANLEGLERVAPFLRPVIELKLV 420
QNLAIPF+S++IRKL+I +SVFALVFFYMIPIAFVQSLANLEGL+RVAPFLRPV L +
Sbjct: 361 QNLAIPFISLTIRKLVIGVSVFALVFFYMIPIAFVQSLANLEGLDRVAPFLRPVTRLDFI 420
Query: 421 KSFLQGFLPGLALKIFLFILPKVLMIMSKIEGHVAVSMLERRAAAKYYYFMLVNVFLGSI 480
KSFLQGFLPGLALKIFL+ILP VL+IMSKIEG++A+S LERRAAAKYYYFMLVNVFLGSI
Sbjct: 421 KSFLQGFLPGLALKIFLWILPTVLLIMSKIEGYIALSTLERRAAAKYYYFMLVNVFLGSI 480
Query: 481 VTGTAFEQLDAFIHQSPTQIPRTIGVSIPMKATFFITYIMVDGWAGMASEILRLKPLVIF 540
+ GTAFEQL +F+HQSP+QIPRTIGVSIPMKATFFITYIMVDGWAG+A EILRLKPLVIF
Sbjct: 481 IAGTAFEQLHSFLHQSPSQIPRTIGVSIPMKATFFITYIMVDGWAGIAGEILRLKPLVIF 540
Query: 541 HLKNLFLVKTDRDREKAMNPKGVDFPETLPTLQLYFLLGIVYAVVTPILLPFILVFFAFA 600
HLKN+F+VKT+ DR +AM+P VDF ET+P+LQLYFLLGIVY VTPILLPFIL+FFAFA
Sbjct: 541 HLKNMFIVKTEEDRVRAMDPGFVDFKETIPSLQLYFLLGIVYTAVTPILLPFILIFFAFA 600
Query: 601 YLVYRHQIINVYNQQYESVGAFWPHVHGRIIASLLISQLLLLGLLSTKKAANSTPLLVAL 660
YLVYRHQIINVYNQQYES GAFWPHVHGRIIASLLISQLLL+GLL++KKAA+STPLL+ L
Sbjct: 601 YLVYRHQIINVYNQQYESCGAFWPHVHGRIIASLLISQLLLMGLLASKKAADSTPLLIIL 660
Query: 661 PILTFFFHKYCKNRFEPAFRKYPLEEAMAKDTMERSTEPDLDVKAFLSDAYLHPIFRSVE 720
PILT FHKYCK+RFEPAFR+YPLEEAMAKD +E+ TEP+L++KA L+DAYLHPIF S E
Sbjct: 661 PILTLSFHKYCKHRFEPAFRQYPLEEAMAKDKLEKETEPELNMKADLADAYLHPIFHSFE 720
Query: 721 EEELAEIKVEKQKSPIQEAT 740
+E +K QE T
Sbjct: 721 KEVELSSSSSSEKETHQEET 740
BLAST of CmaCh16G012020 vs. ExPASy Swiss-Prot
Match:
Q9LVE4 (CSC1-like protein At3g21620 OS=Arabidopsis thaliana OX=3702 GN=At3g21620 PE=2 SV=1)
HSP 1 Score: 901.0 bits (2327), Expect = 9.1e-261
Identity = 438/734 (59.67%), Postives = 577/734 (78.61%), Query Frame = 0
Query: 1 MATLADIGVSALINIITAFVFLLAFAILRIQPINDRVYFPKWYINGGRNSPRGSGSSVRK 60
MATL DIGV+A INI+TAF F +AFAILR+QP+NDRVYFPKWY+ G R+SP +G K
Sbjct: 1 MATLTDIGVAATINILTAFAFFIAFAILRLQPVNDRVYFPKWYLKGLRSSPIKTGGFASK 60
Query: 61 YVNLNIWTYFTFLKWMPAALKMSEDEIISHAGFDSAVFLRIYTLGLKIFFPIAIVALLVL 120
+VNL+ +Y FL WMP AL+M E E+I HAG DS V+LRIY LGLKIFFPIA +A V+
Sbjct: 61 FVNLDFRSYIRFLNWMPQALRMPEPELIDHAGLDSVVYLRIYLLGLKIFFPIACIAFTVM 120
Query: 121 IPVNVSSGTLSFLRKELVVSNIDKLSISNVRPHSIRFFAHIGLEYLFTLWICFMLYKEYD 180
+PVN ++ TL L K L S+IDKLSISN+ S RF+ H+ + Y+ T W CF+L +EY
Sbjct: 121 VPVNWTNSTLDQL-KNLTFSDIDKLSISNIPTGSSRFWVHLCMAYVITFWTCFVLQREYK 180
Query: 181 NVARMRLNFLASQRRCADQFTVLVRNVPRSSGRSNSDIVDQYFHKNHPQHYLSHQAVYNA 240
++A MRL FLAS+ R DQFTVLVRN+P S S++V+ +F NHP +YL++QAVYNA
Sbjct: 181 HIASMRLQFLASEHRRPDQFTVLVRNIPPDPDESVSELVEHFFKVNHPDYYLTYQAVYNA 240
Query: 241 NKLAKLAKKRARLQNWLDYNLLKFERHPDQRPTRRAGWFGLCGKRVDSIEYYKRQMKDLD 300
NKL++L +KR +LQNWLDY K R+P +RP + G+ G G+ VD+I++Y +++ L
Sbjct: 241 NKLSELVQKRMKLQNWLDYYQNKHSRNPSKRPLIKIGFLGCWGEEVDAIDHYIEKIEGLT 300
Query: 301 ARMALERPKIVKDPKAILPVSFVSFKSRWGAAVCAQTQQSKNPTLWLTNWAPEPHDVYWQ 360
+++ E+ ++ K+++P +FVSFK RWGA VC+QTQQS+NPT WLT WAPEP D+YW
Sbjct: 301 RKISEEKETVMSSTKSLVPAAFVSFKKRWGAVVCSQTQQSRNPTEWLTEWAPEPRDIYWD 360
Query: 361 NLAIPFVSVSIRKLLISLSVFALVFFYMIPIAFVQSLANLEGLERVAPFLRPVIELKLVK 420
NLA+P+V ++IR+L+I+++ F L FF+MIPIAFVQ+LAN+EG+E+ PFL+P+IE+K VK
Sbjct: 361 NLALPYVQLTIRRLVIAVAFFFLTFFFMIPIAFVQTLANIEGIEKAVPFLKPLIEVKTVK 420
Query: 421 SFLQGFLPGLALKIFLFILPKVLMIMSKIEGHVAVSMLERRAAAKYYYFMLVNVFLGSIV 480
SF+QGFLPG+ALKIFL +LP +LM+MSK EG ++ S LERR A++YY F +NVFL SI+
Sbjct: 421 SFIQGFLPGIALKIFLIVLPSILMLMSKFEGFISKSSLERRCASRYYMFQFINVFLCSII 480
Query: 481 TGTAFEQLDAFIHQSPTQIPRTIGVSIPMKATFFITYIMVDGWAGMASEILRLKPLVIFH 540
GTA +QLD+F++QS T+IP+TIGVSIPMKATFFITYIMVDGWAG+A EILRLKPL+I+H
Sbjct: 481 AGTALQQLDSFLNQSATEIPKTIGVSIPMKATFFITYIMVDGWAGVAGEILRLKPLIIYH 540
Query: 541 LKNLFLVKTDRDREKAMNPKGVDFPETLPTLQLYFLLGIVYAVVTPILLPFILVFFAFAY 600
LKN FLVKT++DRE+AM+P + F P +QLYF+LG+VYA V+PILLPFILVFFA AY
Sbjct: 541 LKNFFLVKTEKDREEAMDPGTIGFNTGEPQIQLYFILGLVYAAVSPILLPFILVFFALAY 600
Query: 601 LVYRHQIINVYNQQYESVGAFWPHVHGRIIASLLISQLLLLGLLSTKKAANSTPLLVALP 660
+VYRHQIINVYNQ+YES AFWP VH R++ +L++SQLLL+GLLSTKKAA STPLL LP
Sbjct: 601 VVYRHQIINVYNQEYESAAAFWPDVHRRVVIALIVSQLLLMGLLSTKKAARSTPLLFILP 660
Query: 661 ILTFFFHKYCKNRFEPAFRKYPLEEAMAKDTMERSTEPDLDVKAFLSDAYLHPIFRSVEE 720
+LT FHK+C+ R++P F YPL++AM KDT+ER EP+L++K FL +AY HP+F++
Sbjct: 661 VLTIGFHKFCQGRYQPIFVTYPLQDAMVKDTLERMREPNLNLKTFLQNAYAHPVFKAA-- 720
Query: 721 EELAEIKVEKQKSP 735
+ LA V ++ +P
Sbjct: 721 DNLANEMVVEEPAP 731
BLAST of CmaCh16G012020 vs. ExPASy Swiss-Prot
Match:
F4HYR3 (CSC1-like protein At1g62320 OS=Arabidopsis thaliana OX=3702 GN=At1g62320 PE=3 SV=2)
HSP 1 Score: 898.7 bits (2321), Expect = 4.5e-260
Identity = 441/723 (61.00%), Postives = 575/723 (79.53%), Query Frame = 0
Query: 1 MATLADIGVSALINIITAFVFLLAFAILRIQPINDRVYFPKWYINGGRNSPRGSGSSVRK 60
MATLADIG++A INI++A +FLL FAILRIQP NDRVYFPKWY+ G R+SP SG+ V K
Sbjct: 1 MATLADIGLAAAINILSALIFLLLFAILRIQPFNDRVYFPKWYLKGVRSSPVNSGAFVSK 60
Query: 61 YVNLNIWTYFTFLKWMPAALKMSEDEIISHAGFDSAVFLRIYTLGLKIFFPIAIVALLVL 120
+NL+ +Y FL WMP ALKM E E+I HAG DSAV+LRIY +GLKIF PIA+++ +L
Sbjct: 61 IMNLDFRSYVRFLNWMPDALKMPEPELIDHAGLDSAVYLRIYLIGLKIFGPIALLSWSIL 120
Query: 121 IPVNVSSGTLSFLR-KELVVSNIDKLSISNVRPHSIRFFAHIGLEYLFTLWICFMLYKEY 180
+PVN +S L + + + SNIDKLSISNV S RF+AH+ + Y FT W C++L KEY
Sbjct: 121 VPVNWTSDGLQLAKLRNVTSSNIDKLSISNVERGSDRFWAHLVMAYAFTFWTCYVLMKEY 180
Query: 181 DNVARMRLNFLASQRRCADQFTVLVRNVPRSSGRSNSDIVDQYFHKNHPQHYLSHQAVYN 240
+ +A MRL+FL S++R ADQFTVLVRNVP S S S+ V +F NHP HYL+HQ VYN
Sbjct: 181 EKIAAMRLSFLQSEKRRADQFTVLVRNVPPDSDESISENVQHFFLVNHPDHYLTHQVVYN 240
Query: 241 ANKLAKLAKKRARLQNWLDYNLLKFERHPDQRPTRRAGWFGLCGKRVDSIEYYKRQMKDL 300
AN+LAKL + + ++QNWLDY LK+ R+ +QRP + G+ GL GK+VD++++Y +++ L
Sbjct: 241 ANELAKLVEDKKKMQNWLDYYQLKYTRNKEQRPRVKMGFLGLWGKKVDAMDHYTAEIEKL 300
Query: 301 DARMALERPKIVKDPKAILPVSFVSFKSRWGAAVCAQTQQSKNPTLWLTNWAPEPHDVYW 360
++ ER +I KD K+++ +FVSFK+RWGAAVCAQTQQ+KNPT WLT WAPE ++YW
Sbjct: 301 SEQIMEERKRIKKDDKSVMQAAFVSFKTRWGAAVCAQTQQTKNPTEWLTEWAPEAREMYW 360
Query: 361 QNLAIPFVSVSIRKLLISLSVFALVFFYMIPIAFVQSLANLEGLERVAPFLRPVIELKLV 420
NLA+P+VS+++R+ ++ ++ F L FF++IPIAFVQSLA++EG+E+ APFL P+++ KL+
Sbjct: 361 PNLAMPYVSLTVRRFVMHIAFFFLTFFFIIPIAFVQSLASIEGIEKSAPFLSPIVKNKLM 420
Query: 421 KSFLQGFLPGLALKIFLFILPKVLMIMSKIEGHVAVSMLERRAAAKYYYFMLVNVFLGSI 480
KS +QGFLPG+ LK+FL LP +LMIMSK EG +++S LERRAA +YY F LVNVFLGS+
Sbjct: 421 KSLIQGFLPGIVLKLFLIFLPTILMIMSKFEGFISISSLERRAAFRYYIFNLVNVFLGSV 480
Query: 481 VTGTAFEQLDAFIHQSPTQIPRTIGVSIPMKATFFITYIMVDGWAGMASEILRLKPLVIF 540
+TG+AFEQLD+F+ QS IPRT+GV+IP+KATFFITYIMVDGWAG+A EI RLKPLVIF
Sbjct: 481 ITGSAFEQLDSFLKQSANDIPRTVGVAIPIKATFFITYIMVDGWAGVAGEIFRLKPLVIF 540
Query: 541 HLKNLFLVKTDRDREKAMNPKGVDFPETLPTLQLYFLLGIVYAVVTPILLPFILVFFAFA 600
HLKN F VKT++DRE+AM+P +DF T P +QLYFLLG+VYA VTP+LLPFI+ FF FA
Sbjct: 541 HLKNFFFVKTEKDREEAMDPGQIDFYATEPRIQLYFLLGLVYAPVTPVLLPFIIFFFGFA 600
Query: 601 YLVYRHQIINVYNQQYESVGAFWPHVHGRIIASLLISQLLLLGLLSTKKAANSTPLLVAL 660
YLV+RHQIINVYNQ+YES GAFWP VHGRII++L+ISQ+LLLGL+STK STP L+ L
Sbjct: 601 YLVFRHQIINVYNQKYESAGAFWPDVHGRIISALIISQILLLGLMSTKGKVQSTPFLLVL 660
Query: 661 PILTFFFHKYCKNRFEPAFRKYPLEEAMAKDTMERSTEPDLDVKAFLSDAYLHPIFRSVE 720
ILTF FH++CK R+E AF PL+EAM KDT+ER+ EP+L++K FL +AY+HP+F+ E
Sbjct: 661 AILTFGFHRFCKGRYESAFVINPLQEAMIKDTLERAREPNLNLKGFLQNAYVHPVFKDEE 720
Query: 721 EEE 723
+ +
Sbjct: 721 DSD 723
BLAST of CmaCh16G012020 vs. ExPASy Swiss-Prot
Match:
B5TYT3 (CSC1-like protein At1g11960 OS=Arabidopsis thaliana OX=3702 GN=At1g11960 PE=2 SV=1)
HSP 1 Score: 895.2 bits (2312), Expect = 5.0e-259
Identity = 436/723 (60.30%), Postives = 574/723 (79.39%), Query Frame = 0
Query: 1 MATLADIGVSALINIITAFVFLLAFAILRIQPINDRVYFPKWYINGGRNSPRGSGSSVRK 60
MATL DIGV+A INI+TA +FLLAFAILRIQP NDRVYFPKWY+ G R+SP SG+ V K
Sbjct: 1 MATLGDIGVAAAINILTAIIFLLAFAILRIQPFNDRVYFPKWYLKGIRSSPLHSGALVSK 60
Query: 61 YVNLNIWTYFTFLKWMPAALKMSEDEIISHAGFDSAVFLRIYTLGLKIFFPIAIVALLVL 120
+VN+N+ +Y FL WMPAALKM E E+I HAG DSAV+LRIY +GLKIF PIA++A +L
Sbjct: 61 FVNVNLGSYLRFLNWMPAALKMPEPELIDHAGLDSAVYLRIYLIGLKIFVPIALLAWSIL 120
Query: 121 IPVNVSSGTLSFLR-KELVVSNIDKLSISNVRPHSIRFFAHIGLEYLFTLWICFMLYKEY 180
+PVN +S L + + + S+IDKLSISN+ S RF+ H+ + Y FT W C++L KEY
Sbjct: 121 VPVNWTSHGLQLAKLRNVTSSDIDKLSISNIENGSDRFWTHLVMAYAFTFWTCYVLMKEY 180
Query: 181 DNVARMRLNFLASQRRCADQFTVLVRNVPRSSGRSNSDIVDQYFHKNHPQHYLSHQAVYN 240
+ VA MRL FL +++R DQFTVLVRNVP S SD V+ +F NHP HYL+HQ VYN
Sbjct: 181 EKVAAMRLAFLQNEQRRPDQFTVLVRNVPADPDESISDSVEHFFLVNHPDHYLTHQVVYN 240
Query: 241 ANKLAKLAKKRARLQNWLDYNLLKFERHPDQRPTRRAGWFGLCGKRVDSIEYYKRQMKDL 300
AN LA L +++ QNWLDY LK+ R+ + +P + G+ GL GK+VD+I++Y +++ L
Sbjct: 241 ANDLAALVEQKKSTQNWLDYYQLKYTRNQEHKPRIKTGFLGLWGKKVDAIDHYIAEIEKL 300
Query: 301 DARMALERPKIVKDPKAILPVSFVSFKSRWGAAVCAQTQQSKNPTLWLTNWAPEPHDVYW 360
+ ++ ER K+ KD +++P +FVSFK+RWGAAV AQTQQS +PT WLT WAPE +V+W
Sbjct: 301 NEQIMEERKKVKKDDTSVMPAAFVSFKTRWGAAVSAQTQQSSDPTEWLTEWAPEAREVFW 360
Query: 361 QNLAIPFVSVSIRKLLISLSVFALVFFYMIPIAFVQSLANLEGLERVAPFLRPVIELKLV 420
NLAIP+VS+++R+L++ ++ F L FF+MIPIAFVQSLA++EG+E+ APFL+ +IE L
Sbjct: 361 SNLAIPYVSLTVRRLIMHIAFFFLTFFFMIPIAFVQSLASIEGIEKNAPFLKSIIENDLF 420
Query: 421 KSFLQGFLPGLALKIFLFILPKVLMIMSKIEGHVAVSMLERRAAAKYYYFMLVNVFLGSI 480
KS +QGFLPG+ LK+FL LP +LM+MSK EG V++S LERRAA +YY F L+NVFLGS+
Sbjct: 421 KSVIQGFLPGIVLKLFLIFLPSILMVMSKFEGFVSLSSLERRAAFRYYIFNLINVFLGSV 480
Query: 481 VTGTAFEQLDAFIHQSPTQIPRTIGVSIPMKATFFITYIMVDGWAGMASEILRLKPLVIF 540
+TG+AFEQLD+F+ QS +IP+T+GV+IP+KATFFITYIMVDGWAG+A EILRLKPL+ F
Sbjct: 481 ITGSAFEQLDSFLKQSAKEIPKTVGVAIPIKATFFITYIMVDGWAGIAGEILRLKPLIFF 540
Query: 541 HLKNLFLVKTDRDREKAMNPKGVDFPETLPTLQLYFLLGIVYAVVTPILLPFILVFFAFA 600
H+KN LVKT++DRE+AMNP +++ T P +QLYFLLG+VYA VTP+LLPFI++FFA A
Sbjct: 541 HIKNSLLVKTEKDREEAMNPGQINYHATEPRIQLYFLLGLVYAPVTPVLLPFIIIFFALA 600
Query: 601 YLVYRHQIINVYNQQYESVGAFWPHVHGRIIASLLISQLLLLGLLSTKKAANSTPLLVAL 660
YLV+RHQIINVYNQ+YES FWP VHGRII++L+I+Q+LL+GLLSTK AA STP L+ L
Sbjct: 601 YLVFRHQIINVYNQEYESAARFWPDVHGRIISALIIAQILLMGLLSTKGAAQSTPFLLFL 660
Query: 661 PILTFFFHKYCKNRFEPAFRKYPLEEAMAKDTMERSTEPDLDVKAFLSDAYLHPIFRSVE 720
PI+TFFFH+YCK R+EPAF ++PL+EAM KDT+ER+ EP+ ++K +L AY+HP+F+ +
Sbjct: 661 PIITFFFHRYCKGRYEPAFLRHPLKEAMVKDTLERAREPNFNLKPYLQKAYIHPVFKDND 720
Query: 721 EEE 723
E+
Sbjct: 721 YED 723
BLAST of CmaCh16G012020 vs. ExPASy Swiss-Prot
Match:
Q9SY14 (CSC1-like protein At4g02900 OS=Arabidopsis thaliana OX=3702 GN=At4g02900 PE=3 SV=1)
HSP 1 Score: 893.3 bits (2307), Expect = 1.9e-258
Identity = 446/730 (61.10%), Postives = 572/730 (78.36%), Query Frame = 0
Query: 1 MATLADIGVSALINIITAFVFLLAFAILRIQPINDRVYFPKWYINGGRNSPRGSGSSVRK 60
MA++ DIG+SA IN+++AF FL AFA+LR+QP+NDRVYFPKWY+ G R SP S + +
Sbjct: 1 MASVQDIGLSAAINLLSAFAFLFAFAMLRLQPVNDRVYFPKWYLKGIRGSPTRSRGIMTR 60
Query: 61 YVNLNIWTYFTFLKWMPAALKMSEDEIISHAGFDSAVFLRIYTLGLKIFFPIAIVALLVL 120
+VNL+ TY FL WMPAAL+M E E+I HAG DSAV++RIY LGLK+F PI ++A VL
Sbjct: 61 FVNLDWTTYVKFLNWMPAALQMPEPELIEHAGLDSAVYIRIYLLGLKMFVPITLLAFGVL 120
Query: 121 IPVNVSSGTLSFLRKELVVSNIDKLSISNVRPHSIRFFAHIGLEYLFTLWICFMLYKEYD 180
+PVN + TL + +L SN+DKLSISNV P S RF+AHI + Y+ T W C++LY EY
Sbjct: 121 VPVNWTGETLENI-DDLTFSNVDKLSISNVPPGSPRFWAHITMTYVITFWTCYILYMEYK 180
Query: 181 NVARMRLNFLASQRRCADQFTVLVRNVPRSSGRSNSDIVDQYFHKNHPQHYLSHQAVYNA 240
VA MRL LA++ R DQ TVLVRNVP S ++ V+ +F NHP HYL HQ VYNA
Sbjct: 181 AVANMRLRHLAAESRRPDQLTVLVRNVPPDPDESVNEHVEHFFCVNHPDHYLCHQVVYNA 240
Query: 241 NKLAKLAKKRARLQNWLDYNLLKFERHPDQRPTRRAGWFGLCGKRVDSIEYYKRQMKDLD 300
N LAKL +R +QNWL Y KFER P RPT + G+ G G VD+I++Y +M L
Sbjct: 241 NDLAKLVAQRKAMQNWLTYYENKFERKPSSRPTTKTGYGGFWGTTVDAIDFYTSKMDILA 300
Query: 301 ARMALERPKIVKDPKAILPVSFVSFKSRWGAAVCAQTQQSKNPTLWLTNWAPEPHDVYWQ 360
+ A+ER KI+ DPKAI+P +FVSF+SRWG AVCAQTQQ NPT+WLT WAPEP DV+W
Sbjct: 301 EQEAVEREKIMNDPKAIMPAAFVSFRSRWGTAVCAQTQQCHNPTIWLTEWAPEPRDVFWD 360
Query: 361 NLAIPFVSVSIRKLLISLSVFALVFFYMIPIAFVQSLANLEGLERVAPFLRPVIELKLVK 420
NLAIP+V +SIR+LL ++++F L+F +MIPIAFVQSLANLEG+++V PFL+PVIE+K VK
Sbjct: 361 NLAIPYVELSIRRLLTTVALFFLIFCFMIPIAFVQSLANLEGIQKVLPFLKPVIEMKTVK 420
Query: 421 SFLQGFLPGLALKIFLFILPKVLMIMSKIEGHVAVSMLERRAAAKYYYFMLVNVFLGSIV 480
S +QGFLPG+ALKIFL ILP +LM MS+IEG+ ++S L+RR+A KY++F++VNVFLGSI+
Sbjct: 421 SVIQGFLPGIALKIFLIILPTILMTMSQIEGYTSLSYLDRRSAEKYFWFIIVNVFLGSII 480
Query: 481 TGTAFEQLDAFIHQSPTQIPRTIGVSIPMKATFFITYIMVDGWAGMASEILRLKPLVIFH 540
TGTAF+QL +F+ Q PT+IP+T+GVSIPMKATFFITYIMVDGWAG+A+EILR+ PLVIFH
Sbjct: 481 TGTAFQQLKSFLEQPPTEIPKTVGVSIPMKATFFITYIMVDGWAGIAAEILRVVPLVIFH 540
Query: 541 LKNLFLVKTDRDREKAMNPKGVDFPETLPTLQLYFLLGIVYAVVTPILLPFILVFFAFAY 600
LKN FLVKT++DR++AM+P +DF + P +Q YFLLG+VYA V PILLPFI+VFFAFAY
Sbjct: 541 LKNTFLVKTEQDRQQAMDPGHLDFATSEPRIQFYFLLGLVYAAVAPILLPFIIVFFAFAY 600
Query: 601 LVYRHQIINVYNQQYESVGAFWPHVHGRIIASLLISQLLLLGLLSTKKAANSTPLLVALP 660
+V+RHQ+INVY+Q+YES +WP VH R+I L+ISQLL++GLLSTKK A T LL+ P
Sbjct: 601 VVFRHQVINVYDQKYESGARYWPDVHRRLIICLIISQLLMMGLLSTKKFAKVTALLLPQP 660
Query: 661 ILTFFFHKYCKNRFEPAFRKYPLEEAMAKDTMERSTEPDLDVKAFLSDAYLHPIF----- 720
ILTF+F++YC RFE AF K+PL+EAM KDT+E++TEP+L++K +L DAY+HP+F
Sbjct: 661 ILTFWFYRYCAGRFESAFSKFPLQEAMVKDTLEKATEPNLNLKEYLKDAYVHPVFKGNDF 720
Query: 721 ---RSVEEEE 723
R V+EEE
Sbjct: 721 DRPRVVDEEE 729
BLAST of CmaCh16G012020 vs. TAIR 10
Match:
AT1G32090.1 (early-responsive to dehydration stress protein (ERD4) )
HSP 1 Score: 1099.0 bits (2841), Expect = 0.0e+00
Identity = 556/740 (75.14%), Postives = 640/740 (86.49%), Query Frame = 0
Query: 1 MATLADIGVSALINIITAFVFLLAFAILRIQPINDRVYFPKWYINGGRNSPRGSGSS-VR 60
MATL DIGVSALIN+ AF+FL+AFA+LRIQPINDRVYFPKWY+ G RNSPR S + V
Sbjct: 1 MATLQDIGVSALINLFGAFLFLIAFAVLRIQPINDRVYFPKWYLTGERNSPRRSDRTLVG 60
Query: 61 KYVNLNIWTYFTFLKWMPAALKMSEDEIISHAGFDSAVFLRIYTLGLKIFFPIAIVALLV 120
K+VNLN TYFTFL WMP A+KMSE EII HAG DSA+FLRIYTLGLKIF P+ ++AL+V
Sbjct: 61 KFVNLNYKTYFTFLNWMPQAMKMSESEIIRHAGLDSAIFLRIYTLGLKIFAPVMVLALVV 120
Query: 121 LIPVNVSSGTLSFLRKELVVSNIDKLSISNVRPHSIRFFAHIGLEYLFTLWICFMLYKEY 180
L+PVNVSSGTL FL+KELVVSNIDKLSISNV+P S +FF HI +EY+FT W CFMLY+EY
Sbjct: 121 LVPVNVSSGTLFFLKKELVVSNIDKLSISNVQPKSSKFFFHIAVEYIFTFWACFMLYREY 180
Query: 181 DNVARMRLNFLASQRRCADQFTVLVRNVPRSSGRSNSDIVDQYFHKNHPQHYLSHQAVYN 240
+NVA MRL +LASQRR +QFTV+VRNVP G S D VDQ+F NHP+HYL HQAVYN
Sbjct: 181 NNVAIMRLQYLASQRRRPEQFTVVVRNVPDMPGHSVPDTVDQFFKTNHPEHYLCHQAVYN 240
Query: 241 ANKLAKLAKKRARLQNWLDYNLLKFERHPDQRPTRRAGWFGLCGKRVDSIEYYKRQMKDL 300
AN AKL K+RA+LQ W DY +LK +R+P ++PT R G+ GL GKRVDSIEYYK+Q+K+
Sbjct: 241 ANTYAKLVKQRAKLQRWFDYYVLKHQRNPHKQPTCRTGFLGLWGKRVDSIEYYKQQIKEF 300
Query: 301 DARMALERPKIVKDPKAILPVSFVSFKSRWGAAVCAQTQQSKNPTLWLTNWAPEPHDVYW 360
D M+LER K++KD K +LPV+FVSF SRWGAAVCAQTQQSKNPTLWLT+ APEP D+YW
Sbjct: 301 DHNMSLERQKVLKDSKLMLPVAFVSFDSRWGAAVCAQTQQSKNPTLWLTSSAPEPRDIYW 360
Query: 361 QNLAIPFVSVSIRKLLISLSVFALVFFYMIPIAFVQSLANLEGLERVAPFLRPVIELKLV 420
QNLAIPF+S++IRKL+I +SVFALVFFYMIPIAFVQSLANLEGL+RVAPFLRPV L +
Sbjct: 361 QNLAIPFISLTIRKLVIGVSVFALVFFYMIPIAFVQSLANLEGLDRVAPFLRPVTRLDFI 420
Query: 421 KSFLQGFLPGLALKIFLFILPKVLMIMSKIEGHVAVSMLERRAAAKYYYFMLVNVFLGSI 480
KSFLQGFLPGLALKIFL+ILP VL+IMSKIEG++A+S LERRAAAKYYYFMLVNVFLGSI
Sbjct: 421 KSFLQGFLPGLALKIFLWILPTVLLIMSKIEGYIALSTLERRAAAKYYYFMLVNVFLGSI 480
Query: 481 VTGTAFEQLDAFIHQSPTQIPRTIGVSIPMKATFFITYIMVDGWAGMASEILRLKPLVIF 540
+ GTAFEQL +F+HQSP+QIPRTIGVSIPMKATFFITYIMVDGWAG+A EILRLKPLVIF
Sbjct: 481 IAGTAFEQLHSFLHQSPSQIPRTIGVSIPMKATFFITYIMVDGWAGIAGEILRLKPLVIF 540
Query: 541 HLKNLFLVKTDRDREKAMNPKGVDFPETLPTLQLYFLLGIVYAVVTPILLPFILVFFAFA 600
HLKN+F+VKT+ DR +AM+P VDF ET+P+LQLYFLLGIVY VTPILLPFIL+FFAFA
Sbjct: 541 HLKNMFIVKTEEDRVRAMDPGFVDFKETIPSLQLYFLLGIVYTAVTPILLPFILIFFAFA 600
Query: 601 YLVYRHQIINVYNQQYESVGAFWPHVHGRIIASLLISQLLLLGLLSTKKAANSTPLLVAL 660
YLVYRHQIINVYNQQYES GAFWPHVHGRIIASLLISQLLL+GLL++KKAA+STPLL+ L
Sbjct: 601 YLVYRHQIINVYNQQYESCGAFWPHVHGRIIASLLISQLLLMGLLASKKAADSTPLLIIL 660
Query: 661 PILTFFFHKYCKNRFEPAFRKYPLEEAMAKDTMERSTEPDLDVKAFLSDAYLHPIFRSVE 720
PILT FHKYCK+RFEPAFR+YPLEEAMAKD +E+ TEP+L++KA L+DAYLHPIF S E
Sbjct: 661 PILTLSFHKYCKHRFEPAFRQYPLEEAMAKDKLEKETEPELNMKADLADAYLHPIFHSFE 720
Query: 721 EEELAEIKVEKQKSPIQEAT 740
+E +K QE T
Sbjct: 721 KEVELSSSSSSEKETHQEET 740
BLAST of CmaCh16G012020 vs. TAIR 10
Match:
AT3G21620.1 (ERD (early-responsive to dehydration stress) family protein )
HSP 1 Score: 901.0 bits (2327), Expect = 6.5e-262
Identity = 438/734 (59.67%), Postives = 577/734 (78.61%), Query Frame = 0
Query: 1 MATLADIGVSALINIITAFVFLLAFAILRIQPINDRVYFPKWYINGGRNSPRGSGSSVRK 60
MATL DIGV+A INI+TAF F +AFAILR+QP+NDRVYFPKWY+ G R+SP +G K
Sbjct: 1 MATLTDIGVAATINILTAFAFFIAFAILRLQPVNDRVYFPKWYLKGLRSSPIKTGGFASK 60
Query: 61 YVNLNIWTYFTFLKWMPAALKMSEDEIISHAGFDSAVFLRIYTLGLKIFFPIAIVALLVL 120
+VNL+ +Y FL WMP AL+M E E+I HAG DS V+LRIY LGLKIFFPIA +A V+
Sbjct: 61 FVNLDFRSYIRFLNWMPQALRMPEPELIDHAGLDSVVYLRIYLLGLKIFFPIACIAFTVM 120
Query: 121 IPVNVSSGTLSFLRKELVVSNIDKLSISNVRPHSIRFFAHIGLEYLFTLWICFMLYKEYD 180
+PVN ++ TL L K L S+IDKLSISN+ S RF+ H+ + Y+ T W CF+L +EY
Sbjct: 121 VPVNWTNSTLDQL-KNLTFSDIDKLSISNIPTGSSRFWVHLCMAYVITFWTCFVLQREYK 180
Query: 181 NVARMRLNFLASQRRCADQFTVLVRNVPRSSGRSNSDIVDQYFHKNHPQHYLSHQAVYNA 240
++A MRL FLAS+ R DQFTVLVRN+P S S++V+ +F NHP +YL++QAVYNA
Sbjct: 181 HIASMRLQFLASEHRRPDQFTVLVRNIPPDPDESVSELVEHFFKVNHPDYYLTYQAVYNA 240
Query: 241 NKLAKLAKKRARLQNWLDYNLLKFERHPDQRPTRRAGWFGLCGKRVDSIEYYKRQMKDLD 300
NKL++L +KR +LQNWLDY K R+P +RP + G+ G G+ VD+I++Y +++ L
Sbjct: 241 NKLSELVQKRMKLQNWLDYYQNKHSRNPSKRPLIKIGFLGCWGEEVDAIDHYIEKIEGLT 300
Query: 301 ARMALERPKIVKDPKAILPVSFVSFKSRWGAAVCAQTQQSKNPTLWLTNWAPEPHDVYWQ 360
+++ E+ ++ K+++P +FVSFK RWGA VC+QTQQS+NPT WLT WAPEP D+YW
Sbjct: 301 RKISEEKETVMSSTKSLVPAAFVSFKKRWGAVVCSQTQQSRNPTEWLTEWAPEPRDIYWD 360
Query: 361 NLAIPFVSVSIRKLLISLSVFALVFFYMIPIAFVQSLANLEGLERVAPFLRPVIELKLVK 420
NLA+P+V ++IR+L+I+++ F L FF+MIPIAFVQ+LAN+EG+E+ PFL+P+IE+K VK
Sbjct: 361 NLALPYVQLTIRRLVIAVAFFFLTFFFMIPIAFVQTLANIEGIEKAVPFLKPLIEVKTVK 420
Query: 421 SFLQGFLPGLALKIFLFILPKVLMIMSKIEGHVAVSMLERRAAAKYYYFMLVNVFLGSIV 480
SF+QGFLPG+ALKIFL +LP +LM+MSK EG ++ S LERR A++YY F +NVFL SI+
Sbjct: 421 SFIQGFLPGIALKIFLIVLPSILMLMSKFEGFISKSSLERRCASRYYMFQFINVFLCSII 480
Query: 481 TGTAFEQLDAFIHQSPTQIPRTIGVSIPMKATFFITYIMVDGWAGMASEILRLKPLVIFH 540
GTA +QLD+F++QS T+IP+TIGVSIPMKATFFITYIMVDGWAG+A EILRLKPL+I+H
Sbjct: 481 AGTALQQLDSFLNQSATEIPKTIGVSIPMKATFFITYIMVDGWAGVAGEILRLKPLIIYH 540
Query: 541 LKNLFLVKTDRDREKAMNPKGVDFPETLPTLQLYFLLGIVYAVVTPILLPFILVFFAFAY 600
LKN FLVKT++DRE+AM+P + F P +QLYF+LG+VYA V+PILLPFILVFFA AY
Sbjct: 541 LKNFFLVKTEKDREEAMDPGTIGFNTGEPQIQLYFILGLVYAAVSPILLPFILVFFALAY 600
Query: 601 LVYRHQIINVYNQQYESVGAFWPHVHGRIIASLLISQLLLLGLLSTKKAANSTPLLVALP 660
+VYRHQIINVYNQ+YES AFWP VH R++ +L++SQLLL+GLLSTKKAA STPLL LP
Sbjct: 601 VVYRHQIINVYNQEYESAAAFWPDVHRRVVIALIVSQLLLMGLLSTKKAARSTPLLFILP 660
Query: 661 ILTFFFHKYCKNRFEPAFRKYPLEEAMAKDTMERSTEPDLDVKAFLSDAYLHPIFRSVEE 720
+LT FHK+C+ R++P F YPL++AM KDT+ER EP+L++K FL +AY HP+F++
Sbjct: 661 VLTIGFHKFCQGRYQPIFVTYPLQDAMVKDTLERMREPNLNLKTFLQNAYAHPVFKAA-- 720
Query: 721 EELAEIKVEKQKSP 735
+ LA V ++ +P
Sbjct: 721 DNLANEMVVEEPAP 731
BLAST of CmaCh16G012020 vs. TAIR 10
Match:
AT1G11960.1 (ERD (early-responsive to dehydration stress) family protein )
HSP 1 Score: 895.2 bits (2312), Expect = 3.6e-260
Identity = 436/723 (60.30%), Postives = 574/723 (79.39%), Query Frame = 0
Query: 1 MATLADIGVSALINIITAFVFLLAFAILRIQPINDRVYFPKWYINGGRNSPRGSGSSVRK 60
MATL DIGV+A INI+TA +FLLAFAILRIQP NDRVYFPKWY+ G R+SP SG+ V K
Sbjct: 1 MATLGDIGVAAAINILTAIIFLLAFAILRIQPFNDRVYFPKWYLKGIRSSPLHSGALVSK 60
Query: 61 YVNLNIWTYFTFLKWMPAALKMSEDEIISHAGFDSAVFLRIYTLGLKIFFPIAIVALLVL 120
+VN+N+ +Y FL WMPAALKM E E+I HAG DSAV+LRIY +GLKIF PIA++A +L
Sbjct: 61 FVNVNLGSYLRFLNWMPAALKMPEPELIDHAGLDSAVYLRIYLIGLKIFVPIALLAWSIL 120
Query: 121 IPVNVSSGTLSFLR-KELVVSNIDKLSISNVRPHSIRFFAHIGLEYLFTLWICFMLYKEY 180
+PVN +S L + + + S+IDKLSISN+ S RF+ H+ + Y FT W C++L KEY
Sbjct: 121 VPVNWTSHGLQLAKLRNVTSSDIDKLSISNIENGSDRFWTHLVMAYAFTFWTCYVLMKEY 180
Query: 181 DNVARMRLNFLASQRRCADQFTVLVRNVPRSSGRSNSDIVDQYFHKNHPQHYLSHQAVYN 240
+ VA MRL FL +++R DQFTVLVRNVP S SD V+ +F NHP HYL+HQ VYN
Sbjct: 181 EKVAAMRLAFLQNEQRRPDQFTVLVRNVPADPDESISDSVEHFFLVNHPDHYLTHQVVYN 240
Query: 241 ANKLAKLAKKRARLQNWLDYNLLKFERHPDQRPTRRAGWFGLCGKRVDSIEYYKRQMKDL 300
AN LA L +++ QNWLDY LK+ R+ + +P + G+ GL GK+VD+I++Y +++ L
Sbjct: 241 ANDLAALVEQKKSTQNWLDYYQLKYTRNQEHKPRIKTGFLGLWGKKVDAIDHYIAEIEKL 300
Query: 301 DARMALERPKIVKDPKAILPVSFVSFKSRWGAAVCAQTQQSKNPTLWLTNWAPEPHDVYW 360
+ ++ ER K+ KD +++P +FVSFK+RWGAAV AQTQQS +PT WLT WAPE +V+W
Sbjct: 301 NEQIMEERKKVKKDDTSVMPAAFVSFKTRWGAAVSAQTQQSSDPTEWLTEWAPEAREVFW 360
Query: 361 QNLAIPFVSVSIRKLLISLSVFALVFFYMIPIAFVQSLANLEGLERVAPFLRPVIELKLV 420
NLAIP+VS+++R+L++ ++ F L FF+MIPIAFVQSLA++EG+E+ APFL+ +IE L
Sbjct: 361 SNLAIPYVSLTVRRLIMHIAFFFLTFFFMIPIAFVQSLASIEGIEKNAPFLKSIIENDLF 420
Query: 421 KSFLQGFLPGLALKIFLFILPKVLMIMSKIEGHVAVSMLERRAAAKYYYFMLVNVFLGSI 480
KS +QGFLPG+ LK+FL LP +LM+MSK EG V++S LERRAA +YY F L+NVFLGS+
Sbjct: 421 KSVIQGFLPGIVLKLFLIFLPSILMVMSKFEGFVSLSSLERRAAFRYYIFNLINVFLGSV 480
Query: 481 VTGTAFEQLDAFIHQSPTQIPRTIGVSIPMKATFFITYIMVDGWAGMASEILRLKPLVIF 540
+TG+AFEQLD+F+ QS +IP+T+GV+IP+KATFFITYIMVDGWAG+A EILRLKPL+ F
Sbjct: 481 ITGSAFEQLDSFLKQSAKEIPKTVGVAIPIKATFFITYIMVDGWAGIAGEILRLKPLIFF 540
Query: 541 HLKNLFLVKTDRDREKAMNPKGVDFPETLPTLQLYFLLGIVYAVVTPILLPFILVFFAFA 600
H+KN LVKT++DRE+AMNP +++ T P +QLYFLLG+VYA VTP+LLPFI++FFA A
Sbjct: 541 HIKNSLLVKTEKDREEAMNPGQINYHATEPRIQLYFLLGLVYAPVTPVLLPFIIIFFALA 600
Query: 601 YLVYRHQIINVYNQQYESVGAFWPHVHGRIIASLLISQLLLLGLLSTKKAANSTPLLVAL 660
YLV+RHQIINVYNQ+YES FWP VHGRII++L+I+Q+LL+GLLSTK AA STP L+ L
Sbjct: 601 YLVFRHQIINVYNQEYESAARFWPDVHGRIISALIIAQILLMGLLSTKGAAQSTPFLLFL 660
Query: 661 PILTFFFHKYCKNRFEPAFRKYPLEEAMAKDTMERSTEPDLDVKAFLSDAYLHPIFRSVE 720
PI+TFFFH+YCK R+EPAF ++PL+EAM KDT+ER+ EP+ ++K +L AY+HP+F+ +
Sbjct: 661 PIITFFFHRYCKGRYEPAFLRHPLKEAMVKDTLERAREPNFNLKPYLQKAYIHPVFKDND 720
Query: 721 EEE 723
E+
Sbjct: 721 YED 723
BLAST of CmaCh16G012020 vs. TAIR 10
Match:
AT4G02900.1 (ERD (early-responsive to dehydration stress) family protein )
HSP 1 Score: 893.3 bits (2307), Expect = 1.4e-259
Identity = 446/730 (61.10%), Postives = 572/730 (78.36%), Query Frame = 0
Query: 1 MATLADIGVSALINIITAFVFLLAFAILRIQPINDRVYFPKWYINGGRNSPRGSGSSVRK 60
MA++ DIG+SA IN+++AF FL AFA+LR+QP+NDRVYFPKWY+ G R SP S + +
Sbjct: 1 MASVQDIGLSAAINLLSAFAFLFAFAMLRLQPVNDRVYFPKWYLKGIRGSPTRSRGIMTR 60
Query: 61 YVNLNIWTYFTFLKWMPAALKMSEDEIISHAGFDSAVFLRIYTLGLKIFFPIAIVALLVL 120
+VNL+ TY FL WMPAAL+M E E+I HAG DSAV++RIY LGLK+F PI ++A VL
Sbjct: 61 FVNLDWTTYVKFLNWMPAALQMPEPELIEHAGLDSAVYIRIYLLGLKMFVPITLLAFGVL 120
Query: 121 IPVNVSSGTLSFLRKELVVSNIDKLSISNVRPHSIRFFAHIGLEYLFTLWICFMLYKEYD 180
+PVN + TL + +L SN+DKLSISNV P S RF+AHI + Y+ T W C++LY EY
Sbjct: 121 VPVNWTGETLENI-DDLTFSNVDKLSISNVPPGSPRFWAHITMTYVITFWTCYILYMEYK 180
Query: 181 NVARMRLNFLASQRRCADQFTVLVRNVPRSSGRSNSDIVDQYFHKNHPQHYLSHQAVYNA 240
VA MRL LA++ R DQ TVLVRNVP S ++ V+ +F NHP HYL HQ VYNA
Sbjct: 181 AVANMRLRHLAAESRRPDQLTVLVRNVPPDPDESVNEHVEHFFCVNHPDHYLCHQVVYNA 240
Query: 241 NKLAKLAKKRARLQNWLDYNLLKFERHPDQRPTRRAGWFGLCGKRVDSIEYYKRQMKDLD 300
N LAKL +R +QNWL Y KFER P RPT + G+ G G VD+I++Y +M L
Sbjct: 241 NDLAKLVAQRKAMQNWLTYYENKFERKPSSRPTTKTGYGGFWGTTVDAIDFYTSKMDILA 300
Query: 301 ARMALERPKIVKDPKAILPVSFVSFKSRWGAAVCAQTQQSKNPTLWLTNWAPEPHDVYWQ 360
+ A+ER KI+ DPKAI+P +FVSF+SRWG AVCAQTQQ NPT+WLT WAPEP DV+W
Sbjct: 301 EQEAVEREKIMNDPKAIMPAAFVSFRSRWGTAVCAQTQQCHNPTIWLTEWAPEPRDVFWD 360
Query: 361 NLAIPFVSVSIRKLLISLSVFALVFFYMIPIAFVQSLANLEGLERVAPFLRPVIELKLVK 420
NLAIP+V +SIR+LL ++++F L+F +MIPIAFVQSLANLEG+++V PFL+PVIE+K VK
Sbjct: 361 NLAIPYVELSIRRLLTTVALFFLIFCFMIPIAFVQSLANLEGIQKVLPFLKPVIEMKTVK 420
Query: 421 SFLQGFLPGLALKIFLFILPKVLMIMSKIEGHVAVSMLERRAAAKYYYFMLVNVFLGSIV 480
S +QGFLPG+ALKIFL ILP +LM MS+IEG+ ++S L+RR+A KY++F++VNVFLGSI+
Sbjct: 421 SVIQGFLPGIALKIFLIILPTILMTMSQIEGYTSLSYLDRRSAEKYFWFIIVNVFLGSII 480
Query: 481 TGTAFEQLDAFIHQSPTQIPRTIGVSIPMKATFFITYIMVDGWAGMASEILRLKPLVIFH 540
TGTAF+QL +F+ Q PT+IP+T+GVSIPMKATFFITYIMVDGWAG+A+EILR+ PLVIFH
Sbjct: 481 TGTAFQQLKSFLEQPPTEIPKTVGVSIPMKATFFITYIMVDGWAGIAAEILRVVPLVIFH 540
Query: 541 LKNLFLVKTDRDREKAMNPKGVDFPETLPTLQLYFLLGIVYAVVTPILLPFILVFFAFAY 600
LKN FLVKT++DR++AM+P +DF + P +Q YFLLG+VYA V PILLPFI+VFFAFAY
Sbjct: 541 LKNTFLVKTEQDRQQAMDPGHLDFATSEPRIQFYFLLGLVYAAVAPILLPFIIVFFAFAY 600
Query: 601 LVYRHQIINVYNQQYESVGAFWPHVHGRIIASLLISQLLLLGLLSTKKAANSTPLLVALP 660
+V+RHQ+INVY+Q+YES +WP VH R+I L+ISQLL++GLLSTKK A T LL+ P
Sbjct: 601 VVFRHQVINVYDQKYESGARYWPDVHRRLIICLIISQLLMMGLLSTKKFAKVTALLLPQP 660
Query: 661 ILTFFFHKYCKNRFEPAFRKYPLEEAMAKDTMERSTEPDLDVKAFLSDAYLHPIF----- 720
ILTF+F++YC RFE AF K+PL+EAM KDT+E++TEP+L++K +L DAY+HP+F
Sbjct: 661 ILTFWFYRYCAGRFESAFSKFPLQEAMVKDTLEKATEPNLNLKEYLKDAYVHPVFKGNDF 720
Query: 721 ---RSVEEEE 723
R V+EEE
Sbjct: 721 DRPRVVDEEE 729
BLAST of CmaCh16G012020 vs. TAIR 10
Match:
AT4G22120.1 (ERD (early-responsive to dehydration stress) family protein )
HSP 1 Score: 892.5 bits (2305), Expect = 2.3e-259
Identity = 437/732 (59.70%), Postives = 575/732 (78.55%), Query Frame = 0
Query: 1 MATLADIGVSALINIITAFVFLLAFAILRIQPINDRVYFPKWYINGGRNSPRGSGSSVRK 60
MATL DIGVSA INI++AFVF + FA+LR+QP NDRVYF KWY+ G R+SP G+ ++
Sbjct: 1 MATLQDIGVSAGINILSAFVFFIIFAVLRLQPFNDRVYFSKWYLKGLRSSPARGGAFAQR 60
Query: 61 YVNLNIWTYFTFLKWMPAALKMSEDEIISHAGFDSAVFLRIYTLGLKIFFPIAIVALLVL 120
+VNL+ +Y FL WMP ALKM E E+I HAG DS V+LRIY LGLKIF PIA++A VL
Sbjct: 61 FVNLDFRSYMKFLNWMPEALKMPEPELIDHAGLDSVVYLRIYWLGLKIFTPIAVLAWAVL 120
Query: 121 IPVNVSSGTLSFLR--KELVVSNIDKLSISNVRPHSIRFFAHIGLEYLFTLWICFMLYKE 180
+PVN ++ TL + + + S+IDKLS+SN+ +S+RF+ HI + Y FT+W C++L KE
Sbjct: 121 VPVNWTNNTLEMAKQLRNVTSSDIDKLSVSNIPEYSMRFWTHIVMAYAFTIWTCYVLMKE 180
Query: 181 YDNVARMRLNFLASQRRCADQFTVLVRNVPRSSGRSNSDIVDQYFHKNHPQHYLSHQAVY 240
Y+ +A MRL F+AS+ R DQFTVLVRNVP + S S++V+ +F NHP HYL+HQ V
Sbjct: 181 YETIANMRLQFVASEARRPDQFTVLVRNVPPDADESVSELVEHFFLVNHPDHYLTHQVVC 240
Query: 241 NANKLAKLAKKRARLQNWLDYNLLKFERHPDQRPTRRAGWFGLCGKRVDSIEYYKRQMKD 300
NANKLA L KK+ +LQNWLDY LK+ R+ QR + G+ GL G++VD+IE+Y ++
Sbjct: 241 NANKLADLVKKKKKLQNWLDYYQLKYARNNSQRIMVKLGFLGLWGQKVDAIEHYIAEIDK 300
Query: 301 LDARMALERPKIVKDPKAILPVSFVSFKSRWGAAVCAQTQQSKNPTLWLTNWAPEPHDVY 360
+ ++ ER ++V DPKAI+P +FVSFK+RW AAVCAQTQQ++NPT WLT WAPEP DV+
Sbjct: 301 ISKEISKEREEVVNDPKAIMPAAFVSFKTRWAAAVCAQTQQTRNPTQWLTEWAPEPRDVF 360
Query: 361 WQNLAIPFVSVSIRKLLISLSVFALVFFYMIPIAFVQSLANLEGLERVAPFLRPVIELKL 420
W NLAIP+VS+++R+L++ ++ F L FF+++PIAFVQSLA +EG+ + APFL+ +++ K
Sbjct: 361 WSNLAIPYVSLTVRRLIMHVAFFFLTFFFIVPIAFVQSLATIEGIVKAAPFLKFIVDDKF 420
Query: 421 VKSFLQGFLPGLALKIFLFILPKVLMIMSKIEGHVAVSMLERRAAAKYYYFMLVNVFLGS 480
+KS +QGFLPG+ALK+FL LP +LMIMSK EG ++S LERRAA +YY F LVNVFL S
Sbjct: 421 MKSVIQGFLPGIALKLFLAFLPSILMIMSKFEGFTSISSLERRAAFRYYIFNLVNVFLAS 480
Query: 481 IVTGTAFEQLDAFIHQSPTQIPRTIGVSIPMKATFFITYIMVDGWAGMASEILRLKPLVI 540
++ G AFEQL++F++QS QIP+TIGV+IPMKATFFITYIMVDGWAG+A EIL LKPL++
Sbjct: 481 VIAGAAFEQLNSFLNQSANQIPKTIGVAIPMKATFFITYIMVDGWAGVAGEILMLKPLIM 540
Query: 541 FHLKNLFLVKTDRDREKAMNPKGVDFPETLPTLQLYFLLGIVYAVVTPILLPFILVFFAF 600
FHLKN FLVKTD+DRE+AM+P + F P +QLYFLLG+VYA VTP+LLPFILVFFA
Sbjct: 541 FHLKNAFLVKTDKDREEAMDPGSIGFNTGEPRIQLYFLLGLVYAPVTPMLLPFILVFFAL 600
Query: 601 AYLVYRHQIINVYNQQYESVGAFWPHVHGRIIASLLISQLLLLGLLSTKKAANSTPLLVA 660
AY+VYRHQIINVYNQ+YES AFWP VHGR+IA+L+ISQLLL+GLL TK AA + P L+A
Sbjct: 601 AYIVYRHQIINVYNQEYESAAAFWPDVHGRVIAALVISQLLLMGLLGTKHAALAAPFLIA 660
Query: 661 LPILTFFFHKYCKNRFEPAFRKYPLEEAMAKDTMERSTEPDLDVKAFLSDAYLHPIFRSV 720
LP+LT FH +CK R+EPAF +YPL+EAM KDT+E + EP+L++K +L +AY+HP+F+
Sbjct: 661 LPVLTIGFHHFCKGRYEPAFIRYPLQEAMMKDTLETAREPNLNLKGYLQNAYVHPVFKGD 720
Query: 721 EEEELAEIKVEK 731
E++ + K+ K
Sbjct: 721 EDDYDIDDKLGK 732
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9FVQ5 | 0.0e+00 | 75.14 | CSC1-like protein At1g32090 OS=Arabidopsis thaliana OX=3702 GN=At1g32090 PE=1 SV... | [more] |
Q9LVE4 | 9.1e-261 | 59.67 | CSC1-like protein At3g21620 OS=Arabidopsis thaliana OX=3702 GN=At3g21620 PE=2 SV... | [more] |
F4HYR3 | 4.5e-260 | 61.00 | CSC1-like protein At1g62320 OS=Arabidopsis thaliana OX=3702 GN=At1g62320 PE=3 SV... | [more] |
B5TYT3 | 5.0e-259 | 60.30 | CSC1-like protein At1g11960 OS=Arabidopsis thaliana OX=3702 GN=At1g11960 PE=2 SV... | [more] |
Q9SY14 | 1.9e-258 | 61.10 | CSC1-like protein At4g02900 OS=Arabidopsis thaliana OX=3702 GN=At4g02900 PE=3 SV... | [more] |
Match Name | E-value | Identity | Description | |
AT1G32090.1 | 0.0e+00 | 75.14 | early-responsive to dehydration stress protein (ERD4) | [more] |
AT3G21620.1 | 6.5e-262 | 59.67 | ERD (early-responsive to dehydration stress) family protein | [more] |
AT1G11960.1 | 3.6e-260 | 60.30 | ERD (early-responsive to dehydration stress) family protein | [more] |
AT4G02900.1 | 1.4e-259 | 61.10 | ERD (early-responsive to dehydration stress) family protein | [more] |
AT4G22120.1 | 2.3e-259 | 59.70 | ERD (early-responsive to dehydration stress) family protein | [more] |