Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCAATCCCTCTCTTCTCTTTTATTCTCACTGTCCCTGCTCACTCTCAGATCATCTTCTTCCTTCTCTCTTCACGCCCAAACATCTCTGATTTCTTCTCCCCTCCACCGCCCTTCTTTTTCTTGTTTCTTCATTATTAGGGCTCCGTTTTTTTCCATTGCCCTCTTCTTTGCCCTCTTCTTCTTTATGTTTCCTTCCAAATCCTCTGCATTCTATGTCCGATTGAATGCCTTCTTCTTTCTAATGGGGTGTCTGTGTCTGTGTGTGGAATGCCATAACCGGCCATGGGGATGGATGGATGTTTAGTTAAACTCCTTCACTTCCCTTTTGATTTCTTGTTGTTCTATTGCGCTTAGTTGAACTAATAAGCTTTGATTTCTGGGAACGGGAAGTCAGGGATTGCTGTTTTGAATTTTGAGCCGAGTGTTTGGCATTATAGGGCTTAATTTTGCTACACCATTCGGCATTTCCCGAGCTCTAAGGCTAAGAAACTTTCTGGGGTTTCGAATTACTGCTGTTTCCCATCTGGGTTTGGGATCTGAAAGCTCTGTAATCGAATCCGCCAAGTTGTTTTGGTTTCTGTGTTGGGCTTCTCTGCTTTTTTAGCTTGGGATATTGGATTCTGCGTTGTGGGTTTTTTGACGATATGGCCATGGCTATAGCTCATCACAGAGAGAGTAGTAGTGGGAGCTTAACCAGGCATCTTGACAGTACTGGAAAGTACGTTCGATACACATCAGAGCAGGTTGAGGCTCTTGAGCGAGTCTATGCTGAATGCCCAAAGCCAAGTTCTCTTCGAAGGCAGCAGCTCATCCGGGAATGCCCAATTCTTTCAAACATAGAGCCCAAACAGATCAAAGTTTGGTTTCAGAATCGGAGGTTCTTGATCTGATTTTATGCTTCTAGTTAAATTTGTTCGTTTTTCCTGCAATTGCCATGATCCTAAATAGCTAGAATCCTATGGCGTTTAGCAACAGTATCTTGAAGCTTTTGTTTACTTAACCCAAGAAGGATGGAATAAAACCTGCTTTACTCAAGAGTTCATAAGTTCTTCTTTCTTACCAATACAGATATACTTACTTTCTGGTGGAGGGTTTTTTTTACTTTCTTTTTTTCTAATTTTCCTGTTCTGGGGATGCGGAATGGTAATGGTGTTTGGACATGCAGATGTAGGGAGAAGCAAAGGAAGGAGGCTTCTCGGCTGCAAACAGTGAACAGGAAATTGAATGCCATGAACAAATTGTTGATGGAGGAGAATGATCGGCTGCAGAAACAGGTGTCCCAATTGGTGTGTGAAAATGGGTTTATGCGGCAGCAATTACAAACGGTAAGCATAATTGTTCTTCATCCATCTGCAGGCTTATACTTGTGTGGGCTTTTACAGACATTCTTAAATTTTAAACGAATAATTATTTATCTCCGATGTTGCAGGTGCCAGCAGCAACCACTGATGCAAGCTGTGATTCTGTGGTCACCACTCCTCAACCCTCTAAAACAGATGCTAATAACCCAGCTGGGTAAACTTCTAAACGAATTACCGATCTGTTATCTTTTTCGTACGTCTAAACGAACATTATGTTTGATTGTTGTCATGTCTTATTGGTCTTCGTTGTGCTTGGTAGGCTCCTCTCAATTGCAGAGGAGACCTTGGCAGAGTTCCTTTCAAAGGCTACAGGAACTGCTGTTGATTGGGTCCAGATGCCTGGGATGAAGGTTTTAACATGTTCTGATAATTCGTTAAGTCGCCTTCTATTATCTGATCTTAAACATCGAAACTGAATTATAATTTACTGTCAGCCTGGTCCGGATTCTGTTGGGATCTTTGCCATTTCACAAAGTTGCGGCGGAGTGGCAGCAAGAGCCTGTGGTCTTGTAAGTTTGGAGCCTGCAAAGGTAAGACTGTTTATGTCGGTGAATAATGGATAAGTCTGCAATAAGAATATTTATTACAAGCTTTTCTGCAGATTGCAGAGATCCTTAAAGATCGTCCATCGTGGTTTCGTGATTGTCGAAGCCTTGAGGTTTTTACTATGTTTCCAGCTGGAAATGGTGGAACAATTGAACTTGTCTACACACAGGTAAAATTGACTGGACTGGACCTTTTATGTTTCCTCTTTTAGAAGTAATGGTCTTATTTGGGCATAGTAGTAAAGGGAGTACGTTCTGTTGTAGGTGTACGCTCCAACTACCCTTGCTCCTGCACGTGATTTTTGGACGCTAAGATACACAATAACTTTGGAAAATGGCAGCCTTGTGGTAAGCTTGTCACGATTTCCAGTCATTTTATCTATTGGAAATGAAATATTTGTTACAATCCAACGTTATTTATGGGCGGTTGATCTCTCTTAAAAAATAGTTGGATAGGATATTATCTGTTAATTTTCGTGGTCGTCCATATATCAGGTCTGTGAGAGATCTCTCTCGGGTTCTGGTGCCGGTCCTAGTCCAGCTGCTGCTGCTCAGTTTGTTAGAGCTGAGATGCTCCCAAGTGGTTATCTGATTCGACCATGTGAGGGCGGAGGGTCCATCATACATATAGTAGATCACCTAAATCTTGAGGTTTGTCTTGCATGCTCTCCCAGATCTTTTTACTCTGGAAAAGCTCGATCCCTCGAGAGTAACATTGCTCCTCTTATAGGCGTGGAATGTTCCTGAGGTGCTGCGACCGCTTTATGAATCGTCGAAAGTCGTGGCTCAGAACATGACTATTGCAGTATGAGAATTTCCTTTCTTTAGTTTTCGAGATTTCTTGAGAAGATTTAGTTTTTAATCCAGAAAATTTAACAATTGAATATATGCATTTGTTTGTGTAGGCACTACGGTACGTTAGGCAAATAGCTCAGGAGACGAGCGGTGAAGTGGTTTATGGTTTGGGCCGACAACCTGCTGTTCTGCGAACCTTTAGCCAAAGATTGAGCAGGTATCTACACGTTTTCTCTATTATTAAATTATTTGAAATATCGTCTTTCACTACGTCTTTAGCATATAATATTTCAAACTCGTGCATTATTTTCTTTCAGGGGTTTTAATGATTTGGTGAATGGATTCAATGACAACGGATGGTCATTGATTAATTGTGAAGGTGCTGAGGATGTCGTACTTGCTGTGAATTCTACGAAAAATTTCGGCACGACATCCAACCCCGCGAACTCCTTAACGTACCCTGGGGGAGTTCTTTGTGCTAAGGCTTCCATGCTACTCCAAGTATGTTTAGTTGCTGTTTGCAGTGTAACATTAAACTTATGGTAGCTGTTTACTTCCCGTGGATTTGTCGAACGTTTTGAACGTTCGGTTTCATATTTTTTTAATGGGATACAAATAGAGGAAGATTGTATTCATTTTTCTTTTCTTTTTCTACCTTGCTAGAATGTTCCTCCTGCTGTACTAGTTCGTTTCTTGAGAGAACACCGTTCCGAATGGGCTGATTTCAACATCGATGCATACTCTGCAGCAACCCTTAAAGCCAATTCATACGCATATCCAGGGATGAGGCCGACAAGGTTTACCGGGAGTCAAATCATCATGCCACTTGGCCATACAATCGAGCACGAAGAGGTTTGTTCTAAATGGAAACTGTTTCTCGTCATCTTTGCTGTCTAGCTAGATACCATATTGGCATGAAGTAACATGATGAATGCTATTAGATTTTCTTAGAATTCAAGGACGCAGTTGTACAATCTATAGTACGAATCCTATTATACTCGATGAGTCAATCCGATGATTATATTTTCTAGTCATCAAAACTAGATCAATGTCCATTGCTTTTCTTGGATATACATTTTGTTGTTGTTCTTTGCTGATGAACCACATCAAACACGAGTATCCGCTGCTGATTACGGTGTATTCCAGTTGCTTGAAGTTATTCGTCTTGAAGGACATCCTATGGTACAAGAAGACGCTTTTGTTTCAAGGGACATTCATCTTCTTCAGGTTGATAGTCGACTATTTGATTATCGTATGAAGTTTCTTTGGATGCTCATGATCTTAACGGCTTTAATAACTTGATTGTGTTTCAGATATGTAGTGGAATCAACGAGAACGCTGTAGGAGCCTGTTCAGAACTCATTTTTGCCCCGATCGATGAGATGTTTCCAGACGATGCCCCATTGCTGCCTTCTGGCTTTCGAATCATCCCATTGGATTCAAGAACAGTTGAGTATAATTATTTCACATGAATTTTTTTGTAGCCCACCATGCTCGAACTTGAATTTTTACTCGTGTGGCATATGGGATTTGGGTATCGTCGTATATCGAATTCGTCGTGTTGAGTGTGTTCATATGCTCATATAGATAACCCTACCACCTGATGCTTTCTTGATGCAGAGTGATGCTCAAGGTGCATTGACTACGCAACGGACTCTCGATCTAACATCAAGTCTTGAAGTGGGTTCTGGAACAAGCAACATTGCAGGAGATGCATCGTCGAGTCAGAGCCCTCGATCAGTGTTAACTATTGCGTTCCAGTTTCCTTTCGAGAGCAGCATGCAGGATAACGTAATGAACATGGCACAGCAGTACGTACGTAGCGTCATTTCCTCGGTGCAGAGAGTTGCAATGGCCATATCTCCTTCTGGGGGCAGTCCATCACTAGGTCCAAAGTTGTCTCCTGGTTCTCCAGAAGCACTCACTCTAGCACATTGGATATGCAAGAGCTATAGGTATCTTCTAATCACTACTTTTCTTTCATCAGTACAGCAAGAAAAAAAGAAATATGGAAAATTTACGAGGCTCGATTAAATTTTTGGAGTTAGGAGTGTCGTTTGAACGCCATGCATCTGTAATTTTATTGATCGTTTAAACGATTCCTCGATTCAGTTTACAATTAGGAACGGAGTTGATCAGTTCATATAGCTTGGAAAGCGATTCCCTTTTGAAAAATCTCTGGAATCATCAGGATGCAATATTGTGCTGTTCGTTGAAGGTACATTGATAAAAGTAGCATATGGAAACTCCGTTTTTCTGTAGTATGATCGAGTCGATACAAGTGAACTAAACGGAGAATTCTTGTCTGGTTTTGTTCATAGCAGTCGCTGCCTGTTTTCATGTTTGCGAACCAAGCTGGGCTCGACATGTTGGAGACAACTCTAGTAGCATTACAAGACATCACATTGGATAAAATTTTCGACGAGGCGGGTCGTAAAGCTTTGTGTTCTGATTTTCCAAAGTTAATGCAGCAGGTAAATAAATTTTCAGTCACTGCACTGCAACACAACGTTCTTGGTTTAGCGTGGTTCGAAATGCATATCGTTGAATGTGTTTTGGGTTTTATGATCAGGGATTTGCATACTTACCTGGAGGAATCTGTGCATCGACAATGGGGCGACACGTTTCGTACGAACAAGCGATTGCATGGAAAGTCCTCGAGGCAGACGAAACAACTGTCCATTGCCTTGCCTTCTCCTTCATAAACTGGTCTTTTGTTTGATTGAAAGGCTTCTCCAATGGCTCTTCCAAAAGTAATTTAATTAAATTGTTAATTAAATCGCTTTTTATTTTGATTCATGTAAATATTGGGGAGCCCAATTAGATACAAACACGTGACCCTTTAAGGAGACAGTTTTGAAGGCTATGATTCTTTTTTTATTCCCTTCATCCACTAACTCTTGTCGTGATTATATTATTATATGATAATGTTTTTCAATCATAG
mRNA sequence
TCAATCCCTCTCTTCTCTTTTATTCTCACTGTCCCTGCTCACTCTCAGATCATCTTCTTCCTTCTCTCTTCACGCCCAAACATCTCTGATTTCTTCTCCCCTCCACCGCCCTTCTTTTTCTTGTTTCTTCATTATTAGGGCTCCGTTTTTTTCCATTGCCCTCTTCTTTGCCCTCTTCTTCTTTATGTTTCCTTCCAAATCCTCTGCATTCTATGTCCGATTGAATGCCTTCTTCTTTCTAATGGGGTGTCTGTGTCTGTGTGTGGAATGCCATAACCGGCCATGGGGATGGATGGATGTTTAGTTAAACTCCTTCACTTCCCTTTTGATTTCTTGTTGTTCTATTGCGCTTAGTTGAACTAATAAGCTTTGATTTCTGGGAACGGGAAGTCAGGGATTGCTGTTTTGAATTTTGAGCCGAGTGTTTGGCATTATAGGGCTTAATTTTGCTACACCATTCGGCATTTCCCGAGCTCTAAGGCTAAGAAACTTTCTGGGGTTTCGAATTACTGCTGTTTCCCATCTGGGTTTGGGATCTGAAAGCTCTGTAATCGAATCCGCCAAGTTGTTTTGGTTTCTGTGTTGGGCTTCTCTGCTTTTTTAGCTTGGGATATTGGATTCTGCGTTGTGGGTTTTTTGACGATATGGCCATGGCTATAGCTCATCACAGAGAGAGTAGTAGTGGGAGCTTAACCAGGCATCTTGACAGTACTGGAAAGTACGTTCGATACACATCAGAGCAGGTTGAGGCTCTTGAGCGAGTCTATGCTGAATGCCCAAAGCCAAGTTCTCTTCGAAGGCAGCAGCTCATCCGGGAATGCCCAATTCTTTCAAACATAGAGCCCAAACAGATCAAAGTTTGGTTTCAGAATCGGAGATGTAGGGAGAAGCAAAGGAAGGAGGCTTCTCGGCTGCAAACAGTGAACAGGAAATTGAATGCCATGAACAAATTGTTGATGGAGGAGAATGATCGGCTGCAGAAACAGGTGTCCCAATTGGTGTGTGAAAATGGGTTTATGCGGCAGCAATTACAAACGGTGCCAGCAGCAACCACTGATGCAAGCTGTGATTCTGTGGTCACCACTCCTCAACCCTCTAAAACAGATGCTAATAACCCAGCTGGGCTCCTCTCAATTGCAGAGGAGACCTTGGCAGAGTTCCTTTCAAAGGCTACAGGAACTGCTGTTGATTGGGTCCAGATGCCTGGGATGAAGCCTGGTCCGGATTCTGTTGGGATCTTTGCCATTTCACAAAGTTGCGGCGGAGTGGCAGCAAGAGCCTGTGGTCTTGTAAGTTTGGAGCCTGCAAAGATTGCAGAGATCCTTAAAGATCGTCCATCGTGGTTTCGTGATTGTCGAAGCCTTGAGGTTTTTACTATGTTTCCAGCTGGAAATGGTGGAACAATTGAACTTGTCTACACACAGGTGTACGCTCCAACTACCCTTGCTCCTGCACGTGATTTTTGGACGCTAAGATACACAATAACTTTGGAAAATGGCAGCCTTGTGGTCTGTGAGAGATCTCTCTCGGGTTCTGGTGCCGGTCCTAGTCCAGCTGCTGCTGCTCAGTTTGTTAGAGCTGAGATGCTCCCAAGTGGTTATCTGATTCGACCATGTGAGGGCGGAGGGTCCATCATACATATAGTAGATCACCTAAATCTTGAGGCGTGGAATGTTCCTGAGGTGCTGCGACCGCTTTATGAATCGTCGAAAGTCGTGGCTCAGAACATGACTATTGCAGCACTACGGTACGTTAGGCAAATAGCTCAGGAGACGAGCGGTGAAGTGGTTTATGGTTTGGGCCGACAACCTGCTGTTCTGCGAACCTTTAGCCAAAGATTGAGCAGGGGTTTTAATGATTTGGTGAATGGATTCAATGACAACGGATGGTCATTGATTAATTGTGAAGGTGCTGAGGATGTCGTACTTGCTGTGAATTCTACGAAAAATTTCGGCACGACATCCAACCCCGCGAACTCCTTAACGTACCCTGGGGGAGTTCTTTGTGCTAAGGCTTCCATGCTACTCCAAAATGTTCCTCCTGCTGTACTAGTTCGTTTCTTGAGAGAACACCGTTCCGAATGGGCTGATTTCAACATCGATGCATACTCTGCAGCAACCCTTAAAGCCAATTCATACGCATATCCAGGGATGAGGCCGACAAGGTTTACCGGGAGTCAAATCATCATGCCACTTGGCCATACAATCGAGCACGAAGAGTTGCTTGAAGTTATTCGTCTTGAAGGACATCCTATGGTACAAGAAGACGCTTTTGTTTCAAGGGACATTCATCTTCTTCAGATATGTAGTGGAATCAACGAGAACGCTGTAGGAGCCTGTTCAGAACTCATTTTTGCCCCGATCGATGAGATGTTTCCAGACGATGCCCCATTGCTGCCTTCTGGCTTTCGAATCATCCCATTGGATTCAAGAACAATAACCCTACCACCTGATGCTTTCTTGATGCAGAGTGATGCTCAAGGTGCATTGACTACGCAACGGACTCTCGATCTAACATCAAGTCTTGAAGTGGGTTCTGGAACAAGCAACATTGCAGGAGATGCATCGTCGAGTCAGAGCCCTCGATCAGTGTTAACTATTGCGTTCCAGTTTCCTTTCGAGAGCAGCATGCAGGATAACGTAATGAACATGGCACAGCAGTACGTACGTAGCGTCATTTCCTCGGTGCAGAGAGTTGCAATGGCCATATCTCCTTCTGGGGGCAGTCCATCACTAGGTCCAAAGTTGTCTCCTGGTTCTCCAGAAGCACTCACTCTAGCACATTGGATATGCAAGAGCTATAGTTTACAATTAGGAACGGAGTTGATCAGTTCATATAGCTTGGAAAGCGATTCCCTTTTGAAAAATCTCTGGAATCATCAGGATGCAATATTGTGCTGTTCGTTGAAGTCGCTGCCTGTTTTCATGTTTGCGAACCAAGCTGGGCTCGACATGTTGGAGACAACTCTAGTAGCATTACAAGACATCACATTGGATAAAATTTTCGACGAGGCGGGTCGTAAAGCTTTGTGTTCTGATTTTCCAAAGTTAATGCAGCAGGGATTTGCATACTTACCTGGAGGAATCTGTGCATCGACAATGGGGCGACACGTTTCGTACGAACAAGCGATTGCATGGAAAGTCCTCGAGGCAGACGAAACAACTGTCCATTGCCTTGCCTTCTCCTTCATAAACTGGTCTTTTGTTTGATTGAAAGGCTTCTCCAATGGCTCTTCCAAAAGTAATTTAATTAAATTGTTAATTAAATCGCTTTTTATTTTGATTCATGTAAATATTGGGGAGCCCAATTAGATACAAACACGTGACCCTTTAAGGAGACAGTTTTGAAGGCTATGATTCTTTTTTTATTCCCTTCATCCACTAACTCTTGTCGTGATTATATTATTATATGATAATGTTTTTCAATCATAG
Coding sequence (CDS)
ATGGCCATGGCTATAGCTCATCACAGAGAGAGTAGTAGTGGGAGCTTAACCAGGCATCTTGACAGTACTGGAAAGTACGTTCGATACACATCAGAGCAGGTTGAGGCTCTTGAGCGAGTCTATGCTGAATGCCCAAAGCCAAGTTCTCTTCGAAGGCAGCAGCTCATCCGGGAATGCCCAATTCTTTCAAACATAGAGCCCAAACAGATCAAAGTTTGGTTTCAGAATCGGAGATGTAGGGAGAAGCAAAGGAAGGAGGCTTCTCGGCTGCAAACAGTGAACAGGAAATTGAATGCCATGAACAAATTGTTGATGGAGGAGAATGATCGGCTGCAGAAACAGGTGTCCCAATTGGTGTGTGAAAATGGGTTTATGCGGCAGCAATTACAAACGGTGCCAGCAGCAACCACTGATGCAAGCTGTGATTCTGTGGTCACCACTCCTCAACCCTCTAAAACAGATGCTAATAACCCAGCTGGGCTCCTCTCAATTGCAGAGGAGACCTTGGCAGAGTTCCTTTCAAAGGCTACAGGAACTGCTGTTGATTGGGTCCAGATGCCTGGGATGAAGCCTGGTCCGGATTCTGTTGGGATCTTTGCCATTTCACAAAGTTGCGGCGGAGTGGCAGCAAGAGCCTGTGGTCTTGTAAGTTTGGAGCCTGCAAAGATTGCAGAGATCCTTAAAGATCGTCCATCGTGGTTTCGTGATTGTCGAAGCCTTGAGGTTTTTACTATGTTTCCAGCTGGAAATGGTGGAACAATTGAACTTGTCTACACACAGGTGTACGCTCCAACTACCCTTGCTCCTGCACGTGATTTTTGGACGCTAAGATACACAATAACTTTGGAAAATGGCAGCCTTGTGGTCTGTGAGAGATCTCTCTCGGGTTCTGGTGCCGGTCCTAGTCCAGCTGCTGCTGCTCAGTTTGTTAGAGCTGAGATGCTCCCAAGTGGTTATCTGATTCGACCATGTGAGGGCGGAGGGTCCATCATACATATAGTAGATCACCTAAATCTTGAGGCGTGGAATGTTCCTGAGGTGCTGCGACCGCTTTATGAATCGTCGAAAGTCGTGGCTCAGAACATGACTATTGCAGCACTACGGTACGTTAGGCAAATAGCTCAGGAGACGAGCGGTGAAGTGGTTTATGGTTTGGGCCGACAACCTGCTGTTCTGCGAACCTTTAGCCAAAGATTGAGCAGGGGTTTTAATGATTTGGTGAATGGATTCAATGACAACGGATGGTCATTGATTAATTGTGAAGGTGCTGAGGATGTCGTACTTGCTGTGAATTCTACGAAAAATTTCGGCACGACATCCAACCCCGCGAACTCCTTAACGTACCCTGGGGGAGTTCTTTGTGCTAAGGCTTCCATGCTACTCCAAAATGTTCCTCCTGCTGTACTAGTTCGTTTCTTGAGAGAACACCGTTCCGAATGGGCTGATTTCAACATCGATGCATACTCTGCAGCAACCCTTAAAGCCAATTCATACGCATATCCAGGGATGAGGCCGACAAGGTTTACCGGGAGTCAAATCATCATGCCACTTGGCCATACAATCGAGCACGAAGAGTTGCTTGAAGTTATTCGTCTTGAAGGACATCCTATGGTACAAGAAGACGCTTTTGTTTCAAGGGACATTCATCTTCTTCAGATATGTAGTGGAATCAACGAGAACGCTGTAGGAGCCTGTTCAGAACTCATTTTTGCCCCGATCGATGAGATGTTTCCAGACGATGCCCCATTGCTGCCTTCTGGCTTTCGAATCATCCCATTGGATTCAAGAACAATAACCCTACCACCTGATGCTTTCTTGATGCAGAGTGATGCTCAAGGTGCATTGACTACGCAACGGACTCTCGATCTAACATCAAGTCTTGAAGTGGGTTCTGGAACAAGCAACATTGCAGGAGATGCATCGTCGAGTCAGAGCCCTCGATCAGTGTTAACTATTGCGTTCCAGTTTCCTTTCGAGAGCAGCATGCAGGATAACGTAATGAACATGGCACAGCAGTACGTACGTAGCGTCATTTCCTCGGTGCAGAGAGTTGCAATGGCCATATCTCCTTCTGGGGGCAGTCCATCACTAGGTCCAAAGTTGTCTCCTGGTTCTCCAGAAGCACTCACTCTAGCACATTGGATATGCAAGAGCTATAGTTTACAATTAGGAACGGAGTTGATCAGTTCATATAGCTTGGAAAGCGATTCCCTTTTGAAAAATCTCTGGAATCATCAGGATGCAATATTGTGCTGTTCGTTGAAGTCGCTGCCTGTTTTCATGTTTGCGAACCAAGCTGGGCTCGACATGTTGGAGACAACTCTAGTAGCATTACAAGACATCACATTGGATAAAATTTTCGACGAGGCGGGTCGTAAAGCTTTGTGTTCTGATTTTCCAAAGTTAATGCAGCAGGGATTTGCATACTTACCTGGAGGAATCTGTGCATCGACAATGGGGCGACACGTTTCGTACGAACAAGCGATTGCATGGAAAGTCCTCGAGGCAGACGAAACAACTGTCCATTGCCTTGCCTTCTCCTTCATAAACTGGTCTTTTGTTTGA
Protein sequence
MAMAIAHHRESSSGSLTRHLDSTGKYVRYTSEQVEALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRCREKQRKEASRLQTVNRKLNAMNKLLMEENDRLQKQVSQLVCENGFMRQQLQTVPAATTDASCDSVVTTPQPSKTDANNPAGLLSIAEETLAEFLSKATGTAVDWVQMPGMKPGPDSVGIFAISQSCGGVAARACGLVSLEPAKIAEILKDRPSWFRDCRSLEVFTMFPAGNGGTIELVYTQVYAPTTLAPARDFWTLRYTITLENGSLVVCERSLSGSGAGPSPAAAAQFVRAEMLPSGYLIRPCEGGGSIIHIVDHLNLEAWNVPEVLRPLYESSKVVAQNMTIAALRYVRQIAQETSGEVVYGLGRQPAVLRTFSQRLSRGFNDLVNGFNDNGWSLINCEGAEDVVLAVNSTKNFGTTSNPANSLTYPGGVLCAKASMLLQNVPPAVLVRFLREHRSEWADFNIDAYSAATLKANSYAYPGMRPTRFTGSQIIMPLGHTIEHEELLEVIRLEGHPMVQEDAFVSRDIHLLQICSGINENAVGACSELIFAPIDEMFPDDAPLLPSGFRIIPLDSRTITLPPDAFLMQSDAQGALTTQRTLDLTSSLEVGSGTSNIAGDASSSQSPRSVLTIAFQFPFESSMQDNVMNMAQQYVRSVISSVQRVAMAISPSGGSPSLGPKLSPGSPEALTLAHWICKSYSLQLGTELISSYSLESDSLLKNLWNHQDAILCCSLKSLPVFMFANQAGLDMLETTLVALQDITLDKIFDEAGRKALCSDFPKLMQQGFAYLPGGICASTMGRHVSYEQAIAWKVLEADETTVHCLAFSFINWSFV
Homology
BLAST of CmaCh02G008950 vs. ExPASy Swiss-Prot
Match:
Q9SE43 (Homeobox-leucine zipper protein REVOLUTA OS=Arabidopsis thaliana OX=3702 GN=REV PE=1 SV=2)
HSP 1 Score: 1359.7 bits (3518), Expect = 0.0e+00
Identity = 691/859 (80.44%), Postives = 767/859 (89.29%), Query Frame = 0
Query: 1 MAMAIAHHRESSSGSLTRHLDSTGKYVRYTSEQVEALERVYAECPKPSSLRRQQLIRECP 60
M MA+A+HRE SS S+ RHLDS+GKYVRYT+EQVEALERVYAECPKPSSLRRQQLIREC
Sbjct: 1 MEMAVANHRERSSDSMNRHLDSSGKYVRYTAEQVEALERVYAECPKPSSLRRQQLIRECS 60
Query: 61 ILSNIEPKQIKVWFQNRRCREKQRKEASRLQTVNRKLNAMNKLLMEENDRLQKQVSQLVC 120
IL+NIEPKQIKVWFQNRRCR+KQRKEASRLQ+VNRKL+AMNKLLMEENDRLQKQVSQLVC
Sbjct: 61 ILANIEPKQIKVWFQNRRCRDKQRKEASRLQSVNRKLSAMNKLLMEENDRLQKQVSQLVC 120
Query: 121 ENGFMRQQLQTVPAATTDASCDSVVTTPQPSKTDANNPAGLLSIAEETLAEFLSKATGTA 180
ENG+M+QQL TV D SC+SVVTTPQ S DAN+PAGLLSIAEETLAEFLSKATGTA
Sbjct: 121 ENGYMKQQLTTV---VNDPSCESVVTTPQHSLRDANSPAGLLSIAEETLAEFLSKATGTA 180
Query: 181 VDWVQMPGMKPGPDSVGIFAISQSCGGVAARACGLVSLEPAKIAEILKDRPSWFRDCRSL 240
VDWVQMPGMKPGPDSVGIFAISQ C GVAARACGLVSLEP KIAEILKDRPSWFRDCRSL
Sbjct: 181 VDWVQMPGMKPGPDSVGIFAISQRCNGVAARACGLVSLEPMKIAEILKDRPSWFRDCRSL 240
Query: 241 EVFTMFPAGNGGTIELVYTQVYAPTTLAPARDFWTLRYTITLENGSLVVCERSLSGSGAG 300
EVFTMFPAGNGGTIELVY Q YAPTTLAPARDFWTLRYT +L+NGS VVCERSLSGSGAG
Sbjct: 241 EVFTMFPAGNGGTIELVYMQTYAPTTLAPARDFWTLRYTTSLDNGSFVVCERSLSGSGAG 300
Query: 301 PSPAAAAQFVRAEMLPSGYLIRPCEGGGSIIHIVDHLNLEAWNVPEVLRPLYESSKVVAQ 360
P+ A+A+QFVRAEML SGYLIRPC+GGGSIIHIVDHLNLEAW+VP+VLRPLYESSKVVAQ
Sbjct: 301 PNAASASQFVRAEMLSSGYLIRPCDGGGSIIHIVDHLNLEAWSVPDVLRPLYESSKVVAQ 360
Query: 361 NMTIAALRYVRQIAQETSGEVVYGLGRQPAVLRTFSQRLSRGFNDLVNGFNDNGWSLINC 420
MTI+ALRY+RQ+AQE++GEVVYGLGRQPAVLRTFSQRLSRGFND VNGF D+GWS ++C
Sbjct: 361 KMTISALRYIRQLAQESNGEVVYGLGRQPAVLRTFSQRLSRGFNDAVNGFGDDGWSTMHC 420
Query: 421 EGAEDVVLAVNSTKNFGTTSNPANSLTYPGGVLCAKASMLLQNVPPAVLVRFLREHRSEW 480
+GAED+++A+NSTK+ +N +NSL++ GGVLCAKASMLLQNVPPAVL+RFLREHRSEW
Sbjct: 421 DGAEDIIVAINSTKHL---NNISNSLSFLGGVLCAKASMLLQNVPPAVLIRFLREHRSEW 480
Query: 481 ADFNIDAYSAATLKANSYAYPGMRPTRFTGSQIIMPLGHTIEHEELLEVIRLEGHPMVQE 540
ADFN+DAYSAATLKA S+AYPGMRPTRFTGSQIIMPLGHTIEHEE+LEV+RLEGH + QE
Sbjct: 481 ADFNVDAYSAATLKAGSFAYPGMRPTRFTGSQIIMPLGHTIEHEEMLEVVRLEGHSLAQE 540
Query: 541 DAFVSRDIHLLQICSGINENAVGACSELIFAPIDEMFPDDAPLLPSGFRIIPLDSRTITL 600
DAF+SRD+HLLQIC+GI+ENAVGACSELIFAPI+EMFPDDAPL+PSGFR+IP+D++T
Sbjct: 541 DAFMSRDVHLLQICTGIDENAVGACSELIFAPINEMFPDDAPLVPSGFRVIPVDAKT--- 600
Query: 601 PPDAFLMQSDAQGALT-TQRTLDLTSSLEVGSGTSNIAGDASSSQSPRSVLTIAFQFPFE 660
D Q LT RTLDLTSSLEVG N +G++ SS S R +LTIAFQFPFE
Sbjct: 601 --------GDVQDLLTANHRTLDLTSSLEVGPSPENASGNSFSSSSSRCILTIAFQFPFE 660
Query: 661 SSMQDNVMNMAQQYVRSVISSVQRVAMAISPSGGSPSLGPKLSPGSPEALTLAHWICKSY 720
+++Q+NV MA QYVRSVISSVQRVAMAISPSG SPSLG KLSPGSPEA+TLA WI +SY
Sbjct: 661 NNLQENVAGMACQYVRSVISSVQRVAMAISPSGISPSLGSKLSPGSPEAVTLAQWISQSY 720
Query: 721 SLQLGTELISSYSLES-DSLLKNLWNHQDAILCCSLKSLPVFMFANQAGLDMLETTLVAL 780
S LG+EL++ SL S DS+LK LW+HQDAILCCSLK PVFMFANQAGLDMLETTLVAL
Sbjct: 721 SHHLGSELLTIDSLGSDDSVLKLLWDHQDAILCCSLKPQPVFMFANQAGLDMLETTLVAL 780
Query: 781 QDITLDKIFDEAGRKALCSDFPKLMQQGFAYLPGGICASTMGRHVSYEQAIAWKVLEADE 840
QDITL+KIFDE+GRKA+CSDF KLMQQGFA LP GIC STMGRHVSYEQA+AWKV A E
Sbjct: 781 QDITLEKIFDESGRKAICSDFAKLMQQGFACLPSGICVSTMGRHVSYEQAVAWKVFAASE 840
Query: 841 ---TTVHCLAFSFINWSFV 855
+HCLAFSF+NWSFV
Sbjct: 841 ENNNNLHCLAFSFVNWSFV 842
BLAST of CmaCh02G008950 vs. ExPASy Swiss-Prot
Match:
A2XBL9 (Homeobox-leucine zipper protein HOX10 OS=Oryza sativa subsp. indica OX=39946 GN=HOX10 PE=2 SV=2)
HSP 1 Score: 1187.6 bits (3071), Expect = 0.0e+00
Identity = 602/858 (70.16%), Postives = 712/858 (82.98%), Query Frame = 0
Query: 1 MAMAIAHHRESSSG---SLTRHLDSTGKYVRYTSEQVEALERVYAECPKPSSLRRQQLIR 60
MA A+A SS G +DS GKYVRYT EQVEALERVYA+CPKP+S RRQQL+R
Sbjct: 1 MAAAVAMRGSSSDGGGYDKVSGMDS-GKYVRYTPEQVEALERVYADCPKPTSSRRQQLLR 60
Query: 61 ECPILSNIEPKQIKVWFQNRRCREKQRKEASRLQTVNRKLNAMNKLLMEENDRLQKQVSQ 120
ECPIL+NIEPKQIKVWFQNRRCR+KQRKE+SRLQ VNRKL AMNKLLMEEN+RLQKQVSQ
Sbjct: 61 ECPILANIEPKQIKVWFQNRRCRDKQRKESSRLQAVNRKLTAMNKLLMEENERLQKQVSQ 120
Query: 121 LVCENGFMRQQLQTVPAATTDASCDSVVTTPQPSKTDANNPAGLLSIAEETLAEFLSKAT 180
LV EN MRQQLQ P A D SC+S VTTPQ DA+NP+GLLSIAEETL EFLSKAT
Sbjct: 121 LVHENAHMRQQLQNTPLA-NDTSCESNVTTPQNPLRDASNPSGLLSIAEETLTEFLSKAT 180
Query: 181 GTAVDWVQMPGMKPGPDSVGIFAISQSCGGVAARACGLVSLEPAKIAEILKDRPSWFRDC 240
GTA+DWVQMPGMKPGPDSVGI AIS C GVAARACGLV+LEP K+ EILKDRPSWFRDC
Sbjct: 181 GTAIDWVQMPGMKPGPDSVGIVAISHGCRGVAARACGLVNLEPTKVVEILKDRPSWFRDC 240
Query: 241 RSLEVFTMFPAGNGGTIELVYTQVYAPTTLAPARDFWTLRYTITLENGSLVVCERSLSGS 300
R+LEVFTM PAGNGGT+ELVYTQ+YAPTTL PARDFWTLRYT T+E+GSLVVCERSLSGS
Sbjct: 241 RNLEVFTMIPAGNGGTVELVYTQLYAPTTLVPARDFWTLRYTTTMEDGSLVVCERSLSGS 300
Query: 301 GAGPSPAAAAQFVRAEMLPSGYLIRPCEGGGSIIHIVDHLNLEAWNVPEVLRPLYESSKV 360
G GPS A+A Q+VRAEMLPSGYL+RPCEGGGSI+HIVDHL+LEAW+VPEVLRPLYESS+V
Sbjct: 301 GGGPSAASAQQYVRAEMLPSGYLVRPCEGGGSIVHIVDHLDLEAWSVPEVLRPLYESSRV 360
Query: 361 VAQNMTIAALRYVRQIAQETSGEVVYGLGRQPAVLRTFSQRLSRGFNDLVNGFNDNGWSL 420
VAQ MT AALR++RQIAQETSGEVVY LGRQPAVLRTFSQRLSRGFND ++GFND+GWS+
Sbjct: 361 VAQKMTTAALRHIRQIAQETSGEVVYALGRQPAVLRTFSQRLSRGFNDAISGFNDDGWSI 420
Query: 421 INCEGAEDVVLAVNSTKNFGTTSNPANSLTYPGGVLCAKASMLLQNVPPAVLVRFLREHR 480
+ +G EDVV+A NSTK + SN + PGG++CAKASMLLQ+VPPAVLVRFLREHR
Sbjct: 421 MGGDGVEDVVIACNSTKKIRSNSNAGIAFGAPGGIICAKASMLLQSVPPAVLVRFLREHR 480
Query: 481 SEWADFNIDAYSAATLKANSYAYPGMRPTRFTGSQIIMPLGHTIEHEELLEVIRLEGHPM 540
SEWAD+NIDAY A+TLK ++ + PG+RP RF+GSQII+PL HT+E+EE+LEV+RLEG P+
Sbjct: 481 SEWADYNIDAYLASTLKTSACSLPGLRPMRFSGSQIIIPLAHTVENEEILEVVRLEGQPL 540
Query: 541 VQEDAFVSRDIHLLQICSGINENAVGACSELIFAPIDEMFPDDAPLLPSGFRIIPLDSRT 600
++A +SRDIHLLQ+C+GI+E +VG+ +L+FAPID+ FPD+ PL+ SGFR+IPLD +T
Sbjct: 541 THDEALLSRDIHLLQLCTGIDEKSVGSSFQLVFAPIDD-FPDETPLISSGFRVIPLDMKT 600
Query: 601 ITLPPDAFLMQSDAQGALTTQRTLDLTSSLEVGSGTSNIAGDASSSQ-SPRSVLTIAFQF 660
GA ++ RTLDL SSLEVGS T+ +GDAS+ + RSVLTIAFQF
Sbjct: 601 --------------DGA-SSGRTLDLASSLEVGSATAQASGDASADDCNLRSVLTIAFQF 660
Query: 661 PFESSMQDNVMNMAQQYVRSVISSVQRVAMAISPSGGSPSLGPKLSPGSPEALTLAHWIC 720
P+E +QD+V MA+QYVRS++S+VQRV+MAISPS + G ++ G PEA TLA W+C
Sbjct: 661 PYELHLQDSVAAMARQYVRSIVSAVQRVSMAISPSQTGLNAGQRIISGFPEAATLARWVC 720
Query: 721 KSYSLQLGTELISSYSLESDSLLKNLWNHQDAILCCSLKSLPVFMFANQAGLDMLETTLV 780
+SY LG EL+S +++ LLK LW++QDAILCCS K PVF FAN+AGLDMLET+LV
Sbjct: 721 QSYHYHLGVELLSQSDGDAEQLLKMLWHYQDAILCCSFKEKPVFTFANKAGLDMLETSLV 780
Query: 781 ALQDITLDKIFDEAGRKALCSDFPKLMQQGFAYLPGGICASTMGRHVSYEQAIAWKVLEA 840
ALQD+TLD+IFDE G++AL S+ PKLM+QG YLP G+C S MGRHVS++QA+AWKVL A
Sbjct: 781 ALQDLTLDRIFDEPGKEALFSNIPKLMEQGHVYLPSGVCMSGMGRHVSFDQAVAWKVL-A 839
Query: 841 DETTVHCLAFSFINWSFV 855
+++ VHCLAF F+NWSFV
Sbjct: 841 EDSNVHCLAFCFVNWSFV 839
BLAST of CmaCh02G008950 vs. ExPASy Swiss-Prot
Match:
A2Z8L4 (Homeobox-leucine zipper protein HOX9 OS=Oryza sativa subsp. indica OX=39946 GN=HOX9 PE=2 SV=2)
HSP 1 Score: 1187.6 bits (3071), Expect = 0.0e+00
Identity = 604/859 (70.31%), Postives = 704/859 (81.96%), Query Frame = 0
Query: 1 MAMAIAHHRESSS----GSLTRHLDSTGKYVRYTSEQVEALERVYAECPKPSSLRRQQLI 60
MA A+A S S G + +GKYVRYT EQVEALERVYAECPKPSS RRQQL+
Sbjct: 1 MAAAVAMRSGSGSDGGGGGYDKAGMDSGKYVRYTPEQVEALERVYAECPKPSSSRRQQLL 60
Query: 61 RECPILSNIEPKQIKVWFQNRRCREKQRKEASRLQTVNRKLNAMNKLLMEENDRLQKQVS 120
R+CPIL+NIEPKQIKVWFQNRRCR+KQRKEASRLQ VNRKL AMNKLLMEEN+RLQKQVS
Sbjct: 61 RDCPILANIEPKQIKVWFQNRRCRDKQRKEASRLQAVNRKLTAMNKLLMEENERLQKQVS 120
Query: 121 QLVCENGFMRQQLQTVPAATTDASCDSVVTTPQPSKTDANNPAGLLSIAEETLAEFLSKA 180
QLV EN +M+QQLQ P+ D SC+S VTTPQ DA+NP+GLL+IAEETL EFLSKA
Sbjct: 121 QLVHENAYMKQQLQN-PSLGNDTSCESNVTTPQNPLRDASNPSGLLTIAEETLTEFLSKA 180
Query: 181 TGTAVDWVQMPGMKPGPDSVGIFAISQSCGGVAARACGLVSLEPAKIAEILKDRPSWFRD 240
TGTAVDWV MPGMKPGPDS GI A+S C GVAARACGLV+LEP KI EILKDRPSWFRD
Sbjct: 181 TGTAVDWVPMPGMKPGPDSFGIVAVSHGCRGVAARACGLVNLEPTKIVEILKDRPSWFRD 240
Query: 241 CRSLEVFTMFPAGNGGTIELVYTQVYAPTTLAPARDFWTLRYTITLENGSLVVCERSLSG 300
CRSLEVFTMFPAGNGGTIELVY Q+YAPTTL PARDFWTLRYT T+++GSLVVCERSLSG
Sbjct: 241 CRSLEVFTMFPAGNGGTIELVYMQMYAPTTLVPARDFWTLRYTTTMDDGSLVVCERSLSG 300
Query: 301 SGAGPSPAAAAQFVRAEMLPSGYLIRPCEGGGSIIHIVDHLNLEAWNVPEVLRPLYESSK 360
SG GPS A+A QFVRAEMLPSGYL+RPCEGGGSI+HIVDHL+LEAW+VPEVLRPLYESS+
Sbjct: 301 SGGGPSTASAQQFVRAEMLPSGYLVRPCEGGGSIVHIVDHLDLEAWSVPEVLRPLYESSR 360
Query: 361 VVAQNMTIAALRYVRQIAQETSGEVVYGLGRQPAVLRTFSQRLSRGFNDLVNGFNDNGWS 420
VVAQ MT AALR++RQIAQETSGEVVY LGRQPAVLRTFSQRLSRGFND ++GFND+GWS
Sbjct: 361 VVAQKMTTAALRHIRQIAQETSGEVVYALGRQPAVLRTFSQRLSRGFNDAISGFNDDGWS 420
Query: 421 LINCEGAEDVVLAVNSTKNFGTTSNPANSLTYPGGVLCAKASMLLQNVPPAVLVRFLREH 480
++ +G EDV++A N+ K TS AN+ PGGV+CAKASMLLQ+VPPAVLVRFLREH
Sbjct: 421 VMGGDGIEDVIIACNA-KKVRNTSTSANAFVTPGGVICAKASMLLQSVPPAVLVRFLREH 480
Query: 481 RSEWADFNIDAYSAATLKANSYAYPGMRPTRFTGSQIIMPLGHTIEHEELLEVIRLEGHP 540
RSEWAD+N DAYSA++LK +S + PG+RP RF+GSQIIMPL HT+E+EE+LEV+RLEG
Sbjct: 481 RSEWADYNFDAYSASSLKTSSCSLPGLRPMRFSGSQIIMPLAHTVENEEILEVVRLEGQA 540
Query: 541 MVQEDAFVSRDIHLLQICSGINENAVGACSELIFAPIDEMFPDDAPLLPSGFRIIPLDSR 600
+ +D +SRDIHLLQ+C+GI+E ++G+C +L+FAPIDE+FPDDAPL+ SGFR+IPLD +
Sbjct: 541 LTHDDGLMSRDIHLLQLCTGIDEKSMGSCFQLVFAPIDELFPDDAPLISSGFRVIPLDMK 600
Query: 601 TITLPPDAFLMQSDAQGALTTQRTLDLTSSLEVGSGTSNIAGDASSSQ-SPRSVLTIAFQ 660
T P RTLDL SSLEVGS T+ GDAS + RSVLTIAFQ
Sbjct: 601 TDGTP---------------AGRTLDLASSLEVGS-TAQPTGDASMDDCNLRSVLTIAFQ 660
Query: 661 FPFESSMQDNVMNMAQQYVRSVISSVQRVAMAISPSGGSPSLGPKLSPGSPEALTLAHWI 720
FP+E +QD+V MA+QYVRS++SSVQRV+MAISPS + G K+ G PEA TLA WI
Sbjct: 661 FPYEMHLQDSVATMARQYVRSIVSSVQRVSMAISPSRSGLNAGQKIISGFPEAPTLARWI 720
Query: 721 CKSYSLQLGTELISSYSLESDSLLKNLWNHQDAILCCSLKSLPVFMFANQAGLDMLETTL 780
C+SY LG EL+ ++LLK LW+++DAILCCS K PVF FAN+ GL+MLET+L
Sbjct: 721 CQSYQFHLGVELLRQADDAGEALLKMLWDYEDAILCCSFKEKPVFTFANEMGLNMLETSL 780
Query: 781 VALQDITLDKIFDEAGRKALCSDFPKLMQQGFAYLPGGICASTMGRHVSYEQAIAWKVLE 840
VALQD++LDKIFDEAGRKAL ++ PKLM+QG+ YLPGG+C S MGRHVS+EQA+AWKVL
Sbjct: 781 VALQDLSLDKIFDEAGRKALYNEIPKLMEQGYVYLPGGVCLSGMGRHVSFEQAVAWKVL- 840
Query: 841 ADETTVHCLAFSFINWSFV 855
++ VHCLAF F+NWSFV
Sbjct: 841 GEDNNVHCLAFCFVNWSFV 840
BLAST of CmaCh02G008950 vs. ExPASy Swiss-Prot
Match:
Q9AV49 (Homeobox-leucine zipper protein HOX9 OS=Oryza sativa subsp. japonica OX=39947 GN=HOX9 PE=2 SV=1)
HSP 1 Score: 1185.6 bits (3066), Expect = 0.0e+00
Identity = 604/859 (70.31%), Postives = 703/859 (81.84%), Query Frame = 0
Query: 1 MAMAIAHHRESSS----GSLTRHLDSTGKYVRYTSEQVEALERVYAECPKPSSLRRQQLI 60
MA A+A S S G + +GKYVRYT EQVEALERVYAECPKPSS RRQQL+
Sbjct: 1 MAAAVAMRSGSGSDGGGGGYDKAGMDSGKYVRYTPEQVEALERVYAECPKPSSSRRQQLL 60
Query: 61 RECPILSNIEPKQIKVWFQNRRCREKQRKEASRLQTVNRKLNAMNKLLMEENDRLQKQVS 120
R+CPIL+NIEPKQIKVWFQNRRCR+KQRKEASRLQ VNRKL AMNKLLMEEN+RLQKQVS
Sbjct: 61 RDCPILANIEPKQIKVWFQNRRCRDKQRKEASRLQAVNRKLTAMNKLLMEENERLQKQVS 120
Query: 121 QLVCENGFMRQQLQTVPAATTDASCDSVVTTPQPSKTDANNPAGLLSIAEETLAEFLSKA 180
QLV EN +M+QQLQ P+ D SC+S VTTPQ DA+NP+GLL+IAEETL EFLSKA
Sbjct: 121 QLVHENAYMKQQLQN-PSLGNDTSCESNVTTPQNPLRDASNPSGLLTIAEETLTEFLSKA 180
Query: 181 TGTAVDWVQMPGMKPGPDSVGIFAISQSCGGVAARACGLVSLEPAKIAEILKDRPSWFRD 240
TGTAVDWV MPGMKPGPDS GI A+S C GVAARACGLV+LEP KI EILKDRPSWFRD
Sbjct: 181 TGTAVDWVPMPGMKPGPDSFGIVAVSHGCRGVAARACGLVNLEPTKIVEILKDRPSWFRD 240
Query: 241 CRSLEVFTMFPAGNGGTIELVYTQVYAPTTLAPARDFWTLRYTITLENGSLVVCERSLSG 300
CRSLEVFTMFPAGNGGTIELVY Q+YAPTTL PARDFWTLRYT T+E+GSLVVCERSLSG
Sbjct: 241 CRSLEVFTMFPAGNGGTIELVYMQMYAPTTLVPARDFWTLRYTTTMEDGSLVVCERSLSG 300
Query: 301 SGAGPSPAAAAQFVRAEMLPSGYLIRPCEGGGSIIHIVDHLNLEAWNVPEVLRPLYESSK 360
SG GPS A+A QFVRAEMLPSGYL+RPCEGGGSI+HIVDHL+LEAW+VPEVLRPLYESS+
Sbjct: 301 SGGGPSTASAQQFVRAEMLPSGYLVRPCEGGGSIVHIVDHLDLEAWSVPEVLRPLYESSR 360
Query: 361 VVAQNMTIAALRYVRQIAQETSGEVVYGLGRQPAVLRTFSQRLSRGFNDLVNGFNDNGWS 420
VVAQ MT AALR++RQIAQETSGEVVY LGRQPAVLRTFSQRLSRGFND ++GFND+GWS
Sbjct: 361 VVAQKMTTAALRHIRQIAQETSGEVVYALGRQPAVLRTFSQRLSRGFNDAISGFNDDGWS 420
Query: 421 LINCEGAEDVVLAVNSTKNFGTTSNPANSLTYPGGVLCAKASMLLQNVPPAVLVRFLREH 480
++ +G EDV++A N+ K TS AN+ PGGV+CAKASMLLQ+VPPAVLVRFLREH
Sbjct: 421 VMGGDGIEDVIIACNA-KKVRNTSTSANAFVTPGGVICAKASMLLQSVPPAVLVRFLREH 480
Query: 481 RSEWADFNIDAYSAATLKANSYAYPGMRPTRFTGSQIIMPLGHTIEHEELLEVIRLEGHP 540
RSEWAD+N DAYSA++LK +S + PG+RP RF+GSQIIMPL HT+E+EE+LEV+RLEG
Sbjct: 481 RSEWADYNFDAYSASSLKTSSCSLPGLRPMRFSGSQIIMPLAHTVENEEILEVVRLEGQA 540
Query: 541 MVQEDAFVSRDIHLLQICSGINENAVGACSELIFAPIDEMFPDDAPLLPSGFRIIPLDSR 600
+ +D +SRDIHLLQ+C+GI+E ++G+C +L+ APIDE+FPDDAPL+ SGFR+IPLD +
Sbjct: 541 LTHDDGLMSRDIHLLQLCTGIDEKSMGSCFQLVSAPIDELFPDDAPLISSGFRVIPLDMK 600
Query: 601 TITLPPDAFLMQSDAQGALTTQRTLDLTSSLEVGSGTSNIAGDASSSQ-SPRSVLTIAFQ 660
T P RTLDL SSLEVGS T+ GDAS + RSVLTIAFQ
Sbjct: 601 TDGTP---------------AGRTLDLASSLEVGS-TAQPTGDASMDDCNLRSVLTIAFQ 660
Query: 661 FPFESSMQDNVMNMAQQYVRSVISSVQRVAMAISPSGGSPSLGPKLSPGSPEALTLAHWI 720
FP+E +QD+V MA+QYVRS++SSVQRV+MAISPS + G K+ G PEA TLA WI
Sbjct: 661 FPYEMHLQDSVATMARQYVRSIVSSVQRVSMAISPSRSGLNAGQKIISGFPEAPTLARWI 720
Query: 721 CKSYSLQLGTELISSYSLESDSLLKNLWNHQDAILCCSLKSLPVFMFANQAGLDMLETTL 780
C+SY LG EL+ ++LLK LW+++DAILCCS K PVF FAN+ GL+MLET+L
Sbjct: 721 CQSYQFHLGVELLRQADDAGEALLKMLWDYEDAILCCSFKEKPVFTFANEMGLNMLETSL 780
Query: 781 VALQDITLDKIFDEAGRKALCSDFPKLMQQGFAYLPGGICASTMGRHVSYEQAIAWKVLE 840
VALQD++LDKIFDEAGRKAL ++ PKLM+QG+ YLPGG+C S MGRHVS+EQA+AWKVL
Sbjct: 781 VALQDLSLDKIFDEAGRKALYNEIPKLMEQGYVYLPGGVCLSGMGRHVSFEQAVAWKVL- 840
Query: 841 ADETTVHCLAFSFINWSFV 855
++ VHCLAF F+NWSFV
Sbjct: 841 GEDNNVHCLAFCFVNWSFV 840
BLAST of CmaCh02G008950 vs. ExPASy Swiss-Prot
Match:
Q6TAQ6 (Homeobox-leucine zipper protein HOX10 OS=Oryza sativa subsp. japonica OX=39947 GN=HOX10 PE=2 SV=1)
HSP 1 Score: 1184.5 bits (3063), Expect = 0.0e+00
Identity = 601/858 (70.05%), Postives = 711/858 (82.87%), Query Frame = 0
Query: 1 MAMAIAHHRESSSG---SLTRHLDSTGKYVRYTSEQVEALERVYAECPKPSSLRRQQLIR 60
MA A+A SS G +DS GKYVRYT EQVEALERVYA+CPKP+S RRQQL+R
Sbjct: 1 MAAAVAMRGSSSDGGGYDKVSGMDS-GKYVRYTPEQVEALERVYADCPKPTSSRRQQLLR 60
Query: 61 ECPILSNIEPKQIKVWFQNRRCREKQRKEASRLQTVNRKLNAMNKLLMEENDRLQKQVSQ 120
ECPIL+NIEPKQIKVWFQNRRCR+KQRKE+SRLQ VNRKL AMNKLLMEEN+RLQKQVSQ
Sbjct: 61 ECPILANIEPKQIKVWFQNRRCRDKQRKESSRLQAVNRKLTAMNKLLMEENERLQKQVSQ 120
Query: 121 LVCENGFMRQQLQTVPAATTDASCDSVVTTPQPSKTDANNPAGLLSIAEETLAEFLSKAT 180
LV EN MRQQLQ P A D SC+S VTTPQ DA+NP+GLLSIAEETL EFLSKAT
Sbjct: 121 LVHENAHMRQQLQNTPLA-NDTSCESNVTTPQNPLRDASNPSGLLSIAEETLTEFLSKAT 180
Query: 181 GTAVDWVQMPGMKPGPDSVGIFAISQSCGGVAARACGLVSLEPAKIAEILKDRPSWFRDC 240
GTA+DWVQMPGMKPGPDSVGI AIS C GVAARACGLV+LEP K+ EILKDRPSWFRDC
Sbjct: 181 GTAIDWVQMPGMKPGPDSVGIVAISHGCRGVAARACGLVNLEPTKVVEILKDRPSWFRDC 240
Query: 241 RSLEVFTMFPAGNGGTIELVYTQVYAPTTLAPARDFWTLRYTITLENGSLVVCERSLSGS 300
R+LEVFTM PAGNGGT+ELVYTQ+YAPTTL PARDFWTLRYT T+E+GSLVVCERSLSGS
Sbjct: 241 RNLEVFTMIPAGNGGTVELVYTQLYAPTTLVPARDFWTLRYTTTMEDGSLVVCERSLSGS 300
Query: 301 GAGPSPAAAAQFVRAEMLPSGYLIRPCEGGGSIIHIVDHLNLEAWNVPEVLRPLYESSKV 360
G GPS A+A Q+VRAEMLPSGYL+RPCEGGGSI+HIVDHL+LEAW+VPEVLRPLYESS+V
Sbjct: 301 GGGPSAASAQQYVRAEMLPSGYLVRPCEGGGSIVHIVDHLDLEAWSVPEVLRPLYESSRV 360
Query: 361 VAQNMTIAALRYVRQIAQETSGEVVYGLGRQPAVLRTFSQRLSRGFNDLVNGFNDNGWSL 420
VAQ MT AALR++RQIAQETSGEVVY LGRQPAVLRTFSQRLSRGFND ++GFND+GWS+
Sbjct: 361 VAQKMTTAALRHIRQIAQETSGEVVYALGRQPAVLRTFSQRLSRGFNDAISGFNDDGWSI 420
Query: 421 INCEGAEDVVLAVNSTKNFGTTSNPANSLTYPGGVLCAKASMLLQNVPPAVLVRFLREHR 480
+ +G EDVV+A NSTK + SN + PGG++CAKASMLLQ+VPPAVLVRFLREHR
Sbjct: 421 MGGDGVEDVVIACNSTKKIRSNSNAGIAFGAPGGIICAKASMLLQSVPPAVLVRFLREHR 480
Query: 481 SEWADFNIDAYSAATLKANSYAYPGMRPTRFTGSQIIMPLGHTIEHEELLEVIRLEGHPM 540
SEWAD+NIDAY A+TLK ++ + G+RP RF+GSQII+PL HT+E+EE+LEV+RLEG P+
Sbjct: 481 SEWADYNIDAYLASTLKTSACSLTGLRPMRFSGSQIIIPLAHTVENEEILEVVRLEGQPL 540
Query: 541 VQEDAFVSRDIHLLQICSGINENAVGACSELIFAPIDEMFPDDAPLLPSGFRIIPLDSRT 600
++A +SRDIHLLQ+C+GI+E +VG+ +L+FAPID+ FPD+ PL+ SGFR+IPLD +T
Sbjct: 541 THDEALLSRDIHLLQLCTGIDEKSVGSSFQLVFAPIDD-FPDETPLISSGFRVIPLDMKT 600
Query: 601 ITLPPDAFLMQSDAQGALTTQRTLDLTSSLEVGSGTSNIAGDASSSQ-SPRSVLTIAFQF 660
GA ++ RTLDL SSLEVGS T+ +GDAS+ + RSVLTIAFQF
Sbjct: 601 --------------DGA-SSGRTLDLASSLEVGSATAQASGDASADDCNLRSVLTIAFQF 660
Query: 661 PFESSMQDNVMNMAQQYVRSVISSVQRVAMAISPSGGSPSLGPKLSPGSPEALTLAHWIC 720
P+E +QD+V MA+QYVRS++S+VQRV+MAISPS + G ++ G PEA TLA W+C
Sbjct: 661 PYELHLQDSVAAMARQYVRSIVSAVQRVSMAISPSQTGLNAGQRIISGFPEAATLARWVC 720
Query: 721 KSYSLQLGTELISSYSLESDSLLKNLWNHQDAILCCSLKSLPVFMFANQAGLDMLETTLV 780
+SY LG EL+S +++ LLK LW++QDAILCCS K PVF FAN+AGLDMLET+LV
Sbjct: 721 QSYHYHLGVELLSQSDGDAEQLLKMLWHYQDAILCCSFKEKPVFTFANKAGLDMLETSLV 780
Query: 781 ALQDITLDKIFDEAGRKALCSDFPKLMQQGFAYLPGGICASTMGRHVSYEQAIAWKVLEA 840
ALQD+TLD+IFDE G++AL S+ PKLM+QG YLP G+C S MGRHVS++QA+AWKVL A
Sbjct: 781 ALQDLTLDRIFDEPGKEALFSNIPKLMEQGHVYLPSGVCMSGMGRHVSFDQAVAWKVL-A 839
Query: 841 DETTVHCLAFSFINWSFV 855
+++ VHCLAF F+NWSFV
Sbjct: 841 EDSNVHCLAFCFVNWSFV 839
BLAST of CmaCh02G008950 vs. TAIR 10
Match:
AT5G60690.1 (Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein )
HSP 1 Score: 1359.7 bits (3518), Expect = 0.0e+00
Identity = 691/859 (80.44%), Postives = 767/859 (89.29%), Query Frame = 0
Query: 1 MAMAIAHHRESSSGSLTRHLDSTGKYVRYTSEQVEALERVYAECPKPSSLRRQQLIRECP 60
M MA+A+HRE SS S+ RHLDS+GKYVRYT+EQVEALERVYAECPKPSSLRRQQLIREC
Sbjct: 1 MEMAVANHRERSSDSMNRHLDSSGKYVRYTAEQVEALERVYAECPKPSSLRRQQLIRECS 60
Query: 61 ILSNIEPKQIKVWFQNRRCREKQRKEASRLQTVNRKLNAMNKLLMEENDRLQKQVSQLVC 120
IL+NIEPKQIKVWFQNRRCR+KQRKEASRLQ+VNRKL+AMNKLLMEENDRLQKQVSQLVC
Sbjct: 61 ILANIEPKQIKVWFQNRRCRDKQRKEASRLQSVNRKLSAMNKLLMEENDRLQKQVSQLVC 120
Query: 121 ENGFMRQQLQTVPAATTDASCDSVVTTPQPSKTDANNPAGLLSIAEETLAEFLSKATGTA 180
ENG+M+QQL TV D SC+SVVTTPQ S DAN+PAGLLSIAEETLAEFLSKATGTA
Sbjct: 121 ENGYMKQQLTTV---VNDPSCESVVTTPQHSLRDANSPAGLLSIAEETLAEFLSKATGTA 180
Query: 181 VDWVQMPGMKPGPDSVGIFAISQSCGGVAARACGLVSLEPAKIAEILKDRPSWFRDCRSL 240
VDWVQMPGMKPGPDSVGIFAISQ C GVAARACGLVSLEP KIAEILKDRPSWFRDCRSL
Sbjct: 181 VDWVQMPGMKPGPDSVGIFAISQRCNGVAARACGLVSLEPMKIAEILKDRPSWFRDCRSL 240
Query: 241 EVFTMFPAGNGGTIELVYTQVYAPTTLAPARDFWTLRYTITLENGSLVVCERSLSGSGAG 300
EVFTMFPAGNGGTIELVY Q YAPTTLAPARDFWTLRYT +L+NGS VVCERSLSGSGAG
Sbjct: 241 EVFTMFPAGNGGTIELVYMQTYAPTTLAPARDFWTLRYTTSLDNGSFVVCERSLSGSGAG 300
Query: 301 PSPAAAAQFVRAEMLPSGYLIRPCEGGGSIIHIVDHLNLEAWNVPEVLRPLYESSKVVAQ 360
P+ A+A+QFVRAEML SGYLIRPC+GGGSIIHIVDHLNLEAW+VP+VLRPLYESSKVVAQ
Sbjct: 301 PNAASASQFVRAEMLSSGYLIRPCDGGGSIIHIVDHLNLEAWSVPDVLRPLYESSKVVAQ 360
Query: 361 NMTIAALRYVRQIAQETSGEVVYGLGRQPAVLRTFSQRLSRGFNDLVNGFNDNGWSLINC 420
MTI+ALRY+RQ+AQE++GEVVYGLGRQPAVLRTFSQRLSRGFND VNGF D+GWS ++C
Sbjct: 361 KMTISALRYIRQLAQESNGEVVYGLGRQPAVLRTFSQRLSRGFNDAVNGFGDDGWSTMHC 420
Query: 421 EGAEDVVLAVNSTKNFGTTSNPANSLTYPGGVLCAKASMLLQNVPPAVLVRFLREHRSEW 480
+GAED+++A+NSTK+ +N +NSL++ GGVLCAKASMLLQNVPPAVL+RFLREHRSEW
Sbjct: 421 DGAEDIIVAINSTKHL---NNISNSLSFLGGVLCAKASMLLQNVPPAVLIRFLREHRSEW 480
Query: 481 ADFNIDAYSAATLKANSYAYPGMRPTRFTGSQIIMPLGHTIEHEELLEVIRLEGHPMVQE 540
ADFN+DAYSAATLKA S+AYPGMRPTRFTGSQIIMPLGHTIEHEE+LEV+RLEGH + QE
Sbjct: 481 ADFNVDAYSAATLKAGSFAYPGMRPTRFTGSQIIMPLGHTIEHEEMLEVVRLEGHSLAQE 540
Query: 541 DAFVSRDIHLLQICSGINENAVGACSELIFAPIDEMFPDDAPLLPSGFRIIPLDSRTITL 600
DAF+SRD+HLLQIC+GI+ENAVGACSELIFAPI+EMFPDDAPL+PSGFR+IP+D++T
Sbjct: 541 DAFMSRDVHLLQICTGIDENAVGACSELIFAPINEMFPDDAPLVPSGFRVIPVDAKT--- 600
Query: 601 PPDAFLMQSDAQGALT-TQRTLDLTSSLEVGSGTSNIAGDASSSQSPRSVLTIAFQFPFE 660
D Q LT RTLDLTSSLEVG N +G++ SS S R +LTIAFQFPFE
Sbjct: 601 --------GDVQDLLTANHRTLDLTSSLEVGPSPENASGNSFSSSSSRCILTIAFQFPFE 660
Query: 661 SSMQDNVMNMAQQYVRSVISSVQRVAMAISPSGGSPSLGPKLSPGSPEALTLAHWICKSY 720
+++Q+NV MA QYVRSVISSVQRVAMAISPSG SPSLG KLSPGSPEA+TLA WI +SY
Sbjct: 661 NNLQENVAGMACQYVRSVISSVQRVAMAISPSGISPSLGSKLSPGSPEAVTLAQWISQSY 720
Query: 721 SLQLGTELISSYSLES-DSLLKNLWNHQDAILCCSLKSLPVFMFANQAGLDMLETTLVAL 780
S LG+EL++ SL S DS+LK LW+HQDAILCCSLK PVFMFANQAGLDMLETTLVAL
Sbjct: 721 SHHLGSELLTIDSLGSDDSVLKLLWDHQDAILCCSLKPQPVFMFANQAGLDMLETTLVAL 780
Query: 781 QDITLDKIFDEAGRKALCSDFPKLMQQGFAYLPGGICASTMGRHVSYEQAIAWKVLEADE 840
QDITL+KIFDE+GRKA+CSDF KLMQQGFA LP GIC STMGRHVSYEQA+AWKV A E
Sbjct: 781 QDITLEKIFDESGRKAICSDFAKLMQQGFACLPSGICVSTMGRHVSYEQAVAWKVFAASE 840
Query: 841 ---TTVHCLAFSFINWSFV 855
+HCLAFSF+NWSFV
Sbjct: 841 ENNNNLHCLAFSFVNWSFV 842
BLAST of CmaCh02G008950 vs. TAIR 10
Match:
AT2G34710.1 (Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein )
HSP 1 Score: 1110.5 bits (2871), Expect = 0.0e+00
Identity = 578/851 (67.92%), Postives = 676/851 (79.44%), Query Frame = 0
Query: 20 LDSTGKYVRYTSEQVEALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRC 79
LDS GKYVRYT EQVEALERVY ECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRC
Sbjct: 21 LDS-GKYVRYTPEQVEALERVYTECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRC 80
Query: 80 REKQRKEASRLQTVNRKLNAMNKLLMEENDRLQKQVSQLVCENGFMRQQLQTVPAATTDA 139
REKQRKEA+RLQTVNRKLNAMNKLLMEENDRLQKQVS LV ENG M+ QL T TTD
Sbjct: 81 REKQRKEAARLQTVNRKLNAMNKLLMEENDRLQKQVSNLVYENGHMKHQLHTASGTTTDN 140
Query: 140 SCDSVVTT----------PQPSKTDANNPAGLLSIAEETLAEFLSKATGTAVDWVQMPGM 199
SC+SVV + PQ + DANNPAGLLSIAEE LAEFLSKATGTAVDWVQM GM
Sbjct: 141 SCESVVVSGQQHQQQNPNPQHQQRDANNPAGLLSIAEEALAEFLSKATGTAVDWVQMIGM 200
Query: 200 KPGPDSVGIFAISQSCGGVAARACGLVSLEPAKIAEILKDRPSWFRDCRSLEVFTMFPAG 259
KPGPDS+GI AIS++C G+AARACGLVSLEP K+AEILKDRPSW RDCRS++ ++ PAG
Sbjct: 201 KPGPDSIGIVAISRNCSGIAARACGLVSLEPMKVAEILKDRPSWLRDCRSVDTLSVIPAG 260
Query: 260 NGGTIELVYTQVYAPTTLAPARDFWTLRYTITLENGSLVVCERSLSGSGAGPSPAAAAQF 319
NGGTIEL+YTQ+YAPTTLA ARDFWTLRY+ LE+GS VVCERSL+ + GP+ ++ F
Sbjct: 261 NGGTIELIYTQMYAPTTLAAARDFWTLRYSTCLEDGSYVVCERSLTSATGGPTGPPSSNF 320
Query: 320 VRAEMLPSGYLIRPCEGGGSIIHIVDHLNLEAWNVPEVLRPLYESSKVVAQNMTIAALRY 379
VRAEM PSG+LIRPC+GGGSI+HIVDH++L+AW+VPEV+RPLYESSK++AQ MT+AALR+
Sbjct: 321 VRAEMKPSGFLIRPCDGGGSILHIVDHVDLDAWSVPEVMRPLYESSKILAQKMTVAALRH 380
Query: 380 VRQIAQETSGEVVYGLGRQPAVLRTFSQRLSRGFNDLVNGFNDNGWSLINCEGAEDVVLA 439
VRQIAQETSGEV YG GRQPAVLRTFSQRL RGFND VNGF D+GWS + +GAEDV +
Sbjct: 381 VRQIAQETSGEVQYGGGRQPAVLRTFSQRLCRGFNDAVNGFVDDGWSPMGSDGAEDVTVM 440
Query: 440 VNSTKNFGTTSNPANSL--TYPGGVLCAKASMLLQNVPPAVLVRFLREHRSEWADFNIDA 499
+N + S NS ++ GVLCAKASMLLQNVPPAVLVRFLREHRSEWAD+ +DA
Sbjct: 441 INLSPGKFGGSQYGNSFLPSFGSGVLCAKASMLLQNVPPAVLVRFLREHRSEWADYGVDA 500
Query: 500 YSAATLKANSYAYPGMRPTRFTGSQIIMPLGHTIEHEELLEVIRLEGHPMVQEDAFVSRD 559
Y+AA+L+A+ +A P R F +Q+I+PL T+EHEE LEV+RLEGH ED ++RD
Sbjct: 501 YAAASLRASPFAVPCARAGGFPSNQVILPLAQTVEHEESLEVVRLEGHAYSPEDMGLARD 560
Query: 560 IHLLQICSGINENAVGACSELIFAPIDEMFPDDAPLLPSGFRIIPLDSRTITLPPDAFLM 619
++LLQ+CSG++EN VG C++L+FAPIDE F DDAPLLPSGFRIIPL+
Sbjct: 561 MYLLQLCSGVDENVVGGCAQLVFAPIDESFADDAPLLPSGFRIIPLE------------- 620
Query: 620 QSDAQGALTTQRTLDLTSSLEVGSGTSNIAGDAS-SSQSPRSVLTIAFQFPFESSMQDNV 679
Q + RTLDL S+LE G++ AG+A + + RSVLTIAFQF F++ +D+V
Sbjct: 621 QKSTPNGASANRTLDLASALE---GSTRQAGEADPNGCNFRSVLTIAFQFTFDNHSRDSV 680
Query: 680 MNMAQQYVRSVISSVQRVAMAISPSGGSPSLGPKLSPGSPEALTLAHWICKSYSLQLGTE 739
+MA+QYVRS++ S+QRVA+AI+P GS ++ P P SPEALTL WI +SYSL G +
Sbjct: 681 ASMARQYVRSIVGSIQRVALAIAPRPGS-NISPISVPTSPEALTLVRWISRSYSLHTGAD 740
Query: 740 LISSYSLES-DSLLKNLWNHQDAILCCSLK--SLPVFMFANQAGLDMLETTLVALQDITL 799
L S S S D+LL LWNH DAILCCSLK + PVF FANQ GLDMLETTLVALQDI L
Sbjct: 741 LFGSDSQTSGDTLLHQLWNHSDAILCCSLKTNASPVFTFANQTGLDMLETTLVALQDIML 800
Query: 800 DKIFDEAGRKALCSDFPKLMQQGFAYLPGGICASTMGRHVSYEQAIAWKVLEADETTVHC 855
DK DE GRKALCS+FPK+MQQG+A+LP G+CAS+MGR VSYEQA WKVLE DE+ HC
Sbjct: 801 DKTLDEPGRKALCSEFPKIMQQGYAHLPAGVCASSMGRMVSYEQATVWKVLEDDESN-HC 852
BLAST of CmaCh02G008950 vs. TAIR 10
Match:
AT1G30490.1 (Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein )
HSP 1 Score: 1070.8 bits (2768), Expect = 5.5e-313
Identity = 560/866 (64.67%), Postives = 667/866 (77.02%), Query Frame = 0
Query: 5 IAHHRESSSGSLTRHLDSTGKYVRYTSEQVEALERVYAECPKPSSLRRQQLIRECPILSN 64
+AHH S + DS GKYVRYT EQVEALERVYAECPKPSSLRRQQLIRECPIL N
Sbjct: 2 MAHHSMDDRDSPDKGFDS-GKYVRYTPEQVEALERVYAECPKPSSLRRQQLIRECPILCN 61
Query: 65 IEPKQIKVWFQNRRCREKQRKEASRLQTVNRKLNAMNKLLMEENDRLQKQVSQLVCENGF 124
IEP+QIKVWFQNRRCREKQRKE++RLQTVNRKL+AMNKLLMEENDRLQKQVS LV ENGF
Sbjct: 62 IEPRQIKVWFQNRRCREKQRKESARLQTVNRKLSAMNKLLMEENDRLQKQVSNLVYENGF 121
Query: 125 MRQQLQTVPAATTDASCDSVVT----------TPQPSKTDANNPAGLLSIAEETLAEFLS 184
M+ ++ T TTD SC+SVV T Q + D NNPA LLSIAEETLAEFL
Sbjct: 122 MKHRIHTASGTTTDNSCESVVVSGQQRQQQNPTHQHPQRDVNNPANLLSIAEETLAEFLC 181
Query: 185 KATGTAVDWVQMPGMKPGPDSVGIFAISQSCGGVAARACGLVSLEPAKIAEILKDRPSWF 244
KATGTAVDWVQM GMKPGPDS+GI A+S++C G+AARACGLVSLEP K+AEILKDRPSWF
Sbjct: 182 KATGTAVDWVQMIGMKPGPDSIGIVAVSRNCSGIAARACGLVSLEPMKVAEILKDRPSWF 241
Query: 245 RDCRSLEVFTMFPAGNGGTIELVYTQVYAPTTLAPARDFWTLRYTITLENGSLVVCERSL 304
RDCR +E + P GNGGTIELV TQ+YAPTTLA ARDFWTLRY+ +LE+GS VVCERSL
Sbjct: 242 RDCRCVETLNVIPTGNGGTIELVNTQIYAPTTLAAARDFWTLRYSTSLEDGSYVVCERSL 301
Query: 305 SGSGAGPSPAAAAQFVRAEMLPSGYLIRPCEGGGSIIHIVDHLNLEAWNVPEVLRPLYES 364
+ + GP+ ++ FVRA+ML SG+LIRPC+GGGSIIHIVDH++L+ +VPEVLRPLYES
Sbjct: 302 TSATGGPNGPLSSSFVRAKMLSSGFLIRPCDGGGSIIHIVDHVDLDVSSVPEVLRPLYES 361
Query: 365 SKVVAQNMTIAALRYVRQIAQETSGEVVYGLGRQPAVLRTFSQRLSRGFNDLVNGFNDNG 424
SK++AQ MT+AALR+VRQIAQETSGEV Y GRQPAVLRTFSQRL RGFND VNGF D+G
Sbjct: 362 SKILAQKMTVAALRHVRQIAQETSGEVQYSGGRQPAVLRTFSQRLCRGFNDAVNGFVDDG 421
Query: 425 WSLINCEGAEDVVLAVNST--KNFGTTSNPANSLTYPGGVLCAKASMLLQNVPPAVLVRF 484
WS ++ +G ED+ + +NS+ K G+ + ++ GVLCAKASMLLQNVPP VL+RF
Sbjct: 422 WSPMSSDGGEDITIMINSSSAKFAGSQYGSSFLPSFGSGVLCAKASMLLQNVPPLVLIRF 481
Query: 485 LREHRSEWADFNIDAYSAATLKANSYAYPGMRPTRFTGSQIIMPLGHTIEHEELLEVIRL 544
LREHR+EWAD+ +DAYSAA+L+A YA P +R F +Q+I+PL T+EHEE LEV+RL
Sbjct: 482 LREHRAEWADYGVDAYSAASLRATPYAVPCVRTGGFPSNQVILPLAQTLEHEEFLEVVRL 541
Query: 545 EGHPMVQEDAFVSRDIHLLQICSGINENAVGACSELIFAPIDEMFPDDAPLLPSGFRIIP 604
GH ED +SRD++LLQ+CSG++EN VG C++L+FAPIDE F DDAPLLPSGFR+IP
Sbjct: 542 GGHAYSPEDMGLSRDMYLLQLCSGVDENVVGGCAQLVFAPIDESFADDAPLLPSGFRVIP 601
Query: 605 LDSRTITLPPDAFLMQSDAQGALTTQRTLDLTSSLEVGSGTSNIAGDASSSQSPRSVLTI 664
LD +T +D Q A RT DL SSL+ + T S + R VLTI
Sbjct: 602 LDQKT---------NPNDHQSA---SRTRDLASSLDGSTKT-------DSETNSRLVLTI 661
Query: 665 AFQFPFESSMQDNVMNMAQQYVRSVISSVQRVAMAISPSGGSPSLGPKLSPGSPEALTLA 724
AFQF F++ +DNV MA+QYVR+V+ S+QRVA+AI+P GS L P SPEALTL
Sbjct: 662 AFQFTFDNHSRDNVATMARQYVRNVVGSIQRVALAITPRPGSMQL-----PTSPEALTLV 721
Query: 725 HWICKSYSLQLGTELI--SSYSLESDSLLKNLWNHQDAILCCSLK--SLPVFMFANQAGL 784
WI +SYS+ G +L S S D+LLK LW+H DAILCCSLK + PVF FANQAGL
Sbjct: 722 RWITRSYSIHTGADLFGADSQSCGGDTLLKQLWDHSDAILCCSLKTNASPVFTFANQAGL 781
Query: 785 DMLETTLVALQDITLDKIFDEAGRKALCSDFPKLMQQGFAYLPGGICASTMGRHVSYEQA 844
DMLETTLVALQDI LDK D++GR+ALCS+F K+MQQG+A LP GIC S+MGR VSYEQA
Sbjct: 782 DMLETTLVALQDIMLDKTLDDSGRRALCSEFAKIMQQGYANLPAGICVSSMGRPVSYEQA 841
Query: 845 IAWKVLEADETTVHCLAFSFINWSFV 855
WKV++ +E+ HCLAF+ ++WSFV
Sbjct: 842 TVWKVVDDNESN-HCLAFTLVSWSFV 841
BLAST of CmaCh02G008950 vs. TAIR 10
Match:
AT1G52150.1 (Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein )
HSP 1 Score: 1042.0 bits (2693), Expect = 2.7e-304
Identity = 554/842 (65.80%), Postives = 641/842 (76.13%), Query Frame = 0
Query: 24 GKYVRYTSEQVEALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRCREKQ 83
GKYVRYT EQVEALER+Y +CPKPSS+RRQQLIRECPILSNIEPKQIKVWFQNRRCREKQ
Sbjct: 16 GKYVRYTPEQVEALERLYHDCPKPSSIRRQQLIRECPILSNIEPKQIKVWFQNRRCREKQ 75
Query: 84 RKEASRLQTVNRKLNAMNKLLMEENDRLQKQVSQLVCENGFMRQQLQTVPAATTDASCDS 143
RKEASRLQ VNRKL AMNKLLMEENDRLQKQVSQLV EN + RQ D SC+S
Sbjct: 76 RKEASRLQAVNRKLTAMNKLLMEENDRLQKQVSQLVHENSYFRQHTPNPSLPAKDTSCES 135
Query: 144 VVTTPQPSKTDAN-----NPAGLLSIAEETLAEFLSKATGTAVDWVQMPGMKPGPDSVGI 203
VVT+ Q N +PAGLLSIAEETLAEFLSKATGTAV+WVQMPGMKPGPDS+GI
Sbjct: 136 VVTSGQHQLASQNPQRDASPAGLLSIAEETLAEFLSKATGTAVEWVQMPGMKPGPDSIGI 195
Query: 204 FAISQSCGGVAARACGLVSLEPAKIAEILKDRPSWFRDCRSLEVFTMFPAGNGGTIELVY 263
AIS C GVAARACGLV LEP ++AEI+KDRPSWFR+CR++EV + P NGGT+EL+Y
Sbjct: 196 IAISHGCTGVAARACGLVGLEPTRVAEIVKDRPSWFRECRAVEVMNVLPTANGGTVELLY 255
Query: 264 TQVYAPTTLAPARDFWTLRYTITLENGSLVVCERSLSGSGAGPSPAAAAQFVRAEMLPSG 323
Q+YAPTTLAP RDFW LRYT LE+GSLVVCERSL + GPS FVRAEML SG
Sbjct: 256 MQLYAPTTLAPPRDFWLLRYTSVLEDGSLVVCERSLKSTQNGPSMPLVQNFVRAEMLSSG 315
Query: 324 YLIRPCEGGGSIIHIVDHLNLEAWNVPEVLRPLYESSKVVAQNMTIAALRYVRQIAQET- 383
YLIRPC+GGGSIIHIVDH++LEA +VPEVLRPLYES KV+AQ T+AALR ++QIAQE
Sbjct: 316 YLIRPCDGGGSIIHIVDHMDLEACSVPEVLRPLYESPKVLAQKTTMAALRQLKQIAQEVT 375
Query: 384 -SGEVVYGLGRQPAVLRTFSQRLSRGFNDLVNGFNDNGWSLINCEGAEDVVLAVNST--K 443
+ V G GR+PA LR SQRLSRGFN+ VNGF D GWS+I + +DV + VNS+ K
Sbjct: 376 QTNSSVNGWGRRPAALRALSQRLSRGFNEAVNGFTDEGWSVIG-DSMDDVTITVNSSPDK 435
Query: 444 NFGTTSNPANSLT-YPGGVLCAKASMLLQNVPPAVLVRFLREHRSEWADFNIDAYSAATL 503
G AN VLCAKASMLLQNVPPA+L+RFLREHRSEWAD NIDAY AA +
Sbjct: 436 LMGLNLTFANGFAPVSNVVLCAKASMLLQNVPPAILLRFLREHRSEWADNNIDAYLAAAV 495
Query: 504 KANSYAYPGMRPTRFTGSQIIMPLGHTIEHEELLEVIRLEGHPMVQEDAFVSRDIHLLQI 563
K P G Q+I+PL HTIEHEE +EVI+LEG EDA V RDI LLQ+
Sbjct: 496 KVG----PCSARVGGFGGQVILPLAHTIEHEEFMEVIKLEGLGHSPEDAIVPRDIFLLQL 555
Query: 564 CSGINENAVGACSELIFAPIDEMFPDDAPLLPSGFRIIPLDSRTITLPPDAFLMQSDAQG 623
CSG++ENAVG C+ELIFAPID F DDAPLLPSGFRIIPLDS A+
Sbjct: 556 CSGMDENAVGTCAELIFAPIDASFADDAPLLPSGFRIIPLDS---------------AKE 615
Query: 624 ALTTQRTLDLTSSLEVGS-GTSNIAGDASSSQSPRSVLTIAFQFPFESSMQDNVMNMAQQ 683
+ RTLDL S+LE+GS GT + +S RSV+TIAF+F ES MQ++V +MA+Q
Sbjct: 616 VSSPNRTLDLASALEIGSAGTKASTDQSGNSTCARSVMTIAFEFGIESHMQEHVASMARQ 675
Query: 684 YVRSVISSVQRVAMAISPSGGSPSLGPKLSPGSPEALTLAHWICKSYSLQLGTELISSYS 743
YVR +ISSVQRVA+A+SPS S +G + G+PEA TLA WIC+SY +G EL+ S S
Sbjct: 676 YVRGIISSVQRVALALSPSHISSQVGLRTPLGTPEAQTLARWICQSYRGYMGVELLKSNS 735
Query: 744 LESDSLLKNLWNHQDAILCCSLKSLPVFMFANQAGLDMLETTLVALQDITLDKIFDEAGR 803
++S+LKNLW+H DAI+CCS+K+LPVF FANQAGLDMLETTLVALQDI+L+KIFD+ GR
Sbjct: 736 DGNESILKNLWHHTDAIICCSMKALPVFTFANQAGLDMLETTLVALQDISLEKIFDDNGR 795
Query: 804 KALCSDFPKLMQQGFAYLPGGICASTMGRHVSYEQAIAWKVLEADETTVHCLAFSFINWS 855
K LCS+FP++MQQGFA L GGIC S+MGR VSYE+A+AWKVL +E HC+ F FINWS
Sbjct: 796 KTLCSEFPQIMQQGFACLQGGICLSSMGRPVSYERAVAWKVLN-EEENAHCICFVFINWS 836
BLAST of CmaCh02G008950 vs. TAIR 10
Match:
AT1G52150.2 (Homeobox-leucine zipper family protein / lipid-binding START domain-containing protein )
HSP 1 Score: 1042.0 bits (2693), Expect = 2.7e-304
Identity = 554/842 (65.80%), Postives = 640/842 (76.01%), Query Frame = 0
Query: 24 GKYVRYTSEQVEALERVYAECPKPSSLRRQQLIRECPILSNIEPKQIKVWFQNRRCREKQ 83
GKYVRYT EQVEALER+Y +CPKPSS+RRQQLIRECPILSNIEPKQIKVWFQNRRCREKQ
Sbjct: 16 GKYVRYTPEQVEALERLYHDCPKPSSIRRQQLIRECPILSNIEPKQIKVWFQNRRCREKQ 75
Query: 84 RKEASRLQTVNRKLNAMNKLLMEENDRLQKQVSQLVCENGFMRQQLQTVPAATTDASCDS 143
RKEASRLQ VNRKL AMNKLLMEENDRLQKQVSQLV EN + RQ D SC+S
Sbjct: 76 RKEASRLQAVNRKLTAMNKLLMEENDRLQKQVSQLVHENSYFRQHTPNPSLPAKDTSCES 135
Query: 144 VVTTPQPSKTDAN-----NPAGLLSIAEETLAEFLSKATGTAVDWVQMPGMKPGPDSVGI 203
VVT+ Q N +PAGLLSIAEETLAEFLSKATGTAV+WVQMPGMKPGPDS+GI
Sbjct: 136 VVTSGQHQLASQNPQRDASPAGLLSIAEETLAEFLSKATGTAVEWVQMPGMKPGPDSIGI 195
Query: 204 FAISQSCGGVAARACGLVSLEPAKIAEILKDRPSWFRDCRSLEVFTMFPAGNGGTIELVY 263
AIS C GVAARACGLV LEP ++AEI+KDRPSWFR+CR++EV + P NGGT+EL+Y
Sbjct: 196 IAISHGCTGVAARACGLVGLEPTRVAEIVKDRPSWFRECRAVEVMNVLPTANGGTVELLY 255
Query: 264 TQVYAPTTLAPARDFWTLRYTITLENGSLVVCERSLSGSGAGPSPAAAAQFVRAEMLPSG 323
Q+YAPTTLAP RDFW LRYT LE+GSLVVCERSL + GPS FVRAEML SG
Sbjct: 256 MQLYAPTTLAPPRDFWLLRYTSVLEDGSLVVCERSLKSTQNGPSMPLVQNFVRAEMLSSG 315
Query: 324 YLIRPCEGGGSIIHIVDHLNLEAWNVPEVLRPLYESSKVVAQNMTIAALRYVRQIAQET- 383
YLIRPC+GGGSIIHIVDH++LEA +VPEVLRPLYES KV+AQ T+AALR ++QIAQE
Sbjct: 316 YLIRPCDGGGSIIHIVDHMDLEACSVPEVLRPLYESPKVLAQKTTMAALRQLKQIAQEVT 375
Query: 384 -SGEVVYGLGRQPAVLRTFSQRLSRGFNDLVNGFNDNGWSLINCEGAEDVVLAVNST--K 443
+ V G GR+PA LR SQRLSRGFN+ VNGF D GWS+I + +DV + VNS+ K
Sbjct: 376 QTNSSVNGWGRRPAALRALSQRLSRGFNEAVNGFTDEGWSVIG-DSMDDVTITVNSSPDK 435
Query: 444 NFGTTSNPANSLT-YPGGVLCAKASMLLQNVPPAVLVRFLREHRSEWADFNIDAYSAATL 503
G AN VLCAKASMLLQNVPPA+L+RFLREHRSEWAD NIDAY AA +
Sbjct: 436 LMGLNLTFANGFAPVSNVVLCAKASMLLQNVPPAILLRFLREHRSEWADNNIDAYLAAAV 495
Query: 504 KANSYAYPGMRPTRFTGSQIIMPLGHTIEHEELLEVIRLEGHPMVQEDAFVSRDIHLLQI 563
K P G Q+I+PL HTIEHEE +EVI+LEG EDA V RDI LLQ+
Sbjct: 496 KVG----PCSARVGGFGGQVILPLAHTIEHEEFMEVIKLEGLGHSPEDAIVPRDIFLLQL 555
Query: 564 CSGINENAVGACSELIFAPIDEMFPDDAPLLPSGFRIIPLDSRTITLPPDAFLMQSDAQG 623
CSG++ENAVG C+ELIFAPID F DDAPLLPSGFRIIPLDS Q
Sbjct: 556 CSGMDENAVGTCAELIFAPIDASFADDAPLLPSGFRIIPLDSA--------------KQE 615
Query: 624 ALTTQRTLDLTSSLEVGS-GTSNIAGDASSSQSPRSVLTIAFQFPFESSMQDNVMNMAQQ 683
+ RTLDL S+LE+GS GT + +S RSV+TIAF+F ES MQ++V +MA+Q
Sbjct: 616 VSSPNRTLDLASALEIGSAGTKASTDQSGNSTCARSVMTIAFEFGIESHMQEHVASMARQ 675
Query: 684 YVRSVISSVQRVAMAISPSGGSPSLGPKLSPGSPEALTLAHWICKSYSLQLGTELISSYS 743
YVR +ISSVQRVA+A+SPS S +G + G+PEA TLA WIC+SY +G EL+ S S
Sbjct: 676 YVRGIISSVQRVALALSPSHISSQVGLRTPLGTPEAQTLARWICQSYRGYMGVELLKSNS 735
Query: 744 LESDSLLKNLWNHQDAILCCSLKSLPVFMFANQAGLDMLETTLVALQDITLDKIFDEAGR 803
++S+LKNLW+H DAI+CCS+K+LPVF FANQAGLDMLETTLVALQDI+L+KIFD+ GR
Sbjct: 736 DGNESILKNLWHHTDAIICCSMKALPVFTFANQAGLDMLETTLVALQDISLEKIFDDNGR 795
Query: 804 KALCSDFPKLMQQGFAYLPGGICASTMGRHVSYEQAIAWKVLEADETTVHCLAFSFINWS 855
K LCS+FP++MQQGFA L GGIC S+MGR VSYE+A+AWKVL +E HC+ F FINWS
Sbjct: 796 KTLCSEFPQIMQQGFACLQGGICLSSMGRPVSYERAVAWKVLN-EEENAHCICFVFINWS 837
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9SE43 | 0.0e+00 | 80.44 | Homeobox-leucine zipper protein REVOLUTA OS=Arabidopsis thaliana OX=3702 GN=REV ... | [more] |
A2XBL9 | 0.0e+00 | 70.16 | Homeobox-leucine zipper protein HOX10 OS=Oryza sativa subsp. indica OX=39946 GN=... | [more] |
A2Z8L4 | 0.0e+00 | 70.31 | Homeobox-leucine zipper protein HOX9 OS=Oryza sativa subsp. indica OX=39946 GN=H... | [more] |
Q9AV49 | 0.0e+00 | 70.31 | Homeobox-leucine zipper protein HOX9 OS=Oryza sativa subsp. japonica OX=39947 GN... | [more] |
Q6TAQ6 | 0.0e+00 | 70.05 | Homeobox-leucine zipper protein HOX10 OS=Oryza sativa subsp. japonica OX=39947 G... | [more] |
Match Name | E-value | Identity | Description | |
AT5G60690.1 | 0.0e+00 | 80.44 | Homeobox-leucine zipper family protein / lipid-binding START domain-containing p... | [more] |
AT2G34710.1 | 0.0e+00 | 67.92 | Homeobox-leucine zipper family protein / lipid-binding START domain-containing p... | [more] |
AT1G30490.1 | 5.5e-313 | 64.67 | Homeobox-leucine zipper family protein / lipid-binding START domain-containing p... | [more] |
AT1G52150.1 | 2.7e-304 | 65.80 | Homeobox-leucine zipper family protein / lipid-binding START domain-containing p... | [more] |
AT1G52150.2 | 2.7e-304 | 65.80 | Homeobox-leucine zipper family protein / lipid-binding START domain-containing p... | [more] |