Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAGCACCTACGAAAGCAAAATCACGCAGCTGAATCCGTTAAAAAAATTTCAGAGAAAAAAAAGGAAAAAAAATGCAAGTCTTCTCTTCATCTCCTTCAACCGCAAGCTTCAATCTGTTCTCTTCATCTCAAAATCCGAACAATCTCTCGAAATACAGATGCAAAAATTTGATTATACCTCTCCATTTCGCTCCGAGCTCCTTCTCCGTTTGCTGTTCTTCGAGGGAGTCCTCGACGGCGGTTCTCTCCTCCGATATTACGTTGGACGGCGAGAAAAGAGAGGCGTCGGAGGAGGTTCTTAGAGTTCGGCGGCCGGTAATGGAGTTTACCGGTGAAGATTCCGGCGACGGCGAGGAGGCGGAAGATGAGAGGAGTTCTTCGGTGATAGAACTAGGGCTAGCAGAGATTGCGAAGAAGATGCCGATATTTGAACCGGAGAATCGGGTGGATTCGAGTGCCCTAGAAAGGCCGCTGATTATCAATTTGGATTTGGCGTTGTACAAAGCGAAGATGATGGCGAGGAATTTTCTGTATGATGAAGCACGGCAAGTCCTTCAGAAGGTATTTTTCTCTCAGAAATTTAGGATTCCTTTATTAATTTTAATCCTTCGTAACTTCACAAAGTTAGTTTTTGGTTCTATTATATATGAAATTCAAAAAGTACTTTCAACTATTTAGAAGCATTTTTTTTTTATAATCAAATATTGGGGCTGAATTTAGCGCGTGATTTGGTTGGGAAAGTGTATAGATAAATGGCCGGAGGATGGGCGGGCATACGTAGCATTGGGGAAGATGTTGAGCAAGCAAATGAAAGCCGCCGACGCCAGAAATGTGTATGAGAGAGGCTGCCAAGCCACCCAAGGCGAGAACTCCTACATTTGGCAGGTCAGTTTGAATCCTGCCTTTTATGAGCCCAAAGTTATCTCTCTAGTTTTTCCTGATGGTAGTAATAGTTTGTACATACAGGTTCCAAAGCTGTTGATTCGTTATCTCTCTAGTTTTTCCTGATGGTAGTAATAGTTTGTACATACAGGTTCCAAAGCTGTTGATTCAAGGTTATAGAAATCACACAATTATATATGATATTATCCATTTTGAGTATATAAACTCTCGTAGTTTTACTTTTGGTTTCCCCCAAAAGGCATCATACCAATGGAGATGTATTTCTTACTTATAAATCCAAGATCGTCTTCTTGATTAGCCAATGTGGGACTCCCTCCCAATAATCCTCCTCAAACAAAATACACCATAGAGCCTCCCTTGAGACCTATAAAGCCCTTGAACAACCTCCTCTTAATTGAGGCTTAACTCCTATCTCTGCTCGAACAAAGTACACCCTTTGTTCGACATTTGAGGATTCTATTGACATGTTGAGTTAAGGGCACAACTAGGAATCACGGACTTCCACAATGGTATGATATTGTCCACTTTGAGCATACGCTCTCATGAATGTGCTTTTAGTGACGGACTTCCACAATGGTATGATATTGTCCACTTTGAGCATACGCTCTCATGAATGTGCTTTTAGTGTCCCCAAGAGATTTCATGCTAATGGAGATGTATTTCTTACTTATAAATCCATTGATCGTCCTCTTAATTAGCAAGAGATTTCATGCTAATGGAGATGTATTTCTTACTTATAAATCCATTGATCGTCCTCTTAATTAGCCAATGTAGAACTCCCTCTCAACAATCTTCCACCCATTATATGCTAAACTTAGGGTCTCAAAACCCTTGTTGTCTCAGTATAGCTTTCTTGAAGACGAGTAGAGTGGAGTGTGAAAAAAAAGATCTTTTACCCTTGTTGTCTCAGTATAGCTTTCTTGAAGACGAGTAGAGTGGAGTGTGAAAAAAAAGATCTTTTTGTGTAAAACTGATACACCAATTCATAAGCAGTTTTGGTCAAAGAAACCAACTGAAACCGACCCGCTTACACCCCTTGATAGACCAAAAAAAAAAGCAGTTTTGGTCAAAGAAACCAACTGAAACCGACCCGCTTACACCCCTTGATAGACCAAAAAAAACAAGCTGAATTGGGCTATCATGAAATATTTGGAAACAGGTTTATGTTTCATCGTCTCAAATTTTAGCCACTAATTGCACACTTGATATAGTGCTGGGCTGTTCTGGAGAGCAGGATGGGGAATATCAGGAAAGCAAGAGAGCTCTTTGATGCAGCCACAGTAGCCAACAAGAAGCACATTGCCGCATGGCATGGCTGGGCTGTGCTAGAGCTAAAGCAGGGGAACCTCAAGAAGGCCAGGAATCTACTAGCCAAAGGCCTCAAATACTGTGGTGGGAATGAGTATATCTATCAAACACTCGCCCTGCTCGAAGCCAAATCGAATCGCTACGAGCAGGCGTGGGAATGAGTATATCTATCAAACACTCGCCCTGCTCGAAGCCAAATCGAATCGCTACGAGCAGGCACGTTATTTGTTTAAGCAGGCTACCAAGTGCAACCCCAAAAGCTGTGCTAGTTGGCTTGTGAGTTCTTTTACATCCCATTGTCCATTGCCCTGATTCTGTTTTGAAATTTGATCATTACTTTCATCTCAGTTCTTTTACATCCCATTGTCCATTGCCCTGATTCTGTTTTGAAATTTGATCATTACTTTCATCTCCCAAATCCCCTCAGTAATTTGGACATGATCATGGTATGGCAGGCATGGGCGCAGCTGGAGATGCAGCGGGAGAACAACCTTCTTGCTAGACAACTATTTGAGGTGATTTTGATTTCCCTTCTCTTGTTTAACTGTTGATGAGTTCTTAGTAGATCCTTTCATTTCCCTCATTAACAAGTTCGTTTGTTTCTTTGTTACCTAGAAAGCCATCCAGGCGAGCCCCAAGAACAGATTTGCGTGGCACATATGGGGACTCTTTGAAGCTAATATAGGAAATATCGAGAAGGGAAGGAAACTTTTAAAGATAGGCCATGTCCTAAATCCAAGAGACCCTGTTCTTCTTCAGTCTCTTGGTTTATTGGAGTATAAGAACTCCTCTGCAAGCATGGCTCGAGTTTTGTTTAGGAGAGCATCTGAACTGGACCCCAAGCACCAACCAGTGTGGATTGTATGTTAATATCCATTCCCTTGTAGAAGGATTTTCCACTCAAAATCTGATCAGATCTTTTGTTGAGGTTCCATTTTCCTTGACAAAACTTAAACCATGGCTTATGCAGGCTTGGGGATGGATGGAATGGAAAGAAGGTAACATAGGGAAGGCAAGGGAGCTGTATCAAAGAGCATTGTTAATTGACTCAGCTAGTGAGAGTGCAGCTCGATGCCTTCAGGTAGACAAATGCTCGTTGTTGAATTATAAATTTGGTTTCAAAATTTTGATGGTTAGCCTGGGCGTCGCACTGCAACGGCTCCAAACCCACTGCTAGCCAATATTGTTCTTTTTGGGCTTTCTCTTTCAAGCTTTCCCTCAAGGTTTTTAAAATGCATCTACTATGGAGAGGATCTAGTCCTACTTCAATCAGTGCCTCGCATCATTCAGTGACTGGCTCTGATACCATTTATAATAGTTCATGTCCACCAATAGTAGATATTGTCTGCTTTGACCCGTTACGTATCGCTGTCAGTCTCAAGGTTTTAAAACGCGCATGTTAGGGAGAGGTTTCCACAATCTTATAAAGAATGCTATGTTCCTCTCTCCAACTAATGTGAGATCTCACAACTCACCTTCCTTAGGGACCTAGCAACCTCATTAGCATACCGCTTGGAACTTGGCTCTGATACCATTTGTAACCTTCCTTAGGGACCTAGCAACCTCATTAGCATACCGCTTGGAACTTGGCTCTGATACCATTTGTAACAGTGCAAGCCTACCACTAGCAGATGTTGTCCTCTTTGGGTGGTGTGTCAACGAGGACGCTAGGCCCCCAAGGGGAGTGAATTGTGAGACACGTGTTTTAAAACCGTAAGGCTAACGGTGATACGCAACATGCCAAAGTGGACAATATCTGTTAGCGGTGAGTTTGGATTGTTACAAATGGTATCAGAGCCTGTTCAATGAACAGTGTGCCAACAAGGACACTGGACCCCCAAGGAGGGTGGATTGTGAGATCCCACATCGGTTGGAGAGGGGAACGAAGCATTCCTTATAAGGGTGTGGAAATCATTTCCTAACAAACACGCTTTAAAACTGTAAGGTTGACGGGGATATGTAACGAGCCAAAATGGACTAGCGGTAAACTTGGGCTGTTACACGGTTACAAACATTGCAAGATGGATGATTGGTAGGAGGAAGTTGAGCGGTAAACTTGGGCTGTTACACGGTTACAAACATTGCAAGATGGATGATTGGTAGGAGGAAGTTGAAAATGATAGGAATACTTGTTGGAGTCATGTTCTAATTTTGTTATCTAAGTTCGAGAACCTTTAATTTCTGCTTGGAATCGAAATATTACTTACTTTGATTGCTTTAATACAGGCTTGGGGTGTTCTAGAACAGAGAGCAGGCAACCTATCAGCAGCTAGAAGATTATATAGATCCTCTCTAAGCATAAACTCTCAGAGTTATGTAACATGGATGACCTGGGCAGCACTGGAAGATGATCAAGGGAACGCAATCCGAGCGGAGGAAATTCGAAATCTATACTTCCAGCAGGTCAGAACCTTGTTTTTTGAAATTTGAGTTAGTTTTTCTGAACCACTGTTAAGTAAATGAAGGCAGTGTTGCAACAGAGAACAGAAGTTGTGGATGATGCTTCGTGGGTTATGGGGATCTTAGACGTTATTGACCCAGCACTTGATAGCATAAAGAGGCTGTTGAAGCTGGAGCAAGACCCCTTCGCCATGTCAAGAGTAGCAGATGGAGGCGCTAGAAACAGCTCTTTAGACGACTCGGGTGCCTCCTCTAGTGTGGCTGACAGTGAAACTGGGTTTGATTTGGATGCCTTTATGATGAAAAAGTTGTCGATAGACACGTCGAAACTCGAAATTCAACTCGAAACAACTCGACCCAAAAGATTTAAGTATCAAAGAAGTCAAATGAGATCAGAAAACAGACCAGAAATGGCTGTTTCACAGAGCCAAAGAACAGCATCTTCATCTACTTGAGGATCGATCTTATAGTTCATTCCAGAGAAAACAAGGTTCTTAATGGAACAAATGTTTTGTAAGTCTAAGCTTCCCTCGTTCATAAAGTTATTATTATAGCTGCAAGGATCAAGGAAGAAAACACTTCTTTGATGGAGTTCTACATGTCTAAAAGAAGCCATAATGTAAATAACTTCGTAAGTTTATAATTCTTGAGGTTCTCTGAGTCTGATCTCTCTAGTGCTTCTTAATGGTTTGAG
mRNA sequence
AGAGCACCTACGAAAGCAAAATCACGCAGCTGAATCCGTTAAAAAAATTTCAGAGAAAAAAAAGGAAAAAAAATGCAAGTCTTCTCTTCATCTCCTTCAACCGCAAGCTTCAATCTGTTCTCTTCATCTCAAAATCCGAACAATCTCTCGAAATACAGATGCAAAAATTTGATTATACCTCTCCATTTCGCTCCGAGCTCCTTCTCCGTTTGCTGTTCTTCGAGGGAGTCCTCGACGGCGGTTCTCTCCTCCGATATTACGTTGGACGGCGAGAAAAGAGAGGCGTCGGAGGAGGTTCTTAGAGTTCGGCGGCCGGTAATGGAGTTTACCGGTGAAGATTCCGGCGACGGCGAGGAGGCGGAAGATGAGAGGAGTTCTTCGGTGATAGAACTAGGGCTAGCAGAGATTGCGAAGAAGATGCCGATATTTGAACCGGAGAATCGGGTGGATTCGAGTGCCCTAGAAAGGCCGCTGATTATCAATTTGGATTTGGCGTTGTACAAAGCGAAGATGATGGCGAGGAATTTTCTGTATGATGAAGCACGGCAAGTCCTTCAGAAGTGTATAGATAAATGGCCGGAGGATGGGCGGGCATACGTAGCATTGGGGAAGATGTTGAGCAAGCAAATGAAAGCCGCCGACGCCAGAAATGTGTATGAGAGAGGCTGCCAAGCCACCCAAGGCGAGAACTCCTACATTTGGCAGTGCTGGGCTGTTCTGGAGAGCAGGATGGGGAATATCAGGAAAGCAAGAGAGCTCTTTGATGCAGCCACAGTAGCCAACAAGAAGCACATTGCCGCATGGCATGGCTGGGCTGTGCTAGAGCTAAAGCAGGGGAACCTCAAGAAGGCCAGGAATCTACTAGCCAAAGGCCTCAAATACTGTGGTGGGAATGAGCGTGGGAATGAGTATATCTATCAAACACTCGCCCTGCTCGAAGCCAAATCGAATCGCTACGAGCAGGCACGTTATTTGTTTAAGCAGGCTACCAAGTGCAACCCCAAAAGCTGTGCTAGTTGGCTTGCATGGGCGCAGCTGGAGATGCAGCGGGAGAACAACCTTCTTGCTAGACAACTATTTGAGAAAGCCATCCAGGCGAGCCCCAAGAACAGATTTGCGTGGCACATATGGGGACTCTTTGAAGCTAATATAGGAAATATCGAGAAGGGAAGGAAACTTTTAAAGATAGGCCATGTCCTAAATCCAAGAGACCCTGTTCTTCTTCAGTCTCTTGGTTTATTGGAGTATAAGAACTCCTCTGCAAGCATGGCTCGAGTTTTGTTTAGGAGAGCATCTGAACTGGACCCCAAGCACCAACCAGTGTGGATTGCTTGGGGATGGATGGAATGGAAAGAAGGTAACATAGGGAAGGCAAGGGAGCTGTATCAAAGAGCATTGTTAATTGACTCAGCTAGTGAGAGTGCAGCTCGATGCCTTCAGGCTTGGGGTGTTCTAGAACAGAGAGCAGGCAACCTATCAGCAGCTAGAAGATTATATAGATCCTCTCTAAGCATAAACTCTCAGAGTTATGTAACATGGATGACCTGGGCAGCACTGGAAGATGATCAAGGGAACGCAATCCGAGCGGAGGAAATTCGAAATCTATACTTCCAGCAGAGAACAGAAGTTGTGGATGATGCTTCGTGGGTTATGGGGATCTTAGACGTTATTGACCCAGCACTTGATAGCATAAAGAGGCTGTTGAAGCTGGAGCAAGACCCCTTCGCCATGTCAAGAGTAGCAGATGGAGGCGCTAGAAACAGCTCTTTAGACGACTCGGGTGCCTCCTCTAGTGTGGCTGACAGTGAAACTGGGTTTGATTTGGATGCCTTTATGATGAAAAAGTTGTCGATAGACACGTCGAAACTCGAAATTCAACTCGAAACAACTCGACCCAAAAGATTTAAGTATCAAAGAAGTCAAATGAGATCAGAAAACAGACCAGAAATGGCTGTTTCACAGAGCCAAAGAACAGCATCTTCATCTACTTGAGGATCGATCTTATAGTTCATTCCAGAGAAAACAAGGTTCTTAATGGAACAAATGTTTTGTAAGTCTAAGCTTCCCTCGTTCATAAAGTTATTATTATAGCTGCAAGGATCAAGGAAGAAAACACTTCTTTGATGGAGTTCTACATGTCTAAAAGAAGCCATAATGTAAATAACTTCGTAAGTTTATAATTCTTGAGGTTCTCTGAGTCTGATCTCTCTAGTGCTTCTTAATGGTTTGAG
Coding sequence (CDS)
ATGCAAGTCTTCTCTTCATCTCCTTCAACCGCAAGCTTCAATCTGTTCTCTTCATCTCAAAATCCGAACAATCTCTCGAAATACAGATGCAAAAATTTGATTATACCTCTCCATTTCGCTCCGAGCTCCTTCTCCGTTTGCTGTTCTTCGAGGGAGTCCTCGACGGCGGTTCTCTCCTCCGATATTACGTTGGACGGCGAGAAAAGAGAGGCGTCGGAGGAGGTTCTTAGAGTTCGGCGGCCGGTAATGGAGTTTACCGGTGAAGATTCCGGCGACGGCGAGGAGGCGGAAGATGAGAGGAGTTCTTCGGTGATAGAACTAGGGCTAGCAGAGATTGCGAAGAAGATGCCGATATTTGAACCGGAGAATCGGGTGGATTCGAGTGCCCTAGAAAGGCCGCTGATTATCAATTTGGATTTGGCGTTGTACAAAGCGAAGATGATGGCGAGGAATTTTCTGTATGATGAAGCACGGCAAGTCCTTCAGAAGTGTATAGATAAATGGCCGGAGGATGGGCGGGCATACGTAGCATTGGGGAAGATGTTGAGCAAGCAAATGAAAGCCGCCGACGCCAGAAATGTGTATGAGAGAGGCTGCCAAGCCACCCAAGGCGAGAACTCCTACATTTGGCAGTGCTGGGCTGTTCTGGAGAGCAGGATGGGGAATATCAGGAAAGCAAGAGAGCTCTTTGATGCAGCCACAGTAGCCAACAAGAAGCACATTGCCGCATGGCATGGCTGGGCTGTGCTAGAGCTAAAGCAGGGGAACCTCAAGAAGGCCAGGAATCTACTAGCCAAAGGCCTCAAATACTGTGGTGGGAATGAGCGTGGGAATGAGTATATCTATCAAACACTCGCCCTGCTCGAAGCCAAATCGAATCGCTACGAGCAGGCACGTTATTTGTTTAAGCAGGCTACCAAGTGCAACCCCAAAAGCTGTGCTAGTTGGCTTGCATGGGCGCAGCTGGAGATGCAGCGGGAGAACAACCTTCTTGCTAGACAACTATTTGAGAAAGCCATCCAGGCGAGCCCCAAGAACAGATTTGCGTGGCACATATGGGGACTCTTTGAAGCTAATATAGGAAATATCGAGAAGGGAAGGAAACTTTTAAAGATAGGCCATGTCCTAAATCCAAGAGACCCTGTTCTTCTTCAGTCTCTTGGTTTATTGGAGTATAAGAACTCCTCTGCAAGCATGGCTCGAGTTTTGTTTAGGAGAGCATCTGAACTGGACCCCAAGCACCAACCAGTGTGGATTGCTTGGGGATGGATGGAATGGAAAGAAGGTAACATAGGGAAGGCAAGGGAGCTGTATCAAAGAGCATTGTTAATTGACTCAGCTAGTGAGAGTGCAGCTCGATGCCTTCAGGCTTGGGGTGTTCTAGAACAGAGAGCAGGCAACCTATCAGCAGCTAGAAGATTATATAGATCCTCTCTAAGCATAAACTCTCAGAGTTATGTAACATGGATGACCTGGGCAGCACTGGAAGATGATCAAGGGAACGCAATCCGAGCGGAGGAAATTCGAAATCTATACTTCCAGCAGAGAACAGAAGTTGTGGATGATGCTTCGTGGGTTATGGGGATCTTAGACGTTATTGACCCAGCACTTGATAGCATAAAGAGGCTGTTGAAGCTGGAGCAAGACCCCTTCGCCATGTCAAGAGTAGCAGATGGAGGCGCTAGAAACAGCTCTTTAGACGACTCGGGTGCCTCCTCTAGTGTGGCTGACAGTGAAACTGGGTTTGATTTGGATGCCTTTATGATGAAAAAGTTGTCGATAGACACGTCGAAACTCGAAATTCAACTCGAAACAACTCGACCCAAAAGATTTAAGTATCAAAGAAGTCAAATGAGATCAGAAAACAGACCAGAAATGGCTGTTTCACAGAGCCAAAGAACAGCATCTTCATCTACTTGA
Protein sequence
MQVFSSSPSTASFNLFSSSQNPNNLSKYRCKNLIIPLHFAPSSFSVCCSSRESSTAVLSSDITLDGEKREASEEVLRVRRPVMEFTGEDSGDGEEAEDERSSSVIELGLAEIAKKMPIFEPENRVDSSALERPLIINLDLALYKAKMMARNFLYDEARQVLQKCIDKWPEDGRAYVALGKMLSKQMKAADARNVYERGCQATQGENSYIWQCWAVLESRMGNIRKARELFDAATVANKKHIAAWHGWAVLELKQGNLKKARNLLAKGLKYCGGNERGNEYIYQTLALLEAKSNRYEQARYLFKQATKCNPKSCASWLAWAQLEMQRENNLLARQLFEKAIQASPKNRFAWHIWGLFEANIGNIEKGRKLLKIGHVLNPRDPVLLQSLGLLEYKNSSASMARVLFRRASELDPKHQPVWIAWGWMEWKEGNIGKARELYQRALLIDSASESAARCLQAWGVLEQRAGNLSAARRLYRSSLSINSQSYVTWMTWAALEDDQGNAIRAEEIRNLYFQQRTEVVDDASWVMGILDVIDPALDSIKRLLKLEQDPFAMSRVADGGARNSSLDDSGASSSVADSETGFDLDAFMMKKLSIDTSKLEIQLETTRPKRFKYQRSQMRSENRPEMAVSQSQRTASSST
Homology
BLAST of CmaCh06G015620 vs. ExPASy Swiss-Prot
Match:
Q8RWG2 (Protein high chlorophyll fluorescent 107 OS=Arabidopsis thaliana OX=3702 GN=HCF107 PE=1 SV=1)
HSP 1 Score: 755.7 bits (1950), Expect = 4.1e-217
Identity = 402/643 (62.52%), Postives = 488/643 (75.89%), Query Frame = 0
Query: 5 SSSPS---TASFNL-FSSSQNPNNLSKYRCK------------NLIIPLHFAPSSFSVCC 64
SSSPS T+SF+L F + Q P NL K K L P + +++
Sbjct: 11 SSSPSPANTSSFSLSFLTPQIPENLCKSPTKIHIGTHGISGQSFLSHPTFSSKNTYLYAV 70
Query: 65 SSRESSTAVLSSDITLDGEKREASEE--VLRVRRPVMEFTGEDSGDGEEAEDERSSSVIE 124
R SS + +GE E++ E VL VRRP++E + ++S E E ++ + I+
Sbjct: 71 VDRSSSGVFSPQKESANGEGEESNTEEGVLVVRRPLLENSDKES---SEEEGKKYPARID 130
Query: 125 LGLAEIAKKMPIFEPENRVDSS---------ALERPLIINLDLALYKAKMMARNFLYDEA 184
GL+ IAKKMPIFEPE SS A ERPL +NLDL+LYKAK++ARNF Y +A
Sbjct: 131 AGLSNIAKKMPIFEPERSESSSSSSAAAAARAQERPLAVNLDLSLYKAKVLARNFRYKDA 190
Query: 185 RQVLQKCIDKWPEDGRAYVALGKMLSKQMKAADARNVYERGCQATQGENSYIWQCWAVLE 244
++L+KCI WPEDGR YVALGK+LSKQ K A+AR +YE+GCQ+TQGENSYIWQCWAVLE
Sbjct: 191 EKILEKCIAYWPEDGRPYVALGKILSKQSKLAEARILYEKGCQSTQGENSYIWQCWAVLE 250
Query: 245 SRMGNIRKARELFDAATVANKKHIAAWHGWAVLELKQGNLKKARNLLAKGLKYCGGNERG 304
+R+GN+R+ARELFDAATVA+KKH+AAWHGWA LE+KQGN+ KARNLLAKGLK+CG
Sbjct: 251 NRLGNVRRARELFDAATVADKKHVAAWHGWANLEIKQGNISKARNLLAKGLKFCG----R 310
Query: 305 NEYIYQTLALLEAKSNRYEQARYLFKQATKCNPKSCASWLAWAQLEMQRENNLLARQLFE 364
NEYIYQTLALLEAK+ RYEQARYLFKQAT CN +SCASWLAWAQLE+Q+E AR+LFE
Sbjct: 311 NEYIYQTLALLEAKAGRYEQARYLFKQATICNSRSCASWLAWAQLEIQQERYPAARKLFE 370
Query: 365 KAIQASPKNRFAWHIWGLFEANIGNIEKGRKLLKIGHVLNPRDPVLLQSLGLLEYKNSSA 424
KA+QASPKNRFAWH+WG+FEA +GN+E+GRKLLKIGH LNPRDPVLLQSLGLLEYK+SSA
Sbjct: 371 KAVQASPKNRFAWHVWGVFEAGVGNVERGRKLLKIGHALNPRDPVLLQSLGLLEYKHSSA 430
Query: 425 SMARVLFRRASELDPKHQPVWIAWGWMEWKEGNIGKARELYQRALLIDSASESAARCLQA 484
++AR L RRASELDP+HQPVWIAWGWMEWKEGN ARELYQRAL ID+ +ESA+RCLQA
Sbjct: 431 NLARALLRRASELDPRHQPVWIAWGWMEWKEGNTTTARELYQRALSIDANTESASRCLQA 490
Query: 485 WGVLEQRAGNLSAARRLYRSSLSINSQSYVTWMTWAALEDDQGNAIRAEEIRNLYFQQRT 544
WGVLEQRAGNLSAARRL+RSSL+INSQSYVTWMTWA LE+DQG+ RAEEIRNLYFQQRT
Sbjct: 491 WGVLEQRAGNLSAARRLFRSSLNINSQSYVTWMTWAQLEEDQGDTERAEEIRNLYFQQRT 550
Query: 545 EVVDDASWVMGILDVIDPALDSIKRLLKLEQDP---------FAMSRVADGGARNSSLDD 604
EVVDDASWV G LD+IDPALD++KRLL Q+ M+R D ++++ +
Sbjct: 551 EVVDDASWVTGFLDIIDPALDTVKRLLNFGQNNDNNRLTTTLRNMNRTKD--SQSNQQPE 610
Query: 605 SGASSSVADSETGFDLDAFMMKKLSIDTSKLEIQLETTRPKRF 612
S A ++ +GF+LD F+ KLS+D KL++ L++ R +RF
Sbjct: 611 SSAGREDIETGSGFNLDVFLRSKLSLDPLKLDVNLDSKRLERF 644
BLAST of CmaCh06G015620 vs. ExPASy Swiss-Prot
Match:
Q9FNS4 (PsbB mRNA maturation factor Mbb1, chloroplastic OS=Chlamydomonas reinhardtii OX=3055 GN=MBB1 PE=2 SV=1)
HSP 1 Score: 270.4 bits (690), Expect = 5.2e-71
Identity = 189/586 (32.25%), Postives = 282/586 (48.12%), Query Frame = 0
Query: 36 PLHFAPSSFSVCCSSR-------ESSTAVLSSDITLDGEKREASEEVLRVRRPV-----M 95
P+ A SS S SSR S T V + + A V R PV
Sbjct: 18 PVEQASSSSSSSSSSRRTWYAPARSQTGVQVAAYEPTAVLQLAPSAVSRRSTPVRSSIIA 77
Query: 96 EFTGEDSGDGEEAEDERSSSVIELGLAEIAKKMPIFEPENRVDSSALERPLIINLDLALY 155
+ + SGDGE + + S E A F ++V L IN+DL L+
Sbjct: 78 DLSSSGSGDGEGERGDATGSRDEASSA--------FAGSSKV--------LKINIDLLLW 137
Query: 156 KAK-----------MMARNFLYDEARQVLQKCIDKWPEDGRAYVALGKMLSKQMKAADAR 215
+ + + R LY A L++C+ P D RAYV LGK L +Q + +AR
Sbjct: 138 RCRTSRIRARQTLDLNERKSLYKAAEDGLRRCLALDPADPRAYVVLGKTLVQQKRYDEAR 197
Query: 216 NVYERGCQATQGENSYIWQCWAVLESRMGNIRKARELFDAATVANKKHIAAWHGWAVLEL 275
+Y+ GC T N YIW W LE+R GN+ +AR+L+DAA V + H AWH W +LE
Sbjct: 198 QLYQDGCANTGNVNPYIWSAWGWLEARTGNVERARKLYDAAVVVDGTHACAWHKWGMLEK 257
Query: 276 KQGNLKKARNLLAKGLKYCGGNERG-NEYIYQTLALLEAKSNRYEQARYLFKQATKC--N 335
QGN +AR+L +G++ C + N Y+Y L + A+ R +AR F++ T+
Sbjct: 258 GQGNFTRARDLWMQGIQRCRRKPQSQNAYLYNALGCMAAQLGRVGEARSWFEEGTRSAEG 317
Query: 336 PKSCASWLAWAQLEMQRENNLLARQLFEKAIQASPKNRFAWHIWGLFEANIGNIEKGRKL 395
S A W AWA LE ++ + + R LF KA+ A+P++R+ W L+E GN + L
Sbjct: 318 AASVALWQAWAVLEAKQGDPTVVRYLFRKALGANPRSRYVHLAWALWERRQGNPQHCLAL 377
Query: 396 LKIGHVLNPRDPVLLQSLGLLEYKNSSASMARVLFRRASELDPKHQPVWIAWGWMEWKEG 455
L+ G LNP DP L Q+ L+E + AR LF + DP +W A+G ME ++G
Sbjct: 378 LRRGCELNPTDPALYQAWALVEKQAGRIERARELFEQGLRADPSDLYMWQAYGVMEAEQG 437
Query: 456 NIGKARELYQRALLIDSASESAARCLQAWGVLEQRAGNLSAARRLYRSSLSINSQSYVTW 515
N+ +AR+L+Q + D S S AWG LE +AGN+ AR L+++++ ++ +S TW
Sbjct: 438 NMDRARQLFQEGVWADPRSPSTVYVFHAWGALEWQAGNVQTARELFKAAVRVDPKSETTW 497
Query: 516 MTWAALEDDQGNAIRAEEIRNLYFQQRTEVVDDASWVMGILDVIDPALDSIKRLLKLEQD 575
+W A+E + G R +E+R +++ E V A + PA + L +
Sbjct: 498 ASWIAMESELGEIERVDELRIRQAERQWEFVVPAGF------TTRPAPGLVDTLAR---- 557
Query: 576 PFAMSRVADGGARNSSLDDSGASSSVADSETGFDLDAFMMKKLSID 596
F +R SS + GA A SE + A L++D
Sbjct: 558 -FFSARGFGSDGNGSSSSNGGAGGQQAGSEAAAGIRAADSVDLTVD 576
BLAST of CmaCh06G015620 vs. ExPASy Swiss-Prot
Match:
Q9HF03 (Pre-mRNA-splicing factor CLF1 OS=Cryptococcus neoformans var. grubii serotype A (strain H99 / ATCC 208821 / CBS 10515 / FGSC 9487) OX=235443 GN=CLF1 PE=3 SV=1)
HSP 1 Score: 79.0 bits (193), Expect = 2.2e-13
Identity = 71/276 (25.72%), Postives = 124/276 (44.93%), Query Frame = 0
Query: 289 EAKSNRYEQARYLFKQATKCNPKSCASWLAWAQLEMQRENNLLARQLFEKAIQASPKNRF 348
EA N YE++R +F++A +P+S W+ + +E++ N AR LF++AI P+
Sbjct: 85 EASQNEYERSRSVFERALDVDPRSVDLWIKYTDMELKARNINHARNLFDRAITLLPRVDA 144
Query: 349 AWHIWGLFEANIGNIEKGRKLLKIGHVLNPRDPVLLQSLGLLEYKNSSASMARVLFRR-- 408
W+ + E + N+ R++ + P D QS LE + + A ++ R
Sbjct: 145 LWYKYVYLEELLLNVSGARQIFERWMQWEPNDKA-WQSYIKLEERYNELDRASAIYERWI 204
Query: 409 ASELDPKHQPVWIAWGWMEWKEGNIGKARELYQRALLI----DSASESAARCLQAWGVLE 468
A PK+ W+AW E G KARE++Q AL + E A A+ +E
Sbjct: 205 ACRPIPKN---WVAWAKFEEDRGQPDKAREVFQTALEFFGDEEEQVEKAQSVFAAFARME 264
Query: 469 QRAGNLSAARRLYRSSLS--INSQSYVTWMTWAALEDDQGNAIRAEEIRNLYFQQRTEVV 528
R AR +Y+ +L+ S+S + + E G+ RA + ++R +
Sbjct: 265 TRLKEFERARVIYKFALARLPRSKSASLYAQYTKFEKQHGD--RAGVELTVLGKRRIQYE 324
Query: 529 DDASWVMGILDVIDPA-LDSIKRLLKLEQDPFAMSR 556
++ ++ DP D+ L +LE+D + R
Sbjct: 325 EELAY--------DPTNYDAWFSLARLEEDAYRADR 346
BLAST of CmaCh06G015620 vs. ExPASy Swiss-Prot
Match:
P0CO11 (Pre-mRNA-splicing factor CLF1 OS=Cryptococcus neoformans var. neoformans serotype D (strain B-3501A) OX=283643 GN=CLF1 PE=3 SV=1)
HSP 1 Score: 78.2 bits (191), Expect = 3.8e-13
Identity = 83/342 (24.27%), Postives = 149/342 (43.57%), Query Frame = 0
Query: 223 IRKARELFDAATVANKKHIAAWHGWAVLELKQGNLKKARNLLAKGLKYCGGNERGNEYIY 282
+R+A+E + A A K+ + + EL + +K R ++Y R + +
Sbjct: 29 LREAQERQEPAIQAPKQRVQ-----DLEELSEFQARK-RTEFESRIRY----SRDSILAW 88
Query: 283 QTLALLEAKSNRYEQARYLFKQATKCNPKSCASWLAWAQLEMQRENNLLARQLFEKAIQA 342
A EA N YE++R +F++A +P+S W+ + +E++ N AR LF++AI
Sbjct: 89 TKYAQWEASQNEYERSRSVFERALDVDPRSVDLWIKYTDMELKARNINHARNLFDRAITL 148
Query: 343 SPKNRFAWHIWGLFEANIGNIEKGRKLLKIGHVLNPRDPVLLQSLGLLEYKNSSASMARV 402
P+ W+ + E + N+ R++ + P D QS LE + + A
Sbjct: 149 LPRVDALWYKYVYLEELLLNVSGARQIFERWMQWEPNDKA-WQSYIKLEERYNELDRASA 208
Query: 403 LFRR--ASELDPKHQPVWIAWGWMEWKEGNIGKARELYQRALLI----DSASESAARCLQ 462
++ R A PK+ W+ W E G KARE++Q AL + E A
Sbjct: 209 IYERWIACRPIPKN---WVTWAKFEEDRGQPDKAREVFQTALEFFGDEEEQVEKAQSVFA 268
Query: 463 AWGVLEQRAGNLSAARRLYRSSLS--INSQSYVTWMTWAALEDDQGNAIRAEEIRNLYFQ 522
A+ +E R AR +Y+ +L+ S+S + + E G+ RA + +
Sbjct: 269 AFARMETRLKEFERARVIYKFALARLPRSKSASLYAQYTKFEKQHGD--RAGVELTVLGK 328
Query: 523 QRTEVVDDASWVMGILDVIDPA-LDSIKRLLKLEQDPFAMSR 556
+R + ++ ++ DP D+ L +LE+D + R
Sbjct: 329 RRIQYEEELAY--------DPTNYDAWFSLARLEEDAYRADR 346
BLAST of CmaCh06G015620 vs. ExPASy Swiss-Prot
Match:
P0CO10 (Pre-mRNA-splicing factor CLF1 OS=Cryptococcus neoformans var. neoformans serotype D (strain JEC21 / ATCC MYA-565) OX=214684 GN=CLF1 PE=3 SV=1)
HSP 1 Score: 78.2 bits (191), Expect = 3.8e-13
Identity = 83/342 (24.27%), Postives = 149/342 (43.57%), Query Frame = 0
Query: 223 IRKARELFDAATVANKKHIAAWHGWAVLELKQGNLKKARNLLAKGLKYCGGNERGNEYIY 282
+R+A+E + A A K+ + + EL + +K R ++Y R + +
Sbjct: 29 LREAQERQEPAIQAPKQRVQ-----DLEELSEFQARK-RTEFESRIRY----SRDSILAW 88
Query: 283 QTLALLEAKSNRYEQARYLFKQATKCNPKSCASWLAWAQLEMQRENNLLARQLFEKAIQA 342
A EA N YE++R +F++A +P+S W+ + +E++ N AR LF++AI
Sbjct: 89 TKYAQWEASQNEYERSRSVFERALDVDPRSVDLWIKYTDMELKARNINHARNLFDRAITL 148
Query: 343 SPKNRFAWHIWGLFEANIGNIEKGRKLLKIGHVLNPRDPVLLQSLGLLEYKNSSASMARV 402
P+ W+ + E + N+ R++ + P D QS LE + + A
Sbjct: 149 LPRVDALWYKYVYLEELLLNVSGARQIFERWMQWEPNDKA-WQSYIKLEERYNELDRASA 208
Query: 403 LFRR--ASELDPKHQPVWIAWGWMEWKEGNIGKARELYQRALLI----DSASESAARCLQ 462
++ R A PK+ W+ W E G KARE++Q AL + E A
Sbjct: 209 IYERWIACRPIPKN---WVTWAKFEEDRGQPDKAREVFQTALEFFGDEEEQVEKAQSVFA 268
Query: 463 AWGVLEQRAGNLSAARRLYRSSLS--INSQSYVTWMTWAALEDDQGNAIRAEEIRNLYFQ 522
A+ +E R AR +Y+ +L+ S+S + + E G+ RA + +
Sbjct: 269 AFARMETRLKEFERARVIYKFALARLPRSKSASLYAQYTKFEKQHGD--RAGVELTVLGK 328
Query: 523 QRTEVVDDASWVMGILDVIDPA-LDSIKRLLKLEQDPFAMSR 556
+R + ++ ++ DP D+ L +LE+D + R
Sbjct: 329 RRIQYEEELAY--------DPTNYDAWFSLARLEEDAYRADR 346
BLAST of CmaCh06G015620 vs. TAIR 10
Match:
AT3G17040.2 (high chlorophyll fluorescent 107 )
HSP 1 Score: 756.1 bits (1951), Expect = 2.2e-218
Identity = 400/629 (63.59%), Postives = 482/629 (76.63%), Query Frame = 0
Query: 5 SSSPS---TASFNL-FSSSQNPNNLSKYRCKNLIIPLHFAPSSFSVCCSSRESSTAVLSS 64
SSSPS T+SF+L F + Q P NL R SS V +ES+
Sbjct: 11 SSSPSPANTSSFSLSFLTPQIPENLFVDR------------SSSGVFSPQKESANG---- 70
Query: 65 DITLDGEKREASEEVLRVRRPVMEFTGEDSGDGEEAEDERSSSVIELGLAEIAKKMPIFE 124
+GE+ E VL VRRP++E + ++S E E ++ + I+ GL+ IAKKMPIFE
Sbjct: 71 ----EGEESNTEEGVLVVRRPLLENSDKES---SEEEGKKYPARIDAGLSNIAKKMPIFE 130
Query: 125 PENRVDSS---------ALERPLIINLDLALYKAKMMARNFLYDEARQVLQKCIDKWPED 184
PE SS A ERPL +NLDL+LYKAK++ARNF Y +A ++L+KCI WPED
Sbjct: 131 PERSESSSSSSAAAAARAQERPLAVNLDLSLYKAKVLARNFRYKDAEKILEKCIAYWPED 190
Query: 185 GRAYVALGKMLSKQMKAADARNVYERGCQATQGENSYIWQCWAVLESRMGNIRKARELFD 244
GR YVALGK+LSKQ K A+AR +YE+GCQ+TQGENSYIWQCWAVLE+R+GN+R+ARELFD
Sbjct: 191 GRPYVALGKILSKQSKLAEARILYEKGCQSTQGENSYIWQCWAVLENRLGNVRRARELFD 250
Query: 245 AATVANKKHIAAWHGWAVLELKQGNLKKARNLLAKGLKYCGGNERGNEYIYQTLALLEAK 304
AATVA+KKH+AAWHGWA LE+KQGN+ KARNLLAKGLK+CG NEYIYQTLALLEAK
Sbjct: 251 AATVADKKHVAAWHGWANLEIKQGNISKARNLLAKGLKFCG----RNEYIYQTLALLEAK 310
Query: 305 SNRYEQARYLFKQATKCNPKSCASWLAWAQLEMQRENNLLARQLFEKAIQASPKNRFAWH 364
+ RYEQARYLFKQAT CN +SCASWLAWAQLE+Q+E AR+LFEKA+QASPKNRFAWH
Sbjct: 311 AGRYEQARYLFKQATICNSRSCASWLAWAQLEIQQERYPAARKLFEKAVQASPKNRFAWH 370
Query: 365 IWGLFEANIGNIEKGRKLLKIGHVLNPRDPVLLQSLGLLEYKNSSASMARVLFRRASELD 424
+WG+FEA +GN+E+GRKLLKIGH LNPRDPVLLQSLGLLEYK+SSA++AR L RRASELD
Sbjct: 371 VWGVFEAGVGNVERGRKLLKIGHALNPRDPVLLQSLGLLEYKHSSANLARALLRRASELD 430
Query: 425 PKHQPVWIAWGWMEWKEGNIGKARELYQRALLIDSASESAARCLQAWGVLEQRAGNLSAA 484
P+HQPVWIAWGWMEWKEGN ARELYQRAL ID+ +ESA+RCLQAWGVLEQRAGNLSAA
Sbjct: 431 PRHQPVWIAWGWMEWKEGNTTTARELYQRALSIDANTESASRCLQAWGVLEQRAGNLSAA 490
Query: 485 RRLYRSSLSINSQSYVTWMTWAALEDDQGNAIRAEEIRNLYFQQRTEVVDDASWVMGILD 544
RRL+RSSL+INSQSYVTWMTWA LE+DQG+ RAEEIRNLYFQQRTEVVDDASWV G LD
Sbjct: 491 RRLFRSSLNINSQSYVTWMTWAQLEEDQGDTERAEEIRNLYFQQRTEVVDDASWVTGFLD 550
Query: 545 VIDPALDSIKRLLKLEQDP---------FAMSRVADGGARNSSLDDSGASSSVADSETGF 604
+IDPALD++KRLL Q+ M+R D ++++ +S A ++ +GF
Sbjct: 551 IIDPALDTVKRLLNFGQNNDNNRLTTTLRNMNRTKD--SQSNQQPESSAGREDIETGSGF 610
Query: 605 DLDAFMMKKLSIDTSKLEIQLETTRPKRF 612
+LD F+ KLS+D KL++ L++ R +RF
Sbjct: 611 NLDVFLRSKLSLDPLKLDVNLDSKRLERF 610
BLAST of CmaCh06G015620 vs. TAIR 10
Match:
AT3G17040.1 (high chlorophyll fluorescent 107 )
HSP 1 Score: 755.7 bits (1950), Expect = 2.9e-218
Identity = 402/643 (62.52%), Postives = 488/643 (75.89%), Query Frame = 0
Query: 5 SSSPS---TASFNL-FSSSQNPNNLSKYRCK------------NLIIPLHFAPSSFSVCC 64
SSSPS T+SF+L F + Q P NL K K L P + +++
Sbjct: 11 SSSPSPANTSSFSLSFLTPQIPENLCKSPTKIHIGTHGISGQSFLSHPTFSSKNTYLYAV 70
Query: 65 SSRESSTAVLSSDITLDGEKREASEE--VLRVRRPVMEFTGEDSGDGEEAEDERSSSVIE 124
R SS + +GE E++ E VL VRRP++E + ++S E E ++ + I+
Sbjct: 71 VDRSSSGVFSPQKESANGEGEESNTEEGVLVVRRPLLENSDKES---SEEEGKKYPARID 130
Query: 125 LGLAEIAKKMPIFEPENRVDSS---------ALERPLIINLDLALYKAKMMARNFLYDEA 184
GL+ IAKKMPIFEPE SS A ERPL +NLDL+LYKAK++ARNF Y +A
Sbjct: 131 AGLSNIAKKMPIFEPERSESSSSSSAAAAARAQERPLAVNLDLSLYKAKVLARNFRYKDA 190
Query: 185 RQVLQKCIDKWPEDGRAYVALGKMLSKQMKAADARNVYERGCQATQGENSYIWQCWAVLE 244
++L+KCI WPEDGR YVALGK+LSKQ K A+AR +YE+GCQ+TQGENSYIWQCWAVLE
Sbjct: 191 EKILEKCIAYWPEDGRPYVALGKILSKQSKLAEARILYEKGCQSTQGENSYIWQCWAVLE 250
Query: 245 SRMGNIRKARELFDAATVANKKHIAAWHGWAVLELKQGNLKKARNLLAKGLKYCGGNERG 304
+R+GN+R+ARELFDAATVA+KKH+AAWHGWA LE+KQGN+ KARNLLAKGLK+CG
Sbjct: 251 NRLGNVRRARELFDAATVADKKHVAAWHGWANLEIKQGNISKARNLLAKGLKFCG----R 310
Query: 305 NEYIYQTLALLEAKSNRYEQARYLFKQATKCNPKSCASWLAWAQLEMQRENNLLARQLFE 364
NEYIYQTLALLEAK+ RYEQARYLFKQAT CN +SCASWLAWAQLE+Q+E AR+LFE
Sbjct: 311 NEYIYQTLALLEAKAGRYEQARYLFKQATICNSRSCASWLAWAQLEIQQERYPAARKLFE 370
Query: 365 KAIQASPKNRFAWHIWGLFEANIGNIEKGRKLLKIGHVLNPRDPVLLQSLGLLEYKNSSA 424
KA+QASPKNRFAWH+WG+FEA +GN+E+GRKLLKIGH LNPRDPVLLQSLGLLEYK+SSA
Sbjct: 371 KAVQASPKNRFAWHVWGVFEAGVGNVERGRKLLKIGHALNPRDPVLLQSLGLLEYKHSSA 430
Query: 425 SMARVLFRRASELDPKHQPVWIAWGWMEWKEGNIGKARELYQRALLIDSASESAARCLQA 484
++AR L RRASELDP+HQPVWIAWGWMEWKEGN ARELYQRAL ID+ +ESA+RCLQA
Sbjct: 431 NLARALLRRASELDPRHQPVWIAWGWMEWKEGNTTTARELYQRALSIDANTESASRCLQA 490
Query: 485 WGVLEQRAGNLSAARRLYRSSLSINSQSYVTWMTWAALEDDQGNAIRAEEIRNLYFQQRT 544
WGVLEQRAGNLSAARRL+RSSL+INSQSYVTWMTWA LE+DQG+ RAEEIRNLYFQQRT
Sbjct: 491 WGVLEQRAGNLSAARRLFRSSLNINSQSYVTWMTWAQLEEDQGDTERAEEIRNLYFQQRT 550
Query: 545 EVVDDASWVMGILDVIDPALDSIKRLLKLEQDP---------FAMSRVADGGARNSSLDD 604
EVVDDASWV G LD+IDPALD++KRLL Q+ M+R D ++++ +
Sbjct: 551 EVVDDASWVTGFLDIIDPALDTVKRLLNFGQNNDNNRLTTTLRNMNRTKD--SQSNQQPE 610
Query: 605 SGASSSVADSETGFDLDAFMMKKLSIDTSKLEIQLETTRPKRF 612
S A ++ +GF+LD F+ KLS+D KL++ L++ R +RF
Sbjct: 611 SSAGREDIETGSGFNLDVFLRSKLSLDPLKLDVNLDSKRLERF 644
BLAST of CmaCh06G015620 vs. TAIR 10
Match:
AT3G51110.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 69.3 bits (168), Expect = 1.3e-11
Identity = 72/299 (24.08%), Postives = 137/299 (45.82%), Query Frame = 0
Query: 223 IRKARELFDAATVANKKHIAAWHGWAVLELKQGNLKKARNLLAKGLKYCGGNERGNEYIY 282
+R+ +E D A K + W +A E Q + +AR++ + L+ N ++
Sbjct: 54 LRRRKEFEDQIRGA-KTNSQVWVRYADWEESQKDHDRARSVWERALE---DESYRNHTLW 113
Query: 283 QTLALLEAKSNRYEQARYLFKQATKCNPKSCASWLAWAQLEMQRENNLLARQLFEKAIQA 342
A E ++ AR ++ +A K P+ W + +E N AR++FE+ +
Sbjct: 114 LKYAEFEMRNKSVNHARNVWDRAVKILPRVDQFWYKYIHMEEILGNIDGARKIFERWMDW 173
Query: 343 SPKNRFAWHIWGLFEANIGNIEKGRKLLKIGHVLNPRDPVLLQSLGLLEYKNSSASMARV 402
SP + AW + FE IE+ R + + + +P+ ++ E KNS S+AR+
Sbjct: 174 SPDQQ-AWLCFIKFELRYNEIERSRSIYERFVLCHPKASSFIR-YAKFEMKNSQVSLARI 233
Query: 403 LFRRASEL----DPKHQPVWIAWGWMEWKEGNIGKARELYQRALLIDSASESAARCL-QA 462
++ RA E+ + + + +++A+ E + +AR LY+ AL D + A L +
Sbjct: 234 VYERAIEMLKDVEEEAEMIFVAFAEFEELCKEVERARFLYKYAL--DHIPKGRAEDLYKK 293
Query: 463 WGVLEQRAGN-------LSAARRL-YRSSLSINSQSYVTWMTWAALEDDQGNAIRAEEI 509
+ E++ GN + R+L Y + N +Y +W + +LE+ G+ R E+
Sbjct: 294 FVAFEKQYGNKEGIDDAIVGRRKLQYEGEVRKNPLNYDSWFDYISLEETLGDKDRIREV 344
BLAST of CmaCh06G015620 vs. TAIR 10
Match:
AT5G41770.1 (crooked neck protein, putative / cell cycle protein, putative )
HSP 1 Score: 60.8 bits (146), Expect = 4.5e-09
Identity = 49/204 (24.02%), Postives = 94/204 (46.08%), Query Frame = 0
Query: 276 RGNEYIYQTLALLEAKSNRYEQARYLFKQATKCNPKSCASWLAWAQLEMQRENNLLARQL 335
R N ++ A E Y +AR ++++A + + ++ WL +A+ EM+ + AR +
Sbjct: 89 RWNIQVWVKYAQWEESQKDYARARSVWERAIEGDYRNHTLWLKYAEFEMKNKFVNSARNV 148
Query: 336 FEKAIQASPKNRFAWHIWGLFEANIGNIEKGRKLLKIGHVLNPRDPVLLQSLGLLEYKNS 395
+++A+ P+ W+ + E +GNI R++ + +P L S E + +
Sbjct: 149 WDRAVTLLPRVDQLWYKYIHMEEILGNIAGARQIFERWMDWSPDQQGWL-SFIKFELRYN 208
Query: 396 SASMARVLFRRASELDPKHQPVWIAWGWMEWKEGNIGKARELYQRALLIDSASESAARCL 455
AR ++ R PK +I + E K G + + R +Y+RA + E A
Sbjct: 209 EIERARTIYERFVLCHPK-VSAYIRYAKFEMKGGEVARCRSVYERATEKLADDEEAEILF 268
Query: 456 QAWGVLEQRAGNLSAARRLYRSSL 480
A+ E+R + AR +Y+ +L
Sbjct: 269 VAFAEFEERCKEVERARFIYKFAL 290
BLAST of CmaCh06G015620 vs. TAIR 10
Match:
AT5G45990.1 (crooked neck protein, putative / cell cycle protein, putative )
HSP 1 Score: 51.6 bits (122), Expect = 2.7e-06
Identity = 41/185 (22.16%), Postives = 82/185 (44.32%), Query Frame = 0
Query: 316 WLAWAQLEMQRENNLLARQLFEKAIQASPKNRFAWHIWGLFEANIGNIEKGRKLLKIGHV 375
W+ +A+ E + + AR ++E+A++ +N W + FE + R +
Sbjct: 81 WVKYAKWEESQMDYARARSVWERALEGEYRNHTLWVKYAEFEMKNKFVNNARNVWDRSVT 140
Query: 376 LNPRDPVLLQSLGLLEYKNSSASMARVLFRRASELDPKHQPVWIAWGWMEWKEGNIGKAR 435
L PR L + +E K + + AR +F R P Q W+ + E + I +AR
Sbjct: 141 LLPRVDQLWEKYIYMEEKLGNVTGARQIFERWMNWSP-DQKAWLCFIKFELRYNEIERAR 200
Query: 436 ELYQRALLIDSASESAARCLQAWGVLEQRAGNLSAARRLYR---SSLSINSQSYVTWMTW 495
+Y+R +L + R + +R G + AR +Y L+ + ++ + ++++
Sbjct: 201 SIYERFVLCHPKVSAFIRYAK---FEMKRGGQVKLAREVYERAVDKLANDEEAEILFVSF 260
Query: 496 AALED 498
A E+
Sbjct: 261 AEFEE 261
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q8RWG2 | 4.1e-217 | 62.52 | Protein high chlorophyll fluorescent 107 OS=Arabidopsis thaliana OX=3702 GN=HCF1... | [more] |
Q9FNS4 | 5.2e-71 | 32.25 | PsbB mRNA maturation factor Mbb1, chloroplastic OS=Chlamydomonas reinhardtii OX=... | [more] |
Q9HF03 | 2.2e-13 | 25.72 | Pre-mRNA-splicing factor CLF1 OS=Cryptococcus neoformans var. grubii serotype A ... | [more] |
P0CO11 | 3.8e-13 | 24.27 | Pre-mRNA-splicing factor CLF1 OS=Cryptococcus neoformans var. neoformans serotyp... | [more] |
P0CO10 | 3.8e-13 | 24.27 | Pre-mRNA-splicing factor CLF1 OS=Cryptococcus neoformans var. neoformans serotyp... | [more] |
Match Name | E-value | Identity | Description | |
AT3G17040.2 | 2.2e-218 | 63.59 | high chlorophyll fluorescent 107 | [more] |
AT3G17040.1 | 2.9e-218 | 62.52 | high chlorophyll fluorescent 107 | [more] |
AT3G51110.1 | 1.3e-11 | 24.08 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT5G41770.1 | 4.5e-09 | 24.02 | crooked neck protein, putative / cell cycle protein, putative | [more] |
AT5G45990.1 | 2.7e-06 | 22.16 | crooked neck protein, putative / cell cycle protein, putative | [more] |