Csa1G524740 (gene) Cucumber (Chinese Long) v2

NameCsa1G524740
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionPentatricopeptide repeat-containing protein; contains IPR002885 (Pentatricopeptide repeat), IPR011990 (Tetratricopeptide-like helical)
LocationChr1 : 18307153 .. 18309552 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAATTGAAATCTTGAGATTTTGATTTTTAATAAGGTCAGAGTGTGGAGAGAGCGTCATCTCAATCACTCTCAGATATTTCACTGATTCATCCTTCAAAAATCACACAGCTTCTACTCAAATTCGCTCATGGGATGATTTGAACATAATTCACTTGAATCCTTCTGTCATGTGGGCGCTTCGTACTCCCTATTCTACCCATTACCCACCTTCCTCTCCCCGCTATTCCACTTCAAAACTCTCCGTTTCCTCCTTCTCCTTCAATCCTTCAACCCCCCCAAATTCAAATAATAATCACTTGATTCAATCTCTGTGTAAACAGGGCAATCTCAAACAAGCCCTTTACCTCCTCTCCCATGAATCCAATCCTACCCAGCAAACCTGCGAGCTTCTAATCCTCTCCGCCGCTCGCCGGAACTCTCTTTCCGATGCCCTTGACGTCCATCAGCTTCTCGTCGATGGGGGTTTTGACCAAGACCCTTTTTTGGCCACCAAGCTTATCAATATGTTTTCCGAATTGGACACTGTCGACAATGCGCGCAAGGTGTTTGACAAAACGCGTAAGAGAACTATATATGTTTGGAATGCGTTGTTTAGAGCTCTTGCGTTGGCGGGTCGTGGAAACGACGTATTGGAATTGTATCCCCGGATGAATATGATGGGAGTTTCTTCTGATAGGTTTACTTATACTTATTTGCTCAAAGCTTGTGTTGCTTCAGAGTGTTTGGTTTCGTTTCTCCAGAAGGGTAAAGAGATTCATGCGCATATTTTGAGACATGGGTATGGAGCTCATGTTCATGTTATGACTACTCTGATGGATATGTACGCAAGGTTTGGGTGTGTTTCTTATGCCAGTGCAGTGTTTGATGAAATGCCTGTGAAAAATGTGGTTTCTTGGAGTGCTATGATTGCATGCTATGCAAAGAATGGGAAGCCATACGAAGCTTTGGAACTCTTTCGTGAGATGATGCTCAATACCCACGATTCAGTGCCGAATTCTGTGACGATGGTCAGTGTACTCCAAGCTTGTGCTGCTTTTGCTGCTTTGGAGCAAGGGAAGCTTATCCACGCTTACATTCTTAGGAGGGGTCTGGATTCAATCTTGCCAGTTATAAGTGCTCTTATAACCATGTATGCAAGATGTGGTAAGCTTGAGTCAGGCCAACTAATTTTTGACCGTATGCATAAGAAAGATGTTGTCTTATGGAATTCATTGATTTCGAGTTATGGACTGCATGGATATGGAAGAAAAGCAATCAAAATTTTTGAAGAGATGATTGACCATGGATTCTCACCTAGTCACATATCATTTATAAGTGTTTTGGGTGCTTGCAGCCATACTGGGCTTGTTGAAGAGGGGAAGAAGTTGTTTGAATCCATGGTAAAAGAACATGGTATACAGCCTAGTGTAGAGCACTATGCTTGTATGGTTGATCTTCTTGGCCGTGCCAACCGGTTGGATGAAGCAGCCAAGATTATAGAAGATCTGCGTATCGAACCAGGGCCCAAAGTATGGGGTTCTCTTCTTGGTGCCTGTAGGATTCATTGCCATGTTGAGCTTGCTGAACGAGCAAGCAAACGGCTTTTCAAGCTTGAGCCTACAAATGCTGGGAATTATGTACTTCTGGCTGATATTTATGCAGAAGCTGAAATGTGGGATGAGGTAAAGAGAGTGAAAAAACTTCTTGATTCTCGTGAATTACAAAAGGTCCCTGGTAGAAGTTGGATTGAAGTACGAAGGAAAATCTATTCATTTACATCTGTTGATGAGTTCAACCCACAGGGAGAGCAACTTCATGCCCTGTTAGTGAATTTGTCAAATGAAATGAAGCAAAGAGGATATACCCCACAAACAAAACTGGTTCTGTATGACCTTGATCAGGAAGAAAAGGAAAGGATTGTGTTGGGTCATAGCGAAAAACTCGCAGTTGCTTTTGGACTAATCAATACAAGCAAGGGGGACACCATAAGAATAACTAAGAACTTGAGGCTATGTGAAGACTGCCATTCCGTCACAAAATTCATTTCCAAGTTTGCCGATCGAGAGATTATGGTTCGAGATTTGAATCGTTTCCACCATTTCAAAGATGGAGTTTGCTCTTGTGGAGATTATTGGTAGTATCTATACAAATCCTTATCTTGTTCCTTTTATTTTTCTAATACCTCTATGATAATAACAAGTGATCCAACGATTCTATTGGGCAGCTCAGCTACATAGTTGAACACTTCCACACACCTTGAAACTTGTTTTCTTTAGGCCTATAATTTTATTCAAATGTCTTCTAGAGAAAATGAGAAAGAAGAGGCGTCTATTGACGTCAAATTGAAAATTCCCTTCTGCTCTTATATGGGGGAAGATCAAAGCTAAGCTCCATTAAGGTATGACTTATTTAAGGTGTCC

mRNA sequence

ATGTGGGCGCTTCGTACTCCCTATTCTACCCATTACCCACCTTCCTCTCCCCGCTATTCCACTTCAAAACTCTCCGTTTCCTCCTTCTCCTTCAATCCTTCAACCCCCCCAAATTCAAATAATAATCACTTGATTCAATCTCTGTGTAAACAGGGCAATCTCAAACAAGCCCTTTACCTCCTCTCCCATGAATCCAATCCTACCCAGCAAACCTGCGAGCTTCTAATCCTCTCCGCCGCTCGCCGGAACTCTCTTTCCGATGCCCTTGACGTCCATCAGCTTCTCGTCGATGGGGGTTTTGACCAAGACCCTTTTTTGGCCACCAAGCTTATCAATATGTTTTCCGAATTGGACACTGTCGACAATGCGCGCAAGGTGTTTGACAAAACGCGTAAGAGAACTATATATGTTTGGAATGCGTTGTTTAGAGCTCTTGCGTTGGCGGGTCGTGGAAACGACGTATTGGAATTGTATCCCCGGATGAATATGATGGGAGTTTCTTCTGATAGGTTTACTTATACTTATTTGCTCAAAGCTTGTGTTGCTTCAGAGTGTTTGGTTTCGTTTCTCCAGAAGGGTAAAGAGATTCATGCGCATATTTTGAGACATGGGTATGGAGCTCATGTTCATGTTATGACTACTCTGATGGATATGTACGCAAGGTTTGGGTGTGTTTCTTATGCCAGTGCAGTGTTTGATGAAATGCCTGTGAAAAATGTGGTTTCTTGGAGTGCTATGATTGCATGCTATGCAAAGAATGGGAAGCCATACGAAGCTTTGGAACTCTTTCGTGAGATGATGCTCAATACCCACGATTCAGTGCCGAATTCTGTGACGATGGTCAGTGTACTCCAAGCTTGTGCTGCTTTTGCTGCTTTGGAGCAAGGGAAGCTTATCCACGCTTACATTCTTAGGAGGGGTCTGGATTCAATCTTGCCAGTTATAAGTGCTCTTATAACCATGTATGCAAGATGTGGTAAGCTTGAGTCAGGCCAACTAATTTTTGACCGTATGCATAAGAAAGATGTTGTCTTATGGAATTCATTGATTTCGAGTTATGGACTGCATGGATATGGAAGAAAAGCAATCAAAATTTTTGAAGAGATGATTGACCATGGATTCTCACCTAGTCACATATCATTTATAAGTGTTTTGGGTGCTTGCAGCCATACTGGGCTTGTTGAAGAGGGGAAGAAGTTGTTTGAATCCATGGTAAAAGAACATGGTATACAGCCTAGTGTAGAGCACTATGCTTGTATGGTTGATCTTCTTGGCCGTGCCAACCGGTTGGATGAAGCAGCCAAGATTATAGAAGATCTGCGTATCGAACCAGGGCCCAAAGTATGGGGTTCTCTTCTTGGTGCCTGTAGGATTCATTGCCATGTTGAGCTTGCTGAACGAGCAAGCAAACGGCTTTTCAAGCTTGAGCCTACAAATGCTGGGAATTATGTACTTCTGGCTGATATTTATGCAGAAGCTGAAATGTGGGATGAGGTAAAGAGAGTGAAAAAACTTCTTGATTCTCGTGAATTACAAAAGGTCCCTGGTAGAAGTTGGATTGAAGTACGAAGGAAAATCTATTCATTTACATCTGTTGATGAGTTCAACCCACAGGGAGAGCAACTTCATGCCCTGTTAGTGAATTTGTCAAATGAAATGAAGCAAAGAGGATATACCCCACAAACAAAACTGGTTCTGTATGACCTTGATCAGGAAGAAAAGGAAAGGATTGTGTTGGGTCATAGCGAAAAACTCGCAGTTGCTTTTGGACTAATCAATACAAGCAAGGGGGACACCATAAGAATAACTAAGAACTTGAGGCTATGTGAAGACTGCCATTCCGTCACAAAATTCATTTCCAAGTTTGCCGATCGAGAGATTATGGTTCGAGATTTGAATCGTTTCCACCATTTCAAAGATGGAGTTTGCTCTTGTGGAGATTATTGGTAG

Coding sequence (CDS)

ATGTGGGCGCTTCGTACTCCCTATTCTACCCATTACCCACCTTCCTCTCCCCGCTATTCCACTTCAAAACTCTCCGTTTCCTCCTTCTCCTTCAATCCTTCAACCCCCCCAAATTCAAATAATAATCACTTGATTCAATCTCTGTGTAAACAGGGCAATCTCAAACAAGCCCTTTACCTCCTCTCCCATGAATCCAATCCTACCCAGCAAACCTGCGAGCTTCTAATCCTCTCCGCCGCTCGCCGGAACTCTCTTTCCGATGCCCTTGACGTCCATCAGCTTCTCGTCGATGGGGGTTTTGACCAAGACCCTTTTTTGGCCACCAAGCTTATCAATATGTTTTCCGAATTGGACACTGTCGACAATGCGCGCAAGGTGTTTGACAAAACGCGTAAGAGAACTATATATGTTTGGAATGCGTTGTTTAGAGCTCTTGCGTTGGCGGGTCGTGGAAACGACGTATTGGAATTGTATCCCCGGATGAATATGATGGGAGTTTCTTCTGATAGGTTTACTTATACTTATTTGCTCAAAGCTTGTGTTGCTTCAGAGTGTTTGGTTTCGTTTCTCCAGAAGGGTAAAGAGATTCATGCGCATATTTTGAGACATGGGTATGGAGCTCATGTTCATGTTATGACTACTCTGATGGATATGTACGCAAGGTTTGGGTGTGTTTCTTATGCCAGTGCAGTGTTTGATGAAATGCCTGTGAAAAATGTGGTTTCTTGGAGTGCTATGATTGCATGCTATGCAAAGAATGGGAAGCCATACGAAGCTTTGGAACTCTTTCGTGAGATGATGCTCAATACCCACGATTCAGTGCCGAATTCTGTGACGATGGTCAGTGTACTCCAAGCTTGTGCTGCTTTTGCTGCTTTGGAGCAAGGGAAGCTTATCCACGCTTACATTCTTAGGAGGGGTCTGGATTCAATCTTGCCAGTTATAAGTGCTCTTATAACCATGTATGCAAGATGTGGTAAGCTTGAGTCAGGCCAACTAATTTTTGACCGTATGCATAAGAAAGATGTTGTCTTATGGAATTCATTGATTTCGAGTTATGGACTGCATGGATATGGAAGAAAAGCAATCAAAATTTTTGAAGAGATGATTGACCATGGATTCTCACCTAGTCACATATCATTTATAAGTGTTTTGGGTGCTTGCAGCCATACTGGGCTTGTTGAAGAGGGGAAGAAGTTGTTTGAATCCATGGTAAAAGAACATGGTATACAGCCTAGTGTAGAGCACTATGCTTGTATGGTTGATCTTCTTGGCCGTGCCAACCGGTTGGATGAAGCAGCCAAGATTATAGAAGATCTGCGTATCGAACCAGGGCCCAAAGTATGGGGTTCTCTTCTTGGTGCCTGTAGGATTCATTGCCATGTTGAGCTTGCTGAACGAGCAAGCAAACGGCTTTTCAAGCTTGAGCCTACAAATGCTGGGAATTATGTACTTCTGGCTGATATTTATGCAGAAGCTGAAATGTGGGATGAGGTAAAGAGAGTGAAAAAACTTCTTGATTCTCGTGAATTACAAAAGGTCCCTGGTAGAAGTTGGATTGAAGTACGAAGGAAAATCTATTCATTTACATCTGTTGATGAGTTCAACCCACAGGGAGAGCAACTTCATGCCCTGTTAGTGAATTTGTCAAATGAAATGAAGCAAAGAGGATATACCCCACAAACAAAACTGGTTCTGTATGACCTTGATCAGGAAGAAAAGGAAAGGATTGTGTTGGGTCATAGCGAAAAACTCGCAGTTGCTTTTGGACTAATCAATACAAGCAAGGGGGACACCATAAGAATAACTAAGAACTTGAGGCTATGTGAAGACTGCCATTCCGTCACAAAATTCATTTCCAAGTTTGCCGATCGAGAGATTATGGTTCGAGATTTGAATCGTTTCCACCATTTCAAAGATGGAGTTTGCTCTTGTGGAGATTATTGGTAG

Protein sequence

MWALRTPYSTHYPPSSPRYSTSKLSVSSFSFNPSTPPNSNNNHLIQSLCKQGNLKQALYLLSHESNPTQQTCELLILSAARRNSLSDALDVHQLLVDGGFDQDPFLATKLINMFSELDTVDNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLELYPRMNMMGVSSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMDMYARFGCVSYASAVFDEMPVKNVVSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNSVTMVSVLQACAAFAALEQGKLIHAYILRRGLDSILPVISALITMYARCGKLESGQLIFDRMHKKDVVLWNSLISSYGLHGYGRKAIKIFEEMIDHGFSPSHISFISVLGACSHTGLVEEGKKLFESMVKEHGIQPSVEHYACMVDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIHCHVELAERASKRLFKLEPTNAGNYVLLADIYAEAEMWDEVKRVKKLLDSRELQKVPGRSWIEVRRKIYSFTSVDEFNPQGEQLHALLVNLSNEMKQRGYTPQTKLVLYDLDQEEKERIVLGHSEKLAVAFGLINTSKGDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFKDGVCSCGDYW*
BLAST of Csa1G524740 vs. Swiss-Prot
Match: PP265_ARATH (Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidopsis thaliana GN=CRR2 PE=2 SV=1)

HSP 1 Score: 965.3 bits (2494), Expect = 3.3e-280
Identity = 472/645 (73.18%), Postives = 550/645 (85.27%), Query Frame = 1

Query: 6   TPYSTHYPPSSPRYSTS-KLSVSSFSFNPSTPPNSNNNHLIQSLCKQGNLKQALYLLSHE 65
           T ++ ++ P SP    S  +++++ S +       +NN LIQSLCK+G LKQA+ +LS E
Sbjct: 13  TYHTVNFLPRSPLKPPSCSVALNNPSISSGAGAKISNNQLIQSLCKEGKLKQAIRVLSQE 72

Query: 66  SNPTQQTCELLILSAARRNSLSDALDVHQLLVDGGFDQDPFLATKLINMFSELDTVDNAR 125
           S+P+QQT ELLIL    R+SLSDAL VH+ ++D G DQDPFLATKLI M+S+L +VD AR
Sbjct: 73  SSPSQQTYELLILCCGHRSSLSDALRVHRHILDNGSDQDPFLATKLIGMYSDLGSVDYAR 132

Query: 126 KVFDKTRKRTIYVWNALFRALALAGRGNDVLELYPRMNMMGVSSDRFTYTYLLKACVASE 185
           KVFDKTRKRTIYVWNALFRAL LAG G +VL LY +MN +GV SDRFTYTY+LKACVASE
Sbjct: 133 KVFDKTRKRTIYVWNALFRALTLAGHGEEVLGLYWKMNRIGVESDRFTYTYVLKACVASE 192

Query: 186 CLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMDMYARFGCVSYASAVFDEMPVKNVVSWS 245
           C V+ L KGKEIHAH+ R GY +HV++MTTL+DMYARFGCV YAS VF  MPV+NVVSWS
Sbjct: 193 CTVNHLMKGKEIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYASYVFGGMPVRNVVSWS 252

Query: 246 AMIACYAKNGKPYEALELFREMMLNTHDSVPNSVTMVSVLQACAAFAALEQGKLIHAYIL 305
           AMIACYAKNGK +EAL  FREMM  T DS PNSVTMVSVLQACA+ AALEQGKLIH YIL
Sbjct: 253 AMIACYAKNGKAFEALRTFREMMRETKDSSPNSVTMVSVLQACASLAALEQGKLIHGYIL 312

Query: 306 RRGLDSILPVISALITMYARCGKLESGQLIFDRMHKKDVVLWNSLISSYGLHGYGRKAIK 365
           RRGLDSILPVISAL+TMY RCGKLE GQ +FDRMH +DVV WNSLISSYG+HGYG+KAI+
Sbjct: 313 RRGLDSILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGVHGYGKKAIQ 372

Query: 366 IFEEMIDHGFSPSHISFISVLGACSHTGLVEEGKKLFESMVKEHGIQPSVEHYACMVDLL 425
           IFEEM+ +G SP+ ++F+SVLGACSH GLVEEGK+LFE+M ++HGI+P +EHYACMVDLL
Sbjct: 373 IFEEMLANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVDLL 432

Query: 426 GRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIHCHVELAERASKRLFKLEPTNAGNYV 485
           GRANRLDEAAK+++D+R EPGPKVWGSLLG+CRIH +VELAERAS+RLF LEP NAGNYV
Sbjct: 433 GRANRLDEAAKMVQDMRTEPGPKVWGSLLGSCRIHGNVELAERASRRLFALEPKNAGNYV 492

Query: 486 LLADIYAEAEMWDEVKRVKKLLDSRELQKVPGRSWIEVRRKIYSFTSVDEFNPQGEQLHA 545
           LLADIYAEA+MWDEVKRVKKLL+ R LQK+PGR W+EVRRK+YSF SVDEFNP  EQ+HA
Sbjct: 493 LLADIYAEAQMWDEVKRVKKLLEHRGLQKLPGRCWMEVRRKMYSFVSVDEFNPLMEQIHA 552

Query: 546 LLVNLSNEMKQRGYTPQTKLVLYDLDQEEKERIVLGHSEKLAVAFGLINTSKGDTIRITK 605
            LV L+ +MK++GY PQTK VLY+L+ EEKERIVLGHSEKLA+AFGLINTSKG+ IRITK
Sbjct: 553 FLVKLAEDMKEKGYIPQTKGVLYELETEEKERIVLGHSEKLALAFGLINTSKGEPIRITK 612

Query: 606 NLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFKDGVCSCGDYW 650
           NLRLCEDCH  TKFISKF ++EI+VRD+NRFH FK+GVCSCGDYW
Sbjct: 613 NLRLCEDCHLFTKFISKFMEKEILVRDVNRFHRFKNGVCSCGDYW 657

BLAST of Csa1G524740 vs. Swiss-Prot
Match: PP320_ARATH (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana GN=DOT4 PE=2 SV=1)

HSP 1 Score: 472.6 bits (1215), Expect = 6.8e-132
Identity = 235/559 (42.04%), Postives = 348/559 (62.25%), Query Frame = 1

Query: 91  VHQLLVDGGFDQDPFLATKLINMFSELDTVDNARKVFDKTRKRTIYVWNALFRALALAGR 150
           VH + V   F ++      L++M+S+   +D+A+ VF +   R++  + ++    A  G 
Sbjct: 318 VHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGL 377

Query: 151 GNDVLELYPRMNMMGVSSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHGYGAHVH 210
             + ++L+  M   G+S D +T T +L  C         L +GK +H  I  +  G  + 
Sbjct: 378 AGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYR----LLDEGKRVHEWIKENDLGFDIF 437

Query: 211 VMTTLMDMYARFGCVSYASAVFDEMPVKNVVSWSAMIACYAKNGKPYEALELFREMMLNT 270
           V   LMDMYA+ G +  A  VF EM VK+++SW+ +I  Y+KN    EAL LF  ++L  
Sbjct: 438 VSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFN-LLLEE 497

Query: 271 HDSVPNSVTMVSVLQACAAFAALEQGKLIHAYILRRGLDSILPVISALITMYARCGKLES 330
               P+  T+  VL ACA+ +A ++G+ IH YI+R G  S   V ++L+ MYA+CG L  
Sbjct: 498 KRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLL 557

Query: 331 GQLIFDRMHKKDVVLWNSLISSYGLHGYGRKAIKIFEEMIDHGFSPSHISFISVLGACSH 390
             ++FD +  KD+V W  +I+ YG+HG+G++AI +F +M   G     ISF+S+L ACSH
Sbjct: 558 AHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSH 617

Query: 391 TGLVEEGKKLFESMVKEHGIQPSVEHYACMVDLLGRANRLDEAAKIIEDLRIEPGPKVWG 450
           +GLV+EG + F  M  E  I+P+VEHYAC+VD+L R   L +A + IE++ I P   +WG
Sbjct: 618 SGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWG 677

Query: 451 SLLGACRIHCHVELAERASKRLFKLEPTNAGNYVLLADIYAEAEMWDEVKRVKKLLDSRE 510
           +LL  CRIH  V+LAE+ ++++F+LEP N G YVL+A+IYAEAE W++VKR++K +  R 
Sbjct: 678 ALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRG 737

Query: 511 LQKVPGRSWIEVRRKIYSFTSVDEFNPQGEQLHALLVNLSNEMKQRGYTPQTKLVLYDLD 570
           L+K PG SWIE++ ++  F + D  NP+ E + A L  +   M + GY+P TK  L D +
Sbjct: 738 LRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIEEGYSPLTKYALIDAE 797

Query: 571 QEEKERIVLGHSEKLAVAFGLINTSKGDTIRITKNLRLCEDCHSVTKFISKFADREIMVR 630
           + EKE  + GHSEKLA+A G+I++  G  IR+TKNLR+C DCH + KF+SK   REI++R
Sbjct: 798 EMEKEEALCGHSEKLAMALGIISSGHGKIIRVTKNLRVCGDCHEMAKFMSKLTRREIVLR 857

Query: 631 DLNRFHHFKDGVCSCGDYW 650
           D NRFH FKDG CSC  +W
Sbjct: 858 DSNRFHQFKDGHCSCRGFW 871


HSP 2 Score: 184.9 bits (468), Expect = 2.8e-45
Identity = 116/427 (27.17%), Postives = 214/427 (50.12%), Query Frame = 1

Query: 42  NHLIQSLCKQGNLKQALYLLSHESN---PTQQTCELLILSAARRNSLSDALDVHQLLVDG 101
           N  ++  C+ GNL+ A+ LL          +  C +L L A  + SL D  +V   +   
Sbjct: 65  NTQLRRFCESGNLENAVKLLCVSGKWDIDPRTLCSVLQLCADSK-SLKDGKEVDNFIRGN 124

Query: 102 GFDQDPFLATKLINMFSELDTVDNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLELY 161
           GF  D  L +KL  M++    +  A +VFD+ +      WN L   LA +G  +  + L+
Sbjct: 125 GFVIDSNLGSKLSLMYTNCGDLKEASRVFDEVKIEKALFWNILMNELAKSGDFSGSIGLF 184

Query: 162 PRMNMMGVSSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMDM 221
            +M   GV  D +T++ + K+  +    +  +  G+++H  IL+ G+G    V  +L+  
Sbjct: 185 KKMMSSGVEMDSYTFSCVSKSFSS----LRSVHGGEQLHGFILKSGFGERNSVGNSLVAF 244

Query: 222 YARFGCVSYASAVFDEMPVKNVVSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNSV 281
           Y +   V  A  VFDEM  ++V+SW+++I  Y  NG   + L +F +M+++  +   +  
Sbjct: 245 YLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEI--DLA 304

Query: 282 TMVSVLQACAAFAALEQGKLIHAYILRRGLDSILPVISALITMYARCGKLESGQLIFDRM 341
           T+VSV   CA    +  G+ +H+  ++          + L+ MY++CG L+S + +F  M
Sbjct: 305 TIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREM 364

Query: 342 HKKDVVLWNSLISSYGLHGYGRKAIKIFEEMIDHGFSPSHISFISVLGACSHTGLVEEGK 401
             + VV + S+I+ Y   G   +A+K+FEEM + G SP   +  +VL  C+   L++EGK
Sbjct: 365 SDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGK 424

Query: 402 KLFESMVKEHGIQPSVEHYACMVDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRI 461
           ++ E  +KE+ +   +     ++D+  +   + EA  +  ++R++     W +++G    
Sbjct: 425 RVHE-WIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIIS-WNTIIGGYSK 482

Query: 462 HCHVELA 466
           +C+   A
Sbjct: 485 NCYANEA 482


HSP 3 Score: 174.9 bits (442), Expect = 2.9e-42
Identity = 110/404 (27.23%), Postives = 197/404 (48.76%), Query Frame = 1

Query: 42  NHLIQSLCKQGNLKQALYL----LSHESNPTQQTCELLILSAARRNSLSDALDVHQLLVD 101
           N L+  L K G+   ++ L    +S        T   +  S +   S+     +H  ++ 
Sbjct: 164 NILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQLHGFILK 223

Query: 102 GGFDQDPFLATKLINMFSELDTVDNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLEL 161
            GF +   +   L+  + +   VD+ARKVFD+  +R +  WN++       G     L +
Sbjct: 224 SGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSV 283

Query: 162 YPRMNMMGVSSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMD 221
           + +M + G+  D  T   +   C  S  L+S    G+ +H+  ++  +        TL+D
Sbjct: 284 FVQMLVSGIEIDLATIVSVFAGCADSR-LISL---GRAVHSIGVKACFSREDRFCNTLLD 343

Query: 222 MYARFGCVSYASAVFDEMPVKNVVSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNS 281
           MY++ G +  A AVF EM  ++VVS+++MIA YA+ G   EA++LF EM        P+ 
Sbjct: 344 MYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEM--EEEGISPDV 403

Query: 282 VTMVSVLQACAAFAALEQGKLIHAYILRRGLDSILPVISALITMYARCGKLESGQLIFDR 341
            T+ +VL  CA +  L++GK +H +I    L   + V +AL+ MYA+CG ++  +L+F  
Sbjct: 404 YTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSE 463

Query: 342 MHKKDVVLWNSLISSYGLHGYGRKAIKIFEEMIDH-GFSPSHISFISVLGACSHTGLVEE 401
           M  KD++ WN++I  Y  + Y  +A+ +F  +++   FSP   +   VL AC+     ++
Sbjct: 464 MRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLSAFDK 523

Query: 402 GKKLFESMVKEHGIQPSVEHYACMVDLLGRANRLDEAAKIIEDL 441
           G+++    +  +G          +VD+  +   L  A  + +D+
Sbjct: 524 GREI-HGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDI 560

BLAST of Csa1G524740 vs. Swiss-Prot
Match: PP348_ARATH (Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana GN=EMB2758 PE=3 SV=2)

HSP 1 Score: 467.6 bits (1202), Expect = 2.2e-130
Identity = 236/609 (38.75%), Postives = 375/609 (61.58%), Query Frame = 1

Query: 42  NHLIQSLCKQGNLKQALYLLSHESNPTQQTCELLILSAARRNSLSDALDVHQLLVDGGFD 101
           N +I   C+ GN K+AL L +        T   L+ +       +  + +H   +  G +
Sbjct: 220 NAMISGYCQSGNAKEALTLSNGLRAMDSVTVVSLLSACTEAGDFNRGVTIHSYSIKHGLE 279

Query: 102 QDPFLATKLINMFSELDTVDNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLELYPRM 161
            + F++ KLI++++E   + + +KVFD+   R +  WN++ +A  L  +    + L+  M
Sbjct: 280 SELFVSNKLIDLYAEFGRLRDCQKVFDRMYVRDLISWNSIIKAYELNEQPLRAISLFQEM 339

Query: 162 NMMGVSSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHGYGAH-VHVMTTLMDMYA 221
            +  +  D  T   L  A + S+  +  ++  + +    LR G+    + +   ++ MYA
Sbjct: 340 RLSRIQPDCLTLISL--ASILSQ--LGDIRACRSVQGFTLRKGWFLEDITIGNAVVVMYA 399

Query: 222 RFGCVSYASAVFDEMPVKNVVSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNSVTM 281
           + G V  A AVF+ +P  +V+SW+ +I+ YA+NG   EA+E++  +M    +   N  T 
Sbjct: 400 KLGLVDSARAVFNWLPNTDVISWNTIISGYAQNGFASEAIEMYN-IMEEEGEIAANQGTW 459

Query: 282 VSVLQACAAFAALEQGKLIHAYILRRGLDSILPVISALITMYARCGKLESGQLIFDRMHK 341
           VSVL AC+   AL QG  +H  +L+ GL   + V+++L  MY +CG+LE    +F ++ +
Sbjct: 460 VSVLPACSQAGALRQGMKLHGRLLKNGLYLDVFVVTSLADMYGKCGRLEDALSLFYQIPR 519

Query: 342 KDVVLWNSLISSYGLHGYGRKAIKIFEEMIDHGFSPSHISFISVLGACSHTGLVEEGKKL 401
            + V WN+LI+ +G HG+G KA+ +F+EM+D G  P HI+F+++L ACSH+GLV+EG+  
Sbjct: 520 VNSVPWNTLIACHGFHGHGEKAVMLFKEMLDEGVKPDHITFVTLLSACSHSGLVDEGQWC 579

Query: 402 FESMVKEHGIQPSVEHYACMVDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIHC 461
           FE M  ++GI PS++HY CMVD+ GRA +L+ A K I+ + ++P   +WG+LL ACR+H 
Sbjct: 580 FEMMQTDYGITPSLKHYGCMVDMYGRAGQLETALKFIKSMSLQPDASIWGALLSACRVHG 639

Query: 462 HVELAERASKRLFKLEPTNAGNYVLLADIYAEAEMWDEVKRVKKLLDSRELQKVPGRSWI 521
           +V+L + AS+ LF++EP + G +VLL+++YA A  W+ V  ++ +   + L+K PG S +
Sbjct: 640 NVDLGKIASEHLFEVEPEHVGYHVLLSNMYASAGKWEGVDEIRSIAHGKGLRKTPGWSSM 699

Query: 522 EVRRKIYSFTSVDEFNPQGEQLHALLVNLSNEMKQRGYTPQTKLVLYDLDQEEKERIVLG 581
           EV  K+  F + ++ +P  E+++  L  L  ++K  GY P  + VL D++ +EKE I++ 
Sbjct: 700 EVDNKVEVFYTGNQTHPMYEEMYRELTALQAKLKMIGYVPDHRFVLQDVEDDEKEHILMS 759

Query: 582 HSEKLAVAFGLINTSKGDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFKD 641
           HSE+LA+AF LI T    TIRI KNLR+C DCHSVTKFISK  +REI+VRD NRFHHFK+
Sbjct: 760 HSERLAIAFALIATPAKTTIRIFKNLRVCGDCHSVTKFISKITEREIIVRDSNRFHHFKN 819

Query: 642 GVCSCGDYW 650
           GVCSCGDYW
Sbjct: 820 GVCSCGDYW 823


HSP 2 Score: 188.0 bits (476), Expect = 3.4e-46
Identity = 125/440 (28.41%), Postives = 224/440 (50.91%), Query Frame = 1

Query: 42  NHLIQSLCKQGNLKQ-----ALYLLSHESNPTQQTCELLILSAARRNSLSDALDVHQLLV 101
           N +I    + GN  +     +L++LS    P  +T    +L A R  ++ D   +H L +
Sbjct: 121 NLMISGYGRAGNSSEVIRCFSLFMLSSGLTPDYRTFPS-VLKACR--TVIDGNKIHCLAL 180

Query: 102 DGGFDQDPFLATKLINMFSELDTVDNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLE 161
             GF  D ++A  LI+++S    V NAR +FD+   R +  WNA+      +G   + L 
Sbjct: 181 KFGFMWDVYVAASLIHLYSRYKAVGNARILFDEMPVRDMGSWNAMISGYCQSGNAKEALT 240

Query: 162 LYPRMNMMGVSSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLM 221
           L   +  M    D  T   LL AC  +        +G  IH++ ++HG  + + V   L+
Sbjct: 241 LSNGLRAM----DSVTVVSLLSACTEA----GDFNRGVTIHSYSIKHGLESELFVSNKLI 300

Query: 222 DMYARFGCVSYASAVFDEMPVKNVVSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPN 281
           D+YA FG +     VFD M V++++SW+++I  Y  N +P  A+ LF+EM L+     P+
Sbjct: 301 DLYAEFGRLRDCQKVFDRMYVRDLISWNSIIKAYELNEQPLRAISLFQEMRLSRIQ--PD 360

Query: 282 SVTMVSVLQACAAFAALEQGKLIHAYILRRG--LDSILPVISALITMYARCGKLESGQLI 341
            +T++S+    +    +   + +  + LR+G  L+ I  + +A++ MYA+ G ++S + +
Sbjct: 361 CLTLISLASILSQLGDIRACRSVQGFTLRKGWFLEDI-TIGNAVVVMYAKLGLVDSARAV 420

Query: 342 FDRMHKKDVVLWNSLISSYGLHGYGRKAIKIFEEMIDHG-FSPSHISFISVLGACSHTGL 401
           F+ +   DV+ WN++IS Y  +G+  +AI+++  M + G  + +  +++SVL ACS  G 
Sbjct: 421 FNWLPNTDVISWNTIISGYAQNGFASEAIEMYNIMEEEGEIAANQGTWVSVLPACSQAGA 480

Query: 402 VEEGKKLFESMVKEHGIQPSVEHYACMVDLLGRANRLDEAAKIIEDL-RIEPGPKVWGSL 461
           + +G KL   ++K +G+   V     + D+ G+  RL++A  +   + R+   P  W +L
Sbjct: 481 LRQGMKLHGRLLK-NGLYLDVFVVTSLADMYGKCGRLEDALSLFYQIPRVNSVP--WNTL 540

Query: 462 LGACRIHCHVELAERASKRL 473
           +     H H E A    K +
Sbjct: 541 IACHGFHGHGEKAVMLFKEM 543


HSP 3 Score: 173.3 bits (438), Expect = 8.5e-42
Identity = 104/310 (33.55%), Postives = 165/310 (53.23%), Query Frame = 1

Query: 84  SLSDALDVHQLLVDGGFDQDPFLATKLINMFSELDTVDNARKVFDKTRKRTIYVWNALFR 143
           +L  A  +H  LV     Q+  ++ KL+N++  L  V  AR  FD  + R +Y WN +  
Sbjct: 66  NLQSAKCLHARLVVSKQIQNVCISAKLVNLYCYLGNVALARHTFDHIQNRDVYAWNLMIS 125

Query: 144 ALALAGRGNDVLELYPR-MNMMGVSSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILR 203
               AG  ++V+  +   M   G++ D  T+  +LKAC         +  G +IH   L+
Sbjct: 126 GYGRAGNSSEVIRCFSLFMLSSGLTPDYRTFPSVLKACRT-------VIDGNKIHCLALK 185

Query: 204 HGYGAHVHVMTTLMDMYARFGCVSYASAVFDEMPVKNVVSWSAMIACYAKNGKPYEALEL 263
            G+   V+V  +L+ +Y+R+  V  A  +FDEMPV+++ SW+AMI+ Y ++G   EAL L
Sbjct: 186 FGFMWDVYVAASLIHLYSRYKAVGNARILFDEMPVRDMGSWNAMISGYCQSGNAKEALTL 245

Query: 264 FREMMLNTHDSVPNSVTMVSVLQACAAFAALEQGKLIHAYILRRGLDSILPVISALITMY 323
              +         +SVT+VS+L AC       +G  IH+Y ++ GL+S L V + LI +Y
Sbjct: 246 SNGLR------AMDSVTVVSLLSACTEAGDFNRGVTIHSYSIKHGLESELFVSNKLIDLY 305

Query: 324 ARCGKLESGQLIFDRMHKKDVVLWNSLISSYGLHGYGRKAIKIFEEMIDHGFSPSHISFI 383
           A  G+L   Q +FDRM+ +D++ WNS+I +Y L+    +AI +F+EM      P  ++ I
Sbjct: 306 AEFGRLRDCQKVFDRMYVRDLISWNSIIKAYELNEQPLRAISLFQEMRLSRIQPDCLTLI 362

Query: 384 SVLGACSHTG 393
           S+    S  G
Sbjct: 366 SLASILSQLG 362


HSP 4 Score: 98.2 bits (243), Expect = 3.5e-19
Identity = 68/270 (25.19%), Postives = 130/270 (48.15%), Query Frame = 1

Query: 190 LQKGKEIHAHILRHGYGAHVHVMTTLMDMYARFGCVSYASAVFDEMPVKNVVSWSAMIAC 249
           LQ  K +HA ++      +V +   L+++Y   G V+ A   FD +  ++V +W+ MI+ 
Sbjct: 67  LQSAKCLHARLVVSKQIQNVCISAKLVNLYCYLGNVALARHTFDHIQNRDVYAWNLMISG 126

Query: 250 YAKNGKPYEALELFREMMLNTHDSVPNSVTMVSVLQACAAFAALEQGKLIHAYILRRGLD 309
           Y + G   E +  F   ML++    P+  T  SVL+AC     +  G  IH   L+ G  
Sbjct: 127 YGRAGNSSEVIRCFSLFMLSS-GLTPDYRTFPSVLKAC---RTVIDGNKIHCLALKFGFM 186

Query: 310 SILPVISALITMYARCGKLESGQLIFDRMHKKDVVLWNSLISSYGLHGYGRKAIKIFEEM 369
             + V ++LI +Y+R   + + +++FD M  +D+  WN++IS Y   G  ++A+ +   +
Sbjct: 187 WDVYVAASLIHLYSRYKAVGNARILFDEMPVRDMGSWNAMISGYCQSGNAKEALTLSNGL 246

Query: 370 IDHGFSPSHISFISVLGACSHTGLVEEGKKLFESMVKEHGIQPSVEHYACMVDLLGRANR 429
                +   ++ +S+L AC+  G    G  +    +K HG++  +     ++DL     R
Sbjct: 247 ----RAMDSVTVVSLLSACTEAGDFNRGVTIHSYSIK-HGLESELFVSNKLIDLYAEFGR 306

Query: 430 LDEAAKIIEDLRIEPGPKVWGSLLGACRIH 460
           L +  K+ + + +      W S++ A  ++
Sbjct: 307 LRDCQKVFDRMYVRDLIS-WNSIIKAYELN 326

BLAST of Csa1G524740 vs. Swiss-Prot
Match: PP224_ARATH (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 464.5 bits (1194), Expect = 1.9e-129
Identity = 243/623 (39.00%), Postives = 377/623 (60.51%), Query Frame = 1

Query: 29  FSFNPSTPPNSNNNHLIQSLCKQGNLKQALYLLSHESNPTQQTCELLILSAARRNSLSDA 88
           F +N      S NNH   +L    N++ A        +P   T   L+ + +  + L   
Sbjct: 85  FPWNAIIRGYSRNNHFQDALLMYSNMQLA------RVSPDSFTFPHLLKACSGLSHLQMG 144

Query: 89  LDVHQLLVDGGFDQDPFLATKLINMFSELDTVDNARKVFD--KTRKRTIYVWNALFRALA 148
             VH  +   GFD D F+   LI ++++   + +AR VF+     +RTI  W A+  A A
Sbjct: 145 RFVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYA 204

Query: 149 LAGRGNDVLELYPRMNMMGVSSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHGYG 208
             G   + LE++ +M  M V  D   +  L+    A  CL   L++G+ IHA +++ G  
Sbjct: 205 QNGEPMEALEIFSQMRKMDVKPD---WVALVSVLNAFTCLQD-LKQGRSIHASVVKMGLE 264

Query: 209 AHVHVMTTLMDMYARFGCVSYASAVFDEMPVKNVVSWSAMIACYAKNGKPYEALELFREM 268
               ++ +L  MYA+ G V+ A  +FD+M   N++ W+AMI+ YAKNG   EA+++F EM
Sbjct: 265 IEPDLLISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEM 324

Query: 269 MLNTHDSVPNSVTMVSVLQACAAFAALEQGKLIHAYILRRGLDSILPVISALITMYARCG 328
           +    D  P+++++ S + ACA   +LEQ + ++ Y+ R      + + SALI M+A+CG
Sbjct: 325 I--NKDVRPDTISITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCG 384

Query: 329 KLESGQLIFDRMHKKDVVLWNSLISSYGLHGYGRKAIKIFEEMIDHGFSPSHISFISVLG 388
            +E  +L+FDR   +DVV+W+++I  YGLHG  R+AI ++  M   G  P+ ++F+ +L 
Sbjct: 385 SVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLM 444

Query: 389 ACSHTGLVEEGKKLFESMVKEHGIQPSVEHYACMVDLLGRANRLDEAAKIIEDLRIEPGP 448
           AC+H+G+V EG   F  M  +H I P  +HYAC++DLLGRA  LD+A ++I+ + ++PG 
Sbjct: 445 ACNHSGMVREGWWFFNRMA-DHKINPQQQHYACVIDLLGRAGHLDQAYEVIKCMPVQPGV 504

Query: 449 KVWGSLLGACRIHCHVELAERASKRLFKLEPTNAGNYVLLADIYAEAEMWDEVKRVKKLL 508
            VWG+LL AC+ H HVEL E A+++LF ++P+N G+YV L+++YA A +WD V  V+  +
Sbjct: 505 TVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEVRVRM 564

Query: 509 DSRELQKVPGRSWIEVRRKIYSFTSVDEFNPQGEQLHALLVNLSNEMKQRGYTPQTKLVL 568
             + L K  G SW+EVR ++ +F   D+ +P+ E++   +  + + +K+ G+       L
Sbjct: 565 KEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIERQVEWIESRLKEGGFVANKDASL 624

Query: 569 YDLDQEEKERIVLGHSEKLAVAFGLINTSKGDTIRITKNLRLCEDCHSVTKFISKFADRE 628
           +DL+ EE E  +  HSE++A+A+GLI+T +G  +RITKNLR C +CH+ TK ISK  DRE
Sbjct: 625 HDLNDEEAEETLCSHSERIAIAYGLISTPQGTPLRITKNLRACVNCHAATKLISKLVDRE 684

Query: 629 IMVRDLNRFHHFKDGVCSCGDYW 650
           I+VRD NRFHHFKDGVCSCGDYW
Sbjct: 685 IVVRDTNRFHHFKDGVCSCGDYW 694


HSP 2 Score: 230.7 bits (587), Expect = 4.5e-59
Identity = 131/387 (33.85%), Postives = 216/387 (55.81%), Query Frame = 1

Query: 75  LILSAARRNSLSDALDVHQLLVDGGFDQDPFLATKLINMFSELDTVDNARKVFDKTRKRT 134
           LI SA  +  L     +H  L+  G     FL TKLI+  S    +  AR+VFD   +  
Sbjct: 27  LIDSATHKAQLKQ---IHARLLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQ 86

Query: 135 IYVWNALFRALALAGRGNDVLELYPRMNMMGVSSDRFTYTYLLKACVASECLVSFLQKGK 194
           I+ WNA+ R  +      D L +Y  M +  VS D FT+ +LLKAC      +S LQ G+
Sbjct: 87  IFPWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSG----LSHLQMGR 146

Query: 195 EIHAHILRHGYGAHVHVMTTLMDMYARFGCVSYASAVFDEMPV--KNVVSWSAMIACYAK 254
            +HA + R G+ A V V   L+ +YA+   +  A  VF+ +P+  + +VSW+A+++ YA+
Sbjct: 147 FVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQ 206

Query: 255 NGKPYEALELFREMMLNTHDSVPNSVTMVSVLQACAAFAALEQGKLIHAYILRRGLDSIL 314
           NG+P EALE+F +M     D  P+ V +VSVL A      L+QG+ IHA +++ GL+   
Sbjct: 207 NGEPMEALEIFSQM--RKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEP 266

Query: 315 PVISALITMYARCGKLESGQLIFDRMHKKDVVLWNSLISSYGLHGYGRKAIKIFEEMIDH 374
            ++ +L TMYA+CG++ + +++FD+M   +++LWN++IS Y  +GY R+AI +F EMI+ 
Sbjct: 267 DLLISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINK 326

Query: 375 GFSPSHISFISVLGACSHTGLVEEGKKLFESMVKEHGIQPSVEHYACMVDLLGRANRLDE 434
              P  IS  S + AC+  G +E+ + ++E  V     +  V   + ++D+  +   + E
Sbjct: 327 DVRPDTISITSAISACAQVGSLEQARSMYE-YVGRSDYRDDVFISSALIDMFAKCGSV-E 386

Query: 435 AAKIIEDLRIEPGPKVWGSLLGACRIH 460
            A+++ D  ++    VW +++    +H
Sbjct: 387 GARLVFDRTLDRDVVVWSAMIVGYGLH 402

BLAST of Csa1G524740 vs. Swiss-Prot
Match: PP341_ARATH (Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana GN=DYW9 PE=2 SV=1)

HSP 1 Score: 461.5 bits (1186), Expect = 1.6e-128
Identity = 241/591 (40.78%), Postives = 355/591 (60.07%), Query Frame = 1

Query: 61  LSHESNPTQQTCELLIL--SAARRNSLSDALDVHQLLVDGGFDQDPFLATKLINMFSELD 120
           L +ES     T  LL +  + A    L   + +H L    G     ++ T  I+++S+  
Sbjct: 211 LINESCTRLDTTTLLDILPAVAELQELRLGMQIHSLATKTGCYSHDYVLTGFISLYSKCG 270

Query: 121 TVDNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLELYPRMNMMGVSSDRFTYTYLLK 180
            +     +F + RK  I  +NA+       G     L L+  + + G      T   L+ 
Sbjct: 271 KIKMGSALFREFRKPDIVAYNAMIHGYTSNGETELSLSLFKELMLSGARLRSSTLVSLVP 330

Query: 181 ACVASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMDMYARFGCVSYASAVFDEMPVK 240
             V+   ++ +      IH + L+  + +H  V T L  +Y++   +  A  +FDE P K
Sbjct: 331 --VSGHLMLIYA-----IHGYCLKSNFLSHASVSTALTTVYSKLNEIESARKLFDESPEK 390

Query: 241 NVVSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNSVTMVSVLQACAAFAALEQGKL 300
           ++ SW+AMI+ Y +NG   +A+ LFREM  +     PN VT+  +L ACA   AL  GK 
Sbjct: 391 SLPSWNAMISGYTQNGLTEDAISLFREMQKSEFS--PNPVTITCILSACAQLGALSLGKW 450

Query: 301 IHAYILRRGLDSILPVISALITMYARCGKLESGQLIFDRMHKKDVVLWNSLISSYGLHGY 360
           +H  +     +S + V +ALI MYA+CG +   + +FD M KK+ V WN++IS YGLHG 
Sbjct: 451 VHDLVRSTDFESSIYVSTALIGMYAKCGSIAEARRLFDLMTKKNEVTWNTMISGYGLHGQ 510

Query: 361 GRKAIKIFEEMIDHGFSPSHISFISVLGACSHTGLVEEGKKLFESMVKEHGIQPSVEHYA 420
           G++A+ IF EM++ G +P+ ++F+ VL ACSH GLV+EG ++F SM+  +G +PSV+HYA
Sbjct: 511 GQEALNIFYEMLNSGITPTPVTFLCVLYACSHAGLVKEGDEIFNSMIHRYGFEPSVKHYA 570

Query: 421 CMVDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIHCHVELAERASKRLFKLEPT 480
           CMVD+LGRA  L  A + IE + IEPG  VW +LLGACRIH    LA   S++LF+L+P 
Sbjct: 571 CMVDILGRAGHLQRALQFIEAMSIEPGSSVWETLLGACRIHKDTNLARTVSEKLFELDPD 630

Query: 481 NAGNYVLLADIYAEAEMWDEVKRVKKLLDSRELQKVPGRSWIEVRRKIYSFTSVDEFNPQ 540
           N G +VLL++I++    + +   V++    R+L K PG + IE+    + FTS D+ +PQ
Sbjct: 631 NVGYHVLLSNIHSADRNYPQAATVRQTAKKRKLAKAPGYTLIEIGETPHVFTSGDQSHPQ 690

Query: 541 GEQLHALLVNLSNEMKQRGYTPQTKLVLYDLDQEEKERIVLGHSEKLAVAFGLINTSKGD 600
            ++++  L  L  +M++ GY P+T+L L+D+++EE+E +V  HSE+LA+AFGLI T  G 
Sbjct: 691 VKEIYEKLEKLEGKMREAGYQPETELALHDVEEEERELMVKVHSERLAIAFGLIATEPGT 750

Query: 601 TIRITKNLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFKDGVCSCGDYW 650
            IRI KNLR+C DCH+VTK ISK  +R I+VRD NRFHHFKDGVCSCGDYW
Sbjct: 751 EIRIIKNLRVCLDCHTVTKLISKITERVIVVRDANRFHHFKDGVCSCGDYW 792


HSP 2 Score: 147.9 bits (372), Expect = 3.8e-34
Identity = 101/398 (25.38%), Postives = 191/398 (47.99%), Query Frame = 1

Query: 62  SHESNPTQQTCELLILSAARRNSLSDALDVHQLLVDGGFDQDPFLATKLINMFSELDTVD 121
           S +  P   T    I +A+          +H   V  G D +  L + ++ M+ +   V+
Sbjct: 112 STDLKPNSSTYAFAISAASGFRDDRAGRVIHGQAVVDGCDSELLLGSNIVKMYFKFWRVE 171

Query: 122 NARKVFDKTRKRTIYVWNALFRALALAGRGNDVLELYPRMNMMGVSSDRFTYTYLLKACV 181
           +ARKVFD+  ++   +WN +           + ++++   +++  S  R   T LL    
Sbjct: 172 DARKVFDRMPEKDTILWNTMISGYRKNEMYVESIQVF--RDLINESCTRLDTTTLLDILP 231

Query: 182 ASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMDMYARFGCVSYASAVFDEMPVKNVV 241
           A   L   L+ G +IH+   + G  +H +V+T  + +Y++ G +   SA+F E    ++V
Sbjct: 232 AVAELQE-LRLGMQIHSLATKTGCYSHDYVLTGFISLYSKCGKIKMGSALFREFRKPDIV 291

Query: 242 SWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNSVTMVSVLQACAAFAALEQGKLIHA 301
           +++AMI  Y  NG+   +L LF+E+ML+   +   S T+VS++        +     IH 
Sbjct: 292 AYNAMIHGYTSNGETELSLSLFKELMLS--GARLRSSTLVSLVPVSGHLMLIYA---IHG 351

Query: 302 YILRRGLDSILPVISALITMYARCGKLESGQLIFDRMHKKDVVLWNSLISSYGLHGYGRK 361
           Y L+    S   V +AL T+Y++  ++ES + +FD   +K +  WN++IS Y  +G    
Sbjct: 352 YCLKSNFLSHASVSTALTTVYSKLNEIESARKLFDESPEKSLPSWNAMISGYTQNGLTED 411

Query: 362 AIKIFEEMIDHGFSPSHISFISVLGACSHTGLVEEGKKLFESMVKEHGIQPSVEHYACMV 421
           AI +F EM    FSP+ ++   +L AC+  G +  GK + + +V+    + S+     ++
Sbjct: 412 AISLFREMQKSEFSPNPVTITCILSACAQLGALSLGKWVHD-LVRSTDFESSIYVSTALI 471

Query: 422 DLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIH 460
            +  +   + EA ++  DL  +     W +++    +H
Sbjct: 472 GMYAKCGSIAEARRLF-DLMTKKNEVTWNTMISGYGLH 499


HSP 3 Score: 129.8 bits (325), Expect = 1.1e-28
Identity = 86/342 (25.15%), Postives = 162/342 (47.37%), Query Frame = 1

Query: 60  LLSHESNPTQQTCELLILSAA------RRNSLSDALDVHQLLVDGGFDQDPFLATKLINM 119
           LL   S+ T +T   LI          R  S+S     H  ++  GF  D  L TKL   
Sbjct: 2   LLRTVSSATAETTAALISKNTYLDFFKRSTSISHLAQTHAQIILHGFRNDISLLTKLTQR 61

Query: 120 FSELDTVDNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLELYPRMNMM-GVSSDRFT 179
            S+L  +  AR +F   ++  ++++N L R  ++    +  L ++  +     +  +  T
Sbjct: 62  LSDLGAIYYARDIFLSVQRPDVFLFNVLMRGFSVNESPHSSLSVFAHLRKSTDLKPNSST 121

Query: 180 YTYLLKACVASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMDMYARFGCVSYASAVF 239
           Y + + A           + G+ IH   +  G  + + + + ++ MY +F  V  A  VF
Sbjct: 122 YAFAISAASGFRDD----RAGRVIHGQAVVDGCDSELLLGSNIVKMYFKFWRVEDARKVF 181

Query: 240 DEMPVKNVVSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNSVTMVSVLQACAAFAA 299
           D MP K+ + W+ MI+ Y KN    E++++FR++ +N   +  ++ T++ +L A A    
Sbjct: 182 DRMPEKDTILWNTMISGYRKNEMYVESIQVFRDL-INESCTRLDTTTLLDILPAVAELQE 241

Query: 300 LEQGKLIHAYILRRGLDSILPVISALITMYARCGKLESGQLIFDRMHKKDVVLWNSLISS 359
           L  G  IH+   + G  S   V++  I++Y++CGK++ G  +F    K D+V +N++I  
Sbjct: 242 LRLGMQIHSLATKTGCYSHDYVLTGFISLYSKCGKIKMGSALFREFRKPDIVAYNAMIHG 301

Query: 360 YGLHGYGRKAIKIFEEMIDHGFSPSHISFISVLGACSHTGLV 395
           Y  +G    ++ +F+E++  G      + +S++    H  L+
Sbjct: 302 YTSNGETELSLSLFKELMLSGARLRSSTLVSLVPVSGHLMLI 338

BLAST of Csa1G524740 vs. TrEMBL
Match: F6GUX8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_06s0004g05500 PE=4 SV=1)

HSP 1 Score: 1001.5 bits (2588), Expect = 4.7e-289
Identity = 499/658 (75.84%), Postives = 559/658 (84.95%), Query Frame = 1

Query: 1   MWALRTPYSTHYP----PSSPRYSTSKLSVSSFSFNPSTPPNSN-----NNHLIQSLCKQ 60
           MWA +TP +   P    P     + S       +  PST   SN     NN LIQSLCKQ
Sbjct: 1   MWAFQTPQTIQQPHLPKPFHKPTAISPKPQCCLALRPSTTTRSNGDSNNNNPLIQSLCKQ 60

Query: 61  GNLKQALYLLSHESNPTQQTCELLILSAARRNSLSDALDVHQLLVDGGFDQDPFLATKLI 120
           GNL QAL +LS E NPTQ T ELLILS  R+NSL   +D+H+ L+  G DQDPFLATKLI
Sbjct: 61  GNLNQALQVLSQEPNPTQHTYELLILSCTRQNSLPQGIDLHRHLIHDGSDQDPFLATKLI 120

Query: 121 NMFSELDTVDNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLELYPRMNMMGVSSDRF 180
           NM+SELD++DNARKVFDKTRKRTIYVWNALFRAL LAG G +VL+LY RMN +GV SDRF
Sbjct: 121 NMYSELDSIDNARKVFDKTRKRTIYVWNALFRALTLAGYGREVLDLYRRMNRIGVPSDRF 180

Query: 181 TYTYLLKACVASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMDMYARFGCVSYASAV 240
           TYTY+LKACVASE  VS L  G+EIH HILRHG+  HVH+MTTL+DMYARFGCV  AS V
Sbjct: 181 TYTYVLKACVASEAFVSLLLNGREIHGHILRHGFEGHVHIMTTLLDMYARFGCVLNASRV 240

Query: 241 FDEMPVKNVVSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNSVTMVSVLQACAAFA 300
           FD+MPVKNVVSWSAMIACY+KNGKP EALELFR+MML   D +PNSVTMVSVLQACAA A
Sbjct: 241 FDQMPVKNVVSWSAMIACYSKNGKPLEALELFRKMMLENQDLLPNSVTMVSVLQACAALA 300

Query: 301 ALEQGKLIHAYILRRGLDSILPVISALITMYARCGKLESGQLIFDRMHKKDVVLWNSLIS 360
           ALEQGKL+H YILRRGLDSILPV+SAL+T+YARCG LE G  +F+RM K+DVV WNSLIS
Sbjct: 301 ALEQGKLMHGYILRRGLDSILPVVSALVTVYARCGNLELGHRVFERMEKRDVVSWNSLIS 360

Query: 361 SYGLHGYGRKAIKIFEEMIDHGFSPSHISFISVLGACSHTGLVEEGKKLFESMVKEHGIQ 420
           SYG+HG+GRKAI+IF+EMID G SPS ISF+SVLGACSH GLVEEGK LFESMV+ H I 
Sbjct: 361 SYGIHGFGRKAIQIFKEMIDQGLSPSPISFVSVLGACSHAGLVEEGKVLFESMVRGHKIF 420

Query: 421 PSVEHYACMVDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIHCHVELAERASKR 480
           PSVEHYACMVDLLGRANRLDEAAKII+D+RIEPGPKVWGSLLG+CRIHC+VELAERA+ R
Sbjct: 421 PSVEHYACMVDLLGRANRLDEAAKIIDDMRIEPGPKVWGSLLGSCRIHCNVELAERATSR 480

Query: 481 LFKLEPTNAGNYVLLADIYAEAEMWDEVKRVKKLLDSRELQKVPGRSWIEVRRKIYSFTS 540
           LF+LEPTNAGNYVLLADIYAEA+MW+EVKRVK LL++R LQKVPGRS IE+RRKIYSF S
Sbjct: 481 LFELEPTNAGNYVLLADIYAEAKMWNEVKRVKMLLEARGLQKVPGRSCIEIRRKIYSFMS 540

Query: 541 VDEFNPQGEQLHALLVNLSNEMKQRGYTPQTKLVLYDLDQEEKERIVLGHSEKLAVAFGL 600
           VDEFNPQ EQLHALL+ LS EMK++GY P TK+VLYDLD EEKERIVLGHSEKLA+AFGL
Sbjct: 541 VDEFNPQIEQLHALLLKLSMEMKEKGYVPDTKVVLYDLDPEEKERIVLGHSEKLALAFGL 600

Query: 601 INTSKGDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFKDGVCSCGDYW 650
           IN+ KG+TIRITKNLRLCEDCHSVTKFISKFA+REI+VRD+NRFH F+DGVCSCGDYW
Sbjct: 601 INSKKGETIRITKNLRLCEDCHSVTKFISKFANREILVRDVNRFHLFQDGVCSCGDYW 658

BLAST of Csa1G524740 vs. TrEMBL
Match: A0A061GWW5_THECC (Tetratricopeptide repeat (TPR)-like superfamily protein OS=Theobroma cacao GN=TCM_041754 PE=4 SV=1)

HSP 1 Score: 1000.3 bits (2585), Expect = 1.0e-288
Identity = 492/651 (75.58%), Postives = 570/651 (87.56%), Query Frame = 1

Query: 1   MWALRTPYSTHYPP-SSPRYSTSKLSVSSFSFNPS-TPPNSNNNHLIQSLCKQGNLKQAL 60
           MWA  +P  T  P  S+P  ++ KL  SS + NPS +  N NNN LIQSLCK+GNLKQA 
Sbjct: 1   MWAFHSPQPTQPPSLSNPPRTSPKLPSSSLTLNPSISTSNLNNNQLIQSLCKEGNLKQAF 60

Query: 61  YLLSHESNPTQQTCELLILSAARRNSLSDALDVHQLLVDGGFDQDPFLATKLINMFSELD 120
            LLS E NP+Q+T ELLILS A +NSLS A  +H  +   GFDQDPFL TKLI+M+S LD
Sbjct: 61  KLLSQEPNPSQRTYELLILSCAHQNSLSLAQSLHSHISQNGFDQDPFLVTKLISMYSALD 120

Query: 121 TVDNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLELYPRMNMMGVSSDRFTYTYLLK 180
           ++D+ARK+FDKTRKRTI+VWNALFRAL LAG G +VL LY +MN  G+ SDRFTYTY+LK
Sbjct: 121 SLDDARKLFDKTRKRTIFVWNALFRALTLAGFGEEVLGLYRQMNRTGIPSDRFTYTYVLK 180

Query: 181 ACVASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMDMYARFGCVSYASAVFDEMPVK 240
           ACVASECLVS L+KGKEIHA+ILRHGY AHVH+MTTL+DMYARFGCVS AS VF EMPV+
Sbjct: 181 ACVASECLVSLLKKGKEIHAYILRHGYEAHVHIMTTLVDMYARFGCVSCASFVFGEMPVR 240

Query: 241 NVVSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNSVTMVSVLQACAAFAALEQGKL 300
           NVVSWSAMIACYAKNGK +EALELFREMM+ THDS PNSVTMVSVLQACAA AALEQGKL
Sbjct: 241 NVVSWSAMIACYAKNGKSFEALELFREMMVETHDSFPNSVTMVSVLQACAALAALEQGKL 300

Query: 301 IHAYILRRGLDSILPVISALITMYARCGKLESGQLIFDRMHKKDVVLWNSLISSYGLHGY 360
           IHAYILRRGLDS+LPVISALITMY+RCGKLE GQ IFD+M K+DVV WNSLISSY +HG+
Sbjct: 301 IHAYILRRGLDSVLPVISALITMYSRCGKLELGQRIFDQMEKRDVVSWNSLISSYAVHGF 360

Query: 361 GRKAIKIFEEMIDHGFSPSHISFISVLGACSHTGLVEEGKKLFESMVKEHGIQPSVEHYA 420
           G+KAI+IF+EMI  G SPS ++F+SVLGACSH GLVEEGK LF+SM KEHGI PSVEHYA
Sbjct: 361 GKKAIQIFQEMIHQGVSPSPVTFVSVLGACSHAGLVEEGKWLFDSMHKEHGIYPSVEHYA 420

Query: 421 CMVDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIHCHVELAERASKRLFKLEPT 480
           CMVDLLGRANRL+EAA+II+++RIEPG KVWGSLLG+CRIHC+V+LAERAS RLF+LEP 
Sbjct: 421 CMVDLLGRANRLEEAARIIDEMRIEPGAKVWGSLLGSCRIHCNVDLAERASSRLFQLEPV 480

Query: 481 NAGNYVLLADIYAEAEMWDEVKRVKKLLDSRELQKVPGRSWIEVRRKIYSFTSVDEFNPQ 540
           +AGNYVLLADIYAEA+MWDEVKRV+KLL++R LQKVPGRSWIEV+RKIYSF SVDE NPQ
Sbjct: 481 SAGNYVLLADIYAEAKMWDEVKRVRKLLETRSLQKVPGRSWIEVKRKIYSFVSVDESNPQ 540

Query: 541 GEQLHALLVNLSNEMKQRGYTPQTKLVLYDLDQEEKERIVLGHSEKLAVAFGLINTSKGD 600
            E++ + L+ LS EMK++GY PQTK+VLYDL++ EKERI+LGHSEKLAVAFGLINT+KG+
Sbjct: 541 IEEIQSFLIKLSAEMKEKGYVPQTKVVLYDLNEGEKERILLGHSEKLAVAFGLINTNKGE 600

Query: 601 TIRITKNLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFKDGVCSCGDYW 650
           TIRITKNLRLCEDCH++TKFISKFA++EI+VRD+NRFHHF++GVCSC DYW
Sbjct: 601 TIRITKNLRLCEDCHTLTKFISKFANKEILVRDVNRFHHFQNGVCSCDDYW 651

BLAST of Csa1G524740 vs. TrEMBL
Match: V4S0Q9_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025096mg PE=4 SV=1)

HSP 1 Score: 1000.0 bits (2584), Expect = 1.4e-288
Identity = 501/652 (76.84%), Postives = 562/652 (86.20%), Query Frame = 1

Query: 4   LRTPYST----HYPPSSPRYSTSKLSVSSFSFNPSTPPNSNNNHLIQSLCKQGNLKQALY 63
           LR+PY T    H PP  P      +S++S S  P++  + N N LIQSLCKQGNLKQAL 
Sbjct: 13  LRSPYHTNSIAHLPPK-PSSVCCCVSLNS-STTPTSLSSRNKNELIQSLCKQGNLKQALD 72

Query: 64  LLSHESNPTQQTCELLILSAARRNSLSDALDVHQLLVDGGFDQDPFLATKLINMFSELDT 123
           +LS E NPTQ T ELL+LS A  NSLSDAL+VH  L D GFDQDPFL TKLIN++S  D+
Sbjct: 73  VLSSEPNPTQHTYELLLLSCAHHNSLSDALNVHCHLTDNGFDQDPFLVTKLINVYSHFDS 132

Query: 124 VDNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLELYPRMNMMG--VSSDRFTYTYLL 183
           VD+AR VFDKTR+RTIYVWNALFRAL LAGRG +VLELY RMN  G  + SDRFTYTY+L
Sbjct: 133 VDDARHVFDKTRRRTIYVWNALFRALTLAGRGEEVLELYRRMNGTGTGIRSDRFTYTYVL 192

Query: 184 KACVASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMDMYARFGCVSYASAVFDEMPV 243
           KACVAS C  S L+ GKEIHA +LRHGY   VH+MTTL+DMYARFGCV YA  VF +M V
Sbjct: 193 KACVASSCGFSLLKHGKEIHASVLRHGYNGIVHIMTTLIDMYARFGCVMYAGFVFSQMAV 252

Query: 244 KNVVSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNSVTMVSVLQACAAFAALEQGK 303
           KNVVSWSAMIACYA+NG  +EALELFREM++ +HD  PNSVTMVSVLQACAA AALEQGK
Sbjct: 253 KNVVSWSAMIACYARNGMAFEALELFREMIMESHDLCPNSVTMVSVLQACAALAALEQGK 312

Query: 304 LIHAYILRRGLDSILPVISALITMYARCGKLESGQLIFDRMHKKDVVLWNSLISSYGLHG 363
           +IH YILRRGLDSILPV+SAL+TMYARCGKLE GQ +FD M K+DVV WNSLISSYG+HG
Sbjct: 313 MIHGYILRRGLDSILPVVSALVTMYARCGKLELGQCVFDHMDKRDVVSWNSLISSYGVHG 372

Query: 364 YGRKAIKIFEEMIDHGFSPSHISFISVLGACSHTGLVEEGKKLFESMVKEHGIQPSVEHY 423
           YG KAI+IF+EMI HG SPS ISF+SVLGACSH GLVEEGK LFESM KEH ++PSVEHY
Sbjct: 373 YGGKAIQIFKEMIYHGVSPSPISFVSVLGACSHAGLVEEGKMLFESMRKEHMVRPSVEHY 432

Query: 424 ACMVDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIHCHVELAERASKRLFKLEP 483
           ACMVDLLGRAN+L+EAAKIIEDLRIEPGPKVWGSLLG+CRIHC+VELAERASKRLF+LEP
Sbjct: 433 ACMVDLLGRANKLEEAAKIIEDLRIEPGPKVWGSLLGSCRIHCNVELAERASKRLFELEP 492

Query: 484 TNAGNYVLLADIYAEAEMWDEVKRVKKLLDSRELQKVPGRSWIEVRRKIYSFTSVDEFNP 543
           TNAGNYVLLAD+YA A+MWDEVKRVK+LL++R LQKVPGRS IEV+RK+YSF SVDEFNP
Sbjct: 493 TNAGNYVLLADVYAAADMWDEVKRVKRLLEARGLQKVPGRSRIEVKRKMYSFVSVDEFNP 552

Query: 544 QGEQLHALLVNLSNEMKQRGYTPQTKLVLYDLDQEEKERIVLGHSEKLAVAFGLINTSKG 603
           Q EQLHALL+NLS EMK++GY PQTK+VLYDLD EEKERIVLGHSEKLAVAFGLINTSKG
Sbjct: 553 QFEQLHALLINLSAEMKEKGYVPQTKVVLYDLDAEEKERIVLGHSEKLAVAFGLINTSKG 612

Query: 604 DTIRITKNLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFKDGVCSCGDYW 650
           +TIRITKNLRLCEDCHS TKFISKFA++EI+VRD+NRFHHF +GVCSCGDYW
Sbjct: 613 ETIRITKNLRLCEDCHSFTKFISKFANKEILVRDVNRFHHFHNGVCSCGDYW 662

BLAST of Csa1G524740 vs. TrEMBL
Match: A0A0D2RLL5_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_003G128500 PE=4 SV=1)

HSP 1 Score: 997.7 bits (2578), Expect = 6.8e-288
Identity = 491/652 (75.31%), Postives = 572/652 (87.73%), Query Frame = 1

Query: 1   MWALRTPYSTHYPPS--SPRYSTSKLSVSSFSFNPSTPPNS-NNNHLIQSLCKQGNLKQA 60
           MWA  TP  T  PPS  +P  +  KL  SS + NPS   ++ N+N LIQSLCKQG+LKQA
Sbjct: 1   MWAFHTPQPTQ-PPSLFNPPRTPPKLPSSSLTLNPSISNSTPNHNQLIQSLCKQGDLKQA 60

Query: 61  LYLLSHESNPTQQTCELLILSAARRNSLSDALDVHQLLVDGGFDQDPFLATKLINMFSEL 120
             LLS E NP+Q+T E+LILS A +NSLS A  +H  + + GFDQDPFL TKLI+M++ L
Sbjct: 61  FKLLSREPNPSQRTYEVLILSCADQNSLSLAQSLHSHISENGFDQDPFLVTKLISMYAAL 120

Query: 121 DTVDNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLELYPRMNMMGVSSDRFTYTYLL 180
           D++D+ARKVFDKTRKRTI+VWNALFRAL LAG G +VL LY +MN +G+ SDRFTYTY+L
Sbjct: 121 DSLDDARKVFDKTRKRTIFVWNALFRALTLAGFGEEVLGLYRKMNRIGLPSDRFTYTYVL 180

Query: 181 KACVASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMDMYARFGCVSYASAVFDEMPV 240
           KACVASEC+VS L KGKEIHAHILRHG   +VH+MTTL+DMYARFGCV++AS VF++MPV
Sbjct: 181 KACVASECMVSLLNKGKEIHAHILRHGLEGYVHIMTTLVDMYARFGCVAHASFVFEKMPV 240

Query: 241 KNVVSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNSVTMVSVLQACAAFAALEQGK 300
           +NVVSWSAM+ACYAKNGKP+EALELFREMM+ T DS PNSVTMVSVLQACAA +ALEQGK
Sbjct: 241 RNVVSWSAMMACYAKNGKPFEALELFREMMIETQDSAPNSVTMVSVLQACAALSALEQGK 300

Query: 301 LIHAYILRRGLDSILPVISALITMYARCGKLESGQLIFDRMHKKDVVLWNSLISSYGLHG 360
           L+HAYILRRGLDS+LPVISALITMYARCG+LE GQ IFDRM K+DVV WNSLISSYGLHG
Sbjct: 301 LVHAYILRRGLDSVLPVISALITMYARCGELELGQRIFDRMEKRDVVSWNSLISSYGLHG 360

Query: 361 YGRKAIKIFEEMIDHGFSPSHISFISVLGACSHTGLVEEGKKLFESMVKEHGIQPSVEHY 420
           YG+KA++IF+EMI  G SPS I+F+SVLGACSH GLVEEGKKLF+SM KEHGI PSVEHY
Sbjct: 361 YGKKAMQIFQEMIHQGVSPSSITFVSVLGACSHAGLVEEGKKLFDSMRKEHGIHPSVEHY 420

Query: 421 ACMVDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIHCHVELAERASKRLFKLEP 480
           ACMVDLLGRANRL+EAAKII+++RIEPG KVWGSLLG+CRIHC+VELAERAS RLF+LEP
Sbjct: 421 ACMVDLLGRANRLEEAAKIIDEMRIEPGAKVWGSLLGSCRIHCNVELAERASHRLFQLEP 480

Query: 481 TNAGNYVLLADIYAEAEMWDEVKRVKKLLDSRELQKVPGRSWIEVRRKIYSFTSVDEFNP 540
            +AGNYVLLADIYAEAEMWD+VKRV+KLL++R LQKV GRSWIEVRRK+YSF SVDE NP
Sbjct: 481 HSAGNYVLLADIYAEAEMWDDVKRVRKLLETRSLQKVAGRSWIEVRRKMYSFVSVDEPNP 540

Query: 541 QGEQLHALLVNLSNEMKQRGYTPQTKLVLYDLDQEEKERIVLGHSEKLAVAFGLINTSKG 600
           Q E + +LL+ L+ EMK++GY+PQTK+VLYDLD+ EKERI+LGHSEKLAVAFGLINT KG
Sbjct: 541 QIELIQSLLIKLAAEMKEKGYSPQTKVVLYDLDESEKERILLGHSEKLAVAFGLINTKKG 600

Query: 601 DTIRITKNLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFKDGVCSCGDYW 650
           +TIRITKNLRLCEDCHS TKFISKF+++EI+VRD+NRFHHF++GVCSCGDYW
Sbjct: 601 ETIRITKNLRLCEDCHSFTKFISKFSNKEILVRDVNRFHHFQNGVCSCGDYW 651

BLAST of Csa1G524740 vs. TrEMBL
Match: A0A067D6U3_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g006076mg PE=4 SV=1)

HSP 1 Score: 997.3 bits (2577), Expect = 8.8e-288
Identity = 499/652 (76.53%), Postives = 562/652 (86.20%), Query Frame = 1

Query: 4   LRTPYST----HYPPSSPRYSTSKLSVSSFSFNPSTPPNSNNNHLIQSLCKQGNLKQALY 63
           LR+PY T    H PP  P      +S++S S  P++  + N N LIQSLCKQGNL+QAL 
Sbjct: 13  LRSPYHTNSIAHLPPK-PSSVCCCVSLNS-STTPTSLSSRNKNELIQSLCKQGNLRQALD 72

Query: 64  LLSHESNPTQQTCELLILSAARRNSLSDALDVHQLLVDGGFDQDPFLATKLINMFSELDT 123
           +LS E NPTQ T ELL+LS    NSLSDAL+VH  L D GFDQDPFL TKLIN++S  D+
Sbjct: 73  VLSIEPNPTQHTYELLLLSCTHHNSLSDALNVHSHLTDNGFDQDPFLVTKLINVYSHFDS 132

Query: 124 VDNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLELYPRMNMMG--VSSDRFTYTYLL 183
           VD+AR VFDKTR+RTIYVWNALFRAL LAGRG +VLELY RMN  G  + SDRFTYTY+L
Sbjct: 133 VDDARHVFDKTRRRTIYVWNALFRALTLAGRGEEVLELYRRMNGTGTGIRSDRFTYTYVL 192

Query: 184 KACVASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMDMYARFGCVSYASAVFDEMPV 243
           KACVAS C  S L+ GKEIHA +LRHGY   VH+MTTL+DMYARFGCV YA  VF +M V
Sbjct: 193 KACVASSCGFSLLKHGKEIHASVLRHGYNGIVHIMTTLIDMYARFGCVMYAGFVFSQMAV 252

Query: 244 KNVVSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNSVTMVSVLQACAAFAALEQGK 303
           KNVVSWSAMIACYA+NG  +EALELFREM++ +HD  PNSVTMVSVLQACAA AALEQGK
Sbjct: 253 KNVVSWSAMIACYARNGMAFEALELFREMIMESHDLCPNSVTMVSVLQACAALAALEQGK 312

Query: 304 LIHAYILRRGLDSILPVISALITMYARCGKLESGQLIFDRMHKKDVVLWNSLISSYGLHG 363
           +IH YILRRGLDSILPV+SAL+TMYARCGKLE GQ +FD M K+DVV WNSLISSYG+HG
Sbjct: 313 MIHGYILRRGLDSILPVVSALVTMYARCGKLELGQCVFDHMDKRDVVSWNSLISSYGVHG 372

Query: 364 YGRKAIKIFEEMIDHGFSPSHISFISVLGACSHTGLVEEGKKLFESMVKEHGIQPSVEHY 423
           YG KAI+IF+EMI HG SPS ISF+SVLGACSH GLVEEGK LFESM KEH I+PSVEHY
Sbjct: 373 YGGKAIQIFKEMIYHGVSPSPISFVSVLGACSHAGLVEEGKMLFESMRKEHMIRPSVEHY 432

Query: 424 ACMVDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIHCHVELAERASKRLFKLEP 483
           ACMVDLLGRAN+L+EAAKIIEDLRIEPGPKVWGSLLG+CRIHC+VELAERASKRLF+LEP
Sbjct: 433 ACMVDLLGRANKLEEAAKIIEDLRIEPGPKVWGSLLGSCRIHCNVELAERASKRLFELEP 492

Query: 484 TNAGNYVLLADIYAEAEMWDEVKRVKKLLDSRELQKVPGRSWIEVRRKIYSFTSVDEFNP 543
           TNAGNYVLLAD+YA A+MWDEVKRVK+LL++R LQKVPGRS IEV+RK+YSF SVDEF+P
Sbjct: 493 TNAGNYVLLADVYAAADMWDEVKRVKRLLEARGLQKVPGRSRIEVKRKMYSFVSVDEFHP 552

Query: 544 QGEQLHALLVNLSNEMKQRGYTPQTKLVLYDLDQEEKERIVLGHSEKLAVAFGLINTSKG 603
           Q EQLHALL+NLS EMK++GY PQTK+VLYDLD EEKERIVLGHSEKLAVAFGLINTSKG
Sbjct: 553 QFEQLHALLINLSAEMKEKGYVPQTKVVLYDLDAEEKERIVLGHSEKLAVAFGLINTSKG 612

Query: 604 DTIRITKNLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFKDGVCSCGDYW 650
           +TIRITKNLRLCEDCHS TKFISKFA++EI+VRD+NRFHHF++GVCSCGDYW
Sbjct: 613 ETIRITKNLRLCEDCHSFTKFISKFANKEILVRDVNRFHHFRNGVCSCGDYW 662

BLAST of Csa1G524740 vs. TAIR10
Match: AT3G46790.1 (AT3G46790.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 965.3 bits (2494), Expect = 1.9e-281
Identity = 472/645 (73.18%), Postives = 550/645 (85.27%), Query Frame = 1

Query: 6   TPYSTHYPPSSPRYSTS-KLSVSSFSFNPSTPPNSNNNHLIQSLCKQGNLKQALYLLSHE 65
           T ++ ++ P SP    S  +++++ S +       +NN LIQSLCK+G LKQA+ +LS E
Sbjct: 13  TYHTVNFLPRSPLKPPSCSVALNNPSISSGAGAKISNNQLIQSLCKEGKLKQAIRVLSQE 72

Query: 66  SNPTQQTCELLILSAARRNSLSDALDVHQLLVDGGFDQDPFLATKLINMFSELDTVDNAR 125
           S+P+QQT ELLIL    R+SLSDAL VH+ ++D G DQDPFLATKLI M+S+L +VD AR
Sbjct: 73  SSPSQQTYELLILCCGHRSSLSDALRVHRHILDNGSDQDPFLATKLIGMYSDLGSVDYAR 132

Query: 126 KVFDKTRKRTIYVWNALFRALALAGRGNDVLELYPRMNMMGVSSDRFTYTYLLKACVASE 185
           KVFDKTRKRTIYVWNALFRAL LAG G +VL LY +MN +GV SDRFTYTY+LKACVASE
Sbjct: 133 KVFDKTRKRTIYVWNALFRALTLAGHGEEVLGLYWKMNRIGVESDRFTYTYVLKACVASE 192

Query: 186 CLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMDMYARFGCVSYASAVFDEMPVKNVVSWS 245
           C V+ L KGKEIHAH+ R GY +HV++MTTL+DMYARFGCV YAS VF  MPV+NVVSWS
Sbjct: 193 CTVNHLMKGKEIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYASYVFGGMPVRNVVSWS 252

Query: 246 AMIACYAKNGKPYEALELFREMMLNTHDSVPNSVTMVSVLQACAAFAALEQGKLIHAYIL 305
           AMIACYAKNGK +EAL  FREMM  T DS PNSVTMVSVLQACA+ AALEQGKLIH YIL
Sbjct: 253 AMIACYAKNGKAFEALRTFREMMRETKDSSPNSVTMVSVLQACASLAALEQGKLIHGYIL 312

Query: 306 RRGLDSILPVISALITMYARCGKLESGQLIFDRMHKKDVVLWNSLISSYGLHGYGRKAIK 365
           RRGLDSILPVISAL+TMY RCGKLE GQ +FDRMH +DVV WNSLISSYG+HGYG+KAI+
Sbjct: 313 RRGLDSILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGVHGYGKKAIQ 372

Query: 366 IFEEMIDHGFSPSHISFISVLGACSHTGLVEEGKKLFESMVKEHGIQPSVEHYACMVDLL 425
           IFEEM+ +G SP+ ++F+SVLGACSH GLVEEGK+LFE+M ++HGI+P +EHYACMVDLL
Sbjct: 373 IFEEMLANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVDLL 432

Query: 426 GRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIHCHVELAERASKRLFKLEPTNAGNYV 485
           GRANRLDEAAK+++D+R EPGPKVWGSLLG+CRIH +VELAERAS+RLF LEP NAGNYV
Sbjct: 433 GRANRLDEAAKMVQDMRTEPGPKVWGSLLGSCRIHGNVELAERASRRLFALEPKNAGNYV 492

Query: 486 LLADIYAEAEMWDEVKRVKKLLDSRELQKVPGRSWIEVRRKIYSFTSVDEFNPQGEQLHA 545
           LLADIYAEA+MWDEVKRVKKLL+ R LQK+PGR W+EVRRK+YSF SVDEFNP  EQ+HA
Sbjct: 493 LLADIYAEAQMWDEVKRVKKLLEHRGLQKLPGRCWMEVRRKMYSFVSVDEFNPLMEQIHA 552

Query: 546 LLVNLSNEMKQRGYTPQTKLVLYDLDQEEKERIVLGHSEKLAVAFGLINTSKGDTIRITK 605
            LV L+ +MK++GY PQTK VLY+L+ EEKERIVLGHSEKLA+AFGLINTSKG+ IRITK
Sbjct: 553 FLVKLAEDMKEKGYIPQTKGVLYELETEEKERIVLGHSEKLALAFGLINTSKGEPIRITK 612

Query: 606 NLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFKDGVCSCGDYW 650
           NLRLCEDCH  TKFISKF ++EI+VRD+NRFH FK+GVCSCGDYW
Sbjct: 613 NLRLCEDCHLFTKFISKFMEKEILVRDVNRFHRFKNGVCSCGDYW 657

BLAST of Csa1G524740 vs. TAIR10
Match: AT4G18750.1 (AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 472.6 bits (1215), Expect = 3.8e-133
Identity = 235/559 (42.04%), Postives = 348/559 (62.25%), Query Frame = 1

Query: 91  VHQLLVDGGFDQDPFLATKLINMFSELDTVDNARKVFDKTRKRTIYVWNALFRALALAGR 150
           VH + V   F ++      L++M+S+   +D+A+ VF +   R++  + ++    A  G 
Sbjct: 318 VHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGL 377

Query: 151 GNDVLELYPRMNMMGVSSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHGYGAHVH 210
             + ++L+  M   G+S D +T T +L  C         L +GK +H  I  +  G  + 
Sbjct: 378 AGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYR----LLDEGKRVHEWIKENDLGFDIF 437

Query: 211 VMTTLMDMYARFGCVSYASAVFDEMPVKNVVSWSAMIACYAKNGKPYEALELFREMMLNT 270
           V   LMDMYA+ G +  A  VF EM VK+++SW+ +I  Y+KN    EAL LF  ++L  
Sbjct: 438 VSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFN-LLLEE 497

Query: 271 HDSVPNSVTMVSVLQACAAFAALEQGKLIHAYILRRGLDSILPVISALITMYARCGKLES 330
               P+  T+  VL ACA+ +A ++G+ IH YI+R G  S   V ++L+ MYA+CG L  
Sbjct: 498 KRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLL 557

Query: 331 GQLIFDRMHKKDVVLWNSLISSYGLHGYGRKAIKIFEEMIDHGFSPSHISFISVLGACSH 390
             ++FD +  KD+V W  +I+ YG+HG+G++AI +F +M   G     ISF+S+L ACSH
Sbjct: 558 AHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSH 617

Query: 391 TGLVEEGKKLFESMVKEHGIQPSVEHYACMVDLLGRANRLDEAAKIIEDLRIEPGPKVWG 450
           +GLV+EG + F  M  E  I+P+VEHYAC+VD+L R   L +A + IE++ I P   +WG
Sbjct: 618 SGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWG 677

Query: 451 SLLGACRIHCHVELAERASKRLFKLEPTNAGNYVLLADIYAEAEMWDEVKRVKKLLDSRE 510
           +LL  CRIH  V+LAE+ ++++F+LEP N G YVL+A+IYAEAE W++VKR++K +  R 
Sbjct: 678 ALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQRG 737

Query: 511 LQKVPGRSWIEVRRKIYSFTSVDEFNPQGEQLHALLVNLSNEMKQRGYTPQTKLVLYDLD 570
           L+K PG SWIE++ ++  F + D  NP+ E + A L  +   M + GY+P TK  L D +
Sbjct: 738 LRKNPGCSWIEIKGRVNIFVAGDSSNPETENIEAFLRKVRARMIEEGYSPLTKYALIDAE 797

Query: 571 QEEKERIVLGHSEKLAVAFGLINTSKGDTIRITKNLRLCEDCHSVTKFISKFADREIMVR 630
           + EKE  + GHSEKLA+A G+I++  G  IR+TKNLR+C DCH + KF+SK   REI++R
Sbjct: 798 EMEKEEALCGHSEKLAMALGIISSGHGKIIRVTKNLRVCGDCHEMAKFMSKLTRREIVLR 857

Query: 631 DLNRFHHFKDGVCSCGDYW 650
           D NRFH FKDG CSC  +W
Sbjct: 858 DSNRFHQFKDGHCSCRGFW 871


HSP 2 Score: 184.9 bits (468), Expect = 1.6e-46
Identity = 116/427 (27.17%), Postives = 214/427 (50.12%), Query Frame = 1

Query: 42  NHLIQSLCKQGNLKQALYLLSHESN---PTQQTCELLILSAARRNSLSDALDVHQLLVDG 101
           N  ++  C+ GNL+ A+ LL          +  C +L L A  + SL D  +V   +   
Sbjct: 65  NTQLRRFCESGNLENAVKLLCVSGKWDIDPRTLCSVLQLCADSK-SLKDGKEVDNFIRGN 124

Query: 102 GFDQDPFLATKLINMFSELDTVDNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLELY 161
           GF  D  L +KL  M++    +  A +VFD+ +      WN L   LA +G  +  + L+
Sbjct: 125 GFVIDSNLGSKLSLMYTNCGDLKEASRVFDEVKIEKALFWNILMNELAKSGDFSGSIGLF 184

Query: 162 PRMNMMGVSSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMDM 221
            +M   GV  D +T++ + K+  +    +  +  G+++H  IL+ G+G    V  +L+  
Sbjct: 185 KKMMSSGVEMDSYTFSCVSKSFSS----LRSVHGGEQLHGFILKSGFGERNSVGNSLVAF 244

Query: 222 YARFGCVSYASAVFDEMPVKNVVSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNSV 281
           Y +   V  A  VFDEM  ++V+SW+++I  Y  NG   + L +F +M+++  +   +  
Sbjct: 245 YLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEI--DLA 304

Query: 282 TMVSVLQACAAFAALEQGKLIHAYILRRGLDSILPVISALITMYARCGKLESGQLIFDRM 341
           T+VSV   CA    +  G+ +H+  ++          + L+ MY++CG L+S + +F  M
Sbjct: 305 TIVSVFAGCADSRLISLGRAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREM 364

Query: 342 HKKDVVLWNSLISSYGLHGYGRKAIKIFEEMIDHGFSPSHISFISVLGACSHTGLVEEGK 401
             + VV + S+I+ Y   G   +A+K+FEEM + G SP   +  +VL  C+   L++EGK
Sbjct: 365 SDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDEGK 424

Query: 402 KLFESMVKEHGIQPSVEHYACMVDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRI 461
           ++ E  +KE+ +   +     ++D+  +   + EA  +  ++R++     W +++G    
Sbjct: 425 RVHE-WIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIIS-WNTIIGGYSK 482

Query: 462 HCHVELA 466
           +C+   A
Sbjct: 485 NCYANEA 482


HSP 3 Score: 174.9 bits (442), Expect = 1.7e-43
Identity = 110/404 (27.23%), Postives = 197/404 (48.76%), Query Frame = 1

Query: 42  NHLIQSLCKQGNLKQALYL----LSHESNPTQQTCELLILSAARRNSLSDALDVHQLLVD 101
           N L+  L K G+   ++ L    +S        T   +  S +   S+     +H  ++ 
Sbjct: 164 NILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGGEQLHGFILK 223

Query: 102 GGFDQDPFLATKLINMFSELDTVDNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLEL 161
            GF +   +   L+  + +   VD+ARKVFD+  +R +  WN++       G     L +
Sbjct: 224 SGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVSNGLAEKGLSV 283

Query: 162 YPRMNMMGVSSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMD 221
           + +M + G+  D  T   +   C  S  L+S    G+ +H+  ++  +        TL+D
Sbjct: 284 FVQMLVSGIEIDLATIVSVFAGCADSR-LISL---GRAVHSIGVKACFSREDRFCNTLLD 343

Query: 222 MYARFGCVSYASAVFDEMPVKNVVSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNS 281
           MY++ G +  A AVF EM  ++VVS+++MIA YA+ G   EA++LF EM        P+ 
Sbjct: 344 MYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEM--EEEGISPDV 403

Query: 282 VTMVSVLQACAAFAALEQGKLIHAYILRRGLDSILPVISALITMYARCGKLESGQLIFDR 341
            T+ +VL  CA +  L++GK +H +I    L   + V +AL+ MYA+CG ++  +L+F  
Sbjct: 404 YTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSE 463

Query: 342 MHKKDVVLWNSLISSYGLHGYGRKAIKIFEEMIDH-GFSPSHISFISVLGACSHTGLVEE 401
           M  KD++ WN++I  Y  + Y  +A+ +F  +++   FSP   +   VL AC+     ++
Sbjct: 464 MRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASLSAFDK 523

Query: 402 GKKLFESMVKEHGIQPSVEHYACMVDLLGRANRLDEAAKIIEDL 441
           G+++    +  +G          +VD+  +   L  A  + +D+
Sbjct: 524 GREI-HGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDI 560

BLAST of Csa1G524740 vs. TAIR10
Match: AT4G33990.1 (AT4G33990.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 467.6 bits (1202), Expect = 1.2e-131
Identity = 236/609 (38.75%), Postives = 375/609 (61.58%), Query Frame = 1

Query: 42  NHLIQSLCKQGNLKQALYLLSHESNPTQQTCELLILSAARRNSLSDALDVHQLLVDGGFD 101
           N +I   C+ GN K+AL L +        T   L+ +       +  + +H   +  G +
Sbjct: 220 NAMISGYCQSGNAKEALTLSNGLRAMDSVTVVSLLSACTEAGDFNRGVTIHSYSIKHGLE 279

Query: 102 QDPFLATKLINMFSELDTVDNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLELYPRM 161
            + F++ KLI++++E   + + +KVFD+   R +  WN++ +A  L  +    + L+  M
Sbjct: 280 SELFVSNKLIDLYAEFGRLRDCQKVFDRMYVRDLISWNSIIKAYELNEQPLRAISLFQEM 339

Query: 162 NMMGVSSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHGYGAH-VHVMTTLMDMYA 221
            +  +  D  T   L  A + S+  +  ++  + +    LR G+    + +   ++ MYA
Sbjct: 340 RLSRIQPDCLTLISL--ASILSQ--LGDIRACRSVQGFTLRKGWFLEDITIGNAVVVMYA 399

Query: 222 RFGCVSYASAVFDEMPVKNVVSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNSVTM 281
           + G V  A AVF+ +P  +V+SW+ +I+ YA+NG   EA+E++  +M    +   N  T 
Sbjct: 400 KLGLVDSARAVFNWLPNTDVISWNTIISGYAQNGFASEAIEMYN-IMEEEGEIAANQGTW 459

Query: 282 VSVLQACAAFAALEQGKLIHAYILRRGLDSILPVISALITMYARCGKLESGQLIFDRMHK 341
           VSVL AC+   AL QG  +H  +L+ GL   + V+++L  MY +CG+LE    +F ++ +
Sbjct: 460 VSVLPACSQAGALRQGMKLHGRLLKNGLYLDVFVVTSLADMYGKCGRLEDALSLFYQIPR 519

Query: 342 KDVVLWNSLISSYGLHGYGRKAIKIFEEMIDHGFSPSHISFISVLGACSHTGLVEEGKKL 401
            + V WN+LI+ +G HG+G KA+ +F+EM+D G  P HI+F+++L ACSH+GLV+EG+  
Sbjct: 520 VNSVPWNTLIACHGFHGHGEKAVMLFKEMLDEGVKPDHITFVTLLSACSHSGLVDEGQWC 579

Query: 402 FESMVKEHGIQPSVEHYACMVDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIHC 461
           FE M  ++GI PS++HY CMVD+ GRA +L+ A K I+ + ++P   +WG+LL ACR+H 
Sbjct: 580 FEMMQTDYGITPSLKHYGCMVDMYGRAGQLETALKFIKSMSLQPDASIWGALLSACRVHG 639

Query: 462 HVELAERASKRLFKLEPTNAGNYVLLADIYAEAEMWDEVKRVKKLLDSRELQKVPGRSWI 521
           +V+L + AS+ LF++EP + G +VLL+++YA A  W+ V  ++ +   + L+K PG S +
Sbjct: 640 NVDLGKIASEHLFEVEPEHVGYHVLLSNMYASAGKWEGVDEIRSIAHGKGLRKTPGWSSM 699

Query: 522 EVRRKIYSFTSVDEFNPQGEQLHALLVNLSNEMKQRGYTPQTKLVLYDLDQEEKERIVLG 581
           EV  K+  F + ++ +P  E+++  L  L  ++K  GY P  + VL D++ +EKE I++ 
Sbjct: 700 EVDNKVEVFYTGNQTHPMYEEMYRELTALQAKLKMIGYVPDHRFVLQDVEDDEKEHILMS 759

Query: 582 HSEKLAVAFGLINTSKGDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFKD 641
           HSE+LA+AF LI T    TIRI KNLR+C DCHSVTKFISK  +REI+VRD NRFHHFK+
Sbjct: 760 HSERLAIAFALIATPAKTTIRIFKNLRVCGDCHSVTKFISKITEREIIVRDSNRFHHFKN 819

Query: 642 GVCSCGDYW 650
           GVCSCGDYW
Sbjct: 820 GVCSCGDYW 823


HSP 2 Score: 188.0 bits (476), Expect = 1.9e-47
Identity = 125/440 (28.41%), Postives = 224/440 (50.91%), Query Frame = 1

Query: 42  NHLIQSLCKQGNLKQ-----ALYLLSHESNPTQQTCELLILSAARRNSLSDALDVHQLLV 101
           N +I    + GN  +     +L++LS    P  +T    +L A R  ++ D   +H L +
Sbjct: 121 NLMISGYGRAGNSSEVIRCFSLFMLSSGLTPDYRTFPS-VLKACR--TVIDGNKIHCLAL 180

Query: 102 DGGFDQDPFLATKLINMFSELDTVDNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLE 161
             GF  D ++A  LI+++S    V NAR +FD+   R +  WNA+      +G   + L 
Sbjct: 181 KFGFMWDVYVAASLIHLYSRYKAVGNARILFDEMPVRDMGSWNAMISGYCQSGNAKEALT 240

Query: 162 LYPRMNMMGVSSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLM 221
           L   +  M    D  T   LL AC  +        +G  IH++ ++HG  + + V   L+
Sbjct: 241 LSNGLRAM----DSVTVVSLLSACTEA----GDFNRGVTIHSYSIKHGLESELFVSNKLI 300

Query: 222 DMYARFGCVSYASAVFDEMPVKNVVSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPN 281
           D+YA FG +     VFD M V++++SW+++I  Y  N +P  A+ LF+EM L+     P+
Sbjct: 301 DLYAEFGRLRDCQKVFDRMYVRDLISWNSIIKAYELNEQPLRAISLFQEMRLSRIQ--PD 360

Query: 282 SVTMVSVLQACAAFAALEQGKLIHAYILRRG--LDSILPVISALITMYARCGKLESGQLI 341
            +T++S+    +    +   + +  + LR+G  L+ I  + +A++ MYA+ G ++S + +
Sbjct: 361 CLTLISLASILSQLGDIRACRSVQGFTLRKGWFLEDI-TIGNAVVVMYAKLGLVDSARAV 420

Query: 342 FDRMHKKDVVLWNSLISSYGLHGYGRKAIKIFEEMIDHG-FSPSHISFISVLGACSHTGL 401
           F+ +   DV+ WN++IS Y  +G+  +AI+++  M + G  + +  +++SVL ACS  G 
Sbjct: 421 FNWLPNTDVISWNTIISGYAQNGFASEAIEMYNIMEEEGEIAANQGTWVSVLPACSQAGA 480

Query: 402 VEEGKKLFESMVKEHGIQPSVEHYACMVDLLGRANRLDEAAKIIEDL-RIEPGPKVWGSL 461
           + +G KL   ++K +G+   V     + D+ G+  RL++A  +   + R+   P  W +L
Sbjct: 481 LRQGMKLHGRLLK-NGLYLDVFVVTSLADMYGKCGRLEDALSLFYQIPRVNSVP--WNTL 540

Query: 462 LGACRIHCHVELAERASKRL 473
           +     H H E A    K +
Sbjct: 541 IACHGFHGHGEKAVMLFKEM 543


HSP 3 Score: 173.3 bits (438), Expect = 4.8e-43
Identity = 104/310 (33.55%), Postives = 165/310 (53.23%), Query Frame = 1

Query: 84  SLSDALDVHQLLVDGGFDQDPFLATKLINMFSELDTVDNARKVFDKTRKRTIYVWNALFR 143
           +L  A  +H  LV     Q+  ++ KL+N++  L  V  AR  FD  + R +Y WN +  
Sbjct: 66  NLQSAKCLHARLVVSKQIQNVCISAKLVNLYCYLGNVALARHTFDHIQNRDVYAWNLMIS 125

Query: 144 ALALAGRGNDVLELYPR-MNMMGVSSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILR 203
               AG  ++V+  +   M   G++ D  T+  +LKAC         +  G +IH   L+
Sbjct: 126 GYGRAGNSSEVIRCFSLFMLSSGLTPDYRTFPSVLKACRT-------VIDGNKIHCLALK 185

Query: 204 HGYGAHVHVMTTLMDMYARFGCVSYASAVFDEMPVKNVVSWSAMIACYAKNGKPYEALEL 263
            G+   V+V  +L+ +Y+R+  V  A  +FDEMPV+++ SW+AMI+ Y ++G   EAL L
Sbjct: 186 FGFMWDVYVAASLIHLYSRYKAVGNARILFDEMPVRDMGSWNAMISGYCQSGNAKEALTL 245

Query: 264 FREMMLNTHDSVPNSVTMVSVLQACAAFAALEQGKLIHAYILRRGLDSILPVISALITMY 323
              +         +SVT+VS+L AC       +G  IH+Y ++ GL+S L V + LI +Y
Sbjct: 246 SNGLR------AMDSVTVVSLLSACTEAGDFNRGVTIHSYSIKHGLESELFVSNKLIDLY 305

Query: 324 ARCGKLESGQLIFDRMHKKDVVLWNSLISSYGLHGYGRKAIKIFEEMIDHGFSPSHISFI 383
           A  G+L   Q +FDRM+ +D++ WNS+I +Y L+    +AI +F+EM      P  ++ I
Sbjct: 306 AEFGRLRDCQKVFDRMYVRDLISWNSIIKAYELNEQPLRAISLFQEMRLSRIQPDCLTLI 362

Query: 384 SVLGACSHTG 393
           S+    S  G
Sbjct: 366 SLASILSQLG 362


HSP 4 Score: 98.2 bits (243), Expect = 2.0e-20
Identity = 68/270 (25.19%), Postives = 130/270 (48.15%), Query Frame = 1

Query: 190 LQKGKEIHAHILRHGYGAHVHVMTTLMDMYARFGCVSYASAVFDEMPVKNVVSWSAMIAC 249
           LQ  K +HA ++      +V +   L+++Y   G V+ A   FD +  ++V +W+ MI+ 
Sbjct: 67  LQSAKCLHARLVVSKQIQNVCISAKLVNLYCYLGNVALARHTFDHIQNRDVYAWNLMISG 126

Query: 250 YAKNGKPYEALELFREMMLNTHDSVPNSVTMVSVLQACAAFAALEQGKLIHAYILRRGLD 309
           Y + G   E +  F   ML++    P+  T  SVL+AC     +  G  IH   L+ G  
Sbjct: 127 YGRAGNSSEVIRCFSLFMLSS-GLTPDYRTFPSVLKAC---RTVIDGNKIHCLALKFGFM 186

Query: 310 SILPVISALITMYARCGKLESGQLIFDRMHKKDVVLWNSLISSYGLHGYGRKAIKIFEEM 369
             + V ++LI +Y+R   + + +++FD M  +D+  WN++IS Y   G  ++A+ +   +
Sbjct: 187 WDVYVAASLIHLYSRYKAVGNARILFDEMPVRDMGSWNAMISGYCQSGNAKEALTLSNGL 246

Query: 370 IDHGFSPSHISFISVLGACSHTGLVEEGKKLFESMVKEHGIQPSVEHYACMVDLLGRANR 429
                +   ++ +S+L AC+  G    G  +    +K HG++  +     ++DL     R
Sbjct: 247 ----RAMDSVTVVSLLSACTEAGDFNRGVTIHSYSIK-HGLESELFVSNKLIDLYAEFGR 306

Query: 430 LDEAAKIIEDLRIEPGPKVWGSLLGACRIH 460
           L +  K+ + + +      W S++ A  ++
Sbjct: 307 LRDCQKVFDRMYVRDLIS-WNSIIKAYELN 326

BLAST of Csa1G524740 vs. TAIR10
Match: AT3G12770.1 (AT3G12770.1 mitochondrial editing factor 22)

HSP 1 Score: 464.5 bits (1194), Expect = 1.0e-130
Identity = 243/623 (39.00%), Postives = 377/623 (60.51%), Query Frame = 1

Query: 29  FSFNPSTPPNSNNNHLIQSLCKQGNLKQALYLLSHESNPTQQTCELLILSAARRNSLSDA 88
           F +N      S NNH   +L    N++ A        +P   T   L+ + +  + L   
Sbjct: 85  FPWNAIIRGYSRNNHFQDALLMYSNMQLA------RVSPDSFTFPHLLKACSGLSHLQMG 144

Query: 89  LDVHQLLVDGGFDQDPFLATKLINMFSELDTVDNARKVFD--KTRKRTIYVWNALFRALA 148
             VH  +   GFD D F+   LI ++++   + +AR VF+     +RTI  W A+  A A
Sbjct: 145 RFVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYA 204

Query: 149 LAGRGNDVLELYPRMNMMGVSSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHGYG 208
             G   + LE++ +M  M V  D   +  L+    A  CL   L++G+ IHA +++ G  
Sbjct: 205 QNGEPMEALEIFSQMRKMDVKPD---WVALVSVLNAFTCLQD-LKQGRSIHASVVKMGLE 264

Query: 209 AHVHVMTTLMDMYARFGCVSYASAVFDEMPVKNVVSWSAMIACYAKNGKPYEALELFREM 268
               ++ +L  MYA+ G V+ A  +FD+M   N++ W+AMI+ YAKNG   EA+++F EM
Sbjct: 265 IEPDLLISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEM 324

Query: 269 MLNTHDSVPNSVTMVSVLQACAAFAALEQGKLIHAYILRRGLDSILPVISALITMYARCG 328
           +    D  P+++++ S + ACA   +LEQ + ++ Y+ R      + + SALI M+A+CG
Sbjct: 325 I--NKDVRPDTISITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCG 384

Query: 329 KLESGQLIFDRMHKKDVVLWNSLISSYGLHGYGRKAIKIFEEMIDHGFSPSHISFISVLG 388
            +E  +L+FDR   +DVV+W+++I  YGLHG  R+AI ++  M   G  P+ ++F+ +L 
Sbjct: 385 SVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDVTFLGLLM 444

Query: 389 ACSHTGLVEEGKKLFESMVKEHGIQPSVEHYACMVDLLGRANRLDEAAKIIEDLRIEPGP 448
           AC+H+G+V EG   F  M  +H I P  +HYAC++DLLGRA  LD+A ++I+ + ++PG 
Sbjct: 445 ACNHSGMVREGWWFFNRMA-DHKINPQQQHYACVIDLLGRAGHLDQAYEVIKCMPVQPGV 504

Query: 449 KVWGSLLGACRIHCHVELAERASKRLFKLEPTNAGNYVLLADIYAEAEMWDEVKRVKKLL 508
            VWG+LL AC+ H HVEL E A+++LF ++P+N G+YV L+++YA A +WD V  V+  +
Sbjct: 505 TVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEVRVRM 564

Query: 509 DSRELQKVPGRSWIEVRRKIYSFTSVDEFNPQGEQLHALLVNLSNEMKQRGYTPQTKLVL 568
             + L K  G SW+EVR ++ +F   D+ +P+ E++   +  + + +K+ G+       L
Sbjct: 565 KEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIERQVEWIESRLKEGGFVANKDASL 624

Query: 569 YDLDQEEKERIVLGHSEKLAVAFGLINTSKGDTIRITKNLRLCEDCHSVTKFISKFADRE 628
           +DL+ EE E  +  HSE++A+A+GLI+T +G  +RITKNLR C +CH+ TK ISK  DRE
Sbjct: 625 HDLNDEEAEETLCSHSERIAIAYGLISTPQGTPLRITKNLRACVNCHAATKLISKLVDRE 684

Query: 629 IMVRDLNRFHHFKDGVCSCGDYW 650
           I+VRD NRFHHFKDGVCSCGDYW
Sbjct: 685 IVVRDTNRFHHFKDGVCSCGDYW 694


HSP 2 Score: 230.7 bits (587), Expect = 2.5e-60
Identity = 131/387 (33.85%), Postives = 216/387 (55.81%), Query Frame = 1

Query: 75  LILSAARRNSLSDALDVHQLLVDGGFDQDPFLATKLINMFSELDTVDNARKVFDKTRKRT 134
           LI SA  +  L     +H  L+  G     FL TKLI+  S    +  AR+VFD   +  
Sbjct: 27  LIDSATHKAQLKQ---IHARLLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQ 86

Query: 135 IYVWNALFRALALAGRGNDVLELYPRMNMMGVSSDRFTYTYLLKACVASECLVSFLQKGK 194
           I+ WNA+ R  +      D L +Y  M +  VS D FT+ +LLKAC      +S LQ G+
Sbjct: 87  IFPWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSG----LSHLQMGR 146

Query: 195 EIHAHILRHGYGAHVHVMTTLMDMYARFGCVSYASAVFDEMPV--KNVVSWSAMIACYAK 254
            +HA + R G+ A V V   L+ +YA+   +  A  VF+ +P+  + +VSW+A+++ YA+
Sbjct: 147 FVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQ 206

Query: 255 NGKPYEALELFREMMLNTHDSVPNSVTMVSVLQACAAFAALEQGKLIHAYILRRGLDSIL 314
           NG+P EALE+F +M     D  P+ V +VSVL A      L+QG+ IHA +++ GL+   
Sbjct: 207 NGEPMEALEIFSQM--RKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEP 266

Query: 315 PVISALITMYARCGKLESGQLIFDRMHKKDVVLWNSLISSYGLHGYGRKAIKIFEEMIDH 374
            ++ +L TMYA+CG++ + +++FD+M   +++LWN++IS Y  +GY R+AI +F EMI+ 
Sbjct: 267 DLLISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEMINK 326

Query: 375 GFSPSHISFISVLGACSHTGLVEEGKKLFESMVKEHGIQPSVEHYACMVDLLGRANRLDE 434
              P  IS  S + AC+  G +E+ + ++E  V     +  V   + ++D+  +   + E
Sbjct: 327 DVRPDTISITSAISACAQVGSLEQARSMYE-YVGRSDYRDDVFISSALIDMFAKCGSV-E 386

Query: 435 AAKIIEDLRIEPGPKVWGSLLGACRIH 460
            A+++ D  ++    VW +++    +H
Sbjct: 387 GARLVFDRTLDRDVVVWSAMIVGYGLH 402

BLAST of Csa1G524740 vs. TAIR10
Match: AT4G30700.1 (AT4G30700.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 461.5 bits (1186), Expect = 8.8e-130
Identity = 241/591 (40.78%), Postives = 355/591 (60.07%), Query Frame = 1

Query: 61  LSHESNPTQQTCELLIL--SAARRNSLSDALDVHQLLVDGGFDQDPFLATKLINMFSELD 120
           L +ES     T  LL +  + A    L   + +H L    G     ++ T  I+++S+  
Sbjct: 211 LINESCTRLDTTTLLDILPAVAELQELRLGMQIHSLATKTGCYSHDYVLTGFISLYSKCG 270

Query: 121 TVDNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLELYPRMNMMGVSSDRFTYTYLLK 180
            +     +F + RK  I  +NA+       G     L L+  + + G      T   L+ 
Sbjct: 271 KIKMGSALFREFRKPDIVAYNAMIHGYTSNGETELSLSLFKELMLSGARLRSSTLVSLVP 330

Query: 181 ACVASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMDMYARFGCVSYASAVFDEMPVK 240
             V+   ++ +      IH + L+  + +H  V T L  +Y++   +  A  +FDE P K
Sbjct: 331 --VSGHLMLIYA-----IHGYCLKSNFLSHASVSTALTTVYSKLNEIESARKLFDESPEK 390

Query: 241 NVVSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNSVTMVSVLQACAAFAALEQGKL 300
           ++ SW+AMI+ Y +NG   +A+ LFREM  +     PN VT+  +L ACA   AL  GK 
Sbjct: 391 SLPSWNAMISGYTQNGLTEDAISLFREMQKSEFS--PNPVTITCILSACAQLGALSLGKW 450

Query: 301 IHAYILRRGLDSILPVISALITMYARCGKLESGQLIFDRMHKKDVVLWNSLISSYGLHGY 360
           +H  +     +S + V +ALI MYA+CG +   + +FD M KK+ V WN++IS YGLHG 
Sbjct: 451 VHDLVRSTDFESSIYVSTALIGMYAKCGSIAEARRLFDLMTKKNEVTWNTMISGYGLHGQ 510

Query: 361 GRKAIKIFEEMIDHGFSPSHISFISVLGACSHTGLVEEGKKLFESMVKEHGIQPSVEHYA 420
           G++A+ IF EM++ G +P+ ++F+ VL ACSH GLV+EG ++F SM+  +G +PSV+HYA
Sbjct: 511 GQEALNIFYEMLNSGITPTPVTFLCVLYACSHAGLVKEGDEIFNSMIHRYGFEPSVKHYA 570

Query: 421 CMVDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIHCHVELAERASKRLFKLEPT 480
           CMVD+LGRA  L  A + IE + IEPG  VW +LLGACRIH    LA   S++LF+L+P 
Sbjct: 571 CMVDILGRAGHLQRALQFIEAMSIEPGSSVWETLLGACRIHKDTNLARTVSEKLFELDPD 630

Query: 481 NAGNYVLLADIYAEAEMWDEVKRVKKLLDSRELQKVPGRSWIEVRRKIYSFTSVDEFNPQ 540
           N G +VLL++I++    + +   V++    R+L K PG + IE+    + FTS D+ +PQ
Sbjct: 631 NVGYHVLLSNIHSADRNYPQAATVRQTAKKRKLAKAPGYTLIEIGETPHVFTSGDQSHPQ 690

Query: 541 GEQLHALLVNLSNEMKQRGYTPQTKLVLYDLDQEEKERIVLGHSEKLAVAFGLINTSKGD 600
            ++++  L  L  +M++ GY P+T+L L+D+++EE+E +V  HSE+LA+AFGLI T  G 
Sbjct: 691 VKEIYEKLEKLEGKMREAGYQPETELALHDVEEEERELMVKVHSERLAIAFGLIATEPGT 750

Query: 601 TIRITKNLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFKDGVCSCGDYW 650
            IRI KNLR+C DCH+VTK ISK  +R I+VRD NRFHHFKDGVCSCGDYW
Sbjct: 751 EIRIIKNLRVCLDCHTVTKLISKITERVIVVRDANRFHHFKDGVCSCGDYW 792


HSP 2 Score: 147.9 bits (372), Expect = 2.2e-35
Identity = 101/398 (25.38%), Postives = 191/398 (47.99%), Query Frame = 1

Query: 62  SHESNPTQQTCELLILSAARRNSLSDALDVHQLLVDGGFDQDPFLATKLINMFSELDTVD 121
           S +  P   T    I +A+          +H   V  G D +  L + ++ M+ +   V+
Sbjct: 112 STDLKPNSSTYAFAISAASGFRDDRAGRVIHGQAVVDGCDSELLLGSNIVKMYFKFWRVE 171

Query: 122 NARKVFDKTRKRTIYVWNALFRALALAGRGNDVLELYPRMNMMGVSSDRFTYTYLLKACV 181
           +ARKVFD+  ++   +WN +           + ++++   +++  S  R   T LL    
Sbjct: 172 DARKVFDRMPEKDTILWNTMISGYRKNEMYVESIQVF--RDLINESCTRLDTTTLLDILP 231

Query: 182 ASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMDMYARFGCVSYASAVFDEMPVKNVV 241
           A   L   L+ G +IH+   + G  +H +V+T  + +Y++ G +   SA+F E    ++V
Sbjct: 232 AVAELQE-LRLGMQIHSLATKTGCYSHDYVLTGFISLYSKCGKIKMGSALFREFRKPDIV 291

Query: 242 SWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNSVTMVSVLQACAAFAALEQGKLIHA 301
           +++AMI  Y  NG+   +L LF+E+ML+   +   S T+VS++        +     IH 
Sbjct: 292 AYNAMIHGYTSNGETELSLSLFKELMLS--GARLRSSTLVSLVPVSGHLMLIYA---IHG 351

Query: 302 YILRRGLDSILPVISALITMYARCGKLESGQLIFDRMHKKDVVLWNSLISSYGLHGYGRK 361
           Y L+    S   V +AL T+Y++  ++ES + +FD   +K +  WN++IS Y  +G    
Sbjct: 352 YCLKSNFLSHASVSTALTTVYSKLNEIESARKLFDESPEKSLPSWNAMISGYTQNGLTED 411

Query: 362 AIKIFEEMIDHGFSPSHISFISVLGACSHTGLVEEGKKLFESMVKEHGIQPSVEHYACMV 421
           AI +F EM    FSP+ ++   +L AC+  G +  GK + + +V+    + S+     ++
Sbjct: 412 AISLFREMQKSEFSPNPVTITCILSACAQLGALSLGKWVHD-LVRSTDFESSIYVSTALI 471

Query: 422 DLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIH 460
            +  +   + EA ++  DL  +     W +++    +H
Sbjct: 472 GMYAKCGSIAEARRLF-DLMTKKNEVTWNTMISGYGLH 499


HSP 3 Score: 129.8 bits (325), Expect = 6.1e-30
Identity = 86/342 (25.15%), Postives = 162/342 (47.37%), Query Frame = 1

Query: 60  LLSHESNPTQQTCELLILSAA------RRNSLSDALDVHQLLVDGGFDQDPFLATKLINM 119
           LL   S+ T +T   LI          R  S+S     H  ++  GF  D  L TKL   
Sbjct: 2   LLRTVSSATAETTAALISKNTYLDFFKRSTSISHLAQTHAQIILHGFRNDISLLTKLTQR 61

Query: 120 FSELDTVDNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLELYPRMNMM-GVSSDRFT 179
            S+L  +  AR +F   ++  ++++N L R  ++    +  L ++  +     +  +  T
Sbjct: 62  LSDLGAIYYARDIFLSVQRPDVFLFNVLMRGFSVNESPHSSLSVFAHLRKSTDLKPNSST 121

Query: 180 YTYLLKACVASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMDMYARFGCVSYASAVF 239
           Y + + A           + G+ IH   +  G  + + + + ++ MY +F  V  A  VF
Sbjct: 122 YAFAISAASGFRDD----RAGRVIHGQAVVDGCDSELLLGSNIVKMYFKFWRVEDARKVF 181

Query: 240 DEMPVKNVVSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNSVTMVSVLQACAAFAA 299
           D MP K+ + W+ MI+ Y KN    E++++FR++ +N   +  ++ T++ +L A A    
Sbjct: 182 DRMPEKDTILWNTMISGYRKNEMYVESIQVFRDL-INESCTRLDTTTLLDILPAVAELQE 241

Query: 300 LEQGKLIHAYILRRGLDSILPVISALITMYARCGKLESGQLIFDRMHKKDVVLWNSLISS 359
           L  G  IH+   + G  S   V++  I++Y++CGK++ G  +F    K D+V +N++I  
Sbjct: 242 LRLGMQIHSLATKTGCYSHDYVLTGFISLYSKCGKIKMGSALFREFRKPDIVAYNAMIHG 301

Query: 360 YGLHGYGRKAIKIFEEMIDHGFSPSHISFISVLGACSHTGLV 395
           Y  +G    ++ +F+E++  G      + +S++    H  L+
Sbjct: 302 YTSNGETELSLSLFKELMLSGARLRSSTLVSLVPVSGHLMLI 338

BLAST of Csa1G524740 vs. NCBI nr
Match: gi|449474033|ref|XP_004154055.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g46790, chloroplastic [Cucumis sativus])

HSP 1 Score: 1311.6 bits (3393), Expect = 0.0e+00
Identity = 649/649 (100.00%), Postives = 649/649 (100.00%), Query Frame = 1

Query: 1   MWALRTPYSTHYPPSSPRYSTSKLSVSSFSFNPSTPPNSNNNHLIQSLCKQGNLKQALYL 60
           MWALRTPYSTHYPPSSPRYSTSKLSVSSFSFNPSTPPNSNNNHLIQSLCKQGNLKQALYL
Sbjct: 1   MWALRTPYSTHYPPSSPRYSTSKLSVSSFSFNPSTPPNSNNNHLIQSLCKQGNLKQALYL 60

Query: 61  LSHESNPTQQTCELLILSAARRNSLSDALDVHQLLVDGGFDQDPFLATKLINMFSELDTV 120
           LSHESNPTQQTCELLILSAARRNSLSDALDVHQLLVDGGFDQDPFLATKLINMFSELDTV
Sbjct: 61  LSHESNPTQQTCELLILSAARRNSLSDALDVHQLLVDGGFDQDPFLATKLINMFSELDTV 120

Query: 121 DNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLELYPRMNMMGVSSDRFTYTYLLKAC 180
           DNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLELYPRMNMMGVSSDRFTYTYLLKAC
Sbjct: 121 DNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLELYPRMNMMGVSSDRFTYTYLLKAC 180

Query: 181 VASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMDMYARFGCVSYASAVFDEMPVKNV 240
           VASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMDMYARFGCVSYASAVFDEMPVKNV
Sbjct: 181 VASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMDMYARFGCVSYASAVFDEMPVKNV 240

Query: 241 VSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNSVTMVSVLQACAAFAALEQGKLIH 300
           VSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNSVTMVSVLQACAAFAALEQGKLIH
Sbjct: 241 VSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNSVTMVSVLQACAAFAALEQGKLIH 300

Query: 301 AYILRRGLDSILPVISALITMYARCGKLESGQLIFDRMHKKDVVLWNSLISSYGLHGYGR 360
           AYILRRGLDSILPVISALITMYARCGKLESGQLIFDRMHKKDVVLWNSLISSYGLHGYGR
Sbjct: 301 AYILRRGLDSILPVISALITMYARCGKLESGQLIFDRMHKKDVVLWNSLISSYGLHGYGR 360

Query: 361 KAIKIFEEMIDHGFSPSHISFISVLGACSHTGLVEEGKKLFESMVKEHGIQPSVEHYACM 420
           KAIKIFEEMIDHGFSPSHISFISVLGACSHTGLVEEGKKLFESMVKEHGIQPSVEHYACM
Sbjct: 361 KAIKIFEEMIDHGFSPSHISFISVLGACSHTGLVEEGKKLFESMVKEHGIQPSVEHYACM 420

Query: 421 VDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIHCHVELAERASKRLFKLEPTNA 480
           VDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIHCHVELAERASKRLFKLEPTNA
Sbjct: 421 VDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIHCHVELAERASKRLFKLEPTNA 480

Query: 481 GNYVLLADIYAEAEMWDEVKRVKKLLDSRELQKVPGRSWIEVRRKIYSFTSVDEFNPQGE 540
           GNYVLLADIYAEAEMWDEVKRVKKLLDSRELQKVPGRSWIEVRRKIYSFTSVDEFNPQGE
Sbjct: 481 GNYVLLADIYAEAEMWDEVKRVKKLLDSRELQKVPGRSWIEVRRKIYSFTSVDEFNPQGE 540

Query: 541 QLHALLVNLSNEMKQRGYTPQTKLVLYDLDQEEKERIVLGHSEKLAVAFGLINTSKGDTI 600
           QLHALLVNLSNEMKQRGYTPQTKLVLYDLDQEEKERIVLGHSEKLAVAFGLINTSKGDTI
Sbjct: 541 QLHALLVNLSNEMKQRGYTPQTKLVLYDLDQEEKERIVLGHSEKLAVAFGLINTSKGDTI 600

Query: 601 RITKNLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFKDGVCSCGDYW 650
           RITKNLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFKDGVCSCGDYW
Sbjct: 601 RITKNLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFKDGVCSCGDYW 649

BLAST of Csa1G524740 vs. NCBI nr
Match: gi|659115217|ref|XP_008457445.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g46790, chloroplastic [Cucumis melo])

HSP 1 Score: 1224.9 bits (3168), Expect = 0.0e+00
Identity = 607/653 (92.96%), Postives = 628/653 (96.17%), Query Frame = 1

Query: 1   MWALRTPYSTHYPPSSPRYS----TSKLSVSSFSFNPSTPPNSNNNHLIQSLCKQGNLKQ 60
           MWALRTP+ST YPPSS R+S    TSKLSV SFS NPST  NSN + LIQSLCK+GNLKQ
Sbjct: 1   MWALRTPHSTQYPPSSRRHSSAHSTSKLSVCSFSLNPSTSANSNKDQLIQSLCKEGNLKQ 60

Query: 61  ALYLLSHESNPTQQTCELLILSAARRNSLSDALDVHQLLVDGGFDQDPFLATKLINMFSE 120
           AL LLSHE NPTQQTCELLILSA+RR SLSDALDVHQ LVDGGFDQDPFLATKLINMFSE
Sbjct: 61  ALVLLSHEPNPTQQTCELLILSASRRKSLSDALDVHQHLVDGGFDQDPFLATKLINMFSE 120

Query: 121 LDTVDNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLELYPRMNMMGVSSDRFTYTYL 180
           LD+VDNARKVFDKTRKRTIYVWNALFRALALAG GNDVLELYPRM+MMGV  DRFTYTYL
Sbjct: 121 LDSVDNARKVFDKTRKRTIYVWNALFRALALAGHGNDVLELYPRMDMMGVPCDRFTYTYL 180

Query: 181 LKACVASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMDMYARFGCVSYASAVFDEMP 240
           LKACVAS+CLVSFLQKGKEIHAHILRHGYGAHVHVMTTL+DMYARFGCVSYASAVFDEMP
Sbjct: 181 LKACVASDCLVSFLQKGKEIHAHILRHGYGAHVHVMTTLVDMYARFGCVSYASAVFDEMP 240

Query: 241 VKNVVSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNSVTMVSVLQACAAFAALEQG 300
           VKNVVSWSAMIACYAKNGKPYEALELFR+MMLNTHD VPNSVTMVSVLQACAAFAALEQG
Sbjct: 241 VKNVVSWSAMIACYAKNGKPYEALELFRDMMLNTHDLVPNSVTMVSVLQACAAFAALEQG 300

Query: 301 KLIHAYILRRGLDSILPVISALITMYARCGKLESGQLIFDRMHKKDVVLWNSLISSYGLH 360
           KLIHAYILRRGLDSILPVISAL+TMYARCGKLE GQ+IFDR+HKKDV+LWNSL SSYGLH
Sbjct: 301 KLIHAYILRRGLDSILPVISALVTMYARCGKLELGQVIFDRIHKKDVILWNSLFSSYGLH 360

Query: 361 GYGRKAIKIFEEMIDHGFSPSHISFISVLGACSHTGLVEEGKKLFESMVKEHGIQPSVEH 420
           GYGRKAI+IFEEMID+G SPS+ISF+SVLGACSH GLVEEGKKLFESMVKEHGIQPSVEH
Sbjct: 361 GYGRKAIEIFEEMIDNGISPSYISFVSVLGACSHAGLVEEGKKLFESMVKEHGIQPSVEH 420

Query: 421 YACMVDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIHCHVELAERASKRLFKLE 480
           YACMVDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIHCHVELAERASKRLF+LE
Sbjct: 421 YACMVDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIHCHVELAERASKRLFELE 480

Query: 481 PTNAGNYVLLADIYAEAEMWDEVKRVKKLLDSRELQKVPGRSWIEVRRKIYSFTSVDEFN 540
           PTNAGNYVLLADIYAEAEMWDEVKRV+KLL+SRELQKVPGRSWIEVRRKIYSFTSVDEFN
Sbjct: 481 PTNAGNYVLLADIYAEAEMWDEVKRVRKLLNSRELQKVPGRSWIEVRRKIYSFTSVDEFN 540

Query: 541 PQGEQLHALLVNLSNEMKQRGYTPQTKLVLYDLDQEEKERIVLGHSEKLAVAFGLINTSK 600
           PQGEQLHALLVNLSNEMKQRGY PQTKLVLYDLDQEEKERIVLGHSEKLAVAFGLINTSK
Sbjct: 541 PQGEQLHALLVNLSNEMKQRGYVPQTKLVLYDLDQEEKERIVLGHSEKLAVAFGLINTSK 600

Query: 601 GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFKDGVCSCGDYW 650
           GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFKDGVCSCGDYW
Sbjct: 601 GDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFKDGVCSCGDYW 653

BLAST of Csa1G524740 vs. NCBI nr
Match: gi|645251040|ref|XP_008231496.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g46790, chloroplastic [Prunus mume])

HSP 1 Score: 1034.2 bits (2673), Expect = 9.4e-299
Identity = 497/610 (81.48%), Postives = 553/610 (90.66%), Query Frame = 1

Query: 40  NNNHLIQSLCKQGNLKQALYLLSHESNPTQQTCELLILSAARRNSLSDALDVHQLLVDGG 99
           N N LIQSLCKQGNL++AL  L HE NP+QQT E+LILS     SLSD LDVH+ LVDGG
Sbjct: 93  NKNKLIQSLCKQGNLREALQFLPHEPNPSQQTYEILILSCTHHKSLSDGLDVHRHLVDGG 152

Query: 100 FDQDPFLATKLINMFSELDTVDNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLELYP 159
           +DQDPFLATKLI M+SELD++DNARKVFDKT KRTIY+WNALFRAL LAG G +VL+LY 
Sbjct: 153 WDQDPFLATKLIEMYSELDSIDNARKVFDKTHKRTIYMWNALFRALTLAGHGTEVLDLYR 212

Query: 160 RMNMMGVSSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMDMY 219
           RMN +GVSSDRFTYTY++KACV SECL SFLQKGKEIH HILRHGYGAHVHV+TTL+DMY
Sbjct: 213 RMNTLGVSSDRFTYTYVIKACVVSECLSSFLQKGKEIHGHILRHGYGAHVHVVTTLLDMY 272

Query: 220 ARFGCVSYASAVFDEMPVKNVVSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNSVT 279
           ARFGCVSYAS+VFD+M ++NVVSWSAMIACYAKNG+PYEALELFREM+L  HD +PNSVT
Sbjct: 273 ARFGCVSYASSVFDQMQIRNVVSWSAMIACYAKNGRPYEALELFREMILEAHDLLPNSVT 332

Query: 280 MVSVLQACAAFAALEQGKLIHAYILRRGLDSILPVISALITMYARCGKLESGQLIFDRMH 339
           MVSVLQACAA  ALEQG+ +H YILRRGLDSILPV+S LITMYARCGKL+ G+ +F  M+
Sbjct: 333 MVSVLQACAALTALEQGRFLHGYILRRGLDSILPVMSTLITMYARCGKLDLGERVFSMMN 392

Query: 340 KKDVVLWNSLISSYGLHGYGRKAIKIFEEMIDHGFSPSHISFISVLGACSHTGLVEEGKK 399
           KKDVV WNSLISSYG+HGYG+KAI+IFE+M+ HG SPSHISF+SVLGACSH GLVEEGK 
Sbjct: 393 KKDVVSWNSLISSYGVHGYGKKAIQIFEDMVYHGVSPSHISFVSVLGACSHAGLVEEGKM 452

Query: 400 LFESMVKEHGIQPSVEHYACMVDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIH 459
           LF SMVKEHGI PSVEHYACMVDLLGRANR DEAAK+IED+RIEPG KVWG+LLG+CRIH
Sbjct: 453 LFNSMVKEHGIYPSVEHYACMVDLLGRANRFDEAAKVIEDMRIEPGAKVWGALLGSCRIH 512

Query: 460 CHVELAERASKRLFKLEPTNAGNYVLLADIYAEAEMWDEVKRVKKLLDSRELQKVPGRSW 519
           C+VELAERASKRLF+LEP NAGNYVLLADIYAEA+MWDEVKRVKKLL++RELQKVPGRSW
Sbjct: 513 CNVELAERASKRLFELEPRNAGNYVLLADIYAEAKMWDEVKRVKKLLEARELQKVPGRSW 572

Query: 520 IEVRRKIYSFTSVDEFNPQGEQLHALLVNLSNEMKQRGYTPQTKLVLYDLDQEEKERIVL 579
           IEV+RKIYSF SVDEFNPQ EQLHALL  LS EMK RGY PQTK+VLYDLD+EEKERIVL
Sbjct: 573 IEVKRKIYSFISVDEFNPQMEQLHALLAELSTEMKDRGYKPQTKVVLYDLDEEEKERIVL 632

Query: 580 GHSEKLAVAFGLINTSKGDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFK 639
           GHSEKLAVAFGLINT +G+TIRI+KNLRLCEDCH VTKFISKFA+REI+VRD+NRFHHF+
Sbjct: 633 GHSEKLAVAFGLINTKRGETIRISKNLRLCEDCHYVTKFISKFANREILVRDVNRFHHFR 692

Query: 640 DGVCSCGDYW 650
           DGVCSC DYW
Sbjct: 693 DGVCSCEDYW 702

BLAST of Csa1G524740 vs. NCBI nr
Match: gi|694326907|ref|XP_009354345.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g46790, chloroplastic [Pyrus x bretschneideri])

HSP 1 Score: 1013.4 bits (2619), Expect = 1.7e-292
Identity = 488/610 (80.00%), Postives = 550/610 (90.16%), Query Frame = 1

Query: 40  NNNHLIQSLCKQGNLKQALYLLSHESNPTQQTCELLILSAARRNSLSDALDVHQLLVDGG 99
           + N LIQSLCKQGNLKQAL  L HE NP+QQT ELL+LS     SLSD LDVH+ +VDGG
Sbjct: 298 DKNKLIQSLCKQGNLKQALQFLPHEPNPSQQTYELLLLSCTHHKSLSDGLDVHRHIVDGG 357

Query: 100 FDQDPFLATKLINMFSELDTVDNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLELYP 159
           +DQDPFLATKLI M+S LD++DNAR+VFDKTRKRTIY+WNALFRAL LAG G +VL+LY 
Sbjct: 358 WDQDPFLATKLIEMYSALDSIDNAREVFDKTRKRTIYMWNALFRALTLAGHGTEVLDLYR 417

Query: 160 RMNMMGVSSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMDMY 219
           +MN +G+SSDRFTYTY+LKACV SECL S LQKGKEIH HIL++GYGAHVHVMTTL+DMY
Sbjct: 418 QMNTVGISSDRFTYTYVLKACVVSECLSSLLQKGKEIHGHILKNGYGAHVHVMTTLLDMY 477

Query: 220 ARFGCVSYASAVFDEMPVKNVVSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNSVT 279
           ARFGCV YAS+VFD+M ++NVVSWSAMIACYAKNG+PYEALELFREM+L   D  PN VT
Sbjct: 478 ARFGCVFYASSVFDQMQIRNVVSWSAMIACYAKNGRPYEALELFREMILEAQDLFPNPVT 537

Query: 280 MVSVLQACAAFAALEQGKLIHAYILRRGLDSILPVISALITMYARCGKLESGQLIFDRMH 339
           MVSVLQACAA  ALEQG+ IH YILRRGLDSILPV+SALITMYARCGKL+ G+ +F  M+
Sbjct: 538 MVSVLQACAALTALEQGRFIHGYILRRGLDSILPVMSALITMYARCGKLDLGERVFSLMN 597

Query: 340 KKDVVLWNSLISSYGLHGYGRKAIKIFEEMIDHGFSPSHISFISVLGACSHTGLVEEGKK 399
           KKDVV WNSLISSYG+HGYG+KAI+IFE+MI+HG SPS ISF+SVLGACSH GLVEEGK 
Sbjct: 598 KKDVVSWNSLISSYGIHGYGKKAIQIFEDMINHGVSPSRISFVSVLGACSHAGLVEEGKI 657

Query: 400 LFESMVKEHGIQPSVEHYACMVDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIH 459
           LF SMVKEHG+ PSVEHYACMVDLLGRANRLDEAAK+I+++RIEPG KVWG+LLG+CRIH
Sbjct: 658 LFNSMVKEHGLYPSVEHYACMVDLLGRANRLDEAAKVIDNMRIEPGAKVWGALLGSCRIH 717

Query: 460 CHVELAERASKRLFKLEPTNAGNYVLLADIYAEAEMWDEVKRVKKLLDSRELQKVPGRSW 519
           C+VELAERAS+RLF+LEP NAGNYVLLADIYAEAE+WD+VKRVKK L++RELQKVPGRSW
Sbjct: 718 CNVELAERASRRLFELEPRNAGNYVLLADIYAEAELWDDVKRVKKHLEARELQKVPGRSW 777

Query: 520 IEVRRKIYSFTSVDEFNPQGEQLHALLVNLSNEMKQRGYTPQTKLVLYDLDQEEKERIVL 579
           IEV+RKIYSF SVDEFNPQ EQLHALL  LS EMK +GY PQTK+VLYDLD+EEKERIVL
Sbjct: 778 IEVKRKIYSFISVDEFNPQMEQLHALLAELSAEMKDQGYKPQTKVVLYDLDEEEKERIVL 837

Query: 580 GHSEKLAVAFGLINTSKGDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFK 639
           GHSEKLAVAFGLINT +G+TIRI+KNLRLCEDCHSVTKFISKFADREI+VRD+NRFHHF+
Sbjct: 838 GHSEKLAVAFGLINTKRGETIRISKNLRLCEDCHSVTKFISKFADREILVRDVNRFHHFR 897

Query: 640 DGVCSCGDYW 650
            GVCSCGDYW
Sbjct: 898 GGVCSCGDYW 907

BLAST of Csa1G524740 vs. NCBI nr
Match: gi|657950783|ref|XP_008349645.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g46790, chloroplastic [Malus domestica])

HSP 1 Score: 1008.4 bits (2606), Expect = 5.5e-291
Identity = 484/610 (79.34%), Postives = 551/610 (90.33%), Query Frame = 1

Query: 40  NNNHLIQSLCKQGNLKQALYLLSHESNPTQQTCELLILSAARRNSLSDALDVHQLLVDGG 99
           + N LIQSLCKQGNLKQAL  L HE NP+QQT ELL+LS     SLSD LDVH+ +VDGG
Sbjct: 258 DKNKLIQSLCKQGNLKQALQFLPHEPNPSQQTYELLLLSCTHHKSLSDGLDVHRHIVDGG 317

Query: 100 FDQDPFLATKLINMFSELDTVDNARKVFDKTRKRTIYVWNALFRALALAGRGNDVLELYP 159
           +DQDPFLATKLI M+S LD++DNAR+VFDKTRKRTIY+WNALFRAL LAG G +VL+LY 
Sbjct: 318 WDQDPFLATKLIEMYSALDSIDNAREVFDKTRKRTIYMWNALFRALTLAGHGTEVLDLYR 377

Query: 160 RMNMMGVSSDRFTYTYLLKACVASECLVSFLQKGKEIHAHILRHGYGAHVHVMTTLMDMY 219
           +MN +G+SSDRFTYTY+LKACV SECL S LQKGKEIH HIL++GYGAHVHVMTTL+DMY
Sbjct: 378 QMNTVGISSDRFTYTYVLKACVVSECLSSLLQKGKEIHGHILKNGYGAHVHVMTTLLDMY 437

Query: 220 ARFGCVSYASAVFDEMPVKNVVSWSAMIACYAKNGKPYEALELFREMMLNTHDSVPNSVT 279
           ARFGCV YAS+VFD+M ++NVVSWSAMIACYAKNG+PYEALELFREM+L+  D  PN VT
Sbjct: 438 ARFGCVFYASSVFDQMQIRNVVSWSAMIACYAKNGRPYEALELFREMILDAQDLFPNPVT 497

Query: 280 MVSVLQACAAFAALEQGKLIHAYILRRGLDSILPVISALITMYARCGKLESGQLIFDRMH 339
           MVSVLQACAA  ALEQG+ IH YILRRGL+SILPV+SALITMYARCGKL+ G+ +F  M+
Sbjct: 498 MVSVLQACAALTALEQGRFIHGYILRRGLBSILPVMSALITMYARCGKLDLGERVFSLMN 557

Query: 340 KKDVVLWNSLISSYGLHGYGRKAIKIFEEMIDHGFSPSHISFISVLGACSHTGLVEEGKK 399
           KKDVV WNSLISSYG+HG G+KAI+IFE+MI+HG SPS ISF+SVLGACSH GLVEEGK 
Sbjct: 558 KKDVVSWNSLISSYGIHGNGKKAIQIFEDMINHGVSPSRISFVSVLGACSHAGLVEEGKI 617

Query: 400 LFESMVKEHGIQPSVEHYACMVDLLGRANRLDEAAKIIEDLRIEPGPKVWGSLLGACRIH 459
           LF SMVKEHG+ PSVEHYACMVDLLGRANRLDEAAK+I+++RIEPG KVWG+LLG+CRIH
Sbjct: 618 LFNSMVKEHGLYPSVEHYACMVDLLGRANRLDEAAKVIDNMRIEPGAKVWGALLGSCRIH 677

Query: 460 CHVELAERASKRLFKLEPTNAGNYVLLADIYAEAEMWDEVKRVKKLLDSRELQKVPGRSW 519
           C+VELAERAS+RLF+LEP NAGNYVLLADIYAEA++WD+VKRVKK L++RELQK+PGRSW
Sbjct: 678 CNVELAERASRRLFELEPRNAGNYVLLADIYAEAKLWDDVKRVKKHLEARELQKIPGRSW 737

Query: 520 IEVRRKIYSFTSVDEFNPQGEQLHALLVNLSNEMKQRGYTPQTKLVLYDLDQEEKERIVL 579
           IEV+RKIYSF SVDEFNPQ EQLHALL  LS EMK +GY PQTK+VLYDLD+EEKERIVL
Sbjct: 738 IEVKRKIYSFISVDEFNPQMEQLHALLAELSAEMKDQGYKPQTKVVLYDLDEEEKERIVL 797

Query: 580 GHSEKLAVAFGLINTSKGDTIRITKNLRLCEDCHSVTKFISKFADREIMVRDLNRFHHFK 639
           GHSEKLAVAFGLINT +G+TIRI+KNLRLCEDCHSVTKFISKFA+REI+VRD+NRFHHF+
Sbjct: 798 GHSEKLAVAFGLINTKRGETIRISKNLRLCEDCHSVTKFISKFANREILVRDVNRFHHFR 857

Query: 640 DGVCSCGDYW 650
           DGVCSCGDYW
Sbjct: 858 DGVCSCGDYW 867

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP265_ARATH3.3e-28073.18Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidop... [more]
PP320_ARATH6.8e-13242.04Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
PP348_ARATH2.2e-13038.75Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana GN... [more]
PP224_ARATH1.9e-12939.00Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN... [more]
PP341_ARATH1.6e-12840.78Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
F6GUX8_VITVI4.7e-28975.84Putative uncharacterized protein OS=Vitis vinifera GN=VIT_06s0004g05500 PE=4 SV=... [more]
A0A061GWW5_THECC1.0e-28875.58Tetratricopeptide repeat (TPR)-like superfamily protein OS=Theobroma cacao GN=TC... [more]
V4S0Q9_9ROSI1.4e-28876.84Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025096mg PE=4 SV=1[more]
A0A0D2RLL5_GOSRA6.8e-28875.31Uncharacterized protein OS=Gossypium raimondii GN=B456_003G128500 PE=4 SV=1[more]
A0A067D6U3_CITSI8.8e-28876.53Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g006076mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G46790.11.9e-28173.18 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G18750.13.8e-13342.04 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G33990.11.2e-13138.75 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G12770.11.0e-13039.00 mitochondrial editing factor 22[more]
AT4G30700.18.8e-13040.78 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449474033|ref|XP_004154055.1|0.0e+00100.00PREDICTED: pentatricopeptide repeat-containing protein At3g46790, chloroplastic ... [more]
gi|659115217|ref|XP_008457445.1|0.0e+0092.96PREDICTED: pentatricopeptide repeat-containing protein At3g46790, chloroplastic ... [more]
gi|645251040|ref|XP_008231496.1|9.4e-29981.48PREDICTED: pentatricopeptide repeat-containing protein At3g46790, chloroplastic ... [more]
gi|694326907|ref|XP_009354345.1|1.7e-29280.00PREDICTED: pentatricopeptide repeat-containing protein At3g46790, chloroplastic ... [more]
gi|657950783|ref|XP_008349645.1|5.5e-29179.34PREDICTED: pentatricopeptide repeat-containing protein At3g46790, chloroplastic ... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0031425 chloroplast RNA processing
biological_process GO:0000398 mRNA splicing, via spliceosome
biological_process GO:0031426 polycistronic mRNA processing
biological_process GO:0045893 positive regulation of transcription, DNA-templated
biological_process GO:0043687 post-translational protein modification
biological_process GO:0035196 production of miRNAs involved in gene silencing by miRNA
biological_process GO:0030422 production of siRNA involved in RNA interference
biological_process GO:0035194 posttranscriptional gene silencing by RNA
biological_process GO:0070918 production of small RNA involved in gene silencing by RNA
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU085007cucumber EST collection version 3.0transcribed_cluster
CU127744cucumber EST collection version 3.0transcribed_cluster
CU149820cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa1G524740.1Csa1G524740.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU085007CU085007transcribed_cluster
CU149820CU149820transcribed_cluster
CU127744CU127744transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 42..61
score: 0.88coord: 241..267
score: 1.0E-8coord: 212..239
score: 0.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 341..388
score: 5.
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 134..181
score:
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 344..377
score: 8.7E-8coord: 380..413
score: 2.8E-4coord: 241..267
score: 4.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 479..513
score: 5.93coord: 134..168
score: 8.78coord: 169..207
score: 5.305coord: 239..273
score: 11.071coord: 413..443
score: 7.18coord: 68..102
score: 6.062coord: 103..133
score: 6.445coord: 311..341
score: 6.851coord: 276..310
score: 6.138coord: 342..376
score: 12.353coord: 377..412
score: 8.89coord: 208..238
score: 6
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 340..498
score: 1.1E-10coord: 238..270
score: 1.1
IPR011990Tetratricopeptide-like helical domainunknownSSF48452TPR-likecoord: 236..496
score: 1.7
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 29..520
score:
NoneNo IPR availablePANTHERPTHR24015:SF611SUBFAMILY NOT NAMEDcoord: 29..520
score:

The following gene(s) are paralogous to this gene:

None