Cla021603.1 (mRNA) Watermelon (97103) v1

NameCla021603
TypemRNA
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionPentatricopeptide repeat protein 91 (AHRD V1 ***- F5CAE1_FUNHY); contains Interpro domain(s) IPR002885 Pentatricopeptide repeat
LocationChr5 : 4617203 .. 4619728 (+)
Sequence length2526
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGCAAAGCTTCATCCAACCATCTCCATCGCGCCAAATTTCATCACACCCACCACTCAGAATGACTCAAAGCATACCCCTACCCTCCATTTTCAAATTGGGTTGTTCAAAAATTGCAAAACCATGGATGAACTTGGACAATTACACTGTTACGCGTTAAAGCAGGGTCTCATTCGCAAACGACCGACTTTAACTAAGCTTATTTCCATTTGCGTGGAAATGGGCACTTCAGAAAGCTTGGATTATGCTCGAAAGGCCTTCGAGCTCTTCCATGAAGACGGGGAAGCAAATGCCACTCTTTTCATGTACAATTCGTTAATCAGGGGATACTCTGCTGCAGGGCTTTGCGATGAAGCTATTTCGCTGTATGTTCAGATGATCGAGGTTGGGTTTATGCCAGACAACTTCACGTTTCCATTTTTGTTAAGTGCGTGTGCTAAGACTGGAGCGTTTGTAGAAGGGGTTCAGCTCCATGGAGCTCTTATGAAGATTGGTTTAGAGAGAGATATGTTTGTTGCTAATTCTCTGATACATTTGTATGCGGAAGGGGAAGAATTTTTGTTTGCTCGAAAGGTGTTTGATGAAATGCTTGAGAGAAATGTTGTTTCATGGACCAGCTTGATTTGTGGCTATGCTAGGACAGATGCTTCTAGTGAGGCTGTGGCTTTGTTTTTCCAAATGATCGAGGCAGGTGTTAGACCCAATTCTGTCACAATGGTGTGTGTGATATCGGCTTGTGCCAAGTTGAAAGTTCTTGAACTAGCCAAGAGAATTCATGCTTACATTGAAGAATCAGAAATGGAGATTAATACTCATATGGTGAATGCACTTGTGGATATGTACATGAAATGTGGAGAAACTGGTGCTGCAAAGCGACTATATGATGGATGTGTTGATAAGAATTTGGTTTTGTGTAACACAATCATGTCAAATTTTGCACGCCATGGTATGGCGAAGGAGGTACTTGCTGTCTTAGCAGATATGTTCAGAGTAGATCTTCAACCGGATAGAGTTTCGTTGTTATCAGCAATCTCGGCATGTGGGCAGATGAGTGACTATCTACTTGGGATGTGCTGCCACAATTATTGTCTAAGAAATGGGTATGAAGGTTGGGATAACATTTGCAATGCAATTATTGACATGTACATGAAGTGTGGAAGACAAGAAATGGCCTACAGAGTTTTTGACAGCATGTCAAATAAGACCATTGTGTCGTGGAACTCATTACTTGTTGGTTACATTAGAAACAAAGATTTTGAGCCAGCTAGGAAGATATTCAATGAGATGCCTGTAAAGGATATAGTGTCTTGGAACACAATGGTTAATGCTTTGGTTCAAGAGAGTATGTTTGGTGAAGCAATTGAACTTTTTCGAGAAATGCAATTGAAGGAAATGAAAGCAGATAGGGTGACAATGGTAGAAGTTGCATCAGCATGTGGATATCTCGGAGCTCTGGAACTTGCCAAGTGGGTATATGCCTATATTGTAAAGAACAACATCTACTGTGATATGTTGCTTGAGACAGCATTAGTTGATATGTTTGCTAGGTGCGGTGACCCTCATAATGCGATGGAAGTGTTCAACAATATGGATAGAAAAGATGTATTGGCATGGACAGCAGCCATTGGAGCAATGGCTGTGGATGGGAATGGCAAACGAGCTTTTGAACTTTACAATGAGATGCTAAGGCAAGGGGTGAAACCAGATCAAGTAGTTTTTGTAAACATATTAACAGCTTGCAGCCATGGTGGTTTCGTGGAACAAGGGCAGTACATATTTGAGTCAATGAAGGAACATGGAATCTCTCCACAGATTGTTCATTATGGTTGCATGGTTGATCTATTAGGTCGTGCAGGTAAGTTGGAAGAAGCTCTAGATATTATAAAGAGCATGCCAATGAAACCCAATGGGATTATATGGGGATCTCTATTGGCTGCATGTCGTACCCATAAAAACATCGATATGGCAACATTTGCAGCTGAAAGGTTAGCAGAAGTGGCCCCAGAAAGGACTGGGATTCACGTGCTTCTATCAAACATATATGCTTCAGCTGAAAAGTGGGCTGATGTTGCTAATGTGAGGCTACAGTTGAAGGAAAAAGGAGTTCAGAAAATGCCTGGTTCGAGCTCGATAGAAGTTGATGGAGTTATTCATGAGTTTACCTCAGGTGACAGATCACACCCAGAAAACTGTGGCATTGACATGATGTTAAAGGAAATTACCAACAGGCTTGGGGATGTTGGTTATGTTCCTGATGTAACCAATGTTCTTCTTGACGTAAATGAGCAGGAGAAACAATATCTACTCAATCGGCATAGTGAGAAGCTGGCAATGGCTTACGGGCTTATAAGCACAGAAAAGCATTTACCAATTCGTGTTATAAAGAATCTCCGAATGTGCTCAGACTGTCATGCATTCGCCAAATACATTTCAAAAGTGTATGTTAGGGAAATAACAGTACGAGATAATAACAGGTTTCACTTCTTTCGACAAGGGTCTTGTTCATGTGGTGATTATTGGTAA

mRNA sequence

ATGGCGGCAAAGCTTCATCCAACCATCTCCATCGCGCCAAATTTCATCACACCCACCACTCAGAATGACTCAAAGCATACCCCTACCCTCCATTTTCAAATTGGGTTGTTCAAAAATTGCAAAACCATGGATGAACTTGGACAATTACACTGTTACGCGTTAAAGCAGGGTCTCATTCGCAAACGACCGACTTTAACTAAGCTTATTTCCATTTGCGTGGAAATGGGCACTTCAGAAAGCTTGGATTATGCTCGAAAGGCCTTCGAGCTCTTCCATGAAGACGGGGAAGCAAATGCCACTCTTTTCATGTACAATTCGTTAATCAGGGGATACTCTGCTGCAGGGCTTTGCGATGAAGCTATTTCGCTGTATGTTCAGATGATCGAGGTTGGGTTTATGCCAGACAACTTCACGTTTCCATTTTTGTTAAGTGCGTGTGCTAAGACTGGAGCGTTTGTAGAAGGGGTTCAGCTCCATGGAGCTCTTATGAAGATTGGTTTAGAGAGAGATATGTTTGTTGCTAATTCTCTGATACATTTGTATGCGGAAGGGGAAGAATTTTTGTTTGCTCGAAAGGTGTTTGATGAAATGCTTGAGAGAAATGTTGTTTCATGGACCAGCTTGATTTGTGGCTATGCTAGGACAGATGCTTCTAGTGAGGCTGTGGCTTTGTTTTTCCAAATGATCGAGGCAGGTGTTAGACCCAATTCTGTCACAATGGTGTGTGTGATATCGGCTTGTGCCAAGTTGAAAGTTCTTGAACTAGCCAAGAGAATTCATGCTTACATTGAAGAATCAGAAATGGAGATTAATACTCATATGGTGAATGCACTTGTGGATATGTACATGAAATGTGGAGAAACTGGTGCTGCAAAGCGACTATATGATGGATGTGTTGATAAGAATTTGGTTTTGTGTAACACAATCATGTCAAATTTTGCACGCCATGGTATGGCGAAGGAGGTACTTGCTGTCTTAGCAGATATGTTCAGAGTAGATCTTCAACCGGATAGAGTTTCGTTGTTATCAGCAATCTCGGCATGTGGGCAGATGAGTGACTATCTACTTGGGATGTGCTGCCACAATTATTGTCTAAGAAATGGGTATGAAGGTTGGGATAACATTTGCAATGCAATTATTGACATGTACATGAAGTGTGGAAGACAAGAAATGGCCTACAGAGTTTTTGACAGCATGTCAAATAAGACCATTGTGTCGTGGAACTCATTACTTGTTGGTTACATTAGAAACAAAGATTTTGAGCCAGCTAGGAAGATATTCAATGAGATGCCTGTAAAGGATATAGTGTCTTGGAACACAATGGTTAATGCTTTGGTTCAAGAGAGTATGTTTGGTGAAGCAATTGAACTTTTTCGAGAAATGCAATTGAAGGAAATGAAAGCAGATAGGGTGACAATGGTAGAAGTTGCATCAGCATGTGGATATCTCGGAGCTCTGGAACTTGCCAAGTGGGTATATGCCTATATTGTAAAGAACAACATCTACTGTGATATGTTGCTTGAGACAGCATTAGTTGATATGTTTGCTAGGTGCGGTGACCCTCATAATGCGATGGAAGTGTTCAACAATATGGATAGAAAAGATGTATTGGCATGGACAGCAGCCATTGGAGCAATGGCTGTGGATGGGAATGGCAAACGAGCTTTTGAACTTTACAATGAGATGCTAAGGCAAGGGGTGAAACCAGATCAAGTAGTTTTTGTAAACATATTAACAGCTTGCAGCCATGGTGGTTTCGTGGAACAAGGGCAGTACATATTTGAGTCAATGAAGGAACATGGAATCTCTCCACAGATTGTTCATTATGGTTGCATGGTTGATCTATTAGGTCGTGCAGGTAAGTTGGAAGAAGCTCTAGATATTATAAAGAGCATGCCAATGAAACCCAATGGGATTATATGGGGATCTCTATTGGCTGCATGTCGTACCCATAAAAACATCGATATGGCAACATTTGCAGCTGAAAGGTTAGCAGAAGTGGCCCCAGAAAGGACTGGGATTCACGTGCTTCTATCAAACATATATGCTTCAGCTGAAAAGTGGGCTGATGTTGCTAATGTGAGGCTACAGTTGAAGGAAAAAGGAGTTCAGAAAATGCCTGGTTCGAGCTCGATAGAAGTTGATGGAGTTATTCATGAGTTTACCTCAGGTGACAGATCACACCCAGAAAACTGTGGCATTGACATGATGTTAAAGGAAATTACCAACAGGCTTGGGGATGTTGGTTATGTTCCTGATGTAACCAATGTTCTTCTTGACGTAAATGAGCAGGAGAAACAATATCTACTCAATCGGCATAGTGAGAAGCTGGCAATGGCTTACGGGCTTATAAGCACAGAAAAGCATTTACCAATTCGTGTTATAAAGAATCTCCGAATGTGCTCAGACTGTCATGCATTCGCCAAATACATTTCAAAAGTGTATGTTAGGGAAATAACAGTACGAGATAATAACAGGTTTCACTTCTTTCGACAAGGGTCTTGTTCATGTGGTGATTATTGGTAA

Coding sequence (CDS)

ATGGCGGCAAAGCTTCATCCAACCATCTCCATCGCGCCAAATTTCATCACACCCACCACTCAGAATGACTCAAAGCATACCCCTACCCTCCATTTTCAAATTGGGTTGTTCAAAAATTGCAAAACCATGGATGAACTTGGACAATTACACTGTTACGCGTTAAAGCAGGGTCTCATTCGCAAACGACCGACTTTAACTAAGCTTATTTCCATTTGCGTGGAAATGGGCACTTCAGAAAGCTTGGATTATGCTCGAAAGGCCTTCGAGCTCTTCCATGAAGACGGGGAAGCAAATGCCACTCTTTTCATGTACAATTCGTTAATCAGGGGATACTCTGCTGCAGGGCTTTGCGATGAAGCTATTTCGCTGTATGTTCAGATGATCGAGGTTGGGTTTATGCCAGACAACTTCACGTTTCCATTTTTGTTAAGTGCGTGTGCTAAGACTGGAGCGTTTGTAGAAGGGGTTCAGCTCCATGGAGCTCTTATGAAGATTGGTTTAGAGAGAGATATGTTTGTTGCTAATTCTCTGATACATTTGTATGCGGAAGGGGAAGAATTTTTGTTTGCTCGAAAGGTGTTTGATGAAATGCTTGAGAGAAATGTTGTTTCATGGACCAGCTTGATTTGTGGCTATGCTAGGACAGATGCTTCTAGTGAGGCTGTGGCTTTGTTTTTCCAAATGATCGAGGCAGGTGTTAGACCCAATTCTGTCACAATGGTGTGTGTGATATCGGCTTGTGCCAAGTTGAAAGTTCTTGAACTAGCCAAGAGAATTCATGCTTACATTGAAGAATCAGAAATGGAGATTAATACTCATATGGTGAATGCACTTGTGGATATGTACATGAAATGTGGAGAAACTGGTGCTGCAAAGCGACTATATGATGGATGTGTTGATAAGAATTTGGTTTTGTGTAACACAATCATGTCAAATTTTGCACGCCATGGTATGGCGAAGGAGGTACTTGCTGTCTTAGCAGATATGTTCAGAGTAGATCTTCAACCGGATAGAGTTTCGTTGTTATCAGCAATCTCGGCATGTGGGCAGATGAGTGACTATCTACTTGGGATGTGCTGCCACAATTATTGTCTAAGAAATGGGTATGAAGGTTGGGATAACATTTGCAATGCAATTATTGACATGTACATGAAGTGTGGAAGACAAGAAATGGCCTACAGAGTTTTTGACAGCATGTCAAATAAGACCATTGTGTCGTGGAACTCATTACTTGTTGGTTACATTAGAAACAAAGATTTTGAGCCAGCTAGGAAGATATTCAATGAGATGCCTGTAAAGGATATAGTGTCTTGGAACACAATGGTTAATGCTTTGGTTCAAGAGAGTATGTTTGGTGAAGCAATTGAACTTTTTCGAGAAATGCAATTGAAGGAAATGAAAGCAGATAGGGTGACAATGGTAGAAGTTGCATCAGCATGTGGATATCTCGGAGCTCTGGAACTTGCCAAGTGGGTATATGCCTATATTGTAAAGAACAACATCTACTGTGATATGTTGCTTGAGACAGCATTAGTTGATATGTTTGCTAGGTGCGGTGACCCTCATAATGCGATGGAAGTGTTCAACAATATGGATAGAAAAGATGTATTGGCATGGACAGCAGCCATTGGAGCAATGGCTGTGGATGGGAATGGCAAACGAGCTTTTGAACTTTACAATGAGATGCTAAGGCAAGGGGTGAAACCAGATCAAGTAGTTTTTGTAAACATATTAACAGCTTGCAGCCATGGTGGTTTCGTGGAACAAGGGCAGTACATATTTGAGTCAATGAAGGAACATGGAATCTCTCCACAGATTGTTCATTATGGTTGCATGGTTGATCTATTAGGTCGTGCAGGTAAGTTGGAAGAAGCTCTAGATATTATAAAGAGCATGCCAATGAAACCCAATGGGATTATATGGGGATCTCTATTGGCTGCATGTCGTACCCATAAAAACATCGATATGGCAACATTTGCAGCTGAAAGGTTAGCAGAAGTGGCCCCAGAAAGGACTGGGATTCACGTGCTTCTATCAAACATATATGCTTCAGCTGAAAAGTGGGCTGATGTTGCTAATGTGAGGCTACAGTTGAAGGAAAAAGGAGTTCAGAAAATGCCTGGTTCGAGCTCGATAGAAGTTGATGGAGTTATTCATGAGTTTACCTCAGGTGACAGATCACACCCAGAAAACTGTGGCATTGACATGATGTTAAAGGAAATTACCAACAGGCTTGGGGATGTTGGTTATGTTCCTGATGTAACCAATGTTCTTCTTGACGTAAATGAGCAGGAGAAACAATATCTACTCAATCGGCATAGTGAGAAGCTGGCAATGGCTTACGGGCTTATAAGCACAGAAAAGCATTTACCAATTCGTGTTATAAAGAATCTCCGAATGTGCTCAGACTGTCATGCATTCGCCAAATACATTTCAAAAGTGTATGTTAGGGAAATAACAGTACGAGATAATAACAGGTTTCACTTCTTTCGACAAGGGTCTTGTTCATGTGGTGATTATTGGTAA

Protein sequence

MAAKLHPTISIAPNFITPTTQNDSKHTPTLHFQIGLFKNCKTMDELGQLHCYALKQGLIRKRPTLTKLISICVEMGTSESLDYARKAFELFHEDGEANATLFMYNSLIRGYSAAGLCDEAISLYVQMIEVGFMPDNFTFPFLLSACAKTGAFVEGVQLHGALMKIGLERDMFVANSLIHLYAEGEEFLFARKVFDEMLERNVVSWTSLICGYARTDASSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKVLELAKRIHAYIEESEMEINTHMVNALVDMYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMAKEVLAVLADMFRVDLQPDRVSLLSAISACGQMSDYLLGMCCHNYCLRNGYEGWDNICNAIIDMYMKCGRQEMAYRVFDSMSNKTIVSWNSLLVGYIRNKDFEPARKIFNEMPVKDIVSWNTMVNALVQESMFGEAIELFREMQLKEMKADRVTMVEVASACGYLGALELAKWVYAYIVKNNIYCDMLLETALVDMFARCGDPHNAMEVFNNMDRKDVLAWTAAIGAMAVDGNGKRAFELYNEMLRQGVKPDQVVFVNILTACSHGGFVEQGQYIFESMKEHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNIDMATFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKMPGSSSIEVDGVIHEFTSGDRSHPENCGIDMMLKEITNRLGDVGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHLPIRVIKNLRMCSDCHAFAKYISKVYVREITVRDNNRFHFFRQGSCSCGDYW
BLAST of Cla021603 vs. Swiss-Prot
Match: PP249_ARATH (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana GN=PCMP-H56 PE=2 SV=1)

HSP 1 Score: 988.0 bits (2553), Expect = 6.2e-287
Identity = 484/840 (57.62%), Postives = 627/840 (74.64%), Query Frame = 1

Query: 5   LHPTISIAPNFITPTTQNDSKHTPTLHFQIGLFKNCKTMDELGQLHCYALKQGLIRKRPT 64
           L P +        P+  N SK T      +   KNCKT+DEL   H    KQGL     T
Sbjct: 10  LSPMVLATTTTTKPSLLNQSKCTKATPSSL---KNCKTIDELKMFHRSLTKQGLDNDVST 69

Query: 65  LTKLISICVEMGTSESLDYARKAFELFHEDGEANATLFMYNSLIRGYSAAGLCDEAISLY 124
           +TKL++   E+GT ESL +A++ FE    + E+  T FMYNSLIRGY+++GLC+EAI L+
Sbjct: 70  ITKLVARSCELGTRESLSFAKEVFE----NSESYGTCFMYNSLIRGYASSGLCNEAILLF 129

Query: 125 VQMIEVGFMPDNFTFPFLLSACAKTGAFVEGVQLHGALMKIGLERDMFVANSLIHLYAEG 184
           ++M+  G  PD +TFPF LSACAK+ A   G+Q+HG ++K+G  +D+FV NSL+H YAE 
Sbjct: 130 LRMMNSGISPDKYTFPFGLSACAKSRAKGNGIQIHGLIVKMGYAKDLFVQNSLVHFYAEC 189

Query: 185 EEFLFARKVFDEMLERNVVSWTSLICGYARTDASSEAVALFFQMI-EAGVRPNSVTMVCV 244
            E   ARKVFDEM ERNVVSWTS+ICGYAR D + +AV LFF+M+ +  V PNSVTMVCV
Sbjct: 190 GELDSARKVFDEMSERNVVSWTSMICGYARRDFAKDAVDLFFRMVRDEEVTPNSVTMVCV 249

Query: 245 ISACAKLKVLELAKRIHAYIEESEMEINTHMVNALVDMYMKCGETGAAKRLYDGCVDKNL 304
           ISACAKL+ LE  ++++A+I  S +E+N  MV+ALVDMYMKC     AKRL+D     NL
Sbjct: 250 ISACAKLEDLETGEKVYAFIRNSGIEVNDLMVSALVDMYMKCNAIDVAKRLFDEYGASNL 309

Query: 305 VLCNTIMSNFARHGMAKEVLAVLADMFRVDLQPDRVSLLSAISACGQMSDYLLGMCCHNY 364
            LCN + SN+ R G+ +E L V   M    ++PDR+S+LSAIS+C Q+ + L G  CH Y
Sbjct: 310 DLCNAMASNYVRQGLTREALGVFNLMMDSGVRPDRISMLSAISSCSQLRNILWGKSCHGY 369

Query: 365 CLRNGYEGWDNICNAIIDMYMKCGRQEMAYRVFDSMSNKTIVSWNSLLVGYIRNKDFEPA 424
            LRNG+E WDNICNA+IDMYMKC RQ+ A+R+FD MSNKT+V+WNS++ GY+ N + + A
Sbjct: 370 VLRNGFESWDNICNALIDMYMKCHRQDTAFRIFDRMSNKTVVTWNSIVAGYVENGEVDAA 429

Query: 425 RKIFNEMPVKDIVSWNTMVNALVQESMFGEAIELFREMQLKE-MKADRVTMVEVASACGY 484
            + F  MP K+IVSWNT+++ LVQ S+F EAIE+F  MQ +E + AD VTM+ +ASACG+
Sbjct: 430 WETFETMPEKNIVSWNTIISGLVQGSLFEEAIEVFCSMQSQEGVNADGVTMMSIASACGH 489

Query: 485 LGALELAKWVYAYIVKNNIYCDMLLETALVDMFARCGDPHNAMEVFNNMDRKDVLAWTAA 544
           LGAL+LAKW+Y YI KN I  D+ L T LVDMF+RCGDP +AM +FN++  +DV AWTAA
Sbjct: 490 LGALDLAKWIYYYIEKNGIQLDVRLGTTLVDMFSRCGDPESAMSIFNSLTNRDVSAWTAA 549

Query: 545 IGAMAVDGNGKRAFELYNEMLRQGVKPDQVVFVNILTACSHGGFVEQGQYIFESM-KEHG 604
           IGAMA+ GN +RA EL+++M+ QG+KPD V FV  LTACSHGG V+QG+ IF SM K HG
Sbjct: 550 IGAMAMAGNAERAIELFDDMIEQGLKPDGVAFVGALTACSHGGLVQQGKEIFYSMLKLHG 609

Query: 605 ISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNIDMATFAA 664
           +SP+ VHYGCMVDLLGRAG LEEA+ +I+ MPM+PN +IW SLLAACR   N++MA +AA
Sbjct: 610 VSPEDVHYGCMVDLLGRAGLLEEAVQLIEDMPMEPNDVIWNSLLAACRVQGNVEMAAYAA 669

Query: 665 ERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKMPGSSSIEVDGVIHEF 724
           E++  +APERTG +VLLSN+YASA +W D+A VRL +KEKG++K PG+SSI++ G  HEF
Sbjct: 670 EKIQVLAPERTGSYVLLSNVYASAGRWNDMAKVRLSMKEKGLRKPPGTSSIQIRGKTHEF 729

Query: 725 TSGDRSHPENCGIDMMLKEITNRLGDVGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAY 784
           TSGD SHPE   I+ ML E++ R   +G+VPD++NVL+DV+E+EK ++L+RHSEKLAMAY
Sbjct: 730 TSGDESHPEMPNIEAMLDEVSQRASHLGHVPDLSNVLMDVDEKEKIFMLSRHSEKLAMAY 789

Query: 785 GLISTEKHLPIRVIKNLRMCSDCHAFAKYISKVYVREITVRDNNRFHFFRQGSCSCGDYW 842
           GLIS+ K   IR++KNLR+CSDCH+FAK+ SKVY REI +RDNNRFH+ RQG CSCGD+W
Sbjct: 790 GLISSNKGTTIRIVKNLRVCSDCHSFAKFASKVYNREIILRDNNRFHYIRQGKCSCGDFW 842

BLAST of Cla021603 vs. Swiss-Prot
Match: PP175_ARATH (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 543.1 bits (1398), Expect = 5.3e-153
Identity = 275/691 (39.80%), Postives = 425/691 (61.51%), Query Frame = 1

Query: 157 QLHGALMKIGLERDMFVANSLIHLYAEGE--EFLFARKVFDEMLERNVVSWTSLICGYAR 216
           Q HG +++ G   D + A+ L  + A        +ARKVFDE+ + N  +W +LI  YA 
Sbjct: 48  QTHGHMIRTGTFSDPYSASKLFAMAALSSFASLEYARKVFDEIPKPNSFAWNTLIRAYAS 107

Query: 217 TDASSEAVALFFQMI-EAGVRPNSVTMVCVISACAKLKVLELAKRIHAYIEESEMEINTH 276
                 ++  F  M+ E+   PN  T   +I A A++  L L + +H    +S +  +  
Sbjct: 108 GPDPVLSIWAFLDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVF 167

Query: 277 MVNALVDMYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMAKEVLAVLADMFRVD 336
           + N+L+  Y  CG+  +A +++    +K++V  N++++ F + G   + L +   M   D
Sbjct: 168 VANSLIHCYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALELFKKMESED 227

Query: 337 LQPDRVSLLSAISACGQMSDYLLGMCCHNYCLRNGYEGWDNICNAIIDMYMKCGRQEMAY 396
           ++   V+++  +SAC ++ +   G    +Y   N       + NA++DMY KCG  E A 
Sbjct: 228 VKASHVTMVGVLSACAKIRNLEFGRQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAK 287

Query: 397 RVFDSMSNKTIVSWNSLLVGYIRNKDFEPARKIFNEMPVKDIVSWNTMVNALVQESMFGE 456
           R+FD+M  K  V+W ++L GY  ++D+E AR++ N MP KDIV+WN +++A  Q     E
Sbjct: 288 RLFDAMEEKDNVTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNE 347

Query: 457 AIELFREMQL-KEMKADRVTMVEVASACGYLGALELAKWVYAYIVKNNIYCDMLLETALV 516
           A+ +F E+QL K MK +++T+V   SAC  +GALEL +W+++YI K+ I  +  + +AL+
Sbjct: 348 ALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALI 407

Query: 517 DMFARCGDPHNAMEVFNNMDRKDVLAWTAAIGAMAVDGNGKRAFELYNEMLRQGVKPDQV 576
            M+++CGD   + EVFN+++++DV  W+A IG +A+ G G  A +++ +M    VKP+ V
Sbjct: 408 HMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGV 467

Query: 577 VFVNILTACSHGGFVEQGQYIFESMKE-HGISPQIVHYGCMVDLLGRAGKLEEALDIIKS 636
            F N+  ACSH G V++ + +F  M+  +GI P+  HY C+VD+LGR+G LE+A+  I++
Sbjct: 468 TFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEA 527

Query: 637 MPMKPNGIIWGSLLAACRTHKNIDMATFAAERLAEVAPERTGIHVLLSNIYASAEKWADV 696
           MP+ P+  +WG+LL AC+ H N+++A  A  RL E+ P   G HVLLSNIYA   KW +V
Sbjct: 528 MPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENV 587

Query: 697 ANVRLQLKEKGVQKMPGSSSIEVDGVIHEFTSGDRSHPENCGIDMMLKEITNRLGDVGYV 756
           + +R  ++  G++K PG SSIE+DG+IHEF SGD +HP +  +   L E+  +L   GY 
Sbjct: 588 SELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYE 647

Query: 757 PDVTNVLLDVNEQE-KQYLLNRHSEKLAMAYGLISTEKHLPIRVIKNLRMCSDCHAFAKY 816
           P+++ VL  + E+E K+  LN HSEKLA+ YGLISTE    IRVIKNLR+C DCH+ AK 
Sbjct: 648 PEISQVLQIIEEEEMKEQSLNLHSEKLAICYGLISTEAPKVIRVIKNLRVCGDCHSVAKL 707

Query: 817 ISKVYVREITVRDNNRFHFFRQGSCSCGDYW 842
           IS++Y REI VRD  RFH FR G CSC D+W
Sbjct: 708 ISQLYDREIIVRDRYRFHHFRNGQCSCNDFW 738


HSP 2 Score: 223.4 bits (568), Expect = 9.3e-57
Identity = 166/555 (29.91%), Postives = 278/555 (50.09%), Query Frame = 1

Query: 6   HPTISIAPNFITPTTQND-SKHTPTLHFQIGLFKNCKTMDELGQLHCYALKQGLIRKRPT 65
           HP  S  PN   PTT N+ S+H       I L + C ++ +L Q H + ++ G      +
Sbjct: 15  HPNFS-NPN--QPTTNNERSRH-------ISLIERCVSLRQLKQTHGHMIRTGTFSDPYS 74

Query: 66  LTKLISICVEMGTSESLDYARKAFELFHEDGEANATLFMYNSLIRGYSAAGLCDEAISLY 125
            +KL ++   + +  SL+YARK F+   E  + N+  F +N+LIR Y++      +I  +
Sbjct: 75  ASKLFAMAA-LSSFASLEYARKVFD---EIPKPNS--FAWNTLIRAYASGPDPVLSIWAF 134

Query: 126 VQMI-EVGFMPDNFTFPFLLSACAKTGAFVEGVQLHGALMKIGLERDMFVANSLIHLYAE 185
           + M+ E    P+ +TFPFL+ A A+  +   G  LHG  +K  +  D+FVANSLIH Y  
Sbjct: 135 LDMVSESQCYPNKYTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFS 194

Query: 186 GEEFLFARKVFDEMLERNVVSWTSLICGYARTDASSEAVALFFQMIEAGVRPNSVTMVCV 245
             +   A KVF  + E++VVSW S+I G+ +  +  +A+ LF +M    V+ + VTMV V
Sbjct: 195 CGDLDSACKVFTTIKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGV 254

Query: 246 ISACAKLKVLELAKRIHAYIEESEMEINTHMVNALVDMYMKCGETGAAKRLYDGCVDKNL 305
           +SACAK++ LE  +++ +YIEE+ + +N  + NA++DMY KCG    AKRL+D   +K+ 
Sbjct: 255 LSACAKIRNLEFGRQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDN 314

Query: 306 VLCNTIMSNFARHGMAKEVLAVLADMFRVDLQPDRVSLLSAISACGQMSDYLLGMCCHNY 365
           V   T++  +A     +    VL  M + D+     +L+SA    G+ ++ L+    H  
Sbjct: 315 VTWTTMLDGYAISEDYEAAREVLNSMPQKDIVAWN-ALISAYEQNGKPNEALI--VFHEL 374

Query: 366 CLRNGYEGWDNICNAIIDMYMKCGRQEMAYRVFDSMSNKTIVSWN-----SLLVGYIRNK 425
            L+   +       + +    + G  E+  R   S   K  +  N     +L+  Y +  
Sbjct: 375 QLQKNMKLNQITLVSTLSACAQVGALELG-RWIHSYIKKHGIRMNFHVTSALIHMYSKCG 434

Query: 426 DFEPARKIFNEMPVKDIVSWNTMVNALVQESMFGEAIELFREMQLKEMKADRVTMVEVAS 485
           D E +R++FN +  +D+  W+ M+  L       EA+++F +MQ   +K + VT   V  
Sbjct: 435 DLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFC 494

Query: 486 ACGYLGALELAKWVYAYIVKN-NIYCDMLLETALVDMFARCGDPHNAMEVFNNMD-RKDV 545
           AC + G ++ A+ ++  +  N  I  +      +VD+  R G    A++    M      
Sbjct: 495 ACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPST 549

Query: 546 LAWTAAIGAMAVDGN 552
             W A +GA  +  N
Sbjct: 555 SVWGALLGACKIHAN 549

BLAST of Cla021603 vs. Swiss-Prot
Match: PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 536.2 bits (1380), Expect = 6.5e-151
Identity = 287/807 (35.56%), Postives = 456/807 (56.51%), Query Frame = 1

Query: 36  LFKNCKTMDELGQLHCYALKQGLIRKRPTLTKLISICVEMGTSESLDYARKAFELFHEDG 95
           L + C ++ EL Q+     K GL ++    TKL+S+    G   S+D A + FE    D 
Sbjct: 43  LLERCSSLKELRQILPLVFKNGLYQEHFFQTKLVSLFCRYG---SVDEAARVFEPI--DS 102

Query: 96  EANATLFMYNSLIRGYSAAGLCDEAISLYVQMIEVGFMPDNFTFPFLLSACAKTGAFVEG 155
           + N    +Y+++++G++     D+A+  +V+M      P  + F +LL  C        G
Sbjct: 103 KLNV---LYHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVG 162

Query: 156 VQLHGALMKIGLERDMFVANSLIHLYAEGEEFLFARKVFDEMLERNVVSWTSLICGYART 215
            ++HG L+K G   D+F    L ++YA+  +   ARKVFD M ER++VSW +++ GY++ 
Sbjct: 163 KEIHGLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQN 222

Query: 216 DASSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKVLELAKRIHAYIEESEMEINTHMV 275
             +  A+ +   M E  ++P+ +T+V V+ A + L+++ + K IH Y   S  +   ++ 
Sbjct: 223 GMARMALEMVKSMCEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNIS 282

Query: 276 NALVDMYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMAKEVLAVLADMFRVDLQ 335
            ALVDMY KCG    A++L+DG +++N+V  N+++  + ++   KE + +   M    ++
Sbjct: 283 TALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVK 342

Query: 336 PDRVSLLSAISACGQMSDYLLGMCCHNYCLRNGYEGWDNICNAIIDMYMKCGRQEMAYRV 395
           P  VS++ A+ AC  + D   G   H   +  G +   ++ N++I MY KC   + A  +
Sbjct: 343 PTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASM 402

Query: 396 FDSMSNKTIVSWNSLLVGYIRNKDFEPARKIFNEMPVKDIVSWNTMVNALVQESMFGEAI 455
           F  + ++T+VSWN++++G+ +N      R I                          +A+
Sbjct: 403 FGKLQSRTLVSWNAMILGFAQN-----GRPI--------------------------DAL 462

Query: 456 ELFREMQLKEMKADRVTMVEVASACGYLGALELAKWVYAYIVKNNIYCDMLLETALVDMF 515
             F +M+ + +K D  T V V +A   L     AKW++  ++++ +  ++ + TALVDM+
Sbjct: 463 NYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMY 522

Query: 516 ARCGDPHNAMEVFNNMDRKDVLAWTAAIGAMAVDGNGKRAFELYNEMLRQGVKPDQVVFV 575
           A+CG    A  +F+ M  + V  W A I      G GK A EL+ EM +  +KP+ V F+
Sbjct: 523 AKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFL 582

Query: 576 NILTACSHGGFVEQGQYIFESMKE-HGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPM 635
           ++++ACSH G VE G   F  MKE + I   + HYG MVDLLGRAG+L EA D I  MP+
Sbjct: 583 SVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPV 642

Query: 636 KPNGIIWGSLLAACRTHKNIDMATFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANV 695
           KP   ++G++L AC+ HKN++ A  AAERL E+ P+  G HVLL+NIY +A  W  V  V
Sbjct: 643 KPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQV 702

Query: 696 RLQLKEKGVQKMPGSSSIEVDGVIHEFTSGDRSHPENCGIDMMLKEITNRLGDVGYVPDV 755
           R+ +  +G++K PG S +E+   +H F SG  +HP++  I   L+++   + + GYVPD 
Sbjct: 703 RVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKEAGYVPD- 762

Query: 756 TNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHLPIRVIKNLRMCSDCHAFAKYISKV 815
           TN++L V    K+ LL+ HSEKLA+++GL++T     I V KNLR+C+DCH   KYIS V
Sbjct: 763 TNLVLGVENDVKEQLLSTHSEKLAISFGLLNTTAGTTIHVRKNLRVCADCHNATKYISLV 809

Query: 816 YVREITVRDNNRFHFFRQGSCSCGDYW 842
             REI VRD  RFH F+ G+CSCGDYW
Sbjct: 823 TGREIVVRDMQRFHHFKNGACSCGDYW 809

BLAST of Cla021603 vs. Swiss-Prot
Match: PP348_ARATH (Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana GN=EMB2758 PE=3 SV=2)

HSP 1 Score: 523.1 bits (1346), Expect = 5.7e-147
Identity = 301/825 (36.48%), Postives = 460/825 (55.76%), Query Frame = 1

Query: 22  NDSKHTPTLHFQIGLFKNCKTMDELGQLHCYALKQGLIRKRPTLTKLISICVEMGTSESL 81
           N+SK    +H    LF+ C  +     LH   +    I+      KL+++   +G   ++
Sbjct: 49  NESKEIDDVHT---LFRYCTNLQSAKCLHARLVVSKQIQNVCISAKLVNLYCYLG---NV 108

Query: 82  DYARKAFELFHEDGEANATLFMYNSLIRGYSAAGLCDEAISLY-VQMIEVGFMPDNFTFP 141
             AR  F     D   N  ++ +N +I GY  AG   E I  + + M+  G  PD  TFP
Sbjct: 109 ALARHTF-----DHIQNRDVYAWNLMISGYGRAGNSSEVIRCFSLFMLSSGLTPDYRTFP 168

Query: 142 FLLSACAKTGAFVEGVQLHGALMKIGLERDMFVANSLIHLYAEGEEFLFARKVFDEMLER 201
            +L AC      ++G ++H   +K G   D++VA SLIHLY+  +    AR +FDEM  R
Sbjct: 169 SVLKACRTV---IDGNKIHCLALKFGFMWDVYVAASLIHLYSRYKAVGNARILFDEMPVR 228

Query: 202 NVVSWTSLICGYARTDASSEAVALFFQMIEAGVRP-NSVTMVCVISACAKLKVLELAKRI 261
           ++ SW ++I GY ++  + EA+ L       G+R  +SVT+V ++SAC +         I
Sbjct: 229 DMGSWNAMISGYCQSGNAKEALTL-----SNGLRAMDSVTVVSLLSACTEAGDFNRGVTI 288

Query: 262 HAYIEESEMEINTHMVNALVDMYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMA 321
           H+Y  +  +E    + N L+D+Y + G     ++++D    ++L+  N+I+  +  +   
Sbjct: 289 HSYSIKHGLESELFVSNKLIDLYAEFGRLRDCQKVFDRMYVRDLISWNSIIKAYELNEQP 348

Query: 322 KEVLAVLADMFRVDLQPDRVSLLSAISACGQMSDYLLGMCCHNYCLRNGYEGWD-NICNA 381
              +++  +M    +QPD ++L+S  S   Q+ D         + LR G+   D  I NA
Sbjct: 349 LRAISLFQEMRLSRIQPDCLTLISLASILSQLGDIRACRSVQGFTLRKGWFLEDITIGNA 408

Query: 382 IIDMYMKCGRQEMAYRVFDSMSNKTIVSWNSLLVGYIRNKDFEPARKIFNEMPVKDIVSW 441
           ++ MY K G       + DS                        AR +FN +P  D++SW
Sbjct: 409 VVVMYAKLG-------LVDS------------------------ARAVFNWLPNTDVISW 468

Query: 442 NTMVNALVQESMFGEAIELFREMQLK-EMKADRVTMVEVASACGYLGALELAKWVYAYIV 501
           NT+++   Q     EAIE++  M+ + E+ A++ T V V  AC   GAL     ++  ++
Sbjct: 469 NTIISGYAQNGFASEAIEMYNIMEEEGEIAANQGTWVSVLPACSQAGALRQGMKLHGRLL 528

Query: 502 KNNIYCDMLLETALVDMFARCGDPHNAMEVFNNMDRKDVLAWTAAIGAMAVDGNGKRAFE 561
           KN +Y D+ + T+L DM+ +CG   +A+ +F  + R + + W   I      G+G++A  
Sbjct: 529 KNGLYLDVFVVTSLADMYGKCGRLEDALSLFYQIPRVNSVPWNTLIACHGFHGHGEKAVM 588

Query: 562 LYNEMLRQGVKPDQVVFVNILTACSHGGFVEQGQYIFESMK-EHGISPQIVHYGCMVDLL 621
           L+ EML +GVKPD + FV +L+ACSH G V++GQ+ FE M+ ++GI+P + HYGCMVD+ 
Sbjct: 589 LFKEMLDEGVKPDHITFVTLLSACSHSGLVDEGQWCFEMMQTDYGITPSLKHYGCMVDMY 648

Query: 622 GRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNIDMATFAAERLAEVAPERTGIHV 681
           GRAG+LE AL  IKSM ++P+  IWG+LL+ACR H N+D+   A+E L EV PE  G HV
Sbjct: 649 GRAGQLETALKFIKSMSLQPDASIWGALLSACRVHGNVDLGKIASEHLFEVEPEHVGYHV 708

Query: 682 LLSNIYASAEKWADVANVRLQLKEKGVQKMPGSSSIEVDGVIHEFTSGDRSHPENCGIDM 741
           LLSN+YASA KW  V  +R     KG++K PG SS+EVD  +  F +G+++HP    +  
Sbjct: 709 LLSNMYASAGKWEGVDEIRSIAHGKGLRKTPGWSSMEVDNKVEVFYTGNQTHPMYEEMYR 768

Query: 742 MLKEITNRLGDVGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHLPIRVIK 801
            L  +  +L  +GYVPD   VL DV + EK+++L  HSE+LA+A+ LI+T     IR+ K
Sbjct: 769 ELTALQAKLKMIGYVPDHRFVLQDVEDDEKEHILMSHSERLAIAFALIATPAKTTIRIFK 823

Query: 802 NLRMCSDCHAFAKYISKVYVREITVRDNNRFHFFRQGSCSCGDYW 842
           NLR+C DCH+  K+ISK+  REI VRD+NRFH F+ G CSCGDYW
Sbjct: 829 NLRVCGDCHSVTKFISKITEREIIVRDSNRFHHFKNGVCSCGDYW 823

BLAST of Cla021603 vs. Swiss-Prot
Match: PP224_ARATH (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 486.9 bits (1252), Expect = 4.5e-136
Identity = 246/687 (35.81%), Postives = 401/687 (58.37%), Query Frame = 1

Query: 157 QLHGALMKIGLERDMFVANSLIHLYAEGEEFLFARKVFDEMLERNVVSWTSLICGYARTD 216
           Q+H  L+ +GL+   F+   LIH  +   +  FAR+VFD++    +  W ++I GY+R +
Sbjct: 39  QIHARLLVLGLQFSGFLITKLIHASSSFGDITFARQVFDDLPRPQIFPWNAIIRGYSRNN 98

Query: 217 ASSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKVLELAKRIHAYIEESEMEINTHMVN 276
              +A+ ++  M  A V P+S T   ++ AC+ L  L++ + +HA +     + +  + N
Sbjct: 99  HFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSHLQMGRFVHAQVFRLGFDADVFVQN 158

Query: 277 ALVDMYMKCGETGAAKRLYDGCV--DKNLVLCNTIMSNFARHGMAKEVLAVLADMFRVDL 336
            L+ +Y KC   G+A+ +++G    ++ +V    I+S +A++G   E L + + M ++D+
Sbjct: 159 GLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIVSAYAQNGEPMEALEIFSQMRKMDV 218

Query: 337 QPDRVSLLSAISACGQMSDYLLGMCCHNYCLRNGYEGWDNICNAIIDMYMKCGRQEMAYR 396
           +PD V+L+S ++A   + D   G   H   ++ G E   ++  ++  MY KCG+   A  
Sbjct: 219 KPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLEIEPDLLISLNTMYAKCGQVATAKI 278

Query: 397 VFDSMSNKTIVSWNSLLVGYIRNKDFEPARKIFNEMPVKDIVSWNTMVNALVQESMFGEA 456
           +FD M +  ++ WN+++ GY +N                                   EA
Sbjct: 279 LFDKMKSPNLILWNAMISGYAKN-------------------------------GYAREA 338

Query: 457 IELFREMQLKEMKADRVTMVEVASACGYLGALELAKWVYAYIVKNNIYCDMLLETALVDM 516
           I++F EM  K+++ D +++    SAC  +G+LE A+ +Y Y+ +++   D+ + +AL+DM
Sbjct: 339 IDMFHEMINKDVRPDTISITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDM 398

Query: 517 FARCGDPHNAMEVFNNMDRKDVLAWTAAIGAMAVDGNGKRAFELYNEMLRQGVKPDQVVF 576
           FA+CG    A  VF+    +DV+ W+A I    + G  + A  LY  M R GV P+ V F
Sbjct: 399 FAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDVTF 458

Query: 577 VNILTACSHGGFVEQGQYIFESMKEHGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPM 636
           + +L AC+H G V +G + F  M +H I+PQ  HY C++DLLGRAG L++A ++IK MP+
Sbjct: 459 LGLLMACNHSGMVREGWWFFNRMADHKINPQQQHYACVIDLLGRAGHLDQAYEVIKCMPV 518

Query: 637 KPNGIIWGSLLAACRTHKNIDMATFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANV 696
           +P   +WG+LL+AC+ H+++++  +AA++L  + P  TG +V LSN+YA+A  W  VA V
Sbjct: 519 QPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYVQLSNLYAAARLWDRVAEV 578

Query: 697 RLQLKEKGVQKMPGSSSIEVDGVIHEFTSGDRSHPENCGIDMMLKEITNRLGDVGYVPDV 756
           R+++KEKG+ K  G S +EV G +  F  GD+SHP    I+  ++ I +RL + G+V + 
Sbjct: 579 RVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIERQVEWIESRLKEGGFVANK 638

Query: 757 TNVLLDVNEQEKQYLLNRHSEKLAMAYGLISTEKHLPIRVIKNLRMCSDCHAFAKYISKV 816
              L D+N++E +  L  HSE++A+AYGLIST +  P+R+ KNLR C +CHA  K ISK+
Sbjct: 639 DASLHDLNDEEAEETLCSHSERIAIAYGLISTPQGTPLRITKNLRACVNCHAATKLISKL 694

Query: 817 YVREITVRDNNRFHFFRQGSCSCGDYW 842
             REI VRD NRFH F+ G CSCGDYW
Sbjct: 699 VDREIVVRDTNRFHHFKDGVCSCGDYW 694


HSP 2 Score: 222.2 bits (565), Expect = 2.1e-56
Identity = 131/385 (34.03%), Postives = 205/385 (53.25%), Query Frame = 1

Query: 32  FQIGLFKNCKTMDELGQLHCYALKQGLIRKRPTLTKLISICVEMGTSESLDYARKAFELF 91
           F   L  +     +L Q+H   L  GL      +TKLI      G    + +AR+ F   
Sbjct: 23  FYASLIDSATHKAQLKQIHARLLVLGLQFSGFLITKLIHASSSFG---DITFARQVF--- 82

Query: 92  HEDGEANATLFMYNSLIRGYSAAGLCDEAISLYVQMIEVGFMPDNFTFPFLLSACAKTGA 151
             D      +F +N++IRGYS      +A+ +Y  M      PD+FTFP LL AC+    
Sbjct: 83  --DDLPRPQIFPWNAIIRGYSRNNHFQDALLMYSNMQLARVSPDSFTFPHLLKACSGLSH 142

Query: 152 FVEGVQLHGALMKIGLERDMFVANSLIHLYAEGEEFLFARKVFD--EMLERNVVSWTSLI 211
              G  +H  + ++G + D+FV N LI LYA+      AR VF+   + ER +VSWT+++
Sbjct: 143 LQMGRFVHAQVFRLGFDADVFVQNGLIALYAKCRRLGSARTVFEGLPLPERTIVSWTAIV 202

Query: 212 CGYARTDASSEAVALFFQMIEAGVRPNSVTMVCVISACAKLKVLELAKRIHAYIEESEME 271
             YA+     EA+ +F QM +  V+P+ V +V V++A   L+ L+  + IHA + +  +E
Sbjct: 203 SAYAQNGEPMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSIHASVVKMGLE 262

Query: 272 INTHMVNALVDMYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMAKEVLAVLADM 331
           I   ++ +L  MY KCG+   AK L+D     NL+L N ++S +A++G A+E + +  +M
Sbjct: 263 IEPDLLISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNGYAREAIDMFHEM 322

Query: 332 FRVDLQPDRVSLLSAISACGQMSDYLLGMCCHNYCLRNGYEGWDNICNAIIDMYMKCGRQ 391
              D++PD +S+ SAISAC Q+         + Y  R+ Y     I +A+IDM+ KCG  
Sbjct: 323 INKDVRPDTISITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALIDMFAKCGSV 382

Query: 392 EMAYRVFDSMSNKTIVSWNSLLVGY 415
           E A  VFD   ++ +V W++++VGY
Sbjct: 383 EGARLVFDRTLDRDVVVWSAMIVGY 399


HSP 3 Score: 149.1 bits (375), Expect = 2.2e-34
Identity = 84/323 (26.01%), Postives = 165/323 (51.08%), Query Frame = 1

Query: 100 TLFMYNSLIRGYSAAGLCDEAISLYVQMIEVGFMPDNFTFPFLLSACAKTGAFVEGVQLH 159
           T+  + +++  Y+  G   EA+ ++ QM ++   PD      +L+A        +G  +H
Sbjct: 186 TIVSWTAIVSAYAQNGEPMEALEIFSQMRKMDVKPDWVALVSVLNAFTCLQDLKQGRSIH 245

Query: 160 GALMKIGLERDMFVANSLIHLYAEGEEFLFARKVFDEMLERNVVSWTSLICGYARTDASS 219
            +++K+GLE +  +  SL  +YA+  +   A+ +FD+M   N++ W ++I GYA+   + 
Sbjct: 246 ASVVKMGLEIEPDLLISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMISGYAKNGYAR 305

Query: 220 EAVALFFQMIEAGVRPNSVTMVCVISACAKLKVLELAKRIHAYIEESEMEINTHMVNALV 279
           EA+ +F +MI   VRP+++++   ISACA++  LE A+ ++ Y+  S+   +  + +AL+
Sbjct: 306 EAIDMFHEMINKDVRPDTISITSAISACAQVGSLEQARSMYEYVGRSDYRDDVFISSALI 365

Query: 280 DMYMKCGETGAAKRLYDGCVDKNLVLCNTIMSNFARHGMAKEVLAVLADMFRVDLQPDRV 339
           DM+ KCG    A+ ++D  +D+++V+ + ++  +  HG A+E +++   M R  + P+ V
Sbjct: 366 DMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDV 425

Query: 340 SLLSAISACGQMSDYLLGMCCHNYCLRNGYEGWDNICNAIIDMYMKCGRQEMAYRVFDSM 399
           + L  + AC        G    N    +           +ID+  + G  + AY V   M
Sbjct: 426 TFLGLLMACNHSGMVREGWWFFNRMADHKINPQQQHYACVIDLLGRAGHLDQAYEVIKCM 485

Query: 400 S-NKTIVSWNSLLVGYIRNKDFE 422
                +  W +LL    +++  E
Sbjct: 486 PVQPGVTVWGALLSACKKHRHVE 508


HSP 4 Score: 39.7 bits (91), Expect = 1.9e-01
Identity = 27/76 (35.53%), Postives = 38/76 (50.00%), Query Frame = 1

Query: 80  SLDYARKAFELFHEDGEANATLFMYNSLIRGYSAAGLCDEAISLYVQMIEVGFMPDNFTF 139
           S++ AR  F     D   +  + +++++I GY   G   EAISLY  M   G  P++ TF
Sbjct: 373 SVEGARLVF-----DRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAMERGGVHPNDVTF 432

Query: 140 PFLLSACAKTGAFVEG 156
             LL AC  +G   EG
Sbjct: 433 LGLLMACNHSGMVREG 443

BLAST of Cla021603 vs. TrEMBL
Match: A0A0A0LA65_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G146420 PE=4 SV=1)

HSP 1 Score: 1528.8 bits (3957), Expect = 0.0e+00
Identity = 751/841 (89.30%), Postives = 791/841 (94.05%), Query Frame = 1

Query: 1   MAAKLHPTISIAPNFITPTTQNDSKHTPTLHFQIGLFKNCKTMDELGQLHCYALKQGLIR 60
           MAA L PTISI PNFI PTTQNDSKHTP  HFQIGLFK+CKT+DELGQLHCYALKQGLIR
Sbjct: 1   MAANLFPTISIPPNFIKPTTQNDSKHTPPHHFQIGLFKSCKTIDELGQLHCYALKQGLIR 60

Query: 61  KRPTLTKLISICVEMGTSESLDYARKAFELFHEDGEANATLFMYNSLIRGYSAAGLCDEA 120
           K+ T+TKLIS CVEMGTSESLD+ARKAFELFHEDGEAN TLFMYN LIRGYSAAGL DEA
Sbjct: 61  KQSTVTKLISTCVEMGTSESLDFARKAFELFHEDGEANVTLFMYNLLIRGYSAAGLYDEA 120

Query: 121 ISLYVQMIEVGFMPDNFTFPFLLSACAKTGAFVEGVQLHGALMKIGLERDMFVANSLIHL 180
           ISLYVQMIE GFMPDNFTFPF+LSACAKT AFVEG+QLHGAL+KIGLE DMFVANSLIHL
Sbjct: 121 ISLYVQMIEFGFMPDNFTFPFVLSACAKTAAFVEGIQLHGALLKIGLEGDMFVANSLIHL 180

Query: 181 YAEGEEFLFARKVFDEMLERNVVSWTSLICGYARTDASSEAVALFFQMIEAGVRPNSVTM 240
           YAEG EFLFARKVFD MLERNVVSWTSLICGY+RTD   EAVALFFQMIEAGV+PNSVTM
Sbjct: 181 YAEGGEFLFARKVFDGMLERNVVSWTSLICGYSRTDFFGEAVALFFQMIEAGVKPNSVTM 240

Query: 241 VCVISACAKLKVLELAKRIHAYIEESEMEINTHMVNALVDMYMKCGETGAAKRLYDGCVD 300
           VCVISACAKLK LELAKR+HAYIEESEME+NTHMVNAL DM+MKCGETGAAKRLYD CVD
Sbjct: 241 VCVISACAKLKDLELAKRLHAYIEESEMELNTHMVNALADMFMKCGETGAAKRLYDECVD 300

Query: 301 KNLVLCNTIMSNFARHGMAKEVLAVLADMFRVDLQPDRVSLLSAISACGQMSDYLLGMCC 360
           KNLVLCNTIMSN+ARHGM  EVLAVL DM ++DL+PDRVSLL AISACGQM DYLLG CC
Sbjct: 301 KNLVLCNTIMSNYARHGMPNEVLAVLVDMLQLDLRPDRVSLLPAISACGQMGDYLLGKCC 360

Query: 361 HNYCLRNGYEGWDNICNAIIDMYMKCGRQEMAYRVFDSMSNKTIVSWNSLLVGYIRNKDF 420
           HNY LRNGYEGWDNICNA+IDMYMKCG+QEMAYRVFD M NKTIVSWNSLLVGYIRNKD 
Sbjct: 361 HNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDGMLNKTIVSWNSLLVGYIRNKDL 420

Query: 421 EPARKIFNEMPVKDIVSWNTMVNALVQESMFGEAIELFREMQLKEMKADRVTMVEVASAC 480
           E ARKIFNEMP KDIVSWNTM+NALVQESMF EAIELFREMQLKE+K DRVTMVEVASAC
Sbjct: 421 ESARKIFNEMPEKDIVSWNTMLNALVQESMFDEAIELFREMQLKEIKVDRVTMVEVASAC 480

Query: 481 GYLGALELAKWVYAYIVKNNIYCDMLLETALVDMFARCGDPHNAMEVFNNMDRKDVLAWT 540
           G LGALELAKWVY++IVKN IYCDMLLETALVDMFARCGDPH+AM VFNNM RKDV AWT
Sbjct: 481 GNLGALELAKWVYSHIVKNAIYCDMLLETALVDMFARCGDPHSAMNVFNNMHRKDVSAWT 540

Query: 541 AAIGAMAVDGNGKRAFELYNEMLRQGVKPDQVVFVNILTACSHGGFVEQGQYIFESMKEH 600
           AAIGAMAV+GNG RA ELYNEMLRQGVKPDQVVFVNILTACSHGGFVEQG++IFESMK+H
Sbjct: 541 AAIGAMAVNGNGDRAIELYNEMLRQGVKPDQVVFVNILTACSHGGFVEQGEHIFESMKQH 600

Query: 601 GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNIDMATFA 660
           G+SPQIVHYGCMVDLLGRAGKLEEALDII+SMPM+PNGIIWGSLLAACRTHKNIDMATFA
Sbjct: 601 GLSPQIVHYGCMVDLLGRAGKLEEALDIIESMPMRPNGIIWGSLLAACRTHKNIDMATFA 660

Query: 661 AERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKMPGSSSIEVDGVIHE 720
           AERLAEVAPE+TGIH+LLSNIYASAEKW DVANVRLQLKEKGVQKMPGSSSI+VDGVIHE
Sbjct: 661 AERLAEVAPEKTGIHILLSNIYASAEKWDDVANVRLQLKEKGVQKMPGSSSIQVDGVIHE 720

Query: 721 FTSGDRSHPENCGIDMMLKEITNRLGDVGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMA 780
           FTSGDRSHPEN  IDMML EIT+RLGDVGYVPDVTNVLLDVNEQEKQYLLNRHSEKLA+A
Sbjct: 721 FTSGDRSHPENYSIDMMLNEITSRLGDVGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAVA 780

Query: 781 YGLISTEKHLPIRVIKNLRMCSDCHAFAKYISKVYVREITVRDNNRFHFFRQGSCSCGDY 840
           YGLIST+KH+PIRV+KNLRMCSDCH+FAKYISKVY REITVRDNNRFH FRQGSCSCGDY
Sbjct: 781 YGLISTKKHVPIRVMKNLRMCSDCHSFAKYISKVYHREITVRDNNRFHVFRQGSCSCGDY 840

Query: 841 W 842
           W
Sbjct: 841 W 841

BLAST of Cla021603 vs. TrEMBL
Match: M5XXM3_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001360mg PE=4 SV=1)

HSP 1 Score: 1152.9 bits (2981), Expect = 0.0e+00
Identity = 564/840 (67.14%), Postives = 683/840 (81.31%), Query Frame = 1

Query: 4   KLHPTISIAPNFITPTTQNDSKHTPTLHFQIGLFKNCKTMDELGQLHCYALKQGLIRKRP 63
           +L P +S  P+F+ PT Q +SK         GL +NCKTM+E+ QLHC   K+GL  +  
Sbjct: 6   QLSPLVSATPSFVAPTNQRESKAMAKDTSPTGLLRNCKTMNEVKQLHCQISKKGLRNRPS 65

Query: 64  TLTKLISICVEMGTSESLDYARKAFELFHEDGEANA-TLFMYNSLIRGYSAAGLCDEAIS 123
           T+T LI+ C EMGT ESLDYARKAF LF ED E     LFMYNSLIRGYS+AGL DEA+ 
Sbjct: 66  TVTNLITTCAEMGTFESLDYARKAFNLFLEDEETKGHILFMYNSLIRGYSSAGLSDEAVL 125

Query: 124 LYVQMIEVGFMPDNFTFPFLLSACAKTGAFVEGVQLHGALMKIGLERDMFVANSLIHLYA 183
           LYVQM+  G +PD FTFPF+LSAC+K  AF EGVQLHGAL+K+GLE D F+ NSLIH YA
Sbjct: 126 LYVQMVVKGILPDKFTFPFVLSACSKVVAFSEGVQLHGALVKMGLEEDAFIENSLIHFYA 185

Query: 184 EGEEFLFARKVFDEMLERNVVSWTSLICGYARTDASSEAVALFFQMIEAGVRPNSVTMVC 243
           E  E  ++RKVFD M ERN+VSWTSLICGYAR     EAV+LFF+M+ AG++PNSVTMVC
Sbjct: 186 ESGELDYSRKVFDGMAERNIVSWTSLICGYARRQFPKEAVSLFFEMVAAGIKPNSVTMVC 245

Query: 244 VISACAKLKVLELAKRIHAYIEESEMEINTHMVNALVDMYMKCGETGAAKRLYDGCVDKN 303
           VISACAKLK LEL++R+ AYI ES +++NT +VNALVDMYMKCG T AAKRL+D C DKN
Sbjct: 246 VISACAKLKDLELSERVCAYIGESGVKVNTLVVNALVDMYMKCGATDAAKRLFDECGDKN 305

Query: 304 LVLCNTIMSNFARHGMAKEVLAVLADMFRVDLQPDRVSLLSAISACGQMSDYLLGMCCHN 363
           LVL NTI+SN+ R G+A+E LAVL +M R   +PD+V+LLSAISAC Q+ D L G CCH 
Sbjct: 306 LVLYNTILSNYVRQGLAREALAVLDEMLRQGPRPDKVTLLSAISACAQLGDSLSGKCCHG 365

Query: 364 YCLRNGYEGWDNICNAIIDMYMKCGRQEMAYRVFDSMSNKTIVSWNSLLVGYIRNKDFEP 423
           Y +RN  EGWD ICNA+IDMYMKCG+QEMA  +FD+MSN+T+VSWNSL+ G+IR+ D   
Sbjct: 366 YVIRNRLEGWDAICNAMIDMYMKCGKQEMACGIFDNMSNRTVVSWNSLIAGFIRSGDVNS 425

Query: 424 ARKIFNEMPVKDIVSWNTMVNALVQESMFGEAIELFREMQLKEMKADRVTMVEVASACGY 483
           A ++FNEMP  D+VSWNTM+ ALVQESMF EAIELFR MQ   +K DRVTMVEVASACGY
Sbjct: 426 AWQMFNEMPKSDLVSWNTMIGALVQESMFVEAIELFRVMQADGIKGDRVTMVEVASACGY 485

Query: 484 LGALELAKWVYAYIVKNNIYCDMLLETALVDMFARCGDPHNAMEVFNNMDRKDVLAWTAA 543
           LGAL+LAKW +AYI KN I CDM L TALVDMFARCGDP +AM+VF++M R+DV AWTAA
Sbjct: 486 LGALDLAKWTHAYIEKNKIDCDMRLGTALVDMFARCGDPQSAMKVFSSMARRDVSAWTAA 545

Query: 544 IGAMAVDGNGKRAFELYNEMLRQGVKPDQVVFVNILTACSHGGFVEQGQYIFESMKE-HG 603
           IGAMA++GNG+RA EL++EM+RQGVKPD+VVFV +LTACSH GFV+QG  IF SMK  HG
Sbjct: 546 IGAMAMEGNGERALELFDEMIRQGVKPDEVVFVAVLTACSHVGFVKQGWNIFRSMKSVHG 605

Query: 604 ISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNIDMATFAA 663
           ISP I+HYGCMVDLLGRAG L EA D++K MPM+PN +IWG+LLAACRT+KN+++A++AA
Sbjct: 606 ISPHIIHYGCMVDLLGRAGLLGEAFDLVKGMPMEPNDVIWGTLLAACRTYKNVEIASYAA 665

Query: 664 ERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKMPGSSSIEVDGVIHEF 723
           +RL+++  +RTGIHVLLSNIYASAEKWADVA VRL LKEKG+ K+PGSSSIEV+G+IHEF
Sbjct: 666 KRLSKLPTQRTGIHVLLSNIYASAEKWADVAKVRLHLKEKGIHKVPGSSSIEVNGMIHEF 725

Query: 724 TSGDRSHPENCGIDMMLKEITNRLGDVGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAY 783
            SG  ++ E   + +ML+EI  RL + G+VPD+ NVLLDV+E+EK+YLL+RHSEKLA+A+
Sbjct: 726 ISGGDTNTEKSELTLMLQEINCRLREAGHVPDLDNVLLDVDEKEKEYLLSRHSEKLAIAF 785

Query: 784 GLISTEKHLPIRVIKNLRMCSDCHAFAKYISKVYVREITVRDNNRFHFFRQGSCSCGDYW 842
           GLI T + +PIRV+KNLRMCSDCH+FAK +S++Y REI VRDNNRFHFF QG CSC DYW
Sbjct: 786 GLIGTGQGVPIRVVKNLRMCSDCHSFAKLVSRIYNREIIVRDNNRFHFFNQGLCSCSDYW 845

BLAST of Cla021603 vs. TrEMBL
Match: A0A067G4D2_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g003148mg PE=4 SV=1)

HSP 1 Score: 1131.3 bits (2925), Expect = 0.0e+00
Identity = 547/844 (64.81%), Postives = 673/844 (79.74%), Query Frame = 1

Query: 1   MAAKLHPT--ISIAPNFITPTTQNDSKHTPTLHFQIGLFKNCKTMDELGQLHCYALKQGL 60
           MA  L+P+  +   P   T T Q+ +K TP     IG  KNCKT++EL Q HC+ LKQGL
Sbjct: 1   MALTLNPSPLVLATPTVTTLTNQHKAKTTPKDSPSIGSLKNCKTLNELKQPHCHILKQGL 60

Query: 61  IRKRPTLTKLISICVEMGTSESLDYARKAFELFHEDGEANATLFMYNSLIRGYSAAGLCD 120
             K   ++K++  C +MGT ESL YA+KAF+ + +D E +ATLFMYNSLIRGYS  GL  
Sbjct: 61  GHKPSYISKVVCTCAQMGTFESLTYAQKAFDYYIKDNETSATLFMYNSLIRGYSCIGLGV 120

Query: 121 EAISLYVQMIEVGFMPDNFTFPFLLSACAKTGAFVEGVQLHGALMKIGLERDMFVANSLI 180
           EAISLYV++   G +PD FTFPF+L+AC K+ AF EGVQ+HGA++K+G +RD+FV N LI
Sbjct: 121 EAISLYVELAGFGILPDKFTFPFVLNACTKSSAFGEGVQVHGAIVKMGFDRDVFVENCLI 180

Query: 181 HLYAEGEEFLFARKVFDEMLERNVVSWTSLICGYARTDASSEAVALFFQMIEAGVRPNSV 240
           + Y E  + +  R+VFDEM ERNVVSWTSLIC  AR D   EAV LFF+M+E G++PNSV
Sbjct: 181 NFYGECGDIVDGRRVFDEMSERNVVSWTSLICACARRDLPKEAVYLFFEMVEEGIKPNSV 240

Query: 241 TMVCVISACAKLKVLELAKRIHAYIEESEMEINTHMVNALVDMYMKCGETGAAKRLYDGC 300
           TMVCVISACAKL+ LEL  R+ AYI+E  M+ N  MVNALVDMYMKCG    AK+L+  C
Sbjct: 241 TMVCVISACAKLQNLELGDRVCAYIDELGMKANALMVNALVDMYMKCGAVDTAKQLFGEC 300

Query: 301 VDKNLVLCNTIMSNFARHGMAKEVLAVLADMFRVDLQPDRVSLLSAISACGQMSDYLLGM 360
            D+NLVLCNTIMSN+ R G+A+E LA+L +M     +PDRV++LSA+SA  Q+ D L G 
Sbjct: 301 KDRNLVLCNTIMSNYVRLGLAREALAILDEMLLHGPRPDRVTMLSAVSASAQLGDLLCGR 360

Query: 361 CCHNYCLRNGYEGWDNICNAIIDMYMKCGRQEMAYRVFDSMSNKTIVSWNSLLVGYIRNK 420
            CH Y LRNG EGWD+ICN +IDMYMKCG+QEMA R+FD MSNKT+VSWNSL+ G I+N 
Sbjct: 361 MCHGYVLRNGLEGWDSICNTMIDMYMKCGKQEMACRIFDHMSNKTVVSWNSLIAGLIKNG 420

Query: 421 DFEPARKIFNEMPVKDIVSWNTMVNALVQESMFGEAIELFREMQLKEMKADRVTMVEVAS 480
           D E AR++F+EMP +D +SWNTM+  L QE+MF EA+ELFR M  + +K DRVTMV VAS
Sbjct: 421 DVESAREVFSEMPGRDHISWNTMLGGLTQENMFEEAMELFRVMLSERIKVDRVTMVGVAS 480

Query: 481 ACGYLGALELAKWVYAYIVKNNIYCDMLLETALVDMFARCGDPHNAMEVFNNMDRKDVLA 540
           ACGYLGAL+LAKW+YAYI KN I+CDM L TALVDMFARCGDP  AM+VF  M+++DV A
Sbjct: 481 ACGYLGALDLAKWIYAYIEKNGIHCDMQLATALVDMFARCGDPQRAMQVFRRMEKRDVSA 540

Query: 541 WTAAIGAMAVDGNGKRAFELYNEMLRQGVKPDQVVFVNILTACSHGGFVEQGQYIFESMK 600
           WTAAIGAMA++GNG++A EL+NEMLRQG+KPD +VFV +LTACSHGG V QG ++F SM 
Sbjct: 541 WTAAIGAMAMEGNGEQAVELFNEMLRQGIKPDSIVFVGVLTACSHGGLVNQGWHLFRSMT 600

Query: 601 E-HGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNIDMA 660
           + HG+SPQIVHYGCMVDLLGRAG L EALD+IKSMP++PN +IWGSLLAAC+ H+N+D+A
Sbjct: 601 DIHGVSPQIVHYGCMVDLLGRAGLLGEALDLIKSMPVEPNDVIWGSLLAACQKHQNVDIA 660

Query: 661 TFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKMPGSSSIEVDGV 720
            +AAER+ E+ PE++G+HVLLSNIYASA KW +VA VRLQ+KE+G++K+PGSSSIEV+G 
Sbjct: 661 AYAAERITELDPEKSGVHVLLSNIYASAGKWTNVARVRLQMKEQGIRKLPGSSSIEVNGK 720

Query: 721 IHEFTSGDRSHPENCGIDMMLKEITNRLGDVGYVPDVTNVLLDVNEQEKQYLLNRHSEKL 780
           +HEFTSGD SHPE   I  ML+E+  RL D GYVPD+TNVLLDV+EQEK+YLL+ HSEKL
Sbjct: 721 VHEFTSGDESHPEMNNISSMLREMNCRLRDAGYVPDLTNVLLDVDEQEKKYLLSHHSEKL 780

Query: 781 AMAYGLISTEKHLPIRVIKNLRMCSDCHAFAKYISKVYVREITVRDNNRFHFFRQGSCSC 840
           AMA+GLIST K +PIRV+KNLR+C DCH+FAK +SKVY REI VRDNNRFHFFRQGSCSC
Sbjct: 781 AMAFGLISTSKTMPIRVVKNLRLCCDCHSFAKLVSKVYDREIIVRDNNRFHFFRQGSCSC 840

Query: 841 GDYW 842
            D+W
Sbjct: 841 SDFW 844

BLAST of Cla021603 vs. TrEMBL
Match: V4RH19_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10004307mg PE=4 SV=1)

HSP 1 Score: 1130.9 bits (2924), Expect = 0.0e+00
Identity = 547/844 (64.81%), Postives = 673/844 (79.74%), Query Frame = 1

Query: 1   MAAKLHPT--ISIAPNFITPTTQNDSKHTPTLHFQIGLFKNCKTMDELGQLHCYALKQGL 60
           MA  L+P+  +   P   T T Q+ +K TP     IG  KN KT++EL QLHC+ LKQGL
Sbjct: 1   MALTLNPSPLVLATPTVTTLTNQHKAKTTPKDSPSIGSLKNYKTLNELKQLHCHILKQGL 60

Query: 61  IRKRPTLTKLISICVEMGTSESLDYARKAFELFHEDGEANATLFMYNSLIRGYSAAGLCD 120
             K   ++K++S C +MGT ESL YA+KAF+ + +D E +ATLFMYNSLIRGYS  GL  
Sbjct: 61  GHKPSYISKVVSTCAQMGTFESLTYAQKAFDYYIKDNETSATLFMYNSLIRGYSCIGLGV 120

Query: 121 EAISLYVQMIEVGFMPDNFTFPFLLSACAKTGAFVEGVQLHGALMKIGLERDMFVANSLI 180
           EAISLYV+++  G +PD FTFPF+L+AC K+ AF E VQ+HGA++K+G +RD+FV N LI
Sbjct: 121 EAISLYVELVGFGILPDKFTFPFVLNACTKSSAFGEAVQVHGAIVKMGFDRDVFVENCLI 180

Query: 181 HLYAEGEEFLFARKVFDEMLERNVVSWTSLICGYARTDASSEAVALFFQMIEAGVRPNSV 240
           H Y E  + +  R+VFDEM ERNVVSWTSLIC  AR D   EAV LFF+M+E G++PNSV
Sbjct: 181 HFYGECGDIVDGRRVFDEMSERNVVSWTSLICACARRDLPKEAVYLFFEMVEEGIKPNSV 240

Query: 241 TMVCVISACAKLKVLELAKRIHAYIEESEMEINTHMVNALVDMYMKCGETGAAKRLYDGC 300
           TMVCVISACAKL+ LEL  R+ AYI+E  M+ N  MVNALVDMYMKCG    A++L+  C
Sbjct: 241 TMVCVISACAKLQNLELGDRVCAYIDELGMKANALMVNALVDMYMKCGAVDTARQLFGEC 300

Query: 301 VDKNLVLCNTIMSNFARHGMAKEVLAVLADMFRVDLQPDRVSLLSAISACGQMSDYLLGM 360
            D+NLVLCNTIMSN+ R G+A+E LA+L +M     +PDRV++LSA+SA  Q+ D L G 
Sbjct: 301 KDRNLVLCNTIMSNYVRLGLAREALAILDEMLLHGPRPDRVTMLSAVSASAQLGDLLCGR 360

Query: 361 CCHNYCLRNGYEGWDNICNAIIDMYMKCGRQEMAYRVFDSMSNKTIVSWNSLLVGYIRNK 420
            CH Y LRNG EGWD+ICN +IDMYMKCG+QEMA R+FD MSNKT+VSWNSL+ G I+N 
Sbjct: 361 MCHGYVLRNGLEGWDSICNTMIDMYMKCGKQEMACRIFDHMSNKTVVSWNSLIAGLIKNG 420

Query: 421 DFEPARKIFNEMPVKDIVSWNTMVNALVQESMFGEAIELFREMQLKEMKADRVTMVEVAS 480
           D E AR++F+EMP +D +SWNTM+  L QE+MF EA+ELFR M  + +K DRVTMV VAS
Sbjct: 421 DVESAREVFSEMPGRDHISWNTMLGGLTQENMFEEAMELFRVMLSERIKVDRVTMVGVAS 480

Query: 481 ACGYLGALELAKWVYAYIVKNNIYCDMLLETALVDMFARCGDPHNAMEVFNNMDRKDVLA 540
           ACGYLGAL+LAKW+YAYI KN  +CDM L TALVDMFARCGDP  AM+VF  M+++DV A
Sbjct: 481 ACGYLGALDLAKWIYAYIEKNGTHCDMQLATALVDMFARCGDPQRAMQVFRRMEKRDVSA 540

Query: 541 WTAAIGAMAVDGNGKRAFELYNEMLRQGVKPDQVVFVNILTACSHGGFVEQGQYIFESMK 600
           WTAAIGAMA++GNG++A EL+NEMLRQGVKPD +VFV +LTACSHGG V QG ++F SM 
Sbjct: 541 WTAAIGAMAMEGNGEQAVELFNEMLRQGVKPDSIVFVGVLTACSHGGLVNQGWHLFRSMT 600

Query: 601 E-HGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNIDMA 660
           + HG+SPQIVHYGCMVDLLGRAG L EALD+IKSMP++PN +IWGSLLAAC+ H+N+D+A
Sbjct: 601 DIHGVSPQIVHYGCMVDLLGRAGLLGEALDLIKSMPVEPNDVIWGSLLAACQKHQNVDIA 660

Query: 661 TFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKMPGSSSIEVDGV 720
            +AAER+ E+ PE++G+HVLLSNIYASA KW +VA VRLQ+KE+G++K+PGSSSIEV+G 
Sbjct: 661 AYAAERITELDPEKSGVHVLLSNIYASAGKWTNVARVRLQMKEQGIRKLPGSSSIEVNGK 720

Query: 721 IHEFTSGDRSHPENCGIDMMLKEITNRLGDVGYVPDVTNVLLDVNEQEKQYLLNRHSEKL 780
           +HEFTSGD SHPE   I  ML+E+  RL D GYVPD+TNVLLDV+EQEK+YLL+ HSEKL
Sbjct: 721 VHEFTSGDESHPEMNNISSMLREMNCRLRDAGYVPDLTNVLLDVDEQEKKYLLSHHSEKL 780

Query: 781 AMAYGLISTEKHLPIRVIKNLRMCSDCHAFAKYISKVYVREITVRDNNRFHFFRQGSCSC 840
           AMA+GLIST K +PIRV+KNLR+C DCH+FAK +SKVY REI VRDNNRFHFFRQGSCSC
Sbjct: 781 AMAFGLISTSKTMPIRVVKNLRLCCDCHSFAKLVSKVYDREIIVRDNNRFHFFRQGSCSC 840

Query: 841 GDYW 842
            D+W
Sbjct: 841 SDFW 844

BLAST of Cla021603 vs. TrEMBL
Match: W9R192_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_009097 PE=4 SV=1)

HSP 1 Score: 1124.4 bits (2907), Expect = 0.0e+00
Identity = 557/850 (65.53%), Postives = 680/850 (80.00%), Query Frame = 1

Query: 1   MAAKLH--PTISIAPNFITP------TTQNDSKHTPTLHFQIGLFKNCKTMDELGQLHCY 60
           MAA +H  P +S +  F+TP      T  N+    P      G F NCKTMDEL QLHC 
Sbjct: 1   MAATMHLSPLVSASAIFLTPERTKPKTIVNNDTSPPN-----GSFGNCKTMDELKQLHCD 60

Query: 61  ALKQGLIRKRPTLTKLISICVEMGTSESLDYARKAFELFHEDGEANATLFMYNSLIRGYS 120
             K+GL  +  ++T+LI+   EMGTSESLDYAR+AFELF ED  +  TLFMYNSL+RGYS
Sbjct: 61  ITKKGLNHRISSMTELIAKGAEMGTSESLDYARRAFELFKEDEASIGTLFMYNSLMRGYS 120

Query: 121 AAGLCDEAISLYVQMIEVGFMPDNFTFPFLLSACAKTGAFVEGVQLHGALMKIGLERDMF 180
           +AGL  EAIS+YVQM+ +G  PD +TFPF+LS CAK  AF EG+QLHGA++++GLERD+F
Sbjct: 121 SAGLGFEAISVYVQMLVLGITPDKYTFPFVLSGCAKAEAFREGIQLHGAVVRMGLERDLF 180

Query: 181 VANSLIHLYAEGEEFLFARKVFDEMLERNVVSWTSLICGYARTDASSEAVALFFQMIEAG 240
           + NSLIH YAE  E   ARKVFDEM ERNVVSWTSLIC YAR +   EAV+LFF+M+ AG
Sbjct: 181 IGNSLIHFYAECGELDSARKVFDEMPERNVVSWTSLICCYARRELPKEAVSLFFKMVAAG 240

Query: 241 VRPNSVTMVCVISACAKLKVLELAKRIHAYIEESEMEINTHMVNALVDMYMKCGETGAAK 300
           V PN+VTMVCVISACAKL  LEL ++I AY++ES +++N  MVNALVDMY+KC     AK
Sbjct: 241 VEPNAVTMVCVISACAKLNDLELGEKIRAYVQESGVKLNAFMVNALVDMYLKCRAIDDAK 300

Query: 301 RLYDGCVDKNLVLCNTIMSNFARHGMAKEVLAVLADMFRVDLQPDRVSLLSAISACGQMS 360
           RL+D C DKNLVLCNT+MSN+   G+A+E L++  +M R   QPDRV+LLS ISAC Q+ 
Sbjct: 301 RLFDQCADKNLVLCNTMMSNYVDRGLAREALSIFDEMLRGGPQPDRVTLLSVISACSQLG 360

Query: 361 DYLLGMCCHNYCLRNGYEGWDNICNAIIDMYMKCGRQEMAYRVFDSMSNKTIVSWNSLLV 420
           D L G CCH+Y LRNG EG+ NI NA+IDMYM+ G+QEMA ++FD M  KT+VSWNSL+ 
Sbjct: 361 DSLSGRCCHSYALRNGLEGFYNISNAMIDMYMRFGKQEMACKIFDRMPKKTVVSWNSLIS 420

Query: 421 GYIRNKDFEPARKIFNEMPVKDIVSWNTMVNALVQESMFGEAIELFREMQLKEMKADRVT 480
           G+IRN D E A K+FNEMP +D+VSWNTM+ ALV+ESMF EAIELFR+MQ K MKADRVT
Sbjct: 421 GFIRNGDVESAWKMFNEMPERDLVSWNTMIGALVEESMFEEAIELFRDMQSKGMKADRVT 480

Query: 481 MVEVASACGYLGALELAKWVYAYIVKNNIYCDMLLETALVDMFARCGDPHNAMEVFNNMD 540
           MVEVASACGYLGAL+LAKW +AYI KN I CDMLL TALVDMFARCG+  +AM+VFNNM 
Sbjct: 481 MVEVASACGYLGALDLAKWAHAYIKKNEIQCDMLLGTALVDMFARCGNSQSAMQVFNNMP 540

Query: 541 RKDVLAWTAAIGAMAVDGNGKRAFELYNEMLRQGVKPDQVVFVNILTACSHGGFVEQGQY 600
           R+DV AWTAAIGAMA++GNG+RA EL++EML QGVKPD+VVFV +LTA SHGG VEQGQ 
Sbjct: 541 RRDVSAWTAAIGAMAMEGNGERAMELFDEMLNQGVKPDRVVFVALLTAFSHGGSVEQGQK 600

Query: 601 IFESMKE-HGISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTH 660
           +F+SMKE HGI+P+IVHYGCMVDLLGRAG L+EA D+IKSMPM+PN +IWGS LAACRTH
Sbjct: 601 LFDSMKEVHGITPEIVHYGCMVDLLGRAGMLKEASDLIKSMPMEPNDVIWGSFLAACRTH 660

Query: 661 KNIDMATFAAERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKMPGSSS 720
           KN++MA +AAE + E+AP+++GIH+LLSNIYASA KW DVA VRL LKEKG+ K+PG+S 
Sbjct: 661 KNVEMAAYAAESVKELAPQKSGIHILLSNIYASAGKWNDVAKVRLHLKEKGISKVPGTSL 720

Query: 721 IEVDGVIHEFTSGDRSHPENCGIDMMLKEITNRLGDVGYVPDVTNVLLDVNEQEKQYLLN 780
           IEVDG+I+EF   D SHP+   I  ML+EI +RL + G+VP++ NVLLDV+E EK+Y L+
Sbjct: 721 IEVDGMINEFMCSDDSHPKQSQISSMLEEINSRLRNAGHVPELGNVLLDVDEHEKEYFLS 780

Query: 781 RHSEKLAMAYGLISTEKHLPIRVIKNLRMCSDCHAFAKYISKVYVREITVRDNNRFHFFR 840
           RHSEKLA+A+GLIST + +PIR++KNLR CSDCH+FAK +S++Y REI +RDN+RFH FR
Sbjct: 781 RHSEKLAIAFGLISTGQGMPIRIVKNLRTCSDCHSFAKLVSRIYNREIIIRDNHRFHIFR 840

Query: 841 QGSCSCGDYW 842
           QG CSC DYW
Sbjct: 841 QGLCSCSDYW 845

BLAST of Cla021603 vs. NCBI nr
Match: gi|778678432|ref|XP_011650966.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g22690 [Cucumis sativus])

HSP 1 Score: 1528.8 bits (3957), Expect = 0.0e+00
Identity = 751/841 (89.30%), Postives = 791/841 (94.05%), Query Frame = 1

Query: 1   MAAKLHPTISIAPNFITPTTQNDSKHTPTLHFQIGLFKNCKTMDELGQLHCYALKQGLIR 60
           MAA L PTISI PNFI PTTQNDSKHTP  HFQIGLFK+CKT+DELGQLHCYALKQGLIR
Sbjct: 1   MAANLFPTISIPPNFIKPTTQNDSKHTPPHHFQIGLFKSCKTIDELGQLHCYALKQGLIR 60

Query: 61  KRPTLTKLISICVEMGTSESLDYARKAFELFHEDGEANATLFMYNSLIRGYSAAGLCDEA 120
           K+ T+TKLIS CVEMGTSESLD+ARKAFELFHEDGEAN TLFMYN LIRGYSAAGL DEA
Sbjct: 61  KQSTVTKLISTCVEMGTSESLDFARKAFELFHEDGEANVTLFMYNLLIRGYSAAGLYDEA 120

Query: 121 ISLYVQMIEVGFMPDNFTFPFLLSACAKTGAFVEGVQLHGALMKIGLERDMFVANSLIHL 180
           ISLYVQMIE GFMPDNFTFPF+LSACAKT AFVEG+QLHGAL+KIGLE DMFVANSLIHL
Sbjct: 121 ISLYVQMIEFGFMPDNFTFPFVLSACAKTAAFVEGIQLHGALLKIGLEGDMFVANSLIHL 180

Query: 181 YAEGEEFLFARKVFDEMLERNVVSWTSLICGYARTDASSEAVALFFQMIEAGVRPNSVTM 240
           YAEG EFLFARKVFD MLERNVVSWTSLICGY+RTD   EAVALFFQMIEAGV+PNSVTM
Sbjct: 181 YAEGGEFLFARKVFDGMLERNVVSWTSLICGYSRTDFFGEAVALFFQMIEAGVKPNSVTM 240

Query: 241 VCVISACAKLKVLELAKRIHAYIEESEMEINTHMVNALVDMYMKCGETGAAKRLYDGCVD 300
           VCVISACAKLK LELAKR+HAYIEESEME+NTHMVNAL DM+MKCGETGAAKRLYD CVD
Sbjct: 241 VCVISACAKLKDLELAKRLHAYIEESEMELNTHMVNALADMFMKCGETGAAKRLYDECVD 300

Query: 301 KNLVLCNTIMSNFARHGMAKEVLAVLADMFRVDLQPDRVSLLSAISACGQMSDYLLGMCC 360
           KNLVLCNTIMSN+ARHGM  EVLAVL DM ++DL+PDRVSLL AISACGQM DYLLG CC
Sbjct: 301 KNLVLCNTIMSNYARHGMPNEVLAVLVDMLQLDLRPDRVSLLPAISACGQMGDYLLGKCC 360

Query: 361 HNYCLRNGYEGWDNICNAIIDMYMKCGRQEMAYRVFDSMSNKTIVSWNSLLVGYIRNKDF 420
           HNY LRNGYEGWDNICNA+IDMYMKCG+QEMAYRVFD M NKTIVSWNSLLVGYIRNKD 
Sbjct: 361 HNYSLRNGYEGWDNICNAMIDMYMKCGKQEMAYRVFDGMLNKTIVSWNSLLVGYIRNKDL 420

Query: 421 EPARKIFNEMPVKDIVSWNTMVNALVQESMFGEAIELFREMQLKEMKADRVTMVEVASAC 480
           E ARKIFNEMP KDIVSWNTM+NALVQESMF EAIELFREMQLKE+K DRVTMVEVASAC
Sbjct: 421 ESARKIFNEMPEKDIVSWNTMLNALVQESMFDEAIELFREMQLKEIKVDRVTMVEVASAC 480

Query: 481 GYLGALELAKWVYAYIVKNNIYCDMLLETALVDMFARCGDPHNAMEVFNNMDRKDVLAWT 540
           G LGALELAKWVY++IVKN IYCDMLLETALVDMFARCGDPH+AM VFNNM RKDV AWT
Sbjct: 481 GNLGALELAKWVYSHIVKNAIYCDMLLETALVDMFARCGDPHSAMNVFNNMHRKDVSAWT 540

Query: 541 AAIGAMAVDGNGKRAFELYNEMLRQGVKPDQVVFVNILTACSHGGFVEQGQYIFESMKEH 600
           AAIGAMAV+GNG RA ELYNEMLRQGVKPDQVVFVNILTACSHGGFVEQG++IFESMK+H
Sbjct: 541 AAIGAMAVNGNGDRAIELYNEMLRQGVKPDQVVFVNILTACSHGGFVEQGEHIFESMKQH 600

Query: 601 GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNIDMATFA 660
           G+SPQIVHYGCMVDLLGRAGKLEEALDII+SMPM+PNGIIWGSLLAACRTHKNIDMATFA
Sbjct: 601 GLSPQIVHYGCMVDLLGRAGKLEEALDIIESMPMRPNGIIWGSLLAACRTHKNIDMATFA 660

Query: 661 AERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKMPGSSSIEVDGVIHE 720
           AERLAEVAPE+TGIH+LLSNIYASAEKW DVANVRLQLKEKGVQKMPGSSSI+VDGVIHE
Sbjct: 661 AERLAEVAPEKTGIHILLSNIYASAEKWDDVANVRLQLKEKGVQKMPGSSSIQVDGVIHE 720

Query: 721 FTSGDRSHPENCGIDMMLKEITNRLGDVGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMA 780
           FTSGDRSHPEN  IDMML EIT+RLGDVGYVPDVTNVLLDVNEQEKQYLLNRHSEKLA+A
Sbjct: 721 FTSGDRSHPENYSIDMMLNEITSRLGDVGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAVA 780

Query: 781 YGLISTEKHLPIRVIKNLRMCSDCHAFAKYISKVYVREITVRDNNRFHFFRQGSCSCGDY 840
           YGLIST+KH+PIRV+KNLRMCSDCH+FAKYISKVY REITVRDNNRFH FRQGSCSCGDY
Sbjct: 781 YGLISTKKHVPIRVMKNLRMCSDCHSFAKYISKVYHREITVRDNNRFHVFRQGSCSCGDY 840

Query: 841 W 842
           W
Sbjct: 841 W 841

BLAST of Cla021603 vs. NCBI nr
Match: gi|659076373|ref|XP_008438644.1| (PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At3g22690 [Cucumis melo])

HSP 1 Score: 1522.7 bits (3941), Expect = 0.0e+00
Identity = 750/841 (89.18%), Postives = 788/841 (93.70%), Query Frame = 1

Query: 1   MAAKLHPTISIAPNFITPTTQNDSKHTPTLHFQIGLFKNCKTMDELGQLHCYALKQGLIR 60
           MAA L+PTISI PNFI P T+NDSKHT   HFQIGLFK+CKTMDELGQLHCYALKQGLIR
Sbjct: 1   MAANLYPTISITPNFIKPNTRNDSKHTLPHHFQIGLFKSCKTMDELGQLHCYALKQGLIR 60

Query: 61  KRPTLTKLISICVEMGTSESLDYARKAFELFHEDGEANATLFMYNSLIRGYSAAGLCDEA 120
           K+ T+TKLIS CVEMGTSESLD+ARK FELFHEDGEAN TLFMYNSLIRGYS AGLCDEA
Sbjct: 61  KQSTVTKLISTCVEMGTSESLDFARKVFELFHEDGEANVTLFMYNSLIRGYSTAGLCDEA 120

Query: 121 ISLYVQMIEVGFMPDNFTFPFLLSACAKTGAFVEGVQLHGALMKIGLERDMFVANSLIHL 180
           ISLYVQMIE GFMPDNFTFPF+LSACAKT AFVEG+QLHGALMKIGLERDMFVANSLIHL
Sbjct: 121 ISLYVQMIEFGFMPDNFTFPFVLSACAKTAAFVEGIQLHGALMKIGLERDMFVANSLIHL 180

Query: 181 YAEGEEFLFARKVFDEMLERNVVSWTSLICGYARTDASSEAVALFFQMIEAGVRPNSVTM 240
           YAEG EFLFARKVFD MLERNVVSWTSLICGY+RT+ S EAVALFFQMIEAGVRPNSVTM
Sbjct: 181 YAEGGEFLFARKVFDGMLERNVVSWTSLICGYSRTEFSREAVALFFQMIEAGVRPNSVTM 240

Query: 241 VCVISACAKLKVLELAKRIHAYIEESEMEINTHMVNALVDMYMKCGETGAAKRLYDGCVD 300
           VCVISACAKLK LELAKR+HAYIEESEME+NTHMVNALVDM+MKCGETGAAKRLYD CVD
Sbjct: 241 VCVISACAKLKDLELAKRLHAYIEESEMELNTHMVNALVDMFMKCGETGAAKRLYDECVD 300

Query: 301 KNLVLCNTIMSNFARHGMAKEVLAVLADMFRVDLQPDRVSLLSAISACGQMSDYLLGMCC 360
           KNLVLCNTIMSN+ARHGM  EVLAVL DM ++DL+PDRVSLL AISACGQM DYLLG  C
Sbjct: 301 KNLVLCNTIMSNYARHGMPNEVLAVLVDMLQLDLRPDRVSLLPAISACGQMGDYLLGKSC 360

Query: 361 HNYCLRNGYEGWDNICNAIIDMYMKCGRQEMAYRVFDSMSNKTIVSWNSLLVGYIRNKDF 420
           HNY LRNGYE WDNICNA+IDMYMKCG+ EMAYRVFD M NKTIVSWNSLLVGYIRNKD 
Sbjct: 361 HNYSLRNGYERWDNICNAMIDMYMKCGKPEMAYRVFDGMLNKTIVSWNSLLVGYIRNKDL 420

Query: 421 EPARKIFNEMPVKDIVSWNTMVNALVQESMFGEAIELFREMQLKEMKADRVTMVEVASAC 480
           E ARK FNEMP KDIVSWNTM+NALVQESMF EAIELFREMQLKE+KADRVTMVEVASAC
Sbjct: 421 ESARKTFNEMPEKDIVSWNTMLNALVQESMFDEAIELFREMQLKEIKADRVTMVEVASAC 480

Query: 481 GYLGALELAKWVYAYIVKNNIYCDMLLETALVDMFARCGDPHNAMEVFNNMDRKDVLAWT 540
           GYLGALELAKWVY+YIVKN I  DMLLET LVDMFARCGDPH+AMEVFNNMDRKDV AWT
Sbjct: 481 GYLGALELAKWVYSYIVKNGIDYDMLLETTLVDMFARCGDPHSAMEVFNNMDRKDVSAWT 540

Query: 541 AAIGAMAVDGNGKRAFELYNEMLRQGVKPDQVVFVNILTACSHGGFVEQGQYIFESMKEH 600
           AAIGAMAV+GNG RA ELYNEMLRQGVKPDQVVFVNILTACSHGGFVEQG++IFESMK+H
Sbjct: 541 AAIGAMAVNGNGNRAIELYNEMLRQGVKPDQVVFVNILTACSHGGFVEQGEHIFESMKQH 600

Query: 601 GISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNIDMATFA 660
           GISPQIVHYGCMVDLLGRAGKLEEALDII+SMPMKPNGIIWGSLLAACRTHKNIDMATFA
Sbjct: 601 GISPQIVHYGCMVDLLGRAGKLEEALDIIESMPMKPNGIIWGSLLAACRTHKNIDMATFA 660

Query: 661 AERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKMPGSSSIEVDGVIHE 720
           AERL EVAPE+TGIH+LLSNIYASAEKW DVANVRLQLKEKGVQKMPGSSSI+VDGVIHE
Sbjct: 661 AERLEEVAPEKTGIHILLSNIYASAEKWDDVANVRLQLKEKGVQKMPGSSSIQVDGVIHE 720

Query: 721 FTSGDRSHPENCGIDMMLKEITNRLGDVGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMA 780
           FTSGDRSHPEN G+DMML EIT+RL DVGYVP+VTNVLLDVNEQEK YLLNRHSEKLAMA
Sbjct: 721 FTSGDRSHPENYGMDMMLNEITSRLVDVGYVPEVTNVLLDVNEQEKXYLLNRHSEKLAMA 780

Query: 781 YGLISTEKHLPIRVIKNLRMCSDCHAFAKYISKVYVREITVRDNNRFHFFRQGSCSCGDY 840
           YGLIST+KH+PIRV+KNLRMCSDCH+FAKYISKVY REI VRDNNRFHFFRQGSCSCGDY
Sbjct: 781 YGLISTKKHVPIRVMKNLRMCSDCHSFAKYISKVYHREIIVRDNNRFHFFRQGSCSCGDY 840

Query: 841 W 842
           W
Sbjct: 841 W 841

BLAST of Cla021603 vs. NCBI nr
Match: gi|1009141328|ref|XP_015888135.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g22690 [Ziziphus jujuba])

HSP 1 Score: 1153.3 bits (2982), Expect = 0.0e+00
Identity = 566/836 (67.70%), Postives = 674/836 (80.62%), Query Frame = 1

Query: 7   PTISIAPNFITPTTQNDSKHTPTLHFQIGLFKNCKTMDELGQLHCYALKQGLIRKRPTLT 66
           P +S  P+  +PT QN++K  PT    IGL + CKTMDEL QLHC   K+GL  +   +T
Sbjct: 9   PLVSATPSSTSPTLQNETKTIPTDSSPIGLLEKCKTMDELKQLHCSITKKGLNYRLSAVT 68

Query: 67  KLISICVEMGTSESLDYARKAFELFHEDGEANATLFMYNSLIRGYSAAGLCDEAISLYVQ 126
           KLI+ C  MG+ ESLDYARKAFELF ED E   TLFMYNSLIRGYSAAGL DEA+ LYVQ
Sbjct: 69  KLIANCSAMGSVESLDYARKAFELFREDEETGGTLFMYNSLIRGYSAAGLGDEAVLLYVQ 128

Query: 127 MIEVGFMPDNFTFPFLLSACAKTGAFVEGVQLHGALMKIGLERDMFVANSLIHLYAEGEE 186
           M+  G +PD +TFPF+LS C K  AF EG QLHG ++K+GLE D+F+ NSLIH Y+E  +
Sbjct: 129 MVVNGILPDKYTFPFVLSGCVKVEAFREGAQLHGTIVKMGLEEDVFIGNSLIHYYSESGD 188

Query: 187 FLFARKVFDEMLERNVVSWTSLICGYARTDASSEAVALFFQMIEAGVRPNSVTMVCVISA 246
               R+VFD M ERNVVSWTSLI GYAR +   EA++LFF+M+ +G+RPNSVTMVCVI A
Sbjct: 189 LDEGRRVFDGMPERNVVSWTSLIYGYARRELPKEAISLFFEMVASGIRPNSVTMVCVIGA 248

Query: 247 CAKLKVLELAKRIHAYIEESEMEINTHMVNALVDMYMKCGETGAAKRLYDGCVDKNLVLC 306
           CAKLK LEL +RI  YI +S +++N  MVNALVDMYMKCG T +AKRL+D CVDKNLVL 
Sbjct: 249 CAKLKDLELGERICNYIGDSGVKLNALMVNALVDMYMKCGATESAKRLFDKCVDKNLVLY 308

Query: 307 NTIMSNFARHGMAKEVLAVLADMFRVDLQPDRVSLLSAISACGQMSDYLLGMCCHNYCLR 366
           NTI+SN+ R G+A E LA+L +M R   +PDRV+LLSAISAC Q+ D L G CCHNY LR
Sbjct: 309 NTILSNYVRQGLAIEALAILFEMLRQGPKPDRVTLLSAISACSQLGDILSGKCCHNYALR 368

Query: 367 NGYEGWDNICNAIIDMYMKCGRQEMAYRVFDSMSNKTIVSWNSLLVGYIRNKDFEPARKI 426
           NG E WD+ICNA+IDMYMKCG+ E A +VF+ M  KT+VSWNSL+ G+IRN D E AR+ 
Sbjct: 369 NGLECWDSICNALIDMYMKCGKPETACKVFNLMPKKTVVSWNSLIAGFIRNGDLESARRN 428

Query: 427 FNEMPVKDIVSWNTMVNALVQESMFGEAIELFREMQLKEMKADRVTMVEVASACGYLGAL 486
           FNEMP  D+VSWNTM+ ALV ESMF EAIELFR MQ + +K DRVTMVEVASACG LGAL
Sbjct: 429 FNEMPESDLVSWNTMIGALVHESMFEEAIELFRVMQNEGIKPDRVTMVEVASACGCLGAL 488

Query: 487 ELAKWVYAYIVKNNIYCDMLLETALVDMFARCGDPHNAMEVFNNMDRKDVLAWTAAIGAM 546
           +LAKW++ YI K+ I CD+ L TALVDMFARCGD  +AM VF+NM ++DV AWT+AIGAM
Sbjct: 489 DLAKWIHTYIEKHKIDCDIRLGTALVDMFARCGDLQSAMRVFSNMSKRDVSAWTSAIGAM 548

Query: 547 AVDGNGKRAFELYNEMLRQGVKPDQVVFVNILTACSHGGFVEQGQYIFESMKE-HGISPQ 606
           A+ GNG+RA EL+ EML QGVKPD+VVFV +LTACSHGGFVEQG+ +F SM+E H ISPQ
Sbjct: 549 AMQGNGERAIELFEEMLGQGVKPDEVVFVTLLTACSHGGFVEQGKNLFRSMEEVHRISPQ 608

Query: 607 IVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNIDMATFAAERLA 666
           IVHYGCMVDLLGRAG L EA D+IKSMPM+PN +IWGSLLAACRTHKN++MA +AAER+ 
Sbjct: 609 IVHYGCMVDLLGRAGLLREARDLIKSMPMEPNDVIWGSLLAACRTHKNVEMAAYAAERIN 668

Query: 667 EVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKMPGSSSIEVDGVIHEFTSGD 726
           E+A ERTGIHVLLSNIYASA KW DV+ VRL LKEKG+ K+PGSSSIE+DG+IHEFTSGD
Sbjct: 669 ELASERTGIHVLLSNIYASAGKWNDVSKVRLSLKEKGICKVPGSSSIEIDGMIHEFTSGD 728

Query: 727 RSHPENCGIDMMLKEITNRLGDVGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYGLIS 786
             HP+N  I++ML+EI +RL D G+VPD+ NVLLDV+EQEK+YLL+RHSEKLA+++GLIS
Sbjct: 729 DRHPQNSHIEVMLQEINSRLRDAGHVPDLANVLLDVDEQEKEYLLSRHSEKLAISFGLIS 788

Query: 787 TEKHLPIRVIKNLRMCSDCHAFAKYISKVYVREITVRDNNRFHFFRQGSCSCGDYW 842
           T   + IRV+KNLRMCSDCH+FAK +SK+Y REI VRDNNRFHFFRQG CSC DYW
Sbjct: 789 TSHGITIRVVKNLRMCSDCHSFAKLVSKIYDREIVVRDNNRFHFFRQGLCSCSDYW 844

BLAST of Cla021603 vs. NCBI nr
Match: gi|596281956|ref|XP_007225289.1| (hypothetical protein PRUPE_ppa001360mg [Prunus persica])

HSP 1 Score: 1152.9 bits (2981), Expect = 0.0e+00
Identity = 564/840 (67.14%), Postives = 683/840 (81.31%), Query Frame = 1

Query: 4   KLHPTISIAPNFITPTTQNDSKHTPTLHFQIGLFKNCKTMDELGQLHCYALKQGLIRKRP 63
           +L P +S  P+F+ PT Q +SK         GL +NCKTM+E+ QLHC   K+GL  +  
Sbjct: 6   QLSPLVSATPSFVAPTNQRESKAMAKDTSPTGLLRNCKTMNEVKQLHCQISKKGLRNRPS 65

Query: 64  TLTKLISICVEMGTSESLDYARKAFELFHEDGEANA-TLFMYNSLIRGYSAAGLCDEAIS 123
           T+T LI+ C EMGT ESLDYARKAF LF ED E     LFMYNSLIRGYS+AGL DEA+ 
Sbjct: 66  TVTNLITTCAEMGTFESLDYARKAFNLFLEDEETKGHILFMYNSLIRGYSSAGLSDEAVL 125

Query: 124 LYVQMIEVGFMPDNFTFPFLLSACAKTGAFVEGVQLHGALMKIGLERDMFVANSLIHLYA 183
           LYVQM+  G +PD FTFPF+LSAC+K  AF EGVQLHGAL+K+GLE D F+ NSLIH YA
Sbjct: 126 LYVQMVVKGILPDKFTFPFVLSACSKVVAFSEGVQLHGALVKMGLEEDAFIENSLIHFYA 185

Query: 184 EGEEFLFARKVFDEMLERNVVSWTSLICGYARTDASSEAVALFFQMIEAGVRPNSVTMVC 243
           E  E  ++RKVFD M ERN+VSWTSLICGYAR     EAV+LFF+M+ AG++PNSVTMVC
Sbjct: 186 ESGELDYSRKVFDGMAERNIVSWTSLICGYARRQFPKEAVSLFFEMVAAGIKPNSVTMVC 245

Query: 244 VISACAKLKVLELAKRIHAYIEESEMEINTHMVNALVDMYMKCGETGAAKRLYDGCVDKN 303
           VISACAKLK LEL++R+ AYI ES +++NT +VNALVDMYMKCG T AAKRL+D C DKN
Sbjct: 246 VISACAKLKDLELSERVCAYIGESGVKVNTLVVNALVDMYMKCGATDAAKRLFDECGDKN 305

Query: 304 LVLCNTIMSNFARHGMAKEVLAVLADMFRVDLQPDRVSLLSAISACGQMSDYLLGMCCHN 363
           LVL NTI+SN+ R G+A+E LAVL +M R   +PD+V+LLSAISAC Q+ D L G CCH 
Sbjct: 306 LVLYNTILSNYVRQGLAREALAVLDEMLRQGPRPDKVTLLSAISACAQLGDSLSGKCCHG 365

Query: 364 YCLRNGYEGWDNICNAIIDMYMKCGRQEMAYRVFDSMSNKTIVSWNSLLVGYIRNKDFEP 423
           Y +RN  EGWD ICNA+IDMYMKCG+QEMA  +FD+MSN+T+VSWNSL+ G+IR+ D   
Sbjct: 366 YVIRNRLEGWDAICNAMIDMYMKCGKQEMACGIFDNMSNRTVVSWNSLIAGFIRSGDVNS 425

Query: 424 ARKIFNEMPVKDIVSWNTMVNALVQESMFGEAIELFREMQLKEMKADRVTMVEVASACGY 483
           A ++FNEMP  D+VSWNTM+ ALVQESMF EAIELFR MQ   +K DRVTMVEVASACGY
Sbjct: 426 AWQMFNEMPKSDLVSWNTMIGALVQESMFVEAIELFRVMQADGIKGDRVTMVEVASACGY 485

Query: 484 LGALELAKWVYAYIVKNNIYCDMLLETALVDMFARCGDPHNAMEVFNNMDRKDVLAWTAA 543
           LGAL+LAKW +AYI KN I CDM L TALVDMFARCGDP +AM+VF++M R+DV AWTAA
Sbjct: 486 LGALDLAKWTHAYIEKNKIDCDMRLGTALVDMFARCGDPQSAMKVFSSMARRDVSAWTAA 545

Query: 544 IGAMAVDGNGKRAFELYNEMLRQGVKPDQVVFVNILTACSHGGFVEQGQYIFESMKE-HG 603
           IGAMA++GNG+RA EL++EM+RQGVKPD+VVFV +LTACSH GFV+QG  IF SMK  HG
Sbjct: 546 IGAMAMEGNGERALELFDEMIRQGVKPDEVVFVAVLTACSHVGFVKQGWNIFRSMKSVHG 605

Query: 604 ISPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNIDMATFAA 663
           ISP I+HYGCMVDLLGRAG L EA D++K MPM+PN +IWG+LLAACRT+KN+++A++AA
Sbjct: 606 ISPHIIHYGCMVDLLGRAGLLGEAFDLVKGMPMEPNDVIWGTLLAACRTYKNVEIASYAA 665

Query: 664 ERLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKMPGSSSIEVDGVIHEF 723
           +RL+++  +RTGIHVLLSNIYASAEKWADVA VRL LKEKG+ K+PGSSSIEV+G+IHEF
Sbjct: 666 KRLSKLPTQRTGIHVLLSNIYASAEKWADVAKVRLHLKEKGIHKVPGSSSIEVNGMIHEF 725

Query: 724 TSGDRSHPENCGIDMMLKEITNRLGDVGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAY 783
            SG  ++ E   + +ML+EI  RL + G+VPD+ NVLLDV+E+EK+YLL+RHSEKLA+A+
Sbjct: 726 ISGGDTNTEKSELTLMLQEINCRLREAGHVPDLDNVLLDVDEKEKEYLLSRHSEKLAIAF 785

Query: 784 GLISTEKHLPIRVIKNLRMCSDCHAFAKYISKVYVREITVRDNNRFHFFRQGSCSCGDYW 842
           GLI T + +PIRV+KNLRMCSDCH+FAK +S++Y REI VRDNNRFHFF QG CSC DYW
Sbjct: 786 GLIGTGQGVPIRVVKNLRMCSDCHSFAKLVSRIYNREIIVRDNNRFHFFNQGLCSCSDYW 845

BLAST of Cla021603 vs. NCBI nr
Match: gi|694388321|ref|XP_009369874.1| (PREDICTED: pentatricopeptide repeat-containing protein At3g22690-like [Pyrus x bretschneideri])

HSP 1 Score: 1149.0 bits (2971), Expect = 0.0e+00
Identity = 563/839 (67.10%), Postives = 683/839 (81.41%), Query Frame = 1

Query: 4   KLHPTISIAPNFITPTTQNDSKHTPTLHFQIGLFKNCKTMDELGQLHCYALKQGLIRKRP 63
           +L P +S AP+F  PTTQN+ K T       G  +NCKTM+E+ QLHC+  K G   K  
Sbjct: 6   QLSPLVSAAPSFAAPTTQNEPKTTAMETSPTGSLRNCKTMNEVKQLHCHITKTGHGSKPS 65

Query: 64  TLTKLISICVEMGTSESLDYARKAFELFHEDGEANATLFMYNSLIRGYSAAGLCDEAISL 123
           T+T+LIS C EMGTSESL+YARKAF+LF  D E +  LFM NSLIRGY++AGL DEAI L
Sbjct: 66  TVTRLISTCAEMGTSESLEYARKAFDLFLGDQETSGVLFMCNSLIRGYASAGLSDEAILL 125

Query: 124 YVQMIEVGFMPDNFTFPFLLSACAKTGAFVEGVQLHGALMKIGLERDMFVANSLIHLYAE 183
           YVQM   G +PD FTFPF LS+C+K  AF EGVQLHGAL+K+GLE D F+ NSLIH YAE
Sbjct: 126 YVQMAVRGILPDKFTFPFALSSCSKVVAFCEGVQLHGALVKMGLEGDAFIENSLIHFYAE 185

Query: 184 GEEFLFARKVFDEMLERNVVSWTSLICGYARTDASSEAVALFFQMIEAGVRPNSVTMVCV 243
             E  + RKVFD M ERN+VSWTSLICGYAR +   EAV+LFF+M+  G  PNSVTMVCV
Sbjct: 186 CGELDYGRKVFDGMSERNIVSWTSLICGYARRNFPREAVSLFFEMVAEGFEPNSVTMVCV 245

Query: 244 ISACAKLKVLELAKRIHAYIEESEMEINTHMVNALVDMYMKCGETGAAKRLYDGCVDKNL 303
           ISACAKLK L+L++R+ AY+ ES +++NT MVNALVDMYMKCGET AAK+++D CVDKN+
Sbjct: 246 ISACAKLKDLKLSERVCAYLGESGVKVNTLMVNALVDMYMKCGETDAAKQIFDECVDKNV 305

Query: 304 VLCNTIMSNFARHGMAKEVLAVLADMFRVDLQPDRVSLLSAISACGQMSDYLLGMCCHNY 363
           VL NTI+SN+ R G+A+E L+VL +M R   + DRV+LLSAISAC Q+ D L G CCH Y
Sbjct: 306 VLYNTILSNYVRQGLAREALSVLDEMMRQGPRADRVTLLSAISACAQLGDSLSGKCCHGY 365

Query: 364 CLRNGYEGWDNICNAIIDMYMKCGRQEMAYRVFDSMSNKTIVSWNSLLVGYIRNKDFEPA 423
            +RNG EGWD ICNA+IDMYMKCG+QEMA R+FD+M N+T+VSWNS++ G++R+   + A
Sbjct: 366 VIRNGLEGWDAICNAMIDMYMKCGKQEMACRIFDNMLNRTVVSWNSVIAGFVRSGAVKSA 425

Query: 424 RKIFNEMPVKDIVSWNTMVNALVQESMFGEAIELFREMQLKEMKADRVTMVEVASACGYL 483
            ++FNEMP  D+VSWNTM+ ALVQESMF EAIELFR MQ + +K DRVTMVEVASACGYL
Sbjct: 426 WQMFNEMPTSDLVSWNTMIGALVQESMFEEAIELFRVMQAEGIKGDRVTMVEVASACGYL 485

Query: 484 GALELAKWVYAYIVKNNIYCDMLLETALVDMFARCGDPHNAMEVFNNMDRKDVLAWTAAI 543
           GAL+LAKW +AYI KN I CDM L TALVDMFARCGDP +AM++F+ M RKDV AWTAAI
Sbjct: 486 GALDLAKWTHAYIEKNKIDCDMRLGTALVDMFARCGDPQSAMKMFDKMRRKDVSAWTAAI 545

Query: 544 GAMAVDGNGKRAFELYNEMLRQGVKPDQVVFVNILTACSHGGFVEQGQYIFESMKE-HGI 603
           GAMA++GNG+RA EL++EM+RQGVKPD+VVFV +LTACSH GFVEQG  IF SMK  HGI
Sbjct: 546 GAMAMEGNGERALELFDEMIRQGVKPDEVVFVALLTACSHVGFVEQGWNIFRSMKSVHGI 605

Query: 604 SPQIVHYGCMVDLLGRAGKLEEALDIIKSMPMKPNGIIWGSLLAACRTHKNIDMATFAAE 663
           SP IVHYGCMVDLLGRA  LEEA+D++KSMPM PN +IWG+LLAACRT+KN+ +A++ AE
Sbjct: 606 SPHIVHYGCMVDLLGRARLLEEAVDLVKSMPMDPNDVIWGTLLAACRTYKNVKIASYVAE 665

Query: 664 RLAEVAPERTGIHVLLSNIYASAEKWADVANVRLQLKEKGVQKMPGSSSIEVDGVIHEFT 723
           +++ ++ +RTGIHVLLSNIYASA KW DVA VRL LKEKG+ K+PGSSSIEV+G+IHEFT
Sbjct: 666 QMSTLSTQRTGIHVLLSNIYASAGKWDDVAKVRLHLKEKGIHKVPGSSSIEVNGMIHEFT 725

Query: 724 SGDRSHPENCGIDMMLKEITNRLGDVGYVPDVTNVLLDVNEQEKQYLLNRHSEKLAMAYG 783
           SG  ++ E      ML+EI  RL +VG+VPD+ NVLLDV+E+EK+YLL+RHSEKLA+A+G
Sbjct: 726 SGGDTNTEKSQTASMLQEINFRLREVGHVPDLDNVLLDVDEKEKEYLLSRHSEKLAIAFG 785

Query: 784 LISTEKHLPIRVIKNLRMCSDCHAFAKYISKVYVREITVRDNNRFHFFRQGSCSCGDYW 842
           LI T + +PIRV+KNLRMCSDCH+FAK++SK+Y REITVRDNNRFHFFRQG CSCGDYW
Sbjct: 786 LIGTGQRVPIRVVKNLRMCSDCHSFAKFVSKIYNREITVRDNNRFHFFRQGLCSCGDYW 844

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP249_ARATH6.2e-28757.62Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana GN... [more]
PP175_ARATH5.3e-15339.80Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
PPR32_ARATH6.5e-15135.56Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
PP348_ARATH5.7e-14736.48Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana GN... [more]
PP224_ARATH4.5e-13635.81Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0LA65_CUCSA0.0e+0089.30Uncharacterized protein OS=Cucumis sativus GN=Csa_3G146420 PE=4 SV=1[more]
M5XXM3_PRUPE0.0e+0067.14Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa001360mg PE=4 SV=1[more]
A0A067G4D2_CITSI0.0e+0064.81Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g003148mg PE=4 SV=1[more]
V4RH19_9ROSI0.0e+0064.81Uncharacterized protein OS=Citrus clementina GN=CICLE_v10004307mg PE=4 SV=1[more]
W9R192_9ROSA0.0e+0065.53Uncharacterized protein OS=Morus notabilis GN=L484_009097 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|778678432|ref|XP_011650966.1|0.0e+0089.30PREDICTED: pentatricopeptide repeat-containing protein At3g22690 [Cucumis sativu... [more]
gi|659076373|ref|XP_008438644.1|0.0e+0089.18PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At3g... [more]
gi|1009141328|ref|XP_015888135.1|0.0e+0067.70PREDICTED: pentatricopeptide repeat-containing protein At3g22690 [Ziziphus jujub... [more]
gi|596281956|ref|XP_007225289.1|0.0e+0067.14hypothetical protein PRUPE_ppa001360mg [Prunus persica][more]
gi|694388321|ref|XP_009369874.1|0.0e+0067.10PREDICTED: pentatricopeptide repeat-containing protein At3g22690-like [Pyrus x b... [more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cla021603Cla021603gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cla021603Cla021603.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla021603.1.cds1Cla021603.1.cds1CDS


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 509..535
score: 0.0055coord: 376..401
score: 0.12coord: 304..331
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 201..249
score: 3.5E-11coord: 433..473
score: 9.6E-8coord: 102..148
score: 2.
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 558..617
score: 6.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 538..570
score: 5.9E-5coord: 304..337
score: 6.1E-4coord: 103..135
score: 3.5E-7coord: 405..436
score: 1.6E-4coord: 436..469
score: 3.2E-7coord: 608..632
score: 0.0011coord: 175..203
score: 0.003coord: 203..236
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 236..270
score: 7.037coord: 372..402
score: 7.443coord: 469..503
score: 5.579coord: 201..235
score: 11.455coord: 605..635
score: 8.068coord: 135..169
score: 7.925coord: 434..468
score: 11.159coord: 302..336
score: 9.427coord: 403..433
score: 9.24coord: 504..534
score: 7.859coord: 570..604
score: 9.843coord: 535..569
score: 10.534coord: 170..200
score: 7.892coord: 100..134
score: 11.838coord: 271..301
score: 6
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 272..302
score: 9.0E-8coord: 204..237
score: 9.0E-8coord: 370..461
score: 9.0E-8coord: 79..152
score: 9.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 373..392
score: 0.0coord: 11..337
score: 0.0coord: 433..712
score:
NoneNo IPR availablePANTHERPTHR24015:SF670SUBFAMILY NOT NAMEDcoord: 373..392
score: 0.0coord: 433..712
score: 0.0coord: 11..337
score: