Csa1G009920 (gene) Cucumber (Chinese Long) v2

NameCsa1G009920
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionPentatricopeptide repeat-containing protein, putative; contains IPR002885 (Pentatricopeptide repeat)
LocationChr1 : 1552947 .. 1554996 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTATTTATGAGGTCATCAAGGGAATGAAAAAGGGGCAAAATTAGAGAAAATAAACAAGTCATCTTCTTCCCCTCTCTCACACTGCCGCCGCCCCCAACCCGGTCTCGTTTTCTGGCGAGAAACGGCAACGACCCACGCAGGTAGAAGTCGGTTGTTCTTCTCCGATGACCCGTTTACTTGCTAGCAGGTTCAGACCCACGATTTTGTGCCTGAGATTGGAGGCATTTTTACACACGCAGTCTTTTCCGTCTCCGGCGAATCTAAGAAACCACGTCGACGATTACATTACTCCGGCTATTGACGCGAGTAGACACCCACGCGATTATCTTTTCCTTGTGTTGGGTTTTCTACGTGCATAGCAATGCATGCACGCCATTTGTTCGATGAAATGTCGCTGAGAAGATGCGTTTCATTGTTGAAGTTAAAATGGGATAGCTTCATTGCGCAATCCGTCTGTACGCAACATCGGTTTTGTTCCCTTCATTCTACCGTAAACAATGGAGCTGCTGTATCAAAACTGTGTGAAGTGATTTCATGTACGATCGGTGGTTTAGATGAATTGGAATCCAGTTTGAACAAATGTACAATATCATTGACATCTTCATTAGTTACCCAAGTTATTGATTCTAGCAAAAATGAAGCTCCCACTAGAAGATTGCTTAGGTTTTTCTTGTGGTCACTCAAGAAGTTAAACCACACTTTAGAAGATGAAGATTTCAATAATGCCATTCGCTTTTTTGCTCAGAAGAAGGACTACACTGCCGTTAACATTTTACTTTCTAATCTTAAGAAAGCCGACCGGGCAATGGACGGCCAGACTTTCGGTTTTGTGGCTGAAGCTTTTGTCAAAATGGATAGAGAAGATGAAGCATTGGGTCTGTTCAAGAACCTAGAGAAGTACAAATGCCCACATGACCAATTTACTGTCGTAGCTATTATTACTGCTCTTTGTTCGAAAGGGCATGCTAAAAGAGCAGAAGGAGTTGTTTTGCACCACAAGGACAAGATTTCTAGCACAATGAGTTGCATCTATAGAAGCCTTCTATATGGATGGTCTATTAAGAAGAACACAAAAGAAGCAAGAAGAATACTGAAAGAAATGAAGTCAGATGGAACCATGCCAGATTTGTTCTGCTACAACACTTTTCTCAAGTGTCTTTGTGAGAAAAATGTTGAGAAAAATCCTTCAGGTCTTGTGCCTGAATCCTTGAATGTGATGATGGAAATGAGATCCTACAAAATTTCTCCCAACTCAATCAGTTATAATATATTGTTATCATGTCTGTGCAAAACTAGGAGAGTTAAGGAATCTTGTAAAATCCTCGAGATGATGAAAAGAACGGGTTGTCAACCCGATTGCGTAAGCTATTATCTTATGGCGAGAGTGTTGTTCTTGACTGGAAGATTTGGGAAAGGGCGTGAAATTGTGGACGAGATGATTGAAGAAGGATTGACCCCAGATCGAAAATTCTACTACGATTTGATTGGTATTCTGTGTGGTGTAGAGAGAACAAATTATGCACTTGAGCTTTTTGAGAAGATGAAGAGAAGCTCATTGGGGGGTTATGGGCCAGTTTATGATGTGCTTATACCAAAGCTTTGTAGAGGAGGTGAATTTGAAATGGGGAGGCAACTGTGGGAGGAAGCCATGGCTATGGGTGTTTCGCTTAACTGCTCAAGTGAGATTTTGGATCCTTCAATCACAAAGGTTTTCAAGCCAACAAGAAAGATTGAGAATAAAATTGTTGAAGAATTCAATAGTGCTGAGAAGCAGAACAAAGCTGCAGCTGAGAAACCAAAAGAAAAGAGAAAGAAAGGTAAGTAAGAATCTGTCTTGAAAATAGCTTCTTGAATTTTTTGGAACTTAGTCAAATTAAATCATTTCATAGGCCTCCTTGTTCAAGATTTGTATAGTAACTTAATGATACACCATAAGATCTATCTATCTATCTATACACACAGAATGTACTCTTTTAAGCAAATCAGTTCTAGGCCTAAACGGTTTTGAATGGAGATTCGGAAGATTTTTATCCTAAGATTGTGAGCCTC

mRNA sequence

ATGCATGCACGCCATTTGTTCGATGAAATGTCGCTGAGAAGATGCGTTTCATTGTTGAAGTTAAAATGGGATAGCTTCATTGCGCAATCCGTCTGTACGCAACATCGGTTTTGTTCCCTTCATTCTACCGTAAACAATGGAGCTGCTGTATCAAAACTGTGTGAAGTGATTTCATGTACGATCGGTGGTTTAGATGAATTGGAATCCAGTTTGAACAAATGTACAATATCATTGACATCTTCATTAGTTACCCAAGTTATTGATTCTAGCAAAAATGAAGCTCCCACTAGAAGATTGCTTAGGTTTTTCTTGTGGTCACTCAAGAAGTTAAACCACACTTTAGAAGATGAAGATTTCAATAATGCCATTCGCTTTTTTGCTCAGAAGAAGGACTACACTGCCGTTAACATTTTACTTTCTAATCTTAAGAAAGCCGACCGGGCAATGGACGGCCAGACTTTCGGTTTTGTGGCTGAAGCTTTTGTCAAAATGGATAGAGAAGATGAAGCATTGGGTCTGTTCAAGAACCTAGAGAAGTACAAATGCCCACATGACCAATTTACTGTCGTAGCTATTATTACTGCTCTTTGTTCGAAAGGGCATGCTAAAAGAGCAGAAGGAGTTGTTTTGCACCACAAGGACAAGATTTCTAGCACAATGAGTTGCATCTATAGAAGCCTTCTATATGGATGGTCTATTAAGAAGAACACAAAAGAAGCAAGAAGAATACTGAAAGAAATGAAGTCAGATGGAACCATGCCAGATTTGTTCTGCTACAACACTTTTCTCAAGTGTCTTTGTGAGAAAAATGTTGAGAAAAATCCTTCAGGTCTTGTGCCTGAATCCTTGAATGTGATGATGGAAATGAGATCCTACAAAATTTCTCCCAACTCAATCAGTTATAATATATTGTTATCATGTCTGTGCAAAACTAGGAGAGTTAAGGAATCTTGTAAAATCCTCGAGATGATGAAAAGAACGGGTTGTCAACCCGATTGCGTAAGCTATTATCTTATGGCGAGAGTGTTGTTCTTGACTGGAAGATTTGGGAAAGGGCGTGAAATTGTGGACGAGATGATTGAAGAAGGATTGACCCCAGATCGAAAATTCTACTACGATTTGATTGGTATTCTGTGTGGTGTAGAGAGAACAAATTATGCACTTGAGCTTTTTGAGAAGATGAAGAGAAGCTCATTGGGGGGTTATGGGCCAGTTTATGATGTGCTTATACCAAAGCTTTGTAGAGGAGGTGAATTTGAAATGGGGAGGCAACTGTGGGAGGAAGCCATGGCTATGGGTGTTTCGCTTAACTGCTCAAGTGAGATTTTGGATCCTTCAATCACAAAGGTTTTCAAGCCAACAAGAAAGATTGAGAATAAAATTGTTGAAGAATTCAATAGTGCTGAGAAGCAGAACAAAGCTGCAGCTGAGAAACCAAAAGAAAAGAGAAAGAAAGGTAAGTAA

Coding sequence (CDS)

ATGCATGCACGCCATTTGTTCGATGAAATGTCGCTGAGAAGATGCGTTTCATTGTTGAAGTTAAAATGGGATAGCTTCATTGCGCAATCCGTCTGTACGCAACATCGGTTTTGTTCCCTTCATTCTACCGTAAACAATGGAGCTGCTGTATCAAAACTGTGTGAAGTGATTTCATGTACGATCGGTGGTTTAGATGAATTGGAATCCAGTTTGAACAAATGTACAATATCATTGACATCTTCATTAGTTACCCAAGTTATTGATTCTAGCAAAAATGAAGCTCCCACTAGAAGATTGCTTAGGTTTTTCTTGTGGTCACTCAAGAAGTTAAACCACACTTTAGAAGATGAAGATTTCAATAATGCCATTCGCTTTTTTGCTCAGAAGAAGGACTACACTGCCGTTAACATTTTACTTTCTAATCTTAAGAAAGCCGACCGGGCAATGGACGGCCAGACTTTCGGTTTTGTGGCTGAAGCTTTTGTCAAAATGGATAGAGAAGATGAAGCATTGGGTCTGTTCAAGAACCTAGAGAAGTACAAATGCCCACATGACCAATTTACTGTCGTAGCTATTATTACTGCTCTTTGTTCGAAAGGGCATGCTAAAAGAGCAGAAGGAGTTGTTTTGCACCACAAGGACAAGATTTCTAGCACAATGAGTTGCATCTATAGAAGCCTTCTATATGGATGGTCTATTAAGAAGAACACAAAAGAAGCAAGAAGAATACTGAAAGAAATGAAGTCAGATGGAACCATGCCAGATTTGTTCTGCTACAACACTTTTCTCAAGTGTCTTTGTGAGAAAAATGTTGAGAAAAATCCTTCAGGTCTTGTGCCTGAATCCTTGAATGTGATGATGGAAATGAGATCCTACAAAATTTCTCCCAACTCAATCAGTTATAATATATTGTTATCATGTCTGTGCAAAACTAGGAGAGTTAAGGAATCTTGTAAAATCCTCGAGATGATGAAAAGAACGGGTTGTCAACCCGATTGCGTAAGCTATTATCTTATGGCGAGAGTGTTGTTCTTGACTGGAAGATTTGGGAAAGGGCGTGAAATTGTGGACGAGATGATTGAAGAAGGATTGACCCCAGATCGAAAATTCTACTACGATTTGATTGGTATTCTGTGTGGTGTAGAGAGAACAAATTATGCACTTGAGCTTTTTGAGAAGATGAAGAGAAGCTCATTGGGGGGTTATGGGCCAGTTTATGATGTGCTTATACCAAAGCTTTGTAGAGGAGGTGAATTTGAAATGGGGAGGCAACTGTGGGAGGAAGCCATGGCTATGGGTGTTTCGCTTAACTGCTCAAGTGAGATTTTGGATCCTTCAATCACAAAGGTTTTCAAGCCAACAAGAAAGATTGAGAATAAAATTGTTGAAGAATTCAATAGTGCTGAGAAGCAGAACAAAGCTGCAGCTGAGAAACCAAAAGAAAAGAGAAAGAAAGGTAAGTAA

Protein sequence

MHARHLFDEMSLRRCVSLLKLKWDSFIAQSVCTQHRFCSLHSTVNNGAAVSKLCEVISCTIGGLDELESSLNKCTISLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFNNAIRFFAQKKDYTAVNILLSNLKKADRAMDGQTFGFVAEAFVKMDREDEALGLFKNLEKYKCPHDQFTVVAIITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRILKEMKSDGTMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISPNSISYNILLSCLCKTRRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIVDEMIEEGLTPDRKFYYDLIGILCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGEFEMGRQLWEEAMAMGVSLNCSSEILDPSITKVFKPTRKIENKIVEEFNSAEKQNKAAAEKPKEKRKKGK*
BLAST of Csa1G009920 vs. Swiss-Prot
Match: PP439_ARATH (Pentatricopeptide repeat-containing protein At5g61370, mitochondrial OS=Arabidopsis thaliana GN=At5g61370 PE=2 SV=1)

HSP 1 Score: 570.5 bits (1469), Expect = 1.8e-161
Identity = 270/466 (57.94%), Postives = 355/466 (76.18%), Query Frame = 1

Query: 19  LKLKWDSFIAQSVCTQHRFCSLHSTVNNGAAVSKLCEVISCTIGGLDELESSLNKCTISL 78
           ++L   +++  +      FCS H    +  A+ ++  ++S  +GGLD+LE +LN+ ++S 
Sbjct: 6   VRLNRFTYLTSTAKLTRYFCSHHLVDRSETALHEVIRIVSSPVGGLDDLEENLNQVSVSP 65

Query: 79  TSSLVTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFNNAIRFFAQKKDYTAVNIL 138
           +S+LVTQVI+S KNE   RRLLRFF WS K L  +L D++FN  +R  A+KKD+TA+ IL
Sbjct: 66  SSNLVTQVIESCKNETSPRRLLRFFSWSCKSLGSSLHDKEFNYVLRVLAEKKDHTAMQIL 125

Query: 139 LSNLKKADRAMDGQTFGFVAEAFVKMDREDEALGLFKNLEKYKCPHDQFTVVAIITALCS 198
           LS+L+K +RAMD QTF  VAE  VK+ +E++A+G+FK L+K+ CP D FTV AII+ALCS
Sbjct: 126 LSDLRKENRAMDKQTFSIVAETLVKVGKEEDAIGIFKILDKFSCPQDGFTVTAIISALCS 185

Query: 199 KGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRILKEMKSDGTMPDLFC 258
           +GH KRA GV+ HHKD IS     +YRSLL+GWS+++N KEARR++++MKS G  PDLFC
Sbjct: 186 RGHVKRALGVMHHHKDVISGNELSVYRSLLFGWSVQRNVKEARRVIQDMKSAGITPDLFC 245

Query: 259 YNTFLKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISPNSISYNILLSCLCKTRRVKESC 318
           +N+ L CLCE+NV +NPSGLVPE+LN+M+EMRSYKI P S+SYNILLSCL +TRRV+ESC
Sbjct: 246 FNSLLTCLCERNVNRNPSGLVPEALNIMLEMRSYKIQPTSMSYNILLSCLGRTRRVRESC 305

Query: 319 KILEMMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIVDEMIEEGLTPDRKFYYDLIGIL 378
           +ILE MKR+GC PD  SYY + RVL+LTGRFGKG +IVDEMIE G  P+RKFYYDLIG+L
Sbjct: 306 QILEQMKRSGCDPDTGSYYFVVRVLYLTGRFGKGNQIVDEMIERGFRPERKFYYDLIGVL 365

Query: 379 CGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGEFEMGRQLWEEAMAMGVSLNC 438
           CGVER N+AL+LFEKMKRSS+GGYG VYD+LIPKLC+GG FE GR+LWEEA+++ V+L+C
Sbjct: 366 CGVERVNFALQLFEKMKRSSVGGYGQVYDLLIPKLCKGGNFEKGRELWEEALSIDVTLSC 425

Query: 439 SSEILDPSITKVFKPTRKIENKIVEEFNSAEKQNKAAAEKPKEKRK 485
           S  +LDPS+T+VFKP +  E   + +  +   +  A   K K K K
Sbjct: 426 SISLLDPSVTEVFKPMKMKEEAAMVDRRALNLKIHARMNKTKPKLK 471

BLAST of Csa1G009920 vs. Swiss-Prot
Match: PP438_ARATH (Pentatricopeptide repeat-containing protein PNM1, mitochondrial OS=Arabidopsis thaliana GN=PNM1 PE=1 SV=1)

HSP 1 Score: 178.7 bits (452), Expect = 1.5e-43
Identity = 116/415 (27.95%), Postives = 201/415 (48.43%), Query Frame = 1

Query: 76  ISLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFNNAIRFFAQKKDYTAV 135
           I+    L+ Q ++ S      R  L F  W     N +  DE  +  + +F ++KD+  +
Sbjct: 105 ITPNPDLILQTLNLSPEAG--RAALGFNEWLDSNSNFSHTDETVSFFVDYFGRRKDFKGM 164

Query: 136 NILLSNLKKADRAMDGQTFGFVAEAFVKMDREDEALGLFKNLEK-YKCPHDQFTVVAIIT 195
             ++S  K       G+T     +  V+  R  +    F+ +E  Y    D+ ++  ++ 
Sbjct: 165 LEIISKYKGI---AGGKTLESAIDRLVRAGRPKQVTDFFEKMENDYGLKRDKESLTLVVK 224

Query: 196 ALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRILKEMKSDGTMP 255
            LC KGHA  AE +V +  ++I    + I   L+ GW I +   EA R+  EM   G   
Sbjct: 225 KLCEKGHASIAEKMVKNTANEIFPDEN-ICDLLISGWCIAEKLDEATRLAGEMSRGGFEI 284

Query: 256 DLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISPNSISYNILLSCLCKTRRV 315
               YN  L C+C+   +K+P  L PE   V++EM    +  N+ ++N+L++ LCK RR 
Sbjct: 285 GTKAYNMMLDCVCKLCRKKDPFKLQPEVEKVLLEMEFRGVPRNTETFNVLINNLCKIRRT 344

Query: 316 KESCKILEMMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIVDEMIEEGLTP--DRKFYY 375
           +E+  +   M   GCQPD  +Y ++ R L+   R G+G E++D+M   G     ++K YY
Sbjct: 345 EEAMTLFGRMGEWGCQPDAETYLVLIRSLYQAARIGEGDEMIDKMKSAGYGELLNKKEYY 404

Query: 376 DLIGILCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGEFEMGRQLWEEAMAM 435
             + ILCG+ER  +A+ +F+ MK +        YD+L+ K+C   +      L++EA   
Sbjct: 405 GFLKILCGIERLEHAMSVFKSMKANGCKPGIKTYDLLMGKMCANNQLTRANGLYKEAAKK 464

Query: 436 GVSLNCSSEILDPSITKVFKPTRKIENKIVEEFNSAEKQNKAAAEKPKEKRKKGK 488
           G++++     +DP   K  K T+++++ +        K+ +   EK   K+K+ K
Sbjct: 465 GIAVSPKEYRVDPRFMK--KKTKEVDSNV--------KKRETLPEKTARKKKRLK 503

BLAST of Csa1G009920 vs. Swiss-Prot
Match: PP447_ARATH (Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis thaliana GN=At5g65820 PE=3 SV=1)

HSP 1 Score: 135.6 bits (340), Expect = 1.5e-30
Identity = 90/371 (24.26%), Postives = 173/371 (46.63%), Query Frame = 1

Query: 66  ELESSLNKCTISLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFNNAIRF 125
           +LE +LN+  + L   L+ +V++   +        RFF+W+ K+  +    E + + ++ 
Sbjct: 99  KLELALNESGVELRPGLIERVLNRCGDAGNLG--YRFFVWAAKQPRYCHSIEVYKSMVKI 158

Query: 126 FAQKKDYTAVNILLSNLKKAD-RAMDGQTFGFVAEAFVKMDREDEALGLFKNLEKYKCPH 185
            ++ + + AV  L+  ++K + + ++ + F  + + F   D   +A+ +   + K+    
Sbjct: 159 LSKMRQFGAVWGLIEEMRKENPQLIEPELFVVLVQRFASADMVKKAIEVLDEMPKFGFEP 218

Query: 186 DQFTVVAIITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRIL 245
           D++    ++ ALC  G  K A  +    + +    +   + SLLYGW       EA+ +L
Sbjct: 219 DEYVFGCLLDALCKHGSVKDAAKLFEDMRMRFPVNLR-YFTSLLYGWCRVGKMMEAKYVL 278

Query: 246 KEMKSDGTMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISPNSISYNIL 305
            +M   G  PD+  Y   L            +G + ++ +++ +MR     PN+  Y +L
Sbjct: 279 VQMNEAGFEPDIVDYTNLLSGYAN-------AGKMADAYDLLRDMRRRGFEPNANCYTVL 338

Query: 306 LSCLCKTRRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIVDEMIEEGL 365
           +  LCK  R++E+ K+   M+R  C+ D V+Y  +       G+  K   ++D+MI++GL
Sbjct: 339 IQALCKVDRMEEAMKVFVEMERYECEADVVTYTALVSGFCKWGKIDKCYIVLDDMIKKGL 398

Query: 366 TPDRKFYYDLIGILCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGEFEMGRQ 425
            P    Y  ++      E     LEL EKM++        +Y+V+I   C+ GE +   +
Sbjct: 399 MPSELTYMHIMVAHEKKESFEECLELMEKMRQIEYHPDIGIYNVVIRLACKLGEVKEAVR 458

Query: 426 LWEEAMAMGVS 436
           LW E    G+S
Sbjct: 459 LWNEMEENGLS 459


HSP 2 Score: 47.8 bits (112), Expect = 4.1e-04
Identity = 43/196 (21.94%), Postives = 75/196 (38.27%), Query Frame = 1

Query: 169 EALGLFKNLEKYKCPHDQFTVVAIITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLL 228
           +A  L +++ +     +      +I ALC     + A  V +  +          Y +L+
Sbjct: 305 DAYDLLRDMRRRGFEPNANCYTVLIQALCKVDRMEEAMKVFVEMERYECEADVVTYTALV 364

Query: 229 YGWSIKKNTKEARRILKEMKSDGTMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMME 288
            G+       +   +L +M   G MP    Y   +        EK  S    E L +M +
Sbjct: 365 SGFCKWGKIDKCYIVLDDMIKKGLMPSELTYMHIMVAH-----EKKES--FEECLELMEK 424

Query: 289 MRSYKISPNSISYNILLSCLCKTRRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLTGR 348
           MR  +  P+   YN+++   CK   VKE+ ++   M+  G  P   ++ +M   L   G 
Sbjct: 425 MRQIEYHPDIGIYNVVIRLACKLGEVKEAVRLWNEMEENGLSPGVDTFVIMINGLASQGC 484

Query: 349 FGKGREIVDEMIEEGL 365
             +  +   EM+  GL
Sbjct: 485 LLEASDHFKEMVTRGL 493

BLAST of Csa1G009920 vs. Swiss-Prot
Match: PP281_ARATH (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana GN=MEE40 PE=2 SV=1)

HSP 1 Score: 117.9 bits (294), Expect = 3.2e-25
Identity = 89/377 (23.61%), Postives = 165/377 (43.77%), Query Frame = 1

Query: 119 FNNAIRFFAQKKDYTAVNILLSNLKKADRAMDGQTFGFVAEAFVKMDREDEALGLFKNLE 178
           FN  I+   +        ++L ++       D +TF  V + +++    D AL + + + 
Sbjct: 192 FNVLIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLDGALRIREQMV 251

Query: 179 KYKCPHDQFTVVAIITALCSKGHAKRAEGVV--LHHKDKISSTMSCIYRSLLYGWSIKKN 238
           ++ C     +V  I+   C +G  + A   +  + ++D         + +L+ G     +
Sbjct: 252 EFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYT-FNTLVNGLCKAGH 311

Query: 239 TKEARRILKEMKSDGTMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISP 298
            K A  I+  M  +G  PD++ YN+ +  LC+        G V E++ V+ +M +   SP
Sbjct: 312 VKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKL-------GEVKEAVEVLDQMITRDCSP 371

Query: 299 NSISYNILLSCLCKTRRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIV 358
           N+++YN L+S LCK  +V+E+ ++  ++   G  PD  ++  + + L LT       E+ 
Sbjct: 372 NTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELF 431

Query: 359 DEMIEEGLTPDRKFYYDLIGILCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRG 418
           +EM  +G  PD   Y  LI  LC   + + AL + ++M+ S        Y+ LI   C+ 
Sbjct: 432 EEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCKA 491

Query: 419 GEFEMGRQLWEEAMAMGVSLN-----------CSS-------EILDPSITKVFKPTRKIE 476
            +     ++++E    GVS N           C S       +++D  I +  KP +   
Sbjct: 492 NKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTY 551


HSP 2 Score: 112.5 bits (280), Expect = 1.3e-23
Identity = 80/325 (24.62%), Postives = 139/325 (42.77%), Query Frame = 1

Query: 150 DGQTFGFVAEAFVKMDREDEALGLFKNLEKYKCPHDQFTVVAIITALCSKGHAKRAEGVV 209
           D  T+  V     K+    EA+ +   +    C  +  T   +I+ LC +   + A  + 
Sbjct: 329 DVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCSPNTVTYNTLISTLCKENQVEEATELA 388

Query: 210 LHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRILKEMKSDGTMPDLFCYNTFLKCLCEK 269
                K      C + SL+ G  + +N + A  + +EM+S G  PD F YN  +  LC K
Sbjct: 389 RVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELFEEMRSKGCEPDEFTYNMLIDSLCSK 448

Query: 270 NVEKNPSGLVPESLNVMMEMRSYKISPNSISYNILLSCLCKTRRVKESCKILEMMKRTGC 329
                  G + E+LN++ +M     + + I+YN L+   CK  + +E+ +I + M+  G 
Sbjct: 449 -------GKLDEALNMLKQMELSGCARSVITYNTLIDGFCKANKTREAEEIFDEMEVHGV 508

Query: 330 QPDCVSYYLMARVLFLTGRFGKGREIVDEMIEEGLTPDRKFYYDLIGILCGVERTNYALE 389
             + V+Y  +   L  + R     +++D+MI EG  PD+  Y  L+   C       A +
Sbjct: 509 SRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTYNSLLTHFCRGGDIKKAAD 568

Query: 390 LFEKMKRSSLGGYGPVYDVLIPKLCRGGEFEMGRQLWEEAMAMGVSLNCSSEILDPSITK 449
           + + M  +        Y  LI  LC+ G  E+  +L       G+  N +    +P I  
Sbjct: 569 IVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVASKLLRSIQMKGI--NLTPHAYNPVIQG 628

Query: 450 VFKPTRKIENKIVEEFNSAEKQNKA 475
           +F+  +  E   +  F    +QN+A
Sbjct: 629 LFRKRKTTE--AINLFREMLEQNEA 642


HSP 3 Score: 96.7 bits (239), Expect = 7.6e-19
Identity = 76/358 (21.23%), Postives = 152/358 (42.46%), Query Frame = 1

Query: 64  LDELESSLNKCTISLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFNNAI 123
           L++++SS  +C +  ++ L+  +I+S         +L    W + +     +   +N  +
Sbjct: 106 LEDMKSS--RCEMGTSTFLI--LIESYAQFELQDEILSVVDWMIDEFGLKPDTHFYNRML 165

Query: 124 RFFAQKKDYTAVNILLSNLKKADRAMDGQTFGFVAEAFVKMDREDEALGLFKNLEKYKCP 183
                      V I  + +       D  TF  + +A  +  +   A+ + +++  Y   
Sbjct: 166 NLLVDGNSLKLVEISHAKMSVWGIKPDVSTFNVLIKALCRAHQLRPAILMLEDMPSYGLV 225

Query: 184 HDQFTVVAIITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRI 243
            D+ T   ++     +G    A  +     +   S  +     +++G+  +   ++A   
Sbjct: 226 PDEKTFTTVMQGYIEEGDLDGALRIREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNF 285

Query: 244 LKEMKS-DGTMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISPNSISYN 303
           ++EM + DG  PD + +NT +  LC+       +G V  ++ +M  M      P+  +YN
Sbjct: 286 IQEMSNQDGFFPDQYTFNTLVNGLCK-------AGHVKHAIEIMDVMLQEGYDPDVYTYN 345

Query: 304 ILLSCLCKTRRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIVDEMIEE 363
            ++S LCK   VKE+ ++L+ M    C P+ V+Y  +   L    +  +  E+   +  +
Sbjct: 346 SVISGLCKLGEVKEAVEVLDQMITRDCSPNTVTYNTLISTLCKENQVEEATELARVLTSK 405

Query: 364 GLTPDRKFYYDLIGILCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGEFE 421
           G+ PD   +  LI  LC       A+ELFE+M+          Y++LI  LC  G+ +
Sbjct: 406 GILPDVCTFNSLIQGLCLTRNHRVAMELFEEMRSKGCEPDEFTYNMLIDSLCSKGKLD 452


HSP 4 Score: 85.5 bits (210), Expect = 1.8e-15
Identity = 62/281 (22.06%), Postives = 119/281 (42.35%), Query Frame = 1

Query: 119 FNNAIRFFAQKKDYTAVNILLSNLKKADRAMDGQTFGFVAEAFVKMDREDEALGLFKNLE 178
           FN+ I+     +++     L   ++      D  T+  + ++     + DEAL + K +E
Sbjct: 403 FNSLIQGLCLTRNHRVAMELFEEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQME 462

Query: 179 KYKCPHDQFTVVAIITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTK 238
              C     T   +I   C     + AE +    +    S  S  Y +L+ G    +  +
Sbjct: 463 LSGCARSVITYNTLIDGFCKANKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVE 522

Query: 239 EARRILKEMKSDGTMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISPNS 298
           +A +++ +M  +G  PD + YN+ L   C         G + ++ +++  M S    P+ 
Sbjct: 523 DAAQLMDQMIMEGQKPDKYTYNSLLTHFCR-------GGDIKKAADIVQAMTSNGCEPDI 582

Query: 299 ISYNILLSCLCKTRRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIVDE 358
           ++Y  L+S LCK  RV+ + K+L  ++  G      +Y  + + LF   +  +   +  E
Sbjct: 583 VTYGTLISGLCKAGRVEVASKLLRSIQMKGINLTPHAYNPVIQGLFRKRKTTEAINLFRE 642

Query: 359 MIEEGLTPDRKFYYDLI--GILCG----VERTNYALELFEK 394
           M+E+   P     Y ++  G+  G     E  ++ +EL EK
Sbjct: 643 MLEQNEAPPDAVSYRIVFRGLCNGGGPIREAVDFLVELLEK 676

BLAST of Csa1G009920 vs. Swiss-Prot
Match: PP248_ARATH (Pentatricopeptide repeat-containing protein At3g22670, mitochondrial OS=Arabidopsis thaliana GN=At3g22670 PE=2 SV=1)

HSP 1 Score: 117.5 bits (293), Expect = 4.2e-25
Identity = 91/400 (22.75%), Postives = 176/400 (44.00%), Query Frame = 1

Query: 50  VSKLCEVISCTIGGLDELESSLNKCTISLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKK 109
           + K+C+ ++      +++   L+KC + +T SLV QV+    N     +   FF+W+  +
Sbjct: 102 IDKVCDFLNKKDTSHEDVVKELSKCDVVVTESLVLQVLRRFSNG--WNQAYGFFIWANSQ 161

Query: 110 LNHTLEDEDFNNAIRFFAQKKDYTAVNILLSNLKKADRA--MDGQTFGFVAEAFVKMDRE 169
             +      +N  +    + +++  +  L++ + K + +  +   T   V     K  + 
Sbjct: 162 TGYVHSGHTYNAMVDVLGKCRNFDLMWELVNEMNKNEESKLVTLDTMSKVMRRLAKSGKY 221

Query: 170 DEALGLFKNLEK-YKCPHDQFTVVAIITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRS 229
           ++A+  F  +EK Y    D   + +++ AL  +   + A  V L   D I       +  
Sbjct: 222 NKAVDAFLEMEKSYGVKTDTIAMNSLMDALVKENSIEHAHEVFLKLFDTIKPDART-FNI 281

Query: 230 LLYGWSIKKNTKEARRILKEMKSDGTMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVM 289
           L++G+   +   +AR ++  MK     PD+  Y +F++  C++   +  +        ++
Sbjct: 282 LIHGFCKARKFDDARAMMDLMKVTEFTPDVVTYTSFVEAYCKEGDFRRVN-------EML 341

Query: 290 MEMRSYKISPNSISYNILLSCLCKTRRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLT 349
            EMR    +PN ++Y I++  L K+++V E+  + E MK  GC PD   Y  +  +L  T
Sbjct: 342 EEMRENGCNPNVVTYTIVMHSLGKSKQVAEALGVYEKMKEDGCVPDAKFYSSLIHILSKT 401

Query: 350 GRFGKGREIVDEMIEEGLTPDRKFYYDLIGILCGVERTNYALELFEKMKRSSLGGYGP-- 409
           GRF    EI ++M  +G+  D   Y  +I       R   AL L ++M+        P  
Sbjct: 402 GRFKDAAEIFEDMTNQGVRRDVLVYNTMISAALHHSRDEMALRLLKRMEDEEGESCSPNV 461

Query: 410 -VYDVLIPKLCRGGEFEMGRQLWEEAMAMGVSLNCSSEIL 444
             Y  L+   C   + ++   L    +   VS++ S+ IL
Sbjct: 462 ETYAPLLKMCCHKKKMKLLGILLHHMVKNDVSIDVSTYIL 491

BLAST of Csa1G009920 vs. TrEMBL
Match: V4T153_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10023621mg PE=4 SV=1)

HSP 1 Score: 635.2 bits (1637), Expect = 6.6e-179
Identity = 319/481 (66.32%), Postives = 390/481 (81.08%), Query Frame = 1

Query: 18  LLKLKWDSFIAQSVCTQ---HRFCSLHSTVNNGAA---VSKLCEVISCTIGGLDELESSL 77
           LLK KW S + Q V TQ   H   SL+S V +      + +LC+V+S TIGGLD+LE SL
Sbjct: 4   LLKPKWRSLLLQRVDTQKSEHLLLSLYSMVPSNQVSHELKELCKVVSSTIGGLDDLELSL 63

Query: 78  NKCTISLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFNNAIRFFAQKKD 137
           N+ T SL+SSLVTQVIDS K+EAPTRRLLRFFLWS K L+ +LED+D+N+AIR FA+KKD
Sbjct: 64  NQFTGSLSSSLVTQVIDSCKHEAPTRRLLRFFLWSCKNLSASLEDKDYNHAIRVFAEKKD 123

Query: 138 YTAVNILLSNLKKADRAMDGQTFGFVAEAFVKMDREDEALGLFKNLEKYKCPHDQFTVVA 197
           + A+NIL+S+L+K  R M+ Q+FG + E  VK+ REDEALG+FKNLEK+KC  D  TV A
Sbjct: 124 HMAMNILVSDLRKEGRVMETQSFGVLVETLVKLGREDEALGIFKNLEKFKCVQDSVTVSA 183

Query: 198 IITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRILKEMKSDG 257
           I++ALC+KGHA+RAEGVV HHKDKIS    CIYRSL+YGWS+++N K AR+I+KEMKS G
Sbjct: 184 IVSALCAKGHARRAEGVVYHHKDKISGVELCIYRSLIYGWSMQENVKAARKIIKEMKSAG 243

Query: 258 TMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISPNSISYNILLSCLCKT 317
            MPDLFCYNTFL+ LCE+N+++NPSGLVPE+LNVMMEMRSY+I+P SISYNILLSCL +T
Sbjct: 244 FMPDLFCYNTFLRGLCERNLKRNPSGLVPEALNVMMEMRSYRIAPTSISYNILLSCLGRT 303

Query: 318 RRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIVDEMIEEGLTPDRKFY 377
           RRVKESC++LE MK++GC PD VSYYL+ARVL+L+GRFGKG +IVDEMIEEGL PDRKFY
Sbjct: 304 RRVKESCRVLEQMKKSGCAPDWVSYYLVARVLYLSGRFGKGNKIVDEMIEEGLIPDRKFY 363

Query: 378 YDLIGILCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGEFEMGRQLWEEAMA 437
           YDLIGILCGVER N+ALELFE+MKRSSLGGYGPVYDVLIPK+C+GG+F  GR+LW+EAM 
Sbjct: 364 YDLIGILCGVERVNFALELFERMKRSSLGGYGPVYDVLIPKVCQGGDFVKGRELWDEAMV 423

Query: 438 MGVSLNCSSEILDPSITKVFKPTRK-IENKIVEEFNSAEKQNKAAA----EKPKEKRKKG 488
           MG++L+CSS +LDPSIT+VF P RK  E  +     + E Q K       +K K K+KK 
Sbjct: 424 MGLTLSCSSNVLDPSITEVFHPRRKPTEGCLGSTTPNIEAQVKKTVIEVDKKKKSKKKKN 483

BLAST of Csa1G009920 vs. TrEMBL
Match: A0A061FXY0_THECC (Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_014106 PE=4 SV=1)

HSP 1 Score: 635.2 bits (1637), Expect = 6.6e-179
Identity = 307/470 (65.32%), Postives = 384/470 (81.70%), Query Frame = 1

Query: 26  FIAQSVCTQ---HRFCSLHSTVNNGAAVSKLCEVISCTIGGLDELESSLNKCTISLTSSL 85
           F+ Q + TQ   + F S HST+       +LC+V+S ++GGLD+LESSLN+  +SL+  L
Sbjct: 13  FLTQVITTQKPKNLFHSPHSTITTPPEFEELCKVVSSSMGGLDDLESSLNRFKLSLSPLL 72

Query: 86  VTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFNNAIRFFAQKKDYTAVNILLSNL 145
           VTQVI+S +NEAPTRRLLRFFLWS+K L+ +LED+D NN +R FA+KKD+TA+ IL+S++
Sbjct: 73  VTQVINSCENEAPTRRLLRFFLWSVKNLSSSLEDKDLNNVVRVFAKKKDHTAMGILVSDI 132

Query: 146 KKADRAMDGQTFGFVAEAFVKMDREDEALGLFKNLEKYKCPHDQFTVVAIITALCSKGHA 205
           +   R M+ QTF  VAE  VK+ REDEALG+FKNLEK+KCP D F++ AI+ ALC+KGHA
Sbjct: 133 RNRGRTMESQTFSVVAEMLVKLGREDEALGIFKNLEKFKCPRDSFSLTAIVNALCAKGHA 192

Query: 206 KRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRILKEMKSDGTMPDLFCYNTF 265
           ++AEGVV HHKD I+    CIYR LLYGWS+++N KEARR++KEMKS G   DL+CYNTF
Sbjct: 193 RKAEGVVYHHKDTIAGVEPCIYRCLLYGWSVQENVKEARRVIKEMKSAGFELDLYCYNTF 252

Query: 266 LKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISPNSISYNILLSCLCKTRRVKESCKILE 325
           L+CLC KN ++NPSGLVPE+LNVMMEMRS +I+P S+SYNILLSCL +TRRVKESC+ILE
Sbjct: 253 LRCLCGKNAKRNPSGLVPEALNVMMEMRSQRIAPTSVSYNILLSCLGRTRRVKESCQILE 312

Query: 326 MMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIVDEMIEEGLTPDRKFYYDLIGILCGVE 385
           +MK+ GC PD +SYYL+ARVL+LTGRFGKG +IVDEMIE+GLTPDRKFYYDLIG+LCGVE
Sbjct: 313 LMKKAGCAPDWISYYLVARVLYLTGRFGKGNKIVDEMIEQGLTPDRKFYYDLIGVLCGVE 372

Query: 386 RTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGEFEMGRQLWEEAMAMGVSLNCSSEI 445
           R N+ALELFE+MKRSSLGGYGPVYDVLIPKLCRGG+FE GR+LW+EA+A GVSL+CSS++
Sbjct: 373 RVNFALELFERMKRSSLGGYGPVYDVLIPKLCRGGDFEKGRELWDEAVATGVSLSCSSDV 432

Query: 446 LDPSITKVFKPTRKIENKIVEEFNSAE-----KQNKAAAEKPKEKRKKGK 488
           LDPSIT+VFKPTRK E   ++    A+     KQN    +K K+ +KK K
Sbjct: 433 LDPSITEVFKPTRKAEKVHLKGCTMAKSPVKNKQNTMKGKKYKKIKKKKK 482

BLAST of Csa1G009920 vs. TrEMBL
Match: K7MKY1_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_17G102300 PE=4 SV=1)

HSP 1 Score: 624.8 bits (1610), Expect = 9.0e-176
Identity = 305/475 (64.21%), Postives = 377/475 (79.37%), Query Frame = 1

Query: 17  SLLKLKWDSFIAQSVCTQHRFC---SLHSTVNNGAA---VSKLCEVISCTIGGLDELESS 76
           ++LK  W  F  Q++ T +      S +ST+ + +    + +LC ++  T+GGLDELE S
Sbjct: 7   AMLKPAWKRFWLQNMRTHNLQILPFSHYSTLQSMSVHPQLQELCSIVMSTVGGLDELELS 66

Query: 77  LNKCTISLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFNNAIRFFAQKK 136
           LNK   SLTSSLV Q IDSSK+EA TRRLLRFFLWS K L+H LED+D+N+A+R FA+KK
Sbjct: 67  LNKFKDSLTSSLVAQAIDSSKHEAQTRRLLRFFLWSCKNLSHRLEDKDYNHALRVFAEKK 126

Query: 137 DYTAVNILLSNLKKADRAMDGQTFGFVAEAFVKMDREDEALGLFKNLEKYKCPHDQFTVV 196
           DYTA++IL+ +LKK  RAMD +TF  VAE  VK+ +EDEALG+FKNL+KYKC  D+FTV 
Sbjct: 127 DYTAMDILMGDLKKEGRAMDAETFSLVAENLVKLGKEDEALGIFKNLDKYKCSIDEFTVT 186

Query: 197 AIITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRILKEMKSD 256
           AI+ ALCSKGH KRAEGVV HH DKI+ T  CIYRSLLYGWS+++N KEARRI+KEMKS+
Sbjct: 187 AIVNALCSKGHGKRAEGVVWHHNDKITGTKPCIYRSLLYGWSVQRNVKEARRIIKEMKSN 246

Query: 257 GTMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISPNSISYNILLSCLCK 316
           G +PDL CYNTFL+CLCE+N+  NPSGLVPE+LNVMMEM+S+ + P SISYNILLSCL K
Sbjct: 247 GVIPDLLCYNTFLRCLCERNLRHNPSGLVPEALNVMMEMKSHNVFPTSISYNILLSCLGK 306

Query: 317 TRRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIVDEMIEEGLTPDRKF 376
           TRRVKESC+ILE MK +GC PD VSYYL+A+VLFL+GRFGKG+E+VD+MI +GL P+ KF
Sbjct: 307 TRRVKESCQILETMKISGCDPDWVSYYLVAKVLFLSGRFGKGKEMVDQMIGKGLVPNHKF 366

Query: 377 YYDLIGILCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGEFEMGRQLWEEAM 436
           YY LIGILCGVER NYALELFEKMK+SS+GGYGPVYDVLIPKLCRGG+FE GR+LW+EA 
Sbjct: 367 YYSLIGILCGVERVNYALELFEKMKKSSMGGYGPVYDVLIPKLCRGGDFEKGRELWDEAS 426

Query: 437 AMGVSLNCSSEILDPSITKVFKPTRKIENKIVEEFNSAEKQNKAAAEKPKEKRKK 486
            MG++L CS ++LDPSIT+V+KPTR  ++  V+   +   Q         + RKK
Sbjct: 427 GMGITLQCSEDVLDPSITEVYKPTRPEKSSHVDSSRAKSPQKLTKFSGKMKMRKK 481

BLAST of Csa1G009920 vs. TrEMBL
Match: W9QUA2_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_008580 PE=4 SV=1)

HSP 1 Score: 622.5 bits (1604), Expect = 4.4e-175
Identity = 309/458 (67.47%), Postives = 370/458 (80.79%), Query Frame = 1

Query: 20  KLKWDSFIAQSVCTQ---HRFCSLHSTVNNGAAVSKLCEVISCTIGGLDELESSLNKCTI 79
           K +W  F+ +S   Q      C  +S +++ + + +LC ++S TIGGLD+LESSL+    
Sbjct: 6   KSRWHYFLLRSFTAQKFRQLSCLPNSNLSSASRLQELCTIVSRTIGGLDDLESSLSDFRG 65

Query: 80  SLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFNNAIRFFAQKKDYTAVN 139
           SLTSSLVTQVIDS K EAPTRRLLRFFLWS K L   LED+D+N+AIR FA KKD+TA+ 
Sbjct: 66  SLTSSLVTQVIDSCKTEAPTRRLLRFFLWSHKNLKCDLEDKDYNHAIRVFAGKKDHTALE 125

Query: 140 ILLSNLKKADRAMDGQTFGFVAEAFVKMDREDEALGLFKNLEKYKCPHDQFTVVAIITAL 199
           IL+S+LKK  RA++ QT+  VAE  VK+ REDEALG+FKN +KYKCP + FTV A++ AL
Sbjct: 126 ILVSDLKKGGRALESQTYAIVAETLVKLGREDEALGIFKNSDKYKCPQNSFTVTAVVNAL 185

Query: 200 CSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRILKEMKSDGTMPDL 259
           C++GHAKRAEGVV HHKD+IS    CIYRSLLYGWS ++N KEARRI+KEMKS G  PDL
Sbjct: 186 CAQGHAKRAEGVVGHHKDRISGMERCIYRSLLYGWSEQENVKEARRIIKEMKSAGINPDL 245

Query: 260 FCYNTFLKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISPNSISYNILLSCLCKTRRVKE 319
           FCYNTFL+CLCE+N+++NPSGLVPE+LNVMMEMRSY I+PNSISYNILLSCL + RRVKE
Sbjct: 246 FCYNTFLRCLCERNLKRNPSGLVPEALNVMMEMRSYMITPNSISYNILLSCLGRARRVKE 305

Query: 320 SCKILEMMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIVDEMIEEGLTPDRKFYYDLIG 379
           +C+ILE MK+ GC PD +SYYL+ RVL+LT RFGKG ++VDEMI EGL P+ KFYYDLIG
Sbjct: 306 ACQILERMKQAGCSPDWMSYYLVIRVLYLTMRFGKGNKLVDEMIGEGLVPNCKFYYDLIG 365

Query: 380 ILCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGEFEMGRQLWEEAMAMGVSL 439
           +LCGVER  YALELFE MK+ SLGGYGPVYDVLIPKLCRGG+FE GR+LW EAM MGV  
Sbjct: 366 VLCGVERPYYALELFEHMKKRSLGGYGPVYDVLIPKLCRGGDFEKGRELWIEAMNMGVDF 425

Query: 440 NCSSEILDPSITKVFKPTRKIENKI-VEEFNSAEKQNK 474
            CSS++LDPSITKVFKPTRK E KI  EE  S+E +NK
Sbjct: 426 CCSSDVLDPSITKVFKPTRKEEEKISQEESTSSENKNK 463

BLAST of Csa1G009920 vs. TrEMBL
Match: B9RLV3_RICCO (Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCOM_1471200 PE=4 SV=1)

HSP 1 Score: 620.5 bits (1599), Expect = 1.7e-174
Identity = 301/453 (66.45%), Postives = 375/453 (82.78%), Query Frame = 1

Query: 37  FCSLHSTVNNGAA---VSKLCEVISCTIGGLDELESSLNKCTISLTSSLVTQVIDSSKNE 96
           F  L+ST+++      + ++C+ +S +IGGLD+LESSLN    +LTS +VTQVID  K+E
Sbjct: 9   FVCLYSTISHNRVPLELQEICKAVSSSIGGLDDLESSLNGFRGNLTSQIVTQVIDCCKHE 68

Query: 97  APTRRLLRFFLWSLKKLNHTLEDEDFNNAIRFFAQKKDYTAVNILLSNLKKADRAMDGQT 156
           APTRRLLRFFLWS K+L+ +++DEDFN+AIR  A+KKD+TA+ IL+S+L+K  R M+ QT
Sbjct: 69  APTRRLLRFFLWSYKRLDFSMKDEDFNHAIRVLAEKKDHTAMQILISDLRKEGRVMEPQT 128

Query: 157 FGFVAEAFVKMDREDEALGLFKNLEKYKCPHDQFTVVAIITALCSKGHAKRAEGVVLHHK 216
           FG VAEA VK+ REDEALG+FKNL+K+KCP D  TV AIITALC++GHAK+A GVVLHHK
Sbjct: 129 FGLVAEALVKLGREDEALGIFKNLDKFKCPQDCETVTAIITALCAEGHAKKAYGVVLHHK 188

Query: 217 DKISSTMS-CIYRSLLYGWSIKKNTKEARRILKEMKSDGTMPDLFCYNTFLKCLCEKNVE 276
           DK+S  +  CIYRSL+YGWS++KN K AR +++EMK +G  PDLFCYNTFL+CLCE+NVE
Sbjct: 189 DKLSEVIRPCIYRSLIYGWSMQKNVKRAREVIQEMKRNGIKPDLFCYNTFLRCLCERNVE 248

Query: 277 KNPSGLVPESLNVMMEMRSYKISPNSISYNILLSCLCKTRRVKESCKILEMMKRTGCQPD 336
           +NPSGLVPESLNVMMEMRSY+I PNSISYNILLSCL + RRV+ESCKILE+MK++ C PD
Sbjct: 249 RNPSGLVPESLNVMMEMRSYRIEPNSISYNILLSCLGRVRRVQESCKILELMKKSSCAPD 308

Query: 337 CVSYYLMARVLFLTGRFGKGREIVDEMIEEGLTPDRKFYYDLIGILCGVERTNYALELFE 396
            VSYYL+A+VL+LTGRFGKG +IVDEMIE  L PDRKFYYDLIGILCGVER N+AL+LF+
Sbjct: 309 WVSYYLVAKVLYLTGRFGKGNKIVDEMIERRLVPDRKFYYDLIGILCGVERVNFALKLFD 368

Query: 397 KMKRSSLGGYGPVYDVLIPKLCRGGEFEMGRQLWEEAMAMGVSLNCSSEILDPSITKVFK 456
           +MKRSS GGYGPVYD+LIPKLC GG FE G++LW+EAMAMGV+++CSSE+LDPSITKVF+
Sbjct: 369 QMKRSSSGGYGPVYDLLIPKLCIGGNFEKGKELWDEAMAMGVTVHCSSEVLDPSITKVFE 428

Query: 457 PTRKIENKIVEEFNSAEKQNKAAAEKPKEKRKK 486
           PTRK+E +         K N     K +E+ +K
Sbjct: 429 PTRKVEEEEEVRLQDCIKSN---VPKTRERVRK 458

BLAST of Csa1G009920 vs. TAIR10
Match: AT5G61370.1 (AT5G61370.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 570.5 bits (1469), Expect = 1.0e-162
Identity = 270/466 (57.94%), Postives = 355/466 (76.18%), Query Frame = 1

Query: 19  LKLKWDSFIAQSVCTQHRFCSLHSTVNNGAAVSKLCEVISCTIGGLDELESSLNKCTISL 78
           ++L   +++  +      FCS H    +  A+ ++  ++S  +GGLD+LE +LN+ ++S 
Sbjct: 6   VRLNRFTYLTSTAKLTRYFCSHHLVDRSETALHEVIRIVSSPVGGLDDLEENLNQVSVSP 65

Query: 79  TSSLVTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFNNAIRFFAQKKDYTAVNIL 138
           +S+LVTQVI+S KNE   RRLLRFF WS K L  +L D++FN  +R  A+KKD+TA+ IL
Sbjct: 66  SSNLVTQVIESCKNETSPRRLLRFFSWSCKSLGSSLHDKEFNYVLRVLAEKKDHTAMQIL 125

Query: 139 LSNLKKADRAMDGQTFGFVAEAFVKMDREDEALGLFKNLEKYKCPHDQFTVVAIITALCS 198
           LS+L+K +RAMD QTF  VAE  VK+ +E++A+G+FK L+K+ CP D FTV AII+ALCS
Sbjct: 126 LSDLRKENRAMDKQTFSIVAETLVKVGKEEDAIGIFKILDKFSCPQDGFTVTAIISALCS 185

Query: 199 KGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRILKEMKSDGTMPDLFC 258
           +GH KRA GV+ HHKD IS     +YRSLL+GWS+++N KEARR++++MKS G  PDLFC
Sbjct: 186 RGHVKRALGVMHHHKDVISGNELSVYRSLLFGWSVQRNVKEARRVIQDMKSAGITPDLFC 245

Query: 259 YNTFLKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISPNSISYNILLSCLCKTRRVKESC 318
           +N+ L CLCE+NV +NPSGLVPE+LN+M+EMRSYKI P S+SYNILLSCL +TRRV+ESC
Sbjct: 246 FNSLLTCLCERNVNRNPSGLVPEALNIMLEMRSYKIQPTSMSYNILLSCLGRTRRVRESC 305

Query: 319 KILEMMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIVDEMIEEGLTPDRKFYYDLIGIL 378
           +ILE MKR+GC PD  SYY + RVL+LTGRFGKG +IVDEMIE G  P+RKFYYDLIG+L
Sbjct: 306 QILEQMKRSGCDPDTGSYYFVVRVLYLTGRFGKGNQIVDEMIERGFRPERKFYYDLIGVL 365

Query: 379 CGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGEFEMGRQLWEEAMAMGVSLNC 438
           CGVER N+AL+LFEKMKRSS+GGYG VYD+LIPKLC+GG FE GR+LWEEA+++ V+L+C
Sbjct: 366 CGVERVNFALQLFEKMKRSSVGGYGQVYDLLIPKLCKGGNFEKGRELWEEALSIDVTLSC 425

Query: 439 SSEILDPSITKVFKPTRKIENKIVEEFNSAEKQNKAAAEKPKEKRK 485
           S  +LDPS+T+VFKP +  E   + +  +   +  A   K K K K
Sbjct: 426 SISLLDPSVTEVFKPMKMKEEAAMVDRRALNLKIHARMNKTKPKLK 471

BLAST of Csa1G009920 vs. TAIR10
Match: AT5G60960.1 (AT5G60960.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 178.7 bits (452), Expect = 8.6e-45
Identity = 116/415 (27.95%), Postives = 201/415 (48.43%), Query Frame = 1

Query: 76  ISLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFNNAIRFFAQKKDYTAV 135
           I+    L+ Q ++ S      R  L F  W     N +  DE  +  + +F ++KD+  +
Sbjct: 105 ITPNPDLILQTLNLSPEAG--RAALGFNEWLDSNSNFSHTDETVSFFVDYFGRRKDFKGM 164

Query: 136 NILLSNLKKADRAMDGQTFGFVAEAFVKMDREDEALGLFKNLEK-YKCPHDQFTVVAIIT 195
             ++S  K       G+T     +  V+  R  +    F+ +E  Y    D+ ++  ++ 
Sbjct: 165 LEIISKYKGI---AGGKTLESAIDRLVRAGRPKQVTDFFEKMENDYGLKRDKESLTLVVK 224

Query: 196 ALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRILKEMKSDGTMP 255
            LC KGHA  AE +V +  ++I    + I   L+ GW I +   EA R+  EM   G   
Sbjct: 225 KLCEKGHASIAEKMVKNTANEIFPDEN-ICDLLISGWCIAEKLDEATRLAGEMSRGGFEI 284

Query: 256 DLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISPNSISYNILLSCLCKTRRV 315
               YN  L C+C+   +K+P  L PE   V++EM    +  N+ ++N+L++ LCK RR 
Sbjct: 285 GTKAYNMMLDCVCKLCRKKDPFKLQPEVEKVLLEMEFRGVPRNTETFNVLINNLCKIRRT 344

Query: 316 KESCKILEMMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIVDEMIEEGLTP--DRKFYY 375
           +E+  +   M   GCQPD  +Y ++ R L+   R G+G E++D+M   G     ++K YY
Sbjct: 345 EEAMTLFGRMGEWGCQPDAETYLVLIRSLYQAARIGEGDEMIDKMKSAGYGELLNKKEYY 404

Query: 376 DLIGILCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGEFEMGRQLWEEAMAM 435
             + ILCG+ER  +A+ +F+ MK +        YD+L+ K+C   +      L++EA   
Sbjct: 405 GFLKILCGIERLEHAMSVFKSMKANGCKPGIKTYDLLMGKMCANNQLTRANGLYKEAAKK 464

Query: 436 GVSLNCSSEILDPSITKVFKPTRKIENKIVEEFNSAEKQNKAAAEKPKEKRKKGK 488
           G++++     +DP   K  K T+++++ +        K+ +   EK   K+K+ K
Sbjct: 465 GIAVSPKEYRVDPRFMK--KKTKEVDSNV--------KKRETLPEKTARKKKRLK 503

BLAST of Csa1G009920 vs. TAIR10
Match: AT5G65820.1 (AT5G65820.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 135.6 bits (340), Expect = 8.3e-32
Identity = 90/371 (24.26%), Postives = 173/371 (46.63%), Query Frame = 1

Query: 66  ELESSLNKCTISLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFNNAIRF 125
           +LE +LN+  + L   L+ +V++   +        RFF+W+ K+  +    E + + ++ 
Sbjct: 99  KLELALNESGVELRPGLIERVLNRCGDAGNLG--YRFFVWAAKQPRYCHSIEVYKSMVKI 158

Query: 126 FAQKKDYTAVNILLSNLKKAD-RAMDGQTFGFVAEAFVKMDREDEALGLFKNLEKYKCPH 185
            ++ + + AV  L+  ++K + + ++ + F  + + F   D   +A+ +   + K+    
Sbjct: 159 LSKMRQFGAVWGLIEEMRKENPQLIEPELFVVLVQRFASADMVKKAIEVLDEMPKFGFEP 218

Query: 186 DQFTVVAIITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRIL 245
           D++    ++ ALC  G  K A  +    + +    +   + SLLYGW       EA+ +L
Sbjct: 219 DEYVFGCLLDALCKHGSVKDAAKLFEDMRMRFPVNLR-YFTSLLYGWCRVGKMMEAKYVL 278

Query: 246 KEMKSDGTMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISPNSISYNIL 305
            +M   G  PD+  Y   L            +G + ++ +++ +MR     PN+  Y +L
Sbjct: 279 VQMNEAGFEPDIVDYTNLLSGYAN-------AGKMADAYDLLRDMRRRGFEPNANCYTVL 338

Query: 306 LSCLCKTRRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIVDEMIEEGL 365
           +  LCK  R++E+ K+   M+R  C+ D V+Y  +       G+  K   ++D+MI++GL
Sbjct: 339 IQALCKVDRMEEAMKVFVEMERYECEADVVTYTALVSGFCKWGKIDKCYIVLDDMIKKGL 398

Query: 366 TPDRKFYYDLIGILCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGEFEMGRQ 425
            P    Y  ++      E     LEL EKM++        +Y+V+I   C+ GE +   +
Sbjct: 399 MPSELTYMHIMVAHEKKESFEECLELMEKMRQIEYHPDIGIYNVVIRLACKLGEVKEAVR 458

Query: 426 LWEEAMAMGVS 436
           LW E    G+S
Sbjct: 459 LWNEMEENGLS 459


HSP 2 Score: 47.8 bits (112), Expect = 2.3e-05
Identity = 43/196 (21.94%), Postives = 75/196 (38.27%), Query Frame = 1

Query: 169 EALGLFKNLEKYKCPHDQFTVVAIITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLL 228
           +A  L +++ +     +      +I ALC     + A  V +  +          Y +L+
Sbjct: 305 DAYDLLRDMRRRGFEPNANCYTVLIQALCKVDRMEEAMKVFVEMERYECEADVVTYTALV 364

Query: 229 YGWSIKKNTKEARRILKEMKSDGTMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMME 288
            G+       +   +L +M   G MP    Y   +        EK  S    E L +M +
Sbjct: 365 SGFCKWGKIDKCYIVLDDMIKKGLMPSELTYMHIMVAH-----EKKES--FEECLELMEK 424

Query: 289 MRSYKISPNSISYNILLSCLCKTRRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLTGR 348
           MR  +  P+   YN+++   CK   VKE+ ++   M+  G  P   ++ +M   L   G 
Sbjct: 425 MRQIEYHPDIGIYNVVIRLACKLGEVKEAVRLWNEMEENGLSPGVDTFVIMINGLASQGC 484

Query: 349 FGKGREIVDEMIEEGL 365
             +  +   EM+  GL
Sbjct: 485 LLEASDHFKEMVTRGL 493

BLAST of Csa1G009920 vs. TAIR10
Match: AT3G53700.1 (AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 117.9 bits (294), Expect = 1.8e-26
Identity = 89/377 (23.61%), Postives = 165/377 (43.77%), Query Frame = 1

Query: 119 FNNAIRFFAQKKDYTAVNILLSNLKKADRAMDGQTFGFVAEAFVKMDREDEALGLFKNLE 178
           FN  I+   +        ++L ++       D +TF  V + +++    D AL + + + 
Sbjct: 192 FNVLIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIEEGDLDGALRIREQMV 251

Query: 179 KYKCPHDQFTVVAIITALCSKGHAKRAEGVV--LHHKDKISSTMSCIYRSLLYGWSIKKN 238
           ++ C     +V  I+   C +G  + A   +  + ++D         + +L+ G     +
Sbjct: 252 EFGCSWSNVSVNVIVHGFCKEGRVEDALNFIQEMSNQDGFFPDQYT-FNTLVNGLCKAGH 311

Query: 239 TKEARRILKEMKSDGTMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISP 298
            K A  I+  M  +G  PD++ YN+ +  LC+        G V E++ V+ +M +   SP
Sbjct: 312 VKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKL-------GEVKEAVEVLDQMITRDCSP 371

Query: 299 NSISYNILLSCLCKTRRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIV 358
           N+++YN L+S LCK  +V+E+ ++  ++   G  PD  ++  + + L LT       E+ 
Sbjct: 372 NTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELF 431

Query: 359 DEMIEEGLTPDRKFYYDLIGILCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRG 418
           +EM  +G  PD   Y  LI  LC   + + AL + ++M+ S        Y+ LI   C+ 
Sbjct: 432 EEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGCARSVITYNTLIDGFCKA 491

Query: 419 GEFEMGRQLWEEAMAMGVSLN-----------CSS-------EILDPSITKVFKPTRKIE 476
            +     ++++E    GVS N           C S       +++D  I +  KP +   
Sbjct: 492 NKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTY 551


HSP 2 Score: 112.5 bits (280), Expect = 7.6e-25
Identity = 80/325 (24.62%), Postives = 139/325 (42.77%), Query Frame = 1

Query: 150 DGQTFGFVAEAFVKMDREDEALGLFKNLEKYKCPHDQFTVVAIITALCSKGHAKRAEGVV 209
           D  T+  V     K+    EA+ +   +    C  +  T   +I+ LC +   + A  + 
Sbjct: 329 DVYTYNSVISGLCKLGEVKEAVEVLDQMITRDCSPNTVTYNTLISTLCKENQVEEATELA 388

Query: 210 LHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRILKEMKSDGTMPDLFCYNTFLKCLCEK 269
                K      C + SL+ G  + +N + A  + +EM+S G  PD F YN  +  LC K
Sbjct: 389 RVLTSKGILPDVCTFNSLIQGLCLTRNHRVAMELFEEMRSKGCEPDEFTYNMLIDSLCSK 448

Query: 270 NVEKNPSGLVPESLNVMMEMRSYKISPNSISYNILLSCLCKTRRVKESCKILEMMKRTGC 329
                  G + E+LN++ +M     + + I+YN L+   CK  + +E+ +I + M+  G 
Sbjct: 449 -------GKLDEALNMLKQMELSGCARSVITYNTLIDGFCKANKTREAEEIFDEMEVHGV 508

Query: 330 QPDCVSYYLMARVLFLTGRFGKGREIVDEMIEEGLTPDRKFYYDLIGILCGVERTNYALE 389
             + V+Y  +   L  + R     +++D+MI EG  PD+  Y  L+   C       A +
Sbjct: 509 SRNSVTYNTLIDGLCKSRRVEDAAQLMDQMIMEGQKPDKYTYNSLLTHFCRGGDIKKAAD 568

Query: 390 LFEKMKRSSLGGYGPVYDVLIPKLCRGGEFEMGRQLWEEAMAMGVSLNCSSEILDPSITK 449
           + + M  +        Y  LI  LC+ G  E+  +L       G+  N +    +P I  
Sbjct: 569 IVQAMTSNGCEPDIVTYGTLISGLCKAGRVEVASKLLRSIQMKGI--NLTPHAYNPVIQG 628

Query: 450 VFKPTRKIENKIVEEFNSAEKQNKA 475
           +F+  +  E   +  F    +QN+A
Sbjct: 629 LFRKRKTTE--AINLFREMLEQNEA 642


HSP 3 Score: 96.7 bits (239), Expect = 4.3e-20
Identity = 76/358 (21.23%), Postives = 152/358 (42.46%), Query Frame = 1

Query: 64  LDELESSLNKCTISLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFNNAI 123
           L++++SS  +C +  ++ L+  +I+S         +L    W + +     +   +N  +
Sbjct: 106 LEDMKSS--RCEMGTSTFLI--LIESYAQFELQDEILSVVDWMIDEFGLKPDTHFYNRML 165

Query: 124 RFFAQKKDYTAVNILLSNLKKADRAMDGQTFGFVAEAFVKMDREDEALGLFKNLEKYKCP 183
                      V I  + +       D  TF  + +A  +  +   A+ + +++  Y   
Sbjct: 166 NLLVDGNSLKLVEISHAKMSVWGIKPDVSTFNVLIKALCRAHQLRPAILMLEDMPSYGLV 225

Query: 184 HDQFTVVAIITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRI 243
            D+ T   ++     +G    A  +     +   S  +     +++G+  +   ++A   
Sbjct: 226 PDEKTFTTVMQGYIEEGDLDGALRIREQMVEFGCSWSNVSVNVIVHGFCKEGRVEDALNF 285

Query: 244 LKEMKS-DGTMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISPNSISYN 303
           ++EM + DG  PD + +NT +  LC+       +G V  ++ +M  M      P+  +YN
Sbjct: 286 IQEMSNQDGFFPDQYTFNTLVNGLCK-------AGHVKHAIEIMDVMLQEGYDPDVYTYN 345

Query: 304 ILLSCLCKTRRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIVDEMIEE 363
            ++S LCK   VKE+ ++L+ M    C P+ V+Y  +   L    +  +  E+   +  +
Sbjct: 346 SVISGLCKLGEVKEAVEVLDQMITRDCSPNTVTYNTLISTLCKENQVEEATELARVLTSK 405

Query: 364 GLTPDRKFYYDLIGILCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGEFE 421
           G+ PD   +  LI  LC       A+ELFE+M+          Y++LI  LC  G+ +
Sbjct: 406 GILPDVCTFNSLIQGLCLTRNHRVAMELFEEMRSKGCEPDEFTYNMLIDSLCSKGKLD 452


HSP 4 Score: 85.5 bits (210), Expect = 9.9e-17
Identity = 62/281 (22.06%), Postives = 119/281 (42.35%), Query Frame = 1

Query: 119 FNNAIRFFAQKKDYTAVNILLSNLKKADRAMDGQTFGFVAEAFVKMDREDEALGLFKNLE 178
           FN+ I+     +++     L   ++      D  T+  + ++     + DEAL + K +E
Sbjct: 403 FNSLIQGLCLTRNHRVAMELFEEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQME 462

Query: 179 KYKCPHDQFTVVAIITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTK 238
              C     T   +I   C     + AE +    +    S  S  Y +L+ G    +  +
Sbjct: 463 LSGCARSVITYNTLIDGFCKANKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVE 522

Query: 239 EARRILKEMKSDGTMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISPNS 298
           +A +++ +M  +G  PD + YN+ L   C         G + ++ +++  M S    P+ 
Sbjct: 523 DAAQLMDQMIMEGQKPDKYTYNSLLTHFCR-------GGDIKKAADIVQAMTSNGCEPDI 582

Query: 299 ISYNILLSCLCKTRRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIVDE 358
           ++Y  L+S LCK  RV+ + K+L  ++  G      +Y  + + LF   +  +   +  E
Sbjct: 583 VTYGTLISGLCKAGRVEVASKLLRSIQMKGINLTPHAYNPVIQGLFRKRKTTEAINLFRE 642

Query: 359 MIEEGLTPDRKFYYDLI--GILCG----VERTNYALELFEK 394
           M+E+   P     Y ++  G+  G     E  ++ +EL EK
Sbjct: 643 MLEQNEAPPDAVSYRIVFRGLCNGGGPIREAVDFLVELLEK 676

BLAST of Csa1G009920 vs. TAIR10
Match: AT3G22670.1 (AT3G22670.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 117.5 bits (293), Expect = 2.4e-26
Identity = 91/400 (22.75%), Postives = 176/400 (44.00%), Query Frame = 1

Query: 50  VSKLCEVISCTIGGLDELESSLNKCTISLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKK 109
           + K+C+ ++      +++   L+KC + +T SLV QV+    N     +   FF+W+  +
Sbjct: 102 IDKVCDFLNKKDTSHEDVVKELSKCDVVVTESLVLQVLRRFSNG--WNQAYGFFIWANSQ 161

Query: 110 LNHTLEDEDFNNAIRFFAQKKDYTAVNILLSNLKKADRA--MDGQTFGFVAEAFVKMDRE 169
             +      +N  +    + +++  +  L++ + K + +  +   T   V     K  + 
Sbjct: 162 TGYVHSGHTYNAMVDVLGKCRNFDLMWELVNEMNKNEESKLVTLDTMSKVMRRLAKSGKY 221

Query: 170 DEALGLFKNLEK-YKCPHDQFTVVAIITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRS 229
           ++A+  F  +EK Y    D   + +++ AL  +   + A  V L   D I       +  
Sbjct: 222 NKAVDAFLEMEKSYGVKTDTIAMNSLMDALVKENSIEHAHEVFLKLFDTIKPDART-FNI 281

Query: 230 LLYGWSIKKNTKEARRILKEMKSDGTMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVM 289
           L++G+   +   +AR ++  MK     PD+  Y +F++  C++   +  +        ++
Sbjct: 282 LIHGFCKARKFDDARAMMDLMKVTEFTPDVVTYTSFVEAYCKEGDFRRVN-------EML 341

Query: 290 MEMRSYKISPNSISYNILLSCLCKTRRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLT 349
            EMR    +PN ++Y I++  L K+++V E+  + E MK  GC PD   Y  +  +L  T
Sbjct: 342 EEMRENGCNPNVVTYTIVMHSLGKSKQVAEALGVYEKMKEDGCVPDAKFYSSLIHILSKT 401

Query: 350 GRFGKGREIVDEMIEEGLTPDRKFYYDLIGILCGVERTNYALELFEKMKRSSLGGYGP-- 409
           GRF    EI ++M  +G+  D   Y  +I       R   AL L ++M+        P  
Sbjct: 402 GRFKDAAEIFEDMTNQGVRRDVLVYNTMISAALHHSRDEMALRLLKRMEDEEGESCSPNV 461

Query: 410 -VYDVLIPKLCRGGEFEMGRQLWEEAMAMGVSLNCSSEIL 444
             Y  L+   C   + ++   L    +   VS++ S+ IL
Sbjct: 462 ETYAPLLKMCCHKKKMKLLGILLHHMVKNDVSIDVSTYIL 491

BLAST of Csa1G009920 vs. NCBI nr
Match: gi|449441065|ref|XP_004138304.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g61370, mitochondrial [Cucumis sativus])

HSP 1 Score: 983.0 bits (2540), Expect = 1.9e-283
Identity = 487/487 (100.00%), Postives = 487/487 (100.00%), Query Frame = 1

Query: 1   MHARHLFDEMSLRRCVSLLKLKWDSFIAQSVCTQHRFCSLHSTVNNGAAVSKLCEVISCT 60
           MHARHLFDEMSLRRCVSLLKLKWDSFIAQSVCTQHRFCSLHSTVNNGAAVSKLCEVISCT
Sbjct: 1   MHARHLFDEMSLRRCVSLLKLKWDSFIAQSVCTQHRFCSLHSTVNNGAAVSKLCEVISCT 60

Query: 61  IGGLDELESSLNKCTISLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFN 120
           IGGLDELESSLNKCTISLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFN
Sbjct: 61  IGGLDELESSLNKCTISLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFN 120

Query: 121 NAIRFFAQKKDYTAVNILLSNLKKADRAMDGQTFGFVAEAFVKMDREDEALGLFKNLEKY 180
           NAIRFFAQKKDYTAVNILLSNLKKADRAMDGQTFGFVAEAFVKMDREDEALGLFKNLEKY
Sbjct: 121 NAIRFFAQKKDYTAVNILLSNLKKADRAMDGQTFGFVAEAFVKMDREDEALGLFKNLEKY 180

Query: 181 KCPHDQFTVVAIITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEA 240
           KCPHDQFTVVAIITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEA
Sbjct: 181 KCPHDQFTVVAIITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEA 240

Query: 241 RRILKEMKSDGTMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISPNSIS 300
           RRILKEMKSDGTMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISPNSIS
Sbjct: 241 RRILKEMKSDGTMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISPNSIS 300

Query: 301 YNILLSCLCKTRRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIVDEMI 360
           YNILLSCLCKTRRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIVDEMI
Sbjct: 301 YNILLSCLCKTRRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIVDEMI 360

Query: 361 EEGLTPDRKFYYDLIGILCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGEFE 420
           EEGLTPDRKFYYDLIGILCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGEFE
Sbjct: 361 EEGLTPDRKFYYDLIGILCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGEFE 420

Query: 421 MGRQLWEEAMAMGVSLNCSSEILDPSITKVFKPTRKIENKIVEEFNSAEKQNKAAAEKPK 480
           MGRQLWEEAMAMGVSLNCSSEILDPSITKVFKPTRKIENKIVEEFNSAEKQNKAAAEKPK
Sbjct: 421 MGRQLWEEAMAMGVSLNCSSEILDPSITKVFKPTRKIENKIVEEFNSAEKQNKAAAEKPK 480

Query: 481 EKRKKGK 488
           EKRKKGK
Sbjct: 481 EKRKKGK 487

BLAST of Csa1G009920 vs. NCBI nr
Match: gi|659105975|ref|XP_008453221.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g61370, mitochondrial [Cucumis melo])

HSP 1 Score: 946.4 bits (2445), Expect = 1.9e-272
Identity = 464/486 (95.47%), Postives = 481/486 (98.97%), Query Frame = 1

Query: 1   MHARHLFDEMSLRRCVSLLKLKWDSFIAQSVCTQHRFCSLHSTVNNGAAVSKLCEVISCT 60
           MHARH+FDEM LRRCVSLLKLKWDSFIAQSVCTQHRFCSLHSTVNNGAAVSKLCEVISCT
Sbjct: 1   MHARHMFDEMPLRRCVSLLKLKWDSFIAQSVCTQHRFCSLHSTVNNGAAVSKLCEVISCT 60

Query: 61  IGGLDELESSLNKCTISLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFN 120
           IGGLDELESSLN+CTISLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFN
Sbjct: 61  IGGLDELESSLNQCTISLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFN 120

Query: 121 NAIRFFAQKKDYTAVNILLSNLKKADRAMDGQTFGFVAEAFVKMDREDEALGLFKNLEKY 180
           NAIRFFAQKKDYTA+NILLSNLKKADRAMDGQTF FVAEAFVKM+R+DEALGLFKNLEKY
Sbjct: 121 NAIRFFAQKKDYTAINILLSNLKKADRAMDGQTFSFVAEAFVKMNRDDEALGLFKNLEKY 180

Query: 181 KCPHDQFTVVAIITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEA 240
           KCPHDQFTVVAIITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEA
Sbjct: 181 KCPHDQFTVVAIITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEA 240

Query: 241 RRILKEMKSDGTMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISPNSIS 300
           RRILKEMKSDGTMPDLF YNTFLKCLCEKNVEKNPSGLVPE+LNVMMEMRSYKI+PNSIS
Sbjct: 241 RRILKEMKSDGTMPDLFSYNTFLKCLCEKNVEKNPSGLVPEALNVMMEMRSYKIAPNSIS 300

Query: 301 YNILLSCLCKTRRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIVDEMI 360
           YNILLSCLCKTRRVKESC+ILEMMKR+GC+PDCVSYYL+ARVLFLTGRFGKGREIVDEMI
Sbjct: 301 YNILLSCLCKTRRVKESCRILEMMKRSGCRPDCVSYYLVARVLFLTGRFGKGREIVDEMI 360

Query: 361 EEGLTPDRKFYYDLIGILCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGEFE 420
           EEGLTPDRKFYY+LIGILCGVERTNYA+ELFEKMKRSSLGGYGPVYDVLIPK+CRGG+FE
Sbjct: 361 EEGLTPDRKFYYELIGILCGVERTNYAVELFEKMKRSSLGGYGPVYDVLIPKVCRGGDFE 420

Query: 421 MGRQLWEEAMAMGVSLNCSSEILDPSITKVFKPTRKIENKIVEEFNSAEKQNKAAAEKPK 480
           MGRQLWEEAMAMGVSLNCSSEILDPSITKVFKPTRKIENKIVEE N+AEKQNKAAAEKP 
Sbjct: 421 MGRQLWEEAMAMGVSLNCSSEILDPSITKVFKPTRKIENKIVEECNTAEKQNKAAAEKPN 480

Query: 481 EKRKKG 487
           +KRKKG
Sbjct: 481 KKRKKG 486

BLAST of Csa1G009920 vs. NCBI nr
Match: gi|568884695|ref|XP_006494986.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g61370, mitochondrial [Citrus sinensis])

HSP 1 Score: 637.9 bits (1644), Expect = 1.5e-179
Identity = 318/481 (66.11%), Postives = 392/481 (81.50%), Query Frame = 1

Query: 18  LLKLKWDSFIAQSVCT---QHRFCSLHSTVNNGAA---VSKLCEVISCTIGGLDELESSL 77
           LLK KW + + Q V T   +H   SL+STV +      + +LC+V+S TIGGLD+LE SL
Sbjct: 9   LLKPKWRTLLLQRVDTHNFEHLLLSLYSTVPSNQVSHELKELCKVVSSTIGGLDDLELSL 68

Query: 78  NKCTISLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFNNAIRFFAQKKD 137
           N+ T SLTSSLVTQVIDS K EAPTRRLLRFFLWS K ++ +LED+D+N+AIR FA+K+D
Sbjct: 69  NQFTGSLTSSLVTQVIDSCKQEAPTRRLLRFFLWSCKNMSASLEDKDYNHAIRVFAEKRD 128

Query: 138 YTAVNILLSNLKKADRAMDGQTFGFVAEAFVKMDREDEALGLFKNLEKYKCPHDQFTVVA 197
           +TA+NIL+S+L+K  R M+ Q+FG + E  VK+ REDEALG+FKNLEK+KC  D  TV A
Sbjct: 129 HTAMNILVSDLRKEGRVMESQSFGVLVETLVKLGREDEALGIFKNLEKFKCVQDSVTVSA 188

Query: 198 IITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRILKEMKSDG 257
           I++ALC+KGHA+RAEGVV HHKDKIS    CIYRSL+YGWS+++N K AR+I+KEMKS G
Sbjct: 189 IVSALCAKGHARRAEGVVYHHKDKISGVELCIYRSLIYGWSMQENVKAARKIIKEMKSAG 248

Query: 258 TMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISPNSISYNILLSCLCKT 317
            MPDLFCYNTFL+ LCE+N+++NPSGLVPE+LNVMMEMRSY+I+P SISYNILLSCL +T
Sbjct: 249 IMPDLFCYNTFLRGLCERNLKRNPSGLVPEALNVMMEMRSYRIAPTSISYNILLSCLGRT 308

Query: 318 RRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIVDEMIEEGLTPDRKFY 377
           RRVKESC++LE MK++GC PD VSYYL+ARVL+L+GRFGKG +IVDEMIEEGL PDRKFY
Sbjct: 309 RRVKESCQVLEQMKKSGCAPDWVSYYLVARVLYLSGRFGKGNKIVDEMIEEGLIPDRKFY 368

Query: 378 YDLIGILCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGEFEMGRQLWEEAMA 437
           YDLIGILCGVER N+ALELFE+MKRSSLGGYGPVYDVLIPK+CRGG+F  GR+LW+EAM 
Sbjct: 369 YDLIGILCGVERVNFALELFERMKRSSLGGYGPVYDVLIPKVCRGGDFVKGRELWDEAMV 428

Query: 438 MGVSLNCSSEILDPSITKVFKPTRK-IENKIVEEFNSAEKQNKAAA----EKPKEKRKKG 488
           MG++L+CSS +LDPSI +VF+P RK  E+ +     + E Q K       +K K K+KK 
Sbjct: 429 MGLTLSCSSNVLDPSIIEVFQPRRKPTESCLGSTTPNIETQVKKTVIEVDKKKKSKKKKN 488

BLAST of Csa1G009920 vs. NCBI nr
Match: gi|590668221|ref|XP_007037432.1| (Pentatricopeptide repeat superfamily protein [Theobroma cacao])

HSP 1 Score: 635.2 bits (1637), Expect = 9.5e-179
Identity = 307/470 (65.32%), Postives = 384/470 (81.70%), Query Frame = 1

Query: 26  FIAQSVCTQ---HRFCSLHSTVNNGAAVSKLCEVISCTIGGLDELESSLNKCTISLTSSL 85
           F+ Q + TQ   + F S HST+       +LC+V+S ++GGLD+LESSLN+  +SL+  L
Sbjct: 13  FLTQVITTQKPKNLFHSPHSTITTPPEFEELCKVVSSSMGGLDDLESSLNRFKLSLSPLL 72

Query: 86  VTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFNNAIRFFAQKKDYTAVNILLSNL 145
           VTQVI+S +NEAPTRRLLRFFLWS+K L+ +LED+D NN +R FA+KKD+TA+ IL+S++
Sbjct: 73  VTQVINSCENEAPTRRLLRFFLWSVKNLSSSLEDKDLNNVVRVFAKKKDHTAMGILVSDI 132

Query: 146 KKADRAMDGQTFGFVAEAFVKMDREDEALGLFKNLEKYKCPHDQFTVVAIITALCSKGHA 205
           +   R M+ QTF  VAE  VK+ REDEALG+FKNLEK+KCP D F++ AI+ ALC+KGHA
Sbjct: 133 RNRGRTMESQTFSVVAEMLVKLGREDEALGIFKNLEKFKCPRDSFSLTAIVNALCAKGHA 192

Query: 206 KRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRILKEMKSDGTMPDLFCYNTF 265
           ++AEGVV HHKD I+    CIYR LLYGWS+++N KEARR++KEMKS G   DL+CYNTF
Sbjct: 193 RKAEGVVYHHKDTIAGVEPCIYRCLLYGWSVQENVKEARRVIKEMKSAGFELDLYCYNTF 252

Query: 266 LKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISPNSISYNILLSCLCKTRRVKESCKILE 325
           L+CLC KN ++NPSGLVPE+LNVMMEMRS +I+P S+SYNILLSCL +TRRVKESC+ILE
Sbjct: 253 LRCLCGKNAKRNPSGLVPEALNVMMEMRSQRIAPTSVSYNILLSCLGRTRRVKESCQILE 312

Query: 326 MMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIVDEMIEEGLTPDRKFYYDLIGILCGVE 385
           +MK+ GC PD +SYYL+ARVL+LTGRFGKG +IVDEMIE+GLTPDRKFYYDLIG+LCGVE
Sbjct: 313 LMKKAGCAPDWISYYLVARVLYLTGRFGKGNKIVDEMIEQGLTPDRKFYYDLIGVLCGVE 372

Query: 386 RTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGEFEMGRQLWEEAMAMGVSLNCSSEI 445
           R N+ALELFE+MKRSSLGGYGPVYDVLIPKLCRGG+FE GR+LW+EA+A GVSL+CSS++
Sbjct: 373 RVNFALELFERMKRSSLGGYGPVYDVLIPKLCRGGDFEKGRELWDEAVATGVSLSCSSDV 432

Query: 446 LDPSITKVFKPTRKIENKIVEEFNSAE-----KQNKAAAEKPKEKRKKGK 488
           LDPSIT+VFKPTRK E   ++    A+     KQN    +K K+ +KK K
Sbjct: 433 LDPSITEVFKPTRKAEKVHLKGCTMAKSPVKNKQNTMKGKKYKKIKKKKK 482

BLAST of Csa1G009920 vs. NCBI nr
Match: gi|567896330|ref|XP_006440653.1| (hypothetical protein CICLE_v10023621mg [Citrus clementina])

HSP 1 Score: 635.2 bits (1637), Expect = 9.5e-179
Identity = 319/481 (66.32%), Postives = 390/481 (81.08%), Query Frame = 1

Query: 18  LLKLKWDSFIAQSVCTQ---HRFCSLHSTVNNGAA---VSKLCEVISCTIGGLDELESSL 77
           LLK KW S + Q V TQ   H   SL+S V +      + +LC+V+S TIGGLD+LE SL
Sbjct: 4   LLKPKWRSLLLQRVDTQKSEHLLLSLYSMVPSNQVSHELKELCKVVSSTIGGLDDLELSL 63

Query: 78  NKCTISLTSSLVTQVIDSSKNEAPTRRLLRFFLWSLKKLNHTLEDEDFNNAIRFFAQKKD 137
           N+ T SL+SSLVTQVIDS K+EAPTRRLLRFFLWS K L+ +LED+D+N+AIR FA+KKD
Sbjct: 64  NQFTGSLSSSLVTQVIDSCKHEAPTRRLLRFFLWSCKNLSASLEDKDYNHAIRVFAEKKD 123

Query: 138 YTAVNILLSNLKKADRAMDGQTFGFVAEAFVKMDREDEALGLFKNLEKYKCPHDQFTVVA 197
           + A+NIL+S+L+K  R M+ Q+FG + E  VK+ REDEALG+FKNLEK+KC  D  TV A
Sbjct: 124 HMAMNILVSDLRKEGRVMETQSFGVLVETLVKLGREDEALGIFKNLEKFKCVQDSVTVSA 183

Query: 198 IITALCSKGHAKRAEGVVLHHKDKISSTMSCIYRSLLYGWSIKKNTKEARRILKEMKSDG 257
           I++ALC+KGHA+RAEGVV HHKDKIS    CIYRSL+YGWS+++N K AR+I+KEMKS G
Sbjct: 184 IVSALCAKGHARRAEGVVYHHKDKISGVELCIYRSLIYGWSMQENVKAARKIIKEMKSAG 243

Query: 258 TMPDLFCYNTFLKCLCEKNVEKNPSGLVPESLNVMMEMRSYKISPNSISYNILLSCLCKT 317
            MPDLFCYNTFL+ LCE+N+++NPSGLVPE+LNVMMEMRSY+I+P SISYNILLSCL +T
Sbjct: 244 FMPDLFCYNTFLRGLCERNLKRNPSGLVPEALNVMMEMRSYRIAPTSISYNILLSCLGRT 303

Query: 318 RRVKESCKILEMMKRTGCQPDCVSYYLMARVLFLTGRFGKGREIVDEMIEEGLTPDRKFY 377
           RRVKESC++LE MK++GC PD VSYYL+ARVL+L+GRFGKG +IVDEMIEEGL PDRKFY
Sbjct: 304 RRVKESCRVLEQMKKSGCAPDWVSYYLVARVLYLSGRFGKGNKIVDEMIEEGLIPDRKFY 363

Query: 378 YDLIGILCGVERTNYALELFEKMKRSSLGGYGPVYDVLIPKLCRGGEFEMGRQLWEEAMA 437
           YDLIGILCGVER N+ALELFE+MKRSSLGGYGPVYDVLIPK+C+GG+F  GR+LW+EAM 
Sbjct: 364 YDLIGILCGVERVNFALELFERMKRSSLGGYGPVYDVLIPKVCQGGDFVKGRELWDEAMV 423

Query: 438 MGVSLNCSSEILDPSITKVFKPTRK-IENKIVEEFNSAEKQNKAAA----EKPKEKRKKG 488
           MG++L+CSS +LDPSIT+VF P RK  E  +     + E Q K       +K K K+KK 
Sbjct: 424 MGLTLSCSSNVLDPSITEVFHPRRKPTEGCLGSTTPNIEAQVKKTVIEVDKKKKSKKKKN 483

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP439_ARATH1.8e-16157.94Pentatricopeptide repeat-containing protein At5g61370, mitochondrial OS=Arabidop... [more]
PP438_ARATH1.5e-4327.95Pentatricopeptide repeat-containing protein PNM1, mitochondrial OS=Arabidopsis t... [more]
PP447_ARATH1.5e-3024.26Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis th... [more]
PP281_ARATH3.2e-2523.61Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... [more]
PP248_ARATH4.2e-2522.75Pentatricopeptide repeat-containing protein At3g22670, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
V4T153_9ROSI6.6e-17966.32Uncharacterized protein OS=Citrus clementina GN=CICLE_v10023621mg PE=4 SV=1[more]
A0A061FXY0_THECC6.6e-17965.32Pentatricopeptide repeat superfamily protein OS=Theobroma cacao GN=TCM_014106 PE... [more]
K7MKY1_SOYBN9.0e-17664.21Uncharacterized protein OS=Glycine max GN=GLYMA_17G102300 PE=4 SV=1[more]
W9QUA2_9ROSA4.4e-17567.47Uncharacterized protein OS=Morus notabilis GN=L484_008580 PE=4 SV=1[more]
B9RLV3_RICCO1.7e-17466.45Pentatricopeptide repeat-containing protein, putative OS=Ricinus communis GN=RCO... [more]
Match NameE-valueIdentityDescription
AT5G61370.11.0e-16257.94 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G60960.18.6e-4527.95 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G65820.18.3e-3224.26 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G53700.11.8e-2623.61 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G22670.12.4e-2622.75 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449441065|ref|XP_004138304.1|1.9e-283100.00PREDICTED: pentatricopeptide repeat-containing protein At5g61370, mitochondrial ... [more]
gi|659105975|ref|XP_008453221.1|1.9e-27295.47PREDICTED: pentatricopeptide repeat-containing protein At5g61370, mitochondrial ... [more]
gi|568884695|ref|XP_006494986.1|1.5e-17966.11PREDICTED: pentatricopeptide repeat-containing protein At5g61370, mitochondrial ... [more]
gi|590668221|ref|XP_007037432.1|9.5e-17965.32Pentatricopeptide repeat superfamily protein [Theobroma cacao][more]
gi|567896330|ref|XP_006440653.1|9.5e-17966.32hypothetical protein CICLE_v10023621mg [Citrus clementina][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0008152 metabolic process
cellular_component GO:0005739 mitochondrion
cellular_component GO:0005794 Golgi apparatus
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding
molecular_function GO:0016757 transferase activity, transferring glycosyl groups

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa1G009920.1Csa1G009920.1mRNA


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 374..397
score:
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 296..340
score: 4.1
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 224..265
score: 8.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 223..255
score: 0.0024coord: 334..367
score: 2.0E-4coord: 405..438
score: 9.8E-4coord: 299..333
score: 3.9
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 185..215
score: 5.678coord: 220..254
score: 8.484coord: 150..184
score: 7.278coord: 297..331
score: 12.803coord: 255..296
score: 6.292coord: 332..366
score: 9.482coord: 402..436
score: 7.87coord: 367..401
score: 7
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 78..464
score: 1.0E
NoneNo IPR availablePANTHERPTHR24015:SF620SUBFAMILY NOT NAMEDcoord: 78..464
score: 1.0E

The following gene(s) are paralogous to this gene:

None