CSPI04G21480 (gene) Wild cucumber (PI 183967)

NameCSPI04G21480
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionPentatricopeptide repeat-containing protein
LocationChr4 : 19961561 .. 19964171 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTTCCTCCACTTAATCCCCCATTTTTCTTATTCCTTTCCTCCCTTACTCATTTCCTTCCCCCCAAATGGCCCTTTCCAGAAAAACCCGCGTATGGTCAATGTAAGTGCGTGTTGTAACGCAATAAGAAGGGAGCGTGAGATTCGAAGAGGGTTGGATAACGAGAAGTGGAGGGTCTGGTCGAGAGAGGAGCCAAGTCATTTGAACAAAAGCTTCACAAGGAAAAGGGGTTATCTATTACAAAACCCAACCACCCAAGTACTGAGATTTCAATTTTCAGCTCTTCAAGATTCATTGCTTTTCGAAGTCGAAGGGAGGTATTCGCTTTGAGATTGATTTCTCATTCACTGCATGTTATGGTTTTTTCAATCTATTTATCTTCTGCTTATAATTTCATTTGATTATACCGAGTTTTAGACGATTGAAGAAGTTCCTATAACATGGCGTTGATTTGAGAGTTCTGGTTCTGATATTAAATGCCAAAAAGAGAGCGATTCTTGTGAAGAAGTGGGATTCTACTGAGTTTCTTTCACTCAATGGCGACTCTGCTCAATACAGTTTCTCCAATTACAAACCCGTCACCAGAAACCACAAGAAGAGGATGTGGGTTCTTTTCCCATATCCCAAATATCCAGAAGCTCTCACTTAACAAGGGATTTTCTAAAGTTTTAGCATCAACCCAGATTACCATTTCTCCAAAGGACACCATTTTCACACTTCCAAATTGGAAGATTGGGAAACTTGATCAAAAGAGTAAAGAACTTAGACTTAATGATGCTTTTTTTCATTTAGAGTTCATGGTTGAGAAGGGGCAAAAGCCTGATGTATTTCAAGCAACTCAGTTATTGTATGATCTCTGTAAGACATGTAAGATGAGAAAAGCTATTAAGGTAATGGAGATGATGATTGGGTCTGGAATCATTCCAGATGCAGCATCTTATACCTTTTTGGTTAGTTCTTTGTGTAGAAAAGGGAATGTTGGTTATGCAATGCAACTAGTGGACAAAATGGAGGAATATGGTTATCCTACTAACACTGCTACTTATAATTCACTTGTGAGAGGGCTTTGTATGCATGGAAACTTGACTCAGAGCTTGCAACTTTTAGACAGATTAATCCAGAAGGGGTTGGTTCCTAATGCTTATACATACTCTTTTTTGCTTGAAGCTGCATACAAGGAAAGAGGAGCTGATGAAGCAAGTAAGCTTTTGGATGAGATAATTGCAAAGGGTGGGAAACCTAACTTGGTTAGCTACAATGTTTTGTTGACTGGGTTGTGCAAAGAAGGTAGGACAGAGGATGCCATGCAGTTATTTAGGGAATTGCCTTCTAAGGGATTCAGTCCAAATGTTGTCAGTTATAATATCTTGCTAAGGAGTTTGTGCAATGAAGGGAGGTGGGAAGAGGCAAATGTGCTTCTAGCTGAAATGGATGGCGATGAACGATCCCCTTCAACTGTCACTTACAATATATTGATTGGTTCACTTACTCTTCATGGCAGAACAGAACATGCTCTTGAGGTTTTGGAAGAGATGATTAGGGCACGATTCAAGCCAACAGCTTCTAGCTACAATCCGATAATTGCTCGTCTTTGCAAAGATAGGAAAGTAGATCTTGTTGTAAAGTGTCTGGACCAAATGATGTATAGGCATTGCAATCCGAATGAAGGAACATACAATGCCATTGCTACACTTTGTGAAGAGGGTATGGTTCAAGAAGCATTCTCCATTATACAGAGTTTGGGCAACAAGCAACATTTCTCTACTCAAGAATTCTATAAAATTGTTATTACCAGCCTGTGTCGTAAAGGAAACACATATCCAGCATTCCAGCTTCTCTATGAAATGACAAAGTATGGGTTTACACCCGATTCTTTTACCTATTCGTCTTTGATCCGAGGGTTATGCATGGAGGGTATGTTGAATGAGGCAATTGAAATATTCAGTGTAATGGAGGAAAATATCAAGCTTGATACTGAGAATTACAATTCACTCATTCTTGGTTGCTGCAAATCTCGAAGAACTGATTTGGCCTTGGACGTATTCGAAATAATGGTTGGTAAAGGTTATCTGGCCAATGAAACGACATACACCATTCTTGTGGAAGGTATCATCCATGAAAAAGAGATGGATCTAGCAACCGAAGTACTGAGAGAGTTGCAACTGAGGGATGTTATAAATCAAAGCACAGTGGAAAGACTTGTAATGCAGTATGACTTAAACGAATTGCCATTGTGATAATTTCAGTTGACTTGCTACCAGCAATTGGAAACCTAGCAGACAGTTCATCATGAAGATCAAGTAATTGTTAGTATGACAGAGAAAAAAAAAAGTAAAGAAAAAGGTACTTCCACATCCGTCTCATCCCATATTTTGTTTGATCTTATTACTTTGTCTGACAACTGAAAGCTTGTGAATCTGACAAGAGGGTGCTGTTTTTTTTAGTTGCATTATTTACAACTGAGTACATGTTTTTTAGTTCACTAGGCTTCTTATATGTACATAAAATTACTCCATCCAGTTCTTTACTTTCTCTTTCTATTGAGTTTTTTTGCCCATCTGTGGCCAATTTCATTCTTACAAAAGTTTTGGTAGTTGAAAGGAAACCTTATG

mRNA sequence

ATGGCGACTCTGCTCAATACAGTTTCTCCAATTACAAACCCGTCACCAGAAACCACAAGAAGAGGATGTGGGTTCTTTTCCCATATCCCAAATATCCAGAAGCTCTCACTTAACAAGGGATTTTCTAAAGTTTTAGCATCAACCCAGATTACCATTTCTCCAAAGGACACCATTTTCACACTTCCAAATTGGAAGATTGGGAAACTTGATCAAAAGAGTAAAGAACTTAGACTTAATGATGCTTTTTTTCATTTAGAGTTCATGGTTGAGAAGGGGCAAAAGCCTGATGTATTTCAAGCAACTCAGTTATTGTATGATCTCTGTAAGACATGTAAGATGAGAAAAGCTATTAAGGTAATGGAGATGATGATTGGGTCTGGAATCATTCCAGATGCAGCATCTTATACCTTTTTGGTTAGTTCTTTGTGTAGAAAAGGGAATGTTGGTTATGCAATGCAACTAGTGGACAAAATGGAGGAATATGGTTATCCTACTAACACTGCTACTTATAATTCACTTGTGAGAGGGCTTTGTATGCATGGAAACTTGACTCAGAGCTTGCAACTTTTAGACAGATTAATCCAGAAGGGGTTGGTTCCTAATGCTTATACATACTCTTTTTTGCTTGAAGCTGCATACAAGGAAAGAGGAGCTGATGAAGCAAGTAAGCTTTTGGATGAGATAATTGCAAAGGGTGGGAAACCTAACTTGGTTAGCTACAATGTTTTGTTGACTGGGTTGTGCAAAGAAGGTAGGACAGAGGATGCCATGCAGTTATTTAGGGAATTGCCTTCTAAGGGATTCAGTCCAAATGTTGTCAGTTATAATATCTTGCTAAGGAGTTTGTGCAATGAAGGGAGGTGGGAAGAGGCAAATGTGCTTCTAGCTGAAATGGATGGCGATGAACGATCCCCTTCAACTGTCACTTACAATATATTGATTGGTTCACTTACTCTTCATGGCAGAACAGAACATGCTCTTGAGGTTTTGGAAGAGATGATTAGGGCACGATTCAAGCCAACAGCTTCTAGCTACAATCCGATAATTGCTCGTCTTTGCAAAGATAGGAAAGTAGATCTTGTTGTAAAGTGTCTGGACCAAATGATGTATAGGCATTGCAATCCGAATGAAGGAACATACAATGCCATTGCTACACTTTGTGAAGAGGGTATGGTTCAAGAAGCATTCTCCATTATACAGAGTTTGGGCAACAAGCAACATTTCTCTACTCAAGAATTCTATAAAATTGTTATTACCAGCCTGTGTCGTAAAGGAAACACATATCCAGCATTCCAGCTTCTCTATGAAATGACAAAGTATGGGTTTACACCCGATTCTTTTACCTATTCGTCTTTGATCCGAGGGTTATGCATGGAGGGTATGTTGAATGAGGCAATTGAAATATTCAGTGTAATGGAGGAAAATATCAAGCTTGATACTGAGAATTACAATTCACTCATTCTTGGTTGCTGCAAATCTCGAAGAACTGATTTGGCCTTGGACGTATTCGAAATAATGGTTGGTAAAGGTTATCTGGCCAATGAAACGACATACACCATTCTTGTGGAAGGTATCATCCATGAAAAAGAGATGGATCTAGCAACCGAAGTACTGAGAGAGTTGCAACTGAGGGATGTTATAAATCAAAGCACAGTGGAAAGACTTGTAATGCAGTATGACTTAAACGAATTGCCATTGTGA

Coding sequence (CDS)

ATGGCGACTCTGCTCAATACAGTTTCTCCAATTACAAACCCGTCACCAGAAACCACAAGAAGAGGATGTGGGTTCTTTTCCCATATCCCAAATATCCAGAAGCTCTCACTTAACAAGGGATTTTCTAAAGTTTTAGCATCAACCCAGATTACCATTTCTCCAAAGGACACCATTTTCACACTTCCAAATTGGAAGATTGGGAAACTTGATCAAAAGAGTAAAGAACTTAGACTTAATGATGCTTTTTTTCATTTAGAGTTCATGGTTGAGAAGGGGCAAAAGCCTGATGTATTTCAAGCAACTCAGTTATTGTATGATCTCTGTAAGACATGTAAGATGAGAAAAGCTATTAAGGTAATGGAGATGATGATTGGGTCTGGAATCATTCCAGATGCAGCATCTTATACCTTTTTGGTTAGTTCTTTGTGTAGAAAAGGGAATGTTGGTTATGCAATGCAACTAGTGGACAAAATGGAGGAATATGGTTATCCTACTAACACTGCTACTTATAATTCACTTGTGAGAGGGCTTTGTATGCATGGAAACTTGACTCAGAGCTTGCAACTTTTAGACAGATTAATCCAGAAGGGGTTGGTTCCTAATGCTTATACATACTCTTTTTTGCTTGAAGCTGCATACAAGGAAAGAGGAGCTGATGAAGCAAGTAAGCTTTTGGATGAGATAATTGCAAAGGGTGGGAAACCTAACTTGGTTAGCTACAATGTTTTGTTGACTGGGTTGTGCAAAGAAGGTAGGACAGAGGATGCCATGCAGTTATTTAGGGAATTGCCTTCTAAGGGATTCAGTCCAAATGTTGTCAGTTATAATATCTTGCTAAGGAGTTTGTGCAATGAAGGGAGGTGGGAAGAGGCAAATGTGCTTCTAGCTGAAATGGATGGCGATGAACGATCCCCTTCAACTGTCACTTACAATATATTGATTGGTTCACTTACTCTTCATGGCAGAACAGAACATGCTCTTGAGGTTTTGGAAGAGATGATTAGGGCACGATTCAAGCCAACAGCTTCTAGCTACAATCCGATAATTGCTCGTCTTTGCAAAGATAGGAAAGTAGATCTTGTTGTAAAGTGTCTGGACCAAATGATGTATAGGCATTGCAATCCGAATGAAGGAACATACAATGCCATTGCTACACTTTGTGAAGAGGGTATGGTTCAAGAAGCATTCTCCATTATACAGAGTTTGGGCAACAAGCAACATTTCTCTACTCAAGAATTCTATAAAATTGTTATTACCAGCCTGTGTCGTAAAGGAAACACATATCCAGCATTCCAGCTTCTCTATGAAATGACAAAGTATGGGTTTACACCCGATTCTTTTACCTATTCGTCTTTGATCCGAGGGTTATGCATGGAGGGTATGTTGAATGAGGCAATTGAAATATTCAGTGTAATGGAGGAAAATATCAAGCTTGATACTGAGAATTACAATTCACTCATTCTTGGTTGCTGCAAATCTCGAAGAACTGATTTGGCCTTGGACGTATTCGAAATAATGGTTGGTAAAGGTTATCTGGCCAATGAAACGACATACACCATTCTTGTGGAAGGTATCATCCATGAAAAAGAGATGGATCTAGCAACCGAAGTACTGAGAGAGTTGCAACTGAGGGATGTTATAAATCAAAGCACAGTGGAAAGACTTGTAATGCAGTATGACTTAAACGAATTGCCATTGTGA
BLAST of CSPI04G21480 vs. Swiss-Prot
Match: PP131_ARATH (Pentatricopeptide repeat-containing protein At1g79080, chloroplastic OS=Arabidopsis thaliana GN=At1g79080 PE=2 SV=1)

HSP 1 Score: 740.0 bits (1909), Expect = 2.0e-212
Identity = 376/577 (65.16%), Postives = 464/577 (80.42%), Query Frame = 1

Query: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPN--IQKLSLNKGFSKVLASTQITISPKDTI 60
           M+TLLN+V  + +P   + R+  GF SHIP+  +   S++KG ++VLASTQIT+SPKD+ 
Sbjct: 1   MSTLLNSVLSMASPE-SSPRKAVGFVSHIPSGFLHFSSVSKGVARVLASTQITLSPKDSA 60

Query: 61  FTLP--NWK----IGKL--DQKSKELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKT 120
           FT+   +WK     G    D +S E  L+D+F HLE +V  G KP+V  +TQLLYDLCK 
Sbjct: 61  FTITGSSWKPDLDSGSFSDDPRSDEPNLSDSFSHLESLVTGGHKPNVAHSTQLLYDLCKA 120

Query: 121 CKMRKAIKVMEMMIGSGIIPDAASYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTATY 180
            +++KAI+V+E+M+ SGIIPDA++YT+LV+ LC++GNVGYAMQLV+KME++GYP+NT TY
Sbjct: 121 NRLKKAIRVIELMVSSGIIPDASAYTYLVNQLCKRGNVGYAMQLVEKMEDHGYPSNTVTY 180

Query: 181 NSLVRGLCMHGNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIA 240
           N+LVRGLCM G+L QSLQ ++RL+QKGL PNA+TYSFLLEAAYKERG DEA KLLDEII 
Sbjct: 181 NALVRGLCMLGSLNQSLQFVERLMQKGLAPNAFTYSFLLEAAYKERGTDEAVKLLDEIIV 240

Query: 241 KGGKPNLVSYNVLLTGLCKEGRTEDAMQLFRELPSKGFSPNVVSYNILLRSLCNEGRWEE 300
           KGG+PNLVSYNVLLTG CKEGRT+DAM LFRELP+KGF  NVVSYNILLR LC +GRWEE
Sbjct: 241 KGGEPNLVSYNVLLTGFCKEGRTDDAMALFRELPAKGFKANVVSYNILLRCLCCDGRWEE 300

Query: 301 ANVLLAEMDGDERSPSTVTYNILIGSLTLHGRTEHALEVLEEMIRA--RFKPTASSYNPI 360
           AN LLAEMDG +R+PS VTYNILI SL  HGRTE AL+VL+EM +   +F+ TA+SYNP+
Sbjct: 301 ANSLLAEMDGGDRAPSVVTYNILINSLAFHGRTEQALQVLKEMSKGNHQFRVTATSYNPV 360

Query: 361 IARLCKDRKVDLVVKCLDQMMYRHCNPNEGTYNAIATLCE-EGMVQEAFSIIQSLGNKQH 420
           IARLCK+ KVDLVVKCLD+M+YR C PNEGTYNAI +LCE    VQEAF IIQSL NKQ 
Sbjct: 361 IARLCKEGKVDLVVKCLDEMIYRRCKPNEGTYNAIGSLCEHNSKVQEAFYIIQSLSNKQK 420

Query: 421 FSTQEFYKIVITSLCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIE 480
             T +FYK VITSLCRKGNT+ AFQLLYEMT+ GF PD+ TYS+LIRGLC+EGM   A+E
Sbjct: 421 CCTHDFYKSVITSLCRKGNTFAAFQLLYEMTRCGFDPDAHTYSALIRGLCLEGMFTGAME 480

Query: 481 IFSVME--ENIKLDTENYNSLILGCCKSRRTDLALDVFEIMVGKGYLANETTYTILVEGI 540
           + S+ME  EN K   +N+N++ILG CK RRTDLA++VFE+MV K  + NETTY ILVEGI
Sbjct: 481 VLSIMEESENCKPTVDNFNAMILGLCKIRRTDLAMEVFEMMVEKKRMPNETTYAILVEGI 540

Query: 541 IHEKEMDLATEVLRELQLRDVINQSTVERLVMQYDLN 563
            HE E++LA EVL EL+LR VI Q+ V+R+VMQ++L+
Sbjct: 541 AHEDELELAKEVLDELRLRKVIGQNAVDRIVMQFNLD 576

BLAST of CSPI04G21480 vs. Swiss-Prot
Match: PPR28_ARATH (Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana GN=At1g09900 PE=2 SV=1)

HSP 1 Score: 375.2 bits (962), Expect = 1.3e-102
Identity = 192/489 (39.26%), Postives = 302/489 (61.76%), Query Frame = 1

Query: 69  LDQKSKELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKTCKMRKAIKVMEMMIGSGI 128
           L Q  +   L + F  LE MV  G  PD+   T L+   C+  K RKA K++E++ GSG 
Sbjct: 109 LRQMVRTGELEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRLGKTRKAAKILEILEGSGA 168

Query: 129 IPDAASYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTATYNSLVRGLCMHGNLTQSLQ 188
           +PD  +Y  ++S  C+ G +  A+ ++D+M       +  TYN+++R LC  G L Q+++
Sbjct: 169 VPDVITYNVMISGYCKAGEINNALSVLDRMS---VSPDVVTYNTILRSLCDSGKLKQAME 228

Query: 189 LLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGKPNLVSYNVLLTGLC 248
           +LDR++Q+   P+  TY+ L+EA  ++ G   A KLLDE+  +G  P++V+YNVL+ G+C
Sbjct: 229 VLDRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGIC 288

Query: 249 KEGRTEDAMQLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMDGDERSPSTV 308
           KEGR ++A++   ++PS G  PNV+++NI+LRS+C+ GRW +A  LLA+M     SPS V
Sbjct: 289 KEGRLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVV 348

Query: 309 TYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDLVVKCLDQM 368
           T+NILI  L   G    A+++LE+M +   +P + SYNP++   CK++K+D  ++ L++M
Sbjct: 349 TFNILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLERM 408

Query: 369 MYRHCNPNEGTYNAIAT-LCEEGMVQEAFSIIQSLGNKQHFSTQEFYKIVITSLCRKGNT 428
           + R C P+  TYN + T LC++G V++A  I+  L +K        Y  VI  L + G T
Sbjct: 409 VSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAGKT 468

Query: 429 YPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEE-NIKLDTENYNSL 488
             A +LL EM      PD+ TYSSL+ GL  EG ++EAI+ F   E   I+ +   +NS+
Sbjct: 469 GKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIRPNAVTFNSI 528

Query: 489 ILGCCKSRRTDLALDVFEIMVGKGYLANETTYTILVEGIIHEKEMDLATEVLRELQLRDV 548
           +LG CKSR+TD A+D    M+ +G   NET+YTIL+EG+ +E     A E+L EL  + +
Sbjct: 529 MLGLCKSRQTDRAIDFLVFMINRGCKPNETSYTILIEGLAYEGMAKEALELLNELCNKGL 588

Query: 549 INQSTVERL 556
           + +S+ E++
Sbjct: 589 MKKSSAEQV 594

BLAST of CSPI04G21480 vs. Swiss-Prot
Match: PP213_ARATH (Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidopsis thaliana GN=At3g04760 PE=2 SV=1)

HSP 1 Score: 327.4 bits (838), Expect = 3.1e-88
Identity = 174/479 (36.33%), Postives = 273/479 (56.99%), Query Frame = 1

Query: 85  LEFMVEKGQKPDVFQATQLLYDLCKTCKMRKAIKVMEMMIGSGIIPDAASYTFLVSSLCR 144
           LE MV KG  PDV   T+L+        + KA++VME++   G  PD  +Y  L++  C+
Sbjct: 112 LETMVRKGYNPDVILCTKLIKGFFTLRNIPKAVRVMEILEKFGQ-PDVFAYNALINGFCK 171

Query: 145 KGNVGYAMQLVDKMEEYGYPTNTATYNSLVRGLCMHGNLTQSLQLLDRLIQKGLVPNAYT 204
              +  A +++D+M    +  +T TYN ++  LC  G L  +L++L++L+     P   T
Sbjct: 172 MNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLNQLLSDNCQPTVIT 231

Query: 205 YSFLLEAAYKERGADEASKLLDEIIAKGGKPNLVSYNVLLTGLCKEGRTEDAMQLFRELP 264
           Y+ L+EA   E G DEA KL+DE++++G KP++ +YN ++ G+CKEG  + A ++ R L 
Sbjct: 232 YTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKEGMVDRAFEMVRNLE 291

Query: 265 SKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMDGDERSPSTVTYNILIGSLTLHGRTE 324
            KG  P+V+SYNILLR+L N+G+WEE   L+ +M  ++  P+ VTY+ILI +L   G+ E
Sbjct: 292 LKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYSILITTLCRDGKIE 351

Query: 325 HALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDLVVKCLDQMMYRHCNPNEGTYNAI- 384
            A+ +L+ M      P A SY+P+IA  C++ ++D+ ++ L+ M+   C P+   YN + 
Sbjct: 352 EAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMISDGCLPDIVNYNTVL 411

Query: 385 ATLCEEGMVQEAFSIIQSLGNKQHFSTQEFYKIVITSLCRKGNTYPAFQLLYEMTKYGFT 444
           ATLC+ G   +A  I   LG          Y  + ++L   G+   A  ++ EM   G  
Sbjct: 412 ATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRALHMILEMMSNGID 471

Query: 445 PDSFTYSSLIRGLCMEGMLNEAIEIFSVMEE-NIKLDTENYNSLILGCCKSRRTDLALDV 504
           PD  TY+S+I  LC EGM++EA E+   M           YN ++LG CK+ R + A++V
Sbjct: 472 PDEITYNSMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNIVLLGFCKAHRIEDAINV 531

Query: 505 FEIMVGKGYLANETTYTILVEGIIHEKEMDLATEVLRELQLRDVINQSTVERLVMQYDL 562
            E MVG G   NETTYT+L+EGI        A E+  +L   D I++ + +RL   + L
Sbjct: 532 LESMVGNGCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRIDAISEYSFKRLHRTFPL 589

BLAST of CSPI04G21480 vs. Swiss-Prot
Match: PPR36_ARATH (Pentatricopeptide repeat-containing protein At1g12300, mitochondrial OS=Arabidopsis thaliana GN=At1g12300 PE=2 SV=1)

HSP 1 Score: 251.1 bits (640), Expect = 2.8e-65
Identity = 135/454 (29.74%), Postives = 240/454 (52.86%), Query Frame = 1

Query: 91  KGQKPDVFQATQLLYDLCKTCKMRKAIKVMEMMIGSGIIPDAASYTFLVSSLCRKGNVGY 150
           KG   +++  + ++   C+  K+  A   M  +I  G  P+  +++ L++ LC +G V  
Sbjct: 117 KGIAHNLYTLSIMINCFCRCRKLCLAFSAMGKIIKLGYEPNTITFSTLINGLCLEGRVSE 176

Query: 151 AMQLVDKMEEYGYPTNTATYNSLVRGLCMHGNLTQSLQLLDRLIQKGLVPNAYTYSFLLE 210
           A++LVD+M E G+  +  T N+LV GLC+ G   +++ L+D++++ G  PNA TY  +L 
Sbjct: 177 ALELVDRMVEMGHKPDLITINTLVNGLCLSGKEAEAMLLIDKMVEYGCQPNAVTYGPVLN 236

Query: 211 AAYKERGADEASKLLDEIIAKGGKPNLVSYNVLLTGLCKEGRTEDAMQLFRELPSKGFSP 270
              K      A +LL ++  +  K + V Y++++ GLCK G  ++A  LF E+  KG + 
Sbjct: 237 VMCKSGQTALAMELLRKMEERNIKLDAVKYSIIIDGLCKHGSLDNAFNLFNEMEMKGITT 296

Query: 271 NVVSYNILLRSLCNEGRWEEANVLLAEMDGDERSPSTVTYNILIGSLTLHGRTEHALEVL 330
           N+++YNIL+   CN GRW++   LL +M   + +P+ VT+++LI S    G+   A E+ 
Sbjct: 297 NIITYNILIGGFCNAGRWDDGAKLLRDMIKRKINPNVVTFSVLIDSFVKEGKLREAEELH 356

Query: 331 EEMIRARFKPTASSYNPIIARLCKDRKVDLVVKCLDQMMYRHCNPNEGTYNAIAT-LCEE 390
           +EMI     P   +Y  +I   CK+  +D   + +D M+ + C+PN  T+N +    C+ 
Sbjct: 357 KEMIHRGIAPDTITYTSLIDGFCKENHLDKANQMVDLMVSKGCDPNIRTFNILINGYCKA 416

Query: 391 GMVQEAFSIIQSLGNKQHFSTQEFYKIVITSLCRKGNTYPAFQLLYEMTKYGFTPDSFTY 450
             + +   + + +  +   +    Y  +I   C  G    A +L  EM      P+  TY
Sbjct: 417 NRIDDGLELFRKMSLRGVVADTVTYNTLIQGFCELGKLNVAKELFQEMVSRKVPPNIVTY 476

Query: 451 SSLIRGLCMEGMLNEAIEIFSVMEEN-IKLDTENYNSLILGCCKSRRTDLALDVFEIMVG 510
             L+ GLC  G   +A+EIF  +E++ ++LD   YN +I G C + + D A D+F  +  
Sbjct: 477 KILLDGLCDNGESEKALEIFEKIEKSKMELDIGIYNIIIHGMCNASKVDDAWDLFCSLPL 536

Query: 511 KGYLANETTYTILVEGIIHEKEMDLATEVLRELQ 543
           KG      TY I++ G+  +  +  A  + R+++
Sbjct: 537 KGVKPGVKTYNIMIGGLCKKGPLSEAELLFRKME 570

BLAST of CSPI04G21480 vs. Swiss-Prot
Match: PPR39_ARATH (Pentatricopeptide repeat-containing protein At1g12775, mitochondrial OS=Arabidopsis thaliana GN=At1g12775 PE=2 SV=1)

HSP 1 Score: 250.0 bits (637), Expect = 6.3e-65
Identity = 130/456 (28.51%), Postives = 241/456 (52.85%), Query Frame = 1

Query: 88  MVEKGQKPDVFQATQLLYDLCKTCKMRKAIKVMEMMIGSGIIPDAASYTFLVSSLCRKGN 147
           M  KG    ++  + ++   C+  K+  A   M  ++  G  PD   +  L++ LC +  
Sbjct: 114 MESKGIAHSIYTLSIMINCFCRCRKLSYAFSTMGKIMKLGYEPDTVIFNTLLNGLCLECR 173

Query: 148 VGYAMQLVDKMEEYGYPTNTATYNSLVRGLCMHGNLTQSLQLLDRLIQKGLVPNAYTYSF 207
           V  A++LVD+M E G+     T N+LV GLC++G ++ ++ L+DR+++ G  PN  TY  
Sbjct: 174 VSEALELVDRMVEMGHKPTLITLNTLVNGLCLNGKVSDAVVLIDRMVETGFQPNEVTYGP 233

Query: 208 LLEAAYKERGADEASKLLDEIIAKGGKPNLVSYNVLLTGLCKEGRTEDAMQLFRELPSKG 267
           +L    K      A +LL ++  +  K + V Y++++ GLCK+G  ++A  LF E+  KG
Sbjct: 234 VLNVMCKSGQTALAMELLRKMEERNIKLDAVKYSIIIDGLCKDGSLDNAFNLFNEMEIKG 293

Query: 268 FSPNVVSYNILLRSLCNEGRWEEANVLLAEMDGDERSPSTVTYNILIGSLTLHGRTEHAL 327
           F  ++++YN L+   CN GRW++   LL +M   + SP+ VT+++LI S    G+   A 
Sbjct: 294 FKADIITYNTLIGGFCNAGRWDDGAKLLRDMIKRKISPNVVTFSVLIDSFVKEGKLREAD 353

Query: 328 EVLEEMIRARFKPTASSYNPIIARLCKDRKVDLVVKCLDQMMYRHCNPNEGTYNAIAT-L 387
           ++L+EM++    P   +YN +I   CK+ +++  ++ +D M+ + C+P+  T+N +    
Sbjct: 354 QLLKEMMQRGIAPNTITYNSLIDGFCKENRLEEAIQMVDLMISKGCDPDIMTFNILINGY 413

Query: 388 CEEGMVQEAFSIIQSLGNKQHFSTQEFYKIVITSLCRKGNTYPAFQLLYEMTKYGFTPDS 447
           C+   + +   + + +  +   +    Y  ++   C+ G    A +L  EM      PD 
Sbjct: 414 CKANRIDDGLELFREMSLRGVIANTVTYNTLVQGFCQSGKLEVAKKLFQEMVSRRVRPDI 473

Query: 448 FTYSSLIRGLCMEGMLNEAIEIFSVMEEN-IKLDTENYNSLILGCCKSRRTDLALDVFEI 507
            +Y  L+ GLC  G L +A+EIF  +E++ ++LD   Y  +I G C + + D A D+F  
Sbjct: 474 VSYKILLDGLCDNGELEKALEIFGKIEKSKMELDIGIYMIIIHGMCNASKVDDAWDLFCS 533

Query: 508 MVGKGYLANETTYTILVEGIIHEKEMDLATEVLREL 542
           +  KG   +   Y I++  +  +  +  A  + R++
Sbjct: 534 LPLKGVKLDARAYNIMISELCRKDSLSKADILFRKM 569

BLAST of CSPI04G21480 vs. TrEMBL
Match: A0A0A0L2W8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G613170 PE=4 SV=1)

HSP 1 Score: 1134.4 bits (2933), Expect = 0.0e+00
Identity = 566/566 (100.00%), Postives = 566/566 (100.00%), Query Frame = 1

Query: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNIQKLSLNKGFSKVLASTQITISPKDTIFT 60
           MATLLNTVSPITNPSPETTRRGCGFFSHIPNIQKLSLNKGFSKVLASTQITISPKDTIFT
Sbjct: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNIQKLSLNKGFSKVLASTQITISPKDTIFT 60

Query: 61  LPNWKIGKLDQKSKELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKTCKMRKAIKVM 120
           LPNWKIGKLDQKSKELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKTCKMRKAIKVM
Sbjct: 61  LPNWKIGKLDQKSKELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKTCKMRKAIKVM 120

Query: 121 EMMIGSGIIPDAASYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTATYNSLVRGLCMH 180
           EMMIGSGIIPDAASYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTATYNSLVRGLCMH
Sbjct: 121 EMMIGSGIIPDAASYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTATYNSLVRGLCMH 180

Query: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGKPNLVSY 240
           GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGKPNLVSY
Sbjct: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGKPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAMQLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMDG 300
           NVLLTGLCKEGRTEDAMQLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMDG
Sbjct: 241 NVLLTGLCKEGRTEDAMQLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMDG 300

Query: 301 DERSPSTVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           DERSPSTVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL
Sbjct: 301 DERSPSTVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360

Query: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHFSTQEFYKIVITS 420
           VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHFSTQEFYKIVITS
Sbjct: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHFSTQEFYKIVITS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEENIKLDT 480
           LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEENIKLDT
Sbjct: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEENIKLDT 480

Query: 481 ENYNSLILGCCKSRRTDLALDVFEIMVGKGYLANETTYTILVEGIIHEKEMDLATEVLRE 540
           ENYNSLILGCCKSRRTDLALDVFEIMVGKGYLANETTYTILVEGIIHEKEMDLATEVLRE
Sbjct: 481 ENYNSLILGCCKSRRTDLALDVFEIMVGKGYLANETTYTILVEGIIHEKEMDLATEVLRE 540

Query: 541 LQLRDVINQSTVERLVMQYDLNELPL 567
           LQLRDVINQSTVERLVMQYDLNELPL
Sbjct: 541 LQLRDVINQSTVERLVMQYDLNELPL 566

BLAST of CSPI04G21480 vs. TrEMBL
Match: E5GBB3_CUCME (Pentatricopeptide repeat-containing protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 1089.7 bits (2817), Expect = 0.0e+00
Identity = 542/566 (95.76%), Postives = 554/566 (97.88%), Query Frame = 1

Query: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNIQKLSLNKGFSKVLASTQITISPKDTIFT 60
           MATLLNTVSPITN SPETTRRGCGFFSHIPN+QKLSLNKGFSKVLASTQITISPKDTIFT
Sbjct: 1   MATLLNTVSPITNTSPETTRRGCGFFSHIPNLQKLSLNKGFSKVLASTQITISPKDTIFT 60

Query: 61  LPNWKIGKLDQKSKELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKTCKMRKAIKVM 120
           LPNWK GK++QKSKELRL DAFFHLEFMVEKGQKPDVFQATQLLYDLCK CKMRKAIKVM
Sbjct: 61  LPNWKTGKVEQKSKELRLTDAFFHLEFMVEKGQKPDVFQATQLLYDLCKACKMRKAIKVM 120

Query: 121 EMMIGSGIIPDAASYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTATYNSLVRGLCMH 180
           EMMIGSGIIPDA+SYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNT TYNSLVRGLCMH
Sbjct: 121 EMMIGSGIIPDASSYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180

Query: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGKPNLVSY 240
           GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGG+PNLVSY
Sbjct: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGEPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAMQLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMDG 300
           NVLLTGLCKEGRTEDA++LFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEM+G
Sbjct: 241 NVLLTGLCKEGRTEDAIRLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMNG 300

Query: 301 DERSPSTVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           DERSPSTVTYNILIGSL LHGRTEHALEVLEEMIRARFKPTASSYNPIIA LCKD K+DL
Sbjct: 301 DERSPSTVTYNILIGSLALHGRTEHALEVLEEMIRARFKPTASSYNPIIAHLCKDGKLDL 360

Query: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHFSTQEFYKIVITS 420
           VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQH STQEFYKIVITS
Sbjct: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEENIKLDT 480
           LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEEN K DT
Sbjct: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEENNKPDT 480

Query: 481 ENYNSLILGCCKSRRTDLALDVFEIMVGKGYLANETTYTILVEGIIHEKEMDLATEVLRE 540
           ENYNSLILGCCKSRRTDLALDVFEIMVGKGYL NETTYTILVEGIIHEKEMDLAT+VLRE
Sbjct: 481 ENYNSLILGCCKSRRTDLALDVFEIMVGKGYLPNETTYTILVEGIIHEKEMDLATKVLRE 540

Query: 541 LQLRDVINQSTVERLVMQYDLNELPL 567
           LQLRDVI+QST+ERLVMQYDLNELPL
Sbjct: 541 LQLRDVISQSTLERLVMQYDLNELPL 566

BLAST of CSPI04G21480 vs. TrEMBL
Match: A5AJ76_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_19s0027g00450 PE=4 SV=1)

HSP 1 Score: 897.5 bits (2318), Expect = 8.3e-258
Identity = 436/567 (76.90%), Postives = 503/567 (88.71%), Query Frame = 1

Query: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNIQKLSLNKGFSKVLASTQITISPKDTIFT 60
           MA L+N +SPIT+PSPE  R+ CGFFS +PN+  LSLNKGFS+VLASTQITISPKD +FT
Sbjct: 1   MAILVNAMSPITSPSPENARKVCGFFSQVPNLHTLSLNKGFSRVLASTQITISPKDNVFT 60

Query: 61  LPNWKIGKLDQKSKELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKTCKMRKAIKVM 120
           LPNW+ GK D ++++LRLNDAF +LE+M+ KG KPD  QATQL+Y+LCK+ KMRKA KVM
Sbjct: 61  LPNWRSGKNDPRTRDLRLNDAFLYLEYMIGKGHKPDGGQATQLMYELCKSNKMRKATKVM 120

Query: 121 EMMIGSGIIPDAASYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTATYNSLVRGLCMH 180
           E+MIGSG  PD AS TFLV++LC++GNVGYAMQLV+KMEEYGYPTNT TYNSLVRGLCMH
Sbjct: 121 ELMIGSGTTPDPASCTFLVNNLCKRGNVGYAMQLVEKMEEYGYPTNTVTYNSLVRGLCMH 180

Query: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGKPNLVSY 240
           GNL+QSLQ+LD+ ++KGLVPN +TYSFLLEAAYKERGADEA +LLDEI+AKGGKPNLVSY
Sbjct: 181 GNLSQSLQILDKFMKKGLVPNVFTYSFLLEAAYKERGADEAIRLLDEIVAKGGKPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAMQLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMDG 300
           NVLLTGLCKEGRTE+AMQ FR+LPSKGFSPNVVSYNILLRSLC EGRWE+A  LLAEMDG
Sbjct: 241 NVLLTGLCKEGRTEEAMQFFRDLPSKGFSPNVVSYNILLRSLCYEGRWEKAKELLAEMDG 300

Query: 301 DERSPSTVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
            ERSPS VT+NILIGSL LHG+T+ ALEVL++M RARFK TA+SYNPIIARLCK+ KVDL
Sbjct: 301 GERSPSIVTFNILIGSLALHGQTDQALEVLDDMSRARFKATAASYNPIIARLCKEGKVDL 360

Query: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHFSTQEFYKIVITS 420
           VVKCLDQMMYR CNPNEGTYNAIA LCEEG VQEAFSIIQSLGNKQ+ ST +FYK VI+S
Sbjct: 361 VVKCLDQMMYRRCNPNEGTYNAIAVLCEEGKVQEAFSIIQSLGNKQNSSTHDFYKGVISS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEEN-IKLD 480
           LCRKGNTYPAFQLLYEMTKYGF PDS+TYSSLIRGLC EGML+EA+EIFS+MEEN  + D
Sbjct: 421 LCRKGNTYPAFQLLYEMTKYGFVPDSYTYSSLIRGLCSEGMLDEAMEIFSIMEENYCRPD 480

Query: 481 TENYNSLILGCCKSRRTDLALDVFEIMVGKGYLANETTYTILVEGIIHEKEMDLATEVLR 540
            +N+N+LILG CK R+TDL+L VFE+MV KGY+ NETTYTI+VEGI H++EM+LA  VL+
Sbjct: 481 VDNFNALILGLCKCRKTDLSLMVFEMMVKKGYMPNETTYTIIVEGIAHQEEMELAAAVLK 540

Query: 541 ELQLRDVINQSTVERLVMQYDLNELPL 567
           EL LR  + +ST+ERLVMQYDL  LP+
Sbjct: 541 ELYLRQAVGRSTLERLVMQYDLEGLPI 567

BLAST of CSPI04G21480 vs. TrEMBL
Match: A0A061FH66_THECC (Pentatricopeptide repeat (PPR) superfamily protein OS=Theobroma cacao GN=TCM_032450 PE=4 SV=1)

HSP 1 Score: 895.2 bits (2312), Expect = 4.1e-257
Identity = 431/565 (76.28%), Postives = 506/565 (89.56%), Query Frame = 1

Query: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNIQKLSLNKGFSKVLASTQITISPKDTIFT 60
           MATLLN++SP+TNPSPETTR+ CGFF  IPN+   SLNKGF++VLA+TQITISPKD++FT
Sbjct: 1   MATLLNSMSPMTNPSPETTRKTCGFFYQIPNLHSFSLNKGFTRVLATTQITISPKDSVFT 60

Query: 61  LPNWKIGKLDQKSKELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKTCKMRKAIKVM 120
           LPNWK GK D KS+ELRLNDAFFH+E+MV KGQKPDV QATQLLYDLCK  KM+K+I+V+
Sbjct: 61  LPNWKTGKNDTKSRELRLNDAFFHMEYMVGKGQKPDVAQATQLLYDLCKANKMKKSIRVL 120

Query: 121 EMMIGSGIIPDAASYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTATYNSLVRGLCMH 180
           EMM+ SGIIPDAASYTFLV+ LC++GNVG+AMQLV+KME +GYPTNT TYNSLVRGLCMH
Sbjct: 121 EMMVNSGIIPDAASYTFLVNHLCKRGNVGHAMQLVEKMEAHGYPTNTVTYNSLVRGLCMH 180

Query: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGKPNLVSY 240
           GNL QSLQLLD+LIQ+GLVPNA+TYSFLLEAAYKE+G +EA KLLD+IIAKGGKPNLVSY
Sbjct: 181 GNLNQSLQLLDKLIQRGLVPNAFTYSFLLEAAYKEKGVNEAMKLLDDIIAKGGKPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAMQLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMDG 300
           NVLLTGLCKEGRT++A++ FR LP+KGF PNVVSYNILLR+LC EG+W+EAN LLAEMDG
Sbjct: 241 NVLLTGLCKEGRTDEAIRFFRNLPAKGFDPNVVSYNILLRNLCYEGQWKEANELLAEMDG 300

Query: 301 DERSPSTVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           ++RSPS VTYNILIGSL LHGRTEHAL+VL+EMIR RFK TA+SYNPIIARLC+++KVDL
Sbjct: 301 EDRSPSVVTYNILIGSLALHGRTEHALDVLDEMIRGRFKATATSYNPIIARLCQEKKVDL 360

Query: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHFSTQEFYKIVITS 420
           VVKCLDQM+YR C PNEGTYNA A LCE+GMVQEAFSIIQSLG+KQ   + +FYK VI+S
Sbjct: 361 VVKCLDQMIYRRCKPNEGTYNATAVLCEQGMVQEAFSIIQSLGSKQSSPSHDFYKSVISS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEE-NIKLD 480
           LCRKGNTYPAFQLLYEMTK GF PDS+TYSSLIRGLC+EGML EA+EI  VMEE N + D
Sbjct: 421 LCRKGNTYPAFQLLYEMTKSGFNPDSYTYSSLIRGLCLEGMLQEALEIVIVMEESNYRPD 480

Query: 481 TENYNSLILGCCKSRRTDLALDVFEIMVGKGYLANETTYTILVEGIIHEKEMDLATEVLR 540
            +N+N+LILG CKS RTDL+L VFE+++ KGY+ NETTYTILVEGI HE +++LA EVL+
Sbjct: 481 VDNFNALILGFCKSHRTDLSLKVFEMIIEKGYMPNETTYTILVEGIAHEGKIELAAEVLK 540

Query: 541 ELQLRDVINQSTVERLVMQYDLNEL 565
           EL +R+V++Q  VERLVMQY+L+ +
Sbjct: 541 ELHVREVVSQHAVERLVMQYNLSAI 565

BLAST of CSPI04G21480 vs. TrEMBL
Match: A0A0D2U0G5_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_009G354800 PE=4 SV=1)

HSP 1 Score: 884.0 bits (2283), Expect = 9.5e-254
Identity = 427/565 (75.58%), Postives = 503/565 (89.03%), Query Frame = 1

Query: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNIQKLSLNKGFSKVLASTQITISPKDTIFT 60
           MATL+N++S +TNPSPETTRR  GFF  I N+   SL+KGFSKVLA+TQITISPKD++FT
Sbjct: 1   MATLINSISFLTNPSPETTRRPSGFFYQIANLHYFSLSKGFSKVLATTQITISPKDSVFT 60

Query: 61  LPNWKIGKLDQKSKELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKTCKMRKAIKVM 120
           LPNWK GK D KS+ELRL+DA+FH+E+MV KGQKPDV QATQLLYDLCK  KM+K+I+VM
Sbjct: 61  LPNWKTGKSDSKSRELRLSDAYFHMEYMVGKGQKPDVVQATQLLYDLCKVNKMKKSIRVM 120

Query: 121 EMMIGSGIIPDAASYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTATYNSLVRGLCMH 180
           EMM+ SGIIPDAASYTFLV+ LC++GNVG+AMQLV+KME +GYPTNT TYNSLVRGLCMH
Sbjct: 121 EMMVDSGIIPDAASYTFLVNHLCKRGNVGHAMQLVEKMEAHGYPTNTVTYNSLVRGLCMH 180

Query: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGKPNLVSY 240
           GNL QSLQLLDRLIQKGLVPN +TYSFLLEAAYKE+G +EA+KLLDEIIAKGGKPNLVSY
Sbjct: 181 GNLKQSLQLLDRLIQKGLVPNEFTYSFLLEAAYKEKGVNEATKLLDEIIAKGGKPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAMQLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMDG 300
           NVLLTGLCKEGRT++A++ FR+LP+KGF+PNVVSYNI+LR+LC EGRW+EAN LLAEMDG
Sbjct: 241 NVLLTGLCKEGRTDEAIRFFRDLPAKGFNPNVVSYNIVLRNLCYEGRWKEANELLAEMDG 300

Query: 301 DERSPSTVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           D+RSPS VTYNILIGSL LHGRT HA++VL+EMIR RFK +A+SYNPIIA+LC++ KVDL
Sbjct: 301 DDRSPSVVTYNILIGSLALHGRTYHAMDVLDEMIRGRFKVSATSYNPIIAQLCQEEKVDL 360

Query: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHFSTQEFYKIVITS 420
           VVKCLDQM+YR C PNEGTYNAIA LCE+GMVQEAFSI QSL +KQ  S  +FYK VI+S
Sbjct: 361 VVKCLDQMIYRRCKPNEGTYNAIAVLCEQGMVQEAFSIFQSLASKQSSSPNDFYKSVISS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEE-NIKLD 480
           LCRKGNTYPAFQLLYEMTK GFTPDS+TYSSLIRGLC+EGML  A++IF VMEE N K D
Sbjct: 421 LCRKGNTYPAFQLLYEMTKSGFTPDSYTYSSLIRGLCLEGMLQAAMQIFIVMEESNFKPD 480

Query: 481 TENYNSLILGCCKSRRTDLALDVFEIMVGKGYLANETTYTILVEGIIHEKEMDLATEVLR 540
            +N+N+LILG CKS RTDL+L VFE+++ KGY+ NETTYTILVEGI HE +M+LA +VL+
Sbjct: 481 VDNFNALILGFCKSHRTDLSLKVFEMLIEKGYMPNETTYTILVEGIAHEGQMELAAQVLK 540

Query: 541 ELQLRDVINQSTVERLVMQYDLNEL 565
           EL +R+V++Q  VERL+MQY+L+ +
Sbjct: 541 ELHMREVVSQHVVERLIMQYNLSAI 565

BLAST of CSPI04G21480 vs. TAIR10
Match: AT1G79080.1 (AT1G79080.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 740.0 bits (1909), Expect = 1.1e-213
Identity = 376/577 (65.16%), Postives = 464/577 (80.42%), Query Frame = 1

Query: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPN--IQKLSLNKGFSKVLASTQITISPKDTI 60
           M+TLLN+V  + +P   + R+  GF SHIP+  +   S++KG ++VLASTQIT+SPKD+ 
Sbjct: 1   MSTLLNSVLSMASPE-SSPRKAVGFVSHIPSGFLHFSSVSKGVARVLASTQITLSPKDSA 60

Query: 61  FTLP--NWK----IGKL--DQKSKELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKT 120
           FT+   +WK     G    D +S E  L+D+F HLE +V  G KP+V  +TQLLYDLCK 
Sbjct: 61  FTITGSSWKPDLDSGSFSDDPRSDEPNLSDSFSHLESLVTGGHKPNVAHSTQLLYDLCKA 120

Query: 121 CKMRKAIKVMEMMIGSGIIPDAASYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTATY 180
            +++KAI+V+E+M+ SGIIPDA++YT+LV+ LC++GNVGYAMQLV+KME++GYP+NT TY
Sbjct: 121 NRLKKAIRVIELMVSSGIIPDASAYTYLVNQLCKRGNVGYAMQLVEKMEDHGYPSNTVTY 180

Query: 181 NSLVRGLCMHGNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIA 240
           N+LVRGLCM G+L QSLQ ++RL+QKGL PNA+TYSFLLEAAYKERG DEA KLLDEII 
Sbjct: 181 NALVRGLCMLGSLNQSLQFVERLMQKGLAPNAFTYSFLLEAAYKERGTDEAVKLLDEIIV 240

Query: 241 KGGKPNLVSYNVLLTGLCKEGRTEDAMQLFRELPSKGFSPNVVSYNILLRSLCNEGRWEE 300
           KGG+PNLVSYNVLLTG CKEGRT+DAM LFRELP+KGF  NVVSYNILLR LC +GRWEE
Sbjct: 241 KGGEPNLVSYNVLLTGFCKEGRTDDAMALFRELPAKGFKANVVSYNILLRCLCCDGRWEE 300

Query: 301 ANVLLAEMDGDERSPSTVTYNILIGSLTLHGRTEHALEVLEEMIRA--RFKPTASSYNPI 360
           AN LLAEMDG +R+PS VTYNILI SL  HGRTE AL+VL+EM +   +F+ TA+SYNP+
Sbjct: 301 ANSLLAEMDGGDRAPSVVTYNILINSLAFHGRTEQALQVLKEMSKGNHQFRVTATSYNPV 360

Query: 361 IARLCKDRKVDLVVKCLDQMMYRHCNPNEGTYNAIATLCE-EGMVQEAFSIIQSLGNKQH 420
           IARLCK+ KVDLVVKCLD+M+YR C PNEGTYNAI +LCE    VQEAF IIQSL NKQ 
Sbjct: 361 IARLCKEGKVDLVVKCLDEMIYRRCKPNEGTYNAIGSLCEHNSKVQEAFYIIQSLSNKQK 420

Query: 421 FSTQEFYKIVITSLCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIE 480
             T +FYK VITSLCRKGNT+ AFQLLYEMT+ GF PD+ TYS+LIRGLC+EGM   A+E
Sbjct: 421 CCTHDFYKSVITSLCRKGNTFAAFQLLYEMTRCGFDPDAHTYSALIRGLCLEGMFTGAME 480

Query: 481 IFSVME--ENIKLDTENYNSLILGCCKSRRTDLALDVFEIMVGKGYLANETTYTILVEGI 540
           + S+ME  EN K   +N+N++ILG CK RRTDLA++VFE+MV K  + NETTY ILVEGI
Sbjct: 481 VLSIMEESENCKPTVDNFNAMILGLCKIRRTDLAMEVFEMMVEKKRMPNETTYAILVEGI 540

Query: 541 IHEKEMDLATEVLRELQLRDVINQSTVERLVMQYDLN 563
            HE E++LA EVL EL+LR VI Q+ V+R+VMQ++L+
Sbjct: 541 AHEDELELAKEVLDELRLRKVIGQNAVDRIVMQFNLD 576

BLAST of CSPI04G21480 vs. TAIR10
Match: AT1G09900.1 (AT1G09900.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 375.2 bits (962), Expect = 7.3e-104
Identity = 192/489 (39.26%), Postives = 302/489 (61.76%), Query Frame = 1

Query: 69  LDQKSKELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKTCKMRKAIKVMEMMIGSGI 128
           L Q  +   L + F  LE MV  G  PD+   T L+   C+  K RKA K++E++ GSG 
Sbjct: 109 LRQMVRTGELEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRLGKTRKAAKILEILEGSGA 168

Query: 129 IPDAASYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTATYNSLVRGLCMHGNLTQSLQ 188
           +PD  +Y  ++S  C+ G +  A+ ++D+M       +  TYN+++R LC  G L Q+++
Sbjct: 169 VPDVITYNVMISGYCKAGEINNALSVLDRMS---VSPDVVTYNTILRSLCDSGKLKQAME 228

Query: 189 LLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGKPNLVSYNVLLTGLC 248
           +LDR++Q+   P+  TY+ L+EA  ++ G   A KLLDE+  +G  P++V+YNVL+ G+C
Sbjct: 229 VLDRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGIC 288

Query: 249 KEGRTEDAMQLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMDGDERSPSTV 308
           KEGR ++A++   ++PS G  PNV+++NI+LRS+C+ GRW +A  LLA+M     SPS V
Sbjct: 289 KEGRLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSVV 348

Query: 309 TYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDLVVKCLDQM 368
           T+NILI  L   G    A+++LE+M +   +P + SYNP++   CK++K+D  ++ L++M
Sbjct: 349 TFNILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLERM 408

Query: 369 MYRHCNPNEGTYNAIAT-LCEEGMVQEAFSIIQSLGNKQHFSTQEFYKIVITSLCRKGNT 428
           + R C P+  TYN + T LC++G V++A  I+  L +K        Y  VI  L + G T
Sbjct: 409 VSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAGKT 468

Query: 429 YPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEE-NIKLDTENYNSL 488
             A +LL EM      PD+ TYSSL+ GL  EG ++EAI+ F   E   I+ +   +NS+
Sbjct: 469 GKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIRPNAVTFNSI 528

Query: 489 ILGCCKSRRTDLALDVFEIMVGKGYLANETTYTILVEGIIHEKEMDLATEVLRELQLRDV 548
           +LG CKSR+TD A+D    M+ +G   NET+YTIL+EG+ +E     A E+L EL  + +
Sbjct: 529 MLGLCKSRQTDRAIDFLVFMINRGCKPNETSYTILIEGLAYEGMAKEALELLNELCNKGL 588

Query: 549 INQSTVERL 556
           + +S+ E++
Sbjct: 589 MKKSSAEQV 594

BLAST of CSPI04G21480 vs. TAIR10
Match: AT3G04760.1 (AT3G04760.1 Pentatricopeptide repeat (PPR-like) superfamily protein)

HSP 1 Score: 327.4 bits (838), Expect = 1.7e-89
Identity = 174/479 (36.33%), Postives = 273/479 (56.99%), Query Frame = 1

Query: 85  LEFMVEKGQKPDVFQATQLLYDLCKTCKMRKAIKVMEMMIGSGIIPDAASYTFLVSSLCR 144
           LE MV KG  PDV   T+L+        + KA++VME++   G  PD  +Y  L++  C+
Sbjct: 112 LETMVRKGYNPDVILCTKLIKGFFTLRNIPKAVRVMEILEKFGQ-PDVFAYNALINGFCK 171

Query: 145 KGNVGYAMQLVDKMEEYGYPTNTATYNSLVRGLCMHGNLTQSLQLLDRLIQKGLVPNAYT 204
              +  A +++D+M    +  +T TYN ++  LC  G L  +L++L++L+     P   T
Sbjct: 172 MNRIDDATRVLDRMRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLNQLLSDNCQPTVIT 231

Query: 205 YSFLLEAAYKERGADEASKLLDEIIAKGGKPNLVSYNVLLTGLCKEGRTEDAMQLFRELP 264
           Y+ L+EA   E G DEA KL+DE++++G KP++ +YN ++ G+CKEG  + A ++ R L 
Sbjct: 232 YTILIEATMLEGGVDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKEGMVDRAFEMVRNLE 291

Query: 265 SKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMDGDERSPSTVTYNILIGSLTLHGRTE 324
            KG  P+V+SYNILLR+L N+G+WEE   L+ +M  ++  P+ VTY+ILI +L   G+ E
Sbjct: 292 LKGCEPDVISYNILLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYSILITTLCRDGKIE 351

Query: 325 HALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDLVVKCLDQMMYRHCNPNEGTYNAI- 384
            A+ +L+ M      P A SY+P+IA  C++ ++D+ ++ L+ M+   C P+   YN + 
Sbjct: 352 EAMNLLKLMKEKGLTPDAYSYDPLIAAFCREGRLDVAIEFLETMISDGCLPDIVNYNTVL 411

Query: 385 ATLCEEGMVQEAFSIIQSLGNKQHFSTQEFYKIVITSLCRKGNTYPAFQLLYEMTKYGFT 444
           ATLC+ G   +A  I   LG          Y  + ++L   G+   A  ++ EM   G  
Sbjct: 412 ATLCKNGKADQALEIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRALHMILEMMSNGID 471

Query: 445 PDSFTYSSLIRGLCMEGMLNEAIEIFSVMEE-NIKLDTENYNSLILGCCKSRRTDLALDV 504
           PD  TY+S+I  LC EGM++EA E+   M           YN ++LG CK+ R + A++V
Sbjct: 472 PDEITYNSMISCLCREGMVDEAFELLVDMRSCEFHPSVVTYNIVLLGFCKAHRIEDAINV 531

Query: 505 FEIMVGKGYLANETTYTILVEGIIHEKEMDLATEVLRELQLRDVINQSTVERLVMQYDL 562
            E MVG G   NETTYT+L+EGI        A E+  +L   D I++ + +RL   + L
Sbjct: 532 LESMVGNGCRPNETTYTVLIEGIGFAGYRAEAMELANDLVRIDAISEYSFKRLHRTFPL 589

BLAST of CSPI04G21480 vs. TAIR10
Match: AT1G12300.1 (AT1G12300.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 251.1 bits (640), Expect = 1.6e-66
Identity = 135/454 (29.74%), Postives = 240/454 (52.86%), Query Frame = 1

Query: 91  KGQKPDVFQATQLLYDLCKTCKMRKAIKVMEMMIGSGIIPDAASYTFLVSSLCRKGNVGY 150
           KG   +++  + ++   C+  K+  A   M  +I  G  P+  +++ L++ LC +G V  
Sbjct: 117 KGIAHNLYTLSIMINCFCRCRKLCLAFSAMGKIIKLGYEPNTITFSTLINGLCLEGRVSE 176

Query: 151 AMQLVDKMEEYGYPTNTATYNSLVRGLCMHGNLTQSLQLLDRLIQKGLVPNAYTYSFLLE 210
           A++LVD+M E G+  +  T N+LV GLC+ G   +++ L+D++++ G  PNA TY  +L 
Sbjct: 177 ALELVDRMVEMGHKPDLITINTLVNGLCLSGKEAEAMLLIDKMVEYGCQPNAVTYGPVLN 236

Query: 211 AAYKERGADEASKLLDEIIAKGGKPNLVSYNVLLTGLCKEGRTEDAMQLFRELPSKGFSP 270
              K      A +LL ++  +  K + V Y++++ GLCK G  ++A  LF E+  KG + 
Sbjct: 237 VMCKSGQTALAMELLRKMEERNIKLDAVKYSIIIDGLCKHGSLDNAFNLFNEMEMKGITT 296

Query: 271 NVVSYNILLRSLCNEGRWEEANVLLAEMDGDERSPSTVTYNILIGSLTLHGRTEHALEVL 330
           N+++YNIL+   CN GRW++   LL +M   + +P+ VT+++LI S    G+   A E+ 
Sbjct: 297 NIITYNILIGGFCNAGRWDDGAKLLRDMIKRKINPNVVTFSVLIDSFVKEGKLREAEELH 356

Query: 331 EEMIRARFKPTASSYNPIIARLCKDRKVDLVVKCLDQMMYRHCNPNEGTYNAIAT-LCEE 390
           +EMI     P   +Y  +I   CK+  +D   + +D M+ + C+PN  T+N +    C+ 
Sbjct: 357 KEMIHRGIAPDTITYTSLIDGFCKENHLDKANQMVDLMVSKGCDPNIRTFNILINGYCKA 416

Query: 391 GMVQEAFSIIQSLGNKQHFSTQEFYKIVITSLCRKGNTYPAFQLLYEMTKYGFTPDSFTY 450
             + +   + + +  +   +    Y  +I   C  G    A +L  EM      P+  TY
Sbjct: 417 NRIDDGLELFRKMSLRGVVADTVTYNTLIQGFCELGKLNVAKELFQEMVSRKVPPNIVTY 476

Query: 451 SSLIRGLCMEGMLNEAIEIFSVMEEN-IKLDTENYNSLILGCCKSRRTDLALDVFEIMVG 510
             L+ GLC  G   +A+EIF  +E++ ++LD   YN +I G C + + D A D+F  +  
Sbjct: 477 KILLDGLCDNGESEKALEIFEKIEKSKMELDIGIYNIIIHGMCNASKVDDAWDLFCSLPL 536

Query: 511 KGYLANETTYTILVEGIIHEKEMDLATEVLRELQ 543
           KG      TY I++ G+  +  +  A  + R+++
Sbjct: 537 KGVKPGVKTYNIMIGGLCKKGPLSEAELLFRKME 570

BLAST of CSPI04G21480 vs. TAIR10
Match: AT1G12775.1 (AT1G12775.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 250.0 bits (637), Expect = 3.5e-66
Identity = 130/456 (28.51%), Postives = 241/456 (52.85%), Query Frame = 1

Query: 88  MVEKGQKPDVFQATQLLYDLCKTCKMRKAIKVMEMMIGSGIIPDAASYTFLVSSLCRKGN 147
           M  KG    ++  + ++   C+  K+  A   M  ++  G  PD   +  L++ LC +  
Sbjct: 114 MESKGIAHSIYTLSIMINCFCRCRKLSYAFSTMGKIMKLGYEPDTVIFNTLLNGLCLECR 173

Query: 148 VGYAMQLVDKMEEYGYPTNTATYNSLVRGLCMHGNLTQSLQLLDRLIQKGLVPNAYTYSF 207
           V  A++LVD+M E G+     T N+LV GLC++G ++ ++ L+DR+++ G  PN  TY  
Sbjct: 174 VSEALELVDRMVEMGHKPTLITLNTLVNGLCLNGKVSDAVVLIDRMVETGFQPNEVTYGP 233

Query: 208 LLEAAYKERGADEASKLLDEIIAKGGKPNLVSYNVLLTGLCKEGRTEDAMQLFRELPSKG 267
           +L    K      A +LL ++  +  K + V Y++++ GLCK+G  ++A  LF E+  KG
Sbjct: 234 VLNVMCKSGQTALAMELLRKMEERNIKLDAVKYSIIIDGLCKDGSLDNAFNLFNEMEIKG 293

Query: 268 FSPNVVSYNILLRSLCNEGRWEEANVLLAEMDGDERSPSTVTYNILIGSLTLHGRTEHAL 327
           F  ++++YN L+   CN GRW++   LL +M   + SP+ VT+++LI S    G+   A 
Sbjct: 294 FKADIITYNTLIGGFCNAGRWDDGAKLLRDMIKRKISPNVVTFSVLIDSFVKEGKLREAD 353

Query: 328 EVLEEMIRARFKPTASSYNPIIARLCKDRKVDLVVKCLDQMMYRHCNPNEGTYNAIAT-L 387
           ++L+EM++    P   +YN +I   CK+ +++  ++ +D M+ + C+P+  T+N +    
Sbjct: 354 QLLKEMMQRGIAPNTITYNSLIDGFCKENRLEEAIQMVDLMISKGCDPDIMTFNILINGY 413

Query: 388 CEEGMVQEAFSIIQSLGNKQHFSTQEFYKIVITSLCRKGNTYPAFQLLYEMTKYGFTPDS 447
           C+   + +   + + +  +   +    Y  ++   C+ G    A +L  EM      PD 
Sbjct: 414 CKANRIDDGLELFREMSLRGVIANTVTYNTLVQGFCQSGKLEVAKKLFQEMVSRRVRPDI 473

Query: 448 FTYSSLIRGLCMEGMLNEAIEIFSVMEEN-IKLDTENYNSLILGCCKSRRTDLALDVFEI 507
            +Y  L+ GLC  G L +A+EIF  +E++ ++LD   Y  +I G C + + D A D+F  
Sbjct: 474 VSYKILLDGLCDNGELEKALEIFGKIEKSKMELDIGIYMIIIHGMCNASKVDDAWDLFCS 533

Query: 508 MVGKGYLANETTYTILVEGIIHEKEMDLATEVLREL 542
           +  KG   +   Y I++  +  +  +  A  + R++
Sbjct: 534 LPLKGVKLDARAYNIMISELCRKDSLSKADILFRKM 569

BLAST of CSPI04G21480 vs. NCBI nr
Match: gi|778695367|ref|XP_011653982.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g79080, chloroplastic [Cucumis sativus])

HSP 1 Score: 1134.4 bits (2933), Expect = 0.0e+00
Identity = 566/566 (100.00%), Postives = 566/566 (100.00%), Query Frame = 1

Query: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNIQKLSLNKGFSKVLASTQITISPKDTIFT 60
           MATLLNTVSPITNPSPETTRRGCGFFSHIPNIQKLSLNKGFSKVLASTQITISPKDTIFT
Sbjct: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNIQKLSLNKGFSKVLASTQITISPKDTIFT 60

Query: 61  LPNWKIGKLDQKSKELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKTCKMRKAIKVM 120
           LPNWKIGKLDQKSKELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKTCKMRKAIKVM
Sbjct: 61  LPNWKIGKLDQKSKELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKTCKMRKAIKVM 120

Query: 121 EMMIGSGIIPDAASYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTATYNSLVRGLCMH 180
           EMMIGSGIIPDAASYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTATYNSLVRGLCMH
Sbjct: 121 EMMIGSGIIPDAASYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTATYNSLVRGLCMH 180

Query: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGKPNLVSY 240
           GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGKPNLVSY
Sbjct: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGKPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAMQLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMDG 300
           NVLLTGLCKEGRTEDAMQLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMDG
Sbjct: 241 NVLLTGLCKEGRTEDAMQLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMDG 300

Query: 301 DERSPSTVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           DERSPSTVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL
Sbjct: 301 DERSPSTVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360

Query: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHFSTQEFYKIVITS 420
           VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHFSTQEFYKIVITS
Sbjct: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHFSTQEFYKIVITS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEENIKLDT 480
           LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEENIKLDT
Sbjct: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEENIKLDT 480

Query: 481 ENYNSLILGCCKSRRTDLALDVFEIMVGKGYLANETTYTILVEGIIHEKEMDLATEVLRE 540
           ENYNSLILGCCKSRRTDLALDVFEIMVGKGYLANETTYTILVEGIIHEKEMDLATEVLRE
Sbjct: 481 ENYNSLILGCCKSRRTDLALDVFEIMVGKGYLANETTYTILVEGIIHEKEMDLATEVLRE 540

Query: 541 LQLRDVINQSTVERLVMQYDLNELPL 567
           LQLRDVINQSTVERLVMQYDLNELPL
Sbjct: 541 LQLRDVINQSTVERLVMQYDLNELPL 566

BLAST of CSPI04G21480 vs. NCBI nr
Match: gi|659087133|ref|XP_008444287.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g79080, chloroplastic [Cucumis melo])

HSP 1 Score: 1089.7 bits (2817), Expect = 0.0e+00
Identity = 542/566 (95.76%), Postives = 554/566 (97.88%), Query Frame = 1

Query: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNIQKLSLNKGFSKVLASTQITISPKDTIFT 60
           MATLLNTVSPITN SPETTRRGCGFFSHIPN+QKLSLNKGFSKVLASTQITISPKDTIFT
Sbjct: 1   MATLLNTVSPITNTSPETTRRGCGFFSHIPNLQKLSLNKGFSKVLASTQITISPKDTIFT 60

Query: 61  LPNWKIGKLDQKSKELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKTCKMRKAIKVM 120
           LPNWK GK++QKSKELRL DAFFHLEFMVEKGQKPDVFQATQLLYDLCK CKMRKAIKVM
Sbjct: 61  LPNWKTGKVEQKSKELRLTDAFFHLEFMVEKGQKPDVFQATQLLYDLCKACKMRKAIKVM 120

Query: 121 EMMIGSGIIPDAASYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTATYNSLVRGLCMH 180
           EMMIGSGIIPDA+SYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNT TYNSLVRGLCMH
Sbjct: 121 EMMIGSGIIPDASSYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTVTYNSLVRGLCMH 180

Query: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGKPNLVSY 240
           GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGG+PNLVSY
Sbjct: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGEPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAMQLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMDG 300
           NVLLTGLCKEGRTEDA++LFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEM+G
Sbjct: 241 NVLLTGLCKEGRTEDAIRLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMNG 300

Query: 301 DERSPSTVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           DERSPSTVTYNILIGSL LHGRTEHALEVLEEMIRARFKPTASSYNPIIA LCKD K+DL
Sbjct: 301 DERSPSTVTYNILIGSLALHGRTEHALEVLEEMIRARFKPTASSYNPIIAHLCKDGKLDL 360

Query: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHFSTQEFYKIVITS 420
           VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQH STQEFYKIVITS
Sbjct: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHSSTQEFYKIVITS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEENIKLDT 480
           LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEEN K DT
Sbjct: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEENNKPDT 480

Query: 481 ENYNSLILGCCKSRRTDLALDVFEIMVGKGYLANETTYTILVEGIIHEKEMDLATEVLRE 540
           ENYNSLILGCCKSRRTDLALDVFEIMVGKGYL NETTYTILVEGIIHEKEMDLAT+VLRE
Sbjct: 481 ENYNSLILGCCKSRRTDLALDVFEIMVGKGYLPNETTYTILVEGIIHEKEMDLATKVLRE 540

Query: 541 LQLRDVINQSTVERLVMQYDLNELPL 567
           LQLRDVI+QST+ERLVMQYDLNELPL
Sbjct: 541 LQLRDVISQSTLERLVMQYDLNELPL 566

BLAST of CSPI04G21480 vs. NCBI nr
Match: gi|225462201|ref|XP_002269984.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g79080, chloroplastic [Vitis vinifera])

HSP 1 Score: 897.5 bits (2318), Expect = 1.2e-257
Identity = 436/567 (76.90%), Postives = 503/567 (88.71%), Query Frame = 1

Query: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNIQKLSLNKGFSKVLASTQITISPKDTIFT 60
           MA L+N +SPIT+PSPE  R+ CGFFS +PN+  LSLNKGFS+VLASTQITISPKD +FT
Sbjct: 1   MAILVNAMSPITSPSPENARKVCGFFSQVPNLHTLSLNKGFSRVLASTQITISPKDNVFT 60

Query: 61  LPNWKIGKLDQKSKELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKTCKMRKAIKVM 120
           LPNW+ GK D ++++LRLNDAF +LE+M+ KG KPD  QATQL+Y+LCK+ KMRKA KVM
Sbjct: 61  LPNWRSGKNDPRTRDLRLNDAFLYLEYMIGKGHKPDGGQATQLMYELCKSNKMRKATKVM 120

Query: 121 EMMIGSGIIPDAASYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTATYNSLVRGLCMH 180
           E+MIGSG  PD AS TFLV++LC++GNVGYAMQLV+KMEEYGYPTNT TYNSLVRGLCMH
Sbjct: 121 ELMIGSGTTPDPASCTFLVNNLCKRGNVGYAMQLVEKMEEYGYPTNTVTYNSLVRGLCMH 180

Query: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGKPNLVSY 240
           GNL+QSLQ+LD+ ++KGLVPN +TYSFLLEAAYKERGADEA +LLDEI+AKGGKPNLVSY
Sbjct: 181 GNLSQSLQILDKFMKKGLVPNVFTYSFLLEAAYKERGADEAIRLLDEIVAKGGKPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAMQLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMDG 300
           NVLLTGLCKEGRTE+AMQ FR+LPSKGFSPNVVSYNILLRSLC EGRWE+A  LLAEMDG
Sbjct: 241 NVLLTGLCKEGRTEEAMQFFRDLPSKGFSPNVVSYNILLRSLCYEGRWEKAKELLAEMDG 300

Query: 301 DERSPSTVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
            ERSPS VT+NILIGSL LHG+T+ ALEVL++M RARFK TA+SYNPIIARLCK+ KVDL
Sbjct: 301 GERSPSIVTFNILIGSLALHGQTDQALEVLDDMSRARFKATAASYNPIIARLCKEGKVDL 360

Query: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHFSTQEFYKIVITS 420
           VVKCLDQMMYR CNPNEGTYNAIA LCEEG VQEAFSIIQSLGNKQ+ ST +FYK VI+S
Sbjct: 361 VVKCLDQMMYRRCNPNEGTYNAIAVLCEEGKVQEAFSIIQSLGNKQNSSTHDFYKGVISS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEEN-IKLD 480
           LCRKGNTYPAFQLLYEMTKYGF PDS+TYSSLIRGLC EGML+EA+EIFS+MEEN  + D
Sbjct: 421 LCRKGNTYPAFQLLYEMTKYGFVPDSYTYSSLIRGLCSEGMLDEAMEIFSIMEENYCRPD 480

Query: 481 TENYNSLILGCCKSRRTDLALDVFEIMVGKGYLANETTYTILVEGIIHEKEMDLATEVLR 540
            +N+N+LILG CK R+TDL+L VFE+MV KGY+ NETTYTI+VEGI H++EM+LA  VL+
Sbjct: 481 VDNFNALILGLCKCRKTDLSLMVFEMMVKKGYMPNETTYTIIVEGIAHQEEMELAAAVLK 540

Query: 541 ELQLRDVINQSTVERLVMQYDLNELPL 567
           EL LR  + +ST+ERLVMQYDL  LP+
Sbjct: 541 ELYLRQAVGRSTLERLVMQYDLEGLPI 567

BLAST of CSPI04G21480 vs. NCBI nr
Match: gi|1009141884|ref|XP_015888425.1| (PREDICTED: pentatricopeptide repeat-containing protein At1g79080, chloroplastic [Ziziphus jujuba])

HSP 1 Score: 897.5 bits (2318), Expect = 1.2e-257
Identity = 438/567 (77.25%), Postives = 500/567 (88.18%), Query Frame = 1

Query: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNIQKLSLNKGFSKVLASTQITISPKDTIFT 60
           MA LLN VSPI +PSPET+R+ CGFFSH+PN+   S+NKGF++VLAST ITISPKDT+FT
Sbjct: 1   MAILLNPVSPIAHPSPETSRKSCGFFSHVPNLHTFSVNKGFARVLASTPITISPKDTVFT 60

Query: 61  LPNWKIGKLDQKSKELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKTCKMRKAIKVM 120
           +PNW+ GK D +SKE RLNDAF H E MVEKGQKPDV QATQLLYDLCK  K +KA++VM
Sbjct: 61  VPNWRTGKNDTRSKEFRLNDAFLHFERMVEKGQKPDVAQATQLLYDLCKVNKAKKAVRVM 120

Query: 121 EMMIGSGIIPDAASYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTATYNSLVRGLCMH 180
           EMM+GS IIPDAASYTFLV+ LC++G++GYAMQLVDKMEEYGYPTNTATYNSL+RG C+ 
Sbjct: 121 EMMVGSCIIPDAASYTFLVNYLCKRGSIGYAMQLVDKMEEYGYPTNTATYNSLIRGTCLR 180

Query: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGKPNLVSY 240
           GNL QSLQLLDRL QKGLVPNA+TYS LLEAAYKERG +EA KLLDEIIA GGKPNLVSY
Sbjct: 181 GNLNQSLQLLDRLKQKGLVPNAFTYSSLLEAAYKERGVNEAMKLLDEIIANGGKPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAMQLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMDG 300
           NVLLTGLCKEGRT++A+  FR LPS GF+PNVVSYNILLR+LC EGRWEEAN+LLAEMDG
Sbjct: 241 NVLLTGLCKEGRTDEAIHFFRNLPSMGFNPNVVSYNILLRNLCYEGRWEEANMLLAEMDG 300

Query: 301 DERSPSTVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
            ER+PS VTYNILIGSL+LHGR EHA  V++EMIR RFKPTA+SYNPIIA LCK+ KVDL
Sbjct: 301 GERTPSIVTYNILIGSLSLHGRIEHAFGVMDEMIRRRFKPTAASYNPIIASLCKEGKVDL 360

Query: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHFSTQEFYKIVITS 420
           VVKCLDQMMYR C+PNEGTYNAIA LCEEGM+QEAFSIIQSLGNKQ  ST +F K VI+S
Sbjct: 361 VVKCLDQMMYRRCSPNEGTYNAIAVLCEEGMIQEAFSIIQSLGNKQKSSTHDFSKNVISS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEE-NIKLD 480
           LCRKGNTY AFQLLYEMTKYGFTPDS+TYSSL+RGLCMEGML+EA+EIFSVMEE N + +
Sbjct: 421 LCRKGNTYAAFQLLYEMTKYGFTPDSYTYSSLLRGLCMEGMLDEAMEIFSVMEENNYRPN 480

Query: 481 TENYNSLILGCCKSRRTDLALDVFEIMVGKGYLANETTYTILVEGIIHEKEMDLATEVLR 540
           TEN+N+LILG CKS+RTDLA+DV+E+M+ KGY+ NETTY I+VEGI  E EM +A  VL+
Sbjct: 481 TENFNALILGFCKSQRTDLAVDVYEMMIEKGYMPNETTYIIIVEGIASEGEMGIAARVLK 540

Query: 541 ELQLRDVINQSTVERLVMQYDLNELPL 567
           ELQ R VI+QS+VERL MQYDL +LP+
Sbjct: 541 ELQQRQVIDQSSVERLAMQYDLEDLPV 567

BLAST of CSPI04G21480 vs. NCBI nr
Match: gi|590612064|ref|XP_007022279.1| (Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao])

HSP 1 Score: 895.2 bits (2312), Expect = 5.9e-257
Identity = 431/565 (76.28%), Postives = 506/565 (89.56%), Query Frame = 1

Query: 1   MATLLNTVSPITNPSPETTRRGCGFFSHIPNIQKLSLNKGFSKVLASTQITISPKDTIFT 60
           MATLLN++SP+TNPSPETTR+ CGFF  IPN+   SLNKGF++VLA+TQITISPKD++FT
Sbjct: 1   MATLLNSMSPMTNPSPETTRKTCGFFYQIPNLHSFSLNKGFTRVLATTQITISPKDSVFT 60

Query: 61  LPNWKIGKLDQKSKELRLNDAFFHLEFMVEKGQKPDVFQATQLLYDLCKTCKMRKAIKVM 120
           LPNWK GK D KS+ELRLNDAFFH+E+MV KGQKPDV QATQLLYDLCK  KM+K+I+V+
Sbjct: 61  LPNWKTGKNDTKSRELRLNDAFFHMEYMVGKGQKPDVAQATQLLYDLCKANKMKKSIRVL 120

Query: 121 EMMIGSGIIPDAASYTFLVSSLCRKGNVGYAMQLVDKMEEYGYPTNTATYNSLVRGLCMH 180
           EMM+ SGIIPDAASYTFLV+ LC++GNVG+AMQLV+KME +GYPTNT TYNSLVRGLCMH
Sbjct: 121 EMMVNSGIIPDAASYTFLVNHLCKRGNVGHAMQLVEKMEAHGYPTNTVTYNSLVRGLCMH 180

Query: 181 GNLTQSLQLLDRLIQKGLVPNAYTYSFLLEAAYKERGADEASKLLDEIIAKGGKPNLVSY 240
           GNL QSLQLLD+LIQ+GLVPNA+TYSFLLEAAYKE+G +EA KLLD+IIAKGGKPNLVSY
Sbjct: 181 GNLNQSLQLLDKLIQRGLVPNAFTYSFLLEAAYKEKGVNEAMKLLDDIIAKGGKPNLVSY 240

Query: 241 NVLLTGLCKEGRTEDAMQLFRELPSKGFSPNVVSYNILLRSLCNEGRWEEANVLLAEMDG 300
           NVLLTGLCKEGRT++A++ FR LP+KGF PNVVSYNILLR+LC EG+W+EAN LLAEMDG
Sbjct: 241 NVLLTGLCKEGRTDEAIRFFRNLPAKGFDPNVVSYNILLRNLCYEGQWKEANELLAEMDG 300

Query: 301 DERSPSTVTYNILIGSLTLHGRTEHALEVLEEMIRARFKPTASSYNPIIARLCKDRKVDL 360
           ++RSPS VTYNILIGSL LHGRTEHAL+VL+EMIR RFK TA+SYNPIIARLC+++KVDL
Sbjct: 301 EDRSPSVVTYNILIGSLALHGRTEHALDVLDEMIRGRFKATATSYNPIIARLCQEKKVDL 360

Query: 361 VVKCLDQMMYRHCNPNEGTYNAIATLCEEGMVQEAFSIIQSLGNKQHFSTQEFYKIVITS 420
           VVKCLDQM+YR C PNEGTYNA A LCE+GMVQEAFSIIQSLG+KQ   + +FYK VI+S
Sbjct: 361 VVKCLDQMIYRRCKPNEGTYNATAVLCEQGMVQEAFSIIQSLGSKQSSPSHDFYKSVISS 420

Query: 421 LCRKGNTYPAFQLLYEMTKYGFTPDSFTYSSLIRGLCMEGMLNEAIEIFSVMEE-NIKLD 480
           LCRKGNTYPAFQLLYEMTK GF PDS+TYSSLIRGLC+EGML EA+EI  VMEE N + D
Sbjct: 421 LCRKGNTYPAFQLLYEMTKSGFNPDSYTYSSLIRGLCLEGMLQEALEIVIVMEESNYRPD 480

Query: 481 TENYNSLILGCCKSRRTDLALDVFEIMVGKGYLANETTYTILVEGIIHEKEMDLATEVLR 540
            +N+N+LILG CKS RTDL+L VFE+++ KGY+ NETTYTILVEGI HE +++LA EVL+
Sbjct: 481 VDNFNALILGFCKSHRTDLSLKVFEMIIEKGYMPNETTYTILVEGIAHEGKIELAAEVLK 540

Query: 541 ELQLRDVINQSTVERLVMQYDLNEL 565
           EL +R+V++Q  VERLVMQY+L+ +
Sbjct: 541 ELHVREVVSQHAVERLVMQYNLSAI 565

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP131_ARATH2.0e-21265.16Pentatricopeptide repeat-containing protein At1g79080, chloroplastic OS=Arabidop... [more]
PPR28_ARATH1.3e-10239.26Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana GN... [more]
PP213_ARATH3.1e-8836.33Pentatricopeptide repeat-containing protein At3g04760, chloroplastic OS=Arabidop... [more]
PPR36_ARATH2.8e-6529.74Pentatricopeptide repeat-containing protein At1g12300, mitochondrial OS=Arabidop... [more]
PPR39_ARATH6.3e-6528.51Pentatricopeptide repeat-containing protein At1g12775, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0L2W8_CUCSA0.0e+00100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_4G613170 PE=4 SV=1[more]
E5GBB3_CUCME0.0e+0095.76Pentatricopeptide repeat-containing protein OS=Cucumis melo subsp. melo PE=4 SV=... [more]
A5AJ76_VITVI8.3e-25876.90Putative uncharacterized protein OS=Vitis vinifera GN=VIT_19s0027g00450 PE=4 SV=... [more]
A0A061FH66_THECC4.1e-25776.28Pentatricopeptide repeat (PPR) superfamily protein OS=Theobroma cacao GN=TCM_032... [more]
A0A0D2U0G5_GOSRA9.5e-25475.58Uncharacterized protein OS=Gossypium raimondii GN=B456_009G354800 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G79080.11.1e-21365.16 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G09900.17.3e-10439.26 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT3G04760.11.7e-8936.33 Pentatricopeptide repeat (PPR-like) superfamily protein[more]
AT1G12300.11.6e-6629.74 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G12775.13.5e-6628.51 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|778695367|ref|XP_011653982.1|0.0e+00100.00PREDICTED: pentatricopeptide repeat-containing protein At1g79080, chloroplastic ... [more]
gi|659087133|ref|XP_008444287.1|0.0e+0095.76PREDICTED: pentatricopeptide repeat-containing protein At1g79080, chloroplastic ... [more]
gi|225462201|ref|XP_002269984.1|1.2e-25776.90PREDICTED: pentatricopeptide repeat-containing protein At1g79080, chloroplastic ... [more]
gi|1009141884|ref|XP_015888425.1|1.2e-25777.25PREDICTED: pentatricopeptide repeat-containing protein At1g79080, chloroplastic ... [more]
gi|590612064|ref|XP_007022279.1|5.9e-25776.28Pentatricopeptide repeat (PPR) superfamily protein [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G21480.1CSPI04G21480.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 107..128
score:
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 441..473
score: 1.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 480..524
score: 1.0E-9coord: 130..178
score: 1.2E-10coord: 235..284
score: 1.7E-17coord: 305..346
score: 2.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 344..377
score: 1.3E-4coord: 169..202
score: 6.1E-7coord: 447..475
score: 2.4E-6coord: 107..132
score: 0.0016coord: 238..272
score: 2.5E-10coord: 134..164
score: 9.3E-5coord: 414..446
score: 0.0013coord: 483..514
score: 7.0E-6coord: 273..305
score: 1.9E-6coord: 308..341
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 271..305
score: 11.411coord: 306..340
score: 11.729coord: 131..165
score: 10.676coord: 479..513
score: 11.203coord: 236..270
score: 14.009coord: 514..548
score: 7.783coord: 341..375
score: 9.131coord: 410..444
score: 9.493coord: 445..475
score: 11.093coord: 166..200
score: 12.266coord: 201..235
score: 9.262coord: 96..130
score: 8
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 112..373
score: 1.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 10..557
score:
NoneNo IPR availablePANTHERPTHR24015:SF765SUBFAMILY NOT NAMEDcoord: 10..557
score:
NoneNo IPR availableunknownSSF81901HCP-likecoord: 72..269
score: 1.2