Cp4.1LG20g09070 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG20g09070
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCp4.1LG20 : 6735551 .. 6739214 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATCTCCTCCTTCGCATTCGCATTCAGTTCTTGAGGAATTCCATTAACCAGCTCTGTAACTCGATTCCATCGAATCCACATTTCGTTTGTCTCAGAAGATTTTCATTTCATTCATGGTTGTTGAATACTGATCATGGTAACTTCAGCCATATTTTCCGTCTATCTCAAATCCAAAACTATCCAGCTCCCGTCCTCTCTTTTACGAAATTTTGTTTGAATTTCTACTCTAATAGAGCACCTTCAAGATCCTTTAGGAGGAGAGCGAGTAAGAGATTGAAATCCAGAATCAAACCTAAGCTGAACGAGGCTCAATTTCAGCACGCAATTTCTCAAATCCCTCCAAGGTTTAACTCCGAAGAACTCTATAATGTCATTTCTGTTCAGGGAGATCCTTTGGTGTGCTTTGAACTGTTCAATTGGGCCTCCCAGCAGTCTCGTTTCAGACATGATGTTTCCACTTATGAGATTACAATTAAGAAGCTGGGTGAGGCGAAAATGTACGAAGAAATGGATAATGTTGTGAACCAGGTGCTTGCTGTTCCTTCTATTGGTTCTGAGACTCTTTACAATACTATGATATATTTTTTTACTGAGGCAAGGAAGTTGACTAGAGCTATCAATATATTCAAGCATATGCAGAACAACAAAAACGTGAACTGTAGGCCTTCAATTAGAACCTATAATCTGCTTTTTACTGCATTCTTGAGTCGGGGTCGTAATGCTTATATAAATCTCATGTATATGGAGACTATTAGATGCCTCTTCAGACAGATGGTGAATGATGGCATTGAACCTGACATATTTACTTTAAATTGTATGATAAAGGGGTATGTGCTATCCCTTCATGTCAATGATGCTCTCAGGATCTTCCACCAAATGGGTGTTGTTTATAGTAGCCTCCCAAATTCATTTTCCTATGATTATTTGATTCACGGGTTATGCGCGCAAGCGCGAACGGATAATGCGAGGGAGTTGTGCAATGAAATGAAGGAAAAGGGGTTTGTGCCAAGTAGTATATCATATAATTCAATTGTTAATGCTATGGCTCTCAATGGAGAGGTTGAAGAAGCAGTGAATTATTTGTGGGAGATGATTGGTAATAGAAGGTCTCCTGATTTTATTACATATAAGACGGTATTGGATGAACTGTGCAGGCGAGGGAGGGTTGGAGAAGCCACGAGTTTGTTGAGGGAAGTGCAGGAGAAGGAACTTGTGGATGGTCATACTTATAGGAAACTTCTCTATGTGCTTGAAGATGACTATGGAAATGTACATGGTGGATGGCCATGATCTGGAATCATGGATCGTGTGGTTTTTTTGAAGGTAGGATGGACTCGTGACGTTTGATATCAGAGCCACGCTTTGTGTTTTAGTCGGATCAAGCAGATAGAAAAGCGGCCTGTCTTGATTTCTCCCAGTGGTTGTTGTGTATCAATGGAAGGTTTACCGATGCCATTTGCTGAAAAGTTAGAAACCTATGATTTATTCAGATGCTTTACTTAATTTAGTTATAGATCAAGAAGATTCTTCTGGAGAATCATCAATCAGTTGTGGCTACTCGGCTTAGAGCTGGTGCATATCTTTTGAAGCTTCAAGAGTTACAGCTGGTGCATATCTTTTGTGGCTACACGGCTTAAAAACGCTACAAATATTAGTCAATCAGCGAGGTAAAAAGTCTAGCAGCTTTGCTCCTTCACCTATTGGTAAAGTAACTTCGCTCCTCTCCTTTACAGTCTCACAATTCCATTGTCTGAACCTATCGGTAGAGCAGCTTTGCTCCTTAGACGACACTTATTTACCAAGAAAAAGAGAAAGAGAGAAGCTATTTCATTGTTTGAACTTATCGGTAAAGCAGCTTTGCTCCTTAGCTGACACCTATTTACCGAGAAAAAGAGAAAGAGATAAGCTATTCCATTGTCTGAACTTATTAGTAAAGCAGCTTTGGTTCTTAGCCGACTCCTAAAAAGAGTAAGAGATAAGCTATTCCATTGTCTGAACCTATTGGTAAAGCAGCTTTGCTTCTTAGCCGACTCCTAAAAAGAGTAAGAGATAAGCTATTCCATTGTCTGAACCTATTGGTAAAGCAGCTTTGCTTCTTAGCCAATACCTATTTACTGAGAAAAAGAGTAAGAGATAAAGCTTATTGAGACTTCTTTAATAGCTTATAAATGCTTTTCTTCTCAACAACGTAAAAACTTTTGAACAGTACTTGAAGCAGTGAAGCACTTCGACTTAGTTCATGAATCCAACTGTATACTGCTGGTACAGTCTCACATCCCTTCATTTCTTTTCATATAATAGGTGTATTCTCATGCGAAGCAGCCGTCTTTGTGGGACTTGAATTATATGATATCTTCTCCAAAAGAAGATCGTGATCCCTTTATGGCTTTTCCCCCATCAAGTTCAAGGTACTTTTTTCAAGCTGAGTTCATTTAACACTGAATTTTGTTTGCATGAGGAATCAATAAAGAAATTCATACAAGTCCAGCAGGATAGTGCTATATTTATAGTGCTCAAGACATGTTTCAATTTTTGACTAGTAGGTTGGTGTCTAGTAGAATTTGCAATTTAAGAACATGAAGAATTGAAGTTAAACAACACTTGAGCTTAATGAATATTACACTAATCTTCAGTTTGCTATTGATCTGAAAATTAATCTAATTGTGTAAATGTAAAAAAGATATATGTGGATTTATGTGTAGCCATATGCCAAAAGCAAATAATATTTAACTATTAAGAATTTGTATTTAATAATATGCAAGTTTGATATTCTAGCATCGTGGATAAGATATCTATTACTATCTTGTGAGTTTATAGATCAATTAAAATCTTGTGAGTTTATAGATCAATTAAAATCTTGTGAGTTTATAGATCAATTAAAGATAATTGTATGTGGTTGAAGTGGATTGAAAAATGAAATCAATTCCAAACCTAAGCTTTTGTGATTACATTACACTTTTATAACTTTCTACGATTTGTTGTAGAGAATAGAATTTTAGGAGATAAGAAACGTATCTTGAGATTGATTGCAATGAAGTTTGATCTTACTAATTTGTTTGTAAGTATGCTTAAATTGTAAGCTTAATCTTTAAACTTTTTGAATAATGTAATTGAATGACTCAAAGTTTTGAGTCCAAATGAAATATGACTCAATTTCTTGAATTTTATCGCCTTGCTCACTTGGTTAAAACAAAACAATTTCTCTCCCTCTCTCTTCTTCTTCCTCCCCCCTTTGTCTCACTCCTCCCTCCCACTCTCCTTCCTTCTTGTTCTTACCTTTCCTCTTTTCTTCCCTTATTCTTGTGTCCTCTCTGCTTCTCTCCACTTTCAGAATCAAGAATCCTATCAAGAGCGAGAAATAGAGCACGAGCGAGAGCAATAGAGCATGAGCGAGATTGAGAATGTGAGCGAGAGCAATAGAGCATGAGCGAGATTGAGAATGTGAGCGAGATGACAATGCACAAGAGAGACTGACAAAGCATGGATTGACGAAGACGTCTATTATACTTTCACATGTTGCTGGGGATGAGCAAAACAATATAACAGCTTTATGTTAAAAAACTGGATATATTTTGGTTTTTATTCATTGGGTTTTCAATTTACAATCAAGGATAAGGATAAAAATATTAAACAGTTTAGAACAGTTTTTTATAATAAGATT

mRNA sequence

ATGAATCTCCTCCTTCGCATTCGCATTCAGTTCTTGAGGAATTCCATTAACCAGCTCTGTAACTCGATTCCATCGAATCCACATTTCGTTTGTCTCAGAAGATTTTCATTTCATTCATGGTTGTTGAATACTGATCATGGTAACTTCAGCCATATTTTCCGTCTATCTCAAATCCAAAACTATCCAGCTCCCGTCCTCTCTTTTACGAAATTTTGTTTGAATTTCTACTCTAATAGAGCACCTTCAAGATCCTTTAGGAGGAGAGCGAGTAAGAGATTGAAATCCAGAATCAAACCTAAGCTGAACGAGGCTCAATTTCAGCACGCAATTTCTCAAATCCCTCCAAGGTTTAACTCCGAAGAACTCTATAATGTCATTTCTGTTCAGGGAGATCCTTTGGTGTGCTTTGAACTGTTCAATTGGGCCTCCCAGCAGTCTCGTTTCAGACATGATGTTTCCACTTATGAGATTACAATTAAGAAGCTGGGTGAGGCGAAAATGTACGAAGAAATGGATAATGTTGTGAACCAGGTGCTTGCTGTTCCTTCTATTGGTTCTGAGACTCTTTACAATACTATGATATATTTTTTTACTGAGGCAAGGAAGTTGACTAGAGCTATCAATATATTCAAGCATATGCAGAACAACAAAAACGTGAACTGTAGGCCTTCAATTAGAACCTATAATCTGCTTTTTACTGCATTCTTGAGTCGGGGTCGTAATGCTTATATAAATCTCATGTATATGGAGACTATTAGATGCCTCTTCAGACAGATGGTGAATGATGGCATTGAACCTGACATATTTACTTTAAATTGTATGATAAAGGGGTATGTGCTATCCCTTCATGTCAATGATGCTCTCAGGATCTTCCACCAAATGGGTGTTGTTTATAGTAGCCTCCCAAATTCATTTTCCTATGATTATTTGATTCACGGGTTATGCGCGCAAGCGCGAACGGATAATGCGAGGGAGTTGTGCAATGAAATGAAGGAAAAGGGGTTTGTGCCAAGTAGTATATCATATAATTCAATTGTTAATGCTATGGCTCTCAATGGAGAGGTTGAAGAAGCAGTGAATTATTTGTGGGAGATGATTGGTAATAGAAGGTCTCCTGATTTTATTACATATAAGACGGTATTGGATGAACTGTGCAGGCGAGGGAGGGTTGGAGAAGCCACGAGTTTGTTGAGGGAAGTGCAGGAGAAGGAACTTGTGGATGGTCATACTTATAGGAAACTTCTCTATGTGCTTGAAGATGACTATGGAAATGTACATGCTTTGCTTCTTAGCCGACTCCTAAAAAGAGTAAGAGATAAGCTATTCCATTGTCTGAACCTATTGGTAAAGCAGCTTTGCTTCTTAGCCAATACCTATTTACTGAGAAAAAGAGTGTATTCTCATGCGAAGCAGCCGTCTTTGTGGGACTTGAATTATATGATATCTTCTCCAAAAGAAGATCGTGATCCCTTTATGGCTTTTCCCCCATCAAGTTCAAGAATCAAGAATCCTATCAAGAGCGAGAAATAGAGCACGAGCGAGAGCAATAGAGCATGAGCGAGATTGAGAATGTGAGCGAGAGCAATAGAGCATGAGCGAGATTGAGAATGTGAGCGAGATGACAATGCACAAGAGAGACTGACAAAGCATGGATTGACGAAGACGTCTATTATACTTTCACATGTTGCTGGGGATGAGCAAAACAATATAACAGCTTTATGTTAAAAAACTGGATATATTTTGGTTTTTATTCATTGGGTTTTCAATTTACAATCAAGGATAAGGATAAAAATATTAAACAGTTTAGAACAGTTTTTTATAATAAGATT

Coding sequence (CDS)

ATGAATCTCCTCCTTCGCATTCGCATTCAGTTCTTGAGGAATTCCATTAACCAGCTCTGTAACTCGATTCCATCGAATCCACATTTCGTTTGTCTCAGAAGATTTTCATTTCATTCATGGTTGTTGAATACTGATCATGGTAACTTCAGCCATATTTTCCGTCTATCTCAAATCCAAAACTATCCAGCTCCCGTCCTCTCTTTTACGAAATTTTGTTTGAATTTCTACTCTAATAGAGCACCTTCAAGATCCTTTAGGAGGAGAGCGAGTAAGAGATTGAAATCCAGAATCAAACCTAAGCTGAACGAGGCTCAATTTCAGCACGCAATTTCTCAAATCCCTCCAAGGTTTAACTCCGAAGAACTCTATAATGTCATTTCTGTTCAGGGAGATCCTTTGGTGTGCTTTGAACTGTTCAATTGGGCCTCCCAGCAGTCTCGTTTCAGACATGATGTTTCCACTTATGAGATTACAATTAAGAAGCTGGGTGAGGCGAAAATGTACGAAGAAATGGATAATGTTGTGAACCAGGTGCTTGCTGTTCCTTCTATTGGTTCTGAGACTCTTTACAATACTATGATATATTTTTTTACTGAGGCAAGGAAGTTGACTAGAGCTATCAATATATTCAAGCATATGCAGAACAACAAAAACGTGAACTGTAGGCCTTCAATTAGAACCTATAATCTGCTTTTTACTGCATTCTTGAGTCGGGGTCGTAATGCTTATATAAATCTCATGTATATGGAGACTATTAGATGCCTCTTCAGACAGATGGTGAATGATGGCATTGAACCTGACATATTTACTTTAAATTGTATGATAAAGGGGTATGTGCTATCCCTTCATGTCAATGATGCTCTCAGGATCTTCCACCAAATGGGTGTTGTTTATAGTAGCCTCCCAAATTCATTTTCCTATGATTATTTGATTCACGGGTTATGCGCGCAAGCGCGAACGGATAATGCGAGGGAGTTGTGCAATGAAATGAAGGAAAAGGGGTTTGTGCCAAGTAGTATATCATATAATTCAATTGTTAATGCTATGGCTCTCAATGGAGAGGTTGAAGAAGCAGTGAATTATTTGTGGGAGATGATTGGTAATAGAAGGTCTCCTGATTTTATTACATATAAGACGGTATTGGATGAACTGTGCAGGCGAGGGAGGGTTGGAGAAGCCACGAGTTTGTTGAGGGAAGTGCAGGAGAAGGAACTTGTGGATGGTCATACTTATAGGAAACTTCTCTATGTGCTTGAAGATGACTATGGAAATGTACATGCTTTGCTTCTTAGCCGACTCCTAAAAAGAGTAAGAGATAAGCTATTCCATTGTCTGAACCTATTGGTAAAGCAGCTTTGCTTCTTAGCCAATACCTATTTACTGAGAAAAAGAGTGTATTCTCATGCGAAGCAGCCGTCTTTGTGGGACTTGAATTATATGATATCTTCTCCAAAAGAAGATCGTGATCCCTTTATGGCTTTTCCCCCATCAAGTTCAAGAATCAAGAATCCTATCAAGAGCGAGAAATAG

Protein sequence

MNLLLRIRIQFLRNSINQLCNSIPSNPHFVCLRRFSFHSWLLNTDHGNFSHIFRLSQIQNYPAPVLSFTKFCLNFYSNRAPSRSFRRRASKRLKSRIKPKLNEAQFQHAISQIPPRFNSEELYNVISVQGDPLVCFELFNWASQQSRFRHDVSTYEITIKKLGEAKMYEEMDNVVNQVLAVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNKNVNCRPSIRTYNLLFTAFLSRGRNAYINLMYMETIRCLFRQMVNDGIEPDIFTLNCMIKGYVLSLHVNDALRIFHQMGVVYSSLPNSFSYDYLIHGLCAQARTDNARELCNEMKEKGFVPSSISYNSIVNAMALNGEVEEAVNYLWEMIGNRRSPDFITYKTVLDELCRRGRVGEATSLLREVQEKELVDGHTYRKLLYVLEDDYGNVHALLLSRLLKRVRDKLFHCLNLLVKQLCFLANTYLLRKRVYSHAKQPSLWDLNYMISSPKEDRDPFMAFPPSSSRIKNPIKSEK
BLAST of Cp4.1LG20g09070 vs. Swiss-Prot
Match: PP173_ARATH (Pentatricopeptide repeat-containing protein At2g27800, mitochondrial OS=Arabidopsis thaliana GN=At2g27800 PE=3 SV=2)

HSP 1 Score: 451.8 bits (1161), Expect = 9.7e-126
Identity = 217/346 (62.72%), Postives = 273/346 (78.90%), Query Frame = 1

Query: 76  YSNRAPSRSFRRRASKRLKSRIKPKLNEAQFQHAISQIPPRFNSEELYNVISVQGDPLVC 135
           YS   P+RS RRR S R KS  KP LN ++F   IS++PPRF  EEL + I+++ DP +C
Sbjct: 96  YSTSVPTRSLRRRISNRKKSSAKPILNVSKFHETISKLPPRFTPEELADAITLEEDPFLC 155

Query: 136 FELFNWASQQSRFRHDVSTYEITIKKLGEAKMYEEMDNVVNQVLAVPSIGSETLYNTMIY 195
           F LFNWASQQ RF H+  +Y I I+KLG AKMY+EMD++VNQVL+V  IG+E LYN++I+
Sbjct: 156 FHLFNWASQQPRFTHENCSYHIAIRKLGAAKMYQEMDDIVNQVLSVRHIGNENLYNSIIF 215

Query: 196 FFTEARKLTRAINIFKHMQNNKNVNCRPSIRTYNLLFTAFLSRGRNAYINLMYMETIRCL 255
           +FT+A KL RA+NIF+HM  +KN+ CRP+IRTY++LF A L RG N+YIN +YMET+R L
Sbjct: 216 YFTKAGKLIRAVNIFRHMVTSKNLECRPTIRTYHILFKALLGRGNNSYINHVYMETVRSL 275

Query: 256 FRQMVNDGIEPDIFTLNCMIKGYVLSLHVNDALRIFHQMGVVYSSLPNSFSYDYLIHGLC 315
           FRQMV+ GIEPD+F LNC++KGYVLSLHVNDALRIFHQM VVY   PNSF+YDYLIHGLC
Sbjct: 276 FRQMVDSGIEPDVFALNCLVKGYVLSLHVNDALRIFHQMSVVYDCEPNSFTYDYLIHGLC 335

Query: 316 AQARTDNARELCNEMKEKGFVPSSISYNSIVNAMALNGEVEEAVNYLWEMIGNRRSPDFI 375
           AQ RT NAREL +EMK KGFVP+  SYNS+VNA AL+GE+++AV  LWEMI N R  DFI
Sbjct: 336 AQGRTINARELLSEMKGKGFVPNGKSYNSLVNAFALSGEIDDAVKCLWEMIENGRVVDFI 395

Query: 376 TYKTVLDELCRRGRVGEATSLLREVQEKELVDGHTYRKLLYVLEDD 422
           +Y+T++DE CR+G+  EAT LL  ++EK+LVD  +Y KL+ VL  D
Sbjct: 396 SYRTLVDESCRKGKYDEATRLLEMLREKQLVDRDSYDKLVNVLHKD 441

BLAST of Cp4.1LG20g09070 vs. Swiss-Prot
Match: PP254_ARATH (Pentatricopeptide repeat-containing protein At3g25210, mitochondrial OS=Arabidopsis thaliana GN=At3g25210 PE=2 SV=1)

HSP 1 Score: 192.2 bits (487), Expect = 1.4e-47
Identity = 104/324 (32.10%), Postives = 180/324 (55.56%), Query Frame = 1

Query: 95  SRIKPKLN-EAQFQHAISQIPPRFNSEELYNVISVQGDPLVCFELFNWASQQSRFRHDVS 154
           SRI+ +   E QF+  I  + P F + ++   +  Q DP +  ++F W +QQ  ++H+  
Sbjct: 50  SRIRTRTPLETQFETWIQNLKPGFTNSDVVIALRAQSDPDLALDIFRWTAQQRGYKHNHE 109

Query: 155 TYEITIKKLGEAKMYEEMDNVVNQVLAVPSIGSETLYNTMIYFFTEARKL-TRAINIFKH 214
            Y   IK+    K    ++ ++ +V+A     S  LYN +I F    + L  RA +++  
Sbjct: 110 AYHTMIKQAITGKRNNFVETLIEEVIAGACEMSVPLYNCIIRFCCGRKFLFNRAFDVYNK 169

Query: 215 MQNNKNVNCRPSIRTYNLLFTAFLSRGRNAYINLMYMETIRCLFRQMVNDGIEPDIFTLN 274
           M  + +   +P + TY LL ++ L R     +  +Y+  +R L +QM ++G+ PD F LN
Sbjct: 170 MLRSDD--SKPDLETYTLLLSSLLKRFNKLNVCYVYLHAVRSLTKQMKSNGVIPDTFVLN 229

Query: 275 CMIKGYVLSLHVNDALRIFHQMGVVYSSLPNSFSYDYLIHGLCAQARTDNARELCNEMKE 334
            +IK Y   L V++A+R+F +M + Y S PN+++Y YL+ G+C + R         EM+ 
Sbjct: 230 MIIKAYAKCLEVDEAIRVFKEMAL-YGSEPNAYTYSYLVKGVCEKGRVGQGLGFYKEMQV 289

Query: 335 KGFVPSSISYNSIVNAMALNGEVEEAVNYLWEMIGNRRSPDFITYKTVLDELCRRGRVGE 394
           KG VP+   Y  ++ ++++   ++EAV  +++M+ N  SPD +TY TVL ELCR GR  E
Sbjct: 290 KGMVPNGSCYMVLICSLSMERRLDEAVEVVYDMLANSLSPDMLTYNTVLTELCRGGRGSE 349

Query: 395 ATSLLREVQEKELVDG-HTYRKLL 416
           A  ++ E ++++ V G   YR L+
Sbjct: 350 ALEMVEEWKKRDPVMGERNYRTLM 370

BLAST of Cp4.1LG20g09070 vs. Swiss-Prot
Match: PP190_ARATH (Pentatricopeptide repeat-containing protein At2g37230 OS=Arabidopsis thaliana GN=At2g37230 PE=2 SV=1)

HSP 1 Score: 131.0 bits (328), Expect = 3.8e-29
Identity = 91/316 (28.80%), Postives = 155/316 (49.05%), Query Frame = 1

Query: 105 QFQHAISQIPPRFNSEELYNVISVQGDPLVCFELFNWASQQSRFRHDVSTYEITIKKLGE 164
           + Q++I  + P ++   +YNV+          + F W  +    RHD  T+   IK LGE
Sbjct: 103 RLQNSIRDLVPEWDHSLVYNVLHGAKKLEHALQFFRWTERSGLIRHDRDTHMKMIKMLGE 162

Query: 165 AKMYEEMDNVVNQVLAVPSIG---SETLYNTMIYFFTEARKLTRAINIFKHMQNNKNVNC 224
                ++++    +L +P  G    E ++  +I  + +A  +  ++ IF+ M   K++  
Sbjct: 163 VS---KLNHARCILLDMPEKGVPWDEDMFVVLIESYGKAGIVQESVKIFQKM---KDLGV 222

Query: 225 RPSIRTYNLLFTAFLSRGRNAYINLMYMETIRCLFRQMVNDGIEPDIFTLNCMIKGYVLS 284
             +I++YN LF   L RGR       YM   R  F +MV++G+EP   T N M+ G+ LS
Sbjct: 223 ERTIKSYNSLFKVILRRGR-------YMMAKR-YFNKMVSEGVEPTRHTYNLMLWGFFLS 282

Query: 285 LHVNDALRIFHQMGVVYSSLPNSFSYDYLIHGLCAQARTDNARELCNEMKEKGFVPSSIS 344
           L +  ALR F  M     S P+  +++ +I+G C   + D A +L  EMK     PS +S
Sbjct: 283 LRLETALRFFEDMKTRGIS-PDDATFNTMINGFCRFKKMDEAEKLFVEMKGNKIGPSVVS 342

Query: 345 YNSIVNAMALNGEVEEAVNYLWEMIGNRRSPDFITYKTVLDELCRRGRVGEATSLLREVQ 404
           Y +++        V++ +    EM  +   P+  TY T+L  LC  G++ EA ++L+ + 
Sbjct: 343 YTTMIKGYLAVDRVDDGLRIFEEMRSSGIEPNATTYSTLLPGLCDAGKMVEAKNILKNMM 402

Query: 405 EKELV--DGHTYRKLL 416
            K +   D   + KLL
Sbjct: 403 AKHIAPKDNSIFLKLL 403

BLAST of Cp4.1LG20g09070 vs. Swiss-Prot
Match: PP444_ARATH (Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidopsis thaliana GN=At5g64320 PE=2 SV=1)

HSP 1 Score: 115.9 bits (289), Expect = 1.3e-24
Identity = 67/251 (26.69%), Postives = 127/251 (50.60%), Query Frame = 1

Query: 151 DVSTYEITIKKLGEAKMYEEMDNVVNQVLAVPSIGSETLYNTMIYFFTEARKLTRAINIF 210
           +V +Y I +    +    +E  NV+N++ A     +   +N +I  F +  ++  A+ IF
Sbjct: 423 NVYSYTILVDGFCKLGKIDEAYNVLNEMSADGLKPNTVGFNCLISAFCKEHRIPEAVEIF 482

Query: 211 KHMQNNKNVNCRPSIRTYNLLFTAFLSRGRNAYINLMYMETIRCLFRQMVNDGIEPDIFT 270
           + M       C+P + T+N L +               ++    L R M+++G+  +  T
Sbjct: 483 REMPRK---GCKPDVYTFNSLISGLCEVDE--------IKHALWLLRDMISEGVVANTVT 542

Query: 271 LNCMIKGYVLSLHVNDALRIFHQMGVVYSSLPNSFSYDYLIHGLCAQARTDNARELCNEM 330
            N +I  ++    + +A ++ ++M V   S  +  +Y+ LI GLC     D AR L  +M
Sbjct: 543 YNTLINAFLRRGEIKEARKLVNEM-VFQGSPLDEITYNSLIKGLCRAGEVDKARSLFEKM 602

Query: 331 KEKGFVPSSISYNSIVNAMALNGEVEEAVNYLWEMIGNRRSPDFITYKTVLDELCRRGRV 390
              G  PS+IS N ++N +  +G VEEAV +  EM+    +PD +T+ ++++ LCR GR+
Sbjct: 603 LRDGHAPSNISCNILINGLCRSGMVEEAVEFQKEMVLRGSTPDIVTFNSLINGLCRAGRI 661

Query: 391 GEATSLLREVQ 402
            +  ++ R++Q
Sbjct: 663 EDGLTMFRKLQ 661

BLAST of Cp4.1LG20g09070 vs. Swiss-Prot
Match: PP298_ARATH (Pentatricopeptide repeat-containing protein At4g01400, mitochondrial OS=Arabidopsis thaliana GN=At4g01400 PE=2 SV=2)

HSP 1 Score: 115.2 bits (287), Expect = 2.2e-24
Identity = 91/337 (27.00%), Postives = 164/337 (48.66%), Query Frame = 1

Query: 125 VISVQGDPLVCFELFNWASQQSRFRHDVSTYEITIKKLGEAKMYEEMDNVV--NQVLAVP 184
           +I+ Q DPL+  E+F++ASQQ  FRH  S++ I I KLG  + +  +D+V+  ++    P
Sbjct: 57  LIASQSDPLLAKEIFDYASQQPNFRHSRSSHLILILKLGRGRYFNLIDDVLAKHRSSGYP 116

Query: 185 SIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNKNVNCRPSIRTYNLLFTAFLS-RGRN 244
             G   ++  +I  + EA+   + ++ F  M      N  P  +  N +    +S RG  
Sbjct: 117 LTGE--IFTYLIKVYAEAKLPEKVLSTFYKMLE---FNFTPQPKHLNRILDVLVSHRG-- 176

Query: 245 AYINLMYMETIRCLFRQMVNDGIEPDIFTLNCMIKGYVLSLHVNDALRIFHQMGVVYSSL 304
                 Y++    LF+     G+ P+  + N +++ + L+  ++ A ++F +M +    +
Sbjct: 177 ------YLQKAFELFKSSRLHGVMPNTRSYNLLMQAFCLNDDLSIAYQLFGKM-LERDVV 236

Query: 305 PNSFSYDYLIHGLCAQARTDNARELCNEMKEKGFVPSSISYNSIVNAMALNGEVEEAVNY 364
           P+  SY  LI G C + + + A EL ++M  KGFVP  +SY +++N++    ++ EA   
Sbjct: 237 PDVDSYKILIQGFCRKGQVNGAMELLDDMLNKGFVPDRLSYTTLLNSLCRKTQLREAYKL 296

Query: 365 LWEMIGNRRSPDFITYKTVLDELCRRGRVGEATSLLREVQEKEL-VDGHTYRKLLYVLED 424
           L  M     +PD + Y T++   CR  R  +A  +L ++       +  +YR L+  L D
Sbjct: 297 LCRMKLKGCNPDLVHYNTMILGFCREDRAMDARKVLDDMLSNGCSPNSVSYRTLIGGLCD 356

Query: 425 ----DYGNVHALLLSRLLKRVRDKLFHCLNLLVKQLC 454
               D G  +   L  ++ +     F   N LVK  C
Sbjct: 357 QGMFDEGKKY---LEEMISKGFSPHFSVSNCLVKGFC 376

BLAST of Cp4.1LG20g09070 vs. TrEMBL
Match: A0A0A0LS50_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G086930 PE=4 SV=1)

HSP 1 Score: 717.6 bits (1851), Expect = 1.1e-203
Identity = 359/428 (83.88%), Postives = 385/428 (89.95%), Query Frame = 1

Query: 1   MNLLLRIRIQFLRNSINQLCNSIPSNPHFVCLRRFSFHSWLLN-TDHGNFSHIFRLSQIQ 60
           MNLLLRIRI F  NSI+ L NS PS PHFVC RRFS HSW LN T H N  H  R+SQI 
Sbjct: 1   MNLLLRIRIHFSTNSIDHLFNSNPSYPHFVCFRRFSIHSWWLNNTHHYNLPHFLRVSQIH 60

Query: 61  NYPAPVLSFTKFCLNFYSNRAPSRSFRRRASKRLKSRIKPKLNEAQFQHAISQIPPRFNS 120
            Y  P LSFT F L FYS  APSRSFR+RA+KRLKS +KPKL+E QFQ A+S+IPPRF S
Sbjct: 61  PYSGPNLSFTNFLLKFYSRAAPSRSFRKRANKRLKSSLKPKLDETQFQLAVSKIPPRFTS 120

Query: 121 EELYNVISVQGDPLVCFELFNWASQQSRFRHDVSTYEITIKKLGEAKMYEEMDNVVNQVL 180
           EEL NVIS+Q DPLVCFELFNWASQQ RFRHD S+YEITIKKLGEAKMYEEMD+VVNQ L
Sbjct: 121 EELCNVISLQRDPLVCFELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQAL 180

Query: 181 AVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNKNVNCRPSIRTYNLLFTAFLSRG 240
           AV SIGSETLYNTMIYFFTEARKLTRA+NIFKHMQNN+N+NCRPSIRTYNLLFTAFLSRG
Sbjct: 181 AVSSIGSETLYNTMIYFFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRG 240

Query: 241 RNAYINLMYMETIRCLFRQMVN-DGIEPDIFTLNCMIKGYVLSLHVNDALRIFHQMGVVY 300
           RN YIN MYMETIRCLFRQMVN DGIEPDIF+LNCMIKGYVLSLHVNDALRIFHQMGVVY
Sbjct: 241 RNTYINHMYMETIRCLFRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRIFHQMGVVY 300

Query: 301 SSLPNSFSYDYLIHGLCAQARTDNARELCNEMKEKGFVPSSISYNSIVNAMALNGEVEEA 360
           S LPNS+S+DYLIHGLCAQARTDNA+ELCNEMKEKGFVPSSISYNSIVNA+ALNGEVE+A
Sbjct: 301 SCLPNSYSFDYLIHGLCAQARTDNAKELCNEMKEKGFVPSSISYNSIVNALALNGEVEDA 360

Query: 361 VNYLWEMIGNRRSPDFITYKTVLDELCRRGRVGEATSLLREVQEKELVDGHTYRKLLYVL 420
           VNYLWEMI NRRSPDFITYKTVLDELCR+G+V EATSLLRE+QEK+LVDGHTYRKLLYVL
Sbjct: 361 VNYLWEMIDNRRSPDFITYKTVLDELCRQGKVVEATSLLRELQEKDLVDGHTYRKLLYVL 420

Query: 421 EDDYGNVH 427
           EDDYGN++
Sbjct: 421 EDDYGNLN 428

BLAST of Cp4.1LG20g09070 vs. TrEMBL
Match: A0A061GUJ4_THECC (Tetratricopeptide repeat-like superfamily protein, putative OS=Theobroma cacao GN=TCM_041079 PE=4 SV=1)

HSP 1 Score: 561.2 bits (1445), Expect = 1.3e-156
Identity = 286/422 (67.77%), Postives = 343/422 (81.28%), Query Frame = 1

Query: 12  LRNSINQLCNSIPSNPHFVCLRRFSFHSWLLNTDHGNFS---HIFRLSQIQ--NYPAPVL 71
           LRN  ++    I +NP F     +S  S  LN    +F     +  L+QI   +  +P  
Sbjct: 9   LRNFYHKTKIFISTNP-FHNFPYYSVFSSYLNPFIKDFKPKESLLGLTQIDPLSVISPTA 68

Query: 72  SFTKFCLN----FYSNRAPSRSFRRRASKRLKSRIKPKLNEAQFQHAISQIPPRFNSEEL 131
           +   FC N    FYS RAPSRSFRRR +KRLK+  KP L++ +F+ A+SQ+ PRF +EEL
Sbjct: 69  NLHPFCYNSFTCFYSTRAPSRSFRRRINKRLKASSKPVLDQPKFEKAVSQLLPRFTAEEL 128

Query: 132 YNVISVQGDPLVCFELFNWASQQSRFRHDVSTYEITIKKLGEAKMYEEMDNVVNQVLAVP 191
            NVI+++ DPLVC+ELFNWA QQ RFRHDVSTY ITIKKLG AKMYEEMD VVNQVLA+ 
Sbjct: 129 CNVITLEEDPLVCWELFNWAVQQPRFRHDVSTYHITIKKLGVAKMYEEMDVVVNQVLALR 188

Query: 192 SIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNKNVNCRPSIRTYNLLFTAFLSRGRNA 251
           + GSE LYNT+IYFFTEARKLTRA+NIFKHM+NN+ ++CRPSIRTYN+LFTA LSRGR++
Sbjct: 189 TFGSEPLYNTIIYFFTEARKLTRAVNIFKHMRNNRKLDCRPSIRTYNILFTAMLSRGRDS 248

Query: 252 YINLMYMETIRCLFRQMVNDGIEPDIFTLNCMIKGYVLSLHVNDALRIFHQMGVVYSSLP 311
           YIN MYMETIRCLFRQMVNDGIEPD+F+LN MIKGYVLSLHVNDALR+FHQMGVVY  LP
Sbjct: 249 YINHMYMETIRCLFRQMVNDGIEPDVFSLNSMIKGYVLSLHVNDALRVFHQMGVVYKCLP 308

Query: 312 NSFSYDYLIHGLCAQARTDNARELCNEMKEKGFVPSSISYNSIVNAMALNGEVEEAVNYL 371
           NS+SYD+LI+GLCAQ RT+NARELCNEMK+ GFVPSS SYNS+VNA+AL+GEVEEA++YL
Sbjct: 309 NSYSYDFLIYGLCAQGRTNNARELCNEMKKNGFVPSSKSYNSLVNALALSGEVEEALHYL 368

Query: 372 WEMIGNRRSPDFITYKTVLDELCRRGRVGEATSLLREVQEKELVDGHTYRKLLYVLEDDY 425
            EMI  R+S DFITY+T+LDE+CRRGR  EAT LL+E+Q+K+LVDGHTYRKLLY +EDD+
Sbjct: 369 REMIEKRKSADFITYRTILDEICRRGRAEEATGLLKELQDKDLVDGHTYRKLLYAMEDDF 428

BLAST of Cp4.1LG20g09070 vs. TrEMBL
Match: A0A0D2RRL4_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_004G012900 PE=4 SV=1)

HSP 1 Score: 550.4 bits (1417), Expect = 2.2e-153
Identity = 269/382 (70.42%), Postives = 318/382 (83.25%), Query Frame = 1

Query: 43  NTDHGNFSHIFRLSQIQNYPAPVLSFTKFCLNFYSNRAPSRSFRRRASKRLKSRIKPKLN 102
           N  H  F+  F      +  +P     +F   FYS +APSRS+RRR +KRLK+  KP L+
Sbjct: 40  NPQHSPFT--FTQINPSSITSPTSISHQFYTYFYSTKAPSRSYRRRVNKRLKASQKPVLD 99

Query: 103 EAQFQHAISQIPPRFNSEELYNVISVQGDPLVCFELFNWASQQSRFRHDVSTYEITIKKL 162
           +A+FQ  ISQ+PPRF ++ELYNVI+++ DPLVC+ELFNWA+QQ RF+H+VSTY ITIKKL
Sbjct: 100 QAKFQQVISQLPPRFTADELYNVITLEDDPLVCWELFNWAAQQPRFKHNVSTYHITIKKL 159

Query: 163 GEAKMYEEMDNVVNQVLAVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNKNVNCR 222
           G AKMYEEMD VVNQVLA+ S GSE LYNTMIYFF EARKLTRA+NIFKHM+NN+  +CR
Sbjct: 160 GVAKMYEEMDVVVNQVLALRSFGSEPLYNTMIYFFAEARKLTRAVNIFKHMRNNRKFDCR 219

Query: 223 PSIRTYNLLFTAFLSRGRNAYINLMYMETIRCLFRQMVNDGIEPDIFTLNCMIKGYVLSL 282
           PSIRTYN+LFTA LSRG+++YIN MYMETIRCLFRQMV+DGIEPD+FTLN MIKGYVLSL
Sbjct: 220 PSIRTYNILFTAMLSRGKDSYINHMYMETIRCLFRQMVDDGIEPDVFTLNSMIKGYVLSL 279

Query: 283 HVNDALRIFHQMGVVYSSLPNSFSYDYLIHGLCAQARTDNARELCNEMKEKGFVPSSISY 342
           HVNDALR+FHQMGVVY  LPN+FSYDYLI+GLCAQ RT+NARELC+EMK  GF PS  SY
Sbjct: 280 HVNDALRVFHQMGVVYKCLPNAFSYDYLIYGLCAQGRTNNARELCDEMKRNGFTPSGKSY 339

Query: 343 NSIVNAMALNGEVEEAVNYLWEMIGNRRSPDFITYKTVLDELCRRGRVGEATSLLREVQE 402
           NS+VNA+A+ GEVEEAV+YL EMI  R+S D ITY+T+LDE+CRRGRV EA  LLRE+Q 
Sbjct: 340 NSLVNALAIAGEVEEAVHYLREMIEMRKSADLITYRTILDEICRRGRVEEAMGLLRELQS 399

Query: 403 KELVDGHTYRKLLYVLEDDYGN 425
           K+LVDGHTYRKLLY +ED YG+
Sbjct: 400 KDLVDGHTYRKLLYAMEDSYGD 419

BLAST of Cp4.1LG20g09070 vs. TrEMBL
Match: B9IM01_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0018s10150g PE=4 SV=2)

HSP 1 Score: 538.9 bits (1387), Expect = 6.7e-150
Identity = 257/350 (73.43%), Postives = 310/350 (88.57%), Query Frame = 1

Query: 75  FYSNRAPSRSFRRRASKRLKSRIKPKLNEAQFQHAISQIPPRFNSEELYNVISVQGDPLV 134
           FYS +APSRSFR+R +KR K+  +P L+EA+FQ ++SQ+P RF +EEL N I+++ DPLV
Sbjct: 42  FYSTKAPSRSFRKRNNKRAKANSRPILDEAKFQRSVSQLPSRFTNEELCNNITLEDDPLV 101

Query: 135 CFELFNWASQQSRFRHDVSTYEITIKKLGEAKMYEEMDNVVNQVLAVPSIGSETLYNTMI 194
           C ELFNWASQQ RFRHD STY +TIKKLG AKMY+EMD+VVNQ+LAVP IG+E LYN++I
Sbjct: 102 CLELFNWASQQHRFRHDASTYHVTIKKLGIAKMYQEMDDVVNQLLAVPHIGNEALYNSII 161

Query: 195 YFFTEARKLTRAINIFKHMQNNKNVNCRPSIRTYNLLFTAFLSRGRNAYINLMYMETIRC 254
           Y+FTEARKLTRA+NIFK M++++N++CRPSI+TYN+L TA LSRGRN+YIN MYMET+RC
Sbjct: 162 YYFTEARKLTRAVNIFKRMKSSRNLDCRPSIKTYNILLTAMLSRGRNSYINHMYMETMRC 221

Query: 255 LFRQMVNDGIEPDIFTLNCMIKGYVLSLHVNDALRIFHQMGVVYSSLPNSFSYDYLIHGL 314
           LF+QMV+DG+EPDIF+LN MIKGY LSLHVNDALR+FHQMGVVY  LPNSFSYDYL+HGL
Sbjct: 222 LFKQMVDDGVEPDIFSLNSMIKGYALSLHVNDALRVFHQMGVVYKCLPNSFSYDYLVHGL 281

Query: 315 CAQARTDNARELCNEMKEKGFVPSSISYNSIVNAMALNGEVEEAVNYLWEMIGNRRSPDF 374
           CAQ RT+NAREL +EMKEKGFV S+ S+NS+VNA+AL GEV EAVNYLWEMI   RS D 
Sbjct: 282 CAQGRTNNARELFDEMKEKGFVLSNKSFNSLVNALALGGEVGEAVNYLWEMIDKHRSVDL 341

Query: 375 ITYKTVLDELCRRGRVGEATSLLREVQEKELVDGHTYRKLLYVLEDDYGN 425
           ITYKTVLDE+CR+GR+GEATSLL+E QEK+LVDG TYR+LL+VLEDD+GN
Sbjct: 342 ITYKTVLDEICRQGRIGEATSLLKEWQEKDLVDGITYRELLHVLEDDFGN 391

BLAST of Cp4.1LG20g09070 vs. TrEMBL
Match: A0A0B2P955_GLYSO (Pentatricopeptide repeat-containing protein, mitochondrial OS=Glycine soja GN=glysoja_024687 PE=4 SV=1)

HSP 1 Score: 534.6 bits (1376), Expect = 1.3e-148
Identity = 262/366 (71.58%), Postives = 309/366 (84.43%), Query Frame = 1

Query: 67  SFTKFCLNF------YSNRAPSRSFRRRASKRLKSRIKPKLNEAQFQHAISQIPPRFNSE 126
           S+T+F + F      YS +APSRS++RRA KRL    KP L++AQFQ A+SQ+PPRF  E
Sbjct: 72  SYTQFPIPFAPFNSHYSTKAPSRSYQRRARKRLLKSSKPTLDQAQFQLALSQLPPRFTPE 131

Query: 127 ELYNVISVQGDPLVCFELFNWASQQSRFRHDVSTYEITIKKLGEAKMYEEMDNVVNQVLA 186
           EL NVI+ Q DPLVC ELF+WASQQ RFRHDVST+ ITIKKLG AKMY+EMD++VNQ+LA
Sbjct: 132 ELCNVIARQNDPLVCLELFHWASQQPRFRHDVSTFHITIKKLGAAKMYQEMDDIVNQLLA 191

Query: 187 VPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNKNVNC--RPSIRTYNLLFTAFLSR 246
           VP IGSE L+N +IY+FT+ARKLTRA+N+FKHM++ +N+NC  RPSIRTYN+LF AFL R
Sbjct: 192 VPLIGSEALFNMVIYYFTQARKLTRAVNVFKHMKSRRNLNCFFRPSIRTYNILFAAFLGR 251

Query: 247 GRNAYINLMYMETIRCLFRQMVNDGIEPDIFTLNCMIKGYVLSLHVNDALRIFHQMGVVY 306
           G N+YIN +YMETIRCLFRQMV DGI+PDIF+LN MIKGYVLSLHVNDALRIFHQMGV+Y
Sbjct: 252 GSNSYINHVYMETIRCLFRQMVKDGIKPDIFSLNSMIKGYVLSLHVNDALRIFHQMGVIY 311

Query: 307 SSLPNSFSYDYLIHGLCAQARTDNARELCNEMKEKGFVPSSISYNSIVNAMALNGEVEEA 366
              PN+ +YD LIHGLCAQ RT+NA+EL +EMK KGFVPSS SYNS+VN++AL GE+EEA
Sbjct: 312 DCPPNALTYDCLIHGLCAQGRTNNAKELYSEMKTKGFVPSSKSYNSLVNSLALGGEIEEA 371

Query: 367 VNYLWEMIGNRRSPDFITYKTVLDELCRRGRVGEATSLLREVQEKELVDGHTYRKLLYVL 425
           VNYLWEM   +RS DFITYKTVLDE+CRRG V E T  L+E+QEK+LVDGH YRKLLYVL
Sbjct: 372 VNYLWEMTDKQRSADFITYKTVLDEICRRGTVQEGTRFLQELQEKDLVDGHAYRKLLYVL 431

BLAST of Cp4.1LG20g09070 vs. TAIR10
Match: AT2G27800.1 (AT2G27800.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 451.8 bits (1161), Expect = 5.5e-127
Identity = 217/346 (62.72%), Postives = 273/346 (78.90%), Query Frame = 1

Query: 76  YSNRAPSRSFRRRASKRLKSRIKPKLNEAQFQHAISQIPPRFNSEELYNVISVQGDPLVC 135
           YS   P+RS RRR S R KS  KP LN ++F   IS++PPRF  EEL + I+++ DP +C
Sbjct: 96  YSTSVPTRSLRRRISNRKKSSAKPILNVSKFHETISKLPPRFTPEELADAITLEEDPFLC 155

Query: 136 FELFNWASQQSRFRHDVSTYEITIKKLGEAKMYEEMDNVVNQVLAVPSIGSETLYNTMIY 195
           F LFNWASQQ RF H+  +Y I I+KLG AKMY+EMD++VNQVL+V  IG+E LYN++I+
Sbjct: 156 FHLFNWASQQPRFTHENCSYHIAIRKLGAAKMYQEMDDIVNQVLSVRHIGNENLYNSIIF 215

Query: 196 FFTEARKLTRAINIFKHMQNNKNVNCRPSIRTYNLLFTAFLSRGRNAYINLMYMETIRCL 255
           +FT+A KL RA+NIF+HM  +KN+ CRP+IRTY++LF A L RG N+YIN +YMET+R L
Sbjct: 216 YFTKAGKLIRAVNIFRHMVTSKNLECRPTIRTYHILFKALLGRGNNSYINHVYMETVRSL 275

Query: 256 FRQMVNDGIEPDIFTLNCMIKGYVLSLHVNDALRIFHQMGVVYSSLPNSFSYDYLIHGLC 315
           FRQMV+ GIEPD+F LNC++KGYVLSLHVNDALRIFHQM VVY   PNSF+YDYLIHGLC
Sbjct: 276 FRQMVDSGIEPDVFALNCLVKGYVLSLHVNDALRIFHQMSVVYDCEPNSFTYDYLIHGLC 335

Query: 316 AQARTDNARELCNEMKEKGFVPSSISYNSIVNAMALNGEVEEAVNYLWEMIGNRRSPDFI 375
           AQ RT NAREL +EMK KGFVP+  SYNS+VNA AL+GE+++AV  LWEMI N R  DFI
Sbjct: 336 AQGRTINARELLSEMKGKGFVPNGKSYNSLVNAFALSGEIDDAVKCLWEMIENGRVVDFI 395

Query: 376 TYKTVLDELCRRGRVGEATSLLREVQEKELVDGHTYRKLLYVLEDD 422
           +Y+T++DE CR+G+  EAT LL  ++EK+LVD  +Y KL+ VL  D
Sbjct: 396 SYRTLVDESCRKGKYDEATRLLEMLREKQLVDRDSYDKLVNVLHKD 441

BLAST of Cp4.1LG20g09070 vs. TAIR10
Match: AT5G27300.1 (AT5G27300.1 pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 218.8 bits (556), Expect = 7.8e-57
Identity = 121/298 (40.60%), Postives = 173/298 (58.05%), Query Frame = 1

Query: 76  YSNRAPSRSFRRRASKRLKSRIKPKLNEAQFQHAISQIPPRFNSEELYNVISVQGDPLVC 135
           YS   P+RS RRR S R KS  KP LNE++FQ  IS++PPRF  EEL + I+++ DP +C
Sbjct: 89  YSTSVPTRSLRRRISSRKKSSTKPILNESKFQETISKLPPRFTPEELADAITLEEDPFLC 148

Query: 136 FELFNWASQQSRFRHDVSTYEITIKKLGEAKMYEEMDNVVNQVLAVPSIGSETLYNTMIY 195
           F LFNWASQQ RF H+  +Y                       +A+  +G+         
Sbjct: 149 FHLFNWASQQPRFTHENCSYH----------------------IAIRKLGA--------- 208

Query: 196 FFTEARKLTRAINIFKHMQNNKNVNCRPSIRTYNLLFTAFLSRGRNAYINLMYMETIRCL 255
              ++ KL RA+NIF+HM N++N+ CRP++RTY++LF A L RG N++IN +YMET+R L
Sbjct: 209 --AKSGKLIRAVNIFRHMVNSRNLECRPTMRTYHILFKALLGRGNNSFINHLYMETVRSL 268

Query: 256 FRQMVNDGIEPDIFTLNCMIKGYVLSLHVNDALRIFHQMGVVYSSLPNSFSYDYLIHGLC 315
           FRQMV+ GIEPD+F LNC++KG   +++  + L      G V    PN  SY+ L++   
Sbjct: 269 FRQMVDSGIEPDVFALNCLVKG--RTINTRELLSEMKGKGFV----PNGKSYNSLVNAFA 328

Query: 316 AQARTDNARELCNEMKEKGFVPSSISYNSIVNAMALNGEVEEAVNYLWEMIGNRRSPD 374
                D+A +   EM E G V   ISY ++V+     G+ +EA   L EM+  ++  D
Sbjct: 329 LSGEIDDAVKCLWEMIENGRVVDFISYRTLVDESCRKGKYDEATRLL-EMLREKQLVD 346

BLAST of Cp4.1LG20g09070 vs. TAIR10
Match: AT3G25210.1 (AT3G25210.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 192.2 bits (487), Expect = 7.8e-49
Identity = 104/324 (32.10%), Postives = 180/324 (55.56%), Query Frame = 1

Query: 95  SRIKPKLN-EAQFQHAISQIPPRFNSEELYNVISVQGDPLVCFELFNWASQQSRFRHDVS 154
           SRI+ +   E QF+  I  + P F + ++   +  Q DP +  ++F W +QQ  ++H+  
Sbjct: 50  SRIRTRTPLETQFETWIQNLKPGFTNSDVVIALRAQSDPDLALDIFRWTAQQRGYKHNHE 109

Query: 155 TYEITIKKLGEAKMYEEMDNVVNQVLAVPSIGSETLYNTMIYFFTEARKL-TRAINIFKH 214
            Y   IK+    K    ++ ++ +V+A     S  LYN +I F    + L  RA +++  
Sbjct: 110 AYHTMIKQAITGKRNNFVETLIEEVIAGACEMSVPLYNCIIRFCCGRKFLFNRAFDVYNK 169

Query: 215 MQNNKNVNCRPSIRTYNLLFTAFLSRGRNAYINLMYMETIRCLFRQMVNDGIEPDIFTLN 274
           M  + +   +P + TY LL ++ L R     +  +Y+  +R L +QM ++G+ PD F LN
Sbjct: 170 MLRSDD--SKPDLETYTLLLSSLLKRFNKLNVCYVYLHAVRSLTKQMKSNGVIPDTFVLN 229

Query: 275 CMIKGYVLSLHVNDALRIFHQMGVVYSSLPNSFSYDYLIHGLCAQARTDNARELCNEMKE 334
            +IK Y   L V++A+R+F +M + Y S PN+++Y YL+ G+C + R         EM+ 
Sbjct: 230 MIIKAYAKCLEVDEAIRVFKEMAL-YGSEPNAYTYSYLVKGVCEKGRVGQGLGFYKEMQV 289

Query: 335 KGFVPSSISYNSIVNAMALNGEVEEAVNYLWEMIGNRRSPDFITYKTVLDELCRRGRVGE 394
           KG VP+   Y  ++ ++++   ++EAV  +++M+ N  SPD +TY TVL ELCR GR  E
Sbjct: 290 KGMVPNGSCYMVLICSLSMERRLDEAVEVVYDMLANSLSPDMLTYNTVLTELCRGGRGSE 349

Query: 395 ATSLLREVQEKELVDG-HTYRKLL 416
           A  ++ E ++++ V G   YR L+
Sbjct: 350 ALEMVEEWKKRDPVMGERNYRTLM 370

BLAST of Cp4.1LG20g09070 vs. TAIR10
Match: AT2G37230.1 (AT2G37230.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 131.0 bits (328), Expect = 2.1e-30
Identity = 91/316 (28.80%), Postives = 155/316 (49.05%), Query Frame = 1

Query: 105 QFQHAISQIPPRFNSEELYNVISVQGDPLVCFELFNWASQQSRFRHDVSTYEITIKKLGE 164
           + Q++I  + P ++   +YNV+          + F W  +    RHD  T+   IK LGE
Sbjct: 103 RLQNSIRDLVPEWDHSLVYNVLHGAKKLEHALQFFRWTERSGLIRHDRDTHMKMIKMLGE 162

Query: 165 AKMYEEMDNVVNQVLAVPSIG---SETLYNTMIYFFTEARKLTRAINIFKHMQNNKNVNC 224
                ++++    +L +P  G    E ++  +I  + +A  +  ++ IF+ M   K++  
Sbjct: 163 VS---KLNHARCILLDMPEKGVPWDEDMFVVLIESYGKAGIVQESVKIFQKM---KDLGV 222

Query: 225 RPSIRTYNLLFTAFLSRGRNAYINLMYMETIRCLFRQMVNDGIEPDIFTLNCMIKGYVLS 284
             +I++YN LF   L RGR       YM   R  F +MV++G+EP   T N M+ G+ LS
Sbjct: 223 ERTIKSYNSLFKVILRRGR-------YMMAKR-YFNKMVSEGVEPTRHTYNLMLWGFFLS 282

Query: 285 LHVNDALRIFHQMGVVYSSLPNSFSYDYLIHGLCAQARTDNARELCNEMKEKGFVPSSIS 344
           L +  ALR F  M     S P+  +++ +I+G C   + D A +L  EMK     PS +S
Sbjct: 283 LRLETALRFFEDMKTRGIS-PDDATFNTMINGFCRFKKMDEAEKLFVEMKGNKIGPSVVS 342

Query: 345 YNSIVNAMALNGEVEEAVNYLWEMIGNRRSPDFITYKTVLDELCRRGRVGEATSLLREVQ 404
           Y +++        V++ +    EM  +   P+  TY T+L  LC  G++ EA ++L+ + 
Sbjct: 343 YTTMIKGYLAVDRVDDGLRIFEEMRSSGIEPNATTYSTLLPGLCDAGKMVEAKNILKNMM 402

Query: 405 EKELV--DGHTYRKLL 416
            K +   D   + KLL
Sbjct: 403 AKHIAPKDNSIFLKLL 403

BLAST of Cp4.1LG20g09070 vs. TAIR10
Match: AT5G64320.1 (AT5G64320.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 115.9 bits (289), Expect = 7.1e-26
Identity = 67/251 (26.69%), Postives = 127/251 (50.60%), Query Frame = 1

Query: 151 DVSTYEITIKKLGEAKMYEEMDNVVNQVLAVPSIGSETLYNTMIYFFTEARKLTRAINIF 210
           +V +Y I +    +    +E  NV+N++ A     +   +N +I  F +  ++  A+ IF
Sbjct: 423 NVYSYTILVDGFCKLGKIDEAYNVLNEMSADGLKPNTVGFNCLISAFCKEHRIPEAVEIF 482

Query: 211 KHMQNNKNVNCRPSIRTYNLLFTAFLSRGRNAYINLMYMETIRCLFRQMVNDGIEPDIFT 270
           + M       C+P + T+N L +               ++    L R M+++G+  +  T
Sbjct: 483 REMPRK---GCKPDVYTFNSLISGLCEVDE--------IKHALWLLRDMISEGVVANTVT 542

Query: 271 LNCMIKGYVLSLHVNDALRIFHQMGVVYSSLPNSFSYDYLIHGLCAQARTDNARELCNEM 330
            N +I  ++    + +A ++ ++M V   S  +  +Y+ LI GLC     D AR L  +M
Sbjct: 543 YNTLINAFLRRGEIKEARKLVNEM-VFQGSPLDEITYNSLIKGLCRAGEVDKARSLFEKM 602

Query: 331 KEKGFVPSSISYNSIVNAMALNGEVEEAVNYLWEMIGNRRSPDFITYKTVLDELCRRGRV 390
              G  PS+IS N ++N +  +G VEEAV +  EM+    +PD +T+ ++++ LCR GR+
Sbjct: 603 LRDGHAPSNISCNILINGLCRSGMVEEAVEFQKEMVLRGSTPDIVTFNSLINGLCRAGRI 661

Query: 391 GEATSLLREVQ 402
            +  ++ R++Q
Sbjct: 663 EDGLTMFRKLQ 661

BLAST of Cp4.1LG20g09070 vs. NCBI nr
Match: gi|659072149|ref|XP_008463631.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucumis melo])

HSP 1 Score: 721.5 bits (1861), Expect = 1.1e-204
Identity = 363/428 (84.81%), Postives = 386/428 (90.19%), Query Frame = 1

Query: 1   MNLLLRIRIQFLRNSINQLCNSIPSNPHFVCLRRFSFHSWLLN-TDHGNFSHIFRLSQIQ 60
           MN L RIRI F  NSIN L NS PS PHF+C RRFS HS  LN T H N +H  R+SQIQ
Sbjct: 1   MNPLFRIRIHFSTNSINHLFNSNPSYPHFICFRRFSIHSLSLNNTHHCNLTHFLRVSQIQ 60

Query: 61  NYPAPVLSFTKFCLNFYSNRAPSRSFRRRASKRLKSRIKPKLNEAQFQHAISQIPPRFNS 120
            YPAP LSFTKFCLNFYS  APSRSFRRRA+KRLK+ +KP L+EAQFQ A+S+IPPRF  
Sbjct: 61  TYPAPNLSFTKFCLNFYSKTAPSRSFRRRANKRLKASLKPTLDEAQFQLAVSKIPPRFTP 120

Query: 121 EELYNVISVQGDPLVCFELFNWASQQSRFRHDVSTYEITIKKLGEAKMYEEMDNVVNQVL 180
           EEL NVIS+Q DPLVCFELFNWASQQ RF+HDVS+YEITIKKLGEAKMYEEMD+VVNQ L
Sbjct: 121 EELRNVISLQKDPLVCFELFNWASQQPRFKHDVSSYEITIKKLGEAKMYEEMDHVVNQAL 180

Query: 181 AVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNKNVNCRPSIRTYNLLFTAFLSRG 240
           AV SIGSETLYNTMIYFFTEARKLTRAINIFKHMQNN+N+NCRPSIRTYNLLFTAFLSRG
Sbjct: 181 AVSSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRG 240

Query: 241 RNAYINLMYMETIRCLFRQMVN-DGIEPDIFTLNCMIKGYVLSLHVNDALRIFHQMGVVY 300
           RN YIN +YMETIRCLFRQMVN DGIEPDIF LNCMIKGYVLSLHVNDALRIFHQMGVVY
Sbjct: 241 RNTYINHVYMETIRCLFRQMVNDDGIEPDIFALNCMIKGYVLSLHVNDALRIFHQMGVVY 300

Query: 301 SSLPNSFSYDYLIHGLCAQARTDNARELCNEMKEKGFVPSSISYNSIVNAMALNGEVEEA 360
           S LPNS+SYDYLIHGL AQARTDNARELCNEMKEKGFVPSSISYNSIVNAMALNGEVE+A
Sbjct: 301 SCLPNSYSYDYLIHGLSAQARTDNARELCNEMKEKGFVPSSISYNSIVNAMALNGEVEDA 360

Query: 361 VNYLWEMIGNRRSPDFITYKTVLDELCRRGRVGEATSLLREVQEKELVDGHTYRKLLYVL 420
           VNYLWEMI +RRSPDFITYKTVLDELCR GRV EATSLLRE+QEK+LVDGHTYRKLLYVL
Sbjct: 361 VNYLWEMIDHRRSPDFITYKTVLDELCRLGRVVEATSLLRELQEKDLVDGHTYRKLLYVL 420

Query: 421 EDDYGNVH 427
           EDDYGN++
Sbjct: 421 EDDYGNLN 428

BLAST of Cp4.1LG20g09070 vs. NCBI nr
Match: gi|449459126|ref|XP_004147297.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Cucumis sativus])

HSP 1 Score: 717.6 bits (1851), Expect = 1.5e-203
Identity = 359/428 (83.88%), Postives = 385/428 (89.95%), Query Frame = 1

Query: 1   MNLLLRIRIQFLRNSINQLCNSIPSNPHFVCLRRFSFHSWLLN-TDHGNFSHIFRLSQIQ 60
           MNLLLRIRI F  NSI+ L NS PS PHFVC RRFS HSW LN T H N  H  R+SQI 
Sbjct: 1   MNLLLRIRIHFSTNSIDHLFNSNPSYPHFVCFRRFSIHSWWLNNTHHYNLPHFLRVSQIH 60

Query: 61  NYPAPVLSFTKFCLNFYSNRAPSRSFRRRASKRLKSRIKPKLNEAQFQHAISQIPPRFNS 120
            Y  P LSFT F L FYS  APSRSFR+RA+KRLKS +KPKL+E QFQ A+S+IPPRF S
Sbjct: 61  PYSGPNLSFTNFLLKFYSRAAPSRSFRKRANKRLKSSLKPKLDETQFQLAVSKIPPRFTS 120

Query: 121 EELYNVISVQGDPLVCFELFNWASQQSRFRHDVSTYEITIKKLGEAKMYEEMDNVVNQVL 180
           EEL NVIS+Q DPLVCFELFNWASQQ RFRHD S+YEITIKKLGEAKMYEEMD+VVNQ L
Sbjct: 121 EELCNVISLQRDPLVCFELFNWASQQPRFRHDDSSYEITIKKLGEAKMYEEMDHVVNQAL 180

Query: 181 AVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNKNVNCRPSIRTYNLLFTAFLSRG 240
           AV SIGSETLYNTMIYFFTEARKLTRA+NIFKHMQNN+N+NCRPSIRTYNLLFTAFLSRG
Sbjct: 181 AVSSIGSETLYNTMIYFFTEARKLTRAVNIFKHMQNNRNLNCRPSIRTYNLLFTAFLSRG 240

Query: 241 RNAYINLMYMETIRCLFRQMVN-DGIEPDIFTLNCMIKGYVLSLHVNDALRIFHQMGVVY 300
           RN YIN MYMETIRCLFRQMVN DGIEPDIF+LNCMIKGYVLSLHVNDALRIFHQMGVVY
Sbjct: 241 RNTYINHMYMETIRCLFRQMVNDDGIEPDIFSLNCMIKGYVLSLHVNDALRIFHQMGVVY 300

Query: 301 SSLPNSFSYDYLIHGLCAQARTDNARELCNEMKEKGFVPSSISYNSIVNAMALNGEVEEA 360
           S LPNS+S+DYLIHGLCAQARTDNA+ELCNEMKEKGFVPSSISYNSIVNA+ALNGEVE+A
Sbjct: 301 SCLPNSYSFDYLIHGLCAQARTDNAKELCNEMKEKGFVPSSISYNSIVNALALNGEVEDA 360

Query: 361 VNYLWEMIGNRRSPDFITYKTVLDELCRRGRVGEATSLLREVQEKELVDGHTYRKLLYVL 420
           VNYLWEMI NRRSPDFITYKTVLDELCR+G+V EATSLLRE+QEK+LVDGHTYRKLLYVL
Sbjct: 361 VNYLWEMIDNRRSPDFITYKTVLDELCRQGKVVEATSLLRELQEKDLVDGHTYRKLLYVL 420

Query: 421 EDDYGNVH 427
           EDDYGN++
Sbjct: 421 EDDYGNLN 428

BLAST of Cp4.1LG20g09070 vs. NCBI nr
Match: gi|645217145|ref|XP_008224504.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Prunus mume])

HSP 1 Score: 574.3 bits (1479), Expect = 2.1e-160
Identity = 289/428 (67.52%), Postives = 345/428 (80.61%), Query Frame = 1

Query: 8   RIQFLRNSINQLCNSIPSNPHFVCLRRFSFHSWLLNTDHGNFSHIFRLSQIQNYPAPVLS 67
           R  ++  + ++LC     + +    +R +  S+  N  +     I  LSQ Q  P P ++
Sbjct: 10  RNSYMNRNFSRLCYEHSFHQYLSNPQRMAVLSYSSNPIYSTLRPIQYLSQTQIDPVPKIA 69

Query: 68  -------FTKFCL-NFYSNRAPSRSFRRRASKRLKSRIKPKLNEAQFQHAISQIPPRFNS 127
                  + +F L NFYS + PSRSFRRR S+R+KS  K  L+E QFQ AISQ+ PRF  
Sbjct: 70  SNGFLGIYERFLLYNFYSTKPPSRSFRRRESRRVKSS-KSTLDEVQFQRAISQLLPRFTP 129

Query: 128 EELYNVISVQGDPLVCFELFNWASQQSRFRHDVSTYEITIKKLGEAKMYEEMDNVVNQVL 187
           EEL NVI+ Q DP+VC ELFNWASQQ RF+HDVSTY IT+KK+G AKMYEEMD+VVNQVL
Sbjct: 130 EELCNVITQQDDPIVCLELFNWASQQPRFKHDVSTYHITVKKVGVAKMYEEMDDVVNQVL 189

Query: 188 AVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNKNVNCRPSIRTYNLLFTAFLSRG 247
           A+  IGSE LYN++IYFFTEARKLTRA+NIFKHMQN++N+NCRPSIRTYN+LFTAFLSRG
Sbjct: 190 AISYIGSEALYNSIIYFFTEARKLTRAVNIFKHMQNSRNLNCRPSIRTYNILFTAFLSRG 249

Query: 248 RNAYINLMYMETIRCLFRQMVNDGIEPDIFTLNCMIKGYVLSLHVNDALRIFHQMGVVYS 307
            N+YIN MYMETIRCLFRQMV+DGIEPDI++LN MIKGYVLSLHVNDALRIFHQMGVVY+
Sbjct: 250 SNSYINHMYMETIRCLFRQMVDDGIEPDIYSLNSMIKGYVLSLHVNDALRIFHQMGVVYN 309

Query: 308 SLPNSFSYDYLIHGLCAQARTDNARELCNEMKEKGFVPSSISYNSIVNAMALNGEVEEAV 367
            LPNSFSYDYLIHGLC+Q RT+NA++LCNEMK KGF+PSS SYNS+VN +ALNGEVEEAV
Sbjct: 310 CLPNSFSYDYLIHGLCSQGRTNNAKQLCNEMKSKGFIPSSKSYNSLVNGLALNGEVEEAV 369

Query: 368 NYLWEMIGNRRSPDFITYKTVLDELCRRGRVGEATSLLREVQEKELVDGHTYRKLLYVLE 427
            YLWEMI  +RS +FITY+TVLDE+CR+GRVGEA  LL+E QEK+L++GHTYRKLLYVLE
Sbjct: 370 KYLWEMIEKQRSAEFITYRTVLDEICRQGRVGEAMRLLKEFQEKDLLNGHTYRKLLYVLE 429

BLAST of Cp4.1LG20g09070 vs. NCBI nr
Match: gi|802551557|ref|XP_012064863.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Jatropha curcas])

HSP 1 Score: 567.0 bits (1460), Expect = 3.3e-158
Identity = 284/428 (66.36%), Postives = 340/428 (79.44%), Query Frame = 1

Query: 8   RIQFLRN---SINQLCNSIPSNPHFVCLRRFSFHSWLLNTDHGNFSHIFRLSQIQNYPAP 67
           +I F+ N   S N   +S  + P    L  +S +S L  +   +F  +  LSQ +  P  
Sbjct: 15  KIYFINNPCFSYNSKGSSNFAFPFLTSLNHYSVNSCLFKSSINSFKFVNILSQNRIGPVS 74

Query: 68  VLS--------FTKFCLNFYSNRAPSRSFRRRASKRLKSRIKPKLNEAQFQHAISQIPPR 127
           +L         F  F  +FYS R PSRS RRR SKRLK+  KP L+E +FQ AI+++PPR
Sbjct: 75  ILPSNAKLQGFFGGFLFSFYSTRVPSRSLRRRQSKRLKASRKPILDETKFQEAIAKLPPR 134

Query: 128 FNSEELYNVISVQGDPLVCFELFNWASQQSRFRHDVSTYEITIKKLGEAKMYEEMDNVVN 187
           FN++EL  V++++ D LVCFE+FNWASQQ RFRHD STY I IKKLG AKMY+EMD+VVN
Sbjct: 135 FNNDELCYVLTLEEDTLVCFEIFNWASQQHRFRHDTSTYHIIIKKLGVAKMYQEMDDVVN 194

Query: 188 QVLAVPSIGSETLYNTMIYFFTEARKLTRAINIFKHMQNNKNVNCRPSIRTYNLLFTAFL 247
           QVLA+P IG+E LYNT+IYFFTEARKLTRA+NIF HM+N +N+ CRPSIRTYN+LFTA L
Sbjct: 195 QVLAIPHIGNEALYNTIIYFFTEARKLTRAVNIFNHMKNGRNLECRPSIRTYNILFTAML 254

Query: 248 SRGRNAYINLMYMETIRCLFRQMVNDGIEPDIFTLNCMIKGYVLSLHVNDALRIFHQMGV 307
           SRG+N+YIN +YMETIRCLF+QMVNDGIEPDI++LN MIKGYVLSLHVNDALRIFHQMGV
Sbjct: 255 SRGKNSYINYVYMETIRCLFKQMVNDGIEPDIYSLNSMIKGYVLSLHVNDALRIFHQMGV 314

Query: 308 VYSSLPNSFSYDYLIHGLCAQARTDNARELCNEMKEKGFVPSSISYNSIVNAMALNGEVE 367
           VY  LPNSFSYDYLIHGLCAQ RT+NA ELC+EMK KGFVPSS SYNS+VNA+AL GEVE
Sbjct: 315 VYQCLPNSFSYDYLIHGLCAQGRTNNALELCDEMKRKGFVPSSKSYNSLVNALALIGEVE 374

Query: 368 EAVNYLWEMIGNRRSPDFITYKTVLDELCRRGRVGEATSLLREVQEKELVDGHTYRKLLY 425
           EAVNYLWEMI  ++ PDFITY+TVLDE+CR+G++GEA +LL+E +EK  VDG TYRKLLY
Sbjct: 375 EAVNYLWEMIEKQKLPDFITYRTVLDEMCRQGKLGEARNLLKEWEEKHFVDGPTYRKLLY 434

BLAST of Cp4.1LG20g09070 vs. NCBI nr
Match: gi|1009115705|ref|XP_015874374.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g27800, mitochondrial-like [Ziziphus jujuba])

HSP 1 Score: 562.0 bits (1447), Expect = 1.1e-156
Identity = 285/401 (71.07%), Postives = 326/401 (81.30%), Query Frame = 1

Query: 32  LRRFSFHSWLLNTDHGNFSHIFRLSQIQNYPA-PVLSFTKFCL-------NFYSNRAPSR 91
           LRRF   S   N        +  L++   YPA P  S    C+       +FYS+R  SR
Sbjct: 35  LRRFIVFSCNFNLIGNGLRQMHNLARTGTYPASPTASHGLLCVYARFSLYSFYSSRPSSR 94

Query: 92  SFRRRASKRLKSRIKPKLNEAQFQHAISQIPPRFNSEELYNVISVQGDPLVCFELFNWAS 151
           SFRRRA KRLK+   P L+EAQFQ  +SQ+ PRF  EEL NVIS Q DP++C ELFNWA+
Sbjct: 95  SFRRRARKRLKANNVPSLDEAQFQKVVSQLLPRFTPEELCNVISQQDDPILCLELFNWAT 154

Query: 152 QQSRFRHDVSTYEITIKKLGEAKMYEEMDNVVNQVLAVPSIGSETLYNTMIYFFTEARKL 211
            Q RF+HDVSTY  TIKKLG AKMY+EMD+VVNQVLAV SIGSE LYNT+IYFFTEARKL
Sbjct: 155 HQPRFKHDVSTYHTTIKKLGVAKMYQEMDDVVNQVLAVSSIGSEALYNTIIYFFTEARKL 214

Query: 212 TRAINIFKHMQNNKNVNCRPSIRTYNLLFTAFLSRGRNAYINLMYMETIRCLFRQMVNDG 271
           TRAINIFKHM++++ ++CRPSIRTYN+LF A LS G N+YIN MYMETIR LFRQMV+DG
Sbjct: 215 TRAINIFKHMRSSRKLDCRPSIRTYNILFAALLSWGSNSYINHMYMETIRRLFRQMVDDG 274

Query: 272 IEPDIFTLNCMIKGYVLSLHVNDALRIFHQMGVVYSSLPNSFSYDYLIHGLCAQARTDNA 331
           IEPDIF+LN MIKGYVLSLHVNDALRIFHQMGVVY  LPNSFSYDYLIHGLCAQ RT+NA
Sbjct: 275 IEPDIFSLNSMIKGYVLSLHVNDALRIFHQMGVVYKCLPNSFSYDYLIHGLCAQGRTNNA 334

Query: 332 RELCNEMKEKGFVPSSISYNSIVNAMALNGEVEEAVNYLWEMIGNRRSPDFITYKTVLDE 391
           RELCNEMK KGFVPSS SYNS+VNA+AL G+VEEA+ YLWEMI ++RS DFITY+TVLDE
Sbjct: 335 RELCNEMKNKGFVPSSKSYNSLVNALALGGDVEEAMKYLWEMIEHQRSADFITYRTVLDE 394

Query: 392 LCRRGRVGEATSLLREVQEKELVDGHTYRKLLYVLEDDYGN 425
           +CRRGRVG+A SLL+E QEK+LVDGHTYRKLLYVLEDD+G+
Sbjct: 395 ICRRGRVGQAMSLLKEFQEKDLVDGHTYRKLLYVLEDDFGS 435

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP173_ARATH9.7e-12662.72Pentatricopeptide repeat-containing protein At2g27800, mitochondrial OS=Arabidop... [more]
PP254_ARATH1.4e-4732.10Pentatricopeptide repeat-containing protein At3g25210, mitochondrial OS=Arabidop... [more]
PP190_ARATH3.8e-2928.80Pentatricopeptide repeat-containing protein At2g37230 OS=Arabidopsis thaliana GN... [more]
PP444_ARATH1.3e-2426.69Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidop... [more]
PP298_ARATH2.2e-2427.00Pentatricopeptide repeat-containing protein At4g01400, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A0A0LS50_CUCSA1.1e-20383.88Uncharacterized protein OS=Cucumis sativus GN=Csa_1G086930 PE=4 SV=1[more]
A0A061GUJ4_THECC1.3e-15667.77Tetratricopeptide repeat-like superfamily protein, putative OS=Theobroma cacao G... [more]
A0A0D2RRL4_GOSRA2.2e-15370.42Uncharacterized protein OS=Gossypium raimondii GN=B456_004G012900 PE=4 SV=1[more]
B9IM01_POPTR6.7e-15073.43Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0018s10150g PE=4 SV=2[more]
A0A0B2P955_GLYSO1.3e-14871.58Pentatricopeptide repeat-containing protein, mitochondrial OS=Glycine soja GN=gl... [more]
Match NameE-valueIdentityDescription
AT2G27800.15.5e-12762.72 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G27300.17.8e-5740.60 pentatricopeptide (PPR) repeat-containing protein[more]
AT3G25210.17.8e-4932.10 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G37230.12.1e-3028.80 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G64320.17.1e-2626.69 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659072149|ref|XP_008463631.1|1.1e-20484.81PREDICTED: pentatricopeptide repeat-containing protein At2g27800, mitochondrial-... [more]
gi|449459126|ref|XP_004147297.1|1.5e-20383.88PREDICTED: pentatricopeptide repeat-containing protein At2g27800, mitochondrial-... [more]
gi|645217145|ref|XP_008224504.1|2.1e-16067.52PREDICTED: pentatricopeptide repeat-containing protein At2g27800, mitochondrial-... [more]
gi|802551557|ref|XP_012064863.1|3.3e-15866.36PREDICTED: pentatricopeptide repeat-containing protein At2g27800, mitochondrial-... [more]
gi|1009115705|ref|XP_015874374.1|1.1e-15671.07PREDICTED: pentatricopeptide repeat-containing protein At2g27800, mitochondrial-... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g09070.1Cp4.1LG20g09070.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 269..294
score: 0
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 371..399
score: 5.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 302..349
score: 7.0
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 190..235
score: 9.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 305..338
score: 5.6E-7coord: 269..294
score: 0.0021coord: 189..217
score: 0.0017coord: 375..405
score: 0.0017coord: 340..373
score: 0.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 151..181
score: 5.557coord: 373..407
score: 9.635coord: 338..372
score: 9.525coord: 303..337
score: 11.871coord: 224..266
score: 5.601coord: 186..216
score: 7.783coord: 267..297
score: 7
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 58..434
score: 1.7E
NoneNo IPR availablePANTHERPTHR24015:SF658SUBFAMILY NOT NAMEDcoord: 58..434
score: 1.7E

The following gene(s) are paralogous to this gene:

None