Cp4.1LG17g10180 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG17g10180
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCp4.1LG17 : 7655070 .. 7657652 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAATAGCTTTAAGATTTAAGATTTAAGGGTGGAGGCTTATGAGCGTCCCGATGGAAACTTGTGTTCTCCATTGTTGTTACGACCCCTTCCGACTCCGATCCAAACACCCTTCTTCTTCTCCACACCCTTTTGACCTTAATAAAGGCAGCAGCTGCCTCTCCTCCAGAATCCCACTCCCAGAGCCCTACACCAAGCTTCTTCTATCCCTACCCACTCGCGCAGTTCCAACGCATAGTCCTACAAAGCATGCGACTCTATTAGTCGACACTTTTCACGAACACCATACCCTAAAAGCTTTGCTCTCCGACCTCCAGAAAAGGGATTCCTGTCCTTTGCTATTACTTAGACACCATGGAGATTGGAACAGAGACCATTTCTGGCTCGTTGTTAAATTCCTCAGAAATGCTTCAAGATCCCATGAAATCCTTAAGGTGCGGCTTGTCTCTAAGATTTTAGATTCAAATATTCTTTGGCTTCTCTTAGTTTTATATGTAACGATTTAATTCTATGCTAAAATGTTCAAATCAATCTCATAAGCGGAGTGGAAAAGAACCAGAAAACAAACAAAATCTAAAGCAATTACAAACAAAGGAGCACGCCCAAGAAACTAAAAGACCCATCAAACTAAAGCTCAACTCCGAAGCAAACTCCAACCACAACAGTTGAACATAAGAATAAAAAAGAGCCAAACAAGAGAGCTAAGACCTCCAAAAATCAATAATCCAACTGAGAAACTTCAAGAGCGAAAGCCCCAAAACCCAAACTACCAAAATAGAATGCAAAGATTCTTGTGTTAATGAAGCTTTACATTAATCTTCTGTAAACCAGGAAACGTTGATAGAAATAAGATTCTTATATCCTACTTTTTTCAAATCACAATGTATTTTCTTACGGTTTTTTCCGGGATGCTACAGGGGTTGATTGTTGATTACCATTTAAGATCTTTATAATGGCTTGGATGTAAGTGCTTTTTGCAAGTTCTAGTGCCTGTGACCAATGAAATTTTGAAACTTTGTGCAGCTATTTGATACATGGAAGAGCATCGAGGGGTCGCGCATCAGTGAGAGTAACTATGAGAAAGTAATAGTTCTATTGAGTCAAGATGGTCTTATGGAGGATGCTGTATCAGCATTTCAAGATATGAAAAGTCTTGGCCTTCGACCATCTTTGGGTACTTACAATACGCTCATCCATGGTTTTGCTGCAAGGGGTAAGTTTGAAATTGCTATGCTTTTCATTGATGAGATGAAAGAAATCAATATGACTCGAGAAACTGATACTTATGATGGCCTAATTGAAGCCTATGGAAAATATAGAATGTACGATGAGATGATCGAGTGTCTAAAACAGATGGAACTGGATGGATGTTTTCCGGACCACATAACTTACAATTTGCTTATCAGAGAGTTCTCCAAAGGTGGTCTGCTTAAAAAGATGGAAGGATTATACCGAAGCATTCTTTCAAAAAGAATGGATTTACAGTCTTCTACCTTGGTTGCCATGTTGGAAGCTTATGCCAAATTTGGTATCTTGGATAAAATGGAAATGTTCTACAGAAGGATCCTGAACTCAAAGACCAATATGAAGGAGGATCTAATCAGAACGTTGGCTCTAGTTTATATTCAGAACCATATGTATTCAAGAATAGAGACATTGGGTATCGATCTTCACATAAAAGCTGGGAAGACAGATCTTGTTTGGTGCCTGCTTCTTCTATCACATGCTTGTCTATCAAGTCGAAGGGGTATGGACTCTGTTGTTCAGGAAATGGACAAAGCTAAAGAAATTTGGAATGTGACTGTTGCAAACATTTTACTTCTAGCTTATTTGAAAATGAAAGATTTTAAACGGCTCAGAACACTGTTCTCTGAGATACGAGCAAGACATGTGAAACCTGATCTAGTAACCATTGGAATTCTATTAGATGCAAATAACAAAAGTTTTGATGGAACCAGAACTTTAGAGGCATGGAGAAGGATGAACATGCTATCCAGAGCTGTGGAAATAAACACCGACTCGCTTGTTTTAGCTGCATTTGGAAAAGGGAGGTTCCTTAAAGACTGTGAAGAGGCATACGCTTCCCTGGAACTTGTGGGTAGGGAAAGTAAAGTTTGGACTTATGAAGAACTCATTGATCTAGTTTATGAAAACCAGGGGGGAATGGTACTGAAAACCAGATAAAATAGCTTGGTGTGCATGTGCCTGTATCAATTTTTTCTAGGGATTTGACCGCCATTAACAACCAGGAGGGCTTTACAGTGGGGAGTTTTATATGTTTGTTTGTTGTAAATTTGATGTACTTTTAACGGTTATTACTTCACTTGAATCAATGGGTAGTACAGGTAAGATTAGTTGGTATCTTTCTCTGCTAAAGCCCTAGAAAAGCAACTCTATTTCGTTGGATATTATTCTTGTGTGCATCTGATTGAATCAAGTGAATAAGATACTCACAAATTTCTCCTTGATCGTCAACCACAAACACATGTTCGATTTCTCTGTCTGTTTGAGTACCCCATCATTCTTTATCGTACATTT

mRNA sequence

TTNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAATAGCTTTAAGATTTAAGATTTAAGGGTGGAGGCTTATGAGCGTCCCGATGGAAACTTGTGTTCTCCATTGTTGTTACGACCCCTTCCGACTCCGATCCAAACACCCTTCTTCTTCTCCACACCCTTTTGACCTTAATAAAGGCAGCAGCTGCCTCTCCTCCAGAATCCCACTCCCAGAGCCCTACACCAAGCTTCTTCTATCCCTACCCACTCGCGCAGTTCCAACGCATAGTCCTACAAAGCATGCGACTCTATTAGTCGACACTTTTCACGAACACCATACCCTAAAAGCTTTGCTCTCCGACCTCCAGAAAAGGGATTCCTGTCCTTTGCTATTACTTAGACACCATGGAGATTGGAACAGAGACCATTTCTGGCTCGTTGTTAAATTCCTCAGAAATGCTTCAAGATCCCATGAAATCCTTAAGCTATTTGATACATGGAAGAGCATCGAGGGGTCGCGCATCAGTGAGAGTAACTATGAGAAAGTAATAGTTCTATTGAGTCAAGATGGTCTTATGGAGGATGCTGTATCAGCATTTCAAGATATGAAAAGTCTTGGCCTTCGACCATCTTTGGGTACTTACAATACGCTCATCCATGGTTTTGCTGCAAGGGGTAAGTTTGAAATTGCTATGCTTTTCATTGATGAGATGAAAGAAATCAATATGACTCGAGAAACTGATACTTATGATGGCCTAATTGAAGCCTATGGAAAATATAGAATGTACGATGAGATGATCGAGTGTCTAAAACAGATGGAACTGGATGGATGTTTTCCGGACCACATAACTTACAATTTGCTTATCAGAGAGTTCTCCAAAGGTGGTCTGCTTAAAAAGATGGAAGGATTATACCGAAGCATTCTTTCAAAAAGAATGGATTTACAGTCTTCTACCTTGGTTGCCATGTTGGAAGCTTATGCCAAATTTGGTATCTTGGATAAAATGGAAATGTTCTACAGAAGGATCCTGAACTCAAAGACCAATATGAAGGAGGATCTAATCAGAACGTTGGCTCTAGTTTATATTCAGAACCATATGTATTCAAGAATAGAGACATTGGGTATCGATCTTCACATAAAAGCTGGGAAGACAGATCTTGTTTGGTGCCTGCTTCTTCTATCACATGCTTGTCTATCAAGTCGAAGGGGTATGGACTCTGTTGTTCAGGAAATGGACAAAGCTAAAGAAATTTGGAATGTGACTGTTGCAAACATTTTACTTCTAGCTTATTTGAAAATGAAAGATTTTAAACGGCTCAGAACACTGTTCTCTGAGATACGAGCAAGACATGTGAAACCTGATCTAGTAACCATTGGAATTCTATTAGATGCAAATAACAAAAGTTTTGATGGAACCAGAACTTTAGAGGCATGGAGAAGGATGAACATGCTATCCAGAGCTGTGGAAATAAACACCGACTCGCTTGTTTTAGCTGCATTTGGAAAAGGGAGGTTCCTTAAAGACTGTGAAGAGGCATACGCTTCCCTGGAACTTGTGGGTAGGGAAAGTAAAGTTTGGACTTATGAAGAACTCATTGATCTAGTTTATGAAAACCAGGGGGGAATGGTACTGAAAACCAGATAAAATAGCTTGGTGTGCATGTGCCTGTATCAATTTTTTCTAGGGATTTGACCGCCATTAACAACCAGGAGGGCTTTACAGTGGGGAGTTTTATATGTTTGTTTGTTGTAAATTTGATGTACTTTTAACGGTTATTACTTCACTTGAATCAATGGGTAGTACAGGTAAGATTAGTTGGTATCTTTCTCTGCTAAAGCCCTAGAAAAGCAACTCTATTTCGTTGGATATTATTCTTGTGTGCATCTGATTGAATCAAGTGAATAAGATACTCACAAATTTCTCCTTGATCGTCAACCACAAACACATGTTCGATTTCTCTGTCTGTTTGAGTACCCCATCATTCTTTATCGTACATTT

Coding sequence (CDS)

ATGAGCGTCCCGATGGAAACTTGTGTTCTCCATTGTTGTTACGACCCCTTCCGACTCCGATCCAAACACCCTTCTTCTTCTCCACACCCTTTTGACCTTAATAAAGGCAGCAGCTGCCTCTCCTCCAGAATCCCACTCCCAGAGCCCTACACCAAGCTTCTTCTATCCCTACCCACTCGCGCAGTTCCAACGCATAGTCCTACAAAGCATGCGACTCTATTAGTCGACACTTTTCACGAACACCATACCCTAAAAGCTTTGCTCTCCGACCTCCAGAAAAGGGATTCCTGTCCTTTGCTATTACTTAGACACCATGGAGATTGGAACAGAGACCATTTCTGGCTCGTTGTTAAATTCCTCAGAAATGCTTCAAGATCCCATGAAATCCTTAAGCTATTTGATACATGGAAGAGCATCGAGGGGTCGCGCATCAGTGAGAGTAACTATGAGAAAGTAATAGTTCTATTGAGTCAAGATGGTCTTATGGAGGATGCTGTATCAGCATTTCAAGATATGAAAAGTCTTGGCCTTCGACCATCTTTGGGTACTTACAATACGCTCATCCATGGTTTTGCTGCAAGGGGTAAGTTTGAAATTGCTATGCTTTTCATTGATGAGATGAAAGAAATCAATATGACTCGAGAAACTGATACTTATGATGGCCTAATTGAAGCCTATGGAAAATATAGAATGTACGATGAGATGATCGAGTGTCTAAAACAGATGGAACTGGATGGATGTTTTCCGGACCACATAACTTACAATTTGCTTATCAGAGAGTTCTCCAAAGGTGGTCTGCTTAAAAAGATGGAAGGATTATACCGAAGCATTCTTTCAAAAAGAATGGATTTACAGTCTTCTACCTTGGTTGCCATGTTGGAAGCTTATGCCAAATTTGGTATCTTGGATAAAATGGAAATGTTCTACAGAAGGATCCTGAACTCAAAGACCAATATGAAGGAGGATCTAATCAGAACGTTGGCTCTAGTTTATATTCAGAACCATATGTATTCAAGAATAGAGACATTGGGTATCGATCTTCACATAAAAGCTGGGAAGACAGATCTTGTTTGGTGCCTGCTTCTTCTATCACATGCTTGTCTATCAAGTCGAAGGGGTATGGACTCTGTTGTTCAGGAAATGGACAAAGCTAAAGAAATTTGGAATGTGACTGTTGCAAACATTTTACTTCTAGCTTATTTGAAAATGAAAGATTTTAAACGGCTCAGAACACTGTTCTCTGAGATACGAGCAAGACATGTGAAACCTGATCTAGTAACCATTGGAATTCTATTAGATGCAAATAACAAAAGTTTTGATGGAACCAGAACTTTAGAGGCATGGAGAAGGATGAACATGCTATCCAGAGCTGTGGAAATAAACACCGACTCGCTTGTTTTAGCTGCATTTGGAAAAGGGAGGTTCCTTAAAGACTGTGAAGAGGCATACGCTTCCCTGGAACTTGTGGGTAGGGAAAGTAAAGTTTGGACTTATGAAGAACTCATTGATCTAGTTTATGAAAACCAGGGGGGAATGGTACTGAAAACCAGATAA

Protein sequence

MSVPMETCVLHCCYDPFRLRSKHPSSSPHPFDLNKGSSCLSSRIPLPEPYTKLLLSLPTRAVPTHSPTKHATLLVDTFHEHHTLKALLSDLQKRDSCPLLLLRHHGDWNRDHFWLVVKFLRNASRSHEILKLFDTWKSIEGSRISESNYEKVIVLLSQDGLMEDAVSAFQDMKSLGLRPSLGTYNTLIHGFAARGKFEIAMLFIDEMKEINMTRETDTYDGLIEAYGKYRMYDEMIECLKQMELDGCFPDHITYNLLIREFSKGGLLKKMEGLYRSILSKRMDLQSSTLVAMLEAYAKFGILDKMEMFYRRILNSKTNMKEDLIRTLALVYIQNHMYSRIETLGIDLHIKAGKTDLVWCLLLLSHACLSSRRGMDSVVQEMDKAKEIWNVTVANILLLAYLKMKDFKRLRTLFSEIRARHVKPDLVTIGILLDANNKSFDGTRTLEAWRRMNMLSRAVEINTDSLVLAAFGKGRFLKDCEEAYASLELVGRESKVWTYEELIDLVYENQGGMVLKTR
BLAST of Cp4.1LG17g10180 vs. Swiss-Prot
Match: PP310_ARATH (Pentatricopeptide repeat-containing protein At4g14190, chloroplastic OS=Arabidopsis thaliana GN=At4g14190 PE=2 SV=2)

HSP 1 Score: 461.5 bits (1186), Expect = 1.2e-128
Identity = 242/485 (49.90%), Postives = 330/485 (68.04%), Query Frame = 1

Query: 28  PHPFDLNKGSSCLSSRIPLPEPYTKLLLSLPTRAVPTHSPTKHATLLVDTFHEHHTLKAL 87
           P P++LN  S   S+   +P P + L  SLP       S    AT      H H  L +L
Sbjct: 21  PPPWNLNS-SFLTSTSYSIPRP-SSLRRSLPL------SINGDATQPTSLLHHHRFLSSL 80

Query: 88  LSDLQKRDSCPLLLLRHHGDWNRDHFWLVVKFLRNASRSHEILKLFDTWKSIEGSRISES 147
              L    SCPL LL+  GDW++DHFW V++FLR +SR HEIL +FDTWK++E SRISE+
Sbjct: 81  TRRLSLSGSCPLRLLQEDGDWSKDHFWAVIRFLRQSSRLHEILPVFDTWKNLEPSRISEN 140

Query: 148 NYEKVIVLLSQDGLMEDAVSAFQDM-KSLGLRPSLGTYNTLIHGFAARGKFEIAMLFIDE 207
           NYE++I  L ++  M +A+ AF+ M     L PSL  YN++IH +A  GKFE AM +++ 
Sbjct: 141 NYERIIRFLCEEKSMSEAIRAFRSMIDDHELSPSLEIYNSIIHSYADDGKFEEAMFYLNH 200

Query: 208 MKEINMTRETDTYDGLIEAYGKYRMYDEMIECLKQMELDGCFPDHITYNLLIREFSKGGL 267
           MKE  +   T+TYDGLIEAYGK++MYDE++ CLK+ME DGC  DH+TYNLLIREFS+GGL
Sbjct: 201 MKENGLLPITETYDGLIEAYGKWKMYDEIVLCLKRMESDGCVRDHVTYNLLIREFSRGGL 260

Query: 268 LKKMEGLYRSILSKRMDLQSSTLVAMLEAYAKFGILDKMEMFYRRILNSKTNMKEDLIRT 327
           LK+ME +Y+S++S++M L+ STL++MLEAYA+FG+++KME    +I+    ++ E L+R 
Sbjct: 261 LKRMEQMYQSLMSRKMTLEPSTLLSMLEAYAEFGLIEKMEETCNKIIRFGISLDEGLVRK 320

Query: 328 LALVYIQNHMYSRIETLGIDLHI-KAGKTDLVWCLLLLSHACLSSRRGMDSVVQEMDKAK 387
           LA VYI+N M+SR++ LG  +   +  +T+L WCL LL HA L SR+G+D VV+EM++A+
Sbjct: 321 LANVYIENLMFSRLDDLGRGISASRTRRTELAWCLRLLCHARLVSRKGLDYVVKEMEEAR 380

Query: 388 EIWNVTVANILLLAYLKMKDFKRLRTLFSEIRARHVKPDLVTIGILLDANNKSFDGTRTL 447
             WN T ANI LLAY KM DF  +  L SE+R +HVK DLVT+GI+ D +   FDGT   
Sbjct: 381 VPWNTTFANIALLAYSKMGDFTSIELLLSELRIKHVKLDLVTVGIVFDLSEARFDGTGVF 440

Query: 448 EAWRRMNMLSRAVEINTDSLVLAAFGKGRFLKDCEEA-YASLELVGRESKVWTYEELIDL 507
             W+++  L + VE+ TD LV AAFGKG+FL+ CEE    SL     ESK WTY+ L++L
Sbjct: 441 MTWKKIGFLDKPVEMKTDPLVHAAFGKGQFLRSCEEVKNQSLGTRDGESKSWTYQYLMEL 497

Query: 508 VYENQ 510
           V +NQ
Sbjct: 501 VVKNQ 497

BLAST of Cp4.1LG17g10180 vs. Swiss-Prot
Match: PP362_ARATH (Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana GN=At5g02860 PE=2 SV=1)

HSP 1 Score: 107.5 bits (267), Expect = 4.6e-22
Identity = 67/295 (22.71%), Postives = 134/295 (45.42%), Query Frame = 1

Query: 139 IEGSRISESNYEKVIVLLSQDGLMEDAVSAFQDMKSLGLRPSLGTYNTLIHGFAARGKFE 198
           + G   S   Y  +I   ++DG++++A+     M   G +P + TY TL+ GF   GK E
Sbjct: 342 LNGFSPSIVTYNSLISAYARDGMLDEAMELKNQMAEKGTKPDVFTYTTLLSGFERAGKVE 401

Query: 199 IAMLFIDEMKEINMTRETDTYDGLIEAYGKYRMYDEMIECLKQMELDGCFPDHITYNLLI 258
            AM   +EM+         T++  I+ YG    + EM++   ++ + G  PD +T+N L+
Sbjct: 402 SAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGKFTEMMKIFDEINVCGLSPDIVTWNTLL 461

Query: 259 REFSKGGLLKKMEGLYRSILSKRMDLQSSTLVAMLEAYAKFGILDKMEMFYRRILNSKTN 318
             F + G+  ++ G+++ +       +  T   ++ AY++ G  ++    YRR+L++   
Sbjct: 462 AVFGQNGMDSEVSGVFKEMKRAGFVPERETFNTLISAYSRCGSFEQAMTVYRRMLDAGVT 521

Query: 319 MKEDLIRTLALVYIQNHMYSRIETLGIDLHI-KAGKTDLVWCLLLLSHACLSSRRGMDSV 378
                  T+     +  M+ + E +  ++   +    +L +C LL ++A       M S+
Sbjct: 522 PDLSTYNTVLAALARGGMWEQSEKVLAEMEDGRCKPNELTYCSLLHAYANGKEIGLMHSL 581

Query: 379 VQEMDKAKEIWNVTVANILLLAYLKMKDFKRLRTLFSEIRARHVKPDLVTIGILL 433
            +E+          +   L+L   K          FSE++ R   PD+ T+  ++
Sbjct: 582 AEEVYSGVIEPRAVLLKTLVLVCSKCDLLPEAERAFSELKERGFSPDITTLNSMV 636

BLAST of Cp4.1LG17g10180 vs. Swiss-Prot
Match: PPR38_ARATH (Putative pentatricopeptide repeat-containing protein At1g12700, mitochondrial OS=Arabidopsis thaliana GN=At1g12700 PE=3 SV=1)

HSP 1 Score: 103.2 bits (256), Expect = 8.6e-21
Identity = 85/359 (23.68%), Postives = 156/359 (43.45%), Query Frame = 1

Query: 147 SNYEKVIVLLSQDGLMEDAVSAFQDMKSLGLRPSLGTYNTLIHGFAARGKFEIAMLFIDE 206
           + +  +I  L  +G + +AV     M   G +P + TYN++++G    G   +A+  + +
Sbjct: 159 TTFNTLIKGLFLEGKVSEAVVLVDRMVENGCQPDVVTYNSIVNGICRSGDTSLALDLLRK 218

Query: 207 MKEINMTRETDTYDGLIEAYGKYRMYDEMIECLKQMELDGCFPDHITYNLLIREFSKGGL 266
           M+E N+  +  TY  +I++  +    D  I   K+ME  G     +TYN L+R   K G 
Sbjct: 219 MEERNVKADVFTYSTIIDSLCRDGCIDAAISLFKEMETKGIKSSVVTYNSLVRGLCKAGK 278

Query: 267 LKKMEGLYRSILSKRMDLQSSTLVAMLEAYAKFGILDKMEMFYRRILNSKTNMKEDLIRT 326
                 L + ++S+ +     T   +L+ + K G L +    Y+ ++    +       T
Sbjct: 279 WNDGALLLKDMVSREIVPNVITFNVLLDVFVKEGKLQEANELYKEMITRGISPNIITYNT 338

Query: 327 LALVYIQNHMYSRIETLGIDLHIK-AGKTDLVWCLLLLSHACLSSR--RGMDSVVQEMDK 386
           L   Y   +  S    + +DL ++     D+V    L+   C+  R   GM  V + + K
Sbjct: 339 LMDGYCMQNRLSEANNM-LDLMVRNKCSPDIVTFTSLIKGYCMVKRVDDGM-KVFRNISK 398

Query: 387 AKEIWNVTVANILLLAYLKMKDFKRLRTLFSEIRARHVKPDLVTIGILLDANNKSFDGTR 446
              + N    +IL+  + +    K    LF E+ +  V PD++T GILLD    +    +
Sbjct: 399 RGLVANAVTYSILVQGFCQSGKIKLAEELFQEMVSHGVLPDVMTYGILLDGLCDNGKLEK 458

Query: 447 TLEAWRRMNMLSRAVEINTDSLVLAAFGKGRFLKDCEEAYASLELVGRESKVWTYEELI 503
            LE +  +      + I   + ++    KG  ++D    + SL   G +  V TY  +I
Sbjct: 459 ALEIFEDLQKSKMDLGIVMYTTIIEGMCKGGKVEDAWNLFCSLPCKGVKPNVMTYTVMI 515

BLAST of Cp4.1LG17g10180 vs. Swiss-Prot
Match: PP247_ARATH (Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidopsis thaliana GN=At3g22470 PE=2 SV=1)

HSP 1 Score: 99.4 bits (246), Expect = 1.2e-19
Identity = 86/379 (22.69%), Postives = 164/379 (43.27%), Query Frame = 1

Query: 134 DTWKSIEGSRISES--NYEKVIVLLSQDGLMEDAVSAFQDMKSLGLRPSLGTYNTLIHGF 193
           D ++ +E   I  S   Y  VI  L +DG  +DA+S F +M+  G++  + TY++LI G 
Sbjct: 231 DLFRKMEERNIKASVVQYSIVIDSLCKDGSFDDALSLFNEMEMKGIKADVVTYSSLIGGL 290

Query: 194 AARGKFEIAMLFIDEMKEINMTRETDTYDGLIEAYGKYRMYDEMIECLKQMELDGCFPDH 253
              GK++     + EM   N+  +  T+  LI+ + K     E  E   +M   G  PD 
Sbjct: 291 CNDGKWDDGAKMLREMIGRNIIPDVVTFSALIDVFVKEGKLLEAKELYNEMITRGIAPDT 350

Query: 254 ITYNLLIREFSKGGLLKKMEGLYRSILSKRMDLQSSTLVAMLEAYAKFGILDKMEMFYRR 313
           ITYN LI  F K   L +   ++  ++SK  +    T   ++ +Y K   +D     +R 
Sbjct: 351 ITYNSLIDGFCKENCLHEANQMFDLMVSKGCEPDIVTYSILINSYCKAKRVDDGMRLFRE 410

Query: 314 ILNSKTNMKEDLIRTLALVYIQNHMYSRIETLGIDLHIKAGKTDLVWCLLLLSHACLSSR 373
           I +           TL L + Q+   +  + L  ++  +     +V   +LL   C +  
Sbjct: 411 ISSKGLIPNTITYNTLVLGFCQSGKLNAAKELFQEMVSRGVPPSVVTYGILLDGLCDNGE 470

Query: 374 RGMD-SVVQEMDKAKEIWNVTVANILLLAYLKMKDFKRLRTLFSEIRARHVKPDLVTIGI 433
                 + ++M K++    + + NI++             +LF  +  + VKPD+VT  +
Sbjct: 471 LNKALEIFEKMQKSRMTLGIGIYNIIIHGMCNASKVDDAWSLFCSLSDKGVKPDVVTYNV 530

Query: 434 LLDANNKSFDGTRTLEAWRRMNMLSRAVEINTDSLVLAAFGKGRFLKDCEEAYASLELVG 493
           ++    K    +     +R+M       +  T ++++ A   G  L    E    +++ G
Sbjct: 531 MIGGLCKKGSLSEADMLFRKMKEDGCTPDDFTYNILIRAHLGGSGLISSVELIEEMKVCG 590

Query: 494 RESKVWTYEELIDLVYENQ 510
             +   T + +ID++ + +
Sbjct: 591 FSADSSTIKMVIDMLSDRR 609

BLAST of Cp4.1LG17g10180 vs. Swiss-Prot
Match: PPR96_ARATH (Pentatricopeptide repeat-containing protein At1g62930, chloroplastic OS=Arabidopsis thaliana GN=At1g62930 PE=2 SV=2)

HSP 1 Score: 96.3 bits (238), Expect = 1.1e-18
Identity = 77/344 (22.38%), Postives = 157/344 (45.64%), Query Frame = 1

Query: 164 DAVSAFQDMKSLGLRPSLGTYNTLIHGFAARGKFEIAMLFIDEMKEINMTRETDTYDGLI 223
           +AV+    M + G +P L TY T+++G   RG  ++A+  + +M++  +  +   Y  +I
Sbjct: 203 EAVALIDRMVARGCQPDLFTYGTVVNGLCKRGDIDLALSLLKKMEKGKIEADVVIYTTII 262

Query: 224 EAYGKYRMYDEMIECLKQMELDGCFPDHITYNLLIREFSKGGLLKKMEGLYRSILSKRMD 283
           +A   Y+  ++ +    +M+  G  P+ +TYN LIR     G       L   ++ ++++
Sbjct: 263 DALCNYKNVNDALNLFTEMDNKGIRPNVVTYNSLIRCLCNYGRWSDASRLLSDMIERKIN 322

Query: 284 LQSSTLVAMLEAYAKFGILDKMEMFYRRILNSKTNMKEDLIRTLALVYIQNHMYSRIETL 343
               T  A+++A+ K G L + E  Y  ++  K ++  D+    +L+     M+ R++  
Sbjct: 323 PNVVTFSALIDAFVKEGKLVEAEKLYDEMI--KRSIDPDIFTYSSLIN-GFCMHDRLDEA 382

Query: 344 GIDLHIKAGKT---DLVWCLLLLSHACLSSR--RGMDSVVQEMDKAKEIWNVTVANILLL 403
                +   K    ++V    L+   C + R   GM+ + +EM +   + N    N L+ 
Sbjct: 383 KHMFELMISKDCFPNVVTYNTLIKGFCKAKRVEEGME-LFREMSQRGLVGNTVTYNTLIQ 442

Query: 404 AYLKMKDFKRLRTLFSEIRARHVKPDLVTIGILLDANNKSFDGTRTLEAWRRMNMLSRAV 463
              +  D    + +F ++ +  V PD++T  ILLD   K     + L  +  +       
Sbjct: 443 GLFQAGDCDMAQKIFKKMVSDGVPPDIITYSILLDGLCKYGKLEKALVVFEYLQKSKMEP 502

Query: 464 EINTDSLVLAAFGKGRFLKDCEEAYASLELVGRESKVWTYEELI 503
           +I T ++++    K   ++D  + + SL L G +  V  Y  +I
Sbjct: 503 DIYTYNIMIEGMCKAGKVEDGWDLFCSLSLKGVKPNVIIYTTMI 542

BLAST of Cp4.1LG17g10180 vs. TrEMBL
Match: A0A067ECN4_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g010459mg PE=4 SV=1)

HSP 1 Score: 583.6 bits (1503), Expect = 2.4e-163
Identity = 275/443 (62.08%), Postives = 356/443 (80.36%), Query Frame = 1

Query: 68  TKHATLLVDTFHEHHTLKALLSDLQKRDSCPLLLLRHHGDWNRDHFWLVVKFLRNASRSH 127
           TKH TLLV+++HEH  L AL+  L K+ SCPL +L+H GDW +DHFW V++FL+N+SRS 
Sbjct: 61  TKHTTLLVESYHEHQALNALIQRLNKKVSCPLQILQHDGDWTKDHFWAVIRFLKNSSRSR 120

Query: 128 EILKLFDTWKSIEGSRISESNYEKVIVLLSQDGLMEDAVSAFQDMKSLGLRPSLGTYNTL 187
           +I ++FD WK+IE SRI+E N +K+I +L ++GLME+AV AFQ+M+   L+PSL  YN++
Sbjct: 121 QIPQVFDMWKNIEKSRINEFNSQKIIGMLCEEGLMEEAVRAFQEMEGFALKPSLEIYNSI 180

Query: 188 IHGFAARGKFEIAMLFIDEMKEINMTRETDTYDGLIEAYGKYRMYDEMIECLKQMELDGC 247
           IHG++  GKF  A+LF++EMKE+N++ ++DTYDGLI+AYGKY+MYDE+  CLK M+LDGC
Sbjct: 181 IHGYSKIGKFNEALLFLNEMKEMNLSPQSDTYDGLIQAYGKYKMYDEIDMCLKMMKLDGC 240

Query: 248 FPDHITYNLLIREFSKGGLLKKMEGLYRSILSKRMDLQSSTLVAMLEAYAKFGILDKMEM 307
            PDHITYNLLI+EF+  GLLK+MEG Y+S+L+KRM L+SST+VA+L+AY  FG+LDKME 
Sbjct: 241 SPDHITYNLLIQEFACAGLLKRMEGTYKSMLTKRMHLRSSTMVAILDAYMNFGMLDKMEK 300

Query: 308 FYRRILNSKTNMKEDLIRTLALVYIQNHMYSRIETLGIDLHIKAGKTDLVWCLLLLSHAC 367
           FY+R+LNS+T +KEDL+R LA VYI+N+M+SR++ LG DL  + G+T+LVWCL LLSHAC
Sbjct: 301 FYKRLLNSRTPLKEDLVRKLAEVYIKNYMFSRLDDLGDDLASRIGRTELVWCLRLLSHAC 360

Query: 368 LSSRRGMDSVVQEMDKAKEIWNVTVANILLLAYLKMKDFKRLRTLFSEIRARHVKPDLVT 427
           L S RG+DSVV+EM+ AK  WNVT ANI+LLAYLKMKDFK LR L SE+  RHVKPD+VT
Sbjct: 361 LLSHRGIDSVVREMESAKVRWNVTTANIILLAYLKMKDFKHLRVLLSELPTRHVKPDIVT 420

Query: 428 IGILLDANNKSFDGTRTLEAWRRMNMLSRAVEINTDSLVLAAFGKGRFLKDCEEAYASLE 487
           IGIL DA    FDGT  LE W+R+  L + VEINTD LVLA +GKG FL+ CEE Y+SLE
Sbjct: 421 IGILYDARRIGFDGTGALEMWKRIGFLFKTVEINTDPLVLAVYGKGHFLRYCEEVYSSLE 480

Query: 488 LVGRESKVWTYEELIDLVYENQG 511
              RE K WTY+ LIDLV ++ G
Sbjct: 481 PYSREKKRWTYQNLIDLVIKHNG 503

BLAST of Cp4.1LG17g10180 vs. TrEMBL
Match: V4SBF8_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10004784mg PE=4 SV=1)

HSP 1 Score: 582.4 bits (1500), Expect = 5.4e-163
Identity = 282/469 (60.13%), Postives = 369/469 (78.68%), Query Frame = 1

Query: 42  SRIPLPEPYTKLLLSLPTRAVPTHSPTKHATLLVDTFHEHHTLKALLSDLQKRDSCPLLL 101
           S+I + +P +   LS    ++  HS TKH TLLV+++HEH  L AL+  L K+ SCPL +
Sbjct: 37  SKILIRKPISCCCLS-SAPSLDYHS-TKHTTLLVESYHEHQALNALIQRLNKKVSCPLQI 96

Query: 102 LRHHGDWNRDHFWLVVKFLRNASRSHEILKLFDTWKSIEGSRISESNYEKVIVLLSQDGL 161
           L+H GDW +DHFW V++FL+N+SRS +I ++FD WK+IE SRI+E N +K+I +L ++GL
Sbjct: 97  LQHDGDWTKDHFWAVIRFLKNSSRSRQIPQVFDMWKNIEKSRINEFNSQKIIGMLCEEGL 156

Query: 162 MEDAVSAFQDMKSLGLRPSLGTYNTLIHGFAARGKFEIAMLFIDEMKEINMTRETDTYDG 221
           ME+AV AFQ+M+   L+PSL  YN++IHG++  GKF  A+LF++EMKE+N++ ++DTYDG
Sbjct: 157 MEEAVRAFQEMEGFALKPSLEIYNSIIHGYSKIGKFNEALLFLNEMKEMNLSPQSDTYDG 216

Query: 222 LIEAYGKYRMYDEMIECLKQMELDGCFPDHITYNLLIREFSKGGLLKKMEGLYRSILSKR 281
           LI+AYGKY+MYDE+  CLK M+LDGC PDHITYNLLI+EF+  GLLK+MEG Y+S+L+KR
Sbjct: 217 LIQAYGKYKMYDEIDMCLKMMKLDGCSPDHITYNLLIQEFACAGLLKRMEGTYKSMLTKR 276

Query: 282 MDLQSSTLVAMLEAYAKFGILDKMEMFYRRILNSKTNMKEDLIRTLALVYIQNHMYSRIE 341
           M L+SST+VA+L+AY  FG+LDKME FY+R+LNS+T +KEDL+R LA VYI+N+M+SR++
Sbjct: 277 MHLRSSTMVAILDAYMNFGMLDKMEKFYKRLLNSRTPLKEDLVRKLAEVYIKNYMFSRLD 336

Query: 342 TLGIDLHIKAGKTDLVWCLLLLSHACLSSRRGMDSVVQEMDKAKEIWNVTVANILLLAYL 401
            LG DL  + G+T+LVWCL LLSHACL S RG+DSVV+EM+ AK  WNVT ANI+LLAYL
Sbjct: 337 DLGDDLASRIGRTELVWCLRLLSHACLLSHRGIDSVVREMESAKVRWNVTTANIILLAYL 396

Query: 402 KMKDFKRLRTLFSEIRARHVKPDLVTIGILLDANNKSFDGTRTLEAWRRMNMLSRAVEIN 461
           KMKDFK LR L SE+  RHVKPD+VTIGIL DA    FDGT  LE W+R+  L + VEIN
Sbjct: 397 KMKDFKHLRVLLSELPTRHVKPDIVTIGILYDARRIGFDGTGALEMWKRIGFLFKTVEIN 456

Query: 462 TDSLVLAAFGKGRFLKDCEEAYASLELVGRESKVWTYEELIDLVYENQG 511
           TD LVLA +GKG FL+ CEE Y+SLE   RE K WTY+ LIDLV ++ G
Sbjct: 457 TDPLVLAVYGKGHFLRYCEEVYSSLEPYSREKKRWTYQNLIDLVIKHNG 503

BLAST of Cp4.1LG17g10180 vs. TrEMBL
Match: W9S4F4_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_021763 PE=4 SV=1)

HSP 1 Score: 566.6 bits (1459), Expect = 3.1e-158
Identity = 274/464 (59.05%), Postives = 362/464 (78.02%), Query Frame = 1

Query: 47  PEPYTKLLLSLPTRAVPTHSPTKHATLLVDTFHEHHTLKALLSDLQKRDSCPLLLLRHHG 106
           P+ ++ L LS+ +     +S T+H TLLV+TFHEH   K LL  L K DSCP+ LLR  G
Sbjct: 40  PKLFSSLRLSVGSSLSGQNSSTEHTTLLVETFHEHRKFKTLLKRLSKNDSCPMRLLREDG 99

Query: 107 DWNRDHFWLVVKFLRNASRSHEILKLFDTWKSIEGSRISESNYEKVIVLLSQDGLMEDAV 166
           DW ++HFW VV+FLR+ SR+ EI+++FD WK+IE SRI+E NY K+I +L ++GLME+AV
Sbjct: 100 DWCKEHFWAVVRFLRHGSRTKEIVQVFDLWKNIEKSRINELNYCKIIKMLGEEGLMEEAV 159

Query: 167 SAFQDMKSLGLRPSLGTYNTLIHGFAARGKFEIAMLFIDEMKEINMTRETDTYDGLIEAY 226
            +F++MKS GL P+L  YN++IHGF+ +G F+ A+++++EM+E N+  ETDTY+GLIEAY
Sbjct: 160 LSFEEMKSCGLSPTLEVYNSMIHGFSQKGDFDDALVYLNEMREQNVVPETDTYEGLIEAY 219

Query: 227 GKYRMYDEMIECLKQMELDGCFPDHITYNLLIREFSKGGLLKKMEGLYRSILSKRMDLQS 286
            KY MYDE+  CLK+M+L+GC PDHITYNLL+R+FSKGGLLK+ME +Y +++SKRM LQS
Sbjct: 220 AKYEMYDEIGLCLKKMKLNGCPPDHITYNLLMRKFSKGGLLKRMESVYHTMISKRMYLQS 279

Query: 287 STLVAMLEAYAKFGILDKMEMFYRRILNSKTNMKEDLIRTLALVYIQNHMYSRIETLGID 346
           STLVAMLE YA+FGILDKME FY R L +KT + EDLIR LA VYI N+++SR+ETLG+D
Sbjct: 280 STLVAMLETYARFGILDKMEKFYMRTLKTKTPLGEDLIRKLAEVYIDNYLFSRLETLGVD 339

Query: 347 LHIKAGKTDLVWCLLLLSHACLSSRRGMDSVVQEMDKAKEIWNVTVANILLLAYLKMKDF 406
           L    G+TDL+WCL LLSHA L SR+GMD V+QEM++A   WNVT ANI+LL +LKMKDF
Sbjct: 340 LSTTFGETDLLWCLRLLSHAFLFSRKGMDFVIQEMERAHIPWNVTFANIILLTHLKMKDF 399

Query: 407 KRLRTLFSEIRARHVKPDLVTIGILLDANNKSFDGTRTLEAWRRMNMLSRAVEINTDSLV 466
             LR   S++    V+PD+VT+GIL DA    FDGTRTLE W+RM+   +AVE+NTD +V
Sbjct: 400 THLRISLSQL-THSVEPDIVTVGILFDAIGMGFDGTRTLETWKRMDFFYKAVEMNTDPVV 459

Query: 467 LAAFGKGRFLKDCEEAYASLELVGRESKVWTYEELIDLVYENQG 511
           + AFGKG FL++CE AY+SLE   RE+K WTY  L+DLV++++G
Sbjct: 460 ITAFGKGNFLQNCERAYSSLESEVRETKSWTYNNLVDLVFKHKG 502

BLAST of Cp4.1LG17g10180 vs. TrEMBL
Match: A0A061ES07_THECC (Pentatricopeptide repeat-containing protein, putative OS=Theobroma cacao GN=TCM_020186 PE=4 SV=1)

HSP 1 Score: 564.7 bits (1454), Expect = 1.2e-157
Identity = 278/453 (61.37%), Postives = 350/453 (77.26%), Query Frame = 1

Query: 58  PTRAVPTHSPTK-HATLLVDTFHEHHTLKALLSDLQKRDSCPLLLLRHHGDWNRDHFWLV 117
           P+   P  S  K H  LLV+T+H H  LKALL  L+K DSCPL +LR  GDW +D FW+V
Sbjct: 49  PSPPRPDGSSCKNHTALLVETYHHHRRLKALLERLEKDDSCPLQMLRDDGDWTKDIFWVV 108

Query: 118 VKFLRNASRSHEILKLFDTWKSIEGSRISESNYEKVIVLLSQDGLMEDAVSAFQDMKSLG 177
           ++FLR ASRS+EIL++F  WK+IE SRI+E NYEK+I LL ++G +  AV A ++M   G
Sbjct: 109 IRFLRRASRSNEILQVFHMWKNIEKSRINELNYEKIIGLLGEEGRVGQAVQALREMGGYG 168

Query: 178 LRPSLGTYNTLIHGFAARGKFEIAMLFIDEMKEINMTRETDTYDGLIEAYGKYRMYDEMI 237
           L+PSL  YN++IH +A  GKF+ A+ F++EMKEI +  ETDTYDGLIEAYGKY+MYDE+ 
Sbjct: 169 LKPSLEVYNSIIHAYARNGKFDDALSFLNEMKEIGLAPETDTYDGLIEAYGKYKMYDEIG 228

Query: 238 ECLKQMELDGCFPDHITYNLLIREFSKGGLLKKMEGLYRSILSKRMDLQSSTLVAMLEAY 297
            CLK MELD C PDH TYNLLIREFS+GGLL++ME +Y+ +LSK+M+LQSS+LVAMLEAY
Sbjct: 229 TCLKMMELDRCRPDHFTYNLLIREFSRGGLLQRMEQVYQILLSKQMNLQSSSLVAMLEAY 288

Query: 298 AKFGILDKMEMFYRRILNSKTNMKEDLIRTLALVYIQNHMYSRIETLGIDLHIKAGKTDL 357
           A FGILDKME  YR+++NS T +KED IR LA VYI+N+M+SR++ LGIDL  + G+ DL
Sbjct: 289 ANFGILDKMEKVYRKVVNSMT-LKEDTIRILASVYIKNYMFSRLDDLGIDLSSRTGRNDL 348

Query: 358 VWCLLLLSHACLSSRRGMDSVVQEMDKAKEIWNVTVANILLLAYLKMKDFKRLRTLFSEI 417
           VWCL LLSHACL SR+GMDSV+ EM +AK  WNVT++NI+LLAY+KMKDFKRLR L S++
Sbjct: 349 VWCLRLLSHACLLSRKGMDSVILEMCEAKASWNVTISNIILLAYMKMKDFKRLRILLSQL 408

Query: 418 RARHVKPDLVTIGILLDANNKSFDGTRTLEAWRRMNMLSRAVEINTDSLVLAAFGKGRFL 477
            +  V+PD++TIGIL DA    FDG   LE WR+M +L R VE+NTD LVL AFGKG FL
Sbjct: 409 PSHQVRPDIITIGILSDAIEIGFDGAEALETWRKMGLLYRTVEMNTDPLVLIAFGKGHFL 468

Query: 478 KDCEEAYASLELVGRESKVWTYEELIDLVYENQ 510
           +DCEE Y SLE   R+ K WTY  LIDLV +++
Sbjct: 469 RDCEEIYTSLEPKARKEKRWTYHHLIDLVIKHK 500

BLAST of Cp4.1LG17g10180 vs. TrEMBL
Match: A0A0D2NX12_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_006G225200 PE=4 SV=1)

HSP 1 Score: 560.1 bits (1442), Expect = 2.9e-156
Identity = 278/471 (59.02%), Postives = 356/471 (75.58%), Query Frame = 1

Query: 39  CLSSRIPLPEPYTKLLLSLPTRAVPTHSPTKHATLLVDTFHEHHTLKALLSDLQKRDSCP 98
           CLSS    P        S P R     S   H TLLV+T+H H  L+AL+  L+K  SCP
Sbjct: 44  CLSSLSSSP--------STPRRPPDGCSSKTHTTLLVETYHHHRRLRALIEKLEKEGSCP 103

Query: 99  LLLLRHHGDWNRDHFWLVVKFLRNASRSHEILKLFDTWKSIEGSRISESNYEKVIVLLSQ 158
           + +L   GDW ++ FW  VKFLR+A RS+EIL++F  WK+IE SRI+E NYEK+I L  +
Sbjct: 104 MQILGDDGDWTKNDFWAAVKFLRHAFRSNEILQVFRMWKNIEKSRINELNYEKIIGLFCE 163

Query: 159 DGLMEDAVSAFQDMKSLGLRPSLGTYNTLIHGFAARGKFEIAMLFIDEMKEINMTRETDT 218
           + ++E+AV A Q+M+  GLRPSL  YN++IH +A  GKF  A  F++EMKEI +  ETDT
Sbjct: 164 ERMVEEAVEALQEMEGYGLRPSLEIYNSIIHAYAKNGKFNDASFFLNEMKEIGLEPETDT 223

Query: 219 YDGLIEAYGKYRMYDEMIECLKQMELDGCFPDHITYNLLIREFSKGGLLKKMEGLYRSIL 278
           YDGLIEAYGKY+ YD++  CLK MELDGC PDH TYNLLIREFS+GGLL+KME +YR ++
Sbjct: 224 YDGLIEAYGKYKRYDDIGACLKTMELDGCSPDHFTYNLLIREFSRGGLLQKMEQVYRVMI 283

Query: 279 SKRMDLQSSTLVAMLEAYAKFGILDKMEMFYRRILNSKTNMKEDLIRTLALVYIQNHMYS 338
           SK+M+LQ S+LVAMLE+YA FGILDKME  YR+++NS +++KED +R LA VYI+N+M+S
Sbjct: 284 SKKMNLQPSSLVAMLESYANFGILDKMEKVYRKVVNS-SSLKEDTVRKLANVYIKNYMFS 343

Query: 339 RIETLGIDLHIKAGKTDLVWCLLLLSHACLSSRRGMDSVVQEMDKAKEIWNVTVANILLL 398
           R++ LGIDL  + G+ DLVWCL LLSHACL SR+G+DSV+QEMD+AK +WNVT+ NI+LL
Sbjct: 344 RLDDLGIDLSSRTGRNDLVWCLRLLSHACLLSRKGIDSVIQEMDEAKALWNVTIVNIILL 403

Query: 399 AYLKMKDFKRLRTLFSEIRARHVKPDLVTIGILLDANNKSFDGTRTLEAWRRMNMLSRAV 458
           AYLKMKDF  LR+L S++ +R V+PDL T+GIL DA    FDG +TLE WR+M +L RAV
Sbjct: 404 AYLKMKDFTHLRSLLSQLPSRQVRPDLTTVGILFDAIEIGFDGAKTLETWRKM-VLYRAV 463

Query: 459 EINTDSLVLAAFGKGRFLKDCEEAYASLELVGRESKVWTYEELIDLVYENQ 510
           E+NTD LVL AFGKG FL+DCEEAY SLE   R+ K  TY +LIDLV +++
Sbjct: 464 ELNTDPLVLTAFGKGHFLRDCEEAYTSLEPKARDKKTRTYHQLIDLVIKHK 504

BLAST of Cp4.1LG17g10180 vs. TAIR10
Match: AT4G14190.1 (AT4G14190.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 461.5 bits (1186), Expect = 7.0e-130
Identity = 242/485 (49.90%), Postives = 330/485 (68.04%), Query Frame = 1

Query: 28  PHPFDLNKGSSCLSSRIPLPEPYTKLLLSLPTRAVPTHSPTKHATLLVDTFHEHHTLKAL 87
           P P++LN  S   S+   +P P + L  SLP       S    AT      H H  L +L
Sbjct: 21  PPPWNLNS-SFLTSTSYSIPRP-SSLRRSLPL------SINGDATQPTSLLHHHRFLSSL 80

Query: 88  LSDLQKRDSCPLLLLRHHGDWNRDHFWLVVKFLRNASRSHEILKLFDTWKSIEGSRISES 147
              L    SCPL LL+  GDW++DHFW V++FLR +SR HEIL +FDTWK++E SRISE+
Sbjct: 81  TRRLSLSGSCPLRLLQEDGDWSKDHFWAVIRFLRQSSRLHEILPVFDTWKNLEPSRISEN 140

Query: 148 NYEKVIVLLSQDGLMEDAVSAFQDM-KSLGLRPSLGTYNTLIHGFAARGKFEIAMLFIDE 207
           NYE++I  L ++  M +A+ AF+ M     L PSL  YN++IH +A  GKFE AM +++ 
Sbjct: 141 NYERIIRFLCEEKSMSEAIRAFRSMIDDHELSPSLEIYNSIIHSYADDGKFEEAMFYLNH 200

Query: 208 MKEINMTRETDTYDGLIEAYGKYRMYDEMIECLKQMELDGCFPDHITYNLLIREFSKGGL 267
           MKE  +   T+TYDGLIEAYGK++MYDE++ CLK+ME DGC  DH+TYNLLIREFS+GGL
Sbjct: 201 MKENGLLPITETYDGLIEAYGKWKMYDEIVLCLKRMESDGCVRDHVTYNLLIREFSRGGL 260

Query: 268 LKKMEGLYRSILSKRMDLQSSTLVAMLEAYAKFGILDKMEMFYRRILNSKTNMKEDLIRT 327
           LK+ME +Y+S++S++M L+ STL++MLEAYA+FG+++KME    +I+    ++ E L+R 
Sbjct: 261 LKRMEQMYQSLMSRKMTLEPSTLLSMLEAYAEFGLIEKMEETCNKIIRFGISLDEGLVRK 320

Query: 328 LALVYIQNHMYSRIETLGIDLHI-KAGKTDLVWCLLLLSHACLSSRRGMDSVVQEMDKAK 387
           LA VYI+N M+SR++ LG  +   +  +T+L WCL LL HA L SR+G+D VV+EM++A+
Sbjct: 321 LANVYIENLMFSRLDDLGRGISASRTRRTELAWCLRLLCHARLVSRKGLDYVVKEMEEAR 380

Query: 388 EIWNVTVANILLLAYLKMKDFKRLRTLFSEIRARHVKPDLVTIGILLDANNKSFDGTRTL 447
             WN T ANI LLAY KM DF  +  L SE+R +HVK DLVT+GI+ D +   FDGT   
Sbjct: 381 VPWNTTFANIALLAYSKMGDFTSIELLLSELRIKHVKLDLVTVGIVFDLSEARFDGTGVF 440

Query: 448 EAWRRMNMLSRAVEINTDSLVLAAFGKGRFLKDCEEA-YASLELVGRESKVWTYEELIDL 507
             W+++  L + VE+ TD LV AAFGKG+FL+ CEE    SL     ESK WTY+ L++L
Sbjct: 441 MTWKKIGFLDKPVEMKTDPLVHAAFGKGQFLRSCEEVKNQSLGTRDGESKSWTYQYLMEL 497

Query: 508 VYENQ 510
           V +NQ
Sbjct: 501 VVKNQ 497

BLAST of Cp4.1LG17g10180 vs. TAIR10
Match: AT5G02860.1 (AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 107.5 bits (267), Expect = 2.6e-23
Identity = 67/295 (22.71%), Postives = 134/295 (45.42%), Query Frame = 1

Query: 139 IEGSRISESNYEKVIVLLSQDGLMEDAVSAFQDMKSLGLRPSLGTYNTLIHGFAARGKFE 198
           + G   S   Y  +I   ++DG++++A+     M   G +P + TY TL+ GF   GK E
Sbjct: 342 LNGFSPSIVTYNSLISAYARDGMLDEAMELKNQMAEKGTKPDVFTYTTLLSGFERAGKVE 401

Query: 199 IAMLFIDEMKEINMTRETDTYDGLIEAYGKYRMYDEMIECLKQMELDGCFPDHITYNLLI 258
            AM   +EM+         T++  I+ YG    + EM++   ++ + G  PD +T+N L+
Sbjct: 402 SAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGKFTEMMKIFDEINVCGLSPDIVTWNTLL 461

Query: 259 REFSKGGLLKKMEGLYRSILSKRMDLQSSTLVAMLEAYAKFGILDKMEMFYRRILNSKTN 318
             F + G+  ++ G+++ +       +  T   ++ AY++ G  ++    YRR+L++   
Sbjct: 462 AVFGQNGMDSEVSGVFKEMKRAGFVPERETFNTLISAYSRCGSFEQAMTVYRRMLDAGVT 521

Query: 319 MKEDLIRTLALVYIQNHMYSRIETLGIDLHI-KAGKTDLVWCLLLLSHACLSSRRGMDSV 378
                  T+     +  M+ + E +  ++   +    +L +C LL ++A       M S+
Sbjct: 522 PDLSTYNTVLAALARGGMWEQSEKVLAEMEDGRCKPNELTYCSLLHAYANGKEIGLMHSL 581

Query: 379 VQEMDKAKEIWNVTVANILLLAYLKMKDFKRLRTLFSEIRARHVKPDLVTIGILL 433
            +E+          +   L+L   K          FSE++ R   PD+ T+  ++
Sbjct: 582 AEEVYSGVIEPRAVLLKTLVLVCSKCDLLPEAERAFSELKERGFSPDITTLNSMV 636

BLAST of Cp4.1LG17g10180 vs. TAIR10
Match: AT1G12700.1 (AT1G12700.1 ATP binding;nucleic acid binding;helicases)

HSP 1 Score: 103.2 bits (256), Expect = 4.9e-22
Identity = 85/359 (23.68%), Postives = 156/359 (43.45%), Query Frame = 1

Query: 147 SNYEKVIVLLSQDGLMEDAVSAFQDMKSLGLRPSLGTYNTLIHGFAARGKFEIAMLFIDE 206
           + +  +I  L  +G + +AV     M   G +P + TYN++++G    G   +A+  + +
Sbjct: 159 TTFNTLIKGLFLEGKVSEAVVLVDRMVENGCQPDVVTYNSIVNGICRSGDTSLALDLLRK 218

Query: 207 MKEINMTRETDTYDGLIEAYGKYRMYDEMIECLKQMELDGCFPDHITYNLLIREFSKGGL 266
           M+E N+  +  TY  +I++  +    D  I   K+ME  G     +TYN L+R   K G 
Sbjct: 219 MEERNVKADVFTYSTIIDSLCRDGCIDAAISLFKEMETKGIKSSVVTYNSLVRGLCKAGK 278

Query: 267 LKKMEGLYRSILSKRMDLQSSTLVAMLEAYAKFGILDKMEMFYRRILNSKTNMKEDLIRT 326
                 L + ++S+ +     T   +L+ + K G L +    Y+ ++    +       T
Sbjct: 279 WNDGALLLKDMVSREIVPNVITFNVLLDVFVKEGKLQEANELYKEMITRGISPNIITYNT 338

Query: 327 LALVYIQNHMYSRIETLGIDLHIK-AGKTDLVWCLLLLSHACLSSR--RGMDSVVQEMDK 386
           L   Y   +  S    + +DL ++     D+V    L+   C+  R   GM  V + + K
Sbjct: 339 LMDGYCMQNRLSEANNM-LDLMVRNKCSPDIVTFTSLIKGYCMVKRVDDGM-KVFRNISK 398

Query: 387 AKEIWNVTVANILLLAYLKMKDFKRLRTLFSEIRARHVKPDLVTIGILLDANNKSFDGTR 446
              + N    +IL+  + +    K    LF E+ +  V PD++T GILLD    +    +
Sbjct: 399 RGLVANAVTYSILVQGFCQSGKIKLAEELFQEMVSHGVLPDVMTYGILLDGLCDNGKLEK 458

Query: 447 TLEAWRRMNMLSRAVEINTDSLVLAAFGKGRFLKDCEEAYASLELVGRESKVWTYEELI 503
            LE +  +      + I   + ++    KG  ++D    + SL   G +  V TY  +I
Sbjct: 459 ALEIFEDLQKSKMDLGIVMYTTIIEGMCKGGKVEDAWNLFCSLPCKGVKPNVMTYTVMI 515

BLAST of Cp4.1LG17g10180 vs. TAIR10
Match: AT3G22470.1 (AT3G22470.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 99.4 bits (246), Expect = 7.0e-21
Identity = 86/379 (22.69%), Postives = 164/379 (43.27%), Query Frame = 1

Query: 134 DTWKSIEGSRISES--NYEKVIVLLSQDGLMEDAVSAFQDMKSLGLRPSLGTYNTLIHGF 193
           D ++ +E   I  S   Y  VI  L +DG  +DA+S F +M+  G++  + TY++LI G 
Sbjct: 231 DLFRKMEERNIKASVVQYSIVIDSLCKDGSFDDALSLFNEMEMKGIKADVVTYSSLIGGL 290

Query: 194 AARGKFEIAMLFIDEMKEINMTRETDTYDGLIEAYGKYRMYDEMIECLKQMELDGCFPDH 253
              GK++     + EM   N+  +  T+  LI+ + K     E  E   +M   G  PD 
Sbjct: 291 CNDGKWDDGAKMLREMIGRNIIPDVVTFSALIDVFVKEGKLLEAKELYNEMITRGIAPDT 350

Query: 254 ITYNLLIREFSKGGLLKKMEGLYRSILSKRMDLQSSTLVAMLEAYAKFGILDKMEMFYRR 313
           ITYN LI  F K   L +   ++  ++SK  +    T   ++ +Y K   +D     +R 
Sbjct: 351 ITYNSLIDGFCKENCLHEANQMFDLMVSKGCEPDIVTYSILINSYCKAKRVDDGMRLFRE 410

Query: 314 ILNSKTNMKEDLIRTLALVYIQNHMYSRIETLGIDLHIKAGKTDLVWCLLLLSHACLSSR 373
           I +           TL L + Q+   +  + L  ++  +     +V   +LL   C +  
Sbjct: 411 ISSKGLIPNTITYNTLVLGFCQSGKLNAAKELFQEMVSRGVPPSVVTYGILLDGLCDNGE 470

Query: 374 RGMD-SVVQEMDKAKEIWNVTVANILLLAYLKMKDFKRLRTLFSEIRARHVKPDLVTIGI 433
                 + ++M K++    + + NI++             +LF  +  + VKPD+VT  +
Sbjct: 471 LNKALEIFEKMQKSRMTLGIGIYNIIIHGMCNASKVDDAWSLFCSLSDKGVKPDVVTYNV 530

Query: 434 LLDANNKSFDGTRTLEAWRRMNMLSRAVEINTDSLVLAAFGKGRFLKDCEEAYASLELVG 493
           ++    K    +     +R+M       +  T ++++ A   G  L    E    +++ G
Sbjct: 531 MIGGLCKKGSLSEADMLFRKMKEDGCTPDDFTYNILIRAHLGGSGLISSVELIEEMKVCG 590

Query: 494 RESKVWTYEELIDLVYENQ 510
             +   T + +ID++ + +
Sbjct: 591 FSADSSTIKMVIDMLSDRR 609

BLAST of Cp4.1LG17g10180 vs. TAIR10
Match: AT1G62930.1 (AT1G62930.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 96.3 bits (238), Expect = 5.9e-20
Identity = 77/344 (22.38%), Postives = 157/344 (45.64%), Query Frame = 1

Query: 164 DAVSAFQDMKSLGLRPSLGTYNTLIHGFAARGKFEIAMLFIDEMKEINMTRETDTYDGLI 223
           +AV+    M + G +P L TY T+++G   RG  ++A+  + +M++  +  +   Y  +I
Sbjct: 203 EAVALIDRMVARGCQPDLFTYGTVVNGLCKRGDIDLALSLLKKMEKGKIEADVVIYTTII 262

Query: 224 EAYGKYRMYDEMIECLKQMELDGCFPDHITYNLLIREFSKGGLLKKMEGLYRSILSKRMD 283
           +A   Y+  ++ +    +M+  G  P+ +TYN LIR     G       L   ++ ++++
Sbjct: 263 DALCNYKNVNDALNLFTEMDNKGIRPNVVTYNSLIRCLCNYGRWSDASRLLSDMIERKIN 322

Query: 284 LQSSTLVAMLEAYAKFGILDKMEMFYRRILNSKTNMKEDLIRTLALVYIQNHMYSRIETL 343
               T  A+++A+ K G L + E  Y  ++  K ++  D+    +L+     M+ R++  
Sbjct: 323 PNVVTFSALIDAFVKEGKLVEAEKLYDEMI--KRSIDPDIFTYSSLIN-GFCMHDRLDEA 382

Query: 344 GIDLHIKAGKT---DLVWCLLLLSHACLSSR--RGMDSVVQEMDKAKEIWNVTVANILLL 403
                +   K    ++V    L+   C + R   GM+ + +EM +   + N    N L+ 
Sbjct: 383 KHMFELMISKDCFPNVVTYNTLIKGFCKAKRVEEGME-LFREMSQRGLVGNTVTYNTLIQ 442

Query: 404 AYLKMKDFKRLRTLFSEIRARHVKPDLVTIGILLDANNKSFDGTRTLEAWRRMNMLSRAV 463
              +  D    + +F ++ +  V PD++T  ILLD   K     + L  +  +       
Sbjct: 443 GLFQAGDCDMAQKIFKKMVSDGVPPDIITYSILLDGLCKYGKLEKALVVFEYLQKSKMEP 502

Query: 464 EINTDSLVLAAFGKGRFLKDCEEAYASLELVGRESKVWTYEELI 503
           +I T ++++    K   ++D  + + SL L G +  V  Y  +I
Sbjct: 503 DIYTYNIMIEGMCKAGKVEDGWDLFCSLSLKGVKPNVIIYTTMI 542

BLAST of Cp4.1LG17g10180 vs. NCBI nr
Match: gi|985470191|ref|XP_015380926.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic isoform X1 [Citrus sinensis])

HSP 1 Score: 590.5 bits (1521), Expect = 2.8e-165
Identity = 278/443 (62.75%), Postives = 358/443 (80.81%), Query Frame = 1

Query: 68  TKHATLLVDTFHEHHTLKALLSDLQKRDSCPLLLLRHHGDWNRDHFWLVVKFLRNASRSH 127
           TKH TLLV+++HEH  L AL+  L K+ SCPL +L+H GDW +DHFW V++FL+N+SRS 
Sbjct: 61  TKHTTLLVESYHEHQALNALIQRLNKKVSCPLQILQHDGDWTKDHFWAVIRFLKNSSRSR 120

Query: 128 EILKLFDTWKSIEGSRISESNYEKVIVLLSQDGLMEDAVSAFQDMKSLGLRPSLGTYNTL 187
           +I ++FD WK+IE SRI+E NY+K+I +L ++GLME+AV AFQ+M+   L+PSL  YN++
Sbjct: 121 QIPQVFDMWKNIEKSRINEFNYQKIIGMLCEEGLMEEAVRAFQEMEGFALKPSLEIYNSI 180

Query: 188 IHGFAARGKFEIAMLFIDEMKEINMTRETDTYDGLIEAYGKYRMYDEMIECLKQMELDGC 247
           IHG++  GKF  A+LF++EMKE+N++ ++DTYDGLI+AYGKY+MYDE+  CLK M+LDGC
Sbjct: 181 IHGYSKIGKFNEALLFLNEMKEMNLSPQSDTYDGLIQAYGKYKMYDEIDMCLKMMKLDGC 240

Query: 248 FPDHITYNLLIREFSKGGLLKKMEGLYRSILSKRMDLQSSTLVAMLEAYAKFGILDKMEM 307
            PDHITYNLLI+EF+  GLLK+MEG Y+S+L+KRM L+SST+VA+L+AY  FG+LDKME 
Sbjct: 241 SPDHITYNLLIQEFACAGLLKRMEGTYKSMLTKRMHLRSSTMVAILDAYMNFGMLDKMEK 300

Query: 308 FYRRILNSKTNMKEDLIRTLALVYIQNHMYSRIETLGIDLHIKAGKTDLVWCLLLLSHAC 367
           FY+R+LNS+T +KEDL+R LA VYI+N+M+SR++ LG DL  + G+T+LVWCL LLSHAC
Sbjct: 301 FYKRLLNSRTPLKEDLVRKLAEVYIKNYMFSRLDDLGDDLASRIGRTELVWCLRLLSHAC 360

Query: 368 LSSRRGMDSVVQEMDKAKEIWNVTVANILLLAYLKMKDFKRLRTLFSEIRARHVKPDLVT 427
           L S RG+DSVV+EM+ AK  WNVT ANI+LLAYLKMKDFK LR L SE+  RHVKPD+VT
Sbjct: 361 LLSHRGIDSVVREMESAKVRWNVTTANIILLAYLKMKDFKHLRVLLSELPTRHVKPDIVT 420

Query: 428 IGILLDANNKSFDGTRTLEAWRRMNMLSRAVEINTDSLVLAAFGKGRFLKDCEEAYASLE 487
           IGIL DA    FDGT  LE WRR+  LS+ VEINTD LVLA +GKG FL+ CEE Y+SLE
Sbjct: 421 IGILYDARRIGFDGTGALEMWRRIGFLSKTVEINTDPLVLAVYGKGHFLRYCEEVYSSLE 480

Query: 488 LVGRESKVWTYEELIDLVYENQG 511
              RE K WTY+ LIDLV ++ G
Sbjct: 481 PYSREKKRWTYQNLIDLVIKHNG 503

BLAST of Cp4.1LG17g10180 vs. NCBI nr
Match: gi|641833822|gb|KDO52828.1| (hypothetical protein CISIN_1g010459mg [Citrus sinensis])

HSP 1 Score: 583.6 bits (1503), Expect = 3.5e-163
Identity = 275/443 (62.08%), Postives = 356/443 (80.36%), Query Frame = 1

Query: 68  TKHATLLVDTFHEHHTLKALLSDLQKRDSCPLLLLRHHGDWNRDHFWLVVKFLRNASRSH 127
           TKH TLLV+++HEH  L AL+  L K+ SCPL +L+H GDW +DHFW V++FL+N+SRS 
Sbjct: 61  TKHTTLLVESYHEHQALNALIQRLNKKVSCPLQILQHDGDWTKDHFWAVIRFLKNSSRSR 120

Query: 128 EILKLFDTWKSIEGSRISESNYEKVIVLLSQDGLMEDAVSAFQDMKSLGLRPSLGTYNTL 187
           +I ++FD WK+IE SRI+E N +K+I +L ++GLME+AV AFQ+M+   L+PSL  YN++
Sbjct: 121 QIPQVFDMWKNIEKSRINEFNSQKIIGMLCEEGLMEEAVRAFQEMEGFALKPSLEIYNSI 180

Query: 188 IHGFAARGKFEIAMLFIDEMKEINMTRETDTYDGLIEAYGKYRMYDEMIECLKQMELDGC 247
           IHG++  GKF  A+LF++EMKE+N++ ++DTYDGLI+AYGKY+MYDE+  CLK M+LDGC
Sbjct: 181 IHGYSKIGKFNEALLFLNEMKEMNLSPQSDTYDGLIQAYGKYKMYDEIDMCLKMMKLDGC 240

Query: 248 FPDHITYNLLIREFSKGGLLKKMEGLYRSILSKRMDLQSSTLVAMLEAYAKFGILDKMEM 307
            PDHITYNLLI+EF+  GLLK+MEG Y+S+L+KRM L+SST+VA+L+AY  FG+LDKME 
Sbjct: 241 SPDHITYNLLIQEFACAGLLKRMEGTYKSMLTKRMHLRSSTMVAILDAYMNFGMLDKMEK 300

Query: 308 FYRRILNSKTNMKEDLIRTLALVYIQNHMYSRIETLGIDLHIKAGKTDLVWCLLLLSHAC 367
           FY+R+LNS+T +KEDL+R LA VYI+N+M+SR++ LG DL  + G+T+LVWCL LLSHAC
Sbjct: 301 FYKRLLNSRTPLKEDLVRKLAEVYIKNYMFSRLDDLGDDLASRIGRTELVWCLRLLSHAC 360

Query: 368 LSSRRGMDSVVQEMDKAKEIWNVTVANILLLAYLKMKDFKRLRTLFSEIRARHVKPDLVT 427
           L S RG+DSVV+EM+ AK  WNVT ANI+LLAYLKMKDFK LR L SE+  RHVKPD+VT
Sbjct: 361 LLSHRGIDSVVREMESAKVRWNVTTANIILLAYLKMKDFKHLRVLLSELPTRHVKPDIVT 420

Query: 428 IGILLDANNKSFDGTRTLEAWRRMNMLSRAVEINTDSLVLAAFGKGRFLKDCEEAYASLE 487
           IGIL DA    FDGT  LE W+R+  L + VEINTD LVLA +GKG FL+ CEE Y+SLE
Sbjct: 421 IGILYDARRIGFDGTGALEMWKRIGFLFKTVEINTDPLVLAVYGKGHFLRYCEEVYSSLE 480

Query: 488 LVGRESKVWTYEELIDLVYENQG 511
              RE K WTY+ LIDLV ++ G
Sbjct: 481 PYSREKKRWTYQNLIDLVIKHNG 503

BLAST of Cp4.1LG17g10180 vs. NCBI nr
Match: gi|567856726|ref|XP_006421046.1| (hypothetical protein CICLE_v10004784mg [Citrus clementina])

HSP 1 Score: 582.4 bits (1500), Expect = 7.8e-163
Identity = 282/469 (60.13%), Postives = 369/469 (78.68%), Query Frame = 1

Query: 42  SRIPLPEPYTKLLLSLPTRAVPTHSPTKHATLLVDTFHEHHTLKALLSDLQKRDSCPLLL 101
           S+I + +P +   LS    ++  HS TKH TLLV+++HEH  L AL+  L K+ SCPL +
Sbjct: 37  SKILIRKPISCCCLS-SAPSLDYHS-TKHTTLLVESYHEHQALNALIQRLNKKVSCPLQI 96

Query: 102 LRHHGDWNRDHFWLVVKFLRNASRSHEILKLFDTWKSIEGSRISESNYEKVIVLLSQDGL 161
           L+H GDW +DHFW V++FL+N+SRS +I ++FD WK+IE SRI+E N +K+I +L ++GL
Sbjct: 97  LQHDGDWTKDHFWAVIRFLKNSSRSRQIPQVFDMWKNIEKSRINEFNSQKIIGMLCEEGL 156

Query: 162 MEDAVSAFQDMKSLGLRPSLGTYNTLIHGFAARGKFEIAMLFIDEMKEINMTRETDTYDG 221
           ME+AV AFQ+M+   L+PSL  YN++IHG++  GKF  A+LF++EMKE+N++ ++DTYDG
Sbjct: 157 MEEAVRAFQEMEGFALKPSLEIYNSIIHGYSKIGKFNEALLFLNEMKEMNLSPQSDTYDG 216

Query: 222 LIEAYGKYRMYDEMIECLKQMELDGCFPDHITYNLLIREFSKGGLLKKMEGLYRSILSKR 281
           LI+AYGKY+MYDE+  CLK M+LDGC PDHITYNLLI+EF+  GLLK+MEG Y+S+L+KR
Sbjct: 217 LIQAYGKYKMYDEIDMCLKMMKLDGCSPDHITYNLLIQEFACAGLLKRMEGTYKSMLTKR 276

Query: 282 MDLQSSTLVAMLEAYAKFGILDKMEMFYRRILNSKTNMKEDLIRTLALVYIQNHMYSRIE 341
           M L+SST+VA+L+AY  FG+LDKME FY+R+LNS+T +KEDL+R LA VYI+N+M+SR++
Sbjct: 277 MHLRSSTMVAILDAYMNFGMLDKMEKFYKRLLNSRTPLKEDLVRKLAEVYIKNYMFSRLD 336

Query: 342 TLGIDLHIKAGKTDLVWCLLLLSHACLSSRRGMDSVVQEMDKAKEIWNVTVANILLLAYL 401
            LG DL  + G+T+LVWCL LLSHACL S RG+DSVV+EM+ AK  WNVT ANI+LLAYL
Sbjct: 337 DLGDDLASRIGRTELVWCLRLLSHACLLSHRGIDSVVREMESAKVRWNVTTANIILLAYL 396

Query: 402 KMKDFKRLRTLFSEIRARHVKPDLVTIGILLDANNKSFDGTRTLEAWRRMNMLSRAVEIN 461
           KMKDFK LR L SE+  RHVKPD+VTIGIL DA    FDGT  LE W+R+  L + VEIN
Sbjct: 397 KMKDFKHLRVLLSELPTRHVKPDIVTIGILYDARRIGFDGTGALEMWKRIGFLFKTVEIN 456

Query: 462 TDSLVLAAFGKGRFLKDCEEAYASLELVGRESKVWTYEELIDLVYENQG 511
           TD LVLA +GKG FL+ CEE Y+SLE   RE K WTY+ LIDLV ++ G
Sbjct: 457 TDPLVLAVYGKGHFLRYCEEVYSSLEPYSREKKRWTYQNLIDLVIKHNG 503

BLAST of Cp4.1LG17g10180 vs. NCBI nr
Match: gi|694396645|ref|XP_009373597.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic [Pyrus x bretschneideri])

HSP 1 Score: 571.2 bits (1471), Expect = 1.8e-159
Identity = 282/469 (60.13%), Postives = 354/469 (75.48%), Query Frame = 1

Query: 45  PLPEPYTKLLLSL--------PTRAVPTHSPTKHATLLVDTFHEHHTLKALLSDLQKRDS 104
           P P+P T    SL        P  A P    TKH  LLV+TFHEH  LK +L  +   D 
Sbjct: 37  PSPKPRTPFPSSLCYSHPFPQPNVAAPRGGSTKHTILLVETFHEHQRLKDVLVKVTTEDC 96

Query: 105 CPLLLLRHHGDWNRDHFWLVVKFLRNASRSHEILKLFDTWKSIEGSRISESNYEKVIVLL 164
           CPL LL   GDW +D FW V+ FL N SRS EIL+LF+ WK IE SRI+E NY K+I LL
Sbjct: 97  CPLQLLADDGDWTKDQFWAVITFLNNVSRSKEILQLFEMWKKIEKSRINEFNYSKIIGLL 156

Query: 165 SQDGLMEDAVSAFQDMKSLGLRPSLGTYNTLIHGFAARGKFEIAMLFIDEMKEINMTRET 224
           S++GLME+A   FQ+MKS  LRPSL  YN++IHGFA +G F+ A+ ++ EM+E+N+  ET
Sbjct: 157 SEEGLMEEAAPCFQEMKSHDLRPSLEVYNSMIHGFARQGNFDDALFYLSEMREMNVAPET 216

Query: 225 DTYDGLIEAYGKYRMYDEMIECLKQMELDGCFPDHITYNLLIREFSKGGLLKKMEGLYRS 284
           DTYDGLIEAYGKY+MYDEM  C+K+M+L+GC PDHITYNLLIREFS+GGLLK+ME +Y+S
Sbjct: 217 DTYDGLIEAYGKYKMYDEMGMCVKKMKLNGCPPDHITYNLLIREFSRGGLLKRMESVYQS 276

Query: 285 ILSKRMDLQSSTLVAMLEAYAKFGILDKMEMFYRRILNSKTNMKEDLIRTLALVYIQNHM 344
           +LSKR+ LQSSTL+AMLE YAKFGILDKME  Y R+LNS+T +K+DLIR LA VYI+N+ 
Sbjct: 277 MLSKRIILQSSTLIAMLEVYAKFGILDKMEKVYMRLLNSRTLVKDDLIRKLAEVYIKNYK 336

Query: 345 YSRIETLGIDLHIKAGKTDLVWCLLLLSHACLSSRRGMDSVVQEMDKAKEIWNVTVANIL 404
           +SR+E LG+D+  + G+TDLVWCL LLSHA L SRRGMDS+VQEM +    WN TVAN +
Sbjct: 337 FSRLENLGVDISSRFGQTDLVWCLRLLSHAGLLSRRGMDSIVQEMKEENAPWNATVANTI 396

Query: 405 LLAYLKMKDFKRLRTLFSEIRARHVKPDLVTIGILLDANNKSFDGTRTLEAWRRMNMLSR 464
           +LAYLKMKDF  LR L S++  + VKPD++T+GIL DAN   +DG+ TL+AW++  +L R
Sbjct: 397 MLAYLKMKDFTHLRILLSQLLTQGVKPDIITVGILFDANMIGYDGSGTLDAWKKKGLLQR 456

Query: 465 AVEINTDSLVLAAFGKGRFLKDCEEAYASLELVGRESKVWTYEELIDLV 506
           +VE+NTD LVL  FGKG+FL++CE A++SLE   RE+K WTY  LIDLV
Sbjct: 457 SVEMNTDPLVLTTFGKGQFLRNCEAAFSSLEPEVRENKTWTYHHLIDLV 505

BLAST of Cp4.1LG17g10180 vs. NCBI nr
Match: gi|658004698|ref|XP_008337481.1| (PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic [Malus domestica])

HSP 1 Score: 569.3 bits (1466), Expect = 6.8e-159
Identity = 284/482 (58.92%), Postives = 357/482 (74.07%), Query Frame = 1

Query: 27  SPHPFDLNKGSSCLSSRIPLPEPYTKLLLSLPTRAVPTHSPTKHATLLVDTFHEHHTLKA 86
           SP P      S C S   P P+            A P    TKH TLLV+TFHEH  LK 
Sbjct: 38  SPKPRTPFPSSLCYSDPFPPPDV-----------AAPRGGSTKHTTLLVETFHEHQRLKD 97

Query: 87  LLSDLQKRDSCPLLLLRHHGDWNRDHFWLVVKFLRNASRSHEILKLFDTWKSIEGSRISE 146
           LL  +   D  PL LL   GDW +D FW V+ FL NASRS EIL+LF+ WK IE SRI+E
Sbjct: 98  LLVKVTTEDCLPLQLLADDGDWTKDQFWAVITFLNNASRSKEILQLFEMWKKIEKSRINE 157

Query: 147 SNYEKVIVLLSQDGLMEDAVSAFQDMKSLGLRPSLGTYNTLIHGFAARGKFEIAMLFIDE 206
            NY K+I LLS++GLME+A   FQ+MKS  LRPSL  YN++IHGFA +G F+ A+ + +E
Sbjct: 158 FNYSKIIGLLSEEGLMEEAAPCFQEMKSHDLRPSLEVYNSMIHGFARQGNFDDALFYFNE 217

Query: 207 MKEINMTRETDTYDGLIEAYGKYRMYDEMIECLKQMELDGCFPDHITYNLLIREFSKGGL 266
           M+E+N+  ETDTYDGLIEAYGKY+MYDEM  C+K+M+L+GC PDHITYNLLIREFS+GGL
Sbjct: 218 MREMNVALETDTYDGLIEAYGKYKMYDEMGMCVKKMKLNGCPPDHITYNLLIREFSRGGL 277

Query: 267 LKKMEGLYRSILSKRMDLQSSTLVAMLEAYAKFGILDKMEMFYRRILNSKTNMKEDLIRT 326
           LK+ME +Y+S+LSKRM LQSSTL+AMLE YAKFGILDKME  Y R+LNS+T +K+DLIR 
Sbjct: 278 LKRMESVYQSMLSKRMFLQSSTLIAMLEVYAKFGILDKMEKVYMRLLNSRTLVKDDLIRK 337

Query: 327 LALVYIQNHMYSRIETLGIDLHIKAGKTDLVWCLLLLSHACLSSRRGMDSVVQEMDKAKE 386
           LA VYI+N+M+SR+E LG+D+  + G+TDLVWCL LLS A L SRRGMDS+V+EM + K 
Sbjct: 338 LAEVYIENYMFSRLENLGVDISSRFGQTDLVWCLRLLSRAGLLSRRGMDSIVEEMKEQKT 397

Query: 387 IWNVTVANILLLAYLKMKDFKRLRTLFSEIRARHVKPDLVTIGILLDANNKSFDGTRTLE 446
            WN TVAN ++LAYLKMKDF  LR L S++  + V+PD++T+GIL DAN   +DG+ TL+
Sbjct: 398 PWNATVANTIMLAYLKMKDFTHLRILLSQLLTQGVEPDIITVGILFDANMIGYDGSGTLD 457

Query: 447 AWRRMNMLSRAVEINTDSLVLAAFGKGRFLKDCEEAYASLELVGRESKVWTYEELIDLVY 506
            W++  +L R+VE+NTD LVL  FGKG FL++CE A++SLE   RE+K WTY  LIDLV 
Sbjct: 458 IWKKKGLLQRSVEMNTDPLVLTTFGKGHFLRNCEAAFSSLEPDFRENKTWTYHHLIDLVL 508

Query: 507 EN 509
           ++
Sbjct: 518 KH 508

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP310_ARATH1.2e-12849.90Pentatricopeptide repeat-containing protein At4g14190, chloroplastic OS=Arabidop... [more]
PP362_ARATH4.6e-2222.71Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana GN... [more]
PPR38_ARATH8.6e-2123.68Putative pentatricopeptide repeat-containing protein At1g12700, mitochondrial OS... [more]
PP247_ARATH1.2e-1922.69Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidop... [more]
PPR96_ARATH1.1e-1822.38Pentatricopeptide repeat-containing protein At1g62930, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A067ECN4_CITSI2.4e-16362.08Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g010459mg PE=4 SV=1[more]
V4SBF8_9ROSI5.4e-16360.13Uncharacterized protein OS=Citrus clementina GN=CICLE_v10004784mg PE=4 SV=1[more]
W9S4F4_9ROSA3.1e-15859.05Uncharacterized protein OS=Morus notabilis GN=L484_021763 PE=4 SV=1[more]
A0A061ES07_THECC1.2e-15761.37Pentatricopeptide repeat-containing protein, putative OS=Theobroma cacao GN=TCM_... [more]
A0A0D2NX12_GOSRA2.9e-15659.02Uncharacterized protein OS=Gossypium raimondii GN=B456_006G225200 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G14190.17.0e-13049.90 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G02860.12.6e-2322.71 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G12700.14.9e-2223.68 ATP binding;nucleic acid binding;helicases[more]
AT3G22470.17.0e-2122.69 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G62930.15.9e-2022.38 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|985470191|ref|XP_015380926.1|2.8e-16562.75PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic ... [more]
gi|641833822|gb|KDO52828.1|3.5e-16362.08hypothetical protein CISIN_1g010459mg [Citrus sinensis][more]
gi|567856726|ref|XP_006421046.1|7.8e-16360.13hypothetical protein CICLE_v10004784mg [Citrus clementina][more]
gi|694396645|ref|XP_009373597.1|1.8e-15960.13PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic ... [more]
gi|658004698|ref|XP_008337481.1|6.8e-15958.92PREDICTED: pentatricopeptide repeat-containing protein At4g14190, chloroplastic ... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG17g10180.1Cp4.1LG17g10180.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 252..280
score: 0
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 169..228
score: 4.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 218..250
score: 2.3E-7coord: 183..209
score: 8.3E-5coord: 151..180
score: 0.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 389..423
score: 8.309coord: 250..284
score: 8.769coord: 145..179
score: 9.35coord: 215..249
score: 9.997coord: 285..319
score: 6.73coord: 180..214
score: 9
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 158..337
score: 5.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 384..451
score: 2.6E-78coord: 107..314
score: 2.6
NoneNo IPR availablePANTHERPTHR24015:SF826SUBFAMILY NOT NAMEDcoord: 384..451
score: 2.6E-78coord: 107..314
score: 2.6

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG17g10180Cp4.1LG12g11100Cucurbita pepo (Zucchini)cpecpeB165