Cp4.1LG10g08110 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG10g08110
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG10: 5256808 .. 5259080 (+)
RNA-Seq ExpressionCp4.1LG10g08110
SyntenyCp4.1LG10g08110
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCTGGTGTGCCGAAGAGGAAGAGGAGAAGGAAGATGCCCAAACACGGAAGCATCCACTGTACGTCGAGGCATCATTACTTTCGCCGGAGACACGACTCCGATTGAAAACTCCGGTCTGGAAGTAACGGCGTTATTATCGAAGTATAATTCCTAGCGTCTTTGCCATGGTTGAGCTCCCCAAAGCGCTATCCCCTACACTGGTTTTAAAACTCCTCAAAGCAGAGAAAAACCGCAATGCGGCACTCGCTCTATTCGATTCGGTGTGTCACCATCCTGGTTACGCTCACTCACCATTCGTATTCCACTACATTCTCCGGCGACTTATCGACCCGAGGCTCGTTGTTCATGTTGGTCGGATCGTGGAGTTAATACGAGCTTAAAGATGCACCAGCTCCAAGATGTCGCACTGACGGCTATTAAGGCCTATTCGAAGTGTTCAGTGCCCGATCAAGCGCTGTATTTGTTTCAACGCATGGTTGACATTTTTGGGTGTAAACCGGGAATCAAGTCATATAACTCTATGTTTAATGCGTTTTTCACGTATTGTTTTTCAGTGGCGCCGAGCTGAATTGTTTTTCACGTACTTTCAGACGGTGGGCATGTCGCCAATTTGCAAACTTATAACATTTTGATCAAGATATCGTGCAAGAAGAAGCAGTTTGAAAAGGCGAAGAGATTATTGAATTGGATGTCGGAGAAGGGTTTGAACCCTAATGTTTTAAGCTATGGTACTTTAATTAATGCACTTGCGAAGGATGGTAAGTATCGGATGCCGTGGAGGTGTTTGATGAAATGTCTGAGAGAGGGATGAACCCTGATGTTATGTGTTATAATATCTGATTGATGGGTTTTTCAGAAAAGGAGATTTTGAGAAGACTACTGAGATTTGGGAGAGATTACTGAGGGAATCTTCAGTTTATCCGAGTGTGGCAACATATAACATTATGATAAATGGTTTATGTAAGCTGGGTAAGTTCGATGAGAGTATGGAAATATGGAATAGAATGAAGAAGAACCAAAGGTCAGTCGATTTATTTACTTTTAGTTCTATGATTCACGGTTTGAGCAAAGCAGGAAACTCCGATGCTGCTGAGAAAATTTATCAGGAGATGATTGACAGTGGGTTATCGCCTGATGTGCCAACATATAATACAATGCTCAGGGGTCTATTTCGAGCTGCCAAACTAAGTAAATGCTTTGAGTTGTGGGAGGAGATGGGTAAGAATAACTGTTGCAATATCGTTAGTTATAACATATTGATTCAAGGGTTGTTTGACAACAAGAAAGTGGAAGAAGCGATTTGTAATTGGCAGCTCTTACATAAGAGGGGCTTGAGGGCAGATTCAACAACATATGGAGTGTTGATTCACGGGCTATGTAAGAATGGATACTTGAATGAGGCTTTAAGGATATTAAAAGAAGCTGAAAATGAGGATGCAGATTTGGATACTTTTGCTTACTCCTCAATGATTCATGGATTATGCATAAAAGGGAGGTTGGAGCAAGCGGCTGAGCTGATTCATCAGATGAACAAACATAAATATAAACTGAGTTCTCATATCTTCAATTCATTGATTAATGGATGTATCCGAGCTTCTAAACTTGAAGAGGCTATTTTTCTTCTAAGGGAAATGGGCAACCAAGACTGTGCTCCTACTGTAGTCTCCTACAACACTATTATCAATGGTTTGTGTAAGGCAGAAAGATTTAGCGATGCATATCTTCTTCTAAAGGAGATGCTGGAAAAGGGCTTAAAGCCTGATATGATTACATATAGCTTGTTGATCGATGGCCTGTGTCGTGGAGAAAAGTTTGACGTGGCACTCAACTTATGGCATCACTGTATTAACAAGGGTCTTAAGCCCGATGTAACCATGCACAACATCATAATTCATGGTCTTTGTACGGCCCGAAAAGTTGATGTTGCCCTGGAGATCTTTACTCAAATGGCTCAGGTCAACTGTATTCCGGATCTTGTAACACACAACACCATCATGGAAGGTGTTTACAAAGCTGGAGACTGCCAGGAGGCTTTAAAGATTTGGGACCGTATCTTGGAAAAGGGTCTTAAGCCAGACATTATATCCTATAACATTACTTTTAAGGGACTCTGCTCTTGCGCTAGAATTTCAGATGCCATTGGGTTCCTATATGATGCCCTGCATCATGGAATTCTTCCAACTGCCACAACATGGAACATTCTTGTTAGAGCAGTTGTTGGTGATAGATCATTAATGGAATATGCTCTTATTTTCGAGTCTCGGACGTGA

mRNA sequence

ATGCCTGGTGTGCCGAAGAGGAAGAGGAGAAGGAAGATGCCCAAACACGGAAGCATCCACTACGGTGGGCATGTCGCCAATTTGCAAACTTATAACATTTTGATCAAGATATCGTGCAAGAAGAAGCAGTTTGAAAAGGCGAAGAGATTATTGAATTGGATGTCGGAGAAGGGTTTGAACCCTAATGTTTTAAGCTATGGTACTTTAATTAATGCACTTGCGAAGGATGGTAAGTATCGGATGCCGTGGAGAAAAGGAGATTTTGAGAAGACTACTGAGATTTGGGAGAGATTACTGAGGGAATCTTCAGTTTATCCGAGTGTGGCAACATATAACATTATGATAAATGGTTTATGTAAGCTGGGTAAGTTCGATGAGAGTATGGAAATATGGAATAGAATGAAGAAGAACCAAAGGTCAGTCGATTTATTTACTTTTAGTTCTATGATTCACGGTTTGAGCAAAGCAGGAAACTCCGATGCTGCTGAGAAAATTTATCAGGAGATGATTGACAGTGGGTTATCGCCTGATGTGCCAACATATAATACAATGCTCAGGGGTCTATTTCGAGCTGCCAAACTAAGTAAATGCTTTGAGTTGTGGGAGGAGATGGGTAAGAATAACTGTTGCAATATCGTTAGTTATAACATATTGATTCAAGGGTTGTTTGACAACAAGAAAGTGGAAGAAGCGATTTGTAATTGGCAGCTCTTACATAAGAGGGGCTTGAGGGCAGATTCAACAACATATGGAGTGTTGATTCACGGGCTATGTAAGAATGGATACTTGAATGAGGCTTTAAGGATATTAAAAGAAGCTGAAAATGAGGATGCAGATTTGGATACTTTTGCTTACTCCTCAATGATTCATGGATTATGCATAAAAGGGAGGTTGGAGCAAGCGGCTGAGCTGATTCATCAGATGAACAAACATAAATATAAACTGAGTTCTCATATCTTCAATTCATTGATTAATGGATGTATCCGAGCTTCTAAACTTGAAGAGGCTATTTTTCTTCTAAGGGAAATGGGCAACCAAGACTGTGCTCCTACTGTAGTCTCCTACAACACTATTATCAATGGTTTGTGTAAGGCAGAAAGATTTAGCGATGCATATCTTCTTCTAAAGGAGATGCTGGAAAAGGGCTTAAAGCCTGATATGATTACATATAGCTTGTTGATCGATGGCCTGTGTCGTGGAGAAAAGTTTGACGTGGCACTCAACTTATGGCATCACTGTATTAACAAGGGTCTTAAGCCCGATGTAACCATGCACAACATCATAATTCATGGTCTTTGTACGGCCCGAAAAGTTGATGTTGCCCTGGAGATCTTTACTCAAATGGCTCAGGTCAACTGTATTCCGGATCTTGTAACACACAACACCATCATGGAAGGTGTTTACAAAGCTGGAGACTGCCAGGAGGCTTTAAAGATTTGGGACCGTATCTTGGAAAAGGGTCTTAAGCCAGACATTATATCCTATAACATTACTTTTAAGGGACTCTGCTCTTGCGCTAGAATTTCAGATGCCATTGGGTTCCTATATGATGCCCTGCATCATGGAATTCTTCCAACTGCCACAACATGGAACATTCTTGTTAGAGCAGTTGTTGGTGATAGATCATTAATGGAATATGCTCTTATTTTCGAGTCTCGGACGTGA

Coding sequence (CDS)

ATGCCTGGTGTGCCGAAGAGGAAGAGGAGAAGGAAGATGCCCAAACACGGAAGCATCCACTACGGTGGGCATGTCGCCAATTTGCAAACTTATAACATTTTGATCAAGATATCGTGCAAGAAGAAGCAGTTTGAAAAGGCGAAGAGATTATTGAATTGGATGTCGGAGAAGGGTTTGAACCCTAATGTTTTAAGCTATGGTACTTTAATTAATGCACTTGCGAAGGATGGTAAGTATCGGATGCCGTGGAGAAAAGGAGATTTTGAGAAGACTACTGAGATTTGGGAGAGATTACTGAGGGAATCTTCAGTTTATCCGAGTGTGGCAACATATAACATTATGATAAATGGTTTATGTAAGCTGGGTAAGTTCGATGAGAGTATGGAAATATGGAATAGAATGAAGAAGAACCAAAGGTCAGTCGATTTATTTACTTTTAGTTCTATGATTCACGGTTTGAGCAAAGCAGGAAACTCCGATGCTGCTGAGAAAATTTATCAGGAGATGATTGACAGTGGGTTATCGCCTGATGTGCCAACATATAATACAATGCTCAGGGGTCTATTTCGAGCTGCCAAACTAAGTAAATGCTTTGAGTTGTGGGAGGAGATGGGTAAGAATAACTGTTGCAATATCGTTAGTTATAACATATTGATTCAAGGGTTGTTTGACAACAAGAAAGTGGAAGAAGCGATTTGTAATTGGCAGCTCTTACATAAGAGGGGCTTGAGGGCAGATTCAACAACATATGGAGTGTTGATTCACGGGCTATGTAAGAATGGATACTTGAATGAGGCTTTAAGGATATTAAAAGAAGCTGAAAATGAGGATGCAGATTTGGATACTTTTGCTTACTCCTCAATGATTCATGGATTATGCATAAAAGGGAGGTTGGAGCAAGCGGCTGAGCTGATTCATCAGATGAACAAACATAAATATAAACTGAGTTCTCATATCTTCAATTCATTGATTAATGGATGTATCCGAGCTTCTAAACTTGAAGAGGCTATTTTTCTTCTAAGGGAAATGGGCAACCAAGACTGTGCTCCTACTGTAGTCTCCTACAACACTATTATCAATGGTTTGTGTAAGGCAGAAAGATTTAGCGATGCATATCTTCTTCTAAAGGAGATGCTGGAAAAGGGCTTAAAGCCTGATATGATTACATATAGCTTGTTGATCGATGGCCTGTGTCGTGGAGAAAAGTTTGACGTGGCACTCAACTTATGGCATCACTGTATTAACAAGGGTCTTAAGCCCGATGTAACCATGCACAACATCATAATTCATGGTCTTTGTACGGCCCGAAAAGTTGATGTTGCCCTGGAGATCTTTACTCAAATGGCTCAGGTCAACTGTATTCCGGATCTTGTAACACACAACACCATCATGGAAGGTGTTTACAAAGCTGGAGACTGCCAGGAGGCTTTAAAGATTTGGGACCGTATCTTGGAAAAGGGTCTTAAGCCAGACATTATATCCTATAACATTACTTTTAAGGGACTCTGCTCTTGCGCTAGAATTTCAGATGCCATTGGGTTCCTATATGATGCCCTGCATCATGGAATTCTTCCAACTGCCACAACATGGAACATTCTTGTTAGAGCAGTTGTTGGTGATAGATCATTAATGGAATATGCTCTTATTTTCGAGTCTCGGACGTGA

Protein sequence

MPGVPKRKRRRKMPKHGSIHYGGHVANLQTYNILIKISCKKKQFEKAKRLLNWMSEKGLNPNVLSYGTLINALAKDGKYRMPWRKGDFEKTTEIWERLLRESSVYPSVATYNIMINGLCKLGKFDESMEIWNRMKKNQRSVDLFTFSSMIHGLSKAGNSDAAEKIYQEMIDSGLSPDVPTYNTMLRGLFRAAKLSKCFELWEEMGKNNCCNIVSYNILIQGLFDNKKVEEAICNWQLLHKRGLRADSTTYGVLIHGLCKNGYLNEALRILKEAENEDADLDTFAYSSMIHGLCIKGRLEQAAELIHQMNKHKYKLSSHIFNSLINGCIRASKLEEAIFLLREMGNQDCAPTVVSYNTIINGLCKAERFSDAYLLLKEMLEKGLKPDMITYSLLIDGLCRGEKFDVALNLWHHCINKGLKPDVTMHNIIIHGLCTARKVDVALEIFTQMAQVNCIPDLVTHNTIMEGVYKAGDCQEALKIWDRILEKGLKPDIISYNITFKGLCSCARISDAIGFLYDALHHGILPTATTWNILVRAVVGDRSLMEYALIFESRT
Homology
BLAST of Cp4.1LG10g08110 vs. ExPASy Swiss-Prot
Match: Q9SS81 (Pentatricopeptide repeat-containing protein At3g09060 OS=Arabidopsis thaliana OX=3702 GN=At3g09060 PE=2 SV=1)

HSP 1 Score: 570.1 bits (1468), Expect = 2.8e-161
Identity = 278/542 (51.29%), Postives = 364/542 (67.16%), Query Frame = 0

Query: 23  GHVANLQTYNILIKISCKKKQFEKAKRLLNWMSEKGLNPNVLSYGTLINALAKDGK---- 82
           G   NLQTYN+LIK+SCKKK+FEKA+  L+WM ++G  P+V SY T+IN LAK GK    
Sbjct: 144 GVAPNLQTYNVLIKMSCKKKEFEKARGFLDWMWKEGFKPDVFSYSTVINDLAKAGKLDDA 203

Query: 83  ------------------YRMP----WRKGDFEKTTEIWERLLRESSVYPSVATYNIMIN 142
                             Y +      ++ D +   E+W+RLL +SSVYP+V T+NIMI+
Sbjct: 204 LELFDEMSERGVAPDVTCYNILIDGFLKEKDHKTAMELWDRLLEDSSVYPNVKTHNIMIS 263

Query: 143 GLCKLGKFDESMEIWNRMKKNQRSVDLFTFSSMIHGLSKAGNSDAAEKIYQEMIDSGLSP 202
           GL K G+ D+ ++IW RMK+N+R  DL+T+SS+IHGL  AGN D AE ++ E+ +   S 
Sbjct: 264 GLSKCGRVDDCLKIWERMKQNEREKDLYTYSSLIHGLCDAGNVDKAESVFNELDERKASI 323

Query: 203 DVPTYNTMLRGLFRAAKLSKCFELWEEMGKNNCCNIVSYNILIQGLFDNKKVEEAICNWQ 262
           DV TYNTML G  R  K+ +  ELW  M   N  NIVSYNILI+GL +N K++EA   W+
Sbjct: 324 DVVTYNTMLGGFCRCGKIKESLELWRIMEHKNSVNIVSYNILIKGLLENGKIDEATMIWR 383

Query: 263 LLHKRGLRADSTTYGVLIHGLCKNGYLNEALRILKEAENEDADLDTFAYSSMIHGLCIKG 322
           L+  +G  AD TTYG+ IHGLC NGY+N+AL +++E E+    LD +AY+S+I  LC K 
Sbjct: 384 LMPAKGYAADKTTYGIFIHGLCVNGYVNKALGVMQEVESSGGHLDVYAYASIIDCLCKKK 443

Query: 323 RLEQAAELIHQMNKHKYKLSSHIFNSLINGCIRASKLEEAIFLLREMGNQDCAPTVVSYN 382
           RLE+A+ L+ +M+KH  +L+SH+ N+LI G IR S+L EA F LREMG   C PTVVSYN
Sbjct: 444 RLEEASNLVKEMSKHGVELNSHVCNALIGGLIRDSRLGEASFFLREMGKNGCRPTVVSYN 503

Query: 383 TIINGLCKAERFSDAYLLLKEMLEKGLKPDMITYSLLIDGLCRGEKFDVALNLWHHCINK 442
            +I GLCKA +F +A   +KEMLE G KPD+ TYS+L+ GLCR  K D+AL LWH  +  
Sbjct: 504 ILICGLCKAGKFGEASAFVKEMLENGWKPDLKTYSILLCGLCRDRKIDLALELWHQFLQS 563

Query: 443 GLKPDVTMHNIIIHGLCTARKVDVALEIFTQMAQVNCIPDLVTHNTIMEGVYKAGDCQEA 502
           GL+ DV MHNI+IHGLC+  K+D A+ +   M   NC  +LVT+NT+MEG +K GD   A
Sbjct: 564 GLETDVMMHNILIHGLCSVGKLDDAMTVMANMEHRNCTANLVTYNTLMEGFFKVGDSNRA 623

Query: 503 LKIWDRILEKGLKPDIISYNITFKGLCSCARISDAIGFLYDALHHGILPTATTWNILVRA 539
             IW  + + GL+PDIISYN   KGLC C  +S A+ F  DA +HGI PT  TWNILVRA
Sbjct: 624 TVIWGYMYKMGLQPDIISYNTIMKGLCMCRGVSYAMEFFDDARNHGIFPTVYTWNILVRA 683

BLAST of Cp4.1LG10g08110 vs. ExPASy Swiss-Prot
Match: Q9LFF1 (Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=MEE40 PE=2 SV=1)

HSP 1 Score: 278.1 bits (710), Expect = 2.2e-73
Identity = 158/529 (29.87%), Postives = 281/529 (53.12%), Query Frame = 0

Query: 16  HGSIHYGGHVANLQTYNILIKISCKKKQFEKAKRLLNWMSEKGLNPNVLSYGTLINALAK 75
           H  +   G   ++ T+N+LIK  C+  Q   A  +L  M   GL P+  ++ T++    +
Sbjct: 177 HAKMSVWGIKPDVSTFNVLIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIE 236

Query: 76  DGKYRMPWRKGDFEKTTEIWERLLRESSVYPSVATYNIMINGLCKLGKFDESMEIWNRMK 135
           +         GD +    I E+++     + +V+  N++++G CK G+ ++++     M 
Sbjct: 237 E---------GDLDGALRIREQMVEFGCSWSNVSV-NVIVHGFCKEGRVEDALNFIQEMS 296

Query: 136 KNQRSV-DLFTFSSMIHGLSKAGNSDAAEKIYQEMIDSGLSPDVPTYNTMLRGLFRAAKL 195
                  D +TF+++++GL KAG+   A +I   M+  G  PDV TYN+++ GL +  ++
Sbjct: 297 NQDGFFPDQYTFNTLVNGLCKAGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEV 356

Query: 196 SKCFELWEEMGKNNCC-NIVSYNILIQGLFDNKKVEEAICNWQLLHKRGLRADSTTYGVL 255
            +  E+ ++M   +C  N V+YN LI  L    +VEEA    ++L  +G+  D  T+  L
Sbjct: 357 KEAVEVLDQMITRDCSPNTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSL 416

Query: 256 IHGLCKNGYLNEALRILKEAENEDADLDTFAYSSMIHGLCIKGRLEQAAELIHQMNKHKY 315
           I GLC       A+ + +E  ++  + D F Y+ +I  LC KG+L++A  ++ QM     
Sbjct: 417 IQGLCLTRNHRVAMELFEEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGC 476

Query: 316 KLSSHIFNSLINGCIRASKLEEAIFLLREMGNQDCAPTVVSYNTIINGLCKAERFSDAYL 375
             S   +N+LI+G  +A+K  EA  +  EM     +   V+YNT+I+GLCK+ R  DA  
Sbjct: 477 ARSVITYNTLIDGFCKANKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQ 536

Query: 376 LLKEMLEKGLKPDMITYSLLIDGLCRGEKFDVALNLWHHCINKGLKPDVTMHNIIIHGLC 435
           L+ +M+ +G KPD  TY+ L+   CRG     A ++     + G +PD+  +  +I GLC
Sbjct: 537 LMDQMIMEGQKPDKYTYNSLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLISGLC 596

Query: 436 TARKVDVALEIF--TQMAQVNCIPDLVTHNTIMEGVYKAGDCQEALKIWDRILEKG-LKP 495
            A +V+VA ++    QM  +N  P    +N +++G+++     EA+ ++  +LE+    P
Sbjct: 597 KAGRVEVASKLLRSIQMKGINLTPH--AYNPVIQGLFRKRKTTEAINLFREMLEQNEAPP 656

Query: 496 DIISYNITFKGLCS-CARISDAIGFLYDALHHGILPTATTWNILVRAVV 539
           D +SY I F+GLC+    I +A+ FL + L  G +P  ++  +L   ++
Sbjct: 657 DAVSYRIVFRGLCNGGGPIREAVDFLVELLEKGFVPEFSSLYMLAEGLL 693

BLAST of Cp4.1LG10g08110 vs. ExPASy Swiss-Prot
Match: Q76C99 (Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV=1)

HSP 1 Score: 268.1 bits (684), Expect = 2.2e-70
Identity = 166/569 (29.17%), Postives = 281/569 (49.38%), Query Frame = 0

Query: 23  GHVANLQTYNILIKISCKKKQFEKAKRLLNWMSEK---GLNPNVLSYGTLINALAKDGKY 82
           G + N+ +YNIL+K  C + + ++A  LL+ M++    G  P+V+SY T+IN   K+   
Sbjct: 153 GCIPNVFSYNILLKGLCDENRSQEALELLHMMADDRGGGSPPDVVSYTTVINGFFKE--- 212

Query: 83  RMPWRKGDFEKTTEIWERLLRESSVYPSVATYNIMINGLCKLGKFDESMEIWNRMKKNQR 142
                 GD +K    +  +L +  + P V TYN +I  LCK    D++ME+ N M KN  
Sbjct: 213 ------GDSDKAYSTYHEML-DRGILPDVVTYNSIIAALCKAQAMDKAMEVLNTMVKNGV 272

Query: 143 SVDLFTFSSMIHG-----------------------------------LSKAGNSDAAEK 202
             D  T++S++HG                                   L K G    A K
Sbjct: 273 MPDCMTYNSILHGYCSSGQPKEAIGFLKKMRSDGVEPDVVTYSLLMDYLCKNGRCMEARK 332

Query: 203 IYQEMIDSGLSPDVPTYNTMLRGLFRAAKLSKCFELWEEMGKNNC-CNIVSYNILIQGLF 262
           I+  M   GL P++ TY T+L+G      L +   L + M +N    +   ++ILI    
Sbjct: 333 IFDSMTKRGLKPEITTYGTLLQGYATKGALVEMHGLLDLMVRNGIHPDHYVFSILICAYA 392

Query: 263 DNKKVEEAICNWQLLHKRGLRADSTTYGVLIHGLCKNGYLNEALRILKEAENEDADLDTF 322
              KV++A+  +  + ++GL  ++ TYG +I  LCK+G + +A+   ++  +E       
Sbjct: 393 KQGKVDQAMLVFSKMRQQGLNPNAVTYGAVIGILCKSGRVEDAMLYFEQMIDEGLSPGNI 452

Query: 323 AYSSMIHGLCIKGRLEQAAELIHQMNKHKYKLSSHIFNSLINGCIRASKLEEAIFLLREM 382
            Y+S+IHGLC   + E+A ELI +M      L++  FNS+I+   +  ++ E+  L   M
Sbjct: 453 VYNSLIHGLCTCNKWERAEELILEMLDRGICLNTIFFNSIIDSHCKEGRVIESEKLFELM 512

Query: 383 GNQDCAPTVVSYNTIINGLCKAERFSDAYLLLKEMLEKGLKPDMITYSLLIDGLCRGEKF 442
                 P V++YNT+ING C A +  +A  LL  M+  GLKP+ +TYS LI+G C+  + 
Sbjct: 513 VRIGVKPNVITYNTLINGYCLAGKMDEAMKLLSGMVSVGLKPNTVTYSTLINGYCKISRM 572

Query: 443 DVALNLWHHCINKGLKPDVTMHNIIIHGLCTARKVDVALEIFTQMAQVNCIPDLVTHNTI 502
           + AL L+    + G+ PD+  +NII+ GL   R+   A E++ ++ +     +L T+N I
Sbjct: 573 EDALVLFKEMESSGVSPDIITYNIILQGLFQTRRTAAAKELYVRITESGTQIELSTYNII 632

Query: 503 MEGVYKAGDCQEALKIWDRILEKGLKPDIISYNITFKGLCSCARISDAIGFLYDALHHGI 553
           + G+ K     +AL+++  +    LK +  ++NI    L    R  +A         +G+
Sbjct: 633 LHGLCKNKLTDDALQMFQNLCLMDLKLEARTFNIMIDALLKVGRNDEAKDLFVAFSSNGL 692

BLAST of Cp4.1LG10g08110 vs. ExPASy Swiss-Prot
Match: Q6NQ83 (Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g22470 PE=1 SV=1)

HSP 1 Score: 259.6 bits (662), Expect = 8.0e-68
Identity = 144/522 (27.59%), Postives = 263/522 (50.38%), Query Frame = 0

Query: 31  YNILIKISCKKKQFEKAKRLLNWMSEKGLNPNVLSYGTLINALAKDGKYRMPWRKGDFEK 90
           +N L     + KQ++        M   G+  ++ +   +IN   +  K         F  
Sbjct: 73  FNRLCSAVARTKQYDLVLGFCKGMELNGIEHDMYTMTIMINCYCRKKKLLFA-----FSV 132

Query: 91  TTEIWERLLRESSVYPSVATYNIMINGLCKLGKFDESMEIWNRMKKNQRSVDLFTFSSMI 150
               W     +    P   T++ ++NG C  G+  E++ + +RM + ++  DL T S++I
Sbjct: 133 LGRAW-----KLGYEPDTITFSTLVNGFCLEGRVSEAVALVDRMVEMKQRPDLVTVSTLI 192

Query: 151 HGLSKAGNSDAAEKIYQEMIDSGLSPDVPTYNTMLRGLFRAAKLSKCFELWEEMGKNNC- 210
           +GL   G    A  +   M++ G  PD  TY  +L  L ++   +   +L+ +M + N  
Sbjct: 193 NGLCLKGRVSEALVLIDRMVEYGFQPDEVTYGPVLNRLCKSGNSALALDLFRKMEERNIK 252

Query: 211 CNIVSYNILIQGLFDNKKVEEAICNWQLLHKRGLRADSTTYGVLIHGLCKNGYLNEALRI 270
            ++V Y+I+I  L  +   ++A+  +  +  +G++AD  TY  LI GLC +G  ++  ++
Sbjct: 253 ASVVQYSIVIDSLCKDGSFDDALSLFNEMEMKGIKADVVTYSSLIGGLCNDGKWDDGAKM 312

Query: 271 LKEAENEDADLDTFAYSSMIHGLCIKGRLEQAAELIHQMNKHKYKLSSHIFNSLINGCIR 330
           L+E    +   D   +S++I     +G+L +A EL ++M        +  +NSLI+G  +
Sbjct: 313 LREMIGRNIIPDVVTFSALIDVFVKEGKLLEAKELYNEMITRGIAPDTITYNSLIDGFCK 372

Query: 331 ASKLEEAIFLLREMGNQDCAPTVVSYNTIINGLCKAERFSDAYLLLKEMLEKGLKPDMIT 390
            + L EA  +   M ++ C P +V+Y+ +IN  CKA+R  D   L +E+  KGL P+ IT
Sbjct: 373 ENCLHEANQMFDLMVSKGCEPDIVTYSILINSYCKAKRVDDGMRLFREISSKGLIPNTIT 432

Query: 391 YSLLIDGLCRGEKFDVALNLWHHCINKGLKPDVTMHNIIIHGLCTARKVDVALEIFTQMA 450
           Y+ L+ G C+  K + A  L+   +++G+ P V  + I++ GLC   +++ ALEIF +M 
Sbjct: 433 YNTLVLGFCQSGKLNAAKELFQEMVSRGVPPSVVTYGILLDGLCDNGELNKALEIFEKMQ 492

Query: 451 QVNCIPDLVTHNTIMEGVYKAGDCQEALKIWDRILEKGLKPDIISYNITFKGLCSCARIS 510
           +      +  +N I+ G+  A    +A  ++  + +KG+KPD+++YN+   GLC    +S
Sbjct: 493 KSRMTLGIGIYNIIIHGMCNASKVDDAWSLFCSLSDKGVKPDVVTYNVMIGGLCKKGSLS 552

Query: 511 DAIGFLYDALHHGILPTATTWNILVRAVVGDRSLMEYALIFE 552
           +A          G  P   T+NIL+RA +G   L+    + E
Sbjct: 553 EADMLFRKMKEDGCTPDDFTYNILIRAHLGGSGLISSVELIE 584

BLAST of Cp4.1LG10g08110 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 258.8 bits (660), Expect = 1.4e-67
Identity = 130/431 (30.16%), Postives = 232/431 (53.83%), Query Frame = 0

Query: 106 PSVATYNIMINGLCKLGK-FDESMEIWNRMKKNQRSVDLFTFSSMIHGLSKAGNSDAAEK 165
           P V +YN +++   +  +    +  ++  M ++Q S ++FT++ +I G   AGN D A  
Sbjct: 167 PGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALT 226

Query: 166 IYQEMIDSGLSPDVPTYNTMLRGLFRAAKLSKCFELWEEMG-KNNCCNIVSYNILIQGLF 225
           ++ +M   G  P+V TYNT++ G  +  K+   F+L   M  K    N++SYN++I GL 
Sbjct: 227 LFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLC 286

Query: 226 DNKKVEEAICNWQLLHKRGLRADSTTYGVLIHGLCKNGYLNEALRILKEAENEDADLDTF 285
              +++E       +++RG   D  TY  LI G CK G  ++AL +  E           
Sbjct: 287 REGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVI 346

Query: 286 AYSSMIHGLCIKGRLEQAAELIHQMNKHKYKLSSHIFNSLINGCIRASKLEEAIFLLREM 345
            Y+S+IH +C  G + +A E + QM       +   + +L++G  +   + EA  +LREM
Sbjct: 347 TYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREM 406

Query: 346 GNQDCAPTVVSYNTIINGLCKAERFSDAYLLLKEMLEKGLKPDMITYSLLIDGLCRGEKF 405
            +   +P+VV+YN +ING C   +  DA  +L++M EKGL PD+++YS ++ G CR    
Sbjct: 407 NDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDV 466

Query: 406 DVALNLWHHCINKGLKPDVTMHNIIIHGLCTARKVDVALEIFTQMAQVNCIPDLVTHNTI 465
           D AL +    + KG+KPD   ++ +I G C  R+   A +++ +M +V   PD  T+  +
Sbjct: 467 DEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTAL 526

Query: 466 MEGVYKAGDCQEALKIWDRILEKGLKPDIISYNITFKGLCSCARISDAIGFLYDALHHGI 525
           +      GD ++AL++ + ++EKG+ PD+++Y++   GL   +R  +A   L    +   
Sbjct: 527 INAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRLLLKLFYEES 586

Query: 526 LPTATTWNILV 535
           +P+  T++ L+
Sbjct: 587 VPSDVTYHTLI 597

BLAST of Cp4.1LG10g08110 vs. NCBI nr
Match: KAG6595403.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 912 bits (2356), Expect = 0.0
Identity = 444/558 (79.57%), Postives = 487/558 (87.28%), Query Frame = 0

Query: 23  GHVANLQTYNILIKISCKKKQFEKAKRLLNWMSEKGLNPNVLSYGTLINALAKDGK---- 82
           G   NLQTYNILIKISCKKKQFEKAKRLLNWMSEKGL+PNV SYGTLINALAK G     
Sbjct: 107 GMSPNLQTYNILIKISCKKKQFEKAKRLLNWMSEKGLSPNVFSYGTLINALAKSGNLSDA 166

Query: 83  ------------------YRMP----WRKGDFEKTTEIWERLLRESSVYPSVATYNIMIN 142
                             Y +     +RKGDF K +E+WERL RESSVYPSVATYNIMIN
Sbjct: 167 LNLFDEMSERGVNPDVMCYNILIDGFFRKGDFVKASEVWERLRRESSVYPSVATYNIMIN 226

Query: 143 GLCKLGKFDESMEIWNRMKKNQRSVDLFTFSSMIHGLSKAGNSDAAEKIYQEMIDSGLSP 202
           GLCKLGKFDESMEIWNRMKKN+RS+DLFT+SSMIHGLSKAGN DAAE+++QEM+D GLSP
Sbjct: 227 GLCKLGKFDESMEIWNRMKKNKRSLDLFTYSSMIHGLSKAGNFDAAERVFQEMVDVGLSP 286

Query: 203 DVPTYNTMLRGLFRAAKLSKCFELWEEMGKNNCCNIVSYNILIQGLFDNKKVEEAICNWQ 262
           DV TYNTML  LF+A KLSKCFELWE M KNNCCNIVSYNI IQGLFDNKKVEEAICNWQ
Sbjct: 287 DVTTYNTMLSALFQAGKLSKCFELWELMSKNNCCNIVSYNIFIQGLFDNKKVEEAICNWQ 346

Query: 263 LLHKRGLRADSTTYGVLIHGLCKNGYLNEALRILKEAENEDADLDTFAYSSMIHGLCIKG 322
           LLH+RG  ADSTTYG+LIHGLCKNGYLN+ALRILK+AENE ADLD FAYSSMI GLC + 
Sbjct: 347 LLHERGFTADSTTYGLLIHGLCKNGYLNKALRILKKAENEGADLDIFAYSSMIDGLCKEA 406

Query: 323 RLEQAAELIHQMNKHKYKLSSHIFNSLINGCIRASKLEEAIFLLREMGNQDCAPTVVSYN 382
           RL+QA EL+HQMN HK+KL+S++FNSLING +RASKLEEA FLLREM  + C+PTVVSYN
Sbjct: 407 RLDQAVELVHQMNTHKHKLNSYVFNSLINGYVRASKLEEATFLLREMSKKGCSPTVVSYN 466

Query: 383 TIINGLCKAERFSDAYLLLKEMLEKGLKPDMITYSLLIDGLCRGEKFDVALNLWHHCINK 442
           T+INGLCKAERFSDAYL LKEMLEKGLKPDMITYSLLIDGLCRG+K D+ALNLWH CI+K
Sbjct: 467 TLINGLCKAERFSDAYLFLKEMLEKGLKPDMITYSLLIDGLCRGDKLDMALNLWHQCIDK 526

Query: 443 GLKPDVTMHNIIIHGLCTARKVDVALEIFTQMAQVNCIPDLVTHNTIMEGVYKAGDCQEA 502
           GLKPDVT+HNIIIHGLCTARKVDVAL+ FT+MAQVNC+PDLVTHNTIMEG+YK GDC EA
Sbjct: 527 GLKPDVTIHNIIIHGLCTARKVDVALKFFTEMAQVNCVPDLVTHNTIMEGLYKVGDCVEA 586

Query: 503 LKIWDRILEKGLKPDIISYNITFKGLCSCARISDAIGFLYDALHHGILPTATTWNILVRA 554
           LKIWDRILE+GL+PDI+SYNITFKGLCSCAR+SDAIGFLYDAL HG+LPTA TW+ILVRA
Sbjct: 587 LKIWDRILEEGLQPDILSYNITFKGLCSCARVSDAIGFLYDALKHGVLPTAPTWDILVRA 646

BLAST of Cp4.1LG10g08110 vs. NCBI nr
Match: XP_023518584.1 (pentatricopeptide repeat-containing protein At3g09060-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 911 bits (2355), Expect = 0.0
Identity = 444/558 (79.57%), Postives = 487/558 (87.28%), Query Frame = 0

Query: 23  GHVANLQTYNILIKISCKKKQFEKAKRLLNWMSEKGLNPNVLSYGTLINALAKDGK---- 82
           G   NLQTYNILIKISCKKKQFEKAKRLLNW+SEKGL+PNV SYGTLINALAK G     
Sbjct: 144 GMSPNLQTYNILIKISCKKKQFEKAKRLLNWISEKGLSPNVFSYGTLINALAKSGNLSDA 203

Query: 83  ------------------YRMP----WRKGDFEKTTEIWERLLRESSVYPSVATYNIMIN 142
                             Y +     +RKGDF K +E+WERLLRESSVYPSVATYNIMIN
Sbjct: 204 LNLFDEMSERGVNPDVMCYNILIDGFFRKGDFVKASEVWERLLRESSVYPSVATYNIMIN 263

Query: 143 GLCKLGKFDESMEIWNRMKKNQRSVDLFTFSSMIHGLSKAGNSDAAEKIYQEMIDSGLSP 202
           GLCKLGKFDESMEIWNRMKKN+RS+DLFT+ SMIHGLSKAGN DAAE+++QEM+D GLSP
Sbjct: 264 GLCKLGKFDESMEIWNRMKKNKRSLDLFTYCSMIHGLSKAGNFDAAERVFQEMVDVGLSP 323

Query: 203 DVPTYNTMLRGLFRAAKLSKCFELWEEMGKNNCCNIVSYNILIQGLFDNKKVEEAICNWQ 262
           DV TYNTML  LF+A KLSKCFELWE M KNNCCNIVSYNI IQGLF NKKVEEAICNWQ
Sbjct: 324 DVTTYNTMLSALFQAGKLSKCFELWELMSKNNCCNIVSYNIFIQGLFGNKKVEEAICNWQ 383

Query: 263 LLHKRGLRADSTTYGVLIHGLCKNGYLNEALRILKEAENEDADLDTFAYSSMIHGLCIKG 322
           LLH+RG  ADSTTYG+LIHGLCKNGYLN+ALRILKEAENE ADLD FAYSSMI GLC + 
Sbjct: 384 LLHERGFTADSTTYGLLIHGLCKNGYLNKALRILKEAENEGADLDIFAYSSMIDGLCKEA 443

Query: 323 RLEQAAELIHQMNKHKYKLSSHIFNSLINGCIRASKLEEAIFLLREMGNQDCAPTVVSYN 382
           RL+QA EL+HQMN HK+KL+S++FNSLING +RASKLEEAIFLLREM  + C+PTVVSYN
Sbjct: 444 RLDQAVELVHQMNTHKHKLNSYVFNSLINGYVRASKLEEAIFLLREMSKKGCSPTVVSYN 503

Query: 383 TIINGLCKAERFSDAYLLLKEMLEKGLKPDMITYSLLIDGLCRGEKFDVALNLWHHCINK 442
           T+INGLCKAERFSDAYL LKEMLEKGLKPDMITYSLLIDGLCRG+K D+ALNLWH CI+K
Sbjct: 504 TLINGLCKAERFSDAYLFLKEMLEKGLKPDMITYSLLIDGLCRGDKLDMALNLWHQCIDK 563

Query: 443 GLKPDVTMHNIIIHGLCTARKVDVALEIFTQMAQVNCIPDLVTHNTIMEGVYKAGDCQEA 502
           GLKPDVT+HNIIIHGLCTARKVDVAL+ FT+MAQVNC+PDLVTHNTIMEG+YK GDC EA
Sbjct: 564 GLKPDVTIHNIIIHGLCTARKVDVALKFFTEMAQVNCVPDLVTHNTIMEGLYKVGDCVEA 623

Query: 503 LKIWDRILEKGLKPDIISYNITFKGLCSCARISDAIGFLYDALHHGILPTATTWNILVRA 554
           LKIWDRILE+GL+PDI+SYNITFKGLCSCAR+SDAIGFLYDAL HG+LPTA TW+ILVRA
Sbjct: 624 LKIWDRILEEGLQPDILSYNITFKGLCSCARVSDAIGFLYDALKHGVLPTAPTWDILVRA 683

BLAST of Cp4.1LG10g08110 vs. NCBI nr
Match: XP_038882547.1 (pentatricopeptide repeat-containing protein At3g09060 [Benincasa hispida])

HSP 1 Score: 911 bits (2354), Expect = 0.0
Identity = 450/558 (80.65%), Postives = 482/558 (86.38%), Query Frame = 0

Query: 23  GHVANLQTYNILIKISCKKKQFEKAKRLLNWMSEKGLNPNVLSYGTLINALAKDGK---- 82
           G   NLQTYNILIKISCKKKQFEKAK LL WMSEKGLNP+VLSYGTLINALAK G     
Sbjct: 144 GMSPNLQTYNILIKISCKKKQFEKAKGLLKWMSEKGLNPDVLSYGTLINALAKSGNLSDA 203

Query: 83  ------------------YRMP----WRKGDFEKTTEIWERLLRESSVYPSVATYNIMIN 142
                             Y +     +RKGD  K  EIWERLLRESSVYPSVATYNIMIN
Sbjct: 204 LEVFDEMSERGVNPDVMCYNILIDGFFRKGDLVKANEIWERLLRESSVYPSVATYNIMIN 263

Query: 143 GLCKLGKFDESMEIWNRMKKNQRSVDLFTFSSMIHGLSKAGNSDAAEKIYQEMIDSGLSP 202
           GLCKLGKF+ SMEIW RMK N+RS DLFTFSSMIHGLSKAGN DAAEKI+QEMID+GLSP
Sbjct: 264 GLCKLGKFEMSMEIWTRMKNNERSFDLFTFSSMIHGLSKAGNIDAAEKIFQEMIDNGLSP 323

Query: 203 DVPTYNTMLRGLFRAAKLSKCFELWEEMGKNNCCNIVSYNILIQGLFDNKKVEEAICNWQ 262
           DV TYN ML GLFRA KL KCFELWE MGKNNCCNIVSYNILIQGL DNKKVE+AIC WQ
Sbjct: 324 DVTTYNAMLSGLFRAGKLGKCFELWEVMGKNNCCNIVSYNILIQGLLDNKKVEKAICYWQ 383

Query: 263 LLHKRGLRADSTTYGVLIHGLCKNGYLNEALRILKEAENEDADLDTFAYSSMIHGLCIKG 322
           LLH+RGL+ADSTTYG+LIHGLCKNGYL++ALRILKEAENE ADLDTFAYSSMIHGLC KG
Sbjct: 384 LLHERGLKADSTTYGLLIHGLCKNGYLSKALRILKEAENEGADLDTFAYSSMIHGLCKKG 443

Query: 323 RLEQAAELIHQMNKHKYKLSSHIFNSLINGCIRASKLEEAIFLLREMGNQDCAPTVVSYN 382
           RL+QA EL+HQMNKHK+KL+SH+FNSLING +RASKLEEAI LLREM N+DCAPTVVSYN
Sbjct: 444 RLDQAVELVHQMNKHKHKLNSHVFNSLINGYVRASKLEEAILLLREMKNKDCAPTVVSYN 503

Query: 383 TIINGLCKAERFSDAYLLLKEMLEKGLKPDMITYSLLIDGLCRGEKFDVALNLWHHCINK 442
           TIINGLCKAERFSDAYL LKEMLE+GLKPD+ITY+LLIDGLCRGEK D+AL LWHHCINK
Sbjct: 504 TIINGLCKAERFSDAYLSLKEMLEEGLKPDVITYTLLIDGLCRGEKLDMALKLWHHCINK 563

Query: 443 GLKPDVTMHNIIIHGLCTARKVDVALEIFTQMAQVNCIPDLVTHNTIMEGVYKAGDCQEA 502
           GLKPDVTMHNIIIHGLCTA+KVD+AL+IF QMAQVNC+PDLVTHN+IMEG+YKAGDC EA
Sbjct: 564 GLKPDVTMHNIIIHGLCTAQKVDLALDIFNQMAQVNCVPDLVTHNSIMEGLYKAGDCAEA 623

Query: 503 LKIWDRILEKGLKPDIISYNITFKGLCSCARISDAIGFLYDALHHGILPTATTWNILVRA 554
           LKIWDRILE  L+PDIISYNI FKGLCSC R+SDAIGFLYDAL HGILP A TWNILVRA
Sbjct: 624 LKIWDRILEAHLQPDIISYNIAFKGLCSCTRVSDAIGFLYDALQHGILPNAPTWNILVRA 683

BLAST of Cp4.1LG10g08110 vs. NCBI nr
Match: XP_022966568.1 (pentatricopeptide repeat-containing protein At3g09060 isoform X1 [Cucurbita maxima])

HSP 1 Score: 909 bits (2349), Expect = 0.0
Identity = 444/558 (79.57%), Postives = 485/558 (86.92%), Query Frame = 0

Query: 23  GHVANLQTYNILIKISCKKKQFEKAKRLLNWMSEKGLNPNVLSYGTLINALAKDGK---- 82
           G   NLQTYNILIKISCKKKQFEKAKRLLNW+SEKGL+PNV SYGTLINALAK G     
Sbjct: 144 GMSPNLQTYNILIKISCKKKQFEKAKRLLNWISEKGLSPNVFSYGTLINALAKSGNLSDA 203

Query: 83  ------------------YRMP----WRKGDFEKTTEIWERLLRESSVYPSVATYNIMIN 142
                             Y +     +RKGDF K +E+WERLLRESSVYPSVATYNIMIN
Sbjct: 204 LNLFDEMSERGVNPDVLCYNILIDGFFRKGDFVKASEVWERLLRESSVYPSVATYNIMIN 263

Query: 143 GLCKLGKFDESMEIWNRMKKNQRSVDLFTFSSMIHGLSKAGNSDAAEKIYQEMIDSGLSP 202
           GLCKLGKFDESMEIWNRMKKNQRS+DLFT+SSMIHGLSKAGN  AAE+++QEM+D GLSP
Sbjct: 264 GLCKLGKFDESMEIWNRMKKNQRSLDLFTYSSMIHGLSKAGNFHAAERVFQEMVDVGLSP 323

Query: 203 DVPTYNTMLRGLFRAAKLSKCFELWEEMGKNNCCNIVSYNILIQGLFDNKKVEEAICNWQ 262
           DV TYNTML  LF+A KL KCFELWE M KNNCCNIVSYNI IQGLFDNKKVEEAICNWQ
Sbjct: 324 DVTTYNTMLSALFQAGKLMKCFELWELMSKNNCCNIVSYNIFIQGLFDNKKVEEAICNWQ 383

Query: 263 LLHKRGLRADSTTYGVLIHGLCKNGYLNEALRILKEAENEDADLDTFAYSSMIHGLCIKG 322
           LLH+RG  ADSTTYG+LIHGLCKNGYLN+ALRILKEAENE ADLD FAYSSMI+GLC + 
Sbjct: 384 LLHERGFTADSTTYGLLIHGLCKNGYLNKALRILKEAENEGADLDIFAYSSMINGLCKEA 443

Query: 323 RLEQAAELIHQMNKHKYKLSSHIFNSLINGCIRASKLEEAIFLLREMGNQDCAPTVVSYN 382
           RL+QA EL+HQMN HK+KL+S++FNSLING +RASKLEEA FLLREM  + C+PTVVSYN
Sbjct: 444 RLDQAVELVHQMNTHKHKLNSYVFNSLINGYVRASKLEEATFLLREMSKKGCSPTVVSYN 503

Query: 383 TIINGLCKAERFSDAYLLLKEMLEKGLKPDMITYSLLIDGLCRGEKFDVALNLWHHCINK 442
           T+INGLCKAERFSDAYL LKEMLEKGLKPDMITYSLLIDGLCRG+K D+ALNLW  CINK
Sbjct: 504 TLINGLCKAERFSDAYLFLKEMLEKGLKPDMITYSLLIDGLCRGDKLDMALNLWDQCINK 563

Query: 443 GLKPDVTMHNIIIHGLCTARKVDVALEIFTQMAQVNCIPDLVTHNTIMEGVYKAGDCQEA 502
           GLKPDVT+HNIIIHGLC AR VDVAL+ FT+MAQVNC+PDLVTHNTIMEG+YK GDC EA
Sbjct: 564 GLKPDVTIHNIIIHGLCRARNVDVALKFFTEMAQVNCVPDLVTHNTIMEGLYKVGDCVEA 623

Query: 503 LKIWDRILEKGLKPDIISYNITFKGLCSCARISDAIGFLYDALHHGILPTATTWNILVRA 554
           LKIWDRILE+GL+PDI+SYNITFKGLCSCAR+SDAIGFLYDAL HG+LPTATTWNILVRA
Sbjct: 624 LKIWDRILEEGLQPDIMSYNITFKGLCSCARVSDAIGFLYDALKHGVLPTATTWNILVRA 683

BLAST of Cp4.1LG10g08110 vs. NCBI nr
Match: XP_022966569.1 (pentatricopeptide repeat-containing protein At3g09060 isoform X2 [Cucurbita maxima])

HSP 1 Score: 909 bits (2349), Expect = 0.0
Identity = 444/558 (79.57%), Postives = 485/558 (86.92%), Query Frame = 0

Query: 23  GHVANLQTYNILIKISCKKKQFEKAKRLLNWMSEKGLNPNVLSYGTLINALAKDGK---- 82
           G   NLQTYNILIKISCKKKQFEKAKRLLNW+SEKGL+PNV SYGTLINALAK G     
Sbjct: 107 GMSPNLQTYNILIKISCKKKQFEKAKRLLNWISEKGLSPNVFSYGTLINALAKSGNLSDA 166

Query: 83  ------------------YRMP----WRKGDFEKTTEIWERLLRESSVYPSVATYNIMIN 142
                             Y +     +RKGDF K +E+WERLLRESSVYPSVATYNIMIN
Sbjct: 167 LNLFDEMSERGVNPDVLCYNILIDGFFRKGDFVKASEVWERLLRESSVYPSVATYNIMIN 226

Query: 143 GLCKLGKFDESMEIWNRMKKNQRSVDLFTFSSMIHGLSKAGNSDAAEKIYQEMIDSGLSP 202
           GLCKLGKFDESMEIWNRMKKNQRS+DLFT+SSMIHGLSKAGN  AAE+++QEM+D GLSP
Sbjct: 227 GLCKLGKFDESMEIWNRMKKNQRSLDLFTYSSMIHGLSKAGNFHAAERVFQEMVDVGLSP 286

Query: 203 DVPTYNTMLRGLFRAAKLSKCFELWEEMGKNNCCNIVSYNILIQGLFDNKKVEEAICNWQ 262
           DV TYNTML  LF+A KL KCFELWE M KNNCCNIVSYNI IQGLFDNKKVEEAICNWQ
Sbjct: 287 DVTTYNTMLSALFQAGKLMKCFELWELMSKNNCCNIVSYNIFIQGLFDNKKVEEAICNWQ 346

Query: 263 LLHKRGLRADSTTYGVLIHGLCKNGYLNEALRILKEAENEDADLDTFAYSSMIHGLCIKG 322
           LLH+RG  ADSTTYG+LIHGLCKNGYLN+ALRILKEAENE ADLD FAYSSMI+GLC + 
Sbjct: 347 LLHERGFTADSTTYGLLIHGLCKNGYLNKALRILKEAENEGADLDIFAYSSMINGLCKEA 406

Query: 323 RLEQAAELIHQMNKHKYKLSSHIFNSLINGCIRASKLEEAIFLLREMGNQDCAPTVVSYN 382
           RL+QA EL+HQMN HK+KL+S++FNSLING +RASKLEEA FLLREM  + C+PTVVSYN
Sbjct: 407 RLDQAVELVHQMNTHKHKLNSYVFNSLINGYVRASKLEEATFLLREMSKKGCSPTVVSYN 466

Query: 383 TIINGLCKAERFSDAYLLLKEMLEKGLKPDMITYSLLIDGLCRGEKFDVALNLWHHCINK 442
           T+INGLCKAERFSDAYL LKEMLEKGLKPDMITYSLLIDGLCRG+K D+ALNLW  CINK
Sbjct: 467 TLINGLCKAERFSDAYLFLKEMLEKGLKPDMITYSLLIDGLCRGDKLDMALNLWDQCINK 526

Query: 443 GLKPDVTMHNIIIHGLCTARKVDVALEIFTQMAQVNCIPDLVTHNTIMEGVYKAGDCQEA 502
           GLKPDVT+HNIIIHGLC AR VDVAL+ FT+MAQVNC+PDLVTHNTIMEG+YK GDC EA
Sbjct: 527 GLKPDVTIHNIIIHGLCRARNVDVALKFFTEMAQVNCVPDLVTHNTIMEGLYKVGDCVEA 586

Query: 503 LKIWDRILEKGLKPDIISYNITFKGLCSCARISDAIGFLYDALHHGILPTATTWNILVRA 554
           LKIWDRILE+GL+PDI+SYNITFKGLCSCAR+SDAIGFLYDAL HG+LPTATTWNILVRA
Sbjct: 587 LKIWDRILEEGLQPDIMSYNITFKGLCSCARVSDAIGFLYDALKHGVLPTATTWNILVRA 646

BLAST of Cp4.1LG10g08110 vs. ExPASy TrEMBL
Match: A0A6J1HSI1 (pentatricopeptide repeat-containing protein At3g09060 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111466212 PE=4 SV=1)

HSP 1 Score: 909 bits (2349), Expect = 0.0
Identity = 444/558 (79.57%), Postives = 485/558 (86.92%), Query Frame = 0

Query: 23  GHVANLQTYNILIKISCKKKQFEKAKRLLNWMSEKGLNPNVLSYGTLINALAKDGK---- 82
           G   NLQTYNILIKISCKKKQFEKAKRLLNW+SEKGL+PNV SYGTLINALAK G     
Sbjct: 144 GMSPNLQTYNILIKISCKKKQFEKAKRLLNWISEKGLSPNVFSYGTLINALAKSGNLSDA 203

Query: 83  ------------------YRMP----WRKGDFEKTTEIWERLLRESSVYPSVATYNIMIN 142
                             Y +     +RKGDF K +E+WERLLRESSVYPSVATYNIMIN
Sbjct: 204 LNLFDEMSERGVNPDVLCYNILIDGFFRKGDFVKASEVWERLLRESSVYPSVATYNIMIN 263

Query: 143 GLCKLGKFDESMEIWNRMKKNQRSVDLFTFSSMIHGLSKAGNSDAAEKIYQEMIDSGLSP 202
           GLCKLGKFDESMEIWNRMKKNQRS+DLFT+SSMIHGLSKAGN  AAE+++QEM+D GLSP
Sbjct: 264 GLCKLGKFDESMEIWNRMKKNQRSLDLFTYSSMIHGLSKAGNFHAAERVFQEMVDVGLSP 323

Query: 203 DVPTYNTMLRGLFRAAKLSKCFELWEEMGKNNCCNIVSYNILIQGLFDNKKVEEAICNWQ 262
           DV TYNTML  LF+A KL KCFELWE M KNNCCNIVSYNI IQGLFDNKKVEEAICNWQ
Sbjct: 324 DVTTYNTMLSALFQAGKLMKCFELWELMSKNNCCNIVSYNIFIQGLFDNKKVEEAICNWQ 383

Query: 263 LLHKRGLRADSTTYGVLIHGLCKNGYLNEALRILKEAENEDADLDTFAYSSMIHGLCIKG 322
           LLH+RG  ADSTTYG+LIHGLCKNGYLN+ALRILKEAENE ADLD FAYSSMI+GLC + 
Sbjct: 384 LLHERGFTADSTTYGLLIHGLCKNGYLNKALRILKEAENEGADLDIFAYSSMINGLCKEA 443

Query: 323 RLEQAAELIHQMNKHKYKLSSHIFNSLINGCIRASKLEEAIFLLREMGNQDCAPTVVSYN 382
           RL+QA EL+HQMN HK+KL+S++FNSLING +RASKLEEA FLLREM  + C+PTVVSYN
Sbjct: 444 RLDQAVELVHQMNTHKHKLNSYVFNSLINGYVRASKLEEATFLLREMSKKGCSPTVVSYN 503

Query: 383 TIINGLCKAERFSDAYLLLKEMLEKGLKPDMITYSLLIDGLCRGEKFDVALNLWHHCINK 442
           T+INGLCKAERFSDAYL LKEMLEKGLKPDMITYSLLIDGLCRG+K D+ALNLW  CINK
Sbjct: 504 TLINGLCKAERFSDAYLFLKEMLEKGLKPDMITYSLLIDGLCRGDKLDMALNLWDQCINK 563

Query: 443 GLKPDVTMHNIIIHGLCTARKVDVALEIFTQMAQVNCIPDLVTHNTIMEGVYKAGDCQEA 502
           GLKPDVT+HNIIIHGLC AR VDVAL+ FT+MAQVNC+PDLVTHNTIMEG+YK GDC EA
Sbjct: 564 GLKPDVTIHNIIIHGLCRARNVDVALKFFTEMAQVNCVPDLVTHNTIMEGLYKVGDCVEA 623

Query: 503 LKIWDRILEKGLKPDIISYNITFKGLCSCARISDAIGFLYDALHHGILPTATTWNILVRA 554
           LKIWDRILE+GL+PDI+SYNITFKGLCSCAR+SDAIGFLYDAL HG+LPTATTWNILVRA
Sbjct: 624 LKIWDRILEEGLQPDIMSYNITFKGLCSCARVSDAIGFLYDALKHGVLPTATTWNILVRA 683

BLAST of Cp4.1LG10g08110 vs. ExPASy TrEMBL
Match: A0A6J1HU67 (pentatricopeptide repeat-containing protein At3g09060 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111466212 PE=4 SV=1)

HSP 1 Score: 909 bits (2349), Expect = 0.0
Identity = 444/558 (79.57%), Postives = 485/558 (86.92%), Query Frame = 0

Query: 23  GHVANLQTYNILIKISCKKKQFEKAKRLLNWMSEKGLNPNVLSYGTLINALAKDGK---- 82
           G   NLQTYNILIKISCKKKQFEKAKRLLNW+SEKGL+PNV SYGTLINALAK G     
Sbjct: 107 GMSPNLQTYNILIKISCKKKQFEKAKRLLNWISEKGLSPNVFSYGTLINALAKSGNLSDA 166

Query: 83  ------------------YRMP----WRKGDFEKTTEIWERLLRESSVYPSVATYNIMIN 142
                             Y +     +RKGDF K +E+WERLLRESSVYPSVATYNIMIN
Sbjct: 167 LNLFDEMSERGVNPDVLCYNILIDGFFRKGDFVKASEVWERLLRESSVYPSVATYNIMIN 226

Query: 143 GLCKLGKFDESMEIWNRMKKNQRSVDLFTFSSMIHGLSKAGNSDAAEKIYQEMIDSGLSP 202
           GLCKLGKFDESMEIWNRMKKNQRS+DLFT+SSMIHGLSKAGN  AAE+++QEM+D GLSP
Sbjct: 227 GLCKLGKFDESMEIWNRMKKNQRSLDLFTYSSMIHGLSKAGNFHAAERVFQEMVDVGLSP 286

Query: 203 DVPTYNTMLRGLFRAAKLSKCFELWEEMGKNNCCNIVSYNILIQGLFDNKKVEEAICNWQ 262
           DV TYNTML  LF+A KL KCFELWE M KNNCCNIVSYNI IQGLFDNKKVEEAICNWQ
Sbjct: 287 DVTTYNTMLSALFQAGKLMKCFELWELMSKNNCCNIVSYNIFIQGLFDNKKVEEAICNWQ 346

Query: 263 LLHKRGLRADSTTYGVLIHGLCKNGYLNEALRILKEAENEDADLDTFAYSSMIHGLCIKG 322
           LLH+RG  ADSTTYG+LIHGLCKNGYLN+ALRILKEAENE ADLD FAYSSMI+GLC + 
Sbjct: 347 LLHERGFTADSTTYGLLIHGLCKNGYLNKALRILKEAENEGADLDIFAYSSMINGLCKEA 406

Query: 323 RLEQAAELIHQMNKHKYKLSSHIFNSLINGCIRASKLEEAIFLLREMGNQDCAPTVVSYN 382
           RL+QA EL+HQMN HK+KL+S++FNSLING +RASKLEEA FLLREM  + C+PTVVSYN
Sbjct: 407 RLDQAVELVHQMNTHKHKLNSYVFNSLINGYVRASKLEEATFLLREMSKKGCSPTVVSYN 466

Query: 383 TIINGLCKAERFSDAYLLLKEMLEKGLKPDMITYSLLIDGLCRGEKFDVALNLWHHCINK 442
           T+INGLCKAERFSDAYL LKEMLEKGLKPDMITYSLLIDGLCRG+K D+ALNLW  CINK
Sbjct: 467 TLINGLCKAERFSDAYLFLKEMLEKGLKPDMITYSLLIDGLCRGDKLDMALNLWDQCINK 526

Query: 443 GLKPDVTMHNIIIHGLCTARKVDVALEIFTQMAQVNCIPDLVTHNTIMEGVYKAGDCQEA 502
           GLKPDVT+HNIIIHGLC AR VDVAL+ FT+MAQVNC+PDLVTHNTIMEG+YK GDC EA
Sbjct: 527 GLKPDVTIHNIIIHGLCRARNVDVALKFFTEMAQVNCVPDLVTHNTIMEGLYKVGDCVEA 586

Query: 503 LKIWDRILEKGLKPDIISYNITFKGLCSCARISDAIGFLYDALHHGILPTATTWNILVRA 554
           LKIWDRILE+GL+PDI+SYNITFKGLCSCAR+SDAIGFLYDAL HG+LPTATTWNILVRA
Sbjct: 587 LKIWDRILEEGLQPDIMSYNITFKGLCSCARVSDAIGFLYDALKHGVLPTATTWNILVRA 646

BLAST of Cp4.1LG10g08110 vs. ExPASy TrEMBL
Match: A0A6J1EV00 (pentatricopeptide repeat-containing protein At3g09060-like OS=Cucurbita moschata OX=3662 GN=LOC111438211 PE=4 SV=1)

HSP 1 Score: 905 bits (2339), Expect = 0.0
Identity = 441/558 (79.03%), Postives = 484/558 (86.74%), Query Frame = 0

Query: 23  GHVANLQTYNILIKISCKKKQFEKAKRLLNWMSEKGLNPNVLSYGTLINALAKDGK---- 82
           G   NLQTYNILIKISCKKKQFEKAKRLLNW+SEKGL+PNV SYGTLINALAK G     
Sbjct: 144 GMSPNLQTYNILIKISCKKKQFEKAKRLLNWISEKGLSPNVFSYGTLINALAKSGNLSDA 203

Query: 83  ------------------YRMP----WRKGDFEKTTEIWERLLRESSVYPSVATYNIMIN 142
                             Y +     +RKGDF K +E+WERL RE SVYPSVATYNIMIN
Sbjct: 204 LNLFDEMSERGVNPDVMCYNILIDGFFRKGDFVKASEVWERLRREPSVYPSVATYNIMIN 263

Query: 143 GLCKLGKFDESMEIWNRMKKNQRSVDLFTFSSMIHGLSKAGNSDAAEKIYQEMIDSGLSP 202
           GLCKLGKFDESMEIWNRMKKN+RS+DLFT+ SMIHGLSKAGN DAAE+++QEM+D GLSP
Sbjct: 264 GLCKLGKFDESMEIWNRMKKNKRSLDLFTYCSMIHGLSKAGNFDAAERVFQEMVDVGLSP 323

Query: 203 DVPTYNTMLRGLFRAAKLSKCFELWEEMGKNNCCNIVSYNILIQGLFDNKKVEEAICNWQ 262
           DV TYNTML  LF+A KLSKCFELWE M KNNCCNIVSYNI IQGLFDNKKVEEAICNWQ
Sbjct: 324 DVTTYNTMLSALFQAGKLSKCFELWELMSKNNCCNIVSYNIFIQGLFDNKKVEEAICNWQ 383

Query: 263 LLHKRGLRADSTTYGVLIHGLCKNGYLNEALRILKEAENEDADLDTFAYSSMIHGLCIKG 322
           LLH+RG  ADSTTYG+LIHGLCKNGYLN+ALRILKEAENE ADLD FAYSSMI GLC + 
Sbjct: 384 LLHERGFTADSTTYGLLIHGLCKNGYLNKALRILKEAENEGADLDIFAYSSMIDGLCKEA 443

Query: 323 RLEQAAELIHQMNKHKYKLSSHIFNSLINGCIRASKLEEAIFLLREMGNQDCAPTVVSYN 382
           RL+QA EL+HQMN HK+KL+S++FNSLING +RASKLEEA FLLREM  + C+PTVVSYN
Sbjct: 444 RLDQAVELVHQMNAHKHKLNSYVFNSLINGYVRASKLEEATFLLREMSKKGCSPTVVSYN 503

Query: 383 TIINGLCKAERFSDAYLLLKEMLEKGLKPDMITYSLLIDGLCRGEKFDVALNLWHHCINK 442
           T+INGLCKAERFSDAYL LKEMLEKGLKPDMITYSLLIDGLCRG+K D+ALNLWH CI+K
Sbjct: 504 TLINGLCKAERFSDAYLFLKEMLEKGLKPDMITYSLLIDGLCRGDKLDMALNLWHQCIDK 563

Query: 443 GLKPDVTMHNIIIHGLCTARKVDVALEIFTQMAQVNCIPDLVTHNTIMEGVYKAGDCQEA 502
           GLKPDVT+HNIIIHGLCTARKVDVAL+ FT+MAQVNC+PDLVTHNTIMEG+YK GDC EA
Sbjct: 564 GLKPDVTIHNIIIHGLCTARKVDVALKFFTEMAQVNCVPDLVTHNTIMEGLYKVGDCVEA 623

Query: 503 LKIWDRILEKGLKPDIISYNITFKGLCSCARISDAIGFLYDALHHGILPTATTWNILVRA 554
           LKIWD ILE+GL+PDI+SYNITFKGLCSCAR+SDAIGFLYDAL HG+LPTA TW+ILVRA
Sbjct: 624 LKIWDLILEEGLQPDILSYNITFKGLCSCARVSDAIGFLYDALKHGVLPTAPTWDILVRA 683

BLAST of Cp4.1LG10g08110 vs. ExPASy TrEMBL
Match: A0A6J1DF04 (pentatricopeptide repeat-containing protein At3g09060 OS=Momordica charantia OX=3673 GN=LOC111019464 PE=4 SV=1)

HSP 1 Score: 897 bits (2319), Expect = 0.0
Identity = 444/557 (79.71%), Postives = 485/557 (87.07%), Query Frame = 0

Query: 23  GHVANLQTYNILIKISCKKKQFEKAKRLLNWMSEKGLNPNVLSYGTLINALAKDGK---- 82
           G   NLQTYNILIKISCKKKQFEKAK+LLNWMSEKGLNP+V SYGTLINALAK G     
Sbjct: 144 GMSPNLQTYNILIKISCKKKQFEKAKKLLNWMSEKGLNPDVFSYGTLINALAKSGNLSDA 203

Query: 83  ------------------YRMP----WRKGDFEKTTEIWERLLRESSVYPSVATYNIMIN 142
                             Y +     +RKGDF K  E WERLLRESSVYPSVATYNIMIN
Sbjct: 204 VEVFDQMSERRVDPDVMCYNILIDGFFRKGDFVKANEFWERLLRESSVYPSVATYNIMIN 263

Query: 143 GLCKLGKFDESMEIWNRMKKNQRSVDLFTFSSMIHGLSKAGNSDAAEKIYQEMIDSGLSP 202
           GLCKLGKF+ESMEIWNRMK+N+RS+DLFTFSSMIHGL KA N DAAE+I+QEM+DSGLS 
Sbjct: 264 GLCKLGKFNESMEIWNRMKENKRSLDLFTFSSMIHGLIKAENFDAAERIFQEMVDSGLSA 323

Query: 203 DVPTYNTMLRGLFRAAKLSKCFELWEEMGKNNCCNIVSYNILIQGLFDNKKVEEAICNWQ 262
           DV TYNTML GLFRA KL KCFELWE M KNN CNIVSYNILIQGLFDNKKVEEAIC WQ
Sbjct: 324 DVTTYNTMLNGLFRARKLCKCFELWEVMVKNNFCNIVSYNILIQGLFDNKKVEEAICYWQ 383

Query: 263 LLHKRGLRADSTTYGVLIHGLCKNGYLNEALRILKEAENEDADLDTFAYSSMIHGLCIKG 322
           LL +RGL+ADSTTYGVLIHGLCKNGYL++ALRILKEAENE ADLDT++YSSMI GLC KG
Sbjct: 384 LLRERGLKADSTTYGVLIHGLCKNGYLSKALRILKEAENEGADLDTYSYSSMIDGLCKKG 443

Query: 323 RLEQAAELIHQMNKHKYKLSSHIFNSLINGCIRASKLEEAIFLLREMGNQDCAPTVVSYN 382
           RL++A EL +QMN+H++KL+SH++NSLING +RASKLEEAIFLLREM  ++CAPTVVSYN
Sbjct: 444 RLDEALELSNQMNQHEHKLNSHVYNSLINGFVRASKLEEAIFLLREMSKKNCAPTVVSYN 503

Query: 383 TIINGLCKAERFSDAYLLLKEMLEKGLKPDMITYSLLIDGLCRGEKFDVALNLWHHCINK 442
           T+INGLCK ERFSDAYL LKEMLE+GLKPDMITYSLLI GLCRGEK DVALNLWH CI+K
Sbjct: 504 TLINGLCKVERFSDAYLFLKEMLEEGLKPDMITYSLLIGGLCRGEKLDVALNLWHQCIDK 563

Query: 443 GLKPDVTMHNIIIHGLCTARKVDVALEIFTQMAQVNCIPDLVTHNTIMEGVYKAGDCQEA 502
           G KPDVT+HNIIIHGLCTARKVDVAL+IFTQMAQVNC+PDLVTHNTIMEG++KAGDC EA
Sbjct: 564 GFKPDVTIHNIIIHGLCTARKVDVALQIFTQMAQVNCVPDLVTHNTIMEGLHKAGDCAEA 623

Query: 503 LKIWDRILEKGLKPDIISYNITFKGLCSCARISDAIGFLYDALHHGILPTATTWNILVRA 553
           LKIW+RILE+GL PDIISYNITFKGLCSCAR+SDAIGFLYDAL+HGILPTATTWNILVRA
Sbjct: 624 LKIWNRILEEGLHPDIISYNITFKGLCSCARVSDAIGFLYDALNHGILPTATTWNILVRA 683

BLAST of Cp4.1LG10g08110 vs. ExPASy TrEMBL
Match: A0A5D3BDH7 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold78209G00600 PE=4 SV=1)

HSP 1 Score: 879 bits (2271), Expect = 5.31e-316
Identity = 437/558 (78.32%), Postives = 478/558 (85.66%), Query Frame = 0

Query: 23  GHVANLQTYNILIKISCKKKQFEKAKRLLNWMSEKGLNPNVLSYGTLINALAKDGK---- 82
           G   NLQTYNILIKISCKK+QFEKAK LL WM E GL+P+VLSYGTLINALAK G     
Sbjct: 144 GMSPNLQTYNILIKISCKKRQFEKAKGLLTWMFENGLDPDVLSYGTLINALAKSGNILDA 203

Query: 83  ------------------YRMP----WRKGDFEKTTEIWERLLRESSVYPSVATYNIMIN 142
                             Y +     +RKGDF K  EIW+RLLRESSVYPSV TYNIMIN
Sbjct: 204 VELFDEMSERGVNPDVMCYNILIDGFFRKGDFLKANEIWKRLLRESSVYPSVETYNIMIN 263

Query: 143 GLCKLGKFDESMEIWNRMKKNQRSVDLFTFSSMIHGLSKAGNSDAAEKIYQEMIDSGLSP 202
           GLCKLGKFDESME+WNRMKKN+RS+DLFTFSSMIHGL+KAGN DA+EK++QEMI+SGLSP
Sbjct: 264 GLCKLGKFDESMEMWNRMKKNERSLDLFTFSSMIHGLNKAGNFDASEKVFQEMIESGLSP 323

Query: 203 DVPTYNTMLRGLFRAAKLSKCFELWEEMGKNNCCNIVSYNILIQGLFDNKKVEEAICNWQ 262
           DV TYN ML GLFRA KLSKCFELW+ M KNNCCNIVSYNILIQGL DNKKVE+AIC WQ
Sbjct: 324 DVRTYNAMLSGLFRAGKLSKCFELWDVMSKNNCCNIVSYNILIQGLLDNKKVEQAICYWQ 383

Query: 263 LLHKRGLRADSTTYGVLIHGLCKNGYLNEALRILKEAENEDADLDTFAYSSMIHGLCIKG 322
            LH+RGL+ADSTTYG+LI+GLCKNGYLN+ALRIL+EAENE ADLDT+AYSSMIHGLC KG
Sbjct: 384 FLHERGLKADSTTYGLLINGLCKNGYLNKALRILEEAENEGADLDTYAYSSMIHGLCKKG 443

Query: 323 RLEQAAELIHQMNKHKYKLSSHIFNSLINGCIRASKLEEAIFLLREMGNQDCAPTVVSYN 382
           RLEQA ELIHQMNK+K KL+SH+FNSLING +RA KLEEAI +LREM N+DCAPTVVSYN
Sbjct: 444 RLEQAVELIHQMNKNKRKLNSHVFNSLINGYVRAFKLEEAISVLREMKNKDCAPTVVSYN 503

Query: 383 TIINGLCKAERFSDAYLLLKEMLEKGLKPDMITYSLLIDGLCRGEKFDVALNLWHHCINK 442
           TIINGLCKAERFSDA L L+EMLE+GLKPD+ITYSLLIDGLCRGEK D+ALNLW+ CINK
Sbjct: 504 TIINGLCKAERFSDANLSLQEMLEEGLKPDIITYSLLIDGLCRGEKVDMALNLWNQCINK 563

Query: 443 GLKPDVTMHNIIIHGLCTARKVDVALEIFTQMAQVNCIPDLVTHNTIMEGVYKAGDCQEA 502
            LKPDV MHNIIIHGLCTA+KVDVALEIFT+M QVNC+PDLVTHNTIMEG+YKAGDC EA
Sbjct: 564 RLKPDVKMHNIIIHGLCTAQKVDVALEIFTRMGQVNCVPDLVTHNTIMEGLYKAGDCAEA 623

Query: 503 LKIWDRILEKGLKPDIISYNITFKGLCSCARISDAIGFLYDALHHGILPTATTWNILVRA 554
           LKIWD ILE GL+PDIISYNITFKGLCSCAR+SDAI FLYDAL  GILP A TWNILVRA
Sbjct: 624 LKIWDSILEAGLQPDIISYNITFKGLCSCARVSDAIEFLYDALDRGILPNAPTWNILVRA 683

BLAST of Cp4.1LG10g08110 vs. TAIR 10
Match: AT3G09060.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 570.1 bits (1468), Expect = 2.0e-162
Identity = 278/542 (51.29%), Postives = 364/542 (67.16%), Query Frame = 0

Query: 23  GHVANLQTYNILIKISCKKKQFEKAKRLLNWMSEKGLNPNVLSYGTLINALAKDGK---- 82
           G   NLQTYN+LIK+SCKKK+FEKA+  L+WM ++G  P+V SY T+IN LAK GK    
Sbjct: 144 GVAPNLQTYNVLIKMSCKKKEFEKARGFLDWMWKEGFKPDVFSYSTVINDLAKAGKLDDA 203

Query: 83  ------------------YRMP----WRKGDFEKTTEIWERLLRESSVYPSVATYNIMIN 142
                             Y +      ++ D +   E+W+RLL +SSVYP+V T+NIMI+
Sbjct: 204 LELFDEMSERGVAPDVTCYNILIDGFLKEKDHKTAMELWDRLLEDSSVYPNVKTHNIMIS 263

Query: 143 GLCKLGKFDESMEIWNRMKKNQRSVDLFTFSSMIHGLSKAGNSDAAEKIYQEMIDSGLSP 202
           GL K G+ D+ ++IW RMK+N+R  DL+T+SS+IHGL  AGN D AE ++ E+ +   S 
Sbjct: 264 GLSKCGRVDDCLKIWERMKQNEREKDLYTYSSLIHGLCDAGNVDKAESVFNELDERKASI 323

Query: 203 DVPTYNTMLRGLFRAAKLSKCFELWEEMGKNNCCNIVSYNILIQGLFDNKKVEEAICNWQ 262
           DV TYNTML G  R  K+ +  ELW  M   N  NIVSYNILI+GL +N K++EA   W+
Sbjct: 324 DVVTYNTMLGGFCRCGKIKESLELWRIMEHKNSVNIVSYNILIKGLLENGKIDEATMIWR 383

Query: 263 LLHKRGLRADSTTYGVLIHGLCKNGYLNEALRILKEAENEDADLDTFAYSSMIHGLCIKG 322
           L+  +G  AD TTYG+ IHGLC NGY+N+AL +++E E+    LD +AY+S+I  LC K 
Sbjct: 384 LMPAKGYAADKTTYGIFIHGLCVNGYVNKALGVMQEVESSGGHLDVYAYASIIDCLCKKK 443

Query: 323 RLEQAAELIHQMNKHKYKLSSHIFNSLINGCIRASKLEEAIFLLREMGNQDCAPTVVSYN 382
           RLE+A+ L+ +M+KH  +L+SH+ N+LI G IR S+L EA F LREMG   C PTVVSYN
Sbjct: 444 RLEEASNLVKEMSKHGVELNSHVCNALIGGLIRDSRLGEASFFLREMGKNGCRPTVVSYN 503

Query: 383 TIINGLCKAERFSDAYLLLKEMLEKGLKPDMITYSLLIDGLCRGEKFDVALNLWHHCINK 442
            +I GLCKA +F +A   +KEMLE G KPD+ TYS+L+ GLCR  K D+AL LWH  +  
Sbjct: 504 ILICGLCKAGKFGEASAFVKEMLENGWKPDLKTYSILLCGLCRDRKIDLALELWHQFLQS 563

Query: 443 GLKPDVTMHNIIIHGLCTARKVDVALEIFTQMAQVNCIPDLVTHNTIMEGVYKAGDCQEA 502
           GL+ DV MHNI+IHGLC+  K+D A+ +   M   NC  +LVT+NT+MEG +K GD   A
Sbjct: 564 GLETDVMMHNILIHGLCSVGKLDDAMTVMANMEHRNCTANLVTYNTLMEGFFKVGDSNRA 623

Query: 503 LKIWDRILEKGLKPDIISYNITFKGLCSCARISDAIGFLYDALHHGILPTATTWNILVRA 539
             IW  + + GL+PDIISYN   KGLC C  +S A+ F  DA +HGI PT  TWNILVRA
Sbjct: 624 TVIWGYMYKMGLQPDIISYNTIMKGLCMCRGVSYAMEFFDDARNHGIFPTVYTWNILVRA 683

BLAST of Cp4.1LG10g08110 vs. TAIR 10
Match: AT3G53700.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 278.1 bits (710), Expect = 1.5e-74
Identity = 158/529 (29.87%), Postives = 281/529 (53.12%), Query Frame = 0

Query: 16  HGSIHYGGHVANLQTYNILIKISCKKKQFEKAKRLLNWMSEKGLNPNVLSYGTLINALAK 75
           H  +   G   ++ T+N+LIK  C+  Q   A  +L  M   GL P+  ++ T++    +
Sbjct: 177 HAKMSVWGIKPDVSTFNVLIKALCRAHQLRPAILMLEDMPSYGLVPDEKTFTTVMQGYIE 236

Query: 76  DGKYRMPWRKGDFEKTTEIWERLLRESSVYPSVATYNIMINGLCKLGKFDESMEIWNRMK 135
           +         GD +    I E+++     + +V+  N++++G CK G+ ++++     M 
Sbjct: 237 E---------GDLDGALRIREQMVEFGCSWSNVSV-NVIVHGFCKEGRVEDALNFIQEMS 296

Query: 136 KNQRSV-DLFTFSSMIHGLSKAGNSDAAEKIYQEMIDSGLSPDVPTYNTMLRGLFRAAKL 195
                  D +TF+++++GL KAG+   A +I   M+  G  PDV TYN+++ GL +  ++
Sbjct: 297 NQDGFFPDQYTFNTLVNGLCKAGHVKHAIEIMDVMLQEGYDPDVYTYNSVISGLCKLGEV 356

Query: 196 SKCFELWEEMGKNNCC-NIVSYNILIQGLFDNKKVEEAICNWQLLHKRGLRADSTTYGVL 255
            +  E+ ++M   +C  N V+YN LI  L    +VEEA    ++L  +G+  D  T+  L
Sbjct: 357 KEAVEVLDQMITRDCSPNTVTYNTLISTLCKENQVEEATELARVLTSKGILPDVCTFNSL 416

Query: 256 IHGLCKNGYLNEALRILKEAENEDADLDTFAYSSMIHGLCIKGRLEQAAELIHQMNKHKY 315
           I GLC       A+ + +E  ++  + D F Y+ +I  LC KG+L++A  ++ QM     
Sbjct: 417 IQGLCLTRNHRVAMELFEEMRSKGCEPDEFTYNMLIDSLCSKGKLDEALNMLKQMELSGC 476

Query: 316 KLSSHIFNSLINGCIRASKLEEAIFLLREMGNQDCAPTVVSYNTIINGLCKAERFSDAYL 375
             S   +N+LI+G  +A+K  EA  +  EM     +   V+YNT+I+GLCK+ R  DA  
Sbjct: 477 ARSVITYNTLIDGFCKANKTREAEEIFDEMEVHGVSRNSVTYNTLIDGLCKSRRVEDAAQ 536

Query: 376 LLKEMLEKGLKPDMITYSLLIDGLCRGEKFDVALNLWHHCINKGLKPDVTMHNIIIHGLC 435
           L+ +M+ +G KPD  TY+ L+   CRG     A ++     + G +PD+  +  +I GLC
Sbjct: 537 LMDQMIMEGQKPDKYTYNSLLTHFCRGGDIKKAADIVQAMTSNGCEPDIVTYGTLISGLC 596

Query: 436 TARKVDVALEIF--TQMAQVNCIPDLVTHNTIMEGVYKAGDCQEALKIWDRILEKG-LKP 495
            A +V+VA ++    QM  +N  P    +N +++G+++     EA+ ++  +LE+    P
Sbjct: 597 KAGRVEVASKLLRSIQMKGINLTPH--AYNPVIQGLFRKRKTTEAINLFREMLEQNEAPP 656

Query: 496 DIISYNITFKGLCS-CARISDAIGFLYDALHHGILPTATTWNILVRAVV 539
           D +SY I F+GLC+    I +A+ FL + L  G +P  ++  +L   ++
Sbjct: 657 DAVSYRIVFRGLCNGGGPIREAVDFLVELLEKGFVPEFSSLYMLAEGLL 693

BLAST of Cp4.1LG10g08110 vs. TAIR 10
Match: AT3G22470.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 259.6 bits (662), Expect = 5.7e-69
Identity = 144/522 (27.59%), Postives = 263/522 (50.38%), Query Frame = 0

Query: 31  YNILIKISCKKKQFEKAKRLLNWMSEKGLNPNVLSYGTLINALAKDGKYRMPWRKGDFEK 90
           +N L     + KQ++        M   G+  ++ +   +IN   +  K         F  
Sbjct: 73  FNRLCSAVARTKQYDLVLGFCKGMELNGIEHDMYTMTIMINCYCRKKKLLFA-----FSV 132

Query: 91  TTEIWERLLRESSVYPSVATYNIMINGLCKLGKFDESMEIWNRMKKNQRSVDLFTFSSMI 150
               W     +    P   T++ ++NG C  G+  E++ + +RM + ++  DL T S++I
Sbjct: 133 LGRAW-----KLGYEPDTITFSTLVNGFCLEGRVSEAVALVDRMVEMKQRPDLVTVSTLI 192

Query: 151 HGLSKAGNSDAAEKIYQEMIDSGLSPDVPTYNTMLRGLFRAAKLSKCFELWEEMGKNNC- 210
           +GL   G    A  +   M++ G  PD  TY  +L  L ++   +   +L+ +M + N  
Sbjct: 193 NGLCLKGRVSEALVLIDRMVEYGFQPDEVTYGPVLNRLCKSGNSALALDLFRKMEERNIK 252

Query: 211 CNIVSYNILIQGLFDNKKVEEAICNWQLLHKRGLRADSTTYGVLIHGLCKNGYLNEALRI 270
            ++V Y+I+I  L  +   ++A+  +  +  +G++AD  TY  LI GLC +G  ++  ++
Sbjct: 253 ASVVQYSIVIDSLCKDGSFDDALSLFNEMEMKGIKADVVTYSSLIGGLCNDGKWDDGAKM 312

Query: 271 LKEAENEDADLDTFAYSSMIHGLCIKGRLEQAAELIHQMNKHKYKLSSHIFNSLINGCIR 330
           L+E    +   D   +S++I     +G+L +A EL ++M        +  +NSLI+G  +
Sbjct: 313 LREMIGRNIIPDVVTFSALIDVFVKEGKLLEAKELYNEMITRGIAPDTITYNSLIDGFCK 372

Query: 331 ASKLEEAIFLLREMGNQDCAPTVVSYNTIINGLCKAERFSDAYLLLKEMLEKGLKPDMIT 390
            + L EA  +   M ++ C P +V+Y+ +IN  CKA+R  D   L +E+  KGL P+ IT
Sbjct: 373 ENCLHEANQMFDLMVSKGCEPDIVTYSILINSYCKAKRVDDGMRLFREISSKGLIPNTIT 432

Query: 391 YSLLIDGLCRGEKFDVALNLWHHCINKGLKPDVTMHNIIIHGLCTARKVDVALEIFTQMA 450
           Y+ L+ G C+  K + A  L+   +++G+ P V  + I++ GLC   +++ ALEIF +M 
Sbjct: 433 YNTLVLGFCQSGKLNAAKELFQEMVSRGVPPSVVTYGILLDGLCDNGELNKALEIFEKMQ 492

Query: 451 QVNCIPDLVTHNTIMEGVYKAGDCQEALKIWDRILEKGLKPDIISYNITFKGLCSCARIS 510
           +      +  +N I+ G+  A    +A  ++  + +KG+KPD+++YN+   GLC    +S
Sbjct: 493 KSRMTLGIGIYNIIIHGMCNASKVDDAWSLFCSLSDKGVKPDVVTYNVMIGGLCKKGSLS 552

Query: 511 DAIGFLYDALHHGILPTATTWNILVRAVVGDRSLMEYALIFE 552
           +A          G  P   T+NIL+RA +G   L+    + E
Sbjct: 553 EADMLFRKMKEDGCTPDDFTYNILIRAHLGGSGLISSVELIE 584

BLAST of Cp4.1LG10g08110 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 258.8 bits (660), Expect = 9.7e-69
Identity = 130/431 (30.16%), Postives = 232/431 (53.83%), Query Frame = 0

Query: 106 PSVATYNIMINGLCKLGK-FDESMEIWNRMKKNQRSVDLFTFSSMIHGLSKAGNSDAAEK 165
           P V +YN +++   +  +    +  ++  M ++Q S ++FT++ +I G   AGN D A  
Sbjct: 167 PGVLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALT 226

Query: 166 IYQEMIDSGLSPDVPTYNTMLRGLFRAAKLSKCFELWEEMG-KNNCCNIVSYNILIQGLF 225
           ++ +M   G  P+V TYNT++ G  +  K+   F+L   M  K    N++SYN++I GL 
Sbjct: 227 LFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLC 286

Query: 226 DNKKVEEAICNWQLLHKRGLRADSTTYGVLIHGLCKNGYLNEALRILKEAENEDADLDTF 285
              +++E       +++RG   D  TY  LI G CK G  ++AL +  E           
Sbjct: 287 REGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVI 346

Query: 286 AYSSMIHGLCIKGRLEQAAELIHQMNKHKYKLSSHIFNSLINGCIRASKLEEAIFLLREM 345
            Y+S+IH +C  G + +A E + QM       +   + +L++G  +   + EA  +LREM
Sbjct: 347 TYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREM 406

Query: 346 GNQDCAPTVVSYNTIINGLCKAERFSDAYLLLKEMLEKGLKPDMITYSLLIDGLCRGEKF 405
            +   +P+VV+YN +ING C   +  DA  +L++M EKGL PD+++YS ++ G CR    
Sbjct: 407 NDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDV 466

Query: 406 DVALNLWHHCINKGLKPDVTMHNIIIHGLCTARKVDVALEIFTQMAQVNCIPDLVTHNTI 465
           D AL +    + KG+KPD   ++ +I G C  R+   A +++ +M +V   PD  T+  +
Sbjct: 467 DEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTAL 526

Query: 466 MEGVYKAGDCQEALKIWDRILEKGLKPDIISYNITFKGLCSCARISDAIGFLYDALHHGI 525
           +      GD ++AL++ + ++EKG+ PD+++Y++   GL   +R  +A   L    +   
Sbjct: 527 INAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRLLLKLFYEES 586

Query: 526 LPTATTWNILV 535
           +P+  T++ L+
Sbjct: 587 VPSDVTYHTLI 597

BLAST of Cp4.1LG10g08110 vs. TAIR 10
Match: AT5G59900.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 257.7 bits (657), Expect = 2.2e-68
Identity = 152/540 (28.15%), Postives = 263/540 (48.70%), Query Frame = 0

Query: 27  NLQTYNILIKISCKKKQFEKAKRLLNWMSEKGLNPNVLSYGTLINALAKDGKYRMPWRKG 86
           NL  YN LI   CK ++F +A+ L + M + GL PN ++Y  LI+   + GK        
Sbjct: 366 NLFVYNALIDSLCKGRKFHEAELLFDRMGKIGLRPNDVTYSILIDMFCRRGKL------- 425

Query: 87  DFEKTTEIWERLLRESSVYPSVATYNIMINGLCKLGKFDESMEIWNRMKKNQRSVDLFTF 146
               T   +   + ++ +  SV  YN +ING CK G    +      M   +    + T+
Sbjct: 426 ---DTALSFLGEMVDTGLKLSVYPYNSLINGHCKFGDISAAEGFMAEMINKKLEPTVVTY 485

Query: 147 SSMIHGLSKAGNSDAAEKIYQEMIDSGLSPDVPTYNTMLRGLFRAAKLSKCFELWEEMGK 206
           +S++ G    G  + A ++Y EM   G++P + T+ T+L GLFRA  +    +L+ EM +
Sbjct: 486 TSLMGGYCSKGKINKALRLYHEMTGKGIAPSIYTFTTLLSGLFRAGLIRDAVKLFNEMAE 545

Query: 207 NNC-CNIVSYNILIQGLFDNKKVEEAICNWQLLHKRGLRADSTTYGVLIHGLCKNGYLNE 266
            N   N V+YN++I+G  +   + +A    + + ++G+  D+ +Y  LIHGLC  G  +E
Sbjct: 546 WNVKPNRVTYNVMIEGYCEEGDMSKAFEFLKEMTEKGIVPDTYSYRPLIHGLCLTGQASE 605

Query: 267 ALRILKEAENEDADLDTFAYSSMIHGLCIKGRLEQAAELIHQM----------------- 326
           A   +      + +L+   Y+ ++HG C +G+LE+A  +  +M                 
Sbjct: 606 AKVFVDGLHKGNCELNEICYTGLLHGFCREGKLEEALSVCQEMVQRGVDLDLVCYGVLID 665

Query: 327 --NKHK----------------YKLSSHIFNSLINGCIRASKLEEAIFLLREMGNQDCAP 386
              KHK                 K    I+ S+I+   +    +EA  +   M N+ C P
Sbjct: 666 GSLKHKDRKLFFGLLKEMHDRGLKPDDVIYTSMIDAKSKTGDFKEAFGIWDLMINEGCVP 725

Query: 387 TVVSYNTIINGLCKAERFSDAYLLLKEMLEKGLKPDMITYSLLIDGLCRGE-KFDVALNL 446
             V+Y  +INGLCKA   ++A +L  +M      P+ +TY   +D L +GE     A+ L
Sbjct: 726 NEVTYTAVINGLCKAGFVNEAEVLCSKMQPVSSVPNQVTYGCFLDILTKGEVDMQKAVEL 785

Query: 447 WHHCINKGLKPDVTMHNIIIHGLCTARKVDVALEIFTQMAQVNCIPDLVTHNTIMEGVYK 506
            H+ I KGL  +   +N++I G C   +++ A E+ T+M      PD +T+ T++  + +
Sbjct: 786 -HNAILKGLLANTATYNMLIRGFCRQGRIEEASELITRMIGDGVSPDCITYTTMINELCR 845

Query: 507 AGDCQEALKIWDRILEKGLKPDIISYNITFKGLCSCARISDAIGFLYDALHHGILPTATT 530
             D ++A+++W+ + EKG++PD ++YN    G C    +  A     + L  G++P   T
Sbjct: 846 RNDVKKAIELWNSMTEKGIRPDRVAYNTLIHGCCVAGEMGKATELRNEMLRQGLIPNNKT 894

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SS812.8e-16151.29Pentatricopeptide repeat-containing protein At3g09060 OS=Arabidopsis thaliana OX... [more]
Q9LFF12.2e-7329.87Pentatricopeptide repeat-containing protein At3g53700, chloroplastic OS=Arabidop... [more]
Q76C992.2e-7029.17Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV... [more]
Q6NQ838.0e-6827.59Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidop... [more]
Q9FIX31.4e-6730.16Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
KAG6595403.10.079.57Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_023518584.10.079.57pentatricopeptide repeat-containing protein At3g09060-like isoform X1 [Cucurbita... [more]
XP_038882547.10.080.65pentatricopeptide repeat-containing protein At3g09060 [Benincasa hispida][more]
XP_022966568.10.079.57pentatricopeptide repeat-containing protein At3g09060 isoform X1 [Cucurbita maxi... [more]
XP_022966569.10.079.57pentatricopeptide repeat-containing protein At3g09060 isoform X2 [Cucurbita maxi... [more]
Match NameE-valueIdentityDescription
A0A6J1HSI10.079.57pentatricopeptide repeat-containing protein At3g09060 isoform X1 OS=Cucurbita ma... [more]
A0A6J1HU670.079.57pentatricopeptide repeat-containing protein At3g09060 isoform X2 OS=Cucurbita ma... [more]
A0A6J1EV000.079.03pentatricopeptide repeat-containing protein At3g09060-like OS=Cucurbita moschata... [more]
A0A6J1DF040.079.71pentatricopeptide repeat-containing protein At3g09060 OS=Momordica charantia OX=... [more]
A0A5D3BDH75.31e-31678.32Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT3G09060.12.0e-16251.29Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G53700.11.5e-7429.87Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G22470.15.7e-6927.59Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G39710.19.7e-6930.16Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G59900.12.2e-6828.15Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 211..259
e-value: 9.0E-13
score: 48.2
coord: 106..155
e-value: 4.4E-15
score: 55.6
coord: 350..399
e-value: 9.0E-19
score: 67.4
coord: 455..504
e-value: 4.8E-12
score: 45.9
coord: 176..209
e-value: 2.1E-7
score: 31.0
coord: 27..75
e-value: 4.2E-14
score: 52.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 144..178
e-value: 4.1E-10
score: 37.2
coord: 458..492
e-value: 3.7E-6
score: 24.8
coord: 319..351
e-value: 1.9E-6
score: 25.7
coord: 388..422
e-value: 2.3E-5
score: 22.3
coord: 249..276
e-value: 4.0E-5
score: 21.5
coord: 353..386
e-value: 7.5E-11
score: 39.5
coord: 284..315
e-value: 3.6E-5
score: 21.7
coord: 180..210
e-value: 7.9E-7
score: 26.9
coord: 30..63
e-value: 3.4E-6
score: 24.9
coord: 213..246
e-value: 1.8E-4
score: 19.4
coord: 424..456
e-value: 5.1E-6
score: 24.3
coord: 110..137
e-value: 1.6E-9
score: 35.3
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 280..308
e-value: 6.3E-7
score: 29.0
coord: 416..448
e-value: 1.7E-9
score: 37.2
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 319..345
e-value: 5.3E-5
score: 23.2
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 142..176
score: 13.307076
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 316..350
score: 11.005202
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 456..490
score: 11.640958
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 491..525
score: 8.933517
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 246..280
score: 10.500983
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 351..385
score: 13.361882
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 281..315
score: 11.125777
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 27..61
score: 11.947875
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 386..420
score: 11.290196
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 177..211
score: 10.522905
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 107..141
score: 12.210946
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 421..455
score: 10.687325
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 416..552
e-value: 5.0E-28
score: 100.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 303..415
e-value: 2.2E-32
score: 114.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 6..80
e-value: 2.6E-14
score: 55.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 192..299
e-value: 3.6E-23
score: 83.8
coord: 81..191
e-value: 1.9E-32
score: 114.1
NoneNo IPR availablePANTHERPTHR47938:SF6PENTATRICOPEPTIDE (PPR) REPEAT PROTEINcoord: 24..78
coord: 84..538
NoneNo IPR availablePANTHERPTHR47938RESPIRATORY COMPLEX I CHAPERONE (CIA84), PUTATIVE (AFU_ORTHOLOGUE AFUA_2G06020)-RELATEDcoord: 24..78
coord: 84..538
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 290..496
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 83..314

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG10g08110.1Cp4.1LG10g08110.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding