Cp4.1LG07g04100 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG07g04100
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG07: 2672349 .. 2677226 (+)
RNA-Seq ExpressionCp4.1LG07g04100
SyntenyCp4.1LG07g04100
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGCCTAAACCGTTTACTCCTTCGTCAAACACTGAAGAAATTTTCGAGAATCAATGGAAACATACTGCATCGCCAATCGGATGTCAAAAGCAATGTCACTCGGACATTCATCGCAACACCTTCATTTTCTTTGCTTGATCCAAAATTCAGTCGCTGTTCTTCAATTCCTGTCGAAAATCCTGAGCTCTGCAAATCCAATTCAATTTTCAGTAGGTGCATTCACTTCACTGCAACTAAGTTGAGCGATACAGCAATTGAGCCGAAACTGGAGTCATCAGACATTGAGGATCATGATGGATCAATGAATGAGTTCTTATCCAGATTTGTCTGGATCATGCGCGGGAAGATATCTGAAGCATTTCCGGACTATGATAAGCAAACAGTTGATGCAATGCTTTTGATGATTGTTGAAAAAGTGGTATCTGAAATGGAGAAGGGTAGCTTTGAGCAATCGTTAAGGACTTCAACTGGTAATCCAGATTGGGACCTAAGTGAGGATTTGTGGAAGACAGTAAGCGAAGTTAGCAACATGGTTTTGGATGATATGAAGAAGGCTACAAAGAAGGAGAAAAATGAAGGGTTTTCTGCTATCTGAGGAAGTTCAGGAAATGTGTAGGTTTGCTGGGGAAGTTGGTATTCGAGGGGATATGCTAAGGGACTTCAGGTTCAAATGGGCTCGTGAGAAAATGGAGGAAAGCGAGTTTTATGAGAGTTTGGAGCAGCTCAAGAAAGAGGCTAATGCCTCTCCCAGTGGTGCAGAGGCTGCATCCGTTGAGAAATCCGAGCCTGTTTCTATTCCCAAGAGGCGAGTGTTGAGATATATTATTTATGGAATATATCCTAGATATTATANCAGATGAGTCACAGGCCTTAGCACATGCCGTTTGAAGCACAGGTATTAAGAGGATATTTTGCGGTGATAATTAGTTAGAATATATCTCACATATTTAGGGAGTTGTTAGTATAGATTTGATTAACCGTATATTTTATTTATTTACCAGATTTAGTTAGATATATATTTTTTCTATTTTTAGGTATTAGTTGATAGTTTGTATCCTATTTAAACGTGTTAAACATGAATGAAGATCATACTTTCGATCCCAATTCTATTTCTATTTCTCATTCTTAACAGCGAGGGAAGATCAAGTACAAGATCTATGGTCTTGATTTATCTGATCCCAAGTGGAGCGAAGTAGCAGACAAAATCCACGAGGCAGAGGACAAGAACCAAAGCCAATTTCTGGGAAATGCAAATTGATCATAGAGAGAATTCTTTCACTAAACGAGAACGATGACCCATCTCCATTAATGGCTGAATGGACAGAGCTTCTTCAACCTACTAGGATTGACTGGATTACCTTACTTGATAAATTGAATGATAAGAACAGATTCTTATACTTGAAGGTAAGAAATCTCTCTTATTCTTCCCCTTTTGCCTTGTGTTCTTGGATAATTCTTGTTCTCAGTTTCATATTTAAACAGGGTAGAAGAAGCTGCATTTATAATCTCTTGGTTCCTTTTGTTTTGAATGTCATATATAGAATATGCCATAGATTTTGAGTTCTGAAATAGCAACAAGCTTTAGATTTCTGGCGCTATGCATGAGTGATGTTAAGGTGTTTATCAATGATAATACTTTTTCTACACCACATTAGATAGAACGACATAATGTTTGCAATTTGCAAGACCCTGATTTGAATTTGTTGATTATGATGAATGAAGGTAGCAGAGCTTCTTTTGAATGAAGAATCTTTCGAGACCAACATCCGTGACTACTCTAAGCTTGTCGATGTCCATGCTAAAGAGAATCGTCTAGAGGATGCTGAGAGGGTTCTTAAGATGATGAATGAGAAAGGCATTACACCAGACATTTTAACAGCCACAGTTTTGGTTCATATGTATAGCAAGGTGGGAAACCTTGATCGTGCAAAGGAAGCGTTTGATACATTGAGGAGTCACGGCTTCCAACCGGATGAGAAGGTTTATAATTCCATGATAATGGCATTTGTGAACACTGGACAACCAAAGTTGGGCGAATCGATGATGAGAGAAATGGAAGCAAGGGACATTAAACCAAGCAAGGATATTTACATGGCATTGCTAAGGTCATTTTCGCAACGTGGTGATATCAGTGGCGCTGGAGGAATTGCTGCGACGATGCAATTTTCTGGCATCTCGCCAAGTTTGGAGTCATGTACATTACTTGTTGAGACATATGGGCTAGCTGGTGATCCTGATCAGATCAGGCAAGGAACAATTTTGACTACATGATAAAAATCGGGCACAGGCCTGATGACAGGTGCACTGCAAGTATGGTTGCAGCCTATGGAAAGAAGAACCTGTTGGACAAGGCTCTGAATCTTTTACTACAGCTCGAAAAGGATGGGTTTGAGCCAGGGGTTGAGACTTATGCTGCTCTTGTAGATTGGTTAGGTAAGTTGCAGCTGGTTGATGAAGCTGAGCAGCTATTAGGCAAGATCGGCGCACAGGGAGATGCCGTGCCTCTTAAGGTTCATATTAGCCTCTGTGATATGTACTCGAGAGCTGGGGTTGAGAAAAAGGCGCTACAAGCGCTCGGGGTATTGGAAGCTAAAAAGGATGAGTTGGGACATGGTGAATTCGAAAGGATCATAAATGGACTTATAGCTGGTGGCTTTGTGCAGGATGCTAAAAGAATGCAGGGCCTTATGGAGGCTAAGGGTTTTACTGCATCCCAACCACTTCAAATGGCTTTGACGACATCTCAAGCTCTTCGTGGCAGGACACTGCCTTGAGAAAATTTGCATCTCAGCATTTGTTTTCAAGGCTGTGTTGTTTTCCTTCACTCGAATATAGGTTAAACCCGAGTTTTTCCTTATCCATCTCTGACCTTGGTGTATTATTTTGTTTTTTTTACTTCAGAAGGTAGATTGATTGTGAGATTCCACGTCGGTTGGAGAGGAGAACGAAATATTCTTTATAAGGGTGTGGAAACCTCTCCCTAGTAGACGCGTTTTGAAACTGTGAGGCTGACGGTGGCACGTAACGGGCCAAAGCGGACAATATCTACTAGCGGTGAGTTTGGGCTGTTACAAATGGTATCAGAGACAGACACCGAGCGTCCTGGCCCCCTAGGGGGTGGATTGTGAAATCCTACATCGGTTGAAGAGGGAAACGAAGCATTCCTTATAAGGATGTGGAAACCTCTTCCTAGTAGACGCATTTTAAAACCGTGAGACTAACGGCGATACGTAACGAGCCAAAGCCGACAGTATCTGCTAATGGTGAGTTTTGGCTATTTATCTTTGTTTTTCTTCTCCTAAATGTTTATACCTTCGATGTCATGCATTCTATGGTTACATATATCATCTCGTGTTCTATGTCGGGTATGCCACTGCATAATGAACTAATATTTTCATATTAATCATCTCTGGTTCGTATTTTCATGTTGTTCAATGGATATATGCATGCAAAATATAAGGATTTCATCACTTGCTTCAACAATAATAGGTTCTTTTGATTTCTTTTGTTGTTTGCTTTTACAGTAGTATAGAATCATATGTTCATCAATGTTCATAATTCTCAATAGTATTCTCCTTTTATATACTCATTCAGTCGTTTGAGTTCCCCATTTTGCTGCTCTACCACTGCTCTGACAACATCTACTATAGATATCATGCCAACTAGTTTGCCATCTATCACAGGAACGTGTCGTATGCGATTCTCTGCAAATGTCGAAAACTTAGATGGTCGTTGATGCAGCTGGAGAACGAGCCGACTCCCATTTTCATGGTAATTTCAGAACATAAACGACTCTGTATTTAGTATTTACCTGTCATTAGCTGCATTGCTTTAAGGATGTTCGTGTCGGACGTTACGCTTACTAGCTTGTCCTGTTCATGAAAAATAGAAATGTATGAATGAAAAATGTTGTTTGAAATCTGGTTTTTAGCTTAGGGAAGACAGAAATTAACCTCAGAAGTCATTATTTCCCCAACTTTTGTGTATATGGGAGATCTTCCATCTGCTATTATCTTCTTCAGGTAGTCTGCTAATGTTTGCTTATTAGTGTTCTTGATAAGTTTAAATTTTTTAATCCATGTTTTTGAAATTGTTTGGCGTTTGATTTACCTCGTTCTGTTACAATTCCAGCAATGTGTTGTCCTTCTGGCTTCAGCACCACCAACGATCCAATATTATTTCTAGCCATCTGCATCAACACAAAAAGAGTCCTCAATCTTCAGAGATAACTTTTCAAGCTTTTGGTAATGTGAACTGAACCGAACTCGCTACGTTTTGCACGGCATCAATGGCGGTGTCATTGGCTCGACAGGATAGCCAAGGGGAGCCAATACTTCCATCTCCCTTACTCGAAACGATCTCTGCCACCGAAATGTTCTCCAGACCTTTCAACATAGAAGACGAAGCTCCTAAGTCGGATCCTTCGAAAAGCTTCTCGACTTGATTCGTTTCTCTCCTGCACGAGTGCTGCGAGATAGCGATCTTTAGTGTCTCTTGCCACGATCGTACTGCTCTTACGATCCCCTGCATTTTGGATTGGGATGGGAATGTGACAGGAAGATCCGTTGATGATGAAGGGTATGTTGTTCCATGGTGATTTGAAGTAGAATAAACAAGAATAACTTGCAATCATCGTTAATCTCATTACTAAACTAATATCTGGATTAGTGTGTGCATTATTCTTTGCTTTACAAGAATGTCTTGCAATCATCGTTTATGATTAACTAAGTTTTCAATTAGAATTGCAATTAATGAACATTCTTTGCTTCACTAATTGAAATCTAGGGAAATCCTTGTTCTTGATTGATTCCAAGTCCTAAGGGTAACGACTCAAATCCACCGTTAGCAGATATTGTCTTCTTTGGGCTTTCCCTTT

mRNA sequence

ATGAGCCTAAACCGTTTACTCCTTCGTCAAACACTGAAGAAATTTTCGAGAATCAATGGAAACATACTGCATCGCCAATCGGATGTCAAAAGCAATGTCACTCGGACATTCATCGCAACACCTTCATTTTCTTTGCTTGATCCAAAATTCAGTCGCTGTTCTTCAATTCCTGTCGAAAATCCTGAGCTCTGCAAATCCAATTCAATTTTCAGTAGGTGCATTCACTTCACTGCAACTAAGTTGAGCGATACAGCAATTGAGCCGAAACTGGAGTCATCAGACATTGAGGATCATGATGGATCAATGAATGAGTTCTTATCCAGATTTGTCTGGATCATGCGCGGGAAGATATCTGAAGCATTTCCGGACTATGATAAGCAAACAGTTGATGCAATGCTTTTGATGATTGTTGAAAAAGTGGTATCTGAAATGGAGAAGGGTAGCTTTGAGCAATCGTTAAGGACTTCAACTGGTAATCCAGATTGGGACCTAAGTGAGGATTTGTGGAAGACAGAAATGTGTAGGTTTGCTGGGGAAGTTGGTATTCGAGGGGATATGCTAAGGGACTTCAGAGGACAAGAACCAAAGCCAATTTCTGGGAAATGCAAATTGATCATAGAGAGAATTCTTTCACTAAACGAGAACGATGACCCATCTCCATTAATGGCTGAATGGACAGAGCTTCTTCAACCTACTAGGATTGACTGGATTACCTTACTTGATAAATTGAATGATAAGAACAGATTCTTATACTTGAAGGTAGCAGAGCTTCTTTTGAATGAAGAATCTTTCGAGACCAACATCCGTGACTACTCTAAGCTTGTCGATGTCCATGCTAAAGAGAATCGTCTAGAGGATGCTGAGAGGGTTCTTAAGATGATGAATGAGAAAGGCATTACACCAGACATTTTAACAGCCACAGTTTTGGTTCATATGTATAGCAAGGTGGGAAACCTTGATCGTGCAAAGGAAGCGTTTGATACATTGAGGAGTCACGGCTTCCAACCGGATGAGAAGGTTTATAATTCCATGATAATGGCATTTGTGAACACTGGACAACCAAAGTTGGGCGAATCGATGATGAGAGAAATGGAAGCAAGGGACATTAAACCAAGCAAGGATATTTACATGGCATTGCTAAGGTCATTTTCGCAACGTGGTGATATCAGTGGCGCTGGAGGAATTGCTGCGACGATGCAATTTTCTGGCATCTCGCCAAGTTTGGAGTCATGTACATTACTTGTTGAGACATATGGGCTAGCTGGTGATCCTGATCAGATCAGGCAAGGAACAATTTTGACTACATGATAAAAATCGGGCACAGGCCTGATGACAGGTGCACTGCAAGTATGGTTGCAGCCTATGGAAAGAAGAACCTGTTGGACAAGGCTCTGAATCTTTTACTACAGCTCGAAAAGGATGGGTTTGAGCCAGGGGTTGAGACTTATGCTGCTCTTGTAGATTGGTTAGGTAAGTTGCAGCTGGTTGATGAAGCTGAGCAGCTATTAGGCAAGATCGGCGCACAGGGAGATGCCGTGCCTCTTAAGGTTCATATTAGCCTCTGTGATATGTACTCGAGAGCTGGGGTTGAGAAAAAGGCGCTACAAGCGCTCGGGGTATTGGAAGCTAAAAAGGATGAGTTGGGACATGGTGAATTCGAAAGGATCATAAATGGACTTATAGCTGGTGGCTTTGTGCAGGATGCTAAAAGAATGCAGGGCCTTATGGAGGCTAAGGGTTTTACTGCATCCCAACCACTTCAAATGGCTTTGACGACATCTCAAGCTCTTCGTGGCAGGACACTGCCTTGAGAAAATTTGCATCTCAGCATTTGTTTTCAAGGCTGTGTTGTTTTCCTTCACTCGAATATAGGTTAAACCCGAGTTTTTCCTTATCCATCTCTGACCTTGGTGTATTATTTTGTTTTTTTTACTTCAGAAGGTAGATTGATTGTGAGATTCCACGTCGGTTGGAGAGGAGAACGAAATATTCTTTATAAGGGTGTGGAAACCTCTCCCTAGTAGACGCGTTTTGAAACTGTGAGGCTGACGGTGGCACGTAACGGGCCAAAGCGGACAATATCTACTAGCGGTGAGTTTGGGCTGTTACAAATGGTATCAGAGACAGACACCGAGCGTCCTGGCCCCCTAGGGGGTGGATTGTGAAATCCTACATCGGTTGAAGAGGGAAACGAAGCATTCCTTATAAGGATGTGGAAACCTCTTCCTAGTAGACGCATTTTAAAACCGTGAGACTAACGGCGATACGTAACGAGCCAAAGCCGACAGTATCTGCTAATGGAACGTGTCGTATGCGATTCTCTGCAAATGTCGAAAACTTAGATGGTCGTTGATGCAGCTGGAGAACGAGCCGACTCCCATTTTCATGCAATGTGTTGTCCTTCTGGCTTCAGCACCACCAACGATCCAATATTATTTCTAGCCATCTGCATCAACACAAAAAGAGTCCTCAATCTTCAGAGATAACTTTTCAAGCTTTTGGTAATGTGAACTGAACCGAACTCGCTACGTTTTGCACGGCATCAATGGCGGTGTCATTGGCTCGACAGGATAGCCAAGGGGAGCCAATACTTCCATCTCCCTTACTCGAAACGATCTCTGCCACCGAAATGTTCTCCAGACCTTTCAACATAGAAGACGAAGCTCCTAAGTCGGATCCTTCGAAAAGCTTCTCGACTTGATTCGTTTCTCTCCTGCACGAGTGCTGCGAGATAGCGATCTTTAGTGTCTCTTGCCACGATCGTACTGCTCTTACGATCCCCTGCATTTTGGATTGGGATGGGAATGTGACAGGAAGATCCGTTGATGATGAAGGGTATGTTGTTCCATGGTGATTTGAAGTAGAATAAACAAGAATAACTTGCAATCATCGTTAATCTCATTACTAAACTAATATCTGGATTAGTGTGTGCATTATTCTTTGCTTTACAAGAATGTCTTGCAATCATCGTTTATGATTAACTAAGTTTTCAATTAGAATTGCAATTAATGAACATTCTTTGCTTCACTAATTGAAATCTAGGGAAATCCTTGTTCTTGATTGATTCCAAGTCCTAAGGGTAACGACTCAAATCCACCGTTAGCAGATATTGTCTTCTTTGGGCTTTCCCTTT

Coding sequence (CDS)

ATGAGCCTAAACCGTTTACTCCTTCGTCAAACACTGAAGAAATTTTCGAGAATCAATGGAAACATACTGCATCGCCAATCGGATGTCAAAAGCAATGTCACTCGGACATTCATCGCAACACCTTCATTTTCTTTGCTTGATCCAAAATTCAGTCGCTGTTCTTCAATTCCTGTCGAAAATCCTGAGCTCTGCAAATCCAATTCAATTTTCAGTAGGTGCATTCACTTCACTGCAACTAAGTTGAGCGATACAGCAATTGAGCCGAAACTGGAGTCATCAGACATTGAGGATCATGATGGATCAATGAATGAGTTCTTATCCAGATTTGTCTGGATCATGCGCGGGAAGATATCTGAAGCATTTCCGGACTATGATAAGCAAACAGTTGATGCAATGCTTTTGATGATTGTTGAAAAAGTGGTATCTGAAATGGAGAAGGGTAGCTTTGAGCAATCGTTAAGGACTTCAACTGGTAATCCAGATTGGGACCTAAGTGAGGATTTGTGGAAGACAGAAATGTGTAGGTTTGCTGGGGAAGTTGGTATTCGAGGGGATATGCTAAGGGACTTCAGAGGACAAGAACCAAAGCCAATTTCTGGGAAATGCAAATTGATCATAGAGAGAATTCTTTCACTAAACGAGAACGATGACCCATCTCCATTAATGGCTGAATGGACAGAGCTTCTTCAACCTACTAGGATTGACTGGATTACCTTACTTGATAAATTGAATGATAAGAACAGATTCTTATACTTGAAGGTAGCAGAGCTTCTTTTGAATGAAGAATCTTTCGAGACCAACATCCGTGACTACTCTAAGCTTGTCGATGTCCATGCTAAAGAGAATCGTCTAGAGGATGCTGAGAGGGTTCTTAAGATGATGAATGAGAAAGGCATTACACCAGACATTTTAACAGCCACAGTTTTGGTTCATATGTATAGCAAGGTGGGAAACCTTGATCGTGCAAAGGAAGCGTTTGATACATTGAGGAGTCACGGCTTCCAACCGGATGAGAAGGTTTATAATTCCATGATAATGGCATTTGTGAACACTGGACAACCAAAGTTGGGCGAATCGATGATGAGAGAAATGGAAGCAAGGGACATTAAACCAAGCAAGGATATTTACATGGCATTGCTAAGGTCATTTTCGCAACGTGGTGATATCAGTGGCGCTGGAGGAATTGCTGCGACGATGCAATTTTCTGGCATCTCGCCAAGTTTGGAGTCATGTACATTACTTGTTGAGACATATGGGCTAGCTGGTGATCCTGATCAGATCAGGCAAGGAACAATTTTGACTACATGA

Protein sequence

MSLNRLLLRQTLKKFSRINGNILHRQSDVKSNVTRTFIATPSFSLLDPKFSRCSSIPVENPELCKSNSIFSRCIHFTATKLSDTAIEPKLESSDIEDHDGSMNEFLSRFVWIMRGKISEAFPDYDKQTVDAMLLMIVEKVVSEMEKGSFEQSLRTSTGNPDWDLSEDLWKTEMCRFAGEVGIRGDMLRDFRGQEPKPISGKCKLIIERILSLNENDDPSPLMAEWTELLQPTRIDWITLLDKLNDKNRFLYLKVAELLLNEESFETNIRDYSKLVDVHAKENRLEDAERVLKMMNEKGITPDILTATVLVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVNTGQPKLGESMMREMEARDIKPSKDIYMALLRSFSQRGDISGAGGIAATMQFSGISPSLESCTLLVETYGLAGDPDQIRQGTILTT
Homology
BLAST of Cp4.1LG07g04100 vs. ExPASy Swiss-Prot
Match: Q8LEZ4 (Protein NUCLEAR FUSION DEFECTIVE 5, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=NFD5 PE=2 SV=1)

HSP 1 Score: 172.9 bits (437), Expect = 7.7e-42
Identity = 109/323 (33.75%), Postives = 153/323 (47.37%), Query Frame = 0

Query: 57  PVENPELCKSNSIFSRCIHFT-ATKLSDT--AIEPKLESSDIEDHDGSMNEFLSRFVWIM 116
           P +N E+ +  S F+R  HFT  ++LS++  AI+   +  + +D DG+ NEFLSRFVWIM
Sbjct: 50  PFQNVEIPRPISSFNRYFHFTRESRLSESSAAIDDSNDQEE-DDEDGTTNEFLSRFVWIM 109

Query: 117 RGKISEAFPDYDKQTVDAMLLMIVEKVVSEMEKGSFEQSLRTSTGNPDWDLSEDLWKT-- 176
           RGK+SEA+PD DK+ +D MLL+IVEKVV E+E+G F + + ++  +P  + S+DLW T  
Sbjct: 110 RGKVSEAYPDCDKKMIDGMLLLIVEKVVEEIERGGFNK-VGSAPPSPSSEFSDDLWATIW 169

Query: 177 -----------------------------EMCRFAGEVGIRGDMLRDFR----------- 236
                                        EMCRFAGE+GIRGD+LR+ R           
Sbjct: 170 EVSNTVLKDMEKERKKEKMKQYVQSPEVMEMCRFAGEIGIRGDLLRELRFKWAREKMDDA 229

Query: 237 ------------------------------------------------------------ 255
                                                                       
Sbjct: 230 EFYESLEQQRDLDNSIRESETVDGEVEEEGFVPSDEVESRSISLPKRKGKLKYKIYGLEL 289

BLAST of Cp4.1LG07g04100 vs. ExPASy Swiss-Prot
Match: Q9LPC4 (Pentatricopeptide repeat-containing protein At1g01970 OS=Arabidopsis thaliana OX=3702 GN=At1g01970 PE=2 SV=1)

HSP 1 Score: 171.4 bits (433), Expect = 2.2e-41
Identity = 85/235 (36.17%), Postives = 141/235 (60.00%), Query Frame = 0

Query: 195 PKPISGKCKLIIERILSLN-ENDDPSPLMAEWTELLQPTRIDWITLLDKLNDKNRFLYLK 254
           P  +S +C+ ++ +I+  + E      L+  W   + P R DW+++L +L + +   Y+K
Sbjct: 91  PIKMSKRCQALMRQIICFSPEKGSFCDLLGAWLRRMNPIRADWLSILKELKNLDSPFYIK 150

Query: 255 VAELLLNEESFETNIRDYSKLVDVHAKENRLEDAERVLKMMNEKGITPDILTATVLVHMY 314
           VAE  L ++SFE N RDY+K++  + K N++EDAER L  M  +G   D +T T +V +Y
Sbjct: 151 VAEFSLLQDSFEANARDYTKIIHYYGKLNQVEDAERTLLSMKNRGFLIDQVTLTAMVQLY 210

Query: 315 SKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVNTGQPKLGESMMREMEARDIKPSK 374
           SK G    A+E F+ ++  G   D + Y SMIMA++  G P+ GES++REM++++I   +
Sbjct: 211 SKAGCHKLAEETFNEIKLLGEPLDYRSYGSMIMAYIRAGVPEKGESLLREMDSQEICAGR 270

Query: 375 DIYMALLRSFSQRGDISGAGGIAATMQFSGISPSLESCTLLVETYGLAGDPDQIR 429
           ++Y ALLR +S  GD  GA  +   +Q +GI+P ++ C LL+  Y ++G     R
Sbjct: 271 EVYKALLRDYSMGGDAEGAKRVFDAVQIAGITPDVKLCGLLINAYSVSGQSQNAR 325

BLAST of Cp4.1LG07g04100 vs. ExPASy Swiss-Prot
Match: Q940Z1 (Pentatricopeptide repeat-containing protein At1g19525 OS=Arabidopsis thaliana OX=3702 GN=At1g19525 PE=2 SV=2)

HSP 1 Score: 161.8 bits (408), Expect = 1.8e-38
Identity = 74/136 (54.41%), Postives = 108/136 (79.41%), Query Frame = 0

Query: 294 MNEKGITPDILTATVLVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVNTGQ 353
           M++ GI PDILTAT LVHMYSK GN +RA EAF+ L+S+G +PDEK+Y +MI+ +VN G+
Sbjct: 1   MSQNGIFPDILTATALVHMYSKSGNFERATEAFENLKSYGLRPDEKIYEAMILGYVNAGK 60

Query: 354 PKLGESMMREMEARDIKPSKDIYMALLRSFSQRGDISGAGGIAATMQFSGISP-SLESCT 413
           PKLGE +M+EM+A+++K S+++YMALLR+++Q GD +GA GI+++MQ++   P S E+ +
Sbjct: 61  PKLGERLMKEMQAKELKASEEVYMALLRAYAQMGDANGAAGISSSMQYASDGPLSFEAYS 120

Query: 414 LLVETYGLAGDPDQIR 429
           L VE YG AG  D+ +
Sbjct: 121 LFVEAYGKAGQVDKAK 136

BLAST of Cp4.1LG07g04100 vs. ExPASy Swiss-Prot
Match: Q9FLL3 (Pentatricopeptide repeat-containing protein At5g41170, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g41170 PE=2 SV=1)

HSP 1 Score: 86.7 bits (213), Expect = 7.3e-16
Identity = 48/166 (28.92%), Postives = 90/166 (54.22%), Query Frame = 0

Query: 264 FETNIRDYSKLVDVHAKENRLEDAERVLKMMNEKGITPDILTATVLVHMYSKVGNLDRAK 323
           FE +I  ++ L++     NR+E+A  ++  M E GI PD++  T ++    K G+++ A 
Sbjct: 138 FEPDIVTFTSLINGFCLGNRMEEAMSMVNQMVEMGIKPDVVMYTTIIDSLCKNGHVNYAL 197

Query: 324 EAFDTLRSHGFQPDEKVYNSMIMAFVNTGQPKLGESMMREMEARDIKPSKDIYMALLRSF 383
             FD + ++G +PD  +Y S++    N+G+ +  +S++R M  R IKP    + AL+ +F
Sbjct: 198 SLFDQMENYGIRPDVVMYTSLVNGLCNSGRWRDADSLLRGMTKRKIKPDVITFNALIDAF 257

Query: 384 SQRGDISGAGGIAATMQFSGISPSLESCTLLVETYGLAGDPDQIRQ 430
            + G    A  +   M    I+P++ + T L+  + + G  D+ RQ
Sbjct: 258 VKEGKFLDAEELYNEMIRMSIAPNIFTYTSLINGFCMEGCVDEARQ 303

BLAST of Cp4.1LG07g04100 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 85.9 bits (211), Expect = 1.2e-15
Identity = 59/252 (23.41%), Postives = 108/252 (42.86%), Query Frame = 0

Query: 176 FAGEVGIRGDMLRDFRGQEPKPISGKCKLIIERILSLNENDDPSPLMAEWT-ELLQPTRI 235
           FAG + +   +      +   P       +I+    L + DD   L+     + L+P  I
Sbjct: 217 FAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLI 276

Query: 236 DWITLLDKLNDKNRFLYLKVAELLLNEESFETNIRDYSKLVDVHAKENRLEDAERVLKMM 295
            +  +++ L  + R   +      +N   +  +   Y+ L+  + KE     A  +   M
Sbjct: 277 SYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEM 336

Query: 296 NEKGITPDILTATVLVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVNTGQP 355
              G+TP ++T T L+H   K GN++RA E  D +R  G  P+E+ Y +++  F   G  
Sbjct: 337 LRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYM 396

Query: 356 KLGESMMREMEARDIKPSKDIYMALLRSFSQRGDISGAGGIAATMQFSGISPSLESCTLL 415
                ++REM      PS   Y AL+      G +  A  +   M+  G+SP + S + +
Sbjct: 397 NEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTV 456

Query: 416 VETYGLAGDPDQ 427
           +  +  + D D+
Sbjct: 457 LSGFCRSYDVDE 468

BLAST of Cp4.1LG07g04100 vs. NCBI nr
Match: XP_023002246.1 (putative pentatricopeptide repeat-containing protein At2g02150 [Cucurbita maxima])

HSP 1 Score: 763 bits (1971), Expect = 4.60e-272
Identity = 411/540 (76.11%), Postives = 414/540 (76.67%), Query Frame = 0

Query: 1   MSLNRLLLRQTLKKFSRINGNILHRQSDVKSNVTRTFIATPSFSLLDPKFSRCSSIPVEN 60
           M LNRLLLRQTLKKFSRINGNILHRQSDVKSNVTRTF+A PSFSLLDPKFSRCSSIPVEN
Sbjct: 1   MKLNRLLLRQTLKKFSRINGNILHRQSDVKSNVTRTFVAAPSFSLLDPKFSRCSSIPVEN 60

Query: 61  PELCKSNSIFSRCIHFTATKLSDTAIEPKLESSDIEDHDGSMNEFLSRFVWIMRGKISEA 120
            ELCKSNSIFSRCIHFTATK SDTAIEPKLESSD+ED DGSMNEFLSRFVWIMRGKISEA
Sbjct: 61  LELCKSNSIFSRCIHFTATKFSDTAIEPKLESSDVEDQDGSMNEFLSRFVWIMRGKISEA 120

Query: 121 FPDYDKQTVDAMLLMIVEKVVSEMEKGSFEQSLRTSTGNPDWDLSEDLWKT--------- 180
           FPDYDKQTVDAMLLMIVEKVVSEMEKGSFEQSLRTST NPDWDLSEDLWKT         
Sbjct: 121 FPDYDKQTVDAMLLMIVEKVVSEMEKGSFEQSLRTSTDNPDWDLSEDLWKTVSEVSNMVL 180

Query: 181 ----------------------EMCRFAGEVGIRGDMLRDFR------------------ 240
                                 EMCRFAGEVGIRGDMLRDFR                  
Sbjct: 181 DDMKKATKKEKMKGFLLSEEVQEMCRFAGEVGIRGDMLRDFRFKWAREKMEESEFYESLE 240

Query: 241 ------------------------------------------------------------ 300
                                                                       
Sbjct: 241 QLKKEANASPSGAEAAASVEKSEPVSIPKRRGKIKYKIYGLDLSDPKWSEVADKIHEAEE 300

Query: 301 ---GQEPKPISGKCKLIIERILSLNENDDPSPLMAEWTELLQPTRIDWITLLDKLNDKNR 360
               QEPKPISGKCKLI ERILSLNENDDPSPLMAEWT LLQPTRIDWI LLDKLNDKNR
Sbjct: 301 VIWPQEPKPISGKCKLITERILSLNENDDPSPLMAEWTGLLQPTRIDWIALLDKLNDKNR 360

Query: 361 FLYLKVAELLLNEESFETNIRDYSKLVDVHAKENRLEDAERVLKMMNEKGITPDILTATV 420
           FLYLKVAELLLNEESFETNIRDYSKLVDVHAKENRLEDAER+LKMM EKGITPDILTATV
Sbjct: 361 FLYLKVAELLLNEESFETNIRDYSKLVDVHAKENRLEDAERILKMMTEKGITPDILTATV 420

Query: 421 LVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVNTGQPKLGESMMREMEARD 428
           LVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVN GQPKLGESMMREMEARD
Sbjct: 421 LVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVNAGQPKLGESMMREMEARD 480

BLAST of Cp4.1LG07g04100 vs. NCBI nr
Match: XP_022144194.1 (pentatricopeptide repeat-containing protein At5g39710 [Momordica charantia])

HSP 1 Score: 676 bits (1745), Expect = 1.25e-237
Identity = 365/548 (66.61%), Postives = 392/548 (71.53%), Query Frame = 0

Query: 1   MSLNRLLLRQTLKKFSRINGNILHRQSDVKSNVTRTFIATPSFSLLDPKFSRCSSIPVEN 60
           MSL RLLLRQTLKKFS I+G++LHRQS ++ N TRTF  TPSF LLDP + R SSI  +N
Sbjct: 1   MSLKRLLLRQTLKKFSGISGSLLHRQSAIRINATRTFTTTPSFYLLDPHYGRSSSIHAQN 60

Query: 61  PELCKSNSIFSRCIHFTATKLSDTAIEPKLESSDIEDHDGSMNEFLSRFVWIMRGKISEA 120
            ELCKS+ IFSRCIHFT TKLSDTAIEPKLES+D ED DGSMNEFLSRFVWIMRGKISEA
Sbjct: 61  LELCKSSLIFSRCIHFTVTKLSDTAIEPKLESADAEDDDGSMNEFLSRFVWIMRGKISEA 120

Query: 121 FPDYDKQTVDAMLLMIVEKVVSEMEKGSFEQSLRTSTGNPDWDLSEDLWKT--------- 180
           FPDYDKQTVDAMLLMIVE+VVSEMEKG+  Q+L  S  + DWDLSEDLWKT         
Sbjct: 121 FPDYDKQTVDAMLLMIVERVVSEMEKGNIGQTLGASADSEDWDLSEDLWKTVSEVSNMVL 180

Query: 181 ----------------------EMCRFAGEVGIRGDMLRDFR------------------ 240
                                 EMCRFAGEVGIRGDMLR+FR                  
Sbjct: 181 EDMKKATKKEKMKGFLLSEEVQEMCRFAGEVGIRGDMLREFRFKWAREKMEESEFYESLE 240

Query: 241 ------------------------------------------------------------ 300
                                                                       
Sbjct: 241 QLKKEAQENDVEGNNKDSPSGAEAGSEEKSEVVSLPKRRGKIKYKIYGLDLSDPKWTKVA 300

Query: 301 -----------GQEPKPISGKCKLIIERILSLNENDDPSPLMAEWTELLQPTRIDWITLL 360
                       QEPKPISGKCKL+ ERILSLNENDDPSPL+AEWTELLQPTRIDWITLL
Sbjct: 301 DKIHEAEEVLWPQEPKPISGKCKLVTERILSLNENDDPSPLLAEWTELLQPTRIDWITLL 360

Query: 361 DKLNDKNRFLYLKVAELLLNEESFETNIRDYSKLVDVHAKENRLEDAERVLKMMNEKGIT 420
           DKLN+KNRFLYLKVAEL+L+EESF+TNIRDYSKLVD HAKENRLEDAER+LK MNEKGIT
Sbjct: 361 DKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLVDAHAKENRLEDAERILKKMNEKGIT 420

Query: 421 PDILTATVLVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVNTGQPKLGESM 428
           PDILTATVLVHMYSKVGNLDRAKEAF+TLRSHGFQPDEKVYNSMIM  VN+GQPKLGES+
Sbjct: 421 PDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEKVYNSMIMVSVNSGQPKLGESL 480

BLAST of Cp4.1LG07g04100 vs. NCBI nr
Match: XP_022961855.1 (putative pentatricopeptide repeat-containing protein At2g02150 [Cucurbita moschata])

HSP 1 Score: 674 bits (1739), Expect = 9.16e-237
Identity = 363/545 (66.61%), Postives = 393/545 (72.11%), Query Frame = 0

Query: 1   MSLNRLLLRQTLKKFSRINGNILHRQSDVKSNVTRTFIATPSFSLLDPKFSRCSSIPVEN 60
           MSLN LLLR+TLK F RINGN+L +QS V  NVTRTFI +PSFSLLDP +   SS+PV N
Sbjct: 1   MSLNHLLLRRTLKNFLRINGNLLRQQSAVNINVTRTFITSPSFSLLDPHYDCYSSVPVRN 60

Query: 61  PELCKSNSIFSRCIHFTATKLSDTAIEPKLESSDIEDHDGSMNEFLSRFVWIMRGKISEA 120
            EL KSNSIFSRCIH T TKLSD A+EPKLES+D+E+ DGSMNEFLSRFVWIMRGKISE 
Sbjct: 61  FELGKSNSIFSRCIHSTVTKLSDAAMEPKLESADVEEDDGSMNEFLSRFVWIMRGKISET 120

Query: 121 FPDYDKQTVDAMLLMIVEKVVSEMEKGSFEQSLRTSTGNPDWDLSEDLWKT--------- 180
           FPDYDK+TVDAMLLMIVEK+VSEMEKGSFEQSL+ ST N DWDLSEDLWKT         
Sbjct: 121 FPDYDKKTVDAMLLMIVEKLVSEMEKGSFEQSLKASTENRDWDLSEDLWKTVSEVSNMVL 180

Query: 181 ----------------------EMCRFAGEVGIRGDMLRDFR------------------ 240
                                 EMCRFAGEVGIRGDMLR+FR                  
Sbjct: 181 DDMKKATKKEKMKGFLLSEEVQEMCRFAGEVGIRGDMLREFRFKWAREKMEESEFYESLE 240

Query: 241 ------------------------------------------------------------ 300
                                                                       
Sbjct: 241 QLKKEACTQEENNDSPSSVEAASEVKSEAVSLPKRRGKIKYKIYGLDLSDPKWSEVADKV 300

Query: 301 --------GQEPKPISGKCKLIIERILSLNENDDPSPLMAEWTELLQPTRIDWITLLDKL 360
                    QEPKPISGKCKL+ ERILSLN+N+DPSPL+AEW +LLQPTR+DWI LLDKL
Sbjct: 301 HEAEEVLWPQEPKPISGKCKLVTERILSLNDNEDPSPLLAEWKDLLQPTRVDWIALLDKL 360

Query: 361 NDKNRFLYLKVAELLLNEESFETNIRDYSKLVDVHAKENRLEDAERVLKMMNEKGITPDI 420
           N+ NRFLYLKVAELLL+EESF+T+IRDYSKLVDVHAKENRLEDAER+LK MNEKGITPDI
Sbjct: 361 NESNRFLYLKVAELLLSEESFQTDIRDYSKLVDVHAKENRLEDAERILKKMNEKGITPDI 420

Query: 421 LTATVLVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVNTGQPKLGESMMRE 428
           LTA+VLVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVN GQPKLGES+MRE
Sbjct: 421 LTASVLVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVNAGQPKLGESLMRE 480

BLAST of Cp4.1LG07g04100 vs. NCBI nr
Match: XP_022996674.1 (putative pentatricopeptide repeat-containing protein At2g02150 [Cucurbita maxima])

HSP 1 Score: 674 bits (1739), Expect = 9.16e-237
Identity = 363/545 (66.61%), Postives = 391/545 (71.74%), Query Frame = 0

Query: 1   MSLNRLLLRQTLKKFSRINGNILHRQSDVKSNVTRTFIATPSFSLLDPKFSRCSSIPVEN 60
           MSLN LLLR+TLK F RINGN+L  QS V  N TRTFI +PSFSLLDP +   SS+P+ N
Sbjct: 1   MSLNHLLLRRTLKNFLRINGNLLRHQSAVNINATRTFITSPSFSLLDPHYGCYSSVPLRN 60

Query: 61  PELCKSNSIFSRCIHFTATKLSDTAIEPKLESSDIEDHDGSMNEFLSRFVWIMRGKISEA 120
            EL KSNSIFSRCIH T TKLSD A+EPKLES+D+E+ DG MNEFLSRFVWI+RGKISE 
Sbjct: 61  FELGKSNSIFSRCIHSTVTKLSDAAMEPKLESADVEEDDGLMNEFLSRFVWIIRGKISET 120

Query: 121 FPDYDKQTVDAMLLMIVEKVVSEMEKGSFEQSLRTSTGNPDWDLSEDLWKT--------- 180
           FPDYDK+TVDAMLLMIVEKVVSEMEKGSFEQSL+ ST N DWDLSEDLWKT         
Sbjct: 121 FPDYDKKTVDAMLLMIVEKVVSEMEKGSFEQSLKASTENRDWDLSEDLWKTVSEVSNMVL 180

Query: 181 ----------------------EMCRFAGEVGIRGDMLRDFR------------------ 240
                                 EMCRFAGEVGIRGDMLR+FR                  
Sbjct: 181 DDMKKATKKEKMKGFLLSEEVQEMCRFAGEVGIRGDMLREFRFKWAREKMEESEFYESLE 240

Query: 241 ------------------------------------------------------------ 300
                                                                       
Sbjct: 241 QLKKEACTQEENNDSPSSVEDASEVKSEAVSLPKRRGKIKYKIYGLDLSDPKWSEVADKV 300

Query: 301 --------GQEPKPISGKCKLIIERILSLNENDDPSPLMAEWTELLQPTRIDWITLLDKL 360
                    QEPKPISGKCKL+ ERI SLN+N+DPSPL+AEW +LLQPTR+DWITLLDKL
Sbjct: 301 HEAEEVLWPQEPKPISGKCKLVTERIFSLNDNEDPSPLLAEWKDLLQPTRVDWITLLDKL 360

Query: 361 NDKNRFLYLKVAELLLNEESFETNIRDYSKLVDVHAKENRLEDAERVLKMMNEKGITPDI 420
           N+ NRFLYLKVAELLL+EESF+TNIRDYSKLVDVHAKENRLEDAER+LK MNEKGITPDI
Sbjct: 361 NESNRFLYLKVAELLLSEESFQTNIRDYSKLVDVHAKENRLEDAERILKKMNEKGITPDI 420

Query: 421 LTATVLVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVNTGQPKLGESMMRE 428
           LTATVLVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVN+GQPKLGES+MRE
Sbjct: 421 LTATVLVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVNSGQPKLGESLMRE 480

BLAST of Cp4.1LG07g04100 vs. NCBI nr
Match: KAG7029395.1 (Pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 668 bits (1723), Expect = 2.45e-234
Identity = 361/545 (66.24%), Postives = 391/545 (71.74%), Query Frame = 0

Query: 1   MSLNRLLLRQTLKKFSRINGNILHRQSDVKSNVTRTFIATPSFSLLDPKFSRCSSIPVEN 60
           MSLN LLLR+TLK F RINGN+L +QS V  N TR FI +PSFSLLD  +S  SS+PV N
Sbjct: 1   MSLNHLLLRRTLKNFLRINGNLLRQQSAVNINGTRIFITSPSFSLLDRHYSCYSSVPVRN 60

Query: 61  PELCKSNSIFSRCIHFTATKLSDTAIEPKLESSDIEDHDGSMNEFLSRFVWIMRGKISEA 120
            EL KSNSIFSRCIH T TKLSD A+EPKLES+D+E+ DGSMNEFLSRFVWIMRGKISE 
Sbjct: 61  FELGKSNSIFSRCIHSTVTKLSDAAMEPKLESADVEEDDGSMNEFLSRFVWIMRGKISET 120

Query: 121 FPDYDKQTVDAMLLMIVEKVVSEMEKGSFEQSLRTSTGNPDWDLSEDLWKT--------- 180
           FPDYDK+TVDAMLLMIVEKVVSEMEKGSFEQSL+ ST N DWDLSEDLWKT         
Sbjct: 121 FPDYDKKTVDAMLLMIVEKVVSEMEKGSFEQSLKASTENRDWDLSEDLWKTVSEVSNMVL 180

Query: 181 ----------------------EMCRFAGEVGIRGDMLRDFR------------------ 240
                                 EMCRFAGEVGIRGDMLR+FR                  
Sbjct: 181 DDMKKATKKEKMKGFLLSEEVQEMCRFAGEVGIRGDMLREFRFKWAREKMEESEFYESLE 240

Query: 241 ------------------------------------------------------------ 300
                                                                       
Sbjct: 241 QLKKEACTQEENNDSPSSVEAASEVKSEAVSLPKRRGKIKYKIYGLDLSDPKWSEVADKV 300

Query: 301 --------GQEPKPISGKCKLIIERILSLNENDDPSPLMAEWTELLQPTRIDWITLLDKL 360
                    QEPKPISGKCKL+ ERILSLN+N+DPSPL+AEW +LLQPTR+DWI LLDKL
Sbjct: 301 HEAEEVLWPQEPKPISGKCKLVTERILSLNDNEDPSPLLAEWKDLLQPTRVDWIALLDKL 360

Query: 361 NDKNRFLYLKVAELLLNEESFETNIRDYSKLVDVHAKENRLEDAERVLKMMNEKGITPDI 420
           N+ NRFLYLKVAELLL+EESF+T+IRDYSKLVDVHAKENRLEDAER+LK MNEKGITPDI
Sbjct: 361 NESNRFLYLKVAELLLSEESFQTDIRDYSKLVDVHAKENRLEDAERILKKMNEKGITPDI 420

Query: 421 LTATVLVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVNTGQPKLGESMMRE 428
           LTA+VLVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVN GQPKLGES+MRE
Sbjct: 421 LTASVLVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVNAGQPKLGESLMRE 480

BLAST of Cp4.1LG07g04100 vs. ExPASy TrEMBL
Match: A0A6J1KKS4 (putative pentatricopeptide repeat-containing protein At2g02150 OS=Cucurbita maxima OX=3661 GN=LOC111496154 PE=3 SV=1)

HSP 1 Score: 763 bits (1971), Expect = 2.23e-272
Identity = 411/540 (76.11%), Postives = 414/540 (76.67%), Query Frame = 0

Query: 1   MSLNRLLLRQTLKKFSRINGNILHRQSDVKSNVTRTFIATPSFSLLDPKFSRCSSIPVEN 60
           M LNRLLLRQTLKKFSRINGNILHRQSDVKSNVTRTF+A PSFSLLDPKFSRCSSIPVEN
Sbjct: 1   MKLNRLLLRQTLKKFSRINGNILHRQSDVKSNVTRTFVAAPSFSLLDPKFSRCSSIPVEN 60

Query: 61  PELCKSNSIFSRCIHFTATKLSDTAIEPKLESSDIEDHDGSMNEFLSRFVWIMRGKISEA 120
            ELCKSNSIFSRCIHFTATK SDTAIEPKLESSD+ED DGSMNEFLSRFVWIMRGKISEA
Sbjct: 61  LELCKSNSIFSRCIHFTATKFSDTAIEPKLESSDVEDQDGSMNEFLSRFVWIMRGKISEA 120

Query: 121 FPDYDKQTVDAMLLMIVEKVVSEMEKGSFEQSLRTSTGNPDWDLSEDLWKT--------- 180
           FPDYDKQTVDAMLLMIVEKVVSEMEKGSFEQSLRTST NPDWDLSEDLWKT         
Sbjct: 121 FPDYDKQTVDAMLLMIVEKVVSEMEKGSFEQSLRTSTDNPDWDLSEDLWKTVSEVSNMVL 180

Query: 181 ----------------------EMCRFAGEVGIRGDMLRDFR------------------ 240
                                 EMCRFAGEVGIRGDMLRDFR                  
Sbjct: 181 DDMKKATKKEKMKGFLLSEEVQEMCRFAGEVGIRGDMLRDFRFKWAREKMEESEFYESLE 240

Query: 241 ------------------------------------------------------------ 300
                                                                       
Sbjct: 241 QLKKEANASPSGAEAAASVEKSEPVSIPKRRGKIKYKIYGLDLSDPKWSEVADKIHEAEE 300

Query: 301 ---GQEPKPISGKCKLIIERILSLNENDDPSPLMAEWTELLQPTRIDWITLLDKLNDKNR 360
               QEPKPISGKCKLI ERILSLNENDDPSPLMAEWT LLQPTRIDWI LLDKLNDKNR
Sbjct: 301 VIWPQEPKPISGKCKLITERILSLNENDDPSPLMAEWTGLLQPTRIDWIALLDKLNDKNR 360

Query: 361 FLYLKVAELLLNEESFETNIRDYSKLVDVHAKENRLEDAERVLKMMNEKGITPDILTATV 420
           FLYLKVAELLLNEESFETNIRDYSKLVDVHAKENRLEDAER+LKMM EKGITPDILTATV
Sbjct: 361 FLYLKVAELLLNEESFETNIRDYSKLVDVHAKENRLEDAERILKMMTEKGITPDILTATV 420

Query: 421 LVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVNTGQPKLGESMMREMEARD 428
           LVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVN GQPKLGESMMREMEARD
Sbjct: 421 LVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVNAGQPKLGESMMREMEARD 480

BLAST of Cp4.1LG07g04100 vs. ExPASy TrEMBL
Match: A0A6J1CRK9 (pentatricopeptide repeat-containing protein At5g39710 OS=Momordica charantia OX=3673 GN=LOC111013946 PE=3 SV=1)

HSP 1 Score: 676 bits (1745), Expect = 6.04e-238
Identity = 365/548 (66.61%), Postives = 392/548 (71.53%), Query Frame = 0

Query: 1   MSLNRLLLRQTLKKFSRINGNILHRQSDVKSNVTRTFIATPSFSLLDPKFSRCSSIPVEN 60
           MSL RLLLRQTLKKFS I+G++LHRQS ++ N TRTF  TPSF LLDP + R SSI  +N
Sbjct: 1   MSLKRLLLRQTLKKFSGISGSLLHRQSAIRINATRTFTTTPSFYLLDPHYGRSSSIHAQN 60

Query: 61  PELCKSNSIFSRCIHFTATKLSDTAIEPKLESSDIEDHDGSMNEFLSRFVWIMRGKISEA 120
            ELCKS+ IFSRCIHFT TKLSDTAIEPKLES+D ED DGSMNEFLSRFVWIMRGKISEA
Sbjct: 61  LELCKSSLIFSRCIHFTVTKLSDTAIEPKLESADAEDDDGSMNEFLSRFVWIMRGKISEA 120

Query: 121 FPDYDKQTVDAMLLMIVEKVVSEMEKGSFEQSLRTSTGNPDWDLSEDLWKT--------- 180
           FPDYDKQTVDAMLLMIVE+VVSEMEKG+  Q+L  S  + DWDLSEDLWKT         
Sbjct: 121 FPDYDKQTVDAMLLMIVERVVSEMEKGNIGQTLGASADSEDWDLSEDLWKTVSEVSNMVL 180

Query: 181 ----------------------EMCRFAGEVGIRGDMLRDFR------------------ 240
                                 EMCRFAGEVGIRGDMLR+FR                  
Sbjct: 181 EDMKKATKKEKMKGFLLSEEVQEMCRFAGEVGIRGDMLREFRFKWAREKMEESEFYESLE 240

Query: 241 ------------------------------------------------------------ 300
                                                                       
Sbjct: 241 QLKKEAQENDVEGNNKDSPSGAEAGSEEKSEVVSLPKRRGKIKYKIYGLDLSDPKWTKVA 300

Query: 301 -----------GQEPKPISGKCKLIIERILSLNENDDPSPLMAEWTELLQPTRIDWITLL 360
                       QEPKPISGKCKL+ ERILSLNENDDPSPL+AEWTELLQPTRIDWITLL
Sbjct: 301 DKIHEAEEVLWPQEPKPISGKCKLVTERILSLNENDDPSPLLAEWTELLQPTRIDWITLL 360

Query: 361 DKLNDKNRFLYLKVAELLLNEESFETNIRDYSKLVDVHAKENRLEDAERVLKMMNEKGIT 420
           DKLN+KNRFLYLKVAEL+L+EESF+TNIRDYSKLVD HAKENRLEDAER+LK MNEKGIT
Sbjct: 361 DKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLVDAHAKENRLEDAERILKKMNEKGIT 420

Query: 421 PDILTATVLVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVNTGQPKLGESM 428
           PDILTATVLVHMYSKVGNLDRAKEAF+TLRSHGFQPDEKVYNSMIM  VN+GQPKLGES+
Sbjct: 421 PDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEKVYNSMIMVSVNSGQPKLGESL 480

BLAST of Cp4.1LG07g04100 vs. ExPASy TrEMBL
Match: A0A6J1K9D9 (putative pentatricopeptide repeat-containing protein At2g02150 OS=Cucurbita maxima OX=3661 GN=LOC111491849 PE=4 SV=1)

HSP 1 Score: 674 bits (1739), Expect = 4.44e-237
Identity = 363/545 (66.61%), Postives = 391/545 (71.74%), Query Frame = 0

Query: 1   MSLNRLLLRQTLKKFSRINGNILHRQSDVKSNVTRTFIATPSFSLLDPKFSRCSSIPVEN 60
           MSLN LLLR+TLK F RINGN+L  QS V  N TRTFI +PSFSLLDP +   SS+P+ N
Sbjct: 1   MSLNHLLLRRTLKNFLRINGNLLRHQSAVNINATRTFITSPSFSLLDPHYGCYSSVPLRN 60

Query: 61  PELCKSNSIFSRCIHFTATKLSDTAIEPKLESSDIEDHDGSMNEFLSRFVWIMRGKISEA 120
            EL KSNSIFSRCIH T TKLSD A+EPKLES+D+E+ DG MNEFLSRFVWI+RGKISE 
Sbjct: 61  FELGKSNSIFSRCIHSTVTKLSDAAMEPKLESADVEEDDGLMNEFLSRFVWIIRGKISET 120

Query: 121 FPDYDKQTVDAMLLMIVEKVVSEMEKGSFEQSLRTSTGNPDWDLSEDLWKT--------- 180
           FPDYDK+TVDAMLLMIVEKVVSEMEKGSFEQSL+ ST N DWDLSEDLWKT         
Sbjct: 121 FPDYDKKTVDAMLLMIVEKVVSEMEKGSFEQSLKASTENRDWDLSEDLWKTVSEVSNMVL 180

Query: 181 ----------------------EMCRFAGEVGIRGDMLRDFR------------------ 240
                                 EMCRFAGEVGIRGDMLR+FR                  
Sbjct: 181 DDMKKATKKEKMKGFLLSEEVQEMCRFAGEVGIRGDMLREFRFKWAREKMEESEFYESLE 240

Query: 241 ------------------------------------------------------------ 300
                                                                       
Sbjct: 241 QLKKEACTQEENNDSPSSVEDASEVKSEAVSLPKRRGKIKYKIYGLDLSDPKWSEVADKV 300

Query: 301 --------GQEPKPISGKCKLIIERILSLNENDDPSPLMAEWTELLQPTRIDWITLLDKL 360
                    QEPKPISGKCKL+ ERI SLN+N+DPSPL+AEW +LLQPTR+DWITLLDKL
Sbjct: 301 HEAEEVLWPQEPKPISGKCKLVTERIFSLNDNEDPSPLLAEWKDLLQPTRVDWITLLDKL 360

Query: 361 NDKNRFLYLKVAELLLNEESFETNIRDYSKLVDVHAKENRLEDAERVLKMMNEKGITPDI 420
           N+ NRFLYLKVAELLL+EESF+TNIRDYSKLVDVHAKENRLEDAER+LK MNEKGITPDI
Sbjct: 361 NESNRFLYLKVAELLLSEESFQTNIRDYSKLVDVHAKENRLEDAERILKKMNEKGITPDI 420

Query: 421 LTATVLVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVNTGQPKLGESMMRE 428
           LTATVLVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVN+GQPKLGES+MRE
Sbjct: 421 LTATVLVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVNSGQPKLGESLMRE 480

BLAST of Cp4.1LG07g04100 vs. ExPASy TrEMBL
Match: A0A6J1HDE2 (putative pentatricopeptide repeat-containing protein At2g02150 OS=Cucurbita moschata OX=3662 GN=LOC111462496 PE=3 SV=1)

HSP 1 Score: 674 bits (1739), Expect = 4.44e-237
Identity = 363/545 (66.61%), Postives = 393/545 (72.11%), Query Frame = 0

Query: 1   MSLNRLLLRQTLKKFSRINGNILHRQSDVKSNVTRTFIATPSFSLLDPKFSRCSSIPVEN 60
           MSLN LLLR+TLK F RINGN+L +QS V  NVTRTFI +PSFSLLDP +   SS+PV N
Sbjct: 1   MSLNHLLLRRTLKNFLRINGNLLRQQSAVNINVTRTFITSPSFSLLDPHYDCYSSVPVRN 60

Query: 61  PELCKSNSIFSRCIHFTATKLSDTAIEPKLESSDIEDHDGSMNEFLSRFVWIMRGKISEA 120
            EL KSNSIFSRCIH T TKLSD A+EPKLES+D+E+ DGSMNEFLSRFVWIMRGKISE 
Sbjct: 61  FELGKSNSIFSRCIHSTVTKLSDAAMEPKLESADVEEDDGSMNEFLSRFVWIMRGKISET 120

Query: 121 FPDYDKQTVDAMLLMIVEKVVSEMEKGSFEQSLRTSTGNPDWDLSEDLWKT--------- 180
           FPDYDK+TVDAMLLMIVEK+VSEMEKGSFEQSL+ ST N DWDLSEDLWKT         
Sbjct: 121 FPDYDKKTVDAMLLMIVEKLVSEMEKGSFEQSLKASTENRDWDLSEDLWKTVSEVSNMVL 180

Query: 181 ----------------------EMCRFAGEVGIRGDMLRDFR------------------ 240
                                 EMCRFAGEVGIRGDMLR+FR                  
Sbjct: 181 DDMKKATKKEKMKGFLLSEEVQEMCRFAGEVGIRGDMLREFRFKWAREKMEESEFYESLE 240

Query: 241 ------------------------------------------------------------ 300
                                                                       
Sbjct: 241 QLKKEACTQEENNDSPSSVEAASEVKSEAVSLPKRRGKIKYKIYGLDLSDPKWSEVADKV 300

Query: 301 --------GQEPKPISGKCKLIIERILSLNENDDPSPLMAEWTELLQPTRIDWITLLDKL 360
                    QEPKPISGKCKL+ ERILSLN+N+DPSPL+AEW +LLQPTR+DWI LLDKL
Sbjct: 301 HEAEEVLWPQEPKPISGKCKLVTERILSLNDNEDPSPLLAEWKDLLQPTRVDWIALLDKL 360

Query: 361 NDKNRFLYLKVAELLLNEESFETNIRDYSKLVDVHAKENRLEDAERVLKMMNEKGITPDI 420
           N+ NRFLYLKVAELLL+EESF+T+IRDYSKLVDVHAKENRLEDAER+LK MNEKGITPDI
Sbjct: 361 NESNRFLYLKVAELLLSEESFQTDIRDYSKLVDVHAKENRLEDAERILKKMNEKGITPDI 420

Query: 421 LTATVLVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVNTGQPKLGESMMRE 428
           LTA+VLVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVN GQPKLGES+MRE
Sbjct: 421 LTASVLVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVNAGQPKLGESLMRE 480

BLAST of Cp4.1LG07g04100 vs. ExPASy TrEMBL
Match: A0A5A7VHK4 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G005590 PE=4 SV=1)

HSP 1 Score: 647 bits (1668), Expect = 2.51e-226
Identity = 356/545 (65.32%), Postives = 385/545 (70.64%), Query Frame = 0

Query: 1   MSLNRLLLRQTLKKFSRINGNILHRQSDVKSNVTRTFIATPSFSLLDPKFSRCSSIPVEN 60
           MSL  LLLRQT K FS+INGN+L RQS    N T  FI  PSFSLLD      SSI   N
Sbjct: 1   MSLKHLLLRQTRKNFSKINGNLLDRQSP-SINATHIFITKPSFSLLDSHHGYYSSIAARN 60

Query: 61  PELCKSNSIFSRCIHFTATKLSDTAIEPKLESSDIEDHDGSMNEFLSRFVWIMRGKISEA 120
            EL KSNSIFSRCIHFT TKL++ AIE K ES+++ED DGSMNEFLSRFVWIMRGKISEA
Sbjct: 61  FELSKSNSIFSRCIHFTVTKLNNAAIELKPESAEVEDDDGSMNEFLSRFVWIMRGKISEA 120

Query: 121 FPDYDKQTVDAMLLMIVEKVVSEMEKGSFEQSLRTSTGNPDWDLSEDLWKT--------- 180
           FPDYDKQTV+AMLLMIVEKVVSEMEKGSFEQ+L++ST NPDWDLSEDLWKT         
Sbjct: 121 FPDYDKQTVNAMLLMIVEKVVSEMEKGSFEQTLKSSTDNPDWDLSEDLWKTVSEVSNMVL 180

Query: 181 ----------------------EMCRFAGEVGIRGDMLRDFR------------------ 240
                                 EMCRFAGEVGIRGDMLR+FR                  
Sbjct: 181 DDMKKATKKEKMKGFLLSREVQEMCRFAGEVGIRGDMLREFRFKWAREKMEESEFYEGLE 240

Query: 241 ------------------------------------------------------------ 300
                                                                       
Sbjct: 241 QLRKKAHTQEETYDSASGTEAASEVKSEAFSLPKRRGKLKYKIYGLDLSDTKWSEVADKI 300

Query: 301 ---GQ-----EPKPISGKCKLIIERILSLNENDDPSPLMAEWTELLQPTRIDWITLLDKL 360
              GQ     EPKPISG CKL+ ERILSLNENDDPSPL+AEW ELLQPTRIDWITLLD+L
Sbjct: 301 HEAGQMLWPPEPKPISGMCKLVTERILSLNENDDPSPLLAEWKELLQPTRIDWITLLDRL 360

Query: 361 NDKNRFLYLKVAELLLNEESFETNIRDYSKLVDVHAKENRLEDAERVLKMMNEKGITPDI 420
           N+KNRFLY KVAELLLNEESF+TNIRDYSKLVDV+AKE+RLEDAER+L  MNEKGITPDI
Sbjct: 361 NEKNRFLYFKVAELLLNEESFQTNIRDYSKLVDVYAKESRLEDAERILMKMNEKGITPDI 420

Query: 421 LTATVLVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVNTGQPKLGESMMRE 428
           LTATVLVHMYSKVGNLDRAKEAFDTL+SHGFQPDEKVYNSMIMA+VN GQPKLGES+MR+
Sbjct: 421 LTATVLVHMYSKVGNLDRAKEAFDTLKSHGFQPDEKVYNSMIMAYVNAGQPKLGESLMRD 480

BLAST of Cp4.1LG07g04100 vs. TAIR 10
Match: AT1G19520.1 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 375.6 bits (963), Expect = 5.6e-104
Identity = 206/498 (41.37%), Postives = 292/498 (58.63%), Query Frame = 0

Query: 57  PVENPELCKSNSIFSRCIHFT-ATKLSDT--AIEPKLESSDIEDHDGSMNEFLSRFVWIM 116
           P +N E+ +  S F+R  HFT  ++LS++  AI+   +  + +D DG+ NEFLSRFVWIM
Sbjct: 50  PFQNVEIPRPISSFNRYFHFTRESRLSESSAAIDDSNDQEE-DDEDGTTNEFLSRFVWIM 109

Query: 117 RGKISEAFPDYDKQTVDAMLLMIVEKVVSEMEKGSFEQSLRTSTGNPDWDLSEDLWKT-- 176
           RGK+SEA+PD DK+ +D MLL+IVEKVV E+E+G F + + ++  +P  + S+DLW T  
Sbjct: 110 RGKVSEAYPDCDKKMIDGMLLLIVEKVVEEIERGGFNK-VGSAPPSPSSEFSDDLWATIW 169

Query: 177 -----------------------------EMCRFAGEVGIRGDMLRDFR----------- 236
                                        EMCRFAGE+GIRGD+LR+ R           
Sbjct: 170 EVSNTVLKDMEKERKKEKMKQYVQSPEVMEMCRFAGEIGIRGDLLRELRFKWAREKMDDA 229

Query: 237 ------------------------------------------------------------ 296
                                                                       
Sbjct: 230 EFYESLEQQRDLDNSIRESETVDGEVEEEGFVPSDEVESRSISLPKRKGKLKYKIYGLEL 289

Query: 297 --------------------GQEPKPISGKCKLIIERILSLNENDDPSPLMAEWTELLQP 356
                                +EPKP++GKCKL++E++ SL E DDPS L+AEW ELL+P
Sbjct: 290 SDPKWVEMADKIHEAEEEADWREPKPVTGKCKLVMEKLESLQEGDDPSGLLAEWAELLEP 349

Query: 357 TRIDWITLLDKLNDKNRFLYLKVAELLLNEESFETNIRDYSKLVDVHAKENRLEDAERVL 416
            R+DWI L+++L + N   YLKVAE +L+E+SF  +I DYSKL+ +HAKEN +ED ER+L
Sbjct: 350 NRVDWIALINQLREGNTHAYLKVAEGVLDEKSFNASISDYSKLIHIHAKENHIEDVERIL 409

Query: 417 KMMNEKGITPDILTATVLVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVNT 429
           K M++ GI PDILTAT LVHMYSK GN +RA EAF+ L+S+G +PDEK+Y +MI+ +VN 
Sbjct: 410 KKMSQNGIFPDILTATALVHMYSKSGNFERATEAFENLKSYGLRPDEKIYEAMILGYVNA 469

BLAST of Cp4.1LG07g04100 vs. TAIR 10
Match: AT1G19520.2 (pentatricopeptide (PPR) repeat-containing protein )

HSP 1 Score: 172.9 bits (437), Expect = 5.5e-43
Identity = 109/323 (33.75%), Postives = 153/323 (47.37%), Query Frame = 0

Query: 57  PVENPELCKSNSIFSRCIHFT-ATKLSDT--AIEPKLESSDIEDHDGSMNEFLSRFVWIM 116
           P +N E+ +  S F+R  HFT  ++LS++  AI+   +  + +D DG+ NEFLSRFVWIM
Sbjct: 50  PFQNVEIPRPISSFNRYFHFTRESRLSESSAAIDDSNDQEE-DDEDGTTNEFLSRFVWIM 109

Query: 117 RGKISEAFPDYDKQTVDAMLLMIVEKVVSEMEKGSFEQSLRTSTGNPDWDLSEDLWKT-- 176
           RGK+SEA+PD DK+ +D MLL+IVEKVV E+E+G F + + ++  +P  + S+DLW T  
Sbjct: 110 RGKVSEAYPDCDKKMIDGMLLLIVEKVVEEIERGGFNK-VGSAPPSPSSEFSDDLWATIW 169

Query: 177 -----------------------------EMCRFAGEVGIRGDMLRDFR----------- 236
                                        EMCRFAGE+GIRGD+LR+ R           
Sbjct: 170 EVSNTVLKDMEKERKKEKMKQYVQSPEVMEMCRFAGEIGIRGDLLRELRFKWAREKMDDA 229

Query: 237 ------------------------------------------------------------ 255
                                                                       
Sbjct: 230 EFYESLEQQRDLDNSIRESETVDGEVEEEGFVPSDEVESRSISLPKRKGKLKYKIYGLEL 289

BLAST of Cp4.1LG07g04100 vs. TAIR 10
Match: AT1G01970.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 171.4 bits (433), Expect = 1.6e-42
Identity = 85/235 (36.17%), Postives = 141/235 (60.00%), Query Frame = 0

Query: 195 PKPISGKCKLIIERILSLN-ENDDPSPLMAEWTELLQPTRIDWITLLDKLNDKNRFLYLK 254
           P  +S +C+ ++ +I+  + E      L+  W   + P R DW+++L +L + +   Y+K
Sbjct: 91  PIKMSKRCQALMRQIICFSPEKGSFCDLLGAWLRRMNPIRADWLSILKELKNLDSPFYIK 150

Query: 255 VAELLLNEESFETNIRDYSKLVDVHAKENRLEDAERVLKMMNEKGITPDILTATVLVHMY 314
           VAE  L ++SFE N RDY+K++  + K N++EDAER L  M  +G   D +T T +V +Y
Sbjct: 151 VAEFSLLQDSFEANARDYTKIIHYYGKLNQVEDAERTLLSMKNRGFLIDQVTLTAMVQLY 210

Query: 315 SKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVNTGQPKLGESMMREMEARDIKPSK 374
           SK G    A+E F+ ++  G   D + Y SMIMA++  G P+ GES++REM++++I   +
Sbjct: 211 SKAGCHKLAEETFNEIKLLGEPLDYRSYGSMIMAYIRAGVPEKGESLLREMDSQEICAGR 270

Query: 375 DIYMALLRSFSQRGDISGAGGIAATMQFSGISPSLESCTLLVETYGLAGDPDQIR 429
           ++Y ALLR +S  GD  GA  +   +Q +GI+P ++ C LL+  Y ++G     R
Sbjct: 271 EVYKALLRDYSMGGDAEGAKRVFDAVQIAGITPDVKLCGLLINAYSVSGQSQNAR 325

BLAST of Cp4.1LG07g04100 vs. TAIR 10
Match: AT5G41170.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 86.7 bits (213), Expect = 5.2e-17
Identity = 48/166 (28.92%), Postives = 90/166 (54.22%), Query Frame = 0

Query: 264 FETNIRDYSKLVDVHAKENRLEDAERVLKMMNEKGITPDILTATVLVHMYSKVGNLDRAK 323
           FE +I  ++ L++     NR+E+A  ++  M E GI PD++  T ++    K G+++ A 
Sbjct: 138 FEPDIVTFTSLINGFCLGNRMEEAMSMVNQMVEMGIKPDVVMYTTIIDSLCKNGHVNYAL 197

Query: 324 EAFDTLRSHGFQPDEKVYNSMIMAFVNTGQPKLGESMMREMEARDIKPSKDIYMALLRSF 383
             FD + ++G +PD  +Y S++    N+G+ +  +S++R M  R IKP    + AL+ +F
Sbjct: 198 SLFDQMENYGIRPDVVMYTSLVNGLCNSGRWRDADSLLRGMTKRKIKPDVITFNALIDAF 257

Query: 384 SQRGDISGAGGIAATMQFSGISPSLESCTLLVETYGLAGDPDQIRQ 430
            + G    A  +   M    I+P++ + T L+  + + G  D+ RQ
Sbjct: 258 VKEGKFLDAEELYNEMIRMSIAPNIFTYTSLINGFCMEGCVDEARQ 303

BLAST of Cp4.1LG07g04100 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 85.9 bits (211), Expect = 8.8e-17
Identity = 59/252 (23.41%), Postives = 108/252 (42.86%), Query Frame = 0

Query: 176 FAGEVGIRGDMLRDFRGQEPKPISGKCKLIIERILSLNENDDPSPLMAEWT-ELLQPTRI 235
           FAG + +   +      +   P       +I+    L + DD   L+     + L+P  I
Sbjct: 217 FAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLI 276

Query: 236 DWITLLDKLNDKNRFLYLKVAELLLNEESFETNIRDYSKLVDVHAKENRLEDAERVLKMM 295
            +  +++ L  + R   +      +N   +  +   Y+ L+  + KE     A  +   M
Sbjct: 277 SYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEM 336

Query: 296 NEKGITPDILTATVLVHMYSKVGNLDRAKEAFDTLRSHGFQPDEKVYNSMIMAFVNTGQP 355
              G+TP ++T T L+H   K GN++RA E  D +R  G  P+E+ Y +++  F   G  
Sbjct: 337 LRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYM 396

Query: 356 KLGESMMREMEARDIKPSKDIYMALLRSFSQRGDISGAGGIAATMQFSGISPSLESCTLL 415
                ++REM      PS   Y AL+      G +  A  +   M+  G+SP + S + +
Sbjct: 397 NEAYRVLREMNDNGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTV 456

Query: 416 VETYGLAGDPDQ 427
           +  +  + D D+
Sbjct: 457 LSGFCRSYDVDE 468

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8LEZ47.7e-4233.75Protein NUCLEAR FUSION DEFECTIVE 5, mitochondrial OS=Arabidopsis thaliana OX=370... [more]
Q9LPC42.2e-4136.17Pentatricopeptide repeat-containing protein At1g01970 OS=Arabidopsis thaliana OX... [more]
Q940Z11.8e-3854.41Pentatricopeptide repeat-containing protein At1g19525 OS=Arabidopsis thaliana OX... [more]
Q9FLL37.3e-1628.92Pentatricopeptide repeat-containing protein At5g41170, mitochondrial OS=Arabidop... [more]
Q9FIX31.2e-1523.41Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_023002246.14.60e-27276.11putative pentatricopeptide repeat-containing protein At2g02150 [Cucurbita maxima... [more]
XP_022144194.11.25e-23766.61pentatricopeptide repeat-containing protein At5g39710 [Momordica charantia][more]
XP_022961855.19.16e-23766.61putative pentatricopeptide repeat-containing protein At2g02150 [Cucurbita moscha... [more]
XP_022996674.19.16e-23766.61putative pentatricopeptide repeat-containing protein At2g02150 [Cucurbita maxima... [more]
KAG7029395.12.45e-23466.24Pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyr... [more]
Match NameE-valueIdentityDescription
A0A6J1KKS42.23e-27276.11putative pentatricopeptide repeat-containing protein At2g02150 OS=Cucurbita maxi... [more]
A0A6J1CRK96.04e-23866.61pentatricopeptide repeat-containing protein At5g39710 OS=Momordica charantia OX=... [more]
A0A6J1K9D94.44e-23766.61putative pentatricopeptide repeat-containing protein At2g02150 OS=Cucurbita maxi... [more]
A0A6J1HDE24.44e-23766.61putative pentatricopeptide repeat-containing protein At2g02150 OS=Cucurbita mosc... [more]
A0A5A7VHK42.51e-22665.32Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT1G19520.15.6e-10441.37pentatricopeptide (PPR) repeat-containing protein [more]
AT1G19520.25.5e-4333.75pentatricopeptide (PPR) repeat-containing protein [more]
AT1G01970.11.6e-4236.17Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G41170.15.2e-1728.92Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT5G39710.18.8e-1723.41Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 277..297
NoneNo IPR availablePANTHERPTHR46862:SF2PROTEIN NUCLEAR FUSION DEFECTIVE 5, MITOCHONDRIALcoord: 1..171
coord: 193..429
NoneNo IPR availablePANTHERPTHR46862:SF2PROTEIN NUCLEAR FUSION DEFECTIVE 5, MITOCHONDRIALcoord: 172..191
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 221..330
e-value: 4.3E-15
score: 57.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 331..427
e-value: 1.1E-15
score: 59.5
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 291..350
e-value: 2.2E-8
score: 34.1
coord: 360..419
e-value: 1.9E-4
score: 21.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 307..338
e-value: 1.5E-4
score: 19.7
coord: 271..303
e-value: 7.6E-6
score: 23.8
coord: 340..372
e-value: 5.8E-6
score: 24.2
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 337..371
score: 10.884628
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 267..301
score: 10.873667
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 302..336
score: 10.599635
IPR044657Pentatricopeptide repeat-containing protein NFD5-likePANTHERPTHR46862OS07G0661900 PROTEINcoord: 1..171
coord: 193..429
IPR044657Pentatricopeptide repeat-containing protein NFD5-likePANTHERPTHR46862OS07G0661900 PROTEINcoord: 172..191

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG07g04100.1Cp4.1LG07g04100.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding