Cp4.1LG01g23300 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG01g23300
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG01: 19607229 .. 19609130 (-)
RNA-Seq ExpressionCp4.1LG01g23300
SyntenyCp4.1LG01g23300
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCATCGAATTCCCGCTTCGAATCTCACCAAACTCTCGAAGCCCTTCTTCTTCCTCGCTTCAATTCCCAAATCAAATTTCTCTTCGGCTTCATCCTCAACTTTTTTACAATCAATTCCTCGGTCCGAAACTAAATTAATCGTCAACCCTCTTTACCATTTTCTCCCACAAAACCAAAACCCCTTCAACATCGTCGAACTCGTCTCCTCACACCTCAAAACCAGCAACACGCAGTTATCTCTTCTTCAATCCGACATTAAGGAGCTTCTTCCCCACTTGGGTCATCGTGAAATCTCCAAGATTATATTGAGGTGCCAATCTAATTTCGTCTCTGTTCTTGCTTTTTTCAATTGGGTTAAATTTGATTTGGGGATTACACTTAATTCCCAAAACTATTGTCTTATTATCCATATATTGGCATGGTCCCGACAATTTTCCATGGCGATGAAATTCTTGTCTGAACTGATTGAGTTGTCTAAGGATAATGCCTCAGGTAGTGAGGATGTTTTTCATAATCTGGTATTGTGCACTGAGCACTGTAATTGGAATCCTGTTATCTTTGAAATGCTTATGAAGGCGTATGTGAAAGTTCATATGATTCAGGAAAGTTATGAGAGCTTTAAGAAGATGGTGAAGATGGGTTTTGTTCCAAGTGTGATTGCTTGTAATTGTATTCTAAATGGACTGGCGAAGATGAAGTGTGATGCTCAATGTTGGGAGCTCTATGAAGAAATGGGAAGGATTGGAGTTCATTCAAATGCATATACTTTTAATATTTTGACTTATGTTTTGTGTAGAGCTGGGGATGTAAATAAGGTTAATGAGTTCTTAGAAAAGATGGAAGAAGAGGGATTCGATCCCGACGTCGTGACATACAATACTTTGATCGATAGCTATTGTCGAAGAGGAAGATTAGATGATGCATTCTATTTATATAGGATAATGTTTAGAAGGGGTGTGATGCCTGATCTTGTTTCATATACTTCCTTGATGAATGGTCTTTGTAAGTTAGGAAGGGTAAGAGAGGCCCATCAGCTATTTCATCGAATGATCGATCGAGAATTGGATCCTGATGTCGTATTGTATAATACGCTAATTAATGCATATTGCAAGGATGGAAGGCTGCAAGAGGCAAGATCATTGCTACACGATATGACTCGAATTGGCATTTGCCCTGATAGTTTCACTTGTAGGATTATGGTGGAAGGATATGGAAGAGGAGGTAGCTTGATCTCAGCTTTGAATTTGGTTGTAGAACTTCGGAAACTTGGAACGATTGTTACTTACGACATATATGATTATCTCATCGTCTCATTGTGTCTGGAAGATCGTCCATTTGCAGCTAAGAGTGTTCTTGAAAGAGTCATTAAAGATGGTTTCCAACCTAATGCTTGTATCTACAACAAGCTGATTGAATGTTTCTGTAGAGTTCATAATGTTTCTGAGGCACTGCTTTTGAAATCTGAAATGATAAAGAGAAATTTTAAACTTAGCATTGATTCATACAAGCCTCTTATATCCTGTTTGTGTGGAGTCAATAGAAGTGTAGATGGTGAAGGTTTAATGGTAGAAATGGTTGAATCTGGAGTGCTTCCGGATCATCAAATATGCAGAGTATTGATAAATGGATACTGCAAAGAAGGGAATGTTTATAAAGCAGAATCATTATTGGTATCATTTGCTAAAGATTTTGAGTTCTTTGACACTGAAAGTTTCAATGCCTTGGTTAAATTTCATCGCGATTTTGGTAATGAAACGGAGTTGATGCAGCTGCAAGATCGAATGCTGAAAGTCGGTTTCGTTCCGAATAGCTTAACGTGCCGATACGTTATCCATGGATTATGGAAATCTGCAAGGCTCGACAAGCGGAGAGTTCAGGCTCCTCTCTGCCAAAGAGGGTGA

mRNA sequence

ATGCATCGAATTCCCGCTTCGAATCTCACCAAACTCTCGAAGCCCTTCTTCTTCCTCGCTTCAATTCCCAAATCAAATTTCTCTTCGGCTTCATCCTCAACTTTTTTACAATCAATTCCTCGGTCCGAAACTAAATTAATCGTCAACCCTCTTTACCATTTTCTCCCACAAAACCAAAACCCCTTCAACATCGTCGAACTCGTCTCCTCACACCTCAAAACCAGCAACACGCAGTTATCTCTTCTTCAATCCGACATTAAGGAGCTTCTTCCCCACTTGGGTCATCGTGAAATCTCCAAGATTATATTGAGGTGCCAATCTAATTTCGTCTCTGTTCTTGCTTTTTTCAATTGGGTTAAATTTGATTTGGGGATTACACTTAATTCCCAAAACTATTGTCTTATTATCCATATATTGGCATGGTCCCGACAATTTTCCATGGCGATGAAATTCTTGTCTGAACTGATTGAGTTGTCTAAGGATAATGCCTCAGGTAGTGAGGATGTTTTTCATAATCTGGTATTGTGCACTGAGCACTGTAATTGGAATCCTGTTATCTTTGAAATGCTTATGAAGGCGTATGTGAAAGTTCATATGATTCAGGAAAGTTATGAGAGCTTTAAGAAGATGGTGAAGATGGGTTTTGTTCCAAGTGTGATTGCTTGTAATTGTATTCTAAATGGACTGGCGAAGATGAAGTGTGATGCTCAATGTTGGGAGCTCTATGAAGAAATGGGAAGGATTGGAGTTCATTCAAATGCATATACTTTTAATATTTTGACTTATGTTTTGTGTAGAGCTGGGGATGTAAATAAGGTTAATGAGTTCTTAGAAAAGATGGAAGAAGAGGGATTCGATCCCGACGTCGTGACATACAATACTTTGATCGATAGCTATTGTCGAAGAGGAAGATTAGATGATGCATTCTATTTATATAGGATAATGTTTAGAAGGGGTGTGATGCCTGATCTTGTTTCATATACTTCCTTGATGAATGGTCTTTGTAAGTTAGGAAGGGTAAGAGAGGCCCATCAGCTATTTCATCGAATGATCGATCGAGAATTGGATCCTGATGTCGTATTGTATAATACGCTAATTAATGCATATTGCAAGGATGGAAGGCTGCAAGAGGCAAGATCATTGCTACACGATATGACTCGAATTGGCATTTGCCCTGATAGTTTCACTTGTAGGATTATGGTGGAAGGATATGGAAGAGGAGGTAGCTTGATCTCAGCTTTGAATTTGGTTGTAGAACTTCGGAAACTTGGAACGATTGTTACTTACGACATATATGATTATCTCATCGTCTCATTGTGTCTGGAAGATCGTCCATTTGCAGCTAAGAGTGTTCTTGAAAGAGTCATTAAAGATGGTTTCCAACCTAATGCTTGTATCTACAACAAGCTGATTGAATGTTTCTGTAGAGTTCATAATGTTTCTGAGGCACTGCTTTTGAAATCTGAAATGATAAAGAGAAATTTTAAACTTAGCATTGATTCATACAAGCCTCTTATATCCTGTTTGTGTGGAGTCAATAGAAGTGTAGATGGTGAAGGTTTAATGGTAGAAATGGTTGAATCTGGAGTGCTTCCGGATCATCAAATATGCAGAGTATTGATAAATGGATACTGCAAAGAAGGGAATGTTTATAAAGCAGAATCATTATTGGTATCATTTGCTAAAGATTTTGAGTTCTTTGACACTGAAAGTTTCAATGCCTTGGTTAAATTTCATCGCGATTTTGGTAATGAAACGGAGTTGATGCAGCTGCAAGATCGAATGCTGAAAGTCGGTTTCGTTCCGAATAGCTTAACGTGCCGATACGTTATCCATGGATTATGGAAATCTGCAAGGCTCGACAAGCGGAGAGTTCAGGCTCCTCTCTGCCAAAGAGGGTGA

Coding sequence (CDS)

ATGCATCGAATTCCCGCTTCGAATCTCACCAAACTCTCGAAGCCCTTCTTCTTCCTCGCTTCAATTCCCAAATCAAATTTCTCTTCGGCTTCATCCTCAACTTTTTTACAATCAATTCCTCGGTCCGAAACTAAATTAATCGTCAACCCTCTTTACCATTTTCTCCCACAAAACCAAAACCCCTTCAACATCGTCGAACTCGTCTCCTCACACCTCAAAACCAGCAACACGCAGTTATCTCTTCTTCAATCCGACATTAAGGAGCTTCTTCCCCACTTGGGTCATCGTGAAATCTCCAAGATTATATTGAGGTGCCAATCTAATTTCGTCTCTGTTCTTGCTTTTTTCAATTGGGTTAAATTTGATTTGGGGATTACACTTAATTCCCAAAACTATTGTCTTATTATCCATATATTGGCATGGTCCCGACAATTTTCCATGGCGATGAAATTCTTGTCTGAACTGATTGAGTTGTCTAAGGATAATGCCTCAGGTAGTGAGGATGTTTTTCATAATCTGGTATTGTGCACTGAGCACTGTAATTGGAATCCTGTTATCTTTGAAATGCTTATGAAGGCGTATGTGAAAGTTCATATGATTCAGGAAAGTTATGAGAGCTTTAAGAAGATGGTGAAGATGGGTTTTGTTCCAAGTGTGATTGCTTGTAATTGTATTCTAAATGGACTGGCGAAGATGAAGTGTGATGCTCAATGTTGGGAGCTCTATGAAGAAATGGGAAGGATTGGAGTTCATTCAAATGCATATACTTTTAATATTTTGACTTATGTTTTGTGTAGAGCTGGGGATGTAAATAAGGTTAATGAGTTCTTAGAAAAGATGGAAGAAGAGGGATTCGATCCCGACGTCGTGACATACAATACTTTGATCGATAGCTATTGTCGAAGAGGAAGATTAGATGATGCATTCTATTTATATAGGATAATGTTTAGAAGGGGTGTGATGCCTGATCTTGTTTCATATACTTCCTTGATGAATGGTCTTTGTAAGTTAGGAAGGGTAAGAGAGGCCCATCAGCTATTTCATCGAATGATCGATCGAGAATTGGATCCTGATGTCGTATTGTATAATACGCTAATTAATGCATATTGCAAGGATGGAAGGCTGCAAGAGGCAAGATCATTGCTACACGATATGACTCGAATTGGCATTTGCCCTGATAGTTTCACTTGTAGGATTATGGTGGAAGGATATGGAAGAGGAGGTAGCTTGATCTCAGCTTTGAATTTGGTTGTAGAACTTCGGAAACTTGGAACGATTGTTACTTACGACATATATGATTATCTCATCGTCTCATTGTGTCTGGAAGATCGTCCATTTGCAGCTAAGAGTGTTCTTGAAAGAGTCATTAAAGATGGTTTCCAACCTAATGCTTGTATCTACAACAAGCTGATTGAATGTTTCTGTAGAGTTCATAATGTTTCTGAGGCACTGCTTTTGAAATCTGAAATGATAAAGAGAAATTTTAAACTTAGCATTGATTCATACAAGCCTCTTATATCCTGTTTGTGTGGAGTCAATAGAAGTGTAGATGGTGAAGGTTTAATGGTAGAAATGGTTGAATCTGGAGTGCTTCCGGATCATCAAATATGCAGAGTATTGATAAATGGATACTGCAAAGAAGGGAATGTTTATAAAGCAGAATCATTATTGGTATCATTTGCTAAAGATTTTGAGTTCTTTGACACTGAAAGTTTCAATGCCTTGGTTAAATTTCATCGCGATTTTGGTAATGAAACGGAGTTGATGCAGCTGCAAGATCGAATGCTGAAAGTCGGTTTCGTTCCGAATAGCTTAACGTGCCGATACGTTATCCATGGATTATGGAAATCTGCAAGGCTCGACAAGCGGAGAGTTCAGGCTCCTCTCTGCCAAAGAGGGTGA

Protein sequence

MHRIPASNLTKLSKPFFFLASIPKSNFSSASSSTFLQSIPRSETKLIVNPLYHFLPQNQNPFNIVELVSSHLKTSNTQLSLLQSDIKELLPHLGHREISKIILRCQSNFVSVLAFFNWVKFDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHCNWNPVIFEMLMKAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCWELYEEMGRIGVHSNAYTFNILTYVLCRAGDVNKVNEFLEKMEEEGFDPDVVTYNTLIDSYCRRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDVVLYNTLINAYCKDGRLQEARSLLHDMTRIGICPDSFTCRIMVEGYGRGGSLISALNLVVELRKLGTIVTYDIYDYLIVSLCLEDRPFAAKSVLERVIKDGFQPNACIYNKLIECFCRVHNVSEALLLKSEMIKRNFKLSIDSYKPLISCLCGVNRSVDGEGLMVEMVESGVLPDHQICRVLINGYCKEGNVYKAESLLVSFAKDFEFFDTESFNALVKFHRDFGNETELMQLQDRMLKVGFVPNSLTCRYVIHGLWKSARLDKRRVQAPLCQRG
Homology
BLAST of Cp4.1LG01g23300 vs. ExPASy Swiss-Prot
Match: Q9FND8 (Pentatricopeptide repeat-containing protein At5g40400 OS=Arabidopsis thaliana OX=3702 GN=At5g40400 PE=2 SV=1)

HSP 1 Score: 603.6 bits (1555), Expect = 2.6e-171
Identity = 301/594 (50.67%), Postives = 417/594 (70.20%), Query Frame = 0

Query: 27  FSSASSSTFLQSIPRSET--KLIVNPLYHFLPQNQNPFNIVELVSSHLKTSNTQLSL--L 86
           FSS SSS     +PR     K I+NPLY+ LPQ+QNP  IV+++ S L  S+  + L  L
Sbjct: 11  FSSYSSSI----VPRCSNIPKPILNPLYNLLPQSQNPSKIVDVICSTLNHSDYSVLLPNL 70

Query: 87  QSDIKELLPHLGHREISKIILRCQSNFVSVLAFFNWVKFDLGITLNSQNYCLIIHILAWS 146
           + ++K L+PHLG+ EIS+++LR QS+    + FF WVKFDLG   N  NYCL++HIL  S
Sbjct: 71  RDEVKSLIPHLGYPEISRVLLRFQSDASRAITFFKWVKFDLGKRPNVGNYCLLLHILVSS 130

Query: 147 RQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHCNWNPVIFEMLMKAYVKVHMIQE 206
           ++F +AM+FL ELIEL+  +     DVF  LV  T+ CNW+PV+F+ML+K Y+K+ +++E
Sbjct: 131 KKFPLAMQFLCELIELT--SKKEEVDVFRVLVSATDECNWDPVVFDMLVKGYLKLGLVEE 190

Query: 207 SYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCWELYEEMGRIGVHSNAYTFNILTY 266
            +  F++++  GF  SV+ CN +LNGL K+     CW++Y  M R+G+H N YTFNILT 
Sbjct: 191 GFRVFREVLDSGFSVSVVTCNHLLNGLLKLDLMEDCWQVYSVMCRVGIHPNTYTFNILTN 250

Query: 267 VLCRAGDVNKVNEFLEKMEEEGFDPDVVTYNTLIDSYCRRGRLDDAFYLYRIMFRRGVMP 326
           V C   +  +V++FLEKMEEEGF+PD+VTYNTL+ SYCRRGRL +AFYLY+IM+RR V+P
Sbjct: 251 VFCNDSNFREVDDFLEKMEEEGFEPDLVTYNTLVSSYCRRGRLKEAFYLYKIMYRRRVVP 310

Query: 327 DLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDVVLYNTLINAYCKDGRLQEARSLL 386
           DLV+YTSL+ GLCK GRVREAHQ FHRM+DR + PD + YNTLI AYCK+G +Q+++ LL
Sbjct: 311 DLVTYTSLIKGLCKDGRVREAHQTFHRMVDRGIKPDCMSYNTLIYAYCKEGMMQQSKKLL 370

Query: 387 HDMTRIGICPDSFTCRIMVEGYGRGGSLISALNLVVELRKLGTIVTYDIYDYLIVSLCLE 446
           H+M    + PD FTC+++VEG+ R G L+SA+N VVELR+L   + +++ D+LIVSLC E
Sbjct: 371 HEMLGNSVVPDRFTCKVIVEGFVREGRLLSAVNFVVELRRLKVDIPFEVCDFLIVSLCQE 430

Query: 447 DRPFAAKSVLERVI-KDGFQPNACIYNKLIECFCRVHNVSEALLLKSEMIKRNFKLSIDS 506
            +PFAAK +L+R+I ++G +     YN LIE   R   + EAL+LK ++  +N  L   +
Sbjct: 431 GKPFAAKHLLDRIIEEEGHEAKPETYNNLIESLSRCDAIEEALVLKGKLKNQNQVLDAKT 490

Query: 507 YKPLISCLCGVNRSVDGEGLMVEMVESGVLPDHQICRVLINGYCKEGNVYKAESLLVSFA 566
           Y+ LI CLC + R+ + E LM EM +S V PD  IC  L+ GYCKE +  KAE LL  FA
Sbjct: 491 YRALIGCLCRIGRNREAESLMAEMFDSEVKPDSFICGALVYGYCKELDFDKAERLLSLFA 550

Query: 567 KDFEFFDTESFNALVKFHRDFG-NETELMQLQDRMLKVGFVPNSLTCRYVIHGL 615
            +F  FD ES+N+LVK   + G    + ++LQ+RM ++GFVPN LTC+Y+I  L
Sbjct: 551 MEFRIFDPESYNSLVKAVCETGCGYKKALELQERMQRLGFVPNRLTCKYLIQVL 598

BLAST of Cp4.1LG01g23300 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 240.4 bits (612), Expect = 5.7e-62
Identity = 153/525 (29.14%), Postives = 263/525 (50.10%), Query Frame = 0

Query: 99  SKIILRCQSNFVSVLAFFNWVKFDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIEL 158
           S ++L+ Q++   +L F NW       TL  +  C+ +HIL   + +  A     ++   
Sbjct: 52  SNLLLKSQNDQALILKFLNWANPHQFFTLRCK--CITLHILTKFKLYKTAQILAEDVAAK 111

Query: 159 SKDNASGSEDVFHNLVLCTEHCNWNPVIFEMLMKAYVKVHMIQESYESFKKMVKMGFVPS 218
           + D+   S  VF +L    + C     +F++++K+Y ++ +I ++          GF+P 
Sbjct: 112 TLDDEYASL-VFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPG 171

Query: 219 VIACNCILNGLAKMKCDAQCWE-LYEEMGRIGVHSNAYTFNILTYVLCRAGDVNKVNEFL 278
           V++ N +L+   + K +    E +++EM    V  N +T+NIL    C AG+++      
Sbjct: 172 VLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLF 231

Query: 279 EKMEEEGFDPDVVTYNTLIDSYCRRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKL 338
           +KME +G  P+VVTYNTLID YC+  ++DD F L R M  +G+ P+L+SY  ++NGLC+ 
Sbjct: 232 DKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCRE 291

Query: 339 GRVREAHQLFHRMIDRELDPDVVLYNTLINAYCKDGRLQEARSLLHDMTRIGICPDSFTC 398
           GR++E   +   M  R    D V YNTLI  YCK+G   +A  +  +M R G+ P   T 
Sbjct: 292 GRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITY 351

Query: 399 RIMVEGYGRGGSLISALNLVVELRKLGTIVTYDIYDYLIVSLCLEDRPFAAKSVLERVIK 458
             ++    + G++  A+  + ++R  G       Y  L+     +     A  VL  +  
Sbjct: 352 TSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMND 411

Query: 459 DGFQPNACIYNKLIECFCRVHNVSEALLLKSEMIKRNFKLSIDSYKPLISCLCGVNRSVD 518
           +GF P+   YN LI   C    + +A+ +  +M ++     + SY  ++S  C   RS D
Sbjct: 412 NGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFC---RSYD 471

Query: 519 -GEGLMV--EMVESGVLPDHQICRVLINGYCKEGNVYKAESLLVSFAKDFEFFDTESFNA 578
             E L V  EMVE G+ PD      LI G+C++    +A  L     +     D  ++ A
Sbjct: 472 VDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTA 531

Query: 579 LVKFHRDFGNETELMQLQDRMLKVGFVPNSLTCRYVIHGLWKSAR 620
           L+  +   G+  + +QL + M++ G +P+ +T   +I+GL K +R
Sbjct: 532 LINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSR 570

BLAST of Cp4.1LG01g23300 vs. ExPASy Swiss-Prot
Match: Q9LVQ5 (Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX=3702 GN=At5g55840 PE=3 SV=2)

HSP 1 Score: 239.6 bits (610), Expect = 9.8e-62
Identity = 151/507 (29.78%), Postives = 244/507 (48.13%), Query Frame = 0

Query: 113 LAFFNWVKFDLGITLNS--QNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVF 172
           L F  WV    G+  +   Q  C+  HIL  +R +  A   L EL  +S      S  VF
Sbjct: 54  LKFLKWVVKQPGLETDHIVQLVCITTHILVRARMYDPARHILKELSLMS----GKSSFVF 113

Query: 173 HNLVLCTEHCNWNPVIFEMLMKAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLA 232
             L+     CN NP ++++L++ Y++  MIQ+S E F+ M   GF PSV  CN IL  + 
Sbjct: 114 GALMTTYRLCNSNPSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAILGSVV 173

Query: 233 KMKCDAQCWELYEEMGRIGVHSNAYTFNILTYVLCRAGDVNKVNEFLEKMEEEGFDPDVV 292
           K   D   W   +EM +  +  +  TFNIL  VLC  G   K +  ++KME+ G+ P +V
Sbjct: 174 KSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAPTIV 233

Query: 293 TYNTLIDSYCRRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRM 352
           TYNT++  YC++GR   A  L   M  +GV  D+ +Y  L++ LC+  R+ + + L   M
Sbjct: 234 TYNTVLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLLRDM 293

Query: 353 IDRELDPDVVLYNTLINAYCKDGRLQEARSLLHDMTRIGICPDSFTCRIMVEGYGRGGSL 412
             R + P+ V YNTLIN +  +G++  A  LL++M   G+ P+  T   +++G+   G+ 
Sbjct: 294 RKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISEGNF 353

Query: 413 ISALNLVVELRKLGTIVTYDIYDYLIVSLCLEDRPFAAKSVLERVIKDGFQPNACIYNKL 472
             AL +   +   G   +   Y  L+  LC       A+    R+ ++G       Y  +
Sbjct: 354 KEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITYTGM 413

Query: 473 IECFCRVHNVSEALLLKSEMIKRNFKLSIDSYKPLISCLCGVNRSVDGEGLMVEMVESGV 532
           I+  C+   + EA++L +EM K      I +Y  LI+  C V R    + ++  +   G+
Sbjct: 414 IDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYRVGL 473

Query: 533 LPDHQICRVLINGYCKEGNVYKAESLLVSFAKDFEFFDTESFNALVKFHRDFGNETELMQ 592
            P+  I   LI   C+ G + +A  +  +   +    D  +FN LV      G   E  +
Sbjct: 474 SPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAEAEE 533

Query: 593 LQDRMLKVGFVPNSLTCRYVIHGLWKS 618
               M   G +PN+++   +I+G   S
Sbjct: 534 FMRCMTSDGILPNTVSFDCLINGYGNS 556

BLAST of Cp4.1LG01g23300 vs. ExPASy Swiss-Prot
Match: O04504 (Pentatricopeptide repeat-containing protein At1g09820 OS=Arabidopsis thaliana OX=3702 GN=At1g09820 PE=2 SV=1)

HSP 1 Score: 237.3 bits (604), Expect = 4.8e-61
Identity = 143/505 (28.32%), Postives = 256/505 (50.69%), Query Frame = 0

Query: 113 LAFFNWVKFDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHN 172
           L +++W+  +  I+++ +    ++H LA ++++S    FL   +    D+   S  +FH 
Sbjct: 85  LRYYSWLVKNSDISVSLELTFKLLHSLANAKRYSKIRSFLDGFVRNGSDHQVHS--IFHA 144

Query: 173 LVLCTEHCNWNPVIFEMLMKAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKM 232
           + +C   C  N +I +ML+ AY      +  +E+FK+    G+  S ++C  ++  L K 
Sbjct: 145 ISMCDNVC-VNSIIADMLVLAYANNSRFELGFEAFKRSGYYGYKLSALSCKPLMIALLKE 204

Query: 233 KCDAQCWELYEEMGRIGVHSNAYTFNILTYVLCRAGDVNKVNEFLEKMEEEGFDPDVVTY 292
              A    +Y+EM R  +  N +TFN++   LC+ G +NK  + +E M+  G  P+VV+Y
Sbjct: 205 NRSADVEYVYKEMIRRKIQPNVFTFNVVINALCKTGKMNKARDVMEDMKVYGCSPNVVSY 264

Query: 293 NTLIDSYCR---RGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHR 352
           NTLID YC+    G++  A  + + M    V P+L ++  L++G  K   +  + ++F  
Sbjct: 265 NTLIDGYCKLGGNGKMYKADAVLKEMVENDVSPNLTTFNILIDGFWKDDNLPGSMKVFKE 324

Query: 353 MIDRELDPDVVLYNTLINAYCKDGRLQEARSLLHDMTRIGICPDSFTCRIMVEGYGRGGS 412
           M+D+++ P+V+ YN+LIN  C  G++ EA S+   M   G+ P+  T   ++ G+ +   
Sbjct: 325 MLDQDVKPNVISYNSLINGLCNGGKISEAISMRDKMVSAGVQPNLITYNALINGFCKNDM 384

Query: 413 LISALNLVVELRKLGTIVTYDIYDYLIVSLC---LEDRPFAAKSVLERVIKDGFQPNACI 472
           L  AL++   ++  G + T  +Y+ LI + C     D  FA K  +ER   +G  P+   
Sbjct: 385 LKEALDMFGSVKGQGAVPTTRMYNMLIDAYCKLGKIDDGFALKEEMER---EGIVPDVGT 444

Query: 473 YNKLIECFCRVHNVSEALLLKSEMIKRNFKLSIDSYKPLISCLCGVNRSVDGEGLMVEMV 532
           YN LI   CR  N+  A  L  ++  +     + ++  L+   C    S     L+ EM 
Sbjct: 445 YNCLIAGLCRNGNIEAAKKLFDQLTSKGLP-DLVTFHILMEGYCRKGESRKAAMLLKEMS 504

Query: 533 ESGVLPDHQICRVLINGYCKEGNVYKAESLLVSFAKDFEF-FDTESFNALVKFHRDFGNE 592
           + G+ P H    +++ GYCKEGN+  A ++     K+     +  S+N L++ +   G  
Sbjct: 505 KMGLKPRHLTYNIVMKGYCKEGNLKAATNMRTQMEKERRLRMNVASYNVLLQGYSQKGKL 564

Query: 593 TELMQLQDRMLKVGFVPNSLTCRYV 611
            +   L + ML+ G VPN +T   V
Sbjct: 565 EDANMLLNEMLEKGLVPNRITYEIV 582

BLAST of Cp4.1LG01g23300 vs. ExPASy Swiss-Prot
Match: Q9LFC5 (Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana OX=3702 GN=At5g01110 PE=2 SV=1)

HSP 1 Score: 235.7 bits (600), Expect = 1.4e-60
Identity = 161/654 (24.62%), Postives = 292/654 (44.65%), Query Frame = 0

Query: 5   PASNLTKLSKPFFFLASIPKSNFSSASSSTFLQSIPRSETKLIVNPLYHFLPQNQNPFNI 64
           P  NLT  S P F  +S   S+ SSAS S                          +   +
Sbjct: 20  PLKNLTTSSSPVFEPSSSSSSSSSSASFSV-------------------------SDSFL 79

Query: 65  VELVSSHLKTSNTQLSLLQSDIKELLPHLGHREISKIILRCQSNFVSVLAFFNWVKFDL- 124
           VE +   LK  N       ++++  L  L    + +++ RC+++      F + + F   
Sbjct: 80  VEKICFSLKQGN-------NNVRNHLIRLNPLAVVEVLYRCRNDLTLGQRFVDQLGFHFP 139

Query: 125 GITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHCNWN 184
                S +   +IHIL  S + S A   L  +I  S        ++ ++L     +C  N
Sbjct: 140 NFKHTSLSLSAMIHILVRSGRLSDAQSCLLRMIRRS---GVSRLEIVNSLDSTFSNCGSN 199

Query: 185 PVIFEMLMKAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCWELYE 244
             +F++L++ YV+   ++E++E+F  +   GF  S+ ACN ++  L ++      W +Y+
Sbjct: 200 DSVFDLLIRTYVQARKLREAHEAFTLLRSKGFTVSIDACNALIGSLVRIGWVELAWGVYQ 259

Query: 245 EMGRIGVHSNAYTFNILTYVLCRAGDVNKVNEFLEKMEEEGFDPDVVTYNTLIDSYCRRG 304
           E+ R GV  N YT NI+   LC+ G + KV  FL +++E+G  PD+VTYNTLI +Y  +G
Sbjct: 260 EISRSGVGINVYTLNIMVNALCKDGKMEKVGTFLSQVQEKGVYPDIVTYNTLISAYSSKG 319

Query: 305 RLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDVVLYN 364
            +++AF L   M  +G  P + +Y +++NGLCK G+   A ++F  M+   L PD   Y 
Sbjct: 320 LMEEAFELMNAMPGKGFSPGVYTYNTVINGLCKHGKYERAKEVFAEMLRSGLSPDSTTYR 379

Query: 365 TLINAYCKDGRLQEARSLLHDM-----------------------------------TRI 424
           +L+   CK G + E   +  DM                                      
Sbjct: 380 SLLMEACKKGDVVETEKVFSDMRSRDVVPDLVCFSSMMSLFTRSGNLDKALMYFNSVKEA 439

Query: 425 GICPDSFTCRIMVEGYGRGGSLISALNLVVELRKLGTIVTYDIYDYLIVSLCLEDRPFAA 484
           G+ PD+    I+++GY R G +  A+NL  E+ + G  +    Y+ ++  LC       A
Sbjct: 440 GLIPDNVIYTILIQGYCRKGMISVAMNLRNEMLQQGCAMDVVTYNTILHGLCKRKMLGEA 499

Query: 485 KSVLERVIKDGFQPNACIYNKLIECFCRVHNVSEALLLKSEMIKRNFKLSIDSYKPLISC 544
             +   + +    P++     LI+  C++ N+  A+ L  +M ++  +L + +Y  L+  
Sbjct: 500 DKLFNEMTERALFPDSYTLTILIDGHCKLGNLQNAMELFQKMKEKRIRLDVVTYNTLLDG 559

Query: 545 LCGVNRSVDGEGLMVEMVESGVLPDHQICRVLINGYCKEGNVYKAESLLVSFAKDFEFFD 604
              V      + +  +MV   +LP      +L+N  C +G++ +A  +            
Sbjct: 560 FGKVGDIDTAKEIWADMVSKEILPTPISYSILVNALCSKGHLAEAFRVWDEMISKNIKPT 619

Query: 605 TESFNALVKFHRDFGNETELMQLQDRMLKVGFVPNSLTCRYVIHGLWKSARLDK 623
               N+++K +   GN ++     ++M+  GFVP+ ++   +I+G  +   + K
Sbjct: 620 VMICNSMIKGYCRSGNASDGESFLEKMISEGFVPDCISYNTLIYGFVREENMSK 638

BLAST of Cp4.1LG01g23300 vs. NCBI nr
Match: XP_023535833.1 (pentatricopeptide repeat-containing protein At5g40400 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1281 bits (3315), Expect = 0.0
Identity = 633/633 (100.00%), Postives = 633/633 (100.00%), Query Frame = 0

Query: 1   MHRIPASNLTKLSKPFFFLASIPKSNFSSASSSTFLQSIPRSETKLIVNPLYHFLPQNQN 60
           MHRIPASNLTKLSKPFFFLASIPKSNFSSASSSTFLQSIPRSETKLIVNPLYHFLPQNQN
Sbjct: 1   MHRIPASNLTKLSKPFFFLASIPKSNFSSASSSTFLQSIPRSETKLIVNPLYHFLPQNQN 60

Query: 61  PFNIVELVSSHLKTSNTQLSLLQSDIKELLPHLGHREISKIILRCQSNFVSVLAFFNWVK 120
           PFNIVELVSSHLKTSNTQLSLLQSDIKELLPHLGHREISKIILRCQSNFVSVLAFFNWVK
Sbjct: 61  PFNIVELVSSHLKTSNTQLSLLQSDIKELLPHLGHREISKIILRCQSNFVSVLAFFNWVK 120

Query: 121 FDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHC 180
           FDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHC
Sbjct: 121 FDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHC 180

Query: 181 NWNPVIFEMLMKAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCWE 240
           NWNPVIFEMLMKAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCWE
Sbjct: 181 NWNPVIFEMLMKAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCWE 240

Query: 241 LYEEMGRIGVHSNAYTFNILTYVLCRAGDVNKVNEFLEKMEEEGFDPDVVTYNTLIDSYC 300
           LYEEMGRIGVHSNAYTFNILTYVLCRAGDVNKVNEFLEKMEEEGFDPDVVTYNTLIDSYC
Sbjct: 241 LYEEMGRIGVHSNAYTFNILTYVLCRAGDVNKVNEFLEKMEEEGFDPDVVTYNTLIDSYC 300

Query: 301 RRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDVV 360
           RRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDVV
Sbjct: 301 RRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDVV 360

Query: 361 LYNTLINAYCKDGRLQEARSLLHDMTRIGICPDSFTCRIMVEGYGRGGSLISALNLVVEL 420
           LYNTLINAYCKDGRLQEARSLLHDMTRIGICPDSFTCRIMVEGYGRGGSLISALNLVVEL
Sbjct: 361 LYNTLINAYCKDGRLQEARSLLHDMTRIGICPDSFTCRIMVEGYGRGGSLISALNLVVEL 420

Query: 421 RKLGTIVTYDIYDYLIVSLCLEDRPFAAKSVLERVIKDGFQPNACIYNKLIECFCRVHNV 480
           RKLGTIVTYDIYDYLIVSLCLEDRPFAAKSVLERVIKDGFQPNACIYNKLIECFCRVHNV
Sbjct: 421 RKLGTIVTYDIYDYLIVSLCLEDRPFAAKSVLERVIKDGFQPNACIYNKLIECFCRVHNV 480

Query: 481 SEALLLKSEMIKRNFKLSIDSYKPLISCLCGVNRSVDGEGLMVEMVESGVLPDHQICRVL 540
           SEALLLKSEMIKRNFKLSIDSYKPLISCLCGVNRSVDGEGLMVEMVESGVLPDHQICRVL
Sbjct: 481 SEALLLKSEMIKRNFKLSIDSYKPLISCLCGVNRSVDGEGLMVEMVESGVLPDHQICRVL 540

Query: 541 INGYCKEGNVYKAESLLVSFAKDFEFFDTESFNALVKFHRDFGNETELMQLQDRMLKVGF 600
           INGYCKEGNVYKAESLLVSFAKDFEFFDTESFNALVKFHRDFGNETELMQLQDRMLKVGF
Sbjct: 541 INGYCKEGNVYKAESLLVSFAKDFEFFDTESFNALVKFHRDFGNETELMQLQDRMLKVGF 600

Query: 601 VPNSLTCRYVIHGLWKSARLDKRRVQAPLCQRG 633
           VPNSLTCRYVIHGLWKSARLDKRRVQAPLCQRG
Sbjct: 601 VPNSLTCRYVIHGLWKSARLDKRRVQAPLCQRG 633

BLAST of Cp4.1LG01g23300 vs. NCBI nr
Match: XP_022922085.1 (pentatricopeptide repeat-containing protein At5g40400 [Cucurbita moschata])

HSP 1 Score: 1258 bits (3256), Expect = 0.0
Identity = 619/633 (97.79%), Postives = 627/633 (99.05%), Query Frame = 0

Query: 1   MHRIPASNLTKLSKPFFFLASIPKSNFSSASSSTFLQSIPRSETKLIVNPLYHFLPQNQN 60
           MHRIPASNLTKLSKPF FLASIPKSNFSSASSSTFL SIPRSETKLIVNPLYHFLPQNQN
Sbjct: 1   MHRIPASNLTKLSKPFVFLASIPKSNFSSASSSTFLPSIPRSETKLIVNPLYHFLPQNQN 60

Query: 61  PFNIVELVSSHLKTSNTQLSLLQSDIKELLPHLGHREISKIILRCQSNFVSVLAFFNWVK 120
           PFNIVELVSSHLKTSNT LSLLQSDIKELLPHLGHRE+SKIILRCQSNFVSVLAFFNWVK
Sbjct: 61  PFNIVELVSSHLKTSNTNLSLLQSDIKELLPHLGHREVSKIILRCQSNFVSVLAFFNWVK 120

Query: 121 FDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHC 180
           FDLGITL+SQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHC
Sbjct: 121 FDLGITLSSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHC 180

Query: 181 NWNPVIFEMLMKAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCWE 240
           NWNPVIFEML+KAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCWE
Sbjct: 181 NWNPVIFEMLIKAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCWE 240

Query: 241 LYEEMGRIGVHSNAYTFNILTYVLCRAGDVNKVNEFLEKMEEEGFDPDVVTYNTLIDSYC 300
           LYEEMGRIGVHSNAYTFNILTYVLCR GDVNKVNEFLEKMEEEGFDPDVVTYNTLI+SYC
Sbjct: 241 LYEEMGRIGVHSNAYTFNILTYVLCRVGDVNKVNEFLEKMEEEGFDPDVVTYNTLINSYC 300

Query: 301 RRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDVV 360
           RRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDVV
Sbjct: 301 RRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDVV 360

Query: 361 LYNTLINAYCKDGRLQEARSLLHDMTRIGICPDSFTCRIMVEGYGRGGSLISALNLVVEL 420
           LYNTLINAYCKDGRLQEARSLLHDMTRIGICPDSFTCRIMVEGYGRGG LISALNLVVEL
Sbjct: 361 LYNTLINAYCKDGRLQEARSLLHDMTRIGICPDSFTCRIMVEGYGRGGRLISALNLVVEL 420

Query: 421 RKLGTIVTYDIYDYLIVSLCLEDRPFAAKSVLERVIKDGFQPNACIYNKLIECFCRVHNV 480
           RKLGTIVTY+IYDYLIVSLCLEDRPFAAKSVLER+IKDGFQPNACIYNKLIE FCRVHNV
Sbjct: 421 RKLGTIVTYEIYDYLIVSLCLEDRPFAAKSVLERIIKDGFQPNACIYNKLIESFCRVHNV 480

Query: 481 SEALLLKSEMIKRNFKLSIDSYKPLISCLCGVNRSVDGEGLMVEMVESGVLPDHQICRVL 540
           SEALLLKSEM+KRNFKLSIDSYKPLISCLCG+NRSVDGEGLMVEMVESGVLPDHQICRVL
Sbjct: 481 SEALLLKSEMVKRNFKLSIDSYKPLISCLCGINRSVDGEGLMVEMVESGVLPDHQICRVL 540

Query: 541 INGYCKEGNVYKAESLLVSFAKDFEFFDTESFNALVKFHRDFGNETELMQLQDRMLKVGF 600
           INGYCKEGNVYKAESLLVSFAKDFEFFDTESFNALVKFHRDFGNETELMQLQDRMLKVGF
Sbjct: 541 INGYCKEGNVYKAESLLVSFAKDFEFFDTESFNALVKFHRDFGNETELMQLQDRMLKVGF 600

Query: 601 VPNSLTCRYVIHGLWKSARLDKRRVQAPLCQRG 633
           VPNSLTCRYVIHGLWKSARLDKRRVQAPLCQRG
Sbjct: 601 VPNSLTCRYVIHGLWKSARLDKRRVQAPLCQRG 633

BLAST of Cp4.1LG01g23300 vs. NCBI nr
Match: KAG7033098.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1257 bits (3252), Expect = 0.0
Identity = 618/633 (97.63%), Postives = 626/633 (98.89%), Query Frame = 0

Query: 1   MHRIPASNLTKLSKPFFFLASIPKSNFSSASSSTFLQSIPRSETKLIVNPLYHFLPQNQN 60
           MHRIPASNLTKLSKPF FLASIPKSNFSSASSSTFL SIPRSETKL VNPLYHFLPQNQN
Sbjct: 1   MHRIPASNLTKLSKPFVFLASIPKSNFSSASSSTFLPSIPRSETKLTVNPLYHFLPQNQN 60

Query: 61  PFNIVELVSSHLKTSNTQLSLLQSDIKELLPHLGHREISKIILRCQSNFVSVLAFFNWVK 120
           PFNIVELVSSHLKTSNTQLSLLQSDIKELLPHLGHREISKIILRCQSNFVSVLAFFNWVK
Sbjct: 61  PFNIVELVSSHLKTSNTQLSLLQSDIKELLPHLGHREISKIILRCQSNFVSVLAFFNWVK 120

Query: 121 FDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHC 180
           FDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHC
Sbjct: 121 FDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHC 180

Query: 181 NWNPVIFEMLMKAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCWE 240
           NWNPVIFEML+KAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCWE
Sbjct: 181 NWNPVIFEMLIKAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCWE 240

Query: 241 LYEEMGRIGVHSNAYTFNILTYVLCRAGDVNKVNEFLEKMEEEGFDPDVVTYNTLIDSYC 300
           LYEEMGRIGVHSNAYTFNILTYVLCR GDVNKVNEFLEKMEEEGFDPDVVTYNTLIDSYC
Sbjct: 241 LYEEMGRIGVHSNAYTFNILTYVLCRVGDVNKVNEFLEKMEEEGFDPDVVTYNTLIDSYC 300

Query: 301 RRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDVV 360
           RRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDVV
Sbjct: 301 RRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDVV 360

Query: 361 LYNTLINAYCKDGRLQEARSLLHDMTRIGICPDSFTCRIMVEGYGRGGSLISALNLVVEL 420
           LYNTLI+AYCKDGRLQEARSLLHDM RIGICPDSFTCRIMVEGYGRGGSL+SALNLVVEL
Sbjct: 361 LYNTLIHAYCKDGRLQEARSLLHDMIRIGICPDSFTCRIMVEGYGRGGSLVSALNLVVEL 420

Query: 421 RKLGTIVTYDIYDYLIVSLCLEDRPFAAKSVLERVIKDGFQPNACIYNKLIECFCRVHNV 480
           RKLGTIVTY+IYDYLIVSLCLEDRPFAAKSVLER+IKDGFQPNACIYNKLIE FCRVHNV
Sbjct: 421 RKLGTIVTYEIYDYLIVSLCLEDRPFAAKSVLERIIKDGFQPNACIYNKLIESFCRVHNV 480

Query: 481 SEALLLKSEMIKRNFKLSIDSYKPLISCLCGVNRSVDGEGLMVEMVESGVLPDHQICRVL 540
           SEALLLKSEM+KRNFKLSIDSYKPLISCLCG+NRSVDGEGLMVEM ESGVLPDHQICRVL
Sbjct: 481 SEALLLKSEMVKRNFKLSIDSYKPLISCLCGINRSVDGEGLMVEMAESGVLPDHQICRVL 540

Query: 541 INGYCKEGNVYKAESLLVSFAKDFEFFDTESFNALVKFHRDFGNETELMQLQDRMLKVGF 600
           INGYCKEGNVYKAESLLVSFAKDFEFFDTESFNALVKFHRDFGNETELMQLQDRMLKVGF
Sbjct: 541 INGYCKEGNVYKAESLLVSFAKDFEFFDTESFNALVKFHRDFGNETELMQLQDRMLKVGF 600

Query: 601 VPNSLTCRYVIHGLWKSARLDKRRVQAPLCQRG 633
           VPNSLTCRYVIHGLWKSARLD+RRVQAPLCQRG
Sbjct: 601 VPNSLTCRYVIHGLWKSARLDERRVQAPLCQRG 633

BLAST of Cp4.1LG01g23300 vs. NCBI nr
Match: KAG6602420.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1254 bits (3245), Expect = 0.0
Identity = 618/633 (97.63%), Postives = 625/633 (98.74%), Query Frame = 0

Query: 1   MHRIPASNLTKLSKPFFFLASIPKSNFSSASSSTFLQSIPRSETKLIVNPLYHFLPQNQN 60
           MHRIPASNLTKLSKPF FLASIPKSNFSSASSSTFL SIPRSETKL VNPLYHFLPQNQN
Sbjct: 1   MHRIPASNLTKLSKPFVFLASIPKSNFSSASSSTFLPSIPRSETKLTVNPLYHFLPQNQN 60

Query: 61  PFNIVELVSSHLKTSNTQLSLLQSDIKELLPHLGHREISKIILRCQSNFVSVLAFFNWVK 120
           PFNIVELVSSHLKTSNTQLSLLQSDIKELLPHLGHREISKIILRCQSNFVSVLAFFNWVK
Sbjct: 61  PFNIVELVSSHLKTSNTQLSLLQSDIKELLPHLGHREISKIILRCQSNFVSVLAFFNWVK 120

Query: 121 FDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHC 180
           FDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHC
Sbjct: 121 FDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHC 180

Query: 181 NWNPVIFEMLMKAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCWE 240
           NWNPVIFEML+KAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCWE
Sbjct: 181 NWNPVIFEMLIKAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCWE 240

Query: 241 LYEEMGRIGVHSNAYTFNILTYVLCRAGDVNKVNEFLEKMEEEGFDPDVVTYNTLIDSYC 300
           LYEEMGRIGVHSNAYTFNILTYVLCR GDVNKVNEFLEKMEEEGFDPDVVTYNTLIDSYC
Sbjct: 241 LYEEMGRIGVHSNAYTFNILTYVLCRVGDVNKVNEFLEKMEEEGFDPDVVTYNTLIDSYC 300

Query: 301 RRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDVV 360
           RRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDVV
Sbjct: 301 RRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDVV 360

Query: 361 LYNTLINAYCKDGRLQEARSLLHDMTRIGICPDSFTCRIMVEGYGRGGSLISALNLVVEL 420
           LYNTLI+AYCKDGRLQEARSLLHDM RIGICPDSFTCRIMVEGYGRGGSL+SALNLVVEL
Sbjct: 361 LYNTLIHAYCKDGRLQEARSLLHDMIRIGICPDSFTCRIMVEGYGRGGSLVSALNLVVEL 420

Query: 421 RKLGTIVTYDIYDYLIVSLCLEDRPFAAKSVLERVIKDGFQPNACIYNKLIECFCRVHNV 480
           RKLGTIVTY+IYDYLIVSLCLEDRPFAAKSVLER+IKDGFQPNACIYNKLIE FCRVHNV
Sbjct: 421 RKLGTIVTYEIYDYLIVSLCLEDRPFAAKSVLERIIKDGFQPNACIYNKLIESFCRVHNV 480

Query: 481 SEALLLKSEMIKRNFKLSIDSYKPLISCLCGVNRSVDGEGLMVEMVESGVLPDHQICRVL 540
           SEALLLKSEM+KRNFKLSIDSYKPLISCLCG+NRSVDGEGLMVEM ESGVLPDHQICRVL
Sbjct: 481 SEALLLKSEMVKRNFKLSIDSYKPLISCLCGINRSVDGEGLMVEMAESGVLPDHQICRVL 540

Query: 541 INGYCKEGNVYKAESLLVSFAKDFEFFDTESFNALVKFHRDFGNETELMQLQDRMLKVGF 600
           INGYCKEGNVYKAESLLVSFAKDFEFFDTESFNALVKFHRDFGNETELMQLQDRMLKVGF
Sbjct: 541 INGYCKEGNVYKAESLLVSFAKDFEFFDTESFNALVKFHRDFGNETELMQLQDRMLKVGF 600

Query: 601 VPNSLTCRYVIHGLWKSARLDKRRVQAPLCQRG 633
           VPNSLTCRYVI GLWKSARLDKRRVQAPLCQRG
Sbjct: 601 VPNSLTCRYVILGLWKSARLDKRRVQAPLCQRG 633

BLAST of Cp4.1LG01g23300 vs. NCBI nr
Match: XP_022991028.1 (pentatricopeptide repeat-containing protein At5g40400 [Cucurbita maxima])

HSP 1 Score: 1250 bits (3234), Expect = 0.0
Identity = 615/633 (97.16%), Postives = 623/633 (98.42%), Query Frame = 0

Query: 1   MHRIPASNLTKLSKPFFFLASIPKSNFSSASSSTFLQSIPRSETKLIVNPLYHFLPQNQN 60
           MHRIPASNLTKLSKPF FLASIPKSNFSSASSSTFLQS+PRSETKLIVNPLYHFLPQNQN
Sbjct: 1   MHRIPASNLTKLSKPFVFLASIPKSNFSSASSSTFLQSVPRSETKLIVNPLYHFLPQNQN 60

Query: 61  PFNIVELVSSHLKTSNTQLSLLQSDIKELLPHLGHREISKIILRCQSNFVSVLAFFNWVK 120
           PFNIVELVSSHLKTSNT LSLLQSDIKELLPHLGHREISKIILRCQSNFVSVLAFFNWVK
Sbjct: 61  PFNIVELVSSHLKTSNTHLSLLQSDIKELLPHLGHREISKIILRCQSNFVSVLAFFNWVK 120

Query: 121 FDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHC 180
           FDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHC
Sbjct: 121 FDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHC 180

Query: 181 NWNPVIFEMLMKAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCWE 240
           NWNPVIFEML+KAYVKVHMIQESYESFKKMVKMGFVPSVIACN ILNGLAKMKCDAQCWE
Sbjct: 181 NWNPVIFEMLIKAYVKVHMIQESYESFKKMVKMGFVPSVIACNRILNGLAKMKCDAQCWE 240

Query: 241 LYEEMGRIGVHSNAYTFNILTYVLCRAGDVNKVNEFLEKMEEEGFDPDVVTYNTLIDSYC 300
           LYEEMGRIGVHSNAYTFNILTYVLCR GDVNKVNEFLEKMEEEGFDPDVVTYNTLIDSYC
Sbjct: 241 LYEEMGRIGVHSNAYTFNILTYVLCRVGDVNKVNEFLEKMEEEGFDPDVVTYNTLIDSYC 300

Query: 301 RRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDVV 360
           RRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCK GRVREAHQLFHRMIDRELDPDVV
Sbjct: 301 RRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKFGRVREAHQLFHRMIDRELDPDVV 360

Query: 361 LYNTLINAYCKDGRLQEARSLLHDMTRIGICPDSFTCRIMVEGYGRGGSLISALNLVVEL 420
           LYNTLINAYCKDGRLQEARSLLHDM RIGICPDSFTCRIMVEGYGRGGSLISALNLVVEL
Sbjct: 361 LYNTLINAYCKDGRLQEARSLLHDMIRIGICPDSFTCRIMVEGYGRGGSLISALNLVVEL 420

Query: 421 RKLGTIVTYDIYDYLIVSLCLEDRPFAAKSVLERVIKDGFQPNACIYNKLIECFCRVHNV 480
           R+LGTIVT+DIYDYLIVSLC EDRPFAAKSVLER+IKDGFQPNACIYNKLIECFCRVHNV
Sbjct: 421 RRLGTIVTFDIYDYLIVSLCQEDRPFAAKSVLERIIKDGFQPNACIYNKLIECFCRVHNV 480

Query: 481 SEALLLKSEMIKRNFKLSIDSYKPLISCLCGVNRSVDGEGLMVEMVESGVLPDHQICRVL 540
           SEALLLKSEMI+RN KLSIDSYKPLISCLCG+NRSVDGE LMVEMVESGVLPDHQICRVL
Sbjct: 481 SEALLLKSEMIRRNLKLSIDSYKPLISCLCGINRSVDGEDLMVEMVESGVLPDHQICRVL 540

Query: 541 INGYCKEGNVYKAESLLVSFAKDFEFFDTESFNALVKFHRDFGNETELMQLQDRMLKVGF 600
           INGYCKEGN YKAESLLVSFAKDFEFFDTESFNAL+KFHRDFGNETELMQLQDRMLKVGF
Sbjct: 541 INGYCKEGNAYKAESLLVSFAKDFEFFDTESFNALLKFHRDFGNETELMQLQDRMLKVGF 600

Query: 601 VPNSLTCRYVIHGLWKSARLDKRRVQAPLCQRG 633
           VPNSLTCRYVIHGLWKSARLDKRRVQAPLCQRG
Sbjct: 601 VPNSLTCRYVIHGLWKSARLDKRRVQAPLCQRG 633

BLAST of Cp4.1LG01g23300 vs. ExPASy TrEMBL
Match: A0A6J1E7L8 (pentatricopeptide repeat-containing protein At5g40400 OS=Cucurbita moschata OX=3662 GN=LOC111430138 PE=4 SV=1)

HSP 1 Score: 1258 bits (3256), Expect = 0.0
Identity = 619/633 (97.79%), Postives = 627/633 (99.05%), Query Frame = 0

Query: 1   MHRIPASNLTKLSKPFFFLASIPKSNFSSASSSTFLQSIPRSETKLIVNPLYHFLPQNQN 60
           MHRIPASNLTKLSKPF FLASIPKSNFSSASSSTFL SIPRSETKLIVNPLYHFLPQNQN
Sbjct: 1   MHRIPASNLTKLSKPFVFLASIPKSNFSSASSSTFLPSIPRSETKLIVNPLYHFLPQNQN 60

Query: 61  PFNIVELVSSHLKTSNTQLSLLQSDIKELLPHLGHREISKIILRCQSNFVSVLAFFNWVK 120
           PFNIVELVSSHLKTSNT LSLLQSDIKELLPHLGHRE+SKIILRCQSNFVSVLAFFNWVK
Sbjct: 61  PFNIVELVSSHLKTSNTNLSLLQSDIKELLPHLGHREVSKIILRCQSNFVSVLAFFNWVK 120

Query: 121 FDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHC 180
           FDLGITL+SQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHC
Sbjct: 121 FDLGITLSSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHC 180

Query: 181 NWNPVIFEMLMKAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCWE 240
           NWNPVIFEML+KAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCWE
Sbjct: 181 NWNPVIFEMLIKAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCWE 240

Query: 241 LYEEMGRIGVHSNAYTFNILTYVLCRAGDVNKVNEFLEKMEEEGFDPDVVTYNTLIDSYC 300
           LYEEMGRIGVHSNAYTFNILTYVLCR GDVNKVNEFLEKMEEEGFDPDVVTYNTLI+SYC
Sbjct: 241 LYEEMGRIGVHSNAYTFNILTYVLCRVGDVNKVNEFLEKMEEEGFDPDVVTYNTLINSYC 300

Query: 301 RRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDVV 360
           RRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDVV
Sbjct: 301 RRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDVV 360

Query: 361 LYNTLINAYCKDGRLQEARSLLHDMTRIGICPDSFTCRIMVEGYGRGGSLISALNLVVEL 420
           LYNTLINAYCKDGRLQEARSLLHDMTRIGICPDSFTCRIMVEGYGRGG LISALNLVVEL
Sbjct: 361 LYNTLINAYCKDGRLQEARSLLHDMTRIGICPDSFTCRIMVEGYGRGGRLISALNLVVEL 420

Query: 421 RKLGTIVTYDIYDYLIVSLCLEDRPFAAKSVLERVIKDGFQPNACIYNKLIECFCRVHNV 480
           RKLGTIVTY+IYDYLIVSLCLEDRPFAAKSVLER+IKDGFQPNACIYNKLIE FCRVHNV
Sbjct: 421 RKLGTIVTYEIYDYLIVSLCLEDRPFAAKSVLERIIKDGFQPNACIYNKLIESFCRVHNV 480

Query: 481 SEALLLKSEMIKRNFKLSIDSYKPLISCLCGVNRSVDGEGLMVEMVESGVLPDHQICRVL 540
           SEALLLKSEM+KRNFKLSIDSYKPLISCLCG+NRSVDGEGLMVEMVESGVLPDHQICRVL
Sbjct: 481 SEALLLKSEMVKRNFKLSIDSYKPLISCLCGINRSVDGEGLMVEMVESGVLPDHQICRVL 540

Query: 541 INGYCKEGNVYKAESLLVSFAKDFEFFDTESFNALVKFHRDFGNETELMQLQDRMLKVGF 600
           INGYCKEGNVYKAESLLVSFAKDFEFFDTESFNALVKFHRDFGNETELMQLQDRMLKVGF
Sbjct: 541 INGYCKEGNVYKAESLLVSFAKDFEFFDTESFNALVKFHRDFGNETELMQLQDRMLKVGF 600

Query: 601 VPNSLTCRYVIHGLWKSARLDKRRVQAPLCQRG 633
           VPNSLTCRYVIHGLWKSARLDKRRVQAPLCQRG
Sbjct: 601 VPNSLTCRYVIHGLWKSARLDKRRVQAPLCQRG 633

BLAST of Cp4.1LG01g23300 vs. ExPASy TrEMBL
Match: A0A6J1JUZ7 (pentatricopeptide repeat-containing protein At5g40400 OS=Cucurbita maxima OX=3661 GN=LOC111487742 PE=4 SV=1)

HSP 1 Score: 1250 bits (3234), Expect = 0.0
Identity = 615/633 (97.16%), Postives = 623/633 (98.42%), Query Frame = 0

Query: 1   MHRIPASNLTKLSKPFFFLASIPKSNFSSASSSTFLQSIPRSETKLIVNPLYHFLPQNQN 60
           MHRIPASNLTKLSKPF FLASIPKSNFSSASSSTFLQS+PRSETKLIVNPLYHFLPQNQN
Sbjct: 1   MHRIPASNLTKLSKPFVFLASIPKSNFSSASSSTFLQSVPRSETKLIVNPLYHFLPQNQN 60

Query: 61  PFNIVELVSSHLKTSNTQLSLLQSDIKELLPHLGHREISKIILRCQSNFVSVLAFFNWVK 120
           PFNIVELVSSHLKTSNT LSLLQSDIKELLPHLGHREISKIILRCQSNFVSVLAFFNWVK
Sbjct: 61  PFNIVELVSSHLKTSNTHLSLLQSDIKELLPHLGHREISKIILRCQSNFVSVLAFFNWVK 120

Query: 121 FDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHC 180
           FDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHC
Sbjct: 121 FDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHC 180

Query: 181 NWNPVIFEMLMKAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCWE 240
           NWNPVIFEML+KAYVKVHMIQESYESFKKMVKMGFVPSVIACN ILNGLAKMKCDAQCWE
Sbjct: 181 NWNPVIFEMLIKAYVKVHMIQESYESFKKMVKMGFVPSVIACNRILNGLAKMKCDAQCWE 240

Query: 241 LYEEMGRIGVHSNAYTFNILTYVLCRAGDVNKVNEFLEKMEEEGFDPDVVTYNTLIDSYC 300
           LYEEMGRIGVHSNAYTFNILTYVLCR GDVNKVNEFLEKMEEEGFDPDVVTYNTLIDSYC
Sbjct: 241 LYEEMGRIGVHSNAYTFNILTYVLCRVGDVNKVNEFLEKMEEEGFDPDVVTYNTLIDSYC 300

Query: 301 RRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDVV 360
           RRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCK GRVREAHQLFHRMIDRELDPDVV
Sbjct: 301 RRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKFGRVREAHQLFHRMIDRELDPDVV 360

Query: 361 LYNTLINAYCKDGRLQEARSLLHDMTRIGICPDSFTCRIMVEGYGRGGSLISALNLVVEL 420
           LYNTLINAYCKDGRLQEARSLLHDM RIGICPDSFTCRIMVEGYGRGGSLISALNLVVEL
Sbjct: 361 LYNTLINAYCKDGRLQEARSLLHDMIRIGICPDSFTCRIMVEGYGRGGSLISALNLVVEL 420

Query: 421 RKLGTIVTYDIYDYLIVSLCLEDRPFAAKSVLERVIKDGFQPNACIYNKLIECFCRVHNV 480
           R+LGTIVT+DIYDYLIVSLC EDRPFAAKSVLER+IKDGFQPNACIYNKLIECFCRVHNV
Sbjct: 421 RRLGTIVTFDIYDYLIVSLCQEDRPFAAKSVLERIIKDGFQPNACIYNKLIECFCRVHNV 480

Query: 481 SEALLLKSEMIKRNFKLSIDSYKPLISCLCGVNRSVDGEGLMVEMVESGVLPDHQICRVL 540
           SEALLLKSEMI+RN KLSIDSYKPLISCLCG+NRSVDGE LMVEMVESGVLPDHQICRVL
Sbjct: 481 SEALLLKSEMIRRNLKLSIDSYKPLISCLCGINRSVDGEDLMVEMVESGVLPDHQICRVL 540

Query: 541 INGYCKEGNVYKAESLLVSFAKDFEFFDTESFNALVKFHRDFGNETELMQLQDRMLKVGF 600
           INGYCKEGN YKAESLLVSFAKDFEFFDTESFNAL+KFHRDFGNETELMQLQDRMLKVGF
Sbjct: 541 INGYCKEGNAYKAESLLVSFAKDFEFFDTESFNALLKFHRDFGNETELMQLQDRMLKVGF 600

Query: 601 VPNSLTCRYVIHGLWKSARLDKRRVQAPLCQRG 633
           VPNSLTCRYVIHGLWKSARLDKRRVQAPLCQRG
Sbjct: 601 VPNSLTCRYVIHGLWKSARLDKRRVQAPLCQRG 633

BLAST of Cp4.1LG01g23300 vs. ExPASy TrEMBL
Match: A0A0A0KSF1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G523120 PE=4 SV=1)

HSP 1 Score: 1027 bits (2655), Expect = 0.0
Identity = 507/633 (80.09%), Postives = 562/633 (88.78%), Query Frame = 0

Query: 1   MHRIPASNLTKLSKPFFFLASIPKSNFSSASSS-------TFLQSIPRSETKLIVNPLYH 60
           M R  ASNL KL++ FFF  S  KS  SS+SSS       TFLQSIP SE KLIVNPLYH
Sbjct: 1   MLRTTASNLPKLAQSFFFHTSFSKSTLSSSSSSSSSSSSSTFLQSIPESEAKLIVNPLYH 60

Query: 61  FLPQNQNPFNIVELVSSHLKTSNTQLSLLQSDIKELLPHLGHREISKIILRCQSNFVSVL 120
           FLPQNQNPFNIVELVSSHLKT+N +L+LLQS IKEL+PHLGHR+ISKI+LRCQSNFVS L
Sbjct: 61  FLPQNQNPFNIVELVSSHLKTNNPRLALLQSHIKELIPHLGHRQISKILLRCQSNFVSAL 120

Query: 121 AFFNWVKFDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNL 180
           AFFNWVK+DL I L+S NYCLIIHILAWSRQF +AMKFLSELIELSKD  S SEDVF NL
Sbjct: 121 AFFNWVKYDLDIRLSSHNYCLIIHILAWSRQFPLAMKFLSELIELSKD-VSSSEDVFQNL 180

Query: 181 VLCTEHCNWNPVIFEMLMKAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKMK 240
           VLCTEHCNWNPVIFEML+KAYVK+ +I ESY SFKKMVK+GFVP+VIACNCILNGLAKMK
Sbjct: 181 VLCTEHCNWNPVIFEMLIKAYVKLDLIHESYWSFKKMVKLGFVPNVIACNCILNGLAKMK 240

Query: 241 CDAQCWELYEEMGRIGVHSNAYTFNILTYVLCRAGDVNKVNEFLEKMEEEGFDPDVVTYN 300
            DAQCWELYEEMGRIGVHSNAYTFNILTYVLCR GDVNK+N FLEKMEEEGFDPDVVTYN
Sbjct: 241 SDAQCWELYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINGFLEKMEEEGFDPDVVTYN 300

Query: 301 TLIDSYCRRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRMIDR 360
           TLIDSY RRGRL+DAFYLY+IM+RRGVMPDLVSYTSLM GLC+LGRVREAHQLFHRMIDR
Sbjct: 301 TLIDSYVRRGRLEDAFYLYKIMYRRGVMPDLVSYTSLMRGLCRLGRVREAHQLFHRMIDR 360

Query: 361 ELDPDVVLYNTLINAYCKDGRLQEARSLLHDMTRIGICPDSFTCRIMVEGYGRGGSLISA 420
            +DPDVVLYNTLI AYCKDG LQEARSLLH+M  IGI PDSFTCRI+VEGYGR G LISA
Sbjct: 361 GMDPDVVLYNTLIGAYCKDGMLQEARSLLHEMIGIGIHPDSFTCRILVEGYGREGRLISA 420

Query: 421 LNLVVELRKLGTIVTYDIYDYLIVSLCLEDRPFAAKSVLERVIKDGFQPNACIYNKLIEC 480
           LNLVVE++KLG  V +DIY YLI+SLC EDRPFAAKS+LER+++D FQP++ IYNKLIE 
Sbjct: 421 LNLVVEIQKLGVTVAHDIYKYLIISLCREDRPFAAKSLLERILEDSFQPDSDIYNKLIES 480

Query: 481 FCRVHNVSEALLLKSEMIKRNFKLSIDSYKPLISCLCGVNRSVDGEGLMVEMVESGVLPD 540
           FCR +NVSEALLLK EMI RN+K + D+YK LI C+C +NRSVDGEGLMVEMVES V+PD
Sbjct: 481 FCRSNNVSEALLLKLEMINRNYKPTTDTYKSLIHCMCEINRSVDGEGLMVEMVESEVIPD 540

Query: 541 HQICRVLINGYCKEGNVYKAESLLVSFAKDFEFFDTESFNALVKFHRDFGNETELMQLQD 600
           H+ICR L+NGYCKEGN  KAESLLVSFAKDF+FFD+ESFN+LVK +RD GNET+LM+LQD
Sbjct: 541 HEICRALVNGYCKEGNADKAESLLVSFAKDFQFFDSESFNSLVKVYRDVGNETKLMELQD 600

Query: 601 RMLKVGFVPNSLTCRYVIHGLWKSARLDKRRVQ 626
           RMLK GF+PNSLTCRY+IHG+WKS RL+K+RVQ
Sbjct: 601 RMLKAGFLPNSLTCRYIIHGIWKSMRLNKQRVQ 632

BLAST of Cp4.1LG01g23300 vs. ExPASy TrEMBL
Match: A0A1S3C995 (pentatricopeptide repeat-containing protein At5g40400 OS=Cucumis melo OX=3656 GN=LOC103498097 PE=4 SV=1)

HSP 1 Score: 1017 bits (2630), Expect = 0.0
Identity = 501/627 (79.90%), Postives = 557/627 (88.84%), Query Frame = 0

Query: 1   MHRIPASNLTKLSKPFFFLASIPKSNFSSASSSTFLQSIPRSETKLIVNPLYHFLPQNQN 60
           M R  ASNL KLS+ FFFL+S  KS  SS+SSSTFLQSIP SE K IVNPLYHFLPQNQN
Sbjct: 1   MLRTTASNLPKLSQSFFFLSSFSKSTLSSSSSSTFLQSIPESEAKSIVNPLYHFLPQNQN 60

Query: 61  PFNIVELVSSHLKTSNTQLSLLQSDIKELLPHLGHREISKIILRCQSNFVSVLAFFNWVK 120
           PFNIVELVS HLKT+N +L+LLQ++IK L+P+LGH +ISKI+LRCQSNFVS LAFFNWVK
Sbjct: 61  PFNIVELVSLHLKTNNPRLALLQANIKGLIPYLGHCQISKILLRCQSNFVSALAFFNWVK 120

Query: 121 FDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHC 180
           +DL I LNS NYCLIIHILAWSRQF +AMK LSELIELSKD  S SEDVF NLVLCTEHC
Sbjct: 121 YDLDIRLNSHNYCLIIHILAWSRQFPLAMKLLSELIELSKD-VSSSEDVFQNLVLCTEHC 180

Query: 181 NWNPVIFEMLMKAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCWE 240
           NWNPVIFEML+KAYVK+ +I ESY SFKKMVK+GFVPSVIACNCIL+GLAKMK D QCWE
Sbjct: 181 NWNPVIFEMLIKAYVKLDLIHESYWSFKKMVKLGFVPSVIACNCILHGLAKMKSDGQCWE 240

Query: 241 LYEEMGRIGVHSNAYTFNILTYVLCRAGDVNKVNEFLEKMEEEGFDPDVVTYNTLIDSYC 300
           LYEEMGRIGVHSNAYTFNILTYVLCR GDVNK+NEFLEKMEEEGFDPDVVTYNTLIDSYC
Sbjct: 241 LYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYNTLIDSYC 300

Query: 301 RRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDVV 360
           RRGRLDDAFYLYRIM+RR VMPDLVSYTSLM GLC+LGRVREAHQLFHRMIDR +DPDVV
Sbjct: 301 RRGRLDDAFYLYRIMYRRSVMPDLVSYTSLMRGLCRLGRVREAHQLFHRMIDRGMDPDVV 360

Query: 361 LYNTLINAYCKDGRLQEARSLLHDMTRIGICPDSFTCRIMVEGYGRGGSLISALNLVVEL 420
            YNTLI AYCKDG LQEARSLLHDM  IGI PD+FTCRI+VEGYGR G LISALNLVVE+
Sbjct: 361 SYNTLIGAYCKDGMLQEARSLLHDMIGIGIHPDNFTCRILVEGYGREGRLISALNLVVEI 420

Query: 421 RKLGTIVTYDIYDYLIVSLCLEDRPFAAKSVLERVIKDGFQPNACIYNKLIECFCRVHNV 480
           +KLG  + +DIY YLI+SLC EDRPFAAKS+LER+++D FQP++ IYNKLIE FCR +NV
Sbjct: 421 QKLGVTIAHDIYKYLIISLCQEDRPFAAKSLLERILEDRFQPDSDIYNKLIESFCRSNNV 480

Query: 481 SEALLLKSEMIKRNFKLSIDSYKPLISCLCGVNRSVDGEGLMVEMVESGVLPDHQICRVL 540
           SEALLLKSEMI RNFK +I +YK LI C+C +NRSVDGEGLM EMVES VLPDH+ICR L
Sbjct: 481 SEALLLKSEMINRNFKPTIYTYKSLIHCMCEINRSVDGEGLMEEMVESEVLPDHEICRAL 540

Query: 541 INGYCKEGNVYKAESLLVSFAKDFEFFDTESFNALVKFHRDFGNETELMQLQDRMLKVGF 600
           +NGYCKEGN  KAESLLVSFAKDF+FFD+ESFN+LVK + D GNET+LM+LQ RMLK GF
Sbjct: 541 VNGYCKEGNADKAESLLVSFAKDFQFFDSESFNSLVKVYCDMGNETKLMELQTRMLKAGF 600

Query: 601 VPNSLTCRYVIHGLWKSARLDKRRVQA 627
           +PN+LTC+Y+IHGLWK  RL+++RVQA
Sbjct: 601 LPNNLTCQYIIHGLWKFTRLNEQRVQA 626

BLAST of Cp4.1LG01g23300 vs. ExPASy TrEMBL
Match: A0A5D3BTR0 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold374G00460 PE=4 SV=1)

HSP 1 Score: 1016 bits (2627), Expect = 0.0
Identity = 500/627 (79.74%), Postives = 557/627 (88.84%), Query Frame = 0

Query: 1   MHRIPASNLTKLSKPFFFLASIPKSNFSSASSSTFLQSIPRSETKLIVNPLYHFLPQNQN 60
           M R  ASNL KLS+ FFFL+S  KS  SS+SSSTFLQSIP SE K IVNPLYHFLPQNQN
Sbjct: 1   MLRTTASNLPKLSQSFFFLSSFSKSTLSSSSSSTFLQSIPESEAKSIVNPLYHFLPQNQN 60

Query: 61  PFNIVELVSSHLKTSNTQLSLLQSDIKELLPHLGHREISKIILRCQSNFVSVLAFFNWVK 120
           PFNIVELVS HLKT+N +L+LLQ++IK L+P+LGH +ISKI+LRCQSNFVS LAFFNWVK
Sbjct: 61  PFNIVELVSLHLKTNNPRLALLQANIKGLIPYLGHCQISKILLRCQSNFVSALAFFNWVK 120

Query: 121 FDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHC 180
           +DL I LNS NYCLIIHILAWSRQF +AMK LSELIELSKD  S SEDVF NLVLCTEHC
Sbjct: 121 YDLDIRLNSHNYCLIIHILAWSRQFPLAMKLLSELIELSKD-VSSSEDVFQNLVLCTEHC 180

Query: 181 NWNPVIFEMLMKAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCWE 240
           NWNPVIFEML+KAYVK+ +I ESY SFK+MVK+GFVPSVIACNCIL+GLAKMK D QCWE
Sbjct: 181 NWNPVIFEMLIKAYVKLDLIHESYWSFKRMVKLGFVPSVIACNCILHGLAKMKSDGQCWE 240

Query: 241 LYEEMGRIGVHSNAYTFNILTYVLCRAGDVNKVNEFLEKMEEEGFDPDVVTYNTLIDSYC 300
           LYEEMGRIGVHSNAYTFNILTYVLCR GDVNK+NEFLEKMEEEGFDPDVVTYNTLIDSYC
Sbjct: 241 LYEEMGRIGVHSNAYTFNILTYVLCRDGDVNKINEFLEKMEEEGFDPDVVTYNTLIDSYC 300

Query: 301 RRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDVV 360
           RRGRLDDAFYLYRIM+RR VMPDLVSYTSLM GLC+LGRVREAHQLFHRMIDR +DPDVV
Sbjct: 301 RRGRLDDAFYLYRIMYRRSVMPDLVSYTSLMRGLCRLGRVREAHQLFHRMIDRGMDPDVV 360

Query: 361 LYNTLINAYCKDGRLQEARSLLHDMTRIGICPDSFTCRIMVEGYGRGGSLISALNLVVEL 420
            YNTLI AYCKDG LQEARSLLHDM  IGI PD+FTCRI+VEGYGR G LISALNLVVE+
Sbjct: 361 SYNTLIGAYCKDGMLQEARSLLHDMIGIGIHPDNFTCRILVEGYGREGRLISALNLVVEI 420

Query: 421 RKLGTIVTYDIYDYLIVSLCLEDRPFAAKSVLERVIKDGFQPNACIYNKLIECFCRVHNV 480
           +KLG  + +DIY YLI+SLC EDRPFAAKS+LER+++D FQP++ IYNKLIE FCR +NV
Sbjct: 421 QKLGVTIAHDIYKYLIISLCQEDRPFAAKSLLERILEDRFQPDSDIYNKLIESFCRSNNV 480

Query: 481 SEALLLKSEMIKRNFKLSIDSYKPLISCLCGVNRSVDGEGLMVEMVESGVLPDHQICRVL 540
           SEALLLKSEMI RNFK +I +YK LI C+C +NRSVDGEGLM EMVES VLPDH+ICR L
Sbjct: 481 SEALLLKSEMINRNFKPTIYTYKSLIHCMCEINRSVDGEGLMEEMVESEVLPDHEICRAL 540

Query: 541 INGYCKEGNVYKAESLLVSFAKDFEFFDTESFNALVKFHRDFGNETELMQLQDRMLKVGF 600
           +NGYCKEGN  KAESLLVSFAKDF+FFD+ESFN+LVK + D GNET+LM+LQ RMLK GF
Sbjct: 541 VNGYCKEGNADKAESLLVSFAKDFQFFDSESFNSLVKVYCDMGNETKLMELQTRMLKAGF 600

Query: 601 VPNSLTCRYVIHGLWKSARLDKRRVQA 627
           +PN+LTC+Y+IHGLWK  RL+++RVQA
Sbjct: 601 LPNNLTCQYIIHGLWKFTRLNEQRVQA 626

BLAST of Cp4.1LG01g23300 vs. TAIR 10
Match: AT5G40400.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 603.6 bits (1555), Expect = 1.8e-172
Identity = 301/594 (50.67%), Postives = 417/594 (70.20%), Query Frame = 0

Query: 27  FSSASSSTFLQSIPRSET--KLIVNPLYHFLPQNQNPFNIVELVSSHLKTSNTQLSL--L 86
           FSS SSS     +PR     K I+NPLY+ LPQ+QNP  IV+++ S L  S+  + L  L
Sbjct: 11  FSSYSSSI----VPRCSNIPKPILNPLYNLLPQSQNPSKIVDVICSTLNHSDYSVLLPNL 70

Query: 87  QSDIKELLPHLGHREISKIILRCQSNFVSVLAFFNWVKFDLGITLNSQNYCLIIHILAWS 146
           + ++K L+PHLG+ EIS+++LR QS+    + FF WVKFDLG   N  NYCL++HIL  S
Sbjct: 71  RDEVKSLIPHLGYPEISRVLLRFQSDASRAITFFKWVKFDLGKRPNVGNYCLLLHILVSS 130

Query: 147 RQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHCNWNPVIFEMLMKAYVKVHMIQE 206
           ++F +AM+FL ELIEL+  +     DVF  LV  T+ CNW+PV+F+ML+K Y+K+ +++E
Sbjct: 131 KKFPLAMQFLCELIELT--SKKEEVDVFRVLVSATDECNWDPVVFDMLVKGYLKLGLVEE 190

Query: 207 SYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCWELYEEMGRIGVHSNAYTFNILTY 266
            +  F++++  GF  SV+ CN +LNGL K+     CW++Y  M R+G+H N YTFNILT 
Sbjct: 191 GFRVFREVLDSGFSVSVVTCNHLLNGLLKLDLMEDCWQVYSVMCRVGIHPNTYTFNILTN 250

Query: 267 VLCRAGDVNKVNEFLEKMEEEGFDPDVVTYNTLIDSYCRRGRLDDAFYLYRIMFRRGVMP 326
           V C   +  +V++FLEKMEEEGF+PD+VTYNTL+ SYCRRGRL +AFYLY+IM+RR V+P
Sbjct: 251 VFCNDSNFREVDDFLEKMEEEGFEPDLVTYNTLVSSYCRRGRLKEAFYLYKIMYRRRVVP 310

Query: 327 DLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDVVLYNTLINAYCKDGRLQEARSLL 386
           DLV+YTSL+ GLCK GRVREAHQ FHRM+DR + PD + YNTLI AYCK+G +Q+++ LL
Sbjct: 311 DLVTYTSLIKGLCKDGRVREAHQTFHRMVDRGIKPDCMSYNTLIYAYCKEGMMQQSKKLL 370

Query: 387 HDMTRIGICPDSFTCRIMVEGYGRGGSLISALNLVVELRKLGTIVTYDIYDYLIVSLCLE 446
           H+M    + PD FTC+++VEG+ R G L+SA+N VVELR+L   + +++ D+LIVSLC E
Sbjct: 371 HEMLGNSVVPDRFTCKVIVEGFVREGRLLSAVNFVVELRRLKVDIPFEVCDFLIVSLCQE 430

Query: 447 DRPFAAKSVLERVI-KDGFQPNACIYNKLIECFCRVHNVSEALLLKSEMIKRNFKLSIDS 506
            +PFAAK +L+R+I ++G +     YN LIE   R   + EAL+LK ++  +N  L   +
Sbjct: 431 GKPFAAKHLLDRIIEEEGHEAKPETYNNLIESLSRCDAIEEALVLKGKLKNQNQVLDAKT 490

Query: 507 YKPLISCLCGVNRSVDGEGLMVEMVESGVLPDHQICRVLINGYCKEGNVYKAESLLVSFA 566
           Y+ LI CLC + R+ + E LM EM +S V PD  IC  L+ GYCKE +  KAE LL  FA
Sbjct: 491 YRALIGCLCRIGRNREAESLMAEMFDSEVKPDSFICGALVYGYCKELDFDKAERLLSLFA 550

Query: 567 KDFEFFDTESFNALVKFHRDFG-NETELMQLQDRMLKVGFVPNSLTCRYVIHGL 615
            +F  FD ES+N+LVK   + G    + ++LQ+RM ++GFVPN LTC+Y+I  L
Sbjct: 551 MEFRIFDPESYNSLVKAVCETGCGYKKALELQERMQRLGFVPNRLTCKYLIQVL 598

BLAST of Cp4.1LG01g23300 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 240.4 bits (612), Expect = 4.1e-63
Identity = 153/525 (29.14%), Postives = 263/525 (50.10%), Query Frame = 0

Query: 99  SKIILRCQSNFVSVLAFFNWVKFDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIEL 158
           S ++L+ Q++   +L F NW       TL  +  C+ +HIL   + +  A     ++   
Sbjct: 52  SNLLLKSQNDQALILKFLNWANPHQFFTLRCK--CITLHILTKFKLYKTAQILAEDVAAK 111

Query: 159 SKDNASGSEDVFHNLVLCTEHCNWNPVIFEMLMKAYVKVHMIQESYESFKKMVKMGFVPS 218
           + D+   S  VF +L    + C     +F++++K+Y ++ +I ++          GF+P 
Sbjct: 112 TLDDEYASL-VFKSLQETYDLCYSTSSVFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPG 171

Query: 219 VIACNCILNGLAKMKCDAQCWE-LYEEMGRIGVHSNAYTFNILTYVLCRAGDVNKVNEFL 278
           V++ N +L+   + K +    E +++EM    V  N +T+NIL    C AG+++      
Sbjct: 172 VLSYNAVLDATIRSKRNISFAENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLF 231

Query: 279 EKMEEEGFDPDVVTYNTLIDSYCRRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKL 338
           +KME +G  P+VVTYNTLID YC+  ++DD F L R M  +G+ P+L+SY  ++NGLC+ 
Sbjct: 232 DKMETKGCLPNVVTYNTLIDGYCKLRKIDDGFKLLRSMALKGLEPNLISYNVVINGLCRE 291

Query: 339 GRVREAHQLFHRMIDRELDPDVVLYNTLINAYCKDGRLQEARSLLHDMTRIGICPDSFTC 398
           GR++E   +   M  R    D V YNTLI  YCK+G   +A  +  +M R G+ P   T 
Sbjct: 292 GRMKEVSFVLTEMNRRGYSLDEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITY 351

Query: 399 RIMVEGYGRGGSLISALNLVVELRKLGTIVTYDIYDYLIVSLCLEDRPFAAKSVLERVIK 458
             ++    + G++  A+  + ++R  G       Y  L+     +     A  VL  +  
Sbjct: 352 TSLIHSMCKAGNMNRAMEFLDQMRVRGLCPNERTYTTLVDGFSQKGYMNEAYRVLREMND 411

Query: 459 DGFQPNACIYNKLIECFCRVHNVSEALLLKSEMIKRNFKLSIDSYKPLISCLCGVNRSVD 518
           +GF P+   YN LI   C    + +A+ +  +M ++     + SY  ++S  C   RS D
Sbjct: 412 NGFSPSVVTYNALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFC---RSYD 471

Query: 519 -GEGLMV--EMVESGVLPDHQICRVLINGYCKEGNVYKAESLLVSFAKDFEFFDTESFNA 578
             E L V  EMVE G+ PD      LI G+C++    +A  L     +     D  ++ A
Sbjct: 472 VDEALRVKREMVEKGIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTA 531

Query: 579 LVKFHRDFGNETELMQLQDRMLKVGFVPNSLTCRYVIHGLWKSAR 620
           L+  +   G+  + +QL + M++ G +P+ +T   +I+GL K +R
Sbjct: 532 LINAYCMEGDLEKALQLHNEMVEKGVLPDVVTYSVLINGLNKQSR 570

BLAST of Cp4.1LG01g23300 vs. TAIR 10
Match: AT5G55840.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 239.6 bits (610), Expect = 6.9e-63
Identity = 151/507 (29.78%), Postives = 244/507 (48.13%), Query Frame = 0

Query: 113 LAFFNWVKFDLGITLNS--QNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVF 172
           L F  WV    G+  +   Q  C+  HIL  +R +  A   L EL  +S      S  VF
Sbjct: 94  LKFLKWVVKQPGLETDHIVQLVCITTHILVRARMYDPARHILKELSLMS----GKSSFVF 153

Query: 173 HNLVLCTEHCNWNPVIFEMLMKAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLA 232
             L+     CN NP ++++L++ Y++  MIQ+S E F+ M   GF PSV  CN IL  + 
Sbjct: 154 GALMTTYRLCNSNPSVYDILIRVYLREGMIQDSLEIFRLMGLYGFNPSVYTCNAILGSVV 213

Query: 233 KMKCDAQCWELYEEMGRIGVHSNAYTFNILTYVLCRAGDVNKVNEFLEKMEEEGFDPDVV 292
           K   D   W   +EM +  +  +  TFNIL  VLC  G   K +  ++KME+ G+ P +V
Sbjct: 214 KSGEDVSVWSFLKEMLKRKICPDVATFNILINVLCAEGSFEKSSYLMQKMEKSGYAPTIV 273

Query: 293 TYNTLIDSYCRRGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRM 352
           TYNT++  YC++GR   A  L   M  +GV  D+ +Y  L++ LC+  R+ + + L   M
Sbjct: 274 TYNTVLHWYCKKGRFKAAIELLDHMKSKGVDADVCTYNMLIHDLCRSNRIAKGYLLLRDM 333

Query: 353 IDRELDPDVVLYNTLINAYCKDGRLQEARSLLHDMTRIGICPDSFTCRIMVEGYGRGGSL 412
             R + P+ V YNTLIN +  +G++  A  LL++M   G+ P+  T   +++G+   G+ 
Sbjct: 334 RKRMIHPNEVTYNTLINGFSNEGKVLIASQLLNEMLSFGLSPNHVTFNALIDGHISEGNF 393

Query: 413 ISALNLVVELRKLGTIVTYDIYDYLIVSLCLEDRPFAAKSVLERVIKDGFQPNACIYNKL 472
             AL +   +   G   +   Y  L+  LC       A+    R+ ++G       Y  +
Sbjct: 394 KEALKMFYMMEAKGLTPSEVSYGVLLDGLCKNAEFDLARGFYMRMKRNGVCVGRITYTGM 453

Query: 473 IECFCRVHNVSEALLLKSEMIKRNFKLSIDSYKPLISCLCGVNRSVDGEGLMVEMVESGV 532
           I+  C+   + EA++L +EM K      I +Y  LI+  C V R    + ++  +   G+
Sbjct: 454 IDGLCKNGFLDEAVVLLNEMSKDGIDPDIVTYSALINGFCKVGRFKTAKEIVCRIYRVGL 513

Query: 533 LPDHQICRVLINGYCKEGNVYKAESLLVSFAKDFEFFDTESFNALVKFHRDFGNETELMQ 592
            P+  I   LI   C+ G + +A  +  +   +    D  +FN LV      G   E  +
Sbjct: 514 SPNGIIYSTLIYNCCRMGCLKEAIRIYEAMILEGHTRDHFTFNVLVTSLCKAGKVAEAEE 573

Query: 593 LQDRMLKVGFVPNSLTCRYVIHGLWKS 618
               M   G +PN+++   +I+G   S
Sbjct: 574 FMRCMTSDGILPNTVSFDCLINGYGNS 596

BLAST of Cp4.1LG01g23300 vs. TAIR 10
Match: AT1G09820.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 237.3 bits (604), Expect = 3.4e-62
Identity = 143/505 (28.32%), Postives = 256/505 (50.69%), Query Frame = 0

Query: 113 LAFFNWVKFDLGITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHN 172
           L +++W+  +  I+++ +    ++H LA ++++S    FL   +    D+   S  +FH 
Sbjct: 85  LRYYSWLVKNSDISVSLELTFKLLHSLANAKRYSKIRSFLDGFVRNGSDHQVHS--IFHA 144

Query: 173 LVLCTEHCNWNPVIFEMLMKAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKM 232
           + +C   C  N +I +ML+ AY      +  +E+FK+    G+  S ++C  ++  L K 
Sbjct: 145 ISMCDNVC-VNSIIADMLVLAYANNSRFELGFEAFKRSGYYGYKLSALSCKPLMIALLKE 204

Query: 233 KCDAQCWELYEEMGRIGVHSNAYTFNILTYVLCRAGDVNKVNEFLEKMEEEGFDPDVVTY 292
              A    +Y+EM R  +  N +TFN++   LC+ G +NK  + +E M+  G  P+VV+Y
Sbjct: 205 NRSADVEYVYKEMIRRKIQPNVFTFNVVINALCKTGKMNKARDVMEDMKVYGCSPNVVSY 264

Query: 293 NTLIDSYCR---RGRLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHR 352
           NTLID YC+    G++  A  + + M    V P+L ++  L++G  K   +  + ++F  
Sbjct: 265 NTLIDGYCKLGGNGKMYKADAVLKEMVENDVSPNLTTFNILIDGFWKDDNLPGSMKVFKE 324

Query: 353 MIDRELDPDVVLYNTLINAYCKDGRLQEARSLLHDMTRIGICPDSFTCRIMVEGYGRGGS 412
           M+D+++ P+V+ YN+LIN  C  G++ EA S+   M   G+ P+  T   ++ G+ +   
Sbjct: 325 MLDQDVKPNVISYNSLINGLCNGGKISEAISMRDKMVSAGVQPNLITYNALINGFCKNDM 384

Query: 413 LISALNLVVELRKLGTIVTYDIYDYLIVSLC---LEDRPFAAKSVLERVIKDGFQPNACI 472
           L  AL++   ++  G + T  +Y+ LI + C     D  FA K  +ER   +G  P+   
Sbjct: 385 LKEALDMFGSVKGQGAVPTTRMYNMLIDAYCKLGKIDDGFALKEEMER---EGIVPDVGT 444

Query: 473 YNKLIECFCRVHNVSEALLLKSEMIKRNFKLSIDSYKPLISCLCGVNRSVDGEGLMVEMV 532
           YN LI   CR  N+  A  L  ++  +     + ++  L+   C    S     L+ EM 
Sbjct: 445 YNCLIAGLCRNGNIEAAKKLFDQLTSKGLP-DLVTFHILMEGYCRKGESRKAAMLLKEMS 504

Query: 533 ESGVLPDHQICRVLINGYCKEGNVYKAESLLVSFAKDFEF-FDTESFNALVKFHRDFGNE 592
           + G+ P H    +++ GYCKEGN+  A ++     K+     +  S+N L++ +   G  
Sbjct: 505 KMGLKPRHLTYNIVMKGYCKEGNLKAATNMRTQMEKERRLRMNVASYNVLLQGYSQKGKL 564

Query: 593 TELMQLQDRMLKVGFVPNSLTCRYV 611
            +   L + ML+ G VPN +T   V
Sbjct: 565 EDANMLLNEMLEKGLVPNRITYEIV 582

BLAST of Cp4.1LG01g23300 vs. TAIR 10
Match: AT5G01110.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 235.7 bits (600), Expect = 1.0e-61
Identity = 161/654 (24.62%), Postives = 292/654 (44.65%), Query Frame = 0

Query: 5   PASNLTKLSKPFFFLASIPKSNFSSASSSTFLQSIPRSETKLIVNPLYHFLPQNQNPFNI 64
           P  NLT  S P F  +S   S+ SSAS S                          +   +
Sbjct: 20  PLKNLTTSSSPVFEPSSSSSSSSSSASFSV-------------------------SDSFL 79

Query: 65  VELVSSHLKTSNTQLSLLQSDIKELLPHLGHREISKIILRCQSNFVSVLAFFNWVKFDL- 124
           VE +   LK  N       ++++  L  L    + +++ RC+++      F + + F   
Sbjct: 80  VEKICFSLKQGN-------NNVRNHLIRLNPLAVVEVLYRCRNDLTLGQRFVDQLGFHFP 139

Query: 125 GITLNSQNYCLIIHILAWSRQFSMAMKFLSELIELSKDNASGSEDVFHNLVLCTEHCNWN 184
                S +   +IHIL  S + S A   L  +I  S        ++ ++L     +C  N
Sbjct: 140 NFKHTSLSLSAMIHILVRSGRLSDAQSCLLRMIRRS---GVSRLEIVNSLDSTFSNCGSN 199

Query: 185 PVIFEMLMKAYVKVHMIQESYESFKKMVKMGFVPSVIACNCILNGLAKMKCDAQCWELYE 244
             +F++L++ YV+   ++E++E+F  +   GF  S+ ACN ++  L ++      W +Y+
Sbjct: 200 DSVFDLLIRTYVQARKLREAHEAFTLLRSKGFTVSIDACNALIGSLVRIGWVELAWGVYQ 259

Query: 245 EMGRIGVHSNAYTFNILTYVLCRAGDVNKVNEFLEKMEEEGFDPDVVTYNTLIDSYCRRG 304
           E+ R GV  N YT NI+   LC+ G + KV  FL +++E+G  PD+VTYNTLI +Y  +G
Sbjct: 260 EISRSGVGINVYTLNIMVNALCKDGKMEKVGTFLSQVQEKGVYPDIVTYNTLISAYSSKG 319

Query: 305 RLDDAFYLYRIMFRRGVMPDLVSYTSLMNGLCKLGRVREAHQLFHRMIDRELDPDVVLYN 364
            +++AF L   M  +G  P + +Y +++NGLCK G+   A ++F  M+   L PD   Y 
Sbjct: 320 LMEEAFELMNAMPGKGFSPGVYTYNTVINGLCKHGKYERAKEVFAEMLRSGLSPDSTTYR 379

Query: 365 TLINAYCKDGRLQEARSLLHDM-----------------------------------TRI 424
           +L+   CK G + E   +  DM                                      
Sbjct: 380 SLLMEACKKGDVVETEKVFSDMRSRDVVPDLVCFSSMMSLFTRSGNLDKALMYFNSVKEA 439

Query: 425 GICPDSFTCRIMVEGYGRGGSLISALNLVVELRKLGTIVTYDIYDYLIVSLCLEDRPFAA 484
           G+ PD+    I+++GY R G +  A+NL  E+ + G  +    Y+ ++  LC       A
Sbjct: 440 GLIPDNVIYTILIQGYCRKGMISVAMNLRNEMLQQGCAMDVVTYNTILHGLCKRKMLGEA 499

Query: 485 KSVLERVIKDGFQPNACIYNKLIECFCRVHNVSEALLLKSEMIKRNFKLSIDSYKPLISC 544
             +   + +    P++     LI+  C++ N+  A+ L  +M ++  +L + +Y  L+  
Sbjct: 500 DKLFNEMTERALFPDSYTLTILIDGHCKLGNLQNAMELFQKMKEKRIRLDVVTYNTLLDG 559

Query: 545 LCGVNRSVDGEGLMVEMVESGVLPDHQICRVLINGYCKEGNVYKAESLLVSFAKDFEFFD 604
              V      + +  +MV   +LP      +L+N  C +G++ +A  +            
Sbjct: 560 FGKVGDIDTAKEIWADMVSKEILPTPISYSILVNALCSKGHLAEAFRVWDEMISKNIKPT 619

Query: 605 TESFNALVKFHRDFGNETELMQLQDRMLKVGFVPNSLTCRYVIHGLWKSARLDK 623
               N+++K +   GN ++     ++M+  GFVP+ ++   +I+G  +   + K
Sbjct: 620 VMICNSMIKGYCRSGNASDGESFLEKMISEGFVPDCISYNTLIYGFVREENMSK 638

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FND82.6e-17150.67Pentatricopeptide repeat-containing protein At5g40400 OS=Arabidopsis thaliana OX... [more]
Q9FIX35.7e-6229.14Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Q9LVQ59.8e-6229.78Pentatricopeptide repeat-containing protein At5g55840 OS=Arabidopsis thaliana OX... [more]
O045044.8e-6128.32Pentatricopeptide repeat-containing protein At1g09820 OS=Arabidopsis thaliana OX... [more]
Q9LFC51.4e-6024.62Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_023535833.10.0100.00pentatricopeptide repeat-containing protein At5g40400 [Cucurbita pepo subsp. pep... [more]
XP_022922085.10.097.79pentatricopeptide repeat-containing protein At5g40400 [Cucurbita moschata][more]
KAG7033098.10.097.63Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
KAG6602420.10.097.63Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022991028.10.097.16pentatricopeptide repeat-containing protein At5g40400 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1E7L80.097.79pentatricopeptide repeat-containing protein At5g40400 OS=Cucurbita moschata OX=3... [more]
A0A6J1JUZ70.097.16pentatricopeptide repeat-containing protein At5g40400 OS=Cucurbita maxima OX=366... [more]
A0A0A0KSF10.080.09Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G523120 PE=4 SV=1[more]
A0A1S3C9950.079.90pentatricopeptide repeat-containing protein At5g40400 OS=Cucumis melo OX=3656 GN... [more]
A0A5D3BTR00.079.74Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT5G40400.11.8e-17250.67Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G39710.14.1e-6329.14Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G55840.16.9e-6329.78Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G09820.13.4e-6228.32Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT5G01110.11.0e-6124.62Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 325..359
e-value: 2.2E-10
score: 38.0
coord: 255..289
e-value: 5.9E-7
score: 27.3
coord: 290..323
e-value: 1.3E-10
score: 38.8
coord: 360..393
e-value: 3.1E-9
score: 34.4
coord: 185..219
e-value: 0.0012
score: 16.9
coord: 466..498
e-value: 7.1E-6
score: 23.9
coord: 501..533
e-value: 8.6E-4
score: 17.3
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 187..215
e-value: 0.31
score: 11.4
coord: 539..557
e-value: 0.0045
score: 17.1
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 287..336
e-value: 8.6E-19
score: 67.5
coord: 217..266
e-value: 5.8E-9
score: 36.0
coord: 462..510
e-value: 1.7E-8
score: 34.5
coord: 357..404
e-value: 2.9E-15
score: 56.2
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 498..532
score: 8.812943
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 358..392
score: 13.855141
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 568..602
score: 8.560833
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 253..287
score: 11.684803
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 288..322
score: 14.490896
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 463..497
score: 9.930995
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 183..217
score: 10.39137
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 323..357
score: 13.438611
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 102..280
e-value: 8.2E-28
score: 99.7
coord: 436..563
e-value: 1.6E-23
score: 85.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 564..625
e-value: 3.3E-5
score: 25.5
coord: 352..427
e-value: 6.6E-18
score: 66.9
coord: 281..351
e-value: 3.1E-26
score: 94.1
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 138..381
NoneNo IPR availablePANTHERPTHR47941PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN 3, MITOCHONDRIALcoord: 24..621
NoneNo IPR availablePANTHERPTHR47941:SF6OS01G0546700 PROTEINcoord: 24..621

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g23300.1Cp4.1LG01g23300.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding