Cp4.1LG18g05980 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG18g05980
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG18: 6315369 .. 6317654 (-)
RNA-Seq ExpressionCp4.1LG18g05980
SyntenyCp4.1LG18g05980
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTCACATTTCTTTATCTAAACCTCACTACAGCCATCTCAAGGTTCTTTCCAGTTCTTCAATTTCGAAACCAATCTCCTTCAATTCACTTCATTTCTTCAGCTCCATTCAAGATCCAACCACCACAGCTACTCAAAATGAAAGCCCTAAAGATCCATTCGTCAGTTCAGATGCCGCAGTGCCTCAGCCCGTGGAACCGGTGGCTGTTAATGGCGGCGACCAAGTTAAGCGAAGCATCCCTAGAGGTAACCGTCGTAACCCTGAAAAATTAGAGGATATTATTTGTAGAATGATGGCAAATCGTGAATGGACAACACGTTTACAGAATTCGATTAGGTCGTTGGTTCCCCAATTTGATCACTCTATTGTTTGGAATGTGTTACATGCTGCTAAAAACTCGGACCATGCCCTCCAGTTCTTCCGATGGGTAGAGCGATCCGGCTTATTCCAGCATGATCGTGGAACACATTTCAAAATAATTGAGATTTTGGGTAGGGCTTCAAAGCTTAATCATGCCCGTTGCATTCTTCTTGATATGCCCAATAAGGGCGTTGAATGGGATGAAGACTTATTCGTTATAATGATTGATAGTTATGGGAAAGCTGGGATAGTTCAGGAGGCCGTGAAAATTTTTCAAAAGATGAAGGAATTGGGTGTTGAAAGGAGTATTAAATCTTATAATGTTTTGTTTAAGGTGATTTTGAGGAGGGGGCGGTATATGATGGCGAAGAGGTACTTTAATGCTATGTTGAATGAAGGCATAGAACCAACTTGCCATACCTATAATGTAATGCTTTGGGGTTTCTTTCTGTCGTTGAGGCTTGAGACAGCCAAGAGGTTTTATGAAGACATGAAGACTAGAGGCATTGCCCCTGACGTTGTTACATATAACACTATGATTAATGGTTATTATCGGTTCAAAATGATGGAGGAGGCGGAGCAATTCTTTACTGAGATGAAGGGGAACAATCTTTTACCAACAGTGATAAGCTATACTACTATGATAAAAGGTTATGTTTCTTCCGGTCGAGTAGATGATGGATTGAGATTGTTCGAAGAGATGAAGGCTGTTGGTGTGAAGCCAAATGACTTTACTTATTCGACTCTGCTGCCTGGTCTCTGTGATGCAGAGAAAATGTCCGAGGCGCGTCAAATTTTGACAGAAATGGTGGACAAGTATATTGCTCCAAAGGACAATTCAATTTTCATGAGGTTGTTATCTTGCCAGTGCAAGCATGGTGATTTGGATGCTGCTATGCATGTGCTGAAAGCTATGATTCGGTTAAGCATTCCAACAGAGGCTGGGCACTACGGTATTTTGATCGAGAACTGTTGCAAAGCCGGAGTGTATGATCGGGCAGTTAAGTTGCTTGACAAGCTTGTAGAAAAAGAAATCATTTTGAAACCACAAAGTACTCTGGAAATGGAGGCTAGTGCGTATAATCCTATAATTCAGTATCTGTGCGACCATGGGCAGACTGGAAAAGCTGAAACGTTTTTCCGGCAGTTGTTGAAGAAGGGTATCCAGGATGAAGTTGCATTCAATAATTTAATCCGTGGCCATTCCAAAGAAGGGAATCCTGAATTGGCATTTGAAATCTTGAAAATCATGGGTAGGAAGAGTGTTTCTAGGGATGCAGAATCTTACAAGTTGCTAATCAAGAGCTACTTAAGTAAAGGTGAACCAGCTGATGCTAAGACAGCTTTGGACAGCATGATTGAAAGCGGACACTACCCTGATTCGGCGTTGTTTCGATCGGTGATGGAAAGTCTATTTGCAGATGGGAGGGTGCAGACCGCGAGCCGAGTGATGAACAGTATGTTGGATAAAGGAATAACAGAAAACATGGACTTGGTTGCCAAGATTCTGGAAGCCCTTTTCATGAGAGGTCATGTGGAAGAGGCATTGGGACGAGTTGATTTGCTTATGAGATGCAGTTGCCCACCTGATTTTGACAGTCTTTTATCTGTTCTTTGTGAAAAGGGGAAGACCATTGCTGCTCTTAAGCTTTTAGATTTTGGGTTGGAAAGAGAATGCAACATAGAAGTCTCAAGCTATGAGAAAGTACTTGATGCGCTGTTGGCAGCGGGGAAGACGCTGAACGCATACTCAATTCTATGCAAGATAATGGAGAAAGGAGGGGCCAAGGAGTGGAGCAGCTGCGATGATCTGATCAATAGCCTGAATCAGGAAGGGCACACAAAGCAAGCTGATATTCTCTCAAGAATGATAAAGGGTGGAGACAGAATTCGGCGTAAGAAAGCTTCTCCAGCTGCTGCTTGA

mRNA sequence

ATGGCTCACATTTCTTTATCTAAACCTCACTACAGCCATCTCAAGGTTCTTTCCAGTTCTTCAATTTCGAAACCAATCTCCTTCAATTCACTTCATTTCTTCAGCTCCATTCAAGATCCAACCACCACAGCTACTCAAAATGAAAGCCCTAAAGATCCATTCGTCAGTTCAGATGCCGCAGTGCCTCAGCCCGTGGAACCGGTGGCTGTTAATGGCGGCGACCAAGTTAAGCGAAGCATCCCTAGAGGTAACCGTCGTAACCCTGAAAAATTAGAGGATATTATTTGTAGAATGATGGCAAATCGTGAATGGACAACACGTTTACAGAATTCGATTAGGTCGTTGGTTCCCCAATTTGATCACTCTATTGTTTGGAATGTGTTACATGCTGCTAAAAACTCGGACCATGCCCTCCAGTTCTTCCGATGGGTAGAGCGATCCGGCTTATTCCAGCATGATCGTGGAACACATTTCAAAATAATTGAGATTTTGGGTAGGGCTTCAAAGCTTAATCATGCCCGTTGCATTCTTCTTGATATGCCCAATAAGGGCGTTGAATGGGATGAAGACTTATTCGTTATAATGATTGATAGTTATGGGAAAGCTGGGATAGTTCAGGAGGCCGTGAAAATTTTTCAAAAGATGAAGGAATTGGGTGTTGAAAGGAGTATTAAATCTTATAATGTTTTGTTTAAGGTGATTTTGAGGAGGGGGCGGTATATGATGGCGAAGAGGTACTTTAATGCTATGTTGAATGAAGGCATAGAACCAACTTGCCATACCTATAATGTAATGCTTTGGGGTTTCTTTCTGTCGTTGAGGCTTGAGACAGCCAAGAGGTTTTATGAAGACATGAAGACTAGAGGCATTGCCCCTGACGTTGTTACATATAACACTATGATTAATGGTTATTATCGGTTCAAAATGATGGAGGAGGCGGAGCAATTCTTTACTGAGATGAAGGGGAACAATCTTTTACCAACAGTGATAAGCTATACTACTATGATAAAAGGTTATGTTTCTTCCGGTCGAGTAGATGATGGATTGAGATTGTTCGAAGAGATGAAGGCTGTTGGTGTGAAGCCAAATGACTTTACTTATTCGACTCTGCTGCCTGGTCTCTGTGATGCAGAGAAAATGTCCGAGGCGCGTCAAATTTTGACAGAAATGGTGGACAAGTATATTGCTCCAAAGGACAATTCAATTTTCATGAGGTTGTTATCTTGCCAGTGCAAGCATGGTGATTTGGATGCTGCTATGCATGTGCTGAAAGCTATGATTCGGTTAAGCATTCCAACAGAGGCTGGGCACTACGGTATTTTGATCGAGAACTGTTGCAAAGCCGGAGTGTATGATCGGGCAGTTAAGTTGCTTGACAAGCTTGTAGAAAAAGAAATCATTTTGAAACCACAAAGTACTCTGGAAATGGAGGCTAGTGCGTATAATCCTATAATTCAGTATCTGTGCGACCATGGGCAGACTGGAAAAGCTGAAACGTTTTTCCGGCAGTTGTTGAAGAAGGGTATCCAGGATGAAGTTGCATTCAATAATTTAATCCGTGGCCATTCCAAAGAAGGGAATCCTGAATTGGCATTTGAAATCTTGAAAATCATGGGTAGGAAGAGTGTTTCTAGGGATGCAGAATCTTACAAGTTGCTAATCAAGAGCTACTTAAGTAAAGGTGAACCAGCTGATGCTAAGACAGCTTTGGACAGCATGATTGAAAGCGGACACTACCCTGATTCGGCGTTGTTTCGATCGGTGATGGAAAGTCTATTTGCAGATGGGAGGGTGCAGACCGCGAGCCGAGTGATGAACAGTATGTTGGATAAAGGAATAACAGAAAACATGGACTTGGTTGCCAAGATTCTGGAAGCCCTTTTCATGAGAGGTCATGTGGAAGAGGCATTGGGACGAGTTGATTTGCTTATGAGATGCAGTTGCCCACCTGATTTTGACAGTCTTTTATCTGTTCTTTGTGAAAAGGGGAAGACCATTGCTGCTCTTAAGCTTTTAGATTTTGGGTTGGAAAGAGAATGCAACATAGAAGTCTCAAGCTATGAGAAAGTACTTGATGCGCTGTTGGCAGCGGGGAAGACGCTGAACGCATACTCAATTCTATGCAAGATAATGGAGAAAGGAGGGGCCAAGGAGTGGAGCAGCTGCGATGATCTGATCAATAGCCTGAATCAGGAAGGGCACACAAAGCAAGCTGATATTCTCTCAAGAATGATAAAGGGTGGAGACAGAATTCGGCGTAAGAAAGCTTCTCCAGCTGCTGCTTGA

Coding sequence (CDS)

ATGGCTCACATTTCTTTATCTAAACCTCACTACAGCCATCTCAAGGTTCTTTCCAGTTCTTCAATTTCGAAACCAATCTCCTTCAATTCACTTCATTTCTTCAGCTCCATTCAAGATCCAACCACCACAGCTACTCAAAATGAAAGCCCTAAAGATCCATTCGTCAGTTCAGATGCCGCAGTGCCTCAGCCCGTGGAACCGGTGGCTGTTAATGGCGGCGACCAAGTTAAGCGAAGCATCCCTAGAGGTAACCGTCGTAACCCTGAAAAATTAGAGGATATTATTTGTAGAATGATGGCAAATCGTGAATGGACAACACGTTTACAGAATTCGATTAGGTCGTTGGTTCCCCAATTTGATCACTCTATTGTTTGGAATGTGTTACATGCTGCTAAAAACTCGGACCATGCCCTCCAGTTCTTCCGATGGGTAGAGCGATCCGGCTTATTCCAGCATGATCGTGGAACACATTTCAAAATAATTGAGATTTTGGGTAGGGCTTCAAAGCTTAATCATGCCCGTTGCATTCTTCTTGATATGCCCAATAAGGGCGTTGAATGGGATGAAGACTTATTCGTTATAATGATTGATAGTTATGGGAAAGCTGGGATAGTTCAGGAGGCCGTGAAAATTTTTCAAAAGATGAAGGAATTGGGTGTTGAAAGGAGTATTAAATCTTATAATGTTTTGTTTAAGGTGATTTTGAGGAGGGGGCGGTATATGATGGCGAAGAGGTACTTTAATGCTATGTTGAATGAAGGCATAGAACCAACTTGCCATACCTATAATGTAATGCTTTGGGGTTTCTTTCTGTCGTTGAGGCTTGAGACAGCCAAGAGGTTTTATGAAGACATGAAGACTAGAGGCATTGCCCCTGACGTTGTTACATATAACACTATGATTAATGGTTATTATCGGTTCAAAATGATGGAGGAGGCGGAGCAATTCTTTACTGAGATGAAGGGGAACAATCTTTTACCAACAGTGATAAGCTATACTACTATGATAAAAGGTTATGTTTCTTCCGGTCGAGTAGATGATGGATTGAGATTGTTCGAAGAGATGAAGGCTGTTGGTGTGAAGCCAAATGACTTTACTTATTCGACTCTGCTGCCTGGTCTCTGTGATGCAGAGAAAATGTCCGAGGCGCGTCAAATTTTGACAGAAATGGTGGACAAGTATATTGCTCCAAAGGACAATTCAATTTTCATGAGGTTGTTATCTTGCCAGTGCAAGCATGGTGATTTGGATGCTGCTATGCATGTGCTGAAAGCTATGATTCGGTTAAGCATTCCAACAGAGGCTGGGCACTACGGTATTTTGATCGAGAACTGTTGCAAAGCCGGAGTGTATGATCGGGCAGTTAAGTTGCTTGACAAGCTTGTAGAAAAAGAAATCATTTTGAAACCACAAAGTACTCTGGAAATGGAGGCTAGTGCGTATAATCCTATAATTCAGTATCTGTGCGACCATGGGCAGACTGGAAAAGCTGAAACGTTTTTCCGGCAGTTGTTGAAGAAGGGTATCCAGGATGAAGTTGCATTCAATAATTTAATCCGTGGCCATTCCAAAGAAGGGAATCCTGAATTGGCATTTGAAATCTTGAAAATCATGGGTAGGAAGAGTGTTTCTAGGGATGCAGAATCTTACAAGTTGCTAATCAAGAGCTACTTAAGTAAAGGTGAACCAGCTGATGCTAAGACAGCTTTGGACAGCATGATTGAAAGCGGACACTACCCTGATTCGGCGTTGTTTCGATCGGTGATGGAAAGTCTATTTGCAGATGGGAGGGTGCAGACCGCGAGCCGAGTGATGAACAGTATGTTGGATAAAGGAATAACAGAAAACATGGACTTGGTTGCCAAGATTCTGGAAGCCCTTTTCATGAGAGGTCATGTGGAAGAGGCATTGGGACGAGTTGATTTGCTTATGAGATGCAGTTGCCCACCTGATTTTGACAGTCTTTTATCTGTTCTTTGTGAAAAGGGGAAGACCATTGCTGCTCTTAAGCTTTTAGATTTTGGGTTGGAAAGAGAATGCAACATAGAAGTCTCAAGCTATGAGAAAGTACTTGATGCGCTGTTGGCAGCGGGGAAGACGCTGAACGCATACTCAATTCTATGCAAGATAATGGAGAAAGGAGGGGCCAAGGAGTGGAGCAGCTGCGATGATCTGATCAATAGCCTGAATCAGGAAGGGCACACAAAGCAAGCTGATATTCTCTCAAGAATGATAAAGGGTGGAGACAGAATTCGGCGTAAGAAAGCTTCTCCAGCTGCTGCTTGA

Protein sequence

MAHISLSKPHYSHLKVLSSSSISKPISFNSLHFFSSIQDPTTTATQNESPKDPFVSSDAAVPQPVEPVAVNGGDQVKRSIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFDHSIVWNVLHAAKNSDHALQFFRWVERSGLFQHDRGTHFKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGRYMMAKRYFNAMLNEGIEPTCHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTMINGYYRFKMMEEAEQFFTEMKGNNLLPTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGVKPNDFTYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEASAYNPIIQYLCDHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMGRKSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENMDLVAKILEALFMRGHVEEALGRVDLLMRCSCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWSSCDDLINSLNQEGHTKQADILSRMIKGGDRIRRKKASPAAA
Homology
BLAST of Cp4.1LG18g05980 vs. ExPASy Swiss-Prot
Match: Q9ZUU3 (Pentatricopeptide repeat-containing protein At2g37230 OS=Arabidopsis thaliana OX=3702 GN=At2g37230 PE=2 SV=1)

HSP 1 Score: 1012.3 bits (2616), Expect = 2.9e-294
Identity = 512/757 (67.64%), Postives = 611/757 (80.71%), Query Frame = 0

Query: 1   MAHISLSKPHYSHLKVLSSSSISKPISFNSL-HFFSSIQDPTTTATQNESPKDPFVSSDA 60
           MA IS SK + S  +V  S   S   S  SL   FS+I++  T A  N   + P   S+ 
Sbjct: 1   MAFISRSKRYQSKARVYLSLPRSSNSSLFSLPRLFSTIEETQTPANANPETQSPDAKSET 60

Query: 61  AVPQPVEPVAVNGGDQVKRSIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQF 120
                 + +       ++    RG R+N EKLED ICRMM NR WTTRLQNSIR LVP++
Sbjct: 61  K-----KNLTSTETRPLRERFQRGKRQNHEKLEDTICRMMDNRAWTTRLQNSIRDLVPEW 120

Query: 121 DHSIVWNVLHAAKNSDHALQFFRWVERSGLFQHDRGTHFKIIEILGRASKLNHARCILLD 180
           DHS+V+NVLH AK  +HALQFFRW ERSGL +HDR TH K+I++LG  SKLNHARCILLD
Sbjct: 121 DHSLVYNVLHGAKKLEHALQFFRWTERSGLIRHDRDTHMKMIKMLGEVSKLNHARCILLD 180

Query: 181 MPNKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGR 240
           MP KGV WDED+FV++I+SYGKAGIVQE+VKIFQKMK+LGVER+IKSYN LFKVILRRGR
Sbjct: 181 MPEKGVPWDEDMFVVLIESYGKAGIVQESVKIFQKMKDLGVERTIKSYNSLFKVILRRGR 240

Query: 241 YMMAKRYFNAMLNEGIEPTCHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNT 300
           YMMAKRYFN M++EG+EPT HTYN+MLWGFFLSLRLETA RF+EDMKTRGI+PD  T+NT
Sbjct: 241 YMMAKRYFNKMVSEGVEPTRHTYNLMLWGFFLSLRLETALRFFEDMKTRGISPDDATFNT 300

Query: 301 MINGYYRFKMMEEAEQFFTEMKGNNLLPTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVG 360
           MING+ RFK M+EAE+ F EMKGN + P+V+SYTTMIKGY++  RVDDGLR+FEEM++ G
Sbjct: 301 MINGFCRFKKMDEAEKLFVEMKGNKIGPSVVSYTTMIKGYLAVDRVDDGLRIFEEMRSSG 360

Query: 361 VKPNDFTYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAA 420
           ++PN  TYSTLLPGLCDA KM EA+ IL  M+ K+IAPKDNSIF++LL  Q K GD+ AA
Sbjct: 361 IEPNATTYSTLLPGLCDAGKMVEAKNILKNMMAKHIAPKDNSIFLKLLVSQSKAGDMAAA 420

Query: 421 MHVLKAMIRLSIPTEAGHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEAS 480
             VLKAM  L++P EAGHYG+LIEN CKA  Y+RA+KLLD L+EKEIIL+ Q TLEME S
Sbjct: 421 TEVLKAMATLNVPAEAGHYGVLIENQCKASAYNRAIKLLDTLIEKEIILRHQDTLEMEPS 480

Query: 481 AYNPIIQYLCDHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMG 540
           AYNPII+YLC++GQT KAE  FRQL+K+G+QD+ A NNLIRGH+KEGNP+ ++EILKIM 
Sbjct: 481 AYNPIIEYLCNNGQTAKAEVLFRQLMKRGVQDQDALNNLIRGHAKEGNPDSSYEILKIMS 540

Query: 541 RKSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQ 600
           R+ V R++ +Y+LLIKSY+SKGEP DAKTALDSM+E GH PDS+LFRSV+ESLF DGRVQ
Sbjct: 541 RRGVPRESNAYELLIKSYMSKGEPGDAKTALDSMVEDGHVPDSSLFRSVIESLFEDGRVQ 600

Query: 601 TASRVMNSMLDK--GITENMDLVAKILEALFMRGHVEEALGRVDLLMRCSCPPDFDSLLS 660
           TASRVM  M+DK  GI +NMDL+AKILEAL MRGHVEEALGR+DLL +     D DSLLS
Sbjct: 601 TASRVMMIMIDKNVGIEDNMDLIAKILEALLMRGHVEEALGRIDLLNQNGHTADLDSLLS 660

Query: 661 VLCEKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAK 720
           VL EKGKTIAALKLLDFGLER+ ++E SSY+KVLDALL AGKTLNAYS+LCKIMEKG + 
Sbjct: 661 VLSEKGKTIAALKLLDFGLERDLSLEFSSYDKVLDALLGAGKTLNAYSVLCKIMEKGSST 720

Query: 721 EWSSCDDLINSLNQEGHTKQADILSRMIKGGDRIRRK 755
           +W S D+LI SLNQEG+TKQAD+LSRMIK G  I+++
Sbjct: 721 DWKSSDELIKSLNQEGNTKQADVLSRMIKKGQGIKKQ 752

BLAST of Cp4.1LG18g05980 vs. ExPASy Swiss-Prot
Match: O81908 (Pentatricopeptide repeat-containing protein At1g02060, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At1g02060 PE=2 SV=2)

HSP 1 Score: 368.2 bits (944), Expect = 2.2e-100
Identity = 223/669 (33.33%), Postives = 358/669 (53.51%), Query Frame = 0

Query: 85  RRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQ--FDHSIVWNVLHAAKNSDHALQFFR 144
           R    KL   + R + +  W+  L++S+ SL P      + V   L   K     L+FF 
Sbjct: 30  RSTKSKLARSLARAVNSNPWSDELESSLSSLHPSQTISRTTVLQTLRLIKVPADGLRFFD 89

Query: 145 WVERSGLFQHDRGTHFKIIEILGRASKLNHARCILLDMPNKG---VEWDEDLFVIMIDSY 204
           WV   G F H   + F ++E LGRA  LN AR  L  +  +    V+  +  F  +I SY
Sbjct: 90  WVSNKG-FSHKEQSFFLMLEFLGRARNLNVARNFLFSIERRSNGCVKLQDRYFNSLIRSY 149

Query: 205 GKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGRYMMAKRYFNAMLNE-GIEPT 264
           G AG+ QE+VK+FQ MK++G+  S+ ++N L  ++L+RGR  MA   F+ M    G+ P 
Sbjct: 150 GNAGLFQESVKLFQTMKQMGISPSVLTFNSLLSILLKRGRTGMAHDLFDEMRRTYGVTPD 209

Query: 265 CHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTMINGYYRFKMMEEAEQFFT 324
            +T+N ++ GF  +  ++ A R ++DM+     PDVVTYNT+I+G  R   ++ A    +
Sbjct: 210 SYTFNTLINGFCKNSMVDEAFRIFKDMELYHCNPDVVTYNTIIDGLCRAGKVKIAHNVLS 269

Query: 325 EM--KGNNLLPTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGVKPNDFTYSTLLPGLCD 384
            M  K  ++ P V+SYTT+++GY     +D+ + +F +M + G+KPN  TY+TL+ GL +
Sbjct: 270 GMLKKATDVHPNVVSYTTLVRGYCMKQEIDEAVLVFHDMLSRGLKPNAVTYNTLIKGLSE 329

Query: 385 AEKMSEARQILTEMVDKY--IAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTE 444
           A +  E + IL    D +   AP D   F  L+   C  G LDAAM V + M+ + +  +
Sbjct: 330 AHRYDEIKDILIGGNDAFTTFAP-DACTFNILIKAHCDAGHLDAAMKVFQEMLNMKLHPD 389

Query: 445 AGHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEASAYNPIIQYLCDHGQT 504
           +  Y +LI   C    +DRA  L ++L EKE++L       + A+AYNP+ +YLC +G+T
Sbjct: 390 SASYSVLIRTLCMRNEFDRAETLFNELFEKEVLLGKDECKPL-AAAYNPMFEYLCANGKT 449

Query: 505 GKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMGRKSVSRDAESYKLLI 564
            +AE  FRQL+K+G+QD  ++  LI GH +EG  + A+E+L +M R+    D E+Y+LLI
Sbjct: 450 KQAEKVFRQLMKRGVQDPPSYKTLITGHCREGKFKPAYELLVLMLRREFVPDLETYELLI 509

Query: 565 KSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQTASRVMNSMLDKGIT 624
              L  GE   A   L  M+ S + P +  F SV+  L        +  ++  ML+K I 
Sbjct: 510 DGLLKIGEALLAHDTLQRMLRSSYLPVATTFHSVLAELAKRKFANESFCLVTLMLEKRIR 569

Query: 625 ENMDLVAKILEALFMRGHVEEALGRVDLLMRCSCPPDFDSLLSVLCEKGKTIAALKLLDF 684
           +N+DL  +++  LF     E+A   V LL         + LL  LCE  K + A  L+ F
Sbjct: 570 QNIDLSTQVVRLLFSSAQKEKAFLIVRLLYDNGYLVKMEELLGYLCENRKLLDAHTLVLF 629

Query: 685 GLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWSSCDDLINSLNQEGH 744
            LE+   +++ +   V++ L    +   A+S+  +++E G  ++ S    L N+L   G 
Sbjct: 630 CLEKSQMVDIDTCNTVIEGLCKHKRHSEAFSLYNELVELGNHQQLSCHVVLRNALEAAGK 689

BLAST of Cp4.1LG18g05980 vs. ExPASy Swiss-Prot
Match: Q9LSL9 (Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana OX=3702 GN=At5g65560 PE=2 SV=1)

HSP 1 Score: 207.6 bits (527), Expect = 4.9e-52
Identity = 158/712 (22.19%), Postives = 320/712 (44.94%), Query Frame = 0

Query: 56  SSDAAVPQPVEPVAVNGGDQVKRSIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSL 115
           S+D  VP PV          + R++P     +   +   +  +++   W      S++S+
Sbjct: 28  STDVTVPSPVTRRQFCSVSPLLRNLPE-EESDSMSVPHRLLSILSKPNW--HKSPSLKSM 87

Query: 116 VPQFDHSIVWNVLHAAKNSDHALQFFRWVERSGLFQHD----------------RGTHFK 175
           V     S V ++     +   AL F  W+ ++  ++H                  G  FK
Sbjct: 88  VSAISPSHVSSLFSLDLDPKTALNFSHWISQNPRYKHSVYSYASLLTLLINNGYVGVVFK 147

Query: 176 IIEILGRASKLNHARCILLDMPNKGVEWDE----------DLFVIMIDSYGKAGIVQEAV 235
           I  ++ ++         +LD+  K +  DE            +  +++S  + G+V E  
Sbjct: 148 IRLLMIKSCDSVGDALYVLDLCRK-MNKDERFELKYKLIIGCYNTLLNSLARFGLVDEMK 207

Query: 236 KIFQKMKELGVERSIKSYNVLFKVILRRGRYMMAKRYFNAMLNEGIEPTCHTYNVMLWGF 295
           +++ +M E  V  +I +YN +     + G    A +Y + ++  G++P   TY  ++ G+
Sbjct: 208 QVYMEMLEDKVCPNIYTYNKMVNGYCKLGNVEEANQYVSKIVEAGLDPDFFTYTSLIMGY 267

Query: 296 FLSLRLETAKRFYEDMKTRGIAPDVVTYNTMINGYYRFKMMEEAEQFFTEMKGNNLLPTV 355
                L++A + + +M  +G   + V Y  +I+G    + ++EA   F +MK +   PTV
Sbjct: 268 CQRKDLDSAFKVFNEMPLKGCRRNEVAYTHLIHGLCVARRIDEAMDLFVKMKDDECFPTV 327

Query: 356 ISYTTMIKGYVSSGRVDDGLRLFEEMKAVGVKPNDFTYSTLLPGLCDAEKMSEARQILTE 415
            +YT +IK    S R  + L L +EM+  G+KPN  TY+ L+  LC   K  +AR++L +
Sbjct: 328 RTYTVLIKSLCGSERKSEALNLVKEMEETGIKPNIHTYTVLIDSLCSQCKFEKARELLGQ 387

Query: 416 MVDKYIAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAG 475
           M++K + P +   +  L++  CK G ++ A+ V++ M    +      Y  LI+  CK+ 
Sbjct: 388 MLEKGLMP-NVITYNALINGYCKRGMIEDAVDVVELMESRKLSPNTRTYNELIKGYCKSN 447

Query: 476 VYDRAVKLLDKLVEKEIILKPQSTLEMEASAYNPIIQYLCDHGQTGKAETFFRQLLKKG- 535
           V+ +A+ +L+K++E++++         +   YN +I   C  G    A      +  +G 
Sbjct: 448 VH-KAMGVLNKMLERKVL--------PDVVTYNSLIDGQCRSGNFDSAYRLLSLMNDRGL 507

Query: 536 IQDEVAFNNLIRGHSKEGNPELAFEILKIMGRKSVSRDAESYKLLIKSYLSKGEPADAKT 595
           + D+  + ++I    K    E A ++   + +K V+ +   Y  LI  Y   G+  +A  
Sbjct: 508 VPDQWTYTSMIDSLCKSKRVEEACDLFDSLEQKGVNPNVVMYTALIDGYCKAGKVDEAHL 567

Query: 596 ALDSMIESGHYPDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENMDLVAKILEALF 655
            L+ M+     P+S  F +++  L ADG+++ A+ +   M+  G+   +     ++  L 
Sbjct: 568 MLEKMLSKNCLPNSLTFNALIHGLCADGKLKEATLLEEKMVKIGLQPTVSTDTILIHRLL 627

Query: 656 MRGHVEEALGRVDLLMRCSCPPD---FDSLLSVLCEKGKTIAALKLLDFGLERECNIEVS 715
             G  + A  R   ++     PD   + + +   C +G+ + A  ++    E   + ++ 
Sbjct: 628 KDGDFDHAYSRFQQMLSSGTKPDAHTYTTFIQTYCREGRLLDAEDMMAKMRENGVSPDLF 687

Query: 716 SYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWSSCDDLINSLNQEGHTKQ 738
           +Y  ++      G+T  A+ +L ++ + G      +   LI  L +  + KQ
Sbjct: 688 TYSSLIKGYGDLGQTNFAFDVLKRMRDTGCEPSQHTFLSLIKHLLEMKYGKQ 725

BLAST of Cp4.1LG18g05980 vs. ExPASy Swiss-Prot
Match: Q9LPX2 (Pentatricopeptide repeat-containing protein At1g12775, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g12775 PE=2 SV=1)

HSP 1 Score: 206.1 bits (523), Expect = 1.4e-51
Identity = 127/432 (29.40%), Postives = 216/432 (50.00%), Query Frame = 0

Query: 132 KNSDHALQFFRWVERSGLFQHDRGTHFKIIEILGRASKLNHARCILLDMPNKGVEWDEDL 191
           K SD  +   R VE    FQ +  T+  ++ ++ ++ +   A  +L  M  + ++ D   
Sbjct: 208 KVSDAVVLIDRMVETG--FQPNEVTYGPVLNVMCKSGQTALAMELLRKMEERNIKLDAVK 267

Query: 192 FVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGRYMMAKRYFNAML 251
           + I+ID   K G +  A  +F +M+  G +  I +YN L       GR+    +    M+
Sbjct: 268 YSIIIDGLCKDGSLDNAFNLFNEMEIKGFKADIITYNTLIGGFCNAGRWDDGAKLLRDMI 327

Query: 252 NEGIEPTCHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTMINGYYRFKMME 311
              I P   T++V++  F    +L  A +  ++M  RGIAP+ +TYN++I+G+ +   +E
Sbjct: 328 KRKISPNVVTFSVLIDSFVKEGKLREADQLLKEMMQRGIAPNTITYNSLIDGFCKENRLE 387

Query: 312 EAEQFFTEMKGNNLLPTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGVKPNDFTYSTLL 371
           EA Q    M      P ++++  +I GY  + R+DDGL LF EM   GV  N  TY+TL+
Sbjct: 388 EAIQMVDLMISKGCDPDIMTFNILINGYCKANRIDDGLELFREMSLRGVIANTVTYNTLV 447

Query: 372 PGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSI 431
            G C + K+  A+++  EMV + + P D   +  LL   C +G+L+ A+ +   + +  +
Sbjct: 448 QGFCQSGKLEVAKKLFQEMVSRRVRP-DIVSYKILLDGLCDNGELEKALEIFGKIEKSKM 507

Query: 432 PTEAGHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEASAYNPIIQYLCDH 491
             + G Y I+I   C A   D A  L   L        P   ++++A AYN +I  LC  
Sbjct: 508 ELDIGIYMIIIHGMCNASKVDDAWDLFCSL--------PLKGVKLDARAYNIMISELCRK 567

Query: 492 GQTGKAETFFRQLLKKG-IQDEVAFNNLIRGHSKEGNPELAFEILKIMGRKSVSRDAESY 551
               KA+  FR++ ++G   DE+ +N LIR H  + +   A E+++ M       D  + 
Sbjct: 568 DSLSKADILFRKMTEEGHAPDELTYNILIRAHLGDDDATTAAELIEEMKSSGFPADVSTV 627

Query: 552 KLLIKSYLSKGE 563
           K++I + LS GE
Sbjct: 628 KMVI-NMLSSGE 627

BLAST of Cp4.1LG18g05980 vs. ExPASy Swiss-Prot
Match: Q6NQ83 (Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g22470 PE=1 SV=1)

HSP 1 Score: 196.1 bits (497), Expect = 1.5e-48
Identity = 117/406 (28.82%), Postives = 206/406 (50.74%), Query Frame = 0

Query: 150 FQHDRGTHFKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVIMIDSYGKAGIVQEAV 209
           FQ D  T+  ++  L ++     A  +   M  + ++     + I+IDS  K G   +A+
Sbjct: 206 FQPDEVTYGPVLNRLCKSGNSALALDLFRKMEERNIKASVVQYSIVIDSLCKDGSFDDAL 265

Query: 210 KIFQKMKELGVERSIKSYNVLFKVILRRGRYMMAKRYFNAMLNEGIEPTCHTYNVMLWGF 269
            +F +M+  G++  + +Y+ L   +   G++    +    M+   I P   T++ ++  F
Sbjct: 266 SLFNEMEMKGIKADVVTYSSLIGGLCNDGKWDDGAKMLREMIGRNIIPDVVTFSALIDVF 325

Query: 270 FLSLRLETAKRFYEDMKTRGIAPDVVTYNTMINGYYRFKMMEEAEQFFTEMKGNNLLPTV 329
               +L  AK  Y +M TRGIAPD +TYN++I+G+ +   + EA Q F  M      P +
Sbjct: 326 VKEGKLLEAKELYNEMITRGIAPDTITYNSLIDGFCKENCLHEANQMFDLMVSKGCEPDI 385

Query: 330 ISYTTMIKGYVSSGRVDDGLRLFEEMKAVGVKPNDFTYSTLLPGLCDAEKMSEARQILTE 389
           ++Y+ +I  Y  + RVDDG+RLF E+ + G+ PN  TY+TL+ G C + K++ A+++  E
Sbjct: 386 VTYSILINSYCKAKRVDDGMRLFREISSKGLIPNTITYNTLVLGFCQSGKLNAAKELFQE 445

Query: 390 MVDKYIAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAG 449
           MV + + P   + +  LL   C +G+L+ A+ + + M +  +    G Y I+I   C A 
Sbjct: 446 MVSRGVPPSVVT-YGILLDGLCDNGELNKALEIFEKMQKSRMTLGIGIYNIIIHGMCNAS 505

Query: 450 VYDRAVKLLDKLVEKEIILKPQSTLEMEASAYNPIIQYLCDHGQTGKAETFFRQLLKKG- 509
             D A  L   L +K +  KP      +   YN +I  LC  G   +A+  FR++ + G 
Sbjct: 506 KVDDAWSLFCSLSDKGV--KP------DVVTYNVMIGGLCKKGSLSEADMLFRKMKEDGC 565

Query: 510 IQDEVAFNNLIRGHSKEGNPELAFEILKIMGRKSVSRDAESYKLLI 555
             D+  +N LIR H        + E+++ M     S D+ + K++I
Sbjct: 566 TPDDFTYNILIRAHLGGSGLISSVELIEEMKVCGFSADSSTIKMVI 602

BLAST of Cp4.1LG18g05980 vs. NCBI nr
Match: XP_023515807.1 (pentatricopeptide repeat-containing protein At2g37230-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1509 bits (3906), Expect = 0.0
Identity = 761/761 (100.00%), Postives = 761/761 (100.00%), Query Frame = 0

Query: 1   MAHISLSKPHYSHLKVLSSSSISKPISFNSLHFFSSIQDPTTTATQNESPKDPFVSSDAA 60
           MAHISLSKPHYSHLKVLSSSSISKPISFNSLHFFSSIQDPTTTATQNESPKDPFVSSDAA
Sbjct: 1   MAHISLSKPHYSHLKVLSSSSISKPISFNSLHFFSSIQDPTTTATQNESPKDPFVSSDAA 60

Query: 61  VPQPVEPVAVNGGDQVKRSIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFD 120
           VPQPVEPVAVNGGDQVKRSIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFD
Sbjct: 61  VPQPVEPVAVNGGDQVKRSIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFD 120

Query: 121 HSIVWNVLHAAKNSDHALQFFRWVERSGLFQHDRGTHFKIIEILGRASKLNHARCILLDM 180
           HSIVWNVLHAAKNSDHALQFFRWVERSGLFQHDRGTHFKIIEILGRASKLNHARCILLDM
Sbjct: 121 HSIVWNVLHAAKNSDHALQFFRWVERSGLFQHDRGTHFKIIEILGRASKLNHARCILLDM 180

Query: 181 PNKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGRY 240
           PNKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGRY
Sbjct: 181 PNKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGRY 240

Query: 241 MMAKRYFNAMLNEGIEPTCHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTM 300
           MMAKRYFNAMLNEGIEPTCHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTM
Sbjct: 241 MMAKRYFNAMLNEGIEPTCHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTM 300

Query: 301 INGYYRFKMMEEAEQFFTEMKGNNLLPTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGV 360
           INGYYRFKMMEEAEQFFTEMKGNNLLPTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGV
Sbjct: 301 INGYYRFKMMEEAEQFFTEMKGNNLLPTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGV 360

Query: 361 KPNDFTYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM 420
           KPNDFTYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM
Sbjct: 361 KPNDFTYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM 420

Query: 421 HVLKAMIRLSIPTEAGHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEASA 480
           HVLKAMIRLSIPTEAGHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEASA
Sbjct: 421 HVLKAMIRLSIPTEAGHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEASA 480

Query: 481 YNPIIQYLCDHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMGR 540
           YNPIIQYLCDHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMGR
Sbjct: 481 YNPIIQYLCDHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMGR 540

Query: 541 KSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQT 600
           KSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQT
Sbjct: 541 KSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQT 600

Query: 601 ASRVMNSMLDKGITENMDLVAKILEALFMRGHVEEALGRVDLLMRCSCPPDFDSLLSVLC 660
           ASRVMNSMLDKGITENMDLVAKILEALFMRGHVEEALGRVDLLMRCSCPPDFDSLLSVLC
Sbjct: 601 ASRVMNSMLDKGITENMDLVAKILEALFMRGHVEEALGRVDLLMRCSCPPDFDSLLSVLC 660

Query: 661 EKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWS 720
           EKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWS
Sbjct: 661 EKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWS 720

Query: 721 SCDDLINSLNQEGHTKQADILSRMIKGGDRIRRKKASPAAA 761
           SCDDLINSLNQEGHTKQADILSRMIKGGDRIRRKKASPAAA
Sbjct: 721 SCDDLINSLNQEGHTKQADILSRMIKGGDRIRRKKASPAAA 761

BLAST of Cp4.1LG18g05980 vs. NCBI nr
Match: KAG6589742.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1489 bits (3855), Expect = 0.0
Identity = 748/761 (98.29%), Postives = 757/761 (99.47%), Query Frame = 0

Query: 1   MAHISLSKPHYSHLKVLSSSSISKPISFNSLHFFSSIQDPTTTATQNESPKDPFVSSDAA 60
           MAHISLSKPHYSHLKVLSSSSISKPISFNSLHFFSS QDPTTTATQNESPKDPFVSSDAA
Sbjct: 1   MAHISLSKPHYSHLKVLSSSSISKPISFNSLHFFSSTQDPTTTATQNESPKDPFVSSDAA 60

Query: 61  VPQPVEPVAVNGGDQVKRSIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFD 120
           VPQPVEPVAVNGGDQVKRSIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFD
Sbjct: 61  VPQPVEPVAVNGGDQVKRSIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFD 120

Query: 121 HSIVWNVLHAAKNSDHALQFFRWVERSGLFQHDRGTHFKIIEILGRASKLNHARCILLDM 180
           HSIVWNVLHAAKNSDHAL+FFRWVER+GLFQHDRGTHFKIIEILGRASKLNHARCILLDM
Sbjct: 121 HSIVWNVLHAAKNSDHALKFFRWVERAGLFQHDRGTHFKIIEILGRASKLNHARCILLDM 180

Query: 181 PNKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGRY 240
           PNKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGRY
Sbjct: 181 PNKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGRY 240

Query: 241 MMAKRYFNAMLNEGIEPTCHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTM 300
           MMAKRYFNAMLNEGIEPTCHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTM
Sbjct: 241 MMAKRYFNAMLNEGIEPTCHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTM 300

Query: 301 INGYYRFKMMEEAEQFFTEMKGNNLLPTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGV 360
           INGYYRFKMMEEAEQFFTEMKG NL+PTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGV
Sbjct: 301 INGYYRFKMMEEAEQFFTEMKGKNLVPTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGV 360

Query: 361 KPNDFTYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM 420
           KPNDFTYSTLLPGLCDAE+MSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM
Sbjct: 361 KPNDFTYSTLLPGLCDAERMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM 420

Query: 421 HVLKAMIRLSIPTEAGHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEASA 480
           HVLKAMIRLS+PTE+GHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEASA
Sbjct: 421 HVLKAMIRLSVPTESGHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEASA 480

Query: 481 YNPIIQYLCDHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMGR 540
           YNP+IQYLCDHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMGR
Sbjct: 481 YNPVIQYLCDHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMGR 540

Query: 541 KSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQT 600
           KSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDS LFRSVMESLFADGRVQT
Sbjct: 541 KSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSGLFRSVMESLFADGRVQT 600

Query: 601 ASRVMNSMLDKGITENMDLVAKILEALFMRGHVEEALGRVDLLMRCSCPPDFDSLLSVLC 660
           ASRVMNSMLDKGITEN+DLVAKILEALFMRGHVEEALGRVDLLMR SCPPDFDSLLSVLC
Sbjct: 601 ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRVDLLMRSSCPPDFDSLLSVLC 660

Query: 661 EKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWS 720
           EKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWS
Sbjct: 661 EKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWS 720

Query: 721 SCDDLINSLNQEGHTKQADILSRMIKGGDRIRRKKASPAAA 761
           SCDDLI+SLNQEGHTKQADILSRMIKGGDRIRRKKASPAAA
Sbjct: 721 SCDDLISSLNQEGHTKQADILSRMIKGGDRIRRKKASPAAA 761

BLAST of Cp4.1LG18g05980 vs. NCBI nr
Match: XP_022921867.1 (pentatricopeptide repeat-containing protein At2g37230-like [Cucurbita moschata] >KAG7023417.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1486 bits (3846), Expect = 0.0
Identity = 747/761 (98.16%), Postives = 755/761 (99.21%), Query Frame = 0

Query: 1   MAHISLSKPHYSHLKVLSSSSISKPISFNSLHFFSSIQDPTTTATQNESPKDPFVSSDAA 60
           MAHISLSKPHYSHLKVLSSSSISKPISFNSLHFFSS QDPTTTATQNESPKDPFVSSDAA
Sbjct: 1   MAHISLSKPHYSHLKVLSSSSISKPISFNSLHFFSSTQDPTTTATQNESPKDPFVSSDAA 60

Query: 61  VPQPVEPVAVNGGDQVKRSIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFD 120
           VPQPVEPVAVNGGDQVKRSIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFD
Sbjct: 61  VPQPVEPVAVNGGDQVKRSIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFD 120

Query: 121 HSIVWNVLHAAKNSDHALQFFRWVERSGLFQHDRGTHFKIIEILGRASKLNHARCILLDM 180
           HSIVWNVLHAAKNSDHAL+FFRWVER+GLFQHDRGTHFKIIEILGRASKLNHARCILLDM
Sbjct: 121 HSIVWNVLHAAKNSDHALKFFRWVERAGLFQHDRGTHFKIIEILGRASKLNHARCILLDM 180

Query: 181 PNKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGRY 240
           PNKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGRY
Sbjct: 181 PNKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGRY 240

Query: 241 MMAKRYFNAMLNEGIEPTCHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTM 300
           MMAKRYFNAMLNEGIEPTCHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTM
Sbjct: 241 MMAKRYFNAMLNEGIEPTCHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTM 300

Query: 301 INGYYRFKMMEEAEQFFTEMKGNNLLPTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGV 360
           INGYYRFKMMEEAEQFFTEMKG NL+PTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGV
Sbjct: 301 INGYYRFKMMEEAEQFFTEMKGKNLVPTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGV 360

Query: 361 KPNDFTYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM 420
           KPNDFTYSTLLPGLCDAE+MSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM
Sbjct: 361 KPNDFTYSTLLPGLCDAERMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM 420

Query: 421 HVLKAMIRLSIPTEAGHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEASA 480
           HVLKAM RLS+PTEAGHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEASA
Sbjct: 421 HVLKAMTRLSVPTEAGHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEASA 480

Query: 481 YNPIIQYLCDHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMGR 540
           YNP+IQYLCDHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMGR
Sbjct: 481 YNPVIQYLCDHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMGR 540

Query: 541 KSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQT 600
           KSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQT
Sbjct: 541 KSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQT 600

Query: 601 ASRVMNSMLDKGITENMDLVAKILEALFMRGHVEEALGRVDLLMRCSCPPDFDSLLSVLC 660
           ASRVMNSML KGITEN+DLVAKILEALFMRGHVEEALGRVDLLMR SCPPDFDSLLSVLC
Sbjct: 601 ASRVMNSMLHKGITENLDLVAKILEALFMRGHVEEALGRVDLLMRSSCPPDFDSLLSVLC 660

Query: 661 EKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWS 720
           EKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWS
Sbjct: 661 EKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWS 720

Query: 721 SCDDLINSLNQEGHTKQADILSRMIKGGDRIRRKKASPAAA 761
           SCDDLI+SLNQEGHTKQADILSRMIKGGDRIRRKKASP AA
Sbjct: 721 SCDDLISSLNQEGHTKQADILSRMIKGGDRIRRKKASPTAA 761

BLAST of Cp4.1LG18g05980 vs. NCBI nr
Match: XP_022988550.1 (pentatricopeptide repeat-containing protein At2g37230-like [Cucurbita maxima])

HSP 1 Score: 1442 bits (3734), Expect = 0.0
Identity = 725/761 (95.27%), Postives = 742/761 (97.50%), Query Frame = 0

Query: 1   MAHISLSKPHYSHLKVLSSSSISKPISFNSLHFFSSIQDPTTTATQNESPKDPFVSSDAA 60
           MAHISLSKPHYSHLKV SSSSISK ISFNSLHFFSS QDP +T TQNESP DPFVSSDAA
Sbjct: 1   MAHISLSKPHYSHLKVFSSSSISKLISFNSLHFFSSTQDPISTPTQNESPNDPFVSSDAA 60

Query: 61  VPQPVEPVAVNGGDQVKRSIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFD 120
           VPQ VEPVAVNGGDQVKR+IPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFD
Sbjct: 61  VPQSVEPVAVNGGDQVKRTIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFD 120

Query: 121 HSIVWNVLHAAKNSDHALQFFRWVERSGLFQHDRGTHFKIIEILGRASKLNHARCILLDM 180
           HS+VWNVLHAAKNSDHAL+FFRWVER+GLFQHDRGTHFKIIEILGRASKLNHARCILLDM
Sbjct: 121 HSLVWNVLHAAKNSDHALKFFRWVERAGLFQHDRGTHFKIIEILGRASKLNHARCILLDM 180

Query: 181 PNKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGRY 240
           P KGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSY+ LFKVILRRGRY
Sbjct: 181 PKKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYDALFKVILRRGRY 240

Query: 241 MMAKRYFNAMLNEGIEPTCHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTM 300
           MMAKRYFN MLNEGIEPT HTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTM
Sbjct: 241 MMAKRYFNTMLNEGIEPTRHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTM 300

Query: 301 INGYYRFKMMEEAEQFFTEMKGNNLLPTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGV 360
           INGY+RFKMMEEAEQFFTEMKG N++PTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGV
Sbjct: 301 INGYHRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGV 360

Query: 361 KPNDFTYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM 420
           KPND TYSTLLPGLCDAEKM EA QILTEMVD+YIAPKDNSIFMRLLSCQC HGDLDAAM
Sbjct: 361 KPNDVTYSTLLPGLCDAEKMFEAGQILTEMVDRYIAPKDNSIFMRLLSCQCMHGDLDAAM 420

Query: 421 HVLKAMIRLSIPTEAGHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEASA 480
           HVLKAM RLS+PTEAGHYGILIENCCKAGVYDRAVKLLDKLV+KEIILKPQSTLEMEASA
Sbjct: 421 HVLKAMNRLSVPTEAGHYGILIENCCKAGVYDRAVKLLDKLVKKEIILKPQSTLEMEASA 480

Query: 481 YNPIIQYLCDHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMGR 540
           YNPIIQYLCDHGQTGKAETFFRQLLKKGIQDE+AFNNLIRGHSKEGNPELAFE+LKIMGR
Sbjct: 481 YNPIIQYLCDHGQTGKAETFFRQLLKKGIQDEIAFNNLIRGHSKEGNPELAFEMLKIMGR 540

Query: 541 KSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQT 600
           KSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQT
Sbjct: 541 KSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQT 600

Query: 601 ASRVMNSMLDKGITENMDLVAKILEALFMRGHVEEALGRVDLLMRCSCPPDFDSLLSVLC 660
           ASRVMNSMLDKGITEN+DLVAKILEALFMRGHVEEALGRVDLLMRCSCPPDFDSLLSVLC
Sbjct: 601 ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRVDLLMRCSCPPDFDSLLSVLC 660

Query: 661 EKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWS 720
           EKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWS
Sbjct: 661 EKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWS 720

Query: 721 SCDDLINSLNQEGHTKQADILSRMIKGGDRIRRKKASPAAA 761
           SCDDLI+SLNQEGHTKQAD+LSRMIKGGD IRRK ASPAAA
Sbjct: 721 SCDDLISSLNQEGHTKQADVLSRMIKGGDGIRRKNASPAAA 761

BLAST of Cp4.1LG18g05980 vs. NCBI nr
Match: XP_038880029.1 (pentatricopeptide repeat-containing protein At2g37230 [Benincasa hispida])

HSP 1 Score: 1356 bits (3509), Expect = 0.0
Identity = 681/760 (89.61%), Postives = 715/760 (94.08%), Query Frame = 0

Query: 1   MAHISLSKPHYSHLKVLSSSSISKPISFNSLHFFSSIQDPTTTATQNESPKDPFVSSDAA 60
           MAHIS+SK H SH +VLSSSSI KP +  SLHFFSS Q+P +TATQNESP  P  SSDAA
Sbjct: 1   MAHISVSKLHSSHYRVLSSSSIPKPTALYSLHFFSSTQEPISTATQNESPNGPSASSDAA 60

Query: 61  VPQPVEPVAVNGGDQVKRSIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFD 120
           VPQP E VAVNG +QVKR  PRG  RNPEKLED+IC+MMANREWTTRLQNSIRSLVPQFD
Sbjct: 61  VPQPAESVAVNGAEQVKRRTPRGKPRNPEKLEDVICKMMANREWTTRLQNSIRSLVPQFD 120

Query: 121 HSIVWNVLHAAKNSDHALQFFRWVERSGLFQHDRGTHFKIIEILGRASKLNHARCILLDM 180
           HS+VWNVLHAAKNSDHAL FFRWVER+GLFQHDR TH KIIEILGRASKLNHARCILLDM
Sbjct: 121 HSLVWNVLHAAKNSDHALNFFRWVERAGLFQHDRETHLKIIEILGRASKLNHARCILLDM 180

Query: 181 PNKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGRY 240
            NKG+EWDEDLFVI+I+SYGKAGIVQEAVKIFQKMKELGVERS+KSY+ LFKVILRRGRY
Sbjct: 181 LNKGIEWDEDLFVILIESYGKAGIVQEAVKIFQKMKELGVERSVKSYDALFKVILRRGRY 240

Query: 241 MMAKRYFNAMLNEGIEPTCHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTM 300
           MMAKRYFNAMLNEGIEPT HTYNVMLWGFFLSLRLETAKRFYEDMK+RGI+PDVVTYNTM
Sbjct: 241 MMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM 300

Query: 301 INGYYRFKMMEEAEQFFTEMKGNNLLPTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGV 360
           INGYYRFKMMEEAEQFFTEMKG N++PTVISYTTMIKGYVS+GRVDDGLRLFEEMKAVGV
Sbjct: 301 INGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSAGRVDDGLRLFEEMKAVGV 360

Query: 361 KPNDFTYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM 420
           KPND TYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQC HGDLDAAM
Sbjct: 361 KPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCMHGDLDAAM 420

Query: 421 HVLKAMIRLSIPTEAGHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEASA 480
           HVLKAMIRLSIPTEAGHYGILIENCCKAG+YD+AVKLLDKLVEKEIIL+PQSTLEMEASA
Sbjct: 421 HVLKAMIRLSIPTEAGHYGILIENCCKAGMYDKAVKLLDKLVEKEIILRPQSTLEMEASA 480

Query: 481 YNPIIQYLCDHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMGR 540
           YN IIQYLC+HGQTGKA+TFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMGR
Sbjct: 481 YNLIIQYLCNHGQTGKADTFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMGR 540

Query: 541 KSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQT 600
           + VSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLF DGRVQT
Sbjct: 541 RDVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFTDGRVQT 600

Query: 601 ASRVMNSMLDKGITENMDLVAKILEALFMRGHVEEALGRVDLLMRCSCPPDFDSLLSVLC 660
           ASRVMNSMLDKGITEN+DLVAKILEAL MRGHVEEALGR+DLLM C+CPPDFDSLLSVLC
Sbjct: 601 ASRVMNSMLDKGITENLDLVAKILEALLMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLC 660

Query: 661 EKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWS 720
           E+GKTIAALKLLDFGLERECNIE SSYEKVLDALL AGKTLNAY+ILCKIMEKGGA +W 
Sbjct: 661 ERGKTIAALKLLDFGLERECNIEFSSYEKVLDALLGAGKTLNAYAILCKIMEKGGANDWG 720

Query: 721 SCDDLINSLNQEGHTKQADILSRMIKGGDRIRRKKASPAA 760
           S DDLI SLNQEG+TKQADILSR +KGGDR R KK S AA
Sbjct: 721 SFDDLIKSLNQEGNTKQADILSRKMKGGDRKRCKKPSLAA 760

BLAST of Cp4.1LG18g05980 vs. ExPASy TrEMBL
Match: A0A6J1E1L0 (pentatricopeptide repeat-containing protein At2g37230-like OS=Cucurbita moschata OX=3662 GN=LOC111430005 PE=4 SV=1)

HSP 1 Score: 1486 bits (3846), Expect = 0.0
Identity = 747/761 (98.16%), Postives = 755/761 (99.21%), Query Frame = 0

Query: 1   MAHISLSKPHYSHLKVLSSSSISKPISFNSLHFFSSIQDPTTTATQNESPKDPFVSSDAA 60
           MAHISLSKPHYSHLKVLSSSSISKPISFNSLHFFSS QDPTTTATQNESPKDPFVSSDAA
Sbjct: 1   MAHISLSKPHYSHLKVLSSSSISKPISFNSLHFFSSTQDPTTTATQNESPKDPFVSSDAA 60

Query: 61  VPQPVEPVAVNGGDQVKRSIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFD 120
           VPQPVEPVAVNGGDQVKRSIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFD
Sbjct: 61  VPQPVEPVAVNGGDQVKRSIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFD 120

Query: 121 HSIVWNVLHAAKNSDHALQFFRWVERSGLFQHDRGTHFKIIEILGRASKLNHARCILLDM 180
           HSIVWNVLHAAKNSDHAL+FFRWVER+GLFQHDRGTHFKIIEILGRASKLNHARCILLDM
Sbjct: 121 HSIVWNVLHAAKNSDHALKFFRWVERAGLFQHDRGTHFKIIEILGRASKLNHARCILLDM 180

Query: 181 PNKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGRY 240
           PNKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGRY
Sbjct: 181 PNKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGRY 240

Query: 241 MMAKRYFNAMLNEGIEPTCHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTM 300
           MMAKRYFNAMLNEGIEPTCHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTM
Sbjct: 241 MMAKRYFNAMLNEGIEPTCHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTM 300

Query: 301 INGYYRFKMMEEAEQFFTEMKGNNLLPTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGV 360
           INGYYRFKMMEEAEQFFTEMKG NL+PTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGV
Sbjct: 301 INGYYRFKMMEEAEQFFTEMKGKNLVPTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGV 360

Query: 361 KPNDFTYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM 420
           KPNDFTYSTLLPGLCDAE+MSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM
Sbjct: 361 KPNDFTYSTLLPGLCDAERMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM 420

Query: 421 HVLKAMIRLSIPTEAGHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEASA 480
           HVLKAM RLS+PTEAGHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEASA
Sbjct: 421 HVLKAMTRLSVPTEAGHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEASA 480

Query: 481 YNPIIQYLCDHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMGR 540
           YNP+IQYLCDHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMGR
Sbjct: 481 YNPVIQYLCDHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMGR 540

Query: 541 KSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQT 600
           KSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQT
Sbjct: 541 KSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQT 600

Query: 601 ASRVMNSMLDKGITENMDLVAKILEALFMRGHVEEALGRVDLLMRCSCPPDFDSLLSVLC 660
           ASRVMNSML KGITEN+DLVAKILEALFMRGHVEEALGRVDLLMR SCPPDFDSLLSVLC
Sbjct: 601 ASRVMNSMLHKGITENLDLVAKILEALFMRGHVEEALGRVDLLMRSSCPPDFDSLLSVLC 660

Query: 661 EKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWS 720
           EKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWS
Sbjct: 661 EKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWS 720

Query: 721 SCDDLINSLNQEGHTKQADILSRMIKGGDRIRRKKASPAAA 761
           SCDDLI+SLNQEGHTKQADILSRMIKGGDRIRRKKASP AA
Sbjct: 721 SCDDLISSLNQEGHTKQADILSRMIKGGDRIRRKKASPTAA 761

BLAST of Cp4.1LG18g05980 vs. ExPASy TrEMBL
Match: A0A6J1JJW0 (pentatricopeptide repeat-containing protein At2g37230-like OS=Cucurbita maxima OX=3661 GN=LOC111485759 PE=4 SV=1)

HSP 1 Score: 1442 bits (3734), Expect = 0.0
Identity = 725/761 (95.27%), Postives = 742/761 (97.50%), Query Frame = 0

Query: 1   MAHISLSKPHYSHLKVLSSSSISKPISFNSLHFFSSIQDPTTTATQNESPKDPFVSSDAA 60
           MAHISLSKPHYSHLKV SSSSISK ISFNSLHFFSS QDP +T TQNESP DPFVSSDAA
Sbjct: 1   MAHISLSKPHYSHLKVFSSSSISKLISFNSLHFFSSTQDPISTPTQNESPNDPFVSSDAA 60

Query: 61  VPQPVEPVAVNGGDQVKRSIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFD 120
           VPQ VEPVAVNGGDQVKR+IPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFD
Sbjct: 61  VPQSVEPVAVNGGDQVKRTIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFD 120

Query: 121 HSIVWNVLHAAKNSDHALQFFRWVERSGLFQHDRGTHFKIIEILGRASKLNHARCILLDM 180
           HS+VWNVLHAAKNSDHAL+FFRWVER+GLFQHDRGTHFKIIEILGRASKLNHARCILLDM
Sbjct: 121 HSLVWNVLHAAKNSDHALKFFRWVERAGLFQHDRGTHFKIIEILGRASKLNHARCILLDM 180

Query: 181 PNKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGRY 240
           P KGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSY+ LFKVILRRGRY
Sbjct: 181 PKKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYDALFKVILRRGRY 240

Query: 241 MMAKRYFNAMLNEGIEPTCHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTM 300
           MMAKRYFN MLNEGIEPT HTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTM
Sbjct: 241 MMAKRYFNTMLNEGIEPTRHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTM 300

Query: 301 INGYYRFKMMEEAEQFFTEMKGNNLLPTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGV 360
           INGY+RFKMMEEAEQFFTEMKG N++PTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGV
Sbjct: 301 INGYHRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGV 360

Query: 361 KPNDFTYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM 420
           KPND TYSTLLPGLCDAEKM EA QILTEMVD+YIAPKDNSIFMRLLSCQC HGDLDAAM
Sbjct: 361 KPNDVTYSTLLPGLCDAEKMFEAGQILTEMVDRYIAPKDNSIFMRLLSCQCMHGDLDAAM 420

Query: 421 HVLKAMIRLSIPTEAGHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEASA 480
           HVLKAM RLS+PTEAGHYGILIENCCKAGVYDRAVKLLDKLV+KEIILKPQSTLEMEASA
Sbjct: 421 HVLKAMNRLSVPTEAGHYGILIENCCKAGVYDRAVKLLDKLVKKEIILKPQSTLEMEASA 480

Query: 481 YNPIIQYLCDHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMGR 540
           YNPIIQYLCDHGQTGKAETFFRQLLKKGIQDE+AFNNLIRGHSKEGNPELAFE+LKIMGR
Sbjct: 481 YNPIIQYLCDHGQTGKAETFFRQLLKKGIQDEIAFNNLIRGHSKEGNPELAFEMLKIMGR 540

Query: 541 KSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQT 600
           KSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQT
Sbjct: 541 KSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQT 600

Query: 601 ASRVMNSMLDKGITENMDLVAKILEALFMRGHVEEALGRVDLLMRCSCPPDFDSLLSVLC 660
           ASRVMNSMLDKGITEN+DLVAKILEALFMRGHVEEALGRVDLLMRCSCPPDFDSLLSVLC
Sbjct: 601 ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRVDLLMRCSCPPDFDSLLSVLC 660

Query: 661 EKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWS 720
           EKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWS
Sbjct: 661 EKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWS 720

Query: 721 SCDDLINSLNQEGHTKQADILSRMIKGGDRIRRKKASPAAA 761
           SCDDLI+SLNQEGHTKQAD+LSRMIKGGD IRRK ASPAAA
Sbjct: 721 SCDDLISSLNQEGHTKQADVLSRMIKGGDGIRRKNASPAAA 761

BLAST of Cp4.1LG18g05980 vs. ExPASy TrEMBL
Match: A0A6J1C431 (pentatricopeptide repeat-containing protein At2g37230 OS=Momordica charantia OX=3673 GN=LOC111007208 PE=3 SV=1)

HSP 1 Score: 1330 bits (3441), Expect = 0.0
Identity = 669/756 (88.49%), Postives = 707/756 (93.52%), Query Frame = 0

Query: 1   MAHISLSKPHYSHLKVLSSSSISKPISFNSLHFFSSIQDPTTTATQNESPKDPFVSSDAA 60
           MAHISLS+P++   +VLSSSSIS P + N LHFFSS Q+       NESP DP  SSDAA
Sbjct: 1   MAHISLSRPNF---RVLSSSSISNPSALNLLHFFSSTQEQIP----NESPSDPSASSDAA 60

Query: 61  VPQPVEPVAVNGGDQVKRSIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFD 120
            PQP    AVNG +QVK+  PRG  RNPEKLED+ICRMMANREWTTRLQNSIRSLVP+FD
Sbjct: 61  APQPGGRAAVNGAEQVKQRTPRGKHRNPEKLEDVICRMMANREWTTRLQNSIRSLVPEFD 120

Query: 121 HSIVWNVLHAAKNSDHALQFFRWVERSGLFQHDRGTHFKIIEILGRASKLNHARCILLDM 180
           HS+VWNVLHAAKNSDHAL+FFRWVERSGLF+HDR TH KIIEILGRASKLNHARCILLDM
Sbjct: 121 HSLVWNVLHAAKNSDHALKFFRWVERSGLFRHDRDTHLKIIEILGRASKLNHARCILLDM 180

Query: 181 PNKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGRY 240
           PNKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSY+ LFKVILRRGRY
Sbjct: 181 PNKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYDALFKVILRRGRY 240

Query: 241 MMAKRYFNAMLNEGIEPTCHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTM 300
           MMAKRYFNAMLNEGIEPT HTYNVMLWGFFLSLRLETAK+FYEDMK+RGI+PDVVTYNTM
Sbjct: 241 MMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETAKKFYEDMKSRGISPDVVTYNTM 300

Query: 301 INGYYRFKMMEEAEQFFTEMKGNNLLPTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGV 360
           ING+YRFKMMEEAEQFFTEMKG N++PTVISYTTMIKGYVS GRVDDGLRLFEEMKAVGV
Sbjct: 301 INGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGV 360

Query: 361 KPNDFTYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM 420
           KPND TYSTLLPGLC+AEKMSEARQIL EMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM
Sbjct: 361 KPNDITYSTLLPGLCNAEKMSEARQILIEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM 420

Query: 421 HVLKAMIRLSIPTEAGHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEASA 480
           HVLKAMIRLSIPT+AGHYGILIENCCKA  YD+AVKLLDKLVEKEIIL+PQSTL+MEASA
Sbjct: 421 HVLKAMIRLSIPTDAGHYGILIENCCKAEAYDQAVKLLDKLVEKEIILRPQSTLDMEASA 480

Query: 481 YNPIIQYLCDHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMGR 540
           YNPIIQYLC+HGQTGKAETFFRQL+KKGIQDEVAFNNLIRGHSKEGNPEL +EILKIMGR
Sbjct: 481 YNPIIQYLCNHGQTGKAETFFRQLMKKGIQDEVAFNNLIRGHSKEGNPELGYEILKIMGR 540

Query: 541 KSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQT 600
           + VSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDS LFRSVMESLFADGRVQT
Sbjct: 541 RGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSTLFRSVMESLFADGRVQT 600

Query: 601 ASRVMNSMLDKGITENMDLVAKILEALFMRGHVEEALGRVDLLMRCSCPPDFDSLLSVLC 660
           ASRVM SML+KGITEN+DLVAKILEALFMRGHVEEALGR+DLLM C+CPPDF SLLSVLC
Sbjct: 601 ASRVMKSMLEKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFGSLLSVLC 660

Query: 661 EKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWS 720
           EKGKTIAALKLLDFGLEREC+IE+SSYEKVLDALL AGKTLNAYSILCKIMEKGGAK+WS
Sbjct: 661 EKGKTIAALKLLDFGLERECDIELSSYEKVLDALLGAGKTLNAYSILCKIMEKGGAKDWS 720

Query: 721 SCDDLINSLNQEGHTKQADILSRMIKGGDRIRRKKA 756
           SCDDLI SLNQEG+TKQAD+LSRMIKGGDR   KKA
Sbjct: 721 SCDDLIRSLNQEGNTKQADVLSRMIKGGDRKGSKKA 749

BLAST of Cp4.1LG18g05980 vs. ExPASy TrEMBL
Match: A0A5A7T0L7 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold271G00160 PE=3 SV=1)

HSP 1 Score: 1314 bits (3400), Expect = 0.0
Identity = 658/760 (86.58%), Postives = 706/760 (92.89%), Query Frame = 0

Query: 1   MAHISLSKPHYSHLKVLSSSSISKPISFNSLHFFSSIQDPTTTATQNESPKDPFVSSDAA 60
           MAHIS+SK H++H +VLSSSSISKP + NSLHFFSS Q+P + ATQNESP DP  SS+AA
Sbjct: 1   MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISPATQNESPNDPPASSNAA 60

Query: 61  VPQPVEPVAVNGGDQVKRSIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFD 120
           +PQ  E  AVNG  QVK  IPRG  RN EKLED+ICRMMA+REWTTRLQNSIRSLVPQFD
Sbjct: 61  LPQTAESAAVNGVQQVKGRIPRGRPRNTEKLEDLICRMMASREWTTRLQNSIRSLVPQFD 120

Query: 121 HSIVWNVLHAAKNSDHALQFFRWVERSGLFQHDRGTHFKIIEILGRASKLNHARCILLDM 180
           H +V+NVLHAAK S+HAL FFRWVER+GLFQHDR TH KIIEILG ASKLNHARCILLDM
Sbjct: 121 HCLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHLKIIEILGGASKLNHARCILLDM 180

Query: 181 PNKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGRY 240
           PNKGVEWDEDLFV++IDSYGKAGIVQEAVKIF+KMKELGVERS KSY+ LFKVILRRGRY
Sbjct: 181 PNKGVEWDEDLFVVLIDSYGKAGIVQEAVKIFRKMKELGVERSNKSYDALFKVILRRGRY 240

Query: 241 MMAKRYFNAMLNEGIEPTCHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTM 300
           MMAKRYFNAMLNEG+EPT HTYNVMLWGFFLSLRLETAKRFYEDMK+RGI+PDVVTYNTM
Sbjct: 241 MMAKRYFNAMLNEGLEPTRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM 300

Query: 301 INGYYRFKMMEEAEQFFTEMKGNNLLPTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGV 360
           INGY RFKMMEEAEQFFTEMKG N+ PTVISYTTMIKGYVS GRVDDGLRLFEEMKA G 
Sbjct: 301 INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAAGE 360

Query: 361 KPNDFTYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM 420
           KPND TYSTLLPGLCDAEK+ EAR+ILTEMV +YIAPKDNSIFMRLLSCQCKHGDLDAAM
Sbjct: 361 KPNDITYSTLLPGLCDAEKLPEARKILTEMVARYIAPKDNSIFMRLLSCQCKHGDLDAAM 420

Query: 421 HVLKAMIRLSIPTEAGHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEASA 480
           HVLKAM+RLSIPTEAGHYGILIENCCKAG+YD+AVKLLD+LVEKEIILKPQSTLEMEASA
Sbjct: 421 HVLKAMLRLSIPTEAGHYGILIENCCKAGMYDKAVKLLDQLVEKEIILKPQSTLEMEASA 480

Query: 481 YNPIIQYLCDHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMGR 540
           YN IIQYLC+HGQTGKAE FFRQLLKKGIQDEVAFNNLIRGH+KEGNPE AFE+LKIMGR
Sbjct: 481 YNLIIQYLCNHGQTGKAEIFFRQLLKKGIQDEVAFNNLIRGHAKEGNPEFAFEMLKIMGR 540

Query: 541 KSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQT 600
           + VSRDAESYKLLIKSYLSKGEPADAKTALDSMIE+GH PDSALFRSVMESLFADGRVQT
Sbjct: 541 RGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT 600

Query: 601 ASRVMNSMLDKGITENMDLVAKILEALFMRGHVEEALGRVDLLMRCSCPPDFDSLLSVLC 660
           ASRVMNSMLDKGITEN+DLVAKILEALFMRGH EE LGR++LLM C+CPPDFDSLLSVLC
Sbjct: 601 ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEGLGRINLLMNCNCPPDFDSLLSVLC 660

Query: 661 EKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWS 720
           EKGKTIAA KLL+FGLERECNI+ SSYEKVLDAL+ AGKTLNAY+ILCKIMEKGGAK+WS
Sbjct: 661 EKGKTIAAFKLLNFGLERECNIQFSSYEKVLDALMGAGKTLNAYAILCKIMEKGGAKDWS 720

Query: 721 SCDDLINSLNQEGHTKQADILSRMIKGGDRIRRKKASPAA 760
           SCDDLI +LNQEG+TKQADILSRM+KGGDR R KK+S AA
Sbjct: 721 SCDDLIKTLNQEGNTKQADILSRMVKGGDRKRSKKSSLAA 760

BLAST of Cp4.1LG18g05980 vs. ExPASy TrEMBL
Match: A0A1S3B8Y6 (pentatricopeptide repeat-containing protein At2g37230 OS=Cucumis melo OX=3656 GN=LOC103487310 PE=3 SV=1)

HSP 1 Score: 1309 bits (3388), Expect = 0.0
Identity = 654/753 (86.85%), Postives = 701/753 (93.09%), Query Frame = 0

Query: 1   MAHISLSKPHYSHLKVLSSSSISKPISFNSLHFFSSIQDPTTTATQNESPKDPFVSSDAA 60
           MAHIS+SK H++H +VLSSSSISKP + NSLHFFSS Q+P + ATQNESP DP  SS+AA
Sbjct: 1   MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISPATQNESPNDPPASSNAA 60

Query: 61  VPQPVEPVAVNGGDQVKRSIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFD 120
           +PQ  E  AVNG  QVK  IPRG  RN EKLED+ICRMMA+REWTTRLQNSIRSLVPQFD
Sbjct: 61  LPQTAESAAVNGVQQVKGRIPRGRPRNTEKLEDLICRMMASREWTTRLQNSIRSLVPQFD 120

Query: 121 HSIVWNVLHAAKNSDHALQFFRWVERSGLFQHDRGTHFKIIEILGRASKLNHARCILLDM 180
           H +V+NVLHAAK S+HAL FFRWVER+GLFQHDR TH KIIEILG ASKLNHARCILLDM
Sbjct: 121 HCLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHLKIIEILGGASKLNHARCILLDM 180

Query: 181 PNKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGRY 240
           PNKGVEWDEDLFV++IDSYGKAGIVQEAVKIF+KMKELGVERS KSY+ LFKVILRRGRY
Sbjct: 181 PNKGVEWDEDLFVVLIDSYGKAGIVQEAVKIFRKMKELGVERSNKSYDALFKVILRRGRY 240

Query: 241 MMAKRYFNAMLNEGIEPTCHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTM 300
           MMAKRYFNAMLNEG+EPT HTYNVMLWGFFLSLRLETAKRFYEDMK+RGI+PDVVTYNTM
Sbjct: 241 MMAKRYFNAMLNEGLEPTRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM 300

Query: 301 INGYYRFKMMEEAEQFFTEMKGNNLLPTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGV 360
           INGY RFKMMEEAEQFFTEMKG N+ PTVISYTTMIKGYVS GRVDDGLRLFEEMKA G 
Sbjct: 301 INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAAGE 360

Query: 361 KPNDFTYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM 420
           KPND TYSTLLPGLCDAEK+ EAR+ILTEMV +YIAPKDNSIFMRLLSCQCKHGDLDAAM
Sbjct: 361 KPNDITYSTLLPGLCDAEKLPEARKILTEMVARYIAPKDNSIFMRLLSCQCKHGDLDAAM 420

Query: 421 HVLKAMIRLSIPTEAGHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEASA 480
           HVLKAM+RLSIPTEAGHYGILIENCCKAG+YD+AVKLLD+LVEKEIILKPQSTLEMEASA
Sbjct: 421 HVLKAMLRLSIPTEAGHYGILIENCCKAGMYDKAVKLLDQLVEKEIILKPQSTLEMEASA 480

Query: 481 YNPIIQYLCDHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMGR 540
           YN IIQYLC+HGQTGKAE FFRQLLKKGIQDEVAFNNLIRGH+KEGNPE AFE+LKIMGR
Sbjct: 481 YNLIIQYLCNHGQTGKAEIFFRQLLKKGIQDEVAFNNLIRGHAKEGNPEFAFEMLKIMGR 540

Query: 541 KSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQT 600
           + VSRDAESYKLLIKSYLSKGEPADAKTALDSMIE+GH PDSALFRSVMESLFADGRVQT
Sbjct: 541 RGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT 600

Query: 601 ASRVMNSMLDKGITENMDLVAKILEALFMRGHVEEALGRVDLLMRCSCPPDFDSLLSVLC 660
           ASRVMNSMLDKGITEN+DLVAKILEALFMRGH EE LGR++LLM C+CPPDFDSLLSVLC
Sbjct: 601 ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEGLGRINLLMNCNCPPDFDSLLSVLC 660

Query: 661 EKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWS 720
           EKGKTIAA KLL+FGLERECNI+ SSYEKVLDAL+ AGKTLNAY+ILCKIMEKGGAK+WS
Sbjct: 661 EKGKTIAAFKLLNFGLERECNIQFSSYEKVLDALMGAGKTLNAYAILCKIMEKGGAKDWS 720

Query: 721 SCDDLINSLNQEGHTKQADILSRMIKGGDRIRR 753
           SCDDLI +LNQEG+TKQADILSRM+KGGDR RR
Sbjct: 721 SCDDLIKTLNQEGNTKQADILSRMVKGGDRKRR 753

BLAST of Cp4.1LG18g05980 vs. TAIR 10
Match: AT2G37230.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 1012.3 bits (2616), Expect = 2.1e-295
Identity = 512/757 (67.64%), Postives = 611/757 (80.71%), Query Frame = 0

Query: 1   MAHISLSKPHYSHLKVLSSSSISKPISFNSL-HFFSSIQDPTTTATQNESPKDPFVSSDA 60
           MA IS SK + S  +V  S   S   S  SL   FS+I++  T A  N   + P   S+ 
Sbjct: 1   MAFISRSKRYQSKARVYLSLPRSSNSSLFSLPRLFSTIEETQTPANANPETQSPDAKSET 60

Query: 61  AVPQPVEPVAVNGGDQVKRSIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQF 120
                 + +       ++    RG R+N EKLED ICRMM NR WTTRLQNSIR LVP++
Sbjct: 61  K-----KNLTSTETRPLRERFQRGKRQNHEKLEDTICRMMDNRAWTTRLQNSIRDLVPEW 120

Query: 121 DHSIVWNVLHAAKNSDHALQFFRWVERSGLFQHDRGTHFKIIEILGRASKLNHARCILLD 180
           DHS+V+NVLH AK  +HALQFFRW ERSGL +HDR TH K+I++LG  SKLNHARCILLD
Sbjct: 121 DHSLVYNVLHGAKKLEHALQFFRWTERSGLIRHDRDTHMKMIKMLGEVSKLNHARCILLD 180

Query: 181 MPNKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGR 240
           MP KGV WDED+FV++I+SYGKAGIVQE+VKIFQKMK+LGVER+IKSYN LFKVILRRGR
Sbjct: 181 MPEKGVPWDEDMFVVLIESYGKAGIVQESVKIFQKMKDLGVERTIKSYNSLFKVILRRGR 240

Query: 241 YMMAKRYFNAMLNEGIEPTCHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNT 300
           YMMAKRYFN M++EG+EPT HTYN+MLWGFFLSLRLETA RF+EDMKTRGI+PD  T+NT
Sbjct: 241 YMMAKRYFNKMVSEGVEPTRHTYNLMLWGFFLSLRLETALRFFEDMKTRGISPDDATFNT 300

Query: 301 MINGYYRFKMMEEAEQFFTEMKGNNLLPTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVG 360
           MING+ RFK M+EAE+ F EMKGN + P+V+SYTTMIKGY++  RVDDGLR+FEEM++ G
Sbjct: 301 MINGFCRFKKMDEAEKLFVEMKGNKIGPSVVSYTTMIKGYLAVDRVDDGLRIFEEMRSSG 360

Query: 361 VKPNDFTYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAA 420
           ++PN  TYSTLLPGLCDA KM EA+ IL  M+ K+IAPKDNSIF++LL  Q K GD+ AA
Sbjct: 361 IEPNATTYSTLLPGLCDAGKMVEAKNILKNMMAKHIAPKDNSIFLKLLVSQSKAGDMAAA 420

Query: 421 MHVLKAMIRLSIPTEAGHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEAS 480
             VLKAM  L++P EAGHYG+LIEN CKA  Y+RA+KLLD L+EKEIIL+ Q TLEME S
Sbjct: 421 TEVLKAMATLNVPAEAGHYGVLIENQCKASAYNRAIKLLDTLIEKEIILRHQDTLEMEPS 480

Query: 481 AYNPIIQYLCDHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMG 540
           AYNPII+YLC++GQT KAE  FRQL+K+G+QD+ A NNLIRGH+KEGNP+ ++EILKIM 
Sbjct: 481 AYNPIIEYLCNNGQTAKAEVLFRQLMKRGVQDQDALNNLIRGHAKEGNPDSSYEILKIMS 540

Query: 541 RKSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQ 600
           R+ V R++ +Y+LLIKSY+SKGEP DAKTALDSM+E GH PDS+LFRSV+ESLF DGRVQ
Sbjct: 541 RRGVPRESNAYELLIKSYMSKGEPGDAKTALDSMVEDGHVPDSSLFRSVIESLFEDGRVQ 600

Query: 601 TASRVMNSMLDK--GITENMDLVAKILEALFMRGHVEEALGRVDLLMRCSCPPDFDSLLS 660
           TASRVM  M+DK  GI +NMDL+AKILEAL MRGHVEEALGR+DLL +     D DSLLS
Sbjct: 601 TASRVMMIMIDKNVGIEDNMDLIAKILEALLMRGHVEEALGRIDLLNQNGHTADLDSLLS 660

Query: 661 VLCEKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAK 720
           VL EKGKTIAALKLLDFGLER+ ++E SSY+KVLDALL AGKTLNAYS+LCKIMEKG + 
Sbjct: 661 VLSEKGKTIAALKLLDFGLERDLSLEFSSYDKVLDALLGAGKTLNAYSVLCKIMEKGSST 720

Query: 721 EWSSCDDLINSLNQEGHTKQADILSRMIKGGDRIRRK 755
           +W S D+LI SLNQEG+TKQAD+LSRMIK G  I+++
Sbjct: 721 DWKSSDELIKSLNQEGNTKQADVLSRMIKKGQGIKKQ 752

BLAST of Cp4.1LG18g05980 vs. TAIR 10
Match: AT1G02060.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 368.2 bits (944), Expect = 1.6e-101
Identity = 223/669 (33.33%), Postives = 358/669 (53.51%), Query Frame = 0

Query: 85  RRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQ--FDHSIVWNVLHAAKNSDHALQFFR 144
           R    KL   + R + +  W+  L++S+ SL P      + V   L   K     L+FF 
Sbjct: 30  RSTKSKLARSLARAVNSNPWSDELESSLSSLHPSQTISRTTVLQTLRLIKVPADGLRFFD 89

Query: 145 WVERSGLFQHDRGTHFKIIEILGRASKLNHARCILLDMPNKG---VEWDEDLFVIMIDSY 204
           WV   G F H   + F ++E LGRA  LN AR  L  +  +    V+  +  F  +I SY
Sbjct: 90  WVSNKG-FSHKEQSFFLMLEFLGRARNLNVARNFLFSIERRSNGCVKLQDRYFNSLIRSY 149

Query: 205 GKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGRYMMAKRYFNAMLNE-GIEPT 264
           G AG+ QE+VK+FQ MK++G+  S+ ++N L  ++L+RGR  MA   F+ M    G+ P 
Sbjct: 150 GNAGLFQESVKLFQTMKQMGISPSVLTFNSLLSILLKRGRTGMAHDLFDEMRRTYGVTPD 209

Query: 265 CHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTMINGYYRFKMMEEAEQFFT 324
            +T+N ++ GF  +  ++ A R ++DM+     PDVVTYNT+I+G  R   ++ A    +
Sbjct: 210 SYTFNTLINGFCKNSMVDEAFRIFKDMELYHCNPDVVTYNTIIDGLCRAGKVKIAHNVLS 269

Query: 325 EM--KGNNLLPTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGVKPNDFTYSTLLPGLCD 384
            M  K  ++ P V+SYTT+++GY     +D+ + +F +M + G+KPN  TY+TL+ GL +
Sbjct: 270 GMLKKATDVHPNVVSYTTLVRGYCMKQEIDEAVLVFHDMLSRGLKPNAVTYNTLIKGLSE 329

Query: 385 AEKMSEARQILTEMVDKY--IAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTE 444
           A +  E + IL    D +   AP D   F  L+   C  G LDAAM V + M+ + +  +
Sbjct: 330 AHRYDEIKDILIGGNDAFTTFAP-DACTFNILIKAHCDAGHLDAAMKVFQEMLNMKLHPD 389

Query: 445 AGHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEASAYNPIIQYLCDHGQT 504
           +  Y +LI   C    +DRA  L ++L EKE++L       + A+AYNP+ +YLC +G+T
Sbjct: 390 SASYSVLIRTLCMRNEFDRAETLFNELFEKEVLLGKDECKPL-AAAYNPMFEYLCANGKT 449

Query: 505 GKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAFEILKIMGRKSVSRDAESYKLLI 564
            +AE  FRQL+K+G+QD  ++  LI GH +EG  + A+E+L +M R+    D E+Y+LLI
Sbjct: 450 KQAEKVFRQLMKRGVQDPPSYKTLITGHCREGKFKPAYELLVLMLRREFVPDLETYELLI 509

Query: 565 KSYLSKGEPADAKTALDSMIESGHYPDSALFRSVMESLFADGRVQTASRVMNSMLDKGIT 624
              L  GE   A   L  M+ S + P +  F SV+  L        +  ++  ML+K I 
Sbjct: 510 DGLLKIGEALLAHDTLQRMLRSSYLPVATTFHSVLAELAKRKFANESFCLVTLMLEKRIR 569

Query: 625 ENMDLVAKILEALFMRGHVEEALGRVDLLMRCSCPPDFDSLLSVLCEKGKTIAALKLLDF 684
           +N+DL  +++  LF     E+A   V LL         + LL  LCE  K + A  L+ F
Sbjct: 570 QNIDLSTQVVRLLFSSAQKEKAFLIVRLLYDNGYLVKMEELLGYLCENRKLLDAHTLVLF 629

Query: 685 GLERECNIEVSSYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWSSCDDLINSLNQEGH 744
            LE+   +++ +   V++ L    +   A+S+  +++E G  ++ S    L N+L   G 
Sbjct: 630 CLEKSQMVDIDTCNTVIEGLCKHKRHSEAFSLYNELVELGNHQQLSCHVVLRNALEAAGK 689

BLAST of Cp4.1LG18g05980 vs. TAIR 10
Match: AT1G30290.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 224.9 bits (572), Expect = 2.1e-58
Identity = 155/649 (23.88%), Postives = 300/649 (46.22%), Query Frame = 0

Query: 104 WTTRLQNSIRSLVPQFDHSIVWNVLHAAKNSDHALQFFRWVERSGLFQHDRGTHFKIIEI 163
           W  + +  +R+L+     S V  VL +  +   AL+FF W +R   ++HD   ++ ++E+
Sbjct: 157 WNPKHEGQMRNLLRSLKPSQVCAVLRSQDDERVALKFFYWADRQWRYRHDPMVYYSMLEV 216

Query: 164 LGRASKLNHARCILLDMPNKGVEWDEDLFVIMIDSYGKAGIVQEAVKIFQKMKELGVERS 223
           L +      +R +L+ M  +G+    + F  ++ SY +AG +++A+K+   M+  GVE +
Sbjct: 217 LSKTKLCQGSRRVLVLMKRRGIYRTPEAFSRVMVSYSRAGQLRDALKVLTLMQRAGVEPN 276

Query: 224 IKSYNVLFKVILRRGRYMMAKRYFNAMLNEGIEPTCHTYNVMLWGFFLSLRLETAKRFYE 283
           +   N    V +R  R   A R+   M   GI P   TYN M+ G+    R+E A    E
Sbjct: 277 LLICNTTIDVFVRANRLEKALRFLERMQVVGIVPNVVTYNCMIRGYCDLHRVEEAIELLE 336

Query: 284 DMKTRGIAPDVVTYNTMINGYYRFKMMEEAEQFFTEM-KGNNLLPTVISYTTMIKGYVSS 343
           DM ++G  PD V+Y T++    + K + E      +M K + L+P  ++Y T+I      
Sbjct: 337 DMHSKGCLPDKVSYYTIMGYLCKEKRIVEVRDLMKKMAKEHGLVPDQVTYNTLIHMLTKH 396

Query: 344 GRVDDGLRLFEEMKAVGVKPNDFTYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSI 403
              D+ L   ++ +  G + +   YS ++  LC   +MSEA+ ++ EM+ K   P D   
Sbjct: 397 DHADEALWFLKDAQEKGFRIDKLGYSAIVHALCKEGRMSEAKDLINEMLSKGHCPPDVVT 456

Query: 404 FMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGVYDRAVKLLDKLV 463
           +  +++  C+ G++D A  +L+ M           Y  L+   C+ G    A ++++  +
Sbjct: 457 YTAVVNGFCRLGEVDKAKKLLQVMHTHGHKPNTVSYTALLNGMCRTGKSLEAREMMN--M 516

Query: 464 EKEIILKPQSTLEMEASAYNPIIQYLCDHGQTGKAETFFRQLLKKG-IQDEVAFNNLIRG 523
            +E    P S        Y+ I+  L   G+  +A    R+++ KG     V  N L++ 
Sbjct: 517 SEEHWWSPNSI------TYSVIMHGLRREGKLSEACDVVREMVLKGFFPGPVEINLLLQS 576

Query: 524 HSKEGNPELAFEILKIMGRKSVSRDAESYKLLIKSYLSKGEPADAKTALDSMIESGHYPD 583
             ++G    A + ++    K  + +  ++  +I  +    E   A + LD M     + D
Sbjct: 577 LCRDGRTHEARKFMEECLNKGCAINVVNFTTVIHGFCQNDELDAALSVLDDMYLINKHAD 636

Query: 584 SALFRSVMESLFADGRVQTASRVMNSMLDKGITENMDLVAKILEALFMRGHVEEALGRVD 643
              + +++++L   GR+  A+ +M  ML KGI         ++      G V++ +  ++
Sbjct: 637 VFTYTTLVDTLGKKGRIAEATELMKKMLHKGIDPTPVTYRTVIHRYCQMGKVDDLVAILE 696

Query: 644 -LLMRCSCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIEVSSYEKVLDALLAAGKT 703
            ++ R  C   ++ ++  LC  GK   A  LL   L      +  +   +++  L  G  
Sbjct: 697 KMISRQKCRTIYNQVIEKLCVLGKLEEADTLLGKVLRTASRSDAKTCYALMEGYLKKGVP 756

Query: 704 LNAYSILCKIMEKGGAKEWSSCDDLINSLNQEGHTKQAD-ILSRMIKGG 749
           L+AY + C++  +    +   C+ L   L  +G   +AD ++ R+++ G
Sbjct: 757 LSAYKVACRMFNRNLIPDVKMCEKLSKRLVLKGKVDEADKLMLRLVERG 797

BLAST of Cp4.1LG18g05980 vs. TAIR 10
Match: AT5G65560.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 207.6 bits (527), Expect = 3.5e-53
Identity = 158/712 (22.19%), Postives = 320/712 (44.94%), Query Frame = 0

Query: 56  SSDAAVPQPVEPVAVNGGDQVKRSIPRGNRRNPEKLEDIICRMMANREWTTRLQNSIRSL 115
           S+D  VP PV          + R++P     +   +   +  +++   W      S++S+
Sbjct: 28  STDVTVPSPVTRRQFCSVSPLLRNLPE-EESDSMSVPHRLLSILSKPNW--HKSPSLKSM 87

Query: 116 VPQFDHSIVWNVLHAAKNSDHALQFFRWVERSGLFQHD----------------RGTHFK 175
           V     S V ++     +   AL F  W+ ++  ++H                  G  FK
Sbjct: 88  VSAISPSHVSSLFSLDLDPKTALNFSHWISQNPRYKHSVYSYASLLTLLINNGYVGVVFK 147

Query: 176 IIEILGRASKLNHARCILLDMPNKGVEWDE----------DLFVIMIDSYGKAGIVQEAV 235
           I  ++ ++         +LD+  K +  DE            +  +++S  + G+V E  
Sbjct: 148 IRLLMIKSCDSVGDALYVLDLCRK-MNKDERFELKYKLIIGCYNTLLNSLARFGLVDEMK 207

Query: 236 KIFQKMKELGVERSIKSYNVLFKVILRRGRYMMAKRYFNAMLNEGIEPTCHTYNVMLWGF 295
           +++ +M E  V  +I +YN +     + G    A +Y + ++  G++P   TY  ++ G+
Sbjct: 208 QVYMEMLEDKVCPNIYTYNKMVNGYCKLGNVEEANQYVSKIVEAGLDPDFFTYTSLIMGY 267

Query: 296 FLSLRLETAKRFYEDMKTRGIAPDVVTYNTMINGYYRFKMMEEAEQFFTEMKGNNLLPTV 355
                L++A + + +M  +G   + V Y  +I+G    + ++EA   F +MK +   PTV
Sbjct: 268 CQRKDLDSAFKVFNEMPLKGCRRNEVAYTHLIHGLCVARRIDEAMDLFVKMKDDECFPTV 327

Query: 356 ISYTTMIKGYVSSGRVDDGLRLFEEMKAVGVKPNDFTYSTLLPGLCDAEKMSEARQILTE 415
            +YT +IK    S R  + L L +EM+  G+KPN  TY+ L+  LC   K  +AR++L +
Sbjct: 328 RTYTVLIKSLCGSERKSEALNLVKEMEETGIKPNIHTYTVLIDSLCSQCKFEKARELLGQ 387

Query: 416 MVDKYIAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAG 475
           M++K + P +   +  L++  CK G ++ A+ V++ M    +      Y  LI+  CK+ 
Sbjct: 388 MLEKGLMP-NVITYNALINGYCKRGMIEDAVDVVELMESRKLSPNTRTYNELIKGYCKSN 447

Query: 476 VYDRAVKLLDKLVEKEIILKPQSTLEMEASAYNPIIQYLCDHGQTGKAETFFRQLLKKG- 535
           V+ +A+ +L+K++E++++         +   YN +I   C  G    A      +  +G 
Sbjct: 448 VH-KAMGVLNKMLERKVL--------PDVVTYNSLIDGQCRSGNFDSAYRLLSLMNDRGL 507

Query: 536 IQDEVAFNNLIRGHSKEGNPELAFEILKIMGRKSVSRDAESYKLLIKSYLSKGEPADAKT 595
           + D+  + ++I    K    E A ++   + +K V+ +   Y  LI  Y   G+  +A  
Sbjct: 508 VPDQWTYTSMIDSLCKSKRVEEACDLFDSLEQKGVNPNVVMYTALIDGYCKAGKVDEAHL 567

Query: 596 ALDSMIESGHYPDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENMDLVAKILEALF 655
            L+ M+     P+S  F +++  L ADG+++ A+ +   M+  G+   +     ++  L 
Sbjct: 568 MLEKMLSKNCLPNSLTFNALIHGLCADGKLKEATLLEEKMVKIGLQPTVSTDTILIHRLL 627

Query: 656 MRGHVEEALGRVDLLMRCSCPPD---FDSLLSVLCEKGKTIAALKLLDFGLERECNIEVS 715
             G  + A  R   ++     PD   + + +   C +G+ + A  ++    E   + ++ 
Sbjct: 628 KDGDFDHAYSRFQQMLSSGTKPDAHTYTTFIQTYCREGRLLDAEDMMAKMRENGVSPDLF 687

Query: 716 SYEKVLDALLAAGKTLNAYSILCKIMEKGGAKEWSSCDDLINSLNQEGHTKQ 738
           +Y  ++      G+T  A+ +L ++ + G      +   LI  L +  + KQ
Sbjct: 688 TYSSLIKGYGDLGQTNFAFDVLKRMRDTGCEPSQHTFLSLIKHLLEMKYGKQ 725

BLAST of Cp4.1LG18g05980 vs. TAIR 10
Match: AT1G12775.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 206.1 bits (523), Expect = 1.0e-52
Identity = 127/432 (29.40%), Postives = 216/432 (50.00%), Query Frame = 0

Query: 132 KNSDHALQFFRWVERSGLFQHDRGTHFKIIEILGRASKLNHARCILLDMPNKGVEWDEDL 191
           K SD  +   R VE    FQ +  T+  ++ ++ ++ +   A  +L  M  + ++ D   
Sbjct: 208 KVSDAVVLIDRMVETG--FQPNEVTYGPVLNVMCKSGQTALAMELLRKMEERNIKLDAVK 267

Query: 192 FVIMIDSYGKAGIVQEAVKIFQKMKELGVERSIKSYNVLFKVILRRGRYMMAKRYFNAML 251
           + I+ID   K G +  A  +F +M+  G +  I +YN L       GR+    +    M+
Sbjct: 268 YSIIIDGLCKDGSLDNAFNLFNEMEIKGFKADIITYNTLIGGFCNAGRWDDGAKLLRDMI 327

Query: 252 NEGIEPTCHTYNVMLWGFFLSLRLETAKRFYEDMKTRGIAPDVVTYNTMINGYYRFKMME 311
              I P   T++V++  F    +L  A +  ++M  RGIAP+ +TYN++I+G+ +   +E
Sbjct: 328 KRKISPNVVTFSVLIDSFVKEGKLREADQLLKEMMQRGIAPNTITYNSLIDGFCKENRLE 387

Query: 312 EAEQFFTEMKGNNLLPTVISYTTMIKGYVSSGRVDDGLRLFEEMKAVGVKPNDFTYSTLL 371
           EA Q    M      P ++++  +I GY  + R+DDGL LF EM   GV  N  TY+TL+
Sbjct: 388 EAIQMVDLMISKGCDPDIMTFNILINGYCKANRIDDGLELFREMSLRGVIANTVTYNTLV 447

Query: 372 PGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSI 431
            G C + K+  A+++  EMV + + P D   +  LL   C +G+L+ A+ +   + +  +
Sbjct: 448 QGFCQSGKLEVAKKLFQEMVSRRVRP-DIVSYKILLDGLCDNGELEKALEIFGKIEKSKM 507

Query: 432 PTEAGHYGILIENCCKAGVYDRAVKLLDKLVEKEIILKPQSTLEMEASAYNPIIQYLCDH 491
             + G Y I+I   C A   D A  L   L        P   ++++A AYN +I  LC  
Sbjct: 508 ELDIGIYMIIIHGMCNASKVDDAWDLFCSL--------PLKGVKLDARAYNIMISELCRK 567

Query: 492 GQTGKAETFFRQLLKKG-IQDEVAFNNLIRGHSKEGNPELAFEILKIMGRKSVSRDAESY 551
               KA+  FR++ ++G   DE+ +N LIR H  + +   A E+++ M       D  + 
Sbjct: 568 DSLSKADILFRKMTEEGHAPDELTYNILIRAHLGDDDATTAAELIEEMKSSGFPADVSTV 627

Query: 552 KLLIKSYLSKGE 563
           K++I + LS GE
Sbjct: 628 KMVI-NMLSSGE 627

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9ZUU32.9e-29467.64Pentatricopeptide repeat-containing protein At2g37230 OS=Arabidopsis thaliana OX... [more]
O819082.2e-10033.33Pentatricopeptide repeat-containing protein At1g02060, chloroplastic OS=Arabidop... [more]
Q9LSL94.9e-5222.19Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana OX... [more]
Q9LPX21.4e-5129.40Pentatricopeptide repeat-containing protein At1g12775, mitochondrial OS=Arabidop... [more]
Q6NQ831.5e-4828.82Pentatricopeptide repeat-containing protein At3g22470, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
XP_023515807.10.0100.00pentatricopeptide repeat-containing protein At2g37230-like [Cucurbita pepo subsp... [more]
KAG6589742.10.098.29Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022921867.10.098.16pentatricopeptide repeat-containing protein At2g37230-like [Cucurbita moschata] ... [more]
XP_022988550.10.095.27pentatricopeptide repeat-containing protein At2g37230-like [Cucurbita maxima][more]
XP_038880029.10.089.61pentatricopeptide repeat-containing protein At2g37230 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1E1L00.098.16pentatricopeptide repeat-containing protein At2g37230-like OS=Cucurbita moschata... [more]
A0A6J1JJW00.095.27pentatricopeptide repeat-containing protein At2g37230-like OS=Cucurbita maxima O... [more]
A0A6J1C4310.088.49pentatricopeptide repeat-containing protein At2g37230 OS=Momordica charantia OX=... [more]
A0A5A7T0L70.086.58Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3B8Y60.086.85pentatricopeptide repeat-containing protein At2g37230 OS=Cucumis melo OX=3656 GN... [more]
Match NameE-valueIdentityDescription
AT2G37230.12.1e-29567.64Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G02060.11.6e-10133.33Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G30290.12.1e-5823.88Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G65560.13.5e-5322.19Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G12775.11.0e-5229.40Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 405..429
e-value: 0.079
score: 13.2
coord: 480..509
e-value: 0.081
score: 13.2
coord: 438..465
e-value: 0.0079
score: 16.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 194..232
e-value: 2.6E-7
score: 30.7
coord: 327..376
e-value: 1.0E-16
score: 60.8
coord: 260..305
e-value: 6.0E-13
score: 48.8
coord: 511..557
e-value: 8.3E-8
score: 32.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 480..509
e-value: 3.8E-4
score: 18.4
coord: 295..328
e-value: 2.8E-8
score: 31.4
coord: 365..393
e-value: 0.0016
score: 16.5
coord: 513..546
e-value: 6.2E-4
score: 17.8
coord: 260..294
e-value: 4.0E-4
score: 18.4
coord: 549..582
e-value: 6.0E-5
score: 21.0
coord: 438..466
e-value: 9.0E-4
score: 17.3
coord: 226..259
e-value: 1.7E-4
score: 19.5
coord: 192..222
e-value: 2.9E-6
score: 25.1
coord: 330..363
e-value: 3.2E-10
score: 37.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 434..468
score: 8.856788
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 258..292
score: 10.040608
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 328..362
score: 13.383805
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 293..327
score: 13.08785
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 511..545
score: 9.898111
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 546..580
score: 9.996763
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 188..222
score: 10.906551
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 223..257
score: 9.985802
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 363..397
score: 10.687325
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 399..433
score: 8.681407
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 126..272
e-value: 9.6E-26
score: 92.9
coord: 475..628
e-value: 1.3E-25
score: 92.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 378..474
e-value: 4.0E-13
score: 51.1
coord: 649..754
e-value: 5.2E-6
score: 27.9
coord: 273..377
e-value: 2.6E-36
score: 126.7
NoneNo IPR availablePANTHERPTHR47932:SF41PENTATRICOPEPTIDE (PPR) REPEAT PROTEINcoord: 4..749
NoneNo IPR availablePANTHERPTHR47932ATPASE EXPRESSION PROTEIN 3coord: 4..749
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 194..359
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 379..580

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG18g05980.1Cp4.1LG18g05980.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding