Cp4.1LG01g22150 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG01g22150
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG01: 20338325 .. 20340280 (-)
RNA-Seq ExpressionCp4.1LG01g22150
SyntenyCp4.1LG01g22150
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGATATTCATGGCGTCTGCTTCGTTTGCGAGCTCATTTTCGGTGTCAGCTGCTCCTTTCATCTAACTCCTCGCATTTTCAGGTTCTTTCCAATCCGAATCTGCAATCACTTCGCTATTTTTCTTCACTGCTTCATAAGTATCCTGTTCACGACACGAGTATCGTTAATTTTAGATACAGAACGCCTCGAATGCTCTCGTACGAATCGGAGATTGGACAGAAGGACTCTGCTCACGCTGTTTTGTTTGATATTTTCTCTAAATGTCGGGATGTGGATGAAATTAGGAAAGGCTTAGAGTCGAGTGGTATTGTTATTAGTCATGATTTGGTGTTGGAGGTGTTGGGGAAGCTTGAGTCGAACCCTGATGAGGCTATCAGGTTTTTCGGTTGGGTTTCGGGGGATTATGGCGAGAAACTTAGCTCCAAGTCGTATAACTTGATGCTTGGAATCCTAGGAGTTAATGGCCGTGTTGAGGAGTTTTGGGATTTGAACTGTGATATGAAGAAAAAGGGTTATGGGATATCTAAAAGTGTACAGGATAAGGTATTGGAGAAGTTTGAGAAGGATGGATTGGAGAGTGAAGCTGAGAAGTTGAGAGACGTCTTTGCATCAGGATCTATTGACAAGTCTCCCGAGAAGATTGGTTCAATCGTTTGCAAACTTGTTAGGAAGAATGTGTGGGGAGATGATGTTGAGCAGCAATTGCGTGATATGAACATTTCATTTTCAAGTGATATGGTTAAGATGATATTGGAGAATCTTTGTACAGAACCAGCAAAAGCATTTATATTTTTCCGATGGATTGGTGAGAATGGGATGTTTAAGCATGATGAACAGACTTATAATGCCATGGCAAGGGTGTTAGGTAGGGAAGACAGTATTGATAGATTTTGGAAAGTAGTTGATGAAATGAGGAGCCACGGTTACGAAATCGAGGTGGAGACATTTTCTGAGGTGTTGAGACGATTTTGTAAAAGAAGAATGATTGAGGAAGCTGTAAACTTGTATGTGTTTGCAATGGCAGGAGGCAATAAGCCTTCGGTTGATTGTCTTACTTTTCTGTTAAAGAAAATAGCAGTTAGTAAGCACTTAGATCTTAGTCTGTTCTCAAGGGCATTGAAGATATTTACAGAGACAGGCAATGCATTGACGGATTCAATGGTTTTTGCAGTTCTCAAGTCTCTGTCTACTGTTGGTAGGATTGGAGAGTACAACGAGGTTTTAAATGCAATGAAGGAGTATGGATACGTATTTAGTGGTGGTTTGAAGAGAAAGGTAGCATACCAACTTAGTAGCACTGGAAAAAGTGATGAAGCAAATGATTTCGTGAATAGCTTAGAAGCTTCTGGCTGTAATTCAGACAACAAGACCTGGGCAGCTCTGATTGAAGGTTATTGTGTTGCTGGAGATCTTGCTAAGGCTTCTGATTGCATCCACAAAATGGTTGAAAAAGGTGTGGACTGTTGTGCTGGATATACTTTGGATTTAGTGGTCAATGCTTACTGTCAAAAGAAACGCGAAACTGATGCTAGCCGTCTTTTCTGTGATCTCGTTGATGAAAAGCAGCTAAAACCATGGCATTCTACATATAAAGCATTGATAAACAAGCTATTGGTTCGAGGGGAATTCAGAGAAGCTTTGAAATTGTTGGGGATGATGAGAAATCATGAATTCCCACCATTTATTGACCCATTTATTTTGTATGTATCAAAGTCTGGAACAGCTGATGATGCCATCGGCTTCCTGAAGGCCATGACATCGAAGAGTTTTCCTTCTACGACAGTGTTCCTCCATTTGTTTGAAGCATTTTTCCAAGCTGGAAGGCACGGAGATGCTCAAGACTTCCTTTCAAAATGTCCAGGTTACATTCGTAACCATGCTGATGTTCTGGAGCTTTTTAATTCTATGAAGCATGTAGAAGCTGCTCCTCCTCCCCCAAATCTGGCTTCTTAG

mRNA sequence

ATGAGATATTCATGGCGTCTGCTTCGTTTGCGAGCTCATTTTCGGTGTCAGCTGCTCCTTTCATCTAACTCCTCGCATTTTCAGGTTCTTTCCAATCCGAATCTGCAATCACTTCGCTATTTTTCTTCACTGCTTCATAAGTATCCTGTTCACGACACGAGTATCGTTAATTTTAGATACAGAACGCCTCGAATGCTCTCGTACGAATCGGAGATTGGACAGAAGGACTCTGCTCACGCTGTTTTGTTTGATATTTTCTCTAAATGTCGGGATGTGGATGAAATTAGGAAAGGCTTAGAGTCGAGTGGTATTGTTATTAGTCATGATTTGGTGTTGGAGGTGTTGGGGAAGCTTGAGTCGAACCCTGATGAGGCTATCAGGTTTTTCGGTTGGGTTTCGGGGGATTATGGCGAGAAACTTAGCTCCAAGTCGTATAACTTGATGCTTGGAATCCTAGGAGTTAATGGCCGTGTTGAGGAGTTTTGGGATTTGAACTGTGATATGAAGAAAAAGGGTTATGGGATATCTAAAAGTGTACAGGATAAGGTATTGGAGAAGTTTGAGAAGGATGGATTGGAGAGTGAAGCTGAGAAGTTGAGAGACGTCTTTGCATCAGGATCTATTGACAAGTCTCCCGAGAAGATTGGTTCAATCGTTTGCAAACTTGTTAGGAAGAATGTGTGGGGAGATGATGTTGAGCAGCAATTGCGTGATATGAACATTTCATTTTCAAGTGATATGGTTAAGATGATATTGGAGAATCTTTGTACAGAACCAGCAAAAGCATTTATATTTTTCCGATGGATTGGTGAGAATGGGATGTTTAAGCATGATGAACAGACTTATAATGCCATGGCAAGGGTGTTAGGTAGGGAAGACAGTATTGATAGATTTTGGAAAGTAGTTGATGAAATGAGGAGCCACGGTTACGAAATCGAGGTGGAGACATTTTCTGAGGTGTTGAGACGATTTTGTAAAAGAAGAATGATTGAGGAAGCTGTAAACTTGTATGTGTTTGCAATGGCAGGAGGCAATAAGCCTTCGGTTGATTGTCTTACTTTTCTGTTAAAGAAAATAGCAGTTAGTAAGCACTTAGATCTTAGTCTGTTCTCAAGGGCATTGAAGATATTTACAGAGACAGGCAATGCATTGACGGATTCAATGGTTTTTGCAGTTCTCAAGTCTCTGTCTACTGTTGGTAGGATTGGAGAGTACAACGAGGTTTTAAATGCAATGAAGGAGTATGGATACGTATTTAGTGGTGGTTTGAAGAGAAAGGTAGCATACCAACTTAGTAGCACTGGAAAAAGTGATGAAGCAAATGATTTCGTGAATAGCTTAGAAGCTTCTGGCTGTAATTCAGACAACAAGACCTGGGCAGCTCTGATTGAAGGTTATTGTGTTGCTGGAGATCTTGCTAAGGCTTCTGATTGCATCCACAAAATGGTTGAAAAAGGTGTGGACTGTTGTGCTGGATATACTTTGGATTTAGTGGTCAATGCTTACTGTCAAAAGAAACGCGAAACTGATGCTAGCCGTCTTTTCTGTGATCTCGTTGATGAAAAGCAGCTAAAACCATGGCATTCTACATATAAAGCATTGATAAACAAGCTATTGGTTCGAGGGGAATTCAGAGAAGCTTTGAAATTGTTGGGGATGATGAGAAATCATGAATTCCCACCATTTATTGACCCATTTATTTTGTATGTATCAAAGTCTGGAACAGCTGATGATGCCATCGGCTTCCTGAAGGCCATGACATCGAAGAGTTTTCCTTCTACGACAGTGTTCCTCCATTTGTTTGAAGCATTTTTCCAAGCTGGAAGGCACGGAGATGCTCAAGACTTCCTTTCAAAATGTCCAGGTTACATTCGTAACCATGCTGATGTTCTGGAGCTTTTTAATTCTATGAAGCATGTAGAAGCTGCTCCTCCTCCCCCAAATCTGGCTTCTTAG

Coding sequence (CDS)

ATGAGATATTCATGGCGTCTGCTTCGTTTGCGAGCTCATTTTCGGTGTCAGCTGCTCCTTTCATCTAACTCCTCGCATTTTCAGGTTCTTTCCAATCCGAATCTGCAATCACTTCGCTATTTTTCTTCACTGCTTCATAAGTATCCTGTTCACGACACGAGTATCGTTAATTTTAGATACAGAACGCCTCGAATGCTCTCGTACGAATCGGAGATTGGACAGAAGGACTCTGCTCACGCTGTTTTGTTTGATATTTTCTCTAAATGTCGGGATGTGGATGAAATTAGGAAAGGCTTAGAGTCGAGTGGTATTGTTATTAGTCATGATTTGGTGTTGGAGGTGTTGGGGAAGCTTGAGTCGAACCCTGATGAGGCTATCAGGTTTTTCGGTTGGGTTTCGGGGGATTATGGCGAGAAACTTAGCTCCAAGTCGTATAACTTGATGCTTGGAATCCTAGGAGTTAATGGCCGTGTTGAGGAGTTTTGGGATTTGAACTGTGATATGAAGAAAAAGGGTTATGGGATATCTAAAAGTGTACAGGATAAGGTATTGGAGAAGTTTGAGAAGGATGGATTGGAGAGTGAAGCTGAGAAGTTGAGAGACGTCTTTGCATCAGGATCTATTGACAAGTCTCCCGAGAAGATTGGTTCAATCGTTTGCAAACTTGTTAGGAAGAATGTGTGGGGAGATGATGTTGAGCAGCAATTGCGTGATATGAACATTTCATTTTCAAGTGATATGGTTAAGATGATATTGGAGAATCTTTGTACAGAACCAGCAAAAGCATTTATATTTTTCCGATGGATTGGTGAGAATGGGATGTTTAAGCATGATGAACAGACTTATAATGCCATGGCAAGGGTGTTAGGTAGGGAAGACAGTATTGATAGATTTTGGAAAGTAGTTGATGAAATGAGGAGCCACGGTTACGAAATCGAGGTGGAGACATTTTCTGAGGTGTTGAGACGATTTTGTAAAAGAAGAATGATTGAGGAAGCTGTAAACTTGTATGTGTTTGCAATGGCAGGAGGCAATAAGCCTTCGGTTGATTGTCTTACTTTTCTGTTAAAGAAAATAGCAGTTAGTAAGCACTTAGATCTTAGTCTGTTCTCAAGGGCATTGAAGATATTTACAGAGACAGGCAATGCATTGACGGATTCAATGGTTTTTGCAGTTCTCAAGTCTCTGTCTACTGTTGGTAGGATTGGAGAGTACAACGAGGTTTTAAATGCAATGAAGGAGTATGGATACGTATTTAGTGGTGGTTTGAAGAGAAAGGTAGCATACCAACTTAGTAGCACTGGAAAAAGTGATGAAGCAAATGATTTCGTGAATAGCTTAGAAGCTTCTGGCTGTAATTCAGACAACAAGACCTGGGCAGCTCTGATTGAAGGTTATTGTGTTGCTGGAGATCTTGCTAAGGCTTCTGATTGCATCCACAAAATGGTTGAAAAAGGTGTGGACTGTTGTGCTGGATATACTTTGGATTTAGTGGTCAATGCTTACTGTCAAAAGAAACGCGAAACTGATGCTAGCCGTCTTTTCTGTGATCTCGTTGATGAAAAGCAGCTAAAACCATGGCATTCTACATATAAAGCATTGATAAACAAGCTATTGGTTCGAGGGGAATTCAGAGAAGCTTTGAAATTGTTGGGGATGATGAGAAATCATGAATTCCCACCATTTATTGACCCATTTATTTTGTATGTATCAAAGTCTGGAACAGCTGATGATGCCATCGGCTTCCTGAAGGCCATGACATCGAAGAGTTTTCCTTCTACGACAGTGTTCCTCCATTTGTTTGAAGCATTTTTCCAAGCTGGAAGGCACGGAGATGCTCAAGACTTCCTTTCAAAATGTCCAGGTTACATTCGTAACCATGCTGATGTTCTGGAGCTTTTTAATTCTATGAAGCATGTAGAAGCTGCTCCTCCTCCCCCAAATCTGGCTTCTTAG

Protein sequence

MRYSWRLLRLRAHFRCQLLLSSNSSHFQVLSNPNLQSLRYFSSLLHKYPVHDTSIVNFRYRTPRMLSYESEIGQKDSAHAVLFDIFSKCRDVDEIRKGLESSGIVISHDLVLEVLGKLESNPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGYGISKSVQDKVLEKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGDDVEQQLRDMNISFSSDMVKMILENLCTEPAKAFIFFRWIGENGMFKHDEQTYNAMARVLGREDSIDRFWKVVDEMRSHGYEIEVETFSEVLRRFCKRRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIAVSKHLDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFSGGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIHKMVEKGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWHSTYKALINKLLVRGEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHLFEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEAAPPPPNLAS
Homology
BLAST of Cp4.1LG01g22150 vs. ExPASy Swiss-Prot
Match: Q9M891 (Pentatricopeptide repeat-containing protein At3g02490, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g02490 PE=2 SV=1)

HSP 1 Score: 642.1 bits (1655), Expect = 6.7e-183
Identity = 342/652 (52.45%), Postives = 449/652 (68.87%), Query Frame = 0

Query: 1   MRYSWRLLRLRAHFRCQLLLSSNSSHFQVLSNPNLQSLRYFSSLLH-KYPVHDTSIVNFR 60
           MRY WR L  R++        S+ S FQV+SN    S R FSS LH ++ V     + F 
Sbjct: 1   MRYQWRSLLFRSYRSSPRPFLSHHSRFQVISN----STRSFSSFLHERFGVQQRQCL-FA 60

Query: 61  YRTP------RMLSYESEIGQKDSAHAVLFDIFSKCRDVDEIRKGLESSGIVISHDLVLE 120
            R+P      R  S ES I +K  A  V+ D+FS+    DEI K L+S+ +VISH+L L 
Sbjct: 61  LRSPLASSVSRRFSSESAIEEKLPAETVVIDVFSRLNGKDEITKELDSNDVVISHELALR 120

Query: 121 VLGKLESNPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGY 180
           VL +LES+PD A RFF W    Y +KLSSKSYN ML I GVNG V+EFW L  DMKKKG+
Sbjct: 121 VLRELESSPDVAGRFFKWGLEAYPQKLSSKSYNTMLRIFGVNGLVDEFWRLVDDMKKKGH 180

Query: 181 GISKSVQDKVLEKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGDDVE 240
           G+S +V+D+V +KF+KDGLE++ E+L+++FASGS+D S +K+ + VCK+V K VWG DVE
Sbjct: 181 GVSANVRDRVGDKFKKDGLENDLERLKELFASGSMDNSVDKVCNRVCKIVMKEVWGADVE 240

Query: 241 QQLRDMNISFSSDMVKMILENLCTEPAKAFIFFRWIGENGMFKHDEQTYNAMARVLGRED 300
           +QLRD+ + F SD+VKM+LE L  +P KA +FFRWI E+G FKHDE+TYNAMARVLG+E 
Sbjct: 241 KQLRDLKLEFKSDVVKMVLEKLDVDPRKALLFFRWIDESGSFKHDEKTYNAMARVLGKEK 300

Query: 301 SIDRFWKVVDEMRSHGYEIEVETFSEVLRRFCKRRMIEEAVNLYVFAMAG--GNKPSVDC 360
            +DRF  +++E+RS GYE+E+ET+  V  RFC+ +MI+EAV L+ FAMAG   N P+  C
Sbjct: 301 FLDRFQHMIEEIRSAGYEMEMETYVRVSARFCQTKMIKEAVELFEFAMAGSISNTPTPHC 360

Query: 361 LTFLLKKIAVSKHLDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNA 420
            + LLKKI  +K LD+ LF+R LK +T  GN + D M+  VLKSL +V R G+ NEVL A
Sbjct: 361 CSLLLKKIVTAKKLDMDLFTRTLKAYTGNGNVVPDVMLQHVLKSLRSVDRFGQSNEVLKA 420

Query: 421 MKEYGYVFSGGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGD 480
           M E GYV SG L+  +A  LS  GK DEAN+ VN +EASG + D+K  A+L+EG+C A D
Sbjct: 421 MNEGGYVPSGDLQSVIASGLSRKGKKDEANELVNFMEASGNHLDDKAMASLVEGHCDAKD 480

Query: 481 LAKASDCIHKMVEKGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWHSTY 540
           L +AS+C  KM+ K     AGY  + +V AYC   +  D  +LF +LV + QLKPWHSTY
Sbjct: 481 LEEASECFKKMIGKEGVSYAGYAFEKLVLAYCNSFQARDVYKLFSELVKQNQLKPWHSTY 540

Query: 541 KALINKLLVR-----GEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAM 600
           K ++  LL++     G F EAL LL MMRNH FPPF+DPF+ Y+S SGT+ +A  FLKA+
Sbjct: 541 KIMVRNLLMKKVARDGGFEEALSLLPMMRNHGFPPFVDPFMDYLSNSGTSAEAFAFLKAV 600

Query: 601 TSKSFPSTTVFLHLFEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMK 639
           TSK FPS ++ L +FEA  ++ RH +AQD LS  P YIR +A+VLELFN+MK
Sbjct: 601 TSKKFPSNSMVLRVFEAMLKSARHSEAQDLLSMSPSYIRRNAEVLELFNTMK 647

BLAST of Cp4.1LG01g22150 vs. ExPASy Swiss-Prot
Match: Q8LPF1 (Pentatricopeptide repeat-containing protein At5g15980, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g15980 PE=2 SV=1)

HSP 1 Score: 621.7 bits (1602), Expect = 9.4e-177
Identity = 337/669 (50.37%), Postives = 448/669 (66.97%), Query Frame = 0

Query: 1   MRY-SWRLLRLRAHFR--------CQLLLSSNSSHFQVLSNPNLQSLRYFSSLLH-KYPV 60
           MRY  WRL+ LR++ R        C  + S +S  F    +P + +L+    L   + P+
Sbjct: 1   MRYQQWRLMLLRSYHRSHLPYLSPCSQVTSISSRSFSSFIHPGIGALQQSEQLCPLRSPM 60

Query: 61  HDTSIVNFRYRTPRMLSYESEIGQKDSAHAVLFDIFSKCRDVDEIRKGLESSGIVISHDL 120
             TS  N      R  S E  + +K SA A + DIFS+    DEIRK LESSG+VIS DL
Sbjct: 61  --TSSGNLVKSVGRSFSSEPAVEEKSSAEATVIDIFSRLSGEDEIRKELESSGVVISQDL 120

Query: 121 VLEVLGKLESNPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKK 180
            L+VL KLESNPD A  FF W+     E+LSSK+YN+ML ILG NG V+EFW L   MKK
Sbjct: 121 ALKVLRKLESNPDVAKSFFQWIKEASPEELSSKNYNMMLRILGGNGLVDEFWGLVDVMKK 180

Query: 181 KGYGISKSVQDKVLEKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGD 240
           KG+G+S +V+DKV +KF+KDGLES+  +LR +F S  +D S E +   VCK+V K  WGD
Sbjct: 181 KGHGLSANVRDKVGDKFQKDGLESDLLRLRKLFTSDCLDNSAENVCDRVCKIVMKEEWGD 240

Query: 241 DVEQQLRDMNISFSSDMVKMILENLCTEPAKAFIFFRWIGENGMFKHDEQTYNAMARVLG 300
           DVE+++RD+N+ F SD+VKMI+E L  EP KA +FFRWI E+ +FKHDE+TYNAMARVLG
Sbjct: 241 DVEKRVRDLNVEFKSDLVKMIVERLDVEPRKALLFFRWIDESDLFKHDEKTYNAMARVLG 300

Query: 301 REDSIDRFWKVVDEMRSHGYEIEVETFSEVLRRFCKRRMIEEAVNLYVFAMAG---GNKP 360
           +E  +DRF  +V EMRS GYE+E+ET+  V  RFC+ ++I+EAV+L+  AMAG    N P
Sbjct: 301 KEKFLDRFQNIVVEMRSAGYEVEIETYVRVSTRFCQTKLIKEAVDLFEIAMAGSSSSNNP 360

Query: 361 SVDCLTFLLKKIAVSKHLDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNE 420
           +  C   LLKKI  +K LD+ LFSRA+K++T+ GNALTDS++ +VLKSL +V R+ + NE
Sbjct: 361 TPHCFCLLLKKIVTAKILDMDLFSRAVKVYTKNGNALTDSLLKSVLKSLRSVDRVEQSNE 420

Query: 421 VLNAMKEYGYVFSGGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYC 480
           +L  MK  GYV SG ++  +A  LS  GK DEA++FV+ +E+SG N D+K  A+L+EGYC
Sbjct: 421 LLKEMKRGGYVPSGDMQSMIASSLSRKGKKDEADEFVDFMESSGNNLDDKAMASLVEGYC 480

Query: 481 VAGDLAKASDCIHKMVEKGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPW 540
            +G+L +A  C  KMV       A Y+ + +V AYC K +  DA +L    V + QLKP 
Sbjct: 481 DSGNLDEALVCFEKMVGNTGVSYADYSFEKLVLAYCNKNQVRDAYKLLSAQVTKNQLKPR 540

Query: 541 HSTYKALINKLLVR-----GEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGF 600
           HSTYK+L+  LL +     G F EAL LL +M++H FPPFIDPF+ Y S +G + +A+GF
Sbjct: 541 HSTYKSLVTNLLTKKIARDGGFEEALSLLPIMKDHGFPPFIDPFMSYFSSTGKSTEALGF 600

Query: 601 LKAMTSKSFPSTTVFLHLFEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEA 652
           LKAMTS +FP  +V L +FE   ++ RH +AQD LS CP YIRN+ DVLELFN+MK  E+
Sbjct: 601 LKAMTSNNFPYISVVLRVFETMMKSARHSEAQDLLSLCPNYIRNNPDVLELFNTMKPNES 660

BLAST of Cp4.1LG01g22150 vs. ExPASy Swiss-Prot
Match: Q9STK5 (Pentatricopeptide repeat-containing protein At3g48250, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At3g48250 PE=2 SV=1)

HSP 1 Score: 336.3 bits (861), Expect = 7.9e-91
Identity = 182/546 (33.33%), Postives = 300/546 (54.95%), Query Frame = 0

Query: 94  EIRKGLESSGIVISHDLVLEVLGKLESNPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILG 153
           E+ +GL    + ++H+  + VL KLE  P++A  F  WV  D G   S+  Y++ML IL 
Sbjct: 75  EVEEGLRKPDMSLTHETAIYVLRKLEKYPEKAYYFLDWVLRDSGLSPSTPLYSIMLRILV 134

Query: 154 VNGRVEEFWDLNCDMKKKGYGISKSVQDKVLEKFEKDGLESEAEKLRDVFASGSIDKSPE 213
               ++ FW    +MK+ G+ + +     +  +  K+  +++A  +   +     + +  
Sbjct: 135 QQRSMKRFWMTLREMKQGGFYLDEDTYKTIYGELSKEKSKADAVAVAHFYERMLKENAMS 194

Query: 214 KIGSIVCKLVRKNVWGDDVEQQLRDMNISFSSDMVKMILENLCTEPAKAFIFFRWIGENG 273
            +   V  +V K  W  +VE++L++M +  S + V  +L+ L   P KA  FF W+G  G
Sbjct: 195 VVAGEVSAVVTKGDWSCEVERELQEMKLVLSDNFVIRVLKELREHPLKALAFFHWVGGGG 254

Query: 274 M---FKHDEQTYNAMARVLGREDSIDRFWKVVDEMRSHGYEIEVETFSEVLRRFCKRRMI 333
               ++H   TYNA  RVL R +S+  FW VVDEM++ GY+++++T+ +V R+F K RM+
Sbjct: 255 SSSGYQHSTVTYNAALRVLARPNSVAEFWSVVDEMKTAGYDMDLDTYIKVSRQFQKSRMM 314

Query: 334 EEAVNLYVFAMAGGNKPSVDCLTFLLKKIAVSKHLDLSLFSRALKIFTETGNALTDSMVF 393
            E V LY + M G  KPS+   + LL+ ++ S + DL L  R  + +  TG +L+ ++  
Sbjct: 315 AETVKLYEYMMDGPFKPSIQDCSLLLRYLSGSPNPDLDLVFRVSRKYESTGKSLSKAVYD 374

Query: 394 AVLKSLSTVGRIGEYNEVLNAMKEYGYVFSGGLKRKVAYQLSSTGKSDEANDFVNSLEAS 453
            + +SL++VGR  E  E+  AM+  GY        ++ + L    + +EA   ++ +EA 
Sbjct: 375 GIHRSLTSVGRFDEAEEITKAMRNAGYEPDNITYSQLVFGLCKAKRLEEARGVLDQMEAQ 434

Query: 454 GCNSDNKTWAALIEGYCVAGDLAKASDCIHKMVEKGVDCCAGYTLDLVVNAYCQKKRETD 513
           GC  D KTW  LI+G+C   +L KA  C   M+EKG D  +   LD++++ +    +   
Sbjct: 435 GCFPDIKTWTILIQGHCKNNELDKALACFANMLEKGFDIDSN-LLDVLIDGFVIHNKFEG 494

Query: 514 ASRLFCDLVDEKQLKPWHSTYKALINKLLVRGEFREALKLLGMMRNHEFPPFIDPFILYV 573
           AS    ++V    +KPW STYK LI+KLL   +  EAL LL MM+   +P + + F  Y+
Sbjct: 495 ASIFLMEMVKNANVKPWQSTYKLLIDKLLKIKKSEEALDLLQMMKKQNYPAYAEAFDGYL 554

Query: 574 SKSGTADDAIGFLKAMTSKSFPSTTVFLHLFEAFFQAGRHGDAQDFLSKCPGYIRNHADV 633
           +K GT +DA  FL  ++SK  PS   + H+ EAF++ GR  DA++ L  CP + + H  +
Sbjct: 555 AKFGTLEDAKKFLDVLSSKDSPSFAAYFHVIEAFYREGRLTDAKNLLFICPHHFKTHPKI 614

Query: 634 LELFNS 637
            ELF +
Sbjct: 615 SELFGA 619

BLAST of Cp4.1LG01g22150 vs. ExPASy Swiss-Prot
Match: Q9FMF6 (Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g64320 PE=2 SV=1)

HSP 1 Score: 97.8 bits (242), Expect = 4.7e-19
Identity = 123/529 (23.25%), Postives = 212/529 (40.08%), Query Frame = 0

Query: 71  EIGQKDSAHAVLFDIFSKCRDVDEIRKGLESSGIVISHDLVLEVLGKLESNPDEAIRFFG 130
           EIG  DSA+   ++   K  D+D +R               L  L +L  N   ++  F 
Sbjct: 47  EIGGTDSANE--WEKLLKPFDLDSLRNSFHK-----ITPFQLYKLLELPLNVSTSMELFS 106

Query: 131 WVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGYGISKSVQDKVLEKFEKD 190
           W     G + S   Y +++G LG NG  +    L   MK +G    +S+   ++  ++K 
Sbjct: 107 WTGSQNGYRHSFDVYQVLIGKLGANGEFKTIDRLLIQMKDEGIVFKESLFISIMRDYDKA 166

Query: 191 GLESEAEKL----RDVFASGSIDKSPEKIGSIV----CKLVRKNVWGDDVEQQLRDMNIS 250
           G   +  +L    R+V++     KS   +  I+    C  V  NV+ D + +++     +
Sbjct: 167 GFPGQTTRLMLEMRNVYSCEPTFKSYNVVLEILVSGNCHKVAANVFYDMLSRKIPPTLFT 226

Query: 251 FSSDMVKMILENLC--TEPAKAFIFFRWIGENGMFKHDEQTYNAMARVLGREDSIDRFWK 310
           F      ++++  C   E   A    R + ++G    +   Y  +   L + + ++   +
Sbjct: 227 FG-----VVMKAFCAVNEIDSALSLLRDMTKHGCVP-NSVIYQTLIHSLSKCNRVNEALQ 286

Query: 311 VVDEMRSHGYEIEVETFSEVLRRFCKRRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIA 370
           +++EM   G   + ETF++V+   CK   I EA  +    +  G  P      +L+  + 
Sbjct: 287 LLEEMFLMGCVPDAETFNDVILGLCKFDRINEAAKMVNRMLIRGFAPDDITYGYLMNGLC 346

Query: 371 VSKHLDLS--LFSRALKIFTETGNALTDSMVF--------AVLKSLST----VGRIGEYN 430
               +D +  LF R  K      N L    V         AVL  + T    V  +  YN
Sbjct: 347 KIGRVDAAKDLFYRIPKPEIVIFNTLIHGFVTHGRLDDAKAVLSDMVTSYGIVPDVCTYN 406

Query: 431 EVLNAMKEYGYVFSG--GLKRKVAYQLSS-------------------TGKSDEANDFVN 490
            ++     YGY   G  GL  +V + + +                    GK DEA + +N
Sbjct: 407 SLI-----YGYWKEGLVGLALEVLHDMRNKGCKPNVYSYTILVDGFCKLGKIDEAYNVLN 466

Query: 491 SLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIHKMVEKGVDCCAGYTLDLVVNAYCQK 550
            + A G   +   +  LI  +C    + +A +   +M  KG      YT + +++  C+ 
Sbjct: 467 EMSADGLKPNTVGFNCLISAFCKEHRIPEAVEIFREMPRKGCKPDV-YTFNSLISGLCEV 526

Query: 551 KRETDASRLFCDLVDEKQLKPWHSTYKALINKLLVRGEFREALKLLGMM 555
                A  L  D++ E  +     TY  LIN  L RGE +EA KL+  M
Sbjct: 527 DEIKHALWLLRDMISEGVVAN-TVTYNTLINAFLRRGEIKEARKLVNEM 555

BLAST of Cp4.1LG01g22150 vs. ExPASy Swiss-Prot
Match: Q9LUR2 (Putative pentatricopeptide repeat-containing protein At3g16710, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g16710 PE=3 SV=1)

HSP 1 Score: 87.4 bits (215), Expect = 6.4e-16
Identity = 86/384 (22.40%), Postives = 165/384 (42.97%), Query Frame = 0

Query: 255 LCTEPAKAFIFFRWIGENGMFKHDEQTYNAMARVLGREDSIDRFWKVVDEMRSHGYEIEV 314
           L ++P +A  F   + + G F+ D  T+ ++       + I+    + D++   G++  V
Sbjct: 130 LSSQPCRASCFLGKMMKLG-FEPDLVTFTSLLNGYCHWNRIEDAIALFDQILGMGFKPNV 189

Query: 315 ETFSEVLRRFCKRRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIA-VSKHLDLSLFSRA 374
            T++ ++R  CK R +  AV L+      G++P+V     L+  +  + +  D +   R 
Sbjct: 190 VTYTTLIRCLCKNRHLNHAVELFNQMGTNGSRPNVVTYNALVTGLCEIGRWGDAAWLLRD 249

Query: 375 LKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKE---YGYVFSGGLKRKVAYQ 434
           +       N +T +   A++ +   VG++ E  E+ N M +   Y  VF+ G    +   
Sbjct: 250 MMKRRIEPNVITFT---ALIDAFVKVGKLMEAKELYNVMIQMSVYPDVFTYG---SLING 309

Query: 435 LSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIHKMVEKGVDCC 494
           L   G  DEA      +E +GC  +   +  LI G+C +  +       ++M +KGV   
Sbjct: 310 LCMYGLLDEARQMFYLMERNGCYPNEVIYTTLIHGFCKSKRVEDGMKIFYEMSQKGV-VA 369

Query: 495 AGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWHSTYKALINKLLVRGEFREALKL 554
              T  +++  YC   R   A  +F + +  ++  P   TY  L++ L   G+  +AL +
Sbjct: 370 NTITYTVLIQGYCLVGRPDVAQEVF-NQMSSRRAPPDIRTYNVLLDGLCCNGKVEKALMI 429

Query: 555 LGMMRNHEFPPFIDPFILYVS---KSGTADDAIGFLKAMTSKSF-PSTTVFLHLFEAFFQ 614
              MR  E    I  + + +    K G  +DA     ++ SK   P+   +  +   F +
Sbjct: 430 FEYMRKREMDINIVTYTIIIQGMCKLGKVEDAFDLFCSLFSKGMKPNVITYTTMISGFCR 489

Query: 615 AGRHGDAQDFLSKC--PGYIRNHA 629
            G   +A     K    G++ N +
Sbjct: 490 RGLIHEADSLFKKMKEDGFLPNES 504

BLAST of Cp4.1LG01g22150 vs. NCBI nr
Match: XP_023511958.1 (pentatricopeptide repeat-containing protein At3g02490, mitochondrial-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1300 bits (3363), Expect = 0.0
Identity = 651/651 (100.00%), Postives = 651/651 (100.00%), Query Frame = 0

Query: 1   MRYSWRLLRLRAHFRCQLLLSSNSSHFQVLSNPNLQSLRYFSSLLHKYPVHDTSIVNFRY 60
           MRYSWRLLRLRAHFRCQLLLSSNSSHFQVLSNPNLQSLRYFSSLLHKYPVHDTSIVNFRY
Sbjct: 1   MRYSWRLLRLRAHFRCQLLLSSNSSHFQVLSNPNLQSLRYFSSLLHKYPVHDTSIVNFRY 60

Query: 61  RTPRMLSYESEIGQKDSAHAVLFDIFSKCRDVDEIRKGLESSGIVISHDLVLEVLGKLES 120
           RTPRMLSYESEIGQKDSAHAVLFDIFSKCRDVDEIRKGLESSGIVISHDLVLEVLGKLES
Sbjct: 61  RTPRMLSYESEIGQKDSAHAVLFDIFSKCRDVDEIRKGLESSGIVISHDLVLEVLGKLES 120

Query: 121 NPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGYGISKSVQ 180
           NPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGYGISKSVQ
Sbjct: 121 NPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGYGISKSVQ 180

Query: 181 DKVLEKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGDDVEQQLRDMN 240
           DKVLEKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGDDVEQQLRDMN
Sbjct: 181 DKVLEKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGDDVEQQLRDMN 240

Query: 241 ISFSSDMVKMILENLCTEPAKAFIFFRWIGENGMFKHDEQTYNAMARVLGREDSIDRFWK 300
           ISFSSDMVKMILENLCTEPAKAFIFFRWIGENGMFKHDEQTYNAMARVLGREDSIDRFWK
Sbjct: 241 ISFSSDMVKMILENLCTEPAKAFIFFRWIGENGMFKHDEQTYNAMARVLGREDSIDRFWK 300

Query: 301 VVDEMRSHGYEIEVETFSEVLRRFCKRRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIA 360
           VVDEMRSHGYEIEVETFSEVLRRFCKRRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIA
Sbjct: 301 VVDEMRSHGYEIEVETFSEVLRRFCKRRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIA 360

Query: 361 VSKHLDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFS 420
           VSKHLDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFS
Sbjct: 361 VSKHLDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFS 420

Query: 421 GGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIH 480
           GGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIH
Sbjct: 421 GGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIH 480

Query: 481 KMVEKGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWHSTYKALINKLLV 540
           KMVEKGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWHSTYKALINKLLV
Sbjct: 481 KMVEKGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWHSTYKALINKLLV 540

Query: 541 RGEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHL 600
           RGEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHL
Sbjct: 541 RGEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHL 600

Query: 601 FEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEAAPPPPNLAS 651
           FEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEAAPPPPNLAS
Sbjct: 601 FEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEAAPPPPNLAS 651

BLAST of Cp4.1LG01g22150 vs. NCBI nr
Match: KAG6602554.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1273 bits (3294), Expect = 0.0
Identity = 638/651 (98.00%), Postives = 644/651 (98.92%), Query Frame = 0

Query: 1   MRYSWRLLRLRAHFRCQLLLSSNSSHFQVLSNPNLQSLRYFSSLLHKYPVHDTSIVNFRY 60
           MRYSWRLLRLRAHFRCQLLLSSNSSHFQVLSNPNLQSLRYFSSLLHKY VH TSIVN RY
Sbjct: 1   MRYSWRLLRLRAHFRCQLLLSSNSSHFQVLSNPNLQSLRYFSSLLHKYSVHGTSIVNSRY 60

Query: 61  RTPRMLSYESEIGQKDSAHAVLFDIFSKCRDVDEIRKGLESSGIVISHDLVLEVLGKLES 120
           RTPRMLSYE EIGQKDSAHAV+FDIFSKCRDVDEIRKGLESSG+VISHDLVLEVLGKLES
Sbjct: 61  RTPRMLSYEPEIGQKDSAHAVVFDIFSKCRDVDEIRKGLESSGVVISHDLVLEVLGKLES 120

Query: 121 NPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGYGISKSVQ 180
           NPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGYGISKSVQ
Sbjct: 121 NPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGYGISKSVQ 180

Query: 181 DKVLEKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGDDVEQQLRDMN 240
           +KVLEKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGDDVEQQLRDMN
Sbjct: 181 NKVLEKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGDDVEQQLRDMN 240

Query: 241 ISFSSDMVKMILENLCTEPAKAFIFFRWIGENGMFKHDEQTYNAMARVLGREDSIDRFWK 300
           +SFSSDMVKMILENLCTEPAKAFIFFRWIGE+GMFKHDEQTYNAMARVLG EDSIDRFWK
Sbjct: 241 VSFSSDMVKMILENLCTEPAKAFIFFRWIGESGMFKHDEQTYNAMARVLGSEDSIDRFWK 300

Query: 301 VVDEMRSHGYEIEVETFSEVLRRFCKRRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIA 360
           VVDEMRSHGYE+EVETFSEVLRRFCKRRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIA
Sbjct: 301 VVDEMRSHGYEMEVETFSEVLRRFCKRRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIA 360

Query: 361 VSKHLDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFS 420
           VSKHLDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFS
Sbjct: 361 VSKHLDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFS 420

Query: 421 GGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIH 480
           GGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIH
Sbjct: 421 GGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIH 480

Query: 481 KMVEKGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWHSTYKALINKLLV 540
           KMVEKGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPW STYKALINKLLV
Sbjct: 481 KMVEKGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWDSTYKALINKLLV 540

Query: 541 RGEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHL 600
           RGEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHL
Sbjct: 541 RGEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHL 600

Query: 601 FEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEAAPPPPNLAS 651
           FEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEAA PPPNLAS
Sbjct: 601 FEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEAAAPPPNLAS 651

BLAST of Cp4.1LG01g22150 vs. NCBI nr
Match: XP_022921370.1 (pentatricopeptide repeat-containing protein At3g02490, mitochondrial-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 1273 bits (3293), Expect = 0.0
Identity = 638/651 (98.00%), Postives = 644/651 (98.92%), Query Frame = 0

Query: 1   MRYSWRLLRLRAHFRCQLLLSSNSSHFQVLSNPNLQSLRYFSSLLHKYPVHDTSIVNFRY 60
           MRYSWRLLRLRAHFRCQLLLSSNSSHFQVLSNPNLQSLRYFSSLL KYPVHDTSIVN RY
Sbjct: 1   MRYSWRLLRLRAHFRCQLLLSSNSSHFQVLSNPNLQSLRYFSSLLRKYPVHDTSIVNSRY 60

Query: 61  RTPRMLSYESEIGQKDSAHAVLFDIFSKCRDVDEIRKGLESSGIVISHDLVLEVLGKLES 120
            TPRMLS E EIGQKDSAHAV+FDIFSKCRDVDEIRKGLESSG+VISHDLVLEVLGKLES
Sbjct: 61  STPRMLSSEPEIGQKDSAHAVVFDIFSKCRDVDEIRKGLESSGVVISHDLVLEVLGKLES 120

Query: 121 NPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGYGISKSVQ 180
           NPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGYGISKSVQ
Sbjct: 121 NPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGYGISKSVQ 180

Query: 181 DKVLEKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGDDVEQQLRDMN 240
           +KVLEKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGDDVEQQLRDMN
Sbjct: 181 NKVLEKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGDDVEQQLRDMN 240

Query: 241 ISFSSDMVKMILENLCTEPAKAFIFFRWIGENGMFKHDEQTYNAMARVLGREDSIDRFWK 300
           IS SSDMVKMILENLCTEPAKAFIFFRWIGE+GMFKHDEQTYNAMARVLGREDSIDRFWK
Sbjct: 241 ISLSSDMVKMILENLCTEPAKAFIFFRWIGESGMFKHDEQTYNAMARVLGREDSIDRFWK 300

Query: 301 VVDEMRSHGYEIEVETFSEVLRRFCKRRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIA 360
           VVDEMRSHGYE+EVETFSEVLRRFCKR+MIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIA
Sbjct: 301 VVDEMRSHGYEMEVETFSEVLRRFCKRKMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIA 360

Query: 361 VSKHLDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFS 420
           VSKHLDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFS
Sbjct: 361 VSKHLDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFS 420

Query: 421 GGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIH 480
           GGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIH
Sbjct: 421 GGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIH 480

Query: 481 KMVEKGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWHSTYKALINKLLV 540
           KMVEKGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWHSTYKALINKLLV
Sbjct: 481 KMVEKGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWHSTYKALINKLLV 540

Query: 541 RGEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHL 600
           RGEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHL
Sbjct: 541 RGEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHL 600

Query: 601 FEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEAAPPPPNLAS 651
           FEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEAA PPPNLAS
Sbjct: 601 FEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEAAAPPPNLAS 651

BLAST of Cp4.1LG01g22150 vs. NCBI nr
Match: KAG7033234.1 (Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1271 bits (3290), Expect = 0.0
Identity = 637/651 (97.85%), Postives = 644/651 (98.92%), Query Frame = 0

Query: 1   MRYSWRLLRLRAHFRCQLLLSSNSSHFQVLSNPNLQSLRYFSSLLHKYPVHDTSIVNFRY 60
           MRYSWRLLRLRAHFRCQLLLSSNSSHFQVLSNPNLQSLRYFSSLLHKY VH TSIVN RY
Sbjct: 1   MRYSWRLLRLRAHFRCQLLLSSNSSHFQVLSNPNLQSLRYFSSLLHKYSVHGTSIVNSRY 60

Query: 61  RTPRMLSYESEIGQKDSAHAVLFDIFSKCRDVDEIRKGLESSGIVISHDLVLEVLGKLES 120
           RTPRMLSYE EIGQKDSAHAV+FDIFSKCRDVDEIRKGLESSG+VISHDLVLEVLGKLES
Sbjct: 61  RTPRMLSYEPEIGQKDSAHAVVFDIFSKCRDVDEIRKGLESSGVVISHDLVLEVLGKLES 120

Query: 121 NPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGYGISKSVQ 180
           NPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGYGISKSVQ
Sbjct: 121 NPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGYGISKSVQ 180

Query: 181 DKVLEKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGDDVEQQLRDMN 240
           +KVLEKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGDDVEQQLRDMN
Sbjct: 181 NKVLEKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGDDVEQQLRDMN 240

Query: 241 ISFSSDMVKMILENLCTEPAKAFIFFRWIGENGMFKHDEQTYNAMARVLGREDSIDRFWK 300
           +SFSSDMVKMILENLCTEPAKAFIFFRWIGE+GMFKHDEQTYNAMARVLG EDSI+RFWK
Sbjct: 241 VSFSSDMVKMILENLCTEPAKAFIFFRWIGESGMFKHDEQTYNAMARVLGSEDSIERFWK 300

Query: 301 VVDEMRSHGYEIEVETFSEVLRRFCKRRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIA 360
           VVDEMRSHGYE+EVETFSEVLRRFCKRRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIA
Sbjct: 301 VVDEMRSHGYEMEVETFSEVLRRFCKRRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIA 360

Query: 361 VSKHLDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFS 420
           VSKHLDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFS
Sbjct: 361 VSKHLDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFS 420

Query: 421 GGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIH 480
           GGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIH
Sbjct: 421 GGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIH 480

Query: 481 KMVEKGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWHSTYKALINKLLV 540
           KMVEKGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPW STYKALINKLLV
Sbjct: 481 KMVEKGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWDSTYKALINKLLV 540

Query: 541 RGEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHL 600
           RGEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHL
Sbjct: 541 RGEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHL 600

Query: 601 FEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEAAPPPPNLAS 651
           FEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEAA PPPNLAS
Sbjct: 601 FEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEAAAPPPNLAS 651

BLAST of Cp4.1LG01g22150 vs. NCBI nr
Match: XP_022990729.1 (pentatricopeptide repeat-containing protein At3g02490, mitochondrial-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 1235 bits (3195), Expect = 0.0
Identity = 619/651 (95.08%), Postives = 632/651 (97.08%), Query Frame = 0

Query: 1   MRYSWRLLRLRAHFRCQLLLSSNSSHFQVLSNPNLQSLRYFSSLLHKYPVHDTSIVNFRY 60
           MRYSW LL LRAHFRCQLLLSSNSSHFQVLSNPNL SLRYFS LLHKYPVHDTSIVN RY
Sbjct: 1   MRYSWSLLGLRAHFRCQLLLSSNSSHFQVLSNPNLPSLRYFS-LLHKYPVHDTSIVNSRY 60

Query: 61  RTPRMLSYESEIGQKDSAHAVLFDIFSKCRDVDEIRKGLESSGIVISHDLVLEVLGKLES 120
            TPRMLS E EIGQKDSAHAV+FDIFSKCRDVDEIRKGLESSG+VISHDLVLEVLGKLES
Sbjct: 61  CTPRMLSSEPEIGQKDSAHAVVFDIFSKCRDVDEIRKGLESSGVVISHDLVLEVLGKLES 120

Query: 121 NPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGYGISKSVQ 180
           NPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCD+KKKGYGISKSVQ
Sbjct: 121 NPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDIKKKGYGISKSVQ 180

Query: 181 DKVLEKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGDDVEQQLRDMN 240
           +KVLEKFEKDGLESEAE+LRDVFASGSIDKSPEK+GS+VCKLVRKNVWG+DVEQQLRDMN
Sbjct: 181 NKVLEKFEKDGLESEAERLRDVFASGSIDKSPEKMGSVVCKLVRKNVWGNDVEQQLRDMN 240

Query: 241 ISFSSDMVKMILENLCTEPAKAFIFFRWIGENGMFKHDEQTYNAMARVLGREDSIDRFWK 300
           +SFSSDMVKMILENLCTEPAKAFIFFRWIGE+GMFKHDEQTYN MARVLGREDSIDRFWK
Sbjct: 241 VSFSSDMVKMILENLCTEPAKAFIFFRWIGESGMFKHDEQTYNGMARVLGREDSIDRFWK 300

Query: 301 VVDEMRSHGYEIEVETFSEVLRRFCKRRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIA 360
           VVDEMRSHGYE+EVETFSEVLRRFC RRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKI 
Sbjct: 301 VVDEMRSHGYEMEVETFSEVLRRFCARRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIT 360

Query: 361 VSKHLDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFS 420
           VSKHLDLS+FSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFS
Sbjct: 361 VSKHLDLSMFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFS 420

Query: 421 GGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIH 480
           GGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIH
Sbjct: 421 GGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIH 480

Query: 481 KMVEKGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWHSTYKALINKLLV 540
           KM EKGVDCCAGY LDLVVNAYCQKKRETD SRL CDLVDEKQLKPWHSTYK LINKLLV
Sbjct: 481 KMFEKGVDCCAGYALDLVVNAYCQKKRETDGSRLLCDLVDEKQLKPWHSTYKELINKLLV 540

Query: 541 RGEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHL 600
           RGEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHL
Sbjct: 541 RGEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHL 600

Query: 601 FEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEAAPPPPNLAS 651
           FEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNS + VEAA PPPNLAS
Sbjct: 601 FEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSTRLVEAASPPPNLAS 650

BLAST of Cp4.1LG01g22150 vs. ExPASy TrEMBL
Match: A0A6J1E5F8 (pentatricopeptide repeat-containing protein At3g02490, mitochondrial-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111429656 PE=4 SV=1)

HSP 1 Score: 1273 bits (3293), Expect = 0.0
Identity = 638/651 (98.00%), Postives = 644/651 (98.92%), Query Frame = 0

Query: 1   MRYSWRLLRLRAHFRCQLLLSSNSSHFQVLSNPNLQSLRYFSSLLHKYPVHDTSIVNFRY 60
           MRYSWRLLRLRAHFRCQLLLSSNSSHFQVLSNPNLQSLRYFSSLL KYPVHDTSIVN RY
Sbjct: 1   MRYSWRLLRLRAHFRCQLLLSSNSSHFQVLSNPNLQSLRYFSSLLRKYPVHDTSIVNSRY 60

Query: 61  RTPRMLSYESEIGQKDSAHAVLFDIFSKCRDVDEIRKGLESSGIVISHDLVLEVLGKLES 120
            TPRMLS E EIGQKDSAHAV+FDIFSKCRDVDEIRKGLESSG+VISHDLVLEVLGKLES
Sbjct: 61  STPRMLSSEPEIGQKDSAHAVVFDIFSKCRDVDEIRKGLESSGVVISHDLVLEVLGKLES 120

Query: 121 NPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGYGISKSVQ 180
           NPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGYGISKSVQ
Sbjct: 121 NPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGYGISKSVQ 180

Query: 181 DKVLEKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGDDVEQQLRDMN 240
           +KVLEKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGDDVEQQLRDMN
Sbjct: 181 NKVLEKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGDDVEQQLRDMN 240

Query: 241 ISFSSDMVKMILENLCTEPAKAFIFFRWIGENGMFKHDEQTYNAMARVLGREDSIDRFWK 300
           IS SSDMVKMILENLCTEPAKAFIFFRWIGE+GMFKHDEQTYNAMARVLGREDSIDRFWK
Sbjct: 241 ISLSSDMVKMILENLCTEPAKAFIFFRWIGESGMFKHDEQTYNAMARVLGREDSIDRFWK 300

Query: 301 VVDEMRSHGYEIEVETFSEVLRRFCKRRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIA 360
           VVDEMRSHGYE+EVETFSEVLRRFCKR+MIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIA
Sbjct: 301 VVDEMRSHGYEMEVETFSEVLRRFCKRKMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIA 360

Query: 361 VSKHLDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFS 420
           VSKHLDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFS
Sbjct: 361 VSKHLDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFS 420

Query: 421 GGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIH 480
           GGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIH
Sbjct: 421 GGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIH 480

Query: 481 KMVEKGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWHSTYKALINKLLV 540
           KMVEKGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWHSTYKALINKLLV
Sbjct: 481 KMVEKGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWHSTYKALINKLLV 540

Query: 541 RGEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHL 600
           RGEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHL
Sbjct: 541 RGEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHL 600

Query: 601 FEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEAAPPPPNLAS 651
           FEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEAA PPPNLAS
Sbjct: 601 FEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEAAAPPPNLAS 651

BLAST of Cp4.1LG01g22150 vs. ExPASy TrEMBL
Match: A0A6J1JSU2 (pentatricopeptide repeat-containing protein At3g02490, mitochondrial-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111487528 PE=4 SV=1)

HSP 1 Score: 1235 bits (3195), Expect = 0.0
Identity = 619/651 (95.08%), Postives = 632/651 (97.08%), Query Frame = 0

Query: 1   MRYSWRLLRLRAHFRCQLLLSSNSSHFQVLSNPNLQSLRYFSSLLHKYPVHDTSIVNFRY 60
           MRYSW LL LRAHFRCQLLLSSNSSHFQVLSNPNL SLRYFS LLHKYPVHDTSIVN RY
Sbjct: 1   MRYSWSLLGLRAHFRCQLLLSSNSSHFQVLSNPNLPSLRYFS-LLHKYPVHDTSIVNSRY 60

Query: 61  RTPRMLSYESEIGQKDSAHAVLFDIFSKCRDVDEIRKGLESSGIVISHDLVLEVLGKLES 120
            TPRMLS E EIGQKDSAHAV+FDIFSKCRDVDEIRKGLESSG+VISHDLVLEVLGKLES
Sbjct: 61  CTPRMLSSEPEIGQKDSAHAVVFDIFSKCRDVDEIRKGLESSGVVISHDLVLEVLGKLES 120

Query: 121 NPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGYGISKSVQ 180
           NPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCD+KKKGYGISKSVQ
Sbjct: 121 NPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDIKKKGYGISKSVQ 180

Query: 181 DKVLEKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGDDVEQQLRDMN 240
           +KVLEKFEKDGLESEAE+LRDVFASGSIDKSPEK+GS+VCKLVRKNVWG+DVEQQLRDMN
Sbjct: 181 NKVLEKFEKDGLESEAERLRDVFASGSIDKSPEKMGSVVCKLVRKNVWGNDVEQQLRDMN 240

Query: 241 ISFSSDMVKMILENLCTEPAKAFIFFRWIGENGMFKHDEQTYNAMARVLGREDSIDRFWK 300
           +SFSSDMVKMILENLCTEPAKAFIFFRWIGE+GMFKHDEQTYN MARVLGREDSIDRFWK
Sbjct: 241 VSFSSDMVKMILENLCTEPAKAFIFFRWIGESGMFKHDEQTYNGMARVLGREDSIDRFWK 300

Query: 301 VVDEMRSHGYEIEVETFSEVLRRFCKRRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIA 360
           VVDEMRSHGYE+EVETFSEVLRRFC RRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKI 
Sbjct: 301 VVDEMRSHGYEMEVETFSEVLRRFCARRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIT 360

Query: 361 VSKHLDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFS 420
           VSKHLDLS+FSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFS
Sbjct: 361 VSKHLDLSMFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFS 420

Query: 421 GGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIH 480
           GGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIH
Sbjct: 421 GGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIH 480

Query: 481 KMVEKGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWHSTYKALINKLLV 540
           KM EKGVDCCAGY LDLVVNAYCQKKRETD SRL CDLVDEKQLKPWHSTYK LINKLLV
Sbjct: 481 KMFEKGVDCCAGYALDLVVNAYCQKKRETDGSRLLCDLVDEKQLKPWHSTYKELINKLLV 540

Query: 541 RGEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHL 600
           RGEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHL
Sbjct: 541 RGEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHL 600

Query: 601 FEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEAAPPPPNLAS 651
           FEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNS + VEAA PPPNLAS
Sbjct: 601 FEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSTRLVEAASPPPNLAS 650

BLAST of Cp4.1LG01g22150 vs. ExPASy TrEMBL
Match: A0A6J1E0A6 (pentatricopeptide repeat-containing protein At3g02490, mitochondrial-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111429656 PE=4 SV=1)

HSP 1 Score: 1155 bits (2989), Expect = 0.0
Identity = 577/587 (98.30%), Postives = 583/587 (99.32%), Query Frame = 0

Query: 65  MLSYESEIGQKDSAHAVLFDIFSKCRDVDEIRKGLESSGIVISHDLVLEVLGKLESNPDE 124
           MLS E EIGQKDSAHAV+FDIFSKCRDVDEIRKGLESSG+VISHDLVLEVLGKLESNPDE
Sbjct: 1   MLSSEPEIGQKDSAHAVVFDIFSKCRDVDEIRKGLESSGVVISHDLVLEVLGKLESNPDE 60

Query: 125 AIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGYGISKSVQDKVL 184
           AIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGYGISKSVQ+KVL
Sbjct: 61  AIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGYGISKSVQNKVL 120

Query: 185 EKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGDDVEQQLRDMNISFS 244
           EKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGDDVEQQLRDMNIS S
Sbjct: 121 EKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGDDVEQQLRDMNISLS 180

Query: 245 SDMVKMILENLCTEPAKAFIFFRWIGENGMFKHDEQTYNAMARVLGREDSIDRFWKVVDE 304
           SDMVKMILENLCTEPAKAFIFFRWIGE+GMFKHDEQTYNAMARVLGREDSIDRFWKVVDE
Sbjct: 181 SDMVKMILENLCTEPAKAFIFFRWIGESGMFKHDEQTYNAMARVLGREDSIDRFWKVVDE 240

Query: 305 MRSHGYEIEVETFSEVLRRFCKRRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIAVSKH 364
           MRSHGYE+EVETFSEVLRRFCKR+MIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIAVSKH
Sbjct: 241 MRSHGYEMEVETFSEVLRRFCKRKMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIAVSKH 300

Query: 365 LDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFSGGLK 424
           LDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFSGGLK
Sbjct: 301 LDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFSGGLK 360

Query: 425 RKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIHKMVE 484
           RKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIHKMVE
Sbjct: 361 RKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIHKMVE 420

Query: 485 KGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWHSTYKALINKLLVRGEF 544
           KGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWHSTYKALINKLLVRGEF
Sbjct: 421 KGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWHSTYKALINKLLVRGEF 480

Query: 545 REALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHLFEAF 604
           REALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHLFEAF
Sbjct: 481 REALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHLFEAF 540

Query: 605 FQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEAAPPPPNLAS 651
           FQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEAA PPPNLAS
Sbjct: 541 FQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEAAAPPPNLAS 587

BLAST of Cp4.1LG01g22150 vs. ExPASy TrEMBL
Match: A0A6J1JQW6 (pentatricopeptide repeat-containing protein At3g02490, mitochondrial-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111487528 PE=4 SV=1)

HSP 1 Score: 1129 bits (2920), Expect = 0.0
Identity = 561/587 (95.57%), Postives = 574/587 (97.79%), Query Frame = 0

Query: 65  MLSYESEIGQKDSAHAVLFDIFSKCRDVDEIRKGLESSGIVISHDLVLEVLGKLESNPDE 124
           MLS E EIGQKDSAHAV+FDIFSKCRDVDEIRKGLESSG+VISHDLVLEVLGKLESNPDE
Sbjct: 1   MLSSEPEIGQKDSAHAVVFDIFSKCRDVDEIRKGLESSGVVISHDLVLEVLGKLESNPDE 60

Query: 125 AIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGYGISKSVQDKVL 184
           AIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCD+KKKGYGISKSVQ+KVL
Sbjct: 61  AIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDIKKKGYGISKSVQNKVL 120

Query: 185 EKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGDDVEQQLRDMNISFS 244
           EKFEKDGLESEAE+LRDVFASGSIDKSPEK+GS+VCKLVRKNVWG+DVEQQLRDMN+SFS
Sbjct: 121 EKFEKDGLESEAERLRDVFASGSIDKSPEKMGSVVCKLVRKNVWGNDVEQQLRDMNVSFS 180

Query: 245 SDMVKMILENLCTEPAKAFIFFRWIGENGMFKHDEQTYNAMARVLGREDSIDRFWKVVDE 304
           SDMVKMILENLCTEPAKAFIFFRWIGE+GMFKHDEQTYN MARVLGREDSIDRFWKVVDE
Sbjct: 181 SDMVKMILENLCTEPAKAFIFFRWIGESGMFKHDEQTYNGMARVLGREDSIDRFWKVVDE 240

Query: 305 MRSHGYEIEVETFSEVLRRFCKRRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIAVSKH 364
           MRSHGYE+EVETFSEVLRRFC RRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKI VSKH
Sbjct: 241 MRSHGYEMEVETFSEVLRRFCARRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKITVSKH 300

Query: 365 LDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFSGGLK 424
           LDLS+FSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFSGGLK
Sbjct: 301 LDLSMFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKEYGYVFSGGLK 360

Query: 425 RKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIHKMVE 484
           RKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIHKM E
Sbjct: 361 RKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIHKMFE 420

Query: 485 KGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWHSTYKALINKLLVRGEF 544
           KGVDCCAGY LDLVVNAYCQKKRETD SRL CDLVDEKQLKPWHSTYK LINKLLVRGEF
Sbjct: 421 KGVDCCAGYALDLVVNAYCQKKRETDGSRLLCDLVDEKQLKPWHSTYKELINKLLVRGEF 480

Query: 545 REALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHLFEAF 604
           REALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHLFEAF
Sbjct: 481 REALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSKSFPSTTVFLHLFEAF 540

Query: 605 FQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEAAPPPPNLAS 651
           FQAGRHGDAQDFLSKCPGYIRNHADVLELFNS + VEAA PPPNLAS
Sbjct: 541 FQAGRHGDAQDFLSKCPGYIRNHADVLELFNSTRLVEAASPPPNLAS 587

BLAST of Cp4.1LG01g22150 vs. ExPASy TrEMBL
Match: A0A6J1FJW5 (pentatricopeptide repeat-containing protein At3g02490, mitochondrial-like OS=Cucurbita moschata OX=3662 GN=LOC111444701 PE=4 SV=1)

HSP 1 Score: 1095 bits (2833), Expect = 0.0
Identity = 563/662 (85.05%), Postives = 589/662 (88.97%), Query Frame = 0

Query: 1   MRYSWRLLRLRAHFRCQLLLSSNSSHFQVLSNPNLQSLRYFSSLLHKYPVHDT------- 60
           MRYSWRLLRLRAHFR QL +SSN+SHFQV S+PNLQSLR+ SSLL K+PVH T       
Sbjct: 1   MRYSWRLLRLRAHFRSQLRISSNASHFQVHSDPNLQSLRFLSSLLPKHPVHGTASPPISD 60

Query: 61  ----SIVNFRYRTPRMLSYESEIGQKDSAHAVLFDIFSKCRDVDEIRKGLESSGIVISHD 120
               SIV  RY TPR  S ES I QK+S HAV+FDIFSK RDVDEIRK LES+GIVISHD
Sbjct: 61  SQCTSIVKSRYGTPRTFSSESVIEQKESDHAVVFDIFSKSRDVDEIRKDLESNGIVISHD 120

Query: 121 LVLEVLGKLESNPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMK 180
           LVLEVLGKLESNPD AIRFF WVSGDYGEKLSSKSYNLMLGILGVN  V EFWDLNCDMK
Sbjct: 121 LVLEVLGKLESNPDSAIRFFNWVSGDYGEKLSSKSYNLMLGILGVNDHVGEFWDLNCDMK 180

Query: 181 KKGYGISKSVQDKVLEKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWG 240
           KKGYGISK+V+DKVLEKFEKDGLESEAEKLRDVFASGS DKSPEKIGSIVCKLVR NVWG
Sbjct: 181 KKGYGISKTVRDKVLEKFEKDGLESEAEKLRDVFASGSTDKSPEKIGSIVCKLVRNNVWG 240

Query: 241 DDVEQQLRDMNISFSSDMVKMILENLCTEPAKAFIFFRWIGENGMFKHDEQTYNAMARVL 300
           DDVEQQL DMN+SFSSDMVKMILENLCT+PAKAFIFFRWIGE+GMFKHDEQTYNAMARVL
Sbjct: 241 DDVEQQLCDMNVSFSSDMVKMILENLCTDPAKAFIFFRWIGESGMFKHDEQTYNAMARVL 300

Query: 301 GREDSIDRFWKVVDEMRSHGYEIEVETFSEVLRRFCKRRMIEEAVNLYVFAMAGGNKPSV 360
           GRED IDRFWKVVDEMRSHGYE+ VETF++VL RFCKRRMIEEAVNLYVFAMAGGNKPSV
Sbjct: 301 GREDCIDRFWKVVDEMRSHGYEMGVETFAKVLGRFCKRRMIEEAVNLYVFAMAGGNKPSV 360

Query: 361 DCLTFLLKKIAVSKHLDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVL 420
           DCLTFLLKKIAVSKHLDL LFSR LK+FTETGN LTDSMV AVLKSLSTVGRIGEY EVL
Sbjct: 361 DCLTFLLKKIAVSKHLDLDLFSRTLKVFTETGNVLTDSMVSAVLKSLSTVGRIGEYTEVL 420

Query: 421 NAMKEYGYVFSGGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVA 480
           N MKEYGY FSG LKRKVAY+LS TGKSDEANDF+N LEASGCN DNKTWA+LIEG C A
Sbjct: 421 NVMKEYGYEFSGSLKRKVAYRLSRTGKSDEANDFMNGLEASGCNPDNKTWASLIEGLCAA 480

Query: 481 GDLAKASDCIHKMVEKGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWHS 540
           GDLA ASDCIHKMVEKG    AGY LDL++NAYCQK+RETDAS L  DLVD KQLKPWHS
Sbjct: 481 GDLAMASDCIHKMVEKGDVSNAGYALDLIINAYCQKRRETDASHLLSDLVDGKQLKPWHS 540

Query: 541 TYKALINKLLVRGEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAMTSK 600
           TYKALINKLL RGEF+EALKLLGMMRNHEFPPFI+PFI YVSKSGTADDAIGFLKAMTSK
Sbjct: 541 TYKALINKLLQRGEFKEALKLLGMMRNHEFPPFIEPFISYVSKSGTADDAIGFLKAMTSK 600

Query: 601 SFPSTTVFLHLFEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEAAPPPPNL 651
            FPST+VFLHLFEAFFQAGRHGDAQDFLSKCP  IRNHADVL LF SMK VEAA  P NL
Sbjct: 601 RFPSTSVFLHLFEAFFQAGRHGDAQDFLSKCPSDIRNHADVLNLFYSMKPVEAAASP-NL 660

BLAST of Cp4.1LG01g22150 vs. TAIR 10
Match: AT3G02490.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 642.1 bits (1655), Expect = 4.8e-184
Identity = 342/652 (52.45%), Postives = 449/652 (68.87%), Query Frame = 0

Query: 1   MRYSWRLLRLRAHFRCQLLLSSNSSHFQVLSNPNLQSLRYFSSLLH-KYPVHDTSIVNFR 60
           MRY WR L  R++        S+ S FQV+SN    S R FSS LH ++ V     + F 
Sbjct: 1   MRYQWRSLLFRSYRSSPRPFLSHHSRFQVISN----STRSFSSFLHERFGVQQRQCL-FA 60

Query: 61  YRTP------RMLSYESEIGQKDSAHAVLFDIFSKCRDVDEIRKGLESSGIVISHDLVLE 120
            R+P      R  S ES I +K  A  V+ D+FS+    DEI K L+S+ +VISH+L L 
Sbjct: 61  LRSPLASSVSRRFSSESAIEEKLPAETVVIDVFSRLNGKDEITKELDSNDVVISHELALR 120

Query: 121 VLGKLESNPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGY 180
           VL +LES+PD A RFF W    Y +KLSSKSYN ML I GVNG V+EFW L  DMKKKG+
Sbjct: 121 VLRELESSPDVAGRFFKWGLEAYPQKLSSKSYNTMLRIFGVNGLVDEFWRLVDDMKKKGH 180

Query: 181 GISKSVQDKVLEKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGDDVE 240
           G+S +V+D+V +KF+KDGLE++ E+L+++FASGS+D S +K+ + VCK+V K VWG DVE
Sbjct: 181 GVSANVRDRVGDKFKKDGLENDLERLKELFASGSMDNSVDKVCNRVCKIVMKEVWGADVE 240

Query: 241 QQLRDMNISFSSDMVKMILENLCTEPAKAFIFFRWIGENGMFKHDEQTYNAMARVLGRED 300
           +QLRD+ + F SD+VKM+LE L  +P KA +FFRWI E+G FKHDE+TYNAMARVLG+E 
Sbjct: 241 KQLRDLKLEFKSDVVKMVLEKLDVDPRKALLFFRWIDESGSFKHDEKTYNAMARVLGKEK 300

Query: 301 SIDRFWKVVDEMRSHGYEIEVETFSEVLRRFCKRRMIEEAVNLYVFAMAG--GNKPSVDC 360
            +DRF  +++E+RS GYE+E+ET+  V  RFC+ +MI+EAV L+ FAMAG   N P+  C
Sbjct: 301 FLDRFQHMIEEIRSAGYEMEMETYVRVSARFCQTKMIKEAVELFEFAMAGSISNTPTPHC 360

Query: 361 LTFLLKKIAVSKHLDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNA 420
            + LLKKI  +K LD+ LF+R LK +T  GN + D M+  VLKSL +V R G+ NEVL A
Sbjct: 361 CSLLLKKIVTAKKLDMDLFTRTLKAYTGNGNVVPDVMLQHVLKSLRSVDRFGQSNEVLKA 420

Query: 421 MKEYGYVFSGGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGD 480
           M E GYV SG L+  +A  LS  GK DEAN+ VN +EASG + D+K  A+L+EG+C A D
Sbjct: 421 MNEGGYVPSGDLQSVIASGLSRKGKKDEANELVNFMEASGNHLDDKAMASLVEGHCDAKD 480

Query: 481 LAKASDCIHKMVEKGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWHSTY 540
           L +AS+C  KM+ K     AGY  + +V AYC   +  D  +LF +LV + QLKPWHSTY
Sbjct: 481 LEEASECFKKMIGKEGVSYAGYAFEKLVLAYCNSFQARDVYKLFSELVKQNQLKPWHSTY 540

Query: 541 KALINKLLVR-----GEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGFLKAM 600
           K ++  LL++     G F EAL LL MMRNH FPPF+DPF+ Y+S SGT+ +A  FLKA+
Sbjct: 541 KIMVRNLLMKKVARDGGFEEALSLLPMMRNHGFPPFVDPFMDYLSNSGTSAEAFAFLKAV 600

Query: 601 TSKSFPSTTVFLHLFEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMK 639
           TSK FPS ++ L +FEA  ++ RH +AQD LS  P YIR +A+VLELFN+MK
Sbjct: 601 TSKKFPSNSMVLRVFEAMLKSARHSEAQDLLSMSPSYIRRNAEVLELFNTMK 647

BLAST of Cp4.1LG01g22150 vs. TAIR 10
Match: AT5G15980.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 621.7 bits (1602), Expect = 6.7e-178
Identity = 337/669 (50.37%), Postives = 448/669 (66.97%), Query Frame = 0

Query: 1   MRY-SWRLLRLRAHFR--------CQLLLSSNSSHFQVLSNPNLQSLRYFSSLLH-KYPV 60
           MRY  WRL+ LR++ R        C  + S +S  F    +P + +L+    L   + P+
Sbjct: 1   MRYQQWRLMLLRSYHRSHLPYLSPCSQVTSISSRSFSSFIHPGIGALQQSEQLCPLRSPM 60

Query: 61  HDTSIVNFRYRTPRMLSYESEIGQKDSAHAVLFDIFSKCRDVDEIRKGLESSGIVISHDL 120
             TS  N      R  S E  + +K SA A + DIFS+    DEIRK LESSG+VIS DL
Sbjct: 61  --TSSGNLVKSVGRSFSSEPAVEEKSSAEATVIDIFSRLSGEDEIRKELESSGVVISQDL 120

Query: 121 VLEVLGKLESNPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKK 180
            L+VL KLESNPD A  FF W+     E+LSSK+YN+ML ILG NG V+EFW L   MKK
Sbjct: 121 ALKVLRKLESNPDVAKSFFQWIKEASPEELSSKNYNMMLRILGGNGLVDEFWGLVDVMKK 180

Query: 181 KGYGISKSVQDKVLEKFEKDGLESEAEKLRDVFASGSIDKSPEKIGSIVCKLVRKNVWGD 240
           KG+G+S +V+DKV +KF+KDGLES+  +LR +F S  +D S E +   VCK+V K  WGD
Sbjct: 181 KGHGLSANVRDKVGDKFQKDGLESDLLRLRKLFTSDCLDNSAENVCDRVCKIVMKEEWGD 240

Query: 241 DVEQQLRDMNISFSSDMVKMILENLCTEPAKAFIFFRWIGENGMFKHDEQTYNAMARVLG 300
           DVE+++RD+N+ F SD+VKMI+E L  EP KA +FFRWI E+ +FKHDE+TYNAMARVLG
Sbjct: 241 DVEKRVRDLNVEFKSDLVKMIVERLDVEPRKALLFFRWIDESDLFKHDEKTYNAMARVLG 300

Query: 301 REDSIDRFWKVVDEMRSHGYEIEVETFSEVLRRFCKRRMIEEAVNLYVFAMAG---GNKP 360
           +E  +DRF  +V EMRS GYE+E+ET+  V  RFC+ ++I+EAV+L+  AMAG    N P
Sbjct: 301 KEKFLDRFQNIVVEMRSAGYEVEIETYVRVSTRFCQTKLIKEAVDLFEIAMAGSSSSNNP 360

Query: 361 SVDCLTFLLKKIAVSKHLDLSLFSRALKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNE 420
           +  C   LLKKI  +K LD+ LFSRA+K++T+ GNALTDS++ +VLKSL +V R+ + NE
Sbjct: 361 TPHCFCLLLKKIVTAKILDMDLFSRAVKVYTKNGNALTDSLLKSVLKSLRSVDRVEQSNE 420

Query: 421 VLNAMKEYGYVFSGGLKRKVAYQLSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYC 480
           +L  MK  GYV SG ++  +A  LS  GK DEA++FV+ +E+SG N D+K  A+L+EGYC
Sbjct: 421 LLKEMKRGGYVPSGDMQSMIASSLSRKGKKDEADEFVDFMESSGNNLDDKAMASLVEGYC 480

Query: 481 VAGDLAKASDCIHKMVEKGVDCCAGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPW 540
            +G+L +A  C  KMV       A Y+ + +V AYC K +  DA +L    V + QLKP 
Sbjct: 481 DSGNLDEALVCFEKMVGNTGVSYADYSFEKLVLAYCNKNQVRDAYKLLSAQVTKNQLKPR 540

Query: 541 HSTYKALINKLLVR-----GEFREALKLLGMMRNHEFPPFIDPFILYVSKSGTADDAIGF 600
           HSTYK+L+  LL +     G F EAL LL +M++H FPPFIDPF+ Y S +G + +A+GF
Sbjct: 541 HSTYKSLVTNLLTKKIARDGGFEEALSLLPIMKDHGFPPFIDPFMSYFSSTGKSTEALGF 600

Query: 601 LKAMTSKSFPSTTVFLHLFEAFFQAGRHGDAQDFLSKCPGYIRNHADVLELFNSMKHVEA 652
           LKAMTS +FP  +V L +FE   ++ RH +AQD LS CP YIRN+ DVLELFN+MK  E+
Sbjct: 601 LKAMTSNNFPYISVVLRVFETMMKSARHSEAQDLLSLCPNYIRNNPDVLELFNTMKPNES 660

BLAST of Cp4.1LG01g22150 vs. TAIR 10
Match: AT3G48250.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 336.3 bits (861), Expect = 5.6e-92
Identity = 182/546 (33.33%), Postives = 300/546 (54.95%), Query Frame = 0

Query: 94  EIRKGLESSGIVISHDLVLEVLGKLESNPDEAIRFFGWVSGDYGEKLSSKSYNLMLGILG 153
           E+ +GL    + ++H+  + VL KLE  P++A  F  WV  D G   S+  Y++ML IL 
Sbjct: 75  EVEEGLRKPDMSLTHETAIYVLRKLEKYPEKAYYFLDWVLRDSGLSPSTPLYSIMLRILV 134

Query: 154 VNGRVEEFWDLNCDMKKKGYGISKSVQDKVLEKFEKDGLESEAEKLRDVFASGSIDKSPE 213
               ++ FW    +MK+ G+ + +     +  +  K+  +++A  +   +     + +  
Sbjct: 135 QQRSMKRFWMTLREMKQGGFYLDEDTYKTIYGELSKEKSKADAVAVAHFYERMLKENAMS 194

Query: 214 KIGSIVCKLVRKNVWGDDVEQQLRDMNISFSSDMVKMILENLCTEPAKAFIFFRWIGENG 273
            +   V  +V K  W  +VE++L++M +  S + V  +L+ L   P KA  FF W+G  G
Sbjct: 195 VVAGEVSAVVTKGDWSCEVERELQEMKLVLSDNFVIRVLKELREHPLKALAFFHWVGGGG 254

Query: 274 M---FKHDEQTYNAMARVLGREDSIDRFWKVVDEMRSHGYEIEVETFSEVLRRFCKRRMI 333
               ++H   TYNA  RVL R +S+  FW VVDEM++ GY+++++T+ +V R+F K RM+
Sbjct: 255 SSSGYQHSTVTYNAALRVLARPNSVAEFWSVVDEMKTAGYDMDLDTYIKVSRQFQKSRMM 314

Query: 334 EEAVNLYVFAMAGGNKPSVDCLTFLLKKIAVSKHLDLSLFSRALKIFTETGNALTDSMVF 393
            E V LY + M G  KPS+   + LL+ ++ S + DL L  R  + +  TG +L+ ++  
Sbjct: 315 AETVKLYEYMMDGPFKPSIQDCSLLLRYLSGSPNPDLDLVFRVSRKYESTGKSLSKAVYD 374

Query: 394 AVLKSLSTVGRIGEYNEVLNAMKEYGYVFSGGLKRKVAYQLSSTGKSDEANDFVNSLEAS 453
            + +SL++VGR  E  E+  AM+  GY        ++ + L    + +EA   ++ +EA 
Sbjct: 375 GIHRSLTSVGRFDEAEEITKAMRNAGYEPDNITYSQLVFGLCKAKRLEEARGVLDQMEAQ 434

Query: 454 GCNSDNKTWAALIEGYCVAGDLAKASDCIHKMVEKGVDCCAGYTLDLVVNAYCQKKRETD 513
           GC  D KTW  LI+G+C   +L KA  C   M+EKG D  +   LD++++ +    +   
Sbjct: 435 GCFPDIKTWTILIQGHCKNNELDKALACFANMLEKGFDIDSN-LLDVLIDGFVIHNKFEG 494

Query: 514 ASRLFCDLVDEKQLKPWHSTYKALINKLLVRGEFREALKLLGMMRNHEFPPFIDPFILYV 573
           AS    ++V    +KPW STYK LI+KLL   +  EAL LL MM+   +P + + F  Y+
Sbjct: 495 ASIFLMEMVKNANVKPWQSTYKLLIDKLLKIKKSEEALDLLQMMKKQNYPAYAEAFDGYL 554

Query: 574 SKSGTADDAIGFLKAMTSKSFPSTTVFLHLFEAFFQAGRHGDAQDFLSKCPGYIRNHADV 633
           +K GT +DA  FL  ++SK  PS   + H+ EAF++ GR  DA++ L  CP + + H  +
Sbjct: 555 AKFGTLEDAKKFLDVLSSKDSPSFAAYFHVIEAFYREGRLTDAKNLLFICPHHFKTHPKI 614

Query: 634 LELFNS 637
            ELF +
Sbjct: 615 SELFGA 619

BLAST of Cp4.1LG01g22150 vs. TAIR 10
Match: AT5G64320.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 97.8 bits (242), Expect = 3.4e-20
Identity = 123/529 (23.25%), Postives = 212/529 (40.08%), Query Frame = 0

Query: 71  EIGQKDSAHAVLFDIFSKCRDVDEIRKGLESSGIVISHDLVLEVLGKLESNPDEAIRFFG 130
           EIG  DSA+   ++   K  D+D +R               L  L +L  N   ++  F 
Sbjct: 47  EIGGTDSANE--WEKLLKPFDLDSLRNSFHK-----ITPFQLYKLLELPLNVSTSMELFS 106

Query: 131 WVSGDYGEKLSSKSYNLMLGILGVNGRVEEFWDLNCDMKKKGYGISKSVQDKVLEKFEKD 190
           W     G + S   Y +++G LG NG  +    L   MK +G    +S+   ++  ++K 
Sbjct: 107 WTGSQNGYRHSFDVYQVLIGKLGANGEFKTIDRLLIQMKDEGIVFKESLFISIMRDYDKA 166

Query: 191 GLESEAEKL----RDVFASGSIDKSPEKIGSIV----CKLVRKNVWGDDVEQQLRDMNIS 250
           G   +  +L    R+V++     KS   +  I+    C  V  NV+ D + +++     +
Sbjct: 167 GFPGQTTRLMLEMRNVYSCEPTFKSYNVVLEILVSGNCHKVAANVFYDMLSRKIPPTLFT 226

Query: 251 FSSDMVKMILENLC--TEPAKAFIFFRWIGENGMFKHDEQTYNAMARVLGREDSIDRFWK 310
           F      ++++  C   E   A    R + ++G    +   Y  +   L + + ++   +
Sbjct: 227 FG-----VVMKAFCAVNEIDSALSLLRDMTKHGCVP-NSVIYQTLIHSLSKCNRVNEALQ 286

Query: 311 VVDEMRSHGYEIEVETFSEVLRRFCKRRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIA 370
           +++EM   G   + ETF++V+   CK   I EA  +    +  G  P      +L+  + 
Sbjct: 287 LLEEMFLMGCVPDAETFNDVILGLCKFDRINEAAKMVNRMLIRGFAPDDITYGYLMNGLC 346

Query: 371 VSKHLDLS--LFSRALKIFTETGNALTDSMVF--------AVLKSLST----VGRIGEYN 430
               +D +  LF R  K      N L    V         AVL  + T    V  +  YN
Sbjct: 347 KIGRVDAAKDLFYRIPKPEIVIFNTLIHGFVTHGRLDDAKAVLSDMVTSYGIVPDVCTYN 406

Query: 431 EVLNAMKEYGYVFSG--GLKRKVAYQLSS-------------------TGKSDEANDFVN 490
            ++     YGY   G  GL  +V + + +                    GK DEA + +N
Sbjct: 407 SLI-----YGYWKEGLVGLALEVLHDMRNKGCKPNVYSYTILVDGFCKLGKIDEAYNVLN 466

Query: 491 SLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIHKMVEKGVDCCAGYTLDLVVNAYCQK 550
            + A G   +   +  LI  +C    + +A +   +M  KG      YT + +++  C+ 
Sbjct: 467 EMSADGLKPNTVGFNCLISAFCKEHRIPEAVEIFREMPRKGCKPDV-YTFNSLISGLCEV 526

Query: 551 KRETDASRLFCDLVDEKQLKPWHSTYKALINKLLVRGEFREALKLLGMM 555
                A  L  D++ E  +     TY  LIN  L RGE +EA KL+  M
Sbjct: 527 DEIKHALWLLRDMISEGVVAN-TVTYNTLINAFLRRGEIKEARKLVNEM 555

BLAST of Cp4.1LG01g22150 vs. TAIR 10
Match: AT3G16710.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 87.4 bits (215), Expect = 4.5e-17
Identity = 86/384 (22.40%), Postives = 165/384 (42.97%), Query Frame = 0

Query: 255 LCTEPAKAFIFFRWIGENGMFKHDEQTYNAMARVLGREDSIDRFWKVVDEMRSHGYEIEV 314
           L ++P +A  F   + + G F+ D  T+ ++       + I+    + D++   G++  V
Sbjct: 130 LSSQPCRASCFLGKMMKLG-FEPDLVTFTSLLNGYCHWNRIEDAIALFDQILGMGFKPNV 189

Query: 315 ETFSEVLRRFCKRRMIEEAVNLYVFAMAGGNKPSVDCLTFLLKKIA-VSKHLDLSLFSRA 374
            T++ ++R  CK R +  AV L+      G++P+V     L+  +  + +  D +   R 
Sbjct: 190 VTYTTLIRCLCKNRHLNHAVELFNQMGTNGSRPNVVTYNALVTGLCEIGRWGDAAWLLRD 249

Query: 375 LKIFTETGNALTDSMVFAVLKSLSTVGRIGEYNEVLNAMKE---YGYVFSGGLKRKVAYQ 434
           +       N +T +   A++ +   VG++ E  E+ N M +   Y  VF+ G    +   
Sbjct: 250 MMKRRIEPNVITFT---ALIDAFVKVGKLMEAKELYNVMIQMSVYPDVFTYG---SLING 309

Query: 435 LSSTGKSDEANDFVNSLEASGCNSDNKTWAALIEGYCVAGDLAKASDCIHKMVEKGVDCC 494
           L   G  DEA      +E +GC  +   +  LI G+C +  +       ++M +KGV   
Sbjct: 310 LCMYGLLDEARQMFYLMERNGCYPNEVIYTTLIHGFCKSKRVEDGMKIFYEMSQKGV-VA 369

Query: 495 AGYTLDLVVNAYCQKKRETDASRLFCDLVDEKQLKPWHSTYKALINKLLVRGEFREALKL 554
              T  +++  YC   R   A  +F + +  ++  P   TY  L++ L   G+  +AL +
Sbjct: 370 NTITYTVLIQGYCLVGRPDVAQEVF-NQMSSRRAPPDIRTYNVLLDGLCCNGKVEKALMI 429

Query: 555 LGMMRNHEFPPFIDPFILYVS---KSGTADDAIGFLKAMTSKSF-PSTTVFLHLFEAFFQ 614
              MR  E    I  + + +    K G  +DA     ++ SK   P+   +  +   F +
Sbjct: 430 FEYMRKREMDINIVTYTIIIQGMCKLGKVEDAFDLFCSLFSKGMKPNVITYTTMISGFCR 489

Query: 615 AGRHGDAQDFLSKC--PGYIRNHA 629
            G   +A     K    G++ N +
Sbjct: 490 RGLIHEADSLFKKMKEDGFLPNES 504

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9M8916.7e-18352.45Pentatricopeptide repeat-containing protein At3g02490, mitochondrial OS=Arabidop... [more]
Q8LPF19.4e-17750.37Pentatricopeptide repeat-containing protein At5g15980, mitochondrial OS=Arabidop... [more]
Q9STK57.9e-9133.33Pentatricopeptide repeat-containing protein At3g48250, chloroplastic OS=Arabidop... [more]
Q9FMF64.7e-1923.25Pentatricopeptide repeat-containing protein At5g64320, mitochondrial OS=Arabidop... [more]
Q9LUR26.4e-1622.40Putative pentatricopeptide repeat-containing protein At3g16710, mitochondrial OS... [more]
Match NameE-valueIdentityDescription
XP_023511958.10.0100.00pentatricopeptide repeat-containing protein At3g02490, mitochondrial-like [Cucur... [more]
KAG6602554.10.098.00Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
XP_022921370.10.098.00pentatricopeptide repeat-containing protein At3g02490, mitochondrial-like isofor... [more]
KAG7033234.10.097.85Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita a... [more]
XP_022990729.10.095.08pentatricopeptide repeat-containing protein At3g02490, mitochondrial-like isofor... [more]
Match NameE-valueIdentityDescription
A0A6J1E5F80.098.00pentatricopeptide repeat-containing protein At3g02490, mitochondrial-like isofor... [more]
A0A6J1JSU20.095.08pentatricopeptide repeat-containing protein At3g02490, mitochondrial-like isofor... [more]
A0A6J1E0A60.098.30pentatricopeptide repeat-containing protein At3g02490, mitochondrial-like isofor... [more]
A0A6J1JQW60.095.57pentatricopeptide repeat-containing protein At3g02490, mitochondrial-like isofor... [more]
A0A6J1FJW50.085.05pentatricopeptide repeat-containing protein At3g02490, mitochondrial-like OS=Cuc... [more]
Match NameE-valueIdentityDescription
AT3G02490.14.8e-18452.45Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G15980.16.7e-17850.37Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G48250.15.6e-9233.33Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G64320.13.4e-2023.25Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G16710.14.5e-1722.40Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 232..367
e-value: 1.0E-15
score: 59.5
coord: 368..492
e-value: 1.8E-13
score: 52.2
coord: 493..571
e-value: 1.7E-8
score: 36.0
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 281..311
e-value: 7.0E-5
score: 20.7
coord: 530..561
e-value: 0.0018
score: 16.3
coord: 458..487
e-value: 9.7E-6
score: 23.4
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 144..172
e-value: 0.2
score: 12.0
coord: 458..487
e-value: 5.1E-5
score: 23.3
coord: 529..557
e-value: 0.007
score: 16.5
coord: 281..309
e-value: 0.061
score: 13.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 455..489
score: 10.500983
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 527..561
score: 8.845827
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 278..312
score: 9.996763
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 141..175
score: 8.527949
IPR044578Pentatricopeptide repeat-containing protein BIR6-likePANTHERPTHR47003OS01G0970900 PROTEINcoord: 17..643
NoneNo IPR availablePANTHERPTHR47003:SF3BNAC09G41880D PROTEINcoord: 17..643

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g22150.1Cp4.1LG01g22150.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008380 RNA splicing
molecular_function GO:0005515 protein binding