CmaCh06G003340 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh06G003340
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCma_Chr06: 1566133 .. 1567971 (+)
RNA-Seq ExpressionCmaCh06G003340
SyntenyCmaCh06G003340
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGAATGCGAAGCCCACTAGCCTTCAAATCTCAGTTCCCGCCAGCGCCCTTTATCCATGGGTTCTGCAAGCAATCCGCCGCAACGACGGGATGAACTACGGCGCCTATGGCCGCTTCATCCAGCACGCACCGGCTTCCTCTTCGTCCGCCTCGGCAACTGCTTCACGCTCGTCTTGTTCTATGTTCTGTCTCTCCCGAGAATTTCCTCGGATCGAAGGTCATGGCCTTCTAATCAAAATCTGGCAGCCTTAGAGATGCCTACAATGTGTTCGGTAACATTTCTCACGAGAGCATTTTCTCTTGGAATGCTTCTTTTTTATGAGTTACACTCTTCACAATATGCAATCTGATATGCTGAAGCTGTTTTTATCTTTGTTTAATTTGATATCGATGGATGCGAAGCCTGATAAGTTTAAGGTTACTTGTGTTTTGAAAGCGTTGGCGTCGTTGTTTTCTAATTTGATTTTGGCAATGGAAGTTCATTGTTTCATTCTTCGACGAGGGCTTGAGTCTCATCTTTTTGTGTCTATGCTTTGATACTCGAGGTGTAATTAGCTGGTTTTAGCGAAAATTGTGTTTGATAGAATGCCTGCGAGAGATATTGTCTTGGAGTGCGATGGTGGCTGAGTACTCTCAGGGTTGGTTTTATGGGGAATGCAAGGAACTATTTAAAGAGATGTTGAGTTTAGTGGAGTTGAGGCCTAATGCAGTGACTGCAGTCATTGTTTTACAAGCTTGTGCTCAGTCAAATTATCTCATTTTTGGAATGGAAGTTCGTTGATTCGTCAATGAAAGTCAGATTGAAATGGATGTTTCACTATGTAATGCTGTTATTGGATTATATGCAAAAAGTGCGGTAGCTTGGATTATGCTTGGATTATATGCAAAAGGCCAGCATTGAATTAATGAGTAGATATAATGGTTGACTAATTAAAGCGATAATTATGTATTTATAAGTAAAAAATACGTCTGGTATGAGGCCTTTTGGGAAAACTAAAAGTAAAATCATGAGAGCTTATGCTTAAAGTGGACAATATCATACCATTGTGGAGTGTCGTTGTTCCAAACAGTAATGCCGTGATTTTTTATCTGGTTCAGAGCAACCAATAAACTCGACTTGTAGATATATTTCGAGCAATGCAGTTGGATGGTTGCAGACCGAATACTATGACACTTGCGAGCGTTCTTCCCATCTTCTCACATTTTTTAGCTCTAAAGATGGGAAAGAAATTCATGCTTATGCCGTTAGAAACGGTTACGATGGGAATGTTTGTGTTGTCACTGCTATCATTGATTCTTATGCTAAGTCTGGTTACCTCCACAGGGCACGACTGGTTTGTGATCAATTTAAAGGTAGGAGTCTAATCATTTGGACAGCAATAATTTCAGCCTATGCTGCCAATGGAGGTCCTAACTTTACTCTTGAGTTTTTTTTATGAGATTCTGACAAATGGGATTCGGCCTGATTCAGTAGCCTTTACTCCAGTGTGTTCCTGTGCCCATTCAGGGGAGTTAGATGAAGCCTGGAAGATATTTAACGTCTTGTTACCAGAGTATGGAATTCAACCACTAATCGAGCACTATGCTTGCATGGTTGGAGTTCTTATTCTAGCAGAAAAGCTCTCCGATGCTGTTGACTTTATTTCTAAAATGCCAATTGAACCCACTACAAAAGTTTGGGGTGCTTTGCTCGATGGGGCTTCTGTTGCTGGTGATGTTGAGCTTGGAAAGTGCGTTTTTGATAGTCTGCTTGACACCGAGCCTGAAAATACAGGTAACAATATCATCATGGATAACTTATATTCACTGTTTGGAAGGTGGAAAAAATCTGGCTAG

mRNA sequence

ATGGCGAATGCGAAGCCCACTAGCCTTCAAATCTCAGTTCCCGCCAGCGCCCTTTATCCATGGGTTCTGCAAGCAATCCGCCGCAACGACGGGATGAACTACGGCGCCTATGGCCGCTTCATCCAGCACGCACCGGCTTCCTCTTCGTCCGCCTCGGCAACTGCTTCACGCTCGTCTTGTTCTATTTACACTCTTCACAATATGCAATCTGATATGCTGAAGCTGTTTTTATCTTTGTTTAATTTGATATCGATGGATGCGAAGCCTGATAAGTTTAAGGTTACTTGTGTTTTGAAAGCGTTGGCGTCGTTGTTTTCTAATTTGATTTTGGCAATGGAAGTTCATTGTTTCATTCTTCGACGAGGGCTTGAGTCTCATCTTTTTCGAAAATTGTGTTTGATAGAATGCCTGCGAGAGATATTGTCTTGGAGTGCGATGGTGGCTGAGTACTCTCAGGGTTGGTTTTATGGGGAATGCAAGGAACTATTTAAAGAGATGTTGAGTTTAGTGGAGTTGAGGCCTAATGCAGTGACTGCAGTCATTTTCGTTGATTCGTCAATGAAAGTCAGATTGAAATGGATGTTTCACTATGTAATGCTGTTATTGGATTATATGCAAAAAGTGCGCTCTAAAGATGGGAAAGAAATTCATGCTTATGCCGTTAGAAACGGTTACGATGGGAATGTTTGTGTTGTCACTGCTATCATTGATTCTTATGCTAAGTCTGGTTACCTCCACAGGGCACGACTGATTCTGACAAATGGGATTCGGCCTGATTCAGTAGCCTTTACTCCAGTGTGTTCCTGTGCCCATTCAGGGGAGTTAGATGAAGCCTGGAAGATATTTAACGTCTTGTTACCAGAGTATGGAATTCAACCACTAATCGAGCACTATGCTTGCATGGTTGGAGTTCTTATTCTAGCAGAAAAGCTCTCCGATGCTGTTGACTTTATTTCTAAAATGCCAATTGAACCCACTACAAAAGTTTGGGGTGCTTTGCTCGATGGGGCTTCTGTTGCTGGTGATGTTGAGCTTGGAAAGTGCGTTTTTGATAGTCTGCTTGACACCGAGCCTGAAAATACAGGTAACAATATCATCATGGATAACTTATATTCACTGTTTGGAAGGTGGAAAAAATCTGGCTAG

Coding sequence (CDS)

ATGGCGAATGCGAAGCCCACTAGCCTTCAAATCTCAGTTCCCGCCAGCGCCCTTTATCCATGGGTTCTGCAAGCAATCCGCCGCAACGACGGGATGAACTACGGCGCCTATGGCCGCTTCATCCAGCACGCACCGGCTTCCTCTTCGTCCGCCTCGGCAACTGCTTCACGCTCGTCTTGTTCTATTTACACTCTTCACAATATGCAATCTGATATGCTGAAGCTGTTTTTATCTTTGTTTAATTTGATATCGATGGATGCGAAGCCTGATAAGTTTAAGGTTACTTGTGTTTTGAAAGCGTTGGCGTCGTTGTTTTCTAATTTGATTTTGGCAATGGAAGTTCATTGTTTCATTCTTCGACGAGGGCTTGAGTCTCATCTTTTTCGAAAATTGTGTTTGATAGAATGCCTGCGAGAGATATTGTCTTGGAGTGCGATGGTGGCTGAGTACTCTCAGGGTTGGTTTTATGGGGAATGCAAGGAACTATTTAAAGAGATGTTGAGTTTAGTGGAGTTGAGGCCTAATGCAGTGACTGCAGTCATTTTCGTTGATTCGTCAATGAAAGTCAGATTGAAATGGATGTTTCACTATGTAATGCTGTTATTGGATTATATGCAAAAAGTGCGCTCTAAAGATGGGAAAGAAATTCATGCTTATGCCGTTAGAAACGGTTACGATGGGAATGTTTGTGTTGTCACTGCTATCATTGATTCTTATGCTAAGTCTGGTTACCTCCACAGGGCACGACTGATTCTGACAAATGGGATTCGGCCTGATTCAGTAGCCTTTACTCCAGTGTGTTCCTGTGCCCATTCAGGGGAGTTAGATGAAGCCTGGAAGATATTTAACGTCTTGTTACCAGAGTATGGAATTCAACCACTAATCGAGCACTATGCTTGCATGGTTGGAGTTCTTATTCTAGCAGAAAAGCTCTCCGATGCTGTTGACTTTATTTCTAAAATGCCAATTGAACCCACTACAAAAGTTTGGGGTGCTTTGCTCGATGGGGCTTCTGTTGCTGGTGATGTTGAGCTTGGAAAGTGCGTTTTTGATAGTCTGCTTGACACCGAGCCTGAAAATACAGGTAACAATATCATCATGGATAACTTATATTCACTGTTTGGAAGGTGGAAAAAATCTGGCTAG

Protein sequence

MANAKPTSLQISVPASALYPWVLQAIRRNDGMNYGAYGRFIQHAPASSSSASATASRSSCSIYTLHNMQSDMLKLFLSLFNLISMDAKPDKFKVTCVLKALASLFSNLILAMEVHCFILRRGLESHLFRKLCLIECLREILSWSAMVAEYSQGWFYGECKELFKEMLSLVELRPNAVTAVIFVDSSMKVRLKWMFHYVMLLLDYMQKVRSKDGKEIHAYAVRNGYDGNVCVVTAIIDSYAKSGYLHRARLILTNGIRPDSVAFTPVCSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLILAEKLSDAVDFISKMPIEPTTKVWGALLDGASVAGDVELGKCVFDSLLDTEPENTGNNIIMDNLYSLFGRWKKSG
Homology
BLAST of CmaCh06G003340 vs. ExPASy Swiss-Prot
Match: Q9ZUT5 (Pentatricopeptide repeat-containing protein At2g37310 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E49 PE=2 SV=1)

HSP 1 Score: 200.7 bits (509), Expect = 3.0e-50
Identity = 151/491 (30.75%), Postives = 214/491 (43.58%), Query Frame = 0

Query: 63  YTLHNMQSDMLKLFLSLF--NLISMD-AKPDKFKVTCVLKALASL--FSNLILAMEVHCF 122
           YT   M  D   LFLS    +  S D A+PD   ++CVLKAL+    F    LA +VH F
Sbjct: 98  YTSREMYFDAFSLFLSWIGSSCYSSDAARPDSISISCVLKALSGCDDFWLGSLARQVHGF 157

Query: 123 ILRRGLESHLF------------------RKLCLIECLREILSWSAMVAEYSQGWFYGEC 182
           ++R G +S +F                  RK+      R+++SW++M++ YSQ   + +C
Sbjct: 158 VIRGGFDSDVFVGNGMITYYTKCDNIESARKVFDEMSERDVVSWNSMISGYSQSGSFEDC 217

Query: 183 KELFKEMLSLVELRPNAVTAVIFVDS---------SMKVRLKWMFHYVML---------- 242
           K+++K ML+  + +PN VT +    +          ++V  K + +++ +          
Sbjct: 218 KKMYKAMLACSDFKPNGVTVISVFQACGQSSDLIFGLEVHKKMIENHIQMDLSLCNAVIG 277

Query: 243 -------------LLDYMQKVRS------------------------------------- 302
                        L D M +  S                                     
Sbjct: 278 FYAKCGSLDYARALFDEMSEKDSVTYGAIISGYMAHGLVKEAMALFSEMESIGLSTWNAM 337

Query: 303 ---------------------------------------------KDGKEIHAYAVRNGY 362
                                                        K GKEIHA+A+RNG 
Sbjct: 338 ISGLMQNNHHEEVINSFREMIRCGSRPNTVTLSSLLPSLTYSSNLKGGKEIHAFAIRNGA 397

Query: 363 DGNVCVVTAIIDSYAKSGYLHRARLILTN------------------------------- 381
           D N+ V T+IID+YAK G+L  A+ +  N                               
Sbjct: 398 DNNIYVTTSIIDNYAKLGFLLGAQRVFDNCKDRSLIAWTAIITAYAVHGDSDSACSLFDQ 457

BLAST of CmaCh06G003340 vs. ExPASy Swiss-Prot
Match: Q9LW63 (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 159.5 bits (402), Expect = 7.7e-38
Identity = 105/371 (28.30%), Postives = 177/371 (47.71%), Query Frame = 0

Query: 63  YTLHNMQSDMLKLFLSLFNLISMDAKPDKFKVTCVLKALASLFSNLILAMEVHCFILRRG 122
           Y    M  D L++   +  + + D KPD F ++ VL  + S + ++I   E+H +++R+G
Sbjct: 217 YAQSGMYEDALRM---VREMGTTDLKPDSFTLSSVL-PIFSEYVDVIKGKEIHGYVIRKG 276

Query: 123 LESHLFRKLCLIEC------------------LREILSWSAMVAEYSQGWFYGECKELFK 182
           ++S ++    L++                    R+ +SW+++VA Y Q   Y E   LF+
Sbjct: 277 IDSDVYIGSSLVDMYAKSARIEDSERVFSRLYCRDGISWNSLVAGYVQNGRYNEALRLFR 336

Query: 183 EMLSLVELRPNAVTAVIFVDSSMKVRLKWMFHYVMLLLDYMQKVRSKDGKEIHAYAVRNG 242
           +M++  +++P AV     + +          H   L L          GK++H Y +R G
Sbjct: 337 QMVT-AKVKPGAVAFSSVIPACA--------HLATLHL----------GKQLHGYVLRGG 396

Query: 243 YDGNVCVVTAIIDSYAKSGYLHRARLIL-------------------------------- 302
           +  N+ + +A++D Y+K G +  AR I                                 
Sbjct: 397 FGSNIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHGHGHEAVSLFE 456

Query: 303 ---TNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLILA 362
                G++P+ VAF  V  +C+H G +DEAW  FN +   YG+   +EHYA +  +L  A
Sbjct: 457 EMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAAVADLLGRA 516

Query: 363 EKLSDAVDFISKMPIEPTTKVWGALLDGASVAGDVELGKCVFDSLLDTEPENTGNNIIMD 380
            KL +A +FISKM +EPT  VW  LL   SV  ++EL + V + +   + EN G  ++M 
Sbjct: 517 GKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSENMGAYVLMC 564

BLAST of CmaCh06G003340 vs. ExPASy Swiss-Prot
Match: Q9STF3 (Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CRR2 PE=2 SV=1)

HSP 1 Score: 138.3 bits (347), Expect = 1.8e-31
Identity = 100/372 (26.88%), Postives = 166/372 (44.62%), Query Frame = 0

Query: 64  TLHNMQSDMLKLFLSLFNLISMDAKPDKFKVTCVLKALAS---LFSNLILAMEVHCFILR 123
           TL     ++L L+  + N I +++  D+F  T VLKA  +     ++L+   E+H  + R
Sbjct: 154 TLAGHGEEVLGLYWKM-NRIGVES--DRFTYTYVLKACVASECTVNHLMKGKEIHAHLTR 213

Query: 124 RGLESHLFRKLCLIEC------------------LREILSWSAMVAEYSQGWFYGECKEL 183
           RG  SH++    L++                   +R ++SWSAM+A Y++     E    
Sbjct: 214 RGYSSHVYIMTTLVDMYARFGCVDYASYVFGGMPVRNVVSWSAMIACYAKNGKAFEALRT 273

Query: 184 FKEML-SLVELRPNAVTAVIFVDSSMKVRLKWMFHYVMLLLDYMQKVRSKDGKEIHAYAV 243
           F+EM+    +  PN+VT V  + +   +                     + GK IH Y +
Sbjct: 274 FREMMRETKDSSPNSVTMVSVLQACASL------------------AALEQGKLIHGYIL 333

Query: 244 RNGYDGNVCVVTAIIDSYAKSGYLHRARLI------------------------------ 303
           R G D  + V++A++  Y + G L   + +                              
Sbjct: 334 RRGLDSILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGVHGYGKKAIQ 393

Query: 304 -----LTNGIRPDSVAFTPVC-SCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVL 363
                L NG  P  V F  V  +C+H G ++E  ++F  +  ++GI+P IEHYACMV +L
Sbjct: 394 IFEEMLANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVDLL 453

Query: 364 ILAEKLSDAVDFISKMPIEPTTKVWGALLDGASVAGDVELGKCVFDSLLDTEPENTGNNI 378
             A +L +A   +  M  EP  KVWG+LL    + G+VEL +     L   EP+N GN +
Sbjct: 454 GRANRLDEAAKMVQDMRTEPGPKVWGSLLGSCRIHGNVELAERASRRLFALEPKNAGNYV 504

BLAST of CmaCh06G003340 vs. ExPASy Swiss-Prot
Match: Q9LNU6 (Pentatricopeptide repeat-containing protein At1g20230 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H21 PE=2 SV=2)

HSP 1 Score: 138.3 bits (347), Expect = 1.8e-31
Identity = 84/274 (30.66%), Postives = 133/274 (48.54%), Query Frame = 0

Query: 140 ILSWSAMVAEYSQGWFYGECKELFKEMLSLVELRPNAVTAVIFVDSSMKVRLKWMFHYVM 199
           ++SW++++A  +Q     E  ELF+EM  +  ++PN VT    + +   +          
Sbjct: 353 VVSWTSIIAGCAQNGKDIEALELFREM-QVAGVKPNHVTIPSMLPACGNI---------- 412

Query: 200 LLLDYMQKVRSKDGKEIHAYAVRNGYDGNVCVVTAIIDSYAKSGYLHRARLI-------- 259
                        G+  H +AVR     NV V +A+ID YAK G ++ ++++        
Sbjct: 413 --------AALGHGRSTHGFAVRVHLLDNVHVGSALIDMYAKCGRINLSQIVFNMMPTKN 472

Query: 260 ------LTNG---------------------IRPDSVAFTPVCS-CAHSGELDEAWKIFN 319
                 L NG                     ++PD ++FT + S C   G  DE WK F 
Sbjct: 473 LVCWNSLMNGFSMHGKAKEVMSIFESLMRTRLKPDFISFTSLLSACGQVGLTDEGWKYFK 532

Query: 320 VLLPEYGIQPLIEHYACMVGVLILAEKLSDAVDFISKMPIEPTTKVWGALLDGASVAGDV 378
           ++  EYGI+P +EHY+CMV +L  A KL +A D I +MP EP + VWGALL+   +  +V
Sbjct: 533 MMSEEYGIKPRLEHYSCMVNLLGRAGKLQEAYDLIKEMPFEPDSCVWGALLNSCRLQNNV 592

BLAST of CmaCh06G003340 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 137.9 bits (346), Expect = 2.4e-31
Identity = 103/386 (26.68%), Postives = 165/386 (42.75%), Query Frame = 0

Query: 49  SSASATASRSSCSIYTLHNMQSDMLKLFLSLFNLISMDAKPDKFKVTCVLKALASLFSNL 108
           S  S  +  S  + Y    +  + +KLF     +      PD + VT VL   A  +  L
Sbjct: 358 SDRSVVSYTSMIAGYAREGLAGEAVKLF---EEMEEEGISPDVYTVTAVLNCCAR-YRLL 417

Query: 109 ILAMEVHCFILRRGLESHLFRKLCLIEC------------------LREILSWSAMVAEY 168
                VH +I    L   +F    L++                   +++I+SW+ ++  Y
Sbjct: 418 DEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGY 477

Query: 169 SQGWFYGECKELFKEMLSLVELRPNAVTAVIFVDSSMKVRLKWMFHYVMLLLDYMQKVRS 228
           S+  +  E   LF  +L      P+  T                   V  +L     + +
Sbjct: 478 SKNCYANEALSLFNLLLEEKRFSPDERT-------------------VACVLPACASLSA 537

Query: 229 KD-GKEIHAYAVRNGYDGNVCVVTAIIDSYAKSGYLHRARLILTN--------------- 288
            D G+EIH Y +RNGY  +  V  +++D YAK G L  A ++  +               
Sbjct: 538 FDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAG 597

Query: 289 --------------------GIRPDSVAFTPVC-SCAHSGELDEAWKIFNVLLPEYGIQP 348
                               GI  D ++F  +  +C+HSG +DE W+ FN++  E  I+P
Sbjct: 598 YGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEP 657

Query: 349 LIEHYACMVGVLILAEKLSDAVDFISKMPIEPTTKVWGALLDGASVAGDVELGKCVFDSL 380
            +EHYAC+V +L     L  A  FI  MPI P   +WGALL G  +  DV+L + V + +
Sbjct: 658 TVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKV 717

BLAST of CmaCh06G003340 vs. ExPASy TrEMBL
Match: A0A6J1CWN9 (pentatricopeptide repeat-containing protein At2g37310 OS=Momordica charantia OX=3673 GN=LOC111015095 PE=4 SV=1)

HSP 1 Score: 382.1 bits (980), Expect = 2.7e-102
Identity = 259/586 (44.20%), Postives = 295/586 (50.34%), Query Frame = 0

Query: 10  QISVPASALYPWVLQAIRRNDGMNYGAYGRFIQH-------------------------- 69
           QIS+PA A+ PW LQAIRR DGMNY AYGR IQH                          
Sbjct: 3   QISIPAGAVIPWALQAIRRADGMNYAAYGRLIQHCADRRFLRLGKQLHARLVLLSVTPDN 62

Query: 70  -------APASSSSASATASRSSCSI--------------YTLHNMQSDMLKLFLSLFNL 129
                  A  S S +   A     +I              YTLHNM SDMLKLF SL N 
Sbjct: 63  FLGSKLIAFYSKSGSLRDAYNVFGNISHKNIFSWNALFISYTLHNMHSDMLKLFSSLVNS 122

Query: 130 ISMDAKPDKFKVTCVLKALASLFSNLILAMEVHCFILRRGLESHLFRKLCLIECL----- 189
            +MD KPDKF +TCVLKALAS F++ ILA EVHCF+LRRGLES +F    L+        
Sbjct: 123 NAMDVKPDKFTITCVLKALASSFTDSILAKEVHCFVLRRGLESDIFVVNALVTYYSRCEE 182

Query: 190 -------------REILSWSAMVAEYSQGWFYGECKELFKEMLSLVELRPNAVTAVI--- 249
                        R+I+SW+AMVA +SQG FY ECKELFKEMLS VEL+PNA+TAV    
Sbjct: 183 VVLARIVFGRMPERDIVSWNAMVAGFSQGGFYEECKELFKEMLSSVELKPNALTAVSVLQ 242

Query: 250 ----------------FVDSS---MKVRL------------------------------- 309
                           FV+ S   M V L                               
Sbjct: 243 ACAQSNDLIFGMEVHRFVNESQIEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVT 302

Query: 310 ------KWMFH-YVMLLLDYMQKVRS---------------------------------- 369
                  +M H  V   +D  Q+++                                   
Sbjct: 303 YGSMISGYMVHGSVNQAMDLFQELKKPALSTWNAVISGLVQNNQQDGVLDIFRAMQSHGC 362

Query: 370 --------------------KDGKEIHAYAVRNGYDGNVCVVTAIIDSYAKSGYLHRARL 381
                               K GKEIHAYAVRNGY+GN+ V TAIIDSYAKSGYLH A  
Sbjct: 363 RPNAVTLASVLPVFSHFSTLKGGKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGAWQ 422

BLAST of CmaCh06G003340 vs. ExPASy TrEMBL
Match: A0A6J1F110 (pentatricopeptide repeat-containing protein At2g37310 OS=Cucurbita moschata OX=3662 GN=LOC111441405 PE=4 SV=1)

HSP 1 Score: 360.1 bits (923), Expect = 1.1e-95
Identity = 232/486 (47.74%), Postives = 261/486 (53.70%), Query Frame = 0

Query: 63  YTLHNMQSDMLKLFLSLFNLISMDAKPDKFKVTCVLKALASLFSNLILAMEVHCFILRRG 122
           YTLHNM +DMLKLF SL N+ S D KPDKF VTCVLKALASLF+N ILA EVHCF+LRRG
Sbjct: 79  YTLHNMHADMLKLFSSLVNVNSTDVKPDKFTVTCVLKALASLFTNSILAKEVHCFVLRRG 138

Query: 123 LESHLF------------RKLCLIECL------REILSWSAMVAEYSQGWFYGECKELFK 182
           LES +F             +L L   +      R+I+SW+AMVA YSQG FY +CKELFK
Sbjct: 139 LESDIFVVNALITFYSRCDELALARIMFDRTPERDIVSWNAMVAGYSQGGFYEDCKELFK 198

Query: 183 EMLSLVELRPNAVTAVI-------------------FVDSS---MKVRL----------- 242
            ML   E +PNA+TAV                    FV+ S   M V L           
Sbjct: 199 AMLGSGEPKPNALTAVSVLQACAHSNDLIFGMEVHKFVNESGIEMDVSLFNAVIGLYAKC 258

Query: 243 --------------------------KWMFH-YVMLLLDYMQKVRS-------------- 302
                                      +M H +V   +D  +++                
Sbjct: 259 GSLDYARELFEGMPEKDEVTYGSMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLV 318

Query: 303 ----------------------------------------KDGKEIHAYAVRNGYDGNVC 362
                                                   K GKEIHAYAVRN YDGN+ 
Sbjct: 319 QNNQQDGVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAVRNAYDGNIY 378

Query: 363 VVTAIIDSYAKSGYLHRARLI-----------------------------------LTNG 381
           V TAIIDSYAKSGYL  AR +                                   LTNG
Sbjct: 379 VATAIIDSYAKSGYLQGARQVFDQLKRRSLIIWTAIISAYAAHGDANATLSLFYEMLTNG 438

BLAST of CmaCh06G003340 vs. ExPASy TrEMBL
Match: A0A6J1J0S5 (pentatricopeptide repeat-containing protein At2g37310 OS=Cucurbita maxima OX=3661 GN=LOC111482423 PE=4 SV=1)

HSP 1 Score: 356.3 bits (913), Expect = 1.6e-94
Identity = 231/486 (47.53%), Postives = 258/486 (53.09%), Query Frame = 0

Query: 63  YTLHNMQSDMLKLFLSLFNLISMDAKPDKFKVTCVLKALASLFSNLILAMEVHCFILRRG 122
           YTLHNM +DMLKLF SL NL S D KPDKF VTCVLKALASLF+N ILA EVHCF+LRRG
Sbjct: 79  YTLHNMHADMLKLFSSLVNLNSTDVKPDKFTVTCVLKALASLFTNSILAKEVHCFVLRRG 138

Query: 123 LESHLFRKLCLIECL------------------REILSWSAMVAEYSQGWFYGECKELFK 182
           LES +F    LI                     R+I+SW+AMVA YSQG FY +CKELFK
Sbjct: 139 LESDIFVVNALITFYSRCDELVLARIMFHRTPERDIVSWNAMVAGYSQGGFYEDCKELFK 198

Query: 183 EMLSLVELRPNAVTAVI-------------------FVDSS---MKVRL----------- 242
            ML   E +PNA+TAV                    FV+ S   M V L           
Sbjct: 199 AMLGSGEPKPNALTAVSVLQACAQSNDLIFGMEVHKFVNESGIEMDVSLFNAVIGLYAKC 258

Query: 243 --------------------------KWMFH-YVMLLLDYMQKVRS-------------- 302
                                      +M H +V   +D  +++                
Sbjct: 259 GSLDYARELFEGMPEKDEVTYGSMISGYMVHGFVNQAMDLFRELERPALSTWNAVISGLV 318

Query: 303 ----------------------------------------KDGKEIHAYAVRNGYDGNVC 362
                                                   K GKEIHAYAVRN YDGN+ 
Sbjct: 319 QNNQQDGVVDIFRAMQLHGCRPNTVTLASVLPIFSHFSTLKGGKEIHAYAVRNAYDGNIY 378

Query: 363 VVTAIIDSYAKSGYLHRARLI-----------------------------------LTNG 381
           V TAIIDSYAKSGYL  AR +                                   LTNG
Sbjct: 379 VATAIIDSYAKSGYLQGARQVFDQSKRRSLIIWTAIISAYAAHGDANATLSLFYEMLTNG 438

BLAST of CmaCh06G003340 vs. ExPASy TrEMBL
Match: A0A0A0LFN1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G736590 PE=4 SV=1)

HSP 1 Score: 334.7 bits (857), Expect = 5.0e-88
Identity = 220/484 (45.45%), Postives = 249/484 (51.45%), Query Frame = 0

Query: 63  YTLHNMQSDMLKLFLSLFNLISMDAKPDKFKVTCVLKALASLFSNLILAMEVHCFILRRG 122
           YTLHNM +D+LKLF SL N  S D KPD+F VTC LKALASLFSN  LA EVH FILRRG
Sbjct: 79  YTLHNMHTDLLKLFSSLVNSNSTDVKPDRFTVTCALKALASLFSNSGLAKEVHSFILRRG 138

Query: 123 LESHLFRKLCLIECL------------------REILSWSAMVAEYSQGWFYGECKELFK 182
           LE  +F    LI                     R+I+SW+AM+A YSQG  Y +CKELF+
Sbjct: 139 LEYDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAMLAGYSQGGSYEKCKELFR 198

Query: 183 EMLSLVELRPNAVTAVI-------------------FVDSS---MKVRL----------- 242
            MLS +E++PNA+TAV                    FV+ S   M V L           
Sbjct: 199 VMLSSLEVKPNALTAVSVLQACAQSNDLIFGIEVHRFVNESQIKMDVSLWNAVIGLYAKC 258

Query: 243 ---------------KWMFHYVMLLLDYM------------------------------- 302
                          K    Y  ++  YM                               
Sbjct: 259 GSLDYARELFEEMLEKDAITYCSMISGYMVHGFVNQAMDLFREQERPRLPTWNAVISGLV 318

Query: 303 QKVRS-----------------------------------KDGKEIHAYAVRNGYDGNVC 362
           Q  R                                    K GKEIH YA+RN YD N+ 
Sbjct: 319 QNNRQEGAVDIFRAMQSHGCRPNTVTLASILPVFSHFSTLKGGKEIHGYAIRNTYDRNIY 378

Query: 363 VVTAIIDSYAKSGYLHRARLI-----------------------------------LTNG 379
           V TAIIDSYAK GYLH A+L+                                   LTNG
Sbjct: 379 VATAIIDSYAKCGYLHGAQLVFDQIKGRSLIAWTSIISAYAVHGDANVALSLFYEMLTNG 438

BLAST of CmaCh06G003340 vs. ExPASy TrEMBL
Match: A0A5A7TRM4 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G002030 PE=4 SV=1)

HSP 1 Score: 332.8 bits (852), Expect = 1.9e-87
Identity = 215/486 (44.24%), Postives = 252/486 (51.85%), Query Frame = 0

Query: 63  YTLHNMQSDMLKLFLSLFNLISMDAKPDKFKVTCVLKALASLFSNLILAMEVHCFILRRG 122
           YTLHNM +D+LKLFLSL N  S D KPD+F VTCVLKALASLFSN +LA EVHCFILRR 
Sbjct: 79  YTLHNMHTDLLKLFLSLVNSNSTDVKPDRFTVTCVLKALASLFSNSVLAKEVHCFILRRE 138

Query: 123 LESHLFRKLCLIECL------------------REILSWSAMVAEYSQGWFYGECKELFK 182
           LES +F    LI                     R+I+SW+AM+A YSQG  Y +CKELF+
Sbjct: 139 LESDIFVVNALITFYSRCDELVLARIMFDRMPERDIVSWNAMLAGYSQGGSYEKCKELFR 198

Query: 183 EMLSLVELRPNAVTAVI-------------------FVDSS---MKVRL----------- 242
            M S +E++PNA+TAV                    FV+ S   M V L           
Sbjct: 199 VMSSSLEVKPNALTAVSVLQACAQSNDLIFGMEVHRFVNESQIKMDVSLWNAVIGLYAKC 258

Query: 243 --------------------------KWMFH-YVMLLLDYMQKVRS-------------- 302
                                      +M H +V   +D  +++                
Sbjct: 259 GSLDYARELFEEMPEKDGITYCSMISGYMVHGFVNQAMDLFRELERPRLPTWNAVISGLV 318

Query: 303 ----------------------------------------KDGKEIHAYAVRNGYDGNVC 362
                                                   K GKEIH YA+RN YDGN+ 
Sbjct: 319 QNNRQDGALDIFRAMQSHGCRPNTVTLASILPIFSHFSTLKGGKEIHGYAIRNTYDGNIF 378

Query: 363 VVTAIIDSYAKSGYLHRARLI-----------------------------------LTNG 381
           V TAIIDSYAK GYL  AR +                                   LT G
Sbjct: 379 VATAIIDSYAKCGYLQGARQVFDQLKGRSLIAWTSIISAYAVHGDANVALSLFYEMLTYG 438

BLAST of CmaCh06G003340 vs. NCBI nr
Match: KAG7027978.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 446.0 bits (1146), Expect = 3.2e-121
Identity = 264/448 (58.93%), Postives = 279/448 (62.28%), Query Frame = 0

Query: 1   MANAKPTSLQISVPASALYPWVLQAIRRNDGMNYGAYGRFIQHAPASSSSASATASRSSC 60
           M NAKPTSLQISVPASALYPWVLQAIRRNDGMNYGAYGRFIQH PASSSSASATASRSSC
Sbjct: 1   MTNAKPTSLQISVPASALYPWVLQAIRRNDGMNYGAYGRFIQHVPASSSSASATASRSSC 60

Query: 61  SI------------------------------YTLHNMQSDMLKLFLSLFNLISMDAKPD 120
           S+                              YT HNMQSDMLKLFLSLFNLISMDAKPD
Sbjct: 61  SMFCRSREFPRIEGHRLLIKIWQPWRCLQCVRYTFHNMQSDMLKLFLSLFNLISMDAKPD 120

Query: 121 KFKVTCVLKALASLFSNLILAMEVHCFILRRGLESHLFRKLCLIECLREILSWSAMVA-- 180
           KFK                       FI+    E      LCL    R ++SW A +   
Sbjct: 121 KFK-----------------------FIVSFFDEGLSLSFLCLCSDTRGVISWLAKIVFD 180

Query: 181 EYSQGWFYGECKELFKEMLSLVELRPNAVTAVIFVDSSMKVRLKWMFHYVMLLLDYMQKV 240
                    EC +  + M S   +  + +T   + D+  +      F             
Sbjct: 181 RMPARDIVFECDKCLRRMRSHCAMISDYMTE--YYDTCERSSHLLTF------------C 240

Query: 241 RSKDGKEIHAYAVRNGYDGNVCVVTAIIDSYAKSGYLHRARL------------------ 300
            +KDG         NGYDGNVCVVTAIIDSYAKSGYLHRARL                  
Sbjct: 241 NTKDG---------NGYDGNVCVVTAIIDSYAKSGYLHRARLVCDQFKGRSLIIWTAIIS 300

Query: 301 -----------------ILTNGIRPDSVAFTPVCSCAHSGELDEAWKIFNVLLPEYGIQP 360
                            ILTNGIRPDSVAFTPVCSCAHSGELDEAWKIFNVLLPEYGIQP
Sbjct: 301 AYAANGGANVALSFFYEILTNGIRPDSVAFTPVCSCAHSGELDEAWKIFNVLLPEYGIQP 360

Query: 361 LIEHYACMVGVLILAEKLSDAVDFISKMPIEPTTKVWGALLDGASVAGDVELGKCVFDSL 382
           LIEHYACMVGVL LAEK SDAVDFISKMPIEPTTKVW ALL+GASVAGDVELGKCVFDSL
Sbjct: 361 LIEHYACMVGVLSLAEKFSDAVDFISKMPIEPTTKVWDALLNGASVAGDVELGKCVFDSL 402

BLAST of CmaCh06G003340 vs. NCBI nr
Match: XP_022145703.1 (pentatricopeptide repeat-containing protein At2g37310 [Momordica charantia])

HSP 1 Score: 382.1 bits (980), Expect = 5.6e-102
Identity = 259/586 (44.20%), Postives = 295/586 (50.34%), Query Frame = 0

Query: 10  QISVPASALYPWVLQAIRRNDGMNYGAYGRFIQH-------------------------- 69
           QIS+PA A+ PW LQAIRR DGMNY AYGR IQH                          
Sbjct: 3   QISIPAGAVIPWALQAIRRADGMNYAAYGRLIQHCADRRFLRLGKQLHARLVLLSVTPDN 62

Query: 70  -------APASSSSASATASRSSCSI--------------YTLHNMQSDMLKLFLSLFNL 129
                  A  S S +   A     +I              YTLHNM SDMLKLF SL N 
Sbjct: 63  FLGSKLIAFYSKSGSLRDAYNVFGNISHKNIFSWNALFISYTLHNMHSDMLKLFSSLVNS 122

Query: 130 ISMDAKPDKFKVTCVLKALASLFSNLILAMEVHCFILRRGLESHLFRKLCLIECL----- 189
            +MD KPDKF +TCVLKALAS F++ ILA EVHCF+LRRGLES +F    L+        
Sbjct: 123 NAMDVKPDKFTITCVLKALASSFTDSILAKEVHCFVLRRGLESDIFVVNALVTYYSRCEE 182

Query: 190 -------------REILSWSAMVAEYSQGWFYGECKELFKEMLSLVELRPNAVTAVI--- 249
                        R+I+SW+AMVA +SQG FY ECKELFKEMLS VEL+PNA+TAV    
Sbjct: 183 VVLARIVFGRMPERDIVSWNAMVAGFSQGGFYEECKELFKEMLSSVELKPNALTAVSVLQ 242

Query: 250 ----------------FVDSS---MKVRL------------------------------- 309
                           FV+ S   M V L                               
Sbjct: 243 ACAQSNDLIFGMEVHRFVNESQIEMDVSLCNAVIGLYAKCGSLDYARELFEEMPEKDEVT 302

Query: 310 ------KWMFH-YVMLLLDYMQKVRS---------------------------------- 369
                  +M H  V   +D  Q+++                                   
Sbjct: 303 YGSMISGYMVHGSVNQAMDLFQELKKPALSTWNAVISGLVQNNQQDGVLDIFRAMQSHGC 362

Query: 370 --------------------KDGKEIHAYAVRNGYDGNVCVVTAIIDSYAKSGYLHRARL 381
                               K GKEIHAYAVRNGY+GN+ V TAIIDSYAKSGYLH A  
Sbjct: 363 RPNAVTLASVLPVFSHFSTLKGGKEIHAYAVRNGYNGNIYVATAIIDSYAKSGYLHGAWQ 422

BLAST of CmaCh06G003340 vs. NCBI nr
Match: KAG6580575.1 (ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 374.8 bits (961), Expect = 8.9e-100
Identity = 258/573 (45.03%), Postives = 286/573 (49.91%), Query Frame = 0

Query: 23   LQAIRRNDGMNYGAYGRFIQH---------------------------------APASSS 82
            LQ IRR+DGMNYGAYGR IQH                                 A  S S
Sbjct: 720  LQLIRRSDGMNYGAYGRLIQHCTDQRFFRLGKQLHARLVLSSVAPDNFLGSKLIALYSKS 779

Query: 83   SASATASRSSCSI--------------YTLHNMQSDMLKLFLSLFNLISMDAKPDKFKVT 142
             +   A     SI              YTLHNM +DMLKLF SL NL S D KPDKF VT
Sbjct: 780  GSLRDAYNVFDSISHKNIFSWNALFISYTLHNMHADMLKLFSSLVNLNSTDVKPDKFTVT 839

Query: 143  CVLKALASLFSNLILAMEVHCFILRRGLESHLFRKLCLIECL------------------ 202
            CVLKALASLF+N ILA EVHCF+LRRGLES +F    LI                     
Sbjct: 840  CVLKALASLFTNSILAKEVHCFVLRRGLESDIFVVNALITFYSRCDELVLARIMFDRTPE 899

Query: 203  REILSWSAMVAEYSQGWFYGECKELFKEMLSLVELRPNAVTAVI---------------- 262
            R+I+SW+AMVA YSQG FY +CKELFK ML   E +PNA+TAV                 
Sbjct: 900  RDIVSWNAMVAGYSQGGFYEDCKELFKAMLGSGEPKPNALTAVSVLQACAQSNDLIFGME 959

Query: 263  ---FVDSS---MKVRL-------------------------------------KWMFH-Y 322
               FV+ S   M V L                                      +M H +
Sbjct: 960  VHKFVNESGIEMDVSLFNAVIGLYAKCGSLDYARELFEGMPEKDEVTYGSMISGYMVHGF 1019

Query: 323  VMLLLDYMQKVRS----------------------------------------------- 381
            V   +D  +++                                                 
Sbjct: 1020 VNQAMDLFRELERPALSTWNAVISGLVQNNQQDGVVDIFRAMQLHGCRPNTVTLASVLPI 1079

BLAST of CmaCh06G003340 vs. NCBI nr
Match: KAG7017327.1 (ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 371.7 bits (953), Expect = 7.6e-99
Identity = 257/573 (44.85%), Postives = 285/573 (49.74%), Query Frame = 0

Query: 23   LQAIRRNDGMNYGAYGRFIQH---------------------------------APASSS 82
            LQ IRR+DGMNYGAYGR IQH                                 A  S S
Sbjct: 739  LQLIRRSDGMNYGAYGRLIQHCTDQRFFRLGKQLHARLVLSSVAPDNFLGSKLIALYSKS 798

Query: 83   SASATASRSSCSI--------------YTLHNMQSDMLKLFLSLFNLISMDAKPDKFKVT 142
             +   A     SI              YTLHNM +DMLKLF SL NL S D KPDKF VT
Sbjct: 799  GSLRDAYNVFDSISHKNIFSWNALFISYTLHNMHADMLKLFSSLVNLNSTDVKPDKFTVT 858

Query: 143  CVLKALASLFSNLILAMEVHCFILRRGLESHLFRKLCLIECL------------------ 202
            CVLKALASLF+N ILA EVHCF+LRRGLES +F    LI                     
Sbjct: 859  CVLKALASLFTNSILAKEVHCFVLRRGLESDIFVVNALITFYSRCDELVLARIMFDRTPE 918

Query: 203  REILSWSAMVAEYSQGWFYGECKELFKEMLSLVELRPNAVTAVI---------------- 262
            R+I+SW+AMVA YSQG FY +CKELFK ML   E +PNA+TAV                 
Sbjct: 919  RDIVSWNAMVAGYSQGGFYEDCKELFKAMLGSGEPKPNALTAVSVLQACAQSNDLIFGME 978

Query: 263  ---FVDSS---MKVRL-------------------------------------KWMFH-Y 322
               FV+ S   M V L                                      +M H +
Sbjct: 979  VHKFVNESGIEMDVSLFNAVIGLYAKCGSLDYARELFEGMPEKDEVTYGSMISGYMVHGF 1038

Query: 323  VMLLLDYMQKVRS----------------------------------------------- 381
            V   +D  +++                                                 
Sbjct: 1039 VNQAMDLFRELERPALSTWNAVISGLVQNNQQDGVVDIFRAMQLHGCRPNTVTLASVLPI 1098

BLAST of CmaCh06G003340 vs. NCBI nr
Match: XP_023539538.1 (pentatricopeptide repeat-containing protein At2g37310-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 365.9 bits (938), Expect = 4.1e-97
Identity = 223/413 (54.00%), Postives = 244/413 (59.08%), Query Frame = 0

Query: 1   MANAKPTSLQISVPASALYPWVLQAIRRNDGMNYGAYGRFIQHAPASSSSASATASRSSC 60
           M NAKPTSLQISVPASALYPW LQAIRR+DGMNYGAYGRFIQ AP SSSSASATASRSSC
Sbjct: 1   MTNAKPTSLQISVPASALYPWSLQAIRRDDGMNYGAYGRFIQLAPTSSSSASATASRSSC 60

Query: 61  SIYTLHNMQSDMLKLFLSLFNLISMDAKPDKFKVTCVLKALASLFSNLILAMEVHCFILR 120
           S++                                                         
Sbjct: 61  SMF--------------------------------------------------------- 120

Query: 121 RGLESHLFRKLCLIECLREILSWSAMVAEYSQGWFYGECKELFKEMLSLVELRPNAVTAV 180
               S  F +  + ECLREILSW+AMVAE SQGWFYGECKEL+K MLSLVELRPNA+TAV
Sbjct: 121 --CRSREFPR--IEECLREILSWNAMVAECSQGWFYGECKELYKAMLSLVELRPNALTAV 180

Query: 181 IFVDSSMKVR----------LKWMFH------------YVMLLLDYMQKVRS-------- 240
           I + +  ++           L +++             Y  ++ DYM   +         
Sbjct: 181 IVLQACAQLNYLILGMECGSLDYVWELFEEMPEKDEVTYGAMISDYMVHEQPISWSCRYI 240

Query: 241 ----------------------------KDGKEIHAYAVRNGYDGNVCVVTAIIDSYAKS 300
                                       KDGKEIHAYAVRN YDGNVCVVTAIIDSYAKS
Sbjct: 241 SSNAVHGCRPNTVTLASVLPIFSHFSTLKDGKEIHAYAVRNCYDGNVCVVTAIIDSYAKS 300

Query: 301 GYLHRARLILTNGIRPDSVAFTPVCSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMV 356
           GY HRARL               VC     GELDEAWKIFNVLLPEYGIQPL+EHYACMV
Sbjct: 301 GYFHRARL---------------VCD-QFKGELDEAWKIFNVLLPEYGIQPLVEHYACMV 336

BLAST of CmaCh06G003340 vs. TAIR 10
Match: AT2G37310.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 200.7 bits (509), Expect = 2.2e-51
Identity = 151/491 (30.75%), Postives = 214/491 (43.58%), Query Frame = 0

Query: 63  YTLHNMQSDMLKLFLSLF--NLISMD-AKPDKFKVTCVLKALASL--FSNLILAMEVHCF 122
           YT   M  D   LFLS    +  S D A+PD   ++CVLKAL+    F    LA +VH F
Sbjct: 98  YTSREMYFDAFSLFLSWIGSSCYSSDAARPDSISISCVLKALSGCDDFWLGSLARQVHGF 157

Query: 123 ILRRGLESHLF------------------RKLCLIECLREILSWSAMVAEYSQGWFYGEC 182
           ++R G +S +F                  RK+      R+++SW++M++ YSQ   + +C
Sbjct: 158 VIRGGFDSDVFVGNGMITYYTKCDNIESARKVFDEMSERDVVSWNSMISGYSQSGSFEDC 217

Query: 183 KELFKEMLSLVELRPNAVTAVIFVDS---------SMKVRLKWMFHYVML---------- 242
           K+++K ML+  + +PN VT +    +          ++V  K + +++ +          
Sbjct: 218 KKMYKAMLACSDFKPNGVTVISVFQACGQSSDLIFGLEVHKKMIENHIQMDLSLCNAVIG 277

Query: 243 -------------LLDYMQKVRS------------------------------------- 302
                        L D M +  S                                     
Sbjct: 278 FYAKCGSLDYARALFDEMSEKDSVTYGAIISGYMAHGLVKEAMALFSEMESIGLSTWNAM 337

Query: 303 ---------------------------------------------KDGKEIHAYAVRNGY 362
                                                        K GKEIHA+A+RNG 
Sbjct: 338 ISGLMQNNHHEEVINSFREMIRCGSRPNTVTLSSLLPSLTYSSNLKGGKEIHAFAIRNGA 397

Query: 363 DGNVCVVTAIIDSYAKSGYLHRARLILTN------------------------------- 381
           D N+ V T+IID+YAK G+L  A+ +  N                               
Sbjct: 398 DNNIYVTTSIIDNYAKLGFLLGAQRVFDNCKDRSLIAWTAIITAYAVHGDSDSACSLFDQ 457

BLAST of CmaCh06G003340 vs. TAIR 10
Match: AT3G23330.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 159.5 bits (402), Expect = 5.5e-39
Identity = 105/371 (28.30%), Postives = 177/371 (47.71%), Query Frame = 0

Query: 63  YTLHNMQSDMLKLFLSLFNLISMDAKPDKFKVTCVLKALASLFSNLILAMEVHCFILRRG 122
           Y    M  D L++   +  + + D KPD F ++ VL  + S + ++I   E+H +++R+G
Sbjct: 217 YAQSGMYEDALRM---VREMGTTDLKPDSFTLSSVL-PIFSEYVDVIKGKEIHGYVIRKG 276

Query: 123 LESHLFRKLCLIEC------------------LREILSWSAMVAEYSQGWFYGECKELFK 182
           ++S ++    L++                    R+ +SW+++VA Y Q   Y E   LF+
Sbjct: 277 IDSDVYIGSSLVDMYAKSARIEDSERVFSRLYCRDGISWNSLVAGYVQNGRYNEALRLFR 336

Query: 183 EMLSLVELRPNAVTAVIFVDSSMKVRLKWMFHYVMLLLDYMQKVRSKDGKEIHAYAVRNG 242
           +M++  +++P AV     + +          H   L L          GK++H Y +R G
Sbjct: 337 QMVT-AKVKPGAVAFSSVIPACA--------HLATLHL----------GKQLHGYVLRGG 396

Query: 243 YDGNVCVVTAIIDSYAKSGYLHRARLIL-------------------------------- 302
           +  N+ + +A++D Y+K G +  AR I                                 
Sbjct: 397 FGSNIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHGHGHEAVSLFE 456

Query: 303 ---TNGIRPDSVAFTPV-CSCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVLILA 362
                G++P+ VAF  V  +C+H G +DEAW  FN +   YG+   +EHYA +  +L  A
Sbjct: 457 EMKRQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAAVADLLGRA 516

Query: 363 EKLSDAVDFISKMPIEPTTKVWGALLDGASVAGDVELGKCVFDSLLDTEPENTGNNIIMD 380
            KL +A +FISKM +EPT  VW  LL   SV  ++EL + V + +   + EN G  ++M 
Sbjct: 517 GKLEEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSENMGAYVLMC 564

BLAST of CmaCh06G003340 vs. TAIR 10
Match: AT1G20230.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 138.3 bits (347), Expect = 1.3e-32
Identity = 84/274 (30.66%), Postives = 133/274 (48.54%), Query Frame = 0

Query: 140 ILSWSAMVAEYSQGWFYGECKELFKEMLSLVELRPNAVTAVIFVDSSMKVRLKWMFHYVM 199
           ++SW++++A  +Q     E  ELF+EM  +  ++PN VT    + +   +          
Sbjct: 353 VVSWTSIIAGCAQNGKDIEALELFREM-QVAGVKPNHVTIPSMLPACGNI---------- 412

Query: 200 LLLDYMQKVRSKDGKEIHAYAVRNGYDGNVCVVTAIIDSYAKSGYLHRARLI-------- 259
                        G+  H +AVR     NV V +A+ID YAK G ++ ++++        
Sbjct: 413 --------AALGHGRSTHGFAVRVHLLDNVHVGSALIDMYAKCGRINLSQIVFNMMPTKN 472

Query: 260 ------LTNG---------------------IRPDSVAFTPVCS-CAHSGELDEAWKIFN 319
                 L NG                     ++PD ++FT + S C   G  DE WK F 
Sbjct: 473 LVCWNSLMNGFSMHGKAKEVMSIFESLMRTRLKPDFISFTSLLSACGQVGLTDEGWKYFK 532

Query: 320 VLLPEYGIQPLIEHYACMVGVLILAEKLSDAVDFISKMPIEPTTKVWGALLDGASVAGDV 378
           ++  EYGI+P +EHY+CMV +L  A KL +A D I +MP EP + VWGALL+   +  +V
Sbjct: 533 MMSEEYGIKPRLEHYSCMVNLLGRAGKLQEAYDLIKEMPFEPDSCVWGALLNSCRLQNNV 592

BLAST of CmaCh06G003340 vs. TAIR 10
Match: AT3G46790.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 138.3 bits (347), Expect = 1.3e-32
Identity = 100/372 (26.88%), Postives = 166/372 (44.62%), Query Frame = 0

Query: 64  TLHNMQSDMLKLFLSLFNLISMDAKPDKFKVTCVLKALAS---LFSNLILAMEVHCFILR 123
           TL     ++L L+  + N I +++  D+F  T VLKA  +     ++L+   E+H  + R
Sbjct: 154 TLAGHGEEVLGLYWKM-NRIGVES--DRFTYTYVLKACVASECTVNHLMKGKEIHAHLTR 213

Query: 124 RGLESHLFRKLCLIEC------------------LREILSWSAMVAEYSQGWFYGECKEL 183
           RG  SH++    L++                   +R ++SWSAM+A Y++     E    
Sbjct: 214 RGYSSHVYIMTTLVDMYARFGCVDYASYVFGGMPVRNVVSWSAMIACYAKNGKAFEALRT 273

Query: 184 FKEML-SLVELRPNAVTAVIFVDSSMKVRLKWMFHYVMLLLDYMQKVRSKDGKEIHAYAV 243
           F+EM+    +  PN+VT V  + +   +                     + GK IH Y +
Sbjct: 274 FREMMRETKDSSPNSVTMVSVLQACASL------------------AALEQGKLIHGYIL 333

Query: 244 RNGYDGNVCVVTAIIDSYAKSGYLHRARLI------------------------------ 303
           R G D  + V++A++  Y + G L   + +                              
Sbjct: 334 RRGLDSILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGVHGYGKKAIQ 393

Query: 304 -----LTNGIRPDSVAFTPVC-SCAHSGELDEAWKIFNVLLPEYGIQPLIEHYACMVGVL 363
                L NG  P  V F  V  +C+H G ++E  ++F  +  ++GI+P IEHYACMV +L
Sbjct: 394 IFEEMLANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVDLL 453

Query: 364 ILAEKLSDAVDFISKMPIEPTTKVWGALLDGASVAGDVELGKCVFDSLLDTEPENTGNNI 378
             A +L +A   +  M  EP  KVWG+LL    + G+VEL +     L   EP+N GN +
Sbjct: 454 GRANRLDEAAKMVQDMRTEPGPKVWGSLLGSCRIHGNVELAERASRRLFALEPKNAGNYV 504

BLAST of CmaCh06G003340 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 137.9 bits (346), Expect = 1.7e-32
Identity = 103/386 (26.68%), Postives = 165/386 (42.75%), Query Frame = 0

Query: 49  SSASATASRSSCSIYTLHNMQSDMLKLFLSLFNLISMDAKPDKFKVTCVLKALASLFSNL 108
           S  S  +  S  + Y    +  + +KLF     +      PD + VT VL   A  +  L
Sbjct: 358 SDRSVVSYTSMIAGYAREGLAGEAVKLF---EEMEEEGISPDVYTVTAVLNCCAR-YRLL 417

Query: 109 ILAMEVHCFILRRGLESHLFRKLCLIEC------------------LREILSWSAMVAEY 168
                VH +I    L   +F    L++                   +++I+SW+ ++  Y
Sbjct: 418 DEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGY 477

Query: 169 SQGWFYGECKELFKEMLSLVELRPNAVTAVIFVDSSMKVRLKWMFHYVMLLLDYMQKVRS 228
           S+  +  E   LF  +L      P+  T                   V  +L     + +
Sbjct: 478 SKNCYANEALSLFNLLLEEKRFSPDERT-------------------VACVLPACASLSA 537

Query: 229 KD-GKEIHAYAVRNGYDGNVCVVTAIIDSYAKSGYLHRARLILTN--------------- 288
            D G+EIH Y +RNGY  +  V  +++D YAK G L  A ++  +               
Sbjct: 538 FDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMIAG 597

Query: 289 --------------------GIRPDSVAFTPVC-SCAHSGELDEAWKIFNVLLPEYGIQP 348
                               GI  D ++F  +  +C+HSG +DE W+ FN++  E  I+P
Sbjct: 598 YGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACSHSGLVDEGWRFFNIMRHECKIEP 657

Query: 349 LIEHYACMVGVLILAEKLSDAVDFISKMPIEPTTKVWGALLDGASVAGDVELGKCVFDSL 380
            +EHYAC+V +L     L  A  FI  MPI P   +WGALL G  +  DV+L + V + +
Sbjct: 658 TVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIWGALLCGCRIHHDVKLAEKVAEKV 717

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9ZUT53.0e-5030.75Pentatricopeptide repeat-containing protein At2g37310 OS=Arabidopsis thaliana OX... [more]
Q9LW637.7e-3828.30Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
Q9STF31.8e-3126.88Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidop... [more]
Q9LNU61.8e-3130.66Pentatricopeptide repeat-containing protein At1g20230 OS=Arabidopsis thaliana OX... [more]
Q9SN392.4e-3126.68Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Match NameE-valueIdentityDescription
A0A6J1CWN92.7e-10244.20pentatricopeptide repeat-containing protein At2g37310 OS=Momordica charantia OX=... [more]
A0A6J1F1101.1e-9547.74pentatricopeptide repeat-containing protein At2g37310 OS=Cucurbita moschata OX=3... [more]
A0A6J1J0S51.6e-9447.53pentatricopeptide repeat-containing protein At2g37310 OS=Cucurbita maxima OX=366... [more]
A0A0A0LFN15.0e-8845.45Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G736590 PE=4 SV=1[more]
A0A5A7TRM41.9e-8744.24Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
KAG7027978.13.2e-12158.93Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022145703.15.6e-10244.20pentatricopeptide repeat-containing protein At2g37310 [Momordica charantia][more]
KAG6580575.18.9e-10045.03ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. soror... [more]
KAG7017327.17.6e-9944.85ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. argyr... [more]
XP_023539538.14.1e-9754.00pentatricopeptide repeat-containing protein At2g37310-like [Cucurbita pepo subsp... [more]
Match NameE-valueIdentityDescription
AT2G37310.12.2e-5130.75Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G23330.15.5e-3928.30Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G20230.11.3e-3230.66Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G46790.11.3e-3226.88Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G18750.11.7e-3226.68Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 264..373
e-value: 6.1E-14
score: 53.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 62..252
e-value: 3.5E-8
score: 35.2
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 142..168
e-value: 0.014
score: 15.6
NoneNo IPR availablePANTHERPTHR47925OS01G0913400 PROTEIN-RELATEDcoord: 63..181
NoneNo IPR availablePANTHERPTHR47925:SF76BNAA04G21330D PROTEINcoord: 63..181
coord: 208..252
NoneNo IPR availablePANTHERPTHR47925OS01G0913400 PROTEIN-RELATEDcoord: 208..252
NoneNo IPR availablePANTHERPTHR47925:SF76BNAA04G21330D PROTEINcoord: 251..379
NoneNo IPR availablePANTHERPTHR47925OS01G0913400 PROTEIN-RELATEDcoord: 251..379

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh06G003340.1CmaCh06G003340.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding