Cp4.1LG18g09360 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG18g09360
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG18: 8309400 .. 8313508 (+)
RNA-Seq ExpressionCp4.1LG18g09360
SyntenyCp4.1LG18g09360
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACGTGTCAACATTACAAATAAAGACAAATTACTAGCTTACCATGATAAATAAAAAAGAGACCTCAAAAGCGAACTTGGGCTTAATGGTGCCTAAAGCCAACCAGCCCTTGGACATGCTTAAAGGTAAGCGAGGCTTGGACTTGGTCGGATCATGAGCCAAGTTTGGGCCAGGGCATAGTCAAACCTAGGGCTAGCCGACAGACCTTGGAACAACCACCCAGGCCTAGGTGGTTCCCGGCTTATCGGTGCCGGAGTGGAAAATGAAGGCGAAGGAGTTGTGTTTTAGGTCGCTGTATCGGTGCCGGAGGGGTAAGCTGAAGATGTTGCAGAAGCAGAGATACTTGGACCGCCAAAGGATTACTTCCGCCCTTCTTGCTACCTCTGCCTATGCTTTACCAGGCCCCCTTCGAATTCCAAAGGTTCCACTTTCTTCACTCCTTCCTTTCACCTCTTCTCCCATACATGTACACTATACCTCTTGCGCTTCTCTTCCCAATCCTCTTCCACTCCTCAATTTCATGTCAATTCGCTGCATTTGTTTTCTGCCTCCGTCTCCTCCCTCTTCCATTCCCTGGAATTCTAATGAACTTGAGGACGCAAATTCTACTTCTAAGTTCACCGTAAAAGCAGGTAACGGTAGGTTAGTTCCTCTTTTCTCCTCTTCTGCCTCCACCCGGATGTGACTTTTCTTCTTCCATCTCCGTAATTAGTGGCACATATGCCTCTCTTGCCGTTGAATTTGAATGCCCTTTTTTTTTTCCTCATAAACCCTATTTCAGGTTATCAAACCCTTTATCGTGTTCTTGAAGCCTGCAGACTCTCCCCCTCGAATTCCAAAACCGCTTCTGAAACGCATGCAAGAATGATTAAATTTGGATATGGAAACTACCCAACTCTCGTCACCTCTTTAGTATCAGCTTATCAACGTGCCGATTGCCTTAATCGGGTCCATCAACTTCTTAATCTACTCTGCTCTAAACATCTTGATTTAGTTGCAATGAACTTATTTATTGACAATTTTATGAAAATAGGGGAATGCAAACTTGCTAAAAGGGTGTTTGATAAAATGCCTTACCGTGATGTGGTAACGTGGAACTCAATCATCGGAGGTTGTGTGAAGAATGCACGGTATGAGGAGGCATTTAAATTCTTTAGGCAGATGCTGAACTCAAATATCCAGCCGGATGGGTTTACGTTTGCTTCCGTGTTGAATGCATGTGCGCAGCTCGGAGCTCCAAGTAACACTCAGTGGGTTCATGCTCTGATGACTCAGAAAAAAATTGAGCTTAATTCTATATTGAGTTGTGCACTCATAGACGCCTACTCAAAGTGTGGCAGCATCCAAATTGCAAAGGAAATCTTTAGCAGCGTCCCTCGCAGTAACATCTCAGTTTGGAATGCGATGATCAAAGGGCTTGCGATTCATGGGCTTTCAATGGATGCATTATCGGTATTCTGGATGATGGAGCGTGAGAATGTTCTGCCTGATGCTGTCACCTTTTTGGGTATCTTAACAGCCTGCAACCATGGTGGCTTAATTGAACAGGGTCGCAGGTTTTTTGATTGGATGAAAAACCGTTATTCAATTCAGCCACAGCTCGAACATTATGGAGTCATGGTTGATCTGTATAGCCGAGCTGGGTTTCTCGAGGAGGCCTATTCCATAATCGTAGCAATGCCCATAGAGGCAGATGTTGTGACATGGAGGGCGCTTCTGAGTGGTTGTAGAATTTACAGAAATCAAGAACTCGCAGAAGTTGCTATTGCAAACATGTCTCATCGTGGGAGTGGAGATTACGTGTTGCTATCAAATACCTATTGTTCTCTCAACAGATGGGAGCATGCAGAGAGAGTTAGAGAGAGGATGAAAAGCAATGGAGTTCGCAAGAGTTGCGGAAAAAGCTGGATTGAGTTGGGAGGTTCCATTCAAAGCTTCAAGTCAGGTGACCGATCGCATCCAGAAAGCGATGCAGTGTACAAGGTGGTGTGGCGTTTGATGAAGAGAAGTCGGTCAGAGGGATATATGCCTGTGACAGACTTGGTTCTCATGGATATCTCTGAGGAGGAGAAGGAAGAGAACTTATCATATCACAGCGAAAAGTTGGCATTGGCGTATGCGATCCTGAAAACTAGTCCAGGGGCAAAAATCAGTATATCAAAGAACCTACGGATGTGTGATGATTGTCATAGATGGATAAAACTAGTTTCAACACTGCTGTGCAGAGTTGTAGTAGTGAGAGATCGGATCCGGTTTCATCAATTCGAAGGTGGCATGTGTTCCTGTGGTGATCGTTGGTAGTTTGGTTTGGTTTGTTGTTTTAGTCTTTTTTTAACTACATTTGGTAGGGTCGCATTGAAAGACAAGTTATATATTTTAGTTTGGTTCAGACTTTTGGACAACGAAACCATGTTTGGTTGAATGAGGGATCAATTACAATTCCTCCTTATGCTCCTCCTCAGGAAATGCAGGCCCACCTCATGATTGGAGGCGTCCCTGTTATTTGTATAGATGAGAAAGATTGTTTGCATGTTTGTATTGTATTGTATTGTATTGTTTAAAGATAGATTATAAGTAGGAAGGAAGATTGTTTTTCAAATTTTATTCATCTAGTTTAAAAAAATTCAAACAATTCATCTATTTTGATCGCATTTCCTTCAACCCTGTTTCTACCTTGAGACTACTTGTTCTCTCAGAAGCTTATCTCCCGTCGCTCTGCAAAAAACTTACCTGTGACTGCCTGATCTGGAAGTAAGGCAAGCCAGACGCCAGTGTCAGCCCCTTCCTCTGCTGAAATATTGCCAGCGAACCCTGTCATAGCAGTTTTCGCCCAACCAGGGCAGTAACAGTTCACATAAATCTTATGACCCTCGGGTCGGTCCCCCAATATTTTAGCCATTAACCTGGTGTATGCATTCACTGCGAGTTTTGATACAGAGTAGTCGGTTGATAACTGAGGCCACCCCCCAGTTTCCCAACTTCCATCTTCTACTTGTTCCAAAAATGTAGACACAATCCTATCAATTACTTCTTCTGTCAGAGTATCCAAATTGCCGAGTAGTTCTCTAAATGCTACATTTTCCACTCTCTGTAATGAAATACAGGAAAACATACTGAGCTAACCTTCTTCTTCCTCCTATTATTCACACAAAAACAGGAAGAAGGTGCACTTACATTCCGCCTGCCGTTCAGCTTGGCCAACCTTGAGCTCACATTCACGATGCGAGCACCAGCAGACGAAGGTTTCATCAAGGGGACCATTGCTTGAGTCATATTTTTGGTGCCATAATAGTTAGTTGCAATAACCATCTGGGCATACTCTACAGAATTACTAGACCCAAGATTGAAATTCACTCCAGCATTATTAATCTGCAAACATATTTGGACGTAGATGGTGCATTTGGAAGTTATGTCATCTAAATTCATATATATACATTTTTTTTTTTTTGTGTGTGTGAAGTCAATATTTAGCTGGTATTCTGCATGTTGCTAAATCTAAGCGCTCTTCTTTGTGGGCAATTAAGAAACAAAAACGAATGTAGAGAAAAGTAGAGTGTAAGATTAACTTGCCAGAATATCCAAACCACCATAGTTTTGTAGCAGCCAATCAGCAAACTGTTTGATGGATAAAGCATCCAAGACATCCAGCTGATGAAAGGCAACATTGAGGCCACCTTCCTGTAAGACCTTAGCTGCTTCAAGGCCAACACAAACATCTCTTGAAGTCAAGATGACGGTCATTCCATGCATTGCAAATTGTCTTGAAACCTCAAATCCAATTCCTCTATTGCCACCAGTAACCACGGCAATGGTTTCTGTAGACCACCACCTACAAAGTAAATGTGGGTGTTGGTTATTGAGCAGAAAACAGAATGGTGAGAAGGAAAAGAAAGAGACCTCTGGTGATCGGAGTAAGGAATGGTCCTGGTGAGGGATATCTCTTGCATTCTCTTTTCCCTTAGCTCATTGCCTCTTTCCTTCCTGCCCATCTCGGTTCGGCTGTTTCTGATTCTGAAATCGCCCAAACAAAACAAAACAAAACAAAACAAAACAAAGAATCCAGGGTTAGTTGGTTTATGGTTAGAGGCGGGAGAAGGAGGAGACTTAA

mRNA sequence

ATGACGTCGCTGTATCGGTGCCGGAGGGGTAAGCTGAAGATGTTGCAGAAGCAGAGATACTTGGACCGCCAAAGGATTACTTCCGCCCTTCTTGCTACCTCTGCCTATGCTTTACCAGGCCCCCTTCGAATTCCAAAGGTTCCACTTTCTTCACTCCTTCCTTTCACCTCTTCTCCCATACATGTACACTATACCTCTTGCGCTTCTCTTCCCAATCCTCTTCCACTCCTCAATTTCATGTCAATTCGCTGCATTTGTTTTCTGCCTCCGTCTCCTCCCTCTTCCATTCCCTGGAATTCTAATGAACTTGAGGACGCAAATTCTACTTCTAAGTTCACCGTAAAAGCAGGTTATCAAACCCTTTATCGTGTTCTTGAAGCCTGCAGACTCTCCCCCTCGAATTCCAAAACCGCTTCTGAAACGCATGCAAGAATGATTAAATTTGGATATGGAAACTACCCAACTCTCGTCACCTCTTTAGTATCAGCTTATCAACGTGCCGATTGCCTTAATCGGGTCCATCAACTTCTTAATCTACTCTGCTCTAAACATCTTGATTTAGTTGCAATGAACTTATTTATTGACAATTTTATGAAAATAGGGGAATGCAAACTTGCTAAAAGGGTGTTTGATAAAATGCCTTACCGTGATGTGGTAACGTGGAACTCAATCATCGGAGGTTGTGTGAAGAATGCACGGTATGAGGAGGCATTTAAATTCTTTAGGCAGATGCTGAACTCAAATATCCAGCCGGATGGGTTTACGTTTGCTTCCGTGTTGAATGCATGTGCGCAGCTCGGAGCTCCAAGTAACACTCAGTGGGTTCATGCTCTGATGACTCAGAAAAAAATTGAGCTTAATTCTATATTGAGTTGTGCACTCATAGACGCCTACTCAAAGTGTGGCAGCATCCAAATTGCAAAGGAAATCTTTAGCAGCGTCCCTCGCAGTAACATCTCAAGTTGCGGAAAAAGCTGGATTGAGTTGGGAGGTTCCATTCAAAGCTTCAAGTCAGGTGACCGATCGCATCCAGAAAGCGATGCAGTGTACAAGGTGGTGTGGCGTTTGATGAAGAGAAGTCGGTCAGAGGGATATATGCCTGTGACAGACTTGGTTCTCATGGATATCTCTGAGGAGGAGAAGGAAGAGAACTTATCATATCACAGCGAAAAGTTGGCATTGGCGTATGCGATCCTGAAAACTAGTCCAGGGGCAAAAATCAGTATATCAAAGAACCTACGGATGTGTGATGATTGTCATAGATGGATAAAACTAGTTTCAACACTGCTGTGCAGAGTTGTAGTAGTGAGAGATCGGATCCGGTTTCATCAATTCGAAGGTGGCATGTGTTCCTGTGGTGATCGTTGGGTCGCATTGAAAGACAAGTTATATATTTTAACTTTTGGACAACGAAACCATGTTTGGTTGAATGAGGGATCAATTACAATTCCTCCTTATGCTCCTCCTCAGGAAATGCAGGCCCACCTCATGATTGGAGGCGTCCCTGTTATTTGTATAGATGAGAAAGATTGTTTGCATGAAGAAGGTGCACTTACATTCCGCCTGCCGTTCAGCTTGGCCAACCTTGAGCTCACATTCACGATGCGAGCACCAGCAGACGAAGAAAACAGAATGGTGAGAAGGAAAAGAAAGAGACCTCTGGTGATCGGACTCATTGCCTCTTTCCTTCCTGCCCATCTCGGTTCGGCTGTTTCTGATTCTGAAATCGCCCAAACAAAACAAAACAAAACAAAACAAAACAAAGAATCCAGGAGGCGGGAGAAGGAGGAGACTTAA

Coding sequence (CDS)

ATGACGTCGCTGTATCGGTGCCGGAGGGGTAAGCTGAAGATGTTGCAGAAGCAGAGATACTTGGACCGCCAAAGGATTACTTCCGCCCTTCTTGCTACCTCTGCCTATGCTTTACCAGGCCCCCTTCGAATTCCAAAGGTTCCACTTTCTTCACTCCTTCCTTTCACCTCTTCTCCCATACATGTACACTATACCTCTTGCGCTTCTCTTCCCAATCCTCTTCCACTCCTCAATTTCATGTCAATTCGCTGCATTTGTTTTCTGCCTCCGTCTCCTCCCTCTTCCATTCCCTGGAATTCTAATGAACTTGAGGACGCAAATTCTACTTCTAAGTTCACCGTAAAAGCAGGTTATCAAACCCTTTATCGTGTTCTTGAAGCCTGCAGACTCTCCCCCTCGAATTCCAAAACCGCTTCTGAAACGCATGCAAGAATGATTAAATTTGGATATGGAAACTACCCAACTCTCGTCACCTCTTTAGTATCAGCTTATCAACGTGCCGATTGCCTTAATCGGGTCCATCAACTTCTTAATCTACTCTGCTCTAAACATCTTGATTTAGTTGCAATGAACTTATTTATTGACAATTTTATGAAAATAGGGGAATGCAAACTTGCTAAAAGGGTGTTTGATAAAATGCCTTACCGTGATGTGGTAACGTGGAACTCAATCATCGGAGGTTGTGTGAAGAATGCACGGTATGAGGAGGCATTTAAATTCTTTAGGCAGATGCTGAACTCAAATATCCAGCCGGATGGGTTTACGTTTGCTTCCGTGTTGAATGCATGTGCGCAGCTCGGAGCTCCAAGTAACACTCAGTGGGTTCATGCTCTGATGACTCAGAAAAAAATTGAGCTTAATTCTATATTGAGTTGTGCACTCATAGACGCCTACTCAAAGTGTGGCAGCATCCAAATTGCAAAGGAAATCTTTAGCAGCGTCCCTCGCAGTAACATCTCAAGTTGCGGAAAAAGCTGGATTGAGTTGGGAGGTTCCATTCAAAGCTTCAAGTCAGGTGACCGATCGCATCCAGAAAGCGATGCAGTGTACAAGGTGGTGTGGCGTTTGATGAAGAGAAGTCGGTCAGAGGGATATATGCCTGTGACAGACTTGGTTCTCATGGATATCTCTGAGGAGGAGAAGGAAGAGAACTTATCATATCACAGCGAAAAGTTGGCATTGGCGTATGCGATCCTGAAAACTAGTCCAGGGGCAAAAATCAGTATATCAAAGAACCTACGGATGTGTGATGATTGTCATAGATGGATAAAACTAGTTTCAACACTGCTGTGCAGAGTTGTAGTAGTGAGAGATCGGATCCGGTTTCATCAATTCGAAGGTGGCATGTGTTCCTGTGGTGATCGTTGGGTCGCATTGAAAGACAAGTTATATATTTTAACTTTTGGACAACGAAACCATGTTTGGTTGAATGAGGGATCAATTACAATTCCTCCTTATGCTCCTCCTCAGGAAATGCAGGCCCACCTCATGATTGGAGGCGTCCCTGTTATTTGTATAGATGAGAAAGATTGTTTGCATGAAGAAGGTGCACTTACATTCCGCCTGCCGTTCAGCTTGGCCAACCTTGAGCTCACATTCACGATGCGAGCACCAGCAGACGAAGAAAACAGAATGGTGAGAAGGAAAAGAAAGAGACCTCTGGTGATCGGACTCATTGCCTCTTTCCTTCCTGCCCATCTCGGTTCGGCTGTTTCTGATTCTGAAATCGCCCAAACAAAACAAAACAAAACAAAACAAAACAAAGAATCCAGGAGGCGGGAGAAGGAGGAGACTTAA

Protein sequence

MTSLYRCRRGKLKMLQKQRYLDRQRITSALLATSAYALPGPLRIPKVPLSSLLPFTSSPIHVHYTSCASLPNPLPLLNFMSIRCICFLPPSPPSSIPWNSNELEDANSTSKFTVKAGYQTLYRVLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLCSKHLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFFRQMLNSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEIFSSVPRSNISSCGKSWIELGGSIQSFKSGDRSHPESDAVYKVVWRLMKRSRSEGYMPVTDLVLMDISEEEKEENLSYHSEKLALAYAILKTSPGAKISISKNLRMCDDCHRWIKLVSTLLCRVVVVRDRIRFHQFEGGMCSCGDRWVALKDKLYILTFGQRNHVWLNEGSITIPPYAPPQEMQAHLMIGGVPVICIDEKDCLHEEGALTFRLPFSLANLELTFTMRAPADEENRMVRRKRKRPLVIGLIASFLPAHLGSAVSDSEIAQTKQNKTKQNKESRRREKEET
Homology
BLAST of Cp4.1LG18g09360 vs. ExPASy Swiss-Prot
Match: Q9FI49 (Pentatricopeptide repeat-containing protein At5g50990 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H59 PE=2 SV=2)

HSP 1 Score: 302.8 bits (774), Expect = 8.9e-81
Identity = 184/516 (35.66%), Postives = 255/516 (49.42%), Query Frame = 0

Query: 108 STSKFTVKAGYQTLYRVLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRA 167
           S+S  +    +  L +VLE+C+ +PSNSK   + HA++ K GYG YP+L+ S V+AY+R 
Sbjct: 20  SSSSASNLTDHGMLKQVLESCK-APSNSKCVLQAHAQIFKLGYGTYPSLLVSTVAAYRRC 79

Query: 168 DCLNRVHQLLNLLCSKHLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGG 227
           +      +LL    S    +  +NL I++ MKIGE  LAK+V      ++V+TWN +IGG
Sbjct: 80  NRSYLARRLLLWFLSLSPGVCNINLIIESLMKIGESGLAKKVLRNASDQNVITWNLMIGG 139

Query: 228 CVKNARYEEAFKFFRQMLN-SNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIEL 287
            V+N +YEEA K  + ML+ ++I+P+ F+FAS L ACA+LG   + +WVH+LM    IEL
Sbjct: 140 YVRNVQYEEALKALKNMLSFTDIKPNKFSFASSLAACARLGDLHHAKWVHSLMIDSGIEL 199

Query: 288 NSILSCALIDAYSKCGSIQIAKEIFSSVPRSNIS-------------------------- 347
           N+ILS AL+D Y+KCG I  ++E+F SV R+++S                          
Sbjct: 200 NAILSSALVDVYAKCGDIGTSREVFYSVKRNDVSIWNAMITGFATHGLATEAIRVFSEME 259

Query: 348 ----------------SC------------------------------------------ 407
                           +C                                          
Sbjct: 260 AEHVSPDSITFLGLLTTCSHCGLLEEGKEYFGLMSRRFSIQPKLEHYGAMVDLLGRAGRV 319

Query: 408 ------------------------------------------------------------ 457
                                                                       
Sbjct: 320 KEAYELIESMPIEPDVVIWRSLLSSSRTYKNPELGEIAIQNLSKAKSGDYVLLSNIYSST 379

BLAST of Cp4.1LG18g09360 vs. ExPASy Swiss-Prot
Match: O23169 (Pentatricopeptide repeat-containing protein At4g37170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H5 PE=3 SV=1)

HSP 1 Score: 192.6 bits (488), Expect = 1.3e-47
Identity = 109/336 (32.44%), Postives = 157/336 (46.73%), Query Frame = 0

Query: 189 AMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFFRQMLNSN 248
           A +  +D + K G  + AK V D  P  D+V+W S+IGGC +N + +EA K+F  +L S 
Sbjct: 356 ASSSLVDMYTKCGNIESAKHVVDGCPKPDLVSWTSLIGGCAQNGQPDEALKYFDLLLKSG 415

Query: 249 IQPDGFTFASVLNACAQLG-APSNTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIA 308
            +PD  TF +VL+AC   G      ++ +++  + ++   S     L+D  ++ G  +  
Sbjct: 416 TKPDHVTFVNVLSACTHAGLVEKGLEFFYSITEKHRLSHTSDHYTCLVDLLARSGRFEQL 475

Query: 309 KEIFSSVP--------RSNISSC------------------------------------- 368
           K + S +P         S +  C                                     
Sbjct: 476 KSVISEMPMKPSKFLWASVLGGCSTYGNIDLAEEAAQELFKIEPENPVTYVTMANIYAAA 535

Query: 369 ----------------------GKSWIELGGSIQSFKSGDRSHPESDAVYKVVWRLMKRS 428
                                 G SW E+      F + D SHP  + + + +  L K+ 
Sbjct: 536 GKWEEEGKMRKRMQEIGVTKRPGSSWTEIKRKRHVFIAADTSHPMYNQIVEFLRELRKKM 595

Query: 429 RSEGYMPVTDLVLMDISEEEKEENLSYHSEKLALAYAILKTSPGAKISISKNLRMCDDCH 457
           + EGY+P T LVL D+ +E+KEENL YHSEKLA+A+AIL T  G  I + KNLR C DCH
Sbjct: 596 KEEGYVPATSLVLHDVEDEQKEENLVYHSEKLAVAFAILSTEEGTAIKVFKNLRSCVDCH 655

BLAST of Cp4.1LG18g09360 vs. ExPASy Swiss-Prot
Match: Q9FJY7 (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 191.0 bits (484), Expect = 3.8e-47
Identity = 132/508 (25.98%), Postives = 209/508 (41.14%), Query Frame = 0

Query: 120 TLYRVLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNL 179
           T   +L+AC  + S  +  ++ HA++ K GY N    V SL+++Y         H L + 
Sbjct: 117 TFPSLLKACS-NLSAFEETTQIHAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDR 176

Query: 180 LCSKHLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFK 239
           +     D V+ N  I  ++K G+  +A  +F KM  ++ ++W ++I G V+    +EA +
Sbjct: 177 I--PEPDDVSWNSVIKGYVKAGKMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQ 236

Query: 240 FFRQMLNSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYS 299
            F +M NS+++PD  + A+ L+ACAQLGA    +W+H+ + + +I ++S+L C LID Y+
Sbjct: 237 LFHEMQNSDVEPDNVSLANALSACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYA 296

Query: 300 KCGSIQIAKEIFSS---------------------------------------------- 359
           KCG ++ A E+F +                                              
Sbjct: 297 KCGEMEEALEVFKNIKKKSVQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTA 356

Query: 360 ------------------------------------------------------------ 419
                                                                       
Sbjct: 357 VLTACSYTGLVEEGKLIFYSMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLK 416

Query: 420 ----------------------------------------VPRSNISSCGKSW------- 457
                                                   V ++NI +  K W       
Sbjct: 417 PNAVIWGALLKACRIHKNIELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETR 476

BLAST of Cp4.1LG18g09360 vs. ExPASy Swiss-Prot
Match: Q683I9 (Pentatricopeptide repeat-containing protein At3g62890 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H82 PE=2 SV=1)

HSP 1 Score: 190.7 bits (483), Expect = 4.9e-47
Identity = 133/502 (26.49%), Postives = 209/502 (41.63%), Query Frame = 0

Query: 131 SPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLCSKHLDLVAM 190
           +P +      THA+++ FG    P + TSL++ Y     L    ++ +   SK  DL A 
Sbjct: 74  NPLHLPLGQRTHAQILLFGLDKDPFVRTSLLNMYSSCGDLRSAQRVFDDSGSK--DLPAW 133

Query: 191 NLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFFRQML----- 250
           N  ++ + K G    A+++FD+MP R+V++W+ +I G V   +Y+EA   FR+M      
Sbjct: 134 NSVVNAYAKAGLIDDARKLFDEMPERNVISWSCLINGYVMCGKYKEALDLFREMQLPKPN 193

Query: 251 NSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQ 310
            + ++P+ FT ++VL+AC +LGA    +WVHA + +  +E++ +L  ALID Y+KCGS++
Sbjct: 194 EAFVRPNEFTMSTVLSACGRLGALEQGKWVHAYIDKYHVEIDIVLGTALIDMYAKCGSLE 253

Query: 311 IAKEIFSSV---------------------------------------PRS--------- 370
            AK +F+++                                       P S         
Sbjct: 254 RAKRVFNALGSKKDVKAYSAMICCLAMYGLTDECFQLFSEMTTSDNINPNSVTFVGILGA 313

Query: 371 ------------------------------------------------------------ 430
                                                                       
Sbjct: 314 CVHRGLINEGKSYFKMMIEEFGITPSIQHYGCMVDLYGRSGLIKEAESFIASMPMEPDVL 373

Query: 431 -------------NISSC------------------------------------------ 457
                        +I +C                                          
Sbjct: 374 IWGSLLSGSRMLGDIKTCEGALKRLIELDPMNSGAYVLLSNVYAKTGRWMEVKCIRHEME 433

BLAST of Cp4.1LG18g09360 vs. ExPASy Swiss-Prot
Match: Q9SZT8 (Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=ELI1 PE=3 SV=1)

HSP 1 Score: 182.6 bits (462), Expect = 1.3e-44
Identity = 124/498 (24.90%), Postives = 194/498 (38.96%), Query Frame = 0

Query: 130 LSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLCSKHLDLVA 189
           L   ++K+    H  ++KFG G  P + T LV  Y +   +    ++ + +  +   LV+
Sbjct: 137 LKSCSTKSGKLIHTHVLKFGLGIDPYVATGLVDVYAKGGDVVSAQKVFDRMPER--SLVS 196

Query: 190 MNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFFRQML-NSN 249
               I  + K G  + A+ +FD M  RD+V+WN +I G  ++    +A   F+++L    
Sbjct: 197 STAMITCYAKQGNVEAARALFDSMCERDIVSWNVMIDGYAQHGFPNDALMLFQKLLAEGK 256

Query: 250 IQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAK 309
            +PD  T  + L+AC+Q+GA    +W+H  +   +I LN  +   LID YSKCGS++ A 
Sbjct: 257 PKPDEITVVAALSACSQIGALETGRWIHVFVKSSRIRLNVKVCTGLIDMYSKCGSLEEAV 316

Query: 310 EIFSSVPRSNI------------------------------------------------- 369
            +F+  PR +I                                                 
Sbjct: 317 LVFNDTPRKDIVAWNAMIAGYAMHGYSQDALRLFNEMQGITGLQPTDITFIGTLQACAHA 376

Query: 370 ------------------------------------------------------------ 429
                                                                       
Sbjct: 377 GLVNEGIRIFESMGQEYGIKPKIEHYGCLVSLLGRAGQLKRAYETIKNMNMDADSVLWSS 436

Query: 430 --SSC------------------------------------------------------- 457
              SC                                                       
Sbjct: 437 VLGSCKLHGDFVLGKEIAEYLIGLNIKNSGIYVLLSNIYASVGDYEGVAKVRNLMKEKGI 496

BLAST of Cp4.1LG18g09360 vs. NCBI nr
Match: XP_023516965.1 (pentatricopeptide repeat-containing protein At5g50990 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 849 bits (2193), Expect = 2.41e-304
Identity = 454/620 (73.23%), Postives = 454/620 (73.23%), Query Frame = 0

Query: 3   SLYRCRRGKLKMLQKQRYLDRQRITSALLATSAYALPGPLRIPKVPLSSLLPFTSSPIHV 62
           SLYRCRRGKLKMLQKQRYLDRQRITSALLATSAYALPGPLRIPKVPLSSLLPFTSSPIHV
Sbjct: 10  SLYRCRRGKLKMLQKQRYLDRQRITSALLATSAYALPGPLRIPKVPLSSLLPFTSSPIHV 69

Query: 63  HYTSCASLPNPLPLLNFMSIRCICFLPPSPPSSIPWNSNELEDANSTSKFTVKAGYQTLY 122
           HYTSCASLPNPLPLLNFMSIRCICFLPPSPPSSIPWNSNELEDANSTSKFTVKAGYQTLY
Sbjct: 70  HYTSCASLPNPLPLLNFMSIRCICFLPPSPPSSIPWNSNELEDANSTSKFTVKAGYQTLY 129

Query: 123 RVLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLCS 182
           RVLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLCS
Sbjct: 130 RVLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLCS 189

Query: 183 KHLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFFR 242
           KHLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFFR
Sbjct: 190 KHLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFFR 249

Query: 243 QMLNSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSKCG 302
           QMLNSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSKCG
Sbjct: 250 QMLNSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSKCG 309

Query: 303 SIQIAKEIFSSVPRSNIS------------------------------------------ 362
           SIQIAKEIFSSVPRSNIS                                          
Sbjct: 310 SIQIAKEIFSSVPRSNISVWNAMIKGLAIHGLSMDALSVFWMMERENVLPDAVTFLGILT 369

Query: 363 ------------------------------------------------------------ 422
                                                                       
Sbjct: 370 ACNHGGLIEQGRRFFDWMKNRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIVAMPIEADV 429

Query: 423 ------------------------------------------------------------ 456
                                                                       
Sbjct: 430 VTWRALLSGCRIYRNQELAEVAIANMSHRGSGDYVLLSNTYCSLNRWEHAERVRERMKSN 489

BLAST of Cp4.1LG18g09360 vs. NCBI nr
Match: XP_023516963.1 (pentatricopeptide repeat-containing protein At5g50990 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 844 bits (2180), Expect = 2.46e-302
Identity = 454/622 (72.99%), Postives = 454/622 (72.99%), Query Frame = 0

Query: 3   SLYRCRRGKLKMLQKQRYLDRQRITSALLATSAYALPGPLRIPKVPLSSLLPFTSSPIHV 62
           SLYRCRRGKLKMLQKQRYLDRQRITSALLATSAYALPGPLRIPKVPLSSLLPFTSSPIHV
Sbjct: 10  SLYRCRRGKLKMLQKQRYLDRQRITSALLATSAYALPGPLRIPKVPLSSLLPFTSSPIHV 69

Query: 63  HYTSCASLPNPLPLLNFMSIRCICFLPPSPPSSIPWNSNELEDANSTSKFTVKAG--YQT 122
           HYTSCASLPNPLPLLNFMSIRCICFLPPSPPSSIPWNSNELEDANSTSKFTVKAG  YQT
Sbjct: 70  HYTSCASLPNPLPLLNFMSIRCICFLPPSPPSSIPWNSNELEDANSTSKFTVKAGNGYQT 129

Query: 123 LYRVLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLL 182
           LYRVLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLL
Sbjct: 130 LYRVLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLL 189

Query: 183 CSKHLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKF 242
           CSKHLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKF
Sbjct: 190 CSKHLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKF 249

Query: 243 FRQMLNSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSK 302
           FRQMLNSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSK
Sbjct: 250 FRQMLNSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSK 309

Query: 303 CGSIQIAKEIFSSVPRSNIS---------------------------------------- 362
           CGSIQIAKEIFSSVPRSNIS                                        
Sbjct: 310 CGSIQIAKEIFSSVPRSNISVWNAMIKGLAIHGLSMDALSVFWMMERENVLPDAVTFLGI 369

Query: 363 ------------------------------------------------------------ 422
                                                                       
Sbjct: 370 LTACNHGGLIEQGRRFFDWMKNRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIVAMPIEA 429

Query: 423 ------------------------------------------------------------ 456
                                                                       
Sbjct: 430 DVVTWRALLSGCRIYRNQELAEVAIANMSHRGSGDYVLLSNTYCSLNRWEHAERVRERMK 489

BLAST of Cp4.1LG18g09360 vs. NCBI nr
Match: XP_022922013.1 (pentatricopeptide repeat-containing protein At5g50990 isoform X2 [Cucurbita moschata])

HSP 1 Score: 836 bits (2159), Expect = 3.56e-299
Identity = 447/619 (72.21%), Postives = 450/619 (72.70%), Query Frame = 0

Query: 4   LYRCRRGKLKMLQKQRYLDRQRITSALLATSAYALPGPLRIPKVPLSSLLPFTSSPIHVH 63
           LYRCRR KLKMLQKQRY +RQRITSALLATSAYALPGPLRIP+VPLSSLLPFTSSPIHVH
Sbjct: 11  LYRCRRRKLKMLQKQRYSERQRITSALLATSAYALPGPLRIPRVPLSSLLPFTSSPIHVH 70

Query: 64  YTSCASLPNPLPLLNFMSIRCICFLPPSPPSSIPWNSNELEDANSTSKFTVKAGYQTLYR 123
           YTSCASLPNPLPLLNFMSIRCICFLPPSPPSSIPWNSNELEDANSTSKFTV+AGYQTLYR
Sbjct: 71  YTSCASLPNPLPLLNFMSIRCICFLPPSPPSSIPWNSNELEDANSTSKFTVEAGYQTLYR 130

Query: 124 VLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLCSK 183
           VLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLCSK
Sbjct: 131 VLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLCSK 190

Query: 184 HLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFFRQ 243
           HLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFFRQ
Sbjct: 191 HLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFFRQ 250

Query: 244 MLNSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSKCGS 303
           MLNSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSKCGS
Sbjct: 251 MLNSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSKCGS 310

Query: 304 IQIAKEIFSSVPRSNIS------------------------------------------- 363
           IQIAKEIFSSVPRSNIS                                           
Sbjct: 311 IQIAKEIFSSVPRSNISVWNAMIKGLAIHGLSMDALSVFWMMERENVLPDAVTFLGILTA 370

Query: 364 ------------------------------------------------------------ 423
                                                                       
Sbjct: 371 CNHGGLIEQGRRFFDWMKNRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIVAMPIEADVV 430

Query: 424 ------------------------------------------------------------ 456
                                                                       
Sbjct: 431 TWRALLSGCRIYRNQELAEVAIANMSHRGSGDYVLLSNIYCSLNRWEHAERVRERMKSNG 490

BLAST of Cp4.1LG18g09360 vs. NCBI nr
Match: KAG7023021.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 833 bits (2153), Expect = 2.90e-298
Identity = 446/619 (72.05%), Postives = 450/619 (72.70%), Query Frame = 0

Query: 4   LYRCRRGKLKMLQKQRYLDRQRITSALLATSAYALPGPLRIPKVPLSSLLPFTSSPIHVH 63
           LYRCRRGKLKMLQKQRY +RQRITSALLATSAYALPGPLRIP+VPLSSLLPFTSSPIHVH
Sbjct: 11  LYRCRRGKLKMLQKQRYSERQRITSALLATSAYALPGPLRIPRVPLSSLLPFTSSPIHVH 70

Query: 64  YTSCASLPNPLPLLNFMSIRCICFLPPSPPSSIPWNSNELEDANSTSKFTVKAGYQTLYR 123
           YTSCASLPNPLPLLNFMSIRCICFLPPSPPSSIPWNSNELEDANS SKFTV+AGYQTLYR
Sbjct: 71  YTSCASLPNPLPLLNFMSIRCICFLPPSPPSSIPWNSNELEDANSISKFTVEAGYQTLYR 130

Query: 124 VLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLCSK 183
           VLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLCSK
Sbjct: 131 VLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLCSK 190

Query: 184 HLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFFRQ 243
           HLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFFRQ
Sbjct: 191 HLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFFRQ 250

Query: 244 MLNSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSKCGS 303
           MLNSNIQPDGFTFASVLNACAQLGAPS TQWVHALMTQKKIELNSILSCALIDAYSKCGS
Sbjct: 251 MLNSNIQPDGFTFASVLNACAQLGAPSITQWVHALMTQKKIELNSILSCALIDAYSKCGS 310

Query: 304 IQIAKEIFSSVPRSNIS------------------------------------------- 363
           IQIA+EIFSSVPRSNIS                                           
Sbjct: 311 IQIAREIFSSVPRSNISVWNAMIKGLAIHGLSMDALSVFWMMERENVLPDAVTFLGILTA 370

Query: 364 ------------------------------------------------------------ 423
                                                                       
Sbjct: 371 CNHGGLIEQGRRFFDWMKNRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIVAMPIEADVV 430

Query: 424 ------------------------------------------------------------ 456
                                                                       
Sbjct: 431 TWRALLSGCRIYRNQELAEVAIANMSHRGSGDYVLLSNIYCSLNRWEHAERVRERMKSNG 490

BLAST of Cp4.1LG18g09360 vs. NCBI nr
Match: XP_022922012.1 (pentatricopeptide repeat-containing protein At5g50990 isoform X1 [Cucurbita moschata])

HSP 1 Score: 831 bits (2146), Expect = 3.62e-297
Identity = 447/621 (71.98%), Postives = 450/621 (72.46%), Query Frame = 0

Query: 4   LYRCRRGKLKMLQKQRYLDRQRITSALLATSAYALPGPLRIPKVPLSSLLPFTSSPIHVH 63
           LYRCRR KLKMLQKQRY +RQRITSALLATSAYALPGPLRIP+VPLSSLLPFTSSPIHVH
Sbjct: 11  LYRCRRRKLKMLQKQRYSERQRITSALLATSAYALPGPLRIPRVPLSSLLPFTSSPIHVH 70

Query: 64  YTSCASLPNPLPLLNFMSIRCICFLPPSPPSSIPWNSNELEDANSTSKFTVKAG--YQTL 123
           YTSCASLPNPLPLLNFMSIRCICFLPPSPPSSIPWNSNELEDANSTSKFTV+AG  YQTL
Sbjct: 71  YTSCASLPNPLPLLNFMSIRCICFLPPSPPSSIPWNSNELEDANSTSKFTVEAGNGYQTL 130

Query: 124 YRVLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLC 183
           YRVLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLC
Sbjct: 131 YRVLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLC 190

Query: 184 SKHLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFF 243
           SKHLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFF
Sbjct: 191 SKHLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFF 250

Query: 244 RQMLNSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSKC 303
           RQMLNSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSKC
Sbjct: 251 RQMLNSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSKC 310

Query: 304 GSIQIAKEIFSSVPRSNIS----------------------------------------- 363
           GSIQIAKEIFSSVPRSNIS                                         
Sbjct: 311 GSIQIAKEIFSSVPRSNISVWNAMIKGLAIHGLSMDALSVFWMMERENVLPDAVTFLGIL 370

Query: 364 ------------------------------------------------------------ 423
                                                                       
Sbjct: 371 TACNHGGLIEQGRRFFDWMKNRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIVAMPIEAD 430

Query: 424 ------------------------------------------------------------ 456
                                                                       
Sbjct: 431 VVTWRALLSGCRIYRNQELAEVAIANMSHRGSGDYVLLSNIYCSLNRWEHAERVRERMKS 490

BLAST of Cp4.1LG18g09360 vs. ExPASy TrEMBL
Match: A0A6J1E5D9 (pentatricopeptide repeat-containing protein At5g50990 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111430093 PE=3 SV=1)

HSP 1 Score: 836 bits (2159), Expect = 1.72e-299
Identity = 447/619 (72.21%), Postives = 450/619 (72.70%), Query Frame = 0

Query: 4   LYRCRRGKLKMLQKQRYLDRQRITSALLATSAYALPGPLRIPKVPLSSLLPFTSSPIHVH 63
           LYRCRR KLKMLQKQRY +RQRITSALLATSAYALPGPLRIP+VPLSSLLPFTSSPIHVH
Sbjct: 11  LYRCRRRKLKMLQKQRYSERQRITSALLATSAYALPGPLRIPRVPLSSLLPFTSSPIHVH 70

Query: 64  YTSCASLPNPLPLLNFMSIRCICFLPPSPPSSIPWNSNELEDANSTSKFTVKAGYQTLYR 123
           YTSCASLPNPLPLLNFMSIRCICFLPPSPPSSIPWNSNELEDANSTSKFTV+AGYQTLYR
Sbjct: 71  YTSCASLPNPLPLLNFMSIRCICFLPPSPPSSIPWNSNELEDANSTSKFTVEAGYQTLYR 130

Query: 124 VLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLCSK 183
           VLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLCSK
Sbjct: 131 VLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLCSK 190

Query: 184 HLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFFRQ 243
           HLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFFRQ
Sbjct: 191 HLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFFRQ 250

Query: 244 MLNSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSKCGS 303
           MLNSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSKCGS
Sbjct: 251 MLNSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSKCGS 310

Query: 304 IQIAKEIFSSVPRSNIS------------------------------------------- 363
           IQIAKEIFSSVPRSNIS                                           
Sbjct: 311 IQIAKEIFSSVPRSNISVWNAMIKGLAIHGLSMDALSVFWMMERENVLPDAVTFLGILTA 370

Query: 364 ------------------------------------------------------------ 423
                                                                       
Sbjct: 371 CNHGGLIEQGRRFFDWMKNRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIVAMPIEADVV 430

Query: 424 ------------------------------------------------------------ 456
                                                                       
Sbjct: 431 TWRALLSGCRIYRNQELAEVAIANMSHRGSGDYVLLSNIYCSLNRWEHAERVRERMKSNG 490

BLAST of Cp4.1LG18g09360 vs. ExPASy TrEMBL
Match: A0A6J1E1Y0 (pentatricopeptide repeat-containing protein At5g50990 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111430093 PE=3 SV=1)

HSP 1 Score: 831 bits (2146), Expect = 1.75e-297
Identity = 447/621 (71.98%), Postives = 450/621 (72.46%), Query Frame = 0

Query: 4   LYRCRRGKLKMLQKQRYLDRQRITSALLATSAYALPGPLRIPKVPLSSLLPFTSSPIHVH 63
           LYRCRR KLKMLQKQRY +RQRITSALLATSAYALPGPLRIP+VPLSSLLPFTSSPIHVH
Sbjct: 11  LYRCRRRKLKMLQKQRYSERQRITSALLATSAYALPGPLRIPRVPLSSLLPFTSSPIHVH 70

Query: 64  YTSCASLPNPLPLLNFMSIRCICFLPPSPPSSIPWNSNELEDANSTSKFTVKAG--YQTL 123
           YTSCASLPNPLPLLNFMSIRCICFLPPSPPSSIPWNSNELEDANSTSKFTV+AG  YQTL
Sbjct: 71  YTSCASLPNPLPLLNFMSIRCICFLPPSPPSSIPWNSNELEDANSTSKFTVEAGNGYQTL 130

Query: 124 YRVLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLC 183
           YRVLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLC
Sbjct: 131 YRVLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLC 190

Query: 184 SKHLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFF 243
           SKHLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFF
Sbjct: 191 SKHLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFF 250

Query: 244 RQMLNSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSKC 303
           RQMLNSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSKC
Sbjct: 251 RQMLNSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSKC 310

Query: 304 GSIQIAKEIFSSVPRSNIS----------------------------------------- 363
           GSIQIAKEIFSSVPRSNIS                                         
Sbjct: 311 GSIQIAKEIFSSVPRSNISVWNAMIKGLAIHGLSMDALSVFWMMERENVLPDAVTFLGIL 370

Query: 364 ------------------------------------------------------------ 423
                                                                       
Sbjct: 371 TACNHGGLIEQGRRFFDWMKNRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIVAMPIEAD 430

Query: 424 ------------------------------------------------------------ 456
                                                                       
Sbjct: 431 VVTWRALLSGCRIYRNQELAEVAIANMSHRGSGDYVLLSNIYCSLNRWEHAERVRERMKS 490

BLAST of Cp4.1LG18g09360 vs. ExPASy TrEMBL
Match: A0A6J1JL20 (pentatricopeptide repeat-containing protein At5g50990 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111485562 PE=3 SV=1)

HSP 1 Score: 816 bits (2108), Expect = 9.39e-292
Identity = 441/620 (71.13%), Postives = 446/620 (71.94%), Query Frame = 0

Query: 3   SLYRCRRGKLKMLQKQRYLDRQRITSALLATSAYALPGPLRIPKVPLSSLLPFTSSPIHV 62
           SLY  R GKLKM QKQRYLDRQRITSALLATSAYALPGPLRIP+VPLSSL PFTSSPIHV
Sbjct: 10  SLYWRRTGKLKMFQKQRYLDRQRITSALLATSAYALPGPLRIPRVPLSSL-PFTSSPIHV 69

Query: 63  HYTSCASLPNPLPLLNFMSIRCICFLPPSPPSSIPWNSNELEDANSTSKFTVKAGYQTLY 122
           HYTSCASLPNPL LLNFMSIRCICFLPPSPPSSIPWNSNELE ANSTSKFTV+AGYQTLY
Sbjct: 70  HYTSCASLPNPLALLNFMSIRCICFLPPSPPSSIPWNSNELEHANSTSKFTVEAGYQTLY 129

Query: 123 RVLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLCS 182
           RVLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLCS
Sbjct: 130 RVLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLCS 189

Query: 183 KHLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFFR 242
           KHLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFFR
Sbjct: 190 KHLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFFR 249

Query: 243 QMLNSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSKCG 302
           QML SNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKI+LNSILSCALIDAYSKCG
Sbjct: 250 QMLVSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIDLNSILSCALIDAYSKCG 309

Query: 303 SIQIAKEIFSSVPRSNIS------------------------------------------ 362
           SIQIAKEIFSSVPRSNIS                                          
Sbjct: 310 SIQIAKEIFSSVPRSNISVWNAMIKGLAIHGLSMDALSVFWMMERENVLPDAVTFLGILT 369

Query: 363 ------------------------------------------------------------ 422
                                                                       
Sbjct: 370 ACNHGGLIEQGRRFFDWMKNRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIVAMPIEADV 429

Query: 423 ------------------------------------------------------------ 456
                                                                       
Sbjct: 430 VTWRALLSGCRIYRNQELAEVAIANMSHRGSGDYVLLSNIYCSLNRWEHAERVRERMKSN 489

BLAST of Cp4.1LG18g09360 vs. ExPASy TrEMBL
Match: A0A6J1JLQ2 (pentatricopeptide repeat-containing protein At5g50990 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111485562 PE=3 SV=1)

HSP 1 Score: 811 bits (2095), Expect = 9.55e-290
Identity = 441/622 (70.90%), Postives = 446/622 (71.70%), Query Frame = 0

Query: 3   SLYRCRRGKLKMLQKQRYLDRQRITSALLATSAYALPGPLRIPKVPLSSLLPFTSSPIHV 62
           SLY  R GKLKM QKQRYLDRQRITSALLATSAYALPGPLRIP+VPLSSL PFTSSPIHV
Sbjct: 10  SLYWRRTGKLKMFQKQRYLDRQRITSALLATSAYALPGPLRIPRVPLSSL-PFTSSPIHV 69

Query: 63  HYTSCASLPNPLPLLNFMSIRCICFLPPSPPSSIPWNSNELEDANSTSKFTVKAG--YQT 122
           HYTSCASLPNPL LLNFMSIRCICFLPPSPPSSIPWNSNELE ANSTSKFTV+AG  YQT
Sbjct: 70  HYTSCASLPNPLALLNFMSIRCICFLPPSPPSSIPWNSNELEHANSTSKFTVEAGNGYQT 129

Query: 123 LYRVLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLL 182
           LYRVLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLL
Sbjct: 130 LYRVLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLL 189

Query: 183 CSKHLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKF 242
           CSKHLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKF
Sbjct: 190 CSKHLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKF 249

Query: 243 FRQMLNSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSK 302
           FRQML SNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKI+LNSILSCALIDAYSK
Sbjct: 250 FRQMLVSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIDLNSILSCALIDAYSK 309

Query: 303 CGSIQIAKEIFSSVPRSNIS---------------------------------------- 362
           CGSIQIAKEIFSSVPRSNIS                                        
Sbjct: 310 CGSIQIAKEIFSSVPRSNISVWNAMIKGLAIHGLSMDALSVFWMMERENVLPDAVTFLGI 369

Query: 363 ------------------------------------------------------------ 422
                                                                       
Sbjct: 370 LTACNHGGLIEQGRRFFDWMKNRYSIQPQLEHYGVMVDLYSRAGFLEEAYSIIVAMPIEA 429

Query: 423 ------------------------------------------------------------ 456
                                                                       
Sbjct: 430 DVVTWRALLSGCRIYRNQELAEVAIANMSHRGSGDYVLLSNIYCSLNRWEHAERVRERMK 489

BLAST of Cp4.1LG18g09360 vs. ExPASy TrEMBL
Match: A0A1S4DV22 (pentatricopeptide repeat-containing protein At5g50990 OS=Cucumis melo OX=3656 GN=LOC103487281 PE=3 SV=1)

HSP 1 Score: 526 bits (1354), Expect = 8.78e-179
Identity = 309/606 (50.99%), Postives = 335/606 (55.28%), Query Frame = 0

Query: 17  KQRYLDRQRITSALLATSAYALPGPLRIPKVPLSSLLPFTSSPIHVHYTSCASLPNPLPL 76
           KQRYLD +RIT ALLATSA ALP                                     
Sbjct: 3   KQRYLDCRRITCALLATSASALPA------------------------------------ 62

Query: 77  LNFMSIRCICFLPPSPPSSIPWNSNELEDANSTSKFTVKAGYQTLYRVLEACRLSPSNSK 136
                                          + S FT    YQTL+RVLEACRL P NSK
Sbjct: 63  -------------------------------APSNFT---DYQTLHRVLEACRLFPMNSK 122

Query: 137 TASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLCSKHLDLVAMNLFIDN 196
           T  ETHAR+IKFGYGNYPTL+ SLVS YQ   CLNRVH+LL++LCSKHLDLVAMNL I N
Sbjct: 123 TVIETHARIIKFGYGNYPTLIASLVSTYQYVGCLNRVHRLLDILCSKHLDLVAMNLLIGN 182

Query: 197 FMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFFRQMLNSNIQPDGFTF 256
           FMKIGECK AK+VF KMP+RDVVTWNSIIGGCVKNARY+EAF+FFRQML SNIQPDGFTF
Sbjct: 183 FMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTF 242

Query: 257 ASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEIFSSVPR 316
           AS+LNACAQLGAPSNT WV A MTQKKIELNS+LSCALIDAYSKCGSIQIAKEIFS+VP 
Sbjct: 243 ASLLNACAQLGAPSNTHWVRAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNVPH 302

Query: 317 S----------------------------------------------------------- 376
           S                                                           
Sbjct: 303 SDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRY 362

Query: 377 ------------------------------------------------------------ 436
                                                                       
Sbjct: 363 FELMRSRYSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCKIYK 422

Query: 437 ----------NISSC-------------------------------------GKSWIELG 456
                     N+S C                                     GKSWIELG
Sbjct: 423 NHKLAEVAIANMSHCKSGDYVLLSNIYCSLNRWEEAETVRKMMKINRVRKKRGKSWIELG 482

BLAST of Cp4.1LG18g09360 vs. TAIR 10
Match: AT5G50990.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 302.8 bits (774), Expect = 6.3e-82
Identity = 184/516 (35.66%), Postives = 255/516 (49.42%), Query Frame = 0

Query: 108 STSKFTVKAGYQTLYRVLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRA 167
           S+S  +    +  L +VLE+C+ +PSNSK   + HA++ K GYG YP+L+ S V+AY+R 
Sbjct: 20  SSSSASNLTDHGMLKQVLESCK-APSNSKCVLQAHAQIFKLGYGTYPSLLVSTVAAYRRC 79

Query: 168 DCLNRVHQLLNLLCSKHLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGG 227
           +      +LL    S    +  +NL I++ MKIGE  LAK+V      ++V+TWN +IGG
Sbjct: 80  NRSYLARRLLLWFLSLSPGVCNINLIIESLMKIGESGLAKKVLRNASDQNVITWNLMIGG 139

Query: 228 CVKNARYEEAFKFFRQMLN-SNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIEL 287
            V+N +YEEA K  + ML+ ++I+P+ F+FAS L ACA+LG   + +WVH+LM    IEL
Sbjct: 140 YVRNVQYEEALKALKNMLSFTDIKPNKFSFASSLAACARLGDLHHAKWVHSLMIDSGIEL 199

Query: 288 NSILSCALIDAYSKCGSIQIAKEIFSSVPRSNIS-------------------------- 347
           N+ILS AL+D Y+KCG I  ++E+F SV R+++S                          
Sbjct: 200 NAILSSALVDVYAKCGDIGTSREVFYSVKRNDVSIWNAMITGFATHGLATEAIRVFSEME 259

Query: 348 ----------------SC------------------------------------------ 407
                           +C                                          
Sbjct: 260 AEHVSPDSITFLGLLTTCSHCGLLEEGKEYFGLMSRRFSIQPKLEHYGAMVDLLGRAGRV 319

Query: 408 ------------------------------------------------------------ 457
                                                                       
Sbjct: 320 KEAYELIESMPIEPDVVIWRSLLSSSRTYKNPELGEIAIQNLSKAKSGDYVLLSNIYSST 379

BLAST of Cp4.1LG18g09360 vs. TAIR 10
Match: AT4G37170.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 192.6 bits (488), Expect = 9.2e-49
Identity = 109/336 (32.44%), Postives = 157/336 (46.73%), Query Frame = 0

Query: 189 AMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFFRQMLNSN 248
           A +  +D + K G  + AK V D  P  D+V+W S+IGGC +N + +EA K+F  +L S 
Sbjct: 356 ASSSLVDMYTKCGNIESAKHVVDGCPKPDLVSWTSLIGGCAQNGQPDEALKYFDLLLKSG 415

Query: 249 IQPDGFTFASVLNACAQLG-APSNTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIA 308
            +PD  TF +VL+AC   G      ++ +++  + ++   S     L+D  ++ G  +  
Sbjct: 416 TKPDHVTFVNVLSACTHAGLVEKGLEFFYSITEKHRLSHTSDHYTCLVDLLARSGRFEQL 475

Query: 309 KEIFSSVP--------RSNISSC------------------------------------- 368
           K + S +P         S +  C                                     
Sbjct: 476 KSVISEMPMKPSKFLWASVLGGCSTYGNIDLAEEAAQELFKIEPENPVTYVTMANIYAAA 535

Query: 369 ----------------------GKSWIELGGSIQSFKSGDRSHPESDAVYKVVWRLMKRS 428
                                 G SW E+      F + D SHP  + + + +  L K+ 
Sbjct: 536 GKWEEEGKMRKRMQEIGVTKRPGSSWTEIKRKRHVFIAADTSHPMYNQIVEFLRELRKKM 595

Query: 429 RSEGYMPVTDLVLMDISEEEKEENLSYHSEKLALAYAILKTSPGAKISISKNLRMCDDCH 457
           + EGY+P T LVL D+ +E+KEENL YHSEKLA+A+AIL T  G  I + KNLR C DCH
Sbjct: 596 KEEGYVPATSLVLHDVEDEQKEENLVYHSEKLAVAFAILSTEEGTAIKVFKNLRSCVDCH 655

BLAST of Cp4.1LG18g09360 vs. TAIR 10
Match: AT5G66520.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 191.0 bits (484), Expect = 2.7e-48
Identity = 132/508 (25.98%), Postives = 209/508 (41.14%), Query Frame = 0

Query: 120 TLYRVLEACRLSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNL 179
           T   +L+AC  + S  +  ++ HA++ K GY N    V SL+++Y         H L + 
Sbjct: 117 TFPSLLKACS-NLSAFEETTQIHAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDR 176

Query: 180 LCSKHLDLVAMNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFK 239
           +     D V+ N  I  ++K G+  +A  +F KM  ++ ++W ++I G V+    +EA +
Sbjct: 177 I--PEPDDVSWNSVIKGYVKAGKMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQ 236

Query: 240 FFRQMLNSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYS 299
            F +M NS+++PD  + A+ L+ACAQLGA    +W+H+ + + +I ++S+L C LID Y+
Sbjct: 237 LFHEMQNSDVEPDNVSLANALSACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYA 296

Query: 300 KCGSIQIAKEIFSS---------------------------------------------- 359
           KCG ++ A E+F +                                              
Sbjct: 297 KCGEMEEALEVFKNIKKKSVQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTA 356

Query: 360 ------------------------------------------------------------ 419
                                                                       
Sbjct: 357 VLTACSYTGLVEEGKLIFYSMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLK 416

Query: 420 ----------------------------------------VPRSNISSCGKSW------- 457
                                                   V ++NI +  K W       
Sbjct: 417 PNAVIWGALLKACRIHKNIELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETR 476

BLAST of Cp4.1LG18g09360 vs. TAIR 10
Match: AT3G62890.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 190.7 bits (483), Expect = 3.5e-48
Identity = 133/502 (26.49%), Postives = 209/502 (41.63%), Query Frame = 0

Query: 131 SPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLCSKHLDLVAM 190
           +P +      THA+++ FG    P + TSL++ Y     L    ++ +   SK  DL A 
Sbjct: 74  NPLHLPLGQRTHAQILLFGLDKDPFVRTSLLNMYSSCGDLRSAQRVFDDSGSK--DLPAW 133

Query: 191 NLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFFRQML----- 250
           N  ++ + K G    A+++FD+MP R+V++W+ +I G V   +Y+EA   FR+M      
Sbjct: 134 NSVVNAYAKAGLIDDARKLFDEMPERNVISWSCLINGYVMCGKYKEALDLFREMQLPKPN 193

Query: 251 NSNIQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQ 310
            + ++P+ FT ++VL+AC +LGA    +WVHA + +  +E++ +L  ALID Y+KCGS++
Sbjct: 194 EAFVRPNEFTMSTVLSACGRLGALEQGKWVHAYIDKYHVEIDIVLGTALIDMYAKCGSLE 253

Query: 311 IAKEIFSSV---------------------------------------PRS--------- 370
            AK +F+++                                       P S         
Sbjct: 254 RAKRVFNALGSKKDVKAYSAMICCLAMYGLTDECFQLFSEMTTSDNINPNSVTFVGILGA 313

Query: 371 ------------------------------------------------------------ 430
                                                                       
Sbjct: 314 CVHRGLINEGKSYFKMMIEEFGITPSIQHYGCMVDLYGRSGLIKEAESFIASMPMEPDVL 373

Query: 431 -------------NISSC------------------------------------------ 457
                        +I +C                                          
Sbjct: 374 IWGSLLSGSRMLGDIKTCEGALKRLIELDPMNSGAYVLLSNVYAKTGRWMEVKCIRHEME 433

BLAST of Cp4.1LG18g09360 vs. TAIR 10
Match: AT4G37380.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 182.6 bits (462), Expect = 9.5e-46
Identity = 124/498 (24.90%), Postives = 194/498 (38.96%), Query Frame = 0

Query: 130 LSPSNSKTASETHARMIKFGYGNYPTLVTSLVSAYQRADCLNRVHQLLNLLCSKHLDLVA 189
           L   ++K+    H  ++KFG G  P + T LV  Y +   +    ++ + +  +   LV+
Sbjct: 137 LKSCSTKSGKLIHTHVLKFGLGIDPYVATGLVDVYAKGGDVVSAQKVFDRMPER--SLVS 196

Query: 190 MNLFIDNFMKIGECKLAKRVFDKMPYRDVVTWNSIIGGCVKNARYEEAFKFFRQML-NSN 249
               I  + K G  + A+ +FD M  RD+V+WN +I G  ++    +A   F+++L    
Sbjct: 197 STAMITCYAKQGNVEAARALFDSMCERDIVSWNVMIDGYAQHGFPNDALMLFQKLLAEGK 256

Query: 250 IQPDGFTFASVLNACAQLGAPSNTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAK 309
            +PD  T  + L+AC+Q+GA    +W+H  +   +I LN  +   LID YSKCGS++ A 
Sbjct: 257 PKPDEITVVAALSACSQIGALETGRWIHVFVKSSRIRLNVKVCTGLIDMYSKCGSLEEAV 316

Query: 310 EIFSSVPRSNI------------------------------------------------- 369
            +F+  PR +I                                                 
Sbjct: 317 LVFNDTPRKDIVAWNAMIAGYAMHGYSQDALRLFNEMQGITGLQPTDITFIGTLQACAHA 376

Query: 370 ------------------------------------------------------------ 429
                                                                       
Sbjct: 377 GLVNEGIRIFESMGQEYGIKPKIEHYGCLVSLLGRAGQLKRAYETIKNMNMDADSVLWSS 436

Query: 430 --SSC------------------------------------------------------- 457
              SC                                                       
Sbjct: 437 VLGSCKLHGDFVLGKEIAEYLIGLNIKNSGIYVLLSNIYASVGDYEGVAKVRNLMKEKGI 496

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FI498.9e-8135.66Pentatricopeptide repeat-containing protein At5g50990 OS=Arabidopsis thaliana OX... [more]
O231691.3e-4732.44Pentatricopeptide repeat-containing protein At4g37170 OS=Arabidopsis thaliana OX... [more]
Q9FJY73.8e-4725.98Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX... [more]
Q683I94.9e-4726.49Pentatricopeptide repeat-containing protein At3g62890 OS=Arabidopsis thaliana OX... [more]
Q9SZT81.3e-4424.90Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis t... [more]
Match NameE-valueIdentityDescription
XP_023516965.12.41e-30473.23pentatricopeptide repeat-containing protein At5g50990 isoform X2 [Cucurbita pepo... [more]
XP_023516963.12.46e-30272.99pentatricopeptide repeat-containing protein At5g50990 isoform X1 [Cucurbita pepo... [more]
XP_022922013.13.56e-29972.21pentatricopeptide repeat-containing protein At5g50990 isoform X2 [Cucurbita mosc... [more]
KAG7023021.12.90e-29872.05Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022922012.13.62e-29771.98pentatricopeptide repeat-containing protein At5g50990 isoform X1 [Cucurbita mosc... [more]
Match NameE-valueIdentityDescription
A0A6J1E5D91.72e-29972.21pentatricopeptide repeat-containing protein At5g50990 isoform X2 OS=Cucurbita mo... [more]
A0A6J1E1Y01.75e-29771.98pentatricopeptide repeat-containing protein At5g50990 isoform X1 OS=Cucurbita mo... [more]
A0A6J1JL209.39e-29271.13pentatricopeptide repeat-containing protein At5g50990 isoform X2 OS=Cucurbita ma... [more]
A0A6J1JLQ29.55e-29070.90pentatricopeptide repeat-containing protein At5g50990 isoform X1 OS=Cucurbita ma... [more]
A0A1S4DV228.78e-17950.99pentatricopeptide repeat-containing protein At5g50990 OS=Cucumis melo OX=3656 GN... [more]
Match NameE-valueIdentityDescription
AT5G50990.16.3e-8235.66Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G37170.19.2e-4932.44Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G66520.12.7e-4825.98Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G62890.13.5e-4826.49Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G37380.19.5e-4624.90Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 102..216
e-value: 8.6E-7
score: 30.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 217..365
e-value: 1.3E-23
score: 85.9
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 219..252
e-value: 1.9E-9
score: 35.1
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 293..318
e-value: 0.2
score: 12.0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 216..264
e-value: 3.9E-15
score: 55.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 217..251
score: 13.449573
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 323..446
e-value: 2.8E-33
score: 114.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 573..598
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 582..598
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 111..319
NoneNo IPR availablePANTHERPTHR47926:SF208PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 111..319
NoneNo IPR availablePANTHERPTHR47926:SF208PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 320..439
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 320..439

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG18g09360.1Cp4.1LG18g09360.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding