Cp4.1LG16g08940 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG16g08940
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionPentatricopeptide repeat-containing protein
LocationCp4.1LG16 : 8322056 .. 8325296 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGATTTCAGCCACCCTAAGCATTCACGGACGCTCTCCGACTCCTAAACAAGCCATCAATGTCTCAAAGGACTGGAACTTGATTATAAAGCACCAAACCAAGCTTAAGAATGACCATGCCATTCTTTCTACATATACCCAGATGGAGTCTCTTGGTATTGCACCCGATTCTGCTACAATGCCTCTTGTTCTAAAGGCTTGCGGGAGGCTCAACGCCATTGAAAAAGGGGTACGAATTCATTCTTGTATTAGGGATTCGGATTTGATCAGAGATGTTCGGGTTGGGACTGCCTTGGTCGATTTCTATAGTAAATGTGGGCTTGTTGGAGAGGCCAGTAAAGTGTTCGATGAAATGCCTGAAAGAGATTTGGTTTCGTGGAATGCATTGATTTCGGGATATGTGGGCTGTTCGTGCTATAAAGAAGCAGTGTTGTTGTTTATGGAGATGCAAAAGGCAGGCCTCACACCCAATTCTCGTACTGTAGTGCCTCTGCTTTTGGCTTGTGCTGAGATGTTGGAACTGCGATTAGGACATGAGATTCATGGTTATTGTTTGAGAAATGGGTTGTTTGATATGGATGCACATGTTGGTACTGCTTTGATAGGATTTTATATGAGATTTGATGCAGCAGTTTCTCACCGAGTTTTTAGCTTGATGGAGGTGAGAAATGTAGTGAGTTGGAATGCAATGATAACCGGATATCTCAATATTGGAGATTACACAAAAGCTTTGAAGCTTTTTAGTAGTATGCTGACTGAGGGTATAAAGTTTGATGCTGTTACAATGCTGCTGGTAATTCAAGCCTGTGCAGAATCTGAGTCTCTCCAATTAGGCATGCAACTGCATCAGTTGGCTATCAAGTTCAATTTCGTTGATGACTTGTTCGTATTAAATGCACTGTTGAATATGTATAGTGATAATGGACGTCTGGAGTCATCATGTGCGTTGTTTAATGCCGTTCCCACCTCTGATGCCGCCTTATGGAATTCTATGATATCTGCATACATTGCCTTCGGATTTCATGCTGAAGCTATAGCTTTGTATATTAAAATGCGTTTGGAAGGCTTAAAAGAAGACAAAAGAACCGTTGCGATTATGTTGTCTTTATGCGAAGATCTAAACGATGGTTCTATTTGGGGTAGAGGCTTACATGCTCATGCCATGAAAAGTGGAATGGAACTAGATGTATTTCTGGGCAATGCATTGTTAAGCATGTATGTTGAGCACAATCAAATTGATGCTGCACAGAAACTTTTTGATAAGATGAGAGGTTTGGACGTCATCTCCTGGAACACAATGATATTAGCACTTGCTCAGAGTAAGTTTCGAGCCAAAGCATTTCAACTCTTTATGACGATGTGTGAATCAGAAATCAAGTTCAATTCATACACAATGATATCTCTCCTTGCATTATGTAAAGATGGAAGTGATTTGGTGTTTGGGCGATCGATCCATGGTTTTGCAATAAAAAATGGTCTTGAAATAAATACTTCTTTGAACACTTCACTGACTGAAATGTACATAAACTGTAGTGATGAAGGATCGGCTACAAATCTGTTTATTAGATGTCCTCAAAGAGATTTAATTTCATGGAATTCCCTAATTTCGAGCTATATAAAGAATGACAATGCAGGAAAAGCTCTATTACTTTTTAACCATATGATTTCTGAGCTGGAGCCTAACTCCGTGACAATCATAAGTATTCTCACATCTTGTACCCAGCTTGCCCATCTACCACTAGGACAGTGCTTGCATGCTTACACAACTAGAAGGGGAGAATCTTTTGAATTGGATGCTTCTCTAGCAAATGCTTTTATAACTATGTATGCACGATGTGGTAAAATGCAATATGCAGAAAAGATTTTTAACACCCTGCAGGCAAGAAATATTGTCTCATGGAATGCCATGATAACAGGGTATGGCATGCACGGTCGTGGACACGATGCTACTCTAGCCTTTGCACAGATGTTGGATGATGGTTTCAAGCCAAACAATATATCTTTTGTATCTGTTTTATCTGCCTGCAGCCATTCTGGTCTGACTAAGACTGGTTTGCAGCTTTTTAGTTCCATGGTGCGGGACTTTGGTATTGCTCCTCAACTTGCTCACTATGGTTGTATAGTTGATCTGCTTGGTCGTGGGGGCCATTTTGCTGAAGCTATAGCTCTCATCAGCTCAATGCCCGTTGAGCCTGATGCATCAATTTGGAGAGCTTTGCTCAGTTCATGTCAGGTTAAAAGCAATAAAAAACTAGTCGAAACCATCTTTAGAAAGCTTGTTGAATTAGAACCAAGCAATCCAGGGAATTTTGTTTTGCTTTCAAATGTCTACGCAGCAGCAGGTCTTTGGTCAGAGGTTTCACAGATAAGAAAGTGGGTTAGAGATAAAGGTCTAGTGAAGCCTCCAGGAACTAGCTGGATTGTAATCGGAAGTCAGGTCCACTATTTCACTGCAACTGACGTATCACACCCTCAATCAGAAGAAATTTACGAAAATTTGAATTCTTTGACATCATTGATCCAAGATATGGGCTGAAGTTTAGGATTTTTCTGTCATCCACCCAGACTTGATTTTATGTGCGTGCATATATGTAGAAGAGGCTGTAAAGAAAATTCATATGTCGAGGTTCACGCCTTACATTATGCTCATGGGTTGCTCTGCTAATCCTAAAAGGAAGATATCTACTCGGCTAGCTCCAGGAGAGTATTCCCATGTAACAGAGCCATTACTATCCAAATATCCTTTTAACTGTTAGAACTTGTAAGCACATTTACTTTTATCTTACAAAGATTCTGTTTTTTCAGGAATCCAGTCAACTGAAATTAGTTTCCCTTTGTGAGAGAATTGTTGAGTCGCAGATCGATAGTCTCTGATCTTTCCCGAGATGTCTTACAAGAGCACCTGGTAGTGCATAAGGGCTGTACCCTGTGGACTGTTCTTGCTGCATTCTTCGCTGCAATCTCCAACTGTCTGATTTAGTCACTTGCCTGGTTCCACCAACAACAAGATCTAGAGTCCTCCAAGAACCGAACCGGTATGTATACTTTGTTTTCGTTGTCTCATTATCCTACCTTATTTATGGGCGTGTTGGTTTATCGTGTATAAACTAGCCATTCTACTTGCTGGTACAAATTATGATTTGATATATGACCGTCCATGTGAAATATGTATTCCTTAAGTTATTTCACTGAAAGAGTTCTAAATCAACAACCATGAACCAACACCCTTTACTAATGAT

mRNA sequence

ATGGAGATTTCAGCCACCCTAAGCATTCACGGACGCTCTCCGACTCCTAAACAAGCCATCAATGTCTCAAAGGACTGGAACTTGATTATAAAGCACCAAACCAAGCTTAAGAATGACCATGCCATTCTTTCTACATATACCCAGATGGAGTCTCTTGGTATTGCACCCGATTCTGCTACAATGCCTCTTGTTCTAAAGGCTTGCGGGAGGCTCAACGCCATTGAAAAAGGGGCAGGCCTCACACCCAATTCTCGTACTGTAGTGCCTCTGCTTTTGGCTTGTGCTGAGATGTTGGAACTGCGATTAGGACATGAGATTCATGGTTATTGTTTGAGAAATGGGTTGTTTGATATGGATGCACATGTTGGTACTGCTTTGATAGGATTTTATATGAGATTTGATGCAGCAGTTTCTCACCGAGTTTTTAGCTTGATGGAGGTGAGAAATGTAGTGAGTTGGAATGCAATGATAACCGGATATCTCAATATTGGAGATTACACAAAAGCTTTGAAGCTTTTTAGTAGTATGCTGACTGAGGGTATAAAGTTTGATGCTGTTACAATGCTGCTGGTAATTCAAGCCTGTGCAGAATCTGAGTCTCTCCAATTAGGCATGCAACTGCATCAGTTGGCTATCAAGTTCAATTTCGTTGATGACTTGTTCGTATTAAATGCACTGTTGAATATGTATAGTGATAATGGACGTCTGGAGTCATCATGTGCGTTGTTTAATGCCGTTCCCACCTCTGATGCCGCCTTATGGAATTCTATGATATCTGCATACATTGCCTTCGGATTTCATGCTGAAGCTATAGCTTTGTATATTAAAATGCGTTTGGAAGGCTTAAAAGAAGACAAAAGAACCGTTGCGATTATGTTGTCTTTATGCGAAGATCTAAACGATGGTTCTATTTGGGGTAGAGGCTTACATGCTCATGCCATGAAAAGTGGAATGGAACTAGATGTATTTCTGGGCAATGCATTGTTAAGCATGTATGTTGAGCACAATCAAATTGATGCTGCACAGAAACTTTTTGATAAGATGAGAGGTTTGGACGTCATCTCCTGGAACACAATGATATTAGCACTTGCTCAGAGTAAGTTTCGAGCCAAAGCATTTCAACTCTTTATGACGATGTGTGAATCAGAAATCAAGTTCAATTCATACACAATGATATCTCTCCTTGCATTATGTAAAGATGGAAGTGATTTGGTGTTTGGGCGATCGATCCATGGTTTTGCAATAAAAAATGGTCTTGAAATAAATACTTCTTTGAACACTTCACTGACTGAAATGTACATAAACTGTAGTGATGAAGGATCGGCTACAAATCTGTTTATTAGATGTCCTCAAAGAGATTTAATTTCATGGAATTCCCTAATTTCGAGCTATATAAAGAATGACAATGCAGGAAAAGCTCTATTACTTTTTAACCATATGATTTCTGAGCTGGAGCCTAACTCCGTGACAATCATAAAAGAGGCTGTAAAGAAAATTCATATGTCGAGGTTCACGCCTTACATTATGCTCATGGGTTGCTCTGCTAATCCTAAAAGGAAGATATCTACTCGGCTAGCTCCAGGAGAGTATTCCCATGAATCCAGTCAACTGAAATTAGTTTCCCTTTGTGAGAGAATTGTTGAGTCGCAGATCGATAGTCTCTGATCTTTCCCGAGATGTCTTACAAGAGCACCTGGTAGTGCATAAGGGCTGTACCCTGTGGACTGTTCTTGCTGCATTCTTCGCTGCAATCTCCAACTGTCTGATTTAGTCACTTGCCTGGTTCCACCAACAACAAGATCTAGAGTCCTCCAAGAACCGAACCGGTATGTATACTTTGTTTTCGTTGTCTCATTATCCTACCTTATTTATGGGCGTGTTGGTTTATCGTGTATAAACTAGCCATTCTACTTGCTGGTACAAATTATGATTTGATATATGACCGTCCATGTGAAATATGTATTCCTTAAGTTATTTCACTGAAAGAGTTCTAAATCAACAACCATGAACCAACACCCTTTACTAATGAT

Coding sequence (CDS)

ATGGAGATTTCAGCCACCCTAAGCATTCACGGACGCTCTCCGACTCCTAAACAAGCCATCAATGTCTCAAAGGACTGGAACTTGATTATAAAGCACCAAACCAAGCTTAAGAATGACCATGCCATTCTTTCTACATATACCCAGATGGAGTCTCTTGGTATTGCACCCGATTCTGCTACAATGCCTCTTGTTCTAAAGGCTTGCGGGAGGCTCAACGCCATTGAAAAAGGGGCAGGCCTCACACCCAATTCTCGTACTGTAGTGCCTCTGCTTTTGGCTTGTGCTGAGATGTTGGAACTGCGATTAGGACATGAGATTCATGGTTATTGTTTGAGAAATGGGTTGTTTGATATGGATGCACATGTTGGTACTGCTTTGATAGGATTTTATATGAGATTTGATGCAGCAGTTTCTCACCGAGTTTTTAGCTTGATGGAGGTGAGAAATGTAGTGAGTTGGAATGCAATGATAACCGGATATCTCAATATTGGAGATTACACAAAAGCTTTGAAGCTTTTTAGTAGTATGCTGACTGAGGGTATAAAGTTTGATGCTGTTACAATGCTGCTGGTAATTCAAGCCTGTGCAGAATCTGAGTCTCTCCAATTAGGCATGCAACTGCATCAGTTGGCTATCAAGTTCAATTTCGTTGATGACTTGTTCGTATTAAATGCACTGTTGAATATGTATAGTGATAATGGACGTCTGGAGTCATCATGTGCGTTGTTTAATGCCGTTCCCACCTCTGATGCCGCCTTATGGAATTCTATGATATCTGCATACATTGCCTTCGGATTTCATGCTGAAGCTATAGCTTTGTATATTAAAATGCGTTTGGAAGGCTTAAAAGAAGACAAAAGAACCGTTGCGATTATGTTGTCTTTATGCGAAGATCTAAACGATGGTTCTATTTGGGGTAGAGGCTTACATGCTCATGCCATGAAAAGTGGAATGGAACTAGATGTATTTCTGGGCAATGCATTGTTAAGCATGTATGTTGAGCACAATCAAATTGATGCTGCACAGAAACTTTTTGATAAGATGAGAGGTTTGGACGTCATCTCCTGGAACACAATGATATTAGCACTTGCTCAGAGTAAGTTTCGAGCCAAAGCATTTCAACTCTTTATGACGATGTGTGAATCAGAAATCAAGTTCAATTCATACACAATGATATCTCTCCTTGCATTATGTAAAGATGGAAGTGATTTGGTGTTTGGGCGATCGATCCATGGTTTTGCAATAAAAAATGGTCTTGAAATAAATACTTCTTTGAACACTTCACTGACTGAAATGTACATAAACTGTAGTGATGAAGGATCGGCTACAAATCTGTTTATTAGATGTCCTCAAAGAGATTTAATTTCATGGAATTCCCTAATTTCGAGCTATATAAAGAATGACAATGCAGGAAAAGCTCTATTACTTTTTAACCATATGATTTCTGAGCTGGAGCCTAACTCCGTGACAATCATAAAAGAGGCTGTAAAGAAAATTCATATGTCGAGGTTCACGCCTTACATTATGCTCATGGGTTGCTCTGCTAATCCTAAAAGGAAGATATCTACTCGGCTAGCTCCAGGAGAGTATTCCCATGAATCCAGTCAACTGAAATTAGTTTCCCTTTGTGAGAGAATTGTTGAGTCGCAGATCGATAGTCTCTGA

Protein sequence

MEISATLSIHGRSPTPKQAINVSKDWNLIIKHQTKLKNDHAILSTYTQMESLGIAPDSATMPLVLKACGRLNAIEKGAGLTPNSRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFDAAVSHRVFSLMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIKFDAVTMLLVIQACAESESLQLGMQLHQLAIKFNFVDDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALWNSMISAYIAFGFHAEAIALYIKMRLEGLKEDKRTVAIMLSLCEDLNDGSIWGRGLHAHAMKSGMELDVFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISWNTMILALAQSKFRAKAFQLFMTMCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCSDEGSATNLFIRCPQRDLISWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIIKEAVKKIHMSRFTPYIMLMGCSANPKRKISTRLAPGEYSHESSQLKLVSLCERIVESQIDSL
BLAST of Cp4.1LG16g08940 vs. Swiss-Prot
Match: PP348_ARATH (Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana GN=EMB2758 PE=3 SV=2)

HSP 1 Score: 224.2 bits (570), Expect = 3.6e-57
Identity = 151/472 (31.99%), Postives = 248/472 (52.54%), Query Frame = 1

Query: 26  WNLIIKHQTKLKNDHAILSTYTQ-MESLGIAPDSATMPLVLKACGRLNAIEKGAGLTPNS 85
           WNL+I    +  N   ++  ++  M S G+ PD  T P VLKAC                
Sbjct: 120 WNLMISGYGRAGNSSEVIRCFSLFMLSSGLTPDYRTFPSVLKAC---------------- 179

Query: 86  RTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFDAAVSHRV-FS 145
           RTV+              G++IH   L+ G F  D +V  +LI  Y R+ A  + R+ F 
Sbjct: 180 RTVID-------------GNKIHCLALKFG-FMWDVYVAASLIHLYSRYKAVGNARILFD 239

Query: 146 LMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIKFDAVTMLLVIQACAESESLQL 205
            M VR++ SWNAMI+GY   G+  +AL L + +       D+VT++ ++ AC E+     
Sbjct: 240 EMPVRDMGSWNAMISGYCQSGNAKEALTLSNGLRA----MDSVTVVSLLSACTEAGDFNR 299

Query: 206 GMQLHQLAIKFNFVDDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALWNSMISAYIA 265
           G+ +H  +IK     +LFV N L+++Y++ GRL     +F+ +   D   WNS+I AY  
Sbjct: 300 GVTIHSYSIKHGLESELFVSNKLIDLYAEFGRLRDCQKVFDRMYVRDLISWNSIIKAYEL 359

Query: 266 FGFHAEAIALYIKMRLEGLKEDKRTVAIMLSLCEDLNDGSIWGRGLHAHAMKSGMEL-DV 325
                 AI+L+ +MRL  ++ D  T+  + S+   L D     R +    ++ G  L D+
Sbjct: 360 NEQPLRAISLFQEMRLSRIQPDCLTLISLASILSQLGDIRAC-RSVQGFTLRKGWFLEDI 419

Query: 326 FLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISWNTMILALAQSKFRAKAFQLFMTM-CE 385
            +GNA++ MY +   +D+A+ +F+ +   DVISWNT+I   AQ+ F ++A +++  M  E
Sbjct: 420 TIGNAVVVMYAKLGLVDSARAVFNWLPNTDVISWNTIISGYAQNGFASEAIEMYNIMEEE 479

Query: 386 SEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCSDEGS 445
            EI  N  T +S+L  C     L  G  +HG  +KNGL ++  + TSL +MY  C     
Sbjct: 480 GEIAANQGTWVSVLPACSQAGALRQGMKLHGRLLKNGLYLDVFVVTSLADMYGKCGRLED 539

Query: 446 ATNLFIRCPQRDLISWNSLISSYIKNDNAGKALLLFNHMISE-LEPNSVTII 493
           A +LF + P+ + + WN+LI+ +  + +  KA++LF  M+ E ++P+ +T +
Sbjct: 540 ALSLFYQIPRVNSVPWNTLIACHGFHGHGEKAVMLFKEMLDEGVKPDHITFV 556

BLAST of Cp4.1LG16g08940 vs. Swiss-Prot
Match: PP303_ARATH (Pentatricopeptide repeat-containing protein At4g04370 OS=Arabidopsis thaliana GN=PCMP-E99 PE=3 SV=1)

HSP 1 Score: 221.1 bits (562), Expect = 3.0e-56
Identity = 144/462 (31.17%), Postives = 234/462 (50.65%), Query Frame = 1

Query: 20  INVSKDWNLIIKHQTKLKNDHAILSTYTQMESLGIAPDSATMPLVLKACGRLNAIEKGAG 79
           +N +K +N  I H +   +   +LST++ M +  + PD+ T P +LK             
Sbjct: 8   LNSTKYFNSHINHLSSHGDHKQVLSTFSSMLANKLLPDTFTFPSLLK------------- 67

Query: 80  LTPNSRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFD-AAVS 139
                        ACA +  L  G  IH   L NG F  D ++ ++L+  Y +F   A +
Sbjct: 68  -------------ACASLQRLSFGLSIHQQVLVNG-FSSDFYISSSLVNLYAKFGLLAHA 127

Query: 140 HRVFSLMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIKFDAVTMLLVIQACAES 199
            +VF  M  R+VV W AMI  Y   G   +A  L + M  +GIK   VT+L ++    E 
Sbjct: 128 RKVFEEMRERDVVHWTAMIGCYSRAGIVGEACSLVNEMRFQGIKPGPVTLLEMLSGVLEI 187

Query: 200 ESLQLGMQLHQLAIKFNFVDDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALWNSMI 259
             LQ    LH  A+ + F  D+ V+N++LN+Y     +  +  LF+ +   D   WN+MI
Sbjct: 188 TQLQC---LHDFAVIYGFDCDIAVMNSMLNLYCKCDHVGDAKDLFDQMEQRDMVSWNTMI 247

Query: 260 SAYIAFGFHAEAIALYIKMRLEGLKEDKRTVAIMLSLCEDLNDGSIWGRGLHAHAMKSGM 319
           S Y + G  +E + L  +MR +GL+ D++T    LS+   + D  + GR LH   +K+G 
Sbjct: 248 SGYASVGNMSEILKLLYRMRGDGLRPDQQTFGASLSVSGTMCDLEM-GRMLHCQIVKTGF 307

Query: 320 ELDVFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISWNTMILALAQSKFRAKAFQLFMT 379
           ++D+ L  AL++MY++  + +A+ ++ + +   DV+ W  MI  L +     KA  +F  
Sbjct: 308 DVDMHLKTALITMYLKCGKEEASYRVLETIPNKDVVCWTVMISGLMRLGRAEKALIVFSE 367

Query: 380 MCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCSD 439
           M +S    +S  + S++A C        G S+HG+ +++G  ++T    SL  MY  C  
Sbjct: 368 MLQSGSDLSSEAIASVVASCAQLGSFDLGASVHGYVLRHGYTLDTPALNSLITMYAKCGH 427

Query: 440 EGSATNLFIRCPQRDLISWNSLISSYIKNDNAGKALLLFNHM 481
              +  +F R  +RDL+SWN++IS Y +N +  KALLLF  M
Sbjct: 428 LDKSLVIFERMNERDLVSWNAIISGYAQNVDLCKALLLFEEM 438

BLAST of Cp4.1LG16g08940 vs. Swiss-Prot
Match: PP268_ARATH (Putative pentatricopeptide repeat-containing protein At3g47840 OS=Arabidopsis thaliana GN=PCMP-E43 PE=3 SV=1)

HSP 1 Score: 212.6 bits (540), Expect = 1.1e-53
Identity = 139/471 (29.51%), Postives = 228/471 (48.41%), Query Frame = 1

Query: 26  WNLIIKHQTKLKNDHAILSTYTQMESL--GIAPDSATMPLVLKACGRLNAIEKGAGLTPN 85
           W  IIK      N    L  ++ M  +   ++PD++ + +VLKACG+ + I  G  L   
Sbjct: 74  WTSIIKRYVTANNSDEALILFSAMRVVDHAVSPDTSVLSVVLKACGQSSNIAYGESL--- 133

Query: 86  SRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFDAA-VSHRVF 145
                                  H Y ++  L     +VG++L+  Y R      S RVF
Sbjct: 134 -----------------------HAYAVKTSLLS-SVYVGSSLLDMYKRVGKIDKSCRVF 193

Query: 146 SLMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIKFDAVTMLLVIQACAESESLQ 205
           S M  RN V+W A+ITG ++ G Y + L  FS M       D  T  + ++ACA    ++
Sbjct: 194 SEMPFRNAVTWTAIITGLVHAGRYKEGLTYFSEMSRSEELSDTYTFAIALKACAGLRQVK 253

Query: 206 LGMQLHQLAIKFNFVDDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALWNSMISAYI 265
            G  +H   I   FV  L V N+L  MY++ G ++    LF  +   D   W S+I AY 
Sbjct: 254 YGKAIHTHVIVRGFVTTLCVANSLATMYTECGEMQDGLCLFENMSERDVVSWTSLIVAYK 313

Query: 266 AFGFHAEAIALYIKMRLEGLKEDKRTVAIMLSLCEDLNDGSIWGRGLHAHAMKSGMELDV 325
             G   +A+  +IKMR   +  +++T A M S C  L+   +WG  LH + +  G+   +
Sbjct: 314 RIGQEVKAVETFIKMRNSQVPPNEQTFASMFSACASLS-RLVWGEQLHCNVLSLGLNDSL 373

Query: 326 FLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISWNTMILALAQSKFRAKAFQLFMTMCES 385
            + N+++ MY     + +A  LF  MR  D+ISW+T+I    Q+ F  + F+ F  M +S
Sbjct: 374 SVSNSMMKMYSTCGNLVSASVLFQGMRCRDIISWSTIIGGYCQAGFGEEGFKYFSWMRQS 433

Query: 386 EIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCSDEGSA 445
             K   + + SLL++  + + +  GR +H  A+  GLE N+++ +SL  MY  C     A
Sbjct: 434 GTKPTDFALASLLSVSGNMAVIEGGRQVHALALCFGLEQNSTVRSSLINMYSKCGSIKEA 493

Query: 446 TNLFIRCPQRDLISWNSLISSYIKNDNAGKALLLFNHMIS-ELEPNSVTII 493
           + +F    + D++S  ++I+ Y ++  + +A+ LF   +     P+SVT I
Sbjct: 494 SMIFGETDRDDIVSLTAMINGYAEHGKSKEAIDLFEKSLKVGFRPDSVTFI 516

BLAST of Cp4.1LG16g08940 vs. Swiss-Prot
Match: PP320_ARATH (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana GN=DOT4 PE=2 SV=1)

HSP 1 Score: 208.8 bits (530), Expect = 1.6e-52
Identity = 137/459 (29.85%), Postives = 227/459 (49.46%), Query Frame = 1

Query: 26  WNLIIKHQTKLKNDHAILSTYTQMESLGIAPDSATMPLVLKACGRLNAIEKGAGLTPNSR 85
           WN+++    K  +    +  + +M S G+  DS T   V K+   L ++  G        
Sbjct: 163 WNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGG-------- 222

Query: 86  TVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFDAAVSHR-VFSL 145
                              ++HG+ L++G  + ++ VG +L+ FY++     S R VF  
Sbjct: 223 ------------------EQLHGFILKSGFGERNS-VGNSLVAFYLKNQRVDSARKVFDE 282

Query: 146 MEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIKFDAVTMLLVIQACAESESLQLG 205
           M  R+V+SWN++I GY++ G   K L +F  ML  GI+ D  T++ V   CA+S  + LG
Sbjct: 283 MTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLG 342

Query: 206 MQLHQLAIKFNFVDDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALWNSMISAYIAF 265
             +H + +K  F  +    N LL+MYS  G L+S+ A+F  +       + SMI+ Y   
Sbjct: 343 RAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYARE 402

Query: 266 GFHAEAIALYIKMRLEGLKEDKRTVAIMLSLCE--DLNDGSIWGRGLHAHAMKSGMELDV 325
           G   EA+ L+ +M  EG+  D  TV  +L+ C    L D    G+ +H    ++ +  D+
Sbjct: 403 GLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDE---GKRVHEWIKENDLGFDI 462

Query: 326 FLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISWNTMILALAQSKFRAKAFQLF-MTMCE 385
           F+ NAL+ MY +   +  A+ +F +MR  D+ISWNT+I   +++ +  +A  LF + + E
Sbjct: 463 FVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEE 522

Query: 386 SEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCSDEGS 445
                +  T+  +L  C   S    GR IHG+ ++NG   +  +  SL +MY  C     
Sbjct: 523 KRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLL 582

Query: 446 ATNLFIRCPQRDLISWNSLISSYIKNDNAGKALLLFNHM 481
           A  LF     +DL+SW  +I+ Y  +    +A+ LFN M
Sbjct: 583 AHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQM 591

BLAST of Cp4.1LG16g08940 vs. Swiss-Prot
Match: PP205_ARATH (Putative pentatricopeptide repeat-containing protein At3g01580 OS=Arabidopsis thaliana GN=PCMP-E87 PE=3 SV=2)

HSP 1 Score: 205.7 bits (522), Expect = 1.3e-51
Identity = 129/472 (27.33%), Postives = 237/472 (50.21%), Query Frame = 1

Query: 26  WNLIIKHQTKLKNDHAILSTYTQMESLGIAPDSATMPLVLKACGRLNAIEKGAGLTPNSR 85
           WN ++K  ++ K    +L  ++ M      PD+ T+P+ LKACG                
Sbjct: 28  WNTLLKSLSREKQWEEVLYHFSHMFRDEEKPDNFTLPVALKACG---------------- 87

Query: 86  TVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFDAAVSH-RVFSL 145
                     E+ E+  G  IHG+  ++     D +VG++LI  Y++    +   R+F  
Sbjct: 88  ----------ELREVNYGEMIHGFVKKDVTLGSDLYVGSSLIYMYIKCGRMIEALRMFDE 147

Query: 146 MEVRNVVSWNAMITGYLNIGDYTKALKLFSSM-LTEGIKFDAVTMLLVIQACAESESLQL 205
           +E  ++V+W++M++G+   G   +A++ F  M +   +  D VT++ ++ AC +  + +L
Sbjct: 148 LEKPDIVTWSSMVSGFEKNGSPYQAVEFFRRMVMASDVTPDRVTLITLVSACTKLSNSRL 207

Query: 206 GMQLHQLAIKFNFVDDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALWNSMISAYIA 265
           G  +H   I+  F +DL ++N+LLN Y+ +   + +  LF  +   D   W+++I+ Y+ 
Sbjct: 208 GRCVHGFVIRRGFSNDLSLVNSLLNCYAKSRAFKEAVNLFKMIAEKDVISWSTVIACYVQ 267

Query: 266 FGFHAEAIALYIKMRLEGLKEDKRTVAIMLSLCEDLNDGSIWGRGLHAHAMKSGMELDVF 325
            G  AEA+ ++  M  +G + +  TV  +L  C   +D    GR  H  A++ G+E +V 
Sbjct: 268 NGAAAEALLVFNDMMDDGTEPNVATVLCVLQACAAAHDLE-QGRKTHELAIRKGLETEVK 327

Query: 326 LGNALLSMYVEHNQIDAAQKLFDKMRGLDVISWNTMILALAQSKFRAKAFQLFMTM-CES 385
           +  AL+ MY++    + A  +F ++   DV+SW  +I     +    ++ + F  M  E+
Sbjct: 328 VSTALVDMYMKCFSPEEAYAVFSRIPRKDVVSWVALISGFTLNGMAHRSIEEFSIMLLEN 387

Query: 386 EIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCSDEGSA 445
             + ++  M+ +L  C +   L   +  H + IK G + N  +  SL E+Y  C   G+A
Sbjct: 388 NTRPDAILMVKVLGSCSELGFLEQAKCFHSYVIKYGFDSNPFIGASLVELYSRCGSLGNA 447

Query: 446 TNLFIRCPQRDLISWNSLISSYIKNDNAGKALLLFNHMI--SELEPNSVTII 493
           + +F     +D + W SLI+ Y  +    KAL  FNHM+  SE++PN VT +
Sbjct: 448 SKVFNGIALKDTVVWTSLITGYGIHGKGTKALETFNHMVKSSEVKPNEVTFL 472

BLAST of Cp4.1LG16g08940 vs. TrEMBL
Match: A0A061GS93_THECC (Pentatricopeptide repeat-containing protein OS=Theobroma cacao GN=TCM_040700 PE=4 SV=1)

HSP 1 Score: 509.6 bits (1311), Expect = 4.8e-141
Identity = 251/424 (59.20%), Postives = 326/424 (76.89%), Query Frame = 1

Query: 79  GLTPNSRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFDAAVS 138
           G  PNSRT+V +LLAC E+ E+RLG EIHGYCLRNGLFD+D HVGTALIGFY+ F+   S
Sbjct: 154 GFRPNSRTLVAMLLACQEVAEVRLGKEIHGYCLRNGLFDLDPHVGTALIGFYLSFNVRAS 213

Query: 139 HRVFSLMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIKFDAVTMLLVIQACAES 198
           H VF LM VRN V WNAMI GY +IG+  KALKLF  ML +G++FD+VTML +IQACAE 
Sbjct: 214 HTVFDLMAVRNTVCWNAMIKGYFDIGESLKALKLFEKMLMDGVEFDSVTMLALIQACAEF 273

Query: 199 ESLQLGMQLHQLAIKFNFVDDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALWNSMI 258
            SL+LG Q+HQ+AIK ++ +DLF++NALLNMY+D G L+S+C LF+  P  D ALWNSMI
Sbjct: 274 GSLELGSQIHQMAIKCSYSNDLFIVNALLNMYADIGSLKSACKLFDVTPRRDVALWNSMI 333

Query: 259 SAYIAFGFHAEAIALYIKMRLEGLKEDKRTVAIMLSLCEDLNDGSIWGRGLHAHAMKSGM 318
           SAY  +  + EA +L++ MR EG KED RT+ IM SLC +  DG   G+ LHA+A KSGM
Sbjct: 334 SAYFEYSCNEEATSLFVHMRTEGNKEDDRTIVIMFSLCAESADGLRKGKSLHAYASKSGM 393

Query: 319 ELDVFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISWNTMILALAQSKFRAKAFQLFMT 378
            +DV LGNA+L+MY + N ID+ QK+F +M  +DVIS+NT+ILALA++K  ++A+++F  
Sbjct: 394 RMDVNLGNAMLNMYAQQNCIDSVQKVFSEMSNVDVISFNTVILALARNKLGSEAWEVFGL 453

Query: 379 MCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCSD 438
           M E +++ NSYT+IS+LA CKD + L  GRS+HGF IK G+E+N SLNT+LT+MYINC D
Sbjct: 454 MWELDVEPNSYTIISILAACKDETCLNIGRSLHGFVIKQGIEVNVSLNTALTDMYINCGD 513

Query: 439 EGSATNLFIRCPQRDLISWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIIKEAVKK 498
           E +A NLF  CP RDLISWN+LI++Y+KN+ A +A L+F+ MISE+EPNSVTII      
Sbjct: 514 EATARNLFESCPGRDLISWNALIATYVKNNLAHEAFLVFSRMISEVEPNSVTIINILSSC 573

Query: 499 IHMS 503
            H++
Sbjct: 574 THLA 577

BLAST of Cp4.1LG16g08940 vs. TrEMBL
Match: A5BC97_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_043633 PE=4 SV=1)

HSP 1 Score: 495.0 bits (1273), Expect = 1.2e-136
Identity = 255/440 (57.95%), Postives = 322/440 (73.18%), Query Frame = 1

Query: 63  LVLKACGRLNAIEKGAGLTPNSRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHV 122
           L+++  GR N       L PNSRT+V LLLAC    ELRLG  +HGYCLRNG+FD + HV
Sbjct: 147 LLVREMGREN-------LRPNSRTMVALLLACEGASELRLGRGVHGYCLRNGMFDSNPHV 206

Query: 123 GTALIGFYMRFDAAVSHRVFSLMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIK 182
            TALIGFY+RFD  V   +F LM VRN+VSWNAMI+GY ++GDY KAL+LF  ML + +K
Sbjct: 207 ATALIGFYLRFDMRVLPLLFDLMVVRNIVSWNAMISGYYDVGDYFKALELFVQMLVDEVK 266

Query: 183 FDAVTMLLVIQACAESESLQLGMQLHQLAIKFNFVDDLFVLNALLNMYSDNGRLESSCAL 242
           FD VTML+ +QACAE  SL+LG Q+HQLAIKF FV+DL++LNALLNMYS+NG LESS  L
Sbjct: 267 FDCVTMLVAVQACAELGSLKLGKQIHQLAIKFEFVEDLYILNALLNMYSNNGSLESSHQL 326

Query: 243 FNAVPTSDAALWNSMISAYIAFGFHAEAIALYIKMRLEGLKEDKRTVAIMLSLCEDLNDG 302
           F +VP  DA LWNSMISAY AFG H EA+ L+I+M+ EG+K+D+RTV IMLS+CE+L  G
Sbjct: 327 FESVPNRDAPLWNSMISAYAAFGCHEEAMDLFIRMQSEGVKKDERTVVIMLSMCEELASG 386

Query: 303 SIWGRGLHAHAMKSGMELDVFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISWNTMILA 362
            + G+ LHAH +KSGM +D  LGNALLSMY E N +++ QK+FD+M+G+D+ISWNTMILA
Sbjct: 387 LLKGKSLHAHVIKSGMRIDASLGNALLSMYTELNCVESVQKIFDRMKGVDIISWNTMILA 446

Query: 363 LAQSKFRAKAFQLFMTMCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEIN 422
           LA++  RA+A +LF  M ESEIK NSYT+IS+LA C+D + L FGRSIHG+ +K+ +EIN
Sbjct: 447 LARNTLRAQACELFERMRESEIKPNSYTIISILAACEDVTCLDFGRSIHGYVMKHSIEIN 506

Query: 423 TSLNTSLTEMYINCSDEGSATNLFIRCPQRDLISWNSLISSYIKNDNAGKALLLFNHMIS 482
             L T+L +MY+NC DE +A +LF  CP RDLISWN+                    MI 
Sbjct: 507 QPLRTALADMYMNCGDEATARDLFEGCPDRDLISWNA--------------------MIX 559

Query: 483 ELEPNSVTIIKEAVKKIHMS 503
           + EPNSVTII       H++
Sbjct: 567 KAEPNSVTIINVLSSFTHLA 559

BLAST of Cp4.1LG16g08940 vs. TrEMBL
Match: A0A0D2N9C6_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_001G138400 PE=4 SV=1)

HSP 1 Score: 486.5 bits (1251), Expect = 4.3e-134
Identity = 239/425 (56.24%), Postives = 322/425 (75.76%), Query Frame = 1

Query: 79  GLTPNSRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRF-DAAV 138
           G  PNSRT+V ++L C ++ E+RLG  IHGYCLRNGLFD+DAHVGTALI FY+ F D   
Sbjct: 148 GFRPNSRTLVAMILVCDKVAEVRLGKAIHGYCLRNGLFDLDAHVGTALISFYLSFFDVRA 207

Query: 139 SHRVFSLMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIKFDAVTMLLVIQACAE 198
           SH VF LM +RN V WNAMI GY ++G+ +KAL+LF  ML +G++FD+VT+L +IQA AE
Sbjct: 208 SHLVFDLMAIRNTVCWNAMIMGYFDVGESSKALRLFEQMLMDGVEFDSVTVLALIQASAE 267

Query: 199 SESLQLGMQLHQLAIKFNFVDDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALWNSM 258
             SL+LG Q+HQ+AIK ++ +DLF++NAL+NMY++ G L+S+C LF+ +PT D ALWNSM
Sbjct: 268 FGSLELGDQIHQMAIKCSYSNDLFIVNALINMYAEIGCLKSACKLFDGIPTRDVALWNSM 327

Query: 259 ISAYIAFGFHAEAIALYIKMRLEGLKEDKRTVAIMLSLCEDLNDGSIWGRGLHAHAMKSG 318
           ISAYI + +H EAI+L+IKMR EG KED+RT  +MLSLC +  D    GR LHAHA K+G
Sbjct: 328 ISAYIDYSYHGEAISLFIKMRTEGNKEDERTTVLMLSLCAESADALRKGRSLHAHACKTG 387

Query: 319 MELDVFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISWNTMILALAQSKFRAKAFQLFM 378
           M +D+ +GNA+L+MY E N +D+ +K+F +M  +DVIS+NT+IL LA++    +A++ F 
Sbjct: 388 MGMDINIGNAILNMYAEQNCMDSVRKVFGQMSNVDVISYNTLILVLARNNLGIEAWETFG 447

Query: 379 TMCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCS 438
            M ES++K NSYT+IS+LA CKD + L  GRS+HGF IK G+E+N  L T+LT+MYINC 
Sbjct: 448 IMRESDVKPNSYTIISILAACKDETCLNIGRSLHGFVIKQGIEVNAPLKTALTDMYINCG 507

Query: 439 DEGSATNLFIRCPQRDLISWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIIKEAVK 498
           DE +A  LF     RDLISWN+LIS+Y+KN+ A +A L+F+ M+SE+EPNSVTII     
Sbjct: 508 DETTAMKLFESSHGRDLISWNALISTYVKNNQAHEAFLVFSRMVSEVEPNSVTIINILSS 567

Query: 499 KIHMS 503
             H++
Sbjct: 568 CTHLA 572

BLAST of Cp4.1LG16g08940 vs. TrEMBL
Match: W9RTH5_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_015292 PE=4 SV=1)

HSP 1 Score: 484.6 bits (1246), Expect = 1.6e-133
Identity = 247/426 (57.98%), Postives = 319/426 (74.88%), Query Frame = 1

Query: 79  GLTPNSRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFDAAVS 138
           GL PNSRTVV LL  C E+ ELRLG EIHGYCLRNGLFD+D HVGTALIGFY RFDA +S
Sbjct: 141 GLKPNSRTVVGLLSTCRELDELRLGQEIHGYCLRNGLFDLDLHVGTALIGFYSRFDARIS 200

Query: 139 HRVFSLMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGI-KFDAVTMLLVIQACAE 198
             VF LM+V+N VSWNA+ITGY+++G+  +A  LF  +L +G+ KFD++T+L+V QACAE
Sbjct: 201 RLVFDLMDVKNTVSWNAIITGYVDMGENLEACNLFVHLLVDGVNKFDSITVLVVAQACAE 260

Query: 199 SESLQLGMQLHQLAIKFNFVDDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALWNSM 258
                LGMQ+HQLAIK+ + ++LF++NALLNMY D   L+ +C LF  VP  D ALWNSM
Sbjct: 261 LGFRNLGMQIHQLAIKYGYRNNLFIVNALLNMYCDCRSLDLACRLFENVPNRDVALWNSM 320

Query: 259 ISAYIAFGFHAEAIALYIKMRLEGLKEDKRTVAIMLSLCEDLNDGSIWGRGLHAHAMKSG 318
           I AYI +G   EA++L++ MR EG++ED+RT+AIM S C +L DG   G+ LHAHA+KSG
Sbjct: 321 IYAYIEYGICDEALSLFVSMRTEGVREDERTIAIMASSCPNLADGVRNGKSLHAHAIKSG 380

Query: 319 MELD-VFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISWNTMILALAQSKFRAKAFQLF 378
           ME+D V LGNA L MY E N  +AAQK+FD M G DVISWNT+I+ALA +K R +A+ LF
Sbjct: 381 MEIDDVSLGNAFLGMYAELNCTEAAQKVFDDMTGPDVISWNTLIMALACNKLRNEAWNLF 440

Query: 379 MTMCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINC 438
             M  +++  NS+T+IS+LA C D + L  GR++HGF IK G+EI+ S NT+LT+MY+NC
Sbjct: 441 EAMRATKMTPNSHTVISILAACDDETCLNIGRAVHGFVIKLGIEIDLSFNTALTDMYMNC 500

Query: 439 SDEGSATNLFIRCPQRDLISWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIIKEAV 498
            DE +A NLF   P RD+ISWN+LI+SY++N+   KA LLF+ MISE+EPN VTII    
Sbjct: 501 GDEATARNLFENFPDRDVISWNALIASYVRNNQGEKAQLLFSRMISEVEPNGVTIINMLS 560

Query: 499 KKIHMS 503
              H++
Sbjct: 561 SCTHLA 566

BLAST of Cp4.1LG16g08940 vs. TrEMBL
Match: A0A067JE40_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_25923 PE=4 SV=1)

HSP 1 Score: 472.2 bits (1214), Expect = 8.4e-130
Identity = 244/428 (57.01%), Postives = 317/428 (74.07%), Query Frame = 1

Query: 65  LKACGRLNAIEKGAGLTPNSRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGT 124
           ++A G+   +++  GL PNSRT+V LLLAC  +LELRLG E+HGYCLR+G FD+  H+GT
Sbjct: 144 VEAIGQFRRMQR-EGLEPNSRTLVALLLACEGILELRLGQELHGYCLRSGYFDLYPHLGT 203

Query: 125 ALIGFYMRFDAAVSHRVFSLMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIKFD 184
           ALIGFY+ FD  +S  VF LM V++ VSWNAMITGY   GD+ KAL+LF  ML +G+KFD
Sbjct: 204 ALIGFYLNFDVKISSLVFDLMIVKSAVSWNAMITGYFGSGDFVKALELFVQMLKDGVKFD 263

Query: 185 AVTMLLVIQACAESESLQLGMQLHQLAIKFNFVDDLFVLNALLNMYSDNGRLESSCALFN 244
            VT+L+ IQA AE  S +LGMQ+HQLAIK ++ ++LF++NALLNMY++ G LE +C LF+
Sbjct: 264 MVTILVSIQASAEIGSSELGMQIHQLAIKLSYGNELFIVNALLNMYAEIGNLELACRLFD 323

Query: 245 AVPTSDAALWNSMISAYIAFGFHAEAIALYIKMRLEGLKEDKRTVAIMLSLCEDLNDGSI 304
            V   D  LWNSMI+AYI  G + EA +L+  MR E  +ED+RT+A++LSL  +L DG  
Sbjct: 324 TVTVHDVPLWNSMIAAYIDHGCYEEATSLFTTMRTE-TREDERTIAVILSLSAELTDGLK 383

Query: 305 WGRGLHAHAMKSGMELDVFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISWNTMILALA 364
            GR LHA A K  M+++V LGNALLSMY + N ++ A K+F++M  +DV+ +NT+ILA +
Sbjct: 384 IGRSLHALAYKREMKMNVSLGNALLSMYADLNCVEDALKVFNEMSNIDVVPYNTLILAFS 443

Query: 365 QSKFRAKAFQLFMTMCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTS 424
            S    KA++LF  M ESE+  NS+TMISLLA C D   L  GRS+HGF IKN +EIN S
Sbjct: 444 VSNLSGKAWELFGMMRESEVSPNSHTMISLLASCGDEKCLNIGRSVHGFIIKNSIEINLS 503

Query: 425 LNTSLTEMYINCSDEGSATNLFIRCPQRDLISWNSLISSYIKNDNAGKALLLFNHMISEL 484
           LNTSLTEMYINC D  +A  LF  CP RDLISWN++I++ +KND  G+A+L FN MISE+
Sbjct: 504 LNTSLTEMYINCGDGAAARYLFDTCPSRDLISWNAIIAALLKNDKTGEAILFFNRMISEV 563

Query: 485 EPNSVTII 493
           EPNSVTII
Sbjct: 564 EPNSVTII 569

BLAST of Cp4.1LG16g08940 vs. TAIR10
Match: AT4G33990.1 (AT4G33990.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 224.2 bits (570), Expect = 2.0e-58
Identity = 151/472 (31.99%), Postives = 248/472 (52.54%), Query Frame = 1

Query: 26  WNLIIKHQTKLKNDHAILSTYTQ-MESLGIAPDSATMPLVLKACGRLNAIEKGAGLTPNS 85
           WNL+I    +  N   ++  ++  M S G+ PD  T P VLKAC                
Sbjct: 120 WNLMISGYGRAGNSSEVIRCFSLFMLSSGLTPDYRTFPSVLKAC---------------- 179

Query: 86  RTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFDAAVSHRV-FS 145
           RTV+              G++IH   L+ G F  D +V  +LI  Y R+ A  + R+ F 
Sbjct: 180 RTVID-------------GNKIHCLALKFG-FMWDVYVAASLIHLYSRYKAVGNARILFD 239

Query: 146 LMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIKFDAVTMLLVIQACAESESLQL 205
            M VR++ SWNAMI+GY   G+  +AL L + +       D+VT++ ++ AC E+     
Sbjct: 240 EMPVRDMGSWNAMISGYCQSGNAKEALTLSNGLRA----MDSVTVVSLLSACTEAGDFNR 299

Query: 206 GMQLHQLAIKFNFVDDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALWNSMISAYIA 265
           G+ +H  +IK     +LFV N L+++Y++ GRL     +F+ +   D   WNS+I AY  
Sbjct: 300 GVTIHSYSIKHGLESELFVSNKLIDLYAEFGRLRDCQKVFDRMYVRDLISWNSIIKAYEL 359

Query: 266 FGFHAEAIALYIKMRLEGLKEDKRTVAIMLSLCEDLNDGSIWGRGLHAHAMKSGMEL-DV 325
                 AI+L+ +MRL  ++ D  T+  + S+   L D     R +    ++ G  L D+
Sbjct: 360 NEQPLRAISLFQEMRLSRIQPDCLTLISLASILSQLGDIRAC-RSVQGFTLRKGWFLEDI 419

Query: 326 FLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISWNTMILALAQSKFRAKAFQLFMTM-CE 385
            +GNA++ MY +   +D+A+ +F+ +   DVISWNT+I   AQ+ F ++A +++  M  E
Sbjct: 420 TIGNAVVVMYAKLGLVDSARAVFNWLPNTDVISWNTIISGYAQNGFASEAIEMYNIMEEE 479

Query: 386 SEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCSDEGS 445
            EI  N  T +S+L  C     L  G  +HG  +KNGL ++  + TSL +MY  C     
Sbjct: 480 GEIAANQGTWVSVLPACSQAGALRQGMKLHGRLLKNGLYLDVFVVTSLADMYGKCGRLED 539

Query: 446 ATNLFIRCPQRDLISWNSLISSYIKNDNAGKALLLFNHMISE-LEPNSVTII 493
           A +LF + P+ + + WN+LI+ +  + +  KA++LF  M+ E ++P+ +T +
Sbjct: 540 ALSLFYQIPRVNSVPWNTLIACHGFHGHGEKAVMLFKEMLDEGVKPDHITFV 556

BLAST of Cp4.1LG16g08940 vs. TAIR10
Match: AT4G04370.1 (AT4G04370.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 221.1 bits (562), Expect = 1.7e-57
Identity = 144/462 (31.17%), Postives = 234/462 (50.65%), Query Frame = 1

Query: 20  INVSKDWNLIIKHQTKLKNDHAILSTYTQMESLGIAPDSATMPLVLKACGRLNAIEKGAG 79
           +N +K +N  I H +   +   +LST++ M +  + PD+ T P +LK             
Sbjct: 8   LNSTKYFNSHINHLSSHGDHKQVLSTFSSMLANKLLPDTFTFPSLLK------------- 67

Query: 80  LTPNSRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFD-AAVS 139
                        ACA +  L  G  IH   L NG F  D ++ ++L+  Y +F   A +
Sbjct: 68  -------------ACASLQRLSFGLSIHQQVLVNG-FSSDFYISSSLVNLYAKFGLLAHA 127

Query: 140 HRVFSLMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIKFDAVTMLLVIQACAES 199
            +VF  M  R+VV W AMI  Y   G   +A  L + M  +GIK   VT+L ++    E 
Sbjct: 128 RKVFEEMRERDVVHWTAMIGCYSRAGIVGEACSLVNEMRFQGIKPGPVTLLEMLSGVLEI 187

Query: 200 ESLQLGMQLHQLAIKFNFVDDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALWNSMI 259
             LQ    LH  A+ + F  D+ V+N++LN+Y     +  +  LF+ +   D   WN+MI
Sbjct: 188 TQLQC---LHDFAVIYGFDCDIAVMNSMLNLYCKCDHVGDAKDLFDQMEQRDMVSWNTMI 247

Query: 260 SAYIAFGFHAEAIALYIKMRLEGLKEDKRTVAIMLSLCEDLNDGSIWGRGLHAHAMKSGM 319
           S Y + G  +E + L  +MR +GL+ D++T    LS+   + D  + GR LH   +K+G 
Sbjct: 248 SGYASVGNMSEILKLLYRMRGDGLRPDQQTFGASLSVSGTMCDLEM-GRMLHCQIVKTGF 307

Query: 320 ELDVFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISWNTMILALAQSKFRAKAFQLFMT 379
           ++D+ L  AL++MY++  + +A+ ++ + +   DV+ W  MI  L +     KA  +F  
Sbjct: 308 DVDMHLKTALITMYLKCGKEEASYRVLETIPNKDVVCWTVMISGLMRLGRAEKALIVFSE 367

Query: 380 MCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCSD 439
           M +S    +S  + S++A C        G S+HG+ +++G  ++T    SL  MY  C  
Sbjct: 368 MLQSGSDLSSEAIASVVASCAQLGSFDLGASVHGYVLRHGYTLDTPALNSLITMYAKCGH 427

Query: 440 EGSATNLFIRCPQRDLISWNSLISSYIKNDNAGKALLLFNHM 481
              +  +F R  +RDL+SWN++IS Y +N +  KALLLF  M
Sbjct: 428 LDKSLVIFERMNERDLVSWNAIISGYAQNVDLCKALLLFEEM 438

BLAST of Cp4.1LG16g08940 vs. TAIR10
Match: AT3G47840.1 (AT3G47840.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 212.6 bits (540), Expect = 6.1e-55
Identity = 139/471 (29.51%), Postives = 228/471 (48.41%), Query Frame = 1

Query: 26  WNLIIKHQTKLKNDHAILSTYTQMESL--GIAPDSATMPLVLKACGRLNAIEKGAGLTPN 85
           W  IIK      N    L  ++ M  +   ++PD++ + +VLKACG+ + I  G  L   
Sbjct: 74  WTSIIKRYVTANNSDEALILFSAMRVVDHAVSPDTSVLSVVLKACGQSSNIAYGESL--- 133

Query: 86  SRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFDAA-VSHRVF 145
                                  H Y ++  L     +VG++L+  Y R      S RVF
Sbjct: 134 -----------------------HAYAVKTSLLS-SVYVGSSLLDMYKRVGKIDKSCRVF 193

Query: 146 SLMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIKFDAVTMLLVIQACAESESLQ 205
           S M  RN V+W A+ITG ++ G Y + L  FS M       D  T  + ++ACA    ++
Sbjct: 194 SEMPFRNAVTWTAIITGLVHAGRYKEGLTYFSEMSRSEELSDTYTFAIALKACAGLRQVK 253

Query: 206 LGMQLHQLAIKFNFVDDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALWNSMISAYI 265
            G  +H   I   FV  L V N+L  MY++ G ++    LF  +   D   W S+I AY 
Sbjct: 254 YGKAIHTHVIVRGFVTTLCVANSLATMYTECGEMQDGLCLFENMSERDVVSWTSLIVAYK 313

Query: 266 AFGFHAEAIALYIKMRLEGLKEDKRTVAIMLSLCEDLNDGSIWGRGLHAHAMKSGMELDV 325
             G   +A+  +IKMR   +  +++T A M S C  L+   +WG  LH + +  G+   +
Sbjct: 314 RIGQEVKAVETFIKMRNSQVPPNEQTFASMFSACASLS-RLVWGEQLHCNVLSLGLNDSL 373

Query: 326 FLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISWNTMILALAQSKFRAKAFQLFMTMCES 385
            + N+++ MY     + +A  LF  MR  D+ISW+T+I    Q+ F  + F+ F  M +S
Sbjct: 374 SVSNSMMKMYSTCGNLVSASVLFQGMRCRDIISWSTIIGGYCQAGFGEEGFKYFSWMRQS 433

Query: 386 EIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCSDEGSA 445
             K   + + SLL++  + + +  GR +H  A+  GLE N+++ +SL  MY  C     A
Sbjct: 434 GTKPTDFALASLLSVSGNMAVIEGGRQVHALALCFGLEQNSTVRSSLINMYSKCGSIKEA 493

Query: 446 TNLFIRCPQRDLISWNSLISSYIKNDNAGKALLLFNHMIS-ELEPNSVTII 493
           + +F    + D++S  ++I+ Y ++  + +A+ LF   +     P+SVT I
Sbjct: 494 SMIFGETDRDDIVSLTAMINGYAEHGKSKEAIDLFEKSLKVGFRPDSVTFI 516

BLAST of Cp4.1LG16g08940 vs. TAIR10
Match: AT4G18750.1 (AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 208.8 bits (530), Expect = 8.8e-54
Identity = 137/459 (29.85%), Postives = 227/459 (49.46%), Query Frame = 1

Query: 26  WNLIIKHQTKLKNDHAILSTYTQMESLGIAPDSATMPLVLKACGRLNAIEKGAGLTPNSR 85
           WN+++    K  +    +  + +M S G+  DS T   V K+   L ++  G        
Sbjct: 163 WNILMNELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHGG-------- 222

Query: 86  TVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFDAAVSHR-VFSL 145
                              ++HG+ L++G  + ++ VG +L+ FY++     S R VF  
Sbjct: 223 ------------------EQLHGFILKSGFGERNS-VGNSLVAFYLKNQRVDSARKVFDE 282

Query: 146 MEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIKFDAVTMLLVIQACAESESLQLG 205
           M  R+V+SWN++I GY++ G   K L +F  ML  GI+ D  T++ V   CA+S  + LG
Sbjct: 283 MTERDVISWNSIINGYVSNGLAEKGLSVFVQMLVSGIEIDLATIVSVFAGCADSRLISLG 342

Query: 206 MQLHQLAIKFNFVDDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALWNSMISAYIAF 265
             +H + +K  F  +    N LL+MYS  G L+S+ A+F  +       + SMI+ Y   
Sbjct: 343 RAVHSIGVKACFSREDRFCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYARE 402

Query: 266 GFHAEAIALYIKMRLEGLKEDKRTVAIMLSLCE--DLNDGSIWGRGLHAHAMKSGMELDV 325
           G   EA+ L+ +M  EG+  D  TV  +L+ C    L D    G+ +H    ++ +  D+
Sbjct: 403 GLAGEAVKLFEEMEEEGISPDVYTVTAVLNCCARYRLLDE---GKRVHEWIKENDLGFDI 462

Query: 326 FLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISWNTMILALAQSKFRAKAFQLF-MTMCE 385
           F+ NAL+ MY +   +  A+ +F +MR  D+ISWNT+I   +++ +  +A  LF + + E
Sbjct: 463 FVSNALMDMYAKCGSMQEAELVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEE 522

Query: 386 SEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCSDEGS 445
                +  T+  +L  C   S    GR IHG+ ++NG   +  +  SL +MY  C     
Sbjct: 523 KRFSPDERTVACVLPACASLSAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLL 582

Query: 446 ATNLFIRCPQRDLISWNSLISSYIKNDNAGKALLLFNHM 481
           A  LF     +DL+SW  +I+ Y  +    +A+ LFN M
Sbjct: 583 AHMLFDDIASKDLVSWTVMIAGYGMHGFGKEAIALFNQM 591

BLAST of Cp4.1LG16g08940 vs. TAIR10
Match: AT3G01580.1 (AT3G01580.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 205.7 bits (522), Expect = 7.5e-53
Identity = 129/472 (27.33%), Postives = 237/472 (50.21%), Query Frame = 1

Query: 26  WNLIIKHQTKLKNDHAILSTYTQMESLGIAPDSATMPLVLKACGRLNAIEKGAGLTPNSR 85
           WN ++K  ++ K    +L  ++ M      PD+ T+P+ LKACG                
Sbjct: 28  WNTLLKSLSREKQWEEVLYHFSHMFRDEEKPDNFTLPVALKACG---------------- 87

Query: 86  TVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFDAAVSH-RVFSL 145
                     E+ E+  G  IHG+  ++     D +VG++LI  Y++    +   R+F  
Sbjct: 88  ----------ELREVNYGEMIHGFVKKDVTLGSDLYVGSSLIYMYIKCGRMIEALRMFDE 147

Query: 146 MEVRNVVSWNAMITGYLNIGDYTKALKLFSSM-LTEGIKFDAVTMLLVIQACAESESLQL 205
           +E  ++V+W++M++G+   G   +A++ F  M +   +  D VT++ ++ AC +  + +L
Sbjct: 148 LEKPDIVTWSSMVSGFEKNGSPYQAVEFFRRMVMASDVTPDRVTLITLVSACTKLSNSRL 207

Query: 206 GMQLHQLAIKFNFVDDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALWNSMISAYIA 265
           G  +H   I+  F +DL ++N+LLN Y+ +   + +  LF  +   D   W+++I+ Y+ 
Sbjct: 208 GRCVHGFVIRRGFSNDLSLVNSLLNCYAKSRAFKEAVNLFKMIAEKDVISWSTVIACYVQ 267

Query: 266 FGFHAEAIALYIKMRLEGLKEDKRTVAIMLSLCEDLNDGSIWGRGLHAHAMKSGMELDVF 325
            G  AEA+ ++  M  +G + +  TV  +L  C   +D    GR  H  A++ G+E +V 
Sbjct: 268 NGAAAEALLVFNDMMDDGTEPNVATVLCVLQACAAAHDLE-QGRKTHELAIRKGLETEVK 327

Query: 326 LGNALLSMYVEHNQIDAAQKLFDKMRGLDVISWNTMILALAQSKFRAKAFQLFMTM-CES 385
           +  AL+ MY++    + A  +F ++   DV+SW  +I     +    ++ + F  M  E+
Sbjct: 328 VSTALVDMYMKCFSPEEAYAVFSRIPRKDVVSWVALISGFTLNGMAHRSIEEFSIMLLEN 387

Query: 386 EIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCSDEGSA 445
             + ++  M+ +L  C +   L   +  H + IK G + N  +  SL E+Y  C   G+A
Sbjct: 388 NTRPDAILMVKVLGSCSELGFLEQAKCFHSYVIKYGFDSNPFIGASLVELYSRCGSLGNA 447

Query: 446 TNLFIRCPQRDLISWNSLISSYIKNDNAGKALLLFNHMI--SELEPNSVTII 493
           + +F     +D + W SLI+ Y  +    KAL  FNHM+  SE++PN VT +
Sbjct: 448 SKVFNGIALKDTVVWTSLITGYGIHGKGTKALETFNHMVKSSEVKPNEVTFL 472

BLAST of Cp4.1LG16g08940 vs. NCBI nr
Match: gi|659097605|ref|XP_008449715.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g13600-like [Cucumis melo])

HSP 1 Score: 718.4 bits (1853), Expect = 9.7e-204
Identity = 358/417 (85.85%), Postives = 386/417 (92.57%), Query Frame = 1

Query: 76  KGAGLTPNSRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFDA 135
           K AGLTPNSRTVV LLLAC EMLELRLG EIHGYCLRNGLFDMDA+VGTAL+GFY+RFDA
Sbjct: 153 KKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYLRFDA 212

Query: 136 AVSHRVFSLMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIKFDAVTMLLVIQAC 195
            +SHRVFSLM VRN+VSWNA+ITG+LN+GDYTKALKLFSSML EGIKFDAVTML+VIQAC
Sbjct: 213 VLSHRVFSLMVVRNIVSWNAIITGFLNVGDYTKALKLFSSMLIEGIKFDAVTMLVVIQAC 272

Query: 196 AESESLQLGMQLHQLAIKFNFVDDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALWN 255
           AE   L+LGMQLHQLAIKFN ++D+FVLNALLNMYSDNG LESSC LFNAVPTSDAALWN
Sbjct: 273 AEYGCLRLGMQLHQLAIKFNLINDVFVLNALLNMYSDNGSLESSCVLFNAVPTSDAALWN 332

Query: 256 SMISAYIAFGFHAEAIALYIKMRLEGLKEDKRTVAIMLSLCEDLNDGSIWGRGLHAHAMK 315
           SMIS YI FGFHAEAIAL+IKMRLE +KED RT+ IMLSLC DLNDGS+WGRGLHAHAMK
Sbjct: 333 SMISCYIGFGFHAEAIALFIKMRLERIKEDVRTIVIMLSLCNDLNDGSLWGRGLHAHAMK 392

Query: 316 SGMELDVFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISWNTMILALAQSKFRAKAFQL 375
           SG+ELD FLGNALLSMYV+HNQI+AAQ +F+K RGLDVISWNTMI ALAQS FRAKAF+L
Sbjct: 393 SGIELDAFLGNALLSMYVKHNQINAAQNVFEKTRGLDVISWNTMISALAQSMFRAKAFEL 452

Query: 376 FMTMCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYIN 435
           F  MCESEIKFNSYT+ISLLALCKDG+DLVFGRSIHGFAIKNGLEINTSLNTSLTEMYIN
Sbjct: 453 FFMMCESEIKFNSYTIISLLALCKDGNDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYIN 512

Query: 436 CSDEGSATNLFIRCPQRDLISWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTII 493
           C DE +A ++F RCPQRDLISWNSLI SYIKNDNAGKALLLFNHMISELEPNSVTII
Sbjct: 513 CGDERAAIDMFTRCPQRDLISWNSLILSYIKNDNAGKALLLFNHMISELEPNSVTII 569

BLAST of Cp4.1LG16g08940 vs. NCBI nr
Match: gi|449448940|ref|XP_004142223.1| (PREDICTED: putative pentatricopeptide repeat-containing protein At3g01580 [Cucumis sativus])

HSP 1 Score: 717.6 bits (1851), Expect = 1.7e-203
Identity = 358/417 (85.85%), Postives = 387/417 (92.81%), Query Frame = 1

Query: 76  KGAGLTPNSRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFDA 135
           K AGLTPNSRTVV LLLAC EMLELRLG EIHGYCLRNGLFDMDA+VGTAL+GFYMRFDA
Sbjct: 153 KKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYMRFDA 212

Query: 136 AVSHRVFSLMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIKFDAVTMLLVIQAC 195
            +SHRVFSLM VRN+VSWNA+ITG+LN+GD  KALKL+SSML EGIKFDAVTML+VIQAC
Sbjct: 213 VLSHRVFSLMLVRNIVSWNAIITGFLNVGDCAKALKLYSSMLIEGIKFDAVTMLVVIQAC 272

Query: 196 AESESLQLGMQLHQLAIKFNFVDDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALWN 255
           AE   L+LGMQLHQLAIKFN ++DLF+LNALLNMYSDNG LESS ALFNAVPTSDAALWN
Sbjct: 273 AEYGCLRLGMQLHQLAIKFNLINDLFILNALLNMYSDNGSLESSWALFNAVPTSDAALWN 332

Query: 256 SMISAYIAFGFHAEAIALYIKMRLEGLKEDKRTVAIMLSLCEDLNDGSIWGRGLHAHAMK 315
           SMIS+YI FGFHAEAIAL+IKMRLE +KED RT+AIMLSLC DLNDGSIWGRGLHAHAMK
Sbjct: 333 SMISSYIGFGFHAEAIALFIKMRLERIKEDVRTIAIMLSLCNDLNDGSIWGRGLHAHAMK 392

Query: 316 SGMELDVFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISWNTMILALAQSKFRAKAFQL 375
           SG+ELD +LGNALLSMYV+HNQI AAQ +F+KMRGLDVISWNTMI A AQS FRAKAF+L
Sbjct: 393 SGIELDAYLGNALLSMYVKHNQITAAQYVFEKMRGLDVISWNTMISAFAQSMFRAKAFEL 452

Query: 376 FMTMCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYIN 435
           F+ MCESEIKFNSYT++SLLA CKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYIN
Sbjct: 453 FLMMCESEIKFNSYTIVSLLAFCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYIN 512

Query: 436 CSDEGSATNLFIRCPQRDLISWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTII 493
           C DE +ATN+F RCPQRDL+SWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTII
Sbjct: 513 CGDERAATNMFTRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTII 569

BLAST of Cp4.1LG16g08940 vs. NCBI nr
Match: gi|731392814|ref|XP_010651228.1| (PREDICTED: pentatricopeptide repeat-containing protein At2g33680-like [Vitis vinifera])

HSP 1 Score: 534.3 bits (1375), Expect = 2.6e-148
Identity = 269/440 (61.14%), Postives = 340/440 (77.27%), Query Frame = 1

Query: 63  LVLKACGRLNAIEKGAGLTPNSRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHV 122
           L+++  GR N       L PNSRT+V LLLAC    ELRLG  +HGYCLRNG+FD + HV
Sbjct: 147 LLVREMGREN-------LRPNSRTMVALLLACEGASELRLGRGVHGYCLRNGMFDSNPHV 206

Query: 123 GTALIGFYMRFDAAVSHRVFSLMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIK 182
            TALIGFY+RFD  V   +F LM VRN+VSWNAMI+GY ++GDY KAL+LF  ML + +K
Sbjct: 207 ATALIGFYLRFDMRVLPLLFDLMVVRNIVSWNAMISGYYDVGDYFKALELFVQMLVDEVK 266

Query: 183 FDAVTMLLVIQACAESESLQLGMQLHQLAIKFNFVDDLFVLNALLNMYSDNGRLESSCAL 242
           FD VTML+ +QACAE  SL+LG Q+HQLAIKF FV+DL++LNALLNMYS+NG LESS  L
Sbjct: 267 FDCVTMLVAVQACAELGSLKLGKQIHQLAIKFEFVEDLYILNALLNMYSNNGSLESSHQL 326

Query: 243 FNAVPTSDAALWNSMISAYIAFGFHAEAIALYIKMRLEGLKEDKRTVAIMLSLCEDLNDG 302
           F +VP  DA LWNSMISAY AFG H EA+ L+I+M+ EG+K+D+RTV IMLS+CE+L  G
Sbjct: 327 FESVPNRDAPLWNSMISAYAAFGCHEEAMDLFIRMQSEGVKKDERTVVIMLSMCEELASG 386

Query: 303 SIWGRGLHAHAMKSGMELDVFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISWNTMILA 362
            + G+ LHAH +KSGM +D  LGNALLSMY E N +++ QK+FD+M+G+D+ISWNTMILA
Sbjct: 387 LLKGKSLHAHVIKSGMRIDASLGNALLSMYTELNCVESVQKIFDRMKGVDIISWNTMILA 446

Query: 363 LAQSKFRAKAFQLFMTMCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEIN 422
           LA++  RA+A +LF  M ESEIK NSYT+IS+LA C+D + L FGRSIHG+ +K+ +EIN
Sbjct: 447 LARNTLRAQACELFERMRESEIKPNSYTIISILAACEDVTCLDFGRSIHGYVMKHSIEIN 506

Query: 423 TSLNTSLTEMYINCSDEGSATNLFIRCPQRDLISWNSLISSYIKNDNAGKALLLFNHMIS 482
             L T+L +MY+NC DE +A +LF  CP RDLISWN++I+SY+KN+ A KALLLF+ MIS
Sbjct: 507 QPLRTALADMYMNCGDEATARDLFEGCPDRDLISWNAMIASYVKNNQAHKALLLFHRMIS 566

Query: 483 ELEPNSVTIIKEAVKKIHMS 503
           E EPNSVTII       H++
Sbjct: 567 EAEPNSVTIINVLSSFTHLA 579

BLAST of Cp4.1LG16g08940 vs. NCBI nr
Match: gi|1009118471|ref|XP_015875878.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g27110-like [Ziziphus jujuba])

HSP 1 Score: 510.0 bits (1312), Expect = 5.2e-141
Identity = 254/414 (61.35%), Postives = 325/414 (78.50%), Query Frame = 1

Query: 79  GLTPNSRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFDAAVS 138
           GL PNSRTVV LLLAC E+LELRLG EIHGYC+RNGLFD+D HVGTALIGFY+RFD  +S
Sbjct: 155 GLKPNSRTVVALLLACREILELRLGQEIHGYCVRNGLFDLDPHVGTALIGFYLRFDVRIS 214

Query: 139 HRVFSLMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIKFDAVTMLLVIQACAES 198
           H VF LM  RN VSWNA+ITGY+  G++  A KLF  ML + +KFD+VT++ +IQACAE 
Sbjct: 215 HIVFDLMVARNTVSWNAIITGYVENGEHLTAWKLFMRMLVDRVKFDSVTVIAIIQACAEL 274

Query: 199 ESLQLGMQLHQLAIKFNFVDDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALWNSMI 258
             L+LGMQ+HQ+AIK  + ++LFV NALLNMYS++G  E SC LF+ +P  D ALWNSMI
Sbjct: 275 GFLELGMQMHQMAIKSGYSNNLFVANALLNMYSESGSFELSCQLFDTIPKYDVALWNSMI 334

Query: 259 SAYIAFGFHAEAIALYIKMRLEGLKEDKRTVAIMLSLCEDLNDGSIWGRGLHAHAMKSGM 318
            AYI +GF+ EA+ L++ M++ G+++D+RT+AIMLSLC +L DG   G+ LHAHA+K GM
Sbjct: 335 YAYIGYGFYEEAMFLFLNMQVFGIRDDERTIAIMLSLCANLADGMGMGKSLHAHAIKRGM 394

Query: 319 ELDVFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISWNTMILALAQSKFRAKAFQLFMT 378
           ELDV LGNA L MY E N I+AA+K+F +++G DVISWNT+I+ALA +K R +A+  F  
Sbjct: 395 ELDVSLGNAFLGMYAEQNCIEAARKVFTEIKGPDVISWNTLIMALACNKLRNEAWNHFEE 454

Query: 379 MCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCSD 438
           +  S+IK NS+T+ISLLA C D + L  GR+IHGFA+K+ ++I+ SLNT+LT+MY+NC D
Sbjct: 455 IQASKIKPNSHTIISLLAACDDETCLNSGRAIHGFAVKHDIQIDLSLNTALTDMYMNCGD 514

Query: 439 EGSATNLFIRCPQRDLISWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTII 493
           E +A +LF  CP RD+ISWN+LISSYIK +   KA  LFN MISE+EPNSVTII
Sbjct: 515 EATARSLFEACPNRDVISWNALISSYIKKNEGKKAQELFNRMISEVEPNSVTII 568

BLAST of Cp4.1LG16g08940 vs. NCBI nr
Match: gi|590583995|ref|XP_007015051.1| (Pentatricopeptide repeat-containing protein [Theobroma cacao])

HSP 1 Score: 509.6 bits (1311), Expect = 6.8e-141
Identity = 251/424 (59.20%), Postives = 326/424 (76.89%), Query Frame = 1

Query: 79  GLTPNSRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFDAAVS 138
           G  PNSRT+V +LLAC E+ E+RLG EIHGYCLRNGLFD+D HVGTALIGFY+ F+   S
Sbjct: 154 GFRPNSRTLVAMLLACQEVAEVRLGKEIHGYCLRNGLFDLDPHVGTALIGFYLSFNVRAS 213

Query: 139 HRVFSLMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIKFDAVTMLLVIQACAES 198
           H VF LM VRN V WNAMI GY +IG+  KALKLF  ML +G++FD+VTML +IQACAE 
Sbjct: 214 HTVFDLMAVRNTVCWNAMIKGYFDIGESLKALKLFEKMLMDGVEFDSVTMLALIQACAEF 273

Query: 199 ESLQLGMQLHQLAIKFNFVDDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALWNSMI 258
            SL+LG Q+HQ+AIK ++ +DLF++NALLNMY+D G L+S+C LF+  P  D ALWNSMI
Sbjct: 274 GSLELGSQIHQMAIKCSYSNDLFIVNALLNMYADIGSLKSACKLFDVTPRRDVALWNSMI 333

Query: 259 SAYIAFGFHAEAIALYIKMRLEGLKEDKRTVAIMLSLCEDLNDGSIWGRGLHAHAMKSGM 318
           SAY  +  + EA +L++ MR EG KED RT+ IM SLC +  DG   G+ LHA+A KSGM
Sbjct: 334 SAYFEYSCNEEATSLFVHMRTEGNKEDDRTIVIMFSLCAESADGLRKGKSLHAYASKSGM 393

Query: 319 ELDVFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISWNTMILALAQSKFRAKAFQLFMT 378
            +DV LGNA+L+MY + N ID+ QK+F +M  +DVIS+NT+ILALA++K  ++A+++F  
Sbjct: 394 RMDVNLGNAMLNMYAQQNCIDSVQKVFSEMSNVDVISFNTVILALARNKLGSEAWEVFGL 453

Query: 379 MCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCSD 438
           M E +++ NSYT+IS+LA CKD + L  GRS+HGF IK G+E+N SLNT+LT+MYINC D
Sbjct: 454 MWELDVEPNSYTIISILAACKDETCLNIGRSLHGFVIKQGIEVNVSLNTALTDMYINCGD 513

Query: 439 EGSATNLFIRCPQRDLISWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIIKEAVKK 498
           E +A NLF  CP RDLISWN+LI++Y+KN+ A +A L+F+ MISE+EPNSVTII      
Sbjct: 514 EATARNLFESCPGRDLISWNALIATYVKNNLAHEAFLVFSRMISEVEPNSVTIINILSSC 573

Query: 499 IHMS 503
            H++
Sbjct: 574 THLA 577

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP348_ARATH3.6e-5731.99Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana GN... [more]
PP303_ARATH3.0e-5631.17Pentatricopeptide repeat-containing protein At4g04370 OS=Arabidopsis thaliana GN... [more]
PP268_ARATH1.1e-5329.51Putative pentatricopeptide repeat-containing protein At3g47840 OS=Arabidopsis th... [more]
PP320_ARATH1.6e-5229.85Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
PP205_ARATH1.3e-5127.33Putative pentatricopeptide repeat-containing protein At3g01580 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
A0A061GS93_THECC4.8e-14159.20Pentatricopeptide repeat-containing protein OS=Theobroma cacao GN=TCM_040700 PE=... [more]
A5BC97_VITVI1.2e-13657.95Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_043633 PE=4 SV=1[more]
A0A0D2N9C6_GOSRA4.3e-13456.24Uncharacterized protein OS=Gossypium raimondii GN=B456_001G138400 PE=4 SV=1[more]
W9RTH5_9ROSA1.6e-13357.98Uncharacterized protein OS=Morus notabilis GN=L484_015292 PE=4 SV=1[more]
A0A067JE40_JATCU8.4e-13057.01Uncharacterized protein OS=Jatropha curcas GN=JCGZ_25923 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G33990.12.0e-5831.99 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G04370.11.7e-5731.17 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G47840.16.1e-5529.51 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G18750.18.8e-5429.85 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G01580.17.5e-5327.33 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659097605|ref|XP_008449715.1|9.7e-20485.85PREDICTED: pentatricopeptide repeat-containing protein At2g13600-like [Cucumis m... [more]
gi|449448940|ref|XP_004142223.1|1.7e-20385.85PREDICTED: putative pentatricopeptide repeat-containing protein At3g01580 [Cucum... [more]
gi|731392814|ref|XP_010651228.1|2.6e-14861.14PREDICTED: pentatricopeptide repeat-containing protein At2g33680-like [Vitis vin... [more]
gi|1009118471|ref|XP_015875878.1|5.2e-14161.35PREDICTED: pentatricopeptide repeat-containing protein At5g27110-like [Ziziphus ... [more]
gi|590583995|ref|XP_007015051.1|6.8e-14159.20Pentatricopeptide repeat-containing protein [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG16g08940.1Cp4.1LG16g08940.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 223..244
score: 1.3coord: 326..351
score: 0.0061coord: 455..482
score: 1.9E-5coord: 254..282
score: 9.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 149..196
score: 9.1E-12coord: 352..399
score: 2.
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 455..482
score: 6.2E-5coord: 151..184
score: 1.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 352..386
score: 8.966coord: 453..483
score: 9.065coord: 22..56
score: 6.182coord: 250..284
score: 9.767coord: 184..218
score: 5.886coord: 321..351
score: 7.969coord: 219..249
score: 7.125coord: 149..183
score: 11
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 152..276
score: 7.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 322..492
score: 3.0E-134coord: 98..285
score: 3.0E-134coord: 26..71
score: 3.0E

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cp4.1LG16g08940Cucumber (Chinese Long) v3cpecucB0333
Cp4.1LG16g08940Cucumber (Chinese Long) v3cpecucB0369
Cp4.1LG16g08940Wax gourdcpewgoB0374
Cp4.1LG16g08940Cucurbita pepo (Zucchini)cpecpeB156
Cp4.1LG16g08940Cucurbita pepo (Zucchini)cpecpeB286
Cp4.1LG16g08940Cucurbita pepo (Zucchini)cpecpeB287
Cp4.1LG16g08940Cucurbita pepo (Zucchini)cpecpeB305
Cp4.1LG16g08940Cucurbita pepo (Zucchini)cpecpeB310
Cp4.1LG16g08940Cucumber (Gy14) v1cgycpeB0097
Cp4.1LG16g08940Cucurbita maxima (Rimu)cmacpeB375
Cp4.1LG16g08940Cucurbita maxima (Rimu)cmacpeB627
Cp4.1LG16g08940Cucurbita maxima (Rimu)cmacpeB913
Cp4.1LG16g08940Cucurbita moschata (Rifu)cmocpeB053
Cp4.1LG16g08940Cucurbita moschata (Rifu)cmocpeB102
Cp4.1LG16g08940Cucurbita moschata (Rifu)cmocpeB338
Cp4.1LG16g08940Cucurbita moschata (Rifu)cmocpeB576
Cp4.1LG16g08940Cucurbita moschata (Rifu)cmocpeB850
Cp4.1LG16g08940Wild cucumber (PI 183967)cpecpiB268
Cp4.1LG16g08940Wild cucumber (PI 183967)cpecpiB302
Cp4.1LG16g08940Cucumber (Chinese Long) v2cpecuB279
Cp4.1LG16g08940Cucumber (Chinese Long) v2cpecuB302
Cp4.1LG16g08940Bottle gourd (USVL1VR-Ls)cpelsiB238
Cp4.1LG16g08940Bottle gourd (USVL1VR-Ls)cpelsiB246
Cp4.1LG16g08940Watermelon (Charleston Gray)cpewcgB255
Cp4.1LG16g08940Melon (DHL92) v3.5.1cpemeB270
Cp4.1LG16g08940Cucumber (Gy14) v2cgybcpeB037
Cp4.1LG16g08940Cucumber (Gy14) v2cgybcpeB496
Cp4.1LG16g08940Cucumber (Gy14) v2cgybcpeB922
Cp4.1LG16g08940Melon (DHL92) v3.6.1cpemedB315
Cp4.1LG16g08940Silver-seed gourdcarcpeB0246
Cp4.1LG16g08940Silver-seed gourdcarcpeB0654
Cp4.1LG16g08940Silver-seed gourdcarcpeB1273