Cucsat.G7891.T6 (mRNA) Cucumber (B10) v3

Overview
NameCucsat.G7891.T6
TypemRNA
OrganismCucumis sativus L. var. sativus cv B10 (Cucumber (B10) v3)
DescriptionPentatricopeptide repeat-containing protein
Locationctg1556: 3471064 .. 3475113 (+)
RNA-Seq ExpressionCucsat.G7891.T6
SyntenyCucsat.G7891.T6
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TAACAGCTTACATTCAAATGGGTAAGGAGGACTGTGGGCTTCAAGCATTTAAAAGAATGCGGGCAAGCAATGTGATTCCAAATGAATATACATTTTCTGCTGTTATATCTTGTTGTGCTAATTTTGCAAGGTTGAAGTGGGGGGAGCAACTACATGCGCATGTTTTATGTGTTGGGTTCGTCAATGCTTTGTCAGTTGCTAACTCTATCATGACCCTGTACTCAAAATGTGGGGAGTTAGCCTCAGTTTCAAAGGTATTTTGTTCAATGAAATTTAGAGACATCATTACTTGGAGCACTATTATTGCGGCGTATTCTCAAGTAGGCTATGGCGAAGAAGCTTTTGAGTATCTATCACGAATGAGGAGTGAAGGACCGAAACCAAATGAGTTTGCCCTGGCTAGCGTGTTGAGTGTATGTGGAAGTATGGCGATTCTCGAGCAGGGGAAGCAATTGCATGCTCATGTTTTGTCTGTTGGATTAGAACAGACATCCATGGTATGTAGTGCTCTTATTATTATGTATGCAAAATGTGGGAGCATTGCGGAAGCTTCTAAGATCTTTATGGATTCGTGGAAAGATGACATCATTTCATGGACAGCAATGATCAGCGGGTATGCTGAACATGGACACAGCCAAGAAGCCATTGAATTGTTTGAAAATATCCAAAAGGTTGGTTTGAGACCAGACTCCGTGACCTTCATAGGCGTCCTTACTGCTTGTAGCCATGCAGGAATGGTTGACCTTGGTTTCTACTACTTCAATTCAATGAGCAAAGATTATCACATCACTCCTTCAAAAGAACACTATGGATGTATGATTGATCTTCTTTGTCGAGCAGGACGATTGCATGATGCAGAGACCTTGATCAGAAGCATGCCAATTCAATGGGACGATGTTGTCTGGTCTACATTGCTGAGGGCGTGTAGAATCCATGGTGATGTTGATTGTGGACAGCGTGCTGCTGCTGAAGTTCTAAAGTTAGATCCAAATTGTGCTGGGACTCACATAACCTTAGCAAACATTTTTGCTGCTAAGGGAAAGTGGAAGGAAGCAGCAAATATAAGAATGTTAATGAAATCAAAGGGGGTGGTTAAAGAGCCAGGATGGTCTTCGGTAAAGGTCAAGGATAGTGTTTTCGCATTTGTTTCTGGAGATCGTTCACATCCACAAGGAGAAGACATATACAATATTTTGGAGGAGTTGGCTTCAGGAATGGAGATCTATATTCTTGAATTGAATCATTTAGTAACTGATGATAGTGAAGAATAATGAAAGCTTGATTTGTATAGACTCCGTTTTCGATGGATGGCTGTCTCCTTGATGCCTTTACATCATAGAAGTGAAATCACGGACAGGAAAACTGGGATCACTTTTGATTTTTTAAGAAGTGTAGAGGTTGCTCATGAAGGTGCTGGTACATTTTTGAGAATTCTGTTTAATCAATTGTGATGCTTATGCATCTGGTTTGAACTAGCGATGTCAAAGATGAAGGGTCCAGTACATAGAGGTTATTTTTTCAAATGTTGATGATATCTCATTTCTAGGAATCCGGGTTCAGTTCATATCATGCTTCATTACATGCTACTGTGATTTATATATTACCACTTGCACAACTGTTGTGGATGCTAGAATGTTGGTTTTTGTGCTTCTATTTTATAAAAAATGTGTATATTGCGAATGGCACTTAGACCTATTTAGTACTAAAATTGTCCAGACAACTTACATTCAGGCCTTTTTAATCATGTCCACTTTTCTTTATAATTCTGATATTGTTGAGCTCTTGTATGGTTCGAATGGCATGTAGTTGGATATATGTGATGGAAAGTGCACGTTTAGTTGACTCAATGCTTTTTATGGCAAAGTCTTTTTAGTTCGTTTTTTCGAAGTATCATCAAGACACTGGTTGATCATTTTGCTTATGTGGACGGTTTGTTATATAGGTAGGTGCCTTCCATTGTTGGACTTCTTCAAGACGAGGTTGATTCAATGGTGTCTGTGATGAAAGTTGAGAAGGCCCCTTTAGAGTCATAAGTTGATATTGGTGGATTGGATGCTCAAATTCAAGAGATTAAAGAAGCTGTTGAGTTGCTATTTACTCATCCTGAGTTATATGACACTACAGCAAATAACCCTTTTTATAGCCATTCATTATGCCTTTTATTAATGGGCTATAAAAAAGAGAAGGAAACGAAAAACAAAAAGCTGAAAGCTAAAATTTTCTTATTCTTAAAAGTGAGCATTGAAATTTCCTTAAACCCATTTTTGTCTTAACACTTTGTCTTCCCATTTCTCTCTATCTCTCTCCTTTGGCAAATACTCTACTTCCATCGCCGAAACCCAAAATGCCGTGTCACAAGACTTTTAGAATCAAGAAGACACAAGATTTTTAGAATCAAGAAGAAGCTTGTGAAGAAGATGAGGCATAATAGGTCGATCTCGCACTGAATCTGCCTGAGAACCGACAACATGATCAGATACAATGCAAAGTGCAGGCACTAGCGTCGCACCAAGCTAGGGTTCTGAGGTGCTTTCGATTCCTAACTTCAATGTTCATTTACCTTGATTTTTTTAAGTTTCAGAACTTGAATTTTCTTATGGATCTACATTTGTATTTCCGCAACTGAAATTTAGTTTATTGGATACTTTTGTTTAATCATCCATTCATCCTTACACTCAAGATTTAAGCGCTTTGGAATAGACTACTGATAGGTGACTTTTATTTCAAAAAATAAGATGTGATCTTTAAAATTTTAATATTTCTAGTCGATTTGAAAAACATAGTTTCTAGTCATTGATCTGTGATTTAGATTGAGAGAGAGAACTTCATTAAGAGATTATGATGAAGAAGAGGTTTTCTTGCCGTGACTGTTGGTTCTTAATGTTTGAGTTCATGTTAAGTTTAGTCGTGTTTTATCTTGTTCATCATTGTATTTGCACTATTTTATATCTTTATCAAATTCCCAATCATGAGAGAGTTGACTCATATTATTTTGTTGGGAACTTCTTTGGGGCAAAAAACTTGACAATGATGAATTTTTCTGGGAACTTTAACCTATTTAACCTATTAAGTACGTAAAGTTTGGTAGAAACTAAAGTAAGTAGGTTAGACATACATCAATTATACATTTTCTTGTTTAAGTAATCATGGTTTTTCTTTCTTTTTTCACAGTAACCTTCAAATGTTATGCTCTGTTTTCTTGTTTGAGGAAGGATTGCATACAGTGACCTTCAAATGTTAGAAAGCATAACCATACTGCTTCCATGTCTAAGGAGTTACCGTAGGTGTAGGGAGCAATATTTATAAACATGATTATCTCAAGACATTTTAGCTTTTGGCGCCTTAACCCTTGAGGCTTGCATTCTCAATTGAATATGGTCTCTTCCCTGCTCATTGCATTTGGATCTTAGTCTATCCCTTTATTTTCTCCATTTGAAAATAGACTTTATTTTCCTTACTGATGTTTTGACTGTCATTCTTAGGTTCTGACACCCAACATTCTTGATATTACTGTTGAACCACCTGAGAAGGATCATCTCCGCCATGTCATTGACACTATGGCTCTTTATGTTCTGGATGGAGGTTGTGTTTTTGACTAGATGAGCACAGTTGCAATCAACAATAGAGAAACAGAAACCACCAAATAGATAATGATGACAATCCGAAAGCAGGCCACAAAAAGAGCATAACAAAAGTAGCCAAACCTCATCTAGACCCAGAAGTAACAAAGCAACATAAATCCATTACCAAAATATCAAAGATCCATTAATGGTGAAAGGAATAACCAAAAGCAAAAAATGTGTAACGGGAAACACTGAAACAAAGTACTCTCCCTTGCTCGCTGCCAAATAAATTTAAGACGAAAAAACAGATGTAGTAGTATCAAAAGCCTGCCATTGTAAGTCTCTAAGCAAGAAATCCACAATGAACCGACGATCATATCATACAAATCATGTATAACACACGTCCCTTAACTTACAACGAATCTTCTCAATCAAGTGGCTCACAATGCTACAAACAATGTGTTTATCACCACACACCACACAAATT

Coding sequence (CDS)

AACAGCTTACATTCAAATGGGTTGAAGTGGGGGGAGCAACTACATGCGCATGTTTTATGTGTTGGGTTCGTCAATGCTTTGTCAGTTGCTAACTCTATCATGACCCTGTACTCAAAATGTGGGGAGTTAGCCTCAGTTTCAAAGGTATTTTGTTCAATGAAATTTAGAGACATCATTACTTGGAGCACTATTATTGCGGCGTATTCTCAAGTAGGCTATGGCGAAGAAGCTTTTGAGTATCTATCACGAATGAGGAGTGAAGGACCGAAACCAAATGAGTTTGCCCTGGCTAGCGTGTTGAGTGTATGTGGAAGTATGGCGATTCTCGAGCAGGGGAAGCAATTGCATGCTCATGTTTTGTCTGTTGGATTAGAACAGACATCCATGGTATGTAGTGCTCTTATTATTATGTATGCAAAATGTGGGAGCATTGCGGAAGCTTCTAAGATCTTTATGGATTCGTGGAAAGATGACATCATTTCATGGACAGCAATGATCAGCGGGTATGCTGAACATGGACACAGCCAAGAAGCCATTGAATTGTTTGAAAATATCCAAAAGGTTGGTTTGAGACCAGACTCCGTGACCTTCATAGGCGTCCTTACTGCTTGTAGCCATGCAGGAATGGTTGACCTTGGTTTCTACTACTTCAATTCAATGAGCAAAGATTATCACATCACTCCTTCAAAAGAACACTATGGATGTATGATTGATCTTCTTTGTCGAGCAGGACGATTGCATGATGCAGAGACCTTGATCAGAAGCATGCCAATTCAATGGGACGATGTTGTCTGGTCTACATTGCTGAGGGCGTGTAGAATCCATGGTGATGTTGATTGTGGACAGCGTGCTGCTGCTGAAGTTCTAAAGTTAGATCCAAATTGTGCTGGGACTCACATAACCTTAGCAAACATTTTTGCTGCTAAGGGAAAGTGGAAGGAAGCAGCAAATATAAGAATGTTAATGAAATCAAAGGGGGTGGTTAAAGAGCCAGGATGGTCTTCGGTAAAGGTCAAGGATAGTGTTTTCGCATTTGTTTCTGGAGATCGTTCACATCCACAAGGAGAAGACATATACAATATTTTGGAGGAGTTGGCTTCAGGAATGGAGATCTATATTCTTGAATTGAATCATTTAGTAACTGATGATAGTGAAGAATAA

Protein sequence

NSLHSNGLKWGEQLHAHVLCVGFVNALSVANSIMTLYSKCGELASVSKVFCSMKFRDIITWSTIIAAYSQVGYGEEAFEYLSRMRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQTSMVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQKVGLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLHDAETLIRSMPIQWDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFAAKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQGEDIYNILEELASGMEIYILELNHLVTDDSEE
Homology
BLAST of Cucsat.G7891.T6 vs. ExPASy Swiss-Prot
Match: Q9STS9 (Putative pentatricopeptide repeat-containing protein At3g47840 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E43 PE=3 SV=1)

HSP 1 Score: 466.8 bits (1200), Expect = 2.3e-130
Identity = 226/365 (61.92%), Postives = 283/365 (77.53%), Query Frame = 0

Query: 8   LKWGEQLHAHVLCVGFVNALSVANSIMTLYSKCGELASVSKVFCSMKFRDIITWSTIIAA 67
           L WGEQLH +VL +G  ++LSV+NS+M +YS CG L S S +F  M+ RDII+WSTII  
Sbjct: 326 LVWGEQLHCNVLSLGLNDSLSVSNSMMKMYSTCGNLVSASVLFQGMRCRDIISWSTIIGG 385

Query: 68  YSQVGYGEEAFEYLSRMRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT 127
           Y Q G+GEE F+Y S MR  G KP +FALAS+LSV G+MA++E G+Q+HA  L  GLEQ 
Sbjct: 386 YCQAGFGEEGFKYFSWMRQSGTKPTDFALASLLSVSGNMAVIEGGRQVHALALCFGLEQN 445

Query: 128 SMVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQK 187
           S V S+LI MY+KCGSI EAS IF ++ +DDI+S TAMI+GYAEHG S+EAI+LFE   K
Sbjct: 446 STVRSSLINMYSKCGSIKEASMIFGETDRDDIVSLTAMINGYAEHGKSKEAIDLFEKSLK 505

Query: 188 VGLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLH 247
           VG RPDSVTFI VLTAC+H+G +DLGF+YFN M + Y++ P+KEHYGCM+DLLCRAGRL 
Sbjct: 506 VGFRPDSVTFISVLTACTHSGQLDLGFHYFNMMQETYNMRPAKEHYGCMVDLLCRAGRLS 565

Query: 248 DAETLIRSMPIQWDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFA 307
           DAE +I  M  + DDVVW+TLL AC+  GD++ G+RAA  +L+LDP CA   +TLANI++
Sbjct: 566 DAEKMINEMSWKKDDVVWTTLLIACKAKGDIERGRRAAERILELDPTCATALVTLANIYS 625

Query: 308 AKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQGEDIYNILEELAS 367
           + G  +EAAN+R  MK+KGV+KEPGWSS+K+KD V AFVSGDR HPQ EDIYNILE   S
Sbjct: 626 STGNLEEAANVRKNMKAKGVIKEPGWSSIKIKDCVSAFVSGDRFHPQSEDIYNILELAVS 685

Query: 368 GMEIY 373
           G E +
Sbjct: 686 GAEAH 690

BLAST of Cucsat.G7891.T6 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 313.5 bits (802), Expect = 3.2e-84
Identity = 147/383 (38.38%), Postives = 241/383 (62.92%), Query Frame = 0

Query: 5   SNGLKWGEQLHAHVLCVGFVNALSVANSIMTLYSKCGELASVSKVFCSMKFRDIITWSTI 64
           S  ++ G Q+H  +   GF + L + N+++ LYSKCGEL +   +F  + ++D+I+W+T+
Sbjct: 279 SGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISWNTL 338

Query: 65  IAAYSQVGYGEEAFEYLSRMRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHV--LSV 124
           I  Y+ +   +EA      M   G  PN+  + S+L  C  +  ++ G+ +H ++     
Sbjct: 339 IGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLK 398

Query: 125 GLEQTSMVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELF 184
           G+   S + ++LI MYAKCG I  A ++F       + SW AMI G+A HG +  + +LF
Sbjct: 399 GVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLF 458

Query: 185 ENIQKVGLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCR 244
             ++K+G++PD +TF+G+L+ACSH+GM+DLG + F +M++DY +TP  EHYGCMIDLL  
Sbjct: 459 SRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGH 518

Query: 245 AGRLHDAETLIRSMPIQWDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITL 304
           +G   +AE +I  M ++ D V+W +LL+AC++HG+V+ G+  A  ++K++P   G+++ L
Sbjct: 519 SGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLL 578

Query: 305 ANIFAAKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQGEDIYNIL 364
           +NI+A+ G+W E A  R L+  KG+ K PG SS+++   V  F+ GD+ HP+  +IY +L
Sbjct: 579 SNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGML 638

Query: 365 EELASGMEIYILELNHLVTDDSE 386
           EE    ME+ +LE    V D SE
Sbjct: 639 EE----MEV-LLEKAGFVPDTSE 656

BLAST of Cucsat.G7891.T6 vs. ExPASy Swiss-Prot
Match: A8MQA3 (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 311.2 bits (796), Expect = 1.6e-83
Identity = 152/382 (39.79%), Postives = 244/382 (63.87%), Query Frame = 0

Query: 8   LKWGEQLHAHVLCVGFVNALSVANSIMTLYSKCGELASVSKVFCSMKFRDIITWSTIIAA 67
           ++ GE +H+ V+  GF + + V NS++ LY+ CG++AS  KVF  M  +D++ W+++I  
Sbjct: 137 VRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVING 196

Query: 68  YSQVGYGEEAFEYLSRMRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT 127
           +++ G  EEA    + M S+G KP+ F + S+LS C  +  L  GK++H +++ VGL + 
Sbjct: 197 FAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRN 256

Query: 128 SMVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQK 187
               + L+ +YA+CG + EA  +F +    + +SWT++I G A +G  +EAIELF+ ++ 
Sbjct: 257 LHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMES 316

Query: 188 V-GLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRL 247
             GL P  +TF+G+L ACSH GMV  GF YF  M ++Y I P  EH+GCM+DLL RAG++
Sbjct: 317 TEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQV 376

Query: 248 HDAETLIRSMPIQWDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIF 307
             A   I+SMP+Q + V+W TLL AC +HGD D  + A  ++L+L+PN +G ++ L+N++
Sbjct: 377 KKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMY 436

Query: 308 AAKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQGEDIYNILEELA 367
           A++ +W +   IR  M   GV K PG S V+V + V  F+ GD+SHPQ + IY  L+E+ 
Sbjct: 437 ASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMT 496

Query: 368 SGM--EIYILELNHLVTDDSEE 387
             +  E Y+ +++++  D  EE
Sbjct: 497 GRLRSEGYVPQISNVYVDVEEE 518

BLAST of Cucsat.G7891.T6 vs. ExPASy Swiss-Prot
Match: Q9SY02 (Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H24 PE=3 SV=1)

HSP 1 Score: 307.0 bits (785), Expect = 3.0e-82
Identity = 141/362 (38.95%), Postives = 230/362 (63.54%), Query Frame = 0

Query: 27  LSVANSIMTLYSKCGELASVSKVFCSMKFRDIITWSTIIAAYSQVGYGEEAFEYLSRMRS 86
           +S  N+++T Y++CG+++    +F  M  RD ++W+ +IA YSQ G+  EA     +M  
Sbjct: 343 VSTWNTMITGYAQCGKISEAKNLFDKMPKRDPVSWAAMIAGYSQSGHSFEALRLFVQMER 402

Query: 87  EGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQTSMVCSALIIMYAKCGSIAE 146
           EG + N  + +S LS C  +  LE GKQLH  ++  G E    V +AL++MY KCGSI E
Sbjct: 403 EGGRLNRSSFSSALSTCADVVALELGKQLHGRLVKGGYETGCFVGNALLLMYCKCGSIEE 462

Query: 147 ASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQKVGLRPDSVTFIGVLTACSH 206
           A+ +F +    DI+SW  MI+GY+ HG  + A+  FE++++ GL+PD  T + VL+ACSH
Sbjct: 463 ANDLFKEMAGKDIVSWNTMIAGYSRHGFGEVALRFFESMKREGLKPDDATMVAVLSACSH 522

Query: 207 AGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLHDAETLIRSMPIQWDDVVWS 266
            G+VD G  YF +M++DY + P+ +HY CM+DLL RAG L DA  L+++MP + D  +W 
Sbjct: 523 TGLVDKGRQYFYTMTQDYGVMPNSQHYACMVDLLGRAGLLEDAHNLMKNMPFEPDAAIWG 582

Query: 267 TLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFAAKGKWKEAANIRMLMKSKG 326
           TLL A R+HG+ +  + AA ++  ++P  +G ++ L+N++A+ G+W +   +R+ M+ KG
Sbjct: 583 TLLGASRVHGNTELAETAADKIFAMEPENSGMYVLLSNLYASSGRWGDVGKLRVRMRDKG 642

Query: 327 VVKEPGWSSVKVKDSVFAFVSGDRSHPQGEDIYNILEELASGMEI--YILELNHLVTDDS 386
           V K PG+S +++++    F  GD  HP+ ++I+  LEEL   M+   Y+ + + ++ D  
Sbjct: 643 VKKVPGYSWIEIQNKTHTFSVGDEFHPEKDEIFAFLEELDLRMKKAGYVSKTSVVLHDVE 702

BLAST of Cucsat.G7891.T6 vs. ExPASy Swiss-Prot
Match: Q9SVP7 (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 303.9 bits (777), Expect = 2.6e-81
Identity = 144/358 (40.22%), Postives = 223/358 (62.29%), Query Frame = 0

Query: 8   LKWGEQLHAHVLCVGFVNALSVANSIMTLYSKCGELASVSKVFCSMKFRDIITWSTIIAA 67
           LK G+Q+HA     GF + L   N+++TLYS+CG++      F   +  D I W+ +++ 
Sbjct: 607 LKEGQQIHAQACVSGFSSDLPFQNALVTLYSRCGKIEESYLAFEQTEAGDNIAWNALVSG 666

Query: 68  YSQVGYGEEAFEYLSRMRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT 127
           + Q G  EEA     RM  EG   N F   S +      A ++QGKQ+HA +   G +  
Sbjct: 667 FQQSGNNEEALRVFVRMNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYDSE 726

Query: 128 SMVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQK 187
           + VC+ALI MYAKCGSI++A K F++    + +SW A+I+ Y++HG   EA++ F+ +  
Sbjct: 727 TEVCNALISMYAKCGSISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIH 786

Query: 188 VGLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLH 247
             +RP+ VT +GVL+ACSH G+VD G  YF SM+ +Y ++P  EHY C++D+L RAG L 
Sbjct: 787 SNVRPNHVTLVGVLSACSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLS 846

Query: 248 DAETLIRSMPIQWDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFA 307
            A+  I+ MPI+ D +VW TLL AC +H +++ G+ AA  +L+L+P  + T++ L+N++A
Sbjct: 847 RAKEFIQEMPIKPDALVWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYA 906

Query: 308 AKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQGEDIYNILEEL 366
              KW      R  MK KGV KEPG S ++VK+S+ +F  GD++HP  ++I+   ++L
Sbjct: 907 VSKKWDARDLTRQKMKEKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDL 964

BLAST of Cucsat.G7891.T6 vs. NCBI nr
Match: XP_004142727.1 (putative pentatricopeptide repeat-containing protein At3g47840 [Cucumis sativus] >XP_011653730.1 putative pentatricopeptide repeat-containing protein At3g47840 [Cucumis sativus] >XP_031740494.1 putative pentatricopeptide repeat-containing protein At3g47840 [Cucumis sativus] >XP_031740495.1 putative pentatricopeptide repeat-containing protein At3g47840 [Cucumis sativus] >KGN54465.1 hypothetical protein Csa_012903 [Cucumis sativus])

HSP 1 Score: 767 bits (1980), Expect = 2.67e-274
Identity = 379/379 (100.00%), Postives = 379/379 (100.00%), Query Frame = 0

Query: 8   LKWGEQLHAHVLCVGFVNALSVANSIMTLYSKCGELASVSKVFCSMKFRDIITWSTIIAA 67
           LKWGEQLHAHVLCVGFVNALSVANSIMTLYSKCGELASVSKVFCSMKFRDIITWSTIIAA
Sbjct: 334 LKWGEQLHAHVLCVGFVNALSVANSIMTLYSKCGELASVSKVFCSMKFRDIITWSTIIAA 393

Query: 68  YSQVGYGEEAFEYLSRMRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT 127
           YSQVGYGEEAFEYLSRMRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT
Sbjct: 394 YSQVGYGEEAFEYLSRMRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT 453

Query: 128 SMVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQK 187
           SMVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQK
Sbjct: 454 SMVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQK 513

Query: 188 VGLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLH 247
           VGLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLH
Sbjct: 514 VGLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLH 573

Query: 248 DAETLIRSMPIQWDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFA 307
           DAETLIRSMPIQWDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFA
Sbjct: 574 DAETLIRSMPIQWDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFA 633

Query: 308 AKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQGEDIYNILEELAS 367
           AKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQGEDIYNILEELAS
Sbjct: 634 AKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQGEDIYNILEELAS 693

Query: 368 GMEIYILELNHLVTDDSEE 386
           GMEIYILELNHLVTDDSEE
Sbjct: 694 GMEIYILELNHLVTDDSEE 712

BLAST of Cucsat.G7891.T6 vs. NCBI nr
Match: XP_008447344.1 (PREDICTED: putative pentatricopeptide repeat-containing protein At3g47840 [Cucumis melo] >XP_016900384.1 PREDICTED: putative pentatricopeptide repeat-containing protein At3g47840 [Cucumis melo] >KAA0037882.1 putative pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYJ98016.1 putative pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 728 bits (1879), Expect = 5.94e-259
Identity = 360/379 (94.99%), Postives = 366/379 (96.57%), Query Frame = 0

Query: 8   LKWGEQLHAHVLCVGFVNALSVANSIMTLYSKCGELASVSKVFCSMKFRDIITWSTIIAA 67
           LKWGEQLHAHVL +GF+NALSV NSIMT+YSKCGELASVSKVFCSM FRDI+TWSTIIAA
Sbjct: 334 LKWGEQLHAHVLYIGFLNALSVGNSIMTMYSKCGELASVSKVFCSMNFRDIVTWSTIIAA 393

Query: 68  YSQVGYGEEAFEYLSRMRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT 127
           YSQVGY EE FEYLSRMRSEGP+PNEFALASVLS CGSMAILEQGKQLHAHVLS+GLEQT
Sbjct: 394 YSQVGYVEEVFEYLSRMRSEGPRPNEFALASVLSACGSMAILEQGKQLHAHVLSIGLEQT 453

Query: 128 SMVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQK 187
            MVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQK
Sbjct: 454 PMVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQK 513

Query: 188 VGLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLH 247
           VGLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRL 
Sbjct: 514 VGLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLR 573

Query: 248 DAETLIRSMPIQWDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFA 307
           DAETLIRSMPIQ DDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFA
Sbjct: 574 DAETLIRSMPIQRDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFA 633

Query: 308 AKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQGEDIYNILEELAS 367
           AKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQ EDIYNILEELAS
Sbjct: 634 AKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQREDIYNILEELAS 693

Query: 368 GMEIYILELNHLVTDDSEE 386
            MEIYILELNHLV DD EE
Sbjct: 694 RMEIYILELNHLVNDDMEE 712

BLAST of Cucsat.G7891.T6 vs. NCBI nr
Match: XP_038887347.1 (putative pentatricopeptide repeat-containing protein At3g47840 [Benincasa hispida] >XP_038887348.1 putative pentatricopeptide repeat-containing protein At3g47840 [Benincasa hispida] >XP_038887349.1 putative pentatricopeptide repeat-containing protein At3g47840 [Benincasa hispida] >XP_038887350.1 putative pentatricopeptide repeat-containing protein At3g47840 [Benincasa hispida])

HSP 1 Score: 716 bits (1847), Expect = 4.17e-254
Identity = 352/378 (93.12%), Postives = 367/378 (97.09%), Query Frame = 0

Query: 8   LKWGEQLHAHVLCVGFVNALSVANSIMTLYSKCGELASVSKVFCSMKFRDIITWSTIIAA 67
           LKWGEQLHAHVL VGF NALSVANSIMT+YSKCGELASVSKVFCSM FRD+ITWSTIIAA
Sbjct: 334 LKWGEQLHAHVLRVGFRNALSVANSIMTMYSKCGELASVSKVFCSMNFRDVITWSTIIAA 393

Query: 68  YSQVGYGEEAFEYLSRMRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT 127
           YSQVGYGEEAFEYLSRMRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT
Sbjct: 394 YSQVGYGEEAFEYLSRMRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT 453

Query: 128 SMVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQK 187
           SMVCSALIIMYAKCGSIAEASKIFMDS KDD+ISWTAMISGYAEHGHSQEAIELFENIQK
Sbjct: 454 SMVCSALIIMYAKCGSIAEASKIFMDSLKDDVISWTAMISGYAEHGHSQEAIELFENIQK 513

Query: 188 VGLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLH 247
           VGLRPDSVTFIGVLTACSHAGMVDLGF+YFNSMSKDYHITPSKEHYGCMIDLLCRAG+L 
Sbjct: 514 VGLRPDSVTFIGVLTACSHAGMVDLGFHYFNSMSKDYHITPSKEHYGCMIDLLCRAGQLR 573

Query: 248 DAETLIRSMPIQWDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFA 307
           DAE+LIRSMP Q DDVVWS LLRACR+HGDVDCGQRAAAEVLKLDPNCAGTHITLANIFA
Sbjct: 574 DAESLIRSMPFQGDDVVWSILLRACRVHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFA 633

Query: 308 AKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQGEDIYNILEELAS 367
           AKGKWKEAANIRMLMKSKGVVKEPGWSS+K+KDS+FAFV+GDRSHP+GEDIY++LEELAS
Sbjct: 634 AKGKWKEAANIRMLMKSKGVVKEPGWSSIKIKDSIFAFVAGDRSHPRGEDIYSMLEELAS 693

Query: 368 GMEIYILELNHLVTDDSE 385
           G EIYILEL+HLVTD  E
Sbjct: 694 GTEIYILELDHLVTDMEE 711

BLAST of Cucsat.G7891.T6 vs. NCBI nr
Match: XP_023544314.1 (putative pentatricopeptide repeat-containing protein At3g47840 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 711 bits (1835), Expect = 2.77e-252
Identity = 351/378 (92.86%), Postives = 368/378 (97.35%), Query Frame = 0

Query: 8   LKWGEQLHAHVLCVGFVNALSVANSIMTLYSKCGELASVSKVFCSMKFRDIITWSTIIAA 67
           LKWGEQLHAHVL VGF+NALSVANSIMT+YSKCGELASVSKVFCSM F+D+ITWSTIIAA
Sbjct: 334 LKWGEQLHAHVLRVGFLNALSVANSIMTMYSKCGELASVSKVFCSMNFKDVITWSTIIAA 393

Query: 68  YSQVGYGEEAFEYLSRMRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT 127
           YSQVGYG+EAFEYLS+MRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT
Sbjct: 394 YSQVGYGKEAFEYLSQMRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT 453

Query: 128 SMVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQK 187
           +MVCSALIIMYAKCGSI EASKIFMDS KDDIISWTAMISGYAEHGHSQEAIELFE+IQK
Sbjct: 454 AMVCSALIIMYAKCGSITEASKIFMDSLKDDIISWTAMISGYAEHGHSQEAIELFESIQK 513

Query: 188 VGLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLH 247
           VGLRPDSVTFIGVLTACSHAGMVDLGF+YFNSMSKDYHITPSKEHYGCMIDLLCRAGRL+
Sbjct: 514 VGLRPDSVTFIGVLTACSHAGMVDLGFHYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLN 573

Query: 248 DAETLIRSMPIQWDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFA 307
           DAE+LIRSMP Q DDVVWSTLLRACRIHGDVDCGQRAAAEVLKL+PNCAGTHITLANIFA
Sbjct: 574 DAESLIRSMPFQRDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLNPNCAGTHITLANIFA 633

Query: 308 AKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQGEDIYNILEELAS 367
           AKGKWKEAANIRM+MKSKGVVKEPGWSS+K+KDSVFAFV+GDRS PQGEDIY +LEELAS
Sbjct: 634 AKGKWKEAANIRMIMKSKGVVKEPGWSSIKLKDSVFAFVAGDRSLPQGEDIYRMLEELAS 693

Query: 368 GMEIYILELNHLVTDDSE 385
           GMEIYILELNHLVTD  E
Sbjct: 694 GMEIYILELNHLVTDMEE 711

BLAST of Cucsat.G7891.T6 vs. NCBI nr
Match: XP_023544313.1 (putative pentatricopeptide repeat-containing protein At3g47840 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 711 bits (1835), Expect = 6.54e-252
Identity = 351/378 (92.86%), Postives = 368/378 (97.35%), Query Frame = 0

Query: 8   LKWGEQLHAHVLCVGFVNALSVANSIMTLYSKCGELASVSKVFCSMKFRDIITWSTIIAA 67
           LKWGEQLHAHVL VGF+NALSVANSIMT+YSKCGELASVSKVFCSM F+D+ITWSTIIAA
Sbjct: 359 LKWGEQLHAHVLRVGFLNALSVANSIMTMYSKCGELASVSKVFCSMNFKDVITWSTIIAA 418

Query: 68  YSQVGYGEEAFEYLSRMRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT 127
           YSQVGYG+EAFEYLS+MRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT
Sbjct: 419 YSQVGYGKEAFEYLSQMRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT 478

Query: 128 SMVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQK 187
           +MVCSALIIMYAKCGSI EASKIFMDS KDDIISWTAMISGYAEHGHSQEAIELFE+IQK
Sbjct: 479 AMVCSALIIMYAKCGSITEASKIFMDSLKDDIISWTAMISGYAEHGHSQEAIELFESIQK 538

Query: 188 VGLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLH 247
           VGLRPDSVTFIGVLTACSHAGMVDLGF+YFNSMSKDYHITPSKEHYGCMIDLLCRAGRL+
Sbjct: 539 VGLRPDSVTFIGVLTACSHAGMVDLGFHYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLN 598

Query: 248 DAETLIRSMPIQWDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFA 307
           DAE+LIRSMP Q DDVVWSTLLRACRIHGDVDCGQRAAAEVLKL+PNCAGTHITLANIFA
Sbjct: 599 DAESLIRSMPFQRDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLNPNCAGTHITLANIFA 658

Query: 308 AKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQGEDIYNILEELAS 367
           AKGKWKEAANIRM+MKSKGVVKEPGWSS+K+KDSVFAFV+GDRS PQGEDIY +LEELAS
Sbjct: 659 AKGKWKEAANIRMIMKSKGVVKEPGWSSIKLKDSVFAFVAGDRSLPQGEDIYRMLEELAS 718

Query: 368 GMEIYILELNHLVTDDSE 385
           GMEIYILELNHLVTD  E
Sbjct: 719 GMEIYILELNHLVTDMEE 736

BLAST of Cucsat.G7891.T6 vs. ExPASy TrEMBL
Match: A0A0A0KXW2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G335250 PE=4 SV=1)

HSP 1 Score: 767 bits (1980), Expect = 1.29e-274
Identity = 379/379 (100.00%), Postives = 379/379 (100.00%), Query Frame = 0

Query: 8   LKWGEQLHAHVLCVGFVNALSVANSIMTLYSKCGELASVSKVFCSMKFRDIITWSTIIAA 67
           LKWGEQLHAHVLCVGFVNALSVANSIMTLYSKCGELASVSKVFCSMKFRDIITWSTIIAA
Sbjct: 334 LKWGEQLHAHVLCVGFVNALSVANSIMTLYSKCGELASVSKVFCSMKFRDIITWSTIIAA 393

Query: 68  YSQVGYGEEAFEYLSRMRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT 127
           YSQVGYGEEAFEYLSRMRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT
Sbjct: 394 YSQVGYGEEAFEYLSRMRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT 453

Query: 128 SMVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQK 187
           SMVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQK
Sbjct: 454 SMVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQK 513

Query: 188 VGLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLH 247
           VGLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLH
Sbjct: 514 VGLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLH 573

Query: 248 DAETLIRSMPIQWDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFA 307
           DAETLIRSMPIQWDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFA
Sbjct: 574 DAETLIRSMPIQWDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFA 633

Query: 308 AKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQGEDIYNILEELAS 367
           AKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQGEDIYNILEELAS
Sbjct: 634 AKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQGEDIYNILEELAS 693

Query: 368 GMEIYILELNHLVTDDSEE 386
           GMEIYILELNHLVTDDSEE
Sbjct: 694 GMEIYILELNHLVTDDSEE 712

BLAST of Cucsat.G7891.T6 vs. ExPASy TrEMBL
Match: A0A1S4DWM7 (putative pentatricopeptide repeat-containing protein At3g47840 OS=Cucumis melo OX=3656 GN=LOC103489816 PE=4 SV=1)

HSP 1 Score: 728 bits (1879), Expect = 2.88e-259
Identity = 360/379 (94.99%), Postives = 366/379 (96.57%), Query Frame = 0

Query: 8   LKWGEQLHAHVLCVGFVNALSVANSIMTLYSKCGELASVSKVFCSMKFRDIITWSTIIAA 67
           LKWGEQLHAHVL +GF+NALSV NSIMT+YSKCGELASVSKVFCSM FRDI+TWSTIIAA
Sbjct: 334 LKWGEQLHAHVLYIGFLNALSVGNSIMTMYSKCGELASVSKVFCSMNFRDIVTWSTIIAA 393

Query: 68  YSQVGYGEEAFEYLSRMRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT 127
           YSQVGY EE FEYLSRMRSEGP+PNEFALASVLS CGSMAILEQGKQLHAHVLS+GLEQT
Sbjct: 394 YSQVGYVEEVFEYLSRMRSEGPRPNEFALASVLSACGSMAILEQGKQLHAHVLSIGLEQT 453

Query: 128 SMVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQK 187
            MVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQK
Sbjct: 454 PMVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQK 513

Query: 188 VGLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLH 247
           VGLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRL 
Sbjct: 514 VGLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLR 573

Query: 248 DAETLIRSMPIQWDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFA 307
           DAETLIRSMPIQ DDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFA
Sbjct: 574 DAETLIRSMPIQRDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFA 633

Query: 308 AKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQGEDIYNILEELAS 367
           AKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQ EDIYNILEELAS
Sbjct: 634 AKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQREDIYNILEELAS 693

Query: 368 GMEIYILELNHLVTDDSEE 386
            MEIYILELNHLV DD EE
Sbjct: 694 RMEIYILELNHLVNDDMEE 712

BLAST of Cucsat.G7891.T6 vs. ExPASy TrEMBL
Match: A0A5A7T329 (Putative pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold487G00440 PE=4 SV=1)

HSP 1 Score: 728 bits (1879), Expect = 2.88e-259
Identity = 360/379 (94.99%), Postives = 366/379 (96.57%), Query Frame = 0

Query: 8   LKWGEQLHAHVLCVGFVNALSVANSIMTLYSKCGELASVSKVFCSMKFRDIITWSTIIAA 67
           LKWGEQLHAHVL +GF+NALSV NSIMT+YSKCGELASVSKVFCSM FRDI+TWSTIIAA
Sbjct: 334 LKWGEQLHAHVLYIGFLNALSVGNSIMTMYSKCGELASVSKVFCSMNFRDIVTWSTIIAA 393

Query: 68  YSQVGYGEEAFEYLSRMRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT 127
           YSQVGY EE FEYLSRMRSEGP+PNEFALASVLS CGSMAILEQGKQLHAHVLS+GLEQT
Sbjct: 394 YSQVGYVEEVFEYLSRMRSEGPRPNEFALASVLSACGSMAILEQGKQLHAHVLSIGLEQT 453

Query: 128 SMVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQK 187
            MVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQK
Sbjct: 454 PMVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQK 513

Query: 188 VGLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLH 247
           VGLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRL 
Sbjct: 514 VGLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLR 573

Query: 248 DAETLIRSMPIQWDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFA 307
           DAETLIRSMPIQ DDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFA
Sbjct: 574 DAETLIRSMPIQRDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFA 633

Query: 308 AKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQGEDIYNILEELAS 367
           AKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQ EDIYNILEELAS
Sbjct: 634 AKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQREDIYNILEELAS 693

Query: 368 GMEIYILELNHLVTDDSEE 386
            MEIYILELNHLV DD EE
Sbjct: 694 RMEIYILELNHLVNDDMEE 712

BLAST of Cucsat.G7891.T6 vs. ExPASy TrEMBL
Match: A0A6J1GEB4 (putative pentatricopeptide repeat-containing protein At3g47840 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111453211 PE=4 SV=1)

HSP 1 Score: 704 bits (1817), Expect = 7.26e-250
Identity = 348/378 (92.06%), Postives = 366/378 (96.83%), Query Frame = 0

Query: 8   LKWGEQLHAHVLCVGFVNALSVANSIMTLYSKCGELASVSKVFCSMKFRDIITWSTIIAA 67
           LKWGEQLHAHVL VGFVNALSVANSIMT+YSKCGELASVSKVFCSM F+D+ITWSTIIAA
Sbjct: 334 LKWGEQLHAHVLRVGFVNALSVANSIMTMYSKCGELASVSKVFCSMNFKDVITWSTIIAA 393

Query: 68  YSQVGYGEEAFEYLSRMRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT 127
           YSQVGYG+EAFEYLS+MRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT
Sbjct: 394 YSQVGYGKEAFEYLSQMRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT 453

Query: 128 SMVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQK 187
           +MVCSALIIMYAKCGSI EASKIF DS K+DIISWTAMISG+AEHGHSQEAIELFE+IQK
Sbjct: 454 AMVCSALIIMYAKCGSITEASKIFTDSLKNDIISWTAMISGHAEHGHSQEAIELFESIQK 513

Query: 188 VGLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLH 247
           VGLRPDSVTFIGVLTACSHAGMVDLGF+YFNSMSKDYHITPSKEHYGCMIDLLCRAGRL+
Sbjct: 514 VGLRPDSVTFIGVLTACSHAGMVDLGFHYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLN 573

Query: 248 DAETLIRSMPIQWDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFA 307
           DAE+LIRSMP Q DDVVWSTLLRACRIHGDVDCGQRAAAEVLKL+PNCAGTHITLANIFA
Sbjct: 574 DAESLIRSMPFQRDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLNPNCAGTHITLANIFA 633

Query: 308 AKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQGEDIYNILEELAS 367
           AKGKWKEAANIRM+MKSKGVVKEPGWSS+K+KDSVFAFV+GDRS PQGEDIY +LEELA 
Sbjct: 634 AKGKWKEAANIRMIMKSKGVVKEPGWSSIKLKDSVFAFVAGDRSPPQGEDIYRMLEELAL 693

Query: 368 GMEIYILELNHLVTDDSE 385
           GMEIYILELNHLVTD  E
Sbjct: 694 GMEIYILELNHLVTDMEE 711

BLAST of Cucsat.G7891.T6 vs. ExPASy TrEMBL
Match: A0A6J1IMM7 (putative pentatricopeptide repeat-containing protein At3g47840 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111478417 PE=4 SV=1)

HSP 1 Score: 704 bits (1816), Expect = 1.03e-249
Identity = 347/378 (91.80%), Postives = 366/378 (96.83%), Query Frame = 0

Query: 8   LKWGEQLHAHVLCVGFVNALSVANSIMTLYSKCGELASVSKVFCSMKFRDIITWSTIIAA 67
           LKWGEQLHAHVL VGF+NALSVANSIMT+YSKCGELASVSK+FCSM F+D+ITWSTIIAA
Sbjct: 334 LKWGEQLHAHVLRVGFLNALSVANSIMTMYSKCGELASVSKLFCSMNFKDVITWSTIIAA 393

Query: 68  YSQVGYGEEAFEYLSRMRSEGPKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT 127
           YSQVGYG+EAFEYLS+MRSEG KPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT
Sbjct: 394 YSQVGYGKEAFEYLSQMRSEGSKPNEFALASVLSVCGSMAILEQGKQLHAHVLSVGLEQT 453

Query: 128 SMVCSALIIMYAKCGSIAEASKIFMDSWKDDIISWTAMISGYAEHGHSQEAIELFENIQK 187
           +MVCSALIIMYAKCGSI EASKIFMDS KDDIISWTAMISGYAEHGHSQEAIELFE+IQK
Sbjct: 454 AMVCSALIIMYAKCGSITEASKIFMDSVKDDIISWTAMISGYAEHGHSQEAIELFESIQK 513

Query: 188 VGLRPDSVTFIGVLTACSHAGMVDLGFYYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLH 247
           VGLRPDSVTFIGVLTACSHAGM DLGF+YFNSMSKDYHITPSKEHYGCMIDLLCRAGRL+
Sbjct: 514 VGLRPDSVTFIGVLTACSHAGMADLGFHYFNSMSKDYHITPSKEHYGCMIDLLCRAGRLN 573

Query: 248 DAETLIRSMPIQWDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLDPNCAGTHITLANIFA 307
           DAE+LI+SMP Q DDVVWSTLLRACRIHGDVDCGQRAAAEVLKL+PNCAGTHITLANIFA
Sbjct: 574 DAESLIKSMPFQPDDVVWSTLLRACRIHGDVDCGQRAAAEVLKLNPNCAGTHITLANIFA 633

Query: 308 AKGKWKEAANIRMLMKSKGVVKEPGWSSVKVKDSVFAFVSGDRSHPQGEDIYNILEELAS 367
           AKGKWKEAANIRM+MKSKGVVKEPGWSS+K+KDSVFAFV+GDRS PQGEDIY +LEELAS
Sbjct: 634 AKGKWKEAANIRMIMKSKGVVKEPGWSSIKLKDSVFAFVAGDRSPPQGEDIYRMLEELAS 693

Query: 368 GMEIYILELNHLVTDDSE 385
           GMEIYILELNHLVTD  E
Sbjct: 694 GMEIYILELNHLVTDMEE 711

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9STS92.3e-13061.92Putative pentatricopeptide repeat-containing protein At3g47840 OS=Arabidopsis th... [more]
Q9LN013.2e-8438.38Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
A8MQA31.6e-8339.79Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX... [more]
Q9SY023.0e-8238.95Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana OX... [more]
Q9SVP72.6e-8140.22Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_004142727.12.67e-274100.00putative pentatricopeptide repeat-containing protein At3g47840 [Cucumis sativus]... [more]
XP_008447344.15.94e-25994.99PREDICTED: putative pentatricopeptide repeat-containing protein At3g47840 [Cucum... [more]
XP_038887347.14.17e-25493.12putative pentatricopeptide repeat-containing protein At3g47840 [Benincasa hispid... [more]
XP_023544314.12.77e-25292.86putative pentatricopeptide repeat-containing protein At3g47840 isoform X2 [Cucur... [more]
XP_023544313.16.54e-25292.86putative pentatricopeptide repeat-containing protein At3g47840 isoform X1 [Cucur... [more]
Match NameE-valueIdentityDescription
A0A0A0KXW21.29e-274100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G335250 PE=4 SV=1[more]
A0A1S4DWM72.88e-25994.99putative pentatricopeptide repeat-containing protein At3g47840 OS=Cucumis melo O... [more]
A0A5A7T3292.88e-25994.99Putative pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa... [more]
A0A6J1GEB47.26e-25092.06putative pentatricopeptide repeat-containing protein At3g47840 isoform X2 OS=Cuc... [more]
A0A6J1IMM71.03e-24991.80putative pentatricopeptide repeat-containing protein At3g47840 isoform X2 OS=Cuc... [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (B10) v3
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 227..256
e-value: 2.3E-6
score: 27.2
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 158..205
e-value: 7.3E-12
score: 45.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 233..256
e-value: 1.3E-4
score: 19.9
coord: 195..228
e-value: 0.0029
score: 15.7
coord: 160..194
e-value: 4.6E-7
score: 27.6
coord: 59..93
e-value: 1.9E-6
score: 25.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 31..56
e-value: 0.15
score: 12.4
coord: 301..327
e-value: 0.91
score: 9.9
coord: 59..88
e-value: 1.8E-6
score: 27.8
coord: 131..153
e-value: 0.78
score: 10.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 158..192
score: 12.397287
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 57..91
score: 11.553267
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 108..218
e-value: 4.8E-24
score: 87.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 219..292
e-value: 2.6E-5
score: 25.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 1..107
e-value: 1.6E-17
score: 65.9
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 153..317
NoneNo IPR availablePANTHERPTHR24015:SF1799OS05G0581300 PROTEINcoord: 8..358
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 8..358

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cucsat.G7891Cucsat.G7891gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cucsat.G7891.T6.E1Cucsat.G7891.T6.E1exon
Cucsat.G7891.T6.E2Cucsat.G7891.T6.E2exon
Cucsat.G7891.T6.E3Cucsat.G7891.T6.E3exon
Cucsat.G7891.T6.E4Cucsat.G7891.T6.E4exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cucsat.G7891.T6.C1Cucsat.G7891.T6.C1CDS
Cucsat.G7891.T6.C2Cucsat.G7891.T6.C2CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cucsat.G7891.T6Cucsat.G7891.T6-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009451 RNA modification
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0003723 RNA binding