Cucsat.G4739 (gene) Cucumber (B10) v3

Overview
NameCucsat.G4739
Typegene
OrganismCucumis sativus L. var. sativus cv B10 (Cucumber (B10) v3)
DescriptionPentatricopeptide repeat-containing protein
Locationctg1227: 2763637 .. 2766574 (+)
RNA-Seq ExpressionCucsat.G4739
SyntenyCucsat.G4739
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTTTTAATGTTTTTTTAGTTAGTATTTTGAATATACACTTTTAACAAATTTTAATTTGGTTGAGAACAACCGAAACTCTGTATTTTGTAAAAGCCACCGAAAGAGAGGGGGATTTGAATGTATATTCTACTGAATGAGGCTCTGCGCAACAATTCGTGCTTCTATGTTTGACTAAATGCAATCTCAATTCACAAAACCCAAGCTATTACGTACAATCAACAATGTTTTAGCTTCTTCTACACCTAACCCTCGTGCACCGGAGCAGAATTGTTTAGCCCTTCTTCAGGCCTGTAACGCGCTACCCAAGCTCACCCAAATCCATACTCACATTCTCAAGTTGGGTCTCCACAACAACCCACTCGTTCTCACCAAATTCGCCTCCATTTCTTCTCTTATTCATGCTACTGACTACGCTGCCTCTTTCTTGTTCTCTGCTGAAGCCGATACTCGGCTGTACGATGCATTTCTTTTCAATACCCTCATCCGAGCCTACGCTCAAACTGGTCACTCGAAGGATAAAGCCTTGGCTTTGTATGGTATAATGCTTCATGATGCCATTTTGCCTAATAAATTCACGTACCCATTTGTGTTGAAGGCTTGTGCTGGTCTCGAGGTTTTGAATTTGGGCCAAACGGTTCATGGCTCGGTGGTGAAGTTTGGGTTTGATTGTGATATTCATGTTCAGAACACTATGGTTCATATGTATTCCTGTTGCGCCGGTGGGATCAATTCTGCCCGCAAAGTGTTTGATGAAATGCCAAAGTCAGATTCTGTGACTTGGAGTGCGATGATCGGTGGGTATGCTCGAGTAGGGCGCTCCACTGAAGCAGTGGCCTTGTTTAGAGAGATGCAAATGGCGGAGGTTTGCCCAGATGAGATCACTATGGTTTCCATGCTTTCTGCTTGTACTGATTTGGGTGCCCTTGAACTTGGGAAGTGGATTGAAGCTTACATAGAGAGACACGAAATTCATAAACCAGTAGAGGTTAGCAATGCACTCATTGACATGTTTGCAAAGTGTGGTGATATTAGTAAAGCATTGAAGTTATTTAGAGCTATGAATGAGAAAACAATAGTTTCCTGGACTTCTGTTATTGTTGGCATGGCAATGCATGGCCGTGGTCAAGAGGCCACTTGTTTATTTGAGGAGATGACAAGTTCTGGTGTAGCTCCAGATGATGTCGCCTTTATTGGCTTGCTTTCTGCTTGTAGCCATTCGGGACTAGTAGAAAGAGGTAGAGAATATTTCGGTTCTATGATGAAGAAATACAAACTTGTTCCTAAGATAGAACATTATGGATGCATGGTGGACATGTATTGCAGGACTGGACTTGTGAAAGAGGCTCTTGAGTTCGTACGTAATATGCCAATCGAGCCAAATCCAGTAATCTTACGAACACTAGTCAGTGCCTGCCGTGGTCATGGTGAATTCAAGCTTGGAGAAAAGATAACCAAACTGCTAATGAAACACGAACCTTTGCATGAATCAAACTATGTGTTGCTCTCTAATATTTATGCAAAAACGCTTAGTTGGGAGAAGAAGACCAAAATTAGAGAGGTGATGGAAGTGAAAGGCATGAAAAAGGTTCCAGGGAGCACTATGATTGAGATTGATAATGAAATCTATGAATTTGTTGCTGGAGATAAGTCTCATAAACAGCACAAAGAAATCTATGAAATGGTGGATGAGATGGGTAGAGAAATGAAGAAATCTGGATACCGTCCTTCGACATCAGAGGTTTTGCTTGATATCAATGAAGAGGACAAAGAAGATAGTTTGAATAGGCATAGTGAAAAACTAGCTATTGCATTTGGTCTTCTTAGGACTCCACCAGGAACTCCAATTCGAATTGTAAAGAATTTGCGAGTTTGCAGTGATTGCCACTCGGCTTCCAAGTTCATTTCTAAAATTTATGATCGTGAAATCATAATGAGAGACCGCAACAGGTTTCACCACTTCAAGTCTGGGCAGTGCTCATGTGGAGATTTCTGGTGAAGTTATAATAGCATCAAATGAGTATGAAACAGGCAATTGGTAGTTGGTCAGGTTCATCAATGCTCTGCATCCATTATGCAGTTGAGGGCATTACATGTTAAAAATTGGAAGAATGTTTCAAAGATGCAATTGTTAATGTTATATATTTCCTCCATTTGATTCAAAGCTCAACCCCTTTGGCGCCAATATGCTTGAGAGCTTCAAATTCAACTGAAAAAACATAAAATCAGCTGCTTTTGATTGGAAGTGTTCTTTTATCAATCCTGCAAGTAAAAGGAAAGCTAGCTTCCTTGGTGGACGGCTCCTCAAAGGCAATCAAACTAACAACTCGGTAACGTTCAACTTATCTTCGACTCAGTTGTAAAAAATCTTTTAGGACATCCTAGTTTATGCTTAAAGAATCTATATTTCGATTCTTCCAAGTCATGATCTTTTTAATCCAATCTGAATTGCAGGTATGTCAAATTTTATTCTTATTATATCTAATTGTTAAGGAACTCCCAACACAAAGACAGGATGGGCTTAGCCTGGTGATGTGACTTTCAAGTTGATGGATATCTAAGGTTTCATTTGAGTTTGCTTTTGCTTTTAAAATAATTGTTCTCAGCTTTGCTTCCTAAAAAATTCACAGCTAAAGATAGTATTTGGTAAATTTAATTTCATTGGTTAGGAAATTAGAAGATTTTTAAAAAGTTGAATCTAAAAGTGAGAAGCCTACTACTTTCACAAGTTAGTTATAATTTAAAGAGTGATTTTGTTAGATTATTTAACCATTTGCAAAAATGGTTAAACACTTTTTTTCAAAATCTCCTTTTTACCATTTTAAAACACAATTCAAACACTTTTGAGATAGTTTAAGTTTGGTATGTTTGGATTGACTAAACAATATTATTAGTATATTATGTTAGCCTTAGGAAATTAAATTTGTGTAAAGTCTGGAT

Coding sequence (CDS)

ATGCAATCTCAATTCACAAAACCCAAGCTATTACGTACAATCAACAATGTTTTAGCTTCTTCTACACCTAACCCTCGTGCACCGGAGCAGAATTGTTTAGCCCTTCTTCAGGCCTGTAACGCGCTACCCAAGCTCACCCAAATCCATACTCACATTCTCAAGTTGGGTCTCCACAACAACCCACTCGTTCTCACCAAATTCGCCTCCATTTCTTCTCTTATTCATGCTACTGACTACGCTGCCTCTTTCTTGTTCTCTGCTGAAGCCGATACTCGGCTGTACGATGCATTTCTTTTCAATACCCTCATCCGAGCCTACGCTCAAACTGGTCACTCGAAGGATAAAGCCTTGGCTTTGTATGGTATAATGCTTCATGATGCCATTTTGCCTAATAAATTCACGTACCCATTTGTGTTGAAGGCTTGTGCTGGTCTCGAGGTTTTGAATTTGGGCCAAACGGTTCATGGCTCGGTGGTGAAGTTTGGGTTTGATTGTGATATTCATGTTCAGAACACTATGGTTCATATGTATTCCTGTTGCGCCGGTGGGATCAATTCTGCCCGCAAAGTGTTTGATGAAATGCCAAAGTCAGATTCTGTGACTTGGAGTGCGATGATCGGTGGGTATGCTCGAGTAGGGCGCTCCACTGAAGCAGTGGCCTTGTTTAGAGAGATGCAAATGGCGGAGGTTTGCCCAGATGAGATCACTATGGTTTCCATGCTTTCTGCTTGTACTGATTTGGGTGCCCTTGAACTTGGGAAGTGGATTGAAGCTTACATAGAGAGACACGAAATTCATAAACCAGTAGAGGTTAGCAATGCACTCATTGACATGTTTGCAAAGTGTGGTGATATTAGTAAAGCATTGAAGTTATTTAGAGCTATGAATGAGAAAACAATAGTTTCCTGGACTTCTGTTATTGTTGGCATGGCAATGCATGGCCGTGGTCAAGAGGCCACTTGTTTATTTGAGGAGATGACAAGTTCTGGTGTAGCTCCAGATGATGTCGCCTTTATTGGCTTGCTTTCTGCTTGTAGCCATTCGGGACTAGTAGAAAGAGGTAGAGAATATTTCGGTTCTATGATGAAGAAATACAAACTTGTTCCTAAGATAGAACATTATGGATGCATGGTGGACATGTATTGCAGGACTGGACTTGTGAAAGAGGCTCTTGAGTTCGTACGTAATATGCCAATCGAGCCAAATCCAGTAATCTTACGAACACTAGTCAGTGCCTGCCGTGGTCATGGTGAATTCAAGCTTGGAGAAAAGATAACCAAACTGCTAATGAAACACGAACCTTTGCATGAATCAAACTATGTGTTGCTCTCTAATATTTATGCAAAAACGCTTAGTTGGGAGAAGAAGACCAAAATTAGAGAGGTGATGGAAGTGAAAGGCATGAAAAAGGTTCCAGGGAGCACTATGATTGAGATTGATAATGAAATCTATGAATTTGTTGCTGGAGATAAGTCTCATAAACAGCACAAAGAAATCTATGAAATGGTGGATGAGATGGGTAGAGAAATGAAGAAATCTGGATACCGTCCTTCGACATCAGAGGTTTTGCTTGATATCAATGAAGAGGACAAAGAAGATAGTTTGAATAGGCATAGTGAAAAACTAGCTATTGCATTTGGTCTTCTTAGGACTCCACCAGGAACTCCAATTCGAATTGTAAAGAATTTGCGAGTTTGCAGTGATTGCCACTCGGCTTCCAAGTTCATTTCTAAAATTTATGATCGTGAAATCATAATGAGAGACCGCAACAGGTTTCACCACTTCAAGTCTGGGCAGTGCTCATGTGGAGATTTCTGGTGA

Protein sequence

MQSQFTKPKLLRTINNVLASSTPNPRAPEQNCLALLQACNALPKLTQIHTHILKLGLHNNPLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALYGIMLHDAILPNKFTYPFVLKACAGLEVLNLGQTVHGSVVKFGFDCDIHVQNTMVHMYSCCAGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSMLSACTDLGALELGKWIEAYIERHEIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTIVSWTSVIVGMAMHGRGQEATCLFEEMTSSGVAPDDVAFIGLLSACSHSGLVERGREYFGSMMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEFKLGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEIDNEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSEKLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQCSCGDFW
Homology
BLAST of Cucsat.G4739 vs. ExPASy Swiss-Prot
Match: A8MQA3 (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 502.3 bits (1292), Expect = 7.7e-141
Identity = 255/582 (43.81%), Postives = 384/582 (65.98%), Query Frame = 0

Query: 30  QNCLALLQ--ACNALPKLTQIHTHILKLGLHNNPLVLTKFASISSLIHATDYAASFLFSA 89
           + C+ LLQ    +++ KL QIH   ++ G+  +   L K      +   +    S+    
Sbjct: 16  EKCINLLQTYGVSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKV 75

Query: 90  EAD-TRLYDAFLFNTLIRAYAQTGHSKDKALALYGIM-LHDAILPNKFTYPFVLKACAGL 149
            +   +  + F++NTLIR YA+ G+S   A +LY  M +   + P+  TYPF++KA   +
Sbjct: 76  FSKIEKPINVFIWNTLIRGYAEIGNS-ISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTM 135

Query: 150 EVLNLGQTVHGSVVKFGFDCDIHVQNTMVHMYSCCAGGINSARKVFDEMPKSDSVTWSAM 209
             + LG+T+H  V++ GF   I+VQN+++H+Y+ C G + SA KVFD+MP+ D V W+++
Sbjct: 136 ADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANC-GDVASAYKVFDKMPEKDLVAWNSV 195

Query: 210 IGGYARVGRSTEAVALFREMQMAEVCPDEITMVSMLSACTDLGALELGKWIEAYIERHEI 269
           I G+A  G+  EA+AL+ EM    + PD  T+VS+LSAC  +GAL LGK +  Y+ +  +
Sbjct: 196 INGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGL 255

Query: 270 HKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTIVSWTSVIVGMAMHGRGQEATCLFEE 329
            + +  SN L+D++A+CG + +A  LF  M +K  VSWTS+IVG+A++G G+EA  LF+ 
Sbjct: 256 TRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKY 315

Query: 330 MTSS-GVAPDDVAFIGLLSACSHSGLVERGREYFGSMMKKYKLVPKIEHYGCMVDMYCRT 389
           M S+ G+ P ++ F+G+L ACSH G+V+ G EYF  M ++YK+ P+IEH+GCMVD+  R 
Sbjct: 316 MESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARA 375

Query: 390 GLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEFKLGEKITKLLMKHEPLHESNYVLLS 449
           G VK+A E++++MP++PN VI RTL+ AC  HG+  L E     +++ EP H  +YVLLS
Sbjct: 376 GQVKKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLS 435

Query: 450 NIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEIDNEIYEFVAGDKSHKQHKEIYEMVD 509
           N+YA    W    KIR+ M   G+KKVPG +++E+ N ++EF+ GDKSH Q   IY  + 
Sbjct: 436 NMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLK 495

Query: 510 EMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSEKLAIAFGLLRTPPGTPIRIVKNLR 569
           EM   ++  GY P  S V +D+ EE+KE+++  HSEK+AIAF L+ TP  +PI +VKNLR
Sbjct: 496 EMTGRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLR 555

Query: 570 VCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQCSCGDFW 607
           VC+DCH A K +SK+Y+REI++RDR+RFHHFK+G CSC D+W
Sbjct: 556 VCADCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of Cucsat.G4739 vs. ExPASy Swiss-Prot
Match: Q8LK93 (Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H26 PE=2 SV=2)

HSP 1 Score: 499.6 bits (1285), Expect = 5.0e-140
Identity = 256/605 (42.31%), Postives = 390/605 (64.46%), Query Frame = 0

Query: 5   FTKPKLLRTINNVLASSTPNPRAPEQNCLALLQACNALPKLTQIHTHILKLGLHNNPLV- 64
           FTK   + T+N              QN + L+  CN+L +L QI  + +K  + +   V 
Sbjct: 18  FTKHSKIDTVNT-------------QNPILLISKCNSLRELMQIQAYAIKSHIEDVSFVA 77

Query: 65  -LTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALYGI 124
            L  F + S    +  Y A  LF A ++    D  +FN++ R Y++  +  +   +L+  
Sbjct: 78  KLINFCTESPTESSMSY-ARHLFEAMSEP---DIVIFNSMARGYSRFTNPLE-VFSLFVE 137

Query: 125 MLHDAILPNKFTYPFVLKACAGLEVLNLGQTVHGSVVKFGFDCDIHVQNTMVHMYSCCAG 184
           +L D ILP+ +T+P +LKACA  + L  G+ +H   +K G D +++V  T+++MY+ C  
Sbjct: 138 ILEDGILPDNYTFPSLLKACAVAKALEEGRQLHCLSMKLGLDDNVYVCPTLINMYTECE- 197

Query: 185 GINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSMLS 244
            ++SAR VFD + +   V ++AMI GYAR  R  EA++LFREMQ   + P+EIT++S+LS
Sbjct: 198 DVDSARCVFDRIVEPCVVCYNAMITGYARRNRPNEALSLFREMQGKYLKPNEITLLSVLS 257

Query: 245 ACTDLGALELGKWIEAYIERHEIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTIVS 304
           +C  LG+L+LGKWI  Y ++H   K V+V+ ALIDMFAKCG +  A+ +F  M  K   +
Sbjct: 258 SCALLGSLDLGKWIHKYAKKHSFCKYVKVNTALIDMFAKCGSLDDAVSIFEKMRYKDTQA 317

Query: 305 WTSVIVGMAMHGRGQEATCLFEEMTSSGVAPDDVAFIGLLSACSHSGLVERGREYFGSMM 364
           W+++IV  A HG+ +++  +FE M S  V PD++ F+GLL+ACSH+G VE GR+YF  M+
Sbjct: 318 WSAMIVAYANHGKAEKSMLMFERMRSENVQPDEITFLGLLNACSHTGRVEEGRKYFSQMV 377

Query: 365 KKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEFKLG 424
            K+ +VP I+HYG MVD+  R G +++A EF+  +PI P P++ R L++AC  H    L 
Sbjct: 378 SKFGIVPSIKHYGSMVDLLSRAGNLEDAYEFIDKLPISPTPMLWRILLAACSSHNNLDLA 437

Query: 425 EKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEIDNE 484
           EK+++ + + +  H  +YV+LSN+YA+   WE    +R+VM+ +   KVPG + IE++N 
Sbjct: 438 EKVSERIFELDDSHGGDYVILSNLYARNKKWEYVDSLRKVMKDRKAVKVPGCSSIEVNNV 497

Query: 485 IYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVL-LDINEEDKEDSLNRHSEK 544
           ++EF +GD       +++  +DEM +E+K SGY P TS V+  ++N+++KE +L  HSEK
Sbjct: 498 VHEFFSGDGVKSATTKLHRALDEMVKELKLSGYVPDTSMVVHANMNDQEKEITLRYHSEK 557

Query: 545 LAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQCS 604
           LAI FGLL TPPGT IR+VKNLRVC DCH+A+K IS I+ R++++RD  RFHHF+ G+CS
Sbjct: 558 LAITFGLLNTPPGTTIRVVKNLRVCRDCHNAAKLISLIFGRKVVLRDVQRFHHFEDGKCS 603

Query: 605 CGDFW 607
           CGDFW
Sbjct: 618 CGDFW 603

BLAST of Cucsat.G4739 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 486.5 bits (1251), Expect = 4.4e-136
Identity = 273/724 (37.71%), Postives = 381/724 (52.62%), Query Frame = 0

Query: 19  ASSTPNPRAPEQNCLALLQACNALPKLTQIHTHILKLGLHNNPLVLTK---FASISSLIH 78
           +S  P         L+LL  C  L  L  IH  ++K+GLHN    L+K   F  +S    
Sbjct: 23  SSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFE 82

Query: 79  ATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALYGIMLHDAILPNKFTY 138
              YA S   + +    L    ++NT+ R +A +      AL LY  M+   +LPN +T+
Sbjct: 83  GLPYAISVFKTIQEPNLL----IWNTMFRGHALSS-DPVSALKLYVCMISLGLLPNSYTF 142

Query: 139 PFVLKACAGLEVLNLGQTVHGSVVKFGFDCDIHVQNTMVHMY------------------ 198
           PFVLK+CA  +    GQ +HG V+K G D D++V  +++ MY                  
Sbjct: 143 PFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPH 202

Query: 199 ------------SCCAGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREM 258
                           G I +A+K+FDE+P  D V+W+AMI GYA  G   EA+ LF++M
Sbjct: 203 RDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDM 262

Query: 259 QMAEVCPDE--------------------------------------------------- 318
               V PDE                                                   
Sbjct: 263 MKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGEL 322

Query: 319 --------------------------------------------------ITMVSMLSAC 378
                                                             +TM+S+L AC
Sbjct: 323 ETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPAC 382

Query: 379 TDLGALELGKWIEAYIERH--EIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTIVS 438
             LGA+++G+WI  YI++    +     +  +LIDM+AKCGDI  A ++F ++  K++ S
Sbjct: 383 AHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSS 442

Query: 439 WTSVIVGMAMHGRGQEATCLFEEMTSSGVAPDDVAFIGLLSACSHSGLVERGREYFGSMM 498
           W ++I G AMHGR   +  LF  M   G+ PDD+ F+GLLSACSHSG+++ GR  F +M 
Sbjct: 443 WNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMT 502

Query: 499 KKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEFKLG 558
           + YK+ PK+EHYGCM+D+   +GL KEA E +  M +EP+ VI  +L+ AC+ HG  +LG
Sbjct: 503 QDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELG 562

Query: 559 EKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEIDNE 607
           E   + L+K EP +  +YVLLSNIYA    W +  K R ++  KGMKKVPG + IEID+ 
Sbjct: 563 ESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSV 622

BLAST of Cucsat.G4739 vs. ExPASy Swiss-Prot
Match: Q683I9 (Pentatricopeptide repeat-containing protein At3g62890 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H82 PE=2 SV=1)

HSP 1 Score: 474.9 bits (1221), Expect = 1.3e-132
Identity = 238/551 (43.19%), Postives = 358/551 (64.97%), Query Frame = 0

Query: 95  DAFLFNTLIRAYAQTGHS--KDKALALYGIMLHDAILPNKFTYPFVLKACAGLEVLNLGQ 154
           ++FL+N +IRA      S  +   +++Y  M +  + P+  T+PF+L +      L LGQ
Sbjct: 23  ESFLWNIIIRAIVHNVSSPQRHSPISVYLRMRNHRVSPDFHTFPFLLPSFHNPLHLPLGQ 82

Query: 155 TVHGSVVKFGFDCDIHVQNTMVHMYSCC------------------------------AG 214
             H  ++ FG D D  V+ ++++MYS C                              AG
Sbjct: 83  RTHAQILLFGLDKDPFVRTSLLNMYSSCGDLRSAQRVFDDSGSKDLPAWNSVVNAYAKAG 142

Query: 215 GINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQM-----AEVCPDEITM 274
            I+ ARK+FDEMP+ + ++WS +I GY   G+  EA+ LFREMQ+     A V P+E TM
Sbjct: 143 LIDDARKLFDEMPERNVISWSCLINGYVMCGKYKEALDLFREMQLPKPNEAFVRPNEFTM 202

Query: 275 VSMLSACTDLGALELGKWIEAYIERHEIHKPVEVSNALIDMFAKCGDISKALKLFRAM-N 334
            ++LSAC  LGALE GKW+ AYI+++ +   + +  ALIDM+AKCG + +A ++F A+ +
Sbjct: 203 STVLSACGRLGALEQGKWVHAYIDKYHVEIDIVLGTALIDMYAKCGSLERAKRVFNALGS 262

Query: 335 EKTIVSWTSVIVGMAMHGRGQEATCLFEEMTSS-GVAPDDVAFIGLLSACSHSGLVERGR 394
           +K + +++++I  +AM+G   E   LF EMT+S  + P+ V F+G+L AC H GL+  G+
Sbjct: 263 KKDVKAYSAMICCLAMYGLTDECFQLFSEMTTSDNINPNSVTFVGILGACVHRGLINEGK 322

Query: 395 EYFGSMMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRG 454
            YF  M++++ + P I+HYGCMVD+Y R+GL+KEA  F+ +MP+EP+ +I  +L+S  R 
Sbjct: 323 SYFKMMIEEFGITPSIQHYGCMVDLYGRSGLIKEAESFIASMPMEPDVLIWGSLLSGSRM 382

Query: 455 HGEFKLGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGST 514
            G+ K  E   K L++ +P++   YVLLSN+YAKT  W +   IR  MEVKG+ KVPG +
Sbjct: 383 LGDIKTCEGALKRLIELDPMNSGAYVLLSNVYAKTGRWMEVKCIRHEMEVKGINKVPGCS 442

Query: 515 MIEIDNEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSL 574
            +E++  ++EFV GD+S ++ + IY M+DE+ + ++++GY   T EVLLD+NE+DKE +L
Sbjct: 443 YVEVEGVVHEFVVGDESQQESERIYAMLDEIMQRLREAGYVTDTKEVLLDLNEKDKEIAL 502

Query: 575 NRHSEKLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHF 607
           + HSEKLAIAF L++T PGTP+RI+KNLR+C DCH   K ISK++ REI++RD NRFHHF
Sbjct: 503 SYHSEKLAIAFCLMKTRPGTPVRIIKNLRICGDCHLVMKMISKLFSREIVVRDCNRFHHF 562

BLAST of Cucsat.G4739 vs. ExPASy Swiss-Prot
Match: Q9FJY7 (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 473.4 bits (1217), Expect = 3.8e-132
Identity = 236/606 (38.94%), Postives = 371/606 (61.22%), Query Frame = 0

Query: 33  LALLQACNALPKLTQIHTHILKLGLHNNPLVLTKFASISSLIHATDYAASFLFSAEADTR 92
           ++ LQ C+   +L QIH  +LK GL  +   +TKF S      ++D+        +   R
Sbjct: 18  MSCLQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDR 77

Query: 93  LYDAFLFNTLIRAYAQTGHSKDKALALYGIMLHDAILPNKFTYPFVLKACAGLEVLNLGQ 152
             D FL+N +IR ++      +++L LY  ML  +   N +T+P +LKAC+ L       
Sbjct: 78  -PDTFLWNLMIRGFS-CSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETT 137

Query: 153 TVHGSVVKFGFDCDIHVQNTMVHMYSCCAGGINSARKVFDEMPKSDSVTWSAMIGGYARV 212
            +H  + K G++ D++  N++++ Y+   G    A  +FD +P+ D V+W+++I GY + 
Sbjct: 138 QIHAQITKLGYENDVYAVNSLINSYA-VTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKA 197

Query: 213 GR-------------------------------STEAVALFREMQMAEVCPDEITMVSML 272
           G+                               + EA+ LF EMQ ++V PD +++ + L
Sbjct: 198 GKMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANAL 257

Query: 273 SACTDLGALELGKWIEAYIERHEIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTIV 332
           SAC  LGALE GKWI +Y+ +  I     +   LIDM+AKCG++ +AL++F+ + +K++ 
Sbjct: 258 SACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQ 317

Query: 333 SWTSVIVGMAMHGRGQEATCLFEEMTSSGVAPDDVAFIGLLSACSHSGLVERGREYFGSM 392
           +WT++I G A HG G+EA   F EM   G+ P+ + F  +L+ACS++GLVE G+  F SM
Sbjct: 318 AWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSM 377

Query: 393 MKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEFKL 452
            + Y L P IEHYGC+VD+  R GL+ EA  F++ MP++PN VI   L+ ACR H   +L
Sbjct: 378 ERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIEL 437

Query: 453 GEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEIDN 512
           GE+I ++L+  +P H   YV  +NI+A    W+K  + R +M+ +G+ KVPG + I ++ 
Sbjct: 438 GEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISLEG 497

Query: 513 EIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSL-NRHSE 572
             +EF+AGD+SH + ++I      M R+++++GY P   E+LLD+ ++D+ +++ ++HSE
Sbjct: 498 TTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQHSE 557

Query: 573 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 607
           KLAI +GL++T PGT IRI+KNLRVC DCH  +K ISKIY R+I+MRDR RFHHF+ G+C
Sbjct: 558 KLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFRDGKC 617

BLAST of Cucsat.G4739 vs. NCBI nr
Match: XP_004138859.1 (pentatricopeptide repeat-containing protein At4g21065 [Cucumis sativus] >KGN62942.1 hypothetical protein Csa_021798 [Cucumis sativus])

HSP 1 Score: 1225 bits (3170), Expect = 0.0
Identity = 606/606 (100.00%), Postives = 606/606 (100.00%), Query Frame = 0

Query: 1   MQSQFTKPKLLRTINNVLASSTPNPRAPEQNCLALLQACNALPKLTQIHTHILKLGLHNN 60
           MQSQFTKPKLLRTINNVLASSTPNPRAPEQNCLALLQACNALPKLTQIHTHILKLGLHNN
Sbjct: 1   MQSQFTKPKLLRTINNVLASSTPNPRAPEQNCLALLQACNALPKLTQIHTHILKLGLHNN 60

Query: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY 120
           PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY
Sbjct: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY 120

Query: 121 GIMLHDAILPNKFTYPFVLKACAGLEVLNLGQTVHGSVVKFGFDCDIHVQNTMVHMYSCC 180
           GIMLHDAILPNKFTYPFVLKACAGLEVLNLGQTVHGSVVKFGFDCDIHVQNTMVHMYSCC
Sbjct: 121 GIMLHDAILPNKFTYPFVLKACAGLEVLNLGQTVHGSVVKFGFDCDIHVQNTMVHMYSCC 180

Query: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM 240
           AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM
Sbjct: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM 240

Query: 241 LSACTDLGALELGKWIEAYIERHEIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI 300
           LSACTDLGALELGKWIEAYIERHEIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI
Sbjct: 241 LSACTDLGALELGKWIEAYIERHEIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI 300

Query: 301 VSWTSVIVGMAMHGRGQEATCLFEEMTSSGVAPDDVAFIGLLSACSHSGLVERGREYFGS 360
           VSWTSVIVGMAMHGRGQEATCLFEEMTSSGVAPDDVAFIGLLSACSHSGLVERGREYFGS
Sbjct: 301 VSWTSVIVGMAMHGRGQEATCLFEEMTSSGVAPDDVAFIGLLSACSHSGLVERGREYFGS 360

Query: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEFK 420
           MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEFK
Sbjct: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEFK 420

Query: 421 LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE 540
           NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE
Sbjct: 481 NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE 540

Query: 541 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600
           KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC
Sbjct: 541 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 606

BLAST of Cucsat.G4739 vs. NCBI nr
Match: KAA0064932.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1207 bits (3123), Expect = 0.0
Identity = 597/606 (98.51%), Postives = 601/606 (99.17%), Query Frame = 0

Query: 1   MQSQFTKPKLLRTINNVLASSTPNPRAPEQNCLALLQACNALPKLTQIHTHILKLGLHNN 60
           MQSQFTKPKLLRTINNVLASST NPRA EQNCLALLQACNALPKLTQIHTHILKLGLHNN
Sbjct: 1   MQSQFTKPKLLRTINNVLASSTTNPRAAEQNCLALLQACNALPKLTQIHTHILKLGLHNN 60

Query: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY 120
           PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY
Sbjct: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY 120

Query: 121 GIMLHDAILPNKFTYPFVLKACAGLEVLNLGQTVHGSVVKFGFDCDIHVQNTMVHMYSCC 180
           GIMLHD ILPNKFTYPFVLKACAGLEVLNLGQ+VHGSVVKFGFDCDIHVQNTMVHMYSCC
Sbjct: 121 GIMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDCDIHVQNTMVHMYSCC 180

Query: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM 240
           AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM
Sbjct: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM 240

Query: 241 LSACTDLGALELGKWIEAYIERHEIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI 300
           LSACTDLGALELGKWIEAYIERH IHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI
Sbjct: 241 LSACTDLGALELGKWIEAYIERHGIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI 300

Query: 301 VSWTSVIVGMAMHGRGQEATCLFEEMTSSGVAPDDVAFIGLLSACSHSGLVERGREYFGS 360
           VSWTSVIVGMAMHGRG+EATCLFEEM +SGVAPDDVAFIGLLSACSHSGLVERGREYFGS
Sbjct: 301 VSWTSVIVGMAMHGRGREATCLFEEMITSGVAPDDVAFIGLLSACSHSGLVERGREYFGS 360

Query: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEFK 420
           MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLV+ACRGHGEFK
Sbjct: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVTACRGHGEFK 420

Query: 421 LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE 540
           NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE
Sbjct: 481 NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE 540

Query: 541 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600
           KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC
Sbjct: 541 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 606

BLAST of Cucsat.G4739 vs. NCBI nr
Match: XP_008445200.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Cucumis melo])

HSP 1 Score: 1206 bits (3120), Expect = 0.0
Identity = 596/606 (98.35%), Postives = 601/606 (99.17%), Query Frame = 0

Query: 1   MQSQFTKPKLLRTINNVLASSTPNPRAPEQNCLALLQACNALPKLTQIHTHILKLGLHNN 60
           MQSQFTKPKLLRTINNVLASST NPRA EQNCLALLQACNALPKLTQIHTHI+KLGLHNN
Sbjct: 1   MQSQFTKPKLLRTINNVLASSTTNPRAAEQNCLALLQACNALPKLTQIHTHIVKLGLHNN 60

Query: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY 120
           PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY
Sbjct: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY 120

Query: 121 GIMLHDAILPNKFTYPFVLKACAGLEVLNLGQTVHGSVVKFGFDCDIHVQNTMVHMYSCC 180
           GIMLHD ILPNKFTYPFVLKACAGLEVLNLGQ+VHGSVVKFGFDCDIHVQNTMVHMYSCC
Sbjct: 121 GIMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDCDIHVQNTMVHMYSCC 180

Query: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM 240
           AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM
Sbjct: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM 240

Query: 241 LSACTDLGALELGKWIEAYIERHEIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI 300
           LSACTDLGALELGKWIEAYIERH IHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI
Sbjct: 241 LSACTDLGALELGKWIEAYIERHGIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI 300

Query: 301 VSWTSVIVGMAMHGRGQEATCLFEEMTSSGVAPDDVAFIGLLSACSHSGLVERGREYFGS 360
           VSWTSVIVGMAMHGRG+EATCLFEEM +SGVAPDDVAFIGLLSACSHSGLVERGREYFGS
Sbjct: 301 VSWTSVIVGMAMHGRGREATCLFEEMITSGVAPDDVAFIGLLSACSHSGLVERGREYFGS 360

Query: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEFK 420
           MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLV+ACRGHGEFK
Sbjct: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVTACRGHGEFK 420

Query: 421 LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE 540
           NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE
Sbjct: 481 NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE 540

Query: 541 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600
           KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC
Sbjct: 541 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 606

BLAST of Cucsat.G4739 vs. NCBI nr
Match: XP_038884201.1 (pentatricopeptide repeat-containing protein At4g21065-like [Benincasa hispida])

HSP 1 Score: 1154 bits (2984), Expect = 0.0
Identity = 570/606 (94.06%), Postives = 587/606 (96.86%), Query Frame = 0

Query: 1   MQSQFTKPKLLRTINNVLASSTPNPRAPEQNCLALLQACNALPKLTQIHTHILKLGLHNN 60
           MQSQFTK KLLR INNV+AS+T NPRA EQNCLALLQACNALPKLTQIHTHILKLGLHNN
Sbjct: 1   MQSQFTKTKLLRAINNVVASTT-NPRAAEQNCLALLQACNALPKLTQIHTHILKLGLHNN 60

Query: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY 120
           PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKAL+LY
Sbjct: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALSLY 120

Query: 121 GIMLHDAILPNKFTYPFVLKACAGLEVLNLGQTVHGSVVKFGFDCDIHVQNTMVHMYSCC 180
            IMLHD ILPNKFTYPFVLKACAGLEVLNLGQ+VHGSVVKFGFD DIHVQNTM+HMYSCC
Sbjct: 121 SIMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDIHVQNTMIHMYSCC 180

Query: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM 240
           AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVS+
Sbjct: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSI 240

Query: 241 LSACTDLGALELGKWIEAYIERHEIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI 300
           LSACTDLGALELGKWIEAYIER  IHKPVEVSNALIDMFAKCGDI+KALKLFRA+NEKTI
Sbjct: 241 LSACTDLGALELGKWIEAYIERQGIHKPVEVSNALIDMFAKCGDINKALKLFRALNEKTI 300

Query: 301 VSWTSVIVGMAMHGRGQEATCLFEEMTSSGVAPDDVAFIGLLSACSHSGLVERGREYFGS 360
           VSWTSVIVGMAMHGRGQEA CLFEEM  SGVAPDDV+FIGLLSACSHSGLVERGREYF S
Sbjct: 301 VSWTSVIVGMAMHGRGQEAICLFEEMIVSGVAPDDVSFIGLLSACSHSGLVERGREYFSS 360

Query: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEFK 420
           MMKKYKL PKIEHYGCMVDMYCRTGLVKEAL+FV NMP+EPNPVILRTLVSACRGHGEFK
Sbjct: 361 MMKKYKLAPKIEHYGCMVDMYCRTGLVKEALQFVHNMPVEPNPVILRTLVSACRGHGEFK 420

Query: 421 LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITKLLM+HEPLHESNYVLLSNIYAK LSWEKKTKIREVMEVKGMKK+PGSTMIEID
Sbjct: 421 LGEKITKLLMRHEPLHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKIPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE 540
           NEIYEFVAGDKSHKQ+KEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKED+LNRHSE
Sbjct: 481 NEIYEFVAGDKSHKQYKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRHSE 540

Query: 541 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600
           KLAIAFGLL TPPGTPIRIVKNLRVCSDCHSASK+IS IY+REIIMRDRNRFHHFKSG C
Sbjct: 541 KLAIAFGLLSTPPGTPIRIVKNLRVCSDCHSASKYISNIYNREIIMRDRNRFHHFKSGLC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 605

BLAST of Cucsat.G4739 vs. NCBI nr
Match: XP_022131416.1 (pentatricopeptide repeat-containing protein At4g21065-like [Momordica charantia] >XP_022131419.1 pentatricopeptide repeat-containing protein At4g21065-like [Momordica charantia] >XP_022131420.1 pentatricopeptide repeat-containing protein At4g21065-like [Momordica charantia])

HSP 1 Score: 1104 bits (2855), Expect = 0.0
Identity = 546/606 (90.10%), Postives = 568/606 (93.73%), Query Frame = 0

Query: 1   MQSQFTKPKLLRTINNVLASSTPNPRAPEQNCLALLQACNALPKLTQIHTHILKLGLHNN 60
           MQSQF+K KLL  INN    S  NPRA EQ+CLALLQACNALPKL QIH HILKLGLHNN
Sbjct: 1   MQSQFSKTKLLLAINNAPVFSRANPRAAEQDCLALLQACNALPKLAQIHAHILKLGLHNN 60

Query: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY 120
           PLVLTKFASISS+I ATDYAASFLFSA ADTRLYDAFLFNTLIRAYAQTGHSK KALALY
Sbjct: 61  PLVLTKFASISSVISATDYAASFLFSAGADTRLYDAFLFNTLIRAYAQTGHSKPKALALY 120

Query: 121 GIMLHDAILPNKFTYPFVLKACAGLEVLNLGQTVHGSVVKFGFDCDIHVQNTMVHMYSCC 180
           G+ML D ILPNKFTYPFVLKACAGLEVLNLGQ+VHGSVVKFGFD D+HV+NTMVHMYSCC
Sbjct: 121 GLMLRDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDVHVRNTMVHMYSCC 180

Query: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM 240
           AGGIN ARKVFDEMPKSDSVTWSAMIGGYARVGR TEAV+LFREMQ+AEVCPDEITMVS+
Sbjct: 181 AGGINFARKVFDEMPKSDSVTWSAMIGGYARVGRPTEAVSLFREMQLAEVCPDEITMVSI 240

Query: 241 LSACTDLGALELGKWIEAYIERHEIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI 300
           LSACTDLGALELGKW+EAYIER  I KP EVSNALIDMFAKCGDISKALKLF+ M+EKTI
Sbjct: 241 LSACTDLGALELGKWLEAYIERQGIQKPEEVSNALIDMFAKCGDISKALKLFKTMSEKTI 300

Query: 301 VSWTSVIVGMAMHGRGQEATCLFEEMTSSGVAPDDVAFIGLLSACSHSGLVERGREYFGS 360
           VSWTSVIVGMAMHGRGQ+A CLFEEM  SGVAPDDVAFIGLLSACSHSG+VERGREYF S
Sbjct: 301 VSWTSVIVGMAMHGRGQDAICLFEEMIGSGVAPDDVAFIGLLSACSHSGMVERGREYFSS 360

Query: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEFK 420
           M KKYKLVPKIEHYGCMVDM+CRTGLVKEALEFV +MPIEPN VILRTLVSACRGHGEF+
Sbjct: 361 MTKKYKLVPKIEHYGCMVDMFCRTGLVKEALEFVHSMPIEPNAVILRTLVSACRGHGEFQ 420

Query: 421 LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITK LM+HEP+HESNYVLLSNIYAK LSWEKKTKIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKQLMRHEPMHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE 540
           NEIYEFVAGDKSHKQ KEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKED+LNRH E
Sbjct: 481 NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRHGE 540

Query: 541 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600
           KLAIAFGLL TPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFK+G C
Sbjct: 541 KLAIAFGLLNTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKAGIC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 606

BLAST of Cucsat.G4739 vs. ExPASy TrEMBL
Match: A0A0A0LQ71 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G381680 PE=3 SV=1)

HSP 1 Score: 1225 bits (3170), Expect = 0.0
Identity = 606/606 (100.00%), Postives = 606/606 (100.00%), Query Frame = 0

Query: 1   MQSQFTKPKLLRTINNVLASSTPNPRAPEQNCLALLQACNALPKLTQIHTHILKLGLHNN 60
           MQSQFTKPKLLRTINNVLASSTPNPRAPEQNCLALLQACNALPKLTQIHTHILKLGLHNN
Sbjct: 1   MQSQFTKPKLLRTINNVLASSTPNPRAPEQNCLALLQACNALPKLTQIHTHILKLGLHNN 60

Query: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY 120
           PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY
Sbjct: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY 120

Query: 121 GIMLHDAILPNKFTYPFVLKACAGLEVLNLGQTVHGSVVKFGFDCDIHVQNTMVHMYSCC 180
           GIMLHDAILPNKFTYPFVLKACAGLEVLNLGQTVHGSVVKFGFDCDIHVQNTMVHMYSCC
Sbjct: 121 GIMLHDAILPNKFTYPFVLKACAGLEVLNLGQTVHGSVVKFGFDCDIHVQNTMVHMYSCC 180

Query: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM 240
           AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM
Sbjct: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM 240

Query: 241 LSACTDLGALELGKWIEAYIERHEIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI 300
           LSACTDLGALELGKWIEAYIERHEIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI
Sbjct: 241 LSACTDLGALELGKWIEAYIERHEIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI 300

Query: 301 VSWTSVIVGMAMHGRGQEATCLFEEMTSSGVAPDDVAFIGLLSACSHSGLVERGREYFGS 360
           VSWTSVIVGMAMHGRGQEATCLFEEMTSSGVAPDDVAFIGLLSACSHSGLVERGREYFGS
Sbjct: 301 VSWTSVIVGMAMHGRGQEATCLFEEMTSSGVAPDDVAFIGLLSACSHSGLVERGREYFGS 360

Query: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEFK 420
           MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEFK
Sbjct: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEFK 420

Query: 421 LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE 540
           NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE
Sbjct: 481 NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE 540

Query: 541 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600
           KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC
Sbjct: 541 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 606

BLAST of Cucsat.G4739 vs. ExPASy TrEMBL
Match: A0A5A7V9A4 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G002500 PE=3 SV=1)

HSP 1 Score: 1207 bits (3123), Expect = 0.0
Identity = 597/606 (98.51%), Postives = 601/606 (99.17%), Query Frame = 0

Query: 1   MQSQFTKPKLLRTINNVLASSTPNPRAPEQNCLALLQACNALPKLTQIHTHILKLGLHNN 60
           MQSQFTKPKLLRTINNVLASST NPRA EQNCLALLQACNALPKLTQIHTHILKLGLHNN
Sbjct: 1   MQSQFTKPKLLRTINNVLASSTTNPRAAEQNCLALLQACNALPKLTQIHTHILKLGLHNN 60

Query: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY 120
           PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY
Sbjct: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY 120

Query: 121 GIMLHDAILPNKFTYPFVLKACAGLEVLNLGQTVHGSVVKFGFDCDIHVQNTMVHMYSCC 180
           GIMLHD ILPNKFTYPFVLKACAGLEVLNLGQ+VHGSVVKFGFDCDIHVQNTMVHMYSCC
Sbjct: 121 GIMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDCDIHVQNTMVHMYSCC 180

Query: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM 240
           AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM
Sbjct: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM 240

Query: 241 LSACTDLGALELGKWIEAYIERHEIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI 300
           LSACTDLGALELGKWIEAYIERH IHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI
Sbjct: 241 LSACTDLGALELGKWIEAYIERHGIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI 300

Query: 301 VSWTSVIVGMAMHGRGQEATCLFEEMTSSGVAPDDVAFIGLLSACSHSGLVERGREYFGS 360
           VSWTSVIVGMAMHGRG+EATCLFEEM +SGVAPDDVAFIGLLSACSHSGLVERGREYFGS
Sbjct: 301 VSWTSVIVGMAMHGRGREATCLFEEMITSGVAPDDVAFIGLLSACSHSGLVERGREYFGS 360

Query: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEFK 420
           MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLV+ACRGHGEFK
Sbjct: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVTACRGHGEFK 420

Query: 421 LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE 540
           NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE
Sbjct: 481 NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE 540

Query: 541 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600
           KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC
Sbjct: 541 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 606

BLAST of Cucsat.G4739 vs. ExPASy TrEMBL
Match: A0A1S3BC37 (pentatricopeptide repeat-containing protein At4g21065-like OS=Cucumis melo OX=3656 GN=LOC103488302 PE=3 SV=1)

HSP 1 Score: 1206 bits (3120), Expect = 0.0
Identity = 596/606 (98.35%), Postives = 601/606 (99.17%), Query Frame = 0

Query: 1   MQSQFTKPKLLRTINNVLASSTPNPRAPEQNCLALLQACNALPKLTQIHTHILKLGLHNN 60
           MQSQFTKPKLLRTINNVLASST NPRA EQNCLALLQACNALPKLTQIHTHI+KLGLHNN
Sbjct: 1   MQSQFTKPKLLRTINNVLASSTTNPRAAEQNCLALLQACNALPKLTQIHTHIVKLGLHNN 60

Query: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY 120
           PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY
Sbjct: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY 120

Query: 121 GIMLHDAILPNKFTYPFVLKACAGLEVLNLGQTVHGSVVKFGFDCDIHVQNTMVHMYSCC 180
           GIMLHD ILPNKFTYPFVLKACAGLEVLNLGQ+VHGSVVKFGFDCDIHVQNTMVHMYSCC
Sbjct: 121 GIMLHDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDCDIHVQNTMVHMYSCC 180

Query: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM 240
           AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM
Sbjct: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM 240

Query: 241 LSACTDLGALELGKWIEAYIERHEIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI 300
           LSACTDLGALELGKWIEAYIERH IHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI
Sbjct: 241 LSACTDLGALELGKWIEAYIERHGIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI 300

Query: 301 VSWTSVIVGMAMHGRGQEATCLFEEMTSSGVAPDDVAFIGLLSACSHSGLVERGREYFGS 360
           VSWTSVIVGMAMHGRG+EATCLFEEM +SGVAPDDVAFIGLLSACSHSGLVERGREYFGS
Sbjct: 301 VSWTSVIVGMAMHGRGREATCLFEEMITSGVAPDDVAFIGLLSACSHSGLVERGREYFGS 360

Query: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEFK 420
           MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLV+ACRGHGEFK
Sbjct: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVTACRGHGEFK 420

Query: 421 LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE 540
           NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE
Sbjct: 481 NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE 540

Query: 541 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600
           KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC
Sbjct: 541 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 606

BLAST of Cucsat.G4739 vs. ExPASy TrEMBL
Match: A0A6J1BQ70 (pentatricopeptide repeat-containing protein At4g21065-like OS=Momordica charantia OX=3673 GN=LOC111004636 PE=3 SV=1)

HSP 1 Score: 1104 bits (2855), Expect = 0.0
Identity = 546/606 (90.10%), Postives = 568/606 (93.73%), Query Frame = 0

Query: 1   MQSQFTKPKLLRTINNVLASSTPNPRAPEQNCLALLQACNALPKLTQIHTHILKLGLHNN 60
           MQSQF+K KLL  INN    S  NPRA EQ+CLALLQACNALPKL QIH HILKLGLHNN
Sbjct: 1   MQSQFSKTKLLLAINNAPVFSRANPRAAEQDCLALLQACNALPKLAQIHAHILKLGLHNN 60

Query: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY 120
           PLVLTKFASISS+I ATDYAASFLFSA ADTRLYDAFLFNTLIRAYAQTGHSK KALALY
Sbjct: 61  PLVLTKFASISSVISATDYAASFLFSAGADTRLYDAFLFNTLIRAYAQTGHSKPKALALY 120

Query: 121 GIMLHDAILPNKFTYPFVLKACAGLEVLNLGQTVHGSVVKFGFDCDIHVQNTMVHMYSCC 180
           G+ML D ILPNKFTYPFVLKACAGLEVLNLGQ+VHGSVVKFGFD D+HV+NTMVHMYSCC
Sbjct: 121 GLMLRDGILPNKFTYPFVLKACAGLEVLNLGQSVHGSVVKFGFDRDVHVRNTMVHMYSCC 180

Query: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM 240
           AGGIN ARKVFDEMPKSDSVTWSAMIGGYARVGR TEAV+LFREMQ+AEVCPDEITMVS+
Sbjct: 181 AGGINFARKVFDEMPKSDSVTWSAMIGGYARVGRPTEAVSLFREMQLAEVCPDEITMVSI 240

Query: 241 LSACTDLGALELGKWIEAYIERHEIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI 300
           LSACTDLGALELGKW+EAYIER  I KP EVSNALIDMFAKCGDISKALKLF+ M+EKTI
Sbjct: 241 LSACTDLGALELGKWLEAYIERQGIQKPEEVSNALIDMFAKCGDISKALKLFKTMSEKTI 300

Query: 301 VSWTSVIVGMAMHGRGQEATCLFEEMTSSGVAPDDVAFIGLLSACSHSGLVERGREYFGS 360
           VSWTSVIVGMAMHGRGQ+A CLFEEM  SGVAPDDVAFIGLLSACSHSG+VERGREYF S
Sbjct: 301 VSWTSVIVGMAMHGRGQDAICLFEEMIGSGVAPDDVAFIGLLSACSHSGMVERGREYFSS 360

Query: 361 MMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEFK 420
           M KKYKLVPKIEHYGCMVDM+CRTGLVKEALEFV +MPIEPN VILRTLVSACRGHGEF+
Sbjct: 361 MTKKYKLVPKIEHYGCMVDMFCRTGLVKEALEFVHSMPIEPNAVILRTLVSACRGHGEFQ 420

Query: 421 LGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480
           LGEKITK LM+HEP+HESNYVLLSNIYAK LSWEKKTKIREVMEVKGMKKVPGSTMIEID
Sbjct: 421 LGEKITKQLMRHEPMHESNYVLLSNIYAKMLSWEKKTKIREVMEVKGMKKVPGSTMIEID 480

Query: 481 NEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSE 540
           NEIYEFVAGDKSHKQ KEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKED+LNRH E
Sbjct: 481 NEIYEFVAGDKSHKQFKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDTLNRHGE 540

Query: 541 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 600
           KLAIAFGLL TPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFK+G C
Sbjct: 541 KLAIAFGLLNTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKAGIC 600

Query: 601 SCGDFW 606
           SCGDFW
Sbjct: 601 SCGDFW 606

BLAST of Cucsat.G4739 vs. ExPASy TrEMBL
Match: A0A6J1GHH7 (pentatricopeptide repeat-containing protein At4g21065-like OS=Cucurbita moschata OX=3662 GN=LOC111454235 PE=3 SV=1)

HSP 1 Score: 1096 bits (2835), Expect = 0.0
Identity = 549/607 (90.44%), Postives = 569/607 (93.74%), Query Frame = 0

Query: 1   MQSQFTKPKLLRTINNVLASSTPNPRAPEQNCLALLQACNALPKLTQIHTHILKLGLHNN 60
           MQSQF    LLR I+N  AS + NPRA EQNCLALLQACN+LPKLTQIH HI KLGL NN
Sbjct: 1   MQSQF----LLRVISNAAASRS-NPRAAEQNCLALLQACNSLPKLTQIHAHIFKLGLRNN 60

Query: 61  PLVLTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALY 120
           PLVLTKFASISS+I+ATDYAASFLFSAEADTRLYDAFLFNTLIRA+AQTGHSK +AL+LY
Sbjct: 61  PLVLTKFASISSVINATDYAASFLFSAEADTRLYDAFLFNTLIRAFAQTGHSKARALSLY 120

Query: 121 GIMLHDAILPNKFTYPFVLKACAGLEVLNLGQTVHGSVVKFGFDCDIHVQNTMVHMYSCC 180
           GIMLHD ILPNKFTYPFVLKACAGLEVL+LGQ+VHGSVVKFGFD D+HVQNTMVHMYSCC
Sbjct: 121 GIMLHDGILPNKFTYPFVLKACAGLEVLSLGQSVHGSVVKFGFDHDVHVQNTMVHMYSCC 180

Query: 181 AGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSM 240
           +GGI  ARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVS+
Sbjct: 181 SGGIIFARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSV 240

Query: 241 LSACTDLGALELGKWIEAYIERHEIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTI 300
           LSACTDLGALELGKWIEAYIER  I KPVEVSNALIDMFAKCGDI KALKLFRAM++KTI
Sbjct: 241 LSACTDLGALELGKWIEAYIERQGIQKPVEVSNALIDMFAKCGDIGKALKLFRAMSDKTI 300

Query: 301 VSWTSVIVGMAMHGRGQEATCLFEEMT-SSGVAPDDVAFIGLLSACSHSGLVERGREYFG 360
           VSWTSVIVGMAMHGRG EA CLFEEM  SS VAPDDVAFIGLLSACSHSGLVERGREYF 
Sbjct: 301 VSWTSVIVGMAMHGRGLEAICLFEEMIGSSSVAPDDVAFIGLLSACSHSGLVERGREYFN 360

Query: 361 SMMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEF 420
           SMMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFV NMP EPNPVILRTLVSACRGHGEF
Sbjct: 361 SMMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVHNMPFEPNPVILRTLVSACRGHGEF 420

Query: 421 KLGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEI 480
           KLGEKITKLLM+HEP+HESNYVLLSNIYAK  +WEKKTKIREVMEVKGMKKVPGSTMIEI
Sbjct: 421 KLGEKITKLLMRHEPMHESNYVLLSNIYAKMFNWEKKTKIREVMEVKGMKKVPGSTMIEI 480

Query: 481 DNEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHS 540
           DNEIYEFVAGDKSHKQ KEIY MVDEMGREM KSGYRPSTSEVLLDINEEDKED+LNRHS
Sbjct: 481 DNEIYEFVAGDKSHKQFKEIYAMVDEMGREMTKSGYRPSTSEVLLDINEEDKEDTLNRHS 540

Query: 541 EKLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQ 600
           EKLAIAFGLL TPPGTPIRIVKNLRVC+DCHSASKFISKIYDREIIMRDRNRFHHFK G 
Sbjct: 541 EKLAIAFGLLNTPPGTPIRIVKNLRVCTDCHSASKFISKIYDREIIMRDRNRFHHFKGGL 600

Query: 601 CSCGDFW 606
           CSCGDFW
Sbjct: 601 CSCGDFW 602

BLAST of Cucsat.G4739 vs. TAIR 10
Match: AT4G21065.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 502.3 bits (1292), Expect = 5.5e-142
Identity = 255/582 (43.81%), Postives = 384/582 (65.98%), Query Frame = 0

Query: 30  QNCLALLQ--ACNALPKLTQIHTHILKLGLHNNPLVLTKFASISSLIHATDYAASFLFSA 89
           + C+ LLQ    +++ KL QIH   ++ G+  +   L K      +   +    S+    
Sbjct: 16  EKCINLLQTYGVSSITKLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKV 75

Query: 90  EAD-TRLYDAFLFNTLIRAYAQTGHSKDKALALYGIM-LHDAILPNKFTYPFVLKACAGL 149
            +   +  + F++NTLIR YA+ G+S   A +LY  M +   + P+  TYPF++KA   +
Sbjct: 76  FSKIEKPINVFIWNTLIRGYAEIGNS-ISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTM 135

Query: 150 EVLNLGQTVHGSVVKFGFDCDIHVQNTMVHMYSCCAGGINSARKVFDEMPKSDSVTWSAM 209
             + LG+T+H  V++ GF   I+VQN+++H+Y+ C G + SA KVFD+MP+ D V W+++
Sbjct: 136 ADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANC-GDVASAYKVFDKMPEKDLVAWNSV 195

Query: 210 IGGYARVGRSTEAVALFREMQMAEVCPDEITMVSMLSACTDLGALELGKWIEAYIERHEI 269
           I G+A  G+  EA+AL+ EM    + PD  T+VS+LSAC  +GAL LGK +  Y+ +  +
Sbjct: 196 INGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGL 255

Query: 270 HKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTIVSWTSVIVGMAMHGRGQEATCLFEE 329
            + +  SN L+D++A+CG + +A  LF  M +K  VSWTS+IVG+A++G G+EA  LF+ 
Sbjct: 256 TRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKY 315

Query: 330 MTSS-GVAPDDVAFIGLLSACSHSGLVERGREYFGSMMKKYKLVPKIEHYGCMVDMYCRT 389
           M S+ G+ P ++ F+G+L ACSH G+V+ G EYF  M ++YK+ P+IEH+GCMVD+  R 
Sbjct: 316 MESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARA 375

Query: 390 GLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEFKLGEKITKLLMKHEPLHESNYVLLS 449
           G VK+A E++++MP++PN VI RTL+ AC  HG+  L E     +++ EP H  +YVLLS
Sbjct: 376 GQVKKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLS 435

Query: 450 NIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEIDNEIYEFVAGDKSHKQHKEIYEMVD 509
           N+YA    W    KIR+ M   G+KKVPG +++E+ N ++EF+ GDKSH Q   IY  + 
Sbjct: 436 NMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLK 495

Query: 510 EMGREMKKSGYRPSTSEVLLDINEEDKEDSLNRHSEKLAIAFGLLRTPPGTPIRIVKNLR 569
           EM   ++  GY P  S V +D+ EE+KE+++  HSEK+AIAF L+ TP  +PI +VKNLR
Sbjct: 496 EMTGRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLR 555

Query: 570 VCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQCSCGDFW 607
           VC+DCH A K +SK+Y+REI++RDR+RFHHFK+G CSC D+W
Sbjct: 556 VCADCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of Cucsat.G4739 vs. TAIR 10
Match: AT2G02980.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 499.6 bits (1285), Expect = 3.6e-141
Identity = 256/605 (42.31%), Postives = 390/605 (64.46%), Query Frame = 0

Query: 5   FTKPKLLRTINNVLASSTPNPRAPEQNCLALLQACNALPKLTQIHTHILKLGLHNNPLV- 64
           FTK   + T+N              QN + L+  CN+L +L QI  + +K  + +   V 
Sbjct: 18  FTKHSKIDTVNT-------------QNPILLISKCNSLRELMQIQAYAIKSHIEDVSFVA 77

Query: 65  -LTKFASISSLIHATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALYGI 124
            L  F + S    +  Y A  LF A ++    D  +FN++ R Y++  +  +   +L+  
Sbjct: 78  KLINFCTESPTESSMSY-ARHLFEAMSEP---DIVIFNSMARGYSRFTNPLE-VFSLFVE 137

Query: 125 MLHDAILPNKFTYPFVLKACAGLEVLNLGQTVHGSVVKFGFDCDIHVQNTMVHMYSCCAG 184
           +L D ILP+ +T+P +LKACA  + L  G+ +H   +K G D +++V  T+++MY+ C  
Sbjct: 138 ILEDGILPDNYTFPSLLKACAVAKALEEGRQLHCLSMKLGLDDNVYVCPTLINMYTECE- 197

Query: 185 GINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQMAEVCPDEITMVSMLS 244
            ++SAR VFD + +   V ++AMI GYAR  R  EA++LFREMQ   + P+EIT++S+LS
Sbjct: 198 DVDSARCVFDRIVEPCVVCYNAMITGYARRNRPNEALSLFREMQGKYLKPNEITLLSVLS 257

Query: 245 ACTDLGALELGKWIEAYIERHEIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTIVS 304
           +C  LG+L+LGKWI  Y ++H   K V+V+ ALIDMFAKCG +  A+ +F  M  K   +
Sbjct: 258 SCALLGSLDLGKWIHKYAKKHSFCKYVKVNTALIDMFAKCGSLDDAVSIFEKMRYKDTQA 317

Query: 305 WTSVIVGMAMHGRGQEATCLFEEMTSSGVAPDDVAFIGLLSACSHSGLVERGREYFGSMM 364
           W+++IV  A HG+ +++  +FE M S  V PD++ F+GLL+ACSH+G VE GR+YF  M+
Sbjct: 318 WSAMIVAYANHGKAEKSMLMFERMRSENVQPDEITFLGLLNACSHTGRVEEGRKYFSQMV 377

Query: 365 KKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEFKLG 424
            K+ +VP I+HYG MVD+  R G +++A EF+  +PI P P++ R L++AC  H    L 
Sbjct: 378 SKFGIVPSIKHYGSMVDLLSRAGNLEDAYEFIDKLPISPTPMLWRILLAACSSHNNLDLA 437

Query: 425 EKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEIDNE 484
           EK+++ + + +  H  +YV+LSN+YA+   WE    +R+VM+ +   KVPG + IE++N 
Sbjct: 438 EKVSERIFELDDSHGGDYVILSNLYARNKKWEYVDSLRKVMKDRKAVKVPGCSSIEVNNV 497

Query: 485 IYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVL-LDINEEDKEDSLNRHSEK 544
           ++EF +GD       +++  +DEM +E+K SGY P TS V+  ++N+++KE +L  HSEK
Sbjct: 498 VHEFFSGDGVKSATTKLHRALDEMVKELKLSGYVPDTSMVVHANMNDQEKEITLRYHSEK 557

Query: 545 LAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQCS 604
           LAI FGLL TPPGT IR+VKNLRVC DCH+A+K IS I+ R++++RD  RFHHF+ G+CS
Sbjct: 558 LAITFGLLNTPPGTTIRVVKNLRVCRDCHNAAKLISLIFGRKVVLRDVQRFHHFEDGKCS 603

Query: 605 CGDFW 607
           CGDFW
Sbjct: 618 CGDFW 603

BLAST of Cucsat.G4739 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 486.5 bits (1251), Expect = 3.1e-137
Identity = 273/724 (37.71%), Postives = 381/724 (52.62%), Query Frame = 0

Query: 19  ASSTPNPRAPEQNCLALLQACNALPKLTQIHTHILKLGLHNNPLVLTK---FASISSLIH 78
           +S  P         L+LL  C  L  L  IH  ++K+GLHN    L+K   F  +S    
Sbjct: 23  SSDPPYDSIRNHPSLSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSPHFE 82

Query: 79  ATDYAASFLFSAEADTRLYDAFLFNTLIRAYAQTGHSKDKALALYGIMLHDAILPNKFTY 138
              YA S   + +    L    ++NT+ R +A +      AL LY  M+   +LPN +T+
Sbjct: 83  GLPYAISVFKTIQEPNLL----IWNTMFRGHALSS-DPVSALKLYVCMISLGLLPNSYTF 142

Query: 139 PFVLKACAGLEVLNLGQTVHGSVVKFGFDCDIHVQNTMVHMY------------------ 198
           PFVLK+CA  +    GQ +HG V+K G D D++V  +++ MY                  
Sbjct: 143 PFVLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPH 202

Query: 199 ------------SCCAGGINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREM 258
                           G I +A+K+FDE+P  D V+W+AMI GYA  G   EA+ LF++M
Sbjct: 203 RDVVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDM 262

Query: 259 QMAEVCPDE--------------------------------------------------- 318
               V PDE                                                   
Sbjct: 263 MKTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGEL 322

Query: 319 --------------------------------------------------ITMVSMLSAC 378
                                                             +TM+S+L AC
Sbjct: 323 ETACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPAC 382

Query: 379 TDLGALELGKWIEAYIERH--EIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTIVS 438
             LGA+++G+WI  YI++    +     +  +LIDM+AKCGDI  A ++F ++  K++ S
Sbjct: 383 AHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSS 442

Query: 439 WTSVIVGMAMHGRGQEATCLFEEMTSSGVAPDDVAFIGLLSACSHSGLVERGREYFGSMM 498
           W ++I G AMHGR   +  LF  M   G+ PDD+ F+GLLSACSHSG+++ GR  F +M 
Sbjct: 443 WNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMT 502

Query: 499 KKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEFKLG 558
           + YK+ PK+EHYGCM+D+   +GL KEA E +  M +EP+ VI  +L+ AC+ HG  +LG
Sbjct: 503 QDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELG 562

Query: 559 EKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEIDNE 607
           E   + L+K EP +  +YVLLSNIYA    W +  K R ++  KGMKKVPG + IEID+ 
Sbjct: 563 ESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSV 622

BLAST of Cucsat.G4739 vs. TAIR 10
Match: AT3G62890.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 474.9 bits (1221), Expect = 9.4e-134
Identity = 238/551 (43.19%), Postives = 358/551 (64.97%), Query Frame = 0

Query: 95  DAFLFNTLIRAYAQTGHS--KDKALALYGIMLHDAILPNKFTYPFVLKACAGLEVLNLGQ 154
           ++FL+N +IRA      S  +   +++Y  M +  + P+  T+PF+L +      L LGQ
Sbjct: 23  ESFLWNIIIRAIVHNVSSPQRHSPISVYLRMRNHRVSPDFHTFPFLLPSFHNPLHLPLGQ 82

Query: 155 TVHGSVVKFGFDCDIHVQNTMVHMYSCC------------------------------AG 214
             H  ++ FG D D  V+ ++++MYS C                              AG
Sbjct: 83  RTHAQILLFGLDKDPFVRTSLLNMYSSCGDLRSAQRVFDDSGSKDLPAWNSVVNAYAKAG 142

Query: 215 GINSARKVFDEMPKSDSVTWSAMIGGYARVGRSTEAVALFREMQM-----AEVCPDEITM 274
            I+ ARK+FDEMP+ + ++WS +I GY   G+  EA+ LFREMQ+     A V P+E TM
Sbjct: 143 LIDDARKLFDEMPERNVISWSCLINGYVMCGKYKEALDLFREMQLPKPNEAFVRPNEFTM 202

Query: 275 VSMLSACTDLGALELGKWIEAYIERHEIHKPVEVSNALIDMFAKCGDISKALKLFRAM-N 334
            ++LSAC  LGALE GKW+ AYI+++ +   + +  ALIDM+AKCG + +A ++F A+ +
Sbjct: 203 STVLSACGRLGALEQGKWVHAYIDKYHVEIDIVLGTALIDMYAKCGSLERAKRVFNALGS 262

Query: 335 EKTIVSWTSVIVGMAMHGRGQEATCLFEEMTSS-GVAPDDVAFIGLLSACSHSGLVERGR 394
           +K + +++++I  +AM+G   E   LF EMT+S  + P+ V F+G+L AC H GL+  G+
Sbjct: 263 KKDVKAYSAMICCLAMYGLTDECFQLFSEMTTSDNINPNSVTFVGILGACVHRGLINEGK 322

Query: 395 EYFGSMMKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRG 454
            YF  M++++ + P I+HYGCMVD+Y R+GL+KEA  F+ +MP+EP+ +I  +L+S  R 
Sbjct: 323 SYFKMMIEEFGITPSIQHYGCMVDLYGRSGLIKEAESFIASMPMEPDVLIWGSLLSGSRM 382

Query: 455 HGEFKLGEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGST 514
            G+ K  E   K L++ +P++   YVLLSN+YAKT  W +   IR  MEVKG+ KVPG +
Sbjct: 383 LGDIKTCEGALKRLIELDPMNSGAYVLLSNVYAKTGRWMEVKCIRHEMEVKGINKVPGCS 442

Query: 515 MIEIDNEIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSL 574
            +E++  ++EFV GD+S ++ + IY M+DE+ + ++++GY   T EVLLD+NE+DKE +L
Sbjct: 443 YVEVEGVVHEFVVGDESQQESERIYAMLDEIMQRLREAGYVTDTKEVLLDLNEKDKEIAL 502

Query: 575 NRHSEKLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHF 607
           + HSEKLAIAF L++T PGTP+RI+KNLR+C DCH   K ISK++ REI++RD NRFHHF
Sbjct: 503 SYHSEKLAIAFCLMKTRPGTPVRIIKNLRICGDCHLVMKMISKLFSREIVVRDCNRFHHF 562

BLAST of Cucsat.G4739 vs. TAIR 10
Match: AT5G66520.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 473.4 bits (1217), Expect = 2.7e-133
Identity = 236/606 (38.94%), Postives = 371/606 (61.22%), Query Frame = 0

Query: 33  LALLQACNALPKLTQIHTHILKLGLHNNPLVLTKFASISSLIHATDYAASFLFSAEADTR 92
           ++ LQ C+   +L QIH  +LK GL  +   +TKF S      ++D+        +   R
Sbjct: 18  MSCLQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDR 77

Query: 93  LYDAFLFNTLIRAYAQTGHSKDKALALYGIMLHDAILPNKFTYPFVLKACAGLEVLNLGQ 152
             D FL+N +IR ++      +++L LY  ML  +   N +T+P +LKAC+ L       
Sbjct: 78  -PDTFLWNLMIRGFS-CSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETT 137

Query: 153 TVHGSVVKFGFDCDIHVQNTMVHMYSCCAGGINSARKVFDEMPKSDSVTWSAMIGGYARV 212
            +H  + K G++ D++  N++++ Y+   G    A  +FD +P+ D V+W+++I GY + 
Sbjct: 138 QIHAQITKLGYENDVYAVNSLINSYA-VTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKA 197

Query: 213 GR-------------------------------STEAVALFREMQMAEVCPDEITMVSML 272
           G+                               + EA+ LF EMQ ++V PD +++ + L
Sbjct: 198 GKMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANAL 257

Query: 273 SACTDLGALELGKWIEAYIERHEIHKPVEVSNALIDMFAKCGDISKALKLFRAMNEKTIV 332
           SAC  LGALE GKWI +Y+ +  I     +   LIDM+AKCG++ +AL++F+ + +K++ 
Sbjct: 258 SACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQ 317

Query: 333 SWTSVIVGMAMHGRGQEATCLFEEMTSSGVAPDDVAFIGLLSACSHSGLVERGREYFGSM 392
           +WT++I G A HG G+EA   F EM   G+ P+ + F  +L+ACS++GLVE G+  F SM
Sbjct: 318 AWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSM 377

Query: 393 MKKYKLVPKIEHYGCMVDMYCRTGLVKEALEFVRNMPIEPNPVILRTLVSACRGHGEFKL 452
            + Y L P IEHYGC+VD+  R GL+ EA  F++ MP++PN VI   L+ ACR H   +L
Sbjct: 378 ERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIEL 437

Query: 453 GEKITKLLMKHEPLHESNYVLLSNIYAKTLSWEKKTKIREVMEVKGMKKVPGSTMIEIDN 512
           GE+I ++L+  +P H   YV  +NI+A    W+K  + R +M+ +G+ KVPG + I ++ 
Sbjct: 438 GEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISLEG 497

Query: 513 EIYEFVAGDKSHKQHKEIYEMVDEMGREMKKSGYRPSTSEVLLDINEEDKEDSL-NRHSE 572
             +EF+AGD+SH + ++I      M R+++++GY P   E+LLD+ ++D+ +++ ++HSE
Sbjct: 498 TTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQHSE 557

Query: 573 KLAIAFGLLRTPPGTPIRIVKNLRVCSDCHSASKFISKIYDREIIMRDRNRFHHFKSGQC 607
           KLAI +GL++T PGT IRI+KNLRVC DCH  +K ISKIY R+I+MRDR RFHHF+ G+C
Sbjct: 558 KLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFRDGKC 617

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A8MQA37.7e-14143.81Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX... [more]
Q8LK935.0e-14042.31Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidop... [more]
Q9LN014.4e-13637.71Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q683I91.3e-13243.19Pentatricopeptide repeat-containing protein At3g62890 OS=Arabidopsis thaliana OX... [more]
Q9FJY73.8e-13238.94Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_004138859.10.0100.00pentatricopeptide repeat-containing protein At4g21065 [Cucumis sativus] >KGN6294... [more]
KAA0064932.10.098.51pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
XP_008445200.10.098.35PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Cucumis m... [more]
XP_038884201.10.094.06pentatricopeptide repeat-containing protein At4g21065-like [Benincasa hispida][more]
XP_022131416.10.090.10pentatricopeptide repeat-containing protein At4g21065-like [Momordica charantia]... [more]
Match NameE-valueIdentityDescription
A0A0A0LQ710.0100.00DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G3816... [more]
A0A5A7V9A40.098.51Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3BC370.098.35pentatricopeptide repeat-containing protein At4g21065-like OS=Cucumis melo OX=36... [more]
A0A6J1BQ700.090.10pentatricopeptide repeat-containing protein At4g21065-like OS=Momordica charanti... [more]
A0A6J1GHH70.090.44pentatricopeptide repeat-containing protein At4g21065-like OS=Cucurbita moschata... [more]
Match NameE-valueIdentityDescription
AT4G21065.15.5e-14243.81Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G02980.13.6e-14142.31Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G08070.13.1e-13737.71Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G62890.19.4e-13443.19Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G66520.12.7e-13338.94Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (B10) v3
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 258..482
e-value: 1.3E-42
score: 148.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 151..252
e-value: 1.3E-22
score: 82.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 9..150
e-value: 3.4E-7
score: 32.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 373..397
e-value: 0.0012
score: 18.9
coord: 273..298
e-value: 8.7E-5
score: 22.5
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 95..143
e-value: 0.004
score: 17.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 273..296
e-value: 0.001
score: 17.1
coord: 200..234
e-value: 2.8E-8
score: 31.4
coord: 301..334
e-value: 1.1E-5
score: 23.2
coord: 374..397
e-value: 0.0026
score: 15.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 299..346
e-value: 6.7E-8
score: 32.6
coord: 198..244
e-value: 1.8E-10
score: 40.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 268..298
score: 8.61564
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 334..369
score: 8.516988
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 198..232
score: 12.254791
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 95..130
score: 10.095415
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 299..333
score: 10.610596
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 472..596
e-value: 2.1E-41
score: 140.8
NoneNo IPR availablePANTHERPTHR47926:SF239SUBFAMILY NOT NAMEDcoord: 30..592
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 30..592

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsat.G4739.T1Cucsat.G4739.T1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding