Cucsa.143210 (gene) Cucumber (Gy14) v1

NameCucsa.143210
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationscaffold01079 : 1525564 .. 1528271 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GGCGGCTCTCAATGTGATATTCTTCAAGCTTCTTCACTAAGTCACTGTCTTCAGAAAAATGGTACGATGCGACTTTTTACTCAGTTCCTGCAAATCATTTCGCCAAATCAAACAAGTTCACGCCCGATTGATCACCACCGGCCTTATTCTACACCCAATCCCCACTAATAAACTCCTTAAACAACTTTCCTCAATATTCGCTCCAATTTCCTATGCCCACATGGTGTTCGACCATTTTCCTCAACCAGATCTCTTCCTCTACAACACCATTATCAAGGTCCTGGCATTTTCAACCACTTCTTCTGCTGATTCTTTCACGAAGTTTCGTTCTTTAATCCGCGAAGAAAGGTTAGTGCCCAATCAGTATTCGTTTGCATTTGCCTTCAAGGGCTGTGGCAGTGGTGTTGGGGTTCTGGAAGGGGAGCAAGTTCGTGTTCATGCTATTAAACTTGGTCTGGAAAACAATCTGTTTGTGACGAATGCGTTGATTGGGATGTATGTGAATTTGGATTTTGTTGTGGATGCTAGAAAGGTGTTTGATTGGAGTCCCAATAGAGATATGTATTCCTGGAATATCATGCTTAGTGGGTATGCGAGATTGGGGAAAATGGATGAAGCTCGGCAACTGTTTGATGAAATGCCTGAAAAAGATGTTGTGTCGTGGACAACAATGATTTCTGGTTGTCTTCAGGTAATTACATGAAAAACTGAAGTTCTTTCATTTTTTTGTTTCAAAATTTTGGGCTCAATTATAAGTTCAGGCTTTAAACTTCGAAGTTGTGTTTTTGTAACAATGTCCGTTTATCTTTTATTTAGTCTTCACTCTTAAATTTTTCTAATCAGCCTTTCAACTGTCTATTTTTTGTCAATGATTGGGAAGTTAGTGGAAAAAATTGTAGGTCAACACCTTGTTTGGTGACCGTTTCGATTTTAGTTATTGGTTTTTGAAAGTTAGGCTTATTTTCTCTCAAATTAAGCTAAGATCTTGTTTGAGAAATATTTCAATTTTGGTTATTGGTTCTTGAAAGTTAAGCAAATGTTTTGGCAAATTCCAAAAATAAGAACTAAGTTTTAAAAACCGACTATTTTTAGTTTAAAAAATTGGCTTGGTTTATGAAAACATTGGTACAAGTTAATTACTTAGCAAAGAAATTTAAAGATGAAATGAGTATAACCAAATGGTTAACAAACGGGGTGCAAGGGATAAAGAGACAATCTTTGTAGAAGCTTAAAAACCTTAGTAGAAGCTATAATTTTTCCAAAAATTTTAACGTTATATGATTCTAGTTACTTCTTGACACAGTAATACTGTGCTGGTATACGCTATGCCATAGTGAGAAGCCATATTTACTTTCTTTGCGAGAGAGAAGTATTTTTCTGTCTCGTTTCCTCAAACTTGCCCCTGCATCCATGGGGAAATCAAATGGTAAGCAATCACTGAGTTTATTATGAATTGTGTTTAGTTTCTACATATTGCTTAAAAAAaTCTGAAACAGGATCTTCATATCTAGGTTGGTTATTTCATGGAAGCTTTGGATATCTTCCACAACATGCTGGCAAAAGGGATGAGCCCAAACGAGTACACATTAGCCAGTTCCCTTGCTGCCTGTGCTAATCTTGTGGCATTGGATCAAGGAAGATGGATGCACGTTTATATTAAAAAGAACAATATTCAGATGAATGAGCGGTTGCTGGCTGGACTGATTGACATGTATGCAAAATGTGGAGAGTTAGAGTTCGCATCAAAGCTTTTCAACAGTAATCCACGGTTGAAGCGAAAGGTTTGGCCTTGGAATGCCATGATTGGTGGGTTTGCTGTGCATGGAAAATCTAAGGAAGCAATTGAGGTTTTTGAACAAATGAAGATAGAAAAAGTTTCTCCCAACAAAGTTACATTTGTTGCTTTGTTAAATGCTTGTAGTCATGGTAATAGAGTTGAGGAAGGAAGATACTATTTTGAATCAATGGCGAGTCATTACAGAGTCAAACCTGAGTTAGAGCATTACGGCTGTTTGGTCGATCTACTAGGACGTGCTGGGCGCTTGAAGGAAGCTGAAGAGATCATATCAAGTATGCATTTGACACCGGATGTCGCTATATGGGGTGCATTGCTTAGTGCTTGCAAAATTCATAAGGATGCTGAAATGGGAGAGAGAGTTGGGAAAATTGTTAAAGAGTTGGATCCTAACCATCTGGGCTGCCATGTTCTATTAGCAAATATATATTCTTTGACTGGGAATTGGAATGAAGCAAGAACATTAAGGGAGAAAATTGCAGAAAGTGGGAAAAAGAAAACTCCGGGTTGCAGCTCCATCGAGTTGAATGGGATGTTCCATCAATTTCTCGTCGGTGATCGGTCTCATCCTCAAACAAAACAACTCTATTTGTTCTTAGACGAGATGATCACCAAGTTGAAGATTGCTGGTTACATCCCTGAATCCGGAGAAGTTTTGCTTGACATCGACGACAATGAAGACAGAGAAACAGCTTTGTTAAAGCACAGTGAGAAGTTAGCCATTGCCTTTGGGTTGATGAATACAACACCCAAAACCCCGATTCGTATTGTGAAGAACTTGAGAGTATGTAGTGACTGTCATCTAGCGATAAAGTTCATTTCAAAGGTATACGACAGAGAGATTATCGTTAGGGATCGAATTCGATATCACCATTTTAAAGATGGAACTTGTTCGTGTAACGATTATTGGTAA

mRNA sequence

GGCGGCTCTCAATGTGATATTCTTCAAGCTTCTTCACTAAGTCACTGTCTTCAGAAAAATGGTACGATGCGACTTTTTACTCAGTTCCTGCAAATCATTTCGCCAAATCAAACAAGTTCACGCCCGATTGATCACCACCGGCCTTATTCTACACCCAATCCCCACTAATAAACTCCTTAAACAACTTTCCTCAATATTCGCTCCAATTTCCTATGCCCACATGGTGTTCGACCATTTTCCTCAACCAGATCTCTTCCTCTACAACACCATTATCAAGGTCCTGGCATTTTCAACCACTTCTTCTGCTGATTCTTTCACGAAGTTTCGTTCTTTAATCCGCGAAGAAAGGTTAGTGCCCAATCAGTATTCGTTTGCATTTGCCTTCAAGGGCTGTGGCAGTGGTGTTGGGGTTCTGGAAGGGGAGCAAGTTCGTGTTCATGCTATTAAACTTGGTCTGGAAAACAATCTGTTTGTGACGAATGCGTTGATTGGGATGTATGTGAATTTGGATTTTGTTGTGGATGCTAGAAAGGTGTTTGATTGGAGTCCCAATAGAGATATGTATTCCTGGAATATCATGCTTAGTGGGTATGCGAGATTGGGGAAAATGGATGAAGCTCGGCAACTGTTTGATGAAATGCCTGAAAAAGATGTTGTGTCGTGGACAACAATGATTTCTGGTTGTCTTCAGGTTGGTTATTTCATGGAAGCTTTGGATATCTTCCACAACATGCTGGCAAAAGGGATGAGCCCAAACGAGTACACATTAGCCAGTTCCCTTGCTGCCTGTGCTAATCTTGTGGCATTGGATCAAGGAAGATGGATGCACGTTTATATTAAAAAGAACAATATTCAGATGAATGAGCGGTTGCTGGCTGGACTGATTGACATGTATGCAAAATGTGGAGAGTTAGAGTTCGCATCAAAGCTTTTCAACAGTAATCCACGGTTGAAGCGAAAGGTTTGGCCTTGGAATGCCATGATTGGTGGGTTTGCTGTGCATGGAAAATCTAAGGAAGCAATTGAGGTTTTTGAACAAATGAAGATAGAAAAAGTTTCTCCCAACAAAGTTACATTTGTTGCTTTGTTAAATGCTTGTAGTCATGGTAATAGAGTTGAGGAAGGAAGATACTATTTTGAATCAATGGCGAGTCATTACAGAGTCAAACCTGAGTTAGAGCATTACGGCTGTTTGGTCGATCTACTAGGACGTGCTGGGCGCTTGAAGGAAGCTGAAGAGATCATATCAAGTATGCATTTGACACCGGATGTCGCTATATGGGGTGCATTGCTTAGTGCTTGCAAAATTCATAAGGATGCTGAAATGGGAGAGAGAGTTGGGAAAATTGTTAAAGAGTTGGATCCTAACCATCTGGGCTGCCATGTTCTATTAGCAAATATATATTCTTTGACTGGGAATTGGAATGAAGCAAGAACATTAAGGGAGAAAATTGCAGAAAGTGGGAAAAAGAAAACTCCGGGTTGCAGCTCCATCGAGTTGAATGGGATGTTCCATCAATTTCTCGTCGGTGATCGGTCTCATCCTCAAACAAAACAACTCTATTTGTTCTTAGACGAGATGATCACCAAGTTGAAGATTGCTGGTTACATCCCTGAATCCGGAGAAGTTTTGCTTGACATCGACGACAATGAAGACAGAGAAACAGCTTTGTTAAAGCACAGTGAGAAGTTAGCCATTGCCTTTGGGTTGATGAATACAACACCCAAAACCCCGATTCGTATTGTGAAGAACTTGAGAGTATGTAGTGACTGTCATCTAGCGATAAAGTTCATTTCAAAGGTATACGACAGAGAGATTATCGTTAGGGATCGAATTCGATATCACCATTTTAAAGATGGAACTTGTTCGTGTAACGATTATTGGTAA

Coding sequence (CDS)

ATGGTACGATGCGACTTTTTACTCAGTTCCTGCAAATCATTTCGCCAAATCAAACAAGTTCACGCCCGATTGATCACCACCGGCCTTATTCTACACCCAATCCCCACTAATAAACTCCTTAAACAACTTTCCTCAATATTCGCTCCAATTTCCTATGCCCACATGGTGTTCGACCATTTTCCTCAACCAGATCTCTTCCTCTACAACACCATTATCAAGGTCCTGGCATTTTCAACCACTTCTTCTGCTGATTCTTTCACGAAGTTTCGTTCTTTAATCCGCGAAGAAAGGTTAGTGCCCAATCAGTATTCGTTTGCATTTGCCTTCAAGGGCTGTGGCAGTGGTGTTGGGGTTCTGGAAGGGGAGCAAGTTCGTGTTCATGCTATTAAACTTGGTCTGGAAAACAATCTGTTTGTGACGAATGCGTTGATTGGGATGTATGTGAATTTGGATTTTGTTGTGGATGCTAGAAAGGTGTTTGATTGGAGTCCCAATAGAGATATGTATTCCTGGAATATCATGCTTAGTGGGTATGCGAGATTGGGGAAAATGGATGAAGCTCGGCAACTGTTTGATGAAATGCCTGAAAAAGATGTTGTGTCGTGGACAACAATGATTTCTGGTTGTCTTCAGGTTGGTTATTTCATGGAAGCTTTGGATATCTTCCACAACATGCTGGCAAAAGGGATGAGCCCAAACGAGTACACATTAGCCAGTTCCCTTGCTGCCTGTGCTAATCTTGTGGCATTGGATCAAGGAAGATGGATGCACGTTTATATTAAAAAGAACAATATTCAGATGAATGAGCGGTTGCTGGCTGGACTGATTGACATGTATGCAAAATGTGGAGAGTTAGAGTTCGCATCAAAGCTTTTCAACAGTAATCCACGGTTGAAGCGAAAGGTTTGGCCTTGGAATGCCATGATTGGTGGGTTTGCTGTGCATGGAAAATCTAAGGAAGCAATTGAGGTTTTTGAACAAATGAAGATAGAAAAAGTTTCTCCCAACAAAGTTACATTTGTTGCTTTGTTAAATGCTTGTAGTCATGGTAATAGAGTTGAGGAAGGAAGATACTATTTTGAATCAATGGCGAGTCATTACAGAGTCAAACCTGAGTTAGAGCATTACGGCTGTTTGGTCGATCTACTAGGACGTGCTGGGCGCTTGAAGGAAGCTGAAGAGATCATATCAAGTATGCATTTGACACCGGATGTCGCTATATGGGGTGCATTGCTTAGTGCTTGCAAAATTCATAAGGATGCTGAAATGGGAGAGAGAGTTGGGAAAATTGTTAAAGAGTTGGATCCTAACCATCTGGGCTGCCATGTTCTATTAGCAAATATATATTCTTTGACTGGGAATTGGAATGAAGCAAGAACATTAAGGGAGAAAATTGCAGAAAGTGGGAAAAAGAAAACTCCGGGTTGCAGCTCCATCGAGTTGAATGGGATGTTCCATCAATTTCTCGTCGGTGATCGGTCTCATCCTCAAACAAAACAACTCTATTTGTTCTTAGACGAGATGATCACCAAGTTGAAGATTGCTGGTTACATCCCTGAATCCGGAGAAGTTTTGCTTGACATCGACGACAATGAAGACAGAGAAACAGCTTTGTTAAAGCACAGTGAGAAGTTAGCCATTGCCTTTGGGTTGATGAATACAACACCCAAAACCCCGATTCGTATTGTGAAGAACTTGAGAGTATGTAGTGACTGTCATCTAGCGATAAAGTTCATTTCAAAGGTATACGACAGAGAGATTATCGTTAGGGATCGAATTCGATATCACCATTTTAAAGATGGAACTTGTTCGTGTAACGATTATTGGTAA

Protein sequence

MVRCDFLLSSCKSFRQIKQVHARLITTGLILHPIPTNKLLKQLSSIFAPISYAHMVFDHFPQPDLFLYNTIIKVLAFSTTSSADSFTKFRSLIREERLVPNQYSFAFAFKGCGSGVGVLEGEQVRVHAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWSPNRDMYSWNIMLSGYARLGKMDEARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKGMSPNEYTLASSLAACANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFNSNPRLKRKVWPWNAMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVALLNACSHGNRVEEGRYYFESMASHYRVKPELEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLSACKIHKDAEMGERVGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAESGKKKTPGCSSIELNGMFHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDIDDNEDRETALLKHSEKLAIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIVRDRIRYHHFKDGTCSCNDYW*
BLAST of Cucsa.143210 vs. Swiss-Prot
Match: PP449_ARATH (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 529.3 bits (1362), Expect = 5.8e-149
Identity = 276/611 (45.17%), Postives = 379/611 (62.03%), Query Frame = 1

Query: 8   LSSCKSFRQIKQVHARLITTGLILHPIPTNKLLK----QLSSIFAPISYAHMVFDHFPQP 67
           L  C    ++KQ+HAR++ TGL+       K L       SS F P  YA +VFD F +P
Sbjct: 21  LQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLP--YAQIVFDGFDRP 80

Query: 68  DLFLYNTIIKVLAFSTTSSADSFTKFRSLIREERLV-----PNQYSFAFAFKGCGSGVGV 127
           D FL+N +I+   FS +   +     RSL+  +R++      N Y+F    K C +    
Sbjct: 81  DTFLWNLMIR--GFSCSDEPE-----RSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAF 140

Query: 128 LEGEQVRVHAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWSPNRDMYSWNIMLSGY 187
            E  Q+     KLG EN+++  N+LI  Y        A  +FD  P  D  SWN ++ GY
Sbjct: 141 EETTQIHAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGY 200

Query: 188 ARLGKMDEARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKGMSPNEYTLA 247
            + GKMD A  LF +M EK+ +SWTTMISG +Q     EAL +FH M    + P+  +LA
Sbjct: 201 VKAGKMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLA 260

Query: 248 SSLAACANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFNSNPRL 307
           ++L+ACA L AL+QG+W+H Y+ K  I+M+  L   LIDMYAKCGE+E A ++F +    
Sbjct: 261 NALSACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIK-- 320

Query: 308 KRKVWPWNAMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVALLNACSHGNRVEEGRY 367
           K+ V  W A+I G+A HG  +EAI  F +M+   + PN +TF A+L ACS+   VEEG+ 
Sbjct: 321 KKSVQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKL 380

Query: 368 YFESMASHYRVKPELEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLSACKIH 427
            F SM   Y +KP +EHYGC+VDLLGRAG L EA+  I  M L P+  IWGALL AC+IH
Sbjct: 381 IFYSMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIH 440

Query: 428 KDAEMGERVGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAESGKKKTPGCSS 487
           K+ E+GE +G+I+  +DP H G +V  ANI+++   W++A   R  + E G  K PGCS+
Sbjct: 441 KNIELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCST 500

Query: 488 IELNGMFHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDIDDNEDRETAL 547
           I L G  H+FL GDRSHP+ +++      M  KL+  GY+PE  E+LLD+ D+++RE  +
Sbjct: 501 ISLEGTTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIV 560

Query: 548 LKHSEKLAIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIVRDRIRYHHF 607
            +HSEKLAI +GL+ T P T IRI+KNLRVC DCH   K ISK+Y R+I++RDR R+HHF
Sbjct: 561 HQHSEKLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHF 620

Query: 608 KDGTCSCNDYW 610
           +DG CSC DYW
Sbjct: 621 RDGKCSCGDYW 620

BLAST of Cucsa.143210 vs. Swiss-Prot
Match: PP425_ARATH (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 509.2 bits (1310), Expect = 6.2e-143
Identity = 261/620 (42.10%), Postives = 394/620 (63.55%), Query Frame = 1

Query: 8   LSSCKSFRQIKQVHARLITTGLILHPIPTNKLLKQLSSI---FAPISYAHMVFDHFPQPD 67
           +++C++ R + Q+HA  I +G +   +   ++L+  ++       + YAH +F+  PQ +
Sbjct: 30  INNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHKIFNQMPQRN 89

Query: 68  LFLYNTIIKVLAFSTTSSA-DSFTKFRSLIREERLVPNQYSFAFAFKGCGSGVGVLEGEQ 127
            F +NTII+  + S    A  + T F  ++ +E + PN+++F    K C     + EG+Q
Sbjct: 90  CFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAKTGKIQEGKQ 149

Query: 128 VRVHAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWS--------------PNRDMY 187
           +   A+K G   + FV + L+ MYV   F+ DAR +F  +               + ++ 
Sbjct: 150 IHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMVVMTDRRKRDGEIV 209

Query: 188 SWNIMLSGYARLGKMDEARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKG 247
            WN+M+ GY RLG    AR LFD+M ++ VVSW TMISG    G+F +A+++F  M    
Sbjct: 210 LWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNGFFKDAVEVFREMKKGD 269

Query: 248 MSPNEYTLASSLAACANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFAS 307
           + PN  TL S L A + L +L+ G W+H+Y + + I++++ L + LIDMY+KCG +E A 
Sbjct: 270 IRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALIDMYSKCGIIEKAI 329

Query: 308 KLFNSNPRLKRKVWPWNAMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVALLNACSH 367
            +F   PR    V  W+AMI GFA+HG++ +AI+ F +M+   V P+ V ++ LL ACSH
Sbjct: 330 HVFERLPR--ENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYINLLTACSH 389

Query: 368 GNRVEEGRYYFESMASHYRVKPELEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWG 427
           G  VEEGR YF  M S   ++P +EHYGC+VDLLGR+G L EAEE I +M + PD  IW 
Sbjct: 390 GGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPIKPDDVIWK 449

Query: 428 ALLSACKIHKDAEMGERVGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAESG 487
           ALL AC++  + EMG+RV  I+ ++ P+  G +V L+N+Y+  GNW+E   +R ++ E  
Sbjct: 450 ALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMRLRMKEKD 509

Query: 488 KKKTPGCSSIELNGMFHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDID 547
            +K PGCS I+++G+ H+F+V D SHP+ K++   L E+  KL++AGY P + +VLL+++
Sbjct: 510 IRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGYRPITTQVLLNLE 569

Query: 548 DNEDRETALLKHSEKLAIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIV 607
           + ED+E  L  HSEK+A AFGL++T+P  PIRIVKNLR+C DCH +IK ISKVY R+I V
Sbjct: 570 E-EDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSIKLISKVYKRKITV 629

Query: 608 RDRIRYHHFKDGTCSCNDYW 610
           RDR R+HHF+DG+CSC DYW
Sbjct: 630 RDRKRFHHFQDGSCSCMDYW 646

BLAST of Cucsa.143210 vs. Swiss-Prot
Match: PP367_ARATH (Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana GN=PCMP-H88 PE=3 SV=1)

HSP 1 Score: 481.9 bits (1239), Expect = 1.1e-134
Identity = 245/610 (40.16%), Postives = 380/610 (62.30%), Query Frame = 1

Query: 7   LLSSCKSFRQIKQVHARLITTGLILHPIPTNKLLKQL---SSIFAP---ISYAHMVFDHF 66
           LL SC SF  +K +H  L+ T LI      ++LL      S+   P   + YA+ +F   
Sbjct: 18  LLQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKPTNLLGYAYGIFSQI 77

Query: 67  PQPDLFLYNTIIKVLAFSTTSSADSFTKFRSLIREERLVPNQYSFAFAFKGCGSGVGVLE 126
             P+LF++N +I+   FST +       F + + + R+ P+  +F F  K       VL 
Sbjct: 78  QNPNLFVFNLLIR--CFSTGAEPSKAFGFYTQMLKSRIWPDNITFPFLIKASSEMECVLV 137

Query: 127 GEQVRVHAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWSPNRDMYSWNIMLSGYAR 186
           GEQ     ++ G +N+++V N+L+ MY N  F+  A ++F     RD+ SW  M++GY +
Sbjct: 138 GEQTHSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQMGFRDVVSWTSMVAGYCK 197

Query: 187 LGKMDEARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKGMSPNEYTLASS 246
            G ++ AR++FDEMP +++ +W+ MI+G  +   F +A+D+F  M  +G+  NE  + S 
Sbjct: 198 CGMVENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKREGVVANETVMVSV 257

Query: 247 LAACANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFNSNPRLKR 306
           +++CA+L AL+ G   + Y+ K+++ +N  L   L+DM+ +CG++E A  +F   P    
Sbjct: 258 ISSCAHLGALEFGERAYEYVVKSHMTVNLILGTALVDMFWRCGDIEKAIHVFEGLPETDS 317

Query: 307 KVWPWNAMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVALLNACSHGNRVEEGRYYF 366
               W+++I G AVHG + +A+  F QM      P  VTF A+L+ACSHG  VE+G   +
Sbjct: 318 L--SWSSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSHGGLVEKGLEIY 377

Query: 367 ESMASHYRVKPELEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLSACKIHKD 426
           E+M   + ++P LEHYGC+VD+LGRAG+L EAE  I  MH+ P+  I GALL ACKI+K+
Sbjct: 378 ENMKKDHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILGALLGACKIYKN 437

Query: 427 AEMGERVGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAESGKKKTPGCSSIE 486
            E+ ERVG ++ ++ P H G +VLL+NIY+  G W++  +LR+ + E   KK PG S IE
Sbjct: 438 TEVAERVGNMLIKVKPEHSGYYVLLSNIYACAGQWDKIESLRDMMKEKLVKKPPGWSLIE 497

Query: 487 LNGMFHQFLVG-DRSHPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDIDDNEDRETALL 546
           ++G  ++F +G D+ HP+  ++    +E++ K+++ GY   +G+   D+D+ E++E+++ 
Sbjct: 498 IDGKINKFTMGDDQKHPEMGKIRRKWEEILGKIRLIGYKGNTGDAFFDVDE-EEKESSIH 557

Query: 547 KHSEKLAIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIVRDRIRYHHFK 606
            HSEKLAIA+G+M T P T IRIVKNLRVC DCH   K IS+VY RE+IVRDR R+HHF+
Sbjct: 558 MHSEKLAIAYGMMKTKPGTTIRIVKNLRVCEDCHTVTKLISEVYGRELIVRDRNRFHHFR 617

Query: 607 DGTCSCNDYW 610
           +G CSC DYW
Sbjct: 618 NGVCSCRDYW 622

BLAST of Cucsa.143210 vs. Swiss-Prot
Match: PP354_ARATH (Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis thaliana GN=ELI1 PE=3 SV=1)

HSP 1 Score: 480.3 bits (1235), Expect = 3.1e-134
Identity = 243/607 (40.03%), Postives = 378/607 (62.27%), Query Frame = 1

Query: 7   LLSSCKSFRQIKQVHARLITTGLILHP-IPT-NKLLKQLSSIFAPISYAHMVFDHFPQPD 66
           L+   +S  ++ Q+HA ++   L+LHP  P  N  L +  +    I ++  +F     PD
Sbjct: 35  LIDKSQSVDEVLQIHAAILRHNLLLHPRYPVLNLKLHRAYASHGKIRHSLALFHQTIDPD 94

Query: 67  LFLYNTIIKVLAFSTTSSADSFTKFRSLIREERLVPNQYSFAFAFKGCGSGVGVLEGEQV 126
           LFL+   I   + +      +F  +  L+  E + PN+++F+   K C +  G L    +
Sbjct: 95  LFLFTAAINTASINGLKD-QAFLLYVQLLSSE-INPNEFTFSSLLKSCSTKSGKL----I 154

Query: 127 RVHAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWSPNRDMYSWNIMLSGYARLGKM 186
             H +K GL  + +V   L+ +Y     VV A+KVFD  P R + S   M++ YA+ G +
Sbjct: 155 HTHVLKFGLGIDPYVATGLVDVYAKGGDVVSAQKVFDRMPERSLVSSTAMITCYAKQGNV 214

Query: 187 DEARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKGM-SPNEYTLASSLAA 246
           + AR LFD M E+D+VSW  MI G  Q G+  +AL +F  +LA+G   P+E T+ ++L+A
Sbjct: 215 EAARALFDSMCERDIVSWNVMIDGYAQHGFPNDALMLFQKLLAEGKPKPDEITVVAALSA 274

Query: 247 CANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFNSNPRLKRKVW 306
           C+ + AL+ GRW+HV++K + I++N ++  GLIDMY+KCG LE A  +FN  PR  + + 
Sbjct: 275 CSQIGALETGRWIHVFVKSSRIRLNVKVCTGLIDMYSKCGSLEEAVLVFNDTPR--KDIV 334

Query: 307 PWNAMIGGFAVHGKSKEAIEVFEQMK-IEKVSPNKVTFVALLNACSHGNRVEEGRYYFES 366
            WNAMI G+A+HG S++A+ +F +M+ I  + P  +TF+  L AC+H   V EG   FES
Sbjct: 335 AWNAMIAGYAMHGYSQDALRLFNEMQGITGLQPTDITFIGTLQACAHAGLVNEGIRIFES 394

Query: 367 MASHYRVKPELEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLSACKIHKDAE 426
           M   Y +KP++EHYGCLV LLGRAG+LK A E I +M++  D  +W ++L +CK+H D  
Sbjct: 395 MGQEYGIKPKIEHYGCLVSLLGRAGQLKRAYETIKNMNMDADSVLWSSVLGSCKLHGDFV 454

Query: 427 MGERVGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAESGKKKTPGCSSIELN 486
           +G+ + + +  L+  + G +VLL+NIY+  G++     +R  + E G  K PG S+IE+ 
Sbjct: 455 LGKEIAEYLIGLNIKNSGIYVLLSNIYASVGDYEGVAKVRNLMKEKGIVKEPGISTIEIE 514

Query: 487 GMFHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDIDDNEDRETALLKHS 546
              H+F  GDR H ++K++Y  L ++  ++K  GY+P +  VL D+++ E +E +L  HS
Sbjct: 515 NKVHEFRAGDREHSKSKEIYTMLRKISERIKSHGYVPNTNTVLQDLEETE-KEQSLQVHS 574

Query: 547 EKLAIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIVRDRIRYHHFKDGT 606
           E+LAIA+GL++T P +P++I KNLRVCSDCH   K ISK+  R+I++RDR R+HHF DG+
Sbjct: 575 ERLAIAYGLISTKPGSPLKIFKNLRVCSDCHTVTKLISKITGRKIVMRDRNRFHHFTDGS 632

Query: 607 CSCNDYW 610
           CSC D+W
Sbjct: 635 CSCGDFW 632

BLAST of Cucsa.143210 vs. Swiss-Prot
Match: PP295_ARATH (Pentatricopeptide repeat-containing protein At3g62890 OS=Arabidopsis thaliana GN=PCMP-H82 PE=2 SV=1)

HSP 1 Score: 477.6 bits (1228), Expect = 2.0e-133
Identity = 240/570 (42.11%), Postives = 378/570 (66.32%), Query Frame = 1

Query: 48  APISYAHMVFD-HFPQPDLFLYNTIIKVLAFSTTS-SADSFTKFRSLIREERLVPNQYSF 107
           A I+YA+ +F     + + FL+N II+ +  + +S    S       +R  R+ P+ ++F
Sbjct: 6   AIIAYANPIFHIRHLKLESFLWNIIIRAIVHNVSSPQRHSPISVYLRMRNHRVSPDFHTF 65

Query: 108 AFAFKGCGSGVGVLEGEQVRVHAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWSPN 167
            F      + + +  G++     +  GL+ + FV  +L+ MY +   +  A++VFD S +
Sbjct: 66  PFLLPSFHNPLHLPLGQRTHAQILLFGLDKDPFVRTSLLNMYSSCGDLRSAQRVFDDSGS 125

Query: 168 RDMYSWNIMLSGYARLGKMDEARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNM 227
           +D+ +WN +++ YA+ G +D+AR+LFDEMPE++V+SW+ +I+G +  G + EALD+F  M
Sbjct: 126 KDLPAWNSVVNAYAKAGLIDDARKLFDEMPERNVISWSCLINGYVMCGKYKEALDLFREM 185

Query: 228 -LAKG----MSPNEYTLASSLAACANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYA 287
            L K     + PNE+T+++ L+AC  L AL+QG+W+H YI K +++++  L   LIDMYA
Sbjct: 186 QLPKPNEAFVRPNEFTMSTVLSACGRLGALEQGKWVHAYIDKYHVEIDIVLGTALIDMYA 245

Query: 288 KCGELEFASKLFNSNPRLKRKVWPWNAMIGGFAVHGKSKEAIEVFEQMKI-EKVSPNKVT 347
           KCG LE A ++FN+    K+ V  ++AMI   A++G + E  ++F +M   + ++PN VT
Sbjct: 246 KCGSLERAKRVFNALGS-KKDVKAYSAMICCLAMYGLTDECFQLFSEMTTSDNINPNSVT 305

Query: 348 FVALLNACSHGNRVEEGRYYFESMASHYRVKPELEHYGCLVDLLGRAGRLKEAEEIISSM 407
           FV +L AC H   + EG+ YF+ M   + + P ++HYGC+VDL GR+G +KEAE  I+SM
Sbjct: 306 FVGILGACVHRGLINEGKSYFKMMIEEFGITPSIQHYGCMVDLYGRSGLIKEAESFIASM 365

Query: 408 HLTPDVAIWGALLSACKIHKDAEMGERVGKIVKELDPNHLGCHVLLANIYSLTGNWNEAR 467
            + PDV IWG+LLS  ++  D +  E   K + ELDP + G +VLL+N+Y+ TG W E +
Sbjct: 366 PMEPDVLIWGSLLSGSRMLGDIKTCEGALKRLIELDPMNSGAYVLLSNVYAKTGRWMEVK 425

Query: 468 TLREKIAESGKKKTPGCSSIELNGMFHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYIP 527
            +R ++   G  K PGCS +E+ G+ H+F+VGD S  +++++Y  LDE++ +L+ AGY+ 
Sbjct: 426 CIRHEMEVKGINKVPGCSYVEVEGVVHEFVVGDESQQESERIYAMLDEIMQRLREAGYVT 485

Query: 528 ESGEVLLDIDDNEDRETALLKHSEKLAIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFI 587
           ++ EVLLD+++ +D+E AL  HSEKLAIAF LM T P TP+RI+KNLR+C DCHL +K I
Sbjct: 486 DTKEVLLDLNE-KDKEIALSYHSEKLAIAFCLMKTRPGTPVRIIKNLRICGDCHLVMKMI 545

Query: 588 SKVYDREIIVRDRIRYHHFKDGTCSCNDYW 610
           SK++ REI+VRD  R+HHF+DG+CSC D+W
Sbjct: 546 SKLFSREIVVRDCNRFHHFRDGSCSCRDFW 573

BLAST of Cucsa.143210 vs. TrEMBL
Match: A0A0A0LX83_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G612890 PE=4 SV=1)

HSP 1 Score: 1238.0 bits (3202), Expect = 0.0e+00
Identity = 609/609 (100.00%), Postives = 609/609 (100.00%), Query Frame = 1

Query: 1   MVRCDFLLSSCKSFRQIKQVHARLITTGLILHPIPTNKLLKQLSSIFAPISYAHMVFDHF 60
           MVRCDFLLSSCKSFRQIKQVHARLITTGLILHPIPTNKLLKQLSSIFAPISYAHMVFDHF
Sbjct: 1   MVRCDFLLSSCKSFRQIKQVHARLITTGLILHPIPTNKLLKQLSSIFAPISYAHMVFDHF 60

Query: 61  PQPDLFLYNTIIKVLAFSTTSSADSFTKFRSLIREERLVPNQYSFAFAFKGCGSGVGVLE 120
           PQPDLFLYNTIIKVLAFSTTSSADSFTKFRSLIREERLVPNQYSFAFAFKGCGSGVGVLE
Sbjct: 61  PQPDLFLYNTIIKVLAFSTTSSADSFTKFRSLIREERLVPNQYSFAFAFKGCGSGVGVLE 120

Query: 121 GEQVRVHAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWSPNRDMYSWNIMLSGYAR 180
           GEQVRVHAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWSPNRDMYSWNIMLSGYAR
Sbjct: 121 GEQVRVHAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWSPNRDMYSWNIMLSGYAR 180

Query: 181 LGKMDEARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKGMSPNEYTLASS 240
           LGKMDEARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKGMSPNEYTLASS
Sbjct: 181 LGKMDEARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKGMSPNEYTLASS 240

Query: 241 LAACANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFNSNPRLKR 300
           LAACANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFNSNPRLKR
Sbjct: 241 LAACANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFNSNPRLKR 300

Query: 301 KVWPWNAMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVALLNACSHGNRVEEGRYYF 360
           KVWPWNAMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVALLNACSHGNRVEEGRYYF
Sbjct: 301 KVWPWNAMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVALLNACSHGNRVEEGRYYF 360

Query: 361 ESMASHYRVKPELEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLSACKIHKD 420
           ESMASHYRVKPELEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLSACKIHKD
Sbjct: 361 ESMASHYRVKPELEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLSACKIHKD 420

Query: 421 AEMGERVGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAESGKKKTPGCSSIE 480
           AEMGERVGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAESGKKKTPGCSSIE
Sbjct: 421 AEMGERVGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAESGKKKTPGCSSIE 480

Query: 481 LNGMFHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDIDDNEDRETALLK 540
           LNGMFHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDIDDNEDRETALLK
Sbjct: 481 LNGMFHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDIDDNEDRETALLK 540

Query: 541 HSEKLAIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIVRDRIRYHHFKD 600
           HSEKLAIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIVRDRIRYHHFKD
Sbjct: 541 HSEKLAIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIVRDRIRYHHFKD 600

Query: 601 GTCSCNDYW 610
           GTCSCNDYW
Sbjct: 601 GTCSCNDYW 609

BLAST of Cucsa.143210 vs. TrEMBL
Match: F6GUA4_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_06s0004g01520 PE=4 SV=1)

HSP 1 Score: 891.3 bits (2302), Expect = 6.4e-256
Identity = 424/595 (71.26%), Postives = 502/595 (84.37%), Query Frame = 1

Query: 16  QIKQVHARLITTGLILHPIPTNKLLKQL-SSIFAPISYAHMVFDHFPQPDLFLYNTIIKV 75
           QIKQ HA LITTGLILHPI  NKLLK L +S F  +SYAH +FD  P+PD+F+YNT+IK 
Sbjct: 3   QIKQTHAHLITTGLILHPITANKLLKVLIASSFGSLSYAHQLFDQIPKPDVFIYNTMIKA 62

Query: 76  LAFSTTSSADSFTKFRSLIREERLVPNQYSFAFAFKGCGSGVGVLEGEQVRVHAIKLGLE 135
            A   TSS +S   F S++R    +PN+Y+F F FK CG+G+GVLEGEQ+RVHAIK+GLE
Sbjct: 63  HAVIPTSSHNSMRIFLSMVRVSGFLPNRYTFVFVFKACGNGLGVLEGEQIRVHAIKIGLE 122

Query: 136 NNLFVTNALIGMYVNLDFVVDARKVFDWSPNRDMYSWNIMLSGYARLGKMDEARQLFDEM 195
           +NLFVTNA+I MY N   V +AR+VFDWS ++D+YSWNIM+ GY   G++  A+++FDEM
Sbjct: 123 SNLFVTNAMIRMYANWGLVDEARRVFDWSLDQDLYSWNIMIGGYVGSGEIGRAKEMFDEM 182

Query: 196 PEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKGMSPNEYTLASSLAACANLVALDQGR 255
            E+DVVSWTT+I+G +QVG F EALD+FH ML  G  PNE+TLAS+LAACANLVALDQGR
Sbjct: 183 SERDVVSWTTIIAGYVQVGCFKEALDLFHEMLQTGPPPNEFTLASALAACANLVALDQGR 242

Query: 256 WMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFNSNPRLKRKVWPWNAMIGGFAV 315
           W+HVYI K+ I+MNERLLA L+DMYAKCGE++FA+K+F+    LK KVWPWNAMIGG+A+
Sbjct: 243 WIHVYIDKSEIKMNERLLASLLDMYAKCGEIDFAAKVFHDEYGLKLKVWPWNAMIGGYAM 302

Query: 316 HGKSKEAIEVFEQMKIEKVSPNKVTFVALLNACSHGNRVEEGRYYFESMASHYRVKPELE 375
           HGKSKEAI++FEQMK+EKVSPNKVTFVALLNACSHG  VEEGR YF+SMAS Y ++PE+E
Sbjct: 303 HGKSKEAIDLFEQMKVEKVSPNKVTFVALLNACSHGKLVEEGRGYFKSMASSYGIEPEIE 362

Query: 376 HYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLSACKIHKDAEMGERVGKIVKEL 435
           HYGC+VDLLGR+G LKEAEE + +M + PD  IWGALL AC+IHKD E G+R+GKI+KEL
Sbjct: 363 HYGCMVDLLGRSGLLKEAEETVFNMPMAPDATIWGALLGACRIHKDIERGQRIGKIIKEL 422

Query: 436 DPNHLGCHVLLANIYSLTGNWNEARTLREKIAESGKKKTPGCSSIELNGMFHQFLVGDRS 495
           D +H+GCHVLLAN+YS +G W+EA+ +R+KI  SG+KKTPGCSSIELNG+FHQFLVGDRS
Sbjct: 423 DSDHIGCHVLLANLYSASGQWDEAKAVRQKIEVSGRKKTPGCSSIELNGVFHQFLVGDRS 482

Query: 496 HPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDIDDNEDRETALLKHSEKLAIAFGLMNT 555
           HPQTKQLYLFLDEM TKLK AGY+PE GEVLLDIDD ED+ETAL KHSEKLAIAFGL+NT
Sbjct: 483 HPQTKQLYLFLDEMTTKLKNAGYVPEFGEVLLDIDDEEDKETALSKHSEKLAIAFGLINT 542

Query: 556 TPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIVRDRIRYHHFKDGTCSCNDYW 610
            P T IRIVKNLRVC+DCH A KFISKVY REIIVRDRIRYHHFKDG CSC DYW
Sbjct: 543 PPGTAIRIVKNLRVCADCHEATKFISKVYKREIIVRDRIRYHHFKDGFCSCKDYW 597

BLAST of Cucsa.143210 vs. TrEMBL
Match: A0A0B2QXA0_GLYSO (Pentatricopeptide repeat-containing protein OS=Glycine soja GN=glysoja_014027 PE=4 SV=1)

HSP 1 Score: 831.6 bits (2147), Expect = 6.0e-238
Identity = 401/604 (66.39%), Postives = 482/604 (79.80%), Query Frame = 1

Query: 7   LLSSCKSFRQIKQVHARLITTGLILHPIPTNKLLKQLSSIFAPISYAHMVFDHFPQPDLF 66
           L+ SCKS +QIKQ HA+LITT LI HP+  NKLLK  +   A +SYAH +FD  PQPDLF
Sbjct: 22  LIDSCKSMQQIKQTHAQLITTALISHPVSANKLLKLAAC--ASLSYAHKLFDQIPQPDLF 81

Query: 67  LYNTIIKVLAFSTTSSADSFTKFRSLIREERLVPNQYSFAFAFKGCGSGVGVLEGEQVRV 126
           +YNT+IK  + S  S  +S   FRSL ++  L PN+YSF FAF  CG+G+GV EGEQVR+
Sbjct: 82  IYNTMIKAHSLSPHSCHNSLIVFRSLTQDLGLFPNRYSFVFAFSACGNGLGVQEGEQVRI 141

Query: 127 HAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWSPNRDMYSWNIMLSGYARLGKMDE 186
           HA+K+GLENN+FV NALIGMY     V +++KVF W+ +RD+YSWN +++ Y   G M  
Sbjct: 142 HAVKVGLENNVFVVNALIGMYGKWGLVGESQKVFQWAVDRDLYSWNTLIAAYVGSGNMSL 201

Query: 187 ARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKGMSPNEYTLASSLAACAN 246
           A++LFD M E+DVVSW+T+I+G +QVG FMEALD FH ML  G  PNEYTL S+LAAC+N
Sbjct: 202 AKELFDGMRERDVVSWSTIIAGYVQVGCFMEALDFFHEMLQIGPKPNEYTLVSALAACSN 261

Query: 247 LVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFNSNPRLKRKVWPWN 306
           LVALDQG+W+H YI K  I+MNERLLA +IDMYAKCGE+E AS++F  + ++K+KVWPWN
Sbjct: 262 LVALDQGKWIHAYIGKGEIKMNERLLASIIDMYAKCGEIESASRVFFEH-KVKQKVWPWN 321

Query: 307 AMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVALLNACSHGNRVEEGRYYFESMASH 366
           AMIGGFA+HG   EAI VFEQMK+EK+SPNKVTF+ALLNACSHG  VEEG+ YF  M S 
Sbjct: 322 AMIGGFAMHGMPNEAINVFEQMKVEKISPNKVTFIALLNACSHGYMVEEGKLYFRLMVSD 381

Query: 367 YRVKPELEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLSACKIHKDAEMGER 426
           Y + PE+EHYGC+VDLL R+G LKEAE++ISSM + PDVAIWGALL+AC+I+KD E G R
Sbjct: 382 YAITPEIEHYGCMVDLLSRSGLLKEAEDMISSMPMAPDVAIWGALLNACRIYKDMERGYR 441

Query: 427 VGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAES-GKKKTPGCSSIELNGMF 486
           +G+I+K +DPNH+GCHVLL+NIYS +G WNEAR LREK   S  +KK PGCSSIEL G F
Sbjct: 442 IGRIIKGMDPNHIGCHVLLSNIYSTSGRWNEARILREKNEISRDRKKIPGCSSIELKGTF 501

Query: 487 HQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDIDDNEDRETALLKHSEKL 546
           HQFLVGD+SHPQ+ ++Y FLDEM TKLK AGY+PE GE+L DIDD ED+ETAL  HSEKL
Sbjct: 502 HQFLVGDQSHPQSMEIYSFLDEMTTKLKSAGYVPELGELLHDIDDEEDKETALSVHSEKL 561

Query: 547 AIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIVRDRIRYHHFKDGTCSC 606
           AIAFGLMNT   TPIRIVKNLRVC DCH A KFISKVY+R IIVRDR RYHHF+DG CSC
Sbjct: 562 AIAFGLMNTANGTPIRIVKNLRVCGDCHQATKFISKVYNRVIIVRDRTRYHHFEDGICSC 621

Query: 607 NDYW 610
            DYW
Sbjct: 622 KDYW 622

BLAST of Cucsa.143210 vs. TrEMBL
Match: G7KNB4_MEDTR (Pentatricopeptide (PPR) repeat protein OS=Medicago truncatula GN=MTR_6g060510 PE=4 SV=2)

HSP 1 Score: 824.3 bits (2128), Expect = 9.6e-236
Identity = 399/603 (66.17%), Postives = 483/603 (80.10%), Query Frame = 1

Query: 7   LLSSCKSFRQIKQVHARLITTGLILHPIPTNKLLKQLSSIFAPISYAHMVFDHFPQPDLF 66
           L+  CKS  QIKQ HA LITT  I  P+  NK LK ++   A ++YAH +FD  PQPDLF
Sbjct: 18  LIDLCKSINQIKQTHANLITTAQITLPVIANKFLKNVA--LASLTYAHKLFDQIPQPDLF 77

Query: 67  LYNTIIKVLAFSTTSSADSFTKFRSLIREERLVPNQYSFAFAFKGCGSGVGVLEGEQVRV 126
           +YNT+IK  + S  S  DS   FRSLIR+    PN+YSF FAF  CG+G+ V EGEQV  
Sbjct: 78  IYNTMIKSHSMSPHSYLDSIAVFRSLIRDSGYFPNRYSFVFAFGACGNGMCVREGEQVFT 137

Query: 127 HAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWSPNRDMYSWNIMLSGYARLGKMDE 186
           HA+K+GL+ N+FV NALIGM+     V DAR VFD + +RD YSWN M+  Y   G M  
Sbjct: 138 HAVKVGLDGNVFVVNALIGMFGKWGRVEDARNVFDSAVDRDFYSWNTMIGAYVGSGNMVL 197

Query: 187 ARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKGMSPNEYTLASSLAACAN 246
           A++LFDEM E+DVVSW+T+I+G +QVG FMEALD FH ML   + PNEYT+ S+LAAC+N
Sbjct: 198 AKELFDEMHERDVVSWSTIIAGYVQVGCFMEALDFFHKMLQSEVKPNEYTMVSALAACSN 257

Query: 247 LVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFNSNPRLKRKVWPWN 306
           LVALDQG+W+HVYI+++NI+MN+RLLA LIDMYAKCGE++ AS +F+ + ++KRKVWPWN
Sbjct: 258 LVALDQGKWIHVYIRRDNIKMNDRLLASLIDMYAKCGEIDSASSVFHEH-KVKRKVWPWN 317

Query: 307 AMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVALLNACSHGNRVEEGRYYFESMASH 366
           AMIGGFA+HGK +EAI VFE+MK+EKVSPNKVTF+ALLNACSHG  V+EG+ YFE MAS 
Sbjct: 318 AMIGGFAMHGKPEEAINVFEKMKVEKVSPNKVTFIALLNACSHGYMVKEGKSYFELMASD 377

Query: 367 YRVKPELEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLSACKIHKDAEMGER 426
           Y + PE+EHYGC+VDLL R+G LK++EE+I SM + PDVAIWGALL+AC+I+KD E G R
Sbjct: 378 YGINPEIEHYGCMVDLLSRSGHLKDSEEMILSMPMAPDVAIWGALLNACRIYKDMERGYR 437

Query: 427 VGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREK-IAESGKKKTPGCSSIELNGMF 486
           +G+I+KE+DPNH+GC+VLL NIYS +G WNEAR +REK    S +KK PG SSIELNG+F
Sbjct: 438 IGRIIKEIDPNHIGCNVLLGNIYSTSGRWNEARMVREKNEINSDRKKIPGFSSIELNGVF 497

Query: 487 HQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDIDDNEDRETALLKHSEKL 546
           H+FLVGDRSHPQ++++Y FLDEMI+KLKIAGY+PE GEVLLD DD ED+ETAL  HSEKL
Sbjct: 498 HEFLVGDRSHPQSREIYSFLDEMISKLKIAGYVPELGEVLLDFDDEEDKETALSVHSEKL 557

Query: 547 AIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIVRDRIRYHHFKDGTCSC 606
           AIAFGLMNT P TPIRIVKNLRVC DCH A KFISKVYDR IIVRDR+RYHHFKDG CSC
Sbjct: 558 AIAFGLMNTAPGTPIRIVKNLRVCGDCHQATKFISKVYDRVIIVRDRMRYHHFKDGICSC 617

Query: 607 NDY 609
            DY
Sbjct: 618 KDY 617

BLAST of Cucsa.143210 vs. TrEMBL
Match: V7C2N4_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_004G127800g PE=4 SV=1)

HSP 1 Score: 823.9 bits (2127), Expect = 1.3e-235
Identity = 396/604 (65.56%), Postives = 479/604 (79.30%), Query Frame = 1

Query: 7   LLSSCKSFRQIKQVHARLITTGLILHPIPTNKLLKQLSSIFAPISYAHMVFDHFPQPDLF 66
           L+ SCKS +QIKQ H++L+TT LI HP+  NKLLK  +   A +SYAH +FD  PQPDLF
Sbjct: 23  LIESCKSMQQIKQTHSQLVTTALISHPVSANKLLKLAAC--ASLSYAHKLFDQIPQPDLF 82

Query: 67  LYNTIIKVLAFSTTSSADSFTKFRSLIREERLVPNQYSFAFAFKGCGSGVGVLEGEQVRV 126
           +YNT+IK  +    S  +SF  FRSL R+  L PN+YSF FAF  CG+G+ + EG+QVRV
Sbjct: 83  IYNTMIKAHSLLPHSCHNSFGVFRSLTRDSDLFPNRYSFVFAFSACGNGLSMQEGQQVRV 142

Query: 127 HAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWSPNRDMYSWNIMLSGYARLGKMDE 186
           HA+K+GLENN+FV NALI MY     V +  KVF W+ +RD+YSWN M++ +   G M  
Sbjct: 143 HAVKVGLENNVFVLNALISMYGKWGLVEEGWKVFQWAVDRDLYSWNTMIAAFVGSGDMSR 202

Query: 187 ARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKGMSPNEYTLASSLAACAN 246
           A++LFD M EKDVVSW+T+I+G +QVG F+EALD F+ ML  G  PNEYTL S+ AAC+N
Sbjct: 203 AKELFDGMQEKDVVSWSTIIAGYVQVGCFVEALDFFNEMLQIGPRPNEYTLVSAFAACSN 262

Query: 247 LVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFNSNPRLKRKVWPWN 306
           LVALDQG+W+H YI +  I+MNERLLA +IDMYAKCGE+E AS++F  N ++K+KVWPWN
Sbjct: 263 LVALDQGKWIHAYIGRGEIKMNERLLASIIDMYAKCGEIESASRIF-FNYKVKQKVWPWN 322

Query: 307 AMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVALLNACSHGNRVEEGRYYFESMASH 366
           AMIGGFA+HGK  EAI VFEQMK++KVSPNKVTF+ALLNACSHG  VEEG+ YF  M S 
Sbjct: 323 AMIGGFAMHGKPNEAINVFEQMKVKKVSPNKVTFIALLNACSHGYMVEEGKLYFRLMVSD 382

Query: 367 YRVKPELEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLSACKIHKDAEMGER 426
           Y + PE+EHYGC+VDLL R+G LK+AE++ISSM + PDVAIWGALL+AC+I+KD E G R
Sbjct: 383 YAITPEIEHYGCMVDLLSRSGFLKKAEDMISSMPMAPDVAIWGALLNACRIYKDIERGYR 442

Query: 427 VGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAESG-KKKTPGCSSIELNGMF 486
           +G+I+K++DPNH+GCHVLL+NIYS +G WNEAR LREK   S  +KK PGCSSIEL G F
Sbjct: 443 IGRIIKDMDPNHIGCHVLLSNIYSTSGRWNEARMLREKKELSNERKKIPGCSSIELKGTF 502

Query: 487 HQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDIDDNEDRETALLKHSEKL 546
           HQFLVGDRSHP+++++Y FLDEM  KLK AGY+PE GE+L DIDD ED+ETAL  HSEKL
Sbjct: 503 HQFLVGDRSHPESREIYSFLDEMTIKLKSAGYVPEFGELLRDIDDEEDKETALSVHSEKL 562

Query: 547 AIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIVRDRIRYHHFKDGTCSC 606
           AIAFGLMNT   TPIRIVKNLRVC DCH A KFISKVYDR IIVRDR RYHHFKDG CSC
Sbjct: 563 AIAFGLMNTAYGTPIRIVKNLRVCGDCHQATKFISKVYDRVIIVRDRTRYHHFKDGICSC 622

Query: 607 NDYW 610
            DYW
Sbjct: 623 KDYW 623

BLAST of Cucsa.143210 vs. TAIR10
Match: AT5G66520.1 (AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 529.3 bits (1362), Expect = 3.2e-150
Identity = 276/611 (45.17%), Postives = 379/611 (62.03%), Query Frame = 1

Query: 8   LSSCKSFRQIKQVHARLITTGLILHPIPTNKLLK----QLSSIFAPISYAHMVFDHFPQP 67
           L  C    ++KQ+HAR++ TGL+       K L       SS F P  YA +VFD F +P
Sbjct: 21  LQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLP--YAQIVFDGFDRP 80

Query: 68  DLFLYNTIIKVLAFSTTSSADSFTKFRSLIREERLV-----PNQYSFAFAFKGCGSGVGV 127
           D FL+N +I+   FS +   +     RSL+  +R++      N Y+F    K C +    
Sbjct: 81  DTFLWNLMIR--GFSCSDEPE-----RSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAF 140

Query: 128 LEGEQVRVHAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWSPNRDMYSWNIMLSGY 187
            E  Q+     KLG EN+++  N+LI  Y        A  +FD  P  D  SWN ++ GY
Sbjct: 141 EETTQIHAQITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGY 200

Query: 188 ARLGKMDEARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKGMSPNEYTLA 247
            + GKMD A  LF +M EK+ +SWTTMISG +Q     EAL +FH M    + P+  +LA
Sbjct: 201 VKAGKMDIALTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLA 260

Query: 248 SSLAACANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFNSNPRL 307
           ++L+ACA L AL+QG+W+H Y+ K  I+M+  L   LIDMYAKCGE+E A ++F +    
Sbjct: 261 NALSACAQLGALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIK-- 320

Query: 308 KRKVWPWNAMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVALLNACSHGNRVEEGRY 367
           K+ V  W A+I G+A HG  +EAI  F +M+   + PN +TF A+L ACS+   VEEG+ 
Sbjct: 321 KKSVQAWTALISGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKL 380

Query: 368 YFESMASHYRVKPELEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLSACKIH 427
            F SM   Y +KP +EHYGC+VDLLGRAG L EA+  I  M L P+  IWGALL AC+IH
Sbjct: 381 IFYSMERDYNLKPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIH 440

Query: 428 KDAEMGERVGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAESGKKKTPGCSS 487
           K+ E+GE +G+I+  +DP H G +V  ANI+++   W++A   R  + E G  K PGCS+
Sbjct: 441 KNIELGEEIGEILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCST 500

Query: 488 IELNGMFHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDIDDNEDRETAL 547
           I L G  H+FL GDRSHP+ +++      M  KL+  GY+PE  E+LLD+ D+++RE  +
Sbjct: 501 ISLEGTTHEFLAGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIV 560

Query: 548 LKHSEKLAIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIVRDRIRYHHF 607
            +HSEKLAI +GL+ T P T IRI+KNLRVC DCH   K ISK+Y R+I++RDR R+HHF
Sbjct: 561 HQHSEKLAITYGLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHF 620

Query: 608 KDGTCSCNDYW 610
           +DG CSC DYW
Sbjct: 621 RDGKCSCGDYW 620

BLAST of Cucsa.143210 vs. TAIR10
Match: AT5G48910.1 (AT5G48910.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 509.2 bits (1310), Expect = 3.5e-144
Identity = 261/620 (42.10%), Postives = 394/620 (63.55%), Query Frame = 1

Query: 8   LSSCKSFRQIKQVHARLITTGLILHPIPTNKLLKQLSSI---FAPISYAHMVFDHFPQPD 67
           +++C++ R + Q+HA  I +G +   +   ++L+  ++       + YAH +F+  PQ +
Sbjct: 30  INNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHKIFNQMPQRN 89

Query: 68  LFLYNTIIKVLAFSTTSSA-DSFTKFRSLIREERLVPNQYSFAFAFKGCGSGVGVLEGEQ 127
            F +NTII+  + S    A  + T F  ++ +E + PN+++F    K C     + EG+Q
Sbjct: 90  CFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAKTGKIQEGKQ 149

Query: 128 VRVHAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWS--------------PNRDMY 187
           +   A+K G   + FV + L+ MYV   F+ DAR +F  +               + ++ 
Sbjct: 150 IHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMVVMTDRRKRDGEIV 209

Query: 188 SWNIMLSGYARLGKMDEARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKG 247
            WN+M+ GY RLG    AR LFD+M ++ VVSW TMISG    G+F +A+++F  M    
Sbjct: 210 LWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNGFFKDAVEVFREMKKGD 269

Query: 248 MSPNEYTLASSLAACANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFAS 307
           + PN  TL S L A + L +L+ G W+H+Y + + I++++ L + LIDMY+KCG +E A 
Sbjct: 270 IRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALIDMYSKCGIIEKAI 329

Query: 308 KLFNSNPRLKRKVWPWNAMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVALLNACSH 367
            +F   PR    V  W+AMI GFA+HG++ +AI+ F +M+   V P+ V ++ LL ACSH
Sbjct: 330 HVFERLPR--ENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYINLLTACSH 389

Query: 368 GNRVEEGRYYFESMASHYRVKPELEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWG 427
           G  VEEGR YF  M S   ++P +EHYGC+VDLLGR+G L EAEE I +M + PD  IW 
Sbjct: 390 GGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPIKPDDVIWK 449

Query: 428 ALLSACKIHKDAEMGERVGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAESG 487
           ALL AC++  + EMG+RV  I+ ++ P+  G +V L+N+Y+  GNW+E   +R ++ E  
Sbjct: 450 ALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMRLRMKEKD 509

Query: 488 KKKTPGCSSIELNGMFHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDID 547
            +K PGCS I+++G+ H+F+V D SHP+ K++   L E+  KL++AGY P + +VLL+++
Sbjct: 510 IRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGYRPITTQVLLNLE 569

Query: 548 DNEDRETALLKHSEKLAIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIV 607
           + ED+E  L  HSEK+A AFGL++T+P  PIRIVKNLR+C DCH +IK ISKVY R+I V
Sbjct: 570 E-EDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSIKLISKVYKRKITV 629

Query: 608 RDRIRYHHFKDGTCSCNDYW 610
           RDR R+HHF+DG+CSC DYW
Sbjct: 630 RDRKRFHHFQDGSCSCMDYW 646

BLAST of Cucsa.143210 vs. TAIR10
Match: AT5G06540.1 (AT5G06540.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 481.9 bits (1239), Expect = 5.9e-136
Identity = 245/610 (40.16%), Postives = 380/610 (62.30%), Query Frame = 1

Query: 7   LLSSCKSFRQIKQVHARLITTGLILHPIPTNKLLKQL---SSIFAP---ISYAHMVFDHF 66
           LL SC SF  +K +H  L+ T LI      ++LL      S+   P   + YA+ +F   
Sbjct: 18  LLQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKPTNLLGYAYGIFSQI 77

Query: 67  PQPDLFLYNTIIKVLAFSTTSSADSFTKFRSLIREERLVPNQYSFAFAFKGCGSGVGVLE 126
             P+LF++N +I+   FST +       F + + + R+ P+  +F F  K       VL 
Sbjct: 78  QNPNLFVFNLLIR--CFSTGAEPSKAFGFYTQMLKSRIWPDNITFPFLIKASSEMECVLV 137

Query: 127 GEQVRVHAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWSPNRDMYSWNIMLSGYAR 186
           GEQ     ++ G +N+++V N+L+ MY N  F+  A ++F     RD+ SW  M++GY +
Sbjct: 138 GEQTHSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQMGFRDVVSWTSMVAGYCK 197

Query: 187 LGKMDEARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKGMSPNEYTLASS 246
            G ++ AR++FDEMP +++ +W+ MI+G  +   F +A+D+F  M  +G+  NE  + S 
Sbjct: 198 CGMVENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKREGVVANETVMVSV 257

Query: 247 LAACANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFNSNPRLKR 306
           +++CA+L AL+ G   + Y+ K+++ +N  L   L+DM+ +CG++E A  +F   P    
Sbjct: 258 ISSCAHLGALEFGERAYEYVVKSHMTVNLILGTALVDMFWRCGDIEKAIHVFEGLPETDS 317

Query: 307 KVWPWNAMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVALLNACSHGNRVEEGRYYF 366
               W+++I G AVHG + +A+  F QM      P  VTF A+L+ACSHG  VE+G   +
Sbjct: 318 L--SWSSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSHGGLVEKGLEIY 377

Query: 367 ESMASHYRVKPELEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLSACKIHKD 426
           E+M   + ++P LEHYGC+VD+LGRAG+L EAE  I  MH+ P+  I GALL ACKI+K+
Sbjct: 378 ENMKKDHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILGALLGACKIYKN 437

Query: 427 AEMGERVGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAESGKKKTPGCSSIE 486
            E+ ERVG ++ ++ P H G +VLL+NIY+  G W++  +LR+ + E   KK PG S IE
Sbjct: 438 TEVAERVGNMLIKVKPEHSGYYVLLSNIYACAGQWDKIESLRDMMKEKLVKKPPGWSLIE 497

Query: 487 LNGMFHQFLVG-DRSHPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDIDDNEDRETALL 546
           ++G  ++F +G D+ HP+  ++    +E++ K+++ GY   +G+   D+D+ E++E+++ 
Sbjct: 498 IDGKINKFTMGDDQKHPEMGKIRRKWEEILGKIRLIGYKGNTGDAFFDVDE-EEKESSIH 557

Query: 547 KHSEKLAIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIVRDRIRYHHFK 606
            HSEKLAIA+G+M T P T IRIVKNLRVC DCH   K IS+VY RE+IVRDR R+HHF+
Sbjct: 558 MHSEKLAIAYGMMKTKPGTTIRIVKNLRVCEDCHTVTKLISEVYGRELIVRDRNRFHHFR 617

Query: 607 DGTCSCNDYW 610
           +G CSC DYW
Sbjct: 618 NGVCSCRDYW 622

BLAST of Cucsa.143210 vs. TAIR10
Match: AT4G37380.1 (AT4G37380.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 480.3 bits (1235), Expect = 1.7e-135
Identity = 243/607 (40.03%), Postives = 378/607 (62.27%), Query Frame = 1

Query: 7   LLSSCKSFRQIKQVHARLITTGLILHP-IPT-NKLLKQLSSIFAPISYAHMVFDHFPQPD 66
           L+   +S  ++ Q+HA ++   L+LHP  P  N  L +  +    I ++  +F     PD
Sbjct: 35  LIDKSQSVDEVLQIHAAILRHNLLLHPRYPVLNLKLHRAYASHGKIRHSLALFHQTIDPD 94

Query: 67  LFLYNTIIKVLAFSTTSSADSFTKFRSLIREERLVPNQYSFAFAFKGCGSGVGVLEGEQV 126
           LFL+   I   + +      +F  +  L+  E + PN+++F+   K C +  G L    +
Sbjct: 95  LFLFTAAINTASINGLKD-QAFLLYVQLLSSE-INPNEFTFSSLLKSCSTKSGKL----I 154

Query: 127 RVHAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWSPNRDMYSWNIMLSGYARLGKM 186
             H +K GL  + +V   L+ +Y     VV A+KVFD  P R + S   M++ YA+ G +
Sbjct: 155 HTHVLKFGLGIDPYVATGLVDVYAKGGDVVSAQKVFDRMPERSLVSSTAMITCYAKQGNV 214

Query: 187 DEARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKGM-SPNEYTLASSLAA 246
           + AR LFD M E+D+VSW  MI G  Q G+  +AL +F  +LA+G   P+E T+ ++L+A
Sbjct: 215 EAARALFDSMCERDIVSWNVMIDGYAQHGFPNDALMLFQKLLAEGKPKPDEITVVAALSA 274

Query: 247 CANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFNSNPRLKRKVW 306
           C+ + AL+ GRW+HV++K + I++N ++  GLIDMY+KCG LE A  +FN  PR  + + 
Sbjct: 275 CSQIGALETGRWIHVFVKSSRIRLNVKVCTGLIDMYSKCGSLEEAVLVFNDTPR--KDIV 334

Query: 307 PWNAMIGGFAVHGKSKEAIEVFEQMK-IEKVSPNKVTFVALLNACSHGNRVEEGRYYFES 366
            WNAMI G+A+HG S++A+ +F +M+ I  + P  +TF+  L AC+H   V EG   FES
Sbjct: 335 AWNAMIAGYAMHGYSQDALRLFNEMQGITGLQPTDITFIGTLQACAHAGLVNEGIRIFES 394

Query: 367 MASHYRVKPELEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLSACKIHKDAE 426
           M   Y +KP++EHYGCLV LLGRAG+LK A E I +M++  D  +W ++L +CK+H D  
Sbjct: 395 MGQEYGIKPKIEHYGCLVSLLGRAGQLKRAYETIKNMNMDADSVLWSSVLGSCKLHGDFV 454

Query: 427 MGERVGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAESGKKKTPGCSSIELN 486
           +G+ + + +  L+  + G +VLL+NIY+  G++     +R  + E G  K PG S+IE+ 
Sbjct: 455 LGKEIAEYLIGLNIKNSGIYVLLSNIYASVGDYEGVAKVRNLMKEKGIVKEPGISTIEIE 514

Query: 487 GMFHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDIDDNEDRETALLKHS 546
              H+F  GDR H ++K++Y  L ++  ++K  GY+P +  VL D+++ E +E +L  HS
Sbjct: 515 NKVHEFRAGDREHSKSKEIYTMLRKISERIKSHGYVPNTNTVLQDLEETE-KEQSLQVHS 574

Query: 547 EKLAIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIVRDRIRYHHFKDGT 606
           E+LAIA+GL++T P +P++I KNLRVCSDCH   K ISK+  R+I++RDR R+HHF DG+
Sbjct: 575 ERLAIAYGLISTKPGSPLKIFKNLRVCSDCHTVTKLISKITGRKIVMRDRNRFHHFTDGS 632

Query: 607 CSCNDYW 610
           CSC D+W
Sbjct: 635 CSCGDFW 632

BLAST of Cucsa.143210 vs. TAIR10
Match: AT3G62890.1 (AT3G62890.1 Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 477.6 bits (1228), Expect = 1.1e-134
Identity = 240/570 (42.11%), Postives = 378/570 (66.32%), Query Frame = 1

Query: 48  APISYAHMVFD-HFPQPDLFLYNTIIKVLAFSTTS-SADSFTKFRSLIREERLVPNQYSF 107
           A I+YA+ +F     + + FL+N II+ +  + +S    S       +R  R+ P+ ++F
Sbjct: 6   AIIAYANPIFHIRHLKLESFLWNIIIRAIVHNVSSPQRHSPISVYLRMRNHRVSPDFHTF 65

Query: 108 AFAFKGCGSGVGVLEGEQVRVHAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWSPN 167
            F      + + +  G++     +  GL+ + FV  +L+ MY +   +  A++VFD S +
Sbjct: 66  PFLLPSFHNPLHLPLGQRTHAQILLFGLDKDPFVRTSLLNMYSSCGDLRSAQRVFDDSGS 125

Query: 168 RDMYSWNIMLSGYARLGKMDEARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNM 227
           +D+ +WN +++ YA+ G +D+AR+LFDEMPE++V+SW+ +I+G +  G + EALD+F  M
Sbjct: 126 KDLPAWNSVVNAYAKAGLIDDARKLFDEMPERNVISWSCLINGYVMCGKYKEALDLFREM 185

Query: 228 -LAKG----MSPNEYTLASSLAACANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYA 287
            L K     + PNE+T+++ L+AC  L AL+QG+W+H YI K +++++  L   LIDMYA
Sbjct: 186 QLPKPNEAFVRPNEFTMSTVLSACGRLGALEQGKWVHAYIDKYHVEIDIVLGTALIDMYA 245

Query: 288 KCGELEFASKLFNSNPRLKRKVWPWNAMIGGFAVHGKSKEAIEVFEQMKI-EKVSPNKVT 347
           KCG LE A ++FN+    K+ V  ++AMI   A++G + E  ++F +M   + ++PN VT
Sbjct: 246 KCGSLERAKRVFNALGS-KKDVKAYSAMICCLAMYGLTDECFQLFSEMTTSDNINPNSVT 305

Query: 348 FVALLNACSHGNRVEEGRYYFESMASHYRVKPELEHYGCLVDLLGRAGRLKEAEEIISSM 407
           FV +L AC H   + EG+ YF+ M   + + P ++HYGC+VDL GR+G +KEAE  I+SM
Sbjct: 306 FVGILGACVHRGLINEGKSYFKMMIEEFGITPSIQHYGCMVDLYGRSGLIKEAESFIASM 365

Query: 408 HLTPDVAIWGALLSACKIHKDAEMGERVGKIVKELDPNHLGCHVLLANIYSLTGNWNEAR 467
            + PDV IWG+LLS  ++  D +  E   K + ELDP + G +VLL+N+Y+ TG W E +
Sbjct: 366 PMEPDVLIWGSLLSGSRMLGDIKTCEGALKRLIELDPMNSGAYVLLSNVYAKTGRWMEVK 425

Query: 468 TLREKIAESGKKKTPGCSSIELNGMFHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYIP 527
            +R ++   G  K PGCS +E+ G+ H+F+VGD S  +++++Y  LDE++ +L+ AGY+ 
Sbjct: 426 CIRHEMEVKGINKVPGCSYVEVEGVVHEFVVGDESQQESERIYAMLDEIMQRLREAGYVT 485

Query: 528 ESGEVLLDIDDNEDRETALLKHSEKLAIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFI 587
           ++ EVLLD+++ +D+E AL  HSEKLAIAF LM T P TP+RI+KNLR+C DCHL +K I
Sbjct: 486 DTKEVLLDLNE-KDKEIALSYHSEKLAIAFCLMKTRPGTPVRIIKNLRICGDCHLVMKMI 545

Query: 588 SKVYDREIIVRDRIRYHHFKDGTCSCNDYW 610
           SK++ REI+VRD  R+HHF+DG+CSC D+W
Sbjct: 546 SKLFSREIVVRDCNRFHHFRDGSCSCRDFW 573

BLAST of Cucsa.143210 vs. NCBI nr
Match: gi|449442683|ref|XP_004139110.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Cucumis sativus])

HSP 1 Score: 1238.0 bits (3202), Expect = 0.0e+00
Identity = 609/609 (100.00%), Postives = 609/609 (100.00%), Query Frame = 1

Query: 1   MVRCDFLLSSCKSFRQIKQVHARLITTGLILHPIPTNKLLKQLSSIFAPISYAHMVFDHF 60
           MVRCDFLLSSCKSFRQIKQVHARLITTGLILHPIPTNKLLKQLSSIFAPISYAHMVFDHF
Sbjct: 1   MVRCDFLLSSCKSFRQIKQVHARLITTGLILHPIPTNKLLKQLSSIFAPISYAHMVFDHF 60

Query: 61  PQPDLFLYNTIIKVLAFSTTSSADSFTKFRSLIREERLVPNQYSFAFAFKGCGSGVGVLE 120
           PQPDLFLYNTIIKVLAFSTTSSADSFTKFRSLIREERLVPNQYSFAFAFKGCGSGVGVLE
Sbjct: 61  PQPDLFLYNTIIKVLAFSTTSSADSFTKFRSLIREERLVPNQYSFAFAFKGCGSGVGVLE 120

Query: 121 GEQVRVHAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWSPNRDMYSWNIMLSGYAR 180
           GEQVRVHAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWSPNRDMYSWNIMLSGYAR
Sbjct: 121 GEQVRVHAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWSPNRDMYSWNIMLSGYAR 180

Query: 181 LGKMDEARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKGMSPNEYTLASS 240
           LGKMDEARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKGMSPNEYTLASS
Sbjct: 181 LGKMDEARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKGMSPNEYTLASS 240

Query: 241 LAACANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFNSNPRLKR 300
           LAACANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFNSNPRLKR
Sbjct: 241 LAACANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFNSNPRLKR 300

Query: 301 KVWPWNAMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVALLNACSHGNRVEEGRYYF 360
           KVWPWNAMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVALLNACSHGNRVEEGRYYF
Sbjct: 301 KVWPWNAMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVALLNACSHGNRVEEGRYYF 360

Query: 361 ESMASHYRVKPELEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLSACKIHKD 420
           ESMASHYRVKPELEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLSACKIHKD
Sbjct: 361 ESMASHYRVKPELEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLSACKIHKD 420

Query: 421 AEMGERVGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAESGKKKTPGCSSIE 480
           AEMGERVGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAESGKKKTPGCSSIE
Sbjct: 421 AEMGERVGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAESGKKKTPGCSSIE 480

Query: 481 LNGMFHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDIDDNEDRETALLK 540
           LNGMFHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDIDDNEDRETALLK
Sbjct: 481 LNGMFHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDIDDNEDRETALLK 540

Query: 541 HSEKLAIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIVRDRIRYHHFKD 600
           HSEKLAIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIVRDRIRYHHFKD
Sbjct: 541 HSEKLAIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIVRDRIRYHHFKD 600

Query: 601 GTCSCNDYW 610
           GTCSCNDYW
Sbjct: 601 GTCSCNDYW 609

BLAST of Cucsa.143210 vs. NCBI nr
Match: gi|659099142|ref|XP_008450449.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Cucumis melo])

HSP 1 Score: 1198.0 bits (3098), Expect = 0.0e+00
Identity = 584/609 (95.89%), Postives = 601/609 (98.69%), Query Frame = 1

Query: 1   MVRCDFLLSSCKSFRQIKQVHARLITTGLILHPIPTNKLLKQLSSIFAPISYAHMVFDHF 60
           MVRCDFLL SCKSFRQIKQVHA+LIT+GLILHPIPTNKLLKQLSSIFAPISYAHMVFDHF
Sbjct: 1   MVRCDFLLGSCKSFRQIKQVHAQLITSGLILHPIPTNKLLKQLSSIFAPISYAHMVFDHF 60

Query: 61  PQPDLFLYNTIIKVLAFSTTSSADSFTKFRSLIREERLVPNQYSFAFAFKGCGSGVGVLE 120
           PQPDLFLYNTIIKVLAFSTTSSADSFT+FRSLIREERLVPNQYSFAFAFK CGSGVGVLE
Sbjct: 61  PQPDLFLYNTIIKVLAFSTTSSADSFTRFRSLIREERLVPNQYSFAFAFKACGSGVGVLE 120

Query: 121 GEQVRVHAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWSPNRDMYSWNIMLSGYAR 180
           GEQVRVHA+KLGLENNLFVTNALIGMYVNLDFVVDARKVF+WSP RDMYSWNIMLSGYAR
Sbjct: 121 GEQVRVHALKLGLENNLFVTNALIGMYVNLDFVVDARKVFEWSPYRDMYSWNIMLSGYAR 180

Query: 181 LGKMDEARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKGMSPNEYTLASS 240
           LGKMDEARQLFDEMPE+DVVSWTTMISGCLQVG+FMEA+DIFHNMLAKGMSPNE+TLAS+
Sbjct: 181 LGKMDEARQLFDEMPERDVVSWTTMISGCLQVGHFMEAVDIFHNMLAKGMSPNEHTLASA 240

Query: 241 LAACANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFNSNPRLKR 300
           L+ACANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFNSNP+L R
Sbjct: 241 LSACANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFNSNPQLMR 300

Query: 301 KVWPWNAMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVALLNACSHGNRVEEGRYYF 360
           KVWPWNAMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFV+LLNACSHGNRV+EGRYYF
Sbjct: 301 KVWPWNAMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVSLLNACSHGNRVKEGRYYF 360

Query: 361 ESMASHYRVKPELEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLSACKIHKD 420
           ESMASHY VKP LEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLSACKIHKD
Sbjct: 361 ESMASHYGVKPVLEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLSACKIHKD 420

Query: 421 AEMGERVGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAESGKKKTPGCSSIE 480
            EMGER+GKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIA SGKKKTPGCSSIE
Sbjct: 421 VEMGERIGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAVSGKKKTPGCSSIE 480

Query: 481 LNGMFHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDIDDNEDRETALLK 540
           LNGMFHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGY+PESGEVLLDIDDNEDRETALLK
Sbjct: 481 LNGMFHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYVPESGEVLLDIDDNEDRETALLK 540

Query: 541 HSEKLAIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIVRDRIRYHHFKD 600
           HSEKLAIAFGLMNTTPKTPIRIVKNLRVC+DCHLAIKFISKVYDREIIVRDRIRYHHFKD
Sbjct: 541 HSEKLAIAFGLMNTTPKTPIRIVKNLRVCNDCHLAIKFISKVYDREIIVRDRIRYHHFKD 600

Query: 601 GTCSCNDYW 610
           GTCSCNDYW
Sbjct: 601 GTCSCNDYW 609

BLAST of Cucsa.143210 vs. NCBI nr
Match: gi|225435554|ref|XP_002283117.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g66520 isoform X1 [Vitis vinifera])

HSP 1 Score: 902.9 bits (2332), Expect = 3.1e-259
Identity = 430/605 (71.07%), Postives = 508/605 (83.97%), Query Frame = 1

Query: 6   FLLSSCKSFRQIKQVHARLITTGLILHPIPTNKLLKQL-SSIFAPISYAHMVFDHFPQPD 65
           F L SCKS  QIKQ HA LITTGLILHPI  NKLLK L +S F  +SYAH +FD  P+PD
Sbjct: 20  FSLESCKSMNQIKQTHAHLITTGLILHPITANKLLKVLIASSFGSLSYAHQLFDQIPKPD 79

Query: 66  LFLYNTIIKVLAFSTTSSADSFTKFRSLIREERLVPNQYSFAFAFKGCGSGVGVLEGEQV 125
           +F+YNT+IK  A   TSS +S   F S++R    +PN+Y+F F FK CG+G+GVLEGEQ+
Sbjct: 80  VFIYNTMIKAHAVIPTSSHNSMRIFLSMVRVSGFLPNRYTFVFVFKACGNGLGVLEGEQI 139

Query: 126 RVHAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWSPNRDMYSWNIMLSGYARLGKM 185
           RVHAIK+GLE+NLFVTNA+I MY N   V +AR+VFDWS ++D+YSWNIM+ GY   G++
Sbjct: 140 RVHAIKIGLESNLFVTNAMIRMYANWGLVDEARRVFDWSLDQDLYSWNIMIGGYVGSGEI 199

Query: 186 DEARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKGMSPNEYTLASSLAAC 245
             A+++FDEM E+DVVSWTT+I+G +QVG F EALD+FH ML  G  PNE+TLAS+LAAC
Sbjct: 200 GRAKEMFDEMSERDVVSWTTIIAGYVQVGCFKEALDLFHEMLQTGPPPNEFTLASALAAC 259

Query: 246 ANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFNSNPRLKRKVWP 305
           ANLVALDQGRW+HVYI K+ I+MNERLLA L+DMYAKCGE++FA+K+F+    LK KVWP
Sbjct: 260 ANLVALDQGRWIHVYIDKSEIKMNERLLASLLDMYAKCGEIDFAAKVFHDEYGLKLKVWP 319

Query: 306 WNAMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVALLNACSHGNRVEEGRYYFESMA 365
           WNAMIGG+A+HGKSKEAI++FEQMK+EKVSPNKVTFVALLNACSHG  VEEGR YF+SMA
Sbjct: 320 WNAMIGGYAMHGKSKEAIDLFEQMKVEKVSPNKVTFVALLNACSHGKLVEEGRGYFKSMA 379

Query: 366 SHYRVKPELEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLSACKIHKDAEMG 425
           S Y ++PE+EHYGC+VDLLGR+G LKEAEE + +M + PD  IWGALL AC+IHKD E G
Sbjct: 380 SSYGIEPEIEHYGCMVDLLGRSGLLKEAEETVFNMPMAPDATIWGALLGACRIHKDIERG 439

Query: 426 ERVGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAESGKKKTPGCSSIELNGM 485
           +R+GKI+KELD +H+GCHVLLAN+YS +G W+EA+ +R+KI  SG+KKTPGCSSIELNG+
Sbjct: 440 QRIGKIIKELDSDHIGCHVLLANLYSASGQWDEAKAVRQKIEVSGRKKTPGCSSIELNGV 499

Query: 486 FHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDIDDNEDRETALLKHSEK 545
           FHQFLVGDRSHPQTKQLYLFLDEM TKLK AGY+PE GEVLLDIDD ED+ETAL KHSEK
Sbjct: 500 FHQFLVGDRSHPQTKQLYLFLDEMTTKLKNAGYVPEFGEVLLDIDDEEDKETALSKHSEK 559

Query: 546 LAIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIVRDRIRYHHFKDGTCS 605
           LAIAFGL+NT P T IRIVKNLRVC+DCH A KFISKVY REIIVRDRIRYHHFKDG CS
Sbjct: 560 LAIAFGLINTPPGTAIRIVKNLRVCADCHEATKFISKVYKREIIVRDRIRYHHFKDGFCS 619

Query: 606 CNDYW 610
           C DYW
Sbjct: 620 CKDYW 624

BLAST of Cucsa.143210 vs. NCBI nr
Match: gi|1009166612|ref|XP_015901687.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like isoform X1 [Ziziphus jujuba])

HSP 1 Score: 895.6 bits (2313), Expect = 4.9e-257
Identity = 434/605 (71.74%), Postives = 498/605 (82.31%), Query Frame = 1

Query: 7   LLSSCKSFRQIKQVHARLITTGLILHPIPTNKLLKQLS-SIFAPISYAHMVFDHFPQPDL 66
           LL SC S  QIKQ HA +ITTGLIL PIP NKLLK L+ S FA +SYAH VFD  P PD+
Sbjct: 96  LLESCMSLNQIKQAHAHMITTGLILRPIPANKLLKLLAFSSFASLSYAHRVFDQIPTPDI 155

Query: 67  FLYNTIIKVLAFSTTSSADSFTKFRSLIRE-ERLVPNQYSFAFAFKGCGSGVGVLEGEQV 126
           F +NT+IK    S TSS D+   FRS++ +   ++PNQY+F F  K CG+G+GVLEGEQV
Sbjct: 156 FHFNTMIKAHILSPTSSHDALMLFRSMMMQGSSILPNQYTFVFVLKACGNGLGVLEGEQV 215

Query: 127 RVHAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWSPNRDMYSWNIMLSGYARLGKM 186
           RVHAIK+GLE NLFVTNALIGM+V    V DA KVFDWS +RD+YSWNIM+ GY  LGKM
Sbjct: 216 RVHAIKVGLEGNLFVTNALIGMHVKWGLVEDATKVFDWSTDRDLYSWNIMVGGYVGLGKM 275

Query: 187 DEARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKGMSPNEYTLASSLAAC 246
           + A +LF+ M E+DVVSW+T+I+G +QVG FMEALD+FH ML +G  PN++TL S+L AC
Sbjct: 276 NRAMELFNGMRERDVVSWSTIIAGYVQVGCFMEALDLFHRMLQEGPKPNQFTLVSALTAC 335

Query: 247 ANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFNSNPRLKRKVWP 306
           ANLVALDQGRW+HVYI KN I MNERLLA LIDMY KCGE+EFASK+F++   LK KVWP
Sbjct: 336 ANLVALDQGRWIHVYIGKNKITMNERLLASLIDMYVKCGEIEFASKVFSNEHGLKHKVWP 395

Query: 307 WNAMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVALLNACSHGNRVEEGRYYFESMA 366
           WNAMIGGFA+HGKS EAI +FEQMKIEKVS N VTFVALLNACSHG+ VEEGR YF+ MA
Sbjct: 396 WNAMIGGFAMHGKSNEAIHLFEQMKIEKVSANNVTFVALLNACSHGDMVEEGRNYFKLMA 455

Query: 367 SHYRVKPELEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLSACKIHKDAEMG 426
           S Y ++P++EHYGC+VDLLGRAG LKEAEE ISSM + PD AIWGALL AC+IHKD E G
Sbjct: 456 SSYGIEPQIEHYGCMVDLLGRAGLLKEAEETISSMPVAPDAAIWGALLGACRIHKDIERG 515

Query: 427 ERVGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAESGKKKTPGCSSIELNGM 486
           ER+G IVKELDPNH+GC VL+AN+YS+ G W EAR +RE I  SG+KKTPGC+SIELNG 
Sbjct: 516 ERIGNIVKELDPNHIGCQVLMANMYSVCGRWKEARIIRENIEVSGRKKTPGCTSIELNGR 575

Query: 487 FHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDIDDNEDRETALLKHSEK 546
           FHQFLVGDRSHP+TKQLYLFLDEM +KLKIAGY+P  GEVLLDID+ ED+ETAL +HSEK
Sbjct: 576 FHQFLVGDRSHPETKQLYLFLDEMASKLKIAGYVPRLGEVLLDIDEEEDKETALSEHSEK 635

Query: 547 LAIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIVRDRIRYHHFKDGTCS 606
           LAIAFGLMNT P TPIRIVKNLRVC DCH A K+ISKVYDREIIVRDRIRYHHFK   CS
Sbjct: 636 LAIAFGLMNTAPGTPIRIVKNLRVCGDCHQATKYISKVYDREIIVRDRIRYHHFKGENCS 695

Query: 607 CNDYW 610
           C DYW
Sbjct: 696 CKDYW 700

BLAST of Cucsa.143210 vs. NCBI nr
Match: gi|1009166614|ref|XP_015901688.1| (PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like isoform X2 [Ziziphus jujuba])

HSP 1 Score: 895.6 bits (2313), Expect = 4.9e-257
Identity = 434/605 (71.74%), Postives = 498/605 (82.31%), Query Frame = 1

Query: 7   LLSSCKSFRQIKQVHARLITTGLILHPIPTNKLLKQLS-SIFAPISYAHMVFDHFPQPDL 66
           LL SC S  QIKQ HA +ITTGLIL PIP NKLLK L+ S FA +SYAH VFD  P PD+
Sbjct: 30  LLESCMSLNQIKQAHAHMITTGLILRPIPANKLLKLLAFSSFASLSYAHRVFDQIPTPDI 89

Query: 67  FLYNTIIKVLAFSTTSSADSFTKFRSLIRE-ERLVPNQYSFAFAFKGCGSGVGVLEGEQV 126
           F +NT+IK    S TSS D+   FRS++ +   ++PNQY+F F  K CG+G+GVLEGEQV
Sbjct: 90  FHFNTMIKAHILSPTSSHDALMLFRSMMMQGSSILPNQYTFVFVLKACGNGLGVLEGEQV 149

Query: 127 RVHAIKLGLENNLFVTNALIGMYVNLDFVVDARKVFDWSPNRDMYSWNIMLSGYARLGKM 186
           RVHAIK+GLE NLFVTNALIGM+V    V DA KVFDWS +RD+YSWNIM+ GY  LGKM
Sbjct: 150 RVHAIKVGLEGNLFVTNALIGMHVKWGLVEDATKVFDWSTDRDLYSWNIMVGGYVGLGKM 209

Query: 187 DEARQLFDEMPEKDVVSWTTMISGCLQVGYFMEALDIFHNMLAKGMSPNEYTLASSLAAC 246
           + A +LF+ M E+DVVSW+T+I+G +QVG FMEALD+FH ML +G  PN++TL S+L AC
Sbjct: 210 NRAMELFNGMRERDVVSWSTIIAGYVQVGCFMEALDLFHRMLQEGPKPNQFTLVSALTAC 269

Query: 247 ANLVALDQGRWMHVYIKKNNIQMNERLLAGLIDMYAKCGELEFASKLFNSNPRLKRKVWP 306
           ANLVALDQGRW+HVYI KN I MNERLLA LIDMY KCGE+EFASK+F++   LK KVWP
Sbjct: 270 ANLVALDQGRWIHVYIGKNKITMNERLLASLIDMYVKCGEIEFASKVFSNEHGLKHKVWP 329

Query: 307 WNAMIGGFAVHGKSKEAIEVFEQMKIEKVSPNKVTFVALLNACSHGNRVEEGRYYFESMA 366
           WNAMIGGFA+HGKS EAI +FEQMKIEKVS N VTFVALLNACSHG+ VEEGR YF+ MA
Sbjct: 330 WNAMIGGFAMHGKSNEAIHLFEQMKIEKVSANNVTFVALLNACSHGDMVEEGRNYFKLMA 389

Query: 367 SHYRVKPELEHYGCLVDLLGRAGRLKEAEEIISSMHLTPDVAIWGALLSACKIHKDAEMG 426
           S Y ++P++EHYGC+VDLLGRAG LKEAEE ISSM + PD AIWGALL AC+IHKD E G
Sbjct: 390 SSYGIEPQIEHYGCMVDLLGRAGLLKEAEETISSMPVAPDAAIWGALLGACRIHKDIERG 449

Query: 427 ERVGKIVKELDPNHLGCHVLLANIYSLTGNWNEARTLREKIAESGKKKTPGCSSIELNGM 486
           ER+G IVKELDPNH+GC VL+AN+YS+ G W EAR +RE I  SG+KKTPGC+SIELNG 
Sbjct: 450 ERIGNIVKELDPNHIGCQVLMANMYSVCGRWKEARIIRENIEVSGRKKTPGCTSIELNGR 509

Query: 487 FHQFLVGDRSHPQTKQLYLFLDEMITKLKIAGYIPESGEVLLDIDDNEDRETALLKHSEK 546
           FHQFLVGDRSHP+TKQLYLFLDEM +KLKIAGY+P  GEVLLDID+ ED+ETAL +HSEK
Sbjct: 510 FHQFLVGDRSHPETKQLYLFLDEMASKLKIAGYVPRLGEVLLDIDEEEDKETALSEHSEK 569

Query: 547 LAIAFGLMNTTPKTPIRIVKNLRVCSDCHLAIKFISKVYDREIIVRDRIRYHHFKDGTCS 606
           LAIAFGLMNT P TPIRIVKNLRVC DCH A K+ISKVYDREIIVRDRIRYHHFK   CS
Sbjct: 570 LAIAFGLMNTAPGTPIRIVKNLRVCGDCHQATKYISKVYDREIIVRDRIRYHHFKGENCS 629

Query: 607 CNDYW 610
           C DYW
Sbjct: 630 CKDYW 634

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PP449_ARATH5.8e-14945.17Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana GN... [more]
PP425_ARATH6.2e-14342.10Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana GN... [more]
PP367_ARATH1.1e-13440.16Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana GN... [more]
PP354_ARATH3.1e-13440.03Pentatricopeptide repeat-containing protein ELI1, chloroplastic OS=Arabidopsis t... [more]
PP295_ARATH2.0e-13342.11Pentatricopeptide repeat-containing protein At3g62890 OS=Arabidopsis thaliana GN... [more]
Match NameE-valueIdentityDescription
A0A0A0LX83_CUCSA0.0e+00100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_1G612890 PE=4 SV=1[more]
F6GUA4_VITVI6.4e-25671.26Putative uncharacterized protein OS=Vitis vinifera GN=VIT_06s0004g01520 PE=4 SV=... [more]
A0A0B2QXA0_GLYSO6.0e-23866.39Pentatricopeptide repeat-containing protein OS=Glycine soja GN=glysoja_014027 PE... [more]
G7KNB4_MEDTR9.6e-23666.17Pentatricopeptide (PPR) repeat protein OS=Medicago truncatula GN=MTR_6g060510 PE... [more]
V7C2N4_PHAVU1.3e-23565.56Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_004G127800g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G66520.13.2e-15045.17 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G48910.13.5e-14442.10 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G06540.15.9e-13640.16 Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G37380.11.7e-13540.03 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G62890.11.1e-13442.11 Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449442683|ref|XP_004139110.1|0.0e+00100.00PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Cucumis s... [more]
gi|659099142|ref|XP_008450449.1|0.0e+0095.89PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like [Cucumis m... [more]
gi|225435554|ref|XP_002283117.1|3.1e-25971.07PREDICTED: pentatricopeptide repeat-containing protein At5g66520 isoform X1 [Vit... [more]
gi|1009166612|ref|XP_015901687.1|4.9e-25771.74PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like isoform X1... [more]
gi|1009166614|ref|XP_015901688.1|4.9e-25771.74PREDICTED: pentatricopeptide repeat-containing protein At5g66520-like isoform X2... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0008568 microtubule-severing ATPase activity
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.143210.1Cucsa.143210.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 275..293
score: 0.42coord: 375..399
score: 0
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 166..194
score: 3.
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 305..348
score: 1.1E-11coord: 197..244
score: 3.0
IPR002885Pentatricopeptide repeatTIGRFAMsTIGR00756TIGR00756coord: 338..365
score: 0.0012coord: 305..336
score: 1.2E-6coord: 200..234
score: 1.2E-8coord: 169..200
score: 2.
IPR002885Pentatricopeptide repeatPROFILEPS51375PPRcoord: 136..166
score: 5.338coord: 167..201
score: 13.395coord: 233..267
score: 5.108coord: 336..366
score: 7.772coord: 64..100
score: 5.59coord: 301..335
score: 10.731coord: 438..472
score: 5.536coord: 372..402
score: 7.202coord: 202..232
score: 8.725coord: 268..298
score: 5
IPR011990Tetratricopeptide-like helical domainGENE3DG3DSA:1.25.40.10coord: 432..449
score: 0.001coord: 267..328
score: 0.001coord: 178..228
score: 8.2E-7coord: 450..466
score: 8.
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 14..479
score: 1.3E
NoneNo IPR availablePANTHERPTHR24015:SF786SUBFAMILY NOT NAMEDcoord: 14..479
score: 1.3E

The following gene(s) are paralogous to this gene:

None