Csor.00g128750 (gene) Silver-seed gourd (wild; sororia) v1

Overview
NameCsor.00g128750
Typegene
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationCsor_Chr20: 683843 .. 686279 (-)
RNA-Seq ExpressionCsor.00g128750
SyntenyCsor.00g128750
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSinitialstart_codonintronterminalstop_codon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTACAGTTTTGTATTCGAAGCTTTAGATTTCACTTCGCTCAAATCGCTCGATTCCAATTCAGAAACTTCGTTCGAAGAACAGAGCCAAATTCTTCCGTTCACATCAACCTCCTCACCCTGTGCTTCAACGCTCAATCGCTTCGTCAAACCAAAGAAGTCCACGCCATTTGCCTTCTCAATGGCTTGCTTCCTCATAGTGTATCACTCTGTGCTTCCCTTATTCTTAATTACGCCAAGTTTCAGCACCCAGAATCGTTCTGTACTCTGTTCCATCAAACTGTCCAGAATTGTCGTACTGCGTTCCTGTGGAATACCTTGATTCGCGCTCACTCCATTGCTGGGAATGGGACGCTTGATGGGTTGGAGACGTACAACAGGATGGTTCGATTCGGTGTTCAACTCGATGACCATACATTTCCTTTTGTTCTCAAGATATGTTCTGATTCGCTTGATATTTGCAAGGGTATGGAGGTTCATGGGGTTGTGTTTAAGTTGGGTTTTGATTCTGATGTCTATGTTGGCAATACGCTGTTGATGCTGTATGGGAATTGTGGGTTCTTAAATGATGCTAAAAAGGTGTTCGATGAAATGTCTGAGAGAGATGTCGTCTCGTGGAATACGGTTATTGGGCTCCTTTCAGTTAATGGGGATTATAGGGAGGCTCGTAACTATTACTTTTGGATGACTTTGAGGTCCGGAATTCAACCAAATTTGGTGAGTGTTATTAGTCTTTTACCCATTTCTGCTGGCCTTGAAGACGAGGAGATGACAAGACGAATTCATTGTTACATTGTGAAAGTTGGTTTGGATTCTTTGGTAACCTCTTGCAATGCACTTGTCGATGCGTATGGGAAATGTGGGAGTGTGAAAGCTTCATGGCAAGTTTTTGATGAGATAATTGAGAAGAATGAAGTCTCATGGAATTCAATCATCAATGGTCTAGCTTTTAAGGGTCATTTCCGGGATGCCTTGGATGTTTTTAGGATGATGATCGATGCAGGAACTAAACCGAACTCGGTCACCATTTCGAGCATTCTTCCTGTGTTTGTTGAGCTTGAATGTTTCAAAGCAGGAAAAGAAATTCATGGGTTCAGTATGAGAATGGGAACAGAAACTGATCTTTTCATTGCAAATTCCCTGATCGATATGTATGCCAAGTCTGGTCACTCAACTGAGGCATCTAGCATATTCCACAACATGGATGGAAGGAACATAGTTTCTTGGAACGCTATGATAGCTAATTATGTTCTAAATGGGGTCGCGTTGGAAGCAATAAGATTTGTAATACTATTGCAAGAGAGTGGAGAACGTCCCAATGCAGTGACTTTTACCAATGTTCTTCCTGCTTGTGCACGTTCGGGTCACCTTGGTCCTGGCAAAGAAATACATGCCATGGGCGTTCGTTTAGGACTAACATCTGATTTGTTTGTAACCAATGCTCTGACCGACATGTATGCAAAATGTGGTTGCTTTCGTTCTGCTCGAAACGTCTTTAACACTTCCCATAAAGATGAAGTTTCTTATAACATATTAATTACAGGATATTCCGAAACAAACGATTGCTTGGAGTCTCTGAATTTGTTCTCAGAAATGAGGCTGCTTGGTAAAAAAGCCTGATGTCGTTTCCTTTATGGGGGTCATATCAGCATGTGCAAACCTAGCTGCAGTCAAGCAAGGTAAAGAGATTCATGGTGTTGCATTAAGAAATCATCTTAACCCTCATCTATTTGTCTCAAACTCCCTTTTGGACTTTTATACAAAATGTGGAAGAATTGATCTTGCTTGTAAGATCTTCAATCAAATTCTATTCAAAGATGTAGCATCTTGGAATACTATGATTTTAGGGTATGGAATGATAGGAGAGTTGGAAACTGCAATTAATATGTTTGAAGCAATGAGGGATGATAAAGTGCAATATGATTTAGTTTCATATATTGCAGTTCTGTCAGCTTGTAGTCATGGAGGACTAGTTGAACGTGGTTGGCAATACTTCAGCGAGATGTTAGCTCAACATCTTGAACCCACTGAAGTGCACTATACATGTCTGGTGGATCTACTCGGGCGTGCTGGTTTCGTAGAAGAGGCAGCAGAGCTGATTCGGCGACTACCGATAGCGCCCGATTCAAATATTTGGGGAGCTCTACTTGGTGCTTGTCGAATTTACGGAAACGTTGAACTAGGGTGCAAGGCAGCAGAGCAGTTATTTGAGCTAAAGCCTCAGCATTGTGAATACTATATTCTTCTTGCAAACATGCATGCAGAAACAGGAAGATGGGATGAGGTAAACAGGATTAGGGAACTTATGAAGTCTAGAGGAGCGAAAAAGAGCCCTGGCTGTAGTTGGGTTCAGATTCATGACCAGCTGCATGCTTTTGTGGTTGATGATCGAGCAGAGGGATTTGAATCAGGTGGTTTACTGGCAGAATTCGTTTGA

mRNA sequence

ATGTTACAGTTTTGTATTCGAAGCTTTAGATTTCACTTCGCTCAAATCGCTCGATTCCAATTCAGAAACTTCGTTCGAAGAACAGAGCCAAATTCTTCCGTTCACATCAACCTCCTCACCCTGTGCTTCAACGCTCAATCGCTTCGTCAAACCAAAGAAGTCCACGCCATTTGCCTTCTCAATGGCTTGCTTCCTCATAGTGTATCACTCTGTGCTTCCCTTATTCTTAATTACGCCAAGTTTCAGCACCCAGAATCGTTCTGTACTCTGTTCCATCAAACTGTCCAGAATTGTCGTACTGCGTTCCTGTGGAATACCTTGATTCGCGCTCACTCCATTGCTGGGAATGGGACGCTTGATGGGTTGGAGACGTACAACAGGATGGTTCGATTCGGTGTTCAACTCGATGACCATACATTTCCTTTTGTTCTCAAGATATGTTCTGATTCGCTTGATATTTGCAAGGGTATGGAGGTTCATGGGGTTGTGTTTAAGTTGGGTTTTGATTCTGATGTCTATGTTGGCAATACGCTGTTGATGCTGTATGGGAATTGTGGGTTCTTAAATGATGCTAAAAAGGTGTTCGATGAAATGTCTGAGAGAGATGTCGTCTCGTGGAATACGGTTATTGGGCTCCTTTCAGTTAATGGGGATTATAGGGAGGCTCGTAACTATTACTTTTGGATGACTTTGAGGTCCGGAATTCAACCAAATTTGGTGAGTGTTATTAGTCTTTTACCCATTTCTGCTGGCCTTGAAGACGAGGAGATGACAAGACGAATTCATTGTTACATTGTGAAAGTTGGTTTGGATTCTTTGGTAACCTCTTGCAATGCACTTGTCGATGCGTATGGGAAATGTGGGAGTGTGAAAGCTTCATGGCAAGTTTTTGATGAGATAATTGAGAAGAATGAAGTCTCATGGAATTCAATCATCAATGGTCTAGCTTTTAAGGGTCATTTCCGGGATGCCTTGGATGTTTTTAGGATGATGATCGATGCAGGAACTAAACCGAACTCGGTCACCATTTCGAGCATTCTTCCTGTGTTTGTTGAGCTTGAATGTTTCAAAGCAGGAAAAGAAATTCATGGGTTCAGTATGAGAATGGGAACAGAAACTGATCTTTTCATTGCAAATTCCCTGATCGATATGTATGCCAAGTCTGGTCACTCAACTGAGGCATCTAGCATATTCCACAACATGGATGGAAGGAACATAGTTTCTTGGAACGCTATGATAGCTAATTATGTTCTAAATGGGGTCGCGTTGGAAGCAATAAGATTTGTAATACTATTGCAAGAGAGTGGAGAACGTCCCAATGCAGTGACTTTTACCAATGTTCTTCCTGCTTGTGCACGTTCGGGTCACCTTGGTCCTGGCAAAGAAATACATGCCATGGGCGTTCGTTTAGGACTAACATCTGATTTGTTTGTAACCAATGCTCTGACCGACATGTATGCAAAATGTGGTTGCTTTCGTTCTGCTCGAAACGTCTTTAACACTTCCCATAAAGATGAAGTTTCTTATAACATATTAATTACAGGATATTCCGAAACAAACGATTGCTTGGAGTCTCTGAATTTGTTCTCAGAAATGAGGCTGCTTGGGTATGGAATGATAGGAGAGTTGGAAACTGCAATTAATATGTTTGAAGCAATGAGGGATGATAAAGTGCAATATGATTTAGTTTCATATATTGCAGTTCTGTCAGCTTGTAGTCATGGAGGACTAGTTGAACGTGGTTGGCAATACTTCAGCGAGATGTTAGCTCAACATCTTGAACCCACTGAAGTGCACTATACATGTCTGGTGGATCTACTCGGGCGTGCTGGTTTCGTAGAAGAGGCAGCAGAGCTGATTCGGCGACTACCGATAGCGCCCGATTCAAATATTTGGGGAGCTCTACTTGGTGCTTGTCGAATTTACGGAAACGTTGAACTAGGGTGCAAGGCAGCAGAGCAGTTATTTGAGCTAAAGCCTCAGCATTGTGAATACTATATTCTTCTTGCAAACATGCATGCAGAAACAGGAAGATGGGATGAGGTAAACAGGATTAGGGAACTTATGAAGTCTAGAGGAGCGAAAAAGAGCCCTGGCTGTAGTTGGGTTCAGATTCATGACCAGCTGCATGCTTTTGTGGTTGATGATCGAGCAGAGGGATTTGAATCAGGTGGTTTACTGGCAGAATTCGTTTGA

Coding sequence (CDS)

ATGTTACAGTTTTGTATTCGAAGCTTTAGATTTCACTTCGCTCAAATCGCTCGATTCCAATTCAGAAACTTCGTTCGAAGAACAGAGCCAAATTCTTCCGTTCACATCAACCTCCTCACCCTGTGCTTCAACGCTCAATCGCTTCGTCAAACCAAAGAAGTCCACGCCATTTGCCTTCTCAATGGCTTGCTTCCTCATAGTGTATCACTCTGTGCTTCCCTTATTCTTAATTACGCCAAGTTTCAGCACCCAGAATCGTTCTGTACTCTGTTCCATCAAACTGTCCAGAATTGTCGTACTGCGTTCCTGTGGAATACCTTGATTCGCGCTCACTCCATTGCTGGGAATGGGACGCTTGATGGGTTGGAGACGTACAACAGGATGGTTCGATTCGGTGTTCAACTCGATGACCATACATTTCCTTTTGTTCTCAAGATATGTTCTGATTCGCTTGATATTTGCAAGGGTATGGAGGTTCATGGGGTTGTGTTTAAGTTGGGTTTTGATTCTGATGTCTATGTTGGCAATACGCTGTTGATGCTGTATGGGAATTGTGGGTTCTTAAATGATGCTAAAAAGGTGTTCGATGAAATGTCTGAGAGAGATGTCGTCTCGTGGAATACGGTTATTGGGCTCCTTTCAGTTAATGGGGATTATAGGGAGGCTCGTAACTATTACTTTTGGATGACTTTGAGGTCCGGAATTCAACCAAATTTGGTGAGTGTTATTAGTCTTTTACCCATTTCTGCTGGCCTTGAAGACGAGGAGATGACAAGACGAATTCATTGTTACATTGTGAAAGTTGGTTTGGATTCTTTGGTAACCTCTTGCAATGCACTTGTCGATGCGTATGGGAAATGTGGGAGTGTGAAAGCTTCATGGCAAGTTTTTGATGAGATAATTGAGAAGAATGAAGTCTCATGGAATTCAATCATCAATGGTCTAGCTTTTAAGGGTCATTTCCGGGATGCCTTGGATGTTTTTAGGATGATGATCGATGCAGGAACTAAACCGAACTCGGTCACCATTTCGAGCATTCTTCCTGTGTTTGTTGAGCTTGAATGTTTCAAAGCAGGAAAAGAAATTCATGGGTTCAGTATGAGAATGGGAACAGAAACTGATCTTTTCATTGCAAATTCCCTGATCGATATGTATGCCAAGTCTGGTCACTCAACTGAGGCATCTAGCATATTCCACAACATGGATGGAAGGAACATAGTTTCTTGGAACGCTATGATAGCTAATTATGTTCTAAATGGGGTCGCGTTGGAAGCAATAAGATTTGTAATACTATTGCAAGAGAGTGGAGAACGTCCCAATGCAGTGACTTTTACCAATGTTCTTCCTGCTTGTGCACGTTCGGGTCACCTTGGTCCTGGCAAAGAAATACATGCCATGGGCGTTCGTTTAGGACTAACATCTGATTTGTTTGTAACCAATGCTCTGACCGACATGTATGCAAAATGTGGTTGCTTTCGTTCTGCTCGAAACGTCTTTAACACTTCCCATAAAGATGAAGTTTCTTATAACATATTAATTACAGGATATTCCGAAACAAACGATTGCTTGGAGTCTCTGAATTTGTTCTCAGAAATGAGGCTGCTTGGGTATGGAATGATAGGAGAGTTGGAAACTGCAATTAATATGTTTGAAGCAATGAGGGATGATAAAGTGCAATATGATTTAGTTTCATATATTGCAGTTCTGTCAGCTTGTAGTCATGGAGGACTAGTTGAACGTGGTTGGCAATACTTCAGCGAGATGTTAGCTCAACATCTTGAACCCACTGAAGTGCACTATACATGTCTGGTGGATCTACTCGGGCGTGCTGGTTTCGTAGAAGAGGCAGCAGAGCTGATTCGGCGACTACCGATAGCGCCCGATTCAAATATTTGGGGAGCTCTACTTGGTGCTTGTCGAATTTACGGAAACGTTGAACTAGGGTGCAAGGCAGCAGAGCAGTTATTTGAGCTAAAGCCTCAGCATTGTGAATACTATATTCTTCTTGCAAACATGCATGCAGAAACAGGAAGATGGGATGAGGTAAACAGGATTAGGGAACTTATGAAGTCTAGAGGAGCGAAAAAGAGCCCTGGCTGTAGTTGGGTTCAGATTCATGACCAGCTGCATGCTTTTGTGGTTGATGATCGAGCAGAGGGATTTGAATCAGGTGGTTTACTGGCAGAATTCGTTTGA

Protein sequence

MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYGKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLGYGMIGELETAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVERGWQYFSEMLAQHLEPTEVHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCEYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDRAEGFESGGLLAEFV
Homology
BLAST of Csor.00g128750 vs. ExPASy Swiss-Prot
Match: Q9M9E2 (Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H73 PE=1 SV=1)

HSP 1 Score: 435.6 bits (1119), Expect = 1.1e-120
Identity = 235/692 (33.96%), Postives = 387/692 (55.92%), Query Frame = 0

Query: 27  RTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPES 86
           R   +  V + L+ LC   ++  +  +V++I  L+ +    V L  + +  + +F +   
Sbjct: 89  RVAVDEDVFVALVRLCEWKRAQEEGSKVYSIA-LSSMSSLGVELGNAFLAMFVRFGNLVD 148

Query: 87  FCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDGLETYNRMVRF-GVQLDDHTFPFVLK 146
              +F +  +  R  F WN L+  ++  G    + +  Y+RM+   GV+ D +TFP VL+
Sbjct: 149 AWYVFGKMSE--RNLFSWNVLVGGYAKQGYFD-EAMCLYHRMLWVGGVKPDVYTFPCVLR 208

Query: 147 ICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVS 206
            C    D+ +G EVH  V + G++ D+ V N L+ +Y  CG +  A+ +FD M  RD++S
Sbjct: 209 TCGGIPDLARGKEVHVHVVRYGYELDIDVVNALITMYVKCGDVKSARLLFDRMPRRDIIS 268

Query: 207 WNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYI 266
           WN +I     NG   E    +F M   S + P+L+++ S++     L D  + R IH Y+
Sbjct: 269 WNAMISGYFENGMCHEGLELFFAMRGLS-VDPDLMTLTSVISACELLGDRRLGRDIHAYV 328

Query: 267 VKVGLDSLVTSCNALVDAYGKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDAL 326
           +  G    ++ CN+L   Y   GS + + ++F  +  K+ VSW ++I+G  +      A+
Sbjct: 329 ITTGFAVDISVCNSLTQMYLNAGSWREAEKLFSRMERKDIVSWTTMISGYEYNFLPDKAI 388

Query: 327 DVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMY 386
           D +RMM     KP+ +T++++L     L     G E+H  +++    + + +AN+LI+MY
Sbjct: 389 DTYRMMDQDSVKPDEITVAAVLSACATLGDLDTGVELHKLAIKARLISYVIVANNLINMY 448

Query: 387 AKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFT 446
           +K     +A  IFHN+  +N++SW ++IA   LN    EA+ F+  ++ +  +PNA+T T
Sbjct: 449 SKCKCIDKALDIFHNIPRKNVISWTSIIAGLRLNNRCFEALIFLRQMKMT-LQPNAITLT 508

Query: 447 NVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFNTSHKD 506
             L ACAR G L  GKEIHA  +R G+  D F+ NAL DMY +CG   +A + FN+  KD
Sbjct: 509 AALAACARIGALMCGKEIHAHVLRTGVGLDDFLPNALLDMYVRCGRMNTAWSQFNSQKKD 568

Query: 507 EVSYNILITGYSETNDCLESLNLFSEMRLLGYGMIGELETAINMFEAMRDDKVQYDLVSY 566
             S+NIL+TGYSE                      G+    + +F+ M   +V+ D +++
Sbjct: 569 VTSWNILLTGYSER---------------------GQGSMVVELFDRMVKSRVRPDEITF 628

Query: 567 IAVLSACSHGGLVERGWQYFSEMLAQHLEPTEVHYTCLVDLLGRAGFVEEAAELIRRLPI 626
           I++L  CS   +V +G  YFS+M    + P   HY C+VDLLGRAG ++EA + I+++P+
Sbjct: 629 ISLLCGCSKSQMVRQGLMYFSKMEDYGVTPNLKHYACVVDLLGRAGELQEAHKFIQKMPV 688

Query: 627 APDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCEYYILLANMHAETGRWDEVNRI 686
            PD  +WGALL ACRI+  ++LG  +A+ +FEL  +   YYILL N++A+ G+W EV ++
Sbjct: 689 TPDPAVWGALLNACRIHHKIDLGELSAQHIFELDKKSVGYYILLCNLYADCGKWREVAKV 748

Query: 687 RELMKSRGAKKSPGCSWVQIHDQLHAFVVDDR 718
           R +MK  G     GCSWV++  ++HAF+ DD+
Sbjct: 749 RRMMKENGLTVDAGCSWVEVKGKVHAFLSDDK 753

BLAST of Csor.00g128750 vs. ExPASy Swiss-Prot
Match: Q9SN39 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=DOT4 PE=2 SV=1)

HSP 1 Score: 428.7 bits (1101), Expect = 1.3e-118
Identity = 243/684 (35.53%), Postives = 381/684 (55.70%), Query Frame = 0

Query: 37  NLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQ 96
           ++L LC +++SL+  KEV      NG +  S +L + L L Y      +    +F +   
Sbjct: 99  SVLQLCADSKSLKDGKEVDNFIRGNGFVIDS-NLGSKLSLMYTNCGDLKEASRVFDEV-- 158

Query: 97  NCRTAFLWNTLIRAHSIAGNGTLDG-LETYNRMVRFGVQLDDHTFPFVLKICSDSLDICK 156
               A  WN L+  + +A +G   G +  + +M+  GV++D +TF  V K  S    +  
Sbjct: 159 KIEKALFWNILM--NELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHG 218

Query: 157 GMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSV 216
           G ++HG + K GF     VGN+L+  Y     ++ A+KVFDEM+ERDV+SWN++I     
Sbjct: 219 GEQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVS 278

Query: 217 NGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVT 276
           NG   +  + +  M L SGI+ +L +++S+    A      + R +H   VK        
Sbjct: 279 NGLAEKGLSVFVQM-LVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDR 338

Query: 277 SCNALVDAYGKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAG 336
            CN L+D Y KCG + ++  VF E+ +++ VS+ S+I G A +G   +A+ +F  M + G
Sbjct: 339 FCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEG 398

Query: 337 TKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEAS 396
             P+  T++++L           GK +H +        D+F++N+L+DMYAK G   EA 
Sbjct: 399 ISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAE 458

Query: 397 SIFHNMDGRNIVSWNAMIANYVLNGVALEAIR-FVILLQESGERPNAVTFTNVLPACARS 456
            +F  M  ++I+SWN +I  Y  N  A EA+  F +LL+E    P+  T   VLPACA  
Sbjct: 459 LVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASL 518

Query: 457 GHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVF-NTSHKDEVSYNILI 516
                G+EIH   +R G  SD  V N+L DMYAKCG    A  +F + + KD VS+ ++I
Sbjct: 519 SAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMI 578

Query: 517 TGYSETNDCLESLNLFSEMRLLGYGMIGELETAINMFEAMRDDKVQYDLVSYIAVLSACS 576
                                 GYGM G  + AI +F  MR   ++ D +S++++L ACS
Sbjct: 579 A---------------------GYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACS 638

Query: 577 HGGLVERGWQYFSEMLAQ-HLEPTEVHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIW 636
           H GLV+ GW++F+ M  +  +EPT  HY C+VD+L R G + +A   I  +PI PD+ IW
Sbjct: 639 HSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIW 698

Query: 637 GALLGACRIYGNVELGCKAAEQLFELKPQHCEYYILLANMHAETGRWDEVNRIRELMKSR 696
           GALL  CRI+ +V+L  K AE++FEL+P++  YY+L+AN++AE  +W++V R+R+ +  R
Sbjct: 699 GALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQR 755

Query: 697 GAKKSPGCSWVQIHDQLHAFVVDD 717
           G +K+PGCSW++I  +++ FV  D
Sbjct: 759 GLRKNPGCSWIEIKGRVNIFVAGD 755

BLAST of Csor.00g128750 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 415.2 bits (1066), Expect = 1.5e-114
Identity = 226/669 (33.78%), Postives = 376/669 (56.20%), Query Frame = 0

Query: 47  SLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNT 106
           SL++ +++  +   NGL          L+  + ++   +    +F     + +   L++T
Sbjct: 49  SLKELRQILPLVFKNGLYQEHF-FQTKLVSLFCRYGSVDEAARVFEPI--DSKLNVLYHT 108

Query: 107 LIRAHSIAGNGTLD-GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFK 166
           +++    A    LD  L+ + RM    V+   + F ++LK+C D  ++  G E+HG++ K
Sbjct: 109 MLK--GFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLVK 168

Query: 167 LGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNY 226
            GF  D++    L  +Y  C  +N+A+KVFD M ERD+VSWNT++   S NG  R A   
Sbjct: 169 SGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEM 228

Query: 227 YFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYG 286
              M     ++P+ ++++S+LP  + L    + + IH Y ++ G DSLV    ALVD Y 
Sbjct: 229 VKSM-CEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYA 288

Query: 287 KCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISS 346
           KCGS++ + Q+FD ++E+N VSWNS+I+      + ++A+ +F+ M+D G KP  V++  
Sbjct: 289 KCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMG 348

Query: 347 ILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRN 406
            L    +L   + G+ IH  S+ +G + ++ + NSLI MY K      A+S+F  +  R 
Sbjct: 349 ALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRT 408

Query: 407 IVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHA 466
           +VSWNAMI  +  NG  ++A+ +   ++    +P+  T+ +V+ A A        K IH 
Sbjct: 409 LVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHG 468

Query: 467 MGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN-TSHKDEVSYNILITGYSETNDCLE 526
           + +R  L  ++FVT AL DMYAKCG    AR +F+  S +   ++N +I           
Sbjct: 469 VVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMID---------- 528

Query: 527 SLNLFSEMRLLGYGMIGELETAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVERGWQY 586
                      GYG  G  + A+ +FE M+   ++ + V++++V+SACSH GLVE G + 
Sbjct: 529 -----------GYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKC 588

Query: 587 FSEMLAQH-LEPTEVHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYG 646
           F  M   + +E +  HY  +VDLLGRAG + EA + I ++P+ P  N++GA+LGAC+I+ 
Sbjct: 589 FYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHK 648

Query: 647 NVELGCKAAEQLFELKPQHCEYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWV 706
           NV    KAAE+LFEL P    Y++LLAN++     W++V ++R  M  +G +K+PGCS V
Sbjct: 649 NVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMV 690

Query: 707 QIHDQLHAF 713
           +I +++H+F
Sbjct: 709 EIKNEVHSF 690

BLAST of Csor.00g128750 vs. ExPASy Swiss-Prot
Match: Q9STE1 (Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E36 PE=3 SV=1)

HSP 1 Score: 398.7 bits (1023), Expect = 1.5e-109
Identity = 222/652 (34.05%), Postives = 356/652 (54.60%), Query Frame = 0

Query: 70  LCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDG-LETYNRM 129
           + +SLI  Y ++   +    LF + +Q  +   +WN ++  +  A  G LD  ++ ++ M
Sbjct: 175 VASSLIKAYLEYGKIDVPSKLFDRVLQ--KDCVIWNVMLNGY--AKCGALDSVIKGFSVM 234

Query: 130 VRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFL 189
               +  +  TF  VL +C+  L I  G+++HG+V   G D +  + N+LL +Y  CG  
Sbjct: 235 RMDQISPNAVTFDCVLSVCASKLLIDLGVQLHGLVVVSGVDFEGSIKNSLLSMYSKCGRF 294

Query: 190 NDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPI 249
           +DA K+F  MS  D V+WN +I     +G   E+  +++ M + SG+ P+ ++  SLLP 
Sbjct: 295 DDASKLFRMMSRADTVTWNCMISGYVQSGLMEESLTFFYEM-ISSGVLPDAITFSSLLPS 354

Query: 250 SAGLEDEEMTRRIHCYIVK--VGLDSLVTSCNALVDAYGKCGSVKASWQVFDEIIEKNEV 309
            +  E+ E  ++IHCYI++  + LD  +TS  AL+DAY KC  V  +  +F +    + V
Sbjct: 355 VSKFENLEYCKQIHCYIMRHSISLDIFLTS--ALIDAYFKCRGVSMAQNIFSQCNSVDVV 414

Query: 310 SWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFS 369
            + ++I+G    G + D+L++FR ++     PN +T+ SILPV   L   K G+E+HGF 
Sbjct: 415 VFTAMISGYLHNGLYIDSLEMFRWLVKVKISPNEITLVSILPVIGILLALKLGRELHGFI 474

Query: 370 MRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNGVALEAI 429
           ++ G +    I  ++IDMYAK G    A  IF  +  R+IVSWN+MI     +     AI
Sbjct: 475 IKKGFDNRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWNSMITRCAQSDNPSAAI 534

Query: 430 RFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMY 489
                +  SG   + V+ +  L ACA       GK IH   ++  L SD++  + L DMY
Sbjct: 535 DIFRQMGVSGICYDCVSISAALSACANLPSESFGKAIHGFMIKHSLASDVYSESTLIDMY 594

Query: 490 AKCGCFRSARNVFNT-SHKDEVSYNILITGYSETNDCLESLNLFSEMRLLGYGMIGELET 549
           AKCG  ++A NVF T   K+ VS+N +I          +SL LF EM             
Sbjct: 595 AKCGNLKAAMNVFKTMKEKNIVSWNSIIAACGNHGKLKDSLCLFHEM------------- 654

Query: 550 AINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVERGWQYFSEMLAQH-LEPTEVHYTCLV 609
                  +    ++ D ++++ ++S+C H G V+ G ++F  M   + ++P + HY C+V
Sbjct: 655 -------VEKSGIRPDQITFLEIISSCCHVGDVDEGVRFFRSMTEDYGIQPQQEHYACVV 714

Query: 610 DLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCE 669
           DL GRAG + EA E ++ +P  PD+ +WG LLGACR++ NVEL   A+ +L +L P +  
Sbjct: 715 DLFGRAGRLTEAYETVKSMPFPPDAGVWGTLLGACRLHKNVELAEVASSKLMDLDPSNSG 774

Query: 670 YYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDD 717
           YY+L++N HA    W+ V ++R LMK R  +K PG SW++I+ + H FV  D
Sbjct: 775 YYVLISNAHANAREWESVTKVRSLMKEREVQKIPGYSWIEINKRTHLFVSGD 799

BLAST of Csor.00g128750 vs. ExPASy Swiss-Prot
Match: Q9LUJ2 (Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H56 PE=3 SV=1)

HSP 1 Score: 398.3 bits (1022), Expect = 1.9e-109
Identity = 228/710 (32.11%), Postives = 373/710 (52.54%), Query Frame = 0

Query: 44  NAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPE--SFCTLFHQTVQNCRTA 103
           N +++ + K  H      G L + VS    L+    +    E  SF     +  ++  T 
Sbjct: 41  NCKTIDELKMFHRSLTKQG-LDNDVSTITKLVARSCELGTRESLSFAKEVFENSESYGTC 100

Query: 104 FLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHG 163
           F++N+LIR ++ +G    + +  + RM+  G+  D +TFPF L  C+ S     G+++HG
Sbjct: 101 FMYNSLIRGYASSGLCN-EAILLFLRMMNSGISPDKYTFPFGLSACAKSRAKGNGIQIHG 160

Query: 164 VVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYRE 223
           ++ K+G+  D++V N+L+  Y  CG L+ A+KVFDEMSER+VVSW ++I   +     ++
Sbjct: 161 LIVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMICGYARRDFAKD 220

Query: 224 ARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALV 283
           A + +F M     + PN V+++ ++   A LED E   +++ +I   G++      +ALV
Sbjct: 221 AVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNSGIEVNDLMVSALV 280

Query: 284 DAYGKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSV 343
           D Y KC ++  + ++FDE    N    N++ +    +G  R+AL VF +M+D+G +P+ +
Sbjct: 281 DMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSGVRPDRI 340

Query: 344 TISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNM 403
           ++ S +    +L     GK  HG+ +R G E+   I N+LIDMY K      A  IF  M
Sbjct: 341 SMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTAFRIFDRM 400

Query: 404 DGRNIVSWNAMIANYVLNG-------------------------------VALEAIR-FV 463
             + +V+WN+++A YV NG                               +  EAI  F 
Sbjct: 401 SNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEEAIEVFC 460

Query: 464 ILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKC 523
            +  + G   + VT  ++  AC   G L   K I+    + G+  D+ +   L DM+++C
Sbjct: 461 SMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTTLVDMFSRC 520

Query: 524 GCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLGYGMIGELETAINM 583
           G   SA ++FN+    +VS      G                       M G  E AI +
Sbjct: 521 GDPESAMSIFNSLTNRDVSAWTAAIG--------------------AMAMAGNAERAIEL 580

Query: 584 FEAMRDDKVQYDLVSYIAVLSACSHGGLVERGWQYFSEMLAQH-LEPTEVHYTCLVDLLG 643
           F+ M +  ++ D V+++  L+ACSHGGLV++G + F  ML  H + P +VHY C+VDLLG
Sbjct: 581 FDDMIEQGLKPDGVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLG 640

Query: 644 RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCEYYIL 703
           RAG +EEA +LI  +P+ P+  IW +LL ACR+ GNVE+   AAE++  L P+    Y+L
Sbjct: 641 RAGLLEEAVQLIEDMPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVL 700

Query: 704 LANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDRA 719
           L+N++A  GRW+++ ++R  MK +G +K PG S +QI  + H F   D +
Sbjct: 701 LSNVYASAGRWNDMAKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDES 728

BLAST of Csor.00g128750 vs. NCBI nr
Match: KAG6570451.1 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1493 bits (3866), Expect = 0.0
Identity = 731/731 (100.00%), Postives = 731/731 (100.00%), Query Frame = 0

Query: 1   MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLL 60
           MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLL
Sbjct: 1   MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLL 60

Query: 61  NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD 120
           NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD
Sbjct: 61  NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD 120

Query: 121 GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM 180
           GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM
Sbjct: 121 GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM 180

Query: 181 LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV 240
           LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV
Sbjct: 181 LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV 240

Query: 241 SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYGKCGSVKASWQVFDEI 300
           SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYGKCGSVKASWQVFDEI
Sbjct: 241 SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYGKCGSVKASWQVFDEI 300

Query: 301 IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK 360
           IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK
Sbjct: 301 IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK 360

Query: 361 EIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNG 420
           EIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNG
Sbjct: 361 EIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNG 420

Query: 421 VALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTN 480
           VALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTN
Sbjct: 421 VALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTN 480

Query: 481 ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLGYGMI 540
           ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLGYGMI
Sbjct: 481 ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLGYGMI 540

Query: 541 GELETAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVERGWQYFSEMLAQHLEPTEVHY 600
           GELETAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVERGWQYFSEMLAQHLEPTEVHY
Sbjct: 541 GELETAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVERGWQYFSEMLAQHLEPTEVHY 600

Query: 601 TCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKP 660
           TCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKP
Sbjct: 601 TCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKP 660

Query: 661 QHCEYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDRAEG 720
           QHCEYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDRAEG
Sbjct: 661 QHCEYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDRAEG 720

Query: 721 FESGGLLAEFV 731
           FESGGLLAEFV
Sbjct: 721 FESGGLLAEFV 731

BLAST of Csor.00g128750 vs. NCBI nr
Match: XP_022944177.1 (pentatricopeptide repeat-containing protein At4g14170-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 1449 bits (3751), Expect = 0.0
Identity = 727/811 (89.64%), Postives = 728/811 (89.77%), Query Frame = 0

Query: 1   MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLL 60
           MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLL
Sbjct: 1   MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLL 60

Query: 61  NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD 120
           NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD
Sbjct: 61  NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD 120

Query: 121 GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM 180
           GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM
Sbjct: 121 GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM 180

Query: 181 LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV 240
           LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV
Sbjct: 181 LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV 240

Query: 241 SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYGKCGSVKASWQVFDEI 300
           SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAY KCGSVKASWQVFDEI
Sbjct: 241 SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI 300

Query: 301 IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK 360
           IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK
Sbjct: 301 IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK 360

Query: 361 EIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNG 420
           EIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNG
Sbjct: 361 EIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNG 420

Query: 421 VALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTN 480
           VALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTN
Sbjct: 421 VALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTN 480

Query: 481 ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLL----- 540
           ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLL     
Sbjct: 481 ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPD 540

Query: 541 ------------------------------------------------------------ 600
                                                                       
Sbjct: 541 VVSFMGVISACANLAAVKQGKEIHGVALRNHLNSHLFVSNSLLDFYTKCGRIDLACKIFN 600

Query: 601 ---------------GYGMIGELETAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVER 660
                          GYGMIGELETAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVER
Sbjct: 601 QILFKDVASWNTMILGYGMIGELETAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVER 660

Query: 661 GWQYFSEMLAQHLEPTEVHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACR 720
           GWQY SEMLAQHLEPTE+HYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACR
Sbjct: 661 GWQYLSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACR 720

Query: 721 IYGNVELGCKAAEQLFELKPQHCEYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGC 731
           IYGNVELGCKAAEQLFELKPQHC YYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGC
Sbjct: 721 IYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGC 780

BLAST of Csor.00g128750 vs. NCBI nr
Match: KAG6570435.1 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1424 bits (3686), Expect = 0.0
Identity = 715/811 (88.16%), Postives = 719/811 (88.66%), Query Frame = 0

Query: 1   MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLL 60
           MLQFCIRS RFHFAQIARF FRN+VR TE  SSVHINLLTLCFNAQSLRQTKEVHAICLL
Sbjct: 1   MLQFCIRSIRFHFAQIARFHFRNYVRSTEQISSVHINLLTLCFNAQSLRQTKEVHAICLL 60

Query: 61  NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD 120
           NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD
Sbjct: 61  NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD 120

Query: 121 GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM 180
           GLETYNRMVR GVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDS VYVGNTLLM
Sbjct: 121 GLETYNRMVRLGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSHVYVGNTLLM 180

Query: 181 LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV 240
           LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV
Sbjct: 181 LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV 240

Query: 241 SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYGKCGSVKASWQVFDEI 300
           SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYGKCGSVK SWQVFDEI
Sbjct: 241 SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYGKCGSVKTSWQVFDEI 300

Query: 301 IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK 360
           IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK
Sbjct: 301 IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK 360

Query: 361 EIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNG 420
           EIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNG
Sbjct: 361 EIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNG 420

Query: 421 VALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTN 480
           V+LEAIRFVILLQESGERPNAVTFTNVLPACAR GHLGPGKEIHAMGVRLGLTSDLFVTN
Sbjct: 421 VSLEAIRFVILLQESGERPNAVTFTNVLPACARLGHLGPGKEIHAMGVRLGLTSDLFVTN 480

Query: 481 ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLG---- 540
           ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLG    
Sbjct: 481 ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLGKKPD 540

Query: 541 ------------------------------------------------------------ 600
                                                                       
Sbjct: 541 VVSFMGVISACANLAAVKQGKEIHGVALRNHLNSHLFVSNSLLDFYTKCGRIDLACKIFN 600

Query: 601 ----------------YGMIGELETAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVER 660
                           YGMIGELETAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVER
Sbjct: 601 QILFKDVASWNTMILGYGMIGELETAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVER 660

Query: 661 GWQYFSEMLAQHLEPTEVHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACR 720
           GWQYFSEMLAQHLEPTE+HYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACR
Sbjct: 661 GWQYFSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACR 720

Query: 721 IYGNVELGCKAAEQLFELKPQHCEYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGC 731
           IYGNVELGCKAAE LFELKPQHC YYILLAN+HAETGRWDEVNRIRELMKSRGAKKSPGC
Sbjct: 721 IYGNVELGCKAAEHLFELKPQHCGYYILLANIHAETGRWDEVNRIRELMKSRGAKKSPGC 780

BLAST of Csor.00g128750 vs. NCBI nr
Match: KAG6570443.1 (Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1421 bits (3678), Expect = 0.0
Identity = 702/736 (95.38%), Postives = 706/736 (95.92%), Query Frame = 0

Query: 1   MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLL 60
           MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLL
Sbjct: 1   MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLL 60

Query: 61  NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD 120
           NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD
Sbjct: 61  NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD 120

Query: 121 GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM 180
           GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM
Sbjct: 121 GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM 180

Query: 181 LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV 240
           LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV
Sbjct: 181 LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV 240

Query: 241 SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYGKCGSVKASWQVFDEI 300
           SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYGKCGSVKASWQVFDEI
Sbjct: 241 SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYGKCGSVKASWQVFDEI 300

Query: 301 IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK 360
           IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK
Sbjct: 301 IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK 360

Query: 361 EIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNG 420
           EIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNG
Sbjct: 361 EIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNG 420

Query: 421 VALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTN 480
           VALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTN
Sbjct: 421 VALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTN 480

Query: 481 ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLG---- 540
           ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLG    
Sbjct: 481 ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLGKKPD 540

Query: 541 -YGMIGELETAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVERGWQYFSEMLAQHLEP 600
               +G +    N+    +             VLSACSHGGLVERGWQYFSEMLAQHLEP
Sbjct: 541 VVSFMGVISACANLAAVKQ-------------VLSACSHGGLVERGWQYFSEMLAQHLEP 600

Query: 601 TEVHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQL 660
           TEVHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQL
Sbjct: 601 TEVHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQL 660

Query: 661 FELKPQHCEYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVD 720
           FELKPQHCEYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVD
Sbjct: 661 FELKPQHCEYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVD 720

Query: 721 DRAEGFESGGLLAEFV 731
           DRAEGFESGGLLAEFV
Sbjct: 721 DRAEGFESGGLLAEFV 723

BLAST of Csor.00g128750 vs. NCBI nr
Match: XP_022985648.1 (pentatricopeptide repeat-containing protein At4g14170-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 1417 bits (3667), Expect = 0.0
Identity = 709/811 (87.42%), Postives = 719/811 (88.66%), Query Frame = 0

Query: 1   MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLL 60
           MLQFCIRS RFHFAQIARFQFRN+VR TEPNSSVHINLLTLCFN+QSLRQTKEVHAICLL
Sbjct: 1   MLQFCIRSIRFHFAQIARFQFRNYVRSTEPNSSVHINLLTLCFNSQSLRQTKEVHAICLL 60

Query: 61  NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD 120
           NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT FLWNTLIRAHSIAGNGTLD
Sbjct: 61  NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTTFLWNTLIRAHSIAGNGTLD 120

Query: 121 GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM 180
           GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM
Sbjct: 121 GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM 180

Query: 181 LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV 240
           LYGNCGFLN AKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV
Sbjct: 181 LYGNCGFLNGAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV 240

Query: 241 SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYGKCGSVKASWQVFDEI 300
           SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYGKCGSVKASWQVFDEI
Sbjct: 241 SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYGKCGSVKASWQVFDEI 300

Query: 301 IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK 360
           IEKNEVSWNSIINGLAFKGHF DAL+VFRMMIDAGTKPNSVTISSILPVFVELECFKAGK
Sbjct: 301 IEKNEVSWNSIINGLAFKGHFWDALEVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK 360

Query: 361 EIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNG 420
           EIHGFSMRMGTETDLFIANSLIDMYAKSGH TEASSIFHNMDGRNIVSWNAMIANY LNG
Sbjct: 361 EIHGFSMRMGTETDLFIANSLIDMYAKSGHLTEASSIFHNMDGRNIVSWNAMIANYALNG 420

Query: 421 VALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTN 480
           VALEAIRFVILLQESGERPNAVTFTNVLPACAR GHLGPGKEIHAMGVRLGLTSDLFVTN
Sbjct: 421 VALEAIRFVILLQESGERPNAVTFTNVLPACARLGHLGPGKEIHAMGVRLGLTSDLFVTN 480

Query: 481 ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLG---- 540
           ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLG    
Sbjct: 481 ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLGKKPD 540

Query: 541 ------------------------------------------------------------ 600
                                                                       
Sbjct: 541 VVSFMGVLSACANLAAVKQGKEIHGVALRNHLNSHLFVSNSLLDFYTKCGRIDLACKIFN 600

Query: 601 ----------------YGMIGELETAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVER 660
                           YGMIGELETAINMFEAMRDDKVQYD+VSYIAVLSACSHGGLVER
Sbjct: 601 QILFKDVASWNTMILGYGMIGELETAINMFEAMRDDKVQYDVVSYIAVLSACSHGGLVER 660

Query: 661 GWQYFSEMLAQHLEPTEVHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACR 720
           G QYFSEMLAQHLEPTE+HYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACR
Sbjct: 661 GCQYFSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACR 720

Query: 721 IYGNVELGCKAAEQLFELKPQHCEYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGC 731
           IYGN++LGCKAAE LFELKPQHC YYILL+NM+AETGRWD+VNRIRELMKSRGAKKSPGC
Sbjct: 721 IYGNIDLGCKAAEHLFELKPQHCGYYILLSNMYAETGRWDDVNRIRELMKSRGAKKSPGC 780

BLAST of Csor.00g128750 vs. ExPASy TrEMBL
Match: A0A6J1FV31 (pentatricopeptide repeat-containing protein At4g14170-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111448704 PE=4 SV=1)

HSP 1 Score: 1449 bits (3751), Expect = 0.0
Identity = 727/811 (89.64%), Postives = 728/811 (89.77%), Query Frame = 0

Query: 1   MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLL 60
           MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLL
Sbjct: 1   MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLL 60

Query: 61  NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD 120
           NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD
Sbjct: 61  NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD 120

Query: 121 GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM 180
           GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM
Sbjct: 121 GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM 180

Query: 181 LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV 240
           LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV
Sbjct: 181 LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV 240

Query: 241 SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYGKCGSVKASWQVFDEI 300
           SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAY KCGSVKASWQVFDEI
Sbjct: 241 SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI 300

Query: 301 IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK 360
           IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK
Sbjct: 301 IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK 360

Query: 361 EIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNG 420
           EIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNG
Sbjct: 361 EIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNG 420

Query: 421 VALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTN 480
           VALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTN
Sbjct: 421 VALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTN 480

Query: 481 ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLL----- 540
           ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLL     
Sbjct: 481 ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLRKKPD 540

Query: 541 ------------------------------------------------------------ 600
                                                                       
Sbjct: 541 VVSFMGVISACANLAAVKQGKEIHGVALRNHLNSHLFVSNSLLDFYTKCGRIDLACKIFN 600

Query: 601 ---------------GYGMIGELETAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVER 660
                          GYGMIGELETAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVER
Sbjct: 601 QILFKDVASWNTMILGYGMIGELETAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVER 660

Query: 661 GWQYFSEMLAQHLEPTEVHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACR 720
           GWQY SEMLAQHLEPTE+HYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACR
Sbjct: 661 GWQYLSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACR 720

Query: 721 IYGNVELGCKAAEQLFELKPQHCEYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGC 731
           IYGNVELGCKAAEQLFELKPQHC YYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGC
Sbjct: 721 IYGNVELGCKAAEQLFELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGC 780

BLAST of Csor.00g128750 vs. ExPASy TrEMBL
Match: A0A6J1JE86 (pentatricopeptide repeat-containing protein At4g14170-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111483644 PE=4 SV=1)

HSP 1 Score: 1417 bits (3667), Expect = 0.0
Identity = 709/811 (87.42%), Postives = 719/811 (88.66%), Query Frame = 0

Query: 1   MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLL 60
           MLQFCIRS RFHFAQIARFQFRN+VR TEPNSSVHINLLTLCFN+QSLRQTKEVHAICLL
Sbjct: 1   MLQFCIRSIRFHFAQIARFQFRNYVRSTEPNSSVHINLLTLCFNSQSLRQTKEVHAICLL 60

Query: 61  NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD 120
           NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRT FLWNTLIRAHSIAGNGTLD
Sbjct: 61  NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTTFLWNTLIRAHSIAGNGTLD 120

Query: 121 GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM 180
           GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM
Sbjct: 121 GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM 180

Query: 181 LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV 240
           LYGNCGFLN AKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV
Sbjct: 181 LYGNCGFLNGAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV 240

Query: 241 SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYGKCGSVKASWQVFDEI 300
           SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYGKCGSVKASWQVFDEI
Sbjct: 241 SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYGKCGSVKASWQVFDEI 300

Query: 301 IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK 360
           IEKNEVSWNSIINGLAFKGHF DAL+VFRMMIDAGTKPNSVTISSILPVFVELECFKAGK
Sbjct: 301 IEKNEVSWNSIINGLAFKGHFWDALEVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK 360

Query: 361 EIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNG 420
           EIHGFSMRMGTETDLFIANSLIDMYAKSGH TEASSIFHNMDGRNIVSWNAMIANY LNG
Sbjct: 361 EIHGFSMRMGTETDLFIANSLIDMYAKSGHLTEASSIFHNMDGRNIVSWNAMIANYALNG 420

Query: 421 VALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTN 480
           VALEAIRFVILLQESGERPNAVTFTNVLPACAR GHLGPGKEIHAMGVRLGLTSDLFVTN
Sbjct: 421 VALEAIRFVILLQESGERPNAVTFTNVLPACARLGHLGPGKEIHAMGVRLGLTSDLFVTN 480

Query: 481 ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLG---- 540
           ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLG    
Sbjct: 481 ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLGKKPD 540

Query: 541 ------------------------------------------------------------ 600
                                                                       
Sbjct: 541 VVSFMGVLSACANLAAVKQGKEIHGVALRNHLNSHLFVSNSLLDFYTKCGRIDLACKIFN 600

Query: 601 ----------------YGMIGELETAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVER 660
                           YGMIGELETAINMFEAMRDDKVQYD+VSYIAVLSACSHGGLVER
Sbjct: 601 QILFKDVASWNTMILGYGMIGELETAINMFEAMRDDKVQYDVVSYIAVLSACSHGGLVER 660

Query: 661 GWQYFSEMLAQHLEPTEVHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACR 720
           G QYFSEMLAQHLEPTE+HYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACR
Sbjct: 661 GCQYFSEMLAQHLEPTEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACR 720

Query: 721 IYGNVELGCKAAEQLFELKPQHCEYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGC 731
           IYGN++LGCKAAE LFELKPQHC YYILL+NM+AETGRWD+VNRIRELMKSRGAKKSPGC
Sbjct: 721 IYGNIDLGCKAAEHLFELKPQHCGYYILLSNMYAETGRWDDVNRIRELMKSRGAKKSPGC 780

BLAST of Csor.00g128750 vs. ExPASy TrEMBL
Match: A0A6J1FXS7 (pentatricopeptide repeat-containing protein At1g15510, chloroplastic-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111448704 PE=4 SV=1)

HSP 1 Score: 1409 bits (3647), Expect = 0.0
Identity = 697/731 (95.35%), Postives = 702/731 (96.03%), Query Frame = 0

Query: 1   MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLL 60
           MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLL
Sbjct: 1   MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLL 60

Query: 61  NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD 120
           NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD
Sbjct: 61  NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD 120

Query: 121 GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM 180
           GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM
Sbjct: 121 GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM 180

Query: 181 LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV 240
           LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV
Sbjct: 181 LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV 240

Query: 241 SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYGKCGSVKASWQVFDEI 300
           SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAY KCGSVKASWQVFDEI
Sbjct: 241 SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI 300

Query: 301 IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK 360
           IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK
Sbjct: 301 IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK 360

Query: 361 EIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNG 420
           EIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNG
Sbjct: 361 EIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNG 420

Query: 421 VALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTN 480
           VALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTN
Sbjct: 421 VALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTN 480

Query: 481 ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLGYGMI 540
           ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLL     
Sbjct: 481 ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLL----- 540

Query: 541 GELETAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVERGWQYFSEMLAQHLEPTEVHY 600
                  ++   M       +L +   VLSACSHGGLVERGWQY SEMLAQHLEPTE+HY
Sbjct: 541 ---RKKPDVVSFMGVISACANLAAVKQVLSACSHGGLVERGWQYLSEMLAQHLEPTEMHY 600

Query: 601 TCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKP 660
           TCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKP
Sbjct: 601 TCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKP 660

Query: 661 QHCEYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDRAEG 720
           QHC YYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDRAEG
Sbjct: 661 QHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDRAEG 720

Query: 721 FESGGLLAEFV 731
           FESGGLLAEFV
Sbjct: 721 FESGGLLAEFV 723

BLAST of Csor.00g128750 vs. ExPASy TrEMBL
Match: A0A6J1FW82 (pentatricopeptide repeat-containing protein At1g15510, chloroplastic-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111448703 PE=4 SV=1)

HSP 1 Score: 1399 bits (3621), Expect = 0.0
Identity = 691/736 (93.89%), Postives = 697/736 (94.70%), Query Frame = 0

Query: 1   MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLL 60
           MLQFCIRS RFHFAQIARFQFRN+VR TEPNSSVHINLLTLCFNAQSLRQTKEVHAICLL
Sbjct: 1   MLQFCIRSIRFHFAQIARFQFRNYVRSTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLL 60

Query: 61  NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD 120
           NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD
Sbjct: 61  NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD 120

Query: 121 GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM 180
           GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM
Sbjct: 121 GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM 180

Query: 181 LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV 240
           LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV
Sbjct: 181 LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV 240

Query: 241 SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYGKCGSVKASWQVFDEI 300
           SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYGKCGSVK SWQVFDEI
Sbjct: 241 SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYGKCGSVKTSWQVFDEI 300

Query: 301 IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK 360
           IEKNEVSWNSIINGLAFKGHF DALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK
Sbjct: 301 IEKNEVSWNSIINGLAFKGHFWDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK 360

Query: 361 EIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNG 420
           EIHGFSMRMGTETDLF ANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANY LNG
Sbjct: 361 EIHGFSMRMGTETDLFTANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYALNG 420

Query: 421 VALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTN 480
           VALEAIRFVILLQESGERPNAVTFTNVLPACAR GHLGPGKEIHAMGVRLGLTSDLFVTN
Sbjct: 421 VALEAIRFVILLQESGERPNAVTFTNVLPACARLGHLGPGKEIHAMGVRLGLTSDLFVTN 480

Query: 481 ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLG---- 540
           ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLG    
Sbjct: 481 ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLGKKPD 540

Query: 541 -YGMIGELETAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVERGWQYFSEMLAQHLEP 600
               +G +    N+    +             VLSACSHGGLVERGWQYFSEMLAQHLEP
Sbjct: 541 VVSFMGVISACANLAAVKQ-------------VLSACSHGGLVERGWQYFSEMLAQHLEP 600

Query: 601 TEVHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQL 660
           TE+HYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAE L
Sbjct: 601 TEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEHL 660

Query: 661 FELKPQHCEYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVD 720
           FELKPQHC YYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVD
Sbjct: 661 FELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVD 720

Query: 721 DRAEGFESGGLLAEFV 731
           DRAEGFESGGLLAEFV
Sbjct: 721 DRAEGFESGGLLAEFV 723

BLAST of Csor.00g128750 vs. ExPASy TrEMBL
Match: A0A6J1FYI3 (pentatricopeptide repeat-containing protein At1g15510, chloroplastic-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111448703 PE=4 SV=1)

HSP 1 Score: 1397 bits (3616), Expect = 0.0
Identity = 691/736 (93.89%), Postives = 697/736 (94.70%), Query Frame = 0

Query: 1   MLQFCIRSFRFHFAQIARFQFRNFVRRTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLL 60
           MLQFCIRS RFHFAQIARFQFRN+VR TEPNSSVHINLLTLCFNAQSLRQTKEVHAICLL
Sbjct: 1   MLQFCIRSIRFHFAQIARFQFRNYVRSTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLL 60

Query: 61  NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD 120
           NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD
Sbjct: 61  NGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLD 120

Query: 121 GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM 180
           GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM
Sbjct: 121 GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLM 180

Query: 181 LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV 240
           LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV
Sbjct: 181 LYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLV 240

Query: 241 SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYGKCGSVKASWQVFDEI 300
           SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAY KCGSVKASWQVFDEI
Sbjct: 241 SVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYWKCGSVKASWQVFDEI 300

Query: 301 IEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK 360
           IEKNEVSWNSIINGLAFKGHF DALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK
Sbjct: 301 IEKNEVSWNSIINGLAFKGHFWDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGK 360

Query: 361 EIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNG 420
           EIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNG
Sbjct: 361 EIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNG 420

Query: 421 VALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTN 480
           VALEAIRFVILLQESGERPNAVTFTNVLPACAR GHLGPGKEIH MGVRLGLTSDLFVTN
Sbjct: 421 VALEAIRFVILLQESGERPNAVTFTNVLPACARLGHLGPGKEIHGMGVRLGLTSDLFVTN 480

Query: 481 ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLG---- 540
           ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLG    
Sbjct: 481 ALTDMYAKCGCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLGKKPD 540

Query: 541 -YGMIGELETAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVERGWQYFSEMLAQHLEP 600
               +G +    N+    +             VLSACSHGGLVERGWQY SEMLAQHLEP
Sbjct: 541 VVSFMGVISACANLAAVKQ-------------VLSACSHGGLVERGWQYLSEMLAQHLEP 600

Query: 601 TEVHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQL 660
           TE+HYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAE L
Sbjct: 601 TEMHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEHL 660

Query: 661 FELKPQHCEYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVD 720
           FELKPQHC YYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVD
Sbjct: 661 FELKPQHCGYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVD 720

Query: 721 DRAEGFESGGLLAEFV 731
           DRAEGFESGGLLAEFV
Sbjct: 721 DRAEGFESGGLLAEFV 723

BLAST of Csor.00g128750 vs. TAIR 10
Match: AT1G15510.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 435.6 bits (1119), Expect = 7.6e-122
Identity = 235/692 (33.96%), Postives = 387/692 (55.92%), Query Frame = 0

Query: 27  RTEPNSSVHINLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPES 86
           R   +  V + L+ LC   ++  +  +V++I  L+ +    V L  + +  + +F +   
Sbjct: 89  RVAVDEDVFVALVRLCEWKRAQEEGSKVYSIA-LSSMSSLGVELGNAFLAMFVRFGNLVD 148

Query: 87  FCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDGLETYNRMVRF-GVQLDDHTFPFVLK 146
              +F +  +  R  F WN L+  ++  G    + +  Y+RM+   GV+ D +TFP VL+
Sbjct: 149 AWYVFGKMSE--RNLFSWNVLVGGYAKQGYFD-EAMCLYHRMLWVGGVKPDVYTFPCVLR 208

Query: 147 ICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVS 206
            C    D+ +G EVH  V + G++ D+ V N L+ +Y  CG +  A+ +FD M  RD++S
Sbjct: 209 TCGGIPDLARGKEVHVHVVRYGYELDIDVVNALITMYVKCGDVKSARLLFDRMPRRDIIS 268

Query: 207 WNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYI 266
           WN +I     NG   E    +F M   S + P+L+++ S++     L D  + R IH Y+
Sbjct: 269 WNAMISGYFENGMCHEGLELFFAMRGLS-VDPDLMTLTSVISACELLGDRRLGRDIHAYV 328

Query: 267 VKVGLDSLVTSCNALVDAYGKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDAL 326
           +  G    ++ CN+L   Y   GS + + ++F  +  K+ VSW ++I+G  +      A+
Sbjct: 329 ITTGFAVDISVCNSLTQMYLNAGSWREAEKLFSRMERKDIVSWTTMISGYEYNFLPDKAI 388

Query: 327 DVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMY 386
           D +RMM     KP+ +T++++L     L     G E+H  +++    + + +AN+LI+MY
Sbjct: 389 DTYRMMDQDSVKPDEITVAAVLSACATLGDLDTGVELHKLAIKARLISYVIVANNLINMY 448

Query: 387 AKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFT 446
           +K     +A  IFHN+  +N++SW ++IA   LN    EA+ F+  ++ +  +PNA+T T
Sbjct: 449 SKCKCIDKALDIFHNIPRKNVISWTSIIAGLRLNNRCFEALIFLRQMKMT-LQPNAITLT 508

Query: 447 NVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFNTSHKD 506
             L ACAR G L  GKEIHA  +R G+  D F+ NAL DMY +CG   +A + FN+  KD
Sbjct: 509 AALAACARIGALMCGKEIHAHVLRTGVGLDDFLPNALLDMYVRCGRMNTAWSQFNSQKKD 568

Query: 507 EVSYNILITGYSETNDCLESLNLFSEMRLLGYGMIGELETAINMFEAMRDDKVQYDLVSY 566
             S+NIL+TGYSE                      G+    + +F+ M   +V+ D +++
Sbjct: 569 VTSWNILLTGYSER---------------------GQGSMVVELFDRMVKSRVRPDEITF 628

Query: 567 IAVLSACSHGGLVERGWQYFSEMLAQHLEPTEVHYTCLVDLLGRAGFVEEAAELIRRLPI 626
           I++L  CS   +V +G  YFS+M    + P   HY C+VDLLGRAG ++EA + I+++P+
Sbjct: 629 ISLLCGCSKSQMVRQGLMYFSKMEDYGVTPNLKHYACVVDLLGRAGELQEAHKFIQKMPV 688

Query: 627 APDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCEYYILLANMHAETGRWDEVNRI 686
            PD  +WGALL ACRI+  ++LG  +A+ +FEL  +   YYILL N++A+ G+W EV ++
Sbjct: 689 TPDPAVWGALLNACRIHHKIDLGELSAQHIFELDKKSVGYYILLCNLYADCGKWREVAKV 748

Query: 687 RELMKSRGAKKSPGCSWVQIHDQLHAFVVDDR 718
           R +MK  G     GCSWV++  ++HAF+ DD+
Sbjct: 749 RRMMKENGLTVDAGCSWVEVKGKVHAFLSDDK 753

BLAST of Csor.00g128750 vs. TAIR 10
Match: AT4G18750.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 428.7 bits (1101), Expect = 9.3e-120
Identity = 243/684 (35.53%), Postives = 381/684 (55.70%), Query Frame = 0

Query: 37  NLLTLCFNAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQ 96
           ++L LC +++SL+  KEV      NG +  S +L + L L Y      +    +F +   
Sbjct: 99  SVLQLCADSKSLKDGKEVDNFIRGNGFVIDS-NLGSKLSLMYTNCGDLKEASRVFDEV-- 158

Query: 97  NCRTAFLWNTLIRAHSIAGNGTLDG-LETYNRMVRFGVQLDDHTFPFVLKICSDSLDICK 156
               A  WN L+  + +A +G   G +  + +M+  GV++D +TF  V K  S    +  
Sbjct: 159 KIEKALFWNILM--NELAKSGDFSGSIGLFKKMMSSGVEMDSYTFSCVSKSFSSLRSVHG 218

Query: 157 GMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSV 216
           G ++HG + K GF     VGN+L+  Y     ++ A+KVFDEM+ERDV+SWN++I     
Sbjct: 219 GEQLHGFILKSGFGERNSVGNSLVAFYLKNQRVDSARKVFDEMTERDVISWNSIINGYVS 278

Query: 217 NGDYREARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVT 276
           NG   +  + +  M L SGI+ +L +++S+    A      + R +H   VK        
Sbjct: 279 NGLAEKGLSVFVQM-LVSGIEIDLATIVSVFAGCADSRLISLGRAVHSIGVKACFSREDR 338

Query: 277 SCNALVDAYGKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAG 336
            CN L+D Y KCG + ++  VF E+ +++ VS+ S+I G A +G   +A+ +F  M + G
Sbjct: 339 FCNTLLDMYSKCGDLDSAKAVFREMSDRSVVSYTSMIAGYAREGLAGEAVKLFEEMEEEG 398

Query: 337 TKPNSVTISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEAS 396
             P+  T++++L           GK +H +        D+F++N+L+DMYAK G   EA 
Sbjct: 399 ISPDVYTVTAVLNCCARYRLLDEGKRVHEWIKENDLGFDIFVSNALMDMYAKCGSMQEAE 458

Query: 397 SIFHNMDGRNIVSWNAMIANYVLNGVALEAIR-FVILLQESGERPNAVTFTNVLPACARS 456
            +F  M  ++I+SWN +I  Y  N  A EA+  F +LL+E    P+  T   VLPACA  
Sbjct: 459 LVFSEMRVKDIISWNTIIGGYSKNCYANEALSLFNLLLEEKRFSPDERTVACVLPACASL 518

Query: 457 GHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVF-NTSHKDEVSYNILI 516
                G+EIH   +R G  SD  V N+L DMYAKCG    A  +F + + KD VS+ ++I
Sbjct: 519 SAFDKGREIHGYIMRNGYFSDRHVANSLVDMYAKCGALLLAHMLFDDIASKDLVSWTVMI 578

Query: 517 TGYSETNDCLESLNLFSEMRLLGYGMIGELETAINMFEAMRDDKVQYDLVSYIAVLSACS 576
                                 GYGM G  + AI +F  MR   ++ D +S++++L ACS
Sbjct: 579 A---------------------GYGMHGFGKEAIALFNQMRQAGIEADEISFVSLLYACS 638

Query: 577 HGGLVERGWQYFSEMLAQ-HLEPTEVHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIW 636
           H GLV+ GW++F+ M  +  +EPT  HY C+VD+L R G + +A   I  +PI PD+ IW
Sbjct: 639 HSGLVDEGWRFFNIMRHECKIEPTVEHYACIVDMLARTGDLIKAYRFIENMPIPPDATIW 698

Query: 637 GALLGACRIYGNVELGCKAAEQLFELKPQHCEYYILLANMHAETGRWDEVNRIRELMKSR 696
           GALL  CRI+ +V+L  K AE++FEL+P++  YY+L+AN++AE  +W++V R+R+ +  R
Sbjct: 699 GALLCGCRIHHDVKLAEKVAEKVFELEPENTGYYVLMANIYAEAEKWEQVKRLRKRIGQR 755

Query: 697 GAKKSPGCSWVQIHDQLHAFVVDD 717
           G +K+PGCSW++I  +++ FV  D
Sbjct: 759 GLRKNPGCSWIEIKGRVNIFVAGD 755

BLAST of Csor.00g128750 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 415.2 bits (1066), Expect = 1.1e-115
Identity = 226/669 (33.78%), Postives = 376/669 (56.20%), Query Frame = 0

Query: 47  SLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNT 106
           SL++ +++  +   NGL          L+  + ++   +    +F     + +   L++T
Sbjct: 49  SLKELRQILPLVFKNGLYQEHF-FQTKLVSLFCRYGSVDEAARVFEPI--DSKLNVLYHT 108

Query: 107 LIRAHSIAGNGTLD-GLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFK 166
           +++    A    LD  L+ + RM    V+   + F ++LK+C D  ++  G E+HG++ K
Sbjct: 109 MLK--GFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIHGLLVK 168

Query: 167 LGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNY 226
            GF  D++    L  +Y  C  +N+A+KVFD M ERD+VSWNT++   S NG  R A   
Sbjct: 169 SGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMARMALEM 228

Query: 227 YFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALVDAYG 286
              M     ++P+ ++++S+LP  + L    + + IH Y ++ G DSLV    ALVD Y 
Sbjct: 229 VKSM-CEENLKPSFITIVSVLPAVSALRLISVGKEIHGYAMRSGFDSLVNISTALVDMYA 288

Query: 287 KCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISS 346
           KCGS++ + Q+FD ++E+N VSWNS+I+      + ++A+ +F+ M+D G KP  V++  
Sbjct: 289 KCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTDVSVMG 348

Query: 347 ILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRN 406
            L    +L   + G+ IH  S+ +G + ++ + NSLI MY K      A+S+F  +  R 
Sbjct: 349 ALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRT 408

Query: 407 IVSWNAMIANYVLNGVALEAIRFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHA 466
           +VSWNAMI  +  NG  ++A+ +   ++    +P+  T+ +V+ A A        K IH 
Sbjct: 409 LVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHG 468

Query: 467 MGVRLGLTSDLFVTNALTDMYAKCGCFRSARNVFN-TSHKDEVSYNILITGYSETNDCLE 526
           + +R  L  ++FVT AL DMYAKCG    AR +F+  S +   ++N +I           
Sbjct: 469 VVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMID---------- 528

Query: 527 SLNLFSEMRLLGYGMIGELETAINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVERGWQY 586
                      GYG  G  + A+ +FE M+   ++ + V++++V+SACSH GLVE G + 
Sbjct: 529 -----------GYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKC 588

Query: 587 FSEMLAQH-LEPTEVHYTCLVDLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYG 646
           F  M   + +E +  HY  +VDLLGRAG + EA + I ++P+ P  N++GA+LGAC+I+ 
Sbjct: 589 FYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHK 648

Query: 647 NVELGCKAAEQLFELKPQHCEYYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWV 706
           NV    KAAE+LFEL P    Y++LLAN++     W++V ++R  M  +G +K+PGCS V
Sbjct: 649 NVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMV 690

Query: 707 QIHDQLHAF 713
           +I +++H+F
Sbjct: 709 EIKNEVHSF 690

BLAST of Csor.00g128750 vs. TAIR 10
Match: AT4G21300.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 398.7 bits (1023), Expect = 1.0e-110
Identity = 222/652 (34.05%), Postives = 356/652 (54.60%), Query Frame = 0

Query: 70  LCASLILNYAKFQHPESFCTLFHQTVQNCRTAFLWNTLIRAHSIAGNGTLDG-LETYNRM 129
           + +SLI  Y ++   +    LF + +Q  +   +WN ++  +  A  G LD  ++ ++ M
Sbjct: 175 VASSLIKAYLEYGKIDVPSKLFDRVLQ--KDCVIWNVMLNGY--AKCGALDSVIKGFSVM 234

Query: 130 VRFGVQLDDHTFPFVLKICSDSLDICKGMEVHGVVFKLGFDSDVYVGNTLLMLYGNCGFL 189
               +  +  TF  VL +C+  L I  G+++HG+V   G D +  + N+LL +Y  CG  
Sbjct: 235 RMDQISPNAVTFDCVLSVCASKLLIDLGVQLHGLVVVSGVDFEGSIKNSLLSMYSKCGRF 294

Query: 190 NDAKKVFDEMSERDVVSWNTVIGLLSVNGDYREARNYYFWMTLRSGIQPNLVSVISLLPI 249
           +DA K+F  MS  D V+WN +I     +G   E+  +++ M + SG+ P+ ++  SLLP 
Sbjct: 295 DDASKLFRMMSRADTVTWNCMISGYVQSGLMEESLTFFYEM-ISSGVLPDAITFSSLLPS 354

Query: 250 SAGLEDEEMTRRIHCYIVK--VGLDSLVTSCNALVDAYGKCGSVKASWQVFDEIIEKNEV 309
            +  E+ E  ++IHCYI++  + LD  +TS  AL+DAY KC  V  +  +F +    + V
Sbjct: 355 VSKFENLEYCKQIHCYIMRHSISLDIFLTS--ALIDAYFKCRGVSMAQNIFSQCNSVDVV 414

Query: 310 SWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSVTISSILPVFVELECFKAGKEIHGFS 369
            + ++I+G    G + D+L++FR ++     PN +T+ SILPV   L   K G+E+HGF 
Sbjct: 415 VFTAMISGYLHNGLYIDSLEMFRWLVKVKISPNEITLVSILPVIGILLALKLGRELHGFI 474

Query: 370 MRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNMDGRNIVSWNAMIANYVLNGVALEAI 429
           ++ G +    I  ++IDMYAK G    A  IF  +  R+IVSWN+MI     +     AI
Sbjct: 475 IKKGFDNRCNIGCAVIDMYAKCGRMNLAYEIFERLSKRDIVSWNSMITRCAQSDNPSAAI 534

Query: 430 RFVILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMY 489
                +  SG   + V+ +  L ACA       GK IH   ++  L SD++  + L DMY
Sbjct: 535 DIFRQMGVSGICYDCVSISAALSACANLPSESFGKAIHGFMIKHSLASDVYSESTLIDMY 594

Query: 490 AKCGCFRSARNVFNT-SHKDEVSYNILITGYSETNDCLESLNLFSEMRLLGYGMIGELET 549
           AKCG  ++A NVF T   K+ VS+N +I          +SL LF EM             
Sbjct: 595 AKCGNLKAAMNVFKTMKEKNIVSWNSIIAACGNHGKLKDSLCLFHEM------------- 654

Query: 550 AINMFEAMRDDKVQYDLVSYIAVLSACSHGGLVERGWQYFSEMLAQH-LEPTEVHYTCLV 609
                  +    ++ D ++++ ++S+C H G V+ G ++F  M   + ++P + HY C+V
Sbjct: 655 -------VEKSGIRPDQITFLEIISSCCHVGDVDEGVRFFRSMTEDYGIQPQQEHYACVV 714

Query: 610 DLLGRAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCE 669
           DL GRAG + EA E ++ +P  PD+ +WG LLGACR++ NVEL   A+ +L +L P +  
Sbjct: 715 DLFGRAGRLTEAYETVKSMPFPPDAGVWGTLLGACRLHKNVELAEVASSKLMDLDPSNSG 774

Query: 670 YYILLANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDD 717
           YY+L++N HA    W+ V ++R LMK R  +K PG SW++I+ + H FV  D
Sbjct: 775 YYVLISNAHANAREWESVTKVRSLMKEREVQKIPGYSWIEINKRTHLFVSGD 799

BLAST of Csor.00g128750 vs. TAIR 10
Match: AT3G22690.1 (CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885); BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT2G29760.1); Has 49784 Blast hits to 14716 proteins in 280 species: Archae - 2; Bacteria - 10; Metazoa - 107; Fungi - 167; Plants - 48594; Viruses - 0; Other Eukaryotes - 904 (source: NCBI BLink). )

HSP 1 Score: 398.3 bits (1022), Expect = 1.3e-110
Identity = 228/710 (32.11%), Postives = 373/710 (52.54%), Query Frame = 0

Query: 44  NAQSLRQTKEVHAICLLNGLLPHSVSLCASLILNYAKFQHPE--SFCTLFHQTVQNCRTA 103
           N +++ + K  H      G L + VS    L+    +    E  SF     +  ++  T 
Sbjct: 41  NCKTIDELKMFHRSLTKQG-LDNDVSTITKLVARSCELGTRESLSFAKEVFENSESYGTC 100

Query: 104 FLWNTLIRAHSIAGNGTLDGLETYNRMVRFGVQLDDHTFPFVLKICSDSLDICKGMEVHG 163
           F++N+LIR ++ +G    + +  + RM+  G+  D +TFPF L  C+ S     G+++HG
Sbjct: 101 FMYNSLIRGYASSGLCN-EAILLFLRMMNSGISPDKYTFPFGLSACAKSRAKGNGIQIHG 160

Query: 164 VVFKLGFDSDVYVGNTLLMLYGNCGFLNDAKKVFDEMSERDVVSWNTVIGLLSVNGDYRE 223
           ++ K+G+  D++V N+L+  Y  CG L+ A+KVFDEMSER+VVSW ++I   +     ++
Sbjct: 161 LIVKMGYAKDLFVQNSLVHFYAECGELDSARKVFDEMSERNVVSWTSMICGYARRDFAKD 220

Query: 224 ARNYYFWMTLRSGIQPNLVSVISLLPISAGLEDEEMTRRIHCYIVKVGLDSLVTSCNALV 283
           A + +F M     + PN V+++ ++   A LED E   +++ +I   G++      +ALV
Sbjct: 221 AVDLFFRMVRDEEVTPNSVTMVCVISACAKLEDLETGEKVYAFIRNSGIEVNDLMVSALV 280

Query: 284 DAYGKCGSVKASWQVFDEIIEKNEVSWNSIINGLAFKGHFRDALDVFRMMIDAGTKPNSV 343
           D Y KC ++  + ++FDE    N    N++ +    +G  R+AL VF +M+D+G +P+ +
Sbjct: 281 DMYMKCNAIDVAKRLFDEYGASNLDLCNAMASNYVRQGLTREALGVFNLMMDSGVRPDRI 340

Query: 344 TISSILPVFVELECFKAGKEIHGFSMRMGTETDLFIANSLIDMYAKSGHSTEASSIFHNM 403
           ++ S +    +L     GK  HG+ +R G E+   I N+LIDMY K      A  IF  M
Sbjct: 341 SMLSAISSCSQLRNILWGKSCHGYVLRNGFESWDNICNALIDMYMKCHRQDTAFRIFDRM 400

Query: 404 DGRNIVSWNAMIANYVLNG-------------------------------VALEAIR-FV 463
             + +V+WN+++A YV NG                               +  EAI  F 
Sbjct: 401 SNKTVVTWNSIVAGYVENGEVDAAWETFETMPEKNIVSWNTIISGLVQGSLFEEAIEVFC 460

Query: 464 ILLQESGERPNAVTFTNVLPACARSGHLGPGKEIHAMGVRLGLTSDLFVTNALTDMYAKC 523
            +  + G   + VT  ++  AC   G L   K I+    + G+  D+ +   L DM+++C
Sbjct: 461 SMQSQEGVNADGVTMMSIASACGHLGALDLAKWIYYYIEKNGIQLDVRLGTTLVDMFSRC 520

Query: 524 GCFRSARNVFNTSHKDEVSYNILITGYSETNDCLESLNLFSEMRLLGYGMIGELETAINM 583
           G   SA ++FN+    +VS      G                       M G  E AI +
Sbjct: 521 GDPESAMSIFNSLTNRDVSAWTAAIG--------------------AMAMAGNAERAIEL 580

Query: 584 FEAMRDDKVQYDLVSYIAVLSACSHGGLVERGWQYFSEMLAQH-LEPTEVHYTCLVDLLG 643
           F+ M +  ++ D V+++  L+ACSHGGLV++G + F  ML  H + P +VHY C+VDLLG
Sbjct: 581 FDDMIEQGLKPDGVAFVGALTACSHGGLVQQGKEIFYSMLKLHGVSPEDVHYGCMVDLLG 640

Query: 644 RAGFVEEAAELIRRLPIAPDSNIWGALLGACRIYGNVELGCKAAEQLFELKPQHCEYYIL 703
           RAG +EEA +LI  +P+ P+  IW +LL ACR+ GNVE+   AAE++  L P+    Y+L
Sbjct: 641 RAGLLEEAVQLIEDMPMEPNDVIWNSLLAACRVQGNVEMAAYAAEKIQVLAPERTGSYVL 700

Query: 704 LANMHAETGRWDEVNRIRELMKSRGAKKSPGCSWVQIHDQLHAFVVDDRA 719
           L+N++A  GRW+++ ++R  MK +G +K PG S +QI  + H F   D +
Sbjct: 701 LSNVYASAGRWNDMAKVRLSMKEKGLRKPPGTSSIQIRGKTHEFTSGDES 728

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9M9E21.1e-12033.96Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidop... [more]
Q9SN391.3e-11835.53Pentatricopeptide repeat-containing protein DOT4, chloroplastic OS=Arabidopsis t... [more]
Q3E6Q11.5e-11433.78Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Q9STE11.5e-10934.05Pentatricopeptide repeat-containing protein At4g21300 OS=Arabidopsis thaliana OX... [more]
Q9LUJ21.9e-10932.11Pentatricopeptide repeat-containing protein At3g22690 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
KAG6570451.10.0100.00Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurb... [more]
XP_022944177.10.089.64pentatricopeptide repeat-containing protein At4g14170-like isoform X1 [Cucurbita... [more]
KAG6570435.10.088.16Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurb... [more]
KAG6570443.10.095.38Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurb... [more]
XP_022985648.10.087.42pentatricopeptide repeat-containing protein At4g14170-like isoform X1 [Cucurbita... [more]
Match NameE-valueIdentityDescription
A0A6J1FV310.089.64pentatricopeptide repeat-containing protein At4g14170-like isoform X1 OS=Cucurbi... [more]
A0A6J1JE860.087.42pentatricopeptide repeat-containing protein At4g14170-like isoform X1 OS=Cucurbi... [more]
A0A6J1FXS70.095.35pentatricopeptide repeat-containing protein At1g15510, chloroplastic-like isofor... [more]
A0A6J1FW820.093.89pentatricopeptide repeat-containing protein At1g15510, chloroplastic-like isofor... [more]
A0A6J1FYI30.093.89pentatricopeptide repeat-containing protein At1g15510, chloroplastic-like isofor... [more]
Match NameE-valueIdentityDescription
AT1G15510.17.6e-12233.96Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G18750.19.3e-12035.53Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G11290.11.1e-11533.78Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G21300.11.0e-11034.05Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G22690.11.3e-11032.11CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012... [more]
InterPro
Analysis Name: InterPro Annotations of Silver-seed gourd (sororia) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 379..406
e-value: 0.001
score: 19.2
coord: 407..436
e-value: 0.4
score: 11.0
coord: 204..226
e-value: 0.049
score: 13.9
coord: 507..536
e-value: 4.6E-5
score: 23.4
coord: 176..202
e-value: 9.1E-4
score: 19.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 507..536
e-value: 0.0028
score: 15.7
coord: 563..596
e-value: 1.0E-4
score: 20.2
coord: 276..304
e-value: 1.2E-4
score: 20.0
coord: 176..204
e-value: 0.0015
score: 16.5
coord: 306..339
e-value: 7.4E-7
score: 27.0
coord: 379..406
e-value: 0.0028
score: 15.7
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 303..347
e-value: 4.0E-10
score: 39.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 202..237
score: 9.470621
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 304..338
score: 12.101333
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 100..135
score: 8.6266
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 374..408
score: 9.29524
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 171..201
score: 8.506026
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 561..595
score: 10.205028
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 505..539
score: 9.821383
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 358..463
e-value: 7.2E-18
score: 66.6
coord: 464..536
e-value: 8.4E-11
score: 43.5
coord: 255..357
e-value: 1.7E-20
score: 75.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 5..161
e-value: 7.9E-9
score: 37.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 537..715
e-value: 1.3E-29
score: 105.6
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 186..681
IPR033443Pentacotripeptide-repeat region of PRORPPFAMPF17177PPR_longcoord: 542..685
e-value: 1.1E-6
score: 28.1
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 27..149
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 534..721
NoneNo IPR availablePANTHERPTHR47924:SF44BNAC05G13700D PROTEINcoord: 534..721
coord: 229..533
coord: 121..246
NoneNo IPR availablePANTHERPTHR47924PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 229..533
coord: 121..246
NoneNo IPR availablePANTHERPTHR47924:SF44BNAC05G13700D PROTEINcoord: 27..149

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csor.00g128750.m01Csor.00g128750.m01mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding