Csor.00g214140 (gene) Silver-seed gourd (wild; sororia) v1

Overview
NameCsor.00g214140
Typegene
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationCsor_Chr10: 3782777 .. 3784633 (-)
RNA-Seq ExpressionCsor.00g214140
SyntenyCsor.00g214140
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSsinglepolypeptidestart_codonstop_codon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCATCGCTTCTTGCCCTTCAGCCCAGTGCATCCCCGATCAACTCCCCAAAAGTTCACACATCACCTATCCATGGCCTTAAGTCATGCTCATCCATGTCTGAGCTCAAGCAATATCACTCCCAGATCATCCGTCTTGGTCTTTCTACTGACAATGATGCCATGGGTCGTCTTGTCAAATTCTGTGCTGTATCCAAGAATGGAGACCTTGATTATGCACTTCTTTTGTTTAAGACAATCCCCAACCCAGATGCATACATCTACAATACCTTAATAAGAGGCTACTTACAACTCCAATTCCCTAGAGCTTGCTTACTGTTGTATTTGGAAATGCTGCATAAGTGTGTCCTTCCCAATAAATTTACATTTCCTTCTCTAATTCGTGCTTGCTGCATCGATAATGCTATCGAGGAAGGGAAGCAAATTCATGGACATGTTCTTAAATTTGGCTTCAGAACAGATAGATTTTCTCAGAACAATTTGATTCATATGTATGCTAATTTTCAATCCTTGGAAGAAGCCAGAAGGGTCTTCGATGGTATTGAGCTACCCGATGTTGTGTCATGGACCACTCTGCTTACTGGGTATGCTCAGTGTGGATTTCTAGATGAAGCCTTACAAGTTTTCGAGTCGATGCCCGAGCGCAGCTCTGCTTCTTGGAATGCCATGATTTCTTCTTTTGTCCAAAACAATCGATTTCATGAAGCATTTGCTTTGTTTAATCGGATGAGGTCGGAGAAGATTGTGTTGGACAAATACATGGCTGCTAGCATGTTATCAGCTTGCACAGGATTGGGAGCACTTGAACAAGGAATGTGGATACACAGATACATCAAGAAAAGTGAGATCAAATTAGACTCGAAACTTGCAACGACGCTCATCGACATGTATTGTAAATGCGGTTGCCTGGATCGTGCTTTTGAAGTGTTCACTCAGTTGCCAGAAAAGGGCATTTCTTCATGGAATTGCATGATTGGAGGGATGGCTATGCACGGGAAAGGAGAGGCAGCCATTGAGCTTTTCAAAGACATGGAGACCAAAATGGTGACACCAGACAACATAACTTTCCTTAATGTTCTCAATGCTTGTGCTCACTCCGGGTTAGTTGAAAAGGGACGCTACTATTTCAATCATTTCACTCAAGTTTACGATATTAAACCCAGAACAGAACATTATGGATGCATGGTTGATTTATATGGACGAGCCGGGATGCTAGACGAAGCGATGAACCTCATACGTGAGATGCCAATGAGTCCCGACGCCGGAGTGTTAGGTGCCTTCGTTGGAGCATGTAAAATCCATGGGAACGTAGACATGGGGGAGGAAATAGGCAAGAGAGTAATAGAACTAGATCCAAGCAATAGCGGGCGCTACGTTCTACTCGGAAATCTGTACGCCGAGGCCGGTAGATGGGACGGTGTAGCAGAAGTACGAAAGCTAATGAATGATAGAGAAGTGAAGAAGGCTGCTGGGTTTTCCATGATTGAATTGGAGGGCGTGGTGCATGAATTCATTGCAGGAGGAAGGGGTCACCCAGAAGCCAATGAAATATATGGCAAAGTTAAGGAGATGGTAGAATGTATAAGATGTGTAGGATATGTAACCGAGGAGGAGGAGGAGGTGGAGAAGGACAACCCTGTTTACTACCATAGTGAGAAACTGGCAGTTGCTTATGGATTGCTCAAGACTCGAGCAGGGGAAACGCTTCGAATCACTAAGAATCTGCGGGTCTGTAAGGACTGTCACCAAGCTCTTAAGCTTGTTTCTAAGGTGTTTCAACGGAAGATCGTTGTGAGAGATAGAAATCGTTTCCATCATTTTGCTAATGGAGAGTGTTCTTGTAATGATTATTGGTAA

mRNA sequence

ATGTCATCGCTTCTTGCCCTTCAGCCCAGTGCATCCCCGATCAACTCCCCAAAAGTTCACACATCACCTATCCATGGCCTTAAGTCATGCTCATCCATGTCTGAGCTCAAGCAATATCACTCCCAGATCATCCGTCTTGGTCTTTCTACTGACAATGATGCCATGGGTCGTCTTGTCAAATTCTGTGCTGTATCCAAGAATGGAGACCTTGATTATGCACTTCTTTTGTTTAAGACAATCCCCAACCCAGATGCATACATCTACAATACCTTAATAAGAGGCTACTTACAACTCCAATTCCCTAGAGCTTGCTTACTGTTGTATTTGGAAATGCTGCATAAGTGTGTCCTTCCCAATAAATTTACATTTCCTTCTCTAATTCGTGCTTGCTGCATCGATAATGCTATCGAGGAAGGGAAGCAAATTCATGGACATGTTCTTAAATTTGGCTTCAGAACAGATAGATTTTCTCAGAACAATTTGATTCATATGTATGCTAATTTTCAATCCTTGGAAGAAGCCAGAAGGGTCTTCGATGGTATTGAGCTACCCGATGTTGTGTCATGGACCACTCTGCTTACTGGGTATGCTCAGTGTGGATTTCTAGATGAAGCCTTACAAGTTTTCGAGTCGATGCCCGAGCGCAGCTCTGCTTCTTGGAATGCCATGATTTCTTCTTTTGTCCAAAACAATCGATTTCATGAAGCATTTGCTTTGTTTAATCGGATGAGGTCGGAGAAGATTGTGTTGGACAAATACATGGCTGCTAGCATGTTATCAGCTTGCACAGGATTGGGAGCACTTGAACAAGGAATGTGGATACACAGATACATCAAGAAAAGTGAGATCAAATTAGACTCGAAACTTGCAACGACGCTCATCGACATGTATTGTAAATGCGGTTGCCTGGATCGTGCTTTTGAAGTGTTCACTCAGTTGCCAGAAAAGGGCATTTCTTCATGGAATTGCATGATTGGAGGGATGGCTATGCACGGGAAAGGAGAGGCAGCCATTGAGCTTTTCAAAGACATGGAGACCAAAATGGTGACACCAGACAACATAACTTTCCTTAATGTTCTCAATGCTTGTGCTCACTCCGGGTTAGTTGAAAAGGGACGCTACTATTTCAATCATTTCACTCAAGTTTACGATATTAAACCCAGAACAGAACATTATGGATGCATGGTTGATTTATATGGACGAGCCGGGATGCTAGACGAAGCGATGAACCTCATACGTGAGATGCCAATGAGTCCCGACGCCGGAGTGTTAGGTGCCTTCGTTGGAGCATGTAAAATCCATGGGAACGTAGACATGGGGGAGGAAATAGGCAAGAGAGTAATAGAACTAGATCCAAGCAATAGCGGGCGCTACGTTCTACTCGGAAATCTGTACGCCGAGGCCGGTAGATGGGACGGTGTAGCAGAAGTACGAAAGCTAATGAATGATAGAGAAGTGAAGAAGGCTGCTGGGTTTTCCATGATTGAATTGGAGGGCGTGGTGCATGAATTCATTGCAGGAGGAAGGGGTCACCCAGAAGCCAATGAAATATATGGCAAAGTTAAGGAGATGGTAGAATGTATAAGATGTGTAGGATATGTAACCGAGGAGGAGGAGGAGGTGGAGAAGGACAACCCTGTTTACTACCATAGTGAGAAACTGGCAGTTGCTTATGGATTGCTCAAGACTCGAGCAGGGGAAACGCTTCGAATCACTAAGAATCTGCGGGTCTGTAAGGACTGTCACCAAGCTCTTAAGCTTGTTTCTAAGGTGTTTCAACGGAAGATCGTTGTGAGAGATAGAAATCGTTTCCATCATTTTGCTAATGGAGAGTGTTCTTGTAATGATTATTGGTAA

Coding sequence (CDS)

ATGTCATCGCTTCTTGCCCTTCAGCCCAGTGCATCCCCGATCAACTCCCCAAAAGTTCACACATCACCTATCCATGGCCTTAAGTCATGCTCATCCATGTCTGAGCTCAAGCAATATCACTCCCAGATCATCCGTCTTGGTCTTTCTACTGACAATGATGCCATGGGTCGTCTTGTCAAATTCTGTGCTGTATCCAAGAATGGAGACCTTGATTATGCACTTCTTTTGTTTAAGACAATCCCCAACCCAGATGCATACATCTACAATACCTTAATAAGAGGCTACTTACAACTCCAATTCCCTAGAGCTTGCTTACTGTTGTATTTGGAAATGCTGCATAAGTGTGTCCTTCCCAATAAATTTACATTTCCTTCTCTAATTCGTGCTTGCTGCATCGATAATGCTATCGAGGAAGGGAAGCAAATTCATGGACATGTTCTTAAATTTGGCTTCAGAACAGATAGATTTTCTCAGAACAATTTGATTCATATGTATGCTAATTTTCAATCCTTGGAAGAAGCCAGAAGGGTCTTCGATGGTATTGAGCTACCCGATGTTGTGTCATGGACCACTCTGCTTACTGGGTATGCTCAGTGTGGATTTCTAGATGAAGCCTTACAAGTTTTCGAGTCGATGCCCGAGCGCAGCTCTGCTTCTTGGAATGCCATGATTTCTTCTTTTGTCCAAAACAATCGATTTCATGAAGCATTTGCTTTGTTTAATCGGATGAGGTCGGAGAAGATTGTGTTGGACAAATACATGGCTGCTAGCATGTTATCAGCTTGCACAGGATTGGGAGCACTTGAACAAGGAATGTGGATACACAGATACATCAAGAAAAGTGAGATCAAATTAGACTCGAAACTTGCAACGACGCTCATCGACATGTATTGTAAATGCGGTTGCCTGGATCGTGCTTTTGAAGTGTTCACTCAGTTGCCAGAAAAGGGCATTTCTTCATGGAATTGCATGATTGGAGGGATGGCTATGCACGGGAAAGGAGAGGCAGCCATTGAGCTTTTCAAAGACATGGAGACCAAAATGGTGACACCAGACAACATAACTTTCCTTAATGTTCTCAATGCTTGTGCTCACTCCGGGTTAGTTGAAAAGGGACGCTACTATTTCAATCATTTCACTCAAGTTTACGATATTAAACCCAGAACAGAACATTATGGATGCATGGTTGATTTATATGGACGAGCCGGGATGCTAGACGAAGCGATGAACCTCATACGTGAGATGCCAATGAGTCCCGACGCCGGAGTGTTAGGTGCCTTCGTTGGAGCATGTAAAATCCATGGGAACGTAGACATGGGGGAGGAAATAGGCAAGAGAGTAATAGAACTAGATCCAAGCAATAGCGGGCGCTACGTTCTACTCGGAAATCTGTACGCCGAGGCCGGTAGATGGGACGGTGTAGCAGAAGTACGAAAGCTAATGAATGATAGAGAAGTGAAGAAGGCTGCTGGGTTTTCCATGATTGAATTGGAGGGCGTGGTGCATGAATTCATTGCAGGAGGAAGGGGTCACCCAGAAGCCAATGAAATATATGGCAAAGTTAAGGAGATGGTAGAATGTATAAGATGTGTAGGATATGTAACCGAGGAGGAGGAGGAGGTGGAGAAGGACAACCCTGTTTACTACCATAGTGAGAAACTGGCAGTTGCTTATGGATTGCTCAAGACTCGAGCAGGGGAAACGCTTCGAATCACTAAGAATCTGCGGGTCTGTAAGGACTGTCACCAAGCTCTTAAGCTTGTTTCTAAGGTGTTTCAACGGAAGATCGTTGTGAGAGATAGAAATCGTTTCCATCATTTTGCTAATGGAGAGTGTTCTTGTAATGATTATTGGTAA

Protein sequence

MSSLLALQPSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVKFCAVSKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNKFTFPSLIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDGIELPDVVSWTTLLTGYAQCGFLDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFALFNRMRSEKIVLDKYMAASMLSACTGLGALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKCGCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVLNACAHSGLVEKGRYYFNHFTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPDAGVLGAFVGACKIHGNVDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKLMNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEANEIYGKVKEMVECIRCVGYVTEEEEEVEKDNPVYYHSEKLAVAYGLLKTRAGETLRITKNLRVCKDCHQALKLVSKVFQRKIVVRDRNRFHHFANGECSCNDYW
Homology
BLAST of Csor.00g214140 vs. ExPASy Swiss-Prot
Match: Q9FJY7 (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 558.1 bits (1437), Expect = 1.2e-157
Identity = 272/600 (45.33%), Postives = 383/600 (63.83%), Query Frame = 0

Query: 27  LKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVKFCAVSKNGD-LDYALLLFKTIPNPDA 86
           L+ CS   ELKQ H+++++ GL  D+ A+ + + FC  S + D L YA ++F     PD 
Sbjct: 21  LQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDRPDT 80

Query: 87  YIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNKFTFPSLIRACCIDNAIEEGKQIHGH 146
           +++N +IRG+     P   LLLY  ML      N +TFPSL++AC   +A EE  QIH  
Sbjct: 81  FLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETTQIHAQ 140

Query: 147 VLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDGIELPDVVSWTTLLTGYAQCGFLDEA 206
           + K G+  D ++ N+LI+ YA   + + A  +FD I  PD VSW +++ GY + G +D A
Sbjct: 141 ITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAGKMDIA 200

Query: 207 LQVFESMPERSSASWNAMISSFVQNNRFHEAFALFNRMRSEKIVLDKYMAASMLSACTGL 266
           L +F  M E+++ SW  MIS +VQ +   EA  LF+ M++  +  D    A+ LSAC  L
Sbjct: 201 LTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANALSACAQL 260

Query: 267 GALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKCGCLDRAFEVFTQLPEKGISSWNCMI 326
           GALEQG WIH Y+ K+ I++DS L   LIDMY KCG ++ A EVF  + +K + +W  +I
Sbjct: 261 GALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQAWTALI 320

Query: 327 GGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVLNACAHSGLVEKGRYYFNHFTQVYDI 386
            G A HG G  AI  F +M+   + P+ ITF  VL AC+++GLVE+G+  F    + Y++
Sbjct: 321 SGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSMERDYNL 380

Query: 387 KPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPDAGVLGAFVGACKIHGNVDMGEEIGK 446
           KP  EHYGC+VDL GRAG+LDEA   I+EMP+ P+A + GA + AC+IH N+++GEEIG+
Sbjct: 381 KPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELGEEIGE 440

Query: 447 RVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKLMNDREVKKAAGFSMIELEGVVHEFI 506
            +I +DP + GRYV   N++A   +WD  AE R+LM ++ V K  G S I LEG  HEF+
Sbjct: 441 ILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISLEGTTHEFL 500

Query: 507 AGGRGHPEANEIYGKVKEMVECIRCVGYVTEEEE-------EVEKDNPVYYHSEKLAVAY 566
           AG R HPE  +I  K + M   +   GYV E EE       + E++  V+ HSEKLA+ Y
Sbjct: 501 AGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQHSEKLAITY 560

Query: 567 GLLKTRAGETLRITKNLRVCKDCHQALKLVSKVFQRKIVVRDRNRFHHFANGECSCNDYW 619
           GL+KT+ G  +RI KNLRVCKDCH+  KL+SK+++R IV+RDR RFHHF +G+CSC DYW
Sbjct: 561 GLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFRDGKCSCGDYW 620

BLAST of Csor.00g214140 vs. ExPASy Swiss-Prot
Match: Q9FI80 (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 526.6 bits (1355), Expect = 3.9e-148
Identity = 259/633 (40.92%), Postives = 389/633 (61.45%), Query Frame = 0

Query: 12  SPINSPKVHTSPIH-GLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVKFCAVS--KNG 71
           SP +SP  H S +   + +C ++ +L Q H+  I+ G   D  A   +++FCA S   + 
Sbjct: 14  SPASSPASHPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHR 73

Query: 72  DLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACL---LLYLEMLHKCVLPNKFTFPS 131
           DLDYA  +F  +P  + + +NT+IRG+ +    +A +   L Y  M  + V PN+FTFPS
Sbjct: 74  DLDYAHKIFNQMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPS 133

Query: 132 LIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVF------- 191
           +++AC     I+EGKQIHG  LK+GF  D F  +NL+ MY     +++AR +F       
Sbjct: 134 VLKACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEK 193

Query: 192 DGIELPD-------VVSWTTLLTGYAQCGFLDEALQVFESMPERSSASWNAMISSFVQNN 251
           D + + D       +V W  ++ GY + G    A  +F+ M +RS  SWN MIS +  N 
Sbjct: 194 DMVVMTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNG 253

Query: 252 RFHEAFALFNRMRSEKIVLDKYMAASMLSACTGLGALEQGMWIHRYIKKSEIKLDSKLAT 311
            F +A  +F  M+   I  +     S+L A + LG+LE G W+H Y + S I++D  L +
Sbjct: 254 FFKDAVEVFREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGS 313

Query: 312 TLIDMYCKCGCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTP 371
            LIDMY KCG +++A  VF +LP + + +W+ MI G A+HG+   AI+ F  M    V P
Sbjct: 314 ALIDMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRP 373

Query: 372 DNITFLNVLNACAHSGLVEKGRYYFNHFTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNL 431
            ++ ++N+L AC+H GLVE+GR YF+    V  ++PR EHYGCMVDL GR+G+LDEA   
Sbjct: 374 SDVAYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEF 433

Query: 432 IREMPMSPDAGVLGAFVGACKIHGNVDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRW 491
           I  MP+ PD  +  A +GAC++ GNV+MG+ +   ++++ P +SG YV L N+YA  G W
Sbjct: 434 ILNMPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNW 493

Query: 492 DGVAEVRKLMNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEANEIYGKVKEMVECIRCV 551
             V+E+R  M +++++K  G S+I+++GV+HEF+     HP+A EI   + E+ + +R  
Sbjct: 494 SEVSEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLA 553

Query: 552 GY------VTEEEEEVEKDNPVYYHSEKLAVAYGLLKTRAGETLRITKNLRVCKDCHQAL 611
           GY      V    EE +K+N ++YHSEK+A A+GL+ T  G+ +RI KNLR+C+DCH ++
Sbjct: 554 GYRPITTQVLLNLEEEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSI 613

Query: 612 KLVSKVFQRKIVVRDRNRFHHFANGECSCNDYW 619
           KL+SKV++RKI VRDR RFHHF +G CSC DYW
Sbjct: 614 KLISKVYKRKITVRDRKRFHHFQDGSCSCMDYW 646

BLAST of Csor.00g214140 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 523.5 bits (1347), Expect = 3.3e-147
Identity = 276/722 (38.23%), Postives = 404/722 (55.96%), Query Frame = 0

Query: 9   PSAS--PINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVKFCAVSK 68
           PS+S  P +S + H S +  L +C ++  L+  H+Q+I++GL   N A+ +L++FC +S 
Sbjct: 21  PSSSDPPYDSIRNHPS-LSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSP 80

Query: 69  NGD-LDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNKFTFPS 128
           + + L YA+ +FKTI  P+  I+NT+ RG+     P + L LY+ M+   +LPN +TFP 
Sbjct: 81  HFEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPF 140

Query: 129 LIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDGIELPD 188
           ++++C    A +EG+QIHGHVLK G   D +   +LI MY     LE+A +VFD     D
Sbjct: 141 VLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRD 200

Query: 189 VVSWTTLLTGYAQCGFLDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFALFNRMRS 248
           VVS+T L+ GYA  G+++ A ++F+ +P +   SWNAMIS + +   + EA  LF  M  
Sbjct: 201 VVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMK 260

Query: 249 EKI--------------------------------------------VLDKYMAA----- 308
             +                                            ++D Y        
Sbjct: 261 TNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELET 320

Query: 309 ----------------------------------------------------SMLSACTG 368
                                                               S+L AC  
Sbjct: 321 ACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAH 380

Query: 369 LGALEQGMWIHRYIKK--SEIKLDSKLATTLIDMYCKCGCLDRAFEVFTQLPEKGISSWN 428
           LGA++ G WIH YI K    +   S L T+LIDMY KCG ++ A +VF  +  K +SSWN
Sbjct: 381 LGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWN 440

Query: 429 CMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVLNACAHSGLVEKGRYYFNHFTQV 488
            MI G AMHG+ +A+ +LF  M    + PD+ITF+ +L+AC+HSG+++ GR+ F   TQ 
Sbjct: 441 AMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQD 500

Query: 489 YDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPDAGVLGAFVGACKIHGNVDMGEE 548
           Y + P+ EHYGCM+DL G +G+  EA  +I  M M PD  +  + + ACK+HGNV++GE 
Sbjct: 501 YKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGES 560

Query: 549 IGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKLMNDREVKKAAGFSMIELEGVVH 608
             + +I+++P N G YVLL N+YA AGRW+ VA+ R L+ND+ +KK  G S IE++ VVH
Sbjct: 561 FAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVH 620

Query: 609 EFIAGGRGHPEANEIYGKVKEMVECIRCVGY------VTEEEEEVEKDNPVYYHSEKLAV 619
           EFI G + HP   EIYG ++EM   +   G+      V +E EE  K+  + +HSEKLA+
Sbjct: 621 EFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAI 680

BLAST of Csor.00g214140 vs. ExPASy Swiss-Prot
Match: Q9FG16 (Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H88 PE=3 SV=1)

HSP 1 Score: 516.2 bits (1328), Expect = 5.3e-145
Identity = 250/604 (41.39%), Postives = 383/604 (63.41%), Query Frame = 0

Query: 27  LKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVKFCAVSKNGD-----LDYALLLFKTIP 86
           L+SCSS S+LK  H  ++R  L +D     RL+  C      +     L YA  +F  I 
Sbjct: 19  LQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKPTNLLGYAYGIFSQIQ 78

Query: 87  NPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNKFTFPSLIRACCIDNAIEEGKQ 146
           NP+ +++N LIR +     P      Y +ML   + P+  TFP LI+A      +  G+Q
Sbjct: 79  NPNLFVFNLLIRCFSTGAEPSKAFGFYTQMLKSRIWPDNITFPFLIKASSEMECVLVGEQ 138

Query: 147 IHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDGIELPDVVSWTTLLTGYAQCGF 206
            H  +++FGF+ D + +N+L+HMYAN   +  A R+F  +   DVVSWT+++ GY +CG 
Sbjct: 139 THSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQMGFRDVVSWTSMVAGYCKCGM 198

Query: 207 LDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFALFNRMRSEKIVLDKYMAASMLSA 266
           ++ A ++F+ MP R+  +W+ MI+ + +NN F +A  LF  M+ E +V ++ +  S++S+
Sbjct: 199 VENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKREGVVANETVMVSVISS 258

Query: 267 CTGLGALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKCGCLDRAFEVFTQLPEKGISSW 326
           C  LGALE G   + Y+ KS + ++  L T L+DM+ +CG +++A  VF  LPE    SW
Sbjct: 259 CAHLGALEFGERAYEYVVKSHMTVNLILGTALVDMFWRCGDIEKAIHVFEGLPETDSLSW 318

Query: 327 NCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVLNACAHSGLVEKGRYYFNHFTQ 386
           + +I G+A+HG    A+  F  M +    P ++TF  VL+AC+H GLVEKG   + +  +
Sbjct: 319 SSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSHGGLVEKGLEIYENMKK 378

Query: 387 VYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPDAGVLGAFVGACKIHGNVDMGE 446
            + I+PR EHYGC+VD+ GRAG L EA N I +M + P+A +LGA +GACKI+ N ++ E
Sbjct: 379 DHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILGALLGACKIYKNTEVAE 438

Query: 447 EIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKLMNDREVKKAAGFSMIELEGVV 506
            +G  +I++ P +SG YVLL N+YA AG+WD +  +R +M ++ VKK  G+S+IE++G +
Sbjct: 439 RVGNMLIKVKPEHSGYYVLLSNIYACAGQWDKIESLRDMMKEKLVKKPPGWSLIEIDGKI 498

Query: 507 HEFIAG-GRGHPEANEIYGKVKEMVECIRCVGYVTE------EEEEVEKDNPVYYHSEKL 566
           ++F  G  + HPE  +I  K +E++  IR +GY         + +E EK++ ++ HSEKL
Sbjct: 499 NKFTMGDDQKHPEMGKIRRKWEEILGKIRLIGYKGNTGDAFFDVDEEEKESSIHMHSEKL 558

Query: 567 AVAYGLLKTRAGETLRITKNLRVCKDCHQALKLVSKVFQRKIVVRDRNRFHHFANGECSC 619
           A+AYG++KT+ G T+RI KNLRVC+DCH   KL+S+V+ R+++VRDRNRFHHF NG CSC
Sbjct: 559 AIAYGMMKTKPGTTIRIVKNLRVCEDCHTVTKLISEVYGRELIVRDRNRFHHFRNGVCSC 618

BLAST of Csor.00g214140 vs. ExPASy Swiss-Prot
Match: O82380 (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 487.6 bits (1254), Expect = 2.0e-136
Identity = 259/727 (35.63%), Postives = 395/727 (54.33%), Query Frame = 0

Query: 5   LALQPSASPINSPKVH---TSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVKF 64
           L   P+ S  N P  +   +  I  ++ C S+ +LKQ H  +IR G  +D  +  +L   
Sbjct: 12  LPRHPNFSNPNQPTTNNERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAM 71

Query: 65  CAVSKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKC-VLPNK 124
            A+S    L+YA  +F  IP P+++ +NTLIR Y     P   +  +L+M+ +    PNK
Sbjct: 72  AALSSFASLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNK 131

Query: 125 FTFPSLIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIH----------------- 184
           +TFP LI+A    +++  G+ +HG  +K    +D F  N+LIH                 
Sbjct: 132 YTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTT 191

Query: 185 ------------------------------------------------------------ 244
                                                                       
Sbjct: 192 IKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFG 251

Query: 245 ------------------------MYANFQSLEEARRVFDGIELPDVVSWTTLLTGYAQC 304
                                   MY    S+E+A+R+FD +E  D V+WTT+L GYA  
Sbjct: 252 RQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAIS 311

Query: 305 GFLDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFALFNRMRSEK-IVLDKYMAASM 364
              + A +V  SMP++   +WNA+IS++ QN + +EA  +F+ ++ +K + L++    S 
Sbjct: 312 EDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVST 371

Query: 365 LSACTGLGALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKCGCLDRAFEVFTQLPEKGI 424
           LSAC  +GALE G WIH YIKK  I+++  + + LI MY KCG L+++ EVF  + ++ +
Sbjct: 372 LSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDV 431

Query: 425 SSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVLNACAHSGLVEKGRYYFNH 484
             W+ MIGG+AMHG G  A+++F  M+   V P+ +TF NV  AC+H+GLV++    F+ 
Sbjct: 432 FVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQ 491

Query: 485 FTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPDAGVLGAFVGACKIHGNVD 544
               Y I P  +HY C+VD+ GR+G L++A+  I  MP+ P   V GA +GACKIH N++
Sbjct: 492 MESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLN 551

Query: 545 MGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKLMNDREVKKAAGFSMIELE 604
           + E    R++EL+P N G +VLL N+YA+ G+W+ V+E+RK M    +KK  G S IE++
Sbjct: 552 LAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEID 611

Query: 605 GVVHEFIAGGRGHPEANEIYGKVKEMVECIRCVGYVTE-------EEEEVEKDNPVYYHS 619
           G++HEF++G   HP + ++YGK+ E++E ++  GY  E        EEE  K+  +  HS
Sbjct: 612 GMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHS 671

BLAST of Csor.00g214140 vs. NCBI nr
Match: KAG6590058.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1262 bits (3266), Expect = 0.0
Identity = 618/618 (100.00%), Postives = 618/618 (100.00%), Query Frame = 0

Query: 1   MSSLLALQPSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK 60
           MSSLLALQPSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK
Sbjct: 1   MSSLLALQPSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK 60

Query: 61  FCAVSKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK 120
           FCAVSKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK
Sbjct: 61  FCAVSKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK 120

Query: 121 FTFPSLIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDG 180
           FTFPSLIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDG
Sbjct: 121 FTFPSLIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDG 180

Query: 181 IELPDVVSWTTLLTGYAQCGFLDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFALF 240
           IELPDVVSWTTLLTGYAQCGFLDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFALF
Sbjct: 181 IELPDVVSWTTLLTGYAQCGFLDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFALF 240

Query: 241 NRMRSEKIVLDKYMAASMLSACTGLGALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKC 300
           NRMRSEKIVLDKYMAASMLSACTGLGALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKC
Sbjct: 241 NRMRSEKIVLDKYMAASMLSACTGLGALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKC 300

Query: 301 GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL 360
           GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL
Sbjct: 301 GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL 360

Query: 361 NACAHSGLVEKGRYYFNHFTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPD 420
           NACAHSGLVEKGRYYFNHFTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPD
Sbjct: 361 NACAHSGLVEKGRYYFNHFTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPD 420

Query: 421 AGVLGAFVGACKIHGNVDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKL 480
           AGVLGAFVGACKIHGNVDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKL
Sbjct: 421 AGVLGAFVGACKIHGNVDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKL 480

Query: 481 MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEANEIYGKVKEMVECIRCVGYVTEEEEE 540
           MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEANEIYGKVKEMVECIRCVGYVTEEEEE
Sbjct: 481 MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEANEIYGKVKEMVECIRCVGYVTEEEEE 540

Query: 541 VEKDNPVYYHSEKLAVAYGLLKTRAGETLRITKNLRVCKDCHQALKLVSKVFQRKIVVRD 600
           VEKDNPVYYHSEKLAVAYGLLKTRAGETLRITKNLRVCKDCHQALKLVSKVFQRKIVVRD
Sbjct: 541 VEKDNPVYYHSEKLAVAYGLLKTRAGETLRITKNLRVCKDCHQALKLVSKVFQRKIVVRD 600

Query: 601 RNRFHHFANGECSCNDYW 618
           RNRFHHFANGECSCNDYW
Sbjct: 601 RNRFHHFANGECSCNDYW 618

BLAST of Csor.00g214140 vs. NCBI nr
Match: XP_022960820.1 (pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita moschata])

HSP 1 Score: 1260 bits (3260), Expect = 0.0
Identity = 616/618 (99.68%), Postives = 618/618 (100.00%), Query Frame = 0

Query: 1   MSSLLALQPSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK 60
           MSSLLALQPSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK
Sbjct: 1   MSSLLALQPSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK 60

Query: 61  FCAVSKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK 120
           FCAVSKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK
Sbjct: 61  FCAVSKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK 120

Query: 121 FTFPSLIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDG 180
           FTFPSLIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDG
Sbjct: 121 FTFPSLIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDG 180

Query: 181 IELPDVVSWTTLLTGYAQCGFLDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFALF 240
           IELPDVVSWTTLLTGYAQCGFLDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFALF
Sbjct: 181 IELPDVVSWTTLLTGYAQCGFLDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFALF 240

Query: 241 NRMRSEKIVLDKYMAASMLSACTGLGALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKC 300
           NRMRSEKIVLDKYMAASMLSACTGLGALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKC
Sbjct: 241 NRMRSEKIVLDKYMAASMLSACTGLGALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKC 300

Query: 301 GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL 360
           GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL
Sbjct: 301 GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL 360

Query: 361 NACAHSGLVEKGRYYFNHFTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPD 420
           NACAHSGLVEKGRYYFNHFTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPD
Sbjct: 361 NACAHSGLVEKGRYYFNHFTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPD 420

Query: 421 AGVLGAFVGACKIHGNVDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKL 480
           AGVLGAFVGACKIHGNVDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKL
Sbjct: 421 AGVLGAFVGACKIHGNVDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKL 480

Query: 481 MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEANEIYGKVKEMVECIRCVGYVTEEEEE 540
           MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEANEIYGKVKEMVECIRCVGYVTEEEEE
Sbjct: 481 MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEANEIYGKVKEMVECIRCVGYVTEEEEE 540

Query: 541 VEKDNPVYYHSEKLAVAYGLLKTRAGETLRITKNLRVCKDCHQALKLVSKVFQRKIVVRD 600
           VEKDNPVYYHSEKLAVAYGLLKTRAGETLRITKNLRVCKDCHQALKLVSKVFQRKI+VRD
Sbjct: 541 VEKDNPVYYHSEKLAVAYGLLKTRAGETLRITKNLRVCKDCHQALKLVSKVFQRKIIVRD 600

Query: 601 RNRFHHFANGECSCNDYW 618
           RNRFHHFA+GECSCNDYW
Sbjct: 601 RNRFHHFADGECSCNDYW 618

BLAST of Csor.00g214140 vs. NCBI nr
Match: KAG7023724.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1250 bits (3235), Expect = 0.0
Identity = 613/618 (99.19%), Postives = 615/618 (99.51%), Query Frame = 0

Query: 1   MSSLLALQPSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK 60
           MSSLLALQPSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK
Sbjct: 1   MSSLLALQPSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK 60

Query: 61  FCAVSKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK 120
           FCAVSKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHK VLPNK
Sbjct: 61  FCAVSKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKRVLPNK 120

Query: 121 FTFPSLIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDG 180
           FTFPSLIRACCIDNAIEEGKQIH HVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDG
Sbjct: 121 FTFPSLIRACCIDNAIEEGKQIHAHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDG 180

Query: 181 IELPDVVSWTTLLTGYAQCGFLDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFALF 240
           IELPDVVSWTTLLTGYA CGFLDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFALF
Sbjct: 181 IELPDVVSWTTLLTGYALCGFLDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFALF 240

Query: 241 NRMRSEKIVLDKYMAASMLSACTGLGALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKC 300
           NRMRSEKIVLDKYMAASMLSACTGLGALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKC
Sbjct: 241 NRMRSEKIVLDKYMAASMLSACTGLGALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKC 300

Query: 301 GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL 360
           GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL
Sbjct: 301 GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL 360

Query: 361 NACAHSGLVEKGRYYFNHFTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPD 420
           NACAHSGLVEKGRYYFNHFTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPD
Sbjct: 361 NACAHSGLVEKGRYYFNHFTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPD 420

Query: 421 AGVLGAFVGACKIHGNVDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKL 480
           AGVLGAFVGACKIHGNVDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKL
Sbjct: 421 AGVLGAFVGACKIHGNVDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKL 480

Query: 481 MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEANEIYGKVKEMVECIRCVGYVTEEEEE 540
           MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEANEIYGKVKEMVECIRCVGYVTEEEEE
Sbjct: 481 MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEANEIYGKVKEMVECIRCVGYVTEEEEE 540

Query: 541 VEKDNPVYYHSEKLAVAYGLLKTRAGETLRITKNLRVCKDCHQALKLVSKVFQRKIVVRD 600
           VEKDNPVYYHSEKLAVAYGLLKTRAGETLRITKNLRVCKDCHQALKLVSKVFQRKI+VRD
Sbjct: 541 VEKDNPVYYHSEKLAVAYGLLKTRAGETLRITKNLRVCKDCHQALKLVSKVFQRKIIVRD 600

Query: 601 RNRFHHFANGECSCNDYW 618
           RNRFHHFA+GECSCNDYW
Sbjct: 601 RNRFHHFADGECSCNDYW 618

BLAST of Csor.00g214140 vs. NCBI nr
Match: XP_023515687.1 (pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1237 bits (3201), Expect = 0.0
Identity = 605/620 (97.58%), Postives = 613/620 (98.87%), Query Frame = 0

Query: 1   MSSLLALQPSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK 60
           MSSLLALQPSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK
Sbjct: 1   MSSLLALQPSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK 60

Query: 61  FCAVSKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK 120
           FCAVSKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK
Sbjct: 61  FCAVSKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK 120

Query: 121 FTFPSLIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDG 180
           FTFPSLIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDG
Sbjct: 121 FTFPSLIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDG 180

Query: 181 IELPDVVSWTTLLTGYAQCGFLDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFALF 240
           IELPD VSWTTLLTGYAQCGFLDEAL+VFESMPERSSASWNAMISS VQNNRFHEAFALF
Sbjct: 181 IELPDAVSWTTLLTGYAQCGFLDEALEVFESMPERSSASWNAMISSCVQNNRFHEAFALF 240

Query: 241 NRMRSEKIVLDKYMAASMLSACTGLGALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKC 300
           NRMRSEKIVLDKYMAASMLSACTGLG+LEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKC
Sbjct: 241 NRMRSEKIVLDKYMAASMLSACTGLGSLEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKC 300

Query: 301 GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL 360
           GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL
Sbjct: 301 GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL 360

Query: 361 NACAHSGLVEKGRYYFNHFTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPD 420
           NACAHSGLVEKGRYYFNHFTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPD
Sbjct: 361 NACAHSGLVEKGRYYFNHFTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPD 420

Query: 421 AGVLGAFVGACKIHGNVDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKL 480
           AGVLGAFVGACKIHGN+DMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKL
Sbjct: 421 AGVLGAFVGACKIHGNIDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKL 480

Query: 481 MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEANEIYGKVKEMVECIRCVGYVTEEEEE 540
           MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEA EIYGKV EMVECIRCVGYVTEEEEE
Sbjct: 481 MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEAKEIYGKVNEMVECIRCVGYVTEEEEE 540

Query: 541 --VEKDNPVYYHSEKLAVAYGLLKTRAGETLRITKNLRVCKDCHQALKLVSKVFQRKIVV 600
             VEKDNPVYYHSEKLAVA+GLLKT+AGETLRITKNLRVCKDCHQALKLVSKVF+RKI+V
Sbjct: 541 EVVEKDNPVYYHSEKLAVAFGLLKTKAGETLRITKNLRVCKDCHQALKLVSKVFERKIIV 600

Query: 601 RDRNRFHHFANGECSCNDYW 618
           RDRNRFHHF +GECSCNDYW
Sbjct: 601 RDRNRFHHFDDGECSCNDYW 620

BLAST of Csor.00g214140 vs. NCBI nr
Match: XP_022987211.1 (pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita maxima] >XP_022987212.1 pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita maxima])

HSP 1 Score: 1223 bits (3165), Expect = 0.0
Identity = 598/620 (96.45%), Postives = 608/620 (98.06%), Query Frame = 0

Query: 1   MSSLLALQPSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK 60
           MSSLLALQ SASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK
Sbjct: 1   MSSLLALQLSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK 60

Query: 61  FCAVSKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK 120
           FCAV KNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK
Sbjct: 61  FCAVCKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK 120

Query: 121 FTFPSLIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDG 180
           FTFPSLIRACCIDNAIEEGKQIH HVLKFGFR DRFSQNNLIHMYANFQSLEEARRVFDG
Sbjct: 121 FTFPSLIRACCIDNAIEEGKQIHAHVLKFGFRADRFSQNNLIHMYANFQSLEEARRVFDG 180

Query: 181 IELPDVVSWTTLLTGYAQCGFLDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFALF 240
           IELPDVVSWTTLLTGYAQCGFLDEALQVFESMPE SSASWNAMISSFVQNNRFHEAFALF
Sbjct: 181 IELPDVVSWTTLLTGYAQCGFLDEALQVFESMPEHSSASWNAMISSFVQNNRFHEAFALF 240

Query: 241 NRMRSEKIVLDKYMAASMLSACTGLGALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKC 300
           NRMRSEKIVLDKYMAASMLSACTGLGALEQG+WIHRYIKKSEIKLDSKLATTLIDMYCKC
Sbjct: 241 NRMRSEKIVLDKYMAASMLSACTGLGALEQGIWIHRYIKKSEIKLDSKLATTLIDMYCKC 300

Query: 301 GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL 360
           GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL
Sbjct: 301 GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL 360

Query: 361 NACAHSGLVEKGRYYFNHFTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPD 420
           NACAHSGLVEKGRYYFNHF+QVYDIKPRTEHYGCMVDLYGR+GMLDEAM LIREMPMSPD
Sbjct: 361 NACAHSGLVEKGRYYFNHFSQVYDIKPRTEHYGCMVDLYGRSGMLDEAMKLIREMPMSPD 420

Query: 421 AGVLGAFVGACKIHGNVDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKL 480
           AGVLGAFVGACKIHGN+DMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWD VAEVRKL
Sbjct: 421 AGVLGAFVGACKIHGNIDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDSVAEVRKL 480

Query: 481 MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEANEIYGKVKEMVECIRCVGYVTEEEEE 540
           MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEA EIYGKV EM+ECIRCVGYVTEEEEE
Sbjct: 481 MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEAKEIYGKVNEMIECIRCVGYVTEEEEE 540

Query: 541 --VEKDNPVYYHSEKLAVAYGLLKTRAGETLRITKNLRVCKDCHQALKLVSKVFQRKIVV 600
             VEKDNPVYYHSEKLAVA+GLLKT+AGETLRITKNLRVCKDCHQALKLVSKVF+RK +V
Sbjct: 541 EEVEKDNPVYYHSEKLAVAFGLLKTQAGETLRITKNLRVCKDCHQALKLVSKVFERKFIV 600

Query: 601 RDRNRFHHFANGECSCNDYW 618
           RDRNRFHHFA+GECSCNDYW
Sbjct: 601 RDRNRFHHFADGECSCNDYW 620

BLAST of Csor.00g214140 vs. ExPASy TrEMBL
Match: A0A6J1HA70 (pentatricopeptide repeat-containing protein At5g66520-like OS=Cucurbita moschata OX=3662 GN=LOC111461510 PE=3 SV=1)

HSP 1 Score: 1260 bits (3260), Expect = 0.0
Identity = 616/618 (99.68%), Postives = 618/618 (100.00%), Query Frame = 0

Query: 1   MSSLLALQPSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK 60
           MSSLLALQPSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK
Sbjct: 1   MSSLLALQPSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK 60

Query: 61  FCAVSKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK 120
           FCAVSKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK
Sbjct: 61  FCAVSKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK 120

Query: 121 FTFPSLIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDG 180
           FTFPSLIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDG
Sbjct: 121 FTFPSLIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDG 180

Query: 181 IELPDVVSWTTLLTGYAQCGFLDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFALF 240
           IELPDVVSWTTLLTGYAQCGFLDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFALF
Sbjct: 181 IELPDVVSWTTLLTGYAQCGFLDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFALF 240

Query: 241 NRMRSEKIVLDKYMAASMLSACTGLGALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKC 300
           NRMRSEKIVLDKYMAASMLSACTGLGALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKC
Sbjct: 241 NRMRSEKIVLDKYMAASMLSACTGLGALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKC 300

Query: 301 GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL 360
           GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL
Sbjct: 301 GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL 360

Query: 361 NACAHSGLVEKGRYYFNHFTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPD 420
           NACAHSGLVEKGRYYFNHFTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPD
Sbjct: 361 NACAHSGLVEKGRYYFNHFTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPD 420

Query: 421 AGVLGAFVGACKIHGNVDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKL 480
           AGVLGAFVGACKIHGNVDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKL
Sbjct: 421 AGVLGAFVGACKIHGNVDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKL 480

Query: 481 MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEANEIYGKVKEMVECIRCVGYVTEEEEE 540
           MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEANEIYGKVKEMVECIRCVGYVTEEEEE
Sbjct: 481 MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEANEIYGKVKEMVECIRCVGYVTEEEEE 540

Query: 541 VEKDNPVYYHSEKLAVAYGLLKTRAGETLRITKNLRVCKDCHQALKLVSKVFQRKIVVRD 600
           VEKDNPVYYHSEKLAVAYGLLKTRAGETLRITKNLRVCKDCHQALKLVSKVFQRKI+VRD
Sbjct: 541 VEKDNPVYYHSEKLAVAYGLLKTRAGETLRITKNLRVCKDCHQALKLVSKVFQRKIIVRD 600

Query: 601 RNRFHHFANGECSCNDYW 618
           RNRFHHFA+GECSCNDYW
Sbjct: 601 RNRFHHFADGECSCNDYW 618

BLAST of Csor.00g214140 vs. ExPASy TrEMBL
Match: A0A6J1JDI6 (pentatricopeptide repeat-containing protein At5g66520-like OS=Cucurbita maxima OX=3661 GN=LOC111484832 PE=3 SV=1)

HSP 1 Score: 1223 bits (3165), Expect = 0.0
Identity = 598/620 (96.45%), Postives = 608/620 (98.06%), Query Frame = 0

Query: 1   MSSLLALQPSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK 60
           MSSLLALQ SASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK
Sbjct: 1   MSSLLALQLSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK 60

Query: 61  FCAVSKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK 120
           FCAV KNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK
Sbjct: 61  FCAVCKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK 120

Query: 121 FTFPSLIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDG 180
           FTFPSLIRACCIDNAIEEGKQIH HVLKFGFR DRFSQNNLIHMYANFQSLEEARRVFDG
Sbjct: 121 FTFPSLIRACCIDNAIEEGKQIHAHVLKFGFRADRFSQNNLIHMYANFQSLEEARRVFDG 180

Query: 181 IELPDVVSWTTLLTGYAQCGFLDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFALF 240
           IELPDVVSWTTLLTGYAQCGFLDEALQVFESMPE SSASWNAMISSFVQNNRFHEAFALF
Sbjct: 181 IELPDVVSWTTLLTGYAQCGFLDEALQVFESMPEHSSASWNAMISSFVQNNRFHEAFALF 240

Query: 241 NRMRSEKIVLDKYMAASMLSACTGLGALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKC 300
           NRMRSEKIVLDKYMAASMLSACTGLGALEQG+WIHRYIKKSEIKLDSKLATTLIDMYCKC
Sbjct: 241 NRMRSEKIVLDKYMAASMLSACTGLGALEQGIWIHRYIKKSEIKLDSKLATTLIDMYCKC 300

Query: 301 GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL 360
           GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL
Sbjct: 301 GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL 360

Query: 361 NACAHSGLVEKGRYYFNHFTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPD 420
           NACAHSGLVEKGRYYFNHF+QVYDIKPRTEHYGCMVDLYGR+GMLDEAM LIREMPMSPD
Sbjct: 361 NACAHSGLVEKGRYYFNHFSQVYDIKPRTEHYGCMVDLYGRSGMLDEAMKLIREMPMSPD 420

Query: 421 AGVLGAFVGACKIHGNVDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKL 480
           AGVLGAFVGACKIHGN+DMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWD VAEVRKL
Sbjct: 421 AGVLGAFVGACKIHGNIDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDSVAEVRKL 480

Query: 481 MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEANEIYGKVKEMVECIRCVGYVTEEEEE 540
           MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEA EIYGKV EM+ECIRCVGYVTEEEEE
Sbjct: 481 MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEAKEIYGKVNEMIECIRCVGYVTEEEEE 540

Query: 541 --VEKDNPVYYHSEKLAVAYGLLKTRAGETLRITKNLRVCKDCHQALKLVSKVFQRKIVV 600
             VEKDNPVYYHSEKLAVA+GLLKT+AGETLRITKNLRVCKDCHQALKLVSKVF+RK +V
Sbjct: 541 EEVEKDNPVYYHSEKLAVAFGLLKTQAGETLRITKNLRVCKDCHQALKLVSKVFERKFIV 600

Query: 601 RDRNRFHHFANGECSCNDYW 618
           RDRNRFHHFA+GECSCNDYW
Sbjct: 601 RDRNRFHHFADGECSCNDYW 620

BLAST of Csor.00g214140 vs. ExPASy TrEMBL
Match: A0A6J1CAN4 (uncharacterized protein LOC111009921 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111009921 PE=3 SV=1)

HSP 1 Score: 1058 bits (2737), Expect = 0.0
Identity = 515/601 (85.69%), Postives = 560/601 (93.18%), Query Frame = 0

Query: 1   MSSLLALQPSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK 60
           M  LLALQP+AS INS +VHTSPIHGL+SCSSMSELKQ+HSQIIRLGLS DNDAMGRL+K
Sbjct: 1   MPPLLALQPTASVINSSRVHTSPIHGLQSCSSMSELKQFHSQIIRLGLSIDNDAMGRLIK 60

Query: 61  FCAVSKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK 120
           FCAVSKNGDLDYALLLFKTIP PDA+IYNTLIRGYLQ Q  RAC+LLYL+MLHK VLPNK
Sbjct: 61  FCAVSKNGDLDYALLLFKTIPYPDAFIYNTLIRGYLQQQSSRACILLYLQMLHKVVLPNK 120

Query: 121 FTFPSLIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDG 180
           FTFPSLIRACCIDNAIEEGKQ+H HVLKFGFRTD FSQNNLIHMYANFQSLE+ARRVFDG
Sbjct: 121 FTFPSLIRACCIDNAIEEGKQVHAHVLKFGFRTDIFSQNNLIHMYANFQSLEDARRVFDG 180

Query: 181 IELPDVVSWTTLLTGYAQCGFLDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFALF 240
           IELPD V+WTTLLTGYAQCG +DEA QVFESMPE +SASWNAMISSFVQNNRFHEAF LF
Sbjct: 181 IELPDAVTWTTLLTGYAQCGLVDEAFQVFESMPEHNSASWNAMISSFVQNNRFHEAFXLF 240

Query: 241 NRMRSEKIVLDKYMAASMLSACTGLGALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKC 300
           NRMRSEKIVLDKY+AASMLSACTGLGALEQG WIHRYIKKS I+LDSKLATTLIDMYCKC
Sbjct: 241 NRMRSEKIVLDKYVAASMLSACTGLGALEQGKWIHRYIKKSGIELDSKLATTLIDMYCKC 300

Query: 301 GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL 360
           GCLD AF VFT LPEKGISSWNCMIGGMAMHG+GEAAIELFK+ME KMVTPDNITFLNVL
Sbjct: 301 GCLDCAFSVFTHLPEKGISSWNCMIGGMAMHGRGEAAIELFKEMEMKMVTPDNITFLNVL 360

Query: 361 NACAHSGLVEKGRYYFNHFTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPD 420
           +ACAHSGLVE+GR+YF HF ++Y I+PRTEH+GCMVDLYGRAGML+EAM LI EMPM+PD
Sbjct: 361 SACAHSGLVEEGRHYFRHFIELYGIEPRTEHFGCMVDLYGRAGMLEEAMKLISEMPMNPD 420

Query: 421 AGVLGAFVGACKIHGNVDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKL 480
           AGVLGA VGACKIHG+VD+GEEIG RVIEL+P+NSGRYVLLGNLYA+AGRW+ VAEVRKL
Sbjct: 421 AGVLGALVGACKIHGDVDLGEEIGLRVIELEPTNSGRYVLLGNLYAKAGRWEDVAEVRKL 480

Query: 481 MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEANEIYGKVKEMVECIRCVGYVTEEEEE 540
           MNDREVKKA GFSMIELEGVV+EFIAGGR HPEA++IY KV EM+ECIR VGYV E E +
Sbjct: 481 MNDREVKKAPGFSMIELEGVVYEFIAGGRAHPEADKIYVKVNEMLECIRYVGYVPENEID 540

Query: 541 VEKDNPVYYHSEKLAVAYGLLKTRAGETLRITKNLRVCKDCHQALKLVSKVFQRKIVVRD 600
           V+KDNP+YYHSEKLAVA+GLLKT+AGETLRITKNLR+CKDCHQALKLVSKVF+RKI++  
Sbjct: 541 VDKDNPIYYHSEKLAVAFGLLKTKAGETLRITKNLRICKDCHQALKLVSKVFERKIILEH 600

BLAST of Csor.00g214140 vs. ExPASy TrEMBL
Match: A0A6J1CAX3 (uncharacterized protein LOC111009921 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111009921 PE=3 SV=1)

HSP 1 Score: 1058 bits (2737), Expect = 0.0
Identity = 515/601 (85.69%), Postives = 560/601 (93.18%), Query Frame = 0

Query: 1   MSSLLALQPSASPINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVK 60
           M  LLALQP+AS INS +VHTSPIHGL+SCSSMSELKQ+HSQIIRLGLS DNDAMGRL+K
Sbjct: 1   MPPLLALQPTASVINSSRVHTSPIHGLQSCSSMSELKQFHSQIIRLGLSIDNDAMGRLIK 60

Query: 61  FCAVSKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNK 120
           FCAVSKNGDLDYALLLFKTIP PDA+IYNTLIRGYLQ Q  RAC+LLYL+MLHK VLPNK
Sbjct: 61  FCAVSKNGDLDYALLLFKTIPYPDAFIYNTLIRGYLQQQSSRACILLYLQMLHKVVLPNK 120

Query: 121 FTFPSLIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDG 180
           FTFPSLIRACCIDNAIEEGKQ+H HVLKFGFRTD FSQNNLIHMYANFQSLE+ARRVFDG
Sbjct: 121 FTFPSLIRACCIDNAIEEGKQVHAHVLKFGFRTDIFSQNNLIHMYANFQSLEDARRVFDG 180

Query: 181 IELPDVVSWTTLLTGYAQCGFLDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFALF 240
           IELPD V+WTTLLTGYAQCG +DEA QVFESMPE +SASWNAMISSFVQNNRFHEAF LF
Sbjct: 181 IELPDAVTWTTLLTGYAQCGLVDEAFQVFESMPEHNSASWNAMISSFVQNNRFHEAFXLF 240

Query: 241 NRMRSEKIVLDKYMAASMLSACTGLGALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKC 300
           NRMRSEKIVLDKY+AASMLSACTGLGALEQG WIHRYIKKS I+LDSKLATTLIDMYCKC
Sbjct: 241 NRMRSEKIVLDKYVAASMLSACTGLGALEQGKWIHRYIKKSGIELDSKLATTLIDMYCKC 300

Query: 301 GCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVL 360
           GCLD AF VFT LPEKGISSWNCMIGGMAMHG+GEAAIELFK+ME KMVTPDNITFLNVL
Sbjct: 301 GCLDCAFSVFTHLPEKGISSWNCMIGGMAMHGRGEAAIELFKEMEMKMVTPDNITFLNVL 360

Query: 361 NACAHSGLVEKGRYYFNHFTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPD 420
           +ACAHSGLVE+GR+YF HF ++Y I+PRTEH+GCMVDLYGRAGML+EAM LI EMPM+PD
Sbjct: 361 SACAHSGLVEEGRHYFRHFIELYGIEPRTEHFGCMVDLYGRAGMLEEAMKLISEMPMNPD 420

Query: 421 AGVLGAFVGACKIHGNVDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKL 480
           AGVLGA VGACKIHG+VD+GEEIG RVIEL+P+NSGRYVLLGNLYA+AGRW+ VAEVRKL
Sbjct: 421 AGVLGALVGACKIHGDVDLGEEIGLRVIELEPTNSGRYVLLGNLYAKAGRWEDVAEVRKL 480

Query: 481 MNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEANEIYGKVKEMVECIRCVGYVTEEEEE 540
           MNDREVKKA GFSMIELEGVV+EFIAGGR HPEA++IY KV EM+ECIR VGYV E E +
Sbjct: 481 MNDREVKKAPGFSMIELEGVVYEFIAGGRAHPEADKIYVKVNEMLECIRYVGYVPENEID 540

Query: 541 VEKDNPVYYHSEKLAVAYGLLKTRAGETLRITKNLRVCKDCHQALKLVSKVFQRKIVVRD 600
           V+KDNP+YYHSEKLAVA+GLLKT+AGETLRITKNLR+CKDCHQALKLVSKVF+RKI++  
Sbjct: 541 VDKDNPIYYHSEKLAVAFGLLKTKAGETLRITKNLRICKDCHQALKLVSKVFERKIILEH 600

BLAST of Csor.00g214140 vs. ExPASy TrEMBL
Match: A0A5D3CGC5 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold16G003910 PE=3 SV=1)

HSP 1 Score: 1048 bits (2711), Expect = 0.0
Identity = 507/619 (81.91%), Postives = 559/619 (90.31%), Query Frame = 0

Query: 1   MSSLLALQPSASPINSPKVHTSPI-HGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLV 60
           M SLL L P  S  NSPK + SPI   L SCSSMSELKQ+HSQIIRLGLSTDN+A+GRL+
Sbjct: 1   MGSLLPLHPIPSLPNSPKFNPSPIFQALNSCSSMSELKQFHSQIIRLGLSTDNNAIGRLI 60

Query: 61  KFCAVSKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPN 120
           KFCAVSK GDL YALLLF +IP PDA+IYNTLIR YL    P++ LLLYL+MLH  V PN
Sbjct: 61  KFCAVSKYGDLHYALLLFNSIPYPDAFIYNTLIRAYLHFNSPKSSLLLYLQMLHNSVFPN 120

Query: 121 KFTFPSLIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFD 180
           KFTFPS+IRACCIDN++EEGKQIH HV+KFGF  DRF QNNLIHMYANFQSLEEARRVFD
Sbjct: 121 KFTFPSVIRACCIDNSVEEGKQIHTHVVKFGFTKDRFCQNNLIHMYANFQSLEEARRVFD 180

Query: 181 GIELPDVVSWTTLLTGYAQCGFLDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFAL 240
            IELPDVV+WTTLLTGYAQ G++DE+L+VFESMPER+SASWNAMIS FVQNNRFHEAF L
Sbjct: 181 CIELPDVVAWTTLLTGYAQLGYVDESLRVFESMPERNSASWNAMISCFVQNNRFHEAFGL 240

Query: 241 FNRMRSEKIVLDKYMAASMLSACTGLGALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCK 300
           FNRMR EK+VL+K++AASMLSACTGLGAL+QG WIHRYI+K+ I+ DSKLATTLIDMYCK
Sbjct: 241 FNRMRLEKVVLEKFVAASMLSACTGLGALDQGKWIHRYIEKNGIEFDSKLATTLIDMYCK 300

Query: 301 CGCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNV 360
           CGCLD A+EVF  LPEKGISSWNCMIGGMAMHGKGEAAIELFK+METKMV PDNITFLNV
Sbjct: 301 CGCLDCAYEVFVHLPEKGISSWNCMIGGMAMHGKGEAAIELFKEMETKMVKPDNITFLNV 360

Query: 361 LNACAHSGLVEKGRYYFNHFTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSP 420
           L+ACAHSGLVEKG++YFN FTQVY I+PRTEHYGCMVDLYGRAG+L+EAM +I EMPMSP
Sbjct: 361 LSACAHSGLVEKGQHYFNRFTQVYGIEPRTEHYGCMVDLYGRAGLLEEAMKVIDEMPMSP 420

Query: 421 DAGVLGAFVGACKIHGNVDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRK 480
           D GVLGAFVGACKIHGN+++GEE+GKRVIEL+P+NSGRYVLLGNLYAEAGRW+GVAEVRK
Sbjct: 421 DVGVLGAFVGACKIHGNIELGEEVGKRVIELEPTNSGRYVLLGNLYAEAGRWEGVAEVRK 480

Query: 481 LMNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEANEIYGKVKEMVECIRCVGYVTEEEE 540
           LMNDREVKKAAG SMIELEGVV+EFIAGGR HPEA EIY K+ EM+ECIR  GY+ E E 
Sbjct: 481 LMNDREVKKAAGVSMIELEGVVYEFIAGGRNHPEAKEIYDKLNEMLECIRNEGYIAENEI 540

Query: 541 EVEKDNPVYYHSEKLAVAYGLLKTRAGETLRITKNLRVCKDCHQALKLVSKVFQRKIVVR 600
           E EKDNPVYYHSEKLA+A+GLLKT+AGE LRITKNLRVCKDCHQALKLVSKVFQRKI+VR
Sbjct: 541 EEEKDNPVYYHSEKLAIAFGLLKTKAGEILRITKNLRVCKDCHQALKLVSKVFQRKIIVR 600

Query: 601 DRNRFHHFANGECSCNDYW 618
           DRNRFHHF NGECSCNDYW
Sbjct: 601 DRNRFHHFGNGECSCNDYW 619

BLAST of Csor.00g214140 vs. TAIR 10
Match: AT5G66520.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 558.1 bits (1437), Expect = 8.6e-159
Identity = 272/600 (45.33%), Postives = 383/600 (63.83%), Query Frame = 0

Query: 27  LKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVKFCAVSKNGD-LDYALLLFKTIPNPDA 86
           L+ CS   ELKQ H+++++ GL  D+ A+ + + FC  S + D L YA ++F     PD 
Sbjct: 21  LQRCSKQEELKQIHARMLKTGLMQDSYAITKFLSFCISSTSSDFLPYAQIVFDGFDRPDT 80

Query: 87  YIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNKFTFPSLIRACCIDNAIEEGKQIHGH 146
           +++N +IRG+     P   LLLY  ML      N +TFPSL++AC   +A EE  QIH  
Sbjct: 81  FLWNLMIRGFSCSDEPERSLLLYQRMLCSSAPHNAYTFPSLLKACSNLSAFEETTQIHAQ 140

Query: 147 VLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDGIELPDVVSWTTLLTGYAQCGFLDEA 206
           + K G+  D ++ N+LI+ YA   + + A  +FD I  PD VSW +++ GY + G +D A
Sbjct: 141 ITKLGYENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPDDVSWNSVIKGYVKAGKMDIA 200

Query: 207 LQVFESMPERSSASWNAMISSFVQNNRFHEAFALFNRMRSEKIVLDKYMAASMLSACTGL 266
           L +F  M E+++ SW  MIS +VQ +   EA  LF+ M++  +  D    A+ LSAC  L
Sbjct: 201 LTLFRKMAEKNAISWTTMISGYVQADMNKEALQLFHEMQNSDVEPDNVSLANALSACAQL 260

Query: 267 GALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKCGCLDRAFEVFTQLPEKGISSWNCMI 326
           GALEQG WIH Y+ K+ I++DS L   LIDMY KCG ++ A EVF  + +K + +W  +I
Sbjct: 261 GALEQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQAWTALI 320

Query: 327 GGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVLNACAHSGLVEKGRYYFNHFTQVYDI 386
            G A HG G  AI  F +M+   + P+ ITF  VL AC+++GLVE+G+  F    + Y++
Sbjct: 321 SGYAYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSMERDYNL 380

Query: 387 KPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPDAGVLGAFVGACKIHGNVDMGEEIGK 446
           KP  EHYGC+VDL GRAG+LDEA   I+EMP+ P+A + GA + AC+IH N+++GEEIG+
Sbjct: 381 KPTIEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELGEEIGE 440

Query: 447 RVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKLMNDREVKKAAGFSMIELEGVVHEFI 506
            +I +DP + GRYV   N++A   +WD  AE R+LM ++ V K  G S I LEG  HEF+
Sbjct: 441 ILIAIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISLEGTTHEFL 500

Query: 507 AGGRGHPEANEIYGKVKEMVECIRCVGYVTEEEE-------EVEKDNPVYYHSEKLAVAY 566
           AG R HPE  +I  K + M   +   GYV E EE       + E++  V+ HSEKLA+ Y
Sbjct: 501 AGDRSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQHSEKLAITY 560

Query: 567 GLLKTRAGETLRITKNLRVCKDCHQALKLVSKVFQRKIVVRDRNRFHHFANGECSCNDYW 619
           GL+KT+ G  +RI KNLRVCKDCH+  KL+SK+++R IV+RDR RFHHF +G+CSC DYW
Sbjct: 561 GLIKTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFRDGKCSCGDYW 620

BLAST of Csor.00g214140 vs. TAIR 10
Match: AT5G48910.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 526.6 bits (1355), Expect = 2.8e-149
Identity = 259/633 (40.92%), Postives = 389/633 (61.45%), Query Frame = 0

Query: 12  SPINSPKVHTSPIH-GLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVKFCAVS--KNG 71
           SP +SP  H S +   + +C ++ +L Q H+  I+ G   D  A   +++FCA S   + 
Sbjct: 14  SPASSPASHPSSLFPQINNCRTIRDLSQIHAVFIKSGQMRDTLAAAEILRFCATSDLHHR 73

Query: 72  DLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACL---LLYLEMLHKCVLPNKFTFPS 131
           DLDYA  +F  +P  + + +NT+IRG+ +    +A +   L Y  M  + V PN+FTFPS
Sbjct: 74  DLDYAHKIFNQMPQRNCFSWNTIIRGFSESDEDKALIAITLFYEMMSDEFVEPNRFTFPS 133

Query: 132 LIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVF------- 191
           +++AC     I+EGKQIHG  LK+GF  D F  +NL+ MY     +++AR +F       
Sbjct: 134 VLKACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEK 193

Query: 192 DGIELPD-------VVSWTTLLTGYAQCGFLDEALQVFESMPERSSASWNAMISSFVQNN 251
           D + + D       +V W  ++ GY + G    A  +F+ M +RS  SWN MIS +  N 
Sbjct: 194 DMVVMTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLFDKMRQRSVVSWNTMISGYSLNG 253

Query: 252 RFHEAFALFNRMRSEKIVLDKYMAASMLSACTGLGALEQGMWIHRYIKKSEIKLDSKLAT 311
            F +A  +F  M+   I  +     S+L A + LG+LE G W+H Y + S I++D  L +
Sbjct: 254 FFKDAVEVFREMKKGDIRPNYVTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGS 313

Query: 312 TLIDMYCKCGCLDRAFEVFTQLPEKGISSWNCMIGGMAMHGKGEAAIELFKDMETKMVTP 371
            LIDMY KCG +++A  VF +LP + + +W+ MI G A+HG+   AI+ F  M    V P
Sbjct: 314 ALIDMYSKCGIIEKAIHVFERLPRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRP 373

Query: 372 DNITFLNVLNACAHSGLVEKGRYYFNHFTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNL 431
            ++ ++N+L AC+H GLVE+GR YF+    V  ++PR EHYGCMVDL GR+G+LDEA   
Sbjct: 374 SDVAYINLLTACSHGGLVEEGRRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEF 433

Query: 432 IREMPMSPDAGVLGAFVGACKIHGNVDMGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRW 491
           I  MP+ PD  +  A +GAC++ GNV+MG+ +   ++++ P +SG YV L N+YA  G W
Sbjct: 434 ILNMPIKPDDVIWKALLGACRMQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNW 493

Query: 492 DGVAEVRKLMNDREVKKAAGFSMIELEGVVHEFIAGGRGHPEANEIYGKVKEMVECIRCV 551
             V+E+R  M +++++K  G S+I+++GV+HEF+     HP+A EI   + E+ + +R  
Sbjct: 494 SEVSEMRLRMKEKDIRKDPGCSLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLA 553

Query: 552 GY------VTEEEEEVEKDNPVYYHSEKLAVAYGLLKTRAGETLRITKNLRVCKDCHQAL 611
           GY      V    EE +K+N ++YHSEK+A A+GL+ T  G+ +RI KNLR+C+DCH ++
Sbjct: 554 GYRPITTQVLLNLEEEDKENVLHYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSI 613

Query: 612 KLVSKVFQRKIVVRDRNRFHHFANGECSCNDYW 619
           KL+SKV++RKI VRDR RFHHF +G CSC DYW
Sbjct: 614 KLISKVYKRKITVRDRKRFHHFQDGSCSCMDYW 646

BLAST of Csor.00g214140 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 523.5 bits (1347), Expect = 2.3e-148
Identity = 276/722 (38.23%), Postives = 404/722 (55.96%), Query Frame = 0

Query: 9   PSAS--PINSPKVHTSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVKFCAVSK 68
           PS+S  P +S + H S +  L +C ++  L+  H+Q+I++GL   N A+ +L++FC +S 
Sbjct: 21  PSSSDPPYDSIRNHPS-LSLLHNCKTLQSLRIIHAQMIKIGLHNTNYALSKLIEFCILSP 80

Query: 69  NGD-LDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNKFTFPS 128
           + + L YA+ +FKTI  P+  I+NT+ RG+     P + L LY+ M+   +LPN +TFP 
Sbjct: 81  HFEGLPYAISVFKTIQEPNLLIWNTMFRGHALSSDPVSALKLYVCMISLGLLPNSYTFPF 140

Query: 129 LIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDGIELPD 188
           ++++C    A +EG+QIHGHVLK G   D +   +LI MY     LE+A +VFD     D
Sbjct: 141 VLKSCAKSKAFKEGQQIHGHVLKLGCDLDLYVHTSLISMYVQNGRLEDAHKVFDKSPHRD 200

Query: 189 VVSWTTLLTGYAQCGFLDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFALFNRMRS 248
           VVS+T L+ GYA  G+++ A ++F+ +P +   SWNAMIS + +   + EA  LF  M  
Sbjct: 201 VVSYTALIKGYASRGYIENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMK 260

Query: 249 EKI--------------------------------------------VLDKYMAA----- 308
             +                                            ++D Y        
Sbjct: 261 TNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELET 320

Query: 309 ----------------------------------------------------SMLSACTG 368
                                                               S+L AC  
Sbjct: 321 ACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAH 380

Query: 369 LGALEQGMWIHRYIKK--SEIKLDSKLATTLIDMYCKCGCLDRAFEVFTQLPEKGISSWN 428
           LGA++ G WIH YI K    +   S L T+LIDMY KCG ++ A +VF  +  K +SSWN
Sbjct: 381 LGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWN 440

Query: 429 CMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVLNACAHSGLVEKGRYYFNHFTQV 488
            MI G AMHG+ +A+ +LF  M    + PD+ITF+ +L+AC+HSG+++ GR+ F   TQ 
Sbjct: 441 AMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQD 500

Query: 489 YDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPDAGVLGAFVGACKIHGNVDMGEE 548
           Y + P+ EHYGCM+DL G +G+  EA  +I  M M PD  +  + + ACK+HGNV++GE 
Sbjct: 501 YKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGES 560

Query: 549 IGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKLMNDREVKKAAGFSMIELEGVVH 608
             + +I+++P N G YVLL N+YA AGRW+ VA+ R L+ND+ +KK  G S IE++ VVH
Sbjct: 561 FAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVH 620

Query: 609 EFIAGGRGHPEANEIYGKVKEMVECIRCVGY------VTEEEEEVEKDNPVYYHSEKLAV 619
           EFI G + HP   EIYG ++EM   +   G+      V +E EE  K+  + +HSEKLA+
Sbjct: 621 EFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAI 680

BLAST of Csor.00g214140 vs. TAIR 10
Match: AT5G06540.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 516.2 bits (1328), Expect = 3.8e-146
Identity = 250/604 (41.39%), Postives = 383/604 (63.41%), Query Frame = 0

Query: 27  LKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVKFCAVSKNGD-----LDYALLLFKTIP 86
           L+SCSS S+LK  H  ++R  L +D     RL+  C      +     L YA  +F  I 
Sbjct: 19  LQSCSSFSDLKIIHGFLLRTHLISDVFVASRLLALCVDDSTFNKPTNLLGYAYGIFSQIQ 78

Query: 87  NPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKCVLPNKFTFPSLIRACCIDNAIEEGKQ 146
           NP+ +++N LIR +     P      Y +ML   + P+  TFP LI+A      +  G+Q
Sbjct: 79  NPNLFVFNLLIRCFSTGAEPSKAFGFYTQMLKSRIWPDNITFPFLIKASSEMECVLVGEQ 138

Query: 147 IHGHVLKFGFRTDRFSQNNLIHMYANFQSLEEARRVFDGIELPDVVSWTTLLTGYAQCGF 206
            H  +++FGF+ D + +N+L+HMYAN   +  A R+F  +   DVVSWT+++ GY +CG 
Sbjct: 139 THSQIVRFGFQNDVYVENSLVHMYANCGFIAAAGRIFGQMGFRDVVSWTSMVAGYCKCGM 198

Query: 207 LDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFALFNRMRSEKIVLDKYMAASMLSA 266
           ++ A ++F+ MP R+  +W+ MI+ + +NN F +A  LF  M+ E +V ++ +  S++S+
Sbjct: 199 VENAREMFDEMPHRNLFTWSIMINGYAKNNCFEKAIDLFEFMKREGVVANETVMVSVISS 258

Query: 267 CTGLGALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKCGCLDRAFEVFTQLPEKGISSW 326
           C  LGALE G   + Y+ KS + ++  L T L+DM+ +CG +++A  VF  LPE    SW
Sbjct: 259 CAHLGALEFGERAYEYVVKSHMTVNLILGTALVDMFWRCGDIEKAIHVFEGLPETDSLSW 318

Query: 327 NCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVLNACAHSGLVEKGRYYFNHFTQ 386
           + +I G+A+HG    A+  F  M +    P ++TF  VL+AC+H GLVEKG   + +  +
Sbjct: 319 SSIIKGLAVHGHAHKAMHYFSQMISLGFIPRDVTFTAVLSACSHGGLVEKGLEIYENMKK 378

Query: 387 VYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPDAGVLGAFVGACKIHGNVDMGE 446
            + I+PR EHYGC+VD+ GRAG L EA N I +M + P+A +LGA +GACKI+ N ++ E
Sbjct: 379 DHGIEPRLEHYGCIVDMLGRAGKLAEAENFILKMHVKPNAPILGALLGACKIYKNTEVAE 438

Query: 447 EIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKLMNDREVKKAAGFSMIELEGVV 506
            +G  +I++ P +SG YVLL N+YA AG+WD +  +R +M ++ VKK  G+S+IE++G +
Sbjct: 439 RVGNMLIKVKPEHSGYYVLLSNIYACAGQWDKIESLRDMMKEKLVKKPPGWSLIEIDGKI 498

Query: 507 HEFIAG-GRGHPEANEIYGKVKEMVECIRCVGYVTE------EEEEVEKDNPVYYHSEKL 566
           ++F  G  + HPE  +I  K +E++  IR +GY         + +E EK++ ++ HSEKL
Sbjct: 499 NKFTMGDDQKHPEMGKIRRKWEEILGKIRLIGYKGNTGDAFFDVDEEEKESSIHMHSEKL 558

Query: 567 AVAYGLLKTRAGETLRITKNLRVCKDCHQALKLVSKVFQRKIVVRDRNRFHHFANGECSC 619
           A+AYG++KT+ G T+RI KNLRVC+DCH   KL+S+V+ R+++VRDRNRFHHF NG CSC
Sbjct: 559 AIAYGMMKTKPGTTIRIVKNLRVCEDCHTVTKLISEVYGRELIVRDRNRFHHFRNGVCSC 618

BLAST of Csor.00g214140 vs. TAIR 10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 487.6 bits (1254), Expect = 1.4e-137
Identity = 259/727 (35.63%), Postives = 395/727 (54.33%), Query Frame = 0

Query: 5   LALQPSASPINSPKVH---TSPIHGLKSCSSMSELKQYHSQIIRLGLSTDNDAMGRLVKF 64
           L   P+ S  N P  +   +  I  ++ C S+ +LKQ H  +IR G  +D  +  +L   
Sbjct: 12  LPRHPNFSNPNQPTTNNERSRHISLIERCVSLRQLKQTHGHMIRTGTFSDPYSASKLFAM 71

Query: 65  CAVSKNGDLDYALLLFKTIPNPDAYIYNTLIRGYLQLQFPRACLLLYLEMLHKC-VLPNK 124
            A+S    L+YA  +F  IP P+++ +NTLIR Y     P   +  +L+M+ +    PNK
Sbjct: 72  AALSSFASLEYARKVFDEIPKPNSFAWNTLIRAYASGPDPVLSIWAFLDMVSESQCYPNK 131

Query: 125 FTFPSLIRACCIDNAIEEGKQIHGHVLKFGFRTDRFSQNNLIH----------------- 184
           +TFP LI+A    +++  G+ +HG  +K    +D F  N+LIH                 
Sbjct: 132 YTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKVFTT 191

Query: 185 ------------------------------------------------------------ 244
                                                                       
Sbjct: 192 IKEKDVVSWNSMINGFVQKGSPDKALELFKKMESEDVKASHVTMVGVLSACAKIRNLEFG 251

Query: 245 ------------------------MYANFQSLEEARRVFDGIELPDVVSWTTLLTGYAQC 304
                                   MY    S+E+A+R+FD +E  D V+WTT+L GYA  
Sbjct: 252 RQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTTMLDGYAIS 311

Query: 305 GFLDEALQVFESMPERSSASWNAMISSFVQNNRFHEAFALFNRMRSEK-IVLDKYMAASM 364
              + A +V  SMP++   +WNA+IS++ QN + +EA  +F+ ++ +K + L++    S 
Sbjct: 312 EDYEAAREVLNSMPQKDIVAWNALISAYEQNGKPNEALIVFHELQLQKNMKLNQITLVST 371

Query: 365 LSACTGLGALEQGMWIHRYIKKSEIKLDSKLATTLIDMYCKCGCLDRAFEVFTQLPEKGI 424
           LSAC  +GALE G WIH YIKK  I+++  + + LI MY KCG L+++ EVF  + ++ +
Sbjct: 372 LSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEKRDV 431

Query: 425 SSWNCMIGGMAMHGKGEAAIELFKDMETKMVTPDNITFLNVLNACAHSGLVEKGRYYFNH 484
             W+ MIGG+AMHG G  A+++F  M+   V P+ +TF NV  AC+H+GLV++    F+ 
Sbjct: 432 FVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESLFHQ 491

Query: 485 FTQVYDIKPRTEHYGCMVDLYGRAGMLDEAMNLIREMPMSPDAGVLGAFVGACKIHGNVD 544
               Y I P  +HY C+VD+ GR+G L++A+  I  MP+ P   V GA +GACKIH N++
Sbjct: 492 MESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHANLN 551

Query: 545 MGEEIGKRVIELDPSNSGRYVLLGNLYAEAGRWDGVAEVRKLMNDREVKKAAGFSMIELE 604
           + E    R++EL+P N G +VLL N+YA+ G+W+ V+E+RK M    +KK  G S IE++
Sbjct: 552 LAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSIEID 611

Query: 605 GVVHEFIAGGRGHPEANEIYGKVKEMVECIRCVGYVTE-------EEEEVEKDNPVYYHS 619
           G++HEF++G   HP + ++YGK+ E++E ++  GY  E        EEE  K+  +  HS
Sbjct: 612 GMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLNLHS 671

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FJY71.2e-15745.33Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX... [more]
Q9FI803.9e-14840.92Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX... [more]
Q9LN013.3e-14738.23Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9FG165.3e-14541.39Pentatricopeptide repeat-containing protein At5g06540 OS=Arabidopsis thaliana OX... [more]
O823802.0e-13635.63Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
KAG6590058.10.0100.00Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022960820.10.099.68pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita moschata][more]
KAG7023724.10.099.19Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_023515687.10.097.58pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita pepo subsp... [more]
XP_022987211.10.096.45pentatricopeptide repeat-containing protein At5g66520-like [Cucurbita maxima] >X... [more]
Match NameE-valueIdentityDescription
A0A6J1HA700.099.68pentatricopeptide repeat-containing protein At5g66520-like OS=Cucurbita moschata... [more]
A0A6J1JDI60.096.45pentatricopeptide repeat-containing protein At5g66520-like OS=Cucurbita maxima O... [more]
A0A6J1CAN40.085.69uncharacterized protein LOC111009921 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1CAX30.085.69uncharacterized protein LOC111009921 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A5D3CGC50.081.91Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT5G66520.18.6e-15945.33Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G48910.12.8e-14940.92Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G08070.12.3e-14838.23Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G06540.13.8e-14641.39Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G29760.11.4e-13735.63Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Silver-seed gourd (sororia) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 83..131
e-value: 6.2E-8
score: 32.7
coord: 317..364
e-value: 9.7E-11
score: 41.7
coord: 216..262
e-value: 8.7E-8
score: 32.3
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 391..416
e-value: 6.2E-5
score: 23.0
coord: 187..215
e-value: 1.5E-7
score: 31.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 291..319
e-value: 7.3E-5
score: 20.7
coord: 187..215
e-value: 7.4E-6
score: 23.8
coord: 392..415
e-value: 5.3E-4
score: 18.0
coord: 320..350
e-value: 2.2E-5
score: 22.4
coord: 219..251
e-value: 2.2E-9
score: 34.9
coord: 86..119
e-value: 3.7E-4
score: 18.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 185..219
score: 11.553267
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 84..118
score: 9.021208
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 388..418
score: 8.527949
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 286..320
score: 10.577712
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 27..169
e-value: 4.0E-19
score: 70.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 292..539
e-value: 1.3E-38
score: 135.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 171..280
e-value: 3.7E-25
score: 91.0
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 196..471
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 491..607
e-value: 1.8E-30
score: 105.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..19
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..22
NoneNo IPR availablePANTHERPTHR47928:SF91OS06G0231400 PROTEINcoord: 24..592
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 24..592

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csor.00g214140.m01Csor.00g214140.m01mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding