CsGy3G024330.2 (mRNA) Cucumber (Gy14) v2.1

Overview
NameCsGy3G024330.2
TypemRNA
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionPentatricopeptide repeat-containing protein
LocationGy14Chr3: 24356219 .. 24359120 (+)
Sequence length2902
RNA-Seq ExpressionCsGy3G024330.2
SyntenyCsGy3G024330.2
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCGAAGGAACCAAACAAAATCCATATTATCCTAAAAAAAAAAAAACCCATTTGGCAAATCCACTCTCATATTAATTTCTTTCTCTTCAATCCTCTCATTTCCAGTTCAAAAATGGAGCACATTTACTGCCACTCCTCTCTGTAAGGTAATTCTGTTCTTCATTTCTCTTTTCAACTTCTTCGGATTGTGATGGGAAGTCATCCATAAGGACTCTGGAATTCATTGCCATTTGAATTCTGGGATTATCTTATCTTTTCTCCTTCATTTCTATTGACGATGCATGCTCTTAGTAATTGGTGTCCAACTAGTTGCTCCGGCGTTGAATTAGGTTCTTATTCTGTAGTTCATAGGTCATGGAAAAGGGTAAAGAGTTTTGGTTTTTCTGATTGTCGTTGTGGAAATTGGGGTTTCTCTTTGATTTCCTTTAACTTGAGTGTTTTGAGAAGTGGCTTTTGCTACGAGAATTCGAGATTTGTGTGTAATTGTGAGTTCCGCCATGGCTGTTCTAAGCTTAGAGTTGTTCCATTAATGAAGACGAACAGGAATTCTCTTGGGGCTTTCTGTTTATCTGCTTGGGCTGTTGAACAACCAACAATTGATGATGAAATCACTAGGGTTGAATCAAATTCTAGAGATGGTTTGCCTGAGAGAGGATTAGATTGGGATGATGATGACGACGGCAAAGTTAATGGTGAAAATAGCCATGGAGGAGGGAGCTTTAAAGATGAAGGAGAATTGGAGGGGGTGGGAGATGTTAGGGTTGATGTTCGTGCACTAGCGGCTCAGTTGCAGCTTGCTCGTACGGCAGATGATGTTGACCAGGTTCTCAAGGACATGGTTGAATTGCCTCTTCAAGTTTTCTCATCCATGATTAGGGGTTTTGGGAGAGACAGAAGGTTGGAGTGTGCAGTAGCTCTTGTTGACTGGCTTAAGAGAAAGAAGATTGAAACTAATGGTCGTATTGCACCAAACTTGTTCATATACAATAGTCTTCTTGGTGCAGTTAAGCAATCGGGAGAGCTTTCGAGAATGGAAAATGTCTTGACTGATATGGCACAGGAAGGAATTGTTTCAAATGTTGTTACGTACAACACGATTATGTCCATTTACTTGGAACAAGGACTAGCAATGAAAGCTCTTGGCATTCTTGAAGAGATGCCGAAGAAAGGCCTAACTCTGTCTCCCGTATCCTACTCTACGGCCTTACGAGCATACCGAAGGATGAAAGATGGGAATGGTGCTTTAAAGTTCATGGTTGAGTTGAGAGAAAGATATCGTAATGGTGAAATAGCGAAAGATGATAATGTAGATTGGGCTAATGAGTTCTTGAAGCTCGAAAACTTTACAAGACGTGTTTGCTACCAAGTAATGAGGATTTGGCTTGTGAAGGGTGACTGTGCAAGCACGAAGGTGTTGCAACTTCTAATGGAAATGGATAAAGCAGGACTGTCACTTGATCGTGCTGAAGCGGAGCGACTTATTTGGGCTTGTACGTGTGCAGAACACTATAATGTAGCAAAAGAATTGTACTTCAGGATAAGAGAAAAGCAATGTGGTATAAGCTTATCTGTTTGTAATCATGTGATTTGGTTGATGGGGAAAGCTAAGAAATGGTGGGCAGCATTGGAGATTTATGAAGATTTATTAGAGAAAGGACCAAAGCCAAACAACATGTCATATGAACTAATTGTCTCTCACTTCAATGTTCTTCTCACAGCCGCAAAGAAAAGAGGAATTTGGAGATGGGGTGTTCGGTTACTCAACAAAATGGAAGAGAAAGGTCTTAGACCTGGAAGTCGGGAATGGAACGCTGTTCTTGTTGCATGTTCCAGAGCTGCAGAAACTTCTGCAGCTATAGATATTTTTAGGAAAATGGTTGAACAAGGTGAAAAACCCACTGTCCTTTCATATGGGGCATTACTTAGTGCTCTGGAAAAGGGAAAGCTATATGATGAAGCACGCAGTGTCTGGGATCATATGATTAGAGTTGGGGTGGAGCCAAACATCTATGCTTATACGACTATGGCGTCAGTTTTCACTGGTCAAGGAAAATTTAATATGGTTGAAGTCACTATCAATGATATGGTTGCATCAGGCATTGAGCCAACAGTCGTCACATACAATGCAATAATCACGGGATGTGTCCGTAATGGGATGAGCAGTGTAGCTTATGAGTGGTTTCACCGCATGAAAGTTAGAAACATCTCTCCGAACGAGGTGAGTTACGAGTTACTCATTGAGGCCCTTGCAAAGGAAGGTAAACCAAGGCTGGCTTATGAGTTATACATGAGGGCTAAGGATGAGGGTCTCAATCTTTCTTCTAAAGTGTATGATGCAGTAATTGAATCCTCTCAACTTTATGGAGCCTCCGTTAATATAAAATTGTTAGGGTTGCGGCCACCTGACAGGAACAAGAGTTCATAGGTTAAAAAGAAGACTTCAAAGAATGTTTGTAAAGCTGCAGATGTTCCTAGAAAAAGCAGGAACAATTTGAAAGAAGGAAGCTAGTGGTTGATGATCCCTCCATCAGTACTCTGATCTCTTGATTTGTGCATAAAGTATTTGAAGCGACTCAGGAGGAAGCCTATGACTTCTGATTACCCGATCCGAGTGCATATGGAGCTCAGAAACTAAAAAGTCAATTGAAAGTGATGTGGCTGATCAAGGTGAAACTCCTGGGAAATCTGCAGTATCCAGTAAGATACTTTTCTCTGATATGTATGGGAGTTTCACGGTAAAGAAAAGTTAAAACACAATCTGTGATGTAGAAAGCCAAAGTAATGAAAGCATATGAGAATTTGTAGCTTTTTCCTTTTTATCTATAAAATGGTGTAAAGATGAAAGGTCTCTTGATCTAATCCCTTTGCAATGGAAACTTTTAATAAATGAGTTTTGAGTATTT

mRNA sequence

CCGAAGGAACCAAACAAAATCCATATTATCCTAAAAAAAAAAAAACCCATTTGGCAAATCCACTCTCATATTAATTTCTTTCTCTTCAATCCTCTCATTTCCAGTTCAAAAATGGAGCACATTTACTGCCACTCCTCTCTGTAAGGTAATTCTGTTCTTCATTTCTCTTTTCAACTTCTTCGGATTGTGATGGGAAGTCATCCATAAGGACTCTGGAATTCATTGCCATTTGAATTCTGGGATTATCTTATCTTTTCTCCTTCATTTCTATTGACGATGCATGCTCTTAGTAATTGGTGTCCAACTAGTTGCTCCGGCGTTGAATTAGGTTCTTATTCTGTAGTTCATAGGTCATGGAAAAGGGTAAAGAGTTTTGGTTTTTCTGATTGTCGTTGTGGAAATTGGGGTTTCTCTTTGATTTCCTTTAACTTGAGTGTTTTGAGAAGTGGCTTTTGCTACGAGAATTCGAGATTTGTGTGTAATTGTGAGTTCCGCCATGGCTGTTCTAAGCTTAGAGTTGTTCCATTAATGAAGACGAACAGGAATTCTCTTGGGGCTTTCTGTTTATCTGCTTGGGCTGTTGAACAACCAACAATTGATGATGAAATCACTAGGGTTGAATCAAATTCTAGAGATGGTTTGCCTGAGAGAGGATTAGATTGGGATGATGATGACGACGGCAAAGTTAATGGTGAAAATAGCCATGGAGGAGGGAGCTTTAAAGATGAAGGAGAATTGGAGGGGGTGGGAGATGTTAGGGTTGATGTTCGTGCACTAGCGGCTCAGTTGCAGCTTGCTCGTACGGCAGATGATGTTGACCAGGTTCTCAAGGACATGGTTGAATTGCCTCTTCAAGTTTTCTCATCCATGATTAGGGGTTTTGGGAGAGACAGAAGGTTGGAGTGTGCAGTAGCTCTTGTTGACTGGCTTAAGAGAAAGAAGATTGAAACTAATGGTCGTATTGCACCAAACTTGTTCATATACAATAGTCTTCTTGGTGCAGTTAAGCAATCGGGAGAGCTTTCGAGAATGGAAAATGTCTTGACTGATATGGCACAGGAAGGAATTGTTTCAAATGTTGTTACGTACAACACGATTATGTCCATTTACTTGGAACAAGGACTAGCAATGAAAGCTCTTGGCATTCTTGAAGAGATGCCGAAGAAAGGCCTAACTCTGTCTCCCGTATCCTACTCTACGGCCTTACGAGCATACCGAAGGATGAAAGATGGGAATGGTGCTTTAAAGTTCATGGTTGAGTTGAGAGAAAGATATCGTAATGGTGAAATAGCGAAAGATGATAATGTAGATTGGGCTAATGAGTTCTTGAAGCTCGAAAACTTTACAAGACGTGTTTGCTACCAAGTAATGAGGATTTGGCTTGTGAAGGGTGACTGTGCAAGCACGAAGGTGTTGCAACTTCTAATGGAAATGGATAAAGCAGGACTGTCACTTGATCGTGCTGAAGCGGAGCGACTTATTTGGGCTTGTACGTGTGCAGAACACTATAATGTAGCAAAAGAATTGTACTTCAGGATAAGAGAAAAGCAATGTGGTATAAGCTTATCTGTTTGTAATCATGTGATTTGGTTGATGGGGAAAGCTAAGAAATGGTGGGCAGCATTGGAGATTTATGAAGATTTATTAGAGAAAGGACCAAAGCCAAACAACATGTCATATGAACTAATTGTCTCTCACTTCAATGTTCTTCTCACAGCCGCAAAGAAAAGAGGAATTTGGAGATGGGGTGTTCGGTTACTCAACAAAATGGAAGAGAAAGGTCTTAGACCTGGAAGTCGGGAATGGAACGCTGTTCTTGTTGCATGTTCCAGAGCTGCAGAAACTTCTGCAGCTATAGATATTTTTAGGAAAATGGTTGAACAAGGTGAAAAACCCACTGTCCTTTCATATGGGGCATTACTTAGTGCTCTGGAAAAGGGAAAGCTATATGATGAAGCACGCAGTGTCTGGGATCATATGATTAGAGTTGGGGTGGAGCCAAACATCTATGCTTATACGACTATGGCGTCAGTTTTCACTGGTCAAGGAAAATTTAATATGGTTGAAGTCACTATCAATGATATGGTTGCATCAGGCATTGAGCCAACAGTCGTCACATACAATGCAATAATCACGGGATGTGTCCGTAATGGGATGAGCAGTGTAGCTTATGAGTGGTTTCACCGCATGAAAGTTAGAAACATCTCTCCGAACGAGGTGAGTTACGAGTTACTCATTGAGGCCCTTGCAAAGGAAGGTAAACCAAGGCTGGCTTATGAGTTATACATGAGGGCTAAGGATGAGGGTCTCAATCTTTCTTCTAAAGTGTATGATGCAGTAATTGAATCCTCTCAACTTTATGGAGCCTCCGTTAATATAAAATTGTTAGGGTTGCGGCCACCTGACAGGAACAAGAGTTCATAGGTTAAAAAGAAGACTTCAAAGAATGTTTGTAAAGCTGCAGATGTTCCTAGAAAAAGCAGGAACAATTTGAAAGAAGGAAGCTAGTGGTTGATGATCCCTCCATCAGTACTCTGATCTCTTGATTTGTGCATAAAGTATTTGAAGCGACTCAGGAGGAAGCCTATGACTTCTGATTACCCGATCCGAGTGCATATGGAGCTCAGAAACTAAAAAGTCAATTGAAAGTGATGTGGCTGATCAAGGTGAAACTCCTGGGAAATCTGCAGTATCCAGTAAGATACTTTTCTCTGATATGTATGGGAGTTTCACGGTAAAGAAAAGTTAAAACACAATCTGTGATGTAGAAAGCCAAAGTAATGAAAGCATATGAGAATTTGTAGCTTTTTCCTTTTTATCTATAAAATGGTGTAAAGATGAAAGGTCTCTTGATCTAATCCCTTTGCAATGGAAACTTTTAATAAATGAGTTTTGAGTATTT

Coding sequence (CDS)

ATGCATGCTCTTAGTAATTGGTGTCCAACTAGTTGCTCCGGCGTTGAATTAGGTTCTTATTCTGTAGTTCATAGGTCATGGAAAAGGGTAAAGAGTTTTGGTTTTTCTGATTGTCGTTGTGGAAATTGGGGTTTCTCTTTGATTTCCTTTAACTTGAGTGTTTTGAGAAGTGGCTTTTGCTACGAGAATTCGAGATTTGTGTGTAATTGTGAGTTCCGCCATGGCTGTTCTAAGCTTAGAGTTGTTCCATTAATGAAGACGAACAGGAATTCTCTTGGGGCTTTCTGTTTATCTGCTTGGGCTGTTGAACAACCAACAATTGATGATGAAATCACTAGGGTTGAATCAAATTCTAGAGATGGTTTGCCTGAGAGAGGATTAGATTGGGATGATGATGACGACGGCAAAGTTAATGGTGAAAATAGCCATGGAGGAGGGAGCTTTAAAGATGAAGGAGAATTGGAGGGGGTGGGAGATGTTAGGGTTGATGTTCGTGCACTAGCGGCTCAGTTGCAGCTTGCTCGTACGGCAGATGATGTTGACCAGGTTCTCAAGGACATGGTTGAATTGCCTCTTCAAGTTTTCTCATCCATGATTAGGGGTTTTGGGAGAGACAGAAGGTTGGAGTGTGCAGTAGCTCTTGTTGACTGGCTTAAGAGAAAGAAGATTGAAACTAATGGTCGTATTGCACCAAACTTGTTCATATACAATAGTCTTCTTGGTGCAGTTAAGCAATCGGGAGAGCTTTCGAGAATGGAAAATGTCTTGACTGATATGGCACAGGAAGGAATTGTTTCAAATGTTGTTACGTACAACACGATTATGTCCATTTACTTGGAACAAGGACTAGCAATGAAAGCTCTTGGCATTCTTGAAGAGATGCCGAAGAAAGGCCTAACTCTGTCTCCCGTATCCTACTCTACGGCCTTACGAGCATACCGAAGGATGAAAGATGGGAATGGTGCTTTAAAGTTCATGGTTGAGTTGAGAGAAAGATATCGTAATGGTGAAATAGCGAAAGATGATAATGTAGATTGGGCTAATGAGTTCTTGAAGCTCGAAAACTTTACAAGACGTGTTTGCTACCAAGTAATGAGGATTTGGCTTGTGAAGGGTGACTGTGCAAGCACGAAGGTGTTGCAACTTCTAATGGAAATGGATAAAGCAGGACTGTCACTTGATCGTGCTGAAGCGGAGCGACTTATTTGGGCTTGTACGTGTGCAGAACACTATAATGTAGCAAAAGAATTGTACTTCAGGATAAGAGAAAAGCAATGTGGTATAAGCTTATCTGTTTGTAATCATGTGATTTGGTTGATGGGGAAAGCTAAGAAATGGTGGGCAGCATTGGAGATTTATGAAGATTTATTAGAGAAAGGACCAAAGCCAAACAACATGTCATATGAACTAATTGTCTCTCACTTCAATGTTCTTCTCACAGCCGCAAAGAAAAGAGGAATTTGGAGATGGGGTGTTCGGTTACTCAACAAAATGGAAGAGAAAGGTCTTAGACCTGGAAGTCGGGAATGGAACGCTGTTCTTGTTGCATGTTCCAGAGCTGCAGAAACTTCTGCAGCTATAGATATTTTTAGGAAAATGGTTGAACAAGGTGAAAAACCCACTGTCCTTTCATATGGGGCATTACTTAGTGCTCTGGAAAAGGGAAAGCTATATGATGAAGCACGCAGTGTCTGGGATCATATGATTAGAGTTGGGGTGGAGCCAAACATCTATGCTTATACGACTATGGCGTCAGTTTTCACTGGTCAAGGAAAATTTAATATGGTTGAAGTCACTATCAATGATATGGTTGCATCAGGCATTGAGCCAACAGTCGTCACATACAATGCAATAATCACGGGATGTGTCCGTAATGGGATGAGCAGTGTAGCTTATGAGTGGTTTCACCGCATGAAAGTTAGAAACATCTCTCCGAACGAGGTGAGTTACGAGTTACTCATTGAGGCCCTTGCAAAGGAAGGTAAACCAAGGCTGGCTTATGAGTTATACATGAGGGCTAAGGATGAGGGTCTCAATCTTTCTTCTAAAGTGTATGATGCAGTAATTGAATCCTCTCAACTTTATGGAGCCTCCGTTAATATAAAATTGTTAGGGTTGCGGCCACCTGACAGGAACAAGAGTTCATAG

Protein sequence

MHALSNWCPTSCSGVELGSYSVVHRSWKRVKSFGFSDCRCGNWGFSLISFNLSVLRSGFCYENSRFVCNCEFRHGCSKLRVVPLMKTNRNSLGAFCLSAWAVEQPTIDDEITRVESNSRDGLPERGLDWDDDDDGKVNGENSHGGGSFKDEGELEGVGDVRVDVRALAAQLQLARTADDVDQVLKDMVELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNLFIYNSLLGAVKQSGELSRMENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGLTLSPVSYSTALRAYRRMKDGNGALKFMVELRERYRNGEIAKDDNVDWANEFLKLENFTRRVCYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAEAERLIWACTCAEHYNVAKELYFRIREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVSHFNVLLTAAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRKMVEQGEKPTVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKFNMVEVTINDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAKEGKPRLAYELYMRAKDEGLNLSSKVYDAVIESSQLYGASVNIKLLGLRPPDRNKSS*
Homology
BLAST of CsGy3G024330.2 vs. ExPASy Swiss-Prot
Match: Q9SNB7 (Protein LOW PHOTOSYNTHETIC EFFICIENCY 1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=LPE1 PE=1 SV=1)

HSP 1 Score: 765.8 bits (1976), Expect = 4.4e-220
Identity = 385/637 (60.44%), Postives = 484/637 (75.98%), Query Frame = 0

Query: 77  SKLRVVPLMKTNRNSLGAFCLSAWAVEQPTIDDEITRVESNSRDGLPERGLDWDDDDDGK 136
           S  +V+ L +  R+ LG+     WA EQ                    R L+  +++   
Sbjct: 61  SNRKVLFLCEPKRSLLGSSFGVGWATEQ--------------------RELELGEEEVST 120

Query: 137 VNGENSHGGGSFKDEGELEGVGDVRVDVRALAAQLQLARTADDVDQVLKDMVELPLQVFS 196
            +  +++GG             ++RVDVR LA  L+ A+TADDVD VLKD  ELPLQVF 
Sbjct: 121 EDLSSANGGEK----------NNLRVDVRELAFSLRAAKTADDVDAVLKDKGELPLQVFC 180

Query: 197 SMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNLFIYNSLLGAVKQSGELSRMENVL 256
           +MI+GFG+D+RL+ AVA+VDWLKRKK E+ G I PNLFIYNSLLGA++  GE    E +L
Sbjct: 181 AMIKGFGKDKRLKPAVAVVDWLKRKKSESGGVIGPNLFIYNSLLGAMRGFGE---AEKIL 240

Query: 257 TDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGLTLSPVSYSTALRAYRRM 316
            DM +EGIV N+VTYNT+M IY+E+G  +KALGIL+   +KG   +P++YSTAL  YRRM
Sbjct: 241 KDMEEEGIVPNIVTYNTLMVIYMEEGEFLKALGILDLTKEKGFEPNPITYSTALLVYRRM 300

Query: 317 KDGNGALKFMVELRERYRNGEIAKDDNVDWANEFLKLENFTRRVCYQVMRIWLVKGDCAS 376
           +DG GAL+F VELRE+Y   EI  D   DW  EF+KLENF  R+CYQVMR WLVK D  +
Sbjct: 301 EDGMGALEFFVELREKYAKREIGNDVGYDWEFEFVKLENFIGRICYQVMRRWLVKDDNWT 360

Query: 377 TKVLQLLMEMDKAGLSLDRAEAERLIWACTCAEHYNVAKELYFRIREKQCGISLSVCNHV 436
           T+VL+LL  MD AG+   R E ERLIWACT  EHY V KELY RIRE+   ISLSVCNH+
Sbjct: 361 TRVLKLLNAMDSAGVRPSREEHERLIWACTREEHYIVGKELYKRIRERFSEISLSVCNHL 420

Query: 437 IWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVSHFNVLLTAAKKRGIWRWGVRLLN 496
           IWLMGKAKKWWAALEIYEDLL++GP+PNN+SYEL+VSHFN+LL+AA KRGIWRWGVRLLN
Sbjct: 421 IWLMGKAKKWWAALEIYEDLLDEGPEPNNLSYELVVSHFNILLSAASKRGIWRWGVRLLN 480

Query: 497 KMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRKMVEQGEKPTVLSYGALLSALEKGK 556
           KME+KGL+P  R WNAVLVACS+A+ET+AAI IF+ MV+ GEKPTV+SYGALLSALEKGK
Sbjct: 481 KMEDKGLKPQRRHWNAVLVACSKASETTAAIQIFKAMVDNGEKPTVISYGALLSALEKGK 540

Query: 557 LYDEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKFNMVEVTINDMVASGIEPTVVTYN 616
           LYDEA  VW+HMI+VG+EPN+YAYTTMASV TGQ KFN+++  + +M + GIEP+VVT+N
Sbjct: 541 LYDEAFRVWNHMIKVGIEPNLYAYTTMASVLTGQQKFNLLDTLLKEMASKGIEPSVVTFN 600

Query: 617 AIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAKEGKPRLAYELYMRAKDE 676
           A+I+GC RNG+S VAYEWFHRMK  N+ PNE++YE+LIEALA + KPRLAYEL+++A++E
Sbjct: 601 AVISGCARNGLSGVAYEWFHRMKSENVEPNEITYEMLIEALANDAKPRLAYELHVKAQNE 660

Query: 677 GLNLSSKVYDAVIESSQLYGASVNIKLLGLRPPDRNK 714
           GL LSSK YDAV++S++ YGA++++ LLG RP  +N+
Sbjct: 661 GLKLSSKPYDAVVKSAETYGATIDLNLLGPRPDKKNR 664

BLAST of CsGy3G024330.2 vs. ExPASy Swiss-Prot
Match: O64624 (Pentatricopeptide repeat-containing protein At2g18940, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At2g18940 PE=2 SV=1)

HSP 1 Score: 132.9 bits (333), Expect = 1.5e-29
Identity = 103/448 (22.99%), Postives = 192/448 (42.86%), Query Frame = 0

Query: 231 PNLFIYNSLLGAVKQSGELSRMENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGI 290
           P    YN+LL    ++G  +   +VL +M +    ++ VTYN +++ Y+  G + +A G+
Sbjct: 314 PGTVTYNALLQVFGKAGVYTEALSVLKEMEENSCPADSVTYNELVAAYVRAGFSKEAAGV 373

Query: 291 LEEMPKKGLTLSPVSYSTALRAYRRMKDGNGALKFMVELRER-------YRNGEIAKDDN 350
           +E M KKG+  + ++Y+T + AY +    + ALK    ++E          N  ++    
Sbjct: 374 IEMMTKKGVMPNAITYTTVIDAYGKAGKEDEALKLFYSMKEAGCVPNTCTYNAVLSLLGK 433

Query: 351 VDWANEFLK-LENFTRRVCYQVMRIWLVKGDCASTK-----VLQLLMEMDKAGLSLDRAE 410
              +NE +K L +     C      W         K     V ++  EM   G   DR  
Sbjct: 434 KSRSNEMIKMLCDMKSNGCSPNRATWNTMLALCGNKGMDKFVNRVFREMKSCGFEPDRDT 493

Query: 411 AERLIWAC-TCAEHYNVAKELYFRIREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDL 470
              LI A   C    + +K +Y  +        ++  N ++  + +   W +   +  D+
Sbjct: 494 FNTLISAYGRCGSEVDASK-MYGEMTRAGFNACVTTYNALLNALARKGDWRSGENVISDM 553

Query: 471 LEKGPKPNNMSYELIVSHFNVLLTAAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVA 530
             KG KP   SY L       +L    K G +    R+ N+++E  + P       +L+A
Sbjct: 554 KSKGFKPTETSYSL-------MLQCYAKGGNYLGIERIENRIKEGQIFPSWMLLRTLLLA 613

Query: 531 ---CSRAAETSAAIDIFRKMVEQGEKPTVLSYGALLSALEKGKLYDEARSVWDHMIRVGV 590
              C   A +  A  +F+K    G KP ++ + ++LS   +  +YD+A  + + +   G+
Sbjct: 614 NFKCRALAGSERAFTLFKK---HGYKPDMVIFNSMLSIFTRNNMYDQAEGILESIREDGL 673

Query: 591 EPNIYAYTTMASVFTGQGKFNMVEVTINDMVASGIEPTVVTYNAIITGCVRNGMSSVAYE 650
            P++  Y ++  ++  +G+    E  +  +  S ++P +V+YN +I G  R G+   A  
Sbjct: 674 SPDLVTYNSLMDMYVRRGECWKAEEILKTLEKSQLKPDLVSYNTVIKGFCRRGLMQEAVR 733

Query: 651 WFHRMKVRNISPNEVSYELLIEALAKEG 662
               M  R I P   +Y   +      G
Sbjct: 734 MLSEMTERGIRPCIFTYNTFVSGYTAMG 750

BLAST of CsGy3G024330.2 vs. ExPASy Swiss-Prot
Match: Q9LFC5 (Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana OX=3702 GN=At5g01110 PE=2 SV=1)

HSP 1 Score: 132.5 bits (332), Expect = 1.9e-29
Identity = 112/493 (22.72%), Postives = 222/493 (45.03%), Query Frame = 0

Query: 217 WLKRKKIETNGRIAPNLFIYNSLLGAVKQSGELSRMENVLTDMAQEGIVSNVVTYNTIMS 276
           W   ++I  +G +  N++  N ++ A+ + G++ ++   L+ + ++G+  ++VTYNT++S
Sbjct: 220 WGVYQEISRSG-VGINVYTLNIMVNALCKDGKMEKVGTFLSQVQEKGVYPDIVTYNTLIS 279

Query: 277 IYLEQGLAMKALGILEEMPKKGLTLSPVSYSTALRA------YRRMKD----------GN 336
            Y  +GL  +A  ++  MP KG +    +Y+T +        Y R K+            
Sbjct: 280 AYSSKGLMEEAFELMNAMPGKGFSPGVYTYNTVINGLCKHGKYERAKEVFAEMLRSGLSP 339

Query: 337 GALKFMVELRERYRNGEIAKDDNVDWANEFLKLENFTRRVCY-QVMRIWLVKGDCASTKV 396
            +  +   L E  + G++ + + V   ++    +     VC+  +M ++   G+    K 
Sbjct: 340 DSTTYRSLLMEACKKGDVVETEKV--FSDMRSRDVVPDLVCFSSMMSLFTRSGNL--DKA 399

Query: 397 LQLLMEMDKAGLSLDRAEAERLIWACTCAEHYNVAKELYFRIREKQCGISLSVCNHVIWL 456
           L     + +AGL  D      LI         +VA  L   + ++ C + +   N ++  
Sbjct: 400 LMYFNSVKEAGLIPDNVIYTILIQGYCRKGMISVAMNLRNEMLQQGCAMDVVTYNTILHG 459

Query: 457 MGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVSHFNVLLTAAKKRGIWRWGVRLLNKME 516
           + K K    A +++ ++ E+   P+  SY L      +L+    K G  +  + L  KM+
Sbjct: 460 LCKRKMLGEADKLFNEMTERALFPD--SYTL-----TILIDGHCKLGNLQNAMELFQKMK 519

Query: 517 EKGLRPGSREWNAVLVACSRAAETSAAIDIFRKMVEQGEKPTVLSYGALLSAL-EKGKLY 576
           EK +R     +N +L    +  +   A +I+  MV +   PT +SY  L++AL  KG L 
Sbjct: 520 EKRIRLDVVTYNTLLDGFGKVGDIDTAKEIWADMVSKEILPTPISYSILVNALCSKGHL- 579

Query: 577 DEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKFNMVEVTINDMVASGIEPTVVTYNAI 636
            EA  VWD MI   ++P +    +M   +   G  +  E  +  M++ G  P  ++YN +
Sbjct: 580 AEAFRVWDEMISKNIKPTVMICNSMIKGYCRSGNASDGESFLEKMISEGFVPDCISYNTL 639

Query: 637 ITGCVRNGMSSVAYEWFHRMKVR--NISPNEVSYELLIEALAKEGKPRLAYELYMRAKDE 690
           I G VR    S A+    +M+     + P+  +Y  ++    ++ + + A  +  +  + 
Sbjct: 640 IYGFVREENMSKAFGLVKKMEEEQGGLVPDVFTYNSILHGFCRQNQMKEAEVVLRKMIER 699

BLAST of CsGy3G024330.2 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 131.7 bits (330), Expect = 3.2e-29
Identity = 118/479 (24.63%), Postives = 221/479 (46.14%), Query Frame = 0

Query: 194 VFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNLFIYNSLLGA-VKQSGELSRM 253
           VF  +++ + R   ++ A+++V        + +G   P +  YN++L A ++    +S  
Sbjct: 136 VFDLVVKSYSRLSLIDKALSIV-----HLAQAHG-FMPGVLSYNAVLDATIRSKRNISFA 195

Query: 254 ENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGLTLSPVSYSTALRA 313
           ENV  +M +  +  NV TYN ++  +   G    AL + ++M  KG   + V+Y+T +  
Sbjct: 196 ENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDG 255

Query: 314 Y---RRMKDG-----NGALKFMVELRERYR---NGEIAKDDNVDWANEFLKLENFTRRVC 373
           Y   R++ DG     + ALK +      Y    NG + ++  +   +  L   N      
Sbjct: 256 YCKLRKIDDGFKLLRSMALKGLEPNLISYNVVING-LCREGRMKEVSFVLTEMNRRGYSL 315

Query: 374 YQVMRIWLVKGDCASTKVLQLLM---EMDKAGLSLDRAEAERLIWACTCAEHYNVAKELY 433
            +V    L+KG C      Q L+   EM + GL+        LI +   A + N A E  
Sbjct: 316 DEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFL 375

Query: 434 FRIREKQCGISLSVCNHVIWLMGKAKKWW--AALEIYEDLLEKGPKPNNMSYELIVSHFN 493
            ++R +  G+  +   +   + G ++K +   A  +  ++ + G  P+ ++Y       N
Sbjct: 376 DQMRVR--GLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTY-------N 435

Query: 494 VLLTAAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRKMVEQ 553
            L+      G     + +L  M+EKGL P    ++ VL    R+ +   A+ + R+MVE+
Sbjct: 436 ALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEK 495

Query: 554 GEKPTVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKFNMV 613
           G KP  ++Y +L+    + +   EA  +++ M+RVG+ P+ + YT + + +  +G     
Sbjct: 496 GIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKA 555

Query: 614 EVTINDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIE 656
               N+MV  G+ P VVTY+ +I G  +   +  A     ++      P++V+Y  LIE
Sbjct: 556 LQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRLLLKLFYEESVPSDVTYHTLIE 598

BLAST of CsGy3G024330.2 vs. ExPASy Swiss-Prot
Match: Q3EDF8 (Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX=3702 GN=At1g09900 PE=2 SV=1)

HSP 1 Score: 128.6 bits (322), Expect = 2.7e-28
Identity = 106/467 (22.70%), Postives = 203/467 (43.47%), Query Frame = 0

Query: 231 PNLFIYNSLLGAVKQSGELSRMENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGI 290
           P++    +L+    + G+  +   +L  +   G V +V+TYN ++S Y + G    AL +
Sbjct: 135 PDIIPCTTLIRGFCRLGKTRKAAKILEILEGSGAVPDVITYNVMISGYCKAGEINNALSV 194

Query: 291 LEEMPKKGLTLSP--VSYSTALRAYRRMKDGNGALKFMVELRERYRNGEIAKDDNVDWAN 350
           L+ M     ++SP  V+Y+T LR+       +G LK  +E+                   
Sbjct: 195 LDRM-----SVSPDVVTYNTILRSL----CDSGKLKQAMEV------------------- 254

Query: 351 EFLKLENFTRRVCYQ--VMRIWLVKGDCASTKV---LQLLMEMDKAGLSLDRAEAERLIW 410
               L+   +R CY   +    L++  C  + V   ++LL EM   G + D      L+ 
Sbjct: 255 ----LDRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVN 314

Query: 411 ACTCAEHYNVAKELYFRIREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKP 470
                   + A +    +    C  ++   N ++  M    +W  A ++  D+L KG  P
Sbjct: 315 GICKEGRLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSP 374

Query: 471 NNMSYELIVSHFNVLLTAAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAET 530
           +       V  FN+L+    ++G+    + +L KM + G +P S  +N +L    +  + 
Sbjct: 375 S-------VVTFNILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKM 434

Query: 531 SAAIDIFRKMVEQGEKPTVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTM 590
             AI+   +MV +G  P +++Y  +L+AL K    ++A  + + +   G  P +  Y T+
Sbjct: 435 DRAIEYLERMVSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTV 494

Query: 591 ASVFTGQGKFNMVEVTINDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNI 650
                  GK       +++M A  ++P  +TY++++ G  R G    A ++FH  +   I
Sbjct: 495 IDGLAKAGKTGKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGI 554

Query: 651 SPNEVSYELLIEALAKEGKPRLAYELYMRAKDEGLNLSSKVYDAVIE 691
            PN V++  ++  L K  +   A +  +   + G   +   Y  +IE
Sbjct: 555 RPNAVTFNSIMLGLCKSRQTDRAIDFLVFMINRGCKPNETSYTILIE 562

BLAST of CsGy3G024330.2 vs. NCBI nr
Match: XP_011651578.1 (protein LOW PHOTOSYNTHETIC EFFICIENCY 1, chloroplastic [Cucumis sativus])

HSP 1 Score: 1436 bits (3718), Expect = 0.0
Identity = 715/715 (100.00%), Postives = 715/715 (100.00%), Query Frame = 0

Query: 1   MHALSNWCPTSCSGVELGSYSVVHRSWKRVKSFGFSDCRCGNWGFSLISFNLSVLRSGFC 60
           MHALSNWCPTSCSGVELGSYSVVHRSWKRVKSFGFSDCRCGNWGFSLISFNLSVLRSGFC
Sbjct: 1   MHALSNWCPTSCSGVELGSYSVVHRSWKRVKSFGFSDCRCGNWGFSLISFNLSVLRSGFC 60

Query: 61  YENSRFVCNCEFRHGCSKLRVVPLMKTNRNSLGAFCLSAWAVEQPTIDDEITRVESNSRD 120
           YENSRFVCNCEFRHGCSKLRVVPLMKTNRNSLGAFCLSAWAVEQPTIDDEITRVESNSRD
Sbjct: 61  YENSRFVCNCEFRHGCSKLRVVPLMKTNRNSLGAFCLSAWAVEQPTIDDEITRVESNSRD 120

Query: 121 GLPERGLDWDDDDDGKVNGENSHGGGSFKDEGELEGVGDVRVDVRALAAQLQLARTADDV 180
           GLPERGLDWDDDDDGKVNGENSHGGGSFKDEGELEGVGDVRVDVRALAAQLQLARTADDV
Sbjct: 121 GLPERGLDWDDDDDGKVNGENSHGGGSFKDEGELEGVGDVRVDVRALAAQLQLARTADDV 180

Query: 181 DQVLKDMVELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNLFIYNSLL 240
           DQVLKDMVELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNLFIYNSLL
Sbjct: 181 DQVLKDMVELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNLFIYNSLL 240

Query: 241 GAVKQSGELSRMENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGLT 300
           GAVKQSGELSRMENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGLT
Sbjct: 241 GAVKQSGELSRMENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGLT 300

Query: 301 LSPVSYSTALRAYRRMKDGNGALKFMVELRERYRNGEIAKDDNVDWANEFLKLENFTRRV 360
           LSPVSYSTALRAYRRMKDGNGALKFMVELRERYRNGEIAKDDNVDWANEFLKLENFTRRV
Sbjct: 301 LSPVSYSTALRAYRRMKDGNGALKFMVELRERYRNGEIAKDDNVDWANEFLKLENFTRRV 360

Query: 361 CYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAEAERLIWACTCAEHYNVAKELYFR 420
           CYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAEAERLIWACTCAEHYNVAKELYFR
Sbjct: 361 CYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAEAERLIWACTCAEHYNVAKELYFR 420

Query: 421 IREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVSHFNVLLT 480
           IREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVSHFNVLLT
Sbjct: 421 IREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVSHFNVLLT 480

Query: 481 AAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRKMVEQGEKP 540
           AAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRKMVEQGEKP
Sbjct: 481 AAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRKMVEQGEKP 540

Query: 541 TVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKFNMVEVTI 600
           TVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKFNMVEVTI
Sbjct: 541 TVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKFNMVEVTI 600

Query: 601 NDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAKE 660
           NDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAKE
Sbjct: 601 NDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAKE 660

Query: 661 GKPRLAYELYMRAKDEGLNLSSKVYDAVIESSQLYGASVNIKLLGLRPPDRNKSS 715
           GKPRLAYELYMRAKDEGLNLSSKVYDAVIESSQLYGASVNIKLLGLRPPDRNKSS
Sbjct: 661 GKPRLAYELYMRAKDEGLNLSSKVYDAVIESSQLYGASVNIKLLGLRPPDRNKSS 715

BLAST of CsGy3G024330.2 vs. NCBI nr
Match: KAA0066960.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK30239.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1369 bits (3544), Expect = 0.0
Identity = 679/715 (94.97%), Postives = 695/715 (97.20%), Query Frame = 0

Query: 1   MHALSNWCPTSCSGVELGSYSVVHRSWKRVKSFGFSDCRCGNWGFSLISFNLSVLRSGFC 60
           MH LSNWCPTSCSGV+LGSYSVVHRSWKR+K FGFSDC CGNWGFSLISFNLSVL SGFC
Sbjct: 1   MHVLSNWCPTSCSGVDLGSYSVVHRSWKRIKCFGFSDCCCGNWGFSLISFNLSVLGSGFC 60

Query: 61  YENSRFVCNCEFRHGCSKLRVVPLMKTNRNSLGAFCLSAWAVEQPTIDDEITRVESNSRD 120
           YENSRFVCNCEFRHG SKLRVVPLMK NRNSL A+CLSAW VEQPTI DE+ RVESNSRD
Sbjct: 61  YENSRFVCNCEFRHGYSKLRVVPLMKPNRNSLEAWCLSAWTVEQPTIGDELPRVESNSRD 120

Query: 121 GLPERGLDWDDDDDGKVNGENSHGGGSFKDEGELEGVGDVRVDVRALAAQLQLARTADDV 180
           GLPER LDWD DDD  VNGENSHGGGSFKDEGE+EGVGDVRVDVRALAAQLQLARTADDV
Sbjct: 121 GLPERRLDWDGDDDDNVNGENSHGGGSFKDEGEMEGVGDVRVDVRALAAQLQLARTADDV 180

Query: 181 DQVLKDMVELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNLFIYNSLL 240
           DQVLKDMVELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNLFIYNSLL
Sbjct: 181 DQVLKDMVELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNLFIYNSLL 240

Query: 241 GAVKQSGELSRMENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGLT 300
           GAVKQSGELS+MENVLT+MAQEGIVSNVVTYNTIMSIYLEQGLA KALGILEEMPKKGLT
Sbjct: 241 GAVKQSGELSKMENVLTEMAQEGIVSNVVTYNTIMSIYLEQGLATKALGILEEMPKKGLT 300

Query: 301 LSPVSYSTALRAYRRMKDGNGALKFMVELRERYRNGEIAKDDNVDWANEFLKLENFTRRV 360
           LSPVSYSTALRAYR+MKDGNGAL+FMVELRERY NGEIAKDDNVDWANEFLKLENFTRRV
Sbjct: 301 LSPVSYSTALRAYRKMKDGNGALEFMVELRERYHNGEIAKDDNVDWANEFLKLENFTRRV 360

Query: 361 CYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAEAERLIWACTCAEHYNVAKELYFR 420
           CYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAE ERLIWACTCAEHYNVAKELY R
Sbjct: 361 CYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAEEERLIWACTCAEHYNVAKELYIR 420

Query: 421 IREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVSHFNVLLT 480
           IREKQCGISLSVCNHVIWLMGKAKKWWAALEIYE+LLEKGPKPNNMSYELIVSHFNVLLT
Sbjct: 421 IREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEELLEKGPKPNNMSYELIVSHFNVLLT 480

Query: 481 AAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRKMVEQGEKP 540
           AAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFR+MVEQGEKP
Sbjct: 481 AAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRRMVEQGEKP 540

Query: 541 TVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKFNMVEVTI 600
           TVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFT QGKFNMVEVTI
Sbjct: 541 TVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTSQGKFNMVEVTI 600

Query: 601 NDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAKE 660
           NDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAKE
Sbjct: 601 NDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAKE 660

Query: 661 GKPRLAYELYMRAKDEGLNLSSKVYDAVIESSQLYGASVNIKLLGLRPPDRNKSS 715
           GKPRLAYELY RAKDEGLNLSSK+YDAVIESSQLYGAS++I+LLGLRPPD+NKSS
Sbjct: 661 GKPRLAYELYRRAKDEGLNLSSKIYDAVIESSQLYGASIDIRLLGLRPPDKNKSS 715

BLAST of CsGy3G024330.2 vs. NCBI nr
Match: XP_038898205.1 (protein LOW PHOTOSYNTHETIC EFFICIENCY 1, chloroplastic [Benincasa hispida])

HSP 1 Score: 1335 bits (3456), Expect = 0.0
Identity = 663/716 (92.60%), Postives = 686/716 (95.81%), Query Frame = 0

Query: 1   MHALSNWCPTSCSGVELGSYSVVHRSWKRVKSFGFSDCRCGNWGFSLISFNLSVLRSGFC 60
           MH LSNWCPTS SGVELGSYSVVHRSW R+K FGFSDC CGN GFSLISFN SVLRSGFC
Sbjct: 1   MHVLSNWCPTSSSGVELGSYSVVHRSWNRIKCFGFSDCSCGNGGFSLISFNSSVLRSGFC 60

Query: 61  YENSRFVCNCEFRHGCSKLRVVPLMKTNRNSLGAFCLSAWAVEQPTIDDEITRVESNSRD 120
           YENS FVCNCEFRHGCSKL V  LMK  RNSLGA+CLSAWAVE+PTIDDE+ RVES+SRD
Sbjct: 61  YENSTFVCNCEFRHGCSKLGVASLMKPKRNSLGAWCLSAWAVEEPTIDDELARVESSSRD 120

Query: 121 GLPERGLDWDDDDD-GKVNGENSHGGGSFKDEGELEGVGDVRVDVRALAAQLQLARTADD 180
           GLPER L+WDDD D   VNGENSHGGGSFKDE  +EG GDVRVDV ALAAQLQLARTADD
Sbjct: 121 GLPERSLEWDDDHDHDNVNGENSHGGGSFKDEEGMEGEGDVRVDVCALAAQLQLARTADD 180

Query: 181 VDQVLKDMVELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNLFIYNSL 240
           VD+VLKD+ ELPLQVFSSMIRGFGRDRRLECAVALV+WLKRKKIETNGRI PNLF YNSL
Sbjct: 181 VDEVLKDVGELPLQVFSSMIRGFGRDRRLECAVALVEWLKRKKIETNGRIGPNLFTYNSL 240

Query: 241 LGAVKQSGELSRMENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGL 300
           LGAVKQSGELS+MENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGL
Sbjct: 241 LGAVKQSGELSKMENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGL 300

Query: 301 TLSPVSYSTALRAYRRMKDGNGALKFMVELRERYRNGEIAKDDNVDWANEFLKLENFTRR 360
           TLSPVSYSTALRAYRRMKDGNGALKFM+ELRERY NGEIAKDDNVDW NEFLKLENFTRR
Sbjct: 301 TLSPVSYSTALRAYRRMKDGNGALKFMIELRERYHNGEIAKDDNVDWTNEFLKLENFTRR 360

Query: 361 VCYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAEAERLIWACTCAEHYNVAKELYF 420
           VCYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAE ERLIWACTCAEH+NVAKELY+
Sbjct: 361 VCYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAEQERLIWACTCAEHHNVAKELYY 420

Query: 421 RIREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVSHFNVLL 480
           RIREKQCGISLSVCNHVIWLMGKAKKWWAALE+YEDLLEKGPKPNNMSYELIVSHFNVLL
Sbjct: 421 RIREKQCGISLSVCNHVIWLMGKAKKWWAALEVYEDLLEKGPKPNNMSYELIVSHFNVLL 480

Query: 481 TAAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRKMVEQGEK 540
           TAAKKRGIWRWGVRLLNKMEEKGLRPG REWNAVLVACSRAAETSAAIDIFR+MVEQGEK
Sbjct: 481 TAAKKRGIWRWGVRLLNKMEEKGLRPGRREWNAVLVACSRAAETSAAIDIFRRMVEQGEK 540

Query: 541 PTVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKFNMVEVT 600
           PTVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKFNMVEVT
Sbjct: 541 PTVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKFNMVEVT 600

Query: 601 INDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAK 660
           I+DMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAK
Sbjct: 601 ISDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAK 660

Query: 661 EGKPRLAYELYMRAKDEGLNLSSKVYDAVIESSQLYGASVNIKLLGLRPPDRNKSS 715
           EGKPRLAYELYMRAKDEGLNLSSK+YDAVI+SSQLYGAS++I+LLGLRPPD+NKSS
Sbjct: 661 EGKPRLAYELYMRAKDEGLNLSSKIYDAVIQSSQLYGASIDIRLLGLRPPDKNKSS 716

BLAST of CsGy3G024330.2 vs. NCBI nr
Match: XP_022154192.1 (pentatricopeptide repeat-containing protein At3g46610 [Momordica charantia])

HSP 1 Score: 1231 bits (3185), Expect = 0.0
Identity = 618/722 (85.60%), Postives = 661/722 (91.55%), Query Frame = 0

Query: 1   MHALSNWCPTSCSGVELGSYSVVHRSWKRVKSFGFSDCRCGNWGFSLISFNLSVLRSGFC 60
           MHALSNWCPTS S VELGS  VV RS KR+K  GFSDC CGN GFSLISFNL V RSGFC
Sbjct: 1   MHALSNWCPTSSSKVELGSSCVVRRSGKRLKCVGFSDCCCGNGGFSLISFNLRVFRSGFC 60

Query: 61  YENSRFVCNCEFRHGCSKLRVVPLMKTNRNSLGAFCLSAWAVEQPTIDDEITRVESNSRD 120
           YENS+F C+CEFRHGCSKL V  LMK  RNSLGA+ LSAWAVEQPT+ +EI RVESNS D
Sbjct: 61  YENSKFDCSCEFRHGCSKLIVARLMKPKRNSLGAWFLSAWAVEQPTVGNEIVRVESNSED 120

Query: 121 GLPER-------GLDWDDDDDGKVNGENSHGGGSFKDEGELEGVGDVRVDVRALAAQLQL 180
            L ER       GLDWDD  +  VNGEN HGGG FKDE  +EG GDV VDVRALA +LQL
Sbjct: 121 DLAERSEGEGYGGLDWDDHHN--VNGENGHGGGDFKDEDGMEGEGDVWVDVRALAGRLQL 180

Query: 181 ARTADDVDQVLKDMVELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNL 240
            RTADDV++VLKD+ ELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIET+GRIAPNL
Sbjct: 181 TRTADDVEEVLKDVGELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETDGRIAPNL 240

Query: 241 FIYNSLLGAVKQSGELSRMENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGILEE 300
           FIYNSLLGAVKQS   S+ME+VL DMAQEGI SNV+TYNTIMSIYLEQGLAMKALGILEE
Sbjct: 241 FIYNSLLGAVKQSTVFSKMEDVLADMAQEGITSNVITYNTIMSIYLEQGLAMKALGILEE 300

Query: 301 MPKKGLTLSPVSYSTALRAYRRMKDGNGALKFMVELRERYRNGEIAKDDNVDWANEFLKL 360
           MPKKGLT SPVSYST L+AYRRMKDGNGALKFM ELRE+YR+GE+AKDDNVDWA+EF+KL
Sbjct: 301 MPKKGLTPSPVSYSTGLQAYRRMKDGNGALKFMTELREKYRSGEMAKDDNVDWADEFMKL 360

Query: 361 ENFTRRVCYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAEAERLIWACTCAEHYNV 420
           ENFT+RVCYQVMRIWLVKG  ASTKVLQLL+EMDKAGLSLDRAE ERLIWACTCAEH+NV
Sbjct: 361 ENFTKRVCYQVMRIWLVKGYSASTKVLQLLVEMDKAGLSLDRAEEERLIWACTCAEHHNV 420

Query: 421 AKELYFRIREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVS 480
           AKELY+RIREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVS
Sbjct: 421 AKELYYRIREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVS 480

Query: 481 HFNVLLTAAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRKM 540
           HFNVLLTAAKKRGIWRWGVRLLNKMEEKGL+PGSREWNAVLVACS+AAETSAAI+IFR+M
Sbjct: 481 HFNVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKAAETSAAIEIFRRM 540

Query: 541 VEQGEKPTVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKF 600
           VEQGEKPT+LSYGALLSALEKGKLYDEARSVWDHMI+VGV+PNIYAYTTMASVFTGQGKF
Sbjct: 541 VEQGEKPTILSYGALLSALEKGKLYDEARSVWDHMIKVGVKPNIYAYTTMASVFTGQGKF 600

Query: 601 NMVEVTINDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELL 660
           NMVEVTINDMV+SGIEPTVVTYNAIITGCVRNG+SSVAYEWFHRMKVRNISPNEVSYELL
Sbjct: 601 NMVEVTINDMVSSGIEPTVVTYNAIITGCVRNGLSSVAYEWFHRMKVRNISPNEVSYELL 660

Query: 661 IEALAKEGKPRLAYELYMRAKDEGLNLSSKVYDAVIESSQLYGASVNIKLLGLRPPDRNK 715
           IEALAKEGKPRLAYELY+RAK++ LNLSSK YDAVI+SSQ+YGAS++I+ LG  PPD NK
Sbjct: 661 IEALAKEGKPRLAYELYLRAKNDSLNLSSKTYDAVIQSSQVYGASIDIRALGSPPPDTNK 720

BLAST of CsGy3G024330.2 vs. NCBI nr
Match: XP_022934968.1 (pentatricopeptide repeat-containing protein At3g46610-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 1217 bits (3149), Expect = 0.0
Identity = 607/715 (84.90%), Postives = 653/715 (91.33%), Query Frame = 0

Query: 1   MHALSNWCPTSCSGVELGSYSVVHRSWKRVKSFGFSDCRCGNWGFSLISFNLSVLRSGFC 60
           MH  SNWCP S SGVELGSYSVVH SWKR+   GFSD   GN    LISFN SVLRSGFC
Sbjct: 1   MHVHSNWCPISSSGVELGSYSVVHSSWKRINRVGFSDSCYGNGNLYLISFNFSVLRSGFC 60

Query: 61  YENSRFVCNCEFRHGCSKLRVVPLMKTNRNSLGAFCLSAWAVEQPTIDDEITRVESNSRD 120
            E SRF C  EFRHGCSKLRV PLMK  RNSLGA+ L AWAVEQPTIDDEI RVESNSRD
Sbjct: 61  CETSRFECIREFRHGCSKLRVAPLMKPKRNSLGAWFLFAWAVEQPTIDDEIARVESNSRD 120

Query: 121 GLPERGLDWDDDDDGKVNGENSHGGGSFKDEGELEGVGDVRVDVRALAAQLQLARTADDV 180
            LPE  LDWD  D G VN ENSHG G+FKDE  +EG GDVRVDVRALA +LQLARTADDV
Sbjct: 121 DLPESSLDWDVYDPGNVNSENSHGRGNFKDEEGMEGEGDVRVDVRALARRLQLARTADDV 180

Query: 181 DQVLKDMVELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNLFIYNSLL 240
           +++LKD+  LPLQVFSS+IRGFGR+RRLECAVALV+WLK+KKIETNGRIAPNLFIYNSLL
Sbjct: 181 EELLKDVGVLPLQVFSSIIRGFGRNRRLECAVALVEWLKKKKIETNGRIAPNLFIYNSLL 240

Query: 241 GAVKQSGELSRMENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGLT 300
           GAVKQSGE S+ME+VLTDMAQEGIVSNVVTYNTIMSIYLEQGLA+KALGILEEMP+KGLT
Sbjct: 241 GAVKQSGEFSKMEDVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAIKALGILEEMPRKGLT 300

Query: 301 LSPVSYSTALRAYRRMKDGNGALKFMVELRERYRNGEIAKDDNVDWANEFLKLENFTRRV 360
             PVSYSTAL+AYRRM DGNGALKFM+ELRERYRNGE+ KDDNVDWA++FLKLE FTRRV
Sbjct: 301 PCPVSYSTALQAYRRMNDGNGALKFMIELRERYRNGELVKDDNVDWADKFLKLEKFTRRV 360

Query: 361 CYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAEAERLIWACTCAEHYNVAKELYFR 420
           CYQVMRIWLVK D A+TKVLQLLMEMDKAGLSLDR E ERLIWACTCAEH+NVAKELY+R
Sbjct: 361 CYQVMRIWLVKDDPANTKVLQLLMEMDKAGLSLDRVEEERLIWACTCAEHHNVAKELYYR 420

Query: 421 IREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVSHFNVLLT 480
           IREKQC ISLSVCNHVIWL GKAKKWWAALEIYEDLLEKGPKPNN+S ELIVSHFNVLLT
Sbjct: 421 IREKQCSISLSVCNHVIWLTGKAKKWWAALEIYEDLLEKGPKPNNLSNELIVSHFNVLLT 480

Query: 481 AAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRKMVEQGEKP 540
           AAKKRGIWRWGVRLL+KMEEKGL+PG REWNAVLVACSRAAETSAAIDIFR+MVE+GEKP
Sbjct: 481 AAKKRGIWRWGVRLLDKMEEKGLKPGIREWNAVLVACSRAAETSAAIDIFRRMVEKGEKP 540

Query: 541 TVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKFNMVEVTI 600
           TVLSYGALLSALEKGKLYDEARSVWDHMI+VGVEPNIYAYTTM S+FTGQGKFNMVEVT+
Sbjct: 541 TVLSYGALLSALEKGKLYDEARSVWDHMIKVGVEPNIYAYTTMTSIFTGQGKFNMVEVTL 600

Query: 601 NDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAKE 660
           NDMV SGIEPTVVTYNAIITGCVRNGMS+VAYEWFHRMK RNISP+EVSYELL+EALAKE
Sbjct: 601 NDMVTSGIEPTVVTYNAIITGCVRNGMSTVAYEWFHRMKARNISPDEVSYELLVEALAKE 660

Query: 661 GKPRLAYELYMRAKDEGLNLSSKVYDAVIESSQLYGASVNIKLLGLRPPDRNKSS 715
           GKPRLAYELY+ AKDEGLNLSSK+YDAVI+SSQ++GAS++I+LLG RP ++NKSS
Sbjct: 661 GKPRLAYELYLSAKDEGLNLSSKIYDAVIQSSQVHGASIDIRLLGPRPLEKNKSS 715

BLAST of CsGy3G024330.2 vs. ExPASy TrEMBL
Match: A0A0A0LB88 (PPR_long domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G595200 PE=3 SV=1)

HSP 1 Score: 1436 bits (3718), Expect = 0.0
Identity = 715/715 (100.00%), Postives = 715/715 (100.00%), Query Frame = 0

Query: 1   MHALSNWCPTSCSGVELGSYSVVHRSWKRVKSFGFSDCRCGNWGFSLISFNLSVLRSGFC 60
           MHALSNWCPTSCSGVELGSYSVVHRSWKRVKSFGFSDCRCGNWGFSLISFNLSVLRSGFC
Sbjct: 1   MHALSNWCPTSCSGVELGSYSVVHRSWKRVKSFGFSDCRCGNWGFSLISFNLSVLRSGFC 60

Query: 61  YENSRFVCNCEFRHGCSKLRVVPLMKTNRNSLGAFCLSAWAVEQPTIDDEITRVESNSRD 120
           YENSRFVCNCEFRHGCSKLRVVPLMKTNRNSLGAFCLSAWAVEQPTIDDEITRVESNSRD
Sbjct: 61  YENSRFVCNCEFRHGCSKLRVVPLMKTNRNSLGAFCLSAWAVEQPTIDDEITRVESNSRD 120

Query: 121 GLPERGLDWDDDDDGKVNGENSHGGGSFKDEGELEGVGDVRVDVRALAAQLQLARTADDV 180
           GLPERGLDWDDDDDGKVNGENSHGGGSFKDEGELEGVGDVRVDVRALAAQLQLARTADDV
Sbjct: 121 GLPERGLDWDDDDDGKVNGENSHGGGSFKDEGELEGVGDVRVDVRALAAQLQLARTADDV 180

Query: 181 DQVLKDMVELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNLFIYNSLL 240
           DQVLKDMVELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNLFIYNSLL
Sbjct: 181 DQVLKDMVELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNLFIYNSLL 240

Query: 241 GAVKQSGELSRMENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGLT 300
           GAVKQSGELSRMENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGLT
Sbjct: 241 GAVKQSGELSRMENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGLT 300

Query: 301 LSPVSYSTALRAYRRMKDGNGALKFMVELRERYRNGEIAKDDNVDWANEFLKLENFTRRV 360
           LSPVSYSTALRAYRRMKDGNGALKFMVELRERYRNGEIAKDDNVDWANEFLKLENFTRRV
Sbjct: 301 LSPVSYSTALRAYRRMKDGNGALKFMVELRERYRNGEIAKDDNVDWANEFLKLENFTRRV 360

Query: 361 CYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAEAERLIWACTCAEHYNVAKELYFR 420
           CYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAEAERLIWACTCAEHYNVAKELYFR
Sbjct: 361 CYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAEAERLIWACTCAEHYNVAKELYFR 420

Query: 421 IREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVSHFNVLLT 480
           IREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVSHFNVLLT
Sbjct: 421 IREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVSHFNVLLT 480

Query: 481 AAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRKMVEQGEKP 540
           AAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRKMVEQGEKP
Sbjct: 481 AAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRKMVEQGEKP 540

Query: 541 TVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKFNMVEVTI 600
           TVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKFNMVEVTI
Sbjct: 541 TVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKFNMVEVTI 600

Query: 601 NDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAKE 660
           NDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAKE
Sbjct: 601 NDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAKE 660

Query: 661 GKPRLAYELYMRAKDEGLNLSSKVYDAVIESSQLYGASVNIKLLGLRPPDRNKSS 715
           GKPRLAYELYMRAKDEGLNLSSKVYDAVIESSQLYGASVNIKLLGLRPPDRNKSS
Sbjct: 661 GKPRLAYELYMRAKDEGLNLSSKVYDAVIESSQLYGASVNIKLLGLRPPDRNKSS 715

BLAST of CsGy3G024330.2 vs. ExPASy TrEMBL
Match: A0A5D3E368 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold595G00640 PE=3 SV=1)

HSP 1 Score: 1369 bits (3544), Expect = 0.0
Identity = 679/715 (94.97%), Postives = 695/715 (97.20%), Query Frame = 0

Query: 1   MHALSNWCPTSCSGVELGSYSVVHRSWKRVKSFGFSDCRCGNWGFSLISFNLSVLRSGFC 60
           MH LSNWCPTSCSGV+LGSYSVVHRSWKR+K FGFSDC CGNWGFSLISFNLSVL SGFC
Sbjct: 1   MHVLSNWCPTSCSGVDLGSYSVVHRSWKRIKCFGFSDCCCGNWGFSLISFNLSVLGSGFC 60

Query: 61  YENSRFVCNCEFRHGCSKLRVVPLMKTNRNSLGAFCLSAWAVEQPTIDDEITRVESNSRD 120
           YENSRFVCNCEFRHG SKLRVVPLMK NRNSL A+CLSAW VEQPTI DE+ RVESNSRD
Sbjct: 61  YENSRFVCNCEFRHGYSKLRVVPLMKPNRNSLEAWCLSAWTVEQPTIGDELPRVESNSRD 120

Query: 121 GLPERGLDWDDDDDGKVNGENSHGGGSFKDEGELEGVGDVRVDVRALAAQLQLARTADDV 180
           GLPER LDWD DDD  VNGENSHGGGSFKDEGE+EGVGDVRVDVRALAAQLQLARTADDV
Sbjct: 121 GLPERRLDWDGDDDDNVNGENSHGGGSFKDEGEMEGVGDVRVDVRALAAQLQLARTADDV 180

Query: 181 DQVLKDMVELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNLFIYNSLL 240
           DQVLKDMVELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNLFIYNSLL
Sbjct: 181 DQVLKDMVELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNLFIYNSLL 240

Query: 241 GAVKQSGELSRMENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGLT 300
           GAVKQSGELS+MENVLT+MAQEGIVSNVVTYNTIMSIYLEQGLA KALGILEEMPKKGLT
Sbjct: 241 GAVKQSGELSKMENVLTEMAQEGIVSNVVTYNTIMSIYLEQGLATKALGILEEMPKKGLT 300

Query: 301 LSPVSYSTALRAYRRMKDGNGALKFMVELRERYRNGEIAKDDNVDWANEFLKLENFTRRV 360
           LSPVSYSTALRAYR+MKDGNGAL+FMVELRERY NGEIAKDDNVDWANEFLKLENFTRRV
Sbjct: 301 LSPVSYSTALRAYRKMKDGNGALEFMVELRERYHNGEIAKDDNVDWANEFLKLENFTRRV 360

Query: 361 CYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAEAERLIWACTCAEHYNVAKELYFR 420
           CYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAE ERLIWACTCAEHYNVAKELY R
Sbjct: 361 CYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAEEERLIWACTCAEHYNVAKELYIR 420

Query: 421 IREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVSHFNVLLT 480
           IREKQCGISLSVCNHVIWLMGKAKKWWAALEIYE+LLEKGPKPNNMSYELIVSHFNVLLT
Sbjct: 421 IREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEELLEKGPKPNNMSYELIVSHFNVLLT 480

Query: 481 AAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRKMVEQGEKP 540
           AAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFR+MVEQGEKP
Sbjct: 481 AAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRRMVEQGEKP 540

Query: 541 TVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKFNMVEVTI 600
           TVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFT QGKFNMVEVTI
Sbjct: 541 TVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTSQGKFNMVEVTI 600

Query: 601 NDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAKE 660
           NDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAKE
Sbjct: 601 NDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAKE 660

Query: 661 GKPRLAYELYMRAKDEGLNLSSKVYDAVIESSQLYGASVNIKLLGLRPPDRNKSS 715
           GKPRLAYELY RAKDEGLNLSSK+YDAVIESSQLYGAS++I+LLGLRPPD+NKSS
Sbjct: 661 GKPRLAYELYRRAKDEGLNLSSKIYDAVIESSQLYGASIDIRLLGLRPPDKNKSS 715

BLAST of CsGy3G024330.2 vs. ExPASy TrEMBL
Match: A0A6J1DL11 (pentatricopeptide repeat-containing protein At3g46610 OS=Momordica charantia OX=3673 GN=LOC111021510 PE=3 SV=1)

HSP 1 Score: 1231 bits (3185), Expect = 0.0
Identity = 618/722 (85.60%), Postives = 661/722 (91.55%), Query Frame = 0

Query: 1   MHALSNWCPTSCSGVELGSYSVVHRSWKRVKSFGFSDCRCGNWGFSLISFNLSVLRSGFC 60
           MHALSNWCPTS S VELGS  VV RS KR+K  GFSDC CGN GFSLISFNL V RSGFC
Sbjct: 1   MHALSNWCPTSSSKVELGSSCVVRRSGKRLKCVGFSDCCCGNGGFSLISFNLRVFRSGFC 60

Query: 61  YENSRFVCNCEFRHGCSKLRVVPLMKTNRNSLGAFCLSAWAVEQPTIDDEITRVESNSRD 120
           YENS+F C+CEFRHGCSKL V  LMK  RNSLGA+ LSAWAVEQPT+ +EI RVESNS D
Sbjct: 61  YENSKFDCSCEFRHGCSKLIVARLMKPKRNSLGAWFLSAWAVEQPTVGNEIVRVESNSED 120

Query: 121 GLPER-------GLDWDDDDDGKVNGENSHGGGSFKDEGELEGVGDVRVDVRALAAQLQL 180
            L ER       GLDWDD  +  VNGEN HGGG FKDE  +EG GDV VDVRALA +LQL
Sbjct: 121 DLAERSEGEGYGGLDWDDHHN--VNGENGHGGGDFKDEDGMEGEGDVWVDVRALAGRLQL 180

Query: 181 ARTADDVDQVLKDMVELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNL 240
            RTADDV++VLKD+ ELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIET+GRIAPNL
Sbjct: 181 TRTADDVEEVLKDVGELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETDGRIAPNL 240

Query: 241 FIYNSLLGAVKQSGELSRMENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGILEE 300
           FIYNSLLGAVKQS   S+ME+VL DMAQEGI SNV+TYNTIMSIYLEQGLAMKALGILEE
Sbjct: 241 FIYNSLLGAVKQSTVFSKMEDVLADMAQEGITSNVITYNTIMSIYLEQGLAMKALGILEE 300

Query: 301 MPKKGLTLSPVSYSTALRAYRRMKDGNGALKFMVELRERYRNGEIAKDDNVDWANEFLKL 360
           MPKKGLT SPVSYST L+AYRRMKDGNGALKFM ELRE+YR+GE+AKDDNVDWA+EF+KL
Sbjct: 301 MPKKGLTPSPVSYSTGLQAYRRMKDGNGALKFMTELREKYRSGEMAKDDNVDWADEFMKL 360

Query: 361 ENFTRRVCYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAEAERLIWACTCAEHYNV 420
           ENFT+RVCYQVMRIWLVKG  ASTKVLQLL+EMDKAGLSLDRAE ERLIWACTCAEH+NV
Sbjct: 361 ENFTKRVCYQVMRIWLVKGYSASTKVLQLLVEMDKAGLSLDRAEEERLIWACTCAEHHNV 420

Query: 421 AKELYFRIREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVS 480
           AKELY+RIREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVS
Sbjct: 421 AKELYYRIREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVS 480

Query: 481 HFNVLLTAAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRKM 540
           HFNVLLTAAKKRGIWRWGVRLLNKMEEKGL+PGSREWNAVLVACS+AAETSAAI+IFR+M
Sbjct: 481 HFNVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSREWNAVLVACSKAAETSAAIEIFRRM 540

Query: 541 VEQGEKPTVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKF 600
           VEQGEKPT+LSYGALLSALEKGKLYDEARSVWDHMI+VGV+PNIYAYTTMASVFTGQGKF
Sbjct: 541 VEQGEKPTILSYGALLSALEKGKLYDEARSVWDHMIKVGVKPNIYAYTTMASVFTGQGKF 600

Query: 601 NMVEVTINDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELL 660
           NMVEVTINDMV+SGIEPTVVTYNAIITGCVRNG+SSVAYEWFHRMKVRNISPNEVSYELL
Sbjct: 601 NMVEVTINDMVSSGIEPTVVTYNAIITGCVRNGLSSVAYEWFHRMKVRNISPNEVSYELL 660

Query: 661 IEALAKEGKPRLAYELYMRAKDEGLNLSSKVYDAVIESSQLYGASVNIKLLGLRPPDRNK 715
           IEALAKEGKPRLAYELY+RAK++ LNLSSK YDAVI+SSQ+YGAS++I+ LG  PPD NK
Sbjct: 661 IEALAKEGKPRLAYELYLRAKNDSLNLSSKTYDAVIQSSQVYGASIDIRALGSPPPDTNK 720

BLAST of CsGy3G024330.2 vs. ExPASy TrEMBL
Match: A0A6J1F973 (pentatricopeptide repeat-containing protein At3g46610-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111441977 PE=4 SV=1)

HSP 1 Score: 1217 bits (3149), Expect = 0.0
Identity = 607/715 (84.90%), Postives = 653/715 (91.33%), Query Frame = 0

Query: 1   MHALSNWCPTSCSGVELGSYSVVHRSWKRVKSFGFSDCRCGNWGFSLISFNLSVLRSGFC 60
           MH  SNWCP S SGVELGSYSVVH SWKR+   GFSD   GN    LISFN SVLRSGFC
Sbjct: 1   MHVHSNWCPISSSGVELGSYSVVHSSWKRINRVGFSDSCYGNGNLYLISFNFSVLRSGFC 60

Query: 61  YENSRFVCNCEFRHGCSKLRVVPLMKTNRNSLGAFCLSAWAVEQPTIDDEITRVESNSRD 120
            E SRF C  EFRHGCSKLRV PLMK  RNSLGA+ L AWAVEQPTIDDEI RVESNSRD
Sbjct: 61  CETSRFECIREFRHGCSKLRVAPLMKPKRNSLGAWFLFAWAVEQPTIDDEIARVESNSRD 120

Query: 121 GLPERGLDWDDDDDGKVNGENSHGGGSFKDEGELEGVGDVRVDVRALAAQLQLARTADDV 180
            LPE  LDWD  D G VN ENSHG G+FKDE  +EG GDVRVDVRALA +LQLARTADDV
Sbjct: 121 DLPESSLDWDVYDPGNVNSENSHGRGNFKDEEGMEGEGDVRVDVRALARRLQLARTADDV 180

Query: 181 DQVLKDMVELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNLFIYNSLL 240
           +++LKD+  LPLQVFSS+IRGFGR+RRLECAVALV+WLK+KKIETNGRIAPNLFIYNSLL
Sbjct: 181 EELLKDVGVLPLQVFSSIIRGFGRNRRLECAVALVEWLKKKKIETNGRIAPNLFIYNSLL 240

Query: 241 GAVKQSGELSRMENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGLT 300
           GAVKQSGE S+ME+VLTDMAQEGIVSNVVTYNTIMSIYLEQGLA+KALGILEEMP+KGLT
Sbjct: 241 GAVKQSGEFSKMEDVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAIKALGILEEMPRKGLT 300

Query: 301 LSPVSYSTALRAYRRMKDGNGALKFMVELRERYRNGEIAKDDNVDWANEFLKLENFTRRV 360
             PVSYSTAL+AYRRM DGNGALKFM+ELRERYRNGE+ KDDNVDWA++FLKLE FTRRV
Sbjct: 301 PCPVSYSTALQAYRRMNDGNGALKFMIELRERYRNGELVKDDNVDWADKFLKLEKFTRRV 360

Query: 361 CYQVMRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAEAERLIWACTCAEHYNVAKELYFR 420
           CYQVMRIWLVK D A+TKVLQLLMEMDKAGLSLDR E ERLIWACTCAEH+NVAKELY+R
Sbjct: 361 CYQVMRIWLVKDDPANTKVLQLLMEMDKAGLSLDRVEEERLIWACTCAEHHNVAKELYYR 420

Query: 421 IREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVSHFNVLLT 480
           IREKQC ISLSVCNHVIWL GKAKKWWAALEIYEDLLEKGPKPNN+S ELIVSHFNVLLT
Sbjct: 421 IREKQCSISLSVCNHVIWLTGKAKKWWAALEIYEDLLEKGPKPNNLSNELIVSHFNVLLT 480

Query: 481 AAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRKMVEQGEKP 540
           AAKKRGIWRWGVRLL+KMEEKGL+PG REWNAVLVACSRAAETSAAIDIFR+MVE+GEKP
Sbjct: 481 AAKKRGIWRWGVRLLDKMEEKGLKPGIREWNAVLVACSRAAETSAAIDIFRRMVEKGEKP 540

Query: 541 TVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKFNMVEVTI 600
           TVLSYGALLSALEKGKLYDEARSVWDHMI+VGVEPNIYAYTTM S+FTGQGKFNMVEVT+
Sbjct: 541 TVLSYGALLSALEKGKLYDEARSVWDHMIKVGVEPNIYAYTTMTSIFTGQGKFNMVEVTL 600

Query: 601 NDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAKE 660
           NDMV SGIEPTVVTYNAIITGCVRNGMS+VAYEWFHRMK RNISP+EVSYELL+EALAKE
Sbjct: 601 NDMVTSGIEPTVVTYNAIITGCVRNGMSTVAYEWFHRMKARNISPDEVSYELLVEALAKE 660

Query: 661 GKPRLAYELYMRAKDEGLNLSSKVYDAVIESSQLYGASVNIKLLGLRPPDRNKSS 715
           GKPRLAYELY+ AKDEGLNLSSK+YDAVI+SSQ++GAS++I+LLG RP ++NKSS
Sbjct: 661 GKPRLAYELYLSAKDEGLNLSSKIYDAVIQSSQVHGASIDIRLLGPRPLEKNKSS 715

BLAST of CsGy3G024330.2 vs. ExPASy TrEMBL
Match: A0A6J1J1F6 (LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At3g46610-like OS=Cucurbita maxima OX=3661 GN=LOC111481858 PE=4 SV=1)

HSP 1 Score: 1206 bits (3121), Expect = 0.0
Identity = 602/711 (84.67%), Postives = 649/711 (91.28%), Query Frame = 0

Query: 5   SNWCPTSCSGVELGSYSVVHRSWKRVKSFGFSDCRCGNWGFSLISFNLSVLRSGFCYENS 64
           SNW P S SGVELGSYSVVHRSWKR+   GFSD   GN  FSLISFNLSVLRSGFC E S
Sbjct: 5   SNWYPISSSGVELGSYSVVHRSWKRINCVGFSDSCYGNGNFSLISFNLSVLRSGFCCETS 64

Query: 65  RFVCNCEFRHGCSKLRVVPLMKTNRNSLGAFCLSAWAVEQPTIDDEITRVESNSRDGLPE 124
           RF C  EFRHGCSKLRV PLMK  RNSLGA+ L AWAVEQPTIDDEI RVESNSRD LPE
Sbjct: 65  RFECIREFRHGCSKLRVAPLMKPKRNSLGAWFLFAWAVEQPTIDDEIARVESNSRDDLPE 124

Query: 125 RGLDWDDDDDGKVNGENSHGGGSFKDEGELEGVGDVRVDVRALAAQLQLARTADDVDQVL 184
             LDWD  D G V+ ENSHG G+FKDE  +EG GDVRVDVRALA +LQLARTADDV+++L
Sbjct: 125 SSLDWDVYDPGNVDSENSHGRGNFKDEEGMEGEGDVRVDVRALARRLQLARTADDVEELL 184

Query: 185 KDMVELPLQVFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNLFIYNSLLGAVK 244
           KD+  LPLQVFSS+IRGFGR+RRLECAVALV+WLK+KKIETNGRI PNLFIYNSLLGAVK
Sbjct: 185 KDVGVLPLQVFSSIIRGFGRNRRLECAVALVEWLKKKKIETNGRIVPNLFIYNSLLGAVK 244

Query: 245 QSGELSRMENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGLTLSPV 304
           QS E S+ME+VLTDMAQEGIVSNVVTYNTIMSIYLEQGLA+KALGILEEMP+KGLT  PV
Sbjct: 245 QSREFSKMEDVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAIKALGILEEMPRKGLTPCPV 304

Query: 305 SYSTALRAYRRMKDGNGALKFMVELRERYRNGEIAKDDNVDWANEFLKLENFTRRVCYQV 364
           SYSTAL+AYRRM DGNGALKFM+ELRERYRNGE+ KDDNVDWA++FLKLE FTRRVCYQV
Sbjct: 305 SYSTALQAYRRMNDGNGALKFMIELRERYRNGELVKDDNVDWADKFLKLEKFTRRVCYQV 364

Query: 365 MRIWLVKGDCASTKVLQLLMEMDKAGLSLDRAEAERLIWACTCAEHYNVAKELYFRIREK 424
           MRIWLVK D A+TKVLQLLMEMDKAGLSLD  E ERLIWACTCAEH+NVAKELY+RIREK
Sbjct: 365 MRIWLVKDDPANTKVLQLLMEMDKAGLSLDHVEEERLIWACTCAEHHNVAKELYYRIREK 424

Query: 425 QCGISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVSHFNVLLTAAKK 484
           QC ISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNN+S ELIVSHFNVLLTAAKK
Sbjct: 425 QCSISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNLSNELIVSHFNVLLTAAKK 484

Query: 485 RGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRKMVEQGEKPTVLS 544
           RGIWRWGVRLL+KMEEKGL+PG REWNAVLVACSRAAE SAAIDIFR+MVE+GEKPTVLS
Sbjct: 485 RGIWRWGVRLLDKMEEKGLKPGIREWNAVLVACSRAAEMSAAIDIFRRMVEKGEKPTVLS 544

Query: 545 YGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKFNMVEVTINDMV 604
           YGALLSALEKGKLYDEARSVWDHMI+VGVEPNIYAYTTM S+FTGQGKFNMVEVT+NDMV
Sbjct: 545 YGALLSALEKGKLYDEARSVWDHMIKVGVEPNIYAYTTMTSIFTGQGKFNMVEVTLNDMV 604

Query: 605 ASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAKEGKPR 664
            SGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMK +NISP+EVSYELL+EALAKEGKPR
Sbjct: 605 TSGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKAQNISPDEVSYELLVEALAKEGKPR 664

Query: 665 LAYELYMRAKDEGLNLSSKVYDAVIESSQLYGASVNIKLLGLRPPDRNKSS 715
           LAYELY+ AKDEGLN SSK+YDAVI+SSQ++GAS++++L G RP D+NKSS
Sbjct: 665 LAYELYLSAKDEGLNFSSKIYDAVIQSSQVHGASIDVRLXGPRPLDKNKSS 715

BLAST of CsGy3G024330.2 vs. TAIR 10
Match: AT3G46610.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 765.8 bits (1976), Expect = 3.1e-221
Identity = 385/637 (60.44%), Postives = 484/637 (75.98%), Query Frame = 0

Query: 77  SKLRVVPLMKTNRNSLGAFCLSAWAVEQPTIDDEITRVESNSRDGLPERGLDWDDDDDGK 136
           S  +V+ L +  R+ LG+     WA EQ                    R L+  +++   
Sbjct: 61  SNRKVLFLCEPKRSLLGSSFGVGWATEQ--------------------RELELGEEEVST 120

Query: 137 VNGENSHGGGSFKDEGELEGVGDVRVDVRALAAQLQLARTADDVDQVLKDMVELPLQVFS 196
            +  +++GG             ++RVDVR LA  L+ A+TADDVD VLKD  ELPLQVF 
Sbjct: 121 EDLSSANGGEK----------NNLRVDVRELAFSLRAAKTADDVDAVLKDKGELPLQVFC 180

Query: 197 SMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNLFIYNSLLGAVKQSGELSRMENVL 256
           +MI+GFG+D+RL+ AVA+VDWLKRKK E+ G I PNLFIYNSLLGA++  GE    E +L
Sbjct: 181 AMIKGFGKDKRLKPAVAVVDWLKRKKSESGGVIGPNLFIYNSLLGAMRGFGE---AEKIL 240

Query: 257 TDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGLTLSPVSYSTALRAYRRM 316
            DM +EGIV N+VTYNT+M IY+E+G  +KALGIL+   +KG   +P++YSTAL  YRRM
Sbjct: 241 KDMEEEGIVPNIVTYNTLMVIYMEEGEFLKALGILDLTKEKGFEPNPITYSTALLVYRRM 300

Query: 317 KDGNGALKFMVELRERYRNGEIAKDDNVDWANEFLKLENFTRRVCYQVMRIWLVKGDCAS 376
           +DG GAL+F VELRE+Y   EI  D   DW  EF+KLENF  R+CYQVMR WLVK D  +
Sbjct: 301 EDGMGALEFFVELREKYAKREIGNDVGYDWEFEFVKLENFIGRICYQVMRRWLVKDDNWT 360

Query: 377 TKVLQLLMEMDKAGLSLDRAEAERLIWACTCAEHYNVAKELYFRIREKQCGISLSVCNHV 436
           T+VL+LL  MD AG+   R E ERLIWACT  EHY V KELY RIRE+   ISLSVCNH+
Sbjct: 361 TRVLKLLNAMDSAGVRPSREEHERLIWACTREEHYIVGKELYKRIRERFSEISLSVCNHL 420

Query: 437 IWLMGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVSHFNVLLTAAKKRGIWRWGVRLLN 496
           IWLMGKAKKWWAALEIYEDLL++GP+PNN+SYEL+VSHFN+LL+AA KRGIWRWGVRLLN
Sbjct: 421 IWLMGKAKKWWAALEIYEDLLDEGPEPNNLSYELVVSHFNILLSAASKRGIWRWGVRLLN 480

Query: 497 KMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRKMVEQGEKPTVLSYGALLSALEKGK 556
           KME+KGL+P  R WNAVLVACS+A+ET+AAI IF+ MV+ GEKPTV+SYGALLSALEKGK
Sbjct: 481 KMEDKGLKPQRRHWNAVLVACSKASETTAAIQIFKAMVDNGEKPTVISYGALLSALEKGK 540

Query: 557 LYDEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKFNMVEVTINDMVASGIEPTVVTYN 616
           LYDEA  VW+HMI+VG+EPN+YAYTTMASV TGQ KFN+++  + +M + GIEP+VVT+N
Sbjct: 541 LYDEAFRVWNHMIKVGIEPNLYAYTTMASVLTGQQKFNLLDTLLKEMASKGIEPSVVTFN 600

Query: 617 AIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIEALAKEGKPRLAYELYMRAKDE 676
           A+I+GC RNG+S VAYEWFHRMK  N+ PNE++YE+LIEALA + KPRLAYEL+++A++E
Sbjct: 601 AVISGCARNGLSGVAYEWFHRMKSENVEPNEITYEMLIEALANDAKPRLAYELHVKAQNE 660

Query: 677 GLNLSSKVYDAVIESSQLYGASVNIKLLGLRPPDRNK 714
           GL LSSK YDAV++S++ YGA++++ LLG RP  +N+
Sbjct: 661 GLKLSSKPYDAVVKSAETYGATIDLNLLGPRPDKKNR 664

BLAST of CsGy3G024330.2 vs. TAIR 10
Match: AT5G14350.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 162.5 bits (410), Expect = 1.2e-39
Identity = 91/201 (45.27%), Postives = 120/201 (59.70%), Query Frame = 0

Query: 442 KAKKWWAALEIYEDLLEKGPKPNNMSYELIVSHFNVLLTAAKKRGIWRWGVRLLNKMEEK 501
           K +   +ALE+YEDLL++GP+PNN+SYE                      +RL       
Sbjct: 95  KLRNGGSALEMYEDLLDEGPEPNNLSYE---------------------PMRL------- 154

Query: 502 GLRPGS-REWNAVLVACSRAAETSAAIDIFRKMVEQGEKPTVLSYGALLSALEKGKLYDE 561
            LRP S ++W                              TV S+GALLSALEKGKLYDE
Sbjct: 155 QLRPKSIKQW----------------------------LTTVKSHGALLSALEKGKLYDE 214

Query: 562 ARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKFNMVEVTINDMVASG-IEPTVVTYNAII 621
              VW+HM++VG+EPN+YAYTTMASV TGQ K N+++  + +M + G I+P+VVTYNA+I
Sbjct: 215 VLRVWNHMVKVGIEPNLYAYTTMASVLTGQQKLNLLDTLLKEMPSKGIIKPSVVTYNAVI 239

Query: 622 TGCVRNGMSSVAYEWFHRMKV 641
           +GC RNG+S VAYEWFHRM++
Sbjct: 275 SGCTRNGLSGVAYEWFHRMRI 239

BLAST of CsGy3G024330.2 vs. TAIR 10
Match: AT2G18940.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 132.9 bits (333), Expect = 1.0e-30
Identity = 103/448 (22.99%), Postives = 192/448 (42.86%), Query Frame = 0

Query: 231 PNLFIYNSLLGAVKQSGELSRMENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGI 290
           P    YN+LL    ++G  +   +VL +M +    ++ VTYN +++ Y+  G + +A G+
Sbjct: 314 PGTVTYNALLQVFGKAGVYTEALSVLKEMEENSCPADSVTYNELVAAYVRAGFSKEAAGV 373

Query: 291 LEEMPKKGLTLSPVSYSTALRAYRRMKDGNGALKFMVELRER-------YRNGEIAKDDN 350
           +E M KKG+  + ++Y+T + AY +    + ALK    ++E          N  ++    
Sbjct: 374 IEMMTKKGVMPNAITYTTVIDAYGKAGKEDEALKLFYSMKEAGCVPNTCTYNAVLSLLGK 433

Query: 351 VDWANEFLK-LENFTRRVCYQVMRIWLVKGDCASTK-----VLQLLMEMDKAGLSLDRAE 410
              +NE +K L +     C      W         K     V ++  EM   G   DR  
Sbjct: 434 KSRSNEMIKMLCDMKSNGCSPNRATWNTMLALCGNKGMDKFVNRVFREMKSCGFEPDRDT 493

Query: 411 AERLIWAC-TCAEHYNVAKELYFRIREKQCGISLSVCNHVIWLMGKAKKWWAALEIYEDL 470
              LI A   C    + +K +Y  +        ++  N ++  + +   W +   +  D+
Sbjct: 494 FNTLISAYGRCGSEVDASK-MYGEMTRAGFNACVTTYNALLNALARKGDWRSGENVISDM 553

Query: 471 LEKGPKPNNMSYELIVSHFNVLLTAAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVA 530
             KG KP   SY L       +L    K G +    R+ N+++E  + P       +L+A
Sbjct: 554 KSKGFKPTETSYSL-------MLQCYAKGGNYLGIERIENRIKEGQIFPSWMLLRTLLLA 613

Query: 531 ---CSRAAETSAAIDIFRKMVEQGEKPTVLSYGALLSALEKGKLYDEARSVWDHMIRVGV 590
              C   A +  A  +F+K    G KP ++ + ++LS   +  +YD+A  + + +   G+
Sbjct: 614 NFKCRALAGSERAFTLFKK---HGYKPDMVIFNSMLSIFTRNNMYDQAEGILESIREDGL 673

Query: 591 EPNIYAYTTMASVFTGQGKFNMVEVTINDMVASGIEPTVVTYNAIITGCVRNGMSSVAYE 650
            P++  Y ++  ++  +G+    E  +  +  S ++P +V+YN +I G  R G+   A  
Sbjct: 674 SPDLVTYNSLMDMYVRRGECWKAEEILKTLEKSQLKPDLVSYNTVIKGFCRRGLMQEAVR 733

Query: 651 WFHRMKVRNISPNEVSYELLIEALAKEG 662
               M  R I P   +Y   +      G
Sbjct: 734 MLSEMTERGIRPCIFTYNTFVSGYTAMG 750

BLAST of CsGy3G024330.2 vs. TAIR 10
Match: AT5G01110.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 132.5 bits (332), Expect = 1.4e-30
Identity = 112/493 (22.72%), Postives = 222/493 (45.03%), Query Frame = 0

Query: 217 WLKRKKIETNGRIAPNLFIYNSLLGAVKQSGELSRMENVLTDMAQEGIVSNVVTYNTIMS 276
           W   ++I  +G +  N++  N ++ A+ + G++ ++   L+ + ++G+  ++VTYNT++S
Sbjct: 220 WGVYQEISRSG-VGINVYTLNIMVNALCKDGKMEKVGTFLSQVQEKGVYPDIVTYNTLIS 279

Query: 277 IYLEQGLAMKALGILEEMPKKGLTLSPVSYSTALRA------YRRMKD----------GN 336
            Y  +GL  +A  ++  MP KG +    +Y+T +        Y R K+            
Sbjct: 280 AYSSKGLMEEAFELMNAMPGKGFSPGVYTYNTVINGLCKHGKYERAKEVFAEMLRSGLSP 339

Query: 337 GALKFMVELRERYRNGEIAKDDNVDWANEFLKLENFTRRVCY-QVMRIWLVKGDCASTKV 396
            +  +   L E  + G++ + + V   ++    +     VC+  +M ++   G+    K 
Sbjct: 340 DSTTYRSLLMEACKKGDVVETEKV--FSDMRSRDVVPDLVCFSSMMSLFTRSGNL--DKA 399

Query: 397 LQLLMEMDKAGLSLDRAEAERLIWACTCAEHYNVAKELYFRIREKQCGISLSVCNHVIWL 456
           L     + +AGL  D      LI         +VA  L   + ++ C + +   N ++  
Sbjct: 400 LMYFNSVKEAGLIPDNVIYTILIQGYCRKGMISVAMNLRNEMLQQGCAMDVVTYNTILHG 459

Query: 457 MGKAKKWWAALEIYEDLLEKGPKPNNMSYELIVSHFNVLLTAAKKRGIWRWGVRLLNKME 516
           + K K    A +++ ++ E+   P+  SY L      +L+    K G  +  + L  KM+
Sbjct: 460 LCKRKMLGEADKLFNEMTERALFPD--SYTL-----TILIDGHCKLGNLQNAMELFQKMK 519

Query: 517 EKGLRPGSREWNAVLVACSRAAETSAAIDIFRKMVEQGEKPTVLSYGALLSAL-EKGKLY 576
           EK +R     +N +L    +  +   A +I+  MV +   PT +SY  L++AL  KG L 
Sbjct: 520 EKRIRLDVVTYNTLLDGFGKVGDIDTAKEIWADMVSKEILPTPISYSILVNALCSKGHL- 579

Query: 577 DEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKFNMVEVTINDMVASGIEPTVVTYNAI 636
            EA  VWD MI   ++P +    +M   +   G  +  E  +  M++ G  P  ++YN +
Sbjct: 580 AEAFRVWDEMISKNIKPTVMICNSMIKGYCRSGNASDGESFLEKMISEGFVPDCISYNTL 639

Query: 637 ITGCVRNGMSSVAYEWFHRMKVR--NISPNEVSYELLIEALAKEGKPRLAYELYMRAKDE 690
           I G VR    S A+    +M+     + P+  +Y  ++    ++ + + A  +  +  + 
Sbjct: 640 IYGFVREENMSKAFGLVKKMEEEQGGLVPDVFTYNSILHGFCRQNQMKEAEVVLRKMIER 699

BLAST of CsGy3G024330.2 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 131.7 bits (330), Expect = 2.3e-30
Identity = 118/479 (24.63%), Postives = 221/479 (46.14%), Query Frame = 0

Query: 194 VFSSMIRGFGRDRRLECAVALVDWLKRKKIETNGRIAPNLFIYNSLLGA-VKQSGELSRM 253
           VF  +++ + R   ++ A+++V        + +G   P +  YN++L A ++    +S  
Sbjct: 136 VFDLVVKSYSRLSLIDKALSIV-----HLAQAHG-FMPGVLSYNAVLDATIRSKRNISFA 195

Query: 254 ENVLTDMAQEGIVSNVVTYNTIMSIYLEQGLAMKALGILEEMPKKGLTLSPVSYSTALRA 313
           ENV  +M +  +  NV TYN ++  +   G    AL + ++M  KG   + V+Y+T +  
Sbjct: 196 ENVFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDG 255

Query: 314 Y---RRMKDG-----NGALKFMVELRERYR---NGEIAKDDNVDWANEFLKLENFTRRVC 373
           Y   R++ DG     + ALK +      Y    NG + ++  +   +  L   N      
Sbjct: 256 YCKLRKIDDGFKLLRSMALKGLEPNLISYNVVING-LCREGRMKEVSFVLTEMNRRGYSL 315

Query: 374 YQVMRIWLVKGDCASTKVLQLLM---EMDKAGLSLDRAEAERLIWACTCAEHYNVAKELY 433
            +V    L+KG C      Q L+   EM + GL+        LI +   A + N A E  
Sbjct: 316 DEVTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFL 375

Query: 434 FRIREKQCGISLSVCNHVIWLMGKAKKWW--AALEIYEDLLEKGPKPNNMSYELIVSHFN 493
            ++R +  G+  +   +   + G ++K +   A  +  ++ + G  P+ ++Y       N
Sbjct: 376 DQMRVR--GLCPNERTYTTLVDGFSQKGYMNEAYRVLREMNDNGFSPSVVTY-------N 435

Query: 494 VLLTAAKKRGIWRWGVRLLNKMEEKGLRPGSREWNAVLVACSRAAETSAAIDIFRKMVEQ 553
            L+      G     + +L  M+EKGL P    ++ VL    R+ +   A+ + R+MVE+
Sbjct: 436 ALINGHCVTGKMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEK 495

Query: 554 GEKPTVLSYGALLSALEKGKLYDEARSVWDHMIRVGVEPNIYAYTTMASVFTGQGKFNMV 613
           G KP  ++Y +L+    + +   EA  +++ M+RVG+ P+ + YT + + +  +G     
Sbjct: 496 GIKPDTITYSSLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKA 555

Query: 614 EVTINDMVASGIEPTVVTYNAIITGCVRNGMSSVAYEWFHRMKVRNISPNEVSYELLIE 656
               N+MV  G+ P VVTY+ +I G  +   +  A     ++      P++V+Y  LIE
Sbjct: 556 LQLHNEMVEKGVLPDVVTYSVLINGLNKQSRTREAKRLLLKLFYEESVPSDVTYHTLIE 598

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SNB74.4e-22060.44Protein LOW PHOTOSYNTHETIC EFFICIENCY 1, chloroplastic OS=Arabidopsis thaliana O... [more]
O646241.5e-2922.99Pentatricopeptide repeat-containing protein At2g18940, chloroplastic OS=Arabidop... [more]
Q9LFC51.9e-2922.72Pentatricopeptide repeat-containing protein At5g01110 OS=Arabidopsis thaliana OX... [more]
Q9FIX33.2e-2924.63Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
Q3EDF82.7e-2822.70Pentatricopeptide repeat-containing protein At1g09900 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_011651578.10.0100.00protein LOW PHOTOSYNTHETIC EFFICIENCY 1, chloroplastic [Cucumis sativus][more]
KAA0066960.10.094.97pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK30239... [more]
XP_038898205.10.092.60protein LOW PHOTOSYNTHETIC EFFICIENCY 1, chloroplastic [Benincasa hispida][more]
XP_022154192.10.085.60pentatricopeptide repeat-containing protein At3g46610 [Momordica charantia][more]
XP_022934968.10.084.90pentatricopeptide repeat-containing protein At3g46610-like isoform X1 [Cucurbita... [more]
Match NameE-valueIdentityDescription
A0A0A0LB880.0100.00PPR_long domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_3G595200 PE... [more]
A0A5D3E3680.094.97Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1DL110.085.60pentatricopeptide repeat-containing protein At3g46610 OS=Momordica charantia OX=... [more]
A0A6J1F9730.084.90pentatricopeptide repeat-containing protein At3g46610-like isoform X1 OS=Cucurbi... [more]
A0A6J1J1F60.084.67LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At3g46610-like ... [more]
Match NameE-valueIdentityDescription
AT3G46610.13.1e-22160.44Pentatricopeptide repeat (PPR-like) superfamily protein [more]
AT5G14350.11.2e-3945.27Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G18940.11.0e-3022.99Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G01110.11.4e-3022.72Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G39710.12.3e-3024.63Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 557..699
e-value: 6.7E-33
score: 116.4
coord: 365..472
e-value: 1.5E-9
score: 39.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 473..556
e-value: 9.0E-16
score: 59.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 160..349
e-value: 7.6E-27
score: 96.5
IPR033443Pentacotripeptide-repeat region of PRORPPFAMPF17177PPR_longcoord: 443..575
e-value: 1.2E-8
score: 34.6
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 544..577
e-value: 1.4E-5
score: 22.9
coord: 613..647
e-value: 6.6E-9
score: 33.4
coord: 269..300
e-value: 7.6E-5
score: 20.6
coord: 510..541
e-value: 4.9E-5
score: 21.2
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 194..215
e-value: 0.68
score: 10.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 267..313
e-value: 1.4E-10
score: 41.2
coord: 610..659
e-value: 1.4E-14
score: 54.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 267..301
score: 10.829822
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 646..680
score: 9.843305
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 611..645
score: 11.564229
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 541..575
score: 10.599635
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 506..540
score: 9.711769
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 232..266
score: 8.856788
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 576..610
score: 8.95544
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 115..151
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 115..139
NoneNo IPR availablePANTHERPTHR47940OS12G0283900 PROTEINcoord: 3..714

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CsGy3G024330CsGy3G024330gene


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy3G024330.2.utr5p1CsGy3G024330.2.utr5p1five_prime_UTR


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy3G024330.2.exon1CsGy3G024330.2.exon1exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
cds.CsGy3G024330.2cds.CsGy3G024330.2CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy3G024330.2.utr3p1CsGy3G024330.2.utr3p1three_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CsGy3G024330.2CsGy3G024330.2-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding