CsGy1G026460 (gene) Cucumber (Gy14) v2

NameCsGy1G026460
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
Descriptionpentatricopeptide repeat-containing protein At3g18970
LocationChr1 : 25027995 .. 25029455 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCACCTGGCCCCAAGACTGAGCTGCATCCACCACCTAAGTAACGTAAGAATTTCTTCCCTTCAACTTCTACAAATTCATGCCCAATTGATAACCAATGGCTTCAAATCTCCCTCCCCTTACGCCAAGCTAATCACCCATTTATGCAAGAAATCTTCCTCAGAATCCATCGCCCACGCCCATTTGATCTTCCGGCACCACCAATACTCTCCAAATCTCTTCCTCTTCAACACTCTCATAAGATGCGCTCCACCTCATCATTCCATCTCCATTTTCGCCACTTGGGTCTCCACCTCCCACTTCGAATTTGACGATTTCACTTTCATTTTCGTGCTTGGAGCTTGCGCGCGAGCCCCATCTGTTTCTACGTTAATGATTGGTAGGCAAATTCATACTCATATTCTTAAACGTGGGATTGTTTCGAACATTTGGGTGCAGACTACTATGATACATTTTTATTCCATTAATAAAGATGTGGGTAGTGCGCGGAAGTTGTTTGATGAAATGTCTTTGAGAAATAGTGTTACTTGGAATGCGATGATTGCAGGGTACTGCTCACAAGGTGGTAAGGTTTCTCAGAAATATGCCCGAGATGCGTTGGAATTGTTTCGGGGGATGTTGGTTGAATCAACGAATTTTGAGGTGAAACCAACGGATACTACAATGGTTTGCATTCTTTCAGCTGCATCTCAACTAGGTATGCTTGAAACCGGTTCTTGTGTTCATGCTTATATCAAGAAGACAGTTGATTCCCCTGAAAAAGATGTGTTTATTGGCACTGGCTTGGTTAATATGTACTCAAAATGTGGGCTTCTTAACAGTGCCTCGTCAGTTTTTAAGCAGATGAAGCAGAAGAACGTTTTGACATGGACATCCATGGCGACAGGACTGGCTGTTCATGGAAGGGGTAAAGAAGCATTGGAGCTATTGGATGCAATGGGAGCTCATGGTGTAAAGCCAAACGCAGTAACTTTCACGAGTTTGCTTTCTGCTTGCTGTCATGGAGGGCTTATTGAAGAGGGGCTCCATTTGTTTCGTGTCATGGAGAGGAAGTTTGGTGTTGTGCCTCAAATGCAGCATTATGGCTGCATTGTTGACCTTCTTGGGCGCTCTGGGCACTTGAGAGAGGCATACAAATTGATACTCGAAATGCCAATGGAACCTGATGGTGTTTTATGGAGGAGTTTGCTGAGTTCTTGTATGCTCCATGGTGATGTTGAAATGGGAGAGAGAGTGGGTAAGTTGCTTGTTGAGAGACAGGGAGGGGAGAGTTTTGATGACGAATGGTGTGTTGGAAGTGAGGACTTTGTAGCTTTGTCAAATGTTTATGCTTCTGTCGAAAGGTGGGACGATGTGGAGGCTTTGAGAGATGAAATGAAGATCAAAGGGATTGAAAACAAAGCTGGGTGTAGTTCACTACAAACTACGGGTTCTCAAGGCTTGGTGGAGGCTTTATTATAG

mRNA sequence

ATGCACCTGGCCCCAAGACTGAGCTGCATCCACCACCTAAGTAACGTAAGAATTTCTTCCCTTCAACTTCTACAAATTCATGCCCAATTGATAACCAATGGCTTCAAATCTCCCTCCCCTTACGCCAAGCTAATCACCCATTTATGCAAGAAATCTTCCTCAGAATCCATCGCCCACGCCCATTTGATCTTCCGGCACCACCAATACTCTCCAAATCTCTTCCTCTTCAACACTCTCATAAGATGCGCTCCACCTCATCATTCCATCTCCATTTTCGCCACTTGGGTCTCCACCTCCCACTTCGAATTTGACGATTTCACTTTCATTTTCGTGCTTGGAGCTTGCGCGCGAGCCCCATCTGTTTCTACGTTAATGATTGGTAGGCAAATTCATACTCATATTCTTAAACGTGGGATTGTTTCGAACATTTGGGTGCAGACTACTATGATACATTTTTATTCCATTAATAAAGATGTGGGTAGTGCGCGGAAGTTGTTTGATGAAATGTCTTTGAGAAATAGTGTTACTTGGAATGCGATGATTGCAGGGTACTGCTCACAAGGTGGTAAGGTTTCTCAGAAATATGCCCGAGATGCGTTGGAATTGTTTCGGGGGATGTTGGTTGAATCAACGAATTTTGAGGTGAAACCAACGGATACTACAATGGTTTGCATTCTTTCAGCTGCATCTCAACTAGGTATGCTTGAAACCGGTTCTTGTGTTCATGCTTATATCAAGAAGACAGTTGATTCCCCTGAAAAAGATGTGTTTATTGGCACTGGCTTGGTTAATATGTACTCAAAATGTGGGCTTCTTAACAGTGCCTCGTCAGTTTTTAAGCAGATGAAGCAGAAGAACGTTTTGACATGGACATCCATGGCGACAGGACTGGCTGTTCATGGAAGGGGTAAAGAAGCATTGGAGCTATTGGATGCAATGGGAGCTCATGGTGTAAAGCCAAACGCAGTAACTTTCACGAGTTTGCTTTCTGCTTGCTGTCATGGAGGGCTTATTGAAGAGGGGCTCCATTTGTTTCGTGTCATGGAGAGGAAGTTTGGTGTTGTGCCTCAAATGCAGCATTATGGCTGCATTGTTGACCTTCTTGGGCGCTCTGGGCACTTGAGAGAGGCATACAAATTGATACTCGAAATGCCAATGGAACCTGATGGTGTTTTATGGAGGAGTTTGCTGAGTTCTTGTATGCTCCATGGTGATGTTGAAATGGGAGAGAGAGTGGGTAAGTTGCTTGTTGAGAGACAGGGAGGGGAGAGTTTTGATGACGAATGGTGTGTTGGAAGTGAGGACTTTGTAGCTTTGTCAAATGTTTATGCTTCTGTCGAAAGGTGGGACGATGTGGAGGCTTTGAGAGATGAAATGAAGATCAAAGGGATTGAAAACAAAGCTGGGTGTAGTTCACTACAAACTACGGGTTCTCAAGGCTTGGTGGAGGCTTTATTATAG

Coding sequence (CDS)

ATGCACCTGGCCCCAAGACTGAGCTGCATCCACCACCTAAGTAACGTAAGAATTTCTTCCCTTCAACTTCTACAAATTCATGCCCAATTGATAACCAATGGCTTCAAATCTCCCTCCCCTTACGCCAAGCTAATCACCCATTTATGCAAGAAATCTTCCTCAGAATCCATCGCCCACGCCCATTTGATCTTCCGGCACCACCAATACTCTCCAAATCTCTTCCTCTTCAACACTCTCATAAGATGCGCTCCACCTCATCATTCCATCTCCATTTTCGCCACTTGGGTCTCCACCTCCCACTTCGAATTTGACGATTTCACTTTCATTTTCGTGCTTGGAGCTTGCGCGCGAGCCCCATCTGTTTCTACGTTAATGATTGGTAGGCAAATTCATACTCATATTCTTAAACGTGGGATTGTTTCGAACATTTGGGTGCAGACTACTATGATACATTTTTATTCCATTAATAAAGATGTGGGTAGTGCGCGGAAGTTGTTTGATGAAATGTCTTTGAGAAATAGTGTTACTTGGAATGCGATGATTGCAGGGTACTGCTCACAAGGTGGTAAGGTTTCTCAGAAATATGCCCGAGATGCGTTGGAATTGTTTCGGGGGATGTTGGTTGAATCAACGAATTTTGAGGTGAAACCAACGGATACTACAATGGTTTGCATTCTTTCAGCTGCATCTCAACTAGGTATGCTTGAAACCGGTTCTTGTGTTCATGCTTATATCAAGAAGACAGTTGATTCCCCTGAAAAAGATGTGTTTATTGGCACTGGCTTGGTTAATATGTACTCAAAATGTGGGCTTCTTAACAGTGCCTCGTCAGTTTTTAAGCAGATGAAGCAGAAGAACGTTTTGACATGGACATCCATGGCGACAGGACTGGCTGTTCATGGAAGGGGTAAAGAAGCATTGGAGCTATTGGATGCAATGGGAGCTCATGGTGTAAAGCCAAACGCAGTAACTTTCACGAGTTTGCTTTCTGCTTGCTGTCATGGAGGGCTTATTGAAGAGGGGCTCCATTTGTTTCGTGTCATGGAGAGGAAGTTTGGTGTTGTGCCTCAAATGCAGCATTATGGCTGCATTGTTGACCTTCTTGGGCGCTCTGGGCACTTGAGAGAGGCATACAAATTGATACTCGAAATGCCAATGGAACCTGATGGTGTTTTATGGAGGAGTTTGCTGAGTTCTTGTATGCTCCATGGTGATGTTGAAATGGGAGAGAGAGTGGGTAAGTTGCTTGTTGAGAGACAGGGAGGGGAGAGTTTTGATGACGAATGGTGTGTTGGAAGTGAGGACTTTGTAGCTTTGTCAAATGTTTATGCTTCTGTCGAAAGGTGGGACGATGTGGAGGCTTTGAGAGATGAAATGAAGATCAAAGGGATTGAAAACAAAGCTGGGTGTAGTTCACTACAAACTACGGGTTCTCAAGGCTTGGTGGAGGCTTTATTATAG

Protein sequence

MHLAPRLSCIHHLSNVRISSLQLLQIHAQLITNGFKSPSPYAKLITHLCKKSSSESIAHAHLIFRHHQYSPNLFLFNTLIRCAPPHHSISIFATWVSTSHFEFDDFTFIFVLGACARAPSVSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNAMIAGYCSQGGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETGSCVHAYIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLAVHGRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQGGESFDDEWCVGSEDFVALSNVYASVERWDDVEALRDEMKIKGIENKAGCSSLQTTGSQGLVEALL
BLAST of CsGy1G026460 vs. NCBI nr
Match: XP_011659935.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g18970 [Cucumis sativus] >KGN66147.1 hypothetical protein Csa_1G573660 [Cucumis sativus])

HSP 1 Score: 983.0 bits (2540), Expect = 3.6e-283
Identity = 486/486 (100.00%), Postives = 486/486 (100.00%), Query Frame = 0

Query: 1   MHLAPRLSCIHHLSNVRISSLQLLQIHAQLITNGFKSPSPYAKLITHLCKKSSSESIAHA 60
           MHLAPRLSCIHHLSNVRISSLQLLQIHAQLITNGFKSPSPYAKLITHLCKKSSSESIAHA
Sbjct: 1   MHLAPRLSCIHHLSNVRISSLQLLQIHAQLITNGFKSPSPYAKLITHLCKKSSSESIAHA 60

Query: 61  HLIFRHHQYSPNLFLFNTLIRCAPPHHSISIFATWVSTSHFEFDDFTFIFVLGACARAPS 120
           HLIFRHHQYSPNLFLFNTLIRCAPPHHSISIFATWVSTSHFEFDDFTFIFVLGACARAPS
Sbjct: 61  HLIFRHHQYSPNLFLFNTLIRCAPPHHSISIFATWVSTSHFEFDDFTFIFVLGACARAPS 120

Query: 121 VSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNAM 180
           VSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNAM
Sbjct: 121 VSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNAM 180

Query: 181 IAGYCSQGGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETGSC 240
           IAGYCSQGGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETGSC
Sbjct: 181 IAGYCSQGGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETGSC 240

Query: 241 VHAYIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLAVH 300
           VHAYIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLAVH
Sbjct: 241 VHAYIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLAVH 300

Query: 301 GRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQH 360
           GRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQH
Sbjct: 301 GRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQH 360

Query: 361 YGCIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQ 420
           YGCIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQ
Sbjct: 361 YGCIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQ 420

Query: 421 GGESFDDEWCVGSEDFVALSNVYASVERWDDVEALRDEMKIKGIENKAGCSSLQTTGSQG 480
           GGESFDDEWCVGSEDFVALSNVYASVERWDDVEALRDEMKIKGIENKAGCSSLQTTGSQG
Sbjct: 421 GGESFDDEWCVGSEDFVALSNVYASVERWDDVEALRDEMKIKGIENKAGCSSLQTTGSQG 480

Query: 481 LVEALL 487
           LVEALL
Sbjct: 481 LVEALL 486

BLAST of CsGy1G026460 vs. NCBI nr
Match: XP_008450741.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g18970 [Cucumis melo])

HSP 1 Score: 939.9 bits (2428), Expect = 3.5e-270
Identity = 462/486 (95.06%), Postives = 474/486 (97.53%), Query Frame = 0

Query: 1   MHLAPRLSCIHHLSNVRISSLQLLQIHAQLITNGFKSPSPYAKLITHLCKKSSSESIAHA 60
           MHLAPRLSCI+HLSN+RISSLQLLQIHAQ ITNGFKSPSPYAKLITHLCKKSSSESIAHA
Sbjct: 9   MHLAPRLSCINHLSNIRISSLQLLQIHAQFITNGFKSPSPYAKLITHLCKKSSSESIAHA 68

Query: 61  HLIFRHHQYSPNLFLFNTLIRCAPPHHSISIFATWVSTSHFEFDDFTFIFVLGACARAPS 120
           HLIFRHHQ+SPNLFLFNTLIRCAPP +SISIFA WVST HFEFDDFTFIFVLGACARAPS
Sbjct: 69  HLIFRHHQHSPNLFLFNTLIRCAPPQYSISIFANWVSTPHFEFDDFTFIFVLGACARAPS 128

Query: 121 VSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNAM 180
           VSTLMIGRQIHTHILKRGIVSNIW QTTMIHFYS NKDVGSARK+FDEMS+RNSVTWNAM
Sbjct: 129 VSTLMIGRQIHTHILKRGIVSNIWAQTTMIHFYSTNKDVGSARKVFDEMSVRNSVTWNAM 188

Query: 181 IAGYCSQGGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETGSC 240
           IAGYCSQ GKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAAS LGMLETG C
Sbjct: 189 IAGYCSQSGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASHLGMLETGVC 248

Query: 241 VHAYIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLAVH 300
           VHAYIKKT+DSPEKDVFIGTGLVNMYSKCGLL+SASSVFKQMKQ+NVLTWTSMATGLAVH
Sbjct: 249 VHAYIKKTIDSPEKDVFIGTGLVNMYSKCGLLSSASSVFKQMKQRNVLTWTSMATGLAVH 308

Query: 301 GRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQH 360
           GRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQH
Sbjct: 309 GRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQH 368

Query: 361 YGCIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQ 420
           YGCIVDLLGRSGHLREAY+LILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQ
Sbjct: 369 YGCIVDLLGRSGHLREAYELILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQ 428

Query: 421 GGESFDDEWCVGSEDFVALSNVYASVERWDDVEALRDEMKIKGIENKAGCSSLQTTGSQG 480
           GGESFDDEWCVGSEDFVALSNVYAS ERWDDVEALR+EMKIKGIENKAG SS+QTTGSQG
Sbjct: 429 GGESFDDEWCVGSEDFVALSNVYASAERWDDVEALREEMKIKGIENKAGFSSVQTTGSQG 488

Query: 481 LVEALL 487
           LVE LL
Sbjct: 489 LVETLL 494

BLAST of CsGy1G026460 vs. NCBI nr
Match: XP_023515866.1 (pentatricopeptide repeat-containing protein At3g18970 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 784.3 bits (2024), Expect = 2.5e-223
Identity = 382/478 (79.92%), Postives = 425/478 (88.91%), Query Frame = 0

Query: 1   MHLAPRLSCIHHLSNVRISS-LQLLQIHAQLITNGFKSPSPYAKLITHLCKKSSSESIAH 60
           M L+PRLSCIHHL+N+  SS LQL QIHAQLITN FKSP PYAKLI H C   S E+ A+
Sbjct: 1   MRLSPRLSCIHHLNNILSSSLLQLKQIHAQLITNAFKSPIPYAKLIAHFCNNHSPEATAY 60

Query: 61  AHLIFRHHQYSPNLFLFNTLIRCAPPHHSISIFATWVSTSHFEFDDFTFIFVLGACARAP 120
           AHLI  HH++  NLFLFNTLIRCAPP HSISIFA  VST+HFEFDDFTFIF+LGACARAP
Sbjct: 61  AHLISLHHRHPTNLFLFNTLIRCAPPQHSISIFANSVSTAHFEFDDFTFIFLLGACARAP 120

Query: 121 SVSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNA 180
           S  TL  G+QIHTHILKRG+VSNIWVQTTMIHFY+INKDVG+ARK+FDEMS+RN+VTWNA
Sbjct: 121 SPPTLTTGKQIHTHILKRGVVSNIWVQTTMIHFYAINKDVGAARKVFDEMSVRNNVTWNA 180

Query: 181 MIAGYCSQGGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETGS 240
           MIAGYCSQ G+V+QKY R+ALELFRGMLVESTN +V PTDTTMVC+LSAASQLG+LETG 
Sbjct: 181 MIAGYCSQSGRVAQKYGREALELFRGMLVESTNSDVNPTDTTMVCLLSAASQLGVLETGV 240

Query: 241 CVHAYIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLAV 300
           CVHAYI+KT+DSPE DVFIGTGLVNMYSKCG LNSASSVFKQMKQ+NVLTWT+MATGLA+
Sbjct: 241 CVHAYIEKTIDSPENDVFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAI 300

Query: 301 HGRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQ 360
           HG+GKEALELL+AMG HGVKPNAVTFTSLLS CCHGGLIEEGLHLF VMERKFGVVPQMQ
Sbjct: 301 HGKGKEALELLEAMGGHGVKPNAVTFTSLLSGCCHGGLIEEGLHLFDVMERKFGVVPQMQ 360

Query: 361 HYGCIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVER 420
           HYGCIVDLLGR GHL+EAY++IL MPM PDGVLWR L+SSCM+H DVEMGE+VGK LVE 
Sbjct: 361 HYGCIVDLLGRCGHLKEAYEMILGMPMAPDGVLWRGLMSSCMVHCDVEMGEKVGKFLVES 420

Query: 421 QGGESFDDEWCVGSEDFVALSNVYASVERWDDVEALRDEMKIKGIENKAGCSSLQTTG 478
             G    DEWC GSEDFVALSNVYAS +RW++V+A+R+EMKIKGI+NK G SS+QT G
Sbjct: 421 VVG----DEWCDGSEDFVALSNVYASAQRWENVKAVREEMKIKGIQNKRGYSSVQTRG 474

BLAST of CsGy1G026460 vs. NCBI nr
Match: XP_022159202.1 (pentatricopeptide repeat-containing protein At3g18970 [Momordica charantia])

HSP 1 Score: 765.8 bits (1976), Expect = 9.0e-218
Identity = 376/482 (78.01%), Postives = 423/482 (87.76%), Query Frame = 0

Query: 1   MHLAPRLSCIHHLSNVRISSLQLLQIHAQLITNGFKSPSPYAKLITHLCKKSSSESIAHA 60
           MHLAPRL CIHHL+N   SSLQL QIHAQLITNGFKSPSP+AKLI   C KSS ++ AHA
Sbjct: 1   MHLAPRLRCIHHLNNTPNSSLQLKQIHAQLITNGFKSPSPFAKLIAQFCDKSSPQATAHA 60

Query: 61  HLIFRHHQYSPNLFLFNTLIRCAPPHHSISIFATWVSTSHFEFDDFTFIFVLGACARAPS 120
           HLIF+HH+  PNLFL NT IR +PP HSI +FA W ST   EFDDFTFIFVLGA ARAPS
Sbjct: 61  HLIFKHHR--PNLFLLNTFIRSSPPQHSIPVFAKWASTGDLEFDDFTFIFVLGAAARAPS 120

Query: 121 VSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNAM 180
           ++ LMIG+QIHT I KRG++SNIWVQTT IHFY+ NKDV SARK+FDEMS+RNSVTWNAM
Sbjct: 121 LAALMIGKQIHTQIFKRGVISNIWVQTTAIHFYASNKDVSSARKVFDEMSVRNSVTWNAM 180

Query: 181 IAGYCSQGGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETGSC 240
           IAGYCSQ  +V++KYAR+ALELF  MLV+S N EVKPTDTTMVC+LSA SQLG++ETG+ 
Sbjct: 181 IAGYCSQSERVAEKYARNALELFLVMLVDS-NSEVKPTDTTMVCLLSAVSQLGVVETGAS 240

Query: 241 VHAYIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLAVH 300
           VHAYI+KT+ SPE DVF+GTGLV+MYSKCG L+SASSVFKQMKQ+NVLTWT+MATGLAVH
Sbjct: 241 VHAYIEKTIHSPESDVFVGTGLVDMYSKCGCLHSASSVFKQMKQRNVLTWTAMATGLAVH 300

Query: 301 GRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQH 360
           G+GKE+L LLD+M AHGVKPNAVTFTSLLSACCHGGL+EEGLHLFRVMERKFG VP MQH
Sbjct: 301 GKGKESLLLLDSMEAHGVKPNAVTFTSLLSACCHGGLVEEGLHLFRVMERKFGAVPHMQH 360

Query: 361 YGCIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQ 420
           YGCIVDLLGRSGHL+EAY+LI+ MPMEPD VLWRSLLSSCM HGDVEMGE+VGKLLVERQ
Sbjct: 361 YGCIVDLLGRSGHLKEAYELIMGMPMEPDAVLWRSLLSSCMSHGDVEMGEKVGKLLVERQ 420

Query: 421 GGESFDDEWCVG-SEDFVALSNVYASVERWDDVEALRDEMKIKGIENKAGCSSLQTTGSQ 480
            G    +EWCVG SEDFVALSNVYAS ++W DVEA+R+EM+IKGIENK G SSLQTTG +
Sbjct: 421 RGRHGFEEWCVGSSEDFVALSNVYASAQKWGDVEAVREEMRIKGIENKPGYSSLQTTGPR 479

Query: 481 GL 482
           GL
Sbjct: 481 GL 479

BLAST of CsGy1G026460 vs. NCBI nr
Match: XP_022988094.1 (pentatricopeptide repeat-containing protein At3g18970 [Cucurbita maxima])

HSP 1 Score: 751.1 bits (1938), Expect = 2.3e-213
Identity = 370/478 (77.41%), Postives = 412/478 (86.19%), Query Frame = 0

Query: 1   MHLAPRLSCIHHLSNVRISS-LQLLQIHAQLITNGFKSPSPYAKLITHLCKKSSSESIAH 60
           MHL+PRLSCIHHL+N+  SS LQL QI AQLITN FKSPSPYA                 
Sbjct: 1   MHLSPRLSCIHHLNNILNSSLLQLKQIQAQLITNAFKSPSPYAXXXXXXXXXXXXXXXXX 60

Query: 61  AHLIFRHHQYSPNLFLFNTLIRCAPPHHSISIFATWVSTSHFEFDDFTFIFVLGACARAP 120
                  H++  NLFLFNTLIRCAPP HSISIFA  +ST+HFEFDDFTFIF+LGACARAP
Sbjct: 61  XXXXXXXHRHPTNLFLFNTLIRCAPPQHSISIFANSLSTAHFEFDDFTFIFLLGACARAP 120

Query: 121 SVSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNA 180
           S  TL  G+QIHTH+LKRG+VSNIWVQTTMIHFY+INKDVG+ARK+FDEMS+RN+VTWNA
Sbjct: 121 SPPTLTTGKQIHTHVLKRGVVSNIWVQTTMIHFYAINKDVGAARKVFDEMSVRNNVTWNA 180

Query: 181 MIAGYCSQGGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETGS 240
           MIAGYCSQ G+V+QKY R+ALELFRGMLVESTN +V PTDTTMVC+LSAASQLG+LETG 
Sbjct: 181 MIAGYCSQSGRVAQKYGREALELFRGMLVESTNSDVNPTDTTMVCLLSAASQLGVLETGV 240

Query: 241 CVHAYIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLAV 300
           CVHAYI+KT DSPE DVFIGTGLVNMYSKCG LNSASSVFKQMKQ+NVLTWT+MATGLA+
Sbjct: 241 CVHAYIEKTTDSPENDVFIGTGLVNMYSKCGCLNSASSVFKQMKQRNVLTWTAMATGLAI 300

Query: 301 HGRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQ 360
           HG+GKEALELL+AMG HGVKPNAVTFTSLLS CCHGGLIEEGLHLF VMERKFGVVPQMQ
Sbjct: 301 HGKGKEALELLEAMGGHGVKPNAVTFTSLLSGCCHGGLIEEGLHLFDVMERKFGVVPQMQ 360

Query: 361 HYGCIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVER 420
           HYGCIVDLLGR GHL+EAY+LIL MP+ PDGVLWRSL+SSCM+H DVEMGE+VGK LVE 
Sbjct: 361 HYGCIVDLLGRCGHLKEAYELILGMPVAPDGVLWRSLMSSCMVHCDVEMGEKVGKFLVES 420

Query: 421 QGGESFDDEWCVGSEDFVALSNVYASVERWDDVEALRDEMKIKGIENKAGCSSLQTTG 478
             G    DEWC GSEDFVALSNVYAS +RW++V+A+R+EMKIKGI+NKAG SS+QT G
Sbjct: 421 VVG----DEWCDGSEDFVALSNVYASAQRWENVKAVREEMKIKGIQNKAGYSSVQTRG 474

BLAST of CsGy1G026460 vs. TAIR10
Match: AT3G18970.1 (mitochondrial editing factor 20)

HSP 1 Score: 441.0 bits (1133), Expect = 9.3e-124
Identity = 240/456 (52.63%), Postives = 312/456 (68.42%), Query Frame = 0

Query: 21  LQLLQIHAQLITNGFKSPSPYAKLITHLCKKSSSESIAH-AHLIFRHHQYSPNLFLFNTL 80
           +Q  QIHAQL+ NG    S + KLI H C K S+ES +  AHL+       P+ FLFNTL
Sbjct: 22  IQAKQIHAQLVINGCHDNSLFGKLIGHYCSKPSTESSSKLAHLLVFPRFGHPDKFLFNTL 81

Query: 81  IRCAPPHHSISIFATWVSTSHFEF-DDFTFIFVLGACARAPSVSTLMIGRQIHTHILKRG 140
           ++C+ P  SI IFA + S S   + ++ TF+FVLGACAR+ S S L +GR +H  + K G
Sbjct: 82  LKCSKPEDSIRIFANYASKSSLLYLNERTFVFVLGACARSASSSALRVGRIVHGMVKKLG 141

Query: 141 -IVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNAMIAGYCSQGGKVSQKYAR 200
            +  +  + TT++HFY+ N D+  ARK+FDEM  R SVTWNAMI GYCS   K +   AR
Sbjct: 142 FLYESELIGTTLLHFYAKNGDLRYARKVFDEMPERTSVTWNAMIGGYCSHKDKGNHN-AR 201

Query: 201 DALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETGSCVHAYIKKTVDSPEKDVF 260
            A+ LFR          V+PTDTTMVC+LSA SQ G+LE GS VH YI+K   +PE DVF
Sbjct: 202 KAMVLFRRF--SCCGSGVRPTDTTMVCVLSAISQTGLLEIGSLVHGYIEKLGFTPEVDVF 261

Query: 261 IGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLAVHGRGKEALELLDAMGAHG 320
           IGT LV+MYSKCG LN+A SVF+ MK KNV TWTSMATGLA++GRG E   LL+ M   G
Sbjct: 262 IGTALVDMYSKCGCLNNAFSVFELMKVKNVFTWTSMATGLALNGRGNETPNLLNRMAESG 321

Query: 321 VKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQHYGCIVDLLGRSGHLREA 380
           +KPN +TFTSLLSA  H GL+EEG+ LF+ M+ +FGV P ++HYGCIVDLLG++G ++EA
Sbjct: 322 IKPNEITFTSLLSAYRHIGLVEEGIELFKSMKTRFGVTPVIEHYGCIVDLLGKAGRIQEA 381

Query: 381 YKLILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQGGESFDDEWCVGS--ED 440
           Y+ IL MP++PD +L RSL ++C ++G+  MGE +GK L+E +     +DE   GS  ED
Sbjct: 382 YQFILAMPIKPDAILLRSLCNACSIYGETVMGEEIGKALLEIER----EDEKLSGSECED 441

Query: 441 FVALSNVYASVERWDDVEALRDEMKIKGIENKAGCS 472
           +VALSNV A   +W +VE LR EMK + I+ + G S
Sbjct: 442 YVALSNVLAHKGKWVEVEKLRKEMKERRIKTRPGYS 470

BLAST of CsGy1G026460 vs. TAIR10
Match: AT1G59720.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 268.1 bits (684), Expect = 1.1e-71
Identity = 169/469 (36.03%), Postives = 251/469 (53.52%), Query Frame = 0

Query: 22  QLLQIHAQLITNGFKSPSPYAKLITHLCKKSSSES-IAHAHLIFRHHQYSPNLFLFNTLI 81
           QL Q+HA  +   +        L   + + SSS S + +A  +F   + + + F++NTLI
Sbjct: 63  QLKQLHAFTLRTTYPEEPATLFLYGKILQLSSSFSDVNYAFRVFDSIE-NHSSFMWNTLI 122

Query: 82  R-CAPPHHSIS-------IFATWVSTSHFEFDDFTFIFVLGACARAPSVSTLMIGRQIHT 141
           R CA   H +S       ++   +       D  TF FVL ACA     S    G+Q+H 
Sbjct: 123 RACA---HDVSRKEEAFMLYRKMLERGESSPDKHTFPFVLKACAYIFGFSE---GKQVHC 182

Query: 142 HILKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNAMIAGYCSQGGKVS 201
            I+K G   +++V   +IH Y     +  ARK+FDEM  R+ V+WN+MI      G   S
Sbjct: 183 QIVKHGFGGDVYVNNGLIHLYGSCGCLDLARKVFDEMPERSLVSWNSMIDALVRFGEYDS 242

Query: 202 QKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETGSCVHAYIKKTVD-S 261
                 AL+LFR M     +FE  P   TM  +LSA + LG L  G+  HA++ +  D  
Sbjct: 243 ------ALQLFREM---QRSFE--PDGYTMQSVLSACAGLGSLSLGTWAHAFLLRKCDVD 302

Query: 262 PEKDVFIGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLAVHGRGKEALELLD 321
              DV +   L+ MY KCG L  A  VF+ M+++++ +W +M  G A HGR +EA+   D
Sbjct: 303 VAMDVLVKNSLIEMYCKCGSLRMAEQVFQGMQKRDLASWNAMILGFATHGRAEEAMNFFD 362

Query: 322 AM--GAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQHYGCIVDLLG 381
            M      V+PN+VTF  LL AC H G + +G   F +M R + + P ++HYGCIVDL+ 
Sbjct: 363 RMVDKRENVRPNSVTFVGLLIACNHRGFVNKGRQYFDMMVRDYCIEPALEHYGCIVDLIA 422

Query: 382 RSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLHG-DVEMGERVGKLLVERQGGESFDDE 441
           R+G++ EA  +++ MPM+PD V+WRSLL +C   G  VE+ E + + ++  +      + 
Sbjct: 423 RAGYITEAIDMVMSMPMKPDAVIWRSLLDACCKKGASVELSEEIARNIIGTKEDNESSNG 482

Query: 442 WCVGSEDFVALSNVYASVERWDDVEALRDEMKIKGIENKAGCSSLQTTG 478
            C G+  +V LS VYAS  RW+DV  +R  M   GI  + GCSS++  G
Sbjct: 483 NCSGA--YVLLSRVYASASRWNDVGIVRKLMSEHGIRKEPGCSSIEING 511

BLAST of CsGy1G026460 vs. TAIR10
Match: AT5G56310.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 266.2 bits (679), Expect = 4.1e-71
Identity = 168/491 (34.22%), Postives = 264/491 (53.77%), Query Frame = 0

Query: 23  LLQIHAQLITNGFKSPSPYAKLITHLCKKSSSESIAHAHLIFRHHQYSPNLFLFNTLIRC 82
           L Q H  +I  G    +         C  S++  + +A+ +F  HQ  PN +L NT+IR 
Sbjct: 31  LKQSHCYMIITGLNRDNLNVAKFIEAC--SNAGHLRYAYSVFT-HQPCPNTYLHNTMIRA 90

Query: 83  -----APPHHSISIFA---TWVSTSHFEFDDFTFIFVLGACARAPSVSTLMIGRQIHTHI 142
                 P  HSI+I      W   +  + D FTF FVL    R   VS +  GRQIH  +
Sbjct: 91  LSLLDEPNAHSIAITVYRKLWALCA--KPDTFTFPFVLKIAVR---VSDVWFGRQIHGQV 150

Query: 143 LKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNAMIAGYCSQGGKVSQ- 202
           +  G  S++ V T +I  Y     +G ARK+FDEM +++   WNA++AGY    GKV + 
Sbjct: 151 VVFGFDSSVHVVTGLIQMYFSCGGLGDARKMFDEMLVKDVNVWNALLAGY----GKVGEM 210

Query: 203 KYARDALEL------------------------------FRGMLVESTNFEVKPTDTTMV 262
             AR  LE+                              F+ ML+E+    V+P + T++
Sbjct: 211 DEARSLLEMMPCWVRNEVXXXXXXXXXXXXXXXXXXXXXFQRMLMEN----VEPDEVTLL 270

Query: 263 CILSAASQLGMLETGSCVHAYIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMK 322
            +LSA + LG LE G  + +Y+        + V +   +++MY+K G +  A  VF+ + 
Sbjct: 271 AVLSACADLGSLELGERICSYVDHR--GMNRAVSLNNAVIDMYAKSGNITKALDVFECVN 330

Query: 323 QKNVLTWTSMATGLAVHGRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLH 382
           ++NV+TWT++  GLA HG G EAL + + M   GV+PN VTF ++LSAC H G ++ G  
Sbjct: 331 ERNVVTWTTIIAGLATHGHGAEALAMFNRMVKAGVRPNDVTFIAILSACSHVGWVDLGKR 390

Query: 383 LFRVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLH 442
           LF  M  K+G+ P ++HYGC++DLLGR+G LREA ++I  MP + +  +W SLL++  +H
Sbjct: 391 LFNSMRSKYGIHPNIEHYGCMIDLLGRAGKLREADEVIKSMPFKANAAIWGSLLAASNVH 450

Query: 443 GDVEMGERVGKLLVERQGGESFDDEWCVGSEDFVALSNVYASVERWDDVEALRDEMKIKG 475
            D+E+GER    L++ +            S +++ L+N+Y+++ RWD+   +R+ MK  G
Sbjct: 451 HDLELGERALSELIKLEPN---------NSGNYMLLANLYSNLGRWDESRMMRNMMKGIG 494

BLAST of CsGy1G026460 vs. TAIR10
Match: AT3G47530.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 265.4 bits (677), Expect = 7.0e-71
Identity = 168/476 (35.29%), Postives = 258/476 (54.20%), Query Frame = 0

Query: 12  HLSNVRISS---LQLLQIHAQLI-TNGFKSPSPYAKLITHLCKKSSSESIAHAHLIFRHH 71
           HL ++ +SS   L L QIHA L+ T+  ++   +   ++ L        I ++  +F   
Sbjct: 13  HLLSLIVSSTGKLHLRQIHALLLRTSLIRNSDVFHHFLSRLALSLIPRDINYSCRVF-SQ 72

Query: 72  QYSPNLFLFNTLIRC----APPHHSISIFATWVSTSHFEFDDFTFIFVLGACARAPSVST 131
           + +P L   NT+IR       P     +F +    S    +  +  F L  C ++     
Sbjct: 73  RLNPTLSHCNTMIRAFSLSQTPCEGFRLFRSLRRNSSLPANPLSSSFALKCCIKS---GD 132

Query: 132 LMIGRQIHTHILKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNAMIAG 191
           L+ G QIH  I   G +S+  + TT++  YS  ++   A K+FDE+  R++V+WN + + 
Sbjct: 133 LLGGLQIHGKIFSDGFLSDSLLMTTLMDLYSTCENSTDACKVFDEIPKRDTVSWNVLFSC 192

Query: 192 YCSQGGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETGSCVHA 251
           Y      +  K  RD L LF  M     +  VKP   T +  L A + LG L+ G  VH 
Sbjct: 193 Y------LRNKRTRDVLVLFDKM-KNDVDGCVKPDGVTCLLALQACANLGALDFGKQVHD 252

Query: 252 YIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLAVHGRG 311
           +I +   S   +  +   LV+MYS+CG ++ A  VF  M+++NV++WT++ +GLA++G G
Sbjct: 253 FIDENGLSGALN--LSNTLVSMYSRCGSMDKAYQVFYGMRERNVVSWTALISGLAMNGFG 312

Query: 312 KEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLF-RVMERKFGVVPQMQHYG 371
           KEA+E  + M   G+ P   T T LLSAC H GL+ EG+  F R+   +F + P + HYG
Sbjct: 313 KEAIEAFNEMLKFGISPEEQTLTGLLSACSHSGLVAEGMMFFDRMRSGEFKIKPNLHHYG 372

Query: 372 CIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQGG 431
           C+VDLLGR+  L +AY LI  M M+PD  +WR+LL +C +HGDVE+GERV   L+E +  
Sbjct: 373 CVVDLLGRARLLDKAYSLIKSMEMKPDSTIWRTLLGACRVHGDVELGERVISHLIELKAE 432

Query: 432 ESFDDEWCVGSEDFVALSNVYASVERWDDVEALRDEMKIKGIENKAGCSSLQTTGS 479
           E         + D+V L N Y++V +W+ V  LR  MK K I  K GCS+++  G+
Sbjct: 433 E---------AGDYVLLLNTYSTVGKWEKVTELRSLMKEKRIHTKPGCSAIELQGT 466

BLAST of CsGy1G026460 vs. TAIR10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 261.2 bits (666), Expect = 1.3e-69
Identity = 134/371 (36.12%), Postives = 215/371 (57.95%), Query Frame = 0

Query: 104 DDFTFIFVLGACARAPSVSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYSINKDVGSAR 163
           D+ T + V+ ACA++ S+    +GRQ+H  I   G  SN+ +   +I  YS   ++ +A 
Sbjct: 265 DESTMVTVVSACAQSGSIE---LGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETAC 324

Query: 164 KLFDEMSLRNSVTWNAMIAGYCSQGGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMV 223
            LF+ +  ++ ++WN +I GY            ++AL LF+ ML         P D TM+
Sbjct: 325 GLFERLPYKDVISWNTLIGGY------THMNLYKEALLLFQEMLRSGET----PNDVTML 384

Query: 224 CILSAASQLGMLETGSCVHAYIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMK 283
            IL A + LG ++ G  +H YI K +        + T L++MY+KCG + +A  VF  + 
Sbjct: 385 SILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSIL 444

Query: 284 QKNVLTWTSMATGLAVHGRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLH 343
            K++ +W +M  G A+HGR   + +L   M   G++P+ +TF  LLSAC H G+++ G H
Sbjct: 445 HKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRH 504

Query: 344 LFRVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLH 403
           +FR M + + + P+++HYGC++DLLG SG  +EA ++I  M MEPDGV+W SLL +C +H
Sbjct: 505 IFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMH 564

Query: 404 GDVEMGERVGKLLVERQGGESFDDEWCVGSEDFVALSNVYASVERWDDVEALRDEMKIKG 463
           G+VE+GE   + L++ +           GS  +V LSN+YAS  RW++V   R  +  KG
Sbjct: 565 GNVELGESFAENLIKIEPENP-------GS--YVLLSNIYASAGRWNEVAKTRALLNDKG 613

Query: 464 IENKAGCSSLQ 475
           ++   GCSS++
Sbjct: 625 MKKVPGCSSIE 613

BLAST of CsGy1G026460 vs. Swiss-Prot
Match: sp|Q9LJ69|PP243_ARATH (Pentatricopeptide repeat-containing protein At3g18970 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E93 PE=2 SV=1)

HSP 1 Score: 441.0 bits (1133), Expect = 1.7e-122
Identity = 240/456 (52.63%), Postives = 312/456 (68.42%), Query Frame = 0

Query: 21  LQLLQIHAQLITNGFKSPSPYAKLITHLCKKSSSESIAH-AHLIFRHHQYSPNLFLFNTL 80
           +Q  QIHAQL+ NG    S + KLI H C K S+ES +  AHL+       P+ FLFNTL
Sbjct: 22  IQAKQIHAQLVINGCHDNSLFGKLIGHYCSKPSTESSSKLAHLLVFPRFGHPDKFLFNTL 81

Query: 81  IRCAPPHHSISIFATWVSTSHFEF-DDFTFIFVLGACARAPSVSTLMIGRQIHTHILKRG 140
           ++C+ P  SI IFA + S S   + ++ TF+FVLGACAR+ S S L +GR +H  + K G
Sbjct: 82  LKCSKPEDSIRIFANYASKSSLLYLNERTFVFVLGACARSASSSALRVGRIVHGMVKKLG 141

Query: 141 -IVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNAMIAGYCSQGGKVSQKYAR 200
            +  +  + TT++HFY+ N D+  ARK+FDEM  R SVTWNAMI GYCS   K +   AR
Sbjct: 142 FLYESELIGTTLLHFYAKNGDLRYARKVFDEMPERTSVTWNAMIGGYCSHKDKGNHN-AR 201

Query: 201 DALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETGSCVHAYIKKTVDSPEKDVF 260
            A+ LFR          V+PTDTTMVC+LSA SQ G+LE GS VH YI+K   +PE DVF
Sbjct: 202 KAMVLFRRF--SCCGSGVRPTDTTMVCVLSAISQTGLLEIGSLVHGYIEKLGFTPEVDVF 261

Query: 261 IGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLAVHGRGKEALELLDAMGAHG 320
           IGT LV+MYSKCG LN+A SVF+ MK KNV TWTSMATGLA++GRG E   LL+ M   G
Sbjct: 262 IGTALVDMYSKCGCLNNAFSVFELMKVKNVFTWTSMATGLALNGRGNETPNLLNRMAESG 321

Query: 321 VKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQHYGCIVDLLGRSGHLREA 380
           +KPN +TFTSLLSA  H GL+EEG+ LF+ M+ +FGV P ++HYGCIVDLLG++G ++EA
Sbjct: 322 IKPNEITFTSLLSAYRHIGLVEEGIELFKSMKTRFGVTPVIEHYGCIVDLLGKAGRIQEA 381

Query: 381 YKLILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQGGESFDDEWCVGS--ED 440
           Y+ IL MP++PD +L RSL ++C ++G+  MGE +GK L+E +     +DE   GS  ED
Sbjct: 382 YQFILAMPIKPDAILLRSLCNACSIYGETVMGEEIGKALLEIER----EDEKLSGSECED 441

Query: 441 FVALSNVYASVERWDDVEALRDEMKIKGIENKAGCS 472
           +VALSNV A   +W +VE LR EMK + I+ + G S
Sbjct: 442 YVALSNVLAHKGKWVEVEKLRKEMKERRIKTRPGYS 470

BLAST of CsGy1G026460 vs. Swiss-Prot
Match: sp|Q0WQW5|PPR85_ARATH (Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H51 PE=1 SV=2)

HSP 1 Score: 268.1 bits (684), Expect = 1.9e-70
Identity = 169/469 (36.03%), Postives = 251/469 (53.52%), Query Frame = 0

Query: 22  QLLQIHAQLITNGFKSPSPYAKLITHLCKKSSSES-IAHAHLIFRHHQYSPNLFLFNTLI 81
           QL Q+HA  +   +        L   + + SSS S + +A  +F   + + + F++NTLI
Sbjct: 63  QLKQLHAFTLRTTYPEEPATLFLYGKILQLSSSFSDVNYAFRVFDSIE-NHSSFMWNTLI 122

Query: 82  R-CAPPHHSIS-------IFATWVSTSHFEFDDFTFIFVLGACARAPSVSTLMIGRQIHT 141
           R CA   H +S       ++   +       D  TF FVL ACA     S    G+Q+H 
Sbjct: 123 RACA---HDVSRKEEAFMLYRKMLERGESSPDKHTFPFVLKACAYIFGFSE---GKQVHC 182

Query: 142 HILKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNAMIAGYCSQGGKVS 201
            I+K G   +++V   +IH Y     +  ARK+FDEM  R+ V+WN+MI      G   S
Sbjct: 183 QIVKHGFGGDVYVNNGLIHLYGSCGCLDLARKVFDEMPERSLVSWNSMIDALVRFGEYDS 242

Query: 202 QKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETGSCVHAYIKKTVD-S 261
                 AL+LFR M     +FE  P   TM  +LSA + LG L  G+  HA++ +  D  
Sbjct: 243 ------ALQLFREM---QRSFE--PDGYTMQSVLSACAGLGSLSLGTWAHAFLLRKCDVD 302

Query: 262 PEKDVFIGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLAVHGRGKEALELLD 321
              DV +   L+ MY KCG L  A  VF+ M+++++ +W +M  G A HGR +EA+   D
Sbjct: 303 VAMDVLVKNSLIEMYCKCGSLRMAEQVFQGMQKRDLASWNAMILGFATHGRAEEAMNFFD 362

Query: 322 AM--GAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQHYGCIVDLLG 381
            M      V+PN+VTF  LL AC H G + +G   F +M R + + P ++HYGCIVDL+ 
Sbjct: 363 RMVDKRENVRPNSVTFVGLLIACNHRGFVNKGRQYFDMMVRDYCIEPALEHYGCIVDLIA 422

Query: 382 RSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLHG-DVEMGERVGKLLVERQGGESFDDE 441
           R+G++ EA  +++ MPM+PD V+WRSLL +C   G  VE+ E + + ++  +      + 
Sbjct: 423 RAGYITEAIDMVMSMPMKPDAVIWRSLLDACCKKGASVELSEEIARNIIGTKEDNESSNG 482

Query: 442 WCVGSEDFVALSNVYASVERWDDVEALRDEMKIKGIENKAGCSSLQTTG 478
            C G+  +V LS VYAS  RW+DV  +R  M   GI  + GCSS++  G
Sbjct: 483 NCSGA--YVLLSRVYASASRWNDVGIVRKLMSEHGIRKEPGCSSIEING 511

BLAST of CsGy1G026460 vs. Swiss-Prot
Match: sp|Q9FMA1|PP433_ARATH (Pentatricopeptide repeat-containing protein At5g56310 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E13 PE=2 SV=1)

HSP 1 Score: 266.2 bits (679), Expect = 7.4e-70
Identity = 168/491 (34.22%), Postives = 264/491 (53.77%), Query Frame = 0

Query: 23  LLQIHAQLITNGFKSPSPYAKLITHLCKKSSSESIAHAHLIFRHHQYSPNLFLFNTLIRC 82
           L Q H  +I  G    +         C  S++  + +A+ +F  HQ  PN +L NT+IR 
Sbjct: 31  LKQSHCYMIITGLNRDNLNVAKFIEAC--SNAGHLRYAYSVFT-HQPCPNTYLHNTMIRA 90

Query: 83  -----APPHHSISIFA---TWVSTSHFEFDDFTFIFVLGACARAPSVSTLMIGRQIHTHI 142
                 P  HSI+I      W   +  + D FTF FVL    R   VS +  GRQIH  +
Sbjct: 91  LSLLDEPNAHSIAITVYRKLWALCA--KPDTFTFPFVLKIAVR---VSDVWFGRQIHGQV 150

Query: 143 LKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNAMIAGYCSQGGKVSQ- 202
           +  G  S++ V T +I  Y     +G ARK+FDEM +++   WNA++AGY    GKV + 
Sbjct: 151 VVFGFDSSVHVVTGLIQMYFSCGGLGDARKMFDEMLVKDVNVWNALLAGY----GKVGEM 210

Query: 203 KYARDALEL------------------------------FRGMLVESTNFEVKPTDTTMV 262
             AR  LE+                              F+ ML+E+    V+P + T++
Sbjct: 211 DEARSLLEMMPCWVRNEVXXXXXXXXXXXXXXXXXXXXXFQRMLMEN----VEPDEVTLL 270

Query: 263 CILSAASQLGMLETGSCVHAYIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMK 322
            +LSA + LG LE G  + +Y+        + V +   +++MY+K G +  A  VF+ + 
Sbjct: 271 AVLSACADLGSLELGERICSYVDHR--GMNRAVSLNNAVIDMYAKSGNITKALDVFECVN 330

Query: 323 QKNVLTWTSMATGLAVHGRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLH 382
           ++NV+TWT++  GLA HG G EAL + + M   GV+PN VTF ++LSAC H G ++ G  
Sbjct: 331 ERNVVTWTTIIAGLATHGHGAEALAMFNRMVKAGVRPNDVTFIAILSACSHVGWVDLGKR 390

Query: 383 LFRVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLH 442
           LF  M  K+G+ P ++HYGC++DLLGR+G LREA ++I  MP + +  +W SLL++  +H
Sbjct: 391 LFNSMRSKYGIHPNIEHYGCMIDLLGRAGKLREADEVIKSMPFKANAAIWGSLLAASNVH 450

Query: 443 GDVEMGERVGKLLVERQGGESFDDEWCVGSEDFVALSNVYASVERWDDVEALRDEMKIKG 475
            D+E+GER    L++ +            S +++ L+N+Y+++ RWD+   +R+ MK  G
Sbjct: 451 HDLELGERALSELIKLEPN---------NSGNYMLLANLYSNLGRWDESRMMRNMMKGIG 494

BLAST of CsGy1G026460 vs. Swiss-Prot
Match: sp|Q9SN85|PP267_ARATH (Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H76 PE=2 SV=1)

HSP 1 Score: 265.4 bits (677), Expect = 1.3e-69
Identity = 168/476 (35.29%), Postives = 258/476 (54.20%), Query Frame = 0

Query: 12  HLSNVRISS---LQLLQIHAQLI-TNGFKSPSPYAKLITHLCKKSSSESIAHAHLIFRHH 71
           HL ++ +SS   L L QIHA L+ T+  ++   +   ++ L        I ++  +F   
Sbjct: 13  HLLSLIVSSTGKLHLRQIHALLLRTSLIRNSDVFHHFLSRLALSLIPRDINYSCRVF-SQ 72

Query: 72  QYSPNLFLFNTLIRC----APPHHSISIFATWVSTSHFEFDDFTFIFVLGACARAPSVST 131
           + +P L   NT+IR       P     +F +    S    +  +  F L  C ++     
Sbjct: 73  RLNPTLSHCNTMIRAFSLSQTPCEGFRLFRSLRRNSSLPANPLSSSFALKCCIKS---GD 132

Query: 132 LMIGRQIHTHILKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNAMIAG 191
           L+ G QIH  I   G +S+  + TT++  YS  ++   A K+FDE+  R++V+WN + + 
Sbjct: 133 LLGGLQIHGKIFSDGFLSDSLLMTTLMDLYSTCENSTDACKVFDEIPKRDTVSWNVLFSC 192

Query: 192 YCSQGGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETGSCVHA 251
           Y      +  K  RD L LF  M     +  VKP   T +  L A + LG L+ G  VH 
Sbjct: 193 Y------LRNKRTRDVLVLFDKM-KNDVDGCVKPDGVTCLLALQACANLGALDFGKQVHD 252

Query: 252 YIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLAVHGRG 311
           +I +   S   +  +   LV+MYS+CG ++ A  VF  M+++NV++WT++ +GLA++G G
Sbjct: 253 FIDENGLSGALN--LSNTLVSMYSRCGSMDKAYQVFYGMRERNVVSWTALISGLAMNGFG 312

Query: 312 KEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLF-RVMERKFGVVPQMQHYG 371
           KEA+E  + M   G+ P   T T LLSAC H GL+ EG+  F R+   +F + P + HYG
Sbjct: 313 KEAIEAFNEMLKFGISPEEQTLTGLLSACSHSGLVAEGMMFFDRMRSGEFKIKPNLHHYG 372

Query: 372 CIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQGG 431
           C+VDLLGR+  L +AY LI  M M+PD  +WR+LL +C +HGDVE+GERV   L+E +  
Sbjct: 373 CVVDLLGRARLLDKAYSLIKSMEMKPDSTIWRTLLGACRVHGDVELGERVISHLIELKAE 432

Query: 432 ESFDDEWCVGSEDFVALSNVYASVERWDDVEALRDEMKIKGIENKAGCSSLQTTGS 479
           E         + D+V L N Y++V +W+ V  LR  MK K I  K GCS+++  G+
Sbjct: 433 E---------AGDYVLLLNTYSTVGKWEKVTELRSLMKEKRIHTKPGCSAIELQGT 466

BLAST of CsGy1G026460 vs. Swiss-Prot
Match: sp|Q9LN01|PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 261.2 bits (666), Expect = 2.4e-68
Identity = 134/371 (36.12%), Postives = 215/371 (57.95%), Query Frame = 0

Query: 104 DDFTFIFVLGACARAPSVSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYSINKDVGSAR 163
           D+ T + V+ ACA++ S+    +GRQ+H  I   G  SN+ +   +I  YS   ++ +A 
Sbjct: 265 DESTMVTVVSACAQSGSIE---LGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETAC 324

Query: 164 KLFDEMSLRNSVTWNAMIAGYCSQGGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMV 223
            LF+ +  ++ ++WN +I GY            ++AL LF+ ML         P D TM+
Sbjct: 325 GLFERLPYKDVISWNTLIGGY------THMNLYKEALLLFQEMLRSGET----PNDVTML 384

Query: 224 CILSAASQLGMLETGSCVHAYIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMK 283
            IL A + LG ++ G  +H YI K +        + T L++MY+KCG + +A  VF  + 
Sbjct: 385 SILPACAHLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSIL 444

Query: 284 QKNVLTWTSMATGLAVHGRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLH 343
            K++ +W +M  G A+HGR   + +L   M   G++P+ +TF  LLSAC H G+++ G H
Sbjct: 445 HKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRH 504

Query: 344 LFRVMERKFGVVPQMQHYGCIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLH 403
           +FR M + + + P+++HYGC++DLLG SG  +EA ++I  M MEPDGV+W SLL +C +H
Sbjct: 505 IFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMH 564

Query: 404 GDVEMGERVGKLLVERQGGESFDDEWCVGSEDFVALSNVYASVERWDDVEALRDEMKIKG 463
           G+VE+GE   + L++ +           GS  +V LSN+YAS  RW++V   R  +  KG
Sbjct: 565 GNVELGESFAENLIKIEPENP-------GS--YVLLSNIYASAGRWNEVAKTRALLNDKG 613

Query: 464 IENKAGCSSLQ 475
           ++   GCSS++
Sbjct: 625 MKKVPGCSSIE 613

BLAST of CsGy1G026460 vs. TrEMBL
Match: tr|A0A0A0LZ63|A0A0A0LZ63_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G573660 PE=4 SV=1)

HSP 1 Score: 983.0 bits (2540), Expect = 2.4e-283
Identity = 486/486 (100.00%), Postives = 486/486 (100.00%), Query Frame = 0

Query: 1   MHLAPRLSCIHHLSNVRISSLQLLQIHAQLITNGFKSPSPYAKLITHLCKKSSSESIAHA 60
           MHLAPRLSCIHHLSNVRISSLQLLQIHAQLITNGFKSPSPYAKLITHLCKKSSSESIAHA
Sbjct: 1   MHLAPRLSCIHHLSNVRISSLQLLQIHAQLITNGFKSPSPYAKLITHLCKKSSSESIAHA 60

Query: 61  HLIFRHHQYSPNLFLFNTLIRCAPPHHSISIFATWVSTSHFEFDDFTFIFVLGACARAPS 120
           HLIFRHHQYSPNLFLFNTLIRCAPPHHSISIFATWVSTSHFEFDDFTFIFVLGACARAPS
Sbjct: 61  HLIFRHHQYSPNLFLFNTLIRCAPPHHSISIFATWVSTSHFEFDDFTFIFVLGACARAPS 120

Query: 121 VSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNAM 180
           VSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNAM
Sbjct: 121 VSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNAM 180

Query: 181 IAGYCSQGGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETGSC 240
           IAGYCSQGGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETGSC
Sbjct: 181 IAGYCSQGGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETGSC 240

Query: 241 VHAYIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLAVH 300
           VHAYIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLAVH
Sbjct: 241 VHAYIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLAVH 300

Query: 301 GRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQH 360
           GRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQH
Sbjct: 301 GRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQH 360

Query: 361 YGCIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQ 420
           YGCIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQ
Sbjct: 361 YGCIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQ 420

Query: 421 GGESFDDEWCVGSEDFVALSNVYASVERWDDVEALRDEMKIKGIENKAGCSSLQTTGSQG 480
           GGESFDDEWCVGSEDFVALSNVYASVERWDDVEALRDEMKIKGIENKAGCSSLQTTGSQG
Sbjct: 421 GGESFDDEWCVGSEDFVALSNVYASVERWDDVEALRDEMKIKGIENKAGCSSLQTTGSQG 480

Query: 481 LVEALL 487
           LVEALL
Sbjct: 481 LVEALL 486

BLAST of CsGy1G026460 vs. TrEMBL
Match: tr|A0A1S3BPA5|A0A1S3BPA5_CUCME (pentatricopeptide repeat-containing protein At3g18970 OS=Cucumis melo OX=3656 GN=LOC103492230 PE=4 SV=1)

HSP 1 Score: 939.9 bits (2428), Expect = 2.3e-270
Identity = 462/486 (95.06%), Postives = 474/486 (97.53%), Query Frame = 0

Query: 1   MHLAPRLSCIHHLSNVRISSLQLLQIHAQLITNGFKSPSPYAKLITHLCKKSSSESIAHA 60
           MHLAPRLSCI+HLSN+RISSLQLLQIHAQ ITNGFKSPSPYAKLITHLCKKSSSESIAHA
Sbjct: 9   MHLAPRLSCINHLSNIRISSLQLLQIHAQFITNGFKSPSPYAKLITHLCKKSSSESIAHA 68

Query: 61  HLIFRHHQYSPNLFLFNTLIRCAPPHHSISIFATWVSTSHFEFDDFTFIFVLGACARAPS 120
           HLIFRHHQ+SPNLFLFNTLIRCAPP +SISIFA WVST HFEFDDFTFIFVLGACARAPS
Sbjct: 69  HLIFRHHQHSPNLFLFNTLIRCAPPQYSISIFANWVSTPHFEFDDFTFIFVLGACARAPS 128

Query: 121 VSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNAM 180
           VSTLMIGRQIHTHILKRGIVSNIW QTTMIHFYS NKDVGSARK+FDEMS+RNSVTWNAM
Sbjct: 129 VSTLMIGRQIHTHILKRGIVSNIWAQTTMIHFYSTNKDVGSARKVFDEMSVRNSVTWNAM 188

Query: 181 IAGYCSQGGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETGSC 240
           IAGYCSQ GKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAAS LGMLETG C
Sbjct: 189 IAGYCSQSGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASHLGMLETGVC 248

Query: 241 VHAYIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLAVH 300
           VHAYIKKT+DSPEKDVFIGTGLVNMYSKCGLL+SASSVFKQMKQ+NVLTWTSMATGLAVH
Sbjct: 249 VHAYIKKTIDSPEKDVFIGTGLVNMYSKCGLLSSASSVFKQMKQRNVLTWTSMATGLAVH 308

Query: 301 GRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQH 360
           GRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQH
Sbjct: 309 GRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQH 368

Query: 361 YGCIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQ 420
           YGCIVDLLGRSGHLREAY+LILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQ
Sbjct: 369 YGCIVDLLGRSGHLREAYELILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVERQ 428

Query: 421 GGESFDDEWCVGSEDFVALSNVYASVERWDDVEALRDEMKIKGIENKAGCSSLQTTGSQG 480
           GGESFDDEWCVGSEDFVALSNVYAS ERWDDVEALR+EMKIKGIENKAG SS+QTTGSQG
Sbjct: 429 GGESFDDEWCVGSEDFVALSNVYASAERWDDVEALREEMKIKGIENKAGFSSVQTTGSQG 488

Query: 481 LVEALL 487
           LVE LL
Sbjct: 489 LVETLL 494

BLAST of CsGy1G026460 vs. TrEMBL
Match: tr|A0A2P5B3D3|A0A2P5B3D3_PARAD (Pentatricopeptide repeat OS=Parasponia andersonii OX=3476 GN=PanWU01x14_275160 PE=4 SV=1)

HSP 1 Score: 596.3 bits (1536), Expect = 6.3e-167
Identity = 310/477 (64.99%), Postives = 364/477 (76.31%), Query Frame = 0

Query: 1   MHLAPRLSCIHHLSNVRISSLQLLQIHAQLITNGFKSPSPYAKLITHLCKKSSSESIAH- 60
           M   PRL  I  L    +S  QL Q HAQLI NG  SPS  AKLI   C  S+ +S  H 
Sbjct: 1   MLFLPRLRAIGFLRLKLLSIYQLKQAHAQLIVNGLNSPSLVAKLIQQYCSLSNQKSKYHN 60

Query: 61  AHLIFRHHQYSPNLFLFNTLIRCAPPHHSISIFATWVSTSHFEFDDFTFIFVLGACARAP 120
           A L+F+H    PNLFL NTLIRC+ P  SI +FA WVS   FEFDDFT+IFVLGACAR+P
Sbjct: 61  AQLVFKHFD-QPNLFLLNTLIRCSQPKESILVFAQWVSRGDFEFDDFTYIFVLGACARSP 120

Query: 121 SVSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNA 180
           SV TL  GRQIH  I+K G +SNI VQTT+IH Y+ NKD+GSAR++FDEM +RN+VTWNA
Sbjct: 121 SVPTLWAGRQIHARIMKCGTISNIMVQTTVIHSYASNKDMGSARRVFDEMVVRNNVTWNA 180

Query: 181 MIAGYCSQGGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETGS 240
           MI GYCSQ G      A DAL LFR ML +      KPTDTT+VCILSAASQ G+LETG+
Sbjct: 181 MITGYCSQKGS-----ACDALVLFRDMLDDVDG--AKPTDTTIVCILSAASQFGVLETGA 240

Query: 241 CVHAYIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLAV 300
           CVH YI+KT   PE DVFIGTGLV+MYSKCG LNSA S+F +MK+KN+LTWT+MATGLA+
Sbjct: 241 CVHGYIEKTNWVPENDVFIGTGLVDMYSKCGCLNSALSIFIRMKEKNILTWTAMATGLAI 300

Query: 301 HGRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQ 360
           HG+GKEAL+LLDAMG  G+KPNAVTFTSLL ACCH GL+EEGLHLF  M  KF + PQMQ
Sbjct: 301 HGKGKEALQLLDAMGPSGLKPNAVTFTSLLLACCHSGLVEEGLHLFYNMS-KFNITPQMQ 360

Query: 361 HYGCIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVER 420
           HYGCIVDLL R+G L+EAY+ I+ MP+EPD +LWRSLLS+  +HGDV MGE+VGKLL++ 
Sbjct: 361 HYGCIVDLLSRTGLLKEAYEFIMAMPIEPDTILWRSLLSASRIHGDVSMGEKVGKLLLQI 420

Query: 421 QGGESFDDEWCVGSEDFVALSNVYASVERWDDVEALRDEMKIKGIENKAGCSSLQTT 477
              +S  D   V SED+VALSN+YASV +W DVE LR+ MK+KGI+NKAGCSS+QTT
Sbjct: 421 HQEQSSLD---VTSEDYVALSNIYASVGKWADVEMLRENMKVKGIDNKAGCSSIQTT 465

BLAST of CsGy1G026460 vs. TrEMBL
Match: tr|M5WFE9|M5WFE9_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_6G343200 PE=4 SV=1)

HSP 1 Score: 593.6 bits (1529), Expect = 4.1e-166
Identity = 299/480 (62.29%), Postives = 370/480 (77.08%), Query Frame = 0

Query: 1   MHLAPRLSCIHHLSNVRISSLQLLQIHAQLITNG-FKSPSPYAKLITHLCKKSSSESI-A 60
           MH  PR+  +  L+    S+ QL + HAQLIT+G  KSP+ YAKLI      S  +S   
Sbjct: 1   MHHLPRVRALFLLNLKLKSTHQLKRTHAQLITSGLLKSPTLYAKLIQQYGALSDPQSTNL 60

Query: 61  HAHLIFRHHQYSPNLFLFNTLIRCAPPHHSISIFATWVSTSHFEFDDFTFIFVLGACARA 120
           +AH +F+H    PNLFL NTLIRC  P  SI +FA WVS +   FDDFT+ FVLGACAR 
Sbjct: 61  YAHFVFKHFD-EPNLFLLNTLIRCTQPKDSILVFANWVSKATLIFDDFTYKFVLGACARL 120

Query: 121 PSVSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWN 180
           PSVSTL++G QIH  I+K  +VSNI VQTT++HFY+ NKD  SAR++FDEM+++NSVTWN
Sbjct: 121 PSVSTLLVGSQIHARIIKHDVVSNILVQTTLVHFYASNKDFVSARRVFDEMAVKNSVTWN 180

Query: 181 AMIAGYCSQGGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETG 240
           AMI GYCSQ     ++ ARDAL LFR ML +     VKPTDTTMVC+LSAASQLG+LETG
Sbjct: 181 AMITGYCSQ-----RESARDALVLFRDMLDDVCG--VKPTDTTMVCVLSAASQLGVLETG 240

Query: 241 SCVHAYIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLA 300
           +CVH YI+K +  P  DVFIGTGLV MYSKCG ++ A S+FK+MK+KN+LTWT+MATGLA
Sbjct: 241 ACVHGYIEKAIWVPHNDVFIGTGLVGMYSKCGCVDGALSIFKRMKEKNILTWTAMATGLA 300

Query: 301 VHGRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQM 360
           +HG+G EAL LLD M A+G+KPNAVTFTSLLSACCH GL+EEGLHLF +M+  F V+PQM
Sbjct: 301 IHGKGNEALVLLDVMEAYGIKPNAVTFTSLLSACCHSGLVEEGLHLFHMMKSNFDVMPQM 360

Query: 361 QHYGCIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVE 420
           QHYGCIVD+L R G+L+EAY+ ++ MP+EPD VLWRSLLS+C +HGDV MGE+VGK L+ 
Sbjct: 361 QHYGCIVDMLSRRGYLKEAYEFVVGMPVEPDAVLWRSLLSACKVHGDVAMGEKVGKKLLH 420

Query: 421 RQGGESFDDEWCVGSEDFVALSNVYASVERWDDVEALRDEMKIKGIENKAGCSSLQTTGS 479
            Q  ++  D   + SED+VALSN+YAS ERW+DVE +R EMK+KGIENKAGCSS+QT+ +
Sbjct: 421 IQSAQTCAD-LTLKSEDYVALSNIYASAERWEDVEMVRQEMKVKGIENKAGCSSIQTSSN 471

BLAST of CsGy1G026460 vs. TrEMBL
Match: tr|A0A2P5F8P6|A0A2P5F8P6_9ROSA (Pentatricopeptide repeat OS=Trema orientalis OX=63057 GN=TorRG33x02_101730 PE=4 SV=1)

HSP 1 Score: 590.1 bits (1520), Expect = 4.5e-165
Identity = 307/477 (64.36%), Postives = 361/477 (75.68%), Query Frame = 0

Query: 1   MHLAPRLSCIHHLSNVRISSLQLLQIHAQLITNGFKSPSPYAKLITHLCKKSSSESIAH- 60
           M   PRL  I  L    +S  QL Q HAQLI +G  SPS  AKLI   C  S+ +S  H 
Sbjct: 1   MLFLPRLRAIGFLRLKLLSIYQLKQAHAQLIVSGLNSPSLVAKLIQQYCSLSNQQSKYHN 60

Query: 61  AHLIFRHHQYSPNLFLFNTLIRCAPPHHSISIFATWVSTSHFEFDDFTFIFVLGACARAP 120
           A L+F+H    PNLFL NTLIRC+ P  SI +FA WVS   FEFDDFT+IFVLGACAR+P
Sbjct: 61  AQLVFKHFD-KPNLFLLNTLIRCSQPKESILVFAQWVSRGDFEFDDFTYIFVLGACARSP 120

Query: 121 SVSTLMIGRQIHTHILKRGIVSNIWVQTTMIHFYSINKDVGSARKLFDEMSLRNSVTWNA 180
           SV TL  GRQIH  I+K G  SNI VQTT+IH Y+ NKD+GSAR++FDEM +RN+VTWNA
Sbjct: 121 SVPTLWAGRQIHARIMKCGTTSNIMVQTTVIHSYASNKDMGSARRVFDEMVVRNNVTWNA 180

Query: 181 MIAGYCSQGGKVSQKYARDALELFRGMLVESTNFEVKPTDTTMVCILSAASQLGMLETGS 240
           MI GYCSQ G      A DAL LFR ML +      KPTDTT+VCILSAASQ G+LETG+
Sbjct: 181 MITGYCSQKGS-----ACDALVLFRDMLDDVDG--AKPTDTTIVCILSAASQFGVLETGA 240

Query: 241 CVHAYIKKTVDSPEKDVFIGTGLVNMYSKCGLLNSASSVFKQMKQKNVLTWTSMATGLAV 300
           CVH YI+KT   PE DVFIGTGLV+MYSKCG LNSA  +F +MK+KN+LTWT+MATGLA+
Sbjct: 241 CVHGYIEKTNWVPENDVFIGTGLVDMYSKCGCLNSALCIFIRMKEKNILTWTAMATGLAI 300

Query: 301 HGRGKEALELLDAMGAHGVKPNAVTFTSLLSACCHGGLIEEGLHLFRVMERKFGVVPQMQ 360
           HG+GKEAL+LLDAMG  G+KPNAVTFTSLL ACCH GL+EEGLHLF  M  KF + PQMQ
Sbjct: 301 HGKGKEALQLLDAMGPSGLKPNAVTFTSLLLACCHSGLVEEGLHLFYSMS-KFNITPQMQ 360

Query: 361 HYGCIVDLLGRSGHLREAYKLILEMPMEPDGVLWRSLLSSCMLHGDVEMGERVGKLLVER 420
           HYGCIVDLL R+G L+EAY+ I+ MP+EPD +LWRSLLS+  +HGDV MGE+VGKLL++ 
Sbjct: 361 HYGCIVDLLSRTGLLKEAYEFIMAMPIEPDTILWRSLLSASRIHGDVSMGEKVGKLLLQI 420

Query: 421 QGGESFDDEWCVGSEDFVALSNVYASVERWDDVEALRDEMKIKGIENKAGCSSLQTT 477
              +S  D   V SED+VALSN+YAS  +W DVE LR++MK+KGIENKA CSS+QTT
Sbjct: 421 HQEQSSLD---VTSEDYVALSNIYASAGKWADVEMLREDMKVKGIENKAACSSIQTT 465

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011659935.13.6e-283100.00PREDICTED: pentatricopeptide repeat-containing protein At3g18970 [Cucumis sativu... [more]
XP_008450741.13.5e-27095.06PREDICTED: pentatricopeptide repeat-containing protein At3g18970 [Cucumis melo][more]
XP_023515866.12.5e-22379.92pentatricopeptide repeat-containing protein At3g18970 [Cucurbita pepo subsp. pep... [more]
XP_022159202.19.0e-21878.01pentatricopeptide repeat-containing protein At3g18970 [Momordica charantia][more]
XP_022988094.12.3e-21377.41pentatricopeptide repeat-containing protein At3g18970 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT3G18970.19.3e-12452.63mitochondrial editing factor 20[more]
AT1G59720.11.1e-7136.03Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G56310.14.1e-7134.22Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G47530.17.0e-7135.29Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G08070.11.3e-6936.12Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9LJ69|PP243_ARATH1.7e-12252.63Pentatricopeptide repeat-containing protein At3g18970 OS=Arabidopsis thaliana OX... [more]
sp|Q0WQW5|PPR85_ARATH1.9e-7036.03Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondri... [more]
sp|Q9FMA1|PP433_ARATH7.4e-7034.22Pentatricopeptide repeat-containing protein At5g56310 OS=Arabidopsis thaliana OX... [more]
sp|Q9SN85|PP267_ARATH1.3e-6935.29Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana OX... [more]
sp|Q9LN01|PPR21_ARATH2.4e-6836.12Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LZ63|A0A0A0LZ63_CUCSA2.4e-283100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G573660 PE=4 SV=1[more]
tr|A0A1S3BPA5|A0A1S3BPA5_CUCME2.3e-27095.06pentatricopeptide repeat-containing protein At3g18970 OS=Cucumis melo OX=3656 GN... [more]
tr|A0A2P5B3D3|A0A2P5B3D3_PARAD6.3e-16764.99Pentatricopeptide repeat OS=Parasponia andersonii OX=3476 GN=PanWU01x14_275160 P... [more]
tr|M5WFE9|M5WFE9_PRUPE4.1e-16662.29Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_6G343200 PE=4 SV=1[more]
tr|A0A2P5F8P6|A0A2P5F8P6_9ROSA4.5e-16564.36Pentatricopeptide repeat OS=Trema orientalis OX=63057 GN=TorRG33x02_101730 PE=4 ... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0080156 mitochondrial mRNA modification
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
cellular_component GO:0005739 mitochondrion
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy1G026460.1CsGy1G026460.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 175..188
e-value: 0.034
score: 14.3
coord: 147..171
e-value: 0.025
score: 14.7
coord: 361..385
e-value: 0.079
score: 13.1
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 285..334
e-value: 2.8E-11
score: 43.3
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 323..356
e-value: 2.2E-6
score: 25.5
coord: 147..171
e-value: 0.0033
score: 15.5
coord: 289..322
e-value: 1.6E-4
score: 19.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 432..466
score: 7.048
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 357..387
score: 6.697
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 142..172
score: 6.588
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 173..210
score: 7.596
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 321..351
score: 10.249
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 389..419
score: 6.182
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 286..320
score: 10.972
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 255..285
score: 7.947
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 81..254
e-value: 1.6E-22
score: 82.3
coord: 255..479
e-value: 2.0E-40
score: 141.0
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 18..476
NoneNo IPR availablePANTHERPTHR24015:SF873SUBFAMILY NOT NAMEDcoord: 18..476