CsGy1G012070 (gene) Cucumber (Gy14) v2

NameCsGy1G012070
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
Descriptionpentatricopeptide repeat-containing protein At2g20710, mitochondrial-like
LocationChr1 : 7589196 .. 7592028 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTAATTTCGACCTAAAGTTTTTCTTTACATCAGCTGAAGAGCTCACGTACTTGGGCAGCGCCTAGTTTCTCCATTTTTTCTCCTCTCTTTTGCCTTTCCAAATGAAGCTCTTACAATCTCTAAAACCAATTAATTTGATTGCATTCCGGAGTGAATTCGTGAATTTCTACTCTACAGTTGTGAAGGATAACCTTTACAGAAGGATTTCTCCGGTGGGTGACCCTAATATCTCTGTAACTCCACTTCTTGATCAGTGGGTGTTAGAAGGCAGGCTTGTTCAGCAAGACGAACTTCGGCATATCATCAAGGAGCTTAGGGTTTACAAGCGGTTCAAACATGCTCTTGAGGTTTGATTTCTTATTCATATCTTCGTCGAAGTATGATGGTTTTCATTCGCTCTTATGCGAAATGCTTTTGTTGATTGATGCAAATGGGTCCCAACGATTTGGCATTGTTGATTTTTTAAGATGAATAATGTGTTCATCTATGTTCCAGCATGGTTTCTTCTCCTATGAAATTGCTAAGCCGCCAGGATTTTTTATTTGCTTTGTTGAAATAGACAGTATGTGATAATTGGCGTGGAGTTGCATTGTTTCATGGATAAATGTCTTTTCGTAATTAACCAACTTCGAGAAGGGATATGGCTATTCAAATGGTGATGAATAGTCTACACATATGTCTTTCTTCCCAAATTTTGTTCGCCGGTAAAACATTCTGCCTTTTGTAAATTTTGTTGTTTGTTTATTTTCACAAAGTATCATCCATTGAATCTTGTGTACAATGTGCCGCCTCCAAAAAAAAGTAATTAATTTTTTCTTGAACACAATTACTCATGAAACTTCGACATTAGAGTGAGTAACATTGTTTCTAATGTGAACCAAGTTTTTTATTTCCTGTTTTTTTGGTAATATTATCCATATAACAGCTCTAACCAATGAATTAATGAAATTAGTTCGTCAGGAAACAACTCTAGACTCTTAAGGGAGATGATTTTATTTCTATTTCCTCGTAGATTACATGTCATTCTTGATTCGATCTCAAATTCATGTCTGATAACTATTTTTTGTCATTGATAGTTTGTCTTGCTGTTCTGCCAATGCTTTTTTTTTTCTACTGTAGAGTTCACTTTCATGGAACTAGCTTGCTTTTCCTCTCAATTCATAACTGGCAAGAATAACGAATACATTGGAGAGCATTTGCTTTCCATGTTGTTCTTTCTGACATATCAAAGAAGCCACATTGTGTAGAGTTCTTGGCTTATTGATTTCATATTAAATTGAACTGTTCATTGTGTTTTACCCAGTATTATTGTTTTGACCCATCTCTTCAAACCCCCTTTCATCTTCAGATATCAAAGTGGATGAGTGATAAAAGATACTTTCCTTTATCGACTGCTGATATCGCAATACGGATGAATTTGATCTTAAGAGTTCATGGGTTGGAACAAGTGGAAGATTATTTCGATAACATGCCTAGTCAGTTGAAAAGGTACCAAGTTCATATAGCTCTTCTTAACTGCTATGCGCATGAAAAGTGCGTGGATAAAGCCAATGCCTTCATGCAGAAAATTAAGGAAATGGGTTTTGCTAATTCTCCTCTTCCATACAATATCATGATGAATCTTTATCACCAAATTGGAGAATTTGAGAGATTAGATTCTCTGTTGAAAGAAATGAAAGAAAGGGGTGTTTATTATGATCGATTCACATACAGCATCCGAATAAGTGCATATGCTGCTGCATCTGATTTTAGGGGAATCGAAAAGATCATGGAACAAATGGAATCAAATCCGAGTATTGTTCTAGATTGGAACTGTTATGTCATTGCTGCAAATGCTTACAATAAGGTTGGCTTAATAGACAAATCCATTTCCATGCTGAAGAAATCAGAAGGTCTCCTAGCAAATGTCAAAAAGAAAGGTTTTGCATTTAATGTCTACCTCAAACTATATGCCAGAAATGGAAAGAAAGACGAGATACACCGCATTTGGAATCTCTACAAGAAAGAAAAAATCTTCAACAAAGGTTTCATCAGCATGATAACATCACTTTTGATATTAGACGATATCAAAGGTGCAGAGCGTATTTACAAGGAATGGGAGACCAGGAAACTGTCATACGACTTGCGGATTCCAAACTTGTTGGTTGATGCGTATTGTAGAGCTGGTCTAATGGAGAAAGCTGAAGTGCTTCTAAATGAGATGGTGATTGTAAGACGCAAGTTTTCGGTCGAGTCGTGGTGCTATTTAGCGAGTGGATATCTTCAGAAAGATCAACTACCTCAGGCAGTTGAGACACTGAAGTTAGCAGCCAGTGTGTGTCCATCACGACTGAACTACGTCAAGGAAATTTTGGCAGCATTTTTGGATGGGAAGCAAGATGTGGAAGAAACTGAGAAAGTGGTTAATTTGTTGAGGGAAAAAGATGACTCTCATCCTGCTCGTGCTCATGATTACATTGTTGGAGCGATTATGACCGAATCCGCCTAATTTAACTTTTATTCTAAAAGAACCCTTAAGTACTTAAGGGATATTTAGTAAGCAATGGTGATTATCAGTCGGTTAGGTTTTTTTATATAAAAAATTGATCTACCATAGTTGGTTTAGTGAAGTCTCAAAACGTCCTTAACATCGATGAGTAAGGGTCGGTCAGGTTGGTTTAACACTTAAAATATTTTTTGGAAATTCTCGATTTGAAACTTTTCAAAATCGACCCCTCTCATTTTTCGAACGATCGACTTAGGTTTGGTCAATTAGTTCTGTTTTTTTTTATATCATATTCACTCAATTATCGTGTTTCTTAGGGAAAAAAATGAAAATGATAAAAGAATGAAA

mRNA sequence

CTAATTTCGACCTAAAGTTTTTCTTTACATCAGCTGAAGAGCTCACGTACTTGGGCAGCGCCTAGTTTCTCCATTTTTTCTCCTCTCTTTTGCCTTTCCAAATGAAGCTCTTACAATCTCTAAAACCAATTAATTTGATTGCATTCCGGAGTGAATTCGTGAATTTCTACTCTACAGTTGTGAAGGATAACCTTTACAGAAGGATTTCTCCGGTGGGTGACCCTAATATCTCTGTAACTCCACTTCTTGATCAGTGGGTGTTAGAAGGCAGGCTTGTTCAGCAAGACGAACTTCGGCATATCATCAAGGAGCTTAGGGTTTACAAGCGGTTCAAACATGCTCTTGAGATATCAAAGTGGATGAGTGATAAAAGATACTTTCCTTTATCGACTGCTGATATCGCAATACGGATGAATTTGATCTTAAGAGTTCATGGGTTGGAACAAGTGGAAGATTATTTCGATAACATGCCTAGTCAGTTGAAAAGGTACCAAGTTCATATAGCTCTTCTTAACTGCTATGCGCATGAAAAGTGCGTGGATAAAGCCAATGCCTTCATGCAGAAAATTAAGGAAATGGGTTTTGCTAATTCTCCTCTTCCATACAATATCATGATGAATCTTTATCACCAAATTGGAGAATTTGAGAGATTAGATTCTCTGTTGAAAGAAATGAAAGAAAGGGGTGTTTATTATGATCGATTCACATACAGCATCCGAATAAGTGCATATGCTGCTGCATCTGATTTTAGGGGAATCGAAAAGATCATGGAACAAATGGAATCAAATCCGAGTATTGTTCTAGATTGGAACTGTTATGTCATTGCTGCAAATGCTTACAATAAGGTTGGCTTAATAGACAAATCCATTTCCATGCTGAAGAAATCAGAAGGTCTCCTAGCAAATGTCAAAAAGAAAGGTTTTGCATTTAATGTCTACCTCAAACTATATGCCAGAAATGGAAAGAAAGACGAGATACACCGCATTTGGAATCTCTACAAGAAAGAAAAAATCTTCAACAAAGGTTTCATCAGCATGATAACATCACTTTTGATATTAGACGATATCAAAGGTGCAGAGCGTATTTACAAGGAATGGGAGACCAGGAAACTGTCATACGACTTGCGGATTCCAAACTTGTTGGTTGATGCGTATTGTAGAGCTGGTCTAATGGAGAAAGCTGAAGTGCTTCTAAATGAGATGGTGATTGTAAGACGCAAGTTTTCGGTCGAGTCGTGGTGCTATTTAGCGAGTGGATATCTTCAGAAAGATCAACTACCTCAGGCAGTTGAGACACTGAAGTTAGCAGCCAGTGTGTGTCCATCACGACTGAACTACGTCAAGGAAATTTTGGCAGCATTTTTGGATGGGAAGCAAGATGTGGAAGAAACTGAGAAAGTGGTTAATTTGTTGAGGGAAAAAGATGACTCTCATCCTGCTCGTGCTCATGATTACATTGTTGGAGCGATTATGACCGAATCCGCCTAATTTAACTTTTATTCTAAAAGAACCCTTAAGTACTTAAGGGATATTTAGTAAGCAATGGTGATTATCAGTCGGTTAGGTTTTTTTATATAAAAAATTGATCTACCATAGTTGGTTTAGTGAAGTCTCAAAACGTCCTTAACATCGATGAGTAAGGGTCGGTCAGGTTGGTTTAACACTTAAAATATTTTTTGGAAATTCTCGATTTGAAACTTTTCAAAATCGACCCCTCTCATTTTTCGAACGATCGACTTAGGTTTGGTCAATTAGTTCTGTTTTTTTTTATATCATATTCACTCAATTATCGTGTTTCTTAGGGAAAAAAATGAAAATGATAAAAGAATGAAA

Coding sequence (CDS)

ATGAAGCTCTTACAATCTCTAAAACCAATTAATTTGATTGCATTCCGGAGTGAATTCGTGAATTTCTACTCTACAGTTGTGAAGGATAACCTTTACAGAAGGATTTCTCCGGTGGGTGACCCTAATATCTCTGTAACTCCACTTCTTGATCAGTGGGTGTTAGAAGGCAGGCTTGTTCAGCAAGACGAACTTCGGCATATCATCAAGGAGCTTAGGGTTTACAAGCGGTTCAAACATGCTCTTGAGATATCAAAGTGGATGAGTGATAAAAGATACTTTCCTTTATCGACTGCTGATATCGCAATACGGATGAATTTGATCTTAAGAGTTCATGGGTTGGAACAAGTGGAAGATTATTTCGATAACATGCCTAGTCAGTTGAAAAGGTACCAAGTTCATATAGCTCTTCTTAACTGCTATGCGCATGAAAAGTGCGTGGATAAAGCCAATGCCTTCATGCAGAAAATTAAGGAAATGGGTTTTGCTAATTCTCCTCTTCCATACAATATCATGATGAATCTTTATCACCAAATTGGAGAATTTGAGAGATTAGATTCTCTGTTGAAAGAAATGAAAGAAAGGGGTGTTTATTATGATCGATTCACATACAGCATCCGAATAAGTGCATATGCTGCTGCATCTGATTTTAGGGGAATCGAAAAGATCATGGAACAAATGGAATCAAATCCGAGTATTGTTCTAGATTGGAACTGTTATGTCATTGCTGCAAATGCTTACAATAAGGTTGGCTTAATAGACAAATCCATTTCCATGCTGAAGAAATCAGAAGGTCTCCTAGCAAATGTCAAAAAGAAAGGTTTTGCATTTAATGTCTACCTCAAACTATATGCCAGAAATGGAAAGAAAGACGAGATACACCGCATTTGGAATCTCTACAAGAAAGAAAAAATCTTCAACAAAGGTTTCATCAGCATGATAACATCACTTTTGATATTAGACGATATCAAAGGTGCAGAGCGTATTTACAAGGAATGGGAGACCAGGAAACTGTCATACGACTTGCGGATTCCAAACTTGTTGGTTGATGCGTATTGTAGAGCTGGTCTAATGGAGAAAGCTGAAGTGCTTCTAAATGAGATGGTGATTGTAAGACGCAAGTTTTCGGTCGAGTCGTGGTGCTATTTAGCGAGTGGATATCTTCAGAAAGATCAACTACCTCAGGCAGTTGAGACACTGAAGTTAGCAGCCAGTGTGTGTCCATCACGACTGAACTACGTCAAGGAAATTTTGGCAGCATTTTTGGATGGGAAGCAAGATGTGGAAGAAACTGAGAAAGTGGTTAATTTGTTGAGGGAAAAAGATGACTCTCATCCTGCTCGTGCTCATGATTACATTGTTGGAGCGATTATGACCGAATCCGCCTAA

Protein sequence

MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMNLYHQIGEFERLDSLLKEMKERGVYYDRFTYSIRISAYAAASDFRGIEKIMEQMESNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYKKEKIFNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAFLDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA
BLAST of CsGy1G012070 vs. NCBI nr
Match: XP_011653157.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like [Cucumis sativus] >KGN64672.1 hypothetical protein Csa_1G073790 [Cucumis sativus])

HSP 1 Score: 785.4 bits (2027), Expect = 1.0e-223
Identity = 459/461 (99.57%), Postives = 460/461 (99.78%), Query Frame = 0

Query: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ
Sbjct: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
           QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF
Sbjct: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX 180
           DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX
Sbjct: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300
           IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK
Sbjct: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300

Query: 301 KEKIFNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA 360
           KEKIFNKGFISMITSL +LDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA
Sbjct: 301 KEKIFNKGFISMITSLFVLDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA 360

Query: 361 EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF 420
           EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF
Sbjct: 361 EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF 420

Query: 421 LDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA 462
           LDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA
Sbjct: 421 LDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA 461

BLAST of CsGy1G012070 vs. NCBI nr
Match: XP_008442434.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like [Cucumis melo])

HSP 1 Score: 672.2 bits (1733), Expect = 1.3e-189
Identity = 398/453 (87.86%), Postives = 420/453 (92.72%), Query Frame = 0

Query: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           MKLLQS+KP NLIA R   VNFYST VKDNLYRRISPVGDPNISV P+LDQWVLEGR+VQ
Sbjct: 1   MKLLQSVKPTNLIALRRGLVNFYSTFVKDNLYRRISPVGDPNISVIPVLDQWVLEGRVVQ 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
           ++EL+ IIKELRVYKRFKHALEISKWMSDKRY PLST D+A RMNLILRVHGLEQVEDYF
Sbjct: 61  KEELQKIIKELRVYKRFKHALEISKWMSDKRYLPLSTDDVATRMNLILRVHGLEQVEDYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX 180
           +NMPSQLKRY VHIALLNCYAHEKCVDKANAF+QKIKEMG+A S LPYNIMM  XXXXXX
Sbjct: 121 NNMPSQLKRYHVHIALLNCYAHEKCVDKANAFLQKIKEMGYAKSTLPYNIMMNLXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX N SIVLDWNCYV
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSNTSIVLDWNCYV 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300
           IAANAYNKVGLIDKS+SMLKKSEG LA  KKKG AFNVYLKLYARNGKKDE+HRIWNLYK
Sbjct: 241 IAANAYNKVGLIDKSVSMLKKSEGRLATDKKKGHAFNVYLKLYARNGKKDEVHRIWNLYK 300

Query: 301 KEKIFNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA 360
           KEKIFNKGFISMI SLLILDDI+GAE IYKEWET+KLSYD+RIPNLLVDAYCRAGL+EKA
Sbjct: 301 KEKIFNKGFISMIRSLLILDDIRGAEDIYKEWETQKLSYDVRIPNLLVDAYCRAGLIEKA 360

Query: 361 EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF 420
           E L+NE+V VR KFSVESWCYLASGYLQKDQLPQAVETLK AAS+CPS LNYVKEILAAF
Sbjct: 361 EELVNEIVNVRGKFSVESWCYLASGYLQKDQLPQAVETLKKAASLCPSELNYVKEILAAF 420

Query: 421 LDGKQDVEETEKVVNLLREKDDSHPARAHDYIV 454
            DGKQDVEE EKVVNLLREKD+ +PARAHD +V
Sbjct: 421 SDGKQDVEEAEKVVNLLREKDNLNPARAHDILV 453

BLAST of CsGy1G012070 vs. NCBI nr
Match: XP_008442448.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like [Cucumis melo])

HSP 1 Score: 639.8 bits (1649), Expect = 7.1e-180
Identity = 371/467 (79.44%), Postives = 403/467 (86.30%), Query Frame = 0

Query: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           MKLLQS+K INLIAFR E VNFYST V D+LYRR+SPVGDPNIS+ P+LDQWV EGR VQ
Sbjct: 1   MKLLQSVKSINLIAFRRELVNFYSTFVNDDLYRRLSPVGDPNISIVPILDQWVSEGRPVQ 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
             ELR IIKELRVYKR+KHALE+SKWMSDK   PLSTADIA RMNLILRVHGLEQVEDYF
Sbjct: 61  IVELRLIIKELRVYKRYKHALEMSKWMSDKVCLPLSTADIATRMNLILRVHGLEQVEDYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX 180
           +NMPS+LKRYQVHIALLNCYAHEKCVDKANA +QKIKE+GFA +P PYNIMM        
Sbjct: 121 NNMPSKLKRYQVHIALLNCYAHEKCVDKANALLQKIKELGFATTPHPYNIMMNLYHQIGE 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV 240
            XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX            N SIVLDWNCYV
Sbjct: 181 FXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAGIEKITEQMESNTSIVLDWNCYV 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLA-NVKKKGFAFNVYLKLYARNGKKDEIHRIWNLY 300
           +AA+AY KVGLIDKSISMLKKSE LLA   + K  AFN+YL LYA+NGKKDE +RIWNLY
Sbjct: 241 VAADAYYKVGLIDKSISMLKKSEELLAKTAENKCHAFNIYLTLYAKNGKKDETYRIWNLY 300

Query: 301 KKEKIFNKGFISMITSLLILDDIKGAERIYKE-----WETRKLSYDLRIPNLLVDAYCRA 360
           KKEK+FNKGFISMITSLLILDDIKGA RI +E     WET+KLSYDLRIPNLLVDAYCRA
Sbjct: 301 KKEKVFNKGFISMITSLLILDDIKGARRICEEWETQVWETQKLSYDLRIPNLLVDAYCRA 360

Query: 361 GLMEKAEVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVK 420
           GLME+AEVL+ EM+ VRRKFSV+SWCY+ASGYLQKDQLP+AVETLK+AAS+CPS+L+YVK
Sbjct: 361 GLMEEAEVLVYEMMTVRRKFSVKSWCYIASGYLQKDQLPEAVETLKIAASLCPSKLDYVK 420

Query: 421 EILAAFLDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA 462
           EILAAFLDGKQDVEE EKVVNLLREKD+SHPAR HDY   AIMTESA
Sbjct: 421 EILAAFLDGKQDVEEVEKVVNLLREKDNSHPARGHDY---AIMTESA 464

BLAST of CsGy1G012070 vs. NCBI nr
Match: XP_022140106.1 (pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like [Momordica charantia])

HSP 1 Score: 544.7 bits (1402), Expect = 3.1e-151
Identity = 322/453 (71.08%), Postives = 380/453 (83.89%), Query Frame = 0

Query: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           MKLL   K ++ +AFR    +FYS + +D+LYRRISPVGDPN+SV P+LDQWV EGR VQ
Sbjct: 1   MKLLVPGKQVHPVAFRRILGHFYSMIARDSLYRRISPVGDPNVSVIPVLDQWVREGRPVQ 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
           ++EL+ IIKELR+YKR KHALEISKWM DKRY PLS+ DIA RMNLILRVHGLEQVEDYF
Sbjct: 61  REELQKIIKELRIYKRSKHALEISKWMGDKRYLPLSSVDIATRMNLILRVHGLEQVEDYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX 180
           +++PSQLK++QVH+ALLNCYAHEKCVDKANA +QKIKEMGF  +PLPYNIMM       X
Sbjct: 121 NSIPSQLKKFQVHVALLNCYAHEKCVDKANAILQKIKEMGFDGAPLPYNIMMNLYYQIGX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX +P IV DW+CYV
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLDPRIVPDWSCYV 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300
           IAAN Y KVGL+DKS+SML+KSE LLA  K++G AF++ LKLYA +GKKDE+HRIW LYK
Sbjct: 241 IAANGYLKVGLVDKSLSMLRKSEALLATAKRRGSAFDILLKLYAESGKKDELHRIWKLYK 300

Query: 301 KEKIFNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA 360
           KEK++NKG++SM+ SLLILDDI+ AE I+K+WET KLS DLRIPN+L++AYCR GLMEKA
Sbjct: 301 KEKVYNKGYMSMMRSLLILDDIEAAEHIFKDWETWKLSNDLRIPNMLIEAYCRRGLMEKA 360

Query: 361 EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF 420
           E L+N+ V  + KFSV SWCYLA+ Y+QKDQL   VE LK AA++CP  LN+ KEILA F
Sbjct: 361 EALINKAVTGKSKFSVHSWCYLANAYIQKDQLQHTVEALKKAATLCPPELNHFKEILATF 420

Query: 421 LDGKQDVEETEKVVNLLREKDDSHPARAHDYIV 454
           L+GKQDV+E EKVV LLR + +S    AHD ++
Sbjct: 421 LEGKQDVKEAEKVVGLLRAEANS--LFAHDVLI 451

BLAST of CsGy1G012070 vs. NCBI nr
Match: XP_011653151.1 (PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like [Cucumis sativus])

HSP 1 Score: 540.0 bits (1390), Expect = 7.7e-150
Identity = 283/358 (79.05%), Postives = 289/358 (80.73%), Query Frame = 0

Query: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           MKLLQSLKPINLIAFR EFVNFYSTVVKD+LYRRISPVGDPNISVTPLLDQWVLE  LVQ
Sbjct: 1   MKLLQSLKPINLIAFRREFVNFYSTVVKDSLYRRISPVGDPNISVTPLLDQWVLESGLVQ 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
           QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIA RMNLILRVHGLEQVEDYF
Sbjct: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIATRMNLILRVHGLEQVEDYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX 180
           +NMPSQLKR QVHIALLNCYAHEK  DKANA +QKIKEMGFA + LPYNI M        
Sbjct: 121 NNMPSQLKRCQVHIALLNCYAHEKYADKANAVLQKIKEMGFAKTSLPYNITM---NLYHQ 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV 240
                                                          XN SIVLDWNCYV
Sbjct: 181 IGEFERLDSPLKETDVDHDQFTYTTRLSAYATAFDFTGIEKIMEQMEXNTSIVLDWNCYV 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300
           IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIH IWNLYK
Sbjct: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHLIWNLYK 300

Query: 301 KEKIFNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLME 359
           KEKIFNKGFISMITSL +LDDIKGAERIYKEWET+KLSYDLRIPNLLVDAYCRAGLME
Sbjct: 301 KEKIFNKGFISMITSLFVLDDIKGAERIYKEWETQKLSYDLRIPNLLVDAYCRAGLME 355

BLAST of CsGy1G012070 vs. TAIR10
Match: AT2G20710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 271.6 bits (693), Expect = 9.2e-73
Identity = 148/416 (35.58%), Postives = 231/416 (55.53%), Query Frame = 0

Query: 29  DNLYRRISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEISKWMS 88
           D L RR++  GDP+ S+  +LD W+ +G LV+  EL  IIK LR + RF HAL+IS WMS
Sbjct: 38  DTLQRRVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMS 97

Query: 89  DKRYFPLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAHEKCVDK 148
           + R   +S  D+AIR++LI +V GL + E +F+ +P + + Y ++ ALLNCYA +K + K
Sbjct: 98  EHRVHEISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHK 157

Query: 149 ANAFMQKIKEMGFANSPLPYNIMMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 208
           A    Q++KE+GF    LPYN+M+                                    
Sbjct: 158 AEQVFQEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLH 217

Query: 209 XXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLLAN 268
                               +  + LDW  Y   AN Y K GL +K++ ML+KSE ++ N
Sbjct: 218 AYSVVSDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMV-N 277

Query: 269 VKKKGFAFNVYLKLYARNGKKDEIHRIWNLYKK-EKIFNKGFISMITSLLILDDIKGAER 328
            +K+  A+ V +  Y   GKK+E++R+W+LYK+ +  +N G+IS+I++LL +DDI+  E+
Sbjct: 278 AQKRKHAYEVLMSFYGAAGKKEEVYRLWSLYKELDGFYNTGYISVISALLKMDDIEEVEK 337

Query: 329 IYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVESWCYLASGYL 388
           I +EWE     +D+RIP+LL+  YC+ G+MEKAE ++N +V   R     +W  LA GY 
Sbjct: 338 IMEEWEAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYK 397

Query: 389 QKDQLPQAVETLKLAASVCPSRLNYVKEILAA---FLDGKQDVEETEKVVNLLREK 441
              ++ +AVE  K A  V        + +L +   +L+G++D+E   K++ LL E+
Sbjct: 398 MAGKMEKAVEKWKRAIEVSKPGWRPHQVVLMSCVDYLEGQRDMEGLRKILRLLSER 452

BLAST of CsGy1G012070 vs. TAIR10
Match: AT4G21705.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 221.1 bits (562), Expect = 1.4e-57
Identity = 136/407 (33.42%), Postives = 204/407 (50.12%), Query Frame = 0

Query: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           M +L+ + P NLIA R  + N    V K  LY +ISP+GDP  SV P L  WV  G+ V 
Sbjct: 1   MNILRRI-PANLIASRYYYTN---RVKKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVS 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
             EL  I+ +LR  KRF HALE+SKWM++      S  + A+ ++LI RV+G    E+YF
Sbjct: 61  VAELIRIVHDLRRRKRFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX 180
           +N+  Q K  + + ALLNCY  ++ V+K+    +K+KEMGF  S L YN +M        
Sbjct: 121 ENLKEQYKNDKTYGALLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQ 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV 240
                                                              I +DWN Y 
Sbjct: 181 HEKVPKVLEEMKEENVAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYA 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300
           +AA  Y   G  D+++ +LK SE  L   KK G  +N  + LYAR GKK E+ R+W+L K
Sbjct: 241 VAAKFYIDGGDCDRAVELLKMSENRLE--KKDGEGYNHLITLYARLGKKIEVLRLWDLEK 300

Query: 301 K--EKIFNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLME 360
              ++  N+ +++++ SL+ +D +  AE +  EW++    YD R+PN ++  Y    + E
Sbjct: 301 DVCKRRINQDYLTVLQSLVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEE 360

Query: 361 KAEVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASV 406
           KAE +L ++    +  + ESW  +A+ Y +K  L  A + +K A  V
Sbjct: 361 KAEAMLEDLARRGKATTPESWELVATAYAEKGTLENAFKCMKTALGV 401

BLAST of CsGy1G012070 vs. TAIR10
Match: AT1G02150.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 193.7 bits (491), Expect = 2.4e-49
Identity = 181/420 (43.10%), Postives = 263/420 (62.62%), Query Frame = 0

Query: 31  LYRRISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEISKWMSDK 90
           +Y++IS +  P +    +L+QW   GR + + EL  ++KELR YKR   ALE+  WM+++
Sbjct: 69  IYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNR 128

Query: 91  -RYFPLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAHEKCVDKA 150
              F LS +D AI+++LI +V G+   E++F  +P   K  +V+ +LLN Y   K  +KA
Sbjct: 129 GERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKA 188

Query: 151 NAFMQKIKEMGFANSPLPYNIMMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 210
            A +  +++ G+A  PLP+N+  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 189 EALLNTMRDKGYALHPLPFNVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 248

Query: 211 XXXXXXXXXXXXXXXXXXXNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLLANV 270
           XXXXXXXXXXXXXXXXXX + SI  +W  +   A  Y K+G  +K+   L+K E  +   
Sbjct: 249 XXXXXXXXXXXXXXXXXXSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARITG- 308

Query: 271 KKKGFAFNVYLKLYARNGKKDEIHRIWNLYKK--EKIFNKGFISMITSLLILDDIKGAER 330
            +    ++  L LY   G K E++R+W++YK     I N G+ ++++SL+ + DI+GAE+
Sbjct: 309 -RNRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEK 368

Query: 331 IYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVESWCYLASGYL 390
           +Y+EW   K SYD RIPNLL++AY +   +E AE L + MV +  K S  +W  LA G+ 
Sbjct: 369 VYEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHT 428

Query: 391 QKDQLPQAVETLKLAASVCPSRLNYVKEILA-----AFLDGKQDVEETEKVVNLLREKDD 443
           +K  + +A+  L+ A S   S  N+  ++L         + + DV   E V+ LLR+  D
Sbjct: 429 RKRCISEALTCLRNAFSAEGSS-NWRPKVLMLSGFFKLCEEESDVTSKEAVLELLRQSGD 485

BLAST of CsGy1G012070 vs. TAIR10
Match: AT1G28020.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 170.6 bits (431), Expect = 2.2e-42
Identity = 109/345 (31.59%), Postives = 168/345 (48.70%), Query Frame = 0

Query: 34  RISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEISKWMSDKRYF 93
           RI+     N  + P+L+QW  +G  V    +R IIK+LR   +   AL++S+WMS ++  
Sbjct: 41  RITDALHRNAQIIPVLEQWRQQGNQVNPSHVRVIIKKLRDSDQSLQALQVSEWMSKEKIC 100

Query: 94  PLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAH-EKCVDKANAF 153
            L   D A R++LI  V GLE+ E +F+++P   +   V+ +LLN YA  +K + KA A 
Sbjct: 101 NLIPEDFAARLHLIENVVGLEEAEKFFESIPKNARGDSVYTSLLNSYARSDKTLCKAEAT 160

Query: 154 MQKIKEMGFANSPLPYNIMMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 213
            QK++++G    P+PYN MM                                        
Sbjct: 161 FQKMRDLGLLLRPVPYNAMMSLYSALKNREKVEELLLEMKDNDVEADNVTVNNVLKLYSA 220

Query: 214 XXXXXXXXXXXXXXXXNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLLANVKKK 273
                              I L+W+  +  A AY +     K++ ML+ +E L+     K
Sbjct: 221 VCDVTEMEKFLNKWEGIHGIKLEWHTTLDMAKAYLRARSSGKAMKMLRLTEQLVDQKSLK 280

Query: 274 GFAFNVYLKLYARNGKKDEIHRIWNLYKKE--KIFNKGFISMITSLLILDDIKGAERIYK 333
             A++  +KLY   G ++E+ R+W LYK +  +  N G+ ++I SLL +DDI GAE IYK
Sbjct: 281 S-AYDHLMKLYGEAGNREEVLRVWKLYKSKIGERDNNGYRTVIRSLLKVDDIVGAEEIYK 340

Query: 334 EWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFS 376
            WE+  L +D RIP +L   Y   G+ EKAE L+N   I  R+ +
Sbjct: 341 VWESLPLEFDHRIPTMLASGYRDRGMTEKAEKLMNSKTIKDRRMN 384

BLAST of CsGy1G012070 vs. TAIR10
Match: AT5G27460.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 154.8 bits (390), Expect = 1.3e-37
Identity = 103/382 (26.96%), Postives = 179/382 (46.86%), Query Frame = 0

Query: 24  STVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEI 83
           S+V   N  + I     P  SVT LL + +  G  V   ELR I K L    R+  AL++
Sbjct: 33  SSVANRNSLKEILRKNGPRRSVTSLLQERIDSGHAVSLSELRLISKRLIRSNRYDLALQM 92

Query: 84  SKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYFDNM----PSQLKRYQVHIALLNC 143
            +WM +++    S  DIA+R++LI++ HGL+Q E+YF+ +     S       ++ LL  
Sbjct: 93  MEWMENQKDIEFSVYDIALRLDLIIKTHGLKQGEEYFEKLLHSSVSMRVAKSAYLPLLRA 152

Query: 144 YAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXXXXXXXXXXXXXXXXXXXXX 203
           Y   K V +A A M+K+  +GF  +P P+N MM                           
Sbjct: 153 YVKNKMVKEAEALMEKLNGLGFLVTPHPFNEMMKLYEASGQYEKVVMVVSMMKGNKIPRN 212

Query: 204 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYVIAANAYNKVGLIDKSISML 263
                                        + S+ + W+     AN Y K G  +K+  +L
Sbjct: 213 VLSYNLWMNACCEVSGVAAVETVYKEMVGDKSVEVGWSSLCTLANVYIKSGFDEKARLVL 272

Query: 264 KKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYKK--EKIFNKGFISMITSLL 323
           + +E +L    + G+ F   + LYA  G K+ + R+W + K    +I    +I +++SL+
Sbjct: 273 EDAEKMLNRSNRLGYFF--LITLYASLGNKEGVVRLWEVSKSVCGRISCVNYICVLSSLV 332

Query: 324 ILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVE 383
              D++ AER++ EWE +  +YD+R+ N+L+ AY R G + KAE L   ++      + +
Sbjct: 333 KTGDLEEAERVFSEWEAQCFNYDVRVSNVLLGAYVRNGEIRKAESLHGCVLERGGTPNYK 392

Query: 384 SWCYLASGYLQKDQLPQAVETL 400
           +W  L  G+++ + + +A++ +
Sbjct: 393 TWEILMEGWVKCENMEKAIDAM 412

BLAST of CsGy1G012070 vs. Swiss-Prot
Match: sp|Q9SKU6|PP166_ARATH (Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At2g20710 PE=2 SV=1)

HSP 1 Score: 271.6 bits (693), Expect = 1.7e-71
Identity = 148/416 (35.58%), Postives = 231/416 (55.53%), Query Frame = 0

Query: 29  DNLYRRISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEISKWMS 88
           D L RR++  GDP+ S+  +LD W+ +G LV+  EL  IIK LR + RF HAL+IS WMS
Sbjct: 38  DTLQRRVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMS 97

Query: 89  DKRYFPLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAHEKCVDK 148
           + R   +S  D+AIR++LI +V GL + E +F+ +P + + Y ++ ALLNCYA +K + K
Sbjct: 98  EHRVHEISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHK 157

Query: 149 ANAFMQKIKEMGFANSPLPYNIMMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 208
           A    Q++KE+GF    LPYN+M+                                    
Sbjct: 158 AEQVFQEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLH 217

Query: 209 XXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLLAN 268
                               +  + LDW  Y   AN Y K GL +K++ ML+KSE ++ N
Sbjct: 218 AYSVVSDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMV-N 277

Query: 269 VKKKGFAFNVYLKLYARNGKKDEIHRIWNLYKK-EKIFNKGFISMITSLLILDDIKGAER 328
            +K+  A+ V +  Y   GKK+E++R+W+LYK+ +  +N G+IS+I++LL +DDI+  E+
Sbjct: 278 AQKRKHAYEVLMSFYGAAGKKEEVYRLWSLYKELDGFYNTGYISVISALLKMDDIEEVEK 337

Query: 329 IYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVESWCYLASGYL 388
           I +EWE     +D+RIP+LL+  YC+ G+MEKAE ++N +V   R     +W  LA GY 
Sbjct: 338 IMEEWEAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYK 397

Query: 389 QKDQLPQAVETLKLAASVCPSRLNYVKEILAA---FLDGKQDVEETEKVVNLLREK 441
              ++ +AVE  K A  V        + +L +   +L+G++D+E   K++ LL E+
Sbjct: 398 MAGKMEKAVEKWKRAIEVSKPGWRPHQVVLMSCVDYLEGQRDMEGLRKILRLLSER 452

BLAST of CsGy1G012070 vs. Swiss-Prot
Match: sp|Q84JR3|PP334_ARATH (Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At4g21705 PE=2 SV=1)

HSP 1 Score: 221.1 bits (562), Expect = 2.6e-56
Identity = 136/407 (33.42%), Postives = 204/407 (50.12%), Query Frame = 0

Query: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           M +L+ + P NLIA R  + N    V K  LY +ISP+GDP  SV P L  WV  G+ V 
Sbjct: 1   MNILRRI-PANLIASRYYYTN---RVKKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVS 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
             EL  I+ +LR  KRF HALE+SKWM++      S  + A+ ++LI RV+G    E+YF
Sbjct: 61  VAELIRIVHDLRRRKRFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX 180
           +N+  Q K  + + ALLNCY  ++ V+K+    +K+KEMGF  S L YN +M        
Sbjct: 121 ENLKEQYKNDKTYGALLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQ 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV 240
                                                              I +DWN Y 
Sbjct: 181 HEKVPKVLEEMKEENVAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYA 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300
           +AA  Y   G  D+++ +LK SE  L   KK G  +N  + LYAR GKK E+ R+W+L K
Sbjct: 241 VAAKFYIDGGDCDRAVELLKMSENRLE--KKDGEGYNHLITLYARLGKKIEVLRLWDLEK 300

Query: 301 K--EKIFNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLME 360
              ++  N+ +++++ SL+ +D +  AE +  EW++    YD R+PN ++  Y    + E
Sbjct: 301 DVCKRRINQDYLTVLQSLVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEE 360

Query: 361 KAEVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASV 406
           KAE +L ++    +  + ESW  +A+ Y +K  L  A + +K A  V
Sbjct: 361 KAEAMLEDLARRGKATTPESWELVATAYAEKGTLENAFKCMKTALGV 401

BLAST of CsGy1G012070 vs. Swiss-Prot
Match: sp|Q8LPS6|PPR3_ARATH (Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana OX=3702 GN=At1g02150 PE=2 SV=2)

HSP 1 Score: 193.7 bits (491), Expect = 4.4e-48
Identity = 181/420 (43.10%), Postives = 263/420 (62.62%), Query Frame = 0

Query: 31  LYRRISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEISKWMSDK 90
           +Y++IS +  P +    +L+QW   GR + + EL  ++KELR YKR   ALE+  WM+++
Sbjct: 69  IYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNR 128

Query: 91  -RYFPLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAHEKCVDKA 150
              F LS +D AI+++LI +V G+   E++F  +P   K  +V+ +LLN Y   K  +KA
Sbjct: 129 GERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKA 188

Query: 151 NAFMQKIKEMGFANSPLPYNIMMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 210
            A +  +++ G+A  PLP+N+  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 189 EALLNTMRDKGYALHPLPFNVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 248

Query: 211 XXXXXXXXXXXXXXXXXXXNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLLANV 270
           XXXXXXXXXXXXXXXXXX + SI  +W  +   A  Y K+G  +K+   L+K E  +   
Sbjct: 249 XXXXXXXXXXXXXXXXXXSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARITG- 308

Query: 271 KKKGFAFNVYLKLYARNGKKDEIHRIWNLYKK--EKIFNKGFISMITSLLILDDIKGAER 330
            +    ++  L LY   G K E++R+W++YK     I N G+ ++++SL+ + DI+GAE+
Sbjct: 309 -RNRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEK 368

Query: 331 IYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVESWCYLASGYL 390
           +Y+EW   K SYD RIPNLL++AY +   +E AE L + MV +  K S  +W  LA G+ 
Sbjct: 369 VYEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHT 428

Query: 391 QKDQLPQAVETLKLAASVCPSRLNYVKEILA-----AFLDGKQDVEETEKVVNLLREKDD 443
           +K  + +A+  L+ A S   S  N+  ++L         + + DV   E V+ LLR+  D
Sbjct: 429 RKRCISEALTCLRNAFSAEGSS-NWRPKVLMLSGFFKLCEEESDVTSKEAVLELLRQSGD 485

BLAST of CsGy1G012070 vs. Swiss-Prot
Match: sp|Q9C7F1|PPR61_ARATH (Putative pentatricopeptide repeat-containing protein At1g28020 OS=Arabidopsis thaliana OX=3702 GN=At1g28020 PE=3 SV=2)

HSP 1 Score: 170.6 bits (431), Expect = 4.0e-41
Identity = 109/345 (31.59%), Postives = 168/345 (48.70%), Query Frame = 0

Query: 34  RISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEISKWMSDKRYF 93
           RI+     N  + P+L+QW  +G  V    +R IIK+LR   +   AL++S+WMS ++  
Sbjct: 41  RITDALHRNAQIIPVLEQWRQQGNQVNPSHVRVIIKKLRDSDQSLQALQVSEWMSKEKIC 100

Query: 94  PLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAH-EKCVDKANAF 153
            L   D A R++LI  V GLE+ E +F+++P   +   V+ +LLN YA  +K + KA A 
Sbjct: 101 NLIPEDFAARLHLIENVVGLEEAEKFFESIPKNARGDSVYTSLLNSYARSDKTLCKAEAT 160

Query: 154 MQKIKEMGFANSPLPYNIMMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 213
            QK++++G    P+PYN MM                                        
Sbjct: 161 FQKMRDLGLLLRPVPYNAMMSLYSALKNREKVEELLLEMKDNDVEADNVTVNNVLKLYSA 220

Query: 214 XXXXXXXXXXXXXXXXNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLLANVKKK 273
                              I L+W+  +  A AY +     K++ ML+ +E L+     K
Sbjct: 221 VCDVTEMEKFLNKWEGIHGIKLEWHTTLDMAKAYLRARSSGKAMKMLRLTEQLVDQKSLK 280

Query: 274 GFAFNVYLKLYARNGKKDEIHRIWNLYKKE--KIFNKGFISMITSLLILDDIKGAERIYK 333
             A++  +KLY   G ++E+ R+W LYK +  +  N G+ ++I SLL +DDI GAE IYK
Sbjct: 281 S-AYDHLMKLYGEAGNREEVLRVWKLYKSKIGERDNNGYRTVIRSLLKVDDIVGAEEIYK 340

Query: 334 EWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFS 376
            WE+  L +D RIP +L   Y   G+ EKAE L+N   I  R+ +
Sbjct: 341 VWESLPLEFDHRIPTMLASGYRDRGMTEKAEKLMNSKTIKDRRMN 384

BLAST of CsGy1G012070 vs. Swiss-Prot
Match: sp|Q3E911|PP400_ARATH (Pentatricopeptide repeat-containing protein At5g27460 OS=Arabidopsis thaliana OX=3702 GN=At5g27460 PE=2 SV=1)

HSP 1 Score: 154.8 bits (390), Expect = 2.3e-36
Identity = 103/382 (26.96%), Postives = 179/382 (46.86%), Query Frame = 0

Query: 24  STVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEI 83
           S+V   N  + I     P  SVT LL + +  G  V   ELR I K L    R+  AL++
Sbjct: 33  SSVANRNSLKEILRKNGPRRSVTSLLQERIDSGHAVSLSELRLISKRLIRSNRYDLALQM 92

Query: 84  SKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYFDNM----PSQLKRYQVHIALLNC 143
            +WM +++    S  DIA+R++LI++ HGL+Q E+YF+ +     S       ++ LL  
Sbjct: 93  MEWMENQKDIEFSVYDIALRLDLIIKTHGLKQGEEYFEKLLHSSVSMRVAKSAYLPLLRA 152

Query: 144 YAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXXXXXXXXXXXXXXXXXXXXX 203
           Y   K V +A A M+K+  +GF  +P P+N MM                           
Sbjct: 153 YVKNKMVKEAEALMEKLNGLGFLVTPHPFNEMMKLYEASGQYEKVVMVVSMMKGNKIPRN 212

Query: 204 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYVIAANAYNKVGLIDKSISML 263
                                        + S+ + W+     AN Y K G  +K+  +L
Sbjct: 213 VLSYNLWMNACCEVSGVAAVETVYKEMVGDKSVEVGWSSLCTLANVYIKSGFDEKARLVL 272

Query: 264 KKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYKK--EKIFNKGFISMITSLL 323
           + +E +L    + G+ F   + LYA  G K+ + R+W + K    +I    +I +++SL+
Sbjct: 273 EDAEKMLNRSNRLGYFF--LITLYASLGNKEGVVRLWEVSKSVCGRISCVNYICVLSSLV 332

Query: 324 ILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVE 383
              D++ AER++ EWE +  +YD+R+ N+L+ AY R G + KAE L   ++      + +
Sbjct: 333 KTGDLEEAERVFSEWEAQCFNYDVRVSNVLLGAYVRNGEIRKAESLHGCVLERGGTPNYK 392

Query: 384 SWCYLASGYLQKDQLPQAVETL 400
           +W  L  G+++ + + +A++ +
Sbjct: 393 TWEILMEGWVKCENMEKAIDAM 412

BLAST of CsGy1G012070 vs. TrEMBL
Match: tr|A0A0A0LV44|A0A0A0LV44_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G073790 PE=4 SV=1)

HSP 1 Score: 785.4 bits (2027), Expect = 6.9e-224
Identity = 459/461 (99.57%), Postives = 460/461 (99.78%), Query Frame = 0

Query: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ
Sbjct: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
           QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF
Sbjct: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX 180
           DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX
Sbjct: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300
           IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK
Sbjct: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300

Query: 301 KEKIFNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA 360
           KEKIFNKGFISMITSL +LDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA
Sbjct: 301 KEKIFNKGFISMITSLFVLDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA 360

Query: 361 EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF 420
           EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF
Sbjct: 361 EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF 420

Query: 421 LDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA 462
           LDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA
Sbjct: 421 LDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA 461

BLAST of CsGy1G012070 vs. TrEMBL
Match: tr|A0A1S3B572|A0A1S3B572_CUCME (pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103486303 PE=4 SV=1)

HSP 1 Score: 672.2 bits (1733), Expect = 8.5e-190
Identity = 398/453 (87.86%), Postives = 420/453 (92.72%), Query Frame = 0

Query: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           MKLLQS+KP NLIA R   VNFYST VKDNLYRRISPVGDPNISV P+LDQWVLEGR+VQ
Sbjct: 1   MKLLQSVKPTNLIALRRGLVNFYSTFVKDNLYRRISPVGDPNISVIPVLDQWVLEGRVVQ 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
           ++EL+ IIKELRVYKRFKHALEISKWMSDKRY PLST D+A RMNLILRVHGLEQVEDYF
Sbjct: 61  KEELQKIIKELRVYKRFKHALEISKWMSDKRYLPLSTDDVATRMNLILRVHGLEQVEDYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX 180
           +NMPSQLKRY VHIALLNCYAHEKCVDKANAF+QKIKEMG+A S LPYNIMM  XXXXXX
Sbjct: 121 NNMPSQLKRYHVHIALLNCYAHEKCVDKANAFLQKIKEMGYAKSTLPYNIMMNLXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX N SIVLDWNCYV
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSNTSIVLDWNCYV 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300
           IAANAYNKVGLIDKS+SMLKKSEG LA  KKKG AFNVYLKLYARNGKKDE+HRIWNLYK
Sbjct: 241 IAANAYNKVGLIDKSVSMLKKSEGRLATDKKKGHAFNVYLKLYARNGKKDEVHRIWNLYK 300

Query: 301 KEKIFNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA 360
           KEKIFNKGFISMI SLLILDDI+GAE IYKEWET+KLSYD+RIPNLLVDAYCRAGL+EKA
Sbjct: 301 KEKIFNKGFISMIRSLLILDDIRGAEDIYKEWETQKLSYDVRIPNLLVDAYCRAGLIEKA 360

Query: 361 EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF 420
           E L+NE+V VR KFSVESWCYLASGYLQKDQLPQAVETLK AAS+CPS LNYVKEILAAF
Sbjct: 361 EELVNEIVNVRGKFSVESWCYLASGYLQKDQLPQAVETLKKAASLCPSELNYVKEILAAF 420

Query: 421 LDGKQDVEETEKVVNLLREKDDSHPARAHDYIV 454
            DGKQDVEE EKVVNLLREKD+ +PARAHD +V
Sbjct: 421 SDGKQDVEEAEKVVNLLREKDNLNPARAHDILV 453

BLAST of CsGy1G012070 vs. TrEMBL
Match: tr|A0A1S3B6G4|A0A1S3B6G4_CUCME (pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103486307 PE=4 SV=1)

HSP 1 Score: 639.8 bits (1649), Expect = 4.7e-180
Identity = 371/467 (79.44%), Postives = 403/467 (86.30%), Query Frame = 0

Query: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           MKLLQS+K INLIAFR E VNFYST V D+LYRR+SPVGDPNIS+ P+LDQWV EGR VQ
Sbjct: 1   MKLLQSVKSINLIAFRRELVNFYSTFVNDDLYRRLSPVGDPNISIVPILDQWVSEGRPVQ 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
             ELR IIKELRVYKR+KHALE+SKWMSDK   PLSTADIA RMNLILRVHGLEQVEDYF
Sbjct: 61  IVELRLIIKELRVYKRYKHALEMSKWMSDKVCLPLSTADIATRMNLILRVHGLEQVEDYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX 180
           +NMPS+LKRYQVHIALLNCYAHEKCVDKANA +QKIKE+GFA +P PYNIMM        
Sbjct: 121 NNMPSKLKRYQVHIALLNCYAHEKCVDKANALLQKIKELGFATTPHPYNIMMNLYHQIGE 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV 240
            XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX            N SIVLDWNCYV
Sbjct: 181 FXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAGIEKITEQMESNTSIVLDWNCYV 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLA-NVKKKGFAFNVYLKLYARNGKKDEIHRIWNLY 300
           +AA+AY KVGLIDKSISMLKKSE LLA   + K  AFN+YL LYA+NGKKDE +RIWNLY
Sbjct: 241 VAADAYYKVGLIDKSISMLKKSEELLAKTAENKCHAFNIYLTLYAKNGKKDETYRIWNLY 300

Query: 301 KKEKIFNKGFISMITSLLILDDIKGAERIYKE-----WETRKLSYDLRIPNLLVDAYCRA 360
           KKEK+FNKGFISMITSLLILDDIKGA RI +E     WET+KLSYDLRIPNLLVDAYCRA
Sbjct: 301 KKEKVFNKGFISMITSLLILDDIKGARRICEEWETQVWETQKLSYDLRIPNLLVDAYCRA 360

Query: 361 GLMEKAEVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVK 420
           GLME+AEVL+ EM+ VRRKFSV+SWCY+ASGYLQKDQLP+AVETLK+AAS+CPS+L+YVK
Sbjct: 361 GLMEEAEVLVYEMMTVRRKFSVKSWCYIASGYLQKDQLPEAVETLKIAASLCPSKLDYVK 420

Query: 421 EILAAFLDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA 462
           EILAAFLDGKQDVEE EKVVNLLREKD+SHPAR HDY   AIMTESA
Sbjct: 421 EILAAFLDGKQDVEEVEKVVNLLREKDNSHPARGHDY---AIMTESA 464

BLAST of CsGy1G012070 vs. TrEMBL
Match: tr|A0A0A0LSC2|A0A0A0LSC2_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G073800 PE=4 SV=1)

HSP 1 Score: 470.3 bits (1209), Expect = 4.9e-129
Identity = 255/367 (69.48%), Postives = 264/367 (71.93%), Query Frame = 0

Query: 87  MSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAHEKCV 146
           MSDKRYFPLSTADIA RMNLILRVHGLEQVEDYF+NMPSQLKR QVHIALLNCYAHEK  
Sbjct: 1   MSDKRYFPLSTADIATRMNLILRVHGLEQVEDYFNNMPSQLKRCQVHIALLNCYAHEKYA 60

Query: 147 DKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 206
           DKANA +QKIKEMGFA + LPYNI M                                  
Sbjct: 61  DKANAVLQKIKEMGFAKTSLPYNITMNLYHQIGEFERLDSPLKETDVDHDQFTY------ 120

Query: 207 XXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLL 266
                                   +  L+WNCYVIAANAYNKVGLIDKSISMLKKSEGLL
Sbjct: 121 ------------------------TTRLNWNCYVIAANAYNKVGLIDKSISMLKKSEGLL 180

Query: 267 ANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYKKEKIFNKGFISMITSLLILDDIKGAE 326
           ANVKKKGFAFNVYLKLYARNGKKDEIH IWNLYKKEKIFNKGFISMITSL +LDDIKGAE
Sbjct: 181 ANVKKKGFAFNVYLKLYARNGKKDEIHLIWNLYKKEKIFNKGFISMITSLFVLDDIKGAE 240

Query: 327 RIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVESWCYLASGY 386
           RIYKEWET+KLSYDLRIPNLLVDAYCRA                            ASGY
Sbjct: 241 RIYKEWETQKLSYDLRIPNLLVDAYCRA----------------------------ASGY 300

Query: 387 LQKDQLPQAVETLKLAASVCPSRLNYVKEILAAFLDGKQDVEETEKVVNLLREKDDSHPA 446
           LQKDQLPQAVETLK AAS+CPS LNY KEILAAFLDGKQD EETEKVVNLLREKDDSHPA
Sbjct: 301 LQKDQLPQAVETLKKAASLCPSELNYAKEILAAFLDGKQDEEETEKVVNLLREKDDSHPA 309

Query: 447 RAHDYIV 454
           RAHD +V
Sbjct: 361 RAHDILV 309

BLAST of CsGy1G012070 vs. TrEMBL
Match: tr|A0A2N9I6A5|A0A2N9I6A5_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS47431 PE=4 SV=1)

HSP 1 Score: 408.3 bits (1048), Expect = 2.3e-110
Identity = 278/455 (61.10%), Postives = 342/455 (75.16%), Query Frame = 0

Query: 6   SLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQQDELR 65
           SL+P  L   +    +F+S+    +LYRRIS +G PN+SV PLLDQWV EGR V ++EL+
Sbjct: 10  SLRP--LFPSQKLLCHFFSSNALHSLYRRISQLGGPNVSVVPLLDQWVQEGRPVPKEELQ 69

Query: 66  HIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYFDNMPS 125
            IIKELRVYKRF HALEIS+WMSDKRY  LS  DIA RMNLI RVHGLEQVE+YF+N+P+
Sbjct: 70  RIIKELRVYKRFNHALEISQWMSDKRYIILSCGDIATRMNLIFRVHGLEQVENYFNNIPT 129

Query: 126 QLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXXXXXXX 185
            +K + V+ ALLNCYAHEK V+KA   MQ +K+MG            XXXXXXXXXXXXX
Sbjct: 130 NMKGFTVYTALLNCYAHEKSVEKAEIVMQSMKDMGLVXXXXXXXXXXXXXXXXXXXXXXX 189

Query: 186 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYVIAANA 245
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX+P +VL+WN Y IAAN 
Sbjct: 190 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDPQLVLNWNSYSIAANG 249

Query: 246 YNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK-KEKI 305
           Y KVGL+DK+++M+KKSEGL+ N KKK FAF++ LK YA  GKKDE++RIW LYK KEKI
Sbjct: 250 YLKVGLLDKALAMVKKSEGLIDNAKKKNFAFDLLLKQYAEIGKKDELYRIWKLYKEKEKI 309

Query: 306 FNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLL 365
           +NKG+ISMI+SLL  DDI+GAE I++EWE+RKLSYD R+PNLL++AY + GL+ KAE LL
Sbjct: 310 YNKGYISMISSLLAFDDIEGAENIFEEWESRKLSYDFRVPNLLINAYGQKGLLAKAEALL 369

Query: 366 NEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAA---FL 425
           N  +    + S +SW YLASGYL  +Q+P+A+E +K A +VCP      KE LA    +L
Sbjct: 370 NRGMTRGGQPSADSWYYLASGYLDNNQIPKALEVMKKAVAVCPPGWRPSKETLATCLEYL 429

Query: 426 DGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAI 457
           +GK D E  +  +N LR +    P   H+ ++  I
Sbjct: 430 EGKGDTERADDFINSLRVQ-HIFPTSVHNRLLNYI 461

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011653157.11.0e-22399.57PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-... [more]
XP_008442434.11.3e-18987.86PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-... [more]
XP_008442448.17.1e-18079.44PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-... [more]
XP_022140106.13.1e-15171.08pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like [Momor... [more]
XP_011653151.17.7e-15079.05PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g... [more]
Match NameE-valueIdentityDescription
AT2G20710.19.2e-7335.58Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G21705.11.4e-5733.42Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G02150.12.4e-4943.10Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G28020.12.2e-4231.59Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G27460.11.3e-3726.96Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9SKU6|PP166_ARATH1.7e-7135.58Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidop... [more]
sp|Q84JR3|PP334_ARATH2.6e-5633.42Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidop... [more]
sp|Q8LPS6|PPR3_ARATH4.4e-4843.10Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana OX... [more]
sp|Q9C7F1|PPR61_ARATH4.0e-4131.59Putative pentatricopeptide repeat-containing protein At1g28020 OS=Arabidopsis th... [more]
sp|Q3E911|PP400_ARATH2.3e-3626.96Pentatricopeptide repeat-containing protein At5g27460 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LV44|A0A0A0LV44_CUCSA6.9e-22499.57Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G073790 PE=4 SV=1[more]
tr|A0A1S3B572|A0A1S3B572_CUCME8.5e-19087.86pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like OS=Cuc... [more]
tr|A0A1S3B6G4|A0A1S3B6G4_CUCME4.7e-18079.44pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like OS=Cuc... [more]
tr|A0A0A0LSC2|A0A0A0LSC2_CUCSA4.9e-12969.48Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G073800 PE=4 SV=1[more]
tr|A0A2N9I6A5|A0A2N9I6A5_FAGSY2.3e-11061.10Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS47431 PE=4 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009058 biosynthetic process
biological_process GO:0055114 oxidation-reduction process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005506 iron ion binding
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy1G012070.1CsGy1G012070.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 345..368
e-value: 1.8E-6
score: 27.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 168..211
e-value: 2.9E-8
score: 33.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 135..161
e-value: 0.27
score: 11.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 345..368
e-value: 1.0E-4
score: 20.2
coord: 168..199
e-value: 1.7E-4
score: 19.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 199..229
score: 7.706
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 340..374
score: 8.342
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 164..198
score: 9.24
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 272..306
score: 6.599
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 129..163
score: 6.358
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 309..449
e-value: 4.4E-10
score: 41.6
coord: 176..308
e-value: 4.9E-15
score: 57.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 49..162
e-value: 3.8E-5
score: 25.0
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILYSSF48452TPR-likecoord: 312..418
NoneNo IPR availablePANTHERPTHR24015:SF1625SUBFAMILY NOT NAMEDcoord: 29..442
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 29..442

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CsGy1G012070Cucumber (Gy14) v2cgybcgybB025
CsGy1G012070Cucumber (Gy14) v2cgybcgybB035
CsGy1G012070Cucumber (Gy14) v2cgybcgybB041
CsGy1G012070Cucurbita maxima (Rimu)cgybcmaB002
CsGy1G012070Cucurbita maxima (Rimu)cgybcmaB074
CsGy1G012070Cucurbita maxima (Rimu)cgybcmaB086
CsGy1G012070Cucurbita maxima (Rimu)cgybcmaB108
CsGy1G012070Cucurbita maxima (Rimu)cgybcmaB142
CsGy1G012070Cucurbita moschata (Rifu)cgybcmoB003
CsGy1G012070Cucurbita moschata (Rifu)cgybcmoB068
CsGy1G012070Cucurbita moschata (Rifu)cgybcmoB082
CsGy1G012070Cucurbita moschata (Rifu)cgybcmoB101
CsGy1G012070Cucurbita moschata (Rifu)cgybcmoB133
CsGy1G012070Cucurbita pepo (Zucchini)cgybcpeB018
CsGy1G012070Cucurbita pepo (Zucchini)cgybcpeB066
CsGy1G012070Cucurbita pepo (Zucchini)cgybcpeB070
CsGy1G012070Cucurbita pepo (Zucchini)cgybcpeB102
CsGy1G012070Cucurbita pepo (Zucchini)cgybcpeB140
CsGy1G012070Cucumber (Chinese Long) v2cgybcuB031
CsGy1G012070Cucumber (Chinese Long) v2cgybcuB040
CsGy1G012070Bottle gourd (USVL1VR-Ls)cgyblsiB015
CsGy1G012070Bottle gourd (USVL1VR-Ls)cgyblsiB046
CsGy1G012070Melon (DHL92) v3.5.1cgybmeB076
CsGy1G012070Melon (DHL92) v3.6.1cgybmedB020
CsGy1G012070Melon (DHL92) v3.6.1cgybmedB072
CsGy1G012070Watermelon (Charleston Gray)cgybwcgB010
CsGy1G012070Watermelon (Charleston Gray)cgybwcgB013
CsGy1G012070Watermelon (97103) v1cgybwmB044
CsGy1G012070Watermelon (97103) v1cgybwmB077
CsGy1G012070Wild cucumber (PI 183967)cgybcpiB028
CsGy1G012070Wild cucumber (PI 183967)cgybcpiB037
CsGy1G012070Wild cucumber (PI 183967)cgybcpiB044
CsGy1G012070Silver-seed gourdcarcgybB0161
CsGy1G012070Silver-seed gourdcarcgybB0181
CsGy1G012070Silver-seed gourdcarcgybB0638
CsGy1G012070Silver-seed gourdcarcgybB0936
CsGy1G012070Cucumber (Chinese Long) v3cgybcucB034
CsGy1G012070Cucumber (Chinese Long) v3cgybcucB045
CsGy1G012070Watermelon (97103) v2cgybwmbB002
CsGy1G012070Watermelon (97103) v2cgybwmbB024
CsGy1G012070Watermelon (97103) v2cgybwmbB032
CsGy1G012070Wax gourdcgybwgoB020
CsGy1G012070Wax gourdcgybwgoB035
CsGy1G012070Wax gourdcgybwgoB070