CsaV3_1G012220 (gene) Cucumber (Chinese Long) v3

NameCsaV3_1G012220
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
Descriptionpentatricopeptide repeat-containing protein At2g20710, mitochondrial-like
Locationchr1 : 7592048 .. 7594434 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGCTCTTACAATCTCTAAAACCAATTAATTTGATTGCATTCCGGAGTGAATTCGTGAATTTCTACTCTACAGTTGTGAAGGATAACCTTTACAGAAGGATTTCTCCGGTGGGTGACCCTAATATCTCTGTAACTCCACTTCTTGATCAGTGGGTGTTAGAAGGCAGGCTTGTTCAGCAAGACGAACTTCGGCATATCATCAAGGAGCTTAGGGTTTACAAGCGGTTCAAACATGCTCTTGAGGTTTGATTTCTTATTCATATCTTCGTCGAAGTATGATGGTTTTCATTCGCTCTTATGCGAAATGCTTTTGTTGATTGATGCAAATGGGTCCCAACGATTTGGCATTGTTGATTTTTTAAGATGAATAATGTGTTCATCTATGTTCCAGCATGGTTTCTTCTCCTATGAAATTGCTAAGCCGCCAGGATTTTTTATTTGCTTTGTTGAAATAGACAGTATGTGATAATTGGCGTGGAGTTGCATTGTTTCATGGATAAATGTCTTTTCGTAATTAACCAACTTCGAGAAGGGATATGGCTATTCAAATGGTGATGAATAGTCTACACATATGTCTTTCTTCCCAAATTTTGTTCGCCGGTAAAACATTCTGCCTTTTGTAAATTTTGTTGTTTGTTTATTTTCACAAAGTATCATCCATTGAATCTTGTGTACAATGTGCCGCCTCCAAAAAAAAGTAATTAATTTTTTCTTGAACACAATTACTCATGAAACTTCGACATTAGAGTGAGTAACATTGTTTCTAATGTGAACCAAGTTTTTTATTTCCTGTTTTTTTGGTAATATTATCCATATAACAGCTCTAACCAATGAATTAATGAAATTAGTTCGTCAGGAAACAACTCTAGACTCTTAAGGGAGATGATTTTATTTCTATTTCCTCGTAGATTACATGTCATTCTTGATTCGATCTCAAATTCATGTCTGATAACTATTTTTTGTCATTGATAGTTTGTCTTGCTGTTCTGCCAATGCTTTTTTTTTTCTACTGTAGAGTTCACTTTCATGGAACTAGCTTGCTTTTCCTCTCAATTCATAACTGGCAAGAATAACGAATACATTGGAGAGCATTTGCTTTCCATGTTGTTCTTTCTGACATATCAAAGAAGCCACATTGTGTAGAGTTCTTGGCTTATTGATTTCATATTAAATTGAACTGTTCATTGTGTTTTACCCAGTATTATTGTTTTGACCCATCTCTTCAAACCCCCTTTCATCTTCAGATATCAAAGTGGATGAGTGATAAAAGATACTTTCCTTTATCGACTGCTGATATCGCAATACGGATGAATTTGATCTTAAGAGTTCATGGGTTGGAACAAGTGGAAGATTATTTCGATAACATGCCTAGTCAGTTGAAAAGGTACCAAGTTCATATAGCTCTTCTTAACTGCTATGCGCATGAAAAGTGCGTGGATAAAGCCAATGCCTTCATGCAGAAAATTAAGGAAATGGGTTTTGCTAATTCTCCTCTTCCATACAATATCATGATGAATCTTTATCACCAAATTGGAGAATTTGAGAGATTAGATTCTCTGTTGAAAGAAATGAAAGAAAGGGGTGTTTATTATGATCGATTCACATACAGCATCCGAATAAGTGCATATGCTGCTGCATCTGATTTTAGGGGAATCGAAAAGATCATGGAACAAATGGAATCAAATCCGAGTATTGTTCTAGATTGGAACTGTTATGTCATTGCTGCAAATGCTTACAATAAGGTTGGCTTAATAGACAAATCCATTTCCATGCTGAAGAAATCAGAAGGTCTCCTAGCAAATGTCAAAAAGAAAGGTTTTGCATTTAATGTCTACCTCAAACTATATGCCAGAAATGGAAAGAAAGACGAGATACACCGCATTTGGAATCTCTACAAGAAAGAAAAAATCTTCAACAAAGGTTTCATCAGCATGATAACATCACTTTTGATATTAGACGATATCAAAGGTGCAGAGCGTATTTACAAGGAATGGGAGACCAGGAAACTGTCATACGACTTGCGGATTCCAAACTTGTTGGTTGATGCGTATTGTAGAGCTGGTCTAATGGAGAAAGCTGAAGTGCTTCTAAATGAGATGGTGATTGTAAGACGCAAGTTTTCGGTCGAGTCGTGGTGCTATTTAGCGAGTGGATATCTTCAGAAAGATCAACTACCTCAGGCAGTTGAGACACTGAAGTTAGCAGCCAGTGTGTGTCCATCACGACTGAACTACGTCAAGGAAATTTTGGCAGCATTTTTGGATGGGAAGCAAGATGTGGAAGAAACTGAGAAAGTGGTTAATTTGTTGAGGGAAAAAGATGACTCTCATCCTGCTCGTGCTCATGATTACATTGTTGGAGCGATTATGACCGAATCCGCCTAA

mRNA sequence

ATGAAGCTCTTACAATCTCTAAAACCAATTAATTTGATTGCATTCCGGAGTGAATTCGTGAATTTCTACTCTACAGTTGTGAAGGATAACCTTTACAGAAGGATTTCTCCGGTGGGTGACCCTAATATCTCTGTAACTCCACTTCTTGATCAGTGGGTGTTAGAAGGCAGGCTTGTTCAGCAAGACGAACTTCGGCATATCATCAAGGAGCTTAGGGTTTACAAGCGGTTCAAACATGCTCTTGAGATATCAAAGTGGATGAGTGATAAAAGATACTTTCCTTTATCGACTGCTGATATCGCAATACGGATGAATTTGATCTTAAGAGTTCATGGGTTGGAACAAGTGGAAGATTATTTCGATAACATGCCTAGTCAGTTGAAAAGGTACCAAGTTCATATAGCTCTTCTTAACTGCTATGCGCATGAAAAGTGCGTGGATAAAGCCAATGCCTTCATGCAGAAAATTAAGGAAATGGGTTTTGCTAATTCTCCTCTTCCATACAATATCATGATGAATCTTTATCACCAAATTGGAGAATTTGAGAGATTAGATTCTCTGTTGAAAGAAATGAAAGAAAGGGGTGTTTATTATGATCGATTCACATACAGCATCCGAATAAGTGCATATGCTGCTGCATCTGATTTTAGGGGAATCGAAAAGATCATGGAACAAATGGAATCAAATCCGAGTATTGTTCTAGATTGGAACTGTTATGTCATTGCTGCAAATGCTTACAATAAGGTTGGCTTAATAGACAAATCCATTTCCATGCTGAAGAAATCAGAAGGTCTCCTAGCAAATGTCAAAAAGAAAGGTTTTGCATTTAATGTCTACCTCAAACTATATGCCAGAAATGGAAAGAAAGACGAGATACACCGCATTTGGAATCTCTACAAGAAAGAAAAAATCTTCAACAAAGGTTTCATCAGCATGATAACATCACTTTTGATATTAGACGATATCAAAGGTGCAGAGCGTATTTACAAGGAATGGGAGACCAGGAAACTGTCATACGACTTGCGGATTCCAAACTTGTTGGTTGATGCGTATTGTAGAGCTGGTCTAATGGAGAAAGCTGAAGTGCTTCTAAATGAGATGGTGATTGTAAGACGCAAGTTTTCGGTCGAGTCGTGGTGCTATTTAGCGAGTGGATATCTTCAGAAAGATCAACTACCTCAGGCAGTTGAGACACTGAAGTTAGCAGCCAGTGTGTGTCCATCACGACTGAACTACGTCAAGGAAATTTTGGCAGCATTTTTGGATGGGAAGCAAGATGTGGAAGAAACTGAGAAAGTGGTTAATTTGTTGAGGGAAAAAGATGACTCTCATCCTGCTCGTGCTCATGATTACATTGTTGGAGCGATTATGACCGAATCCGCCTAA

Coding sequence (CDS)

ATGAAGCTCTTACAATCTCTAAAACCAATTAATTTGATTGCATTCCGGAGTGAATTCGTGAATTTCTACTCTACAGTTGTGAAGGATAACCTTTACAGAAGGATTTCTCCGGTGGGTGACCCTAATATCTCTGTAACTCCACTTCTTGATCAGTGGGTGTTAGAAGGCAGGCTTGTTCAGCAAGACGAACTTCGGCATATCATCAAGGAGCTTAGGGTTTACAAGCGGTTCAAACATGCTCTTGAGATATCAAAGTGGATGAGTGATAAAAGATACTTTCCTTTATCGACTGCTGATATCGCAATACGGATGAATTTGATCTTAAGAGTTCATGGGTTGGAACAAGTGGAAGATTATTTCGATAACATGCCTAGTCAGTTGAAAAGGTACCAAGTTCATATAGCTCTTCTTAACTGCTATGCGCATGAAAAGTGCGTGGATAAAGCCAATGCCTTCATGCAGAAAATTAAGGAAATGGGTTTTGCTAATTCTCCTCTTCCATACAATATCATGATGAATCTTTATCACCAAATTGGAGAATTTGAGAGATTAGATTCTCTGTTGAAAGAAATGAAAGAAAGGGGTGTTTATTATGATCGATTCACATACAGCATCCGAATAAGTGCATATGCTGCTGCATCTGATTTTAGGGGAATCGAAAAGATCATGGAACAAATGGAATCAAATCCGAGTATTGTTCTAGATTGGAACTGTTATGTCATTGCTGCAAATGCTTACAATAAGGTTGGCTTAATAGACAAATCCATTTCCATGCTGAAGAAATCAGAAGGTCTCCTAGCAAATGTCAAAAAGAAAGGTTTTGCATTTAATGTCTACCTCAAACTATATGCCAGAAATGGAAAGAAAGACGAGATACACCGCATTTGGAATCTCTACAAGAAAGAAAAAATCTTCAACAAAGGTTTCATCAGCATGATAACATCACTTTTGATATTAGACGATATCAAAGGTGCAGAGCGTATTTACAAGGAATGGGAGACCAGGAAACTGTCATACGACTTGCGGATTCCAAACTTGTTGGTTGATGCGTATTGTAGAGCTGGTCTAATGGAGAAAGCTGAAGTGCTTCTAAATGAGATGGTGATTGTAAGACGCAAGTTTTCGGTCGAGTCGTGGTGCTATTTAGCGAGTGGATATCTTCAGAAAGATCAACTACCTCAGGCAGTTGAGACACTGAAGTTAGCAGCCAGTGTGTGTCCATCACGACTGAACTACGTCAAGGAAATTTTGGCAGCATTTTTGGATGGGAAGCAAGATGTGGAAGAAACTGAGAAAGTGGTTAATTTGTTGAGGGAAAAAGATGACTCTCATCCTGCTCGTGCTCATGATTACATTGTTGGAGCGATTATGACCGAATCCGCCTAA

Protein sequence

MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMNLYHQIGEFERLDSLLKEMKERGVYYDRFTYSIRISAYAAASDFRGIEKIMEQMESNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYKKEKIFNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAFLDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA
BLAST of CsaV3_1G012220 vs. NCBI nr
Match: XP_011653157.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like [Cucumis sativus] >KGN64672.1 hypothetical protein Csa_1G073790 [Cucumis sativus])

HSP 1 Score: 785.8 bits (2028), Expect = 8.0e-224
Identity = 459/461 (99.57%), Postives = 460/461 (99.78%), Query Frame = 0

Query: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ
Sbjct: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
           QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF
Sbjct: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX 180
           DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX
Sbjct: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300
           IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK
Sbjct: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300

Query: 301 KEKIFNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA 360
           KEKIFNKGFISMITSL +LDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA
Sbjct: 301 KEKIFNKGFISMITSLFVLDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA 360

Query: 361 EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF 420
           EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF
Sbjct: 361 EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF 420

Query: 421 LDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA 462
           LDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA
Sbjct: 421 LDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA 461

BLAST of CsaV3_1G012220 vs. NCBI nr
Match: XP_008442434.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like [Cucumis melo])

HSP 1 Score: 672.5 bits (1734), Expect = 9.9e-190
Identity = 398/453 (87.86%), Postives = 420/453 (92.72%), Query Frame = 0

Query: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           MKLLQS+KP NLIA R   VNFYST VKDNLYRRISPVGDPNISV P+LDQWVLEGR+VQ
Sbjct: 1   MKLLQSVKPTNLIALRRGLVNFYSTFVKDNLYRRISPVGDPNISVIPVLDQWVLEGRVVQ 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
           ++EL+ IIKELRVYKRFKHALEISKWMSDKRY PLST D+A RMNLILRVHGLEQVEDYF
Sbjct: 61  KEELQKIIKELRVYKRFKHALEISKWMSDKRYLPLSTDDVATRMNLILRVHGLEQVEDYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX 180
           +NMPSQLKRY VHIALLNCYAHEKCVDKANAF+QKIKEMG+A S LPYNIMM  XXXXXX
Sbjct: 121 NNMPSQLKRYHVHIALLNCYAHEKCVDKANAFLQKIKEMGYAKSTLPYNIMMNLXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX N SIVLDWNCYV
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSNTSIVLDWNCYV 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300
           IAANAYNKVGLIDKS+SMLKKSEG LA  KKKG AFNVYLKLYARNGKKDE+HRIWNLYK
Sbjct: 241 IAANAYNKVGLIDKSVSMLKKSEGRLATDKKKGHAFNVYLKLYARNGKKDEVHRIWNLYK 300

Query: 301 KEKIFNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA 360
           KEKIFNKGFISMI SLLILDDI+GAE IYKEWET+KLSYD+RIPNLLVDAYCRAGL+EKA
Sbjct: 301 KEKIFNKGFISMIRSLLILDDIRGAEDIYKEWETQKLSYDVRIPNLLVDAYCRAGLIEKA 360

Query: 361 EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF 420
           E L+NE+V VR KFSVESWCYLASGYLQKDQLPQAVETLK AAS+CPS LNYVKEILAAF
Sbjct: 361 EELVNEIVNVRGKFSVESWCYLASGYLQKDQLPQAVETLKKAASLCPSELNYVKEILAAF 420

Query: 421 LDGKQDVEETEKVVNLLREKDDSHPARAHDYIV 454
            DGKQDVEE EKVVNLLREKD+ +PARAHD +V
Sbjct: 421 SDGKQDVEEAEKVVNLLREKDNLNPARAHDILV 453

BLAST of CsaV3_1G012220 vs. NCBI nr
Match: XP_008442448.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like [Cucumis melo])

HSP 1 Score: 640.2 bits (1650), Expect = 5.4e-180
Identity = 371/467 (79.44%), Postives = 403/467 (86.30%), Query Frame = 0

Query: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           MKLLQS+K INLIAFR E VNFYST V D+LYRR+SPVGDPNIS+ P+LDQWV EGR VQ
Sbjct: 1   MKLLQSVKSINLIAFRRELVNFYSTFVNDDLYRRLSPVGDPNISIVPILDQWVSEGRPVQ 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
             ELR IIKELRVYKR+KHALE+SKWMSDK   PLSTADIA RMNLILRVHGLEQVEDYF
Sbjct: 61  IVELRLIIKELRVYKRYKHALEMSKWMSDKVCLPLSTADIATRMNLILRVHGLEQVEDYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX 180
           +NMPS+LKRYQVHIALLNCYAHEKCVDKANA +QKIKE+GFA +P PYNIMM        
Sbjct: 121 NNMPSKLKRYQVHIALLNCYAHEKCVDKANALLQKIKELGFATTPHPYNIMMNLYHQIGE 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV 240
            XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX            N SIVLDWNCYV
Sbjct: 181 FXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAGIEKITEQMESNTSIVLDWNCYV 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLA-NVKKKGFAFNVYLKLYARNGKKDEIHRIWNLY 300
           +AA+AY KVGLIDKSISMLKKSE LLA   + K  AFN+YL LYA+NGKKDE +RIWNLY
Sbjct: 241 VAADAYYKVGLIDKSISMLKKSEELLAKTAENKCHAFNIYLTLYAKNGKKDETYRIWNLY 300

Query: 301 KKEKIFNKGFISMITSLLILDDIKGAERIYKE-----WETRKLSYDLRIPNLLVDAYCRA 360
           KKEK+FNKGFISMITSLLILDDIKGA RI +E     WET+KLSYDLRIPNLLVDAYCRA
Sbjct: 301 KKEKVFNKGFISMITSLLILDDIKGARRICEEWETQVWETQKLSYDLRIPNLLVDAYCRA 360

Query: 361 GLMEKAEVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVK 420
           GLME+AEVL+ EM+ VRRKFSV+SWCY+ASGYLQKDQLP+AVETLK+AAS+CPS+L+YVK
Sbjct: 361 GLMEEAEVLVYEMMTVRRKFSVKSWCYIASGYLQKDQLPEAVETLKIAASLCPSKLDYVK 420

Query: 421 EILAAFLDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA 462
           EILAAFLDGKQDVEE EKVVNLLREKD+SHPAR HDY   AIMTESA
Sbjct: 421 EILAAFLDGKQDVEEVEKVVNLLREKDNSHPARGHDY---AIMTESA 464

BLAST of CsaV3_1G012220 vs. NCBI nr
Match: XP_022140106.1 (pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like [Momordica charantia])

HSP 1 Score: 545.0 bits (1403), Expect = 2.4e-151
Identity = 322/453 (71.08%), Postives = 380/453 (83.89%), Query Frame = 0

Query: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           MKLL   K ++ +AFR    +FYS + +D+LYRRISPVGDPN+SV P+LDQWV EGR VQ
Sbjct: 1   MKLLVPGKQVHPVAFRRILGHFYSMIARDSLYRRISPVGDPNVSVIPVLDQWVREGRPVQ 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
           ++EL+ IIKELR+YKR KHALEISKWM DKRY PLS+ DIA RMNLILRVHGLEQVEDYF
Sbjct: 61  REELQKIIKELRIYKRSKHALEISKWMGDKRYLPLSSVDIATRMNLILRVHGLEQVEDYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX 180
           +++PSQLK++QVH+ALLNCYAHEKCVDKANA +QKIKEMGF  +PLPYNIMM       X
Sbjct: 121 NSIPSQLKKFQVHVALLNCYAHEKCVDKANAILQKIKEMGFDGAPLPYNIMMNLYYQIGX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX +P IV DW+CYV
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLDPRIVPDWSCYV 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300
           IAAN Y KVGL+DKS+SML+KSE LLA  K++G AF++ LKLYA +GKKDE+HRIW LYK
Sbjct: 241 IAANGYLKVGLVDKSLSMLRKSEALLATAKRRGSAFDILLKLYAESGKKDELHRIWKLYK 300

Query: 301 KEKIFNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA 360
           KEK++NKG++SM+ SLLILDDI+ AE I+K+WET KLS DLRIPN+L++AYCR GLMEKA
Sbjct: 301 KEKVYNKGYMSMMRSLLILDDIEAAEHIFKDWETWKLSNDLRIPNMLIEAYCRRGLMEKA 360

Query: 361 EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF 420
           E L+N+ V  + KFSV SWCYLA+ Y+QKDQL   VE LK AA++CP  LN+ KEILA F
Sbjct: 361 EALINKAVTGKSKFSVHSWCYLANAYIQKDQLQHTVEALKKAATLCPPELNHFKEILATF 420

Query: 421 LDGKQDVEETEKVVNLLREKDDSHPARAHDYIV 454
           L+GKQDV+E EKVV LLR + +S    AHD ++
Sbjct: 421 LEGKQDVKEAEKVVGLLRAEANS--LFAHDVLI 451

BLAST of CsaV3_1G012220 vs. NCBI nr
Match: XP_011653151.1 (PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like [Cucumis sativus])

HSP 1 Score: 540.0 bits (1390), Expect = 7.7e-150
Identity = 283/358 (79.05%), Postives = 289/358 (80.73%), Query Frame = 0

Query: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           MKLLQSLKPINLIAFR EFVNFYSTVVKD+LYRRISPVGDPNISVTPLLDQWVLE  LVQ
Sbjct: 1   MKLLQSLKPINLIAFRREFVNFYSTVVKDSLYRRISPVGDPNISVTPLLDQWVLESGLVQ 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
           QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIA RMNLILRVHGLEQVEDYF
Sbjct: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIATRMNLILRVHGLEQVEDYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX 180
           +NMPSQLKR QVHIALLNCYAHEK  DKANA +QKIKEMGFA + LPYNI M        
Sbjct: 121 NNMPSQLKRCQVHIALLNCYAHEKYADKANAVLQKIKEMGFAKTSLPYNITM---NLYHQ 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV 240
                                                          XN SIVLDWNCYV
Sbjct: 181 IGEFERLDSPLKETDVDHDQFTYTTRLSAYATAFDFTGIEKIMEQMEXNTSIVLDWNCYV 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300
           IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIH IWNLYK
Sbjct: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHLIWNLYK 300

Query: 301 KEKIFNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLME 359
           KEKIFNKGFISMITSL +LDDIKGAERIYKEWET+KLSYDLRIPNLLVDAYCRAGLME
Sbjct: 301 KEKIFNKGFISMITSLFVLDDIKGAERIYKEWETQKLSYDLRIPNLLVDAYCRAGLME 355

BLAST of CsaV3_1G012220 vs. TAIR10
Match: AT2G20710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 271.6 bits (693), Expect = 9.2e-73
Identity = 148/416 (35.58%), Postives = 231/416 (55.53%), Query Frame = 0

Query: 29  DNLYRRISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEISKWMS 88
           D L RR++  GDP+ S+  +LD W+ +G LV+  EL  IIK LR + RF HAL+IS WMS
Sbjct: 38  DTLQRRVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMS 97

Query: 89  DKRYFPLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAHEKCVDK 148
           + R   +S  D+AIR++LI +V GL + E +F+ +P + + Y ++ ALLNCYA +K + K
Sbjct: 98  EHRVHEISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHK 157

Query: 149 ANAFMQKIKEMGFANSPLPYNIMMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 208
           A    Q++KE+GF    LPYN+M+                                    
Sbjct: 158 AEQVFQEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLH 217

Query: 209 XXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLLAN 268
                               +  + LDW  Y   AN Y K GL +K++ ML+KSE ++ N
Sbjct: 218 AYSVVSDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMV-N 277

Query: 269 VKKKGFAFNVYLKLYARNGKKDEIHRIWNLYKK-EKIFNKGFISMITSLLILDDIKGAER 328
            +K+  A+ V +  Y   GKK+E++R+W+LYK+ +  +N G+IS+I++LL +DDI+  E+
Sbjct: 278 AQKRKHAYEVLMSFYGAAGKKEEVYRLWSLYKELDGFYNTGYISVISALLKMDDIEEVEK 337

Query: 329 IYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVESWCYLASGYL 388
           I +EWE     +D+RIP+LL+  YC+ G+MEKAE ++N +V   R     +W  LA GY 
Sbjct: 338 IMEEWEAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYK 397

Query: 389 QKDQLPQAVETLKLAASVCPSRLNYVKEILAA---FLDGKQDVEETEKVVNLLREK 441
              ++ +AVE  K A  V        + +L +   +L+G++D+E   K++ LL E+
Sbjct: 398 MAGKMEKAVEKWKRAIEVSKPGWRPHQVVLMSCVDYLEGQRDMEGLRKILRLLSER 452

BLAST of CsaV3_1G012220 vs. TAIR10
Match: AT4G21705.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 221.1 bits (562), Expect = 1.4e-57
Identity = 136/407 (33.42%), Postives = 204/407 (50.12%), Query Frame = 0

Query: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           M +L+ + P NLIA R  + N    V K  LY +ISP+GDP  SV P L  WV  G+ V 
Sbjct: 1   MNILRRI-PANLIASRYYYTN---RVKKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVS 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
             EL  I+ +LR  KRF HALE+SKWM++      S  + A+ ++LI RV+G    E+YF
Sbjct: 61  VAELIRIVHDLRRRKRFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX 180
           +N+  Q K  + + ALLNCY  ++ V+K+    +K+KEMGF  S L YN +M        
Sbjct: 121 ENLKEQYKNDKTYGALLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQ 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV 240
                                                              I +DWN Y 
Sbjct: 181 HEKVPKVLEEMKEENVAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYA 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300
           +AA  Y   G  D+++ +LK SE  L   KK G  +N  + LYAR GKK E+ R+W+L K
Sbjct: 241 VAAKFYIDGGDCDRAVELLKMSENRLE--KKDGEGYNHLITLYARLGKKIEVLRLWDLEK 300

Query: 301 K--EKIFNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLME 360
              ++  N+ +++++ SL+ +D +  AE +  EW++    YD R+PN ++  Y    + E
Sbjct: 301 DVCKRRINQDYLTVLQSLVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEE 360

Query: 361 KAEVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASV 406
           KAE +L ++    +  + ESW  +A+ Y +K  L  A + +K A  V
Sbjct: 361 KAEAMLEDLARRGKATTPESWELVATAYAEKGTLENAFKCMKTALGV 401

BLAST of CsaV3_1G012220 vs. TAIR10
Match: AT1G02150.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 193.7 bits (491), Expect = 2.4e-49
Identity = 181/420 (43.10%), Postives = 263/420 (62.62%), Query Frame = 0

Query: 31  LYRRISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEISKWMSDK 90
           +Y++IS +  P +    +L+QW   GR + + EL  ++KELR YKR   ALE+  WM+++
Sbjct: 69  IYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNR 128

Query: 91  -RYFPLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAHEKCVDKA 150
              F LS +D AI+++LI +V G+   E++F  +P   K  +V+ +LLN Y   K  +KA
Sbjct: 129 GERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKA 188

Query: 151 NAFMQKIKEMGFANSPLPYNIMMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 210
            A +  +++ G+A  PLP+N+  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 189 EALLNTMRDKGYALHPLPFNVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 248

Query: 211 XXXXXXXXXXXXXXXXXXXNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLLANV 270
           XXXXXXXXXXXXXXXXXX + SI  +W  +   A  Y K+G  +K+   L+K E  +   
Sbjct: 249 XXXXXXXXXXXXXXXXXXSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARITG- 308

Query: 271 KKKGFAFNVYLKLYARNGKKDEIHRIWNLYKK--EKIFNKGFISMITSLLILDDIKGAER 330
            +    ++  L LY   G K E++R+W++YK     I N G+ ++++SL+ + DI+GAE+
Sbjct: 309 -RNRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEK 368

Query: 331 IYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVESWCYLASGYL 390
           +Y+EW   K SYD RIPNLL++AY +   +E AE L + MV +  K S  +W  LA G+ 
Sbjct: 369 VYEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHT 428

Query: 391 QKDQLPQAVETLKLAASVCPSRLNYVKEILA-----AFLDGKQDVEETEKVVNLLREKDD 443
           +K  + +A+  L+ A S   S  N+  ++L         + + DV   E V+ LLR+  D
Sbjct: 429 RKRCISEALTCLRNAFSAEGSS-NWRPKVLMLSGFFKLCEEESDVTSKEAVLELLRQSGD 485

BLAST of CsaV3_1G012220 vs. TAIR10
Match: AT1G28020.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 170.6 bits (431), Expect = 2.2e-42
Identity = 109/345 (31.59%), Postives = 168/345 (48.70%), Query Frame = 0

Query: 34  RISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEISKWMSDKRYF 93
           RI+     N  + P+L+QW  +G  V    +R IIK+LR   +   AL++S+WMS ++  
Sbjct: 41  RITDALHRNAQIIPVLEQWRQQGNQVNPSHVRVIIKKLRDSDQSLQALQVSEWMSKEKIC 100

Query: 94  PLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAH-EKCVDKANAF 153
            L   D A R++LI  V GLE+ E +F+++P   +   V+ +LLN YA  +K + KA A 
Sbjct: 101 NLIPEDFAARLHLIENVVGLEEAEKFFESIPKNARGDSVYTSLLNSYARSDKTLCKAEAT 160

Query: 154 MQKIKEMGFANSPLPYNIMMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 213
            QK++++G    P+PYN MM                                        
Sbjct: 161 FQKMRDLGLLLRPVPYNAMMSLYSALKNREKVEELLLEMKDNDVEADNVTVNNVLKLYSA 220

Query: 214 XXXXXXXXXXXXXXXXNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLLANVKKK 273
                              I L+W+  +  A AY +     K++ ML+ +E L+     K
Sbjct: 221 VCDVTEMEKFLNKWEGIHGIKLEWHTTLDMAKAYLRARSSGKAMKMLRLTEQLVDQKSLK 280

Query: 274 GFAFNVYLKLYARNGKKDEIHRIWNLYKKE--KIFNKGFISMITSLLILDDIKGAERIYK 333
             A++  +KLY   G ++E+ R+W LYK +  +  N G+ ++I SLL +DDI GAE IYK
Sbjct: 281 S-AYDHLMKLYGEAGNREEVLRVWKLYKSKIGERDNNGYRTVIRSLLKVDDIVGAEEIYK 340

Query: 334 EWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFS 376
            WE+  L +D RIP +L   Y   G+ EKAE L+N   I  R+ +
Sbjct: 341 VWESLPLEFDHRIPTMLASGYRDRGMTEKAEKLMNSKTIKDRRMN 384

BLAST of CsaV3_1G012220 vs. TAIR10
Match: AT5G27460.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 154.8 bits (390), Expect = 1.3e-37
Identity = 103/382 (26.96%), Postives = 179/382 (46.86%), Query Frame = 0

Query: 24  STVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEI 83
           S+V   N  + I     P  SVT LL + +  G  V   ELR I K L    R+  AL++
Sbjct: 33  SSVANRNSLKEILRKNGPRRSVTSLLQERIDSGHAVSLSELRLISKRLIRSNRYDLALQM 92

Query: 84  SKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYFDNM----PSQLKRYQVHIALLNC 143
            +WM +++    S  DIA+R++LI++ HGL+Q E+YF+ +     S       ++ LL  
Sbjct: 93  MEWMENQKDIEFSVYDIALRLDLIIKTHGLKQGEEYFEKLLHSSVSMRVAKSAYLPLLRA 152

Query: 144 YAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXXXXXXXXXXXXXXXXXXXXX 203
           Y   K V +A A M+K+  +GF  +P P+N MM                           
Sbjct: 153 YVKNKMVKEAEALMEKLNGLGFLVTPHPFNEMMKLYEASGQYEKVVMVVSMMKGNKIPRN 212

Query: 204 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYVIAANAYNKVGLIDKSISML 263
                                        + S+ + W+     AN Y K G  +K+  +L
Sbjct: 213 VLSYNLWMNACCEVSGVAAVETVYKEMVGDKSVEVGWSSLCTLANVYIKSGFDEKARLVL 272

Query: 264 KKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYKK--EKIFNKGFISMITSLL 323
           + +E +L    + G+ F   + LYA  G K+ + R+W + K    +I    +I +++SL+
Sbjct: 273 EDAEKMLNRSNRLGYFF--LITLYASLGNKEGVVRLWEVSKSVCGRISCVNYICVLSSLV 332

Query: 324 ILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVE 383
              D++ AER++ EWE +  +YD+R+ N+L+ AY R G + KAE L   ++      + +
Sbjct: 333 KTGDLEEAERVFSEWEAQCFNYDVRVSNVLLGAYVRNGEIRKAESLHGCVLERGGTPNYK 392

Query: 384 SWCYLASGYLQKDQLPQAVETL 400
           +W  L  G+++ + + +A++ +
Sbjct: 393 TWEILMEGWVKCENMEKAIDAM 412

BLAST of CsaV3_1G012220 vs. Swiss-Prot
Match: sp|Q9SKU6|PP166_ARATH (Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At2g20710 PE=2 SV=1)

HSP 1 Score: 271.6 bits (693), Expect = 1.7e-71
Identity = 148/416 (35.58%), Postives = 231/416 (55.53%), Query Frame = 0

Query: 29  DNLYRRISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEISKWMS 88
           D L RR++  GDP+ S+  +LD W+ +G LV+  EL  IIK LR + RF HAL+IS WMS
Sbjct: 38  DTLQRRVARSGDPSASIIKVLDGWLDQGNLVKTSELHSIIKMLRKFSRFSHALQISDWMS 97

Query: 89  DKRYFPLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAHEKCVDK 148
           + R   +S  D+AIR++LI +V GL + E +F+ +P + + Y ++ ALLNCYA +K + K
Sbjct: 98  EHRVHEISEGDVAIRLDLIAKVGGLGEAEKFFETIPMERRNYHLYGALLNCYASKKVLHK 157

Query: 149 ANAFMQKIKEMGFANSPLPYNIMMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 208
           A    Q++KE+GF    LPYN+M+                                    
Sbjct: 158 AEQVFQEMKELGFLKGCLPYNVMLNLYVRTGKYTMVEKLLREMEDETVKPDIFTVNTRLH 217

Query: 209 XXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLLAN 268
                               +  + LDW  Y   AN Y K GL +K++ ML+KSE ++ N
Sbjct: 218 AYSVVSDVEGMEKFLMRCEADQGLHLDWRTYADTANGYIKAGLTEKALEMLRKSEQMV-N 277

Query: 269 VKKKGFAFNVYLKLYARNGKKDEIHRIWNLYKK-EKIFNKGFISMITSLLILDDIKGAER 328
            +K+  A+ V +  Y   GKK+E++R+W+LYK+ +  +N G+IS+I++LL +DDI+  E+
Sbjct: 278 AQKRKHAYEVLMSFYGAAGKKEEVYRLWSLYKELDGFYNTGYISVISALLKMDDIEEVEK 337

Query: 329 IYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVESWCYLASGYL 388
           I +EWE     +D+RIP+LL+  YC+ G+MEKAE ++N +V   R     +W  LA GY 
Sbjct: 338 IMEEWEAGHSLFDIRIPHLLITGYCKKGMMEKAEEVVNILVQKWRVEDTSTWERLALGYK 397

Query: 389 QKDQLPQAVETLKLAASVCPSRLNYVKEILAA---FLDGKQDVEETEKVVNLLREK 441
              ++ +AVE  K A  V        + +L +   +L+G++D+E   K++ LL E+
Sbjct: 398 MAGKMEKAVEKWKRAIEVSKPGWRPHQVVLMSCVDYLEGQRDMEGLRKILRLLSER 452

BLAST of CsaV3_1G012220 vs. Swiss-Prot
Match: sp|Q84JR3|PP334_ARATH (Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At4g21705 PE=2 SV=1)

HSP 1 Score: 221.1 bits (562), Expect = 2.6e-56
Identity = 136/407 (33.42%), Postives = 204/407 (50.12%), Query Frame = 0

Query: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           M +L+ + P NLIA R  + N    V K  LY +ISP+GDP  SV P L  WV  G+ V 
Sbjct: 1   MNILRRI-PANLIASRYYYTN---RVKKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVS 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
             EL  I+ +LR  KRF HALE+SKWM++      S  + A+ ++LI RV+G    E+YF
Sbjct: 61  VAELIRIVHDLRRRKRFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX 180
           +N+  Q K  + + ALLNCY  ++ V+K+    +K+KEMGF  S L YN +M        
Sbjct: 121 ENLKEQYKNDKTYGALLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQ 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV 240
                                                              I +DWN Y 
Sbjct: 181 HEKVPKVLEEMKEENVAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYA 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300
           +AA  Y   G  D+++ +LK SE  L   KK G  +N  + LYAR GKK E+ R+W+L K
Sbjct: 241 VAAKFYIDGGDCDRAVELLKMSENRLE--KKDGEGYNHLITLYARLGKKIEVLRLWDLEK 300

Query: 301 K--EKIFNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLME 360
              ++  N+ +++++ SL+ +D +  AE +  EW++    YD R+PN ++  Y    + E
Sbjct: 301 DVCKRRINQDYLTVLQSLVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEE 360

Query: 361 KAEVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASV 406
           KAE +L ++    +  + ESW  +A+ Y +K  L  A + +K A  V
Sbjct: 361 KAEAMLEDLARRGKATTPESWELVATAYAEKGTLENAFKCMKTALGV 401

BLAST of CsaV3_1G012220 vs. Swiss-Prot
Match: sp|Q8LPS6|PPR3_ARATH (Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana OX=3702 GN=At1g02150 PE=2 SV=2)

HSP 1 Score: 193.7 bits (491), Expect = 4.4e-48
Identity = 181/420 (43.10%), Postives = 263/420 (62.62%), Query Frame = 0

Query: 31  LYRRISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEISKWMSDK 90
           +Y++IS +  P +    +L+QW   GR + + EL  ++KELR YKR   ALE+  WM+++
Sbjct: 69  IYKKISLMEKPELGAASVLNQWEKAGRKLTKWELCRVVKELRKYKRANQALEVYDWMNNR 128

Query: 91  -RYFPLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAHEKCVDKA 150
              F LS +D AI+++LI +V G+   E++F  +P   K  +V+ +LLN Y   K  +KA
Sbjct: 129 GERFRLSASDAAIQLDLIGKVRGIPDAEEFFLQLPENFKDRRVYGSLLNAYVRAKSREKA 188

Query: 151 NAFMQKIKEMGFANSPLPYNIMMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 210
            A +  +++ G+A  PLP+N+  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 189 EALLNTMRDKGYALHPLPFNVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 248

Query: 211 XXXXXXXXXXXXXXXXXXXNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLLANV 270
           XXXXXXXXXXXXXXXXXX + SI  +W  +   A  Y K+G  +K+   L+K E  +   
Sbjct: 249 XXXXXXXXXXXXXXXXXXSDVSIYPNWTTFSTMATMYIKMGETEKAEDALRKVEARITG- 308

Query: 271 KKKGFAFNVYLKLYARNGKKDEIHRIWNLYKK--EKIFNKGFISMITSLLILDDIKGAER 330
            +    ++  L LY   G K E++R+W++YK     I N G+ ++++SL+ + DI+GAE+
Sbjct: 309 -RNRIPYHYLLSLYGSLGNKKELYRVWHVYKSVVPSIPNLGYHALVSSLVRMGDIEGAEK 368

Query: 331 IYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVESWCYLASGYL 390
           +Y+EW   K SYD RIPNLL++AY +   +E AE L + MV +  K S  +W  LA G+ 
Sbjct: 369 VYEEWLPVKSSYDPRIPNLLMNAYVKNDQLETAEGLFDHMVEMGGKPSSSTWEILAVGHT 428

Query: 391 QKDQLPQAVETLKLAASVCPSRLNYVKEILA-----AFLDGKQDVEETEKVVNLLREKDD 443
           +K  + +A+  L+ A S   S  N+  ++L         + + DV   E V+ LLR+  D
Sbjct: 429 RKRCISEALTCLRNAFSAEGSS-NWRPKVLMLSGFFKLCEEESDVTSKEAVLELLRQSGD 485

BLAST of CsaV3_1G012220 vs. Swiss-Prot
Match: sp|Q9C7F1|PPR61_ARATH (Putative pentatricopeptide repeat-containing protein At1g28020 OS=Arabidopsis thaliana OX=3702 GN=At1g28020 PE=3 SV=2)

HSP 1 Score: 170.6 bits (431), Expect = 4.0e-41
Identity = 109/345 (31.59%), Postives = 168/345 (48.70%), Query Frame = 0

Query: 34  RISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEISKWMSDKRYF 93
           RI+     N  + P+L+QW  +G  V    +R IIK+LR   +   AL++S+WMS ++  
Sbjct: 41  RITDALHRNAQIIPVLEQWRQQGNQVNPSHVRVIIKKLRDSDQSLQALQVSEWMSKEKIC 100

Query: 94  PLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAH-EKCVDKANAF 153
            L   D A R++LI  V GLE+ E +F+++P   +   V+ +LLN YA  +K + KA A 
Sbjct: 101 NLIPEDFAARLHLIENVVGLEEAEKFFESIPKNARGDSVYTSLLNSYARSDKTLCKAEAT 160

Query: 154 MQKIKEMGFANSPLPYNIMMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 213
            QK++++G    P+PYN MM                                        
Sbjct: 161 FQKMRDLGLLLRPVPYNAMMSLYSALKNREKVEELLLEMKDNDVEADNVTVNNVLKLYSA 220

Query: 214 XXXXXXXXXXXXXXXXNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLLANVKKK 273
                              I L+W+  +  A AY +     K++ ML+ +E L+     K
Sbjct: 221 VCDVTEMEKFLNKWEGIHGIKLEWHTTLDMAKAYLRARSSGKAMKMLRLTEQLVDQKSLK 280

Query: 274 GFAFNVYLKLYARNGKKDEIHRIWNLYKKE--KIFNKGFISMITSLLILDDIKGAERIYK 333
             A++  +KLY   G ++E+ R+W LYK +  +  N G+ ++I SLL +DDI GAE IYK
Sbjct: 281 S-AYDHLMKLYGEAGNREEVLRVWKLYKSKIGERDNNGYRTVIRSLLKVDDIVGAEEIYK 340

Query: 334 EWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFS 376
            WE+  L +D RIP +L   Y   G+ EKAE L+N   I  R+ +
Sbjct: 341 VWESLPLEFDHRIPTMLASGYRDRGMTEKAEKLMNSKTIKDRRMN 384

BLAST of CsaV3_1G012220 vs. Swiss-Prot
Match: sp|Q3E911|PP400_ARATH (Pentatricopeptide repeat-containing protein At5g27460 OS=Arabidopsis thaliana OX=3702 GN=At5g27460 PE=2 SV=1)

HSP 1 Score: 154.8 bits (390), Expect = 2.3e-36
Identity = 103/382 (26.96%), Postives = 179/382 (46.86%), Query Frame = 0

Query: 24  STVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQQDELRHIIKELRVYKRFKHALEI 83
           S+V   N  + I     P  SVT LL + +  G  V   ELR I K L    R+  AL++
Sbjct: 33  SSVANRNSLKEILRKNGPRRSVTSLLQERIDSGHAVSLSELRLISKRLIRSNRYDLALQM 92

Query: 84  SKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYFDNM----PSQLKRYQVHIALLNC 143
            +WM +++    S  DIA+R++LI++ HGL+Q E+YF+ +     S       ++ LL  
Sbjct: 93  MEWMENQKDIEFSVYDIALRLDLIIKTHGLKQGEEYFEKLLHSSVSMRVAKSAYLPLLRA 152

Query: 144 YAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXXXXXXXXXXXXXXXXXXXXX 203
           Y   K V +A A M+K+  +GF  +P P+N MM                           
Sbjct: 153 YVKNKMVKEAEALMEKLNGLGFLVTPHPFNEMMKLYEASGQYEKVVMVVSMMKGNKIPRN 212

Query: 204 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYVIAANAYNKVGLIDKSISML 263
                                        + S+ + W+     AN Y K G  +K+  +L
Sbjct: 213 VLSYNLWMNACCEVSGVAAVETVYKEMVGDKSVEVGWSSLCTLANVYIKSGFDEKARLVL 272

Query: 264 KKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYKK--EKIFNKGFISMITSLL 323
           + +E +L    + G+ F   + LYA  G K+ + R+W + K    +I    +I +++SL+
Sbjct: 273 EDAEKMLNRSNRLGYFF--LITLYASLGNKEGVVRLWEVSKSVCGRISCVNYICVLSSLV 332

Query: 324 ILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVE 383
              D++ AER++ EWE +  +YD+R+ N+L+ AY R G + KAE L   ++      + +
Sbjct: 333 KTGDLEEAERVFSEWEAQCFNYDVRVSNVLLGAYVRNGEIRKAESLHGCVLERGGTPNYK 392

Query: 384 SWCYLASGYLQKDQLPQAVETL 400
           +W  L  G+++ + + +A++ +
Sbjct: 393 TWEILMEGWVKCENMEKAIDAM 412

BLAST of CsaV3_1G012220 vs. TrEMBL
Match: tr|A0A0A0LV44|A0A0A0LV44_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G073790 PE=4 SV=1)

HSP 1 Score: 785.8 bits (2028), Expect = 5.3e-224
Identity = 459/461 (99.57%), Postives = 460/461 (99.78%), Query Frame = 0

Query: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ
Sbjct: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
           QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF
Sbjct: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX 180
           DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX
Sbjct: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300
           IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK
Sbjct: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300

Query: 301 KEKIFNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA 360
           KEKIFNKGFISMITSL +LDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA
Sbjct: 301 KEKIFNKGFISMITSLFVLDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA 360

Query: 361 EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF 420
           EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF
Sbjct: 361 EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF 420

Query: 421 LDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA 462
           LDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA
Sbjct: 421 LDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA 461

BLAST of CsaV3_1G012220 vs. TrEMBL
Match: tr|A0A1S3B572|A0A1S3B572_CUCME (pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103486303 PE=4 SV=1)

HSP 1 Score: 672.5 bits (1734), Expect = 6.5e-190
Identity = 398/453 (87.86%), Postives = 420/453 (92.72%), Query Frame = 0

Query: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           MKLLQS+KP NLIA R   VNFYST VKDNLYRRISPVGDPNISV P+LDQWVLEGR+VQ
Sbjct: 1   MKLLQSVKPTNLIALRRGLVNFYSTFVKDNLYRRISPVGDPNISVIPVLDQWVLEGRVVQ 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
           ++EL+ IIKELRVYKRFKHALEISKWMSDKRY PLST D+A RMNLILRVHGLEQVEDYF
Sbjct: 61  KEELQKIIKELRVYKRFKHALEISKWMSDKRYLPLSTDDVATRMNLILRVHGLEQVEDYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX 180
           +NMPSQLKRY VHIALLNCYAHEKCVDKANAF+QKIKEMG+A S LPYNIMM  XXXXXX
Sbjct: 121 NNMPSQLKRYHVHIALLNCYAHEKCVDKANAFLQKIKEMGYAKSTLPYNIMMNLXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX N SIVLDWNCYV
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSNTSIVLDWNCYV 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK 300
           IAANAYNKVGLIDKS+SMLKKSEG LA  KKKG AFNVYLKLYARNGKKDE+HRIWNLYK
Sbjct: 241 IAANAYNKVGLIDKSVSMLKKSEGRLATDKKKGHAFNVYLKLYARNGKKDEVHRIWNLYK 300

Query: 301 KEKIFNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKA 360
           KEKIFNKGFISMI SLLILDDI+GAE IYKEWET+KLSYD+RIPNLLVDAYCRAGL+EKA
Sbjct: 301 KEKIFNKGFISMIRSLLILDDIRGAEDIYKEWETQKLSYDVRIPNLLVDAYCRAGLIEKA 360

Query: 361 EVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAAF 420
           E L+NE+V VR KFSVESWCYLASGYLQKDQLPQAVETLK AAS+CPS LNYVKEILAAF
Sbjct: 361 EELVNEIVNVRGKFSVESWCYLASGYLQKDQLPQAVETLKKAASLCPSELNYVKEILAAF 420

Query: 421 LDGKQDVEETEKVVNLLREKDDSHPARAHDYIV 454
            DGKQDVEE EKVVNLLREKD+ +PARAHD +V
Sbjct: 421 SDGKQDVEEAEKVVNLLREKDNLNPARAHDILV 453

BLAST of CsaV3_1G012220 vs. TrEMBL
Match: tr|A0A1S3B6G4|A0A1S3B6G4_CUCME (pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103486307 PE=4 SV=1)

HSP 1 Score: 640.2 bits (1650), Expect = 3.6e-180
Identity = 371/467 (79.44%), Postives = 403/467 (86.30%), Query Frame = 0

Query: 1   MKLLQSLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQ 60
           MKLLQS+K INLIAFR E VNFYST V D+LYRR+SPVGDPNIS+ P+LDQWV EGR VQ
Sbjct: 1   MKLLQSVKSINLIAFRRELVNFYSTFVNDDLYRRLSPVGDPNISIVPILDQWVSEGRPVQ 60

Query: 61  QDELRHIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYF 120
             ELR IIKELRVYKR+KHALE+SKWMSDK   PLSTADIA RMNLILRVHGLEQVEDYF
Sbjct: 61  IVELRLIIKELRVYKRYKHALEMSKWMSDKVCLPLSTADIATRMNLILRVHGLEQVEDYF 120

Query: 121 DNMPSQLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXX 180
           +NMPS+LKRYQVHIALLNCYAHEKCVDKANA +QKIKE+GFA +P PYNIMM        
Sbjct: 121 NNMPSKLKRYQVHIALLNCYAHEKCVDKANALLQKIKELGFATTPHPYNIMMNLYHQIGE 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYV 240
            XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX            N SIVLDWNCYV
Sbjct: 181 FXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAGIEKITEQMESNTSIVLDWNCYV 240

Query: 241 IAANAYNKVGLIDKSISMLKKSEGLLA-NVKKKGFAFNVYLKLYARNGKKDEIHRIWNLY 300
           +AA+AY KVGLIDKSISMLKKSE LLA   + K  AFN+YL LYA+NGKKDE +RIWNLY
Sbjct: 241 VAADAYYKVGLIDKSISMLKKSEELLAKTAENKCHAFNIYLTLYAKNGKKDETYRIWNLY 300

Query: 301 KKEKIFNKGFISMITSLLILDDIKGAERIYKE-----WETRKLSYDLRIPNLLVDAYCRA 360
           KKEK+FNKGFISMITSLLILDDIKGA RI +E     WET+KLSYDLRIPNLLVDAYCRA
Sbjct: 301 KKEKVFNKGFISMITSLLILDDIKGARRICEEWETQVWETQKLSYDLRIPNLLVDAYCRA 360

Query: 361 GLMEKAEVLLNEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVK 420
           GLME+AEVL+ EM+ VRRKFSV+SWCY+ASGYLQKDQLP+AVETLK+AAS+CPS+L+YVK
Sbjct: 361 GLMEEAEVLVYEMMTVRRKFSVKSWCYIASGYLQKDQLPEAVETLKIAASLCPSKLDYVK 420

Query: 421 EILAAFLDGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAIMTESA 462
           EILAAFLDGKQDVEE EKVVNLLREKD+SHPAR HDY   AIMTESA
Sbjct: 421 EILAAFLDGKQDVEEVEKVVNLLREKDNSHPARGHDY---AIMTESA 464

BLAST of CsaV3_1G012220 vs. TrEMBL
Match: tr|A0A0A0LSC2|A0A0A0LSC2_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G073800 PE=4 SV=1)

HSP 1 Score: 471.1 bits (1211), Expect = 2.9e-129
Identity = 255/367 (69.48%), Postives = 264/367 (71.93%), Query Frame = 0

Query: 87  MSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYFDNMPSQLKRYQVHIALLNCYAHEKCV 146
           MSDKRYFPLSTADIA RMNLILRVHGLEQVEDYF+NMPSQLKR QVHIALLNCYAHEK  
Sbjct: 1   MSDKRYFPLSTADIATRMNLILRVHGLEQVEDYFNNMPSQLKRCQVHIALLNCYAHEKYA 60

Query: 147 DKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 206
           DKANA +QKIKEMGFA + LPYNI M                                  
Sbjct: 61  DKANAVLQKIKEMGFAKTSLPYNITMNLYHQIGEFERLDSPLKETDVDHDQFTY------ 120

Query: 207 XXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYVIAANAYNKVGLIDKSISMLKKSEGLL 266
                                   +  L+WNCYVIAANAYNKVGLIDKSISMLKKSEGLL
Sbjct: 121 ------------------------TTRLNWNCYVIAANAYNKVGLIDKSISMLKKSEGLL 180

Query: 267 ANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYKKEKIFNKGFISMITSLLILDDIKGAE 326
           ANVKKKGFAFNVYLKLYARNGKKDEIH IWNLYKKEKIFNKGFISMITSL +LDDIKGAE
Sbjct: 181 ANVKKKGFAFNVYLKLYARNGKKDEIHLIWNLYKKEKIFNKGFISMITSLFVLDDIKGAE 240

Query: 327 RIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLLNEMVIVRRKFSVESWCYLASGY 386
           RIYKEWET+KLSYDLRIPNLLVDAYCRA                            ASGY
Sbjct: 241 RIYKEWETQKLSYDLRIPNLLVDAYCRA----------------------------ASGY 300

Query: 387 LQKDQLPQAVETLKLAASVCPSRLNYVKEILAAFLDGKQDVEETEKVVNLLREKDDSHPA 446
           LQKDQLPQAVETLK AAS+CPS LNY KEILAAFLDGKQD EETEKVVNLLREKDDSHPA
Sbjct: 301 LQKDQLPQAVETLKKAASLCPSELNYAKEILAAFLDGKQDEEETEKVVNLLREKDDSHPA 309

Query: 447 RAHDYIV 454
           RAHD +V
Sbjct: 361 RAHDILV 309

BLAST of CsaV3_1G012220 vs. TrEMBL
Match: tr|A0A2N9I6A5|A0A2N9I6A5_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS47431 PE=4 SV=1)

HSP 1 Score: 408.7 bits (1049), Expect = 1.8e-110
Identity = 278/455 (61.10%), Postives = 342/455 (75.16%), Query Frame = 0

Query: 6   SLKPINLIAFRSEFVNFYSTVVKDNLYRRISPVGDPNISVTPLLDQWVLEGRLVQQDELR 65
           SL+P  L   +    +F+S+    +LYRRIS +G PN+SV PLLDQWV EGR V ++EL+
Sbjct: 10  SLRP--LFPSQKLLCHFFSSNALHSLYRRISQLGGPNVSVVPLLDQWVQEGRPVPKEELQ 69

Query: 66  HIIKELRVYKRFKHALEISKWMSDKRYFPLSTADIAIRMNLILRVHGLEQVEDYFDNMPS 125
            IIKELRVYKRF HALEIS+WMSDKRY  LS  DIA RMNLI RVHGLEQVE+YF+N+P+
Sbjct: 70  RIIKELRVYKRFNHALEISQWMSDKRYIILSCGDIATRMNLIFRVHGLEQVENYFNNIPT 129

Query: 126 QLKRYQVHIALLNCYAHEKCVDKANAFMQKIKEMGFANSPLPYNIMMXXXXXXXXXXXXX 185
            +K + V+ ALLNCYAHEK V+KA   MQ +K+MG            XXXXXXXXXXXXX
Sbjct: 130 NMKGFTVYTALLNCYAHEKSVEKAEIVMQSMKDMGLVXXXXXXXXXXXXXXXXXXXXXXX 189

Query: 186 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPSIVLDWNCYVIAANA 245
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX+P +VL+WN Y IAAN 
Sbjct: 190 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDPQLVLNWNSYSIAANG 249

Query: 246 YNKVGLIDKSISMLKKSEGLLANVKKKGFAFNVYLKLYARNGKKDEIHRIWNLYK-KEKI 305
           Y KVGL+DK+++M+KKSEGL+ N KKK FAF++ LK YA  GKKDE++RIW LYK KEKI
Sbjct: 250 YLKVGLLDKALAMVKKSEGLIDNAKKKNFAFDLLLKQYAEIGKKDELYRIWKLYKEKEKI 309

Query: 306 FNKGFISMITSLLILDDIKGAERIYKEWETRKLSYDLRIPNLLVDAYCRAGLMEKAEVLL 365
           +NKG+ISMI+SLL  DDI+GAE I++EWE+RKLSYD R+PNLL++AY + GL+ KAE LL
Sbjct: 310 YNKGYISMISSLLAFDDIEGAENIFEEWESRKLSYDFRVPNLLINAYGQKGLLAKAEALL 369

Query: 366 NEMVIVRRKFSVESWCYLASGYLQKDQLPQAVETLKLAASVCPSRLNYVKEILAA---FL 425
           N  +    + S +SW YLASGYL  +Q+P+A+E +K A +VCP      KE LA    +L
Sbjct: 370 NRGMTRGGQPSADSWYYLASGYLDNNQIPKALEVMKKAVAVCPPGWRPSKETLATCLEYL 429

Query: 426 DGKQDVEETEKVVNLLREKDDSHPARAHDYIVGAI 457
           +GK D E  +  +N LR +    P   H+ ++  I
Sbjct: 430 EGKGDTERADDFINSLRVQ-HIFPTSVHNRLLNYI 461

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011653157.18.0e-22499.57PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-... [more]
XP_008442434.19.9e-19087.86PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-... [more]
XP_008442448.15.4e-18079.44PREDICTED: pentatricopeptide repeat-containing protein At2g20710, mitochondrial-... [more]
XP_022140106.12.4e-15171.08pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like [Momor... [more]
XP_011653151.17.7e-15079.05PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At2g... [more]
Match NameE-valueIdentityDescription
AT2G20710.19.2e-7335.58Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G21705.11.4e-5733.42Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G02150.12.4e-4943.10Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G28020.12.2e-4231.59Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G27460.11.3e-3726.96Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9SKU6|PP166_ARATH1.7e-7135.58Pentatricopeptide repeat-containing protein At2g20710, mitochondrial OS=Arabidop... [more]
sp|Q84JR3|PP334_ARATH2.6e-5633.42Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidop... [more]
sp|Q8LPS6|PPR3_ARATH4.4e-4843.10Pentatricopeptide repeat-containing protein At1g02150 OS=Arabidopsis thaliana OX... [more]
sp|Q9C7F1|PPR61_ARATH4.0e-4131.59Putative pentatricopeptide repeat-containing protein At1g28020 OS=Arabidopsis th... [more]
sp|Q3E911|PP400_ARATH2.3e-3626.96Pentatricopeptide repeat-containing protein At5g27460 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LV44|A0A0A0LV44_CUCSA5.3e-22499.57Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G073790 PE=4 SV=1[more]
tr|A0A1S3B572|A0A1S3B572_CUCME6.5e-19087.86pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like OS=Cuc... [more]
tr|A0A1S3B6G4|A0A1S3B6G4_CUCME3.6e-18079.44pentatricopeptide repeat-containing protein At2g20710, mitochondrial-like OS=Cuc... [more]
tr|A0A0A0LSC2|A0A0A0LSC2_CUCSA2.9e-12969.48Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G073800 PE=4 SV=1[more]
tr|A0A2N9I6A5|A0A2N9I6A5_FAGSY1.8e-11061.10Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS47431 PE=4 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0009058 biosynthetic process
biological_process GO:0055114 oxidation-reduction process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0005506 iron ion binding
molecular_function GO:0016491 oxidoreductase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_1G012220.1CsaV3_1G012220.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 309..449
e-value: 4.4E-10
score: 41.6
coord: 176..308
e-value: 4.9E-15
score: 57.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 49..162
e-value: 3.8E-5
score: 25.0
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILYSSF48452TPR-likecoord: 312..418
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 345..368
e-value: 1.8E-6
score: 27.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 345..368
e-value: 1.0E-4
score: 20.2
coord: 168..199
e-value: 1.7E-4
score: 19.5
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 168..211
e-value: 2.9E-8
score: 33.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 135..161
e-value: 0.27
score: 11.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 129..163
score: 6.358
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 164..198
score: 9.24
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 340..374
score: 8.342
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 199..229
score: 7.706
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 272..306
score: 6.599
NoneNo IPR availablePANTHERPTHR24015:SF1625SUBFAMILY NOT NAMEDcoord: 29..442
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 29..442

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CsaV3_1G012220CSPI01G12050Wild cucumber (PI 183967)cpicucB001
CsaV3_1G012220Cucsa.126620Cucumber (Gy14) v1cgycucB174
CsaV3_1G012220CsGy1G012070Cucumber (Gy14) v2cgybcucB002
CsaV3_1G012220MELO3C002597Melon (DHL92) v3.5.1cucmeB024
CsaV3_1G012220MELO3C002597.2Melon (DHL92) v3.6.1cucmedB022
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CsaV3_1G012220Cucumber (Gy14) v1cgycucB018
CsaV3_1G012220Cucumber (Gy14) v1cgycucB165
CsaV3_1G012220Cucurbita maxima (Rimu)cmacucB0011
CsaV3_1G012220Cucurbita maxima (Rimu)cmacucB0249
CsaV3_1G012220Cucurbita maxima (Rimu)cmacucB0465
CsaV3_1G012220Cucurbita maxima (Rimu)cmacucB0515
CsaV3_1G012220Cucurbita maxima (Rimu)cmacucB1005
CsaV3_1G012220Cucurbita moschata (Rifu)cmocucB0000
CsaV3_1G012220Cucurbita moschata (Rifu)cmocucB0235
CsaV3_1G012220Cucurbita moschata (Rifu)cmocucB0457
CsaV3_1G012220Cucurbita moschata (Rifu)cmocucB0503
CsaV3_1G012220Cucurbita moschata (Rifu)cmocucB0989
CsaV3_1G012220Cucurbita pepo (Zucchini)cpecucB0031
CsaV3_1G012220Cucurbita pepo (Zucchini)cpecucB0585
CsaV3_1G012220Cucurbita pepo (Zucchini)cpecucB0624
CsaV3_1G012220Cucurbita pepo (Zucchini)cpecucB0668
CsaV3_1G012220Cucurbita pepo (Zucchini)cpecucB0812
CsaV3_1G012220Cucurbita pepo (Zucchini)cpecucB0953
CsaV3_1G012220Wild cucumber (PI 183967)cpicucB202
CsaV3_1G012220Wild cucumber (PI 183967)cpicucB310
CsaV3_1G012220Bottle gourd (USVL1VR-Ls)cuclsiB020
CsaV3_1G012220Bottle gourd (USVL1VR-Ls)cuclsiB022
CsaV3_1G012220Bottle gourd (USVL1VR-Ls)cuclsiB056
CsaV3_1G012220Melon (DHL92) v3.5.1cucmeB092
CsaV3_1G012220Melon (DHL92) v3.6.1cucmedB085
CsaV3_1G012220Watermelon (Charleston Gray)cucwcgB012
CsaV3_1G012220Watermelon (Charleston Gray)cucwcgB015
CsaV3_1G012220Watermelon (Charleston Gray)cucwcgB038
CsaV3_1G012220Watermelon (97103) v1cucwmB013
CsaV3_1G012220Watermelon (97103) v1cucwmB050
CsaV3_1G012220Watermelon (97103) v1cucwmB088
CsaV3_1G012220Watermelon (97103) v2cucwmbB001
CsaV3_1G012220Watermelon (97103) v2cucwmbB022
CsaV3_1G012220Watermelon (97103) v2cucwmbB030
CsaV3_1G012220Wax gourdcucwgoB026
CsaV3_1G012220Wax gourdcucwgoB041
CsaV3_1G012220Wax gourdcucwgoB080
CsaV3_1G012220Cucumber (Chinese Long) v3cuccucB030
CsaV3_1G012220Cucumber (Chinese Long) v3cuccucB044
CsaV3_1G012220Silver-seed gourdcarcucB0170
CsaV3_1G012220Silver-seed gourdcarcucB0193
CsaV3_1G012220Silver-seed gourdcarcucB0672
CsaV3_1G012220Silver-seed gourdcarcucB0991