CsGy4G017810 (gene) Cucumber (Gy14) v2

NameCsGy4G017810
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
DescriptionPentatricopeptide repeat-containing family protein
LocationChr4 : 22998337 .. 23000463 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTCTCTGGATTAATGTTCTTCCGGTTGGTGCTAAATCGTTTCTACTGTTCTAATTTTATAATTTCACGTAATTCTCTTATTACCCGGTATTCTCGATTGGGTCAAATCGAAAAGGCTCGGGTTGTGTTCGATGAAATGCGTGACAAAAACATCATTTCATGGAACTCAATTGTTGCTGGGTACTTTCAGAACAAACGGCCTCAGGAAGCCCAGAACATGTTTGATAAAATGTCTGAGAGGAATACTATATCTTGGAATGGTTTAGTTTCTGGGTATATTAACAATGGGATGATCAATGAAGCTAGGGAAGTGTTTGATAGAATGCCTGAGAGGAATGTTGTTTCCTGGACTGCAATGGTCAGAGGGTACGTGAAGGAAGGTATGATTTCTGAGGCAGAGACACTTTTTTGGCAAATGCCTGAAAAGAATGTAGTATCTTGGACGGTGATGTTGGGTGGCCTTCTTCAAGAAGGACGGATTGATGAGGCTTGTAGGCTTTTCGATATGATGCCTGAGAAGGATGTGGTGACACGAACTAATATGATTGGAGGGTATTGCCAAGTAGGCCGTTTAGTGGAAGCTCGTATGCTTTTCGATGAGATGCCTCGTCGGAATGTCGTGTCATGGACTACGATGATAACAGGATATGTGCAGAACCAACAAGTGGATATTGCTAGAAAGCTGTTTGAAGTCATGCCAGAGAAAAATGAGGTTTCATGGACTGCTATGCTGAAAGGCTACACCAATTGTGGGCGGCTTGATGAGGCTTCAGAACTTTTTAATGCAATGCCAATTAAGTCGGTTGTTGCTTGTAATGCAATGATTCTTTGTTTTGGCCAGAATGGGGAGGTCCCAAAAGCAAGACAAGTGTTTGATCAAATGAGAGAAAAGGATGAAGGAACATGGAGTGCTATGATTAAAGTGTATGAACGGAAAGGCCTTGAGTTAGATGCACTTGAGTTGTTTCGTATGATGCAAAGAGAAGGAATAAGGCCAAATTTTCCTTCTTTGATCAGCGTTCTTTCCGTTTGTGCTGGCTTGGCTAATCTTGATCATGGTAGAGAGATACATGCCCAGCTTGTGAGATCCCAATTTGACCTTGATGTCTATGTTGCCTCCGTTCTTCTCTCAATGTACATAAAGTGTGGCAATCTGGCGAAAGCAAAACAAGTATTTGATAGGTTTGCAGTTAAGGATGTTGTCATGTGGAACTCGATTATCACAGGATATGCCCAACATGGTCTAGGGGTGGAAGCATTACGAGTTTTCCACGATATGCATTTTTCAGGTATCATGCCAGATGATGTCACATTTGTTGGAGTTCTTTCGGCATGTAGTTACACTGGCAATGTGAAAAAAGGCTTAGAAATTTTCAATTCCATGGAAACGAAGTATCAAGTGGAACAAAAAATTGAACATTATGCCTGCATGGTTGATTTGCTTGGTCGAGCTGGTAAACTAAATGAGGCAATGGATCTAATAGAAAAAATGCCAATGGAAGCCGATGCTATTATTTGGGGTGCTTTATTAGGTGCTTGCAGAACCCACATGAAATTGGATTTGGCTGAAGTTGCAGCCAAAAAGCTTCTAGTGCTTGAGCCTAAAAATGCAGGGCCTTTCATTTTGTTGTCAAATATCTATGCATCTCAAGGTAGATGGGATGATGTTGCTGAGTTGAGGAGAAATATGAGAGATAGGCGTGTGAGCAAGTACCCAGGCTGTAGCTGGATTGTTGTCGAGAAGAAAGTTCATAAGTTCACGGGAGGTGATAGCTCGGGACACCCTGAGCATTCTGAGATCAATAGAATATTAGAGTGGTTGTCTGGATTGCTAAGAGAGGCAGGATATTACCCAGATCAAAGTTTTGTGCTACATGACGTAGATGAAGAAGAGAAAGTGCAAAGCTTGGAGTACCATAGTGAGAAACTGGCTGTGGCATACGGACTTCTCAAAATACCAATAGGTATGCCGATTCGTGTAATGAAGAATCTCCGTGTCTGTGGGGATTGCCACGCTGCAATTAAACTAATTGCGAAGGTTACAGGAAGAGAGATCATCTTGAGAGATGCTAATCGGTTCCATCATTTTAAGGATGGCTCATGCTCTTGTCGGGATTATTGGTGA

mRNA sequence

ATGTTCTCTGGATTAATGTTCTTCCGGTTGGTGCTAAATCGTTTCTACTGTTCTAATTTTATAATTTCACGTAATTCTCTTATTACCCGGTATTCTCGATTGGGTCAAATCGAAAAGGCTCGGGTTGTGTTCGATGAAATGCGTGACAAAAACATCATTTCATGGAACTCAATTGTTGCTGGGTACTTTCAGAACAAACGGCCTCAGGAAGCCCAGAACATGTTTGATAAAATGTCTGAGAGGAATACTATATCTTGGAATGGTTTAGTTTCTGGGTATATTAACAATGGGATGATCAATGAAGCTAGGGAAGTGTTTGATAGAATGCCTGAGAGGAATGTTGTTTCCTGGACTGCAATGGTCAGAGGGTACGTGAAGGAAGGTATGATTTCTGAGGCAGAGACACTTTTTTGGCAAATGCCTGAAAAGAATGTAGTATCTTGGACGGTGATGTTGGGTGGCCTTCTTCAAGAAGGACGGATTGATGAGGCTTGTAGGCTTTTCGATATGATGCCTGAGAAGGATGTGGTGACACGAACTAATATGATTGGAGGGTATTGCCAAGTAGGCCGTTTAGTGGAAGCTCGTATGCTTTTCGATGAGATGCCTCGTCGGAATGTCGTGTCATGGACTACGATGATAACAGGATATGTGCAGAACCAACAAGTGGATATTGCTAGAAAGCTGTTTGAAGTCATGCCAGAGAAAAATGAGGTTTCATGGACTGCTATGCTGAAAGGCTACACCAATTGTGGGCGGCTTGATGAGGCTTCAGAACTTTTTAATGCAATGCCAATTAAGTCGGTTGTTGCTTGTAATGCAATGATTCTTTGTTTTGGCCAGAATGGGGAGGTCCCAAAAGCAAGACAAGTGTTTGATCAAATGAGAGAAAAGGATGAAGGAACATGGAGTGCTATGATTAAAGTGTATGAACGGAAAGGCCTTGAGTTAGATGCACTTGAGTTGTTTCGTATGATGCAAAGAGAAGGAATAAGGCCAAATTTTCCTTCTTTGATCAGCGTTCTTTCCGTTTGTGCTGGCTTGGCTAATCTTGATCATGGTAGAGAGATACATGCCCAGCTTGTGAGATCCCAATTTGACCTTGATGTCTATGTTGCCTCCGTTCTTCTCTCAATGTACATAAAGTGTGGCAATCTGGCGAAAGCAAAACAAGTATTTGATAGGTTTGCAGTTAAGGATGTTGTCATGTGGAACTCGATTATCACAGGATATGCCCAACATGGTCTAGGGGTGGAAGCATTACGAGTTTTCCACGATATGCATTTTTCAGGTATCATGCCAGATGATGTCACATTTGTTGGAGTTCTTTCGGCATGTAGTTACACTGGCAATGTGAAAAAAGGCTTAGAAATTTTCAATTCCATGGAAACGAAGTATCAAGTGGAACAAAAAATTGAACATTATGCCTGCATGGTTGATTTGCTTGGTCGAGCTGGTAAACTAAATGAGGCAATGGATCTAATAGAAAAAATGCCAATGGAAGCCGATGCTATTATTTGGGGTGCTTTATTAGGTGCTTGCAGAACCCACATGAAATTGGATTTGGCTGAAGTTGCAGCCAAAAAGCTTCTAGTGCTTGAGCCTAAAAATGCAGGGCCTTTCATTTTGTTGTCAAATATCTATGCATCTCAAGGTAGATGGGATGATGTTGCTGAGTTGAGGAGAAATATGAGAGATAGGCGTGTGAGCAAGTACCCAGGCTGTAGCTGGATTGTTGTCGAGAAGAAAGTTCATAAGTTCACGGGAGGTGATAGCTCGGGACACCCTGAGCATTCTGAGATCAATAGAATATTAGAGTGGTTGTCTGGATTGCTAAGAGAGGCAGGATATTACCCAGATCAAAGTTTTGTGCTACATGACGTAGATGAAGAAGAGAAAGTGCAAAGCTTGGAGTACCATAGTGAGAAACTGGCTGTGGCATACGGACTTCTCAAAATACCAATAGGTATGCCGATTCGTGTAATGAAGAATCTCCGTGTCTGTGGGGATTGCCACGCTGCAATTAAACTAATTGCGAAGGTTACAGGAAGAGAGATCATCTTGAGAGATGCTAATCGGTTCCATCATTTTAAGGATGGCTCATGCTCTTGTCGGGATTATTGGTGA

Coding sequence (CDS)

ATGTTCTCTGGATTAATGTTCTTCCGGTTGGTGCTAAATCGTTTCTACTGTTCTAATTTTATAATTTCACGTAATTCTCTTATTACCCGGTATTCTCGATTGGGTCAAATCGAAAAGGCTCGGGTTGTGTTCGATGAAATGCGTGACAAAAACATCATTTCATGGAACTCAATTGTTGCTGGGTACTTTCAGAACAAACGGCCTCAGGAAGCCCAGAACATGTTTGATAAAATGTCTGAGAGGAATACTATATCTTGGAATGGTTTAGTTTCTGGGTATATTAACAATGGGATGATCAATGAAGCTAGGGAAGTGTTTGATAGAATGCCTGAGAGGAATGTTGTTTCCTGGACTGCAATGGTCAGAGGGTACGTGAAGGAAGGTATGATTTCTGAGGCAGAGACACTTTTTTGGCAAATGCCTGAAAAGAATGTAGTATCTTGGACGGTGATGTTGGGTGGCCTTCTTCAAGAAGGACGGATTGATGAGGCTTGTAGGCTTTTCGATATGATGCCTGAGAAGGATGTGGTGACACGAACTAATATGATTGGAGGGTATTGCCAAGTAGGCCGTTTAGTGGAAGCTCGTATGCTTTTCGATGAGATGCCTCGTCGGAATGTCGTGTCATGGACTACGATGATAACAGGATATGTGCAGAACCAACAAGTGGATATTGCTAGAAAGCTGTTTGAAGTCATGCCAGAGAAAAATGAGGTTTCATGGACTGCTATGCTGAAAGGCTACACCAATTGTGGGCGGCTTGATGAGGCTTCAGAACTTTTTAATGCAATGCCAATTAAGTCGGTTGTTGCTTGTAATGCAATGATTCTTTGTTTTGGCCAGAATGGGGAGGTCCCAAAAGCAAGACAAGTGTTTGATCAAATGAGAGAAAAGGATGAAGGAACATGGAGTGCTATGATTAAAGTGTATGAACGGAAAGGCCTTGAGTTAGATGCACTTGAGTTGTTTCGTATGATGCAAAGAGAAGGAATAAGGCCAAATTTTCCTTCTTTGATCAGCGTTCTTTCCGTTTGTGCTGGCTTGGCTAATCTTGATCATGGTAGAGAGATACATGCCCAGCTTGTGAGATCCCAATTTGACCTTGATGTCTATGTTGCCTCCGTTCTTCTCTCAATGTACATAAAGTGTGGCAATCTGGCGAAAGCAAAACAAGTATTTGATAGGTTTGCAGTTAAGGATGTTGTCATGTGGAACTCGATTATCACAGGATATGCCCAACATGGTCTAGGGGTGGAAGCATTACGAGTTTTCCACGATATGCATTTTTCAGGTATCATGCCAGATGATGTCACATTTGTTGGAGTTCTTTCGGCATGTAGTTACACTGGCAATGTGAAAAAAGGCTTAGAAATTTTCAATTCCATGGAAACGAAGTATCAAGTGGAACAAAAAATTGAACATTATGCCTGCATGGTTGATTTGCTTGGTCGAGCTGGTAAACTAAATGAGGCAATGGATCTAATAGAAAAAATGCCAATGGAAGCCGATGCTATTATTTGGGGTGCTTTATTAGGTGCTTGCAGAACCCACATGAAATTGGATTTGGCTGAAGTTGCAGCCAAAAAGCTTCTAGTGCTTGAGCCTAAAAATGCAGGGCCTTTCATTTTGTTGTCAAATATCTATGCATCTCAAGGTAGATGGGATGATGTTGCTGAGTTGAGGAGAAATATGAGAGATAGGCGTGTGAGCAAGTACCCAGGCTGTAGCTGGATTGTTGTCGAGAAGAAAGTTCATAAGTTCACGGGAGGTGATAGCTCGGGACACCCTGAGCATTCTGAGATCAATAGAATATTAGAGTGGTTGTCTGGATTGCTAAGAGAGGCAGGATATTACCCAGATCAAAGTTTTGTGCTACATGACGTAGATGAAGAAGAGAAAGTGCAAAGCTTGGAGTACCATAGTGAGAAACTGGCTGTGGCATACGGACTTCTCAAAATACCAATAGGTATGCCGATTCGTGTAATGAAGAATCTCCGTGTCTGTGGGGATTGCCACGCTGCAATTAAACTAATTGCGAAGGTTACAGGAAGAGAGATCATCTTGAGAGATGCTAATCGGTTCCATCATTTTAAGGATGGCTCATGCTCTTGTCGGGATTATTGGTGA

Protein sequence

MFSGLMFFRLVLNRFYCSNFIISRNSLITRYSRLGQIEKARVVFDEMRDKNIISWNSIVAGYFQNKRPQEAQNMFDKMSERNTISWNGLVSGYINNGMINEAREVFDRMPERNVVSWTAMVRGYVKEGMISEAETLFWQMPEKNVVSWTVMLGGLLQEGRIDEACRLFDMMPEKDVVTRTNMIGGYCQVGRLVEARMLFDEMPRRNVVSWTTMITGYVQNQQVDIARKLFEVMPEKNEVSWTAMLKGYTNCGRLDEASELFNAMPIKSVVACNAMILCFGQNGEVPKARQVFDQMREKDEGTWSAMIKVYERKGLELDALELFRMMQREGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLAKAKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACSYTGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIWGALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDRRVSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHDVDEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREIILRDANRFHHFKDGSCSCRDYW
BLAST of CsGy4G017810 vs. NCBI nr
Match: KGN54664.1 (hypothetical protein Csa_4G418570 [Cucumis sativus])

HSP 1 Score: 785.8 bits (2028), Expect = 1.2e-223
Identity = 381/381 (100.00%), Postives = 381/381 (100.00%), Query Frame = 0

Query: 328 REGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLA 387
           REGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLA
Sbjct: 394 REGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLA 453

Query: 388 KAKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACS 447
           KAKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACS
Sbjct: 454 KAKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACS 513

Query: 448 YTGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIW 507
           YTGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIW
Sbjct: 514 YTGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIW 573

Query: 508 GALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDR 567
           GALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDR
Sbjct: 574 GALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDR 633

Query: 568 RVSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHD 627
           RVSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHD
Sbjct: 634 RVSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHD 693

Query: 628 VDEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREII 687
           VDEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREII
Sbjct: 694 VDEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREII 753

Query: 688 LRDANRFHHFKDGSCSCRDYW 709
           LRDANRFHHFKDGSCSCRDYW
Sbjct: 754 LRDANRFHHFKDGSCSCRDYW 774

BLAST of CsGy4G017810 vs. NCBI nr
Match: XP_011653842.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g56690, mitochondrial-like [Cucumis sativus])

HSP 1 Score: 785.8 bits (2028), Expect = 1.2e-223
Identity = 381/381 (100.00%), Postives = 381/381 (100.00%), Query Frame = 0

Query: 328 REGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLA 387
           REGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLA
Sbjct: 328 REGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLA 387

Query: 388 KAKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACS 447
           KAKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACS
Sbjct: 388 KAKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACS 447

Query: 448 YTGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIW 507
           YTGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIW
Sbjct: 448 YTGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIW 507

Query: 508 GALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDR 567
           GALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDR
Sbjct: 508 GALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDR 567

Query: 568 RVSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHD 627
           RVSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHD
Sbjct: 568 RVSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHD 627

Query: 628 VDEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREII 687
           VDEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREII
Sbjct: 628 VDEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREII 687

Query: 688 LRDANRFHHFKDGSCSCRDYW 709
           LRDANRFHHFKDGSCSCRDYW
Sbjct: 688 LRDANRFHHFKDGSCSCRDYW 708

BLAST of CsGy4G017810 vs. NCBI nr
Match: XP_008455763.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g56690, mitochondrial-like [Cucumis melo])

HSP 1 Score: 767.7 bits (1981), Expect = 3.5e-218
Identity = 372/381 (97.64%), Postives = 374/381 (98.16%), Query Frame = 0

Query: 328 REGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLA 387
           REGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLA
Sbjct: 328 REGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLA 387

Query: 388 KAKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACS 447
           KAKQVFDRFAVKDVVMWNSIITGYAQHGLG EALRVFHDMHFSGIMPDD+TFVGVLSACS
Sbjct: 388 KAKQVFDRFAVKDVVMWNSIITGYAQHGLGEEALRVFHDMHFSGIMPDDITFVGVLSACS 447

Query: 448 YTGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIW 507
           YTGNVKKGLEIF SMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIW
Sbjct: 448 YTGNVKKGLEIFYSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIW 507

Query: 508 GALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDR 567
           GALLGACRTHMKLDLAEVAAKKLL LEPKNAGPFILLSNIYASQGRW DVAELRRNMRDR
Sbjct: 508 GALLGACRTHMKLDLAEVAAKKLLYLEPKNAGPFILLSNIYASQGRWGDVAELRRNMRDR 567

Query: 568 RVSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHD 627
            VSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHD
Sbjct: 568 HVSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHD 627

Query: 628 VDEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREII 687
           VDEEEKVQSLEYHSEKLAVAYGL KIP GMPIRVMKNLRVCGDCHAAIKLIAKVTGREII
Sbjct: 628 VDEEEKVQSLEYHSEKLAVAYGLFKIPKGMPIRVMKNLRVCGDCHAAIKLIAKVTGREII 687

Query: 688 LRDANRFHHFKDGSCSCRDYW 709
           LRDANRFHHFKDGSCSC+DYW
Sbjct: 688 LRDANRFHHFKDGSCSCQDYW 708

BLAST of CsGy4G017810 vs. NCBI nr
Match: XP_022158877.1 (pentatricopeptide repeat-containing protein At1g56690, mitochondrial-like isoform X1 [Momordica charantia] >XP_022158878.1 pentatricopeptide repeat-containing protein At1g56690, mitochondrial-like isoform X1 [Momordica charantia] >XP_022158879.1 pentatricopeptide repeat-containing protein At1g56690, mitochondrial-like isoform X1 [Momordica charantia] >XP_022158880.1 pentatricopeptide repeat-containing protein At1g56690, mitochondrial-like isoform X1 [Momordica charantia] >XP_022158881.1 pentatricopeptide repeat-containing protein At1g56690, mitochondrial-like isoform X1 [Momordica charantia])

HSP 1 Score: 723.8 bits (1867), Expect = 5.7e-205
Identity = 345/381 (90.55%), Postives = 360/381 (94.49%), Query Frame = 0

Query: 328 REGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLA 387
           REGIRPNFPSLISVLSVCA LANLDHGRE+HAQLVRSQFDLDVYVASVL++MY+KCGNL 
Sbjct: 328 REGIRPNFPSLISVLSVCASLANLDHGREVHAQLVRSQFDLDVYVASVLITMYVKCGNLV 387

Query: 388 KAKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACS 447
           KAKQVFDRF +KDVVMWNSIITGYAQHGLG EAL+VFHDMHFSG+MPDD+TFVGVLSACS
Sbjct: 388 KAKQVFDRFPIKDVVMWNSIITGYAQHGLGEEALQVFHDMHFSGVMPDDITFVGVLSACS 447

Query: 448 YTGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIW 507
           YTGNVKKGLEIFNSME KY VE+K EHYACMVDLLGRAGKLNEAM LIEKMPMEADAIIW
Sbjct: 448 YTGNVKKGLEIFNSMEPKYHVERKTEHYACMVDLLGRAGKLNEAMGLIEKMPMEADAIIW 507

Query: 508 GALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDR 567
           GALLGACR HMKLDLAE+AAKKLL LEPKNAGP+ILLSNIYASQGRW DV ELRRNMRDR
Sbjct: 508 GALLGACRIHMKLDLAEIAAKKLLELEPKNAGPYILLSNIYASQGRWSDVVELRRNMRDR 567

Query: 568 RVSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHD 627
            VSK PGCSWI VEKKVH FTGGDS GHPEHSEI RIL+WLSGLLREAGYYPD+SFVLHD
Sbjct: 568 SVSKSPGCSWIDVEKKVHMFTGGDSMGHPEHSEIMRILDWLSGLLREAGYYPDRSFVLHD 627

Query: 628 VDEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREII 687
           VDEEEKVQSL YHSEKLAVAYGLLKIP GMPIRVMKNLRVCGDCH AIKLIAKV+GREII
Sbjct: 628 VDEEEKVQSLGYHSEKLAVAYGLLKIPKGMPIRVMKNLRVCGDCHTAIKLIAKVSGREII 687

Query: 688 LRDANRFHHFKDGSCSCRDYW 709
           LRDANRFHHFKDGSCSC+DYW
Sbjct: 688 LRDANRFHHFKDGSCSCKDYW 708

BLAST of CsGy4G017810 vs. NCBI nr
Match: XP_023517429.1 (pentatricopeptide repeat-containing protein At1g56690, mitochondrial-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 719.5 bits (1856), Expect = 1.1e-203
Identity = 343/380 (90.26%), Postives = 360/380 (94.74%), Query Frame = 0

Query: 329 EGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLAK 388
           EGIRPNFPSLISVLSVCA LA LDHGREIHAQLVRSQFDLDVYVASVLL+MY+KCGNL K
Sbjct: 329 EGIRPNFPSLISVLSVCASLATLDHGREIHAQLVRSQFDLDVYVASVLLTMYVKCGNLVK 388

Query: 389 AKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACSY 448
           AKQ+FD+FA KDVVMWNSIITGYAQHGLG EAL+VFHDMHFS ++PDD+TF+GVLSACSY
Sbjct: 389 AKQIFDKFATKDVVMWNSIITGYAQHGLGEEALQVFHDMHFSRVVPDDITFIGVLSACSY 448

Query: 449 TGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIWG 508
           TGNVKKGLEIFNSME KYQVEQKIEHYACMVDLLGRAGKLNEAMDLIE MPMEADAIIWG
Sbjct: 449 TGNVKKGLEIFNSMEMKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIENMPMEADAIIWG 508

Query: 509 ALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDRR 568
           +LLGACRTHM+LDLAEVAAKKLL LEPKNAGP+ILLSNIYASQGRW+DVAELRR MRDR 
Sbjct: 509 SLLGACRTHMRLDLAEVAAKKLLQLEPKNAGPYILLSNIYASQGRWNDVAELRRTMRDRS 568

Query: 569 VSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHDV 628
           VSK PGCSWI V+KKVH FTGG+SSGHPEHSEI R LEWLSGLLRE GYYPD+SFVLHDV
Sbjct: 569 VSKSPGCSWIDVDKKVHMFTGGESSGHPEHSEIMRTLEWLSGLLREEGYYPDRSFVLHDV 628

Query: 629 DEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREIIL 688
           DEEEKVQSL YHSEKLAVAYGLLK+P GMPIRVMKNLRVCGDCH AIKLIAKVTGREIIL
Sbjct: 629 DEEEKVQSLGYHSEKLAVAYGLLKLPKGMPIRVMKNLRVCGDCHTAIKLIAKVTGREIIL 688

Query: 689 RDANRFHHFKDGSCSCRDYW 709
           RDANRFHHFKDGSCSCRDYW
Sbjct: 689 RDANRFHHFKDGSCSCRDYW 708

BLAST of CsGy4G017810 vs. TAIR10
Match: AT1G56690.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 553.5 bits (1425), Expect = 1.9e-157
Identity = 258/381 (67.72%), Postives = 314/381 (82.41%), Query Frame = 0

Query: 328 REGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLA 387
           ++G+RP+FPSLIS+LSVCA LA+L +GR++HA LVR QFD DVYVASVL++MY+KCG L 
Sbjct: 324 KQGVRPSFPSLISILSVCATLASLQYGRQVHAHLVRCQFDDDVYVASVLMTMYVKCGELV 383

Query: 388 KAKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACS 447
           KAK VFDRF+ KD++MWNSII+GYA HGLG EAL++FH+M  SG MP+ VT + +L+ACS
Sbjct: 384 KAKLVFDRFSSKDIIMWNSIISGYASHGLGEEALKIFHEMPSSGTMPNKVTLIAILTACS 443

Query: 448 YTGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIW 507
           Y G +++GLEIF SME+K+ V   +EHY+C VD+LGRAG++++AM+LIE M ++ DA +W
Sbjct: 444 YAGKLEEGLEIFESMESKFCVTPTVEHYSCTVDMLGRAGQVDKAMELIESMTIKPDATVW 503

Query: 508 GALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDR 567
           GALLGAC+TH +LDLAEVAAKKL   EP NAG ++LLS+I AS+ +W DVA +R+NMR  
Sbjct: 504 GALLGACKTHSRLDLAEVAAKKLFENEPDNAGTYVLLSSINASRSKWGDVAVVRKNMRTN 563

Query: 568 RVSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHD 627
            VSK+PGCSWI V KKVH FT G    HPE + I  +LE   GLLREAGY PD S VLHD
Sbjct: 564 NVSKFPGCSWIEVGKKVHMFTRGGIKNHPEQAMILMMLEKTDGLLREAGYSPDCSHVLHD 623

Query: 628 VDEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREII 687
           VDEEEKV SL  HSE+LAVAYGLLK+P G+PIRVMKNLRVCGDCHAAIKLI+KVT REII
Sbjct: 624 VDEEEKVDSLSRHSERLAVAYGLLKLPEGVPIRVMKNLRVCGDCHAAIKLISKVTEREII 683

Query: 688 LRDANRFHHFKDGSCSCRDYW 709
           LRDANRFHHF +G CSCRDYW
Sbjct: 684 LRDANRFHHFNNGECSCRDYW 704

BLAST of CsGy4G017810 vs. TAIR10
Match: AT1G09410.1 (pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 453.4 bits (1165), Expect = 2.6e-127
Identity = 220/382 (57.59%), Postives = 267/382 (69.90%), Query Frame = 0

Query: 328 REGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLA 387
           ++G+RP FP+LIS+LSVCA LA+L HG+++HAQLVR QFD+DVYVASVL++MYIKCG L 
Sbjct: 324 KQGVRPTFPTLISILSVCASLASLHHGKQVHAQLVRCQFDVDVYVASVLMTMYIKCGELV 383

Query: 388 KAKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSG-IMPDDVTFVGVLSAC 447
           K+K +FDRF  KD++MWNSII+GYA HGLG EAL+VF +M  SG   P++VTFV      
Sbjct: 384 KSKLIFDRFPSKDIIMWNSIISGYASHGLGEEALKVFCEMPLSGSTKPNEVTFVATXXXX 443

Query: 448 SYTGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAII 507
                                                                +E DA +
Sbjct: 444 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTVEPDAAV 503

Query: 508 WGALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRD 567
           WG+LLGACRTH +LD+AE  AKKL+ +EP+N+G +ILLSN+YASQGRW DVAELR+ M+ 
Sbjct: 504 WGSLLGACRTHSQLDVAEFCAKKLIEIEPENSGTYILLSNMYASQGRWADVAELRKLMKT 563

Query: 568 RRVSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLH 627
           R V K PGCSW  VE KVH FT G  + HPE   I +IL+ L GLLREAGY PD S+ LH
Sbjct: 564 RLVRKSPGCSWTEVENKVHAFTRGGINSHPEQESILKILDELDGLLREAGYNPDCSYALH 623

Query: 628 DVDEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREI 687
           DVDEEEKV SL+YHSE+LAVAY LLK+  G+PIRVMKNLRVC DCH AIK+I+KV  REI
Sbjct: 624 DVDEEEKVNSLKYHSERLAVAYALLKLSEGIPIRVMKNLRVCSDCHTAIKIISKVKEREI 683

Query: 688 ILRDANRFHHFKDGSCSCRDYW 709
           ILRDANRFHHF++G CSC+DYW
Sbjct: 684 ILRDANRFHHFRNGECSCKDYW 705

BLAST of CsGy4G017810 vs. TAIR10
Match: AT4G02750.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 429.1 bits (1102), Expect = 5.3e-120
Identity = 205/380 (53.95%), Postives = 262/380 (68.95%), Query Frame = 0

Query: 329 EGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLAK 388
           EG R N  S  S LS CA +  L+ G+++H +LV+  ++   +V + LL MY KCG++ +
Sbjct: 403 EGGRLNRSSFSSALSTCADVVALELGKQLHGRLVKGGYETGCFVGNALLLMYCKCGSIEE 462

Query: 389 AKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACSY 448
           A  +F   A KD+V WN++I GY++HG G  ALR F  M   G+ PDD T V VLSACS+
Sbjct: 463 ANDLFKEMAGKDIVSWNTMIAGYSRHGFGEVALRFFESMKREGLKPDDATMVAVLSACSH 522

Query: 449 TGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIWG 508
           TG V KG + F +M   Y V    +HYACMVDLLGRAG L +A +L++ MP E DA IWG
Sbjct: 523 TGLVDKGRQYFYTMTQDYGVMPNSQHYACMVDLLGRAGLLEDAHNLMKNMPFEPDAAIWG 582

Query: 509 ALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDRR 568
            LLGA R H   +LAE AA K+  +EP+N+G ++LLSN+YAS GRW DV +LR  MRD+ 
Sbjct: 583 TLLGASRVHGNTELAETAADKIFAMEPENSGMYVLLSNLYASSGRWGDVGKLRVRMRDKG 642

Query: 569 VSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHDV 628
           V K PG SWI ++ K H F+ GD   HPE  EI   LE L   +++AGY    S VLHDV
Sbjct: 643 VKKVPGYSWIEIQNKTHTFSVGDEF-HPEKDEIFAFLEELDLRMKKAGYVSKTSVVLHDV 702

Query: 629 DEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREIIL 688
           +EEEK + + YHSE+LAVAYG++++  G PIRV+KNLRVC DCH AIK +A++TGR IIL
Sbjct: 703 EEEEKERMVRYHSERLAVAYGIMRVSSGRPIRVIKNLRVCEDCHNAIKYMARITGRLIIL 762

Query: 689 RDANRFHHFKDGSCSCRDYW 709
           RD NRFHHFKDGSCSC DYW
Sbjct: 763 RDNNRFHHFKDGSCSCGDYW 781

BLAST of CsGy4G017810 vs. TAIR10
Match: AT2G22070.1 (pentatricopeptide (PPR) repeat-containing protein)

HSP 1 Score: 400.2 bits (1027), Expect = 2.6e-111
Identity = 187/380 (49.21%), Postives = 261/380 (68.68%), Query Frame = 0

Query: 330 GIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLAKA 389
           G RPN  +L ++LSV + LA+L HG++IH   V+S     V V++ L++MY K GN+  A
Sbjct: 408 GQRPNSYTLAAMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYAKAGNITSA 467

Query: 390 KQVFDRF-AVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACSY 449
            + FD     +D V W S+I   AQHG   EAL +F  M   G+ PD +T+VGV SAC++
Sbjct: 468 SRAFDLIRCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLRPDHITYVGVFSACTH 527

Query: 450 TGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIWG 509
            G V +G + F+ M+   ++   + HYACMVDL GRAG L EA + IEKMP+E D + WG
Sbjct: 528 AGLVNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWG 587

Query: 510 ALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDRR 569
           +LL ACR H  +DL +VAA++LL+LEP+N+G +  L+N+Y++ G+W++ A++R++M+D R
Sbjct: 588 SLLSACRVHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGR 647

Query: 570 VSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHDV 629
           V K  G SWI V+ KVH F G +   HPE +EI   ++ +   +++ GY PD + VLHD+
Sbjct: 648 VKKEQGFSWIEVKHKVHVF-GVEDGTHPEKNEIYMTMKKIWDEIKKMGYVPDTASVLHDL 707

Query: 630 DEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREIIL 689
           +EE K Q L +HSEKLA+A+GL+  P    +R+MKNLRVC DCH AIK I+K+ GREII+
Sbjct: 708 EEEVKEQILRHHSEKLAIAFGLISTPDKTTLRIMKNLRVCNDCHTAIKFISKLVGREIIV 767

Query: 690 RDANRFHHFKDGSCSCRDYW 709
           RD  RFHHFKDG CSCRDYW
Sbjct: 768 RDTTRFHHFKDGFCSCRDYW 786

BLAST of CsGy4G017810 vs. TAIR10
Match: AT4G16835.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 388.3 bits (996), Expect = 1.0e-107
Identity = 192/380 (50.53%), Postives = 248/380 (65.26%), Query Frame = 0

Query: 329 EGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLAK 388
           EGIRPN   L S L  C+ L+ L  GR+IH  + +S    DV   + L+SMY KCG L  
Sbjct: 278 EGIRPNSSGLSSALLGCSELSALQLGRQIHQIVSKSTLCNDVTALTSLISMYCKCGELGD 337

Query: 389 AKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACSY 448
           A ++F+    KDVV WN++I+GYAQHG   +AL +F +M  + I PD +TFV VL AC++
Sbjct: 338 AWKLFEVMKKKDVVAWNAMISGYAQHGNADKALCLFREMIDNKIRPDWITFVAVLLACNH 397

Query: 449 TGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIWG 508
            G V  G+  F SM   Y+VE + +HY CMVDLLGRAGKL EA+ LI  MP    A ++G
Sbjct: 398 AGLVNIGMAYFESMVRDYKVEPQPDHYTCMVDLLGRAGKLEEALKLIRSMPFRPHAAVFG 457

Query: 509 ALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDRR 568
            LLGACR H  ++LAE AA+KLL L  +NA  ++ L+NIYAS+ RW+DVA +R+ M++  
Sbjct: 458 TLLGACRVHKNVELAEFAAEKLLQLNSQNAAGYVQLANIYASKNRWEDVARVRKRMKESN 517

Query: 569 VSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHDV 628
           V K PG SWI +  KVH F   D   HPE                  GY P+  F LH+V
Sbjct: 518 VVKVPGYSWIEIRNKVHHFRSSDRI-HPELXXXXXXXXXXXXXXXXXGYKPELEFALHNV 577

Query: 629 DEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREIIL 688
           +EE+K + L +HSEKLAVA+G +K+P G  I+V KNLR+CGDCH AIK I+++  REII+
Sbjct: 578 EEEQKEKLLLWHSEKLAVAFGCIKLPQGSQIQVFKNLRICGDCHKAIKFISEIEKREIIV 637

Query: 689 RDANRFHHFKDGSCSCRDYW 709
           RD  RFHHFKDGSCSC DYW
Sbjct: 638 RDTTRFHHFKDGSCSCGDYW 656

BLAST of CsGy4G017810 vs. Swiss-Prot
Match: sp|Q9FXB9|PPR84_ARATH (Pentatricopeptide repeat-containing protein At1g56690, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H69 PE=2 SV=1)

HSP 1 Score: 553.5 bits (1425), Expect = 3.4e-156
Identity = 258/381 (67.72%), Postives = 314/381 (82.41%), Query Frame = 0

Query: 328 REGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLA 387
           ++G+RP+FPSLIS+LSVCA LA+L +GR++HA LVR QFD DVYVASVL++MY+KCG L 
Sbjct: 324 KQGVRPSFPSLISILSVCATLASLQYGRQVHAHLVRCQFDDDVYVASVLMTMYVKCGELV 383

Query: 388 KAKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACS 447
           KAK VFDRF+ KD++MWNSII+GYA HGLG EAL++FH+M  SG MP+ VT + +L+ACS
Sbjct: 384 KAKLVFDRFSSKDIIMWNSIISGYASHGLGEEALKIFHEMPSSGTMPNKVTLIAILTACS 443

Query: 448 YTGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIW 507
           Y G +++GLEIF SME+K+ V   +EHY+C VD+LGRAG++++AM+LIE M ++ DA +W
Sbjct: 444 YAGKLEEGLEIFESMESKFCVTPTVEHYSCTVDMLGRAGQVDKAMELIESMTIKPDATVW 503

Query: 508 GALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDR 567
           GALLGAC+TH +LDLAEVAAKKL   EP NAG ++LLS+I AS+ +W DVA +R+NMR  
Sbjct: 504 GALLGACKTHSRLDLAEVAAKKLFENEPDNAGTYVLLSSINASRSKWGDVAVVRKNMRTN 563

Query: 568 RVSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHD 627
            VSK+PGCSWI V KKVH FT G    HPE + I  +LE   GLLREAGY PD S VLHD
Sbjct: 564 NVSKFPGCSWIEVGKKVHMFTRGGIKNHPEQAMILMMLEKTDGLLREAGYSPDCSHVLHD 623

Query: 628 VDEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREII 687
           VDEEEKV SL  HSE+LAVAYGLLK+P G+PIRVMKNLRVCGDCHAAIKLI+KVT REII
Sbjct: 624 VDEEEKVDSLSRHSERLAVAYGLLKLPEGVPIRVMKNLRVCGDCHAAIKLISKVTEREII 683

Query: 688 LRDANRFHHFKDGSCSCRDYW 709
           LRDANRFHHF +G CSCRDYW
Sbjct: 684 LRDANRFHHFNNGECSCRDYW 704

BLAST of CsGy4G017810 vs. Swiss-Prot
Match: sp|Q56XI1|PPR25_ARATH (Pentatricopeptide repeat-containing protein At1g09410, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H18 PE=1 SV=2)

HSP 1 Score: 453.4 bits (1165), Expect = 4.7e-126
Identity = 220/382 (57.59%), Postives = 267/382 (69.90%), Query Frame = 0

Query: 328 REGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLA 387
           ++G+RP FP+LIS+LSVCA LA+L HG+++HAQLVR QFD+DVYVASVL++MYIKCG L 
Sbjct: 324 KQGVRPTFPTLISILSVCASLASLHHGKQVHAQLVRCQFDVDVYVASVLMTMYIKCGELV 383

Query: 388 KAKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSG-IMPDDVTFVGVLSAC 447
           K+K +FDRF  KD++MWNSII+GYA HGLG EAL+VF +M  SG   P++VTFV      
Sbjct: 384 KSKLIFDRFPSKDIIMWNSIISGYASHGLGEEALKVFCEMPLSGSTKPNEVTFVATXXXX 443

Query: 448 SYTGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAII 507
                                                                +E DA +
Sbjct: 444 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTVEPDAAV 503

Query: 508 WGALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRD 567
           WG+LLGACRTH +LD+AE  AKKL+ +EP+N+G +ILLSN+YASQGRW DVAELR+ M+ 
Sbjct: 504 WGSLLGACRTHSQLDVAEFCAKKLIEIEPENSGTYILLSNMYASQGRWADVAELRKLMKT 563

Query: 568 RRVSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLH 627
           R V K PGCSW  VE KVH FT G  + HPE   I +IL+ L GLLREAGY PD S+ LH
Sbjct: 564 RLVRKSPGCSWTEVENKVHAFTRGGINSHPEQESILKILDELDGLLREAGYNPDCSYALH 623

Query: 628 DVDEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREI 687
           DVDEEEKV SL+YHSE+LAVAY LLK+  G+PIRVMKNLRVC DCH AIK+I+KV  REI
Sbjct: 624 DVDEEEKVNSLKYHSERLAVAYALLKLSEGIPIRVMKNLRVCSDCHTAIKIISKVKEREI 683

Query: 688 ILRDANRFHHFKDGSCSCRDYW 709
           ILRDANRFHHF++G CSC+DYW
Sbjct: 684 ILRDANRFHHFRNGECSCKDYW 705

BLAST of CsGy4G017810 vs. Swiss-Prot
Match: sp|Q9SY02|PP301_ARATH (Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H24 PE=3 SV=1)

HSP 1 Score: 429.1 bits (1102), Expect = 9.6e-119
Identity = 205/380 (53.95%), Postives = 262/380 (68.95%), Query Frame = 0

Query: 329 EGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLAK 388
           EG R N  S  S LS CA +  L+ G+++H +LV+  ++   +V + LL MY KCG++ +
Sbjct: 403 EGGRLNRSSFSSALSTCADVVALELGKQLHGRLVKGGYETGCFVGNALLLMYCKCGSIEE 462

Query: 389 AKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACSY 448
           A  +F   A KD+V WN++I GY++HG G  ALR F  M   G+ PDD T V VLSACS+
Sbjct: 463 ANDLFKEMAGKDIVSWNTMIAGYSRHGFGEVALRFFESMKREGLKPDDATMVAVLSACSH 522

Query: 449 TGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIWG 508
           TG V KG + F +M   Y V    +HYACMVDLLGRAG L +A +L++ MP E DA IWG
Sbjct: 523 TGLVDKGRQYFYTMTQDYGVMPNSQHYACMVDLLGRAGLLEDAHNLMKNMPFEPDAAIWG 582

Query: 509 ALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDRR 568
            LLGA R H   +LAE AA K+  +EP+N+G ++LLSN+YAS GRW DV +LR  MRD+ 
Sbjct: 583 TLLGASRVHGNTELAETAADKIFAMEPENSGMYVLLSNLYASSGRWGDVGKLRVRMRDKG 642

Query: 569 VSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHDV 628
           V K PG SWI ++ K H F+ GD   HPE  EI   LE L   +++AGY    S VLHDV
Sbjct: 643 VKKVPGYSWIEIQNKTHTFSVGDEF-HPEKDEIFAFLEELDLRMKKAGYVSKTSVVLHDV 702

Query: 629 DEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREIIL 688
           +EEEK + + YHSE+LAVAYG++++  G PIRV+KNLRVC DCH AIK +A++TGR IIL
Sbjct: 703 EEEEKERMVRYHSERLAVAYGIMRVSSGRPIRVIKNLRVCEDCHNAIKYMARITGRLIIL 762

Query: 689 RDANRFHHFKDGSCSCRDYW 709
           RD NRFHHFKDGSCSC DYW
Sbjct: 763 RDNNRFHHFKDGSCSCGDYW 781

BLAST of CsGy4G017810 vs. Swiss-Prot
Match: sp|Q9SHZ8|PP168_ARATH (Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H41 PE=3 SV=1)

HSP 1 Score: 400.2 bits (1027), Expect = 4.8e-110
Identity = 187/380 (49.21%), Postives = 261/380 (68.68%), Query Frame = 0

Query: 330 GIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLAKA 389
           G RPN  +L ++LSV + LA+L HG++IH   V+S     V V++ L++MY K GN+  A
Sbjct: 408 GQRPNSYTLAAMLSVASSLASLSHGKQIHGSAVKSGEIYSVSVSNALITMYAKAGNITSA 467

Query: 390 KQVFDRF-AVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACSY 449
            + FD     +D V W S+I   AQHG   EAL +F  M   G+ PD +T+VGV SAC++
Sbjct: 468 SRAFDLIRCERDTVSWTSMIIALAQHGHAEEALELFETMLMEGLRPDHITYVGVFSACTH 527

Query: 450 TGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIWG 509
            G V +G + F+ M+   ++   + HYACMVDL GRAG L EA + IEKMP+E D + WG
Sbjct: 528 AGLVNQGRQYFDMMKDVDKIIPTLSHYACMVDLFGRAGLLQEAQEFIEKMPIEPDVVTWG 587

Query: 510 ALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDRR 569
           +LL ACR H  +DL +VAA++LL+LEP+N+G +  L+N+Y++ G+W++ A++R++M+D R
Sbjct: 588 SLLSACRVHKNIDLGKVAAERLLLLEPENSGAYSALANLYSACGKWEEAAKIRKSMKDGR 647

Query: 570 VSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHDV 629
           V K  G SWI V+ KVH F G +   HPE +EI   ++ +   +++ GY PD + VLHD+
Sbjct: 648 VKKEQGFSWIEVKHKVHVF-GVEDGTHPEKNEIYMTMKKIWDEIKKMGYVPDTASVLHDL 707

Query: 630 DEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREIIL 689
           +EE K Q L +HSEKLA+A+GL+  P    +R+MKNLRVC DCH AIK I+K+ GREII+
Sbjct: 708 EEEVKEQILRHHSEKLAIAFGLISTPDKTTLRIMKNLRVCNDCHTAIKFISKLVGREIIV 767

Query: 690 RDANRFHHFKDGSCSCRDYW 709
           RD  RFHHFKDG CSCRDYW
Sbjct: 768 RDTTRFHHFKDGFCSCRDYW 786

BLAST of CsGy4G017810 vs. Swiss-Prot
Match: sp|Q9M4P3|PP316_ARATH (Pentatricopeptide repeat-containing protein At4g16835, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=DYW10 PE=2 SV=3)

HSP 1 Score: 388.3 bits (996), Expect = 1.9e-106
Identity = 192/380 (50.53%), Postives = 248/380 (65.26%), Query Frame = 0

Query: 329 EGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLAK 388
           EGIRPN   L S L  C+ L+ L  GR+IH  + +S    DV   + L+SMY KCG L  
Sbjct: 278 EGIRPNSSGLSSALLGCSELSALQLGRQIHQIVSKSTLCNDVTALTSLISMYCKCGELGD 337

Query: 389 AKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACSY 448
           A ++F+    KDVV WN++I+GYAQHG   +AL +F +M  + I PD +TFV VL AC++
Sbjct: 338 AWKLFEVMKKKDVVAWNAMISGYAQHGNADKALCLFREMIDNKIRPDWITFVAVLLACNH 397

Query: 449 TGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIWG 508
            G V  G+  F SM   Y+VE + +HY CMVDLLGRAGKL EA+ LI  MP    A ++G
Sbjct: 398 AGLVNIGMAYFESMVRDYKVEPQPDHYTCMVDLLGRAGKLEEALKLIRSMPFRPHAAVFG 457

Query: 509 ALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDRR 568
            LLGACR H  ++LAE AA+KLL L  +NA  ++ L+NIYAS+ RW+DVA +R+ M++  
Sbjct: 458 TLLGACRVHKNVELAEFAAEKLLQLNSQNAAGYVQLANIYASKNRWEDVARVRKRMKESN 517

Query: 569 VSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHDV 628
           V K PG SWI +  KVH F   D   HPE                  GY P+  F LH+V
Sbjct: 518 VVKVPGYSWIEIRNKVHHFRSSDRI-HPELXXXXXXXXXXXXXXXXXGYKPELEFALHNV 577

Query: 629 DEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREIIL 688
           +EE+K + L +HSEKLAVA+G +K+P G  I+V KNLR+CGDCH AIK I+++  REII+
Sbjct: 578 EEEQKEKLLLWHSEKLAVAFGCIKLPQGSQIQVFKNLRICGDCHKAIKFISEIEKREIIV 637

Query: 689 RDANRFHHFKDGSCSCRDYW 709
           RD  RFHHFKDGSCSC DYW
Sbjct: 638 RDTTRFHHFKDGSCSCGDYW 656

BLAST of CsGy4G017810 vs. TrEMBL
Match: tr|A0A0A0L0T3|A0A0A0L0T3_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G418570 PE=4 SV=1)

HSP 1 Score: 785.8 bits (2028), Expect = 8.1e-224
Identity = 381/381 (100.00%), Postives = 381/381 (100.00%), Query Frame = 0

Query: 328 REGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLA 387
           REGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLA
Sbjct: 394 REGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLA 453

Query: 388 KAKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACS 447
           KAKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACS
Sbjct: 454 KAKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACS 513

Query: 448 YTGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIW 507
           YTGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIW
Sbjct: 514 YTGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIW 573

Query: 508 GALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDR 567
           GALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDR
Sbjct: 574 GALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDR 633

Query: 568 RVSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHD 627
           RVSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHD
Sbjct: 634 RVSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHD 693

Query: 628 VDEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREII 687
           VDEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREII
Sbjct: 694 VDEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREII 753

Query: 688 LRDANRFHHFKDGSCSCRDYW 709
           LRDANRFHHFKDGSCSCRDYW
Sbjct: 754 LRDANRFHHFKDGSCSCRDYW 774

BLAST of CsGy4G017810 vs. TrEMBL
Match: tr|A0A1S3C183|A0A1S3C183_CUCME (pentatricopeptide repeat-containing protein At1g56690, mitochondrial-like OS=Cucumis melo OX=3656 GN=LOC103495864 PE=4 SV=1)

HSP 1 Score: 767.7 bits (1981), Expect = 2.3e-218
Identity = 372/381 (97.64%), Postives = 374/381 (98.16%), Query Frame = 0

Query: 328 REGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLA 387
           REGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLA
Sbjct: 328 REGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLA 387

Query: 388 KAKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACS 447
           KAKQVFDRFAVKDVVMWNSIITGYAQHGLG EALRVFHDMHFSGIMPDD+TFVGVLSACS
Sbjct: 388 KAKQVFDRFAVKDVVMWNSIITGYAQHGLGEEALRVFHDMHFSGIMPDDITFVGVLSACS 447

Query: 448 YTGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIW 507
           YTGNVKKGLEIF SMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIW
Sbjct: 448 YTGNVKKGLEIFYSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIW 507

Query: 508 GALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDR 567
           GALLGACRTHMKLDLAEVAAKKLL LEPKNAGPFILLSNIYASQGRW DVAELRRNMRDR
Sbjct: 508 GALLGACRTHMKLDLAEVAAKKLLYLEPKNAGPFILLSNIYASQGRWGDVAELRRNMRDR 567

Query: 568 RVSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHD 627
            VSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHD
Sbjct: 568 HVSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHD 627

Query: 628 VDEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREII 687
           VDEEEKVQSLEYHSEKLAVAYGL KIP GMPIRVMKNLRVCGDCHAAIKLIAKVTGREII
Sbjct: 628 VDEEEKVQSLEYHSEKLAVAYGLFKIPKGMPIRVMKNLRVCGDCHAAIKLIAKVTGREII 687

Query: 688 LRDANRFHHFKDGSCSCRDYW 709
           LRDANRFHHFKDGSCSC+DYW
Sbjct: 688 LRDANRFHHFKDGSCSCQDYW 708

BLAST of CsGy4G017810 vs. TrEMBL
Match: tr|A0A2I4EAU2|A0A2I4EAU2_9ROSI (pentatricopeptide repeat-containing protein At1g56690, mitochondrial OS=Juglans regia OX=51240 GN=LOC108987906 PE=4 SV=1)

HSP 1 Score: 639.8 bits (1649), Expect = 7.2e-180
Identity = 303/381 (79.53%), Postives = 340/381 (89.24%), Query Frame = 0

Query: 328 REGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLA 387
           REGI PNFPSLISVL VC+ LA+LDHG+++HA+LVRS +D DVYV+SVL++MY+KCG+L 
Sbjct: 323 REGISPNFPSLISVLCVCSSLASLDHGKQVHARLVRSHYDRDVYVSSVLITMYVKCGDLV 382

Query: 388 KAKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACS 447
           KAK VFDRFA KD+VMWNSIITGYAQHGLG EAL+VFH+M    I+PDDVTF+GVLSACS
Sbjct: 383 KAKLVFDRFAPKDIVMWNSIITGYAQHGLGEEALQVFHEMCSLNILPDDVTFIGVLSACS 442

Query: 448 YTGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIW 507
           YTG V +G EIF SM +KYQVE   EHYACMVDLLGRAG++NEAM+LIE MP+EADAI+W
Sbjct: 443 YTGKVHEGHEIFESMNSKYQVEPTTEHYACMVDLLGRAGQVNEAMNLIENMPVEADAIVW 502

Query: 508 GALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDR 567
           GALLGACRTHMKLDLAEVAAKKL+ LEPKN+GP+ILLSNIYAS+G+W DVA+LR+ MR R
Sbjct: 503 GALLGACRTHMKLDLAEVAAKKLVQLEPKNSGPYILLSNIYASKGKWHDVADLRKKMRAR 562

Query: 568 RVSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHD 627
           RVSK PGCSWI VEKKVH FTGGDS GHPE   I R+LE LSG+LREAGY PD +FVLHD
Sbjct: 563 RVSKSPGCSWIEVEKKVHMFTGGDSMGHPEQEIIMRMLERLSGMLREAGYCPDGTFVLHD 622

Query: 628 VDEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREII 687
           VDEEEKVQSL YHSEKLAV YGLLK+P GMPIRVMKNLRVCGDCH+AIKLIAK+TGREII
Sbjct: 623 VDEEEKVQSLGYHSEKLAVVYGLLKLPEGMPIRVMKNLRVCGDCHSAIKLIAKITGREII 682

Query: 688 LRDANRFHHFKDGSCSCRDYW 709
           LRDANRFHHFKDG CSCRDYW
Sbjct: 683 LRDANRFHHFKDGLCSCRDYW 703

BLAST of CsGy4G017810 vs. TrEMBL
Match: tr|A0A2P5D016|A0A2P5D016_9ROSA (DYW domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_267430 PE=4 SV=1)

HSP 1 Score: 639.8 bits (1649), Expect = 7.2e-180
Identity = 307/381 (80.58%), Postives = 335/381 (87.93%), Query Frame = 0

Query: 328 REGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLA 387
           R  +RPNFPSLISVLSVCA LA+LDHGRE+H QLVRSQFD DVYV+SVL++MY+KCGNL 
Sbjct: 323 RHRVRPNFPSLISVLSVCASLASLDHGREVHGQLVRSQFDHDVYVSSVLITMYVKCGNLV 382

Query: 388 KAKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACS 447
           KAK VFD FA KDVVMWNSIITGYAQHGLG EAL++FH+M   G+ PDD+TF+G+LSACS
Sbjct: 383 KAKLVFDSFAAKDVVMWNSIITGYAQHGLGEEALQIFHEMCSLGLAPDDITFIGLLSACS 442

Query: 448 YTGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIW 507
           YTG V +GLEIF SM+ KY VE + EHYACMVDLLGRAGK+NEAM+LIEKMPMEADAI+W
Sbjct: 443 YTGKVVEGLEIFESMKCKYLVEPRTEHYACMVDLLGRAGKVNEAMNLIEKMPMEADAIVW 502

Query: 508 GALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDR 567
           G+LLGACRTHMKLDLAEVAAKKLL L+P+NAGP ILLSNIYAS+ RW DV ELR NMR R
Sbjct: 503 GSLLGACRTHMKLDLAEVAAKKLLQLDPRNAGPCILLSNIYASKSRWRDVEELRENMRAR 562

Query: 568 RVSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHD 627
            VSK PGCSWI VEKKVH FTGGDS GHPEH  I R+LE L GLLREAGY PD +FVLHD
Sbjct: 563 SVSKSPGCSWIEVEKKVHMFTGGDSMGHPEHPMILRMLERLGGLLREAGYCPDGTFVLHD 622

Query: 628 VDEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREII 687
           VDEEEKV SL YHSEKLAVAYGLLK+P  MPIRVMKNLRVCGDCH+AIKLIAKVT REII
Sbjct: 623 VDEEEKVHSLRYHSEKLAVAYGLLKLPEPMPIRVMKNLRVCGDCHSAIKLIAKVTRREII 682

Query: 688 LRDANRFHHFKDGSCSCRDYW 709
           LRDANRFHHFKDG CSCRDYW
Sbjct: 683 LRDANRFHHFKDGLCSCRDYW 703

BLAST of CsGy4G017810 vs. TrEMBL
Match: tr|A0A1Q3BZV9|A0A1Q3BZV9_CEPFO (PPR domain-containing protein/PPR_2 domain-containing protein/DYW_deaminase domain-containing protein OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_16864 PE=4 SV=1)

HSP 1 Score: 639.4 bits (1648), Expect = 9.4e-180
Identity = 301/381 (79.00%), Postives = 341/381 (89.50%), Query Frame = 0

Query: 328 REGIRPNFPSLISVLSVCAGLANLDHGREIHAQLVRSQFDLDVYVASVLLSMYIKCGNLA 387
           R+ +RPNFP +ISVLSVC+ LA+LDHGR++HAQLVR QFD+DVYV+SVL++MYIKCG+LA
Sbjct: 323 RQRVRPNFPCMISVLSVCSSLASLDHGRQVHAQLVRFQFDVDVYVSSVLITMYIKCGDLA 382

Query: 388 KAKQVFDRFAVKDVVMWNSIITGYAQHGLGVEALRVFHDMHFSGIMPDDVTFVGVLSACS 447
           KAKQVF+R  +KDVV+WNSIITGYAQHGL  E L+VFH+M FSGIMPD VTFVGVL+ACS
Sbjct: 383 KAKQVFNRVPLKDVVIWNSIITGYAQHGLAEEVLQVFHEMSFSGIMPDKVTFVGVLTACS 442

Query: 448 YTGNVKKGLEIFNSMETKYQVEQKIEHYACMVDLLGRAGKLNEAMDLIEKMPMEADAIIW 507
           YTG +K+G +IF SM++KY VE   EHYACMVDLLGRAG++NEAM LIE M +EADAI+W
Sbjct: 443 YTGKIKEGRQIFESMKSKYLVEPGAEHYACMVDLLGRAGQVNEAMSLIETMLVEADAIVW 502

Query: 508 GALLGACRTHMKLDLAEVAAKKLLVLEPKNAGPFILLSNIYASQGRWDDVAELRRNMRDR 567
           GALLGACR HMK+DLAEVAAKKLL +EPKNAGP+ILLSN+YASQGRWDDVAELR+ M +R
Sbjct: 503 GALLGACRIHMKMDLAEVAAKKLLQIEPKNAGPYILLSNLYASQGRWDDVAELRKTMSER 562

Query: 568 RVSKYPGCSWIVVEKKVHKFTGGDSSGHPEHSEINRILEWLSGLLREAGYYPDQSFVLHD 627
            VSK PGCSW+ V KKVH FTGGDS+GHPEH  I R+LE + GLLREAGY+PD SFVLHD
Sbjct: 563 AVSKSPGCSWLEVGKKVHMFTGGDSAGHPEHERIIRMLEKIVGLLREAGYFPDGSFVLHD 622

Query: 628 VDEEEKVQSLEYHSEKLAVAYGLLKIPIGMPIRVMKNLRVCGDCHAAIKLIAKVTGREII 687
           VDEEEKVQSL YHSEKLAVAYGLLK+P G PIRVMKNLRVCGDCH+AIKLIAKVTGREII
Sbjct: 623 VDEEEKVQSLRYHSEKLAVAYGLLKVPDGTPIRVMKNLRVCGDCHSAIKLIAKVTGREII 682

Query: 688 LRDANRFHHFKDGSCSCRDYW 709
           LRDANRFHHFKDG CSCRDYW
Sbjct: 683 LRDANRFHHFKDGLCSCRDYW 703

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KGN54664.11.2e-223100.00hypothetical protein Csa_4G418570 [Cucumis sativus][more]
XP_011653842.11.2e-223100.00PREDICTED: pentatricopeptide repeat-containing protein At1g56690, mitochondrial-... [more]
XP_008455763.13.5e-21897.64PREDICTED: pentatricopeptide repeat-containing protein At1g56690, mitochondrial-... [more]
XP_022158877.15.7e-20590.55pentatricopeptide repeat-containing protein At1g56690, mitochondrial-like isofor... [more]
XP_023517429.11.1e-20390.26pentatricopeptide repeat-containing protein At1g56690, mitochondrial-like [Cucur... [more]
Match NameE-valueIdentityDescription
AT1G56690.11.9e-15767.72Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G09410.12.6e-12757.59pentatricopeptide (PPR) repeat-containing protein[more]
AT4G02750.15.3e-12053.95Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G22070.12.6e-11149.21pentatricopeptide (PPR) repeat-containing protein[more]
AT4G16835.11.0e-10750.53Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9FXB9|PPR84_ARATH3.4e-15667.72Pentatricopeptide repeat-containing protein At1g56690, mitochondrial OS=Arabidop... [more]
sp|Q56XI1|PPR25_ARATH4.7e-12657.59Pentatricopeptide repeat-containing protein At1g09410, mitochondrial OS=Arabidop... [more]
sp|Q9SY02|PP301_ARATH9.6e-11953.95Pentatricopeptide repeat-containing protein At4g02750 OS=Arabidopsis thaliana OX... [more]
sp|Q9SHZ8|PP168_ARATH4.8e-11049.21Pentatricopeptide repeat-containing protein At2g22070 OS=Arabidopsis thaliana OX... [more]
sp|Q9M4P3|PP316_ARATH1.9e-10650.53Pentatricopeptide repeat-containing protein At4g16835, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0L0T3|A0A0A0L0T3_CUCSA8.1e-224100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G418570 PE=4 SV=1[more]
tr|A0A1S3C183|A0A1S3C183_CUCME2.3e-21897.64pentatricopeptide repeat-containing protein At1g56690, mitochondrial-like OS=Cuc... [more]
tr|A0A2I4EAU2|A0A2I4EAU2_9ROSI7.2e-18079.53pentatricopeptide repeat-containing protein At1g56690, mitochondrial OS=Juglans ... [more]
tr|A0A2P5D016|A0A2P5D016_9ROSA7.2e-18080.58DYW domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_267430 ... [more]
tr|A0A1Q3BZV9|A0A1Q3BZV9_CEPFO9.4e-18079.00PPR domain-containing protein/PPR_2 domain-containing protein/DYW_deaminase doma... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR032867DYW_dom
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy4G017810.1CsGy4G017810.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 335..444
e-value: 1.6E-23
score: 84.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 113..174
e-value: 2.3E-16
score: 62.0
coord: 175..266
e-value: 2.6E-24
score: 88.1
coord: 17..112
e-value: 1.4E-21
score: 79.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 267..329
e-value: 2.6E-7
score: 32.3
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILYSSF48452TPR-likecoord: 521..558
coord: 31..116
coord: 269..305
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 399..446
e-value: 2.2E-12
score: 46.8
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 208..237
e-value: 2.2E-4
score: 19.2
coord: 402..435
e-value: 1.1E-7
score: 29.5
coord: 53..82
e-value: 4.9E-4
score: 18.1
coord: 302..334
e-value: 4.7E-6
score: 24.4
coord: 84..115
e-value: 6.3E-7
score: 27.2
coord: 270..299
e-value: 3.7E-6
score: 24.8
coord: 180..208
e-value: 9.3E-6
score: 23.5
coord: 25..52
e-value: 1.7E-5
score: 22.7
coord: 239..265
e-value: 9.4E-5
score: 20.3
coord: 115..146
e-value: 1.7E-5
score: 22.7
coord: 146..177
e-value: 1.6E-6
score: 25.9
coord: 475..500
e-value: 0.0022
score: 16.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 374..395
e-value: 0.58
score: 10.4
coord: 208..237
e-value: 1.4E-5
score: 24.9
coord: 146..175
e-value: 2.7E-6
score: 27.2
coord: 115..145
e-value: 1.6E-7
score: 31.0
coord: 270..298
e-value: 2.3E-5
score: 24.2
coord: 53..82
e-value: 1.5E-5
score: 24.8
coord: 475..499
e-value: 0.0065
score: 16.5
coord: 25..52
e-value: 1.7E-5
score: 24.6
coord: 239..265
e-value: 4.2E-6
score: 26.5
coord: 177..207
e-value: 6.6E-7
score: 29.1
coord: 302..331
e-value: 3.3E-6
score: 26.9
coord: 84..114
e-value: 6.3E-7
score: 29.1
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 117..143
score: 6.993
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 471..501
score: 7.826
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 82..116
score: 11.762
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 435..465
score: 7.739
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 20..50
score: 8.287
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 400..434
score: 11.893
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 272..298
score: 6.665
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 237..271
score: 10.983
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 537..571
score: 7.388
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 369..399
score: 7.322
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 299..333
score: 11.904
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 144..178
score: 11.192
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 51..81
score: 9.964
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 206..236
score: 9.635
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 334..368
score: 5.36
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 180..205
score: 6.566
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 573..698
e-value: 3.5E-40
score: 136.7
NoneNo IPR availablePANTHERPTHR24015:SF553SUBFAMILY NOT NAMEDcoord: 269..620
coord: 225..271
coord: 22..64
coord: 62..232
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 269..620
coord: 225..271
coord: 22..64
coord: 62..232