CsaV3_1G002680 (gene) Cucumber (Chinese Long) v3

NameCsaV3_1G002680
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionPentatricopeptide repeat-containing protein
Locationchr1 : 1668697 .. 1670520 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAATGTATAACCGGCTTCTGCCTTTCTCGTATAGAATTATTCGAAGGTCTCGGGTCCAACAAGAAATTTGTACAATCTCGAACTTGGATTTTTTAGAATCAGAAATGTTGAAATTTGTACACACCCAAGCAATGGATCTTCCGTTTCAGGCAACTAACGGTAGCAAAATTCCTGATTACAATGACGTGCGAAGAGGGCATTTCCTCATGAAACTCATAGACGACTCTGTTTCGCGTAATGGGTTCGAATCTATTGCTCGTATTTTCTCTAAGTATCGTGGTTCTATCAATTCTCAACAGTGTAACTCGATGATCAGGACTTATTTGGATTTGAATAAGCATTTAAATTCTCTGTACATTTTTGCCCTTATGCATAAGTTTAGTATTCTGCCCGATTCATCCACTTTTCCTGCTGTTCTTAAAGCAACTGCGCAGCTATGTGATACTGGAGTTGGAAAAATGATACATGGTATTGTTATTCAGATGGGTTTTATTTGTGATGTCTACACAAGTACCGCTTTAGTTCATCTGTATTGTACTTGTTTGTCTATATCTGATGCTTCTCAGTTGTTCGACGAAATGCCCGAGAGAAATGCAGTTACTTGGAATGCTTTGATTACTGGTTATACTCATAATAGAAAGTTTGTGAAAGCTATCGATGCTTTTCGAGGAATGTTGGCAGATGGGGCTCAACCGAGTGAGAGAACCGTGGTTGTAGTTCTATCGGCTTGTTCTCATTTGGGAGCTTTTAATCAGGGAAAGTGGATCCATGAGTTTATTTATCATAATAGGTTGAGACTGAACGTGTTTGTGGGCACAGCTCTTATTGATATGTATGCTAAATGTGGGGCTGTTTATGAGGTCGAGAAGGTCTTCGAAGAAATTAGAGAGAAGAACGTGTATACATGGAATGTCTTGATTTCTGGATATGCCATGAATGGGCAAGGCGATGCAGCTTTGCAGGCTTTTTCTAGGATGTTGATGGAAAATTTCAAGCCAGATGAGGTTACCTTTCTAGGTGTTTTGTGTGCATGCTGTCACCAAGGTCTGGTAACGGAAGGGCGCTGGCAATTCATGAGCATGAAACAACAGTTTGGACTGCAACCAAGGATAGAGCATTATGGATGTATGGTAGACCTACTTGGTCGAGCGGGATTGTTGGAGGAAGCTCTAGAGTTGATCCAATCCATGAGCATAGAGCCAGACCCTATCATTTGGAGGGCTTTGCTTTGTGCTTGCAGAGTCCATGGGAATACGAAATTGGGTGAATATATTATCAAAAGACTTATAGAACTAGAACCAAACAATGGGGAAAATTATGTCTTGCTGTCAAATATATACTCAAGAGAACGACGGTGGGCTGAAGTAGGGAAGTTGAGGGGAATGATGAATCTAAGAGGGATCAGAAAAGTCCCTGGTTGCAGTTCAATTGAAATAAACAACGTAGTTTATGAGTTTGTTGCATCAAATGACAGAAAACCAGAATTTGAGGCAATATACAAGCAGTTGGACAATTTGATTAAGAAATTGAAAGAAAATGGTTATGTTACAGGCACGGACATGGCTTTATATGATATTGAGAAAGAAGAGAAAGAGCATTCTGTGATGTACCATAGTGAGAAACTTGCTTTAGCATTTGGTCTCTTAAACTCTCCTTTAGATTGCACCCTAAGGATAGTGAAAAATCTAAGAATTTGCTTGGACTGCCACGAGTTTTTCAAGGTTTTATCACTCGTCTATAAAAGATATATTGTTGTGAGAGACAGAAACCGTTTCCACCATTTTTATGAAGGTTTCTGTTCGTGCCGAGACTATTGGTGA

mRNA sequence

ATGAAAATGTATAACCGGCTTCTGCCTTTCTCGTATAGAATTATTCGAAGGTCTCGGGTCCAACAAGAAATTTGTACAATCTCGAACTTGGATTTTTTAGAATCAGAAATGTTGAAATTTGTACACACCCAAGCAATGGATCTTCCGTTTCAGGCAACTAACGGTAGCAAAATTCCTGATTACAATGACGTGCGAAGAGGGCATTTCCTCATGAAACTCATAGACGACTCTGTTTCGCGTAATGGGTTCGAATCTATTGCTCGTATTTTCTCTAAGTATCGTGGTTCTATCAATTCTCAACAGTGTAACTCGATGATCAGGACTTATTTGGATTTGAATAAGCATTTAAATTCTCTGTACATTTTTGCCCTTATGCATAAGTTTAGTATTCTGCCCGATTCATCCACTTTTCCTGCTGTTCTTAAAGCAACTGCGCAGCTATGTGATACTGGAGTTGGAAAAATGATACATGGTATTGTTATTCAGATGGGTTTTATTTGTGATGTCTACACAAGTACCGCTTTAGTTCATCTGTATTGTACTTGTTTGTCTATATCTGATGCTTCTCAGTTGTTCGACGAAATGCCCGAGAGAAATGCAGTTACTTGGAATGCTTTGATTACTGGTTATACTCATAATAGAAAGTTTGTGAAAGCTATCGATGCTTTTCGAGGAATGTTGGCAGATGGGGCTCAACCGAGTGAGAGAACCGTGGTTGTAGTTCTATCGGCTTGTTCTCATTTGGGAGCTTTTAATCAGGGAAAGTGGATCCATGAGTTTATTTATCATAATAGGTTGAGACTGAACGTGTTTGTGGGCACAGCTCTTATTGATATGTATGCTAAATGTGGGGCTGTTTATGAGGTCGAGAAGGTCTTCGAAGAAATTAGAGAGAAGAACGTGTATACATGGAATGTCTTGATTTCTGGATATGCCATGAATGGGCAAGGCGATGCAGCTTTGCAGGCTTTTTCTAGGATGTTGATGGAAAATTTCAAGCCAGATGAGGTTACCTTTCTAGGTGTTTTGTGTGCATGCTGTCACCAAGGTCTGGTAACGGAAGGGCGCTGGCAATTCATGAGCATGAAACAACAGTTTGGACTGCAACCAAGGATAGAGCATTATGGATGTATGGTAGACCTACTTGGTCGAGCGGGATTGTTGGAGGAAGCTCTAGAGTTGATCCAATCCATGAGCATAGAGCCAGACCCTATCATTTGGAGGGCTTTGCTTTGTGCTTGCAGAGTCCATGGGAATACGAAATTGGGTGAATATATTATCAAAAGACTTATAGAACTAGAACCAAACAATGGGGAAAATTATGTCTTGCTGTCAAATATATACTCAAGAGAACGACGGTGGGCTGAAGTAGGGAAGTTGAGGGGAATGATGAATCTAAGAGGGATCAGAAAAGTCCCTGGTTGCAGTTCAATTGAAATAAACAACGTAGTTTATGAGTTTGTTGCATCAAATGACAGAAAACCAGAATTTGAGGCAATATACAAGCAGTTGGACAATTTGATTAAGAAATTGAAAGAAAATGGTTATGTTACAGGCACGGACATGGCTTTATATGATATTGAGAAAGAAGAGAAAGAGCATTCTGTGATGTACCATAGTGAGAAACTTGCTTTAGCATTTGGTCTCTTAAACTCTCCTTTAGATTGCACCCTAAGGATAGTGAAAAATCTAAGAATTTGCTTGGACTGCCACGAGTTTTTCAAGGTTTTATCACTCGTCTATAAAAGATATATTGTTGTGAGAGACAGAAACCGTTTCCACCATTTTTATGAAGGTTTCTGTTCGTGCCGAGACTATTGGTGA

Coding sequence (CDS)

ATGAAAATGTATAACCGGCTTCTGCCTTTCTCGTATAGAATTATTCGAAGGTCTCGGGTCCAACAAGAAATTTGTACAATCTCGAACTTGGATTTTTTAGAATCAGAAATGTTGAAATTTGTACACACCCAAGCAATGGATCTTCCGTTTCAGGCAACTAACGGTAGCAAAATTCCTGATTACAATGACGTGCGAAGAGGGCATTTCCTCATGAAACTCATAGACGACTCTGTTTCGCGTAATGGGTTCGAATCTATTGCTCGTATTTTCTCTAAGTATCGTGGTTCTATCAATTCTCAACAGTGTAACTCGATGATCAGGACTTATTTGGATTTGAATAAGCATTTAAATTCTCTGTACATTTTTGCCCTTATGCATAAGTTTAGTATTCTGCCCGATTCATCCACTTTTCCTGCTGTTCTTAAAGCAACTGCGCAGCTATGTGATACTGGAGTTGGAAAAATGATACATGGTATTGTTATTCAGATGGGTTTTATTTGTGATGTCTACACAAGTACCGCTTTAGTTCATCTGTATTGTACTTGTTTGTCTATATCTGATGCTTCTCAGTTGTTCGACGAAATGCCCGAGAGAAATGCAGTTACTTGGAATGCTTTGATTACTGGTTATACTCATAATAGAAAGTTTGTGAAAGCTATCGATGCTTTTCGAGGAATGTTGGCAGATGGGGCTCAACCGAGTGAGAGAACCGTGGTTGTAGTTCTATCGGCTTGTTCTCATTTGGGAGCTTTTAATCAGGGAAAGTGGATCCATGAGTTTATTTATCATAATAGGTTGAGACTGAACGTGTTTGTGGGCACAGCTCTTATTGATATGTATGCTAAATGTGGGGCTGTTTATGAGGTCGAGAAGGTCTTCGAAGAAATTAGAGAGAAGAACGTGTATACATGGAATGTCTTGATTTCTGGATATGCCATGAATGGGCAAGGCGATGCAGCTTTGCAGGCTTTTTCTAGGATGTTGATGGAAAATTTCAAGCCAGATGAGGTTACCTTTCTAGGTGTTTTGTGTGCATGCTGTCACCAAGGTCTGGTAACGGAAGGGCGCTGGCAATTCATGAGCATGAAACAACAGTTTGGACTGCAACCAAGGATAGAGCATTATGGATGTATGGTAGACCTACTTGGTCGAGCGGGATTGTTGGAGGAAGCTCTAGAGTTGATCCAATCCATGAGCATAGAGCCAGACCCTATCATTTGGAGGGCTTTGCTTTGTGCTTGCAGAGTCCATGGGAATACGAAATTGGGTGAATATATTATCAAAAGACTTATAGAACTAGAACCAAACAATGGGGAAAATTATGTCTTGCTGTCAAATATATACTCAAGAGAACGACGGTGGGCTGAAGTAGGGAAGTTGAGGGGAATGATGAATCTAAGAGGGATCAGAAAAGTCCCTGGTTGCAGTTCAATTGAAATAAACAACGTAGTTTATGAGTTTGTTGCATCAAATGACAGAAAACCAGAATTTGAGGCAATATACAAGCAGTTGGACAATTTGATTAAGAAATTGAAAGAAAATGGTTATGTTACAGGCACGGACATGGCTTTATATGATATTGAGAAAGAAGAGAAAGAGCATTCTGTGATGTACCATAGTGAGAAACTTGCTTTAGCATTTGGTCTCTTAAACTCTCCTTTAGATTGCACCCTAAGGATAGTGAAAAATCTAAGAATTTGCTTGGACTGCCACGAGTTTTTCAAGGTTTTATCACTCGTCTATAAAAGATATATTGTTGTGAGAGACAGAAACCGTTTCCACCATTTTTATGAAGGTTTCTGTTCGTGCCGAGACTATTGGTGA

Protein sequence

MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFLESEMLKFVHTQAMDLPFQATNGSKIPDYNDVRRGHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLDLNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEGFCSCRDYW
BLAST of CsaV3_1G002680 vs. NCBI nr
Match: XP_004138309.2 (PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Cucumis sativus] >KGN63701.1 hypothetical protein Csa_1G011530 [Cucumis sativus])

HSP 1 Score: 1243.8 bits (3217), Expect = 0.0e+00
Identity = 607/607 (100.00%), Postives = 607/607 (100.00%), Query Frame = 0

Query: 1   MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFLESEMLKFVHTQAMDLPFQATNGSKIPD 60
           MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFLESEMLKFVHTQAMDLPFQATNGSKIPD
Sbjct: 1   MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFLESEMLKFVHTQAMDLPFQATNGSKIPD 60

Query: 61  YNDVRRGHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLDLNKHLNSLY 120
           YNDVRRGHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLDLNKHLNSLY
Sbjct: 61  YNDVRRGHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLDLNKHLNSLY 120

Query: 121 IFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYC 180
           IFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYC
Sbjct: 121 IFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYC 180

Query: 181 TCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGAQPSERTVVV 240
           TCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGAQPSERTVVV
Sbjct: 181 TCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGAQPSERTVVV 240

Query: 241 VLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEKVFEEIREKN 300
           VLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEKVFEEIREKN
Sbjct: 241 VLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEKVFEEIREKN 300

Query: 301 VYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGLVTEGRWQFM 360
           VYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGLVTEGRWQFM
Sbjct: 301 VYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGLVTEGRWQFM 360

Query: 361 SMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVHGNT 420
           SMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVHGNT
Sbjct: 361 SMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVHGNT 420

Query: 421 KLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEI 480
           KLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEI
Sbjct: 421 KLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEI 480

Query: 481 NNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEEKEHSVMYHS 540
           NNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEEKEHSVMYHS
Sbjct: 481 NNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEEKEHSVMYHS 540

Query: 541 EKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEGF 600
           EKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEGF
Sbjct: 541 EKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEGF 600

Query: 601 CSCRDYW 608
           CSCRDYW
Sbjct: 601 CSCRDYW 607

BLAST of CsaV3_1G002680 vs. NCBI nr
Match: XP_016901378.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Cucumis melo])

HSP 1 Score: 1097.4 bits (2837), Expect = 0.0e+00
Identity = 531/562 (94.48%), Postives = 547/562 (97.33%), Query Frame = 0

Query: 46  MDLPFQATNGSKIPDYNDVRRGHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSM 105
           MDLPFQ TN  K PDYNDVRRGHF+MKLIDDSVS NGFESIARIFSKYRGSINSQQCNSM
Sbjct: 1   MDLPFQETNDRKTPDYNDVRRGHFVMKLIDDSVSHNGFESIARIFSKYRGSINSQQCNSM 60

Query: 106 IRTYLDLNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGF 165
           IR YLDLNKHLNSLYIFA MHKFSILPD STFPAVLKATAQLCDT VGKMIHGIVIQMGF
Sbjct: 61  IRRYLDLNKHLNSLYIFAQMHKFSILPDLSTFPAVLKATAQLCDTEVGKMIHGIVIQMGF 120

Query: 166 ICDVYTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRG 225
           ICDVYTSTALVH+Y TCLSISDASQ+FDEM ERNAVTWNALITGYTHNRKF++AIDAFRG
Sbjct: 121 ICDVYTSTALVHMYSTCLSISDASQVFDEMAERNAVTWNALITGYTHNRKFMEAIDAFRG 180

Query: 226 MLADGAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGA 285
           MLA GAQPSERTVV+VLSACSHLGA NQGKWIH+FIYHNRLRLNVFVGTALIDMYAKCGA
Sbjct: 181 MLAAGAQPSERTVVLVLSACSHLGALNQGKWIHDFIYHNRLRLNVFVGTALIDMYAKCGA 240

Query: 286 VYEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCA 345
           V EVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCA
Sbjct: 241 VDEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCA 300

Query: 346 CCHQGLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPI 405
           CCHQGLVTEGR QFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMS+EPDPI
Sbjct: 301 CCHQGLVTEGRRQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSMEPDPI 360

Query: 406 IWRALLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMN 465
           IWRALLCACRVHGNTKLGEYI+KRL+ELEPNNGENYVLLSNIY+RERRWAEVGKLRGMMN
Sbjct: 361 IWRALLCACRVHGNTKLGEYIMKRLVELEPNNGENYVLLSNIYARERRWAEVGKLRGMMN 420

Query: 466 LRGIRKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALY 525
           LRGIRKVPGCSSIEINNVVYEFVASNDRKPE+EAIYKQLDNLIKKLKENGYVTGTDMALY
Sbjct: 421 LRGIRKVPGCSSIEINNVVYEFVASNDRKPEYEAIYKQLDNLIKKLKENGYVTGTDMALY 480

Query: 526 DIEKEEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYI 585
           D+EKEEKEHS+MYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKV+SLVYKRYI
Sbjct: 481 DVEKEEKEHSLMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYI 540

Query: 586 VVRDRNRFHHFYEGFCSCRDYW 608
           VVRDRNRFHHF+EGFCSCRDYW
Sbjct: 541 VVRDRNRFHHFFEGFCSCRDYW 562

BLAST of CsaV3_1G002680 vs. NCBI nr
Match: XP_023529316.1 (pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1054.7 bits (2726), Expect = 1.2e-304
Identity = 521/618 (84.30%), Postives = 561/618 (90.78%), Query Frame = 0

Query: 1   MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFL--ESEMLKFVHTQAMDLPFQATNGSKI 60
           MKM  R LPFS+R+IRR+R+ Q+ CTISNLDFL  +S++ +FVHT+ M+LP Q     KI
Sbjct: 1   MKMDLRFLPFSFRLIRRARL-QDTCTISNLDFLANQSQISRFVHTRVMNLPSQGGIERKI 60

Query: 61  PDYNDVRR---------GHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTY 120
           PD  D RR         G+FLMKLI+DSVS NGFESIA IFSK+RGSINSQ CNSMIR Y
Sbjct: 61  PDCLDARRKEGNDMRSDGYFLMKLIEDSVSNNGFESIALIFSKFRGSINSQICNSMIRGY 120

Query: 121 LDLNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDV 180
           LDLN+HLNSL IFA MHKFSILPDSSTFPAVLKATAQLCD  +GKMIHG V+QMGFI DV
Sbjct: 121 LDLNEHLNSLIIFAHMHKFSILPDSSTFPAVLKATAQLCDIKLGKMIHGAVVQMGFIRDV 180

Query: 181 YTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLAD 240
           YTSTALVH+YC+CLSISDASQLFDEMPERN+VTWNALITGYTHNRKF +AI+AFRGMLA 
Sbjct: 181 YTSTALVHMYCSCLSISDASQLFDEMPERNSVTWNALITGYTHNRKFKEAINAFRGMLAA 240

Query: 241 GAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEV 300
           GA+PSERTVVVVLSACSHLGA NQGKWIH+FIY N+LRLNVFVGTALIDMYAKCG V E 
Sbjct: 241 GAEPSERTVVVVLSACSHLGALNQGKWIHDFIYQNKLRLNVFVGTALIDMYAKCGVVEEA 300

Query: 301 EKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQ 360
           EKVFEEIR+KNVYTWNVLISGY MNGQGDAALQAFSRMLMENFKPD VTFLG+LCACCHQ
Sbjct: 301 EKVFEEIRDKNVYTWNVLISGYGMNGQGDAALQAFSRMLMENFKPDAVTFLGLLCACCHQ 360

Query: 361 GLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRA 420
           GLVTEGR QF+SMKQQFGLQP+IEHYGCMVDLLGRAGLLEEALELI+SMS+EPDPIIWRA
Sbjct: 361 GLVTEGRRQFISMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALELIESMSMEPDPIIWRA 420

Query: 421 LLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGI 480
           LLCACRVHGNTKLGEY I+RLIELEPNNGENYVLLSN+YSRERRW EVGKLRGMM+LRGI
Sbjct: 421 LLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSRERRWIEVGKLRGMMSLRGI 480

Query: 481 RKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEK 540
            KVPGCSSIEINN V+EF ASNDRK EF AIYKQLDN++KKLKENGYVTGTDM+L+DIEK
Sbjct: 481 EKVPGCSSIEINNSVHEFTASNDRKLEFNAIYKQLDNVMKKLKENGYVTGTDMSLFDIEK 540

Query: 541 EEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRD 600
           EEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRIC DCHEFFKV+SLVYKRYIVVRD
Sbjct: 541 EEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICSDCHEFFKVVSLVYKRYIVVRD 600

Query: 601 RNRFHHFYEGFCSCRDYW 608
           RNRFHHF EG CSCRDYW
Sbjct: 601 RNRFHHFSEGVCSCRDYW 617

BLAST of CsaV3_1G002680 vs. NCBI nr
Match: XP_022925029.1 (pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita moschata])

HSP 1 Score: 1050.0 bits (2714), Expect = 3.0e-303
Identity = 518/618 (83.82%), Postives = 560/618 (90.61%), Query Frame = 0

Query: 1   MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFL--ESEMLKFVHTQAMDLPFQATNGSKI 60
           MKM  RLLPFS+R+IRR+R+ Q+ CTISNLDFL  +S++ +FVHT+ M+LP Q     KI
Sbjct: 1   MKMDLRLLPFSFRLIRRARL-QDTCTISNLDFLANQSQISRFVHTRVMNLPSQGGIERKI 60

Query: 61  PDYNDVRR---------GHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTY 120
           PD  D RR         G+FLMKLI+DSVS NGFESIA IFSK+RGSINSQ CNSMIR Y
Sbjct: 61  PDCLDARRKEGNDMRSDGYFLMKLIEDSVSNNGFESIALIFSKFRGSINSQICNSMIRGY 120

Query: 121 LDLNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDV 180
           LD N+HLNSL IFA MHKFSILPDSSTFPAVLKATAQLCD  +GKMIHG V+QMGFI DV
Sbjct: 121 LDSNEHLNSLIIFAHMHKFSILPDSSTFPAVLKATAQLCDIKLGKMIHGAVVQMGFIRDV 180

Query: 181 YTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLAD 240
           YTSTALVH+YC+CLSISDASQLFDEMPERN+VTWNALITGYTHNRKF +AI+AFRGMLA 
Sbjct: 181 YTSTALVHMYCSCLSISDASQLFDEMPERNSVTWNALITGYTHNRKFREAINAFRGMLAA 240

Query: 241 GAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEV 300
           GA+PSERTVVVVLSACSHLGA NQGKWIH+FIY N+LRLNVFVGTALIDMYAKCG V E 
Sbjct: 241 GAEPSERTVVVVLSACSHLGALNQGKWIHDFIYQNKLRLNVFVGTALIDMYAKCGVVEEA 300

Query: 301 EKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQ 360
           EKVFEEIR++NVYTWNVLISGY MNGQG+AALQ FSRMLMENFKPD VTFLG+LCACCHQ
Sbjct: 301 EKVFEEIRDRNVYTWNVLISGYGMNGQGNAALQVFSRMLMENFKPDAVTFLGLLCACCHQ 360

Query: 361 GLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRA 420
           GLVTEGR QF+SMKQQFGLQP+IEHYGCMVDLLGRAGLLEEALELI+SMS+EPDPIIWRA
Sbjct: 361 GLVTEGRRQFISMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALELIESMSMEPDPIIWRA 420

Query: 421 LLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGI 480
           LLCACRVHGNTKLGEY I+RLIELEPNNGENYVLLSN+YSRERRW EVGKLRGMM+LRGI
Sbjct: 421 LLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSRERRWIEVGKLRGMMSLRGI 480

Query: 481 RKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEK 540
            KVPGCSSIEINN V+EF ASNDRK EF AIYKQLDN++KKLKENGYVTGTDM+L+DIEK
Sbjct: 481 EKVPGCSSIEINNAVHEFTASNDRKREFSAIYKQLDNVMKKLKENGYVTGTDMSLFDIEK 540

Query: 541 EEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRD 600
           EEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRIC DCHEFFKV+SLVYKRYIVVRD
Sbjct: 541 EEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICSDCHEFFKVVSLVYKRYIVVRD 600

Query: 601 RNRFHHFYEGFCSCRDYW 608
           RNRFHHF EG CSCRDYW
Sbjct: 601 RNRFHHFSEGVCSCRDYW 617

BLAST of CsaV3_1G002680 vs. NCBI nr
Match: XP_023003968.1 (pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita maxima])

HSP 1 Score: 999.2 bits (2582), Expect = 6.1e-288
Identity = 490/571 (85.81%), Postives = 523/571 (91.59%), Query Frame = 0

Query: 46  MDLPFQATNGSKIPDY--------NDVRR-GHFLMKLIDDSVSRNGFESIARIFSKYRGS 105
           M+LP Q     KIPD         ND+R  G+FLMKLI+DSVS NGFESIA IFSK+RGS
Sbjct: 1   MNLPSQGGIERKIPDCLDALRKEGNDMRSDGYFLMKLIEDSVSNNGFESIALIFSKFRGS 60

Query: 106 INSQQCNSMIRTYLDLNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMI 165
           INSQ CNSMIR YLDLN+HLNSL IFA MHKFSILPDSSTFPAVLKATAQLCD  +GKMI
Sbjct: 61  INSQICNSMIRGYLDLNEHLNSLIIFAHMHKFSILPDSSTFPAVLKATAQLCDIKLGKMI 120

Query: 166 HGIVIQMGFICDVYTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKF 225
           HG V+QMGFI DVYTSTALVH+YC+CLSISDASQLFDEMPERN+VTWNALITGYTHNRKF
Sbjct: 121 HGAVVQMGFIRDVYTSTALVHMYCSCLSISDASQLFDEMPERNSVTWNALITGYTHNRKF 180

Query: 226 VKAIDAFRGMLADGAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTAL 285
            +AI+AFRGMLA GA+PSERT+VVVLSACSHLGA NQGKWIH+FIY N+LRLNVFVGTAL
Sbjct: 181 KEAINAFRGMLAAGAEPSERTMVVVLSACSHLGALNQGKWIHDFIYQNKLRLNVFVGTAL 240

Query: 286 IDMYAKCGAVYEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDE 345
           IDMYAKCG V E EKVFEEIR+KNVYTWNVLISGY MNGQG+AALQAFSRMLMENFKPD 
Sbjct: 241 IDMYAKCGVVEEAEKVFEEIRDKNVYTWNVLISGYGMNGQGNAALQAFSRMLMENFKPDA 300

Query: 346 VTFLGVLCACCHQGLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQ 405
           VTFLG+LCACCHQGLVTEGR QF+SMKQQFGLQP+IEHYGCMVDLLGRAGLLEEALELI+
Sbjct: 301 VTFLGLLCACCHQGLVTEGRRQFISMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALELIE 360

Query: 406 SMSIEPDPIIWRALLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAE 465
           SMS+EPDPIIWRALLCACRVHGNTK+GEY I+RLIELEPNNGENYVLLSN+YSRERRW E
Sbjct: 361 SMSMEPDPIIWRALLCACRVHGNTKMGEYTIRRLIELEPNNGENYVLLSNLYSRERRWIE 420

Query: 466 VGKLRGMMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGY 525
           VGKLRGMM+LRGI KVPGCSSIEINN VYEF ASNDRK EF AIYKQLDN++KKLKENGY
Sbjct: 421 VGKLRGMMSLRGIEKVPGCSSIEINNAVYEFTASNDRKLEFSAIYKQLDNVMKKLKENGY 480

Query: 526 VTGTDMALYDIEKEEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKV 585
           VTGTDM+L+DIEKEEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRIC DCHEFFKV
Sbjct: 481 VTGTDMSLFDIEKEEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICSDCHEFFKV 540

Query: 586 LSLVYKRYIVVRDRNRFHHFYEGFCSCRDYW 608
           +SLVYKRYIVVRDRNRFHHF E  CSCRDYW
Sbjct: 541 VSLVYKRYIVVRDRNRFHHFSERVCSCRDYW 571

BLAST of CsaV3_1G002680 vs. TAIR10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 451.1 bits (1159), Expect = 1.1e-126
Identity = 217/484 (44.83%), Postives = 319/484 (65.91%), Query Frame = 0

Query: 127 KFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYCTCLSIS 186
           K ++ PD ST   V+ A AQ     +G+ +H  +   GF  ++    AL+ LY  C  + 
Sbjct: 259 KTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELE 318

Query: 187 DASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGAQPSERTVVVVLSACS 246
            A  LF+ +P ++ ++WN LI GYTH   + +A+  F+ ML  G  P++ T++ +L AC+
Sbjct: 319 TACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACA 378

Query: 247 HLGAFNQGKWIHEFIYHNRLR--LNV-FVGTALIDMYAKCGAVYEVEKVFEEIREKNVYT 306
           HLGA + G+WIH +I   RL+   N   + T+LIDMYAKCG +    +VF  I  K++ +
Sbjct: 379 HLGAIDIGRWIHVYI-DKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSS 438

Query: 307 WNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGLVTEGRWQFMSMK 366
           WN +I G+AM+G+ DA+   FSRM     +PD++TF+G+L AC H G++  GR  F +M 
Sbjct: 439 WNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMT 498

Query: 367 QQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVHGNTKLG 426
           Q + + P++EHYGCM+DLLG +GL +EA E+I  M +EPD +IW +LL AC++HGN +LG
Sbjct: 499 QDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELG 558

Query: 427 EYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEINNV 486
           E   + LI++EP N  +YVLLSNIY+   RW EV K R ++N +G++KVPGCSSIEI++V
Sbjct: 559 ESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSV 618

Query: 487 VYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEEKEHSVMYHSEKL 546
           V+EF+  +   P    IY  L+ +   L++ G+V  T   L ++E+E KE ++ +HSEKL
Sbjct: 619 VHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKL 678

Query: 547 ALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEGFCSC 606
           A+AFGL+++     L IVKNLR+C +CHE  K++S +YKR I+ RDR RFHHF +G CSC
Sbjct: 679 AIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSC 738

Query: 607 RDYW 608
            DYW
Sbjct: 739 NDYW 741

BLAST of CsaV3_1G002680 vs. TAIR10
Match: AT3G47530.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 428.3 bits (1100), Expect = 7.8e-120
Identity = 231/548 (42.15%), Postives = 331/548 (60.40%), Query Frame = 0

Query: 68  HFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLDLNKHLNSLYIFALMHK 127
           HFL +L    + R+      R+FS+ R +     CN+MIR +           +F  + +
Sbjct: 48  HFLSRLALSLIPRD-INYSCRVFSQ-RLNPTLSHCNTMIRAFSLSQTPCEGFRLFRSLRR 107

Query: 128 FSILPD---SSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYCTCLS 187
            S LP    SS+F   LK   +  D   G  IHG +   GF+ D    T L+ LY TC +
Sbjct: 108 NSSLPANPLSSSF--ALKCCIKSGDLLGGLQIHGKIFSDGFLSDSLLMTTLMDLYSTCEN 167

Query: 188 ISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLAD---GAQPSERTVVVV 247
            +DA ++FDE+P+R+ V+WN L + Y  N++    +  F  M  D     +P   T ++ 
Sbjct: 168 STDACKVFDEIPKRDTVSWNVLFSCYLRNKRTRDVLVLFDKMKNDVDGCVKPDGVTCLLA 227

Query: 248 LSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEKVFEEIREKNV 307
           L AC++LGA + GK +H+FI  N L   + +   L+ MY++CG++ +  +VF  +RE+NV
Sbjct: 228 LQACANLGALDFGKQVHDFIDENGLSGALNLSNTLVSMYSRCGSMDKAYQVFYGMRERNV 287

Query: 308 YTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGLVTEGRWQFMS 367
            +W  LISG AMNG G  A++AF+ ML     P+E T  G+L AC H GLV EG   F  
Sbjct: 288 VSWTALISGLAMNGFGKEAIEAFNEMLKFGISPEEQTLTGLLSACSHSGLVAEGMMFFDR 347

Query: 368 MKQ-QFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVHGNT 427
           M+  +F ++P + HYGC+VDLLGRA LL++A  LI+SM ++PD  IWR LL ACRVHG+ 
Sbjct: 348 MRSGEFKIKPNLHHYGCVVDLLGRARLLDKAYSLIKSMEMKPDSTIWRTLLGACRVHGDV 407

Query: 428 KLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEI 487
           +LGE +I  LIEL+     +YVLL N YS   +W +V +LR +M  + I   PGCS+IE+
Sbjct: 408 ELGERVISHLIELKAEEAGDYVLLLNTYSTVGKWEKVTELRSLMKEKRIHTKPGCSAIEL 467

Query: 488 NNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIE-KEEKEHSVMYH 547
              V+EF+  +   P  E IYK L  + ++LK  GYV      L+++E +EEK +++ YH
Sbjct: 468 QGTVHEFIVDDVSHPRKEEIYKMLAEINQQLKIAGYVAEITSELHNLESEEEKGYALRYH 527

Query: 548 SEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEG 607
           SEKLA+AFG+L +P   T+R+ KNLR C+DCH F K +S VY R ++VRDR+RFHHF  G
Sbjct: 528 SEKLAIAFGILVTPPGTTIRVTKNLRTCVDCHNFAKFVSDVYDRIVIVRDRSRFHHFKGG 587

BLAST of CsaV3_1G002680 vs. TAIR10
Match: AT4G30700.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 425.6 bits (1093), Expect = 5.0e-119
Identity = 214/505 (42.38%), Postives = 308/505 (60.99%), Query Frame = 0

Query: 103 NSMIRTYLDLNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQ 162
           N+MI  Y    +   SL +F  +        SST  +++  +  L    +   IHG  ++
Sbjct: 291 NAMIHGYTSNGETELSLSLFKELMLSGARLRSSTLVSLVPVSGHLM---LIYAIHGYCLK 350

Query: 163 MGFICDVYTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDA 222
             F+     STAL  +Y     I  A +LFDE PE++  +WNA+I+GYT N     AI  
Sbjct: 351 SNFLSHASVSTALTTVYSKLNEIESARKLFDESPEKSLPSWNAMISGYTQNGLTEDAISL 410

Query: 223 FRGMLADGAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAK 282
           FR M      P+  T+  +LSAC+ LGA + GKW+H+ +       +++V TALI MYAK
Sbjct: 411 FREMQKSEFSPNPVTITCILSACAQLGALSLGKWVHDLVRSTDFESSIYVSTALIGMYAK 470

Query: 283 CGAVYEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGV 342
           CG++ E  ++F+ + +KN  TWN +ISGY ++GQG  AL  F  ML     P  VTFL V
Sbjct: 471 CGSIAEARRLFDLMTKKNEVTWNTMISGYGLHGQGQEALNIFYEMLNSGITPTPVTFLCV 530

Query: 343 LCACCHQGLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEP 402
           L AC H GLV EG   F SM  ++G +P ++HY CMVD+LGRAG L+ AL+ I++MSIEP
Sbjct: 531 LYACSHAGLVKEGDEIFNSMIHRYGFEPSVKHYACMVDILGRAGHLQRALQFIEAMSIEP 590

Query: 403 DPIIWRALLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRG 462
              +W  LL ACR+H +T L   + ++L EL+P+N   +VLLSNI+S +R + +   +R 
Sbjct: 591 GSSVWETLLGACRIHKDTNLARTVSEKLFELDPDNVGYHVLLSNIHSADRNYPQAATVRQ 650

Query: 463 MMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDM 522
               R + K PG + IEI    + F + +   P+ + IY++L+ L  K++E GY   T++
Sbjct: 651 TAKKRKLAKAPGYTLIEIGETPHVFTSGDQSHPQVKEIYEKLEKLEGKMREAGYQPETEL 710

Query: 523 ALYDIEKEEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYK 582
           AL+D+E+EE+E  V  HSE+LA+AFGL+ +     +RI+KNLR+CLDCH   K++S + +
Sbjct: 711 ALHDVEEEERELMVKVHSERLAIAFGLIATEPGTEIRIIKNLRVCLDCHTVTKLISKITE 770

Query: 583 RYIVVRDRNRFHHFYEGFCSCRDYW 608
           R IVVRD NRFHHF +G CSC DYW
Sbjct: 771 RVIVVRDANRFHHFKDGVCSCGDYW 792

BLAST of CsaV3_1G002680 vs. TAIR10
Match: AT4G33170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 424.9 bits (1091), Expect = 8.6e-119
Identity = 220/543 (40.52%), Postives = 319/543 (58.75%), Query Frame = 0

Query: 69  FLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLDLNKHLNSLYIFALMHKF 128
           F+   + D+ SRN     A I  + R + +    N+M+  Y   +    +L +FALMHK 
Sbjct: 453 FVSTALIDAYSRNRCMKEAEILFE-RHNFDLVAWNAMMAGYTQSHDGHKTLKLFALMHKQ 512

Query: 129 SILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYCTCLSISDA 188
               D  T   V K    L     GK +H   I+ G+  D++ S+ ++ +Y  C  +S A
Sbjct: 513 GERSDDFTLATVFKTCGFLFAINQGKQVHAYAIKSGYDLDLWVSSGILDMYVKCGDMSAA 572

Query: 189 SQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGAQPSERTVVVVLSACSHL 248
              FD +P  + V W  +I+G   N +  +A   F  M   G  P E T+  +  A S L
Sbjct: 573 QFAFDSIPVPDDVAWTTMISGCIENGEEERAFHVFSQMRLMGVLPDEFTIATLAKASSCL 632

Query: 249 GAFNQGKWIHEFIYHNRLRLNV----FVGTALIDMYAKCGAVYEVEKVFEEIREKNVYTW 308
            A  QG+ IH     N L+LN     FVGT+L+DMYAKCG++ +   +F+ I   N+  W
Sbjct: 633 TALEQGRQIHA----NALKLNCTNDPFVGTSLVDMYAKCGSIDDAYCLFKRIEMMNITAW 692

Query: 309 NVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGLVTEGRWQFMSMKQ 368
           N ++ G A +G+G   LQ F +M     KPD+VTF+GVL AC H GLV+E      SM  
Sbjct: 693 NAMLVGLAQHGEGKETLQLFKQMKSLGIKPDKVTFIGVLSACSHSGLVSEAYKHMRSMHG 752

Query: 369 QFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVHGNTKLGE 428
            +G++P IEHY C+ D LGRAGL+++A  LI+SMS+E    ++R LL ACRV G+T+ G+
Sbjct: 753 DYGIKPEIEHYSCLADALGRAGLVKQAENLIESMSMEASASMYRTLLAACRVQGDTETGK 812

Query: 429 YIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEINNVV 488
            +  +L+ELEP +   YVLLSN+Y+   +W E+   R MM    ++K PG S IE+ N +
Sbjct: 813 RVATKLLELEPLDSSAYVLLSNMYAAASKWDEMKLARTMMKGHKVKKDPGFSWIEVKNKI 872

Query: 489 YEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEEKEHSVMYHSEKLA 548
           + FV  +    + E IY+++ ++I+ +K+ GYV  TD  L D+E+EEKE ++ YHSEKLA
Sbjct: 873 HIFVVDDRSNRQTELIYRKVKDMIRDIKQEGYVPETDFTLVDVEEEEKERALYYHSEKLA 932

Query: 549 LAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEGFCSCR 608
           +AFGLL++P    +R++KNLR+C DCH   K ++ VY R IV+RD NRFH F +G CSC 
Sbjct: 933 VAFGLLSTPPSTPIRVIKNLRVCGDCHNAMKYIAKVYNREIVLRDANRFHRFKDGICSCG 990

BLAST of CsaV3_1G002680 vs. TAIR10
Match: AT2G02980.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 420.6 bits (1080), Expect = 1.6e-117
Identity = 211/506 (41.70%), Postives = 309/506 (61.07%), Query Frame = 0

Query: 103 NSMIRTYLDLNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQ 162
           NSM R Y      L    +F  + +  ILPD+ TFP++LKA A       G+ +H + ++
Sbjct: 98  NSMARGYSRFTNPLEVFSLFVEILEDGILPDNYTFPSLLKACAVAKALEEGRQLHCLSMK 157

Query: 163 MGFICDVYTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDA 222
           +G   +VY    L+++Y  C  +  A  +FD + E   V +NA+ITGY    +  +A+  
Sbjct: 158 LGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNAMITGYARRNRPNEALSL 217

Query: 223 FRGMLADGAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAK 282
           FR M     +P+E T++ VLS+C+ LG+ + GKWIH++   +     V V TALIDM+AK
Sbjct: 218 FREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHSFCKYVKVNTALIDMFAK 277

Query: 283 CGAVYEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGV 342
           CG++ +   +FE++R K+   W+                           +PDE+TFLG+
Sbjct: 278 CGSLDDAVSIFEKMRYKDTQAWSXXXXXXXXXXXXXXXXXXXXXXXXXXXQPDEITFLGL 337

Query: 343 LCACCHQGLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEP 402
           L AC H G V EGR  F  M  +FG+ P I+HYG MVDLL RAG LE+A E I  + I P
Sbjct: 338 LNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNLEDAYEFIDKLPISP 397

Query: 403 DPIIWRALLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRG 462
            P++WR LL AC  H N  L E + +R+ EL+ ++G +YV+LSN+Y+R ++W  V  LR 
Sbjct: 398 TPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILSNLYARNKKWEYVDSLRK 457

Query: 463 MMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDM 522
           +M  R   KVPGCSSIE+NNVV+EF + +  K     +++ LD ++K+LK +GYV  T M
Sbjct: 458 VMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMVKELKLSGYVPDTSM 517

Query: 523 ALY-DIEKEEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVY 582
            ++ ++  +EKE ++ YHSEKLA+ FGLLN+P   T+R+VKNLR+C DCH   K++SL++
Sbjct: 518 VVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVCRDCHNAAKLISLIF 577

Query: 583 KRYIVVRDRNRFHHFYEGFCSCRDYW 608
            R +V+RD  RFHHF +G CSC D+W
Sbjct: 578 GRKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of CsaV3_1G002680 vs. Swiss-Prot
Match: sp|Q9LN01|PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 451.1 bits (1159), Expect = 2.0e-125
Identity = 217/484 (44.83%), Postives = 319/484 (65.91%), Query Frame = 0

Query: 127 KFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYCTCLSIS 186
           K ++ PD ST   V+ A AQ     +G+ +H  +   GF  ++    AL+ LY  C  + 
Sbjct: 259 KTNVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELE 318

Query: 187 DASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGAQPSERTVVVVLSACS 246
            A  LF+ +P ++ ++WN LI GYTH   + +A+  F+ ML  G  P++ T++ +L AC+
Sbjct: 319 TACGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACA 378

Query: 247 HLGAFNQGKWIHEFIYHNRLR--LNV-FVGTALIDMYAKCGAVYEVEKVFEEIREKNVYT 306
           HLGA + G+WIH +I   RL+   N   + T+LIDMYAKCG +    +VF  I  K++ +
Sbjct: 379 HLGAIDIGRWIHVYI-DKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSS 438

Query: 307 WNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGLVTEGRWQFMSMK 366
           WN +I G+AM+G+ DA+   FSRM     +PD++TF+G+L AC H G++  GR  F +M 
Sbjct: 439 WNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMT 498

Query: 367 QQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVHGNTKLG 426
           Q + + P++EHYGCM+DLLG +GL +EA E+I  M +EPD +IW +LL AC++HGN +LG
Sbjct: 499 QDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELG 558

Query: 427 EYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEINNV 486
           E   + LI++EP N  +YVLLSNIY+   RW EV K R ++N +G++KVPGCSSIEI++V
Sbjct: 559 ESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSV 618

Query: 487 VYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEEKEHSVMYHSEKL 546
           V+EF+  +   P    IY  L+ +   L++ G+V  T   L ++E+E KE ++ +HSEKL
Sbjct: 619 VHEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKL 678

Query: 547 ALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEGFCSC 606
           A+AFGL+++     L IVKNLR+C +CHE  K++S +YKR I+ RDR RFHHF +G CSC
Sbjct: 679 AIAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSC 738

Query: 607 RDYW 608
            DYW
Sbjct: 739 NDYW 741

BLAST of CsaV3_1G002680 vs. Swiss-Prot
Match: sp|Q9SN85|PP267_ARATH (Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H76 PE=2 SV=1)

HSP 1 Score: 428.3 bits (1100), Expect = 1.4e-118
Identity = 231/548 (42.15%), Postives = 331/548 (60.40%), Query Frame = 0

Query: 68  HFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLDLNKHLNSLYIFALMHK 127
           HFL +L    + R+      R+FS+ R +     CN+MIR +           +F  + +
Sbjct: 48  HFLSRLALSLIPRD-INYSCRVFSQ-RLNPTLSHCNTMIRAFSLSQTPCEGFRLFRSLRR 107

Query: 128 FSILPD---SSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYCTCLS 187
            S LP    SS+F   LK   +  D   G  IHG +   GF+ D    T L+ LY TC +
Sbjct: 108 NSSLPANPLSSSF--ALKCCIKSGDLLGGLQIHGKIFSDGFLSDSLLMTTLMDLYSTCEN 167

Query: 188 ISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLAD---GAQPSERTVVVV 247
            +DA ++FDE+P+R+ V+WN L + Y  N++    +  F  M  D     +P   T ++ 
Sbjct: 168 STDACKVFDEIPKRDTVSWNVLFSCYLRNKRTRDVLVLFDKMKNDVDGCVKPDGVTCLLA 227

Query: 248 LSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEKVFEEIREKNV 307
           L AC++LGA + GK +H+FI  N L   + +   L+ MY++CG++ +  +VF  +RE+NV
Sbjct: 228 LQACANLGALDFGKQVHDFIDENGLSGALNLSNTLVSMYSRCGSMDKAYQVFYGMRERNV 287

Query: 308 YTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGLVTEGRWQFMS 367
            +W  LISG AMNG G  A++AF+ ML     P+E T  G+L AC H GLV EG   F  
Sbjct: 288 VSWTALISGLAMNGFGKEAIEAFNEMLKFGISPEEQTLTGLLSACSHSGLVAEGMMFFDR 347

Query: 368 MKQ-QFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVHGNT 427
           M+  +F ++P + HYGC+VDLLGRA LL++A  LI+SM ++PD  IWR LL ACRVHG+ 
Sbjct: 348 MRSGEFKIKPNLHHYGCVVDLLGRARLLDKAYSLIKSMEMKPDSTIWRTLLGACRVHGDV 407

Query: 428 KLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEI 487
           +LGE +I  LIEL+     +YVLL N YS   +W +V +LR +M  + I   PGCS+IE+
Sbjct: 408 ELGERVISHLIELKAEEAGDYVLLLNTYSTVGKWEKVTELRSLMKEKRIHTKPGCSAIEL 467

Query: 488 NNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIE-KEEKEHSVMYH 547
              V+EF+  +   P  E IYK L  + ++LK  GYV      L+++E +EEK +++ YH
Sbjct: 468 QGTVHEFIVDDVSHPRKEEIYKMLAEINQQLKIAGYVAEITSELHNLESEEEKGYALRYH 527

Query: 548 SEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEG 607
           SEKLA+AFG+L +P   T+R+ KNLR C+DCH F K +S VY R ++VRDR+RFHHF  G
Sbjct: 528 SEKLAIAFGILVTPPGTTIRVTKNLRTCVDCHNFAKFVSDVYDRIVIVRDRSRFHHFKGG 587

BLAST of CsaV3_1G002680 vs. Swiss-Prot
Match: sp|Q9SUH6|PP341_ARATH (Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana OX=3702 GN=DYW9 PE=2 SV=1)

HSP 1 Score: 425.6 bits (1093), Expect = 9.1e-118
Identity = 214/505 (42.38%), Postives = 308/505 (60.99%), Query Frame = 0

Query: 103 NSMIRTYLDLNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQ 162
           N+MI  Y    +   SL +F  +        SST  +++  +  L    +   IHG  ++
Sbjct: 291 NAMIHGYTSNGETELSLSLFKELMLSGARLRSSTLVSLVPVSGHLM---LIYAIHGYCLK 350

Query: 163 MGFICDVYTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDA 222
             F+     STAL  +Y     I  A +LFDE PE++  +WNA+I+GYT N     AI  
Sbjct: 351 SNFLSHASVSTALTTVYSKLNEIESARKLFDESPEKSLPSWNAMISGYTQNGLTEDAISL 410

Query: 223 FRGMLADGAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAK 282
           FR M      P+  T+  +LSAC+ LGA + GKW+H+ +       +++V TALI MYAK
Sbjct: 411 FREMQKSEFSPNPVTITCILSACAQLGALSLGKWVHDLVRSTDFESSIYVSTALIGMYAK 470

Query: 283 CGAVYEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGV 342
           CG++ E  ++F+ + +KN  TWN +ISGY ++GQG  AL  F  ML     P  VTFL V
Sbjct: 471 CGSIAEARRLFDLMTKKNEVTWNTMISGYGLHGQGQEALNIFYEMLNSGITPTPVTFLCV 530

Query: 343 LCACCHQGLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEP 402
           L AC H GLV EG   F SM  ++G +P ++HY CMVD+LGRAG L+ AL+ I++MSIEP
Sbjct: 531 LYACSHAGLVKEGDEIFNSMIHRYGFEPSVKHYACMVDILGRAGHLQRALQFIEAMSIEP 590

Query: 403 DPIIWRALLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRG 462
              +W  LL ACR+H +T L   + ++L EL+P+N   +VLLSNI+S +R + +   +R 
Sbjct: 591 GSSVWETLLGACRIHKDTNLARTVSEKLFELDPDNVGYHVLLSNIHSADRNYPQAATVRQ 650

Query: 463 MMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDM 522
               R + K PG + IEI    + F + +   P+ + IY++L+ L  K++E GY   T++
Sbjct: 651 TAKKRKLAKAPGYTLIEIGETPHVFTSGDQSHPQVKEIYEKLEKLEGKMREAGYQPETEL 710

Query: 523 ALYDIEKEEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYK 582
           AL+D+E+EE+E  V  HSE+LA+AFGL+ +     +RI+KNLR+CLDCH   K++S + +
Sbjct: 711 ALHDVEEEERELMVKVHSERLAIAFGLIATEPGTEIRIIKNLRVCLDCHTVTKLISKITE 770

Query: 583 RYIVVRDRNRFHHFYEGFCSCRDYW 608
           R IVVRD NRFHHF +G CSC DYW
Sbjct: 771 RVIVVRDANRFHHFKDGVCSCGDYW 792

BLAST of CsaV3_1G002680 vs. Swiss-Prot
Match: sp|Q9SMZ2|PP347_ARATH (Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H53 PE=3 SV=1)

HSP 1 Score: 424.9 bits (1091), Expect = 1.5e-117
Identity = 220/543 (40.52%), Postives = 319/543 (58.75%), Query Frame = 0

Query: 69  FLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLDLNKHLNSLYIFALMHKF 128
           F+   + D+ SRN     A I  + R + +    N+M+  Y   +    +L +FALMHK 
Sbjct: 453 FVSTALIDAYSRNRCMKEAEILFE-RHNFDLVAWNAMMAGYTQSHDGHKTLKLFALMHKQ 512

Query: 129 SILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYCTCLSISDA 188
               D  T   V K    L     GK +H   I+ G+  D++ S+ ++ +Y  C  +S A
Sbjct: 513 GERSDDFTLATVFKTCGFLFAINQGKQVHAYAIKSGYDLDLWVSSGILDMYVKCGDMSAA 572

Query: 189 SQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGAQPSERTVVVVLSACSHL 248
              FD +P  + V W  +I+G   N +  +A   F  M   G  P E T+  +  A S L
Sbjct: 573 QFAFDSIPVPDDVAWTTMISGCIENGEEERAFHVFSQMRLMGVLPDEFTIATLAKASSCL 632

Query: 249 GAFNQGKWIHEFIYHNRLRLNV----FVGTALIDMYAKCGAVYEVEKVFEEIREKNVYTW 308
            A  QG+ IH     N L+LN     FVGT+L+DMYAKCG++ +   +F+ I   N+  W
Sbjct: 633 TALEQGRQIHA----NALKLNCTNDPFVGTSLVDMYAKCGSIDDAYCLFKRIEMMNITAW 692

Query: 309 NVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGLVTEGRWQFMSMKQ 368
           N ++ G A +G+G   LQ F +M     KPD+VTF+GVL AC H GLV+E      SM  
Sbjct: 693 NAMLVGLAQHGEGKETLQLFKQMKSLGIKPDKVTFIGVLSACSHSGLVSEAYKHMRSMHG 752

Query: 369 QFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVHGNTKLGE 428
            +G++P IEHY C+ D LGRAGL+++A  LI+SMS+E    ++R LL ACRV G+T+ G+
Sbjct: 753 DYGIKPEIEHYSCLADALGRAGLVKQAENLIESMSMEASASMYRTLLAACRVQGDTETGK 812

Query: 429 YIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEINNVV 488
            +  +L+ELEP +   YVLLSN+Y+   +W E+   R MM    ++K PG S IE+ N +
Sbjct: 813 RVATKLLELEPLDSSAYVLLSNMYAAASKWDEMKLARTMMKGHKVKKDPGFSWIEVKNKI 872

Query: 489 YEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEEKEHSVMYHSEKLA 548
           + FV  +    + E IY+++ ++I+ +K+ GYV  TD  L D+E+EEKE ++ YHSEKLA
Sbjct: 873 HIFVVDDRSNRQTELIYRKVKDMIRDIKQEGYVPETDFTLVDVEEEEKERALYYHSEKLA 932

Query: 549 LAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEGFCSCR 608
           +AFGLL++P    +R++KNLR+C DCH   K ++ VY R IV+RD NRFH F +G CSC 
Sbjct: 933 VAFGLLSTPPSTPIRVIKNLRVCGDCHNAMKYIAKVYNREIVLRDANRFHRFKDGICSCG 990

BLAST of CsaV3_1G002680 vs. Swiss-Prot
Match: sp|Q8LK93|PP145_ARATH (Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H26 PE=2 SV=2)

HSP 1 Score: 420.6 bits (1080), Expect = 2.9e-116
Identity = 211/506 (41.70%), Postives = 309/506 (61.07%), Query Frame = 0

Query: 103 NSMIRTYLDLNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQ 162
           NSM R Y      L    +F  + +  ILPD+ TFP++LKA A       G+ +H + ++
Sbjct: 98  NSMARGYSRFTNPLEVFSLFVEILEDGILPDNYTFPSLLKACAVAKALEEGRQLHCLSMK 157

Query: 163 MGFICDVYTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDA 222
           +G   +VY    L+++Y  C  +  A  +FD + E   V +NA+ITGY    +  +A+  
Sbjct: 158 LGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNAMITGYARRNRPNEALSL 217

Query: 223 FRGMLADGAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAK 282
           FR M     +P+E T++ VLS+C+ LG+ + GKWIH++   +     V V TALIDM+AK
Sbjct: 218 FREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHSFCKYVKVNTALIDMFAK 277

Query: 283 CGAVYEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGV 342
           CG++ +   +FE++R K+   W+                           +PDE+TFLG+
Sbjct: 278 CGSLDDAVSIFEKMRYKDTQAWSXXXXXXXXXXXXXXXXXXXXXXXXXXXQPDEITFLGL 337

Query: 343 LCACCHQGLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEP 402
           L AC H G V EGR  F  M  +FG+ P I+HYG MVDLL RAG LE+A E I  + I P
Sbjct: 338 LNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNLEDAYEFIDKLPISP 397

Query: 403 DPIIWRALLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRG 462
            P++WR LL AC  H N  L E + +R+ EL+ ++G +YV+LSN+Y+R ++W  V  LR 
Sbjct: 398 TPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILSNLYARNKKWEYVDSLRK 457

Query: 463 MMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDM 522
           +M  R   KVPGCSSIE+NNVV+EF + +  K     +++ LD ++K+LK +GYV  T M
Sbjct: 458 VMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMVKELKLSGYVPDTSM 517

Query: 523 ALY-DIEKEEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVY 582
            ++ ++  +EKE ++ YHSEKLA+ FGLLN+P   T+R+VKNLR+C DCH   K++SL++
Sbjct: 518 VVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVCRDCHNAAKLISLIF 577

Query: 583 KRYIVVRDRNRFHHFYEGFCSCRDYW 608
            R +V+RD  RFHHF +G CSC D+W
Sbjct: 578 GRKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of CsaV3_1G002680 vs. TrEMBL
Match: tr|A0A0A0LUK8|A0A0A0LUK8_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G011530 PE=4 SV=1)

HSP 1 Score: 1243.8 bits (3217), Expect = 0.0e+00
Identity = 607/607 (100.00%), Postives = 607/607 (100.00%), Query Frame = 0

Query: 1   MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFLESEMLKFVHTQAMDLPFQATNGSKIPD 60
           MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFLESEMLKFVHTQAMDLPFQATNGSKIPD
Sbjct: 1   MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFLESEMLKFVHTQAMDLPFQATNGSKIPD 60

Query: 61  YNDVRRGHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLDLNKHLNSLY 120
           YNDVRRGHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLDLNKHLNSLY
Sbjct: 61  YNDVRRGHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLDLNKHLNSLY 120

Query: 121 IFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYC 180
           IFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYC
Sbjct: 121 IFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYC 180

Query: 181 TCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGAQPSERTVVV 240
           TCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGAQPSERTVVV
Sbjct: 181 TCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGAQPSERTVVV 240

Query: 241 VLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEKVFEEIREKN 300
           VLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEKVFEEIREKN
Sbjct: 241 VLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEKVFEEIREKN 300

Query: 301 VYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGLVTEGRWQFM 360
           VYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGLVTEGRWQFM
Sbjct: 301 VYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGLVTEGRWQFM 360

Query: 361 SMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVHGNT 420
           SMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVHGNT
Sbjct: 361 SMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVHGNT 420

Query: 421 KLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEI 480
           KLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEI
Sbjct: 421 KLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEI 480

Query: 481 NNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEEKEHSVMYHS 540
           NNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEEKEHSVMYHS
Sbjct: 481 NNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEEKEHSVMYHS 540

Query: 541 EKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEGF 600
           EKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEGF
Sbjct: 541 EKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEGF 600

Query: 601 CSCRDYW 608
           CSCRDYW
Sbjct: 601 CSCRDYW 607

BLAST of CsaV3_1G002680 vs. TrEMBL
Match: tr|A0A1S4DZH3|A0A1S4DZH3_CUCME (pentatricopeptide repeat-containing protein At4g21065-like OS=Cucumis melo OX=3656 GN=LOC103494017 PE=4 SV=1)

HSP 1 Score: 1097.4 bits (2837), Expect = 0.0e+00
Identity = 531/562 (94.48%), Postives = 547/562 (97.33%), Query Frame = 0

Query: 46  MDLPFQATNGSKIPDYNDVRRGHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSM 105
           MDLPFQ TN  K PDYNDVRRGHF+MKLIDDSVS NGFESIARIFSKYRGSINSQQCNSM
Sbjct: 1   MDLPFQETNDRKTPDYNDVRRGHFVMKLIDDSVSHNGFESIARIFSKYRGSINSQQCNSM 60

Query: 106 IRTYLDLNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGF 165
           IR YLDLNKHLNSLYIFA MHKFSILPD STFPAVLKATAQLCDT VGKMIHGIVIQMGF
Sbjct: 61  IRRYLDLNKHLNSLYIFAQMHKFSILPDLSTFPAVLKATAQLCDTEVGKMIHGIVIQMGF 120

Query: 166 ICDVYTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRG 225
           ICDVYTSTALVH+Y TCLSISDASQ+FDEM ERNAVTWNALITGYTHNRKF++AIDAFRG
Sbjct: 121 ICDVYTSTALVHMYSTCLSISDASQVFDEMAERNAVTWNALITGYTHNRKFMEAIDAFRG 180

Query: 226 MLADGAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGA 285
           MLA GAQPSERTVV+VLSACSHLGA NQGKWIH+FIYHNRLRLNVFVGTALIDMYAKCGA
Sbjct: 181 MLAAGAQPSERTVVLVLSACSHLGALNQGKWIHDFIYHNRLRLNVFVGTALIDMYAKCGA 240

Query: 286 VYEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCA 345
           V EVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCA
Sbjct: 241 VDEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCA 300

Query: 346 CCHQGLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPI 405
           CCHQGLVTEGR QFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMS+EPDPI
Sbjct: 301 CCHQGLVTEGRRQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSMEPDPI 360

Query: 406 IWRALLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMN 465
           IWRALLCACRVHGNTKLGEYI+KRL+ELEPNNGENYVLLSNIY+RERRWAEVGKLRGMMN
Sbjct: 361 IWRALLCACRVHGNTKLGEYIMKRLVELEPNNGENYVLLSNIYARERRWAEVGKLRGMMN 420

Query: 466 LRGIRKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALY 525
           LRGIRKVPGCSSIEINNVVYEFVASNDRKPE+EAIYKQLDNLIKKLKENGYVTGTDMALY
Sbjct: 421 LRGIRKVPGCSSIEINNVVYEFVASNDRKPEYEAIYKQLDNLIKKLKENGYVTGTDMALY 480

Query: 526 DIEKEEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYI 585
           D+EKEEKEHS+MYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKV+SLVYKRYI
Sbjct: 481 DVEKEEKEHSLMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYI 540

Query: 586 VVRDRNRFHHFYEGFCSCRDYW 608
           VVRDRNRFHHF+EGFCSCRDYW
Sbjct: 541 VVRDRNRFHHFFEGFCSCRDYW 562

BLAST of CsaV3_1G002680 vs. TrEMBL
Match: tr|D7SI59|D7SI59_VITVI (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_17s0000g06770 PE=4 SV=1)

HSP 1 Score: 776.5 bits (2004), Expect = 4.2e-221
Identity = 375/538 (69.70%), Postives = 443/538 (82.34%), Query Frame = 0

Query: 70  LMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLDLNKHLNSLYIFALMHKFS 129
           LMKLID SVS +GF + A +F+++ G I+S  CNSMIR Y D NKHL+S++I+  M K  
Sbjct: 78  LMKLIDFSVSSHGFAASALLFTQFYGFIDSDLCNSMIRCYTDSNKHLHSVFIYTQMWKNG 137

Query: 130 ILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYCTCLSISDAS 189
           I PDSSTFP VLK+ AQLC   +GK IH  +IQMGF  +VY STALV++Y TC S+SDA 
Sbjct: 138 IFPDSSTFPTVLKSVAQLCRQELGKAIHCCIIQMGFESNVYVSTALVNMYGTCSSVSDAR 197

Query: 190 QLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGAQPSERTVVVVLSACSHLG 249
           Q+FDE+P+RN V+WNALITGY HNR F K ID FR M   GA+P E T+V VL AC+HLG
Sbjct: 198 QVFDEIPDRNIVSWNALITGYNHNRMFRKVIDVFREMQIAGAKPVEVTMVGVLLACAHLG 257

Query: 250 AFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEKVFEEIREKNVYTWNVLIS 309
           A NQG+WI ++I HNRLRLNVFVGTALIDMYAKCG V E EK+F+ +R KNVYTWNVLIS
Sbjct: 258 ALNQGRWIDDYIDHNRLRLNVFVGTALIDMYAKCGVVDEAEKIFKAMRVKNVYTWNVLIS 317

Query: 310 GYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGLVTEGRWQFMSMKQQFGLQ 369
           GYAMNG+G++ALQAFSRM+ME FKPDEVTFLGVLCACCHQGLV EGR  F SMK++FGL+
Sbjct: 318 GYAMNGRGESALQAFSRMIMEKFKPDEVTFLGVLCACCHQGLVNEGRTYFTSMKEEFGLR 377

Query: 370 PRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVHGNTKLGEYIIKR 429
           PRIEHYGCMVDLLGRAG L+EA +LIQ+MS++PDPIIWR LL ACR+HGN +LGE+ IK+
Sbjct: 378 PRIEHYGCMVDLLGRAGFLDEAQQLIQAMSMQPDPIIWRELLGACRIHGNIQLGEFAIKK 437

Query: 430 LIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEINNVVYEFVA 489
           L+ELEPNNGENYVLL+N+Y+R++RW +VG++R MM+ R +RKVPGCSSIEI+NVVYEFV 
Sbjct: 438 LLELEPNNGENYVLLANLYARDQRWDKVGEVREMMDCRRVRKVPGCSSIEIDNVVYEFVV 497

Query: 490 SNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEEKEHSVMYHSEKLALAFGL 549
           SN  KP FE +YK L ++ KKLK  GYV  T MA YDIE+EEKEHS+MYHSEKLALAFGL
Sbjct: 498 SNYIKPGFEEVYKLLADMNKKLKLAGYVADTGMASYDIEEEEKEHSLMYHSEKLALAFGL 557

Query: 550 LNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEGFCSCRDYW 608
           L SP   TLRIVKNLRIC DCH FFK++S VY+R I VRDRNRFHHF  G CSC+DYW
Sbjct: 558 LKSPSGLTLRIVKNLRICQDCHGFFKIVSKVYRRDISVRDRNRFHHFVGGACSCKDYW 615

BLAST of CsaV3_1G002680 vs. TrEMBL
Match: tr|A0A2R6PCT2|A0A2R6PCT2_ACTCH (Pentatricopeptide repeat-containing protein OS=Actinidia chinensis var. chinensis OX=1590841 GN=CEY00_Acc29396 PE=4 SV=1)

HSP 1 Score: 745.7 bits (1924), Expect = 8.0e-212
Identity = 355/538 (65.99%), Postives = 434/538 (80.67%), Query Frame = 0

Query: 70  LMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLDLNKHLNSLYIFALMHKFS 129
           LMKLI+ SVS  GF S A +F+++   I+S+ CNS+IR+Y  LNKH++S++++  M K  
Sbjct: 51  LMKLINSSVSSYGFASSAPLFAQFNHFIDSELCNSVIRSYTHLNKHVHSVFVYTQMCKAG 110

Query: 130 ILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYCTCLSISDAS 189
           I PDSSTFPAVLK+  +L    +GK IH  V++MGF+ D+YT+TALVH+YCTCL   +  
Sbjct: 111 ISPDSSTFPAVLKSVTKLGRGDIGKSIHCCVVKMGFVSDLYTNTALVHMYCTCLLPGEGR 170

Query: 190 QLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGAQPSERTVVVVLSACSHLG 249
           Q+FD MPERNAV+WNALI+GY HNRKF +AIDAFR M A GA+P E T+V VLSACSHLG
Sbjct: 171 QVFDVMPERNAVSWNALISGYAHNRKFREAIDAFRDMQAAGAKPGEVTMVGVLSACSHLG 230

Query: 250 AFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEKVFEEIREKNVYTWNVLIS 309
           A NQGKWIH++I  N+LRLNVFVGTALIDMYAKCG V E ++VF  +R KNVYTWNVLIS
Sbjct: 231 ALNQGKWIHDYIVRNKLRLNVFVGTALIDMYAKCGVVDEAQRVFGAVRVKNVYTWNVLIS 290

Query: 310 GYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGLVTEGRWQFMSMKQQFGLQ 369
           GYAMNGQG+AALQAF  M++EN++PDEVTFLG+LCACCHQGLV  GR    +MK+++GL 
Sbjct: 291 GYAMNGQGEAALQAFETMIVENYRPDEVTFLGILCACCHQGLVEVGRRHLRNMKEEYGLN 350

Query: 370 PRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVHGNTKLGEYIIKR 429
           PRIEHYGCMVDLLGRAGL  EA EL+ +M+++PDPIIWRA L ACR+HG+T+LGE  IK 
Sbjct: 351 PRIEHYGCMVDLLGRAGLFVEAQELMHTMNMKPDPIIWRAFLGACRIHGHTQLGETAIKN 410

Query: 430 LIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEINNVVYEFVA 489
           LIELEP NGENY+LLSN+Y+R+ +W+EVG++R +MN  GIRKVPGCSSIEI N VYEFV 
Sbjct: 411 LIELEPENGENYILLSNLYARDHKWSEVGEVREIMNRGGIRKVPGCSSIEIENAVYEFVV 470

Query: 490 SNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEEKEHSVMYHSEKLALAFGL 549
           SN   P +E +YK L N+ ++LK  GY   TDM  YDIE+EEKE ++ YHSEKLALAFGL
Sbjct: 471 SNLMGPGYEELYKLLANVKRELKVAGYAEYTDMVSYDIEEEEKEQTLTYHSEKLALAFGL 530

Query: 550 LNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEGFCSCRDYW 608
           LNS  D TLRI+KNLRIC DCH+FFK++S +Y+R I VRDRNRFHHF  G CSC+DYW
Sbjct: 531 LNSLPDTTLRILKNLRICQDCHQFFKLVSELYRRDITVRDRNRFHHFSGGVCSCKDYW 588

BLAST of CsaV3_1G002680 vs. TrEMBL
Match: tr|A0A1U8A8W5|A0A1U8A8W5_NELNU (pentatricopeptide repeat-containing protein At4g21065-like OS=Nelumbo nucifera OX=4432 GN=LOC104598183 PE=4 SV=1)

HSP 1 Score: 730.7 bits (1885), Expect = 2.7e-207
Identity = 350/563 (62.17%), Postives = 433/563 (76.91%), Query Frame = 0

Query: 58  IPDYNDVRR--GHF-----------LMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNS 117
           I DYN +++  GH            LMKLID S S +G +  A +F++++  INS+ C S
Sbjct: 53  ITDYNHMKQFLGHIITNNIAIDEFSLMKLIDLSFSSSGSDVSAHLFTQFQDFINSEICTS 112

Query: 118 MIRTYLDLNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMG 177
           MIR++   NKH  S++++ +MHK+  +PDSSTFPAVLK+TAQ+C    GK +H  + Q G
Sbjct: 113 MIRSFTHSNKHFLSIFVYIMMHKYGYVPDSSTFPAVLKSTAQVCRRRFGKSVHAYIFQTG 172

Query: 178 FICDVYTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFR 237
           F  DV+T+TALVH+Y TC SI +A +LFDEMP +N+V+WNALITGYTHNRKF +AI  FR
Sbjct: 173 FNSDVFTNTALVHMYATCTSIGEARRLFDEMPVKNSVSWNALITGYTHNRKFREAISTFR 232

Query: 238 GMLADGAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCG 297
            M   G +P E T+V VLSAC HLGA NQGKWIH++I   RLRLNVFVGTALIDMYAKCG
Sbjct: 233 EMQISGFEPGEVTMVGVLSACGHLGALNQGKWIHDYIVQKRLRLNVFVGTALIDMYAKCG 292

Query: 298 AVYEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLC 357
            V E EKVF  +R KNVYTWNVLISG+ MNGQG+AALQAFSRM+MENFKPD VT L VLC
Sbjct: 293 VVDEAEKVFGAMRVKNVYTWNVLISGFTMNGQGEAALQAFSRMVMENFKPDGVTLLAVLC 352

Query: 358 ACCHQGLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDP 417
           ACC QGL+ EGR  F+SM++++GL+P IEHYGCMVDLLGRAG L EA ELI++M  +PDP
Sbjct: 353 ACCRQGLIKEGRRYFVSMEKEYGLRPGIEHYGCMVDLLGRAGFLNEAQELIRTMPYKPDP 412

Query: 418 IIWRALLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMM 477
           ++WRALL ACR+HG+T+LGE  I+ L+ LEPNNGENYVLLSN+Y+R  RW +VG++R MM
Sbjct: 413 VVWRALLGACRIHGSTQLGEVAIRNLLGLEPNNGENYVLLSNLYARGHRWTKVGEVRDMM 472

Query: 478 NLRGIRKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMAL 537
           N +GIRK+PGCSSIE+++ VYEFV SN    E   +Y  L ++  ++K  GYV  T+M  
Sbjct: 473 NRKGIRKIPGCSSIEVDDAVYEFVVSNSLDVELGEVYNMLADMKNEMKLAGYVAETEMVS 532

Query: 538 YDIEKEEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRY 597
           YDIE+EEKE+S+MYHSEKLALAFGLL +  D T+RIVKNLRIC DCH F K++S +YKR 
Sbjct: 533 YDIEEEEKENSLMYHSEKLALAFGLLKTSSDSTIRIVKNLRICKDCHGFCKIVSKIYKRN 592

Query: 598 IVVRDRNRFHHFYEGFCSCRDYW 608
           IVVRDRN FHHF  G CSC+DYW
Sbjct: 593 IVVRDRNLFHHFAGGLCSCKDYW 615

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004138309.20.0e+00100.00PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Cucumis s... [more]
XP_016901378.10.0e+0094.48PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Cucumis m... [more]
XP_023529316.11.2e-30484.30pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita pepo subsp... [more]
XP_022925029.13.0e-30383.82pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita moschata][more]
XP_023003968.16.1e-28885.81pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT1G08070.11.1e-12644.83Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G47530.17.8e-12042.15Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G30700.15.0e-11942.38Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G33170.18.6e-11940.52Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G02980.11.6e-11741.70Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9LN01|PPR21_ARATH2.0e-12544.83Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
sp|Q9SN85|PP267_ARATH1.4e-11842.15Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana OX... [more]
sp|Q9SUH6|PP341_ARATH9.1e-11842.38Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana OX... [more]
sp|Q9SMZ2|PP347_ARATH1.5e-11740.52Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX... [more]
sp|Q8LK93|PP145_ARATH2.9e-11641.70Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LUK8|A0A0A0LUK8_CUCSA0.0e+00100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G011530 PE=4 SV=1[more]
tr|A0A1S4DZH3|A0A1S4DZH3_CUCME0.0e+0094.48pentatricopeptide repeat-containing protein At4g21065-like OS=Cucumis melo OX=36... [more]
tr|D7SI59|D7SI59_VITVI4.2e-22169.70Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_17s0000g06770 PE=4 SV=... [more]
tr|A0A2R6PCT2|A0A2R6PCT2_ACTCH8.0e-21265.99Pentatricopeptide repeat-containing protein OS=Actinidia chinensis var. chinensi... [more]
tr|A0A1U8A8W5|A0A1U8A8W5_NELNU2.7e-20762.17pentatricopeptide repeat-containing protein At4g21065-like OS=Nelumbo nucifera O... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR032867DYW_dom
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_1G002680.1CsaV3_1G002680.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 494..514
NoneNo IPR availablePANTHERPTHR24015:SF1602SUBFAMILY NOT NAMEDcoord: 64..175
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 179..480
NoneNo IPR availablePANTHERPTHR24015:SF1602SUBFAMILY NOT NAMEDcoord: 179..480
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 64..175
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 374..398
e-value: 0.0018
score: 18.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 199..246
e-value: 2.2E-9
score: 37.3
coord: 299..348
e-value: 1.9E-11
score: 43.9
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 201..234
e-value: 1.0E-6
score: 26.5
coord: 302..336
e-value: 3.4E-8
score: 31.2
coord: 375..399
e-value: 0.0019
score: 16.2
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 403..433
score: 5.623
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 98..132
score: 7.213
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 437..471
score: 6.566
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 234..268
score: 5.218
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 335..370
score: 7.443
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 300..334
score: 12.079
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 371..401
score: 7.388
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 168..198
score: 8.21
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 199..233
score: 11.093
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 269..299
score: 8.079
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 354..547
e-value: 1.1E-13
score: 53.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 151..253
e-value: 1.3E-18
score: 69.0
coord: 254..353
e-value: 4.0E-24
score: 86.9
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILYSSF48452TPR-likecoord: 295..324
coord: 386..456
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 473..596
e-value: 1.1E-35
score: 122.2

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CsaV3_1G002680CmaCh11G001190Cucurbita maxima (Rimu)cmacucB0112
CsaV3_1G002680CsGy1G002660Cucumber (Gy14) v2cgybcucB002
CsaV3_1G002680Lsi06G014380Bottle gourd (USVL1VR-Ls)cuclsiB081
CsaV3_1G002680Bhi02G000191Wax gourdcucwgoB125
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CsaV3_1G002680Cucumber (Chinese Long) v3cuccucB000
CsaV3_1G002680Cucumber (Chinese Long) v3cuccucB035
CsaV3_1G002680Silver-seed gourdcarcucB0059
CsaV3_1G002680Silver-seed gourdcarcucB0419
CsaV3_1G002680Silver-seed gourdcarcucB0496
CsaV3_1G002680Melon (DHL92) v3.5.1cucmeB053
CsaV3_1G002680Cucumber (Gy14) v2cgybcucB003
CsaV3_1G002680Cucumber (Gy14) v1cgycucB144
CsaV3_1G002680Cucumber (Gy14) v1cgycucB314
CsaV3_1G002680Cucumber (Gy14) v1cgycucB435
CsaV3_1G002680Cucurbita maxima (Rimu)cmacucB0058
CsaV3_1G002680Cucurbita maxima (Rimu)cmacucB0252
CsaV3_1G002680Cucurbita maxima (Rimu)cmacucB0482
CsaV3_1G002680Cucurbita maxima (Rimu)cmacucB0514
CsaV3_1G002680Cucurbita maxima (Rimu)cmacucB0814
CsaV3_1G002680Cucurbita moschata (Rifu)cmocucB0242
CsaV3_1G002680Cucurbita moschata (Rifu)cmocucB0044
CsaV3_1G002680Cucurbita moschata (Rifu)cmocucB0091
CsaV3_1G002680Cucurbita moschata (Rifu)cmocucB0502
CsaV3_1G002680Cucurbita moschata (Rifu)cmocucB0798
CsaV3_1G002680Cucurbita pepo (Zucchini)cpecucB0027
CsaV3_1G002680Cucurbita pepo (Zucchini)cpecucB0442
CsaV3_1G002680Cucurbita pepo (Zucchini)cpecucB0470
CsaV3_1G002680Cucurbita pepo (Zucchini)cpecucB0627
CsaV3_1G002680Cucurbita pepo (Zucchini)cpecucB0667
CsaV3_1G002680Cucurbita pepo (Zucchini)cpecucB0814
CsaV3_1G002680Wild cucumber (PI 183967)cpicucB001
CsaV3_1G002680Wild cucumber (PI 183967)cpicucB002
CsaV3_1G002680Wild cucumber (PI 183967)cpicucB254
CsaV3_1G002680Bottle gourd (USVL1VR-Ls)cuclsiB061
CsaV3_1G002680Bottle gourd (USVL1VR-Ls)cuclsiB024
CsaV3_1G002680Melon (DHL92) v3.5.1cucmeB005
CsaV3_1G002680Melon (DHL92) v3.5.1cucmeB026
CsaV3_1G002680Melon (DHL92) v3.6.1cucmedB005
CsaV3_1G002680Melon (DHL92) v3.6.1cucmedB024
CsaV3_1G002680Melon (DHL92) v3.6.1cucmedB050
CsaV3_1G002680Watermelon (Charleston Gray)cucwcgB064
CsaV3_1G002680Watermelon (Charleston Gray)cucwcgB070
CsaV3_1G002680Watermelon (Charleston Gray)cucwcgB088
CsaV3_1G002680Watermelon (97103) v1cucwmB044
CsaV3_1G002680Watermelon (97103) v1cucwmB071
CsaV3_1G002680Watermelon (97103) v1cucwmB105
CsaV3_1G002680Watermelon (97103) v2cucwmbB055
CsaV3_1G002680Watermelon (97103) v2cucwmbB059
CsaV3_1G002680Watermelon (97103) v2cucwmbB076
CsaV3_1G002680Wax gourdcucwgoB056
CsaV3_1G002680Wax gourdcucwgoB076
CsaV3_1G002680Wax gourdcucwgoB103