CsaV3_1G002680 (gene) Cucumber (Chinese Long) v3

Overview
NameCsaV3_1G002680
Typegene
OrganismCucumis sativus L. var. sativus cv. Chinese Long (Cucumber (Chinese Long) v3)
DescriptionPentatricopeptide repeat-containing protein
Locationchr1: 1668697 .. 1670520 (+)
RNA-Seq ExpressionCsaV3_1G002680
SyntenyCsaV3_1G002680
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAATGTATAACCGGCTTCTGCCTTTCTCGTATAGAATTATTCGAAGGTCTCGGGTCCAACAAGAAATTTGTACAATCTCGAACTTGGATTTTTTAGAATCAGAAATGTTGAAATTTGTACACACCCAAGCAATGGATCTTCCGTTTCAGGCAACTAACGGTAGCAAAATTCCTGATTACAATGACGTGCGAAGAGGGCATTTCCTCATGAAACTCATAGACGACTCTGTTTCGCGTAATGGGTTCGAATCTATTGCTCGTATTTTCTCTAAGTATCGTGGTTCTATCAATTCTCAACAGTGTAACTCGATGATCAGGACTTATTTGGATTTGAATAAGCATTTAAATTCTCTGTACATTTTTGCCCTTATGCATAAGTTTAGTATTCTGCCCGATTCATCCACTTTTCCTGCTGTTCTTAAAGCAACTGCGCAGCTATGTGATACTGGAGTTGGAAAAATGATACATGGTATTGTTATTCAGATGGGTTTTATTTGTGATGTCTACACAAGTACCGCTTTAGTTCATCTGTATTGTACTTGTTTGTCTATATCTGATGCTTCTCAGTTGTTCGACGAAATGCCCGAGAGAAATGCAGTTACTTGGAATGCTTTGATTACTGGTTATACTCATAATAGAAAGTTTGTGAAAGCTATCGATGCTTTTCGAGGAATGTTGGCAGATGGGGCTCAACCGAGTGAGAGAACCGTGGTTGTAGTTCTATCGGCTTGTTCTCATTTGGGAGCTTTTAATCAGGGAAAGTGGATCCATGAGTTTATTTATCATAATAGGTTGAGACTGAACGTGTTTGTGGGCACAGCTCTTATTGATATGTATGCTAAATGTGGGGCTGTTTATGAGGTCGAGAAGGTCTTCGAAGAAATTAGAGAGAAGAACGTGTATACATGGAATGTCTTGATTTCTGGATATGCCATGAATGGGCAAGGCGATGCAGCTTTGCAGGCTTTTTCTAGGATGTTGATGGAAAATTTCAAGCCAGATGAGGTTACCTTTCTAGGTGTTTTGTGTGCATGCTGTCACCAAGGTCTGGTAACGGAAGGGCGCTGGCAATTCATGAGCATGAAACAACAGTTTGGACTGCAACCAAGGATAGAGCATTATGGATGTATGGTAGACCTACTTGGTCGAGCGGGATTGTTGGAGGAAGCTCTAGAGTTGATCCAATCCATGAGCATAGAGCCAGACCCTATCATTTGGAGGGCTTTGCTTTGTGCTTGCAGAGTCCATGGGAATACGAAATTGGGTGAATATATTATCAAAAGACTTATAGAACTAGAACCAAACAATGGGGAAAATTATGTCTTGCTGTCAAATATATACTCAAGAGAACGACGGTGGGCTGAAGTAGGGAAGTTGAGGGGAATGATGAATCTAAGAGGGATCAGAAAAGTCCCTGGTTGCAGTTCAATTGAAATAAACAACGTAGTTTATGAGTTTGTTGCATCAAATGACAGAAAACCAGAATTTGAGGCAATATACAAGCAGTTGGACAATTTGATTAAGAAATTGAAAGAAAATGGTTATGTTACAGGCACGGACATGGCTTTATATGATATTGAGAAAGAAGAGAAAGAGCATTCTGTGATGTACCATAGTGAGAAACTTGCTTTAGCATTTGGTCTCTTAAACTCTCCTTTAGATTGCACCCTAAGGATAGTGAAAAATCTAAGAATTTGCTTGGACTGCCACGAGTTTTTCAAGGTTTTATCACTCGTCTATAAAAGATATATTGTTGTGAGAGACAGAAACCGTTTCCACCATTTTTATGAAGGTTTCTGTTCGTGCCGAGACTATTGGTGA

mRNA sequence

ATGAAAATGTATAACCGGCTTCTGCCTTTCTCGTATAGAATTATTCGAAGGTCTCGGGTCCAACAAGAAATTTGTACAATCTCGAACTTGGATTTTTTAGAATCAGAAATGTTGAAATTTGTACACACCCAAGCAATGGATCTTCCGTTTCAGGCAACTAACGGTAGCAAAATTCCTGATTACAATGACGTGCGAAGAGGGCATTTCCTCATGAAACTCATAGACGACTCTGTTTCGCGTAATGGGTTCGAATCTATTGCTCGTATTTTCTCTAAGTATCGTGGTTCTATCAATTCTCAACAGTGTAACTCGATGATCAGGACTTATTTGGATTTGAATAAGCATTTAAATTCTCTGTACATTTTTGCCCTTATGCATAAGTTTAGTATTCTGCCCGATTCATCCACTTTTCCTGCTGTTCTTAAAGCAACTGCGCAGCTATGTGATACTGGAGTTGGAAAAATGATACATGGTATTGTTATTCAGATGGGTTTTATTTGTGATGTCTACACAAGTACCGCTTTAGTTCATCTGTATTGTACTTGTTTGTCTATATCTGATGCTTCTCAGTTGTTCGACGAAATGCCCGAGAGAAATGCAGTTACTTGGAATGCTTTGATTACTGGTTATACTCATAATAGAAAGTTTGTGAAAGCTATCGATGCTTTTCGAGGAATGTTGGCAGATGGGGCTCAACCGAGTGAGAGAACCGTGGTTGTAGTTCTATCGGCTTGTTCTCATTTGGGAGCTTTTAATCAGGGAAAGTGGATCCATGAGTTTATTTATCATAATAGGTTGAGACTGAACGTGTTTGTGGGCACAGCTCTTATTGATATGTATGCTAAATGTGGGGCTGTTTATGAGGTCGAGAAGGTCTTCGAAGAAATTAGAGAGAAGAACGTGTATACATGGAATGTCTTGATTTCTGGATATGCCATGAATGGGCAAGGCGATGCAGCTTTGCAGGCTTTTTCTAGGATGTTGATGGAAAATTTCAAGCCAGATGAGGTTACCTTTCTAGGTGTTTTGTGTGCATGCTGTCACCAAGGTCTGGTAACGGAAGGGCGCTGGCAATTCATGAGCATGAAACAACAGTTTGGACTGCAACCAAGGATAGAGCATTATGGATGTATGGTAGACCTACTTGGTCGAGCGGGATTGTTGGAGGAAGCTCTAGAGTTGATCCAATCCATGAGCATAGAGCCAGACCCTATCATTTGGAGGGCTTTGCTTTGTGCTTGCAGAGTCCATGGGAATACGAAATTGGGTGAATATATTATCAAAAGACTTATAGAACTAGAACCAAACAATGGGGAAAATTATGTCTTGCTGTCAAATATATACTCAAGAGAACGACGGTGGGCTGAAGTAGGGAAGTTGAGGGGAATGATGAATCTAAGAGGGATCAGAAAAGTCCCTGGTTGCAGTTCAATTGAAATAAACAACGTAGTTTATGAGTTTGTTGCATCAAATGACAGAAAACCAGAATTTGAGGCAATATACAAGCAGTTGGACAATTTGATTAAGAAATTGAAAGAAAATGGTTATGTTACAGGCACGGACATGGCTTTATATGATATTGAGAAAGAAGAGAAAGAGCATTCTGTGATGTACCATAGTGAGAAACTTGCTTTAGCATTTGGTCTCTTAAACTCTCCTTTAGATTGCACCCTAAGGATAGTGAAAAATCTAAGAATTTGCTTGGACTGCCACGAGTTTTTCAAGGTTTTATCACTCGTCTATAAAAGATATATTGTTGTGAGAGACAGAAACCGTTTCCACCATTTTTATGAAGGTTTCTGTTCGTGCCGAGACTATTGGTGA

Coding sequence (CDS)

ATGAAAATGTATAACCGGCTTCTGCCTTTCTCGTATAGAATTATTCGAAGGTCTCGGGTCCAACAAGAAATTTGTACAATCTCGAACTTGGATTTTTTAGAATCAGAAATGTTGAAATTTGTACACACCCAAGCAATGGATCTTCCGTTTCAGGCAACTAACGGTAGCAAAATTCCTGATTACAATGACGTGCGAAGAGGGCATTTCCTCATGAAACTCATAGACGACTCTGTTTCGCGTAATGGGTTCGAATCTATTGCTCGTATTTTCTCTAAGTATCGTGGTTCTATCAATTCTCAACAGTGTAACTCGATGATCAGGACTTATTTGGATTTGAATAAGCATTTAAATTCTCTGTACATTTTTGCCCTTATGCATAAGTTTAGTATTCTGCCCGATTCATCCACTTTTCCTGCTGTTCTTAAAGCAACTGCGCAGCTATGTGATACTGGAGTTGGAAAAATGATACATGGTATTGTTATTCAGATGGGTTTTATTTGTGATGTCTACACAAGTACCGCTTTAGTTCATCTGTATTGTACTTGTTTGTCTATATCTGATGCTTCTCAGTTGTTCGACGAAATGCCCGAGAGAAATGCAGTTACTTGGAATGCTTTGATTACTGGTTATACTCATAATAGAAAGTTTGTGAAAGCTATCGATGCTTTTCGAGGAATGTTGGCAGATGGGGCTCAACCGAGTGAGAGAACCGTGGTTGTAGTTCTATCGGCTTGTTCTCATTTGGGAGCTTTTAATCAGGGAAAGTGGATCCATGAGTTTATTTATCATAATAGGTTGAGACTGAACGTGTTTGTGGGCACAGCTCTTATTGATATGTATGCTAAATGTGGGGCTGTTTATGAGGTCGAGAAGGTCTTCGAAGAAATTAGAGAGAAGAACGTGTATACATGGAATGTCTTGATTTCTGGATATGCCATGAATGGGCAAGGCGATGCAGCTTTGCAGGCTTTTTCTAGGATGTTGATGGAAAATTTCAAGCCAGATGAGGTTACCTTTCTAGGTGTTTTGTGTGCATGCTGTCACCAAGGTCTGGTAACGGAAGGGCGCTGGCAATTCATGAGCATGAAACAACAGTTTGGACTGCAACCAAGGATAGAGCATTATGGATGTATGGTAGACCTACTTGGTCGAGCGGGATTGTTGGAGGAAGCTCTAGAGTTGATCCAATCCATGAGCATAGAGCCAGACCCTATCATTTGGAGGGCTTTGCTTTGTGCTTGCAGAGTCCATGGGAATACGAAATTGGGTGAATATATTATCAAAAGACTTATAGAACTAGAACCAAACAATGGGGAAAATTATGTCTTGCTGTCAAATATATACTCAAGAGAACGACGGTGGGCTGAAGTAGGGAAGTTGAGGGGAATGATGAATCTAAGAGGGATCAGAAAAGTCCCTGGTTGCAGTTCAATTGAAATAAACAACGTAGTTTATGAGTTTGTTGCATCAAATGACAGAAAACCAGAATTTGAGGCAATATACAAGCAGTTGGACAATTTGATTAAGAAATTGAAAGAAAATGGTTATGTTACAGGCACGGACATGGCTTTATATGATATTGAGAAAGAAGAGAAAGAGCATTCTGTGATGTACCATAGTGAGAAACTTGCTTTAGCATTTGGTCTCTTAAACTCTCCTTTAGATTGCACCCTAAGGATAGTGAAAAATCTAAGAATTTGCTTGGACTGCCACGAGTTTTTCAAGGTTTTATCACTCGTCTATAAAAGATATATTGTTGTGAGAGACAGAAACCGTTTCCACCATTTTTATGAAGGTTTCTGTTCGTGCCGAGACTATTGGTGA

Protein sequence

MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFLESEMLKFVHTQAMDLPFQATNGSKIPDYNDVRRGHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLDLNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEGFCSCRDYW*
Homology
BLAST of CsaV3_1G002680 vs. NCBI nr
Match: XP_004138309.2 (pentatricopeptide repeat-containing protein At4g21065 [Cucumis sativus] >KGN63701.1 hypothetical protein Csa_014271 [Cucumis sativus])

HSP 1 Score: 1243.8 bits (3217), Expect = 0.0e+00
Identity = 607/607 (100.00%), Postives = 607/607 (100.00%), Query Frame = 0

Query: 1   MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFLESEMLKFVHTQAMDLPFQATNGSKIPD 60
           MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFLESEMLKFVHTQAMDLPFQATNGSKIPD
Sbjct: 1   MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFLESEMLKFVHTQAMDLPFQATNGSKIPD 60

Query: 61  YNDVRRGHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLDLNKHLNSLY 120
           YNDVRRGHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLDLNKHLNSLY
Sbjct: 61  YNDVRRGHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLDLNKHLNSLY 120

Query: 121 IFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYC 180
           IFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYC
Sbjct: 121 IFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYC 180

Query: 181 TCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGAQPSERTVVV 240
           TCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGAQPSERTVVV
Sbjct: 181 TCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGAQPSERTVVV 240

Query: 241 VLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEKVFEEIREKN 300
           VLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEKVFEEIREKN
Sbjct: 241 VLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEKVFEEIREKN 300

Query: 301 VYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGLVTEGRWQFM 360
           VYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGLVTEGRWQFM
Sbjct: 301 VYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGLVTEGRWQFM 360

Query: 361 SMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVHGNT 420
           SMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVHGNT
Sbjct: 361 SMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVHGNT 420

Query: 421 KLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEI 480
           KLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEI
Sbjct: 421 KLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEI 480

Query: 481 NNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEEKEHSVMYHS 540
           NNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEEKEHSVMYHS
Sbjct: 481 NNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEEKEHSVMYHS 540

Query: 541 EKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEGF 600
           EKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEGF
Sbjct: 541 EKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEGF 600

Query: 601 CSCRDYW 608
           CSCRDYW
Sbjct: 601 CSCRDYW 607

BLAST of CsaV3_1G002680 vs. NCBI nr
Match: KAA0057970.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 1115.5 bits (2884), Expect = 0.0e+00
Identity = 540/571 (94.57%), Postives = 556/571 (97.37%), Query Frame = 0

Query: 37  MLKFVHTQAMDLPFQATNGSKIPDYNDVRRGHFLMKLIDDSVSRNGFESIARIFSKYRGS 96
           MLKFVHTQAMDLPFQ TN  K PDYNDVRRGHF+MKLIDDSVS NGFESIARIFSKYRGS
Sbjct: 1   MLKFVHTQAMDLPFQETNDRKTPDYNDVRRGHFVMKLIDDSVSHNGFESIARIFSKYRGS 60

Query: 97  INSQQCNSMIRTYLDLNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMI 156
           INSQQCNSMIR YLDLNKHLNSLYIFA MHKFSILPD STFPAVLKATAQLCDT VGKMI
Sbjct: 61  INSQQCNSMIRRYLDLNKHLNSLYIFAQMHKFSILPDLSTFPAVLKATAQLCDTEVGKMI 120

Query: 157 HGIVIQMGFICDVYTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKF 216
           HGIVIQMGFICDVYTSTALVH+Y TCLSISDASQ+FDEM ERNAVTWNALITGYTHNRKF
Sbjct: 121 HGIVIQMGFICDVYTSTALVHMYSTCLSISDASQVFDEMAERNAVTWNALITGYTHNRKF 180

Query: 217 VKAIDAFRGMLADGAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTAL 276
           ++AIDAFRGMLA GAQPSERTVV+VLSACSHLGA NQGKWIH+FIYHNRLRLNVFVGTAL
Sbjct: 181 MEAIDAFRGMLAAGAQPSERTVVLVLSACSHLGALNQGKWIHDFIYHNRLRLNVFVGTAL 240

Query: 277 IDMYAKCGAVYEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDE 336
           IDMYAKCGAV EVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDE
Sbjct: 241 IDMYAKCGAVDEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDE 300

Query: 337 VTFLGVLCACCHQGLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQ 396
           VTFLGVLCACCHQGLVTEGR QFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQ
Sbjct: 301 VTFLGVLCACCHQGLVTEGRRQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQ 360

Query: 397 SMSIEPDPIIWRALLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAE 456
           SMS+EPDPIIWRALLCACRVHGNTKLGEYI+KRL+ELEPNNGENYVLLSNIY+RERRWAE
Sbjct: 361 SMSMEPDPIIWRALLCACRVHGNTKLGEYIMKRLVELEPNNGENYVLLSNIYARERRWAE 420

Query: 457 VGKLRGMMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGY 516
           VGKLRGMMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPE+EAIYKQLDNLIKKLKENGY
Sbjct: 421 VGKLRGMMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPEYEAIYKQLDNLIKKLKENGY 480

Query: 517 VTGTDMALYDIEKEEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKV 576
           VTGTDMALYD+EKEEKEHS+MYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKV
Sbjct: 481 VTGTDMALYDVEKEEKEHSLMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKV 540

Query: 577 LSLVYKRYIVVRDRNRFHHFYEGFCSCRDYW 608
           +SLVYKRYIVVRDRNRFHHF+EGFCSCRDYW
Sbjct: 541 VSLVYKRYIVVRDRNRFHHFFEGFCSCRDYW 571

BLAST of CsaV3_1G002680 vs. NCBI nr
Match: XP_016901378.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Cucumis melo])

HSP 1 Score: 1097.4 bits (2837), Expect = 0.0e+00
Identity = 531/562 (94.48%), Postives = 547/562 (97.33%), Query Frame = 0

Query: 46  MDLPFQATNGSKIPDYNDVRRGHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSM 105
           MDLPFQ TN  K PDYNDVRRGHF+MKLIDDSVS NGFESIARIFSKYRGSINSQQCNSM
Sbjct: 1   MDLPFQETNDRKTPDYNDVRRGHFVMKLIDDSVSHNGFESIARIFSKYRGSINSQQCNSM 60

Query: 106 IRTYLDLNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGF 165
           IR YLDLNKHLNSLYIFA MHKFSILPD STFPAVLKATAQLCDT VGKMIHGIVIQMGF
Sbjct: 61  IRRYLDLNKHLNSLYIFAQMHKFSILPDLSTFPAVLKATAQLCDTEVGKMIHGIVIQMGF 120

Query: 166 ICDVYTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRG 225
           ICDVYTSTALVH+Y TCLSISDASQ+FDEM ERNAVTWNALITGYTHNRKF++AIDAFRG
Sbjct: 121 ICDVYTSTALVHMYSTCLSISDASQVFDEMAERNAVTWNALITGYTHNRKFMEAIDAFRG 180

Query: 226 MLADGAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGA 285
           MLA GAQPSERTVV+VLSACSHLGA NQGKWIH+FIYHNRLRLNVFVGTALIDMYAKCGA
Sbjct: 181 MLAAGAQPSERTVVLVLSACSHLGALNQGKWIHDFIYHNRLRLNVFVGTALIDMYAKCGA 240

Query: 286 VYEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCA 345
           V EVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCA
Sbjct: 241 VDEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCA 300

Query: 346 CCHQGLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPI 405
           CCHQGLVTEGR QFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMS+EPDPI
Sbjct: 301 CCHQGLVTEGRRQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSMEPDPI 360

Query: 406 IWRALLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMN 465
           IWRALLCACRVHGNTKLGEYI+KRL+ELEPNNGENYVLLSNIY+RERRWAEVGKLRGMMN
Sbjct: 361 IWRALLCACRVHGNTKLGEYIMKRLVELEPNNGENYVLLSNIYARERRWAEVGKLRGMMN 420

Query: 466 LRGIRKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALY 525
           LRGIRKVPGCSSIEINNVVYEFVASNDRKPE+EAIYKQLDNLIKKLKENGYVTGTDMALY
Sbjct: 421 LRGIRKVPGCSSIEINNVVYEFVASNDRKPEYEAIYKQLDNLIKKLKENGYVTGTDMALY 480

Query: 526 DIEKEEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYI 585
           D+EKEEKEHS+MYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKV+SLVYKRYI
Sbjct: 481 DVEKEEKEHSLMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYI 540

Query: 586 VVRDRNRFHHFYEGFCSCRDYW 608
           VVRDRNRFHHF+EGFCSCRDYW
Sbjct: 541 VVRDRNRFHHFFEGFCSCRDYW 562

BLAST of CsaV3_1G002680 vs. NCBI nr
Match: XP_038878567.1 (pentatricopeptide repeat-containing protein At4g21065-like [Benincasa hispida])

HSP 1 Score: 1092.8 bits (2825), Expect = 0.0e+00
Identity = 543/616 (88.15%), Postives = 568/616 (92.21%), Query Frame = 0

Query: 1   MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFLESEMLKFVHTQAMDLPFQATNGSKIPD 60
           MKMY RLLP S  II+RSR+ QEICTI N   LESEM KFVHTQAMDLP   TN  KIPD
Sbjct: 1   MKMYFRLLPLSCGIIQRSRL-QEICTILNSVILESEMSKFVHTQAMDLPPPRTNERKIPD 60

Query: 61  Y--------NDVRR-GHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLD 120
           Y        NDVRR G+FLMKLIDDSVS NGFESIA IFSK+R SINSQ CNSMIR YLD
Sbjct: 61  YKDALHKEGNDVRRDGYFLMKLIDDSVSHNGFESIALIFSKFRSSINSQLCNSMIRGYLD 120

Query: 121 LNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYT 180
           LNKHLNSLYIFA MHKFSILPDSSTFPAVLKATAQLCDT VGKMIHG VIQMGFI DVYT
Sbjct: 121 LNKHLNSLYIFAHMHKFSILPDSSTFPAVLKATAQLCDTEVGKMIHGTVIQMGFIHDVYT 180

Query: 181 STALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGA 240
           STALVH+YC CLSISDAS++FDEMPERNAVTWNALITGYTHNRKF++AI+AFRGMLA GA
Sbjct: 181 STALVHMYCACLSISDASRVFDEMPERNAVTWNALITGYTHNRKFMEAINAFRGMLAAGA 240

Query: 241 QPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEK 300
           +PSERT+VVVLSACSHLGA NQGKW+HEFIYHNRLRLNVFVGTALIDMYAKCGAV E EK
Sbjct: 241 EPSERTMVVVLSACSHLGALNQGKWVHEFIYHNRLRLNVFVGTALIDMYAKCGAVDEAEK 300

Query: 301 VFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGL 360
           VFEEIREKNVYTWNVLISGYAMNGQGDAAL AFSRMLMENFKPDEVTFLG+LCACCHQGL
Sbjct: 301 VFEEIREKNVYTWNVLISGYAMNGQGDAALAAFSRMLMENFKPDEVTFLGILCACCHQGL 360

Query: 361 VTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALL 420
           VTEGR QFMSMKQ FGLQP+IEHYGCMVDLLGRAG L+EALELIQSMS+EPDPIIWRALL
Sbjct: 361 VTEGRRQFMSMKQHFGLQPKIEHYGCMVDLLGRAGFLDEALELIQSMSMEPDPIIWRALL 420

Query: 421 CACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRK 480
           CACRVHGNTKLGEY I+RLIELEPNNGENYVLLSN+YSRE+RWAEVGKLRGMM+LRGI K
Sbjct: 421 CACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSREQRWAEVGKLRGMMSLRGIGK 480

Query: 481 VPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEE 540
           VPGCSSIEINNVVYEF ASNDRKPEFEAIYKQLDNL +KLKENGYVTGTDMALYDIEKEE
Sbjct: 481 VPGCSSIEINNVVYEFAASNDRKPEFEAIYKQLDNLSEKLKENGYVTGTDMALYDIEKEE 540

Query: 541 KEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRN 600
           KEHSVMYHSEKLALAFGLLNSPL CTLRIVKNLRICLDCHEFFKV+S+VY+RYIVVRDRN
Sbjct: 541 KEHSVMYHSEKLALAFGLLNSPLGCTLRIVKNLRICLDCHEFFKVVSIVYQRYIVVRDRN 600

Query: 601 RFHHFYEGFCSCRDYW 608
           RFHHF EGFCSCRDYW
Sbjct: 601 RFHHFSEGFCSCRDYW 615

BLAST of CsaV3_1G002680 vs. NCBI nr
Match: XP_023529316.1 (pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1054.7 bits (2726), Expect = 3.1e-304
Identity = 521/618 (84.30%), Postives = 561/618 (90.78%), Query Frame = 0

Query: 1   MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFL--ESEMLKFVHTQAMDLPFQATNGSKI 60
           MKM  R LPFS+R+IRR+R+ Q+ CTISNLDFL  +S++ +FVHT+ M+LP Q     KI
Sbjct: 1   MKMDLRFLPFSFRLIRRARL-QDTCTISNLDFLANQSQISRFVHTRVMNLPSQGGIERKI 60

Query: 61  PDYNDVRR---------GHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTY 120
           PD  D RR         G+FLMKLI+DSVS NGFESIA IFSK+RGSINSQ CNSMIR Y
Sbjct: 61  PDCLDARRKEGNDMRSDGYFLMKLIEDSVSNNGFESIALIFSKFRGSINSQICNSMIRGY 120

Query: 121 LDLNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDV 180
           LDLN+HLNSL IFA MHKFSILPDSSTFPAVLKATAQLCD  +GKMIHG V+QMGFI DV
Sbjct: 121 LDLNEHLNSLIIFAHMHKFSILPDSSTFPAVLKATAQLCDIKLGKMIHGAVVQMGFIRDV 180

Query: 181 YTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLAD 240
           YTSTALVH+YC+CLSISDASQLFDEMPERN+VTWNALITGYTHNRKF +AI+AFRGMLA 
Sbjct: 181 YTSTALVHMYCSCLSISDASQLFDEMPERNSVTWNALITGYTHNRKFKEAINAFRGMLAA 240

Query: 241 GAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEV 300
           GA+PSERTVVVVLSACSHLGA NQGKWIH+FIY N+LRLNVFVGTALIDMYAKCG V E 
Sbjct: 241 GAEPSERTVVVVLSACSHLGALNQGKWIHDFIYQNKLRLNVFVGTALIDMYAKCGVVEEA 300

Query: 301 EKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQ 360
           EKVFEEIR+KNVYTWNVLISGY MNGQGDAALQAFSRMLMENFKPD VTFLG+LCACCHQ
Sbjct: 301 EKVFEEIRDKNVYTWNVLISGYGMNGQGDAALQAFSRMLMENFKPDAVTFLGLLCACCHQ 360

Query: 361 GLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRA 420
           GLVTEGR QF+SMKQQFGLQP+IEHYGCMVDLLGRAGLLEEALELI+SMS+EPDPIIWRA
Sbjct: 361 GLVTEGRRQFISMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALELIESMSMEPDPIIWRA 420

Query: 421 LLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGI 480
           LLCACRVHGNTKLGEY I+RLIELEPNNGENYVLLSN+YSRERRW EVGKLRGMM+LRGI
Sbjct: 421 LLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSRERRWIEVGKLRGMMSLRGI 480

Query: 481 RKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEK 540
            KVPGCSSIEINN V+EF ASNDRK EF AIYKQLDN++KKLKENGYVTGTDM+L+DIEK
Sbjct: 481 EKVPGCSSIEINNSVHEFTASNDRKLEFNAIYKQLDNVMKKLKENGYVTGTDMSLFDIEK 540

Query: 541 EEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRD 600
           EEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRIC DCHEFFKV+SLVYKRYIVVRD
Sbjct: 541 EEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICSDCHEFFKVVSLVYKRYIVVRD 600

Query: 601 RNRFHHFYEGFCSCRDYW 608
           RNRFHHF EG CSCRDYW
Sbjct: 601 RNRFHHFSEGVCSCRDYW 617

BLAST of CsaV3_1G002680 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 464.2 bits (1193), Expect = 2.3e-129
Identity = 237/574 (41.29%), Postives = 355/574 (61.85%), Query Frame = 0

Query: 40  FVHTQAMDLPFQ---ATNGSKIPDYNDVRRGHFLMKLIDDSVSRNGFESIARIFSKYRGS 99
           +VHT  + +  Q     +  K+ D +  R       LI    SR   E+  ++F +    
Sbjct: 170 YVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVK 229

Query: 100 INSQQCNSMIRTYLDLNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMI 159
            +    N+MI  Y +   +  +L +F  M K ++ PD ST   V+ A AQ     +G+ +
Sbjct: 230 -DVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQV 289

Query: 160 HGIVIQMGFICDVYTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKF 219
           H  +   GF  ++    AL+ LY  C  +  A  LF+ +P ++ ++WN LI GYTH   +
Sbjct: 290 HLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLY 349

Query: 220 VKAIDAFRGMLADGAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLR--LNV-FVG 279
            +A+  F+ ML  G  P++ T++ +L AC+HLGA + G+WIH +I   RL+   N   + 
Sbjct: 350 KEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYI-DKRLKGVTNASSLR 409

Query: 280 TALIDMYAKCGAVYEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFK 339
           T+LIDMYAKCG +    +VF  I  K++ +WN +I G+AM+G+ DA+   FSRM     +
Sbjct: 410 TSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQ 469

Query: 340 PDEVTFLGVLCACCHQGLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALE 399
           PD++TF+G+L AC H G++  GR  F +M Q + + P++EHYGCM+DLLG +GL +EA E
Sbjct: 470 PDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEE 529

Query: 400 LIQSMSIEPDPIIWRALLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERR 459
           +I  M +EPD +IW +LL AC++HGN +LGE   + LI++EP N  +YVLLSNIY+   R
Sbjct: 530 MINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGR 589

Query: 460 WAEVGKLRGMMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKE 519
           W EV K R ++N +G++KVPGCSSIEI++VV+EF+  +   P    IY  L+ +   L++
Sbjct: 590 WNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEK 649

Query: 520 NGYVTGTDMALYDIEKEEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEF 579
            G+V  T   L ++E+E KE ++ +HSEKLA+AFGL+++     L IVKNLR+C +CHE 
Sbjct: 650 AGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEA 709

Query: 580 FKVLSLVYKRYIVVRDRNRFHHFYEGFCSCRDYW 608
            K++S +YKR I+ RDR RFHHF +G CSC DYW
Sbjct: 710 TKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of CsaV3_1G002680 vs. ExPASy Swiss-Prot
Match: A8MQA3 (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 461.5 bits (1186), Expect = 1.5e-128
Identity = 219/522 (41.95%), Postives = 343/522 (65.71%), Query Frame = 0

Query: 88  RIFSKYRGSINSQQCNSMIRTYLDLNKHLNSLYIFALMHKFSIL-PDSSTFPAVLKATAQ 147
           ++FSK    IN    N++IR Y ++   +++  ++  M    ++ PD+ T+P ++KA   
Sbjct: 74  KVFSKIEKPINVFIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTT 133

Query: 148 LCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNAL 207
           + D  +G+ IH +VI+ GF   +Y   +L+HLY  C  ++ A ++FD+MPE++ V WN++
Sbjct: 134 MADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSV 193

Query: 208 ITGYTHNRKFVKAIDAFRGMLADGAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRL 267
           I G+  N K  +A+  +  M + G +P   T+V +LSAC+ +GA   GK +H ++    L
Sbjct: 194 INGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGL 253

Query: 268 RLNVFVGTALIDMYAKCGAVYEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSR 327
             N+     L+D+YA+CG V E + +F+E+ +KN  +W  LI G A+NG G  A++ F  
Sbjct: 254 TRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKY 313

Query: 328 M-LMENFKPDEVTFLGVLCACCHQGLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRA 387
           M   E   P E+TF+G+L AC H G+V EG   F  M++++ ++PRIEH+GCMVDLL RA
Sbjct: 314 MESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARA 373

Query: 388 GLLEEALELIQSMSIEPDPIIWRALLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLS 447
           G +++A E I+SM ++P+ +IWR LL AC VHG++ L E+   ++++LEPN+  +YVLLS
Sbjct: 374 GQVKKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLS 433

Query: 448 NIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLD 507
           N+Y+ E+RW++V K+R  M   G++KVPG S +E+ N V+EF+  +   P+ +AIY +L 
Sbjct: 434 NMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLK 493

Query: 508 NLIKKLKENGYVTGTDMALYDIEKEEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLR 567
            +  +L+  GYV        D+E+EEKE++V+YHSEK+A+AF L+++P    + +VKNLR
Sbjct: 494 EMTGRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLR 553

Query: 568 ICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEGFCSCRDYW 608
           +C DCH   K++S VY R IVVRDR+RFHHF  G CSC+DYW
Sbjct: 554 VCADCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of CsaV3_1G002680 vs. ExPASy Swiss-Prot
Match: Q8LK93 (Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H26 PE=2 SV=2)

HSP 1 Score: 448.4 bits (1152), Expect = 1.3e-124
Identity = 220/506 (43.48%), Postives = 324/506 (64.03%), Query Frame = 0

Query: 103 NSMIRTYLDLNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQ 162
           NSM R Y      L    +F  + +  ILPD+ TFP++LKA A       G+ +H + ++
Sbjct: 98  NSMARGYSRFTNPLEVFSLFVEILEDGILPDNYTFPSLLKACAVAKALEEGRQLHCLSMK 157

Query: 163 MGFICDVYTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDA 222
           +G   +VY    L+++Y  C  +  A  +FD + E   V +NA+ITGY    +  +A+  
Sbjct: 158 LGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNAMITGYARRNRPNEALSL 217

Query: 223 FRGMLADGAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAK 282
           FR M     +P+E T++ VLS+C+ LG+ + GKWIH++   +     V V TALIDM+AK
Sbjct: 218 FREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHSFCKYVKVNTALIDMFAK 277

Query: 283 CGAVYEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGV 342
           CG++ +   +FE++R K+   W+ +I  YA +G+ + ++  F RM  EN +PDE+TFLG+
Sbjct: 278 CGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLMFERMRSENVQPDEITFLGL 337

Query: 343 LCACCHQGLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEP 402
           L AC H G V EGR  F  M  +FG+ P I+HYG MVDLL RAG LE+A E I  + I P
Sbjct: 338 LNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNLEDAYEFIDKLPISP 397

Query: 403 DPIIWRALLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRG 462
            P++WR LL AC  H N  L E + +R+ EL+ ++G +YV+LSN+Y+R ++W  V  LR 
Sbjct: 398 TPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILSNLYARNKKWEYVDSLRK 457

Query: 463 MMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDM 522
           +M  R   KVPGCSSIE+NNVV+EF + +  K     +++ LD ++K+LK +GYV  T M
Sbjct: 458 VMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMVKELKLSGYVPDTSM 517

Query: 523 ALY-DIEKEEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVY 582
            ++ ++  +EKE ++ YHSEKLA+ FGLLN+P   T+R+VKNLR+C DCH   K++SL++
Sbjct: 518 VVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVCRDCHNAAKLISLIF 577

Query: 583 KRYIVVRDRNRFHHFYEGFCSCRDYW 608
            R +V+RD  RFHHF +G CSC D+W
Sbjct: 578 GRKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of CsaV3_1G002680 vs. ExPASy Swiss-Prot
Match: Q683I9 (Pentatricopeptide repeat-containing protein At3g62890 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H82 PE=2 SV=1)

HSP 1 Score: 435.3 bits (1118), Expect = 1.2e-120
Identity = 228/550 (41.45%), Postives = 328/550 (59.64%), Query Frame = 0

Query: 96  SINSQQCNSMIRTYLDLNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKM 155
           +++S Q +S I  YL +  H              + PD  TFP +L +        +G+ 
Sbjct: 37  NVSSPQRHSPISVYLRMRNH-------------RVSPDFHTFPFLLPSFHNPLHLPLGQR 96

Query: 156 IHGIVIQMGFICDVYTSTALVHLYCTCLS------------------------------- 215
            H  ++  G   D +  T+L+++Y +C                                 
Sbjct: 97  THAQILLFGLDKDPFVRTSLLNMYSSCGDLRSAQRVFDDSGSKDLPAWNSVVNAYAKAGL 156

Query: 216 ISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADG-----AQPSERTVV 275
           I DA +LFDEMPERN ++W+ LI GY    K+ +A+D FR M          +P+E T+ 
Sbjct: 157 IDDARKLFDEMPERNVISWSCLINGYVMCGKYKEALDLFREMQLPKPNEAFVRPNEFTMS 216

Query: 276 VVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEKVFEEI-RE 335
            VLSAC  LGA  QGKW+H +I    + +++ +GTALIDMYAKCG++   ++VF  +  +
Sbjct: 217 TVLSACGRLGALEQGKWVHAYIDKYHVEIDIVLGTALIDMYAKCGSLERAKRVFNALGSK 276

Query: 336 KNVYTWNVLISGYAMNGQGDAALQAFSRM-LMENFKPDEVTFLGVLCACCHQGLVTEGRW 395
           K+V  ++ +I   AM G  D   Q FS M   +N  P+ VTF+G+L AC H+GL+ EG+ 
Sbjct: 277 KDVKAYSAMICCLAMYGLTDECFQLFSEMTTSDNINPNSVTFVGILGACVHRGLINEGKS 336

Query: 396 QFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVH 455
            F  M ++FG+ P I+HYGCMVDL GR+GL++EA   I SM +EPD +IW +LL   R+ 
Sbjct: 337 YFKMMIEEFGITPSIQHYGCMVDLYGRSGLIKEAESFIASMPMEPDVLIWGSLLSGSRML 396

Query: 456 GNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSS 515
           G+ K  E  +KRLIEL+P N   YVLLSN+Y++  RW EV  +R  M ++GI KVPGCS 
Sbjct: 397 GDIKTCEGALKRLIELDPMNSGAYVLLSNVYAKTGRWMEVKCIRHEMEVKGINKVPGCSY 456

Query: 516 IEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEEKEHSVM 575
           +E+  VV+EFV  ++ + E E IY  LD ++++L+E GYVT T   L D+ +++KE ++ 
Sbjct: 457 VEVEGVVHEFVVGDESQQESERIYAMLDEIMQRLREAGYVTDTKEVLLDLNEKDKEIALS 516

Query: 576 YHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFY 608
           YHSEKLA+AF L+ +     +RI+KNLRIC DCH   K++S ++ R IVVRD NRFHHF 
Sbjct: 517 YHSEKLAIAFCLMKTRPGTPVRIIKNLRICGDCHLVMKMISKLFSREIVVRDCNRFHHFR 573

BLAST of CsaV3_1G002680 vs. ExPASy Swiss-Prot
Match: Q9SN85 (Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H76 PE=2 SV=1)

HSP 1 Score: 428.3 bits (1100), Expect = 1.4e-118
Identity = 231/548 (42.15%), Postives = 331/548 (60.40%), Query Frame = 0

Query: 68  HFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLDLNKHLNSLYIFALMHK 127
           HFL +L    + R+      R+FS+ R +     CN+MIR +           +F  + +
Sbjct: 48  HFLSRLALSLIPRD-INYSCRVFSQ-RLNPTLSHCNTMIRAFSLSQTPCEGFRLFRSLRR 107

Query: 128 FSILPD---SSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYCTCLS 187
            S LP    SS+F   LK   +  D   G  IHG +   GF+ D    T L+ LY TC +
Sbjct: 108 NSSLPANPLSSSF--ALKCCIKSGDLLGGLQIHGKIFSDGFLSDSLLMTTLMDLYSTCEN 167

Query: 188 ISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLAD---GAQPSERTVVVV 247
            +DA ++FDE+P+R+ V+WN L + Y  N++    +  F  M  D     +P   T ++ 
Sbjct: 168 STDACKVFDEIPKRDTVSWNVLFSCYLRNKRTRDVLVLFDKMKNDVDGCVKPDGVTCLLA 227

Query: 248 LSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEKVFEEIREKNV 307
           L AC++LGA + GK +H+FI  N L   + +   L+ MY++CG++ +  +VF  +RE+NV
Sbjct: 228 LQACANLGALDFGKQVHDFIDENGLSGALNLSNTLVSMYSRCGSMDKAYQVFYGMRERNV 287

Query: 308 YTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGLVTEGRWQFMS 367
            +W  LISG AMNG G  A++AF+ ML     P+E T  G+L AC H GLV EG   F  
Sbjct: 288 VSWTALISGLAMNGFGKEAIEAFNEMLKFGISPEEQTLTGLLSACSHSGLVAEGMMFFDR 347

Query: 368 MKQ-QFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVHGNT 427
           M+  +F ++P + HYGC+VDLLGRA LL++A  LI+SM ++PD  IWR LL ACRVHG+ 
Sbjct: 348 MRSGEFKIKPNLHHYGCVVDLLGRARLLDKAYSLIKSMEMKPDSTIWRTLLGACRVHGDV 407

Query: 428 KLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEI 487
           +LGE +I  LIEL+     +YVLL N YS   +W +V +LR +M  + I   PGCS+IE+
Sbjct: 408 ELGERVISHLIELKAEEAGDYVLLLNTYSTVGKWEKVTELRSLMKEKRIHTKPGCSAIEL 467

Query: 488 NNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIE-KEEKEHSVMYH 547
              V+EF+  +   P  E IYK L  + ++LK  GYV      L+++E +EEK +++ YH
Sbjct: 468 QGTVHEFIVDDVSHPRKEEIYKMLAEINQQLKIAGYVAEITSELHNLESEEEKGYALRYH 527

Query: 548 SEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEG 607
           SEKLA+AFG+L +P   T+R+ KNLR C+DCH F K +S VY R ++VRDR+RFHHF  G
Sbjct: 528 SEKLAIAFGILVTPPGTTIRVTKNLRTCVDCHNFAKFVSDVYDRIVIVRDRSRFHHFKGG 587

BLAST of CsaV3_1G002680 vs. ExPASy TrEMBL
Match: A0A0A0LUK8 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G011530 PE=3 SV=1)

HSP 1 Score: 1243.8 bits (3217), Expect = 0.0e+00
Identity = 607/607 (100.00%), Postives = 607/607 (100.00%), Query Frame = 0

Query: 1   MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFLESEMLKFVHTQAMDLPFQATNGSKIPD 60
           MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFLESEMLKFVHTQAMDLPFQATNGSKIPD
Sbjct: 1   MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFLESEMLKFVHTQAMDLPFQATNGSKIPD 60

Query: 61  YNDVRRGHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLDLNKHLNSLY 120
           YNDVRRGHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLDLNKHLNSLY
Sbjct: 61  YNDVRRGHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLDLNKHLNSLY 120

Query: 121 IFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYC 180
           IFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYC
Sbjct: 121 IFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYC 180

Query: 181 TCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGAQPSERTVVV 240
           TCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGAQPSERTVVV
Sbjct: 181 TCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGAQPSERTVVV 240

Query: 241 VLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEKVFEEIREKN 300
           VLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEKVFEEIREKN
Sbjct: 241 VLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEKVFEEIREKN 300

Query: 301 VYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGLVTEGRWQFM 360
           VYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGLVTEGRWQFM
Sbjct: 301 VYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGLVTEGRWQFM 360

Query: 361 SMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVHGNT 420
           SMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVHGNT
Sbjct: 361 SMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVHGNT 420

Query: 421 KLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEI 480
           KLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEI
Sbjct: 421 KLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEI 480

Query: 481 NNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEEKEHSVMYHS 540
           NNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEEKEHSVMYHS
Sbjct: 481 NNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEEKEHSVMYHS 540

Query: 541 EKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEGF 600
           EKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEGF
Sbjct: 541 EKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEGF 600

Query: 601 CSCRDYW 608
           CSCRDYW
Sbjct: 601 CSCRDYW 607

BLAST of CsaV3_1G002680 vs. ExPASy TrEMBL
Match: A0A5A7US40 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold274G002740 PE=3 SV=1)

HSP 1 Score: 1115.5 bits (2884), Expect = 0.0e+00
Identity = 540/571 (94.57%), Postives = 556/571 (97.37%), Query Frame = 0

Query: 37  MLKFVHTQAMDLPFQATNGSKIPDYNDVRRGHFLMKLIDDSVSRNGFESIARIFSKYRGS 96
           MLKFVHTQAMDLPFQ TN  K PDYNDVRRGHF+MKLIDDSVS NGFESIARIFSKYRGS
Sbjct: 1   MLKFVHTQAMDLPFQETNDRKTPDYNDVRRGHFVMKLIDDSVSHNGFESIARIFSKYRGS 60

Query: 97  INSQQCNSMIRTYLDLNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMI 156
           INSQQCNSMIR YLDLNKHLNSLYIFA MHKFSILPD STFPAVLKATAQLCDT VGKMI
Sbjct: 61  INSQQCNSMIRRYLDLNKHLNSLYIFAQMHKFSILPDLSTFPAVLKATAQLCDTEVGKMI 120

Query: 157 HGIVIQMGFICDVYTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKF 216
           HGIVIQMGFICDVYTSTALVH+Y TCLSISDASQ+FDEM ERNAVTWNALITGYTHNRKF
Sbjct: 121 HGIVIQMGFICDVYTSTALVHMYSTCLSISDASQVFDEMAERNAVTWNALITGYTHNRKF 180

Query: 217 VKAIDAFRGMLADGAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTAL 276
           ++AIDAFRGMLA GAQPSERTVV+VLSACSHLGA NQGKWIH+FIYHNRLRLNVFVGTAL
Sbjct: 181 MEAIDAFRGMLAAGAQPSERTVVLVLSACSHLGALNQGKWIHDFIYHNRLRLNVFVGTAL 240

Query: 277 IDMYAKCGAVYEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDE 336
           IDMYAKCGAV EVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDE
Sbjct: 241 IDMYAKCGAVDEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDE 300

Query: 337 VTFLGVLCACCHQGLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQ 396
           VTFLGVLCACCHQGLVTEGR QFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQ
Sbjct: 301 VTFLGVLCACCHQGLVTEGRRQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQ 360

Query: 397 SMSIEPDPIIWRALLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAE 456
           SMS+EPDPIIWRALLCACRVHGNTKLGEYI+KRL+ELEPNNGENYVLLSNIY+RERRWAE
Sbjct: 361 SMSMEPDPIIWRALLCACRVHGNTKLGEYIMKRLVELEPNNGENYVLLSNIYARERRWAE 420

Query: 457 VGKLRGMMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGY 516
           VGKLRGMMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPE+EAIYKQLDNLIKKLKENGY
Sbjct: 421 VGKLRGMMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPEYEAIYKQLDNLIKKLKENGY 480

Query: 517 VTGTDMALYDIEKEEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKV 576
           VTGTDMALYD+EKEEKEHS+MYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKV
Sbjct: 481 VTGTDMALYDVEKEEKEHSLMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKV 540

Query: 577 LSLVYKRYIVVRDRNRFHHFYEGFCSCRDYW 608
           +SLVYKRYIVVRDRNRFHHF+EGFCSCRDYW
Sbjct: 541 VSLVYKRYIVVRDRNRFHHFFEGFCSCRDYW 571

BLAST of CsaV3_1G002680 vs. ExPASy TrEMBL
Match: A0A1S4DZH3 (pentatricopeptide repeat-containing protein At4g21065-like OS=Cucumis melo OX=3656 GN=LOC103494017 PE=3 SV=1)

HSP 1 Score: 1097.4 bits (2837), Expect = 0.0e+00
Identity = 531/562 (94.48%), Postives = 547/562 (97.33%), Query Frame = 0

Query: 46  MDLPFQATNGSKIPDYNDVRRGHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSM 105
           MDLPFQ TN  K PDYNDVRRGHF+MKLIDDSVS NGFESIARIFSKYRGSINSQQCNSM
Sbjct: 1   MDLPFQETNDRKTPDYNDVRRGHFVMKLIDDSVSHNGFESIARIFSKYRGSINSQQCNSM 60

Query: 106 IRTYLDLNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGF 165
           IR YLDLNKHLNSLYIFA MHKFSILPD STFPAVLKATAQLCDT VGKMIHGIVIQMGF
Sbjct: 61  IRRYLDLNKHLNSLYIFAQMHKFSILPDLSTFPAVLKATAQLCDTEVGKMIHGIVIQMGF 120

Query: 166 ICDVYTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRG 225
           ICDVYTSTALVH+Y TCLSISDASQ+FDEM ERNAVTWNALITGYTHNRKF++AIDAFRG
Sbjct: 121 ICDVYTSTALVHMYSTCLSISDASQVFDEMAERNAVTWNALITGYTHNRKFMEAIDAFRG 180

Query: 226 MLADGAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGA 285
           MLA GAQPSERTVV+VLSACSHLGA NQGKWIH+FIYHNRLRLNVFVGTALIDMYAKCGA
Sbjct: 181 MLAAGAQPSERTVVLVLSACSHLGALNQGKWIHDFIYHNRLRLNVFVGTALIDMYAKCGA 240

Query: 286 VYEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCA 345
           V EVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCA
Sbjct: 241 VDEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCA 300

Query: 346 CCHQGLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPI 405
           CCHQGLVTEGR QFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMS+EPDPI
Sbjct: 301 CCHQGLVTEGRRQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSMEPDPI 360

Query: 406 IWRALLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMN 465
           IWRALLCACRVHGNTKLGEYI+KRL+ELEPNNGENYVLLSNIY+RERRWAEVGKLRGMMN
Sbjct: 361 IWRALLCACRVHGNTKLGEYIMKRLVELEPNNGENYVLLSNIYARERRWAEVGKLRGMMN 420

Query: 466 LRGIRKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALY 525
           LRGIRKVPGCSSIEINNVVYEFVASNDRKPE+EAIYKQLDNLIKKLKENGYVTGTDMALY
Sbjct: 421 LRGIRKVPGCSSIEINNVVYEFVASNDRKPEYEAIYKQLDNLIKKLKENGYVTGTDMALY 480

Query: 526 DIEKEEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYI 585
           D+EKEEKEHS+MYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKV+SLVYKRYI
Sbjct: 481 DVEKEEKEHSLMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVVSLVYKRYI 540

Query: 586 VVRDRNRFHHFYEGFCSCRDYW 608
           VVRDRNRFHHF+EGFCSCRDYW
Sbjct: 541 VVRDRNRFHHFFEGFCSCRDYW 562

BLAST of CsaV3_1G002680 vs. ExPASy TrEMBL
Match: A0A6J1EAY2 (pentatricopeptide repeat-containing protein At4g21065-like OS=Cucurbita moschata OX=3662 GN=LOC111432395 PE=3 SV=1)

HSP 1 Score: 1050.0 bits (2714), Expect = 3.7e-303
Identity = 518/618 (83.82%), Postives = 560/618 (90.61%), Query Frame = 0

Query: 1   MKMYNRLLPFSYRIIRRSRVQQEICTISNLDFL--ESEMLKFVHTQAMDLPFQATNGSKI 60
           MKM  RLLPFS+R+IRR+R+ Q+ CTISNLDFL  +S++ +FVHT+ M+LP Q     KI
Sbjct: 1   MKMDLRLLPFSFRLIRRARL-QDTCTISNLDFLANQSQISRFVHTRVMNLPSQGGIERKI 60

Query: 61  PDYNDVRR---------GHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTY 120
           PD  D RR         G+FLMKLI+DSVS NGFESIA IFSK+RGSINSQ CNSMIR Y
Sbjct: 61  PDCLDARRKEGNDMRSDGYFLMKLIEDSVSNNGFESIALIFSKFRGSINSQICNSMIRGY 120

Query: 121 LDLNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDV 180
           LD N+HLNSL IFA MHKFSILPDSSTFPAVLKATAQLCD  +GKMIHG V+QMGFI DV
Sbjct: 121 LDSNEHLNSLIIFAHMHKFSILPDSSTFPAVLKATAQLCDIKLGKMIHGAVVQMGFIRDV 180

Query: 181 YTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLAD 240
           YTSTALVH+YC+CLSISDASQLFDEMPERN+VTWNALITGYTHNRKF +AI+AFRGMLA 
Sbjct: 181 YTSTALVHMYCSCLSISDASQLFDEMPERNSVTWNALITGYTHNRKFREAINAFRGMLAA 240

Query: 241 GAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEV 300
           GA+PSERTVVVVLSACSHLGA NQGKWIH+FIY N+LRLNVFVGTALIDMYAKCG V E 
Sbjct: 241 GAEPSERTVVVVLSACSHLGALNQGKWIHDFIYQNKLRLNVFVGTALIDMYAKCGVVEEA 300

Query: 301 EKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQ 360
           EKVFEEIR++NVYTWNVLISGY MNGQG+AALQ FSRMLMENFKPD VTFLG+LCACCHQ
Sbjct: 301 EKVFEEIRDRNVYTWNVLISGYGMNGQGNAALQVFSRMLMENFKPDAVTFLGLLCACCHQ 360

Query: 361 GLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRA 420
           GLVTEGR QF+SMKQQFGLQP+IEHYGCMVDLLGRAGLLEEALELI+SMS+EPDPIIWRA
Sbjct: 361 GLVTEGRRQFISMKQQFGLQPKIEHYGCMVDLLGRAGLLEEALELIESMSMEPDPIIWRA 420

Query: 421 LLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGI 480
           LLCACRVHGNTKLGEY I+RLIELEPNNGENYVLLSN+YSRERRW EVGKLRGMM+LRGI
Sbjct: 421 LLCACRVHGNTKLGEYTIRRLIELEPNNGENYVLLSNLYSRERRWIEVGKLRGMMSLRGI 480

Query: 481 RKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEK 540
            KVPGCSSIEINN V+EF ASNDRK EF AIYKQLDN++KKLKENGYVTGTDM+L+DIEK
Sbjct: 481 EKVPGCSSIEINNAVHEFTASNDRKREFSAIYKQLDNVMKKLKENGYVTGTDMSLFDIEK 540

Query: 541 EEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRD 600
           EEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRIC DCHEFFKV+SLVYKRYIVVRD
Sbjct: 541 EEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICSDCHEFFKVVSLVYKRYIVVRD 600

Query: 601 RNRFHHFYEGFCSCRDYW 608
           RNRFHHF EG CSCRDYW
Sbjct: 601 RNRFHHFSEGVCSCRDYW 617

BLAST of CsaV3_1G002680 vs. ExPASy TrEMBL
Match: A0A6J1BZP4 (pentatricopeptide repeat-containing protein At4g21065-like OS=Momordica charantia OX=3673 GN=LOC111006955 PE=3 SV=1)

HSP 1 Score: 1028.5 bits (2658), Expect = 1.1e-296
Identity = 510/612 (83.33%), Postives = 550/612 (89.87%), Query Frame = 0

Query: 8   LPFSYRIIRRSRVQQEICTISNLDFL--ESEMLKFVHTQ-AMDLPFQATNGSKIPDYNDV 67
           +  S+R+IRR+R+ Q+ICTISN  FL  +S++ KF+HTQ  M+LP Q+TN  KIPDY DV
Sbjct: 43  IEMSFRLIRRARL-QDICTISNSAFLANQSQISKFMHTQLTMNLPPQSTNERKIPDYMDV 102

Query: 68  RR---------GHFLMKLIDDSVSRNGFESIARIFSKYRGSINSQQCNSMIRTYLDLNKH 127
            R         G+FLMKLIDDSVS +GFESIA IFSK+RG IN Q CN MIR YLD NKH
Sbjct: 103 VRKEGNDMRSDGYFLMKLIDDSVSHDGFESIAPIFSKFRGVINCQLCNWMIRGYLDSNKH 162

Query: 128 LNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQMGFICDVYTSTAL 187
           LNSL IFA MHKFSILPDSSTFPAV+KATA+ C+  +GKMIHG VIQMGFI DVYTSTAL
Sbjct: 163 LNSLLIFAHMHKFSILPDSSTFPAVIKATARSCNVELGKMIHGTVIQMGFIRDVYTSTAL 222

Query: 188 VHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADGAQPSE 247
           VH+YCTCLSISDA QLFDEMPERN+VTWNALITGYTHNRKF++AI+AFRGMLA GA+PSE
Sbjct: 223 VHMYCTCLSISDAYQLFDEMPERNSVTWNALITGYTHNRKFMEAINAFRGMLAAGAEPSE 282

Query: 248 RTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEKVFEE 307
           RTVVVVLSACSHLGA NQG WIHEFIY N+LRLNVFVGTALIDMYAKCGAV E EKVFEE
Sbjct: 283 RTVVVVLSACSHLGALNQGTWIHEFIYQNKLRLNVFVGTALIDMYAKCGAVEEAEKVFEE 342

Query: 308 IREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGVLCACCHQGLVTEG 367
           IREKNVYTWNVLISGYAMNGQGD ALQAFS ML ENFKPDEVTFLGVLCACCHQGLVTEG
Sbjct: 343 IREKNVYTWNVLISGYAMNGQGDEALQAFSMMLRENFKPDEVTFLGVLCACCHQGLVTEG 402

Query: 368 RWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACR 427
           R QF+SMKQ FGL+PRIEHYGCMVDLLGRAGLLEEALELIQSMS+EPDPIIWRALLCACR
Sbjct: 403 RRQFVSMKQHFGLRPRIEHYGCMVDLLGRAGLLEEALELIQSMSMEPDPIIWRALLCACR 462

Query: 428 VHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGC 487
           VHGNTKLGEY I+RLI+LEPNNGENYVLLSN+YSRERRW EVGKLRGMM+LRGI KVPGC
Sbjct: 463 VHGNTKLGEYAIRRLIDLEPNNGENYVLLSNLYSRERRWIEVGKLRGMMSLRGIGKVPGC 522

Query: 488 SSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEEKEHS 547
           SSIEI NVVYEF ASNDRKPEF+AIYKQLDN+I+KLK NGY+TGT MAL+DIE+EEKEH 
Sbjct: 523 SSIEIKNVVYEFAASNDRKPEFDAIYKQLDNVIEKLKANGYITGTGMALFDIEEEEKEHC 582

Query: 548 VMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHH 607
           VMYHSEKLALAFGLLNSPLDC LRIVKNLRICLDCHEFFKV SLVYKR+IVVRDRNRFHH
Sbjct: 583 VMYHSEKLALAFGLLNSPLDCALRIVKNLRICLDCHEFFKVASLVYKRFIVVRDRNRFHH 642

BLAST of CsaV3_1G002680 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 464.2 bits (1193), Expect = 1.7e-130
Identity = 237/574 (41.29%), Postives = 355/574 (61.85%), Query Frame = 0

Query: 40  FVHTQAMDLPFQ---ATNGSKIPDYNDVRRGHFLMKLIDDSVSRNGFESIARIFSKYRGS 99
           +VHT  + +  Q     +  K+ D +  R       LI    SR   E+  ++F +    
Sbjct: 170 YVHTSLISMYVQNGRLEDAHKVFDKSPHRDVVSYTALIKGYASRGYIENAQKLFDEIPVK 229

Query: 100 INSQQCNSMIRTYLDLNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMI 159
            +    N+MI  Y +   +  +L +F  M K ++ PD ST   V+ A AQ     +G+ +
Sbjct: 230 -DVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQSGSIELGRQV 289

Query: 160 HGIVIQMGFICDVYTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKF 219
           H  +   GF  ++    AL+ LY  C  +  A  LF+ +P ++ ++WN LI GYTH   +
Sbjct: 290 HLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISWNTLIGGYTHMNLY 349

Query: 220 VKAIDAFRGMLADGAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLR--LNV-FVG 279
            +A+  F+ ML  G  P++ T++ +L AC+HLGA + G+WIH +I   RL+   N   + 
Sbjct: 350 KEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYI-DKRLKGVTNASSLR 409

Query: 280 TALIDMYAKCGAVYEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFK 339
           T+LIDMYAKCG +    +VF  I  K++ +WN +I G+AM+G+ DA+   FSRM     +
Sbjct: 410 TSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQ 469

Query: 340 PDEVTFLGVLCACCHQGLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALE 399
           PD++TF+G+L AC H G++  GR  F +M Q + + P++EHYGCM+DLLG +GL +EA E
Sbjct: 470 PDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEE 529

Query: 400 LIQSMSIEPDPIIWRALLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERR 459
           +I  M +EPD +IW +LL AC++HGN +LGE   + LI++EP N  +YVLLSNIY+   R
Sbjct: 530 MINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGR 589

Query: 460 WAEVGKLRGMMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKE 519
           W EV K R ++N +G++KVPGCSSIEI++VV+EF+  +   P    IY  L+ +   L++
Sbjct: 590 WNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGMLEEMEVLLEK 649

Query: 520 NGYVTGTDMALYDIEKEEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEF 579
            G+V  T   L ++E+E KE ++ +HSEKLA+AFGL+++     L IVKNLR+C +CHE 
Sbjct: 650 AGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLTIVKNLRVCRNCHEA 709

Query: 580 FKVLSLVYKRYIVVRDRNRFHHFYEGFCSCRDYW 608
            K++S +YKR I+ RDR RFHHF +G CSC DYW
Sbjct: 710 TKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of CsaV3_1G002680 vs. TAIR 10
Match: AT4G21065.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 461.5 bits (1186), Expect = 1.1e-129
Identity = 219/522 (41.95%), Postives = 343/522 (65.71%), Query Frame = 0

Query: 88  RIFSKYRGSINSQQCNSMIRTYLDLNKHLNSLYIFALMHKFSIL-PDSSTFPAVLKATAQ 147
           ++FSK    IN    N++IR Y ++   +++  ++  M    ++ PD+ T+P ++KA   
Sbjct: 74  KVFSKIEKPINVFIWNTLIRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTT 133

Query: 148 LCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNAL 207
           + D  +G+ IH +VI+ GF   +Y   +L+HLY  C  ++ A ++FD+MPE++ V WN++
Sbjct: 134 MADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSV 193

Query: 208 ITGYTHNRKFVKAIDAFRGMLADGAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRL 267
           I G+  N K  +A+  +  M + G +P   T+V +LSAC+ +GA   GK +H ++    L
Sbjct: 194 INGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGL 253

Query: 268 RLNVFVGTALIDMYAKCGAVYEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSR 327
             N+     L+D+YA+CG V E + +F+E+ +KN  +W  LI G A+NG G  A++ F  
Sbjct: 254 TRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKY 313

Query: 328 M-LMENFKPDEVTFLGVLCACCHQGLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRA 387
           M   E   P E+TF+G+L AC H G+V EG   F  M++++ ++PRIEH+GCMVDLL RA
Sbjct: 314 MESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARA 373

Query: 388 GLLEEALELIQSMSIEPDPIIWRALLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLS 447
           G +++A E I+SM ++P+ +IWR LL AC VHG++ L E+   ++++LEPN+  +YVLLS
Sbjct: 374 GQVKKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLS 433

Query: 448 NIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLD 507
           N+Y+ E+RW++V K+R  M   G++KVPG S +E+ N V+EF+  +   P+ +AIY +L 
Sbjct: 434 NMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLK 493

Query: 508 NLIKKLKENGYVTGTDMALYDIEKEEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLR 567
            +  +L+  GYV        D+E+EEKE++V+YHSEK+A+AF L+++P    + +VKNLR
Sbjct: 494 EMTGRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLR 553

Query: 568 ICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEGFCSCRDYW 608
           +C DCH   K++S VY R IVVRDR+RFHHF  G CSC+DYW
Sbjct: 554 VCADCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 595

BLAST of CsaV3_1G002680 vs. TAIR 10
Match: AT2G02980.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 448.4 bits (1152), Expect = 9.5e-126
Identity = 220/506 (43.48%), Postives = 324/506 (64.03%), Query Frame = 0

Query: 103 NSMIRTYLDLNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKMIHGIVIQ 162
           NSM R Y      L    +F  + +  ILPD+ TFP++LKA A       G+ +H + ++
Sbjct: 98  NSMARGYSRFTNPLEVFSLFVEILEDGILPDNYTFPSLLKACAVAKALEEGRQLHCLSMK 157

Query: 163 MGFICDVYTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDA 222
           +G   +VY    L+++Y  C  +  A  +FD + E   V +NA+ITGY    +  +A+  
Sbjct: 158 LGLDDNVYVCPTLINMYTECEDVDSARCVFDRIVEPCVVCYNAMITGYARRNRPNEALSL 217

Query: 223 FRGMLADGAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAK 282
           FR M     +P+E T++ VLS+C+ LG+ + GKWIH++   +     V V TALIDM+AK
Sbjct: 218 FREMQGKYLKPNEITLLSVLSSCALLGSLDLGKWIHKYAKKHSFCKYVKVNTALIDMFAK 277

Query: 283 CGAVYEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSRMLMENFKPDEVTFLGV 342
           CG++ +   +FE++R K+   W+ +I  YA +G+ + ++  F RM  EN +PDE+TFLG+
Sbjct: 278 CGSLDDAVSIFEKMRYKDTQAWSAMIVAYANHGKAEKSMLMFERMRSENVQPDEITFLGL 337

Query: 343 LCACCHQGLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEP 402
           L AC H G V EGR  F  M  +FG+ P I+HYG MVDLL RAG LE+A E I  + I P
Sbjct: 338 LNACSHTGRVEEGRKYFSQMVSKFGIVPSIKHYGSMVDLLSRAGNLEDAYEFIDKLPISP 397

Query: 403 DPIIWRALLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRG 462
            P++WR LL AC  H N  L E + +R+ EL+ ++G +YV+LSN+Y+R ++W  V  LR 
Sbjct: 398 TPMLWRILLAACSSHNNLDLAEKVSERIFELDDSHGGDYVILSNLYARNKKWEYVDSLRK 457

Query: 463 MMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDM 522
           +M  R   KVPGCSSIE+NNVV+EF + +  K     +++ LD ++K+LK +GYV  T M
Sbjct: 458 VMKDRKAVKVPGCSSIEVNNVVHEFFSGDGVKSATTKLHRALDEMVKELKLSGYVPDTSM 517

Query: 523 ALY-DIEKEEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVY 582
            ++ ++  +EKE ++ YHSEKLA+ FGLLN+P   T+R+VKNLR+C DCH   K++SL++
Sbjct: 518 VVHANMNDQEKEITLRYHSEKLAITFGLLNTPPGTTIRVVKNLRVCRDCHNAAKLISLIF 577

Query: 583 KRYIVVRDRNRFHHFYEGFCSCRDYW 608
            R +V+RD  RFHHF +G CSC D+W
Sbjct: 578 GRKVVLRDVQRFHHFEDGKCSCGDFW 603

BLAST of CsaV3_1G002680 vs. TAIR 10
Match: AT3G62890.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 435.3 bits (1118), Expect = 8.3e-122
Identity = 228/550 (41.45%), Postives = 328/550 (59.64%), Query Frame = 0

Query: 96  SINSQQCNSMIRTYLDLNKHLNSLYIFALMHKFSILPDSSTFPAVLKATAQLCDTGVGKM 155
           +++S Q +S I  YL +  H              + PD  TFP +L +        +G+ 
Sbjct: 37  NVSSPQRHSPISVYLRMRNH-------------RVSPDFHTFPFLLPSFHNPLHLPLGQR 96

Query: 156 IHGIVIQMGFICDVYTSTALVHLYCTCLS------------------------------- 215
            H  ++  G   D +  T+L+++Y +C                                 
Sbjct: 97  THAQILLFGLDKDPFVRTSLLNMYSSCGDLRSAQRVFDDSGSKDLPAWNSVVNAYAKAGL 156

Query: 216 ISDASQLFDEMPERNAVTWNALITGYTHNRKFVKAIDAFRGMLADG-----AQPSERTVV 275
           I DA +LFDEMPERN ++W+ LI GY    K+ +A+D FR M          +P+E T+ 
Sbjct: 157 IDDARKLFDEMPERNVISWSCLINGYVMCGKYKEALDLFREMQLPKPNEAFVRPNEFTMS 216

Query: 276 VVLSACSHLGAFNQGKWIHEFIYHNRLRLNVFVGTALIDMYAKCGAVYEVEKVFEEI-RE 335
            VLSAC  LGA  QGKW+H +I    + +++ +GTALIDMYAKCG++   ++VF  +  +
Sbjct: 217 TVLSACGRLGALEQGKWVHAYIDKYHVEIDIVLGTALIDMYAKCGSLERAKRVFNALGSK 276

Query: 336 KNVYTWNVLISGYAMNGQGDAALQAFSRM-LMENFKPDEVTFLGVLCACCHQGLVTEGRW 395
           K+V  ++ +I   AM G  D   Q FS M   +N  P+ VTF+G+L AC H+GL+ EG+ 
Sbjct: 277 KDVKAYSAMICCLAMYGLTDECFQLFSEMTTSDNINPNSVTFVGILGACVHRGLINEGKS 336

Query: 396 QFMSMKQQFGLQPRIEHYGCMVDLLGRAGLLEEALELIQSMSIEPDPIIWRALLCACRVH 455
            F  M ++FG+ P I+HYGCMVDL GR+GL++EA   I SM +EPD +IW +LL   R+ 
Sbjct: 337 YFKMMIEEFGITPSIQHYGCMVDLYGRSGLIKEAESFIASMPMEPDVLIWGSLLSGSRML 396

Query: 456 GNTKLGEYIIKRLIELEPNNGENYVLLSNIYSRERRWAEVGKLRGMMNLRGIRKVPGCSS 515
           G+ K  E  +KRLIEL+P N   YVLLSN+Y++  RW EV  +R  M ++GI KVPGCS 
Sbjct: 397 GDIKTCEGALKRLIELDPMNSGAYVLLSNVYAKTGRWMEVKCIRHEMEVKGINKVPGCSY 456

Query: 516 IEINNVVYEFVASNDRKPEFEAIYKQLDNLIKKLKENGYVTGTDMALYDIEKEEKEHSVM 575
           +E+  VV+EFV  ++ + E E IY  LD ++++L+E GYVT T   L D+ +++KE ++ 
Sbjct: 457 VEVEGVVHEFVVGDESQQESERIYAMLDEIMQRLREAGYVTDTKEVLLDLNEKDKEIALS 516

Query: 576 YHSEKLALAFGLLNSPLDCTLRIVKNLRICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFY 608
           YHSEKLA+AF L+ +     +RI+KNLRIC DCH   K++S ++ R IVVRD NRFHHF 
Sbjct: 517 YHSEKLAIAFCLMKTRPGTPVRIIKNLRICGDCHLVMKMISKLFSREIVVRDCNRFHHFR 573

BLAST of CsaV3_1G002680 vs. TAIR 10
Match: AT4G21065.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 435.3 bits (1118), Expect = 8.3e-122
Identity = 203/462 (43.94%), Postives = 310/462 (67.10%), Query Frame = 0

Query: 147 LCDTGVGKMIHGIVIQMGFICDVYTSTALVHLYCTCLSISDASQLFDEMPERNAVTWNAL 206
           + D  +G+ IH +VI+ GF   +Y   +L+HLY  C  ++ A ++FD+MPE++ V WN++
Sbjct: 1   MADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSV 60

Query: 207 ITGYTHNRKFVKAIDAFRGMLADGAQPSERTVVVVLSACSHLGAFNQGKWIHEFIYHNRL 266
           I G+  N K  +A+  +  M + G +P   T+V +LSAC+ +GA   GK +H ++    L
Sbjct: 61  INGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGL 120

Query: 267 RLNVFVGTALIDMYAKCGAVYEVEKVFEEIREKNVYTWNVLISGYAMNGQGDAALQAFSR 326
             N+     L+D+YA+CG V E + +F+E+ +KN  +W  LI G A+NG G  A++ F  
Sbjct: 121 TRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKY 180

Query: 327 M-LMENFKPDEVTFLGVLCACCHQGLVTEGRWQFMSMKQQFGLQPRIEHYGCMVDLLGRA 386
           M   E   P E+TF+G+L AC H G+V EG   F  M++++ ++PRIEH+GCMVDLL RA
Sbjct: 181 MESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARA 240

Query: 387 GLLEEALELIQSMSIEPDPIIWRALLCACRVHGNTKLGEYIIKRLIELEPNNGENYVLLS 446
           G +++A E I+SM ++P+ +IWR LL AC VHG++ L E+   ++++LEPN+  +YVLLS
Sbjct: 241 GQVKKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLS 300

Query: 447 NIYSRERRWAEVGKLRGMMNLRGIRKVPGCSSIEINNVVYEFVASNDRKPEFEAIYKQLD 506
           N+Y+ E+RW++V K+R  M   G++KVPG S +E+ N V+EF+  +   P+ +AIY +L 
Sbjct: 301 NMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLK 360

Query: 507 NLIKKLKENGYVTGTDMALYDIEKEEKEHSVMYHSEKLALAFGLLNSPLDCTLRIVKNLR 566
            +  +L+  GYV        D+E+EEKE++V+YHSEK+A+AF L+++P    + +VKNLR
Sbjct: 361 EMTGRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLR 420

Query: 567 ICLDCHEFFKVLSLVYKRYIVVRDRNRFHHFYEGFCSCRDYW 608
           +C DCH   K++S VY R IVVRDR+RFHHF  G CSC+DYW
Sbjct: 421 VCADCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 462

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004138309.20.0e+00100.00pentatricopeptide repeat-containing protein At4g21065 [Cucumis sativus] >KGN6370... [more]
KAA0057970.10.0e+0094.57pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa][more]
XP_016901378.10.0e+0094.48PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Cucumis m... [more]
XP_038878567.10.0e+0088.15pentatricopeptide repeat-containing protein At4g21065-like [Benincasa hispida][more]
XP_023529316.13.1e-30484.30pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita pepo subsp... [more]
Match NameE-valueIdentityDescription
Q9LN012.3e-12941.29Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
A8MQA31.5e-12841.95Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX... [more]
Q8LK931.3e-12443.48Pentatricopeptide repeat-containing protein At2g02980, chloroplastic OS=Arabidop... [more]
Q683I91.2e-12041.45Pentatricopeptide repeat-containing protein At3g62890 OS=Arabidopsis thaliana OX... [more]
Q9SN851.4e-11842.15Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A0A0LUK80.0e+00100.00DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G0115... [more]
A0A5A7US400.0e+0094.57Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S4DZH30.0e+0094.48pentatricopeptide repeat-containing protein At4g21065-like OS=Cucumis melo OX=36... [more]
A0A6J1EAY23.7e-30383.82pentatricopeptide repeat-containing protein At4g21065-like OS=Cucurbita moschata... [more]
A0A6J1BZP41.1e-29683.33pentatricopeptide repeat-containing protein At4g21065-like OS=Momordica charanti... [more]
Match NameE-valueIdentityDescription
AT1G08070.11.7e-13041.29Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G21065.11.1e-12941.95Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G02980.19.5e-12643.48Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G62890.18.3e-12241.45Pentatricopeptide repeat (PPR) superfamily protein [more]
AT4G21065.28.3e-12243.94Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Chinese Long) v3
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 494..514
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 77..581
NoneNo IPR availablePANTHERPTHR47928:SF133SUBFAMILY NOT NAMEDcoord: 77..581
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 375..399
e-value: 0.0019
score: 16.2
coord: 201..234
e-value: 1.0E-6
score: 26.5
coord: 302..336
e-value: 3.4E-8
score: 31.2
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 374..398
e-value: 0.0019
score: 18.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 299..348
e-value: 2.5E-11
score: 43.6
coord: 199..246
e-value: 2.2E-9
score: 37.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 300..334
score: 12.079411
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 199..233
score: 11.092894
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 80..256
e-value: 1.3E-25
score: 92.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 356..434
e-value: 1.7E-5
score: 26.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 257..355
e-value: 2.3E-25
score: 91.0
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 295..456
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 473..596
e-value: 1.2E-35
score: 122.2

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_1G002680.1CsaV3_1G002680.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding