CsaV3_1G038250 (gene) Cucumber (Chinese Long) v3

NameCsaV3_1G038250
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionPentatricopeptide repeat-containing protein
Locationchr1 : 24037916 .. 24041383 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGATAGAAACTTAGCAACTTCAATTCTAAAGTTAATTAACTCTTTCCACTTAAATTATCCGAGAAATCCCAAATTACCCAACGAACTTTCCAACGGTTAATATAAATGAGTTTGGTTTGATTCCCTTTCATCAGCTGAGGGAAGAAGAATCTAAGGATGAGCTCAATCTCCACCTCACACCTCCCTTCTCCTTTCAAACCAGTGGATTTCTCGGCAGAGAAGAACATTCCCACATCAAAACTTCCACAAAAAACTGTTTTGAAGCTTTTCGACTCAAAATCCATCACTTCTTTGCAATATCTCACCCAACTTCATGGCCTTGTATTGCGTAGTGGTCATTTCCAAGACCATTACGTCTCCGGCGCGTTGCTCAAGTGTTATGCAAATCCCCATTTCAGCAATTTTGACTTTGCTTTGAAGGTATTCTCCTCAATCCCAAATCCCAACGTTTTCATTTGGAATATTGTGATTAAAGGGTGTTTAGAGAACAACAAACTGTTTAAAGCTATTTACTTCTATGGTAGGATGGTTATTGATGCTAGGCCCAATAAATTCACGTACCCAACTTTGTTTAAAGCTTGTTCTGTGGCACAAGCTGTACAAGAAGGGAGGCAAATTCATGGTCATGTGGTGAAACATGGGATTGGTAGTGACGTGCATATCAAAAGTGCTGGAATTCACATGTATGCCTCTTTTGGTAGATTAGAGGATGCAAGGAAAATGTTTTACAGTGGGGAGTCCGATGTTGTCTGTTGGAATACAATGATTGATGGGTACCTGAAATGTGGGGTTCTTGAAGCTGCTAAAGGGTTGTTTGCTCAAATGCCAGTAAAAAATATTGGCTCATGGAATGTGATGATCAATGGTTTAGCTAAGGGTGGGAATTTGGGAGATGCAAGGAAGTTGTTCGATGAAATGAGTGAAAGAGACGAAATTTCTTGGAGTTCTATGGTAGATGGTTACATATCAGCAGGTCGTTACAAAGAAGCATTAGAGATTTTCCAGCAAATGCAAAGAGAGGAGACCAGGCCTGGAAGGTTCATTTTGTCTAGTGTTCTAGCTGCTTGTTCAAATATTGGAGCCATTGATCAAGGGAGATGGGTTCATGCTTATCTCAAGAGGAACTCCATTAAATTGGATGCAGTGTTGGGGACTGCCTTATTGGATATGTATGCAAAATGTGGTAGGCTGGACATGGGATGGGAGGTATTTGAAGAAATGAAAGAAAGAGAGATCTTCACTTGGAATGCCATGATTGGTGGGCTTGCTATACATGGAAGAGCAGAGGATGCACTTGAGCTTTTCTCTAAGTTGCAGGAGGGGAGGATGAAACCGAATGGAATCACACTGGTTGGTGTTCTTACTGCTTGTGCTCATGCAGGTTTTGTTGACAAAGGCCTGAGAATTTTCCAAACAATGAGAGAGTTTTATGGTGTTGATCCTGAACTGGAACATTATGGATGTATGGTTGATTTGCTAGGGAGGTCAGGGTTGTTTTCTGAAGCTGAGGATTTGATAAACTCAATGCCAATGAAACCCAATGCAGCTGTTTGGGGAGCGCTCTTGGGTGCCTGCAGGATTCATGGAAATTTCGATTTGGCTGAAAGAGTGGGGAAGATTTTGCTCGAATTAGAGCCACAGAACAGTGGCCGGTATGTGTTACTGTCAAATATATATGCAAAAGTAGGGAGGTTTGATGATGTTTCTAAAATTAGAAAGTTGATGAAGGATAGGGGGATAAAAACAGTGCCTGGTGTCAGCATAGTTGATTTAAATGGTACAGTTCATGAATTCAAAATGGGCGACGGATCACATCCACAAATGAAAGAAATTTATAGGAAGCTAAAAATAATAAAAGAGAGGCTACAGATGGCAGGACATTCACCTGATACATCTCAAGTTTTATTTGATATTGATGAGGAAGAGAAGGAAACTGCAGTTAATTACCATAGTGAAAAGCTTGCAATTGCTTTTGGATTGATTAATACCTTGCCAGGTAAACGCATACACATAGTGAAGAACTTGAGGGTTTGTGACGATTGTCATTCAGCTACAAAGCTTATTTCTCAAATTTTTGATCGAGAAATAATTGTGAGGGACCGTGTTCGTTATCACCATTTCAAAAATGGAACTTGTTCATGTAAAGATTTTTGGTGATCTATGCTCTTCAAAAGGTGTACATACTCAATATATTGAGAAAGCATCTTAAGCAATAGAAGGCTATAGTTGACAACTAAAAATGCGAACCTAGTCAAGATCAGTAGAATATAGGAAACCCGTGGAATCAAGCTCTGAACGATAAGGTAAACATGCTCATATATCTGTAACCTCACGATAATTTGCTTCTTAAGTTCTAAATTAAATTCAATGTATGAAAAGTTCAATTTCATCTACATATGGAAAGGTGAAATGTTATCTCAATTTCAATGTGTAATCCATAAAGCTTAAGCAAAGTATGAATTAGTTAAAATCGTTTTTCATATCATATCTTTTCATTTCATCAGAAAGCAGTTTGCACAAAAGCCACCATTGTCGTGTACAGACTATTGCATATTTCTCAATATTTTCTCTAACCAAACATTTTTGTGCTGCTTGACTTTTCACCGACTGTTTTGGATGACTAAAATTAGTAACTTAAGTTTGCTGCCTGAAACAAATCATTTCCATCTCTTTGTAAATTCAAAAACAGATGTCTTGATCCATTAGCACCTGTGTGCCAATGATAGTTTTAAGTTTGCGGGCTTTTAAAAGTTTAGAGCTGAGTCGTATTACATATACCTTATTCTAAAATCTTCCTGCTCATCACTGCTTTTGTTAATTAATTCCAATGGAGTTATGAGAAAATTACTGACATTTTATGAAAAACAGGAGTTATCCAGCTGACGGTTCAGAACTTCCCAAGTGCGTAAGGCTCATTAGAGACAGGGCAGAGTACTATACGTGCTCATAAATATGGAGTGAATGCTGTCTATGAACTTTAGAGATAGGGAAATACAACATGCTTCTTCTCCTCCTTCTCTTGTAGGCTGTAGCATGGTCATTTTACGAACGGTTTAGGTGCTACAATATTACACCACTAACGAGCACCGATCCCGACCCTTACGTTATTTTTCTGGATCATAGTGTATGCAAAAGATAGGATGCCAAACTTATTAGCCCACAATCATTGAGCTAAAGCTGTGCAAGATAAAAAGAAAAGAATTAAACCCAATGAAGACAAGTTTTGGCACAAAAATAAAAAAAAATGAGTATGGGATATCACTAGTCTGTGTTTGTGAAAAAAAATTAATATTATTCAAAATGAAGCTATCTAGTAGTTATATGTGAATTTTCGAGACTTTTTTACACTAATCAAAATAGGGTCATTTCTTTTTAGGCTATAACATGACCAATTTTCGAACATTATGGGTGCTTCCATTGTACACCACTAACAAACAATAATCCGTACCCTTTGGCTATTTTTC

mRNA sequence

ATGGTTATTGATGCTAGGCCCAATAAATTCACGTACCCAACTTTGTTTAAAGCTTGTTCTGTGGCACAAGCTGTACAAGAAGGGAGGCAAATTCATGGTCATGTGGTGAAACATGGGATTGGTAGTGACGTGCATATCAAAAGTGCTGGAATTCACATGTATGCCTCTTTTGGTAGATTAGAGGATGCAAGGAAAATGTTTTACAGTGGGGAGTCCGATGTTGTCTGTTGGAATACAATGATTGATGGGTACCTGAAATGTGGGGTTCTTGAAGCTGCTAAAGGGTTGTTTGCTCAAATGCCAGTAAAAAATATTGGCTCATGGAATGTGATGATCAATGGTTTAGCTAAGGGTGGGAATTTGGGAGATGCAAGGAAGTTGTTCGATGAAATGAGTGAAAGAGACGAAATTTCTTGGAGTTCTATGGTAGATGGTTACATATCAGCAGGTCGTTACAAAGAAGCATTAGAGATTTTCCAGCAAATGCAAAGAGAGGAGACCAGGCCTGGAAGGTTCATTTTGTCTAGTGTTCTAGCTGCTTGTTCAAATATTGGAGCCATTGATCAAGGGAGATGGGTTCATGCTTATCTCAAGAGGAACTCCATTAAATTGGATGCAGTGTTGGGGACTGCCTTATTGGATATGTATGCAAAATGTGGTAGGCTGGACATGGGATGGGAGGTATTTGAAGAAATGAAAGAAAGAGAGATCTTCACTTGGAATGCCATGATTGGTGGGCTTGCTATACATGGAAGAGCAGAGGATGCACTTGAGCTTTTCTCTAAGTTGCAGGAGGGGAGGATGAAACCGAATGGAATCACACTGGTTGGTGTTCTTACTGCTTGTGCTCATGCAGGTTTTGTTGACAAAGGCCTGAGAATTTTCCAAACAATGAGAGAGTTTTATGGTGTTGATCCTGAACTGGAACATTATGGATGTATGGTTGATTTGCTAGGGAGGTCAGGGTTGTTTTCTGAAGCTGAGGATTTGATAAACTCAATGCCAATGAAACCCAATGCAGCTGTTTGGGGAGCGCTCTTGGGTGCCTGCAGGATTCATGGAAATTTCGATTTGGCTGAAAGAGTGGGGAAGATTTTGCTCGAATTAGAGCCACAGAACAGTGGCCGGTATGTGTTACTGTCAAATATATATGCAAAAGTAGGGAGGTTTGATGATGTTTCTAAAATTAGAAAGTTGATGAAGGATAGGGGGATAAAAACAGTGCCTGGTGTCAGCATAGTTGATTTAAATGGTACAGTTCATGAATTCAAAATGGGCGACGGATCACATCCACAAATGAAAGAAATTTATAGGAAGCTAAAAATAATAAAAGAGAGGCTACAGATGGCAGGACATTCACCTGATACATCTCAAGTTTTATTTGATATTGATGAGGAAGAGAAGGAAACTGCAGTTAATTACCATAGTGAAAAGCTTGCAATTGCTTTTGGATTGATTAATACCTTGCCAGGTAAACGCATACACATAGTGAAGAACTTGAGGGTTTGTGACGATTGTCATTCAGCTACAAAGCTTATTTCTCAAATTTTTGATCGAGAAATAATTGTGAGGGACCGTGTTCGTTATCACCATTTCAAAAATGGAACTTGTTCATGTAAAGATTTTTGGTGA

Coding sequence (CDS)

ATGGTTATTGATGCTAGGCCCAATAAATTCACGTACCCAACTTTGTTTAAAGCTTGTTCTGTGGCACAAGCTGTACAAGAAGGGAGGCAAATTCATGGTCATGTGGTGAAACATGGGATTGGTAGTGACGTGCATATCAAAAGTGCTGGAATTCACATGTATGCCTCTTTTGGTAGATTAGAGGATGCAAGGAAAATGTTTTACAGTGGGGAGTCCGATGTTGTCTGTTGGAATACAATGATTGATGGGTACCTGAAATGTGGGGTTCTTGAAGCTGCTAAAGGGTTGTTTGCTCAAATGCCAGTAAAAAATATTGGCTCATGGAATGTGATGATCAATGGTTTAGCTAAGGGTGGGAATTTGGGAGATGCAAGGAAGTTGTTCGATGAAATGAGTGAAAGAGACGAAATTTCTTGGAGTTCTATGGTAGATGGTTACATATCAGCAGGTCGTTACAAAGAAGCATTAGAGATTTTCCAGCAAATGCAAAGAGAGGAGACCAGGCCTGGAAGGTTCATTTTGTCTAGTGTTCTAGCTGCTTGTTCAAATATTGGAGCCATTGATCAAGGGAGATGGGTTCATGCTTATCTCAAGAGGAACTCCATTAAATTGGATGCAGTGTTGGGGACTGCCTTATTGGATATGTATGCAAAATGTGGTAGGCTGGACATGGGATGGGAGGTATTTGAAGAAATGAAAGAAAGAGAGATCTTCACTTGGAATGCCATGATTGGTGGGCTTGCTATACATGGAAGAGCAGAGGATGCACTTGAGCTTTTCTCTAAGTTGCAGGAGGGGAGGATGAAACCGAATGGAATCACACTGGTTGGTGTTCTTACTGCTTGTGCTCATGCAGGTTTTGTTGACAAAGGCCTGAGAATTTTCCAAACAATGAGAGAGTTTTATGGTGTTGATCCTGAACTGGAACATTATGGATGTATGGTTGATTTGCTAGGGAGGTCAGGGTTGTTTTCTGAAGCTGAGGATTTGATAAACTCAATGCCAATGAAACCCAATGCAGCTGTTTGGGGAGCGCTCTTGGGTGCCTGCAGGATTCATGGAAATTTCGATTTGGCTGAAAGAGTGGGGAAGATTTTGCTCGAATTAGAGCCACAGAACAGTGGCCGGTATGTGTTACTGTCAAATATATATGCAAAAGTAGGGAGGTTTGATGATGTTTCTAAAATTAGAAAGTTGATGAAGGATAGGGGGATAAAAACAGTGCCTGGTGTCAGCATAGTTGATTTAAATGGTACAGTTCATGAATTCAAAATGGGCGACGGATCACATCCACAAATGAAAGAAATTTATAGGAAGCTAAAAATAATAAAAGAGAGGCTACAGATGGCAGGACATTCACCTGATACATCTCAAGTTTTATTTGATATTGATGAGGAAGAGAAGGAAACTGCAGTTAATTACCATAGTGAAAAGCTTGCAATTGCTTTTGGATTGATTAATACCTTGCCAGGTAAACGCATACACATAGTGAAGAACTTGAGGGTTTGTGACGATTGTCATTCAGCTACAAAGCTTATTTCTCAAATTTTTGATCGAGAAATAATTGTGAGGGACCGTGTTCGTTATCACCATTTCAAAAATGGAACTTGTTCATGTAAAGATTTTTGGTGA

Protein sequence

MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRLEDARKMFYSGESDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKNIGSWNVMINGLAKGGNLGDARKLFDEMSERDEISWSSMVDGYISAGRYKEALEIFQQMQREETRPGRFILSSVLAACSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTWNAMIGGLAIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKGLRIFQTMREFYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFDLAERVGKILLELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGVSIVDLNGTVHEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIDEEEKETAVNYHSEKLAIAFGLINTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRVRYHHFKNGTCSCKDFW
BLAST of CsaV3_1G038250 vs. NCBI nr
Match: XP_004135765.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g48910-like isoform X1 [Cucumis sativus] >KGN66009.1 hypothetical protein Csa_1G561400 [Cucumis sativus])

HSP 1 Score: 972.2 bits (2512), Expect = 7.1e-280
Identity = 543/543 (100.00%), Postives = 543/543 (100.00%), Query Frame = 0

Query: 1   MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRL 60
           MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRL
Sbjct: 124 MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRL 183

Query: 61  EDARKMFYSGESDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXXXXXXXXXXX 120
           EDARKMFYSGESDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXXXXXXXXXXX
Sbjct: 184 EDARKMFYSGESDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXXXXXXXXXXX 243

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAA 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAA
Sbjct: 244 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAA 303

Query: 181 CSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTW 240
           CSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTW
Sbjct: 304 CSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTW 363

Query: 241 NAMIGGLAIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKGLRIFQTMRE 300
           NAMIGGLAIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKGLRIFQTMRE
Sbjct: 364 NAMIGGLAIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKGLRIFQTMRE 423

Query: 301 FYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFDLAE 360
           FYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFDLAE
Sbjct: 424 FYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFDLAE 483

Query: 361 RVGKILLELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGVSIVDLNGTV 420
           RVGKILLELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGVSIVDLNGTV
Sbjct: 484 RVGKILLELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGVSIVDLNGTV 543

Query: 421 HEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIDEEEKETAVNYHSEKLA 480
           HEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIDEEEKETAVNYHSEKLA
Sbjct: 544 HEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIDEEEKETAVNYHSEKLA 603

Query: 481 IAFGLINTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRVRYHHFKNGTCSCK 540
           IAFGLINTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRVRYHHFKNGTCSCK
Sbjct: 604 IAFGLINTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRVRYHHFKNGTCSCK 663

Query: 541 DFW 544
           DFW
Sbjct: 664 DFW 666

BLAST of CsaV3_1G038250 vs. NCBI nr
Match: XP_011659572.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g48910-like isoform X2 [Cucumis sativus])

HSP 1 Score: 972.2 bits (2512), Expect = 7.1e-280
Identity = 543/543 (100.00%), Postives = 543/543 (100.00%), Query Frame = 0

Query: 1   MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRL 60
           MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRL
Sbjct: 1   MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRL 60

Query: 61  EDARKMFYSGESDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXXXXXXXXXXX 120
           EDARKMFYSGESDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXXXXXXXXXXX
Sbjct: 61  EDARKMFYSGESDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXXXXXXXXXXX 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAA 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAA
Sbjct: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAA 180

Query: 181 CSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTW 240
           CSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTW
Sbjct: 181 CSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTW 240

Query: 241 NAMIGGLAIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKGLRIFQTMRE 300
           NAMIGGLAIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKGLRIFQTMRE
Sbjct: 241 NAMIGGLAIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKGLRIFQTMRE 300

Query: 301 FYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFDLAE 360
           FYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFDLAE
Sbjct: 301 FYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFDLAE 360

Query: 361 RVGKILLELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGVSIVDLNGTV 420
           RVGKILLELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGVSIVDLNGTV
Sbjct: 361 RVGKILLELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGVSIVDLNGTV 420

Query: 421 HEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIDEEEKETAVNYHSEKLA 480
           HEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIDEEEKETAVNYHSEKLA
Sbjct: 421 HEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIDEEEKETAVNYHSEKLA 480

Query: 481 IAFGLINTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRVRYHHFKNGTCSCK 540
           IAFGLINTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRVRYHHFKNGTCSCK
Sbjct: 481 IAFGLINTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRVRYHHFKNGTCSCK 540

Query: 541 DFW 544
           DFW
Sbjct: 541 DFW 543

BLAST of CsaV3_1G038250 vs. NCBI nr
Match: XP_016900941.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g48910-like isoform X1 [Cucumis melo] >XP_016900942.1 PREDICTED: pentatricopeptide repeat-containing protein At5g48910-like isoform X1 [Cucumis melo] >XP_016900943.1 PREDICTED: pentatricopeptide repeat-containing protein At5g48910-like isoform X1 [Cucumis melo])

HSP 1 Score: 943.3 bits (2437), Expect = 3.5e-271
Identity = 525/543 (96.69%), Postives = 536/543 (98.71%), Query Frame = 0

Query: 1   MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRL 60
           MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSD+HIKSAGI MYASFGRL
Sbjct: 124 MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDMHIKSAGIQMYASFGRL 183

Query: 61  EDARKMFYSGESDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXXXXXXXXXXX 120
           EDARK+FYSGESDVVCWNTMIDGYLKCG LEAAKGLFAQMPV+ XXXXXXXXXXXXXXXX
Sbjct: 184 EDARKLFYSGESDVVCWNTMIDGYLKCGDLEAAKGLFAQMPVRNXXXXXXXXXXXXXXXX 243

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAA 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAA
Sbjct: 244 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAA 303

Query: 181 CSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTW 240
           CSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTW
Sbjct: 304 CSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTW 363

Query: 241 NAMIGGLAIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKGLRIFQTMRE 300
           NAMIGGLAIHGRAEDALELFSK+QEGRMKPNG+TLVGVLTACAHAGFVDKGLRIFQTMRE
Sbjct: 364 NAMIGGLAIHGRAEDALELFSKMQEGRMKPNGVTLVGVLTACAHAGFVDKGLRIFQTMRE 423

Query: 301 FYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFDLAE 360
           FYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNF+LAE
Sbjct: 424 FYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFNLAE 483

Query: 361 RVGKILLELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGVSIVDLNGTV 420
           RVGKILLELEPQNSGRYVLLSNIYA VGRFDDV+KIRKLMKDRGIKT+PGVS VDLNGTV
Sbjct: 484 RVGKILLELEPQNSGRYVLLSNIYANVGRFDDVAKIRKLMKDRGIKTLPGVSTVDLNGTV 543

Query: 421 HEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIDEEEKETAVNYHSEKLA 480
           HEFKMGDGSH QMKEIYRKLKIIKERLQMAGHSPDTSQVLFDI+EEEKETAV YHSEKLA
Sbjct: 544 HEFKMGDGSHLQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIEEEEKETAVQYHSEKLA 603

Query: 481 IAFGLINTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRVRYHHFKNGTCSCK 540
           IAFGLINTLPG+RIHIVKNLRVCDDCHSATKLISQI+DREIIVRDRVRYHHFKNGTCSCK
Sbjct: 604 IAFGLINTLPGERIHIVKNLRVCDDCHSATKLISQIYDREIIVRDRVRYHHFKNGTCSCK 663

Query: 541 DFW 544
           DFW
Sbjct: 664 DFW 666

BLAST of CsaV3_1G038250 vs. NCBI nr
Match: XP_016900944.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g48910-like isoform X2 [Cucumis melo])

HSP 1 Score: 943.3 bits (2437), Expect = 3.5e-271
Identity = 525/543 (96.69%), Postives = 536/543 (98.71%), Query Frame = 0

Query: 1   MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRL 60
           MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSD+HIKSAGI MYASFGRL
Sbjct: 1   MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDMHIKSAGIQMYASFGRL 60

Query: 61  EDARKMFYSGESDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXXXXXXXXXXX 120
           EDARK+FYSGESDVVCWNTMIDGYLKCG LEAAKGLFAQMPV+ XXXXXXXXXXXXXXXX
Sbjct: 61  EDARKLFYSGESDVVCWNTMIDGYLKCGDLEAAKGLFAQMPVRNXXXXXXXXXXXXXXXX 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAA 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAA
Sbjct: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAA 180

Query: 181 CSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTW 240
           CSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTW
Sbjct: 181 CSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTW 240

Query: 241 NAMIGGLAIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKGLRIFQTMRE 300
           NAMIGGLAIHGRAEDALELFSK+QEGRMKPNG+TLVGVLTACAHAGFVDKGLRIFQTMRE
Sbjct: 241 NAMIGGLAIHGRAEDALELFSKMQEGRMKPNGVTLVGVLTACAHAGFVDKGLRIFQTMRE 300

Query: 301 FYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFDLAE 360
           FYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNF+LAE
Sbjct: 301 FYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFNLAE 360

Query: 361 RVGKILLELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGVSIVDLNGTV 420
           RVGKILLELEPQNSGRYVLLSNIYA VGRFDDV+KIRKLMKDRGIKT+PGVS VDLNGTV
Sbjct: 361 RVGKILLELEPQNSGRYVLLSNIYANVGRFDDVAKIRKLMKDRGIKTLPGVSTVDLNGTV 420

Query: 421 HEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIDEEEKETAVNYHSEKLA 480
           HEFKMGDGSH QMKEIYRKLKIIKERLQMAGHSPDTSQVLFDI+EEEKETAV YHSEKLA
Sbjct: 421 HEFKMGDGSHLQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIEEEEKETAVQYHSEKLA 480

Query: 481 IAFGLINTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRVRYHHFKNGTCSCK 540
           IAFGLINTLPG+RIHIVKNLRVCDDCHSATKLISQI+DREIIVRDRVRYHHFKNGTCSCK
Sbjct: 481 IAFGLINTLPGERIHIVKNLRVCDDCHSATKLISQIYDREIIVRDRVRYHHFKNGTCSCK 540

Query: 541 DFW 544
           DFW
Sbjct: 541 DFW 543

BLAST of CsaV3_1G038250 vs. NCBI nr
Match: XP_023529862.1 (pentatricopeptide repeat-containing protein At5g48910-like isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 845.1 bits (2182), Expect = 1.3e-241
Identity = 470/543 (86.56%), Postives = 510/543 (93.92%), Query Frame = 0

Query: 1   MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRL 60
           MVIDARPNKF+YPTLFKACSVAQAV EGRQIH HVVKHG GSD+HIKSAGI MY SFG  
Sbjct: 1   MVIDARPNKFSYPTLFKACSVAQAVHEGRQIHCHVVKHGFGSDMHIKSAGIQMYTSFGSF 60

Query: 61  EDARKMFYSGESDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXXXXXXXXXXX 120
           EDARK+  +GESDVVCWNT+IDGYLKCG LEAAKGLF QMP  XXXXXXXXXXXXXXXXX
Sbjct: 61  EDARKLLDNGESDVVCWNTLIDGYLKCGNLEAAKGLFEQMPTTXXXXXXXXXXXXXXXXX 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAA 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  PGRFIL SVLAA
Sbjct: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGRFILCSVLAA 180

Query: 181 CSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTW 240
           CS++GAIDQGRWVHAYL+RNSIKLDAVLGTALLDMYAKCGRLDM W+VF E++ERE+FTW
Sbjct: 181 CSSVGAIDQGRWVHAYLERNSIKLDAVLGTALLDMYAKCGRLDMAWKVFNELQEREVFTW 240

Query: 241 NAMIGGLAIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKGLRIFQTMRE 300
           NAMIGGLAIHGRAEDALELFSK+Q+GR+KPNG+TLV +LTACAHAGFVD+GLRIF+TM+E
Sbjct: 241 NAMIGGLAIHGRAEDALELFSKMQKGRLKPNGVTLVSILTACAHAGFVDRGLRIFETMKE 300

Query: 301 FYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFDLAE 360
           FYGV+PE+ HYGC+VDLLGRSGLFSEAE+LI+SMPMKPNAAVWGALLG CRIHGN +LAE
Sbjct: 301 FYGVEPEMVHYGCVVDLLGRSGLFSEAEELISSMPMKPNAAVWGALLGGCRIHGNVELAE 360

Query: 361 RVGKILLELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGVSIVDLNGTV 420
           RVGKILLEL+  NSG Y LLSNIYAK GRFDDV+KIRK+MK++GIKTVPGVS+VDLNGTV
Sbjct: 361 RVGKILLELDVHNSGYYTLLSNIYAKAGRFDDVAKIRKMMKEKGIKTVPGVSMVDLNGTV 420

Query: 421 HEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIDEEEKETAVNYHSEKLA 480
           HEFKMGD SHPQMKEIYRKL+ IKERLQMAG+SPDTSQVLFDI+EEEKE+AV  HSEKLA
Sbjct: 421 HEFKMGDRSHPQMKEIYRKLERIKERLQMAGYSPDTSQVLFDIEEEEKESAVRCHSEKLA 480

Query: 481 IAFGLINTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRVRYHHFKNGTCSCK 540
           IAFGLIN+ PGK IHIVKNLR+CDDCHSATKLISQI+D+EIIVRDRVRYHHFKNGTCSCK
Sbjct: 481 IAFGLINSSPGKTIHIVKNLRICDDCHSATKLISQIYDQEIIVRDRVRYHHFKNGTCSCK 540

Query: 541 DFW 544
           DFW
Sbjct: 541 DFW 543

BLAST of CsaV3_1G038250 vs. TAIR10
Match: AT5G48910.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 490.0 bits (1260), Expect = 1.9e-138
Identity = 264/552 (47.83%), Postives = 374/552 (67.75%), Query Frame = 0

Query: 7   PNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRLEDARKM 66
           PN+FT+P++ KAC+    +QEG+QIHG  +K+G G D  + S  + MY   G ++DAR +
Sbjct: 126 PNRFTFPSVLKACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVL 185

Query: 67  FYSG---------------ESDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXX 126
           FY                 + ++V WN MIDGY++ G  +AA+ LF              
Sbjct: 186 FYKNIIEKDMVVMTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLF-------------- 245

Query: 127 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGR 186
                             XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   RP  
Sbjct: 246 -----------------DXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDIRPNY 305

Query: 187 FILSSVLAACSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEE 246
             L SVL A S +G+++ G W+H Y + + I++D VLG+AL+DMY+KCG ++    VFE 
Sbjct: 306 VTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALIDMYSKCGIIEKAIHVFER 365

Query: 247 MKEREIFTWNAMIGGLAIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKG 306
           +    + TW+AMI G AIHG+A DA++ F K+++  ++P+ +  + +LTAC+H G V++G
Sbjct: 366 LPRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYINLLTACSHGGLVEEG 425

Query: 307 LRIFQTMREFYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACR 366
            R F  M    G++P +EHYGCMVDLLGRSGL  EAE+ I +MP+KP+  +W ALLGACR
Sbjct: 426 RRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPIKPDDVIWKALLGACR 485

Query: 367 IHGNFDLAERVGKILLELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGV 426
           + GN ++ +RV  IL+++ P +SG YV LSN+YA  G + +VS++R  MK++ I+  PG 
Sbjct: 486 MQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMRLRMKEKDIRKDPGC 545

Query: 427 SIVDLNGTVHEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIDEEEKETA 486
           S++D++G +HEF + D SHP+ KEI   L  I ++L++AG+ P T+QVL +++EE+KE  
Sbjct: 546 SLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGYRPITTQVLLNLEEEDKENV 605

Query: 487 VNYHSEKLAIAFGLINTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRVRYHH 544
           ++YHSEK+A AFGLI+T PGK I IVKNLR+C+DCHS+ KLIS+++ R+I VRDR R+HH
Sbjct: 606 LHYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSIKLISKVYKRKITVRDRKRFHH 646

BLAST of CsaV3_1G038250 vs. TAIR10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 469.5 bits (1207), Expect = 2.7e-132
Identity = 228/543 (41.99%), Postives = 338/543 (62.25%), Query Frame = 0

Query: 4   DARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRLEDA 63
           + RP++ T  T+  AC+ + +++ GRQ+H  +  HG GS++ I +A I +Y+  G LE A
Sbjct: 261 NVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETA 320

Query: 64  RKMFYS-GESDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXXXXXXXXXXXXX 123
             +F      DV+ WNT+I GY    + + A  LF +M                      
Sbjct: 321 CGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEM---------------------- 380

Query: 124 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAACS 183
                                                         P    + S+L AC+
Sbjct: 381 ----------------------------------------LRSGETPNDVTMLSILPACA 440

Query: 184 NIGAIDQGRWVHAYLKR--NSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTW 243
           ++GAID GRW+H Y+ +    +   + L T+L+DMYAKCG ++   +VF  +  + + +W
Sbjct: 441 HLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSW 500

Query: 244 NAMIGGLAIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKGLRIFQTMRE 303
           NAMI G A+HGRA+ + +LFS++++  ++P+ IT VG+L+AC+H+G +D G  IF+TM +
Sbjct: 501 NAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQ 560

Query: 304 FYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFDLAE 363
            Y + P+LEHYGCM+DLLG SGLF EAE++IN M M+P+  +W +LL AC++HGN +L E
Sbjct: 561 DYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGE 620

Query: 364 RVGKILLELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGVSIVDLNGTV 423
              + L+++EP+N G YVLLSNIYA  GR+++V+K R L+ D+G+K VPG S ++++  V
Sbjct: 621 SFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVV 680

Query: 424 HEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIDEEEKETAVNYHSEKLA 483
           HEF +GD  HP+ +EIY  L+ ++  L+ AG  PDTS+VL +++EE KE A+ +HSEKLA
Sbjct: 681 HEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLA 740

Query: 484 IAFGLINTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRVRYHHFKNGTCSCK 543
           IAFGLI+T PG ++ IVKNLRVC +CH ATKLIS+I+ REII RDR R+HHF++G CSC 
Sbjct: 741 IAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCN 741

BLAST of CsaV3_1G038250 vs. TAIR10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 458.4 bits (1178), Expect = 6.3e-129
Identity = 274/610 (44.92%), Postives = 376/610 (61.64%), Query Frame = 0

Query: 7   PNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRLEDARKM 66
           PNK+T+P L KA +   ++  G+ +HG  VK  +GSDV + ++ IH Y S G L+ A K+
Sbjct: 129 PNKYTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKV 188

Query: 67  F----------------------------------------------------------- 126
           F                                                           
Sbjct: 189 FTTIKEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVTMVGVLSACAKIRNL 248

Query: 127 --------YSGES----DVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXXXXXX 186
                   Y  E+    ++   N M+D Y KCG +E AK LF  M  K      XXXXXX
Sbjct: 249 EFGRQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTXXXXXX 308

Query: 187 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXT-RPGRFIL 246
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX                    +  +  L
Sbjct: 309 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNGKPNEALIVFHELQLQKNMKLNQITL 368

Query: 247 SSVLAACSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKE 306
            S L+AC+ +GA++ GRW+H+Y+K++ I+++  + +AL+ MY+KCG L+   EVF  +++
Sbjct: 369 VSTLSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEK 428

Query: 307 REIFTWNAMIGGLAIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKGLRI 366
           R++F W+AMIGGLA+HG   +A+++F K+QE  +KPNG+T   V  AC+H G VD+   +
Sbjct: 429 RDVFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESL 488

Query: 367 FQTMREFYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHG 426
           F  M   YG+ PE +HY C+VD+LGRSG   +A   I +MP+ P+ +VWGALLGAC+IH 
Sbjct: 489 FHQMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHA 548

Query: 427 NFDLAERVGKILLELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGVSIV 486
           N +LAE     LLELEP+N G +VLLSNIYAK+G++++VS++RK M+  G+K  PG S +
Sbjct: 549 NLNLAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSI 608

Query: 487 DLNGTVHEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIDEEE-KETAVN 544
           +++G +HEF  GD +HP  +++Y KL  + E+L+  G+ P+ SQVL  I+EEE KE ++N
Sbjct: 609 EIDGMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLN 668

BLAST of CsaV3_1G038250 vs. TAIR10
Match: AT1G59720.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 448.7 bits (1153), Expect = 5.0e-126
Identity = 227/556 (40.83%), Postives = 330/556 (59.35%), Query Frame = 0

Query: 4   DARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRLEDA 63
           ++ P+K T+P + KAC+      EG+Q+H  +VKHG G DV++ +  IH+Y S G L+ A
Sbjct: 146 ESSPDKHTFPFVLKACAYIFGFSEGKQVHCQIVKHGFGGDVYVNNGLIHLYGSCGCLDLA 205

Query: 64  RKMF-YSGESDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXXXXXXXXXXXXX 123
           RK+F    E  +V WN+MID  ++ G  ++A  LF +M                      
Sbjct: 206 RKVFDEMPERSLVSWNSMIDALVRFGEYDSALQLFREM---------------------- 265

Query: 124 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAACS 183
                                                         P  + + SVL+AC+
Sbjct: 266 -----------------------------------------QRSFEPDGYTMQSVLSACA 325

Query: 184 NIGAIDQGRWVHAYLKRN---SIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFT 243
            +G++  G W HA+L R     + +D ++  +L++MY KCG L M  +VF+ M++R++ +
Sbjct: 326 GLGSLSLGTWAHAFLLRKCDVDVAMDVLVKNSLIEMYCKCGSLRMAEQVFQGMQKRDLAS 385

Query: 244 WNAMIGGLAIHGRAEDALELFSKLQEGR--MKPNGITLVGVLTACAHAGFVDKGLRIFQT 303
           WNAMI G A HGRAE+A+  F ++ + R  ++PN +T VG+L AC H GFV+KG + F  
Sbjct: 386 WNAMILGFATHGRAEEAMNFFDRMVDKRENVRPNSVTFVGLLIACNHRGFVNKGRQYFDM 445

Query: 304 MREFYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHG-NF 363
           M   Y ++P LEHYGC+VDL+ R+G  +EA D++ SMPMKP+A +W +LL AC   G + 
Sbjct: 446 MVRDYCIEPALEHYGCIVDLIARAGYITEAIDMVMSMPMKPDAVIWRSLLDACCKKGASV 505

Query: 364 DLAERVGKILLELEPQN-------SGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVP 423
           +L+E + + ++  +  N       SG YVLLS +YA   R++DV  +RKLM + GI+  P
Sbjct: 506 ELSEEIARNIIGTKEDNESSNGNCSGAYVLLSRVYASASRWNDVGIVRKLMSEHGIRKEP 565

Query: 424 GVSIVDLNGTVHEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQV-LFD-IDEEE 483
           G S +++NG  HEF  GD SHPQ K+IY++LK+I +RL+  G+ PD SQ  L D  ++  
Sbjct: 566 GCSSIEINGISHEFFAGDTSHPQTKQIYQQLKVIDDRLRSIGYLPDRSQAPLVDATNDGS 625

Query: 484 KETAVNYHSEKLAIAFGLINTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRV 543
           KE ++  HSE+LAIAFGLIN  P   I I KNLRVC+DCH  TKLIS++F+ EIIVRDRV
Sbjct: 626 KEYSLRLHSERLAIAFGLINLPPQTPIRIFKNLRVCNDCHEVTKLISKVFNTEIIVRDRV 638

BLAST of CsaV3_1G038250 vs. TAIR10
Match: AT5G66520.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 438.0 bits (1125), Expect = 8.8e-123
Identity = 271/537 (50.47%), Postives = 371/537 (69.09%), Query Frame = 0

Query: 8   NKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRLEDARKMF 67
           N +T+P+L KACS   A +E  QIH  + K G                            
Sbjct: 114 NAYTFPSLLKACSNLSAFEETTQIHAQITKLGY--------------------------- 173

Query: 68  YSGESDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXXXXXXXXXXXXXXXXXX 127
              E+DV   N++I+ Y   G  + A  LF ++P  XXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 174 ---ENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPXXXXXXXXXXXXXXXXXXXXXXXX 233

Query: 128 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAACSNIGAI 187
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  P    L++ L+AC+ +GA+
Sbjct: 234 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVEPDNVSLANALSACAQLGAL 293

Query: 188 DQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTWNAMIGGL 247
           +QG+W+H+YL +  I++D+VLG  L+DMYAKCG ++   EVF+ +K++ +  W A+I G 
Sbjct: 294 EQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQAWTALISGY 353

Query: 248 AIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKGLRIFQTMREFYGVDPE 307
           A HG   +A+  F ++Q+  +KPN IT   VLTAC++ G V++G  IF +M   Y + P 
Sbjct: 354 AYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSMERDYNLKPT 413

Query: 308 LEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFDLAERVGKILL 367
           +EHYGC+VDLLGR+GL  EA+  I  MP+KPNA +WGALL ACRIH N +L E +G+IL+
Sbjct: 414 IEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELGEEIGEILI 473

Query: 368 ELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGVSIVDLNGTVHEFKMGD 427
            ++P + GRYV  +NI+A   ++D  ++ R+LMK++G+  VPG S + L GT HEF  GD
Sbjct: 474 AIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISLEGTTHEFLAGD 533

Query: 428 GSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFD-IDEEEKETAVNYHSEKLAIAFGLI 487
            SHP++++I  K +I++ +L+  G+ P+  ++L D +D++E+E  V+ HSEKLAI +GLI
Sbjct: 534 RSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQHSEKLAITYGLI 593

Query: 488 NTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRVRYHHFKNGTCSCKDFW 544
            T PG  I I+KNLRVC DCH  TKLIS+I+ R+I++RDR R+HHF++G CSC D+W
Sbjct: 594 KTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFRDGKCSCGDYW 620

BLAST of CsaV3_1G038250 vs. Swiss-Prot
Match: sp|Q9FI80|PP425_ARATH (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 490.0 bits (1260), Expect = 3.5e-137
Identity = 264/552 (47.83%), Postives = 374/552 (67.75%), Query Frame = 0

Query: 7   PNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRLEDARKM 66
           PN+FT+P++ KAC+    +QEG+QIHG  +K+G G D  + S  + MY   G ++DAR +
Sbjct: 126 PNRFTFPSVLKACAKTGKIQEGKQIHGLALKYGFGGDEFVMSNLVRMYVMCGFMKDARVL 185

Query: 67  FYSG---------------ESDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXX 126
           FY                 + ++V WN MIDGY++ G  +AA+ LF              
Sbjct: 186 FYKNIIEKDMVVMTDRRKRDGEIVLWNVMIDGYMRLGDCKAARMLF-------------- 245

Query: 127 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGR 186
                             XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   RP  
Sbjct: 246 -----------------DXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGDIRPNY 305

Query: 187 FILSSVLAACSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEE 246
             L SVL A S +G+++ G W+H Y + + I++D VLG+AL+DMY+KCG ++    VFE 
Sbjct: 306 VTLVSVLPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALIDMYSKCGIIEKAIHVFER 365

Query: 247 MKEREIFTWNAMIGGLAIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKG 306
           +    + TW+AMI G AIHG+A DA++ F K+++  ++P+ +  + +LTAC+H G V++G
Sbjct: 366 LPRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYINLLTACSHGGLVEEG 425

Query: 307 LRIFQTMREFYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACR 366
            R F  M    G++P +EHYGCMVDLLGRSGL  EAE+ I +MP+KP+  +W ALLGACR
Sbjct: 426 RRYFSQMVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPIKPDDVIWKALLGACR 485

Query: 367 IHGNFDLAERVGKILLELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGV 426
           + GN ++ +RV  IL+++ P +SG YV LSN+YA  G + +VS++R  MK++ I+  PG 
Sbjct: 486 MQGNVEMGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMRLRMKEKDIRKDPGC 545

Query: 427 SIVDLNGTVHEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIDEEEKETA 486
           S++D++G +HEF + D SHP+ KEI   L  I ++L++AG+ P T+QVL +++EE+KE  
Sbjct: 546 SLIDIDGVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGYRPITTQVLLNLEEEDKENV 605

Query: 487 VNYHSEKLAIAFGLINTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRVRYHH 544
           ++YHSEK+A AFGLI+T PGK I IVKNLR+C+DCHS+ KLIS+++ R+I VRDR R+HH
Sbjct: 606 LHYHSEKIATAFGLISTSPGKPIRIVKNLRICEDCHSSIKLISKVYKRKITVRDRKRFHH 646

BLAST of CsaV3_1G038250 vs. Swiss-Prot
Match: sp|Q9LN01|PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 469.5 bits (1207), Expect = 4.9e-131
Identity = 228/543 (41.99%), Postives = 338/543 (62.25%), Query Frame = 0

Query: 4   DARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRLEDA 63
           + RP++ T  T+  AC+ + +++ GRQ+H  +  HG GS++ I +A I +Y+  G LE A
Sbjct: 261 NVRPDESTMVTVVSACAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETA 320

Query: 64  RKMFYS-GESDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXXXXXXXXXXXXX 123
             +F      DV+ WNT+I GY    + + A  LF +M                      
Sbjct: 321 CGLFERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEM---------------------- 380

Query: 124 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAACS 183
                                                         P    + S+L AC+
Sbjct: 381 ----------------------------------------LRSGETPNDVTMLSILPACA 440

Query: 184 NIGAIDQGRWVHAYLKR--NSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTW 243
           ++GAID GRW+H Y+ +    +   + L T+L+DMYAKCG ++   +VF  +  + + +W
Sbjct: 441 HLGAIDIGRWIHVYIDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSW 500

Query: 244 NAMIGGLAIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKGLRIFQTMRE 303
           NAMI G A+HGRA+ + +LFS++++  ++P+ IT VG+L+AC+H+G +D G  IF+TM +
Sbjct: 501 NAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQ 560

Query: 304 FYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFDLAE 363
            Y + P+LEHYGCM+DLLG SGLF EAE++IN M M+P+  +W +LL AC++HGN +L E
Sbjct: 561 DYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGE 620

Query: 364 RVGKILLELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGVSIVDLNGTV 423
              + L+++EP+N G YVLLSNIYA  GR+++V+K R L+ D+G+K VPG S ++++  V
Sbjct: 621 SFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVV 680

Query: 424 HEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIDEEEKETAVNYHSEKLA 483
           HEF +GD  HP+ +EIY  L+ ++  L+ AG  PDTS+VL +++EE KE A+ +HSEKLA
Sbjct: 681 HEFIIGDKFHPRNREIYGMLEEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLA 740

Query: 484 IAFGLINTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRVRYHHFKNGTCSCK 543
           IAFGLI+T PG ++ IVKNLRVC +CH ATKLIS+I+ REII RDR R+HHF++G CSC 
Sbjct: 741 IAFGLISTKPGTKLTIVKNLRVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCN 741

BLAST of CsaV3_1G038250 vs. Swiss-Prot
Match: sp|O82380|PP175_ARATH (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 458.4 bits (1178), Expect = 1.1e-127
Identity = 274/610 (44.92%), Postives = 376/610 (61.64%), Query Frame = 0

Query: 7   PNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRLEDARKM 66
           PNK+T+P L KA +   ++  G+ +HG  VK  +GSDV + ++ IH Y S G L+ A K+
Sbjct: 129 PNKYTFPFLIKAAAEVSSLSLGQSLHGMAVKSAVGSDVFVANSLIHCYFSCGDLDSACKV 188

Query: 67  F----------------------------------------------------------- 126
           F                                                           
Sbjct: 189 FTTIKEKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVTMVGVLSACAKIRNL 248

Query: 127 --------YSGES----DVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXXXXXX 186
                   Y  E+    ++   N M+D Y KCG +E AK LF  M  K      XXXXXX
Sbjct: 249 EFGRQVCSYIEENRVNVNLTLANAMLDMYTKCGSIEDAKRLFDAMEEKDNVTWTXXXXXX 308

Query: 187 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXT-RPGRFIL 246
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX                    +  +  L
Sbjct: 309 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNGKPNEALIVFHELQLQKNMKLNQITL 368

Query: 247 SSVLAACSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKE 306
            S L+AC+ +GA++ GRW+H+Y+K++ I+++  + +AL+ MY+KCG L+   EVF  +++
Sbjct: 369 VSTLSACAQVGALELGRWIHSYIKKHGIRMNFHVTSALIHMYSKCGDLEKSREVFNSVEK 428

Query: 307 REIFTWNAMIGGLAIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKGLRI 366
           R++F W+AMIGGLA+HG   +A+++F K+QE  +KPNG+T   V  AC+H G VD+   +
Sbjct: 429 RDVFVWSAMIGGLAMHGCGNEAVDMFYKMQEANVKPNGVTFTNVFCACSHTGLVDEAESL 488

Query: 367 FQTMREFYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHG 426
           F  M   YG+ PE +HY C+VD+LGRSG   +A   I +MP+ P+ +VWGALLGAC+IH 
Sbjct: 489 FHQMESNYGIVPEEKHYACIVDVLGRSGYLEKAVKFIEAMPIPPSTSVWGALLGACKIHA 548

Query: 427 NFDLAERVGKILLELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGVSIV 486
           N +LAE     LLELEP+N G +VLLSNIYAK+G++++VS++RK M+  G+K  PG S +
Sbjct: 549 NLNLAEMACTRLLELEPRNDGAHVLLSNIYAKLGKWENVSELRKHMRVTGLKKEPGCSSI 608

Query: 487 DLNGTVHEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIDEEE-KETAVN 544
           +++G +HEF  GD +HP  +++Y KL  + E+L+  G+ P+ SQVL  I+EEE KE ++N
Sbjct: 609 EIDGMIHEFLSGDNAHPMSEKVYGKLHEVMEKLKSNGYEPEISQVLQIIEEEEMKEQSLN 668

BLAST of CsaV3_1G038250 vs. Swiss-Prot
Match: sp|Q0WQW5|PPR85_ARATH (Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H51 PE=1 SV=2)

HSP 1 Score: 448.7 bits (1153), Expect = 9.0e-125
Identity = 227/556 (40.83%), Postives = 330/556 (59.35%), Query Frame = 0

Query: 4   DARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRLEDA 63
           ++ P+K T+P + KAC+      EG+Q+H  +VKHG G DV++ +  IH+Y S G L+ A
Sbjct: 146 ESSPDKHTFPFVLKACAYIFGFSEGKQVHCQIVKHGFGGDVYVNNGLIHLYGSCGCLDLA 205

Query: 64  RKMF-YSGESDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXXXXXXXXXXXXX 123
           RK+F    E  +V WN+MID  ++ G  ++A  LF +M                      
Sbjct: 206 RKVFDEMPERSLVSWNSMIDALVRFGEYDSALQLFREM---------------------- 265

Query: 124 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAACS 183
                                                         P  + + SVL+AC+
Sbjct: 266 -----------------------------------------QRSFEPDGYTMQSVLSACA 325

Query: 184 NIGAIDQGRWVHAYLKRN---SIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFT 243
            +G++  G W HA+L R     + +D ++  +L++MY KCG L M  +VF+ M++R++ +
Sbjct: 326 GLGSLSLGTWAHAFLLRKCDVDVAMDVLVKNSLIEMYCKCGSLRMAEQVFQGMQKRDLAS 385

Query: 244 WNAMIGGLAIHGRAEDALELFSKLQEGR--MKPNGITLVGVLTACAHAGFVDKGLRIFQT 303
           WNAMI G A HGRAE+A+  F ++ + R  ++PN +T VG+L AC H GFV+KG + F  
Sbjct: 386 WNAMILGFATHGRAEEAMNFFDRMVDKRENVRPNSVTFVGLLIACNHRGFVNKGRQYFDM 445

Query: 304 MREFYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHG-NF 363
           M   Y ++P LEHYGC+VDL+ R+G  +EA D++ SMPMKP+A +W +LL AC   G + 
Sbjct: 446 MVRDYCIEPALEHYGCIVDLIARAGYITEAIDMVMSMPMKPDAVIWRSLLDACCKKGASV 505

Query: 364 DLAERVGKILLELEPQN-------SGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVP 423
           +L+E + + ++  +  N       SG YVLLS +YA   R++DV  +RKLM + GI+  P
Sbjct: 506 ELSEEIARNIIGTKEDNESSNGNCSGAYVLLSRVYASASRWNDVGIVRKLMSEHGIRKEP 565

Query: 424 GVSIVDLNGTVHEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQV-LFD-IDEEE 483
           G S +++NG  HEF  GD SHPQ K+IY++LK+I +RL+  G+ PD SQ  L D  ++  
Sbjct: 566 GCSSIEINGISHEFFAGDTSHPQTKQIYQQLKVIDDRLRSIGYLPDRSQAPLVDATNDGS 625

Query: 484 KETAVNYHSEKLAIAFGLINTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRV 543
           KE ++  HSE+LAIAFGLIN  P   I I KNLRVC+DCH  TKLIS++F+ EIIVRDRV
Sbjct: 626 KEYSLRLHSERLAIAFGLINLPPQTPIRIFKNLRVCNDCHEVTKLISKVFNTEIIVRDRV 638

BLAST of CsaV3_1G038250 vs. Swiss-Prot
Match: sp|Q9FJY7|PP449_ARATH (Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H61 PE=2 SV=1)

HSP 1 Score: 438.0 bits (1125), Expect = 1.6e-121
Identity = 271/537 (50.47%), Postives = 371/537 (69.09%), Query Frame = 0

Query: 8   NKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRLEDARKMF 67
           N +T+P+L KACS   A +E  QIH  + K G                            
Sbjct: 114 NAYTFPSLLKACSNLSAFEETTQIHAQITKLGY--------------------------- 173

Query: 68  YSGESDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXXXXXXXXXXXXXXXXXX 127
              E+DV   N++I+ Y   G  + A  LF ++P  XXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 174 ---ENDVYAVNSLINSYAVTGNFKLAHLLFDRIPEPXXXXXXXXXXXXXXXXXXXXXXXX 233

Query: 128 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAACSNIGAI 187
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  P    L++ L+AC+ +GA+
Sbjct: 234 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVEPDNVSLANALSACAQLGAL 293

Query: 188 DQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTWNAMIGGL 247
           +QG+W+H+YL +  I++D+VLG  L+DMYAKCG ++   EVF+ +K++ +  W A+I G 
Sbjct: 294 EQGKWIHSYLNKTRIRMDSVLGCVLIDMYAKCGEMEEALEVFKNIKKKSVQAWTALISGY 353

Query: 248 AIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKGLRIFQTMREFYGVDPE 307
           A HG   +A+  F ++Q+  +KPN IT   VLTAC++ G V++G  IF +M   Y + P 
Sbjct: 354 AYHGHGREAISKFMEMQKMGIKPNVITFTAVLTACSYTGLVEEGKLIFYSMERDYNLKPT 413

Query: 308 LEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFDLAERVGKILL 367
           +EHYGC+VDLLGR+GL  EA+  I  MP+KPNA +WGALL ACRIH N +L E +G+IL+
Sbjct: 414 IEHYGCIVDLLGRAGLLDEAKRFIQEMPLKPNAVIWGALLKACRIHKNIELGEEIGEILI 473

Query: 368 ELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGVSIVDLNGTVHEFKMGD 427
            ++P + GRYV  +NI+A   ++D  ++ R+LMK++G+  VPG S + L GT HEF  GD
Sbjct: 474 AIDPYHGGRYVHKANIHAMDKKWDKAAETRRLMKEQGVAKVPGCSTISLEGTTHEFLAGD 533

Query: 428 GSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFD-IDEEEKETAVNYHSEKLAIAFGLI 487
            SHP++++I  K +I++ +L+  G+ P+  ++L D +D++E+E  V+ HSEKLAI +GLI
Sbjct: 534 RSHPEIEKIQSKWRIMRRKLEENGYVPELEEMLLDLVDDDEREAIVHQHSEKLAITYGLI 593

Query: 488 NTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRVRYHHFKNGTCSCKDFW 544
            T PG  I I+KNLRVC DCH  TKLIS+I+ R+I++RDR R+HHF++G CSC D+W
Sbjct: 594 KTKPGTIIRIMKNLRVCKDCHKVTKLISKIYKRDIVMRDRTRFHHFRDGKCSCGDYW 620

BLAST of CsaV3_1G038250 vs. TrEMBL
Match: tr|A0A0A0LYD3|A0A0A0LYD3_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G561400 PE=4 SV=1)

HSP 1 Score: 972.2 bits (2512), Expect = 4.7e-280
Identity = 543/543 (100.00%), Postives = 543/543 (100.00%), Query Frame = 0

Query: 1   MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRL 60
           MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRL
Sbjct: 124 MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRL 183

Query: 61  EDARKMFYSGESDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXXXXXXXXXXX 120
           EDARKMFYSGESDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXXXXXXXXXXX
Sbjct: 184 EDARKMFYSGESDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXXXXXXXXXXX 243

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAA 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAA
Sbjct: 244 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAA 303

Query: 181 CSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTW 240
           CSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTW
Sbjct: 304 CSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTW 363

Query: 241 NAMIGGLAIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKGLRIFQTMRE 300
           NAMIGGLAIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKGLRIFQTMRE
Sbjct: 364 NAMIGGLAIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKGLRIFQTMRE 423

Query: 301 FYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFDLAE 360
           FYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFDLAE
Sbjct: 424 FYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFDLAE 483

Query: 361 RVGKILLELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGVSIVDLNGTV 420
           RVGKILLELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGVSIVDLNGTV
Sbjct: 484 RVGKILLELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGVSIVDLNGTV 543

Query: 421 HEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIDEEEKETAVNYHSEKLA 480
           HEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIDEEEKETAVNYHSEKLA
Sbjct: 544 HEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIDEEEKETAVNYHSEKLA 603

Query: 481 IAFGLINTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRVRYHHFKNGTCSCK 540
           IAFGLINTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRVRYHHFKNGTCSCK
Sbjct: 604 IAFGLINTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRVRYHHFKNGTCSCK 663

Query: 541 DFW 544
           DFW
Sbjct: 664 DFW 666

BLAST of CsaV3_1G038250 vs. TrEMBL
Match: tr|A0A1S4DY81|A0A1S4DY81_CUCME (pentatricopeptide repeat-containing protein At5g48910-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103492111 PE=4 SV=1)

HSP 1 Score: 943.3 bits (2437), Expect = 2.3e-271
Identity = 525/543 (96.69%), Postives = 536/543 (98.71%), Query Frame = 0

Query: 1   MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRL 60
           MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSD+HIKSAGI MYASFGRL
Sbjct: 124 MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDMHIKSAGIQMYASFGRL 183

Query: 61  EDARKMFYSGESDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXXXXXXXXXXX 120
           EDARK+FYSGESDVVCWNTMIDGYLKCG LEAAKGLFAQMPV+ XXXXXXXXXXXXXXXX
Sbjct: 184 EDARKLFYSGESDVVCWNTMIDGYLKCGDLEAAKGLFAQMPVRNXXXXXXXXXXXXXXXX 243

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAA 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAA
Sbjct: 244 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAA 303

Query: 181 CSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTW 240
           CSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTW
Sbjct: 304 CSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTW 363

Query: 241 NAMIGGLAIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKGLRIFQTMRE 300
           NAMIGGLAIHGRAEDALELFSK+QEGRMKPNG+TLVGVLTACAHAGFVDKGLRIFQTMRE
Sbjct: 364 NAMIGGLAIHGRAEDALELFSKMQEGRMKPNGVTLVGVLTACAHAGFVDKGLRIFQTMRE 423

Query: 301 FYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFDLAE 360
           FYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNF+LAE
Sbjct: 424 FYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFNLAE 483

Query: 361 RVGKILLELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGVSIVDLNGTV 420
           RVGKILLELEPQNSGRYVLLSNIYA VGRFDDV+KIRKLMKDRGIKT+PGVS VDLNGTV
Sbjct: 484 RVGKILLELEPQNSGRYVLLSNIYANVGRFDDVAKIRKLMKDRGIKTLPGVSTVDLNGTV 543

Query: 421 HEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIDEEEKETAVNYHSEKLA 480
           HEFKMGDGSH QMKEIYRKLKIIKERLQMAGHSPDTSQVLFDI+EEEKETAV YHSEKLA
Sbjct: 544 HEFKMGDGSHLQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIEEEEKETAVQYHSEKLA 603

Query: 481 IAFGLINTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRVRYHHFKNGTCSCK 540
           IAFGLINTLPG+RIHIVKNLRVCDDCHSATKLISQI+DREIIVRDRVRYHHFKNGTCSCK
Sbjct: 604 IAFGLINTLPGERIHIVKNLRVCDDCHSATKLISQIYDREIIVRDRVRYHHFKNGTCSCK 663

Query: 541 DFW 544
           DFW
Sbjct: 664 DFW 666

BLAST of CsaV3_1G038250 vs. TrEMBL
Match: tr|A0A1S4DY82|A0A1S4DY82_CUCME (pentatricopeptide repeat-containing protein At5g48910-like isoform X2 OS=Cucumis melo OX=3656 GN=LOC103492111 PE=4 SV=1)

HSP 1 Score: 943.3 bits (2437), Expect = 2.3e-271
Identity = 525/543 (96.69%), Postives = 536/543 (98.71%), Query Frame = 0

Query: 1   MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRL 60
           MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSD+HIKSAGI MYASFGRL
Sbjct: 1   MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDMHIKSAGIQMYASFGRL 60

Query: 61  EDARKMFYSGESDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXXXXXXXXXXX 120
           EDARK+FYSGESDVVCWNTMIDGYLKCG LEAAKGLFAQMPV+ XXXXXXXXXXXXXXXX
Sbjct: 61  EDARKLFYSGESDVVCWNTMIDGYLKCGDLEAAKGLFAQMPVRNXXXXXXXXXXXXXXXX 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAA 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAA
Sbjct: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLAA 180

Query: 181 CSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTW 240
           CSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTW
Sbjct: 181 CSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFTW 240

Query: 241 NAMIGGLAIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKGLRIFQTMRE 300
           NAMIGGLAIHGRAEDALELFSK+QEGRMKPNG+TLVGVLTACAHAGFVDKGLRIFQTMRE
Sbjct: 241 NAMIGGLAIHGRAEDALELFSKMQEGRMKPNGVTLVGVLTACAHAGFVDKGLRIFQTMRE 300

Query: 301 FYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFDLAE 360
           FYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNF+LAE
Sbjct: 301 FYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFNLAE 360

Query: 361 RVGKILLELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGVSIVDLNGTV 420
           RVGKILLELEPQNSGRYVLLSNIYA VGRFDDV+KIRKLMKDRGIKT+PGVS VDLNGTV
Sbjct: 361 RVGKILLELEPQNSGRYVLLSNIYANVGRFDDVAKIRKLMKDRGIKTLPGVSTVDLNGTV 420

Query: 421 HEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIDEEEKETAVNYHSEKLA 480
           HEFKMGDGSH QMKEIYRKLKIIKERLQMAGHSPDTSQVLFDI+EEEKETAV YHSEKLA
Sbjct: 421 HEFKMGDGSHLQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIEEEEKETAVQYHSEKLA 480

Query: 481 IAFGLINTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRVRYHHFKNGTCSCK 540
           IAFGLINTLPG+RIHIVKNLRVCDDCHSATKLISQI+DREIIVRDRVRYHHFKNGTCSCK
Sbjct: 481 IAFGLINTLPGERIHIVKNLRVCDDCHSATKLISQIYDREIIVRDRVRYHHFKNGTCSCK 540

Query: 541 DFW 544
           DFW
Sbjct: 541 DFW 543

BLAST of CsaV3_1G038250 vs. TrEMBL
Match: tr|A0A2P5AHC4|A0A2P5AHC4_9ROSA (DYW domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_350360 PE=4 SV=1)

HSP 1 Score: 700.7 bits (1807), Expect = 2.6e-198
Identity = 392/544 (72.06%), Postives = 457/544 (84.01%), Query Frame = 0

Query: 1   MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRL 60
           + ++ RPNKFT+P   KAC++ QAV+EG Q+H HVVKH    D H+KSAGI MYASFG +
Sbjct: 128 VAMNCRPNKFTFPAALKACTLVQAVEEGVQVHAHVVKHRFSGDGHVKSAGIQMYASFGCM 187

Query: 61  EDARKMF-YSGESDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXXXXXXXXXX 120
           E+AR+M   +GESDV+CWN MIDG                    XXXXXXXXXXXXXXXX
Sbjct: 188 EEARRMLGEAGESDVICWNAMIDGCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 247

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLA 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX    +P +F+LSSVLA
Sbjct: 248 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXREEIKPRKFVLSSVLA 307

Query: 181 ACSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFT 240
           AC+N+GA+DQGRW+HAY+KRN I+LDAV+GTA+LDMYAKCGRLD+ WEVFE+M+ RE FT
Sbjct: 308 ACANVGALDQGRWIHAYVKRNLIRLDAVVGTAVLDMYAKCGRLDLAWEVFEKMRPRETFT 367

Query: 241 WNAMIGGLAIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKGLRIFQTMR 300
           WNAMIGGLA+HGRAEDA+ELFSK++  + KP+GIT V VL ACAHAG VDKGLRIF +M+
Sbjct: 368 WNAMIGGLAMHGRAEDAIELFSKMERNKSKPDGITFVNVLNACAHAGLVDKGLRIFSSMK 427

Query: 301 EFYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFDLA 360
           + YGVDPE+EHY C+VDL GR+GL ++AED I+SMP++PNAAV+GALLGACRIHGN  L 
Sbjct: 428 KLYGVDPEVEHYACIVDLFGRAGLLADAEDFISSMPVRPNAAVFGALLGACRIHGNVKLG 487

Query: 361 ERVGKILLELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGVSIVDLNGT 420
           E+VGKILLELEP NSGRY LLSNIYAK GR+ DV K+RKLMK RGI+T PG+S++DL+GT
Sbjct: 488 EKVGKILLELEPHNSGRYALLSNIYAKAGRWKDVEKVRKLMKQRGIRTTPGISMIDLDGT 547

Query: 421 VHEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIDEEEKETAVNYHSEKL 480
           VHEFKMGD SHPQMKEIY  L+ I ERLQ+ G+SP+TSQVLFDI EEEKET + YH EKL
Sbjct: 548 VHEFKMGDSSHPQMKEIYLMLERIIERLQIEGYSPNTSQVLFDISEEEKETELRYHCEKL 607

Query: 481 AIAFGLINTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRVRYHHFKNGTCSC 540
           AIAFGL+NT PG  I IVKNLRVC+DCHSATKLISQI++R+IIVRDRVRYHHF+NG CSC
Sbjct: 608 AIAFGLLNTTPGTTIRIVKNLRVCEDCHSATKLISQIYNRDIIVRDRVRYHHFRNGNCSC 667

Query: 541 KDFW 544
            DFW
Sbjct: 668 MDFW 671

BLAST of CsaV3_1G038250 vs. TrEMBL
Match: tr|A0A2P5D8S9|A0A2P5D8S9_PARAD (DYW domain containing protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_087790 PE=4 SV=1)

HSP 1 Score: 689.5 bits (1778), Expect = 6.1e-195
Identity = 390/544 (71.69%), Postives = 454/544 (83.46%), Query Frame = 0

Query: 1   MVIDARPNKFTYPTLFKACSVAQAVQEGRQIHGHVVKHGIGSDVHIKSAGIHMYASFGRL 60
           + ++  PNKFT P + KAC++ +AV+EG Q+H HVVKH    D H+KSAGI MYASFG +
Sbjct: 128 VAVNCMPNKFTLPAVLKACTLVKAVEEGVQVHAHVVKHRFSGDGHVKSAGIQMYASFGCM 187

Query: 61  EDARKMFYSGE-SDVVCWNTMIDGYLKCGVLEAAKGLFAQMPVKXXXXXXXXXXXXXXXX 120
           E+AR++    E SDV+CWN MIDG                    XXXXXXXXXXXXXXXX
Sbjct: 188 EEARRVLGEAEKSDVICWNAMIDGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 247

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTRPGRFILSSVLA 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   +P +F+LSSVLA
Sbjct: 248 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEEIKPRKFVLSSVLA 307

Query: 181 ACSNIGAIDQGRWVHAYLKRNSIKLDAVLGTALLDMYAKCGRLDMGWEVFEEMKEREIFT 240
           AC+N+GA+DQGRWVHAY+KRN I+LDAVLGTALLDMYAKCGRLD+ WEVFE+M+ RE FT
Sbjct: 308 ACANVGALDQGRWVHAYVKRNLIQLDAVLGTALLDMYAKCGRLDLAWEVFEKMRPRETFT 367

Query: 241 WNAMIGGLAIHGRAEDALELFSKLQEGRMKPNGITLVGVLTACAHAGFVDKGLRIFQTMR 300
           WNAMIGGLA+HGRAEDA+ELF K+Q  ++KP+GIT V VL ACAHAG VD+GLRIF +M+
Sbjct: 368 WNAMIGGLAMHGRAEDAIELFFKMQRNKLKPDGITFVNVLNACAHAGLVDEGLRIFSSMK 427

Query: 301 EFYGVDPELEHYGCMVDLLGRSGLFSEAEDLINSMPMKPNAAVWGALLGACRIHGNFDLA 360
           + YGV+PE+EHYGC+VDL GR+GL ++AED I+SMP++PNAAV+GALLGACRIHGN +L 
Sbjct: 428 KLYGVEPEVEHYGCIVDLFGRAGLLADAEDFISSMPVRPNAAVFGALLGACRIHGNVELG 487

Query: 361 ERVGKILLELEPQNSGRYVLLSNIYAKVGRFDDVSKIRKLMKDRGIKTVPGVSIVDLNGT 420
           E+VGKILLELEP NSGRY LLSNIYAK GR+ DV K+RKLMK RGI+T  G+S++DL+GT
Sbjct: 488 EKVGKILLELEPHNSGRYALLSNIYAKAGRWKDVEKVRKLMKQRGIRTTHGISMIDLDGT 547

Query: 421 VHEFKMGDGSHPQMKEIYRKLKIIKERLQMAGHSPDTSQVLFDIDEEEKETAVNYHSEKL 480
           VHEFKMGD SHPQMKEIY  L+ I ERLQ+ G+SP+TSQVLFDI EEEKET + YH EKL
Sbjct: 548 VHEFKMGDSSHPQMKEIYLMLERIIERLQIEGYSPNTSQVLFDISEEEKETELQYHCEKL 607

Query: 481 AIAFGLINTLPGKRIHIVKNLRVCDDCHSATKLISQIFDREIIVRDRVRYHHFKNGTCSC 540
           AIAFGL+NT PG  I IVKNLRVC DCHSATKLISQI+ R+IIVRDRVRYHHF+NG CSC
Sbjct: 608 AIAFGLLNTTPGTTIRIVKNLRVCKDCHSATKLISQIYHRDIIVRDRVRYHHFRNGKCSC 667

Query: 541 KDFW 544
            DFW
Sbjct: 668 MDFW 671

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004135765.17.1e-280100.00PREDICTED: pentatricopeptide repeat-containing protein At5g48910-like isoform X1... [more]
XP_011659572.17.1e-280100.00PREDICTED: pentatricopeptide repeat-containing protein At5g48910-like isoform X2... [more]
XP_016900941.13.5e-27196.69PREDICTED: pentatricopeptide repeat-containing protein At5g48910-like isoform X1... [more]
XP_016900944.13.5e-27196.69PREDICTED: pentatricopeptide repeat-containing protein At5g48910-like isoform X2... [more]
XP_023529862.11.3e-24186.56pentatricopeptide repeat-containing protein At5g48910-like isoform X2 [Cucurbita... [more]
Match NameE-valueIdentityDescription
AT5G48910.11.9e-13847.83Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G08070.12.7e-13241.99Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G29760.16.3e-12944.92Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G59720.15.0e-12640.83Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G66520.18.8e-12350.47Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9FI80|PP425_ARATH3.5e-13747.83Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX... [more]
sp|Q9LN01|PPR21_ARATH4.9e-13141.99Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
sp|O82380|PP175_ARATH1.1e-12744.92Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
sp|Q0WQW5|PPR85_ARATH9.0e-12540.83Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondri... [more]
sp|Q9FJY7|PP449_ARATH1.6e-12150.47Pentatricopeptide repeat-containing protein At5g66520 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LYD3|A0A0A0LYD3_CUCSA4.7e-280100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G561400 PE=4 SV=1[more]
tr|A0A1S4DY81|A0A1S4DY81_CUCME2.3e-27196.69pentatricopeptide repeat-containing protein At5g48910-like isoform X1 OS=Cucumis... [more]
tr|A0A1S4DY82|A0A1S4DY82_CUCME2.3e-27196.69pentatricopeptide repeat-containing protein At5g48910-like isoform X2 OS=Cucumis... [more]
tr|A0A2P5AHC4|A0A2P5AHC4_9ROSA2.6e-19872.06DYW domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_350360 ... [more]
tr|A0A2P5D8S9|A0A2P5D8S9_PARAD6.1e-19571.69DYW domain containing protein OS=Parasponia andersonii OX=3476 GN=PanWU01x14_087... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO:0008270zinc ion binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
IPR032867DYW_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_1G038250.1CsaV3_1G038250.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 409..533
e-value: 1.7E-38
score: 131.3
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 75..105
e-value: 3.9E-6
score: 26.6
coord: 379..405
e-value: 0.16
score: 12.2
coord: 137..165
e-value: 1.8E-7
score: 30.9
coord: 107..134
e-value: 1.7E-7
score: 30.9
coord: 210..236
e-value: 2.1E-5
score: 24.3
coord: 310..335
e-value: 0.0057
score: 16.7
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 238..271
e-value: 3.2E-6
score: 24.9
coord: 107..135
e-value: 6.3E-8
score: 30.3
coord: 75..104
e-value: 1.2E-5
score: 23.2
coord: 210..237
e-value: 3.3E-5
score: 21.8
coord: 137..169
e-value: 1.8E-7
score: 28.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 237..283
e-value: 7.3E-8
score: 32.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 307..337
score: 7.235
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 339..369
score: 5.722
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 135..169
score: 12.792
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 236..270
score: 11.246
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 170..204
score: 6.16
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 271..301
score: 7.191
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 8..42
score: 7.18
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 205..235
score: 9.854
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 73..103
score: 9.657
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 373..407
score: 8.44
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 104..134
score: 10.863
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 174..282
e-value: 5.1E-25
score: 89.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 283..454
e-value: 3.9E-20
score: 74.4
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILYSSF48452TPR-likecoord: 117..159
coord: 346..392
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 7..119
NoneNo IPR availablePANTHERPTHR24015:SF929SUBFAMILY NOT NAMEDcoord: 103..456
NoneNo IPR availablePANTHERPTHR24015:SF929SUBFAMILY NOT NAMEDcoord: 7..119
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 103..456