CsaV3_7G008020 (gene) Cucumber (Chinese Long) v3

NameCsaV3_7G008020
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionPentatricopeptide repeat-containing protein
Locationchr7 : 5019619 .. 5021421 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGCTGTCTGTGTGTCAATCCTGAGTTCATCAGTCTCTCCGACCTCCTTCAGGGCCGCATTAACAATTCCCATCTTCGTCAAATCCATGCCCGAGTTTTTCGTTTGCTGAAACATCAAGACAATCTAATTGCAACTCGACTTATCGGCCATTACCCACATTCTGTTGGACTCAGAGTTTTCAATCAACTCATACGCCCCAACATATTTCCTTGCAACGCCATCATCAGAGTACTTGCTGAACACAACTCTTCGTTCTTTGCTTTATCCATCTTTAAATATTTGAAGCACCTTTCACTTTCCCCTAATGACTTCACTTTTTCTTTCCTTCTCAAAGCGTTTCACCGTTCCTGCAATGCTCTCAATGTGAAACAAGTTCATACCCATGTCCTTAAAATGGGTTATTTTGGTGATTCGTTTATCTCCAATTCTCTTCTTGGAGTTTATGCGAGAGGCTTGAAGGAGATGGCTTCTGCACATAAGTTATTCGATGAAATGTCGGATAGAGAAATGGCTTGTTGTTGGACTTCTTTAATTGCTGGCTATGCCCAGATGGGTCTTGCTGAAAAGGCAATGCTGCTTTTTTTTATGATGGTCAAAGAGAATATACAGCCTGAGGATGACACCATAGTTAGTGTTCTGTCTGCTTGTTCCAAGTTGCAAATTGCGGAAATTGAGAAATGGGTTGTAGAATTAAGACAATTGGTTAATAAATGTGATTCCAAGAGGTCTTGTTGTGACTCAATTAATATTGTTCTTATTTATCTATATGGAAAGTGGGGGATGGTTGAGAAGAGTGAAGAAAAGTTCAATGAAGTTGTTGATAAGAGAAGTGTGCTTGTTTGGAATTCAATGATAAATGCATATTTTCAGAATGGTTTCCCTGTGGAAGCCTTGACCCTTTTCCGTCTAATGGTTGAAAATCCCCATTGCAAACCCAACCATGTTACAATGGTTACTGTCATTTCGGCTTGTGCTCAAATAGGAGATTTGCAGCTTGGAAGTTGGGTTCACGAAGTTCTCCAACGTGGTGGCCGTAAAGGTATTATTGCATCAAACAAAATGTTAGCCACTTCATTGATTGATATGTATTGTAAATGTGGTAGCTTGGAGAGGGCAAAGGAAGTTTTTCATCAACTAATCAACAAAGATGTAATCACCTTCAATGCCATGATCATGGGTCTTGCCGTTAACAGCAAAGGCGATGAGGCATTGAAGCTTTTTGCCCAAATGCAAGAGATTAATATAATACCATCAACTGGAACATTTATTGGCCTATTATCTGCTTGTAGCCATTCAGGGTTTCTAGAACAAGGGCGTCAAATCTTTATTGAAATGACTACGCACTATTTAGTATCACCTAGTTTAGAACACTATGCTTGTTACATTGATCTCCTTGCTCGGGCAGGCCACTTTGACGATGCTCTTGAAGTTATTTCAACCATGCCTTTTGAACCCAATAATTTTGTTTGGAGTTCTCTTCTGAGAGGCTGTCTACTCCATTCGAGGTTTGAGTTGGCACAATATGTTTCAAAGAAGCTTGTTGAAGTGGATCCTGAAAACTCTGCTGGCTACGTAATGCAGGCCAATTCATTTGCTACGGATCTTCAATGGGATGATGTCTCGGCTTTGAGATGGTTTATGAGAGAAAAGGGTGTTCATAAGCAGCCAGGCCAGAGTTGGATCAGTATAGATGGAACTGTACACGAATTTTTTTCGGCAACCAAATCACATCCTTATGTTGATTTATTATACACTACCTTGAATGAGCTTGAGAAGCAAATGAAACTAGTTATCCCATAG

mRNA sequence

ATGCGCTGTCTGTGTGTCAATCCTGAGTTCATCAGTCTCTCCGACCTCCTTCAGGGCCGCATTAACAATTCCCATCTTCGTCAAATCCATGCCCGAGTTTTTCGTTTGCTGAAACATCAAGACAATCTAATTGCAACTCGACTTATCGGCCATTACCCACATTCTGTTGGACTCAGAGTTTTCAATCAACTCATACGCCCCAACATATTTCCTTGCAACGCCATCATCAGACCTGAGGATGACACCATAGTTAGTGTTCTGTCTGCTTGTTCCAAGTTGCAAATTGCGGAAATTGAGAAATGGGTTGTAGAATTAAGACAATTGGTTAATAAATGTGATTCCAAGAGGTCTTGTTGTGACTCAATTAATATTGTTCTTATTTATCTATATGGAAAGTGGGGGATGGTTGAGAAGAGTGAAGAAAAGTTCAATGAAGTTGTTGATAAGAGAAGTGTGCTTGTTTGGAATTCAATGATAAATGCATATTTTCAGAATGGTTTCCCTGTGGAAGCCTTGACCCTTTTCCGTCTAATGGTTGAAAATCCCCATTGCAAACCCAACCATGTTACAATGGTTACTGTCATTTCGGCTTGTGCTCAAATAGGAGATTTGCAGCTTGGAAGTTGGGTTCACGAAGTTCTCCAACGTGGTGGCCGTAAAGGTATTATTGCATCAAACAAAATGTTAGCCACTTCATTGATTGATATGTATTGTAAATGTGGTAGCTTGGAGAGGGCAAAGGAAGTTTTTCATCAACTAATCAACAAAGATGTAATCACCTTCAATGCCATGATCATGGGTCTTGCCGTTAACAGCAAAGGCGATGAGGCATTGAAGCTTTTTGCCCAAATGCAAGAGATTAATATAATACCATCAACTGGAACATTTATTGGCCTATTATCTGCTTGTAGCCATTCAGGGTTTCTAGAACAAGGGCGTCAAATCTTTATTGAAATGACTACGCACTATTTAGTATCACCTAGTTTAGAACACTATGCTTGTTACATTGATCTCCTTGCTCGGGCAGGCCACTTTGACGATGCTCTTGAAGTTATTTCAACCATGCCTTTTGAACCCAATAATTTTGTTTGGAGTTCTCTTCTGAGAGGCTGTCTACTCCATTCGAGGTTTGAGTTGGCACAATATGTTTCAAAGAAGCTTGTTGAAGTGGATCCTGAAAACTCTGCTGGCTACGTAATGCAGGCCAATTCATTTGCTACGGATCTTCAATGGGATGATGTCTCGGCTTTGAGATGGTTTATGAGAGAAAAGGGTGTTCATAAGCAGCCAGGCCAGAGTTGGATCAGTATAGATGGAACTGTACACGAATTTTTTTCGGCAACCAAATCACATCCTTATGTTGATTTATTATACACTACCTTGAATGAGCTTGAGAAGCAAATGAAACTAGTTATCCCATAG

Coding sequence (CDS)

ATGCGCTGTCTGTGTGTCAATCCTGAGTTCATCAGTCTCTCCGACCTCCTTCAGGGCCGCATTAACAATTCCCATCTTCGTCAAATCCATGCCCGAGTTTTTCGTTTGCTGAAACATCAAGACAATCTAATTGCAACTCGACTTATCGGCCATTACCCACATTCTGTTGGACTCAGAGTTTTCAATCAACTCATACGCCCCAACATATTTCCTTGCAACGCCATCATCAGACCTGAGGATGACACCATAGTTAGTGTTCTGTCTGCTTGTTCCAAGTTGCAAATTGCGGAAATTGAGAAATGGGTTGTAGAATTAAGACAATTGGTTAATAAATGTGATTCCAAGAGGTCTTGTTGTGACTCAATTAATATTGTTCTTATTTATCTATATGGAAAGTGGGGGATGGTTGAGAAGAGTGAAGAAAAGTTCAATGAAGTTGTTGATAAGAGAAGTGTGCTTGTTTGGAATTCAATGATAAATGCATATTTTCAGAATGGTTTCCCTGTGGAAGCCTTGACCCTTTTCCGTCTAATGGTTGAAAATCCCCATTGCAAACCCAACCATGTTACAATGGTTACTGTCATTTCGGCTTGTGCTCAAATAGGAGATTTGCAGCTTGGAAGTTGGGTTCACGAAGTTCTCCAACGTGGTGGCCGTAAAGGTATTATTGCATCAAACAAAATGTTAGCCACTTCATTGATTGATATGTATTGTAAATGTGGTAGCTTGGAGAGGGCAAAGGAAGTTTTTCATCAACTAATCAACAAAGATGTAATCACCTTCAATGCCATGATCATGGGTCTTGCCGTTAACAGCAAAGGCGATGAGGCATTGAAGCTTTTTGCCCAAATGCAAGAGATTAATATAATACCATCAACTGGAACATTTATTGGCCTATTATCTGCTTGTAGCCATTCAGGGTTTCTAGAACAAGGGCGTCAAATCTTTATTGAAATGACTACGCACTATTTAGTATCACCTAGTTTAGAACACTATGCTTGTTACATTGATCTCCTTGCTCGGGCAGGCCACTTTGACGATGCTCTTGAAGTTATTTCAACCATGCCTTTTGAACCCAATAATTTTGTTTGGAGTTCTCTTCTGAGAGGCTGTCTACTCCATTCGAGGTTTGAGTTGGCACAATATGTTTCAAAGAAGCTTGTTGAAGTGGATCCTGAAAACTCTGCTGGCTACGTAATGCAGGCCAATTCATTTGCTACGGATCTTCAATGGGATGATGTCTCGGCTTTGAGATGGTTTATGAGAGAAAAGGGTGTTCATAAGCAGCCAGGCCAGAGTTGGATCAGTATAGATGGAACTGTACACGAATTTTTTTCGGCAACCAAATCACATCCTTATGTTGATTTATTATACACTACCTTGAATGAGCTTGAGAAGCAAATGAAACTAGTTATCCCATAG

Protein sequence

MRCLCVNPEFISLSDLLQGRINNSHLRQIHARVFRLLKHQDNLIATRLIGHYPHSVGLRVFNQLIRPNIFPCNAIIRPEDDTIVSVLSACSKLQIAEIEKWVVELRQLVNKCDSKRSCCDSINIVLIYLYGKWGMVEKSEEKFNEVVDKRSVLVWNSMINAYFQNGFPVEALTLFRLMVENPHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSLIDMYCKCGSLERAKEVFHQLINKDVITFNAMIMGLAVNSKGDEALKLFAQMQEINIIPSTGTFIGLLSACSHSGFLEQGRQIFIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVISTMPFEPNNFVWSSLLRGCLLHSRFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDDVSALRWFMREKGVHKQPGQSWISIDGTVHEFFSATKSHPYVDLLYTTLNELEKQMKLVIP
BLAST of CsaV3_7G008020 vs. NCBI nr
Match: KGN43869.1 (hypothetical protein Csa_7G071580 [Cucumis sativus])

HSP 1 Score: 916.8 bits (2368), Expect = 3.1e-263
Identity = 473/600 (78.83%), Postives = 473/600 (78.83%), Query Frame = 0

Query: 1   MRCLCVNPEFISLSDLLQGRINNSHLRQIHARVFRLLKHQDNLIATRLIGHYPHSVGLRV 60
           MRCLCVNPEFISLSDLLQGRINNSHLRQIHARVFRLLKHQDNLIATRLIGHYPHSVGLRV
Sbjct: 1   MRCLCVNPEFISLSDLLQGRINNSHLRQIHARVFRLLKHQDNLIATRLIGHYPHSVGLRV 60

Query: 61  FNQLIRPNIFPCNAIIR------------------------------------------- 120
           FNQLIRPNIFPCNAIIR                                           
Sbjct: 61  FNQLIRPNIFPCNAIIRVLAEHNSSFFALSIFKYLKHLSLSPNDFTFSFLLKAFHRSCNA 120

Query: 121 ------------------------------------------------------------ 180
                                                                       
Sbjct: 121 LNVKQVHTHVLKMGYFGDSFISNSLLGVYARGLKEMASAHKLFDEMSDREMACCWTSLIA 180

Query: 181 ------------------------PEDDTIVSVLSACSKLQIAEIEKWVVELRQLVNKCD 240
                                   PEDDTIVSVLSACSKLQIAEIEKWVVELRQLVNKCD
Sbjct: 181 GYAQMGLAEKAMLLFFMMVKENIQPEDDTIVSVLSACSKLQIAEIEKWVVELRQLVNKCD 240

Query: 241 SKRSCCDSINIVLIYLYGKWGMVEKSEEKFNEVVDKRSVLVWNSMINAYFQNGFPVEALT 300
           SKRSCCDSINIVLIYLYGKWGMVEKSEEKFNEVVDKRSVLVWNSMINAYFQNGFPVEALT
Sbjct: 241 SKRSCCDSINIVLIYLYGKWGMVEKSEEKFNEVVDKRSVLVWNSMINAYFQNGFPVEALT 300

Query: 301 LFRLMVENPHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSL 360
           LFRLMVENPHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSL
Sbjct: 301 LFRLMVENPHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSL 360

Query: 361 IDMYCKCGSLERAKEVFHQLINKDVITFNAMIMGLAVNSKGDEALKLFAQMQEINIIPST 420
           IDMYCKCGSLERAKEVFHQLINKDVITFNAMIMGLAVNSKGDEALKLFAQMQEINIIPST
Sbjct: 361 IDMYCKCGSLERAKEVFHQLINKDVITFNAMIMGLAVNSKGDEALKLFAQMQEINIIPST 420

Query: 421 GTFIGLLSACSHSGFLEQGRQIFIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVIS 474
           GTFIGLLSACSHSGFLEQGRQIFIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVIS
Sbjct: 421 GTFIGLLSACSHSGFLEQGRQIFIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVIS 480

BLAST of CsaV3_7G008020 vs. NCBI nr
Match: XP_008455016.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X1 [Cucumis melo])

HSP 1 Score: 864.0 bits (2231), Expect = 2.4e-247
Identity = 444/600 (74.00%), Postives = 459/600 (76.50%), Query Frame = 0

Query: 1   MRCLCVNPEFISLSDLLQGRINNSHLRQIHARVFRLLKHQDNLIATRLIGHYPHSVGLRV 60
           MRCL VNPEFI+LSDLLQGRINNSHLRQIHARVFRLLKHQDNLIATRLIGHYPHSVGLRV
Sbjct: 9   MRCLFVNPEFINLSDLLQGRINNSHLRQIHARVFRLLKHQDNLIATRLIGHYPHSVGLRV 68

Query: 61  FNQLIRPNIFPCNAIIR------------------------------------------- 120
           FNQLIRPNIFPCNAIIR                                           
Sbjct: 69  FNQLIRPNIFPCNAIIRVLAEHNTSFLALSIFKSLKHLSLSPNDFTFSFLLKAFHRSCNA 128

Query: 121 ------------------------------------------------------------ 180
                                                                       
Sbjct: 129 LDVKQVHTHVLKMGYFGDSFISNALLGVYARGLKDMASAHKVFDEMSDREMACCWTSLIA 188

Query: 181 ------------------------PEDDTIVSVLSACSKLQIAEIEKWVVELRQLVNKCD 240
                                   PEDDT+VSVLSACSK QIAEIEKWVV LR+LVNK D
Sbjct: 189 GYAQMGLAEKAMLIFVTMIKENMQPEDDTMVSVLSACSKFQIAEIEKWVVALRELVNKFD 248

Query: 241 SKRSCCDSINIVLIYLYGKWGMVEKSEEKFNEVVDKRSVLVWNSMINAYFQNGFPVEALT 300
           SK SCCDSINIVLIYLYGKWGMVEKSEEKFNE++DK+SVLVWNSMINAYFQNGFPVEALT
Sbjct: 249 SKSSCCDSINIVLIYLYGKWGMVEKSEEKFNEIIDKKSVLVWNSMINAYFQNGFPVEALT 308

Query: 301 LFRLMVENPHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSL 360
           LFRLMVENPHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQR GRKGIIASNKMLAT+L
Sbjct: 309 LFRLMVENPHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQRSGRKGIIASNKMLATAL 368

Query: 361 IDMYCKCGSLERAKEVFHQLINKDVITFNAMIMGLAVNSKGDEALKLFAQMQEINIIPST 420
           IDMYCKCGSLERAKEVFHQLINKDVI+FNAMIMGLAVN KGDEALKLFAQMQEI+I PST
Sbjct: 369 IDMYCKCGSLERAKEVFHQLINKDVISFNAMIMGLAVNGKGDEALKLFAQMQEIDIRPST 428

Query: 421 GTFIGLLSACSHSGFLEQGRQIFIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVIS 474
           GTFIGLLSACSHSGFLEQG QIFIEMTT YL+SPSLEHYACYIDLLARAG F+DALEV+S
Sbjct: 429 GTFIGLLSACSHSGFLEQGHQIFIEMTTQYLISPSLEHYACYIDLLARAGRFEDALEVVS 488

BLAST of CsaV3_7G008020 vs. NCBI nr
Match: XP_011658883.1 (PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820 [Cucumis sativus])

HSP 1 Score: 814.3 bits (2102), Expect = 2.2e-232
Identity = 397/398 (99.75%), Postives = 398/398 (100.00%), Query Frame = 0

Query: 76  IRPEDDTIVSVLSACSKLQIAEIEKWVVELRQLVNKCDSKRSCCDSINIVLIYLYGKWGM 135
           I+PEDDTIVSVLSACSKLQIAEIEKWVVELRQLVNKCDSKRSCCDSINIVLIYLYGKWGM
Sbjct: 71  IQPEDDTIVSVLSACSKLQIAEIEKWVVELRQLVNKCDSKRSCCDSINIVLIYLYGKWGM 130

Query: 136 VEKSEEKFNEVVDKRSVLVWNSMINAYFQNGFPVEALTLFRLMVENPHCKPNHVTMVTVI 195
           VEKSEEKFNEVVDKRSVLVWNSMINAYFQNGFPVEALTLFRLMVENPHCKPNHVTMVTVI
Sbjct: 131 VEKSEEKFNEVVDKRSVLVWNSMINAYFQNGFPVEALTLFRLMVENPHCKPNHVTMVTVI 190

Query: 196 SACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSLIDMYCKCGSLERAKEVFHQLIN 255
           SACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSLIDMYCKCGSLERAKEVFHQLIN
Sbjct: 191 SACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSLIDMYCKCGSLERAKEVFHQLIN 250

Query: 256 KDVITFNAMIMGLAVNSKGDEALKLFAQMQEINIIPSTGTFIGLLSACSHSGFLEQGRQI 315
           KDVITFNAMIMGLAVNSKGDEALKLFAQMQEINIIPSTGTFIGLLSACSHSGFLEQGRQI
Sbjct: 251 KDVITFNAMIMGLAVNSKGDEALKLFAQMQEINIIPSTGTFIGLLSACSHSGFLEQGRQI 310

Query: 316 FIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVISTMPFEPNNFVWSSLLRGCLLHS 375
           FIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVISTMPFEPNNFVWSSLLRGCLLHS
Sbjct: 311 FIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVISTMPFEPNNFVWSSLLRGCLLHS 370

Query: 376 RFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDDVSALRWFMREKGVHKQPGQSWI 435
           RFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDDVSALRWFMREKGVHKQPGQSWI
Sbjct: 371 RFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDDVSALRWFMREKGVHKQPGQSWI 430

Query: 436 SIDGTVHEFFSATKSHPYVDLLYTTLNELEKQMKLVIP 474
           SIDGTVHEFFSATKSHPYVDLLYTTLNELEKQMKLVIP
Sbjct: 431 SIDGTVHEFFSATKSHPYVDLLYTTLNELEKQMKLVIP 468

BLAST of CsaV3_7G008020 vs. NCBI nr
Match: XP_022952095.1 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 789.3 bits (2037), Expect = 7.4e-225
Identity = 408/600 (68.00%), Postives = 442/600 (73.67%), Query Frame = 0

Query: 1   MRCLCVNPEFISLSDLLQGRINNSHLRQIHARVFRLLKHQDNLIATRLIGHYPHSVGLRV 60
           MR  C+N EFI+LSDLLQGRIN+S LRQIH RVFRLLKHQDNLIATRLIGHYP+SVG+RV
Sbjct: 9   MRSSCINLEFINLSDLLQGRINSSRLRQIHGRVFRLLKHQDNLIATRLIGHYPYSVGIRV 68

Query: 61  FNQLIRPNIFPCNAIIR------------------------------------------- 120
           FNQL+RPNIFPCNAIIR                                           
Sbjct: 69  FNQLLRPNIFPCNAIIRVLAESNCSFLAFSIFKSLKRLSLSPNDFTFSFLLKAFHRSSHS 128

Query: 121 ------------------------------------------------------------ 180
                                                                       
Sbjct: 129 PNVKQVHTHVMKMGYLGDSFISNALLGVYARGLKDMGSAHNMFDEMSEREMACCWTSLIA 188

Query: 181 ------------------------PEDDTIVSVLSACSKLQIAEIEKWVVELRQLVNKCD 240
                                   PEDDT+VSVLSACSKLQIAEIEKWV EL QLVN+ D
Sbjct: 189 GYAHMGLAEKALLLFVMMIKENIQPEDDTMVSVLSACSKLQIAEIEKWVAELTQLVNEFD 248

Query: 241 SKRSCCDSINIVLIYLYGKWGMVEKSEEKFNEVVDKRSVLVWNSMINAYFQNGFPVEALT 300
              SCCDSINIVL+YLYGKWG +EKSEEKFNE+VDKRS +VWNSMINAYFQNG PVEALT
Sbjct: 249 ---SCCDSINIVLVYLYGKWGKIEKSEEKFNEIVDKRSAIVWNSMINAYFQNGCPVEALT 308

Query: 301 LFRLMVENPHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSL 360
           LFRLM+ENPHCKPNHVTMVTV+SACAQIGDLQLG  VHE L+ GGR+GIIASNKMLAT+L
Sbjct: 309 LFRLMLENPHCKPNHVTMVTVLSACAQIGDLQLGRRVHEALEHGGRRGIIASNKMLATAL 368

Query: 361 IDMYCKCGSLERAKEVFHQLINKDVITFNAMIMGLAVNSKGDEALKLFAQMQEINIIPST 420
           IDMYCK GSLE+AK+VFH+LI KDVI+FNAMIMGLAVN K DEALKLF+QMQE +I P+T
Sbjct: 369 IDMYCKSGSLEKAKQVFHELICKDVISFNAMIMGLAVNGKADEALKLFSQMQESDIKPTT 428

Query: 421 GTFIGLLSACSHSGFLEQGRQIFIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVIS 474
           GTFIGLLSACSHSGFLEQGRQIFI+M T Y  SPSL+HYACYIDLLARAG  +DALEV+S
Sbjct: 429 GTFIGLLSACSHSGFLEQGRQIFIQMATRYSTSPSLDHYACYIDLLARAGCVEDALEVVS 488

BLAST of CsaV3_7G008020 vs. NCBI nr
Match: XP_022972521.1 (pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucurbita maxima])

HSP 1 Score: 776.2 bits (2003), Expect = 6.5e-221
Identity = 402/600 (67.00%), Postives = 438/600 (73.00%), Query Frame = 0

Query: 1   MRCLCVNPEFISLSDLLQGRINNSHLRQIHARVFRLLKHQDNLIATRLIGHYPHSVGLRV 60
           MR  C+NPEFI+LSDLLQGRIN+S LRQIH RVFRLLKHQDNLIATRLIGHYPHSVG+RV
Sbjct: 9   MRSSCINPEFINLSDLLQGRINSSRLRQIHGRVFRLLKHQDNLIATRLIGHYPHSVGIRV 68

Query: 61  FNQLIRPNIFPCNAIIR------------------------------------------- 120
           FNQL+RPNIFPCNAIIR                                           
Sbjct: 69  FNQLLRPNIFPCNAIIRVLAESNCSFLAFSIFKSLKRLSLSPNDFTFSFLLKAFHRSSHS 128

Query: 121 ------------------------------------------------------------ 180
                                                                       
Sbjct: 129 PNVKQVHTQVMKMGYLGDSFISNALLGVYARGLQDMCSAHNMFDEMSEREMACCWTSLIA 188

Query: 181 ------------------------PEDDTIVSVLSACSKLQIAEIEKWVVELRQLVNKCD 240
                                   P DDT+VSVLSACSKLQIAEIEKWV EL QL+N+  
Sbjct: 189 GYAHMGLVEKALLLFVMMIKENIQPVDDTMVSVLSACSKLQIAEIEKWVAELTQLINEF- 248

Query: 241 SKRSCCDSINIVLIYLYGKWGMVEKSEEKFNEVVDKRSVLVWNSMINAYFQNGFPVEALT 300
              SC DSINIVL+YLYGKWG +EKSEEKF+E+VDKRS +VWNSMINAYFQNG PVEALT
Sbjct: 249 --ASCGDSINIVLVYLYGKWGKIEKSEEKFSEIVDKRSAIVWNSMINAYFQNGCPVEALT 308

Query: 301 LFRLMVENPHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSL 360
           LFRLM+ENPHCKPNHVTMVTV+SACAQIGDLQLG  VHE L+ GGR+GIIASNKMLAT+L
Sbjct: 309 LFRLMLENPHCKPNHVTMVTVLSACAQIGDLQLGRRVHEALEHGGRRGIIASNKMLATAL 368

Query: 361 IDMYCKCGSLERAKEVFHQLINKDVITFNAMIMGLAVNSKGDEALKLFAQMQEINIIPST 420
           IDMYCK GSLE+AK+VFH+LI KDVI+FNAMIMGLAVN K DEALKLF+QMQE +I P+T
Sbjct: 369 IDMYCKSGSLEKAKQVFHELICKDVISFNAMIMGLAVNGKADEALKLFSQMQESDIKPTT 428

Query: 421 GTFIGLLSACSHSGFLEQGRQIFIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVIS 474
           GTFIGLLSACSHSGFLEQG QIFI+M T Y  SPSLEHYACYIDLLARAG  +DAL+V+S
Sbjct: 429 GTFIGLLSACSHSGFLEQGHQIFIQMATRYSTSPSLEHYACYIDLLARAGCVEDALKVVS 488

BLAST of CsaV3_7G008020 vs. TAIR10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 294.3 bits (752), Expect = 1.4e-79
Identity = 152/392 (38.78%), Postives = 236/392 (60.20%), Query Frame = 0

Query: 76  IRPEDDTIVSVLSACSKLQIAEIEKWVVELRQLVNKCDSKRSCCDSINIV--LIYLYGKW 135
           +RP++ T+V+V+SAC+       +   +EL + V+          ++ IV  LI LY K 
Sbjct: 262 VRPDESTMVTVVSACA-------QSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKC 321

Query: 136 GMVEKSEEKFNEVVDKRSVLVWNSMINAYFQNGFPVEALTLFRLMVENPHCKPNHVTMVT 195
           G +E +   F E +  + V+ WN++I  Y       EAL LF+ M+ +    PN VTM++
Sbjct: 322 GELETACGLF-ERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGE-TPNDVTMLS 381

Query: 196 VISACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSLIDMYCKCGSLERAKEVFHQL 255
           ++ ACA +G + +G W+H  + +  R   + +   L TSLIDMY KCG +E A +VF+ +
Sbjct: 382 ILPACAHLGAIDIGRWIHVYIDK--RLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSI 441

Query: 256 INKDVITFNAMIMGLAVNSKGDEALKLFAQMQEINIIPSTGTFIGLLSACSHSGFLEQGR 315
           ++K + ++NAMI G A++ + D +  LF++M++I I P   TF+GLLSACSHSG L+ GR
Sbjct: 442 LHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGR 501

Query: 316 QIFIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVISTMPFEPNNFVWSSLLRGCLL 375
            IF  MT  Y ++P LEHY C IDLL  +G F +A E+I+ M  EP+  +W SLL+ C +
Sbjct: 502 HIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKM 561

Query: 376 HSRFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDDVSALRWFMREKGVHKQPGQS 435
           H   EL +  ++ L++++PEN   YV+ +N +A+  +W++V+  R  + +KG+ K PG S
Sbjct: 562 HGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCS 621

Query: 436 WISIDGTVHEFFSATKSHPYVDLLYTTLNELE 466
            I ID  VHEF    K HP    +Y  L E+E
Sbjct: 622 SIEIDSVVHEFIIGDKFHPRNREIYGMLEEME 642

BLAST of CsaV3_7G008020 vs. TAIR10
Match: AT3G08820.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 288.1 bits (736), Expect = 9.8e-78
Identity = 142/396 (35.86%), Postives = 227/396 (57.32%), Query Frame = 0

Query: 76  IRPEDDTIVSVLSACSKLQIAEIEKWVVELRQLVNKCDSKRSCCDSINIVLIYLYGKWGM 135
           ++P+   IV VLSAC  +   +  +W+V+  +     + +      +   L+ LY K G 
Sbjct: 208 VKPDSYFIVQVLSACVHVGDLDSGEWIVKYME-----EMEMQKNSFVRTTLVNLYAKCGK 267

Query: 136 VEKSEEKFNEVVDKRSVLVWNSMINAYFQNGFPVEALTLFRLMVENPHCKPNHVTMVTVI 195
           +EK+   F+ +V+K  ++ W++MI  Y  N FP E + LF  M++  + KP+  ++V  +
Sbjct: 268 MEKARSVFDSMVEK-DIVTWSTMIQGYASNSFPKEGIELFLQMLQE-NLKPDQFSIVGFL 327

Query: 196 SACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSLIDMYCKCGSLERAKEVFHQLIN 255
           S+CA +G L LG W   ++ R        +N  +A +LIDMY KCG++ R  EVF ++  
Sbjct: 328 SSCASLGALDLGEWGISLIDRHE----FLTNLFMANALIDMYAKCGAMARGFEVFKEMKE 387

Query: 256 KDVITFNAMIMGLAVNSKGDEALKLFAQMQEINIIPSTGTFIGLLSACSHSGFLEQGRQI 315
           KD++  NA I GLA N     +  +F Q +++ I P   TF+GLL  C H+G ++ G + 
Sbjct: 388 KDIVIMNAAISGLAKNGHVKLSFAVFGQTEKLGISPDGSTFLGLLCGCVHAGLIQDGLRF 447

Query: 316 FIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVISTMPFEPNNFVWSSLLRGCLLHS 375
           F  ++  Y +  ++EHY C +DL  RAG  DDA  +I  MP  PN  VW +LL GC L  
Sbjct: 448 FNAISCVYALKRTVEHYGCMVDLWGRAGMLDDAYRLICDMPMRPNAIVWGALLSGCRLVK 507

Query: 376 RFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDDVSALRWFMREKGVHKQPGQSWI 435
             +LA+ V K+L+ ++P N+  YV  +N ++   +WD+ + +R  M +KG+ K PG SWI
Sbjct: 508 DTQLAETVLKELIALEPWNAGNYVQLSNIYSVGGRWDEAAEVRDMMNKKGMKKIPGYSWI 567

Query: 436 SIDGTVHEFFSATKSHPYVDLLYTTLNELEKQMKLV 472
            ++G VHEF +  KSHP  D +Y  L +L  +M+L+
Sbjct: 568 ELEGKVHEFLADDKSHPLSDKIYAKLEDLGNEMRLM 592

BLAST of CsaV3_7G008020 vs. TAIR10
Match: AT4G30700.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 270.0 bits (689), Expect = 2.8e-72
Identity = 158/488 (32.38%), Postives = 259/488 (53.07%), Query Frame = 0

Query: 5   CVNPEFISLSDLLQGRINNSHLR---QIHARVFRLLKHQDNLIATRLIGHYPH----SVG 64
           C   +  +L D+L        LR   QIH+   +   +  + + T  I  Y       +G
Sbjct: 216 CTRLDTTTLLDILPAVAELQELRLGMQIHSLATKTGCYSHDYVLTGFISLYSKCGKIKMG 275

Query: 65  LRVFNQLIRPNIFPCNAIIR---PEDDTIVSV------------LSACSKLQIAEIEKWV 124
             +F +  +P+I   NA+I       +T +S+            L + + + +  +   +
Sbjct: 276 SALFREFRKPDIVAYNAMIHGYTSNGETELSLSLFKELMLSGARLRSSTLVSLVPVSGHL 335

Query: 125 VELRQLVNKC-DSKRSCCDSINIVLIYLYGKWGMVEKSEEKFNEVVDKRSVLVWNSMINA 184
           + +  +   C  S      S++  L  +Y K   +E + + F+E  +K S+  WN+MI+ 
Sbjct: 336 MLIYAIHGYCLKSNFLSHASVSTALTTVYSKLNEIESARKLFDESPEK-SLPSWNAMISG 395

Query: 185 YFQNGFPVEALTLFRLMVENPHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQRGGRKG 244
           Y QNG   +A++LFR M +     PN VT+  ++SACAQ+G L LG WVH+++    R  
Sbjct: 396 YTQNGLTEDAISLFREM-QKSEFSPNPVTITCILSACAQLGALSLGKWVHDLV----RST 455

Query: 245 IIASNKMLATSLIDMYCKCGSLERAKEVFHQLINKDVITFNAMIMGLAVNSKGDEALKLF 304
              S+  ++T+LI MY KCGS+  A+ +F  +  K+ +T+N MI G  ++ +G EAL +F
Sbjct: 456 DFESSIYVSTALIGMYAKCGSIAEARRLFDLMTKKNEVTWNTMISGYGLHGQGQEALNIF 515

Query: 305 AQMQEINIIPSTGTFIGLLSACSHSGFLEQGRQIFIEMTTHYLVSPSLEHYACYIDLLAR 364
            +M    I P+  TF+ +L ACSH+G +++G +IF  M   Y   PS++HYAC +D+L R
Sbjct: 516 YEMLNSGITPTPVTFLCVLYACSHAGLVKEGDEIFNSMIHRYGFEPSVKHYACMVDILGR 575

Query: 365 AGHFDDALEVISTMPFEPNNFVWSSLLRGCLLHSRFELAQYVSKKLVEVDPENSAGYVMQ 424
           AGH   AL+ I  M  EP + VW +LL  C +H    LA+ VS+KL E+DP+N   +V+ 
Sbjct: 576 AGHLQRALQFIEAMSIEPGSSVWETLLGACRIHKDTNLARTVSEKLFELDPDNVGYHVLL 635

Query: 425 ANSFATDLQWDDVSALRWFMREKGVHKQPGQSWISIDGTVHEFFSATKSHPYVDLLYTTL 470
           +N  + D  +   + +R   +++ + K PG + I I  T H F S  +SHP V  +Y  L
Sbjct: 636 SNIHSADRNYPQAATVRQTAKKRKLAKAPGYTLIEIGETPHVFTSGDQSHPQVKEIYEKL 695

BLAST of CsaV3_7G008020 vs. TAIR10
Match: AT1G59720.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 266.5 bits (680), Expect = 3.0e-71
Identity = 135/360 (37.50%), Postives = 222/360 (61.67%), Query Frame = 0

Query: 122 INIVLIYLYGKWGMVEKSEEKFNEVVDKRSVLVWNSMINAYFQNGFPVEALTLFRLMVEN 181
           +N  LI+LYG  G ++ + + F+E + +RS++ WNSMI+A  + G    AL LFR M  +
Sbjct: 188 VNNGLIHLYGSCGCLDLARKVFDE-MPERSLVSWNSMIDALVRFGEYDSALQLFREMQRS 247

Query: 182 PHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSLIDMYCKCG 241
              +P+  TM +V+SACA +G L LG+W H  L R      +A + ++  SLI+MYCKCG
Sbjct: 248 --FEPDGYTMQSVLSACAGLGSLSLGTWAHAFLLRKCDVD-VAMDVLVKNSLIEMYCKCG 307

Query: 242 SLERAKEVFHQLINKDVITFNAMIMGLAVNSKGDEALKLFAQM--QEINIIPSTGTFIGL 301
           SL  A++VF  +  +D+ ++NAMI+G A + + +EA+  F +M  +  N+ P++ TF+GL
Sbjct: 308 SLRMAEQVFQGMQKRDLASWNAMILGFATHGRAEEAMNFFDRMVDKRENVRPNSVTFVGL 367

Query: 302 LSACSHSGFLEQGRQIFIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVISTMPFEP 361
           L AC+H GF+ +GRQ F  M   Y + P+LEHY C +DL+ARAG+  +A++++ +MP +P
Sbjct: 368 LIACNHRGFVNKGRQYFDMMVRDYCIEPALEHYGCIVDLIARAGYITEAIDMVMSMPMKP 427

Query: 362 NNFVWSSLLRGCLLH-SRFELAQYVSKKLVEVDPEN-------SAGYVMQANSFATDLQW 421
           +  +W SLL  C    +  EL++ +++ ++    +N       S  YV+ +  +A+  +W
Sbjct: 428 DAVIWRSLLDACCKKGASVELSEEIARNIIGTKEDNESSNGNCSGAYVLLSRVYASASRW 487

Query: 422 DDVSALRWFMREKGVHKQPGQSWISIDGTVHEFFSATKSHPYVDLLYTTLNELEKQMKLV 472
           +DV  +R  M E G+ K+PG S I I+G  HEFF+   SHP    +Y  L  ++ +++ +
Sbjct: 488 NDVGIVRKLMSEHGIRKEPGCSSIEINGISHEFFAGDTSHPQTKQIYQQLKVIDDRLRSI 543

BLAST of CsaV3_7G008020 vs. TAIR10
Match: AT2G29760.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 265.4 bits (677), Expect = 6.8e-71
Identity = 141/423 (33.33%), Postives = 237/423 (56.03%), Query Frame = 0

Query: 82  TIVSVLSACSKLQIAEIEKWVVELRQLVNKCDSKRSCCDSINIVL----IYLYGKWGMVE 141
           T+V VLSAC+K++  E        RQ+ +  +  R    ++N+ L    + +Y K G +E
Sbjct: 234 TMVGVLSACAKIRNLEFG------RQVCSYIEENRV---NVNLTLANAMLDMYTKCGSIE 293

Query: 142 KSEEKFNEVVDKRSVLVWN-------------------------------SMINAYFQNG 201
            ++  F+ + +K +V  W                                        NG
Sbjct: 294 DAKRLFDAMEEKDNV-TWTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNG 353

Query: 202 FPVEALTLFRLMVENPHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQRGGRKGIIASN 261
            P EAL +F  +    + K N +T+V+ +SACAQ+G L+LG W+H  +++ G    I  N
Sbjct: 354 KPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSYIKKHG----IRMN 413

Query: 262 KMLATSLIDMYCKCGSLERAKEVFHQLINKDVITFNAMIMGLAVNSKGDEALKLFAQMQE 321
             + ++LI MY KCG LE+++EVF+ +  +DV  ++AMI GLA++  G+EA+ +F +MQE
Sbjct: 414 FHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQE 473

Query: 322 INIIPSTGTFIGLLSACSHSGFLEQGRQIFIEMTTHYLVSPSLEHYACYIDLLARAGHFD 381
            N+ P+  TF  +  ACSH+G +++   +F +M ++Y + P  +HYAC +D+L R+G+ +
Sbjct: 474 ANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVLGRSGYLE 533

Query: 382 DALEVISTMPFEPNNFVWSSLLRGCLLHSRFELAQYVSKKLVEVDPENSAGYVMQANSFA 441
            A++ I  MP  P+  VW +LL  C +H+   LA+    +L+E++P N   +V+ +N +A
Sbjct: 534 KAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDGAHVLLSNIYA 593

Query: 442 TDLQWDDVSALRWFMREKGVHKQPGQSWISIDGTVHEFFSATKSHPYVDLLYTTLNELEK 470
              +W++VS LR  MR  G+ K+PG S I IDG +HEF S   +HP  + +Y  L+E+ +
Sbjct: 594 KLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEKVYGKLHEVME 642

BLAST of CsaV3_7G008020 vs. Swiss-Prot
Match: sp|Q9LN01|PPR21_ARATH (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 294.3 bits (752), Expect = 2.5e-78
Identity = 152/392 (38.78%), Postives = 236/392 (60.20%), Query Frame = 0

Query: 76  IRPEDDTIVSVLSACSKLQIAEIEKWVVELRQLVNKCDSKRSCCDSINIV--LIYLYGKW 135
           +RP++ T+V+V+SAC+       +   +EL + V+          ++ IV  LI LY K 
Sbjct: 262 VRPDESTMVTVVSACA-------QSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKC 321

Query: 136 GMVEKSEEKFNEVVDKRSVLVWNSMINAYFQNGFPVEALTLFRLMVENPHCKPNHVTMVT 195
           G +E +   F E +  + V+ WN++I  Y       EAL LF+ M+ +    PN VTM++
Sbjct: 322 GELETACGLF-ERLPYKDVISWNTLIGGYTHMNLYKEALLLFQEMLRSGE-TPNDVTMLS 381

Query: 196 VISACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSLIDMYCKCGSLERAKEVFHQL 255
           ++ ACA +G + +G W+H  + +  R   + +   L TSLIDMY KCG +E A +VF+ +
Sbjct: 382 ILPACAHLGAIDIGRWIHVYIDK--RLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSI 441

Query: 256 INKDVITFNAMIMGLAVNSKGDEALKLFAQMQEINIIPSTGTFIGLLSACSHSGFLEQGR 315
           ++K + ++NAMI G A++ + D +  LF++M++I I P   TF+GLLSACSHSG L+ GR
Sbjct: 442 LHKSLSSWNAMIFGFAMHGRADASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGR 501

Query: 316 QIFIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVISTMPFEPNNFVWSSLLRGCLL 375
            IF  MT  Y ++P LEHY C IDLL  +G F +A E+I+ M  EP+  +W SLL+ C +
Sbjct: 502 HIFRTMTQDYKMTPKLEHYGCMIDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKM 561

Query: 376 HSRFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDDVSALRWFMREKGVHKQPGQS 435
           H   EL +  ++ L++++PEN   YV+ +N +A+  +W++V+  R  + +KG+ K PG S
Sbjct: 562 HGNVELGESFAENLIKIEPENPGSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCS 621

Query: 436 WISIDGTVHEFFSATKSHPYVDLLYTTLNELE 466
            I ID  VHEF    K HP    +Y  L E+E
Sbjct: 622 SIEIDSVVHEFIIGDKFHPRNREIYGMLEEME 642

BLAST of CsaV3_7G008020 vs. Swiss-Prot
Match: sp|Q9SR82|PP219_ARATH (Putative pentatricopeptide repeat-containing protein At3g08820 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H84 PE=3 SV=1)

HSP 1 Score: 288.1 bits (736), Expect = 1.8e-76
Identity = 142/396 (35.86%), Postives = 227/396 (57.32%), Query Frame = 0

Query: 76  IRPEDDTIVSVLSACSKLQIAEIEKWVVELRQLVNKCDSKRSCCDSINIVLIYLYGKWGM 135
           ++P+   IV VLSAC  +   +  +W+V+  +     + +      +   L+ LY K G 
Sbjct: 208 VKPDSYFIVQVLSACVHVGDLDSGEWIVKYME-----EMEMQKNSFVRTTLVNLYAKCGK 267

Query: 136 VEKSEEKFNEVVDKRSVLVWNSMINAYFQNGFPVEALTLFRLMVENPHCKPNHVTMVTVI 195
           +EK+   F+ +V+K  ++ W++MI  Y  N FP E + LF  M++  + KP+  ++V  +
Sbjct: 268 MEKARSVFDSMVEK-DIVTWSTMIQGYASNSFPKEGIELFLQMLQE-NLKPDQFSIVGFL 327

Query: 196 SACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSLIDMYCKCGSLERAKEVFHQLIN 255
           S+CA +G L LG W   ++ R        +N  +A +LIDMY KCG++ R  EVF ++  
Sbjct: 328 SSCASLGALDLGEWGISLIDRHE----FLTNLFMANALIDMYAKCGAMARGFEVFKEMKE 387

Query: 256 KDVITFNAMIMGLAVNSKGDEALKLFAQMQEINIIPSTGTFIGLLSACSHSGFLEQGRQI 315
           KD++  NA I GLA N     +  +F Q +++ I P   TF+GLL  C H+G ++ G + 
Sbjct: 388 KDIVIMNAAISGLAKNGHVKLSFAVFGQTEKLGISPDGSTFLGLLCGCVHAGLIQDGLRF 447

Query: 316 FIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVISTMPFEPNNFVWSSLLRGCLLHS 375
           F  ++  Y +  ++EHY C +DL  RAG  DDA  +I  MP  PN  VW +LL GC L  
Sbjct: 448 FNAISCVYALKRTVEHYGCMVDLWGRAGMLDDAYRLICDMPMRPNAIVWGALLSGCRLVK 507

Query: 376 RFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDDVSALRWFMREKGVHKQPGQSWI 435
             +LA+ V K+L+ ++P N+  YV  +N ++   +WD+ + +R  M +KG+ K PG SWI
Sbjct: 508 DTQLAETVLKELIALEPWNAGNYVQLSNIYSVGGRWDEAAEVRDMMNKKGMKKIPGYSWI 567

Query: 436 SIDGTVHEFFSATKSHPYVDLLYTTLNELEKQMKLV 472
            ++G VHEF +  KSHP  D +Y  L +L  +M+L+
Sbjct: 568 ELEGKVHEFLADDKSHPLSDKIYAKLEDLGNEMRLM 592

BLAST of CsaV3_7G008020 vs. Swiss-Prot
Match: sp|Q9SUH6|PP341_ARATH (Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana OX=3702 GN=DYW9 PE=2 SV=1)

HSP 1 Score: 270.0 bits (689), Expect = 5.0e-71
Identity = 158/488 (32.38%), Postives = 259/488 (53.07%), Query Frame = 0

Query: 5   CVNPEFISLSDLLQGRINNSHLR---QIHARVFRLLKHQDNLIATRLIGHYPH----SVG 64
           C   +  +L D+L        LR   QIH+   +   +  + + T  I  Y       +G
Sbjct: 216 CTRLDTTTLLDILPAVAELQELRLGMQIHSLATKTGCYSHDYVLTGFISLYSKCGKIKMG 275

Query: 65  LRVFNQLIRPNIFPCNAIIR---PEDDTIVSV------------LSACSKLQIAEIEKWV 124
             +F +  +P+I   NA+I       +T +S+            L + + + +  +   +
Sbjct: 276 SALFREFRKPDIVAYNAMIHGYTSNGETELSLSLFKELMLSGARLRSSTLVSLVPVSGHL 335

Query: 125 VELRQLVNKC-DSKRSCCDSINIVLIYLYGKWGMVEKSEEKFNEVVDKRSVLVWNSMINA 184
           + +  +   C  S      S++  L  +Y K   +E + + F+E  +K S+  WN+MI+ 
Sbjct: 336 MLIYAIHGYCLKSNFLSHASVSTALTTVYSKLNEIESARKLFDESPEK-SLPSWNAMISG 395

Query: 185 YFQNGFPVEALTLFRLMVENPHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQRGGRKG 244
           Y QNG   +A++LFR M +     PN VT+  ++SACAQ+G L LG WVH+++    R  
Sbjct: 396 YTQNGLTEDAISLFREM-QKSEFSPNPVTITCILSACAQLGALSLGKWVHDLV----RST 455

Query: 245 IIASNKMLATSLIDMYCKCGSLERAKEVFHQLINKDVITFNAMIMGLAVNSKGDEALKLF 304
              S+  ++T+LI MY KCGS+  A+ +F  +  K+ +T+N MI G  ++ +G EAL +F
Sbjct: 456 DFESSIYVSTALIGMYAKCGSIAEARRLFDLMTKKNEVTWNTMISGYGLHGQGQEALNIF 515

Query: 305 AQMQEINIIPSTGTFIGLLSACSHSGFLEQGRQIFIEMTTHYLVSPSLEHYACYIDLLAR 364
            +M    I P+  TF+ +L ACSH+G +++G +IF  M   Y   PS++HYAC +D+L R
Sbjct: 516 YEMLNSGITPTPVTFLCVLYACSHAGLVKEGDEIFNSMIHRYGFEPSVKHYACMVDILGR 575

Query: 365 AGHFDDALEVISTMPFEPNNFVWSSLLRGCLLHSRFELAQYVSKKLVEVDPENSAGYVMQ 424
           AGH   AL+ I  M  EP + VW +LL  C +H    LA+ VS+KL E+DP+N   +V+ 
Sbjct: 576 AGHLQRALQFIEAMSIEPGSSVWETLLGACRIHKDTNLARTVSEKLFELDPDNVGYHVLL 635

Query: 425 ANSFATDLQWDDVSALRWFMREKGVHKQPGQSWISIDGTVHEFFSATKSHPYVDLLYTTL 470
           +N  + D  +   + +R   +++ + K PG + I I  T H F S  +SHP V  +Y  L
Sbjct: 636 SNIHSADRNYPQAATVRQTAKKRKLAKAPGYTLIEIGETPHVFTSGDQSHPQVKEIYEKL 695

BLAST of CsaV3_7G008020 vs. Swiss-Prot
Match: sp|Q0WQW5|PPR85_ARATH (Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H51 PE=1 SV=2)

HSP 1 Score: 266.5 bits (680), Expect = 5.5e-70
Identity = 135/360 (37.50%), Postives = 222/360 (61.67%), Query Frame = 0

Query: 122 INIVLIYLYGKWGMVEKSEEKFNEVVDKRSVLVWNSMINAYFQNGFPVEALTLFRLMVEN 181
           +N  LI+LYG  G ++ + + F+E + +RS++ WNSMI+A  + G    AL LFR M  +
Sbjct: 188 VNNGLIHLYGSCGCLDLARKVFDE-MPERSLVSWNSMIDALVRFGEYDSALQLFREMQRS 247

Query: 182 PHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSLIDMYCKCG 241
              +P+  TM +V+SACA +G L LG+W H  L R      +A + ++  SLI+MYCKCG
Sbjct: 248 --FEPDGYTMQSVLSACAGLGSLSLGTWAHAFLLRKCDVD-VAMDVLVKNSLIEMYCKCG 307

Query: 242 SLERAKEVFHQLINKDVITFNAMIMGLAVNSKGDEALKLFAQM--QEINIIPSTGTFIGL 301
           SL  A++VF  +  +D+ ++NAMI+G A + + +EA+  F +M  +  N+ P++ TF+GL
Sbjct: 308 SLRMAEQVFQGMQKRDLASWNAMILGFATHGRAEEAMNFFDRMVDKRENVRPNSVTFVGL 367

Query: 302 LSACSHSGFLEQGRQIFIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVISTMPFEP 361
           L AC+H GF+ +GRQ F  M   Y + P+LEHY C +DL+ARAG+  +A++++ +MP +P
Sbjct: 368 LIACNHRGFVNKGRQYFDMMVRDYCIEPALEHYGCIVDLIARAGYITEAIDMVMSMPMKP 427

Query: 362 NNFVWSSLLRGCLLH-SRFELAQYVSKKLVEVDPEN-------SAGYVMQANSFATDLQW 421
           +  +W SLL  C    +  EL++ +++ ++    +N       S  YV+ +  +A+  +W
Sbjct: 428 DAVIWRSLLDACCKKGASVELSEEIARNIIGTKEDNESSNGNCSGAYVLLSRVYASASRW 487

Query: 422 DDVSALRWFMREKGVHKQPGQSWISIDGTVHEFFSATKSHPYVDLLYTTLNELEKQMKLV 472
           +DV  +R  M E G+ K+PG S I I+G  HEFF+   SHP    +Y  L  ++ +++ +
Sbjct: 488 NDVGIVRKLMSEHGIRKEPGCSSIEINGISHEFFAGDTSHPQTKQIYQQLKVIDDRLRSI 543

BLAST of CsaV3_7G008020 vs. Swiss-Prot
Match: sp|O82380|PP175_ARATH (Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H33 PE=2 SV=1)

HSP 1 Score: 265.4 bits (677), Expect = 1.2e-69
Identity = 141/423 (33.33%), Postives = 237/423 (56.03%), Query Frame = 0

Query: 82  TIVSVLSACSKLQIAEIEKWVVELRQLVNKCDSKRSCCDSINIVL----IYLYGKWGMVE 141
           T+V VLSAC+K++  E        RQ+ +  +  R    ++N+ L    + +Y K G +E
Sbjct: 234 TMVGVLSACAKIRNLEFG------RQVCSYIEENRV---NVNLTLANAMLDMYTKCGSIE 293

Query: 142 KSEEKFNEVVDKRSVLVWN-------------------------------SMINAYFQNG 201
            ++  F+ + +K +V  W                                        NG
Sbjct: 294 DAKRLFDAMEEKDNV-TWTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNG 353

Query: 202 FPVEALTLFRLMVENPHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQRGGRKGIIASN 261
            P EAL +F  +    + K N +T+V+ +SACAQ+G L+LG W+H  +++ G    I  N
Sbjct: 354 KPNEALIVFHELQLQKNMKLNQITLVSTLSACAQVGALELGRWIHSYIKKHG----IRMN 413

Query: 262 KMLATSLIDMYCKCGSLERAKEVFHQLINKDVITFNAMIMGLAVNSKGDEALKLFAQMQE 321
             + ++LI MY KCG LE+++EVF+ +  +DV  ++AMI GLA++  G+EA+ +F +MQE
Sbjct: 414 FHVTSALIHMYSKCGDLEKSREVFNSVEKRDVFVWSAMIGGLAMHGCGNEAVDMFYKMQE 473

Query: 322 INIIPSTGTFIGLLSACSHSGFLEQGRQIFIEMTTHYLVSPSLEHYACYIDLLARAGHFD 381
            N+ P+  TF  +  ACSH+G +++   +F +M ++Y + P  +HYAC +D+L R+G+ +
Sbjct: 474 ANVKPNGVTFTNVFCACSHTGLVDEAESLFHQMESNYGIVPEEKHYACIVDVLGRSGYLE 533

Query: 382 DALEVISTMPFEPNNFVWSSLLRGCLLHSRFELAQYVSKKLVEVDPENSAGYVMQANSFA 441
            A++ I  MP  P+  VW +LL  C +H+   LA+    +L+E++P N   +V+ +N +A
Sbjct: 534 KAVKFIEAMPIPPSTSVWGALLGACKIHANLNLAEMACTRLLELEPRNDGAHVLLSNIYA 593

Query: 442 TDLQWDDVSALRWFMREKGVHKQPGQSWISIDGTVHEFFSATKSHPYVDLLYTTLNELEK 470
              +W++VS LR  MR  G+ K+PG S I IDG +HEF S   +HP  + +Y  L+E+ +
Sbjct: 594 KLGKWENVSELRKHMRVTGLKKEPGCSSIEIDGMIHEFLSGDNAHPMSEKVYGKLHEVME 642

BLAST of CsaV3_7G008020 vs. TrEMBL
Match: tr|A0A0A0K4I3|A0A0A0K4I3_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G071580 PE=4 SV=1)

HSP 1 Score: 916.8 bits (2368), Expect = 2.0e-263
Identity = 473/600 (78.83%), Postives = 473/600 (78.83%), Query Frame = 0

Query: 1   MRCLCVNPEFISLSDLLQGRINNSHLRQIHARVFRLLKHQDNLIATRLIGHYPHSVGLRV 60
           MRCLCVNPEFISLSDLLQGRINNSHLRQIHARVFRLLKHQDNLIATRLIGHYPHSVGLRV
Sbjct: 1   MRCLCVNPEFISLSDLLQGRINNSHLRQIHARVFRLLKHQDNLIATRLIGHYPHSVGLRV 60

Query: 61  FNQLIRPNIFPCNAIIR------------------------------------------- 120
           FNQLIRPNIFPCNAIIR                                           
Sbjct: 61  FNQLIRPNIFPCNAIIRVLAEHNSSFFALSIFKYLKHLSLSPNDFTFSFLLKAFHRSCNA 120

Query: 121 ------------------------------------------------------------ 180
                                                                       
Sbjct: 121 LNVKQVHTHVLKMGYFGDSFISNSLLGVYARGLKEMASAHKLFDEMSDREMACCWTSLIA 180

Query: 181 ------------------------PEDDTIVSVLSACSKLQIAEIEKWVVELRQLVNKCD 240
                                   PEDDTIVSVLSACSKLQIAEIEKWVVELRQLVNKCD
Sbjct: 181 GYAQMGLAEKAMLLFFMMVKENIQPEDDTIVSVLSACSKLQIAEIEKWVVELRQLVNKCD 240

Query: 241 SKRSCCDSINIVLIYLYGKWGMVEKSEEKFNEVVDKRSVLVWNSMINAYFQNGFPVEALT 300
           SKRSCCDSINIVLIYLYGKWGMVEKSEEKFNEVVDKRSVLVWNSMINAYFQNGFPVEALT
Sbjct: 241 SKRSCCDSINIVLIYLYGKWGMVEKSEEKFNEVVDKRSVLVWNSMINAYFQNGFPVEALT 300

Query: 301 LFRLMVENPHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSL 360
           LFRLMVENPHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSL
Sbjct: 301 LFRLMVENPHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSL 360

Query: 361 IDMYCKCGSLERAKEVFHQLINKDVITFNAMIMGLAVNSKGDEALKLFAQMQEINIIPST 420
           IDMYCKCGSLERAKEVFHQLINKDVITFNAMIMGLAVNSKGDEALKLFAQMQEINIIPST
Sbjct: 361 IDMYCKCGSLERAKEVFHQLINKDVITFNAMIMGLAVNSKGDEALKLFAQMQEINIIPST 420

Query: 421 GTFIGLLSACSHSGFLEQGRQIFIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVIS 474
           GTFIGLLSACSHSGFLEQGRQIFIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVIS
Sbjct: 421 GTFIGLLSACSHSGFLEQGRQIFIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVIS 480

BLAST of CsaV3_7G008020 vs. TrEMBL
Match: tr|A0A1S3BZG8|A0A1S3BZG8_CUCME (pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103495291 PE=4 SV=1)

HSP 1 Score: 864.0 bits (2231), Expect = 1.6e-247
Identity = 444/600 (74.00%), Postives = 459/600 (76.50%), Query Frame = 0

Query: 1   MRCLCVNPEFISLSDLLQGRINNSHLRQIHARVFRLLKHQDNLIATRLIGHYPHSVGLRV 60
           MRCL VNPEFI+LSDLLQGRINNSHLRQIHARVFRLLKHQDNLIATRLIGHYPHSVGLRV
Sbjct: 9   MRCLFVNPEFINLSDLLQGRINNSHLRQIHARVFRLLKHQDNLIATRLIGHYPHSVGLRV 68

Query: 61  FNQLIRPNIFPCNAIIR------------------------------------------- 120
           FNQLIRPNIFPCNAIIR                                           
Sbjct: 69  FNQLIRPNIFPCNAIIRVLAEHNTSFLALSIFKSLKHLSLSPNDFTFSFLLKAFHRSCNA 128

Query: 121 ------------------------------------------------------------ 180
                                                                       
Sbjct: 129 LDVKQVHTHVLKMGYFGDSFISNALLGVYARGLKDMASAHKVFDEMSDREMACCWTSLIA 188

Query: 181 ------------------------PEDDTIVSVLSACSKLQIAEIEKWVVELRQLVNKCD 240
                                   PEDDT+VSVLSACSK QIAEIEKWVV LR+LVNK D
Sbjct: 189 GYAQMGLAEKAMLIFVTMIKENMQPEDDTMVSVLSACSKFQIAEIEKWVVALRELVNKFD 248

Query: 241 SKRSCCDSINIVLIYLYGKWGMVEKSEEKFNEVVDKRSVLVWNSMINAYFQNGFPVEALT 300
           SK SCCDSINIVLIYLYGKWGMVEKSEEKFNE++DK+SVLVWNSMINAYFQNGFPVEALT
Sbjct: 249 SKSSCCDSINIVLIYLYGKWGMVEKSEEKFNEIIDKKSVLVWNSMINAYFQNGFPVEALT 308

Query: 301 LFRLMVENPHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSL 360
           LFRLMVENPHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQR GRKGIIASNKMLAT+L
Sbjct: 309 LFRLMVENPHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQRSGRKGIIASNKMLATAL 368

Query: 361 IDMYCKCGSLERAKEVFHQLINKDVITFNAMIMGLAVNSKGDEALKLFAQMQEINIIPST 420
           IDMYCKCGSLERAKEVFHQLINKDVI+FNAMIMGLAVN KGDEALKLFAQMQEI+I PST
Sbjct: 369 IDMYCKCGSLERAKEVFHQLINKDVISFNAMIMGLAVNGKGDEALKLFAQMQEIDIRPST 428

Query: 421 GTFIGLLSACSHSGFLEQGRQIFIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVIS 474
           GTFIGLLSACSHSGFLEQG QIFIEMTT YL+SPSLEHYACYIDLLARAG F+DALEV+S
Sbjct: 429 GTFIGLLSACSHSGFLEQGHQIFIEMTTQYLISPSLEHYACYIDLLARAGRFEDALEVVS 488

BLAST of CsaV3_7G008020 vs. TrEMBL
Match: tr|A0A1S3C008|A0A1S3C008_CUCME (pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X2 OS=Cucumis melo OX=3656 GN=LOC103495291 PE=4 SV=1)

HSP 1 Score: 765.8 bits (1976), Expect = 5.8e-218
Identity = 369/398 (92.71%), Postives = 385/398 (96.73%), Query Frame = 0

Query: 76  IRPEDDTIVSVLSACSKLQIAEIEKWVVELRQLVNKCDSKRSCCDSINIVLIYLYGKWGM 135
           ++PEDDT+VSVLSACSK QIAEIEKWVV LR+LVNK DSK SCCDSINIVLIYLYGKWGM
Sbjct: 71  MQPEDDTMVSVLSACSKFQIAEIEKWVVALRELVNKFDSKSSCCDSINIVLIYLYGKWGM 130

Query: 136 VEKSEEKFNEVVDKRSVLVWNSMINAYFQNGFPVEALTLFRLMVENPHCKPNHVTMVTVI 195
           VEKSEEKFNE++DK+SVLVWNSMINAYFQNGFPVEALTLFRLMVENPHCKPNHVTMVTVI
Sbjct: 131 VEKSEEKFNEIIDKKSVLVWNSMINAYFQNGFPVEALTLFRLMVENPHCKPNHVTMVTVI 190

Query: 196 SACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSLIDMYCKCGSLERAKEVFHQLIN 255
           SACAQIGDLQLGSWVHEVLQR GRKGIIASNKMLAT+LIDMYCKCGSLERAKEVFHQLIN
Sbjct: 191 SACAQIGDLQLGSWVHEVLQRSGRKGIIASNKMLATALIDMYCKCGSLERAKEVFHQLIN 250

Query: 256 KDVITFNAMIMGLAVNSKGDEALKLFAQMQEINIIPSTGTFIGLLSACSHSGFLEQGRQI 315
           KDVI+FNAMIMGLAVN KGDEALKLFAQMQEI+I PSTGTFIGLLSACSHSGFLEQG QI
Sbjct: 251 KDVISFNAMIMGLAVNGKGDEALKLFAQMQEIDIRPSTGTFIGLLSACSHSGFLEQGHQI 310

Query: 316 FIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVISTMPFEPNNFVWSSLLRGCLLHS 375
           FIEMTT YL+SPSLEHYACYIDLLARAG F+DALEV+STMPFEPNNFVWSSLLRGCLLHS
Sbjct: 311 FIEMTTQYLISPSLEHYACYIDLLARAGRFEDALEVVSTMPFEPNNFVWSSLLRGCLLHS 370

Query: 376 RFELAQYVSKKLVEVDPENSAGYVMQANSFATDLQWDDVSALRWFMREKGVHKQPGQSWI 435
            FELAQYVSKKLVEVDPENSAGYVMQANSFA+D QWDDVSALRWFMREKGVHKQPGQSWI
Sbjct: 371 SFELAQYVSKKLVEVDPENSAGYVMQANSFASDRQWDDVSALRWFMREKGVHKQPGQSWI 430

Query: 436 SIDGTVHEFFSATKSHPYVDLLYTTLNELEKQMKLVIP 474
           SIDGTVHEFFSATKSHPYVDLLY+TLNEL+KQ KLVIP
Sbjct: 431 SIDGTVHEFFSATKSHPYVDLLYSTLNELDKQTKLVIP 468

BLAST of CsaV3_7G008020 vs. TrEMBL
Match: tr|A0A2N9J9J9|A0A2N9J9J9_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS61410 PE=4 SV=1)

HSP 1 Score: 585.5 bits (1508), Expect = 1.1e-163
Identity = 302/596 (50.67%), Postives = 377/596 (63.26%), Query Frame = 0

Query: 7   NPEFISLSDLLQGRINNSHLRQIHARVFRLLKHQDNLIATRLIGHYPHSVGLRVFNQLIR 66
           +P F  LS +LQ R+++SHL QIHA++ +L  H  NLIATRLIGHYP  + L+VF+QL  
Sbjct: 37  SPNFTHLSAILQARVSHSHLLQIHAQIIQLGAHHHNLIATRLIGHYPSYIALKVFHQLQN 96

Query: 67  PNIFPCNAII-------------------------------------------------- 126
           PNIFP NAI+                                                  
Sbjct: 97  PNIFPFNAIVRVLAGEGLFYHAFHIFKTLKKRSLSPNEFSFCFLLKACFRSSNAHYVKQI 156

Query: 127 ------------------------------------------------------------ 186
                                                                       
Sbjct: 157 HTHVVKMGFLGDPFVCNGLLAAYAKRVKDLVSARKVFDEMPDKVVVCCWTSLIAGYAQSG 216

Query: 187 -----------------RPEDDTIVSVLSACSKLQIAEIEKWVVELRQLVNKCDSKRSCC 246
                            RPE+DT+VSVLSACS L+I EIEKWV  L ++    DSK S C
Sbjct: 217 QTEDVLQFFVMMVRLNLRPENDTLVSVLSACSNLEIVEIEKWVKILTEVGKNFDSKNSSC 276

Query: 247 DSINIVLIYLYGKWGMVEKSEEKFNEVVD--KRSVLVWNSMINAYFQNGFPVEALTLFRL 306
           D++N VLIYLYGKW  +EKS E F+E+VD  KRSVL WN+MI AY QNG P+EAL++FR+
Sbjct: 277 DAVNTVLIYLYGKWEKIEKSREIFDEIVDNAKRSVLPWNAMIGAYVQNGCPMEALSIFRI 336

Query: 307 MVENPHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQRGGRKGIIASNKMLATSLIDMY 366
           MVE+P+ KPNHVTMV+V+SACAQ+GDL LG WVHE ++  G KGII SNK+LAT+ IDMY
Sbjct: 337 MVEDPNPKPNHVTMVSVLSACAQVGDLDLGRWVHEYMKSKGSKGIIGSNKILATAFIDMY 396

Query: 367 CKCGSLERAKEVFHQLINKDVITFNAMIMGLAVNSKGDEALKLFAQMQEINIIPSTGTFI 426
            KCGSLERAKEVF Q+++KDV++FNAMIMGLAVN +G+EAL+LF++M+E  + P+ GTF+
Sbjct: 397 SKCGSLERAKEVFDQMVSKDVVSFNAMIMGLAVNGEGEEALRLFSKMREFGLQPNAGTFL 456

Query: 427 GLLSACSHSGFLEQGRQIFIEMTTHYLVSPSLEHYACYIDLLARAGHFDDALEVISTMPF 474
           G+L AC HSG  E+GRQIF++MT+ + VSP LEHYACYIDLLAR G  ++ALEV+ +MPF
Sbjct: 457 GVLCACCHSGLAEKGRQIFVDMTSRFSVSPKLEHYACYIDLLARVGLVEEALEVVFSMPF 516

BLAST of CsaV3_7G008020 vs. TrEMBL
Match: tr|A0A2P4KGQ5|A0A2P4KGQ5_QUESU (Pentatricopeptide repeat-containing protein, chloroplastic OS=Quercus suber OX=58331 GN=CFP56_53897 PE=4 SV=1)

HSP 1 Score: 578.6 bits (1490), Expect = 1.3e-161
Identity = 310/606 (51.16%), Postives = 380/606 (62.71%), Query Frame = 0

Query: 1   MRCLCVNP----EFISLSDLLQGRINNSHLRQIHARVFRLLKHQDNLIATRLIGHYPHSV 60
           +R L  NP     F  LS +LQG++++S LRQIHA++F+L  HQDNLIATRLIGHYP  V
Sbjct: 24  VRFLSSNPPTSANFTDLSAILQGQVSHSRLRQIHAQIFQLGAHQDNLIATRLIGHYPLHV 83

Query: 61  GLRVFNQLIRPNIFPCNAIIR--------------------------------------- 120
            L+VF+QL  PNIFP NAIIR                                       
Sbjct: 84  ALKVFHQLQNPNIFPFNAIIRVLAEKGLFSHAFFIFKTLKQRSLSPNDFSFCFLLKACFR 143

Query: 121 ------------------------------------------------------------ 180
                                                                       
Sbjct: 144 SNNAYYVNQIHTHVVKMGFLGDPFVCNGFLAAYGKSLKDLVSARKLFDEMSDKGVVCCWT 203

Query: 181 ----------------------------PEDDTIVSVLSACSKLQIAEIEKWVVELRQLV 240
                                       PE+DT+VSVLSACS L+  +IEKWV  L ++ 
Sbjct: 204 SLISGYAQSDQTEDVLQFFLMMVQLNLQPENDTLVSVLSACSNLENVKIEKWVTVLAEIG 263

Query: 241 NKCDSKRSCCDSINIVLIYLYGKWGMVEKSEEKFNEVVD--KRSVLVWNSMINAYFQNGF 300
              DSK S CD++N VLIYLYGKWG +EKS E F+E+V   KRSVL WN+MI AY QNG 
Sbjct: 264 KNFDSKNSGCDAVNTVLIYLYGKWGRIEKSREIFDEIVHNAKRSVLPWNAMIGAYVQNGC 323

Query: 301 PVEALTLFRLMVENPHCKPNHVTMVTVISACAQIGDLQLGSWVHEVLQRGGRKGIIASNK 360
           P+EAL++FRLMVE+P+ KPNHVTMV+V+SACAQIGDL LG WVHE ++  G KGII SNK
Sbjct: 324 PMEALSIFRLMVEDPNHKPNHVTMVSVLSACAQIGDLDLGRWVHEYMKSKGSKGIIGSNK 383

Query: 361 MLATSLIDMYCKCGSLERAKEVFHQLINKDVITFNAMIMGLAVNSKGDEALKLFAQMQEI 420
           +LAT+ IDMY KCGSLERAKEVF Q+++KDV++FNAMIMGLAVN +G+EAL+LF++MQE 
Sbjct: 384 ILATAFIDMYSKCGSLERAKEVFDQMVSKDVVSFNAMIMGLAVNGEGEEALRLFSKMQEF 443

Query: 421 NIIPSTGTFIGLLSACSHSGFLEQGRQIFIEMTTHYLVSPSLEHYACYIDLLARAGHFDD 474
            + P+ GTF+GLL AC HSG  E+GR+IF++MT+ + VSP LEHYACYIDLLAR G  ++
Sbjct: 444 GLQPNAGTFLGLLCACCHSGLAEKGREIFVDMTSRFSVSPELEHYACYIDLLARVGLVEE 503

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KGN43869.13.1e-26378.83hypothetical protein Csa_7G071580 [Cucumis sativus][more]
XP_008455016.12.4e-24774.00PREDICTED: pentatricopeptide repeat-containing protein At1g08070, chloroplastic-... [more]
XP_011658883.12.2e-23299.75PREDICTED: putative pentatricopeptide repeat-containing protein At3g08820 [Cucum... [more]
XP_022952095.17.4e-22568.00pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isofor... [more]
XP_022972521.16.5e-22167.00pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like [Cucur... [more]
Match NameE-valueIdentityDescription
AT1G08070.11.4e-7938.78Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G08820.19.8e-7835.86Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G30700.12.8e-7232.38Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G59720.13.0e-7137.50Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT2G29760.16.8e-7133.33Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9LN01|PPR21_ARATH2.5e-7838.78Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
sp|Q9SR82|PP219_ARATH1.8e-7635.86Putative pentatricopeptide repeat-containing protein At3g08820 OS=Arabidopsis th... [more]
sp|Q9SUH6|PP341_ARATH5.0e-7132.38Pentatricopeptide repeat-containing protein At4g30700 OS=Arabidopsis thaliana OX... [more]
sp|Q0WQW5|PPR85_ARATH5.5e-7037.50Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondri... [more]
sp|O82380|PP175_ARATH1.2e-6933.33Pentatricopeptide repeat-containing protein At2g29760, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0K4I3|A0A0A0K4I3_CUCSA2.0e-26378.83Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G071580 PE=4 SV=1[more]
tr|A0A1S3BZG8|A0A1S3BZG8_CUCME1.6e-24774.00pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isofor... [more]
tr|A0A1S3C008|A0A1S3C008_CUCME5.8e-21892.71pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isofor... [more]
tr|A0A2N9J9J9|A0A2N9J9J9_FAGSY1.1e-16350.67Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS61410 PE=4 SV=1[more]
tr|A0A2P4KGQ5|A0A2P4KGQ5_QUESU1.3e-16151.16Pentatricopeptide repeat-containing protein, chloroplastic OS=Quercus suber OX=5... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0005515 protein binding
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_7G008020.1CsaV3_7G008020.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 231..257
e-value: 1.6E-4
score: 19.6
coord: 154..187
e-value: 5.5E-8
score: 30.5
coord: 259..291
e-value: 7.2E-5
score: 20.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 334..355
e-value: 1.1
score: 9.6
coord: 126..149
e-value: 0.54
score: 10.5
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 256..304
e-value: 2.0E-9
score: 37.4
coord: 151..199
e-value: 1.5E-9
score: 37.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 226..256
score: 8.725
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 257..291
score: 11.224
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 187..221
score: 6.73
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 328..358
score: 7.037
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 151..181
score: 9.547
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 360..394
score: 6.204
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 292..326
score: 7.191
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 222..449
e-value: 7.3E-38
score: 132.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 73..221
e-value: 2.5E-16
score: 62.1
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILYSSF48452TPR-likecoord: 211..396
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 11..77
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 148..455
NoneNo IPR availablePANTHERPTHR24015:SF991SUBFAMILY NOT NAMEDcoord: 11..77
coord: 148..455

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CsaV3_7G008020Cucumber (Chinese Long) v3cuccucB100
CsaV3_7G008020Cucumber (Chinese Long) v3cuccucB188
CsaV3_7G008020Silver-seed gourdcarcucB0216
CsaV3_7G008020Silver-seed gourdcarcucB0329
CsaV3_7G008020Silver-seed gourdcarcucB0412
CsaV3_7G008020Silver-seed gourdcarcucB0490
CsaV3_7G008020Cucumber (Gy14) v2cgybcucB198
CsaV3_7G008020Cucumber (Gy14) v1cgycucB540
CsaV3_7G008020Cucurbita maxima (Rimu)cmacucB0057
CsaV3_7G008020Cucurbita maxima (Rimu)cmacucB0171
CsaV3_7G008020Cucurbita maxima (Rimu)cmacucB0232
CsaV3_7G008020Cucurbita maxima (Rimu)cmacucB0414
CsaV3_7G008020Cucurbita maxima (Rimu)cmacucB0589
CsaV3_7G008020Cucurbita maxima (Rimu)cmacucB0639
CsaV3_7G008020Cucurbita maxima (Rimu)cmacucB0918
CsaV3_7G008020Cucurbita maxima (Rimu)cmacucB0959
CsaV3_7G008020Cucurbita moschata (Rifu)cmocucB0160
CsaV3_7G008020Cucurbita moschata (Rifu)cmocucB0409
CsaV3_7G008020Cucurbita moschata (Rifu)cmocucB0910
CsaV3_7G008020Cucurbita moschata (Rifu)cmocucB0949
CsaV3_7G008020Cucurbita pepo (Zucchini)cpecucB0141
CsaV3_7G008020Cucurbita pepo (Zucchini)cpecucB0582
CsaV3_7G008020Cucurbita pepo (Zucchini)cpecucB0736
CsaV3_7G008020Cucurbita pepo (Zucchini)cpecucB0868
CsaV3_7G008020Cucurbita pepo (Zucchini)cpecucB0995
CsaV3_7G008020Wild cucumber (PI 183967)cpicucB123
CsaV3_7G008020Wild cucumber (PI 183967)cpicucB248
CsaV3_7G008020Bottle gourd (USVL1VR-Ls)cuclsiB558
CsaV3_7G008020Bottle gourd (USVL1VR-Ls)cuclsiB573
CsaV3_7G008020Melon (DHL92) v3.5.1cucmeB548
CsaV3_7G008020Melon (DHL92) v3.5.1cucmeB581
CsaV3_7G008020Melon (DHL92) v3.5.1cucmeB569
CsaV3_7G008020Melon (DHL92) v3.6.1cucmedB536
CsaV3_7G008020Melon (DHL92) v3.6.1cucmedB555
CsaV3_7G008020Melon (DHL92) v3.6.1cucmedB568
CsaV3_7G008020Watermelon (Charleston Gray)cucwcgB572
CsaV3_7G008020Watermelon (Charleston Gray)cucwcgB585
CsaV3_7G008020Watermelon (Charleston Gray)cucwcgB601
CsaV3_7G008020Watermelon (97103) v1cucwmB607
CsaV3_7G008020Watermelon (97103) v1cucwmB619
CsaV3_7G008020Watermelon (97103) v1cucwmB635
CsaV3_7G008020Watermelon (97103) v2cucwmbB550
CsaV3_7G008020Watermelon (97103) v2cucwmbB562
CsaV3_7G008020Watermelon (97103) v2cucwmbB563
CsaV3_7G008020Watermelon (97103) v2cucwmbB577
CsaV3_7G008020Wax gourdcucwgoB702
CsaV3_7G008020Wax gourdcucwgoB724