CsaV3_4G004520 (gene) Cucumber (Chinese Long) v3

NameCsaV3_4G004520
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionPentatricopeptide repeat-containing protein
Locationchr4 : 2826221 .. 2828571 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CCAAACGAAGAGCGGATTCTTGGTTAGACCCTAACAAAACCTCGGGGCCGTATAACGCTCTCAAGGCTCAGGAGGAGAAGTGATTTTCGAGAGAAATGAATCGTCGGAGTTTAATCTCAAGGGCGCCGGCAGGTTTCCGGCAGCTCTGTACTTCATTGAACGAGTTGATGCGCAGTCCTGCGAATAATCAACGGGGGCTCTACCCGAGGTTGTCGGCCCTGGGTGCTACTGGCGGTAGCGTGGCGAAGACAATAAACCAGTTCATCATGGAGGGAAATATCGTCAAAAAATATGAACTCGAAAAATGCATAAAGGAGCTCCGGAAGTATCGTAGATATCACCATTGTCTTCAGGTTTGTTCTTTCTCTAGAAGTTATGTTAATGAAGTTGCCACTGATGAGATGCTATCAGATTTAATTGCACGAGGTTTGTTCGCTGAAATTGGTGTGAGAAACATAAATGCATAAAATTACTGCTTCTTTCCATGTTTCTTCATTTTCTGTTGCCTTCTGTCAATTGTTGGTTGAATTTTTCCTTGCTATTTTGTTGAGATTTTTATTGGGATGACCCATCTTCAGCCCCTTAAGTGATCAATGAACTACCTTCTCCAATGAAAACAAACCTTTTGCTTTCACTTCCGTGGTTTATGATAGCGCCTAAATTGATGGCCTTGCCTGTGGCAATTTTCATTTTTCTCTCCTCCACCGAAAATGGTGAAAGAACAGCAAAAGCAAAGTGTGAGAGAAATGAAGATCGCTATAGACGTGACCAGTGCGTTAGAGCTACAATAGACTTGGTCATCCCTAACTCAATACCTTTGCTTCGGTTGCAGATAATGGAATGGATGGAGACAAGAAAAATTAACTATTCATTCACTGACTATGCCCTGCGTTTAGATCTTATATCAAAAGTTAATGGAGTAACTGCTGCGGAGAAATATTTCTATGATCTTCCACCATCTGCGAAGAATCGATGTACTTATGGAGCCCTTTTGAACTGCTATTGCAAGGAAATGATGGAGGAAAAGGCTTTGACTCTTTTTAAGAAGATGGATGAGTTGAAGATTTCTACTAGTTTGTCCTTTAACAATCTTATGACCATGTACATGAGAATGGATCACCCCGAGAAAGTACCTCCTCTAATAGGTGAGATGAAGCAGAGAGGCTTTTATCTTACTACGTTCACATACAATGTGTGGATGAACAGTTGTGCTTCCTTGAATGACATTGGAAAAGTTGAAGAAATTCTTGAGGAGATGAAAATGGAAGACAGAAACAAATTTGATTGGACGACATATTCGAATTTGGCTTCTTTCTATGTCAAGGCAGGGCAGTTTGAGAAAGCTGAATTAGCTCTTAAAAAGTTAGAGGAAGAGATGAAATCCGATAAGAACGATCGTCTTGTATACCATTGCTTGATAAGCCTCTATGCGTCGACTTCCAATCTGAGTGAGGTGAATAGGATATGGAATGCACTGAAATCAGTTTATTCAACGATGACTAATATAAGCTATCTCGTCATGCTTCAGGCCCTAAGAAAACTAAAGGATATTGAGGGTCTTAAAAGAACTTATAAAGAATGGGAATCAAATTGTCGCAACTTCGATTTGCGGATAGTTAATGATATCATCGGGGCTTACCTACAACAGGACATGTACGAAGATGCTGCAATGATCTTTGAGGATGCCACTAAGAGAAGTAAAGGACCTTTCTCTAGGGCCCGAGAAATGTTCATGGTTTACTTCTTGAAGTTGAAGCAAGTGGATTCAGCATTCAGTCATTTGGAATCAGCTTTATCTGAAAGCAAGGAGAAGGAATGGCATCCATCACTAGCAACGACAACTGCTTTTCTAAATTACTTTGAGGAAGAGAAAGATGTTGAGGGTGCTGAAGATTTTGCTAGGATTTTAAAGAGACTTAAGTGTCTAGACGCAAGTGGATACCATCTGTTGCTCAAGACTTATGTAGCGGCAGGAAAATTGGCCCCCGATATGCGAAAGAGATTGAAAGAAGACGACATTGAGATAAGTAGTGAGCTTGAGGAGTTGCTCGGAACGGTTTGCCCCCAATAAATACGTGGTCGCCAAAGATATACGTACCACTTATTGGAGTTCTTTCGAATTCATAAACATTTGTTTAATAGGTCGTAAAATGCTATTGTTACAAATGGTATTGAGTTTTAATGGAATGTCATGCATATGCAAATTGATAAATCTAGTGTATTATTAAATTACAATGATTGACTTATGAAATTTTTCATTGTAGAGGTTAAATTGTTTTACTTTTCGTGGTATAAAGACTAATTTCCCTTTTTTATAACAATTGTGTTTTTGAAAATTATGTTGATAAACA

mRNA sequence

ATGAATCGTCGGAGTTTAATCTCAAGGGCGCCGGCAGGTTTCCGGCAGCTCTGTACTTCATTGAACGAGTTGATGCGCAGTCCTGCGAATAATCAACGGGGGCTCTACCCGAGGTTGTCGGCCCTGGGTGCTACTGGCGGTAGCGTGGCGAAGACAATAAACCAGTTCATCATGGAGGGAAATATCGTCAAAAAATATGAACTCGAAAAATGCATAAAGGAGCTCCGGAAGTATCGTAGATATCACCATTGTCTTCAGATAATGGAATGGATGGAGACAAGAAAAATTAACTATTCATTCACTGACTATGCCCTGCGTTTAGATCTTATATCAAAAGTTAATGGAGTAACTGCTGCGGAGAAATATTTCTATGATCTTCCACCATCTGCGAAGAATCGATGTACTTATGGAGCCCTTTTGAACTGCTATTGCAAGGAAATGATGGAGGAAAAGGCTTTGACTCTTTTTAAGAAGATGGATGAGTTGAAGATTTCTACTAGTTTGTCCTTTAACAATCTTATGACCATGTACATGAGAATGGATCACCCCGAGAAAGTACCTCCTCTAATAGGTGAGATGAAGCAGAGAGGCTTTTATCTTACTACGTTCACATACAATGTGTGGATGAACAGTTGTGCTTCCTTGAATGACATTGGAAAAGTTGAAGAAATTCTTGAGGAGATGAAAATGGAAGACAGAAACAAATTTGATTGGACGACATATTCGAATTTGGCTTCTTTCTATGTCAAGGCAGGGCAGTTTGAGAAAGCTGAATTAGCTCTTAAAAAGTTAGAGGAAGAGATGAAATCCGATAAGAACGATCGTCTTGTATACCATTGCTTGATAAGCCTCTATGCGTCGACTTCCAATCTGAGTGAGGTGAATAGGATATGGAATGCACTGAAATCAGTTTATTCAACGATGACTAATATAAGCTATCTCGTCATGCTTCAGGCCCTAAGAAAACTAAAGGATATTGAGGGTCTTAAAAGAACTTATAAAGAATGGGAATCAAATTGTCGCAACTTCGATTTGCGGATAGTTAATGATATCATCGGGGCTTACCTACAACAGGACATGTACGAAGATGCTGCAATGATCTTTGAGGATGCCACTAAGAGAAGTAAAGGACCTTTCTCTAGGGCCCGAGAAATGTTCATGGTTTACTTCTTGAAGTTGAAGCAAGTGGATTCAGCATTCAGTCATTTGGAATCAGCTTTATCTGAAAGCAAGGAGAAGGAATGGCATCCATCACTAGCAACGACAACTGCTTTTCTAAATTACTTTGAGGAAGAGAAAGATGTTGAGGGTGCTGAAGATTTTGCTAGGATTTTAAAGAGACTTAAGTGTCTAGACGCAAGTGGATACCATCTGTTGCTCAAGACTTATGTAGCGGCAGGAAAATTGGCCCCCGATATGCGAAAGAGATTGAAAGAAGACGACATTGAGATAAGTAGTGAGCTTGAGGAGTTGCTCGGAACGGTTTGCCCCCAATAA

Coding sequence (CDS)

ATGAATCGTCGGAGTTTAATCTCAAGGGCGCCGGCAGGTTTCCGGCAGCTCTGTACTTCATTGAACGAGTTGATGCGCAGTCCTGCGAATAATCAACGGGGGCTCTACCCGAGGTTGTCGGCCCTGGGTGCTACTGGCGGTAGCGTGGCGAAGACAATAAACCAGTTCATCATGGAGGGAAATATCGTCAAAAAATATGAACTCGAAAAATGCATAAAGGAGCTCCGGAAGTATCGTAGATATCACCATTGTCTTCAGATAATGGAATGGATGGAGACAAGAAAAATTAACTATTCATTCACTGACTATGCCCTGCGTTTAGATCTTATATCAAAAGTTAATGGAGTAACTGCTGCGGAGAAATATTTCTATGATCTTCCACCATCTGCGAAGAATCGATGTACTTATGGAGCCCTTTTGAACTGCTATTGCAAGGAAATGATGGAGGAAAAGGCTTTGACTCTTTTTAAGAAGATGGATGAGTTGAAGATTTCTACTAGTTTGTCCTTTAACAATCTTATGACCATGTACATGAGAATGGATCACCCCGAGAAAGTACCTCCTCTAATAGGTGAGATGAAGCAGAGAGGCTTTTATCTTACTACGTTCACATACAATGTGTGGATGAACAGTTGTGCTTCCTTGAATGACATTGGAAAAGTTGAAGAAATTCTTGAGGAGATGAAAATGGAAGACAGAAACAAATTTGATTGGACGACATATTCGAATTTGGCTTCTTTCTATGTCAAGGCAGGGCAGTTTGAGAAAGCTGAATTAGCTCTTAAAAAGTTAGAGGAAGAGATGAAATCCGATAAGAACGATCGTCTTGTATACCATTGCTTGATAAGCCTCTATGCGTCGACTTCCAATCTGAGTGAGGTGAATAGGATATGGAATGCACTGAAATCAGTTTATTCAACGATGACTAATATAAGCTATCTCGTCATGCTTCAGGCCCTAAGAAAACTAAAGGATATTGAGGGTCTTAAAAGAACTTATAAAGAATGGGAATCAAATTGTCGCAACTTCGATTTGCGGATAGTTAATGATATCATCGGGGCTTACCTACAACAGGACATGTACGAAGATGCTGCAATGATCTTTGAGGATGCCACTAAGAGAAGTAAAGGACCTTTCTCTAGGGCCCGAGAAATGTTCATGGTTTACTTCTTGAAGTTGAAGCAAGTGGATTCAGCATTCAGTCATTTGGAATCAGCTTTATCTGAAAGCAAGGAGAAGGAATGGCATCCATCACTAGCAACGACAACTGCTTTTCTAAATTACTTTGAGGAAGAGAAAGATGTTGAGGGTGCTGAAGATTTTGCTAGGATTTTAAAGAGACTTAAGTGTCTAGACGCAAGTGGATACCATCTGTTGCTCAAGACTTATGTAGCGGCAGGAAAATTGGCCCCCGATATGCGAAAGAGATTGAAAGAAGACGACATTGAGATAAGTAGTGAGCTTGAGGAGTTGCTCGGAACGGTTTGCCCCCAATAA

Protein sequence

MNRRSLISRAPAGFRQLCTSLNELMRSPANNQRGLYPRLSALGATGGSVAKTINQFIMEGNIVKKYELEKCIKELRKYRRYHHCLQIMEWMETRKINYSFTDYALRLDLISKVNGVTAAEKYFYDLPPSAKNRCTYGALLNCYCKEMMEEKALTLFKKMDELKISTSLSFNNLMTMYMRMDHPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMEDRNKFDWTTYSNLASFYVKAGQFEKAELALKKLEEEMKSDKNDRLVYHCLISLYASTSNLSEVNRIWNALKSVYSTMTNISYLVMLQALRKLKDIEGLKRTYKEWESNCRNFDLRIVNDIIGAYLQQDMYEDAAMIFEDATKRSKGPFSRAREMFMVYFLKLKQVDSAFSHLESALSESKEKEWHPSLATTTAFLNYFEEEKDVEGAEDFARILKRLKCLDASGYHLLLKTYVAAGKLAPDMRKRLKEDDIEISSELEELLGTVCPQ
BLAST of CsaV3_4G004520 vs. NCBI nr
Match: XP_004146883.2 (PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial [Cucumis sativus] >KGN53198.1 Pentatricopeptide repeat-containing protein [Cucumis sativus])

HSP 1 Score: 996.1 bits (2574), Expect = 4.2e-287
Identity = 498/498 (100.00%), Postives = 498/498 (100.00%), Query Frame = 0

Query: 1   MNRRSLISRAPAGFRQLCTSLNELMRSPANNQRGLYPRLSALGATGGSVAKTINQFIMEG 60
           MNRRSLISRAPAGFRQLCTSLNELMRSPANNQRGLYPRLSALGATGGSVAKTINQFIMEG
Sbjct: 1   MNRRSLISRAPAGFRQLCTSLNELMRSPANNQRGLYPRLSALGATGGSVAKTINQFIMEG 60

Query: 61  NIVKKYELEKCIKELRKYRRYHHCLQIMEWMETRKINYSFTDYALRLDLISKVNGVTAAE 120
           NIVKKYELEKCIKELRKYRRYHHCLQIMEWMETRKINYSFTDYALRLDLISKVNGVTAAE
Sbjct: 61  NIVKKYELEKCIKELRKYRRYHHCLQIMEWMETRKINYSFTDYALRLDLISKVNGVTAAE 120

Query: 121 KYFYDLPPSAKNRCTYGALLNCYCKEMMEEKALTLFKKMDELKISTSLSFNNLMTMYMRM 180
           KYFYDLPPSAKNRCTYGALLNCYCKEMMEEKALTLFKKMDELKISTSLSFNNLMTMYMRM
Sbjct: 121 KYFYDLPPSAKNRCTYGALLNCYCKEMMEEKALTLFKKMDELKISTSLSFNNLMTMYMRM 180

Query: 181 DHPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMEDRNKFDWTT 240
           DHPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMEDRNKFDWTT
Sbjct: 181 DHPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMEDRNKFDWTT 240

Query: 241 YSNLASFYVKAGQFEKAELALKKLEEEMKSDKNDRLVYHCLISLYASTSNLSEVNRIWNA 300
           YSNLASFYVKAGQFEKAELALKKLEEEMKSDKNDRLVYHCLISLYASTSNLSEVNRIWNA
Sbjct: 241 YSNLASFYVKAGQFEKAELALKKLEEEMKSDKNDRLVYHCLISLYASTSNLSEVNRIWNA 300

Query: 301 LKSVYSTMTNISYLVMLQALRKLKDIEGLKRTYKEWESNCRNFDLRIVNDIIGAYLQQDM 360
           LKSVYSTMTNISYLVMLQALRKLKDIEGLKRTYKEWESNCRNFDLRIVNDIIGAYLQQDM
Sbjct: 301 LKSVYSTMTNISYLVMLQALRKLKDIEGLKRTYKEWESNCRNFDLRIVNDIIGAYLQQDM 360

Query: 361 YEDAAMIFEDATKRSKGPFSRAREMFMVYFLKLKQVDSAFSHLESALSESKEKEWHPSLA 420
           YEDAAMIFEDATKRSKGPFSRAREMFMVYFLKLKQVDSAFSHLESALSESKEKEWHPSLA
Sbjct: 361 YEDAAMIFEDATKRSKGPFSRAREMFMVYFLKLKQVDSAFSHLESALSESKEKEWHPSLA 420

Query: 421 TTTAFLNYFEEEKDVEGAEDFARILKRLKCLDASGYHLLLKTYVAAGKLAPDMRKRLKED 480
           TTTAFLNYFEEEKDVEGAEDFARILKRLKCLDASGYHLLLKTYVAAGKLAPDMRKRLKED
Sbjct: 421 TTTAFLNYFEEEKDVEGAEDFARILKRLKCLDASGYHLLLKTYVAAGKLAPDMRKRLKED 480

Query: 481 DIEISSELEELLGTVCPQ 499
           DIEISSELEELLGTVCPQ
Sbjct: 481 DIEISSELEELLGTVCPQ 498

BLAST of CsaV3_4G004520 vs. NCBI nr
Match: XP_008453822.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial-like isoform X1 [Cucumis melo])

HSP 1 Score: 907.1 bits (2343), Expect = 2.6e-260
Identity = 455/498 (91.37%), Postives = 467/498 (93.78%), Query Frame = 0

Query: 1   MNRRSLISRAPAGFRQLCTSLNELMRSPANNQRGLYPRLSALGATGGSVAKTINQFIMEG 60
           MNRRSLISRAPAG RQLCTS+ EL RSPANN RGLYPRLS LGATGGSVA+TIN+FIMEG
Sbjct: 1   MNRRSLISRAPAGLRQLCTSVAELTRSPANNHRGLYPRLSVLGATGGSVAQTINRFIMEG 60

Query: 61  NIVKKYELEKCIKELRKYRRYHHCLQIMEWMETRKINYSFTDYALRLDLISKVNGVTAAE 120
           NIVKKYELEKCIKELRKYRRY H LQIMEWME RKINYSFTDYALRLDLISKVNG+TAAE
Sbjct: 61  NIVKKYELEKCIKELRKYRRYDHSLQIMEWMEIRKINYSFTDYALRLDLISKVNGITAAE 120

Query: 121 KYFYDLPPSAKNRCTYGALLNCYCKEMMEEKALTLFKKMDELKISTSLSFNNLMTMYMRM 180
           KYFYDLPPSAKNRCTYGALLNCYCKEMMEEKA TLFKKMDELK  TSL+FNNLMTMYMRM
Sbjct: 121 KYFYDLPPSAKNRCTYGALLNCYCKEMMEEKASTLFKKMDELKFVTSLAFNNLMTMYMRM 180

Query: 181 DHPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMEDRNKFDWTT 240
           D PEKVPPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMED NK DWTT
Sbjct: 181 DQPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMEDSNKLDWTT 240

Query: 241 YSNLASFYVKAGQFEKAELALKKLEEEMKSDKNDRLVYHCLISLYASTSNLSEVNRIWNA 300
           +SNLASFYVKAGQ EKAELALKK+EEE+KSDK DRL YHCLISLYASTSNLSEVNRIWN 
Sbjct: 241 FSNLASFYVKAGQLEKAELALKKVEEEIKSDKKDRLAYHCLISLYASTSNLSEVNRIWNL 300

Query: 301 LKSVYSTMTNISYLVMLQALRKLKDIEGLKRTYKEWESNCRNFDLRIVNDIIGAYLQQDM 360
           LKSVY TMTN SYLVMLQAL KLKDIEGLK+TYKEWES C  FDLR+VN IIGAYLQQDM
Sbjct: 301 LKSVYPTMTNTSYLVMLQALSKLKDIEGLKKTYKEWESICHIFDLRLVNVIIGAYLQQDM 360

Query: 361 YEDAAMIFEDATKRSKGPFSRAREMFMVYFLKLKQVDSAFSHLESALSESKEKEWHPSLA 420
           YEDAAMIFEDA KRSKGPFSRARE FMVYFLKLKQVDSAFSHLESA+SESKEKEWHPSLA
Sbjct: 361 YEDAAMIFEDAIKRSKGPFSRAREKFMVYFLKLKQVDSAFSHLESAISESKEKEWHPSLA 420

Query: 421 TTTAFLNYFEEEKDVEGAEDFARILKRLKCLDASGYHLLLKTYVAAGKLAPDMRKRLKED 480
           TT AFLNYFEEEKDVEGAEDFARILKRLKCLD SGYHLLLKTYVAAGK APDMR+RLKED
Sbjct: 421 TTNAFLNYFEEEKDVEGAEDFARILKRLKCLDESGYHLLLKTYVAAGKSAPDMRQRLKED 480

Query: 481 DIEISSELEELLGTVCPQ 499
           DI ISSELEELLGTVCPQ
Sbjct: 481 DIGISSELEELLGTVCPQ 498

BLAST of CsaV3_4G004520 vs. NCBI nr
Match: XP_022141328.1 (pentatricopeptide repeat-containing protein At1g02370, mitochondrial [Momordica charantia])

HSP 1 Score: 776.5 bits (2004), Expect = 5.3e-221
Identity = 389/496 (78.43%), Postives = 429/496 (86.49%), Query Frame = 0

Query: 1   MNRRSLISRAPAGFRQLCTSLNELMRSPANNQRGLYPRLSALGATGGSVAKTINQFIMEG 60
           MNRRSLIS  PAG RQLCTS  EL R P N+QR LYPRLSALGA+GGSVA+T+NQFIMEG
Sbjct: 1   MNRRSLIS--PAGLRQLCTSAAELTRGPLNDQRRLYPRLSALGASGGSVARTLNQFIMEG 60

Query: 61  NIVKKYELEKCIKELRKYRRYHHCLQIMEWMETRKINYSFTDYALRLDLISKVNGVTAAE 120
            IVKKYELE+CIKELRKYRRYHH LQIMEWME R IN SFTDYALRLDLISKV G+ AAE
Sbjct: 61  KIVKKYELERCIKELRKYRRYHHALQIMEWMEMRNINSSFTDYALRLDLISKVKGIAAAE 120

Query: 121 KYFYDLPPSAKNRCTYGALLNCYCKEMMEEKALTLFKKMDELKISTSLSFNNLMTMYMRM 180
            YF DL PSAKNR TYGALLNCYCKEMM+EKAL L KK+DELK +++LSFNNLMTMYMRM
Sbjct: 121 SYFCDLSPSAKNRFTYGALLNCYCKEMMQEKALALSKKIDELKFASNLSFNNLMTMYMRM 180

Query: 181 DHPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMEDRNKFDWTT 240
           + PEKVPPLI EMKQRG +L+T+TYNVWMNSCASL+D+GKVEEILEEMK EDRNKFDWTT
Sbjct: 181 NQPEKVPPLIDEMKQRGIFLSTYTYNVWMNSCASLDDVGKVEEILEEMKNEDRNKFDWTT 240

Query: 241 YSNLASFYVKAGQFEKAELALKKLEEEMKSDKNDRLVYHCLISLYASTSNLSEVNRIWNA 300
           +SNLA+ YVK GQ +KAELALKK+EEE+KS+K   L YH LISLYASTSNLSEVNRIW A
Sbjct: 241 FSNLAAIYVKVGQLDKAELALKKVEEEIKSNKQKCLAYHFLISLYASTSNLSEVNRIWKA 300

Query: 301 LKSVYSTMTNISYLVMLQALRKLKDIEGLKRTYKEWESNCRNFDLRIVNDIIGAYLQQDM 360
           LKSVY    NISYLV+LQAL KLKD +GLK+TY EWES+C  FD R+    IGAYL+QDM
Sbjct: 301 LKSVYPMTNNISYLVILQALSKLKDFDGLKKTYNEWESSCSIFDFRLAAVTIGAYLRQDM 360

Query: 361 YEDAAMIFEDATKRSKGPFSRAREMFMVYFLKLKQVDSAFSHLESALSESKEKEWHPSLA 420
           Y+DAA+IFEDA KRSKGPF RAREMFM+YFLKLKQVDSA SHLESA+SESK+ EWHPS A
Sbjct: 361 YKDAALIFEDAIKRSKGPFLRAREMFMIYFLKLKQVDSALSHLESAISESKDNEWHPSPA 420

Query: 421 TTTAFLNYFEEEKDVEGAEDFARILKRLKCLDASGYHLLLKTYVAAGKLAPDMRKRLKED 480
              AFL YFEEEKDV+GAEDFA+ILK   CLD+S YHLLLKTYVAAGK +PDMR+RLKED
Sbjct: 421 MVNAFLKYFEEEKDVKGAEDFAKILKSFNCLDSSAYHLLLKTYVAAGKSSPDMRQRLKED 480

Query: 481 DIEISSELEELLGTVC 497
            IEISSELEELL  +C
Sbjct: 481 KIEISSELEELLEAIC 494

BLAST of CsaV3_4G004520 vs. NCBI nr
Match: XP_022942939.1 (pentatricopeptide repeat-containing protein At1g02370, mitochondrial-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 760.0 bits (1961), Expect = 5.1e-216
Identity = 381/484 (78.72%), Postives = 418/484 (86.36%), Query Frame = 0

Query: 1   MNRRSLISRAPAGFRQLCTSLNELMRSPANNQRGLYPRLSALGATGGSVAKTINQFIMEG 60
           MNRRSL+SRA AG R LCTS  E  R P N+Q+ LYPRLS LGATGGSVA+T+NQ+IMEG
Sbjct: 1   MNRRSLLSRASAGLRHLCTSTAESKRGPVNDQQRLYPRLSKLGATGGSVAQTLNQYIMEG 60

Query: 61  NIVKKYELEKCIKELRKYRRYHHCLQIMEWMETRKINYSFTDYALRLDLISKVNGVTAAE 120
            IVKKYELE+CIKELRKYRRYHH LQIMEWME RKINYSFTDYALRLDLISKV G+ AAE
Sbjct: 61  KIVKKYELERCIKELRKYRRYHHALQIMEWMEMRKINYSFTDYALRLDLISKVKGIAAAE 120

Query: 121 KYFYDLPPSAKNRCTYGALLNCYCKEMMEEKALTLFKKMDELKISTSLSFNNLMTMYMRM 180
            YF DL  SAKNR TYGALLNCYCKE+MEEKAL L KK+DELK +++LSFNNLMTMYMRM
Sbjct: 121 NYFCDLSSSAKNRFTYGALLNCYCKELMEEKALALSKKIDELKFASNLSFNNLMTMYMRM 180

Query: 181 DHPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMEDRNKFDWTT 240
           D PEKVPPLI EMK+RG +L+T+TYNVWMNSCASLN +GKVEEILEEMK EDRNKFDWTT
Sbjct: 181 DQPEKVPPLIDEMKRRGIFLSTYTYNVWMNSCASLNGVGKVEEILEEMKNEDRNKFDWTT 240

Query: 241 YSNLASFYVKAGQFEKAELALKKLEEEMKSDK-NDRLVYHCLISLYASTSNLSEVNRIWN 300
           +SNLA+ YVKAGQ EKAELALKK+E E+KS+K  DRL YH LISLYASTSN SEV RIWN
Sbjct: 241 FSNLAAIYVKAGQLEKAELALKKVENEIKSNKQQDRLAYHFLISLYASTSNRSEVYRIWN 300

Query: 301 ALKSVYSTMTNISYLVMLQALRKLKDIEGLKRTYKEWESNCRNFDLRIVNDIIGAYLQQD 360
           ALKSVY    N+SYLVMLQAL KLKD EGLK TYKEWES+C +FDLR+ +  IGAYL+QD
Sbjct: 301 ALKSVYPMTNNMSYLVMLQALSKLKDFEGLKSTYKEWESSCSSFDLRLADVTIGAYLRQD 360

Query: 361 MYEDAAMIFEDATKRSKGPFSRAREMFMVYFLKLKQVDSAFSHLESALSESKEKEWHPSL 420
           MYEDAA++FEDA KRSKGPF RAREMFM+YFLK KQVD A SHLESA+SES + EWHPS 
Sbjct: 361 MYEDAALVFEDAIKRSKGPFFRAREMFMIYFLKFKQVDLALSHLESAISESMDDEWHPSP 420

Query: 421 ATTTAFLNYFEEEKDVEGAEDFARILKRLKCLDASGYHLLLKTYVAAGKLAPDMRKRLKE 480
           A   AFL YFEEEKD+EGAEDFARILKR KCLDAS YHLLLKTY AAGK APDMR+RL E
Sbjct: 421 AMANAFLMYFEEEKDIEGAEDFARILKRFKCLDASAYHLLLKTYAAAGKRAPDMRQRLIE 480

Query: 481 DDIE 484
           D+IE
Sbjct: 481 DNIE 484

BLAST of CsaV3_4G004520 vs. NCBI nr
Match: XP_008453823.1 (PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial-like isoform X2 [Cucumis melo])

HSP 1 Score: 758.4 bits (1957), Expect = 1.5e-215
Identity = 378/411 (91.97%), Postives = 387/411 (94.16%), Query Frame = 0

Query: 88  MEWMETRKINYSFTDYALRLDLISKVNGVTAAEKYFYDLPPSAKNRCTYGALLNCYCKEM 147
           MEWME RKINYSFTDYALRLDLISKVNG+TAAEKYFYDLPPSAKNRCTYGALLNCYCKEM
Sbjct: 1   MEWMEIRKINYSFTDYALRLDLISKVNGITAAEKYFYDLPPSAKNRCTYGALLNCYCKEM 60

Query: 148 MEEKALTLFKKMDELKISTSLSFNNLMTMYMRMDHPEKVPPLIGEMKQRGFYLTTFTYNV 207
           MEEKA TLFKKMDELK  TSL+FNNLMTMYMRMD PEKVPPLIGEMKQRGFYLTTFTYNV
Sbjct: 61  MEEKASTLFKKMDELKFVTSLAFNNLMTMYMRMDQPEKVPPLIGEMKQRGFYLTTFTYNV 120

Query: 208 WMNSCASLNDIGKVEEILEEMKMEDRNKFDWTTYSNLASFYVKAGQFEKAELALKKLEEE 267
           WMNSCASLNDIGKVEEILEEMKMED NK DWTT+SNLASFYVKAGQ EKAELALKK+EEE
Sbjct: 121 WMNSCASLNDIGKVEEILEEMKMEDSNKLDWTTFSNLASFYVKAGQLEKAELALKKVEEE 180

Query: 268 MKSDKNDRLVYHCLISLYASTSNLSEVNRIWNALKSVYSTMTNISYLVMLQALRKLKDIE 327
           +KSDK DRL YHCLISLYASTSNLSEVNRIWN LKSVY TMTN SYLVMLQAL KLKDIE
Sbjct: 181 IKSDKKDRLAYHCLISLYASTSNLSEVNRIWNLLKSVYPTMTNTSYLVMLQALSKLKDIE 240

Query: 328 GLKRTYKEWESNCRNFDLRIVNDIIGAYLQQDMYEDAAMIFEDATKRSKGPFSRAREMFM 387
           GLK+TYKEWES C  FDLR+VN IIGAYLQQDMYEDAAMIFEDA KRSKGPFSRARE FM
Sbjct: 241 GLKKTYKEWESICHIFDLRLVNVIIGAYLQQDMYEDAAMIFEDAIKRSKGPFSRAREKFM 300

Query: 388 VYFLKLKQVDSAFSHLESALSESKEKEWHPSLATTTAFLNYFEEEKDVEGAEDFARILKR 447
           VYFLKLKQVDSAFSHLESA+SESKEKEWHPSLATT AFLNYFEEEKDVEGAEDFARILKR
Sbjct: 301 VYFLKLKQVDSAFSHLESAISESKEKEWHPSLATTNAFLNYFEEEKDVEGAEDFARILKR 360

Query: 448 LKCLDASGYHLLLKTYVAAGKLAPDMRKRLKEDDIEISSELEELLGTVCPQ 499
           LKCLD SGYHLLLKTYVAAGK APDMR+RLKEDDI ISSELEELLGTVCPQ
Sbjct: 361 LKCLDESGYHLLLKTYVAAGKSAPDMRQRLKEDDIGISSELEELLGTVCPQ 411

BLAST of CsaV3_4G004520 vs. TAIR10
Match: AT1G02370.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 468.4 bits (1204), Expect = 5.6e-132
Identity = 239/470 (50.85%), Postives = 326/470 (69.36%), Query Frame = 0

Query: 32  QRGLYPRLSALGATGGSVAKTINQFIMEGNIVKKYELEKCIKELRKYRRYHHCLQIMEWM 91
           QR LY +LS L  TGG+VA+T+NQFIMEG  V+K +L +C K LRK+RR  H  +I +WM
Sbjct: 70  QRELYKKLSMLSVTGGTVAETLNQFIMEGITVRKDDLFRCAKTLRKFRRPQHAFEIFDWM 129

Query: 92  ETRKINYSFTDYALRLDLISKVNGVTAAEKYFYDLPPSAKN-RCTYGALLNCYCKEMMEE 151
           E RK+ +S +D+A+ LDLI K  G+ AAE YF +L PSAKN + TYGAL+NCYC E+ EE
Sbjct: 130 EKRKMTFSVSDHAICLDLIGKTKGLEAAENYFNNLDPSAKNHQSTYGALMNCYCVELEEE 189

Query: 152 KALTLFKKMDELK-ISTSLSFNNLMTMYMRMDHPEKVPPLIGEMKQRGFYLTTFTYNVWM 211
           KA   F+ MDEL  ++ SL FNN+M+MYMR+  PEKVP L+  MKQRG      TY++WM
Sbjct: 190 KAKAHFEIMDELNFVNNSLPFNNMMSMYMRLSQPEKVPVLVDAMKQRGISPCGVTYSIWM 249

Query: 212 NSCASLNDIGKVEEILEEMKMEDRNKFDWTTYSNLASFYVKAGQFEKAELALKKLEEEMK 271
            SC SLND+  +E+I++EM  +   K  W T+SNLA+ Y KAG +EKA+ ALK +EE+M 
Sbjct: 250 QSCGSLNDLDGLEKIIDEMGKDSEAKTTWNTFSNLAAIYTKAGLYEKADSALKSMEEKM- 309

Query: 272 SDKNDRLVYHCLISLYASTSNLSEVNRIWNALKSVYSTMTNISYLVMLQALRKLKDIEGL 331
            + N+R  +H L+SLYA  S   EV R+W +LK     + N+SYLVMLQA+ KL D++G+
Sbjct: 310 -NPNNRDSHHFLMSLYAGISKGPEVYRVWESLKKARPEVNNLSYLVMLQAMSKLGDLDGI 369

Query: 332 KRTYKEWESNCRNFDLRIVNDIIGAYLQQDMYEDAAMIFEDATKRSKGPFSRAREMFMVY 391
           K+ + EWES C  +D+R+ N  I  YL+ +MYE+A  I + A K+SKGPFS+AR++ M++
Sbjct: 370 KKIFTEWESKCWAYDMRLANIAINTYLKGNMYEEAEKILDGAMKKSKGPFSKARQLLMIH 429

Query: 392 FLKLKQVDSAFSHLESALSESKEK--EWHPSLATTTAFLNYFEEEKDVEGAEDFARILKR 451
            L+  + D A  HLE+A+S+S E   EW  S    + F  +FE+ KDV+GAEDF +IL  
Sbjct: 430 LLENDKADLAMKHLEAAVSDSAENKDEWGWSSELVSLFFLHFEKAKDVDGAEDFCKILSN 489

Query: 452 LKCLDASGYHLLLKTYVAAGKLAPDMRKRLKEDDIEISSELEELLGTVCP 498
            K LD+     L+KTY AA K +PDMR+RL +  IE+S E+++LL TVCP
Sbjct: 490 WKPLDSETMTFLIKTYAAAEKTSPDMRERLSQQQIEVSEEIQDLLKTVCP 537

BLAST of CsaV3_4G004520 vs. TAIR10
Match: AT4G01990.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 432.2 bits (1110), Expect = 4.4e-121
Identity = 222/471 (47.13%), Postives = 310/471 (65.82%), Query Frame = 0

Query: 29  ANNQRGLYPRLSALGAT-GGSVAKTINQFIMEGNIVKKYELEKCIKELRKYRRYHHCLQI 88
           A   R +Y +LS+LG   GG + +T+NQF+MEG  VKK++L +  K+LRK+R+    L+I
Sbjct: 35  AKKHRSIYKKLSSLGTRGGGKMEETLNQFVMEGVPVKKHDLIRYAKDLRKFRQPQRALEI 94

Query: 89  MEWMETRKINYSFTDYALRLDLISKVNGVTAAEKYFYDLPPSAKNRCTYGALLNCYCKEM 148
            EWME ++I ++ +D+A+RL+LI+K  G+ AAE YF  L  S KN+ TYG+LLNCYC E 
Sbjct: 95  FEWMERKEIAFTGSDHAIRLNLIAKSKGLEAAETYFNSLDDSIKNQSTYGSLLNCYCVEK 154

Query: 149 MEEKALTLFKKMDELK-ISTSLSFNNLMTMYMRMDHPEKVPPLIGEMKQRGFYLTTFTYN 208
            E KA   F+ M +L  +S SL FNNLM MYM +  PEKVP L+  MK++       TY+
Sbjct: 155 EEVKAKAHFENMVDLNHVSNSLPFNNLMAMYMGLGQPEKVPALVVAMKEKSITPCDITYS 214

Query: 209 VWMNSCASLNDIGKVEEILEEMKMEDRNKFDWTTYSNLASFYVKAGQFEKAELALKKLEE 268
           +W+ SC SL D+  VE++L+EMK E    F W T++NLA+ Y+K G + KAE ALK LE 
Sbjct: 215 MWIQSCGSLKDLDGVEKVLDEMKAEGEGIFSWNTFANLAAIYIKVGLYGKAEEALKSLEN 274

Query: 269 EMKSDKNDRLVYHCLISLYASTSNLSEVNRIWNALKSVYSTMTNISYLVMLQALRKLKDI 328
            M  D  D   YH LI+LY   +N SEV R+W+ LK  Y  + N SYL ML+AL KL DI
Sbjct: 275 NMNPDVRD--CYHFLINLYTGIANASEVYRVWDLLKKRYPNVNNSSYLTMLRALSKLDDI 334

Query: 329 EGLKRTYKEWESNCRNFDLRIVNDIIGAYLQQDMYEDAAMIFEDATKRSKGPFSRAREMF 388
           +G+K+ + EWES C  +D+R+ N  I +YL+Q+MYE+A  +F  A K+ KG FS+AR++ 
Sbjct: 335 DGVKKVFAEWESTCWTYDMRMANVAISSYLKQNMYEEAEAVFNGAMKKCKGQFSKARQLL 394

Query: 389 MVYFLKLKQVDSAFSHLESALSESKEKEWHPSLATTTAFLNYFEEEKDVEGAEDFARILK 448
           M++ LK  Q D A  H E+A+ + ++K W  S    ++F  +FEE KDV+GAE+F + L 
Sbjct: 395 MMHLLKNDQADLALKHFEAAVLD-QDKNWTWSSELISSFFLHFEEAKDVDGAEEFCKTLT 454

Query: 449 RLKCLDASGYHLLLKTYVAAGKLAPDMRKRLKEDDIEISSELEELLGTVCP 498
           +   L +  Y LL+KTY+AAGK  PDM+KRL+E  I +  E E LL  +CP
Sbjct: 455 KWSPLSSETYTLLMKTYLAAGKACPDMKKRLEEQGILVDEEQECLLSKICP 502

BLAST of CsaV3_4G004520 vs. TAIR10
Match: AT1G60770.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 253.1 bits (645), Expect = 3.7e-67
Identity = 162/465 (34.84%), Postives = 237/465 (50.97%), Query Frame = 0

Query: 35  LYPRLSALGATGGSVAKTINQFIMEGNIVKKYELEKCIKELRKYRRYHHCLQIMEWMETR 94
           LY RL   G T   V + +NQF+     V K+E+   IK+LR    Y+  L++ E ME R
Sbjct: 25  LYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGDTIKKLRNRGLYYPALKLSEVMEER 84

Query: 95  KINYSFTDYALRLDLISKVNGVTAAEKYFYDLPPSAKNRCTYGALLNCYCKEMMEEKALT 154
            +N + +D A+ LDL++K   +TA E YF DLP ++K   TYG+LLNCYCKE++ EKA  
Sbjct: 85  GMNKTVSDQAIHLDLVAKAREITAGENYFVDLPETSKTELTYGSLLNCYCKELLTEKAEG 144

Query: 155 LFKKMDELKIS-TSLSFNNLMTMYMRMDHPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCA 214
           L  KM EL I+                                                 
Sbjct: 145 LLNKMKELNITPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 204

Query: 215 SLNDIGKVEEILEEMKMEDRNKFDWTTYSNLASFYVKAGQFEKAELALKKLEEEMKSDKN 274
                                       SN+AS YV AG  +KAE AL++L  EMK+ + 
Sbjct: 205 XXXXXXXXXXXXXXXXXXXXXXXXXXXXSNMASIYVDAGLSQKAEKALQEL--EMKNTQR 264

Query: 275 DRLVYHCLISLYASTSNLSEVNRIWNALKSVYSTMTNISYLVMLQALRKLKDIEGLKRTY 334
           D   Y  LI+LY     L+EV RIW +L+      +N++YL M+Q L KL D+ G +  +
Sbjct: 265 DFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSNVAYLNMIQVLVKLNDLPGAETLF 324

Query: 335 KEWESNCRNFDLRIVNDIIGAYLQQDMYEDAAMIFEDATKRSKGPFSRAREMFMVYFLKL 394
           KEW++NC  +D+RIVN +IGAY Q+ + + A  + E A +R     ++  E+FM Y++K 
Sbjct: 325 KEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEKAPRRGGKLNAKTWEIFMDYYVKS 384

Query: 395 KQVDSAFSHLESALSESKEK--EWHPSLATTTAFLNYFEEEKDVEGAEDFARILKR-LKC 454
             +  A   +  A+S  K    +W PS  T  A ++YFE++KDV GAE+   ILK     
Sbjct: 385 GDMARALECMSKAVSIGKGDGGKWLPSPETVRALMSYFEQKKDVNGAENLLEILKNGTDN 444

Query: 455 LDASGYHLLLKTYVAAGKLAPDMRKRLKEDDIEISSELEELLGTV 496
           + A  +  L++TY AAGK  P MR+RLK +++E++   ++LL  V
Sbjct: 445 IGAEIFEPLIRTYAAAGKSHPAMRRRLKMENVEVNEATKKLLDEV 487

BLAST of CsaV3_4G004520 vs. TAIR10
Match: AT4G21705.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 251.9 bits (642), Expect = 8.2e-67
Identity = 154/493 (31.24%), Postives = 258/493 (52.33%), Query Frame = 0

Query: 21  LNELMRSPAN------------NQRGLYPRLSALGATGGSVAKTINQFIMEGNIVKKYEL 80
           +N L R PAN             +  LY ++S LG    SV   +  ++  G  V   EL
Sbjct: 1   MNILRRIPANLIASRYYYTNRVKKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVSVAEL 60

Query: 81  EKCIKELRKYRRYHHCLQIMEWM-ETRKINYSFTDYALRLDLISKVNGVTAAEKYFYDLP 140
            + + +LR+ +R+ H L++ +WM ET    +S T++A+ LDLI +V G   AE+YF +L 
Sbjct: 61  IRIVHDLRRRKRFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYFENLK 120

Query: 141 PSAKNRCTYGALLNCYCKEMMEEKALTLFKKMDELK-ISTSLSFNNLMTMYMRMDHPEKV 200
              KN  TYGALLNCY ++   EK+L  F+KM E+  +++SL++NN+M +Y  +   EKV
Sbjct: 121 EQYKNDKTYGALLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKV 180

Query: 201 PPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMEDRNKFDWTTYSNLAS 260
           P ++ EMK+       ++Y + +N+  ++ D+ ++   L +M+       DW TY+  A 
Sbjct: 181 PKVLEEMKEENVAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYAVAAK 240

Query: 261 FYVKAGQFEKAELALKKLEEEMKSDKNDRLVYHCLISLYASTSNLSEVNRIWNALKSVYS 320
           FY+  G  ++A   LK  E  +  +K D   Y+ LI+LYA      EV R+W+  K V  
Sbjct: 241 FYIDGGDCDRAVELLKMSENRL--EKKDGEGYNHLITLYARLGKKIEVLRLWDLEKDVCK 300

Query: 321 TMTNISYLVMLQALRKLKDIEGLKRTYKEWESNCRNFDLRIVNDIIGAYLQQDMYEDAAM 380
              N  YL +LQ+L K+  +   +    EW+S+   +D R+ N +I  Y+ + M E A  
Sbjct: 301 RRINQDYLTVLQSLVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEA 360

Query: 381 IFEDATKRSKGPFSRAREMFMVYFLKLKQVDSAFSHLESALS-ESKEKEWHPSLATTTAF 440
           + ED  +R K     + E+    + +   +++AF  +++AL  E   ++W P L   T+ 
Sbjct: 361 MLEDLARRGKATTPESWELVATAYAEKGTLENAFKCMKTALGVEVGSRKWRPGLTLVTSV 420

Query: 441 LNYFEEEKDVEGAEDFARILKRLKCLDASGYHLLLKTYV-AAGKLAPDMRKRLKEDDIEI 498
           L++  +E  ++  E F   L+    ++   YH L+K  +   G+    + +R+K+D IEI
Sbjct: 421 LSWVGDEGSLKEVESFVASLRNCIGVNKQMYHALVKADIREGGRNIDTLLQRMKDDKIEI 480

BLAST of CsaV3_4G004520 vs. TAIR10
Match: AT5G27460.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 224.6 bits (571), Expect = 1.4e-58
Identity = 142/441 (32.20%), Postives = 229/441 (51.93%), Query Frame = 0

Query: 48  SVAKTINQFIMEGNIVKKYELEKCIKELRKYRRYHHCLQIMEWMETRK-INYSFTDYALR 107
           SV   + + I  G+ V   EL    K L +  RY   LQ+MEWME +K I +S  D ALR
Sbjct: 53  SVTSLLQERIDSGHAVSLSELRLISKRLIRSNRYDLALQMMEWMENQKDIEFSVYDIALR 112

Query: 108 LDLISKVNGVTAAEKYFYDLPPSAKN----RCTYGALLNCYCKEMMEEKALTLFKKMDEL 167
           LDLI K +G+   E+YF  L  S+ +    +  Y  LL  Y K  M ++A  L +K++ L
Sbjct: 113 LDLIIKTHGLKQGEEYFEKLLHSSVSMRVAKSAYLPLLRAYVKNKMVKEAEALMEKLNGL 172

Query: 168 K-ISTSLSFNNLMTMYMRMDHPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKV 227
             + T   FN +M +Y      EKV  ++  MK         +YN+WMN+C  ++ +  V
Sbjct: 173 GFLVTPHPFNEMMKLYEASGQYEKVVMVVSMMKGNKIPRNVLSYNLWMNACCEVSGVAAV 232

Query: 228 EEILEEMKMEDRNKFDWTTYSNLASFYVKAGQFEKAELALKKLEEEMKSDKNDRLVYHCL 287
           E + +EM  +   +  W++   LA+ Y+K+G  EKA L L+  E+ +  ++++RL Y  L
Sbjct: 233 ETVYKEMVGDKSVEVGWSSLCTLANVYIKSGFDEKARLVLEDAEKML--NRSNRLGYFFL 292

Query: 288 ISLYASTSNLSEVNRIWNALKSVYSTMTNISYLVMLQALRKLKDIEGLKRTYKEWESNCR 347
           I+LYAS  N   V R+W   KSV   ++ ++Y+ +L +L K  D+E  +R + EWE+ C 
Sbjct: 293 ITLYASLGNKEGVVRLWEVSKSVCGRISCVNYICVLSSLVKTGDLEEAERVFSEWEAQCF 352

Query: 348 NFDLRIVNDIIGAYLQQDMYEDAAMIFEDATKRSKGPFSRAREMFMVYFLKLKQVDSAFS 407
           N+D+R+ N ++GAY++      A  +     +R   P  +  E+ M  ++K + ++ A  
Sbjct: 353 NYDVRVSNVLLGAYVRNGEIRKAESLHGCVLERGGTPNYKTWEILMEGWVKCENMEKAID 412

Query: 408 HLESALSESKEKEWHPSLATTTAFLNYFEEEKDVEGAEDFARILKRLKCLDASGYHLLLK 467
            +       +   W PS     A   YFE+E+ +E A  + R L RL       Y LLL+
Sbjct: 413 AMHQVFVLMRRCHWRPSHNIVMAIAEYFEKEEKIEEATAYVRDLHRLGLASLPLYRLLLR 472

Query: 468 TYVAAGKLAPDMRKRLKEDDI 483
            +  A + A D+ + +K D +
Sbjct: 473 MHEHAKRPAYDIYEMMKLDKL 491

BLAST of CsaV3_4G004520 vs. Swiss-Prot
Match: sp|Q9FZ24|PPR4_ARATH (Pentatricopeptide repeat-containing protein At1g02370, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g02370 PE=2 SV=1)

HSP 1 Score: 468.4 bits (1204), Expect = 1.0e-130
Identity = 239/470 (50.85%), Postives = 326/470 (69.36%), Query Frame = 0

Query: 32  QRGLYPRLSALGATGGSVAKTINQFIMEGNIVKKYELEKCIKELRKYRRYHHCLQIMEWM 91
           QR LY +LS L  TGG+VA+T+NQFIMEG  V+K +L +C K LRK+RR  H  +I +WM
Sbjct: 70  QRELYKKLSMLSVTGGTVAETLNQFIMEGITVRKDDLFRCAKTLRKFRRPQHAFEIFDWM 129

Query: 92  ETRKINYSFTDYALRLDLISKVNGVTAAEKYFYDLPPSAKN-RCTYGALLNCYCKEMMEE 151
           E RK+ +S +D+A+ LDLI K  G+ AAE YF +L PSAKN + TYGAL+NCYC E+ EE
Sbjct: 130 EKRKMTFSVSDHAICLDLIGKTKGLEAAENYFNNLDPSAKNHQSTYGALMNCYCVELEEE 189

Query: 152 KALTLFKKMDELK-ISTSLSFNNLMTMYMRMDHPEKVPPLIGEMKQRGFYLTTFTYNVWM 211
           KA   F+ MDEL  ++ SL FNN+M+MYMR+  PEKVP L+  MKQRG      TY++WM
Sbjct: 190 KAKAHFEIMDELNFVNNSLPFNNMMSMYMRLSQPEKVPVLVDAMKQRGISPCGVTYSIWM 249

Query: 212 NSCASLNDIGKVEEILEEMKMEDRNKFDWTTYSNLASFYVKAGQFEKAELALKKLEEEMK 271
            SC SLND+  +E+I++EM  +   K  W T+SNLA+ Y KAG +EKA+ ALK +EE+M 
Sbjct: 250 QSCGSLNDLDGLEKIIDEMGKDSEAKTTWNTFSNLAAIYTKAGLYEKADSALKSMEEKM- 309

Query: 272 SDKNDRLVYHCLISLYASTSNLSEVNRIWNALKSVYSTMTNISYLVMLQALRKLKDIEGL 331
            + N+R  +H L+SLYA  S   EV R+W +LK     + N+SYLVMLQA+ KL D++G+
Sbjct: 310 -NPNNRDSHHFLMSLYAGISKGPEVYRVWESLKKARPEVNNLSYLVMLQAMSKLGDLDGI 369

Query: 332 KRTYKEWESNCRNFDLRIVNDIIGAYLQQDMYEDAAMIFEDATKRSKGPFSRAREMFMVY 391
           K+ + EWES C  +D+R+ N  I  YL+ +MYE+A  I + A K+SKGPFS+AR++ M++
Sbjct: 370 KKIFTEWESKCWAYDMRLANIAINTYLKGNMYEEAEKILDGAMKKSKGPFSKARQLLMIH 429

Query: 392 FLKLKQVDSAFSHLESALSESKEK--EWHPSLATTTAFLNYFEEEKDVEGAEDFARILKR 451
            L+  + D A  HLE+A+S+S E   EW  S    + F  +FE+ KDV+GAEDF +IL  
Sbjct: 430 LLENDKADLAMKHLEAAVSDSAENKDEWGWSSELVSLFFLHFEKAKDVDGAEDFCKILSN 489

Query: 452 LKCLDASGYHLLLKTYVAAGKLAPDMRKRLKEDDIEISSELEELLGTVCP 498
            K LD+     L+KTY AA K +PDMR+RL +  IE+S E+++LL TVCP
Sbjct: 490 WKPLDSETMTFLIKTYAAAEKTSPDMRERLSQQQIEVSEEIQDLLKTVCP 537

BLAST of CsaV3_4G004520 vs. Swiss-Prot
Match: sp|Q93WC5|PP300_ARATH (Pentatricopeptide repeat-containing protein At4g01990, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At4g01990 PE=2 SV=1)

HSP 1 Score: 432.2 bits (1110), Expect = 8.0e-120
Identity = 222/471 (47.13%), Postives = 310/471 (65.82%), Query Frame = 0

Query: 29  ANNQRGLYPRLSALGAT-GGSVAKTINQFIMEGNIVKKYELEKCIKELRKYRRYHHCLQI 88
           A   R +Y +LS+LG   GG + +T+NQF+MEG  VKK++L +  K+LRK+R+    L+I
Sbjct: 35  AKKHRSIYKKLSSLGTRGGGKMEETLNQFVMEGVPVKKHDLIRYAKDLRKFRQPQRALEI 94

Query: 89  MEWMETRKINYSFTDYALRLDLISKVNGVTAAEKYFYDLPPSAKNRCTYGALLNCYCKEM 148
            EWME ++I ++ +D+A+RL+LI+K  G+ AAE YF  L  S KN+ TYG+LLNCYC E 
Sbjct: 95  FEWMERKEIAFTGSDHAIRLNLIAKSKGLEAAETYFNSLDDSIKNQSTYGSLLNCYCVEK 154

Query: 149 MEEKALTLFKKMDELK-ISTSLSFNNLMTMYMRMDHPEKVPPLIGEMKQRGFYLTTFTYN 208
            E KA   F+ M +L  +S SL FNNLM MYM +  PEKVP L+  MK++       TY+
Sbjct: 155 EEVKAKAHFENMVDLNHVSNSLPFNNLMAMYMGLGQPEKVPALVVAMKEKSITPCDITYS 214

Query: 209 VWMNSCASLNDIGKVEEILEEMKMEDRNKFDWTTYSNLASFYVKAGQFEKAELALKKLEE 268
           +W+ SC SL D+  VE++L+EMK E    F W T++NLA+ Y+K G + KAE ALK LE 
Sbjct: 215 MWIQSCGSLKDLDGVEKVLDEMKAEGEGIFSWNTFANLAAIYIKVGLYGKAEEALKSLEN 274

Query: 269 EMKSDKNDRLVYHCLISLYASTSNLSEVNRIWNALKSVYSTMTNISYLVMLQALRKLKDI 328
            M  D  D   YH LI+LY   +N SEV R+W+ LK  Y  + N SYL ML+AL KL DI
Sbjct: 275 NMNPDVRD--CYHFLINLYTGIANASEVYRVWDLLKKRYPNVNNSSYLTMLRALSKLDDI 334

Query: 329 EGLKRTYKEWESNCRNFDLRIVNDIIGAYLQQDMYEDAAMIFEDATKRSKGPFSRAREMF 388
           +G+K+ + EWES C  +D+R+ N  I +YL+Q+MYE+A  +F  A K+ KG FS+AR++ 
Sbjct: 335 DGVKKVFAEWESTCWTYDMRMANVAISSYLKQNMYEEAEAVFNGAMKKCKGQFSKARQLL 394

Query: 389 MVYFLKLKQVDSAFSHLESALSESKEKEWHPSLATTTAFLNYFEEEKDVEGAEDFARILK 448
           M++ LK  Q D A  H E+A+ + ++K W  S    ++F  +FEE KDV+GAE+F + L 
Sbjct: 395 MMHLLKNDQADLALKHFEAAVLD-QDKNWTWSSELISSFFLHFEEAKDVDGAEEFCKTLT 454

Query: 449 RLKCLDASGYHLLLKTYVAAGKLAPDMRKRLKEDDIEISSELEELLGTVCP 498
           +   L +  Y LL+KTY+AAGK  PDM+KRL+E  I +  E E LL  +CP
Sbjct: 455 KWSPLSSETYTLLMKTYLAAGKACPDMKKRLEEQGILVDEEQECLLSKICP 502

BLAST of CsaV3_4G004520 vs. Swiss-Prot
Match: sp|O22714|PPR86_ARATH (Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana OX=3702 GN=At1g60770 PE=2 SV=1)

HSP 1 Score: 253.1 bits (645), Expect = 6.6e-66
Identity = 162/465 (34.84%), Postives = 237/465 (50.97%), Query Frame = 0

Query: 35  LYPRLSALGATGGSVAKTINQFIMEGNIVKKYELEKCIKELRKYRRYHHCLQIMEWMETR 94
           LY RL   G T   V + +NQF+     V K+E+   IK+LR    Y+  L++ E ME R
Sbjct: 25  LYNRLFKDGGTEVKVRQQLNQFLKGTKHVFKWEVGDTIKKLRNRGLYYPALKLSEVMEER 84

Query: 95  KINYSFTDYALRLDLISKVNGVTAAEKYFYDLPPSAKNRCTYGALLNCYCKEMMEEKALT 154
            +N + +D A+ LDL++K   +TA E YF DLP ++K   TYG+LLNCYCKE++ EKA  
Sbjct: 85  GMNKTVSDQAIHLDLVAKAREITAGENYFVDLPETSKTELTYGSLLNCYCKELLTEKAEG 144

Query: 155 LFKKMDELKIS-TSLSFNNLMTMYMRMDHPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCA 214
           L  KM EL I+                                                 
Sbjct: 145 LLNKMKELNITPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 204

Query: 215 SLNDIGKVEEILEEMKMEDRNKFDWTTYSNLASFYVKAGQFEKAELALKKLEEEMKSDKN 274
                                       SN+AS YV AG  +KAE AL++L  EMK+ + 
Sbjct: 205 XXXXXXXXXXXXXXXXXXXXXXXXXXXXSNMASIYVDAGLSQKAEKALQEL--EMKNTQR 264

Query: 275 DRLVYHCLISLYASTSNLSEVNRIWNALKSVYSTMTNISYLVMLQALRKLKDIEGLKRTY 334
           D   Y  LI+LY     L+EV RIW +L+      +N++YL M+Q L KL D+ G +  +
Sbjct: 265 DFTAYQFLITLYGRLGKLTEVYRIWRSLRLAIPKTSNVAYLNMIQVLVKLNDLPGAETLF 324

Query: 335 KEWESNCRNFDLRIVNDIIGAYLQQDMYEDAAMIFEDATKRSKGPFSRAREMFMVYFLKL 394
           KEW++NC  +D+RIVN +IGAY Q+ + + A  + E A +R     ++  E+FM Y++K 
Sbjct: 325 KEWQANCSTYDIRIVNVLIGAYAQEGLIQKANELKEKAPRRGGKLNAKTWEIFMDYYVKS 384

Query: 395 KQVDSAFSHLESALSESKEK--EWHPSLATTTAFLNYFEEEKDVEGAEDFARILKR-LKC 454
             +  A   +  A+S  K    +W PS  T  A ++YFE++KDV GAE+   ILK     
Sbjct: 385 GDMARALECMSKAVSIGKGDGGKWLPSPETVRALMSYFEQKKDVNGAENLLEILKNGTDN 444

Query: 455 LDASGYHLLLKTYVAAGKLAPDMRKRLKEDDIEISSELEELLGTV 496
           + A  +  L++TY AAGK  P MR+RLK +++E++   ++LL  V
Sbjct: 445 IGAEIFEPLIRTYAAAGKSHPAMRRRLKMENVEVNEATKKLLDEV 487

BLAST of CsaV3_4G004520 vs. Swiss-Prot
Match: sp|Q84JR3|PP334_ARATH (Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At4g21705 PE=2 SV=1)

HSP 1 Score: 251.9 bits (642), Expect = 1.5e-65
Identity = 154/493 (31.24%), Postives = 258/493 (52.33%), Query Frame = 0

Query: 21  LNELMRSPAN------------NQRGLYPRLSALGATGGSVAKTINQFIMEGNIVKKYEL 80
           +N L R PAN             +  LY ++S LG    SV   +  ++  G  V   EL
Sbjct: 1   MNILRRIPANLIASRYYYTNRVKKTTLYSKISPLGDPKSSVYPELQNWVQCGKKVSVAEL 60

Query: 81  EKCIKELRKYRRYHHCLQIMEWM-ETRKINYSFTDYALRLDLISKVNGVTAAEKYFYDLP 140
            + + +LR+ +R+ H L++ +WM ET    +S T++A+ LDLI +V G   AE+YF +L 
Sbjct: 61  IRIVHDLRRRKRFLHALEVSKWMNETGVCVFSPTEHAVHLDLIGRVYGFVTAEEYFENLK 120

Query: 141 PSAKNRCTYGALLNCYCKEMMEEKALTLFKKMDELK-ISTSLSFNNLMTMYMRMDHPEKV 200
              KN  TYGALLNCY ++   EK+L  F+KM E+  +++SL++NN+M +Y  +   EKV
Sbjct: 121 EQYKNDKTYGALLNCYVRQQNVEKSLLHFEKMKEMGFVTSSLTYNNIMCLYTNIGQHEKV 180

Query: 201 PPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMEDRNKFDWTTYSNLAS 260
           P ++ EMK+       ++Y + +N+  ++ D+ ++   L +M+       DW TY+  A 
Sbjct: 181 PKVLEEMKEENVAPDNYSYRICINAFGAMYDLERIGGTLRDMERRQDITMDWNTYAVAAK 240

Query: 261 FYVKAGQFEKAELALKKLEEEMKSDKNDRLVYHCLISLYASTSNLSEVNRIWNALKSVYS 320
           FY+  G  ++A   LK  E  +  +K D   Y+ LI+LYA      EV R+W+  K V  
Sbjct: 241 FYIDGGDCDRAVELLKMSENRL--EKKDGEGYNHLITLYARLGKKIEVLRLWDLEKDVCK 300

Query: 321 TMTNISYLVMLQALRKLKDIEGLKRTYKEWESNCRNFDLRIVNDIIGAYLQQDMYEDAAM 380
              N  YL +LQ+L K+  +   +    EW+S+   +D R+ N +I  Y+ + M E A  
Sbjct: 301 RRINQDYLTVLQSLVKIDALVEAEEVLTEWKSSGNCYDFRVPNTVIRGYIGKSMEEKAEA 360

Query: 381 IFEDATKRSKGPFSRAREMFMVYFLKLKQVDSAFSHLESALS-ESKEKEWHPSLATTTAF 440
           + ED  +R K     + E+    + +   +++AF  +++AL  E   ++W P L   T+ 
Sbjct: 361 MLEDLARRGKATTPESWELVATAYAEKGTLENAFKCMKTALGVEVGSRKWRPGLTLVTSV 420

Query: 441 LNYFEEEKDVEGAEDFARILKRLKCLDASGYHLLLKTYV-AAGKLAPDMRKRLKEDDIEI 498
           L++  +E  ++  E F   L+    ++   YH L+K  +   G+    + +R+K+D IEI
Sbjct: 421 LSWVGDEGSLKEVESFVASLRNCIGVNKQMYHALVKADIREGGRNIDTLLQRMKDDKIEI 480

BLAST of CsaV3_4G004520 vs. Swiss-Prot
Match: sp|Q3E911|PP400_ARATH (Pentatricopeptide repeat-containing protein At5g27460 OS=Arabidopsis thaliana OX=3702 GN=At5g27460 PE=2 SV=1)

HSP 1 Score: 224.6 bits (571), Expect = 2.5e-57
Identity = 142/441 (32.20%), Postives = 229/441 (51.93%), Query Frame = 0

Query: 48  SVAKTINQFIMEGNIVKKYELEKCIKELRKYRRYHHCLQIMEWMETRK-INYSFTDYALR 107
           SV   + + I  G+ V   EL    K L +  RY   LQ+MEWME +K I +S  D ALR
Sbjct: 53  SVTSLLQERIDSGHAVSLSELRLISKRLIRSNRYDLALQMMEWMENQKDIEFSVYDIALR 112

Query: 108 LDLISKVNGVTAAEKYFYDLPPSAKN----RCTYGALLNCYCKEMMEEKALTLFKKMDEL 167
           LDLI K +G+   E+YF  L  S+ +    +  Y  LL  Y K  M ++A  L +K++ L
Sbjct: 113 LDLIIKTHGLKQGEEYFEKLLHSSVSMRVAKSAYLPLLRAYVKNKMVKEAEALMEKLNGL 172

Query: 168 K-ISTSLSFNNLMTMYMRMDHPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKV 227
             + T   FN +M +Y      EKV  ++  MK         +YN+WMN+C  ++ +  V
Sbjct: 173 GFLVTPHPFNEMMKLYEASGQYEKVVMVVSMMKGNKIPRNVLSYNLWMNACCEVSGVAAV 232

Query: 228 EEILEEMKMEDRNKFDWTTYSNLASFYVKAGQFEKAELALKKLEEEMKSDKNDRLVYHCL 287
           E + +EM  +   +  W++   LA+ Y+K+G  EKA L L+  E+ +  ++++RL Y  L
Sbjct: 233 ETVYKEMVGDKSVEVGWSSLCTLANVYIKSGFDEKARLVLEDAEKML--NRSNRLGYFFL 292

Query: 288 ISLYASTSNLSEVNRIWNALKSVYSTMTNISYLVMLQALRKLKDIEGLKRTYKEWESNCR 347
           I+LYAS  N   V R+W   KSV   ++ ++Y+ +L +L K  D+E  +R + EWE+ C 
Sbjct: 293 ITLYASLGNKEGVVRLWEVSKSVCGRISCVNYICVLSSLVKTGDLEEAERVFSEWEAQCF 352

Query: 348 NFDLRIVNDIIGAYLQQDMYEDAAMIFEDATKRSKGPFSRAREMFMVYFLKLKQVDSAFS 407
           N+D+R+ N ++GAY++      A  +     +R   P  +  E+ M  ++K + ++ A  
Sbjct: 353 NYDVRVSNVLLGAYVRNGEIRKAESLHGCVLERGGTPNYKTWEILMEGWVKCENMEKAID 412

Query: 408 HLESALSESKEKEWHPSLATTTAFLNYFEEEKDVEGAEDFARILKRLKCLDASGYHLLLK 467
            +       +   W PS     A   YFE+E+ +E A  + R L RL       Y LLL+
Sbjct: 413 AMHQVFVLMRRCHWRPSHNIVMAIAEYFEKEEKIEEATAYVRDLHRLGLASLPLYRLLLR 472

Query: 468 TYVAAGKLAPDMRKRLKEDDI 483
            +  A + A D+ + +K D +
Sbjct: 473 MHEHAKRPAYDIYEMMKLDKL 491

BLAST of CsaV3_4G004520 vs. TrEMBL
Match: tr|A0A0A0KUH1|A0A0A0KUH1_CUCSA (Pentatricopeptide repeat-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G026260 PE=4 SV=1)

HSP 1 Score: 996.1 bits (2574), Expect = 2.8e-287
Identity = 498/498 (100.00%), Postives = 498/498 (100.00%), Query Frame = 0

Query: 1   MNRRSLISRAPAGFRQLCTSLNELMRSPANNQRGLYPRLSALGATGGSVAKTINQFIMEG 60
           MNRRSLISRAPAGFRQLCTSLNELMRSPANNQRGLYPRLSALGATGGSVAKTINQFIMEG
Sbjct: 1   MNRRSLISRAPAGFRQLCTSLNELMRSPANNQRGLYPRLSALGATGGSVAKTINQFIMEG 60

Query: 61  NIVKKYELEKCIKELRKYRRYHHCLQIMEWMETRKINYSFTDYALRLDLISKVNGVTAAE 120
           NIVKKYELEKCIKELRKYRRYHHCLQIMEWMETRKINYSFTDYALRLDLISKVNGVTAAE
Sbjct: 61  NIVKKYELEKCIKELRKYRRYHHCLQIMEWMETRKINYSFTDYALRLDLISKVNGVTAAE 120

Query: 121 KYFYDLPPSAKNRCTYGALLNCYCKEMMEEKALTLFKKMDELKISTSLSFNNLMTMYMRM 180
           KYFYDLPPSAKNRCTYGALLNCYCKEMMEEKALTLFKKMDELKISTSLSFNNLMTMYMRM
Sbjct: 121 KYFYDLPPSAKNRCTYGALLNCYCKEMMEEKALTLFKKMDELKISTSLSFNNLMTMYMRM 180

Query: 181 DHPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMEDRNKFDWTT 240
           DHPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMEDRNKFDWTT
Sbjct: 181 DHPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMEDRNKFDWTT 240

Query: 241 YSNLASFYVKAGQFEKAELALKKLEEEMKSDKNDRLVYHCLISLYASTSNLSEVNRIWNA 300
           YSNLASFYVKAGQFEKAELALKKLEEEMKSDKNDRLVYHCLISLYASTSNLSEVNRIWNA
Sbjct: 241 YSNLASFYVKAGQFEKAELALKKLEEEMKSDKNDRLVYHCLISLYASTSNLSEVNRIWNA 300

Query: 301 LKSVYSTMTNISYLVMLQALRKLKDIEGLKRTYKEWESNCRNFDLRIVNDIIGAYLQQDM 360
           LKSVYSTMTNISYLVMLQALRKLKDIEGLKRTYKEWESNCRNFDLRIVNDIIGAYLQQDM
Sbjct: 301 LKSVYSTMTNISYLVMLQALRKLKDIEGLKRTYKEWESNCRNFDLRIVNDIIGAYLQQDM 360

Query: 361 YEDAAMIFEDATKRSKGPFSRAREMFMVYFLKLKQVDSAFSHLESALSESKEKEWHPSLA 420
           YEDAAMIFEDATKRSKGPFSRAREMFMVYFLKLKQVDSAFSHLESALSESKEKEWHPSLA
Sbjct: 361 YEDAAMIFEDATKRSKGPFSRAREMFMVYFLKLKQVDSAFSHLESALSESKEKEWHPSLA 420

Query: 421 TTTAFLNYFEEEKDVEGAEDFARILKRLKCLDASGYHLLLKTYVAAGKLAPDMRKRLKED 480
           TTTAFLNYFEEEKDVEGAEDFARILKRLKCLDASGYHLLLKTYVAAGKLAPDMRKRLKED
Sbjct: 421 TTTAFLNYFEEEKDVEGAEDFARILKRLKCLDASGYHLLLKTYVAAGKLAPDMRKRLKED 480

Query: 481 DIEISSELEELLGTVCPQ 499
           DIEISSELEELLGTVCPQ
Sbjct: 481 DIEISSELEELLGTVCPQ 498

BLAST of CsaV3_4G004520 vs. TrEMBL
Match: tr|A0A1S3BYD8|A0A1S3BYD8_CUCME (pentatricopeptide repeat-containing protein At1g02370, mitochondrial-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103494433 PE=4 SV=1)

HSP 1 Score: 907.1 bits (2343), Expect = 1.7e-260
Identity = 455/498 (91.37%), Postives = 467/498 (93.78%), Query Frame = 0

Query: 1   MNRRSLISRAPAGFRQLCTSLNELMRSPANNQRGLYPRLSALGATGGSVAKTINQFIMEG 60
           MNRRSLISRAPAG RQLCTS+ EL RSPANN RGLYPRLS LGATGGSVA+TIN+FIMEG
Sbjct: 1   MNRRSLISRAPAGLRQLCTSVAELTRSPANNHRGLYPRLSVLGATGGSVAQTINRFIMEG 60

Query: 61  NIVKKYELEKCIKELRKYRRYHHCLQIMEWMETRKINYSFTDYALRLDLISKVNGVTAAE 120
           NIVKKYELEKCIKELRKYRRY H LQIMEWME RKINYSFTDYALRLDLISKVNG+TAAE
Sbjct: 61  NIVKKYELEKCIKELRKYRRYDHSLQIMEWMEIRKINYSFTDYALRLDLISKVNGITAAE 120

Query: 121 KYFYDLPPSAKNRCTYGALLNCYCKEMMEEKALTLFKKMDELKISTSLSFNNLMTMYMRM 180
           KYFYDLPPSAKNRCTYGALLNCYCKEMMEEKA TLFKKMDELK  TSL+FNNLMTMYMRM
Sbjct: 121 KYFYDLPPSAKNRCTYGALLNCYCKEMMEEKASTLFKKMDELKFVTSLAFNNLMTMYMRM 180

Query: 181 DHPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMEDRNKFDWTT 240
           D PEKVPPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMED NK DWTT
Sbjct: 181 DQPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMEDSNKLDWTT 240

Query: 241 YSNLASFYVKAGQFEKAELALKKLEEEMKSDKNDRLVYHCLISLYASTSNLSEVNRIWNA 300
           +SNLASFYVKAGQ EKAELALKK+EEE+KSDK DRL YHCLISLYASTSNLSEVNRIWN 
Sbjct: 241 FSNLASFYVKAGQLEKAELALKKVEEEIKSDKKDRLAYHCLISLYASTSNLSEVNRIWNL 300

Query: 301 LKSVYSTMTNISYLVMLQALRKLKDIEGLKRTYKEWESNCRNFDLRIVNDIIGAYLQQDM 360
           LKSVY TMTN SYLVMLQAL KLKDIEGLK+TYKEWES C  FDLR+VN IIGAYLQQDM
Sbjct: 301 LKSVYPTMTNTSYLVMLQALSKLKDIEGLKKTYKEWESICHIFDLRLVNVIIGAYLQQDM 360

Query: 361 YEDAAMIFEDATKRSKGPFSRAREMFMVYFLKLKQVDSAFSHLESALSESKEKEWHPSLA 420
           YEDAAMIFEDA KRSKGPFSRARE FMVYFLKLKQVDSAFSHLESA+SESKEKEWHPSLA
Sbjct: 361 YEDAAMIFEDAIKRSKGPFSRAREKFMVYFLKLKQVDSAFSHLESAISESKEKEWHPSLA 420

Query: 421 TTTAFLNYFEEEKDVEGAEDFARILKRLKCLDASGYHLLLKTYVAAGKLAPDMRKRLKED 480
           TT AFLNYFEEEKDVEGAEDFARILKRLKCLD SGYHLLLKTYVAAGK APDMR+RLKED
Sbjct: 421 TTNAFLNYFEEEKDVEGAEDFARILKRLKCLDESGYHLLLKTYVAAGKSAPDMRQRLKED 480

Query: 481 DIEISSELEELLGTVCPQ 499
           DI ISSELEELLGTVCPQ
Sbjct: 481 DIGISSELEELLGTVCPQ 498

BLAST of CsaV3_4G004520 vs. TrEMBL
Match: tr|A0A1S3BWN7|A0A1S3BWN7_CUCME (pentatricopeptide repeat-containing protein At1g02370, mitochondrial-like isoform X2 OS=Cucumis melo OX=3656 GN=LOC103494433 PE=4 SV=1)

HSP 1 Score: 758.4 bits (1957), Expect = 9.8e-216
Identity = 378/411 (91.97%), Postives = 387/411 (94.16%), Query Frame = 0

Query: 88  MEWMETRKINYSFTDYALRLDLISKVNGVTAAEKYFYDLPPSAKNRCTYGALLNCYCKEM 147
           MEWME RKINYSFTDYALRLDLISKVNG+TAAEKYFYDLPPSAKNRCTYGALLNCYCKEM
Sbjct: 1   MEWMEIRKINYSFTDYALRLDLISKVNGITAAEKYFYDLPPSAKNRCTYGALLNCYCKEM 60

Query: 148 MEEKALTLFKKMDELKISTSLSFNNLMTMYMRMDHPEKVPPLIGEMKQRGFYLTTFTYNV 207
           MEEKA TLFKKMDELK  TSL+FNNLMTMYMRMD PEKVPPLIGEMKQRGFYLTTFTYNV
Sbjct: 61  MEEKASTLFKKMDELKFVTSLAFNNLMTMYMRMDQPEKVPPLIGEMKQRGFYLTTFTYNV 120

Query: 208 WMNSCASLNDIGKVEEILEEMKMEDRNKFDWTTYSNLASFYVKAGQFEKAELALKKLEEE 267
           WMNSCASLNDIGKVEEILEEMKMED NK DWTT+SNLASFYVKAGQ EKAELALKK+EEE
Sbjct: 121 WMNSCASLNDIGKVEEILEEMKMEDSNKLDWTTFSNLASFYVKAGQLEKAELALKKVEEE 180

Query: 268 MKSDKNDRLVYHCLISLYASTSNLSEVNRIWNALKSVYSTMTNISYLVMLQALRKLKDIE 327
           +KSDK DRL YHCLISLYASTSNLSEVNRIWN LKSVY TMTN SYLVMLQAL KLKDIE
Sbjct: 181 IKSDKKDRLAYHCLISLYASTSNLSEVNRIWNLLKSVYPTMTNTSYLVMLQALSKLKDIE 240

Query: 328 GLKRTYKEWESNCRNFDLRIVNDIIGAYLQQDMYEDAAMIFEDATKRSKGPFSRAREMFM 387
           GLK+TYKEWES C  FDLR+VN IIGAYLQQDMYEDAAMIFEDA KRSKGPFSRARE FM
Sbjct: 241 GLKKTYKEWESICHIFDLRLVNVIIGAYLQQDMYEDAAMIFEDAIKRSKGPFSRAREKFM 300

Query: 388 VYFLKLKQVDSAFSHLESALSESKEKEWHPSLATTTAFLNYFEEEKDVEGAEDFARILKR 447
           VYFLKLKQVDSAFSHLESA+SESKEKEWHPSLATT AFLNYFEEEKDVEGAEDFARILKR
Sbjct: 301 VYFLKLKQVDSAFSHLESAISESKEKEWHPSLATTNAFLNYFEEEKDVEGAEDFARILKR 360

Query: 448 LKCLDASGYHLLLKTYVAAGKLAPDMRKRLKEDDIEISSELEELLGTVCPQ 499
           LKCLD SGYHLLLKTYVAAGK APDMR+RLKEDDI ISSELEELLGTVCPQ
Sbjct: 361 LKCLDESGYHLLLKTYVAAGKSAPDMRQRLKEDDIGISSELEELLGTVCPQ 411

BLAST of CsaV3_4G004520 vs. TrEMBL
Match: tr|A0A251PTA2|A0A251PTA2_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_3G010700 PE=4 SV=1)

HSP 1 Score: 606.7 bits (1563), Expect = 4.8e-170
Identity = 313/502 (62.35%), Postives = 379/502 (75.50%), Query Frame = 0

Query: 1   MNRRSLISRAPAGFRQLCTSL---NELMRSPANNQRGLYPRLSALGATGGSVAKTINQFI 60
           MN    IS      R+LCT++    E  RS   N   LY RLSALGATGGSVAKT+NQ+I
Sbjct: 1   MNSSRSISAGTWLVRKLCTAVEAATESARSQPGNPNRLYRRLSALGATGGSVAKTLNQYI 60

Query: 61  MEGNIVKKYELEKCIKELRKYRRYHHCLQIMEWMETRKINYSFTDYALRLDLISKVNGVT 120
           MEG ++KKYELE+CIKELRKYR++ H L+IMEWME RK+NYS  D+A+RLDL SKV G+ 
Sbjct: 61  MEGKMLKKYELERCIKELRKYRKFQHALEIMEWMEFRKMNYSKADFAIRLDLTSKVKGIE 120

Query: 121 AAEKYFYDLPPSAKNRCTYGALLNCYCKEMMEEKALTLFKKMDELKI-STSLSFNNLMTM 180
           AAE YF  L PS K+R TYGALLNCYCKE+MEEKAL L++ MDEL+  S+SL FNNLM+M
Sbjct: 121 AAEDYFSGLSPSLKDRFTYGALLNCYCKELMEEKALALYETMDELEFASSSLVFNNLMSM 180

Query: 181 YMRMDHPEKVPPLIGEMKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMEDRNKF 240
           +MR   PEKV PL+ EMKQR   L TFTYN+WM S ASLND    E +L+EM+ +D N+ 
Sbjct: 181 HMRKQQPEKVAPLVQEMKQRNIPLDTFTYNIWMQSFASLNDFEGAERVLDEMQKQDGNQC 240

Query: 241 DWTTYSNLASFYVKAGQFEKAELALKKLEEEMKSDKNDRLVYHCLISLYASTSNLSEVNR 300
            W+TYSNLA+ YVKA  F+KAELALKK EE MK  K  R  YH LISLYA TSNL EV R
Sbjct: 241 SWSTYSNLAAIYVKAKIFDKAELALKKSEEMMKPLK-QRNTYHFLISLYACTSNLGEVKR 300

Query: 301 IWNALKSVYSTMTNISYLVMLQALRKLKDIEGLKRTYKEWESNCRNFDLRIVNDIIGAYL 360
           +W +LK  +    N+SYL+MLQAL KL DIEGLK  ++EWE  C ++D+R+ N  I  YL
Sbjct: 301 VWESLKKAFPATNNMSYLIMLQALCKLNDIEGLKECFEEWECKCSSYDMRLANTAIRGYL 360

Query: 361 QQDMYEDAAMIFEDATKRSKGPFSRAREMFMVYFLKLKQVDSAFSHLESALSESKEKEWH 420
            QDMYE+AA++F DA KR+KGPF +AREMFM+YFLK  QVD A S+L +A+SE+ + EWH
Sbjct: 361 SQDMYEEAALVFADACKRTKGPFFKAREMFMLYFLKNCQVDLAVSYLGAAVSETADGEWH 420

Query: 421 PSLATTTAFLNYFEEEKDVEGAEDFARILKRLKCLDASGYHLLLKTYVAAGKLAPDMRKR 480
           PS  TT+AF  YFEEEKDVE AE+F +ILKRL CL ++ Y+LLLKTY+AAGKL P+MR+R
Sbjct: 421 PSPDTTSAFFKYFEEEKDVESAENFCKILKRLNCLCSNEYYLLLKTYIAAGKLDPEMRQR 480

Query: 481 LKEDDIEISSELEELLGTVCPQ 499
           LKE+DIEIS ELE LL  V P+
Sbjct: 481 LKEEDIEISPELESLLERVSPE 501

BLAST of CsaV3_4G004520 vs. TrEMBL
Match: tr|A0A2I4ERV2|A0A2I4ERV2_9ROSI (pentatricopeptide repeat-containing protein At1g02370, mitochondrial OS=Juglans regia OX=51240 GN=LOC108992119 PE=4 SV=1)

HSP 1 Score: 595.1 bits (1533), Expect = 1.4e-166
Identity = 305/486 (62.76%), Postives = 375/486 (77.16%), Query Frame = 0

Query: 15  RQLC-TSLNELMRSPANNQRGLYPRLSALGATGGSVAKTINQFIMEGNIVKKYELEKCIK 74
           R  C  S  E  R P      LY RLSALGATGGSV++T+ Q++ EGN V KY LE+CIK
Sbjct: 15  RHFCRVSETETERKPER----LYRRLSALGATGGSVSETLKQYLREGNSVDKYNLERCIK 74

Query: 75  ELRKYRRYHHCLQIMEWMETRKINYSFTDYALRLDLISKVNGVTAAEKYFYDLPPSAKNR 134
            LRKYRR+ H L+IMEWME ++  Y   D+ALRLDLISK  G+ AAE+YF  LPP+ KNR
Sbjct: 75  ALRKYRRFQHALEIMEWMEAKEFTYYSKDHALRLDLISKTKGIPAAEEYFSGLPPTGKNR 134

Query: 135 CTYGALLNCYCKEMMEEKALTLFKKMDELK-ISTSLSFNNLMTMYMRMDHPEKVPPLIGE 194
            TYGALLNCYCKE+ME+KAL LFK+MDEL  +S+ L+FN+LM++YMR + PEKVPPL+ E
Sbjct: 135 LTYGALLNCYCKEVMEDKALALFKEMDELNFVSSDLAFNSLMSLYMRTNQPEKVPPLVQE 194

Query: 195 MKQRGFYLTTFTYNVWMNSCASLNDIGKVEEILEEMKMEDRNKFDWTTYSNLASFYVKAG 254
           MK+R   L+ FT NVWMNS A LNDI  VE +L+    ED +K DWTTYSNLA+ YVKAG
Sbjct: 195 MKRRSIPLSIFTRNVWMNSYALLNDIEGVERVLK----EDESKCDWTTYSNLAAIYVKAG 254

Query: 255 QFEKAELALKKLEEEMKSDKNDRLVYHCLISLYASTSNLSEVNRIWNALKSVYSTMTNIS 314
            FEKA+LALKKLEE MK  K  R  YH LISLYA T+NL EVNR+W +LKSV+ T  N+S
Sbjct: 255 LFEKAQLALKKLEENMKPPK--REAYHFLISLYAGTNNLGEVNRVWKSLKSVFRTTNNMS 314

Query: 315 YLVMLQALRKLKDIEGLKRTYKEWESNCRNFDLRIVNDIIGAYLQQDMYEDAAMIFEDAT 374
            LVMLQALRKLKD++GL + YK WES+C ++D+R+ N +I AYL QDM+++A ++F+DA 
Sbjct: 315 NLVMLQALRKLKDVDGLTKCYKAWESSCFSYDVRLANVVISAYLSQDMFKEATLVFDDAV 374

Query: 375 KRSKGPFSRAREMFMVYFLKLKQVDSAFSHLESALSESKEKEWHPSLATTTAFLNYFEEE 434
           KR KGPF  AREMFMVYFLK++QVD A S+LE+A+SE ++ EW P+ AT +AFL YFE E
Sbjct: 375 KRCKGPFFLAREMFMVYFLKIRQVDLARSYLEAAVSE-EDNEWRPAPATASAFLKYFEGE 434

Query: 435 KDVEGAEDFARILKRLKCLDASGYHLLLKTYVAAGKLAPDMRKRLKEDDIEISSELEELL 494
           KDV+GAE+F   LK  KCL ++ YHLLLKTY+AAGK AP+MR+RLKED IEISS+LE LL
Sbjct: 435 KDVDGAEEFCTTLKTFKCLTSNIYHLLLKTYIAAGKQAPEMRRRLKEDHIEISSDLENLL 489

Query: 495 GTVCPQ 499
             VCP+
Sbjct: 495 EKVCPE 489

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004146883.24.2e-287100.00PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial ... [more]
XP_008453822.12.6e-26091.37PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial-... [more]
XP_022141328.15.3e-22178.43pentatricopeptide repeat-containing protein At1g02370, mitochondrial [Momordica ... [more]
XP_022942939.15.1e-21678.72pentatricopeptide repeat-containing protein At1g02370, mitochondrial-like isofor... [more]
XP_008453823.11.5e-21591.97PREDICTED: pentatricopeptide repeat-containing protein At1g02370, mitochondrial-... [more]
Match NameE-valueIdentityDescription
AT1G02370.15.6e-13250.85Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G01990.14.4e-12147.13Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G60770.13.7e-6734.84Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G21705.18.2e-6731.24Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G27460.11.4e-5832.20Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9FZ24|PPR4_ARATH1.0e-13050.85Pentatricopeptide repeat-containing protein At1g02370, mitochondrial OS=Arabidop... [more]
sp|Q93WC5|PP300_ARATH8.0e-12047.13Pentatricopeptide repeat-containing protein At4g01990, mitochondrial OS=Arabidop... [more]
sp|O22714|PPR86_ARATH6.6e-6634.84Pentatricopeptide repeat-containing protein At1g60770 OS=Arabidopsis thaliana OX... [more]
sp|Q84JR3|PP334_ARATH1.5e-6531.24Pentatricopeptide repeat-containing protein At4g21705, mitochondrial OS=Arabidop... [more]
sp|Q3E911|PP400_ARATH2.5e-5732.20Pentatricopeptide repeat-containing protein At5g27460 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0KUH1|A0A0A0KUH1_CUCSA2.8e-287100.00Pentatricopeptide repeat-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G... [more]
tr|A0A1S3BYD8|A0A1S3BYD8_CUCME1.7e-26091.37pentatricopeptide repeat-containing protein At1g02370, mitochondrial-like isofor... [more]
tr|A0A1S3BWN7|A0A1S3BWN7_CUCME9.8e-21691.97pentatricopeptide repeat-containing protein At1g02370, mitochondrial-like isofor... [more]
tr|A0A251PTA2|A0A251PTA2_PRUPE4.8e-17062.35Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_3G010700 PE=4 SV=1[more]
tr|A0A2I4ERV2|A0A2I4ERV2_9ROSI1.4e-16662.76pentatricopeptide repeat-containing protein At1g02370, mitochondrial OS=Juglans ... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_4G004520.1CsaV3_4G004520.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 326..346
NoneNo IPR availableCOILSCoilCoilcoord: 254..274
NoneNo IPR availablePANTHERPTHR24015:SF504SUBFAMILY NOT NAMEDcoord: 26..491
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 26..491
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 134..165
e-value: 9.9E-7
score: 26.6
coord: 203..231
e-value: 7.3E-5
score: 20.7
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 240..266
e-value: 0.071
score: 13.3
coord: 203..230
e-value: 3.8E-4
score: 20.4
coord: 135..162
e-value: 7.7E-6
score: 25.7
coord: 169..198
e-value: 0.7
score: 10.2
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 132..166
score: 9.767
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 274..304
score: 6.149
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 237..267
score: 7.585
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 201..235
score: 8.221
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 344..378
score: 7.213
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 309..339
score: 5.338
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 167..200
score: 5.601
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 271..406
e-value: 1.4E-8
score: 36.4
coord: 64..270
e-value: 2.1E-23
score: 85.2
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILYSSF48452TPR-likecoord: 135..321