CsaV3_5G013110 (gene) Cucumber (Chinese Long) v3

NameCsaV3_5G013110
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionPentatricopeptide repeat-containing protein
Locationchr5 : 9494682 .. 9496808 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTATTTTAAAACTACCAATTACTGACATTATGCCTGTGAAGTTTACACCATTTCTTTCCAGGTCCAATTTCCTTGCTTCTCCTCACCAGGACCCCATCAAGCTGTTGAAGGTAGCTGCAGATGCCAAGAACTTAAAATTTGGTAGAACAATCCATGCCCATCTGACCATCACCAATCACAACTATAGAGACTCCAAAGTAAACCAACTGAATTCCCTTATTAATTTGTACGTGAAATGTGATGAAGTATCGATTGCTCGGAAATTGTTCGATAGTATGCCTAGAAGAAATGTTGTGTCCTGGAGTGCTTTAATGGCTGGGTACATGCAAAATGGTAATCCCTTGGAAGTTTTCGAGTTGTTCAAAAAGATGGTCGTGAAGGATAATATCTTCCCCAATGAATATGTGATTGCTACTGCTATATCTTCTTGTGATAGTCAAATGTATGTAGAAGGGAAACAATGTCATGGGTATGCATTAAAGTCTGGATTGGAGTTTCATCAATATGTTAAGAATGCACTTATTCAGTTGTACTCTAAATGTTCAGATGTAGGAGCAGCAATCCAGATATTATATACTGTCCCAGGTAATGACATATTTTGTTATAATTTGGTAGTGAATGGGCTTCTACAGCACACACATATGGCAGAAGCTGTAGACGTTCTGAAGTTAATAATTAGTGAAGGCATAGAATGGAATAATGCCACTTATGTTACAATTTTCCGCCTTTGTGCTAGTCTTAAAGATATAACATTAGGTAAGCAAGTTCACGCTCAAATGTTGAAAAGTGATATTGACTGTGATGTCTATATTGGAAGTTCTATCATTGATATGTACGGGAAGTGTGGTAATGTGTTGAGTGGAAGAACCTTTTTTGATCGGTTACAAAGCCGAAACGTTGTTTCTTGGACATCGATCATAGCAGCTTATTTTCAGAATGAATTCTTTGAAGAAGCATTGAATCTGTTTTCAAAGATGGAAATTGATTGTATTCCTCCCAATGAATATACAATGGCGGTGTTGTTCAACTCTGCGGCAGGTTTGTCTGCACTATGCCTTGGGGATCAGTTACATGCACGTGCTGAGAAATCAGGTCTCAAAGGCAATGTTATGGTAGGTAATGCCTTGATCATTATGTATTTTAAGAGTGGGGACATTTTAGCAGCACAAAGTGTGTTTTCAAACATGACGTGTTGTAATATCATTACCTGGAATGCAATAATAACTGGCCACTCACACCATGGTCTGGGCAAGGAAGCGTTAAGCATGTTCCAGGACATGATGGCTACTGGAGAACGCCCTAATTATGTAACTTTTATTGGTGTAATATTAGCTTGTGCCCATTTAAAACTGGTGGATGAAGGATTCTACTATTTTAATCATTTGATGAAACAGTTCAGAATTGTTCCTGGATTGGAGCACTATACCTGTATTGTTGGACTTTTAAGTAGATCTGGACGACTGGATGAAGCCGAGAATTTTATGCGGTCACATCAAATAAATTGGGATGTTGTTTCCTGGCGCACCCTTCTCAACGCTTGTTATGTTCATAAACATTATGATAAAGGGAGGAAAATAGCAGAGTACTTGCTACAGTTGGAGCCTAGGGATGTTGGAACTTATATTCTATTGTCAAACATGCATGCGAGAGTTAGGAGGTGGGATCATGTTGTTGAGATTCGAAAATTGATGAGGGAAAGAAATGTCAAGAAGGAACCTGGAGTAAGTTGGTTAGAAATAAGAAATGTTGCCCATGTTTTTACATCTGAGGATATTAAACACCCTGAGGCCAATCTGATTTATGAAAATGTAAAGGATTTATTATCTAAGATTCGACCTTTGGGGTATGTTCCTGATATTGATAATGTATTGCATGATATTGAGGACGAGCAGAAGGTCGACAATCTTAGCTATCACAGTGAGAAGCTTGCTGTAGCATATGGCCTAATGAAGACACCATCTGGCGCACCAATCACAGTGATTAAAAACCTTAGGATGTGTGATGATTGTCACACTGCTATCAAACTTATTTCCAAAGTTGCTAATAGGGTTATAGTTGTTAGAGATGCCAATCGCTTCCATCATTTTCAAAATGGTTGTTGTTCGTGTGGAGATTATTGGTGA

mRNA sequence

ATGTCTATTTTAAAACTACCAATTACTGACATTATGCCTGTGAAGTTTACACCATTTCTTTCCAGGTCCAATTTCCTTGCTTCTCCTCACCAGGACCCCATCAAGCTGTTGAAGGTAGCTGCAGATGCCAAGAACTTAAAATTTGGTAGAACAATCCATGCCCATCTGACCATCACCAATCACAACTATAGAGACTCCAAAGTAAACCAACTGAATTCCCTTATTAATTTGTACGTGAAATGTGATGAAGTATCGATTGCTCGGAAATTGTTCGATAGTATGCCTAGAAGAAATGTTGTGTCCTGGAGTGCTTTAATGGCTGGGTACATGCAAAATGGTAATCCCTTGGAAGTTTTCGAGTTGTTCAAAAAGATGGTCGTGAAGGATAATATCTTCCCCAATGAATATGTGATTGCTACTGCTATATCTTCTTGTGATAGTCAAATGTATGTAGAAGGGAAACAATGTCATGGGTATGCATTAAAGTCTGGATTGGAGTTTCATCAATATGTTAAGAATGCACTTATTCAGTTGTACTCTAAATGTTCAGATGTAGGAGCAGCAATCCAGATATTATATACTGTCCCAGGTAATGACATATTTTGTTATAATTTGGTAGTGAATGGGCTTCTACAGCACACACATATGGCAGAAGCTGTAGACGTTCTGAAGTTAATAATTAGTGAAGGCATAGAATGGAATAATGCCACTTATGTTACAATTTTCCGCCTTTGTGCTAGTCTTAAAGATATAACATTAGGTAAGCAAGTTCACGCTCAAATGTTGAAAAGTGATATTGACTGTGATGTCTATATTGGAAGTTCTATCATTGATATGTACGGGAAGTGTGGTAATGTGTTGAGTGGAAGAACCTTTTTTGATCGGTTACAAAGCCGAAACGTTGTTTCTTGGACATCGATCATAGCAGCTTATTTTCAGAATGAATTCTTTGAAGAAGCATTGAATCTGTTTTCAAAGATGGAAATTGATTGTATTCCTCCCAATGAATATACAATGGCGGTGTTGTTCAACTCTGCGGCAGGTTTGTCTGCACTATGCCTTGGGGATCAGTTACATGCACGTGCTGAGAAATCAGGTCTCAAAGGCAATGTTATGGTAGGTAATGCCTTGATCATTATGTATTTTAAGAGTGGGGACATTTTAGCAGCACAAAGTGTGTTTTCAAACATGACGTGTTGTAATATCATTACCTGGAATGCAATAATAACTGGCCACTCACACCATGGTCTGGGCAAGGAAGCGTTAAGCATGTTCCAGGACATGATGGCTACTGGAGAACGCCCTAATTATGTAACTTTTATTGGTGTAATATTAGCTTGTGCCCATTTAAAACTGGTGGATGAAGGATTCTACTATTTTAATCATTTGATGAAACAGTTCAGAATTGTTCCTGGATTGGAGCACTATACCTGTATTGTTGGACTTTTAAGTAGATCTGGACGACTGGATGAAGCCGAGAATTTTATGCGGTCACATCAAATAAATTGGGATGTTGTTTCCTGGCGCACCCTTCTCAACGCTTGTTATGTTCATAAACATTATGATAAAGGGAGGAAAATAGCAGAGTACTTGCTACAGTTGGAGCCTAGGGATGTTGGAACTTATATTCTATTGTCAAACATGCATGCGAGAGTTAGGAGGTGGGATCATGTTGTTGAGATTCGAAAATTGATGAGGGAAAGAAATGTCAAGAAGGAACCTGGAGTAAGTTGGTTAGAAATAAGAAATGTTGCCCATGTTTTTACATCTGAGGATATTAAACACCCTGAGGCCAATCTGATTTATGAAAATGTAAAGGATTTATTATCTAAGATTCGACCTTTGGGGTATGTTCCTGATATTGATAATGTATTGCATGATATTGAGGACGAGCAGAAGGTCGACAATCTTAGCTATCACAGTGAGAAGCTTGCTGTAGCATATGGCCTAATGAAGACACCATCTGGCGCACCAATCACAGTGATTAAAAACCTTAGGATGTGTGATGATTGTCACACTGCTATCAAACTTATTTCCAAAGTTGCTAATAGGGTTATAGTTGTTAGAGATGCCAATCGCTTCCATCATTTTCAAAATGGTTGTTGTTCGTGTGGAGATTATTGGTGA

Coding sequence (CDS)

ATGTCTATTTTAAAACTACCAATTACTGACATTATGCCTGTGAAGTTTACACCATTTCTTTCCAGGTCCAATTTCCTTGCTTCTCCTCACCAGGACCCCATCAAGCTGTTGAAGGTAGCTGCAGATGCCAAGAACTTAAAATTTGGTAGAACAATCCATGCCCATCTGACCATCACCAATCACAACTATAGAGACTCCAAAGTAAACCAACTGAATTCCCTTATTAATTTGTACGTGAAATGTGATGAAGTATCGATTGCTCGGAAATTGTTCGATAGTATGCCTAGAAGAAATGTTGTGTCCTGGAGTGCTTTAATGGCTGGGTACATGCAAAATGGTAATCCCTTGGAAGTTTTCGAGTTGTTCAAAAAGATGGTCGTGAAGGATAATATCTTCCCCAATGAATATGTGATTGCTACTGCTATATCTTCTTGTGATAGTCAAATGTATGTAGAAGGGAAACAATGTCATGGGTATGCATTAAAGTCTGGATTGGAGTTTCATCAATATGTTAAGAATGCACTTATTCAGTTGTACTCTAAATGTTCAGATGTAGGAGCAGCAATCCAGATATTATATACTGTCCCAGGTAATGACATATTTTGTTATAATTTGGTAGTGAATGGGCTTCTACAGCACACACATATGGCAGAAGCTGTAGACGTTCTGAAGTTAATAATTAGTGAAGGCATAGAATGGAATAATGCCACTTATGTTACAATTTTCCGCCTTTGTGCTAGTCTTAAAGATATAACATTAGGTAAGCAAGTTCACGCTCAAATGTTGAAAAGTGATATTGACTGTGATGTCTATATTGGAAGTTCTATCATTGATATGTACGGGAAGTGTGGTAATGTGTTGAGTGGAAGAACCTTTTTTGATCGGTTACAAAGCCGAAACGTTGTTTCTTGGACATCGATCATAGCAGCTTATTTTCAGAATGAATTCTTTGAAGAAGCATTGAATCTGTTTTCAAAGATGGAAATTGATTGTATTCCTCCCAATGAATATACAATGGCGGTGTTGTTCAACTCTGCGGCAGGTTTGTCTGCACTATGCCTTGGGGATCAGTTACATGCACGTGCTGAGAAATCAGGTCTCAAAGGCAATGTTATGGTAGGTAATGCCTTGATCATTATGTATTTTAAGAGTGGGGACATTTTAGCAGCACAAAGTGTGTTTTCAAACATGACGTGTTGTAATATCATTACCTGGAATGCAATAATAACTGGCCACTCACACCATGGTCTGGGCAAGGAAGCGTTAAGCATGTTCCAGGACATGATGGCTACTGGAGAACGCCCTAATTATGTAACTTTTATTGGTGTAATATTAGCTTGTGCCCATTTAAAACTGGTGGATGAAGGATTCTACTATTTTAATCATTTGATGAAACAGTTCAGAATTGTTCCTGGATTGGAGCACTATACCTGTATTGTTGGACTTTTAAGTAGATCTGGACGACTGGATGAAGCCGAGAATTTTATGCGGTCACATCAAATAAATTGGGATGTTGTTTCCTGGCGCACCCTTCTCAACGCTTGTTATGTTCATAAACATTATGATAAAGGGAGGAAAATAGCAGAGTACTTGCTACAGTTGGAGCCTAGGGATGTTGGAACTTATATTCTATTGTCAAACATGCATGCGAGAGTTAGGAGGTGGGATCATGTTGTTGAGATTCGAAAATTGATGAGGGAAAGAAATGTCAAGAAGGAACCTGGAGTAAGTTGGTTAGAAATAAGAAATGTTGCCCATGTTTTTACATCTGAGGATATTAAACACCCTGAGGCCAATCTGATTTATGAAAATGTAAAGGATTTATTATCTAAGATTCGACCTTTGGGGTATGTTCCTGATATTGATAATGTATTGCATGATATTGAGGACGAGCAGAAGGTCGACAATCTTAGCTATCACAGTGAGAAGCTTGCTGTAGCATATGGCCTAATGAAGACACCATCTGGCGCACCAATCACAGTGATTAAAAACCTTAGGATGTGTGATGATTGTCACACTGCTATCAAACTTATTTCCAAAGTTGCTAATAGGGTTATAGTTGTTAGAGATGCCAATCGCTTCCATCATTTTCAAAATGGTTGTTGTTCGTGTGGAGATTATTGGTGA

Protein sequence

MSILKLPITDIMPVKFTPFLSRSNFLASPHQDPIKLLKVAADAKNLKFGRTIHAHLTITNHNYRDSKVNQLNSLINLYVKCDEVSIARKLFDSMPRRNVVSWSALMAGYMQNGNPLEVFELFKKMVVKDNIFPNEYVIATAISSCDSQMYVEGKQCHGYALKSGLEFHQYVKNALIQLYSKCSDVGAAIQILYTVPGNDIFCYNLVVNGLLQHTHMAEAVDVLKLIISEGIEWNNATYVTIFRLCASLKDITLGKQVHAQMLKSDIDCDVYIGSSIIDMYGKCGNVLSGRTFFDRLQSRNVVSWTSIIAAYFQNEFFEEALNLFSKMEIDCIPPNEYTMAVLFNSAAGLSALCLGDQLHARAEKSGLKGNVMVGNALIIMYFKSGDILAAQSVFSNMTCCNIITWNAIITGHSHHGLGKEALSMFQDMMATGERPNYVTFIGVILACAHLKLVDEGFYYFNHLMKQFRIVPGLEHYTCIVGLLSRSGRLDEAENFMRSHQINWDVVSWRTLLNACYVHKHYDKGRKIAEYLLQLEPRDVGTYILLSNMHARVRRWDHVVEIRKLMRERNVKKEPGVSWLEIRNVAHVFTSEDIKHPEANLIYENVKDLLSKIRPLGYVPDIDNVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPITVIKNLRMCDDCHTAIKLISKVANRVIVVRDANRFHHFQNGCCSCGDYW
BLAST of CsaV3_5G013110 vs. NCBI nr
Match: XP_011655117.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g39680 [Cucumis sativus] >XP_011655118.1 PREDICTED: pentatricopeptide repeat-containing protein At5g39680 [Cucumis sativus] >XP_011655119.1 PREDICTED: pentatricopeptide repeat-containing protein At5g39680 [Cucumis sativus] >KGN50862.1 hypothetical protein Csa_5G292190 [Cucumis sativus])

HSP 1 Score: 1449.9 bits (3752), Expect = 0.0e+00
Identity = 708/708 (100.00%), Postives = 708/708 (100.00%), Query Frame = 0

Query: 1   MSILKLPITDIMPVKFTPFLSRSNFLASPHQDPIKLLKVAADAKNLKFGRTIHAHLTITN 60
           MSILKLPITDIMPVKFTPFLSRSNFLASPHQDPIKLLKVAADAKNLKFGRTIHAHLTITN
Sbjct: 1   MSILKLPITDIMPVKFTPFLSRSNFLASPHQDPIKLLKVAADAKNLKFGRTIHAHLTITN 60

Query: 61  HNYRDSKVNQLNSLINLYVKCDEVSIARKLFDSMPRRNVVSWSALMAGYMQNGNPLEVFE 120
           HNYRDSKVNQLNSLINLYVKCDEVSIARKLFDSMPRRNVVSWSALMAGYMQNGNPLEVFE
Sbjct: 61  HNYRDSKVNQLNSLINLYVKCDEVSIARKLFDSMPRRNVVSWSALMAGYMQNGNPLEVFE 120

Query: 121 LFKKMVVKDNIFPNEYVIATAISSCDSQMYVEGKQCHGYALKSGLEFHQYVKNALIQLYS 180
           LFKKMVVKDNIFPNEYVIATAISSCDSQMYVEGKQCHGYALKSGLEFHQYVKNALIQLYS
Sbjct: 121 LFKKMVVKDNIFPNEYVIATAISSCDSQMYVEGKQCHGYALKSGLEFHQYVKNALIQLYS 180

Query: 181 KCSDVGAAIQILYTVPGNDIFCYNLVVNGLLQHTHMAEAVDVLKLIISEGIEWNNATYVT 240
           KCSDVGAAIQILYTVPGNDIFCYNLVVNGLLQHTHMAEAVDVLKLIISEGIEWNNATYVT
Sbjct: 181 KCSDVGAAIQILYTVPGNDIFCYNLVVNGLLQHTHMAEAVDVLKLIISEGIEWNNATYVT 240

Query: 241 IFRLCASLKDITLGKQVHAQMLKSDIDCDVYIGSSIIDMYGKCGNVLSGRTFFDRLQSRN 300
           IFRLCASLKDITLGKQVHAQMLKSDIDCDVYIGSSIIDMYGKCGNVLSGRTFFDRLQSRN
Sbjct: 241 IFRLCASLKDITLGKQVHAQMLKSDIDCDVYIGSSIIDMYGKCGNVLSGRTFFDRLQSRN 300

Query: 301 VVSWTSIIAAYFQNEFFEEALNLFSKMEIDCIPPNEYTMAVLFNSAAGLSALCLGDQLHA 360
           VVSWTSIIAAYFQNEFFEEALNLFSKMEIDCIPPNEYTMAVLFNSAAGLSALCLGDQLHA
Sbjct: 301 VVSWTSIIAAYFQNEFFEEALNLFSKMEIDCIPPNEYTMAVLFNSAAGLSALCLGDQLHA 360

Query: 361 RAEKSGLKGNVMVGNALIIMYFKSGDILAAQSVFSNMTCCNIITWNAIITGHSHHGLGKE 420
           RAEKSGLKGNVMVGNALIIMYFKSGDILAAQSVFSNMTCCNIITWNAIITGHSHHGLGKE
Sbjct: 361 RAEKSGLKGNVMVGNALIIMYFKSGDILAAQSVFSNMTCCNIITWNAIITGHSHHGLGKE 420

Query: 421 ALSMFQDMMATGERPNYVTFIGVILACAHLKLVDEGFYYFNHLMKQFRIVPGLEHYTCIV 480
           ALSMFQDMMATGERPNYVTFIGVILACAHLKLVDEGFYYFNHLMKQFRIVPGLEHYTCIV
Sbjct: 421 ALSMFQDMMATGERPNYVTFIGVILACAHLKLVDEGFYYFNHLMKQFRIVPGLEHYTCIV 480

Query: 481 GLLSRSGRLDEAENFMRSHQINWDVVSWRTLLNACYVHKHYDKGRKIAEYLLQLEPRDVG 540
           GLLSRSGRLDEAENFMRSHQINWDVVSWRTLLNACYVHKHYDKGRKIAEYLLQLEPRDVG
Sbjct: 481 GLLSRSGRLDEAENFMRSHQINWDVVSWRTLLNACYVHKHYDKGRKIAEYLLQLEPRDVG 540

Query: 541 TYILLSNMHARVRRWDHVVEIRKLMRERNVKKEPGVSWLEIRNVAHVFTSEDIKHPEANL 600
           TYILLSNMHARVRRWDHVVEIRKLMRERNVKKEPGVSWLEIRNVAHVFTSEDIKHPEANL
Sbjct: 541 TYILLSNMHARVRRWDHVVEIRKLMRERNVKKEPGVSWLEIRNVAHVFTSEDIKHPEANL 600

Query: 601 IYENVKDLLSKIRPLGYVPDIDNVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIT 660
           IYENVKDLLSKIRPLGYVPDIDNVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIT
Sbjct: 601 IYENVKDLLSKIRPLGYVPDIDNVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIT 660

Query: 661 VIKNLRMCDDCHTAIKLISKVANRVIVVRDANRFHHFQNGCCSCGDYW 709
           VIKNLRMCDDCHTAIKLISKVANRVIVVRDANRFHHFQNGCCSCGDYW
Sbjct: 661 VIKNLRMCDDCHTAIKLISKVANRVIVVRDANRFHHFQNGCCSCGDYW 708

BLAST of CsaV3_5G013110 vs. NCBI nr
Match: XP_008456851.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g39680 isoform X2 [Cucumis melo] >XP_016902038.1 PREDICTED: pentatricopeptide repeat-containing protein At5g39680 isoform X2 [Cucumis melo] >XP_016902039.1 PREDICTED: pentatricopeptide repeat-containing protein At5g39680 isoform X2 [Cucumis melo] >XP_016902040.1 PREDICTED: pentatricopeptide repeat-containing protein At5g39680 isoform X2 [Cucumis melo] >XP_016902041.1 PREDICTED: pentatricopeptide repeat-containing protein At5g39680 isoform X2 [Cucumis melo] >XP_016902042.1 PREDICTED: pentatricopeptide repeat-containing protein At5g39680 isoform X2 [Cucumis melo] >XP_016902043.1 PREDICTED: pentatricopeptide repeat-containing protein At5g39680 isoform X2 [Cucumis melo] >XP_016902044.1 PREDICTED: pentatricopeptide repeat-containing protein At5g39680 isoform X2 [Cucumis melo] >XP_016902045.1 PREDICTED: pentatricopeptide repeat-containing protein At5g39680 isoform X2 [Cucumis melo] >XP_016902046.1 PREDICTED: pentatricopeptide repeat-containing protein At5g39680 isoform X2 [Cucumis melo] >XP_016902047.1 PREDICTED: pentatricopeptide repeat-containing protein At5g39680 isoform X2 [Cucumis melo] >XP_016902048.1 PREDICTED: pentatricopeptide repeat-containing protein At5g39680 isoform X2 [Cucumis melo])

HSP 1 Score: 1377.8 bits (3565), Expect = 0.0e+00
Identity = 671/708 (94.77%), Postives = 689/708 (97.32%), Query Frame = 0

Query: 1   MSILKLPITDIMPVKFTPFLSRSNFLASPHQDPIKLLKVAADAKNLKFGRTIHAHLTITN 60
           MSILKLPI+DIMPVKFTPFLSRS+F ASPHQDPIKLLKVAADAKNL FGRTI AHLTITN
Sbjct: 1   MSILKLPISDIMPVKFTPFLSRSDFFASPHQDPIKLLKVAADAKNLIFGRTIQAHLTITN 60

Query: 61  HNYRDSKVNQLNSLINLYVKCDEVSIARKLFDSMPRRNVVSWSALMAGYMQNGNPLEVFE 120
           HNYRDSKVNQLNSLINLYVKC EVSIARK+FDSMPRRNVVSWS LMAGYMQNGNP EVFE
Sbjct: 61  HNYRDSKVNQLNSLINLYVKCGEVSIARKVFDSMPRRNVVSWSTLMAGYMQNGNPSEVFE 120

Query: 121 LFKKMVVKDNIFPNEYVIATAISSCDSQMYVEGKQCHGYALKSGLEFHQYVKNALIQLYS 180
           LFKKMV+KDNI PN+YVIAT ISSC+SQMYVEGKQCHGYALKSGLEFHQYVKNALIQLYS
Sbjct: 121 LFKKMVLKDNILPNKYVIATVISSCNSQMYVEGKQCHGYALKSGLEFHQYVKNALIQLYS 180

Query: 181 KCSDVGAAIQILYTVPGNDIFCYNLVVNGLLQHTHMAEAVDVLKLIISEGIEWNNATYVT 240
           KCSDVGAAIQILYTVPGNDIFCYNLVVNGLLQHTHM EAVDVLKLIIS+GIEWN+ATYVT
Sbjct: 181 KCSDVGAAIQILYTVPGNDIFCYNLVVNGLLQHTHMREAVDVLKLIISKGIEWNSATYVT 240

Query: 241 IFRLCASLKDITLGKQVHAQMLKSDIDCDVYIGSSIIDMYGKCGNVLSGRTFFDRLQSRN 300
           IFRLCASLKDITLGKQVHAQMLKSDIDCDVYIGSSIIDMYGKCGNVLSGRTFFDRLQSRN
Sbjct: 241 IFRLCASLKDITLGKQVHAQMLKSDIDCDVYIGSSIIDMYGKCGNVLSGRTFFDRLQSRN 300

Query: 301 VVSWTSIIAAYFQNEFFEEALNLFSKMEIDCIPPNEYTMAVLFNSAAGLSALCLGDQLHA 360
           VVSWTSI+AAYFQNEFFEEAL+LFSKMEID IPPNEYTMAVLFNSAAGLSALCLGDQLHA
Sbjct: 301 VVSWTSIMAAYFQNEFFEEALDLFSKMEIDRIPPNEYTMAVLFNSAAGLSALCLGDQLHA 360

Query: 361 RAEKSGLKGNVMVGNALIIMYFKSGDILAAQSVFSNMTCCNIITWNAIITGHSHHGLGKE 420
           RAEKSGLKGNVMVGNALIIMYFKSGDILAAQ VFSNMTCC+IITWNAIITGHSHHGLGKE
Sbjct: 361 RAEKSGLKGNVMVGNALIIMYFKSGDILAAQRVFSNMTCCDIITWNAIITGHSHHGLGKE 420

Query: 421 ALSMFQDMMATGERPNYVTFIGVILACAHLKLVDEGFYYFNHLMKQFRIVPGLEHYTCIV 480
           ALSMFQDMM TGERPNYVTFIGVI ACAHLKLVDEGFYYFNHLMKQF IVPGLEHYTCIV
Sbjct: 421 ALSMFQDMMTTGERPNYVTFIGVISACAHLKLVDEGFYYFNHLMKQFGIVPGLEHYTCIV 480

Query: 481 GLLSRSGRLDEAENFMRSHQINWDVVSWRTLLNACYVHKHYDKGRKIAEYLLQLEPRDVG 540
           GLLSRSGRLDEAENFMRSHQINWDVVSWRTLLNACYVHKHYDKG++IAEYLLQLEPRDVG
Sbjct: 481 GLLSRSGRLDEAENFMRSHQINWDVVSWRTLLNACYVHKHYDKGKQIAEYLLQLEPRDVG 540

Query: 541 TYILLSNMHARVRRWDHVVEIRKLMRERNVKKEPGVSWLEIRNVAHVFTSEDIKHPEANL 600
           TYILLSNMHARVRRWD VVEIRKLMRERNVKKEPGVSWLEIRNVAHVFTSEDIKHP+ANL
Sbjct: 541 TYILLSNMHARVRRWDRVVEIRKLMRERNVKKEPGVSWLEIRNVAHVFTSEDIKHPDANL 600

Query: 601 IYENVKDLLSKIRPLGYVPDIDNVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIT 660
           IYENVK+LLSKIRPLGYVPDIDNVLHDIEDEQKV+NLSYHSEKLAVAYGLMKT SG PI 
Sbjct: 601 IYENVKNLLSKIRPLGYVPDIDNVLHDIEDEQKVNNLSYHSEKLAVAYGLMKTTSGTPIR 660

Query: 661 VIKNLRMCDDCHTAIKLISKVANRVIVVRDANRFHHFQNGCCSCGDYW 709
           VIKNLRMCDDCHTAIKLIS+VANRVI+VRD NRFHHFQNGCCSCGDYW
Sbjct: 661 VIKNLRMCDDCHTAIKLISQVANRVIIVRDVNRFHHFQNGCCSCGDYW 708

BLAST of CsaV3_5G013110 vs. NCBI nr
Match: XP_023002421.1 (pentatricopeptide repeat-containing protein At5g39680 [Cucurbita maxima] >XP_023002429.1 pentatricopeptide repeat-containing protein At5g39680 [Cucurbita maxima])

HSP 1 Score: 1205.3 bits (3117), Expect = 0.0e+00
Identity = 583/711 (82.00%), Postives = 649/711 (91.28%), Query Frame = 0

Query: 1   MSILKL--PITDIMPVKFTPFLSRSNFLASPHQDPIKLLKVAADAKNLKFGRTIHAHLTI 60
           M++LKL  PI+ + PVKFTPFLS+SN LASP  DP+KLLKVAADAKNLKFGR IHAHL I
Sbjct: 28  MAMLKLPVPISSLAPVKFTPFLSKSNELASPLLDPMKLLKVAADAKNLKFGRVIHAHLVI 87

Query: 61  TNHNYRDSKVNQLNSLINLYVKCDEVSIARKLFDSMPRRNVVSWSALMAGYMQNGNPLEV 120
           TN   RD +VNQ+NSLINLYVKCDE+ IAR++FD M +RNVVSW ALMAGYMQNG+PL+V
Sbjct: 88  TNRIPRDCRVNQINSLINLYVKCDELFIARQMFDRMSKRNVVSWCALMAGYMQNGSPLDV 147

Query: 121 FELFKKMVVKDNIFPNEYVIATAISSC-DSQMYVEGKQCHGYALKSGLEFHQYVKNALIQ 180
           FELFKKM+VKDNIFPNEYVIAT ISSC DSQMYVEGKQCHG++LKSGLE HQYVKNALIQ
Sbjct: 148 FELFKKMIVKDNIFPNEYVIATVISSCFDSQMYVEGKQCHGFSLKSGLELHQYVKNALIQ 207

Query: 181 LYSKCSDVGAAIQILYTVPGNDIFCYNLVVNGLLQHTHMAEAVDVLKLIISEGIEWNNAT 240
           +YSKCSDV AA++IL TVPG D+FCYNLV+NGLL+H+H+ EA++VLKL+I EG +WNNAT
Sbjct: 208 MYSKCSDVRAALKILDTVPGYDVFCYNLVLNGLLEHSHVGEAIEVLKLMIDEGTKWNNAT 267

Query: 241 YVTIFRLCASLKDITLGKQVHAQMLKSDIDCDVYIGSSIIDMYGKCGNVLSGRTFFDRLQ 300
           +VTIFR+CASLKD+ LGK VHA+MLKSDID DVYIGSSIIDMYGKCGNVLSGR FFD+LQ
Sbjct: 268 FVTIFRICASLKDLKLGKHVHARMLKSDIDDDVYIGSSIIDMYGKCGNVLSGRAFFDQLQ 327

Query: 301 SRNVVSWTSIIAAYFQNEFFEEALNLFSKMEIDCIPPNEYTMAVLFNSAAGLSALCLGDQ 360
           +RNVVSWT+I+AAYFQN FFEEALNLFSKMEID IPPNEYT+AVL NSAAGLSAL  GDQ
Sbjct: 328 NRNVVSWTAIMAAYFQNGFFEEALNLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQ 387

Query: 361 LHARAEKSGLKGNVMVGNALIIMYFKSGDILAAQSVFSNMTCCNIITWNAIITGHSHHGL 420
           LHARAEKSGLKGNV+VGNALIIMY KSGDILAAQ VFSNM CC+ ITWNAIITGHSHH +
Sbjct: 388 LHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQLVFSNMKCCDSITWNAIITGHSHHCI 447

Query: 421 GKEALSMFQDMMATGERPNYVTFIGVILACAHLKLVDEGFYYFNHLMKQFRIVPGLEHYT 480
           GKEAL++F DM+   E PNYVTFIGV+ ACAHL LVDEG YYFNHLMKQ  IVPGLEHYT
Sbjct: 448 GKEALNIFHDMLTARECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQLGIVPGLEHYT 507

Query: 481 CIVGLLSRSGRLDEAENFMRSHQINWDVVSWRTLLNACYVHKHYDKGRKIAEYLLQLEPR 540
           CIVGLLSRSGRLDEAENFMRS+ INWDVV+WRTLLNACYVH++YDKG++IAEYLLQ++  
Sbjct: 508 CIVGLLSRSGRLDEAENFMRSNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHE 567

Query: 541 DVGTYILLSNMHARVRRWDHVVEIRKLMRERNVKKEPGVSWLEIRNVAHVFTSEDIKHPE 600
           DVG+YILLSNMHARVRRWD VV++RKLMRERNVKKEPGVSWLEIRN+AHVFTSED KHPE
Sbjct: 568 DVGSYILLSNMHARVRRWDGVVKVRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHPE 627

Query: 601 ANLIYENVKDLLSKIRPLGYVPDIDNVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGA 660
           ++ IYE ++DLL+KIRPLGYVPDI  VLHDIEDEQK+DNLSYHSEKLAVAYGLMK+PSGA
Sbjct: 628 SSQIYEMIRDLLTKIRPLGYVPDIAGVLHDIEDEQKIDNLSYHSEKLAVAYGLMKSPSGA 687

Query: 661 PITVIKNLRMCDDCHTAIKLISKVANRVIVVRDANRFHHFQNGCCSCGDYW 709
           PI VIKNLRMCDDCHTAIKLISKVANR I+VRDANRFHHFQ+G CSCGDYW
Sbjct: 688 PIRVIKNLRMCDDCHTAIKLISKVANRTIIVRDANRFHHFQDGFCSCGDYW 738

BLAST of CsaV3_5G013110 vs. NCBI nr
Match: XP_022144415.1 (pentatricopeptide repeat-containing protein At5g39680 [Momordica charantia])

HSP 1 Score: 1204.5 bits (3115), Expect = 0.0e+00
Identity = 582/709 (82.09%), Postives = 643/709 (90.69%), Query Frame = 0

Query: 1   MSILKLPITDIMPVKFTPFLSRSNFLASPHQDPIKLLKVAADAKNLKFGRTIHAHLTITN 60
           M  LKLP + +      PFL +SN+ ASP Q+PIKLLK+AADAKNLKFGR IHAHL ITN
Sbjct: 1   MPTLKLPTSGL-----PPFLFKSNYFASPTQNPIKLLKLAADAKNLKFGRIIHAHLIITN 60

Query: 61  HNYRDSKVNQLNSLINLYVKCDEVSIARKLFDSMPRRNVVSWSALMAGYMQNGNPLEVFE 120
           H   D +VNQ+NSLIN Y KCDE+ +AR++FD MP+RNVVSWSALMAGYMQNG+ LEVF 
Sbjct: 61  HTPGDCRVNQINSLINFYAKCDELLVARQMFDRMPKRNVVSWSALMAGYMQNGSSLEVFR 120

Query: 121 LFKKMVVKDNIFPNEYVIATAISS-CDSQMYVEGKQCHGYALKSGLEFHQYVKNALIQLY 180
           L KKMVV+D+I PNEYVIAT +SS C SQMYVEGKQCHGYALKSGLE HQYVKNALIQ+Y
Sbjct: 121 LLKKMVVEDDICPNEYVIATIVSSCCGSQMYVEGKQCHGYALKSGLELHQYVKNALIQMY 180

Query: 181 SKCSDVGAAIQILYTVPGNDIFCYNLVVNGLLQHTHMAEAVDVLKLIISEGIEWNNATYV 240
           SKCSDV AA+QIL TVPG DIFCYNLV+NGLL+H+H+ EA++VL L+I E IEWNNATYV
Sbjct: 181 SKCSDVRAAMQILDTVPGYDIFCYNLVLNGLLEHSHLREAIEVLNLMIGEEIEWNNATYV 240

Query: 241 TIFRLCASLKDITLGKQVHAQMLKSDIDCDVYIGSSIIDMYGKCGNVLSGRTFFDRLQSR 300
           TIFRLCASLKD+ LGKQVHAQML++DID DVYIGSSIIDMYGKCG VLSGRTFFDRLQS+
Sbjct: 241 TIFRLCASLKDLELGKQVHAQMLRNDIDYDVYIGSSIIDMYGKCGKVLSGRTFFDRLQSQ 300

Query: 301 NVVSWTSIIAAYFQNEFFEEALNLFSKMEIDCIPPNEYTMAVLFNSAAGLSALCLGDQLH 360
           NVVSWT+I+AAYFQN FFEEALNLFSKMEIDCIPPNEYT+AV  NSAAGLSAL  GDQLH
Sbjct: 301 NVVSWTTIMAAYFQNGFFEEALNLFSKMEIDCIPPNEYTLAVSLNSAAGLSALSHGDQLH 360

Query: 361 ARAEKSGLKGNVMVGNALIIMYFKSGDILAAQSVFSNMTCCNIITWNAIITGHSHHGLGK 420
           ARAEKSGLKGNV+VGNALIIMY KSGDILAAQ VFSNM CC+ ITWNAIITGHSHHGLGK
Sbjct: 361 ARAEKSGLKGNVIVGNALIIMYSKSGDILAAQHVFSNMICCDSITWNAIITGHSHHGLGK 420

Query: 421 EALSMFQDMMATGERPNYVTFIGVILACAHLKLVDEGFYYFNHLMKQFRIVPGLEHYTCI 480
           EALSMFQDM+ATGE PNYVTFIGV+ ACAHL LV EGFYYFNHLMKQF IVPGLEHYTCI
Sbjct: 421 EALSMFQDMLATGECPNYVTFIGVLSACAHLSLVHEGFYYFNHLMKQFGIVPGLEHYTCI 480

Query: 481 VGLLSRSGRLDEAENFMRSHQINWDVVSWRTLLNACYVHKHYDKGRKIAEYLLQLEPRDV 540
           +GLLSRSG+LDEAENFMRS+ INWDVV+WRTLL ACYVH++YDKG++IAEYLLQ++P DV
Sbjct: 481 IGLLSRSGQLDEAENFMRSNPINWDVVAWRTLLTACYVHRNYDKGKQIAEYLLQMDPEDV 540

Query: 541 GTYILLSNMHARVRRWDHVVEIRKLMRERNVKKEPGVSWLEIRNVAHVFTSEDIKHPEAN 600
           G+YILLSNMHARVRRWD VV+IRKLMRERNVKKEPGVSWLEIRN+AHVFTSED KHPE++
Sbjct: 541 GSYILLSNMHARVRRWDGVVKIRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHPESS 600

Query: 601 LIYENVKDLLSKIRPLGYVPDIDNVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPI 660
            IYE V+DLLS+I+PLGYVPDI  VLHDI+DEQK+DNLSYHSEKLAVAYGLMKTP GAPI
Sbjct: 601 QIYEKVRDLLSRIQPLGYVPDIAGVLHDIDDEQKLDNLSYHSEKLAVAYGLMKTPLGAPI 660

Query: 661 TVIKNLRMCDDCHTAIKLISKVANRVIVVRDANRFHHFQNGCCSCGDYW 709
            VIKNLRMCDDCHTA+KLISKVANRVI+VRDANRFHHF++GCCSCGDYW
Sbjct: 661 RVIKNLRMCDDCHTAVKLISKVANRVIIVRDANRFHHFEDGCCSCGDYW 704

BLAST of CsaV3_5G013110 vs. NCBI nr
Match: XP_022933883.1 (pentatricopeptide repeat-containing protein At5g39680 [Cucurbita moschata])

HSP 1 Score: 1203.0 bits (3111), Expect = 0.0e+00
Identity = 583/711 (82.00%), Postives = 648/711 (91.14%), Query Frame = 0

Query: 1   MSILKL--PITDIMPVKFTPFLSRSNFLASPHQDPIKLLKVAADAKNLKFGRTIHAHLTI 60
           M++LKL  PI+ + PVKFTPFLS+SN LASP  DP+KLLKVAADAKNLKFGRTIHAHL I
Sbjct: 1   MAMLKLPVPISSLAPVKFTPFLSKSNDLASPLLDPMKLLKVAADAKNLKFGRTIHAHLII 60

Query: 61  TNHNYRDSKVNQLNSLINLYVKCDEVSIARKLFDSMPRRNVVSWSALMAGYMQNGNPLEV 120
           TN    D +VNQ+NSLINLYVKCDE+ IAR++FD M +RNVVSW ALMAGYMQNG+PLEV
Sbjct: 61  TNRLPGDCRVNQINSLINLYVKCDELFIARQMFDRMSKRNVVSWCALMAGYMQNGSPLEV 120

Query: 121 FELFKKMVVKDNIFPNEYVIATAISSC-DSQMYVEGKQCHGYALKSGLEFHQYVKNALIQ 180
           FELFKKM+VKDNIFPNEYVIAT ISSC DSQMYVEG+QCHG++LKSGLE HQYVKNALIQ
Sbjct: 121 FELFKKMIVKDNIFPNEYVIATVISSCRDSQMYVEGRQCHGFSLKSGLELHQYVKNALIQ 180

Query: 181 LYSKCSDVGAAIQILYTVPGNDIFCYNLVVNGLLQHTHMAEAVDVLKLIISEGIEWNNAT 240
           +YSKCSDV AA++IL TVPG D+FCYNLV+NGLL+H+H+ EA++VLKL+I EG +WNNAT
Sbjct: 181 MYSKCSDVRAALKILDTVPGYDVFCYNLVLNGLLEHSHVREAIEVLKLMIGEGTKWNNAT 240

Query: 241 YVTIFRLCASLKDITLGKQVHAQMLKSDIDCDVYIGSSIIDMYGKCGNVLSGRTFFDRLQ 300
           +VTIFR+CASLKD+  GK VHA+MLKSDID DVYIGSSIIDMYGKCGNVLSGR FFD+LQ
Sbjct: 241 FVTIFRICASLKDLKFGKHVHARMLKSDIDYDVYIGSSIIDMYGKCGNVLSGRAFFDQLQ 300

Query: 301 SRNVVSWTSIIAAYFQNEFFEEALNLFSKMEIDCIPPNEYTMAVLFNSAAGLSALCLGDQ 360
           +RNVVSWT+I+AAYFQN FFEEALNLFSKMEID IPPNEYT+AVL NSAAGLSAL  GDQ
Sbjct: 301 NRNVVSWTAIMAAYFQNGFFEEALNLFSKMEIDHIPPNEYTLAVLLNSAAGLSALSHGDQ 360

Query: 361 LHARAEKSGLKGNVMVGNALIIMYFKSGDILAAQSVFSNMTCCNIITWNAIITGHSHHGL 420
           LHARAEKSGLKGNV+VGNALIIMY KSGDILAAQ VFSNM CC+ ITWNAIITGHSHH +
Sbjct: 361 LHARAEKSGLKGNVIVGNALIIMYSKSGDILAAQCVFSNMKCCDSITWNAIITGHSHHCI 420

Query: 421 GKEALSMFQDMMATGERPNYVTFIGVILACAHLKLVDEGFYYFNHLMKQFRIVPGLEHYT 480
           GKEAL++F DM+   E PNYVTFIGV+ ACAHL LVDEG YYFNHLMKQF IVPGLEHYT
Sbjct: 421 GKEALNIFHDMLTARECPNYVTFIGVLSACAHLSLVDEGLYYFNHLMKQFGIVPGLEHYT 480

Query: 481 CIVGLLSRSGRLDEAENFMRSHQINWDVVSWRTLLNACYVHKHYDKGRKIAEYLLQLEPR 540
           CIVGLLSRSGRLDEAENFMRS+ INWDVV+WRTLLNACYVH++YDKG++IAEYLLQ++  
Sbjct: 481 CIVGLLSRSGRLDEAENFMRSNPINWDVVAWRTLLNACYVHRNYDKGKQIAEYLLQMDHE 540

Query: 541 DVGTYILLSNMHARVRRWDHVVEIRKLMRERNVKKEPGVSWLEIRNVAHVFTSEDIKHPE 600
           DVG+YILLSNMHARVRRWD VV++RKLMRERNVKKEPGVSWLEIRN+AHVFTSED KHPE
Sbjct: 541 DVGSYILLSNMHARVRRWDGVVKVRKLMRERNVKKEPGVSWLEIRNIAHVFTSEDNKHPE 600

Query: 601 ANLIYENVKDLLSKIRPLGYVPDIDNVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGA 660
           ++ IYE V+DLL+KIRPLGYVPDI  VLHDIEDEQK+DNLSYHSEKLAVAYGLMK PSGA
Sbjct: 601 SSQIYEMVRDLLTKIRPLGYVPDIAGVLHDIEDEQKLDNLSYHSEKLAVAYGLMKLPSGA 660

Query: 661 PITVIKNLRMCDDCHTAIKLISKVANRVIVVRDANRFHHFQNGCCSCGDYW 709
           PI VIKNLRMCDDCHTAIKLISK+ANR I+VRDANRFHHFQ+G CSCGDYW
Sbjct: 661 PIRVIKNLRMCDDCHTAIKLISKLANRTIIVRDANRFHHFQDGFCSCGDYW 711

BLAST of CsaV3_5G013110 vs. TAIR10
Match: AT5G39680.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 749.2 bits (1933), Expect = 2.3e-216
Identity = 359/675 (53.19%), Postives = 477/675 (70.67%), Query Frame = 0

Query: 35  KLLKVAADAKNLKFGRTIHAHLTITNHNYRDSKVNQLNSLINLYVKCDEVSIARKLFDSM 94
           +LLKV A++  L+ G +IHAHL +TN + R     Q+NSLINLYVKC E   ARKLFD M
Sbjct: 36  ELLKVCANSSYLRIGESIHAHLIVTNQSSRAEDAYQINSLINLYVKCRETVRARKLFDLM 95

Query: 95  PRRNVVSWSALMAGYMQNGNPLEVFELFKKMVVKDNIFPNEYVIATAISSC-DSQMYVEG 154
           P RNVVSW A+M GY  +G   EV +LFK M       PNE+V      SC +S    EG
Sbjct: 96  PERNVVSWCAMMKGYQNSGFDFEVLKLFKSMFFSGESRPNEFVATVVFKSCSNSGRIEEG 155

Query: 155 KQCHGYALKSGLEFHQYVKNALIQLYSKCSDVGAAIQILYTVPGNDIFCYNLVVNGLLQH 214
           KQ HG  LK GL  H++V+N L+ +YS CS  G AI++L  +P  D+  ++  ++G L+ 
Sbjct: 156 KQFHGCFLKYGLISHEFVRNTLVYMYSLCSGNGEAIRVLDDLPYCDLSVFSSALSGYLEC 215

Query: 215 THMAEAVDVLKLIISEGIEWNNATYVTIFRLCASLKDITLGKQVHAQMLKSDIDCDVYIG 274
               E +DVL+   +E   WNN TY++  RL ++L+D+ L  QVH++M++   + +V   
Sbjct: 216 GAFKEGLDVLRKTANEDFVWNNLTYLSSLRLFSNLRDLNLALQVHSRMVRFGFNAEVEAC 275

Query: 275 SSIIDMYGKCGNVLSGRTFFDRLQSRNVVSWTSIIAAYFQNEFFEEALNLFSKMEIDCIP 334
            ++I+MYGKCG VL  +  FD   ++N+   T+I+ AYFQ++ FEEALNLFSKM+   +P
Sbjct: 276 GALINMYGKCGKVLYAQRVFDDTHAQNIFLNTTIMDAYFQDKSFEEALNLFSKMDTKEVP 335

Query: 335 PNEYTMAVLFNSAAGLSALCLGDQLHARAEKSGLKGNVMVGNALIIMYFKSGDILAAQSV 394
           PNEYT A+L NS A LS L  GD LH    KSG + +VMVGNAL+ MY KSG I  A+  
Sbjct: 336 PNEYTFAILLNSIAELSLLKQGDLLHGLVLKSGYRNHVMVGNALVNMYAKSGSIEDARKA 395

Query: 395 FSNMTCCNIITWNAIITGHSHHGLGKEALSMFQDMMATGERPNYVTFIGVILACAHLKLV 454
           FS MT  +I+TWN +I+G SHHGLG+EAL  F  M+ TGE PN +TFIGV+ AC+H+  V
Sbjct: 396 FSGMTFRDIVTWNTMISGCSHHGLGREALEAFDRMIFTGEIPNRITFIGVLQACSHIGFV 455

Query: 455 DEGFYYFNHLMKQFRIVPGLEHYTCIVGLLSRSGRLDEAENFMRSHQINWDVVSWRTLLN 514
           ++G +YFN LMK+F + P ++HYTCIVGLLS++G   +AE+FMR+  I WDVV+WRTLLN
Sbjct: 456 EQGLHYFNQLMKKFDVQPDIQHYTCIVGLLSKAGMFKDAEDFMRTAPIEWDVVAWRTLLN 515

Query: 515 ACYVHKHYDKGRKIAEYLLQLEPRDVGTYILLSNMHARVRRWDHVVEIRKLMRERNVKKE 574
           ACYV ++Y  G+K+AEY ++  P D G Y+LLSN+HA+ R W+ V ++R LM  R VKKE
Sbjct: 516 ACYVRRNYRLGKKVAEYAIEKYPNDSGVYVLLSNIHAKSREWEGVAKVRSLMNNRGVKKE 575

Query: 575 PGVSWLEIRNVAHVFTSEDIKHPEANLIYENVKDLLSKIRPLGYVPDIDNVLHDIEDEQK 634
           PGVSW+ IRN  HVF +ED +HPE  LIY  VK+++SKI+PLGY PD+    HD+++EQ+
Sbjct: 576 PGVSWIGIRNQTHVFLAEDNQHPEITLIYAKVKEVMSKIKPLGYSPDVAGAFHDVDEEQR 635

Query: 635 VDNLSYHSEKLAVAYGLMKTPSGAPITVIKNLRMCDDCHTAIKLISKVANRVIVVRDANR 694
            DNLSYHSEKLAVAYGL+KTP  +P+ V KN+R+CDDCH+AIKLISK++ R IV+RD+NR
Sbjct: 636 EDNLSYHSEKLAVAYGLIKTPEKSPLYVTKNVRICDDCHSAIKLISKISKRYIVIRDSNR 695

Query: 695 FHHFQNGCCSCGDYW 709
           FHHF +G CSC DYW
Sbjct: 696 FHHFLDGQCSCCDYW 710

BLAST of CsaV3_5G013110 vs. TAIR10
Match: AT4G33170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 499.2 bits (1284), Expect = 4.2e-141
Identity = 260/677 (38.40%), Postives = 400/677 (59.08%), Query Frame = 0

Query: 34  IKLLKVAADAKNLKFGRTIHAHLTITNHNYRDSKVNQLNSLINLYVKCDEVSIARKLFDS 93
           I +L  A    +L  G+ +H           D  +   NSLIN+Y K  +   AR +FD+
Sbjct: 319 ILMLATAVKVDSLALGQQVHCMALKLG---LDLMLTVSNSLINMYCKLRKFGFARTVFDN 378

Query: 94  MPRRNVVSWSALMAGYMQNGNPLEVFELFKKMVVKDNIFPNEYVIATAISSCDS--QMYV 153
           M  R+++SW++++AG  QNG  +E   LF ++ ++  + P++Y + + + +  S  +   
Sbjct: 379 MSERDLISWNSVIAGIAQNGLEVEAVCLFMQL-LRCGLKPDQYTMTSVLKAASSLPEGLS 438

Query: 154 EGKQCHGYALKSGLEFHQYVKNALIQLYSKCSDVGAAIQILYTVPGNDIFCYNLVVNGLL 213
             KQ H +A+K       +V  ALI  YS+   +  A +IL+     D+  +N ++ G  
Sbjct: 439 LSKQVHVHAIKINNVSDSFVSTALIDAYSRNRCMKEA-EILFERHNFDLVAWNAMMAGYT 498

Query: 214 QHTHMAEAVDVLKLIISEGIEWNNATYVTIFRLCASLKDITLGKQVHAQMLKSDIDCDVY 273
           Q     + + +  L+  +G   ++ T  T+F+ C  L  I  GKQVHA  +KS  D D++
Sbjct: 499 QSHDGHKTLKLFALMHKQGERSDDFTLATVFKTCGFLFAINQGKQVHAYAIKSGYDLDLW 558

Query: 274 IGSSIIDMYGKCGNVLSGRTFFDRLQSRNVVSWTSIIAAYFQNEFFEEALNLFSKMEIDC 333
           + S I+DMY KCG++ + +  FD +   + V+WT++I+   +N   E A ++FS+M +  
Sbjct: 559 VSSGILDMYVKCGDMSAAQFAFDSIPVPDDVAWTTMISGCIENGEEERAFHVFSQMRLMG 618

Query: 334 IPPNEYTMAVLFNSAAGLSALCLGDQLHARAEKSGLKGNVMVGNALIIMYFKSGDILAAQ 393
           + P+E+T+A L  +++ L+AL  G Q+HA A K     +  VG +L+ MY K G I  A 
Sbjct: 619 VLPDEFTIATLAKASSCLTALEQGRQIHANALKLNCTNDPFVGTSLVDMYAKCGSIDDAY 678

Query: 394 SVFSNMTCCNIITWNAIITGHSHHGLGKEALSMFQDMMATGERPNYVTFIGVILACAHLK 453
            +F  +   NI  WNA++ G + HG GKE L +F+ M + G +P+ VTFIGV+ AC+H  
Sbjct: 679 CLFKRIEMMNITAWNAMLVGLAQHGEGKETLQLFKQMKSLGIKPDKVTFIGVLSACSHSG 738

Query: 454 LVDEGFYYFNHLMKQFRIVPGLEHYTCIVGLLSRSGRLDEAENFMRSHQINWDVVSWRTL 513
           LV E + +   +   + I P +EHY+C+   L R+G + +AEN + S  +      +RTL
Sbjct: 739 LVSEAYKHMRSMHGDYGIKPEIEHYSCLADALGRAGLVKQAENLIESMSMEASASMYRTL 798

Query: 514 LNACYVHKHYDKGRKIAEYLLQLEPRDVGTYILLSNMHARVRRWDHVVEIRKLMRERNVK 573
           L AC V    + G+++A  LL+LEP D   Y+LLSNM+A   +WD +   R +M+   VK
Sbjct: 799 LAACRVQGDTETGKRVATKLLELEPLDSSAYVLLSNMYAAASKWDEMKLARTMMKGHKVK 858

Query: 574 KEPGVSWLEIRNVAHVFTSEDIKHPEANLIYENVKDLLSKIRPLGYVPDIDNVLHDIEDE 633
           K+PG SW+E++N  H+F  +D  + +  LIY  VKD++  I+  GYVP+ D  L D+E+E
Sbjct: 859 KDPGFSWIEVKNKIHIFVVDDRSNRQTELIYRKVKDMIRDIKQEGYVPETDFTLVDVEEE 918

Query: 634 QKVDNLSYHSEKLAVAYGLMKTPSGAPITVIKNLRMCDDCHTAIKLISKVANRVIVVRDA 693
           +K   L YHSEKLAVA+GL+ TP   PI VIKNLR+C DCH A+K I+KV NR IV+RDA
Sbjct: 919 EKERALYYHSEKLAVAFGLLSTPPSTPIRVIKNLRVCGDCHNAMKYIAKVYNREIVLRDA 978

Query: 694 NRFHHFQNGCCSCGDYW 709
           NRFH F++G CSCGDYW
Sbjct: 979 NRFHRFKDGICSCGDYW 990

BLAST of CsaV3_5G013110 vs. TAIR10
Match: AT4G13650.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 480.7 bits (1236), Expect = 1.5e-135
Identity = 244/682 (35.78%), Postives = 392/682 (57.48%), Query Frame = 0

Query: 29   PHQDPIKLLKVAADAKNLKF-GRTIHAHLTITNHNYRDSKVNQLNSLINLYVKCDEVSIA 88
            P  + +  L VA  A    F G+ +HA+ T       ++K+    +L+NLY KC ++  A
Sbjct: 387  PDSNTLASLVVACSADGTLFRGQQLHAYTTKLGF-ASNNKIE--GALLNLYAKCADIETA 446

Query: 89   RKLFDSMPRRNVVSWSALMAGYMQNGNPLEVFELFKKMVVKDNIFPNEYVIATAISSCDS 148
               F      NVV W+ ++  Y    +    F +F++M +++ I PN+Y   + + +C  
Sbjct: 447  LDYFLETEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEE-IVPNQYTYPSILKTCIR 506

Query: 149  QMYVE-GKQCHGYALKSGLEFHQYVKNALIQLYSKCSDVGAAIQILYTVPGNDIFCYNLV 208
               +E G+Q H   +K+  + + YV + LI +Y+K   +  A  IL    G D+  +  +
Sbjct: 507  LGDLELGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSWTTM 566

Query: 209  VNGLLQHTHMAEAVDVLKLIISEGIEWNNATYVTIFRLCASLKDITLGKQVHAQMLKSDI 268
            + G  Q+    +A+   + ++  GI  +          CA L+ +  G+Q+HAQ   S  
Sbjct: 567  IAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQIHAQACVSGF 626

Query: 269  DCDVYIGSSIIDMYGKCGNVLSGRTFFDRLQSRNVVSWTSIIAAYFQNEFFEEALNLFSK 328
              D+   ++++ +Y +CG +      F++ ++ + ++W ++++ + Q+   EEAL +F +
Sbjct: 627  SSDLPFQNALVTLYSRCGKIEESYLAFEQTEAGDNIAWNALVSGFQQSGNNEEALRVFVR 686

Query: 329  MEIDCIPPNEYTMAVLFNSAAGLSALCLGDQLHARAEKSGLKGNVMVGNALIIMYFKSGD 388
            M  + I  N +T      +A+  + +  G Q+HA   K+G      V NALI MY K G 
Sbjct: 687  MNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYDSETEVCNALISMYAKCGS 746

Query: 389  ILAAQSVFSNMTCCNIITWNAIITGHSHHGLGKEALSMFQDMMATGERPNYVTFIGVILA 448
            I  A+  F  ++  N ++WNAII  +S HG G EAL  F  M+ +  RPN+VT +GV+ A
Sbjct: 747  ISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVGVLSA 806

Query: 449  CAHLKLVDEGFYYFNHLMKQFRIVPGLEHYTCIVGLLSRSGRLDEAENFMRSHQINWDVV 508
            C+H+ LVD+G  YF  +  ++ + P  EHY C+V +L+R+G L  A+ F++   I  D +
Sbjct: 807  CSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSRAKEFIQEMPIKPDAL 866

Query: 509  SWRTLLNACYVHKHYDKGRKIAEYLLQLEPRDVGTYILLSNMHARVRRWDHVVEIRKLMR 568
             WRTLL+AC VHK+ + G   A +LL+LEP D  TY+LLSN++A  ++WD     R+ M+
Sbjct: 867  VWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAVSKKWDARDLTRQKMK 926

Query: 569  ERNVKKEPGVSWLEIRNVAHVFTSEDIKHPEANLIYENVKDLLSKIRPLGYVPDIDNVLH 628
            E+ VKKEPG SW+E++N  H F   D  HP A+ I+E  +DL  +   +GYV D  ++L+
Sbjct: 927  EKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDLTKRASEIGYVQDCFSLLN 986

Query: 629  DIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPITVIKNLRMCDDCHTAIKLISKVANRVI 688
            +++ EQK   +  HSEKLA+++GL+  P+  PI V+KNLR+C+DCH  IK +SKV+NR I
Sbjct: 987  ELQHEQKDPIIFIHSEKLAISFGLLSLPATVPINVMKNLRVCNDCHAWIKFVSKVSNREI 1046

Query: 689  VVRDANRFHHFQNGCCSCGDYW 709
            +VRDA RFHHF+ G CSC DYW
Sbjct: 1047 IVRDAYRFHHFEGGACSCKDYW 1064

BLAST of CsaV3_5G013110 vs. TAIR10
Match: AT5G09950.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 473.8 bits (1218), Expect = 1.9e-133
Identity = 250/675 (37.04%), Postives = 408/675 (60.44%), Query Frame = 0

Query: 41  ADAKNLKFGRTIHAHLTITNHNYRDSKVNQLNSLINLYVKCDEVSIARKLFDSMPRRNVV 100
           A+   LK GR +H H+  T     D  V   N L+N+Y KC  ++ AR++F  M  ++ V
Sbjct: 324 AEEVGLKKGREVHGHVITT--GLVDFMVGIGNGLVNMYAKCGSIADARRVFYFMTDKDSV 383

Query: 101 SWSALMAGYMQNGNPLEVFELFKKMVVKDNIFPNEYVIATAISSCDSQMYVE-GKQCHGY 160
           SW++++ G  QNG  +E  E +K M  + +I P  + + +++SSC S  + + G+Q HG 
Sbjct: 384 SWNSMITGLDQNGCFIEAVERYKSM-RRHDILPGSFTLISSLSSCASLKWAKLGQQIHGE 443

Query: 161 ALKSGLEFHQYVKNALIQLYSKCSDVGAAIQILYTVPGNDIFCYNLVVNGLLQ-HTHMAE 220
           +LK G++ +  V NAL+ LY++   +    +I  ++P +D   +N ++  L +    + E
Sbjct: 444 SLKLGIDLNVSVSNALMTLYAETGYLNECRKIFSSMPEHDQVSWNSIIGALARSERSLPE 503

Query: 221 AVDVLKLIISEGIEWNNATYVTIFRLCASLKDITLGKQVHAQMLKSDIDCDVYIGSSIID 280
           AV         G + N  T+ ++    +SL    LGKQ+H   LK++I  +    +++I 
Sbjct: 504 AVVCFLNAQRAGQKLNRITFSSVLSAVSSLSFGELGKQIHGLALKNNIADEATTENALIA 563

Query: 281 MYGKCGNVLSGRTFFDRL-QSRNVVSWTSIIAAYFQNEFFEEALNLFSKMEIDCIPPNEY 340
            YGKCG +      F R+ + R+ V+W S+I+ Y  NE   +AL+L   M       + +
Sbjct: 564 CYGKCGEMDGCEKIFSRMAERRDNVTWNSMISGYIHNELLAKALDLVWFMLQTGQRLDSF 623

Query: 341 TMAVLFNSAAGLSALCLGDQLHARAEKSGLKGNVMVGNALIIMYFKSGDILAAQSVFSNM 400
             A + ++ A ++ L  G ++HA + ++ L+ +V+VG+AL+ MY K G +  A   F+ M
Sbjct: 624 MYATVLSAFASVATLERGMEVHACSVRACLESDVVVGSALVDMYSKCGRLDYALRFFNTM 683

Query: 401 TCCNIITWNAIITGHSHHGLGKEALSMFQDMMATGER-PNYVTFIGVILACAHLKLVDEG 460
              N  +WN++I+G++ HG G+EAL +F+ M   G+  P++VTF+GV+ AC+H  L++EG
Sbjct: 684 PVRNSYSWNSMISGYARHGQGEEALKLFETMKLDGQTPPDHVTFVGVLSACSHAGLLEEG 743

Query: 461 FYYFNHLMKQFRIVPGLEHYTCIVGLLSRSGRLDEAENFMRSHQINWDVVSWRTLLNAC- 520
           F +F  +   + + P +EH++C+  +L R+G LD+ E+F+    +  +V+ WRT+L AC 
Sbjct: 744 FKHFESMSDSYGLAPRIEHFSCMADVLGRAGELDKLEDFIEKMPMKPNVLIWRTVLGACC 803

Query: 521 -YVHKHYDKGRKIAEYLLQLEPRDVGTYILLSNMHARVRRWDHVVEIRKLMRERNVKKEP 580
               +  + G+K AE L QLEP +   Y+LL NM+A   RW+ +V+ RK M++ +VKKE 
Sbjct: 804 RANGRKAELGKKAAEMLFQLEPENAVNYVLLGNMYAAGGRWEDLVKARKKMKDADVKKEA 863

Query: 581 GVSWLEIRNVAHVFTSEDIKHPEANLIYENVKDLLSKIRPLGYVPDIDNVLHDIEDEQKV 640
           G SW+ +++  H+F + D  HP+A++IY+ +K+L  K+R  GYVP     L+D+E E K 
Sbjct: 864 GYSWVTMKDGVHMFVAGDKSHPDADVIYKKLKELNRKMRDAGYVPQTGFALYDLEQENKE 923

Query: 641 DNLSYHSEKLAVAYGL-MKTPSGAPITVIKNLRMCDDCHTAIKLISKVANRVIVVRDANR 700
           + LSYHSEKLAVA+ L  +  S  PI ++KNLR+C DCH+A K ISK+  R I++RD+NR
Sbjct: 924 EILSYHSEKLAVAFVLAAQRSSTLPIRIMKNLRVCGDCHSAFKYISKIEGRQIILRDSNR 983

Query: 701 FHHFQNGCCSCGDYW 709
           FHHFQ+G CSC D+W
Sbjct: 984 FHHFQDGACSCSDFW 995

BLAST of CsaV3_5G013110 vs. TAIR10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 468.8 bits (1205), Expect = 6.0e-132
Identity = 247/674 (36.65%), Postives = 391/674 (58.01%), Query Frame = 0

Query: 36  LLKVAADAKNLKFGRTIHAHLTITNHNYRDSKVNQLNSLINLYVKCDEVSIARKLFDSMP 95
           LLKV  D   L+ G+ IH  L  +  +     +  +  L N+Y KC +V+ ARK+FD MP
Sbjct: 141 LLKVCGDEAELRVGKEIHGLLVKSGFSL---DLFAMTGLENMYAKCRQVNEARKVFDRMP 200

Query: 96  RRNVVSWSALMAGYMQNGNPLEVFELFKKMVVKDNIFPNEYVIATAISSCDSQMYVE-GK 155
            R++VSW+ ++AGY QNG      E+ K M  ++N+ P+   I + + +  +   +  GK
Sbjct: 201 ERDLVSWNTIVAGYSQNGMARMALEMVKSM-CEENLKPSFITIVSVLPAVSALRLISVGK 260

Query: 156 QCHGYALKSGLEFHQYVKNALIQLYSKCSDVGAAIQILYTVPGNDIFCYNLVVNGLLQHT 215
           + HGYA++SG +    +  AL+ +Y+KC  +  A Q+   +   ++  +N +++  +Q+ 
Sbjct: 261 EIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNE 320

Query: 216 HMAEAVDVLKLIISEGIEWNNATYVTIFRLCASLKDITLGKQVHAQMLKSDIDCDVYIGS 275
           +  EA+ + + ++ EG++  + + +     CA L D+  G+ +H   ++  +D +V + +
Sbjct: 321 NPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVN 380

Query: 276 SIIDMYGKCGNVLSGRTFFDRLQSRNVVSWTSIIAAYFQNEFFEEALNLFSKMEIDCIPP 335
           S+I MY KC  V +  + F +LQSR +VSW ++I  + QN    +ALN FS+M    + P
Sbjct: 381 SLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKP 440

Query: 336 NEYTMAVLFNSAAGLSALCLGDQLHARAEKSGLKGNVMVGNALIIMYFKSGDILAAQSVF 395
           + +T   +  + A LS       +H    +S L  NV V  AL+ MY K G I+ A+ +F
Sbjct: 441 DTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIF 500

Query: 396 SNMTCCNIITWNAIITGHSHHGLGKEALSMFQDMMATGERPNYVTFIGVILACAHLKLVD 455
             M+  ++ TWNA+I G+  HG GK AL +F++M     +PN VTF+ VI AC+H  LV+
Sbjct: 501 DMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVE 560

Query: 456 EGFYYFNHLMKQFRIVPGLEHYTCIVGLLSRSGRLDEAENFMRSHQINWDVVSWRTLLNA 515
            G   F  + + + I   ++HY  +V LL R+GRL+EA +F+    +   V  +  +L A
Sbjct: 561 AGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGA 620

Query: 516 CYVHKHYDKGRKIAEYLLQLEPRDVGTYILLSNMHARVRRWDHVVEIRKLMRERNVKKEP 575
           C +HK+ +   K AE L +L P D G ++LL+N++     W+ V ++R  M  + ++K P
Sbjct: 621 CQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTP 680

Query: 576 GVSWLEIRNVAHVFTSEDIKHPEANLIYENVKDLLSKIRPLGYVPDIDNVLHDIEDEQKV 635
           G S +EI+N  H F S    HP++  IY  ++ L+  I+  GYVPD + VL  +E++ K 
Sbjct: 681 GCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKEAGYVPDTNLVL-GVENDVKE 740

Query: 636 DNLSYHSEKLAVAYGLMKTPSGAPITVIKNLRMCDDCHTAIKLISKVANRVIVVRDANRF 695
             LS HSEKLA+++GL+ T +G  I V KNLR+C DCH A K IS V  R IVVRD  RF
Sbjct: 741 QLLSTHSEKLAISFGLLNTTAGTTIHVRKNLRVCADCHNATKYISLVTGREIVVRDMQRF 800

Query: 696 HHFQNGCCSCGDYW 709
           HHF+NG CSCGDYW
Sbjct: 801 HHFKNGACSCGDYW 809

BLAST of CsaV3_5G013110 vs. Swiss-Prot
Match: sp|Q9FK93|PP406_ARATH (Pentatricopeptide repeat-containing protein At5g39680 OS=Arabidopsis thaliana OX=3702 GN=EMB2744 PE=1 SV=1)

HSP 1 Score: 749.2 bits (1933), Expect = 4.2e-215
Identity = 359/675 (53.19%), Postives = 477/675 (70.67%), Query Frame = 0

Query: 35  KLLKVAADAKNLKFGRTIHAHLTITNHNYRDSKVNQLNSLINLYVKCDEVSIARKLFDSM 94
           +LLKV A++  L+ G +IHAHL +TN + R     Q+NSLINLYVKC E   ARKLFD M
Sbjct: 36  ELLKVCANSSYLRIGESIHAHLIVTNQSSRAEDAYQINSLINLYVKCRETVRARKLFDLM 95

Query: 95  PRRNVVSWSALMAGYMQNGNPLEVFELFKKMVVKDNIFPNEYVIATAISSC-DSQMYVEG 154
           P RNVVSW A+M GY  +G   EV +LFK M       PNE+V      SC +S    EG
Sbjct: 96  PERNVVSWCAMMKGYQNSGFDFEVLKLFKSMFFSGESRPNEFVATVVFKSCSNSGRIEEG 155

Query: 155 KQCHGYALKSGLEFHQYVKNALIQLYSKCSDVGAAIQILYTVPGNDIFCYNLVVNGLLQH 214
           KQ HG  LK GL  H++V+N L+ +YS CS  G AI++L  +P  D+  ++  ++G L+ 
Sbjct: 156 KQFHGCFLKYGLISHEFVRNTLVYMYSLCSGNGEAIRVLDDLPYCDLSVFSSALSGYLEC 215

Query: 215 THMAEAVDVLKLIISEGIEWNNATYVTIFRLCASLKDITLGKQVHAQMLKSDIDCDVYIG 274
               E +DVL+   +E   WNN TY++  RL ++L+D+ L  QVH++M++   + +V   
Sbjct: 216 GAFKEGLDVLRKTANEDFVWNNLTYLSSLRLFSNLRDLNLALQVHSRMVRFGFNAEVEAC 275

Query: 275 SSIIDMYGKCGNVLSGRTFFDRLQSRNVVSWTSIIAAYFQNEFFEEALNLFSKMEIDCIP 334
            ++I+MYGKCG VL  +  FD   ++N+   T+I+ AYFQ++ FEEALNLFSKM+   +P
Sbjct: 276 GALINMYGKCGKVLYAQRVFDDTHAQNIFLNTTIMDAYFQDKSFEEALNLFSKMDTKEVP 335

Query: 335 PNEYTMAVLFNSAAGLSALCLGDQLHARAEKSGLKGNVMVGNALIIMYFKSGDILAAQSV 394
           PNEYT A+L NS A LS L  GD LH    KSG + +VMVGNAL+ MY KSG I  A+  
Sbjct: 336 PNEYTFAILLNSIAELSLLKQGDLLHGLVLKSGYRNHVMVGNALVNMYAKSGSIEDARKA 395

Query: 395 FSNMTCCNIITWNAIITGHSHHGLGKEALSMFQDMMATGERPNYVTFIGVILACAHLKLV 454
           FS MT  +I+TWN +I+G SHHGLG+EAL  F  M+ TGE PN +TFIGV+ AC+H+  V
Sbjct: 396 FSGMTFRDIVTWNTMISGCSHHGLGREALEAFDRMIFTGEIPNRITFIGVLQACSHIGFV 455

Query: 455 DEGFYYFNHLMKQFRIVPGLEHYTCIVGLLSRSGRLDEAENFMRSHQINWDVVSWRTLLN 514
           ++G +YFN LMK+F + P ++HYTCIVGLLS++G   +AE+FMR+  I WDVV+WRTLLN
Sbjct: 456 EQGLHYFNQLMKKFDVQPDIQHYTCIVGLLSKAGMFKDAEDFMRTAPIEWDVVAWRTLLN 515

Query: 515 ACYVHKHYDKGRKIAEYLLQLEPRDVGTYILLSNMHARVRRWDHVVEIRKLMRERNVKKE 574
           ACYV ++Y  G+K+AEY ++  P D G Y+LLSN+HA+ R W+ V ++R LM  R VKKE
Sbjct: 516 ACYVRRNYRLGKKVAEYAIEKYPNDSGVYVLLSNIHAKSREWEGVAKVRSLMNNRGVKKE 575

Query: 575 PGVSWLEIRNVAHVFTSEDIKHPEANLIYENVKDLLSKIRPLGYVPDIDNVLHDIEDEQK 634
           PGVSW+ IRN  HVF +ED +HPE  LIY  VK+++SKI+PLGY PD+    HD+++EQ+
Sbjct: 576 PGVSWIGIRNQTHVFLAEDNQHPEITLIYAKVKEVMSKIKPLGYSPDVAGAFHDVDEEQR 635

Query: 635 VDNLSYHSEKLAVAYGLMKTPSGAPITVIKNLRMCDDCHTAIKLISKVANRVIVVRDANR 694
            DNLSYHSEKLAVAYGL+KTP  +P+ V KN+R+CDDCH+AIKLISK++ R IV+RD+NR
Sbjct: 636 EDNLSYHSEKLAVAYGLIKTPEKSPLYVTKNVRICDDCHSAIKLISKISKRYIVIRDSNR 695

Query: 695 FHHFQNGCCSCGDYW 709
           FHHF +G CSC DYW
Sbjct: 696 FHHFLDGQCSCCDYW 710

BLAST of CsaV3_5G013110 vs. Swiss-Prot
Match: sp|Q9SMZ2|PP347_ARATH (Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H53 PE=3 SV=1)

HSP 1 Score: 499.2 bits (1284), Expect = 7.5e-140
Identity = 260/677 (38.40%), Postives = 400/677 (59.08%), Query Frame = 0

Query: 34  IKLLKVAADAKNLKFGRTIHAHLTITNHNYRDSKVNQLNSLINLYVKCDEVSIARKLFDS 93
           I +L  A    +L  G+ +H           D  +   NSLIN+Y K  +   AR +FD+
Sbjct: 319 ILMLATAVKVDSLALGQQVHCMALKLG---LDLMLTVSNSLINMYCKLRKFGFARTVFDN 378

Query: 94  MPRRNVVSWSALMAGYMQNGNPLEVFELFKKMVVKDNIFPNEYVIATAISSCDS--QMYV 153
           M  R+++SW++++AG  QNG  +E   LF ++ ++  + P++Y + + + +  S  +   
Sbjct: 379 MSERDLISWNSVIAGIAQNGLEVEAVCLFMQL-LRCGLKPDQYTMTSVLKAASSLPEGLS 438

Query: 154 EGKQCHGYALKSGLEFHQYVKNALIQLYSKCSDVGAAIQILYTVPGNDIFCYNLVVNGLL 213
             KQ H +A+K       +V  ALI  YS+   +  A +IL+     D+  +N ++ G  
Sbjct: 439 LSKQVHVHAIKINNVSDSFVSTALIDAYSRNRCMKEA-EILFERHNFDLVAWNAMMAGYT 498

Query: 214 QHTHMAEAVDVLKLIISEGIEWNNATYVTIFRLCASLKDITLGKQVHAQMLKSDIDCDVY 273
           Q     + + +  L+  +G   ++ T  T+F+ C  L  I  GKQVHA  +KS  D D++
Sbjct: 499 QSHDGHKTLKLFALMHKQGERSDDFTLATVFKTCGFLFAINQGKQVHAYAIKSGYDLDLW 558

Query: 274 IGSSIIDMYGKCGNVLSGRTFFDRLQSRNVVSWTSIIAAYFQNEFFEEALNLFSKMEIDC 333
           + S I+DMY KCG++ + +  FD +   + V+WT++I+   +N   E A ++FS+M +  
Sbjct: 559 VSSGILDMYVKCGDMSAAQFAFDSIPVPDDVAWTTMISGCIENGEEERAFHVFSQMRLMG 618

Query: 334 IPPNEYTMAVLFNSAAGLSALCLGDQLHARAEKSGLKGNVMVGNALIIMYFKSGDILAAQ 393
           + P+E+T+A L  +++ L+AL  G Q+HA A K     +  VG +L+ MY K G I  A 
Sbjct: 619 VLPDEFTIATLAKASSCLTALEQGRQIHANALKLNCTNDPFVGTSLVDMYAKCGSIDDAY 678

Query: 394 SVFSNMTCCNIITWNAIITGHSHHGLGKEALSMFQDMMATGERPNYVTFIGVILACAHLK 453
            +F  +   NI  WNA++ G + HG GKE L +F+ M + G +P+ VTFIGV+ AC+H  
Sbjct: 679 CLFKRIEMMNITAWNAMLVGLAQHGEGKETLQLFKQMKSLGIKPDKVTFIGVLSACSHSG 738

Query: 454 LVDEGFYYFNHLMKQFRIVPGLEHYTCIVGLLSRSGRLDEAENFMRSHQINWDVVSWRTL 513
           LV E + +   +   + I P +EHY+C+   L R+G + +AEN + S  +      +RTL
Sbjct: 739 LVSEAYKHMRSMHGDYGIKPEIEHYSCLADALGRAGLVKQAENLIESMSMEASASMYRTL 798

Query: 514 LNACYVHKHYDKGRKIAEYLLQLEPRDVGTYILLSNMHARVRRWDHVVEIRKLMRERNVK 573
           L AC V    + G+++A  LL+LEP D   Y+LLSNM+A   +WD +   R +M+   VK
Sbjct: 799 LAACRVQGDTETGKRVATKLLELEPLDSSAYVLLSNMYAAASKWDEMKLARTMMKGHKVK 858

Query: 574 KEPGVSWLEIRNVAHVFTSEDIKHPEANLIYENVKDLLSKIRPLGYVPDIDNVLHDIEDE 633
           K+PG SW+E++N  H+F  +D  + +  LIY  VKD++  I+  GYVP+ D  L D+E+E
Sbjct: 859 KDPGFSWIEVKNKIHIFVVDDRSNRQTELIYRKVKDMIRDIKQEGYVPETDFTLVDVEEE 918

Query: 634 QKVDNLSYHSEKLAVAYGLMKTPSGAPITVIKNLRMCDDCHTAIKLISKVANRVIVVRDA 693
           +K   L YHSEKLAVA+GL+ TP   PI VIKNLR+C DCH A+K I+KV NR IV+RDA
Sbjct: 919 EKERALYYHSEKLAVAFGLLSTPPSTPIRVIKNLRVCGDCHNAMKYIAKVYNREIVLRDA 978

Query: 694 NRFHHFQNGCCSCGDYW 709
           NRFH F++G CSCGDYW
Sbjct: 979 NRFHRFKDGICSCGDYW 990

BLAST of CsaV3_5G013110 vs. Swiss-Prot
Match: sp|Q9SVP7|PP307_ARATH (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 480.7 bits (1236), Expect = 2.8e-134
Identity = 244/682 (35.78%), Postives = 392/682 (57.48%), Query Frame = 0

Query: 29   PHQDPIKLLKVAADAKNLKF-GRTIHAHLTITNHNYRDSKVNQLNSLINLYVKCDEVSIA 88
            P  + +  L VA  A    F G+ +HA+ T       ++K+    +L+NLY KC ++  A
Sbjct: 387  PDSNTLASLVVACSADGTLFRGQQLHAYTTKLGF-ASNNKIE--GALLNLYAKCADIETA 446

Query: 89   RKLFDSMPRRNVVSWSALMAGYMQNGNPLEVFELFKKMVVKDNIFPNEYVIATAISSCDS 148
               F      NVV W+ ++  Y    +    F +F++M +++ I PN+Y   + + +C  
Sbjct: 447  LDYFLETEVENVVLWNVMLVAYGLLDDLRNSFRIFRQMQIEE-IVPNQYTYPSILKTCIR 506

Query: 149  QMYVE-GKQCHGYALKSGLEFHQYVKNALIQLYSKCSDVGAAIQILYTVPGNDIFCYNLV 208
               +E G+Q H   +K+  + + YV + LI +Y+K   +  A  IL    G D+  +  +
Sbjct: 507  LGDLELGEQIHSQIIKTNFQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSWTTM 566

Query: 209  VNGLLQHTHMAEAVDVLKLIISEGIEWNNATYVTIFRLCASLKDITLGKQVHAQMLKSDI 268
            + G  Q+    +A+   + ++  GI  +          CA L+ +  G+Q+HAQ   S  
Sbjct: 567  IAGYTQYNFDDKALTTFRQMLDRGIRSDEVGLTNAVSACAGLQALKEGQQIHAQACVSGF 626

Query: 269  DCDVYIGSSIIDMYGKCGNVLSGRTFFDRLQSRNVVSWTSIIAAYFQNEFFEEALNLFSK 328
              D+   ++++ +Y +CG +      F++ ++ + ++W ++++ + Q+   EEAL +F +
Sbjct: 627  SSDLPFQNALVTLYSRCGKIEESYLAFEQTEAGDNIAWNALVSGFQQSGNNEEALRVFVR 686

Query: 329  MEIDCIPPNEYTMAVLFNSAAGLSALCLGDQLHARAEKSGLKGNVMVGNALIIMYFKSGD 388
            M  + I  N +T      +A+  + +  G Q+HA   K+G      V NALI MY K G 
Sbjct: 687  MNREGIDNNNFTFGSAVKAASETANMKQGKQVHAVITKTGYDSETEVCNALISMYAKCGS 746

Query: 389  ILAAQSVFSNMTCCNIITWNAIITGHSHHGLGKEALSMFQDMMATGERPNYVTFIGVILA 448
            I  A+  F  ++  N ++WNAII  +S HG G EAL  F  M+ +  RPN+VT +GV+ A
Sbjct: 747  ISDAEKQFLEVSTKNEVSWNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVGVLSA 806

Query: 449  CAHLKLVDEGFYYFNHLMKQFRIVPGLEHYTCIVGLLSRSGRLDEAENFMRSHQINWDVV 508
            C+H+ LVD+G  YF  +  ++ + P  EHY C+V +L+R+G L  A+ F++   I  D +
Sbjct: 807  CSHIGLVDKGIAYFESMNSEYGLSPKPEHYVCVVDMLTRAGLLSRAKEFIQEMPIKPDAL 866

Query: 509  SWRTLLNACYVHKHYDKGRKIAEYLLQLEPRDVGTYILLSNMHARVRRWDHVVEIRKLMR 568
             WRTLL+AC VHK+ + G   A +LL+LEP D  TY+LLSN++A  ++WD     R+ M+
Sbjct: 867  VWRTLLSACVVHKNMEIGEFAAHHLLELEPEDSATYVLLSNLYAVSKKWDARDLTRQKMK 926

Query: 569  ERNVKKEPGVSWLEIRNVAHVFTSEDIKHPEANLIYENVKDLLSKIRPLGYVPDIDNVLH 628
            E+ VKKEPG SW+E++N  H F   D  HP A+ I+E  +DL  +   +GYV D  ++L+
Sbjct: 927  EKGVKKEPGQSWIEVKNSIHSFYVGDQNHPLADEIHEYFQDLTKRASEIGYVQDCFSLLN 986

Query: 629  DIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPITVIKNLRMCDDCHTAIKLISKVANRVI 688
            +++ EQK   +  HSEKLA+++GL+  P+  PI V+KNLR+C+DCH  IK +SKV+NR I
Sbjct: 987  ELQHEQKDPIIFIHSEKLAISFGLLSLPATVPINVMKNLRVCNDCHAWIKFVSKVSNREI 1046

Query: 689  VVRDANRFHHFQNGCCSCGDYW 709
            +VRDA RFHHF+ G CSC DYW
Sbjct: 1047 IVRDAYRFHHFEGGACSCKDYW 1064

BLAST of CsaV3_5G013110 vs. Swiss-Prot
Match: sp|Q9FIB2|PP373_ARATH (Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H35 PE=3 SV=1)

HSP 1 Score: 473.8 bits (1218), Expect = 3.4e-132
Identity = 250/675 (37.04%), Postives = 408/675 (60.44%), Query Frame = 0

Query: 41  ADAKNLKFGRTIHAHLTITNHNYRDSKVNQLNSLINLYVKCDEVSIARKLFDSMPRRNVV 100
           A+   LK GR +H H+  T     D  V   N L+N+Y KC  ++ AR++F  M  ++ V
Sbjct: 324 AEEVGLKKGREVHGHVITT--GLVDFMVGIGNGLVNMYAKCGSIADARRVFYFMTDKDSV 383

Query: 101 SWSALMAGYMQNGNPLEVFELFKKMVVKDNIFPNEYVIATAISSCDSQMYVE-GKQCHGY 160
           SW++++ G  QNG  +E  E +K M  + +I P  + + +++SSC S  + + G+Q HG 
Sbjct: 384 SWNSMITGLDQNGCFIEAVERYKSM-RRHDILPGSFTLISSLSSCASLKWAKLGQQIHGE 443

Query: 161 ALKSGLEFHQYVKNALIQLYSKCSDVGAAIQILYTVPGNDIFCYNLVVNGLLQ-HTHMAE 220
           +LK G++ +  V NAL+ LY++   +    +I  ++P +D   +N ++  L +    + E
Sbjct: 444 SLKLGIDLNVSVSNALMTLYAETGYLNECRKIFSSMPEHDQVSWNSIIGALARSERSLPE 503

Query: 221 AVDVLKLIISEGIEWNNATYVTIFRLCASLKDITLGKQVHAQMLKSDIDCDVYIGSSIID 280
           AV         G + N  T+ ++    +SL    LGKQ+H   LK++I  +    +++I 
Sbjct: 504 AVVCFLNAQRAGQKLNRITFSSVLSAVSSLSFGELGKQIHGLALKNNIADEATTENALIA 563

Query: 281 MYGKCGNVLSGRTFFDRL-QSRNVVSWTSIIAAYFQNEFFEEALNLFSKMEIDCIPPNEY 340
            YGKCG +      F R+ + R+ V+W S+I+ Y  NE   +AL+L   M       + +
Sbjct: 564 CYGKCGEMDGCEKIFSRMAERRDNVTWNSMISGYIHNELLAKALDLVWFMLQTGQRLDSF 623

Query: 341 TMAVLFNSAAGLSALCLGDQLHARAEKSGLKGNVMVGNALIIMYFKSGDILAAQSVFSNM 400
             A + ++ A ++ L  G ++HA + ++ L+ +V+VG+AL+ MY K G +  A   F+ M
Sbjct: 624 MYATVLSAFASVATLERGMEVHACSVRACLESDVVVGSALVDMYSKCGRLDYALRFFNTM 683

Query: 401 TCCNIITWNAIITGHSHHGLGKEALSMFQDMMATGER-PNYVTFIGVILACAHLKLVDEG 460
              N  +WN++I+G++ HG G+EAL +F+ M   G+  P++VTF+GV+ AC+H  L++EG
Sbjct: 684 PVRNSYSWNSMISGYARHGQGEEALKLFETMKLDGQTPPDHVTFVGVLSACSHAGLLEEG 743

Query: 461 FYYFNHLMKQFRIVPGLEHYTCIVGLLSRSGRLDEAENFMRSHQINWDVVSWRTLLNAC- 520
           F +F  +   + + P +EH++C+  +L R+G LD+ E+F+    +  +V+ WRT+L AC 
Sbjct: 744 FKHFESMSDSYGLAPRIEHFSCMADVLGRAGELDKLEDFIEKMPMKPNVLIWRTVLGACC 803

Query: 521 -YVHKHYDKGRKIAEYLLQLEPRDVGTYILLSNMHARVRRWDHVVEIRKLMRERNVKKEP 580
               +  + G+K AE L QLEP +   Y+LL NM+A   RW+ +V+ RK M++ +VKKE 
Sbjct: 804 RANGRKAELGKKAAEMLFQLEPENAVNYVLLGNMYAAGGRWEDLVKARKKMKDADVKKEA 863

Query: 581 GVSWLEIRNVAHVFTSEDIKHPEANLIYENVKDLLSKIRPLGYVPDIDNVLHDIEDEQKV 640
           G SW+ +++  H+F + D  HP+A++IY+ +K+L  K+R  GYVP     L+D+E E K 
Sbjct: 864 GYSWVTMKDGVHMFVAGDKSHPDADVIYKKLKELNRKMRDAGYVPQTGFALYDLEQENKE 923

Query: 641 DNLSYHSEKLAVAYGL-MKTPSGAPITVIKNLRMCDDCHTAIKLISKVANRVIVVRDANR 700
           + LSYHSEKLAVA+ L  +  S  PI ++KNLR+C DCH+A K ISK+  R I++RD+NR
Sbjct: 924 EILSYHSEKLAVAFVLAAQRSSTLPIRIMKNLRVCGDCHSAFKYISKIEGRQIILRDSNR 983

Query: 701 FHHFQNGCCSCGDYW 709
           FHHFQ+G CSC D+W
Sbjct: 984 FHHFQDGACSCSDFW 995

BLAST of CsaV3_5G013110 vs. Swiss-Prot
Match: sp|Q3E6Q1|PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 468.8 bits (1205), Expect = 1.1e-130
Identity = 247/674 (36.65%), Postives = 391/674 (58.01%), Query Frame = 0

Query: 36  LLKVAADAKNLKFGRTIHAHLTITNHNYRDSKVNQLNSLINLYVKCDEVSIARKLFDSMP 95
           LLKV  D   L+ G+ IH  L  +  +     +  +  L N+Y KC +V+ ARK+FD MP
Sbjct: 141 LLKVCGDEAELRVGKEIHGLLVKSGFSL---DLFAMTGLENMYAKCRQVNEARKVFDRMP 200

Query: 96  RRNVVSWSALMAGYMQNGNPLEVFELFKKMVVKDNIFPNEYVIATAISSCDSQMYVE-GK 155
            R++VSW+ ++AGY QNG      E+ K M  ++N+ P+   I + + +  +   +  GK
Sbjct: 201 ERDLVSWNTIVAGYSQNGMARMALEMVKSM-CEENLKPSFITIVSVLPAVSALRLISVGK 260

Query: 156 QCHGYALKSGLEFHQYVKNALIQLYSKCSDVGAAIQILYTVPGNDIFCYNLVVNGLLQHT 215
           + HGYA++SG +    +  AL+ +Y+KC  +  A Q+   +   ++  +N +++  +Q+ 
Sbjct: 261 EIHGYAMRSGFDSLVNISTALVDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNE 320

Query: 216 HMAEAVDVLKLIISEGIEWNNATYVTIFRLCASLKDITLGKQVHAQMLKSDIDCDVYIGS 275
           +  EA+ + + ++ EG++  + + +     CA L D+  G+ +H   ++  +D +V + +
Sbjct: 321 NPKEAMLIFQKMLDEGVKPTDVSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVN 380

Query: 276 SIIDMYGKCGNVLSGRTFFDRLQSRNVVSWTSIIAAYFQNEFFEEALNLFSKMEIDCIPP 335
           S+I MY KC  V +  + F +LQSR +VSW ++I  + QN    +ALN FS+M    + P
Sbjct: 381 SLISMYCKCKEVDTAASMFGKLQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKP 440

Query: 336 NEYTMAVLFNSAAGLSALCLGDQLHARAEKSGLKGNVMVGNALIIMYFKSGDILAAQSVF 395
           + +T   +  + A LS       +H    +S L  NV V  AL+ MY K G I+ A+ +F
Sbjct: 441 DTFTYVSVITAIAELSITHHAKWIHGVVMRSCLDKNVFVTTALVDMYAKCGAIMIARLIF 500

Query: 396 SNMTCCNIITWNAIITGHSHHGLGKEALSMFQDMMATGERPNYVTFIGVILACAHLKLVD 455
             M+  ++ TWNA+I G+  HG GK AL +F++M     +PN VTF+ VI AC+H  LV+
Sbjct: 501 DMMSERHVTTWNAMIDGYGTHGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVE 560

Query: 456 EGFYYFNHLMKQFRIVPGLEHYTCIVGLLSRSGRLDEAENFMRSHQINWDVVSWRTLLNA 515
            G   F  + + + I   ++HY  +V LL R+GRL+EA +F+    +   V  +  +L A
Sbjct: 561 AGLKCFYMMKENYSIELSMDHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGA 620

Query: 516 CYVHKHYDKGRKIAEYLLQLEPRDVGTYILLSNMHARVRRWDHVVEIRKLMRERNVKKEP 575
           C +HK+ +   K AE L +L P D G ++LL+N++     W+ V ++R  M  + ++K P
Sbjct: 621 CQIHKNVNFAEKAAERLFELNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTP 680

Query: 576 GVSWLEIRNVAHVFTSEDIKHPEANLIYENVKDLLSKIRPLGYVPDIDNVLHDIEDEQKV 635
           G S +EI+N  H F S    HP++  IY  ++ L+  I+  GYVPD + VL  +E++ K 
Sbjct: 681 GCSMVEIKNEVHSFFSGSTAHPDSKKIYAFLEKLICHIKEAGYVPDTNLVL-GVENDVKE 740

Query: 636 DNLSYHSEKLAVAYGLMKTPSGAPITVIKNLRMCDDCHTAIKLISKVANRVIVVRDANRF 695
             LS HSEKLA+++GL+ T +G  I V KNLR+C DCH A K IS V  R IVVRD  RF
Sbjct: 741 QLLSTHSEKLAISFGLLNTTAGTTIHVRKNLRVCADCHNATKYISLVTGREIVVRDMQRF 800

Query: 696 HHFQNGCCSCGDYW 709
           HHF+NG CSCGDYW
Sbjct: 801 HHFKNGACSCGDYW 809

BLAST of CsaV3_5G013110 vs. TrEMBL
Match: tr|A0A0A0KR26|A0A0A0KR26_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G292190 PE=4 SV=1)

HSP 1 Score: 1449.9 bits (3752), Expect = 0.0e+00
Identity = 708/708 (100.00%), Postives = 708/708 (100.00%), Query Frame = 0

Query: 1   MSILKLPITDIMPVKFTPFLSRSNFLASPHQDPIKLLKVAADAKNLKFGRTIHAHLTITN 60
           MSILKLPITDIMPVKFTPFLSRSNFLASPHQDPIKLLKVAADAKNLKFGRTIHAHLTITN
Sbjct: 1   MSILKLPITDIMPVKFTPFLSRSNFLASPHQDPIKLLKVAADAKNLKFGRTIHAHLTITN 60

Query: 61  HNYRDSKVNQLNSLINLYVKCDEVSIARKLFDSMPRRNVVSWSALMAGYMQNGNPLEVFE 120
           HNYRDSKVNQLNSLINLYVKCDEVSIARKLFDSMPRRNVVSWSALMAGYMQNGNPLEVFE
Sbjct: 61  HNYRDSKVNQLNSLINLYVKCDEVSIARKLFDSMPRRNVVSWSALMAGYMQNGNPLEVFE 120

Query: 121 LFKKMVVKDNIFPNEYVIATAISSCDSQMYVEGKQCHGYALKSGLEFHQYVKNALIQLYS 180
           LFKKMVVKDNIFPNEYVIATAISSCDSQMYVEGKQCHGYALKSGLEFHQYVKNALIQLYS
Sbjct: 121 LFKKMVVKDNIFPNEYVIATAISSCDSQMYVEGKQCHGYALKSGLEFHQYVKNALIQLYS 180

Query: 181 KCSDVGAAIQILYTVPGNDIFCYNLVVNGLLQHTHMAEAVDVLKLIISEGIEWNNATYVT 240
           KCSDVGAAIQILYTVPGNDIFCYNLVVNGLLQHTHMAEAVDVLKLIISEGIEWNNATYVT
Sbjct: 181 KCSDVGAAIQILYTVPGNDIFCYNLVVNGLLQHTHMAEAVDVLKLIISEGIEWNNATYVT 240

Query: 241 IFRLCASLKDITLGKQVHAQMLKSDIDCDVYIGSSIIDMYGKCGNVLSGRTFFDRLQSRN 300
           IFRLCASLKDITLGKQVHAQMLKSDIDCDVYIGSSIIDMYGKCGNVLSGRTFFDRLQSRN
Sbjct: 241 IFRLCASLKDITLGKQVHAQMLKSDIDCDVYIGSSIIDMYGKCGNVLSGRTFFDRLQSRN 300

Query: 301 VVSWTSIIAAYFQNEFFEEALNLFSKMEIDCIPPNEYTMAVLFNSAAGLSALCLGDQLHA 360
           VVSWTSIIAAYFQNEFFEEALNLFSKMEIDCIPPNEYTMAVLFNSAAGLSALCLGDQLHA
Sbjct: 301 VVSWTSIIAAYFQNEFFEEALNLFSKMEIDCIPPNEYTMAVLFNSAAGLSALCLGDQLHA 360

Query: 361 RAEKSGLKGNVMVGNALIIMYFKSGDILAAQSVFSNMTCCNIITWNAIITGHSHHGLGKE 420
           RAEKSGLKGNVMVGNALIIMYFKSGDILAAQSVFSNMTCCNIITWNAIITGHSHHGLGKE
Sbjct: 361 RAEKSGLKGNVMVGNALIIMYFKSGDILAAQSVFSNMTCCNIITWNAIITGHSHHGLGKE 420

Query: 421 ALSMFQDMMATGERPNYVTFIGVILACAHLKLVDEGFYYFNHLMKQFRIVPGLEHYTCIV 480
           ALSMFQDMMATGERPNYVTFIGVILACAHLKLVDEGFYYFNHLMKQFRIVPGLEHYTCIV
Sbjct: 421 ALSMFQDMMATGERPNYVTFIGVILACAHLKLVDEGFYYFNHLMKQFRIVPGLEHYTCIV 480

Query: 481 GLLSRSGRLDEAENFMRSHQINWDVVSWRTLLNACYVHKHYDKGRKIAEYLLQLEPRDVG 540
           GLLSRSGRLDEAENFMRSHQINWDVVSWRTLLNACYVHKHYDKGRKIAEYLLQLEPRDVG
Sbjct: 481 GLLSRSGRLDEAENFMRSHQINWDVVSWRTLLNACYVHKHYDKGRKIAEYLLQLEPRDVG 540

Query: 541 TYILLSNMHARVRRWDHVVEIRKLMRERNVKKEPGVSWLEIRNVAHVFTSEDIKHPEANL 600
           TYILLSNMHARVRRWDHVVEIRKLMRERNVKKEPGVSWLEIRNVAHVFTSEDIKHPEANL
Sbjct: 541 TYILLSNMHARVRRWDHVVEIRKLMRERNVKKEPGVSWLEIRNVAHVFTSEDIKHPEANL 600

Query: 601 IYENVKDLLSKIRPLGYVPDIDNVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIT 660
           IYENVKDLLSKIRPLGYVPDIDNVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIT
Sbjct: 601 IYENVKDLLSKIRPLGYVPDIDNVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIT 660

Query: 661 VIKNLRMCDDCHTAIKLISKVANRVIVVRDANRFHHFQNGCCSCGDYW 709
           VIKNLRMCDDCHTAIKLISKVANRVIVVRDANRFHHFQNGCCSCGDYW
Sbjct: 661 VIKNLRMCDDCHTAIKLISKVANRVIVVRDANRFHHFQNGCCSCGDYW 708

BLAST of CsaV3_5G013110 vs. TrEMBL
Match: tr|A0A1S4E243|A0A1S4E243_CUCME (pentatricopeptide repeat-containing protein At5g39680 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103496677 PE=4 SV=1)

HSP 1 Score: 1377.8 bits (3565), Expect = 0.0e+00
Identity = 671/708 (94.77%), Postives = 689/708 (97.32%), Query Frame = 0

Query: 1   MSILKLPITDIMPVKFTPFLSRSNFLASPHQDPIKLLKVAADAKNLKFGRTIHAHLTITN 60
           MSILKLPI+DIMPVKFTPFLSRS+F ASPHQDPIKLLKVAADAKNL FGRTI AHLTITN
Sbjct: 1   MSILKLPISDIMPVKFTPFLSRSDFFASPHQDPIKLLKVAADAKNLIFGRTIQAHLTITN 60

Query: 61  HNYRDSKVNQLNSLINLYVKCDEVSIARKLFDSMPRRNVVSWSALMAGYMQNGNPLEVFE 120
           HNYRDSKVNQLNSLINLYVKC EVSIARK+FDSMPRRNVVSWS LMAGYMQNGNP EVFE
Sbjct: 61  HNYRDSKVNQLNSLINLYVKCGEVSIARKVFDSMPRRNVVSWSTLMAGYMQNGNPSEVFE 120

Query: 121 LFKKMVVKDNIFPNEYVIATAISSCDSQMYVEGKQCHGYALKSGLEFHQYVKNALIQLYS 180
           LFKKMV+KDNI PN+YVIAT ISSC+SQMYVEGKQCHGYALKSGLEFHQYVKNALIQLYS
Sbjct: 121 LFKKMVLKDNILPNKYVIATVISSCNSQMYVEGKQCHGYALKSGLEFHQYVKNALIQLYS 180

Query: 181 KCSDVGAAIQILYTVPGNDIFCYNLVVNGLLQHTHMAEAVDVLKLIISEGIEWNNATYVT 240
           KCSDVGAAIQILYTVPGNDIFCYNLVVNGLLQHTHM EAVDVLKLIIS+GIEWN+ATYVT
Sbjct: 181 KCSDVGAAIQILYTVPGNDIFCYNLVVNGLLQHTHMREAVDVLKLIISKGIEWNSATYVT 240

Query: 241 IFRLCASLKDITLGKQVHAQMLKSDIDCDVYIGSSIIDMYGKCGNVLSGRTFFDRLQSRN 300
           IFRLCASLKDITLGKQVHAQMLKSDIDCDVYIGSSIIDMYGKCGNVLSGRTFFDRLQSRN
Sbjct: 241 IFRLCASLKDITLGKQVHAQMLKSDIDCDVYIGSSIIDMYGKCGNVLSGRTFFDRLQSRN 300

Query: 301 VVSWTSIIAAYFQNEFFEEALNLFSKMEIDCIPPNEYTMAVLFNSAAGLSALCLGDQLHA 360
           VVSWTSI+AAYFQNEFFEEAL+LFSKMEID IPPNEYTMAVLFNSAAGLSALCLGDQLHA
Sbjct: 301 VVSWTSIMAAYFQNEFFEEALDLFSKMEIDRIPPNEYTMAVLFNSAAGLSALCLGDQLHA 360

Query: 361 RAEKSGLKGNVMVGNALIIMYFKSGDILAAQSVFSNMTCCNIITWNAIITGHSHHGLGKE 420
           RAEKSGLKGNVMVGNALIIMYFKSGDILAAQ VFSNMTCC+IITWNAIITGHSHHGLGKE
Sbjct: 361 RAEKSGLKGNVMVGNALIIMYFKSGDILAAQRVFSNMTCCDIITWNAIITGHSHHGLGKE 420

Query: 421 ALSMFQDMMATGERPNYVTFIGVILACAHLKLVDEGFYYFNHLMKQFRIVPGLEHYTCIV 480
           ALSMFQDMM TGERPNYVTFIGVI ACAHLKLVDEGFYYFNHLMKQF IVPGLEHYTCIV
Sbjct: 421 ALSMFQDMMTTGERPNYVTFIGVISACAHLKLVDEGFYYFNHLMKQFGIVPGLEHYTCIV 480

Query: 481 GLLSRSGRLDEAENFMRSHQINWDVVSWRTLLNACYVHKHYDKGRKIAEYLLQLEPRDVG 540
           GLLSRSGRLDEAENFMRSHQINWDVVSWRTLLNACYVHKHYDKG++IAEYLLQLEPRDVG
Sbjct: 481 GLLSRSGRLDEAENFMRSHQINWDVVSWRTLLNACYVHKHYDKGKQIAEYLLQLEPRDVG 540

Query: 541 TYILLSNMHARVRRWDHVVEIRKLMRERNVKKEPGVSWLEIRNVAHVFTSEDIKHPEANL 600
           TYILLSNMHARVRRWD VVEIRKLMRERNVKKEPGVSWLEIRNVAHVFTSEDIKHP+ANL
Sbjct: 541 TYILLSNMHARVRRWDRVVEIRKLMRERNVKKEPGVSWLEIRNVAHVFTSEDIKHPDANL 600

Query: 601 IYENVKDLLSKIRPLGYVPDIDNVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPIT 660
           IYENVK+LLSKIRPLGYVPDIDNVLHDIEDEQKV+NLSYHSEKLAVAYGLMKT SG PI 
Sbjct: 601 IYENVKNLLSKIRPLGYVPDIDNVLHDIEDEQKVNNLSYHSEKLAVAYGLMKTTSGTPIR 660

Query: 661 VIKNLRMCDDCHTAIKLISKVANRVIVVRDANRFHHFQNGCCSCGDYW 709
           VIKNLRMCDDCHTAIKLIS+VANRVI+VRD NRFHHFQNGCCSCGDYW
Sbjct: 661 VIKNLRMCDDCHTAIKLISQVANRVIIVRDVNRFHHFQNGCCSCGDYW 708

BLAST of CsaV3_5G013110 vs. TrEMBL
Match: tr|A0A2I4E991|A0A2I4E991_9ROSI (pentatricopeptide repeat-containing protein At5g39680-like OS=Juglans regia OX=51240 GN=LOC108987495 PE=4 SV=1)

HSP 1 Score: 941.0 bits (2431), Expect = 1.5e-270
Identity = 450/697 (64.56%), Postives = 551/697 (79.05%), Query Frame = 0

Query: 14  VKFTPFLSRSN-FLASPHQDPIKLLKVAADAKNLKFGRTIHAHLTITNHNYRDSKVNQLN 73
           + + PFL + N   +SP  DPIKLLK +A+ KNL+ G+ +HAHL  +    +   +   N
Sbjct: 16  LSYVPFLFKPNRGSSSPFADPIKLLKKSAETKNLRIGKLVHAHLITSTLASKTQDLFHAN 75

Query: 74  SLINLYVKCDEVSIARKLFDSMPRRNVVSWSALMAGYMQNGNPLEVFELFKKMVVKDNIF 133
           SLIN Y KC ++ IAR+LFD M  RN+VSWSALMAGY+ NG  LEV  LFKKM   DN+ 
Sbjct: 76  SLINFYAKCGDIFIARQLFDQMSERNIVSWSALMAGYLHNGLALEVLVLFKKMFSVDNLR 135

Query: 134 PNEYVIATAISSCD-SQMYVEGKQCHGYALKSGLEFHQYVKNALIQLYSKCSDVGAAIQI 193
           PNEY+ A A++SC  S    EGKQCHGY LKSGLEFHQYVKNALI +YS+CSDV  A+ +
Sbjct: 136 PNEYIFAIALASCSVSGRIEEGKQCHGYVLKSGLEFHQYVKNALIHMYSRCSDVEGAMWV 195

Query: 194 LYTVPGNDIFCYNLVVNGLLQHTHMAEAVDVLKLIISEGIEWNNATYVTIFRLCASLKDI 253
           L T+PG D+  YN VVNGL++  ++ EA++V   + SE   W+N TY+++F L A LKD+
Sbjct: 196 LNTLPGYDVVSYNSVVNGLVELGYLKEALEVTGRMASECKAWDNVTYISLFGLSARLKDL 255

Query: 254 TLGKQVHAQMLKSDIDCDVYIGSSIIDMYGKCGNVLSGRTFFDRLQSRNVVSWTSIIAAY 313
            LGKQVH QMLKSD+DCDV++ S+I+DMYGKCGN+L+ R  FD LQ RNVVSWT+++AAY
Sbjct: 256 NLGKQVHGQMLKSDLDCDVFVSSAIVDMYGKCGNILNARKVFDCLQDRNVVSWTALMAAY 315

Query: 314 FQNEFFEEALNLFSKMEIDCIPPNEYTMAVLFNSAAGLSALCLGDQLHARAEKSGLKGNV 373
           FQN  FEEALNLFS ME++   PNEYT AVL NS+A LSAL LGD LHAR +KSG KG+ 
Sbjct: 316 FQNGCFEEALNLFSVMEVEDFMPNEYTFAVLLNSSASLSALRLGDLLHARIKKSGFKGHT 375

Query: 374 MVGNALIIMYFKSGDILAAQSVFSNMTCCNIITWNAIITGHSHHGLGKEALSMFQDMMAT 433
           +VGNALIIMY KSG+I  A  VFS +   + +TWNA+I+G+SHHGLGKEAL +FQDM+ +
Sbjct: 376 IVGNALIIMYSKSGNIKGANKVFSELIFRDSVTWNAMISGYSHHGLGKEALDLFQDMLTS 435

Query: 434 GERPNYVTFIGVILACAHLKLVDEGFYYFNHLMKQFRIVPGLEHYTCIVGLLSRSGRLDE 493
           G  PNYVTFIGV+ ACAHL +V EG YY NHLM++  I PGLEHYTCIVGLLSR+G LDE
Sbjct: 436 GVSPNYVTFIGVLSACAHLGMVQEGLYYLNHLMRKMGIEPGLEHYTCIVGLLSRAGLLDE 495

Query: 494 AENFMRSHQINWDVVSWRTLLNACYVHKHYDKGRKIAEYLLQLEPRDVGTYILLSNMHAR 553
           AENFMRS  + WDV++WRTL+NAC+VH+++  G++IAE ++ ++P DVGTYILLSNM+A+
Sbjct: 496 AENFMRSTPVKWDVIAWRTLVNACHVHRNFGLGKRIAESVMLMDPHDVGTYILLSNMYAK 555

Query: 554 VRRWDHVVEIRKLMRERNVKKEPGVSWLEIRNVAHVFTSEDIKHPEANLIYENVKDLLSK 613
            RRWD VV+IRKLMRERN+KKEPGVSWLEIRN+ HVF SED  HPE++ I+E V +LL+K
Sbjct: 556 ERRWDGVVKIRKLMRERNIKKEPGVSWLEIRNITHVFVSEDNTHPESSQIHEKVGELLAK 615

Query: 614 IRPLGYVPDIDNVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPITVIKNLRMCDDC 673
           I+PLGYVP+I  VLHD+EDEQK + LSYHSEKLA+AYGLMKTPS A I VIKNLRMCDDC
Sbjct: 616 IKPLGYVPNIAAVLHDVEDEQKENYLSYHSEKLAIAYGLMKTPSKASIRVIKNLRMCDDC 675

Query: 674 HTAIKLISKVANRVIVVRDANRFHHFQNGCCSCGDYW 709
           H A KLISKV NRVI+VRDANRFHHFQ+GCCSC DYW
Sbjct: 676 HIAAKLISKVTNRVIIVRDANRFHHFQDGCCSCADYW 712

BLAST of CsaV3_5G013110 vs. TrEMBL
Match: tr|A0A2I4GUC2|A0A2I4GUC2_9ROSI (LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At5g39680-like OS=Juglans regia OX=51240 GN=LOC109010973 PE=4 SV=1)

HSP 1 Score: 934.5 bits (2414), Expect = 1.4e-268
Identity = 448/697 (64.28%), Postives = 548/697 (78.62%), Query Frame = 0

Query: 14  VKFTPFLSRSN-FLASPHQDPIKLLKVAADAKNLKFGRTIHAHLTITNHNYRDSKVNQLN 73
           + + PFL + N   +SP  DPIKLLK +A+ KNL+ G+ +HAHL  +    +   V   N
Sbjct: 16  LSYVPFLFKPNRGSSSPFADPIKLLKKSAETKNLRIGKLVHAHLITSTQASKTQDVFHAN 75

Query: 74  SLINLYVKCDEVSIARKLFDSMPRRNVVSWSALMAGYMQNGNPLEVFELFKKMVVKDNIF 133
           SLIN Y KC ++SIAR+LFD M  RN+VSWSALMAGY+ NG  LEV  LFK+M   DN+ 
Sbjct: 76  SLINFYAKCGDISIARQLFDQMSERNIVSWSALMAGYLHNGLALEVLVLFKQMFSVDNLR 135

Query: 134 PNEYVIATAISSCD-SQMYVEGKQCHGYALKSGLEFHQYVKNALIQLYSKCSDVGAAIQI 193
           PNEY+ A A++SC  S    EGKQCHGY LKSGLEFHQYVKNALI +YS+CSDV  AI +
Sbjct: 136 PNEYIFAIALASCSVSGRIEEGKQCHGYVLKSGLEFHQYVKNALIHMYSRCSDVEGAIWV 195

Query: 194 LYTVPGNDIFCYNLVVNGLLQHTHMAEAVDVLKLIISEGIEWNNATYVTIFRLCASLKDI 253
           L T+PG D+  YN V+NGL++  ++ EA++    + SE   W+N TY+++F L A LKD+
Sbjct: 196 LNTLPGYDVVSYNSVINGLVELGYLKEALEATARMASECKAWDNVTYISLFGLSARLKDL 255

Query: 254 TLGKQVHAQMLKSDIDCDVYIGSSIIDMYGKCGNVLSGRTFFDRLQSRNVVSWTSIIAAY 313
            LGKQVH QMLKSD+DCDV++ S+I+DMYGKCGN+L+ R  FD LQ RNVVSWT+++AAY
Sbjct: 256 NLGKQVHGQMLKSDLDCDVFVSSAIVDMYGKCGNILNARKVFDCLQDRNVVSWTALMAAY 315

Query: 314 FQNEFFEEALNLFSKMEIDCIPPNEYTMAVLFNSAAGLSALCLGDQLHARAEKSGLKGNV 373
           FQN  FEEALNLFS ME++   PNEYT AVL NS+A LSAL  GD LHAR +KSG KG+ 
Sbjct: 316 FQNGCFEEALNLFSVMEVEDFMPNEYTFAVLLNSSASLSALKTGDLLHARIKKSGFKGHT 375

Query: 374 MVGNALIIMYFKSGDILAAQSVFSNMTCCNIITWNAIITGHSHHGLGKEALSMFQDMMAT 433
           +VGNALIIMY KSG+I  A  VFS++   + +TWNA+I+G+SHHGLGKEAL +F DM+ +
Sbjct: 376 IVGNALIIMYSKSGNIKGANKVFSDLIFRDSVTWNAMISGYSHHGLGKEALDLFHDMLTS 435

Query: 434 GERPNYVTFIGVILACAHLKLVDEGFYYFNHLMKQFRIVPGLEHYTCIVGLLSRSGRLDE 493
              PNYVTFIGV+ ACAHL LV EG YY NHLM++  I PGLEHYTCIVGLLSR+G LDE
Sbjct: 436 VVSPNYVTFIGVLSACAHLGLVQEGLYYLNHLMRKMGIEPGLEHYTCIVGLLSRAGLLDE 495

Query: 494 AENFMRSHQINWDVVSWRTLLNACYVHKHYDKGRKIAEYLLQLEPRDVGTYILLSNMHAR 553
           AENFMRS  + WDV++WRTL+NAC+V ++Y  G++IAE ++ ++P DVGTYILLSNM+A+
Sbjct: 496 AENFMRSTPVKWDVIAWRTLVNACHVLRNYGLGKRIAESVMLMDPHDVGTYILLSNMYAK 555

Query: 554 VRRWDHVVEIRKLMRERNVKKEPGVSWLEIRNVAHVFTSEDIKHPEANLIYENVKDLLSK 613
             RWD VV+I KLMRERN+KKEPGVSWLEIRN+ HVF SED KHPE++ IYE V +LL+K
Sbjct: 556 ESRWDGVVKIWKLMRERNIKKEPGVSWLEIRNITHVFVSEDNKHPESSQIYEKVGELLAK 615

Query: 614 IRPLGYVPDIDNVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPITVIKNLRMCDDC 673
           I+PLGYVP+I  VLHD+EDEQK + LSYHSEKLA+AYGLMKTPS A I VIKNLRMCDDC
Sbjct: 616 IKPLGYVPNIAAVLHDVEDEQKENYLSYHSEKLAIAYGLMKTPSEASIRVIKNLRMCDDC 675

Query: 674 HTAIKLISKVANRVIVVRDANRFHHFQNGCCSCGDYW 709
           H+A KLISKV NRVI+VRDANRFHHFQ+GCCSC DYW
Sbjct: 676 HSAAKLISKVTNRVIIVRDANRFHHFQDGCCSCADYW 712

BLAST of CsaV3_5G013110 vs. TrEMBL
Match: tr|M5WY68|M5WY68_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_3G265900 PE=4 SV=1)

HSP 1 Score: 920.6 bits (2378), Expect = 2.1e-264
Identity = 435/692 (62.86%), Postives = 543/692 (78.47%), Query Frame = 0

Query: 18  PFLSRSNFLASPHQDPIKLLKVAADAKNLKFGRTIHAHLTITNHNYRDSKVNQLNSLINL 77
           PFL +   +    +DPIKLLK AAD KNL+ G+T+HAHL +++   +   +   NSLINL
Sbjct: 13  PFLFKPKVIPGSIEDPIKLLKKAADTKNLRLGKTVHAHLILSSETSKFLDIFHANSLINL 72

Query: 78  YVKCDEVSIARKLFDSMPRRNVVSWSALMAGYMQNGNPLEVFELFKKMVVKDNIFPNEYV 137
           Y KCD ++ AR LF+ MP+RNVVSW+ALMAGY+  G  LEV  LFK MV  DN+ PNE+V
Sbjct: 73  YAKCDRITTARHLFECMPKRNVVSWTALMAGYLHKGLTLEVLGLFKTMVSVDNLCPNEFV 132

Query: 138 IATAISSCDSQMYV-EGKQCHGYALKSGLEFHQYVKNALIQLYSKCSDVGAAIQILYTVP 197
            AT +SSC     V EGKQCHGY LKSGL  +QYVKNAL+ +YS CS+V AA+++L TVP
Sbjct: 133 FATVLSSCSGSGRVEEGKQCHGYVLKSGLLSYQYVKNALVHMYSSCSEVEAAMRVLNTVP 192

Query: 198 GNDIFCYNLVVNGLLQHTHMAEAVDVLKLIISEGIEWNNATYVTIFRLCASLKDITLGKQ 257
           G+DI  YN VVNGLL+H H+ EA+D+L ++I +   W+N TY+TIF +CA LKD+ LG Q
Sbjct: 193 GDDILSYNSVVNGLLEHGHVKEAMDILDMMIGQCKAWDNVTYITIFGVCAHLKDLRLGLQ 252

Query: 258 VHAQMLKSDIDCDVYIGSSIIDMYGKCGNVLSGRTFFDRLQSRNVVSWTSIIAAYFQNEF 317
           VH+QMLK+DIDCDV++ S++IDMYGKCG VL+    FD LQ+RN+VSWT+I+AAYFQN  
Sbjct: 253 VHSQMLKTDIDCDVFLSSAMIDMYGKCGKVLNALKVFDGLQTRNIVSWTAIMAAYFQNGC 312

Query: 318 FEEALNLFSKMEIDCIPPNEYTMAVLFNSAAGLSALCLGDQLHARAEKSGLKGNVMVGNA 377
           FEEAL L S+ME + I PNEYT AVL NS AGLSAL  GD LHA  EKSG K + +VGNA
Sbjct: 313 FEEALGLLSQMEFEDILPNEYTFAVLLNSCAGLSALRHGDLLHASVEKSGFKDHAIVGNA 372

Query: 378 LIIMYFKSGDILAAQSVFSNMTCCNIITWNAIITGHSHHGLGKEALSMFQDMMATGERPN 437
           L+ MY K G+I AA  VF +MT  + +TWNA+I+G SHHGLG EAL++FQDM+  GERPN
Sbjct: 373 LVNMYSKCGNIQAANDVFLDMTSRDAVTWNAMISGFSHHGLGNEALNVFQDMLEAGERPN 432

Query: 438 YVTFIGVILACAHLKLVDEGFYYFNHLMKQFRIVPGLEHYTCIVGLLSRSGRLDEAENFM 497
            +TF+GV+ ACAHL LV EGFYY N LMKQ  I PGLEH+TCIVGLLSR+G+LD+AE +M
Sbjct: 433 NITFVGVLSACAHLGLVQEGFYYLNQLMKQIGIEPGLEHHTCIVGLLSRAGQLDQAEKYM 492

Query: 498 RSHQINWDVVSWRTLLNACYVHKHYDKGRKIAEYLLQLEPRDVGTYILLSNMHARVRRWD 557
           R+  + WD+V+WR+LLNAC+VHK Y  G+++AE ++Q++P DVGTY LLSNM+A+  RWD
Sbjct: 493 RTMPVKWDIVAWRSLLNACHVHKSYGLGKRVAEVVVQMDPNDVGTYTLLSNMYAKANRWD 552

Query: 558 HVVEIRKLMRERNVKKEPGVSWLEIRNVAHVFTSEDIKHPEANLIYENVKDLLSKIRPLG 617
            VV+IRKLMRE+N+KKEPGVSW+EIRN  H+F S+D  HPE++ I+E V +LL+KI+ LG
Sbjct: 553 GVVQIRKLMREKNIKKEPGVSWVEIRNTTHIFVSDDNIHPESSQIHEKVGELLAKIKLLG 612

Query: 618 YVPDIDNVLHDIEDEQKVDNLSYHSEKLAVAYGLMKTPSGAPITVIKNLRMCDDCHTAIK 677
           YVPDI  VLHD++DEQK D LSYHSEKLA+AY LMKTP+  PI VIKNLR+CDDCH A+K
Sbjct: 613 YVPDIAAVLHDVDDEQKEDYLSYHSEKLAIAYALMKTPTEVPIRVIKNLRICDDCHAAVK 672

Query: 678 LISKVANRVIVVRDANRFHHFQNGCCSCGDYW 709
           LISKV NR+I+VRDANRFH FQ+G CSC DYW
Sbjct: 673 LISKVTNRLIIVRDANRFHQFQDGKCSCADYW 704

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011655117.10.0e+00100.00PREDICTED: pentatricopeptide repeat-containing protein At5g39680 [Cucumis sativu... [more]
XP_008456851.10.0e+0094.77PREDICTED: pentatricopeptide repeat-containing protein At5g39680 isoform X2 [Cuc... [more]
XP_023002421.10.0e+0082.00pentatricopeptide repeat-containing protein At5g39680 [Cucurbita maxima] >XP_023... [more]
XP_022144415.10.0e+0082.09pentatricopeptide repeat-containing protein At5g39680 [Momordica charantia][more]
XP_022933883.10.0e+0082.00pentatricopeptide repeat-containing protein At5g39680 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
AT5G39680.12.3e-21653.19Pentatricopeptide repeat (PPR) superfamily protein[more]
AT4G33170.14.2e-14138.40Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G13650.11.5e-13535.78Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G09950.11.9e-13337.04Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G11290.16.0e-13236.65Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9FK93|PP406_ARATH4.2e-21553.19Pentatricopeptide repeat-containing protein At5g39680 OS=Arabidopsis thaliana OX... [more]
sp|Q9SMZ2|PP347_ARATH7.5e-14038.40Pentatricopeptide repeat-containing protein At4g33170 OS=Arabidopsis thaliana OX... [more]
sp|Q9SVP7|PP307_ARATH2.8e-13435.78Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... [more]
sp|Q9FIB2|PP373_ARATH3.4e-13237.04Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis th... [more]
sp|Q3E6Q1|PPR32_ARATH1.1e-13036.65Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0KR26|A0A0A0KR26_CUCSA0.0e+00100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G292190 PE=4 SV=1[more]
tr|A0A1S4E243|A0A1S4E243_CUCME0.0e+0094.77pentatricopeptide repeat-containing protein At5g39680 isoform X2 OS=Cucumis melo... [more]
tr|A0A2I4E991|A0A2I4E991_9ROSI1.5e-27064.56pentatricopeptide repeat-containing protein At5g39680-like OS=Juglans regia OX=5... [more]
tr|A0A2I4GUC2|A0A2I4GUC2_9ROSI1.4e-26864.28LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At5g39680-like ... [more]
tr|M5WY68|M5WY68_PRUPE2.1e-26462.86Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_3G265900 PE=4 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0008270zinc ion binding
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR032867DYW_dom
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0016787 hydrolase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_5G013110.1CsaV3_5G013110.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 201..234
e-value: 8.6E-4
score: 17.3
coord: 100..135
e-value: 5.2E-6
score: 24.3
coord: 302..336
e-value: 3.7E-5
score: 21.6
coord: 403..436
e-value: 8.5E-7
score: 26.8
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 201..231
e-value: 0.2
score: 11.9
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 97..143
e-value: 2.3E-7
score: 30.8
coord: 401..448
e-value: 3.6E-9
score: 36.6
coord: 299..345
e-value: 3.2E-10
score: 39.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 199..233
score: 8.495
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 504..534
score: 6.445
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 98..133
score: 9.788
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 67..97
score: 7.245
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 300..334
score: 10.479
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 436..471
score: 7.081
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 538..572
score: 8.199
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 401..435
score: 11.17
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 370..400
score: 5.963
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 234..268
score: 6.96
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 472..502
score: 5.897
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 269..299
score: 6.686
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 20..167
e-value: 1.6E-16
score: 62.2
coord: 352..451
e-value: 2.2E-17
score: 65.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 452..611
e-value: 1.5E-14
score: 56.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 210..351
e-value: 1.6E-26
score: 95.4
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 574..698
e-value: 3.1E-41
score: 140.1
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 30..625
NoneNo IPR availablePANTHERPTHR24015:SF382SUBFAMILY NOT NAMEDcoord: 30..625

The following gene(s) are paralogous to this gene:

None