CsaV3_1G029170 (gene) Cucumber (Chinese Long) v3

NameCsaV3_1G029170
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionPentatricopeptide repeat-containing protein
Locationchr1 : 15971039 .. 15973723 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCCCTCAAACTTTCCTTCTCTTTACATTCTTTTGATTCCAATAAGTTCGATTTTCCTCTCAATTCACCTCTGCTCTCTGATTATTGCTCTCTTTTCTCCATCAATGCTCATCTTCATCTCAATAAGTCCTCCATAATTTACTCTCTGGCTAGGGTTCACAAGCCCTCTAAAGTTTCTCAGGTAGAACAGGACGCGTCCGACGTTTCCCAATCCAGATTTGATGAAATTGTCGCCAGGAAAAAGTATTTTACCTCTAAGAAGCCTTCAAAGAGAGCAGCAGGTTCGCATTTTAGTTTCAGTAGGAATTGTAATGACAATATTCTTTTTAATGGTGGTGAATTGGATGTCAATTACTCAACTATATCCTCTGATTTGAGCTTAGAGGATTGCAATGCTATTTTGAAAAGGCTAGAGAAGTGTAACGATTCCAAAACACTGGGTTTCTTTGAGTGGATGAGAAGTAATGGGAAATTAAAACACAATGTGAGTGCTTATAATTTGGTTCTTCGAGTGTTGGGTAGGCAAGAAGATTGGGATGCTGCCGAGAAGCTAATTGAGGAAGTTAGAGCTGAGTTGGGTTCTCAATTGGATTTTCAGGTTTTTAACACTCTTATCTATGCTTGTTATAAATCGAGGTTTGTGGAGCAGGGTACGAAATGGTTTCGAATGATGTTGGAATGCCAAGTGCAGCCCAATGTCGCAACATTTGGAATGCTTATGGGTCTCTATCAGAAGAAGTGTGATATTAAGGAATCGGAGTTTGCCTTTAATCAGATGAGAAACTTTGGTATTGTTTGTGAAACAGCATATGCATCTATGATTACTATATACATACGTATGAATTTATACGATAAAGCAGAAGAGGTGATTCAATTAATGCAAGAAGATAAAGTGATTCCTAATCTAGAGAACTGGGTAGTAATGCTTAATGCTTATTGTCAGCAAGGCAAAATGGAGGAAGCTGAACTTGTATTTGCCTCAATGGAAGAAGCTGGGTTTTCATCCAATATCATTGCATATAATACCTTGATTACTGGGTATGGGAAGGCATCAAATATGGATACTGCTCAACGCCTGTTCTTGGGCATCAAGAACTCTGGAGTAGAACCTGATGAAACGACTTACCGCTCCATGATTGAAGGTTGGGGTCGAGCTGGTAATTATAAAATGGCAGAATGGTATTATAAGGAGCTCAAACGGAGAGGATATATGCCGAATTCCTCTAACTTGTTCACCCTCATAAATCTACAAGCCAAACATGAGGATGAAGCAGGTACACTTAAAACTCTAAATGATATGCTAAAGATTGGATGCCGGCCTTCTTCCATTGTTGGAAATGTTTTGCAAGCATATGAAAAGGCTAGAAGAATGAAAAGTGTGCCTGTCCTCTTGACAGGGTCGTTCTATCGGAAAGTTCTGAGCAGCCAGACATCTTGCTCAATTCTGGTAATGGCTTATGTGAAGCACTGTTTAGTGGATGACGCTTTAAAAGTGTTGAGGGAAAAGGAGTGGAAAGATCATCATTTTGAGGAGAATTTGTATCATTTGCTAATTTGTTCATGTAAAGAGTTGGGCCATCTCGAGAATGCAATTAAGATATACACACAACTGCCTAAACGTGAAAACAAACCGAACTTGCATATCACATGCACAATGATTGATATCTACAGCATCATGGGTAGGTTCTCTGACGGGGAGAAGCTTTATCTAAGCCTGAGATCATCAGGCATTCCTTTGGATTTGATTGCTTATAATGTTGTTGTGAGAATGTATGTTAAAGCTGGATCATTGGAAGATGCATGCTCAGTTCTTGACTTGATGGCTGAGCAGCAGGACATTGTTCCGGACATATATCTGTTACGGGACATGCTTCGTATTTACCAACGATGTGGCATGGTGCATAAGCTAGCAGATCTGTACTATAGGATATTGAAGAGTGGAGTGTCTTGGGATCAGGAAATGTATAATTGTGTCATAAATTGCTGTTCCCGTGCTCTGCCTGTTGATGAGCTTTCTAGGCTTTTTGATGAAATGCTTCAATGTGGTTTTGCCCCAAATACAGTGACCTTGAATGTCATGCTTGACGTTTATGGGAAATCCAAGCTTTTCACTAAGGCTAGAAATCTCTTTGGGCTTGCTCAGAAAAGAGGTTTGGTTGATGCAATCTCTTATAATACTATGATATCTGTCTATGGGAAGAATAAGGACTTCAAAAACATGTCATCTACGGTTCAGAAAATGAAATTTAACGGGTTTTCAGTTTCCCTTGAAGCCTACAATTGTATGTTGGATGCTTATGGCAAAGAATGCCAAATGGAGAATTTCAGAAGTGTCTTGCAGCGAATGCAGGAGACAAGTTCTGAATGTGACCATTATACGTACAACATCATGATCAACATCTATGGAGAACAAGGATGGATAGATGAAGTTGCGGAAGTTCTGACAGAATTGAAAGCATGTGGACTTGAACCCGATCTGTACAGCTACAACACATTGATCAAGGCATATGGAATAGCAGGGATGGTTGAAGAAGCTGCTCAGTTGGTGAAAGAAATGAGAGAAAAGAGGATAGAACCAGATAGGATAACTTATATTAACATGATTAGAGCACTGCAAAGAAACGATCAATTTTTAGAGGCAGTCAAGTGGTCATTGTGGATGAAGCAGATGAAATATTGA

mRNA sequence

ATGGCTTCCCTCAAACTTTCCTTCTCTTTACATTCTTTTGATTCCAATAAGTTCGATTTTCCTCTCAATTCACCTCTGCTCTCTGATTATTGCTCTCTTTTCTCCATCAATGCTCATCTTCATCTCAATAAGTCCTCCATAATTTACTCTCTGGCTAGGGTTCACAAGCCCTCTAAAGTTTCTCAGGTAGAACAGGACGCGTCCGACGTTTCCCAATCCAGATTTGATGAAATTGTCGCCAGGAAAAAGTATTTTACCTCTAAGAAGCCTTCAAAGAGAGCAGCAGGTTCGCATTTTAGTTTCAGTAGGAATTGTAATGACAATATTCTTTTTAATGGTGGTGAATTGGATGTCAATTACTCAACTATATCCTCTGATTTGAGCTTAGAGGATTGCAATGCTATTTTGAAAAGGCTAGAGAAGTGTAACGATTCCAAAACACTGGGTTTCTTTGAGTGGATGAGAAGTAATGGGAAATTAAAACACAATGTGAGTGCTTATAATTTGGTTCTTCGAGTGTTGGGTAGGCAAGAAGATTGGGATGCTGCCGAGAAGCTAATTGAGGAAGTTAGAGCTGAGTTGGGTTCTCAATTGGATTTTCAGGTTTTTAACACTCTTATCTATGCTTGTTATAAATCGAGGTTTGTGGAGCAGGGTACGAAATGGTTTCGAATGATGTTGGAATGCCAAGTGCAGCCCAATGTCGCAACATTTGGAATGCTTATGGGTCTCTATCAGAAGAAGTGTGATATTAAGGAATCGGAGTTTGCCTTTAATCAGATGAGAAACTTTGGTATTGTTTGTGAAACAGCATATGCATCTATGATTACTATATACATACGTATGAATTTATACGATAAAGCAGAAGAGGTGATTCAATTAATGCAAGAAGATAAAGTGATTCCTAATCTAGAGAACTGGGTAGTAATGCTTAATGCTTATTGTCAGCAAGGCAAAATGGAGGAAGCTGAACTTGTATTTGCCTCAATGGAAGAAGCTGGGTTTTCATCCAATATCATTGCATATAATACCTTGATTACTGGGTATGGGAAGGCATCAAATATGGATACTGCTCAACGCCTGTTCTTGGGCATCAAGAACTCTGGAGTAGAACCTGATGAAACGACTTACCGCTCCATGATTGAAGGTTGGGGTCGAGCTGGTAATTATAAAATGGCAGAATGGTATTATAAGGAGCTCAAACGGAGAGGATATATGCCGAATTCCTCTAACTTGTTCACCCTCATAAATCTACAAGCCAAACATGAGGATGAAGCAGGTACACTTAAAACTCTAAATGATATGCTAAAGATTGGATGCCGGCCTTCTTCCATTGTTGGAAATGTTTTGCAAGCATATGAAAAGGCTAGAAGAATGAAAAGTGTGCCTGTCCTCTTGACAGGGTCGTTCTATCGGAAAGTTCTGAGCAGCCAGACATCTTGCTCAATTCTGGTAATGGCTTATGTGAAGCACTGTTTAGTGGATGACGCTTTAAAAGTGTTGAGGGAAAAGGAGTGGAAAGATCATCATTTTGAGGAGAATTTGTATCATTTGCTAATTTGTTCATGTAAAGAGTTGGGCCATCTCGAGAATGCAATTAAGATATACACACAACTGCCTAAACGTGAAAACAAACCGAACTTGCATATCACATGCACAATGATTGATATCTACAGCATCATGGGTAGGTTCTCTGACGGGGAGAAGCTTTATCTAAGCCTGAGATCATCAGGCATTCCTTTGGATTTGATTGCTTATAATGTTGTTGTGAGAATGTATGTTAAAGCTGGATCATTGGAAGATGCATGCTCAGTTCTTGACTTGATGGCTGAGCAGCAGGACATTGTTCCGGACATATATCTGTTACGGGACATGCTTCGTATTTACCAACGATGTGGCATGGTGCATAAGCTAGCAGATCTGTACTATAGGATATTGAAGAGTGGAGTGTCTTGGGATCAGGAAATGTATAATTGTGTCATAAATTGCTGTTCCCGTGCTCTGCCTGTTGATGAGCTTTCTAGGCTTTTTGATGAAATGCTTCAATGTGGTTTTGCCCCAAATACAGTGACCTTGAATGTCATGCTTGACGTTTATGGGAAATCCAAGCTTTTCACTAAGGCTAGAAATCTCTTTGGGCTTGCTCAGAAAAGAGGTTTGGTTGATGCAATCTCTTATAATACTATGATATCTGTCTATGGGAAGAATAAGGACTTCAAAAACATGTCATCTACGGTTCAGAAAATGAAATTTAACGGGTTTTCAGTTTCCCTTGAAGCCTACAATTGTATGTTGGATGCTTATGGCAAAGAATGCCAAATGGAGAATTTCAGAAGTGTCTTGCAGCGAATGCAGGAGACAAGTTCTGAATGTGACCATTATACGTACAACATCATGATCAACATCTATGGAGAACAAGGATGGATAGATGAAGTTGCGGAAGTTCTGACAGAATTGAAAGCATGTGGACTTGAACCCGATCTGTACAGCTACAACACATTGATCAAGGCATATGGAATAGCAGGGATGGTTGAAGAAGCTGCTCAGTTGGTGAAAGAAATGAGAGAAAAGAGGATAGAACCAGATAGGATAACTTATATTAACATGATTAGAGCACTGCAAAGAAACGATCAATTTTTAGAGGCAGTCAAGTGGTCATTGTGGATGAAGCAGATGAAATATTGA

Coding sequence (CDS)

ATGGCTTCCCTCAAACTTTCCTTCTCTTTACATTCTTTTGATTCCAATAAGTTCGATTTTCCTCTCAATTCACCTCTGCTCTCTGATTATTGCTCTCTTTTCTCCATCAATGCTCATCTTCATCTCAATAAGTCCTCCATAATTTACTCTCTGGCTAGGGTTCACAAGCCCTCTAAAGTTTCTCAGGTAGAACAGGACGCGTCCGACGTTTCCCAATCCAGATTTGATGAAATTGTCGCCAGGAAAAAGTATTTTACCTCTAAGAAGCCTTCAAAGAGAGCAGCAGGTTCGCATTTTAGTTTCAGTAGGAATTGTAATGACAATATTCTTTTTAATGGTGGTGAATTGGATGTCAATTACTCAACTATATCCTCTGATTTGAGCTTAGAGGATTGCAATGCTATTTTGAAAAGGCTAGAGAAGTGTAACGATTCCAAAACACTGGGTTTCTTTGAGTGGATGAGAAGTAATGGGAAATTAAAACACAATGTGAGTGCTTATAATTTGGTTCTTCGAGTGTTGGGTAGGCAAGAAGATTGGGATGCTGCCGAGAAGCTAATTGAGGAAGTTAGAGCTGAGTTGGGTTCTCAATTGGATTTTCAGGTTTTTAACACTCTTATCTATGCTTGTTATAAATCGAGGTTTGTGGAGCAGGGTACGAAATGGTTTCGAATGATGTTGGAATGCCAAGTGCAGCCCAATGTCGCAACATTTGGAATGCTTATGGGTCTCTATCAGAAGAAGTGTGATATTAAGGAATCGGAGTTTGCCTTTAATCAGATGAGAAACTTTGGTATTGTTTGTGAAACAGCATATGCATCTATGATTACTATATACATACGTATGAATTTATACGATAAAGCAGAAGAGGTGATTCAATTAATGCAAGAAGATAAAGTGATTCCTAATCTAGAGAACTGGGTAGTAATGCTTAATGCTTATTGTCAGCAAGGCAAAATGGAGGAAGCTGAACTTGTATTTGCCTCAATGGAAGAAGCTGGGTTTTCATCCAATATCATTGCATATAATACCTTGATTACTGGGTATGGGAAGGCATCAAATATGGATACTGCTCAACGCCTGTTCTTGGGCATCAAGAACTCTGGAGTAGAACCTGATGAAACGACTTACCGCTCCATGATTGAAGGTTGGGGTCGAGCTGGTAATTATAAAATGGCAGAATGGTATTATAAGGAGCTCAAACGGAGAGGATATATGCCGAATTCCTCTAACTTGTTCACCCTCATAAATCTACAAGCCAAACATGAGGATGAAGCAGGTACACTTAAAACTCTAAATGATATGCTAAAGATTGGATGCCGGCCTTCTTCCATTGTTGGAAATGTTTTGCAAGCATATGAAAAGGCTAGAAGAATGAAAAGTGTGCCTGTCCTCTTGACAGGGTCGTTCTATCGGAAAGTTCTGAGCAGCCAGACATCTTGCTCAATTCTGGTAATGGCTTATGTGAAGCACTGTTTAGTGGATGACGCTTTAAAAGTGTTGAGGGAAAAGGAGTGGAAAGATCATCATTTTGAGGAGAATTTGTATCATTTGCTAATTTGTTCATGTAAAGAGTTGGGCCATCTCGAGAATGCAATTAAGATATACACACAACTGCCTAAACGTGAAAACAAACCGAACTTGCATATCACATGCACAATGATTGATATCTACAGCATCATGGGTAGGTTCTCTGACGGGGAGAAGCTTTATCTAAGCCTGAGATCATCAGGCATTCCTTTGGATTTGATTGCTTATAATGTTGTTGTGAGAATGTATGTTAAAGCTGGATCATTGGAAGATGCATGCTCAGTTCTTGACTTGATGGCTGAGCAGCAGGACATTGTTCCGGACATATATCTGTTACGGGACATGCTTCGTATTTACCAACGATGTGGCATGGTGCATAAGCTAGCAGATCTGTACTATAGGATATTGAAGAGTGGAGTGTCTTGGGATCAGGAAATGTATAATTGTGTCATAAATTGCTGTTCCCGTGCTCTGCCTGTTGATGAGCTTTCTAGGCTTTTTGATGAAATGCTTCAATGTGGTTTTGCCCCAAATACAGTGACCTTGAATGTCATGCTTGACGTTTATGGGAAATCCAAGCTTTTCACTAAGGCTAGAAATCTCTTTGGGCTTGCTCAGAAAAGAGGTTTGGTTGATGCAATCTCTTATAATACTATGATATCTGTCTATGGGAAGAATAAGGACTTCAAAAACATGTCATCTACGGTTCAGAAAATGAAATTTAACGGGTTTTCAGTTTCCCTTGAAGCCTACAATTGTATGTTGGATGCTTATGGCAAAGAATGCCAAATGGAGAATTTCAGAAGTGTCTTGCAGCGAATGCAGGAGACAAGTTCTGAATGTGACCATTATACGTACAACATCATGATCAACATCTATGGAGAACAAGGATGGATAGATGAAGTTGCGGAAGTTCTGACAGAATTGAAAGCATGTGGACTTGAACCCGATCTGTACAGCTACAACACATTGATCAAGGCATATGGAATAGCAGGGATGGTTGAAGAAGCTGCTCAGTTGGTGAAAGAAATGAGAGAAAAGAGGATAGAACCAGATAGGATAACTTATATTAACATGATTAGAGCACTGCAAAGAAACGATCAATTTTTAGAGGCAGTCAAGTGGTCATTGTGGATGAAGCAGATGAAATATTGA

Protein sequence

MASLKLSFSLHSFDSNKFDFPLNSPLLSDYCSLFSINAHLHLNKSSIIYSLARVHKPSKVSQVEQDASDVSQSRFDEIVARKKYFTSKKPSKRAAGSHFSFSRNCNDNILFNGGELDVNYSTISSDLSLEDCNAILKRLEKCNDSKTLGFFEWMRSNGKLKHNVSAYNLVLRVLGRQEDWDAAEKLIEEVRAELGSQLDFQVFNTLIYACYKSRFVEQGTKWFRMMLECQVQPNVATFGMLMGLYQKKCDIKESEFAFNQMRNFGIVCETAYASMITIYIRMNLYDKAEEVIQLMQEDKVIPNLENWVVMLNAYCQQGKMEEAELVFASMEEAGFSSNIIAYNTLITGYGKASNMDTAQRLFLGIKNSGVEPDETTYRSMIEGWGRAGNYKMAEWYYKELKRRGYMPNSSNLFTLINLQAKHEDEAGTLKTLNDMLKIGCRPSSIVGNVLQAYEKARRMKSVPVLLTGSFYRKVLSSQTSCSILVMAYVKHCLVDDALKVLREKEWKDHHFEENLYHLLICSCKELGHLENAIKIYTQLPKRENKPNLHITCTMIDIYSIMGRFSDGEKLYLSLRSSGIPLDLIAYNVVVRMYVKAGSLEDACSVLDLMAEQQDIVPDIYLLRDMLRIYQRCGMVHKLADLYYRILKSGVSWDQEMYNCVINCCSRALPVDELSRLFDEMLQCGFAPNTVTLNVMLDVYGKSKLFTKARNLFGLAQKRGLVDAISYNTMISVYGKNKDFKNMSSTVQKMKFNGFSVSLEAYNCMLDAYGKECQMENFRSVLQRMQETSSECDHYTYNIMINIYGEQGWIDEVAEVLTELKACGLEPDLYSYNTLIKAYGIAGMVEEAAQLVKEMREKRIEPDRITYINMIRALQRNDQFLEAVKWSLWMKQMKY
BLAST of CsaV3_1G029170 vs. NCBI nr
Match: XP_004146719.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g30825, chloroplastic [Cucumis sativus] >XP_011655645.1 PREDICTED: pentatricopeptide repeat-containing protein At4g30825, chloroplastic [Cucumis sativus] >KGN65183.1 hypothetical protein Csa_1G257890 [Cucumis sativus])

HSP 1 Score: 1219.5 bits (3154), Expect = 0.0e+00
Identity = 757/757 (100.00%), Postives = 757/757 (100.00%), Query Frame = 0

Query: 1   MASLKLSFSLHSFDSNKFDFPLNSPLLSDYCSLFSINAHLHLNKSSIIYSLARVHKPSKV 60
           MASLKLSFSLHSFDSNKFDFPLNSPLLSDYCSLFSINAHLHLNKSSIIYSLARVHKPSKV
Sbjct: 1   MASLKLSFSLHSFDSNKFDFPLNSPLLSDYCSLFSINAHLHLNKSSIIYSLARVHKPSKV 60

Query: 61  SQVEQDASDVSQSRFDEIVARKKYFTSKKPSKRAAGSHFSFSRNCNDNILFNGGELDVNY 120
           SQVEQDASDVSQSRFDEIVARKKYFTSKKPSKRAAGSHFSFSRNCNDNILFNGGELDVNY
Sbjct: 61  SQVEQDASDVSQSRFDEIVARKKYFTSKKPSKRAAGSHFSFSRNCNDNILFNGGELDVNY 120

Query: 121 STISSDLSLEDCNAILKRLEKCNDSKTLGFFEWMRSNGKLKHNVSAYNLVLRVLGRQEDW 180
           STISSDLSLEDCNAILKRLEKCNDSKTLGFFEWMRSNGKLKHNVSAYNLVLRVLGRQEDW
Sbjct: 121 STISSDLSLEDCNAILKRLEKCNDSKTLGFFEWMRSNGKLKHNVSAYNLVLRVLGRQEDW 180

Query: 181 DAAEKLIEEVRAELGSQLDFQVFNTLIYACYKSRFVEQGTKWFRMMLECQVQPNVATFGM 240
           DAAEKLIEEVRAELGSQLDFQVFNTLIYACYKSRFVEQGTKWFRMMLECQVQPNVATFGM
Sbjct: 181 DAAEKLIEEVRAELGSQLDFQVFNTLIYACYKSRFVEQGTKWFRMMLECQVQPNVATFGM 240

Query: 241 LMGLYQKKCDIKESEFAFNQMRNFGIVCETAYASMITIYIRMNLYDKAEEVIQLMQEDKV 300
           LMGLYQKKCDIKESEFAFNQMRNFGIVCETAYASMITIYIRMNLYDKAEEVIQLMQEDKV
Sbjct: 241 LMGLYQKKCDIKESEFAFNQMRNFGIVCETAYASMITIYIRMNLYDKAEEVIQLMQEDKV 300

Query: 301 IPNLENWVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           IPNLENWVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 IPNLENWVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLINLQA 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLINLQA
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLINLQA 420

Query: 421 KHEDEAGTLKTLNDMLKIGCRPSSIVGNVLQAYEKARRMKSVPVLLTGSFYRKVLSSQTS 480
           KHEDEAGTLKTLNDMLKIGCRPSSIVGNVLQAYEKARRMKSVPVLLTGSFYRKVLSSQTS
Sbjct: 421 KHEDEAGTLKTLNDMLKIGCRPSSIVGNVLQAYEKARRMKSVPVLLTGSFYRKVLSSQTS 480

Query: 481 CSILVMAYVKHCLVDDALKVLREKEWKDHHFEENLYHLLICSCKELGHLENAIKIYTQLP 540
           CSILVMAYVKHCLVDDALKVLREKEWKDHHFEENLYHLLICSCKELGHLENAIKIYTQLP
Sbjct: 481 CSILVMAYVKHCLVDDALKVLREKEWKDHHFEENLYHLLICSCKELGHLENAIKIYTQLP 540

Query: 541 KRENKPNLHITCTMIDIYSIMGRFSDGEKLYLSLRSSGIPLDLIAYNVVVRMYVKAGSLE 600
           KRENKPNLHITCTMIDIYSIMGRFSDGEKLYLSLRSSGIPLDLIAYNVVVRMYVKAGSLE
Sbjct: 541 KRENKPNLHITCTMIDIYSIMGRFSDGEKLYLSLRSSGIPLDLIAYNVVVRMYVKAGSLE 600

Query: 601 DACSVLDLMAEQQDIVPDIYLLRDMLRIYQRCGMVHKLADLYYRILKSGVSWDQEMYNCV 660
           DACSVLDLMAEQQDIVPDIYLLRDMLRIYQRCGMVHKLADLYYRILKSGVSWDQEMYNCV
Sbjct: 601 DACSVLDLMAEQQDIVPDIYLLRDMLRIYQRCGMVHKLADLYYRILKSGVSWDQEMYNCV 660

Query: 661 INCCSRALPVDELSRLFDEMLQCGFAPNTVTLNVMLDVYGKSKLFTKARNLFGLAQKRGL 720
           INCCSRALPVDELSRLFDEMLQCGFAPNTVTLNVMLDVYGKSKLFTKARNLFGLAQKRGL
Sbjct: 661 INCCSRALPVDELSRLFDEMLQCGFAPNTVTLNVMLDVYGKSKLFTKARNLFGLAQKRGL 720

Query: 721 VDXXXXXXXXXXXXXXXXXXNMSSTVQKMKFNGFSVS 758
           VDXXXXXXXXXXXXXXXXXXNMSSTVQKMKFNGFSVS
Sbjct: 721 VDXXXXXXXXXXXXXXXXXXNMSSTVQKMKFNGFSVS 757

BLAST of CsaV3_1G029170 vs. NCBI nr
Match: XP_016899838.1 (PREDICTED: pentatricopeptide repeat-containing protein At4g30825, chloroplastic [Cucumis melo])

HSP 1 Score: 1149.0 bits (2971), Expect = 0.0e+00
Identity = 719/761 (94.48%), Postives = 739/761 (97.11%), Query Frame = 0

Query: 1   MASLKLSFSLHSFDSNKFDFPLNSPLLSDYCSLFSINAHLHLNKSSIIYSLARVHKPSKV 60
           MASLKLSFSLHSFDSNKFDFP+NSP LSDYCSLFSIN ++HLNKS I+YSLARVHKPSKV
Sbjct: 1   MASLKLSFSLHSFDSNKFDFPVNSPPLSDYCSLFSINGYIHLNKSCILYSLARVHKPSKV 60

Query: 61  SQVEQDASDVSQSRFDEIVARKKYFTSKKPSKRAAGSHFSFSRNCN----DNILFNGGEL 120
           SQVE +ASDVSQSRFD+I +RKKYFT+KKPSKRAAGSHFSFSRNC+    +NILF+GGEL
Sbjct: 61  SQVEPEASDVSQSRFDDIDSRKKYFTAKKPSKRAAGSHFSFSRNCSEKIFENILFSGGEL 120

Query: 121 DVNYSTISSDLSLEDCNAILKRLEKCNDSKTLGFFEWMRSNGKLKHNVSAYNLVLRVLGR 180
           DVNYSTISSDLSLE CNAILKRLEKCNDSKTL FFEWMRSNGKLKHNVSAYNLVLRVLGR
Sbjct: 121 DVNYSTISSDLSLEGCNAILKRLEKCNDSKTLDFFEWMRSNGKLKHNVSAYNLVLRVLGR 180

Query: 181 QEDWDAAEKLIEEVRAELGSQLDFQVFNTLIYACYKSRFVEQGTKWFRMMLECQVQPNVA 240
           QEDWDAAEKLI+EVRAELGSQLDFQVFNTLIYACYKS FVE GTKWFRMMLECQVQPNVA
Sbjct: 181 QEDWDAAEKLIKEVRAELGSQLDFQVFNTLIYACYKSGFVEWGTKWFRMMLECQVQPNVA 240

Query: 241 TFGMLMGLYQKKCDIKESEFAFNQMRNFGIVCETAYASMITIYIRMNLYDKAEEVIQLMQ 300
           TFGMLMGLYQK CDI+ESEFAFNQMRNFGIVCETAYASMITIYIRMNLYDKAEEVIQLMQ
Sbjct: 241 TFGMLMGLYQKSCDIEESEFAFNQMRNFGIVCETAYASMITIYIRMNLYDKAEEVIQLMQ 300

Query: 301 EDKVIPNLENWVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           +DKVIPNLENW+XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 KDKVIPNLENWLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLI 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX I
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXI 420

Query: 421 NLQAKHEDEAGTLKTLNDMLKIGCRPSSIVGNVLQAYEKARRMKSVPVLLTGSFYRKVLS 480
           NLQAKHEDEAG LKTLNDMLKIGCRPSSIVGNVLQAYEKARR+KSVPVLLTGSFYRKVLS
Sbjct: 421 NLQAKHEDEAGALKTLNDMLKIGCRPSSIVGNVLQAYEKARRIKSVPVLLTGSFYRKVLS 480

Query: 481 SQTSCSILVMAYVKHCLVDDALKVLREKEWKDHHFEENLYHLLICSCKELGHLENAIKIY 540
           SQTSCSILVMAYVKHCLVDDALKVLREKEWKDHHFEENLYHLLICSCKELGH E+AIKIY
Sbjct: 481 SQTSCSILVMAYVKHCLVDDALKVLREKEWKDHHFEENLYHLLICSCKELGHFESAIKIY 540

Query: 541 TQLPKRENKPNLHITCTMIDIYSIMGRFSDGEKLYLSLRSSGIPLDLIAYNVVVRMYVKA 600
            Q PKRENKPNLHITCTMIDIYSIMGRFSDGEKLYLSLRSSGIPLDLIAYNVVVRMYVKA
Sbjct: 541 AQRPKRENKPNLHITCTMIDIYSIMGRFSDGEKLYLSLRSSGIPLDLIAYNVVVRMYVKA 600

Query: 601 GSLEDACSVLDLMAEQQDIVPDIYLLRDMLRIYQRCGMVHKLADLYYRILKSGVSWDQEM 660
           GSLEDACSVLDLMAEQQDIVPD+YLLRDMLRIYQRCGMVHKL+DLYYRILKSGVSWDQEM
Sbjct: 601 GSLEDACSVLDLMAEQQDIVPDVYLLRDMLRIYQRCGMVHKLSDLYYRILKSGVSWDQEM 660

Query: 661 YNCVINCCSRALPVDELSRLFDEMLQCGFAPNTVTLNVMLDVYGKSKLFTKARNLFGLAQ 720
           YNCVINCCSRALPVDELSRLFDEMLQCGFAPNTVTLNVMLDVYGKSKLF KARNLFG AQ
Sbjct: 661 YNCVINCCSRALPVDELSRLFDEMLQCGFAPNTVTLNVMLDVYGKSKLFAKARNLFGFAQ 720

Query: 721 KRGLVDXXXXXXXXXXXXXXXXXXNMSSTVQKMKFNGFSVS 758
           KRGLVD XXXXXXXXXXXXXXXXXNMSSTVQ+MKFNGFSVS
Sbjct: 721 KRGLVDAXXXXXXXXXXXXXXXXXNMSSTVQQMKFNGFSVS 761

BLAST of CsaV3_1G029170 vs. NCBI nr
Match: XP_022135004.1 (pentatricopeptide repeat-containing protein At4g30825, chloroplastic isoform X1 [Momordica charantia] >XP_022135005.1 pentatricopeptide repeat-containing protein At4g30825, chloroplastic isoform X2 [Momordica charantia] >XP_022135006.1 pentatricopeptide repeat-containing protein At4g30825, chloroplastic isoform X3 [Momordica charantia])

HSP 1 Score: 1055.4 bits (2728), Expect = 1.1e-304
Identity = 665/762 (87.27%), Postives = 710/762 (93.18%), Query Frame = 0

Query: 1   MASLKLSFSLHSFDSNKFDFPLNSPLLSDYCSLFSINAHLHLNKSSIIYSLARVHKPSKV 60
           MASLK+SF L SFDS KFDFP+ S LLSD CS+FSI  ++HLNKS I+YSLARVHKPSKV
Sbjct: 1   MASLKISFPLDSFDSKKFDFPVKSALLSDICSVFSITGYIHLNKSCILYSLARVHKPSKV 60

Query: 61  SQVEQDASDVSQSRF--DEIVARKKYFTSKKPSKRAAGSHFSFSRNCN----DNILFNGG 120
           SQVE +ASD+ QS+F  DEI ARKKY  +KKPSKRA GS+FSFSRNC+    DNI+FNGG
Sbjct: 61  SQVEPEASDIYQSKFVDDEIGARKKYVGNKKPSKRAPGSYFSFSRNCSEKVFDNIIFNGG 120

Query: 121 ELDVNYSTISSDLSLEDCNAILKRLEKCNDSKTLGFFEWMRSNGKLKHNVSAYNLVLRVL 180
           E+DVNYSTISSDLSLEDCNAIL++LEKCND KTL FFEWMR NGKL+HNV+AYNLVLRVL
Sbjct: 121 EMDVNYSTISSDLSLEDCNAILRKLEKCNDGKTLVFFEWMRRNGKLEHNVTAYNLVLRVL 180

Query: 181 GRQEDWDAAEKLIEEVRAELGSQLDFQVFNTLIYACYKSRFVEQGTKWFRMMLECQVQPN 240
           GRQEDWDAAEKLI +VRA+LGSQLDFQ+FNTLIYACYKS  V++G KWFRMMLEC+VQPN
Sbjct: 181 GRQEDWDAAEKLIRQVRADLGSQLDFQIFNTLIYACYKSGLVDRGAKWFRMMLECRVQPN 240

Query: 241 VATFGMLMGLYQKKCDIKESEFAFNQMRNFGIVCETAYASMITIYIRMNLYDKAEEVIQL 300
           VATFGMLMGL QK C+++E+EFAF+QMR+FGIVCE  YASMITIY R++LYDKAEEVIQL
Sbjct: 241 VATFGMLMGLCQKGCNVEEAEFAFSQMRSFGIVCEAMYASMITIYARLSLYDKAEEVIQL 300

Query: 301 MQEDKVIPNLENWVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           MQEDKV PNLENW+ XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 MQEDKVTPNLENWLVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXT 420

Query: 421 LINLQAKHEDEAGTLKTLNDMLKIGCRPSSIVGNVLQAYEKARRMKSVPVLLTGSFYRKV 480
           LINLQAKHEDEAG L+TL+DMLKIGCRPSSIVGNVLQAYEKARR+KSVP+LLTGSFY KV
Sbjct: 421 LINLQAKHEDEAGALETLDDMLKIGCRPSSIVGNVLQAYEKARRIKSVPLLLTGSFYCKV 480

Query: 481 LSSQTSCSILVMAYVKHCLVDDALKVLREKEWKDHHFEENLYHLLICSCKELGHLENAIK 540
           LSSQTSCSILVMAY+KHCLVDDALK+LREKEW DH+FEENLYHLLICSCKELG LENAIK
Sbjct: 481 LSSQTSCSILVMAYMKHCLVDDALKILREKEWNDHNFEENLYHLLICSCKELGQLENAIK 540

Query: 541 IYTQLPKRENKPNLHITCTMIDIYSIMGRFSDGEKLYLSLRSSGIPLDLIAYNVVVRMYV 600
           IYTQLPKRENKPNLHITCTMIDIYSIMG+FS+GEKLYLSLRSS IPLDLIA+NVVVRMYV
Sbjct: 541 IYTQLPKRENKPNLHITCTMIDIYSIMGKFSEGEKLYLSLRSSDIPLDLIAFNVVVRMYV 600

Query: 601 KAGSLEDACSVLDLMAEQQDIVPDIYLLRDMLRIYQRCGMVHKLADLYYRILKSGVSWDQ 660
           KAGSLEDAC VLDLM +QQDIVPD+YLLRDMLRIYQRCGMV KLADLYYRILKSGVSWDQ
Sbjct: 601 KAGSLEDACLVLDLMDQQQDIVPDVYLLRDMLRIYQRCGMVDKLADLYYRILKSGVSWDQ 660

Query: 661 EMYNCVINCCSRALPVDELSRLFDEMLQCGFAPNTVTLNVMLDVYGKSKLFTKARNLFGL 720
           EMYNCVINCCSRALPVDELSRLFDEML  GFAPNTVTLNVMLDVYGKSKLFTKARNLFGL
Sbjct: 661 EMYNCVINCCSRALPVDELSRLFDEMLHRGFAPNTVTLNVMLDVYGKSKLFTKARNLFGL 720

Query: 721 AQKRGLVDXXXXXXXXXXXXXXXXXXNMSSTVQKMKFNGFSV 757
           AQKRGLVD   XXXXXXXXXXXX   NMSSTVQKMKFNGFSV
Sbjct: 721 AQKRGLVDVISXXXXXXXXXXXXDFKNMSSTVQKMKFNGFSV 762

BLAST of CsaV3_1G029170 vs. NCBI nr
Match: XP_023516176.1 (pentatricopeptide repeat-containing protein At4g30825, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 993.0 bits (2566), Expect = 6.4e-286
Identity = 646/761 (84.89%), Postives = 689/761 (90.54%), Query Frame = 0

Query: 1   MASLKLSFSLHSFDSNKFDFPLNSPLLSDYCSLFSINAHLHLNKSSIIYSLARVHKPSKV 60
           MASLKLSFSL SF S KFDFP+NS LLSD CS+FSI  ++HLNKS ++YSL R HKPSKV
Sbjct: 1   MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITGYIHLNKSCVLYSLVRAHKPSKV 60

Query: 61  SQVEQDASDVSQSRFDEIVARKKYFTSKKPSKRAAGSHFSFSRNCN----DNILFNGGEL 120
            + E      S+   DEI  RKKYF  KKPSKRA GS+FSFS+NC+    D+I+F+GGEL
Sbjct: 61  -EPETSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSIVFHGGEL 120

Query: 121 DVNYSTISSDLSLEDCNAILKRLEKCNDSKTLGFFEWMRSNGKLKHNVSAYNLVLRVLGR 180
           DVNYSTISSDLSLEDCNAILKRLEKCND K LGFFEWMR N KL+HNVSAYNL+LRVLGR
Sbjct: 121 DVNYSTISSDLSLEDCNAILKRLEKCNDRKALGFFEWMRINRKLEHNVSAYNLILRVLGR 180

Query: 181 QEDWDAAEKLIEEVRAELGSQLDFQVFNTLIYACYKSRFVEQGTKWFRMMLECQVQPNVA 240
           Q+DWDAA+KLI EVRAEL  QLDFQVFNTLIYACYKS  VEQG KWF+MMLE QV PNVA
Sbjct: 181 QQDWDAADKLIREVRAELSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVA 240

Query: 241 TFGMLMGLYQKKCDIKESEFAFNQMRNFGIVCETAYASMITIYIRMNLYDKAEEVIQLMQ 300
           TFGMLMGLYQK C++KE+EFAFNQMRNFGIVCETAYASMITIY R++LYDKAEEVI+LMQ
Sbjct: 241 TFGMLMGLYQKSCNLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRLMQ 300

Query: 301 EDKVIPNLENWVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           EDKVIPN+ENW+ XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 EDKVIPNVENWLVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLI 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   L+
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLFTLM 420

Query: 421 NLQAKHEDEAGTLKTLNDMLKIGCRPSSIVGNVLQAYEKARRMKSVPVLLTGSFYRKVLS 480
           NLQAKHED+AG LKTLNDMLKIGCR SSIVGNVLQAYEKARR+KSVP+LLTGSFYRKVL+
Sbjct: 421 NLQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKVLA 480

Query: 481 SQTSCSILVMAYVKHCLVDDALKVLREKEWKDHHFEENLYHLLICSCKELGHLENAIKIY 540
           SQTSCSILVMAYVKH LVDDALKVLREKEW D  FEENLYHLLICSCKEL HLENAIKIY
Sbjct: 481 SQTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIY 540

Query: 541 TQLPKRENKPNLHITCTMIDIYSIMGRFSDGEKLYLSLRSSGIPLDLIAYNVVVRMYVKA 600
           TQLPKR+NKPNLHIT TMIDIYSIMGRFSDGEKLYLSL+SSGI LDLIA++VVVRMYVKA
Sbjct: 541 TQLPKRKNKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKA 600

Query: 601 GSLEDACSVLDLMAEQQDIVPDIYLLRDMLRIYQRCGMVHKLADLYYRILKSGVSWDQEM 660
           GSLEDACSVLD M +QQDIVPDIYL RDMLRIYQRCGMV KL D+YYRIL S VSWDQEM
Sbjct: 601 GSLEDACSVLDFMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEM 660

Query: 661 YNCVINCCSRALPVDELSRLFDEMLQCGFAPNTVTLNVMLDVYGKSKLFTKARNLFGLAQ 720
           YNCVINCCSRAL VDELS LFDEMLQ GFAPNTVTLNVMLDVYGKSKLF+KAR L  LAQ
Sbjct: 661 YNCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKLFSKARKLLLLAQ 720

Query: 721 KRGLVDXXXXXXXXXXXXXXXXXXNMSSTVQKMKFNGFSVS 758
           K+GLVD XXXXXXXXXXXXXXXXXNMSSTV+ M+FNGFS+S
Sbjct: 721 KKGLVDVXXXXXXXXXXXXXXXXXNMSSTVRTMEFNGFSLS 760

BLAST of CsaV3_1G029170 vs. NCBI nr
Match: XP_022922044.1 (pentatricopeptide repeat-containing protein At4g30825, chloroplastic [Cucurbita moschata])

HSP 1 Score: 984.9 bits (2545), Expect = 1.7e-283
Identity = 645/761 (84.76%), Postives = 685/761 (90.01%), Query Frame = 0

Query: 1   MASLKLSFSLHSFDSNKFDFPLNSPLLSDYCSLFSINAHLHLNKSSIIYSLARVHKPSKV 60
           MASLKLSFSL SF S KFDFP+NS LLSD CS+FSI  ++HLNKS ++YSL R HKPSKV
Sbjct: 1   MASLKLSFSLDSFHSKKFDFPVNSSLLSDCCSVFSITGYIHLNKSFVLYSLVRAHKPSKV 60

Query: 61  SQVEQDASDVSQSRFDEIVARKKYFTSKKPSKRAAGSHFSFSRNCN----DNILFNGGEL 120
            + E      S+   DEI  RKKYF  KKPSKRA GS+FSFS+NC+    D+I+F+GGEL
Sbjct: 61  -EPETSGGYESKCAVDEIDTRKKYFGGKKPSKRAPGSYFSFSKNCSEKVFDSIVFHGGEL 120

Query: 121 DVNYSTISSDLSLEDCNAILKRLEKCNDSKTLGFFEWMRSNGKLKHNVSAYNLVLRVLGR 180
           DVNYSTISSDLSLEDCNAILKRLEKCND K LGFFEWMR N KL+HNVSAYNL+LRVLGR
Sbjct: 121 DVNYSTISSDLSLEDCNAILKRLEKCNDRKALGFFEWMRINRKLEHNVSAYNLILRVLGR 180

Query: 181 QEDWDAAEKLIEEVRAELGSQLDFQVFNTLIYACYKSRFVEQGTKWFRMMLECQVQPNVA 240
           Q+DWDAAEKLI EVRAE   QLDFQVFNTLIYACYKS  VEQG KWF+MMLE QV PNVA
Sbjct: 181 QQDWDAAEKLIREVRAESSDQLDFQVFNTLIYACYKSGLVEQGAKWFQMMLEWQVLPNVA 240

Query: 241 TFGMLMGLYQKKCDIKESEFAFNQMRNFGIVCETAYASMITIYIRMNLYDKAEEVIQLMQ 300
           TFGMLMGLYQK C +KE+EFAFNQMRNFGIVCETAYASMITIY R++LYDKAEEVI+LMQ
Sbjct: 241 TFGMLMGLYQKSCSLKEAEFAFNQMRNFGIVCETAYASMITIYTRLSLYDKAEEVIRLMQ 300

Query: 301 EDKVIPNLENWVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           EDKV PN+ENW+ XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 EDKVTPNVENWLVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLI 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   L+
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLFTLM 420

Query: 421 NLQAKHEDEAGTLKTLNDMLKIGCRPSSIVGNVLQAYEKARRMKSVPVLLTGSFYRKVLS 480
           NLQAKHED+AG LKTLNDMLKIGCR SSIVGNVLQAYEKARR+KSVP+LLTGSFYRKVL+
Sbjct: 421 NLQAKHEDDAGALKTLNDMLKIGCRLSSIVGNVLQAYEKARRIKSVPLLLTGSFYRKVLA 480

Query: 481 SQTSCSILVMAYVKHCLVDDALKVLREKEWKDHHFEENLYHLLICSCKELGHLENAIKIY 540
           SQTSCSILVMAYVKH LVDDALKVLREKEW D  FEENLYHLLICSCKEL HLENAIKIY
Sbjct: 481 SQTSCSILVMAYVKHGLVDDALKVLREKEWNDLRFEENLYHLLICSCKELDHLENAIKIY 540

Query: 541 TQLPKRENKPNLHITCTMIDIYSIMGRFSDGEKLYLSLRSSGIPLDLIAYNVVVRMYVKA 600
           TQLPK ENKPNLHIT TMIDIYSIMGRFSDGEKLYLSL+SSGI LDLIA++VVVRMYVKA
Sbjct: 541 TQLPKHENKPNLHITSTMIDIYSIMGRFSDGEKLYLSLKSSGIRLDLIAFSVVVRMYVKA 600

Query: 601 GSLEDACSVLDLMAEQQDIVPDIYLLRDMLRIYQRCGMVHKLADLYYRILKSGVSWDQEM 660
           GSLEDACSVLDLM +QQDIVPDIYL RDMLRIYQRCGMV KL D+YYRIL S VSWDQEM
Sbjct: 601 GSLEDACSVLDLMDKQQDIVPDIYLFRDMLRIYQRCGMVDKLQDVYYRILNSDVSWDQEM 660

Query: 661 YNCVINCCSRALPVDELSRLFDEMLQCGFAPNTVTLNVMLDVYGKSKLFTKARNLFGLAQ 720
           YNCVINCCSRAL VDELS LFDEMLQ GFAPNTVTLNVMLDVYGKSK F+KAR L  LAQ
Sbjct: 661 YNCVINCCSRALLVDELSSLFDEMLQRGFAPNTVTLNVMLDVYGKSKHFSKARKLLLLAQ 720

Query: 721 KRGLVDXXXXXXXXXXXXXXXXXXNMSSTVQKMKFNGFSVS 758
           K+GLVDXXXXXXXXXXXXXXXXXX MSSTV+ M+FNGFS+S
Sbjct: 721 KKGLVDXXXXXXXXXXXXXXXXXXXMSSTVRTMEFNGFSLS 760

BLAST of CsaV3_1G029170 vs. TAIR10
Match: AT4G30825.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 453.0 bits (1164), Expect = 4.3e-127
Identity = 380/766 (49.61%), Postives = 462/766 (60.31%), Query Frame = 0

Query: 1   MASLKLSFSLHSFDS--NKFDFPLNSPLLSDYCSLFSINAHLHLNKSSIIYSLARVHKPS 60
           M SL+ S  L  FDS   +F F  N     D   +  + + +H  ++S I S  RV    
Sbjct: 1   MGSLRFSIPLDPFDSKRKRFHFSANPSQFPDQFPIHFVTSSIHATRASSIGSSTRVLDKI 60

Query: 61  KVS----QVEQDASDVSQSRFDEIVARKKYFTSKKPSKRAAGSHFSFSRNCND----NIL 120
           +VS    +  ++A + + +   E     K    ++ +K+     FSF R  ND    N+ 
Sbjct: 61  RVSSLGTEANENAINSASAAPVERSRSSKLSGDQRGTKKYVARKFSFRRGSNDLELENLF 120

Query: 121 FNGGELDVNYSTISSDLSLEDCNAILKRLEKCNDSKTLGFFEWMRSNGKLKHNVSAYNLV 180
            N GE+DVNYS I    SLE CN ILKRLE C+D+  + FF+WMR NGKL  N  AY+L+
Sbjct: 121 VNNGEIDVNYSAIKPGQSLEHCNGILKRLESCSDTNAIKFFDWMRCNGKLVGNFVAYSLI 180

Query: 181 LRVLGRQEDWDAAEKLIEEVRAELGSQLDFQVFNTLIYACYKSRFVEQGTKWFRMMLECQ 240
           LRVLGR+E+WD AE LI+E+      Q  +QVFNT+IYAC K   V+  +KWF MMLE  
Sbjct: 181 LRVLGRREEWDRAEDLIKELCGFHEFQKSYQVFNTVIYACTKKGNVKLASKWFHMMLEFG 240

Query: 241 VQPNVATFGMLMGLYQKKCDIKESEFAFNQMRNFGIVCETAYASMITIYIRMNLYDKAEE 300
           V+PNVAT GMLMGLYQK  +++E+EFAF+ MR FGIVCE+AY+SMITIY R+ LYDKAEE
Sbjct: 241 VRPNVATIGMLMGLYQKNWNVEEAEFAFSHMRKFGIVCESAYSSMITIYTRLRLYDKAEE 300

Query: 301 VIQLMQEDKVIPNLENWVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           VI LM++D+V   LENW+          XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 VIDLMKQDRVRLKLENWLVMLNAYSQQGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420

Query: 421 XXXXLINLQAKHEDEAGTLKTLNDMLKIGCRPSSIVGNVLQAYEKARRMKSVPVLLTGSF 480
           XXXX INLQAK+ D  G +KT+ DM  IGC+ SSI+G +LQAYEK  ++  VP +L GSF
Sbjct: 421 XXXXXINLQAKYGDRDGAIKTIEDMTGIGCQYSSILGIILQAYEKVGKIDVVPCVLKGSF 480

Query: 481 YRKVLSSQTSCSILVMAYVKHCLVDDALKVLREKEWKDHHFEENLYHLLICSCKELGHLE 540
           +  +  +QTS S LVM                                            
Sbjct: 481 HNHIRLNQTSFSSLVMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 540

Query: 541 NAIKIYTQLPKRENKPNLHITCTMIDIYSIMGRFSDGEKLYLSLRSSGIPLDLIAYNVVV 600
                                                                       
Sbjct: 541 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 600

Query: 601 RMYVKAGSLEDACSVLDLMAEQQDIVPDIYLLRDMLRIYQRCGMVHKLADLYYRILKSGV 660
                      ACSVL++M EQ+DIVPD+YL RDMLRIYQ+C +  KL  LYYRI KSG+
Sbjct: 601 XXXXXXXXXXXACSVLEIMDEQKDIVPDVYLFRDMLRIYQKCDLQDKLQHLYYRIRKSGI 660

Query: 661 SWDQEMYNCVINCCSRALPVDELSRLFDEMLQCGFAPNTVTLNVMLDVYGKSKLFTKARN 720
            W+QEMYNCVINCC+RALP+DELS  F+EM++ GF PNTVT NV+LDVYGK+KLF K   
Sbjct: 661 HWNQEMYNCVINCCARALPLDELSGTFEEMIRYGFTPNTVTFNVLLDVYGKAKLFKKVNE 720

Query: 721 LFGLAQKRGLVDXXXXXXXXXXXXXXXXXXNMSSTVQKMKFNGFSV 757
           LF LA++ G+VD                  NMSS ++ M+F+GFSV
Sbjct: 721 LFLLAKRHGVVDVISYNTIIAAYGKNKDYTNMSSAIKNMQFDGFSV 766

BLAST of CsaV3_1G029170 vs. TAIR10
Match: AT3G23020.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 60.5 bits (145), Expect = 6.3e-09
Identity = 46/181 (25.41%), Postives = 84/181 (46.41%), Query Frame = 0

Query: 127 LSLEDCNAILKRLEKCNDSKTLGFFEWMRSNGKLKHNVSAYNLVLRVLGRQEDWDAAEKL 186
           LS ++   ILK  E+ +  + +  FEW +S G  + NV  YN++LR+LG+   W   + L
Sbjct: 152 LSNKERTIILK--EQIHWERAVEIFEWFKSKGCYELNVIHYNIMLRILGKACKWRYVQSL 211

Query: 187 IEEVRAELGSQLDFQVFNTLIYACYKSRFVEQGTKWFRMMLECQVQPNVATFGMLMGLYQ 246
            +E+    G +     + TLI    K         W   M +  +QP+  T G+++ +Y+
Sbjct: 212 WDEM-IRKGIKPINSTYGTLIDVYSKGGLKVHALCWLGKMSKIGMQPDEVTTGIVLQMYK 271

Query: 247 KKCDIKESEFAF-------NQMRNFGIVCETAYASMITIYIRMNLYDKAEEVIQLMQEDK 301
           K  + +++E  F       N+  +   +    Y +MI  Y +     +A E  + M E+ 
Sbjct: 272 KAREFQKAEEFFKKWSCDENKADSHVCLSSYTYNTMIDTYGKSGQIKEASETFKRMLEEG 329

BLAST of CsaV3_1G029170 vs. TAIR10
Match: AT1G50270.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 60.1 bits (144), Expect = 8.2e-09
Identity = 40/167 (23.95%), Postives = 79/167 (47.31%), Query Frame = 0

Query: 546 PNLHITCTMIDIYSIMGRFSDGEKLYLSLRSSGIPLDLIAYNVVVRMYVKAGSLEDACSV 605
           PN     +++   + +G    G +++  +  + I ++  A   ++ +YVK G LE+A  V
Sbjct: 304 PNEKTLSSVLSACAHVGALHRGRRVHCYMIKNSIEINTTAGTTLIDLYVKCGCLEEAILV 363

Query: 606 LDLMAEQQDIVPDIYLLRDMLRIYQRCGMVHKLADLYYRILKSGVSWDQEMYNCVINCCS 665
            + + E+     ++Y    M+  +   G      DL+Y +L S VS ++  +  V++ C+
Sbjct: 364 FERLHEK-----NVYTWTAMINGFAAHGYARDAFDLFYTMLSSHVSPNEVTFMAVLSACA 423

Query: 666 RALPVDELSRLFDEML-QCGFAPNTVTLNVMLDVYGKSKLFTKARNL 712
               V+E  RLF  M  +    P       M+D++G+  L  +A+ L
Sbjct: 424 HGGLVEEGRRLFLSMKGRFNMEPKADHYACMVDLFGRKGLLEEAKAL 465

BLAST of CsaV3_1G029170 vs. TAIR10
Match: AT5G39350.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 60.1 bits (144), Expect = 8.2e-09
Identity = 53/269 (19.70%), Postives = 117/269 (43.49%), Query Frame = 0

Query: 415 LINLQAKHEDEAGTLKTLNDMLKIGCRPSSI-VGNVLQAYEKARRMKSVPVLLTGSFYRK 474
           +IN   +  D    L+    M   G RP+++ + +++     A ++     L   +  ++
Sbjct: 290 MINGYTEDGDVENALELCRLMQFEGVRPNAVTIASLVSVCGDALKVNDGKCLHGWAVRQQ 349

Query: 475 VLSSQTSCSILVMAYVKHCLVDDALKVLREKEWKDHHFEENLYHLLICSCKELGHLENAI 534
           V S     + L+  Y K   VD   +V          +    +  +I  C +   + +A+
Sbjct: 350 VYSDIIIETSLISMYAKCKRVDLCFRVFSGAS----KYHTGPWSAIIAGCVQNELVSDAL 409

Query: 535 KIYTQLPKRENKPNLHITCTMIDIYSIMGRFSDGEKLYLSLRSSGIPLDLIAYNVVVRMY 594
            ++ ++ + + +PN+    +++  Y+ +        ++  L  +G    L A   +V +Y
Sbjct: 410 GLFKRMRREDVEPNIATLNSLLPAYAALADLRQAMNIHCYLTKTGFMSSLDAATGLVHVY 469

Query: 595 VKAGSLEDACSVLDLMAEQQDIVPDIYLLRDMLRIYQRCGMVHKLADLYYRILKSGVSWD 654
            K G+LE A  + + + E+     D+ L   ++  Y   G  H    ++  +++SGV+ +
Sbjct: 470 SKCGTLESAHKIFNGIQEKHK-SKDVVLWGALISGYGMHGDGHNALQVFMEMVRSGVTPN 529

Query: 655 QEMYNCVINCCSRALPVDELSRLFDEMLQ 683
           +  +   +N CS +  V+E   LF  ML+
Sbjct: 530 EITFTSALNACSHSGLVEEGLTLFRFMLE 553

BLAST of CsaV3_1G029170 vs. TAIR10
Match: AT3G20730.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 58.5 bits (140), Expect = 2.4e-08
Identity = 56/246 (22.76%), Postives = 111/246 (45.12%), Query Frame = 0

Query: 484 LVMAYVKHCLVDDALKVLREKEWKDHHFEENLYHLLICSCKELGHLE------NAIKIYT 543
           LV AYVK   + +A K+    + +D         LL C+    G  +      +A  I+ 
Sbjct: 255 LVNAYVKCGSLANAWKLHEGTKKRD---------LLSCTALITGFSQQNNCTSDAFDIFK 314

Query: 544 QLPKRENKPNLHITCTMIDIYSIMGRFSDGEKLY-LSLRSSGIPLDLIAYNVVVRMYVKA 603
            + + + K +  +  +M+ I + +   + G +++  +L+SS I  D+   N ++ MY K+
Sbjct: 315 DMIRMKTKMDEVVVSSMLKICTTIASVTIGRQIHGFALKSSQIRFDVALGNSLIDMYAKS 374

Query: 604 GSLEDACSVLDLMAEQQDIVPDIYLLRDMLRIYQRCGMVHKLADLYYRILKSGVSWDQEM 663
           G +EDA    + M E+     D+     ++  Y R G   K  DLY R+    +  +   
Sbjct: 375 GEIEDAVLAFEEMKEK-----DVRSWTSLIAGYGRHGNFEKAIDLYNRMEHERIKPNDVT 434

Query: 664 YNCVINCCSRALPVDELSRLFDEML-QCGFAPNTVTLNVMLDVYGKSKLFTKARNLFGLA 722
           +  +++ CS     +   +++D M+ + G       L+ ++D+  +S    +A  L  + 
Sbjct: 435 FLSLLSACSHTGQTELGWKIYDTMINKHGIEAREEHLSCIIDMLARSGYLEEAYAL--IR 484

BLAST of CsaV3_1G029170 vs. Swiss-Prot
Match: sp|O65567|PP342_ARATH (Pentatricopeptide repeat-containing protein At4g30825, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At4g30825 PE=2 SV=2)

HSP 1 Score: 453.0 bits (1164), Expect = 7.8e-126
Identity = 380/766 (49.61%), Postives = 462/766 (60.31%), Query Frame = 0

Query: 1   MASLKLSFSLHSFDS--NKFDFPLNSPLLSDYCSLFSINAHLHLNKSSIIYSLARVHKPS 60
           M SL+ S  L  FDS   +F F  N     D   +  + + +H  ++S I S  RV    
Sbjct: 1   MGSLRFSIPLDPFDSKRKRFHFSANPSQFPDQFPIHFVTSSIHATRASSIGSSTRVLDKI 60

Query: 61  KVS----QVEQDASDVSQSRFDEIVARKKYFTSKKPSKRAAGSHFSFSRNCND----NIL 120
           +VS    +  ++A + + +   E     K    ++ +K+     FSF R  ND    N+ 
Sbjct: 61  RVSSLGTEANENAINSASAAPVERSRSSKLSGDQRGTKKYVARKFSFRRGSNDLELENLF 120

Query: 121 FNGGELDVNYSTISSDLSLEDCNAILKRLEKCNDSKTLGFFEWMRSNGKLKHNVSAYNLV 180
            N GE+DVNYS I    SLE CN ILKRLE C+D+  + FF+WMR NGKL  N  AY+L+
Sbjct: 121 VNNGEIDVNYSAIKPGQSLEHCNGILKRLESCSDTNAIKFFDWMRCNGKLVGNFVAYSLI 180

Query: 181 LRVLGRQEDWDAAEKLIEEVRAELGSQLDFQVFNTLIYACYKSRFVEQGTKWFRMMLECQ 240
           LRVLGR+E+WD AE LI+E+      Q  +QVFNT+IYAC K   V+  +KWF MMLE  
Sbjct: 181 LRVLGRREEWDRAEDLIKELCGFHEFQKSYQVFNTVIYACTKKGNVKLASKWFHMMLEFG 240

Query: 241 VQPNVATFGMLMGLYQKKCDIKESEFAFNQMRNFGIVCETAYASMITIYIRMNLYDKAEE 300
           V+PNVAT GMLMGLYQK  +++E+EFAF+ MR FGIVCE+AY+SMITIY R+ LYDKAEE
Sbjct: 241 VRPNVATIGMLMGLYQKNWNVEEAEFAFSHMRKFGIVCESAYSSMITIYTRLRLYDKAEE 300

Query: 301 VIQLMQEDKVIPNLENWVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           VI LM++D+V   LENW+          XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 VIDLMKQDRVRLKLENWLVMLNAYSQQGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420

Query: 421 XXXXLINLQAKHEDEAGTLKTLNDMLKIGCRPSSIVGNVLQAYEKARRMKSVPVLLTGSF 480
           XXXX INLQAK+ D  G +KT+ DM  IGC+ SSI+G +LQAYEK  ++  VP +L GSF
Sbjct: 421 XXXXXINLQAKYGDRDGAIKTIEDMTGIGCQYSSILGIILQAYEKVGKIDVVPCVLKGSF 480

Query: 481 YRKVLSSQTSCSILVMAYVKHCLVDDALKVLREKEWKDHHFEENLYHLLICSCKELGHLE 540
           +  +  +QTS S LVM                                            
Sbjct: 481 HNHIRLNQTSFSSLVMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 540

Query: 541 NAIKIYTQLPKRENKPNLHITCTMIDIYSIMGRFSDGEKLYLSLRSSGIPLDLIAYNVVV 600
                                                                       
Sbjct: 541 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 600

Query: 601 RMYVKAGSLEDACSVLDLMAEQQDIVPDIYLLRDMLRIYQRCGMVHKLADLYYRILKSGV 660
                      ACSVL++M EQ+DIVPD+YL RDMLRIYQ+C +  KL  LYYRI KSG+
Sbjct: 601 XXXXXXXXXXXACSVLEIMDEQKDIVPDVYLFRDMLRIYQKCDLQDKLQHLYYRIRKSGI 660

Query: 661 SWDQEMYNCVINCCSRALPVDELSRLFDEMLQCGFAPNTVTLNVMLDVYGKSKLFTKARN 720
            W+QEMYNCVINCC+RALP+DELS  F+EM++ GF PNTVT NV+LDVYGK+KLF K   
Sbjct: 661 HWNQEMYNCVINCCARALPLDELSGTFEEMIRYGFTPNTVTFNVLLDVYGKAKLFKKVNE 720

Query: 721 LFGLAQKRGLVDXXXXXXXXXXXXXXXXXXNMSSTVQKMKFNGFSV 757
           LF LA++ G+VD                  NMSS ++ M+F+GFSV
Sbjct: 721 LFLLAKRHGVVDVISYNTIIAAYGKNKDYTNMSSAIKNMQFDGFSV 766

BLAST of CsaV3_1G029170 vs. Swiss-Prot
Match: sp|Q9LS88|PP250_ARATH (Pentatricopeptide repeat-containing protein At3g23020 OS=Arabidopsis thaliana OX=3702 GN=At3g23020 PE=2 SV=1)

HSP 1 Score: 60.5 bits (145), Expect = 1.1e-07
Identity = 46/181 (25.41%), Postives = 84/181 (46.41%), Query Frame = 0

Query: 127 LSLEDCNAILKRLEKCNDSKTLGFFEWMRSNGKLKHNVSAYNLVLRVLGRQEDWDAAEKL 186
           LS ++   ILK  E+ +  + +  FEW +S G  + NV  YN++LR+LG+   W   + L
Sbjct: 152 LSNKERTIILK--EQIHWERAVEIFEWFKSKGCYELNVIHYNIMLRILGKACKWRYVQSL 211

Query: 187 IEEVRAELGSQLDFQVFNTLIYACYKSRFVEQGTKWFRMMLECQVQPNVATFGMLMGLYQ 246
            +E+    G +     + TLI    K         W   M +  +QP+  T G+++ +Y+
Sbjct: 212 WDEM-IRKGIKPINSTYGTLIDVYSKGGLKVHALCWLGKMSKIGMQPDEVTTGIVLQMYK 271

Query: 247 KKCDIKESEFAF-------NQMRNFGIVCETAYASMITIYIRMNLYDKAEEVIQLMQEDK 301
           K  + +++E  F       N+  +   +    Y +MI  Y +     +A E  + M E+ 
Sbjct: 272 KAREFQKAEEFFKKWSCDENKADSHVCLSSYTYNTMIDTYGKSGQIKEASETFKRMLEEG 329

BLAST of CsaV3_1G029170 vs. Swiss-Prot
Match: sp|Q9FLZ9|PP405_ARATH (Pentatricopeptide repeat-containing protein At5g39350 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E16 PE=2 SV=1)

HSP 1 Score: 60.1 bits (144), Expect = 1.5e-07
Identity = 53/269 (19.70%), Postives = 117/269 (43.49%), Query Frame = 0

Query: 415 LINLQAKHEDEAGTLKTLNDMLKIGCRPSSI-VGNVLQAYEKARRMKSVPVLLTGSFYRK 474
           +IN   +  D    L+    M   G RP+++ + +++     A ++     L   +  ++
Sbjct: 290 MINGYTEDGDVENALELCRLMQFEGVRPNAVTIASLVSVCGDALKVNDGKCLHGWAVRQQ 349

Query: 475 VLSSQTSCSILVMAYVKHCLVDDALKVLREKEWKDHHFEENLYHLLICSCKELGHLENAI 534
           V S     + L+  Y K   VD   +V          +    +  +I  C +   + +A+
Sbjct: 350 VYSDIIIETSLISMYAKCKRVDLCFRVFSGAS----KYHTGPWSAIIAGCVQNELVSDAL 409

Query: 535 KIYTQLPKRENKPNLHITCTMIDIYSIMGRFSDGEKLYLSLRSSGIPLDLIAYNVVVRMY 594
            ++ ++ + + +PN+    +++  Y+ +        ++  L  +G    L A   +V +Y
Sbjct: 410 GLFKRMRREDVEPNIATLNSLLPAYAALADLRQAMNIHCYLTKTGFMSSLDAATGLVHVY 469

Query: 595 VKAGSLEDACSVLDLMAEQQDIVPDIYLLRDMLRIYQRCGMVHKLADLYYRILKSGVSWD 654
            K G+LE A  + + + E+     D+ L   ++  Y   G  H    ++  +++SGV+ +
Sbjct: 470 SKCGTLESAHKIFNGIQEKHK-SKDVVLWGALISGYGMHGDGHNALQVFMEMVRSGVTPN 529

Query: 655 QEMYNCVINCCSRALPVDELSRLFDEMLQ 683
           +  +   +N CS +  V+E   LF  ML+
Sbjct: 530 EITFTSALNACSHSGLVEEGLTLFRFMLE 553

BLAST of CsaV3_1G029170 vs. Swiss-Prot
Match: sp|Q9SX45|PPR75_ARATH (Pentatricopeptide repeat-containing protein At1g50270 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E42 PE=2 SV=1)

HSP 1 Score: 60.1 bits (144), Expect = 1.5e-07
Identity = 40/167 (23.95%), Postives = 79/167 (47.31%), Query Frame = 0

Query: 546 PNLHITCTMIDIYSIMGRFSDGEKLYLSLRSSGIPLDLIAYNVVVRMYVKAGSLEDACSV 605
           PN     +++   + +G    G +++  +  + I ++  A   ++ +YVK G LE+A  V
Sbjct: 304 PNEKTLSSVLSACAHVGALHRGRRVHCYMIKNSIEINTTAGTTLIDLYVKCGCLEEAILV 363

Query: 606 LDLMAEQQDIVPDIYLLRDMLRIYQRCGMVHKLADLYYRILKSGVSWDQEMYNCVINCCS 665
            + + E+     ++Y    M+  +   G      DL+Y +L S VS ++  +  V++ C+
Sbjct: 364 FERLHEK-----NVYTWTAMINGFAAHGYARDAFDLFYTMLSSHVSPNEVTFMAVLSACA 423

Query: 666 RALPVDELSRLFDEML-QCGFAPNTVTLNVMLDVYGKSKLFTKARNL 712
               V+E  RLF  M  +    P       M+D++G+  L  +A+ L
Sbjct: 424 HGGLVEEGRRLFLSMKGRFNMEPKADHYACMVDLFGRKGLLEEAKAL 465

BLAST of CsaV3_1G029170 vs. Swiss-Prot
Match: sp|Q9LT48|PP244_ARATH (Pentatricopeptide repeat-containing protein At3g20730 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E94 PE=2 SV=1)

HSP 1 Score: 58.5 bits (140), Expect = 4.3e-07
Identity = 56/246 (22.76%), Postives = 111/246 (45.12%), Query Frame = 0

Query: 484 LVMAYVKHCLVDDALKVLREKEWKDHHFEENLYHLLICSCKELGHLE------NAIKIYT 543
           LV AYVK   + +A K+    + +D         LL C+    G  +      +A  I+ 
Sbjct: 256 LVNAYVKCGSLANAWKLHEGTKKRD---------LLSCTALITGFSQQNNCTSDAFDIFK 315

Query: 544 QLPKRENKPNLHITCTMIDIYSIMGRFSDGEKLY-LSLRSSGIPLDLIAYNVVVRMYVKA 603
            + + + K +  +  +M+ I + +   + G +++  +L+SS I  D+   N ++ MY K+
Sbjct: 316 DMIRMKTKMDEVVVSSMLKICTTIASVTIGRQIHGFALKSSQIRFDVALGNSLIDMYAKS 375

Query: 604 GSLEDACSVLDLMAEQQDIVPDIYLLRDMLRIYQRCGMVHKLADLYYRILKSGVSWDQEM 663
           G +EDA    + M E+     D+     ++  Y R G   K  DLY R+    +  +   
Sbjct: 376 GEIEDAVLAFEEMKEK-----DVRSWTSLIAGYGRHGNFEKAIDLYNRMEHERIKPNDVT 435

Query: 664 YNCVINCCSRALPVDELSRLFDEML-QCGFAPNTVTLNVMLDVYGKSKLFTKARNLFGLA 722
           +  +++ CS     +   +++D M+ + G       L+ ++D+  +S    +A  L  + 
Sbjct: 436 FLSLLSACSHTGQTELGWKIYDTMINKHGIEAREEHLSCIIDMLARSGYLEEAYAL--IR 485

BLAST of CsaV3_1G029170 vs. TrEMBL
Match: tr|A0A0A0LTR9|A0A0A0LTR9_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G257890 PE=4 SV=1)

HSP 1 Score: 1219.5 bits (3154), Expect = 0.0e+00
Identity = 757/757 (100.00%), Postives = 757/757 (100.00%), Query Frame = 0

Query: 1   MASLKLSFSLHSFDSNKFDFPLNSPLLSDYCSLFSINAHLHLNKSSIIYSLARVHKPSKV 60
           MASLKLSFSLHSFDSNKFDFPLNSPLLSDYCSLFSINAHLHLNKSSIIYSLARVHKPSKV
Sbjct: 1   MASLKLSFSLHSFDSNKFDFPLNSPLLSDYCSLFSINAHLHLNKSSIIYSLARVHKPSKV 60

Query: 61  SQVEQDASDVSQSRFDEIVARKKYFTSKKPSKRAAGSHFSFSRNCNDNILFNGGELDVNY 120
           SQVEQDASDVSQSRFDEIVARKKYFTSKKPSKRAAGSHFSFSRNCNDNILFNGGELDVNY
Sbjct: 61  SQVEQDASDVSQSRFDEIVARKKYFTSKKPSKRAAGSHFSFSRNCNDNILFNGGELDVNY 120

Query: 121 STISSDLSLEDCNAILKRLEKCNDSKTLGFFEWMRSNGKLKHNVSAYNLVLRVLGRQEDW 180
           STISSDLSLEDCNAILKRLEKCNDSKTLGFFEWMRSNGKLKHNVSAYNLVLRVLGRQEDW
Sbjct: 121 STISSDLSLEDCNAILKRLEKCNDSKTLGFFEWMRSNGKLKHNVSAYNLVLRVLGRQEDW 180

Query: 181 DAAEKLIEEVRAELGSQLDFQVFNTLIYACYKSRFVEQGTKWFRMMLECQVQPNVATFGM 240
           DAAEKLIEEVRAELGSQLDFQVFNTLIYACYKSRFVEQGTKWFRMMLECQVQPNVATFGM
Sbjct: 181 DAAEKLIEEVRAELGSQLDFQVFNTLIYACYKSRFVEQGTKWFRMMLECQVQPNVATFGM 240

Query: 241 LMGLYQKKCDIKESEFAFNQMRNFGIVCETAYASMITIYIRMNLYDKAEEVIQLMQEDKV 300
           LMGLYQKKCDIKESEFAFNQMRNFGIVCETAYASMITIYIRMNLYDKAEEVIQLMQEDKV
Sbjct: 241 LMGLYQKKCDIKESEFAFNQMRNFGIVCETAYASMITIYIRMNLYDKAEEVIQLMQEDKV 300

Query: 301 IPNLENWVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           IPNLENWVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 IPNLENWVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLINLQA 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLINLQA
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLINLQA 420

Query: 421 KHEDEAGTLKTLNDMLKIGCRPSSIVGNVLQAYEKARRMKSVPVLLTGSFYRKVLSSQTS 480
           KHEDEAGTLKTLNDMLKIGCRPSSIVGNVLQAYEKARRMKSVPVLLTGSFYRKVLSSQTS
Sbjct: 421 KHEDEAGTLKTLNDMLKIGCRPSSIVGNVLQAYEKARRMKSVPVLLTGSFYRKVLSSQTS 480

Query: 481 CSILVMAYVKHCLVDDALKVLREKEWKDHHFEENLYHLLICSCKELGHLENAIKIYTQLP 540
           CSILVMAYVKHCLVDDALKVLREKEWKDHHFEENLYHLLICSCKELGHLENAIKIYTQLP
Sbjct: 481 CSILVMAYVKHCLVDDALKVLREKEWKDHHFEENLYHLLICSCKELGHLENAIKIYTQLP 540

Query: 541 KRENKPNLHITCTMIDIYSIMGRFSDGEKLYLSLRSSGIPLDLIAYNVVVRMYVKAGSLE 600
           KRENKPNLHITCTMIDIYSIMGRFSDGEKLYLSLRSSGIPLDLIAYNVVVRMYVKAGSLE
Sbjct: 541 KRENKPNLHITCTMIDIYSIMGRFSDGEKLYLSLRSSGIPLDLIAYNVVVRMYVKAGSLE 600

Query: 601 DACSVLDLMAEQQDIVPDIYLLRDMLRIYQRCGMVHKLADLYYRILKSGVSWDQEMYNCV 660
           DACSVLDLMAEQQDIVPDIYLLRDMLRIYQRCGMVHKLADLYYRILKSGVSWDQEMYNCV
Sbjct: 601 DACSVLDLMAEQQDIVPDIYLLRDMLRIYQRCGMVHKLADLYYRILKSGVSWDQEMYNCV 660

Query: 661 INCCSRALPVDELSRLFDEMLQCGFAPNTVTLNVMLDVYGKSKLFTKARNLFGLAQKRGL 720
           INCCSRALPVDELSRLFDEMLQCGFAPNTVTLNVMLDVYGKSKLFTKARNLFGLAQKRGL
Sbjct: 661 INCCSRALPVDELSRLFDEMLQCGFAPNTVTLNVMLDVYGKSKLFTKARNLFGLAQKRGL 720

Query: 721 VDXXXXXXXXXXXXXXXXXXNMSSTVQKMKFNGFSVS 758
           VDXXXXXXXXXXXXXXXXXXNMSSTVQKMKFNGFSVS
Sbjct: 721 VDXXXXXXXXXXXXXXXXXXNMSSTVQKMKFNGFSVS 757

BLAST of CsaV3_1G029170 vs. TrEMBL
Match: tr|A0A1S4DV41|A0A1S4DV41_CUCME (pentatricopeptide repeat-containing protein At4g30825, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103487334 PE=4 SV=1)

HSP 1 Score: 1149.0 bits (2971), Expect = 0.0e+00
Identity = 719/761 (94.48%), Postives = 739/761 (97.11%), Query Frame = 0

Query: 1   MASLKLSFSLHSFDSNKFDFPLNSPLLSDYCSLFSINAHLHLNKSSIIYSLARVHKPSKV 60
           MASLKLSFSLHSFDSNKFDFP+NSP LSDYCSLFSIN ++HLNKS I+YSLARVHKPSKV
Sbjct: 1   MASLKLSFSLHSFDSNKFDFPVNSPPLSDYCSLFSINGYIHLNKSCILYSLARVHKPSKV 60

Query: 61  SQVEQDASDVSQSRFDEIVARKKYFTSKKPSKRAAGSHFSFSRNCN----DNILFNGGEL 120
           SQVE +ASDVSQSRFD+I +RKKYFT+KKPSKRAAGSHFSFSRNC+    +NILF+GGEL
Sbjct: 61  SQVEPEASDVSQSRFDDIDSRKKYFTAKKPSKRAAGSHFSFSRNCSEKIFENILFSGGEL 120

Query: 121 DVNYSTISSDLSLEDCNAILKRLEKCNDSKTLGFFEWMRSNGKLKHNVSAYNLVLRVLGR 180
           DVNYSTISSDLSLE CNAILKRLEKCNDSKTL FFEWMRSNGKLKHNVSAYNLVLRVLGR
Sbjct: 121 DVNYSTISSDLSLEGCNAILKRLEKCNDSKTLDFFEWMRSNGKLKHNVSAYNLVLRVLGR 180

Query: 181 QEDWDAAEKLIEEVRAELGSQLDFQVFNTLIYACYKSRFVEQGTKWFRMMLECQVQPNVA 240
           QEDWDAAEKLI+EVRAELGSQLDFQVFNTLIYACYKS FVE GTKWFRMMLECQVQPNVA
Sbjct: 181 QEDWDAAEKLIKEVRAELGSQLDFQVFNTLIYACYKSGFVEWGTKWFRMMLECQVQPNVA 240

Query: 241 TFGMLMGLYQKKCDIKESEFAFNQMRNFGIVCETAYASMITIYIRMNLYDKAEEVIQLMQ 300
           TFGMLMGLYQK CDI+ESEFAFNQMRNFGIVCETAYASMITIYIRMNLYDKAEEVIQLMQ
Sbjct: 241 TFGMLMGLYQKSCDIEESEFAFNQMRNFGIVCETAYASMITIYIRMNLYDKAEEVIQLMQ 300

Query: 301 EDKVIPNLENWVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           +DKVIPNLENW+XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 KDKVIPNLENWLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLI 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX I
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXI 420

Query: 421 NLQAKHEDEAGTLKTLNDMLKIGCRPSSIVGNVLQAYEKARRMKSVPVLLTGSFYRKVLS 480
           NLQAKHEDEAG LKTLNDMLKIGCRPSSIVGNVLQAYEKARR+KSVPVLLTGSFYRKVLS
Sbjct: 421 NLQAKHEDEAGALKTLNDMLKIGCRPSSIVGNVLQAYEKARRIKSVPVLLTGSFYRKVLS 480

Query: 481 SQTSCSILVMAYVKHCLVDDALKVLREKEWKDHHFEENLYHLLICSCKELGHLENAIKIY 540
           SQTSCSILVMAYVKHCLVDDALKVLREKEWKDHHFEENLYHLLICSCKELGH E+AIKIY
Sbjct: 481 SQTSCSILVMAYVKHCLVDDALKVLREKEWKDHHFEENLYHLLICSCKELGHFESAIKIY 540

Query: 541 TQLPKRENKPNLHITCTMIDIYSIMGRFSDGEKLYLSLRSSGIPLDLIAYNVVVRMYVKA 600
            Q PKRENKPNLHITCTMIDIYSIMGRFSDGEKLYLSLRSSGIPLDLIAYNVVVRMYVKA
Sbjct: 541 AQRPKRENKPNLHITCTMIDIYSIMGRFSDGEKLYLSLRSSGIPLDLIAYNVVVRMYVKA 600

Query: 601 GSLEDACSVLDLMAEQQDIVPDIYLLRDMLRIYQRCGMVHKLADLYYRILKSGVSWDQEM 660
           GSLEDACSVLDLMAEQQDIVPD+YLLRDMLRIYQRCGMVHKL+DLYYRILKSGVSWDQEM
Sbjct: 601 GSLEDACSVLDLMAEQQDIVPDVYLLRDMLRIYQRCGMVHKLSDLYYRILKSGVSWDQEM 660

Query: 661 YNCVINCCSRALPVDELSRLFDEMLQCGFAPNTVTLNVMLDVYGKSKLFTKARNLFGLAQ 720
           YNCVINCCSRALPVDELSRLFDEMLQCGFAPNTVTLNVMLDVYGKSKLF KARNLFG AQ
Sbjct: 661 YNCVINCCSRALPVDELSRLFDEMLQCGFAPNTVTLNVMLDVYGKSKLFAKARNLFGFAQ 720

Query: 721 KRGLVDXXXXXXXXXXXXXXXXXXNMSSTVQKMKFNGFSVS 758
           KRGLVD XXXXXXXXXXXXXXXXXNMSSTVQ+MKFNGFSVS
Sbjct: 721 KRGLVDAXXXXXXXXXXXXXXXXXNMSSTVQQMKFNGFSVS 761

BLAST of CsaV3_1G029170 vs. TrEMBL
Match: tr|A0A2N9HKQ3|A0A2N9HKQ3_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS40694 PE=4 SV=1)

HSP 1 Score: 760.8 bits (1963), Expect = 3.5e-216
Identity = 524/783 (66.92%), Postives = 620/783 (79.18%), Query Frame = 0

Query: 1   MASLKLSFSLHSFDSNKFDFPLNSPLLSDYCSLFSINAHLHLNKSSIIYSLARVHKPSKV 60
           MAS+KLS  L S+DS K  F +N P  S++CS+ SI +++H+++   I SL R +   KV
Sbjct: 1   MASMKLSILLDSYDSKK--FTVNPPQPSNWCSVSSIFSYIHVSRVCTINSLNR-NTRIKV 60

Query: 61  SQVEQDASDVSQSR--FDEIV----------------------ARKKYFTSKKPSKRAAG 120
           S+ + D  ++S+S    D+IV                        K++  +KK  KR  G
Sbjct: 61  SRFDTDLPNISESNGVDDDIVLSPTKGMVNESLIEQNPDFEGRVEKRFRGTKKGIKREVG 120

Query: 121 SHFSFSRNCN----DNILFNGGELDVNYSTISSDLSLEDCNAILKRLEKCNDSKTLGFFE 180
             + F RN +    +N+  + GELDVNYS I SDLSLE CNA+LKRLEKC+DSKTL FFE
Sbjct: 121 LKYRFRRNGSEREIENLFVDDGELDVNYSGIGSDLSLEHCNAVLKRLEKCSDSKTLEFFE 180

Query: 181 WMRSNGKLKHNVSAYNLVLRVLGRQEDWDAAEKLIEEVRAELGSQLDFQVFNTLIYACYK 240
           WMRSNGKL+ NVSAYNLVLRVLGR+EDWDAAE ++ E+  +LG +LD +VFNT+IYAC K
Sbjct: 181 WMRSNGKLEQNVSAYNLVLRVLGRREDWDAAETMVRELCNKLGCELDCRVFNTVIYACCK 240

Query: 241 SRFVEQGTKWFRMMLECQVQPNVATFGMLMGLYQKKCDIKESEFAFNQMRNFGIVCETAY 300
            R VE G KWFRMMLE  V+PNVATFGMLMGLYQK  +++E+EF F QMR+F IVC++AY
Sbjct: 241 LRRVELGAKWFRMMLENGVRPNVATFGMLMGLYQKGWNVEEAEFTFCQMRDFEIVCQSAY 300

Query: 301 ASMITIYIRMNLYDKAEEVIQLMQEDKVIPNLENWVXXXXXXXXXXXXXXXXXXXXXXXX 360
           ++MITIY R++LYDKAE VI LM+EDKV  NLENW+XXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 SAMITIYTRLSLYDKAEGVIGLMREDKVDKNLENWLXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420

Query: 421 XXXXXXXXXXXXXXXXXXXXXXLINLQAKHEDEAGTLKTLNDMLKIGCRPSSIVGNVLQA 480
           XXXXXXXXXXXXXXXXXXXXXX   LQA+H DE G ++TL+DMLK+ C+ SSI+G +LQA
Sbjct: 421 XXXXXXXXXXXXXXXXXXXXXXXXXLQARHGDEEGAVRTLDDMLKMECQHSSILGTLLQA 480

Query: 481 YEKARRMKSVPVLLTGSFYRKVLSSQTSCSILVMAYVKHCLVDDALKVLREKEWKDHHFE 540
           YE+A R+  VP++L GSFY+ VL +QTSCSILVMAYVK+CLVDDA+KVL +K WKD  FE
Sbjct: 481 YERAGRIDKVPLILKGSFYQHVLVNQTSCSILVMAYVKNCLVDDAIKVLGDKFWKDPLFE 540

Query: 541 ENLYHLLICSCKELGHLENAIKIYTQLPKRENKPNLHITCTMIDIYSIMGRFSDGEKLYL 600
            NLYHLLICSCKE GHLENAIK+YTQ+PK ++KPNLHI+CTMIDIYS+M  F + E+LYL
Sbjct: 541 NNLYHLLICSCKEWGHLENAIKVYTQMPKHDDKPNLHISCTMIDIYSVMSLFPEAEQLYL 600

Query: 601 SLRSSGIPLDLIAYNVVVRMYVKAGSLEDACSVLDLMAEQQDIVPDIYLLRDMLRIYQRC 660
            L+SSGI LD+IA+++VVRMY+KAG L  ACSVLD+M +Q DIVPDIYL RDML      
Sbjct: 601 ELKSSGIALDMIAFSIVVRMYIKAGLLSKACSVLDMMDKQGDIVPDIYLFRDMLXXXXXX 660

Query: 661 GMVHKLADLYYRILKSGVSWDQEMYNCVINCCSRALPVDELSRLFDEMLQCGFAPNTVTL 720
            M+ KLADLYY+ILKSGV+WDQEMYNCVINCC+RALPVDELSRLFDEMLQ GF+PNT+T 
Sbjct: 661 XMLGKLADLYYKILKSGVTWDQEMYNCVINCCARALPVDELSRLFDEMLQLGFSPNTITF 720

Query: 721 NVMLDVYGKSKLFTKARNLFGLAQKRGLVDXXXXXXXXXXXXXXXXXXNMSSTVQKMKFN 756
           NVMLDVYGKSKLFTKA+ LF +AQK+GLVD                  NMSSTV+KM+FN
Sbjct: 721 NVMLDVYGKSKLFTKAKRLFWMAQKQGLVDVISYNTIIAAYGQNKDFKNMSSTVRKMQFN 780

BLAST of CsaV3_1G029170 vs. TrEMBL
Match: tr|A0A2I4EW02|A0A2I4EW02_9ROSI (pentatricopeptide repeat-containing protein At4g30825, chloroplastic OS=Juglans regia OX=51240 GN=LOC108993204 PE=4 SV=1)

HSP 1 Score: 743.0 bits (1917), Expect = 7.6e-211
Identity = 503/785 (64.08%), Postives = 601/785 (76.56%), Query Frame = 0

Query: 1   MASLKLSFSLHSFDSNKFDFPLNSPLLSDYCSLFSINAHLH-----------LNKSSIIY 60
           MA LKLS SL +FDS KF+     P  S++CS +SI +++H           LN+  I  
Sbjct: 1   MALLKLSISLDTFDSKKFNVSAAQP--SNWCSEYSIFSYIHVSRACASSINPLNRKQINV 60

Query: 61  SLARVHKPSKVSQVEQDASDVSQSRFDEIV--------------ARKKYFTSKKPSKRAA 120
           S   +  P ++S   Q   D+  S    +V                K++  SKK  KR  
Sbjct: 61  SRFNIELP-EISDSNQANKDIVLSSTKSLVNESVIEQKPDFKHKVGKRFLGSKKGIKREV 120

Query: 121 GSHFSFSRNCND----NILFNGGELDVNYSTISSDLSLEDCNAILKRLEKCNDSKTLGFF 180
           G  F F RN  D    N+    GELDV+YS I SDLSLE CNAILKRLEKC D+KT+ FF
Sbjct: 121 GLEFRFGRNDTDREIENLFVGDGELDVDYSAIGSDLSLEHCNAILKRLEKCCDTKTMVFF 180

Query: 181 EWMRSNGKLKHNVSAYNLVLRVLGRQEDWDAAEKLIEEVRAELGSQLDFQVFNTLIYACY 240
           EWMRSNGKL+ N+SA+N+VLRVLGR+EDWDAAE ++  +  +L  +LD +VFNT+IY+  
Sbjct: 181 EWMRSNGKLEQNMSAHNIVLRVLGRKEDWDAAEAMVRGLSIKLAGELDCRVFNTVIYSFC 240

Query: 241 KSRFVEQGTKWFRMMLECQVQPNVATFGMLMGLYQKKCDIKESEFAFNQMRNFGIVCETA 300
           K    E G KWFRMMLE  V+PNVATFGMLMGLYQK  +++E+EF F QMRNFGIVCE+A
Sbjct: 241 KLGRAELGAKWFRMMLENGVRPNVATFGMLMGLYQKGWNVEEAEFTFRQMRNFGIVCESA 300

Query: 301 YASMITIYIRMNLYDKAEEVIQLMQEDKVIPNLENWVXXXXXXXXXXXXXXXXXXXXXXX 360
           Y++MITI+ R+NLYDKAE+VI LM EDKV+ NLENW+      XXXXXXXXXXXXXXXXX
Sbjct: 301 YSAMITIFTRLNLYDKAEQVIGLMTEDKVVLNLENWLVILNTFXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420

Query: 421 XXXXXXXXXXXXXXXXXXXXXXXLINLQAKHEDEAGTLKTLNDMLKIGCRPSSIVGNVLQ 480
           XXXXXXXXXXXXXXXXXXXXXXX          E G ++TL+DMLK+GC+ SSI+G +LQ
Sbjct: 421 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEEGAVRTLDDMLKMGCQCSSILGTLLQ 480

Query: 481 AYEKARRMKSVPVLLTGSFYRKVLSSQTSCSILVMAYVKHCLVDDALKVLREKEWKDHHF 540
           AYE+A R+  V  +L G  Y+ +L +QTSCSILV AYVKHCLVDDA++VL +K WKD  F
Sbjct: 481 AYERAGRIDKVAQILNGPLYQHILVNQTSCSILVAAYVKHCLVDDAIRVLEDKVWKDLPF 540

Query: 541 EENLYHLLICSCKELGHLENAIKIYTQLPKRENKPNLHITCTMIDIYSIMGRFSDGEKLY 600
           E NLYHLLICSCKELG LE A+KIYTQ+PK ++KPNLHI+CTMIDIYS+MG F + EK+Y
Sbjct: 541 ENNLYHLLICSCKELGQLEQAVKIYTQMPKHDDKPNLHISCTMIDIYSVMGLFPEAEKIY 600

Query: 601 LSLRSSGIPLDLIAYNVVVRMYVKAGSLEDACSVLDLMAEQQDIVPDIYLLRDMLRIYQR 660
           L L+SSGI LD+IA+++VVRMY+KAGSL +AC+VLD++ +Q+DIVPD+YLLRDMLRIYQR
Sbjct: 601 LKLKSSGIALDMIAFSIVVRMYIKAGSLRNACAVLDILDKQRDIVPDVYLLRDMLRIYQR 660

Query: 661 CGMVHKLADLYYRILKSGVSWDQEMYNCVINCCSRALPVDELSRLFDEMLQCGFAPNTVT 720
           CGM+ KLADLY++I+K G++WDQEMYNCVINCCSRALPVDELSRLFDEMLQ GF+PNT+T
Sbjct: 661 CGMLDKLADLYHKIMKIGMTWDQEMYNCVINCCSRALPVDELSRLFDEMLQRGFSPNTIT 720

Query: 721 LNVMLDVYGKSKLFTKARNLFGLAQKRGLVDXXXXXXXXXXXXXXXXXXNMSSTVQKMKF 757
           +NVMLDVYGKSKLFTKA+ LF +AQK+GL+D                  NMSS V+KM+F
Sbjct: 721 VNVMLDVYGKSKLFTKAKRLFWVAQKQGLIDVISYNTVIAAYGQNKKFKNMSSMVRKMQF 780

BLAST of CsaV3_1G029170 vs. TrEMBL
Match: tr|A0A2C9ULY6|A0A2C9ULY6_MANES (Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_14G112800 PE=4 SV=1)

HSP 1 Score: 725.7 bits (1872), Expect = 1.3e-205
Identity = 495/780 (63.46%), Postives = 615/780 (78.85%), Query Frame = 0

Query: 1   MASLKLSFSLHSFDSNKFDFPLNSPLLSDYCSLFSINAHLHLNKSSIIYSLARVHKPSKV 60
           MASL+LS SL++  SN    PL  P    + SLFSI++ +  +K  II  L+R + P KV
Sbjct: 1   MASLRLSISLNANKSNFSKNPLQFP---THVSLFSISSCIPSSKPCIITILSRFN-PVKV 60

Query: 61  SQVEQDASD-----------VSQSRFDEIV---------ARKKYFTSKKPSKRAAGSHFS 120
           S+VE + S+           V +S   +++          RK Y  +KK  K   G  F+
Sbjct: 61  SRVETELSESEPVLSTSRDLVQESLNQDLIERNQDLKRKIRKNYRGAKKGRKSQVGFKFN 120

Query: 121 FSRNCN---DNILFNGGELDVNYSTISSDLSLEDCNAILKRLEKC-NDSKTLGFFEWMRS 180
           + R+ +   ++   +  +LDV+YS I+S+LSLE CN ILK+LE C ++SKTL FFEWM+S
Sbjct: 121 YKRHGSQQREDFFVHDTDLDVDYSVINSNLSLEQCNYILKQLEGCSSESKTLRFFEWMKS 180

Query: 181 NGKLKHNVSAYNLVLRVLGRQEDWDAAEKLIEEVRAELGSQLDFQVFNTLIYACYKSRFV 240
           NGKL+ NV+AYN++LRVL R+EDWD AE++I E+    GS LDF++FNTLIY C K   +
Sbjct: 181 NGKLEKNVNAYNVILRVLARREDWDCAERMIRELSDSFGSALDFRIFNTLIYICSKRGHM 240

Query: 241 EQGTKWFRMMLECQVQPNVATFGMLMGLYQKKCDIKESEFAFNQMRNFGIVCETAYASMI 300
           + G KWF MMLE  VQPNVATFGMLMGLYQK  +++E+EF F+QMR+F I+C++AY++MI
Sbjct: 241 KLGGKWFLMMLELGVQPNVATFGMLMGLYQKGWNVEEAEFVFSQMRSFRIICQSAYSAMI 300

Query: 301 TIYIRMNLYDKAEEVIQLMQEDKVIPNLENWVXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           TIY R+ LYDKAEEVI +M++D V  NLENW+ XXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 TIYTRLRLYDKAEEVIGIMRKDNVALNLENWLVXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420

Query: 421 XXXXXXXXXXXXXXXXXXLINLQAKHEDEAGTLKTLNDMLKIGCRPSSIVGNVLQAYEKA 480
           XXXXXXXXXXXXXXXXXX  NLQAKH+DE G ++T+ DMLK+GC+ SSI+G +L++YE+A
Sbjct: 421 XXXXXXXXXXXXXXXXXXXXNLQAKHDDEEGAVRTIQDMLKMGCQYSSILGTLLKSYERA 480

Query: 481 RRMKSVPVLLTGSFYRKVLSSQTSCSILVMAYVKHCLVDDALKVLREKEWKDHHFEENLY 540
            ++  VP+LL GSFY+ VL +QTSCSILVMAYVKHCLV DAL+VL++KEW D  FE+NLY
Sbjct: 481 GKIDKVPLLLKGSFYQHVLVNQTSCSILVMAYVKHCLVHDALEVLQDKEWNDPAFEDNLY 540

Query: 541 HLLICSCKELGHLENAIKIYTQLPKRENKPNLHITCTMIDIYSIMGRFSDGEKLYLSLRS 600
           HLLICSCKELGHLENA+KIY+Q+PK   KPNLHI CTMID+YS +G F++GEKLYL L+S
Sbjct: 541 HLLICSCKELGHLENAVKIYSQMPKSNGKPNLHILCTMIDVYSSLGLFTEGEKLYLQLKS 600

Query: 601 SGIPLDLIAYNVVVRMYVKAGSLEDACSVLDLMAEQQDIVPDIYLLRDMLRIYQRCGMVH 660
           SGI LD+IA+++VVRMYVKAG L+DAC+VL+ + +Q+DI+PDIYL RDMLRIYQRCGM+ 
Sbjct: 601 SGIALDMIAFSIVVRMYVKAGLLKDACTVLETIEKQKDIIPDIYLFRDMLRIYQRCGMMS 660

Query: 661 KLADLYYRILKSGVSWDQEMYNCVINCCSRALPVDELSRLFDEMLQCGFAPNTVTLNVML 720
           KL DLYY+IL+SGV WDQE+Y+C+INCC+RALPV E+SRLF+EML+CGF+PNT+T NVML
Sbjct: 661 KLNDLYYKILRSGVVWDQELYSCIINCCARALPVYEISRLFNEMLRCGFSPNTITFNVML 720

Query: 721 DVYGKSKLFTKARNLFGLAQKRGLVDXXXXXXXXXXXXXXXXXXNMSSTVQKMKFNGFSV 757
           DVYGK+K F K + LF +A+KRGLVD                  NM+S +QKM+F+GFSV
Sbjct: 721 DVYGKAKNFRKVKELFWMARKRGLVDVISYNTVIAAYGHNRDFKNMASAIQKMQFDGFSV 776

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004146719.10.0e+00100.00PREDICTED: pentatricopeptide repeat-containing protein At4g30825, chloroplastic ... [more]
XP_016899838.10.0e+0094.48PREDICTED: pentatricopeptide repeat-containing protein At4g30825, chloroplastic ... [more]
XP_022135004.11.1e-30487.27pentatricopeptide repeat-containing protein At4g30825, chloroplastic isoform X1 ... [more]
XP_023516176.16.4e-28684.89pentatricopeptide repeat-containing protein At4g30825, chloroplastic [Cucurbita ... [more]
XP_022922044.11.7e-28384.76pentatricopeptide repeat-containing protein At4g30825, chloroplastic [Cucurbita ... [more]
Match NameE-valueIdentityDescription
AT4G30825.14.3e-12749.61Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G23020.16.3e-0925.41Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G50270.18.2e-0923.95Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G39350.18.2e-0919.70Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G20730.12.4e-0822.76Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
sp|O65567|PP342_ARATH7.8e-12649.61Pentatricopeptide repeat-containing protein At4g30825, chloroplastic OS=Arabidop... [more]
sp|Q9LS88|PP250_ARATH1.1e-0725.41Pentatricopeptide repeat-containing protein At3g23020 OS=Arabidopsis thaliana OX... [more]
sp|Q9FLZ9|PP405_ARATH1.5e-0719.70Pentatricopeptide repeat-containing protein At5g39350 OS=Arabidopsis thaliana OX... [more]
sp|Q9SX45|PPR75_ARATH1.5e-0723.95Pentatricopeptide repeat-containing protein At1g50270 OS=Arabidopsis thaliana OX... [more]
sp|Q9LT48|PP244_ARATH4.3e-0722.76Pentatricopeptide repeat-containing protein At3g20730 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LTR9|A0A0A0LTR9_CUCSA0.0e+00100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G257890 PE=4 SV=1[more]
tr|A0A1S4DV41|A0A1S4DV41_CUCME0.0e+0094.48pentatricopeptide repeat-containing protein At4g30825, chloroplastic OS=Cucumis ... [more]
tr|A0A2N9HKQ3|A0A2N9HKQ3_FAGSY3.5e-21666.92Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS40694 PE=4 SV=1[more]
tr|A0A2I4EW02|A0A2I4EW02_9ROSI7.6e-21164.08pentatricopeptide repeat-containing protein At4g30825, chloroplastic OS=Juglans ... [more]
tr|A0A2C9ULY6|A0A2C9ULY6_MANES1.3e-20563.46Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_14G112800 PE=4 SV=... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0007049 cell cycle
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_1G029170.1CsaV3_1G029170.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 840..860
NoneNo IPR availablePANTHERPTHR12683:SF10SUBFAMILY NOT NAMEDcoord: 58..894
NoneNo IPR availablePANTHERPTHR12683FAMILY NOT NAMEDcoord: 58..894
NoneNo IPR availableSUPERFAMILYSSF81901HCP-likecoord: 176..402
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 307..338
e-value: 4.8E-7
score: 27.6
coord: 202..235
e-value: 0.0013
score: 16.8
coord: 794..827
e-value: 4.1E-7
score: 27.8
coord: 271..303
e-value: 3.3E-5
score: 21.8
coord: 585..619
e-value: 3.2E-7
score: 28.1
coord: 341..374
e-value: 1.6E-5
score: 22.8
coord: 829..862
e-value: 6.3E-10
score: 36.6
coord: 724..757
e-value: 1.1E-5
score: 23.3
coord: 376..408
e-value: 8.3E-4
score: 17.4
coord: 690..721
e-value: 0.0014
score: 16.7
coord: 760..792
e-value: 4.1E-5
score: 21.5
coord: 656..689
e-value: 2.4E-6
score: 25.3
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 724..754
e-value: 2.4E-4
score: 21.1
coord: 166..191
e-value: 0.35
score: 11.1
coord: 585..611
e-value: 0.0015
score: 18.5
coord: 553..579
e-value: 0.4
score: 10.9
coord: 307..335
e-value: 2.6E-6
score: 27.2
coord: 480..503
e-value: 0.19
score: 12.0
coord: 271..299
e-value: 0.0045
score: 17.0
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 194..245
e-value: 0.0028
score: 17.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 826..873
e-value: 2.1E-12
score: 46.9
coord: 338..383
e-value: 5.9E-11
score: 42.3
coord: 760..801
e-value: 1.4E-8
score: 34.7
coord: 656..699
e-value: 8.2E-8
score: 32.2
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 234..264
score: 6.369
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 582..612
score: 9.339
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 373..407
score: 11.772
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 722..756
score: 9.734
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 199..233
score: 9.262
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 303..337
score: 11.389
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 338..372
score: 11.378
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 477..511
score: 6.39
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 862..894
score: 7.333
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 827..861
score: 13.197
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 547..581
score: 7.87
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 757..791
score: 8.177
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 268..302
score: 9.24
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 512..546
score: 7.958
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 163..193
score: 8.396
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 688..718
score: 7.541
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 618..652
score: 6.96
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 408..442
score: 6.588
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 653..687
score: 10.819
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 792..826
score: 11.575
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 791..890
e-value: 5.9E-26
score: 92.9
coord: 115..268
e-value: 3.2E-23
score: 84.0
coord: 505..625
e-value: 9.9E-19
score: 69.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 626..790
e-value: 3.1E-35
score: 124.0
coord: 326..480
e-value: 1.4E-25
score: 92.3

The following gene(s) are paralogous to this gene:

None