CsaV3_4G024260 (gene) Cucumber (Chinese Long) v3

NameCsaV3_4G024260
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionPentatricopeptide repeat-containing protein
Locationchr4 : 14194415 .. 14199171 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAAAGGCGGGTCTCACACCCAATTCTCGTACTGTTGTGGCTCTACTTTTGGCATGTGGTGAGATGTTGGAGTTGAGATTAGGACAAGAGATTCATGGTTATTGTTTGAGAAATGGGTTGTTTGATATGGATGCTTATGTTGGTACTGCTTTGGTAGGATTTTATATGAGATTTGATGCAGTACTTTCACACCGTGTTTTTAGCTTGATGTTGGTGAGAAATATAGTGAGTTGGAATGCAATAATAACCGGATTTCTTAATGTTGGAGATTGCGCAAAAGCTTTGAAGCTTTATAGTAGTATGCTGATAGAAGGTATAAAGTTTGATGCTGTTACCATGTTGGTGGTAATTCAAGCATGCGCAGAATATGGATGTCTTCGATTAGGCATGCAACTGCACCAGTTGGCTATCAAGTTCAATTTGATTAATGACTTGTTTATATTAAATGCACTATTGAATATGTATAGTGATAATGGAAGTTTGGAATCATCATGGGCGTTGTTTAATGCTGTTCCCACCTCTGATGCTGCTTTATGGAATTCCATGATATCTTCTTACATTGGTTTTGGATTTCATGCAGAAGCTATAGCTTTGTTTATTAAAATGCGTTTAGAACGCATAAAAGAAGATGTTAGAACCATTGCGATTATGTTATCTTTATGCAATGATCTAAATGATGGTTCAATATGGGGCAGAGGCTTACATGCTCATGCCATGAAAAGTGGAATAGAACTAGATGCATATCTGGGCAATGCGTTATTAAGCATGTATGTCAAGCACAATCAAATAACTGCTGCACAGTATGTTTTTGAGAAGATGAGAGGTTTGGATGTCATCTCCTGGAACACAATGATATCAGCATTTGCTCAGAGTATGTTTCGAGCCAAAGCATTTGAACTCTTTTTGATGATGTGTGAATCAGAAATCAAGTTTAATTCTTACACGATAGTATCTCTCCTCGCATTCTGTAAAGATGGAAGTGATTTAGTGTTTGGGAGATCGATCCATGGTTTTGCAATAAAAAATGGTCTCGAAATAAATACTTCTTTGAACACTTCACTAACTGAAATGTACATCAATTGTGGTGATGAAAGAGCAGCTACAAATATGTTTACTAGATGTCCTCAAAGAGATTTAGTCTCATGGAATTCCCTAATTTCAAGTTATATAAAGAATGACAATGCAGGAAAAGCTCTATTACTTTTTAACCATATGATTTCTGAGCTGGAGCCGAACTCTGTGACAATCATAAATATCCTCACATCTTGTACGCAGCTTGCTCATCTACCACTAGGACAGTGCTTGCATGCTTACACAACCAGAAGGGAAGTATCTCTTGAAATGGATGCTTCTCTGGCAAATGCTTTTATAACTATGTATGCACGATGCGGTAAATTGCAATATGCAGAAAAGATTTTTTGCACCCTGCAGACAAGAAGTATTGTCTCATGGAATGCCATGATAACAGGATATGGCATGCACGGTCGTGGACGCGATGCCACTCTAGCATTTGCACAGATGTTGGATGATGGTTTCAAGCCTAACAATGTATCTTTTGCATCTGTTTTATCTGCCTGCAGCCATTCTGGTTTGACCGTGACAGGTTTGCAACTTTTCCATTCCATGGTGCGGGATTTTGGTATTGCTCCTCAACTTACTCACTATGGTTGTATGGTTGATCTGCTTGGTCGTGGGGGCCATTTTTCTGAAGCTATAGCTTTCATCAACTCAATGCCCATTGAACCTGATGCATCAATTTGGAGAGCTTTGCTCAGTTCATGTCAGATTAAAAGCAATAACAAGCTATTGGAGACCATCTTTGGAAAGCTTGTTGAATTAGAACCAAGCAATCCAGGCAATTTTATTTTGCTTTCCAATATTTATGCAGCAGCAGGCCTTTGGTCAGAAGTTGTACAGATAAGAAAGTGGCTTCGAGAAAGAGGTCTAGGAAAGCCTCCAGGAACTAGCTGGATTGTAATTGGCAATCAGGTTCACCATTTCACTGCTACTGATGTATTACACCCTCAATCAGAAAGAATTTACGAAAATTTGAATTCTTTGACATCATTGATCCGAGATTTGGGCTGACATTAAGGATTTTTCTGTCATCCAGCCAGACGTGATTTTGTGCGTGTTATGAAAAGAGCTTTTCTTGAAAAAAATTAAGCCATTGTTCACTCCTTGCATTATGCTCATGATTCAATCCTGCTATTCAAAACTGAATTGAGATGTGCTTTGTGAGAGAATTGTAAAATGAAAGGGATGAAGCCCAGATCGATATTCTCTGTTCTTGTCCGATATATCTTACAAGAGATCCACTTCTTGCAGCATTCTTCTTTGTGATCTGCAGCGGTCTTTTGGTCAATTGCCTGGTTCCCCCAACTTGATCTAGATTCCTCCGAGAACCGAATCAGTACGTATACTTTGTACTGATTTTCTCATTATCCTTTCTAATGTATGAGTGCGTTGGTTTATCATGCATAAACAAACCATTCTACTTGCTGGTACAAATTATGATTAGATATATGACGTGCATTTCAAATATGTATTTCTCATGTTATTTTACTGAAAGAGCTGTAAATCACAACCATGAACCAACACCTTTTACTAAAGATTACCGCCTTTATCATCTTCATGATGGTGGTGGATGTGGATCTGCATAAATTTCTCTTGAAGATAGTAAATGTCACCTTCGTGATTTCGCTTTAGAAATTGAGAAATTATTACTCTTTCAACTGGAAAGGTGTTGACACAGAAAGTGAAACATTATTATATGTTTTTGAGTTCAACAACATATGGGATTCGTGATTTTCGAACCTATGACCTTTTAGTCAAAGGTTTAGGCCTAAACCAATAGAATCGTACCTAGATTAGCACAAAATGAGATATCCTGTGAGGTTTGATTACTGCAATTACAACAGATATGAAACGGTTGGACTAATCTCCAAAATGCCTTTTGTTTATAGAAACATATTCAATGACATTGATACTTTTAACATGGTATAAGATGTTGAAAGTATTTACATTGATTTTTTTTGTAGTAGTGGATGTAAGAATTGTTATTAGTTGGGGGAAAATAATATGAATGTGAATGACAACTAATTAGGAACAGGACCAGATTGGTCGACACTAAAGGAAAATAAATAAGATAAAAAAATCTTCTAACTCCTAGTCTGCCAATCTCCTAAAAGCTCTAGTCGTCGGAAGGCCATATCACAGTCTCCATTAGCCTCTCTCGCATCAACTCCAAGTTCTCATCAGATATCCCTTTGTATATTTCAGCATGATCACATTTCTATCCAACAACACCAAATCACAATTCAATATAATAATAATAATAAAAAAAACCCCTCAAAATACCAAGTTTGTTCTTACCTTAACCCATTTGTTATAGAGATGAAGCCTTGTTATCATCACCCTCTCAGCCAAATCTTGCTTCTCCTGGTAATCATCCAATTTGTTTCAGTACTCAATCATTTACATATCGTTCATGTAATTCAAATAGACTCTTTAGACTTCTTTTTCCTTCCCTTTTGAGTTCAACACGTGAATGCATAGGGATTTGAACCCCTAATGTGCTGGTTGATGATACTGTGTCTTAATCAAGTGAGTTATGTTTAGGTTTAGGTTTAGGTTTACAATCTTTTAGAGATTGGTATATTACCTTTACAAGAGTTCGAATGAAACGCTTCCCTTCCCCAGGCTTGTGATTTACAACAAAGCTGTTGATGTAGAAAAAGGTGAAATGCAAGATTAGATGGGGAAATGAAATAATGTTCTAAAAACACTATTTTGATTTCCACAGAATAAGCATTTACATTGGCAGTCATAATGAGTTCTTAAAAACTAAAGCAAAAGGCTTCACTTACTTATAAAACCATCTGTACTGTGTTGGGTTCATCTCATAAAGCTGATTCAAAACTGTCTTCACAGCTTTGTATGTGAAATAATTCTGTATTTGCTGCAAATAGAAAGCCTCTGGTAAGCTTAAAACAGTTTTTTCTTCCCTTTTTAAATGCAGAATCCAAGTGCCAAGAACATTGATCATTCAGGAAGCAGGAGAAGAGACAAGTAGTTGATAACAATTCATGCCGTTTGGTAACCATTTCGTTTATTTTTTTTTTAAATTAAACCAAAAATTAGAAGACAGATCAAAGCATTTTGCATTGAAGAAGAGAGTGATATAATATCCCTACCATTTTCACATCATCAAAACTATCCTCATACTGCCCTGCAAATTCATTGACAACAACAAGCCTCCGATTCCTTCGTTGGTTCCTCTTACTCTGACGGCCACCACCGGCCGAGAGCTGCCACTCGTCACCAAGATTGACAAACGAACTTCTCAAGTCCACCGATTTAGACACGGTCAAATAACTCTTAGCCTTGGATTTCCTTTGCAAAACCAAATCCCCACCACTTCTAAGACTCATATTGGTAACCGGCGACGAGTCGAAAGACAAACACGGCGGCAGGGAGTCGACCACAGGTGCACCAACAACAAACAAAGCTCCCACCATTTTGGAAAAAAAGATTCCAAGATCACAAGCCAACCGTTGAGAGAAGGAGGATATATGAAGTGGGAATAAAGGAATTGAAAGAGAAAGAAGTGAATATAAAGGAAAAGTGGGAGAGAATAGGAGATTGGGTCAAAGATTCAGCTATCGTAAGAAACTGATCAAACTTTGAGAATTCAATTTAGAATTGTACAAGAAAGAGGATGAAAGAAAGGTTTCATATTGTTGGAAGTTTGT

mRNA sequence

ATGAAAAAGGCGGGTCTCACACCCAATTCTCGTACTGTTGTGGCTCTACTTTTGGCATGTGGTGAGATGTTGGAGTTGAGATTAGGACAAGAGATTCATGGTTATTGTTTGAGAAATGGGTTGTTTGATATGGATGCTTATGTTGGTACTGCTTTGGTAGGATTTTATATGAGATTTGATGCAGTACTTTCACACCGTGTTTTTAGCTTGATGTTGGTGAGAAATATAGTGAGTTGGAATGCAATAATAACCGGATTTCTTAATGTTGGAGATTGCGCAAAAGCTTTGAAGCTTTATAGTAGTATGCTGATAGAAGGTATAAAGTTTGATGCTGTTACCATGTTGGTGGTAATTCAAGCATGCGCAGAATATGGATGTCTTCGATTAGGCATGCAACTGCACCAGTTGGCTATCAAGTTCAATTTGATTAATGACTTGTTTATATTAAATGCACTATTGAATATGTATAGTGATAATGGAAGTTTGGAATCATCATGGGCGTTGTTTAATGCTGTTCCCACCTCTGATGCTGCTTTATGGAATTCCATGATATCTTCTTACATTGGTTTTGGATTTCATGCAGAAGCTATAGCTTTGTTTATTAAAATGCGTTTAGAACGCATAAAAGAAGATGTTAGAACCATTGCGATTATGTTATCTTTATGCAATGATCTAAATGATGGTTCAATATGGGGCAGAGGCTTACATGCTCATGCCATGAAAAGTGGAATAGAACTAGATGCATATCTGGGCAATGCGTTATTAAGCATGTATGTCAAGCACAATCAAATAACTGCTGCACAGTATGTTTTTGAGAAGATGAGAGGTTTGGATGTCATCTCCTGGAACACAATGATATCAGCATTTGCTCAGAGTATGTTTCGAGCCAAAGCATTTGAACTCTTTTTGATGATGTGTGAATCAGAAATCAAGTTTAATTCTTACACGATAGTATCTCTCCTCGCATTCTGTAAAGATGGAAGTGATTTAGTGTTTGGGAGATCGATCCATGGTTTTGCAATAAAAAATGGTCTCGAAATAAATACTTCTTTGAACACTTCACTAACTGAAATGTACATCAATTGTGGTGATGAAAGAGCAGCTACAAATATGTTTACTAGATGTCCTCAAAGAGATTTAGTCTCATGGAATTCCCTAATTTCAAGTTATATAAAGAATGACAATGCAGGAAAAGCTCTATTACTTTTTAACCATATGATTTCTGAGCTGGAGCCGAACTCTGTGACAATCATAAATATCCTCACATCTTGTACGCAGCTTGCTCATCTACCACTAGGACAGTGCTTGCATGCTTACACAACCAGAAGGGAAGTATCTCTTGAAATGGATGCTTCTCTGGCAAATGCTTTTATAACTATGTATGCACGATGCGGTAAATTGCAATATGCAGAAAAGATTTTTTGCACCCTGCAGACAAGAAGTATTGTCTCATGGAATGCCATGATAACAGGATATGGCATGCACGGTCGTGGACGCGATGCCACTCTAGCATTTGCACAGATGTTGGATGATGGTTTCAAGCCTAACAATGTATCTTTTGCATCTGTTTTATCTGCCTGCAGCCATTCTGGTTTGACCGTGACAGGTTTGCAACTTTTCCATTCCATGGTGCGGGATTTTGGTATTGCTCCTCAACTTACTCACTATGGTTGTATGGTTGATCTGCTTGGTCGTGGGGGCCATTTTTCTGAAGCTATAGCTTTCATCAACTCAATGCCCATTGAACCTGATGCATCAATTTGGAGAGCTTTGCTCAGTTCATGTCAGATTAAAAGCAATAACAAGCTATTGGAGACCATCTTTGGAAAGCTTGTTGAATTAGAACCAAGCAATCCAGGCAATTTTATTTTGCTTTCCAATATTTATGCAGCAGCAGGCCTTTGGTCAGAAGTTGTACAGATAAGAAAGTGGCTTCGAGAAAGAGGTCTAGGAAAGCCTCCAGGAACTAGCTGGATTGTAATTGGCAATCAGGTTCACCATTTCACTGCTACTGATGTATTACACCCTCAATCAGAAAGAATTTACGAAAATTTGAATTCTTTGACATCATTGATCCGAGATTTGGGCTGA

Coding sequence (CDS)

ATGAAAAAGGCGGGTCTCACACCCAATTCTCGTACTGTTGTGGCTCTACTTTTGGCATGTGGTGAGATGTTGGAGTTGAGATTAGGACAAGAGATTCATGGTTATTGTTTGAGAAATGGGTTGTTTGATATGGATGCTTATGTTGGTACTGCTTTGGTAGGATTTTATATGAGATTTGATGCAGTACTTTCACACCGTGTTTTTAGCTTGATGTTGGTGAGAAATATAGTGAGTTGGAATGCAATAATAACCGGATTTCTTAATGTTGGAGATTGCGCAAAAGCTTTGAAGCTTTATAGTAGTATGCTGATAGAAGGTATAAAGTTTGATGCTGTTACCATGTTGGTGGTAATTCAAGCATGCGCAGAATATGGATGTCTTCGATTAGGCATGCAACTGCACCAGTTGGCTATCAAGTTCAATTTGATTAATGACTTGTTTATATTAAATGCACTATTGAATATGTATAGTGATAATGGAAGTTTGGAATCATCATGGGCGTTGTTTAATGCTGTTCCCACCTCTGATGCTGCTTTATGGAATTCCATGATATCTTCTTACATTGGTTTTGGATTTCATGCAGAAGCTATAGCTTTGTTTATTAAAATGCGTTTAGAACGCATAAAAGAAGATGTTAGAACCATTGCGATTATGTTATCTTTATGCAATGATCTAAATGATGGTTCAATATGGGGCAGAGGCTTACATGCTCATGCCATGAAAAGTGGAATAGAACTAGATGCATATCTGGGCAATGCGTTATTAAGCATGTATGTCAAGCACAATCAAATAACTGCTGCACAGTATGTTTTTGAGAAGATGAGAGGTTTGGATGTCATCTCCTGGAACACAATGATATCAGCATTTGCTCAGAGTATGTTTCGAGCCAAAGCATTTGAACTCTTTTTGATGATGTGTGAATCAGAAATCAAGTTTAATTCTTACACGATAGTATCTCTCCTCGCATTCTGTAAAGATGGAAGTGATTTAGTGTTTGGGAGATCGATCCATGGTTTTGCAATAAAAAATGGTCTCGAAATAAATACTTCTTTGAACACTTCACTAACTGAAATGTACATCAATTGTGGTGATGAAAGAGCAGCTACAAATATGTTTACTAGATGTCCTCAAAGAGATTTAGTCTCATGGAATTCCCTAATTTCAAGTTATATAAAGAATGACAATGCAGGAAAAGCTCTATTACTTTTTAACCATATGATTTCTGAGCTGGAGCCGAACTCTGTGACAATCATAAATATCCTCACATCTTGTACGCAGCTTGCTCATCTACCACTAGGACAGTGCTTGCATGCTTACACAACCAGAAGGGAAGTATCTCTTGAAATGGATGCTTCTCTGGCAAATGCTTTTATAACTATGTATGCACGATGCGGTAAATTGCAATATGCAGAAAAGATTTTTTGCACCCTGCAGACAAGAAGTATTGTCTCATGGAATGCCATGATAACAGGATATGGCATGCACGGTCGTGGACGCGATGCCACTCTAGCATTTGCACAGATGTTGGATGATGGTTTCAAGCCTAACAATGTATCTTTTGCATCTGTTTTATCTGCCTGCAGCCATTCTGGTTTGACCGTGACAGGTTTGCAACTTTTCCATTCCATGGTGCGGGATTTTGGTATTGCTCCTCAACTTACTCACTATGGTTGTATGGTTGATCTGCTTGGTCGTGGGGGCCATTTTTCTGAAGCTATAGCTTTCATCAACTCAATGCCCATTGAACCTGATGCATCAATTTGGAGAGCTTTGCTCAGTTCATGTCAGATTAAAAGCAATAACAAGCTATTGGAGACCATCTTTGGAAAGCTTGTTGAATTAGAACCAAGCAATCCAGGCAATTTTATTTTGCTTTCCAATATTTATGCAGCAGCAGGCCTTTGGTCAGAAGTTGTACAGATAAGAAAGTGGCTTCGAGAAAGAGGTCTAGGAAAGCCTCCAGGAACTAGCTGGATTGTAATTGGCAATCAGGTTCACCATTTCACTGCTACTGATGTATTACACCCTCAATCAGAAAGAATTTACGAAAATTTGAATTCTTTGACATCATTGATCCGAGATTTGGGCTGA

Protein sequence

MKKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYMRFDAVLSHRVFSLMLVRNIVSWNAIITGFLNVGDCAKALKLYSSMLIEGIKFDAVTMLVVIQACAEYGCLRLGMQLHQLAIKFNLINDLFILNALLNMYSDNGSLESSWALFNAVPTSDAALWNSMISSYIGFGFHAEAIALFIKMRLERIKEDVRTIAIMLSLCNDLNDGSIWGRGLHAHAMKSGIELDAYLGNALLSMYVKHNQITAAQYVFEKMRGLDVISWNTMISAFAQSMFRAKAFELFLMMCESEIKFNSYTIVSLLAFCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCGDERAATNMFTRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIINILTSCTQLAHLPLGQCLHAYTTRREVSLEMDASLANAFITMYARCGKLQYAEKIFCTLQTRSIVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTVTGLQLFHSMVRDFGIAPQLTHYGCMVDLLGRGGHFSEAIAFINSMPIEPDASIWRALLSSCQIKSNNKLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIVIGNQVHHFTATDVLHPQSERIYENLNSLTSLIRDLG
BLAST of CsaV3_4G024260 vs. NCBI nr
Match: XP_004142223.1 (PREDICTED: putative pentatricopeptide repeat-containing protein At3g01580 [Cucumis sativus])

HSP 1 Score: 1388.6 bits (3593), Expect = 0.0e+00
Identity = 695/695 (100.00%), Postives = 695/695 (100.00%), Query Frame = 0

Query: 2   KKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYMRFDA 61
           KKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYMRFDA
Sbjct: 153 KKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYMRFDA 212

Query: 62  VLSHRVFSLMLVRNIVSWNAIITGFLNVGDCAKALKLYSSMLIEGIKFDAVTMLVVIQAC 121
           VLSHRVFSLMLVRNIVSWNAIITGFLNVGDCAKALKLYSSMLIEGIKFDAVTMLVVIQAC
Sbjct: 213 VLSHRVFSLMLVRNIVSWNAIITGFLNVGDCAKALKLYSSMLIEGIKFDAVTMLVVIQAC 272

Query: 122 AEYGCLRLGMQLHQLAIKFNLINDLFILNALLNMYSDNGSLESSWALFNAVPTSDAALWN 181
           AEYGCLRLGMQLHQLAIKFNLINDLFILNALLNMYSDNGSLESSWALFNAVPTSDAALWN
Sbjct: 273 AEYGCLRLGMQLHQLAIKFNLINDLFILNALLNMYSDNGSLESSWALFNAVPTSDAALWN 332

Query: 182 SMISSYIGFGFHAEAIALFIKMRLERIKEDVRTIAIMLSLCNDLNDGSIWGRGLHAHAMK 241
           SMISSYIGFGFHAEAIALFIKMRLERIKEDVRTIAIMLSLCNDLNDGSIWGRGLHAHAMK
Sbjct: 333 SMISSYIGFGFHAEAIALFIKMRLERIKEDVRTIAIMLSLCNDLNDGSIWGRGLHAHAMK 392

Query: 242 SGIELDAYLGNALLSMYVKHNQITAAQYVFEKMRGLDVISWNTMISAFAQSMFRAKAFEL 301
           SGIELDAYLGNALLSMYVKHNQITAAQYVFEKMRGLDVISWNTMISAFAQSMFRAKAFEL
Sbjct: 393 SGIELDAYLGNALLSMYVKHNQITAAQYVFEKMRGLDVISWNTMISAFAQSMFRAKAFEL 452

Query: 302 FLMMCESEIKFNSYTIVSLLAFCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYIN 361
           FLMMCESEIKFNSYTIVSLLAFCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYIN
Sbjct: 453 FLMMCESEIKFNSYTIVSLLAFCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYIN 512

Query: 362 CGDERAATNMFTRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIINIL 421
           CGDERAATNMFTRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIINIL
Sbjct: 513 CGDERAATNMFTRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIINIL 572

Query: 422 TSCTQLAHLPLGQCLHAYTTRREVSLEMDASLANAFITMYARCGKLQYAEKIFCTLQTRS 481
           TSCTQLAHLPLGQCLHAYTTRREVSLEMDASLANAFITMYARCGKLQYAEKIFCTLQTRS
Sbjct: 573 TSCTQLAHLPLGQCLHAYTTRREVSLEMDASLANAFITMYARCGKLQYAEKIFCTLQTRS 632

Query: 482 IVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTVTGLQLFH 541
           IVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTVTGLQLFH
Sbjct: 633 IVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTVTGLQLFH 692

Query: 542 SMVRDFGIAPQLTHYGCMVDLLGRGGHFSEAIAFINSMPIEPDASIWRALLSSCQIKSNN 601
           SMVRDFGIAPQLTHYGCMVDLLGRGGHFSEAIAFINSMPIEPDASIWRALLSSCQIKSNN
Sbjct: 693 SMVRDFGIAPQLTHYGCMVDLLGRGGHFSEAIAFINSMPIEPDASIWRALLSSCQIKSNN 752

Query: 602 KLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIVI 661
           KLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIVI
Sbjct: 753 KLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIVI 812

Query: 662 GNQVHHFTATDVLHPQSERIYENLNSLTSLIRDLG 697
           GNQVHHFTATDVLHPQSERIYENLNSLTSLIRDLG
Sbjct: 813 GNQVHHFTATDVLHPQSERIYENLNSLTSLIRDLG 847

BLAST of CsaV3_4G024260 vs. NCBI nr
Match: XP_016900722.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g57430, chloroplastic-like isoform X2 [Cucumis melo])

HSP 1 Score: 1313.1 bits (3397), Expect = 0.0e+00
Identity = 655/696 (94.11%), Postives = 675/696 (96.98%), Query Frame = 0

Query: 1   MKKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYMRFD 60
           MKKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFY+RFD
Sbjct: 32  MKKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYLRFD 91

Query: 61  AVLSHRVFSLMLVRNIVSWNAIITGFLNVGDCAKALKLYSSMLIEGIKFDAVTMLVVIQA 120
           AVLSHRVFSLM+VRNIVSWNAIITGFLNVGD  KALKL+SSMLIEGIKFDAVTMLVVIQA
Sbjct: 92  AVLSHRVFSLMVVRNIVSWNAIITGFLNVGDYTKALKLFSSMLIEGIKFDAVTMLVVIQA 151

Query: 121 CAEYGCLRLGMQLHQLAIKFNLINDLFILNALLNMYSDNGSLESSWALFNAVPTSDAALW 180
           CAEYGCLRLGMQLHQLAIKFNLIND+F+LNALLNMYSDNGSLESS  LFNAVPTSDAALW
Sbjct: 152 CAEYGCLRLGMQLHQLAIKFNLINDVFVLNALLNMYSDNGSLESSCVLFNAVPTSDAALW 211

Query: 181 NSMISSYIGFGFHAEAIALFIKMRLERIKEDVRTIAIMLSLCNDLNDGSIWGRGLHAHAM 240
           NSMIS YIGFGFHAEAIALFIKMRLERIKEDVRTI IMLSLCNDLNDGS+WGRGLHAHAM
Sbjct: 212 NSMISCYIGFGFHAEAIALFIKMRLERIKEDVRTIVIMLSLCNDLNDGSLWGRGLHAHAM 271

Query: 241 KSGIELDAYLGNALLSMYVKHNQITAAQYVFEKMRGLDVISWNTMISAFAQSMFRAKAFE 300
           KSGIELDA+LGNALLSMYVKHNQI AAQ VFEK RGLDVISWNTMISA AQSMFRAKAFE
Sbjct: 272 KSGIELDAFLGNALLSMYVKHNQINAAQNVFEKTRGLDVISWNTMISALAQSMFRAKAFE 331

Query: 301 LFLMMCESEIKFNSYTIVSLLAFCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYI 360
           LF MMCESEIKFNSYTI+SLLA CKDG+DLVFGRSIHGFAIKNGLEINTSLNTSLTEMYI
Sbjct: 332 LFFMMCESEIKFNSYTIISLLALCKDGNDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYI 391

Query: 361 NCGDERAATNMFTRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIINI 420
           NCGDERAA +MFTRCPQRDL+SWNSLI SYIKNDNAGKALLLFNHMISELEPNSVTIINI
Sbjct: 392 NCGDERAAIDMFTRCPQRDLISWNSLILSYIKNDNAGKALLLFNHMISELEPNSVTIINI 451

Query: 421 LTSCTQLAHLPLGQCLHAYTTRREVSLEMDASLANAFITMYARCGKLQYAEKIFCTLQTR 480
           LTSCTQLAHLPLGQCLHAY TRRE SLEMDASLANAFITMYARCGK+QYAE+IF TLQTR
Sbjct: 452 LTSCTQLAHLPLGQCLHAYATRREESLEMDASLANAFITMYARCGKMQYAEQIFRTLQTR 511

Query: 481 SIVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTVTGLQLF 540
           +IVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLT TGL LF
Sbjct: 512 NIVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTETGLLLF 571

Query: 541 HSMVRDFGIAPQLTHYGCMVDLLGRGGHFSEAIAFINSMPIEPDASIWRALLSSCQIKSN 600
           HSMVRDFG+APQLTHYGCMVDLLGRGGHFSEAIAFIN+MPIEPDASIWRALLSS QIKSN
Sbjct: 572 HSMVRDFGLAPQLTHYGCMVDLLGRGGHFSEAIAFINTMPIEPDASIWRALLSSWQIKSN 631

Query: 601 NKLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIV 660
            KLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIV
Sbjct: 632 KKLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIV 691

Query: 661 IGNQVHHFTATDVLHPQSERIYENLNSLTSLIRDLG 697
           IGNQVH+FTATDVLHPQSE+IYENLNSLTSLI+D+G
Sbjct: 692 IGNQVHYFTATDVLHPQSEKIYENLNSLTSLIQDMG 727

BLAST of CsaV3_4G024260 vs. NCBI nr
Match: XP_008449715.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g13600-like isoform X1 [Cucumis melo])

HSP 1 Score: 1311.2 bits (3392), Expect = 0.0e+00
Identity = 654/695 (94.10%), Postives = 674/695 (96.98%), Query Frame = 0

Query: 2   KKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYMRFDA 61
           KKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFY+RFDA
Sbjct: 153 KKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYLRFDA 212

Query: 62  VLSHRVFSLMLVRNIVSWNAIITGFLNVGDCAKALKLYSSMLIEGIKFDAVTMLVVIQAC 121
           VLSHRVFSLM+VRNIVSWNAIITGFLNVGD  KALKL+SSMLIEGIKFDAVTMLVVIQAC
Sbjct: 213 VLSHRVFSLMVVRNIVSWNAIITGFLNVGDYTKALKLFSSMLIEGIKFDAVTMLVVIQAC 272

Query: 122 AEYGCLRLGMQLHQLAIKFNLINDLFILNALLNMYSDNGSLESSWALFNAVPTSDAALWN 181
           AEYGCLRLGMQLHQLAIKFNLIND+F+LNALLNMYSDNGSLESS  LFNAVPTSDAALWN
Sbjct: 273 AEYGCLRLGMQLHQLAIKFNLINDVFVLNALLNMYSDNGSLESSCVLFNAVPTSDAALWN 332

Query: 182 SMISSYIGFGFHAEAIALFIKMRLERIKEDVRTIAIMLSLCNDLNDGSIWGRGLHAHAMK 241
           SMIS YIGFGFHAEAIALFIKMRLERIKEDVRTI IMLSLCNDLNDGS+WGRGLHAHAMK
Sbjct: 333 SMISCYIGFGFHAEAIALFIKMRLERIKEDVRTIVIMLSLCNDLNDGSLWGRGLHAHAMK 392

Query: 242 SGIELDAYLGNALLSMYVKHNQITAAQYVFEKMRGLDVISWNTMISAFAQSMFRAKAFEL 301
           SGIELDA+LGNALLSMYVKHNQI AAQ VFEK RGLDVISWNTMISA AQSMFRAKAFEL
Sbjct: 393 SGIELDAFLGNALLSMYVKHNQINAAQNVFEKTRGLDVISWNTMISALAQSMFRAKAFEL 452

Query: 302 FLMMCESEIKFNSYTIVSLLAFCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYIN 361
           F MMCESEIKFNSYTI+SLLA CKDG+DLVFGRSIHGFAIKNGLEINTSLNTSLTEMYIN
Sbjct: 453 FFMMCESEIKFNSYTIISLLALCKDGNDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYIN 512

Query: 362 CGDERAATNMFTRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIINIL 421
           CGDERAA +MFTRCPQRDL+SWNSLI SYIKNDNAGKALLLFNHMISELEPNSVTIINIL
Sbjct: 513 CGDERAAIDMFTRCPQRDLISWNSLILSYIKNDNAGKALLLFNHMISELEPNSVTIINIL 572

Query: 422 TSCTQLAHLPLGQCLHAYTTRREVSLEMDASLANAFITMYARCGKLQYAEKIFCTLQTRS 481
           TSCTQLAHLPLGQCLHAY TRRE SLEMDASLANAFITMYARCGK+QYAE+IF TLQTR+
Sbjct: 573 TSCTQLAHLPLGQCLHAYATRREESLEMDASLANAFITMYARCGKMQYAEQIFRTLQTRN 632

Query: 482 IVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTVTGLQLFH 541
           IVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLT TGL LFH
Sbjct: 633 IVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTETGLLLFH 692

Query: 542 SMVRDFGIAPQLTHYGCMVDLLGRGGHFSEAIAFINSMPIEPDASIWRALLSSCQIKSNN 601
           SMVRDFG+APQLTHYGCMVDLLGRGGHFSEAIAFIN+MPIEPDASIWRALLSS QIKSN 
Sbjct: 693 SMVRDFGLAPQLTHYGCMVDLLGRGGHFSEAIAFINTMPIEPDASIWRALLSSWQIKSNK 752

Query: 602 KLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIVI 661
           KLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIVI
Sbjct: 753 KLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIVI 812

Query: 662 GNQVHHFTATDVLHPQSERIYENLNSLTSLIRDLG 697
           GNQVH+FTATDVLHPQSE+IYENLNSLTSLI+D+G
Sbjct: 813 GNQVHYFTATDVLHPQSEKIYENLNSLTSLIQDMG 847

BLAST of CsaV3_4G024260 vs. NCBI nr
Match: XP_023512048.1 (pentatricopeptide repeat-containing protein At3g57430, chloroplastic-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023512049.1 pentatricopeptide repeat-containing protein At3g57430, chloroplastic-like isoform X1 [Cucurbita pepo subsp. pepo] >XP_023512050.1 pentatricopeptide repeat-containing protein At3g57430, chloroplastic-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1221.8 bits (3160), Expect = 0.0e+00
Identity = 600/696 (86.21%), Postives = 650/696 (93.39%), Query Frame = 0

Query: 1   MKKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYMRFD 60
           M+KAGLTPNSRTVV LLLAC EMLELRLG EIHGYCLRNGLFDMDA+VGTAL+GFYMRFD
Sbjct: 150 MQKAGLTPNSRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFD 209

Query: 61  AVLSHRVFSLMLVRNIVSWNAIITGFLNVGDCAKALKLYSSMLIEGIKFDAVTMLVVIQA 120
           A +SHRVFSLM VRN+VSWNA+ITG+LN+GD  KALKL+SSML EGIKFDAVTML+VIQA
Sbjct: 210 AAVSHRVFSLMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIKFDAVTMLLVIQA 269

Query: 121 CAEYGCLRLGMQLHQLAIKFNLINDLFILNALLNMYSDNGSLESSWALFNAVPTSDAALW 180
           CAE   L+LGMQLHQLAIKFN ++DLF+LNALLNMYSDNG LESS ALFNAVPTSDAALW
Sbjct: 270 CAESESLQLGMQLHQLAIKFNFVDDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALW 329

Query: 181 NSMISSYIGFGFHAEAIALFIKMRLERIKEDVRTIAIMLSLCNDLNDGSIWGRGLHAHAM 240
           NSMIS+YI FGFHAEAIAL+IKMRLE +KED RT+AIMLSLC DLNDGSIWGRGLHAHAM
Sbjct: 330 NSMISAYIAFGFHAEAIALYIKMRLEGLKEDKRTVAIMLSLCEDLNDGSIWGRGLHAHAM 389

Query: 241 KSGIELDAYLGNALLSMYVKHNQITAAQYVFEKMRGLDVISWNTMISAFAQSMFRAKAFE 300
           KSG+ELD +LGNALLSMYV+HNQI AAQ +F+KMRGLDVISWNTMI A AQS FRAKAF+
Sbjct: 390 KSGMELDVFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISWNTMILALAQSKFRAKAFQ 449

Query: 301 LFLMMCESEIKFNSYTIVSLLAFCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYI 360
           LF+ MCESEIKFNSYT++SLLA CKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYI
Sbjct: 450 LFMTMCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYI 509

Query: 361 NCGDERAATNMFTRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIINI 420
           NC DE +ATN+F RCPQRDL+SWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTII+I
Sbjct: 510 NCSDEGSATNLFIRCPQRDLISWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIISI 569

Query: 421 LTSCTQLAHLPLGQCLHAYTTRREVSLEMDASLANAFITMYARCGKLQYAEKIFCTLQTR 480
           LTSCTQLAHLPLGQCLHAYTTRR  S E+DASLANAFITMYARCGK+QYAEKIF TLQ R
Sbjct: 570 LTSCTQLAHLPLGQCLHAYTTRRGESFELDASLANAFITMYARCGKMQYAEKIFNTLQAR 629

Query: 481 SIVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTVTGLQLF 540
           +IVSWNAMITGYGMHGRG DATLAFAQMLDDGFKPNN+SF SVLSACSHSGLT TGLQLF
Sbjct: 630 NIVSWNAMITGYGMHGRGHDATLAFAQMLDDGFKPNNISFVSVLSACSHSGLTKTGLQLF 689

Query: 541 HSMVRDFGIAPQLTHYGCMVDLLGRGGHFSEAIAFINSMPIEPDASIWRALLSSCQIKSN 600
            SMVRDFGIAPQL HYGC+VDLLGRGGHF+EAIA I+SMP+EPDASIWRALLSSCQ+KSN
Sbjct: 690 SSMVRDFGIAPQLAHYGCIVDLLGRGGHFAEAIALISSMPVEPDASIWRALLSSCQVKSN 749

Query: 601 NKLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIV 660
            KL+ETIF KLVELEPSNPGNF+LLSN+YAAAGLWSEV QIRKW+R++GL KPPGTSWIV
Sbjct: 750 KKLVETIFRKLVELEPSNPGNFVLLSNVYAAAGLWSEVSQIRKWVRDKGLVKPPGTSWIV 809

Query: 661 IGNQVHHFTATDVLHPQSERIYENLNSLTSLIRDLG 697
           IG+QVH+FTATDV HPQSE IYENLNSLTSLI+D+G
Sbjct: 810 IGSQVHYFTATDVSHPQSEEIYENLNSLTSLIQDMG 845

BLAST of CsaV3_4G024260 vs. NCBI nr
Match: XP_022944564.1 (pentatricopeptide repeat-containing protein At3g57430, chloroplastic-like [Cucurbita moschata] >XP_022944565.1 pentatricopeptide repeat-containing protein At3g57430, chloroplastic-like [Cucurbita moschata])

HSP 1 Score: 1212.6 bits (3136), Expect = 0.0e+00
Identity = 600/696 (86.21%), Postives = 646/696 (92.82%), Query Frame = 0

Query: 1   MKKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYMRFD 60
           M+KAGLTPNSRTVV LLLAC EMLELRLG EIHGYCLRNGLFDMDA+VGTAL+GFYMRFD
Sbjct: 150 MQKAGLTPNSRTVVPLLLACAEMLELRLGHEIHGYCLRNGLFDMDAHVGTALIGFYMRFD 209

Query: 61  AVLSHRVFSLMLVRNIVSWNAIITGFLNVGDCAKALKLYSSMLIEGIKFDAVTMLVVIQA 120
           A +SHRVFS M VRN+VSWNA+ITG+LN+GD  KALKL+SSML EGIKFDAVTML+VIQA
Sbjct: 210 ATVSHRVFSSMEVRNVVSWNAMITGYLNIGDYTKALKLFSSMLTEGIKFDAVTMLLVIQA 269

Query: 121 CAEYGCLRLGMQLHQLAIKFNLINDLFILNALLNMYSDNGSLESSWALFNAVPTSDAALW 180
           CAE   L+LGMQLHQLAIKFN I DLF+LNALLNMYSDNG LESS ALFNAVPTSDAALW
Sbjct: 270 CAESESLQLGMQLHQLAIKFNFIGDLFVLNALLNMYSDNGRLESSCALFNAVPTSDAALW 329

Query: 181 NSMISSYIGFGFHAEAIALFIKMRLERIKEDVRTIAIMLSLCNDLNDGSIWGRGLHAHAM 240
           NSMIS+YI FGFHAEAIAL+IKMRLE +KED RT+ IMLSLC DLNDGSIWGRGLHAHAM
Sbjct: 330 NSMISAYIAFGFHAEAIALYIKMRLEGLKEDKRTVEIMLSLCEDLNDGSIWGRGLHAHAM 389

Query: 241 KSGIELDAYLGNALLSMYVKHNQITAAQYVFEKMRGLDVISWNTMISAFAQSMFRAKAFE 300
           KSG+ELD +LGNALLSMYV+HNQI AAQ +F+KMRGLDVIS NTMI A A+S FRAKAFE
Sbjct: 390 KSGMELDVFLGNALLSMYVEHNQIDAAQKLFDKMRGLDVISCNTMILALARSKFRAKAFE 449

Query: 301 LFLMMCESEIKFNSYTIVSLLAFCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYI 360
           LF+ MCESEIKFNSYT++SLLA CKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYI
Sbjct: 450 LFMTMCESEIKFNSYTMISLLALCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYI 509

Query: 361 NCGDERAATNMFTRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIINI 420
           NC DE +ATN+F RCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTII+I
Sbjct: 510 NCRDEGSATNLFIRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIISI 569

Query: 421 LTSCTQLAHLPLGQCLHAYTTRREVSLEMDASLANAFITMYARCGKLQYAEKIFCTLQTR 480
           LTSCTQLAHLPLGQCLHAYTTRR  S E+DASLANAFITMYARCGK+QYAEKIF TL+ R
Sbjct: 570 LTSCTQLAHLPLGQCLHAYTTRRGESFELDASLANAFITMYARCGKMQYAEKIFSTLKAR 629

Query: 481 SIVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTVTGLQLF 540
           +IVSWNAMITGYGMHGRG DATLAFAQMLDDGFKPNN+SF SVLSACSHSGLT TGLQLF
Sbjct: 630 NIVSWNAMITGYGMHGRGHDATLAFAQMLDDGFKPNNISFVSVLSACSHSGLTKTGLQLF 689

Query: 541 HSMVRDFGIAPQLTHYGCMVDLLGRGGHFSEAIAFINSMPIEPDASIWRALLSSCQIKSN 600
            SMVRDFGIAPQL HYGC+VDLLGRGGHF+EAIA I+SMP+EPDASIWRALLSSCQ+KSN
Sbjct: 690 SSMVRDFGIAPQLAHYGCIVDLLGRGGHFAEAIALISSMPVEPDASIWRALLSSCQVKSN 749

Query: 601 NKLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIV 660
            KL+ETIF KLVELEPSNPGNF+LLSNIYAAAGLWSEV QIRKWLR++GL KPPGTSWIV
Sbjct: 750 KKLVETIFRKLVELEPSNPGNFVLLSNIYAAAGLWSEVSQIRKWLRDKGLVKPPGTSWIV 809

Query: 661 IGNQVHHFTATDVLHPQSERIYENLNSLTSLIRDLG 697
           IG+QVH+FTATDV HPQSE IYENLNSLTSLI+D+G
Sbjct: 810 IGSQVHYFTATDVSHPQSEEIYENLNSLTSLIQDMG 845

BLAST of CsaV3_4G024260 vs. TAIR10
Match: AT3G57430.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 420.2 bits (1079), Expect = 2.4e-117
Identity = 243/713 (34.08%), Postives = 402/713 (56.38%), Query Frame = 0

Query: 5   GLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYMR---FDA 64
           G+ P++    ALL A  ++ ++ LG++IH +  + G       V   LV  Y +   F A
Sbjct: 92  GIKPDNYAFPALLKAVADLQDMELGKQIHAHVYKFGYGVDSVTVANTLVNLYRKCGDFGA 151

Query: 65  VLSHRVFSLMLVRNIVSWNAIITGFLNVGDCAKALKLYSSMLIEGIKFDAVTMLVVIQAC 124
           V  ++VF  +  RN VSWN++I+   +      AL+ +  ML E ++  + T++ V+ AC
Sbjct: 152 V--YKVFDRISERNQVSWNSLISSLCSFEKWEMALEAFRCMLDENVEPSSFTLVSVVTAC 211

Query: 125 AEYGC---LRLGMQLHQLAIKFNLINDLFILNALLNMYSDNGSLESSWALFNAVPTSDAA 184
           +       L +G Q+H   ++   +N  FI+N L+ MY   G L SS  L  +    D  
Sbjct: 212 SNLPMPEGLMMGKQVHAYGLRKGELNS-FIINTLVAMYGKLGKLASSKVLLGSFGGRDLV 271

Query: 185 LWNSMISSYIGFGFHAEAIALFIKMRLERIKEDVRTIAIMLSLCNDLNDGSIWGRGLHAH 244
            WN+++SS        EA+    +M LE ++ D  TI+ +L  C+ L +    G+ LHA+
Sbjct: 272 TWNTVLSSLCQNEQLLEALEYLREMVLEGVEPDEFTISSVLPACSHL-EMLRTGKELHAY 331

Query: 245 AMKSG-IELDAYLGNALLSMYVKHNQITAAQYVFEKMRGLDVISWNTMISAFAQSMFRAK 304
           A+K+G ++ ++++G+AL+ MY    Q+ + + VF+ M    +  WN MI+ ++Q+    +
Sbjct: 332 ALKNGSLDENSFVGSALVDMYCNCKQVLSGRRVFDGMFDRKIGLWNAMIAGYSQNEHDKE 391

Query: 305 AFELFLMMCESE-IKFNSYTIVSLLAFCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLT 364
           A  LF+ M ES  +  NS T+  ++  C          +IHGF +K GL+ +  +  +L 
Sbjct: 392 ALLLFIGMEESAGLLANSTTMAGVVPACVRSGAFSRKEAIHGFVVKRGLDRDRFVQNTLM 451

Query: 365 EMYINCGDERAATNMFTRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMIS-------- 424
           +MY   G    A  +F +   RDLV+WN++I+ Y+ +++   ALLL + M +        
Sbjct: 452 DMYSRLGKIDIAMRIFGKMEDRDLVTWNTMITGYVFSEHHEDALLLLHKMQNLERKVSKG 511

Query: 425 ----ELEPNSVTIINILTSCTQLAHLPLGQCLHAYTTRREVSLEMDASLANAFITMYARC 484
                L+PNS+T++ IL SC  L+ L  G+ +HAY  +   +L  D ++ +A + MYA+C
Sbjct: 512 ASRVSLKPNSITLMTILPSCAALSALAKGKEIHAYAIKN--NLATDVAVGSALVDMYAKC 571

Query: 485 GKLQYAEKIFCTLQTRSIVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVL 544
           G LQ + K+F  +  +++++WN +I  YGMHG G++A      M+  G KPN V+F SV 
Sbjct: 572 GCLQMSRKVFDQIPQKNVITWNVIIMAYGMHGNGQEAIDLLRMMMVQGVKPNEVTFISVF 631

Query: 545 SACSHSGLTVTGLQLFHSMVRDFGIAPQLTHYGCMVDLLGRGGHFSEAIAFINSMPIE-P 604
           +ACSHSG+   GL++F+ M  D+G+ P   HY C+VDLLGR G   EA   +N MP +  
Sbjct: 632 AACSHSGMVDEGLRIFYVMKPDYGVEPSSDHYACVVDLLGRAGRIKEAYQLMNMMPRDFN 691

Query: 605 DASIWRALLSSCQIKSNNKLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRK 664
            A  W +LL + +I +N ++ E     L++LEP+   +++LL+NIY++AGLW +  ++R+
Sbjct: 692 KAGAWSSLLGASRIHNNLEIGEIAAQNLIQLEPNVASHYVLLANIYSSAGLWDKATEVRR 751

Query: 665 WLRERGLGKPPGTSWIVIGNQVHHFTATDVLHPQSERIYENLNSLTSLIRDLG 697
            ++E+G+ K PG SWI  G++VH F A D  HPQSE++   L +L   +R  G
Sbjct: 752 NMKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHPQSEKLSGYLETLWERMRKEG 798

BLAST of CsaV3_4G024260 vs. TAIR10
Match: AT4G13650.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 404.8 bits (1039), Expect = 1.1e-112
Identity = 226/694 (32.56%), Postives = 373/694 (53.75%), Query Frame = 0

Query: 5   GLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYMRFDAVLS 64
           G+ P      ++L AC ++  L +G+++HG  L+ G F  D YV  ALV  Y     ++S
Sbjct: 283 GIMPTPYAFSSVLSACKKIESLEIGEQLHGLVLKLG-FSSDTYVCNALVSLYFHLGNLIS 342

Query: 65  -HRVFSLMLVRNIVSWNAIITGFLNVGDCAKALKLYSSMLIEGIKFDAVTMLVVIQACAE 124
              +FS M  R+ V++N +I G    G   KA++L+  M ++G++ D+ T+  ++ AC+ 
Sbjct: 343 AEHIFSNMSQRDAVTYNTLINGLSQCGYGEKAMELFKRMHLDGLEPDSNTLASLVVACSA 402

Query: 125 YGCLRLGMQLHQLAIKFNLINDLFILNALLNMYSDNGSLESSWALFNAVPTSDAALWNSM 184
            G L  G QLH    K    ++  I  ALLN+Y+    +E++   F      +  LWN M
Sbjct: 403 DGTLFRGQQLHAYTTKLGFASNNKIEGALLNLYAKCADIETALDYFLETEVENVVLWNVM 462

Query: 185 ISSYIGFGFHAEAIALFIKMRLERIKEDVRTIAIMLSLCNDLNDGSIWGRGLHAHAMKSG 244
           + +Y        +  +F +M++E I  +  T   +L  C  L D  + G  +H+  +K+ 
Sbjct: 463 LVAYGLLDDLRNSFRIFRQMQIEEIVPNQYTYPSILKTCIRLGDLEL-GEQIHSQIIKTN 522

Query: 245 IELDAYLGNALLSMYVKHNQITAAQYVFEKMRGLDVISWNTMISAFAQSMFRAKAFELFL 304
            +L+AY+ + L+ MY K  ++  A  +  +  G DV+SW TMI+ + Q  F  KA   F 
Sbjct: 523 FQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSWTTMIAGYTQYNFDDKALTTFR 582

Query: 305 MMCESEIKFNSYTIVSLLAFCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCG 364
            M +  I+ +   + + ++ C     L  G+ IH  A  +G   +     +L  +Y  CG
Sbjct: 583 QMLDRGIRSDEVGLTNAVSACAGLQALKEGQQIHAQACVSGFSSDLPFQNALVTLYSRCG 642

Query: 365 DERAATNMFTRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISE-LEPNSVTIINILT 424
               +   F +    D ++WN+L+S + ++ N  +AL +F  M  E ++ N+ T  + + 
Sbjct: 643 KIEESYLAFEQTEAGDNIAWNALVSGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAVK 702

Query: 425 SCTQLAHLPLGQCLHAYTTRREVSLEMDASLANAFITMYARCGKLQYAEKIFCTLQTRSI 484
           + ++ A++  G+ +HA  T+     + +  + NA I+MYA+CG +  AEK F  + T++ 
Sbjct: 703 AASETANMKQGKQVHAVITK--TGYDSETEVCNALISMYAKCGSISDAEKQFLEVSTKNE 762

Query: 485 VSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTVTGLQLFHS 544
           VSWNA+I  Y  HG G +A  +F QM+    +PN+V+   VLSACSH GL   G+  F S
Sbjct: 763 VSWNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFES 822

Query: 545 MVRDFGIAPQLTHYGCMVDLLGRGGHFSEAIAFINSMPIEPDASIWRALLSSCQIKSNNK 604
           M  ++G++P+  HY C+VD+L R G  S A  FI  MPI+PDA +WR LLS+C +  N +
Sbjct: 823 MNSEYGLSPKPEHYVCVVDMLTRAGLLSRAKEFIQEMPIKPDALVWRTLLSACVVHKNME 882

Query: 605 LLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIVIG 664
           + E     L+ELEP +   ++LLSN+YA +  W      R+ ++E+G+ K PG SWI + 
Sbjct: 883 IGEFAAHHLLELEPEDSATYVLLSNLYAVSKKWDARDLTRQKMKEKGVKKEPGQSWIEVK 942

Query: 665 NQVHHFTATDVLHPQSERIYENLNSLTSLIRDLG 697
           N +H F   D  HP ++ I+E    LT    ++G
Sbjct: 943 NSIHSFYVGDQNHPLADEIHEYFQDLTKRASEIG 972

BLAST of CsaV3_4G024260 vs. TAIR10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 394.4 bits (1012), Expect = 1.4e-109
Identity = 229/683 (33.53%), Postives = 366/683 (53.59%), Query Frame = 0

Query: 16  LLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYMRFDAV-LSHRVFSLMLVR 75
           LL  C  + ELR   +I     +NGL+  + +  T LV  + R+ +V  + RVF  +  +
Sbjct: 43  LLERCSSLKELR---QILPLVFKNGLY-QEHFFQTKLVSLFCRYGSVDEAARVFEPIDSK 102

Query: 76  NIVSWNAIITGFLNVGDCAKALKLYSSMLIEGIKFDAVTMLVVIQACAEYGCLRLGMQLH 135
             V ++ ++ GF  V D  KAL+ +  M  + ++        +++ C +   LR+G ++H
Sbjct: 103 LNVLYHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIH 162

Query: 136 QLAIKFNLINDLFILNALLNMYSDNGSLESSWALFNAVPTSDAALWNSMISSYIGFGFHA 195
            L +K     DLF +  L NMY+    +  +  +F+ +P  D   WN++++ Y   G   
Sbjct: 163 GLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMAR 222

Query: 196 EAIALFIKMRLERIKEDVRTIAIMLSLCNDLNDGSIWGRGLHAHAMKSGIELDAYLGNAL 255
            A+ +   M  E +K    TI  +L   + L   S+ G+ +H +AM+SG +    +  AL
Sbjct: 223 MALEMVKSMCEENLKPSFITIVSVLPAVSALRLISV-GKEIHGYAMRSGFDSLVNISTAL 282

Query: 256 LSMYVKHNQITAAQYVFEKMRGLDVISWNTMISAFAQSMFRAKAFELFLMMCESEIKFNS 315
           + MY K   +  A+ +F+ M   +V+SWN+MI A+ Q+    +A  +F  M +  +K   
Sbjct: 283 VDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTD 342

Query: 316 YTIVSLLAFCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCGDERAATNMFTR 375
            +++  L  C D  DL  GR IH  +++ GL+ N S+  SL  MY  C +   A +MF +
Sbjct: 343 VSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGK 402

Query: 376 CPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISE-LEPNSVTIINILTSCTQLAHLPLG 435
              R LVSWN++I  + +N     AL  F+ M S  ++P++ T ++++T+  +L+     
Sbjct: 403 LQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHA 462

Query: 436 QCLHAYTTRREVSLEMDASLANAFITMYARCGKLQYAEKIFCTLQTRSIVSWNAMITGYG 495
           + +H    R    L+ +  +  A + MYA+CG +  A  IF  +  R + +WNAMI GYG
Sbjct: 463 KWIHGVVMRS--CLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYG 522

Query: 496 MHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTVTGLQLFHSMVRDFGIAPQL 555
            HG G+ A   F +M     KPN V+F SV+SACSHSGL   GL+ F+ M  ++ I   +
Sbjct: 523 THGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSM 582

Query: 556 THYGCMVDLLGRGGHFSEAIAFINSMPIEPDASIWRALLSSCQIKSNNKLLETIFGKLVE 615
            HYG MVDLLGR G  +EA  FI  MP++P  +++ A+L +CQI  N    E    +L E
Sbjct: 583 DHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFE 642

Query: 616 LEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIVIGNQVHHFTATDV 675
           L P + G  +LL+NIY AA +W +V Q+R  +  +GL K PG S + I N+VH F +   
Sbjct: 643 LNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGST 702

Query: 676 LHPQSERIYENLNSLTSLIRDLG 697
            HP S++IY  L  L   I++ G
Sbjct: 703 AHPDSKKIYAFLEKLICHIKEAG 718

BLAST of CsaV3_4G024260 vs. TAIR10
Match: AT1G15510.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 393.3 bits (1009), Expect = 3.2e-109
Identity = 223/698 (31.95%), Postives = 366/698 (52.44%), Query Frame = 0

Query: 1   MKKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYMRF- 60
           M++  +  +    VAL+  C        G +++   L + +  +   +G A +  ++RF 
Sbjct: 85  MQELRVAVDEDVFVALVRLCEWKRAQEEGSKVYSIAL-SSMSSLGVELGNAFLAMFVRFG 144

Query: 61  DAVLSHRVFSLMLVRNIVSWNAIITGFLNVGDCAKALKLYSSML-IEGIKFDAVTMLVVI 120
           + V +  VF  M  RN+ SWN ++ G+   G   +A+ LY  ML + G+K D  T   V+
Sbjct: 145 NLVDAWYVFGKMSERNLFSWNVLVGGYAKQGYFDEAMCLYHRMLWVGGVKPDVYTFPCVL 204

Query: 121 QACAEYGCLRLGMQLHQLAIKFNLINDLFILNALLNMYSDNGSLESSWALFNAVPTSDAA 180
           + C     L  G ++H   +++    D+ ++NAL+ MY   G ++S+  LF+ +P  D  
Sbjct: 205 RTCGGIPDLARGKEVHVHVVRYGYELDIDVVNALITMYVKCGDVKSARLLFDRMPRRDII 264

Query: 181 LWNSMISSYIGFGFHAEAIALFIKMRLERIKEDVRTIAIMLSLCNDLNDGSIWGRGLHAH 240
            WN+MIS Y   G   E + LF  MR   +  D+ T+  ++S C  L D  + GR +HA+
Sbjct: 265 SWNAMISGYFENGMCHEGLELFFAMRGLSVDPDLMTLTSVISACELLGDRRL-GRDIHAY 324

Query: 241 AMKSGIELDAYLGNALLSMYVKHNQITAAQYVFEKMRGLDVISWNTMISAFAQSMFRAKA 300
            + +G  +D  + N+L  MY+       A+ +F +M   D++SW TMIS +  +    KA
Sbjct: 325 VITTGFAVDISVCNSLTQMYLNAGSWREAEKLFSRMERKDIVSWTTMISGYEYNFLPDKA 384

Query: 301 FELFLMMCESEIKFNSYTIVSLLAFCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEM 360
            + + MM +  +K +  T+ ++L+ C    DL  G  +H  AIK  L     +  +L  M
Sbjct: 385 IDTYRMMDQDSVKPDEITVAAVLSACATLGDLDTGVELHKLAIKARLISYVIVANNLINM 444

Query: 361 YINCGDERAATNMFTRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTII 420
           Y  C     A ++F   P+++++SW S+I+    N+   +AL+    M   L+PN++T+ 
Sbjct: 445 YSKCKCIDKALDIFHNIPRKNVISWTSIIAGLRLNNRCFEALIFLRQMKMTLQPNAITLT 504

Query: 421 NILTSCTQLAHLPLGQCLHAYTTRREVSLEMDASLANAFITMYARCGKLQYAEKIFCTLQ 480
             L +C ++  L  G+ +HA+  R  V L  D  L NA + MY RCG++  A   F   Q
Sbjct: 505 AALAACARIGALMCGKEIHAHVLRTGVGL--DDFLPNALLDMYVRCGRMNTAWSQF-NSQ 564

Query: 481 TRSIVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTVTGLQ 540
            + + SWN ++TGY   G+G      F +M+    +P+ ++F S+L  CS S +   GL 
Sbjct: 565 KKDVTSWNILLTGYSERGQGSMVVELFDRMVKSRVRPDEITFISLLCGCSKSQMVRQGLM 624

Query: 541 LFHSMVRDFGIAPQLTHYGCMVDLLGRGGHFSEAIAFINSMPIEPDASIWRALLSSCQIK 600
            F S + D+G+ P L HY C+VDLLGR G   EA  FI  MP+ PD ++W ALL++C+I 
Sbjct: 625 YF-SKMEDYGVTPNLKHYACVVDLLGRAGELQEAHKFIQKMPVTPDPAVWGALLNACRIH 684

Query: 601 SNNKLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSW 660
               L E     + EL+  + G +ILL N+YA  G W EV ++R+ ++E GL    G SW
Sbjct: 685 HKIDLGELSAQHIFELDKKSVGYYILLCNLYADCGKWREVAKVRRMMKENGLTVDAGCSW 744

Query: 661 IVIGNQVHHFTATDVLHPQSERIYENLNSLTSLIRDLG 697
           + +  +VH F + D  HPQ++ I   L      + ++G
Sbjct: 745 VEVKGKVHAFLSDDKYHPQTKEINTVLEGFYEKMSEVG 776

BLAST of CsaV3_4G024260 vs. TAIR10
Match: AT3G02330.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 390.6 bits (1002), Expect = 2.1e-108
Identity = 241/713 (33.80%), Postives = 370/713 (51.89%), Query Frame = 0

Query: 1   MKKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYMRFD 60
           M + G+  + RT   +L  C  + +  LG +IHG  +R G  D D    +AL+  Y +  
Sbjct: 171 MGREGIEFDGRTFAIILKVCSFLEDTSLGMQIHGIVVRVGC-DTDVVAASALLDMYAKGK 230

Query: 61  A-VLSHRVFSLMLVRNIVSWNAIITGFLNVGDCAKALKLYSSMLIEGIKFDAVTMLVVIQ 120
             V S RVF  +  +N VSW+AII G +     + ALK +  M              V++
Sbjct: 231 RFVESLRVFQGIPEKNSVSWSAIIAGCVQNNLLSLALKFFKEMQKVNAGVSQSIYASVLR 290

Query: 121 ACAEYGCLRLGMQLHQLAIKFNLINDLFILNALLNMYSDNGSLESSWALFNAVPTSDAAL 180
           +CA    LRLG QLH  A+K +   D  +  A L+MY+   +++ +  LF+     +   
Sbjct: 291 SCAALSELRLGGQLHAHALKSDFAADGIVRTATLDMYAKCDNMQDAQILFDNSENLNRQS 350

Query: 181 WNSMISSYIGFGFHAEAIALFIKMRLERIKEDVRTIAIMLSLCNDLNDGSIWGRGLHAHA 240
           +N+MI+ Y       +A+ LF ++    +  D  +++ +   C  L  G   G  ++  A
Sbjct: 351 YNAMITGYSQEEHGFKALLLFHRLMSSGLGFDEISLSGVFRAC-ALVKGLSEGLQIYGLA 410

Query: 241 MKSGIELDAYLGNALLSMYVKHNQITAAQYVFEKMRGLDVISWNTMISAFAQSMFRAKAF 300
           +KS + LD  + NA + MY K   +  A  VF++MR  D +SWN +I+A  Q+    +  
Sbjct: 411 IKSSLSLDVCVANAAIDMYGKCQALAEAFRVFDEMRRRDAVSWNAIIAAHEQNGKGYETL 470

Query: 301 ELFLMMCESEIKFNSYTIVSLLAFCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMY 360
            LF+ M  S I+ + +T  S+L  C  GS L +G  IH   +K+G+  N+S+  SL +MY
Sbjct: 471 FLFVSMLRSRIEPDEFTFGSILKACTGGS-LGYGMEIHSSIVKSGMASNSSVGCSLIDMY 530

Query: 361 INCGDERAATNMFTRCPQR--------------------DLVSWNSLISSYIKNDNAGKA 420
             CG    A  + +R  QR                      VSWNS+IS Y+  + +  A
Sbjct: 531 SKCGMIEEAEKIHSRFFQRANVXXXXXXXXXXXXXXXXXXXVSWNSIISGYVMKEQSEDA 590

Query: 421 LLLFNHMIS-ELEPNSVTIINILTSCTQLAHLPLGQCLHAYTTRREVSLEMDASLANAFI 480
            +LF  M+   + P+  T   +L +C  LA   LG+ +HA   ++E  L+ D  + +  +
Sbjct: 591 QMLFTRMMEMGITPDKFTYATVLDTCANLASAGLGKQIHAQVIKKE--LQSDVYICSTLV 650

Query: 481 TMYARCGKLQYAEKIFCTLQTRSIVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNV 540
            MY++CG L  +  +F     R  V+WNAMI GY  HG+G +A   F +M+ +  KPN+V
Sbjct: 651 DMYSKCGDLHDSRLMFEKSLRRDFVTWNAMICGYAHHGKGEEAIQLFERMILENIKPNHV 710

Query: 541 SFASVLSACSHSGLTVTGLQLFHSMVRDFGIAPQLTHYGCMVDLLGRGGHFSEAIAFINS 600
           +F S+L AC+H GL   GL+ F+ M RD+G+ PQL HY  MVD+LG+ G    A+  I  
Sbjct: 711 TFISILRACAHMGLIDKGLEYFYMMKRDYGLDPQLPHYSNMVDILGKSGKVKRALELIRE 770

Query: 601 MPIEPDASIWRALLSSCQIKSNN-KLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSE 660
           MP E D  IWR LL  C I  NN ++ E     L+ L+P +   + LLSN+YA AG+W +
Sbjct: 771 MPFEADDVIWRTLLGVCTIHRNNVEVAEEATAALLRLDPQDSSAYTLLSNVYADAGMWEK 830

Query: 661 VVQIRKWLRERGLGKPPGTSWIVIGNQVHHFTATDVLHPQSERIYENLNSLTS 691
           V  +R+ +R   L K PG SW+ + +++H F   D  HP+ E IYE L  + S
Sbjct: 831 VSDLRRNMRGFKLKKEPGCSWVELKDELHVFLVGDKAHPRWEEIYEELGLIYS 878

BLAST of CsaV3_4G024260 vs. Swiss-Prot
Match: sp|Q7Y211|PP285_ARATH (Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H81 PE=2 SV=2)

HSP 1 Score: 420.2 bits (1079), Expect = 4.4e-116
Identity = 243/713 (34.08%), Postives = 402/713 (56.38%), Query Frame = 0

Query: 5   GLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYMR---FDA 64
           G+ P++    ALL A  ++ ++ LG++IH +  + G       V   LV  Y +   F A
Sbjct: 92  GIKPDNYAFPALLKAVADLQDMELGKQIHAHVYKFGYGVDSVTVANTLVNLYRKCGDFGA 151

Query: 65  VLSHRVFSLMLVRNIVSWNAIITGFLNVGDCAKALKLYSSMLIEGIKFDAVTMLVVIQAC 124
           V  ++VF  +  RN VSWN++I+   +      AL+ +  ML E ++  + T++ V+ AC
Sbjct: 152 V--YKVFDRISERNQVSWNSLISSLCSFEKWEMALEAFRCMLDENVEPSSFTLVSVVTAC 211

Query: 125 AEYGC---LRLGMQLHQLAIKFNLINDLFILNALLNMYSDNGSLESSWALFNAVPTSDAA 184
           +       L +G Q+H   ++   +N  FI+N L+ MY   G L SS  L  +    D  
Sbjct: 212 SNLPMPEGLMMGKQVHAYGLRKGELNS-FIINTLVAMYGKLGKLASSKVLLGSFGGRDLV 271

Query: 185 LWNSMISSYIGFGFHAEAIALFIKMRLERIKEDVRTIAIMLSLCNDLNDGSIWGRGLHAH 244
            WN+++SS        EA+    +M LE ++ D  TI+ +L  C+ L +    G+ LHA+
Sbjct: 272 TWNTVLSSLCQNEQLLEALEYLREMVLEGVEPDEFTISSVLPACSHL-EMLRTGKELHAY 331

Query: 245 AMKSG-IELDAYLGNALLSMYVKHNQITAAQYVFEKMRGLDVISWNTMISAFAQSMFRAK 304
           A+K+G ++ ++++G+AL+ MY    Q+ + + VF+ M    +  WN MI+ ++Q+    +
Sbjct: 332 ALKNGSLDENSFVGSALVDMYCNCKQVLSGRRVFDGMFDRKIGLWNAMIAGYSQNEHDKE 391

Query: 305 AFELFLMMCESE-IKFNSYTIVSLLAFCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLT 364
           A  LF+ M ES  +  NS T+  ++  C          +IHGF +K GL+ +  +  +L 
Sbjct: 392 ALLLFIGMEESAGLLANSTTMAGVVPACVRSGAFSRKEAIHGFVVKRGLDRDRFVQNTLM 451

Query: 365 EMYINCGDERAATNMFTRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMIS-------- 424
           +MY   G    A  +F +   RDLV+WN++I+ Y+ +++   ALLL + M +        
Sbjct: 452 DMYSRLGKIDIAMRIFGKMEDRDLVTWNTMITGYVFSEHHEDALLLLHKMQNLERKVSKG 511

Query: 425 ----ELEPNSVTIINILTSCTQLAHLPLGQCLHAYTTRREVSLEMDASLANAFITMYARC 484
                L+PNS+T++ IL SC  L+ L  G+ +HAY  +   +L  D ++ +A + MYA+C
Sbjct: 512 ASRVSLKPNSITLMTILPSCAALSALAKGKEIHAYAIKN--NLATDVAVGSALVDMYAKC 571

Query: 485 GKLQYAEKIFCTLQTRSIVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVL 544
           G LQ + K+F  +  +++++WN +I  YGMHG G++A      M+  G KPN V+F SV 
Sbjct: 572 GCLQMSRKVFDQIPQKNVITWNVIIMAYGMHGNGQEAIDLLRMMMVQGVKPNEVTFISVF 631

Query: 545 SACSHSGLTVTGLQLFHSMVRDFGIAPQLTHYGCMVDLLGRGGHFSEAIAFINSMPIE-P 604
           +ACSHSG+   GL++F+ M  D+G+ P   HY C+VDLLGR G   EA   +N MP +  
Sbjct: 632 AACSHSGMVDEGLRIFYVMKPDYGVEPSSDHYACVVDLLGRAGRIKEAYQLMNMMPRDFN 691

Query: 605 DASIWRALLSSCQIKSNNKLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRK 664
            A  W +LL + +I +N ++ E     L++LEP+   +++LL+NIY++AGLW +  ++R+
Sbjct: 692 KAGAWSSLLGASRIHNNLEIGEIAAQNLIQLEPNVASHYVLLANIYSSAGLWDKATEVRR 751

Query: 665 WLRERGLGKPPGTSWIVIGNQVHHFTATDVLHPQSERIYENLNSLTSLIRDLG 697
            ++E+G+ K PG SWI  G++VH F A D  HPQSE++   L +L   +R  G
Sbjct: 752 NMKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHPQSEKLSGYLETLWERMRKEG 798

BLAST of CsaV3_4G024260 vs. Swiss-Prot
Match: sp|Q9SVP7|PP307_ARATH (Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H42 PE=2 SV=2)

HSP 1 Score: 404.8 bits (1039), Expect = 1.9e-111
Identity = 226/694 (32.56%), Postives = 373/694 (53.75%), Query Frame = 0

Query: 5   GLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYMRFDAVLS 64
           G+ P      ++L AC ++  L +G+++HG  L+ G F  D YV  ALV  Y     ++S
Sbjct: 283 GIMPTPYAFSSVLSACKKIESLEIGEQLHGLVLKLG-FSSDTYVCNALVSLYFHLGNLIS 342

Query: 65  -HRVFSLMLVRNIVSWNAIITGFLNVGDCAKALKLYSSMLIEGIKFDAVTMLVVIQACAE 124
              +FS M  R+ V++N +I G    G   KA++L+  M ++G++ D+ T+  ++ AC+ 
Sbjct: 343 AEHIFSNMSQRDAVTYNTLINGLSQCGYGEKAMELFKRMHLDGLEPDSNTLASLVVACSA 402

Query: 125 YGCLRLGMQLHQLAIKFNLINDLFILNALLNMYSDNGSLESSWALFNAVPTSDAALWNSM 184
            G L  G QLH    K    ++  I  ALLN+Y+    +E++   F      +  LWN M
Sbjct: 403 DGTLFRGQQLHAYTTKLGFASNNKIEGALLNLYAKCADIETALDYFLETEVENVVLWNVM 462

Query: 185 ISSYIGFGFHAEAIALFIKMRLERIKEDVRTIAIMLSLCNDLNDGSIWGRGLHAHAMKSG 244
           + +Y        +  +F +M++E I  +  T   +L  C  L D  + G  +H+  +K+ 
Sbjct: 463 LVAYGLLDDLRNSFRIFRQMQIEEIVPNQYTYPSILKTCIRLGDLEL-GEQIHSQIIKTN 522

Query: 245 IELDAYLGNALLSMYVKHNQITAAQYVFEKMRGLDVISWNTMISAFAQSMFRAKAFELFL 304
            +L+AY+ + L+ MY K  ++  A  +  +  G DV+SW TMI+ + Q  F  KA   F 
Sbjct: 523 FQLNAYVCSVLIDMYAKLGKLDTAWDILIRFAGKDVVSWTTMIAGYTQYNFDDKALTTFR 582

Query: 305 MMCESEIKFNSYTIVSLLAFCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCG 364
            M +  I+ +   + + ++ C     L  G+ IH  A  +G   +     +L  +Y  CG
Sbjct: 583 QMLDRGIRSDEVGLTNAVSACAGLQALKEGQQIHAQACVSGFSSDLPFQNALVTLYSRCG 642

Query: 365 DERAATNMFTRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISE-LEPNSVTIINILT 424
               +   F +    D ++WN+L+S + ++ N  +AL +F  M  E ++ N+ T  + + 
Sbjct: 643 KIEESYLAFEQTEAGDNIAWNALVSGFQQSGNNEEALRVFVRMNREGIDNNNFTFGSAVK 702

Query: 425 SCTQLAHLPLGQCLHAYTTRREVSLEMDASLANAFITMYARCGKLQYAEKIFCTLQTRSI 484
           + ++ A++  G+ +HA  T+     + +  + NA I+MYA+CG +  AEK F  + T++ 
Sbjct: 703 AASETANMKQGKQVHAVITK--TGYDSETEVCNALISMYAKCGSISDAEKQFLEVSTKNE 762

Query: 485 VSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTVTGLQLFHS 544
           VSWNA+I  Y  HG G +A  +F QM+    +PN+V+   VLSACSH GL   G+  F S
Sbjct: 763 VSWNAIINAYSKHGFGSEALDSFDQMIHSNVRPNHVTLVGVLSACSHIGLVDKGIAYFES 822

Query: 545 MVRDFGIAPQLTHYGCMVDLLGRGGHFSEAIAFINSMPIEPDASIWRALLSSCQIKSNNK 604
           M  ++G++P+  HY C+VD+L R G  S A  FI  MPI+PDA +WR LLS+C +  N +
Sbjct: 823 MNSEYGLSPKPEHYVCVVDMLTRAGLLSRAKEFIQEMPIKPDALVWRTLLSACVVHKNME 882

Query: 605 LLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIVIG 664
           + E     L+ELEP +   ++LLSN+YA +  W      R+ ++E+G+ K PG SWI + 
Sbjct: 883 IGEFAAHHLLELEPEDSATYVLLSNLYAVSKKWDARDLTRQKMKEKGVKKEPGQSWIEVK 942

Query: 665 NQVHHFTATDVLHPQSERIYENLNSLTSLIRDLG 697
           N +H F   D  HP ++ I+E    LT    ++G
Sbjct: 943 NSIHSFYVGDQNHPLADEIHEYFQDLTKRASEIG 972

BLAST of CsaV3_4G024260 vs. Swiss-Prot
Match: sp|Q3E6Q1|PPR32_ARATH (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 394.4 bits (1012), Expect = 2.6e-108
Identity = 229/683 (33.53%), Postives = 366/683 (53.59%), Query Frame = 0

Query: 16  LLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYMRFDAV-LSHRVFSLMLVR 75
           LL  C  + ELR   +I     +NGL+  + +  T LV  + R+ +V  + RVF  +  +
Sbjct: 43  LLERCSSLKELR---QILPLVFKNGLY-QEHFFQTKLVSLFCRYGSVDEAARVFEPIDSK 102

Query: 76  NIVSWNAIITGFLNVGDCAKALKLYSSMLIEGIKFDAVTMLVVIQACAEYGCLRLGMQLH 135
             V ++ ++ GF  V D  KAL+ +  M  + ++        +++ C +   LR+G ++H
Sbjct: 103 LNVLYHTMLKGFAKVSDLDKALQFFVRMRYDDVEPVVYNFTYLLKVCGDEAELRVGKEIH 162

Query: 136 QLAIKFNLINDLFILNALLNMYSDNGSLESSWALFNAVPTSDAALWNSMISSYIGFGFHA 195
            L +K     DLF +  L NMY+    +  +  +F+ +P  D   WN++++ Y   G   
Sbjct: 163 GLLVKSGFSLDLFAMTGLENMYAKCRQVNEARKVFDRMPERDLVSWNTIVAGYSQNGMAR 222

Query: 196 EAIALFIKMRLERIKEDVRTIAIMLSLCNDLNDGSIWGRGLHAHAMKSGIELDAYLGNAL 255
            A+ +   M  E +K    TI  +L   + L   S+ G+ +H +AM+SG +    +  AL
Sbjct: 223 MALEMVKSMCEENLKPSFITIVSVLPAVSALRLISV-GKEIHGYAMRSGFDSLVNISTAL 282

Query: 256 LSMYVKHNQITAAQYVFEKMRGLDVISWNTMISAFAQSMFRAKAFELFLMMCESEIKFNS 315
           + MY K   +  A+ +F+ M   +V+SWN+MI A+ Q+    +A  +F  M +  +K   
Sbjct: 283 VDMYAKCGSLETARQLFDGMLERNVVSWNSMIDAYVQNENPKEAMLIFQKMLDEGVKPTD 342

Query: 316 YTIVSLLAFCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYINCGDERAATNMFTR 375
            +++  L  C D  DL  GR IH  +++ GL+ N S+  SL  MY  C +   A +MF +
Sbjct: 343 VSVMGALHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGK 402

Query: 376 CPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISE-LEPNSVTIINILTSCTQLAHLPLG 435
              R LVSWN++I  + +N     AL  F+ M S  ++P++ T ++++T+  +L+     
Sbjct: 403 LQSRTLVSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHA 462

Query: 436 QCLHAYTTRREVSLEMDASLANAFITMYARCGKLQYAEKIFCTLQTRSIVSWNAMITGYG 495
           + +H    R    L+ +  +  A + MYA+CG +  A  IF  +  R + +WNAMI GYG
Sbjct: 463 KWIHGVVMRS--CLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYG 522

Query: 496 MHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTVTGLQLFHSMVRDFGIAPQL 555
            HG G+ A   F +M     KPN V+F SV+SACSHSGL   GL+ F+ M  ++ I   +
Sbjct: 523 THGFGKAALELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSM 582

Query: 556 THYGCMVDLLGRGGHFSEAIAFINSMPIEPDASIWRALLSSCQIKSNNKLLETIFGKLVE 615
            HYG MVDLLGR G  +EA  FI  MP++P  +++ A+L +CQI  N    E    +L E
Sbjct: 583 DHYGAMVDLLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFE 642

Query: 616 LEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIVIGNQVHHFTATDV 675
           L P + G  +LL+NIY AA +W +V Q+R  +  +GL K PG S + I N+VH F +   
Sbjct: 643 LNPDDGGYHVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGST 702

Query: 676 LHPQSERIYENLNSLTSLIRDLG 697
            HP S++IY  L  L   I++ G
Sbjct: 703 AHPDSKKIYAFLEKLICHIKEAG 718

BLAST of CsaV3_4G024260 vs. Swiss-Prot
Match: sp|Q9M9E2|PPR45_ARATH (Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H73 PE=1 SV=1)

HSP 1 Score: 393.3 bits (1009), Expect = 5.7e-108
Identity = 223/698 (31.95%), Postives = 366/698 (52.44%), Query Frame = 0

Query: 1   MKKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYMRF- 60
           M++  +  +    VAL+  C        G +++   L + +  +   +G A +  ++RF 
Sbjct: 85  MQELRVAVDEDVFVALVRLCEWKRAQEEGSKVYSIAL-SSMSSLGVELGNAFLAMFVRFG 144

Query: 61  DAVLSHRVFSLMLVRNIVSWNAIITGFLNVGDCAKALKLYSSML-IEGIKFDAVTMLVVI 120
           + V +  VF  M  RN+ SWN ++ G+   G   +A+ LY  ML + G+K D  T   V+
Sbjct: 145 NLVDAWYVFGKMSERNLFSWNVLVGGYAKQGYFDEAMCLYHRMLWVGGVKPDVYTFPCVL 204

Query: 121 QACAEYGCLRLGMQLHQLAIKFNLINDLFILNALLNMYSDNGSLESSWALFNAVPTSDAA 180
           + C     L  G ++H   +++    D+ ++NAL+ MY   G ++S+  LF+ +P  D  
Sbjct: 205 RTCGGIPDLARGKEVHVHVVRYGYELDIDVVNALITMYVKCGDVKSARLLFDRMPRRDII 264

Query: 181 LWNSMISSYIGFGFHAEAIALFIKMRLERIKEDVRTIAIMLSLCNDLNDGSIWGRGLHAH 240
            WN+MIS Y   G   E + LF  MR   +  D+ T+  ++S C  L D  + GR +HA+
Sbjct: 265 SWNAMISGYFENGMCHEGLELFFAMRGLSVDPDLMTLTSVISACELLGDRRL-GRDIHAY 324

Query: 241 AMKSGIELDAYLGNALLSMYVKHNQITAAQYVFEKMRGLDVISWNTMISAFAQSMFRAKA 300
            + +G  +D  + N+L  MY+       A+ +F +M   D++SW TMIS +  +    KA
Sbjct: 325 VITTGFAVDISVCNSLTQMYLNAGSWREAEKLFSRMERKDIVSWTTMISGYEYNFLPDKA 384

Query: 301 FELFLMMCESEIKFNSYTIVSLLAFCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEM 360
            + + MM +  +K +  T+ ++L+ C    DL  G  +H  AIK  L     +  +L  M
Sbjct: 385 IDTYRMMDQDSVKPDEITVAAVLSACATLGDLDTGVELHKLAIKARLISYVIVANNLINM 444

Query: 361 YINCGDERAATNMFTRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTII 420
           Y  C     A ++F   P+++++SW S+I+    N+   +AL+    M   L+PN++T+ 
Sbjct: 445 YSKCKCIDKALDIFHNIPRKNVISWTSIIAGLRLNNRCFEALIFLRQMKMTLQPNAITLT 504

Query: 421 NILTSCTQLAHLPLGQCLHAYTTRREVSLEMDASLANAFITMYARCGKLQYAEKIFCTLQ 480
             L +C ++  L  G+ +HA+  R  V L  D  L NA + MY RCG++  A   F   Q
Sbjct: 505 AALAACARIGALMCGKEIHAHVLRTGVGL--DDFLPNALLDMYVRCGRMNTAWSQF-NSQ 564

Query: 481 TRSIVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTVTGLQ 540
            + + SWN ++TGY   G+G      F +M+    +P+ ++F S+L  CS S +   GL 
Sbjct: 565 KKDVTSWNILLTGYSERGQGSMVVELFDRMVKSRVRPDEITFISLLCGCSKSQMVRQGLM 624

Query: 541 LFHSMVRDFGIAPQLTHYGCMVDLLGRGGHFSEAIAFINSMPIEPDASIWRALLSSCQIK 600
            F S + D+G+ P L HY C+VDLLGR G   EA  FI  MP+ PD ++W ALL++C+I 
Sbjct: 625 YF-SKMEDYGVTPNLKHYACVVDLLGRAGELQEAHKFIQKMPVTPDPAVWGALLNACRIH 684

Query: 601 SNNKLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSW 660
               L E     + EL+  + G +ILL N+YA  G W EV ++R+ ++E GL    G SW
Sbjct: 685 HKIDLGELSAQHIFELDKKSVGYYILLCNLYADCGKWREVAKVRRMMKENGLTVDAGCSW 744

Query: 661 IVIGNQVHHFTATDVLHPQSERIYENLNSLTSLIRDLG 697
           + +  +VH F + D  HPQ++ I   L      + ++G
Sbjct: 745 VEVKGKVHAFLSDDKYHPQTKEINTVLEGFYEKMSEVG 776

BLAST of CsaV3_4G024260 vs. Swiss-Prot
Match: sp|Q9FWA6|PP207_ARATH (Pentatricopeptide repeat-containing protein At3g02330, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-E90 PE=2 SV=2)

HSP 1 Score: 390.6 bits (1002), Expect = 3.7e-107
Identity = 241/713 (33.80%), Postives = 370/713 (51.89%), Query Frame = 0

Query: 1   MKKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYMRFD 60
           M + G+  + RT   +L  C  + +  LG +IHG  +R G  D D    +AL+  Y +  
Sbjct: 171 MGREGIEFDGRTFAIILKVCSFLEDTSLGMQIHGIVVRVGC-DTDVVAASALLDMYAKGK 230

Query: 61  A-VLSHRVFSLMLVRNIVSWNAIITGFLNVGDCAKALKLYSSMLIEGIKFDAVTMLVVIQ 120
             V S RVF  +  +N VSW+AII G +     + ALK +  M              V++
Sbjct: 231 RFVESLRVFQGIPEKNSVSWSAIIAGCVQNNLLSLALKFFKEMQKVNAGVSQSIYASVLR 290

Query: 121 ACAEYGCLRLGMQLHQLAIKFNLINDLFILNALLNMYSDNGSLESSWALFNAVPTSDAAL 180
           +CA    LRLG QLH  A+K +   D  +  A L+MY+   +++ +  LF+     +   
Sbjct: 291 SCAALSELRLGGQLHAHALKSDFAADGIVRTATLDMYAKCDNMQDAQILFDNSENLNRQS 350

Query: 181 WNSMISSYIGFGFHAEAIALFIKMRLERIKEDVRTIAIMLSLCNDLNDGSIWGRGLHAHA 240
           +N+MI+ Y       +A+ LF ++    +  D  +++ +   C  L  G   G  ++  A
Sbjct: 351 YNAMITGYSQEEHGFKALLLFHRLMSSGLGFDEISLSGVFRAC-ALVKGLSEGLQIYGLA 410

Query: 241 MKSGIELDAYLGNALLSMYVKHNQITAAQYVFEKMRGLDVISWNTMISAFAQSMFRAKAF 300
           +KS + LD  + NA + MY K   +  A  VF++MR  D +SWN +I+A  Q+    +  
Sbjct: 411 IKSSLSLDVCVANAAIDMYGKCQALAEAFRVFDEMRRRDAVSWNAIIAAHEQNGKGYETL 470

Query: 301 ELFLMMCESEIKFNSYTIVSLLAFCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMY 360
            LF+ M  S I+ + +T  S+L  C  GS L +G  IH   +K+G+  N+S+  SL +MY
Sbjct: 471 FLFVSMLRSRIEPDEFTFGSILKACTGGS-LGYGMEIHSSIVKSGMASNSSVGCSLIDMY 530

Query: 361 INCGDERAATNMFTRCPQR--------------------DLVSWNSLISSYIKNDNAGKA 420
             CG    A  + +R  QR                      VSWNS+IS Y+  + +  A
Sbjct: 531 SKCGMIEEAEKIHSRFFQRANVXXXXXXXXXXXXXXXXXXXVSWNSIISGYVMKEQSEDA 590

Query: 421 LLLFNHMIS-ELEPNSVTIINILTSCTQLAHLPLGQCLHAYTTRREVSLEMDASLANAFI 480
            +LF  M+   + P+  T   +L +C  LA   LG+ +HA   ++E  L+ D  + +  +
Sbjct: 591 QMLFTRMMEMGITPDKFTYATVLDTCANLASAGLGKQIHAQVIKKE--LQSDVYICSTLV 650

Query: 481 TMYARCGKLQYAEKIFCTLQTRSIVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNV 540
            MY++CG L  +  +F     R  V+WNAMI GY  HG+G +A   F +M+ +  KPN+V
Sbjct: 651 DMYSKCGDLHDSRLMFEKSLRRDFVTWNAMICGYAHHGKGEEAIQLFERMILENIKPNHV 710

Query: 541 SFASVLSACSHSGLTVTGLQLFHSMVRDFGIAPQLTHYGCMVDLLGRGGHFSEAIAFINS 600
           +F S+L AC+H GL   GL+ F+ M RD+G+ PQL HY  MVD+LG+ G    A+  I  
Sbjct: 711 TFISILRACAHMGLIDKGLEYFYMMKRDYGLDPQLPHYSNMVDILGKSGKVKRALELIRE 770

Query: 601 MPIEPDASIWRALLSSCQIKSNN-KLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSE 660
           MP E D  IWR LL  C I  NN ++ E     L+ L+P +   + LLSN+YA AG+W +
Sbjct: 771 MPFEADDVIWRTLLGVCTIHRNNVEVAEEATAALLRLDPQDSSAYTLLSNVYADAGMWEK 830

Query: 661 VVQIRKWLRERGLGKPPGTSWIVIGNQVHHFTATDVLHPQSERIYENLNSLTS 691
           V  +R+ +R   L K PG SW+ + +++H F   D  HP+ E IYE L  + S
Sbjct: 831 VSDLRRNMRGFKLKKEPGCSWVELKDELHVFLVGDKAHPRWEEIYEELGLIYS 878

BLAST of CsaV3_4G024260 vs. TrEMBL
Match: tr|A0A1S4DXL9|A0A1S4DXL9_CUCME (pentatricopeptide repeat-containing protein At3g57430, chloroplastic-like isoform X2 OS=Cucumis melo OX=3656 GN=LOC103491511 PE=4 SV=1)

HSP 1 Score: 1313.1 bits (3397), Expect = 0.0e+00
Identity = 655/696 (94.11%), Postives = 675/696 (96.98%), Query Frame = 0

Query: 1   MKKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYMRFD 60
           MKKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFY+RFD
Sbjct: 32  MKKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYLRFD 91

Query: 61  AVLSHRVFSLMLVRNIVSWNAIITGFLNVGDCAKALKLYSSMLIEGIKFDAVTMLVVIQA 120
           AVLSHRVFSLM+VRNIVSWNAIITGFLNVGD  KALKL+SSMLIEGIKFDAVTMLVVIQA
Sbjct: 92  AVLSHRVFSLMVVRNIVSWNAIITGFLNVGDYTKALKLFSSMLIEGIKFDAVTMLVVIQA 151

Query: 121 CAEYGCLRLGMQLHQLAIKFNLINDLFILNALLNMYSDNGSLESSWALFNAVPTSDAALW 180
           CAEYGCLRLGMQLHQLAIKFNLIND+F+LNALLNMYSDNGSLESS  LFNAVPTSDAALW
Sbjct: 152 CAEYGCLRLGMQLHQLAIKFNLINDVFVLNALLNMYSDNGSLESSCVLFNAVPTSDAALW 211

Query: 181 NSMISSYIGFGFHAEAIALFIKMRLERIKEDVRTIAIMLSLCNDLNDGSIWGRGLHAHAM 240
           NSMIS YIGFGFHAEAIALFIKMRLERIKEDVRTI IMLSLCNDLNDGS+WGRGLHAHAM
Sbjct: 212 NSMISCYIGFGFHAEAIALFIKMRLERIKEDVRTIVIMLSLCNDLNDGSLWGRGLHAHAM 271

Query: 241 KSGIELDAYLGNALLSMYVKHNQITAAQYVFEKMRGLDVISWNTMISAFAQSMFRAKAFE 300
           KSGIELDA+LGNALLSMYVKHNQI AAQ VFEK RGLDVISWNTMISA AQSMFRAKAFE
Sbjct: 272 KSGIELDAFLGNALLSMYVKHNQINAAQNVFEKTRGLDVISWNTMISALAQSMFRAKAFE 331

Query: 301 LFLMMCESEIKFNSYTIVSLLAFCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYI 360
           LF MMCESEIKFNSYTI+SLLA CKDG+DLVFGRSIHGFAIKNGLEINTSLNTSLTEMYI
Sbjct: 332 LFFMMCESEIKFNSYTIISLLALCKDGNDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYI 391

Query: 361 NCGDERAATNMFTRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIINI 420
           NCGDERAA +MFTRCPQRDL+SWNSLI SYIKNDNAGKALLLFNHMISELEPNSVTIINI
Sbjct: 392 NCGDERAAIDMFTRCPQRDLISWNSLILSYIKNDNAGKALLLFNHMISELEPNSVTIINI 451

Query: 421 LTSCTQLAHLPLGQCLHAYTTRREVSLEMDASLANAFITMYARCGKLQYAEKIFCTLQTR 480
           LTSCTQLAHLPLGQCLHAY TRRE SLEMDASLANAFITMYARCGK+QYAE+IF TLQTR
Sbjct: 452 LTSCTQLAHLPLGQCLHAYATRREESLEMDASLANAFITMYARCGKMQYAEQIFRTLQTR 511

Query: 481 SIVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTVTGLQLF 540
           +IVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLT TGL LF
Sbjct: 512 NIVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTETGLLLF 571

Query: 541 HSMVRDFGIAPQLTHYGCMVDLLGRGGHFSEAIAFINSMPIEPDASIWRALLSSCQIKSN 600
           HSMVRDFG+APQLTHYGCMVDLLGRGGHFSEAIAFIN+MPIEPDASIWRALLSS QIKSN
Sbjct: 572 HSMVRDFGLAPQLTHYGCMVDLLGRGGHFSEAIAFINTMPIEPDASIWRALLSSWQIKSN 631

Query: 601 NKLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIV 660
            KLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIV
Sbjct: 632 KKLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIV 691

Query: 661 IGNQVHHFTATDVLHPQSERIYENLNSLTSLIRDLG 697
           IGNQVH+FTATDVLHPQSE+IYENLNSLTSLI+D+G
Sbjct: 692 IGNQVHYFTATDVLHPQSEKIYENLNSLTSLIQDMG 727

BLAST of CsaV3_4G024260 vs. TrEMBL
Match: tr|A0A1S3BNK9|A0A1S3BNK9_CUCME (pentatricopeptide repeat-containing protein At2g13600-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103491511 PE=4 SV=1)

HSP 1 Score: 1311.2 bits (3392), Expect = 0.0e+00
Identity = 654/695 (94.10%), Postives = 674/695 (96.98%), Query Frame = 0

Query: 2   KKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYMRFDA 61
           KKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFY+RFDA
Sbjct: 153 KKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYLRFDA 212

Query: 62  VLSHRVFSLMLVRNIVSWNAIITGFLNVGDCAKALKLYSSMLIEGIKFDAVTMLVVIQAC 121
           VLSHRVFSLM+VRNIVSWNAIITGFLNVGD  KALKL+SSMLIEGIKFDAVTMLVVIQAC
Sbjct: 213 VLSHRVFSLMVVRNIVSWNAIITGFLNVGDYTKALKLFSSMLIEGIKFDAVTMLVVIQAC 272

Query: 122 AEYGCLRLGMQLHQLAIKFNLINDLFILNALLNMYSDNGSLESSWALFNAVPTSDAALWN 181
           AEYGCLRLGMQLHQLAIKFNLIND+F+LNALLNMYSDNGSLESS  LFNAVPTSDAALWN
Sbjct: 273 AEYGCLRLGMQLHQLAIKFNLINDVFVLNALLNMYSDNGSLESSCVLFNAVPTSDAALWN 332

Query: 182 SMISSYIGFGFHAEAIALFIKMRLERIKEDVRTIAIMLSLCNDLNDGSIWGRGLHAHAMK 241
           SMIS YIGFGFHAEAIALFIKMRLERIKEDVRTI IMLSLCNDLNDGS+WGRGLHAHAMK
Sbjct: 333 SMISCYIGFGFHAEAIALFIKMRLERIKEDVRTIVIMLSLCNDLNDGSLWGRGLHAHAMK 392

Query: 242 SGIELDAYLGNALLSMYVKHNQITAAQYVFEKMRGLDVISWNTMISAFAQSMFRAKAFEL 301
           SGIELDA+LGNALLSMYVKHNQI AAQ VFEK RGLDVISWNTMISA AQSMFRAKAFEL
Sbjct: 393 SGIELDAFLGNALLSMYVKHNQINAAQNVFEKTRGLDVISWNTMISALAQSMFRAKAFEL 452

Query: 302 FLMMCESEIKFNSYTIVSLLAFCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYIN 361
           F MMCESEIKFNSYTI+SLLA CKDG+DLVFGRSIHGFAIKNGLEINTSLNTSLTEMYIN
Sbjct: 453 FFMMCESEIKFNSYTIISLLALCKDGNDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYIN 512

Query: 362 CGDERAATNMFTRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIINIL 421
           CGDERAA +MFTRCPQRDL+SWNSLI SYIKNDNAGKALLLFNHMISELEPNSVTIINIL
Sbjct: 513 CGDERAAIDMFTRCPQRDLISWNSLILSYIKNDNAGKALLLFNHMISELEPNSVTIINIL 572

Query: 422 TSCTQLAHLPLGQCLHAYTTRREVSLEMDASLANAFITMYARCGKLQYAEKIFCTLQTRS 481
           TSCTQLAHLPLGQCLHAY TRRE SLEMDASLANAFITMYARCGK+QYAE+IF TLQTR+
Sbjct: 573 TSCTQLAHLPLGQCLHAYATRREESLEMDASLANAFITMYARCGKMQYAEQIFRTLQTRN 632

Query: 482 IVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTVTGLQLFH 541
           IVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLT TGL LFH
Sbjct: 633 IVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTETGLLLFH 692

Query: 542 SMVRDFGIAPQLTHYGCMVDLLGRGGHFSEAIAFINSMPIEPDASIWRALLSSCQIKSNN 601
           SMVRDFG+APQLTHYGCMVDLLGRGGHFSEAIAFIN+MPIEPDASIWRALLSS QIKSN 
Sbjct: 693 SMVRDFGLAPQLTHYGCMVDLLGRGGHFSEAIAFINTMPIEPDASIWRALLSSWQIKSNK 752

Query: 602 KLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIVI 661
           KLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIVI
Sbjct: 753 KLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIVI 812

Query: 662 GNQVHHFTATDVLHPQSERIYENLNSLTSLIRDLG 697
           GNQVH+FTATDVLHPQSE+IYENLNSLTSLI+D+G
Sbjct: 813 GNQVHYFTATDVLHPQSEKIYENLNSLTSLIQDMG 847

BLAST of CsaV3_4G024260 vs. TrEMBL
Match: tr|A0A2N9IVN7|A0A2N9IVN7_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS56003 PE=4 SV=1)

HSP 1 Score: 916.4 bits (2367), Expect = 3.9e-263
Identity = 449/696 (64.51%), Postives = 553/696 (79.45%), Query Frame = 0

Query: 1   MKKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYMRFD 60
           M++ G  PNSRTVVALLLA  E+LELR+GQEIHGYCLRNGLFD+D +VGTAL+GFY+RFD
Sbjct: 160 MEREGFKPNSRTVVALLLASEEVLELRIGQEIHGYCLRNGLFDLDPHVGTALIGFYLRFD 219

Query: 61  AVLSHRVFSLMLVRNIVSWNAIITGFLNVGDCAKALKLYSSMLIEGIKFDAVTMLVVIQA 120
             +S  VF LM+VR+IVSWNA+ITG+++VG+  +ALKL+ SML  G+ FD+VTMLVVIQA
Sbjct: 220 VRISRLVFDLMVVRSIVSWNAMITGYVDVGNYLEALKLFMSMLENGVPFDSVTMLVVIQA 279

Query: 121 CAEYGCLRLGMQLHQLAIKFNLINDLFILNALLNMYSDNGSLESSWALFNAVPTSDAALW 180
           C E+G L+LGMQ+HQ+AIK   + DL+I+NALLNMYS+NGSL SS ALF+ +P  D ALW
Sbjct: 280 CVEFGSLQLGMQIHQMAIKLKYVGDLYIVNALLNMYSENGSLASSCALFDTIPACDVALW 339

Query: 181 NSMISSYIGFGFHAEAIALFIKMRLERIKEDVRTIAIMLSLCNDLNDGSIWGRGLHAHAM 240
           NSMIS+YIGFG + EA+ LFI MR E IKED RTIAI+LS C    DG   G+ LHAHA 
Sbjct: 340 NSMISAYIGFGCYEEALRLFISMRTEGIKEDERTIAIILSSCAKFTDGLRKGKSLHAHAT 399

Query: 241 KSGIELDAYLGNALLSMYVKHNQITAAQYVFEKMRGLDVISWNTMISAFAQSMFRAKAFE 300
           KS +++D  LGNALLSMY   N + A Q VF ++R LD++SWNT+I A A +  R +A++
Sbjct: 400 KSRMKIDISLGNALLSMYADLNCVEAVQKVFAELRSLDIVSWNTLILALAHNRMRIEAWK 459

Query: 301 LFLMMCESEIKFNSYTIVSLLAFCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYI 360
           LF +M ESE K NSYTI+S+LA C+D + L  GRSIHGF +K+G+E+N SLNT+LT+MY+
Sbjct: 460 LFGVMRESEFKPNSYTIISILATCEDETCLNVGRSIHGFVVKHGIEVNLSLNTTLTDMYM 519

Query: 361 NCGDERAATNMFTRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIINI 420
           NCGDE +A  +F  CP RDL+SWNS+I++YI ++   +ALLLFN MI E+EPNSVTIIN+
Sbjct: 520 NCGDEASARILFEGCPNRDLISWNSMIANYINDNQVREALLLFNRMIQEVEPNSVTIINV 579

Query: 421 LTSCTQLAHLPLGQCLHAYTTRREVSLEMDASLANAFITMYARCGKLQYAEKIFCTLQTR 480
           L+SCT LA+LP GQCLHAY  RR   L  + SLANAFITMYARCG +Q AEKIF TL  R
Sbjct: 580 LSSCTDLANLPQGQCLHAYVIRRHSYLSFNLSLANAFITMYARCGSMQKAEKIFKTLPRR 639

Query: 481 SIVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTVTGLQLF 540
           +I+SWNAMITGY MHGRG DA LAF QML++G++PN V+F SV+SACSHSGL   GL+LF
Sbjct: 640 NIISWNAMITGYRMHGRGDDAILAFLQMLEEGYQPNGVTFLSVISACSHSGLIEKGLELF 699

Query: 541 HSMVRDFGIAPQLTHYGCMVDLLGRGGHFSEAIAFINSMPIEPDASIWRALLSSCQIKSN 600
           HSMV+DF I P+L HYGC+VDLLGRGG   EA  FI SMPIEPDAS+WRALLSSC++ ++
Sbjct: 700 HSMVKDFKITPELVHYGCVVDLLGRGGRLDEAREFIQSMPIEPDASVWRALLSSCRVNND 759

Query: 601 NKLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIV 660
            KL E IF KLVELEP NPGN++LLSNIYAAA  W EV +IR WL+E+GL KPPG SWIV
Sbjct: 760 TKLAENIFEKLVELEPMNPGNYVLLSNIYAAADCWLEVTRIRTWLKEKGLHKPPGISWIV 819

Query: 661 IGNQVHHFTATDVLHPQSERIYENLNSLTSLIRDLG 697
           + +QV +FTA D+ HPQS +IY NLNSL   I++ G
Sbjct: 820 VRSQVQYFTAGDLSHPQSGQIYANLNSLLCTIKENG 855

BLAST of CsaV3_4G024260 vs. TrEMBL
Match: tr|A0A2I4FH18|A0A2I4FH18_9ROSI (putative pentatricopeptide repeat-containing protein At5g59200, chloroplastic isoform X1 OS=Juglans regia OX=51240 GN=LOC108998716 PE=4 SV=1)

HSP 1 Score: 882.9 bits (2280), Expect = 4.8e-253
Identity = 433/695 (62.30%), Postives = 542/695 (77.99%), Query Frame = 0

Query: 2   KKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYMRFDA 61
           ++ G  PNSRTVVALLLAC E+LEL+LGQE+HGYCLRNGL D++ +VGTAL+GFY+RFDA
Sbjct: 149 EREGFRPNSRTVVALLLACEEVLELKLGQEMHGYCLRNGLLDLNPHVGTALIGFYLRFDA 208

Query: 62  VLSHRVFSLMLVRNIVSWNAIITGFLNVGDCAKALKLYSSMLIEGIKFDAVTMLVVIQAC 121
            +S  VF LM+VR+IVSWNA+I G+++VGD  +ALKL+ SML++G+ FD+VT L  IQAC
Sbjct: 209 RISRLVFDLMVVRSIVSWNAMINGYVDVGDYLEALKLFLSMLMDGVHFDSVTTLGCIQAC 268

Query: 122 AEYGCLRLGMQLHQLAIKFNLINDLFILNALLNMYSDNGSLESSWALFNAVPTSDAALWN 181
           AE G L LG QLHQ+AIK    NDL+++NALL MYS+NGSL+S++ LF   P  D ALWN
Sbjct: 269 AELGYLGLGTQLHQMAIKLKYANDLYVINALLTMYSENGSLDSAYKLFGGTPARDVALWN 328

Query: 182 SMISSYIGFGFHAEAIALFIKMRLERIKEDVRTIAIMLSLCNDLNDGSIWGRGLHAHAMK 241
           SMIS+YI FG   EA++LF  MR + IKED RTI  +LS C+ L DG   G+ LHAHA K
Sbjct: 329 SMISAYINFGDFEEALSLFSSMRTKGIKEDERTIVTVLSSCSKLADGLRKGKSLHAHATK 388

Query: 242 SGIELDAYLGNALLSMYVKHNQITAAQYVFEKMRGLDVISWNTMISAFAQSMFRAKAFEL 301
           SG+++D  LGNALLSMY + N + A Q VF K+R  D+ISWNT+I A A +  R++A++L
Sbjct: 389 SGMKMDVSLGNALLSMYAELNCVEAVQKVFAKLRDSDIISWNTVILALAHNKMRSEAWKL 448

Query: 302 FLMMCESEIKFNSYTIVSLLAFCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYIN 361
           F +M ESE K N YTI+S+LA C D + L  GRSIHGF +K+ +EIN SLNT+LT+MY+ 
Sbjct: 449 FGLMRESEFKPNPYTIISILATCDDETCLNIGRSIHGFVVKHAIEINLSLNTALTDMYMK 508

Query: 362 CGDERAATNMFTRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIINIL 421
            GDE +A  +F  CP+RDL+SWN++I+ YI ++ A KAL LFNHMI E+EPNSVTI+++L
Sbjct: 509 SGDEASARTLFDCCPKRDLISWNAMITGYINDNQAIKALYLFNHMILEVEPNSVTIMSVL 568

Query: 422 TSCTQLAHLPLGQCLHAYTTRREVSLEMDASLANAFITMYARCGKLQYAEKIFCTLQTRS 481
           +SCT LA+LP G+CLHAY TRR+     + SLANAFI MYARCG +Q A+ IF TL  R+
Sbjct: 569 SSCTDLANLPQGKCLHAYATRRDSYFGFNLSLANAFIMMYARCGSMQSAKNIFETLPRRN 628

Query: 482 IVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTVTGLQLFH 541
           I++WNAMI GY MHGRG DA  +F QML+DG+ PN  +F SV+SACSHSG    GL+LFH
Sbjct: 629 IIAWNAMINGYRMHGRGYDAIHSFLQMLEDGYTPNGATFLSVISACSHSGFIEKGLELFH 688

Query: 542 SMVRDFGIAPQLTHYGCMVDLLGRGGHFSEAIAFINSMPIEPDASIWRALLSSCQIKSNN 601
           SMV+DF I P+L HYGC+VDLLGRGG   EA  FI SMPI+PDAS+WRALLS+C++ S+ 
Sbjct: 689 SMVQDFKIEPELVHYGCVVDLLGRGGRLDEAREFIESMPIKPDASVWRALLSACRVNSDI 748

Query: 602 KLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIVI 661
           KL E IF KL+ELEP NPGN+ILLSNIYAA G WSEV  IR WLRE+GL KPPG SWIV+
Sbjct: 749 KLAENIFEKLIELEPMNPGNYILLSNIYAAVGRWSEVRHIRTWLREKGLNKPPGFSWIVV 808

Query: 662 GNQVHHFTATDVLHPQSERIYENLNSLTSLIRDLG 697
            +Q H+FTA+DV HPQSE+IYENLNSL SLI++ G
Sbjct: 809 RSQPHYFTASDVSHPQSEKIYENLNSLLSLIKENG 843

BLAST of CsaV3_4G024260 vs. TrEMBL
Match: tr|A0A061GS93|A0A061GS93_THECC (Pentatricopeptide repeat-containing protein OS=Theobroma cacao OX=3641 GN=TCM_040700 PE=4 SV=1)

HSP 1 Score: 854.7 bits (2207), Expect = 1.4e-244
Identity = 417/696 (59.91%), Postives = 535/696 (76.87%), Query Frame = 0

Query: 1   MKKAGLTPNSRTVVALLLACGEMLELRLGQEIHGYCLRNGLFDMDAYVGTALVGFYMRFD 60
           M++ G  PNSRT+VA+LLAC E+ E+RLG+EIHGYCLRNGLFD+D +VGTAL+GFY+ F+
Sbjct: 150 MQREGFRPNSRTLVAMLLACQEVAEVRLGKEIHGYCLRNGLFDLDPHVGTALIGFYLSFN 209

Query: 61  AVLSHRVFSLMLVRNIVSWNAIITGFLNVGDCAKALKLYSSMLIEGIKFDAVTMLVVIQA 120
              SH VF LM VRN V WNA+I G+ ++G+  KALKL+  ML++G++FD+VTML +IQA
Sbjct: 210 VRASHTVFDLMAVRNTVCWNAMIKGYFDIGESLKALKLFEKMLMDGVEFDSVTMLALIQA 269

Query: 121 CAEYGCLRLGMQLHQLAIKFNLINDLFILNALLNMYSDNGSLESSWALFNAVPTSDAALW 180
           CAE+G L LG Q+HQ+AIK +  NDLFI+NALLNMY+D GSL+S+  LF+  P  D ALW
Sbjct: 270 CAEFGSLELGSQIHQMAIKCSYSNDLFIVNALLNMYADIGSLKSACKLFDVTPRRDVALW 329

Query: 181 NSMISSYIGFGFHAEAIALFIKMRLERIKEDVRTIAIMLSLCNDLNDGSIWGRGLHAHAM 240
           NSMIS+Y  +  + EA +LF+ MR E  KED RTI IM SLC +  DG   G+ LHA+A 
Sbjct: 330 NSMISAYFEYSCNEEATSLFVHMRTEGNKEDDRTIVIMFSLCAESADGLRKGKSLHAYAS 389

Query: 241 KSGIELDAYLGNALLSMYVKHNQITAAQYVFEKMRGLDVISWNTMISAFAQSMFRAKAFE 300
           KSG+ +D  LGNA+L+MY + N I + Q VF +M  +DVIS+NT+I A A++   ++A+E
Sbjct: 390 KSGMRMDVNLGNAMLNMYAQQNCIDSVQKVFSEMSNVDVISFNTVILALARNKLGSEAWE 449

Query: 301 LFLMMCESEIKFNSYTIVSLLAFCKDGSDLVFGRSIHGFAIKNGLEINTSLNTSLTEMYI 360
           +F +M E +++ NSYTI+S+LA CKD + L  GRS+HGF IK G+E+N SLNT+LT+MYI
Sbjct: 450 VFGLMWELDVEPNSYTIISILAACKDETCLNIGRSLHGFVIKQGIEVNVSLNTALTDMYI 509

Query: 361 NCGDERAATNMFTRCPQRDLVSWNSLISSYIKNDNAGKALLLFNHMISELEPNSVTIINI 420
           NCGDE  A N+F  CP RDL+SWN+LI++Y+KN+ A +A L+F+ MISE+EPNSVTIINI
Sbjct: 510 NCGDEATARNLFESCPGRDLISWNALIATYVKNNLAHEAFLVFSRMISEVEPNSVTIINI 569

Query: 421 LTSCTQLAHLPLGQCLHAYTTRREVSLEMDASLANAFITMYARCGKLQYAEKIFCTLQTR 480
           L+SCT LAHLP GQC HAY  R+E SL  + SL NAFITMYARCG +Q AE+IF TL  R
Sbjct: 570 LSSCTHLAHLPQGQCFHAYMLRQESSLGHNLSLGNAFITMYARCGSMQSAERIFKTLPRR 629

Query: 481 SIVSWNAMITGYGMHGRGRDATLAFAQMLDDGFKPNNVSFASVLSACSHSGLTVTGLQLF 540
           +I+SWNA+ITGYGMHGRG DA LAF+QML+DG+ PN V+F SVLSACSHSG+   GL+LF
Sbjct: 630 NIISWNAIITGYGMHGRGSDAILAFSQMLEDGYYPNEVTFISVLSACSHSGMIEEGLRLF 689

Query: 541 HSMVRDFGIAPQLTHYGCMVDLLGRGGHFSEAIAFINSMPIEPDASIWRALLSSCQIKSN 600
            SMV DF I PQL HYGC+VDLLGR G   EA  FI SMPI+PDAS+WRALLS+ +    
Sbjct: 690 DSMVHDFHITPQLAHYGCVVDLLGRAGCLDEARGFIESMPIKPDASVWRALLSAYRDHCY 749

Query: 601 NKLLETIFGKLVELEPSNPGNFILLSNIYAAAGLWSEVVQIRKWLRERGLGKPPGTSWIV 660
            K  + IF K+VEL+P NPGN++L+ N YAAAGLWS+V QIR  L+ +GL KPPG SWIV
Sbjct: 750 TKEAKAIFEKIVELDPMNPGNYVLVCNAYAAAGLWSDVRQIRTCLKAKGLRKPPGMSWIV 809

Query: 661 IGNQVHHFTATDVLHPQSERIYENLNSLTSLIRDLG 697
           + +Q+H F A D  HP +++IY NLNSL   ++++G
Sbjct: 810 VRSQIHSFAAGDRSHPMADKIYANLNSLLHSMKEIG 845

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004142223.10.0e+00100.00PREDICTED: putative pentatricopeptide repeat-containing protein At3g01580 [Cucum... [more]
XP_016900722.10.0e+0094.11PREDICTED: pentatricopeptide repeat-containing protein At3g57430, chloroplastic-... [more]
XP_008449715.10.0e+0094.10PREDICTED: pentatricopeptide repeat-containing protein At2g13600-like isoform X1... [more]
XP_023512048.10.0e+0086.21pentatricopeptide repeat-containing protein At3g57430, chloroplastic-like isofor... [more]
XP_022944564.10.0e+0086.21pentatricopeptide repeat-containing protein At3g57430, chloroplastic-like [Cucur... [more]
Match NameE-valueIdentityDescription
AT3G57430.12.4e-11734.08Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G13650.11.1e-11232.56Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G11290.11.4e-10933.53Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G15510.13.2e-10931.95Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G02330.12.1e-10833.80Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q7Y211|PP285_ARATH4.4e-11634.08Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidop... [more]
sp|Q9SVP7|PP307_ARATH1.9e-11132.56Pentatricopeptide repeat-containing protein At4g13650 OS=Arabidopsis thaliana OX... [more]
sp|Q3E6Q1|PPR32_ARATH2.6e-10833.53Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
sp|Q9M9E2|PPR45_ARATH5.7e-10831.95Pentatricopeptide repeat-containing protein At1g15510, chloroplastic OS=Arabidop... [more]
sp|Q9FWA6|PP207_ARATH3.7e-10733.80Pentatricopeptide repeat-containing protein At3g02330, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
tr|A0A1S4DXL9|A0A1S4DXL9_CUCME0.0e+0094.11pentatricopeptide repeat-containing protein At3g57430, chloroplastic-like isofor... [more]
tr|A0A1S3BNK9|A0A1S3BNK9_CUCME0.0e+0094.10pentatricopeptide repeat-containing protein At2g13600-like isoform X1 OS=Cucumis... [more]
tr|A0A2N9IVN7|A0A2N9IVN7_FAGSY3.9e-26364.51Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS56003 PE=4 SV=1[more]
tr|A0A2I4FH18|A0A2I4FH18_9ROSI4.8e-25362.30putative pentatricopeptide repeat-containing protein At5g59200, chloroplastic is... [more]
tr|A0A061GS93|A0A061GS93_THECC1.4e-24459.91Pentatricopeptide repeat-containing protein OS=Theobroma cacao OX=3641 GN=TCM_04... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_4G024260.1CsaV3_4G024260.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 336..431
e-value: 1.6E-11
score: 45.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 231..335
e-value: 1.8E-17
score: 65.8
coord: 432..546
e-value: 3.3E-21
score: 78.1
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 280..313
e-value: 5.9E-4
score: 17.8
coord: 381..408
e-value: 5.8E-5
score: 21.0
coord: 77..110
e-value: 5.0E-5
score: 21.2
coord: 483..516
e-value: 7.7E-7
score: 26.9
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 252..277
e-value: 0.088
score: 13.0
coord: 555..579
e-value: 0.71
score: 10.2
coord: 455..479
e-value: 0.31
score: 11.3
coord: 381..408
e-value: 1.3E-5
score: 25.1
coord: 180..206
e-value: 8.2E-4
score: 19.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 75..122
e-value: 2.1E-10
score: 40.5
coord: 278..325
e-value: 1.3E-7
score: 31.6
coord: 481..528
e-value: 1.4E-8
score: 34.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 75..109
score: 10.052
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 110..144
score: 5.985
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 278..312
score: 9.12
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 481..515
score: 11.06
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 618..652
score: 6.599
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 145..175
score: 7.015
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 450..480
score: 7.147
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 176..210
score: 8.912
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 552..582
score: 6.193
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 247..277
score: 7.147
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 516..551
score: 8.21
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 379..409
score: 9.219
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 22..211
coord: 270..678
coord: 127..324
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 228..424
NoneNo IPR availablePANTHERPTHR24015:SF75SUBFAMILY NOT NAMEDcoord: 22..211
coord: 270..678
coord: 127..324
coord: 228..424

The following gene(s) are paralogous to this gene:

None